Miyakogusa Predicted Gene

Lj4g3v0684090.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj4g3v0684090.1 Non Chatacterized Hit- tr|A9P8G1|A9P8G1_POPTR
Putative uncharacterized protein OS=Populus
trichocarp,26.78,3e-18,seg,NULL,CUFF.47908.1
         (356 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G26650.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   416   e-116
AT1G69430.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   406   e-113
AT4G19950.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    72   7e-13
AT5G61340.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    64   1e-10
AT1G31130.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    59   4e-09
AT5G44860.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    54   2e-07
AT2G18680.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    50   3e-06

>AT1G26650.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G69430.1); Has 205 Blast hits to 204 proteins
           in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 205; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr1:9210335-9211342 FORWARD
           LENGTH=335
          Length = 335

 Score =  416 bits (1068), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 200/311 (64%), Positives = 248/311 (79%), Gaps = 1/311 (0%)

Query: 39  EIQSMNALDILRETIRILRFNSWAFMAITVLLICPVSAVLLSNVTVDESIVKNLTIRLML 98
           +  S NAL+ILRET+RILR+N  A M  T +LICPVSA+LL N  VD+S+V  LT++L+L
Sbjct: 21  QFHSSNALEILRETVRILRYNLGALMLTTAVLICPVSALLLPNFLVDQSLVNKLTVKLLL 80

Query: 99  VARTSGLPLRSIIKQSCQRFAEMVISSASCFPLYATLLLLSKAAVVYSVDCTYSRKKFDV 158
           VA++SGLPL+  +K SCQ+FAE  +SSA CFP++ T+ LLSKAAVVYSVDC+YSR+  D+
Sbjct: 81  VAKSSGLPLQPFVKHSCQKFAETAVSSAMCFPVFITVSLLSKAAVVYSVDCSYSREVVDI 140

Query: 159 SKFCVIIAKFWRKILSTYMWACTIVIGCITLFCVFLVAFCSALSVLGFSPDXXXXXXXXX 218
           SKF VI+ K WR+++ TY+W C +++GC T FCV LVA CS+ SVLGFSPD         
Sbjct: 141 SKFLVILQKIWRRVVFTYVWICILIVGCFTFFCVLLVAICSSFSVLGFSPDFNVYGAMLV 200

Query: 219 XXXFSVVFANAIIICNIAMVITVLEDVSGAQAMLRSSILIKGQTQVGLLIFLGSTIGTAF 278
              FSVVFANAIIICN A+VI+VLEDVSG  A++R+S LIKGQ QVGLL+FLGST+G AF
Sbjct: 201 GLAFSVVFANAIIICNTAIVISVLEDVSGLGALMRASDLIKGQIQVGLLMFLGSTLGLAF 260

Query: 279 VEGLFEHRVKTLSYGDGSSRMWEGPLLVIMHSFVVLIDSMMSAVFYFSCR-SFSMENSEG 337
           VEGLF+HRVK +SYGDGSSR+WEGPLLV+M+SFV LIDSMMSAVFYFSCR  +SME S G
Sbjct: 261 VEGLFDHRVKKVSYGDGSSRLWEGPLLVLMYSFVTLIDSMMSAVFYFSCRVYYSMEASRG 320

Query: 338 EGNSILETMAI 348
           E   I+ET+ +
Sbjct: 321 ETQPIMETVTV 331


>AT1G69430.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G26650.1); Has 216 Blast hits to 215 proteins
           in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 216; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr1:26098025-26099077 FORWARD
           LENGTH=350
          Length = 350

 Score =  406 bits (1043), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 200/301 (66%), Positives = 249/301 (82%)

Query: 37  NYEIQSMNALDILRETIRILRFNSWAFMAITVLLICPVSAVLLSNVTVDESIVKNLTIRL 96
           +++  SMNAL+ILRET+RILR+N  AFM I +LLICPVSA+LL N+ VD+S+V +LT+RL
Sbjct: 35  DHKFHSMNALEILRETVRILRYNLGAFMLIALLLICPVSAILLPNLLVDQSVVNSLTVRL 94

Query: 97  MLVARTSGLPLRSIIKQSCQRFAEMVISSASCFPLYATLLLLSKAAVVYSVDCTYSRKKF 156
           +LV+++SGLPL   ++ SCQ+F+E  +SSA CFPL+ TL LLS+AAVVYSVDCTYSRKK 
Sbjct: 95  LLVSKSSGLPLLPFVRNSCQKFSETAVSSAMCFPLFITLSLLSRAAVVYSVDCTYSRKKV 154

Query: 157 DVSKFCVIIAKFWRKILSTYMWACTIVIGCITLFCVFLVAFCSALSVLGFSPDXXXXXXX 216
            V+KF VI+ + W++++ TY+W CT+++ C+T FCVFLVA CS+  VLGFSPD       
Sbjct: 155 VVTKFVVIMQRLWKRLVITYLWICTVIVVCLTSFCVFLVAVCSSFYVLGFSPDFNAYGAI 214

Query: 217 XXXXXFSVVFANAIIICNIAMVITVLEDVSGAQAMLRSSILIKGQTQVGLLIFLGSTIGT 276
                FSVVFANAIIICN  +VI++LEDVSG  A++R+S LIKGQTQVGLLIFLGSTIG 
Sbjct: 215 LVGLVFSVVFANAIIICNTTIVISILEDVSGPGALVRASDLIKGQTQVGLLIFLGSTIGL 274

Query: 277 AFVEGLFEHRVKTLSYGDGSSRMWEGPLLVIMHSFVVLIDSMMSAVFYFSCRSFSMENSE 336
            FVEGLFEHRVK+LSYGDGSSR+WEGPLLV+M+SFVVLID+MMSAVFYFSCRS+SME  E
Sbjct: 275 TFVEGLFEHRVKSLSYGDGSSRLWEGPLLVVMYSFVVLIDTMMSAVFYFSCRSYSMEAVE 334

Query: 337 G 337
            
Sbjct: 335 A 335


>AT4G19950.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G44860.1); Has 338 Blast hits to 330 proteins
           in 72 species: Archae - 2; Bacteria - 94; Metazoa - 7;
           Fungi - 0; Plants - 232; Viruses - 0; Other Eukaryotes -
           3 (source: NCBI BLink). | chr4:10809977-10810942 FORWARD
           LENGTH=321
          Length = 321

 Score = 72.0 bits (175), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 75/300 (25%), Positives = 136/300 (45%), Gaps = 20/300 (6%)

Query: 39  EIQSMNALDILRETIRILRFNSWAFMAITVLLICPVSAVLLSNVTVDESIVKNLTIRLML 98
           E+Q +N   ILRE+  I +++   F  IT+ LI P+S  +L++    + I+         
Sbjct: 7   ELQFLNKRGILRESTSIPQYSLKTFYLITLTLIFPLSFAILAHSLFTQPIL--------- 57

Query: 99  VARTSGLPLRSIIKQSCQRFAEMVISSASCFPLYA-TLLLLSKAAVVYSVDCTYSRKKFD 157
            A+    P     +   Q    +++    C+ ++     LLS AAVV++V   Y+ K   
Sbjct: 58  -AQIDTYPQAD--QSQLQHEWTVLLVFQFCYIIFLFAFSLLSTAAVVFTVASLYTGKPVS 114

Query: 158 VSKFCVIIAKFWRKILSTYMWACTIVIGCITLFCVFLVAFCSALSVLGFSPDXXXXXXXX 217
            S     I    +++  T++W   +++   T+F +FLV    A+ +              
Sbjct: 115 FSSTMSAIPLVLKRLFITFLWVSLLMLAYNTVFLIFLVTLIVAVDLQNVVLAVFSLVVIF 174

Query: 218 XXXXFSVVFANAIIICNIAMVITVLEDVSGAQAMLRSSILIKGQTQVGLLIFLGSTIGTA 277
                  V+  A  + ++A V++VLE + G  AM +S  L+KG+T +   +     +   
Sbjct: 175 VLFLVVHVYMTA--LWHLASVVSVLEPIYGLAAMKKSYELLKGKTLMACSMVFIYLVHCG 232

Query: 278 FVEGLFEHRVKTLSYGDGS---SRMWEGPLLVIMHSFVVLIDSMMSAVFYFSCRSFSMEN 334
           F+ G+F   V  +  GD     +R+  G  LV +   V LI  ++ +VFY+ C+SF  + 
Sbjct: 233 FIAGVFGAVV--VRGGDDYGIFARIVAGGFLVGVLVIVNLIGLLVQSVFYYVCKSFHHQE 290


>AT5G61340.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G26650.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:24662064-24663044 REVERSE LENGTH=326
          Length = 326

 Score = 64.3 bits (155), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 70/274 (25%), Positives = 129/274 (47%), Gaps = 22/274 (8%)

Query: 73  PVSAVLLSNVTVDESIVKNLTIRLMLVARTSGL-PLRSIIKQSCQRFAEMVISSASCFPL 131
           P SA LL +     S    L +RL ++ R +G             + ++ + SS    P 
Sbjct: 35  PFSAGLLLSQPFFSSSSSTLHMRLNMLFRGAGFSSSHDFFNILSLKLSQTLSSSLFTLPF 94

Query: 132 YATLLLLSKAAVVYSVDCTYSRKKFDVSKFCVIIAKFWRKILSTYMWACTIVIGC-ITLF 190
             T LLLSKA V+  +   +S     V         F+ ++L TY+     ++    + F
Sbjct: 95  SLTFLLLSKAYVIKLLSNNHSADSSSV---------FYLRLLKTYVCNFFFLLSANASAF 145

Query: 191 CVFLVAFCSALSVLGFSP-DXXXXXXXXXXXXFSVVFANAIIICNIAMVITVLEDVSGAQ 249
            +F +A+ + L   GFS  +            +S++ ANA +I N+A+V +      G  
Sbjct: 146 ALFFLAY-NTLEAFGFSSRNFYTFLSLSSAIIYSIIIANAFVISNLALVSSPSSSSGGYT 204

Query: 250 AMLRSSILIKGQTQVGLLIFLGSTIGTAFVEGLFEHRVKTLSYGDGS----SRMWEGPLL 305
            +L++ +LI+G+    + + L + +G A VE LF++RV   SY +G     S   EG  +
Sbjct: 205 NILKACLLIRGRNSTAMALALPTNLGLAGVEALFQYRVMR-SYYNGDRDIISIALEGTFI 263

Query: 306 VIMHSFVVLIDSMMSAVFYFSCRSFSMENSEGEG 339
             +++  +++D++++ +FY SC    ++N E + 
Sbjct: 264 AYLYALFLVLDTIVNFLFYQSC----IKNEEDQN 293


>AT1G31130.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G19950.1); Has 246 Blast hits to 244 proteins
           in 29 species: Archae - 2; Bacteria - 16; Metazoa - 0;
           Fungi - 0; Plants - 222; Viruses - 0; Other Eukaryotes -
           6 (source: NCBI BLink). | chr1:11114963-11115928 REVERSE
           LENGTH=321
          Length = 321

 Score = 59.3 bits (142), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 66/248 (26%), Positives = 111/248 (44%), Gaps = 24/248 (9%)

Query: 39  EIQSMNALDILRETIRILRFNSWAFMAITVLLICPVSAVLLSNVTVDESIVKNLTIRLML 98
           E+Q +    +L+E+I I + +   F  IT+  I P+S  +L++    + I+  L      
Sbjct: 7   ELQFLTIPQLLQESISIKKRSPRTFYLITLSFIFPLSFAILAHSLFTQPILAKLD----- 61

Query: 99  VARTSGLPLRSIIKQSCQRFAEMVISSASCFPLYATLLLLSKAAVVYSVDCTYSRKKFDV 158
                  P  S   +S   +  ++I   S         LLS AAVV++V   Y+ K    
Sbjct: 62  ----KSDPPNS--DRSRHDWTVLLIFQFSYLIFLFAFSLLSTAAVVFTVASLYTGKPVSF 115

Query: 159 SKFCVIIAKFWRKILSTYMWACTIVIGCITLFCVFLVAFCSALSV--LGFSPDXXXXXXX 216
           S     I K ++++  T++W   ++     +F VFLV    AL +  LG +         
Sbjct: 116 SSTLSAIPKVFKRLFITFLWVALLMFAYNAVFFVFLVMLLVALDLNSLGLA---IVAGVI 172

Query: 217 XXXXXFSV-VFANAIIICNIAMVITVLEDVSGAQAMLRSSILIKGQTQVGL-----LIFL 270
                F V V+  A  + ++  VI+VLE V G  AM ++  L+KG+T++ +      +FL
Sbjct: 173 ISVLYFGVHVYFTA--LWHLGSVISVLEPVYGIAAMRKAYELLKGKTKMAMGLIFVYLFL 230

Query: 271 GSTIGTAF 278
              IG  F
Sbjct: 231 CGLIGVVF 238


>AT5G44860.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G19950.1); Has 233 Blast hits to 227 proteins
           in 25 species: Archae - 0; Bacteria - 13; Metazoa - 1;
           Fungi - 0; Plants - 216; Viruses - 0; Other Eukaryotes -
           3 (source: NCBI BLink). | chr5:18110688-18111653 REVERSE
           LENGTH=321
          Length = 321

 Score = 53.5 bits (127), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 71/297 (23%), Positives = 130/297 (43%), Gaps = 16/297 (5%)

Query: 39  EIQSMNALDILRETIRILRFNSWAFMAITVLLICPVSAVLLSNVTVDESIVKNLTIRLML 98
           E+Q +N   ILRE+  I +F+   F  IT+ LI P+S  +L++    + I+         
Sbjct: 7   ELQFLNIQGILRESTTIPKFSPKTFYLITLTLIFPLSFAILAHSLFTQPIL--------- 57

Query: 99  VARTSGLPLRSIIKQSCQRFAEMVISSASCFPLYATLLLLSKAAVVYSVDCTYSRKKFDV 158
            A+    P     K + +    ++        L+A   LLS AAVV++V   Y+ K    
Sbjct: 58  -AQLDATPPSDQSKTNHEWTLLLIYQFIYVIFLFA-FSLLSTAAVVFTVASLYTGKPVSF 115

Query: 159 SKFCVIIAKFWRKILSTYMWACTIVIGCITLFCVFLVAFCSALSVLGFSPDXXXXXXXXX 218
           S     I    +++  T++W   +++   ++F +FLV    A+ +   S           
Sbjct: 116 SSTMSAIPLVLKRLFITFLWVSLMMLVYNSVFLLFLVVLIVAIDLQ--SVILAVFSMVVI 173

Query: 219 XXXFSVVFANAIIICNIAMVITVLEDVSGAQAMLRSSILIKGQTQVGL-LIFLGSTI-GT 276
              F  V        ++A V++VLE + G  AM +S  L+ G+T +   ++F+   + G 
Sbjct: 174 FVLFLGVHVYMTAWWHLASVVSVLEPIYGIAAMKKSYELLNGRTNMACSMVFMYLALCGI 233

Query: 277 AFVEGLFEHRVKTLSYGDGSSRMWEGPLLVIMHSFVVLIDSMMSAVFYFSCRSFSME 333
                          +G   +++  G  LV +   V L+  ++ +VFY+ C+SF  +
Sbjct: 234 TAGVFGGVVVHGGDDFG-LFTKIVVGGFLVGILVIVNLVGLLVQSVFYYVCKSFHHQ 289


>AT2G18680.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: male gametophyte, pollen tube;
           EXPRESSED DURING: L mature pollen stage, M germinated
           pollen stage; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT2G18690.1); Has 30201 Blast
           hits to 17322 proteins in 780 species: Archae - 12;
           Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants -
           5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
           BLink). | chr2:8094780-8095643 FORWARD LENGTH=287
          Length = 287

 Score = 50.1 bits (118), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 64/126 (50%), Gaps = 8/126 (6%)

Query: 137 LLSKAAVVYSVDCTYSRKKFDVSKFCVIIAKFWRKILSTYMWACTIVIGCITLFCVFLVA 196
           LLS   +V++   T+    F++  F ++  K+W+  L T  +     +G   LF  F+V 
Sbjct: 77  LLSTLVMVHASALTHKDDSFEIKDFPILTLKYWKGPLVTNFYIVLFSLGYWFLF--FIVL 134

Query: 197 FCSALSVLGFSP--DXXXXXXXXXXXXFSVVFANAIIICNIAMVITVLEDVSGAQAMLRS 254
           F    S++ FS   D            F+V  +   I+ N++MVI++LED  G QA+ ++
Sbjct: 135 F----SIVFFSTKLDSLAAKSRALFIVFAVFESYLAIVWNLSMVISILEDTYGIQALGKA 190

Query: 255 SILIKG 260
           + ++KG
Sbjct: 191 AKIVKG 196