Miyakogusa Predicted Gene

Lj0g3v0333789.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0333789.1 Non Chatacterized Hit- tr|J3MHY1|J3MHY1_ORYBR
Uncharacterized protein OS=Oryza brachyantha
GN=OB07G1,33.47,0.000000000000002,seg,NULL,CUFF.22768.1
         (497 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G37960.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   285   7e-77
AT2G37960.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   285   7e-77
AT3G54060.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   218   8e-57
AT3G54060.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   217   1e-56

>AT2G37960.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT3G54060.2);
           Has 30201 Blast hits to 17322 proteins in 780 species:
           Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
           3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
           2996 (source: NCBI BLink). | chr2:15886962-15889180
           REVERSE LENGTH=480
          Length = 480

 Score =  285 bits (728), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 192/493 (38%), Positives = 273/493 (55%), Gaps = 54/493 (10%)

Query: 13  FGKGKVTPIQVAFIVDRYLCDNNFTETRSAFRNEASSLIANSPINQVPKSLLSLGEMLNE 72
            G G+VTPIQVAF+VDRYLCDN F++TRS FR+EASSLI+NSP+ +VP SLL L E+LNE
Sbjct: 14  IGNGEVTPIQVAFLVDRYLCDNRFSKTRSLFRSEASSLISNSPVREVPNSLLPLNEILNE 73

Query: 73  YICLKEQKVMVDRERVLVEQEKNRVQMLLQGMHNVMNAYNASRSNAAPNVHVMNAKS--- 129
           YI LK++K+++D+E+  ++QEK RVQ LL GM +VMNAYN+S + A P   V+ + +   
Sbjct: 74  YIRLKKEKIVMDQEKSKLDQEKTRVQNLLNGMQDVMNAYNSSTAAAPPPPPVITSAAPMD 133

Query: 130 -AVVPQPKLQNGXXXXXXXXXXXXXXXXXXIHSLPPSINTNPETGNFSTPIVSVSSRKRK 188
             VV     QN                   + SLP     N   GNF+ P ++ S  K++
Sbjct: 134 KQVVASTSKQNNFGVSSSGCTVYNTQNAMTV-SLP----GNKRVGNFTGPCITQSITKKR 188

Query: 189 DTNTVDVPXXXXXXXXXXXXXXIPVKGKKPLLQSTS-VNNQAVSQLSCPTQSSAGNCIXX 247
            +  V V               +  KG K + Q+ + +  Q  S++  P  +        
Sbjct: 189 KSPEVSV-----------GAPSVSRKGMKKIPQAANYLTFQTPSEMQTPLNNGVA---TN 234

Query: 248 XXXXXXXXXXKCLFNHPALTIPTNSPVPKTPPRSNSCHSDTNISPTEISSVATCNGEATP 307
                     KCLF+    + P+NS  P+TP +  S  SD                E TP
Sbjct: 235 ESSDLTSSVAKCLFDKSGTSPPSNSTCPRTPQQKVSPQSD---------------KEVTP 279

Query: 308 TCYSVVSTKRVLVSPAKQMA--YIESSHCI---SPVKAVNSDKVSKRDHVRSRLDFDASD 362
           T  ++V+ +R+ VSP KQ+A   +E SH +   SPVK+ N    SKRDHV+ RL+FD ++
Sbjct: 280 TNCTIVTKERITVSPLKQIASYTVERSHTVSSFSPVKS-NLKMSSKRDHVKGRLNFDDTE 338

Query: 363 MPESLDKSSPNEM---STSESDKELDLFDIDFSNLDALGMDFSFTEMLNDLEIPCEGIDF 419
               LD  +  +M   S+S S+ E DLFDIDFSN+D L  DFSF+E+L D +I CE +  
Sbjct: 339 ATMHLDAPATVDMVSTSSSGSEAEADLFDIDFSNIDLLSEDFSFSELLFDFDIGCEEMSN 398

Query: 420 SDNPA-SSHSKDNPSGSSHECK-----ANQVISGLSSTKAEVLSEKDMNTLGPDCLTTMN 473
              P  S+   +  SGSS E +      +QV+S  +ST  E++  KDMNT G D +TT+ 
Sbjct: 399 HSLPQPSNFHIETASGSSPESRNTNLEPDQVVSEYTSTVTEMIQGKDMNTQGSDSMTTVK 458

Query: 474 TVTKCIKTISPVK 486
           ++TKC++ +SP K
Sbjct: 459 SITKCLRILSPAK 471


>AT2G37960.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G54060.2); Has 418 Blast hits to 247 proteins
           in 92 species: Archae - 0; Bacteria - 163; Metazoa - 49;
           Fungi - 80; Plants - 28; Viruses - 0; Other Eukaryotes -
           98 (source: NCBI BLink). | chr2:15886962-15889180
           REVERSE LENGTH=480
          Length = 480

 Score =  285 bits (728), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 192/493 (38%), Positives = 273/493 (55%), Gaps = 54/493 (10%)

Query: 13  FGKGKVTPIQVAFIVDRYLCDNNFTETRSAFRNEASSLIANSPINQVPKSLLSLGEMLNE 72
            G G+VTPIQVAF+VDRYLCDN F++TRS FR+EASSLI+NSP+ +VP SLL L E+LNE
Sbjct: 14  IGNGEVTPIQVAFLVDRYLCDNRFSKTRSLFRSEASSLISNSPVREVPNSLLPLNEILNE 73

Query: 73  YICLKEQKVMVDRERVLVEQEKNRVQMLLQGMHNVMNAYNASRSNAAPNVHVMNAKS--- 129
           YI LK++K+++D+E+  ++QEK RVQ LL GM +VMNAYN+S + A P   V+ + +   
Sbjct: 74  YIRLKKEKIVMDQEKSKLDQEKTRVQNLLNGMQDVMNAYNSSTAAAPPPPPVITSAAPMD 133

Query: 130 -AVVPQPKLQNGXXXXXXXXXXXXXXXXXXIHSLPPSINTNPETGNFSTPIVSVSSRKRK 188
             VV     QN                   + SLP     N   GNF+ P ++ S  K++
Sbjct: 134 KQVVASTSKQNNFGVSSSGCTVYNTQNAMTV-SLP----GNKRVGNFTGPCITQSITKKR 188

Query: 189 DTNTVDVPXXXXXXXXXXXXXXIPVKGKKPLLQSTS-VNNQAVSQLSCPTQSSAGNCIXX 247
            +  V V               +  KG K + Q+ + +  Q  S++  P  +        
Sbjct: 189 KSPEVSV-----------GAPSVSRKGMKKIPQAANYLTFQTPSEMQTPLNNGVA---TN 234

Query: 248 XXXXXXXXXXKCLFNHPALTIPTNSPVPKTPPRSNSCHSDTNISPTEISSVATCNGEATP 307
                     KCLF+    + P+NS  P+TP +  S  SD                E TP
Sbjct: 235 ESSDLTSSVAKCLFDKSGTSPPSNSTCPRTPQQKVSPQSD---------------KEVTP 279

Query: 308 TCYSVVSTKRVLVSPAKQMA--YIESSHCI---SPVKAVNSDKVSKRDHVRSRLDFDASD 362
           T  ++V+ +R+ VSP KQ+A   +E SH +   SPVK+ N    SKRDHV+ RL+FD ++
Sbjct: 280 TNCTIVTKERITVSPLKQIASYTVERSHTVSSFSPVKS-NLKMSSKRDHVKGRLNFDDTE 338

Query: 363 MPESLDKSSPNEM---STSESDKELDLFDIDFSNLDALGMDFSFTEMLNDLEIPCEGIDF 419
               LD  +  +M   S+S S+ E DLFDIDFSN+D L  DFSF+E+L D +I CE +  
Sbjct: 339 ATMHLDAPATVDMVSTSSSGSEAEADLFDIDFSNIDLLSEDFSFSELLFDFDIGCEEMSN 398

Query: 420 SDNPA-SSHSKDNPSGSSHECK-----ANQVISGLSSTKAEVLSEKDMNTLGPDCLTTMN 473
              P  S+   +  SGSS E +      +QV+S  +ST  E++  KDMNT G D +TT+ 
Sbjct: 399 HSLPQPSNFHIETASGSSPESRNTNLEPDQVVSEYTSTVTEMIQGKDMNTQGSDSMTTVK 458

Query: 474 TVTKCIKTISPVK 486
           ++TKC++ +SP K
Sbjct: 459 SITKCLRILSPAK 471


>AT3G54060.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G37960.2); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr3:20018902-20020826 REVERSE LENGTH=442
          Length = 442

 Score =  218 bits (555), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 157/436 (36%), Positives = 231/436 (52%), Gaps = 68/436 (15%)

Query: 1   MVKQSKPRKPDSFGKGKVTPIQVAFIVDRYLCDNNFTETRSAFRNEASSLIANSPINQVP 60
           M K S+ +  +  GKG+VTP QVAFIVDRYL DN F+ETR+ FR+EASSLI++SPI  VP
Sbjct: 1   MGKSSRSKGSNLIGKGEVTPTQVAFIVDRYLHDNRFSETRALFRSEASSLISDSPIRNVP 60

Query: 61  KSLLSLGEMLNEYICLKEQKVMVDRERVLVEQEKNRVQMLLQGMHNVMNAYNASRSNAAP 120
            SL++L  MLN Y+ LK+QKV +D+E++ ++QEK RVQ LLQGM NVMN YNAS +   P
Sbjct: 61  NSLMTLDAMLNHYVSLKKQKVSLDQEKLKLDQEKIRVQNLLQGMENVMNTYNASLTAPPP 120

Query: 121 NVHVMNAKSAVVPQPKLQNGXXXXXXXXXXXXXXXXXXIHSLPPSI--NTNPETGNFSTP 178
                    A  P  + +N                   ++ +  S+  N   + GNFSTP
Sbjct: 121 ---------ASAPTSQQKN------HSISSSGLSQYNTLNGMSVSLLGNKRVDFGNFSTP 165

Query: 179 IVS--VSSRKRKDTNTVDVPXXXXXXXXXXXXXXIPVKGKKPLLQSTSVN-----NQAVS 231
             S  ++ +++    +V  P               PV  K  + ++T  N     ++A +
Sbjct: 166 STSQSITGKRKGPEVSVTAP---------------PVSRKSRITRATGTNKLPQADKAAN 210

Query: 232 QLSCPTQSSAGNCIXXXXXXXXXXXXKCLFNHPALTIPTNSPVPKTPPRSNSCHSDTNIS 291
             +  T + A N              KCLFN    ++PT+S   +TP +          +
Sbjct: 211 NFTSETLAVAKNSASNELIGNGSSVVKCLFNKADSSVPTSSTCFRTPQKH---------A 261

Query: 292 PTEISSVATCNGEATPT---CYSVVSTKRVLVSPAKQM-AY-IESSHCIS---PVKAVNS 343
            +      +   E TPT   C ++V+ +R  +SP KQ+ +Y +E SH IS   PVK+ N 
Sbjct: 262 SSGSDKSNSSQKEVTPTNTNC-TIVTKERFTISPLKQITSYSVERSHLISFSSPVKS-NL 319

Query: 344 DKVSKRDHVRSRLDFDASDMPESLDKSSPNEM---STSESDKELDLFDIDFSNLDALGMD 400
              +KRDHV+ +L+FD +D    L+  +  ++   S S S+ E+DLFD+DFSNLD     
Sbjct: 320 KMSNKRDHVKGKLNFDDTDTETCLEAPATADLVSTSPSGSEPEVDLFDMDFSNLD----- 374

Query: 401 FSFTEMLNDLEIPCEG 416
             F+E+L D ++ CEG
Sbjct: 375 --FSELLVDFDLGCEG 388


>AT3G54060.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G37960.2); Has 455 Blast hits to 322 proteins
           in 98 species: Archae - 0; Bacteria - 178; Metazoa - 88;
           Fungi - 75; Plants - 28; Viruses - 2; Other Eukaryotes -
           84 (source: NCBI BLink). | chr3:20018915-20020826
           REVERSE LENGTH=456
          Length = 456

 Score =  217 bits (553), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 157/436 (36%), Positives = 231/436 (52%), Gaps = 68/436 (15%)

Query: 1   MVKQSKPRKPDSFGKGKVTPIQVAFIVDRYLCDNNFTETRSAFRNEASSLIANSPINQVP 60
           M K S+ +  +  GKG+VTP QVAFIVDRYL DN F+ETR+ FR+EASSLI++SPI  VP
Sbjct: 1   MGKSSRSKGSNLIGKGEVTPTQVAFIVDRYLHDNRFSETRALFRSEASSLISDSPIRNVP 60

Query: 61  KSLLSLGEMLNEYICLKEQKVMVDRERVLVEQEKNRVQMLLQGMHNVMNAYNASRSNAAP 120
            SL++L  MLN Y+ LK+QKV +D+E++ ++QEK RVQ LLQGM NVMN YNAS +   P
Sbjct: 61  NSLMTLDAMLNHYVSLKKQKVSLDQEKLKLDQEKIRVQNLLQGMENVMNTYNASLTAPPP 120

Query: 121 NVHVMNAKSAVVPQPKLQNGXXXXXXXXXXXXXXXXXXIHSLPPSI--NTNPETGNFSTP 178
                    A  P  + +N                   ++ +  S+  N   + GNFSTP
Sbjct: 121 ---------ASAPTSQQKN------HSISSSGLSQYNTLNGMSVSLLGNKRVDFGNFSTP 165

Query: 179 IVS--VSSRKRKDTNTVDVPXXXXXXXXXXXXXXIPVKGKKPLLQSTSVN-----NQAVS 231
             S  ++ +++    +V  P               PV  K  + ++T  N     ++A +
Sbjct: 166 STSQSITGKRKGPEVSVTAP---------------PVSRKSRITRATGTNKLPQADKAAN 210

Query: 232 QLSCPTQSSAGNCIXXXXXXXXXXXXKCLFNHPALTIPTNSPVPKTPPRSNSCHSDTNIS 291
             +  T + A N              KCLFN    ++PT+S   +TP +          +
Sbjct: 211 NFTSETLAVAKNSASNELIGNGSSVVKCLFNKADSSVPTSSTCFRTPQKH---------A 261

Query: 292 PTEISSVATCNGEATPT---CYSVVSTKRVLVSPAKQM-AY-IESSHCIS---PVKAVNS 343
            +      +   E TPT   C ++V+ +R  +SP KQ+ +Y +E SH IS   PVK+ N 
Sbjct: 262 SSGSDKSNSSQKEVTPTNTNC-TIVTKERFTISPLKQITSYSVERSHLISFSSPVKS-NL 319

Query: 344 DKVSKRDHVRSRLDFDASDMPESLDKSSPNEM---STSESDKELDLFDIDFSNLDALGMD 400
              +KRDHV+ +L+FD +D    L+  +  ++   S S S+ E+DLFD+DFSNLD     
Sbjct: 320 KMSNKRDHVKGKLNFDDTDTETCLEAPATADLVSTSPSGSEPEVDLFDMDFSNLD----- 374

Query: 401 FSFTEMLNDLEIPCEG 416
             F+E+L D ++ CEG
Sbjct: 375 --FSELLVDFDLGCEG 388