Miyakogusa Predicted Gene

Lj1g3v4027100.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v4027100.1 Non Chatacterized Hit- tr|Q9AV94|Q9AV94_SOYBN
Putative uncharacterized protein OS=Glycine max PE=2
S,70.43,0,seg,NULL,CUFF.31787.1
         (808 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G17910.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   234   2e-61
AT5G17910.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   234   2e-61
AT2G29620.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   116   8e-26
AT1G07330.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   109   8e-24
AT5G58880.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    89   1e-17

>AT5G17910.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: mitochondrion;
           EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT2G29620.1). |
           chr5:5927906-5932292 FORWARD LENGTH=1342
          Length = 1342

 Score =  234 bits (596), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 179/436 (41%), Positives = 242/436 (55%), Gaps = 61/436 (13%)

Query: 9   SKSAIKWTEDDQKNLLDLGTXXXXXXXXXXXXIXXXXXX---XLMAEKNLIDFDSADIPC 65
           SKSAIKWTE DQ+N++DLG+            I          LMAE+NLIDFDSADIP 
Sbjct: 351 SKSAIKWTEADQRNVMDLGSLELERNQRLENLIARRRARHNMRLMAERNLIDFDSADIPF 410

Query: 66  NVAPIAV-RRNPFDFPDDSYAGMGLPPIPGSAPSILQPRRNPFDIPYDSNEEKPDLKGDS 124
           N+ PI+  R NPFD   DSY  M   PIPGSAPSI+  RRNPFD+PY+ NEEKPDLKGD 
Sbjct: 411 NMPPISTARHNPFDVSYDSYDDM---PIPGSAPSIMFARRNPFDLPYEPNEEKPDLKGDG 467

Query: 125 FQQEFTQFVQKDTFFRRHESFSMGPSVLGISKQERHDISWKPVFISERMASEGTSYPSFQ 184
           FQ+EF+    KD  FRRHESFS+GPS+LG  + +R     +P F+ ER+A+EGTSY  F+
Sbjct: 468 FQEEFSSQQPKDPMFRRHESFSVGPSMLGGPRHDR----LRPFFVLERLANEGTSYYPFE 523

Query: 185 RQSSEASDSKLSSVPDTESVSSG-DQDERKFSEQELSQETELTSNMDHLSDVVEHGSQSS 243
           RQ SE S+SK+SS+PDTESV +  + DE+K  E    +ET++ + +D +SD  E  + S+
Sbjct: 524 RQLSEVSESKVSSIPDTESVCTVLEDDEKKVDENNADRETKI-AKVDMVSDNDEENNHSA 582

Query: 244 GEND----------------SVELINVDESS----VHHDEGEIVLGGVE---DPSEMELY 280
            ++D                S E  + DE +    +HHD  EIVLG  E   + S+M + 
Sbjct: 583 SDHDEENSHSASDHDEEKSHSSEDSDFDEQADSKKLHHDVAEIVLGSGETHHEQSDM-ME 641

Query: 281 TETGEVGTHEHFNAGETHLRREPSDEVXXXXXXXXXXXLQSEVID-------------GI 327
            ET + G  +  +  ++ L  +  +E            +  +V+D             G 
Sbjct: 642 GETSDKGKLDEVSDSDSSLSEK--EEKIRDISEDEAMLISEQVVDLHEELGASSLPSFGE 699

Query: 328 PDENMEQTSNSQQEDSHLPESR-----ISRQASLEGSNFQNESGDVEDILHVEPVYDSSP 382
            + NM   +   ++D H  E+R     I+   SL+ S      G + D  H EPVYDSSP
Sbjct: 700 LEINM---ARGVEDDYHHDEARAEESFITAHPSLDESAIHVLCG-LGDGDHEEPVYDSSP 755

Query: 383 SAAENLHSFPSVSSDF 398
            +     SF SVSSD+
Sbjct: 756 PSGSRFPSFSSVSSDY 771


>AT5G17910.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: mitochondrion;
           EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT2G29620.1); Has 1807 Blast
           hits to 1807 proteins in 277 species: Archae - 0;
           Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
           Viruses - 0; Other Eukaryotes - 339 (source: NCBI
           BLink). | chr5:5927906-5932292 FORWARD LENGTH=1342
          Length = 1342

 Score =  234 bits (596), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 179/436 (41%), Positives = 242/436 (55%), Gaps = 61/436 (13%)

Query: 9   SKSAIKWTEDDQKNLLDLGTXXXXXXXXXXXXIXXXXXX---XLMAEKNLIDFDSADIPC 65
           SKSAIKWTE DQ+N++DLG+            I          LMAE+NLIDFDSADIP 
Sbjct: 351 SKSAIKWTEADQRNVMDLGSLELERNQRLENLIARRRARHNMRLMAERNLIDFDSADIPF 410

Query: 66  NVAPIAV-RRNPFDFPDDSYAGMGLPPIPGSAPSILQPRRNPFDIPYDSNEEKPDLKGDS 124
           N+ PI+  R NPFD   DSY  M   PIPGSAPSI+  RRNPFD+PY+ NEEKPDLKGD 
Sbjct: 411 NMPPISTARHNPFDVSYDSYDDM---PIPGSAPSIMFARRNPFDLPYEPNEEKPDLKGDG 467

Query: 125 FQQEFTQFVQKDTFFRRHESFSMGPSVLGISKQERHDISWKPVFISERMASEGTSYPSFQ 184
           FQ+EF+    KD  FRRHESFS+GPS+LG  + +R     +P F+ ER+A+EGTSY  F+
Sbjct: 468 FQEEFSSQQPKDPMFRRHESFSVGPSMLGGPRHDR----LRPFFVLERLANEGTSYYPFE 523

Query: 185 RQSSEASDSKLSSVPDTESVSSG-DQDERKFSEQELSQETELTSNMDHLSDVVEHGSQSS 243
           RQ SE S+SK+SS+PDTESV +  + DE+K  E    +ET++ + +D +SD  E  + S+
Sbjct: 524 RQLSEVSESKVSSIPDTESVCTVLEDDEKKVDENNADRETKI-AKVDMVSDNDEENNHSA 582

Query: 244 GEND----------------SVELINVDESS----VHHDEGEIVLGGVE---DPSEMELY 280
            ++D                S E  + DE +    +HHD  EIVLG  E   + S+M + 
Sbjct: 583 SDHDEENSHSASDHDEEKSHSSEDSDFDEQADSKKLHHDVAEIVLGSGETHHEQSDM-ME 641

Query: 281 TETGEVGTHEHFNAGETHLRREPSDEVXXXXXXXXXXXLQSEVID-------------GI 327
            ET + G  +  +  ++ L  +  +E            +  +V+D             G 
Sbjct: 642 GETSDKGKLDEVSDSDSSLSEK--EEKIRDISEDEAMLISEQVVDLHEELGASSLPSFGE 699

Query: 328 PDENMEQTSNSQQEDSHLPESR-----ISRQASLEGSNFQNESGDVEDILHVEPVYDSSP 382
            + NM   +   ++D H  E+R     I+   SL+ S      G + D  H EPVYDSSP
Sbjct: 700 LEINM---ARGVEDDYHHDEARAEESFITAHPSLDESAIHVLCG-LGDGDHEEPVYDSSP 755

Query: 383 SAAENLHSFPSVSSDF 398
            +     SF SVSSD+
Sbjct: 756 PSGSRFPSFSSVSSDY 771


>AT2G29620.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G07330.1); Has 887 Blast hits to 750 proteins
           in 151 species: Archae - 2; Bacteria - 63; Metazoa -
           270; Fungi - 51; Plants - 111; Viruses - 6; Other
           Eukaryotes - 384 (source: NCBI BLink). |
           chr2:12663200-12665803 REVERSE LENGTH=747
          Length = 747

 Score =  116 bits (290), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 66/139 (47%), Positives = 78/139 (56%), Gaps = 10/139 (7%)

Query: 10  KSAIKWTEDDQKNLLDLGTXXXXXXXXXXXXIXXXXXXXL---MAEKNLIDFDSADIPCN 66
           K  + WTEDDQKNL+DLGT            I            AE +L+D         
Sbjct: 222 KVVVAWTEDDQKNLMDLGTSEIERNKRLENLISRRRSRRFFLLAAEGSLMD------DME 275

Query: 67  VAPIAVRRNPFDFPDDSYAGMGLPPIPGSAPSILQPRRNPFDIPYDSNEEKPDLKGDSFQ 126
           V  I + RN + F   +Y   GL  +PGSAPS+L PRRNPFD+PYD  EEKP+L GDSFQ
Sbjct: 276 VPRICIGRNFYGFDKGNYEIDGLV-MPGSAPSVLLPRRNPFDLPYDPLEEKPNLTGDSFQ 334

Query: 127 QEFTQFVQKDTFFRRHESF 145
           QEF +   KD FF RHESF
Sbjct: 335 QEFAETNPKDIFFCRHESF 353


>AT1G07330.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G29620.1); Has 597 Blast hits to 536 proteins
           in 121 species: Archae - 2; Bacteria - 47; Metazoa -
           170; Fungi - 43; Plants - 98; Viruses - 0; Other
           Eukaryotes - 237 (source: NCBI BLink). |
           chr1:2251131-2253585 FORWARD LENGTH=685
          Length = 685

 Score =  109 bits (273), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 69/160 (43%), Positives = 83/160 (51%), Gaps = 16/160 (10%)

Query: 10  KSAIKWTEDDQKNLLDLGTXXXXXXXXXXXXIXXXXXXXL---MAEKNLIDFDSADIPCN 66
           K  + WTEDDQKNL+DLG             I       L    AE +L+D +       
Sbjct: 156 KKIVAWTEDDQKNLMDLGNSEMERNKRLEHLITRRRMRRLVRLAAESSLMDME------- 208

Query: 67  VAPIAVRRNPFDFPDDSYAGMGLPPIPGSAPSILQPRRNPFDIPYDSNEEKPDLKGDSFQ 126
           V P+ V RN F    ++Y   GL  +P SAPS+L P +NPFDIPYD  EEKP+L GDSFQ
Sbjct: 209 VPPVCVGRNYFGLDQENYIVDGLQ-MPESAPSVLLPTKNPFDIPYDPQEEKPNLSGDSFQ 267

Query: 127 QEFTQFVQKDTFFRRHESFSMGPSVLGISKQERHDISWKP 166
           QEF      D FF RHESF     V  +  Q   D  W+P
Sbjct: 268 QEFAAN-PNDIFFCRHESFCR--RVFPLDNQ--LDTKWEP 302


>AT5G58880.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G29620.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:23775966-23779504
           FORWARD LENGTH=1088
          Length = 1088

 Score = 89.0 bits (219), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 38/55 (69%), Positives = 43/55 (78%)

Query: 92  IPGSAPSILQPRRNPFDIPYDSNEEKPDLKGDSFQQEFTQFVQKDTFFRRHESFS 146
           IPGSAPS++   RNPFDIPYD  EE+P+L GDSF QEF+ F QKD FF RHESF 
Sbjct: 284 IPGSAPSVMLQGRNPFDIPYDPQEERPNLTGDSFDQEFSLFNQKDLFFCRHESFC 338