Miyakogusa Predicted Gene

Lj0g3v0101629.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0101629.1 Non Chatacterized Hit- tr|G8A265|G8A265_MEDTR
Putative uncharacterized protein OS=Medicago
truncatul,55.17,0.000000000000002,seg,NULL,CUFF.5712.1
         (495 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G47010.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   708   0.0  
AT2G47010.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   708   0.0  
AT1G17030.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   596   e-170
AT4G09965.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   103   3e-22

>AT2G47010.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 17 plant structures; EXPRESSED
           DURING: 10 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT1G17030.1);
           Has 72 Blast hits to 72 proteins in 13 species: Archae -
           0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 71;
           Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).
           | chr2:19317505-19319252 FORWARD LENGTH=493
          Length = 493

 Score =  708 bits (1828), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/472 (71%), Positives = 389/472 (82%), Gaps = 22/472 (4%)

Query: 34  SAVGDPGMQRDGLRVAFEAWNFCNEVGEEAPHMGSPRAADCFDLS--------------- 78
           SAVGDPGM+RDGLRVAFEAWNFCNEVG EAPHMGSPRAADCFDLS               
Sbjct: 23  SAVGDPGMKRDGLRVAFEAWNFCNEVGFEAPHMGSPRAADCFDLSSKCIKAYTEDQSNKT 82

Query: 79  --GSSLIHKVTEADNKLGVGDSLPGLTPED-INNADLYAAHKELYLGSLCEVPDTPRPWQ 135
             GSSL+HKV+++DN+LG+G   PG+  E  ++N DLYA  KELYLGSLC+V D P PW 
Sbjct: 83  TSGSSLVHKVSDSDNELGIGKPKPGIISESALHNPDLYAVEKELYLGSLCQVSDKPNPWS 142

Query: 136 FWMVMLKNGNYDTRSGLCPKDGKKVPPF-APGRFPCFGEGCMNQPIFCHQQTQL-KDG-T 192
           FWMVMLKNGNYDT+S LCPK+GKK+PPF  PG FPCFG GCMNQP   H +T+L +DG T
Sbjct: 143 FWMVMLKNGNYDTKSALCPKNGKKIPPFNQPGLFPCFGSGCMNQPTLNHGKTELQRDGQT 202

Query: 193 MRGGFSGSYDLGSDCGSEHDGLSYYEVVWEKKVNAGSWVFKHKLRTSKKYPWLMLYLRAD 252
           M+G F+G+Y+ G+D G+  DG+SYYEVVWEK+V  G WVFKHKL+TS KYPWLMLYLRAD
Sbjct: 203 MKGWFNGTYEQGADFGNGLDGISYYEVVWEKRVGVGGWVFKHKLKTSAKYPWLMLYLRAD 262

Query: 253 ATKGFSGGYHYDTRGMLKTLPQSPNFKVRLSLDIKKGGGSKSQFYLLDIGSCWKNNGAAC 312
           ATKGFSGGYHYDTRGMLKTLP+SPNFKVRL+L++K+GGG+KSQFYLLDIGSCWKNNG  C
Sbjct: 263 ATKGFSGGYHYDTRGMLKTLPESPNFKVRLTLNVKQGGGAKSQFYLLDIGSCWKNNGKPC 322

Query: 313 DGDVLTDVTRYSEMIINPETPAWCSPTGLGNCPPFHITPDNRKIYRNDTANFPYSAYHFY 372
           DGDV TDVTRYSEMIINPETP WC+P  L NCPP+H   +  +++R D  +FPY AYH Y
Sbjct: 323 DGDVTTDVTRYSEMIINPETPLWCNPKSLHNCPPYHTFRNGTRVHRTDHRSFPYEAYHVY 382

Query: 373 CAPGNAQHLEKPVSTCDPYSNPQAQEIVQLLPHPIWAEYGYPTKKDDGWVGDGRTWELDV 432
           CAPGNA+HLE PV TCD YSNPQAQEI+QLLPHP+W EYGYPT+  DGWVGD RTW+LDV
Sbjct: 383 CAPGNAEHLELPVGTCDAYSNPQAQEILQLLPHPVWGEYGYPTRLGDGWVGDPRTWDLDV 442

Query: 433 GGLSSRLYFYQDPGTPPAKRVWTSIDSGTEIFVSDKDEVAEWSLSDFDVIVT 484
           GGLSSRL+FYQDPGT PA+R+WTS+D GTEI+  D + +AEW LSDFDV++T
Sbjct: 443 GGLSSRLFFYQDPGTIPARRIWTSVDVGTEIYKED-EAIAEWDLSDFDVLIT 493


>AT2G47010.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 17 plant structures; EXPRESSED
           DURING: 10 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT1G17030.1);
           Has 72 Blast hits to 72 proteins in 13 species: Archae -
           0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 71;
           Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).
           | chr2:19317505-19319252 FORWARD LENGTH=493
          Length = 493

 Score =  708 bits (1828), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/472 (71%), Positives = 389/472 (82%), Gaps = 22/472 (4%)

Query: 34  SAVGDPGMQRDGLRVAFEAWNFCNEVGEEAPHMGSPRAADCFDLS--------------- 78
           SAVGDPGM+RDGLRVAFEAWNFCNEVG EAPHMGSPRAADCFDLS               
Sbjct: 23  SAVGDPGMKRDGLRVAFEAWNFCNEVGFEAPHMGSPRAADCFDLSSKCIKAYTEDQSNKT 82

Query: 79  --GSSLIHKVTEADNKLGVGDSLPGLTPED-INNADLYAAHKELYLGSLCEVPDTPRPWQ 135
             GSSL+HKV+++DN+LG+G   PG+  E  ++N DLYA  KELYLGSLC+V D P PW 
Sbjct: 83  TSGSSLVHKVSDSDNELGIGKPKPGIISESALHNPDLYAVEKELYLGSLCQVSDKPNPWS 142

Query: 136 FWMVMLKNGNYDTRSGLCPKDGKKVPPF-APGRFPCFGEGCMNQPIFCHQQTQL-KDG-T 192
           FWMVMLKNGNYDT+S LCPK+GKK+PPF  PG FPCFG GCMNQP   H +T+L +DG T
Sbjct: 143 FWMVMLKNGNYDTKSALCPKNGKKIPPFNQPGLFPCFGSGCMNQPTLNHGKTELQRDGQT 202

Query: 193 MRGGFSGSYDLGSDCGSEHDGLSYYEVVWEKKVNAGSWVFKHKLRTSKKYPWLMLYLRAD 252
           M+G F+G+Y+ G+D G+  DG+SYYEVVWEK+V  G WVFKHKL+TS KYPWLMLYLRAD
Sbjct: 203 MKGWFNGTYEQGADFGNGLDGISYYEVVWEKRVGVGGWVFKHKLKTSAKYPWLMLYLRAD 262

Query: 253 ATKGFSGGYHYDTRGMLKTLPQSPNFKVRLSLDIKKGGGSKSQFYLLDIGSCWKNNGAAC 312
           ATKGFSGGYHYDTRGMLKTLP+SPNFKVRL+L++K+GGG+KSQFYLLDIGSCWKNNG  C
Sbjct: 263 ATKGFSGGYHYDTRGMLKTLPESPNFKVRLTLNVKQGGGAKSQFYLLDIGSCWKNNGKPC 322

Query: 313 DGDVLTDVTRYSEMIINPETPAWCSPTGLGNCPPFHITPDNRKIYRNDTANFPYSAYHFY 372
           DGDV TDVTRYSEMIINPETP WC+P  L NCPP+H   +  +++R D  +FPY AYH Y
Sbjct: 323 DGDVTTDVTRYSEMIINPETPLWCNPKSLHNCPPYHTFRNGTRVHRTDHRSFPYEAYHVY 382

Query: 373 CAPGNAQHLEKPVSTCDPYSNPQAQEIVQLLPHPIWAEYGYPTKKDDGWVGDGRTWELDV 432
           CAPGNA+HLE PV TCD YSNPQAQEI+QLLPHP+W EYGYPT+  DGWVGD RTW+LDV
Sbjct: 383 CAPGNAEHLELPVGTCDAYSNPQAQEILQLLPHPVWGEYGYPTRLGDGWVGDPRTWDLDV 442

Query: 433 GGLSSRLYFYQDPGTPPAKRVWTSIDSGTEIFVSDKDEVAEWSLSDFDVIVT 484
           GGLSSRL+FYQDPGT PA+R+WTS+D GTEI+  D + +AEW LSDFDV++T
Sbjct: 443 GGLSSRLFFYQDPGTIPARRIWTSVDVGTEIYKED-EAIAEWDLSDFDVLIT 493


>AT1G17030.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT2G47010.2); Has 70 Blast hits to
           70 proteins in 13 species: Archae - 0; Bacteria - 0;
           Metazoa - 0; Fungi - 0; Plants - 69; Viruses - 0; Other
           Eukaryotes - 1 (source: NCBI BLink). |
           chr1:5822487-5824424 FORWARD LENGTH=502
          Length = 502

 Score =  596 bits (1537), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 288/469 (61%), Positives = 352/469 (75%), Gaps = 16/469 (3%)

Query: 28  TTKTNF---SAVGDPGMQRDGLRVAFEAWNFCNEVGEEAPHMGSPRAADCFDLSGSS--- 81
           T +TN    SAVGDPGM+ D LRVA EAWN CNEVGEEA +MGSPR ADCFD+  SS   
Sbjct: 32  TERTNINYVSAVGDPGMRNDNLRVAIEAWNQCNEVGEEATNMGSPRMADCFDIDNSSFPV 91

Query: 82  -LIHKVTEADNKLGVGD-SLPGLTPEDINNADLYAAHKELYLGSLCEVPDTPRPWQFWMV 139
            +IHKV E DN+LGVG+ +  G++  D  NAD+YAA KE+YLG+ C+V D P PWQFWM+
Sbjct: 92  KIIHKVDERDNRLGVGNGTYGGISAGD--NADIYAAQKEVYLGNKCQVVDKPNPWQFWMI 149

Query: 140 MLKNGNYDTRSGLCPKDGKKVPPFAP-GRFPCFGEGCMNQPIFCHQQTQLKD---GTMRG 195
           MLKNGN DT + +CP++GKK  PF P GRFPCFG+GCMN P   H+ T L D   G M G
Sbjct: 150 MLKNGNTDTLAAICPENGKKAKPFPPTGRFPCFGKGCMNMPSMHHEYTSLVDNEEGHMSG 209

Query: 196 GFSGSYDLGSDCGSEHDGLSYYEVVWEKKVNAG-SWVFKHKLRTSKKYPWLMLYLRADAT 254
            F G++DL +D        SYY+V WEKK+    SWVF H L+TS KYPWLMLYLRADA+
Sbjct: 210 SFYGTWDLDNDQKDPVGNNSYYKVKWEKKIGGNESWVFHHLLKTSSKYPWLMLYLRADAS 269

Query: 255 KGFSGGYHYDTRGMLKTLPQSPNFKVRLSLDIKKGGGSKSQFYLLDIGSCWKNNGAACDG 314
           +GFSGGYHYDTRGM+K   +SP+FKV+  L+I KGGGS SQFYL+D+GSCWKN+G  CDG
Sbjct: 270 RGFSGGYHYDTRGMMKMTLKSPDFKVKFKLEIIKGGGSGSQFYLMDMGSCWKNDGRDCDG 329

Query: 315 DVLTDVTRYSEMIINPETPAWCSPTGLGNCPPFHITPDNRKIYRNDTANFPYSAYHFYCA 374
           DV TDVTRYSEMIINP   A C+   LG CPP H  P+  K++R D   FP+ AYH+YC 
Sbjct: 330 DVTTDVTRYSEMIINPGATAVCTRNRLGACPPEHTFPNGTKVHRTDKEKFPFEAYHYYCV 389

Query: 375 PGNAQHLEKPVSTCDPYSNPQAQEIVQLLPHPIWAEYGYPTKKDDGWVGDGRTWELDVGG 434
           PGNA+  E P   CDPYSNPQ QEI+Q+LPHP+W ++GYPTKK  GW+GD RTWELDVG 
Sbjct: 390 PGNARFAESPYEVCDPYSNPQPQEILQILPHPVWEQFGYPTKKGQGWIGDPRTWELDVGK 449

Query: 435 LSSRLYFYQDPGTPPAKRVWTSIDSGTEIFVSDKDEVAEWSLSDFDVIV 483
           LS  L+FYQDPGT P +R W+SID GTEI++S K+++AEW+++DFD+++
Sbjct: 450 LSQSLFFYQDPGTKPVERHWSSIDLGTEIYMS-KNQIAEWTVTDFDIVI 497


>AT4G09965.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G47010.2); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr4:6244740-6245686 REVERSE LENGTH=223
          Length = 223

 Score =  103 bits (256), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 46/74 (62%), Positives = 62/74 (83%), Gaps = 2/74 (2%)

Query: 416 KKDDGWVGDGRTWELDVGGLSSRLYFYQD-PGTPPAKRVWTSIDSGTEIFVSDKDEVAEW 474
           K+ +GW+GD RTWE++ G LSSRLYFYQ+ PGT PAKR+WTSI+  T+I+VS++ E AEW
Sbjct: 148 KQGNGWIGDSRTWEVN-GALSSRLYFYQEYPGTKPAKRMWTSINVVTDIYVSNRQETAEW 206

Query: 475 SLSDFDVIVTQPKT 488
           ++SDFDV+V Q +T
Sbjct: 207 TVSDFDVLVQQKET 220