Miyakogusa Predicted Gene
- Lj0g3v0101629.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0101629.1 Non Chatacterized Hit- tr|G8A265|G8A265_MEDTR
Putative uncharacterized protein OS=Medicago
truncatul,55.17,0.000000000000002,seg,NULL,CUFF.5712.1
(495 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G47010.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 708 0.0
AT2G47010.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 708 0.0
AT1G17030.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 596 e-170
AT4G09965.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 103 3e-22
>AT2G47010.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 17 plant structures; EXPRESSED
DURING: 10 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G17030.1);
Has 72 Blast hits to 72 proteins in 13 species: Archae -
0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 71;
Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).
| chr2:19317505-19319252 FORWARD LENGTH=493
Length = 493
Score = 708 bits (1828), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/472 (71%), Positives = 389/472 (82%), Gaps = 22/472 (4%)
Query: 34 SAVGDPGMQRDGLRVAFEAWNFCNEVGEEAPHMGSPRAADCFDLS--------------- 78
SAVGDPGM+RDGLRVAFEAWNFCNEVG EAPHMGSPRAADCFDLS
Sbjct: 23 SAVGDPGMKRDGLRVAFEAWNFCNEVGFEAPHMGSPRAADCFDLSSKCIKAYTEDQSNKT 82
Query: 79 --GSSLIHKVTEADNKLGVGDSLPGLTPED-INNADLYAAHKELYLGSLCEVPDTPRPWQ 135
GSSL+HKV+++DN+LG+G PG+ E ++N DLYA KELYLGSLC+V D P PW
Sbjct: 83 TSGSSLVHKVSDSDNELGIGKPKPGIISESALHNPDLYAVEKELYLGSLCQVSDKPNPWS 142
Query: 136 FWMVMLKNGNYDTRSGLCPKDGKKVPPF-APGRFPCFGEGCMNQPIFCHQQTQL-KDG-T 192
FWMVMLKNGNYDT+S LCPK+GKK+PPF PG FPCFG GCMNQP H +T+L +DG T
Sbjct: 143 FWMVMLKNGNYDTKSALCPKNGKKIPPFNQPGLFPCFGSGCMNQPTLNHGKTELQRDGQT 202
Query: 193 MRGGFSGSYDLGSDCGSEHDGLSYYEVVWEKKVNAGSWVFKHKLRTSKKYPWLMLYLRAD 252
M+G F+G+Y+ G+D G+ DG+SYYEVVWEK+V G WVFKHKL+TS KYPWLMLYLRAD
Sbjct: 203 MKGWFNGTYEQGADFGNGLDGISYYEVVWEKRVGVGGWVFKHKLKTSAKYPWLMLYLRAD 262
Query: 253 ATKGFSGGYHYDTRGMLKTLPQSPNFKVRLSLDIKKGGGSKSQFYLLDIGSCWKNNGAAC 312
ATKGFSGGYHYDTRGMLKTLP+SPNFKVRL+L++K+GGG+KSQFYLLDIGSCWKNNG C
Sbjct: 263 ATKGFSGGYHYDTRGMLKTLPESPNFKVRLTLNVKQGGGAKSQFYLLDIGSCWKNNGKPC 322
Query: 313 DGDVLTDVTRYSEMIINPETPAWCSPTGLGNCPPFHITPDNRKIYRNDTANFPYSAYHFY 372
DGDV TDVTRYSEMIINPETP WC+P L NCPP+H + +++R D +FPY AYH Y
Sbjct: 323 DGDVTTDVTRYSEMIINPETPLWCNPKSLHNCPPYHTFRNGTRVHRTDHRSFPYEAYHVY 382
Query: 373 CAPGNAQHLEKPVSTCDPYSNPQAQEIVQLLPHPIWAEYGYPTKKDDGWVGDGRTWELDV 432
CAPGNA+HLE PV TCD YSNPQAQEI+QLLPHP+W EYGYPT+ DGWVGD RTW+LDV
Sbjct: 383 CAPGNAEHLELPVGTCDAYSNPQAQEILQLLPHPVWGEYGYPTRLGDGWVGDPRTWDLDV 442
Query: 433 GGLSSRLYFYQDPGTPPAKRVWTSIDSGTEIFVSDKDEVAEWSLSDFDVIVT 484
GGLSSRL+FYQDPGT PA+R+WTS+D GTEI+ D + +AEW LSDFDV++T
Sbjct: 443 GGLSSRLFFYQDPGTIPARRIWTSVDVGTEIYKED-EAIAEWDLSDFDVLIT 493
>AT2G47010.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 17 plant structures; EXPRESSED
DURING: 10 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G17030.1);
Has 72 Blast hits to 72 proteins in 13 species: Archae -
0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 71;
Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).
| chr2:19317505-19319252 FORWARD LENGTH=493
Length = 493
Score = 708 bits (1828), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/472 (71%), Positives = 389/472 (82%), Gaps = 22/472 (4%)
Query: 34 SAVGDPGMQRDGLRVAFEAWNFCNEVGEEAPHMGSPRAADCFDLS--------------- 78
SAVGDPGM+RDGLRVAFEAWNFCNEVG EAPHMGSPRAADCFDLS
Sbjct: 23 SAVGDPGMKRDGLRVAFEAWNFCNEVGFEAPHMGSPRAADCFDLSSKCIKAYTEDQSNKT 82
Query: 79 --GSSLIHKVTEADNKLGVGDSLPGLTPED-INNADLYAAHKELYLGSLCEVPDTPRPWQ 135
GSSL+HKV+++DN+LG+G PG+ E ++N DLYA KELYLGSLC+V D P PW
Sbjct: 83 TSGSSLVHKVSDSDNELGIGKPKPGIISESALHNPDLYAVEKELYLGSLCQVSDKPNPWS 142
Query: 136 FWMVMLKNGNYDTRSGLCPKDGKKVPPF-APGRFPCFGEGCMNQPIFCHQQTQL-KDG-T 192
FWMVMLKNGNYDT+S LCPK+GKK+PPF PG FPCFG GCMNQP H +T+L +DG T
Sbjct: 143 FWMVMLKNGNYDTKSALCPKNGKKIPPFNQPGLFPCFGSGCMNQPTLNHGKTELQRDGQT 202
Query: 193 MRGGFSGSYDLGSDCGSEHDGLSYYEVVWEKKVNAGSWVFKHKLRTSKKYPWLMLYLRAD 252
M+G F+G+Y+ G+D G+ DG+SYYEVVWEK+V G WVFKHKL+TS KYPWLMLYLRAD
Sbjct: 203 MKGWFNGTYEQGADFGNGLDGISYYEVVWEKRVGVGGWVFKHKLKTSAKYPWLMLYLRAD 262
Query: 253 ATKGFSGGYHYDTRGMLKTLPQSPNFKVRLSLDIKKGGGSKSQFYLLDIGSCWKNNGAAC 312
ATKGFSGGYHYDTRGMLKTLP+SPNFKVRL+L++K+GGG+KSQFYLLDIGSCWKNNG C
Sbjct: 263 ATKGFSGGYHYDTRGMLKTLPESPNFKVRLTLNVKQGGGAKSQFYLLDIGSCWKNNGKPC 322
Query: 313 DGDVLTDVTRYSEMIINPETPAWCSPTGLGNCPPFHITPDNRKIYRNDTANFPYSAYHFY 372
DGDV TDVTRYSEMIINPETP WC+P L NCPP+H + +++R D +FPY AYH Y
Sbjct: 323 DGDVTTDVTRYSEMIINPETPLWCNPKSLHNCPPYHTFRNGTRVHRTDHRSFPYEAYHVY 382
Query: 373 CAPGNAQHLEKPVSTCDPYSNPQAQEIVQLLPHPIWAEYGYPTKKDDGWVGDGRTWELDV 432
CAPGNA+HLE PV TCD YSNPQAQEI+QLLPHP+W EYGYPT+ DGWVGD RTW+LDV
Sbjct: 383 CAPGNAEHLELPVGTCDAYSNPQAQEILQLLPHPVWGEYGYPTRLGDGWVGDPRTWDLDV 442
Query: 433 GGLSSRLYFYQDPGTPPAKRVWTSIDSGTEIFVSDKDEVAEWSLSDFDVIVT 484
GGLSSRL+FYQDPGT PA+R+WTS+D GTEI+ D + +AEW LSDFDV++T
Sbjct: 443 GGLSSRLFFYQDPGTIPARRIWTSVDVGTEIYKED-EAIAEWDLSDFDVLIT 493
>AT1G17030.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT2G47010.2); Has 70 Blast hits to
70 proteins in 13 species: Archae - 0; Bacteria - 0;
Metazoa - 0; Fungi - 0; Plants - 69; Viruses - 0; Other
Eukaryotes - 1 (source: NCBI BLink). |
chr1:5822487-5824424 FORWARD LENGTH=502
Length = 502
Score = 596 bits (1537), Expect = e-170, Method: Compositional matrix adjust.
Identities = 288/469 (61%), Positives = 352/469 (75%), Gaps = 16/469 (3%)
Query: 28 TTKTNF---SAVGDPGMQRDGLRVAFEAWNFCNEVGEEAPHMGSPRAADCFDLSGSS--- 81
T +TN SAVGDPGM+ D LRVA EAWN CNEVGEEA +MGSPR ADCFD+ SS
Sbjct: 32 TERTNINYVSAVGDPGMRNDNLRVAIEAWNQCNEVGEEATNMGSPRMADCFDIDNSSFPV 91
Query: 82 -LIHKVTEADNKLGVGD-SLPGLTPEDINNADLYAAHKELYLGSLCEVPDTPRPWQFWMV 139
+IHKV E DN+LGVG+ + G++ D NAD+YAA KE+YLG+ C+V D P PWQFWM+
Sbjct: 92 KIIHKVDERDNRLGVGNGTYGGISAGD--NADIYAAQKEVYLGNKCQVVDKPNPWQFWMI 149
Query: 140 MLKNGNYDTRSGLCPKDGKKVPPFAP-GRFPCFGEGCMNQPIFCHQQTQLKD---GTMRG 195
MLKNGN DT + +CP++GKK PF P GRFPCFG+GCMN P H+ T L D G M G
Sbjct: 150 MLKNGNTDTLAAICPENGKKAKPFPPTGRFPCFGKGCMNMPSMHHEYTSLVDNEEGHMSG 209
Query: 196 GFSGSYDLGSDCGSEHDGLSYYEVVWEKKVNAG-SWVFKHKLRTSKKYPWLMLYLRADAT 254
F G++DL +D SYY+V WEKK+ SWVF H L+TS KYPWLMLYLRADA+
Sbjct: 210 SFYGTWDLDNDQKDPVGNNSYYKVKWEKKIGGNESWVFHHLLKTSSKYPWLMLYLRADAS 269
Query: 255 KGFSGGYHYDTRGMLKTLPQSPNFKVRLSLDIKKGGGSKSQFYLLDIGSCWKNNGAACDG 314
+GFSGGYHYDTRGM+K +SP+FKV+ L+I KGGGS SQFYL+D+GSCWKN+G CDG
Sbjct: 270 RGFSGGYHYDTRGMMKMTLKSPDFKVKFKLEIIKGGGSGSQFYLMDMGSCWKNDGRDCDG 329
Query: 315 DVLTDVTRYSEMIINPETPAWCSPTGLGNCPPFHITPDNRKIYRNDTANFPYSAYHFYCA 374
DV TDVTRYSEMIINP A C+ LG CPP H P+ K++R D FP+ AYH+YC
Sbjct: 330 DVTTDVTRYSEMIINPGATAVCTRNRLGACPPEHTFPNGTKVHRTDKEKFPFEAYHYYCV 389
Query: 375 PGNAQHLEKPVSTCDPYSNPQAQEIVQLLPHPIWAEYGYPTKKDDGWVGDGRTWELDVGG 434
PGNA+ E P CDPYSNPQ QEI+Q+LPHP+W ++GYPTKK GW+GD RTWELDVG
Sbjct: 390 PGNARFAESPYEVCDPYSNPQPQEILQILPHPVWEQFGYPTKKGQGWIGDPRTWELDVGK 449
Query: 435 LSSRLYFYQDPGTPPAKRVWTSIDSGTEIFVSDKDEVAEWSLSDFDVIV 483
LS L+FYQDPGT P +R W+SID GTEI++S K+++AEW+++DFD+++
Sbjct: 450 LSQSLFFYQDPGTKPVERHWSSIDLGTEIYMS-KNQIAEWTVTDFDIVI 497
>AT4G09965.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G47010.2); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr4:6244740-6245686 REVERSE LENGTH=223
Length = 223
Score = 103 bits (256), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 46/74 (62%), Positives = 62/74 (83%), Gaps = 2/74 (2%)
Query: 416 KKDDGWVGDGRTWELDVGGLSSRLYFYQD-PGTPPAKRVWTSIDSGTEIFVSDKDEVAEW 474
K+ +GW+GD RTWE++ G LSSRLYFYQ+ PGT PAKR+WTSI+ T+I+VS++ E AEW
Sbjct: 148 KQGNGWIGDSRTWEVN-GALSSRLYFYQEYPGTKPAKRMWTSINVVTDIYVSNRQETAEW 206
Query: 475 SLSDFDVIVTQPKT 488
++SDFDV+V Q +T
Sbjct: 207 TVSDFDVLVQQKET 220