
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0200.6
(398 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAF21309.1| seed maturation protein PM23 [Glycine max] 521 e-146
gb|AAM20153.1| unknown protein [Arabidopsis thaliana] gi|1738088... 400 e-110
gb|AAR87215.1| expressed protein [Oryza sativa (japonica cultiva... 360 5e-98
ref|NP_973464.1| expressed protein [Arabidopsis thaliana] 350 3e-95
gb|AAM60961.1| seed maturation-like protein [Arabidopsis thaliana] 195 2e-48
gb|AAP42757.1| At4g33110 [Arabidopsis thaliana] gi|9755664|emb|C... 194 3e-48
gb|AAP21300.1| At1g63610 [Arabidopsis thaliana] gi|42562912|ref|... 105 3e-21
ref|NP_974078.1| expressed protein [Arabidopsis thaliana] 100 6e-20
dbj|BAD28454.1| seed maturation-like protein [Oryza sativa (japo... 100 8e-20
gb|AAF19695.1| F2K11.3 [Arabidopsis thaliana] 72 3e-11
ref|ZP_00326944.1| hypothetical protein Tery02002317 [Trichodesm... 68 6e-10
ref|ZP_00517341.1| hypothetical protein CwatDRAFT_2590 [Crocosph... 65 3e-09
ref|NP_441676.1| hypothetical protein slr1674 [Synechocystis sp.... 63 2e-08
gb|AAQ00101.1| Uncharacterized protein [Prochlorococcus marinus ... 62 4e-08
gb|AAF75756.1| unknown [Nostoc sp. PCC 7120] gi|17132945|dbj|BAB... 60 1e-07
ref|ZP_00162200.2| hypothetical protein Avar03001506 [Anabaena v... 60 1e-07
ref|ZP_00109571.1| hypothetical protein Npun02003207 [Nostoc pun... 59 2e-07
ref|NP_927326.1| hypothetical protein gll4380 [Gloeobacter viola... 57 8e-07
ref|NP_892723.1| hypothetical protein PMM0605 [Prochlorococcus m... 57 1e-06
emb|CAE20583.1| conserved hypothetical protein [Prochlorococcus ... 56 2e-06
>gb|AAF21309.1| seed maturation protein PM23 [Glycine max]
Length = 404
Score = 521 bits (1342), Expect = e-146
Identities = 288/402 (71%), Positives = 318/402 (78%), Gaps = 18/402 (4%)
Query: 8 RQREEREGKSGCIG----GCSSWQPPFPSS-PLFLFLFSTSALHPPHPSLSLLSATTQTL 62
R RE REGK + G ++ SS PLF P+PSL S + +
Sbjct: 10 RHRERREGKVVLVVVLLIGMATLSSILSSSLPLF------HRYPNPNPSLFRPSLSL-SF 62
Query: 63 QNRNPLSWFLLPQPPPSGISPPNPR----ELIQEIEPLDVSHIQKDVPPTTADAMKRTIS 118
P S FL+ + + + ELIQEIEPLDVSHIQKDVPPTTADAMKRTIS
Sbjct: 63 SRTKPRSPFLVLAASSHDFASNSKKSVLTELIQEIEPLDVSHIQKDVPPTTADAMKRTIS 122
Query: 119 GMLGLLPSDQFHVVIEALWEPLSKLLISSMMTGYTLRNAEYRLCLEKNLDMCERDLEKPK 178
GMLGLLPSDQFHVVIEALWEPLSKLLISSMMTGYTLRN EYRLCLEKNLDM E D+EKPK
Sbjct: 123 GMLGLLPSDQFHVVIEALWEPLSKLLISSMMTGYTLRNVEYRLCLEKNLDMFEGDIEKPK 182
Query: 179 AESTPMDLQGLLHDSVNVIDFGRKSNLSSKVEKVHEDVDIQDLGEISSEAQQYILNLQSR 238
AES +DLQGL+HDSVN I+FG+ NLSSKVEK+HE+VDIQ+LGEIS+EAQQYI NLQSR
Sbjct: 183 AESMKVDLQGLMHDSVNAIEFGKNKNLSSKVEKLHEEVDIQELGEISAEAQQYIFNLQSR 242
Query: 239 LSSMKKELHEVKRKNAALQMQQFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIIS 298
LSSMKKELHEVKRK+AALQMQQFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELK+ I+S
Sbjct: 243 LSSMKKELHEVKRKSAALQMQQFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELKDTILS 302
Query: 299 VVHGLLATLSPKMHSKPSTMSENATIGTANAGSEDCAEVVENSSIQFHPVISLTRDYLAR 358
VVHGLLATLSPKMHSKPSTMSEN T+G NAGSEDCAEV+ENS++QF PVISLTRDYLAR
Sbjct: 303 VVHGLLATLSPKMHSKPSTMSENTTVGATNAGSEDCAEVLENSALQFQPVISLTRDYLAR 362
Query: 359 LLFWCMLLGHYLRGLEYRVDLTELLSLTSDAEN--NGNEQIA 398
LLFWCML L GL LT+LLSLTSDAEN +G++ IA
Sbjct: 363 LLFWCMLWDTILEGLSVDWKLTDLLSLTSDAENDVSGSQPIA 404
>gb|AAM20153.1| unknown protein [Arabidopsis thaliana] gi|17380884|gb|AAL36254.1|
unknown protein [Arabidopsis thaliana]
gi|3650033|gb|AAC61288.1| unknown protein [Arabidopsis
thaliana] gi|19698843|gb|AAL91157.1| unknown protein
[Arabidopsis thaliana] gi|15226027|ref|NP_179097.1|
expressed protein [Arabidopsis thaliana]
gi|25368604|pir||H84522 hypothetical protein At2g14910
[imported] - Arabidopsis thaliana
Length = 386
Score = 400 bits (1028), Expect = e-110
Identities = 213/311 (68%), Positives = 247/311 (78%), Gaps = 7/311 (2%)
Query: 88 ELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEPLSKLLISS 147
+LIQEIEPLDVS IQKDVP TT DAMKRTISGMLGLLPSD+F V IE+LWEPLSKLL+SS
Sbjct: 83 DLIQEIEPLDVSLIQKDVPVTTLDAMKRTISGMLGLLPSDRFQVHIESLWEPLSKLLVSS 142
Query: 148 MMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQGLLHDSVNVIDFGRKSNLSS 207
MMTGYTLRNAEYRL LEKNLDM L+ +E+T D++G D +V S S
Sbjct: 143 MMTGYTLRNAEYRLFLEKNLDMSGGGLDSHASENTEYDMEGTFPDEDHV-----SSKRDS 197
Query: 208 KVEKVHEDVDIQDLGEISSEAQQYILNLQSRLSSMKKELHEVKRKNAALQMQQFVGEEKN 267
+ + + E +D + LG +SSEAQ+YIL LQS+LSS+KKEL E++RKNAALQMQQFVGEEKN
Sbjct: 198 RTQNLSETIDEEGLGRVSSEAQEYILRLQSQLSSVKKELQEMRRKNAALQMQQFVGEEKN 257
Query: 268 DLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPKMHSKPSTMSENATIGTA 327
DLLDYLRSLQPE+VA+LSE +PE+KE I SVVHGLLATLSPKMHSK T
Sbjct: 258 DLLDYLRSLQPEKVAELSEPAAPEVKETIHSVVHGLLATLSPKMHSKFPASEVPPTETVK 317
Query: 328 NAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYLRGLEYRVDLTELLSLTS 387
EDCAE+VEN+S+QF P+ISLTRDYLARLLFWCMLLGHYLRGLEYR++L E+LSLT
Sbjct: 318 AKSDEDCAELVENTSLQFQPLISLTRDYLARLLFWCMLLGHYLRGLEYRMELMEVLSLTC 377
Query: 388 DAENNGNEQIA 398
DA NG+E +A
Sbjct: 378 DA--NGSENVA 386
>gb|AAR87215.1| expressed protein [Oryza sativa (japonica cultivar-group)]
gi|50901384|ref|XP_463125.1| expressed protein [Oryza
sativa (japonica cultivar-group)]
Length = 405
Score = 360 bits (923), Expect = 5e-98
Identities = 188/309 (60%), Positives = 235/309 (75%), Gaps = 6/309 (1%)
Query: 89 LIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEPLSKLLISSM 148
LIQ+IEPLD+S IQKDVPP T DAMKRTISGMLGLLPSDQF VV+EALW P KLL+SS+
Sbjct: 88 LIQDIEPLDLSVIQKDVPPETVDAMKRTISGMLGLLPSDQFRVVVEALWNPFFKLLVSSI 147
Query: 149 MTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQGLLHDSVNVIDFGRKSNLSSK 208
MTGYTLRNAEYRL E+NL++ E D E + + + + S I + ++
Sbjct: 148 MTGYTLRNAEYRLSFERNLELSEEDSEGQNRDISEDNHHNINLGSPVTIFRLSEEDMLQD 207
Query: 209 VEKVHEDVDIQ----DLGEISSEAQQYILNLQSRLSSMKKELHEVKRKNAALQMQQFVGE 264
EK E++ + DLG ++ +A+ YI+ LQSRL +MKKELH+++RKN+ALQMQQFVGE
Sbjct: 208 TEKNDEELPCETVGEDLGNLTPQAEDYIIQLQSRLDAMKKELHDLRRKNSALQMQQFVGE 267
Query: 265 EKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPKMHSKPSTMSENATI 324
EKNDLLDYLRSL PE+VA+LSE TSP ++E I SVVHGLLATLSPK+HSK NA+
Sbjct: 268 EKNDLLDYLRSLTPEKVAELSESTSPGVQEAIHSVVHGLLATLSPKIHSKAPPPLGNASG 327
Query: 325 GTANAGSE--DCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYLRGLEYRVDLTEL 382
G N G E DCAE+VEN+S+ F P+IS+ RDYLARLLFWCMLLGHY+RGLEYR++L +L
Sbjct: 328 GVLNLGGEDDDCAELVENASLPFQPLISVPRDYLARLLFWCMLLGHYIRGLEYRLELAQL 387
Query: 383 LSLTSDAEN 391
L +++D E+
Sbjct: 388 LRISTDVES 396
>ref|NP_973464.1| expressed protein [Arabidopsis thaliana]
Length = 366
Score = 350 bits (899), Expect = 3e-95
Identities = 187/275 (68%), Positives = 216/275 (78%), Gaps = 5/275 (1%)
Query: 88 ELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEPLSKLLISS 147
+LIQEIEPLDVS IQKDVP TT DAMKRTISGMLGLLPSD+F V IE+LWEPLSKLL+SS
Sbjct: 83 DLIQEIEPLDVSLIQKDVPVTTLDAMKRTISGMLGLLPSDRFQVHIESLWEPLSKLLVSS 142
Query: 148 MMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQGLLHDSVNVIDFGRKSNLSS 207
MMTGYTLRNAEYRL LEKNLDM L+ +E+T D++G D +V S S
Sbjct: 143 MMTGYTLRNAEYRLFLEKNLDMSGGGLDSHASENTEYDMEGTFPDEDHV-----SSKRDS 197
Query: 208 KVEKVHEDVDIQDLGEISSEAQQYILNLQSRLSSMKKELHEVKRKNAALQMQQFVGEEKN 267
+ + + E +D + LG +SSEAQ+YIL LQS+LSS+KKEL E++RKNAALQMQQFVGEEKN
Sbjct: 198 RTQNLSETIDEEGLGRVSSEAQEYILRLQSQLSSVKKELQEMRRKNAALQMQQFVGEEKN 257
Query: 268 DLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPKMHSKPSTMSENATIGTA 327
DLLDYLRSLQPE+VA+LSE +PE+KE I SVVHGLLATLSPKMHSK T
Sbjct: 258 DLLDYLRSLQPEKVAELSEPAAPEVKETIHSVVHGLLATLSPKMHSKFPASEVPPTETVK 317
Query: 328 NAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFW 362
EDCAE+VEN+S+QF P+ISLTRDYLARLLFW
Sbjct: 318 AKSDEDCAELVENTSLQFQPLISLTRDYLARLLFW 352
>gb|AAM60961.1| seed maturation-like protein [Arabidopsis thaliana]
Length = 355
Score = 195 bits (495), Expect = 2e-48
Identities = 124/297 (41%), Positives = 174/297 (57%), Gaps = 25/297 (8%)
Query: 89 LIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEPLSKLLISSM 148
L+ I+PLD S I K + + D+MK+TIS MLGLLPSDQF V + +PL +LLISS+
Sbjct: 83 LVNRIQPLDTSVISKGLSDSAKDSMKQTISSMLGLLPSDQFSVSVTISEQPLYRLLISSI 142
Query: 149 MTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQGLLHDSVNVIDFGRKSNLSSK 208
+TGYTL NAEYR+ L +N D+ ++ K E + S + G +L +
Sbjct: 143 ITGYTLWNAEYRVSLRRNFDI---PIDPRKEEEDQSSKDNVRFGS----EKGMSEDLGNC 195
Query: 209 VEKVHEDVDIQDLGEISSEAQQYILNLQSRLSSMKKELHEVKRKNAALQMQQFVGEEKND 268
VE+ E + Q G++S EA YI LQS LSSMK+EL K+K ++ ++ +ND
Sbjct: 196 VEE-FERLSPQVFGDLSPEALSYIQLLQSELSSMKEELDSQKKKALRIECEK---GNRND 251
Query: 269 LLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPKMHSKPSTMSENATIGTAN 328
LLDYLRSL PE V +LS+ +SPE++EI+ +V +L L + S +N I T +
Sbjct: 252 LLDYLRSLDPEMVTELSQLSSPEVEEIVNQLVQNVLERLFEDQTT--SNFMQNPGIRTTD 309
Query: 329 AGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYLRGLEYRVDLTELLSL 385
G +V +RDYLA+LLFWCMLLGH+LRGLE R+ L+ ++ L
Sbjct: 310 GGDGTGRKV------------DTSRDYLAKLLFWCMLLGHHLRGLENRLHLSCVVGL 354
>gb|AAP42757.1| At4g33110 [Arabidopsis thaliana] gi|9755664|emb|CAC01816.1| seed
maturation-like protein [Arabidopsis thaliana]
gi|22655278|gb|AAM98229.1| seed maturation-like protein
[Arabidopsis thaliana] gi|15242177|ref|NP_197001.1|
expressed protein [Arabidopsis thaliana]
gi|11346180|pir||T51442 seed maturation-like protein -
Arabidopsis thaliana
Length = 355
Score = 194 bits (494), Expect = 3e-48
Identities = 124/297 (41%), Positives = 173/297 (57%), Gaps = 25/297 (8%)
Query: 89 LIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEPLSKLLISSM 148
L+ I+PLD S I K + + D+MK+TIS MLGLLPSDQF V + +PL +LLISS+
Sbjct: 83 LVNRIQPLDTSVISKGLSDSAKDSMKQTISSMLGLLPSDQFSVSVTISEQPLYRLLISSI 142
Query: 149 MTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQGLLHDSVNVIDFGRKSNLSSK 208
+TGYTL NAEYR+ L +N D+ ++ K E + S + G +L +
Sbjct: 143 ITGYTLWNAEYRVSLRRNFDI---PIDPRKEEEDQSSKDNVRFGS----EKGMSEDLGNC 195
Query: 209 VEKVHEDVDIQDLGEISSEAQQYILNLQSRLSSMKKELHEVKRKNAALQMQQFVGEEKND 268
VE+ E + Q G++S EA YI LQS LSSMK+EL K+K ++ ++ +ND
Sbjct: 196 VEE-FERLSPQVFGDLSPEALSYIQLLQSELSSMKEELDSQKKKALRIECEK---GNRND 251
Query: 269 LLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPKMHSKPSTMSENATIGTAN 328
LLDYLRSL PE V +LS+ +SPE++EI+ +V +L L + S +N I T
Sbjct: 252 LLDYLRSLDPEMVTELSQLSSPEVEEIVNQLVQNVLERLFEDQTT--SNFMQNPGIRTTE 309
Query: 329 AGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYLRGLEYRVDLTELLSL 385
G +V +RDYLA+LLFWCMLLGH+LRGLE R+ L+ ++ L
Sbjct: 310 GGDGTGRKV------------DTSRDYLAKLLFWCMLLGHHLRGLENRLHLSCVVGL 354
>gb|AAP21300.1| At1g63610 [Arabidopsis thaliana] gi|42562912|ref|NP_176549.3|
expressed protein [Arabidopsis thaliana]
gi|12324944|gb|AAG52423.1| unknown protein; 83181-85105
[Arabidopsis thaliana] gi|25404427|pir||B96661 unknown
protein, 83181-85105 [imported] - Arabidopsis thaliana
Length = 340
Score = 105 bits (261), Expect = 3e-21
Identities = 84/312 (26%), Positives = 150/312 (47%), Gaps = 66/312 (21%)
Query: 77 PPSGISPPNPR-----ELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHV 131
PP+G P R E +Q ++P + K P +AM++T++ M+G LP F V
Sbjct: 75 PPNGTRQPKSRRDILLEYVQNVKPEFMEMFVKRAPKHVVEAMRQTVTNMIGTLPPQFFAV 134
Query: 132 VIEALWEPLSKLLISSMMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQGLLH 191
+ ++ E L++L++S +MTGY RNA+YRL L+++L+ A P D +G
Sbjct: 135 TVTSVAENLAQLMMSVLMTGYMFRNAQYRLELQQSLEQV--------ALPEPRDQKGGDE 186
Query: 192 DSVNVIDFGRKSNLSSKVEKVHEDVDIQDLGEISSEAQQYILNLQSRLSSMKKELHEVKR 251
D G + N+S +V + + G +A++YI L++ + + + +V R
Sbjct: 187 DYAP----GTQKNVSGEVIRWN-----NVSGPEKIDAKKYIELLEAEIEELNR---QVGR 234
Query: 252 KNAALQMQQFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPKM 311
K+A ++N++L+YL+SL+P+ + +L+ ++ + + V LLA
Sbjct: 235 KSA---------NQQNEILEYLKSLEPQNLKELTSTAGEDVAVAMNTFVKRLLAV----- 280
Query: 312 HSKPSTMSENATIGTANAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYLR 371
S P+ M N T E S+ LA+LL+W M++G+ +R
Sbjct: 281 -SDPNQMKTNVT---------------ETSAAD-----------LAKLLYWLMVVGYSIR 313
Query: 372 GLEYRVDLTELL 383
+E R D+ +L
Sbjct: 314 NIEVRFDMERVL 325
Score = 38.1 bits (87), Expect = 0.48
Identities = 26/94 (27%), Positives = 43/94 (45%), Gaps = 1/94 (1%)
Query: 88 ELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHV-VIEALWEPLSKLLIS 146
E ++ +EP ++ + A AM + +L + +Q V E L+KLL
Sbjct: 245 EYLKSLEPQNLKELTSTAGEDVAVAMNTFVKRLLAVSDPNQMKTNVTETSAADLAKLLYW 304
Query: 147 SMMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAE 180
M+ GY++RN E R +E+ L + E P E
Sbjct: 305 LMVVGYSIRNIEVRFDMERVLGTQPKLAELPPGE 338
>ref|NP_974078.1| expressed protein [Arabidopsis thaliana]
Length = 341
Score = 100 bits (250), Expect = 6e-20
Identities = 83/313 (26%), Positives = 152/313 (48%), Gaps = 67/313 (21%)
Query: 77 PPSGI--SPPNPREL----IQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFH 130
PP+G P + R++ +Q ++P + K P +AM++T++ M+G LP F
Sbjct: 75 PPNGTRQQPKSRRDILLEYVQNVKPEFMEMFVKRAPKHVVEAMRQTVTNMIGTLPPQFFA 134
Query: 131 VVIEALWEPLSKLLISSMMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQGLL 190
V + ++ E L++L++S +MTGY RNA+YRL L+++L+ A P D +G
Sbjct: 135 VTVTSVAENLAQLMMSVLMTGYMFRNAQYRLELQQSLEQV--------ALPEPRDQKGGD 186
Query: 191 HDSVNVIDFGRKSNLSSKVEKVHEDVDIQDLGEISSEAQQYILNLQSRLSSMKKELHEVK 250
D G + N+S +V + + G +A++YI L++ + + + +V
Sbjct: 187 EDYAP----GTQKNVSGEVIRWN-----NVSGPEKIDAKKYIELLEAEIEELNR---QVG 234
Query: 251 RKNAALQMQQFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPK 310
RK+A ++N++L+YL+SL+P+ + +L+ ++ + + V LLA
Sbjct: 235 RKSA---------NQQNEILEYLKSLEPQNLKELTSTAGEDVAVAMNTFVKRLLAV---- 281
Query: 311 MHSKPSTMSENATIGTANAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYL 370
S P+ M N T E S+ LA+LL+W M++G+ +
Sbjct: 282 --SDPNQMKTNVT---------------ETSAAD-----------LAKLLYWLMVVGYSI 313
Query: 371 RGLEYRVDLTELL 383
R +E R D+ +L
Sbjct: 314 RNIEVRFDMERVL 326
Score = 38.1 bits (87), Expect = 0.48
Identities = 26/94 (27%), Positives = 43/94 (45%), Gaps = 1/94 (1%)
Query: 88 ELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHV-VIEALWEPLSKLLIS 146
E ++ +EP ++ + A AM + +L + +Q V E L+KLL
Sbjct: 246 EYLKSLEPQNLKELTSTAGEDVAVAMNTFVKRLLAVSDPNQMKTNVTETSAADLAKLLYW 305
Query: 147 SMMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAE 180
M+ GY++RN E R +E+ L + E P E
Sbjct: 306 LMVVGYSIRNIEVRFDMERVLGTQPKLAELPPGE 339
>dbj|BAD28454.1| seed maturation-like protein [Oryza sativa (japonica
cultivar-group)]
Length = 336
Score = 100 bits (249), Expect = 8e-20
Identities = 78/305 (25%), Positives = 144/305 (46%), Gaps = 66/305 (21%)
Query: 83 PPNPRELIQE----IEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWE 138
P N R+++ E ++P + K PP DAM++T++ M+G LP F V + + E
Sbjct: 79 PKNRRDILLEYVKNVQPEFMELFIKRAPPQVVDAMRQTVTNMIGTLPPQFFAVTVTTVAE 138
Query: 139 PLSKLLISSMMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQGLLHDSVNVID 198
L++L+ S +MTGY RNA+YRL L+++L+ L +PK E+ D
Sbjct: 139 NLAQLMYSVLMTGYMFRNAQYRLELQQSLEQIA--LPEPKEENDSADYAP---------- 186
Query: 199 FGRKSNLSSKVEKVHEDVDIQDLGEISSEAQQYILNLQSRLSSMKKELHEVKRKNAALQM 258
G + ++ +V + ++ G +A +YI L++ + + H+V RK++
Sbjct: 187 -GTQKKVTGEVIRWNKTT-----GPEKIDAVKYIELLEAEIDELS---HQVARKSS---- 233
Query: 259 QQFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPKMHSKPSTM 318
+ N+LL+YL++L+P+ + +L+ ++ + + + LLA S P+ M
Sbjct: 234 -----QGSNELLEYLKTLEPQNLKELASSAGEDVVFAMNAFIKRLLAV------SDPAQM 282
Query: 319 SENATIGTANAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYLRGLEYRVD 378
+ +AN LA L+FW M++G+ +R +E R D
Sbjct: 283 KTTVSETSANQ--------------------------LANLMFWLMIVGYSMRNIEVRFD 316
Query: 379 LTELL 383
+ +L
Sbjct: 317 MERVL 321
Score = 35.4 bits (80), Expect = 3.1
Identities = 24/95 (25%), Positives = 42/95 (43%), Gaps = 1/95 (1%)
Query: 88 ELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLL-PSDQFHVVIEALWEPLSKLLIS 146
E ++ +EP ++ + AM I +L + P+ V E L+ L+
Sbjct: 241 EYLKTLEPQNLKELASSAGEDVVFAMNAFIKRLLAVSDPAQMKTTVSETSANQLANLMFW 300
Query: 147 SMMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAES 181
M+ GY++RN E R +E+ L + E P E+
Sbjct: 301 LMIVGYSMRNIEVRFDMERVLGAAPKIGELPPGEN 335
>gb|AAF19695.1| F2K11.3 [Arabidopsis thaliana]
Length = 222
Score = 72.0 bits (175), Expect = 3e-11
Identities = 43/136 (31%), Positives = 78/136 (56%), Gaps = 7/136 (5%)
Query: 77 PPSGISPPNPREL----IQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVV 132
PP+G P + R++ +Q ++P + K P +AM++T++ M+G LP F V
Sbjct: 75 PPNGTRPKSRRDILLEYVQNVKPEFMEMFVKRAPKHVVEAMRQTVTNMIGTLPPQFFAVT 134
Query: 133 IEALWEPLSKLLISSMMTGYTLRNAEYRLCLEKNLDMCERDLEKPKAESTPMDLQGLLHD 192
+ ++ E L++L++S +MTGY RNA+YRL L+++L+ L +P+ T + L+ L+
Sbjct: 135 VTSVAENLAQLMMSVLMTGYMFRNAQYRLELQQSLEQVA--LPEPRG-ITYLALRKLMQK 191
Query: 193 SVNVIDFGRKSNLSSK 208
S++ R N + K
Sbjct: 192 SISSFWKQRSKNSTVK 207
>ref|ZP_00326944.1| hypothetical protein Tery02002317 [Trichodesmium erythraeum IMS101]
Length = 113
Score = 67.8 bits (164), Expect = 6e-10
Identities = 34/79 (43%), Positives = 49/79 (61%)
Query: 88 ELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEPLSKLLISS 147
+ +Q + P VSH+ K M+R I G+LG +PS+QF+V + E L KLL S+
Sbjct: 24 QYVQSMSPDTVSHLSKPTSQEVFQVMERNIVGLLGNIPSEQFNVNVTTSRENLGKLLASA 83
Query: 148 MMTGYTLRNAEYRLCLEKN 166
M++GY LRNAE R+ EK+
Sbjct: 84 MISGYFLRNAEQRMTFEKS 102
>ref|ZP_00517341.1| hypothetical protein CwatDRAFT_2590 [Crocosphaera watsonii WH 8501]
gi|67854274|gb|EAM49575.1| hypothetical protein
CwatDRAFT_2590 [Crocosphaera watsonii WH 8501]
Length = 97
Score = 65.5 bits (158), Expect = 3e-09
Identities = 35/89 (39%), Positives = 49/89 (54%)
Query: 90 IQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEPLSKLLISSMM 149
+Q + P +S + K M+R I G+LG LPS+ F V + + L KLL S+MM
Sbjct: 9 VQSLSPETISQLSKPDSKEVFQVMERNIIGLLGNLPSEHFGVTVSTSRDHLGKLLASAMM 68
Query: 150 TGYTLRNAEYRLCLEKNLDMCERDLEKPK 178
+GY LRNAE RL EK+L ++ K
Sbjct: 69 SGYFLRNAEQRLNFEKSLQAINSTTQEDK 97
>ref|NP_441676.1| hypothetical protein slr1674 [Synechocystis sp. PCC 6803]
gi|1653442|dbj|BAA18356.1| slr1674 [Synechocystis sp.
PCC 6803] gi|7446610|pir||S75897 hypothetical protein
slr1674 - Synechocystis sp. (strain PCC 6803)
Length = 116
Score = 62.8 bits (151), Expect = 2e-08
Identities = 37/95 (38%), Positives = 51/95 (52%), Gaps = 3/95 (3%)
Query: 74 PQPPPSGISPPNPREL---IQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFH 130
PQP +G P L +QE+ P ++ + + M+R I G+LG LP + F
Sbjct: 8 PQPLFAGNEAPGKDSLWTYVQELSPETIAQLSRPDSQEVFQVMERNIIGLLGNLPPEHFG 67
Query: 131 VVIEALWEPLSKLLISSMMTGYTLRNAEYRLCLEK 165
V I E L +LL S+MM+GY LRNAE RL E+
Sbjct: 68 VTISTSRENLGRLLASAMMSGYFLRNAEQRLGFEQ 102
Score = 45.4 bits (106), Expect = 0.003
Identities = 36/135 (26%), Positives = 56/135 (40%), Gaps = 36/135 (26%)
Query: 261 FVGEE---KNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPKMHSKPST 317
F G E K+ L Y++ L PE +AQLS S E+ +++ + GLL L P+
Sbjct: 12 FAGNEAPGKDSLWTYVQELSPETIAQLSRPDSQEVFQVMERNIIGLLGNLPPE------- 64
Query: 318 MSENATIGTANAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYLRGLEYRV 377
F IS +R+ L RLL M+ G++LR E R+
Sbjct: 65 --------------------------HFGVTISTSRENLGRLLASAMMSGYFLRNAEQRL 98
Query: 378 DLTELLSLTSDAENN 392
+ +S++ N
Sbjct: 99 GFEQAFKSSSNSNEN 113
>gb|AAQ00101.1| Uncharacterized protein [Prochlorococcus marinus subsp. marinus
str. CCMP1375] gi|33240506|ref|NP_875448.1| hypothetical
protein Pro1056 [Prochlorococcus marinus subsp. marinus
str. CCMP1375]
Length = 116
Score = 61.6 bits (148), Expect = 4e-08
Identities = 34/90 (37%), Positives = 49/90 (53%)
Query: 88 ELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEPLSKLLISS 147
+ +QE P + + K P D ++ + G+LG+LP DQF V I + + + LL S+
Sbjct: 22 QYLQEQSPDVLQRVAKSASPEIQDIIRHNVQGLLGMLPGDQFEVKITSSRDHFANLLASA 81
Query: 148 MMTGYTLRNAEYRLCLEKNLDMCERDLEKP 177
MMTGY LR E R LE++L E KP
Sbjct: 82 MMTGYFLRQMEQRKELEESLITDEEMSIKP 111
Score = 47.8 bits (112), Expect = 6e-04
Identities = 37/128 (28%), Positives = 57/128 (43%), Gaps = 35/128 (27%)
Query: 263 GEEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPKMHSKPSTMSENA 322
G E N L+ YL+ P+ + ++++ SPE+++II V GLL L
Sbjct: 14 GSEGNALIQYLQEQSPDVLQRVAKSASPEIQDIIRHNVQGLLGML--------------- 58
Query: 323 TIGTANAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYLRGLEYRVDLTEL 382
QF I+ +RD+ A LL M+ G++LR +E R +L E
Sbjct: 59 ------------------PGDQFEVKITSSRDHFANLLASAMMTGYFLRQMEQRKELEE- 99
Query: 383 LSLTSDAE 390
SL +D E
Sbjct: 100 -SLITDEE 106
>gb|AAF75756.1| unknown [Nostoc sp. PCC 7120] gi|17132945|dbj|BAB75510.1| alr3811
[Nostoc sp. PCC 7120] gi|25531703|pir||AD2282
hypothetical protein alr3811 [imported] - Nostoc sp.
(strain PCC 7120) gi|17231303|ref|NP_487851.1|
hypothetical protein alr3811 [Nostoc sp. PCC 7120]
Length = 157
Score = 60.1 bits (144), Expect = 1e-07
Identities = 32/84 (38%), Positives = 46/84 (54%)
Query: 88 ELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEPLSKLLISS 147
+ ++ + P V+ + K P M+R I G+LG LP + F V I E L +LL S+
Sbjct: 67 QYVKSLSPETVTQLSKPTSPEVFQVMERNIIGLLGNLPPEHFGVTITTSREHLGRLLASA 126
Query: 148 MMTGYTLRNAEYRLCLEKNLDMCE 171
M++GY LRNAE R+ E L E
Sbjct: 127 MISGYFLRNAEQRMSFETVLQGIE 150
Score = 53.1 bits (126), Expect = 1e-05
Identities = 44/153 (28%), Positives = 67/153 (43%), Gaps = 39/153 (25%)
Query: 246 LHEVKRKNAALQMQQFVG---EEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHG 302
L E K N + ++ +F E N L Y++SL PE V QLS+ TSPE+ +++ + G
Sbjct: 39 LQEEKVSNQSNRVSEFFNSDSETANLLWQYVKSLSPETVTQLSKPTSPEVFQVMERNIIG 98
Query: 303 LLATLSPKMHSKPSTMSENATIGTANAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFW 362
LL L P+ F I+ +R++L RLL
Sbjct: 99 LLGNLPPE---------------------------------HFGVTITTSREHLGRLLAS 125
Query: 363 CMLLGHYLRGLEYRVDLTELLSLTSDAENNGNE 395
M+ G++LR E R+ +L E+N NE
Sbjct: 126 AMISGYFLRNAEQRMSFETVL---QGIESNHNE 155
>ref|ZP_00162200.2| hypothetical protein Avar03001506 [Anabaena variabilis ATCC 29413]
Length = 157
Score = 60.1 bits (144), Expect = 1e-07
Identities = 32/84 (38%), Positives = 46/84 (54%)
Query: 88 ELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEPLSKLLISS 147
+ ++ + P V+ + K P M+R I G+LG LP + F V I E L +LL S+
Sbjct: 67 QYVKSLSPETVTQLSKPTSPEVFQVMERNIIGLLGNLPPEHFGVTITTSREHLGRLLASA 126
Query: 148 MMTGYTLRNAEYRLCLEKNLDMCE 171
M++GY LRNAE R+ E L E
Sbjct: 127 MISGYFLRNAEQRMSFETVLQGSE 150
Score = 51.2 bits (121), Expect = 5e-05
Identities = 40/141 (28%), Positives = 62/141 (43%), Gaps = 36/141 (25%)
Query: 246 LHEVKRKNAALQMQQFVG---EEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHG 302
L E K N + ++ +F E N L Y++SL PE V QLS+ TSPE+ +++ + G
Sbjct: 39 LQEEKVSNQSNRVSEFFNSDSEAANLLWQYVKSLSPETVTQLSKPTSPEVFQVMERNIIG 98
Query: 303 LLATLSPKMHSKPSTMSENATIGTANAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFW 362
LL L P+ F I+ +R++L RLL
Sbjct: 99 LLGNLPPE---------------------------------HFGVTITTSREHLGRLLAS 125
Query: 363 CMLLGHYLRGLEYRVDLTELL 383
M+ G++LR E R+ +L
Sbjct: 126 AMISGYFLRNAEQRMSFETVL 146
>ref|ZP_00109571.1| hypothetical protein Npun02003207 [Nostoc punctiforme PCC 73102]
Length = 114
Score = 59.3 bits (142), Expect = 2e-07
Identities = 30/84 (35%), Positives = 47/84 (55%)
Query: 88 ELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEPLSKLLISS 147
+ ++ + P V+ + K M+R I+G+LG LPS+ F + + E L +LL S+
Sbjct: 24 QYVKSLSPETVTQLSKPTSAEVFQVMERNITGLLGNLPSEHFGITVSTSRESLGRLLASA 83
Query: 148 MMTGYTLRNAEYRLCLEKNLDMCE 171
M++GY LRNAE R+ E L E
Sbjct: 84 MISGYFLRNAEQRMNFELALQGTE 107
Score = 45.1 bits (105), Expect = 0.004
Identities = 40/129 (31%), Positives = 57/129 (44%), Gaps = 36/129 (27%)
Query: 265 EKNDLL-DYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPKMHSKPSTMSENAT 323
E NDLL Y++SL PE V QLS+ TS E+ +++ + GLL L P H
Sbjct: 17 ETNDLLWQYVKSLSPETVTQLSKPTSAEVFQVMERNITGLLGNL-PSEH----------- 64
Query: 324 IGTANAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYLRGLEYRVDLTELL 383
F +S +R+ L RLL M+ G++LR E R++ L
Sbjct: 65 ---------------------FGITVSTSRESLGRLLASAMISGYFLRNAEQRMNFE--L 101
Query: 384 SLTSDAENN 392
+L NN
Sbjct: 102 ALQGTETNN 110
>ref|NP_927326.1| hypothetical protein gll4380 [Gloeobacter violaceus PCC 7421]
gi|35214955|dbj|BAC92321.1| gll4380 [Gloeobacter
violaceus PCC 7421]
Length = 114
Score = 57.4 bits (137), Expect = 8e-07
Identities = 31/71 (43%), Positives = 43/71 (59%)
Query: 98 VSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEPLSKLLISSMMTGYTLRNA 157
+S I + V P + I G++G LPS+QF+V + + LS LL S+MMTGY LRN
Sbjct: 37 LSQIAQSVTPEVHQMIAGNIQGLMGSLPSNQFNVQVSTNRDNLSALLASAMMTGYFLRNV 96
Query: 158 EYRLCLEKNLD 168
E R+ LE L+
Sbjct: 97 EQRMELEGRLN 107
Score = 47.0 bits (110), Expect = 0.001
Identities = 32/127 (25%), Positives = 57/127 (44%), Gaps = 33/127 (25%)
Query: 265 EKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPKMHSKPSTMSENATI 324
+ N L+ YLR PE ++Q+++ +PE+ ++I + GL+ +L
Sbjct: 21 QDNKLVHYLRMQSPELLSQIAQSVTPEVHQMIAGNIQGLMGSLP---------------- 64
Query: 325 GTANAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYLRGLEYRVDLTELLS 384
S QF+ +S RD L+ LL M+ G++LR +E R++L L+
Sbjct: 65 -----------------SNQFNVQVSTNRDNLSALLASAMMTGYFLRNVEQRMELEGRLN 107
Query: 385 LTSDAEN 391
E+
Sbjct: 108 AALGGED 114
>ref|NP_892723.1| hypothetical protein PMM0605 [Prochlorococcus marinus subsp.
pastoris str. CCMP1986] gi|33639894|emb|CAE19064.1|
conserved hypothetical protein [Prochlorococcus marinus
subsp. pastoris str. CCMP1986]
Length = 112
Score = 57.0 bits (136), Expect = 1e-06
Identities = 32/88 (36%), Positives = 50/88 (56%), Gaps = 3/88 (3%)
Query: 83 PPNPRELIQEIE---PLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEP 139
P + +LIQ ++ P + + K + ++ + G+LG+LPSDQF V I + +
Sbjct: 14 PNDENDLIQYLQKQSPEVMQRVAKSASEDIQEIIRHNVQGLLGMLPSDQFDVKITSSKDN 73
Query: 140 LSKLLISSMMTGYTLRNAEYRLCLEKNL 167
++ LL S+MMTGY LR E R LE+ L
Sbjct: 74 IANLLSSAMMTGYFLRQMEQRKELEQTL 101
Score = 45.1 bits (105), Expect = 0.004
Identities = 33/124 (26%), Positives = 55/124 (43%), Gaps = 33/124 (26%)
Query: 265 EKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPKMHSKPSTMSENATI 324
++NDL+ YL+ PE + ++++ S +++EII V GLL L
Sbjct: 16 DENDLIQYLQKQSPEVMQRVAKSASEDIQEIIRHNVQGLLGML----------------- 58
Query: 325 GTANAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYLRGLEYRVDLTELLS 384
S QF I+ ++D +A LL M+ G++LR +E R +L + L
Sbjct: 59 ----------------PSDQFDVKITSSKDNIANLLSSAMMTGYFLRQMEQRKELEQTLK 102
Query: 385 LTSD 388
D
Sbjct: 103 SDED 106
>emb|CAE20583.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313] gi|33862681|ref|NP_894241.1| hypothetical protein
PMT0408 [Prochlorococcus marinus str. MIT 9313]
Length = 116
Score = 56.2 bits (134), Expect = 2e-06
Identities = 28/80 (35%), Positives = 46/80 (57%)
Query: 88 ELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLLPSDQFHVVIEALWEPLSKLLISS 147
+ +Q+ P + + K D ++ + G+LG++P +QF V + A + L+ LL S+
Sbjct: 22 QYLQDQSPDVLQRVAKSASNDIQDIIRHNVQGLLGMIPGEQFEVKVTASRDNLASLLASA 81
Query: 148 MMTGYTLRNAEYRLCLEKNL 167
MMTGY LR E R LE++L
Sbjct: 82 MMTGYFLRQMEQRKELEESL 101
Score = 39.7 bits (91), Expect = 0.16
Identities = 32/125 (25%), Positives = 56/125 (44%), Gaps = 35/125 (28%)
Query: 264 EEKNDLLDYLRSLQPEQVAQLSEFTSPELKEIIISVVHGLLATLSPKMHSKPSTMSENAT 323
++ N L+ YL+ P+ + ++++ S ++++II V GLL + +
Sbjct: 15 QDGNGLIQYLQDQSPDVLQRVAKSASNDIQDIIRHNVQGLLGMIPGE------------- 61
Query: 324 IGTANAGSEDCAEVVENSSIQFHPVISLTRDYLARLLFWCMLLGHYLRGLEYRVDLTELL 383
QF ++ +RD LA LL M+ G++LR +E R +L E
Sbjct: 62 --------------------QFEVKVTASRDNLASLLASAMMTGYFLRQMEQRKELEE-- 99
Query: 384 SLTSD 388
SL SD
Sbjct: 100 SLFSD 104
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.315 0.132 0.380
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 687,189,256
Number of Sequences: 2540612
Number of extensions: 29722364
Number of successful extensions: 86756
Number of sequences better than 10.0: 298
Number of HSP's better than 10.0 without gapping: 65
Number of HSP's successfully gapped in prelim test: 234
Number of HSP's that attempted gapping in prelim test: 86458
Number of HSP's gapped (non-prelim): 470
length of query: 398
length of database: 863,360,394
effective HSP length: 130
effective length of query: 268
effective length of database: 533,080,834
effective search space: 142865663512
effective search space used: 142865663512
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 76 (33.9 bits)
Lotus: description of TM0200.6