
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0081b.1
(360 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAM60961.1| seed maturation-like protein [Arabidopsis thaliana] 361 2e-98
gb|AAP42757.1| At4g33110 [Arabidopsis thaliana] gi|9755664|emb|C... 360 4e-98
gb|AAM20153.1| unknown protein [Arabidopsis thaliana] gi|1738088... 199 1e-49
gb|AAR87215.1| expressed protein [Oryza sativa (japonica cultiva... 194 5e-48
ref|NP_973464.1| expressed protein [Arabidopsis thaliana] 169 1e-40
gb|AAF21309.1| seed maturation protein PM23 [Glycine max] 157 6e-37
ref|NP_974078.1| expressed protein [Arabidopsis thaliana] 100 9e-20
gb|AAP21300.1| At1g63610 [Arabidopsis thaliana] gi|42562912|ref|... 100 1e-19
dbj|BAD28454.1| seed maturation-like protein [Oryza sativa (japo... 91 4e-17
gb|AAF19695.1| F2K11.3 [Arabidopsis thaliana] 62 3e-08
gb|AAF75756.1| unknown [Nostoc sp. PCC 7120] gi|17132945|dbj|BAB... 61 6e-08
ref|ZP_00162200.2| hypothetical protein Avar03001506 [Anabaena v... 61 6e-08
ref|ZP_00109571.1| hypothetical protein Npun02003207 [Nostoc pun... 58 4e-07
ref|ZP_00517341.1| hypothetical protein CwatDRAFT_2590 [Crocosph... 55 3e-06
ref|ZP_00326944.1| hypothetical protein Tery02002317 [Trichodesm... 54 7e-06
ref|NP_682814.1| hypothetical protein tll2024 [Thermosynechococc... 54 1e-05
ref|ZP_00160351.2| hypothetical protein Avar03003448 [Anabaena v... 52 2e-05
dbj|BAB76875.1| all5176 [Nostoc sp. PCC 7120] gi|17232668|ref|NP... 51 5e-05
emb|CAE07510.1| conserved hypothetical protein [Synechococcus sp... 50 8e-05
ref|NP_441676.1| hypothetical protein slr1674 [Synechocystis sp.... 50 8e-05
>gb|AAM60961.1| seed maturation-like protein [Arabidopsis thaliana]
Length = 355
Score = 361 bits (926), Expect = 2e-98
Identities = 209/366 (57%), Positives = 260/366 (70%), Gaps = 17/366 (4%)
Query: 1 MAALSWRPF-ILSRLTDLSPNPLHPPKPPPLFLRRRRCF-----LTSCYADGFSSSSSSS 54
MAA S R F +LSR+TDLS L +PPP R + ++S S S
Sbjct: 1 MAAASARAFFMLSRVTDLSKKKLILHQPPPSSSPHRLPYAPNRAVSSSAVISCLSGGGVS 60
Query: 55 SSDDVVSTRKSTFDRGFTVIANMLRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPS 114
S D VSTR+S DRGF VIAN++ RI+PLD SVISKG+SD+A+DSMKQTIS+MLGLLPS
Sbjct: 61 SDDSYVSTRRSKLDRGFAVIANLVNRIQPLDTSVISKGLSDSAKDSMKQTISSMLGLLPS 120
Query: 115 DHFSVTVTVSKHPLHRLLVSSIITGYTLWNAEYRMSLTRNLDIASPCGARDSDCEKRSEI 174
D FSV+VT+S+ PL+RLL+SSIITGYTLWNAEYR+SL RN DI P R + E +S
Sbjct: 121 DQFSVSVTISEQPLYRLLISSIITGYTLWNAEYRVSLRRNFDI--PIDPRKEE-EDQSSK 177
Query: 175 LEVKGGGEDGGEIEVASDLGLKDLENCSSSPRVFGDLPPQALNYIQQLQSELTNVKEELN 234
V+ G E G ++ DLG E SP+VFGDL P+AL+YIQ LQSEL+++KEEL+
Sbjct: 178 DNVRFGSEKG----MSEDLGNCVEEFERLSPQVFGDLSPEALSYIQLLQSELSSMKEELD 233
Query: 235 ARKQEMMQLEYDRGIRNNLLEYLRSLDPEMVTELSRPSSVEVEDIIHQLVQNILSRFFVD 294
++K++ +++E ++G RN+LL+YLRSLDPEMVTELS+ SS EVE+I++QLVQN+L R F D
Sbjct: 234 SQKKKALRIECEKGNRNDLLDYLRSLDPEMVTELSQLSSPEVEEIVNQLVQNVLERLFED 293
Query: 295 DASGSFMEQSVEGNIDNHPDNGDEFSDTVGTSRDYLAKLLFWCMLLGHHLRGLENRLQLS 354
+ +FM+ D GD V TSRDYLAKLLFWCMLLGHHLRGLENRL LS
Sbjct: 294 QTTSNFMQNPG----IRTTDGGDGTGRKVDTSRDYLAKLLFWCMLLGHHLRGLENRLHLS 349
Query: 355 CVVGLL 360
CVVGLL
Sbjct: 350 CVVGLL 355
>gb|AAP42757.1| At4g33110 [Arabidopsis thaliana] gi|9755664|emb|CAC01816.1| seed
maturation-like protein [Arabidopsis thaliana]
gi|22655278|gb|AAM98229.1| seed maturation-like protein
[Arabidopsis thaliana] gi|15242177|ref|NP_197001.1|
expressed protein [Arabidopsis thaliana]
gi|11346180|pir||T51442 seed maturation-like protein -
Arabidopsis thaliana
Length = 355
Score = 360 bits (924), Expect = 4e-98
Identities = 210/370 (56%), Positives = 263/370 (70%), Gaps = 25/370 (6%)
Query: 1 MAALSWRPF-ILSRLTDLSPNPLHPPKPPPLFLRRRRCF-----LTSCYADGFSSSSSSS 54
MAA S R F +LSR+TDLS L +PPP R + ++S S S
Sbjct: 1 MAAASARAFFMLSRVTDLSKKKLILHQPPPSSSPHRLPYAPNRAVSSSAVISCLSGGGVS 60
Query: 55 SSDDVVSTRKSTFDRGFTVIANMLRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPS 114
S D VSTR+S DRGF VIAN++ RI+PLD SVISKG+SD+A+DSMKQTIS+MLGLLPS
Sbjct: 61 SDDSYVSTRRSKLDRGFAVIANLVNRIQPLDTSVISKGLSDSAKDSMKQTISSMLGLLPS 120
Query: 115 DHFSVTVTVSKHPLHRLLVSSIITGYTLWNAEYRMSLTRNLDIASPCGARDSDCEKRSEI 174
D FSV+VT+S+ PL+RLL+SSIITGYTLWNAEYR+SL RN DI P R + E +S
Sbjct: 121 DQFSVSVTISEQPLYRLLISSIITGYTLWNAEYRVSLRRNFDI--PIDPRKEE-EDQSSK 177
Query: 175 LEVKGGGEDGGEIEVASDLGLKDLENCSSSPRVFGDLPPQALNYIQQLQSELTNVKEELN 234
V+ G E G ++ DLG E SP+VFGDL P+AL+YIQ LQSEL+++KEEL+
Sbjct: 178 DNVRFGSEKG----MSEDLGNCVEEFERLSPQVFGDLSPEALSYIQLLQSELSSMKEELD 233
Query: 235 ARKQEMMQLEYDRGIRNNLLEYLRSLDPEMVTELSRPSSVEVEDIIHQLVQNILSRFFVD 294
++K++ +++E ++G RN+LL+YLRSLDPEMVTELS+ SS EVE+I++QLVQN+L R F D
Sbjct: 234 SQKKKALRIECEKGNRNDLLDYLRSLDPEMVTELSQLSSPEVEEIVNQLVQNVLERLFED 293
Query: 295 DASGSFME----QSVEGNIDNHPDNGDEFSDTVGTSRDYLAKLLFWCMLLGHHLRGLENR 350
+ +FM+ ++ EG GD V TSRDYLAKLLFWCMLLGHHLRGLENR
Sbjct: 294 QTTSNFMQNPGIRTTEG--------GDGTGRKVDTSRDYLAKLLFWCMLLGHHLRGLENR 345
Query: 351 LQLSCVVGLL 360
L LSCVVGLL
Sbjct: 346 LHLSCVVGLL 355
>gb|AAM20153.1| unknown protein [Arabidopsis thaliana] gi|17380884|gb|AAL36254.1|
unknown protein [Arabidopsis thaliana]
gi|3650033|gb|AAC61288.1| unknown protein [Arabidopsis
thaliana] gi|19698843|gb|AAL91157.1| unknown protein
[Arabidopsis thaliana] gi|15226027|ref|NP_179097.1|
expressed protein [Arabidopsis thaliana]
gi|25368604|pir||H84522 hypothetical protein At2g14910
[imported] - Arabidopsis thaliana
Length = 386
Score = 199 bits (505), Expect = 1e-49
Identities = 135/366 (36%), Positives = 203/366 (54%), Gaps = 29/366 (7%)
Query: 19 PNPLHPPKPP-------PLFLRRRRCFLTSCYADGFSSSSSSSSSDDVVSTRKSTFDRGF 71
P LH P P P F RR R + + S++ SS+ DD S T
Sbjct: 14 PQLLHKPTKPLPFLFLLPRFNRRFRSLTITSSSTTSSNNFSSNCGDDGFSLDDFTLHSDS 73
Query: 72 T-----VIANMLRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPSDHFSVTVTVSKH 126
V++++++ IEPLD S+I K V D+MK+TIS MLGLLPSD F V +
Sbjct: 74 RSPKKCVLSDLIQEIEPLDVSLIQKDVPVTTLDAMKRTISGMLGLLPSDRFQVHIESLWE 133
Query: 127 PLHRLLVSSIITGYTLWNAEYRMSLTRNLDIASPCGARDSDCEKRSEILEVKGGGEDGGE 186
PL +LLVSS++TGYTL NAEYR+ L +NLD++ G DS + +E +++G D
Sbjct: 134 PLSKLLVSSMMTGYTLRNAEYRLFLEKNLDMSG--GGLDSHASENTE-YDMEGTFPDEDH 190
Query: 187 IEVASDLGLKDLENCSSSPRVFGDLPPQALNYIQQLQSELTNVKEELNARKQEMMQLEYD 246
+ D ++L + G + +A YI +LQS+L++VK+EL +++ L+
Sbjct: 191 VSSKRDSRTQNLSE-TIDEEGLGRVSSEAQEYILRLQSQLSSVKKELQEMRRKNAALQMQ 249
Query: 247 RGI---RNNLLEYLRSLDPEMVTELSRPSSVEVEDIIHQLVQNIL--------SRFFVDD 295
+ + +N+LL+YLRSL PE V ELS P++ EV++ IH +V +L S+F +
Sbjct: 250 QFVGEEKNDLLDYLRSLQPEKVAELSEPAAPEVKETIHSVVHGLLATLSPKMHSKFPASE 309
Query: 296 A--SGSFMEQSVEGNIDNHPDNGDEFSDTVGTSRDYLAKLLFWCMLLGHHLRGLENRLQL 353
+ + +S E + + +F + +RDYLA+LLFWCMLLGH+LRGLE R++L
Sbjct: 310 VPPTETVKAKSDEDCAELVENTSLQFQPLISLTRDYLARLLFWCMLLGHYLRGLEYRMEL 369
Query: 354 SCVVGL 359
V+ L
Sbjct: 370 MEVLSL 375
>gb|AAR87215.1| expressed protein [Oryza sativa (japonica cultivar-group)]
gi|50901384|ref|XP_463125.1| expressed protein [Oryza
sativa (japonica cultivar-group)]
Length = 405
Score = 194 bits (492), Expect = 5e-48
Identities = 141/368 (38%), Positives = 198/368 (53%), Gaps = 45/368 (12%)
Query: 21 PLHPPKPPP---LFLRRRRCFLTSCYADGFSSSSSSSSSDDVVSTRKSTFDRGFTVIANM 77
PL PP PPP LF R G ++S+S + R + +++ N+
Sbjct: 29 PLPPPPPPPQQQLFQAALRLPRRRLAGVGVVAASASPFDELYARGRPAHGSSKKSILWNL 88
Query: 78 LRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPSDHFSVTVTVSKHPLHRLLVSSII 137
++ IEPLD SVI K V D+MK+TIS MLGLLPSD F V V +P +LLVSSI+
Sbjct: 89 IQDIEPLDLSVIQKDVPPETVDAMKRTISGMLGLLPSDQFRVVVEALWNPFFKLLVSSIM 148
Query: 138 TGYTLWNAEYRMSLTRNLDIASPCGARDSDCEKRSEILEVKGGGEDGGE----IEVASDL 193
TGYTL NAEYR+S RNL+++ DS+ + R +I E + G ++ +
Sbjct: 149 TGYTLRNAEYRLSFERNLELSE----EDSEGQNR-DISEDNHHNINLGSPVTIFRLSEED 203
Query: 194 GLKDLEN------CSSSPRVFGDLPPQALNYIQQLQSELTNVKEELN--ARKQEMMQLEY 245
L+D E C + G+L PQA +YI QLQS L +K+EL+ RK +Q++
Sbjct: 204 MLQDTEKNDEELPCETVGEDLGNLTPQAEDYIIQLQSRLDAMKKELHDLRRKNSALQMQQ 263
Query: 246 DRG-IRNNLLEYLRSLDPEMVTELSRPSSVEVEDIIHQLVQNILSRFF----------VD 294
G +N+LL+YLRSL PE V ELS +S V++ IH +V +L+ +
Sbjct: 264 FVGEEKNDLLDYLRSLTPEKVAELSESTSPGVQEAIHSVVHGLLATLSPKIHSKAPPPLG 323
Query: 295 DASGSFMEQSVEGNIDNHPDNGDE--------FSDTVGTSRDYLAKLLFWCMLLGHHLRG 346
+ASG + N+ D+ E F + RDYLA+LLFWCMLLGH++RG
Sbjct: 324 NASGGVL------NLGGEDDDCAELVENASLPFQPLISVPRDYLARLLFWCMLLGHYIRG 377
Query: 347 LENRLQLS 354
LE RL+L+
Sbjct: 378 LEYRLELA 385
>ref|NP_973464.1| expressed protein [Arabidopsis thaliana]
Length = 366
Score = 169 bits (428), Expect = 1e-40
Identities = 120/343 (34%), Positives = 184/343 (52%), Gaps = 29/343 (8%)
Query: 19 PNPLHPPKPP-------PLFLRRRRCFLTSCYADGFSSSSSSSSSDDVVSTRKSTFDRGF 71
P LH P P P F RR R + + S++ SS+ DD S T
Sbjct: 14 PQLLHKPTKPLPFLFLLPRFNRRFRSLTITSSSTTSSNNFSSNCGDDGFSLDDFTLHSDS 73
Query: 72 T-----VIANMLRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPSDHFSVTVTVSKH 126
V++++++ IEPLD S+I K V D+MK+TIS MLGLLPSD F V +
Sbjct: 74 RSPKKCVLSDLIQEIEPLDVSLIQKDVPVTTLDAMKRTISGMLGLLPSDRFQVHIESLWE 133
Query: 127 PLHRLLVSSIITGYTLWNAEYRMSLTRNLDIASPCGARDSDCEKRSEILEVKGGGEDGGE 186
PL +LLVSS++TGYTL NAEYR+ L +NLD++ G DS + +E +++G D
Sbjct: 134 PLSKLLVSSMMTGYTLRNAEYRLFLEKNLDMSG--GGLDSHASENTE-YDMEGTFPDEDH 190
Query: 187 IEVASDLGLKDLENCSSSPRVFGDLPPQALNYIQQLQSELTNVKEELNARKQEMMQLEYD 246
+ D ++L + G + +A YI +LQS+L++VK+EL +++ L+
Sbjct: 191 VSSKRDSRTQNLSE-TIDEEGLGRVSSEAQEYILRLQSQLSSVKKELQEMRRKNAALQMQ 249
Query: 247 RGI---RNNLLEYLRSLDPEMVTELSRPSSVEVEDIIHQLVQNIL--------SRFFVDD 295
+ + +N+LL+YLRSL PE V ELS P++ EV++ IH +V +L S+F +
Sbjct: 250 QFVGEEKNDLLDYLRSLQPEKVAELSEPAAPEVKETIHSVVHGLLATLSPKMHSKFPASE 309
Query: 296 A--SGSFMEQSVEGNIDNHPDNGDEFSDTVGTSRDYLAKLLFW 336
+ + +S E + + +F + +RDYLA+LLFW
Sbjct: 310 VPPTETVKAKSDEDCAELVENTSLQFQPLISLTRDYLARLLFW 352
>gb|AAF21309.1| seed maturation protein PM23 [Glycine max]
Length = 404
Score = 157 bits (396), Expect = 6e-37
Identities = 113/327 (34%), Positives = 175/327 (52%), Gaps = 32/327 (9%)
Query: 53 SSSSDDVVSTRKSTFDRGFTVIANMLRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLL 112
++SS D S K + V+ +++ IEPLD S I K V D+MK+TIS MLGLL
Sbjct: 75 AASSHDFASNSKKS------VLTELIQEIEPLDVSHIQKDVPPTTADAMKRTISGMLGLL 128
Query: 113 PSDHFSVTVTVSKHPLHRLLVSSIITGYTLWNAEYRMSLTRNLDIASPCGARDSDCEK-R 171
PSD F V + PL +LL+SS++TGYTL N EYR+ L +NLD+ + D EK +
Sbjct: 129 PSDQFHVVIEALWEPLSKLLISSMMTGYTLRNVEYRLCLEKNLDMF------EGDIEKPK 182
Query: 172 SEILEVKGGG---EDGGEIEVASDLGLKD-LENCSSSPRV--FGDLPPQALNYIQQLQSE 225
+E ++V G + IE + L +E + G++ +A YI LQS
Sbjct: 183 AESMKVDLQGLMHDSVNAIEFGKNKNLSSKVEKLHEEVDIQELGEISAEAQQYIFNLQSR 242
Query: 226 LTNVKEELNARKQEMMQLEYDRGI---RNNLLEYLRSLDPEMVTELSRPSSVEVEDIIHQ 282
L+++K+EL+ K++ L+ + + +N+LL+YLRSL PE V +LS +S E++D I
Sbjct: 243 LSSMKKELHEVKRKSAALQMQQFVGEEKNDLLDYLRSLQPEQVAQLSEFTSPELKDTILS 302
Query: 283 LVQNILSRFF--VDDASGSFMEQSVEGNIDNHPDNGDE--------FSDTVGTSRDYLAK 332
+V +L+ + + E + G + ++ E F + +RDYLA+
Sbjct: 303 VVHGLLATLSPKMHSKPSTMSENTTVGATNAGSEDCAEVLENSALQFQPVISLTRDYLAR 362
Query: 333 LLFWCMLLGHHLRGLENRLQLSCVVGL 359
LLFWCML L GL +L+ ++ L
Sbjct: 363 LLFWCMLWDTILEGLSVDWKLTDLLSL 389
>ref|NP_974078.1| expressed protein [Arabidopsis thaliana]
Length = 341
Score = 100 bits (248), Expect = 9e-20
Identities = 84/318 (26%), Positives = 151/318 (47%), Gaps = 54/318 (16%)
Query: 44 ADGFSSSSSSSSSDDVVSTRKSTFDRGFTVIANMLRRIEPLDNSVISKGVSDAARDSMKQ 103
A G SS SS+ SS TR+ R ++ ++ ++P + K ++M+Q
Sbjct: 61 AYGSSSDSSADSSTPPNGTRQQPKSRR-DILLEYVQNVKPEFMEMFVKRAPKHVVEAMRQ 119
Query: 104 TISTMLGLLPSDHFSVTVTVSKHPLHRLLVSSIITGYTLWNAEYRMSLTRNLDIASPCGA 163
T++ M+G LP F+VTVT L +L++S ++TGY NA+YR+ L ++L+ +
Sbjct: 120 TVTNMIGTLPPQFFAVTVTSVAENLAQLMMSVLMTGYMFRNAQYRLELQQSLEQVALPEP 179
Query: 164 RDSDCEKRSEILEVKGGGED---GGEIEVASDLGLKDLENCSSSPRVFGDLPPQALNYIQ 220
RD KGG ED G + V+ + + N S ++ A YI+
Sbjct: 180 RDQ-----------KGGDEDYAPGTQKNVSGE--VIRWNNVSGPEKI------DAKKYIE 220
Query: 221 QLQSELTNVKEELNARKQEMMQLEYDRGIRNNLLEYLRSLDPEMVTELSRPSSVEVEDII 280
L++E+ + ++ + +N +LEYL+SL+P+ + EL+ + +V +
Sbjct: 221 LLEAEIEELNRQVGRKSANQ---------QNEILEYLKSLEPQNLKELTSTAGEDVAVAM 271
Query: 281 HQLVQNILSRFFVDDASGSFMEQSVEGNIDNHPDNGDEFSDTVGTSRDYLAKLLFWCMLL 340
+ V+ +L+ V D + ++ N+ TS LAKLL+W M++
Sbjct: 272 NTFVKRLLA---VSDPN------QMKTNVTE-------------TSAADLAKLLYWLMVV 309
Query: 341 GHHLRGLENRLQLSCVVG 358
G+ +R +E R + V+G
Sbjct: 310 GYSIRNIEVRFDMERVLG 327
>gb|AAP21300.1| At1g63610 [Arabidopsis thaliana] gi|42562912|ref|NP_176549.3|
expressed protein [Arabidopsis thaliana]
gi|12324944|gb|AAG52423.1| unknown protein; 83181-85105
[Arabidopsis thaliana] gi|25404427|pir||B96661 unknown
protein, 83181-85105 [imported] - Arabidopsis thaliana
Length = 340
Score = 99.8 bits (247), Expect = 1e-19
Identities = 84/318 (26%), Positives = 151/318 (47%), Gaps = 55/318 (17%)
Query: 44 ADGFSSSSSSSSSDDVVSTRKSTFDRGFTVIANMLRRIEPLDNSVISKGVSDAARDSMKQ 103
A G SS SS+ SS TR+ R ++ ++ ++P + K ++M+Q
Sbjct: 61 AYGSSSDSSADSSTPPNGTRQPKSRRD--ILLEYVQNVKPEFMEMFVKRAPKHVVEAMRQ 118
Query: 104 TISTMLGLLPSDHFSVTVTVSKHPLHRLLVSSIITGYTLWNAEYRMSLTRNLDIASPCGA 163
T++ M+G LP F+VTVT L +L++S ++TGY NA+YR+ L ++L+ +
Sbjct: 119 TVTNMIGTLPPQFFAVTVTSVAENLAQLMMSVLMTGYMFRNAQYRLELQQSLEQVALPEP 178
Query: 164 RDSDCEKRSEILEVKGGGED---GGEIEVASDLGLKDLENCSSSPRVFGDLPPQALNYIQ 220
RD KGG ED G + V+ + + N S ++ A YI+
Sbjct: 179 RDQ-----------KGGDEDYAPGTQKNVSGE--VIRWNNVSGPEKI------DAKKYIE 219
Query: 221 QLQSELTNVKEELNARKQEMMQLEYDRGIRNNLLEYLRSLDPEMVTELSRPSSVEVEDII 280
L++E+ + ++ + +N +LEYL+SL+P+ + EL+ + +V +
Sbjct: 220 LLEAEIEELNRQVGRKSANQ---------QNEILEYLKSLEPQNLKELTSTAGEDVAVAM 270
Query: 281 HQLVQNILSRFFVDDASGSFMEQSVEGNIDNHPDNGDEFSDTVGTSRDYLAKLLFWCMLL 340
+ V+ +L+ V D + ++ N+ TS LAKLL+W M++
Sbjct: 271 NTFVKRLLA---VSDPN------QMKTNVTE-------------TSAADLAKLLYWLMVV 308
Query: 341 GHHLRGLENRLQLSCVVG 358
G+ +R +E R + V+G
Sbjct: 309 GYSIRNIEVRFDMERVLG 326
>dbj|BAD28454.1| seed maturation-like protein [Oryza sativa (japonica
cultivar-group)]
Length = 336
Score = 91.3 bits (225), Expect = 4e-17
Identities = 74/288 (25%), Positives = 136/288 (46%), Gaps = 52/288 (18%)
Query: 73 VIANMLRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPSDHFSVTVTVSKHPLHRLL 132
++ ++ ++P + K D+M+QT++ M+G LP F+VTVT L +L+
Sbjct: 85 ILLEYVKNVQPEFMELFIKRAPPQVVDAMRQTVTNMIGTLPPQFFAVTVTTVAENLAQLM 144
Query: 133 VSSIITGYTLWNAEYRMSLTRNLD-IASPCGARDSDCEKRSEILEVKGGGEDGGEIEVAS 191
S ++TGY NA+YR+ L ++L+ IA P ++D + + K GE I
Sbjct: 145 YSVLMTGYMFRNAQYRLELQQSLEQIALPEPKEENDSADYAPGTQKKVTGE---VIRWNK 201
Query: 192 DLGLKDLENCSSSPRVFGDLPPQALNYIQQLQSELTNVKEELNARKQEMMQLEYDRGIRN 251
G + ++ A+ YI+ L++E+ + ++ ARK N
Sbjct: 202 TTGPEKID---------------AVKYIELLEAEIDELSHQV-ARKSSQGS--------N 237
Query: 252 NLLEYLRSLDPEMVTELSRPSSVEVEDIIHQLVQNILSRFFVDDASGSFMEQSVEGNIDN 311
LLEYL++L+P+ + EL+ + +V ++ ++ +L+ V D +
Sbjct: 238 ELLEYLKTLEPQNLKELASSAGEDVVFAMNAFIKRLLA---VSDPA-------------- 280
Query: 312 HPDNGDEFSDTVG-TSRDYLAKLLFWCMLLGHHLRGLENRLQLSCVVG 358
+ TV TS + LA L+FW M++G+ +R +E R + V+G
Sbjct: 281 ------QMKTTVSETSANQLANLMFWLMIVGYSMRNIEVRFDMERVLG 322
Score = 33.9 bits (76), Expect = 7.8
Identities = 26/104 (25%), Positives = 49/104 (47%), Gaps = 23/104 (22%)
Query: 250 RNNLLEYLRSLDPEMVTELSRPSSVEVEDIIHQLVQNILSRFFVDDASGSFMEQSVEGNI 309
R+ LLEY++++ PE + + + +V D + Q V N++ G+ Q
Sbjct: 83 RDILLEYVKNVQPEFMELFIKRAPPQVVDAMRQTVTNMI---------GTLPPQF----- 128
Query: 310 DNHPDNGDEFSDTVGTSRDYLAKLLFWCMLLGHHLRGLENRLQL 353
F+ TV T + LA+L++ ++ G+ R + RL+L
Sbjct: 129 ---------FAVTVTTVAENLAQLMYSVLMTGYMFRNAQYRLEL 163
>gb|AAF19695.1| F2K11.3 [Arabidopsis thaliana]
Length = 222
Score = 62.0 bits (149), Expect = 3e-08
Identities = 39/114 (34%), Positives = 64/114 (55%), Gaps = 5/114 (4%)
Query: 44 ADGFSSSSSSSSSDDVVSTR-KSTFDRGFTVIANMLRRIEPLDNSVISKGVSDAARDSMK 102
A G SS SS+ SS TR KS D ++ ++ ++P + K ++M+
Sbjct: 61 AYGSSSDSSADSSTPPNGTRPKSRRD----ILLEYVQNVKPEFMEMFVKRAPKHVVEAMR 116
Query: 103 QTISTMLGLLPSDHFSVTVTVSKHPLHRLLVSSIITGYTLWNAEYRMSLTRNLD 156
QT++ M+G LP F+VTVT L +L++S ++TGY NA+YR+ L ++L+
Sbjct: 117 QTVTNMIGTLPPQFFAVTVTSVAENLAQLMMSVLMTGYMFRNAQYRLELQQSLE 170
>gb|AAF75756.1| unknown [Nostoc sp. PCC 7120] gi|17132945|dbj|BAB75510.1| alr3811
[Nostoc sp. PCC 7120] gi|25531703|pir||AD2282
hypothetical protein alr3811 [imported] - Nostoc sp.
(strain PCC 7120) gi|17231303|ref|NP_487851.1|
hypothetical protein alr3811 [Nostoc sp. PCC 7120]
Length = 157
Score = 60.8 bits (146), Expect = 6e-08
Identities = 38/94 (40%), Positives = 53/94 (55%), Gaps = 4/94 (4%)
Query: 61 STRKSTFDRGFTVIANML----RRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPSDH 116
S R S F + AN+L + + P + +SK S M++ I +LG LP +H
Sbjct: 48 SNRVSEFFNSDSETANLLWQYVKSLSPETVTQLSKPTSPEVFQVMERNIIGLLGNLPPEH 107
Query: 117 FSVTVTVSKHPLHRLLVSSIITGYTLWNAEYRMS 150
F VT+T S+ L RLL S++I+GY L NAE RMS
Sbjct: 108 FGVTITTSREHLGRLLASAMISGYFLRNAEQRMS 141
Score = 56.6 bits (135), Expect = 1e-06
Identities = 42/143 (29%), Positives = 67/143 (46%), Gaps = 26/143 (18%)
Query: 218 YIQQLQSELTNVKEELNARKQEMMQLEY---DRGIRNNLLEYLRSLDPEMVTELSRPSSV 274
Y++ L S + +E Q E+ D N L +Y++SL PE VT+LS+P+S
Sbjct: 27 YLKPLSSHPPVLLQEEKVSNQSNRVSEFFNSDSETANLLWQYVKSLSPETVTQLSKPTSP 86
Query: 275 EVEDIIHQLVQNILSRFFVDDASGSFMEQSVEGNIDNHPDNGDEFSDTVGTSRDYLAKLL 334
EV + ME+++ G + N P + F T+ TSR++L +LL
Sbjct: 87 EVFQV---------------------MERNIIGLLGNLPP--EHFGVTITTSREHLGRLL 123
Query: 335 FWCMLLGHHLRGLENRLQLSCVV 357
M+ G+ LR E R+ V+
Sbjct: 124 ASAMISGYFLRNAEQRMSFETVL 146
>ref|ZP_00162200.2| hypothetical protein Avar03001506 [Anabaena variabilis ATCC 29413]
Length = 157
Score = 60.8 bits (146), Expect = 6e-08
Identities = 38/94 (40%), Positives = 53/94 (55%), Gaps = 4/94 (4%)
Query: 61 STRKSTFDRGFTVIANML----RRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPSDH 116
S R S F + AN+L + + P + +SK S M++ I +LG LP +H
Sbjct: 48 SNRVSEFFNSDSEAANLLWQYVKSLSPETVTQLSKPTSPEVFQVMERNIIGLLGNLPPEH 107
Query: 117 FSVTVTVSKHPLHRLLVSSIITGYTLWNAEYRMS 150
F VT+T S+ L RLL S++I+GY L NAE RMS
Sbjct: 108 FGVTITTSREHLGRLLASAMISGYFLRNAEQRMS 141
Score = 56.6 bits (135), Expect = 1e-06
Identities = 42/143 (29%), Positives = 67/143 (46%), Gaps = 26/143 (18%)
Query: 218 YIQQLQSELTNVKEELNARKQEMMQLEY---DRGIRNNLLEYLRSLDPEMVTELSRPSSV 274
Y++ L S + +E Q E+ D N L +Y++SL PE VT+LS+P+S
Sbjct: 27 YLKPLSSHPPVLLQEEKVSNQSNRVSEFFNSDSEAANLLWQYVKSLSPETVTQLSKPTSP 86
Query: 275 EVEDIIHQLVQNILSRFFVDDASGSFMEQSVEGNIDNHPDNGDEFSDTVGTSRDYLAKLL 334
EV + ME+++ G + N P + F T+ TSR++L +LL
Sbjct: 87 EVFQV---------------------MERNIIGLLGNLPP--EHFGVTITTSREHLGRLL 123
Query: 335 FWCMLLGHHLRGLENRLQLSCVV 357
M+ G+ LR E R+ V+
Sbjct: 124 ASAMISGYFLRNAEQRMSFETVL 146
>ref|ZP_00109571.1| hypothetical protein Npun02003207 [Nostoc punctiforme PCC 73102]
Length = 114
Score = 58.2 bits (139), Expect = 4e-07
Identities = 30/73 (41%), Positives = 47/73 (64%)
Query: 78 LRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPSDHFSVTVTVSKHPLHRLLVSSII 137
++ + P + +SK S M++ I+ +LG LPS+HF +TV+ S+ L RLL S++I
Sbjct: 26 VKSLSPETVTQLSKPTSAEVFQVMERNITGLLGNLPSEHFGITVSTSRESLGRLLASAMI 85
Query: 138 TGYTLWNAEYRMS 150
+GY L NAE RM+
Sbjct: 86 SGYFLRNAEQRMN 98
Score = 53.9 bits (128), Expect = 7e-06
Identities = 36/102 (35%), Positives = 54/102 (52%), Gaps = 24/102 (23%)
Query: 251 NNLL-EYLRSLDPEMVTELSRPSSVEVEDIIHQLVQNILSRFFVDDASGSFMEQSVEGNI 309
N+LL +Y++SL PE VT+LS+P+S EV + ME+++ G +
Sbjct: 19 NDLLWQYVKSLSPETVTQLSKPTSAEVFQV---------------------MERNITGLL 57
Query: 310 DNHPDNGDEFSDTVGTSRDYLAKLLFWCMLLGHHLRGLENRL 351
N P + F TV TSR+ L +LL M+ G+ LR E R+
Sbjct: 58 GNLP--SEHFGITVSTSRESLGRLLASAMISGYFLRNAEQRM 97
>ref|ZP_00517341.1| hypothetical protein CwatDRAFT_2590 [Crocosphaera watsonii WH 8501]
gi|67854274|gb|EAM49575.1| hypothetical protein
CwatDRAFT_2590 [Crocosphaera watsonii WH 8501]
Length = 97
Score = 55.5 bits (132), Expect = 3e-06
Identities = 30/78 (38%), Positives = 49/78 (62%)
Query: 78 LRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPSDHFSVTVTVSKHPLHRLLVSSII 137
++ + P S +SK S M++ I +LG LPS+HF VTV+ S+ L +LL S+++
Sbjct: 9 VQSLSPETISQLSKPDSKEVFQVMERNIIGLLGNLPSEHFGVTVSTSRDHLGKLLASAMM 68
Query: 138 TGYTLWNAEYRMSLTRNL 155
+GY L NAE R++ ++L
Sbjct: 69 SGYFLRNAEQRLNFEKSL 86
Score = 54.3 bits (129), Expect = 6e-06
Identities = 35/100 (35%), Positives = 52/100 (52%), Gaps = 23/100 (23%)
Query: 252 NLLEYLRSLDPEMVTELSRPSSVEVEDIIHQLVQNILSRFFVDDASGSFMEQSVEGNIDN 311
+L +Y++SL PE +++LS+P S EV + ME+++ G + N
Sbjct: 4 SLWKYVQSLSPETISQLSKPDSKEVFQV---------------------MERNIIGLLGN 42
Query: 312 HPDNGDEFSDTVGTSRDYLAKLLFWCMLLGHHLRGLENRL 351
P + F TV TSRD+L KLL M+ G+ LR E RL
Sbjct: 43 LPS--EHFGVTVSTSRDHLGKLLASAMMSGYFLRNAEQRL 80
>ref|ZP_00326944.1| hypothetical protein Tery02002317 [Trichodesmium erythraeum IMS101]
Length = 113
Score = 53.9 bits (128), Expect = 7e-06
Identities = 29/80 (36%), Positives = 48/80 (59%)
Query: 78 LRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPSDHFSVTVTVSKHPLHRLLVSSII 137
++ + P S +SK S M++ I +LG +PS+ F+V VT S+ L +LL S++I
Sbjct: 26 VQSMSPDTVSHLSKPTSQEVFQVMERNIVGLLGNIPSEQFNVNVTTSRENLGKLLASAMI 85
Query: 138 TGYTLWNAEYRMSLTRNLDI 157
+GY L NAE RM+ ++ +
Sbjct: 86 SGYFLRNAEQRMTFEKSFKV 105
Score = 48.5 bits (114), Expect = 3e-04
Identities = 32/101 (31%), Positives = 52/101 (50%), Gaps = 23/101 (22%)
Query: 251 NNLLEYLRSLDPEMVTELSRPSSVEVEDIIHQLVQNILSRFFVDDASGSFMEQSVEGNID 310
N L +Y++S+ P+ V+ LS+P+S EV + ME+++ G +
Sbjct: 20 NLLWQYVQSMSPDTVSHLSKPTSQEVFQV---------------------MERNIVGLLG 58
Query: 311 NHPDNGDEFSDTVGTSRDYLAKLLFWCMLLGHHLRGLENRL 351
N P ++F+ V TSR+ L KLL M+ G+ LR E R+
Sbjct: 59 NIPS--EQFNVNVTTSRENLGKLLASAMISGYFLRNAEQRM 97
>ref|NP_682814.1| hypothetical protein tll2024 [Thermosynechococcus elongatus BP-1]
gi|22295751|dbj|BAC09576.1| tll2024 [Thermosynechococcus
elongatus BP-1]
Length = 110
Score = 53.5 bits (127), Expect = 1e-05
Identities = 37/117 (31%), Positives = 57/117 (48%), Gaps = 23/117 (19%)
Query: 242 QLEYDRGIRNNLLEYLRSLDPEMVTELSRPSSVEVEDIIHQLVQNILSRFFVDDASGSFM 301
Q D G N LL+YL+S PE++T ++R S E+ II
Sbjct: 12 QSSTDDGQPNLLLKYLQSQSPEVLTRIARSVSPEIRQII--------------------- 50
Query: 302 EQSVEGNIDNHPDNGDEFSDTVGTSRDYLAKLLFWCMLLGHHLRGLENRLQLSCVVG 358
Q+V+G + P ++F+ + T RD LA LL M+ G+ LR +E R++L +G
Sbjct: 51 SQNVQGLVGGLP--SEDFNVQIATDRDNLAGLLASAMMTGYFLRQMEQRMELEMSLG 105
Score = 47.4 bits (111), Expect = 7e-04
Identities = 31/99 (31%), Positives = 51/99 (51%), Gaps = 1/99 (1%)
Query: 63 RKSTFDRGFTVIANMLRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPSDHFSVTVT 122
+ ST D ++ L+ P + I++ VS R + Q + ++G LPS+ F+V +
Sbjct: 12 QSSTDDGQPNLLLKYLQSQSPEVLTRIARSVSPEIRQIISQNVQGLVGGLPSEDFNVQIA 71
Query: 123 VSKHPLHRLLVSSIITGYTLWNAEYRMSLTRNL-DIASP 160
+ L LL S+++TGY L E RM L +L D+ P
Sbjct: 72 TDRDNLAGLLASAMMTGYFLRQMEQRMELEMSLGDLPMP 110
>ref|ZP_00160351.2| hypothetical protein Avar03003448 [Anabaena variabilis ATCC 29413]
Length = 133
Score = 52.4 bits (124), Expect = 2e-05
Identities = 35/103 (33%), Positives = 55/103 (52%), Gaps = 23/103 (22%)
Query: 251 NNLLEYLRSLDPEMVTELSRPSSVEVEDIIHQLVQNILSRFFVDDASGSFMEQSVEGNID 310
N L EY++S+ P+ VT+LSRP S EV +I + V L GN+
Sbjct: 20 NGLWEYVQSMSPQTVTQLSRPGSREVLQLIQRAVVATL------------------GNLP 61
Query: 311 NHPDNGDEFSDTVGTSRDYLAKLLFWCMLLGHHLRGLENRLQL 353
+ ++F+ + TSR+ L++LL M+ G+ LR +E RL+L
Sbjct: 62 H-----EQFNTNITTSREELSQLLGAAMVDGYFLRNVEQRLEL 99
Score = 40.4 bits (93), Expect = 0.083
Identities = 20/77 (25%), Positives = 42/77 (53%)
Query: 78 LRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPSDHFSVTVTVSKHPLHRLLVSSII 137
++ + P + +S+ S +++ + LG LP + F+ +T S+ L +LL ++++
Sbjct: 26 VQSMSPQTVTQLSRPGSREVLQLIQRAVVATLGNLPHEQFNTNITTSREELSQLLGAAMV 85
Query: 138 TGYTLWNAEYRMSLTRN 154
GY L N E R+ L ++
Sbjct: 86 DGYFLRNVEQRLELEKS 102
>dbj|BAB76875.1| all5176 [Nostoc sp. PCC 7120] gi|17232668|ref|NP_489216.1|
hypothetical protein all5176 [Nostoc sp. PCC 7120]
gi|25308196|pir||AH2452 hypothetical protein all5176
[imported] - Nostoc sp. (strain PCC 7120)
Length = 135
Score = 51.2 bits (121), Expect = 5e-05
Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 23/103 (22%)
Query: 251 NNLLEYLRSLDPEMVTELSRPSSVEVEDIIHQLVQNILSRFFVDDASGSFMEQSVEGNID 310
N L EY++S+ P+ VT+LS+P S EV +I + V L GN+
Sbjct: 22 NGLWEYVQSMSPQTVTQLSKPGSREVLQLIQRAVVATL------------------GNLP 63
Query: 311 NHPDNGDEFSDTVGTSRDYLAKLLFWCMLLGHHLRGLENRLQL 353
+ ++F+ + TSR+ L++LL M+ G+ LR +E RL+L
Sbjct: 64 H-----EQFNTNITTSREELSQLLGAAMVDGYFLRNVEQRLEL 101
Score = 41.6 bits (96), Expect = 0.037
Identities = 21/77 (27%), Positives = 42/77 (54%)
Query: 78 LRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPSDHFSVTVTVSKHPLHRLLVSSII 137
++ + P + +SK S +++ + LG LP + F+ +T S+ L +LL ++++
Sbjct: 28 VQSMSPQTVTQLSKPGSREVLQLIQRAVVATLGNLPHEQFNTNITTSREELSQLLGAAMV 87
Query: 138 TGYTLWNAEYRMSLTRN 154
GY L N E R+ L ++
Sbjct: 88 DGYFLRNVEQRLELEKS 104
>emb|CAE07510.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
gi|33865529|ref|NP_897088.1| hypothetical protein
SYNW0995 [Synechococcus sp. WH 8102]
Length = 117
Score = 50.4 bits (119), Expect = 8e-05
Identities = 23/67 (34%), Positives = 39/67 (57%)
Query: 89 ISKGVSDAARDSMKQTISTMLGLLPSDHFSVTVTVSKHPLHRLLVSSIITGYTLWNAEYR 148
++K S+ +D ++ + +LG+LP +HF V VT ++ L +L S+++TGY L E R
Sbjct: 35 VAKSASNDIQDIIRHNVQGLLGMLPGEHFEVKVTANRDNLANMLASAMMTGYFLRQMEQR 94
Query: 149 MSLTRNL 155
L L
Sbjct: 95 KELEETL 101
Score = 43.1 bits (100), Expect = 0.013
Identities = 27/103 (26%), Positives = 48/103 (46%), Gaps = 23/103 (22%)
Query: 251 NNLLEYLRSLDPEMVTELSRPSSVEVEDIIHQLVQNILSRFFVDDASGSFMEQSVEGNID 310
N+L++YL+ P+ + +++ +S +++DII VQ +L
Sbjct: 18 NSLIQYLQEQTPDTLQRVAKSASNDIQDIIRHNVQGLLGML------------------- 58
Query: 311 NHPDNGDEFSDTVGTSRDYLAKLLFWCMLLGHHLRGLENRLQL 353
G+ F V +RD LA +L M+ G+ LR +E R +L
Sbjct: 59 ----PGEHFEVKVTANRDNLANMLASAMMTGYFLRQMEQRKEL 97
>ref|NP_441676.1| hypothetical protein slr1674 [Synechocystis sp. PCC 6803]
gi|1653442|dbj|BAA18356.1| slr1674 [Synechocystis sp.
PCC 6803] gi|7446610|pir||S75897 hypothetical protein
slr1674 - Synechocystis sp. (strain PCC 6803)
Length = 116
Score = 50.4 bits (119), Expect = 8e-05
Identities = 26/72 (36%), Positives = 44/72 (61%)
Query: 78 LRRIEPLDNSVISKGVSDAARDSMKQTISTMLGLLPSDHFSVTVTVSKHPLHRLLVSSII 137
++ + P + +S+ S M++ I +LG LP +HF VT++ S+ L RLL S+++
Sbjct: 27 VQELSPETIAQLSRPDSQEVFQVMERNIIGLLGNLPPEHFGVTISTSRENLGRLLASAMM 86
Query: 138 TGYTLWNAEYRM 149
+GY L NAE R+
Sbjct: 87 SGYFLRNAEQRL 98
Score = 48.9 bits (115), Expect = 2e-04
Identities = 32/102 (31%), Positives = 50/102 (48%), Gaps = 23/102 (22%)
Query: 250 RNNLLEYLRSLDPEMVTELSRPSSVEVEDIIHQLVQNILSRFFVDDASGSFMEQSVEGNI 309
+++L Y++ L PE + +LSRP S EV + ME+++ G +
Sbjct: 20 KDSLWTYVQELSPETIAQLSRPDSQEVFQV---------------------MERNIIGLL 58
Query: 310 DNHPDNGDEFSDTVGTSRDYLAKLLFWCMLLGHHLRGLENRL 351
N P + F T+ TSR+ L +LL M+ G+ LR E RL
Sbjct: 59 GNLPP--EHFGVTISTSRENLGRLLASAMMSGYFLRNAEQRL 98
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.318 0.135 0.390
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 608,934,010
Number of Sequences: 2540612
Number of extensions: 24988874
Number of successful extensions: 113572
Number of sequences better than 10.0: 92
Number of HSP's better than 10.0 without gapping: 43
Number of HSP's successfully gapped in prelim test: 49
Number of HSP's that attempted gapping in prelim test: 113410
Number of HSP's gapped (non-prelim): 157
length of query: 360
length of database: 863,360,394
effective HSP length: 129
effective length of query: 231
effective length of database: 535,621,446
effective search space: 123728554026
effective search space used: 123728554026
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)
Lotus: description of TM0081b.1