
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0037a.11
(685 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC222193 UP|Q39873 (Q39873) Lea protein precursor, complete 42 0.001
NP005117 51 kDa seed maturation protein 42 0.001
BF010312 37 0.029
TC216024 similar to GB|AAM78036.1|21928015|AY125526 At2g01100/F2... 37 0.029
TC214018 weakly similar to UP|Q9FXI1 (Q9FXI1) F6F9.12 protein, p... 35 0.14
BU765158 33 0.32
TC228949 similar to UP|Q945N1 (Q945N1) AT5g50310/MXI22_1, partia... 33 0.32
BE822052 similar to GP|4557063|gb|A expressed protein {Arabidops... 33 0.42
TC212457 UP|Q39871 (Q39871) Maturation polypeptide, complete 33 0.55
TC210126 similar to UP|NUCL_HUMAN (P19338) Nucleolin (Protein C2... 33 0.55
CF921716 33 0.55
TC217651 similar to GB|AAP68268.1|31711824|BT008829 At5g47680 {A... 33 0.55
TC230134 weakly similar to UP|NUCL_HUMAN (P19338) Nucleolin (Pro... 32 0.71
TC206039 32 0.93
TC225337 similar to UP|O65848 (O65848) Annexin, complete 32 0.93
CF922098 32 1.2
TC209358 similar to UP|P92987 (P92987) Myosin heavy chain-like p... 32 1.2
TC219825 weakly similar to UP|MID2_YEAST (P36027) Mating process... 32 1.2
TC232681 weakly similar to GB|AAP21377.1|30102918|BT006569 At1g4... 32 1.2
TC215450 similar to UP|P93488 (P93488) NAP1Ps, partial (30%) 31 2.1
>TC222193 UP|Q39873 (Q39873) Lea protein precursor, complete
Length = 1748
Score = 42.0 bits (97), Expect = 0.001
Identities = 63/274 (22%), Positives = 109/274 (38%), Gaps = 9/274 (3%)
Frame = +3
Query: 332 KMKEYLAQSAAAAKKRAAETEQKKK---NEGTSGSDNVRDPKRQKTSGAAGGKPLHQSTL 388
K K+Y + + AAKK QK K ++ T + +D QKT A G Q T
Sbjct: 663 KTKDYASDATDAAKKTKDYAAQKTKDYASDATDAAKKTKDYAAQKTKDYASGGA--QKTK 836
Query: 389 DSKSRPAEKKKGHDNVPPPQQDSSTLINRPSTPFGQAGPSSA--IGGEAPPPLLNLSDPH 446
D S A+K K + D++ ++ Q A + A D
Sbjct: 837 DYASGAAQKTKDY------ASDAAQRTKDHASDGAQKSKEYAGDVAQNAKDYAQKSKDYA 998
Query: 447 FNGLEFMTRTFDNQIHKDISGQGPPNIASMAIHHALSAASTVAGMAQCVKELIAAKNRFE 506
+ ++ + ++ HK + A A H + A+ + AQ K+ + +
Sbjct: 999 GDAVQNIKDYANDAAHKS------KDYAGDASHKSKEASDYASETAQKTKDYVGDAAQKS 1160
Query: 507 KKAADYKT-AYERAKTDAETANKKLKSAEEKC---AKLTEDLAASDLLLQKTKSLKEAIN 562
K+A++Y + A +R K A A K+ K A + A+ T+D A+ Q+TK + I
Sbjct: 1161KEASEYASDAAQRTKEYAGDATKRSKEASDHANDMARKTKDYASD--TAQRTKEKLQDIA 1334
Query: 563 DKHTAVQAKYQKLEKKYDRLNASILGRASLQYAQ 596
+ + + K AS + +A+ Q +Q
Sbjct: 1335SEAGQYSTEKAREMKDAAAEKASDIAKAAKQKSQ 1436
>NP005117 51 kDa seed maturation protein
Length = 1422
Score = 42.0 bits (97), Expect = 0.001
Identities = 69/287 (24%), Positives = 111/287 (38%), Gaps = 22/287 (7%)
Frame = +1
Query: 332 KMKEYLAQSAAAAKKRAAETEQKKKNEGTSGSD---NVRDPKRQKTSGAAGGKPLHQSTL 388
K K+Y + + AAKK QK K+ + SD N +D QKT A G Q T
Sbjct: 598 KTKDYASDATDAAKKTKDYAAQKTKDYASEASDVAQNTKDYAAQKTKDYASGGA--QKTK 771
Query: 389 DSKSRPAEKKKGHDNVPPPQQDSSTLINRPSTPFGQAGPSSAIGGEAPPPLLNLSDPHFN 448
D S A+K K + D++ ++ Q A D N
Sbjct: 772 DYASGGAQKTKDY------ASDAAQKTKDYASDGAQKSKEYA------------GDVALN 897
Query: 449 GLEFMTRTFDNQIHKDISGQGPPNI---ASMAI-----------HHALSAASTVAGMAQC 494
++ ++ KD +G N+ AS A+ H + A+ + A+
Sbjct: 898 AKDYAQKS------KDYAGDAAQNVKDYASDAVQKRKEYSGDASHKSKEASDYASETAKK 1059
Query: 495 VKELIAAKNRFEKKAADYKT-AYERAKTDAETANKKLKSAE----EKCAKLTEDLAASDL 549
K+ + + K AA+Y + A +R K A A K+ K A A+ T+D A+
Sbjct: 1060TKDYVGDAAQRSKGAAEYASDAAQRTKEYAGDATKRSKEASNDHANDMAQKTKDYASD-- 1233
Query: 550 LLQKTKSLKEAINDKHTAVQAKYQKLEKKYDRLNASILGRASLQYAQ 596
Q+TK + I + A+ + K AS + +A+ Q +Q
Sbjct: 1234TAQRTKEKLQDIASEAGQYSAEKAREMKDAAAEKASDIAKAAKQKSQ 1374
>BF010312
Length = 415
Score = 37.0 bits (84), Expect = 0.029
Identities = 17/39 (43%), Positives = 24/39 (60%), Gaps = 1/39 (2%)
Frame = +2
Query: 236 KVWQKKYFKVMESPEI-KNLFRDSDNNPLFPFYWTKNPR 273
++++ Y+KV + KN F D NPLFPFYW + PR
Sbjct: 167 RIFKTGYYKVTIRLVVGKNFFYDLAGNPLFPFYWKQCPR 283
>TC216024 similar to GB|AAM78036.1|21928015|AY125526 At2g01100/F23H14.7
{Arabidopsis thaliana;} , partial (14%)
Length = 1433
Score = 37.0 bits (84), Expect = 0.029
Identities = 21/70 (30%), Positives = 35/70 (50%)
Frame = +2
Query: 330 PIKMKEYLAQSAAAAKKRAAETEQKKKNEGTSGSDNVRDPKRQKTSGAAGGKPLHQSTLD 389
P+K ++ L +A +++ T+ KK++ S SDN D + +KTS + K S D
Sbjct: 263 PLKWEQKLEAAAETKERKLKATKHKKRSGSDSDSDNDSDDESRKTSKRSHRKHRKHSHYD 442
Query: 390 SKSRPAEKKK 399
S K+K
Sbjct: 443 SGDHEKRKEK 472
>TC214018 weakly similar to UP|Q9FXI1 (Q9FXI1) F6F9.12 protein, partial (13%)
Length = 460
Score = 34.7 bits (78), Expect = 0.14
Identities = 29/114 (25%), Positives = 52/114 (45%), Gaps = 10/114 (8%)
Frame = +1
Query: 482 LSAASTVAGMAQCVKELIAAKNRF---EKKAADYKTAYERAKTDAETANKKLKSAEEKC- 537
L VA +++C + L K+R E+ A+ K+ A+ A +LK E
Sbjct: 52 LEKEKAVADLSKCAENLEMTKSRLLETEQHLAEVKSQLTSAQRSNSLAETQLKCMTESYR 231
Query: 538 ---AKLTEDLAASDLLLQKTKSLKEAINDK---HTAVQAKYQKLEKKYDRLNAS 585
A+ E + L KT++L+ + D+ H AKY+++E++ R +S
Sbjct: 232 TIEARTKEFETELNHLQMKTETLENELEDEKKAHEEALAKYKEIEEQLQRNESS 393
>BU765158
Length = 420
Score = 33.5 bits (75), Expect = 0.32
Identities = 20/60 (33%), Positives = 29/60 (48%)
Frame = +2
Query: 624 KDGQVVGDDDISLDLLPQFDDESEPEEDGEDGNEQHRNEDQEKEDPQAGTSQGNNANNEN 683
KD V+ DD +L L + +S +ED E G+E+ + E EDP + A EN
Sbjct: 98 KDSPVISDDKSNL-LEDDAEGQSGKDEDDESGSEEDSCSESEVEDPADIKAARKEARKEN 274
>TC228949 similar to UP|Q945N1 (Q945N1) AT5g50310/MXI22_1, partial (27%)
Length = 1134
Score = 33.5 bits (75), Expect = 0.32
Identities = 21/80 (26%), Positives = 37/80 (46%), Gaps = 17/80 (21%)
Frame = +3
Query: 618 GWLKEIKDGQVVGDDDISLDL-----------------LPQFDDESEPEEDGEDGNEQHR 660
G + EIKD ++ DD SL+L + DD+ E E+D ED ++
Sbjct: 42 GGMMEIKDQEITLDDLYSLNLSKLDEWKCIIPASESEWVEASDDDEENEDDDEDESDGDS 221
Query: 661 NEDQEKEDPQAGTSQGNNAN 680
D++++D + + NA+
Sbjct: 222 LTDEDEDDEEEEEEEAQNAS 281
>BE822052 similar to GP|4557063|gb|A expressed protein {Arabidopsis
thaliana}, partial (28%)
Length = 668
Score = 33.1 bits (74), Expect = 0.42
Identities = 14/53 (26%), Positives = 27/53 (50%)
Frame = -1
Query: 630 GDDDISLDLLPQFDDESEPEEDGEDGNEQHRNEDQEKEDPQAGTSQGNNANNE 682
GD+D S + + D E +PE +G G+ + D++ +D G G + + +
Sbjct: 458 GDEDFSGEGEEEADPEDDPEANGAGGSNDDDDSDEDDDDDNDGDDDGEDEDEK 300
Score = 30.4 bits (67), Expect = 2.7
Identities = 16/52 (30%), Positives = 27/52 (51%), Gaps = 12/52 (23%)
Frame = -1
Query: 644 DESEPEEDGEDGNEQHRNEDQE------------KEDPQAGTSQGNNANNEN 683
DE E EED D N+Q+ E E ++DP+A + G+N ++++
Sbjct: 515 DEDEDEEDDHDTNDQNDEEGDEDFSGEGEEEADPEDDPEANGAGGSNDDDDS 360
Score = 30.4 bits (67), Expect = 2.7
Identities = 17/46 (36%), Positives = 28/46 (59%), Gaps = 1/46 (2%)
Frame = -1
Query: 631 DDDISLDLLPQFDDESEPEEDGEDGNEQH-RNEDQEKEDPQAGTSQ 675
DDD D DD+++ ++DGED +E+ +E+ E+E PQ T +
Sbjct: 371 DDDSDED----DDDDNDGDDDGEDEDEKEGEDEEDEEEVPQPPTKK 246
>TC212457 UP|Q39871 (Q39871) Maturation polypeptide, complete
Length = 1741
Score = 32.7 bits (73), Expect = 0.55
Identities = 20/69 (28%), Positives = 31/69 (43%)
Frame = +1
Query: 332 KMKEYLAQSAAAAKKRAAETEQKKKNEGTSGSDNVRDPKRQKTSGAAGGKPLHQSTLDSK 391
++K+ A++A AAK + AET + KN+ D +D + T A Q T +K
Sbjct: 1039 EIKDRAAETAEAAKNKTAETAEVTKNKALEMKDAAKDRTAETTDAA------KQKTAQAK 1200
Query: 392 SRPAEKKKG 400
E G
Sbjct: 1201 ENTKENVSG 1227
>TC210126 similar to UP|NUCL_HUMAN (P19338) Nucleolin (Protein C23), partial
(7%)
Length = 735
Score = 32.7 bits (73), Expect = 0.55
Identities = 13/33 (39%), Positives = 24/33 (72%)
Frame = +3
Query: 643 DDESEPEEDGEDGNEQHRNEDQEKEDPQAGTSQ 675
DD+++ EED +DG ++ ED+E+++ + TSQ
Sbjct: 492 DDDNDNEEDDDDGEDEEEEEDEEEDEEE--TSQ 584
Score = 29.6 bits (65), Expect = 4.6
Identities = 11/35 (31%), Positives = 22/35 (62%)
Frame = +3
Query: 643 DDESEPEEDGEDGNEQHRNEDQEKEDPQAGTSQGN 677
DD+ + +ED EDG++Q + + E+ + G +G+
Sbjct: 342 DDDKDGDEDDEDGHDQDDDGEDEEFSGEEGDEEGD 446
>CF921716
Length = 367
Score = 32.7 bits (73), Expect = 0.55
Identities = 26/96 (27%), Positives = 36/96 (37%), Gaps = 7/96 (7%)
Frame = +3
Query: 350 ETEQKKKNEGTSGSDNVRDPKRQKTSGAAGGKPLHQSTLDSKSRPAEKKKGHDNVPPPQQ 409
E ++KK + + KR+K G A + + K +KKKG PPP +
Sbjct: 9 EKKKKKNFYPPTKKKTHKKKKRKKKKGGAPPRKKKKKKKKKKGGKKKKKKGGKTKPPPGK 188
Query: 410 DSS-----TLINRPSTPFGQAG--PSSAIGGEAPPP 438
+ TP G G P GG PPP
Sbjct: 189 KKDGPPPPAGKKKKKTPPGGGGGRPPKTPGGGGPPP 296
>TC217651 similar to GB|AAP68268.1|31711824|BT008829 At5g47680 {Arabidopsis
thaliana;} , partial (40%)
Length = 732
Score = 32.7 bits (73), Expect = 0.55
Identities = 28/117 (23%), Positives = 53/117 (44%), Gaps = 1/117 (0%)
Frame = +2
Query: 470 PPNIASMAIHHALSAASTVAGMAQCVKELIAAKNRFEKKAADYK-TAYERAKTDAETANK 528
PP A+ +++ ++ +A + RFE K A+ K A E+ + D E +
Sbjct: 86 PPQTTETENRKTDQTAADPPVLSKNARKKLAKQQRFEAKKAEKKAAAKEQKRRDVE---R 256
Query: 529 KLKSAEEKCAKLTEDLAASDLLLQKTKSLKEAINDKHTAVQAKYQKLEKKYDRLNAS 585
K K EE A ++E+ +K K ++ N + ++ + Q+ E K +RL +
Sbjct: 257 KRKEWEESLAGVSEE--------EKAKLIESRRNLRKERMEKRSQEKEDKRERLTVA 403
>TC230134 weakly similar to UP|NUCL_HUMAN (P19338) Nucleolin (Protein C23),
partial (8%)
Length = 581
Score = 32.3 bits (72), Expect = 0.71
Identities = 16/49 (32%), Positives = 26/49 (52%)
Frame = +2
Query: 622 EIKDGQVVGDDDISLDLLPQFDDESEPEEDGEDGNEQHRNEDQEKEDPQ 670
E G DDD D DD+ E+DG++ ++ +ED+++E PQ
Sbjct: 401 EANGGGESDDDDEDDD----DDDDDNDEDDGDEDDDDEEDEDEDEETPQ 535
Score = 28.9 bits (63), Expect = 7.9
Identities = 15/49 (30%), Positives = 26/49 (52%), Gaps = 4/49 (8%)
Frame = +2
Query: 631 DDDISLDLLPQFDDESEPEEDGEDGNEQ----HRNEDQEKEDPQAGTSQ 675
DDD + + DD+ E ++D +D N++ ++D+E ED T Q
Sbjct: 389 DDDPEANGGGESDDDDEDDDDDDDDNDEDDGDEDDDDEEDEDEDEETPQ 535
>TC206039
Length = 834
Score = 32.0 bits (71), Expect = 0.93
Identities = 18/75 (24%), Positives = 33/75 (44%), Gaps = 3/75 (4%)
Frame = -2
Query: 332 KMKEYLAQSAAAAKKRAAETEQKKKNEGTSGSDNVRDPKRQKTSGAAGGKPLHQSTLDSK 391
K K+ + KK+ + ++KKK + + K++K GG+P + +
Sbjct: 269 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKIKKKKKKKKKKKKXXXGGGGEPKXXKKKEKR 90
Query: 392 SRP---AEKKKGHDN 403
RP +KK+G N
Sbjct: 89 GRPRGGXKKKRGRKN 45
Score = 31.2 bits (69), Expect = 1.6
Identities = 22/76 (28%), Positives = 32/76 (41%)
Frame = -3
Query: 332 KMKEYLAQSAAAAKKRAAETEQKKKNEGTSGSDNVRDPKRQKTSGAAGGKPLHQSTLDSK 391
K K+ + KK+ + ++KKK G N + KR+K GA GG +
Sbjct: 217 KKKKKKKKKKKKLKKKKKKKKKKKKX-XXGGGGNQKXXKRRKKGGAPGG---GXKKSGGE 50
Query: 392 SRPAEKKKGHDNVPPP 407
P +K N PPP
Sbjct: 49 KTPGANQKKPRNPPPP 2
>TC225337 similar to UP|O65848 (O65848) Annexin, complete
Length = 1222
Score = 32.0 bits (71), Expect = 0.93
Identities = 20/64 (31%), Positives = 28/64 (43%), Gaps = 3/64 (4%)
Frame = -2
Query: 116 VLPCGEDDCVLLRKEPADESPDFFFVYGYFFLD---LNIKLPFSPFICHVLSFLNVAPCQ 172
V+ + C L P + PD FF+ F LD NI F F ++S + + C
Sbjct: 708 VISVAVESCQKLGLAPCSKDPDGFFMVALFLLDRFMKNISFSFGQFCIDLISTIPI--CG 535
Query: 173 LQPN 176
QPN
Sbjct: 534 HQPN 523
>CF922098
Length = 263
Score = 31.6 bits (70), Expect = 1.2
Identities = 20/51 (39%), Positives = 25/51 (48%), Gaps = 5/51 (9%)
Frame = +3
Query: 393 RPAEKKKGHDNVPPPQQDSSTLINRPSTPFGQAGPSSAIGGE-----APPP 438
+P +KKKG N PPPQ++ R TP+G GGE PPP
Sbjct: 21 KPPKKKKGQKNPPPPQKE------RGKTPWG--------GGENKKFPTPPP 131
>TC209358 similar to UP|P92987 (P92987) Myosin heavy chain-like protein,
partial (17%)
Length = 740
Score = 31.6 bits (70), Expect = 1.2
Identities = 14/45 (31%), Positives = 27/45 (59%)
Frame = +1
Query: 508 KAADYKTAYERAKTDAETANKKLKSAEEKCAKLTEDLAASDLLLQ 552
K AD + A E+ + +A T+NKK+ +++ + D+ + LLL+
Sbjct: 190 KLADKQAALEKIQWEAMTSNKKVDKLQDELGSMQADITSFTLLLE 324
>TC219825 weakly similar to UP|MID2_YEAST (P36027) Mating process protein
MID2 (Serine-rich protein SMS1) (Protein kinase A
interference protein), partial (19%)
Length = 862
Score = 31.6 bits (70), Expect = 1.2
Identities = 24/78 (30%), Positives = 39/78 (49%)
Frame = -2
Query: 591 SLQYAQGFLAAKEQINVVEPGFDLSRIGWLKEIKDGQVVGDDDISLDLLPQFDDESEPEE 650
+L+ G L E ++ +E G D E+ +G GD D+ + D++SE EE
Sbjct: 438 ALEEVVGLLHGGEAVDGLEDGDDGG------EVPEG---GDVDVGD*KESEDDEDSEEEE 286
Query: 651 DGEDGNEQHRNEDQEKED 668
+G G + +EDQ +ED
Sbjct: 285 EGVVGVDGEDHEDQGEED 232
>TC232681 weakly similar to GB|AAP21377.1|30102918|BT006569 At1g47970
{Arabidopsis thaliana;} , partial (39%)
Length = 867
Score = 31.6 bits (70), Expect = 1.2
Identities = 14/42 (33%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Frame = +1
Query: 643 DDESEPEEDGEDGNEQHRNEDQEKE-DPQAGTSQGNNANNEN 683
D+E + E+D DG + +ED+E+E D Q G ++ N+++
Sbjct: 232 DEEDDDEDDAPDGGDDDDDEDEEEESDVQRGGEPDDDDNDDD 357
Score = 29.6 bits (65), Expect = 4.6
Identities = 14/41 (34%), Positives = 22/41 (53%)
Frame = +1
Query: 630 GDDDISLDLLPQFDDESEPEEDGEDGNEQHRNEDQEKEDPQ 670
GDDD D + D + E D +D ++ +ED+E E+ Q
Sbjct: 271 GDDDDDEDEEEESDVQRGGEPDDDDNDDDDEDEDEEDEEEQ 393
>TC215450 similar to UP|P93488 (P93488) NAP1Ps, partial (30%)
Length = 672
Score = 30.8 bits (68), Expect = 2.1
Identities = 24/96 (25%), Positives = 38/96 (39%), Gaps = 13/96 (13%)
Frame = +3
Query: 600 AAKEQINVVEPGFDLSR----------IGWLK-EIKDGQVVGD--DDISLDLLPQFDDES 646
AA E N +E +D+ + W E G GD DD + + + DDE
Sbjct: 78 AAXELQNQMEQDYDIGSTIRDKIIPHAVSWFTGEAIQGDEFGDLEDDEDDEDIEEDDDEE 257
Query: 647 EPEEDGEDGNEQHRNEDQEKEDPQAGTSQGNNANNE 682
E E++ +D +E E + K+ G+ E
Sbjct: 258 EEEDEDDDDDEDDEEESKTKKKKSGRAQLGDGQQGE 365
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.313 0.131 0.378
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 29,896,032
Number of Sequences: 63676
Number of extensions: 485105
Number of successful extensions: 2881
Number of sequences better than 10.0: 100
Number of HSP's better than 10.0 without gapping: 2660
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2814
length of query: 685
length of database: 12,639,632
effective HSP length: 104
effective length of query: 581
effective length of database: 6,017,328
effective search space: 3496067568
effective search space used: 3496067568
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 62 (28.5 bits)
Lotus: description of TM0037a.11