
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0550.6
(680 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
NP005117 51 kDa seed maturation protein 44 2e-04
TC222193 UP|Q39873 (Q39873) Lea protein precursor, complete 43 4e-04
TC216024 similar to GB|AAM78036.1|21928015|AY125526 At2g01100/F2... 38 0.017
BF010312 37 0.029
CF921661 36 0.049
TC216004 UP|TF2B_SOYBN (P48513) Transcription initiation factor ... 35 0.14
CK606526 34 0.24
TC228949 similar to UP|Q945N1 (Q945N1) AT5g50310/MXI22_1, partia... 33 0.32
BU765158 33 0.32
BE822052 similar to GP|4557063|gb|A expressed protein {Arabidops... 33 0.41
TC216756 similar to PIR|T05768|T05768 subtilisin-like proteinase... 33 0.41
TC225336 similar to UP|ANX4_FRAAN (P51074) Annexin-like protein ... 33 0.54
TC210126 similar to UP|NUCL_HUMAN (P19338) Nucleolin (Protein C2... 33 0.54
TC230134 weakly similar to UP|NUCL_HUMAN (P19338) Nucleolin (Pro... 32 0.71
TC225337 similar to UP|O65848 (O65848) Annexin, complete 32 0.92
TC212457 UP|Q39871 (Q39871) Maturation polypeptide, complete 32 0.92
TC232681 weakly similar to GB|AAP21377.1|30102918|BT006569 At1g4... 32 1.2
TC209358 similar to UP|P92987 (P92987) Myosin heavy chain-like p... 32 1.2
TC206039 32 1.2
AI959820 similar to GP|14335118|gb| At1g30200/F12P21_1 {Arabidop... 32 1.2
>NP005117 51 kDa seed maturation protein
Length = 1422
Score = 44.3 bits (103), Expect = 2e-04
Identities = 68/273 (24%), Positives = 105/273 (37%), Gaps = 8/273 (2%)
Frame = +1
Query: 327 KMKEYLAQSAAAAKKRAAETEQKKKNEGTSGSD---NVRDPKRQKTSSAAGGRPLHQSTL 383
K K+Y + + AAKK QK K+ + SD N +D QKT A G Q T
Sbjct: 598 KTKDYASDATDAAKKTKDYAAQKTKDYASEASDVAQNTKDYAAQKTKDYASGGA--QKTK 771
Query: 384 DPRSHPAEKKKGHDNVPPPQQDSSALINRPPTPFNQAGPSLAIGGEAPPPLLNLSDPHFN 443
D S A+K K + + + A + +L A D N
Sbjct: 772 DYASGGAQKTKDYASDAAQKTKDYASDGAQKSKEYAGDVALNAKDYAQKSKDYAGDAAQN 951
Query: 444 GLEFMTRTFDNRIHKDISGQGPPNIASVAIHHALSAASTVAGMAQCVKELISAKNRYEKK 503
++ + R K+ SG A H + A+ + A+ K+ + + K
Sbjct: 952 VKDYASDAVQKR--KEYSGD--------ASHKSKEASDYASETAKKTKDYVGDAAQRSKG 1101
Query: 504 AADYKT-AYERAKTDAETANKKLKSAE----EKCAKLTEDLAASDLLLQKTKSLKEAIND 558
AA+Y + A +R K A A K+ K A A+ T+D A+ Q+TK + I
Sbjct: 1102AAEYASDAAQRTKEYAGDATKRSKEASNDHANDMAQKTKDYASD--TAQRTKEKLQDIAS 1275
Query: 559 KHTAIQAKYQKLEKKYDRLNASIIGRASLQYDQ 591
+ A+ + K AS I +A+ Q Q
Sbjct: 1276EAGQYSAEKAREMKDAAAEKASDIAKAAKQKSQ 1374
>TC222193 UP|Q39873 (Q39873) Lea protein precursor, complete
Length = 1748
Score = 43.1 bits (100), Expect = 4e-04
Identities = 65/273 (23%), Positives = 106/273 (38%), Gaps = 8/273 (2%)
Frame = +3
Query: 327 KMKEYLAQSAAAAKKRAAETEQKKK---NEGTSGSDNVRDPKRQKTSSAAGGRPLHQSTL 383
K K+Y + + AAKK QK K ++ T + +D QKT A G Q T
Sbjct: 663 KTKDYASDATDAAKKTKDYAAQKTKDYASDATDAAKKTKDYAAQKTKDYASGGA--QKTK 836
Query: 384 DPRSHPAEKKKGHDNVPPPQQDSSALINRPPTPFNQAGPSLAIGGEAPPPLLNLSDPHFN 443
D S A+K K + S A + A S G+ N D
Sbjct: 837 DYASGAAQKTKDY--------ASDAAQRTKDHASDGAQKSKEYAGDVAQ---NAKDYAQK 983
Query: 444 GLEFMTRTFDN-RIHKDISGQGPPNIASVAIHHALSAASTVAGMAQCVKELISAKNRYEK 502
++ N + + + + + A A H + A+ + AQ K+ + + K
Sbjct: 984 SKDYAGDAVQNIKDYANDAAHKSKDYAGDASHKSKEASDYASETAQKTKDYVGDAAQKSK 1163
Query: 503 KAADYKT-AYERAKTDAETANKKLKSAEEKC---AKLTEDLAASDLLLQKTKSLKEAIND 558
+A++Y + A +R K A A K+ K A + A+ T+D A+ Q+TK + I
Sbjct: 1164EASEYASDAAQRTKEYAGDATKRSKEASDHANDMARKTKDYASD--TAQRTKEKLQDIAS 1337
Query: 559 KHTAIQAKYQKLEKKYDRLNASIIGRASLQYDQ 591
+ + + K AS I +A+ Q Q
Sbjct: 1338EAGQYSTEKAREMKDAAAEKASDIAKAAKQKSQ 1436
>TC216024 similar to GB|AAM78036.1|21928015|AY125526 At2g01100/F23H14.7
{Arabidopsis thaliana;} , partial (14%)
Length = 1433
Score = 37.7 bits (86), Expect = 0.017
Identities = 20/70 (28%), Positives = 35/70 (49%)
Frame = +2
Query: 325 PIKMKEYLAQSAAAAKKRAAETEQKKKNEGTSGSDNVRDPKRQKTSSAAGGRPLHQSTLD 384
P+K ++ L +A +++ T+ KK++ S SDN D + +KTS + + S D
Sbjct: 263 PLKWEQKLEAAAETKERKLKATKHKKRSGSDSDSDNDSDDESRKTSKRSHRKHRKHSHYD 442
Query: 385 PRSHPAEKKK 394
H K+K
Sbjct: 443 SGDHEKRKEK 472
>BF010312
Length = 415
Score = 37.0 bits (84), Expect = 0.029
Identities = 17/39 (43%), Positives = 24/39 (60%), Gaps = 1/39 (2%)
Frame = +2
Query: 231 KVWQKKYFKVMESPEI-KNLFRDSDNNPLFPFYWTKNPR 268
++++ Y+KV + KN F D NPLFPFYW + PR
Sbjct: 167 RIFKTGYYKVTIRLVVGKNFFYDLAGNPLFPFYWKQCPR 283
>CF921661
Length = 293
Score = 36.2 bits (82), Expect = 0.049
Identities = 28/102 (27%), Positives = 40/102 (38%), Gaps = 8/102 (7%)
Frame = +2
Query: 340 KKRAAETEQKKKNEGTSGSDNVRDPKRQKTSSAAG---GRPLHQSTLDPRSHPAEKKKGH 396
+K+ KKK +G G + P+ S G G P + P +P KG
Sbjct: 2 QKKKGGLPLKKKQKGLWGKGQKKTPRGPPNSGEKGKKKGTPRY-----PHGNPPPPPKGM 166
Query: 397 DNVPP-----PQQDSSALINRPPTPFNQAGPSLAIGGEAPPP 433
+ PP P + + +PP NQ GP G+ PPP
Sbjct: 167 EKPPPQGKKTPGKKKKKGLCKPPKTENQIGPLERGMGKFPPP 292
>TC216004 UP|TF2B_SOYBN (P48513) Transcription initiation factor IIB (General
transcription factor TFIIB), partial (34%)
Length = 416
Score = 34.7 bits (78), Expect = 0.14
Identities = 26/103 (25%), Positives = 47/103 (45%), Gaps = 1/103 (0%)
Frame = -3
Query: 380 QSTLDPRSHPAEKKKGHDNVPPPQQDSSALINRPPT-PFNQAGPSLAIGGEAPPPLLNLS 438
+S L+PR +++ ++ PPP + + RPP+ GP +G +P L +
Sbjct: 411 RSGLEPRFCQRPREEERNSPPPPPLGLAMTVERPPSVRRGLEGPPTRLGSLSPDSLAKVR 232
Query: 439 DPHFNGLEFMTRTFDNRIHKDISGQGPPNIASVAIHHALSAAS 481
+ +E+ +RT + H +S N SV + +L AS
Sbjct: 231 HSEVSSMEWDSRTRPHSEH-TVSPAEWSNTTSVCLLQSLQNAS 106
>CK606526
Length = 513
Score = 33.9 bits (76), Expect = 0.24
Identities = 22/56 (39%), Positives = 27/56 (47%), Gaps = 2/56 (3%)
Frame = +3
Query: 380 QSTLDPRSHPAEKKKGHDNVP--PPQQDSSALINRPPTPFNQAGPSLAIGGEAPPP 433
+ TL P+ P KK G P PPQ + + PPTP + P GG APPP
Sbjct: 36 EKTLPPKQTPGGKKGGAPPEPRAPPQGEKT--FGSPPTPNKKPPPENR-GGGAPPP 194
>TC228949 similar to UP|Q945N1 (Q945N1) AT5g50310/MXI22_1, partial (27%)
Length = 1134
Score = 33.5 bits (75), Expect = 0.32
Identities = 21/80 (26%), Positives = 37/80 (46%), Gaps = 17/80 (21%)
Frame = +3
Query: 613 GWLKEIKDGQVVGDDDISLDL-----------------LPQFDDESEPEEDGEDGNEQHR 655
G + EIKD ++ DD SL+L + DD+ E E+D ED ++
Sbjct: 42 GGMMEIKDQEITLDDLYSLNLSKLDEWKCIIPASESEWVEASDDDEENEDDDEDESDGDS 221
Query: 656 NEDQEKEDPQAGTSQGNNAN 675
D++++D + + NA+
Sbjct: 222 LTDEDEDDEEEEEEEAQNAS 281
>BU765158
Length = 420
Score = 33.5 bits (75), Expect = 0.32
Identities = 20/60 (33%), Positives = 29/60 (48%)
Frame = +2
Query: 619 KDGQVVGDDDISLDLLPQFDDESEPEEDGEDGNEQHRNEDQEKEDPQAGTSQGNNANNEN 678
KD V+ DD +L L + +S +ED E G+E+ + E EDP + A EN
Sbjct: 98 KDSPVISDDKSNL-LEDDAEGQSGKDEDDESGSEEDSCSESEVEDPADIKAARKEARKEN 274
>BE822052 similar to GP|4557063|gb|A expressed protein {Arabidopsis
thaliana}, partial (28%)
Length = 668
Score = 33.1 bits (74), Expect = 0.41
Identities = 14/53 (26%), Positives = 27/53 (50%)
Frame = -1
Query: 625 GDDDISLDLLPQFDDESEPEEDGEDGNEQHRNEDQEKEDPQAGTSQGNNANNE 677
GD+D S + + D E +PE +G G+ + D++ +D G G + + +
Sbjct: 458 GDEDFSGEGEEEADPEDDPEANGAGGSNDDDDSDEDDDDDNDGDDDGEDEDEK 300
Score = 30.4 bits (67), Expect = 2.7
Identities = 16/52 (30%), Positives = 27/52 (51%), Gaps = 12/52 (23%)
Frame = -1
Query: 639 DESEPEEDGEDGNEQHRNEDQE------------KEDPQAGTSQGNNANNEN 678
DE E EED D N+Q+ E E ++DP+A + G+N ++++
Sbjct: 515 DEDEDEEDDHDTNDQNDEEGDEDFSGEGEEEADPEDDPEANGAGGSNDDDDS 360
Score = 30.4 bits (67), Expect = 2.7
Identities = 17/46 (36%), Positives = 28/46 (59%), Gaps = 1/46 (2%)
Frame = -1
Query: 626 DDDISLDLLPQFDDESEPEEDGEDGNEQH-RNEDQEKEDPQAGTSQ 670
DDD D DD+++ ++DGED +E+ +E+ E+E PQ T +
Sbjct: 371 DDDSDED----DDDDNDGDDDGEDEDEKEGEDEEDEEEVPQPPTKK 246
>TC216756 similar to PIR|T05768|T05768 subtilisin-like proteinase -
Arabidopsis thaliana {Arabidopsis thaliana;} , partial
(13%)
Length = 764
Score = 33.1 bits (74), Expect = 0.41
Identities = 30/99 (30%), Positives = 41/99 (41%), Gaps = 1/99 (1%)
Frame = -3
Query: 337 AAAKKRAAETEQKKKNEGTSGSDNVRDPKRQKTSSAAGGRPLHQSTLDPRSHPAEKKKGH 396
AA ++++ T + +N GS V RQ S R H ST R+ P
Sbjct: 543 AA*RRKSQATRRIDQNGHDRGSITVLFDSRQWGSEP---RASHPSTSATRTPPRTDPSSG 373
Query: 397 DNVPPPQQDSSALINRPP-TPFNQAGPSLAIGGEAPPPL 434
PPP S++ + PP T A PS + APP L
Sbjct: 372 SGYPPPPSPHSSVSSPPPRTRAATASPSRSPHLRAPPRL 256
>TC225336 similar to UP|ANX4_FRAAN (P51074) Annexin-like protein RJ4, partial
(55%)
Length = 831
Score = 32.7 bits (73), Expect = 0.54
Identities = 20/64 (31%), Positives = 28/64 (43%), Gaps = 3/64 (4%)
Frame = -3
Query: 111 VLPCGEDDCVLLRKEPADESPDFFFVYGYFFLD---LNIKLPFSPFICHVLSFLNVAPCQ 167
V+ + C L P E PD FF+ F LD NI F F +++ + + C
Sbjct: 208 VISVAVESCQKLCLAPCGEDPDGFFMVALFLLDRFMKNISFSFGQFCIDLITIIPI--CX 35
Query: 168 LQPN 171
QPN
Sbjct: 34 HQPN 23
>TC210126 similar to UP|NUCL_HUMAN (P19338) Nucleolin (Protein C23), partial
(7%)
Length = 735
Score = 32.7 bits (73), Expect = 0.54
Identities = 13/33 (39%), Positives = 24/33 (72%)
Frame = +3
Query: 638 DDESEPEEDGEDGNEQHRNEDQEKEDPQAGTSQ 670
DD+++ EED +DG ++ ED+E+++ + TSQ
Sbjct: 492 DDDNDNEEDDDDGEDEEEEEDEEEDEEE--TSQ 584
Score = 29.6 bits (65), Expect = 4.6
Identities = 11/35 (31%), Positives = 22/35 (62%)
Frame = +3
Query: 638 DDESEPEEDGEDGNEQHRNEDQEKEDPQAGTSQGN 672
DD+ + +ED EDG++Q + + E+ + G +G+
Sbjct: 342 DDDKDGDEDDEDGHDQDDDGEDEEFSGEEGDEEGD 446
>TC230134 weakly similar to UP|NUCL_HUMAN (P19338) Nucleolin (Protein C23),
partial (8%)
Length = 581
Score = 32.3 bits (72), Expect = 0.71
Identities = 16/49 (32%), Positives = 26/49 (52%)
Frame = +2
Query: 617 EIKDGQVVGDDDISLDLLPQFDDESEPEEDGEDGNEQHRNEDQEKEDPQ 665
E G DDD D DD+ E+DG++ ++ +ED+++E PQ
Sbjct: 401 EANGGGESDDDDEDDD----DDDDDNDEDDGDEDDDDEEDEDEDEETPQ 535
Score = 28.9 bits (63), Expect = 7.8
Identities = 15/49 (30%), Positives = 26/49 (52%), Gaps = 4/49 (8%)
Frame = +2
Query: 626 DDDISLDLLPQFDDESEPEEDGEDGNEQ----HRNEDQEKEDPQAGTSQ 670
DDD + + DD+ E ++D +D N++ ++D+E ED T Q
Sbjct: 389 DDDPEANGGGESDDDDEDDDDDDDDNDEDDGDEDDDDEEDEDEDEETPQ 535
>TC225337 similar to UP|O65848 (O65848) Annexin, complete
Length = 1222
Score = 32.0 bits (71), Expect = 0.92
Identities = 20/64 (31%), Positives = 28/64 (43%), Gaps = 3/64 (4%)
Frame = -2
Query: 111 VLPCGEDDCVLLRKEPADESPDFFFVYGYFFLD---LNIKLPFSPFICHVLSFLNVAPCQ 167
V+ + C L P + PD FF+ F LD NI F F ++S + + C
Sbjct: 708 VISVAVESCQKLGLAPCSKDPDGFFMVALFLLDRFMKNISFSFGQFCIDLISTIPI--CG 535
Query: 168 LQPN 171
QPN
Sbjct: 534 HQPN 523
>TC212457 UP|Q39871 (Q39871) Maturation polypeptide, complete
Length = 1741
Score = 32.0 bits (71), Expect = 0.92
Identities = 22/75 (29%), Positives = 37/75 (49%), Gaps = 5/75 (6%)
Frame = +1
Query: 303 ASKSGTLNQFFTTMGKSKVDANPIK-----MKEYLAQSAAAAKKRAAETEQKKKNEGTSG 357
A+K+G + + T K+ A K +K+ A++A AAK + AET + KN+
Sbjct: 958 ANKAGEMKE--ATKKKTAETAEAAKNKAGEIKDRAAETAEAAKNKTAETAEVTKNKALEM 1131
Query: 358 SDNVRDPKRQKTSSA 372
D +D + T +A
Sbjct: 1132 KDAAKDRTAETTDAA 1176
>TC232681 weakly similar to GB|AAP21377.1|30102918|BT006569 At1g47970
{Arabidopsis thaliana;} , partial (39%)
Length = 867
Score = 31.6 bits (70), Expect = 1.2
Identities = 14/42 (33%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Frame = +1
Query: 638 DDESEPEEDGEDGNEQHRNEDQEKE-DPQAGTSQGNNANNEN 678
D+E + E+D DG + +ED+E+E D Q G ++ N+++
Sbjct: 232 DEEDDDEDDAPDGGDDDDDEDEEEESDVQRGGEPDDDDNDDD 357
Score = 29.6 bits (65), Expect = 4.6
Identities = 14/41 (34%), Positives = 22/41 (53%)
Frame = +1
Query: 625 GDDDISLDLLPQFDDESEPEEDGEDGNEQHRNEDQEKEDPQ 665
GDDD D + D + E D +D ++ +ED+E E+ Q
Sbjct: 271 GDDDDDEDEEEESDVQRGGEPDDDDNDDDDEDEDEEDEEEQ 393
>TC209358 similar to UP|P92987 (P92987) Myosin heavy chain-like protein,
partial (17%)
Length = 740
Score = 31.6 bits (70), Expect = 1.2
Identities = 14/45 (31%), Positives = 27/45 (59%)
Frame = +1
Query: 503 KAADYKTAYERAKTDAETANKKLKSAEEKCAKLTEDLAASDLLLQ 547
K AD + A E+ + +A T+NKK+ +++ + D+ + LLL+
Sbjct: 190 KLADKQAALEKIQWEAMTSNKKVDKLQDELGSMQADITSFTLLLE 324
>TC206039
Length = 834
Score = 31.6 bits (70), Expect = 1.2
Identities = 23/93 (24%), Positives = 38/93 (40%), Gaps = 3/93 (3%)
Frame = -2
Query: 327 KMKEYLAQSAAAAKKRAAETEQKKKNEGTSGSDNVRDPKRQKTSSAAGGRPLHQSTLDPR 386
K K+ + KK+ + ++KKK + + K++K GG P + R
Sbjct: 269 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKIKKKKKKKKKKKKXXXGGGGEPKXXKKKEKR 90
Query: 387 SHP---AEKKKGHDNVPPPQQDSSALINRPPTP 416
P +KK+G N P Q + +PP P
Sbjct: 89 GRPRGGXKKKRGRKN-PRGQPKKTP---KPPPP 3
Score = 31.6 bits (70), Expect = 1.2
Identities = 21/85 (24%), Positives = 33/85 (38%)
Frame = -2
Query: 318 KSKVDANPIKMKEYLAQSAAAAKKRAAETEQKKKNEGTSGSDNVRDPKRQKTSSAAGGRP 377
K K K K+ + KK+ + ++KKK G + K++K GG
Sbjct: 245 KKKKKKKKKKKKKKKKKKKKKIKKKKKKKKKKKKXXXGGGGEPKXXKKKEKRGRPRGGXK 66
Query: 378 LHQSTLDPRSHPAEKKKGHDNVPPP 402
+ +PR P + K PPP
Sbjct: 65 KKRGRKNPRGQPKKTPK-----PPP 6
>AI959820 similar to GP|14335118|gb| At1g30200/F12P21_1 {Arabidopsis
thaliana}, partial (19%)
Length = 342
Score = 31.6 bits (70), Expect = 1.2
Identities = 26/98 (26%), Positives = 37/98 (37%)
Frame = +2
Query: 336 AAAAKKRAAETEQKKKNEGTSGSDNVRDPKRQKTSSAAGGRPLHQSTLDPRSHPAEKKKG 395
AAAA RAA T ++++ +S + P T++ P S P +
Sbjct: 23 AAAASSRAASTPSSRRSKTSSSASTASSP----TTTPLPPPPPPTSPAAPSGTSSASSSA 190
Query: 396 HDNVPPPQQDSSALINRPPTPFNQAGPSLAIGGEAPPP 433
P SS+ N PP P PS + PPP
Sbjct: 191 ES*NPSKPSVSSSAPNAPPPP-----PSRPLPPPPPPP 289
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.312 0.131 0.378
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 30,512,461
Number of Sequences: 63676
Number of extensions: 519669
Number of successful extensions: 3147
Number of sequences better than 10.0: 101
Number of HSP's better than 10.0 without gapping: 2880
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3070
length of query: 680
length of database: 12,639,632
effective HSP length: 104
effective length of query: 576
effective length of database: 6,017,328
effective search space: 3465980928
effective search space used: 3465980928
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 62 (28.5 bits)
Lotus: description of TM0550.6