
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0219.2
(365 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC77681 similar to GP|22597156|gb|AAN03465.1 nucleolar histone d... 35 0.060
BQ151362 weakly similar to GP|13561980|gb| flagelliform silk pro... 34 0.078
TC87237 similar to GP|160409|gb|AAA29651.1|| mature-parasite-inf... 33 0.17
TC77763 similar to GP|15215674|gb|AAK91382.1 AT4g27500/F27G19_10... 32 0.30
TC80893 similar to GP|4557063|gb|AAD22502.1| expressed protein {... 32 0.51
TC89427 weakly similar to GP|4557063|gb|AAD22502.1| expressed pr... 32 0.51
TC90297 similar to GP|3204101|emb|CAA07227.1 hypothetical protei... 31 0.66
TC77337 weakly similar to GP|4019275|gb|AAC95573.1| orf 48 {Atel... 31 0.66
BF641220 weakly similar to PIR|G86203|G86 probable N-arginine di... 31 0.66
CB066689 homologue to GP|13646986|dbj DNA-binding protein DF1 {P... 31 0.86
BI311490 weakly similar to GP|21751020|dbj unnamed protein produ... 31 0.86
TC86146 homologue to GP|10334499|emb|CAC10211. hypothetical prot... 30 1.1
TC86145 homologue to GP|10334499|emb|CAC10211. hypothetical prot... 30 1.1
BG648593 weakly similar to GP|9294451|dbj A37 protein; ethylene-... 30 1.1
TC91658 similar to GP|14329812|emb|CAC40753. putative nucleosome... 30 1.5
CA859251 similar to GP|21305823|gb DNA polymerase I {Hz-1 insect... 30 1.5
TC89984 similar to GP|16604601|gb|AAL24093.1 unknown protein {Ar... 30 1.5
TC77395 similar to GP|4874305|gb|AAD31367.1| expressed protein {... 30 1.9
TC80288 weakly similar to PIR|T06029|T06029 hypothetical protein... 30 1.9
TC87387 homologue to PIR|T47775|T47775 hypothetical protein F24I... 30 1.9
>TC77681 similar to GP|22597156|gb|AAN03465.1 nucleolar histone deacetylase
HD2-P39 {Glycine max}, partial (47%)
Length = 1233
Score = 34.7 bits (78), Expect = 0.060
Identities = 20/53 (37%), Positives = 26/53 (48%)
Frame = +2
Query: 311 DDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDPQAGTSQGNNANNEN 363
DDD S D + DDE E D + +E +ED E+E P QG NE+
Sbjct: 584 DDDESDDEIGSSDDEME-NADSDSEDEDDSDEDDEEETPVKKVDQGKKRPNES 739
>BQ151362 weakly similar to GP|13561980|gb| flagelliform silk protein
{Argiope trifasciata}, partial (3%)
Length = 1136
Score = 34.3 bits (77), Expect = 0.078
Identities = 41/151 (27%), Positives = 55/151 (36%), Gaps = 21/151 (13%)
Frame = +1
Query: 24 AKKRAAETEQKKKNEGTSGSDNVRDPKRQKTSSAAGVKPLHQSTLDPKGRPTEKKKGHDN 83
A R T Q KK G ++ Q+ SAAGV +S G K+ HD
Sbjct: 163 AGSRPKNTAQAKKTTGEREDPQPKEKTYQR--SAAGVATGRESA---GGTTNNSKRQHD- 324
Query: 84 VPPHQPDSGALINRPSTPFIQA---------GPSSAIGGE------------ALPPLLNL 122
PP+ P++ +N P P A GP++ G A PPLL
Sbjct: 325 -PPY-PNTMPCVNHPPPPTTPAKAPPAPRDSGPAAPTNGGGAKAPLRCRVACAFPPLLCC 498
Query: 123 SDPRFNGLDFMNRTFDNRIHKDVSGQGPPNI 153
+D D R + + Q PPNI
Sbjct: 499 NDKNHLK*DVDRRRSTRKKTQK*KNQAPPNI 591
>TC87237 similar to GP|160409|gb|AAA29651.1|| mature-parasite-infected
erythrocyte surface antigen {Plasmodium falciparum},
partial (2%)
Length = 2007
Score = 33.1 bits (74), Expect = 0.17
Identities = 19/65 (29%), Positives = 33/65 (50%), Gaps = 3/65 (4%)
Frame = +1
Query: 301 KEIKDGQVV---GDDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDPQAGTSQGN 357
+EIKDG+ + +++ + Q ++E++ EE + NE KNE EKE + +
Sbjct: 172 EEIKDGEKIQQENEENKDEEKSQQENEENKDEEKSQQENELKKNEGGEKETGEITEEKSK 351
Query: 358 NANNE 362
N E
Sbjct: 352 QENEE 366
>TC77763 similar to GP|15215674|gb|AAK91382.1 AT4g27500/F27G19_100
{Arabidopsis thaliana}, partial (38%)
Length = 2297
Score = 32.3 bits (72), Expect = 0.30
Identities = 23/99 (23%), Positives = 43/99 (43%), Gaps = 3/99 (3%)
Frame = +1
Query: 194 TAYEQAK---ADAETANKNLKAAEERCAKLTDDLAASDLLLQKTKSLKETINDKHTAVQA 250
T +Q K D + K +A + ++ D L A D+ +Q ++ + K
Sbjct: 922 TIQDQVKLIGGDLDGVKKERQAIRSKIKQIDDVLKAIDIDIQSLQAELVAVTQKREQAFE 1101
Query: 251 KCQKLEKKYERLNASILGRASLLFAQGFLAAKEQISVVE 289
QKL K+ + N+ +LL LAAK+ ++ ++
Sbjct: 1102 SIQKLRKQRDEGNSYFYQSRTLLTKARELAAKKDVAAID 1218
>TC80893 similar to GP|4557063|gb|AAD22502.1| expressed protein {Arabidopsis
thaliana}, partial (39%)
Length = 883
Score = 31.6 bits (70), Expect = 0.51
Identities = 12/26 (46%), Positives = 17/26 (65%)
Frame = +3
Query: 323 DDESEPEEDGEDGNEQHKNEDHEKED 348
DD+ EED EDG ++ ED E++D
Sbjct: 480 DDDDGDEEDEEDGEDEEDEEDEEEDD 557
Score = 30.4 bits (67), Expect = 1.1
Identities = 16/53 (30%), Positives = 28/53 (52%)
Frame = +3
Query: 311 DDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDPQAGTSQGNNANNEN 363
DDD D+ + DD E + G++G E+ ED DP+A + G++ ++
Sbjct: 336 DDDDDDDVQDEDDDGEEEDYSGDEGEEEGDPED----DPEANGAGGSDDGEDD 482
Score = 29.3 bits (64), Expect = 2.5
Identities = 17/63 (26%), Positives = 31/63 (48%), Gaps = 1/63 (1%)
Frame = +3
Query: 301 KEIKDGQVVGDDDISLDLLPQFDDESEPEE-DGEDGNEQHKNEDHEKEDPQAGTSQGNNA 359
KE K +DD D + DD+ E E+ G++G E+ ED + + G+ G +
Sbjct: 303 KENKSDTEDDEDDDDDDDVQDEDDDGEEEDYSGDEGEEEGDPEDDPEANGAGGSDDGEDD 482
Query: 360 NNE 362
+++
Sbjct: 483 DDD 491
Score = 28.9 bits (63), Expect = 3.3
Identities = 13/28 (46%), Positives = 18/28 (63%)
Frame = +3
Query: 323 DDESEPEEDGEDGNEQHKNEDHEKEDPQ 350
D + E EEDGED E ++E+ + E PQ
Sbjct: 489 DGDEEDEEDGED-EEDEEDEEEDDETPQ 569
>TC89427 weakly similar to GP|4557063|gb|AAD22502.1| expressed protein
{Arabidopsis thaliana}, partial (50%)
Length = 753
Score = 31.6 bits (70), Expect = 0.51
Identities = 28/97 (28%), Positives = 43/97 (43%), Gaps = 6/97 (6%)
Frame = +1
Query: 260 ERLNASILGRASLLFAQGFLAA--KEQISVVEPGFDLSRIG----WLKEIKDGQVVGDDD 313
+ L A+ A LL A G L IS ++ F L ++ +E KDG DDD
Sbjct: 142 QSLIATAKSLAYLLIATGSLLTDVNHLISPMDGRFPLDQLSKGNHTCEENKDGSETEDDD 321
Query: 314 ISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDPQ 350
+ DD+ +ED ++ + +ED E DP+
Sbjct: 322 ------DEDDDDDVNDEDDDNDEDFSGDEDDEDADPE 414
Score = 30.4 bits (67), Expect = 1.1
Identities = 15/47 (31%), Positives = 24/47 (50%)
Frame = +1
Query: 303 IKDGQVVGDDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDP 349
+ +G DDD D DD+ + +DGED +E + +D E + P
Sbjct: 424 VPNGAGGSDDDDEDD-----DDDDDDNDDGEDEDEDEEEDDDEDQPP 549
>TC90297 similar to GP|3204101|emb|CAA07227.1 hypothetical protein {Cicer
arietinum}, partial (56%)
Length = 853
Score = 31.2 bits (69), Expect = 0.66
Identities = 21/81 (25%), Positives = 33/81 (39%), Gaps = 3/81 (3%)
Frame = +3
Query: 6 VDANPIKMKEYLAQSAAAAKKRAAETEQKKKNEGTSGSDNVRDPKRQKTSS---AAGVKP 62
+D +P K+Y+ A + G +DN R P RQ T+ + P
Sbjct: 387 LDGDP---KQYVDSPARHDNASNRSSNDSTPRLGVGSADNRRRPSRQSTAGSEHSVERSP 557
Query: 63 LHQSTLDPKGRPTEKKKGHDN 83
LH+ P GR + +G +N
Sbjct: 558 LHRQARAPAGRDSPSWEGKNN 620
>TC77337 weakly similar to GP|4019275|gb|AAC95573.1| orf 48 {Ateline
herpesvirus 3}, partial (8%)
Length = 986
Score = 31.2 bits (69), Expect = 0.66
Identities = 14/38 (36%), Positives = 23/38 (59%)
Frame = +3
Query: 311 DDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKED 348
+DD D + DDE E ++D E+G E+ + E ++ED
Sbjct: 600 EDDEDGDDQDEDDDEDEDDDDEEEGGEEDEEEGVDEED 713
Score = 31.2 bits (69), Expect = 0.66
Identities = 11/25 (44%), Positives = 20/25 (80%)
Frame = +3
Query: 324 DESEPEEDGEDGNEQHKNEDHEKED 348
DE+ EED EDG++Q +++D +++D
Sbjct: 582 DENGEEEDDEDGDDQDEDDDEDEDD 656
Score = 29.3 bits (64), Expect = 2.5
Identities = 15/43 (34%), Positives = 25/43 (57%), Gaps = 2/43 (4%)
Frame = +3
Query: 310 GDDDISLDLLPQFDDESEP--EEDGEDGNEQHKNEDHEKEDPQ 350
GDD D + DD+ E EED E+G ++ NE+ E+++ +
Sbjct: 615 GDDQDEDDDEDEDDDDEEEGGEEDEEEGVDEEDNEEEEEDEDE 743
Score = 28.5 bits (62), Expect = 4.3
Identities = 15/45 (33%), Positives = 25/45 (55%), Gaps = 4/45 (8%)
Frame = +3
Query: 323 DDESEPEEDGE----DGNEQHKNEDHEKEDPQAGTSQGNNANNEN 363
DDE E ++DG+ +G ++ +ED G GNN+NN++
Sbjct: 432 DDEDEEDDDGDGAFGEGEDELSSED--------GGGYGNNSNNKS 542
>BF641220 weakly similar to PIR|G86203|G86 probable N-arginine dibasic
convertase [imported] - Arabidopsis thaliana, partial
(5%)
Length = 634
Score = 31.2 bits (69), Expect = 0.66
Identities = 40/150 (26%), Positives = 61/150 (40%), Gaps = 17/150 (11%)
Frame = +2
Query: 231 LQKTKSLKETIN---DKHTAVQAKCQKLEKKYERLNASILGRASLLFAQGFLAAKEQISV 287
+QKTK K I KH+ + +K+ L + + A L++ + + V
Sbjct: 65 VQKTKPNKTLITIVAPKHSLSPFRFST-DKESMGLKGAPAAATTAATAAVALSSSDDVIV 241
Query: 288 VEPGFD-LSRIGWLKEIKDGQVVGDDDISLDLLPQF-----DDESEPEEDGEDGNEQ--- 338
P + L R+ LK +V D +I + P+ DDE E +ED ED E
Sbjct: 242 KSPNDNRLYRLVHLKNGLQALIVHDPEIYPEGAPKDGSIDEDDEEEDDEDEEDDEEDDDE 421
Query: 339 -HKNEDHEKEDPQ----AGTSQGNNANNEN 363
+ED E ED G G A N++
Sbjct: 422 GEDDEDEEXEDEDEXXVXGREGGKGAANQS 511
>CB066689 homologue to GP|13646986|dbj DNA-binding protein DF1 {Pisum
sativum}, partial (8%)
Length = 508
Score = 30.8 bits (68), Expect = 0.86
Identities = 17/56 (30%), Positives = 28/56 (49%)
Frame = +3
Query: 307 QVVGDDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDPQAGTSQGNNANNE 362
QVV ++++ L+ Q + + P + GED EQ N E+ED G ++E
Sbjct: 171 QVVQPENMAAPLMVQPEQQWRPPQQGEDNMEQ--NRGQEEEDMDEDDKDGEEEDDE 332
>BI311490 weakly similar to GP|21751020|dbj unnamed protein product {Homo
sapiens}, partial (22%)
Length = 387
Score = 30.8 bits (68), Expect = 0.86
Identities = 18/47 (38%), Positives = 27/47 (57%)
Frame = -3
Query: 301 KEIKDGQVVGDDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKE 347
K+ KD + G+DD D +F +ES+PE + E+ E+ K E E E
Sbjct: 343 KQEKDA-IDGEDDGEEDNNDEFFNESQPEFEDENEKEESKEEIDESE 206
>TC86146 homologue to GP|10334499|emb|CAC10211. hypothetical protein {Cicer
arietinum}, partial (95%)
Length = 1084
Score = 30.4 bits (67), Expect = 1.1
Identities = 18/52 (34%), Positives = 25/52 (47%)
Frame = +3
Query: 310 GDDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDPQAGTSQGNNANN 361
GDD L DDE + E+D D E +++ED + D S G+ NN
Sbjct: 264 GDDFDDLHDGTDVDDEDDDEDD--DNEEDYEDEDEDAFDVHDHASVGDRENN 413
>TC86145 homologue to GP|10334499|emb|CAC10211. hypothetical protein {Cicer
arietinum}, partial (91%)
Length = 1311
Score = 30.4 bits (67), Expect = 1.1
Identities = 18/52 (34%), Positives = 25/52 (47%)
Frame = +1
Query: 310 GDDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDPQAGTSQGNNANN 361
GDD L DDE + E+D D E +++ED + D S G+ NN
Sbjct: 445 GDDFDDLHDGTDVDDEDDDEDD--DNEEDYEDEDEDAFDVHDHASVGDRENN 594
>BG648593 weakly similar to GP|9294451|dbj A37 protein; ethylene-inducible
protein-like {Arabidopsis thaliana}, partial (75%)
Length = 724
Score = 30.4 bits (67), Expect = 1.1
Identities = 16/40 (40%), Positives = 19/40 (47%), Gaps = 1/40 (2%)
Frame = +2
Query: 62 PLHQSTLDPKGRPTEKKKG-HDNVPPHQPDSGALINRPST 100
P QST P+ PT K H N P H+P A +R T
Sbjct: 5 PPSQSTTPPQSPPTHSKPHTHSNYPSHKPSVAAQFSRSPT 124
>TC91658 similar to GP|14329812|emb|CAC40753. putative nucleosome assembly
protein 1 {Atropa belladonna}, partial (46%)
Length = 583
Score = 30.0 bits (66), Expect = 1.5
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
Frame = +2
Query: 312 DDISLDLLPQFDDESEPEE---DGEDGNEQHKNEDHEKEDPQAGTSQGNNAN 360
DDI +D DDE EE D +D ++ ++ED E+ED G S+ +
Sbjct: 71 DDIEVDE----DDEDGDEEDDDDDDDDDDDEEDEDDEEEDEGKGKSKSKRGS 214
>CA859251 similar to GP|21305823|gb DNA polymerase I {Hz-1 insect virus}
[Heliothis zea virus 1], partial (2%)
Length = 803
Score = 30.0 bits (66), Expect = 1.5
Identities = 20/47 (42%), Positives = 26/47 (54%)
Frame = -1
Query: 302 EIKDGQVVGDDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKED 348
E D ++V + D + DDE E +ED ED NE NED E+ED
Sbjct: 701 ECVDVELVDNSDKVRYKMDDDDDEEEEDED-EDENE---NEDEEEED 573
>TC89984 similar to GP|16604601|gb|AAL24093.1 unknown protein {Arabidopsis
thaliana}, partial (32%)
Length = 1303
Score = 30.0 bits (66), Expect = 1.5
Identities = 40/179 (22%), Positives = 77/179 (42%), Gaps = 28/179 (15%)
Frame = +3
Query: 162 LSAASIVAGMAQCVKELIATKNRYEKKAADYKTAYEQAKADAETANKNLKAAEERCAKLT 221
L+ SIV +++L A + + E + Y ++KAD + K +K+ +L
Sbjct: 603 LTKESIVQQNDVLLQDLDAAREQLEILSKQYGELEAKSKADIKVLVKEVKSLRSSQTELK 782
Query: 222 DDLAASDLLLQKTKSLKETINDKHTAVQA---------KCQKLEKK---------YERLN 263
+L S+ + +K ++ K ++++ + QA KC L K+ YE +
Sbjct: 783 KEL--SESIKEKYEAEKLLLHEREKSEQAEIAWRKQLEKCGLLLKQLQECSVELPYEDED 956
Query: 264 ASILGRASLLFAQGFL-AAKEQISVV---------EPGFDLSRIGWLKEIKDGQVVGDD 312
+ L +S A L + +QI ++ + G S + +IKDG + GD+
Sbjct: 957 RTFLQSSSSTDAFNKLKTSDDQIDILLAEVENLEKDAGSAASNVDKTNDIKDGVICGDE 1133
>TC77395 similar to GP|4874305|gb|AAD31367.1| expressed protein {Arabidopsis
thaliana}, partial (65%)
Length = 1649
Score = 29.6 bits (65), Expect = 1.9
Identities = 14/41 (34%), Positives = 21/41 (51%)
Frame = +2
Query: 311 DDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDPQA 351
D S D F+D E+ +DG+E+ + +DHE E A
Sbjct: 566 DPQPSEDKPEHFEDTESEEDILDDGDEEEEEQDHEPEPEHA 688
>TC80288 weakly similar to PIR|T06029|T06029 hypothetical protein T28I19.100
- Arabidopsis thaliana, partial (14%)
Length = 1460
Score = 29.6 bits (65), Expect = 1.9
Identities = 14/40 (35%), Positives = 24/40 (60%)
Frame = +3
Query: 323 DDESEPEEDGEDGNEQHKNEDHEKEDPQAGTSQGNNANNE 362
D+E + EE+G+D + E+ +KED + G N+ N+E
Sbjct: 699 DEEKDKEEEGDD-----ETENEDKEDEEKGGLVENHENHE 803
>TC87387 homologue to PIR|T47775|T47775 hypothetical protein F24I3.230 -
Arabidopsis thaliana, partial (14%)
Length = 1074
Score = 29.6 bits (65), Expect = 1.9
Identities = 24/88 (27%), Positives = 39/88 (44%), Gaps = 6/88 (6%)
Frame = +3
Query: 12 KMKEYLAQSAAAAKKRAAETEQKKKN--EGTSGSDNV----RDPKRQKTSSAAGVKPLHQ 65
K K+ AAA+ + E E+KKK+ +G GS V + K+ K +S G + +
Sbjct: 402 KKKKDKENGAAASDEEKVEKEKKKKHKEKGEDGSPEVEKSDKKKKKHKETSEVGSPEVDK 581
Query: 66 STLDPKGRPTEKKKGHDNVPPHQPDSGA 93
S K + E K ++ +S A
Sbjct: 582 SEKKKKKKDKEAKDNAADISNGNDESNA 665
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.308 0.127 0.350
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,715,531
Number of Sequences: 36976
Number of extensions: 108602
Number of successful extensions: 726
Number of sequences better than 10.0: 66
Number of HSP's better than 10.0 without gapping: 640
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 699
length of query: 365
length of database: 9,014,727
effective HSP length: 97
effective length of query: 268
effective length of database: 5,428,055
effective search space: 1454718740
effective search space used: 1454718740
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 59 (27.3 bits)
Lotus: description of TM0219.2