
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0152.2
(603 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BF644046 50 3e-06
TC88472 similar to GP|19423958|gb|AAL87269.1 unknown protein {Ar... 36 0.048
BF641220 weakly similar to PIR|G86203|G86 probable N-arginine di... 35 0.11
AL388248 weakly similar to GP|6062758|gb|A NADH dehydrogenase su... 35 0.11
TC80288 weakly similar to PIR|T06029|T06029 hypothetical protein... 34 0.14
AW256780 similar to PIR|T12641|T126 NADH dehydrogenase (ubiquino... 33 0.31
TC82286 weakly similar to PIR|B96544|B96544 hypothetical protein... 33 0.41
TC92259 weakly similar to GP|10177335|dbj|BAB10684. nuclear matr... 32 0.70
AW690594 similar to GP|23498163|emb hypothetical protein {Plasmo... 32 0.70
TC87688 similar to GP|10177535|dbj|BAB10930. gene_id:K1F13.21~un... 32 0.70
TC81816 similar to GP|8096269|dbj|BAA95789.1 KED {Nicotiana taba... 32 0.70
TC79552 similar to GP|13877579|gb|AAK43867.1 putative T-complex ... 32 0.70
TC91658 similar to GP|14329812|emb|CAC40753. putative nucleosome... 32 0.70
TC81541 similar to GP|22597168|gb|AAN03471.1 unknown protein {Gl... 32 0.70
TC86145 homologue to GP|10334499|emb|CAC10211. hypothetical prot... 32 0.91
TC86146 homologue to GP|10334499|emb|CAC10211. hypothetical prot... 32 0.91
TC77101 similar to GP|15148920|gb|AAK84887.1 homeodomain leucine... 32 0.91
TC86552 similar to PIR|G85436|G85436 hypothetical protein AT4g36... 31 1.2
TC89336 weakly similar to GP|21554135|gb|AAM63215.1 unknown {Ara... 31 1.2
TC80296 similar to PIR|H86265|H86265 protein F3F19.18 [imported]... 31 1.2
>BF644046
Length = 597
Score = 49.7 bits (117), Expect = 3e-06
Identities = 24/55 (43%), Positives = 33/55 (59%)
Frame = +3
Query: 82 HTYFFENIFTDLKCKLPLSDFTCSVLTLLNVAPTQLHCNSWAYLRAFELLCQVLG 136
H Y F +F D+ K P ++F C L LNVA +QLH N A++ FE+ C+ LG
Sbjct: 9 HMYSF--VFEDIGFKFPFTNFECDFLKALNVASSQLHPNCCAFMCGFEISCESLG 167
>TC88472 similar to GP|19423958|gb|AAL87269.1 unknown protein {Arabidopsis
thaliana}, partial (46%)
Length = 1019
Score = 35.8 bits (81), Expect = 0.048
Identities = 31/114 (27%), Positives = 59/114 (51%), Gaps = 5/114 (4%)
Frame = +2
Query: 437 KQLEEKEREILKMKATMKL-LDSANKVNEKKA----ADLALENERLKKHVEDLNITQKAK 491
K++ + ERE +K + +L LD AN ++ + A + +E RL K DL ++
Sbjct: 296 KRIHKAEREKMKREHLNELFLDLANALDLSEPNNGKASILIEASRLLK---DLLCQIQSL 466
Query: 492 EEELVKSKAEITHLNSSNAELKNENSKLHSEVSELKNSVLDQFEAGFAKAKEQI 545
++E V +E ++ ELK ENS L +++ +L+ + +A A++K +
Sbjct: 467 KKENVSLLSESHYVTMEKNELKEENSSLETQIEKLQGEI----QARIAQSKPDL 616
>BF641220 weakly similar to PIR|G86203|G86 probable N-arginine dibasic
convertase [imported] - Arabidopsis thaliana, partial
(5%)
Length = 634
Score = 34.7 bits (78), Expect = 0.11
Identities = 13/25 (52%), Positives = 21/25 (84%)
Frame = +2
Query: 577 DEEDEGEEDKNEEDENVNDNEGEGE 601
DE+DE E+D++EED+ +D+EGE +
Sbjct: 359 DEDDEEEDDEDEEDDEEDDDEGEDD 433
Score = 30.8 bits (68), Expect = 1.6
Identities = 13/29 (44%), Positives = 18/29 (61%)
Frame = +2
Query: 571 ISPDTGDEEDEGEEDKNEEDENVNDNEGE 599
I D +E+DE EED E+D+ D+E E
Sbjct: 356 IDEDDEEEDDEDEEDDEEDDDEGEDDEDE 442
Score = 29.6 bits (65), Expect = 3.5
Identities = 10/36 (27%), Positives = 23/36 (63%)
Frame = +2
Query: 567 DGKLISPDTGDEEDEGEEDKNEEDENVNDNEGEGEN 602
DG + D +++++ E+D+ ++DE +D + E E+
Sbjct: 347 DGSIDEDDEEEDDEDEEDDEEDDDEGEDDEDEEXED 454
>AL388248 weakly similar to GP|6062758|gb|A NADH dehydrogenase subunit II
{Cynolebias alexandri}, partial (7%)
Length = 417
Score = 34.7 bits (78), Expect = 0.11
Identities = 23/90 (25%), Positives = 43/90 (47%)
Frame = -2
Query: 438 QLEEKEREILKMKATMKLLDSANKVNEKKAADLALENERLKKHVEDLNITQKAKEEELVK 497
+L KERE +K + + K + ++ L NERLK+ + + +A E E
Sbjct: 395 ELLRKERENMKAQIASLQAEMEEKGDSEEVGTLQKHNERLKEKLANWKEKYEASETEREA 216
Query: 498 SKAEITHLNSSNAELKNENSKLHSEVSELK 527
++ E + N+S L + +L ++ EL+
Sbjct: 215 AEGEASAANASVRRLTMKVLELSKQLKELQ 126
>TC80288 weakly similar to PIR|T06029|T06029 hypothetical protein T28I19.100
- Arabidopsis thaliana, partial (14%)
Length = 1460
Score = 34.3 bits (77), Expect = 0.14
Identities = 43/209 (20%), Positives = 87/209 (41%), Gaps = 16/209 (7%)
Frame = +3
Query: 411 TKGLEIAAISKMIDLESADFDGINSAKQLEEKEREILKMKATMKLLDSANKVNEKKAADL 470
+KG ++ + + I L F I K +K++E K + + +++ + DL
Sbjct: 183 SKGFKVKHVLQAILLLGVCFWLIYQVKHNHDKKKEFDKNDTKLPIRTETDQILKLGRKDL 362
Query: 471 ------ALENERLKKHVEDLNIT---QKAKEEELVKSKAEITHLNSSNAELKNENSKLHS 521
A +NE ++ ED +I Q +E + + + E + + + E ++ +
Sbjct: 363 HPGKVEADKNEGHEEEEEDEHIVYNMQNKREHDEQQQEGEEGNKHETEEESEDNVHERRE 542
Query: 522 EVSELKNSVLDQFEAGFAKAKEQILFLNPQVSINL-------AGSDPYARIVDGKLISPD 574
E E +N + + E++ V I+ A +D +VD + +
Sbjct: 543 EQDEEENKHGAEVQEENESKSEEVEDEGGDVEIDENDHEKSEADNDREDEVVDEEKDKEE 722
Query: 575 TGDEEDEGEEDKNEEDENVNDNEGEGENH 603
GD+E E E+ ++EE + +N ENH
Sbjct: 723 EGDDETENEDKEDEEKGGLVENH---ENH 800
>AW256780 similar to PIR|T12641|T126 NADH dehydrogenase (ubiquinone) (EC
1.6.5.3) chain 5 - Brachypodium arbuscula chloroplast
(fragment), partial (7%)
Length = 724
Score = 33.1 bits (74), Expect = 0.31
Identities = 16/49 (32%), Positives = 25/49 (50%)
Frame = -2
Query: 355 WKSLLKEFEELTSEEVTSLWDSKIDFNSLVETNLVFEADREKVKKIGLK 403
WK L++EFE+L + + LW SK + T F+ D ++ LK
Sbjct: 657 WKELIEEFEKLIVKLILGLWKSKSSYERKQHTEGNFDVDTGDIEFSQLK 511
>TC82286 weakly similar to PIR|B96544|B96544 hypothetical protein F4M15.4
[imported] - Arabidopsis thaliana, partial (5%)
Length = 1782
Score = 32.7 bits (73), Expect = 0.41
Identities = 14/29 (48%), Positives = 21/29 (72%)
Frame = +2
Query: 574 DTGDEEDEGEEDKNEEDENVNDNEGEGEN 602
D GDEE E EE++NEE E+ ++ +GE+
Sbjct: 404 DWGDEEKEEEEEENEEKEDEAEHMNDGES 490
>TC92259 weakly similar to GP|10177335|dbj|BAB10684. nuclear matrix
constituent protein 1 (NMCP1)-like {Arabidopsis
thaliana}, partial (7%)
Length = 630
Score = 32.0 bits (71), Expect = 0.70
Identities = 24/91 (26%), Positives = 45/91 (49%)
Frame = +1
Query: 399 KIGLKEACQAIMTKGLEIAAISKMIDLESADFDGINSAKQLEEKEREILKMKATMKLLDS 458
++ LKE + ++ LE+ A + + E A F+ + L+EK+ E+ K ++
Sbjct: 1 EVKLKEEIDLVRSQNLELLAQADKLKAEKAKFEV--EWELLDEKKEELRKEAEFIE---- 162
Query: 459 ANKVNEKKAADLALENERLKKHVEDLNITQK 489
NE+KA ++NER K E N+ ++
Sbjct: 163 ----NERKAVSTFVKNERDKLREEKENLRKQ 243
>AW690594 similar to GP|23498163|emb hypothetical protein {Plasmodium
falciparum 3D7}, partial (10%)
Length = 633
Score = 32.0 bits (71), Expect = 0.70
Identities = 13/26 (50%), Positives = 19/26 (73%)
Frame = +3
Query: 574 DTGDEEDEGEEDKNEEDENVNDNEGE 599
D D+E+E EE++ EE+E +D EGE
Sbjct: 165 DEHDDEEEEEEEEEEEEEEDDDEEGE 242
Score = 31.6 bits (70), Expect = 0.91
Identities = 14/28 (50%), Positives = 20/28 (71%)
Frame = +3
Query: 574 DTGDEEDEGEEDKNEEDENVNDNEGEGE 601
D DEE+E EE++ EEDE+ ++ E E E
Sbjct: 117 DEHDEEEEEEEEEEEEDEHDDEEEEEEE 200
Score = 29.3 bits (64), Expect = 4.5
Identities = 11/25 (44%), Positives = 18/25 (72%)
Frame = +3
Query: 577 DEEDEGEEDKNEEDENVNDNEGEGE 601
+ +DE EE++ EE+E D++ EGE
Sbjct: 168 EHDDEEEEEEEEEEEEEEDDDEEGE 242
>TC87688 similar to GP|10177535|dbj|BAB10930. gene_id:K1F13.21~unknown
protein {Arabidopsis thaliana}, partial (46%)
Length = 2077
Score = 32.0 bits (71), Expect = 0.70
Identities = 42/196 (21%), Positives = 70/196 (35%), Gaps = 7/196 (3%)
Frame = +2
Query: 403 KEACQAIMTKGLEIAAISKMIDLESADFDGI--NSAKQLEEKEREILKMKATMKLLDSAN 460
K ++ G + I IDL+S Q + EI + K LD
Sbjct: 173 KSPLDELLVDGYDAEQIWHQIDLQSQPLLSTLRRRLNQFVKNPEEIAQFKVP---LDVGK 343
Query: 461 KVNEKKAADLALE-----NERLKKHVEDLNITQKAKEEELVKSKAEITHLNSSNAELKNE 515
K+ +KK +L E +E L +D +K K + + + + + + E +
Sbjct: 344 KLEKKKRVELEEEESDDFDEELDDDDDDFEGVEKKKAKGGSEGEDDFEEEDDEDEEGSED 523
Query: 516 NSKLHSEVSELKNSVLDQFEAGFAKAKEQILFLNPQVSINLAGSDPYARIVDGKLISPDT 575
E ++K + E F K E +L + D Y +
Sbjct: 524 EDDEEDEKEKVKGGGI---EDKFLKIDELTEYLEKE-------EDNYEK----------- 640
Query: 576 GDEEDEGEEDKNEEDE 591
G+E DE +ED E+DE
Sbjct: 641 GEERDEADEDSEEDDE 688
>TC81816 similar to GP|8096269|dbj|BAA95789.1 KED {Nicotiana tabacum},
partial (13%)
Length = 663
Score = 32.0 bits (71), Expect = 0.70
Identities = 27/140 (19%), Positives = 60/140 (42%), Gaps = 1/140 (0%)
Frame = +3
Query: 461 KVNEKKAADLALENERLKKHVEDLNITQK-AKEEELVKSKAEITHLNSSNAELKNENSKL 519
K+++K A D+ + ++K +E ++ + K+E+ K K + T ++ + E K
Sbjct: 90 KIDDKSAGDVKEDKVEIEKDLEIKSVEKDDEKKEKKDKEKKDKTDVDEGKDKKDKEKKKK 269
Query: 520 HSEVSELKNSVLDQFEAGFAKAKEQILFLNPQVSINLAGSDPYARIVDGKLISPDTGDEE 579
+ +K D E + K++ + GK G+E+
Sbjct: 270 EKKEENVKGEEEDGDEKKDKEKKKK------------------EKKEKGKEDKDKDGEEK 395
Query: 580 DEGEEDKNEEDENVNDNEGE 599
++ + ++D+N +D+EGE
Sbjct: 396 KSKKDKEKKKDKNEDDDEGE 455
>TC79552 similar to GP|13877579|gb|AAK43867.1 putative T-complex protein 1
theta subunit; TCP-1-Theta {Arabidopsis thaliana},
partial (39%)
Length = 1063
Score = 32.0 bits (71), Expect = 0.70
Identities = 33/142 (23%), Positives = 63/142 (44%)
Frame = +3
Query: 384 VETNLVFEADREKVKKIGLKEACQAIMTKGLEIAAISKMIDLESADFDGINSAKQLEEKE 443
V++ V E +V + +E ++ T L + S + D+E A DG+N+ K +
Sbjct: 102 VDSVSVEEIGGARVTIVKNEEGGNSVATVVLRGSTDSILDDIERAVDDGVNTYKTMCRDS 281
Query: 444 REILKMKATMKLLDSANKVNEKKAADLALENERLKKHVEDLNITQKAKEEELVKSKAEIT 503
R + AT ++ A +V E + L+ + K E + + E + EI
Sbjct: 282 RIVPGAAATE--IELAKRVKEFSFKETGLDQYAIAKFAESFEMIPRTLAENAGLNAMEI- 452
Query: 504 HLNSSNAELKNENSKLHSEVSE 525
++S AE + N+K+ ++ E
Sbjct: 453 -ISSLYAEHASGNTKVGIDLDE 515
>TC91658 similar to GP|14329812|emb|CAC40753. putative nucleosome assembly
protein 1 {Atropa belladonna}, partial (46%)
Length = 583
Score = 32.0 bits (71), Expect = 0.70
Identities = 12/28 (42%), Positives = 20/28 (70%)
Frame = +2
Query: 574 DTGDEEDEGEEDKNEEDENVNDNEGEGE 601
+ GDEED+ ++D +++DE D+E E E
Sbjct: 98 EDGDEEDDDDDDDDDDDEEDEDDEEEDE 181
>TC81541 similar to GP|22597168|gb|AAN03471.1 unknown protein {Glycine max},
partial (36%)
Length = 927
Score = 32.0 bits (71), Expect = 0.70
Identities = 14/51 (27%), Positives = 24/51 (46%)
Frame = +3
Query: 302 ESNRPQKKKRKNETPESAKGKDSSQPSMEKFMVKGNPQHMTLKAGSSSAPP 352
+S++P+ ++ PE+ K KD P+ M+ P M + G PP
Sbjct: 735 DSDKPKDAEKPKPKPEAEKPKDKPAPTAMPMMIPQMPPPMAVPVGMCYVPP 887
>TC86145 homologue to GP|10334499|emb|CAC10211. hypothetical protein {Cicer
arietinum}, partial (91%)
Length = 1311
Score = 31.6 bits (70), Expect = 0.91
Identities = 16/49 (32%), Positives = 27/49 (54%), Gaps = 5/49 (10%)
Frame = +1
Query: 560 DPYARIVDGKLISPDTGDEEDEGEEDKNEEDENVND-----NEGEGENH 603
D + + DG + + DE+D+ EED +EDE+ D + G+ EN+
Sbjct: 448 DDFDDLHDGTDVDDEDDDEDDDNEEDYEDEDEDAFDVHDHASVGDRENN 594
>TC86146 homologue to GP|10334499|emb|CAC10211. hypothetical protein {Cicer
arietinum}, partial (95%)
Length = 1084
Score = 31.6 bits (70), Expect = 0.91
Identities = 16/49 (32%), Positives = 27/49 (54%), Gaps = 5/49 (10%)
Frame = +3
Query: 560 DPYARIVDGKLISPDTGDEEDEGEEDKNEEDENVND-----NEGEGENH 603
D + + DG + + DE+D+ EED +EDE+ D + G+ EN+
Sbjct: 267 DDFDDLHDGTDVDDEDDDEDDDNEEDYEDEDEDAFDVHDHASVGDRENN 413
>TC77101 similar to GP|15148920|gb|AAK84887.1 homeodomain leucine zipper
protein HDZ3 {Phaseolus vulgaris}, complete
Length = 1532
Score = 31.6 bits (70), Expect = 0.91
Identities = 35/137 (25%), Positives = 59/137 (42%)
Frame = +1
Query: 365 LTSEEVTSLWDSKIDFNSLVETNLVFEADREKVKKIGLKEACQAIMTKGLEIAAISKMID 424
LTSE+V L S + N L E + KK+GL+ A+ + +K ++
Sbjct: 643 LTSEQVHMLEKSFEEENKLEP-----ERKTQLAKKLGLQPRQVAVWFQNRRARWKTKQLE 807
Query: 425 LESADFDGINSAKQLEEKEREILKMKATMKLLDSANKVNEKKAADLALENERLKKHVEDL 484
D+D + S+ + + DS NK NEK +++ NE+L+ +D+
Sbjct: 808 ---RDYDVLKSSYD------------SLLSTYDSINKENEKLKSEVVSLNEKLQVQAKDM 942
Query: 485 NITQKAKEEELVKSKAE 501
EE L + KA+
Sbjct: 943 ------LEEPLSEKKAD 975
>TC86552 similar to PIR|G85436|G85436 hypothetical protein AT4g36980
[imported] - Arabidopsis thaliana, partial (59%)
Length = 1785
Score = 31.2 bits (69), Expect = 1.2
Identities = 13/34 (38%), Positives = 24/34 (70%)
Frame = +2
Query: 567 DGKLISPDTGDEEDEGEEDKNEEDENVNDNEGEG 600
+GK S + D+++E E+D+++ED N +D+ EG
Sbjct: 680 NGKEESQISDDDDEEDEDDEDDEDFNSDDSNDEG 781
>TC89336 weakly similar to GP|21554135|gb|AAM63215.1 unknown {Arabidopsis
thaliana}, partial (7%)
Length = 1241
Score = 31.2 bits (69), Expect = 1.2
Identities = 22/118 (18%), Positives = 54/118 (45%)
Frame = +1
Query: 428 ADFDGINSAKQLEEKEREILKMKATMKLLDSANKVNEKKAADLALENERLKKHVEDLNIT 487
+ F G K +E ++ +K + +++ + K++E++ L +NE+ + ++ +
Sbjct: 430 SQFVGEGENKSYDELLKKFIKNEEELRVSNLKLKLSEEEIIKLKNQNEKSEGQLDSVQKE 609
Query: 488 QKAKEEELVKSKAEITHLNSSNAELKNENSKLHSEVSELKNSVLDQFEAGFAKAKEQI 545
+EL K ++ L AEL+ L ++ E+ N L A+ ++++
Sbjct: 610 LTLNMDELEHKKGQVLELQKQKAELETHVPNLVEQL-EVANEHLKISNDEVARLRKEL 780
>TC80296 similar to PIR|H86265|H86265 protein F3F19.18 [imported] -
Arabidopsis thaliana, partial (34%)
Length = 1734
Score = 31.2 bits (69), Expect = 1.2
Identities = 15/45 (33%), Positives = 27/45 (59%), Gaps = 1/45 (2%)
Frame = +3
Query: 560 DPYARIVDGKLISPDTGDEEDEGEE-DKNEEDENVNDNEGEGENH 603
D R D + D + EDEG++ + +EED ++++EG+G+ H
Sbjct: 921 DENDRSSDYETSGDDADNVEDEGDDLEDSEEDGGISEHEGDGDLH 1055
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.313 0.130 0.368
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 17,041,234
Number of Sequences: 36976
Number of extensions: 245893
Number of successful extensions: 1521
Number of sequences better than 10.0: 83
Number of HSP's better than 10.0 without gapping: 1288
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1419
length of query: 603
length of database: 9,014,727
effective HSP length: 102
effective length of query: 501
effective length of database: 5,243,175
effective search space: 2626830675
effective search space used: 2626830675
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 61 (28.1 bits)
Lotus: description of TM0152.2