
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0022.11
(334 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2... 195 2e-50
AJ502495 weakly similar to GP|18071369|g putative gag-pol polypr... 128 4e-30
TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-relate... 74 2e-28
BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F2... 115 2e-26
TC89912 weakly similar to PIR|B84512|B84512 probable retroelemen... 110 6e-25
BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vu... 92 4e-19
BG644677 weakly similar to GP|7682800|gb| Hypothetical protein T... 69 2e-12
BE941052 weakly similar to PIR|B85188|B85 retrotransposon like p... 59 2e-09
BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sati... 54 1e-07
CB893805 similar to GP|10177935|d copia-type polyprotein {Arabid... 52 4e-07
BG587141 similar to PIR|H86461|H86 hypothetical protein AAF32440... 48 6e-06
BF519055 similar to GP|21592754|gb| unknown {Arabidopsis thalian... 45 4e-05
BG453259 homologue to GP|21434|emb|CA ORF4 {Solanum tuberosum}, ... 42 3e-04
CB894419 weakly similar to GP|10177935|db copia-type polyprotein... 40 0.001
TC91257 similar to GP|21434|emb|CAA36616.1|| ORF4 {Solanum tuber... 34 0.069
AJ497569 weakly similar to PIR|T04833|T04 hypothetical protein F... 31 0.76
TC83624 homologue to PIR|G84581|G84581 copia-like retroelement p... 30 1.00
TC85182 homologue to PIR|JC4780|JC4780 peroxidase (EC 1.11.1.7) ... 30 1.3
TC83658 29 2.2
TC88226 similar to GP|15186732|dbj|BAB62890. aspartic proteinase... 29 2.2
>BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2O9.150 -
Arabidopsis thaliana, partial (11%)
Length = 732
Score = 195 bits (496), Expect = 2e-50
Identities = 95/217 (43%), Positives = 141/217 (64%), Gaps = 1/217 (0%)
Frame = +1
Query: 57 IQVDQQPEATYIHQSKYTKEFLKKFNMTECTPAKTPMHPTCILEKEDVSAKVCQKLYRGM 116
++V Q E YI Q KY + L++F M + ++ P+ P C L K++ KV Y+ +
Sbjct: 1 VEVIQNEEGIYICQRKYVTDLLERFGMEKSNLSRNPIAPRCKLIKDENGVKVDATKYKQI 180
Query: 117 IGSLLYLTASRPDILFSVHLCAQFQSDPRETHLTTVKRILKYLKGTTNLGLMYKKSS*YK 176
+G L+YL A+RPD+++ + L ++F + P E H+ VKR+L+YL GT NLG+MYK++ K
Sbjct: 181 VGCLMYLAATRPDLMYVLSLISRFMNCPTELHMHAVKRVLRYLNGTINLGIMYKRNGSEK 360
Query: 177 LSGYCDADYAGDRTERKSTYGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQM 236
L Y D+DYAGD +RKST G L S VSW+SK+Q + LST +AE+I+AA C+ Q
Sbjct: 361 LEAYTDSDYAGDLDDRKSTSGYVFMLSSGAVSWSSKKQPVVTLSTTKAEFIAAAFCACQS 540
Query: 237 LWMKHQLEDYQILES-NISIYCDNTAAISLSKNLILH 272
+WM+ LE +S +I++YCDN + I LSKN +LH
Sbjct: 541 VWMRRVLEKLGYTQSGSITMYCDNNSTIKLSKNPVLH 651
>AJ502495 weakly similar to GP|18071369|g putative gag-pol polyprotein {Oryza
sativa}, partial (9%)
Length = 542
Score = 128 bits (321), Expect = 4e-30
Identities = 62/146 (42%), Positives = 98/146 (66%), Gaps = 1/146 (0%)
Frame = +2
Query: 183 ADYAGDRTERKSTYGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQ 242
+D+AGD RKST G LG+ +SW+SK+Q +A STAEAEYI++ C+TQ +W++
Sbjct: 2 SDWAGDTETRKSTSGYAFHLGTGAISWSSKKQPVVAFSTAEAEYIASTSCATQTVWLRRI 181
Query: 243 LEDYQILESN-ISIYCDNTAAISLSKNLILHSRAKHIEVKYHFIRDYVQKGVLLLKFVDT 301
LE ++ IYCDN +AI+LSKN + H R+KHI++++H IR+ + + +++++ T
Sbjct: 182 LEVMHHEQNTPTKIYCDNKSAIALSKNPVFHGRSKHIDIQFHKIRELIAEKEVVIEYCPT 361
Query: 302 DHQWADIFSKPLAEDRFKFILKNLNM 327
+ + ADIF+KPL + F + K L M
Sbjct: 362 EEKIADIFTKPLKIESFYKLKKMLGM 439
>TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
(EC 3.4.23.-);, partial (7%)
Length = 705
Score = 73.6 bits (179), Expect(2) = 2e-28
Identities = 38/87 (43%), Positives = 56/87 (63%)
Frame = +1
Query: 152 VKRILKYLKGTTNLGLMYKKSS*YKLSGYCDADYAGDRTERKSTYGNCQFLGSNLVSWAS 211
VKRI++Y+KGT+ + + + S + GY D+D+AGD +RKST G L VSW S
Sbjct: 1 VKRIMRYIKGTSGVAVCFGGSE-LTVRGYVDSDFAGDHDKRKSTTGYVFTLAGGAVSWLS 177
Query: 212 KRQSTIALSTAEAEYISAAICSTQMLW 238
K Q+ +ALST EAEY++A + + L+
Sbjct: 178 KLQTVVALSTTEAEYMAAYLKHARKLF 258
Score = 70.1 bits (170), Expect(2) = 2e-28
Identities = 27/84 (32%), Positives = 55/84 (65%)
Frame = +3
Query: 235 QMLWMKHQLEDYQILESNISIYCDNTAAISLSKNLILHSRAKHIEVKYHFIRDYVQKGVL 294
+ +WM+ +E+ + I++YCD+ +A+ +++N HSR KHI ++YHF+R+ V++G +
Sbjct: 249 EAIWMQRLMEELGHKQEQITVYCDSQSALHIARNPAFHSRTKHIGIQYHFVREVVEEGSV 428
Query: 295 LLKFVDTDHQWADIFSKPLAEDRF 318
++ + T+ AD +K + D+F
Sbjct: 429 DMQKIHTNDNLADAMTKSINTDKF 500
>BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana, partial (10%)
Length = 744
Score = 115 bits (289), Expect = 2e-26
Identities = 66/188 (35%), Positives = 97/188 (51%)
Frame = +2
Query: 17 IYVDDIIFGSANTFLCKEFSKMMQAEFEMSMMGELKYFLGIQVDQQPEATYIHQSKYTKE 76
+YVDDI+ + + + F++ +G L+YFLG++V + + ++Q KYT E
Sbjct: 179 VYVDDIVLAGNDISEIQHVKCFLIDRFKIKDLGSLRYFLGLEVARSKQGILLNQRKYTLE 358
Query: 77 FLKKFNMTECTPAKTPMHPTCILEKEDVSAKVCQKLYRGMIGSLLYLTASRPDILFSVHL 136
L+ TP + L D + YR +IG L+YLT +RPDI F+V
Sbjct: 359 LLEDSGNLAVKSTLTPYDISLKLHNSDSPLYNDETQYRRLIGKLIYLTTTRPDISFAVQQ 538
Query: 137 CAQFQSDPRETHLTTVKRILKYLKGTTNLGLMYKKSS*YKLSGYCDADYAGDRTERKSTY 196
+QF S P++ H R+L+YLK GL Y +S KLS + D+D+A T RKS
Sbjct: 539 LSQFVSKPQQVHYQAAIRVLQYLKTAPAKGLFYSATSNLKLSSFADSDWATCPTTRKSVT 718
Query: 197 GNCQFLGS 204
G FLGS
Sbjct: 719 GYWVFLGS 742
>TC89912 weakly similar to PIR|B84512|B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(10%)
Length = 814
Score = 110 bits (276), Expect = 6e-25
Identities = 56/162 (34%), Positives = 95/162 (58%), Gaps = 2/162 (1%)
Frame = +1
Query: 152 VKRILKYLKGTTNLGLMYKKSS*YK--LSGYCDADYAGDRTERKSTYGNCQFLGSNLVSW 209
+K +LKYL + L Y K++ + L GY DADYAG+ RKS G L +SW
Sbjct: 7 LKWVLKYLNESLKSSLKYTKAAQEEDALEGYVDADYAGNVDTRKSLSGFVFTLYGTTISW 186
Query: 210 ASKRQSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQILESNISIYCDNTAAISLSKNL 269
+ +QS + LST +AEYI+ +W+K + + I + + I+CD+ +AI L+ +
Sbjct: 187 KANQQSVVTLSTTQAEYIAFVEGVKDAIWLKGMIGELGITQEYVKIHCDSQSAIHLANHQ 366
Query: 270 ILHSRAKHIEVKYHFIRDYVQKGVLLLKFVDTDHQWADIFSK 311
+ H R KHI+++ HFIRD ++ ++++ + ++ AD+F+K
Sbjct: 367 VYHERTKHIDIRLHFIRDMIESKEIVVEKMASEENPADVFTK 492
>BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vulgaris},
partial (13%)
Length = 494
Score = 91.7 bits (226), Expect = 4e-19
Identities = 48/122 (39%), Positives = 72/122 (58%), Gaps = 3/122 (2%)
Frame = +1
Query: 127 RPDILFSVHLCAQFQSDPRETHLTTVKRILKYLKGTTNLGLMY---KKSS*YKLSGYCDA 183
RPDI +SV + ++F DPR+ HL RIL+Y++GT GL++ KS Y+L Y D+
Sbjct: 121 RPDICYSVSVISKFMHDPRKPHLIAANRILRYVRGTMEYGLLFPYGAKSEVYELICYSDS 300
Query: 184 DYAGDRTERKSTYGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQL 243
D+ GD R+ST G +SW +K+Q ALS+ EAEYI+ + Q LW+ +
Sbjct: 301 DWCGD---RRSTSGYVFKFNDAAISWCTKKQPITALSSYEAEYIAGTFATFQALWLDSVI 471
Query: 244 ED 245
++
Sbjct: 472 KE 477
>BG644677 weakly similar to GP|7682800|gb| Hypothetical protein T15F17.l
{Arabidopsis thaliana}, partial (3%)
Length = 539
Score = 69.3 bits (168), Expect = 2e-12
Identities = 39/104 (37%), Positives = 57/104 (54%)
Frame = -3
Query: 121 LYLTASRPDILFSVHLCAQFQSDPRETHLTTVKRILKYLKGTTNLGLMYKKSS*YKLSGY 180
+ LT P+I FS++L +++ S P H +K I KYLKG ++GL Y K L GY
Sbjct: 531 ILLTLQGPNITFSINLLSRYSSAPTMRH*NGIKHICKYLKGIIDMGLFYSKDCSPDLIGY 352
Query: 181 CDADYAGDRTERKSTYGNCQFLGSNLVSWASKRQSTIALSTAEA 224
+A Y D + +S G G+ ++SW S + STIA S+ A
Sbjct: 351 VNA*YLSDPHKARS*TGYIFTCGNTVISWRSTK*STIATSSNHA 220
>BE941052 weakly similar to PIR|B85188|B85 retrotransposon like protein
[imported] - Arabidopsis thaliana, partial (4%)
Length = 480
Score = 59.3 bits (142), Expect = 2e-09
Identities = 28/71 (39%), Positives = 43/71 (60%)
Frame = +2
Query: 257 CDNTAAISLSKNLILHSRAKHIEVKYHFIRDYVQKGVLLLKFVDTDHQWADIFSKPLAED 316
CD +A L+ N + HSR KHI + HF+RD VQ+G L ++ V T Q AD +KPL++
Sbjct: 29 CDYLSATYLTHNPVYHSRMKHISIDIHFVRDLVQQGKLKVQHVCTVDQLADCLTKPLSKS 208
Query: 317 RFKFILKNLNM 327
R + + + +
Sbjct: 209 RHQLLRNKIGV 241
>BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sativa (japonica
cultivar-group)}, partial (8%)
Length = 503
Score = 53.5 bits (127), Expect = 1e-07
Identities = 27/81 (33%), Positives = 41/81 (50%)
Frame = +1
Query: 17 IYVDDIIFGSANTFLCKEFSKMMQAEFEMSMMGELKYFLGIQVDQQPEATYIHQSKYTKE 76
+YVDDI + K ++ EFE+ +G LKYFLG++V + + + I Q KY +
Sbjct: 127 VYVDDIFLTGDHGK*IKRLKNLLAEEFEIKDLGNLKYFLGMEVARWKKGSSISQRKYVLD 306
Query: 77 FLKKFNMTECTPAKTPMHPTC 97
LK+ M C + P C
Sbjct: 307 LLKETRMIGCKTIRDPYGCNC 369
Score = 39.7 bits (91), Expect = 0.002
Identities = 21/53 (39%), Positives = 31/53 (57%)
Frame = +2
Query: 88 PAKTPMHPTCILEKEDVSAKVCQKLYRGMIGSLLYLTASRPDILFSVHLCAQF 140
P++TPM T L D V + Y+ ++G L+YL+ +RPDI F V +QF
Sbjct: 341 PSETPMDATVKLGTLDNGTLVDKGRYQRLVGKLIYLSHTRPDISFVVCTMSQF 499
>CB893805 similar to GP|10177935|d copia-type polyprotein {Arabidopsis
thaliana}, partial (14%)
Length = 778
Score = 51.6 bits (122), Expect = 4e-07
Identities = 23/52 (44%), Positives = 35/52 (67%)
Frame = +3
Query: 17 IYVDDIIFGSANTFLCKEFSKMMQAEFEMSMMGELKYFLGIQVDQQPEATYI 68
+YVDD+IF + + +EF K M+ EF MS +G++ YFLG++V Q + YI
Sbjct: 618 LYVDDLIFIGNDENMFEEFKKSMKKEFNMSDLGKMHYFLGVEVTQNEKGIYI 773
>BG587141 similar to PIR|H86461|H86 hypothetical protein AAF32440.1
[imported] - Arabidopsis thaliana, partial (20%)
Length = 731
Score = 47.8 bits (112), Expect = 6e-06
Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 2/95 (2%)
Frame = +3
Query: 235 QMLWMKHQLED--YQILESNISIYCDNTAAISLSKNLILHSRAKHIEVKYHFIRDYVQKG 292
Q +W++ L + ++ E + I DN + I+L++N + H R HI +YHFIR+ V+ G
Sbjct: 135 QAMWLQDLLSEVTWEPCEE-VVIRIDNQSVIALTRNPVFHGRGNHIHKRYHFIRECVENG 311
Query: 293 VLLLKFVDTDHQWADIFSKPLAEDRFKFILKNLNM 327
+ ++ V + A I +K L F+ I + M
Sbjct: 312 QVEVEHVPGEKHRAYI*TKALGRIIFREIRYYIGM 416
>BF519055 similar to GP|21592754|gb| unknown {Arabidopsis thaliana}, partial
(30%)
Length = 675
Score = 45.1 bits (105), Expect = 4e-05
Identities = 24/83 (28%), Positives = 45/83 (53%)
Frame = -3
Query: 103 DVSAKVCQKLYRGMIGSLLYLTASRPDILFSVHLCAQFQSDPRETHLTTVKRILKYLKGT 162
D S KL ++G+LLY+T + PD+ FS++ +QF P + + +K+++++ K
Sbjct: 664 DGSITADSKLVHSIVGALLYITVTCPDLSFSINKPSQFMHKPTQINFQQLKKVMRHPK-- 491
Query: 163 TNLGLMYKKSS*YKLSGYCDADY 185
L KK ++ + DAD+
Sbjct: 490 ----LTIKKLFDLQIYAFSDADW 434
>BG453259 homologue to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial (5%)
Length = 657
Score = 42.0 bits (97), Expect = 3e-04
Identities = 34/132 (25%), Positives = 64/132 (47%), Gaps = 3/132 (2%)
Frame = -2
Query: 167 LMYKKSS*YKLSGYCDADYAGDRTERKSTYGNCQFLGSNLVSWASKRQSTIA--LSTAEA 224
+++K+ + Y + AG +R ST G FLG N+V +Q+ +A +
Sbjct: 530 IVFKRE*KLSMEVYXNTVCAGWIVDRGSTSGY*MFLGGNMVE*---KQNVVAR*VQRHNF 360
Query: 225 EYISAAICSTQMLWMKHQLEDYQI-LESNISIYCDNTAAISLSKNLILHSRAKHIEVKYH 283
E S + +K +L+D I + ++++ +N ++ N + H R KHIE+ H
Sbjct: 359 ELCSQGL*RVMDEELKIKLDDLIINYKDPMTLF*NNNFVSRIAHNPVQHYRTKHIEIDQH 180
Query: 284 FIRDYVQKGVLL 295
FI + + G++L
Sbjct: 179 FIIEKLYSGLIL 144
>CB894419 weakly similar to GP|10177935|db copia-type polyprotein
{Arabidopsis thaliana}, partial (2%)
Length = 170
Score = 40.4 bits (93), Expect = 0.001
Identities = 21/55 (38%), Positives = 28/55 (50%)
Frame = +3
Query: 72 KYTKEFLKKFNMTECTPAKTPMHPTCILEKEDVSAKVCQKLYRGMIGSLLYLTAS 126
K + LKKF M P TP+ L +E +V Y+ +IGSL YLTA+
Sbjct: 6 KCASDILKKFKMEHSKPISTPVEEKLKLTRESDGKRVDSTHYKSLIGSLRYLTAT 170
>TC91257 similar to GP|21434|emb|CAA36616.1|| ORF4 {Solanum tuberosum},
partial (8%)
Length = 854
Score = 34.3 bits (77), Expect = 0.069
Identities = 14/28 (50%), Positives = 21/28 (75%)
Frame = +3
Query: 113 YRGMIGSLLYLTASRPDILFSVHLCAQF 140
YR ++G L YLT +RPDI ++V + +QF
Sbjct: 762 YRRLVGKLNYLTMTRPDISYAVSVVSQF 845
>AJ497569 weakly similar to PIR|T04833|T04 hypothetical protein F21P8.50 -
Arabidopsis thaliana, partial (4%)
Length = 723
Score = 30.8 bits (68), Expect = 0.76
Identities = 18/60 (30%), Positives = 31/60 (51%), Gaps = 2/60 (3%)
Frame = +2
Query: 256 YCDNTAAISLSKNLILHSRAKHIEVKYHFIRDYVQKGVLLLKFVD--TDHQWADIFSKPL 313
YCDN +A+ ++ N++ H R H E Y+ +G +L+ + + Q A +KPL
Sbjct: 290 YCDNISALHIAANMVFHERT*HRETD-----PYIVQGSRMLQLMPSASKDQPAYSLTKPL 454
>TC83624 homologue to PIR|G84581|G84581 copia-like retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(1%)
Length = 831
Score = 30.4 bits (67), Expect = 1.00
Identities = 12/31 (38%), Positives = 18/31 (57%)
Frame = +2
Query: 255 IYCDNTAAISLSKNLILHSRAKHIEVKYHFI 285
IYC N + ++KN + H R KH E + F+
Sbjct: 482 IYCVNQITLYIAKNQVYHERTKH*ENNWTFL 574
>TC85182 homologue to PIR|JC4780|JC4780 peroxidase (EC 1.11.1.7) 1B precursor
- alfalfa, complete
Length = 1373
Score = 30.0 bits (66), Expect = 1.3
Identities = 21/73 (28%), Positives = 32/73 (43%), Gaps = 2/73 (2%)
Frame = -2
Query: 60 DQQPEATYIHQSKYTKEFLKK--FNMTECTPAKTPMHPTCILEKEDVSAKVCQKLYRGMI 117
D+ EAT S E K F ++ C P TP+ P I+ + K+ +K + +
Sbjct: 1183 DESVEATLTKPSSTDFELTKLHCFLISPCFPVNTPIFPILII----AALKLSKKAF*SVA 1016
Query: 118 GSLLYLTASRPDI 130
L L S PD+
Sbjct: 1015 NLLTMLMVSEPDV 977
>TC83658
Length = 834
Score = 29.3 bits (64), Expect = 2.2
Identities = 13/29 (44%), Positives = 18/29 (61%)
Frame = -2
Query: 215 STIALSTAEAEYISAAICSTQMLWMKHQL 243
S+I E +Y SA +C T ML ++HQL
Sbjct: 566 SSINFHNKEIQYQSARVCHTNMLIVRHQL 480
>TC88226 similar to GP|15186732|dbj|BAB62890. aspartic proteinase delta
{Glycine max}, partial (57%)
Length = 1361
Score = 29.3 bits (64), Expect = 2.2
Identities = 19/68 (27%), Positives = 34/68 (49%), Gaps = 2/68 (2%)
Frame = +2
Query: 194 STYGNCQFLGSNLVSWA--SKRQSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQILES 251
S G C F G+ +S S + T +S+ + + ++C ++WM++QL+ Q E
Sbjct: 434 SQIGLCTFDGTQGISMGIQSVVEQTDRISSGGHQDATCSVCEMAVVWMQNQLKQNQ-TEE 610
Query: 252 NISIYCDN 259
I Y D+
Sbjct: 611 RIINYADS 634
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.325 0.137 0.413
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,770,489
Number of Sequences: 36976
Number of extensions: 148052
Number of successful extensions: 936
Number of sequences better than 10.0: 44
Number of HSP's better than 10.0 without gapping: 920
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 930
length of query: 334
length of database: 9,014,727
effective HSP length: 97
effective length of query: 237
effective length of database: 5,428,055
effective search space: 1286449035
effective search space used: 1286449035
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 58 (26.9 bits)
Lotus: description of TM0022.11