
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0346.12
(1332 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
CB893805 similar to GP|10177935|d copia-type polyprotein {Arabid... 274 2e-73
BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2... 273 3e-73
AJ502495 weakly similar to GP|18071369|g putative gag-pol polypr... 177 2e-44
AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsi... 172 8e-43
BG587156 similar to PIR|G85055|G8 probable polyprotein [imported... 167 2e-41
TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative ret... 167 2e-41
BG644690 weakly similar to GP|18542179|gb putative pol protein {... 137 7e-40
BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F2... 160 3e-39
TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-relate... 89 8e-35
TC89912 weakly similar to PIR|B84512|B84512 probable retroelemen... 114 2e-25
BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vu... 109 8e-24
BI262917 weakly similar to GP|19920130|g Putative retroelement {... 102 8e-22
BG586293 weakly similar to PIR|E84473|E84 probable retroelement ... 97 9e-22
CB892913 similar to GP|1769897|emb lectin receptor kinase {Arabi... 66 8e-19
BI262818 weakly similar to GP|6642775|gb| gag-pol polyprotein {V... 91 4e-18
BG587170 similar to PIR|F86470|F8 probable retroelement polyprot... 91 4e-18
BG586273 weakly similar to PIR|F86470|F8 probable retroelement p... 77 4e-14
BG644677 weakly similar to GP|7682800|gb| Hypothetical protein T... 73 8e-13
BE941052 weakly similar to PIR|B85188|B85 retrotransposon like p... 70 5e-12
TC81992 weakly similar to PIR|T47841|T47841 hypothetical protein... 58 2e-11
>CB893805 similar to GP|10177935|d copia-type polyprotein {Arabidopsis
thaliana}, partial (14%)
Length = 778
Score = 274 bits (700), Expect = 2e-73
Identities = 145/259 (55%), Positives = 187/259 (71%), Gaps = 4/259 (1%)
Frame = +3
Query: 812 NAMMMVTENDPVSFGEAVKNKKWRDAMSAEIESIERNQSWELTVLPKGVKPIGVKWVFKT 871
N M+ +DP +F EAVK++KWR +M+ E+E+ ERN +WELT L G K IG+KW+FKT
Sbjct: 12 NLGMLTMTSDPTTFEEAVKSEKWRASMNNEMEATERNNTWELTDLRSGAKTIGLKWIFKT 191
Query: 872 KLNEDGDVEKFKARLVAKGYAQRHEVDYTEVFAPVARLDTIRVILEVAAQFSWEVFQLD- 930
KLNE+G++EK+KARLVAKGY+Q++ VDYTEVFAPVAR DTIR+++ +AAQ + +
Sbjct: 192 KLNENGEIEKYKARLVAKGYSQQYGVDYTEVFAPVARWDTIRMVIALAAQIKRDGVCIS* 371
Query: 931 --VKSAFLHGELKEEVFVQQPKGFIRKGEEDKVYRLKKALYGLKQALRAWYSRIEAYFVR 988
+ + +++ + + R R+K+ALYGLKQA RAWYSRIEAYF +
Sbjct: 372 M*KAHSCMEN*MRKFLLINHRVM*RRVIS*----RVKRALYGLKQAPRAWYSRIEAYFTK 539
Query: 989 EDFERCPSEHILFTK-SKGGRILIVSLYVDDLIFTGNDRVMCDEFKSSMMLEFDMSDLGK 1047
E FE+CP EH LF K S+GG+ILI+SLYVDDLIF GND M +EFK SM EF+MSDLGK
Sbjct: 540 EGFEKCPYEHTLFVKLSEGGKILIISLYVDDLIFIGNDENMFEEFKKSMKKEFNMSDLGK 719
Query: 1048 MKYFLGVEVKQC*DGIFIC 1066
M YFLGVEV Q GI+IC
Sbjct: 720 MHYFLGVEVTQNEKGIYIC 776
>BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2O9.150 -
Arabidopsis thaliana, partial (11%)
Length = 732
Score = 273 bits (698), Expect = 3e-73
Identities = 140/219 (63%), Positives = 164/219 (73%)
Frame = +1
Query: 1054 VEVKQC*DGIFIC*RRYARDVLARFDMRDSNAVKNPTVPGTKLSKDEGGVRVDETLFKQV 1113
VEV Q +GI+IC R+Y D+L RF M SN +NP P KL KDE GV+VD T +KQ+
Sbjct: 1 VEVIQNEEGIYICQRKYVTDLLERFGMEKSNLSRNPIAPRCKLIKDENGVKVDATKYKQI 180
Query: 1114 VGSLMYLTVTRPDLMYGVSLISRFMSSPTMSHWLAAKRILRYLKGTTDLGIFYKKGGSNM 1173
VG LMYL TRPDLMY +SLISRFM+ PT H A KR+LRYL GT +LGI YK+ GS
Sbjct: 181 VGCLMYLAATRPDLMYVLSLISRFMNCPTELHMHAVKRVLRYLNGTINLGIMYKRNGSE- 357
Query: 1174 KLMAFPDSDYAGDLDDRRSTSRFVFMLGSGVVSWSSKKQYVVALSTTEAEYIAAALCACQ 1233
KL A+ DSDYAGDLDDR+STS +VFML SG VSWSSKKQ VV LSTT+AE+IAAA CACQ
Sbjct: 358 KLEAYTDSDYAGDLDDRKSTSGYVFMLSSGAVSWSSKKQPVVTLSTTKAEFIAAAFCACQ 537
Query: 1234 CVWLIRVLEKIGVEEKTSTEIMCDNSFTIQLSKNPVFHG 1272
VW+ RVLEK+G + S + CDN+ TI+LSKNPV HG
Sbjct: 538 SVWMRRVLEKLGYTQSGSITMYCDNNSTIKLSKNPVLHG 654
>AJ502495 weakly similar to GP|18071369|g putative gag-pol polyprotein {Oryza
sativa}, partial (9%)
Length = 542
Score = 177 bits (449), Expect = 2e-44
Identities = 83/146 (56%), Positives = 111/146 (75%)
Frame = +2
Query: 1181 SDYAGDLDDRRSTSRFVFMLGSGVVSWSSKKQYVVALSTTEAEYIAAALCACQCVWLIRV 1240
SD+AGD + R+STS + F LG+G +SWSSKKQ VVA ST EAEYIA+ CA Q VWL R+
Sbjct: 2 SDWAGDTETRKSTSGYAFHLGTGAISWSSKKQPVVAFSTAEAEYIASTSCATQTVWLRRI 181
Query: 1241 LEKIGVEEKTSTEIMCDNSFTIQLSKNPVFHGKSKHIDVRFHFLRDLVNDGVIKLSYCNS 1300
LE + E+ T T+I CDN I LSKNPVFHG+SKHID++FH +R+L+ + + + YC +
Sbjct: 182 LEVMHHEQNTPTKIYCDNKSAIALSKNPVFHGRSKHIDIQFHKIRELIAEKEVVIEYCPT 361
Query: 1301 QDQIADIMTKPIKLEQFEKLRGMLGV 1326
+++IADI TKP+K+E F KL+ MLG+
Sbjct: 362 EEKIADIFTKPLKIESFYKLKKMLGM 439
>AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsis thaliana},
partial (9%)
Length = 675
Score = 172 bits (436), Expect = 8e-43
Identities = 87/206 (42%), Positives = 131/206 (63%)
Frame = +1
Query: 819 ENDPVSFGEAVKNKKWRDAMSAEIESIERNQSWELTVLPKGVKPIGVKWVFKTKLNEDGD 878
EN P+S + +W AM E +++ N++W+L LP K IG KWV++ K N DG
Sbjct: 70 ENLPLS------DPRWLQAMKTEYKALIDNKTWDLVPLPPHKKAIGCKWVYRVKENPDGS 231
Query: 879 VEKFKARLVAKGYAQRHEVDYTEVFAPVARLDTIRVILEVAAQFSWEVFQLDVKSAFLHG 938
V KFKARLVAKG++Q DYTE F+PV + TIR+IL +A + WE+ Q+D+ +AFL+G
Sbjct: 232 VNKFKARLVAKGFSQTLGCDYTETFSPVIKPVTIRLILTIAITYKWEIQQIDINNAFLNG 411
Query: 939 ELKEEVFVQQPKGFIRKGEEDKVYRLKKALYGLKQALRAWYSRIEAYFVREDFERCPSEH 998
L+EEV++ QP+GF + V +L K+LYGLKQA RAWY + + ++ F + +
Sbjct: 412 FLQEEVYMSQPQGF-EAANKSLVCKLNKSLYGLKQAPRAWYEXLTSAQIQFGFTKSRCDP 588
Query: 999 ILFTKSKGGRILIVSLYVDDLIFTGN 1024
L ++ G + + +YVDD++ TG+
Sbjct: 589 SLLIYNQNGACIYLXIYVDDILITGS 666
>BG587156 similar to PIR|G85055|G8 probable polyprotein [imported] -
Arabidopsis thaliana, partial (17%)
Length = 618
Score = 167 bits (423), Expect = 2e-41
Identities = 79/190 (41%), Positives = 123/190 (64%), Gaps = 1/190 (0%)
Frame = -1
Query: 814 MMMVTEND-PVSFGEAVKNKKWRDAMSAEIESIERNQSWELTVLPKGVKPIGVKWVFKTK 872
M+ + EN P S+ EA+++K+W++++ AE ++ +N +W + LPKG K + +W+F K
Sbjct: 585 MVNLNENHIPRSYEEAMEDKEWKESVGAEAGAMIKNDTWYESELPKGKKAVSSRWIFTIK 406
Query: 873 LNEDGDVEKFKARLVAKGYAQRHEVDYTEVFAPVARLDTIRVILEVAAQFSWEVFQLDVK 932
DG +E+ K RLVA+G+ + DY E FAPVA+L TIR++L +A W ++Q+DVK
Sbjct: 405 YKADGSIERKKTRLVARGFTLTYGEDYIETFAPVAKLHTIRIVLSLAVNLGWGLWQMDVK 226
Query: 933 SAFLHGELKEEVFVQQPKGFIRKGEEDKVYRLKKALYGLKQALRAWYSRIEAYFVREDFE 992
+AFL GEL++EV++ P G + V RLKKA+YGLKQ+ RAWY+++ F
Sbjct: 225 NAFLQGELEDEVYMYPPPGLEHLVKRGNVLRLKKAIYGLKQSPRAWYNKLSTTLNGRGFR 46
Query: 993 RCPSEHILFT 1002
+ +H LFT
Sbjct: 45 KSELDHTLFT 16
>TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative retroelement
{Oryza sativa} [Oryza sativa (japonica cultivar-group)],
partial (10%)
Length = 823
Score = 167 bits (423), Expect = 2e-41
Identities = 94/231 (40%), Positives = 134/231 (57%), Gaps = 4/231 (1%)
Frame = +1
Query: 493 LQLVHSDICGPIKPSWNSDKRYILSFIDDHTRKTWVYFLHEKSEAFVKFKEYKAGVEKEI 552
L +HSD+ GP K + +RY+++ IDD RK WVYFL K+E F FK+++ VE +
Sbjct: 97 LDYIHSDLWGPSKVTSYGGRRYMMTIIDDFPRKVWVYFLRYKNETFPTFKKWRILVETQT 276
Query: 553 GAHITCLRTDRVCEFTSNEFDEFCRSQGINRQLTTTYTP*QNGVAERKNRTIMNVVRSML 612
G ++ L TD EF S++F+EFC + GI R T P QNGVAER RT++ R ML
Sbjct: 277 GKNVKKLITDN*LEFCSSDFNEFCTNHGIARHKTIPRNPQQNGVAERMIRTLLERARCML 456
Query: 613 NEKQV--PKVFWSEAVRWCVHIQNRCPIVAVENKTPEEAWSGEKPVVHYFRIFECVAHVH 670
+ + + W EA H+ NR P A++ K PE+ WSG RIF C A+
Sbjct: 457 SNAGL*N*RDLWVEAASTACHLVNRSPHSALDFKVPEDIWSGNLVDYSNLRIFGCPAYAL 636
Query: 671 VPDQRRSKLDDKSKKCVFLGVSDESKAWRLYV--PVSKKIIVSKDVVFEEE 719
V D KL ++ +C+FL + ESK +RL+ P S+K+I+S+DV F E+
Sbjct: 637 VND---GKLAPRAGECIFLSYASESKGYRLWCSDPKSQKLILSRDVTFNED 780
>BG644690 weakly similar to GP|18542179|gb putative pol protein {Zea mays},
partial (22%)
Length = 629
Score = 137 bits (344), Expect(2) = 7e-40
Identities = 63/141 (44%), Positives = 99/141 (69%)
Frame = -2
Query: 879 VEKFKARLVAKGYAQRHEVDYTEVFAPVARLDTIRVILEVAAQFSWEVFQLDVKSAFLHG 938
+ + K++LV +GY Q+ +DY E F+PVAR++ IR+++ AA ++++Q+DVKSAF++G
Sbjct: 424 ITRNKSKLVVQGYNQKEGIDYDEAFSPVARMEAIRILIAFAAFMGFKLYQMDVKSAFING 245
Query: 939 ELKEEVFVQQPKGFIRKGEEDKVYRLKKALYGLKQALRAWYSRIEAYFVREDFERCPSEH 998
+LKEEVFV+QP GF + V+RL K LYGLKQA RAWY R+ + ++ F+R ++
Sbjct: 244 DLKEEVFVKQPPGFEDAEVPNHVFRLNKTLYGLKQAPRAWYERLSKFLLKNGFKRGKIDN 65
Query: 999 ILFTKSKGGRILIVSLYVDDL 1019
LF + +LI+ +YVDD+
Sbjct: 64 TLFLLKRE*ELLIIQVYVDDI 2
Score = 47.0 bits (110), Expect(2) = 7e-40
Identities = 19/57 (33%), Positives = 34/57 (59%)
Frame = -3
Query: 817 VTENDPVSFGEAVKNKKWRDAMSAEIESIERNQSWELTVLPKGVKPIGVKWVFKTKL 873
++ +P + EA+++ W ++M E+ ER++ W L P+G IG +WVF+ KL
Sbjct: 600 ISSIEPKNVKEALRDADWINSMQEELHQFERSKVWYLVPRPEGKTVIGTRWVFRNKL 430
>BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana, partial (10%)
Length = 744
Score = 160 bits (405), Expect = 3e-39
Identities = 91/246 (36%), Positives = 141/246 (56%), Gaps = 3/246 (1%)
Frame = +2
Query: 960 KVYRLKKALYGLKQALRAWYSRIEAYFVREDFERCPSEHILFTKSKGGRILIVSLYVDDL 1019
KV L+K++YGLKQA R WYS++ + + + S+ LFTK K + +YVDD+
Sbjct: 17 KVCELQKSIYGLKQASRQWYSKLSESLISFGYLQSSSDFSLFTKFKDSSFTTLLVYVDDI 196
Query: 1020 IFTGNDRVMCDEFKSSMMLEFDMSDLGKMKYFLGVEVKQC*DGIFIC*RRYARDVLARFD 1079
+ GND K ++ F + DLG ++YFLG+EV + GI + R+Y ++L +
Sbjct: 197 VLAGNDISEIQHVKCFLIDRFKIKDLGSLRYFLGLEVARSKQGILLNQRKYTLELL---E 367
Query: 1080 MRDSNAVKNPTVP---GTKLSKDEGGVRVDETLFKQVVGSLMYLTVTRPDLMYGVSLISR 1136
+ AVK+ P KL + + DET +++++G L+YLT TRPD+ + V +S+
Sbjct: 368 DSGNLAVKSTLTPYDISLKLHNSDSPLYNDETQYRRLIGKLIYLTTTRPDISFAVQQLSQ 547
Query: 1137 FMSSPTMSHWLAAKRILRYLKGTTDLGIFYKKGGSNMKLMAFPDSDYAGDLDDRRSTSRF 1196
F+S P H+ AA R+L+YLK G+FY SN+KL +F DSD+A R+S + +
Sbjct: 548 FVSKPQQVHYQAAIRVLQYLKTAPAKGLFY-SATSNLKLSSFADSDWATCPTTRKSVTGY 724
Query: 1197 VFMLGS 1202
LGS
Sbjct: 725 WVFLGS 742
>TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
(EC 3.4.23.-);, partial (7%)
Length = 705
Score = 89.0 bits (219), Expect(2) = 8e-35
Identities = 45/80 (56%), Positives = 59/80 (73%)
Frame = +1
Query: 1150 KRILRYLKGTTDLGIFYKKGGSNMKLMAFPDSDYAGDLDDRRSTSRFVFMLGSGVVSWSS 1209
KRI+RY+KGT+ + + + GGS + + + DSD+AGD D R+ST+ +VF L G VSW S
Sbjct: 4 KRIMRYIKGTSGVAVCF--GGSELTVRGYVDSDFAGDHDKRKSTTGYVFTLAGGAVSWLS 177
Query: 1210 KKQYVVALSTTEAEYIAAAL 1229
K Q VVALSTTEAEY+AA L
Sbjct: 178 KLQTVVALSTTEAEYMAAYL 237
Score = 78.2 bits (191), Expect(2) = 8e-35
Identities = 32/99 (32%), Positives = 65/99 (65%), Gaps = 1/99 (1%)
Frame = +3
Query: 1231 AC-QCVWLIRVLEKIGVEEKTSTEIMCDNSFTIQLSKNPVFHGKSKHIDVRFHFLRDLVN 1289
AC + +W+ R++E++G +++ T + CD+ + +++NP FH ++KHI +++HF+R++V
Sbjct: 240 ACKEAIWMQRLMEELGHKQEQIT-VYCDSQSALHIARNPAFHSRTKHIGIQYHFVREVVE 416
Query: 1290 DGVIKLSYCNSQDQIADIMTKPIKLEQFEKLRGMLGVTE 1328
+G + + ++ D +AD MTK I ++F R G+ E
Sbjct: 417 EGSVDMQKIHTNDNLADAMTKSINTDKFIWCRSSYGLLE 533
>TC89912 weakly similar to PIR|B84512|B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(10%)
Length = 814
Score = 114 bits (286), Expect = 2e-25
Identities = 60/164 (36%), Positives = 96/164 (57%), Gaps = 1/164 (0%)
Frame = +1
Query: 1148 AAKRILRYLKGTTDLGIFYKKGGSNMK-LMAFPDSDYAGDLDDRRSTSRFVFMLGSGVVS 1206
A K +L+YL + + Y K L + D+DYAG++D R+S S FVF L +S
Sbjct: 4 ALKWVLKYLNESLKSSLKYTKAAQEEDALEGYVDADYAGNVDTRKSLSGFVFTLYGTTIS 183
Query: 1207 WSSKKQYVVALSTTEAEYIAAALCACQCVWLIRVLEKIGVEEKTSTEIMCDNSFTIQLSK 1266
W + +Q VV LSTT+AEYIA +WL ++ ++G+ ++ +I CD+ I L+
Sbjct: 184 WKANQQSVVTLSTTQAEYIAFVEGVKDAIWLKGMIGELGITQE-YVKIHCDSQSAIHLAN 360
Query: 1267 NPVFHGKSKHIDVRFHFLRDLVNDGVIKLSYCNSQDQIADIMTK 1310
+ V+H ++KHID+R HF+RD++ I + S++ AD+ TK
Sbjct: 361 HQVYHERTKHIDIRLHFIRDMIESKEIVVEKMASEENPADVFTK 492
>BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vulgaris},
partial (13%)
Length = 494
Score = 109 bits (272), Expect = 8e-24
Identities = 55/126 (43%), Positives = 79/126 (62%), Gaps = 2/126 (1%)
Frame = +1
Query: 1124 RPDLMYGVSLISRFMSSPTMSHWLAAKRILRYLKGTTDLGIFYKKGGSN--MKLMAFPDS 1181
RPD+ Y VS+IS+FM P H +AA RILRY++GT + G+ + G + +L+ + DS
Sbjct: 121 RPDICYSVSVISKFMHDPRKPHLIAANRILRYVRGTMEYGLLFPYGAKSEVYELICYSDS 300
Query: 1182 DYAGDLDDRRSTSRFVFMLGSGVVSWSSKKQYVVALSTTEAEYIAAALCACQCVWLIRVL 1241
D+ GD RRSTS +VF +SW +KKQ + ALS+ EAEYIA Q +WL V+
Sbjct: 301 DWCGD---RRSTSGYVFKFNDAAISWCTKKQPITALSSYEAEYIAGTFATFQALWLDSVI 471
Query: 1242 EKIGVE 1247
+++ E
Sbjct: 472 KELKCE 489
>BI262917 weakly similar to GP|19920130|g Putative retroelement {Oryza
sativa} [Oryza sativa (japonica cultivar-group)],
partial (8%)
Length = 426
Score = 102 bits (255), Expect = 8e-22
Identities = 49/115 (42%), Positives = 69/115 (59%)
Frame = +1
Query: 589 YTP*QNGVAERKNRTIMNVVRSMLNEKQVPKVFWSEAVRWCVHIQNRCPIVAVENKTPEE 648
+TP QNGVAER NRT++ R+ML + K FW+EAV+ ++ NR P ++ KTP E
Sbjct: 82 HTPQQNGVAERMNRTLLERTRAMLKTAGMAKSFWAEAVKTACYVINRSPSTVIDLKTPME 261
Query: 649 AWSGEKPVVHYFRIFECVAHVHVPDQRRSKLDDKSKKCVFLGVSDESKAWRLYVP 703
W G+ +F C +V Q R+KLD KS+KC+FLG +D K + L+ P
Sbjct: 262 MWKGKPVDYSSLHVFGCPVYVMYNSQERTKLDPKSRKCIFLGYADNVKGYXLWDP 426
>BG586293 weakly similar to PIR|E84473|E84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(7%)
Length = 763
Score = 97.4 bits (241), Expect(2) = 9e-22
Identities = 46/92 (50%), Positives = 67/92 (72%)
Frame = +2
Query: 847 RNQSWELTVLPKGVKPIGVKWVFKTKLNEDGDVEKFKARLVAKGYAQRHEVDYTEVFAPV 906
+ Q+ +L P GVKPIG++W++K K NEDG + K+KARLVAKGY ++ +D+ EVFAPV
Sbjct: 47 QKQTLKLVKKPTGVKPIGLRWIYKIKRNEDGTLIKYKARLVAKGYVKQQGIDFDEVFAPV 226
Query: 907 ARLDTIRVILEVAAQFSWEVFQLDVKSAFLHG 938
R++TI ++L +AA + +DVK AFL+G
Sbjct: 227 VRIETI*LLLALAATNGC*IHHIDVKIAFLNG 322
Score = 25.8 bits (55), Expect(2) = 9e-22
Identities = 11/16 (68%), Positives = 13/16 (80%)
Frame = +3
Query: 834 WRDAMSAEIESIERNQ 849
W DAM AEIESI +N+
Sbjct: 9 WIDAMKAEIESIIKNR 56
>CB892913 similar to GP|1769897|emb lectin receptor kinase {Arabidopsis
thaliana}, partial (35%)
Length = 837
Score = 66.2 bits (160), Expect(2) = 8e-19
Identities = 37/121 (30%), Positives = 69/121 (56%), Gaps = 1/121 (0%)
Frame = +1
Query: 131 EKVDSFLGRTLIVVNKMKSNGETMEHSTVVSKILRSLTSKFNYVVCSIEESNDLSILSID 190
E+V + R + + N++K N E +E ++ KILRSL KF ++V IEE+ DL ++I+
Sbjct: 472 ERVRVYFSRVISISNQLKRNSEKLEDVRIMEKILRSLDPKFEHIVEKIEETKDLETMTIE 651
Query: 191 ELHGSLLVHEQRMQGHQEEEHVLK-VAQEDRSSRGRGRGAPRGGRGRGRGRQSLNKEVIE 249
+L GSL +E++ H++++ + + + + + G+ R + +GRG N + I
Sbjct: 652 KLQGSLQAYEEK---HKKKKDITE*LFKMQLKEKDEGQRNERIQQSQGRGST*KNCKFIN 822
Query: 250 C 250
C
Sbjct: 823 C 825
Score = 47.0 bits (110), Expect(2) = 8e-19
Identities = 24/99 (24%), Positives = 52/99 (52%)
Frame = +3
Query: 19 YDYWSMTMENFLRSKEMWSLVDEGIPVLETGTTPASEEQMKAVEEAKLKDLKVKNYLFQA 78
YD + M L + ++W +V++G + + ++ Q +++++ +D K ++QA
Sbjct: 150 YDNCFIKMMALLGAHDVWEVVEKGYKESQDEDS-LTKAQRDTLKDSRKRDKKALFLIYQA 326
Query: 79 IGREILETILDKETSKEIWNSMKQKYHGSSKVKREQLQV 117
+ + E I + ++KE W +K G KVK+ +LQ+
Sbjct: 327 LDEDEFEKISNATSAKEAWEKLKTSCQGEDKVKKVRLQI 443
>BI262818 weakly similar to GP|6642775|gb| gag-pol polyprotein {Vitis
vinifera}, partial (24%)
Length = 615
Score = 90.5 bits (223), Expect = 4e-18
Identities = 46/158 (29%), Positives = 82/158 (51%), Gaps = 1/158 (0%)
Frame = +1
Query: 1 MSDSEKFEQPAIPKFDG-HYDYWSMTMENFLRSKEMWSLVDEGIPVLETGTTPASEEQMK 59
M ++ P +P FDG +Y W+ +E +L + ++W V+E VL P Q+K
Sbjct: 142 MDSETSYKAPPLPVFDGENYHIWAARIEAYLEANDLWEAVEEDYEVLPLSDNPTMA-QIK 318
Query: 60 AVEEAKLKDLKVKNYLFQAIGREILETILDKETSKEIWNSMKQKYHGSSKVKREQLQVMR 119
+E K + K + LF A+ EI I+ +++ EIWN +K +Y G +++ Q +
Sbjct: 319 NHKERKTRKSKARATLFAAVSEEIFTRIMTIKSAFEIWNFLKTEYEGDERIRGMQALNLI 498
Query: 120 REFDLLAMKEGEKVDSFLGRTLIVVNKMKSNGETMEHS 157
REF++ MKE E + + + + + NK++ G + S
Sbjct: 499 REFEMQKMKESETIKEYANKLISIANKVRLLGSELXDS 612
>BG587170 similar to PIR|F86470|F8 probable retroelement polyprotein
[imported] - Arabidopsis thaliana, partial (13%)
Length = 718
Score = 90.5 bits (223), Expect = 4e-18
Identities = 58/184 (31%), Positives = 90/184 (48%), Gaps = 1/184 (0%)
Frame = -3
Query: 435 HCRFGHLNHKGLRTLSHKKMVVGLPSLESPEKICTTCLTGKQHREPVPKRSLWRASKQLQ 494
H R GH + + L + LP + K C C+ GK + P+ S +
Sbjct: 578 HARLGHPHGRALNLM--------LPGVVFENKNCEACILGKHCKNVFPRTSTVYENC-FD 426
Query: 495 LVHSDICGPIKPSWNSDK-RYILSFIDDHTRKTWVYFLHEKSEAFVKFKEYKAGVEKEIG 553
L+++D+ PS + D +Y ++FID+ ++ TW+ + K FK ++A V
Sbjct: 425 LIYTDLW--TAPSLSRDNHKYFVTFIDEKSKYTWLTLIPSKDRVIDAFKNFQAYVTNHYH 252
Query: 554 AHITCLRTDRVCEFTSNEFDEFCRSQGINRQLTTTYTP*QNGVAERKNRTIMNVVRSMLN 613
A I LR+D E+TS F GI Q + YTP QNGVA+RKN+ +M V RS++
Sbjct: 251 AKIKILRSDNGGEYTSYAFKSHLDHHGILHQTSCPYTPQQNGVAKRKNKHLMEVARSLMF 72
Query: 614 EKQV 617
+ V
Sbjct: 71 QANV 60
>BG586273 weakly similar to PIR|F86470|F8 probable retroelement polyprotein
[imported] - Arabidopsis thaliana, partial (7%)
Length = 705
Score = 77.0 bits (188), Expect = 4e-14
Identities = 43/145 (29%), Positives = 77/145 (52%)
Frame = -2
Query: 631 HIQNRCPIVAVENKTPEEAWSGEKPVVHYFRIFECVAHVHVPDQRRSKLDDKSKKCVFLG 690
++ NR P ++++ P E + KP + Y R+F C+ +V VP + R+KL+ +S+K +F+G
Sbjct: 698 YLINRIPTRVLKDQAPFEVLNQRKPSLTYMRVFGCLCYVLVPGELRNKLEARSRKAMFIG 519
Query: 691 VSDESKAWRLYVPVSKKIIVSKDVVFEEEESWDWGRIEEEIKLDILECGEEDQNEEENGR 750
S K ++ Y P +++++VS+DV F EE G EE+ + D+ + +
Sbjct: 518 YSTTQKGYKCYDPEARRVLVSRDVKFIEER----GYYEEKNQEDLRDLTSDKAGVLRVIL 351
Query: 751 TDLNNLSSNSSSSSNSLPESLPNEP 775
L + S+ + PE NEP
Sbjct: 350 EGLGIKMNQDQSTRSRQPEESSNEP 276
>BG644677 weakly similar to GP|7682800|gb| Hypothetical protein T15F17.l
{Arabidopsis thaliana}, partial (3%)
Length = 539
Score = 72.8 bits (177), Expect = 8e-13
Identities = 38/103 (36%), Positives = 61/103 (58%)
Frame = -3
Query: 1120 LTVTRPDLMYGVSLISRFMSSPTMSHWLAAKRILRYLKGTTDLGIFYKKGGSNMKLMAFP 1179
LT+ P++ + ++L+SR+ S+PTM H K I +YLKG D+G+FY K S L+ +
Sbjct: 525 LTLQGPNITFSINLLSRYSSAPTMRH*NGIKHICKYLKGIIDMGLFYSKDCS-PDLIGYV 349
Query: 1180 DSDYAGDLDDRRSTSRFVFMLGSGVVSWSSKKQYVVALSTTEA 1222
++ Y D RS + ++F G+ V+SW S K +A S+ A
Sbjct: 348 NA*YLSDPHKARS*TGYIFTCGNTVISWRSTK*STIATSSNHA 220
>BE941052 weakly similar to PIR|B85188|B85 retrotransposon like protein
[imported] - Arabidopsis thaliana, partial (4%)
Length = 480
Score = 70.1 bits (170), Expect = 5e-12
Identities = 30/75 (40%), Positives = 47/75 (62%)
Frame = +2
Query: 1254 IMCDNSFTIQLSKNPVFHGKSKHIDVRFHFLRDLVNDGVIKLSYCNSQDQIADIMTKPIK 1313
+ CD L+ NPV+H + KHI + HF+RDLV G +K+ + + DQ+AD +TKP+
Sbjct: 23 LRCDYLSATYLTHNPVYHSRMKHISIDIHFVRDLVQQGKLKVQHVCTVDQLADCLTKPLS 202
Query: 1314 LEQFEKLRGMLGVTE 1328
+ + LR +GVT+
Sbjct: 203 KSRHQLLRNKIGVTD 247
>TC81992 weakly similar to PIR|T47841|T47841 hypothetical protein T2O9.150 -
Arabidopsis thaliana, partial (3%)
Length = 952
Score = 58.2 bits (139), Expect(2) = 2e-11
Identities = 35/100 (35%), Positives = 59/100 (59%), Gaps = 5/100 (5%)
Frame = +3
Query: 70 KVKNYLFQAIG--REILETILD---KETSKEIWNSMKQKYHGSSKVKREQLQVMRREFDL 124
KV +L Q I ++++E L K + S+K+ Y +++ KR QLQ + R++ +
Sbjct: 600 KVVRFLRQKITSFKQLIELCLKQF*KTRHRRRSGSLKK*YQVTARGKRAQLQAL*RDW*I 779
Query: 125 LAMKEGEKVDSFLGRTLIVVNKMKSNGETMEHSTVVSKIL 164
L MK GE + + RT+ +VNK++ +GE M + TV+ KIL
Sbjct: 780 LQMKNGESIIDYFARTVTIVNKIRMHGEQMSNLTVIEKIL 899
Score = 42.0 bits (97), Expect = 0.002
Identities = 21/30 (70%), Positives = 23/30 (76%)
Frame = +2
Query: 67 KDLKVKNYLFQAIGREILETILDKETSKEI 96
K K KN+LFQAI R +LETIL ETSKEI
Sbjct: 608 KIFKAKNHLFQAIDRVVLETILKNETSKEI 697
Score = 30.0 bits (66), Expect(2) = 2e-11
Identities = 16/39 (41%), Positives = 22/39 (56%)
Frame = +1
Query: 30 LRSKEMWSLVDEGIPVLETGTTPASEEQMKAVEEAKLKD 68
L SKE W+L+ GIP G +E + K ++E KL D
Sbjct: 499 LASKEYWNLIYPGIPQQAVGVV-LTENEKKTLDELKL*D 612
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.320 0.136 0.403
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 39,852,354
Number of Sequences: 36976
Number of extensions: 559241
Number of successful extensions: 3727
Number of sequences better than 10.0: 99
Number of HSP's better than 10.0 without gapping: 3379
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3588
length of query: 1332
length of database: 9,014,727
effective HSP length: 108
effective length of query: 1224
effective length of database: 5,021,319
effective search space: 6146094456
effective search space used: 6146094456
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (29.3 bits)
Lotus: description of TM0346.12