
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0003.6
(1307 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative ret... 253 3e-67
AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsi... 165 1e-40
CB893805 similar to GP|10177935|d copia-type polyprotein {Arabid... 164 2e-40
BG644690 weakly similar to GP|18542179|gb putative pol protein {... 124 7e-36
BI262917 weakly similar to GP|19920130|g Putative retroelement {... 140 3e-33
CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotia... 80 1e-32
TC89912 weakly similar to PIR|B84512|B84512 probable retroelemen... 129 1e-29
BF635063 weakly similar to PIR|F84486|F84 probable retroelement ... 127 4e-29
BG587156 similar to PIR|G85055|G8 probable polyprotein [imported... 121 2e-27
BG587170 similar to PIR|F86470|F8 probable retroelement polyprot... 110 3e-24
TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-relate... 108 1e-23
AJ502495 weakly similar to GP|18071369|g putative gag-pol polypr... 102 1e-21
BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F2... 91 4e-18
BG587174 similar to PIR|A47759|A4775 retrovirus-related reverse ... 80 4e-15
BG586293 weakly similar to PIR|E84473|E84 probable retroelement ... 73 6e-13
BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sati... 72 1e-12
BG586273 weakly similar to PIR|F86470|F8 probable retroelement p... 69 9e-12
BF631997 weakly similar to GP|18542925|gb Putative pol polyprote... 58 2e-11
BE941052 weakly similar to PIR|B85188|B85 retrotransposon like p... 60 6e-09
BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2... 59 1e-08
>TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative retroelement
{Oryza sativa} [Oryza sativa (japonica cultivar-group)],
partial (10%)
Length = 823
Score = 253 bits (647), Expect = 3e-67
Identities = 133/268 (49%), Positives = 179/268 (66%), Gaps = 8/268 (2%)
Frame = +1
Query: 452 CKYCVL-GKQCRVRFKTGHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVW 510
CK+ + G + +V F T H+TKGILDY+HSD+ GP+K S G RY +T DDF RKVW
Sbjct: 22 CKHLLFFGNRKKVSFSTATHRTKGILDYIHSDLWGPSKVTSYGGRRYMMTIIDDFPRKVW 201
Query: 511 VYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSV 570
VYF++YK+E F FK W+ VE QTG+ +K L +DN E+ +F FC +GI RH ++
Sbjct: 202 VYFLRYKNETFPTFKKWRILVETQTGKNVKKLITDN*LEFCSSDFNEFCTNHGIARHKTI 381
Query: 571 RKTPQQNGVAERMNRTLTEKARCLRLNACL--SKCLWAATINMACYLVNRSPRASLDGKV 628
+ PQQNGVAERM RTL E+ARC+ NA L + LW + AC+LVNRSP ++LD KV
Sbjct: 382 PRNPQQNGVAERMIRTLLERARCMLSNAGL*N*RDLWVEAASTACHLVNRSPHSALDFKV 561
Query: 629 AEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLW--DP 686
E++W+GN +D SNLRIF PAY ++ KL P++ CI + Y KGY+LW DP
Sbjct: 562 PEDIWSGNLVDYSNLRIFGCPAYALVND---GKLAPRAGECIFLSYASESKGYRLWCSDP 732
Query: 687 VKKKVIVSRDVVFDE*SML---KQSDVT 711
+K+I+SRDV F+E ++L KQS V+
Sbjct: 733 KSQKLILSRDVTFNEDALLSSGKQSFVS 816
>AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsis thaliana},
partial (9%)
Length = 675
Score = 165 bits (417), Expect = 1e-40
Identities = 85/196 (43%), Positives = 125/196 (63%), Gaps = 2/196 (1%)
Frame = +1
Query: 806 KWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGY 865
+W+ AM E ++L N+T +LV LP K+ IGCKWVY+ K KFKA LV KG+
Sbjct: 94 RWLQAMKTEYKALIDNKTWDLVPLPPHKKAIGCKWVYRVKENPD-GSVNKFKARLVAKGF 270
Query: 866 SQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEG 925
SQ G DY E FSPV++ +IR++L + + ++Q+D+ FL+G L+E++Y+ QP+G
Sbjct: 271 SQTLGCDYTETFSPVIKPVTIRLILTIAITYKWEIQQIDINNAFLNGFLQEEVYMSQPQG 450
Query: 926 FSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGY--RRCDYDCCVYVMSLDDGS 983
F E + LVCKL +SLYGLKQ+PR WY+ S ++ G+ RCD +Y +G+
Sbjct: 451 F-EAANKSLVCKLNKSLYGLKQAPRAWYEXLTSAQIQFGFTKSRCDPSLLIY---NQNGA 618
Query: 984 FIFLLLYVDDMLIAAN 999
I+L +YVDD+LI +
Sbjct: 619 CIYLXIYVDDILITGS 666
>CB893805 similar to GP|10177935|d copia-type polyprotein {Arabidopsis
thaliana}, partial (14%)
Length = 778
Score = 164 bits (416), Expect = 2e-40
Identities = 92/251 (36%), Positives = 145/251 (57%), Gaps = 4/251 (1%)
Frame = +3
Query: 785 LLTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKK 844
+LT + DP+T+ EA+ S +KW ++M EME+ ++N T L L G + IG KW++K
Sbjct: 21 MLTMTSDPTTFEEAVKS---EKWRASMNNEMEATERNNTWELTDLRSGAKTIGLKWIFKT 191
Query: 845 KLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASM---DMHLE 901
KL E EK+KA LV KGYSQ G+DY E+F+PV R +IR+V+AL A + + +
Sbjct: 192 KLNEN-GEIEKYKARLVAKGYSQQYGVDYTEVFAPVARWDTIRMVIALAAQIKRDGVCIS 368
Query: 902 QMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVC-KLKRSLYGLKQSPRQWYKRFDSYM 960
M + + + + + I R++ ++KR+LYGLKQ+PR WY R ++Y
Sbjct: 369 *M*KAHSCMEN*MRKFLLINH-----RVM*RRVIS*RVKRALYGLKQAPRAWYSRIEAYF 533
Query: 961 LRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDL 1020
+ G+ +C Y+ ++V + G + + LYVDD++ N + E K + KEF+M DL
Sbjct: 534 TKEGFEKCPYEHTLFVKLSEGGKILIISLYVDDLIFIGNDENMFEEFKKSMKKEFNMSDL 713
Query: 1021 GAAKKILGMEI 1031
G LG+E+
Sbjct: 714 GKMHYFLGVEV 746
>BG644690 weakly similar to GP|18542179|gb putative pol protein {Zea mays},
partial (22%)
Length = 629
Score = 124 bits (312), Expect(2) = 7e-36
Identities = 57/138 (41%), Positives = 95/138 (68%)
Frame = -2
Query: 857 KAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEE 916
K+ LV +GY+Q +GIDYDE FSPV R +IR+++A A M L QMDVK+ F++G+L+E
Sbjct: 412 KSKLVVQGYNQKEGIDYDEAFSPVARMEAIRILIAFAAFMGFKLYQMDVKSAFINGDLKE 233
Query: 917 QIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYV 976
+++++QP GF + V +L ++LYGLKQ+PR WY+R ++L+ G++R D +++
Sbjct: 232 EVFVKQPPGFEDAEVPNHVFRLNKTLYGLKQAPRAWYERLSKFLLKNGFKRGKIDNTLFL 53
Query: 977 MSLDDGSFIFLLLYVDDM 994
+ + + + +YVDD+
Sbjct: 52 LK-RE*ELLIIQVYVDDI 2
Score = 45.8 bits (107), Expect(2) = 7e-36
Identities = 22/68 (32%), Positives = 40/68 (58%)
Frame = -3
Query: 779 DLAAYALLTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGC 838
+L +++ SS +P EA+ + W+++M EE+ ++++ LV P GK VIG
Sbjct: 624 NLVSFSAFISSIEPKNVKEALRDAD---WINSMQEELHQFERSKVWYLVPRPEGKTVIGT 454
Query: 839 KWVYKKKL 846
+WV++ KL
Sbjct: 453 RWVFRNKL 430
>BI262917 weakly similar to GP|19920130|g Putative retroelement {Oryza
sativa} [Oryza sativa (japonica cultivar-group)],
partial (8%)
Length = 426
Score = 140 bits (353), Expect = 3e-33
Identities = 65/133 (48%), Positives = 87/133 (64%)
Frame = +1
Query: 554 NFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMAC 613
+F HF + + + V TPQQNGVAERMNRTL E+ R + A ++K WA + AC
Sbjct: 28 SF*HFVNKRVL*GNSXVAHTPQQNGVAERMNRTLLERTRAMLKTAGMAKSFWAEAVKTAC 207
Query: 614 YLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIG 673
Y++NRSP +D K E+W G P+D S+L +F P YV +S++R+KLDPKS+ CI +G
Sbjct: 208 YVINRSPSTVIDLKTPMEMWKGKPVDYSSLHVFGCPVYVMYNSQERTKLDPKSRKCIFLG 387
Query: 674 YNKGVKGYKLWDP 686
Y VKGY LWDP
Sbjct: 388 YADNVKGYXLWDP 426
>CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotiana tabacum},
partial (7%)
Length = 780
Score = 80.1 bits (196), Expect(2) = 1e-32
Identities = 42/89 (47%), Positives = 58/89 (64%)
Frame = -3
Query: 994 MLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLS 1053
+L+ +++ ++ LKT+ KE DMKDLG AKKI+GM+I D+ L LSQ Y+ VL
Sbjct: 271 LLVVGSNIDEIKNLKTRFSKEIDMKDLGPAKKIIGMQIMIDKQKGVL*LSQVEYITRVLQ 92
Query: 1054 RFDMSKANHVSTPLTNHFKLSLEQSPKID 1082
F+M A VST L +HF LS EQSP+ +
Sbjct: 91 IFNMGNAILVSTTLASHFCLSHEQSPQTE 5
Score = 79.7 bits (195), Expect(2) = 1e-32
Identities = 43/90 (47%), Positives = 61/90 (67%)
Frame = -2
Query: 877 FSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVC 936
F P+V+ +I +L++VA +++LE +DVKT FL G+L E IY+ QPEGFS G++V
Sbjct: 554 FVPIVKLNTIMFLLSIVAIENLYLE*LDVKTAFLRGDLVEDIYMHQPEGFS*E-VGKMVG 378
Query: 937 KLKRSLYGLKQSPRQWYKRFDSYMLRIGYR 966
KLK+S+YGLKQ PRQ + R G+R
Sbjct: 377 KLKKSMYGLKQGPRQCI*SLKALCTRKGFR 288
>TC89912 weakly similar to PIR|B84512|B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(10%)
Length = 814
Score = 129 bits (323), Expect = 1e-29
Identities = 57/108 (52%), Positives = 82/108 (75%)
Frame = +1
Query: 1184 GGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAI 1243
G I WK++ QS V +STT+AEY+A E K+A+WL G++ ELG+ Q V++HCDSQSAI
Sbjct: 169 GTTISWKANQQSVVTLSTTQAEYIAFVEGVKDAIWLKGMIGELGITQEYVKIHCDSQSAI 348
Query: 1244 YLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTK 1291
+L N+QVYH RTKHID+R H IR+++ S++I+++K+ + EN D TK
Sbjct: 349 HLANHQVYHERTKHIDIRLHFIRDMIESKEIVVEKMASEENPADVFTK 492
>BF635063 weakly similar to PIR|F84486|F84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(4%)
Length = 677
Score = 127 bits (318), Expect = 4e-29
Identities = 65/178 (36%), Positives = 108/178 (60%), Gaps = 3/178 (1%)
Frame = -2
Query: 3 GAKFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEK--PADIATVDWNEMKEKAAGL 60
G+K ++ +F G +FGLW+ + +L Q+ +KAL+ E P ++ + EM +KA
Sbjct: 565 GSKRDIEKFTGDNDFGLWKVKMEAVLIQQKCEKALKGEVSLPVTMSRAEKTEMVDKARSA 386
Query: 61 ITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHV 120
I LC+ D V+ + T +W KL SLYM+K+ ++ F KQ+LYS +M E + +
Sbjct: 385 IVLCLGDKVLREVAKERTAASMWAKL*SLYMTKSLAHRQFLKQQLYSFRMVESKAIMEQL 206
Query: 121 YAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKD-SITLDSISSTL 177
FN IL D+ + V ++DE+KAI+LLC+LP S++ T+ YGK+ ++TL+ + + L
Sbjct: 205 TEFNKILDDLENIEVQLEDEEKAILLLCALPKSFESFKDTMLYGKEGTVTLEEVQAAL 32
>BG587156 similar to PIR|G85055|G8 probable polyprotein [imported] -
Arabidopsis thaliana, partial (17%)
Length = 618
Score = 121 bits (304), Expect = 2e-27
Identities = 64/187 (34%), Positives = 106/187 (56%)
Frame = -1
Query: 792 PSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*K 851
P +Y EAM E +W ++ E ++ KN+T +LP GK+ + +W++ K
Sbjct: 558 PRSYEEAM---EDKEWKESVGAEAGAMIKNDTWYESELPKGKKAVSSRWIFTIKYKAD-G 391
Query: 852 EREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLH 911
E+ K LV +G++ G DY E F+PV + +IR+VL+L ++ L QMDVK FL
Sbjct: 390 SIERKKTRLVARGFTLTYGEDYIETFAPVAKLHTIRIVLSLAVNLGWGLWQMDVKNAFLQ 211
Query: 912 GNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYD 971
G LE+++Y+ P G V +LK+++YGLKQSPR WY + + + G+R+ + D
Sbjct: 210 GELEDEVYMYPPPGLEHLVKRGNVLRLKKAIYGLKQSPRAWYNKLSTTLNGRGFRKSELD 31
Query: 972 CCVYVMS 978
++ ++
Sbjct: 30 HTLFTLT 10
>BG587170 similar to PIR|F86470|F8 probable retroelement polyprotein
[imported] - Arabidopsis thaliana, partial (13%)
Length = 718
Score = 110 bits (276), Expect = 3e-24
Identities = 75/202 (37%), Positives = 100/202 (49%)
Frame = -3
Query: 419 LWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTKGILDY 478
LWH RLGH R + ML GV C+ C+LGK C+ F + D
Sbjct: 584 LWHARLGHPHGRAL-----NLMLPGVVFENKN-CEACILGKHCKNVFPRTSTVYENCFDL 423
Query: 479 VHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGRK 538
+++D+ S +YFVTF D+ S+ W+ + K V FK ++A V N K
Sbjct: 422 IYTDL-WTAPSLSRDNHKYFVTFIDEKSKYTWLTLIPSKDRVIDAFKNFQAYVTNHYHAK 246
Query: 539 IKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCLRLNA 598
IK LRSDNG EYT F + +GI S TPQQNGVA+R N+ L E AR L
Sbjct: 245 IKILRSDNGGEYTSYAFKSHLDHHGILHQTSCPYTPQQNGVAKRKNKHLMEVARSL---- 78
Query: 599 CLSKCLWAATINMACYLVNRSP 620
++ A ++ ACYL+N P
Sbjct: 77 -----MFQANVSTACYLINWIP 27
>TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
(EC 3.4.23.-);, partial (7%)
Length = 705
Score = 108 bits (271), Expect = 1e-23
Identities = 59/163 (36%), Positives = 101/163 (61%), Gaps = 12/163 (7%)
Frame = +3
Query: 1150 *QGARCCS------ISC-GIC--GLCR*SR**KVYNRICLYSCGGPICWKSSVQSTVAMS 1200
*+ RCC ++C G+C CR*S * K+Y +C+++C K VQ T S
Sbjct: 24 *RNLRCCGMFRRIRVNCQGLC*FRFCR*S**KKIYYWLCVHACRRSS--KLVVQVTNGCS 197
Query: 1201 TTEAE---YMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKH 1257
+ + ++ +A KEA+W+ L++ELG +Q + ++CDSQSA+++ N +H+RTKH
Sbjct: 198 SVNDRSGVHGSLPQACKEAIWMQRLMEELGHKQEQITVYCDSQSALHIARNPAFHSRTKH 377
Query: 1258 IDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKFMF 1300
I +++H +RE++ + +QKIHT++N D +TK + +DKF++
Sbjct: 378 IGIQYHFVREVVEEGSVDMQKIHTNDNLADAMTKSINTDKFIW 506
>AJ502495 weakly similar to GP|18071369|g putative gag-pol polyprotein {Oryza
sativa}, partial (9%)
Length = 542
Score = 102 bits (254), Expect = 1e-21
Identities = 48/115 (41%), Positives = 74/115 (63%), Gaps = 1/115 (0%)
Frame = +2
Query: 1185 GPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGG-VQLHCDSQSAI 1243
G I W S Q VA ST EAEY+A A + +WL +++ + EQ +++CD++SAI
Sbjct: 68 GAISWSSKKQPVVAFSTAEAEYIASTSCATQTVWLRRILEVMHHEQNTPTKIYCDNKSAI 247
Query: 1244 YLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
L+ N V+H R+KHID++FHKIREL+A ++++++ T E D TKP+ + F
Sbjct: 248 ALSKNPVFHGRSKHIDIQFHKIRELIAEKEVVIEYCPTEEKIADIFTKPLKIESF 412
>BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana, partial (10%)
Length = 744
Score = 90.5 bits (223), Expect = 4e-18
Identities = 59/191 (30%), Positives = 94/191 (48%), Gaps = 1/191 (0%)
Frame = +2
Query: 935 VCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDM 994
VC+L++S+YGLKQ+ RQWY + ++ GY + D ++ D SF LL+YVDD+
Sbjct: 20 VCELQKSIYGLKQASRQWYSKLSESLISFGYLQSSSDFSLFT-KFKDSSFTTLLVYVDDI 196
Query: 995 LIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSR 1054
++A N + ++ +K L F +KDLG+ + LG+E+ R + + L+Q+ Y +L
Sbjct: 197 VLAGNDISEIQHVKCFLIDRFKIKDLGSLRYFLGLEV--ARSKQGILLNQRKYTLELLED 370
Query: 1055 FDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYAVQLVV*CM-LWFALDQIWHK 1113
TP KL SP + E + I + L + FA+ Q+
Sbjct: 371 SGNLAVKSTLTPYDISLKLHNSDSPLYNDETQYRRLIGKLIYLTTTRPDISFAVQQLSQF 550
Query: 1114 QLVKCQVHVQA 1124
QVH QA
Sbjct: 551 VSKPQQVHYQA 583
>BG587174 similar to PIR|A47759|A4775 retrovirus-related reverse
transcriptase homolog - rape retrotransposon copia-like
(fragment), partial (84%)
Length = 249
Score = 80.5 bits (197), Expect = 4e-15
Identities = 45/89 (50%), Positives = 56/89 (62%), Gaps = 7/89 (7%)
Frame = -1
Query: 914 LEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCC 973
LEE+IY+ QPEGF G VCKL++SLYGLKQSPRQWYKRFDSY R +
Sbjct: 246 LEEKIYMTQPEGFLFPGKEDHVCKLRKSLYGLKQSPRQWYKRFDSY-------RSSWATT 88
Query: 974 VYVMSLDD-------GSFIFLLLYVDDML 995
+M++ +I+L+LYVDDML
Sbjct: 87 GVLMTVVST*TR*RMSRYIYLVLYVDDML 1
>BG586293 weakly similar to PIR|E84473|E84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(7%)
Length = 763
Score = 73.2 bits (178), Expect = 6e-13
Identities = 40/94 (42%), Positives = 60/94 (63%)
Frame = +2
Query: 820 KNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSP 879
+ +T LV+ P G + IG +W+YK K K+KA LV KGY + +GID+DE+F+P
Sbjct: 47 QKQTLKLVKKPTGVKPIGLRWIYKIKRNED-GTLIKYKARLVAKGYVKQQGIDFDEVFAP 223
Query: 880 VVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGN 913
VVR +I ++LAL A+ + +DVK FL+G+
Sbjct: 224 VVRIETI*LLLALAATNGC*IHHIDVKIAFLNGH 325
>BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sativa (japonica
cultivar-group)}, partial (8%)
Length = 503
Score = 72.0 bits (175), Expect = 1e-12
Identities = 40/120 (33%), Positives = 61/120 (50%)
Frame = +1
Query: 947 QSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNE 1006
QSPR W+ RF + + GY +C D +++ L++YVDD+ + +H +
Sbjct: 1 QSPRDWFDRFT*VVKKFGYIQCQTDHAMFIKHSSTVKKAILIVYVDDIFLTGDHGK*IKR 180
Query: 1007 LKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTP 1066
LK L +EF++KDLG K LGME+ R K +SQ+ YV +L M + P
Sbjct: 181 LKNLLAEEFEIKDLGNLKYFLGMEV--ARWKKGSSISQRKYVLDLLKETRMIGCKTIRDP 354
>BG586273 weakly similar to PIR|F86470|F8 probable retroelement polyprotein
[imported] - Arabidopsis thaliana, partial (7%)
Length = 705
Score = 69.3 bits (168), Expect = 9e-12
Identities = 37/90 (41%), Positives = 53/90 (58%)
Frame = -2
Query: 612 ACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CII 671
ACYL+NR P L + EV L+ +R+F YV + E R+KL+ +S+ +
Sbjct: 704 ACYLINRIPTRVLKDQAPFEVLNQRKPSLTYMRVFGCLCYVLVPGELRNKLEARSRKAMF 525
Query: 672 IGYNKGVKGYKLWDPVKKKVIVSRDVVFDE 701
IGY+ KGYK +DP ++V+VSRDV F E
Sbjct: 524 IGYSTTQKGYKCYDPEARRVLVSRDVKFIE 435
>BF631997 weakly similar to GP|18542925|gb Putative pol polyprotein {Oryza
sativa}, partial (6%)
Length = 650
Score = 57.8 bits (138), Expect(2) = 2e-11
Identities = 30/75 (40%), Positives = 42/75 (56%)
Frame = -2
Query: 512 YFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVR 571
+ MK+KSE F + + + GR+ + +NG E F C+E I R ++VR
Sbjct: 643 FMMKHKSEAFYFSRNGRFWSRVKQGRR*NVFKLNNGLEICSAEFNELCKEEHITRQYTVR 464
Query: 572 KTPQQNGVAERMNRT 586
TPQ+NGVAERMN T
Sbjct: 463 NTPQKNGVAERMNIT 419
Score = 45.4 bits (106), Expect = 1e-04
Identities = 46/150 (30%), Positives = 66/150 (43%), Gaps = 6/150 (4%)
Frame = -1
Query: 524 FKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGI---QRHFSV---RKTPQQN 577
FK WK VE+QTG+K+K L+ TE N FC + H+ K +
Sbjct: 608 FKEWKILVESQTGKKVKRLQ----TE*RLGNL--FCRVQ*TLQGRTHYKAIYGEKYSTKE 447
Query: 578 GVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNP 637
E+ARC+ NA L++ A I+ CYLVN P ++
Sbjct: 446 WCC*ANEYNFLERARCMFSNAGLNRSFQAEAISTKCYLVNVLPLLL*TVRLHLRY---GL 276
Query: 638 IDLSNLRIFWRPAYVHISSEDRSKLDPKSK 667
I+L +I PAY H+ + KL+P+SK
Sbjct: 275 INLLITQILGCPAYYHV---NEGKLEPRSK 195
Score = 30.4 bits (67), Expect(2) = 2e-11
Identities = 24/80 (30%), Positives = 32/80 (40%), Gaps = 12/80 (15%)
Frame = -3
Query: 619 SPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*C--------- 669
SP ++D K EVW+ P + SN + S L P C
Sbjct: 324 SPSTAIDCKTPFEVWSNKPANYSNSWL--------------SCLLP----C**G*T*T*I 199
Query: 670 ---IIIGYNKGVKGYKLWDP 686
II+G GVKGY++W P
Sbjct: 198 *ERIILG*GNGVKGYRIWSP 139
>BE941052 weakly similar to PIR|B85188|B85 retrotransposon like protein
[imported] - Arabidopsis thaliana, partial (4%)
Length = 480
Score = 60.1 bits (144), Expect = 6e-09
Identities = 28/70 (40%), Positives = 44/70 (62%)
Frame = +2
Query: 1228 VEQGGVQLHCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTD 1287
+ Q G+ L CD SA YLT+N VYH+R KHI + H +R+L+ ++ +Q + T + D
Sbjct: 5 ISQSGL-LRCDYLSATYLTHNPVYHSRMKHISIDIHFVRDLVQQGKLKVQHVCTVDQLAD 181
Query: 1288 KLTKPVTSDK 1297
LTKP++ +
Sbjct: 182 CLTKPLSKSR 211
>BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2O9.150 -
Arabidopsis thaliana, partial (11%)
Length = 732
Score = 58.9 bits (141), Expect = 1e-08
Identities = 57/226 (25%), Positives = 104/226 (45%), Gaps = 13/226 (5%)
Frame = +1
Query: 1040 LWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSP-KIDSEIEGMSKIPYAVQLV 1098
+++ Q+ YV +L RF M K+N P+ KL +++ K+D+ + +
Sbjct: 28 IYICQRKYVTDLLERFGMEKSNLSRNPIAPRCKLIKDENGVKVDAT---------KYKQI 180
Query: 1099 V*CMLWFAL---DQIWHKQLVK------CQVHVQAREAALG--SSQVDPKILEGYNGSRY 1147
V C+++ A D ++ L+ ++H+ A + L + ++ I+ NGS
Sbjct: 181 VGCLMYLAATRPDLMYVLSLISRFMNCPTELHMHAVKRVLRYLNGTINLGIMYKRNGSEK 360
Query: 1148 HV*QGARCCSISCGICGLCR*SR**KVYNRICLYSCGGPICWKSSVQSTVAMSTTEAEYM 1207
A S G R K + G + W S Q V +STT+AE++
Sbjct: 361 ---LEAYTDSDYAGDLD----DR--KSTSGYVFMLSSGAVSWSSKKQPVVTLSTTKAEFI 513
Query: 1208 AVAEAAKEALWLTGLVKELGVEQ-GGVQLHCDSQSAIYLTNNQVYH 1252
A A A +++W+ ++++LG Q G + ++CD+ S I L+ N V H
Sbjct: 514 AAAFCACQSVWMRRVLEKLGYTQSGSITMYCDNNSTIKLSKNPVLH 651
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.323 0.138 0.419
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 40,871,356
Number of Sequences: 36976
Number of extensions: 598609
Number of successful extensions: 3143
Number of sequences better than 10.0: 89
Number of HSP's better than 10.0 without gapping: 2995
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3106
length of query: 1307
length of database: 9,014,727
effective HSP length: 108
effective length of query: 1199
effective length of database: 5,021,319
effective search space: 6020561481
effective search space used: 6020561481
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 64 (29.3 bits)
Lotus: description of TM0003.6