
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0171.6
(799 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsi... 178 6e-45
BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F2... 155 4e-38
BG587156 similar to PIR|G85055|G8 probable polyprotein [imported... 152 4e-37
CB893805 similar to GP|10177935|d copia-type polyprotein {Arabid... 151 8e-37
BG644690 weakly similar to GP|18542179|gb putative pol protein {... 112 2e-30
BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sati... 97 2e-20
BG452576 weakly similar to GP|12005223|gb|A reverse transcriptas... 71 2e-19
BG647824 weakly similar to PIR|G96722|G96 hypothetical protein F... 73 5e-19
AJ502495 weakly similar to GP|18071369|g putative gag-pol polypr... 82 1e-15
TC92908 similar to GP|10140704|gb|AAG13538.1 putative gag-pol po... 81 1e-15
BG586293 weakly similar to PIR|E84473|E84 probable retroelement ... 80 3e-15
CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotia... 69 2e-14
AJ497569 weakly similar to PIR|T04833|T04 hypothetical protein F... 45 5e-12
BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2... 69 5e-12
TC89912 weakly similar to PIR|B84512|B84512 probable retroelemen... 68 2e-11
BG587174 similar to PIR|A47759|A4775 retrovirus-related reverse ... 49 1e-05
AW685211 weakly similar to GP|4539662|gb| polyprotein {Sorghum b... 49 1e-05
BE942480 49 1e-05
TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-relate... 49 1e-05
TC83624 homologue to PIR|G84581|G84581 copia-like retroelement p... 45 1e-04
>AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsis thaliana},
partial (9%)
Length = 675
Score = 178 bits (452), Expect = 6e-45
Identities = 87/190 (45%), Positives = 117/190 (60%)
Frame = +1
Query: 353 WVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQ 412
W+ AM E AL N TW LVPLPP+ I KWVY++K +G+V ++KARLVA G++Q
Sbjct: 97 WLQAMKTEYKALIDNKTWDLVPLPPHKKAIGCKWVYRVKENPDGSVNKFKARLVAKGFSQ 276
Query: 413 IEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVS 472
G Y +TFSP K +R++L +A W + Q+D+NNAFL G L E+VYM P+G
Sbjct: 277 TLGCDYTETFSPVIKPVTIRLILTIAITYKWEIQQIDINNAFLNGFLQEEVYMSQPQGFE 456
Query: 473 CVDSGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLI 532
+ VCKL+KS YGLKQA R WY TS + G+ ++ D SL Q + L I
Sbjct: 457 AANKSLVCKLNKSLYGLKQAPRAWYEXLTSAQIQFGFTKSRCDPSLLIYNQNGACIYLXI 636
Query: 533 YVDDIILAGN 542
YVDDI++ G+
Sbjct: 637 YVDDILITGS 666
>BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana, partial (10%)
Length = 744
Score = 155 bits (393), Expect = 4e-38
Identities = 80/137 (58%), Positives = 99/137 (71%)
Frame = +2
Query: 478 KVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDI 537
KVC+L KS YGLKQASRQWY+K + L++ GY Q+ SD SLF+K + SFT LL+YVDDI
Sbjct: 17 KVCELQKSIYGLKQASRQWYSKLSESLISFGYLQSSSDFSLFTKFKDSSFTTLLVYVDDI 196
Query: 538 ILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSG 597
+LAGN + E +K L + FKIKDLG L+YFLGLEV+ S +GI L QRKY L+L+ DSG
Sbjct: 197 VLAGNDISEIQHVKCFLIDRFKIKDLGSLRYFLGLEVARSKQGILLNQRKYTLELLEDSG 376
Query: 598 VLGSKPVSTPLDPSSRL 614
L K TP D S +L
Sbjct: 377 NLAVKSTLTPYDISLKL 427
>BG587156 similar to PIR|G85055|G8 probable polyprotein [imported] -
Arabidopsis thaliana, partial (17%)
Length = 618
Score = 152 bits (385), Expect = 4e-37
Identities = 80/203 (39%), Positives = 123/203 (60%), Gaps = 1/203 (0%)
Frame = -1
Query: 321 YSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNVV 380
+S H A+ ++L+ + P +Y EA + K W +++ E A+ N TW LP
Sbjct: 618 FSQYPEAHCAFMVNLNENHIPRSYEEAMEDKEWKESVGAEAGAMIKNDTWYESELPKGKK 439
Query: 381 PIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASV 440
+ ++W++ IK +A+G++ER K RLVA G+ G Y +TF+P AKL +R+VL+LA
Sbjct: 438 AVSSRWIFTIKYKADGSIERKKTRLVARGFTLTYGEDYIETFAPVAKLHTIRIVLSLAVN 259
Query: 441 NNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVS-CVDSGKVCKLHKSSYGLKQASRQWYAK 499
W L Q+DV NAFL G+L ++VYM P G+ V G V +L K+ YGLKQ+ R WY K
Sbjct: 258 LGWGLWQMDVKNAFLQGELEDEVYMYPPPGLEHLVKRGNVLRLKKAIYGLKQSPRAWYNK 79
Query: 500 FTSLLVTCGYKQAHSDHSLFSKT 522
++ L G++++ DH+LF+ T
Sbjct: 78 LSTTLNGRGFRKSELDHTLFTLT 10
>CB893805 similar to GP|10177935|d copia-type polyprotein {Arabidopsis
thaliana}, partial (14%)
Length = 778
Score = 151 bits (382), Expect = 8e-37
Identities = 89/258 (34%), Positives = 141/258 (54%), Gaps = 8/258 (3%)
Frame = +3
Query: 335 LSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRA 394
L++ S+P T+ EA K + W +MN E+ A E N TW L L I KW++K K
Sbjct: 24 LTMTSDPTTFEEAVKSEKWRASMNNEMEATERNNTWELTDLRSGAKTIGLKWIFKTKLNE 203
Query: 395 NGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAF 454
NG +E+YKARLVA GY+Q G+ Y + F+P A+ +RMV+ALA+ Q+ +
Sbjct: 204 NGEIEKYKARLVAKGYSQQYGVDYTEVFAPVARWDTIRMVIALAA-------QIKRDGVC 362
Query: 455 LLGDLYEDVYMKVPEGVSCVDSGKVC-------KLHKSSYGLKQASRQWYAKFTSLLVTC 507
+ M+ + + +V ++ ++ YGLKQA R WY++ +
Sbjct: 363 IS*M*KAHSCMEN*MRKFLLINHRVM*RRVIS*RVKRALYGLKQAPRAWYSRIEAYFTKE 542
Query: 508 GYKQAHSDHSLFSK-TQGQSFTILLIYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVL 566
G+++ +H+LF K ++G I+ +YVDD+I GN + F K ++ F + DLG +
Sbjct: 543 GFEKCPYEHTLFVKLSEGGKILIISLYVDDLIFIGNDENMFEEFKKSMKKEFNMSDLGKM 722
Query: 567 KYFLGLEVSHSAKGISLC 584
YFLG+EV+ + KGI +C
Sbjct: 723 HYFLGVEVTQNEKGIYIC 776
>BG644690 weakly similar to GP|18542179|gb putative pol protein {Zea mays},
partial (22%)
Length = 629
Score = 112 bits (279), Expect(2) = 2e-30
Identities = 57/141 (40%), Positives = 90/141 (63%), Gaps = 1/141 (0%)
Frame = -2
Query: 398 VERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAFLLG 457
+ R K++LV GYNQ EGI Y + FSP A++ +R+++A A+ + L+Q+DV +AF+ G
Sbjct: 424 ITRNKSKLVVQGYNQKEGIDYDEAFSPVARMEAIRILIAFAAFMGFKLYQMDVKSAFING 245
Query: 458 DLYEDVYMKVPEGVSCVD-SGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHSDH 516
DL E+V++K P G + V +L+K+ YGLKQA R WY + + L+ G+K+ D+
Sbjct: 244 DLKEEVFVKQPPGFEDAEVPNHVFRLNKTLYGLKQAPRAWYERLSKFLLKNGFKRGKIDN 65
Query: 517 SLFSKTQGQSFTILLIYVDDI 537
+LF + I+ +YVDDI
Sbjct: 64 TLFLLKRE*ELLIIQVYVDDI 2
Score = 39.3 bits (90), Expect(2) = 2e-30
Identities = 17/52 (32%), Positives = 26/52 (49%)
Frame = -3
Query: 340 EPHTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIK 391
EP EA + W+++M E+ E + W LVP P I +WV++ K
Sbjct: 588 EPKNVKEALRDADWINSMQEELHQFERSKVWYLVPRPEGKTVIGTRWVFRNK 433
>BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sativa (japonica
cultivar-group)}, partial (8%)
Length = 503
Score = 97.4 bits (241), Expect = 2e-20
Identities = 51/118 (43%), Positives = 73/118 (61%), Gaps = 1/118 (0%)
Frame = +1
Query: 491 QASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQ-SFTILLIYVDDIILAGNFLDEFTR 549
Q+ R W+ +FT ++ GY Q +DH++F K IL++YVDDI L G+ R
Sbjct: 1 QSPRDWFDRFT*VVKKFGYIQCQTDHAMFIKHSSTVKKAILIVYVDDIFLTGDHGK*IKR 180
Query: 550 IKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTP 607
+K L F+IKDLG LKYFLG+EV+ KG S+ QRKY LDL+ ++ ++G K + P
Sbjct: 181 LKNLLAEEFEIKDLGNLKYFLGMEVARWKKGSSISQRKYVLDLLKETRMIGCKTIRDP 354
>BG452576 weakly similar to GP|12005223|gb|A reverse transcriptase-like
protein {Spiranthes hongkongensis}, partial (25%)
Length = 676
Score = 71.2 bits (173), Expect(4) = 2e-19
Identities = 44/106 (41%), Positives = 66/106 (61%)
Frame = +2
Query: 494 RQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIKAA 553
R+ Y K S L++ GYKQ+ +DHS+FS G+ FTI L+Y+DDI+ A + E +K+
Sbjct: 236 RK*Y*KLLSTLISLGYKQSPNDHSIFSF--GRRFTIFLVYLDDIVFARDDHSETQLVKSH 409
Query: 554 LDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVL 599
LD F I LG L Y +G++++ S I Q KY ++L+ +SG L
Sbjct: 410 LDKNFIIIGLGTLHY-VGVKIA*SES*IIDDQCKYTIELLEESGQL 544
Score = 34.7 bits (78), Expect(4) = 2e-19
Identities = 15/22 (68%), Positives = 16/22 (72%)
Frame = +3
Query: 479 VCKLHKSSYGLKQASRQWYAKF 500
VCK H S YGLK A RQ Y+KF
Sbjct: 165 VCKFHNSIYGLKHAHRQ*YSKF 230
Score = 25.4 bits (54), Expect(4) = 2e-19
Identities = 11/16 (68%), Positives = 12/16 (74%)
Frame = +3
Query: 599 LGSKPVSTPLDPSSRL 614
LG KP STP DPS +L
Sbjct: 546 LGMKPSSTPRDPSLKL 593
Score = 21.9 bits (45), Expect(4) = 2e-19
Identities = 8/11 (72%), Positives = 8/11 (72%)
Frame = +1
Query: 443 WHLHQLDVNNA 453
WH LDVNNA
Sbjct: 40 WHQRLLDVNNA 72
>BG647824 weakly similar to PIR|G96722|G96 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana, partial (5%)
Length = 721
Score = 72.8 bits (177), Expect(2) = 5e-19
Identities = 34/60 (56%), Positives = 46/60 (76%), Gaps = 1/60 (1%)
Frame = -3
Query: 741 VYRALASATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSAL-HIAANPVFHERTKHLEIEC 799
+YR++ S CE+ LTYLL DL+ +KP+++YCDNQSA HIAAN F ERTKH+E++C
Sbjct: 377 IYRSI*STICEIKWLTYLLNDLKFTFIKPAMLYCDNQSAARHIAANSSFLERTKHIELDC 198
Score = 40.4 bits (93), Expect(2) = 5e-19
Identities = 14/28 (50%), Positives = 24/28 (85%)
Frame = -2
Query: 704 CVDTRRSVTSYCFFIGNSLICWRSKKQQ 731
C+DT +S++ +C F+G+SLICW+S K++
Sbjct: 474 CLDT*KSISYFCIFLGDSLICWKS*KKK 391
>AJ502495 weakly similar to GP|18071369|g putative gag-pol polyprotein {Oryza
sativa}, partial (9%)
Length = 542
Score = 81.6 bits (200), Expect = 1e-15
Identities = 39/96 (40%), Positives = 60/96 (61%)
Frame = +2
Query: 703 GCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDL 762
G +TR+S + Y F +G I W SKKQ ++ S++EA Y A S + L +L+ +
Sbjct: 14 GDTETRKSTSGYAFHLGTGAISWSSKKQPVVAFSTAEAEYIASTSCATQTVWLRRILEVM 193
Query: 763 QIEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIE 798
E P+ IYCDN+SA+ ++ NPVFH R+KH++I+
Sbjct: 194 HHEQNTPTKIYCDNKSAIALSKNPVFHGRSKHIDIQ 301
>TC92908 similar to GP|10140704|gb|AAG13538.1 putative gag-pol polyprotein
{Oryza sativa}, partial (1%)
Length = 638
Score = 81.3 bits (199), Expect = 1e-15
Identities = 53/144 (36%), Positives = 80/144 (54%)
Frame = +2
Query: 308 SSGIKYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEAN 367
SS Y I +Y+ YS S H A +++ + EP Y +A++ + W+ AM EI LE N
Sbjct: 185 SSSKSYHIFHYVGYS-FSAKHRASLAAITSNIEPKNYVQAAQ*QEWLAAMEQEIQVLEEN 361
Query: 368 GTWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAK 427
T +L PL +D + VYKI +ANG +E+YKA+LVA + Q+EG + D + K
Sbjct: 362 NTSTLEPLREGKKWVDCRPVYKIIHKANGEIEKYKAQLVAKDFVQVEGEDF*D-LCLSNK 538
Query: 428 LTIVRMVLALASVNNWHLHQLDVN 451
R +L +A+ LH +DV+
Sbjct: 539 DDNCRCLLTIAAAKG*QLHLMDVS 610
>BG586293 weakly similar to PIR|E84473|E84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(7%)
Length = 763
Score = 79.7 bits (195), Expect(2) = 3e-15
Identities = 40/89 (44%), Positives = 58/89 (64%)
Frame = +2
Query: 369 TWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKL 428
T LV P V PI +W+YKIKR +GT+ +YKARLVA GY + +GI + + F+P ++
Sbjct: 56 TLKLVKKPTGVKPIGLRWIYKIKRNEDGTLIKYKARLVAKGYVKQQGIDFDEVFAPVVRI 235
Query: 429 TIVRMVLALASVNNWHLHQLDVNNAFLLG 457
+ ++LALA+ N +H +DV AFL G
Sbjct: 236 ETI*LLLALAATNGC*IHHIDVKIAFLNG 322
Score = 20.8 bits (42), Expect(2) = 3e-15
Identities = 7/15 (46%), Positives = 10/15 (66%)
Frame = +3
Query: 353 WVDAMNLEISALEAN 367
W+DAM EI ++ N
Sbjct: 9 WIDAMKAEIESIIKN 53
>CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotiana tabacum},
partial (7%)
Length = 780
Score = 68.9 bits (167), Expect(2) = 2e-14
Identities = 38/89 (42%), Positives = 49/89 (54%)
Frame = -2
Query: 422 FSPTAKLTIVRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVSCVDSGKVCK 481
F P KL + +L++ ++ N +L LDV AFL GDL ED+YM PEG S V K
Sbjct: 554 FVPIVKLNTIMFLLSIVAIENLYLE*LDVKTAFLRGDLVEDIYMHQPEGFS*EVGKMVGK 375
Query: 482 LHKSSYGLKQASRQWYAKFTSLLVTCGYK 510
L KS YGLKQ RQ +L G++
Sbjct: 374 LKKSMYGLKQGPRQCI*SLKALCTRKGFR 288
Score = 28.5 bits (62), Expect(2) = 2e-14
Identities = 10/38 (26%), Positives = 21/38 (54%)
Frame = -3
Query: 537 IILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEV 574
+++ G+ +DE +K +KDLG K +G+++
Sbjct: 271 LLVVGSNIDEIKNLKTRFSKEIDMKDLGPAKKIIGMQI 158
>AJ497569 weakly similar to PIR|T04833|T04 hypothetical protein F21P8.50 -
Arabidopsis thaliana, partial (4%)
Length = 723
Score = 45.4 bits (106), Expect(2) = 5e-12
Identities = 25/48 (52%), Positives = 33/48 (68%)
Frame = +3
Query: 702 GGCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASAT 749
G C + R T +CF + +SLI W+SKKQ +S+S SEA RALA+AT
Sbjct: 84 GSCPYSIR*TTEFCF-LSSSLISWKSKKQCVVSRSFSEA**RALANAT 224
Score = 43.9 bits (102), Expect(2) = 5e-12
Identities = 20/26 (76%), Positives = 21/26 (79%)
Frame = +2
Query: 773 YCDNQSALHIAANPVFHERTKHLEIE 798
YCDN SALHIAAN VFHERT H E +
Sbjct: 290 YCDNISALHIAANMVFHERT*HRETD 367
>BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2O9.150 -
Arabidopsis thaliana, partial (11%)
Length = 732
Score = 69.3 bits (168), Expect = 5e-12
Identities = 64/233 (27%), Positives = 100/233 (42%), Gaps = 15/233 (6%)
Frame = +1
Query: 572 LEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLDPSSRLSQDGGGATL*GCFFIQK 631
+EV + +GI +CQRKY DL+ G+ S P+ P +L +D G
Sbjct: 1 VEVIQNEEGIYICQRKYVTDLLERFGMEKSNLSRNPIAPRCKLIKDENGV---------- 150
Query: 632 THRKTVLSYYNQV*YNLCSSAAKSVPLQSYCDSLCCSS*NSKVS-ERESKKRVIFPKEFC 690
K + Y Q+ L AA L Y SL N + KRV+ +
Sbjct: 151 ---KVDATKYKQIVGCLMYLAATRPDLM-YVLSLISRFMNCPTELHMHAVKRVL--RYLN 312
Query: 691 STI-VGV**CRLG-------------GCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKS 736
TI +G+ R G G +D R+S + Y F + + + W SKKQ ++ S
Sbjct: 313 GTINLGIMYKRNGSEKLEAYTDSDYAGDLDDRKSTSGYVFMLSSGAVSWSSKKQPVVTLS 492
Query: 737 SSEAVYRALASATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSALHIAANPVFH 789
+++A + A A C+ + +L+ L +YCDN S + ++ NPV H
Sbjct: 493 TTKAEFIAAAFCACQSVWMRRVLEKLGYTQSGSITMYCDNNSTIKLSKNPVLH 651
>TC89912 weakly similar to PIR|B84512|B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(10%)
Length = 814
Score = 67.8 bits (164), Expect = 2e-11
Identities = 36/97 (37%), Positives = 62/97 (63%), Gaps = 2/97 (2%)
Frame = +1
Query: 703 GCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDL 762
G VDTR+S++ + F + + I W++ +Q ++ S+++A Y A + L ++ +L
Sbjct: 118 GNVDTRKSLSGFVFTLYGTTISWKANQQSVVTLSTTQAEYIAFVEGVKDAIWLKGMIGEL 297
Query: 763 QI--EPVKPSVIYCDNQSALHIAANPVFHERTKHLEI 797
I E VK I+CD+QSA+H+A + V+HERTKH++I
Sbjct: 298 GITQEYVK---IHCDSQSAIHLANHQVYHERTKHIDI 399
>BG587174 similar to PIR|A47759|A4775 retrovirus-related reverse
transcriptase homolog - rape retrotransposon copia-like
(fragment), partial (84%)
Length = 249
Score = 48.5 bits (114), Expect = 1e-05
Identities = 32/91 (35%), Positives = 49/91 (53%), Gaps = 10/91 (10%)
Frame = -1
Query: 458 DLYEDVYMKVPEGVSCVDSGK---VCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHS 514
+L E +YM PEG + GK VCKL KS YGLKQ+ RQWY +F S Y+ + +
Sbjct: 249 ELEEKIYMTQPEGF--LFPGKEDHVCKLRKSLYGLKQSPRQWYKRFDS------YRSSWA 94
Query: 515 DHSLF-------SKTQGQSFTILLIYVDDII 538
+ ++ + + L++YVDD++
Sbjct: 93 TTGVLMTVVST*TR*RMSRYIYLVLYVDDML 1
>AW685211 weakly similar to GP|4539662|gb| polyprotein {Sorghum bicolor},
partial (2%)
Length = 388
Score = 48.5 bits (114), Expect = 1e-05
Identities = 29/86 (33%), Positives = 44/86 (50%)
Frame = -2
Query: 356 AMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEG 415
AM E++ALE+N TW+++PLP + YK R K R+ A GY+++ G
Sbjct: 219 AMIGEMTALESNRTWTIIPLPLEMP-------YK---------HRLKVRMAAKGYSKVYG 88
Query: 416 IYYFDTFSPTAKLTIVRMVLALASVN 441
++Y DTFSP + + L N
Sbjct: 87 LHYNDTFSPAEYCAMTHTTMELVCDN 10
>BE942480
Length = 396
Score = 48.5 bits (114), Expect = 1e-05
Identities = 20/44 (45%), Positives = 34/44 (76%)
Frame = -2
Query: 741 VYRALASATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSALHIAA 784
+YRA++S CE+ LTY++ L+++ +KP++ Y DNQ+A HIA+
Sbjct: 320 IYRAMSSIVCEIEWLTYIVDVLKVQSIKPTLPYYDNQAARHIAS 189
>TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
(EC 3.4.23.-);, partial (7%)
Length = 705
Score = 48.5 bits (114), Expect = 1e-05
Identities = 19/27 (70%), Positives = 23/27 (84%)
Frame = +3
Query: 772 IYCDNQSALHIAANPVFHERTKHLEIE 798
+YCD+QSALHIA NP FH RTKH+ I+
Sbjct: 309 VYCDSQSALHIARNPAFHSRTKHIGIQ 389
>TC83624 homologue to PIR|G84581|G84581 copia-like retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(1%)
Length = 831
Score = 44.7 bits (104), Expect = 1e-04
Identities = 20/38 (52%), Positives = 28/38 (73%)
Frame = +2
Query: 759 LKDLQIEPVKPSVIYCDNQSALHIAANPVFHERTKHLE 796
L++LQ++ + +IYC NQ L+IA N V+HERTKH E
Sbjct: 443 LRNLQVQCTRLLLIYCVNQITLYIAKNQVYHERTKH*E 556
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.338 0.145 0.471
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 27,534,655
Number of Sequences: 36976
Number of extensions: 435780
Number of successful extensions: 4336
Number of sequences better than 10.0: 82
Number of HSP's better than 10.0 without gapping: 2515
Number of HSP's successfully gapped in prelim test: 292
Number of HSP's that attempted gapping in prelim test: 1713
Number of HSP's gapped (non-prelim): 2955
length of query: 799
length of database: 9,014,727
effective HSP length: 104
effective length of query: 695
effective length of database: 5,169,223
effective search space: 3592609985
effective search space used: 3592609985
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.8 bits)
S2: 62 (28.5 bits)
Lotus: description of TM0171.6