
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0128.9
(1596 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BG644690 weakly similar to GP|18542179|gb putative pol protein {... 194 1e-67
BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2... 211 2e-54
AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsi... 172 1e-42
CB893805 similar to GP|10177935|d copia-type polyprotein {Arabid... 172 1e-42
BG587156 similar to PIR|G85055|G8 probable polyprotein [imported... 165 1e-40
BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F2... 158 2e-38
TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative ret... 143 5e-34
AJ502495 weakly similar to GP|18071369|g putative gag-pol polypr... 136 7e-32
TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-relate... 78 8e-30
BI262917 weakly similar to GP|19920130|g Putative retroelement {... 118 2e-26
BG587170 similar to PIR|F86470|F8 probable retroelement polyprot... 116 6e-26
TC89912 weakly similar to PIR|B84512|B84512 probable retroelemen... 116 8e-26
BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vu... 97 7e-20
BG586293 weakly similar to PIR|E84473|E84 probable retroelement ... 96 1e-19
BG586273 weakly similar to PIR|F86470|F8 probable retroelement p... 80 6e-15
BG644677 weakly similar to GP|7682800|gb| Hypothetical protein T... 73 1e-12
BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sati... 71 3e-12
CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotia... 69 1e-11
BE941052 weakly similar to PIR|B85188|B85 retrotransposon like p... 64 6e-10
AJ499215 weakly similar to GP|6642775|gb| gag-pol polyprotein {V... 64 6e-10
>BG644690 weakly similar to GP|18542179|gb putative pol protein {Zea mays},
partial (22%)
Length = 629
Score = 194 bits (492), Expect(2) = 1e-67
Identities = 91/141 (64%), Positives = 116/141 (81%)
Frame = -2
Query: 1146 VVRNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNG 1205
+ RNK++LV QGY+Q+EGIDY E F+PVAR+EAIR+LI+F+ L+QMDVKSAF+NG
Sbjct: 424 ITRNKSKLVVQGYNQKEGIDYDEAFSPVARMEAIRILIAFAAFMGFKLYQMDVKSAFING 245
Query: 1206 YISEEVYVHQPPGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDT 1265
+ EEV+V QPPGFED + P+HVF+L K+LYGLKQAPRAWYERLS FLL+N F RGK+D
Sbjct: 244 DLKEEVFVKQPPGFEDAEVPNHVFRLNKTLYGLKQAPRAWYERLSKFLLKNGFKRGKIDN 65
Query: 1266 TLFCKTYKDDILIVQIYVDDI 1286
TLF + ++LI+Q+YVDDI
Sbjct: 64 TLFLLKRE*ELLIIQVYVDDI 2
Score = 83.2 bits (204), Expect(2) = 1e-67
Identities = 37/64 (57%), Positives = 49/64 (75%)
Frame = -3
Query: 1077 LLSLKGLVSLIEPKSIDEALQDKDWILAMEEELNQFSKNDVWSLVKKPESVLVIGTKWVF 1136
L+S +S IEPK++ EAL+D DWI +M+EEL+QF ++ VW LV +PE VIGT+WVF
Sbjct: 621 LVSFSAFISSIEPKNVKEALRDADWINSMQEELHQFERSKVWYLVPRPEGKTVIGTRWVF 442
Query: 1137 RNKL 1140
RNKL
Sbjct: 441 RNKL 430
>BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2O9.150 -
Arabidopsis thaliana, partial (11%)
Length = 732
Score = 211 bits (537), Expect = 2e-54
Identities = 103/217 (47%), Positives = 145/217 (66%), Gaps = 1/217 (0%)
Frame = +1
Query: 1321 IQVDQTPEGTYIHQSKYTKELLKKFNMLESTVAKTPMHPTCILEKEYKSGKVCQKLYRGM 1380
++V Q EG YI Q KY +LL++F M +S +++ P+ P C L K+ KV Y+ +
Sbjct: 1 VEVIQNEEGIYICQRKYVTDLLERFGMEKSNLSRNPIAPRCKLIKDENGVKVDATKYKQI 180
Query: 1381 IGSLLYLTASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKTSEYK 1440
+G L+YL A+RPD+++ + L +RF + P E H+ AVKR+LRYL GT NLG+MYK+ K
Sbjct: 181 VGCLMYLAATRPDLMYVLSLISRFMNCPTELHMHAVKRVLRYLNGTINLGIMYKRNGSEK 360
Query: 1441 LSGYCDADYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQM 1500
L Y D+DYAGD +RKSTSG L S VSW+SK+Q + LST +AE+I+AA C+ Q
Sbjct: 361 LEAYTDSDYAGDLDDRKSTSGYVFMLSSGAVSWSSKKQPVVTLSTTKAEFIAAAFCACQS 540
Query: 1501 LWMKHQLEDYQILES-NIPIYCDNTAAISLSKNPILH 1536
+WM+ LE +S +I +YCDN + I LSKNP+LH
Sbjct: 541 VWMRRVLEKLGYTQSGSITMYCDNNSTIKLSKNPVLH 651
>AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsis thaliana},
partial (9%)
Length = 675
Score = 172 bits (435), Expect = 1e-42
Identities = 96/218 (44%), Positives = 128/218 (58%), Gaps = 3/218 (1%)
Frame = +1
Query: 1080 LKGLVSLIEPKSID---EALQDKDWILAMEEELNQFSKNDVWSLVKKPESVLVIGTKWVF 1136
LK V L EP D L D W+ AM+ E N W LV P IG KWV+
Sbjct: 25 LKPKVFLTEPCPQDCENLPLSDPRWLQAMKTEYKALIDNKTWDLVPLPPHKKAIGCKWVY 204
Query: 1137 RNKLNEKGDVVRNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQM 1196
R K N G V + KARLVA+G+SQ G DYTETF+PV + IRL+++ ++ + + Q+
Sbjct: 205 RVKENPDGSVNKFKARLVAKGFSQTLGCDYTETFSPVIKPVTIRLILTIAITYKWEIQQI 384
Query: 1197 DVKSAFLNGYISEEVYVHQPPGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLEN 1256
D+ +AFLNG++ EEVY+ QP GFE K V KL KSLYGLKQAPRAWYE L+S ++
Sbjct: 385 DINNAFLNGFLQEEVYMSQPQGFEAANK-SLVCKLNKSLYGLKQAPRAWYEXLTSAQIQF 561
Query: 1257 EFVRGKVDTTLFCKTYKDDILIVQIYVDDIIFGSANQS 1294
F + + D +L + + IYVDDI+ ++ S
Sbjct: 562 GFTKSRCDPSLLIYNQNGACIYLXIYVDDILITGSSAS 675
>CB893805 similar to GP|10177935|d copia-type polyprotein {Arabidopsis
thaliana}, partial (14%)
Length = 778
Score = 172 bits (435), Expect = 1e-42
Identities = 90/251 (35%), Positives = 149/251 (58%), Gaps = 6/251 (2%)
Frame = +3
Query: 1088 EPKSIDEALQDKDWILAMEEELNQFSKNDVWSLVKKPESVLVIGTKWVFRNKLNEKGDVV 1147
+P + +EA++ + W +M E+ +N+ W L IG KW+F+ KLNE G++
Sbjct: 39 DPTTFEEAVKSEKWRASMNNEMEATERNNTWELTDLRSGAKTIGLKWIFKTKLNENGEIE 218
Query: 1148 RNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSV---NHNIVLHQMDVKSAFLN 1204
+ KARLVA+GYSQQ G+DYTE FAPVAR + IR++I+ + + + M + +
Sbjct: 219 KYKARLVAKGYSQQYGVDYTEVFAPVARWDTIRMVIALAAQIKRDGVCIS*M*KAHSCME 398
Query: 1205 GYISEEVYVHQPPGFEDEKKPDHVF--KLKKSLYGLKQAPRAWYERLSSFLLENEFVRGK 1262
+ + + ++ V ++K++LYGLKQAPRAWY R+ ++ + F +
Sbjct: 399 N*MRKFLLINH------RVM*RRVIS*RVKRALYGLKQAPRAWYSRIEAYFTKEGFEKCP 560
Query: 1263 VDTTLFCKTYK-DDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFMGI 1321
+ TLF K + ILI+ +YVDD+IF ++++ +EF + M+ EF MS +G++ YF+G+
Sbjct: 561 YEHTLFVKLSEGGKILIISLYVDDLIFIGNDENMFEEFKKSMKKEFNMSDLGKMHYFLGV 740
Query: 1322 QVDQTPEGTYI 1332
+V Q +G YI
Sbjct: 741 EVTQNEKGIYI 773
>BG587156 similar to PIR|G85055|G8 probable polyprotein [imported] -
Arabidopsis thaliana, partial (17%)
Length = 618
Score = 165 bits (418), Expect = 1e-40
Identities = 81/183 (44%), Positives = 118/183 (64%)
Frame = -1
Query: 1089 PKSIDEALQDKDWILAMEEELNQFSKNDVWSLVKKPESVLVIGTKWVFRNKLNEKGDVVR 1148
P+S +EA++DK+W ++ E KND W + P+ + ++W+F K G + R
Sbjct: 558 PRSYEEAMEDKEWKESVGAEAGAMIKNDTWYESELPKGKKAVSSRWIFTIKYKADGSIER 379
Query: 1149 NKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYIS 1208
K RLVA+G++ G DY ETFAPVA+L IR+++S +VN L QMDVK+AFL G +
Sbjct: 378 KKTRLVARGFTLTYGEDYIETFAPVAKLHTIRIVLSLAVNLGWGLWQMDVKNAFLQGELE 199
Query: 1209 EEVYVHQPPGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLF 1268
+EVY++ PPG E K +V +LKK++YGLKQ+PRAWY +LS+ L F + ++D TLF
Sbjct: 198 DEVYMYPPPGLEHLVKRGNVLRLKKAIYGLKQSPRAWYNKLSTTLNGRGFRKSELDHTLF 19
Query: 1269 CKT 1271
T
Sbjct: 18 TLT 10
>BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana, partial (10%)
Length = 744
Score = 158 bits (399), Expect = 2e-38
Identities = 89/241 (36%), Positives = 133/241 (54%)
Frame = +2
Query: 1228 VFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTYKDDILIVQIYVDDII 1287
V +L+KS+YGLKQA R WY +LS L+ +++ D +LF K + +YVDDI+
Sbjct: 20 VCELQKSIYGLKQASRQWYSKLSESLISFGYLQSSSDFSLFTKFKDSSFTTLLVYVDDIV 199
Query: 1288 FGSANQSLCKEFSEMMQAEFEMSMMGELKYFMGIQVDQTPEGTYIHQSKYTKELLKKFNM 1347
+ S + + F++ +G L+YF+G++V ++ +G ++Q KYT ELL+
Sbjct: 200 LAGNDISEIQHVKCFLIDRFKIKDLGSLRYFLGLEVARSKQGILLNQRKYTLELLEDSGN 379
Query: 1348 LESTVAKTPMHPTCILEKEYKSGKVCQKLYRGMIGSLLYLTASRPDILFSVHLCARFQSD 1407
L TP + L + YR +IG L+YLT +RPDI F+V ++F S
Sbjct: 380 LAVKSTLTPYDISLKLHNSDSPLYNDETQYRRLIGKLIYLTTTRPDISFAVQQLSQFVSK 559
Query: 1408 PRETHLTAVKRILRYLKGTTNLGLMYKKTSEYKLSGYCDADYAGDRTERKSTSGNCQFLG 1467
P++ H A R+L+YLK GL Y TS KLS + D+D+A T RKS +G FLG
Sbjct: 560 PQQVHYQAAIRVLQYLKTAPAKGLFYSATSNLKLSSFADSDWATCPTTRKSVTGYWVFLG 739
Query: 1468 S 1468
S
Sbjct: 740 S 742
>TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative retroelement
{Oryza sativa} [Oryza sativa (japonica cultivar-group)],
partial (10%)
Length = 823
Score = 143 bits (361), Expect = 5e-34
Identities = 85/255 (33%), Positives = 131/255 (51%), Gaps = 3/255 (1%)
Frame = +1
Query: 731 GLPNLKFASDALCEACQKG-KFTKVPFKAKNVVSTSRPLELLHIDLFGPVKIESIGGKRY 789
G+ L+F L +K F+ + K + L+ +H DL+GP K+ S GG+RY
Sbjct: 1 GIDKLEFCKHLLFFGNRKKVSFSTATHRTKGI------LDYIHSDLWGPSKVTSYGGRRY 162
Query: 790 GMVIVDDYSRWTWVKFLTRKDESHVVFSTFIAQVQNEKACRIVRVRSDHGGEFENDKFES 849
M I+DD+ R WV FL K+E+ F + V+ + + ++ +D+ EF + F
Sbjct: 163 MMTIIDDFPRKVWVYFLRYKNETFPTFKKWRILVETQTGKNVKKLITDN*LEFCSSDFNE 342
Query: 850 LFDSYGIAHDFSCPRTPQQNGVVERKNRTLQEMARTMLQETGM--AKHFWAEAVNTACYI 907
++GIA + PR PQQNGV ER RTL E AR ML G+ + W EA +TAC++
Sbjct: 343 FCTNHGIARHKTIPRNPQQNGVAERMIRTLLERARCMLSNAGL*N*RDLWVEAASTACHL 522
Query: 908 QNRISVRPILNKTPYELWKNIKPNISYFHPFGCVCYVLNTKDRLHKFDAKSSKCLLLGYS 967
NR + K P ++W + S FGC Y L +L ++ +C+ L Y+
Sbjct: 523 VNRSPHSALDFKVPEDIWSGNLVDYSNLRIFGCPAYALVNDGKL---APRAGECIFLSYA 693
Query: 968 ERSKGFRFYNTDAKT 982
SKG+R + +D K+
Sbjct: 694 SESKGYRLWCSDPKS 738
>AJ502495 weakly similar to GP|18071369|g putative gag-pol polyprotein {Oryza
sativa}, partial (9%)
Length = 542
Score = 136 bits (342), Expect = 7e-32
Identities = 68/147 (46%), Positives = 100/147 (67%), Gaps = 2/147 (1%)
Frame = +2
Query: 1447 ADYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQ 1506
+D+AGD RKSTSG LG+ +SW+SK+Q +A STAEAEYI++ C+TQ +W++
Sbjct: 2 SDWAGDTETRKSTSGYAFHLGTGAISWSSKKQPVVAFSTAEAEYIASTSCATQTVWLRRI 181
Query: 1507 LEDYQILESNIP--IYCDNTAAISLSKNPILHSRAKHIEVKHHFIRDYVQKGVLLLKFVD 1564
LE E N P IYCDN +AI+LSKNP+ H R+KHI+++ H IR+ + + +++++
Sbjct: 182 LEVMHH-EQNTPTKIYCDNKSAIALSKNPVFHGRSKHIDIQFHKIRELIAEKEVVIEYCP 358
Query: 1565 TDHQWADIFTKPLAEDRFNFILKNLNM 1591
T+ + ADIFTKPL + F + K L M
Sbjct: 359 TEEKIADIFTKPLKIESFYKLKKMLGM 439
>TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
(EC 3.4.23.-);, partial (7%)
Length = 705
Score = 77.8 bits (190), Expect(2) = 8e-30
Identities = 40/87 (45%), Positives = 58/87 (65%)
Frame = +1
Query: 1416 VKRILRYLKGTTNLGLMYKKTSEYKLSGYCDADYAGDRTERKSTSGNCQFLGSNLVSWAS 1475
VKRI+RY+KGT+ + + + SE + GY D+D+AGD +RKST+G L VSW S
Sbjct: 1 VKRIMRYIKGTSGVAVCFGG-SELTVRGYVDSDFAGDHDKRKSTTGYVFTLAGGAVSWLS 177
Query: 1476 KRQSTIALSTAEAEYISAAICSTQMLW 1502
K Q+ +ALST EAEY++A + + L+
Sbjct: 178 KLQTVVALSTTEAEYMAAYLKHARKLF 258
Score = 72.8 bits (177), Expect(2) = 8e-30
Identities = 28/84 (33%), Positives = 55/84 (65%)
Frame = +3
Query: 1499 QMLWMKHQLEDYQILESNIPIYCDNTAAISLSKNPILHSRAKHIEVKHHFIRDYVQKGVL 1558
+ +WM+ +E+ + I +YCD+ +A+ +++NP HSR KHI +++HF+R+ V++G +
Sbjct: 249 EAIWMQRLMEELGHKQEQITVYCDSQSALHIARNPAFHSRTKHIGIQYHFVREVVEEGSV 428
Query: 1559 LLKFVDTDHQWADIFTKPLAEDRF 1582
++ + T+ AD TK + D+F
Sbjct: 429 DMQKIHTNDNLADAMTKSINTDKF 500
>BI262917 weakly similar to GP|19920130|g Putative retroelement {Oryza
sativa} [Oryza sativa (japonica cultivar-group)],
partial (8%)
Length = 426
Score = 118 bits (295), Expect = 2e-26
Identities = 58/113 (51%), Positives = 69/113 (60%)
Frame = +1
Query: 865 TPQQNGVVERKNRTLQEMARTMLQETGMAKHFWAEAVNTACYIQNRISVRPILNKTPYEL 924
TPQQNGV ER NRTL E R ML+ GMAK FWAEAV TACY+ NR I KTP E+
Sbjct: 85 TPQQNGVAERMNRTLLERTRAMLKTAGMAKSFWAEAVKTACYVINRSPSTVIDLKTPMEM 264
Query: 925 WKNIKPNISYFHPFGCVCYVLNTKDRLHKFDAKSSKCLLLGYSERSKGFRFYN 977
WK + S H FGC YV+ K D KS KC+ LGY++ KG+ ++
Sbjct: 265 WKGKPVDYSSLHVFGCPVYVMYNSQERTKLDPKSRKCIFLGYADNVKGYXLWD 423
>BG587170 similar to PIR|F86470|F8 probable retroelement polyprotein
[imported] - Arabidopsis thaliana, partial (13%)
Length = 718
Score = 116 bits (291), Expect = 6e-26
Identities = 80/239 (33%), Positives = 122/239 (50%), Gaps = 3/239 (1%)
Frame = -3
Query: 679 KNNIYKIRLSELEAQNVKCLLSVDE---EQWVWHRRLGHASMRKISQLSKLNLVRGLPNL 735
K ++Y + + N KC + + +WH RLGH R LNL+ LP +
Sbjct: 674 KGDLYMLEKLD-PVSNYKCSFTSSSSLNKDALWHARLGHPHGRA------LNLM--LPGV 522
Query: 736 KFASDALCEACQKGKFTKVPFKAKNVVSTSRPLELLHIDLFGPVKIESIGGKRYGMVIVD 795
F + CEAC GK K F + V + +L++ DL+ + S +Y + +D
Sbjct: 521 VFENKN-CEACILGKHCKNVFPRTSTVYENC-FDLIYTDLWTAPSL-SRDNHKYFVTFID 351
Query: 796 DYSRWTWVKFLTRKDESHVVFSTFIAQVQNEKACRIVRVRSDHGGEFENDKFESLFDSYG 855
+ S++TW+ + KD F F A V N +I +RSD+GGE+ + F+S D +G
Sbjct: 350 EKSKYTWLTLIPSKDRVIDAFKNFQAYVTNHYHAKIKILRSDNGGEYTSYAFKSHLDHHG 171
Query: 856 IAHDFSCPRTPQQNGVVERKNRTLQEMARTMLQETGMAKHFWAEAVNTACYIQNRISVR 914
I H SCP TPQQNGV +RKN+ L E+AR+++ + V+TACY+ N I +
Sbjct: 170 ILHQTSCPYTPQQNGVAKRKNKHLMEVARSLMFQAN---------VSTACYLINWIPTK 21
>TC89912 weakly similar to PIR|B84512|B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(10%)
Length = 814
Score = 116 bits (290), Expect = 8e-26
Identities = 58/163 (35%), Positives = 97/163 (58%), Gaps = 2/163 (1%)
Frame = +1
Query: 1415 AVKRILRYLKGTTNLGLMYKKTSEYK--LSGYCDADYAGDRTERKSTSGNCQFLGSNLVS 1472
A+K +L+YL + L Y K ++ + L GY DADYAG+ RKS SG L +S
Sbjct: 4 ALKWVLKYLNESLKSSLKYTKAAQEEDALEGYVDADYAGNVDTRKSLSGFVFTLYGTTIS 183
Query: 1473 WASKRQSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQILESNIPIYCDNTAAISLSKN 1532
W + +QS + LST +AEYI+ +W+K + + I + + I+CD+ +AI L+ +
Sbjct: 184 WKANQQSVVTLSTTQAEYIAFVEGVKDAIWLKGMIGELGITQEYVKIHCDSQSAIHLANH 363
Query: 1533 PILHSRAKHIEVKHHFIRDYVQKGVLLLKFVDTDHQWADIFTK 1575
+ H R KHI+++ HFIRD ++ ++++ + ++ AD+FTK
Sbjct: 364 QVYHERTKHIDIRLHFIRDMIESKEIVVEKMASEENPADVFTK 492
>BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vulgaris},
partial (13%)
Length = 494
Score = 96.7 bits (239), Expect = 7e-20
Identities = 50/122 (40%), Positives = 74/122 (59%), Gaps = 3/122 (2%)
Frame = +1
Query: 1391 RPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMY---KKTSEYKLSGYCDA 1447
RPDI +SV + ++F DPR+ HL A RILRY++GT GL++ K+ Y+L Y D+
Sbjct: 121 RPDICYSVSVISKFMHDPRKPHLIAANRILRYVRGTMEYGLLFPYGAKSEVYELICYSDS 300
Query: 1448 DYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQL 1507
D+ GD R+STSG +SW +K+Q ALS+ EAEYI+ + Q LW+ +
Sbjct: 301 DWCGD---RRSTSGYVFKFNDAAISWCTKKQPITALSSYEAEYIAGTFATFQALWLDSVI 471
Query: 1508 ED 1509
++
Sbjct: 472 KE 477
>BG586293 weakly similar to PIR|E84473|E84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(7%)
Length = 763
Score = 95.9 bits (237), Expect = 1e-19
Identities = 49/116 (42%), Positives = 73/116 (62%)
Frame = +2
Query: 1091 SIDEALQDKDWILAMEEELNQFSKNDVWSLVKKPESVLVIGTKWVFRNKLNEKGDVVRNK 1150
S+D + +D I ++ L LVKKP V IG +W+++ K NE G +++ K
Sbjct: 5 SLDRCNEGRDRIYYQKQTLK---------LVKKPTGVKPIGLRWIYKIKRNEDGTLIKYK 157
Query: 1151 ARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGY 1206
ARLVA+GY +Q+GID+ E FAPV R+E I LL++ + + +H +DVK AFLNG+
Sbjct: 158 ARLVAKGYVKQQGIDFDEVFAPVVRIETI*LLLALAATNGC*IHHIDVKIAFLNGH 325
>BG586273 weakly similar to PIR|F86470|F8 probable retroelement polyprotein
[imported] - Arabidopsis thaliana, partial (7%)
Length = 705
Score = 80.1 bits (196), Expect = 6e-15
Identities = 40/113 (35%), Positives = 67/113 (58%)
Frame = -2
Query: 904 ACYIQNRISVRPILNKTPYELWKNIKPNISYFHPFGCVCYVLNTKDRLHKFDAKSSKCLL 963
ACY+ NRI R + ++ P+E+ KP+++Y FGC+CYVL + +K +A+S K +
Sbjct: 704 ACYLINRIPTRVLKDQAPFEVLNQRKPSLTYMRVFGCLCYVLVPGELRNKLEARSRKAMF 525
Query: 964 LGYSERSKGFRFYNTDAKTIEESIHVRFDDKLDSDQSKLVEKFADLSINVSDK 1016
+GYS KG++ Y+ +A+ + S V+F ++ + K E DL+ SDK
Sbjct: 524 IGYSTTQKGYKCYDPEARRVLVSRDVKFIEERGYYEEKNQEDLRDLT---SDK 375
>BG644677 weakly similar to GP|7682800|gb| Hypothetical protein T15F17.l
{Arabidopsis thaliana}, partial (3%)
Length = 539
Score = 72.8 bits (177), Expect = 1e-12
Identities = 39/104 (37%), Positives = 58/104 (55%)
Frame = -3
Query: 1385 LYLTASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKTSEYKLSGY 1444
+ LT P+I FS++L +R+ S P H +K I +YLKG ++GL Y K L GY
Sbjct: 531 ILLTLQGPNITFSINLLSRYSSAPTMRH*NGIKHICKYLKGIIDMGLFYSKDCSPDLIGY 352
Query: 1445 CDADYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEA 1488
+A Y D + +S +G G+ ++SW S + STIA S+ A
Sbjct: 351 VNA*YLSDPHKARS*TGYIFTCGNTVISWRSTK*STIATSSNHA 220
>BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sativa (japonica
cultivar-group)}, partial (8%)
Length = 503
Score = 71.2 bits (173), Expect = 3e-12
Identities = 41/125 (32%), Positives = 67/125 (52%), Gaps = 3/125 (2%)
Frame = +1
Query: 1240 QAPRAWYERLSSFLLENEFVRGKVDTTLFCK---TYKDDILIVQIYVDDIIFGSANQSLC 1296
Q+PR W++R + + + +++ + D +F K T K ILIV YVDDI +
Sbjct: 1 QSPRDWFDRFT*VVKKFGYIQCQTDHAMFIKHSSTVKKAILIV--YVDDIFLTGDHGK*I 174
Query: 1297 KEFSEMMQAEFEMSMMGELKYFMGIQVDQTPEGTYIHQSKYTKELLKKFNMLESTVAKTP 1356
K ++ EFE+ +G LKYF+G++V + +G+ I Q KY +LLK+ M+ + P
Sbjct: 175 KRLKNLLAEEFEIKDLGNLKYFLGMEVARWKKGSSISQRKYVLDLLKETRMIGCKTIRDP 354
Query: 1357 MHPTC 1361
C
Sbjct: 355 YGCNC 369
Score = 32.7 bits (73), Expect = 1.2
Identities = 19/57 (33%), Positives = 31/57 (54%)
Frame = +2
Query: 1348 LESTVAKTPMHPTCILEKEYKSGKVCQKLYRGMIGSLLYLTASRPDILFSVHLCARF 1404
L+ ++TPM T L V + Y+ ++G L+YL+ +RPDI F V ++F
Sbjct: 329 LDVKPSETPMDATVKLGTLDNGTLVDKGRYQRLVGKLIYLSHTRPDISFVVCTMSQF 499
>CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotiana tabacum},
partial (7%)
Length = 780
Score = 69.3 bits (168), Expect = 1e-11
Identities = 36/74 (48%), Positives = 47/74 (62%)
Frame = -2
Query: 1170 FAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEVYVHQPPGFEDEKKPDHVF 1229
F P+ +L I L+S N+ L +DVK+AFL G + E++Y+HQP GF E V
Sbjct: 554 FVPIVKLNTIMFLLSIVAIENLYLE*LDVKTAFLRGDLVEDIYMHQPEGFS*E-VGKMVG 378
Query: 1230 KLKKSLYGLKQAPR 1243
KLKKS+YGLKQ PR
Sbjct: 377 KLKKSMYGLKQGPR 336
>BE941052 weakly similar to PIR|B85188|B85 retrotransposon like protein
[imported] - Arabidopsis thaliana, partial (4%)
Length = 480
Score = 63.5 bits (153), Expect = 6e-10
Identities = 30/71 (42%), Positives = 43/71 (60%)
Frame = +2
Query: 1521 CDNTAAISLSKNPILHSRAKHIEVKHHFIRDYVQKGVLLLKFVDTDHQWADIFTKPLAED 1580
CD +A L+ NP+ HSR KHI + HF+RD VQ+G L ++ V T Q AD TKPL++
Sbjct: 29 CDYLSATYLTHNPVYHSRMKHISIDIHFVRDLVQQGKLKVQHVCTVDQLADCLTKPLSKS 208
Query: 1581 RFNFILKNLNM 1591
R + + +
Sbjct: 209 RHQLLRNKIGV 241
>AJ499215 weakly similar to GP|6642775|gb| gag-pol polyprotein {Vitis
vinifera}, partial (18%)
Length = 567
Score = 63.5 bits (153), Expect = 6e-10
Identities = 40/116 (34%), Positives = 58/116 (49%), Gaps = 3/116 (2%)
Frame = +3
Query: 561 LSMLQISLIAPLKHQSWYLDSGCSRHMTGEKRMFRELKLKPGGEVGFGGNEKGKIVGTGT 620
L+M + P K+ W +DSGC+ HMT ++ +F+EL +V ++ G GT
Sbjct: 66 LAMSTFATKQPSKY--WLIDSGCTHHMTHDRDLFKELNKSTISKVRMLNGAHIEVEGIGT 239
Query: 621 ICVDSS---PCIDNVLLVDGLTHNLLSISQLADKGYDVIFNQKSCRAVSQIDGSVL 673
+ V S I NVL L +LLS+ QL KGY V+F + C Q + VL
Sbjct: 240 VLVKSHSGYKQISNVLYAPKLNQSLLSVPQLLTKGYKVLFEHEKCVIKDQNNKEVL 407
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.332 0.142 0.437
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 48,574,853
Number of Sequences: 36976
Number of extensions: 693815
Number of successful extensions: 5838
Number of sequences better than 10.0: 63
Number of HSP's better than 10.0 without gapping: 3234
Number of HSP's successfully gapped in prelim test: 305
Number of HSP's that attempted gapping in prelim test: 2442
Number of HSP's gapped (non-prelim): 3769
length of query: 1596
length of database: 9,014,727
effective HSP length: 109
effective length of query: 1487
effective length of database: 4,984,343
effective search space: 7411718041
effective search space used: 7411718041
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.5 bits)
S2: 65 (29.6 bits)
Lotus: description of TM0128.9