
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0359.3
(1572 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BG644690 weakly similar to GP|18542179|gb putative pol protein {... 193 6e-67
BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2... 210 3e-54
CB893805 similar to GP|10177935|d copia-type polyprotein {Arabid... 174 2e-43
AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsi... 174 3e-43
BG587156 similar to PIR|G85055|G8 probable polyprotein [imported... 166 5e-41
BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F2... 161 2e-39
TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative ret... 141 2e-33
BI262917 weakly similar to GP|19920130|g Putative retroelement {... 119 7e-27
TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-relate... 77 2e-26
BG587170 similar to PIR|F86470|F8 probable retroelement polyprot... 117 5e-26
AJ502495 weakly similar to GP|18071369|g putative gag-pol polypr... 113 7e-25
TC89912 weakly similar to PIR|B84512|B84512 probable retroelemen... 105 2e-22
BG586293 weakly similar to PIR|E84473|E84 probable retroelement ... 96 1e-19
BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vu... 94 3e-19
BG586273 weakly similar to PIR|F86470|F8 probable retroelement p... 80 6e-15
BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sati... 72 2e-12
BG644677 weakly similar to GP|7682800|gb| Hypothetical protein T... 72 2e-12
CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotia... 69 1e-11
AJ499215 weakly similar to GP|6642775|gb| gag-pol polyprotein {V... 64 6e-10
TC92908 similar to GP|10140704|gb|AAG13538.1 putative gag-pol po... 62 2e-09
>BG644690 weakly similar to GP|18542179|gb putative pol protein {Zea mays},
partial (22%)
Length = 629
Score = 193 bits (491), Expect(2) = 6e-67
Identities = 91/141 (64%), Positives = 115/141 (81%)
Frame = -2
Query: 1159 VVRNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNG 1218
+ RNK++LV QGY+Q+EGIDY E F+PVAR+EAIR+LI+F+ L+QMDVKSAF+NG
Sbjct: 424 ITRNKSKLVVQGYNQKEGIDYDEAFSPVARMEAIRILIAFAAFMGFKLYQMDVKSAFING 245
Query: 1219 YISEEVYVHQPPGFEDERKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDT 1278
+ EEV+V QPPGFED P+HVF+L K+LYGLKQAPRAWYERLS FLL+N F RGK+D
Sbjct: 244 DLKEEVFVKQPPGFEDAEVPNHVFRLNKTLYGLKQAPRAWYERLSKFLLKNGFKRGKIDN 65
Query: 1279 TLFCKTYKDDILIVQIYVDDI 1299
TLF + ++LI+Q+YVDDI
Sbjct: 64 TLFLLKRE*ELLIIQVYVDDI 2
Score = 81.3 bits (199), Expect(2) = 6e-67
Identities = 36/64 (56%), Positives = 49/64 (76%)
Frame = -3
Query: 1090 LLSLKGLVSLIEPKSIDEALQDKEWILAMEEELNQFSKNDVWSLVKKPENVHVIGTKWVF 1149
L+S +S IEPK++ EAL+D +WI +M+EEL+QF ++ VW LV +PE VIGT+WVF
Sbjct: 621 LVSFSAFISSIEPKNVKEALRDADWINSMQEELHQFERSKVWYLVPRPEGKTVIGTRWVF 442
Query: 1150 RNKL 1153
RNKL
Sbjct: 441 RNKL 430
>BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2O9.150 -
Arabidopsis thaliana, partial (11%)
Length = 732
Score = 210 bits (535), Expect = 3e-54
Identities = 102/217 (47%), Positives = 145/217 (66%), Gaps = 1/217 (0%)
Frame = +1
Query: 1334 IQVDQTPEGTYIHQSKYTKELLKKFNMLESTVAKTPMHPTCILEKEDKSGKVCQKLYRGM 1393
++V Q EG YI Q KY +LL++F M +S +++ P+ P C L K++ KV Y+ +
Sbjct: 1 VEVIQNEEGIYICQRKYVTDLLERFGMEKSNLSRNPIAPRCKLIKDENGVKVDATKYKQI 180
Query: 1394 IGSLLYLTASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKTSEYK 1453
+G L+YL A+RPD+++ + L +RF + P E H+ AVKR+LRYL GT NLG+MYK+ K
Sbjct: 181 VGCLMYLAATRPDLMYVLSLISRFMNCPTELHMHAVKRVLRYLNGTINLGIMYKRNGSEK 360
Query: 1454 LSGYCDADYAGDRTERKSTFGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQM 1513
L Y D+DYAGD +RKST G L S VSW+SK+Q + LST +AE+I+AA C+ Q
Sbjct: 361 LEAYTDSDYAGDLDDRKSTSGYVFMLSSGAVSWSSKKQPVVTLSTTKAEFIAAAFCACQS 540
Query: 1514 LWMKHQLEDYQILES-NIPIYCDNTAAISLSKNPILH 1549
+WM+ LE +S +I +YCDN + I LSKNP+LH
Sbjct: 541 VWMRRVLEKLGYTQSGSITMYCDNNSTIKLSKNPVLH 651
>CB893805 similar to GP|10177935|d copia-type polyprotein {Arabidopsis
thaliana}, partial (14%)
Length = 778
Score = 174 bits (442), Expect = 2e-43
Identities = 91/249 (36%), Positives = 150/249 (59%), Gaps = 4/249 (1%)
Frame = +3
Query: 1101 EPKSIDEALQDKEWILAMEEELNQFSKNDVWSLVKKPENVHVIGTKWVFRNKLNEKGEVV 1160
+P + +EA++ ++W +M E+ +N+ W L IG KW+F+ KLNE GE+
Sbjct: 39 DPTTFEEAVKSEKWRASMNNEMEATERNNTWELTDLRSGAKTIGLKWIFKTKLNENGEIE 218
Query: 1161 RNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSV---NHNIVLHQMDVKSAFLN 1217
+ KARLVA+GYSQQ G+DYTE FAPVAR + IR++I+ + + + M + +
Sbjct: 219 KYKARLVAKGYSQQYGVDYTEVFAPVARWDTIRMVIALAAQIKRDGVCIS*M*KAHSCME 398
Query: 1218 GYISEEVYVHQPPGFEDERKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVD 1277
+ + + ++ + ++K++LYGLKQAPRAWY R+ ++ + F + +
Sbjct: 399 N*MRKFLLINH----RVM*RRVIS*RVKRALYGLKQAPRAWYSRIEAYFTKEGFEKCPYE 566
Query: 1278 TTLFCKTYK-DDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQV 1336
TLF K + ILI+ +YVDD+IF ++++ +EF + M+ EF MS +G++ YFLG++V
Sbjct: 567 HTLFVKLSEGGKILIISLYVDDLIFIGNDENMFEEFKKSMKKEFNMSDLGKMHYFLGVEV 746
Query: 1337 DQTPEGTYI 1345
Q +G YI
Sbjct: 747 TQNEKGIYI 773
>AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsis thaliana},
partial (9%)
Length = 675
Score = 174 bits (440), Expect = 3e-43
Identities = 96/218 (44%), Positives = 129/218 (59%), Gaps = 3/218 (1%)
Frame = +1
Query: 1093 LKGLVSLIEPKSID---EALQDKEWILAMEEELNQFSKNDVWSLVKKPENVHVIGTKWVF 1149
LK V L EP D L D W+ AM+ E N W LV P + IG KWV+
Sbjct: 25 LKPKVFLTEPCPQDCENLPLSDPRWLQAMKTEYKALIDNKTWDLVPLPPHKKAIGCKWVY 204
Query: 1150 RNKLNEKGEVVRNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQM 1209
R K N G V + KARLVA+G+SQ G DYTETF+PV + IRL+++ ++ + + Q+
Sbjct: 205 RVKENPDGSVNKFKARLVAKGFSQTLGCDYTETFSPVIKPVTIRLILTIAITYKWEIQQI 384
Query: 1210 DVKSAFLNGYISEEVYVHQPPGFEDERKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLEN 1269
D+ +AFLNG++ EEVY+ QP GFE K V KL KSLYGLKQAPRAWYE L+S ++
Sbjct: 385 DINNAFLNGFLQEEVYMSQPQGFEAANK-SLVCKLNKSLYGLKQAPRAWYEXLTSAQIQF 561
Query: 1270 EFVRGKVDTTLFCKTYKDDILIVQIYVDDIIFGSANQS 1307
F + + D +L + + IYVDDI+ ++ S
Sbjct: 562 GFTKSRCDPSLLIYNQNGACIYLXIYVDDILITGSSAS 675
>BG587156 similar to PIR|G85055|G8 probable polyprotein [imported] -
Arabidopsis thaliana, partial (17%)
Length = 618
Score = 166 bits (421), Expect = 5e-41
Identities = 82/183 (44%), Positives = 118/183 (63%)
Frame = -1
Query: 1102 PKSIDEALQDKEWILAMEEELNQFSKNDVWSLVKKPENVHVIGTKWVFRNKLNEKGEVVR 1161
P+S +EA++DKEW ++ E KND W + P+ + ++W+F K G + R
Sbjct: 558 PRSYEEAMEDKEWKESVGAEAGAMIKNDTWYESELPKGKKAVSSRWIFTIKYKADGSIER 379
Query: 1162 NKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYIS 1221
K RLVA+G++ G DY ETFAPVA+L IR+++S +VN L QMDVK+AFL G +
Sbjct: 378 KKTRLVARGFTLTYGEDYIETFAPVAKLHTIRIVLSLAVNLGWGLWQMDVKNAFLQGELE 199
Query: 1222 EEVYVHQPPGFEDERKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLF 1281
+EVY++ PPG E K +V +LKK++YGLKQ+PRAWY +LS+ L F + ++D TLF
Sbjct: 198 DEVYMYPPPGLEHLVKRGNVLRLKKAIYGLKQSPRAWYNKLSTTLNGRGFRKSELDHTLF 19
Query: 1282 CKT 1284
T
Sbjct: 18 TLT 10
>BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana, partial (10%)
Length = 744
Score = 161 bits (407), Expect = 2e-39
Identities = 91/241 (37%), Positives = 133/241 (54%)
Frame = +2
Query: 1241 VFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTYKDDILIVQIYVDDII 1300
V +L+KS+YGLKQA R WY +LS L+ +++ D +LF K + +YVDDI+
Sbjct: 20 VCELQKSIYGLKQASRQWYSKLSESLISFGYLQSSSDFSLFTKFKDSSFTTLLVYVDDIV 199
Query: 1301 FGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEGTYIHQSKYTKELLKKFNM 1360
+ S + + F++ +G L+YFLG++V ++ +G ++Q KYT ELL+
Sbjct: 200 LAGNDISEIQHVKCFLIDRFKIKDLGSLRYFLGLEVARSKQGILLNQRKYTLELLEDSGN 379
Query: 1361 LESTVAKTPMHPTCILEKEDKSGKVCQKLYRGMIGSLLYLTASRPDILFSVHLCARFQSD 1420
L TP + L D + YR +IG L+YLT +RPDI F+V ++F S
Sbjct: 380 LAVKSTLTPYDISLKLHNSDSPLYNDETQYRRLIGKLIYLTTTRPDISFAVQQLSQFVSK 559
Query: 1421 PRETHLTAVKRILRYLKGTTNLGLMYKKTSEYKLSGYCDADYAGDRTERKSTFGNCQFLG 1480
P++ H A R+L+YLK GL Y TS KLS + D+D+A T RKS G FLG
Sbjct: 560 PQQVHYQAAIRVLQYLKTAPAKGLFYSATSNLKLSSFADSDWATCPTTRKSVTGYWVFLG 739
Query: 1481 S 1481
S
Sbjct: 740 S 742
>TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative retroelement
{Oryza sativa} [Oryza sativa (japonica cultivar-group)],
partial (10%)
Length = 823
Score = 141 bits (356), Expect = 2e-33
Identities = 86/254 (33%), Positives = 130/254 (50%), Gaps = 2/254 (0%)
Frame = +1
Query: 744 GLPNLKFASDALCEACQKGKFTKVPFNAKNVVSTSRPLELLHIDLFGPVKTESIGGKRYG 803
G+ L+F L +K KV F+ T L+ +H DL+GP K S GG+RY
Sbjct: 1 GIDKLEFCKHLLFFGNRK----KVSFSTATH-RTKGILDYIHSDLWGPSKVTSYGGRRYM 165
Query: 804 MVIVDDYSRWTWVKFLTRKDESHAVFSTFIAQVQNEKACRIVRVRSDHGGEFENDKFESL 863
M I+DD+ R WV FL K+E+ F + V+ + + ++ +D+ EF + F
Sbjct: 166 MTIIDDFPRKVWVYFLRYKNETFPTFKKWRILVETQTGKNVKKLITDN*LEFCSSDFNEF 345
Query: 864 FDSYGISHDFSCPRTPQQNGVVERKNRTLQEMARTMLQETGM--AKHFWAEAVNTACYIP 921
++GI+ + PR PQQNGV ER RTL E AR ML G+ + W EA +TAC++
Sbjct: 346 CTNHGIARHKTIPRNPQQNGVAERMIRTLLERARCMLSNAGL*N*RDLWVEAASTACHLV 525
Query: 922 NRISVRPILNKTPYELWKNIKPNISYFHPFGCVCYVLNTKDRLHKFDAKSSKCLLLGYSD 981
NR + K P ++W + S FGC Y L +L ++ +C+ L Y+
Sbjct: 526 NRSPHSALDFKVPEDIWSGNLVDYSNLRIFGCPAYALVNDGKL---APRAGECIFLSYAS 696
Query: 982 RSKGFRFYNTDAKT 995
SKG+R + +D K+
Sbjct: 697 ESKGYRLWCSDPKS 738
>BI262917 weakly similar to GP|19920130|g Putative retroelement {Oryza
sativa} [Oryza sativa (japonica cultivar-group)],
partial (8%)
Length = 426
Score = 119 bits (299), Expect = 7e-27
Identities = 59/113 (52%), Positives = 69/113 (60%)
Frame = +1
Query: 878 TPQQNGVVERKNRTLQEMARTMLQETGMAKHFWAEAVNTACYIPNRISVRPILNKTPYEL 937
TPQQNGV ER NRTL E R ML+ GMAK FWAEAV TACY+ NR I KTP E+
Sbjct: 85 TPQQNGVAERMNRTLLERTRAMLKTAGMAKSFWAEAVKTACYVINRSPSTVIDLKTPMEM 264
Query: 938 WKNIKPNISYFHPFGCVCYVLNTKDRLHKFDAKSSKCLLLGYSDRSKGFRFYN 990
WK + S H FGC YV+ K D KS KC+ LGY+D KG+ ++
Sbjct: 265 WKGKPVDYSSLHVFGCPVYVMYNSQERTKLDPKSRKCIFLGYADNVKGYXLWD 423
>TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
(EC 3.4.23.-);, partial (7%)
Length = 705
Score = 76.6 bits (187), Expect(2) = 2e-26
Identities = 40/87 (45%), Positives = 57/87 (64%)
Frame = +1
Query: 1429 VKRILRYLKGTTNLGLMYKKTSEYKLSGYCDADYAGDRTERKSTFGNCQFLGSNLVSWAS 1488
VKRI+RY+KGT+ + + + SE + GY D+D+AGD +RKST G L VSW S
Sbjct: 1 VKRIMRYIKGTSGVAVCFGG-SELTVRGYVDSDFAGDHDKRKSTTGYVFTLAGGAVSWLS 177
Query: 1489 KRQSTIALSTAEAEYISAAICSTQMLW 1515
K Q+ +ALST EAEY++A + + L+
Sbjct: 178 KLQTVVALSTTEAEYMAAYLKHARKLF 258
Score = 62.8 bits (151), Expect(2) = 2e-26
Identities = 22/58 (37%), Positives = 41/58 (69%)
Frame = +3
Query: 1512 QMLWMKHQLEDYQILESNIPIYCDNTAAISLSKNPILHSRAKHIEVKYHFIRDYVQKG 1569
+ +WM+ +E+ + I +YCD+ +A+ +++NP HSR KHI ++YHF+R+ V++G
Sbjct: 249 EAIWMQRLMEELGHKQEQITVYCDSQSALHIARNPAFHSRTKHIGIQYHFVREVVEEG 422
>BG587170 similar to PIR|F86470|F8 probable retroelement polyprotein
[imported] - Arabidopsis thaliana, partial (13%)
Length = 718
Score = 117 bits (292), Expect = 5e-26
Identities = 82/240 (34%), Positives = 125/240 (51%), Gaps = 4/240 (1%)
Frame = -3
Query: 692 KNNIYKIRLSELEAQNVKCLL----SVNEEQWVWHRRLGHASMRKISQLSKLNLVRGLPN 747
K ++Y + + N KC S+N++ +WH RLGH R LNL+ LP
Sbjct: 674 KGDLYMLEKLD-PVSNYKCSFTSSSSLNKDA-LWHARLGHPHGRA------LNLM--LPG 525
Query: 748 LKFASDALCEACQKGKFTKVPFNAKNVVSTSRPLELLHIDLFGPVKTESIGGKRYGMVIV 807
+ F + CEAC GK K F + V + +L++ DL+ + S +Y + +
Sbjct: 524 VVFENKN-CEACILGKHCKNVFPRTSTVYENC-FDLIYTDLW-TAPSLSRDNHKYFVTFI 354
Query: 808 DDYSRWTWVKFLTRKDESHAVFSTFIAQVQNEKACRIVRVRSDHGGEFENDKFESLFDSY 867
D+ S++TW+ + KD F F A V N +I +RSD+GGE+ + F+S D +
Sbjct: 353 DEKSKYTWLTLIPSKDRVIDAFKNFQAYVTNHYHAKIKILRSDNGGEYTSYAFKSHLDHH 174
Query: 868 GISHDFSCPRTPQQNGVVERKNRTLQEMARTMLQETGMAKHFWAEAVNTACYIPNRISVR 927
GI H SCP TPQQNGV +RKN+ L E+AR+++ + V+TACY+ N I +
Sbjct: 173 GILHQTSCPYTPQQNGVAKRKNKHLMEVARSLMFQAN---------VSTACYLINWIPTK 21
>AJ502495 weakly similar to GP|18071369|g putative gag-pol polyprotein {Oryza
sativa}, partial (9%)
Length = 542
Score = 113 bits (282), Expect = 7e-25
Identities = 54/109 (49%), Positives = 77/109 (70%), Gaps = 2/109 (1%)
Frame = +2
Query: 1460 ADYAGDRTERKSTFGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQ 1519
+D+AGD RKST G LG+ +SW+SK+Q +A STAEAEYI++ C+TQ +W++
Sbjct: 2 SDWAGDTETRKSTSGYAFHLGTGAISWSSKKQPVVAFSTAEAEYIASTSCATQTVWLRRI 181
Query: 1520 LEDYQILESNIP--IYCDNTAAISLSKNPILHSRAKHIEVKYHFIRDYV 1566
LE E N P IYCDN +AI+LSKNP+ H R+KHI++++H IR+ +
Sbjct: 182 LE-VMHHEQNTPTKIYCDNKSAIALSKNPVFHGRSKHIDIQFHKIRELI 325
>TC89912 weakly similar to PIR|B84512|B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(10%)
Length = 814
Score = 105 bits (261), Expect = 2e-22
Identities = 52/142 (36%), Positives = 83/142 (57%), Gaps = 2/142 (1%)
Frame = +1
Query: 1428 AVKRILRYLKGTTNLGLMYKKTSEYK--LSGYCDADYAGDRTERKSTFGNCQFLGSNLVS 1485
A+K +L+YL + L Y K ++ + L GY DADYAG+ RKS G L +S
Sbjct: 4 ALKWVLKYLNESLKSSLKYTKAAQEEDALEGYVDADYAGNVDTRKSLSGFVFTLYGTTIS 183
Query: 1486 WASKRQSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQILESNIPIYCDNTAAISLSKN 1545
W + +QS + LST +AEYI+ +W+K + + I + + I+CD+ +AI L+ +
Sbjct: 184 WKANQQSVVTLSTTQAEYIAFVEGVKDAIWLKGMIGELGITQEYVKIHCDSQSAIHLANH 363
Query: 1546 PILHSRAKHIEVKYHFIRDYVQ 1567
+ H R KHI+++ HFIRD ++
Sbjct: 364 QVYHERTKHIDIRLHFIRDMIE 429
>BG586293 weakly similar to PIR|E84473|E84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(7%)
Length = 763
Score = 95.5 bits (236), Expect = 1e-19
Identities = 44/87 (50%), Positives = 63/87 (71%)
Frame = +2
Query: 1133 LVKKPENVHVIGTKWVFRNKLNEKGEVVRNKARLVAQGYSQQEGIDYTETFAPVARLEAI 1192
LVKKP V IG +W+++ K NE G +++ KARLVA+GY +Q+GID+ E FAPV R+E I
Sbjct: 65 LVKKPTGVKPIGLRWIYKIKRNEDGTLIKYKARLVAKGYVKQQGIDFDEVFAPVVRIETI 244
Query: 1193 RLLISFSVNHNIVLHQMDVKSAFLNGY 1219
LL++ + + +H +DVK AFLNG+
Sbjct: 245 *LLLALAATNGC*IHHIDVKIAFLNGH 325
>BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vulgaris},
partial (13%)
Length = 494
Score = 94.4 bits (233), Expect = 3e-19
Identities = 49/122 (40%), Positives = 73/122 (59%), Gaps = 3/122 (2%)
Frame = +1
Query: 1404 RPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMY---KKTSEYKLSGYCDA 1460
RPDI +SV + ++F DPR+ HL A RILRY++GT GL++ K+ Y+L Y D+
Sbjct: 121 RPDICYSVSVISKFMHDPRKPHLIAANRILRYVRGTMEYGLLFPYGAKSEVYELICYSDS 300
Query: 1461 DYAGDRTERKSTFGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQL 1520
D+ GD R+ST G +SW +K+Q ALS+ EAEYI+ + Q LW+ +
Sbjct: 301 DWCGD---RRSTSGYVFKFNDAAISWCTKKQPITALSSYEAEYIAGTFATFQALWLDSVI 471
Query: 1521 ED 1522
++
Sbjct: 472 KE 477
>BG586273 weakly similar to PIR|F86470|F8 probable retroelement polyprotein
[imported] - Arabidopsis thaliana, partial (7%)
Length = 705
Score = 80.1 bits (196), Expect = 6e-15
Identities = 40/113 (35%), Positives = 67/113 (58%)
Frame = -2
Query: 917 ACYIPNRISVRPILNKTPYELWKNIKPNISYFHPFGCVCYVLNTKDRLHKFDAKSSKCLL 976
ACY+ NRI R + ++ P+E+ KP+++Y FGC+CYVL + +K +A+S K +
Sbjct: 704 ACYLINRIPTRVLKDQAPFEVLNQRKPSLTYMRVFGCLCYVLVPGELRNKLEARSRKAMF 525
Query: 977 LGYSDRSKGFRFYNTDAKTIEESIHVRFDDKLDSDQSKLVEKFADLSINVSDK 1029
+GYS KG++ Y+ +A+ + S V+F ++ + K E DL+ SDK
Sbjct: 524 IGYSTTQKGYKCYDPEARRVLVSRDVKFIEERGYYEEKNQEDLRDLT---SDK 375
>BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sativa (japonica
cultivar-group)}, partial (8%)
Length = 503
Score = 72.0 bits (175), Expect = 2e-12
Identities = 42/125 (33%), Positives = 67/125 (53%), Gaps = 3/125 (2%)
Frame = +1
Query: 1253 QAPRAWYERLSSFLLENEFVRGKVDTTLFCK---TYKDDILIVQIYVDDIIFGSANQSLC 1309
Q+PR W++R + + + +++ + D +F K T K ILIV YVDDI +
Sbjct: 1 QSPRDWFDRFT*VVKKFGYIQCQTDHAMFIKHSSTVKKAILIV--YVDDIFLTGDHGK*I 174
Query: 1310 KEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEGTYIHQSKYTKELLKKFNMLESTVAKTP 1369
K ++ EFE+ +G LKYFLG++V + +G+ I Q KY +LLK+ M+ + P
Sbjct: 175 KRLKNLLAEEFEIKDLGNLKYFLGMEVARWKKGSSISQRKYVLDLLKETRMIGCKTIRDP 354
Query: 1370 MHPTC 1374
C
Sbjct: 355 YGCNC 369
Score = 36.2 bits (82), Expect = 0.10
Identities = 20/57 (35%), Positives = 32/57 (56%)
Frame = +2
Query: 1361 LESTVAKTPMHPTCILEKEDKSGKVCQKLYRGMIGSLLYLTASRPDILFSVHLCARF 1417
L+ ++TPM T L D V + Y+ ++G L+YL+ +RPDI F V ++F
Sbjct: 329 LDVKPSETPMDATVKLGTLDNGTLVDKGRYQRLVGKLIYLSHTRPDISFVVCTMSQF 499
>BG644677 weakly similar to GP|7682800|gb| Hypothetical protein T15F17.l
{Arabidopsis thaliana}, partial (3%)
Length = 539
Score = 71.6 bits (174), Expect = 2e-12
Identities = 39/104 (37%), Positives = 57/104 (54%)
Frame = -3
Query: 1398 LYLTASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKTSEYKLSGY 1457
+ LT P+I FS++L +R+ S P H +K I +YLKG ++GL Y K L GY
Sbjct: 531 ILLTLQGPNITFSINLLSRYSSAPTMRH*NGIKHICKYLKGIIDMGLFYSKDCSPDLIGY 352
Query: 1458 CDADYAGDRTERKSTFGNCQFLGSNLVSWASKRQSTIALSTAEA 1501
+A Y D + +S G G+ ++SW S + STIA S+ A
Sbjct: 351 VNA*YLSDPHKARS*TGYIFTCGNTVISWRSTK*STIATSSNHA 220
>CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotiana tabacum},
partial (7%)
Length = 780
Score = 69.3 bits (168), Expect = 1e-11
Identities = 36/74 (48%), Positives = 47/74 (62%)
Frame = -2
Query: 1183 FAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEVYVHQPPGFEDERKPDHVF 1242
F P+ +L I L+S N+ L +DVK+AFL G + E++Y+HQP GF E V
Sbjct: 554 FVPIVKLNTIMFLLSIVAIENLYLE*LDVKTAFLRGDLVEDIYMHQPEGFS*E-VGKMVG 378
Query: 1243 KLKKSLYGLKQAPR 1256
KLKKS+YGLKQ PR
Sbjct: 377 KLKKSMYGLKQGPR 336
>AJ499215 weakly similar to GP|6642775|gb| gag-pol polyprotein {Vitis
vinifera}, partial (18%)
Length = 567
Score = 63.5 bits (153), Expect = 6e-10
Identities = 40/116 (34%), Positives = 58/116 (49%), Gaps = 3/116 (2%)
Frame = +3
Query: 574 LSMLQISLIAPLKHQSWYLDSGCSRHMTGEKRMFRELKLKPGGEVGFGGNEKGKIVGTGT 633
L+M + P K+ W +DSGC+ HMT ++ +F+EL +V ++ G GT
Sbjct: 66 LAMSTFATKQPSKY--WLIDSGCTHHMTHDRDLFKELNKSTISKVRMLNGAHIEVEGIGT 239
Query: 634 ICVDSS---PCIDNVLLVDGLTHNLLSISQLADKGYDVIFNQKSCRAVSQIDGSVL 686
+ V S I NVL L +LLS+ QL KGY V+F + C Q + VL
Sbjct: 240 VLVKSHSGYKQISNVLYAPKLNQSLLSVPQLLTKGYKVLFEHEKCVIKDQNNKEVL 407
>TC92908 similar to GP|10140704|gb|AAG13538.1 putative gag-pol polyprotein
{Oryza sativa}, partial (1%)
Length = 638
Score = 62.0 bits (149), Expect = 2e-09
Identities = 50/186 (26%), Positives = 88/186 (46%)
Frame = +2
Query: 1026 VSDKGKAPEEVEPEEDEPEEEAGPSNSQTLKKSRITAAHPKELILGNKDEPVRTRSAFRP 1085
+ + K+ EV+ E + EE G L+K + +H L +K + +
Sbjct: 53 IQEDFKSHGEVQEESNYIEEIKGFQEPTQLRKIKE*ESHLVTLNSSSKSYHIFHYVGYSF 232
Query: 1086 SEETLLSLKGLVSLIEPKSIDEALQDKEWILAMEEELNQFSKNDVWSLVKKPENVHVIGT 1145
S + SL + S IEPK+ +A Q +EW+ AME+E+ +N+ +L E +
Sbjct: 233 SAKHRASLAAITSNIEPKNYVQAAQ*QEWLAAMEQEIQVLEENNTSTLEPLREGKKWVDC 412
Query: 1146 KWVFRNKLNEKGEVVRNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIV 1205
+ V++ GE+ + KA+LVA+ + Q EG D+ + + + R L++ +
Sbjct: 413 RPVYKIIHKANGEIEKYKAQLVAKDFVQVEGEDF*D-LCLSNKDDNCRCLLTIAAAKG*Q 589
Query: 1206 LHQMDV 1211
LH MDV
Sbjct: 590 LHLMDV 607
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.332 0.142 0.436
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 47,346,617
Number of Sequences: 36976
Number of extensions: 668342
Number of successful extensions: 4292
Number of sequences better than 10.0: 65
Number of HSP's better than 10.0 without gapping: 3175
Number of HSP's successfully gapped in prelim test: 157
Number of HSP's that attempted gapping in prelim test: 998
Number of HSP's gapped (non-prelim): 3463
length of query: 1572
length of database: 9,014,727
effective HSP length: 109
effective length of query: 1463
effective length of database: 4,984,343
effective search space: 7292093809
effective search space used: 7292093809
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (22.0 bits)
S2: 65 (29.6 bits)
Lotus: description of TM0359.3