
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0003.6
(1307 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 503 e-142
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 499 e-141
CF922226 156 6e-38
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 143 4e-34
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 135 2e-31
CO981347 108 1e-29
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 120 4e-27
NP005470 reverse transcriptase 117 3e-26
TC222253 similar to UP|Q9ZQE4 (Q9ZQE4) Copia-like retroelement p... 116 6e-26
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 113 5e-25
BM307983 112 1e-24
BI701169 109 7e-24
CA937893 similar to GP|20805072|dbj retrovirus-related pol polyp... 109 9e-24
CO981879 71 1e-23
AI959950 107 5e-23
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 103 4e-22
CO983516 99 1e-20
TC213365 similar to UP|Q6I8L6 (Q6I8L6) Reverse transcriptase (Fr... 97 6e-20
TC232995 92 1e-18
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 92 1e-18
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 503 bits (1295), Expect = e-142
Identities = 337/1096 (30%), Positives = 546/1096 (49%), Gaps = 18/1096 (1%)
Frame = +1
Query: 221 RSKSKDRKTTECYSCKQIGHWKRDC------PNRSGKSGNNSSSANVVQSDGSCSEEDLL 274
+ K RK C+ C + GH K C P+ +S N+ V + S L+
Sbjct: 1477 QQKKSKRKKWRCHYCGKYGHIKPFCYHLHGHPHHGTQSSNSRKKMMWVPKHKAVS---LV 1647
Query: 275 CVSSVKCT--DAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIA 332
+S++ + + W LDSGCS HMT +E+ + + YV GD I GM +
Sbjct: 1648 VHTSLRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGK---- 1815
Query: 333 LDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTA 392
L G+ +L++V V + NLIS+ L + G++ ++ ++ K + +M+ R+
Sbjct: 1816 LVHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEV-LMKGSRSK 1992
Query: 393 GNIYKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIG-- 450
N Y + + +D ++WH R GHL RGM ++ + ++G+ + I
Sbjct: 1993 DNCYLWTPQETSYSSTCLSSKEDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEG 2172
Query: 451 -LCKYCVLGKQCRVRF-KTGHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRK 508
+C C +GKQ ++ K H T +L+ +H D+ GP + S+ G RY DDFSR
Sbjct: 2173 RICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRF 2352
Query: 509 VWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHF 568
WV F++ KSE F FK ++ + IK +RSD+G E+ + F FC GI F
Sbjct: 2353 TWVNFIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEGITHEF 2532
Query: 569 SVRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKV 628
S TPQQNG+ ER NRTL E AR + L LWA +N ACY+ NR
Sbjct: 2533 SAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTT 2712
Query: 629 AEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVK 688
E+W G + + IF P Y+ E R K+DPKS I +GY+ + Y++++
Sbjct: 2713 LYEIWKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRT 2892
Query: 689 KKVIVSRDVVFDE*SMLKQSDVTVVPDTEVEN-SSQDKIQVDIEETPVSPRQIVAQQQSE 747
+ V+ S +VV D+ S ++ DV T +N + K + E + + + Q +
Sbjct: 2893 RTVMESINVVVDDLSPARKKDVEEDVRTSGDNVADAAKSGENAENSDSATDESNINQPDK 3072
Query: 748 PGSDSGEVQDYTLVRDREPSRITPPVRYGFEDLAAYALLTSSGDPSTYHEAMAS*EKDKW 807
S + + +P+R R ++ + + S +P EA+ + W
Sbjct: 3073 RSSTRIQKMHPKELIIGDPNR-GVTTRSREVEIVSNSCFVSKIEPKNVKEALTD---EFW 3240
Query: 808 MSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KER--EKFKAHLVTKGY 865
++AM EE+E K+NE LV P G VIG KW++K K T +E + KA LV +GY
Sbjct: 3241 INAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNK---TNEEGVITRNKARLVAQGY 3411
Query: 866 SQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEG 925
+Q +G+D+DE F+PV R SIR++L + + L QMDVK+ FL+G L E++Y+EQP+G
Sbjct: 3412 TQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKG 3591
Query: 926 FSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFI 985
F++ V +LK++LYGLKQ+PR WY+R ++ + GYR+ D ++V D + +
Sbjct: 3592 FADPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQ-DAENLM 3768
Query: 986 FLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQK 1045
+YVDD++ + ++ EF+M +G LG+++ + + ++LSQ
Sbjct: 3769 IAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDS--IFLSQS 3942
Query: 1046 SYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYAVQLVV*CMLWF 1105
Y + ++ +F M A+H TP H KLS +++ G S + ++ +L+
Sbjct: 3943 RYAKNIVKKFGMENASHKRTPAPTHLKLSKDEA--------GTSVDQSLYRSMIGSLLYL 4098
Query: 1106 ALDQIWHKQLVKCQVHVQAREAALGSSQVDPKILEGYNGSRYHV*QGARCCS-ISCGICG 1164
+ V QA +QV +IL+ NG+ + C + + G C
Sbjct: 4099 TASRPDITYAVGVCARYQANPKISHLTQV-KRILKYVNGTSDYGIMYCHCSNPMLVGYCD 4275
Query: 1165 L--CR*SR**KVYNRICLYSCGGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGL 1222
+ K + C Y I W S Q+ V++ST EAEY+A + + +W+ +
Sbjct: 4276 ADWAGSADDRKSTSGGCFYLGNNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQM 4455
Query: 1223 VKELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTS 1282
+KE VEQ + L+CD+ SAI ++ N V H+RTKHID+R H IR+L+ + I L+ + T
Sbjct: 4456 LKEYNVEQDVMTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTE 4635
Query: 1283 ENTTDKLTKPVTSDKF 1298
E D TK + +++F
Sbjct: 4636 EQIADIFTKALDANQF 4683
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 499 bits (1286), Expect = e-141
Identities = 335/1094 (30%), Positives = 540/1094 (48%), Gaps = 16/1094 (1%)
Frame = +1
Query: 221 RSKSKDRKTTECYSCKQIGHWKRDCPNRSGKSGN---NSSSANVVQSDGSCSEEDLLCVS 277
+ K RK C+ C + GH K C + G + +SSS + L+ +
Sbjct: 1480 QQKKSKRKKWRCHYCGKYGHIKPFCYHLHGHPHHGTQSSSSGRKMMWVPKHKIVSLVVHT 1659
Query: 278 SVKCT--DAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDD 335
S++ + + W LDSGCS HMT +E+ + + YV GD I GM + L
Sbjct: 1660 SLRASAKEDWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMGK----LVH 1827
Query: 336 GGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNI 395
G+ +L++V V + NLIS+ L + G++ ++ ++ K + +M+ R+ N
Sbjct: 1828 DGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEV-LMKGSRSKDNC 2004
Query: 396 YKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIG---LC 452
Y + + +D K+WH R GHL RGM ++ + ++G+ + I +C
Sbjct: 2005 YLWTPQETSYSSTCLFSKEDEVKIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRIC 2184
Query: 453 KYCVLGKQCRVRF-KTGHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWV 511
C +GKQ ++ K H T +L+ +H D+ GP + S+ G RY DDFSR WV
Sbjct: 2185 GECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWV 2364
Query: 512 YFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVR 571
F++ KS+ F FK ++ + IK +RSD+G E+ + F FC GI FS
Sbjct: 2365 NFIREKSDTFEVFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSAA 2544
Query: 572 KTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEE 631
TPQQNG+ ER NRTL E AR + L LWA +N ACY+ NR E
Sbjct: 2545 ITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYE 2724
Query: 632 VWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKV 691
+W G + + IF P Y+ E R K+DPKS I +GY+ + Y++++ + V
Sbjct: 2725 IWKGRKPTVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTV 2904
Query: 692 IVSRDVVFDE*SMLKQSDVTVVPDTEVENSSQDKIQVDIEET--PVSPRQIVAQQQSEPG 749
+ S +VV D+ + ++ DV T +N + + E + + Q P
Sbjct: 2905 MESINVVVDDLTPARKKDVEEDVRTSGDNVADTAKSAENAENSDSATDEPNINQPDKRPS 3084
Query: 750 SDSGEVQDYTLVRDREPSRITPPVRYGFEDLAAYALLTSSGDPSTYHEAMAS*EKDKWMS 809
++ L+ +T R ++ + + S +P EA+ + W++
Sbjct: 3085 IRIQKMHPKELIIGDPNRGVT--TRSREIEIVSNSCFVSKIEPKNVKEALTD---EFWIN 3249
Query: 810 AMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KER--EKFKAHLVTKGYSQ 867
AM EE+E K+NE LV P G VIG KW++K K T +E + KA LV +GY+Q
Sbjct: 3250 AMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNK---TNEEGVITRNKARLVAQGYTQ 3420
Query: 868 HKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFS 927
+G+D+DE F+PV R SIR++L + + L QMDVK+ FL+G L E+ Y+EQP+GF
Sbjct: 3421 IEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEAYVEQPKGFV 3600
Query: 928 ETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFL 987
+ V +LK++LYGLKQ+PR WY+R ++ + GYR+ D ++V D + +
Sbjct: 3601 DPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQ-DAENLMIA 3777
Query: 988 LLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSY 1047
+YVDD++ + ++ EF+M +G LG+++ + + ++LSQ Y
Sbjct: 3778 QIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDS--IFLSQSKY 3951
Query: 1048 VEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYAVQLVV*CMLWFAL 1107
+ ++ +F M A+H TP H KLS +++ G S + ++ +L+
Sbjct: 3952 AKNIVKKFGMENASHKRTPAPTHLKLSKDEA--------GTSVDQSLYRSMIGSLLYLTA 4107
Query: 1108 DQIWHKQLVKCQVHVQAREAALGSSQVDPKILEGYNGSRYHV*QGARCC-SISCGICGL- 1165
+ V QA +QV +IL+ NG+ + C S+ G C
Sbjct: 4108 SRPDITYAVGVCARYQANPKISHLNQV-KRILKYVNGTSDYGIMYCHCSDSMLVGYCDAD 4284
Query: 1166 -CR*SR**KVYNRICLYSCGGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVK 1224
+ K + C Y I W S Q+ V++ST EAEY+A + + +W+ ++K
Sbjct: 4285 WAGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLK 4464
Query: 1225 ELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSEN 1284
E VEQ + L+CD+ SAI ++ N V H+RTKHID+R H IR+L+ + I L+ + T E
Sbjct: 4465 EYNVEQDVMTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLEHVDTEEQ 4644
Query: 1285 TTDKLTKPVTSDKF 1298
D TK + +++F
Sbjct: 4645 IADIFTKALDANQF 4686
Score = 45.4 bits (106), Expect = 2e-04
Identities = 69/303 (22%), Positives = 112/303 (36%), Gaps = 50/303 (16%)
Frame = +1
Query: 20 WQRMAKDLLAQKSLQ---KALRDEKPADIATVDWNEM---KEKAAGLITLCVSDDVMNHI 73
W+ + K K L K + KP + T + +E+ KA + V ++ I
Sbjct: 118 WKAVIKGWEHPKMLDTEGKPTNELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLI 297
Query: 74 LDLTTLKDVWDKLESLY--MSKTPMNKL-FAKQRLYSLKMQEGGDLQAHVYAFNNILADM 130
T KD W+ L++ + SK M++L + +LKM+E + I
Sbjct: 298 NTCTVAKDAWEILKTTHEGTSKVKMSRLQLLATKFENLKMKEEECIHDFHMNILEIANAC 477
Query: 131 TRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKD--SITLDSISSTL------LPHAQ 182
T LG + DE +L SLP +D VT + +D ++ +D + +L L
Sbjct: 478 TALGERMTDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRT 657
Query: 183 RRQSVEEGGGSSGEGLFVKGGQDRGRGKGKAV---------------------------- 214
++S S+ EG + D G AV
Sbjct: 658 EKKSKNLAFVSNDEGEEDEYDLDTDEGLTNAVVLLGKQFNKVLNRMDRRQKPHVRNIPFD 837
Query: 215 -----DSGKKKRSKSKDRKTTECYSCKQIGHWKRDCPNRSGKSGNNSSSANVVQSDGSCS 269
+ KK K K +C+ C+ GH K +CP K S V +SD + S
Sbjct: 838 IRKGSEYQKKSDEKPSHSKGIQCHGCEGYGHIKAECPTHLKKQRKGLS---VCRSDDTES 1008
Query: 270 EED 272
E++
Sbjct: 1009EQE 1017
>CF922226
Length = 667
Score = 156 bits (394), Expect = 6e-38
Identities = 89/228 (39%), Positives = 129/228 (56%), Gaps = 13/228 (5%)
Frame = -3
Query: 91 MSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSL 150
M+K+ +N+L+ KQ LYS KM E + + FN ++ D+ + VT+DDED+A++LLC L
Sbjct: 665 MTKSLVNRLYXKQSLYSFKMHEDRSVGEQLDLFNKLILDLENIDVTIDDEDQALLLLCYL 486
Query: 151 PGSYDHLVTTLTYGKDSITLDSISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQDRGRGK 210
P SY H TL +G+DS++LD + T L + + E+ +SGEGL RGK
Sbjct: 485 PKSYSHFKETLLFGRDSVSLDEV-QTALNSKELNERKEKKSSASGEGL-------TARGK 330
Query: 211 GKAVDSG-KKKRSKSKDRKTTE-------CYSCKQIGHWKRDCPNRSGKSGNN-----SS 257
DS KK+ K +++K E CY CK+ GH ++ CP R G+N S
Sbjct: 329 TFKKDSEFDKKKQKPENQKNGEGNIFKIRCYHCKKEGHTRKVCPERQKNGGSNNRKKDSG 150
Query: 258 SANVVQSDGSCSEEDLLCVSSVKCTDAWVLDSGCSYHMTQHREWFNSF 305
+A +VQ DG S E L+ VS W++DSGCS+HMT ++ WF F
Sbjct: 149 NAAIVQDDGYESAEALM-VSEKNPETKWIMDSGCSWHMTPNKSWFEQF 9
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 143 bits (361), Expect = 4e-34
Identities = 71/130 (54%), Positives = 90/130 (68%)
Frame = -2
Query: 827 VQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSI 886
V LP GK +GC+WVY K+ T E ++ KA LV KGY+Q GIDY + FSPV + T++
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPT-GEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTV 230
Query: 887 RVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLK 946
R+ LA+ A L Q+D+K FLHG+LEE IY+EQP GF G+ LVCKL RSLYGLK
Sbjct: 229 RLFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLK 50
Query: 947 QSPRQWYKRF 956
QSPR W+ +F
Sbjct: 49 QSPRAWFGKF 20
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 135 bits (339), Expect = 2e-31
Identities = 70/155 (45%), Positives = 101/155 (65%), Gaps = 3/155 (1%)
Frame = -3
Query: 810 AMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KERE---KFKAHLVTKGYS 866
AM EE+ ++N LV+ P VIG KWV++ KL E + KA LV KGY+
Sbjct: 458 AMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKL----DEHGIIIRNKARLVAKGYN 291
Query: 867 QHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGF 926
Q +GIDY+E ++PV R IR++LA V+ M+ L QMDVK+ FL+G ++E++Y+EQP GF
Sbjct: 290 QEEGIDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGF 111
Query: 927 SETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYML 961
V KL+++LYGLKQ+PR WY+R +++L
Sbjct: 110 EIPDKPTHVYKLQKALYGLKQAPRAWYERISNFLL 6
>CO981347
Length = 624
Score = 108 bits (269), Expect(2) = 1e-29
Identities = 54/128 (42%), Positives = 74/128 (57%)
Frame = +2
Query: 520 VFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGV 579
+F KF+ + NQ G K+K LR+DNG E+ + F FC + GI+RH V TP QNG+
Sbjct: 104 IF*KFRE*HTLIGNQLGTKLKVLRTDNGLEFVLEQFNEFCRKIGIKRHKIVPHTP*QNGL 283
Query: 580 AERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPID 639
AERMN T+ E+ RC+ L+A L K W N YL+NR P ++L K E W+G
Sbjct: 284 AERMNMTILERVRCMLLSARLPKTFWGEAANTTSYLINRCPSSTLGFKTPMEAWSGETT* 463
Query: 640 LSNLRIFW 647
L ++ W
Sbjct: 464 LFRIKGVW 487
Score = 44.3 bits (103), Expect = 4e-04
Identities = 23/59 (38%), Positives = 36/59 (60%), Gaps = 2/59 (3%)
Frame = +3
Query: 637 PIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLW--DPVKKKVIV 693
P + S L++F A+ H+ + KLD ++ C+ IGY KGVK YKLW +P + + I+
Sbjct: 456 PPNYSGLKVFGSLAFDHVK---QGKLDARAVKCVFIGYPKGVKRYKLWKLEPGETRCII 623
Score = 41.6 bits (96), Expect(2) = 1e-29
Identities = 18/36 (50%), Positives = 25/36 (69%)
Frame = +3
Query: 486 PTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVF 521
P++ + G YF+T DDFSR+VW+Y +K KSE F
Sbjct: 3 PSRVKTHGGSSYFLTIIDDFSRRVWLYVLKNKSESF 110
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 120 bits (301), Expect = 4e-27
Identities = 65/138 (47%), Positives = 88/138 (63%)
Frame = +3
Query: 786 LTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKK 845
L+S PST EA+ + W AMV+EM++L+ N T LV LP GK +GC+WVY K
Sbjct: 3 LSSLTVPSTIREAL---DHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVK 173
Query: 846 LAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDV 905
+ K ++ KA LV KGY+Q GI+Y + FSPV T++R+ LA+ A L Q+D+
Sbjct: 174 VGPNGKV-DRLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDI 350
Query: 906 KTTFLHGNLEEQIYIEQP 923
K FLHG+LEE IY+EQP
Sbjct: 351 KNAFLHGDLEEDIYMEQP 404
>NP005470 reverse transcriptase
Length = 267
Score = 117 bits (293), Expect = 3e-26
Identities = 54/89 (60%), Positives = 66/89 (73%)
Frame = +1
Query: 906 KTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGY 965
+T FLHG LEE I ++QPEGF G R V +L+RSLYGLKQSPRQWY FDS++ G+
Sbjct: 1 RTAFLHGRLEENILMKQPEGFEVQGKERYVSQLQRSLYGLKQSPRQWYMSFDSFITNQGF 180
Query: 966 RRCDYDCCVYVMSLDDGSFIFLLLYVDDM 994
+R YDCCVY ++DG I+LLLYVDDM
Sbjct: 181 KRSLYDCCVYHNKVEDGLMIYLLLYVDDM 267
>TC222253 similar to UP|Q9ZQE4 (Q9ZQE4) Copia-like retroelement pol
polyprotein, partial (4%)
Length = 919
Score = 116 bits (291), Expect = 6e-26
Identities = 63/152 (41%), Positives = 91/152 (59%), Gaps = 1/152 (0%)
Frame = +1
Query: 444 VRSCIIGLCKYCVLGKQCRVRFKTGHH-KTKGILDYVHSDVRGPTKEPSV*GFRYFVTFT 502
+ + + G+C C +GK+ R F TG + K +L VH D+ + P+ YF+TF
Sbjct: 307 IMNILDGVCDTCEIGKKHRESFPTGKSWRMKKLLKIVHLDLC-TVEIPTHGDNNYFITFI 483
Query: 503 DDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEEN 562
DDFS+K+WVYF+K KSE FK++KA E Q G K+K L D G EY ++ F E++
Sbjct: 484 DDFSKKMWVYFLKQKSEACNAFKMFKAFAEKQNGCKVKALIIDKGQEYL--SYTIFFEKH 657
Query: 563 GIQRHFSVRKTPQQNGVAERMNRTLTEKARCL 594
GIQ + + TPQ NGV ER N+T+ + RC+
Sbjct: 658 GIQHQLTTKYTPQHNGVTERKNKTIMDMVRCM 753
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 113 bits (283), Expect = 5e-25
Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 1/177 (0%)
Frame = +1
Query: 854 EKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGN 913
++FKA LV K Y+Q G DY FSPV + + ++ ++ L +D K FLHG
Sbjct: 37 DQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAFLHGY 216
Query: 914 LEEQIYIEQPEGFSETGD-GRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDC 972
LEE++Y+EQP GF G+ +VC+L RS YGLKQSPR W F I Y + D
Sbjct: 217 LEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAW--PFLYCGAAIWYDSHEADH 390
Query: 973 CVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGM 1029
V+ G I+L++YVDD+ I + H + LK L +F KDLG + LG+
Sbjct: 391 SVFYCHSPQGC-IYLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558
>BM307983
Length = 406
Score = 112 bits (280), Expect = 1e-24
Identities = 58/135 (42%), Positives = 82/135 (59%), Gaps = 3/135 (2%)
Frame = +2
Query: 836 IGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPV---VRHTSIRVVLAL 892
+GC+W+Y K +++KA LV KGY Q GIDY+E F+ ++ S
Sbjct: 2 VGCRWIYTVKY*AD-DTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*IQSGSSSP*QQA 178
Query: 893 VASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQW 952
+MH Q DVK FLHG+LEE++Y+E P G+ + G VC+LK++LYGLKQSPR W
Sbjct: 179 QFGWEMH--QFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAW 352
Query: 953 YKRFDSYMLRIGYRR 967
+ RF ML +GY++
Sbjct: 353 FGRFTQAMLSLGYKQ 397
>BI701169
Length = 407
Score = 109 bits (273), Expect = 7e-24
Identities = 52/97 (53%), Positives = 67/97 (68%)
Frame = +2
Query: 498 FVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMH 557
F F DDFSRK WVYF K+K EVF FK +KA VE ++G KIK +RSD G E+ F
Sbjct: 113 FFLFIDDFSRKTWVYFFKHKLEVFENFKKFKAIVEKESGFKIKAMRSDRGGEF*SNEFQK 292
Query: 558 FCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCL 594
+C+++GI+R V ++PQQNGVAER NRT+ AR +
Sbjct: 293 YCDDHGIRRPLMVLRSPQQNGVAERKNRTILNMARSM 403
>CA937893 similar to GP|20805072|dbj retrovirus-related pol polyprotein from
transposon TNT 1-94-like, partial (7%)
Length = 412
Score = 109 bits (272), Expect = 9e-24
Identities = 53/104 (50%), Positives = 70/104 (66%)
Frame = -2
Query: 3 GAKFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADIATVDWNEMKEKAAGLIT 62
GAKFEV +FDGTGNF LWQ+ KDLLA + L K LRD K + +DW E+ E+ A I
Sbjct: 315 GAKFEVGKFDGTGNFRLWQKRVKDLLA*QGLLKVLRDSKSNNTEALDWEEL*ERTATTIR 136
Query: 63 LCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLY 106
LC+ D+ + H+++L +VW KLES YM K+ NKL+ Q+LY
Sbjct: 135 LCLVDEFLYHMMELAFPGEVWKKLESQYMLKSLTNKLYLMQKLY 4
>CO981879
Length = 576
Score = 70.9 bits (172), Expect(2) = 1e-23
Identities = 43/102 (42%), Positives = 52/102 (50%)
Frame = -1
Query: 524 FKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERM 583
FK + ++ Q KIK RSDNG EY +K+ ENGI S TPQQNGVAER
Sbjct: 567 FKTFFQMIQTQFQVKIKVFRSDNGREYFNKHLSKXXLENGIIHQSSCVDTPQQNGVAERK 388
Query: 584 NRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLD 625
NR L E AR L K W I YL N++ +L+
Sbjct: 387 NRHLXEVARALLFQNKAPKYXWGEAILTGTYLKNKNA*QNLE 262
Score = 58.5 bits (140), Expect(2) = 1e-23
Identities = 26/59 (44%), Positives = 37/59 (62%)
Frame = -2
Query: 643 LRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE 701
L+IF +VHI ++ KL+P++K C+ +GY KGYK +DP KK V+ DV F E
Sbjct: 194 LKIFGCTVFVHIHEPNQGKLEPRAKKCVFVGYAPNQKGYKCFDPTSKKTFVTIDVTFFE 18
>AI959950
Length = 466
Score = 107 bits (266), Expect = 5e-23
Identities = 57/133 (42%), Positives = 84/133 (62%)
Frame = -1
Query: 808 MSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQ 867
M AM EE++ +KN * LV+LP K+V+G KW++ KL K ++KA LV KGYSQ
Sbjct: 397 MKAMQEELDQFQKNNV*KLVKLPKRKKVVGVKWIFCNKLDEDGKV-VRYKARLVAKGYSQ 221
Query: 868 HKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFS 927
+GIDY + F+ V R I ++L+ +M L QMDVK+ FL+G +++++Y+EQP GF
Sbjct: 220 QEGIDYPKTFALVARLEVICILLSFATYSNMKLYQMDVKSAFLNGLIQKEVYVEQPPGFE 41
Query: 928 ETGDGRLVCKLKR 940
+ V KL +
Sbjct: 40 NETLHQHVFKLNK 2
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 103 bits (258), Expect = 4e-22
Identities = 70/183 (38%), Positives = 95/183 (51%), Gaps = 2/183 (1%)
Frame = +2
Query: 418 KLWHMRLGH--LSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTKGI 475
KL H RLGH LS+ +M + +K + C+ C LGK R + +
Sbjct: 308 KLLHERLGHPHLSKLKIM-VPSLEKIKDL------FCESCQLGKHVRSSXRHVESRVDSP 466
Query: 476 LDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQT 535
+H D+ GP + S+ +RYFVTF D+FS+ V+ MK +SE+ + F +++ Q
Sbjct: 467 FLVIHXDIWGPNRVSSM-SYRYFVTFIDEFSQCTRVFLMKERSEILS-FLTSVNKIKTQF 640
Query: 536 GRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCLR 595
G+ IK LRSDN EY F GI FS TPQQN +AER NR L E AR L
Sbjct: 641 GKTIKILRSDNAKEYFSSVISPFXSAQGILHQFSCPHTPQQNDIAERKNRHLVETARTLL 820
Query: 596 LNA 598
L+A
Sbjct: 821 LHA 829
>CO983516
Length = 724
Score = 99.0 bits (245), Expect = 1e-20
Identities = 50/122 (40%), Positives = 77/122 (62%)
Frame = +2
Query: 874 DEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGR 933
D+ F PV R SIR++L + + L QMDVK+ FL+G L E++Y+EQP+GF +
Sbjct: 356 DKEFHPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFIDPTHPD 535
Query: 934 LVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDD 993
V +LK++LYGLKQ+PR WY+R + + GYR+ D ++V D + + +YVDD
Sbjct: 536 HVYRLKKALYGLKQAPRAWYERLTELLTQQGYRKGGIDKTLFVKQ-DAENLMIAQIYVDD 712
Query: 994 ML 995
++
Sbjct: 713 IV 718
>TC213365 similar to UP|Q6I8L6 (Q6I8L6) Reverse transcriptase (Fragment),
partial (10%)
Length = 440
Score = 96.7 bits (239), Expect = 6e-20
Identities = 51/102 (50%), Positives = 66/102 (64%)
Frame = +3
Query: 1197 VAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQVYHARTK 1256
VA+STTEA+YMA+ EAAKE +WL GL+ +L + Q ++ DS SAI +QV+H RTK
Sbjct: 9 VALSTTEAKYMALTEAAKEGIWLRGLINDLRINQEYANIYYDSLSAICFAKDQVHHDRTK 188
Query: 1257 HIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
HIDVR+H IR + R+I + KI T N D K V KF
Sbjct: 189 HIDVRYHFIR---SERRIKVHKISTLHNPADMFPKLVPKSKF 305
>TC232995
Length = 1009
Score = 92.4 bits (228), Expect = 1e-18
Identities = 47/159 (29%), Positives = 88/159 (54%)
Frame = +2
Query: 920 IEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSL 979
+EQP GF + V KL+++LYGLKQ+PR WY+R +++L + R D +++
Sbjct: 2 VEQPPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFI-KR 178
Query: 980 DDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKK 1039
+ + +YVDD++ + + E + EF+M +G K LG++I + +
Sbjct: 179 KHNDILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*G-- 352
Query: 1040 LWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQS 1078
++++Q Y + ++ RF M A H+STP++ + L ++S
Sbjct: 353 IFINQSKYCKELIKRFGMDSAKHMSTPMSTNCYLDKDES 469
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 92.0 bits (227), Expect = 1e-18
Identities = 45/120 (37%), Positives = 77/120 (63%), Gaps = 1/120 (0%)
Frame = +2
Query: 1179 CLYSCGGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGV-EQGGVQLHC 1237
C++ G + WKS Q+ VA S+ EAEY ++A E +W+ ++EL E+ ++L+C
Sbjct: 83 CVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCELMWIKQFLQELRFCEELQMKLYC 262
Query: 1238 DSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDK 1297
D+Q+A+++ +N V+H RTKHI++ H IRE L S++I+ + I +++ D LTK + K
Sbjct: 263 DNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEIVTEFIGSNDQPVDILTKSLRGPK 442
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.323 0.138 0.419
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 57,975,398
Number of Sequences: 63676
Number of extensions: 840875
Number of successful extensions: 4602
Number of sequences better than 10.0: 166
Number of HSP's better than 10.0 without gapping: 4375
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4523
length of query: 1307
length of database: 12,639,632
effective HSP length: 108
effective length of query: 1199
effective length of database: 5,762,624
effective search space: 6909386176
effective search space used: 6909386176
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 65 (29.6 bits)
Lotus: description of TM0003.6