
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0007.12
(1387 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 467 e-131
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 464 e-130
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 464 e-130
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 349 3e-95
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 335 5e-91
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 300 1e-80
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 289 3e-77
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 287 1e-76
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 269 4e-71
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 162 6e-39
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 160 2e-38
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 160 3e-38
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 160 3e-38
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 159 5e-38
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 158 9e-38
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 143 4e-33
POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse transcript... 140 2e-32
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 138 9e-32
POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.2... 134 2e-30
M860_ARATH (P92523) Hypothetical mitochondrial protein AtMg00860... 132 7e-30
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 467 bits (1202), Expect = e-131
Identities = 285/870 (32%), Positives = 449/870 (50%), Gaps = 37/870 (4%)
Query: 544 PTRFHDHHINLLPNTPPVNVRPYRYPHSQKEAMATILTDMLQEGIVVPSTSPYSSPVLLI 603
P + + + L + +R Y P + +AM + L+ GI+ S + + PV+ +
Sbjct: 396 PIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFV 455
Query: 604 KKKDGTWRFCVDFRSLNSITIKDRFPIPTIDELLDELGGASHFSKLDLRSGFHQIRLATE 663
KK+GT R VD++ LN + +P+P I++LL ++ G++ F+KLDL+S +H IR+
Sbjct: 456 PKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKG 515
Query: 664 DTHKTAFRTVDGHYEFLVMPFGLTNAPSTFQAAMNDLLRPFLRRFVLVFFDDILVYSPSL 723
D HK AFR G +E+LVMP+G++ AP+ FQ +N +L V+ + DDIL++S S
Sbjct: 516 DEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSE 575
Query: 724 SAHMTHLKEVLEVLLSHKFYAKLSKCIFGVTSVSYLGHIISANGVGPDPSKVQAIVDWPV 783
S H+ H+K+VL+ L + +KC F + V ++G+ IS G P + ++ W
Sbjct: 576 SEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQ 635
Query: 784 PRNLTALRGFLGLTGFYRRFIKNYAAHASHLTDLLKQDQKDKHTLLPWSSAAADSFQHLK 843
P+N LR FLG + R+FI + L +LLK+D + K W+ + +++K
Sbjct: 636 PKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWK-----WTPTQTQAIENIK 690
Query: 844 DLIISAPVLVLPDFSATFDIETDASGTAVGAVLSQKG-----HPISFFSKKLTLQMQHQS 898
++S PVL DFS +ETDAS AVGAVLSQK +P+ ++S K++ + S
Sbjct: 691 QCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYS 750
Query: 899 TYVREMYAVTEAVKKWRQYLIG--HKFRIYTDQQSLKHLMTQTFQTPDQ--IKWATKLLG 954
+EM A+ +++K WR YL F+I TD ++L +T + ++ +W L
Sbjct: 751 VSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQD 810
Query: 955 FDYEIFYKPGSENRVADALSRCHSSELPLLAAISSPVPEIITQLK-----------QYYK 1003
F++EI Y+PGS N +ADALSR P+ + Q+ +Y
Sbjct: 811 FNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTN 870
Query: 1004 TPEGIQLI--VDKSTLPHFRVHHEVLY-FKDGLFVPEHDQWRTSILSEYHASPAAGHSGL 1060
+ + L+ DK + ++ +L KD + +P Q +I+ +YH H G+
Sbjct: 871 DTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGI 930
Query: 1061 KPTLARLMASFNWPGIQTETKTFIKQCLPCQYNKYVPAKKSGLLQPLPTPAQIWEDISMD 1120
+ ++ F W GI+ + + +++ C CQ NK K G LQP+P + WE +SMD
Sbjct: 931 ELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMD 990
Query: 1121 FITGLPPSHGHTVAWVIVDRLSKYAHFVALPANFTATSLANRFSSEICRLHGIPRSIVSD 1180
FIT LP S G+ +V+VDR SK A V + TA A F + G P+ I++D
Sbjct: 991 FITALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIAD 1050
Query: 1181 RDKIFLSHFWRDLFRVYGTKLRFSTAYHPETDGQTEVVNRGLETYLRCFAGEQPRSWYKF 1240
D IF S W+D Y ++FS Y P+TDGQTE N+ +E LRC P +W
Sbjct: 1051 NDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDH 1110
Query: 1241 LHLAELWYNTSFHSAAGMTPFQAVYGRPPPSLLAYVPGSSAIQSLDESLQQRT*ILESLK 1300
+ L + YN + HSA MTPF+ V+ P +P S DE+ Q+ + +++K
Sbjct: 1111 ISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFS--DKTDENSQETIQVFQTVK 1168
Query: 1301 ANLQRAQHRMKIQKDKSRREV-TFEENAWVLLRLQPYRQRSLAHRLSNKLAKRFYGPFRV 1359
+L +MK D +E+ F+ V+++ R ++ SNKLA F GPF V
Sbjct: 1169 EHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK----RTKTGFLHKSNKLAPSFAGPFYV 1224
Query: 1360 KRRIGSVAYELDLPPTSK--LHNVFHVSLL 1387
++ G YELDLP + K + FHVS L
Sbjct: 1225 LQKSGPNNYELDLPDSIKHMFSSTFHVSHL 1254
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 464 bits (1195), Expect = e-130
Identities = 284/870 (32%), Positives = 449/870 (50%), Gaps = 37/870 (4%)
Query: 544 PTRFHDHHINLLPNTPPVNVRPYRYPHSQKEAMATILTDMLQEGIVVPSTSPYSSPVLLI 603
P + + + L + +R Y P + +AM + L+ GI+ S + + PV+ +
Sbjct: 396 PIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFV 455
Query: 604 KKKDGTWRFCVDFRSLNSITIKDRFPIPTIDELLDELGGASHFSKLDLRSGFHQIRLATE 663
KK+GT R VD++ LN + +P+P I++LL ++ G++ F+KLDL+S +H IR+
Sbjct: 456 PKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKG 515
Query: 664 DTHKTAFRTVDGHYEFLVMPFGLTNAPSTFQAAMNDLLRPFLRRFVLVFFDDILVYSPSL 723
D HK AFR G +E+LVMP+G++ AP+ FQ +N +L V+ + D+IL++S S
Sbjct: 516 DEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSE 575
Query: 724 SAHMTHLKEVLEVLLSHKFYAKLSKCIFGVTSVSYLGHIISANGVGPDPSKVQAIVDWPV 783
S H+ H+K+VL+ L + +KC F + V ++G+ IS G P + ++ W
Sbjct: 576 SEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQ 635
Query: 784 PRNLTALRGFLGLTGFYRRFIKNYAAHASHLTDLLKQDQKDKHTLLPWSSAAADSFQHLK 843
P+N LR FLG + R+FI + L +LLK+D + K W+ + +++K
Sbjct: 636 PKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWK-----WTPTQTQAIENIK 690
Query: 844 DLIISAPVLVLPDFSATFDIETDASGTAVGAVLSQKG-----HPISFFSKKLTLQMQHQS 898
++S PVL DFS +ETDAS AVGAVLSQK +P+ ++S K++ + S
Sbjct: 691 QCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYS 750
Query: 899 TYVREMYAVTEAVKKWRQYLIG--HKFRIYTDQQSLKHLMTQTFQTPDQ--IKWATKLLG 954
+EM A+ +++K WR YL F+I TD ++L +T + ++ +W L
Sbjct: 751 VSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQD 810
Query: 955 FDYEIFYKPGSENRVADALSRCHSSELPLLAAISSPVPEIITQLK-----------QYYK 1003
F++EI Y+PGS N +ADALSR P+ + Q+ +Y
Sbjct: 811 FNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTN 870
Query: 1004 TPEGIQLI--VDKSTLPHFRVHHEVLY-FKDGLFVPEHDQWRTSILSEYHASPAAGHSGL 1060
+ + L+ DK + ++ +L KD + +P Q +I+ +YH H G+
Sbjct: 871 DTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGI 930
Query: 1061 KPTLARLMASFNWPGIQTETKTFIKQCLPCQYNKYVPAKKSGLLQPLPTPAQIWEDISMD 1120
+ ++ F W GI+ + + +++ C CQ NK K G LQP+P + WE +SMD
Sbjct: 931 ELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMD 990
Query: 1121 FITGLPPSHGHTVAWVIVDRLSKYAHFVALPANFTATSLANRFSSEICRLHGIPRSIVSD 1180
FIT LP S G+ +V+VDR SK A V + TA A F + G P+ I++D
Sbjct: 991 FITALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIAD 1050
Query: 1181 RDKIFLSHFWRDLFRVYGTKLRFSTAYHPETDGQTEVVNRGLETYLRCFAGEQPRSWYKF 1240
D IF S W+D Y ++FS Y P+TDGQTE N+ +E LRC P +W
Sbjct: 1051 NDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDH 1110
Query: 1241 LHLAELWYNTSFHSAAGMTPFQAVYGRPPPSLLAYVPGSSAIQSLDESLQQRT*ILESLK 1300
+ L + YN + HSA MTPF+ V+ P +P S DE+ Q+ + +++K
Sbjct: 1111 ISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFS--DKTDENSQETIQVFQTVK 1168
Query: 1301 ANLQRAQHRMKIQKDKSRREV-TFEENAWVLLRLQPYRQRSLAHRLSNKLAKRFYGPFRV 1359
+L +MK D +E+ F+ V+++ R ++ SNKLA F GPF V
Sbjct: 1169 EHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK----RTKTGFLHKSNKLAPSFAGPFYV 1224
Query: 1360 KRRIGSVAYELDLPPTSK--LHNVFHVSLL 1387
++ G YELDLP + K + FHVS L
Sbjct: 1225 LQKSGPNNYELDLPDSIKHMFSSTFHVSHL 1254
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 464 bits (1195), Expect = e-130
Identities = 284/870 (32%), Positives = 449/870 (50%), Gaps = 37/870 (4%)
Query: 544 PTRFHDHHINLLPNTPPVNVRPYRYPHSQKEAMATILTDMLQEGIVVPSTSPYSSPVLLI 603
P + + + L + +R Y P + +AM + L+ GI+ S + + PV+ +
Sbjct: 396 PIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFV 455
Query: 604 KKKDGTWRFCVDFRSLNSITIKDRFPIPTIDELLDELGGASHFSKLDLRSGFHQIRLATE 663
KK+GT R VD++ LN + +P+P I++LL ++ G++ F+KLDL+S +H IR+
Sbjct: 456 PKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKG 515
Query: 664 DTHKTAFRTVDGHYEFLVMPFGLTNAPSTFQAAMNDLLRPFLRRFVLVFFDDILVYSPSL 723
D HK AFR G +E+LVMP+G++ AP+ FQ +N +L V+ + D+IL++S S
Sbjct: 516 DEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSE 575
Query: 724 SAHMTHLKEVLEVLLSHKFYAKLSKCIFGVTSVSYLGHIISANGVGPDPSKVQAIVDWPV 783
S H+ H+K+VL+ L + +KC F + V ++G+ IS G P + ++ W
Sbjct: 576 SEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQ 635
Query: 784 PRNLTALRGFLGLTGFYRRFIKNYAAHASHLTDLLKQDQKDKHTLLPWSSAAADSFQHLK 843
P+N LR FLG + R+FI + L +LLK+D + K W+ + +++K
Sbjct: 636 PKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWK-----WTPTQTQAIENIK 690
Query: 844 DLIISAPVLVLPDFSATFDIETDASGTAVGAVLSQKG-----HPISFFSKKLTLQMQHQS 898
++S PVL DFS +ETDAS AVGAVLSQK +P+ ++S K++ + S
Sbjct: 691 QCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYS 750
Query: 899 TYVREMYAVTEAVKKWRQYLIG--HKFRIYTDQQSLKHLMTQTFQTPDQ--IKWATKLLG 954
+EM A+ +++K WR YL F+I TD ++L +T + ++ +W L
Sbjct: 751 VSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQD 810
Query: 955 FDYEIFYKPGSENRVADALSRCHSSELPLLAAISSPVPEIITQLK-----------QYYK 1003
F++EI Y+PGS N +ADALSR P+ + Q+ +Y
Sbjct: 811 FNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTN 870
Query: 1004 TPEGIQLI--VDKSTLPHFRVHHEVLY-FKDGLFVPEHDQWRTSILSEYHASPAAGHSGL 1060
+ + L+ DK + ++ +L KD + +P Q +I+ +YH H G+
Sbjct: 871 DTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGI 930
Query: 1061 KPTLARLMASFNWPGIQTETKTFIKQCLPCQYNKYVPAKKSGLLQPLPTPAQIWEDISMD 1120
+ ++ F W GI+ + + +++ C CQ NK K G LQP+P + WE +SMD
Sbjct: 931 ELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMD 990
Query: 1121 FITGLPPSHGHTVAWVIVDRLSKYAHFVALPANFTATSLANRFSSEICRLHGIPRSIVSD 1180
FIT LP S G+ +V+VDR SK A V + TA A F + G P+ I++D
Sbjct: 991 FITALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIAD 1050
Query: 1181 RDKIFLSHFWRDLFRVYGTKLRFSTAYHPETDGQTEVVNRGLETYLRCFAGEQPRSWYKF 1240
D IF S W+D Y ++FS Y P+TDGQTE N+ +E LRC P +W
Sbjct: 1051 NDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDH 1110
Query: 1241 LHLAELWYNTSFHSAAGMTPFQAVYGRPPPSLLAYVPGSSAIQSLDESLQQRT*ILESLK 1300
+ L + YN + HSA MTPF+ V+ P +P S DE+ Q+ + +++K
Sbjct: 1111 ISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFS--DKTDENSQETIQVFQTVK 1168
Query: 1301 ANLQRAQHRMKIQKDKSRREV-TFEENAWVLLRLQPYRQRSLAHRLSNKLAKRFYGPFRV 1359
+L +MK D +E+ F+ V+++ R ++ SNKLA F GPF V
Sbjct: 1169 EHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK----RTKTGFLHKSNKLAPSFAGPFYV 1224
Query: 1360 KRRIGSVAYELDLPPTSK--LHNVFHVSLL 1387
++ G YELDLP + K + FHVS L
Sbjct: 1225 LQKSGPNNYELDLPDSIKHMFSSTFHVSHL 1254
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 349 bits (895), Expect = 3e-95
Identities = 193/462 (41%), Positives = 269/462 (57%), Gaps = 12/462 (2%)
Query: 525 LWELLQSYAPV-FSTPHGLPPTRFHDHHINLLPNTPPVNVRPYRYPHSQKEAMATILTDM 583
L LLQ Y + + L T H IN N P + Y YP + ++ + + + DM
Sbjct: 173 LCALLQKYHDIQYHEGDKLTFTNQTKHTINTKHNLPLYS--KYSYPQAYEQEVESQIQDM 230
Query: 584 LQEGIVVPSTSPYSSPVLLIKKKDGT-----WRFCVDFRSLNSITIKDRFPIPTIDELLD 638
L +GI+ S SPY+SP+ ++ KK +R +D+R LN IT+ DR PIP +DE+L
Sbjct: 231 LNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPIPNMDEILG 290
Query: 639 ELGGASHFSKLDLRSGFHQIRLATEDTHKTAFRTVDGHYEFLVMPFGLTNAPSTFQAAMN 698
+LG ++F+ +DL GFHQI + E KTAF T GHYE+L MPFGL NAP+TFQ MN
Sbjct: 291 KLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMN 350
Query: 699 DLLRPFLRRFVLVFFDDILVYSPSLSAHMTHLKEVLEVLLSHKFYAKLSKCIFGVTSVSY 758
D+LRP L + LV+ DDI+V+S SL H+ L V E L +L KC F ++
Sbjct: 351 DILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTF 410
Query: 759 LGHIISANGVGPDPSKVQAIVDWPVPRNLTALRGFLGLTGFYRRFIKNYAAHASHLTDLL 818
LGH+++ +G+ P+P K++AI +P+P ++ FLGLTG+YR+FI N+A A +T L
Sbjct: 411 LGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCL 470
Query: 819 KQDQKDKHTLLPWSSAAADSFQHLKDLIISAPVLVLPDFSATFDIETDASGTAVGAVLSQ 878
K++ K T + SA F+ LK LI P+L +PDF+ F + TDAS A+GAVLSQ
Sbjct: 471 KKNMKIDTTNPEYDSA----FKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQ 526
Query: 879 KGHPISFFSKKLTLQMQHQSTYVREMYAVTEAVKKWRQYLIGHKFRIYTDQQSLKHLMTQ 938
GHP+S+ S+ L + ST +E+ A+ A K +R YL+G F I +D Q L L
Sbjct: 527 DGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRM 586
Query: 939 TFQTPDQIKWATKLLGFDYEIFYKPGSENRVADALSRCHSSE 980
+W KL FD++I Y G EN VADALSR E
Sbjct: 587 KDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALSRIKLEE 628
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 335 bits (859), Expect = 5e-91
Identities = 181/436 (41%), Positives = 258/436 (58%), Gaps = 10/436 (2%)
Query: 550 HHINLLPNTPPVNVRPYRYPHSQKEAMATILTDMLQEGIVVPSTSPYSSPVLLIKKKDGT 609
H +N N+P + + Y + + + + +ML +G++ S SPY+SP ++ KK
Sbjct: 197 HVLNTTHNSP-IYSKQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDA 255
Query: 610 -----WRFCVDFRSLNSITIKDRFPIPTIDELLDELGGASHFSKLDLRSGFHQIRLATED 664
+R +D+R LN ITI DR+PIP +DE+L +LG +F+ +DL GFHQI + E
Sbjct: 256 SGANKYRVVIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEES 315
Query: 665 THKTAFRTVDGHYEFLVMPFGLTNAPSTFQAAMNDLLRPFLRRFVLVFFDDILVYSPSLS 724
KTAF T GHYE+L MPFGL NAP+TFQ MN++LRP L + LV+ DDI+++S SL+
Sbjct: 316 ISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLT 375
Query: 725 AHMTHLKEVLEVLLSHKFYAKLSKCIFGVTSVSYLGHIISANGVGPDPSKVQAIVDWPVP 784
H+ ++ V L +L KC F ++LGHI++ +G+ P+P KV+AIV +P+P
Sbjct: 376 EHLNSIQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIP 435
Query: 785 RNLTALRGFLGLTGFYRRFIKNYAAHASHLTDLLKQDQKDKHTLLPWSSAAADSFQHLKD 844
+R FLGLTG+YR+FI NYA A +T LK+ K L + ++F+ LK
Sbjct: 436 TKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEY----IEAFEKLKA 491
Query: 845 LIISAPVLVLPDFSATFDIETDASGTAVGAVLSQKGHPISFFSKKLTLQMQHQSTYVREM 904
LII P+L LPDF F + TDAS A+GAVLSQ GHPISF S+ L + S +E+
Sbjct: 492 LIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQNGHPISFISRTLNDHELNYSAIEKEL 551
Query: 905 YAVTEAVKKWRQYLIGHKFRIYTDQQSLKHLMTQTFQTPDQIKWATKLLGFDYEIFYKPG 964
A+ A K +R YL+G +F I +D Q L+ L +W +L + ++I Y G
Sbjct: 552 LAIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKG 611
Query: 965 SENRVADALSRCHSSE 980
EN VADALSR E
Sbjct: 612 KENSVADALSRIKIEE 627
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 300 bits (769), Expect = 1e-80
Identities = 166/433 (38%), Positives = 250/433 (57%), Gaps = 18/433 (4%)
Query: 560 PVNVRPYRYPHSQKEAMATILTDMLQEGIVVPSTSPYSSPVLLIKKK-----DGTWRFCV 614
P+ + Y YP + + + + ++LQ+GI+ PS SPY+SP+ ++ KK + +R V
Sbjct: 123 PIYAKSYPYPVNMRGEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVV 182
Query: 615 DFRSLNSITIKDRFPIPTIDELLDELGGASHFSKLDLRSGFHQIRLATEDTHKTAFRTVD 674
DF+ LN++TI D +PIP I+ L LG A +F+ LDL SGFHQI + D KTAF T++
Sbjct: 183 DFKRLNTVTIPDTYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLN 242
Query: 675 GHYEFLVMPFGLTNAPSTFQAAMNDLLRPFLRRFVLVFFDDILVYSPSLSAHMTHLKEVL 734
G YEFL +PFGL NAP+ FQ ++D+LR + + V+ DDI+V+S H +L+ VL
Sbjct: 243 GKYEFLRLPFGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVL 302
Query: 735 EVLLSHKFYAKLSKCIFGVTSVSYLGHIISANGVGPDPSKVQAIVDWPVPRNLTALRGFL 794
L L K F T V +LG+I++A+G+ DP KV+AI + P P ++ L+ FL
Sbjct: 303 ASLSKANLQVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFL 362
Query: 795 GLTGFYRRFIKNYAAHASHLTDL-------LKQDQKDKHTLLPWSSAAADSFQHLKDLII 847
G+T +YR+FI++YA A LT+L +K Q K + A SF LK ++
Sbjct: 363 GMTSYYRKFIQDYAKVAKPLTNLTRGLYANIKSSQSSK-VPITLDETALQSFNDLKSILC 421
Query: 848 SAPVLVLPDFSATFDIETDASGTAVGAVLSQ----KGHPISFFSKKLTLQMQHQSTYVRE 903
S+ +L P F+ F + TDAS A+GAVLSQ + PI++ S+ L ++ +T +E
Sbjct: 422 SSEILAFPCFTKPFHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKE 481
Query: 904 MYAVTEAVKKWRQYLIG-HKFRIYTDQQSLKHLMTQTFQTPDQIKWATKLLGFDYEIFYK 962
M A+ ++ R YL G ++YTD Q L + +W ++ ++ E+ YK
Sbjct: 482 MLAIIWSLDNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYK 541
Query: 963 PGSENRVADALSR 975
PG N VADALSR
Sbjct: 542 PGKSNVVADALSR 554
Score = 51.2 bits (121), Expect = 2e-05
Identities = 60/265 (22%), Positives = 100/265 (37%), Gaps = 20/265 (7%)
Query: 1057 HSGLKPTLARLMASFNWPGIQTETKTFIKQCLPCQYNKYVPAKKSGLLQPLPTPAQIWED 1116
H G +L+ + +P + + + C C+ KY LQP P P E
Sbjct: 705 HRGPTEIRLQLLEKYYFPRMSSTIRLQTSSCQCCKLYKYERHPNKPNLQPTPIPNYPCEI 764
Query: 1117 ISMDFITGLPPSHGHTVAWVIVDRLSKYAHFVALPANFTATSLANRFSSEICRLHGIPRS 1176
+ +D + + +D+ SK+A L + A+ E P+
Sbjct: 765 LHIDIF-----ALEKRLYLSCIDKFSKFAKLFHLQSK--ASVHLRETLVEALHYFTAPKV 817
Query: 1177 IVSDRDKIFLSHFWRDLFRVYGTKLRFSTAYHPETDGQTEVVNRGLETYLRCFAGEQPR- 1235
+VSD ++ L + R L ++ E +GQ E + RC E P
Sbjct: 818 LVSDNERGLLCPTVLNYLRSLDIDLYYAPTQKSEVNGQVERFHSTFLEIYRCLKDELPTF 877
Query: 1236 SWYKFLHLAELWYNTSFHSAAGMTPFQAVYGRPPPSLLAYVPGSSAIQSLDESLQQRT*I 1295
+ +H+A YNTS HS P + R S + Y Q L + +Q
Sbjct: 878 KPVELVHIAVDRYNTSVHSVTNRKPADVFFDR--SSRVNY-------QGLTDFRRQ---T 925
Query: 1296 LESLKANLQRAQHRMKIQKDKSRRE 1320
LE +K ++ Q R + ++K+R E
Sbjct: 926 LEDIKGLIEYKQIRGNMARNKNRDE 950
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 289 bits (740), Expect = 3e-77
Identities = 200/607 (32%), Positives = 307/607 (49%), Gaps = 43/607 (7%)
Query: 404 IPDIEVTFAGNTFHIPFYVMDLQGA-DFVLGLDWLKTLGKVISDFSIPSMSFVVNGKT-- 460
+P IE AG T + ++D A +++ + LK + V S FS+ S ++G T
Sbjct: 12 LPFIERRLAGRTLKM---LIDTDAAKNYIRPVKELKNVMPVASPFSVSS----IHGSTEI 64
Query: 461 ---CTLEGEPLPPPSHA--SFNHFQRLIHTDAIAECHTITFLPSPP-------------S 502
C ++ P S N F +I D + + L S
Sbjct: 65 KHKCLMKVFKHISPFFLLDSLNAFDAIIGLDLLTQAGVKLNLAEDSLEYQGIAEKLHYFS 124
Query: 503 TPSPQFLTLENLSTPPPDFDPALWELLQSYAPVFSTPHGLPPTRFHDHHINLLPNTPPVN 562
PS F + ++ P +++ +T LP I + N P V
Sbjct: 125 CPSVNFTDVNDIVVPDSVKKEFKDTIIRRKKAFSTTNEALPFNTAVTATIRTVDNEP-VY 183
Query: 563 VRPYRYPHSQKEAMATILTDMLQEGIVVPSTSPYSSPVLLIKKK------DGTWRFCVDF 616
R Y + + + +L++GI+ PS SPY+SP ++ KK + R +DF
Sbjct: 184 SRAYPTLMGVSDFVNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDF 243
Query: 617 RSLNSITIKDRFPIPTIDELLDELGGASHFSKLDLRSGFHQIRLATEDTHKTAFRTVDGH 676
R LN TI DR+P+P+I +L LG A F+ LDL+SG+HQI LA D KT+F G
Sbjct: 244 RKLNEKTIPDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGK 303
Query: 677 YEFLVMPFGLTNAPSTFQAAMNDLLRPFLRRFVLVFFDDILVYSPSLSAHMTHLKEVLEV 736
YEF +PFGL NA S FQ A++D+LR + + V+ DD++++S + S H+ H+ VL+
Sbjct: 304 YEFCRLPFGLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKC 363
Query: 737 LLSHKFYAKLSKCIFGVTSVSYLGHIISANGVGPDPSKVQAIVDWPVPRNLTALRGFLGL 796
L+ K F SV YLG I+S +G DP KV+AI ++P P + +R FLGL
Sbjct: 364 LIDANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGL 423
Query: 797 TGFYRRFIKNYAAHASHLTDLLKQD------QKDKHTLLPWSSAAADSFQHLKDLIISAP 850
+YR FIK++AA A +TD+LK + K + ++ ++FQ L++++ S
Sbjct: 424 ASYYRVFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASED 483
Query: 851 VLV-LPDFSATFDIETDASGTAVGAVLSQKGHPISFFSKKLTLQMQHQSTYVREMYAVTE 909
V++ PDF FD+ TDAS + +GAVLSQ+G PI+ S+ L Q+ +T RE+ A+
Sbjct: 484 VILKYPDFKKPFDLTTDASASGIGAVLSQEGRPITMISRTLKQPEQNYATNERELLAIVW 543
Query: 910 AVKKWRQYLIG-HKFRIYTDQQSLKHLMTQTFQTPDQIKWATKLLGFDYEIFYKPGSENR 968
A+ K + +L G + I+TD Q L + +W + + + ++FYKPG EN
Sbjct: 544 ALGKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKPGKENF 603
Query: 969 VADALSR 975
VADALSR
Sbjct: 604 VADALSR 610
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 287 bits (735), Expect = 1e-76
Identities = 241/901 (26%), Positives = 396/901 (43%), Gaps = 81/901 (8%)
Query: 522 DPALWELLQSYAPVFS-TPHGLPPTRFHDHHINLLPNTPPVNVRPYRYPHSQKEAMATIL 580
D +W++++ + VF+ + L + I L P+ +P P + K + ++
Sbjct: 903 DRKIWDVIEQFQDVFAISDDELGRNSGTECVIELKEGAEPIRQKPRPIPLALKPEIRKMI 962
Query: 581 TDMLQEGIVVPSTSPYSSPVLLIKKKDGTWRFCVDFRSLNSITIKDRFPIPTIDELLDEL 640
ML + ++ S SP+SSPV+L+KKKDG+ R C+D+R +N + + P+P I+ L L
Sbjct: 963 QKMLNQKVIRESKSPWSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNIEATLQSL 1022
Query: 641 GGASHFSKLDLRSGFHQIRLATEDTHKTAFRTVDGHYEFLVMPFGLTNAPSTFQAAMNDL 700
G ++ D+ +GF QI L + TAF +E+ V+PFGL +P+ FQ M ++
Sbjct: 1023 AGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALFQGTMEEI 1082
Query: 701 LRPFLRRFVLVFFDDILVYSPSLSAHMTHLKEVLEVLLSHKFYAKLSKCIFGVTSVSYLG 760
+ L V+ DD+L+ S + H+ +KE L + + SKC V YLG
Sbjct: 1083 IGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAKKEVEYLG 1142
Query: 761 HIISANGVGPDPSKVQAIVDWPVPRNLTALRGFLGLTGFYRRFIKNYAAHASHLTDLLKQ 820
H ++ +GV K + + P N+ L+ FLGL G+YR+FI N+A AS LT L+
Sbjct: 1143 HKVTLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASSLTSLI-- 1200
Query: 821 DQKDKHTLLPWSSAAADSFQHLKDLIISAPVLVLPDFSAT------FDIETDASGTAVGA 874
W +FQ LK L+ PVL PD A F I TDAS +GA
Sbjct: 1201 ---SAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGA 1257
Query: 875 VLSQKG-----HPISFFSKKLTLQMQHQSTYVREMYAVTEAVKKWRQYLIGHKFRIYTDQ 929
VL+Q+G HPI+F SK L+ E A+ A+++++ + G ++TD
Sbjct: 1258 VLAQEGPDGQQHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDH 1317
Query: 930 QSLKHLMTQTFQTPDQIKWATKLLGFDYEIFYKPGSENRVADALSR--CHSSEL------ 981
+ L L+ + +W+ ++L FD +I Y G N VADALSR C +EL
Sbjct: 1318 KPLISLLKGSPLADRLWRWSIEILEFDVKIVYLAGKANAVADALSRGGCPPNELEEEQTK 1377
Query: 982 ---PLLAAISSPVPEIITQ---LKQYYKTPEGIQLIV-------DKSTLPHFRVHHEVL- 1027
++ AI + +P+I+ L++ EG + ++ K T + E+
Sbjct: 1378 ELTSIVNAIQTELPDILDSSCWLERLKGEDEGWKEVIAALEGGKTKGTFKIVGIESEISL 1437
Query: 1028 -YFK--------------DGLFVPEHDQWRTSILSEYHASPAAGHSGLKPTLARLMASFN 1072
Y+K VPE + RT +L E H AGH G+K + F
Sbjct: 1438 EYYKIVGGVLKNTEIEEQSRSVVPE--KIRTPLLKELHEGMLAGHFGIKKMWRMVHRKFY 1495
Query: 1073 WPGIQTETKTFIKQCLPCQYNKYVPAKKSGLLQPLPTPAQI---WEDISMDFITGLPPSH 1129
WP ++ + ++ C C + A L TP ++ E ++ D +
Sbjct: 1496 WPQMRVCVENCVRTCAKC-----LCANDHSKLTSSLTPYRMTFPLEIVACDLMDVGLSVQ 1550
Query: 1130 GHTVAWVIVDRLSKYAHFVALPANFTATSLANRFSSEICRLHGIPRSIVSDRDKIFLSHF 1189
G+ I+D +KY V +P T L IP +++D+ K F++
Sbjct: 1551 GNRYILTIIDLFTKYGTAVPIPDKKAETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGL 1610
Query: 1190 WRDLFRVYGTKLRFSTAYHPETDGQTEVVNRGLETYLRCFAGEQPRSWYKFLHLAELWYN 1249
+ + + + Y+ +G E N+ + ++ P W + A YN
Sbjct: 1611 FAQFTHMLKIEHITTKGYNSRANGAVERFNKTIMHIMKKKTA-VPMEWDDQVVYAVYAYN 1669
Query: 1250 TSFHSAAGMTPFQAVYGRPPPSLLAYVPGSSAI----QSLDE----SLQQRT*ILESLKA 1301
H G TP ++GR L + G A+ +DE Q+ + + K
Sbjct: 1670 NCVHENTGETPMFLMHGRDVMGPLE-MSGEDAVGINYADMDEYKHLLTQELLKVQKIAKE 1728
Query: 1302 NLQRAQHRMKI---QKDKSRREVTFEENAWVLLRLQPYRQRSLAHRLSNKLAKRFYGPFR 1358
+ R Q K QK S++ + + VLL + + + +L NK + GP+R
Sbjct: 1729 HAMREQESYKSLFDQKYASKKHRFPQPGSRVLLEIPSEKLGAQCPKLVNK----WSGPYR 1784
Query: 1359 V 1359
V
Sbjct: 1785 V 1785
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 269 bits (687), Expect = 4e-71
Identities = 164/473 (34%), Positives = 251/473 (52%), Gaps = 19/473 (4%)
Query: 555 LPNTPPVNVRPYRYPHSQKEAMATILTDMLQEGIVVPSTSPYSSPVLLIKKKDGT----- 609
L + PV + YR PHSQ E + + ++++ IV PS S Y+SP+LL+ KK
Sbjct: 309 LKDDEPVYTKNYRSPHSQVEEIQAQVQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKK 368
Query: 610 -WRFCVDFRSLNSITIKDRFPIPTIDELLDELGGASHFSKLDLRSGFHQIRLATEDTHKT 668
WR +D+R +N + D+FP+P ID++LD+LG A +FS LDL SGFHQI L T
Sbjct: 369 KWRLVIDYRQINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDIT 428
Query: 669 AFRTVDGHYEFLVMPFGLTNAPSTFQAAMNDLLRPFLRRFVLVFFDDILVYSPSLSAHMT 728
+F T +G Y F +PFGL AP++FQ M ++ DD++V S +
Sbjct: 429 SFSTSNGSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLK 488
Query: 729 HLKEVLEVLLSHKFYAKLSKCIFGVTSVSYLGHIISANGVGPDPSKVQAIVDWPVPRNLT 788
+L EV + KC F + V++LGH + G+ PD K I ++PVP +
Sbjct: 489 NLTEVFGKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDAD 548
Query: 789 ALRGFLGLTGFYRRFIKNYAAHASHLTDLLKQDQKDKHTLLPWSSAAADSFQHLKDLIIS 848
+ R F+ +YRRFIKN+A ++ H+T L K K+ W+ +F HLK +I+
Sbjct: 549 SARRFVAFCNYYRRFIKNFADYSRHITRLCK-----KNVPFEWTDECQKAFIHLKSQLIN 603
Query: 849 APVLVLPDFSATFDIETDASGTAVGAVLSQ--KGH--PISFFSKKLTLQMQHQSTYVREM 904
+L PDFS F I TDAS A GAVL+Q GH P+++ S+ T ++ST +E+
Sbjct: 604 PTLLQYPDFSKEFCITTDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQEL 663
Query: 905 YAVTEAVKKWRQYLIGHKFRIYTDQQSLKHLMTQTFQTPDQIKWATKLLGFDYEIFYKPG 964
A+ A+ +R Y+ G F + TD + L +L + + + +L +++ + Y G
Sbjct: 664 AAIHWAIIHFRPYIYGKHFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKG 723
Query: 965 SENRVADALSRCHSSELPLLAAISSPVPEIITQLKQYYKTPEG-IQLIVDKST 1016
+N VADALSR E L I+ + ++ T+ + K+ G QL + K T
Sbjct: 724 KDNHVADALSRITIKE---LKDITGNILKVTTRFQSRQKSCAGKEQLDLQKQT 773
Score = 92.4 bits (228), Expect = 8e-18
Identities = 80/335 (23%), Positives = 153/335 (44%), Gaps = 28/335 (8%)
Query: 1038 HDQWRTSILSEYHASPA-AGHSGLKPTLARLMASFNWPGIQTETKTFIKQCLPCQYNKYV 1096
+++ + +ILS H P GH+G+ TLA++ + W + K ++++C CQ K
Sbjct: 889 NEKEKEAILSTLHDDPIQGGHTGITKTLAKVKRHYYWKNMSKYIKEYVRKCQKCQKAKTT 948
Query: 1097 PAKKSGLLQPLPTPAQIWEDISMDFITGLPPS-HGHTVAWVIVDRLSKYAHFVALP-ANF 1154
K+ + TP ++ + +D I LP S +G+ A ++ L+KY VA+P AN
Sbjct: 949 KHTKTPMTIT-ETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKY--LVAIPIANK 1005
Query: 1155 TATSLANRFSSEICRLHGIPRSIVSDRDKIFLSHFWRDLFRVYGTKLRFSTAYHPETDGQ 1214
+A ++A +G ++ ++D + + DL + K STA+H +T G
Sbjct: 1006 SAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGV 1065
Query: 1215 TEVVNRGLETYLRCFAGEQPRSWYKFLHLAELWYNTSFHSAAGMTPFQAVYGRPP--PSL 1272
E +R L Y+R + W +L +NT+ P++ V+GR P
Sbjct: 1066 VERSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKH 1125
Query: 1273 LAYVPGSSAIQSLDESLQQRT*ILESLKANLQRAQHRMKIQKDKSR-------REVTFEE 1325
+ I ++D+ ++ LE A RA+ ++ K+K++ +++ E
Sbjct: 1126 FNKLHSIEPIYNIDDYAKESKYRLEVAYA---RARKLLEAHKEKNKENYDLKVKDIELEV 1182
Query: 1326 NAWVLLRLQPYRQRSLAHRLSNKLAKRFYGPFRVK 1360
VLLR + + +KL ++ GP++++
Sbjct: 1183 GDKVLLR----------NEVGHKLDFKYTGPYKIE 1207
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 674
Score = 162 bits (410), Expect = 6e-39
Identities = 162/643 (25%), Positives = 267/643 (41%), Gaps = 49/643 (7%)
Query: 361 VDTGSSNNFVPPRTVSFLHLKVTPIPTFPVMVGNGAHIPCAGYIPDIEVTFAGNTFHIPF 420
VDTG+S + H P V + +G+ I DI++ AG FHIP
Sbjct: 46 VDTGASLCIASKFVIPEEHWINAERPIM-VKIADGSSITINKVCRDIDLIIAGEIFHIPT 104
Query: 421 YVMDLQGADFVLGLDWLKTLGKVISDFSIPSMSFVVNGKTCTLEGEPLPPPSHASFNHFQ 480
G DF++G ++ + + + I V+ K T P H +
Sbjct: 105 VYQQESGIDFIIGNNFCQ-----LYEPFIQFTDRVIFTKDRTY-------PVHIAKLTRA 152
Query: 481 RLIHTDAIAEC---HTITFLPSPPSTPSPQFLTLEN---LSTPPPDFDPALWELLQSYAP 534
+ T+ E + T P P + + + L LS + ++
Sbjct: 153 VRVGTEGFLESMKKRSKTQQPEPVNISTNKIAILSEGRRLSEEKLFITQQRMQKIEELLE 212
Query: 535 VFSTPHGLPPTR---FHDHHINLLPNTPPVNVRPYRYPHSQKEAMATILTDMLQEGIVVP 591
+ + L P + + I L + + V+P +Y +E + ++L ++ P
Sbjct: 213 KVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP 272
Query: 592 STSPYSSPVLLI----KKKDGTWRFCVDFRSLNSITIKDRFPIPTIDELLDELGGASHFS 647
S SP+ +P L+ +K+ G R V+++++N T+ D + P DELL + G FS
Sbjct: 273 SKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFS 332
Query: 648 KLDLRSGFHQIRLATEDTHKTAFRTVDGHYEFLVMPFGLTNAPSTFQAAMNDLLRPFLRR 707
D +SGF Q+ L E TAF GHYE+ V+PFGL APS FQ M++ R F R+
Sbjct: 333 SFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RK 391
Query: 708 FVLVFFDDILVYSPSLSAHMTHLKEVLEVLLSHKFYAKLSKCIFGVTSVSYLGHIISANG 767
F V+ DDILV+S + H+ H+ +L+ H K +++LG I
Sbjct: 392 FCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGT 451
Query: 768 VGPDPSKVQAIVDWP-VPRNLTALRGFLGLTGFYRRFIKNYAAHASHLTDLLKQDQKDKH 826
P ++ I +P + L+ FLG+ + +I A L LK++
Sbjct: 452 HKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKEN----- 506
Query: 827 TLLPWSSAAADS--FQHLKDLIISAPVLVLPDFSATFDIETDAS----GTAVGAVLSQKG 880
+PW D+ Q +K + P L P IETDAS G + A+ +G
Sbjct: 507 --VPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEG 564
Query: 881 HPIS----FFSKKLTLQMQHQSTYVREMYAVTEAVKKWRQYLIGHKFRIYTDQQSLKHLM 936
+ S ++ + +E AV +KK+ YL F I TD K +
Sbjct: 565 TNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFV 624
Query: 937 TQTFQTPDQ----IKWATKLLGFDYEIFYKPGSENRVADALSR 975
++ + I+W L + +++ + G++N AD LSR
Sbjct: 625 NLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSR 667
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 666
Score = 160 bits (406), Expect = 2e-38
Identities = 152/621 (24%), Positives = 261/621 (41%), Gaps = 74/621 (11%)
Query: 390 VMVGNGAHIPCAGYIPDIEVTFAGNTFHIPFYVMDLQGADFVLG----------LDW--- 436
V + N I +++V FAG +F IP G DF++G + W
Sbjct: 81 VKIANQELIKITKVCKNLKVKFAGKSFEIPTVYQQETGIDFLIGNNFCRLYNPFIQWEDR 140
Query: 437 --------LKTLGKVISDFSIPSMSFVVNGKTCTLEGEPLPPPSHASFNHFQRLIHTDAI 488
+ + KV FS+ + SF+ N K + + E +P N + +I+ +
Sbjct: 141 IAFHLKNEMVLIKKVTKAFSVSNPSFLENMKKDS-KTEQIP-----GTNISKNIINPEER 194
Query: 489 AECHTITFLPSPPSTPSPQFLTLENLSTPPPDFDPALWELLQSYAPVFSTPHGLPPTRFH 548
FL + Q L P DP + ++
Sbjct: 195 Y------FLITEKYQKIEQLLDKVCSENP---IDP------------------IKSKQWM 227
Query: 549 DHHINLLPNTPPVNVRPYRYPHSQKEAMATILTDMLQEGIVVPSTSPYSSPVLLIK---- 604
I L+ + V+P Y +E A + ++L G+++PS S + SP L++
Sbjct: 228 KASIKLIDPLKVIRVKPMSYSPQDREGFAKQIKELLDLGLIIPSKSQHMSPAFLVENEAE 287
Query: 605 KKDGTWRFCVDFRSLNSITIKDRFPIPTIDELLDELGGASHFSKLDLRSGFHQIRLATED 664
++ G R V+++++N TI D +P + ELL L G S FS D +SGF Q+ L E
Sbjct: 288 RRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEES 347
Query: 665 THKTAFRTVDGHYEFLVMPFGLTNAPSTFQAAMNDLLRPFLRRFVLVFFDDILVYSPSLS 724
TAF GH+++ V+PFGL APS FQ M L +F +V+ DDI+V+S S
Sbjct: 348 QKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSEL 406
Query: 725 AHMTHLKEVLEVLLSHKFYAKLSKCIFGVTSVSYLGHIISANGVGPDPSKVQAIVDWPVP 784
H H+ VL+++ + K +++LG I P ++ I +P
Sbjct: 407 DHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNHILENIHKFPDR 466
Query: 785 -RNLTALRGFLGLTGFYRRFIKNYAAHASHLTDLLKQDQKDKHTLLPWSSAAADSFQHLK 843
+ L+ FLG+ + +I A L LK+D W+ + +D + +K
Sbjct: 467 LEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKD-----VTWNWTQSDSDYVKKIK 521
Query: 844 DLIISAPVLVLPDFSATFDIETDASGTAVGAVLSQKGHP-----ISFFSKKLTLQMQHQS 898
+ S P L LP IETDAS + G VL + + S ++
Sbjct: 522 KNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYH 581
Query: 899 TYVREMYAVTEAVKKWRQYLIGHKFRIYTDQQSLKHLMTQTFQTPDQ----IKWATKLLG 954
+ +E+ AV + + K+ YL +F + TD ++ + + + + ++W
Sbjct: 582 SNDKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFSK 641
Query: 955 FDYEIFYKPGSENRVADALSR 975
+ +++ + G +N +AD L+R
Sbjct: 642 YQFDVEHLEGVKNVLADCLTR 662
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 160 bits (404), Expect = 3e-38
Identities = 125/443 (28%), Positives = 201/443 (45%), Gaps = 27/443 (6%)
Query: 552 INLLPNTPPVNVRPYRYPHSQKEAMATILTDMLQEGIVVPSTSPYSSPVLLI----KKKD 607
I L + + V+P +Y +E + ++L ++ PS SP+ +P L+ +K+
Sbjct: 238 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRR 297
Query: 608 GTWRFCVDFRSLNSITIKDRFPIPTIDELLDELGGASHFSKLDLRSGFHQIRLATEDTHK 667
G R V+++++N TI D + +P DELL + G FS D +SGF Q+ L E
Sbjct: 298 GKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPL 357
Query: 668 TAFRTVDGHYEFLVMPFGLTNAPSTFQAAMNDLLRPFLRRFVLVFFDDILVYSPSLSAHM 727
TAF GHYE+ V+PFGL APS FQ M++ R F R+F V+ DDILV+S + H+
Sbjct: 358 TAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHL 416
Query: 728 THLKEVLEVLLSHKFYAKLSKCIFGVTSVSYLGHIISANGVGPDPSKVQAIVDWP-VPRN 786
H+ +L+ H K +++LG I P ++ I +P +
Sbjct: 417 LHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLED 476
Query: 787 LTALRGFLGLTGFYRRFIKNYAAHASHLTDLLKQDQKDKHTLLPWSSAAADS--FQHLKD 844
L+ FLG+ + +I A L LK++ +PW D+ Q +K
Sbjct: 477 KKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKEN-------VPWKWTKEDTLYMQKVKK 529
Query: 845 LIISAPVLVLPDFSATFDIETDAS----GTAVGAVLSQKGHPIS----FFSKKLTLQMQH 896
+ P L P IETDAS G + A+ +G + S ++
Sbjct: 530 NLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERN 589
Query: 897 QSTYVREMYAVTEAVKKWRQYLIGHKFRIYTDQQSLKHLMTQTFQTPDQ----IKWATKL 952
+ +E AV +KK+ YL F I TD K + ++ + I+W L
Sbjct: 590 YHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWL 649
Query: 953 LGFDYEIFYKPGSENRVADALSR 975
+ +++ + G++N AD LSR
Sbjct: 650 SHYSFDVEHIKGTDNHFADFLSR 672
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 160 bits (404), Expect = 3e-38
Identities = 125/443 (28%), Positives = 201/443 (45%), Gaps = 27/443 (6%)
Query: 552 INLLPNTPPVNVRPYRYPHSQKEAMATILTDMLQEGIVVPSTSPYSSPVLLI----KKKD 607
I L + + V+P +Y +E + ++L ++ PS SP+ +P L+ +K+
Sbjct: 238 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRR 297
Query: 608 GTWRFCVDFRSLNSITIKDRFPIPTIDELLDELGGASHFSKLDLRSGFHQIRLATEDTHK 667
G R V+++++N TI D + +P DELL + G FS D +SGF Q+ L E
Sbjct: 298 GKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPL 357
Query: 668 TAFRTVDGHYEFLVMPFGLTNAPSTFQAAMNDLLRPFLRRFVLVFFDDILVYSPSLSAHM 727
TAF GHYE+ V+PFGL APS FQ M++ R F R+F V+ DDILV+S + H+
Sbjct: 358 TAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHL 416
Query: 728 THLKEVLEVLLSHKFYAKLSKCIFGVTSVSYLGHIISANGVGPDPSKVQAIVDWP-VPRN 786
H+ +L+ H K +++LG I P ++ I +P +
Sbjct: 417 LHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLED 476
Query: 787 LTALRGFLGLTGFYRRFIKNYAAHASHLTDLLKQDQKDKHTLLPWSSAAADS--FQHLKD 844
L+ FLG+ + +I A L LK++ +PW D+ Q +K
Sbjct: 477 KKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKEN-------VPWKWTKEDTLYMQKVKK 529
Query: 845 LIISAPVLVLPDFSATFDIETDAS----GTAVGAVLSQKGHPIS----FFSKKLTLQMQH 896
+ P L P IETDAS G + A+ +G + S ++
Sbjct: 530 NLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERN 589
Query: 897 QSTYVREMYAVTEAVKKWRQYLIGHKFRIYTDQQSLKHLMTQTFQTPDQ----IKWATKL 952
+ +E AV +KK+ YL F I TD K + ++ + I+W L
Sbjct: 590 YHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWL 649
Query: 953 LGFDYEIFYKPGSENRVADALSR 975
+ +++ + G++N AD LSR
Sbjct: 650 SHYSFDVEHIKGTDNHFADFLSR 672
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 159 bits (402), Expect = 5e-38
Identities = 124/443 (27%), Positives = 201/443 (44%), Gaps = 27/443 (6%)
Query: 552 INLLPNTPPVNVRPYRYPHSQKEAMATILTDMLQEGIVVPSTSPYSSPVLLI----KKKD 607
I L + + V+P +Y +E + ++L ++ PS SP+ +P L+ +K+
Sbjct: 238 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRR 297
Query: 608 GTWRFCVDFRSLNSITIKDRFPIPTIDELLDELGGASHFSKLDLRSGFHQIRLATEDTHK 667
G R V+++++N T+ D + +P DELL + G FS D +SGF Q+ L E
Sbjct: 298 GKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPL 357
Query: 668 TAFRTVDGHYEFLVMPFGLTNAPSTFQAAMNDLLRPFLRRFVLVFFDDILVYSPSLSAHM 727
TAF GHYE+ V+PFGL APS FQ M++ R F R+F V+ DDILV+S + H+
Sbjct: 358 TAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHL 416
Query: 728 THLKEVLEVLLSHKFYAKLSKCIFGVTSVSYLGHIISANGVGPDPSKVQAIVDWP-VPRN 786
H+ +L+ H K +++LG I P ++ I +P +
Sbjct: 417 LHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLED 476
Query: 787 LTALRGFLGLTGFYRRFIKNYAAHASHLTDLLKQDQKDKHTLLPWSSAAADS--FQHLKD 844
L+ FLG+ + +I A L LK++ +PW D+ Q +K
Sbjct: 477 KKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKEN-------VPWRWTKEDTLYMQKVKK 529
Query: 845 LIISAPVLVLPDFSATFDIETDAS----GTAVGAVLSQKGHPIS----FFSKKLTLQMQH 896
+ P L P IETDAS G + A+ +G + S ++
Sbjct: 530 NLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKN 589
Query: 897 QSTYVREMYAVTEAVKKWRQYLIGHKFRIYTDQQSLKHLMTQTFQTPDQ----IKWATKL 952
+ +E AV +KK+ YL F I TD K + ++ + I+W L
Sbjct: 590 YHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWL 649
Query: 953 LGFDYEIFYKPGSENRVADALSR 975
+ +++ + G++N AD LSR
Sbjct: 650 SHYSFDVEHIKGTDNHFADFLSR 672
Score = 32.3 bits (72), Expect = 9.2
Identities = 21/73 (28%), Positives = 31/73 (41%), Gaps = 1/73 (1%)
Query: 361 VDTGSSNNFVPPRTVSFLHLKVTPIPTFPVMVGNGAHIPCAGYIPDIEVTFAGNTFHIPF 420
VDTG+S + H P V + +G+ I + DI++ AG F IP
Sbjct: 44 VDTGASLCIASKFVIPEEHWVNAERPIM-VKIADGSSITISKVCKDIDLIIAGEIFRIPT 102
Query: 421 YVMDLQGADFVLG 433
G DF++G
Sbjct: 103 VYQQESGIDFIIG 115
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 680
Score = 158 bits (400), Expect = 9e-38
Identities = 123/443 (27%), Positives = 200/443 (44%), Gaps = 27/443 (6%)
Query: 552 INLLPNTPPVNVRPYRYPHSQKEAMATILTDMLQEGIVVPSTSPYSSPVLLIKKKD---- 607
I L + + V+P +Y +E + ++L ++ PS SP+ +P L+ +
Sbjct: 239 IKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGR 298
Query: 608 GTWRFCVDFRSLNSITIKDRFPIPTIDELLDELGGASHFSKLDLRSGFHQIRLATEDTHK 667
G R V+++++N T+ D + +P DELL + G FS D +SGF Q+ L E
Sbjct: 299 GNKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPL 358
Query: 668 TAFRTVDGHYEFLVMPFGLTNAPSTFQAAMNDLLRPFLRRFVLVFFDDILVYSPSLSAHM 727
TAF GHYE+ V+PFGL APS FQ M++ R F R+F V+ DDI+V+S + H+
Sbjct: 359 TAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDIVVFSNNEEDHL 417
Query: 728 THLKEVLEVLLSHKFYAKLSKCIFGVTSVSYLGHIISANGVGPDPSKVQAIVDWP-VPRN 786
H+ +L+ H K +++LG I P ++ I +P +
Sbjct: 418 LHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLED 477
Query: 787 LTALRGFLGLTGFYRRFIKNYAAHASHLTDLLKQDQKDKHTLLPWSSAAADS--FQHLKD 844
L+ FLG+ + +I N A L LK++ +PW D+ Q +K
Sbjct: 478 KKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKEN-------VPWKWTKEDTLYMQKVKK 530
Query: 845 LIISAPVLVLPDFSATFDIETDAS----GTAVGAVLSQKGHPIS----FFSKKLTLQMQH 896
+ P L P IETDAS G + A+ +G + S ++
Sbjct: 531 NLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERN 590
Query: 897 QSTYVREMYAVTEAVKKWRQYLIGHKFRIYTDQQSLKHLMTQTFQTPDQ----IKWATKL 952
+ +E AV +KK+ YL F I TD K + ++ + I+W L
Sbjct: 591 YHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWL 650
Query: 953 LGFDYEIFYKPGSENRVADALSR 975
+ +++ + G++N AD LSR
Sbjct: 651 SHYSFDVEHIKGTDNHFADFLSR 673
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 143 bits (360), Expect = 4e-33
Identities = 116/454 (25%), Positives = 207/454 (45%), Gaps = 39/454 (8%)
Query: 552 INLLPNTPPVNVRPYRY-PHSQKEAMATILTDMLQEGIVVPSTSPYSSPVLLI------- 603
+N++ + RP ++ +EAM + +LQ ++ PS S + S ++
Sbjct: 1393 LNIINPDIKIMGRPIKHVTPGDEEAMTRQINLLLQMKVIRPSESKHRSTAFIVRSGTEID 1452
Query: 604 ----KKKDGTWRFCVDFRSLNSITIKDRFPIPTIDELLDELGGASHFSKLDLRSGFHQIR 659
K+K G R +++ LN T D++ +P I+ ++ ++G + +SK DL+SGF Q+
Sbjct: 1453 PITGKEKKGKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSKIYSKFDLKSGFWQVA 1512
Query: 660 LATEDTHKTAFRTVDGHYEFLVMPFGLTNAPSTFQAAMNDLLRPFLRRFVLVFFDDILVY 719
+ E TAF + YE+LVMPFGL NAP+ FQ M+++ + +F+ V+ DDILV+
Sbjct: 1513 MEEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMDNVFKG-TEKFIAVYIDDILVF 1571
Query: 720 SPSLSAHMTHLKEVLEVLLSHKFYAKLSKCIFGVTSVSYLGHIISANGVGPDPSKVQAIV 779
S + H HL +L++ + +K G + +LG + + P + I
Sbjct: 1572 SETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLGCTKIKLQPHIISKIC 1631
Query: 780 DWPVPRNLT--ALRGFLGLTGFYRRFIKNYAAHASHLTDLLKQDQKDKHTLLPWSSAAAD 837
D+ + T +R +LG+ + R +I++ L + + W
Sbjct: 1632 DFSDEKLATPEGMRSWLGILSYARNYIQDIGKLVQPLRQKMAPTGDKRMNPETWKMV--- 1688
Query: 838 SFQHLKDLIISAPVLVLPDFSATFDIETDASGTAVGAVLSQKGHPISFFSKKLTLQM--- 894
+ +K+ + + P L LP + IETD T GAV K +S + T ++
Sbjct: 1689 --RQIKEKVKNLPDLQLPPKDSFIIIETDGCMTGWGAVCKWK---MSKHDPRSTERICAY 1743
Query: 895 ------QHQSTYVREMYAVTEAVKKWRQYLIGHK-FRIYTDQQSLKHLMTQTFQT-PDQI 946
+ST E+ A + K++ Y + K I +D +++ +T + P ++
Sbjct: 1744 ASGSFNPIKSTIDAEIQAAIHGLDKFKIYYLDKKELIIRSDCEAIIKFYNKTNENKPSRV 1803
Query: 947 KWAT-----KLLGFDYEIFYKPGSENRVADALSR 975
+W T LG + G N +ADALSR
Sbjct: 1804 RWLTFSDFLTGLGITVTFEHIDGKHNGLADALSR 1837
>POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 1046
Score = 140 bits (353), Expect = 2e-32
Identities = 118/416 (28%), Positives = 186/416 (44%), Gaps = 29/416 (6%)
Query: 552 INLLPNTPPVNVRPYRYPHSQKEAMATILTDMLQEGIVVPSTSPYSSPVLLIKKKDGT-- 609
I+L P PV++R Y + +T L+ G++ P SP+++P+L +KK GT
Sbjct: 25 IDLKPTAMPVSIRQYPMSKEAHMGIQPHITRFLELGVLRPCRSPWNTPLLPVKKP-GTRD 83
Query: 610 WRFCVDFRSLNSITIKDRFPIPTIDELLDELG-GASHFSKLDLRSGFHQIRLATEDTHKT 668
+R D R +N T+ +P LL L + ++ LDL+ F + LA +
Sbjct: 84 YRPVQDLREVNKRTMDIHPTVPNPYNLLSTLSPDRTWYTVLDLKDAFFCLPLAPQSQELF 143
Query: 669 AF------RTVDGHYEFLVMPFGLTNAPSTFQAAMNDLLRPFLRRF----VLVFFDDILV 718
AF R + G + +P G N+P+ F A++ L F + +L + DD+L+
Sbjct: 144 AFEWRDPERGISGQLTWTRLPQGFKNSPTLFDEALHRDLTDFRTQHPEVTLLQYVDDLLL 203
Query: 719 YSPSLSAHMTHLKEVLEVLLSHKFYAKLSKCIFGVTSVSYLGHIISANGVGPDPSKVQAI 778
+P+ A + K +L L + A K T V+YLG+I+S P +++ +
Sbjct: 204 AAPTKEACIRGTKHLLRELGDKGYRASAKKAQICQTKVTYLGYILSEGKRWLTPGRIETV 263
Query: 779 VDWPVPRNLTALRGFLGLTGFYRRFIKNYAAHASHLTDLLKQDQKDKHTLLPWSSAAADS 838
P P+N +R FLG GF R +I +A A+ L L K+ W +
Sbjct: 264 AHIPPPQNPREVREFLGTAGFCRLWIPGFAELAAPLYALTKESAP-----FTWQEKHQSA 318
Query: 839 FQHLKDLIISAPVLVLPDFSATFDIETDASGTAVGAVLSQK----GHPISFFSKKLTLQM 894
F+ LK+ ++SAP L LPD S F + D VL+QK P+++ SKKL
Sbjct: 319 FEALKEALLSAPALGLPDTSKPFTLFIDEKQGIAKGVLTQKLGPWKRPVAYLSKKLDPVA 378
Query: 895 QHQSTYVREMYAVTEAVKKWRQYLIGHKFRIYTDQQSLKHLMTQTFQTPDQIKWAT 950
+R M A VK + +G + T L QTPD +W T
Sbjct: 379 AGWPPCLRIMAATAMLVKDSAKLTLGQPLTVITPHA----LEAIVRQTPD--RWIT 428
Score = 67.4 bits (163), Expect = 3e-10
Identities = 58/196 (29%), Positives = 89/196 (44%), Gaps = 10/196 (5%)
Query: 1110 PAQIWEDISMDFITGLPPSHGHTVAWVIVDRLSKYAHFVALPANF-TATSLANRFSSEIC 1168
P WE +DF P G+ V VD S + A P TA +A + EI
Sbjct: 761 PGVYWE---IDFTEVKPHYAGYKYLLVFVDTFSGWVE--AYPTRQETAHMVAKKILEEIF 815
Query: 1169 RLHGIPRSIVSDRDKIFLSHFWRDLFRVYGTKLRFSTAYHPETDGQTEVVNRGL-ETYLR 1227
G+P+ I SD F+S + L R G + AY P++ GQ E +NR + ET +
Sbjct: 816 PRFGLPKVIGSDNGPAFVSQVSQGLARTLGINWKLHCAYRPQSSGQVERMNRTIKETLTK 875
Query: 1228 CFAGEQPRSWYKFLHLAELWYNTSFHSAAGMTPFQAVYGRPPPSLLAYVPGSSAIQSLDE 1287
+ W + L LA L + + G+TP++ +YG PPP L+ + S +
Sbjct: 876 LTLETGLKDWRRLLSLALLRARNT-PNRFGLTPYEILYGGPPP--LSTLLNSFSPSDPKT 932
Query: 1288 SLQQRT*ILESLKANL 1303
LQ R L++++A +
Sbjct: 933 DLQARLKGLQAVQAQI 948
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 659
Score = 138 bits (348), Expect = 9e-32
Identities = 157/674 (23%), Positives = 274/674 (40%), Gaps = 61/674 (9%)
Query: 336 ALHGRPSPRT------LKFTAIVNGHPVVVLVDTGSSNNFVPPRTVSFLHLKVTPIPTFP 389
+L R +P + LKF + VDTGSS + + + P
Sbjct: 2 SLRNRTNPNSIYVKGILKFPGYQTNLDLHCYVDTGSSLCMASKYVIPEEYWQTAEKP-LN 60
Query: 390 VMVGNGAHIPCAGYIPDIEVTFAGNTFHIPFYVMDLQGADFVLGLDWLKTLGKVISDFSI 449
+ + NG I + + G F IP G D +LG ++ + I
Sbjct: 61 IKIANGKIIQLTKVCSKLPIRLGGERFLIPTLFQQESGIDLLLGNNFCQLYSPFIQ--YT 118
Query: 450 PSMSFVVNGKTCTLEGEPLPPPSHASFNHFQRLIHTDAIAECHTITFLPSPPSTPSPQFL 509
+ F +N K + G+ + + + + P P + S Q L
Sbjct: 119 DRIYFHLN-KQSVIIGKITKAYQYGVKGFLESMKKKSKVNR-------PEPINITSNQHL 170
Query: 510 TLENLSTPPPDFDPALWEL-------LQSYAPVFSTPHGLPPTR---FHDHHINLLPNTP 559
LE D L+E+ ++ S+ + + P + + I L+
Sbjct: 171 FLEEGGN---HVDEMLYEIQISKFSAIEEMLERVSSENPIDPEKSKQWMTATIELIDPKT 227
Query: 560 PVNVRPYRYPHSQKEAMATILTDMLQEGIVVPSTSPYSSPVLLIK----KKDGTWRFCVD 615
V V+P Y S +E + ++L+ ++ PS S + SP L++ ++ G R V+
Sbjct: 228 VVKVKPMSYSPSDREEFDRQIKELLELKVIKPSKSTHMSPAFLVENEAERRRGKKRMVVN 287
Query: 616 FRSLNSITIKDRFPIPTIDELLDELGGASHFSKLDLRSGFHQIRLATEDTHKTAFRTVDG 675
++++N T D +P DELL + G +S D +SG Q+ L E TAF G
Sbjct: 288 YKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQLLTAFTCPQG 347
Query: 676 HYEFLVMPFGLTNAPSTFQAAMNDLLRPFLRRFVLVFFDDILVYS-PSLSAHMTHLKEVL 734
HY++ V+PFGL APS F + ++ V+ DDILV+S H H+ +L
Sbjct: 348 HYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTGRKEHYIHVLNIL 407
Query: 735 E------VLLSHKFYAKLSKCIFGVTSVSYLGHIISANGVGPDPSKVQAIVDWPVP-RNL 787
++LS K A+L K +++LG I P ++ I +P +
Sbjct: 408 RRCEKLGIILSKK-KAQLFK-----EKINFLGLEIDQGTHCPQNHILEHIHKFPDRIEDK 461
Query: 788 TALRGFLGLTGFYRRFIKNYAAHASHLTDLLKQDQKDKHTLLPWSSAAADSFQHLKDLII 847
L+ FLG+ + +I A+ L LK+D + W+ + +K +
Sbjct: 462 KQLQRFLGILTYASDYIPKLASIRKPLQSKLKED-----STWTWNDTDSQYMAKIKKNLK 516
Query: 848 SAPVLVLPDFSATFDIETDAS----GTAVGAVLSQKGHPISFFSKKLTLQMQHQSTYVRE 903
S P L P+ + IETDAS G + A+ + + + S ++ + +E
Sbjct: 517 SFPKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHSNEKE 576
Query: 904 MYAVTEAVKKWRQYLIGHKFRIYTDQQSLKHLMTQTFQTPDQ----IKWATKLLGFDYEI 959
+ AV +KK+ YL +F I TD ++ H + + + ++W L +D+++
Sbjct: 577 LLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQYDFDV 636
Query: 960 FYKPGSENRVADAL 973
+ G++N AD L
Sbjct: 637 EHIAGTKNVFADFL 650
>POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC
3.4.23.-); Reverse transcriptase/ribonuclease H (EC
2.7.7.49) (EC 3.1.26.4) (RT); Integrase (IN)]
Length = 1189
Score = 134 bits (337), Expect = 2e-30
Identities = 111/393 (28%), Positives = 178/393 (45%), Gaps = 23/393 (5%)
Query: 552 INLLPNTPPVNVRPYRYPHSQKEAMATILTDMLQEGIVVPSTSPYSSPVLLIKKKDGT-- 609
I+L P PV+++ Y + + L+ G++ P SP+++P+L +KK GT
Sbjct: 168 IDLKPTAVPVSIKQYPMSLEAHMGIRQHIIKFLELGVLRPCRSPWNTPLLPVKKP-GTQD 226
Query: 610 WRFCVDFRSLNSITIKDRFPIPTIDELLDELG-GASHFSKLDLRSGFHQIRLATEDTHKT 668
+R D R +N T+ +P LL L S ++ LDL+ F + LA +
Sbjct: 227 YRPVQDLREINKRTVDIHPTVPNPYNLLSTLKPDYSWYTVLDLKDAFFCLPLAPQSQELF 286
Query: 669 AF------RTVDGHYEFLVMPFGLTNAPSTFQAAMNDLLRPFLRRF----VLVFFDDILV 718
AF R + G + +P G N+P+ F A++ L F + +L + DD+L+
Sbjct: 287 AFEWKDPERGISGQLTWTRLPQGFKNSPTLFDEALHRDLTDFRTQHPEVTLLQYVDDLLL 346
Query: 719 YSPSLSAHMTHLKEVLEVLLSHKFYAKLSKCIFGVTSVSYLGHIISANGVGPDPSKVQAI 778
+P+ A + +L+ L + A K T V+YLG+I+S P +++ +
Sbjct: 347 AAPTKKACTQGTRHLLQELGEKGYRASAKKAQICQTKVTYLGYILSEGKRWLTPGRIETV 406
Query: 779 VDWPVPRNLTALRGFLGLTGFYRRFIKNYAAHASHLTDLLKQDQKDKHTLLPWSSAAADS 838
P PRN +R FLG GF R +I +A A+ L L K+ T W + +
Sbjct: 407 ARIPPPRNPREVREFLGTAGFCRLWIPGFAELAAPLYALTKES-----TPFTWQTEHQLA 461
Query: 839 FQHLKDLIISAPVLVLPDFSATFDIETDASGTAVGAVLSQK----GHPISFFSKKLTLQM 894
F+ LK ++SAP L LPD S F + D VL+QK P+++ SKKL
Sbjct: 462 FEALKKALLSAPALGLPDTSKPFTLFLDERQGIAKGVLTQKLGPWKRPVAYLSKKLDPVA 521
Query: 895 QHQSTYVREMYAVTEAVKKWRQYLIGHKFRIYT 927
+R M A VK + +G + T
Sbjct: 522 AGWPPCLRIMAATAMLVKDSAKLTLGQPLTVIT 554
Score = 72.0 bits (175), Expect = 1e-11
Identities = 71/263 (26%), Positives = 113/263 (41%), Gaps = 13/263 (4%)
Query: 1044 SILSEYHASPAAGHSGLKPTLARLMASFNWPGIQTETKTFIKQCLPCQY-NKYVPAKKSG 1102
+++ + HA G+ LK + + F P T + C CQ N +G
Sbjct: 839 AMIQQMHAWTHLGNRKLKLLIEK--TDFLIPRASTLIEQVTSACKVCQQVNAGATRVPAG 896
Query: 1103 LLQPLPTPAQIWEDISMDFITGLPPSHGHTVAWVIVDRLSKYAHFVALPANF-TATSLAN 1161
P WE +DF P G+ V VD S + A P TA +A
Sbjct: 897 KRTRGNRPGVYWE---IDFTEVKPHYAGYKYLLVFVDTFSGWVE--AFPTRQETAHIVAK 951
Query: 1162 RFSSEICRLHGIPRSIVSDRDKIFLSHFWRDLFRVYGTKLRFSTAYHPETDGQTEVVNRG 1221
+ EI G+P+ I SD F+S + L R+ G + AY P++ GQ E +NR
Sbjct: 952 KILEEIFPRFGLPKVIGSDNGPAFVSQVSQGLARILGINWKLHCAYRPQSSGQVERMNRT 1011
Query: 1222 L-ETYLRCFAGEQPRSWYKFLHLAELWYNTSFHSAAGMTPFQAVYGRPPPSLLAYVPGSS 1280
+ ET + + W + L LA L + + G+TP++ +YG PPP L+ + S
Sbjct: 1012 IKETLTKLTLETGLKDWRRLLSLALLRARNT-PNRFGLTPYEILYGGPPP--LSTLLNSF 1068
Query: 1281 AIQSLDESLQQRT*ILESLKANL 1303
+ + LQ R L++++A +
Sbjct: 1069 SPSNSKTDLQARLKGLQAVQAQI 1091
>M860_ARATH (P92523) Hypothetical mitochondrial protein AtMg00860
(ORF158)
Length = 158
Score = 132 bits (332), Expect = 7e-30
Identities = 67/137 (48%), Positives = 87/137 (62%), Gaps = 8/137 (5%)
Query: 727 MTHLKEVLEVLLSHKFYAKLSKCIFGVTSVSYLGH--IISANGVGPDPSKVQAIVDWPVP 784
M HL VL++ H+FYA KC FG ++YLGH IIS GV DP+K++A+V WP P
Sbjct: 1 MNHLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEP 60
Query: 785 RNLTALRGFLGLTGFYRRFIKNYAAHASHLTDLLKQDQKDKHTLLPWSSAAADSFQHLKD 844
+N T LRGFLGLTG+YRRF+KNY LT+LLK++ L W+ AA +F+ LK
Sbjct: 61 KNTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNS------LKWTEMAALAFKALKG 114
Query: 845 LIISAPVLVLPDFSATF 861
+ + PVL LPD F
Sbjct: 115 AVTTLPVLALPDLKLPF 131
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.322 0.136 0.419
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 169,537,562
Number of Sequences: 164201
Number of extensions: 7549091
Number of successful extensions: 21203
Number of sequences better than 10.0: 164
Number of HSP's better than 10.0 without gapping: 96
Number of HSP's successfully gapped in prelim test: 71
Number of HSP's that attempted gapping in prelim test: 20708
Number of HSP's gapped (non-prelim): 338
length of query: 1387
length of database: 59,974,054
effective HSP length: 123
effective length of query: 1264
effective length of database: 39,777,331
effective search space: 50278546384
effective search space used: 50278546384
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 72 (32.3 bits)
Lotus: description of TM0007.12