
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC126790.10 + phase: 0
(1621 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 499 e-140
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 496 e-139
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 496 e-139
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 357 1e-97
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 352 5e-96
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 348 9e-95
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 339 3e-92
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 302 4e-81
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 276 3e-73
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 194 1e-48
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 191 1e-47
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 191 1e-47
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 190 3e-47
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 187 2e-46
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 186 3e-46
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 170 3e-41
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 164 2e-39
POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.2... 153 3e-36
POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic pro... 152 6e-36
RRPO_OENBE (P31843) RNA-directed DNA polymerase homolog (Reverse... 151 2e-35
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 499 bits (1285), Expect = e-140
Identities = 294/905 (32%), Positives = 480/905 (52%), Gaps = 42/905 (4%)
Query: 656 EFSDVFPE----ELPGIPPDREIEFSIDLIPGTQPISIPPYRMAPAELKELREQLQDLLD 711
EF D+ E +LP P + +EF ++L + I Y + P +++ + +++ L
Sbjct: 380 EFKDITAETNTEKLP--KPIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLK 437
Query: 712 KGFIRASTSPWGAPVLFVKKKDGSMRLCVDYRQLNKVTIKNKYPLPRIDELFDQLQGAQC 771
G IR S + PV+FV KK+G++R+ VDY+ LNK N YPLP I++L ++QG+
Sbjct: 438 SGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTI 497
Query: 772 FSKIDLRSGYHQLKIKSEDISKTAFRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFKPFL 831
F+K+DL+S YH ++++ D K AFR G +E+LVM +G++ APA F +N +
Sbjct: 498 FTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAK 557
Query: 832 DRFVIVFIDDILIYSKSREEHEQHLRLVLQTLREKQLYAKFSKCEFWLESVAFLGHIVSK 891
+ V+ ++DDILI+SKS EH +H++ VLQ L+ L +KCEF V F+G+ +S+
Sbjct: 558 ESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISE 617
Query: 892 NGISVDPSKVEAVQNWPRPTSVKEIRSFLGLAGYYRRFVKDFSKLAFPLTRLTQKKVEFK 951
G + ++ V W +P + KE+R FLG Y R+F+ S+L PL L +K V +K
Sbjct: 618 KGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWK 677
Query: 952 WTDACEESFQKLKECLISAPILALPTSGGGYVVYCDASRVGLGCVLMQHGK-----VIAY 1006
WT ++ + +K+CL+S P+L ++ DAS V +G VL Q + Y
Sbjct: 678 WTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGY 737
Query: 1007 ASRQLKRHEQNYPTHDLEMAAVIFALKIWRHYLYG--ETCEIYTDHKSL--KYIFEQRDL 1062
S ++ + + NY D EM A+I +LK WRHYL E +I TDH++L + E
Sbjct: 738 YSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPE 797
Query: 1063 NLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHLTAIKRPIVKEFQEIVES 1122
N R RW L+D++ I Y PG AN +ADALSR IV E + I +
Sbjct: 798 NKRLARWQLFLQDFNFEINYRPGSANHIADALSR---------------IVDETEPIPKD 842
Query: 1123 GVQFEIDHSRTLLAHMKFRSTLIDDIKQAQSQDSELMKMVDNVRNGKVSNFSVDSEGVLW 1182
I+ + F++ ++ + + D++L+ +++N N + ++
Sbjct: 843 SEDNSINFVNQISITDDFKNQVVTE----YTNDTKLLNLLNNEDKRVEENIQLKDGLLIN 898
Query: 1183 LKSRICVPNVDDLRRKILEEAHHSSYTIHPGSNKMYQDLREFYWWEGMKRDVANFVSKCL 1242
K +I +PN L R I+++ H IHPG + + + W+G+++ + +V C
Sbjct: 899 SKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCH 958
Query: 1243 VCQQVKAEHQKPAGLLQPIEIPKWKWEGIAMDFVTGLPRTQKGFDSVWVIIDRLTKSAHF 1302
CQ K+ + KP G LQPI + WE ++MDF+T LP + G+++++V++DR +K A
Sbjct: 959 TCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAIL 1017
Query: 1303 LPVKTTYTASQYAKIYLEEIVSLHGVPISIISDRGAQFTAQFWKSFQAALGTRLNLSTAF 1362
+P + TA Q A+++ + +++ G P II+D FT+Q WK F + S +
Sbjct: 1018 VPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPY 1077
Query: 1363 HPQTDGQSERTIQILEDMLRACVLDLGGSWDRYLPMMEFAYNNSYQSSIQMAPFEALYGR 1422
PQTDGQ+ERT Q +E +LR +W ++ +++ +YNN+ S+ QM PFE ++
Sbjct: 1078 RPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH-- 1135
Query: 1423 RCRSPIGWFEVGEAKLVGPELIQDAIDKVKLIRDRLVTAQSRQKSYSDKRRRPL-EFTVG 1481
R + E+ E Q+ I + +++ L T + K Y D + + + EF G
Sbjct: 1136 RYSPALSPLELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPG 1195
Query: 1482 EHVFLRVSPMKGVLRFGKKGKLTPRFIGPFEILERVGPVAYRLALPPDLSRV-HPVFHIS 1540
+ V ++ G L K KL P F GPF +L++ GP Y L LP + + FH+S
Sbjct: 1196 DLVMVK-RTKTGFLH--KSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVS 1252
Query: 1541 MLRKY 1545
L KY
Sbjct: 1253 HLEKY 1257
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 496 bits (1278), Expect = e-139
Identities = 293/905 (32%), Positives = 480/905 (52%), Gaps = 42/905 (4%)
Query: 656 EFSDVFPE----ELPGIPPDREIEFSIDLIPGTQPISIPPYRMAPAELKELREQLQDLLD 711
EF D+ E +LP P + +EF ++L + I Y + P +++ + +++ L
Sbjct: 380 EFKDITAETNTEKLP--KPIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLK 437
Query: 712 KGFIRASTSPWGAPVLFVKKKDGSMRLCVDYRQLNKVTIKNKYPLPRIDELFDQLQGAQC 771
G IR S + PV+FV KK+G++R+ VDY+ LNK N YPLP I++L ++QG+
Sbjct: 438 SGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTI 497
Query: 772 FSKIDLRSGYHQLKIKSEDISKTAFRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFKPFL 831
F+K+DL+S YH ++++ D K AFR G +E+LVM +G++ APA F +N +
Sbjct: 498 FTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVK 557
Query: 832 DRFVIVFIDDILIYSKSREEHEQHLRLVLQTLREKQLYAKFSKCEFWLESVAFLGHIVSK 891
+ V+ ++D+ILI+SKS EH +H++ VLQ L+ L +KCEF V F+G+ +S+
Sbjct: 558 ESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISE 617
Query: 892 NGISVDPSKVEAVQNWPRPTSVKEIRSFLGLAGYYRRFVKDFSKLAFPLTRLTQKKVEFK 951
G + ++ V W +P + KE+R FLG Y R+F+ S+L PL L +K V +K
Sbjct: 618 KGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWK 677
Query: 952 WTDACEESFQKLKECLISAPILALPTSGGGYVVYCDASRVGLGCVLMQHGK-----VIAY 1006
WT ++ + +K+CL+S P+L ++ DAS V +G VL Q + Y
Sbjct: 678 WTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGY 737
Query: 1007 ASRQLKRHEQNYPTHDLEMAAVIFALKIWRHYLYG--ETCEIYTDHKSL--KYIFEQRDL 1062
S ++ + + NY D EM A+I +LK WRHYL E +I TDH++L + E
Sbjct: 738 YSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPE 797
Query: 1063 NLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHLTAIKRPIVKEFQEIVES 1122
N R RW L+D++ I Y PG AN +ADALSR IV E + I +
Sbjct: 798 NKRLARWQLFLQDFNFEINYRPGSANHIADALSR---------------IVDETEPIPKD 842
Query: 1123 GVQFEIDHSRTLLAHMKFRSTLIDDIKQAQSQDSELMKMVDNVRNGKVSNFSVDSEGVLW 1182
I+ + F++ ++ + + D++L+ +++N N + ++
Sbjct: 843 SEDNSINFVNQISITDDFKNQVVTE----YTNDTKLLNLLNNEDKRVEENIQLKDGLLIN 898
Query: 1183 LKSRICVPNVDDLRRKILEEAHHSSYTIHPGSNKMYQDLREFYWWEGMKRDVANFVSKCL 1242
K +I +PN L R I+++ H IHPG + + + W+G+++ + +V C
Sbjct: 899 SKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCH 958
Query: 1243 VCQQVKAEHQKPAGLLQPIEIPKWKWEGIAMDFVTGLPRTQKGFDSVWVIIDRLTKSAHF 1302
CQ K+ + KP G LQPI + WE ++MDF+T LP + G+++++V++DR +K A
Sbjct: 959 TCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAIL 1017
Query: 1303 LPVKTTYTASQYAKIYLEEIVSLHGVPISIISDRGAQFTAQFWKSFQAALGTRLNLSTAF 1362
+P + TA Q A+++ + +++ G P II+D FT+Q WK F + S +
Sbjct: 1018 VPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPY 1077
Query: 1363 HPQTDGQSERTIQILEDMLRACVLDLGGSWDRYLPMMEFAYNNSYQSSIQMAPFEALYGR 1422
PQTDGQ+ERT Q +E +LR +W ++ +++ +YNN+ S+ QM PFE ++
Sbjct: 1078 RPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH-- 1135
Query: 1423 RCRSPIGWFEVGEAKLVGPELIQDAIDKVKLIRDRLVTAQSRQKSYSDKRRRPL-EFTVG 1481
R + E+ E Q+ I + +++ L T + K Y D + + + EF G
Sbjct: 1136 RYSPALSPLELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPG 1195
Query: 1482 EHVFLRVSPMKGVLRFGKKGKLTPRFIGPFEILERVGPVAYRLALPPDLSRV-HPVFHIS 1540
+ V ++ G L K KL P F GPF +L++ GP Y L LP + + FH+S
Sbjct: 1196 DLVMVK-RTKTGFLH--KSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVS 1252
Query: 1541 MLRKY 1545
L KY
Sbjct: 1253 HLEKY 1257
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 496 bits (1278), Expect = e-139
Identities = 293/905 (32%), Positives = 480/905 (52%), Gaps = 42/905 (4%)
Query: 656 EFSDVFPE----ELPGIPPDREIEFSIDLIPGTQPISIPPYRMAPAELKELREQLQDLLD 711
EF D+ E +LP P + +EF ++L + I Y + P +++ + +++ L
Sbjct: 380 EFKDITAETNTEKLP--KPIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLK 437
Query: 712 KGFIRASTSPWGAPVLFVKKKDGSMRLCVDYRQLNKVTIKNKYPLPRIDELFDQLQGAQC 771
G IR S + PV+FV KK+G++R+ VDY+ LNK N YPLP I++L ++QG+
Sbjct: 438 SGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTI 497
Query: 772 FSKIDLRSGYHQLKIKSEDISKTAFRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFKPFL 831
F+K+DL+S YH ++++ D K AFR G +E+LVM +G++ APA F +N +
Sbjct: 498 FTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVK 557
Query: 832 DRFVIVFIDDILIYSKSREEHEQHLRLVLQTLREKQLYAKFSKCEFWLESVAFLGHIVSK 891
+ V+ ++D+ILI+SKS EH +H++ VLQ L+ L +KCEF V F+G+ +S+
Sbjct: 558 ESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISE 617
Query: 892 NGISVDPSKVEAVQNWPRPTSVKEIRSFLGLAGYYRRFVKDFSKLAFPLTRLTQKKVEFK 951
G + ++ V W +P + KE+R FLG Y R+F+ S+L PL L +K V +K
Sbjct: 618 KGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWK 677
Query: 952 WTDACEESFQKLKECLISAPILALPTSGGGYVVYCDASRVGLGCVLMQHGK-----VIAY 1006
WT ++ + +K+CL+S P+L ++ DAS V +G VL Q + Y
Sbjct: 678 WTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGY 737
Query: 1007 ASRQLKRHEQNYPTHDLEMAAVIFALKIWRHYLYG--ETCEIYTDHKSL--KYIFEQRDL 1062
S ++ + + NY D EM A+I +LK WRHYL E +I TDH++L + E
Sbjct: 738 YSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPE 797
Query: 1063 NLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHLTAIKRPIVKEFQEIVES 1122
N R RW L+D++ I Y PG AN +ADALSR IV E + I +
Sbjct: 798 NKRLARWQLFLQDFNFEINYRPGSANHIADALSR---------------IVDETEPIPKD 842
Query: 1123 GVQFEIDHSRTLLAHMKFRSTLIDDIKQAQSQDSELMKMVDNVRNGKVSNFSVDSEGVLW 1182
I+ + F++ ++ + + D++L+ +++N N + ++
Sbjct: 843 SEDNSINFVNQISITDDFKNQVVTE----YTNDTKLLNLLNNEDKRVEENIQLKDGLLIN 898
Query: 1183 LKSRICVPNVDDLRRKILEEAHHSSYTIHPGSNKMYQDLREFYWWEGMKRDVANFVSKCL 1242
K +I +PN L R I+++ H IHPG + + + W+G+++ + +V C
Sbjct: 899 SKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCH 958
Query: 1243 VCQQVKAEHQKPAGLLQPIEIPKWKWEGIAMDFVTGLPRTQKGFDSVWVIIDRLTKSAHF 1302
CQ K+ + KP G LQPI + WE ++MDF+T LP + G+++++V++DR +K A
Sbjct: 959 TCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAIL 1017
Query: 1303 LPVKTTYTASQYAKIYLEEIVSLHGVPISIISDRGAQFTAQFWKSFQAALGTRLNLSTAF 1362
+P + TA Q A+++ + +++ G P II+D FT+Q WK F + S +
Sbjct: 1018 VPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPY 1077
Query: 1363 HPQTDGQSERTIQILEDMLRACVLDLGGSWDRYLPMMEFAYNNSYQSSIQMAPFEALYGR 1422
PQTDGQ+ERT Q +E +LR +W ++ +++ +YNN+ S+ QM PFE ++
Sbjct: 1078 RPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH-- 1135
Query: 1423 RCRSPIGWFEVGEAKLVGPELIQDAIDKVKLIRDRLVTAQSRQKSYSDKRRRPL-EFTVG 1481
R + E+ E Q+ I + +++ L T + K Y D + + + EF G
Sbjct: 1136 RYSPALSPLELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPG 1195
Query: 1482 EHVFLRVSPMKGVLRFGKKGKLTPRFIGPFEILERVGPVAYRLALPPDLSRV-HPVFHIS 1540
+ V ++ G L K KL P F GPF +L++ GP Y L LP + + FH+S
Sbjct: 1196 DLVMVK-RTKTGFLH--KSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVS 1252
Query: 1541 MLRKY 1545
L KY
Sbjct: 1253 HLEKY 1257
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 357 bits (916), Expect = 1e-97
Identities = 200/599 (33%), Positives = 334/599 (55%), Gaps = 45/599 (7%)
Query: 692 YRMAPAELKELREQLQDLLDKGFIRASTSPWGAPVLFV-KKKDGS----MRLCVDYRQLN 746
Y A +E+ Q+QD+L++G IR S SP+ +P+ V KK+D S R+ +DYR+LN
Sbjct: 213 YSYPQAYEQEVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLN 272
Query: 747 KVTIKNKYPLPRIDELFDQLQGAQCFSKIDLRSGYHQLKIKSEDISKTAFRTRYGHYEFL 806
++T+ +++P+P +DE+ +L F+ IDL G+HQ+++ E +SKTAF T++GHYE+L
Sbjct: 273 EITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYL 332
Query: 807 VMSFGLTNAPAAFMDLMNRVFKPFLDRFVIVFIDDILIYSKSREEHEQHLRLVLQTLREK 866
M FGL NAPA F MN + +P L++ +V++DDI+++S S +EH Q L LV + L +
Sbjct: 333 RMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKA 392
Query: 867 QLYAKFSKCEFWLESVAFLGHIVSKNGISVDPSKVEAVQNWPRPTSVKEIRSFLGLAGYY 926
L + KCEF + FLGH+++ +GI +P K+EA+Q +P PT KEI++FLGL GYY
Sbjct: 393 NLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYY 452
Query: 927 RRFVKDFSKLAFPLTRLTQKKVEFKWTD-ACEESFQKLKECLISAPILALPTSGGGYVVY 985
R+F+ +F+ +A P+T+ +K ++ T+ + +F+KLK + PIL +P + +
Sbjct: 453 RKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLT 512
Query: 986 CDASRVGLGCVLMQHGKVIAYASRQLKRHEQNYPTHDLEMAAVIFALKIWRHYLYGETCE 1045
DAS V LG VL Q G ++Y SR L HE NY T + E+ A+++A K +RHYL G E
Sbjct: 513 TDASDVALGAVLSQDGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFE 572
Query: 1046 IYTDHKSLKYIFEQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSR--------- 1096
I +DH+ L +++ +D N + RW L ++D I Y GK N VADALSR
Sbjct: 573 ISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALSRIKLEETYLS 632
Query: 1097 --------KSMGSLAHLTAIKRPIVKEFQEIVESGVQFEIDHSRTLLAHMKFRSTLIDDI 1148
+ L +T +RP+ ++++ S +I ++ H+ + + DI
Sbjct: 633 EQTQHSAEEDNSDLIFIT--ERPLNTFNRQVIFSKGPPDIKVTKYFKKHI---TQIFYDI 687
Query: 1149 KQAQSQDSELMKMVDNVRNG-------------KVSNFSVDSEGVLWLKSRICVPNVDDL 1195
+ + L+ ++ +++++ L+S I + N+
Sbjct: 688 MTREKAEQYLIDHFCGKKSALYIESDADFEVIQAAHKLAINTKYTKILRSTILLKNITTY 747
Query: 1196 R--RKILEEAHHSSYTIHPGSNKMYQDLREFYWWEGMKRDVANFVSKCLVCQQVKAEHQ 1252
++++ AH +HPG K + E Y++ + + N +++C +C K EH+
Sbjct: 748 AEFKELILTAHEK--LLHPGIQKTTKLFGETYYFPNSQLLIQNIINECSICNLAKTEHR 804
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 352 bits (903), Expect = 5e-96
Identities = 264/904 (29%), Positives = 438/904 (48%), Gaps = 68/904 (7%)
Query: 653 IVCEFSDVFPEELPGIPPDREIEFSIDLIPGTQPISIPPYRMAPAELK-ELREQLQDLLD 711
++ +F DVF + + E I+L G +PI P R P LK E+R+ +Q +L+
Sbjct: 909 VIEQFQDVFAISDDELGRNSGTECVIELKEGAEPIRQKP-RPIPLALKPEIRKMIQKMLN 967
Query: 712 KGFIRASTSPWGAPVLFVKKKDGSMRLCVDYRQLNKVTIKNKYPLPRIDELFDQLQGAQC 771
+ IR S SPW +PV+ VKKKDGS+R+C+DYR++NKV N +PLP I+ L G +
Sbjct: 968 QKVIRESKSPWSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKL 1027
Query: 772 FSKIDLRSGYHQLKIKSEDISKTAFRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFKPFL 831
++ D+ +G+ Q+ + + TAF +E+ V+ FGL +PA F M + L
Sbjct: 1028 YTVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLL 1087
Query: 832 DRFVIVFIDDILIYSKSREEHEQHLRLVLQTLREKQLYAKFSKCEFWLESVAFLGHIVSK 891
V++DD+LI SK E+H Q ++ L +R+ + + SKC + V +LGH V+
Sbjct: 1088 GVCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTL 1147
Query: 892 NGISVDPSKVEAVQNWPRPTSVKEIRSFLGLAGYYRRFVKDFSKLAFPLTRLTQKKVEFK 951
+G+ K + ++ + RPT+VKE++SFLGL GYYR+F+ +F+++A LT L KV +
Sbjct: 1148 DGVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWI 1207
Query: 952 WTDACEESFQKLKECLISAPILALPTSGGG------YVVYCDASRVGLGCVLMQHG---- 1001
W E +FQ+LK+ + P+LA P +++Y DASR G+G VL Q G
Sbjct: 1208 WEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQ 1267
Query: 1002 -KVIAYASRQLKRHEQNYPTHDLEMAAVIFALKIWRHYLYGETCEIYTDHKSLKYIFEQR 1060
IA+AS+ L E Y DLE A++FAL+ ++ +YG ++TDHK L + +
Sbjct: 1268 QHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGS 1327
Query: 1061 DLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSM-------GSLAHLTAIKRPIV 1113
L R RW + ++D I+Y GKAN VADALSR LT+I I
Sbjct: 1328 PLADRLWRWSIEILEFDVKIVYLAGKANAVADALSRGGCPPNELEEEQTKELTSIVNAIQ 1387
Query: 1114 KEFQEIVESG-----VQFEIDHSRTLLAHMKFRSTLIDDIKQAQSQDSELMKMVDNVRNG 1168
E +I++S ++ E + + ++A ++ T + +SE+ + G
Sbjct: 1388 TELPDILDSSCWLERLKGEDEGWKEVIAALEGGKT--KGTFKIVGIESEISLEYYKIVGG 1445
Query: 1169 KVSNFSVDSEGVLWLKSRICVPNVDDLRRKILEEAHHSSYTIHPGSNKMYQDLREFYWWE 1228
+ N ++ + SR VP + +R +L+E H H G KM++ + ++W
Sbjct: 1446 VLKNTEIEEQ------SRSVVP--EKIRTPLLKELHEGMLAGHFGIKKMWRMVHRKFYWP 1497
Query: 1229 GMKRDVANFVSKCLVCQQVKAEHQKPAGLLQPIEIPKWKWEGIAMDFV-TGLPRTQKGFD 1287
M+ V N V C C +H K L P + + E +A D + GL + +G
Sbjct: 1498 QMRVCVENCVRTCAKCLCAN-DHSKLTSSLTPYRM-TFPLEIVACDLMDVGL--SVQGNR 1553
Query: 1288 SVWVIIDRLTKSAHFLPVKTTYTASQYAKIYLEEIVSLHG-VPISIISDRGAQFTAQFWK 1346
+ IID TK +P+ A K ++E G +P+ +++D+G +F +
Sbjct: 1554 YILTIIDLFTKYGTAVPIPDK-KAETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFA 1612
Query: 1347 SFQAALGTRLNLSTAFHPQTDGQSER-TIQILEDMLRACVLDLGGSWDRYLPMMEFAYNN 1405
F L + ++ + +G ER I+ M + + + WD + +AYNN
Sbjct: 1613 QFTHMLKIEHITTKGYNSRANGAVERFNKTIMHIMKKKTAVPM--EWDDQVVYAVYAYNN 1670
Query: 1406 SYQSSIQMAPFEALYGRRCRSP----------IGWFEVGEAKLVGPELIQDAIDKVKLIR 1455
+ P ++GR P I + ++ E K + L Q+ + K+ +
Sbjct: 1671 CVHENTGETPMFLMHGRDVMGPLEMSGEDAVGINYADMDEYKHL---LTQELLKVQKIAK 1727
Query: 1456 DRLVTAQSRQKS-----YSDKRRRPLEFTVGEHVFLRVSPMKGVLRFGKKGKLTPRFIGP 1510
+ + Q KS Y+ K+ R + G V L + K + KL ++ GP
Sbjct: 1728 EHAMREQESYKSLFDQKYASKKHRFPQ--PGSRVLLEIPSEK---LGAQCPKLVNKWSGP 1782
Query: 1511 FEIL 1514
+ ++
Sbjct: 1783 YRVI 1786
Score = 39.3 bits (90), Expect = 0.089
Identities = 20/61 (32%), Positives = 31/61 (50%), Gaps = 5/61 (8%)
Query: 369 RHSSGTTRACSVCGR--FHSGTCFN-DDRECFQCGQKGHIKKNCPIPTTQPSSSRASAPV 425
+ ++ C C + +H +CF +R CF+C + GHI NC P ++S APV
Sbjct: 561 KSKDNASQKCDECQQSGWHMASCFKLKNRACFRCNEMGHIAWNC--PKKNENTSEKEAPV 618
Query: 426 A 426
A
Sbjct: 619 A 619
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 348 bits (892), Expect = 9e-95
Identities = 233/732 (31%), Positives = 378/732 (50%), Gaps = 48/732 (6%)
Query: 686 PISIPPYRMAPAELKELREQLQDLLDKGFIRASTSPWGAPVLFVKKKDGSM-----RLCV 740
PI Y +A E+ Q+Q++L++G IR S SP+ +P V KK + R+ +
Sbjct: 206 PIYSKQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVI 265
Query: 741 DYRQLNKVTIKNKYPLPRIDELFDQLQGAQCFSKIDLRSGYHQLKIKSEDISKTAFRTRY 800
DYR+LN++TI ++YP+P +DE+ +L Q F+ IDL G+HQ+++ E ISKTAF T+
Sbjct: 266 DYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKS 325
Query: 801 GHYEFLVMSFGLTNAPAAFMDLMNRVFKPFLDRFVIVFIDDILIYSKSREEHEQHLRLVL 860
GHYE+L M FGL NAPA F MN + +P L++ +V++DDI+I+S S EH ++LV
Sbjct: 326 GHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVF 385
Query: 861 QTLREKQLYAKFSKCEFWLESVAFLGHIVSKNGISVDPSKVEAVQNWPRPTSVKEIRSFL 920
L + L + KCEF + FLGHIV+ +GI +P KV+A+ ++P PT KEIR+FL
Sbjct: 386 TKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFL 445
Query: 921 GLAGYYRRFVKDFSKLAFPLTRLTQKKVEFKWTDACE--ESFQKLKECLISAPILALPTS 978
GL GYYR+F+ +++ +A P+T +K+ + T E E+F+KLK +I PIL LP
Sbjct: 446 GLTGYYRKFIPNYADIAKPMTSCLKKRTKID-TQKLEYIEAFEKLKALIIRDPILQLPDF 504
Query: 979 GGGYVVYCDASRVGLGCVLMQHGKVIAYASRQLKRHEQNYPTHDLEMAAVIFALKIWRHY 1038
+V+ DAS + LG VL Q+G I++ SR L HE NY + E+ A+++A K +RHY
Sbjct: 505 EKKFVLTTDASNLALGAVLSQNGHPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHY 564
Query: 1039 LYGETCEIYTDHKSLKYIFEQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKS 1098
L G I +DH+ L+++ ++ + RW L +Y I Y GK N VADALSR
Sbjct: 565 LLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKGKENSVADALSRIK 624
Query: 1099 MGSLAHLTAIKRPIVKEFQEIV---ESGVQF------EIDHSRTLLAHMKFRSTLIDDIK 1149
+ H A + ++ ++ E + + I + + H K I I+
Sbjct: 625 IEENHHSEATQHSAEEDNSNLIHLTEKPINYFKKQIIFIKSDKNKVEHSKIFGNSITTIQ 684
Query: 1150 -QAQSQDSELMKMVDNVRNGKVSNF---SVDSEGV-------------LWLKSRICVPNV 1192
+ + ++D+ + ++ + VD E V ++S + NV
Sbjct: 685 YDVMTLEKAKQILLDHFIHRNITIYIESDVDFEIVQRAHIEIVNTTYTKVIRSLFLLKNV 744
Query: 1193 DDLR--RKILEEAHHSSYTIHPGSNKMYQDLREFYWWEGMKRDVANFVSKCLVCQQVKAE 1250
++I+ ++H +HPG KM + +E +++ + + N +++C +C K E
Sbjct: 745 GSYAEFKEIILQSHEK--LLHPGIQKMTKLFKENHFFPNSQLLIQNIINECNICNLAKTE 802
Query: 1251 HQKPAGLLQPIEIPKWKWEGIAMDFVTGLPRTQKGFDSVWVIIDRLTKSAHFLPVKTTYT 1310
H+ L+ P+ E +D + K + S ID +K A +KT
Sbjct: 803 HRNTKMPLKITPNPEHCREKFVVDIYSS---EGKHYIS---CIDIYSKFATLEQIKTKDW 856
Query: 1311 ASQYAKIYLEEIVSLHGVPISIISDRGAQFTAQFWKSFQAALGTRLNLSTAFHPQTDGQS 1370
+ L I + G P + +DR F++ K + L L+TA + D
Sbjct: 857 IE--CRNALMRIFNQLGKPKLLKADRDGAFSSLALKRWLEEEEVELQLNTAKNGVAD--V 912
Query: 1371 ERTIQILEDMLR 1382
ER + + + +R
Sbjct: 913 ERLHKTINEKIR 924
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 339 bits (870), Expect = 3e-92
Identities = 276/928 (29%), Positives = 434/928 (46%), Gaps = 105/928 (11%)
Query: 564 VVLNMVDFDVILGMDWLSLHHATVDCHNKVVKFKPPGEATFSFQGERSWVPNNLISSLRA 623
V+ N+ FD I+G D L A VD N + P + +P +S+
Sbjct: 24 VLPNLHSFDGIIGDDTLKDLKAIVDRKNNCLIITPGIK-----------IPLLARASINV 72
Query: 624 NKLLSRGCQGYLALVRDVQAGEEKLENVPIVCEFSDVFPEELPGIPPDREIEFSIDLIPG 683
N LL+ + G +++ N ++ EF +F L G+ + ++ I
Sbjct: 73 NPLLAA----------EHPDGTQEILN-SLLGEFPRIFEPPLSGMSVETAVKAEIRT--N 119
Query: 684 TQ-PISIPPYRMAPAELKELREQLQDLLDKGFIRASTSPWGAPVLFVKKK-----DGSMR 737
TQ PI Y E+ Q+ +LL G IR S SP+ +P+ V KK + R
Sbjct: 120 TQDPIYAKSYPYPVNMRGEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYR 179
Query: 738 LCVDYRQLNKVTIKNKYPLPRIDELFDQLQGAQCFSKIDLRSGYHQLKIKSEDISKTAFR 797
+ VD+++LN VTI + YP+P I+ L A+ F+ +DL SG+HQ+ +K DI KTAF
Sbjct: 180 MVVDFKRLNTVTIPDTYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFS 239
Query: 798 TRYGHYEFLVMSFGLTNAPAAFMDLMNRVFKPFLDRFVIVFIDDILIYSKSREEHEQHLR 857
T G YEFL + FGL NAPA F +++ + + + + V+IDDI+++S+ + H ++LR
Sbjct: 240 TLNGKYEFLRLPFGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLR 299
Query: 858 LVLQTLREKQLYAKFSKCEFWLESVAFLGHIVSKNGISVDPSKVEAVQNWPRPTSVKEIR 917
LVL +L + L K F V FLG+IV+ +GI DP KV A+ P PTSVKE++
Sbjct: 300 LVLASLSKANLQVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELK 359
Query: 918 SFLGLAGYYRRFVKDFSKLAFPLTRLTQ-----------KKVEFKWTDACEESFQKLKEC 966
FLG+ YYR+F++D++K+A PLT LT+ KV + +SF LK
Sbjct: 360 RFLGMTSYYRKFIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSI 419
Query: 967 LISAPILALPTSGGGYVVYCDASRVGLGCVLMQ----HGKVIAYASRQLKRHEQNYPTHD 1022
L S+ ILA P + + DAS +G VL Q + IAY SR L + E+NY T +
Sbjct: 420 LCSSEILAFPCFTKPFHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIE 479
Query: 1023 LEMAAVIFALKIWRHYLYGE-TCEIYTDHKSLKYIFEQRDLNLRQRRWMELLKDYDCTIL 1081
EM A+I++L R YLYG T ++YTDH+ L + R+ N + +RW +++Y+C ++
Sbjct: 480 KEMLAIIWSLDNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELI 539
Query: 1082 YHPGKANVVADALSR-----------------KSMGSLA------HLTAIKRPIVKEFQE 1118
Y PGK+NVVADALSR M SLA H ++ P V+
Sbjct: 540 YKPGKSNVVADALSRIPPQLNQLSTDLDANPEDDMQSLATAHSALHDSSRLIPHVESPIN 599
Query: 1119 IVESGVQFEIDHSRTLLAH---------MKFRSTLIDDIKQAQSQDSELMKMVDN---VR 1166
+ ++ + F+ S+ L H + + + D+ S S L ++ N +
Sbjct: 600 VFKNQLIFDTTRSKYLCEHPFPGYTRHLIPLKDGSLADL--TNSLQSCLRPVIINGVKIP 657
Query: 1167 NGKVSNF-SVDSEGVLWLKSRICVPNVDDLRR-----KILEEAHHSSYTIHPGSNKMYQD 1220
+ F S+ L K RI V D+ +I+E+ H + H G ++
Sbjct: 658 EAHLQRFQSICLANFLLYKIRITQRLVADVSGAEEICEIIEKEHRRA---HRGPTEIRLQ 714
Query: 1221 LREFYWWEGMKRDVANFVSKCLVCQQVKAEHQKPAGLLQPIEIPKWKWEGIAMDFVTGLP 1280
L E Y++ M + S C C+ K E LQP IP + E + +D
Sbjct: 715 LLEKYYFPRMSSTIRLQTSSCQCCKLYKYERHPNKPNLQPTPIPNYPCEILHIDIFALEK 774
Query: 1281 RTQKGFDSVWVIIDRLTKSAHFLPVKTTYTASQYAKIYLEEIVSLHGVPISIISDRGAQF 1340
R ID+ +K A +++ AS + + L E + P ++SD
Sbjct: 775 RLYLS------CIDKFSKFAKLFHLQS--KASVHLRETLVEALHYFTAPKVLVSDNERGL 826
Query: 1341 TAQFWKSFQAALGTRLNLSTAFHPQTDGQSERTIQILEDMLRACVLDLGGSWDRYLPMME 1400
++ +L L + + +GQ ER ++ R C+ D ++ + + ++
Sbjct: 827 LCPTVLNYLRSLDIDLYYAPTQKSEVNGQVERFHSTFLEIYR-CLKDELPTF-KPVELVH 884
Query: 1401 FA---YNNSYQSSIQMAPFEALYGRRCR 1425
A YN S S P + + R R
Sbjct: 885 IAVDRYNTSVHSVTNRKPADVFFDRSSR 912
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 302 bits (774), Expect = 4e-81
Identities = 233/856 (27%), Positives = 399/856 (46%), Gaps = 97/856 (11%)
Query: 702 LREQLQDLLDKGFIRASTSPWGAPVLFVKKK------DGSMRLCVDYRQLNKVTIKNKYP 755
+ +++ LL G IR S SP+ +P V KK + + RL +D+R+LN+ TI ++YP
Sbjct: 197 VNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYP 256
Query: 756 LPRIDELFDQLQGAQCFSKIDLRSGYHQLKIKSEDISKTAFRTRYGHYEFLVMSFGLTNA 815
+P I + L A+ F+ +DL+SGYHQ+ + D KT+F G YEF + FGL NA
Sbjct: 257 MPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNA 316
Query: 816 PAAFMDLMNRVFKPFLDRFVIVFIDDILIYSKSREEHEQHLRLVLQTLREKQLYAKFSKC 875
+ F ++ V + + + V++DD++I+S++ +H +H+ VL+ L + + K
Sbjct: 317 SSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKT 376
Query: 876 EFWLESVAFLGHIVSKNGISVDPSKVEAVQNWPRPTSVKEIRSFLGLAGYYRRFVKDFSK 935
F+ ESV +LG IVSK+G DP KV+A+Q +P P V ++RSFLGLA YYR F+KDF+
Sbjct: 377 RFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAA 436
Query: 936 LAFPLTRLTQ-----------KKVEFKWTDACEESFQKLKECLISAP-ILALPTSGGGYV 983
+A P+T + + KK+ ++ + +FQ+L+ L S IL P +
Sbjct: 437 IARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFD 496
Query: 984 VYCDASRVGLGCVLMQHGKVIAYASRQLKRHEQNYPTHDLEMAAVIFALKIWRHYLYG-E 1042
+ DAS G+G VL Q G+ I SR LK+ EQNY T++ E+ A+++AL +++LYG
Sbjct: 497 LTTDASASGIGAVLSQEGRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFLYGSR 556
Query: 1043 TCEIYTDHKSLKYIFEQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSL 1102
I+TDH+ L + R+ N + +RW + ++ + Y PGK N VADALSR+++ +L
Sbjct: 557 EINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKPGKENFVADALSRQNLNAL 616
Query: 1103 AH------------------LTAIKRPIVKEFQEIVESGVQFEIDHSRTL-------LAH 1137
+ + +P+ +I+ +F + + L L
Sbjct: 617 QNEPQSDAATIHSELSLTYTVETTDKPLNCFRNQIILEAARFPLKRNLVLFRSKSRHLIS 676
Query: 1138 MKFRSTLIDDIKQAQSQDSEL-----MKMVDNVRNGKVSNFSVDSEGVLWLKSRICVPNV 1192
+S L+ +K+ + D + + + ++ +++F + + V ++
Sbjct: 677 FTDKSWLLKTLKEVVNPDVVNAIHCDLPTLASFQHDLIAHFPATQ----FRHCKNVVLDI 732
Query: 1193 DDLRRKI-LEEAHHSSYTIHPGSNKMYQDLREFYWWEGMKRDVANFVSKCLVCQQVKAEH 1251
D +I + A H+ H + + + + Y++ M V+ C VC Q K +
Sbjct: 733 TDKNEQIEIVTAEHN--RAHRAAQENIKQVLRDYYFPKMGSLAKEVVANCRVCTQAKYDR 790
Query: 1252 QKPAGLLQPIEIPKWKWEGIAMDFVTGLPRTQKGFDSVWVIIDRLTKSAHFLPVKTTYTA 1311
L IP + E + +D + +K F ID+ +K A PV +
Sbjct: 791 HPKKQELGETPIPSYTGEMVHIDIFS---TDRKLF---LTCIDKFSKYAIVQPVVSRTIV 844
Query: 1312 SQYAKIYLEEIVSLHGVPISIISDRGAQFTAQFWKS-FQAALGTRLNLSTAFHPQTDGQS 1370
A L +I++L ++ D F ++ S + + G + + H ++GQ
Sbjct: 845 DITAP--LLQIINLFPNIKTVYCDNEPAFNSETVTSMLKNSFGIDIVNAPPLHSSSNGQV 902
Query: 1371 ERTIQILEDMLRACVLD-LGGSWDRYLPMMEFAYNNSYQSSIQMAPFEALYGRRCRSPIG 1429
ER L ++ R LD + YN + S + P E ++
Sbjct: 903 ERFHSTLAEIARCLKLDKKTNDTVELILRATIEYNKTVHSVTRERPIEVVH--------- 953
Query: 1430 WFEVGEAKLVGPELIQDAIDKVKLIRDRLVTAQSRQKSYSDKRRRPLEFTVGEHVFLRVS 1489
A ++ I+ RLV AQ ++ R+ F VGE VF++ +
Sbjct: 954 ---------------PGAHERCLEIKARLVKAQQDSIGRNNPSRQNRVFEVGERVFVKNN 998
Query: 1490 PMKGVLRFGKKGKLTP 1505
G KLTP
Sbjct: 999 KRLG-------NKLTP 1007
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 276 bits (707), Expect = 3e-73
Identities = 164/483 (33%), Positives = 257/483 (52%), Gaps = 13/483 (2%)
Query: 645 EEKLENVPIVCEFSDVFPEELPGIPPDREIEFSIDLIPGTQPISIPPYRMAPAELKELRE 704
+ +LEN I E+ D+F E I + + + L +P+ YR ++++E++
Sbjct: 276 KSQLEN--ICSEYIDIFALESEPITVNNLYKQQLRL-KDDEPVYTKNYRSPHSQVEEIQA 332
Query: 705 QLQDLLDKGFIRASTSPWGAPVLFVKKKDG------SMRLCVDYRQLNKVTIKNKYPLPR 758
Q+Q L+ + S S + +P+L V KK RL +DYRQ+NK + +K+PLPR
Sbjct: 333 QVQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLPR 392
Query: 759 IDELFDQLQGAQCFSKIDLRSGYHQLKIKSEDISKTAFRTRYGHYEFLVMSFGLTNAPAA 818
ID++ DQL A+ FS +DL SG+HQ+++ T+F T G Y F + FGL AP +
Sbjct: 393 IDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPNS 452
Query: 819 FMDLMNRVFKPFLDRFVIVFIDDILIYSKSREEHEQHLRLVLQTLREKQLYAKFSKCEFW 878
F +M F +++DD+++ S + ++L V RE L KC F+
Sbjct: 453 FQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSFF 512
Query: 879 LESVAFLGHIVSKNGISVDPSKVEAVQNWPRPTSVKEIRSFLGLAGYYRRFVKDFSKLAF 938
+ V FLGH + GI D K + +QN+P P R F+ YYRRF+K+F+ +
Sbjct: 513 MHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYSR 572
Query: 939 PLTRLTQKKVEFKWTDACEESFQKLKECLISAPILALPTSGGGYVVYCDASRVGLGCVLM 998
+TRL +K V F+WTD C+++F LK LI+ +L P + + DAS+ G VL
Sbjct: 573 HITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVLT 632
Query: 999 Q----HGKVIAYASRQLKRHEQNYPTHDLEMAAVIFALKIWRHYLYGETCEIYTDHKSLK 1054
Q H +AYASR + E N T + E+AA+ +A+ +R Y+YG+ + TDH+ L
Sbjct: 633 QNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHRPLT 692
Query: 1055 YIFEQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHLTAIKRPIVK 1114
Y+F + + + R L++Y+ T+ Y GK N VADALSR ++ L +T +
Sbjct: 693 YLFSMVNPSSKLTRIRLELEEYNFTVEYLKGKDNHVADALSRITIKELKDITGNILKVTT 752
Query: 1115 EFQ 1117
FQ
Sbjct: 753 RFQ 755
Score = 91.7 bits (226), Expect = 2e-17
Identities = 80/321 (24%), Positives = 138/321 (42%), Gaps = 32/321 (9%)
Query: 1211 HPGSNKMYQDLREFYWWEGMKRDVANFVSKCLVCQQVKA-EHQKPAGLLQPIEIPKWKWE 1269
H G K ++ Y+W+ M + + +V KC CQ+ K +H K + E P+ ++
Sbjct: 909 HTGITKTLAKVKRHYYWKNMSKYIKEYVRKCQKCQKAKTTKHTKTPMTIT--ETPEHAFD 966
Query: 1270 GIAMDFVTGLPRTQKGFDSVWVIIDRLTKSAHFLPVKTTYTASQYAKIYLEEIVSLHGVP 1329
+ +D + LP+++ G + +I LTK +P+ +A AK E + +G
Sbjct: 967 RVVVDTIGPLPKSENGNEYAVTLICDLTKYLVAIPIANK-SAKTVAKAIFESFILKYGPM 1025
Query: 1330 ISIISDRGAQFTAQFWKSFQAALGTRLNLSTAFHPQTDGQSERTIQILEDMLRACVLDLG 1389
+ I+D G ++ L + STA H QT G ER+ + L + +R+ +
Sbjct: 1026 KTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVERSHRTLNEYIRSYISTDK 1085
Query: 1390 GSWDRYLPMMEFAYNNSYQSSIQMAPFEALYGRRCRSPIGWFEVGEAKLVGPELIQDAID 1449
WD +L + +N + P+E ++GR P + KL E I + D
Sbjct: 1086 TDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHF-----NKLHSIEPIYNIDD 1140
Query: 1450 KVKLIRDRLVTAQSR-----------QKSYSDKRRRPLEFTVGEHVFLRVSPMKGVLRFG 1498
K + RL A +R K D + + +E VG+ V LR
Sbjct: 1141 YAKESKYRLEVAYARARKLLEAHKEKNKENYDLKVKDIELEVGDKVLLR----------N 1190
Query: 1499 KKG-KLTPRFIGPFEILERVG 1518
+ G KL ++ GP++I E +G
Sbjct: 1191 EVGHKLDFKYTGPYKI-ESIG 1210
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 194 bits (494), Expect = 1e-48
Identities = 168/638 (26%), Positives = 286/638 (44%), Gaps = 41/638 (6%)
Query: 498 DAHVLFDPGATYSFVSLYFAPRLGKSSSFLDETLVVTTPVGNNVLAKSVYYSCDVSIEGK 557
+ H D GA+ S + P ++ + ++V G+++ V D+ I G+
Sbjct: 39 ELHCFVDTGASLCIASKFVIPEEHWVNA--ERPIMVKIADGSSITISKVCKDIDLIIAGE 96
Query: 558 VLPADLVVLNMVDFDVILGMDWLSLHHATVDCHNKVVKFKP---PGEATFSFQGERSWVP 614
+ V D I+G ++ L+ + ++V+ K P T + R +
Sbjct: 97 IFKIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDRVIFTKNKSYPVHITKLTRAVRVGIE 156
Query: 615 NNLISSLRANKL------------LSRGCQGYLALVRDVQAGEEKL----ENVPIVCEFS 658
L S + +K + + L + EEKL + + + E
Sbjct: 157 GFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQKIEELL 216
Query: 659 DVFPEELPGIPPDRE--IEFSIDLIPGTQPISIPPYRMAPAELKELREQLQDLLDKGFIR 716
+ E P P + ++ SI L ++ I + P + +P + +E +Q+++LLD I+
Sbjct: 217 EKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIK 276
Query: 717 ASTSPWGAPVLFV----KKKDGSMRLCVDYRQLNKVTIKNKYPLPRIDELFDQLQGAQCF 772
S SP AP V +K+ G R+ V+Y+ +NK TI + Y LP DEL ++G + F
Sbjct: 277 PSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIF 336
Query: 773 SKIDLRSGYHQLKIKSEDISKTAFRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFKPFLD 832
S D +SG+ Q+ + E TAF GHYE+ V+ FGL AP+ F M+ F+ F
Sbjct: 337 SSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-R 395
Query: 833 RFVIVFIDDILIYSKSREEHEQHLRLVLQTLREKQLYAKFSKCEFWLESVAFLGHIVSKN 892
+F V++DDIL++S + E+H H+ ++LQ + + K + + + + FLG + +
Sbjct: 396 KFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEG 455
Query: 893 GISVDPSKVEAVQNWPRP-TSVKEIRSFLGLAGYYRRFVKDFSKLAFPLTRLTQKKVEFK 951
+E + +P K+++ FLG+ Y ++ +++ PL ++ V +K
Sbjct: 456 THKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWK 515
Query: 952 WTDACEESFQKLKECLISAPILALPTSGGGYVVYCDASRVGLGCVL----MQHGK----V 1003
WT QK+K+ L P L P ++ DAS G +L + G +
Sbjct: 516 WTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELI 575
Query: 1004 IAYASRQLKRHEQNYPTHDLEMAAVIFALKIWRHYLYGETCEIYTDH---KSLKYIFEQR 1060
YAS K E+NY ++D E AVI +K + YL I TD+ KS + +
Sbjct: 576 CRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKG 635
Query: 1061 DLNL-RQRRWMELLKDYDCTILYHPGKANVVADALSRK 1097
D L R RW L Y + + G N AD LSR+
Sbjct: 636 DSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSRE 673
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 191 bits (486), Expect = 1e-47
Identities = 165/639 (25%), Positives = 284/639 (43%), Gaps = 43/639 (6%)
Query: 498 DAHVLFDPGATYSFVSLYFAPRLGKSSSFLDETLVVTTPVGNNVLAKSVYYSCDVSIEGK 557
+ H D GA+ S + P ++ + ++V G+++ V D+ I G+
Sbjct: 39 ELHCFVDTGASLCIASKFVIPEEHWVNA--ERPIMVKIADGSSITISKVCKDIDLIIAGE 96
Query: 558 VLPADLVVLNMVDFDVILGMDWLSLHHATVDCHNKVVKFKPPG----------------E 601
+ V D I+G ++ L+ + ++V+ K E
Sbjct: 97 IFRIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDRVIFTKNKSYPVHIAKLTRAVRVGTE 156
Query: 602 ATFSFQGERSWVPNNLISSLRANKLLSRGCQGYLALVRDVQAGEEKL----ENVPIVCEF 657
+RS ++ NK+ + L + EEKL + + + E
Sbjct: 157 GFLESMKKRSKTQQPEPVNISTNKI-ENPLEEIAILSEGRRLSEEKLFITQQRMQKIEEL 215
Query: 658 SDVFPEELPGIPPDRE--IEFSIDLIPGTQPISIPPYRMAPAELKELREQLQDLLDKGFI 715
+ E P P + ++ SI L ++ I + P + +P + +E +Q+++LLD I
Sbjct: 216 LEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVI 275
Query: 716 RASTSPWGAPVLFV----KKKDGSMRLCVDYRQLNKVTIKNKYPLPRIDELFDQLQGAQC 771
+ S SP AP V +K+ G R+ V+Y+ +NK T+ + Y LP DEL ++G +
Sbjct: 276 KPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKI 335
Query: 772 FSKIDLRSGYHQLKIKSEDISKTAFRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFKPFL 831
FS D +SG+ Q+ + E TAF GHYE+ V+ FGL AP+ F M+ F+ F
Sbjct: 336 FSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF- 394
Query: 832 DRFVIVFIDDILIYSKSREEHEQHLRLVLQTLREKQLYAKFSKCEFWLESVAFLGHIVSK 891
+F V++DDIL++S + E+H H+ ++LQ + + K + + + + FLG + +
Sbjct: 395 RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE 454
Query: 892 NGISVDPSKVEAVQNWPRP-TSVKEIRSFLGLAGYYRRFVKDFSKLAFPLTRLTQKKVEF 950
+E + +P K+++ FLG+ Y ++ +++ PL ++ V +
Sbjct: 455 GTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPW 514
Query: 951 KWTDACEESFQKLKECLISAPILALPTSGGGYVVYCDASRVGLGCVL----MQHGK---- 1002
+WT QK+K+ L P L P ++ DAS G +L + G
Sbjct: 515 RWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTEL 574
Query: 1003 VIAYASRQLKRHEQNYPTHDLEMAAVIFALKIWRHYLYGETCEIYTDH---KSLKYIFEQ 1059
+ YAS K E+NY ++D E AVI +K + YL I TD+ KS + +
Sbjct: 575 ICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYK 634
Query: 1060 RDLNL-RQRRWMELLKDYDCTILYHPGKANVVADALSRK 1097
D L R RW L Y + + G N AD LSR+
Sbjct: 635 GDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSRE 673
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 191 bits (485), Expect = 1e-47
Identities = 168/641 (26%), Positives = 286/641 (44%), Gaps = 54/641 (8%)
Query: 498 DAHVLFDPGATYSFVSLYFAPRLGKSSSFLDETLVVTTPVGNNVLAKSVYYSCDVSIEGK 557
+ H D GA+ S + P ++ + ++V G+++ V D+ I G+
Sbjct: 41 ELHCFVDTGASLCIASKFVIPEEHWINA--ERPIMVKIADGSSITINKVCRDIDLIIAGE 98
Query: 558 VLPADLVVLNMVDFDVILGMDWLSLHHATVDCHNKVVKFKPP---------------GEA 602
+ V D I+G ++ L+ + ++V+ K G
Sbjct: 99 IFHIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDRVIFTKDRTYPVHIAKLTRAVRVGTE 158
Query: 603 TF---------SFQGERSWVPNNLISSLRANKLLSRGCQGYLALVRDVQAGEEKLENVPI 653
F + Q E + N I+ L + LS + + +Q EE LE V
Sbjct: 159 GFLESMKKRSKTQQPEPVNISTNKIAILSEGRRLSE--EKLFITQQRMQKIEELLEKV-- 214
Query: 654 VCEFSDVFPEELPGIPPDREIEFSIDLIPGTQPISIPPYRMAPAELKELREQLQDLLDKG 713
C + + P + + ++ SI L ++ I + P + +P + +E +Q+++LLD
Sbjct: 215 -CSENPLDPNKTK-----QWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLK 268
Query: 714 FIRASTSPWGAPVLFV----KKKDGSMRLCVDYRQLNKVTIKNKYPLPRIDELFDQLQGA 769
I+ S SP AP V +K+ G R+ V+Y+ +NK T+ + Y P DEL ++G
Sbjct: 269 VIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGK 328
Query: 770 QCFSKIDLRSGYHQLKIKSEDISKTAFRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFKP 829
+ FS D +SG+ Q+ + E TAF GHYE+ V+ FGL AP+ F M+ F+
Sbjct: 329 KIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRV 388
Query: 830 FLDRFVIVFIDDILIYSKSREEHEQHLRLVLQTLREKQLYAKFSKCEFWLESVAFLGHIV 889
F +F V++DDIL++S + E+H H+ ++LQ + + K + + + + FLG +
Sbjct: 389 F-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEI 447
Query: 890 SKNGISVDPSKVEAVQNWPRP-TSVKEIRSFLGLAGYYRRFVKDFSKLAFPLTRLTQKKV 948
+ +E + +P K+++ FLG+ Y ++ +++ PL ++ V
Sbjct: 448 DEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENV 507
Query: 949 EFKWTDACEESFQKLKECLISAPILALPTSGGGYVVYCDASRVGLGCVL----MQHGK-- 1002
+KWT QK+K+ L P L P ++ DAS G +L + G
Sbjct: 508 PWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNT 567
Query: 1003 --VIAYASRQLKRHEQNYPTHDLEMAAVIFALKIWRHYLYGETCEIYTDH---KSLKYIF 1057
+ YAS K E+NY ++D E AVI +K + YL I TD+ KS +
Sbjct: 568 ELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLN 627
Query: 1058 EQRDLNL-RQRRWMELLKDYDCTILYHPGKANVVADALSRK 1097
+ D L R RW L Y + + G N AD LSR+
Sbjct: 628 YKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSRE 668
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 190 bits (482), Expect = 3e-47
Identities = 166/639 (25%), Positives = 283/639 (43%), Gaps = 43/639 (6%)
Query: 498 DAHVLFDPGATYSFVSLYFAPRLGKSSSFLDETLVVTTPVGNNVLAKSVYYSCDVSIEGK 557
+ H D GA+ S + P ++ + ++V G+++ V D+ I +
Sbjct: 39 ELHCFVDTGASLCIASKFVIPEEHWVNA--ERPIMVKIADGSSITISKVCKDIDLIIARE 96
Query: 558 VLPADLVVLNMVDFDVILGMDWLSLHHATVDCHNKVVKFKPPG----------------E 601
+ V D I+G ++ L+ + ++V+ K E
Sbjct: 97 IFKIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDRVIFTKNKSYPVHIAKLTRAVRVGTE 156
Query: 602 ATFSFQGERSWVPNNLISSLRANKLLSRGCQGYLALVRDVQAGEEKL----ENVPIVCEF 657
+RS ++ NK+ + L + EEKL + + + E
Sbjct: 157 GFLESMKKRSKTQQPEPVNISTNKI-ENPLKEIAILSEGRRLSEEKLFITQQRMQKIEEL 215
Query: 658 SDVFPEELPGIPPDRE--IEFSIDLIPGTQPISIPPYRMAPAELKELREQLQDLLDKGFI 715
+ E P P + ++ SI L ++ I + P + +P + +E +Q+++LLD I
Sbjct: 216 LEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVI 275
Query: 716 RASTSPWGAPVLFV----KKKDGSMRLCVDYRQLNKVTIKNKYPLPRIDELFDQLQGAQC 771
+ S SP AP V +K+ G R+ V+Y+ +NK TI + Y LP DEL ++G +
Sbjct: 276 KPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKI 335
Query: 772 FSKIDLRSGYHQLKIKSEDISKTAFRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFKPFL 831
FS D +SG+ Q+ + E TAF GHYE+ V+ FGL AP+ F M+ F+ F
Sbjct: 336 FSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF- 394
Query: 832 DRFVIVFIDDILIYSKSREEHEQHLRLVLQTLREKQLYAKFSKCEFWLESVAFLGHIVSK 891
+F V++DDIL++S + E+H H+ ++LQ + + K + + + + FLG + +
Sbjct: 395 RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDE 454
Query: 892 NGISVDPSKVEAVQNWPRP-TSVKEIRSFLGLAGYYRRFVKDFSKLAFPLTRLTQKKVEF 950
+E + +P K+++ FLG+ Y ++ +++ PL ++ V +
Sbjct: 455 GTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPW 514
Query: 951 KWTDACEESFQKLKECLISAPILALPTSGGGYVVYCDASRVGLGCVL----MQHGK---- 1002
KWT QK+K+ L P L P ++ DAS G +L + G
Sbjct: 515 KWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTEL 574
Query: 1003 VIAYASRQLKRHEQNYPTHDLEMAAVIFALKIWRHYLYGETCEIYTDH---KSLKYIFEQ 1059
+ YAS K E+NY ++D E AVI +K + YL I TD+ KS + +
Sbjct: 575 ICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYK 634
Query: 1060 RDLNL-RQRRWMELLKDYDCTILYHPGKANVVADALSRK 1097
D L R RW L Y + + G N AD LSR+
Sbjct: 635 GDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSRE 673
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 187 bits (475), Expect = 2e-46
Identities = 159/634 (25%), Positives = 275/634 (43%), Gaps = 50/634 (7%)
Query: 496 SQDAHVLFDPGATYSFVSLYFAPRLGKSSSFLDETLVVTTPVGNNVLAK--SVYYSCDVS 553
S + H D GA+ S Y P +S D + + N L K V + V
Sbjct: 46 SFNIHCYVDTGASLCIASRYIIPEELWENSPKD----IQVKIANQELIKITKVCKNLKVK 101
Query: 554 IEGKVLPADLVVLNMVDFDVILGMDWLSLHHATVDCHNKVVKFKPPGEATFSFQGERSWV 613
GK V D ++G ++ L++ + +++ F E + +++
Sbjct: 102 FAGKSFEIPTVYQQETGIDFLIGNNFCRLYNPFIQWEDRIA-FHLKNEMVLIKKVTKAFS 160
Query: 614 PNN--------------LISSLRANKLLSRGCQGYLALVRDVQAGEEKLENVPIVCEFSD 659
+N I +K + + Y + Q E+ L+ V C +
Sbjct: 161 VSNPSFLENMKKDSKTEQIPGTNISKNIINPEERYFLITEKYQKIEQLLDKV---CSENP 217
Query: 660 VFPEELPGIPPDREIEFSIDLIPGTQPISIPPYRMAPAELKELREQLQDLLDKGFIRAST 719
+ P I + ++ SI LI + I + P +P + + +Q+++LLD G I S
Sbjct: 218 IDP-----IKSKQWMKASIKLIDPLKVIRVKPMSYSPQDREGFAKQIKELLDLGLIIPSK 272
Query: 720 SPWGAPVLFVK----KKDGSMRLCVDYRQLNKVTIKNKYPLPRIDELFDQLQGAQCFSKI 775
S +P V+ ++ G R+ V+Y+ +N+ TI + + LP + EL L+G FS
Sbjct: 273 SQHMSPAFLVENEAERRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSF 332
Query: 776 DLRSGYHQLKIKSEDISKTAFRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFKPFLDRFV 835
D +SG+ Q+ + E TAF GH+++ V+ FGL AP+ F M D+F
Sbjct: 333 DCKSGFWQVVLDEESQKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFC 391
Query: 836 IVFIDDILIYSKSREEHEQHLRLVLQTLREKQLYAKFSKCEFWLESVAFLGHIVSKNGIS 895
+V++DDI+++S S +H H+ VL+ + + + K + E + FLG + K
Sbjct: 392 MVYVDDIIVFSNSELDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHC 451
Query: 896 VDPSKVEAVQNWP-RPTSVKEIRSFLGLAGYYRRFVKDFSKLAFPLTRLTQKKVEFKWTD 954
+E + +P R K ++ FLG+ Y ++ +++ PL +K V + WT
Sbjct: 452 PQNHILENIHKFPDRLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQ 511
Query: 955 ACEESFQKLKECLISAPILALPTSGGGYVVYCDASRVGLGCVLMQHG-----KVIAYASR 1009
+ + +K+K+ L S P L LP ++ DAS G VL + Y+S
Sbjct: 512 SDSDYVKKIKKNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSG 571
Query: 1010 QLKRHEQNYPTHDLEMAAVIFALKIWRHYLYGETCEIYTDHKSLKYIFEQRDLNL----- 1064
K+ E+NY ++D E+ AV + + YL + TD+K+ Y +NL
Sbjct: 572 SFKQAEKNYHSNDKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLR---INLKGDSK 628
Query: 1065 --RQRRWMELLKDYDCTILYHPGKANVVADALSR 1096
R RW Y + + G NV+AD L+R
Sbjct: 629 QGRLVRWQNWFSKYQFDVEHLEGVKNVLADCLTR 662
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 186 bits (473), Expect = 3e-46
Identities = 165/646 (25%), Positives = 284/646 (43%), Gaps = 57/646 (8%)
Query: 498 DAHVLFDPGATYSFVSLYFAPRLGKSSSFLDETLVVTTPVGNNVLAKSVYYSCDVSIEGK 557
+ H D GA+ S + P ++ + ++V G+++ V D+ I G
Sbjct: 40 ELHCFVDTGASLCIASKFVIPEEHWVNA--ERPIMVKIADGSSITISKVCKDIDLIIVGV 97
Query: 558 VLPADLVVLNMVDFDVILGMDWLSLHHATVDCHNKVVKFKPPG----------------E 601
+ V D I+G ++ L+ + ++V+ K E
Sbjct: 98 IFKIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDRVIFTKNKSYPVHIAKLTRAVRVGTE 157
Query: 602 ATFSFQGERSWVP---------NNLISSLRANKLLSRGC----QGYLALVRDVQAGEEKL 648
+RS N + + L +LS G + + +Q EE L
Sbjct: 158 GFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQKTEELL 217
Query: 649 ENVPIVCEFSDVFPEELPGIPPDREIEFSIDLIPGTQPISIPPYRMAPAELKELREQLQD 708
E V C + + P + + ++ SI L ++ I + P + +P + +E +Q+++
Sbjct: 218 EKV---CSENPLDPNKTK-----QWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKE 269
Query: 709 LLDKGFIRASTSPWGAPVLFVKKKD----GSMRLCVDYRQLNKVTIKNKYPLPRIDELFD 764
LLD I+ S SP AP V + G+ R+ V+Y+ +NK T+ + Y LP DEL
Sbjct: 270 LLDLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLT 329
Query: 765 QLQGAQCFSKIDLRSGYHQLKIKSEDISKTAFRTRYGHYEFLVMSFGLTNAPAAFMDLMN 824
++G + FS D +SG+ Q+ + E TAF GHYE+ V+ FGL AP+ F M+
Sbjct: 330 LIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMD 389
Query: 825 RVFKPFLDRFVIVFIDDILIYSKSREEHEQHLRLVLQTLREKQLYAKFSKCEFWLESVAF 884
F+ F +F V++DDI+++S + E+H H+ ++LQ + + K + + + + F
Sbjct: 390 EAFRVF-RKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINF 448
Query: 885 LGHIVSKNGISVDPSKVEAVQNWPRP-TSVKEIRSFLGLAGYYRRFVKDFSKLAFPLTRL 943
LG + + +E + +P K+++ FLG+ Y ++ + +++ PL
Sbjct: 449 LGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAK 508
Query: 944 TQKKVEFKWTDACEESFQKLKECLISAPILALPTSGGGYVVYCDASRVGLGCVL----MQ 999
++ V +KWT QK+K+ L P L P ++ DAS G +L +
Sbjct: 509 LKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKIN 568
Query: 1000 HGK----VIAYASRQLKRHEQNYPTHDLEMAAVIFALKIWRHYLYGETCEIYTDH---KS 1052
G + Y S K E+NY ++D E AVI +K + YL I TD+ KS
Sbjct: 569 EGTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKS 628
Query: 1053 LKYIFEQRDLNL-RQRRWMELLKDYDCTILYHPGKANVVADALSRK 1097
+ + D L R RW L Y + + G N AD LSR+
Sbjct: 629 FVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSRE 674
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 170 bits (431), Expect = 3e-41
Identities = 120/440 (27%), Positives = 208/440 (47%), Gaps = 20/440 (4%)
Query: 677 SIDLIPGTQPISIPPYRMAPAELKELREQLQDLLDKGFIRASTSPWGAPVLFVK----KK 732
+I+LI + + P +P++ +E Q+++LL+ I+ S S +P V+ ++
Sbjct: 219 TIELIDPKTVVKVKPMSYSPSDREEFDRQIKELLELKVIKPSKSTHMSPAFLVENEAERR 278
Query: 733 DGSMRLCVDYRQLNKVTIKNKYPLPRIDELFDQLQGAQCFSKIDLRSGYHQLKIKSEDIS 792
G R+ V+Y+ +NK T + + LP DEL ++G + +S D +SG Q+ + E
Sbjct: 279 RGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQL 338
Query: 793 KTAFRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFKPFLDRFVIVFIDDILIYSKS-REE 851
TAF GHY++ V+ FGL AP+ F ++ V++DDIL++S + R+E
Sbjct: 339 LTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTGRKE 398
Query: 852 HEQHLRLVLQTLREKQLYAKFSKCEFWLESVAFLGHIVSKNGISVDPSKVEAVQNWP-RP 910
H H+ +L+ + + K + + E + FLG + + +E + +P R
Sbjct: 399 HYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEIDQGTHCPQNHILEHIHKFPDRI 458
Query: 911 TSVKEIRSFLGLAGYYRRFVKDFSKLAFPLTRLTQKKVEFKWTDACEESFQKLKECLISA 970
K+++ FLG+ Y ++ + + PL ++ + W D + K+K+ L S
Sbjct: 459 EDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKNLKSF 518
Query: 971 PILALPTSGGGYVVYCDASRVGLGCVLM----QHGKVIAYASRQLKRHEQNYPTHDLEMA 1026
P L P V+ DAS G +L H + YAS K E+NY +++ E+
Sbjct: 519 PKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHSNEKELL 578
Query: 1027 AVIFALKIWRHYLYGETCEIYTDHKSLKYIFEQRDLNL-------RQRRWMELLKDYDCT 1079
AVI +K + YL I TD+K+ + ++NL R RW L YD
Sbjct: 579 AVIRVIKKFSIYLTPSRFLIRTDNKNFTHFV---NINLKGDRKQGRLVRWQMWLSQYDFD 635
Query: 1080 ILYHPGKANVVADALSRKSM 1099
+ + G NV AD L ++
Sbjct: 636 VEHIAGTKNVFADFLQENTL 655
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 164 bits (414), Expect = 2e-39
Identities = 128/456 (28%), Positives = 224/456 (49%), Gaps = 34/456 (7%)
Query: 671 DREIEFSIDLI-PGTQPISIPPYRMAPAELKELREQLQDLLDKGFIRASTSPWGAPVLFV 729
+ +I+ +++I P + + P + P + + + Q+ LL IR S S + V
Sbjct: 1386 NNKIKCKLNIINPDIKIMGRPIKHVTPGDEEAMTRQINLLLQMKVIRPSESKHRSTAFIV 1445
Query: 730 -----------KKKDGSMRLCVDYRQLNKVTIKNKYPLPRIDELFDQLQGAQCFSKIDLR 778
K+K G R+ +Y+ LN+ T ++Y LP I+ + ++ ++ +SK DL+
Sbjct: 1446 RSGTEIDPITGKEKKGKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSKIYSKFDLK 1505
Query: 779 SGYHQLKIKSEDISKTAFRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFKPFLDRFVIVF 838
SG+ Q+ ++ E + TAF YE+LVM FGL NAPA F M+ VFK ++F+ V+
Sbjct: 1506 SGFWQVAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMDNVFKG-TEKFIAVY 1564
Query: 839 IDDILIYSKSREEHEQHLRLVLQTLREKQLYAKFSKCEFWLESVAFLGHIVSKNGISVDP 898
IDDIL++S++ E+H QHL +LQ +E L +K + + FLG + I + P
Sbjct: 1565 IDDILVFSETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLGCTKIKLQP 1624
Query: 899 SKVEAVQNW--PRPTSVKEIRSFLGLAGYYRRFVKDFSKLAFPLTRLTQKKVEFKWTDAC 956
+ + ++ + + + +RS+LG+ Y R +++D KL PL + + +
Sbjct: 1625 HIISKICDFSDEKLATPEGMRSWLGILSYARNYIQDIGKLVQPLRQKMAPTGDKRMNPET 1684
Query: 957 EESFQKLKECLISAPILALPTSGGGYVVYCDASRVGLGCV----LMQHG-----KVIAYA 1007
+ +++KE + + P L LP ++ D G G V + +H ++ AYA
Sbjct: 1685 WKMVRQIKEKVKNLPDLQLPPKDSFIIIETDGCMTGWGAVCKWKMSKHDPRSTERICAYA 1744
Query: 1008 SRQLKRHEQNYPTHDLEMAAVIFAL-KIWRHYLYGETCEIYTDHKSLKYIFEQRDLNLRQ 1066
S + T D E+ A I L K +YL + I +D +++ + + + N
Sbjct: 1745 SGSFNPIKS---TIDAEIQAAIHGLDKFKIYYLDKKELIIRSDCEAIIKFYNKTNENKPS 1801
Query: 1067 R-RWM---ELLKDYDCTILYH--PGKANVVADALSR 1096
R RW+ + L T+ + GK N +ADALSR
Sbjct: 1802 RVRWLTFSDFLTGLGITVTFEHIDGKHNGLADALSR 1837
>POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1189
Score = 153 bits (387), Expect = 3e-36
Identities = 119/460 (25%), Positives = 209/460 (44%), Gaps = 27/460 (5%)
Query: 645 EEKLENVPIVCEFSDVFPEELPGIPPDR--------EIEFSIDLIPGTQPISIPPYRMAP 696
E +L ++P+ DV+ ++ P + + IDL P P+SI Y M+
Sbjct: 127 EHRLFDIPVTTSLPDVWLQDFPQAWAETGGLGRAKCQAPIIIDLKPTAVPVSIKQYPMSL 186
Query: 697 AELKELREQLQDLLDKGFIRASTSPWGAPVLFVKKKDGS-MRLCVDYRQLNKVTIKNKYP 755
+R+ + L+ G +R SPW P+L VKK R D R++NK T+
Sbjct: 187 EAHMGIRQHIIKFLELGVLRPCRSPWNTPLLPVKKPGTQDYRPVQDLREINKRTVDIHPT 246
Query: 756 LPRIDELFDQLQ-GAQCFSKIDLRSGYHQLKIKSEDISKTAFRTR------YGHYEFLVM 808
+P L L+ ++ +DL+ + L + + AF + G + +
Sbjct: 247 VPNPYNLLSTLKPDYSWYTVLDLKDAFFCLPLAPQSQELFAFEWKDPERGISGQLTWTRL 306
Query: 809 SFGLTNAPAAFMDLMNRVFKPFLDRF----VIVFIDDILIYSKSREEHEQHLRLVLQTLR 864
G N+P F + ++R F + ++ ++DD+L+ + +++ Q R +LQ L
Sbjct: 307 PQGFKNSPTLFDEALHRDLTDFRTQHPEVTLLQYVDDLLLAAPTKKACTQGTRHLLQELG 366
Query: 865 EKQLYAKFSKCEFWLESVAFLGHIVSKNGISVDPSKVEAVQNWPRPTSVKEIRSFLGLAG 924
EK A K + V +LG+I+S+ + P ++E V P P + +E+R FLG AG
Sbjct: 367 EKGYRASAKKAQICQTKVTYLGYILSEGKRWLTPGRIETVARIPPPRNPREVREFLGTAG 426
Query: 925 YYRRFVKDFSKLAFPLTRLTQKKVEFKWTDACEESFQKLKECLISAPILALPTSGGGYVV 984
+ R ++ F++LA PL LT++ F W + +F+ LK+ L+SAP L LP + + +
Sbjct: 427 FCRLWIPGFAELAAPLYALTKESTPFTWQTEHQLAFEALKKALLSAPALGLPDTSKPFTL 486
Query: 985 YCDASR-VGLGCVLMQHG---KVIAYASRQLKRHEQNYPTHDLEMAAVIFALKIWRHYLY 1040
+ D + + G + + G + +AY S++L +P MAA +K
Sbjct: 487 FLDERQGIAKGVLTQKLGPWKRPVAYLSKKLDPVAAGWPPCLRIMAATAMLVKDSAKLTL 546
Query: 1041 GETCEIYTDHKSLKYIFEQRD---LNLRQRRWMELLKDYD 1077
G+ + T H + + D N R + LL D D
Sbjct: 547 GQPLTVITPHTLEAIVRQPPDRWITNARLTHYQALLLDTD 586
Score = 70.5 bits (171), Expect = 4e-11
Identities = 59/190 (31%), Positives = 82/190 (43%), Gaps = 8/190 (4%)
Query: 1234 VANFVSKCLVCQQVKAEHQK-PAGLLQPIEIPKWKWEGIAMDFVTGLPRTQKGFDSVWVI 1292
+ S C VCQQV A + PAG P WE +DF P G+ + V
Sbjct: 873 IEQVTSACKVCQQVNAGATRVPAGKRTRGNRPGVYWE---IDFTEVKPH-YAGYKYLLVF 928
Query: 1293 IDRLTKSAHFLPVKTTYTASQYAKIYLEEIVSLHGVPISIISDRGAQFTAQFWKSFQAAL 1352
+D + P + TA AK LEEI G+P I SD G F +Q + L
Sbjct: 929 VDTFSGWVEAFPTRQE-TAHIVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQGLARIL 987
Query: 1353 GTRLNLSTAFHPQTDGQSERTIQILEDMLRACVLDLG-GSWDRYLPMMEFAYNNSYQSSI 1411
G L A+ PQ+ GQ ER + +++ L L+ G W R L + N+ +
Sbjct: 988 GINWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLLSLALLRARNT-PNRF 1046
Query: 1412 QMAPFEALYG 1421
+ P+E LYG
Sbjct: 1047 GLTPYEILYG 1056
>POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 692
Score = 152 bits (385), Expect = 6e-36
Identities = 105/337 (31%), Positives = 176/337 (52%), Gaps = 17/337 (5%)
Query: 698 ELKELREQLQDLLDKGFIRASTSPWGAPVLFVKK----KDGSMRLCVDYRQLNKVTIKNK 753
+++E +E+ +DLL KG IR S SP AP +V+ K G R+ ++Y+++N+ TI +
Sbjct: 215 DVQEFKEECEDLLKKGLIRESQSPHSAPAFYVENHNEIKRGKRRMVINYKKMNEATIGDS 274
Query: 754 YPLPRIDELFDQLQGAQCFSKIDLRSGYHQLKIKSEDISKTAFR-TRYGHYEFLVMSFGL 812
Y LPR D + ++++G+ FS +D +SGY+QL++ TAF HYE+ V+SFGL
Sbjct: 275 YKLPRKDFILEKIKGSLWFSSLDAKSGYYQLRLHENTKPLTAFSCPPQKHYEWNVLSFGL 334
Query: 813 TNAPAAFMDLMNRVFKPFLDRFVIVFIDDILIYSK-SREEHEQHLRLVLQTLREKQLYAK 871
AP+ + M++ K L+ + +IDDILI++K S+E+H +R+VLQ ++EK +
Sbjct: 335 KQAPSIYQRFMDQSLKG-LEHICLAYIDDILIFTKGSKEQHVNDVRIVLQRIKEKGIIIS 393
Query: 872 FSKCEFWLESVAFLGHIVSKNG-ISVDPSKVEAVQNWP-RPTSVKEIRSFLGLAGYYRR- 928
K + + + +LG + NG I + P E + +P K+I+ FLG Y
Sbjct: 394 KKKSKLIQQEIEYLGLKIQGNGEIDLSPHTQEKILQFPDELEDRKQIQRFLGCINYIANE 453
Query: 929 -FVKDFSKLAFPLTRLTQKKVEFKWTDACEESFQKLKECLISAPILALPTSGGGYVVYCD 987
F K+ + L + K +KW + Q +K + S P L + +V D
Sbjct: 454 GFFKNLALERKHLQKKISVKNPWKWDTIDTKMVQSIKGKIQSLPKLYNASIQDFLIVETD 513
Query: 988 ASRVG-LGCVLMQHGKVIAYASRQLKRHEQNYPTHDL 1023
AS+ GC+ + + +++ E PT DL
Sbjct: 514 ASQHSWSGCL-----RALPKGKQKIGLDEFGIPTADL 545
>RRPO_OENBE (P31843) RNA-directed DNA polymerase homolog (Reverse
transcriptase homolog)
Length = 142
Score = 151 bits (381), Expect = 2e-35
Identities = 74/131 (56%), Positives = 97/131 (73%), Gaps = 3/131 (2%)
Query: 735 SMRLCVDYRQLNKVTIKNKYPLPRIDELFDQLQGAQCFSKIDLRSGYHQLKIKSEDISKT 794
S+R+C+DYR L KVTIKNKYP+PR+D+LFD+L A F+K+DLRSGY Q++I D KT
Sbjct: 5 SLRMCIDYRALTKVTIKNKYPIPRVDDLFDRLAQATWFTKLDLRSGYWQVRIAKGDEPKT 64
Query: 795 AFRTRYGHYEFLVMSFGLTNAPAAFMDLMNRVFKPFLDRFVIVFIDDIL---IYSKSREE 851
TRYG +EF VM FGLTNA A F +LMN V +LD FV+V++DD++ IYS S E
Sbjct: 65 TCVTRYGSFEFRVMPFGLTNALATFCNLMNNVLYEYLDHFVVVYLDDLVVYTIYSNSLHE 124
Query: 852 HEQHLRLVLQT 862
H +HLR+V ++
Sbjct: 125 HIKHLRVVRES 135
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.320 0.136 0.409
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 194,160,760
Number of Sequences: 164201
Number of extensions: 8588835
Number of successful extensions: 26554
Number of sequences better than 10.0: 214
Number of HSP's better than 10.0 without gapping: 120
Number of HSP's successfully gapped in prelim test: 96
Number of HSP's that attempted gapping in prelim test: 25444
Number of HSP's gapped (non-prelim): 633
length of query: 1621
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1497
effective length of database: 39,613,130
effective search space: 59300855610
effective search space used: 59300855610
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 73 (32.7 bits)
Medicago: description of AC126790.10