
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC121241.13 - phase: 0 /pseudo
(869 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 289 3e-77
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 280 2e-74
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 249 3e-65
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 238 7e-62
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 235 4e-61
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 235 4e-61
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 233 2e-60
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 225 3e-58
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 217 1e-55
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 157 1e-37
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 157 1e-37
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 157 1e-37
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 155 3e-37
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 155 6e-37
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 154 1e-36
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 149 3e-35
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 149 3e-35
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 138 6e-32
RRPO_OENBE (P31843) RNA-directed DNA polymerase homolog (Reverse... 136 2e-31
POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic pro... 120 1e-26
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 289 bits (739), Expect = 3e-77
Identities = 142/348 (40%), Positives = 220/348 (62%), Gaps = 1/348 (0%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
IDYR+LN++T+ +R+P+P +D+++ +L F+ IDL+ G+HQI++ E + KTAF T+
Sbjct: 266 IDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTK 325
Query: 62 YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSKNEEEHAEHMKIV 121
+GHYEY MPFG+ NAP F MN I L+K +V+ DDI+++S + +EH + + +V
Sbjct: 326 HGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLV 385
Query: 122 LQLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQWETPKSVTEIRSF 181
+ L + L +L KCEF E +FLGH+++ GI +P K++A+ ++ P EI++F
Sbjct: 386 FEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAF 445
Query: 182 LGLAGYYRRFIEGFSKLALPLTQLTCKG-KSFVWDAQCENSFNELKRRLTTAPILILPKP 240
LGL GYYR+FI F+ +A P+T+ K K + + +++F +LK ++ PIL +P
Sbjct: 446 LGLTGYYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDF 505
Query: 241 DEPFVVYCDASKLGLGGVLMQDGKVVAYASRQLRIHEKNYPTHDLELAAVVFVLKIWRHY 300
+ F + DAS + LG VL QDG ++Y SR L HE NY T + EL A+V+ K +RHY
Sbjct: 506 TKKFTLTTDASDVALGAVLSQDGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHY 565
Query: 301 LYGS*FEVFSDHKSLKYLFDQKELNMRQRRWLELLKDCDFGLNYHPGK 348
L G FE+ SDH+ L +L+ K+ N + RW L + DF + Y GK
Sbjct: 566 LLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGK 613
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 280 bits (715), Expect = 2e-74
Identities = 147/350 (42%), Positives = 214/350 (61%), Gaps = 5/350 (1%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
IDYR+LN++TI +RYP+P +D+++ +L + F+ IDL+ G+HQI++ +E + KTAF T+
Sbjct: 265 IDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTK 324
Query: 62 YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSKNEEEHAEHMKIV 121
GHYEY MPFG+ NAP F MN I L+K +V+ DDI+I+S + EH +++V
Sbjct: 325 SGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLV 384
Query: 122 LQLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQWETPKSVTEIRSF 181
L + L +L KCEF E +FLGHI++ GI +P KV A+ + P EIR+F
Sbjct: 385 FTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAF 444
Query: 182 LGLAGYYRRFIEGFSKLALPLTQLTCKGKSFVWDAQ---CENSFNELKRRLTTAPILILP 238
LGL GYYR+FI ++ +A P+T +C K D Q +F +LK + PIL LP
Sbjct: 445 LGLTGYYRKFIPNYADIAKPMT--SCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLP 502
Query: 239 KPDEPFVVYCDASKLGLGGVLMQDGKVVAYASRQLRIHEKNYPTHDLELAAVVFVLKIWR 298
++ FV+ DAS L LG VL Q+G +++ SR L HE NY + EL A+V+ K +R
Sbjct: 503 DFEKKFVLTTDASNLALGAVLSQNGHPISFISRTLNDHELNYSAIEKELLAIVWATKTFR 562
Query: 299 HYLYGS*FEVFSDHKSLKYLFDQKELNMRQRRWLELLKDCDFGLNYHPGK 348
HYL G F + SDH+ L++L + KE + RW L + F ++Y GK
Sbjct: 563 HYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKGK 612
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 249 bits (635), Expect = 3e-65
Identities = 133/363 (36%), Positives = 212/363 (57%), Gaps = 16/363 (4%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
+D+++LN VTI + YP+P I+ + L A+ F+ +DL+SG+HQI +K+ D+ KTAF T
Sbjct: 182 VDFKRLNTVTIPDTYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTL 241
Query: 62 YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSKNEEEHAEHMKIV 121
G YE+ +PFG+ NAP +F ++ I ++ K V+ DDI+++S++ + H +++++V
Sbjct: 242 NGKYEFLRLPFGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLV 301
Query: 122 LQLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQWETPKSVTEIRSF 181
L L + L L K F ++V FLG+I++ GI DP KV A+S+ P SV E++ F
Sbjct: 302 LASLSKANLQVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRF 361
Query: 182 LGLAGYYRRFIEGFSKLALPLTQLT-----------CKGKSFVWDAQCENSFNELKRRLT 230
LG+ YYR+FI+ ++K+A PLT LT D SFN+LK L
Sbjct: 362 LGMTSYYRKFIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILC 421
Query: 231 TAPILILPKPDEPFVVYCDASKLGLGGVLMQD----GKVVAYASRQLRIHEKNYPTHDLE 286
++ IL P +PF + DAS +G VL QD + +AY SR L E+NY T + E
Sbjct: 422 SSEILAFPCFTKPFHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKE 481
Query: 287 LAAVVFVLKIWRHYLYGS-*FEVFSDHKSLKYLFDQKELNMRQRRWLELLKDCDFGLNYH 345
+ A+++ L R YLYG+ +V++DH+ L + + N + +RW +++ + L Y
Sbjct: 482 MLAIIWSLDNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYK 541
Query: 346 PGK 348
PGK
Sbjct: 542 PGK 544
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 238 bits (606), Expect = 7e-62
Identities = 135/355 (38%), Positives = 198/355 (55%), Gaps = 9/355 (2%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
+DY+ LNK N YPLP I+ L+ ++ G+ +F+K+DL S YH I+V+ D K AFR
Sbjct: 466 VDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCP 525
Query: 62 YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSKNEEEHAEHMKIV 121
G +EY VMP+G++ AP F ++N I + VV + DDILI+SK+E EH +H+K V
Sbjct: 526 RGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDV 585
Query: 122 LQLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQWETPKSVTEIRSF 181
LQ LK L +KCEF S+V F+G+ IS G +D V QW+ PK+ E+R F
Sbjct: 586 LQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQF 645
Query: 182 LGLAGYYRRFIEGFSKLALPLTQLTCKGKSFVWDAQCENSFNELKRRLTTAPILILPKPD 241
LG Y R+FI S+L PL L K + W + +K+ L + P+L
Sbjct: 646 LGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFS 705
Query: 242 EPFVVYCDASKLGLGGVLMQ---DGKV--VAYASRQLRIHEKNYPTHDLELAAVVFVLKI 296
+ ++ DAS + +G VL Q D K V Y S ++ + NY D E+ A++ LK
Sbjct: 706 KKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKH 765
Query: 297 WRHYLYGS--*FEVFSDHKSL--KYLFDQKELNMRQRRWLELLKDCDFGLNYHPG 347
WRHYL + F++ +DH++L + + + N R RW L+D +F +NY PG
Sbjct: 766 WRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPG 820
Score = 124 bits (311), Expect = 1e-27
Identities = 75/235 (31%), Positives = 116/235 (48%), Gaps = 7/235 (2%)
Query: 564 QLAEIYIHNIVKLHGVPSSIVSDRNFRFTSRFWKSLQDALGSKLRVSSAYHPQADGHSER 623
Q A ++ ++ G P I++D + FTS+ WK ++ S Y PQ DG +ER
Sbjct: 1028 QTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTER 1087
Query: 624 TIQSLEDLLRVCVLEQGGAWDSHLPLIEFTYNNSYHSSIGMAPFETLYGRRCRTPLCWFE 683
T Q++E LLR W H+ L++ +YNN+ HS+ M PFE ++ R L E
Sbjct: 1088 TNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH--RYSPALSPLE 1145
Query: 684 SGESVLLGPDLVHQTIEKVQMIREKMKASQSRQKSYHD-KRKKALEFQEGDHVFLRVTPM 742
+ +TI+ Q ++E + + + K Y D K ++ EFQ GD V ++
Sbjct: 1146 LPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK---R 1202
Query: 743 TGVGRALKSKKLTPKFIGPYQISERVGTVAYRVGLPPHLSNL-HDVFHVSQLRKY 796
T G KS KL P F GP+ + ++ G Y + LP + ++ FHVS L KY
Sbjct: 1203 TKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKY 1257
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 235 bits (599), Expect = 4e-61
Identities = 134/355 (37%), Positives = 198/355 (55%), Gaps = 9/355 (2%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
+DY+ LNK N YPLP I+ L+ ++ G+ +F+K+DL S YH I+V+ D K AFR
Sbjct: 466 VDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCP 525
Query: 62 YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSKNEEEHAEHMKIV 121
G +EY VMP+G++ AP F ++N I + VV + D+ILI+SK+E EH +H+K V
Sbjct: 526 RGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDV 585
Query: 122 LQLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQWETPKSVTEIRSF 181
LQ LK L +KCEF S+V F+G+ IS G +D V QW+ PK+ E+R F
Sbjct: 586 LQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQF 645
Query: 182 LGLAGYYRRFIEGFSKLALPLTQLTCKGKSFVWDAQCENSFNELKRRLTTAPILILPKPD 241
LG Y R+FI S+L PL L K + W + +K+ L + P+L
Sbjct: 646 LGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFS 705
Query: 242 EPFVVYCDASKLGLGGVLMQ---DGKV--VAYASRQLRIHEKNYPTHDLELAAVVFVLKI 296
+ ++ DAS + +G VL Q D K V Y S ++ + NY D E+ A++ LK
Sbjct: 706 KKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKH 765
Query: 297 WRHYLYGS--*FEVFSDHKSL--KYLFDQKELNMRQRRWLELLKDCDFGLNYHPG 347
WRHYL + F++ +DH++L + + + N R RW L+D +F +NY PG
Sbjct: 766 WRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPG 820
Score = 124 bits (311), Expect = 1e-27
Identities = 75/235 (31%), Positives = 116/235 (48%), Gaps = 7/235 (2%)
Query: 564 QLAEIYIHNIVKLHGVPSSIVSDRNFRFTSRFWKSLQDALGSKLRVSSAYHPQADGHSER 623
Q A ++ ++ G P I++D + FTS+ WK ++ S Y PQ DG +ER
Sbjct: 1028 QTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTER 1087
Query: 624 TIQSLEDLLRVCVLEQGGAWDSHLPLIEFTYNNSYHSSIGMAPFETLYGRRCRTPLCWFE 683
T Q++E LLR W H+ L++ +YNN+ HS+ M PFE ++ R L E
Sbjct: 1088 TNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH--RYSPALSPLE 1145
Query: 684 SGESVLLGPDLVHQTIEKVQMIREKMKASQSRQKSYHD-KRKKALEFQEGDHVFLRVTPM 742
+ +TI+ Q ++E + + + K Y D K ++ EFQ GD V ++
Sbjct: 1146 LPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK---R 1202
Query: 743 TGVGRALKSKKLTPKFIGPYQISERVGTVAYRVGLPPHLSNL-HDVFHVSQLRKY 796
T G KS KL P F GP+ + ++ G Y + LP + ++ FHVS L KY
Sbjct: 1203 TKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKY 1257
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 235 bits (599), Expect = 4e-61
Identities = 134/355 (37%), Positives = 198/355 (55%), Gaps = 9/355 (2%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
+DY+ LNK N YPLP I+ L+ ++ G+ +F+K+DL S YH I+V+ D K AFR
Sbjct: 466 VDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCP 525
Query: 62 YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSKNEEEHAEHMKIV 121
G +EY VMP+G++ AP F ++N I + VV + D+ILI+SK+E EH +H+K V
Sbjct: 526 RGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDV 585
Query: 122 LQLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQWETPKSVTEIRSF 181
LQ LK L +KCEF S+V F+G+ IS G +D V QW+ PK+ E+R F
Sbjct: 586 LQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQF 645
Query: 182 LGLAGYYRRFIEGFSKLALPLTQLTCKGKSFVWDAQCENSFNELKRRLTTAPILILPKPD 241
LG Y R+FI S+L PL L K + W + +K+ L + P+L
Sbjct: 646 LGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFS 705
Query: 242 EPFVVYCDASKLGLGGVLMQ---DGKV--VAYASRQLRIHEKNYPTHDLELAAVVFVLKI 296
+ ++ DAS + +G VL Q D K V Y S ++ + NY D E+ A++ LK
Sbjct: 706 KKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKH 765
Query: 297 WRHYLYGS--*FEVFSDHKSL--KYLFDQKELNMRQRRWLELLKDCDFGLNYHPG 347
WRHYL + F++ +DH++L + + + N R RW L+D +F +NY PG
Sbjct: 766 WRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPG 820
Score = 124 bits (311), Expect = 1e-27
Identities = 75/235 (31%), Positives = 116/235 (48%), Gaps = 7/235 (2%)
Query: 564 QLAEIYIHNIVKLHGVPSSIVSDRNFRFTSRFWKSLQDALGSKLRVSSAYHPQADGHSER 623
Q A ++ ++ G P I++D + FTS+ WK ++ S Y PQ DG +ER
Sbjct: 1028 QTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTER 1087
Query: 624 TIQSLEDLLRVCVLEQGGAWDSHLPLIEFTYNNSYHSSIGMAPFETLYGRRCRTPLCWFE 683
T Q++E LLR W H+ L++ +YNN+ HS+ M PFE ++ R L E
Sbjct: 1088 TNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH--RYSPALSPLE 1145
Query: 684 SGESVLLGPDLVHQTIEKVQMIREKMKASQSRQKSYHD-KRKKALEFQEGDHVFLRVTPM 742
+ +TI+ Q ++E + + + K Y D K ++ EFQ GD V ++
Sbjct: 1146 LPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK---R 1202
Query: 743 TGVGRALKSKKLTPKFIGPYQISERVGTVAYRVGLPPHLSNL-HDVFHVSQLRKY 796
T G KS KL P F GP+ + ++ G Y + LP + ++ FHVS L KY
Sbjct: 1203 TKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKY 1257
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 233 bits (594), Expect = 2e-60
Identities = 128/362 (35%), Positives = 206/362 (56%), Gaps = 17/362 (4%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
ID+R+LN+ TI +RYP+P I ++ L A+ F+ +DL SGYHQI + + D +KT+F
Sbjct: 241 IDFRKLNEKTIPDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVN 300
Query: 62 YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSKNEEEHAEHMKIV 121
G YE+ +PFG+ NA +F ++ + + K V+ DD++I+S+NE +H H+ V
Sbjct: 301 GGKYEFCRLPFGLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTV 360
Query: 122 LQLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQWETPKSVTEIRSF 181
L+ L + + K F+ V +LG I+S G DP KV A+ ++ P V ++RSF
Sbjct: 361 LKCLIDANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSF 420
Query: 182 LGLAGYYRRFIEGFSKLALPLTQLTCKGKS------------FVWDAQCENSFNELKRRL 229
LGLA YYR FI+ F+ +A P+T + KG++ ++ N+F L+ L
Sbjct: 421 LGLASYYRVFIKDFAAIARPITDI-LKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNIL 479
Query: 230 TTAPILILPKPD--EPFVVYCDASKLGLGGVLMQDGKVVAYASRQLRIHEKNYPTHDLEL 287
+ + IL PD +PF + DAS G+G VL Q+G+ + SR L+ E+NY T++ EL
Sbjct: 480 ASEDV-ILKYPDFKKPFDLTTDASASGIGAVLSQEGRPITMISRTLKQPEQNYATNEREL 538
Query: 288 AAVVFVLKIWRHYLYGS-*FEVFSDHKSLKYLFDQKELNMRQRRWLELLKDCDFGLNYHP 346
A+V+ L +++LYGS +F+DH+ L + + N + +RW + + + Y P
Sbjct: 539 LAIVWALGKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKP 598
Query: 347 GK 348
GK
Sbjct: 599 GK 600
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 225 bits (574), Expect = 3e-58
Identities = 125/358 (34%), Positives = 199/358 (54%), Gaps = 11/358 (3%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
IDYR++NKV N +PLP I+ + L G ++++ D+ +G+ QI + ++ + TAF
Sbjct: 996 IDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIG 1055
Query: 62 YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSKNEEEHAEHMKIV 121
+E+ V+PFG+ +P +F M I L V+ DD+LI SK+ E+H + +K
Sbjct: 1056 SELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEA 1115
Query: 122 LQLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQWETPKSVTEIRSF 181
L +++ + + SKC EV +LGH ++ G+ K D + Q+ P +V E++SF
Sbjct: 1116 LTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKELQSF 1175
Query: 182 LGLAGYYRRFIEGFSKLALPLTQLTCKGKSFVWDAQCENSFNELKRRLTTAPILILP--- 238
LGL GYYR+FI F+++A LT L +++W+ + E +F ELK+ + P+L P
Sbjct: 1176 LGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPDVE 1235
Query: 239 ---KPDEPFVVYCDASKLGLGGVLMQDG-----KVVAYASRQLRIHEKNYPTHDLELAAV 290
K D PF++Y DAS+ G+G VL Q+G +A+AS+ L E Y DLE A+
Sbjct: 1236 AALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEALAM 1295
Query: 291 VFVLKIWRHYLYGS*FEVFSDHKSLKYLFDQKELNMRQRRWLELLKDCDFGLNYHPGK 348
+F L+ ++ +YG+ VF+DHK L L L R RW + + D + Y GK
Sbjct: 1296 MFALRRFKTIIYGTAITVFTDHKPLISLLKGSPLADRLWRWSIEILEFDVKIVYLAGK 1353
Score = 47.8 bits (112), Expect = 1e-04
Identities = 53/218 (24%), Positives = 91/218 (41%), Gaps = 21/218 (9%)
Query: 579 VPSSIVSDRNFRFTSRFWKSLQDALGSKLRVSSAYHPQADGHSERTIQSLEDLLRVCVLE 638
+P +++D+ F + + L + + Y+ +A+G ER +++ +++
Sbjct: 1594 IPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGYNSRANGAVERFNKTIMHIMKKKTAV 1653
Query: 639 QGGAWDSHLPLIEFTYNNSYHSSIGMAPFETLYGRRCRTPLCWFESGESVL--------L 690
WD + + YNN H + G P ++GR PL SGE +
Sbjct: 1654 PM-EWDDQVVYAVYAYNNCVHENTGETPMFLMHGRDVMGPL--EMSGEDAVGINYADMDE 1710
Query: 691 GPDLVHQTIEKVQMI-REKMKASQSRQKSYHDKR---KKALEFQEGDHVFLRVTPMTGVG 746
L+ Q + KVQ I +E Q KS D++ KK Q G V L + P +G
Sbjct: 1711 YKHLLTQELLKVQKIAKEHAMREQESYKSLFDQKYASKKHRFPQPGSRVLLEI-PSEKLG 1769
Query: 747 RALKSKKLTPKFIGPYQI---SERVGTVAYRVGLPPHL 781
+ KL K+ GPY++ SE + +G H+
Sbjct: 1770 --AQCPKLVNKWSGPYRVISCSENSAEITPVLGKRKHI 1805
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 217 bits (552), Expect = 1e-55
Identities = 122/351 (34%), Positives = 185/351 (51%), Gaps = 4/351 (1%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
IDYRQ+NK + +++PLPRIDD++DQL A+ FS +DL SG+HQI++ + T+F T
Sbjct: 374 IDYRQINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTS 433
Query: 62 YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSKNEEEHAEHMKIV 121
G Y + +PFG+ AP F M F ++ DD+++ +E+ +++ V
Sbjct: 434 NGSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEV 493
Query: 122 LQLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQWETPKSVTEIRSF 181
+E L KC F++ EV+FLGH + GI+ D K D + + P R F
Sbjct: 494 FGKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRF 553
Query: 182 LGLAGYYRRFIEGFSKLALPLTQLTCKGKSFVWDAQCENSFNELKRRLTTAPILILPKPD 241
+ YYRRFI+ F+ + +T+L K F W +C+ +F LK +L +L P
Sbjct: 554 VAFCNYYRRFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFS 613
Query: 242 EPFVVYCDASKLGLGGVLMQDGK----VVAYASRQLRIHEKNYPTHDLELAAVVFVLKIW 297
+ F + DASK G VL Q+ VAYASR E N T + ELAA+ + + +
Sbjct: 614 KEFCITTDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHF 673
Query: 298 RHYLYGS*FEVFSDHKSLKYLFDQKELNMRQRRWLELLKDCDFGLNYHPGK 348
R Y+YG F V +DH+ L YLF + + R L++ +F + Y GK
Sbjct: 674 RPYIYGKHFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKGK 724
Score = 56.6 bits (135), Expect = 3e-07
Identities = 49/211 (23%), Positives = 93/211 (43%), Gaps = 16/211 (7%)
Query: 565 LAEIYIHNIVKLHGVPSSIVSDRNFRFTSRFWKSLQDALGSKLRVSSAYHPQADGHSERT 624
+A+ + + +G + ++D + + L L K S+A+H Q G ER+
Sbjct: 1010 VAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVERS 1069
Query: 625 IQSLEDLLRVCVLEQGGAWDSHLPLIEFTYNNSYHSSIGMAPFETLYGRRCRTPLCW--F 682
++L + +R + WD L + +N + P+E ++GR P +
Sbjct: 1070 HRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHFNKL 1129
Query: 683 ESGESVLLGPDLVHQTIEKVQM----IREKMKASQSRQKSYHDKRKKALEFQEGDHVFLR 738
S E + D ++ ++++ R+ ++A + + K +D + K +E + GD V LR
Sbjct: 1130 HSIEPIYNIDDYAKESKYRLEVAYARARKLLEAHKEKNKENYDLKVKDIELEVGDKVLLR 1189
Query: 739 VTPMTGVGRALKSKKLTPKFIGPYQISERVG 769
VG KL K+ GPY+I E +G
Sbjct: 1190 ----NEVGH-----KLDFKYTGPYKI-ESIG 1210
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 157 bits (397), Expect = 1e-37
Identities = 100/347 (28%), Positives = 176/347 (49%), Gaps = 17/347 (4%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
++Y+ +NK T+ + Y LP D+L+ + G ++FS D SG+ Q+ + E TAF
Sbjct: 304 VNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCP 363
Query: 62 YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSKNEEEHAEHMKIV 121
GHYE+ V+PFG+ AP +F +M+ F + KF V+ DDIL++S NEE+H H+ ++
Sbjct: 364 QGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMI 422
Query: 122 LQLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQW-ETPKSVTEIRS 180
LQ + + K + + +++FLG I ++ ++++ +T + +++
Sbjct: 423 LQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQR 482
Query: 181 FLGLAGYYRRFIEGFSKLALPLTQLTCKGKSFVWDAQCENSFNELKRRLTTAPILILPKP 240
FLG+ Y +I +++ PL + + W + ++K+ L P L P P
Sbjct: 483 FLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPLP 542
Query: 241 DEPFVVYCDASKLGLGGVL----MQDGK----VVAYASRQLRIHEKNYPTHDLELAAVVF 292
+E ++ DAS GG+L + +G + YAS + EKNY ++D E AV+
Sbjct: 543 EEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVIN 602
Query: 293 VLKIWRHYLYGS*FEVFSDHK------SLKYLFDQK-ELNMRQRRWL 332
+K + YL F + +D+ +L Y D K N+R + WL
Sbjct: 603 TIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWL 649
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 157 bits (396), Expect = 1e-37
Identities = 100/347 (28%), Positives = 176/347 (49%), Gaps = 17/347 (4%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
++Y+ +NK TI + Y LP D+L+ + G ++FS D SG+ Q+ + E TAF
Sbjct: 304 VNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCP 363
Query: 62 YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSKNEEEHAEHMKIV 121
GHYE+ V+PFG+ AP +F +M+ F + KF V+ DDIL++S NEE+H H+ ++
Sbjct: 364 QGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMI 422
Query: 122 LQLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQW-ETPKSVTEIRS 180
LQ + + K + + +++FLG I ++ ++++ +T + +++
Sbjct: 423 LQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQR 482
Query: 181 FLGLAGYYRRFIEGFSKLALPLTQLTCKGKSFVWDAQCENSFNELKRRLTTAPILILPKP 240
FLG+ Y +I +++ PL + + W + ++K+ L P L P P
Sbjct: 483 FLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLP 542
Query: 241 DEPFVVYCDASKLGLGGVL----MQDGK----VVAYASRQLRIHEKNYPTHDLELAAVVF 292
+E ++ DAS GG+L + +G + YAS + E+NY ++D E AV+
Sbjct: 543 EEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVIN 602
Query: 293 VLKIWRHYLYGS*FEVFSDHK------SLKYLFDQK-ELNMRQRRWL 332
+K + YL F + +D+ +L Y D K N+R + WL
Sbjct: 603 TIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWL 649
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 157 bits (396), Expect = 1e-37
Identities = 100/347 (28%), Positives = 176/347 (49%), Gaps = 17/347 (4%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
++Y+ +NK TI + Y LP D+L+ + G ++FS D SG+ Q+ + E TAF
Sbjct: 304 VNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCP 363
Query: 62 YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSKNEEEHAEHMKIV 121
GHYE+ V+PFG+ AP +F +M+ F + KF V+ DDIL++S NEE+H H+ ++
Sbjct: 364 QGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMI 422
Query: 122 LQLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQW-ETPKSVTEIRS 180
LQ + + K + + +++FLG I ++ ++++ +T + +++
Sbjct: 423 LQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQR 482
Query: 181 FLGLAGYYRRFIEGFSKLALPLTQLTCKGKSFVWDAQCENSFNELKRRLTTAPILILPKP 240
FLG+ Y +I +++ PL + + W + ++K+ L P L P P
Sbjct: 483 FLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLP 542
Query: 241 DEPFVVYCDASKLGLGGVL----MQDGK----VVAYASRQLRIHEKNYPTHDLELAAVVF 292
+E ++ DAS GG+L + +G + YAS + E+NY ++D E AV+
Sbjct: 543 EEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVIN 602
Query: 293 VLKIWRHYLYGS*FEVFSDHK------SLKYLFDQK-ELNMRQRRWL 332
+K + YL F + +D+ +L Y D K N+R + WL
Sbjct: 603 TIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWL 649
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 666
Score = 155 bits (393), Expect = 3e-37
Identities = 93/322 (28%), Positives = 170/322 (51%), Gaps = 7/322 (2%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
++Y+ +N+ TI + + LP + +L+ L G +FS D SG+ Q+ + +E + TAF
Sbjct: 297 VNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQKLTAFTCP 356
Query: 62 YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSKNEEEHAEHMKIV 121
GH+++KV+PFG+ AP +F +M + DKF +V+ DDI+++S +E +H H+ V
Sbjct: 357 QGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSELDHYNHVYAV 415
Query: 122 LQLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQW-ETPKSVTEIRS 180
L+++++ + K + +++FLG I ++ + ++ + + ++
Sbjct: 416 LKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNHILENIHKFPDRLEDKKHLQR 475
Query: 181 FLGLAGYYRRFIEGFSKLALPLTQLTCKGKSFVWDAQCENSFNELKRRLTTAPILILPKP 240
FLG+ Y +I +++ PL K ++ W + ++K+ L + P L LPKP
Sbjct: 476 FLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLGSFPKLYLPKP 535
Query: 241 DEPFVVYCDASKLGLGGVLMQ---DG--KVVAYASRQLRIHEKNYPTHDLELAAVVFVLK 295
++ ++ DAS GGVL DG + Y+S + EKNY ++D EL AV V+
Sbjct: 536 EDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYHSNDKELLAVKQVIT 595
Query: 296 IWRHYLYGS*FEVFSDHKSLKY 317
+ YL F V +D+K+ Y
Sbjct: 596 KFSAYLTPVRFTVRTDNKNFTY 617
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 674
Score = 155 bits (391), Expect = 6e-37
Identities = 99/347 (28%), Positives = 175/347 (49%), Gaps = 17/347 (4%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
++Y+ +NK T+ + Y P D+L+ + G ++FS D SG+ Q+ + E TAF
Sbjct: 299 VNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCP 358
Query: 62 YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSKNEEEHAEHMKIV 121
GHYE+ V+PFG+ AP +F +M+ F + KF V+ DDIL++S NEE+H H+ ++
Sbjct: 359 QGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMI 417
Query: 122 LQLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQW-ETPKSVTEIRS 180
LQ + + K + + +++FLG I ++ ++++ +T + +++
Sbjct: 418 LQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQR 477
Query: 181 FLGLAGYYRRFIEGFSKLALPLTQLTCKGKSFVWDAQCENSFNELKRRLTTAPILILPKP 240
FLG+ Y +I +++ PL + + W + ++K+ L P L P P
Sbjct: 478 FLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLP 537
Query: 241 DEPFVVYCDASKLGLGGVL----MQDGK----VVAYASRQLRIHEKNYPTHDLELAAVVF 292
+E ++ DAS GG+L + +G + YAS + EKNY ++D E AV+
Sbjct: 538 EEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVIN 597
Query: 293 VLKIWRHYLYGS*FEVFSDHK------SLKYLFDQK-ELNMRQRRWL 332
+K + YL F + +D+ +L Y D K N+R + WL
Sbjct: 598 TIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWL 644
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 680
Score = 154 bits (389), Expect = 1e-36
Identities = 97/347 (27%), Positives = 175/347 (49%), Gaps = 17/347 (4%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
++Y+ +NK T+ + Y LP D+L+ + G ++FS D SG+ Q+ + E TAF
Sbjct: 305 VNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCP 364
Query: 62 YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSKNEEEHAEHMKIV 121
GHYE+ V+PFG+ AP +F +M+ F + KF V+ DDI+++S NEE+H H+ ++
Sbjct: 365 QGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDIVVFSNNEEDHLLHVAMI 423
Query: 122 LQLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQW-ETPKSVTEIRS 180
LQ + + K + + +++FLG I ++ ++++ +T + +++
Sbjct: 424 LQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQR 483
Query: 181 FLGLAGYYRRFIEGFSKLALPLTQLTCKGKSFVWDAQCENSFNELKRRLTTAPILILPKP 240
FLG+ Y +I +++ PL + + W + ++K+ L P L P P
Sbjct: 484 FLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLP 543
Query: 241 DEPFVVYCDASKLGLGGVL----MQDGK----VVAYASRQLRIHEKNYPTHDLELAAVVF 292
+E ++ DAS GG+L + +G + Y S + E+NY ++D E AV+
Sbjct: 544 EEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETLAVIN 603
Query: 293 VLKIWRHYLYGS*FEVFSDHK------SLKYLFDQK-ELNMRQRRWL 332
+K + YL F + +D+ +L Y D K N+R + WL
Sbjct: 604 TIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWL 650
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 149 bits (376), Expect = 3e-35
Identities = 101/344 (29%), Positives = 175/344 (50%), Gaps = 16/344 (4%)
Query: 3 DYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTRY 62
+Y++LN + + +P +++ + A +FSK DL +G+H +K+KD+ T F
Sbjct: 1244 NYKRLNDNMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSE 1303
Query: 63 GHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSKNEEEHAEHMKIVL 122
G Y + V PFG+ NAP F +M F KF +++ DDILI S NE+EH EH+KI
Sbjct: 1304 GLYTWNVCPFGIANAPCAFQRFMQESFGDL--KFALLYIDDILIASNNEKEHIEHLKIFF 1361
Query: 123 QLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQWETPK--SVTEIRS 180
+KE K + +L EV +LG I I + P VD + +++ K ++ +++
Sbjct: 1362 NRVKEVGCVLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIKKFDKNKLNTLKGLQA 1421
Query: 181 FLGLAGYYRRFIEGFSKLALPLTQLTCKGKSFVWDAQCENSFNELKRRLTTAPILILPKP 240
+LGL Y R +I+ SKL PL + T K +++ + N +++R ++ L PK
Sbjct: 1422 YLGLLNYARGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKPLERPKE 1481
Query: 241 DEPFVVYCDASKLGLGGVLM---------QDGKVVAYASRQLRIHEKNYPTHDLELAAVV 291
+ ++ DAS+ G G VL+ K+ YAS +K + + D E+ A+
Sbjct: 1482 TDYIIIETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFG-EKKTWTSLDYEIEAIN 1540
Query: 292 FVLKIWRHYLYGS*FEVFSDHKSLKYLFDQKELNMRQR-RWLEL 334
L ++ YL F + +D +++ ++ R + RW++L
Sbjct: 1541 EALNKFQIYL-DKDFTIRTDCEAIVKGIKTEDYKKRSKTRWIKL 1583
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 149 bits (376), Expect = 3e-35
Identities = 99/343 (28%), Positives = 170/343 (48%), Gaps = 17/343 (4%)
Query: 3 DYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTRY 62
+Y+ LN+ T ++Y LP I+ ++ ++ ++++SK DL SG+ Q+ +++E + TAF
Sbjct: 1468 NYKLLNENTESDQYSLPGINTIISKVGRSKIYSKFDLKSGFWQVAMEEESVPWTAFLAGN 1527
Query: 63 GHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSKNEEEHAEHMKIVL 122
YE+ VMPFG+ NAP +F M+ +F +KF+ V+ DDIL++S+ E+H++H+ +L
Sbjct: 1528 KLYEWLVMPFGLKNAPAIFQRKMDNVFKG-TEKFIAVYIDDILVFSETAEQHSQHLYTML 1586
Query: 123 QLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQWETPKSVTE--IRS 180
QL KE L +K + E+ FLG + + I + P + + + K T +RS
Sbjct: 1587 QLCKENGLILSPTKMKIGTPEIDFLGASLGCTKIKLQPHIISKICDFSDEKLATPEGMRS 1646
Query: 181 FLGLAGYYRRFIEGFSKLALPLTQLTCKGKSFVWDAQCENSFNELKRRLTTAPILILPKP 240
+LG+ Y R +I+ KL PL Q + + ++K ++ P L LP
Sbjct: 1647 WLGILSYARNYIQDIGKLVQPLRQKMAPTGDKRMNPETWKMVRQIKEKVKNLPDLQLPPK 1706
Query: 241 DEPFVVYCDASKLGLGGVL---------MQDGKVVAYASRQLRIHEKNYPTHDLELAAVV 291
D ++ D G G V ++ AYAS + T D E+ A +
Sbjct: 1707 DSFIIIETDGCMTGWGAVCKWKMSKHDPRSTERICAYASGSFNPIKS---TIDAEIQAAI 1763
Query: 292 FVL-KIWRHYLYGS*FEVFSDHKSLKYLFDQKELNMRQR-RWL 332
L K +YL + SD +++ +++ N R RWL
Sbjct: 1764 HGLDKFKIYYLDKKELIIRSDCEAIIKFYNKTNENKPSRVRWL 1806
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 659
Score = 138 bits (348), Expect = 6e-32
Identities = 93/357 (26%), Positives = 177/357 (49%), Gaps = 12/357 (3%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
++Y+ +NK T + + LP D+L+ + G +++S D SG Q+ + E TAF
Sbjct: 286 VNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQLLTAFTCP 345
Query: 62 YGHYEYKVMPFGVTNAPGVFME-YMNRIFHAYLDKFVVVFSDDILIYSK-NEEEHAEHMK 119
GHY++ V+PFG+ AP +F + Y N + Y K+ V+ DDIL++S +EH H+
Sbjct: 346 QGHYQWNVVPFGLKQAPSIFPKTYANSHSNQY-SKYCCVYVDDILVFSNTGRKEHYIHVL 404
Query: 120 IVLQLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSGIVVDPSKVDAVSQW-ETPKSVTEI 178
+L+ ++ + K + + +++FLG I ++ + ++ + + ++
Sbjct: 405 NILRRCEKLGIILSKKKAQLFKEKINFLGLEIDQGTHCPQNHILEHIHKFPDRIEDKKQL 464
Query: 179 RSFLGLAGYYRRFIEGFSKLALPLTQLTCKGKSFVWDAQCENSFNELKRRLTTAPILILP 238
+ FLG+ Y +I + + PL + ++ W+ ++K+ L + P L P
Sbjct: 465 QRFLGILTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKNLKSFPKLYHP 524
Query: 239 KPDEPFVVYCDASKLGLGGVLM----QDGKVVAYASRQLRIHEKNYPTHDLELAAVVFVL 294
+P++ V+ DAS+ GG+L + YAS + E+NY +++ EL AV+ V+
Sbjct: 525 EPNDKLVIETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHSNEKELLAVIRVI 584
Query: 295 KIWRHYLYGS*FEVFSDHKSLKYLFDQKELNMRQR----RWLELLKDCDFGLNYHPG 347
K + YL S F + +D+K+ + + R++ RW L DF + + G
Sbjct: 585 KKFSIYLTPSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQYDFDVEHIAG 641
>RRPO_OENBE (P31843) RNA-directed DNA polymerase homolog (Reverse
transcriptase homolog)
Length = 142
Score = 136 bits (343), Expect = 2e-31
Identities = 64/123 (52%), Positives = 87/123 (70%), Gaps = 3/123 (2%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
IDYR L KVTIKN+YP+PR+DDL D+L A F+K+DL SGY Q+++ D KT TR
Sbjct: 10 IDYRALTKVTIKNKYPIPRVDDLFDRLAQATWFTKLDLRSGYWQVRIAKGDEPKTTCVTR 69
Query: 62 YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDIL---IYSKNEEEHAEHM 118
YG +E++VMPFG+TNA F MN + + YLD FVVV+ DD++ IYS + EH +H+
Sbjct: 70 YGSFEFRVMPFGLTNALATFCNLMNNVLYEYLDHFVVVYLDDLVVYTIYSNSLHEHIKHL 129
Query: 119 KIV 121
++V
Sbjct: 130 RVV 132
>POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 692
Score = 120 bits (302), Expect = 1e-26
Identities = 86/292 (29%), Positives = 150/292 (50%), Gaps = 15/292 (5%)
Query: 2 IDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLSSGYHQIKVKDEDMQKTAFRTR 61
I+Y+++N+ TI + Y LPR D +++++ G+ FS +D SGY+Q+++ + TAF
Sbjct: 261 INYKKMNEATIGDSYKLPRKDFILEKIKGSLWFSSLDAKSGYYQLRLHENTKPLTAFSCP 320
Query: 62 -YGHYEYKVMPFGVTNAPGVFMEYMNRIFHAYLDKFVVVFSDDILIYSK-NEEEHAEHMK 119
HYE+ V+ FG+ AP ++ +M++ L+ + + DDILI++K ++E+H ++
Sbjct: 321 PQKHYEWNVLSFGLKQAPSIYQRFMDQSLKG-LEHICLAYIDDILIFTKGSKEQHVNDVR 379
Query: 120 IVLQLLKEKKLYAKLSKCEFWLSEVSFLGHIISGSG-IVVDPSKVDAVSQW-ETPKSVTE 177
IVLQ +KEK + K + E+ +LG I G+G I + P + + Q+ + + +
Sbjct: 380 IVLQRIKEKGIIISKKKSKLIQQEIEYLGLKIQGNGEIDLSPHTQEKILQFPDELEDRKQ 439
Query: 178 IRSFLGLAGYYRRFIEGFSK-LALPLTQLTCK---GKSFVWDAQCENSFNELKRRLTTAP 233
I+ FLG Y EGF K LAL L K + WD +K ++ + P
Sbjct: 440 IQRFLGCINYIAN--EGFFKNLALERKHLQKKISVKNPWKWDTIDTKMVQSIKGKIQSLP 497
Query: 234 ILILPKPDEPFVVYCDASKLGLGGVLMQDGKVVAYASRQLRIHEKNYPTHDL 285
L + +V DAS+ G L + + +++ + E PT DL
Sbjct: 498 KLYNASIQDFLIVETDASQHSWSGCL----RALPKGKQKIGLDEFGIPTADL 545
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.337 0.148 0.477
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 96,177,587
Number of Sequences: 164201
Number of extensions: 3884685
Number of successful extensions: 14871
Number of sequences better than 10.0: 123
Number of HSP's better than 10.0 without gapping: 48
Number of HSP's successfully gapped in prelim test: 75
Number of HSP's that attempted gapping in prelim test: 14649
Number of HSP's gapped (non-prelim): 210
length of query: 869
length of database: 59,974,054
effective HSP length: 119
effective length of query: 750
effective length of database: 40,434,135
effective search space: 30325601250
effective search space used: 30325601250
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.7 bits)
S2: 70 (31.6 bits)
Medicago: description of AC121241.13