
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146759.6 - phase: 0
(1825 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 300 2e-80
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 300 3e-80
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 298 1e-79
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 296 5e-79
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 296 5e-79
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 285 6e-76
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 265 7e-70
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 246 4e-64
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 236 6e-61
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 176 7e-43
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 170 3e-41
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 155 1e-36
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 155 1e-36
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 155 1e-36
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 152 8e-36
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 151 2e-35
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 145 1e-33
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 129 6e-29
POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.2... 128 1e-28
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 119 1e-25
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 300 bits (768), Expect = 2e-80
Identities = 219/717 (30%), Positives = 343/717 (47%), Gaps = 79/717 (11%)
Query: 949 KEVVKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKNELIPTRTVTGR--- 1005
++ V+ ++ +L+ G+I S+S + SP+ VVPKK + +G+
Sbjct: 220 EQEVESQIQDMLNQGIIRT-SNSPYNSPIWVVPKK----------------QDASGKQKF 262
Query: 1006 RMCIDYRRLNTATRKDHFPLPFMDQMIERLAGQAFYCFLDGYSGYNQIAVAPEDQEKTAF 1065
R+ IDYR+LN T D P+P MD+++ +L ++ +D G++QI + PE KTAF
Sbjct: 263 RIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAF 322
Query: 1066 TCPFGVFAYRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDFSVFGKSFDQCLFHL 1125
+ G + Y RMPFGL APATFQRCM I ++ K+ V++DD VF S D+ L L
Sbjct: 323 STKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSL 382
Query: 1126 NAVLKRCTETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAKIEVIEKLPPPMNVKGV 1185
V ++ + NL L +KC F+ E LGH ++ GI+ + KIE I+K P P K +
Sbjct: 383 GLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEI 442
Query: 1186 RSFLGHAGFYRRFIKDFSKIAKPLCNLLVKETEFD-FDVECLNAFSLIKNKLVTAPIIIA 1244
++FLG G+YR+FI +F+ IAKP+ L K + D + E +AF +K + PI+
Sbjct: 443 KAFLGLTGYYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKV 502
Query: 1245 PNWDLHFELMCDASDYAVGAVLGQRKNKFFHAIYYASKVLNESQVNYSTTEKELLAVIFA 1304
P++ F L DASD A+GAVL Q H + Y S+ LNE ++NYST EKELLA+++A
Sbjct: 503 PDFTKKFTLTTDASDVALGAVLSQDG----HPLSYISRTLNEHEINYSTIEKELLAIVWA 558
Query: 1305 LEKFRSYLIGSKVIVFTDHAALKYLLTKGDSKPRLLRWVLLLQEFDLEIRDKKGVENVVA 1364
+ FR YL+G + +DH L +L D +L RW + L EFD +I+ KG EN VA
Sbjct: 559 TKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVA 618
Query: 1365 DHLSRLENNEV-TKKEGAIMAEFPDEQLFAIRERP--WFADMANFKAG------------ 1409
D LSR++ E ++ AE + L I ERP F F G
Sbjct: 619 DALSRIKLEETYLSEQTQHSAEEDNSDLIFITERPLNTFNRQVIFSKGPPDIKVTKYFKK 678
Query: 1410 ---NIIPDDMEQHQRKKFFKD----------------------ANHYLWDDPYLFKVSTD 1444
I D M + + +++ D A+ + Y + +
Sbjct: 679 HITQIFYDIMTREKAEQYLIDHFCGKKSALYIESDADFEVIQAAHKLAINTKYTKILRST 738
Query: 1445 GLIRRCVAGEEIKNIVWHCHSSAYGGHHSGERTAAKVLQSGFWWPTLFKDCHDFVRRCDN 1504
L++ E K ++ H H G + K+ +++P + + C
Sbjct: 739 ILLKNITTYAEFKELILTAHEKLL---HPGIQKTTKLFGETYYFPNSQLLIQNIINECSI 795
Query: 1505 CQRTGSISKRNEMPLTGIIEVEPFDC---WGIDFMGPFPPSSSYLHILVCVDYVTKWVEA 1561
C + + +MP +P C + ID SS H + C+D +K+
Sbjct: 796 CNLAKTEHRNTDMPTK--TTPKPEHCREKFMIDIY-----SSEGKHYVSCIDIYSKFATL 848
Query: 1562 IPCVANDSKTVVNFLRKNIFTRFGTPRVLISDGGKHFCNNFLETVLKKYNIKHKVAT 1618
D N L + IF + G P++L +D F + L+ L+ ++ ++ T
Sbjct: 849 EEIKTKDWIECKNALMR-IFNQLGKPKLLKADRDGAFSSLALKRWLESEEVELQLNT 904
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 300 bits (767), Expect = 3e-80
Identities = 253/926 (27%), Positives = 419/926 (44%), Gaps = 67/926 (7%)
Query: 894 EEEKLMRVLRENEGALGWKISDLKGISPAYCMHRIHMEAEYKSVVQPQRRLNPTMKEVVK 953
++ K+ V+ + + +L S C+ + AE + Q R + +K ++
Sbjct: 902 DDRKIWDVIEQFQDVFAISDDELGRNSGTECVIELKEGAE--PIRQKPRPIPLALKPEIR 959
Query: 954 KEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKNELIPTRTVTGRRMCIDYRR 1013
K + K+L+ +I S S W SPV +V KK G RMCIDYR+
Sbjct: 960 KMIQKMLNQKVIRE-SKSPWSSPVVLVKKKDGSI------------------RMCIDYRK 1000
Query: 1014 LNTATRKDHFPLPFMDQMIERLAGQAFYCFLDGYSGYNQIAVAPEDQEKTAFTCPFGVFA 1073
+N + + PLP ++ ++ LAG+ Y D +G+ QI + + +E TAF +F
Sbjct: 1001 VNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGSELFE 1060
Query: 1074 YRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDFSVFGKSFDQCLFHLNAVLKRCT 1133
+ +PFGL +PA FQ M I D++ V++DD + K +Q L + L R
Sbjct: 1061 WNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIR 1120
Query: 1134 ETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAKIEVIEKLPPPMNVKGVRSFLGHAG 1193
++ + L KCH E LGHK++ G+E + K + +++ P NVK ++SFLG G
Sbjct: 1121 KSGMKLRASKCHIAKKEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVG 1180
Query: 1194 FYRRFIKDFSKIAKPLCNLLVKETEFDFDVECLNAFSLIKNKLVTAPIIIAPN------W 1247
+YR+FI +F++IA L +L+ + + ++ E AF +K + P++ P+
Sbjct: 1181 YYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKG 1240
Query: 1248 DLHFELMCDASDYAVGAVLGQR-KNKFFHAIYYASKVLNESQVNYSTTEKELLAVIFALE 1306
D F + DAS +GAVL Q + H I +ASK L+ ++ Y T+ E LA++FAL
Sbjct: 1241 DRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEALAMMFALR 1300
Query: 1307 KFRSYLIGSKVIVFTDHAALKYLLTKGDSKPRLLRWVLLLQEFDLEIRDKKGVENVVADH 1366
+F++ + G+ + VFTDH L LL RL RW + + EFD++I G N VAD
Sbjct: 1301 RFKTIIYGTAITVFTDHKPLISLLKGSPLADRLWRWSIEILEFDVKIVYLAGKANAVADA 1360
Query: 1367 LSR----------LENNEVTKKEGAIMAEFPDEQLFAIRERPWFADMANFKAG-NIIPDD 1415
LSR + E+T AI E PD + W + G +
Sbjct: 1361 LSRGGCPPNELEEEQTKELTSIVNAIQTELPD----ILDSSCWLERLKGEDEGWKEVIAA 1416
Query: 1416 MEQHQRKKFFKDANHYLWDDPYLFKVSTDGLIR--------RCVAGEEIKN-IVWHCHSS 1466
+E + K FK +K+ G+++ R V E+I+ ++ H
Sbjct: 1417 LEGGKTKGTFKIVGIESEISLEYYKI-VGGVLKNTEIEEQSRSVVPEKIRTPLLKELHEG 1475
Query: 1467 AYGGHHSGERTAAKVLQSGFWWPTLFKDCHDFVRRCDNCQRTGSISKRNEMPLTGIIEVE 1526
GH G + +++ F+WP + + VR C C SK LT
Sbjct: 1476 MLAGHF-GIKKMWRMVHRKFYWPQMRVCVENCVRTCAKCLCANDHSKLTS-SLTPYRMTF 1533
Query: 1527 PFDCWGIDFMGPFPPSSSYLHILVCVDYVTKWVEAIPCVANDSKTVVN-FLRKNIFTRFG 1585
P + D M +IL +D TK+ A+P ++TV+ F+ +
Sbjct: 1534 PLEIVACDLMDVGLSVQGNRYILTIIDLFTKYGTAVPIPDKKAETVLKAFVERWAIGEGR 1593
Query: 1586 TPRVLISDGGKHFCNNFLETVLKKYNIKHKVATPYHPQTSGQVEVSNRQLKQILEKTVAS 1645
P L++D GK F N I+H Y+ + +G VE N+ + I++K A
Sbjct: 1594 IPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGYNSRANGAVERFNKTIMHIMKKKTAV 1653
Query: 1646 SRKDWSRKLDDALWAYRIAFKTHLGLSPYQLVFGKACHLPVELEHKAYWAIKALNFDQTL 1705
+W ++ A++AY + G +P L+ G+ P+E+ + I + D+
Sbjct: 1654 P-MEWDDQVVYAVYAYNNCVHENTGETPMFLMHGRDVMGPLEMSGEDAVGINYADMDE-- 1710
Query: 1706 AGKKRLLKLNELEEMRLGAYENAVIYKERTKRYHDKGLVRREFYVGQ-----LVLLFNSR 1760
K LL EL +++ A E+A+ +E K D+ ++ Q L+ + + +
Sbjct: 1711 --YKHLL-TQELLKVQKIAKEHAMREQESYKSLFDQKYASKKHRFPQPGSRVLLEIPSEK 1767
Query: 1761 LKLFPGKLKSKWSGPFMIESISPYGA 1786
L KL +KWSGP+ + S S A
Sbjct: 1768 LGAQCPKLVNKWSGPYRVISCSENSA 1793
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 298 bits (762), Expect = 1e-79
Identities = 243/873 (27%), Positives = 403/873 (45%), Gaps = 78/873 (8%)
Query: 944 LNPTMKEVVKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKNELIPTRTVT 1003
L P + + E+ + L +G+I S + PV VPKK G
Sbjct: 420 LPPGKMQAMNDEINQGLKSGIIRE-SKAINACPVMFVPKKEGTL---------------- 462
Query: 1004 GRRMCIDYRRLNTATRKDHFPLPFMDQMIERLAGQAFYCFLDGYSGYNQIAVAPEDQEKT 1063
RM +DY+ LN + + +PLP ++Q++ ++ G + LD S Y+ I V D+ K
Sbjct: 463 --RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKL 520
Query: 1064 AFTCPFGVFAYRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDFSVFGKSFDQCLF 1123
AF CP GVF Y MP+G+ APA FQ + +I + E ++ +MDD + KS + +
Sbjct: 521 AFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVK 580
Query: 1124 HLNAVLKRCTETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAKIEVIEKLPPPMNVK 1183
H+ VL++ NLI+N KC F ++ +G+ IS KG Q I+ + + P N K
Sbjct: 581 HVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRK 640
Query: 1184 GVRSFLGHAGFYRRFIKDFSKIAKPLCNLLVKETEFDFDVECLNAFSLIKNKLVTAPIII 1243
+R FLG + R+FI S++ PL NLL K+ + + A IK LV+ P++
Sbjct: 641 ELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLR 700
Query: 1244 APNWDLHFELMCDASDYAVGAVLGQR--KNKFFHAIYYASKVLNESQVNYSTTEKELLAV 1301
++ L DASD AVGAVL Q+ +K++ YY++K ++++Q+NYS ++KE+LA+
Sbjct: 701 HFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAK-MSKAQLNYSVSDKEMLAI 759
Query: 1302 IFALEKFRSYLIGS--KVIVFTDHAALKYLLTKGDSKP---RLLRWVLLLQEFDLEIRDK 1356
I +L+ +R YL + + TDH L +T +S+P RL RW L LQ+F+ EI +
Sbjct: 760 IKSLKHWRHYLESTIEPFKILTDHRNLIGRIT-NESEPENKRLARWQLFLQDFNFEINYR 818
Query: 1357 KGVENVVADHLSRLENNEVTKKEGAIMAEFPDEQLFAIRERPWFADMAN---------FK 1407
G N +AD LSR + + I + D + + + D N K
Sbjct: 819 PGSANHIADALSR-----IVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTK 873
Query: 1408 AGNIIPDDMEQHQRKKFFKDANHYLWDDPYLFKVSTDGLIRRCVA--GEEIKNIVWHCHS 1465
N++ ++ ++ + KD D L T L R + EE K I
Sbjct: 874 LLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQ-LTRTIIKKYHEEGKLI------ 926
Query: 1466 SAYGGHHSGERTAAKVLQSGFWWPTLFKDCHDFVRRCDNCQRTGSISKRNEMPLTGIIEV 1525
H G ++ F W + K ++V+ C CQ S + + PL I
Sbjct: 927 ------HPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPS 980
Query: 1526 E-PFDCWGIDFMGPFPPSSSYLHILVCVDYVTKWVEAIPCVAN-DSKTVVNFLRKNIFTR 1583
E P++ +DF+ P SS Y + V VD +K +PC + ++ + +
Sbjct: 981 ERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAY 1040
Query: 1584 FGTPRVLISDGGKHFCNNFLETVLKKYNIKHKVATPYHPQTSGQVEVSNRQLKQILEKTV 1643
FG P+ +I+D F + + KYN K + PY PQT GQ E +N+ ++++L
Sbjct: 1041 FGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVC 1100
Query: 1644 ASSRKDWSRKLDDALWAYRIAFKTHLGLSPYQLVFGKACHL-PVELEHKAYWAIKALNFD 1702
++ W + +Y A + ++P+++V + L P+EL + D
Sbjct: 1101 STHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFS---------D 1151
Query: 1703 QTLAGKKRLLKLNELEEMRLGAYENAVIYKERTKRYHDKGLVR-REFYVGQLVLLFNSRL 1761
+T + +++ + + L N + + K+Y D + EF G LV++ ++
Sbjct: 1152 KTDENSQETIQVFQTVKEHLNT--NNI----KMKKYFDMKIQEIEEFQPGDLVMVKRTKT 1205
Query: 1762 KLF--PGKLKSKWSGPFMIESISPYGAVELSKP 1792
KL ++GPF + S EL P
Sbjct: 1206 GFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLP 1238
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 296 bits (757), Expect = 5e-79
Identities = 242/873 (27%), Positives = 404/873 (45%), Gaps = 78/873 (8%)
Query: 944 LNPTMKEVVKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKNELIPTRTVT 1003
L P + + E+ + L +G+I S + PV VPKK G
Sbjct: 420 LPPGKMQAMNDEINQGLKSGIIRE-SKAINACPVMFVPKKEGTL---------------- 462
Query: 1004 GRRMCIDYRRLNTATRKDHFPLPFMDQMIERLAGQAFYCFLDGYSGYNQIAVAPEDQEKT 1063
RM +DY+ LN + + +PLP ++Q++ ++ G + LD S Y+ I V D+ K
Sbjct: 463 --RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKL 520
Query: 1064 AFTCPFGVFAYRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDFSVFGKSFDQCLF 1123
AF CP GVF Y MP+G+ APA FQ + +I ++ E ++ +MD+ + KS + +
Sbjct: 521 AFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVK 580
Query: 1124 HLNAVLKRCTETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAKIEVIEKLPPPMNVK 1183
H+ VL++ NLI+N KC F ++ +G+ IS KG Q I+ + + P N K
Sbjct: 581 HVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRK 640
Query: 1184 GVRSFLGHAGFYRRFIKDFSKIAKPLCNLLVKETEFDFDVECLNAFSLIKNKLVTAPIII 1243
+R FLG + R+FI S++ PL NLL K+ + + A IK LV+ P++
Sbjct: 641 ELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLR 700
Query: 1244 APNWDLHFELMCDASDYAVGAVLGQR--KNKFFHAIYYASKVLNESQVNYSTTEKELLAV 1301
++ L DASD AVGAVL Q+ +K++ YY++K ++++Q+NYS ++KE+LA+
Sbjct: 701 HFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAK-MSKAQLNYSVSDKEMLAI 759
Query: 1302 IFALEKFRSYLIGS--KVIVFTDHAALKYLLTKGDSKP---RLLRWVLLLQEFDLEIRDK 1356
I +L+ +R YL + + TDH L +T +S+P RL RW L LQ+F+ EI +
Sbjct: 760 IKSLKHWRHYLESTIEPFKILTDHRNLIGRIT-NESEPENKRLARWQLFLQDFNFEINYR 818
Query: 1357 KGVENVVADHLSRLENNEVTKKEGAIMAEFPDEQLFAIRERPWFADMAN---------FK 1407
G N +AD LSR + + I + D + + + D N K
Sbjct: 819 PGSANHIADALSR-----IVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTK 873
Query: 1408 AGNIIPDDMEQHQRKKFFKDANHYLWDDPYLFKVSTDGLIRRCVA--GEEIKNIVWHCHS 1465
N++ ++ ++ + KD D L T L R + EE K I
Sbjct: 874 LLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQ-LTRTIIKKYHEEGKLI------ 926
Query: 1466 SAYGGHHSGERTAAKVLQSGFWWPTLFKDCHDFVRRCDNCQRTGSISKRNEMPLTGIIEV 1525
H G ++ F W + K ++V+ C CQ S + + PL I
Sbjct: 927 ------HPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPS 980
Query: 1526 E-PFDCWGIDFMGPFPPSSSYLHILVCVDYVTKWVEAIPCVAN-DSKTVVNFLRKNIFTR 1583
E P++ +DF+ P SS Y + V VD +K +PC + ++ + +
Sbjct: 981 ERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAY 1040
Query: 1584 FGTPRVLISDGGKHFCNNFLETVLKKYNIKHKVATPYHPQTSGQVEVSNRQLKQILEKTV 1643
FG P+ +I+D F + + KYN K + PY PQT GQ E +N+ ++++L
Sbjct: 1041 FGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVC 1100
Query: 1644 ASSRKDWSRKLDDALWAYRIAFKTHLGLSPYQLVFGKACHL-PVELEHKAYWAIKALNFD 1702
++ W + +Y A + ++P+++V + L P+EL + D
Sbjct: 1101 STHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFS---------D 1151
Query: 1703 QTLAGKKRLLKLNELEEMRLGAYENAVIYKERTKRYHDKGLVR-REFYVGQLVLLFNSRL 1761
+T + +++ + + L N + + K+Y D + EF G LV++ ++
Sbjct: 1152 KTDENSQETIQVFQTVKEHLNT--NNI----KMKKYFDMKIQEIEEFQPGDLVMVKRTKT 1205
Query: 1762 KLF--PGKLKSKWSGPFMIESISPYGAVELSKP 1792
KL ++GPF + S EL P
Sbjct: 1206 GFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLP 1238
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 296 bits (757), Expect = 5e-79
Identities = 242/873 (27%), Positives = 404/873 (45%), Gaps = 78/873 (8%)
Query: 944 LNPTMKEVVKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKNELIPTRTVT 1003
L P + + E+ + L +G+I S + PV VPKK G
Sbjct: 420 LPPGKMQAMNDEINQGLKSGIIRE-SKAINACPVMFVPKKEGTL---------------- 462
Query: 1004 GRRMCIDYRRLNTATRKDHFPLPFMDQMIERLAGQAFYCFLDGYSGYNQIAVAPEDQEKT 1063
RM +DY+ LN + + +PLP ++Q++ ++ G + LD S Y+ I V D+ K
Sbjct: 463 --RMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKL 520
Query: 1064 AFTCPFGVFAYRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDFSVFGKSFDQCLF 1123
AF CP GVF Y MP+G+ APA FQ + +I ++ E ++ +MD+ + KS + +
Sbjct: 521 AFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVK 580
Query: 1124 HLNAVLKRCTETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAKIEVIEKLPPPMNVK 1183
H+ VL++ NLI+N KC F ++ +G+ IS KG Q I+ + + P N K
Sbjct: 581 HVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRK 640
Query: 1184 GVRSFLGHAGFYRRFIKDFSKIAKPLCNLLVKETEFDFDVECLNAFSLIKNKLVTAPIII 1243
+R FLG + R+FI S++ PL NLL K+ + + A IK LV+ P++
Sbjct: 641 ELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLR 700
Query: 1244 APNWDLHFELMCDASDYAVGAVLGQR--KNKFFHAIYYASKVLNESQVNYSTTEKELLAV 1301
++ L DASD AVGAVL Q+ +K++ YY++K ++++Q+NYS ++KE+LA+
Sbjct: 701 HFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAK-MSKAQLNYSVSDKEMLAI 759
Query: 1302 IFALEKFRSYLIGS--KVIVFTDHAALKYLLTKGDSKP---RLLRWVLLLQEFDLEIRDK 1356
I +L+ +R YL + + TDH L +T +S+P RL RW L LQ+F+ EI +
Sbjct: 760 IKSLKHWRHYLESTIEPFKILTDHRNLIGRIT-NESEPENKRLARWQLFLQDFNFEINYR 818
Query: 1357 KGVENVVADHLSRLENNEVTKKEGAIMAEFPDEQLFAIRERPWFADMAN---------FK 1407
G N +AD LSR + + I + D + + + D N K
Sbjct: 819 PGSANHIADALSR-----IVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTK 873
Query: 1408 AGNIIPDDMEQHQRKKFFKDANHYLWDDPYLFKVSTDGLIRRCVA--GEEIKNIVWHCHS 1465
N++ ++ ++ + KD D L T L R + EE K I
Sbjct: 874 LLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQ-LTRTIIKKYHEEGKLI------ 926
Query: 1466 SAYGGHHSGERTAAKVLQSGFWWPTLFKDCHDFVRRCDNCQRTGSISKRNEMPLTGIIEV 1525
H G ++ F W + K ++V+ C CQ S + + PL I
Sbjct: 927 ------HPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPS 980
Query: 1526 E-PFDCWGIDFMGPFPPSSSYLHILVCVDYVTKWVEAIPCVAN-DSKTVVNFLRKNIFTR 1583
E P++ +DF+ P SS Y + V VD +K +PC + ++ + +
Sbjct: 981 ERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAY 1040
Query: 1584 FGTPRVLISDGGKHFCNNFLETVLKKYNIKHKVATPYHPQTSGQVEVSNRQLKQILEKTV 1643
FG P+ +I+D F + + KYN K + PY PQT GQ E +N+ ++++L
Sbjct: 1041 FGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVC 1100
Query: 1644 ASSRKDWSRKLDDALWAYRIAFKTHLGLSPYQLVFGKACHL-PVELEHKAYWAIKALNFD 1702
++ W + +Y A + ++P+++V + L P+EL + D
Sbjct: 1101 STHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFS---------D 1151
Query: 1703 QTLAGKKRLLKLNELEEMRLGAYENAVIYKERTKRYHDKGLVR-REFYVGQLVLLFNSRL 1761
+T + +++ + + L N + + K+Y D + EF G LV++ ++
Sbjct: 1152 KTDENSQETIQVFQTVKEHLNT--NNI----KMKKYFDMKIQEIEEFQPGDLVMVKRTKT 1205
Query: 1762 KLF--PGKLKSKWSGPFMIESISPYGAVELSKP 1792
KL ++GPF + S EL P
Sbjct: 1206 GFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLP 1238
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 285 bits (730), Expect = 6e-76
Identities = 214/728 (29%), Positives = 350/728 (47%), Gaps = 76/728 (10%)
Query: 937 VVQPQRRLNPTMKEVVKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKNEL 996
+ Q L T + V+ +V ++L+ G+I S+S + SP VVPKK
Sbjct: 207 IYSKQYPLAQTHEIEVENQVQEMLNQGLIRE-SNSPYNSPTWVVPKK------------- 252
Query: 997 IPTRTVTGR-RMCIDYRRLNTATRKDHFPLPFMDQMIERLAGQAFYCFLDGYSGYNQIAV 1055
P + + R+ IDYR+LN T D +P+P MD+++ +L ++ +D G++QI +
Sbjct: 253 -PDASGANKYRVVIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEM 311
Query: 1056 APEDQEKTAFTCPFGVFAYRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDFSVFG 1115
E KTAF+ G + Y RMPFGL APATFQRCM +I ++ K+ V++DD +F
Sbjct: 312 DEESISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFS 371
Query: 1116 KSFDQCLFHLNAVLKRCTETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAKIEVIEK 1175
S + L + V + + NL L +KC F+ E LGH ++ GI+ + K++ I
Sbjct: 372 TSLTEHLNSIQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVS 431
Query: 1176 LPPPMNVKGVRSFLGHAGFYRRFIKDFSKIAKPLCNLLVKETEFDFD-VECLNAFSLIKN 1234
P P K +R+FLG G+YR+FI +++ IAKP+ + L K T+ D +E + AF +K
Sbjct: 432 YPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKA 491
Query: 1235 KLVTAPIIIAPNWDLHFELMCDASDYAVGAVLGQRKNKFFHAIYYASKVLNESQVNYSTT 1294
++ PI+ P+++ F L DAS+ A+GAVL Q H I + S+ LN+ ++NYS
Sbjct: 492 LIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQNG----HPISFISRTLNDHELNYSAI 547
Query: 1295 EKELLAVIFALEKFRSYLIGSKVIVFTDHAALKYLLTKGDSKPRLLRWVLLLQEFDLEIR 1354
EKELLA+++A + FR YL+G + ++ +DH L++L + +L RW + L E+ +I
Sbjct: 548 EKELLAIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKID 607
Query: 1355 DKKGVENVVADHLSRLENNEVTKKEGA-IMAEFPDEQLFAIRERP--WFADMANFKA--- 1408
KG EN VAD LSR++ E E AE + L + E+P +F F
Sbjct: 608 YIKGKENSVADALSRIKIEENHHSEATQHSAEEDNSNLIHLTEKPINYFKKQIIFIKSDK 667
Query: 1409 ---------GNIIP----DDMEQHQRKKFFKD----------------------ANHYLW 1433
GN I D M + K+ D A+ +
Sbjct: 668 NKVEHSKIFGNSITTIQYDVMTLEKAKQILLDHFIHRNITIYIESDVDFEIVQRAHIEIV 727
Query: 1434 DDPYLFKVSTDGLIRRCVAGEEIKNIVWHCHSSAYGGHHSGERTAAKVLQSGFWWPTLFK 1493
+ Y + + L++ + E K I+ H H G + K+ + ++P
Sbjct: 728 NTTYTKVIRSLFLLKNVGSYAEFKEIILQSHEKLL---HPGIQKMTKLFKENHFFPNSQL 784
Query: 1494 DCHDFVRRCDNCQRTGSISKRNEMPLTGIIEVEPFDC---WGIDFMGPFPPSSSYLHILV 1550
+ + C+ C + + +MPL I P C + +D SS H +
Sbjct: 785 LIQNIINECNICNLAKTEHRNTKMPLK--ITPNPEHCREKFVVDIY-----SSEGKHYIS 837
Query: 1551 CVDYVTKWVEAIPCVANDSKTVVNFLRKNIFTRFGTPRVLISDGGKHFCNNFLETVLKKY 1610
C+D +K+ D N L + IF + G P++L +D F + L+ L++
Sbjct: 838 CIDIYSKFATLEQIKTKDWIECRNALMR-IFNQLGKPKLLKADRDGAFSSLALKRWLEEE 896
Query: 1611 NIKHKVAT 1618
++ ++ T
Sbjct: 897 EVELQLNT 904
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 265 bits (678), Expect = 7e-70
Identities = 159/435 (36%), Positives = 252/435 (57%), Gaps = 26/435 (5%)
Query: 948 MKEVVKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKNELIPTRTVTGRRM 1007
M+ V++++ +LL G+I P S+S + SP+ +VPKK EK RM
Sbjct: 135 MRGEVERQIDELLQDGIIRP-SNSPYNSPIWIVPKKPKPN---GEKQY----------RM 180
Query: 1008 CIDYRRLNTATRKDHFPLPFMDQMIERLAGQAFYCFLDGYSGYNQIAVAPEDQEKTAFTC 1067
+D++RLNT T D +P+P ++ + L ++ LD SG++QI + D KTAF+
Sbjct: 181 VVDFKRLNTVTIPDTYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFST 240
Query: 1068 PFGVFAYRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDFSVFGKSFDQCLFHLNA 1127
G + + R+PFGL APA FQR + I + I K V++DD VF + +D +L
Sbjct: 241 LNGKYEFLRLPFGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRL 300
Query: 1128 VLKRCTETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAKIEVIEKLPPPMNVKGVRS 1187
VL ++ NL +N EK HF+ T+ LG+ +++ GI+ D K+ I ++PPP +VK ++
Sbjct: 301 VLASLSKANLQVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKR 360
Query: 1188 FLGHAGFYRRFIKDFSKIAKPLCNLL------VKETE-----FDFDVECLNAFSLIKNKL 1236
FLG +YR+FI+D++K+AKPL NL +K ++ D L +F+ +K+ L
Sbjct: 361 FLGMTSYYRKFIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSIL 420
Query: 1237 VTAPIIIAPNWDLHFELMCDASDYAVGAVLGQRKNKFFHAIYYASKVLNESQVNYSTTEK 1296
++ I+ P + F L DAS++A+GAVL Q I Y S+ LN+++ NY+T EK
Sbjct: 421 CSSEILAFPCFTKPFHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEK 480
Query: 1297 ELLAVIFALEKFRSYLIGSKVI-VFTDHAALKYLLTKGDSKPRLLRWVLLLQEFDLEIRD 1355
E+LA+I++L+ R+YL G+ I V+TDH L + L + +L RW ++E++ E+
Sbjct: 481 EMLAIIWSLDNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIY 540
Query: 1356 KKGVENVVADHLSRL 1370
K G NVVAD LSR+
Sbjct: 541 KPGKSNVVADALSRI 555
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 246 bits (628), Expect = 4e-64
Identities = 151/434 (34%), Positives = 230/434 (52%), Gaps = 13/434 (2%)
Query: 950 EVVKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKNELIPTRTVTGRRMCI 1009
E ++ +V KL+ ++ P S S + SP+ +VPKK P R+ I
Sbjct: 328 EEIQAQVQKLIKDKIVEP-SVSQYNSPLLLVPKKSS------------PNSDKKKWRLVI 374
Query: 1010 DYRRLNTATRKDHFPLPFMDQMIERLAGQAFYCFLDGYSGYNQIAVAPEDQEKTAFTCPF 1069
DYR++N D FPLP +D ++++L ++ LD SG++QI + ++ T+F+
Sbjct: 375 DYRQINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSN 434
Query: 1070 GVFAYRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDFSVFGKSFDQCLFHLNAVL 1129
G + + R+PFGL AP +FQR M FS + ++MDD V G S L +L V
Sbjct: 435 GSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVF 494
Query: 1130 KRCTETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAKIEVIEKLPPPMNVKGVRSFL 1189
+C E NL L+ EKC F + E LGHK + KGI D K +VI+ P P + R F+
Sbjct: 495 GKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFV 554
Query: 1190 GHAGFYRRFIKDFSKIAKPLCNLLVKETEFDFDVECLNAFSLIKNKLVTAPIIIAPNWDL 1249
+YRRFIK+F+ ++ + L K F++ EC AF +K++L+ ++ P++
Sbjct: 555 AFCNYYRRFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSK 614
Query: 1250 HFELMCDASDYAVGAVLGQRKNKFFHAIYYASKVLNESQVNYSTTEKELLAVIFALEKFR 1309
F + DAS A GAVL Q N + YAS+ + + N STTE+EL A+ +A+ FR
Sbjct: 615 EFCITTDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFR 674
Query: 1310 SYLIGSKVIVFTDHAALKYLLTKGDSKPRLLRWVLLLQEFDLEIRDKKGVENVVADHLSR 1369
Y+ G V TDH L YL + + +L R L L+E++ + KG +N VAD LSR
Sbjct: 675 PYIYGKHFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKGKDNHVADALSR 734
Query: 1370 LENNEVTKKEGAIM 1383
+ E+ G I+
Sbjct: 735 ITIKELKDITGNIL 748
Score = 123 bits (308), Expect = 5e-27
Identities = 93/326 (28%), Positives = 154/326 (46%), Gaps = 20/326 (6%)
Query: 1469 GGHHSGERTAAKVLQSGFWWPTLFKDCHDFVRRCDNCQRTGSISKRNEMPLTGIIEV--E 1526
GGH +T AKV + ++W + K ++VR+C CQ+ + +K + P+T I E
Sbjct: 907 GGHTGITKTLAKVKRH-YYWKNMSKYIKEYVRKCQKCQKAKT-TKHTKTPMT-ITETPEH 963
Query: 1527 PFDCWGIDFMGPFPPSSS---YLHILVCVDYVTKWVEAIPCVANDSKTVVNFLRKNIFTR 1583
FD +D +GP P S + Y L+C +TK++ AIP +KTV + ++ +
Sbjct: 964 AFDRVVVDTIGPLPKSENGNEYAVTLICD--LTKYLVAIPIANKSAKTVAKAIFESFILK 1021
Query: 1584 FGTPRVLISDGGKHFCNNFLETVLKKYNIKHKVATPYHPQTSGQVEVSNRQLKQILEKTV 1643
+G + I+D G + N+ + + K IK+ +T +H QT G VE S+R L + + +
Sbjct: 1022 YGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVERSHRTLNEYIRSYI 1081
Query: 1644 ASSRKDWSRKLDDALWAYRIAFKTHLGLSPYQLVFGKACHLPVELEHKAYWAIKALNFDQ 1703
++ + DW L ++ + PY+LVFG+ +LP +K + N D
Sbjct: 1082 STDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHF-NKLHSIEPIYNIDD 1140
Query: 1704 TLAGKKRLLKLNELEEMRLGAYENAVIYKERTKRYHDKGLVRREFYVGQLVLLFNSRLKL 1763
K L++ +L +KE+ K +D + E VG VLL N
Sbjct: 1141 YAKESKYRLEVAYARARKL-----LEAHKEKNKENYDLKVKDIELEVGDKVLLRNE---- 1191
Query: 1764 FPGKLKSKWSGPFMIESISPYGAVEL 1789
KL K++GP+ IESI + L
Sbjct: 1192 VGHKLDFKYTGPYKIESIGDNNNITL 1217
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 236 bits (601), Expect = 6e-61
Identities = 155/447 (34%), Positives = 241/447 (53%), Gaps = 33/447 (7%)
Query: 943 RLNPTMKEV---VKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKNELIPT 999
R PT+ V V EV +LL G+I P S S + SP VV KKG T N
Sbjct: 185 RAYPTLMGVSDFVNNEVKQLLKDGIIRP-SRSPYNSPTWVVDKKG--TDAFGNPN----- 236
Query: 1000 RTVTGRRMCIDYRRLNTATRKDHFPLPFMDQMIERLAGQAFYCFLDGYSGYNQIAVAPED 1059
+R+ ID+R+LN T D +P+P + ++ L F+ LD SGY+QI +A D
Sbjct: 237 -----KRLVIDFRKLNEKTIPDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHD 291
Query: 1060 QEKTAFTCPFGVFAYRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDFSVFGKSFD 1119
+EKT+F+ G + + R+PFGL A + FQR + + + I K V++DD +F ++
Sbjct: 292 REKTSFSVNGGKYEFCRLPFGLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENES 351
Query: 1120 QCLFHLNAVLKRCTETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAKIEVIEKLPPP 1179
+ H++ VLK + N+ ++ EK F LG +S G + D K++ I++ P P
Sbjct: 352 DHVRHIDTVLKCLIDANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEP 411
Query: 1180 MNVKGVRSFLGHAGFYRRFIKDFSKIAKPLCNLL-----------VKETEFDFDVECLNA 1228
V VRSFLG A +YR FIKDF+ IA+P+ ++L K+ +F+ NA
Sbjct: 412 DCVYKVRSFLGLASYYRVFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNA 471
Query: 1229 FSLIKNKLVTAPIIIA-PNWDLHFELMCDASDYAVGAVLGQRKNKFFHAIYYASKVLNES 1287
F ++N L + +I+ P++ F+L DAS +GAVL Q I S+ L +
Sbjct: 472 FQRLRNILASEDVILKYPDFKKPFDLTTDASASGIGAVLSQEG----RPITMISRTLKQP 527
Query: 1288 QVNYSTTEKELLAVIFALEKFRSYLIGSKVI-VFTDHAALKYLLTKGDSKPRLLRWVLLL 1346
+ NY+T E+ELLA+++AL K +++L GS+ I +FTDH L + + ++ ++ RW +
Sbjct: 528 EQNYATNERELLAIVWALGKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYI 587
Query: 1347 QEFDLEIRDKKGVENVVADHLSRLENN 1373
+ + ++ K G EN VAD LSR N
Sbjct: 588 DQHNAKVFYKPGKENFVADALSRQNLN 614
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 176 bits (445), Expect = 7e-43
Identities = 164/562 (29%), Positives = 263/562 (46%), Gaps = 69/562 (12%)
Query: 831 NEQLL--KPTKLEEMSNEGKLGAKSLSNEEEKIPELKELPSHLKYVFLSKDVSKPA---- 884
NE +L K TK +SN L ++ E+IP +SK++ P
Sbjct: 147 NEMVLIKKVTKAFSVSNPSFLENMKKDSKTEQIPGTN----------ISKNIINPEERYF 196
Query: 885 IISSTLTPLEEEKLMRVLRENE----GALGWKISDLKGISPAYCMHRIHMEAEYKSVVQP 940
+I+ +E+ L +V EN + W + +K I P + V+P
Sbjct: 197 LITEKYQKIEQ-LLDKVCSENPIDPIKSKQWMKASIKLIDPLKVIR-----------VKP 244
Query: 941 QRRLNPTMKEVVKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKNELIPTR 1000
+P +E K++ +LLD G+I P S S +SP +V + R
Sbjct: 245 MS-YSPQDREGFAKQIKELLDLGLIIP-SKSQHMSPAFLVENEA--------------ER 288
Query: 1001 TVTGRRMCIDYRRLNTATRKDHFPLPFMDQMIERLAGQAFYCFLDGYSGYNQIAVAPEDQ 1060
+RM ++Y+ +N AT D LP M +++ L G++ + D SG+ Q+ + E Q
Sbjct: 289 RRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQ 348
Query: 1061 EKTAFTCPFGVFAYRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDFSVFGKSFDQ 1120
+ TAFTCP G F ++ +PFGL AP+ FQR M + + +K V++DD VF S
Sbjct: 349 KLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSELD 407
Query: 1121 CLFHLNAVLKRCTETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAK-------IEVI 1173
H+ AVLK + +IL+ +K + + KI+ G+E+D+ +E I
Sbjct: 408 HYNHVYAVLKIVEKYGIILSKKKAN-------LFKEKINFLGLEIDKGTHCPQNHILENI 460
Query: 1174 EKLPPPM-NVKGVRSFLGHAGFYRRFIKDFSKIAKPLCNLLVKETEFDFDVECLNAFSLI 1232
K P + + K ++ FLG + +I ++I KPL L K+ +++ + I
Sbjct: 461 HKFPDRLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKI 520
Query: 1233 KNKLVTAPIIIAPNWDLHFELMCDASDYAVGAVLGQRKNKFFHAI-YYASKVLNESQVNY 1291
K L + P + P + H + DASD G VL R I Y+S +++ NY
Sbjct: 521 KKNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNY 580
Query: 1292 STTEKELLAVIFALEKFRSYLIGSKVIVFTDHAALKYLL---TKGDSKP-RLLRWVLLLQ 1347
+ +KELLAV + KF +YL + V TD+ Y L KGDSK RL+RW
Sbjct: 581 HSNDKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFS 640
Query: 1348 EFDLEIRDKKGVENVVADHLSR 1369
++ ++ +GV+NV+AD L+R
Sbjct: 641 KYQFDVEHLEGVKNVLADCLTR 662
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 170 bits (431), Expect = 3e-41
Identities = 145/522 (27%), Positives = 247/522 (46%), Gaps = 61/522 (11%)
Query: 864 LKELPSHLKYVFLSKDVSKPAIISSTLTPLEEEKLMRVLREN----EGALGWKISDLKGI 919
L+E +H+ + +SK + I EE L RV EN E + W + ++ I
Sbjct: 172 LEEGGNHVDEMLYEIQISKFSAI--------EEMLERVSSENPIDPEKSKQWMTATIELI 223
Query: 920 SPAYCMHRIHMEAEYKSVVQPQ-RRLNPTMKEVVKKEVLKLLDAGMIYPISDSSWVSPVH 978
P K+VV+ + +P+ +E +++ +LL+ +I P S S+ +SP
Sbjct: 224 DP-------------KTVVKVKPMSYSPSDREEFDRQIKELLELKVIKP-SKSTHMSPAF 269
Query: 979 VVPKKGGMTVVVNEKNELIPTRTVTGRRMCIDYRRLNTATRKDHFPLPFMDQMIERLAGQ 1038
+V + R +RM ++Y+ +N AT+ D LP D+++ + G+
Sbjct: 270 LVENEA--------------ERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGK 315
Query: 1039 AFYCFLDGYSGYNQIAVAPEDQEKTAFTCPFGVFAYRRMPFGLCGAPATFQRCMLSIFSD 1098
Y D SG Q+ + E Q TAFTCP G + + +PFGL AP+ F + + S+
Sbjct: 316 KIYSSFDCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSN 375
Query: 1099 MIEKNIEVFMDDFSVFGKS-FDQCLFHLNAVLKRCTETNLILNWEKCHFMVTEGIVLGHK 1157
K V++DD VF + + H+ +L+RC + +IL+ +K + K
Sbjct: 376 QYSKYCCVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQ-------LFKEK 428
Query: 1158 ISSKGIEVDQAK-------IEVIEKLPPPM-NVKGVRSFLGHAGFYRRFIKDFSKIAKPL 1209
I+ G+E+DQ +E I K P + + K ++ FLG + +I + I KPL
Sbjct: 429 INFLGLEIDQGTHCPQNHILEHIHKFPDRIEDKKQLQRFLGILTYASDYIPKLASIRKPL 488
Query: 1210 CNLLVKETEFDFDVECLNAFSLIKNKLVTAPIIIAPNWDLHFELMCDASDYAVGAVLGQR 1269
+ L +++ + ++ + IK L + P + P + + DAS+ G +L
Sbjct: 489 QSKLKEDSTWTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLVIETDASEEFWGGILKAI 548
Query: 1270 KNKFFHAIYYASKVLNESQVNYSTTEKELLAVIFALEKFRSYLIGSKVIVFTDHAALKYL 1329
N + YAS ++ NY + EKELLAVI ++KF YL S+ ++ TD+ +
Sbjct: 549 HNSHEYICRYASGSFKAAERNYHSNEKELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHF 608
Query: 1330 LT---KGDSKP-RLLRWVLLLQEFDLEIRDKKGVENVVADHL 1367
+ KGD K RL+RW + L ++D ++ G +NV AD L
Sbjct: 609 VNINLKGDRKQGRLVRWQMWLSQYDFDVEHIAGTKNVFADFL 650
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 155 bits (391), Expect = 1e-36
Identities = 133/457 (29%), Positives = 218/457 (47%), Gaps = 40/457 (8%)
Query: 935 KSVVQPQRRLNPTMKEVVKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKN 994
K++ + +P +E K++ +LLD +I P S S ++P +V N
Sbjct: 245 KAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------------NN 291
Query: 995 ELIPTRTVTGRRMCIDYRRLNTATRKDHFPLPFMDQMIERLAGQAFYCFLDGYSGYNQIA 1054
E R +RM ++Y+ +N AT D + LP D+++ + G+ + D SG+ Q+
Sbjct: 292 EAEKRRGK--KRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVL 349
Query: 1055 VAPEDQEKTAFTCPFGVFAYRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDFSVF 1114
+ E + TAFTCP G + + +PFGL AP+ FQR M F + K V++DD VF
Sbjct: 350 LDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVF 408
Query: 1115 GKSFDQCLFHLNAVLKRCTETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAK----- 1169
+ + L H+ +L++C + +IL+ +K + KI+ G+E+D+
Sbjct: 409 SNNEEDHLLHVAMILQKCNQHGIILSKKKAQ-------LFKKKINFLGLEIDEGTHKPQG 461
Query: 1170 --IEVIEKLPPPM-NVKGVRSFLGHAGFYRRFIKDFSKIAKPLCNLLVKETEFDFDVECL 1226
+E I K P + + K ++ FLG + +I ++I KPL L + + + E
Sbjct: 462 HILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKEDT 521
Query: 1227 NAFSLIKNKLVTAPIIIAPNWDLHFELMCDASDYAVGAVLGQRK-NKFFHA---IYYASK 1282
+K L P + P + + DASD G +L K N+ + YAS
Sbjct: 522 LYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASG 581
Query: 1283 VLNESQVNYSTTEKELLAVIFALEKFRSYLIGSKVIVFTDHAALKYLLT---KGDSK-PR 1338
++ NY + +KE LAVI ++KF YL ++ TD+ K + KGDSK R
Sbjct: 582 SFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGR 641
Query: 1339 LLRWVLLLQEFDLEIRDKKGVENVVADHLSRLENNEV 1375
+RW L + ++ KG +N AD LSR E N+V
Sbjct: 642 NIRWQAWLSHYSFDVEHIKGTDNHFADFLSR-EFNKV 677
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 155 bits (391), Expect = 1e-36
Identities = 133/457 (29%), Positives = 218/457 (47%), Gaps = 40/457 (8%)
Query: 935 KSVVQPQRRLNPTMKEVVKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKN 994
K++ + +P +E K++ +LLD +I P S S ++P +V N
Sbjct: 245 KAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------------NN 291
Query: 995 ELIPTRTVTGRRMCIDYRRLNTATRKDHFPLPFMDQMIERLAGQAFYCFLDGYSGYNQIA 1054
E R +RM ++Y+ +N AT D + LP D+++ + G+ + D SG+ Q+
Sbjct: 292 EAEKRRGK--KRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVL 349
Query: 1055 VAPEDQEKTAFTCPFGVFAYRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDFSVF 1114
+ E + TAFTCP G + + +PFGL AP+ FQR M F + K V++DD VF
Sbjct: 350 LDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVF 408
Query: 1115 GKSFDQCLFHLNAVLKRCTETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAK----- 1169
+ + L H+ +L++C + +IL+ +K + KI+ G+E+D+
Sbjct: 409 SNNEEDHLLHVAMILQKCNQHGIILSKKKAQ-------LFKKKINFLGLEIDEGTHKPQG 461
Query: 1170 --IEVIEKLPPPM-NVKGVRSFLGHAGFYRRFIKDFSKIAKPLCNLLVKETEFDFDVECL 1226
+E I K P + + K ++ FLG + +I ++I KPL L + + + E
Sbjct: 462 HILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDT 521
Query: 1227 NAFSLIKNKLVTAPIIIAPNWDLHFELMCDASDYAVGAVLGQRK-NKFFHA---IYYASK 1282
+K L P + P + + DASD G +L K N+ + YAS
Sbjct: 522 LYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASG 581
Query: 1283 VLNESQVNYSTTEKELLAVIFALEKFRSYLIGSKVIVFTDHAALKYLLT---KGDSK-PR 1338
++ NY + +KE LAVI ++KF YL ++ TD+ K + KGDSK R
Sbjct: 582 SFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGR 641
Query: 1339 LLRWVLLLQEFDLEIRDKKGVENVVADHLSRLENNEV 1375
+RW L + ++ KG +N AD LSR E N+V
Sbjct: 642 NIRWQAWLSHYSFDVEHIKGTDNHFADFLSR-EFNKV 677
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 155 bits (391), Expect = 1e-36
Identities = 133/457 (29%), Positives = 218/457 (47%), Gaps = 40/457 (8%)
Query: 935 KSVVQPQRRLNPTMKEVVKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKN 994
K++ + +P +E K++ +LLD +I P S S ++P +V N
Sbjct: 245 KAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------------NN 291
Query: 995 ELIPTRTVTGRRMCIDYRRLNTATRKDHFPLPFMDQMIERLAGQAFYCFLDGYSGYNQIA 1054
E R +RM ++Y+ +N AT D + LP D+++ + G+ + D SG+ Q+
Sbjct: 292 EAEKRRGK--KRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVL 349
Query: 1055 VAPEDQEKTAFTCPFGVFAYRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDFSVF 1114
+ E + TAFTCP G + + +PFGL AP+ FQR M F + K V++DD VF
Sbjct: 350 LDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVF 408
Query: 1115 GKSFDQCLFHLNAVLKRCTETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAK----- 1169
+ + L H+ +L++C + +IL+ +K + KI+ G+E+D+
Sbjct: 409 SNNEEDHLLHVAMILQKCNQHGIILSKKKAQ-------LFKKKINFLGLEIDEGTHKPQG 461
Query: 1170 --IEVIEKLPPPM-NVKGVRSFLGHAGFYRRFIKDFSKIAKPLCNLLVKETEFDFDVECL 1226
+E I K P + + K ++ FLG + +I ++I KPL L + + + E
Sbjct: 462 HILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDT 521
Query: 1227 NAFSLIKNKLVTAPIIIAPNWDLHFELMCDASDYAVGAVLGQRK-NKFFHA---IYYASK 1282
+K L P + P + + DASD G +L K N+ + YAS
Sbjct: 522 LYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASG 581
Query: 1283 VLNESQVNYSTTEKELLAVIFALEKFRSYLIGSKVIVFTDHAALKYLLT---KGDSK-PR 1338
++ NY + +KE LAVI ++KF YL ++ TD+ K + KGDSK R
Sbjct: 582 SFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGR 641
Query: 1339 LLRWVLLLQEFDLEIRDKKGVENVVADHLSRLENNEV 1375
+RW L + ++ KG +N AD LSR E N+V
Sbjct: 642 NIRWQAWLSHYSFDVEHIKGTDNHFADFLSR-EFNKV 677
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 152 bits (384), Expect = 8e-36
Identities = 129/451 (28%), Positives = 213/451 (46%), Gaps = 39/451 (8%)
Query: 935 KSVVQPQRRLNPTMKEVVKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKN 994
K++ + +P +E K++ +LLD +I P S S ++P +V N
Sbjct: 240 KAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------------NN 286
Query: 995 ELIPTRTVTGRRMCIDYRRLNTATRKDHFPLPFMDQMIERLAGQAFYCFLDGYSGYNQIA 1054
E R +RM ++Y+ +N AT D + P D+++ + G+ + D SG+ Q+
Sbjct: 287 EAEKRRGK--KRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKSGFWQVL 344
Query: 1055 VAPEDQEKTAFTCPFGVFAYRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDFSVF 1114
+ E + TAFTCP G + + +PFGL AP+ FQR M F + K V++DD VF
Sbjct: 345 LDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVF 403
Query: 1115 GKSFDQCLFHLNAVLKRCTETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAK----- 1169
+ + L H+ +L++C + +IL+ +K + KI+ G+E+D+
Sbjct: 404 SNNEEDHLLHVAMILQKCNQHGIILSKKKAQ-------LFKKKINFLGLEIDEGTHKPQG 456
Query: 1170 --IEVIEKLPPPM-NVKGVRSFLGHAGFYRRFIKDFSKIAKPLCNLLVKETEFDFDVECL 1226
+E I K P + + K ++ FLG + +I ++I KPL L + + + E
Sbjct: 457 HILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDT 516
Query: 1227 NAFSLIKNKLVTAPIIIAPNWDLHFELMCDASDYAVGAVLGQRK-NKFFHA---IYYASK 1282
+K L P + P + + DASD G +L K N+ + YAS
Sbjct: 517 LYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASG 576
Query: 1283 VLNESQVNYSTTEKELLAVIFALEKFRSYLIGSKVIVFTDHAALKYLLT---KGDSK-PR 1338
++ NY + +KE LAVI ++KF YL ++ TD+ K + KGDSK R
Sbjct: 577 SFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGR 636
Query: 1339 LLRWVLLLQEFDLEIRDKKGVENVVADHLSR 1369
+RW L + ++ KG +N AD LSR
Sbjct: 637 NIRWQAWLSHYSFDVEHIKGTDNHFADFLSR 667
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 151 bits (381), Expect = 2e-35
Identities = 130/457 (28%), Positives = 218/457 (47%), Gaps = 40/457 (8%)
Query: 935 KSVVQPQRRLNPTMKEVVKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKN 994
K++ + +P +E K++ +LLD +I P S S ++P +V N
Sbjct: 246 KAIKVKPMKYSPMDREEFDKQIKELLDLKVIKP-SKSPHMAPAFLV------------NN 292
Query: 995 ELIPTRTVTGRRMCIDYRRLNTATRKDHFPLPFMDQMIERLAGQAFYCFLDGYSGYNQIA 1054
E R +RM ++Y+ +N AT D + LP D+++ + G+ + D SG+ Q+
Sbjct: 293 EAENGRG--NKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVL 350
Query: 1055 VAPEDQEKTAFTCPFGVFAYRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDFSVF 1114
+ E + TAFTCP G + + +PFGL AP+ FQR M F + K V++DD VF
Sbjct: 351 LDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDIVVF 409
Query: 1115 GKSFDQCLFHLNAVLKRCTETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAK----- 1169
+ + L H+ +L++C + +IL+ +K + KI+ G+E+D+
Sbjct: 410 SNNEEDHLLHVAMILQKCNQHGIILSKKKAQ-------LFKKKINFLGLEIDEGTHKPQG 462
Query: 1170 --IEVIEKLPPPM-NVKGVRSFLGHAGFYRRFIKDFSKIAKPLCNLLVKETEFDFDVECL 1226
+E I K P + + K ++ FLG + +I + +++ +PL L + + + E
Sbjct: 463 HILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKEDT 522
Query: 1227 NAFSLIKNKLVTAPIIIAPNWDLHFELMCDASDYAVGAVLGQRK-NKFFHA---IYYASK 1282
+K L P + P + + DASD G +L K N+ + Y S
Sbjct: 523 LYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYRSG 582
Query: 1283 VLNESQVNYSTTEKELLAVIFALEKFRSYLIGSKVIVFTDHAALKYLLT---KGDSK-PR 1338
++ NY + +KE LAVI ++KF YL ++ TD+ K + KGDSK R
Sbjct: 583 SFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGR 642
Query: 1339 LLRWVLLLQEFDLEIRDKKGVENVVADHLSRLENNEV 1375
+RW L + ++ KG +N AD LSR E N+V
Sbjct: 643 NIRWQAWLSHYSFDVEHIKGTDNHFADFLSR-EFNKV 678
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 145 bits (366), Expect = 1e-33
Identities = 146/537 (27%), Positives = 235/537 (43%), Gaps = 59/537 (10%)
Query: 872 KYVFLSKDVSKPAIISSTLTP-----LEEEKLMRVLRENEGALGWKISDLKGISPAYCMH 926
+Y+ ++ V P+ + L+E K M+ + EN WK + +K C
Sbjct: 1341 EYLNIAASVETPSFLDQEFARKNKDLLKEMKEMKYIGENPMEF-WKNNKIK------CKL 1393
Query: 927 RIHMEAEYKSVVQPQRRLNPTMKEVVKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGM 986
I + + K + +P + + P +E + +++ LL +I P S+S K
Sbjct: 1394 NI-INPDIKIMGRPIKHVTPGDEEAMTRQINLLLQMKVIRP-SES----------KHRST 1441
Query: 987 TVVVNEKNELIPTRTVTGR------RMCIDYRRLNTATRKDHFPLPFMDQMIERLAGQAF 1040
+V E+ P +TG+ RM +Y+ LN T D + LP ++ +I ++
Sbjct: 1442 AFIVRSGTEIDP---ITGKEKKGKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSKI 1498
Query: 1041 YCFLDGYSGYNQIAVAPEDQEKTAFTCPFGVFAYRRMPFGLCGAPATFQRCMLSIFSDMI 1100
Y D SG+ Q+A+ E TAF ++ + MPFGL APA FQR M ++F
Sbjct: 1499 YSKFDLKSGFWQVAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMDNVFKG-T 1557
Query: 1101 EKNIEVFMDDFSVFGKSFDQCLFHLNAVLKRCTETNLILNWEKCHFMVTEGIVLGHKISS 1160
EK I V++DD VF ++ +Q HL +L+ C E LIL+ K E LG +
Sbjct: 1558 EKFIAVYIDDILVFSETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLGC 1617
Query: 1161 KGIEVDQAKIEVI-----EKLPPPMNVKGVRSFLGHAGFYRRFIKDFSKIAKPLCNLLVK 1215
I++ I I EKL P +G+RS+LG + R +I+D K+ +PL +
Sbjct: 1618 TKIKLQPHIISKICDFSDEKLATP---EGMRSWLGILSYARNYIQDIGKLVQPLRQKMAP 1674
Query: 1216 ETEFDFDVECLNAFSLIKNKLVTAPIIIAPNWDLHFELMCDASDYAVGAVLGQRKNKF-- 1273
+ + E IK K+ P + P D + D GAV + +K
Sbjct: 1675 TGDKRMNPETWKMVRQIKEKVKNLPDLQLPPKDSFIIIETDGCMTGWGAVCKWKMSKHDP 1734
Query: 1274 ---FHAIYYASKVLNESQVNYSTTEKELLAVIFALEKFRSYLIGSKVIVFTD--HAALKY 1328
YAS N + ST + E+ A I L+KF+ Y + K ++ A +K+
Sbjct: 1735 RSTERICAYASGSFNPIK---STIDAEIQAAIHGLDKFKIYYLDKKELIIRSDCEAIIKF 1791
Query: 1329 LLTKGDSKPRLLRWVLLLQEF------DLEIRDKKGVENVVADHLSRLENNEVTKKE 1379
++KP +RW L +F + G N +AD LSR+ N V K +
Sbjct: 1792 YNKTNENKPSRVRW-LTFSDFLTGLGITVTFEHIDGKHNGLADALSRMINFIVEKND 1847
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 129 bits (325), Expect = 6e-29
Identities = 100/359 (27%), Positives = 161/359 (43%), Gaps = 30/359 (8%)
Query: 1465 SSAYGGHHSGERTAAKVLQSGFWWPTLFKDCHDFVRRCDNCQRTGSISKRNEMPLTGIIE 1524
S+A+ H+G + S +WWP L KD +R+C C T + + + L +
Sbjct: 821 STAHNIAHTGRDATFLKVSSKYWWPNLRKDVVKSIRQCKQCLVTNATNLTSPPILRPVKP 880
Query: 1525 VEPFDCWGIDFMGPFPPSSSYLHILVCVDYVTKWVEAIPCVANDSKTVVNFLRKNIFTRF 1584
++PFD + ID++GP PPS+ YLH+LV VD +T +V P A + V L N+ T
Sbjct: 881 LKPFDKFYIDYIGPLPPSNGYLHVLVVVDSMTGFVWLYPTKAPSTSATVKAL--NMLTSI 938
Query: 1585 GTPRVLISDGGKHFCNNFLETVLKKYNIKHKVATPYHPQTSGQVEVSNRQLKQILEKTVA 1644
P+VL SD G F ++ K+ I+ + +TPYHPQ+SG+VE N +K++L K +
Sbjct: 939 AIPKVLHSDQGAAFTSSTFADWAKEKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLLI 998
Query: 1645 SSRKDWSRKLDDALWAYRIAFKTHLGLSPYQLVFGKACHLPVELEHKAYWAIKALNFDQT 1704
W L A ++ +P+QL+FG + P N D
Sbjct: 999 GRPAKWYDLLPVVQLALNNSYSPSSKYTPHQLLFGVDSNTPF------------ANSDTL 1046
Query: 1705 LAGKKRLLKLNELEEMRLGAYENAVIYKERTKRYHDKGLVRREFYVGQLVLLFNSRLKLF 1764
++ L L L+E+R ++ + R VGQLV +R
Sbjct: 1047 DLSREEELSL--LQEIRSSLHQPT--SPPASSRSWSPS-------VGQLVQERVAR---- 1091
Query: 1765 PGKLKSKWSGP-FMIESISPYGAVELSKPGEPGTFKVNAQRIKPYLGGELPTNKGGVVL 1822
P L+ +W P ++E ++P + L G T V+ ++ Y + G + L
Sbjct: 1092 PASLRPRWHKPTAILEVVNPRTVIILDHLGNRRTVSVDNLKLTAYQDNGTSNDSGTMAL 1150
Score = 83.6 bits (205), Expect = 5e-15
Identities = 91/454 (20%), Positives = 189/454 (41%), Gaps = 50/454 (11%)
Query: 939 QPQRRLNPTMKEVVKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKNELIP 998
Q Q +NP K ++ + LL G++ +S+ +PV+ VPK G
Sbjct: 177 QKQYPINPKAKPSIQIVIDDLLKQGVLIQ-QNSTMNTPVYPVPKPDGKW----------- 224
Query: 999 TRTVTGRRMCIDYRRLNTATRKDHFPLPFMDQMIERLAG-------QAFYCFLDGYSGYN 1051
RM +DYR +N +P + + AG + LD +G+
Sbjct: 225 -------RMVLDYREVNKT-------IPLIAAQNQHSAGILSSIYRGKYKTTLDLTNGFW 270
Query: 1052 QIAVAPEDQEKTAFTCPFGVFAYRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDF 1111
+ PE TAFT + + R+P G +PA F ++ + ++ N++ ++DD
Sbjct: 271 AHPITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTADVVDLLKEI--PNVQAYVDDI 328
Query: 1112 SVFGKSFDQCLFHLNAVLKRCTETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAKIE 1171
+ + L L + +++ +K E LG I+ +G + +
Sbjct: 329 YISHDDPQEHLEQLEKIFSILLNAGYVVSLKKSEIAQREVEFLGFNITKEGRGLTDTFKQ 388
Query: 1172 VIEKLPPPMNVKGVRSFLGHAGFYRRFIKDFSKIAKPLCNLLVKETE--FDFDVECLNAF 1229
+ + PP ++K ++S LG F R FI ++S++ KPL ++ + + N
Sbjct: 389 KLLNITPPKDLKQLQSILGLLNFARNFIPNYSELVKPLYTIVANANGKFISWTEDNSNQL 448
Query: 1230 SLIKNKLVTAPIIIAPNWDLHFELMCDASDYAVGAVLGQRKNKFFHAIYYASKVLNESQV 1289
I + L A + N + + ++S A +K I Y + + ++++
Sbjct: 449 QHIISVLNQADNLEERNPETRLIIKVNSSPSAGYIRYYNEGSK--RPIMYVNYIFSKAEA 506
Query: 1290 NYSTTEKELLAVIFALEKFRSYLIGSKVIVFTDHAALKYL----LTKGDSKP-RLLRWVL 1344
++ TEK L + L K +G +++V++ ++ + L + + P R + W+
Sbjct: 507 KFTQTEKLLTTMHKGLIKAMDLAMGQEILVYSPIVSMTKIQRTPLPERKALPVRWITWMT 566
Query: 1345 LLQE------FDLEIRDKKGVENVVADHLSRLEN 1372
L++ +D + + + + NV D +++ ++
Sbjct: 567 YLEDPRIQFHYDKSLPELQQIPNVTEDVIAKTKH 600
>POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1157
Score = 128 bits (322), Expect = 1e-28
Identities = 106/364 (29%), Positives = 156/364 (42%), Gaps = 34/364 (9%)
Query: 1459 IVWHCHSSAYGGHHSGERTAAKVLQSGFWWPTLFKDCHDFVRRCDNCQRTGSISKRNEMP 1518
I+ H+ A+ G S T KV S +WWP L KD +R+C C T + +
Sbjct: 821 IILQAHNIAHTGRDS---TFLKV-SSKYWWPNLRKDVVKVIRQCKQCLVTNAATLAAPPI 876
Query: 1519 LTGIIEVEPFDCWGIDFMGPFPPSSSYLHILVCVDYVTKWVEAIPCVANDSKTVVNFLRK 1578
L V+PFD + ID++GP PPS+ YLH+LV VD +T +V P A + V L
Sbjct: 877 LRPERPVKPFDKFFIDYIGPLPPSNGYLHVLVVVDSMTGFVWLYPTKAPSTSATVKAL-- 934
Query: 1579 NIFTRFGTPRVLISDGGKHFCNNFLETVLKKYNIKHKVATPYHPQTSGQVEVSNRQLKQI 1638
N+ T P+V+ SD G F + K I+ + +TPYHPQ+SG+VE N +K++
Sbjct: 935 NMLTSIAVPKVIHSDQGAAFTSATFADWAKNKGIQLEFSTPYHPQSSGKVERKNSDIKRL 994
Query: 1639 LEKTVASSRKDWSRKLDDALWAYRIAFKTHLGLSPYQLVFGKACHLPVELEHKAYWAIKA 1698
L K + W L A ++ +P+QL+FG + P
Sbjct: 995 LTKLLVGRPAKWYDLLPVVQLALNNSYSPSSKYTPHQLLFGIDSNTPF------------ 1042
Query: 1699 LNFDQTLAGKKRLLKLNELEEMRLGAYENAVIYKERTKRYHDKGLVRREFYVGQLVLLFN 1758
L L+ EE+ L + +Y T + VGQLV
Sbjct: 1043 --------ANSDTLDLSREEELSLLQEIRSSLYLPSTP---PASIRAWSPSVGQLVQERV 1091
Query: 1759 SRLKLFPGKLKSKWSGPF-MIESISPYGAVELSKPGEPGTFKVNAQRIKPYLGGELPTNK 1817
+R P L+ +W P ++E I+P V L G T V+ ++ Y P
Sbjct: 1092 AR----PASLRPRWHKPTPVLEVINPRAVVILDHLGNRRTVSVDNLKLTAYQKDGTPNES 1147
Query: 1818 GGVV 1821
VV
Sbjct: 1148 AAVV 1151
Score = 88.2 bits (217), Expect = 2e-16
Identities = 100/455 (21%), Positives = 196/455 (42%), Gaps = 52/455 (11%)
Query: 939 QPQRRLNPTMKEVVKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKNELIP 998
Q Q +NP K ++ + LL G++ +S +PV+ VPK G
Sbjct: 179 QKQYPINPKAKASIQTVINDLLKQGVLIQ-QNSIMNTPVYPVPKPDGKW----------- 226
Query: 999 TRTVTGRRMCIDYRRLNTATRKDHFPLPFMDQMIERLAGQAFYCF-------LDGYSGYN 1051
RM +DYR +N +P + + AG F LD +G+
Sbjct: 227 -------RMVLDYREVNKT-------IPLIAAQNQHSAGILSSIFRGKYKTTLDLSNGFW 272
Query: 1052 QIAVAPEDQEKTAFTCPFGVFAYRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDF 1111
++ PE TAFT + + R+P G +PA F ++ + ++ N++V++DD
Sbjct: 273 AHSITPESYWLTAFTWLGQQYCWTRLPQGFLNSPALFTADVVDLLKEV--PNVQVYVDDI 330
Query: 1112 SVFGKSFDQCLFHLNAVLKRCTETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAKIE 1171
+ + L L V +++ +K E LG I+ +G + + +
Sbjct: 331 YISHDDPREHLEQLEKVFSLLLNAGYVVSLKKSEIAQHEVEFLGFNITKEGRGLTETFKQ 390
Query: 1172 VIEKLPPPMNVKGVRSFLGHAGFYRRFIKDFSKIAKPLCNLLVKET--EFDFDVECLNAF 1229
+ + PP ++K ++S LG F R FI +FS++ KPL N++ + +
Sbjct: 391 KLLNITPPRDLKQLQSILGLLNFARNFIPNFSELVKPLYNIIATANGKYITWTTDNSQQL 450
Query: 1230 SLIKNKLVTAPIIIAPNWDLHFELMCDASDYAVGAVLGQRKNKFF-HAIYYASKVLNESQ 1288
I + L +A + N ++ + + S A G + + N+F I Y + V +++
Sbjct: 451 QNIISMLNSAENLEERNPEVRLIMKVNTSPSA-GYI--RFYNEFAKRPIMYLNYVYTKAE 507
Query: 1289 VNYSTTEKELLAVIFALEKFRSYLIGSKVIVFTDHAAL----KYLLTKGDSKP-RLLRWV 1343
V ++ TEK L + L K +G +++V++ ++ K L + + P R + W+
Sbjct: 508 VKFTNTEKLLTTIHKGLIKALDLGMGQEILVYSPIVSMTKIQKTPLPERKALPIRWITWM 567
Query: 1344 LLLQE------FDLEIRDKKGVENVVADHLSRLEN 1372
L++ +D + + + V V D ++++++
Sbjct: 568 SYLEDPRIQFHYDKTLPELQQVPTVTDDIIAKIKH 602
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 119 bits (297), Expect = 1e-25
Identities = 108/436 (24%), Positives = 190/436 (42%), Gaps = 28/436 (6%)
Query: 946 PTMKEVVKKEVLKLLDAGMIYPISDSSWVSPVHVVPKKGGMTVVVNEKNELIPTRTVTGR 1005
P KEV +K++ +LLD +I + + +V +E + +
Sbjct: 1193 PADKEVFEKQIKELLDNKLIKKADPTC---------RHRTAAFIVRNHSEEVAQKP---- 1239
Query: 1006 RMCIDYRRLNTATRKDHFPLPFMDQMIERLAGQAFYCFLDGYSGYNQIAVAPEDQEKTAF 1065
R+ +Y+RLN D F +P MI + + D +G++ + + + ++ T F
Sbjct: 1240 RIVYNYKRLNDNMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTF 1299
Query: 1066 TCPFGVFAYRRMPFGLCGAPATFQRCMLSIFSDMIEKNIEVFMDDFSVFGKSFDQCLFHL 1125
TC G++ + PFG+ AP FQR M F D+ K +++DD + + + + HL
Sbjct: 1300 TCSEGLYTWNVCPFGIANAPCAFQRFMQESFGDL--KFALLYIDDILIASNNEKEHIEHL 1357
Query: 1126 NAVLKRCTETNLILNWEKCHFMVTEGIVLGHKISSKGIEVDQAKIEVIEKLPPPM--NVK 1183
R E +L+ +K + E LG +I I + ++ I+K +K
Sbjct: 1358 KIFFNRVKEVGCVLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIKKFDKNKLNTLK 1417
Query: 1184 GVRSFLGHAGFYRRFIKDFSKIAKPLCNLLVKETEFDFDVECLNAFSLIKNKLVTAPIII 1243
G++++LG + R +IKD SK+ PL K + F+ E N I+ ++ +
Sbjct: 1418 GLQAYLGLLNYARGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKPLE 1477
Query: 1244 APNWDLHFELMCDASDYAVGAVLGQRKNKFF-----HAIYYASKVLNESQVNYSTTEKEL 1298
P + + DAS+ GAVL + +K+ YAS E + +++ + E+
Sbjct: 1478 RPKETDYIIIETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFGEKK-TWTSLDYEI 1536
Query: 1299 LAVIFALEKFRSYLIGSKVIVFTDHAALKYLLTKGDSKPRLLRWV-----LLLQEFDLEI 1353
A+ AL KF+ YL I A +K + T+ K RW+ LL +
Sbjct: 1537 EAINEALNKFQIYLDKDFTIRTDCEAIVKGIKTEDYKKRSKTRWIKLRDNLLKDGYKPTF 1596
Query: 1354 RDKKGVENVVADHLSR 1369
KG +N + + LSR
Sbjct: 1597 EHIKGNKNFLPNFLSR 1612
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.318 0.136 0.404
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 224,574,501
Number of Sequences: 164201
Number of extensions: 10279482
Number of successful extensions: 35363
Number of sequences better than 10.0: 354
Number of HSP's better than 10.0 without gapping: 140
Number of HSP's successfully gapped in prelim test: 228
Number of HSP's that attempted gapping in prelim test: 33398
Number of HSP's gapped (non-prelim): 1513
length of query: 1825
length of database: 59,974,054
effective HSP length: 125
effective length of query: 1700
effective length of database: 39,448,929
effective search space: 67063179300
effective search space used: 67063179300
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 73 (32.7 bits)
Medicago: description of AC146759.6