
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146971.7 - phase: 0 /pseudo
(2360 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 118 2e-25
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 105 2e-21
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 105 2e-21
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 105 2e-21
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 105 2e-21
POL_MLVMO (P03355) Pol polyprotein [Contains: Protease (EC 3.4.2... 94 4e-18
POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.2... 94 4e-18
POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.2... 93 8e-18
POL_MLVF5 (P26810) Pol polyprotein [Contains: Protease (EC 3.4.2... 93 1e-17
POL_MLVAV (P03356) Pol polyprotein [Contains: Protease (EC 3.4.2... 91 5e-17
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 88 2e-16
POL_MLVCB (P08361) Pol polyprotein [Contains: Reverse transcript... 88 2e-16
POL_MLVAK (P03357) Pol polyprotein [Contains: Reverse transcript... 88 2e-16
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 88 3e-16
POL_MLVRK (P31795) Pol polyprotein [Contains: Protease (EC 3.4.2... 87 7e-16
POL_MLVRD (P11227) Pol polyprotein [Contains: Protease (EC 3.4.2... 87 7e-16
POL3_MOUSE (P11367) Retrovirus-related Pol polyprotein (Endonucl... 86 1e-15
POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.2... 83 8e-15
POL_AVIRE (P03360) Pol polyprotein [Contains: Reverse transcript... 82 1e-14
POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse transcript... 80 7e-14
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 118 bits (296), Expect = 2e-25
Identities = 107/427 (25%), Positives = 189/427 (44%), Gaps = 36/427 (8%)
Query: 1950 DIKVFLQTREYPPGASNKD-------KKTLRRLSSNFFLNGDILYKRNFDTVLLRCV--- 1999
D+ FLQ E G + KK +S + F N +N LL V
Sbjct: 828 DLDQFLQRLELQAGIYDISQIKMAPWKKIFEHVSIDKFKNMGNKILKNLKVALLNPVTQI 887
Query: 2000 -DKYEADLLIHEIHEGSFGIHPNGHTMAKKIL---RAGYYWMTMESDCYKHTRKCHKCQI 2055
++ E + ++ +H+ GHT K L + YYW M ++ RKC KCQ
Sbjct: 888 NNEKEKEAILSTLHDDPI---QGGHTGITKTLAKVKRHYYWKNMSKYIKEYVRKCQKCQK 944
Query: 2056 YADKIHMPPTTLNLLSSP-WPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASY 2114
H T + + +P F +D IG + PK+ NG+ + + I TK++ A
Sbjct: 945 AKTTKHTK-TPMTITETPEHAFDRVVVDTIGPL-PKSENGNEYAVTLICDLTKYLVAIPI 1002
Query: 2115 ANVTKQVVVKFIKNHIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQM 2174
AN + + V K I I +YG ITD GT N ++ +LC KI++ S+ + Q
Sbjct: 1003 ANKSAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQT 1062
Query: 2175 NGAVEAANKNIKRIVQKMVVTYK-DWHEMLPFALHGYRTSVRTSIGATPFSLVYGMEAVL 2233
G VE +++ + ++ + T K DW L + ++ + T+ P+ LV+G + L
Sbjct: 1063 VGVVERSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNL 1122
Query: 2234 PVEV-EIPSLRVLMEVDLSEAEWVQNRYDQLNLIEEKRMAALCHGQLYQKRMKQAFDKKV 2292
P ++ S+ + +D ++ + +L + + L + ++++ K+ +D KV
Sbjct: 1123 PKHFNKLHSIEPIYNID----DYAKESKYRLEVAYARARKLL---EAHKEKNKENYDLKV 1175
Query: 2293 RPREFKEGDLVLKKIFSFQPDSRGKWAPNYEGPYVVKKAFSGGAMTLQTMDGEELPRPVN 2352
+ E + GD VL + + K Y GPY ++ +TL T ++ + V+
Sbjct: 1176 KDIELEVGDKVL-----LRNEVGHKLDFKYTGPYKIESIGDNNNITLLTNKNKK--QIVH 1228
Query: 2353 TDTVKKY 2359
D +KK+
Sbjct: 1229 KDRLKKF 1235
Score = 39.3 bits (90), Expect = 0.13
Identities = 26/103 (25%), Positives = 48/103 (46%), Gaps = 1/103 (0%)
Query: 1653 IYYLSKKFTECESRYSMLEKTCCALAWAAKRLRHYMINHTTWLISKMDPIKYIFEKPALI 1712
+ Y S+ FT+ ES S E+ A+ WA R Y+ + + P+ Y+F
Sbjct: 642 VAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHRPLTYLFSMVNPS 701
Query: 1713 GRIARWQMLLSEYDIEYRSQKAIKGSILADHLAHQPLEDYRPI 1755
++ R ++ L EY+ K K + +AD L+ +++ + I
Sbjct: 702 SKLTRIRLELEEYNFTVEYLKG-KDNHVADALSRITIKELKDI 743
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 105 bits (261), Expect = 2e-21
Identities = 87/330 (26%), Positives = 149/330 (44%), Gaps = 20/330 (6%)
Query: 2007 LIHEIHEGSFGIHPNGHTMAKKILRA---GYYWMTMESDCYKHTRKCHKCQIYADKIHMP 2063
L+ E+HEG GH KK+ R +YW M R C KC D +
Sbjct: 1468 LLKELHEGMLA----GHFGIKKMWRMVHRKFYWPQMRVCVENCVRTCAKCLCANDHSKLT 1523
Query: 2064 PTTLNLLSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYANVTKQVVV 2123
++L +P + D++ + G+R+IL ID FTK+ A + + V+
Sbjct: 1524 -SSLTPYRMTFPLEIVACDLMD--VGLSVQGNRYILTIIDLFTKYGTAVPIPDKKAETVL 1580
Query: 2124 K-FIKNHIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQMNGAVEAAN 2182
K F++ I IP +++TD G N + + KIEH + Y + NGAVE N
Sbjct: 1581 KAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGYNSRANGAVERFN 1640
Query: 2183 KNIKRIVQKMVVTYKDWHEMLPFALHGYRTSVRTSIGATPFSLVYGMEAVLPVEVEIPSL 2242
K I I++K +W + + +A++ Y V + G TP L++G + + P+E+
Sbjct: 1641 KTIMHIMKKKTAVPMEWDDQVVYAVYAYNNCVHENTGETPMFLMHGRDVMGPLEMSGEDA 1700
Query: 2243 RVLMEVDLSEAEWVQNRYDQLNLIEEKRMAALCHGQLYQKRMKQAFDKKVRPREFK---E 2299
+ D+ E + + + L++ +++A H Q+ K FD+K ++ +
Sbjct: 1701 VGINYADMDEYKHLLTQ----ELLKVQKIAKE-HAMREQESYKSLFDQKYASKKHRFPQP 1755
Query: 2300 GDLVLKKIFSFQPDSR-GKWAPNYEGPYVV 2328
G VL +I S + ++ K + GPY V
Sbjct: 1756 GSRVLLEIPSEKLGAQCPKLVNKWSGPYRV 1785
Score = 43.1 bits (100), Expect = 0.009
Identities = 27/110 (24%), Positives = 55/110 (49%), Gaps = 2/110 (1%)
Query: 1636 MGCVLGQQDETGRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAKRLRHYMINHTTWL 1695
+G VL Q+ G+ +H I + SK + E+RY + + A+ +A +R + + +
Sbjct: 1255 IGAVLAQEGPDGQ-QHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITV 1313
Query: 1696 ISKMDPIKYIFEKPALIGRIARWQMLLSEYDIEYRSQKAIKGSILADHLA 1745
+ P+ + + L R+ RW + + E+D++ A K + +AD L+
Sbjct: 1314 FTDHKPLISLLKGSPLADRLWRWSIEILEFDVKI-VYLAGKANAVADALS 1362
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 105 bits (261), Expect = 2e-21
Identities = 105/447 (23%), Positives = 193/447 (42%), Gaps = 38/447 (8%)
Query: 1891 ELHHIPRDENQMADALATLSSMIKVNHHNDVPLISVKFLDRPAYVFAAEVVFDDKPWFHD 1950
E+++ P N +ADAL+ + V+ +P K + + F ++ D
Sbjct: 814 EINYRPGSANHIADALSRI-----VDETEPIP----KDSEDNSINFVNQISITDDFKNQV 864
Query: 1951 IKVFLQTREYPPGASNKDKKTLRRLSSNFFLNGDILYKRNFDTVLLRCVDKYEADLLIHE 2010
+ + + +N+DK R+ N L +L D +LL D +I +
Sbjct: 865 VTEYTNDTKLLNLLNNEDK----RVEENIQLKDGLLINSK-DQILLPN-DTQLTRTIIKK 918
Query: 2011 IHEGSFGIHPNGHTMAKKILRAGYYWMTMESDCYKHTRKCHKCQIYADKIHMPPTTLNLL 2070
HE IHP + ILR + W + ++ + CH CQI + H P L +
Sbjct: 919 YHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPI 977
Query: 2071 S-SPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKW-VEAASYANVTKQVVVKFIKN 2128
S P+ +D I + S+G+ + V +D F+K + ++T + +
Sbjct: 978 PPSERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQ 1035
Query: 2129 HIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQMNGAVEAANKNIKRI 2188
+I +G P II DN ++ K+ + S PYRPQ +G E N+ ++++
Sbjct: 1036 RVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKL 1095
Query: 2189 VQKMVVTYKD-WHEMLPFALHGYRTSVRTSIGATPFSLVYGMEAVLPVEVEIPSLRVLME 2247
++ + T+ + W + + Y ++ ++ TPF +V+ L +E+PS +
Sbjct: 1096 LRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFS--DK 1152
Query: 2248 VDLSEAEWVQNRYDQLNLIEEKRMAALCHGQLYQKRMKQAFDKKVRP-REFKEGDLVL-K 2305
D + E +Q ++E H +MK+ FD K++ EF+ GDLV+ K
Sbjct: 1153 TDENSQETIQ----VFQTVKE-------HLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK 1201
Query: 2306 KIFSFQPDSRGKWAPNYEGP-YVVKKA 2331
+ + K AP++ GP YV++K+
Sbjct: 1202 RTKTGFLHKSNKLAPSFAGPFYVLQKS 1228
Score = 50.1 bits (118), Expect = 7e-05
Identities = 42/146 (28%), Positives = 72/146 (48%), Gaps = 25/146 (17%)
Query: 1635 SMGCVLGQQDETGRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAKRLRHYMINHTTW 1694
++G VL Q+ + K + + Y S K ++ + YS+ +K A+ + K RHY
Sbjct: 718 AVGAVLSQKHDDD-KYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHY------- 769
Query: 1695 LISKMDPIKYIFEKPALIGRI-----------ARWQMLLSE--YDIEYRSQKAIKGSILA 1741
L S ++P K + + LIGRI ARWQ+ L + ++I YR A + +A
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSA---NHIA 826
Query: 1742 DHLAHQPLEDYRPIKFDFPDEEIMYL 1767
D L+ + +++ PI D D I ++
Sbjct: 827 DALS-RIVDETEPIPKDSEDNSINFV 851
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 105 bits (261), Expect = 2e-21
Identities = 105/447 (23%), Positives = 193/447 (42%), Gaps = 38/447 (8%)
Query: 1891 ELHHIPRDENQMADALATLSSMIKVNHHNDVPLISVKFLDRPAYVFAAEVVFDDKPWFHD 1950
E+++ P N +ADAL+ + V+ +P K + + F ++ D
Sbjct: 814 EINYRPGSANHIADALSRI-----VDETEPIP----KDSEDNSINFVNQISITDDFKNQV 864
Query: 1951 IKVFLQTREYPPGASNKDKKTLRRLSSNFFLNGDILYKRNFDTVLLRCVDKYEADLLIHE 2010
+ + + +N+DK R+ N L +L D +LL D +I +
Sbjct: 865 VTEYTNDTKLLNLLNNEDK----RVEENIQLKDGLLINSK-DQILLPN-DTQLTRTIIKK 918
Query: 2011 IHEGSFGIHPNGHTMAKKILRAGYYWMTMESDCYKHTRKCHKCQIYADKIHMPPTTLNLL 2070
HE IHP + ILR + W + ++ + CH CQI + H P L +
Sbjct: 919 YHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPI 977
Query: 2071 S-SPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKW-VEAASYANVTKQVVVKFIKN 2128
S P+ +D I + S+G+ + V +D F+K + ++T + +
Sbjct: 978 PPSERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQ 1035
Query: 2129 HIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQMNGAVEAANKNIKRI 2188
+I +G P II DN ++ K+ + S PYRPQ +G E N+ ++++
Sbjct: 1036 RVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKL 1095
Query: 2189 VQKMVVTYKD-WHEMLPFALHGYRTSVRTSIGATPFSLVYGMEAVLPVEVEIPSLRVLME 2247
++ + T+ + W + + Y ++ ++ TPF +V+ L +E+PS +
Sbjct: 1096 LRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFS--DK 1152
Query: 2248 VDLSEAEWVQNRYDQLNLIEEKRMAALCHGQLYQKRMKQAFDKKVRP-REFKEGDLVL-K 2305
D + E +Q ++E H +MK+ FD K++ EF+ GDLV+ K
Sbjct: 1153 TDENSQETIQ----VFQTVKE-------HLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK 1201
Query: 2306 KIFSFQPDSRGKWAPNYEGP-YVVKKA 2331
+ + K AP++ GP YV++K+
Sbjct: 1202 RTKTGFLHKSNKLAPSFAGPFYVLQKS 1228
Score = 50.1 bits (118), Expect = 7e-05
Identities = 42/146 (28%), Positives = 72/146 (48%), Gaps = 25/146 (17%)
Query: 1635 SMGCVLGQQDETGRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAKRLRHYMINHTTW 1694
++G VL Q+ + K + + Y S K ++ + YS+ +K A+ + K RHY
Sbjct: 718 AVGAVLSQKHDDD-KYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHY------- 769
Query: 1695 LISKMDPIKYIFEKPALIGRI-----------ARWQMLLSE--YDIEYRSQKAIKGSILA 1741
L S ++P K + + LIGRI ARWQ+ L + ++I YR A + +A
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSA---NHIA 826
Query: 1742 DHLAHQPLEDYRPIKFDFPDEEIMYL 1767
D L+ + +++ PI D D I ++
Sbjct: 827 DALS-RIVDETEPIPKDSEDNSINFV 851
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 105 bits (261), Expect = 2e-21
Identities = 105/447 (23%), Positives = 193/447 (42%), Gaps = 38/447 (8%)
Query: 1891 ELHHIPRDENQMADALATLSSMIKVNHHNDVPLISVKFLDRPAYVFAAEVVFDDKPWFHD 1950
E+++ P N +ADAL+ + V+ +P K + + F ++ D
Sbjct: 814 EINYRPGSANHIADALSRI-----VDETEPIP----KDSEDNSINFVNQISITDDFKNQV 864
Query: 1951 IKVFLQTREYPPGASNKDKKTLRRLSSNFFLNGDILYKRNFDTVLLRCVDKYEADLLIHE 2010
+ + + +N+DK R+ N L +L D +LL D +I +
Sbjct: 865 VTEYTNDTKLLNLLNNEDK----RVEENIQLKDGLLINSK-DQILLPN-DTQLTRTIIKK 918
Query: 2011 IHEGSFGIHPNGHTMAKKILRAGYYWMTMESDCYKHTRKCHKCQIYADKIHMPPTTLNLL 2070
HE IHP + ILR + W + ++ + CH CQI + H P L +
Sbjct: 919 YHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPI 977
Query: 2071 S-SPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKW-VEAASYANVTKQVVVKFIKN 2128
S P+ +D I + S+G+ + V +D F+K + ++T + +
Sbjct: 978 PPSERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQ 1035
Query: 2129 HIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQMNGAVEAANKNIKRI 2188
+I +G P II DN ++ K+ + S PYRPQ +G E N+ ++++
Sbjct: 1036 RVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKL 1095
Query: 2189 VQKMVVTYKD-WHEMLPFALHGYRTSVRTSIGATPFSLVYGMEAVLPVEVEIPSLRVLME 2247
++ + T+ + W + + Y ++ ++ TPF +V+ L +E+PS +
Sbjct: 1096 LRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFS--DK 1152
Query: 2248 VDLSEAEWVQNRYDQLNLIEEKRMAALCHGQLYQKRMKQAFDKKVRP-REFKEGDLVL-K 2305
D + E +Q ++E H +MK+ FD K++ EF+ GDLV+ K
Sbjct: 1153 TDENSQETIQ----VFQTVKE-------HLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK 1201
Query: 2306 KIFSFQPDSRGKWAPNYEGP-YVVKKA 2331
+ + K AP++ GP YV++K+
Sbjct: 1202 RTKTGFLHKSNKLAPSFAGPFYVLQKS 1228
Score = 50.1 bits (118), Expect = 7e-05
Identities = 42/146 (28%), Positives = 72/146 (48%), Gaps = 25/146 (17%)
Query: 1635 SMGCVLGQQDETGRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAKRLRHYMINHTTW 1694
++G VL Q+ + K + + Y S K ++ + YS+ +K A+ + K RHY
Sbjct: 718 AVGAVLSQKHDDD-KYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHY------- 769
Query: 1695 LISKMDPIKYIFEKPALIGRI-----------ARWQMLLSE--YDIEYRSQKAIKGSILA 1741
L S ++P K + + LIGRI ARWQ+ L + ++I YR A + +A
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSA---NHIA 826
Query: 1742 DHLAHQPLEDYRPIKFDFPDEEIMYL 1767
D L+ + +++ PI D D I ++
Sbjct: 827 DALS-RIVDETEPIPKDSEDNSINFV 851
>POL_MLVMO (P03355) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1199
Score = 94.0 bits (232), Expect = 4e-18
Identities = 82/298 (27%), Positives = 135/298 (44%), Gaps = 25/298 (8%)
Query: 2034 YYWMTMESDCYKHTRKCHKC-QIYADKIHMPPTTLNLLSSPWPFSMWGIDMIGRIEPKAS 2092
YY + + T C C Q+ A K + T + P + W ID I+P
Sbjct: 869 YYMLNRDRTLKNITETCKACAQVNASKSAVKQGTR--VRGHRPGTHWEIDFT-EIKP-GL 924
Query: 2093 NGHRFILVAIDYFTKWVEAASYANVTKQVVVKFIKNHIICRYGIPNRIITDNGTNLNNKM 2152
G++++LV ID F+ W+EA T +VV K + I R+G+P + TDNG +K+
Sbjct: 925 YGYKYLLVFIDTFSGWIEAFPTKKETAKVVTKKLLEEIFPRFGMPQVLGTDNGPAFVSKV 984
Query: 2153 MKELCDDFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVVT--YKDWHEMLPFALHGY 2210
+ + D I+ YRPQ +G VE N+ IK + K+ + +DW +LP AL+
Sbjct: 985 SQTVADLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRDWVLLLPLALYRA 1044
Query: 2211 RTSVRTSIGATPFSLVYGMEAVLPVEVEIPSLRVLMEVDLSEAEWVQNRYDQLNLIEEKR 2270
R + G TP+ ++YG L V P + ++ + +Q L L++ +
Sbjct: 1045 RNTPGPH-GLTPYEILYGAPPPL-VNFPDPDM-----TRVTNSPSLQAHLQALYLVQHEV 1097
Query: 2271 MAALCHGQLYQKRMKQAFDKKVRPREFKEGDLVLKKIFSFQPDSRGKWAPNYEGPYVV 2328
L YQ+++ D+ V P ++ GD V + + P ++GPY V
Sbjct: 1098 WRPL--AAAYQEQL----DRPVVPHPYRVGDTVWVRRHQTK-----NLEPRWKGPYTV 1144
>POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 94.0 bits (232), Expect = 4e-18
Identities = 82/298 (27%), Positives = 134/298 (44%), Gaps = 25/298 (8%)
Query: 2034 YYWMTMESDCYKHTRKCHKC-QIYADKIHMPPTTLNLLSSPWPFSMWGIDMIGRIEPKAS 2092
YY + + T C C Q+ A K + T + P + W ID ++P
Sbjct: 874 YYMLNRDRTLKDITETCQACAQVNASKSAVKQGTR--VRGHRPGTHWEIDFT-EVKP-GL 929
Query: 2093 NGHRFILVAIDYFTKWVEAASYANVTKQVVVKFIKNHIICRYGIPNRIITDNGTNLNNKM 2152
G++++LV ID F+ WVEA T +VV K + I R+G+P + TDNG +K+
Sbjct: 930 YGYKYLLVFIDTFSGWVEAFPTKKETAKVVTKKLLEEIFPRFGMPQVLGTDNGPAFVSKV 989
Query: 2153 MKELCDDFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVVT--YKDWHEMLPFALHGY 2210
+ + D ++ YRPQ +G VE N+ IK + K+ + +DW +LP AL+
Sbjct: 990 SQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRDWVLLLPLALYRA 1049
Query: 2211 RTSVRTSIGATPFSLVYGMEAVLPVEVEIPSLRVLMEVDLSEAEWVQNRYDQLNLIEEKR 2270
R + G TP+ ++YG L V P + ++ +Q L L++ +
Sbjct: 1050 RNTPGPH-GLTPYEILYGAPPPL-VNFPDPDM-----AKVTHNPSLQAHLQALYLVQHEV 1102
Query: 2271 MAALCHGQLYQKRMKQAFDKKVRPREFKEGDLVLKKIFSFQPDSRGKWAPNYEGPYVV 2328
L YQ+++ D+ V P F+ GD V + + P ++GPY V
Sbjct: 1103 WRPL--AAAYQEQL----DRPVVPHPFRVGDTVWVRRHQTK-----NLEPRWKGPYTV 1149
>POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 93.2 bits (230), Expect = 8e-18
Identities = 81/298 (27%), Positives = 134/298 (44%), Gaps = 25/298 (8%)
Query: 2034 YYWMTMESDCYKHTRKCHKC-QIYADKIHMPPTTLNLLSSPWPFSMWGIDMIGRIEPKAS 2092
YY + + T C C Q+ A K + T + P + W ID ++P
Sbjct: 874 YYMLNRDRTLKDITETCKACAQVNASKSAVKQGTR--VRGHRPGTHWEIDFT-EVKP-GL 929
Query: 2093 NGHRFILVAIDYFTKWVEAASYANVTKQVVVKFIKNHIICRYGIPNRIITDNGTNLNNKM 2152
G++++LV +D F+ WVEA T +VV K + I R+G+P + TDNG +K+
Sbjct: 930 YGYKYLLVFVDTFSGWVEAFPTKKETAKVVTKKLLEEIFPRFGMPQVLGTDNGPAFVSKV 989
Query: 2153 MKELCDDFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVVT--YKDWHEMLPFALHGY 2210
+ + D ++ YRPQ +G VE N+ IK + K+ + +DW +LP AL+
Sbjct: 990 SQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRDWVLLLPLALYRA 1049
Query: 2211 RTSVRTSIGATPFSLVYGMEAVLPVEVEIPSLRVLMEVDLSEAEWVQNRYDQLNLIEEKR 2270
R + G TP+ ++YG L V P + ++ +Q L L++ +
Sbjct: 1050 RNTPGPH-GLTPYEILYGAPPPL-VNFPDPDM-----AKVTHNPSLQAHLQALYLVQHEV 1102
Query: 2271 MAALCHGQLYQKRMKQAFDKKVRPREFKEGDLVLKKIFSFQPDSRGKWAPNYEGPYVV 2328
L YQ+++ D+ V P F+ GD V + + P ++GPY V
Sbjct: 1103 WRPL--AAAYQEQL----DRPVVPHPFRVGDTVWVRRHQTK-----NLEPRWKGPYTV 1149
>POL_MLVF5 (P26810) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 92.8 bits (229), Expect = 1e-17
Identities = 89/337 (26%), Positives = 150/337 (44%), Gaps = 37/337 (10%)
Query: 2000 DKYEADLL--IHEIHEGSFGIHPNGHTMAKKILRAGY---YWMTMESDCYKHTRKCHKC- 2053
D++ +LL +H++ SF + K +L Y Y + + T C C
Sbjct: 842 DQFTFELLDFLHQLTHLSF-------SKTKALLERSYSPSYMLNRDRTLKDITETCKACA 894
Query: 2054 QIYADKIHMPPTTLNLLSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAAS 2113
Q+ A K + T + P + W ID ++P G++++LV +D F+ WVEA
Sbjct: 895 QVNASKSAVKQGTR--VRGHRPGTHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFP 950
Query: 2114 YANVTKQVVVKFIKNHIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQ 2173
T +VV K + I R+G+P + TDNG +K+ + + D ++ YRPQ
Sbjct: 951 TKKETAKVVTKKLLEEIFPRFGMPQVLGTDNGPAFVSKVSQTVADLLGVDWKLHCAYRPQ 1010
Query: 2174 MNGAVEAANKNIKRIVQKMVVT--YKDWHEMLPFALHGYRTSVRTSIGATPFSLVYGMEA 2231
+G VE N+ IK + K+ + +DW +LP AL+ R + G TP+ ++YG
Sbjct: 1011 SSGQVERMNRTIKETLTKLTLATGSRDWVLLLPLALYRARNTPGPH-GLTPYEILYGAPP 1069
Query: 2232 VLPVEVEIPSLRVLMEVDLSEAEWVQNRYDQLNLIEEKRMAALCHGQLYQKRMKQAFDKK 2291
L V P + ++ +Q L L++ + L YQ+++ D+
Sbjct: 1070 PL-VNFPDPDM-----AKVTHNPSLQAHLQALYLVQHEVWRPL--AAAYQEQL----DRP 1117
Query: 2292 VRPREFKEGDLVLKKIFSFQPDSRGKWAPNYEGPYVV 2328
V P F+ GD V + + P ++GPY V
Sbjct: 1118 VVPHPFRVGDTVWVRRHQTK-----NLEPRWKGPYTV 1149
>POL_MLVAV (P03356) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1196
Score = 90.5 bits (223), Expect = 5e-17
Identities = 83/330 (25%), Positives = 142/330 (42%), Gaps = 28/330 (8%)
Query: 2005 DLLIHEIHEGSFGIHPNGHTMAKKILRAG---YYWMTMESDCYKHTRKCHKC-QIYADKI 2060
D + E+ + + G+ K +L G YY + + C C Q+ A K
Sbjct: 837 DQFVFELLDSLHRLTHLGYQKMKALLDRGESPYYMLNRDKTLQYVADSCTVCAQVNASKA 896
Query: 2061 HMPPTTLNLLSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYANVTKQ 2120
+ + P S W ID ++P G++++LV +D F+ WVEA T +
Sbjct: 897 KIGAGVR--VRGHRPGSHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFPTKRETAR 952
Query: 2121 VVVKFIKNHIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQMNGAVEA 2180
VV K + I R+G+P + +DNG +++ + + D I+ YRPQ +G VE
Sbjct: 953 VVSKKLLEEIFPRFGMPQVLGSDNGPAFTSQVSQSVADLLGIDWKLHCAYRPQSSGQVER 1012
Query: 2181 ANKNIKRIVQKMVVT--YKDWHEMLPFALHGYRTSVRTSIGATPFSLVYGMEAVLPVEVE 2238
N+ IK + K+ + +DW +LP AL+ R + G TP+ ++YG L V
Sbjct: 1013 MNRTIKETLTKLTLAAGTRDWVLLLPLALYRARNTPGPH-GLTPYEILYGAPPPL-VNFH 1070
Query: 2239 IPSLRVLMEVDLSEAEWVQNRYDQLNLIEEKRMAALCHGQLYQKRMKQAFDKKVRPREFK 2298
P + +L+ + +Q L ++ + L + + D+ V P F+
Sbjct: 1071 DPDMS-----ELTNSPSLQAHLQALQTVQREIWKPLA------EAYRDQLDQPVIPHPFR 1119
Query: 2299 EGDLVLKKIFSFQPDSRGKWAPNYEGPYVV 2328
GD V + + P ++GPY V
Sbjct: 1120 IGDSVWVRRHQTK-----NLEPRWKGPYTV 1144
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 88.2 bits (217), Expect = 2e-16
Identities = 70/260 (26%), Positives = 122/260 (46%), Gaps = 24/260 (9%)
Query: 1999 VDKYEADLLIHEIHEGSFGIHPNGHTMAKKILRAGYYWMTMESDCYKHTRKCHKCQIYAD 2058
V K +++ ++HEG HP G K+ R+ +W ++SD R C+ CQ +
Sbjct: 778 VPKSLQKIVLKQLHEG----HP-GIVQMKQKARSFVFWRGLDSDIENMVRHCNNCQENSK 832
Query: 2059 KIHMPPTTLNLLSSPWPF--SMWG---IDMIGRIEPKASNGHRFILVAIDYFTKWVEAAS 2113
+ P +PWP + W ID G + NG ++LV +D TK+ E
Sbjct: 833 MPRVVPL------NPWPVPEAPWKRIHIDFAGPL-----NGC-YLLVVVDAKTKYAEVKL 880
Query: 2114 YANVTKQVVVKFIKNHIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQ 2173
+++ + ++ I +G P II+DNGT L + + ++C IEH S+ Y P+
Sbjct: 881 TRSISAVTTIDLLEE-IFSIHGYPETIISDNGTQLTSHLFAQMCQSHGIEHKTSAVYYPR 939
Query: 2174 MNGAVEAANKNIKRIVQKMVVTYKDWHEMLPFALHGYRTSVRTSI-GATPFSLVYGMEAV 2232
NGA E +KR + K+ ++L L YR + +++ G+TP +G +
Sbjct: 940 SNGAAERFVDTLKRGIAKIKGEGSVNQQILNKFLISYRNTPHSALNGSTPAECHFGRKIR 999
Query: 2233 LPVEVEIPSLRVLMEVDLSE 2252
+ + +P+ RVL L++
Sbjct: 1000 TTMSLLMPTDRVLKVPKLTQ 1019
>POL_MLVCB (P08361) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 282
Score = 88.2 bits (217), Expect = 2e-16
Identities = 66/237 (27%), Positives = 113/237 (46%), Gaps = 20/237 (8%)
Query: 2094 GHRFILVAIDYFTKWVEAASYANVTKQVVVKFIKNHIICRYGIPNRIITDNGTNLNNKMM 2153
G++++LV +D F+ W+EA T +VV K + I R+G+P + TDNG +K+
Sbjct: 12 GYKYLLVFVDTFSGWIEAFPTKKETAKVVTKKLLEEIFPRFGMPQVLGTDNGPAFVSKVS 71
Query: 2154 KELCDDFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVVT--YKDWHEMLPFALHGYR 2211
+ + D I+ YRPQ +G VE N+ IK + K+ + +DW +LP AL+ R
Sbjct: 72 QTVADLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRDWVLLLPLALYRAR 131
Query: 2212 TSVRTSIGATPFSLVYGMEAVLPVEVEIPSLRVLMEVDLSEAEWVQNRYDQLNLIEEKRM 2271
+ G TP+ ++YG L V P + ++ + +Q L L++ +
Sbjct: 132 NTPGPH-GLTPYEILYGAPPPL-VNFPDPDM-----TRVTNSPSLQAHLQALYLVQHEVW 184
Query: 2272 AALCHGQLYQKRMKQAFDKKVRPREFKEGDLVLKKIFSFQPDSRGKWAPNYEGPYVV 2328
L YQ+++ D+ V P ++ GD V + + P ++GPY V
Sbjct: 185 RPL--AAAYQEQL----DRPVVPHPYRVGDTVWVRRHQTK-----NLEPRWKGPYTV 230
>POL_MLVAK (P03357) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 843
Score = 88.2 bits (217), Expect = 2e-16
Identities = 83/330 (25%), Positives = 144/330 (43%), Gaps = 29/330 (8%)
Query: 2005 DLLIHEIHEGSFGIHPNGHTMAKKILRAG---YYWMTMESDCYKHTRKCHKC-QIYADKI 2060
D + E+ + + G+ K +L G YY + + C C Q+ A K
Sbjct: 485 DQFVFELLDSLHRLTHLGYQKMKALLDRGESPYYMLNRDKTLQYVADSCTVCAQVNASKA 544
Query: 2061 HMPPTTLNLLSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYANVTKQ 2120
+ + P S W ID ++P G++++LV +D F+ WVEA T +
Sbjct: 545 KIGAGVR--VRGHRPGSHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFPTKRETAR 600
Query: 2121 VVVKFIKNHIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQMNGAVEA 2180
VV K + I R+G+P + +DNG +++ + + D I+ + + YRPQ +G VE
Sbjct: 601 VVSKKLLEEIFPRFGMPQVLGSDNGPAFTSQVSQSVADLLGIDKLHCA-YRPQSSGQVER 659
Query: 2181 ANKNIKRIVQKMVVT--YKDWHEMLPFALHGYRTSVRTSIGATPFSLVYGMEAVLPVEVE 2238
N+ IK + K+ + +DW +LP AL+ R + G TP+ ++YG L V
Sbjct: 660 MNRTIKETLTKLTLAAGTRDWVLLLPLALYRARNTPGPH-GLTPYEILYGAPPPL-VNFH 717
Query: 2239 IPSLRVLMEVDLSEAEWVQNRYDQLNLIEEKRMAALCHGQLYQKRMKQAFDKKVRPREFK 2298
P + +L+ + +Q L ++ + L + + D+ V P F+
Sbjct: 718 DPDMS-----ELTNSPSLQAHLQALQTVQREIWKPLA------EAYRDQLDQPVIPHPFR 766
Query: 2299 EGDLVLKKIFSFQPDSRGKWAPNYEGPYVV 2328
GD V + + P ++GPY V
Sbjct: 767 IGDSVWVRRHQTK-----NLEPRWKGPYTV 791
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 87.8 bits (216), Expect = 3e-16
Identities = 100/402 (24%), Positives = 165/402 (40%), Gaps = 51/402 (12%)
Query: 1900 NQMADALATLSSMIKVNHHNDVPLISVKFLDRPAYVFAAEVVFDDKPWFHDIKVFLQTRE 1959
N +AD LAT S + H N P + + LD+ L
Sbjct: 748 NNLADKLATQGSYVV--HCNTTPSLDAE-LDQ-----------------------LLQGH 781
Query: 1960 YPPGASNKDKKTLRRLSSNFFLNGDILYKRNFDTVLLRCVDKYEADLLIHEI-HEGSFGI 2018
YPPG + K TL N I+ + N ++ D+ + H I H G
Sbjct: 782 YPPGYPKQYKYTLEE-------NKLIVERPNGIRIVPPKADREKIISTAHNIAHTG---- 830
Query: 2019 HPNGHTMAKKILRAGYYWMTMESDCYKHTRKCHKCQIYADKIHMPPTTLNLLSSPWPFSM 2078
T K + + Y+W + D K R+C +C + P L + PF
Sbjct: 831 --RDATFLK--VSSKYWWPNLRKDVVKSIRQCKQCLVTNATNLTSPPILRPVKPLKPFDK 886
Query: 2079 WGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYANVTKQVVVKFIKNHIICRYGIPN 2138
+ ID IG + P SNG+ +LV +D T +V + VK + +++ IP
Sbjct: 887 FYIDYIGPLPP--SNGYLHVLVVVDSMTGFVWLYPTKAPSTSATVKAL--NMLTSIAIPK 942
Query: 2139 RIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVV-TYK 2197
+ +D G + + + I+ S+PY PQ +G VE N +IKR++ K+++
Sbjct: 943 VLHSDQGAAFTSSTFADWAKEKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLLIGRPA 1002
Query: 2198 DWHEMLPFALHGYRTSVRTSIGATPFSLVYGMEAVLPVEVEIPSLRVLMEVDLSEAEWVQ 2257
W+++LP S S TP L++G+++ P +L + E +LS + ++
Sbjct: 1003 KWYDLLPVVQLALNNSYSPSSKYTPHQLLFGVDSNTPF-ANSDTLDLSREEELSLLQEIR 1061
Query: 2258 NRYDQ-LNLIEEKRMAALCHGQLYQKRMKQAFDKKVRPREFK 2298
+ Q + R + GQL Q+R+ A +RPR K
Sbjct: 1062 SSLHQPTSPPASSRSWSPSVGQLVQERV--ARPASLRPRWHK 1101
>POL_MLVRK (P31795) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)] (Fragment)
Length = 581
Score = 86.7 bits (213), Expect = 7e-16
Identities = 84/333 (25%), Positives = 142/333 (42%), Gaps = 34/333 (10%)
Query: 2005 DLLIHEIHEGSFGIHPNGHTMAKKILRAG---YYWMTMESDCYKHTRKCHKC-QIYADKI 2060
D + E+ + + G+ K +L G YY + + C C Q+ A K
Sbjct: 222 DQFVFELLDSLHRLTHLGYQKMKALLDRGESPYYMLNRDKTLQYVADSCTVCAQVNASKA 281
Query: 2061 HMPPTTLNLLSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYANVTKQ 2120
+ + P + W ID ++P G++++LV +D F+ WVEA + T +
Sbjct: 282 KIGAGVR--VRGHRPGTHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFPTKHETAK 337
Query: 2121 VVVKFIKNHIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQMNGAVEA 2180
+V K + I R+G+P + TDNG +++ + + I+ YRPQ +G VE
Sbjct: 338 IVTKKLLEEIFPRFGMPQVLGTDNGPAFVSQVSQSVAKLLGIDWKLHCAYRPQSSGQVER 397
Query: 2181 ANKNIKRIVQKMVVT--YKDWHEMLPFALHGYRTSVRTSIGATPFSLVYGMEAVLPVEVE 2238
N+ IK + K+ + +DW +LP AL+ R + G TP+ ++YG L V
Sbjct: 398 MNRTIKETLTKLTLATGTRDWVLLLPLALYRARNTPGPH-GLTPYEILYGAPPPL-VNFH 455
Query: 2239 IPSLRVLMEVDLSEAEWVQNRYDQLNLIEE---KRMAALCHGQLYQKRMKQAFDKKVRPR 2295
P + + + +Q L ++ K +AA QL D+ V P
Sbjct: 456 DPEMS-----KFTNSPSLQAHLQALQAVQREVWKPLAAAYQDQL---------DQPVIPH 501
Query: 2296 EFKEGDLVLKKIFSFQPDSRGKWAPNYEGPYVV 2328
F+ GD V + + P ++GPY V
Sbjct: 502 PFRVGDTVWVRRHQTK-----NLEPRWKGPYTV 529
>POL_MLVRD (P11227) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1196
Score = 86.7 bits (213), Expect = 7e-16
Identities = 84/333 (25%), Positives = 142/333 (42%), Gaps = 34/333 (10%)
Query: 2005 DLLIHEIHEGSFGIHPNGHTMAKKILRAG---YYWMTMESDCYKHTRKCHKC-QIYADKI 2060
D + E+ + + G+ K +L G YY + + C C Q+ A K
Sbjct: 837 DQFVFELLDSLHRLTHLGYQKMKALLDRGESPYYMLNRDKTLQYVADSCTVCAQVNASKA 896
Query: 2061 HMPPTTLNLLSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYANVTKQ 2120
+ + P + W ID ++P G++++LV +D F+ WVEA + T +
Sbjct: 897 KIGAGVR--VRGHRPGTHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFPTKHETAK 952
Query: 2121 VVVKFIKNHIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQMNGAVEA 2180
+V K + I R+G+P + TDNG +++ + + I+ YRPQ +G VE
Sbjct: 953 IVTKKLLEEIFPRFGMPQVLGTDNGPAFVSQVSQSVAKLLGIDWKLHCAYRPQSSGQVER 1012
Query: 2181 ANKNIKRIVQKMVVT--YKDWHEMLPFALHGYRTSVRTSIGATPFSLVYGMEAVLPVEVE 2238
N+ IK + K+ + +DW +LP AL+ R + G TP+ ++YG L V
Sbjct: 1013 MNRTIKETLTKLTLATGTRDWVLLLPLALYRARNTPGPH-GLTPYEILYGAPPPL-VNFH 1070
Query: 2239 IPSLRVLMEVDLSEAEWVQNRYDQLNLIEE---KRMAALCHGQLYQKRMKQAFDKKVRPR 2295
P + + + +Q L ++ K +AA QL D+ V P
Sbjct: 1071 DPEMS-----KFTNSPSLQAHLQALQAVQREVWKPLAAAYQDQL---------DQPVIPH 1116
Query: 2296 EFKEGDLVLKKIFSFQPDSRGKWAPNYEGPYVV 2328
F+ GD V + + P ++GPY V
Sbjct: 1117 PFRVGDTVWVRRHQTK-----NLEPRWKGPYTV 1144
>POL3_MOUSE (P11367) Retrovirus-related Pol polyprotein (Endonuclease)
(Fragment)
Length = 390
Score = 85.9 bits (211), Expect = 1e-15
Identities = 78/301 (25%), Positives = 132/301 (42%), Gaps = 31/301 (10%)
Query: 2034 YYWMTMESDCYKHTRKCHKC-QIYADKIHMPPTTLNLLSSPWPFSMWGIDMIGRIEPKAS 2092
YY + + ++ C C Q+ A K + T + + W ID ++P
Sbjct: 78 YYMLNKDKILHEVAESCQACVQVNASKTKIRAGTR--VRGHRLGTHWEIDFT-EVKP-GL 133
Query: 2093 NGHRFILVAIDYFTKWVEAASYANVTKQVVVKFIKNHIICRYGIPNRIITDNGTNLNNKM 2152
G++++LV +D F+ WVEA + T ++V K + I R+G+P + TDNG +++
Sbjct: 134 YGYKYLLVFVDTFSGWVEAFPTKHETAKIVTKKLLEEIFPRFGMPQVLGTDNGPAFVSQV 193
Query: 2153 MKELCDDFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVVT--YKDWHEMLPFALHGY 2210
+ + I+ YRPQ +G VE N+ IK + K+ + +DW +LP AL+
Sbjct: 194 SQSVAKLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGTRDWVLLLPLALYRA 253
Query: 2211 RTSVRTSIGATPFSLVYGMEAVLPVEVEIPSLRVLMEVDLSEAEWVQNRYDQLNLIEE-- 2268
R + G TP+ ++YG L V P + + + +Q L ++
Sbjct: 254 RNTPGPH-GLTPYEILYGAPPPL-VNFHDPEMS-----KFTNSPSLQAHLQALQAVQREV 306
Query: 2269 -KRMAALCHGQLYQKRMKQAFDKKVRPREFKEGDLVLKKIFSFQPDSRGKWAPNYEGPYV 2327
K +AA QL D+ V P F+ GD V + + P ++GPY
Sbjct: 307 WKPLAAAYQDQL---------DQPVIPHPFRVGDTVWVRRHQTK-----NLEPRWKGPYT 352
Query: 2328 V 2328
V
Sbjct: 353 V 353
>POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1157
Score = 83.2 bits (204), Expect = 8e-15
Identities = 71/267 (26%), Positives = 119/267 (43%), Gaps = 9/267 (3%)
Query: 2034 YYWMTMESDCYKHTRKCHKCQIYADKIHMPPTTLNLLSSPWPFSMWGIDMIGRIEPKASN 2093
Y+W + D K R+C +C + P L PF + ID IG + P SN
Sbjct: 844 YWWPNLRKDVVKVIRQCKQCLVTNAATLAAPPILRPERPVKPFDKFFIDYIGPLPP--SN 901
Query: 2094 GHRFILVAIDYFTKWVEAASYANVTKQVVVKFIKNHIICRYGIPNRIITDNGTNLNNKMM 2153
G+ +LV +D T +V + VK + +++ +P I +D G +
Sbjct: 902 GYLHVLVVVDSMTGFVWLYPTKAPSTSATVKAL--NMLTSIAVPKVIHSDQGAAFTSATF 959
Query: 2154 KELCDDFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVV-TYKDWHEMLPFALHGYRT 2212
+ + I+ S+PY PQ +G VE N +IKR++ K++V W+++LP
Sbjct: 960 ADWAKNKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLLVGRPAKWYDLLPVVQLALNN 1019
Query: 2213 SVRTSIGATPFSLVYGMEAVLPVEVEIPSLRVLMEVDLSEAEWVQNR-YDQLNLIEEKRM 2271
S S TP L++G+++ P +L + E +LS + +++ Y R
Sbjct: 1020 SYSPSSKYTPHQLLFGIDSNTPF-ANSDTLDLSREEELSLLQEIRSSLYLPSTPPASIRA 1078
Query: 2272 AALCHGQLYQKRMKQAFDKKVRPREFK 2298
+ GQL Q+R+ A +RPR K
Sbjct: 1079 WSPSVGQLVQERV--ARPASLRPRWHK 1103
>POL_AVIRE (P03360) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 473
Score = 82.4 bits (202), Expect = 1e-14
Identities = 48/156 (30%), Positives = 80/156 (50%), Gaps = 2/156 (1%)
Query: 2075 PFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYANVTKQVVVKFIKNHIICRY 2134
P W +D I K G++++LV +D F+ WVEA T QVV+K + II R+
Sbjct: 188 PGEHWEVDFTEMITAKG--GYKYLLVLVDTFSGWVEAYPAKRETSQVVIKHLILDIIPRF 245
Query: 2135 GIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVV 2194
G+P +I +DNG K+ ++LC+ + YRPQ +G VE N+ +K+ + K+
Sbjct: 246 GLPVQIGSDNGPAFVAKVTQQLCEALNVSWKLHCAYRPQSSGQVERMNRTLKKAIAKLED 305
Query: 2195 TYKDWHEMLPFALHGYRTSVRTSIGATPFSLVYGME 2230
+ + P + T G +PF ++YG++
Sbjct: 306 RDRRGLGLPPPSGFAPGTVYPGREGLSPFEILYGLK 341
>POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 1046
Score = 80.1 bits (196), Expect = 7e-14
Identities = 57/185 (30%), Positives = 88/185 (46%), Gaps = 8/185 (4%)
Query: 2047 TRKCHKCQ-IYADKIHMPPTTLNLLSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYF 2105
T C CQ + A +P + P + W ID ++P + G++++LV +D F
Sbjct: 734 TSACKVCQQVNAGATRVPEGKRTRGNRPGVY--WEIDFT-EVKPHYA-GYKYLLVFVDTF 789
Query: 2106 TKWVEAASYANVTKQVVVKFIKNHIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHH 2165
+ WVEA T +V K I I R+G+P I +DNG +++ + L I
Sbjct: 790 SGWVEAYPTRQETAHMVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQGLARTLGINWK 849
Query: 2166 NSSPYRPQMNGAVEAANKNIKRIVQKMVVT--YKDWHEMLPFALHGYRTSVRTSIGATPF 2223
YRPQ +G VE N+ IK + K+ + KDW +L AL R + G TP+
Sbjct: 850 LHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLLSLALLRAR-NTPNRFGLTPY 908
Query: 2224 SLVYG 2228
++YG
Sbjct: 909 EILYG 913
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.327 0.140 0.434
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 275,180,715
Number of Sequences: 164201
Number of extensions: 12109897
Number of successful extensions: 58744
Number of sequences better than 10.0: 514
Number of HSP's better than 10.0 without gapping: 283
Number of HSP's successfully gapped in prelim test: 249
Number of HSP's that attempted gapping in prelim test: 46577
Number of HSP's gapped (non-prelim): 4108
length of query: 2360
length of database: 59,974,054
effective HSP length: 127
effective length of query: 2233
effective length of database: 39,120,527
effective search space: 87356136791
effective search space used: 87356136791
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 74 (33.1 bits)
Medicago: description of AC146971.7