
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146806.4 + phase: 0 /pseudo
(796 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 227 8e-59
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 227 8e-59
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 227 8e-59
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 106 2e-22
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 101 9e-21
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 99 4e-20
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript... 97 1e-19
POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.2... 97 2e-19
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 74 2e-12
POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23... 68 1e-10
POL_MLVAV (P03356) Pol polyprotein [Contains: Protease (EC 3.4.2... 67 2e-10
POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.2... 66 3e-10
POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse transcript... 64 2e-09
POL_MLVRK (P31795) Pol polyprotein [Contains: Protease (EC 3.4.2... 64 2e-09
POL_MLVRD (P11227) Pol polyprotein [Contains: Protease (EC 3.4.2... 64 2e-09
POL_MLVAK (P03357) Pol polyprotein [Contains: Reverse transcript... 63 3e-09
POL3_MOUSE (P11367) Retrovirus-related Pol polyprotein (Endonucl... 61 1e-08
POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.2... 57 1e-07
POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.2... 57 1e-07
POL_MLVF5 (P26810) Pol polyprotein [Contains: Protease (EC 3.4.2... 57 1e-07
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 227 bits (579), Expect = 8e-59
Identities = 142/464 (30%), Positives = 240/464 (51%), Gaps = 23/464 (4%)
Query: 258 PGKANVVADVLSRKTLHMYALMVKELELIEQFGELSLVSELTPDGVRLGMLKLTSNILEE 317
PG AN +AD LSR +V E E I + E + ++ + + +T + +
Sbjct: 819 PGSANHIADALSR--------IVDETEPIPKDSEDNSINFVN-------QISITDDFKNQ 863
Query: 318 IKNGQKEDLELVDRVTLVNQGKGGDFRLDENDVLMFRDRVCVPDVLDLKRQILDEGHRSS 377
+ D +L++ + ++ + +L + ++ +D++ +P+ L R I+ + H
Sbjct: 864 VVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEG 923
Query: 378 LSIHPGATKMYQDLKRLFWWPGMKKEIAEFVYACLVCQKSMIEHQRPSGLMQPLFVPEWT 437
IHPG + + R F W G++K+I E+V C CQ + + +P G +QP+ E
Sbjct: 924 KLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERP 983
Query: 438 WDSISMDFVGALPKTSKSFDTIWVIVDRLTKSTHFVPIKTSMSIARLAEIYIEQIVRLHG 497
W+S+SMDF+ ALP++S ++ ++V+VDR +K VP S++ + A ++ ++++ G
Sbjct: 984 WESLSMDFITALPESS-GYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFG 1042
Query: 498 IPSSIVSDRDPRFTSNFWESLQAALGTKLSLSSAYHPQTDGQTERTIQSLEDLLRACVLE 557
P I++D D FTS W+ + S Y PQTDGQTERT Q++E LLR
Sbjct: 1043 NPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCST 1102
Query: 558 QGVSWDECLPLIEFTYNNSFHSSIRMAPFEALYGRRCRTPLCWFESGESAMLAPEVVQET 617
+W + + L++ +YNN+ HS+ +M PFE ++ R L E + E QET
Sbjct: 1103 HPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH--RYSPALSPLELPSFSDKTDENSQET 1160
Query: 618 TEKVKMIQEKMKASQST*KSYHDKRRKDI-EFQVGDHVFLWVNPVTGVGHALKCRKLTPR 676
+ + ++E + + K Y D + ++I EFQ GD V + T G K KL P
Sbjct: 1161 IQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMV---KRTKTGFLHKSNKLAPS 1217
Query: 677 FVGPFDVIEKVGVVAYRIALPPSLSNL-HNVFHVPKLRKYVHDA 719
F GPF V++K G Y + LP S+ ++ + FHV L KY H++
Sbjct: 1218 FAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKYRHNS 1261
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 227 bits (579), Expect = 8e-59
Identities = 142/464 (30%), Positives = 240/464 (51%), Gaps = 23/464 (4%)
Query: 258 PGKANVVADVLSRKTLHMYALMVKELELIEQFGELSLVSELTPDGVRLGMLKLTSNILEE 317
PG AN +AD LSR +V E E I + E + ++ + + +T + +
Sbjct: 819 PGSANHIADALSR--------IVDETEPIPKDSEDNSINFVN-------QISITDDFKNQ 863
Query: 318 IKNGQKEDLELVDRVTLVNQGKGGDFRLDENDVLMFRDRVCVPDVLDLKRQILDEGHRSS 377
+ D +L++ + ++ + +L + ++ +D++ +P+ L R I+ + H
Sbjct: 864 VVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEG 923
Query: 378 LSIHPGATKMYQDLKRLFWWPGMKKEIAEFVYACLVCQKSMIEHQRPSGLMQPLFVPEWT 437
IHPG + + R F W G++K+I E+V C CQ + + +P G +QP+ E
Sbjct: 924 KLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERP 983
Query: 438 WDSISMDFVGALPKTSKSFDTIWVIVDRLTKSTHFVPIKTSMSIARLAEIYIEQIVRLHG 497
W+S+SMDF+ ALP++S ++ ++V+VDR +K VP S++ + A ++ ++++ G
Sbjct: 984 WESLSMDFITALPESS-GYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFG 1042
Query: 498 IPSSIVSDRDPRFTSNFWESLQAALGTKLSLSSAYHPQTDGQTERTIQSLEDLLRACVLE 557
P I++D D FTS W+ + S Y PQTDGQTERT Q++E LLR
Sbjct: 1043 NPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCST 1102
Query: 558 QGVSWDECLPLIEFTYNNSFHSSIRMAPFEALYGRRCRTPLCWFESGESAMLAPEVVQET 617
+W + + L++ +YNN+ HS+ +M PFE ++ R L E + E QET
Sbjct: 1103 HPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH--RYSPALSPLELPSFSDKTDENSQET 1160
Query: 618 TEKVKMIQEKMKASQST*KSYHDKRRKDI-EFQVGDHVFLWVNPVTGVGHALKCRKLTPR 676
+ + ++E + + K Y D + ++I EFQ GD V + T G K KL P
Sbjct: 1161 IQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMV---KRTKTGFLHKSNKLAPS 1217
Query: 677 FVGPFDVIEKVGVVAYRIALPPSLSNL-HNVFHVPKLRKYVHDA 719
F GPF V++K G Y + LP S+ ++ + FHV L KY H++
Sbjct: 1218 FAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKYRHNS 1261
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 227 bits (579), Expect = 8e-59
Identities = 142/464 (30%), Positives = 240/464 (51%), Gaps = 23/464 (4%)
Query: 258 PGKANVVADVLSRKTLHMYALMVKELELIEQFGELSLVSELTPDGVRLGMLKLTSNILEE 317
PG AN +AD LSR +V E E I + E + ++ + + +T + +
Sbjct: 819 PGSANHIADALSR--------IVDETEPIPKDSEDNSINFVN-------QISITDDFKNQ 863
Query: 318 IKNGQKEDLELVDRVTLVNQGKGGDFRLDENDVLMFRDRVCVPDVLDLKRQILDEGHRSS 377
+ D +L++ + ++ + +L + ++ +D++ +P+ L R I+ + H
Sbjct: 864 VVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEG 923
Query: 378 LSIHPGATKMYQDLKRLFWWPGMKKEIAEFVYACLVCQKSMIEHQRPSGLMQPLFVPEWT 437
IHPG + + R F W G++K+I E+V C CQ + + +P G +QP+ E
Sbjct: 924 KLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERP 983
Query: 438 WDSISMDFVGALPKTSKSFDTIWVIVDRLTKSTHFVPIKTSMSIARLAEIYIEQIVRLHG 497
W+S+SMDF+ ALP++S ++ ++V+VDR +K VP S++ + A ++ ++++ G
Sbjct: 984 WESLSMDFITALPESS-GYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFG 1042
Query: 498 IPSSIVSDRDPRFTSNFWESLQAALGTKLSLSSAYHPQTDGQTERTIQSLEDLLRACVLE 557
P I++D D FTS W+ + S Y PQTDGQTERT Q++E LLR
Sbjct: 1043 NPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCST 1102
Query: 558 QGVSWDECLPLIEFTYNNSFHSSIRMAPFEALYGRRCRTPLCWFESGESAMLAPEVVQET 617
+W + + L++ +YNN+ HS+ +M PFE ++ R L E + E QET
Sbjct: 1103 HPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH--RYSPALSPLELPSFSDKTDENSQET 1160
Query: 618 TEKVKMIQEKMKASQST*KSYHDKRRKDI-EFQVGDHVFLWVNPVTGVGHALKCRKLTPR 676
+ + ++E + + K Y D + ++I EFQ GD V + T G K KL P
Sbjct: 1161 IQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMV---KRTKTGFLHKSNKLAPS 1217
Query: 677 FVGPFDVIEKVGVVAYRIALPPSLSNL-HNVFHVPKLRKYVHDA 719
F GPF V++K G Y + LP S+ ++ + FHV L KY H++
Sbjct: 1218 FAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKYRHNS 1261
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 106 bits (265), Expect = 2e-22
Identities = 82/314 (26%), Positives = 143/314 (45%), Gaps = 18/314 (5%)
Query: 381 HPGATKMYQDLKRLFWWPGMKKEIAEFVYACLVCQKSMIEHQRPSGLMQPLFVPEWTWDS 440
H G TK +KR ++W M K I E+V C CQK+ + M PE +D
Sbjct: 909 HTGITKTLAKVKRHYYWKNMSKYIKEYVRKCQKCQKAKTTKHTKTP-MTITETPEHAFDR 967
Query: 441 ISMDFVGALPKTSKSFDTIWVIVDRLTKSTHFVPIKTSMSIARLAEIYIEQIVRLHGIPS 500
+ +D +G LPK+ + ++ LTK +PI + S +A+ E + +G
Sbjct: 968 VVVDTIGPLPKSENGNEYAVTLICDLTKYLVAIPI-ANKSAKTVAKAIFESFILKYGPMK 1026
Query: 501 SIVSDRDPRFTSNFWESLQAALGTKLSLSSAYHPQTDGQTERTIQSLEDLLRACVLEQGV 560
+ ++D + ++ L L K S+A+H QT G ER+ ++L + +R+ +
Sbjct: 1027 TFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVERSHRTLNEYIRSYISTDKT 1086
Query: 561 SWDECLPLIEFTYNNSFHSSIRMAPFEALYGRRCRTPLCW--FESGESAMLAPEVVQETT 618
WD L + +N + P+E ++GR P + S E + +E+
Sbjct: 1087 DWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHFNKLHSIEPIYNIDDYAKESK 1146
Query: 619 EKVKM----IQEKMKASQST*KSYHDKRRKDIEFQVGDHVFLWVNPVTGVGHALKCRKLT 674
++++ ++ ++A + K +D + KDIE +VGD V L VGH KL
Sbjct: 1147 YRLEVAYARARKLLEAHKEKNKENYDLKVKDIELEVGDKVLL----RNEVGH-----KLD 1197
Query: 675 PRFVGPFDVIEKVG 688
++ GP+ IE +G
Sbjct: 1198 FKYTGPYK-IESIG 1210
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 101 bits (251), Expect = 9e-21
Identities = 69/286 (24%), Positives = 135/286 (47%), Gaps = 20/286 (6%)
Query: 343 FRLDENDVLMFRDRVC--VPDVLDLKRQILDEGHRSSLSIHPGATKMYQDLKRLFWWPGM 400
+ L+EN +++ R VP D + +I+ H + H G + + +WWP +
Sbjct: 792 YTLEENKLIVERPNGIRIVPPKAD-REKIISTAHNIA---HTGRDATFLKVSSKYWWPNL 847
Query: 401 KKEIAEFVYACLVCQKSMIEHQRPSGLMQPLFVPEWTWDSISMDFVGALPKTSKSFDTIW 460
+K++ + + C C + + +++P+ P +D +D++G LP S + +
Sbjct: 848 RKDVVKSIRQCKQCLVTNATNLTSPPILRPV-KPLKPFDKFYIDYIGPLPP-SNGYLHVL 905
Query: 461 VIVDRLTKSTHFVPIKTSMSIARLAEIYIEQIVRLHGIPSSIVSDRDPRFTSNFWESLQA 520
V+VD +T P K + A + + + + IP + SD+ FTS+ +
Sbjct: 906 VVVDSMTGFVWLYPTKAPSTSATVKALNMLTSI---AIPKVLHSDQGAAFTSSTFADWAK 962
Query: 521 ALGTKLSLSSAYHPQTDGQTERTIQSLEDLLRACVLEQGVSWDECLPLIEFTYNNSFHSS 580
G +L S+ YHPQ+ G+ ER ++ LL ++ + W + LP+++ NNS+ S
Sbjct: 963 EKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLLIGRPAKWYDLLPVVQLALNNSYSPS 1022
Query: 581 IRMAPFEALYGRRCRTPLCWFESGESAMLAPEVVQETTEKVKMIQE 626
+ P + L+G TP F + ++ L+ E E++ ++QE
Sbjct: 1023 SKYTPHQLLFGVDSNTP---FANSDTLDLSRE------EELSLLQE 1059
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 99.0 bits (245), Expect = 4e-20
Identities = 75/268 (27%), Positives = 130/268 (47%), Gaps = 21/268 (7%)
Query: 351 LMFRDRVCVPDVLDLKRQILDEGHRSSLSIHPGATKMYQDLKRLFWWPGMKKEIAEFVYA 410
L+ DRV VP L++ +L + H HPG +M Q + +W G+ +I V
Sbjct: 770 LLLDDRVIVPK--SLQKIVLKQLHEG----HPGIVQMKQKARSFVFWRGLDSDIENMVRH 823
Query: 411 CLVCQK-SMIEHQRPSGLMQPLFVPEWTWDSISMDFVGALPKTSKSFDTIWVIVDRLTKS 469
C CQ+ S + P + P VPE W I +DF G L + V+VD TK
Sbjct: 824 CNNCQENSKMPRVVP---LNPWPVPEAPWKRIHIDFAGPLNGCY-----LLVVVDAKTK- 874
Query: 470 THFVPIKTSMSIARLAEI-YIEQIVRLHGIPSSIVSDRDPRFTSNFWESLQAALGTKLSL 528
+ +K + SI+ + I +E+I +HG P +I+SD + TS+ + + + G +
Sbjct: 875 --YAEVKLTRSISAVTTIDLLEEIFSIHGYPETIISDNGTQLTSHLFAQMCQSHGIEHKT 932
Query: 529 SSAYHPQTDGQTERTIQSLEDLLRACVLEQGVSWDECLPLIEFTYNNSFHSSIR-MAPFE 587
S+ Y+P+++G ER + +L+ + A + +G + L +Y N+ HS++ P E
Sbjct: 933 SAVYYPRSNGAAERFVDTLKRGI-AKIKGEGSVNQQILNKFLISYRNTPHSALNGSTPAE 991
Query: 588 ALYGRRCRTPLCWFESGESAMLAPEVVQ 615
+GR+ RT + + + P++ Q
Sbjct: 992 CHFGRKIRTTMSLLMPTDRVLKVPKLTQ 1019
>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 886
Score = 97.4 bits (241), Expect = 1e-19
Identities = 55/217 (25%), Positives = 104/217 (47%), Gaps = 5/217 (2%)
Query: 381 HPGATKMYQDLKRLFWWPGMKKEIAEFVYACLVCQKSMIEHQRPSGLMQPLFVPEWTWDS 440
H G + L+WWP M+K++ + + C C + ++ +++P P+ +D
Sbjct: 620 HTGREATLLKIANLYWWPNMRKDVVKQLGRCQQCLITNASNKASGPILRP-DRPQKPFDK 678
Query: 441 ISMDFVGALPKTSKSFDTIWVIVDRLTKSTHFVPIKTSMSIARLAEIYIEQIVRLHGIPS 500
+D++G LP S+ + + V+VD +T T P K + A + + + + IP
Sbjct: 679 FFIDYIGPLPP-SQGYLYVLVVVDGMTGFTWLYPTKAPSTSATVKSLNVLTSI---AIPK 734
Query: 501 SIVSDRDPRFTSNFWESLQAALGTKLSLSSAYHPQTDGQTERTIQSLEDLLRACVLEQGV 560
I SD+ FTS+ + G L S+ YHPQ+ + ER ++ LL ++ +
Sbjct: 735 VIHSDQGAAFTSSTFAEWAKERGIHLEFSTPYHPQSGSKVERKNSDIKRLLTKLLVGRPT 794
Query: 561 SWDECLPLIEFTYNNSFHSSIRMAPFEALYGRRCRTP 597
W + LP+++ NN++ ++ P + L+G TP
Sbjct: 795 KWYDLLPVVQLALNNTYSPVLKYTPHQLLFGIDSNTP 831
>POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1157
Score = 96.7 bits (239), Expect = 2e-19
Identities = 68/286 (23%), Positives = 133/286 (45%), Gaps = 20/286 (6%)
Query: 343 FRLDENDVLMFRD--RVCVPDVLDLKRQILDEGHRSSLSIHPGATKMYQDLKRLFWWPGM 400
++L+ V++ R + +P D + QI+ + H + H G + + +WWP +
Sbjct: 794 YQLENGQVMVTRPNGKRIIPPKSD-RPQIILQAHNIA---HTGRDSTFLKVSSKYWWPNL 849
Query: 401 KKEIAEFVYACLVCQKSMIEHQRPSGLMQPLFVPEWTWDSISMDFVGALPKTSKSFDTIW 460
+K++ + + C C + +++P P +D +D++G LP S + +
Sbjct: 850 RKDVVKVIRQCKQCLVTNAATLAAPPILRPER-PVKPFDKFFIDYIGPLPP-SNGYLHVL 907
Query: 461 VIVDRLTKSTHFVPIKTSMSIARLAEIYIEQIVRLHGIPSSIVSDRDPRFTSNFWESLQA 520
V+VD +T P K + A + + + + +P I SD+ FTS +
Sbjct: 908 VVVDSMTGFVWLYPTKAPSTSATVKALNMLTSI---AVPKVIHSDQGAAFTSATFADWAK 964
Query: 521 ALGTKLSLSSAYHPQTDGQTERTIQSLEDLLRACVLEQGVSWDECLPLIEFTYNNSFHSS 580
G +L S+ YHPQ+ G+ ER ++ LL ++ + W + LP+++ NNS+ S
Sbjct: 965 NKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLLVGRPAKWYDLLPVVQLALNNSYSPS 1024
Query: 581 IRMAPFEALYGRRCRTPLCWFESGESAMLAPEVVQETTEKVKMIQE 626
+ P + L+G TP F + ++ L+ E E++ ++QE
Sbjct: 1025 SKYTPHQLLFGIDSNTP---FANSDTLDLSRE------EELSLLQE 1061
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 73.6 bits (179), Expect = 2e-12
Identities = 109/459 (23%), Positives = 193/459 (41%), Gaps = 49/459 (10%)
Query: 255 VIIPGKANVVADVLSRKTLHMYALMVKELELIEQFGEL-SLVSELTPDGVRLGMLKLTSN 313
V + GKAN VAD LSR ELE EQ EL S+V+ + + L + +S
Sbjct: 1348 VYLAGKANAVADALSRG-----GCPPNELEE-EQTKELTSIVNAIQTE---LPDILDSSC 1398
Query: 314 ILEEIKN---GQKEDLELVD----RVTLVNQGKGGDFRLD---------ENDVLMFRDRV 357
LE +K G KE + ++ + T G + L+ +N + + R
Sbjct: 1399 WLERLKGEDEGWKEVIAALEGGKTKGTFKIVGIESEISLEYYKIVGGVLKNTEIEEQSRS 1458
Query: 358 CVPDVLDLKRQILDEGHRSSLSIHPGATKMYQDLKRLFWWPGMKKEIAEFVYACLVCQKS 417
VP+ ++ +L E H L+ H G KM++ + R F+WP M+ + V C C +
Sbjct: 1459 VVPE--KIRTPLLKELHEGMLAGHFGIKKMWRMVHRKFYWPQMRVCVENCVRTCAKCLCA 1516
Query: 418 MIEHQRPSGLMQPLFVPEWTWDSISMDFVGALPKTSKSFDTIWVIVDRLTKSTHFVPIKT 477
+H + + + P + + + ++ D + + + + I I+D TK VPI
Sbjct: 1517 N-DHSKLTSSLTP-YRMTFPLEIVACDLMD-VGLSVQGNRYILTIIDLFTKYGTAVPIPD 1573
Query: 478 SMSIARLAEIYIEQIVRLHGIPSSIVSDRDPRFTSNFWESLQAALGTKLSLSSAYHPQTD 537
+ L + IP +++D+ F + + L + + Y+ + +
Sbjct: 1574 KKAETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGYNSRAN 1633
Query: 538 GQTERTIQSLEDLLRACVLEQGVSWDECLPLIEFTYNNSFHSSIRMAPFEALYGRRCRTP 597
G ER +++ +++ + WD+ + + YNN H + P ++GR P
Sbjct: 1634 GAVERFNKTIMHIMKKKTAVP-MEWDDQVVYAVYAYNNCVHENTGETPMFLMHGRDVMGP 1692
Query: 598 LCWFESGESAM---------LAPEVVQETTEKVKMIQEKMKASQST*KSYHDKR---RKD 645
L SGE A+ + QE + K+ +E Q + KS D++ +K
Sbjct: 1693 L--EMSGEDAVGINYADMDEYKHLLTQELLKVQKIAKEHAMREQESYKSLFDQKYASKKH 1750
Query: 646 IEFQVGDHVFLWVNPVTGVGHALKCRKLTPRFVGPFDVI 684
Q G V L + P +G +C KL ++ GP+ VI
Sbjct: 1751 RFPQPGSRVLLEI-PSEKLG--AQCPKLVNKWSGPYRVI 1786
>POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1165
Score = 67.8 bits (164), Expect = 1e-10
Identities = 80/316 (25%), Positives = 128/316 (40%), Gaps = 38/316 (12%)
Query: 381 HPGATKMYQDLKRL-FWWPGMKKEIAEFVYACLVC-QKSMIEHQRPSGLMQPLFVPEWTW 438
H G K+ Q + R P ++ + E C C + + R +G Q P W
Sbjct: 821 HLGPEKLLQLVNRTSLLIPNLQSAVREVTSQCQACAMTNAVTTYRETGKRQRGDRPGVYW 880
Query: 439 DSISMDFVGALPKTSKSFDTIWVIVDRLTKSTHFVPIKTSMSIARLAEIYIEQIVRLHGI 498
+ +DF P + + V +D + P KT ++ +I +E+I+ GI
Sbjct: 881 E---VDFTEIKPGRYGN-KYLLVFIDTFSGWVEAFPTKTETALIVCKKI-LEEILPRFGI 935
Query: 499 PSSIVSDRDPRFTSNFWESLQAALGTKLSLSSAYHPQTDGQTERTIQSLEDLLRACVLEQ 558
P + SD P F + + L LG L AY PQ+ GQ ER +++++ L LE
Sbjct: 936 PKVLGSDNGPAFVAQVSQGLATQLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLALET 995
Query: 559 -GVSWDECLPLIEFTYNNSFHSSIRMAPFEALYGRRCRTPLCWFESGESAMLAPEVVQET 617
G W LPL N+ + P+E LYG P ESGE+ L P+
Sbjct: 996 GGKDWVTLLPLALLRARNT-PGRFGLTPYEILYG----GPPPILESGET--LGPD----- 1043
Query: 618 TEKVKMIQEKMKASQST*KSYHDKRRKDIE---------FQVGDHVFLWVNPVTGVGHAL 668
+ ++ +KA + D+ ++ + FQVGD V +
Sbjct: 1044 DRFLPVLFTHLKALEIVRTQIWDQIKEVYKPGTVTIPHPFQVGDQVLV---------RRH 1094
Query: 669 KCRKLTPRFVGPFDVI 684
+ L PR+ GP+ V+
Sbjct: 1095 RPSSLEPRWKGPYLVL 1110
>POL_MLVAV (P03356) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1196
Score = 67.0 bits (162), Expect = 2e-10
Identities = 84/342 (24%), Positives = 147/342 (42%), Gaps = 33/342 (9%)
Query: 352 MFRDRVCVPDVLDLKRQILDEGHRSSLSIHPGATKMYQDLKR---LFWWPGMKKEIAEFV 408
+F+ + +PD ++LD HR + H G KM L R ++ K +
Sbjct: 828 VFQGKPVMPDQFVF--ELLDSLHRLT---HLGYQKMKALLDRGESPYYMLNRDKTLQYVA 882
Query: 409 YACLVC-QKSMIEHQRPSGLMQPLFVPEWTWDSISMDFVGALPKTSKSFDTIWVIVDRLT 467
+C VC Q + + + +G+ P W+ +DF P + + V VD +
Sbjct: 883 DSCTVCAQVNASKAKIGAGVRVRGHRPGSHWE---IDFTEVKPGLY-GYKYLLVFVDTFS 938
Query: 468 KSTHFVPIKTSMSIARLAEIYIEQIVRLHGIPSSIVSDRDPRFTSNFWESLQAALGTKLS 527
P K + +++ +E+I G+P + SD P FTS +S+ LG
Sbjct: 939 GWVEAFPTKRETARV-VSKKLLEEIFPRFGMPQVLGSDNGPAFTSQVSQSVADLLGIDWK 997
Query: 528 LSSAYHPQTDGQTERTIQSLEDLLRACVLEQGV-SWDECLPLIEFTYNNSFHSSIRMAPF 586
L AY PQ+ GQ ER +++++ L L G W LPL + N+ + P+
Sbjct: 998 LHCAYRPQSSGQVERMNRTIKETLTKLTLAAGTRDWVLLLPLALYRARNT-PGPHGLTPY 1056
Query: 587 EALYGRRCRTPLCWFESGE-SAMLAPEVVQETTEKVKMIQEKMKASQST*KSYHDKRRKD 645
E LYG PL F + S + +Q + ++ +Q ++ + ++Y D+ +
Sbjct: 1057 EILYG--APPPLVNFHDPDMSELTNSPSLQAHLQALQTVQREIWKPLA--EAYRDQLDQP 1112
Query: 646 I---EFQVGDHVFLWVNPVTGVGHALKCRKLTPRFVGPFDVI 684
+ F++GD V WV + + L PR+ GP+ V+
Sbjct: 1113 VIPHPFRIGDSV--WV-------RRHQTKNLEPRWKGPYTVL 1145
>POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1189
Score = 66.2 bits (160), Expect = 3e-10
Identities = 75/293 (25%), Positives = 122/293 (41%), Gaps = 22/293 (7%)
Query: 395 FWWPGMKKEIAEFVYACLVCQKSMIEHQR-PSGLMQPLFVPEWTWDSISMDFVGALPKTS 453
F P I + AC VCQ+ R P+G P W+ +DF P +
Sbjct: 864 FLIPRASTLIEQVTSACKVCQQVNAGATRVPAGKRTRGNRPGVYWE---IDFTEVKPHYA 920
Query: 454 KSFDTIWVIVDRLTKSTHFVPIKTSMSIARLAEIYIEQIVRLHGIPSSIVSDRDPRFTSN 513
+ + V VD + P + + +A+ +E+I G+P I SD P F S
Sbjct: 921 -GYKYLLVFVDTFSGWVEAFPTRQETAHI-VAKKILEEIFPRFGLPKVIGSDNGPAFVSQ 978
Query: 514 FWESLQAALGTKLSLSSAYHPQTDGQTERTIQSLEDLLRACVLEQGV-SWDECLPLIEFT 572
+ L LG L AY PQ+ GQ ER +++++ L LE G+ W L L
Sbjct: 979 VSQGLARILGINWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLLSLALLR 1038
Query: 573 YNNSFHSSIRMAPFEALYGRRCRTPLCWFESGESAMLAPEVVQETTEKVKMIQEKMKASQ 632
N+ + + P+E LYG PL + S + +Q + ++ +Q ++ A
Sbjct: 1039 ARNT-PNRFGLTPYEILYGG--PPPLSTLLNSFSPSNSKTDLQARLKGLQAVQAQIWAPL 1095
Query: 633 ST*KSYHDKRRKDIE-FQVGDHVFLWVNPVTGVGHALKCRKLTPRFVGPFDVI 684
+ + Y + FQVGD V++ + + L PR+ GP+ V+
Sbjct: 1096 A--ELYRPGHSQTSHPFQVGDSVYV---------RRHRSQGLEPRWKGPYIVL 1137
>POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 1046
Score = 63.9 bits (154), Expect = 2e-09
Identities = 73/284 (25%), Positives = 117/284 (40%), Gaps = 22/284 (7%)
Query: 404 IAEFVYACLVCQKSMIEHQR-PSGLMQPLFVPEWTWDSISMDFVGALPKTSKSFDTIWVI 462
I + AC VCQ+ R P G P W+ +DF P + + + V
Sbjct: 730 IEQVTSACKVCQQVNAGATRVPEGKRTRGNRPGVYWE---IDFTEVKPHYA-GYKYLLVF 785
Query: 463 VDRLTKSTHFVPIKTSMSIARLAEIYIEQIVRLHGIPSSIVSDRDPRFTSNFWESLQAAL 522
VD + P + + +A+ +E+I G+P I SD P F S + L L
Sbjct: 786 VDTFSGWVEAYPTRQETA-HMVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQGLARTL 844
Query: 523 GTKLSLSSAYHPQTDGQTERTIQSLEDLLRACVLEQGV-SWDECLPLIEFTYNNSFHSSI 581
G L AY PQ+ GQ ER +++++ L LE G+ W L L N+ +
Sbjct: 845 GINWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLLSLALLRARNT-PNRF 903
Query: 582 RMAPFEALYGRRCRTPLCWFESGESAMLAPEVVQETTEKVKMIQEKMKASQST*KSYHDK 641
+ P+E LYG PL + S +Q + ++ +Q ++ + + Y
Sbjct: 904 GLTPYEILYGG--PPPLSTLLNSFSPSDPKTDLQARLKGLQAVQAQIWTPLA--ELYRPG 959
Query: 642 R-RKDIEFQVGDHVFLWVNPVTGVGHALKCRKLTPRFVGPFDVI 684
+ FQVGD V++ + G L PR+ GP+ V+
Sbjct: 960 HPQTSYPFQVGDSVYVRWHRSQG---------LEPRWKGPYIVL 994
>POL_MLVRK (P31795) Pol polyprotein [Contains: Protease (EC
3.4.23.-); Reverse transcriptase/ribonuclease H (EC
2.7.7.49) (EC 3.1.26.4) (RT); Integrase (IN)] (Fragment)
Length = 581
Score = 63.5 bits (153), Expect = 2e-09
Identities = 84/342 (24%), Positives = 143/342 (41%), Gaps = 33/342 (9%)
Query: 352 MFRDRVCVPDVLDLKRQILDEGHRSSLSIHPGATKMYQDLKR---LFWWPGMKKEIAEFV 408
+F+ + +PD ++LD HR + H G KM L R ++ K +
Sbjct: 213 VFQGKPVMPDQFVF--ELLDSLHRLT---HLGYQKMKALLDRGESPYYMLNRDKTLQYVA 267
Query: 409 YACLVC-QKSMIEHQRPSGLMQPLFVPEWTWDSISMDFVGALPKTSKSFDTIWVIVDRLT 467
+C VC Q + + + +G+ P W+ +DF P + + V VD +
Sbjct: 268 DSCTVCAQVNASKAKIGAGVRVRGHRPGTHWE---IDFTEVKPGLY-GYKYLLVFVDTFS 323
Query: 468 KSTHFVPIKTSMSIARLAEIYIEQIVRLHGIPSSIVSDRDPRFTSNFWESLQAALGTKLS 527
P K + ++ +E+I G+P + +D P F S +S+ LG
Sbjct: 324 GWVEAFPTKHETAKIVTKKL-LEEIFPRFGMPQVLGTDNGPAFVSQVSQSVAKLLGIDWK 382
Query: 528 LSSAYHPQTDGQTERTIQSLEDLLRACVLEQGV-SWDECLPLIEFTYNNSFHSSIRMAPF 586
L AY PQ+ GQ ER +++++ L L G W LPL + N+ + P+
Sbjct: 383 LHCAYRPQSSGQVERMNRTIKETLTKLTLATGTRDWVLLLPLALYRARNT-PGPHGLTPY 441
Query: 587 EALYGRRCRTPLCWFESGE-SAMLAPEVVQETTEKVKMIQEKMKASQST*KSYHDKRRKD 645
E LYG PL F E S +Q + ++ +Q ++ + +Y D+ +
Sbjct: 442 EILYG--APPPLVNFHDPEMSKFTNSPSLQAHLQALQAVQREVWKPLAA--AYQDQLDQP 497
Query: 646 I---EFQVGDHVFLWVNPVTGVGHALKCRKLTPRFVGPFDVI 684
+ F+VGD V WV + + L PR+ GP+ V+
Sbjct: 498 VIPHPFRVGDTV--WV-------RRHQTKNLEPRWKGPYTVL 530
>POL_MLVRD (P11227) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1196
Score = 63.5 bits (153), Expect = 2e-09
Identities = 84/342 (24%), Positives = 143/342 (41%), Gaps = 33/342 (9%)
Query: 352 MFRDRVCVPDVLDLKRQILDEGHRSSLSIHPGATKMYQDLKR---LFWWPGMKKEIAEFV 408
+F+ + +PD ++LD HR + H G KM L R ++ K +
Sbjct: 828 VFQGKPVMPDQFVF--ELLDSLHRLT---HLGYQKMKALLDRGESPYYMLNRDKTLQYVA 882
Query: 409 YACLVC-QKSMIEHQRPSGLMQPLFVPEWTWDSISMDFVGALPKTSKSFDTIWVIVDRLT 467
+C VC Q + + + +G+ P W+ +DF P + + V VD +
Sbjct: 883 DSCTVCAQVNASKAKIGAGVRVRGHRPGTHWE---IDFTEVKPGLY-GYKYLLVFVDTFS 938
Query: 468 KSTHFVPIKTSMSIARLAEIYIEQIVRLHGIPSSIVSDRDPRFTSNFWESLQAALGTKLS 527
P K + ++ +E+I G+P + +D P F S +S+ LG
Sbjct: 939 GWVEAFPTKHETAKIVTKKL-LEEIFPRFGMPQVLGTDNGPAFVSQVSQSVAKLLGIDWK 997
Query: 528 LSSAYHPQTDGQTERTIQSLEDLLRACVLEQGV-SWDECLPLIEFTYNNSFHSSIRMAPF 586
L AY PQ+ GQ ER +++++ L L G W LPL + N+ + P+
Sbjct: 998 LHCAYRPQSSGQVERMNRTIKETLTKLTLATGTRDWVLLLPLALYRARNT-PGPHGLTPY 1056
Query: 587 EALYGRRCRTPLCWFESGE-SAMLAPEVVQETTEKVKMIQEKMKASQST*KSYHDKRRKD 645
E LYG PL F E S +Q + ++ +Q ++ + +Y D+ +
Sbjct: 1057 EILYG--APPPLVNFHDPEMSKFTNSPSLQAHLQALQAVQREVWKPLAA--AYQDQLDQP 1112
Query: 646 I---EFQVGDHVFLWVNPVTGVGHALKCRKLTPRFVGPFDVI 684
+ F+VGD V WV + + L PR+ GP+ V+
Sbjct: 1113 VIPHPFRVGDTV--WV-------RRHQTKNLEPRWKGPYTVL 1145
>POL_MLVAK (P03357) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 843
Score = 63.2 bits (152), Expect = 3e-09
Identities = 84/342 (24%), Positives = 147/342 (42%), Gaps = 34/342 (9%)
Query: 352 MFRDRVCVPDVLDLKRQILDEGHRSSLSIHPGATKMYQDLKR---LFWWPGMKKEIAEFV 408
+F+ + +PD ++LD HR + H G KM L R ++ K +
Sbjct: 476 VFQGKPVMPDQFVF--ELLDSLHRLT---HLGYQKMKALLDRGESPYYMLNRDKTLQYVA 530
Query: 409 YACLVC-QKSMIEHQRPSGLMQPLFVPEWTWDSISMDFVGALPKTSKSFDTIWVIVDRLT 467
+C VC Q + + + +G+ P W+ +DF P + + V VD +
Sbjct: 531 DSCTVCAQVNASKAKIGAGVRVRGHRPGSHWE---IDFTEVKPGLY-GYKYLLVFVDTFS 586
Query: 468 KSTHFVPIKTSMSIARLAEIYIEQIVRLHGIPSSIVSDRDPRFTSNFWESLQAALGTKLS 527
P K + +++ +E+I G+P + SD P FTS +S+ LG
Sbjct: 587 GWVEAFPTKRETARV-VSKKLLEEIFPRFGMPQVLGSDNGPAFTSQVSQSVADLLGID-K 644
Query: 528 LSSAYHPQTDGQTERTIQSLEDLLRACVLEQGV-SWDECLPLIEFTYNNSFHSSIRMAPF 586
L AY PQ+ GQ ER +++++ L L G W LPL + N+ + P+
Sbjct: 645 LHCAYRPQSSGQVERMNRTIKETLTKLTLAAGTRDWVLLLPLALYRARNT-PGPHGLTPY 703
Query: 587 EALYGRRCRTPLCWFESGE-SAMLAPEVVQETTEKVKMIQEKMKASQST*KSYHDKRRKD 645
E LYG PL F + S + +Q + ++ +Q ++ + ++Y D+ +
Sbjct: 704 EILYG--APPPLVNFHDPDMSELTNSPSLQAHLQALQTVQREIWKPLA--EAYRDQLDQP 759
Query: 646 I---EFQVGDHVFLWVNPVTGVGHALKCRKLTPRFVGPFDVI 684
+ F++GD V WV + + L PR+ GP+ V+
Sbjct: 760 VIPHPFRIGDSV--WV-------RRHQTKNLEPRWKGPYTVL 792
>POL3_MOUSE (P11367) Retrovirus-related Pol polyprotein
(Endonuclease) (Fragment)
Length = 390
Score = 61.2 bits (147), Expect = 1e-08
Identities = 70/287 (24%), Positives = 118/287 (40%), Gaps = 26/287 (9%)
Query: 403 EIAEFVYACLVCQKSMIEHQRPSGLMQPLFVPEWTWDSISMDFVGALPKTSKSFDTIWVI 462
E+AE AC+ S + + + + W +DF P + + V
Sbjct: 89 EVAESCQACVQVNASKTKIRAGTRVRGHRLGTHW-----EIDFTEVKPGLY-GYKYLLVF 142
Query: 463 VDRLTKSTHFVPIKTSMSIARLAEIYIEQIVRLHGIPSSIVSDRDPRFTSNFWESLQAAL 522
VD + P K + ++ +E+I G+P + +D P F S +S+ L
Sbjct: 143 VDTFSGWVEAFPTKHETAKIVTKKL-LEEIFPRFGMPQVLGTDNGPAFVSQVSQSVAKLL 201
Query: 523 GTKLSLSSAYHPQTDGQTERTIQSLEDLLRACVLEQGV-SWDECLPLIEFTYNNSFHSSI 581
G L AY PQ+ GQ ER +++++ L L G W LPL + N+
Sbjct: 202 GIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGTRDWVLLLPLALYRARNT-PGPH 260
Query: 582 RMAPFEALYGRRCRTPLCWFESGE-SAMLAPEVVQETTEKVKMIQEKMKASQST*KSYHD 640
+ P+E LYG PL F E S +Q + ++ +Q ++ + +Y D
Sbjct: 261 GLTPYEILYG--APPPLVNFHDPEMSKFTNSPSLQAHLQALQAVQREVWKPLAA--AYQD 316
Query: 641 KRRKDI---EFQVGDHVFLWVNPVTGVGHALKCRKLTPRFVGPFDVI 684
+ + + F+VGD V WV + + L PR+ GP+ V+
Sbjct: 317 QLDQPVIPHPFRVGDTV--WV-------RRHQTKNLEPRWKGPYTVL 354
>POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 57.4 bits (137), Expect = 1e-07
Identities = 52/201 (25%), Positives = 91/201 (44%), Gaps = 19/201 (9%)
Query: 489 IEQIVRLHGIPSSIVSDRDPRFTSNFWESLQAALGTKLSLSSAYHPQTDGQTERTIQSLE 548
+E+I G+P + +D P F S +++ LG L AY PQ+ GQ ER ++++
Sbjct: 964 LEEIFPRFGMPQVLGTDNGPAFVSKVSQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIK 1023
Query: 549 DLLRACVLEQGV-SWDECLPLIEFTYNNSFHSSIRMAPFEALYGRRCRTPLCWFESGESA 607
+ L L G W LPL + N+ + P+E LYG PL F + A
Sbjct: 1024 ETLTKLTLATGSRDWVLLLPLALYRARNT-PGPHGLTPYEILYG--APPPLVNFPDPDMA 1080
Query: 608 MLAPE-VVQETTEKVKMIQEKMKASQST*KSYHDKRRKDI---EFQVGDHVFLWVNPVTG 663
+ +Q + + ++Q ++ + +Y ++ + + F+VGD V WV
Sbjct: 1081 KVTHNPSLQAHLQALYLVQHEVWRPLAA--AYQEQLDRPVVPHPFRVGDTV--WV----- 1131
Query: 664 VGHALKCRKLTPRFVGPFDVI 684
+ + L PR+ GP+ V+
Sbjct: 1132 --RRHQTKNLEPRWKGPYTVL 1150
>POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 57.4 bits (137), Expect = 1e-07
Identities = 52/201 (25%), Positives = 91/201 (44%), Gaps = 19/201 (9%)
Query: 489 IEQIVRLHGIPSSIVSDRDPRFTSNFWESLQAALGTKLSLSSAYHPQTDGQTERTIQSLE 548
+E+I G+P + +D P F S +++ LG L AY PQ+ GQ ER ++++
Sbjct: 964 LEEIFPRFGMPQVLGTDNGPAFVSKVSQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIK 1023
Query: 549 DLLRACVLEQGV-SWDECLPLIEFTYNNSFHSSIRMAPFEALYGRRCRTPLCWFESGESA 607
+ L L G W LPL + N+ + P+E LYG PL F + A
Sbjct: 1024 ETLTKLTLATGSRDWVLLLPLALYRARNT-PGPHGLTPYEILYG--APPPLVNFPDPDMA 1080
Query: 608 MLAPE-VVQETTEKVKMIQEKMKASQST*KSYHDKRRKDI---EFQVGDHVFLWVNPVTG 663
+ +Q + + ++Q ++ + +Y ++ + + F+VGD V WV
Sbjct: 1081 KVTHNPSLQAHLQALYLVQHEVWRPLAA--AYQEQLDRPVVPHPFRVGDTV--WV----- 1131
Query: 664 VGHALKCRKLTPRFVGPFDVI 684
+ + L PR+ GP+ V+
Sbjct: 1132 --RRHQTKNLEPRWKGPYTVL 1150
>POL_MLVF5 (P26810) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 57.4 bits (137), Expect = 1e-07
Identities = 52/201 (25%), Positives = 91/201 (44%), Gaps = 19/201 (9%)
Query: 489 IEQIVRLHGIPSSIVSDRDPRFTSNFWESLQAALGTKLSLSSAYHPQTDGQTERTIQSLE 548
+E+I G+P + +D P F S +++ LG L AY PQ+ GQ ER ++++
Sbjct: 964 LEEIFPRFGMPQVLGTDNGPAFVSKVSQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIK 1023
Query: 549 DLLRACVLEQGV-SWDECLPLIEFTYNNSFHSSIRMAPFEALYGRRCRTPLCWFESGESA 607
+ L L G W LPL + N+ + P+E LYG PL F + A
Sbjct: 1024 ETLTKLTLATGSRDWVLLLPLALYRARNT-PGPHGLTPYEILYG--APPPLVNFPDPDMA 1080
Query: 608 MLAPE-VVQETTEKVKMIQEKMKASQST*KSYHDKRRKDI---EFQVGDHVFLWVNPVTG 663
+ +Q + + ++Q ++ + +Y ++ + + F+VGD V WV
Sbjct: 1081 KVTHNPSLQAHLQALYLVQHEVWRPLAA--AYQEQLDRPVVPHPFRVGDTV--WV----- 1131
Query: 664 VGHALKCRKLTPRFVGPFDVI 684
+ + L PR+ GP+ V+
Sbjct: 1132 --RRHQTKNLEPRWKGPYTVL 1150
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.340 0.151 0.500
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 81,886,107
Number of Sequences: 164201
Number of extensions: 3224535
Number of successful extensions: 11413
Number of sequences better than 10.0: 54
Number of HSP's better than 10.0 without gapping: 36
Number of HSP's successfully gapped in prelim test: 18
Number of HSP's that attempted gapping in prelim test: 11334
Number of HSP's gapped (non-prelim): 71
length of query: 796
length of database: 59,974,054
effective HSP length: 118
effective length of query: 678
effective length of database: 40,598,336
effective search space: 27525671808
effective search space used: 27525671808
T: 11
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.9 bits)
S2: 70 (31.6 bits)
Medicago: description of AC146806.4