
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0083.15
(489 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 107 7e-23
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 102 2e-21
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 102 2e-21
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 102 2e-21
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 97 9e-20
POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23... 95 5e-19
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 92 4e-18
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 91 9e-18
POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.2... 91 9e-18
POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.2... 90 1e-17
POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.2... 90 1e-17
POL_MLVF5 (P26810) Pol polyprotein [Contains: Protease (EC 3.4.2... 90 1e-17
POL_MLVAV (P03356) Pol polyprotein [Contains: Protease (EC 3.4.2... 87 7e-17
POL_MLVMO (P03355) Pol polyprotein [Contains: Protease (EC 3.4.2... 86 2e-16
POL_MLVCB (P08361) Pol polyprotein [Contains: Reverse transcript... 85 4e-16
POL_MLVAK (P03357) Pol polyprotein [Contains: Reverse transcript... 84 8e-16
POL3_MOUSE (P11367) Retrovirus-related Pol polyprotein (Endonucl... 83 2e-15
POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.2... 82 2e-15
POL_MLVRK (P31795) Pol polyprotein [Contains: Protease (EC 3.4.2... 82 4e-15
POL_MLVRD (P11227) Pol polyprotein [Contains: Protease (EC 3.4.2... 82 4e-15
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 107 bits (267), Expect = 7e-23
Identities = 80/323 (24%), Positives = 142/323 (43%), Gaps = 15/323 (4%)
Query: 138 VLAEIHEGSC-GHHLGGRSLARKVLRAGYYWPTLEKDAADHVKQCDPCQRHADLHHAPPA 196
+L+ +H+ G H G KV R YYW + K ++V++C CQ+ H
Sbjct: 896 ILSTLHDDPIQGGHTGITKTLAKVKRH-YYWKNMSKYIKEYVRKCQKCQKAKTTKHTKTP 954
Query: 197 RLSSLVSPWPFHQWGMDLLGPFDTAPG*LKYLVVAVDYYTKWIEAEPLATITSARVQRFF 256
+ F + +D +GP + +Y V + TK++ A P+A ++ V +
Sbjct: 955 MTITETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYLVAIPIANKSAKTVAKAI 1014
Query: 257 YKNVIRRFGVPGVLITDNGTQFTSKGFRDLLDGLHIKQRFTSVEHPQTNGQAESANRVIL 316
+++ I ++G ITD GT++ + DL L IK ++ H QT G E ++R +
Sbjct: 1015 FESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVERSHRTLN 1074
Query: 317 RGLRKRLGDAKGDWAEQLDHVLWAYRTTPHSTTGESPYRLTFGTEAVIPAEIREPSARTA 376
+R + K DW L + ++ + TT PY L FG + +P + +
Sbjct: 1075 EYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHFNKLHSIEP 1134
Query: 377 GFDLNQNDQLLGEDLDLLAERRAIANCRELIA--KQEAAIRYNKKVVLRSFAAGDLVLQN 434
++++ + L++ A A R+L+ K++ Y+ KV GD VL
Sbjct: 1135 IYNIDDYAKESKYRLEV-----AYARARKLLEAHKEKNKENYDLKVKDIELEVGDKVLLR 1189
Query: 435 ASIGARSTERGKLAAKWEGPYRV 457
+G KL K+ GPY++
Sbjct: 1190 NEVG------HKLDFKYTGPYKI 1206
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 102 bits (254), Expect = 2e-21
Identities = 84/335 (25%), Positives = 147/335 (43%), Gaps = 16/335 (4%)
Query: 138 VLAEIHEGSCGHHLGGRSLARKVLRAGYYWPTLEKDAADHVKQCDPCQRHADLHHAPPAR 197
++ + HE H G L +LR + W + K ++V+ C CQ + +H P
Sbjct: 915 IIKKYHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGP 973
Query: 198 LSSLV-SPWPFHQWGMDLLGPFDTAPG*LKYLVVAVDYYTKWIEAEPLA-TITSARVQRF 255
L + S P+ MD + + G L V VD ++K P +IT+ + R
Sbjct: 974 LQPIPPSERPWESLSMDFITALPESSG-YNALFVVVDRFSKMAILVPCTKSITAEQTARM 1032
Query: 256 FYKNVIRRFGVPGVLITDNGTQFTSKGFRDLLDGLHIKQRFTSVEHPQTNGQAESANRVI 315
F + VI FG P +I DN FTS+ ++D + +F+ PQT+GQ E N+ +
Sbjct: 1033 FDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTV 1092
Query: 316 LRGLRKRLGDAKGDWAEQLDHVLWAYRTTPHSTTGESPYRLTFG-TEAVIPAEIREPSAR 374
+ LR W + + V +Y HS T +P+ + + A+ P E+ S +
Sbjct: 1093 EKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDK 1152
Query: 375 TAGFDLNQNDQLLGEDLDLLAERRAIANCRELIAKQEAAIRYNKKVVLRSFAAGDLVLQN 434
T ++N Q E + + + N + K+ ++ + + F GDLV+
Sbjct: 1153 T-----DENSQ---ETIQVFQTVKEHLNTNNIKMKKYFDMKIQE---IEEFQPGDLVMVK 1201
Query: 435 ASIGARSTERGKLAAKWEGPYRVVEATGTGAYKLE 469
+ + KLA + GP+ V++ +G Y+L+
Sbjct: 1202 RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELD 1236
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 102 bits (254), Expect = 2e-21
Identities = 84/335 (25%), Positives = 147/335 (43%), Gaps = 16/335 (4%)
Query: 138 VLAEIHEGSCGHHLGGRSLARKVLRAGYYWPTLEKDAADHVKQCDPCQRHADLHHAPPAR 197
++ + HE H G L +LR + W + K ++V+ C CQ + +H P
Sbjct: 915 IIKKYHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGP 973
Query: 198 LSSLV-SPWPFHQWGMDLLGPFDTAPG*LKYLVVAVDYYTKWIEAEPLA-TITSARVQRF 255
L + S P+ MD + + G L V VD ++K P +IT+ + R
Sbjct: 974 LQPIPPSERPWESLSMDFITALPESSG-YNALFVVVDRFSKMAILVPCTKSITAEQTARM 1032
Query: 256 FYKNVIRRFGVPGVLITDNGTQFTSKGFRDLLDGLHIKQRFTSVEHPQTNGQAESANRVI 315
F + VI FG P +I DN FTS+ ++D + +F+ PQT+GQ E N+ +
Sbjct: 1033 FDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTV 1092
Query: 316 LRGLRKRLGDAKGDWAEQLDHVLWAYRTTPHSTTGESPYRLTFG-TEAVIPAEIREPSAR 374
+ LR W + + V +Y HS T +P+ + + A+ P E+ S +
Sbjct: 1093 EKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDK 1152
Query: 375 TAGFDLNQNDQLLGEDLDLLAERRAIANCRELIAKQEAAIRYNKKVVLRSFAAGDLVLQN 434
T ++N Q E + + + N + K+ ++ + + F GDLV+
Sbjct: 1153 T-----DENSQ---ETIQVFQTVKEHLNTNNIKMKKYFDMKIQE---IEEFQPGDLVMVK 1201
Query: 435 ASIGARSTERGKLAAKWEGPYRVVEATGTGAYKLE 469
+ + KLA + GP+ V++ +G Y+L+
Sbjct: 1202 RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELD 1236
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 102 bits (254), Expect = 2e-21
Identities = 84/335 (25%), Positives = 147/335 (43%), Gaps = 16/335 (4%)
Query: 138 VLAEIHEGSCGHHLGGRSLARKVLRAGYYWPTLEKDAADHVKQCDPCQRHADLHHAPPAR 197
++ + HE H G L +LR + W + K ++V+ C CQ + +H P
Sbjct: 915 IIKKYHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGP 973
Query: 198 LSSLV-SPWPFHQWGMDLLGPFDTAPG*LKYLVVAVDYYTKWIEAEPLA-TITSARVQRF 255
L + S P+ MD + + G L V VD ++K P +IT+ + R
Sbjct: 974 LQPIPPSERPWESLSMDFITALPESSG-YNALFVVVDRFSKMAILVPCTKSITAEQTARM 1032
Query: 256 FYKNVIRRFGVPGVLITDNGTQFTSKGFRDLLDGLHIKQRFTSVEHPQTNGQAESANRVI 315
F + VI FG P +I DN FTS+ ++D + +F+ PQT+GQ E N+ +
Sbjct: 1033 FDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTV 1092
Query: 316 LRGLRKRLGDAKGDWAEQLDHVLWAYRTTPHSTTGESPYRLTFG-TEAVIPAEIREPSAR 374
+ LR W + + V +Y HS T +P+ + + A+ P E+ S +
Sbjct: 1093 EKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDK 1152
Query: 375 TAGFDLNQNDQLLGEDLDLLAERRAIANCRELIAKQEAAIRYNKKVVLRSFAAGDLVLQN 434
T ++N Q E + + + N + K+ ++ + + F GDLV+
Sbjct: 1153 T-----DENSQ---ETIQVFQTVKEHLNTNNIKMKKYFDMKIQE---IEEFQPGDLVMVK 1201
Query: 435 ASIGARSTERGKLAAKWEGPYRVVEATGTGAYKLE 469
+ + KLA + GP+ V++ +G Y+L+
Sbjct: 1202 RTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELD 1236
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 97.1 bits (240), Expect = 9e-20
Identities = 82/333 (24%), Positives = 145/333 (42%), Gaps = 12/333 (3%)
Query: 138 VLAEIHEGSCGHHLGGRSLARKVLRAGYYWPTLEKDAADHVKQCDPCQRHADLHHAPPAR 197
+L E+HEG H G + + R V R +YWP + + V+ C C D H +
Sbjct: 1468 LLKELHEGMLAGHFGIKKMWRMVHRK-FYWPQMRVCVENCVRTCAKCLCAND-HSKLTSS 1525
Query: 198 LSSLVSPWPFHQWGMDLLGPFDTAPG*LKYLVVAVDYYTKWIEAEPLATITSARVQRFFY 257
L+ +P DL+ + G +Y++ +D +TK+ A P+ + V + F
Sbjct: 1526 LTPYRMTFPLEIVACDLMDVGLSVQG-NRYILTIIDLFTKYGTAVPIPDKKAETVLKAFV 1584
Query: 258 KN-VIRRFGVPGVLITDNGTQFTSKGFRDLLDGLHIKQRFTSVEHPQTNGQAESANRVIL 316
+ I +P L+TD G +F + F L I+ T + + NG E N+ I+
Sbjct: 1585 ERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGYNSRANGAVERFNKTIM 1644
Query: 317 RGLRKRLGDAKGDWAEQLDHVLWAYRTTPHSTTGESPYRLTFGTEAVIPAEIREPSARTA 376
++K+ +W +Q+ + ++AY H TGE+P L G + + P E+ A
Sbjct: 1645 HIMKKKTA-VPMEWDDQVVYAVYAYNNCVHENTGETPMFLMHGRDVMGPLEMSGEDAVGI 1703
Query: 377 GF-DLNQNDQLLGEDLDLLAERRAIANCRELIAKQEAAIRYNKKVVLRSF---AAGDLVL 432
+ D+++ LL ++ L + + IA + ++ +++K + G VL
Sbjct: 1704 NYADMDEYKHLLTQE---LLKVQKIAKEHAMREQESYKSLFDQKYASKKHRFPQPGSRVL 1760
Query: 433 QNASIGARSTERGKLAAKWEGPYRVVEATGTGA 465
+ KL KW GPYRV+ + A
Sbjct: 1761 LEIPSEKLGAQCPKLVNKWSGPYRVISCSENSA 1793
>POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1165
Score = 94.7 bits (234), Expect = 5e-19
Identities = 95/348 (27%), Positives = 144/348 (41%), Gaps = 33/348 (9%)
Query: 130 LAKDRAAYVLAEIHEGSCGHHLGGRSLARKVLRAGYYWPTLEKDAADHVKQCDPC-QRHA 188
L DR + +H+ + HLG L + V R P L+ + QC C +A
Sbjct: 804 LTPDRGKEFIKRLHQLT---HLGPEKLLQLVNRTSLLIPNLQSAVREVTSQCQACAMTNA 860
Query: 189 DLHHAPPARLSSLVSPWPFHQWGMDLLGPFDTAPG*L--KYLVVAVDYYTKWIEAEPLAT 246
+ + P + W +D + PG KYL+V +D ++ W+EA P T
Sbjct: 861 VTTYRETGKRQRGDRPGVY--WEVDFT---EIKPGRYGNKYLLVFIDTFSGWVEAFPTKT 915
Query: 247 ITSARVQRFFYKNVIRRFGVPGVLITDNGTQFTSKGFRDLLDGLHIKQRFTSVEHPQTNG 306
T+ V + + ++ RFG+P VL +DNG F ++ + L L I + PQ++G
Sbjct: 916 ETALIVCKKILEEILPRFGIPKVLGSDNGPAFVAQVSQGLATQLGINWKLHCAYRPQSSG 975
Query: 307 QAESANRVILRGLRKRLGDAKG-DWAEQLDHVLWAYRTTPHSTTGESPYRLTFGTEAVIP 365
Q E NR I L K + G DW L L R TP G +PY + +G
Sbjct: 976 QVERMNRTIKETLTKLALETGGKDWVTLLPLALLRARNTP-GRFGLTPYEILYG------ 1028
Query: 366 AEIREPSARTAGFDLNQNDQLLGEDLDLLAERRAIANCRELIAKQ-EAAIRYNKKVVLRS 424
P +G L +D+ L L +A+ R I Q + + +
Sbjct: 1029 ---GPPPILESGETLGPDDRFLPV---LFTHLKALEIVRTQIWDQIKEVYKPGTVTIPHP 1082
Query: 425 FAAGDLVLQNASIGARSTERGKLAAKWEGPYRVVEATGTGAYKLETLA 472
F GD VL R L +W+GPY V+ T T A K++ +A
Sbjct: 1083 FQVGDQVL------VRRHRPSSLEPRWKGPYLVLLTTPT-AVKVDGIA 1123
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 91.7 bits (226), Expect = 4e-18
Identities = 76/257 (29%), Positives = 117/257 (44%), Gaps = 18/257 (7%)
Query: 146 SCGHHLG--GRSLARKVLRAGYYWPTLEKDAADHVKQCDPCQRHADLHHAPPARLSSLVS 203
S H++ GR + + Y+WP L KD ++QC C + P L +
Sbjct: 821 STAHNIAHTGRDATFLKVSSKYWWPNLRKDVVKSIRQCKQCLVTNATNLTSPPILRPVKP 880
Query: 204 PWPFHQWGMDLLGPFDTAPG*LKYLVVAVDYYTKWIEAEPL-ATITSARVQRFFYKNVIR 262
PF ++ +D +GP + G L LVV VD T ++ P A TSA V+ N++
Sbjct: 881 LKPFDKFYIDYIGPLPPSNGYLHVLVV-VDSMTGFVWLYPTKAPSTSATVKAL---NMLT 936
Query: 263 RFGVPGVLITDNGTQFTSKGFRDLLDGLHIKQRFTSVEHPQTNGQAESANRVILRGLRKR 322
+P VL +D G FTS F D I+ F++ HPQ++G+ E N I R L K
Sbjct: 937 SIAIPKVLHSDQGAAFTSSTFADWAKEKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKL 996
Query: 323 LGDAKGDWAEQLDHVLWAYRTTPHSTTGESPYRLTFGTEAVIPAEIREPSARTAGFDLNQ 382
L W + L V A + ++ +P++L FG ++ P A + DL++
Sbjct: 997 LIGRPAKWYDLLPVVQLALNNSYSPSSKYTPHQLLFGVDS------NTPFANSDTLDLSR 1050
Query: 383 NDQLLGEDLDLLAERRA 399
E+L LL E R+
Sbjct: 1051 E-----EELSLLQEIRS 1062
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 90.5 bits (223), Expect = 9e-18
Identities = 71/243 (29%), Positives = 116/243 (47%), Gaps = 24/243 (9%)
Query: 138 VLAEIHEGSCGHHLGGRSLARKVLRAGYYWPTLEKDAADHVKQCDPCQRHADLHHAPPAR 197
VL ++HEG H G + +K R+ +W L+ D + V+ C+ CQ ++ + P
Sbjct: 786 VLKQLHEG----HPGIVQMKQKA-RSFVFWRGLDSDIENMVRHCNNCQENSKMPRVVP-- 838
Query: 198 LSSLVSPWPFHQ--WG---MDLLGPFDTAPG*LKYLVVAVDYYTKWIEAEPLATITSARV 252
++PWP + W +D GP + YL+V VD TK+ E + +I SA
Sbjct: 839 ----LNPWPVPEAPWKRIHIDFAGPLNGC-----YLLVVVDAKTKYAEVKLTRSI-SAVT 888
Query: 253 QRFFYKNVIRRFGVPGVLITDNGTQFTSKGFRDLLDGLHIKQRFTSVEHPQTNGQAESAN 312
+ + G P +I+DNGTQ TS F + I+ + ++V +P++NG AE
Sbjct: 889 TIDLLEEIFSIHGYPETIISDNGTQLTSHLFAQMCQSHGIEHKTSAVYYPRSNGAAERFV 948
Query: 313 RVILRGLRKRLGDAKGDWAEQLDHVLWAYRTTPHST-TGESPYRLTFGTEAVIPAEIREP 371
+ RG+ K G+ + + L+ L +YR TPHS G +P FG + + P
Sbjct: 949 DTLKRGIAKIKGEGSVN-QQILNKFLISYRNTPHSALNGSTPAECHFGRKIRTTMSLLMP 1007
Query: 372 SAR 374
+ R
Sbjct: 1008 TDR 1010
>POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1157
Score = 90.5 bits (223), Expect = 9e-18
Identities = 98/338 (28%), Positives = 143/338 (41%), Gaps = 34/338 (10%)
Query: 150 HLGGRSLARKVLRAGYYWPTLEKDAADHVKQCDPCQRHADLHHAPPARLSSLVSPWPFHQ 209
H G S KV + Y+WP L KD ++QC C A P L PF +
Sbjct: 830 HTGRDSTFLKV-SSKYWWPNLRKDVVKVIRQCKQCLVTNAATLAAPPILRPERPVKPFDK 888
Query: 210 WGMDLLGPFDTAPG*LKYLVVAVDYYTKWIEAEPL-ATITSARVQRFFYKNVIRRFGVPG 268
+ +D +GP + G L LVV VD T ++ P A TSA V+ N++ VP
Sbjct: 889 FFIDYIGPLPPSNGYLHVLVV-VDSMTGFVWLYPTKAPSTSATVKAL---NMLTSIAVPK 944
Query: 269 VLITDNGTQFTSKGFRDLLDGLHIKQRFTSVEHPQTNGQAESANRVILRGLRKRLGDAKG 328
V+ +D G FTS F D I+ F++ HPQ++G+ E N I R L K L
Sbjct: 945 VIHSDQGAAFTSATFADWAKNKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLLVGRPA 1004
Query: 329 DWAEQLDHVLWAYRTTPHSTTGESPYRLTFGTEAVIPAEIREPSARTAGFDLNQNDQLLG 388
W + L V A + ++ +P++L FG ++ P A + DL++
Sbjct: 1005 KWYDLLPVVQLALNNSYSPSSKYTPHQLLFGIDS------NTPFANSDTLDLSRE----- 1053
Query: 389 EDLDLLAERRAIANCRELIAKQEAAIRYNKKVVLRSFAAGDLVLQNASIGARSTERGKLA 448
E+L LL E I + L + A+IR S + G LV + R L
Sbjct: 1054 EELSLLQE---IRSSLYLPSTPPASIR------AWSPSVGQLVQE------RVARPASLR 1098
Query: 449 AKWEGPYRVVEATGTGAYKLETLAGKEIPRTWNATNLR 486
+W P V+E A + G RT + NL+
Sbjct: 1099 PRWHKPTPVLEVINPRAVVILDHLGNR--RTVSVDNLK 1134
>POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 89.7 bits (221), Expect = 1e-17
Identities = 78/250 (31%), Positives = 114/250 (45%), Gaps = 22/250 (8%)
Query: 226 KYLVVAVDYYTKWIEAEPLATITSARVQRFFYKNVIRRFGVPGVLITDNGTQFTSKGFRD 285
KYL+V VD ++ W+EA P T+ V + + + RFG+P VL TDNG F SK +
Sbjct: 933 KYLLVFVDTFSGWVEAFPTKKETAKVVTKKLLEEIFPRFGMPQVLGTDNGPAFVSKVSQT 992
Query: 286 LLDGLHIKQRFTSVEHPQTNGQAESANRVILRGLRK-RLGDAKGDWAEQLDHVLWAYRTT 344
+ D L + + PQ++GQ E NR I L K L DW L L+ R T
Sbjct: 993 VADLLGVDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRDWVLLLPLALYRARNT 1052
Query: 345 PHSTTGESPYRLTFGTEAVIPAEIREPSARTAGFDLNQNDQLLGEDLDLLAER--RAIAN 402
P G +PY + +G P + P A N + Q + L L+ R +A
Sbjct: 1053 P-GPHGLTPYEILYGAP---PPLVNFPDPDMAKVTHNPSLQAHLQALYLVQHEVWRPLA- 1107
Query: 403 CRELIAKQEAAIRYNKKVVLRSFAAGDLVLQNASIGARSTERGKLAAKWEGPYRVVEATG 462
A QE + ++ VV F GD ++ R + L +W+GPY V+ T
Sbjct: 1108 ----AAYQE---QLDRPVVPHPFRVGD------TVWVRRHQTKNLEPRWKGPYTVLLTTP 1154
Query: 463 TGAYKLETLA 472
T A K++ +A
Sbjct: 1155 T-ALKVDGIA 1163
>POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 89.7 bits (221), Expect = 1e-17
Identities = 87/313 (27%), Positives = 135/313 (42%), Gaps = 28/313 (8%)
Query: 165 YYWPTLEKDAADHVKQCDPCQRHADLHHAPPARLSSLVSPWPFHQWGMDLLGPFDTAPG* 224
YY ++ D + C C + + + + + + P W +D + PG
Sbjct: 874 YYMLNRDRTLKDITETCQACAQ-VNASKSAVKQGTRVRGHRPGTHWEIDFT---EVKPGL 929
Query: 225 L--KYLVVAVDYYTKWIEAEPLATITSARVQRFFYKNVIRRFGVPGVLITDNGTQFTSKG 282
KYL+V +D ++ W+EA P T+ V + + + RFG+P VL TDNG F SK
Sbjct: 930 YGYKYLLVFIDTFSGWVEAFPTKKETAKVVTKKLLEEIFPRFGMPQVLGTDNGPAFVSKV 989
Query: 283 FRDLLDGLHIKQRFTSVEHPQTNGQAESANRVILRGLRK-RLGDAKGDWAEQLDHVLWAY 341
+ + D L + + PQ++GQ E NR I L K L DW L L+
Sbjct: 990 SQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRDWVLLLPLALYRA 1049
Query: 342 RTTPHSTTGESPYRLTFGTEAVIPAEIREPSARTAGFDLNQNDQLLGEDLDLLAER--RA 399
R TP G +PY + +G P + P A N + Q + L L+ R
Sbjct: 1050 RNTP-GPHGLTPYEILYGAP---PPLVNFPDPDMAKVTHNPSLQAHLQALYLVQHEVWRP 1105
Query: 400 IANCRELIAKQEAAIRYNKKVVLRSFAAGDLVLQNASIGARSTERGKLAAKWEGPYRVVE 459
+A A QE + ++ VV F GD ++ R + L +W+GPY V+
Sbjct: 1106 LA-----AAYQE---QLDRPVVPHPFRVGD------TVWVRRHQTKNLEPRWKGPYTVLL 1151
Query: 460 ATGTGAYKLETLA 472
T T A K++ +A
Sbjct: 1152 TTPT-ALKVDGIA 1163
>POL_MLVF5 (P26810) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 89.7 bits (221), Expect = 1e-17
Identities = 78/250 (31%), Positives = 114/250 (45%), Gaps = 22/250 (8%)
Query: 226 KYLVVAVDYYTKWIEAEPLATITSARVQRFFYKNVIRRFGVPGVLITDNGTQFTSKGFRD 285
KYL+V VD ++ W+EA P T+ V + + + RFG+P VL TDNG F SK +
Sbjct: 933 KYLLVFVDTFSGWVEAFPTKKETAKVVTKKLLEEIFPRFGMPQVLGTDNGPAFVSKVSQT 992
Query: 286 LLDGLHIKQRFTSVEHPQTNGQAESANRVILRGLRK-RLGDAKGDWAEQLDHVLWAYRTT 344
+ D L + + PQ++GQ E NR I L K L DW L L+ R T
Sbjct: 993 VADLLGVDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRDWVLLLPLALYRARNT 1052
Query: 345 PHSTTGESPYRLTFGTEAVIPAEIREPSARTAGFDLNQNDQLLGEDLDLLAER--RAIAN 402
P G +PY + +G P + P A N + Q + L L+ R +A
Sbjct: 1053 P-GPHGLTPYEILYGAP---PPLVNFPDPDMAKVTHNPSLQAHLQALYLVQHEVWRPLA- 1107
Query: 403 CRELIAKQEAAIRYNKKVVLRSFAAGDLVLQNASIGARSTERGKLAAKWEGPYRVVEATG 462
A QE + ++ VV F GD ++ R + L +W+GPY V+ T
Sbjct: 1108 ----AAYQE---QLDRPVVPHPFRVGD------TVWVRRHQTKNLEPRWKGPYTVLLTTP 1154
Query: 463 TGAYKLETLA 472
T A K++ +A
Sbjct: 1155 T-ALKVDGIA 1163
>POL_MLVAV (P03356) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1196
Score = 87.4 bits (215), Expect = 7e-17
Identities = 72/242 (29%), Positives = 109/242 (44%), Gaps = 23/242 (9%)
Query: 226 KYLVVAVDYYTKWIEAEPLATITSARVQRFFYKNVIRRFGVPGVLITDNGTQFTSKGFRD 285
KYL+V VD ++ W+EA P T+ V + + + RFG+P VL +DNG FTS+ +
Sbjct: 928 KYLLVFVDTFSGWVEAFPTKRETARVVSKKLLEEIFPRFGMPQVLGSDNGPAFTSQVSQS 987
Query: 286 LLDGLHIKQRFTSVEHPQTNGQAESANRVILRGLRK-RLGDAKGDWAEQLDHVLWAYRTT 344
+ D L I + PQ++GQ E NR I L K L DW L L+ R T
Sbjct: 988 VADLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLAAGTRDWVLLLPLALYRARNT 1047
Query: 345 PHSTTGESPYRLTFGTEAVIPAEIREPSARTAGFDLNQNDQLLGEDLDLLAERRAIANCR 404
P G +PY + +G + +P D+++ L L A +A+ +
Sbjct: 1048 P-GPHGLTPYEILYGAPPPL-VNFHDP-------DMSE----LTNSPSLQAHLQALQTVQ 1094
Query: 405 ELIAKQEAAI---RYNKKVVLRSFAAGDLVLQNASIGARSTERGKLAAKWEGPYRVVEAT 461
I K A + ++ V+ F GD S+ R + L +W+GPY V+ T
Sbjct: 1095 REIWKPLAEAYRDQLDQPVIPHPFRIGD------SVWVRRHQTKNLEPRWKGPYTVLLTT 1148
Query: 462 GT 463
T
Sbjct: 1149 PT 1150
>POL_MLVMO (P03355) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1199
Score = 85.9 bits (211), Expect = 2e-16
Identities = 74/251 (29%), Positives = 114/251 (44%), Gaps = 24/251 (9%)
Query: 226 KYLVVAVDYYTKWIEAEPLATITSARVQRFFYKNVIRRFGVPGVLITDNGTQFTSKGFRD 285
KYL+V +D ++ WIEA P T+ V + + + RFG+P VL TDNG F SK +
Sbjct: 928 KYLLVFIDTFSGWIEAFPTKKETAKVVTKKLLEEIFPRFGMPQVLGTDNGPAFVSKVSQT 987
Query: 286 LLDGLHIKQRFTSVEHPQTNGQAESANRVILRGLRK-RLGDAKGDWAEQLDHVLWAYRTT 344
+ D L I + PQ++GQ E NR I L K L DW L L+ R T
Sbjct: 988 VADLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRDWVLLLPLALYRARNT 1047
Query: 345 PHSTTGESPYRLTFGTEAVIPAEIREPSARTAGFDLNQNDQLLGEDLDLLAERRAIANCR 404
P G +PY + +G P + P D+ + + L A +A+ +
Sbjct: 1048 P-GPHGLTPYEILYGAP---PPLVNFPDP-----DMTR----VTNSPSLQAHLQALYLVQ 1094
Query: 405 ELIAKQEAAI---RYNKKVVLRSFAAGDLVLQNASIGARSTERGKLAAKWEGPYRVVEAT 461
+ + AA + ++ VV + GD ++ R + L +W+GPY V+ T
Sbjct: 1095 HEVWRPLAAAYQEQLDRPVVPHPYRVGD------TVWVRRHQTKNLEPRWKGPYTVLLTT 1148
Query: 462 GTGAYKLETLA 472
T A K++ +A
Sbjct: 1149 PT-ALKVDGIA 1158
>POL_MLVCB (P08361) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 282
Score = 85.1 bits (209), Expect = 4e-16
Identities = 72/242 (29%), Positives = 108/242 (43%), Gaps = 23/242 (9%)
Query: 226 KYLVVAVDYYTKWIEAEPLATITSARVQRFFYKNVIRRFGVPGVLITDNGTQFTSKGFRD 285
KYL+V VD ++ WIEA P T+ V + + + RFG+P VL TDNG F SK +
Sbjct: 14 KYLLVFVDTFSGWIEAFPTKKETAKVVTKKLLEEIFPRFGMPQVLGTDNGPAFVSKVSQT 73
Query: 286 LLDGLHIKQRFTSVEHPQTNGQAESANRVILRGLRK-RLGDAKGDWAEQLDHVLWAYRTT 344
+ D L I + PQ++GQ E NR I L K L DW L L+ R T
Sbjct: 74 VADLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRDWVLLLPLALYRARNT 133
Query: 345 PHSTTGESPYRLTFGTEAVIPAEIREPSARTAGFDLNQNDQLLGEDLDLLAERRAIANCR 404
P G +PY + +G P + P D+ + + L A +A+ +
Sbjct: 134 P-GPHGLTPYEILYGAP---PPLVNFPDP-----DMTR----VTNSPSLQAHLQALYLVQ 180
Query: 405 ELIAKQEAAI---RYNKKVVLRSFAAGDLVLQNASIGARSTERGKLAAKWEGPYRVVEAT 461
+ + AA + ++ VV + GD ++ R + L +W+GPY V+ T
Sbjct: 181 HEVWRPLAAAYQEQLDRPVVPHPYRVGD------TVWVRRHQTKNLEPRWKGPYTVLLTT 234
Query: 462 GT 463
T
Sbjct: 235 PT 236
>POL_MLVAK (P03357) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 843
Score = 84.0 bits (206), Expect = 8e-16
Identities = 72/242 (29%), Positives = 109/242 (44%), Gaps = 24/242 (9%)
Query: 226 KYLVVAVDYYTKWIEAEPLATITSARVQRFFYKNVIRRFGVPGVLITDNGTQFTSKGFRD 285
KYL+V VD ++ W+EA P T+ V + + + RFG+P VL +DNG FTS+ +
Sbjct: 576 KYLLVFVDTFSGWVEAFPTKRETARVVSKKLLEEIFPRFGMPQVLGSDNGPAFTSQVSQS 635
Query: 286 LLDGLHIKQRFTSVEHPQTNGQAESANRVILRGLRK-RLGDAKGDWAEQLDHVLWAYRTT 344
+ D L I + PQ++GQ E NR I L K L DW L L+ R T
Sbjct: 636 VADLLGI-DKLHCAYRPQSSGQVERMNRTIKETLTKLTLAAGTRDWVLLLPLALYRARNT 694
Query: 345 PHSTTGESPYRLTFGTEAVIPAEIREPSARTAGFDLNQNDQLLGEDLDLLAERRAIANCR 404
P G +PY + +G + +P D+++ L L A +A+ +
Sbjct: 695 P-GPHGLTPYEILYGAPPPL-VNFHDP-------DMSE----LTNSPSLQAHLQALQTVQ 741
Query: 405 ELIAKQEAAI---RYNKKVVLRSFAAGDLVLQNASIGARSTERGKLAAKWEGPYRVVEAT 461
I K A + ++ V+ F GD S+ R + L +W+GPY V+ T
Sbjct: 742 REIWKPLAEAYRDQLDQPVIPHPFRIGD------SVWVRRHQTKNLEPRWKGPYTVLLTT 795
Query: 462 GT 463
T
Sbjct: 796 PT 797
>POL3_MOUSE (P11367) Retrovirus-related Pol polyprotein
(Endonuclease) (Fragment)
Length = 390
Score = 82.8 bits (203), Expect = 2e-15
Identities = 72/251 (28%), Positives = 112/251 (43%), Gaps = 24/251 (9%)
Query: 226 KYLVVAVDYYTKWIEAEPLATITSARVQRFFYKNVIRRFGVPGVLITDNGTQFTSKGFRD 285
KYL+V VD ++ W+EA P T+ V + + + RFG+P VL TDNG F S+ +
Sbjct: 137 KYLLVFVDTFSGWVEAFPTKHETAKIVTKKLLEEIFPRFGMPQVLGTDNGPAFVSQVSQS 196
Query: 286 LLDGLHIKQRFTSVEHPQTNGQAESANRVILRGLRK-RLGDAKGDWAEQLDHVLWAYRTT 344
+ L I + PQ++GQ E NR I L K L DW L L+ R T
Sbjct: 197 VAKLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGTRDWVLLLPLALYRARNT 256
Query: 345 PHSTTGESPYRLTFGTEAVIPAEIREPSARTAGFDLNQNDQLLGEDLDLLAERRAIANCR 404
P G +PY + +G P + + F + + L A +A+ +
Sbjct: 257 P-GPHGLTPYEILYGAP---PPLVNFHDPEMSKFTNSPS---------LQAHLQALQAVQ 303
Query: 405 ELIAKQEAAI---RYNKKVVLRSFAAGDLVLQNASIGARSTERGKLAAKWEGPYRVVEAT 461
+ K AA + ++ V+ F GD ++ R + L +W+GPY V+ T
Sbjct: 304 REVWKPLAAAYQDQLDQPVIPHPFRVGD------TVWVRRHQTKNLEPRWKGPYTVLLTT 357
Query: 462 GTGAYKLETLA 472
T A K++ +A
Sbjct: 358 PT-ALKVDGIA 367
>POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1189
Score = 82.4 bits (202), Expect = 2e-15
Identities = 88/336 (26%), Positives = 132/336 (39%), Gaps = 25/336 (7%)
Query: 139 LAEIHEGSCGHHLGGRSLARKVLRAGYYWPTLEKDAADHVKQCDPCQR-HADLHHAPPAR 197
LA I + HLG R L + + + P C CQ+ +A P +
Sbjct: 838 LAMIQQMHAWTHLGNRKLKLLIEKTDFLIPRASTLIEQVTSACKVCQQVNAGATRVPAGK 897
Query: 198 LSSLVSPWPFHQWGMDLLGPFDTAPG*LKYLVVAVDYYTKWIEAEPLATITSARVQRFFY 257
+ P + W +D G KYL+V VD ++ W+EA P T+ V +
Sbjct: 898 RTRGNRPGVY--WEIDFTEVKPHYAG-YKYLLVFVDTFSGWVEAFPTRQETAHIVAKKIL 954
Query: 258 KNVIRRFGVPGVLITDNGTQFTSKGFRDLLDGLHIKQRFTSVEHPQTNGQAESANRVILR 317
+ + RFG+P V+ +DNG F S+ + L L I + PQ++GQ E NR I
Sbjct: 955 EEIFPRFGLPKVIGSDNGPAFVSQVSQGLARILGINWKLHCAYRPQSSGQVERMNRTIKE 1014
Query: 318 GLRK-RLGDAKGDWAEQLDHVLWAYRTTPHSTTGESPYRLTFGTEAVIPAEIREPSARTA 376
L K L DW L L R TP + G +PY + +G + + S +
Sbjct: 1015 TLTKLTLETGLKDWRRLLSLALLRARNTP-NRFGLTPYEILYGGPPPLSTLLNSFSPSNS 1073
Query: 377 GFDLNQNDQLLGEDLDLLAERRAIANCRELIAKQEAAIRYNKKVVLRSFAAGDLVLQNAS 436
DL L +A+ ++ A R F GD S
Sbjct: 1074 KTDLQAR----------LKGLQAVQ--AQIWAPLAELYRPGHSQTSHPFQVGD------S 1115
Query: 437 IGARSTERGKLAAKWEGPYRVVEATGTGAYKLETLA 472
+ R L +W+GPY V+ T T A K++ +A
Sbjct: 1116 VYVRRHRSQGLEPRWKGPYIVLLTTPT-AIKVDGIA 1150
>POL_MLVRK (P31795) Pol polyprotein [Contains: Protease (EC
3.4.23.-); Reverse transcriptase/ribonuclease H (EC
2.7.7.49) (EC 3.1.26.4) (RT); Integrase (IN)] (Fragment)
Length = 581
Score = 81.6 bits (200), Expect = 4e-15
Identities = 69/242 (28%), Positives = 106/242 (43%), Gaps = 23/242 (9%)
Query: 226 KYLVVAVDYYTKWIEAEPLATITSARVQRFFYKNVIRRFGVPGVLITDNGTQFTSKGFRD 285
KYL+V VD ++ W+EA P T+ V + + + RFG+P VL TDNG F S+ +
Sbjct: 313 KYLLVFVDTFSGWVEAFPTKHETAKIVTKKLLEEIFPRFGMPQVLGTDNGPAFVSQVSQS 372
Query: 286 LLDGLHIKQRFTSVEHPQTNGQAESANRVILRGLRK-RLGDAKGDWAEQLDHVLWAYRTT 344
+ L I + PQ++GQ E NR I L K L DW L L+ R T
Sbjct: 373 VAKLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGTRDWVLLLPLALYRARNT 432
Query: 345 PHSTTGESPYRLTFGTEAVIPAEIREPSARTAGFDLNQNDQLLGEDLDLLAERRAIANCR 404
P G +PY + +G P + + F + + L A +A+ +
Sbjct: 433 P-GPHGLTPYEILYGAP---PPLVNFHDPEMSKFTNSPS---------LQAHLQALQAVQ 479
Query: 405 ELIAKQEAAI---RYNKKVVLRSFAAGDLVLQNASIGARSTERGKLAAKWEGPYRVVEAT 461
+ K AA + ++ V+ F GD ++ R + L +W+GPY V+ T
Sbjct: 480 REVWKPLAAAYQDQLDQPVIPHPFRVGD------TVWVRRHQTKNLEPRWKGPYTVLLTT 533
Query: 462 GT 463
T
Sbjct: 534 PT 535
>POL_MLVRD (P11227) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1196
Score = 81.6 bits (200), Expect = 4e-15
Identities = 69/242 (28%), Positives = 106/242 (43%), Gaps = 23/242 (9%)
Query: 226 KYLVVAVDYYTKWIEAEPLATITSARVQRFFYKNVIRRFGVPGVLITDNGTQFTSKGFRD 285
KYL+V VD ++ W+EA P T+ V + + + RFG+P VL TDNG F S+ +
Sbjct: 928 KYLLVFVDTFSGWVEAFPTKHETAKIVTKKLLEEIFPRFGMPQVLGTDNGPAFVSQVSQS 987
Query: 286 LLDGLHIKQRFTSVEHPQTNGQAESANRVILRGLRK-RLGDAKGDWAEQLDHVLWAYRTT 344
+ L I + PQ++GQ E NR I L K L DW L L+ R T
Sbjct: 988 VAKLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGTRDWVLLLPLALYRARNT 1047
Query: 345 PHSTTGESPYRLTFGTEAVIPAEIREPSARTAGFDLNQNDQLLGEDLDLLAERRAIANCR 404
P G +PY + +G P + + F + + L A +A+ +
Sbjct: 1048 P-GPHGLTPYEILYGAP---PPLVNFHDPEMSKFTNSPS---------LQAHLQALQAVQ 1094
Query: 405 ELIAKQEAAI---RYNKKVVLRSFAAGDLVLQNASIGARSTERGKLAAKWEGPYRVVEAT 461
+ K AA + ++ V+ F GD ++ R + L +W+GPY V+ T
Sbjct: 1095 REVWKPLAAAYQDQLDQPVIPHPFRVGD------TVWVRRHQTKNLEPRWKGPYTVLLTT 1148
Query: 462 GT 463
T
Sbjct: 1149 PT 1150
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.321 0.137 0.424
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 58,888,728
Number of Sequences: 164201
Number of extensions: 2547676
Number of successful extensions: 6093
Number of sequences better than 10.0: 118
Number of HSP's better than 10.0 without gapping: 65
Number of HSP's successfully gapped in prelim test: 53
Number of HSP's that attempted gapping in prelim test: 5948
Number of HSP's gapped (non-prelim): 136
length of query: 489
length of database: 59,974,054
effective HSP length: 114
effective length of query: 375
effective length of database: 41,255,140
effective search space: 15470677500
effective search space used: 15470677500
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 68 (30.8 bits)
Lotus: description of TM0083.15