
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0101.14
(1733 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 332 4e-90
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 332 7e-90
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 330 2e-89
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 327 2e-88
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 326 3e-88
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 322 4e-87
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 317 1e-85
POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic pro... 254 1e-66
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 238 1e-61
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 228 9e-59
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 226 6e-58
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 223 5e-57
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 220 2e-56
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 218 9e-56
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 215 1e-54
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 185 1e-45
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 179 6e-44
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 176 7e-43
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 164 3e-39
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 102 1e-20
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 332 bits (852), Expect = 4e-90
Identities = 223/653 (34%), Positives = 337/653 (51%), Gaps = 55/653 (8%)
Query: 1120 LETPALFDTGANSSCISEGLIPTRYFEKTTEKLSA--AEGSKLIIK-------------- 1163
+E DTGA+ S+ +IP ++ + A+GS + I
Sbjct: 38 IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIAREI 97
Query: 1164 YKIPSA---------IIKNDSLEIETPFLLVRNLTHKVIIGTPFIKKLFPYNTDEKGITV 1214
+KIP+ II N+ ++ PF+ T +VI K +P + + V
Sbjct: 98 FKIPTVYQQESGIDFIIGNNFCQLYEPFI---QFTDRVIFTK---NKSYPVHIAKLTRAV 151
Query: 1215 QHLGKPILFKFSK-------PPFSKTLNIISYKEKQINFLKE--EISHKSIEVQLQQPSV 1265
+ + L K P + + N I K+I L E +S + + + Q+
Sbjct: 152 RVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLKEIAILSEGRRLSEEKLFITQQRMQ- 210
Query: 1266 KTRIGNILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQF 1325
+I +LE + S D PN + ++L SD K +P++ + +
Sbjct: 211 --KIEELLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREE 261
Query: 1326 CQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIP 1385
K+I +LL K+I+ SKSP F VN +AE RG R+V+NYK +N+A Y +P
Sbjct: 262 FDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLP 321
Query: 1386 NKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPS 1445
NK +LL + KIFS FD KSGFWQ+ L ++ R TAFT P G YEWNV+PFGLK APS
Sbjct: 322 NKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPS 381
Query: 1446 EFQRIMNEIFNPYSNFTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLF 1505
FQR M+E F + F VY+DD+L+FS + + H H+ + ++G+ +SK K LF
Sbjct: 382 IFQRHMDEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLF 441
Query: 1506 QTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTII 1565
+ KI FLG I +GT P +E +KFPD + DK QLQRFLG L Y +D+ P+L+ I
Sbjct: 442 KKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIR 501
Query: 1566 KLLHDRLKKDPP-PWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGIL 1624
K L +LK++ P W+ T ++++K ++ P L+ P P+ I+ETDASD +GG+L
Sbjct: 502 KPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGML 561
Query: 1625 K----QKIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRVD 1680
K + + E I + S + A++NY + KE LA++ +I KF L FL+R D
Sbjct: 562 KAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTD 621
Query: 1681 CKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTREY 1733
K + + K + RWQA LS + F++E+IKG+ N DFL+RE+
Sbjct: 622 NTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSREF 674
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 332 bits (850), Expect = 7e-90
Identities = 220/647 (34%), Positives = 335/647 (51%), Gaps = 43/647 (6%)
Query: 1120 LETPALFDTGANSSCISEGLIPTRYFEKTTEKLSA--AEGSKLIIK-------------- 1163
+E DTGA+ S+ +IP ++ + A+GS + I
Sbjct: 38 IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIAGEI 97
Query: 1164 YKIPSA---------IIKNDSLEIETPFLLVRNLTHKVIIGTPFIKKLFPYNTDEKGITV 1214
+KIP+ II N+ ++ PF+ T +VI K +P + + V
Sbjct: 98 FKIPTVYQQESGIDFIIGNNFCQLYEPFI---QFTDRVIFTK---NKSYPVHITKLTRAV 151
Query: 1215 QHLGKPILFKFSKPPFSKTLNIISYKEKQINFLKEEISHKSIEVQLQQPSV---KTRIGN 1271
+ + L K ++ ++ +I EEI+ S +L + + + R+
Sbjct: 152 RVGIEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQK 211
Query: 1272 ILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFCQKEIN 1331
I E + +CS+ P + K M SD K +P++ + + K+I
Sbjct: 212 I-EELLEKVCSENPLDPNKTKQWMKA---SIKLSDPSKAIKVKPMKYSPMDREEFDKQIK 267
Query: 1332 DLLQKKLIRRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLL 1391
+LL K+I+ SKSP F VN +AE RG R+V+NYK +N+A Y +PNK +LL
Sbjct: 268 ELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELL 327
Query: 1392 ARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIM 1451
+ KIFS FD KSGFWQ+ L ++ R TAFT P G YEWNV+PFGLK APS FQR M
Sbjct: 328 TLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHM 387
Query: 1452 NEIFNPYSNFTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRF 1511
+E F + F VY+DD+L+FS + + H H+ + ++G+ +SK K LF+ KI F
Sbjct: 388 DEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINF 447
Query: 1512 LGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDR 1571
LG I +GT P +E +KFPD + DK QLQRFLG L Y +D+ P+L+ I K L +
Sbjct: 448 LGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAK 507
Query: 1572 LKKDPP-PWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILK----Q 1626
LK++ P W+ T ++++K ++ P L+ P P+ I+ETDASD +GG+LK
Sbjct: 508 LKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKIN 567
Query: 1627 KIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRVDCKSAKD 1686
+ + E I + S + A++NY + KE LA++ +I KF L FL+R D K
Sbjct: 568 EGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKS 627
Query: 1687 ILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTREY 1733
+ + K + RWQA LS + F++E+IKG+ N DFL+RE+
Sbjct: 628 FVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSREF 674
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 330 bits (847), Expect = 2e-89
Identities = 219/647 (33%), Positives = 335/647 (50%), Gaps = 43/647 (6%)
Query: 1120 LETPALFDTGANSSCISEGLIPTRYFEKTTEKLSA--AEGSKLIIK-------------- 1163
+E DTGA+ S+ +IP ++ + A+GS + I
Sbjct: 38 IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIAGEI 97
Query: 1164 YKIPSA---------IIKNDSLEIETPFLLVRNLTHKVIIGTPFIKKLFPYNTDEKGITV 1214
++IP+ II N+ ++ PF+ T +VI K +P + + V
Sbjct: 98 FRIPTVYQQESGIDFIIGNNFCQLYEPFI---QFTDRVIFTK---NKSYPVHIAKLTRAV 151
Query: 1215 QHLGKPILFKFSKPPFSKTLNIISYKEKQINFLKEEISHKSIEVQLQQPSV---KTRIGN 1271
+ + L K ++ ++ +I EEI+ S +L + + + R+
Sbjct: 152 RVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQK 211
Query: 1272 ILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFCQKEIN 1331
I E + +CS+ P + K M SD K +P++ + + K+I
Sbjct: 212 I-EELLEKVCSENPLDPNKTKQWMKA---SIKLSDPSKAIKVKPMKYSPMDREEFDKQIK 267
Query: 1332 DLLQKKLIRRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLL 1391
+LL K+I+ SKSP F VN +AE RG R+V+NYK +N+A Y +PNK +LL
Sbjct: 268 ELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELL 327
Query: 1392 ARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIM 1451
+ KIFS FD KSGFWQ+ L ++ R TAFT P G YEWNV+PFGLK APS FQR M
Sbjct: 328 TLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHM 387
Query: 1452 NEIFNPYSNFTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRF 1511
+E F + F VY+DD+L+FS + + H H+ + ++G+ +SK K LF+ KI F
Sbjct: 388 DEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINF 447
Query: 1512 LGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDR 1571
LG I +GT P +E +KFPD + DK QLQRFLG L Y +D+ P+L+ I K L +
Sbjct: 448 LGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAK 507
Query: 1572 LKKDPP-PWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILK----Q 1626
LK++ P W+ T ++++K ++ P L+ P P+ I+ETDASD +GG+LK
Sbjct: 508 LKENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKIN 567
Query: 1627 KIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRVDCKSAKD 1686
+ + E I + S + A++NY + KE LA++ +I KF L FL+R D K
Sbjct: 568 EGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKS 627
Query: 1687 ILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTREY 1733
+ + K + RWQA LS + F++E+IKG+ N DFL+RE+
Sbjct: 628 FVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSREF 674
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 327 bits (837), Expect = 2e-88
Identities = 219/647 (33%), Positives = 335/647 (50%), Gaps = 50/647 (7%)
Query: 1120 LETPALFDTGANSSCISEGLIPTRYFEKTTEKLSA--AEGSKLIIK-------------- 1163
+E DTGA+ S+ +IP ++ + A+GS + I
Sbjct: 40 IELHCFVDTGASLCIASKFVIPEEHWINAERPIMVKIADGSSITINKVCRDIDLIIAGEI 99
Query: 1164 YKIPSA---------IIKNDSLEIETPFLLVRNLTHKVIIGTPFIK-KLFPYNTDEKGIT 1213
+ IP+ II N+ ++ PF+ T +VI F K + +P + +
Sbjct: 100 FHIPTVYQQESGIDFIIGNNFCQLYEPFI---QFTDRVI----FTKDRTYPVHIAKLTRA 152
Query: 1214 VQHLGKPILFKFSKPPFSKTLNIISYKEKQINFLKE--EISHKSIEVQLQQPSVKTRIGN 1271
V+ + L K ++ ++ +I L E +S + + + Q+ +I
Sbjct: 153 VRVGTEGFLESMKKRSKTQQPEPVNISTNKIAILSEGRRLSEEKLFITQQRMQ---KIEE 209
Query: 1272 ILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFCQKEIN 1331
+LE + S D PN + ++L SD K +P++ + + K+I
Sbjct: 210 LLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREEFDKQIK 262
Query: 1332 DLLQKKLIRRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLL 1391
+LL K+I+ SKSP F VN +AE RG R+V+NYK +N+A Y PNK +LL
Sbjct: 263 ELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELL 322
Query: 1392 ARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIM 1451
+ KIFS FD KSGFWQ+ L ++ R TAFT P G YEWNV+PFGLK APS FQR M
Sbjct: 323 TLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHM 382
Query: 1452 NEIFNPYSNFTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRF 1511
+E F + F VY+DD+L+FS + + H H+ + ++G+ +SK K LF+ KI F
Sbjct: 383 DEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINF 442
Query: 1512 LGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDR 1571
LG I +GT P +E +KFPD + DK QLQRFLG L Y +D+ P+L+ I K L +
Sbjct: 443 LGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAK 502
Query: 1572 LKKDPP-PWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILK----Q 1626
LK++ P W+ T ++++K ++ P L+ P P+ I+ETDASD +GG+LK
Sbjct: 503 LKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKIN 562
Query: 1627 KIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRVDCKSAKD 1686
+ + E I + S + A++NY + KE LA++ +I KF L FL+R D K
Sbjct: 563 EGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKS 622
Query: 1687 ILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTREY 1733
+ + K + RWQA LS + F++E+IKG+ N DFL+RE+
Sbjct: 623 FVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSREF 669
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 326 bits (836), Expect = 3e-88
Identities = 220/653 (33%), Positives = 334/653 (50%), Gaps = 55/653 (8%)
Query: 1120 LETPALFDTGANSSCISEGLIPTRYFEKTTEKLSA--AEGSKLIIK-------------- 1163
+E DTGA+ S+ +IP ++ + A+GS + I
Sbjct: 39 IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIVGVI 98
Query: 1164 YKIPSA---------IIKNDSLEIETPFLLVRNLTHKVIIGTPFIKKLFPYNTDEKGITV 1214
+KIP+ II N+ ++ PF+ T +VI K +P + + V
Sbjct: 99 FKIPTVYQQESGIDFIIGNNFCQLYEPFI---QFTDRVIFTK---NKSYPVHIAKLTRAV 152
Query: 1215 QHLGKPILFKFSK-------PPFSKTLNIISYKEKQINFLKE--EISHKSIEVQLQQPSV 1265
+ + L K P + + N I ++I L E +S + + + QQ
Sbjct: 153 RVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFIT-QQRMQ 211
Query: 1266 KTRIGNILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQF 1325
KT E + +CS+ P + K M SD K +P++ + +
Sbjct: 212 KT------EELLEKVCSENPLDPNKTKQWMKA---SIKLSDPSKAIKVKPMKYSPMDREE 262
Query: 1326 CQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIP 1385
K+I +LL K+I+ SKSP F VN +AE RG R+V+NYK +N+A Y +P
Sbjct: 263 FDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVGDAYNLP 322
Query: 1386 NKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPS 1445
NK +LL + KIFS FD KSGFWQ+ L ++ R TAFT P G YEWNV+PFGLK APS
Sbjct: 323 NKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPS 382
Query: 1446 EFQRIMNEIFNPYSNFTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLF 1505
FQR M+E F + F VY+DD+++FS + + H H+ + ++G+ +SK K LF
Sbjct: 383 IFQRHMDEAFRVFRKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLF 442
Query: 1506 QTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTII 1565
+ KI FLG I +GT P +E +KFPD + DK QLQRFLG L Y +D+ P L+ +
Sbjct: 443 KKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMR 502
Query: 1566 KLLHDRLKKDPP-PWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGIL 1624
+ L +LK++ P W+ T ++++K ++ P L+ P P+ I+ETDASD +GG+L
Sbjct: 503 QPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGML 562
Query: 1625 K----QKIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRVD 1680
K + + E I + S + A++NY + KE LA++ +I KF L FL+R D
Sbjct: 563 KAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTD 622
Query: 1681 CKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTREY 1733
K + + K + RWQA LS + F++E+IKG+ N DFL+RE+
Sbjct: 623 NTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSREF 675
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 322 bits (826), Expect = 4e-87
Identities = 206/615 (33%), Positives = 318/615 (51%), Gaps = 12/615 (1%)
Query: 1127 DTGANSSCISEGLIPTRYFEKTTEKLSAAEGSKLIIKYK--IPSAIIKNDSLEIETPFLL 1184
DTGA+ S +IP +E + + + ++ +IK + +K E P +
Sbjct: 54 DTGASLCIASRYIIPEELWENSPKDIQVKIANQELIKITKVCKNLKVKFAGKSFEIPTVY 113
Query: 1185 VRNLTHKVIIGTPFIKKLFPYNTDEKGITVQHLGKPILFKFSKPPFSKTLNIISYKEKQI 1244
+ +IG F + P+ E I + +L K FS + N + +
Sbjct: 114 QQETGIDFLIGNNFCRLYNPFIQWEDRIAFHLKNEMVLIKKVTKAFSVS-NPSFLENMKK 172
Query: 1245 NFLKEEISHKSIEVQLQQPSVK----TRIGNILENIQSSICSDLPNAFWERKSHMVELPY 1300
+ E+I +I + P + T +E + +CS+ P + K M
Sbjct: 173 DSKTEQIPGTNISKNIINPEERYFLITEKYQKIEQLLDKVCSENPIDPIKSKQWMKA--- 229
Query: 1301 EKDFSDKQIPTKARPIQMNEELLQFCQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIE 1360
D + +P+ + + + K+I +LL LI SKS F V +AE
Sbjct: 230 SIKLIDPLKVIRVKPMSYSPQDREGFAKQIKELLDLGLIIPSKSQHMSPAFLVENEAERR 289
Query: 1361 RGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRY 1420
RG R+V+NYK +NQA + +PN ++LL L IFS FD KSGFWQ+ L E+ +
Sbjct: 290 RGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQK 349
Query: 1421 KTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSNFTIVYIDDVLIFSQSIDQHF 1480
TAFT P G ++W V+PFGLK APS FQR M N F +VY+DD+++FS S H+
Sbjct: 350 LTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNGADKFCMVYVDDIIVFSNSELDHY 409
Query: 1481 KHLNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIID 1540
H+ + +++K G+ +SK K +LF+ KI FLG I +GT P N +E KFPD++ D
Sbjct: 410 NHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNHILENIHKFPDRLED 469
Query: 1541 KTQLQRFLGCLNYVADFCPQLSTIIKLLHDRLKKDPP-PWSDVHTNVVKQIKLRIKNLPC 1599
K LQRFLG L Y + P+L+ I K L +LKKD W+ ++ VK+IK + + P
Sbjct: 470 KKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLGSFPK 529
Query: 1600 LYLPNPQAFKIVETDASDIGFGGILKQKIFDNEQIIA-FTSKHWNPAQQNYSTVKKEVLA 1658
LYLP P+ I+ETDASD +GG+LK + D ++I ++S + A++NY + KE+LA
Sbjct: 530 LYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYHSNDKELLA 589
Query: 1659 IVLSISKFQYDLINQTFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYI 1718
+ I+KF L F VR D K+ L+ ++K + + RWQ S + F++E++
Sbjct: 590 VKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFSKYQFDVEHL 649
Query: 1719 KGSTNSLPDFLTREY 1733
+G N L D LTR++
Sbjct: 650 EGVKNVLADCLTRDF 664
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 317 bits (813), Expect = 1e-85
Identities = 211/634 (33%), Positives = 330/634 (51%), Gaps = 34/634 (5%)
Query: 1120 LETPALFDTGANSSCISEGLIPTRYFEKTTEKLSAA-EGSKLIIKYKIPSAI-IKNDSLE 1177
L+ DTG++ S+ +IP Y++ + L+ K+I K+ S + I+
Sbjct: 27 LDLHCYVDTGSSLCMASKYVIPEEYWQTAEKPLNIKIANGKIIQLTKVCSKLPIRLGGER 86
Query: 1178 IETPFLLVRNLTHKVIIGTPFIKKLFPY---------NTDEKGITVQHLGKPILF----- 1223
P L + +++G F + P+ + +++ + + + K +
Sbjct: 87 FLIPTLFQQESGIDLLLGNNFCQLYSPFIQYTDRIYFHLNKQSVIIGKITKAYQYGVKGF 146
Query: 1224 -----KFSKPPFSKTLNIISYKEKQINFLKEEISHKSIEVQLQQPSVKTRIGNILENIQS 1278
K SK + +NI S Q FL+E +H + Q S + I +LE + S
Sbjct: 147 LESMKKKSKVNRPEPINITS---NQHLFLEEGGNHVDEMLYEIQISKFSAIEEMLERVSS 203
Query: 1279 SICSDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFCQKEINDLLQKKL 1338
D P + + +EL D + K +P+ + + ++I +LL+ K+
Sbjct: 204 ENPID-PEKSKQWMTATIEL------IDPKTVVKVKPMSYSPSDREEFDRQIKELLELKV 256
Query: 1339 IRRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAK 1398
I+ SKS F V +AE RG R+V+NYK +N+A + +PNK +LL + K
Sbjct: 257 IKPSKSTHMSPAFLVENEAERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKK 316
Query: 1399 IFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQR-IMNEIFNP 1457
I+S FD KSG WQ+ L ++ + TAFT P G Y+WNV+PFGLK APS F + N N
Sbjct: 317 IYSSFDCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQ 376
Query: 1458 YSNFTIVYIDDVLIFSQS-IDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHNI 1516
YS + VY+DD+L+FS + +H+ H+ + +K G+ +SK K LF+ KI FLG I
Sbjct: 377 YSKYCCVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEI 436
Query: 1517 HQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDRLKKDP 1576
QGT P N +E KFPD+I DK QLQRFLG L Y +D+ P+L++I K L +LK+D
Sbjct: 437 DQGTHCPQNHILEHIHKFPDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDS 496
Query: 1577 P-PWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIFDNEQII 1635
W+D + + +IK +K+ P LY P P ++ETDAS+ +GGILK +E I
Sbjct: 497 TWTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHEYIC 556
Query: 1636 AFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRVDCKSAKDILQKDVKNL 1695
+ S + A++NY + +KE+LA++ I KF L FL+R D K+ + ++K
Sbjct: 557 RYASGSFKAAERNYHSNEKELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHFVNINLKGD 616
Query: 1696 ASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFL 1729
+ RWQ LS +DF++E+I G+ N DFL
Sbjct: 617 RKQGRLVRWQMWLSQYDFDVEHIAGTKNVFADFL 650
>POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 692
Score = 254 bits (649), Expect = 1e-66
Identities = 208/694 (29%), Positives = 329/694 (46%), Gaps = 100/694 (14%)
Query: 1111 IKIIIG--DFILETPALFDTGANSSCISEGLIPTRY-FEKTTEKLSAAEGSKLIIKYKIP 1167
IK+ IG +F+ A DTGA + C + I + K +++ A+ SK I+ I
Sbjct: 22 IKVSIGKRNFL----AYIDTGA-TLCFGKRKISNNWEILKQPKEIIIADKSKHYIREAIS 76
Query: 1168 SAIIKNDSLEIETPFLLVRNLTHKVIIGTPFIKKLFPYNTDEKGITVQ--HLGKPILFKF 1225
+ +K ++ E P + + + +IIG F+K P+ + I ++ +L P +
Sbjct: 77 NVFLKIENKEFLIPIIYLHDSGLDLIIGNNFLKLYQPFIQRLETIELRWKNLNNPKESQM 136
Query: 1226 SKPPFSKTLNIISYKEKQINF-LKEEISHKSIEVQLQQPSVKTRIGNILENIQSSICSDL 1284
++ ++I+ L++ + K+IE QL++ +CS+
Sbjct: 137 ISTKILTKNEVLKLSFEKIHICLEKYLFFKTIEEQLEE-----------------VCSEH 179
Query: 1285 PNAFWERKSHM-VEL----PYEKDFSDKQIPTKARPIQMNEELLQFCQKEINDLLQKKLI 1339
P + K+ + +E+ P ++ +IP R +Q +E E DLL+K LI
Sbjct: 180 PLDETKNKNGLLIEIRLKDPLQEINVTNRIPYTIRDVQEFKE-------ECEDLLKKGLI 232
Query: 1340 RRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKI 1399
R S+SP S FYV EI+RG R+VINYK +N+A Y +P K +L ++ +
Sbjct: 233 RESQSPHSAPAFYVENHNEIKRGKRRMVINYKKMNEATIGDSYKLPRKDFILEKIKGSLW 292
Query: 1400 FSKFDMKSGFWQIQLQEKDRYKTAFTV-PFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPY 1458
FS D KSG++Q++L E + TAF+ P YEWNV+ FGLK APS +QR M++
Sbjct: 293 FSSLDAKSGYYQLRLHENTKPLTAFSCPPQKHYEWNVLSFGLKQAPSIYQRFMDQSLKGL 352
Query: 1459 SNFTIVYIDDVLIFSQ-SIDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHNIH 1517
+ + YIDD+LIF++ S +QH + + IK+ G+ +SK K L Q +I +LG I
Sbjct: 353 EHICLAYIDDILIFTKGSKEQHVNDVRIVLQRIKEKGIIISKKKSKLIQQEIEYLGLKIQ 412
Query: 1518 -QGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVAD--FCPQLSTIIKLLHDRLK- 1573
G I E +FPD++ D+ Q+QRFLGC+NY+A+ F L+ K L ++
Sbjct: 413 GNGEIDLSPHTQEKILQFPDELEDRKQIQRFLGCINYIANEGFFKNLALERKHLQKKISV 472
Query: 1574 KDPPPWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGIL------KQK 1627
K+P W + T +V+ IK +I++LP LY + Q F IVETDAS + G L KQK
Sbjct: 473 KNPWKWDTIDTKMVQSIKGKIQSLPKLYNASIQDFLIVETDASQHSWSGCLRALPKGKQK 532
Query: 1628 I-----------------------------------------------FDNEQIIA-FTS 1639
I +NE ++ + S
Sbjct: 533 IGLDEFGIPTADLCTGSSSASSDNSPAEIDKCHSASKQDTHVASKIKKLENELLLCKYVS 592
Query: 1640 KHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRVDCKSAKDILQKDVKNLASKH 1699
+ + Y + EVLA V + K++ DL+ FL+R D K + ++K
Sbjct: 593 GTFTDTETRYPIAELEVLAGVKVLEKWRIDLLQTRFLLRTDSKYFAGFCRYNIKTDYRNG 652
Query: 1700 IFARWQAILSVFDFEIEYIKGSTNSLPDFLTREY 1733
RWQ L + +E IK N D LTRE+
Sbjct: 653 RLIRWQLRLQAYQPYVELIKSENNPFADTLTREW 686
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 238 bits (607), Expect = 1e-61
Identities = 211/775 (27%), Positives = 370/775 (47%), Gaps = 117/775 (15%)
Query: 1023 DTFRRLEKSTIKPVTIQDL-QSEVHTLQAEVKSLKQ------IQISQQLILDK------- 1068
+ + EK++ TIQ+ ++E++ ++ E++ K+ Q+ + +I+ +
Sbjct: 1114 EALKHSEKASRVFSTIQESDEAELNLIKEELRQFKEETRMAIAQLKEAIIVQEEDTIEER 1173
Query: 1069 ----LTEENSEE--SSSSSSTPNSASNNNVGDFLEIINNVIIQKFYINIKIIIGDFILET 1122
L E+++E S+++ + N N VG I ++ +YIN
Sbjct: 1174 CAMILEEKHTENIYSATAKAEYNGLYNVKVG-----IKPDNMEPYYIN------------ 1216
Query: 1123 PALFDTGANSSCISEGLIPTRYFE--KTTEKLSAAEGSKLIIKY-KIPSAIIKNDSLEIE 1179
A+ DTGA + I IP Y+E K T + G + K +I +
Sbjct: 1217 -AIVDTGATACLIQISAIPENYYEDAKVTVNFRSVLGIGTSTQMIKAGRILIGEQYFRMP 1275
Query: 1180 TPFLLVRNLTH--KVIIGTPFIKKLFPYNTDEKGITVQHLGKPIL--FKFSKPPFSKTLN 1235
+++ L+ ++IIG FI+ L E G+ ++ K I+ +K +
Sbjct: 1276 VTYVMNMGLSPGIQMIIGCSFIRSL------EGGLRIE---KDIITFYKLVTSIETSRTT 1326
Query: 1236 IISYKEKQINFLKEEISHKSIEVQ----LQQPSVKTRIGNILENIQSSICSDLPNAFWER 1291
++ +++ ++E + + V+ L Q + + E + + P FW+
Sbjct: 1327 QVANSIEELELSEDEYLNIAASVETPSFLDQEFARKNKDLLKEMKEMKYIGENPMEFWKN 1386
Query: 1292 KSHMVELPYEKDFSDKQIPTKARPIQM----NEELLQFCQKEINDLLQKKLIRRSKSPWS 1347
+L + + I RPI+ +EE + ++IN LLQ K+IR S+S
Sbjct: 1387 NKIKCKL----NIINPDIKIMGRPIKHVTPGDEEAMT---RQINLLLQMKVIRPSESKHR 1439
Query: 1348 CATFYVNKQAEIE-------RGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIF 1400
F V EI+ +G R+V NYK LN+ +Y +P +++++ +KI+
Sbjct: 1440 STAFIVRSGTEIDPITGKEKKGKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSKIY 1499
Query: 1401 SKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSN 1460
SKFD+KSGFWQ+ ++E+ TAF YEW VMPFGLKNAP+ FQR M+ +F
Sbjct: 1500 SKFDLKSGFWQVAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMDNVFKGTEK 1559
Query: 1461 FTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHNIHQGT 1520
F VYIDD+L+FS++ +QH +HL T + + K+NGL +S TK+ + +I FLG ++
Sbjct: 1560 FIAVYIDDILVFSETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLGCTK 1619
Query: 1521 I----IPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDRL---- 1572
I I++ +F+D +++ ++ +LG L+Y ++ + +++ L ++
Sbjct: 1620 IKLQPHIISKICDFSD---EKLATPEGMRSWLGILSYARNYIQDIGKLVQPLRQKMAPTG 1676
Query: 1573 --KKDPPPWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIF- 1629
+ +P W +V+QIK ++KNLP L LP +F I+ETD G+G + K K+
Sbjct: 1677 DKRMNPETW-----KMVRQIKEKVKNLPDLQLPPKDSFIIIETDGCMTGWGAVCKWKMSK 1731
Query: 1630 ----DNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQ-YDLINQTFLVRVDCKSA 1684
E+I A+ S +NP + ST+ E+ A + + KF+ Y L + ++R DC++
Sbjct: 1732 HDPRSTERICAYASGSFNPIK---STIDAEIQAAIHGLDKFKIYYLDKKELIIRSDCEAI 1788
Query: 1685 KDILQKDVKNLASKHIFARWQAILSVFDF--------EIEYIKGSTNSLPDFLTR 1731
K +N S+ RW L+ DF E+I G N L D L+R
Sbjct: 1789 IKFYNKTNENKPSR---VRW---LTFSDFLTGLGITVTFEHIDGKHNGLADALSR 1837
Score = 37.0 bits (84), Expect = 0.48
Identities = 52/336 (15%), Positives = 123/336 (36%), Gaps = 50/336 (14%)
Query: 726 LSNLKCKSLGDFRWYKDTFLTRVYTR----EDSQQAFWKEKFLAGLPKSFGDKVREKLRS 781
L L C + R Y +LT +++ E+ +P + G++V + +
Sbjct: 735 LKQLVCPNYQSIRRYLMDYLTLAAETGLMWSETEGPAISEELFTKMPAAIGERVAQAYKI 794
Query: 782 QNPGGEIPYQTLSYGQLIAIIQRVALKICQDDKIQQQLTKEKSQNRRDLGTFCEQFGIQG 841
+P + + Y + + ++ C++ + L FC F I+G
Sbjct: 795 MDPTSAVNLPSRVYFTINYLTEQ-----CKEASYMRSLKALD---------FCRDFPIEG 840
Query: 842 CPKKPKPRKHDPPPKQQWRRNSSRNHDHRKPKPRSKPHSTQAAKNPPENRPSQGKNVTCY 901
+ +K K + + + HD+ ++K + CY
Sbjct: 841 YYGRSGEKKKYTARKAT--KYTGKAHDNHIRVTKAKYQ----------------RKCKCY 882
Query: 902 NCGKPGHISRYCRLKRRISELHLEPEIEDKINNLLIQTSDEEESASSDSEVSEDLNQIQN 961
CG+ GH + CR K H + + + +L ++ ++E SA E +++ +
Sbjct: 883 ICGQEGHYANQCRNK------HKDQQRVAILQSLDLKENEEVVSADDKEEEDDEIFSVLG 936
Query: 962 DDDPQSSSSINVLTNEQDLLFRAINSIPDPDEKKIYLERLKFTLEDKPPKNPITTNKFNL 1021
++D Q + + + ++ + + + D + + + P +
Sbjct: 937 EEDYQEETIMVLEEDDIQQIIKEFSKFGDLSRRNVG--------PNFPGPAEVQMGVLKP 988
Query: 1022 RDTFRRLEKSTIKPVTIQDLQSEVHTLQAEVKSLKQ 1057
+ ++RR ++T++ + + + T Q +S KQ
Sbjct: 989 KSSWRRPIQATLEEINCHHNWTAISTGQLACRSCKQ 1024
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 228 bits (582), Expect = 9e-59
Identities = 235/906 (25%), Positives = 407/906 (43%), Gaps = 98/906 (10%)
Query: 881 TQAAKNPPEN---RPSQGKNVTCYNCGKPGHISRYCRLKRRISELHLEPEIEDKINNLLI 937
T KN +N RPS K CY C H++ C RR + ++ LI
Sbjct: 752 TNYNKNRRKNYVRRPSIKKKCRCYICQDENHLANRC--PRRYT---------NQARASLI 800
Query: 938 QTSDEE--ESASSDSEVSEDLNQIQNDDDPQSSSSINVLTNE----QDLLFRAINSIPDP 991
DE+ AS D ++ L I+ D+ SS + T E +D + + D
Sbjct: 801 DGLDEDIVSIASDDEDIENFLEIIELDEFIAHSSQEHEHTWEIGGKKDKVCEICSYFTDY 860
Query: 992 DEKKIYLERLKFTLEDKPPKNPITTNKFNLRDTFRRLEKSTIKPVTIQDLQSEVHTLQAE 1051
+ K + + T E + K +++ L T ++K T + I DL+ V L+
Sbjct: 861 N-KTVSCK----TCETQYCKT--CSDQLALEVT--EVKKPTKEETMIDDLKLNVKNLEFR 911
Query: 1052 VKSLKQIQISQQLILDKLTEENSEESSSSSSTPNS--ASNNNVGDFLEIINNVIIQKFYI 1109
V L+ ++ Q + DK S + P + A N ++++ N Y+
Sbjct: 912 VTILEH-KVEMQNLQDKFETMQIRNKSEITEIPTTSLAMRANESNYIKTSINKTAG-CYV 969
Query: 1110 NIKIIIGDFILETPALFDTGANSSCISEGLIPTRYFEKTTEKLS--AAEGSKLIIKYKIP 1167
KI + AL D+G+ + I LIP + T ++ A + SK + ++
Sbjct: 970 ETKISFNNENRIITALIDSGSTHNIICPTLIPASWINNTHREIIMFAVDNSKYNLNQELI 1029
Query: 1168 SAIIKNDSLEIETPFLLVRNLTHKVIIGTPFIKKLFPYN--TDEKG--------ITVQ-- 1215
I K E++ F + L + P + + T+E G IT+Q
Sbjct: 1030 DDI-KLQFQEVDETFGIKYKLGQTYVAPKPTKTFIIGHRFLTNENGSVTIHKDYITIQKT 1088
Query: 1216 ------------------HLGKPILFKFSKPPFSKTLNIISYKEKQINFLKEEISHKSIE 1257
H G+P LF ++K ++ SY+ + I K EI ++S+
Sbjct: 1089 TGIYPTARHELKSEFARKHGGRPPLFSNIPETYNKIPHLHSYQPQPILGYKNEIGNQSLI 1148
Query: 1258 VQLQQPSVKTRIGNILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQ 1317
+++ IG+ + +++ D + +PY +DK++
Sbjct: 1149 TMVKELEALGFIGDDITKNRTTWVCDFKIINPDINITCATIPYTP--ADKEV-------- 1198
Query: 1318 MNEELLQFCQKEINDLLQKKLIRRSKSPWS--CATFYVNKQAEIERGTPRLVINYKPLNQ 1375
+K+I +LL KLI+++ A F V +E PR+V NYK LN
Sbjct: 1199 --------FEKQIKELLDNKLIKKADPTCRHRTAAFIVRNHSEEVAQKPRIVYNYKRLND 1250
Query: 1376 ALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNV 1435
+ + IP+K ++ + A IFSKFD+K+GF ++L++ + T FT G Y WNV
Sbjct: 1251 NMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSEGLYTWNV 1310
Query: 1436 MPFGLKNAPSEFQRIMNEIFNPYSNFTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGL 1495
PFG+ NAP FQR M E F F ++YIDD+LI S + +H +HL F + +K+ G
Sbjct: 1311 CPFGIANAPCAFQRFMQESFGDL-KFALLYIDDILIASNNEKEHIEHLKIFFNRVKEVGC 1369
Query: 1496 AVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQ-LQRFLGCLNYV 1554
+SK K +F ++ +LG I +G I ++ KF ++ + LQ +LG LNY
Sbjct: 1370 VLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIKKFDKNKLNTLKGLQAYLGLLNYA 1429
Query: 1555 ADFCPQLSTIIKLLHDRLKKDPPP-WSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVET 1613
+ LS ++ L+ + K+ ++ N++ +I+ + + L P + I+ET
Sbjct: 1430 RGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKPLERPKETDYIIIET 1489
Query: 1614 DASDIGFGGIL-----KQKIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQY 1668
DAS+ G+G +L K D E+I + S ++ ++ ++++ E+ AI +++KFQ
Sbjct: 1490 DASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFG-EKKTWTSLDYEIEAINEALNKFQI 1548
Query: 1669 DLINQTFLVRVDCKS-AKDILQKDVKNLA-SKHIFARWQAILSVFDFEIEYIKGSTNSLP 1726
+++ F +R DC++ K I +D K + ++ I R + + E+IKG+ N LP
Sbjct: 1549 -YLDKDFTIRTDCEAIVKGIKTEDYKKRSKTRWIKLRDNLLKDGYKPTFEHIKGNKNFLP 1607
Query: 1727 DFLTRE 1732
+FL+RE
Sbjct: 1608 NFLSRE 1613
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 226 bits (575), Expect = 6e-58
Identities = 185/691 (26%), Positives = 333/691 (47%), Gaps = 70/691 (10%)
Query: 1053 KSLKQIQI-SQQLILDKLTEENSEESSSSSSTPN-SASNNNVGDFLEIINNVIIQKFYIN 1110
K+ K+ Q+ ++ L + L+++N+ +S + N S + FL N +++ +
Sbjct: 199 KTHKRFQLQNKNLGKESLSKKNNTTNSRNLRKTNVSRIEYSSNKFL----NHTRKRYEMV 254
Query: 1111 IKIIIGDFILETPALFDTGANSSCISEGLI-----PTRYFEKTTEKLSAAEGSKLIIKYK 1165
++ + DF P L DTGA ++ I+E + PTR + K+ I K
Sbjct: 255 LQAELPDFKCSIPCLIDTGAQANIITEETVRAHKLPTRPWSKSVIYGGVYPNK---INRK 311
Query: 1166 IPSAIIKNDSLEIETPFLLVRNLTHKVIIGTPFIKKLFPYNTDEKGITVQHLGKPILFKF 1225
I + + I+T FL+V+ +H I L+ N +
Sbjct: 312 TIKLNISLNGISIKTEFLVVKKFSHPAAIS---FTTLYDNNIE----------------- 351
Query: 1226 SKPPFSKTLNIISYKEKQINFLKEEISHKSIEVQLQQPSVKTRIGNILENIQSSICSDLP 1285
S + + +S K N +KE I + + + +T + + I+
Sbjct: 352 ----ISSSKHTLSQMNKVSNIVKEP-ELPDIYKEFKDITAETNTEKLPKPIKG------- 399
Query: 1286 NAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFCQKEINDLLQKKLIRRSKSP 1345
+E E + ++P + P+ + +Q EIN L+ +IR SK+
Sbjct: 400 ----------LEFEVELTQENYRLPIRNYPLPPGK--MQAMNDEINQGLKSGIIRESKAI 447
Query: 1346 WSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDM 1405
+C +V K+ GT R+V++YKPLN+ + YP+P + LLA++ + IF+K D+
Sbjct: 448 NACPVMFVPKK----EGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDL 503
Query: 1406 KSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSNFTIV- 1464
KS + I++++ D +K AF P G +E+ VMP+G+ AP+ FQ +N I +V
Sbjct: 504 KSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVC 563
Query: 1465 YIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPI 1524
Y+DD+LI S+S +H KH+ + +K L +++ K Q++++F+G++I + P
Sbjct: 564 YMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPC 623
Query: 1525 NRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDRLKKDPP-PWSDVH 1583
I+ ++ Q ++ +L++FLG +NY+ F P+ S + L++ LKKD W+
Sbjct: 624 QENIDKVLQW-KQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQ 682
Query: 1584 TNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIFDNEQI-IAFTSKHW 1642
T ++ IK + + P L + ++ETDASD+ G +L QK D++ + + S
Sbjct: 683 TQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKM 742
Query: 1643 NPAQQNYSTVKKEVLAIVLSISKFQYDLIN--QTFLVRVDCKSAKDILQKDVKNLASKHI 1700
+ AQ NYS KE+LAI+ S+ +++ L + + F + D ++ + + +
Sbjct: 743 SKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESE--PENKR 800
Query: 1701 FARWQAILSVFDFEIEYIKGSTNSLPDFLTR 1731
ARWQ L F+FEI Y GS N + D L+R
Sbjct: 801 LARWQLFLQDFNFEINYRPGSANHIADALSR 831
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 223 bits (567), Expect = 5e-57
Identities = 181/673 (26%), Positives = 323/673 (47%), Gaps = 71/673 (10%)
Query: 1075 EESSSSSSTPNSAS--NNNVGDFLEIINNVII----QKFYINIKIIIGDFILETPALFDT 1128
E S ++T NS + NV +E +N + +++ + ++ + DF P L DT
Sbjct: 214 ESLSKKNNTTNSRNLRKTNVSR-IEYSSNKFLNHTRKRYEMVLQAELPDFKCSIPCLIDT 272
Query: 1129 GANSSCISEGLI-----PTRYFEKTTEKLSAAEGSKLIIKYKIPSAIIKNDSLEIETPFL 1183
GA ++ I+E + PTR + K+ I K I + + I+T FL
Sbjct: 273 GAQANIITEETVRAHKLPTRPWSKSVIYGGVYPNK---INRKTIKLNISLNGISIKTEFL 329
Query: 1184 LVRNLTHKVIIGTPFIKKLFPYNTDEKGITVQHLGKPILFKFSKPPFSKTLNIISYKEKQ 1243
+V+ +H I L+ N + S + + +S K
Sbjct: 330 VVKKFSHPAAIS---FTTLYDNNIE---------------------ISSSKHTLSQMNKV 365
Query: 1244 INFLKEEISHKSIEVQLQQPSVKTRIGNILENIQSSICSDLPNAFWERKSHMVELPYEKD 1303
N +KE I + + + +T + + I+ +E E
Sbjct: 366 SNIVKEP-ELPDIYKEFKDITAETNTEKLPKPIKG-----------------LEFEVELT 407
Query: 1304 FSDKQIPTKARPIQMNEELLQFCQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIERGT 1363
+ ++P + P+ + +Q EIN L+ +IR SK+ +C +V K+ GT
Sbjct: 408 QENYRLPIRNYPLPPGK--MQAMNDEINQGLKSGIIRESKAINACPVMFVPKK----EGT 461
Query: 1364 PRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTA 1423
R+V++YKPLN+ + YP+P + LLA++ + IF+K D+KS + I++++ D +K A
Sbjct: 462 LRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLA 521
Query: 1424 FTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSNFTIV-YIDDVLIFSQSIDQHFKH 1482
F P G +E+ VMP+G+ AP+ FQ +N I +V Y+D++LI S+S +H KH
Sbjct: 522 FRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKH 581
Query: 1483 LNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKT 1542
+ + +K L +++ K Q++++F+G++I + P I+ ++ Q ++
Sbjct: 582 VKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQW-KQPKNRK 640
Query: 1543 QLQRFLGCLNYVADFCPQLSTIIKLLHDRLKKDPP-PWSDVHTNVVKQIKLRIKNLPCLY 1601
+L++FLG +NY+ F P+ S + L++ LKKD W+ T ++ IK + + P L
Sbjct: 641 ELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLR 700
Query: 1602 LPNPQAFKIVETDASDIGFGGILKQKIFDNEQI-IAFTSKHWNPAQQNYSTVKKEVLAIV 1660
+ ++ETDASD+ G +L QK D++ + + S + AQ NYS KE+LAI+
Sbjct: 701 HFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAII 760
Query: 1661 LSISKFQYDLIN--QTFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYI 1718
S+ +++ L + + F + D ++ + + + ARWQ L F+FEI Y
Sbjct: 761 KSLKHWRHYLESTIEPFKILTDHRNLIGRITNESE--PENKRLARWQLFLQDFNFEINYR 818
Query: 1719 KGSTNSLPDFLTR 1731
GS N + D L+R
Sbjct: 819 PGSANHIADALSR 831
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 220 bits (561), Expect = 2e-56
Identities = 171/637 (26%), Positives = 307/637 (47%), Gaps = 64/637 (10%)
Query: 1105 QKFYINIKIIIGDFILETPALFDTGANSSCISEGLI-----PTRYFEKTTEKLSAAEGSK 1159
+++ + ++ + DF P L DTG ++ I+E + PTR + K+
Sbjct: 249 KRYEMVLQAELPDFKCSIPCLIDTGTQANIITEETVRAHKLPTRPWSKSVIYGGVYPNK- 307
Query: 1160 LIIKYKIPSAIIKNDSLEIETPFLLVRNLTHKVIIGTPFIKKLFPYNTDEKGITVQHLGK 1219
I K I + + I+T FL+V+ +H I L+ N +
Sbjct: 308 --INRKTIKLNISLNGISIKTEFLVVKKFSHPAAIS---FTTLYDNNIE----------- 351
Query: 1220 PILFKFSKPPFSKTLNIISYKEKQINFLKEEISHKSIEVQLQQPSVKTRIGNILENIQSS 1279
S + + +S K N +KE I + + + +T + + I+
Sbjct: 352 ----------ISSSKHTLSQMNKVSNIVKEP-ELPDIYKEFKDITAETNTEKLPKPIKG- 399
Query: 1280 ICSDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFCQKEINDLLQKKLI 1339
+E E + ++P + P+ + +Q EIN L+ +I
Sbjct: 400 ----------------LEFEVELTQENYRLPIRNYPLPPGK--MQAMNDEINQGLKSGII 441
Query: 1340 RRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKI 1399
R SK+ +C +V K+ GT R+V++YKPLN+ + YP+P + LLA++ + I
Sbjct: 442 RESKAINACPVMFVPKK----EGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTI 497
Query: 1400 FSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYS 1459
F+K D+KS + I++++ D +K AF P G +E+ VMP+G+ AP+ FQ +N I
Sbjct: 498 FTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVK 557
Query: 1460 NFTIV-YIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHNIHQ 1518
+V Y+D++LI S+S +H KH+ + +K L +++ K Q++++F+G++I +
Sbjct: 558 ESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISE 617
Query: 1519 GTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDRLKKDPP- 1577
P I+ ++ Q ++ +L++FLG +NY+ F P+ S + L++ LKKD
Sbjct: 618 KGFTPCQENIDKVLQW-KQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRW 676
Query: 1578 PWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIFDNEQI-IA 1636
W+ T ++ IK + + P L + ++ETDASD+ G +L QK D++ +
Sbjct: 677 KWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVG 736
Query: 1637 FTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLIN--QTFLVRVDCKSAKDILQKDVKN 1694
+ S + AQ NYS KE+LAI+ S+ +++ L + + F + D ++ + + +
Sbjct: 737 YYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESE- 795
Query: 1695 LASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTR 1731
ARWQ L F+FEI Y GS N + D L+R
Sbjct: 796 -PENKRLARWQLFLQDFNFEINYRPGSANHIADALSR 831
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 218 bits (556), Expect = 9e-56
Identities = 138/412 (33%), Positives = 221/412 (53%), Gaps = 13/412 (3%)
Query: 1324 QFCQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIE-RGTPRLVINYKPLNQALCWIRY 1382
Q + +I D+L + +IR S SP++ + V K+ + + R+VI+Y+ LN+ R+
Sbjct: 221 QEVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRH 280
Query: 1383 PIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKN 1442
PIPN ++L +L F+ D+ GF QI++ + KTAF+ G YE+ MPFGLKN
Sbjct: 281 PIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKN 340
Query: 1443 APSEFQRIMNEIFNPYSN-FTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTK 1501
AP+ FQR MN+I P N +VY+DD+++FS S+D+H + L + K L + K
Sbjct: 341 APATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDK 400
Query: 1502 VSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQL 1561
+ + FLGH + I P IE K+P K +++ FLG Y F P
Sbjct: 401 CEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPK-EIKAFLGLTGYYRKFIPNF 459
Query: 1562 STIIKLLHDRLKKDP--PPWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIG 1619
+ I K + LKK+ + + + K++K I P L +P+ + TDASD+
Sbjct: 460 ADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVA 519
Query: 1620 FGGILKQKIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRV 1679
G +L Q + +++ S+ N + NYST++KE+LAIV + F++ L+ + F +
Sbjct: 520 LGAVLSQ----DGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISS 575
Query: 1680 DCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTR 1731
D + + + +K+ SK RW+ LS FDF+I+YIKG N + D L+R
Sbjct: 576 DHQPLSWLYR--MKDPNSK--LTRWRVKLSEFDFDIKYIKGKENCVADALSR 623
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 215 bits (547), Expect = 1e-54
Identities = 138/427 (32%), Positives = 226/427 (52%), Gaps = 15/427 (3%)
Query: 1309 IPTKARPIQMNEELLQFCQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIERGTP-RLV 1367
I +K P+ E+ + ++ ++L + LIR S SP++ T+ V K+ + R+V
Sbjct: 207 IYSKQYPLAQTHEIE--VENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVV 264
Query: 1368 INYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVP 1427
I+Y+ LN+ RYPIPN ++L +L + F+ D+ GF QI++ E+ KTAF+
Sbjct: 265 IDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTK 324
Query: 1428 FGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSN-FTIVYIDDVLIFSQSIDQHFKHLNTF 1486
G YE+ MPFGL+NAP+ FQR MN I P N +VY+DD++IFS S+ +H +
Sbjct: 325 SGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLV 384
Query: 1487 ISVIKKNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQR 1546
+ + L + K + + FLGH + I P ++ +P DK +++
Sbjct: 385 FTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDK-EIRA 443
Query: 1547 FLGCLNYVADFCPQLSTIIKLLHDRLKKDPPPWSD--VHTNVVKQIKLRIKNLPCLYLPN 1604
FLG Y F P + I K + LKK + + +++K I P L LP+
Sbjct: 444 FLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPD 503
Query: 1605 PQAFKIVETDASDIGFGGILKQKIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSIS 1664
+ ++ TDAS++ G +L Q N I+F S+ N + NYS ++KE+LAIV +
Sbjct: 504 FEKKFVLTTDASNLALGAVLSQ----NGHPISFISRTLNDHELNYSAIEKELLAIVWATK 559
Query: 1665 KFQYDLINQTFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNS 1724
F++ L+ + FL+ D + + + ++K +K RW+ LS + F+I+YIKG NS
Sbjct: 560 TFRHYLLGRQFLIASDHQPLRWL--HNLKEPGAK--LERWRVRLSEYQFKIDYIKGKENS 615
Query: 1725 LPDFLTR 1731
+ D L+R
Sbjct: 616 VADALSR 622
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 185 bits (469), Expect = 1e-45
Identities = 134/434 (30%), Positives = 217/434 (49%), Gaps = 24/434 (5%)
Query: 1309 IPTKARPIQMNEELLQFCQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIERGTPRLVI 1368
I K RPI + L +K I +L +K+IR SKSPWS V K+ G+ R+ I
Sbjct: 943 IRQKPRPIPL--ALKPEIRKMIQKMLNQKVIRESKSPWSSPVVLVKKKD----GSIRMCI 996
Query: 1369 NYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPF 1428
+Y+ +N+ + +P+PN + L L K+++ FDM +GFWQI L EK + TAF +
Sbjct: 997 DYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGS 1056
Query: 1429 GQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSNF-TIVYIDDVLIFSQSIDQHFKHLNTFI 1487
+EWNV+PFGL +P+ FQ M EI VY+DD+LI S+ ++QH + + +
Sbjct: 1057 ELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEAL 1116
Query: 1488 SVIKKNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFP--DQIIDKTQLQ 1545
+ I+K+G+ + +K + + ++ +LGH + T+ + TDK + + +LQ
Sbjct: 1117 TRIRKSGMKLRASKCHIAKKEVEYLGHKV---TLDGVETQEVKTDKMKQFSRPTNVKELQ 1173
Query: 1546 RFLGCLNYVADFCPQLSTIIKLLHDRLK-KDPPPWSDVHTNVVKQIKLRIKNLPCLYLPN 1604
FLG + Y F + I L + K W +++K + P L P+
Sbjct: 1174 SFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPD 1233
Query: 1605 PQAFK------IVETDASDIGFGGILKQKIFDNEQ-IIAFTSKHWNPAQQNYSTVKKEVL 1657
+A ++ TDAS G G +L Q+ D +Q IAF SK +PA+ Y E L
Sbjct: 1234 VEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEAL 1293
Query: 1658 AIVLSISKFQYDLINQTFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEY 1717
A++ ++ +F+ + V D K +L+ LA + RW + FD +I Y
Sbjct: 1294 AMMFALRRFKTIIYGTAITVFTDHKPLISLLKG--SPLADR--LWRWSIEILEFDVKIVY 1349
Query: 1718 IKGSTNSLPDFLTR 1731
+ G N++ D L+R
Sbjct: 1350 LAGKANAVADALSR 1363
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 179 bits (454), Expect = 6e-44
Identities = 140/479 (29%), Positives = 236/479 (49%), Gaps = 30/479 (6%)
Query: 1274 ENIQSSICSDLPNAFWERKSHM-VELPYEKDF-SDKQIPTKAR----PIQMNEELLQFCQ 1327
+ I +S+ + P F S M VE + + ++ Q P A+ P+ M E+ +
Sbjct: 85 QEILNSLLGEFPRIFEPPLSGMSVETAVKAEIRTNTQDPIYAKSYPYPVNMRGEV----E 140
Query: 1328 KEINDLLQKKLIRRSKSPWSCATFYVNKQAEIERGTP-RLVINYKPLNQALCWIRYPIPN 1386
++I++LLQ +IR S SP++ + V K+ + R+V+++K LN YPIP+
Sbjct: 141 RQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYPIPD 200
Query: 1387 KKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSE 1446
LA L +AK F+ D+ SGF QI ++E D KTAF+ G+YE+ +PFGLKNAP+
Sbjct: 201 INATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAPAI 260
Query: 1447 FQRIMNEIFNPY-SNFTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLF 1505
FQR++++I + VYIDD+++FS+ D H+K+L ++ + K L V+ K
Sbjct: 261 FQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKSHFL 320
Query: 1506 QTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTII 1565
T++ FLG+ + I + + + P K +L+RFLG +Y F + +
Sbjct: 321 DTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVK-ELKRFLGMTSYYRKFIQDYAKVA 379
Query: 1566 KLL------------HDRLKKDPPPWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVET 1613
K L + K P + +K + + L P + T
Sbjct: 380 KPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLTT 439
Query: 1614 DASDIGFGGILKQKIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQ 1673
DAS+ G +L Q ++ IA+ S+ N ++NY+T++KE+LAI+ S+ + L
Sbjct: 440 DASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYGA 499
Query: 1674 -TFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTR 1731
T V D + L +N +K RW+A + ++ E+ Y G +N + D L+R
Sbjct: 500 GTIKVYTDHQPLTFALGN--RNFNAK--LKRWKARIEEYNCELIYKPGKSNVVADALSR 554
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 176 bits (445), Expect = 7e-43
Identities = 139/531 (26%), Positives = 260/531 (48%), Gaps = 45/531 (8%)
Query: 1234 LNIISYKEKQINFLKEEISHKSIEVQLQQ---PSVK-TRIGNILENIQSSICSDLPNAFW 1289
L++++ ++N ++ + ++ I +L PSV T + +I+ + S+ + +
Sbjct: 94 LDLLTQAGVKLNLAEDSLEYQGIAEKLHYFSCPSVNFTDVNDIV--VPDSVKKEFKDTII 151
Query: 1290 ERKSHMVE----LPYE-------KDFSDKQIPTKARPIQMNEELLQFCQKEINDLLQKKL 1338
RK LP+ + ++ + ++A P M + F E+ LL+ +
Sbjct: 152 RRKKAFSTTNEALPFNTAVTATIRTVDNEPVYSRAYPTLMG--VSDFVNNEVKQLLKDGI 209
Query: 1339 IRRSKSPWSCATFYVNKQAEIERGTP--RLVINYKPLNQALCWIRYPIPNKKDLLARLHD 1396
IR S+SP++ T+ V+K+ G P RLVI+++ LN+ RYP+P+ +LA L
Sbjct: 210 IRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYPMPSIPMILANLGK 269
Query: 1397 AKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIF- 1455
AK F+ D+KSG+ QI L E DR KT+F+V G+YE+ +PFGL+NA S FQR ++++
Sbjct: 270 AKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNASSIFQRALDDVLR 329
Query: 1456 NPYSNFTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHN 1515
VY+DDV+IFS++ H +H++T + + + VS+ K F+ + +LG
Sbjct: 330 EQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKTRFFKESVEYLGFI 389
Query: 1516 IHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDRL--- 1572
+ + ++ ++P+ +++ FLG +Y F + I + + D L
Sbjct: 390 VSKDGTKSDPEKVKAIQEYPEPDC-VYKVRSFLGLASYYRVFIKDFAAIARPITDILKGE 448
Query: 1573 ---------KKDPPPWSDVHTNVVKQIK--LRIKNLPCLYLPNPQAFKIVETDASDIGFG 1621
KK P +++ N ++++ L +++ Y + F + TDAS G G
Sbjct: 449 NGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFDLT-TDASASGIG 507
Query: 1622 GILKQKIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQYDLINQTFLVRVDC 1681
+L Q+ + I S+ +QNY+T ++E+LAIV ++ K Q L ++
Sbjct: 508 AVLSQE----GRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFLYGSR---EINI 560
Query: 1682 KSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTRE 1732
+ L V + + RW++ + + ++ Y G N + D L+R+
Sbjct: 561 FTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKPGKENFVADALSRQ 611
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 164 bits (414), Expect = 3e-39
Identities = 132/497 (26%), Positives = 235/497 (46%), Gaps = 22/497 (4%)
Query: 1245 NFLKEEISHKSIEVQLQQPSVKTRIGNILENIQSSICSDLPNAF-WERKSHMVELPYEKD 1303
N ++ H++ V Q +K + ++ +ICS+ + F E + V Y++
Sbjct: 250 NVVQANSEHRNKTVLSQ---LKKNFPELFKSQLENICSEYIDIFALESEPITVNNLYKQQ 306
Query: 1304 F---SDKQIPTK--ARPIQMNEELLQFCQKEINDLLQKKLIRRSKSPWSCATFYVNKQAE 1358
D+ + TK P EE+ Q ++ L++ K++ S S ++ V K++
Sbjct: 307 LRLKDDEPVYTKNYRSPHSQVEEI----QAQVQKLIKDKIVEPSVSQYNSPLLLVPKKSS 362
Query: 1359 --IERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQE 1416
++ RLVI+Y+ +N+ L ++P+P D+L +L AK FS D+ SGF QI+L E
Sbjct: 363 PNSDKKKWRLVIDYRQINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDE 422
Query: 1417 KDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYS-NFTIVYIDDVLIFSQS 1475
R T+F+ G Y + +PFGLK AP+ FQR+M F+ + +Y+DD+++ S
Sbjct: 423 GSRDITSFSTSNGSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCS 482
Query: 1476 IDQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFP 1535
K+L ++ L + K S F ++ FLGH I+P ++ + +P
Sbjct: 483 EKHMLKNLTEVFGKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYP 542
Query: 1536 DQIIDKTQLQRFLGCLNYVADFCPQLSTIIKLLHDRLKKDPP-PWSDVHTNVVKQIKLRI 1594
D +RF+ NY F + + + KK+ P W+D +K ++
Sbjct: 543 VP-HDADSARRFVAFCNYYRRFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQL 601
Query: 1595 KNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIFDNEQIIAFTSKHWNPAQQNYSTVKK 1654
N L P+ + TDAS G +L Q ++ +A+ S+ + + N ST ++
Sbjct: 602 INPTLLQYPDFSKEFCITTDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQ 661
Query: 1655 EVLAIVLSISKFQYDLINQTFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFE 1714
E+ AI +I F+ + + F V+ D + + + N +SK R + L ++F
Sbjct: 662 ELAAIHWAIIHFRPYIYGKHFTVKTDHRPLTYLF--SMVNPSSK--LTRIRLELEEYNFT 717
Query: 1715 IEYIKGSTNSLPDFLTR 1731
+EY+KG N + D L+R
Sbjct: 718 VEYLKGKDNHVADALSR 734
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 102 bits (253), Expect = 1e-20
Identities = 87/377 (23%), Positives = 167/377 (44%), Gaps = 25/377 (6%)
Query: 1310 PTKARPIQMNEELLQFCQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIERGTPRLVIN 1369
P K PI N + Q I+DLL++ ++ + S + + V K G R+V++
Sbjct: 176 PQKQYPI--NPKAKPSIQIVIDDLLKQGVLIQQNSTMNTPVYPVPKPD----GKWRMVLD 229
Query: 1370 YKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFG 1429
Y+ +N+ + I + +L+ ++ K + D+ +GFW + + + TAFT
Sbjct: 230 YREVNKTIPLIAAQNQHSAGILSSIYRGKYKTTLDLTNGFWAHPITPESYWLTAFTWQGK 289
Query: 1430 QYEWNVMPFGLKNAPSEFQRIMNEIFNPYSNFTIVYIDDVLIFSQSIDQHFKHLNTFISV 1489
QY W +P G N+P+ F + ++ N Y+DD+ I +H + L S+
Sbjct: 290 QYCWTRLPQGFLNSPALFTADVVDLLKEIPNVQ-AYVDDIYISHDDPQEHLEQLEKIFSI 348
Query: 1490 IKKNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQII------DKTQ 1543
+ G VS K + Q ++ FLG NI TD F +++ D Q
Sbjct: 349 LLNAGYVVSLKKSEIAQREVEFLGFNI-------TKEGRGLTDTFKQKLLNITPPKDLKQ 401
Query: 1544 LQRFLGCLNYVADFCPQLSTIIKLLHDRLKKDPP---PWSDVHTNVVKQIKLRIKNLPCL 1600
LQ LG LN+ +F P S ++K L+ + W++ ++N ++ I + L
Sbjct: 402 LQSILGLLNFARNFIPNYSELVKPLYTIVANANGKFISWTEDNSNQLQHIISVLNQADNL 461
Query: 1601 YLPNPQAFKIVETDASDIGFGGILKQKIFDNEQIIAFTSKHWNPAQQNYSTVKKEVLAIV 1660
NP+ I++ ++S G ++ +++ I + + ++ A+ ++ +K + +
Sbjct: 462 EERNPETRLIIKVNSSP--SAGYIRYYNEGSKRPIMYVNYIFSKAEAKFTQTEKLLTTMH 519
Query: 1661 LSISKFQYDLINQTFLV 1677
+ K + Q LV
Sbjct: 520 KGLIKAMDLAMGQEILV 536
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.339 0.149 0.490
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 190,646,649
Number of Sequences: 164201
Number of extensions: 8027356
Number of successful extensions: 41685
Number of sequences better than 10.0: 304
Number of HSP's better than 10.0 without gapping: 90
Number of HSP's successfully gapped in prelim test: 216
Number of HSP's that attempted gapping in prelim test: 40848
Number of HSP's gapped (non-prelim): 789
length of query: 1733
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1609
effective length of database: 39,613,130
effective search space: 63737526170
effective search space used: 63737526170
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.8 bits)
S2: 73 (32.7 bits)
Lotus: description of TM0101.14