
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0173b.7
(1526 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 336 3e-91
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 335 4e-91
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 334 1e-90
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 331 8e-90
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 328 7e-89
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 325 4e-88
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 320 1e-86
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 246 3e-64
POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic pro... 246 5e-64
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 233 2e-60
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 231 1e-59
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 227 2e-58
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 224 2e-57
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 219 5e-56
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 213 3e-54
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 185 9e-46
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 184 2e-45
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 180 3e-44
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 162 5e-39
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 103 4e-21
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 336 bits (861), Expect = 3e-91
Identities = 224/645 (34%), Positives = 338/645 (51%), Gaps = 41/645 (6%)
Query: 914 LETPALFDTGADSSCISEGLIPTRYFEKTTEKLSG--AEGSKLIIKY--KIPSAIIKNDS 969
+E DTGA S+ +IP ++ + A+GS + I K II +
Sbjct: 38 IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIAREI 97
Query: 970 LEIETSFLLVRNLTHKVIIETPFIKKLFPY-NTDEKGITVQHHGQPI-VFKFSKPPFVKT 1027
+I T + + II F + P+ ++ I ++ P+ + K ++ V T
Sbjct: 98 FKIPTVYQQESGIDF--IIGNNFCQLYEPFIQFTDRVIFTKNKSYPVHIAKLTRAVRVGT 155
Query: 1028 LNIISYKEKQINFLKEE---ISYKNIEVQLQQPSVKS------------------RIENI 1066
+ +K+ + E IS IE L++ ++ S +IE +
Sbjct: 156 EGFLESMKKRSKTQQPEPVNISTNKIENPLKEIAILSEGRRLSEEKLFITQQRMQKIEEL 215
Query: 1067 LENIQSSICFDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFFQREIND 1126
LE + S D PN + ++L SD K +P++ + + F ++I +
Sbjct: 216 LEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREEFDKQIKE 268
Query: 1127 LLQKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLA 1186
LL K+I+ SKSP AF VN +AE RG R+V+NYK +N+A Y +PNK +LL
Sbjct: 269 LLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLT 328
Query: 1187 RLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMN 1246
+ KIFS FD KSGFWQ+ L ++ R TAFT P G YEWNV+PFGLK APS FQR M+
Sbjct: 329 LIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMD 388
Query: 1247 EIFNP*SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFL 1306
E F KF VY+DD+L+FS + + H H+ + ++G+ +SK K LF+ KI FL
Sbjct: 389 EAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFL 448
Query: 1307 GHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRL 1366
G I +GT P +E +KFPD + DK QLQRFLG L Y +D+ P+L+ I KPL +L
Sbjct: 449 GLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKL 508
Query: 1367 KKDPP-PWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIIDK 1425
K++ P W+ T ++++K ++ P L+ P P+ I+ETDASD +GG+LK I++
Sbjct: 509 KENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINE 568
Query: 1426 ----EQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCKSAKDI 1481
E I + S + A++NY + KE LA++ +I F L FL+R D K
Sbjct: 569 GTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSF 628
Query: 1482 LQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTRE 1526
+ + K + RWQA LS + F++E+IKG+ N DFL+RE
Sbjct: 629 VNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSRE 673
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 335 bits (860), Expect = 4e-91
Identities = 224/645 (34%), Positives = 337/645 (51%), Gaps = 41/645 (6%)
Query: 914 LETPALFDTGADSSCISEGLIPTRYFEKTTEKLSG--AEGSKLIIKY--KIPSAIIKNDS 969
+E DTGA S+ +IP ++ + A+GS + I K II +
Sbjct: 38 IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIAGEI 97
Query: 970 LEIETSFLLVRNLTHKVIIETPFIKKLFPY-NTDEKGITVQHHGQPI-VFKFSKPPFVKT 1027
I T + + II F + P+ ++ I ++ P+ + K ++ V T
Sbjct: 98 FRIPTVYQQESGIDF--IIGNNFCQLYEPFIQFTDRVIFTKNKSYPVHIAKLTRAVRVGT 155
Query: 1028 LNIISYKEKQINFLKEE---ISYKNIEVQLQQPSVKS------------------RIENI 1066
+ +K+ + E IS IE L++ ++ S +IE +
Sbjct: 156 EGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQKIEEL 215
Query: 1067 LENIQSSICFDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFFQREIND 1126
LE + S D PN + ++L SD K +P++ + + F ++I +
Sbjct: 216 LEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREEFDKQIKE 268
Query: 1127 LLQKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLA 1186
LL K+I+ SKSP AF VN +AE RG R+V+NYK +N+A Y +PNK +LL
Sbjct: 269 LLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELLT 328
Query: 1187 RLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMN 1246
+ KIFS FD KSGFWQ+ L ++ R TAFT P G YEWNV+PFGLK APS FQR M+
Sbjct: 329 LIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMD 388
Query: 1247 EIFNP*SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFL 1306
E F KF VY+DD+L+FS + + H H+ + ++G+ +SK K LF+ KI FL
Sbjct: 389 EAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFL 448
Query: 1307 GHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRL 1366
G I +GT P +E +KFPD + DK QLQRFLG L Y +D+ P+L+ I KPL +L
Sbjct: 449 GLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKL 508
Query: 1367 KKDPP-PWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIIDK 1425
K++ P W+ T ++++K ++ P L+ P P+ I+ETDASD +GG+LK I++
Sbjct: 509 KENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINE 568
Query: 1426 ----EQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCKSAKDI 1481
E I + S + A++NY + KE LA++ +I F L FL+R D K
Sbjct: 569 GTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSF 628
Query: 1482 LQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTRE 1526
+ + K + RWQA LS + F++E+IKG+ N DFL+RE
Sbjct: 629 VNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSRE 673
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 334 bits (856), Expect = 1e-90
Identities = 223/645 (34%), Positives = 337/645 (51%), Gaps = 41/645 (6%)
Query: 914 LETPALFDTGADSSCISEGLIPTRYFEKTTEKLSG--AEGSKLIIKY--KIPSAIIKNDS 969
+E DTGA S+ +IP ++ + A+GS + I K II +
Sbjct: 38 IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIAGEI 97
Query: 970 LEIETSFLLVRNLTHKVIIETPFIKKLFPY-NTDEKGITVQHHGQPI-VFKFSKPPFVKT 1027
+I T + + II F + P+ ++ I ++ P+ + K ++ V
Sbjct: 98 FKIPTVYQQESGIDF--IIGNNFCQLYEPFIQFTDRVIFTKNKSYPVHITKLTRAVRVGI 155
Query: 1028 LNIISYKEKQINFLKEE---ISYKNIEVQLQQPSVKS------------------RIENI 1066
+ +K+ + E IS IE L++ ++ S +IE +
Sbjct: 156 EGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQKIEEL 215
Query: 1067 LENIQSSICFDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFFQREIND 1126
LE + S D PN + ++L SD K +P++ + + F ++I +
Sbjct: 216 LEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREEFDKQIKE 268
Query: 1127 LLQKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLA 1186
LL K+I+ SKSP AF VN +AE RG R+V+NYK +N+A Y +PNK +LL
Sbjct: 269 LLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLT 328
Query: 1187 RLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMN 1246
+ KIFS FD KSGFWQ+ L ++ R TAFT P G YEWNV+PFGLK APS FQR M+
Sbjct: 329 LIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMD 388
Query: 1247 EIFNP*SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFL 1306
E F KF VY+DD+L+FS + + H H+ + ++G+ +SK K LF+ KI FL
Sbjct: 389 EAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFL 448
Query: 1307 GHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRL 1366
G I +GT P +E +KFPD + DK QLQRFLG L Y +D+ P+L+ I KPL +L
Sbjct: 449 GLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKL 508
Query: 1367 KKDPP-PWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIIDK 1425
K++ P W+ T ++++K ++ P L+ P P+ I+ETDASD +GG+LK I++
Sbjct: 509 KENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINE 568
Query: 1426 ----EQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCKSAKDI 1481
E I + S + A++NY + KE LA++ +I F L FL+R D K
Sbjct: 569 GTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSF 628
Query: 1482 LQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTRE 1526
+ + K + RWQA LS + F++E+IKG+ N DFL+RE
Sbjct: 629 VNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSRE 673
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 331 bits (849), Expect = 8e-90
Identities = 190/470 (40%), Positives = 273/470 (57%), Gaps = 12/470 (2%)
Query: 1062 RIENILENIQSSICFDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFFQ 1121
+IE +LE + S D PN + ++L SD K +P++ + + F
Sbjct: 206 KIEELLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREEFD 258
Query: 1122 REINDLLQKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNK 1181
++I +LL K+I+ SKSP AF VN +AE RG R+V+NYK +N+A Y PNK
Sbjct: 259 KQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNK 318
Query: 1182 KDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEF 1241
+LL + KIFS FD KSGFWQ+ L ++ R TAFT P G YEWNV+PFGLK APS F
Sbjct: 319 DELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIF 378
Query: 1242 QRIMNEIFNP*SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQT 1301
QR M+E F KF VY+DD+L+FS + + H H+ + ++G+ +SK K LF+
Sbjct: 379 QRHMDEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKK 438
Query: 1302 KIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKP 1361
KI FLG I +GT P +E +KFPD + DK QLQRFLG L Y +D+ P+L+ I KP
Sbjct: 439 KINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKP 498
Query: 1362 LHDRLKKDPP-PWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQ 1420
L +LK++ P W+ T ++++K ++ P L+ P P+ I+ETDASD +GG+LK
Sbjct: 499 LQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKA 558
Query: 1421 KIIDK----EQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCK 1476
I++ E I + S + A++NY + KE LA++ +I F L FL+R D
Sbjct: 559 IKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNT 618
Query: 1477 SAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTRE 1526
K + + K + RWQA LS + F++E+IKG+ N DFL+RE
Sbjct: 619 HFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSRE 668
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 328 bits (841), Expect = 7e-89
Identities = 219/656 (33%), Positives = 338/656 (51%), Gaps = 63/656 (9%)
Query: 914 LETPALFDTGADSSCISEGLIPTRYFEKTTEKLSG--AEGSKLIIK-------------- 957
+E DTGA S+ +IP ++ + A+GS + I
Sbjct: 39 IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIVGVI 98
Query: 958 YKIPSAIIKNDSLEIETSFLLVRNLTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPI-V 1016
+KIP+ + ++ F++ N + PFI+ ++ I ++ P+ +
Sbjct: 99 FKIPTVYQQESGID----FIIGNNFCQ---LYEPFIQ------FTDRVIFTKNKSYPVHI 145
Query: 1017 FKFSKPPFVKTLNIISYKEKQINFLKEE---ISYKNIEVQLQQPSVKS------------ 1061
K ++ V T + +K+ + E IS IE L++ ++ S
Sbjct: 146 AKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFI 205
Query: 1062 ------RIENILENIQSSICFDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEE 1115
+ E +LE + S D PN + ++L SD K +P++ +
Sbjct: 206 TQQRMQKTEELLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPM 258
Query: 1116 LLQFFQREINDLLQKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIR 1175
+ F ++I +LL K+I+ SKSP AF VN +AE RG R+V+NYK +N+A
Sbjct: 259 DREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVGDA 318
Query: 1176 YPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLK 1235
Y +PNK +LL + KIFS FD KSGFWQ+ L ++ R TAFT P G YEWNV+PFGLK
Sbjct: 319 YNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLK 378
Query: 1236 NAPSEFQRIMNEIFNP*SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTK 1295
APS FQR M+E F KF VY+DD+++FS + + H H+ + ++G+ +SK K
Sbjct: 379 QAPSIFQRHMDEAFRVFRKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKK 438
Query: 1296 VSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQL 1355
LF+ KI FLG I +GT P +E +KFPD + DK QLQRFLG L Y +D+ P L
Sbjct: 439 AQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNL 498
Query: 1356 STIIKPLHDRLKKDPP-PWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGF 1414
+ + +PL +LK++ P W+ T ++++K ++ P L+ P P+ I+ETDASD +
Sbjct: 499 AQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYW 558
Query: 1415 GGILKQKIIDK----EQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFL 1470
GG+LK I++ E I + S + A++NY + KE LA++ +I F L FL
Sbjct: 559 GGMLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFL 618
Query: 1471 VRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTRE 1526
+R D K + + K + RWQA LS + F++E+IKG+ N DFL+RE
Sbjct: 619 IRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSRE 674
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 325 bits (834), Expect = 4e-88
Identities = 215/627 (34%), Positives = 328/627 (52%), Gaps = 38/627 (6%)
Query: 921 DTGADSSCISEGLIPTRYFEKTTE----KLSGAEGSKLIIKYKIPSAIIKNDSLEIETSF 976
DTGA S +IP +E + + K++ E K+ K S EI T +
Sbjct: 54 DTGASLCIASRYIIPEELWENSPKDIQVKIANQELIKITKVCKNLKVKFAGKSFEIPTVY 113
Query: 977 LLVRNLTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPIV-------FKFSKPPFVKTLN 1029
+ +I F + P+ E I + ++ F S P F++ +
Sbjct: 114 QQETGIDF--LIGNNFCRLYNPFIQWEDRIAFHLKNEMVLIKKVTKAFSVSNPSFLENMK 171
Query: 1030 IISYKEKQI-------NFLKEEISYKNIEVQLQQPSVKSRIENILENIQSSICFD-LPNA 1081
S K +QI N + E Y I + Q +IE +L+ + S D + +
Sbjct: 172 KDS-KTEQIPGTNISKNIINPEERYFLITEKYQ------KIEQLLDKVCSENPIDPIKSK 224
Query: 1082 FWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFFQREINDLLQKKLIRRSKSPWS 1141
W + S + P + + +P+ + + + F ++I +LL LI SKS
Sbjct: 225 QWMKASIKLIDPLKV--------IRVKPMSYSPQDREGFAKQIKELLDLGLIIPSKSQHM 276
Query: 1142 CAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKS 1201
AF V +AE RG R+V+NYK +NQA + +PN ++LL L IFS FD KS
Sbjct: 277 SPAFLVENEAERRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKS 336
Query: 1202 GFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNP*SKFAIVYID 1261
GFWQ+ L E+ + TAFT P G ++W V+PFGLK APS FQR M N KF +VY+D
Sbjct: 337 GFWQVVLDEESQKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNGADKFCMVYVD 396
Query: 1262 DVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRA 1321
D+++FS S H+ H+ + I++K G+ +SK K +LF+ KI FLG I +GT P N
Sbjct: 397 DIIVFSNSELDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNHI 456
Query: 1322 IEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-PWSDIHTNV 1380
+E KFPD++ DK LQRFLG L Y + P+L+ I KPL +LKKD W+ ++
Sbjct: 457 LENIHKFPDRLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDY 516
Query: 1381 VKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIIDKEQIIA-FTSKHWNPA 1439
VK+IK + + P LYLP P+ I+ETDASD +GG+LK + +D ++I ++S + A
Sbjct: 517 VKKIKKNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQA 576
Query: 1440 QQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCKSAKDILQKDVKNLASKHIFARWQ 1499
++NY + KE+LA+ I+ F + L +F VR D K+ L+ ++K + + RWQ
Sbjct: 577 EKNYHSNDKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQ 636
Query: 1500 AILSVFDFEIEYIKGSTNSLPDFLTRE 1526
S + F++E+++G N L D LTR+
Sbjct: 637 NWFSKYQFDVEHLEGVKNVLADCLTRD 663
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 320 bits (821), Expect = 1e-86
Identities = 190/496 (38%), Positives = 281/496 (56%), Gaps = 10/496 (2%)
Query: 1031 ISYKEKQINFLKEEISYKNIEVQLQQPSVKSRIENILENIQSSICFDLPNAFWERKSHMV 1090
I+ Q FL+E ++ + + Q S S IE +LE + S D P + + +
Sbjct: 162 INITSNQHLFLEEGGNHVDEMLYEIQISKFSAIEEMLERVSSENPID-PEKSKQWMTATI 220
Query: 1091 ELPYEKDFSDKQIPTKARPIQMNEELLQFFQREINDLLQKKLIRRSKSPWSCAAFYVNKQ 1150
EL D + K +P+ + + F R+I +LL+ K+I+ SKS AF V +
Sbjct: 221 EL------IDPKTVVKVKPMSYSPSDREEFDRQIKELLELKVIKPSKSTHMSPAFLVENE 274
Query: 1151 AEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQE 1210
AE RG R+V+NYK +N+A + +PNK +LL + KI+S FD KSG WQ+ L +
Sbjct: 275 AERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDK 334
Query: 1211 KDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQR-IMNEIFNP*SKFAIVYIDDVLIFSQS 1269
+ + TAFT P G Y+WNV+PFGLK APS F + N N SK+ VY+DD+L+FS +
Sbjct: 335 ESQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNT 394
Query: 1270 -IDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKF 1328
+H+ H+ + +K G+ +SK K LF+ KI FLG I QGT P N +E KF
Sbjct: 395 GRKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEIDQGTHCPQNHILEHIHKF 454
Query: 1329 PDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-PWSDIHTNVVKQIKLR 1387
PD+I DK QLQRFLG L Y +D+ P+L++I KPL +LK+D W+D + + +IK
Sbjct: 455 PDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKN 514
Query: 1388 IKNLPCLYLPNPQAFKIVETDASDIGFGGILKQKIIDKEQIIAFTSKHWNPAQQNYSTVK 1447
+K+ P LY P P ++ETDAS+ +GGILK E I + S + A++NY + +
Sbjct: 515 LKSFPKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHSNE 574
Query: 1448 KEVLAIVLSISNFQSDLINQKFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDF 1507
KE+LA++ I F L +FL+R D K+ + ++K + RWQ LS +DF
Sbjct: 575 KELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQYDF 634
Query: 1508 EIEYIKGSTNSLPDFL 1523
++E+I G+ N DFL
Sbjct: 635 DVEHIAGTKNVFADFL 650
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 246 bits (629), Expect = 3e-64
Identities = 220/783 (28%), Positives = 372/783 (47%), Gaps = 129/783 (16%)
Query: 813 DTFKRLEKSTIKPVTIQDL-QSEVHILKAEVKSLKQ------IQTSQQLILEK------- 858
+ K EK++ TIQ+ ++E++++K E++ K+ Q + +I+++
Sbjct: 1114 EALKHSEKASRVFSTIQESDEAELNLIKEELRQFKEETRMAIAQLKEAIIVQEEDTIEER 1173
Query: 859 --LTEENSNGGSSSSSSSTSTSNRAPNNNVGDFLEIINYVTIQKFYINITIIIGDFILET 916
+ E + + S+++ + N N VG I ++ +YIN
Sbjct: 1174 CAMILEEKHTENIYSATAKAEYNGLYNVKVG-----IKPDNMEPYYIN------------ 1216
Query: 917 PALFDTGADSSCISEGLIPTRYFE--KTTEKLSGAEG---SKLIIK----------YKIP 961
A+ DTGA + I IP Y+E K T G S +IK +++P
Sbjct: 1217 -AIVDTGATACLIQISAIPENYYEDAKVTVNFRSVLGIGTSTQMIKAGRILIGEQYFRMP 1275
Query: 962 SAIIKNDSLEIETSFLL----VRNLTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPIVF 1017
+ N L ++ +R+L + IE I + E T Q
Sbjct: 1276 VTYVMNMGLSPGIQMIIGCSFIRSLEGGLRIEKDIITFYKLVTSIETSRTTQVANSIEEL 1335
Query: 1018 KFSKPPFVKTLNIISYKEKQINFLKEEISYKNIEVQLQQPSVKSRIENILENIQSSICFD 1077
+ S+ + LNI + E +FL +E + KN ++ + +K EN +E
Sbjct: 1336 ELSEDEY---LNIAASVETP-SFLDQEFARKNKDLLKEMKEMKYIGENPME--------- 1382
Query: 1078 LPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQM----NEELLQFFQREINDLLQKKLI 1133
FW+ +L + + I RPI+ +EE + R+IN LLQ K+I
Sbjct: 1383 ----FWKNNKIKCKL----NIINPDIKIMGRPIKHVTPGDEEAMT---RQINLLLQMKVI 1431
Query: 1134 RRSKSPWSCAAFYVNKQAEIE-------RGTPRLVINYKPLNQALCWIRYPIPNKKDLLA 1186
R S+S AF V EI+ +G R+V NYK LN+ +Y +P +++
Sbjct: 1432 RPSESKHRSTAFIVRSGTEIDPITGKEKKGKERMVFNYKLLNENTESDQYSLPGINTIIS 1491
Query: 1187 RLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMN 1246
++ +KI+SKFD+KSGFWQ+ ++E+ TAF YEW VMPFGLKNAP+ FQR M+
Sbjct: 1492 KVGRSKIYSKFDLKSGFWQVAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMD 1551
Query: 1247 EIFNP*SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFL 1306
+F KF VYIDD+L+FS++ +QH +HL T + + K+NG+ +S TK+ + +I FL
Sbjct: 1552 NVFKGTEKFIAVYIDDILVFSETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFL 1611
Query: 1307 GHNIHQGTI----IPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPL 1362
G ++ I I++ +F+D +++ ++ +LG L+Y ++ + +++PL
Sbjct: 1612 GASLGCTKIKLQPHIISKICDFSD---EKLATPEGMRSWLGILSYARNYIQDIGKLVQPL 1668
Query: 1363 HDRL------KKDPPPWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGG 1416
++ + +P W +V+QIK ++KNLP L LP +F I+ETD G+G
Sbjct: 1669 RQKMAPTGDKRMNPETW-----KMVRQIKEKVKNLPDLQLPPKDSFIIIETDGCMTGWGA 1723
Query: 1417 ILKQKII-----DKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQK-FL 1470
+ K K+ E+I A+ S +NP + ST+ E+ A + + F+ +++K +
Sbjct: 1724 VCKWKMSKHDPRSTERICAYASGSFNPIK---STIDAEIQAAIHGLDKFKIYYLDKKELI 1780
Query: 1471 VRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDF--------EIEYIKGSTNSLPDF 1522
+R DC++ K +N S+ RW L+ DF E+I G N L D
Sbjct: 1781 IRSDCEAIIKFYNKTNENKPSR---VRW---LTFSDFLTGLGITVTFEHIDGKHNGLADA 1834
Query: 1523 LTR 1525
L+R
Sbjct: 1835 LSR 1837
Score = 37.4 bits (85), Expect = 0.32
Identities = 47/249 (18%), Positives = 94/249 (36%), Gaps = 42/249 (16%)
Query: 518 LSNLKCKSLGDFRWYKDTFLTRVYTR----EDSQHAFWKEKFLAGLPKSFGDKVREKLRS 573
L L C + R Y +LT +++ E+ +P + G++V + +
Sbjct: 735 LKQLVCPNYQSIRRYLMDYLTLAAETGLMWSETEGPAISEELFTKMPAAIGERVAQAYKI 794
Query: 574 QNPGGEIPYQTLSYGQLIAIIQRVALKICQDDKIQQQLTKEKSQNRRDLGTFCEQFGIQG 633
+P + + Y + + ++ C++ + L FC F I+G
Sbjct: 795 MDPTSAVNLPSRVYFTINYLTEQ-----CKEASYMRSLKALD---------FCRDFPIEG 840
Query: 634 CPKKPKPRKQDPPPKQQWRRKSSQNHDHRKPKPRSKPHSTQAAKTPPENRPQGKDVTCYN 693
+ +K+ RK++ K K H T + + + K CY
Sbjct: 841 YYGRSGEKKK------YTARKAT--------KYTGKAHDNHIRVTKAKYQRKCK---CYI 883
Query: 694 CGKPGHISRYCRLKRRISE-LHLEPEIEDKINNLLIQTSDEEE------SVPSDSEVSED 746
CG+ GH + CR K + + + + ++ K N ++ D+EE SV + + E+
Sbjct: 884 CGQEGHYANQCRNKHKDQQRVAILQSLDLKENEEVVSADDKEEEDDEIFSVLGEEDYQEE 943
Query: 747 LNQIQNDDD 755
+ +DD
Sbjct: 944 TIMVLEEDD 952
>POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 692
Score = 246 bits (627), Expect = 5e-64
Identities = 203/685 (29%), Positives = 311/685 (44%), Gaps = 108/685 (15%)
Query: 918 ALFDTGADSSCISEGLIPTRY-FEKTTEKLSGAEGSKLIIKYKIPSAIIKNDSLEIETSF 976
A DTGA + C + I + K +++ A+ SK I+ I + +K ++ E
Sbjct: 33 AYIDTGA-TLCFGKRKISNNWEILKQPKEIIIADKSKHYIREAISNVFLKIENKEFLIPI 91
Query: 977 LLVRNLTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPIVFKFSKPPFVKTLNIISYKEK 1036
+ + + +II F+K PF++ L I + K
Sbjct: 92 IYLHDSGLDLIIGNNFLKLY-------------------------QPFIQRLETIELRWK 126
Query: 1037 QINFLKE---------------EISYKNIEVQLQQPSVKSRIENILENIQSSICFDLPNA 1081
+N KE ++S++ I + L++ IE LE + S D
Sbjct: 127 NLNNPKESQMISTKILTKNEVLKLSFEKIHICLEKYLFFKTIEEQLEEVCSEHPLDETK- 185
Query: 1082 FWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFFQREINDLLQKKLIRRSKSPWS 1141
+ ++E+ + + + T P + + +Q F+ E DLL+K LIR S+SP S
Sbjct: 186 --NKNGLLIEIRLKDPLQEINV-TNRIPYTIRD--VQEFKEECEDLLKKGLIRESQSPHS 240
Query: 1142 CAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKS 1201
AFYV EI+RG R+VINYK +N+A Y +P K +L ++ + FS D KS
Sbjct: 241 APAFYVENHNEIKRGKRRMVINYKKMNEATIGDSYKLPRKDFILEKIKGSLWFSSLDAKS 300
Query: 1202 GFWQIQLQEKDRYKTAFTV-PFGQYEWNVMPFGLKNAPSEFQRIMNEIFNP*SKFAIVYI 1260
G++Q++L E + TAF+ P YEWNV+ FGLK APS +QR M++ + YI
Sbjct: 301 GYYQLRLHENTKPLTAFSCPPQKHYEWNVLSFGLKQAPSIYQRFMDQSLKGLEHICLAYI 360
Query: 1261 DDVLIFSQ-SIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFLGHNIH-QGTIIPI 1318
DD+LIF++ S +QH + + IK+ G+ +SK K L Q +I +LG I G I
Sbjct: 361 DDILIFTKGSKEQHVNDVRIVLQRIKEKGIIISKKKSKLIQQEIEYLGLKIQGNGEIDLS 420
Query: 1319 NRAIEFTDKFPDQIIDKTQLQRFLGCLNYVAD--FCPQLSTIIKPLHDRLK-KDPPPWSD 1375
E +FPD++ D+ Q+QRFLGC+NY+A+ F L+ K L ++ K+P W
Sbjct: 421 PHTQEKILQFPDELEDRKQIQRFLGCINYIANEGFFKNLALERKHLQKKISVKNPWKWDT 480
Query: 1376 IHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGIL------KQKI------- 1422
I T +V+ IK +I++LP LY + Q F IVETDAS + G L KQKI
Sbjct: 481 IDTKMVQSIKGKIQSLPKLYNASIQDFLIVETDASQHSWSGCLRALPKGKQKIGLDEFGI 540
Query: 1423 --------------------IDKEQ---------------------IIAFTSKHWNPAQQ 1441
IDK + + S + +
Sbjct: 541 PTADLCTGSSSASSDNSPAEIDKCHSASKQDTHVASKIKKLENELLLCKYVSGTFTDTET 600
Query: 1442 NYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCKSAKDILQKDVKNLASKHIFARWQAI 1501
Y + EVLA V + ++ DL+ +FL+R D K + ++K RWQ
Sbjct: 601 RYPIAELEVLAGVKVLEKWRIDLLQTRFLLRTDSKYFAGFCRYNIKTDYRNGRLIRWQLR 660
Query: 1502 LSVFDFEIEYIKGSTNSLPDFLTRE 1526
L + +E IK N D LTRE
Sbjct: 661 LQAYQPYVELIKSENNPFADTLTRE 685
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 233 bits (595), Expect = 2e-60
Identities = 201/740 (27%), Positives = 359/740 (48%), Gaps = 77/740 (10%)
Query: 807 NKFNLKDTFKRLEKSTIKPVTIQDLQSEVHILKAEVKSLKQIQTSQQLILEKLTE----- 861
+K N+ D RL + P +++ L+ E + K+E+ + ++ T
Sbjct: 148 SKANVDDFHTRLFILWMLPYSLRKLK-ERNYWKSEISEIYDFLEDKRTASYGKTHKRFQL 206
Query: 862 ENSNGGSSSSSSSTSTSN----RAPNNNVGDFL--EIINYVTIQKFYINITIIIGDFILE 915
+N N G S S +T+N R N + ++ + +N+ T +++ + + + DF
Sbjct: 207 QNKNLGKESLSKKNNTTNSRNLRKTNVSRIEYSSNKFLNH-TRKRYEMVLQAELPDFKCS 265
Query: 916 TPALFDTGADSSCISEGLI-----PTRYFEKTTEKLSGAEGSKLIIKYKIPSAIIKNDSL 970
P L DTGA ++ I+E + PTR + K+ G +K I K I + +
Sbjct: 266 IPCLIDTGAQANIITEETVRAHKLPTRPWSKSVI-YGGVYPNK--INRKTIKLNISLNGI 322
Query: 971 EIETSFLLVRNLTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPIVFKFSKPPFVKTLNI 1030
I+T FL+V+ +H I L+ N + I+ H + K S NI
Sbjct: 323 SIKTEFLVVKKFSHPAAIS---FTTLYDNNIE---ISSSKHTLSQMNKVS--------NI 368
Query: 1031 ISYKEKQINFLKEEISYKNIEVQLQQPSVKSRIENILENIQSSICFDLPNAFWERKSHMV 1090
+ KE ++ + +E +K+I + + I+ +
Sbjct: 369 V--KEPELPDIYKE--FKDITAETNTEKLPKPIKGL------------------------ 400
Query: 1091 ELPYEKDFSDKQIPTKARPIQMNEELLQFFQREINDLLQKKLIRRSKSPWSCAAFYVNKQ 1150
E E + ++P + P+ + +Q EIN L+ +IR SK+ +C +V K+
Sbjct: 401 EFEVELTQENYRLPIRNYPLPPGK--MQAMNDEINQGLKSGIIRESKAINACPVMFVPKK 458
Query: 1151 AEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQE 1210
GT R+V++YKPLN+ + YP+P + LLA++ + IF+K D+KS + I++++
Sbjct: 459 ----EGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514
Query: 1211 KDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNP*SKFAIV-YIDDVLIFSQS 1269
D +K AF P G +E+ VMP+G+ AP+ FQ +N I + +V Y+DD+LI S+S
Sbjct: 515 GDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKS 574
Query: 1270 IDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFP 1329
+H KH+ + +K + +++ K Q++++F+G++I + P I+ ++
Sbjct: 575 ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQW- 633
Query: 1330 DQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-PWSDIHTNVVKQIKLRI 1388
Q ++ +L++FLG +NY+ F P+ S + PL++ LKKD W+ T ++ IK +
Sbjct: 634 KQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCL 693
Query: 1389 KNLPCLYLPNPQAFKIVETDASDIGFGGILKQK-IIDKEQIIAFTSKHWNPAQQNYSTVK 1447
+ P L + ++ETDASD+ G +L QK DK + + S + AQ NYS
Sbjct: 694 VSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSD 753
Query: 1448 KEVLAIVLSISNFQSDLIN--QKFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVF 1505
KE+LAI+ S+ +++ L + + F + D ++ + + + ARWQ L F
Sbjct: 754 KEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESE--PENKRLARWQLFLQDF 811
Query: 1506 DFEIEYIKGSTNSLPDFLTR 1525
+FEI Y GS N + D L+R
Sbjct: 812 NFEINYRPGSANHIADALSR 831
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 231 bits (589), Expect = 1e-59
Identities = 199/740 (26%), Positives = 360/740 (47%), Gaps = 77/740 (10%)
Query: 807 NKFNLKDTFKRLEKSTIKPVTIQDLQSEVHILKAEVKSLKQIQTSQQLIL-----EKLTE 861
+K N+ D RL + P +++ L+ E + K+E+ + ++ ++
Sbjct: 148 SKANVDDFHTRLFILWMLPYSLRKLK-ERNYWKSEISEIYDFLEDKRTASYGKTHKRFQP 206
Query: 862 ENSNGGSSSSSSSTSTSN----RAPNNNVGDFL--EIINYVTIQKFYINITIIIGDFILE 915
+N N G S S +T+N R N + ++ + +N+ T +++ + + + DF
Sbjct: 207 QNKNLGKESLSKKNNTTNSRNLRKTNVSRIEYSSNKFLNH-TRKRYEMVLQAELPDFKCS 265
Query: 916 TPALFDTGADSSCISEGLI-----PTRYFEKTTEKLSGAEGSKLIIKYKIPSAIIKNDSL 970
P L DTGA ++ I+E + PTR + K+ G +K I K I + +
Sbjct: 266 IPCLIDTGAQANIITEETVRAHKLPTRPWSKSVI-YGGVYPNK--INRKTIKLNISLNGI 322
Query: 971 EIETSFLLVRNLTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPIVFKFSKPPFVKTLNI 1030
I+T FL+V+ +H I L+ N + I+ H + K S NI
Sbjct: 323 SIKTEFLVVKKFSHPAAIS---FTTLYDNNIE---ISSSKHTLSQMNKVS--------NI 368
Query: 1031 ISYKEKQINFLKEEISYKNIEVQLQQPSVKSRIENILENIQSSICFDLPNAFWERKSHMV 1090
+ KE ++ + +E +K+I + + I+ +
Sbjct: 369 V--KEPELPDIYKE--FKDITAETNTEKLPKPIKGL------------------------ 400
Query: 1091 ELPYEKDFSDKQIPTKARPIQMNEELLQFFQREINDLLQKKLIRRSKSPWSCAAFYVNKQ 1150
E E + ++P + P+ + +Q EIN L+ +IR SK+ +C +V K+
Sbjct: 401 EFEVELTQENYRLPIRNYPLPPGK--MQAMNDEINQGLKSGIIRESKAINACPVMFVPKK 458
Query: 1151 AEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQE 1210
GT R+V++YKPLN+ + YP+P + LLA++ + IF+K D+KS + I++++
Sbjct: 459 ----EGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514
Query: 1211 KDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNP*SKFAIV-YIDDVLIFSQS 1269
D +K AF P G +E+ VMP+G+ AP+ FQ +N I + +V Y+D++LI S+S
Sbjct: 515 GDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKS 574
Query: 1270 IDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFP 1329
+H KH+ + +K + +++ K Q++++F+G++I + P I+ ++
Sbjct: 575 ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQW- 633
Query: 1330 DQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-PWSDIHTNVVKQIKLRI 1388
Q ++ +L++FLG +NY+ F P+ S + PL++ LKKD W+ T ++ IK +
Sbjct: 634 KQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCL 693
Query: 1389 KNLPCLYLPNPQAFKIVETDASDIGFGGILKQK-IIDKEQIIAFTSKHWNPAQQNYSTVK 1447
+ P L + ++ETDASD+ G +L QK DK + + S + AQ NYS
Sbjct: 694 VSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSD 753
Query: 1448 KEVLAIVLSISNFQSDLIN--QKFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVF 1505
KE+LAI+ S+ +++ L + + F + D ++ + + + ARWQ L F
Sbjct: 754 KEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESE--PENKRLARWQLFLQDF 811
Query: 1506 DFEIEYIKGSTNSLPDFLTR 1525
+FEI Y GS N + D L+R
Sbjct: 812 NFEINYRPGSANHIADALSR 831
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 227 bits (578), Expect = 2e-58
Identities = 187/693 (26%), Positives = 340/693 (48%), Gaps = 70/693 (10%)
Query: 843 KSLKQIQTSQQLILEKLTEENSNGGSSSSSSSTSTSNRAPNNNVGDFLEIINYVTIQKFY 902
K+ K+ Q + + ++ + +N +S + T+ S ++N + +N+ T +++
Sbjct: 199 KTHKRFQPQNKNLGKEFLPKKNNTTNSRNLRKTNISRIEYSSN-----KFLNH-TRKRYE 252
Query: 903 INITIIIGDFILETPALFDTGADSSCISEGLI-----PTRYFEKTTEKLSGAEGSKLIIK 957
+ + + DF P L DTG ++ I+E + PTR + K+ G +K I
Sbjct: 253 MVLQAELPDFKCSIPCLIDTGTQANIITEETVRAHKLPTRPWSKSVI-YGGVYPNK--IN 309
Query: 958 YKIPSAIIKNDSLEIETSFLLVRNLTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPIVF 1017
K I + + I+T FL+V+ +H I L+ N + I+ H +
Sbjct: 310 RKTIKLNISLNGISIKTEFLVVKKFSHPAAIS---FTTLYDNNIE---ISSSKHTLSQMN 363
Query: 1018 KFSKPPFVKTLNIISYKEKQINFLKEEISYKNIEVQLQQPSVKSRIENILENIQSSICFD 1077
K S NI+ KE ++ + +E +K+I + + I+ +
Sbjct: 364 KVS--------NIV--KEPELPDIYKE--FKDITAETNTEKLPKPIKGL----------- 400
Query: 1078 LPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFFQREINDLLQKKLIRRSK 1137
E E + ++P + P+ + +Q EIN L+ +IR SK
Sbjct: 401 -------------EFEVELTQENYRLPIRNYPLPPGK--MQAMNDEINQGLKSGIIRESK 445
Query: 1138 SPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKF 1197
+ +C +V K+ GT R+V++YKPLN+ + YP+P + LLA++ + IF+K
Sbjct: 446 AINACPVMFVPKK----EGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKL 501
Query: 1198 DMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNP*SKFAI 1257
D+KS + I++++ D +K AF P G +E+ VMP+G+ AP+ FQ +N I + +
Sbjct: 502 DLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHV 561
Query: 1258 V-YIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFLGHNIHQGTII 1316
V Y+D++LI S+S +H KH+ + +K + +++ K Q++++F+G++I +
Sbjct: 562 VCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFT 621
Query: 1317 PINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-PWSD 1375
P I+ ++ Q ++ +L++FLG +NY+ F P+ S + PL++ LKKD W+
Sbjct: 622 PCQENIDKVLQW-KQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTP 680
Query: 1376 IHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILKQK-IIDKEQIIAFTSK 1434
T ++ IK + + P L + ++ETDASD+ G +L QK DK + + S
Sbjct: 681 TQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSA 740
Query: 1435 HWNPAQQNYSTVKKEVLAIVLSISNFQSDLIN--QKFLVRVDCKSAKDILQKDVKNLASK 1492
+ AQ NYS KE+LAI+ S+ +++ L + + F + D ++ + + +
Sbjct: 741 KMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESE--PEN 798
Query: 1493 HIFARWQAILSVFDFEIEYIKGSTNSLPDFLTR 1525
ARWQ L F+FEI Y GS N + D L+R
Sbjct: 799 KRLARWQLFLQDFNFEINYRPGSANHIADALSR 831
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 224 bits (570), Expect = 2e-57
Identities = 226/885 (25%), Positives = 404/885 (45%), Gaps = 86/885 (9%)
Query: 687 KDVTCYNCGKPGHISRYCRLKRRISELHLEPEIEDKINNLLIQTSDEEESVPSDSEVSED 746
K CY C H++ C RR + + D ++ ++ + ++E + + E+ E
Sbjct: 770 KKCRCYICQDENHLANRC--PRRYTN-QARASLIDGLDEDIVSIASDDEDIENFLEIIE- 825
Query: 747 LNQIQNDDDQSSSSINVLTNEQDLIFRAIDSIPDPDEKKVYLERLKLTLEDRPPKSPITT 806
L++ Q + ++D + D + K V + T E + K+ +
Sbjct: 826 LDEFIAHSSQEHEHTWEIGGKKDKVCEICSYFTDYN-KTVSCK----TCETQYCKT--CS 878
Query: 807 NKFNLKDTFKRLEKSTIKPVTIQDLQSEVHILKAEVKSLKQIQTSQQLILEKLTEENSNG 866
++ L+ T ++K T + I DL+ V L+ V L+ Q L + T + N
Sbjct: 879 DQLALEVT--EVKKPTKEETMIDDLKLNVKNLEFRVTILEHKVEMQNLQDKFETMQIRNK 936
Query: 867 GSSSSSSSTSTSNRAPNNNVGDFLEIINYVTIQKFYINITIIIGDFILETPALFDTGADS 926
+ +TS + RA +N IN Y+ I + AL D+G+
Sbjct: 937 SEITEIPTTSLAMRANESNY--IKTSINKTA--GCYVETKISFNNENRIITALIDSGSTH 992
Query: 927 SCISEGLIPTRYFEKTTEKLS--GAEGSKLIIKYKIPSAIIKNDSLEIETSFLLVRNLTH 984
+ I LIP + T ++ + SK + ++ I K E++ +F + L
Sbjct: 993 NIICPTLIPASWINNTHREIIMFAVDNSKYNLNQELIDDI-KLQFQEVDETFGIKYKLGQ 1051
Query: 985 KVIIETPFIKKLFPYN--TDEKG--------ITVQ--------------------HHGQP 1014
+ P + + T+E G IT+Q H G+P
Sbjct: 1052 TYVAPKPTKTFIIGHRFLTNENGSVTIHKDYITIQKTTGIYPTARHELKSEFARKHGGRP 1111
Query: 1015 IVFKFSKPPFVKTLNIISYKEKQINFLKEEISYKNIEVQLQQPSVKSRI-ENILENIQSS 1073
+F + K ++ SY+ + I K EI +++ +++ I ++I +N +
Sbjct: 1112 PLFSNIPETYNKIPHLHSYQPQPILGYKNEIGNQSLITMVKELEALGFIGDDITKNRTTW 1171
Query: 1074 IC-FDLPNAFWERKSHMVELPYEKDFSDKQIPTKARPIQMNEELLQFFQREINDLLQKKL 1132
+C F + N + +PY +DK++ F+++I +LL KL
Sbjct: 1172 VCDFKIINP--DINITCATIPYTP--ADKEV----------------FEKQIKELLDNKL 1211
Query: 1133 IRRSKSPWS--CAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARLHD 1190
I+++ AAF V +E PR+V NYK LN + + IP+K ++ +
Sbjct: 1212 IKKADPTCRHRTAAFIVRNHSEEVAQKPRIVYNYKRLNDNMHTDPFNIPHKISMINLIQK 1271
Query: 1191 AKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFN 1250
A IFSKFD+K+GF ++L++ + T FT G Y WNV PFG+ NAP FQR M E F
Sbjct: 1272 ANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSEGLYTWNVCPFGIANAPCAFQRFMQESFG 1331
Query: 1251 P*SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFLGHNI 1310
KFA++YIDD+LI S + +H +HL F + +K+ G +SK K +F ++ +LG I
Sbjct: 1332 D-LKFALLYIDDILIASNNEKEHIEHLKIFFNRVKEVGCVLSKKKSKMFLKEVEYLGVEI 1390
Query: 1311 HQGTIIPINRAIEFTDKFPDQIIDKTQ-LQRFLGCLNYVADFCPQLSTIIKPLHDRLKKD 1369
+G I ++ KF ++ + LQ +LG LNY + LS ++ PL+ + K+
Sbjct: 1391 KEGKISLQPHIVDKIKKFDKNKLNTLKGLQAYLGLLNYARGYIKDLSKLVGPLYKKTGKN 1450
Query: 1370 PPP-WSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGIL-----KQKII 1423
++ N++ +I+ + + L P + I+ETDAS+ G+G +L K
Sbjct: 1451 GQRIFNKEDWNIIFKIEREVSKIKPLERPKETDYIIIETDASEEGWGAVLVCKPDKYSGK 1510
Query: 1424 DKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCKS-AKDIL 1482
D E+I + S ++ ++ ++++ E+ AI +++ FQ +++ F +R DC++ K I
Sbjct: 1511 DTEKIAGYASGNFG-EKKTWTSLDYEIEAINEALNKFQI-YLDKDFTIRTDCEAIVKGIK 1568
Query: 1483 QKDVKNLA-SKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTRE 1526
+D K + ++ I R + + E+IKG+ N LP+FL+RE
Sbjct: 1569 TEDYKKRSKTRWIKLRDNLLKDGYKPTFEHIKGNKNFLPNFLSRE 1613
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 219 bits (558), Expect = 5e-56
Identities = 138/412 (33%), Positives = 221/412 (53%), Gaps = 13/412 (3%)
Query: 1118 QFFQREINDLLQKKLIRRSKSPWSCAAFYVNKQAEIE-RGTPRLVINYKPLNQALCWIRY 1176
Q + +I D+L + +IR S SP++ + V K+ + + R+VI+Y+ LN+ R+
Sbjct: 221 QEVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRH 280
Query: 1177 PIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKN 1236
PIPN ++L +L F+ D+ GF QI++ + KTAF+ G YE+ MPFGLKN
Sbjct: 281 PIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKN 340
Query: 1237 APSEFQRIMNEIFNP-*SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTK 1295
AP+ FQR MN+I P +K +VY+DD+++FS S+D+H + L + K + + K
Sbjct: 341 APATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDK 400
Query: 1296 VSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQL 1355
+ + FLGH + I P IE K+P K +++ FLG Y F P
Sbjct: 401 CEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPK-EIKAFLGLTGYYRKFIPNF 459
Query: 1356 STIIKPLHDRLKKDP--PPWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIG 1413
+ I KP+ LKK+ + + + K++K I P L +P+ + TDASD+
Sbjct: 460 ADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVA 519
Query: 1414 FGGILKQKIIDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRV 1473
G +L Q +++ S+ N + NYST++KE+LAIV + F+ L+ + F +
Sbjct: 520 LGAVLSQ----DGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISS 575
Query: 1474 DCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTR 1525
D + + + +K+ SK RW+ LS FDF+I+YIKG N + D L+R
Sbjct: 576 DHQPLSWLYR--MKDPNSK--LTRWRVKLSEFDFDIKYIKGKENCVADALSR 623
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 213 bits (542), Expect = 3e-54
Identities = 136/427 (31%), Positives = 226/427 (52%), Gaps = 15/427 (3%)
Query: 1103 IPTKARPIQMNEELLQFFQREINDLLQKKLIRRSKSPWSCAAFYVNKQAEIERGTP-RLV 1161
I +K P+ E+ + ++ ++L + LIR S SP++ + V K+ + R+V
Sbjct: 207 IYSKQYPLAQTHEIE--VENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVV 264
Query: 1162 INYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVP 1221
I+Y+ LN+ RYPIPN ++L +L + F+ D+ GF QI++ E+ KTAF+
Sbjct: 265 IDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTK 324
Query: 1222 FGQYEWNVMPFGLKNAPSEFQRIMNEIFNP-*SKFAIVYIDDVLIFSQSIDQHFKHLNTF 1280
G YE+ MPFGL+NAP+ FQR MN I P +K +VY+DD++IFS S+ +H +
Sbjct: 325 SGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLV 384
Query: 1281 ISIIKKNGMAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQR 1340
+ + + + K + + FLGH + I P ++ +P DK +++
Sbjct: 385 FTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDK-EIRA 443
Query: 1341 FLGCLNYVADFCPQLSTIIKPLHDRLKKDPPPWSD--IHTNVVKQIKLRIKNLPCLYLPN 1398
FLG Y F P + I KP+ LKK + + +++K I P L LP+
Sbjct: 444 FLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPD 503
Query: 1399 PQAFKIVETDASDIGFGGILKQKIIDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSIS 1458
+ ++ TDAS++ G +L Q I+F S+ N + NYS ++KE+LAIV +
Sbjct: 504 FEKKFVLTTDASNLALGAVLSQ----NGHPISFISRTLNDHELNYSAIEKELLAIVWATK 559
Query: 1459 NFQSDLINQKFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNS 1518
F+ L+ ++FL+ D + + + ++K +K RW+ LS + F+I+YIKG NS
Sbjct: 560 TFRHYLLGRQFLIASDHQPLRWL--HNLKEPGAK--LERWRVRLSEYQFKIDYIKGKENS 615
Query: 1519 LPDFLTR 1525
+ D L+R
Sbjct: 616 VADALSR 622
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 185 bits (469), Expect = 9e-46
Identities = 135/434 (31%), Positives = 218/434 (50%), Gaps = 24/434 (5%)
Query: 1103 IPTKARPIQMNEELLQFFQREINDLLQKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVI 1162
I K RPI + L ++ I +L +K+IR SKSPWS V K+ G+ R+ I
Sbjct: 943 IRQKPRPIPL--ALKPEIRKMIQKMLNQKVIRESKSPWSSPVVLVKKKD----GSIRMCI 996
Query: 1163 NYKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPF 1222
+Y+ +N+ + +P+PN + L L K+++ FDM +GFWQI L EK + TAF +
Sbjct: 997 DYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGS 1056
Query: 1223 GQYEWNVMPFGLKNAPSEFQRIMNEIFNP-*SKFAIVYIDDVLIFSQSIDQHFKHLNTFI 1281
+EWNV+PFGL +P+ FQ M EI A VY+DD+LI S+ ++QH + + +
Sbjct: 1057 ELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEAL 1116
Query: 1282 SIIKKNGMAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFP--DQIIDKTQLQ 1339
+ I+K+GM + +K + + ++ +LGH + T+ + TDK + + +LQ
Sbjct: 1117 TRIRKSGMKLRASKCHIAKKEVEYLGHKV---TLDGVETQEVKTDKMKQFSRPTNVKELQ 1173
Query: 1340 RFLGCLNYVADFCPQLSTIIKPLHDRLK-KDPPPWSDIHTNVVKQIKLRIKNLPCLYLPN 1398
FLG + Y F + I L + K W +++K + P L P+
Sbjct: 1174 SFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPD 1233
Query: 1399 PQAFK------IVETDASDIGFGGILKQKIIDKEQ-IIAFTSKHWNPAQQNYSTVKKEVL 1451
+A ++ TDAS G G +L Q+ D +Q IAF SK +PA+ Y E L
Sbjct: 1234 VEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEAL 1293
Query: 1452 AIVLSISNFQSDLINQKFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEY 1511
A++ ++ F++ + V D K +L+ LA + RW + FD +I Y
Sbjct: 1294 AMMFALRRFKTIIYGTAITVFTDHKPLISLLKG--SPLADR--LWRWSIEILEFDVKIVY 1349
Query: 1512 IKGSTNSLPDFLTR 1525
+ G N++ D L+R
Sbjct: 1350 LAGKANAVADALSR 1363
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 184 bits (467), Expect = 2e-45
Identities = 140/478 (29%), Positives = 239/478 (49%), Gaps = 28/478 (5%)
Query: 1068 ENIQSSICFDLPNAFWERKSHM-VELPYEKDF-SDKQIPTKAR----PIQMNEELLQFFQ 1121
+ I +S+ + P F S M VE + + ++ Q P A+ P+ M E+ +
Sbjct: 85 QEILNSLLGEFPRIFEPPLSGMSVETAVKAEIRTNTQDPIYAKSYPYPVNMRGEV----E 140
Query: 1122 REINDLLQKKLIRRSKSPWSCAAFYVNKQAEIERGTP-RLVINYKPLNQALCWIRYPIPN 1180
R+I++LLQ +IR S SP++ + V K+ + R+V+++K LN YPIP+
Sbjct: 141 RQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYPIPD 200
Query: 1181 KKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSE 1240
LA L +AK F+ D+ SGF QI ++E D KTAF+ G+YE+ +PFGLKNAP+
Sbjct: 201 INATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAPAI 260
Query: 1241 FQRIMNEIFNP*-SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLF 1299
FQR++++I K VYIDD+++FS+ D H+K+L ++ + K + V+ K
Sbjct: 261 FQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKSHFL 320
Query: 1300 QTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTII 1359
T++ FLG+ + I + + + P K +L+RFLG +Y F + +
Sbjct: 321 DTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVK-ELKRFLGMTSYYRKFIQDYAKVA 379
Query: 1360 KPLHDRLK------------KDPPPWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVET 1407
KPL + + K P + +K + + L P + T
Sbjct: 380 KPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLTT 439
Query: 1408 DASDIGFGGILKQKIIDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQ 1467
DAS+ G +L Q +++ IA+ S+ N ++NY+T++KE+LAI+ S+ N ++ L
Sbjct: 440 DASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYGA 499
Query: 1468 KFLVRVDCKSAKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTR 1525
++V +N +K RW+A + ++ E+ Y G +N + D L+R
Sbjct: 500 G-TIKVYTDHQPLTFALGNRNFNAK--LKRWKARIEEYNCELIYKPGKSNVVADALSR 554
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 180 bits (456), Expect = 3e-44
Identities = 151/589 (25%), Positives = 280/589 (46%), Gaps = 58/589 (9%)
Query: 979 VRNLTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPIVFKFSKPPFVKT----------L 1028
V+ L + + + +PF +T+ ++H VFK P F+ L
Sbjct: 40 VKELKNVMPVASPFSVSSIHGSTE-----IKHKCLMKVFKHISPFFLLDSLNAFDAIIGL 94
Query: 1029 NIISYKEKQINFLKEEISYKNIEVQLQQ---PSVKSRIENILENIQSSICFDLPNAFWER 1085
++++ ++N ++ + Y+ I +L PSV N + + S+ + + R
Sbjct: 95 DLLTQAGVKLNLAEDSLEYQGIAEKLHYFSCPSVNFTDVNDIV-VPDSVKKEFKDTIIRR 153
Query: 1086 KSHMVE----LPYE-------KDFSDKQIPTKARPIQMNEELLQFFQREINDLLQKKLIR 1134
K LP+ + ++ + ++A P M + F E+ LL+ +IR
Sbjct: 154 KKAFSTTNEALPFNTAVTATIRTVDNEPVYSRAYPTLMG--VSDFVNNEVKQLLKDGIIR 211
Query: 1135 RSKSPWSCAAFYVNKQAEIERGTP--RLVINYKPLNQALCWIRYPIPNKKDLLARLHDAK 1192
S+SP++ + V+K+ G P RLVI+++ LN+ RYP+P+ +LA L AK
Sbjct: 212 PSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYPMPSIPMILANLGKAK 271
Query: 1193 IFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIF-NP 1251
F+ D+KSG+ QI L E DR KT+F+V G+YE+ +PFGL+NA S FQR ++++
Sbjct: 272 FFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNASSIFQRALDDVLREQ 331
Query: 1252 *SKFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQTKIRFLGHNIH 1311
K VY+DDV+IFS++ H +H++T + + M VS+ K F+ + +LG +
Sbjct: 332 IGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKTRFFKESVEYLGFIVS 391
Query: 1312 QGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRL----- 1366
+ ++ ++P+ +++ FLG +Y F + I +P+ D L
Sbjct: 392 KDGTKSDPEKVKAIQEYPEPDC-VYKVRSFLGLASYYRVFIKDFAAIARPITDILKGENG 450
Query: 1367 -------KKDPPPWSDIHTNVVKQIK--LRIKNLPCLYLPNPQAFKIVETDASDIGFGGI 1417
KK P +++ N ++++ L +++ Y + F + TDAS G G +
Sbjct: 451 SVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFDLT-TDASASGIGAV 509
Query: 1418 LKQKIIDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCKS 1477
L Q + + I S+ +QNY+T ++E+LAIV ++ Q+ L + ++ +
Sbjct: 510 LSQ----EGRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFLYGSR---EINIFT 562
Query: 1478 AKDILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTRE 1526
L V + + RW++ + + ++ Y G N + D L+R+
Sbjct: 563 DHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKPGKENFVADALSRQ 611
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 162 bits (411), Expect = 5e-39
Identities = 146/586 (24%), Positives = 261/586 (43%), Gaps = 32/586 (5%)
Query: 959 KIPSAIIKNDSLEIETSFLLVRN-LTHKVIIETPFIKKLFPYNTDEKGITVQHHGQPIVF 1017
K P I S I T+ L R+ + ++I+ + L P + GI V +
Sbjct: 162 KFPIYIPIAYSSGINTTLLPARSQVVRRLIVSSKDDNILIPNQEIQTGIYVAN-----TI 216
Query: 1018 KFSKPPFVKTLNIISYKE------------KQINFLKEEISYKNIEVQLQQPSVKSRIEN 1065
S FV+ LN + N ++ ++N V Q +K
Sbjct: 217 ATSSNTFVRILNTTDSDQLVNMDTLKYEPLSNYNVVQANSEHRNKTVLSQ---LKKNFPE 273
Query: 1066 ILENIQSSICFDLPNAF-WERKSHMVELPYEKDFSDKQI-PTKARPIQMNEELLQFFQRE 1123
+ ++ +IC + + F E + V Y++ K P + + ++ Q +
Sbjct: 274 LFKSQLENICSEYIDIFALESEPITVNNLYKQQLRLKDDEPVYTKNYRSPHSQVEEIQAQ 333
Query: 1124 INDLLQKKLIRRSKSPWSCAAFYVNKQAE--IERGTPRLVINYKPLNQALCWIRYPIPNK 1181
+ L++ K++ S S ++ V K++ ++ RLVI+Y+ +N+ L ++P+P
Sbjct: 334 VQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLPRI 393
Query: 1182 KDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKNAPSEF 1241
D+L +L AK FS D+ SGF QI+L E R T+F+ G Y + +PFGLK AP+ F
Sbjct: 394 DDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPNSF 453
Query: 1242 QRIMNEIFNP*S-KFAIVYIDDVLIFSQSIDQHFKHLNTFISIIKKNGMAVSKTKVSLFQ 1300
QR+M F+ A +Y+DD+++ S K+L ++ + + K S F
Sbjct: 454 QRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSFFM 513
Query: 1301 TKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIK 1360
++ FLGH I+P ++ + +P D +RF+ NY F + +
Sbjct: 514 HEVTFLGHKCTDKGILPDDKKYDVIQNYPVP-HDADSARRFVAFCNYYRRFIKNFADYSR 572
Query: 1361 PLHDRLKKDPP-PWSDIHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASDIGFGGILK 1419
+ KK+ P W+D +K ++ N L P+ + TDAS G +L
Sbjct: 573 HITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVLT 632
Query: 1420 QKIIDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISNFQSDLINQKFLVRVDCKSAK 1479
Q + +A+ S+ + + N ST ++E+ AI +I +F+ + + F V+ D +
Sbjct: 633 QNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHRPLT 692
Query: 1480 DILQKDVKNLASKHIFARWQAILSVFDFEIEYIKGSTNSLPDFLTR 1525
+ + N +SK R + L ++F +EY+KG N + D L+R
Sbjct: 693 YLF--SMVNPSSK--LTRIRLELEEYNFTVEYLKGKDNHVADALSR 734
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 103 bits (257), Expect = 4e-21
Identities = 83/320 (25%), Positives = 144/320 (44%), Gaps = 31/320 (9%)
Query: 1104 PTKARPIQMNEELLQFFQREINDLLQKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVIN 1163
P K PI N + Q I+DLL++ ++ + S + + V K G R+V++
Sbjct: 176 PQKQYPI--NPKAKPSIQIVIDDLLKQGVLIQQNSTMNTPVYPVPKPD----GKWRMVLD 229
Query: 1164 YKPLNQALCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFG 1223
Y+ +N+ + I + +L+ ++ K + D+ +GFW + + + TAFT
Sbjct: 230 YREVNKTIPLIAAQNQHSAGILSSIYRGKYKTTLDLTNGFWAHPITPESYWLTAFTWQGK 289
Query: 1224 QYEWNVMPFGLKNAPSEFQR----IMNEIFNP*SKFAIVYIDDVLIFSQSIDQHFKHLNT 1279
QY W +P G N+P+ F ++ EI N Y+DD+ I +H + L
Sbjct: 290 QYCWTRLPQGFLNSPALFTADVVDLLKEIPN-----VQAYVDDIYISHDDPQEHLEQLEK 344
Query: 1280 FISIIKKNGMAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQII------ 1333
SI+ G VS K + Q ++ FLG NI TD F +++
Sbjct: 345 IFSILLNAGYVVSLKKSEIAQREVEFLGFNI-------TKEGRGLTDTFKQKLLNITPPK 397
Query: 1334 DKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP---PWSDIHTNVVKQIKLRIKN 1390
D QLQ LG LN+ +F P S ++KPL+ + W++ ++N ++ I +
Sbjct: 398 DLKQLQSILGLLNFARNFIPNYSELVKPLYTIVANANGKFISWTEDNSNQLQHIISVLNQ 457
Query: 1391 LPCLYLPNPQAFKIVETDAS 1410
L NP+ I++ ++S
Sbjct: 458 ADNLEERNPETRLIIKVNSS 477
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.316 0.134 0.389
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 183,035,463
Number of Sequences: 164201
Number of extensions: 8287028
Number of successful extensions: 34081
Number of sequences better than 10.0: 342
Number of HSP's better than 10.0 without gapping: 96
Number of HSP's successfully gapped in prelim test: 251
Number of HSP's that attempted gapping in prelim test: 33042
Number of HSP's gapped (non-prelim): 951
length of query: 1526
length of database: 59,974,054
effective HSP length: 123
effective length of query: 1403
effective length of database: 39,777,331
effective search space: 55807595393
effective search space used: 55807595393
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 73 (32.7 bits)
Lotus: description of TM0173b.7