
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0019a.7
(1703 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 295 6e-79
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 293 2e-78
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 293 4e-78
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 291 8e-78
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 290 2e-77
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 288 1e-76
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 287 2e-76
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 235 7e-61
POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic pro... 227 2e-58
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 210 2e-53
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 201 1e-50
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 199 4e-50
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 197 3e-49
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 191 1e-47
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 189 7e-47
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 172 7e-42
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 170 4e-41
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 168 1e-40
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 138 1e-31
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 97 4e-19
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 295 bits (756), Expect = 6e-79
Identities = 205/600 (34%), Positives = 311/600 (51%), Gaps = 55/600 (9%)
Query: 1131 LEIPALFDTGADSSCISEGLIPTRYFEKTTEKLSA--AEGSRLKIK-------------- 1174
+E+ DTGA S+ +IP ++ + A+GS + I
Sbjct: 38 IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIAREI 97
Query: 1175 YKIPSAIIKNGSLEIETPFLLVRNLSQKIIIGTPFIK----------KLFPYNTDENGIT 1224
+KIP+ + ++ F++ N Q + PFI+ K +P + +
Sbjct: 98 FKIPTVYQQESGID----FIIGNNFCQ---LYEPFIQFTDRVIFTKNKSYPVHIAKLTRA 150
Query: 1225 VQHLGQPIL------FKFSEP-PIDKTLNVISYKEKQINFLKEEISYRTIEEQLQQPSVK 1277
V+ + L K +P P++ + N I K+I L E R EE+L +
Sbjct: 151 VRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLKEIAILSE--GRRLSEEKLFITQQR 208
Query: 1278 -SRIEDILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEELLHF 1336
+IE++LE + S D PN + ++L SD + K +P++ +
Sbjct: 209 MQKIEELLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREE 261
Query: 1337 CQKEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIP 1396
K+I +LL+ K+I+ SKSP AF VN +AE RG R+V+NYK +N+A Y +P
Sbjct: 262 FDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLP 321
Query: 1397 NKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPS 1456
NK +LL K+FS FD KSGFWQ+ L ++ + TAFT P G YEWNV+PFGLK APS
Sbjct: 322 NKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPS 381
Query: 1457 EFQRIMNEIFNPYSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSLF 1516
FQR M+E F + K VY+DD+L+FS + H H+ + ++G+ +SK K LF
Sbjct: 382 IFQRHMDEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLF 441
Query: 1517 QTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTII 1576
+ KI FLG I +GT P +E +KFPD + DK QLQRFLG L Y +D+ P+L+ I
Sbjct: 442 KKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIR 501
Query: 1577 KPLHDRLKKDPP-P*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGIL 1635
KPL +LK++ P + T ++++K ++ P L+ P P+ IIETDASD +GG+L
Sbjct: 502 KPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGML 561
Query: 1636 K----QKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVCVD 1691
K + + E I + S + A++NY + KE LA++ +I KF L FL+ D
Sbjct: 562 KAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTD 621
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 293 bits (751), Expect = 2e-78
Identities = 204/600 (34%), Positives = 311/600 (51%), Gaps = 55/600 (9%)
Query: 1131 LEIPALFDTGADSSCISEGLIPTRYFEKTTEKLSA--AEGSRLKIK-------------- 1174
+E+ DTGA S+ +IP ++ + A+GS + I
Sbjct: 38 IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIAGEI 97
Query: 1175 YKIPSAIIKNGSLEIETPFLLVRNLSQKIIIGTPFIK----------KLFPYNTDENGIT 1224
+KIP+ + ++ F++ N Q + PFI+ K +P + +
Sbjct: 98 FKIPTVYQQESGID----FIIGNNFCQ---LYEPFIQFTDRVIFTKNKSYPVHITKLTRA 150
Query: 1225 VQHLGQPIL------FKFSEP-PIDKTLNVISYKEKQINFLKEEISYRTIEEQLQQPSVK 1277
V+ + L K +P P++ + N I ++I L E R EE+L +
Sbjct: 151 VRVGIEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSE--GRRLSEEKLFITQQR 208
Query: 1278 -SRIEDILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEELLHF 1336
+IE++LE + S D PN + ++L SD + K +P++ +
Sbjct: 209 MQKIEELLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREE 261
Query: 1337 CQKEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIP 1396
K+I +LL+ K+I+ SKSP AF VN +AE RG R+V+NYK +N+A Y +P
Sbjct: 262 FDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLP 321
Query: 1397 NKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPS 1456
NK +LL K+FS FD KSGFWQ+ L ++ + TAFT P G YEWNV+PFGLK APS
Sbjct: 322 NKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPS 381
Query: 1457 EFQRIMNEIFNPYSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSLF 1516
FQR M+E F + K VY+DD+L+FS + H H+ + ++G+ +SK K LF
Sbjct: 382 IFQRHMDEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLF 441
Query: 1517 QTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTII 1576
+ KI FLG I +GT P +E +KFPD + DK QLQRFLG L Y +D+ P+L+ I
Sbjct: 442 KKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIR 501
Query: 1577 KPLHDRLKKDPP-P*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGIL 1635
KPL +LK++ P + T ++++K ++ P L+ P P+ IIETDASD +GG+L
Sbjct: 502 KPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGML 561
Query: 1636 K----QKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVCVD 1691
K + + E I + S + A++NY + KE LA++ +I KF L FL+ D
Sbjct: 562 KAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTD 621
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 293 bits (749), Expect = 4e-78
Identities = 203/598 (33%), Positives = 307/598 (50%), Gaps = 51/598 (8%)
Query: 1131 LEIPALFDTGADSSCISEGLIPTRYFEKTTEKLSA--AEGSRLKIKYKIPSAIIKNGSLE 1188
+E+ DTGA S+ +IP ++ + A+GS + I S + K+ L
Sbjct: 38 IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITI-----SKVCKDIDLI 92
Query: 1189 IETPFLLVRNLSQK-----IIIGTPFIKKLFPY-NTDENGITVQHLGQPILF-------- 1234
I + + Q+ IIG F + P+ + I ++ P+
Sbjct: 93 IAGEIFRIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDRVIFTKNKSYPVHIAKLTRAVR 152
Query: 1235 --------------KFSEP-PIDKTLNVISYKEKQINFLKEEISYRTIEEQLQQPSVK-S 1278
K +P P++ + N I ++I L E R EE+L +
Sbjct: 153 VGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSE--GRRLSEEKLFITQQRMQ 210
Query: 1279 RIEDILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEELLHFCQ 1338
+IE++LE + S D PN + ++L SD + K +P++ +
Sbjct: 211 KIEELLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREEFD 263
Query: 1339 KEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNK 1398
K+I +LL+ K+I+ SKSP AF VN +AE RG R+V+NYK +N+A Y +PNK
Sbjct: 264 KQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNLPNK 323
Query: 1399 KDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPSEF 1458
+LL K+FS FD KSGFWQ+ L ++ + TAFT P G YEWNV+PFGLK APS F
Sbjct: 324 DELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIF 383
Query: 1459 QRIMNEIFNPYSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSLFQT 1518
QR M+E F + K VY+DD+L+FS + H H+ + ++G+ +SK K LF+
Sbjct: 384 QRHMDEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKK 443
Query: 1519 KIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKP 1578
KI FLG I +GT P +E +KFPD + DK QLQRFLG L Y +D+ P+L+ I KP
Sbjct: 444 KINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKP 503
Query: 1579 LHDRLKKDPP-P*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGILK- 1636
L +LK++ P + T ++++K ++ P L+ P P+ IIETDASD +GG+LK
Sbjct: 504 LQAKLKENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKA 563
Query: 1637 ---QKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVCVD 1691
+ + E I + S + A++NY + KE LA++ +I KF L FL+ D
Sbjct: 564 IKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTD 621
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 291 bits (746), Expect = 8e-78
Identities = 193/575 (33%), Positives = 292/575 (50%), Gaps = 32/575 (5%)
Query: 1138 DTGADSSCISEGLIPTRYFEKTTEKLSA--AEGSRLKIKYKIPSAIIKNGSLEIETPFLL 1195
DTGA S +IP +E + + + A +KI + +K E P +
Sbjct: 54 DTGASLCIASRYIIPEELWENSPKDIQVKIANQELIKITKVCKNLKVKFAGKSFEIPTVY 113
Query: 1196 VRNLSQKIIIGTPFIKKLFPYNTDENGITVQHLGQPIL-------FKFSEPPIDKTLNVI 1248
+ +IG F + P+ E+ I + +L F S P + +
Sbjct: 114 QQETGIDFLIGNNFCRLYNPFIQWEDRIAFHLKNEMVLIKKVTKAFSVSNPSFLENMKKD 173
Query: 1249 SYKEKQI-------NFLKEEISYRTIEEQLQQPSVKSRIEDILENIQSSICSDLPNAFWE 1301
S K +QI N + E Y I E+ Q+ +E + +CS+ P +
Sbjct: 174 S-KTEQIPGTNISKNIINPEERYFLITEKYQK----------IEQLLDKVCSENPIDPIK 222
Query: 1302 RKSHMVELPYEKDFSDKQISTKARPIQMNEELLHFCQKEINDLLEKKLIRRSKSPWSCAA 1361
K M D + +P+ + + K+I +LL+ LI SKS A
Sbjct: 223 SKQWMKA---SIKLIDPLKVIRVKPMSYSPQDREGFAKQIKELLDLGLIIPSKSQHMSPA 279
Query: 1362 FYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARQHDAKVFSKFDMKSGFW 1421
F V +AE RG R+V+NYK +NQA + +PN ++LL +FS FD KSGFW
Sbjct: 280 FLVENEAERRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFW 339
Query: 1422 QIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSKLTIVYIDDVL 1481
Q+ L E+ + TAFT P G ++W V+PFGLK APS FQR M N K +VY+DD++
Sbjct: 340 QVVLDEESQKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNGADKFCMVYVDDII 399
Query: 1482 IFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEF 1541
+FS + H+ H+ + ++++ G+ +SK K +LF+ KI FLG I +GT P N +E
Sbjct: 400 VFSNSELDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNHILEN 459
Query: 1542 TDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-P*SDIHTNVVKQ 1600
KFPD++ DK LQRFLG L Y + P+L+ I KPL +LKKD + ++ VK+
Sbjct: 460 IHKFPDRLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKK 519
Query: 1601 IKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGILKQKVFDKEQIIA-FTSKHWNPAQQN 1659
IK + + P LYLP P+ IIETDASD +GG+LK + D ++I ++S + A++N
Sbjct: 520 IKKNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKN 579
Query: 1660 YSTVKKEVLAIVLSISKFQSDLINQKFLVCVDCKS 1694
Y + KE+LA+ I+KF + L +F V D K+
Sbjct: 580 YHSNDKELLAVKQVITKFSAYLTPVRFTVRTDNKN 614
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 290 bits (743), Expect = 2e-77
Identities = 199/591 (33%), Positives = 305/591 (50%), Gaps = 40/591 (6%)
Query: 1131 LEIPALFDTGADSSCISEGLIPTRYFEKTTEK---LSAAEGSRLKIKYKIPSAIIKNGSL 1187
L++ DTG+ S+ +IP Y++ T EK + A G +++ I+ G
Sbjct: 27 LDLHCYVDTGSSLCMASKYVIPEEYWQ-TAEKPLNIKIANGKIIQLTKVCSKLPIRLGGE 85
Query: 1188 EIETPFLLVRNLSQKIIIGTPFIKKLFPYNTDENGITVQHLGQPILF------------- 1234
P L + +++G F + P+ + I Q ++
Sbjct: 86 RFLIPTLFQQESGIDLLLGNNFCQLYSPFIQYTDRIYFHLNKQSVIIGKITKAYQYGVKG 145
Query: 1235 ------KFSEPPIDKTLNVISYKEKQINFLKEEISYRTIEEQLQQPSVK--SRIEDILEN 1286
K S+ + +N+ S Q FL+E ++ ++E L + + S IE++LE
Sbjct: 146 FLESMKKKSKVNRPEPINITS---NQHLFLEEGGNH--VDEMLYEIQISKFSAIEEMLER 200
Query: 1287 IQSSICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEELLHFCQKEINDLLE 1346
+ S D P + + +EL D + K +P+ + ++I +LLE
Sbjct: 201 VSSENPID-PEKSKQWMTATIEL------IDPKTVVKVKPMSYSPSDREEFDRQIKELLE 253
Query: 1347 KKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARQH 1406
K+I+ SKS AF V +AE RG R+V+NYK +N+A + +PNK +LL
Sbjct: 254 LKVIKPSKSTHMSPAFLVENEAERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVR 313
Query: 1407 DAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPSEFQR-IMNEI 1465
K++S FD KSG WQ+ L ++ + TAFT P G Y+WNV+PFGLK APS F + N
Sbjct: 314 GKKIYSSFDCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSH 373
Query: 1466 FNPYSKLTIVYIDDVLIFSQT-LDQHFKHLNTFISVIKRNGLAVSKTKVSLFQTKIRFLG 1524
N YSK VY+DD+L+FS T +H+ H+ + ++ G+ +SK K LF+ KI FLG
Sbjct: 374 SNQYSKYCCVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLG 433
Query: 1525 HNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLK 1584
I QGT P N +E KFPD+I DK QLQRFLG L Y +D+ P+L++I KPL +LK
Sbjct: 434 LEIDQGTHCPQNHILEHIHKFPDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLK 493
Query: 1585 KDPP-P*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGILKQKVFDKE 1643
+D +D + + +IK +K+ P LY P P +IETDAS+ +GGILK E
Sbjct: 494 EDSTWTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHE 553
Query: 1644 QIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVCVDCKS 1694
I + S + A++NY + +KE+LA++ I KF L +FL+ D K+
Sbjct: 554 YICRYASGSFKAAERNYHSNEKELLAVIRVIKKFSIYLTPSRFLIRTDNKN 604
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 288 bits (736), Expect = 1e-76
Identities = 203/602 (33%), Positives = 311/602 (50%), Gaps = 59/602 (9%)
Query: 1131 LEIPALFDTGADSSCISEGLIPTRYFEKTTEKLSA--AEGSRLKIK-------------- 1174
+E+ DTGA S+ +IP ++ + A+GS + I
Sbjct: 39 IELHCFVDTGASLCIASKFVIPEEHWVNAERPIMVKIADGSSITISKVCKDIDLIIVGVI 98
Query: 1175 YKIPSAIIKNGSLEIETPFLLVRNLSQKIIIGTPFIK----------KLFPYNTDENGIT 1224
+KIP+ + ++ F++ N Q + PFI+ K +P + +
Sbjct: 99 FKIPTVYQQESGID----FIIGNNFCQ---LYEPFIQFTDRVIFTKNKSYPVHIAKLTRA 151
Query: 1225 VQHLGQPIL------FKFSEP-PIDKTLNVISYKEKQINFLKEEISYRTIEEQL---QQP 1274
V+ + L K +P P++ + N I ++I L E R EE+L QQ
Sbjct: 152 VRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSE--GRRLSEEKLFITQQR 209
Query: 1275 SVKSRIEDILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEELL 1334
K+ E++LE + S D PN + ++L SD + K +P++ +
Sbjct: 210 MQKT--EELLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDR 260
Query: 1335 HFCQKEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYP 1394
K+I +LL+ K+I+ SKSP AF VN +AE RG R+V+NYK +N+A Y
Sbjct: 261 EEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVGDAYN 320
Query: 1395 IPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNA 1454
+PNK +LL K+FS FD KSGFWQ+ L ++ + TAFT P G YEWNV+PFGLK A
Sbjct: 321 LPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQA 380
Query: 1455 PSEFQRIMNEIFNPYSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVS 1514
PS FQR M+E F + K VY+DD+++FS + H H+ + ++G+ +SK K
Sbjct: 381 PSIFQRHMDEAFRVFRKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQ 440
Query: 1515 LFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLST 1574
LF+ KI FLG I +GT P +E +KFPD + DK QLQRFLG L Y +D+ P L+
Sbjct: 441 LFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQ 500
Query: 1575 IIKPLHDRLKKDPP-P*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGG 1633
+ +PL +LK++ P + T ++++K ++ P L+ P P+ IIETDASD +GG
Sbjct: 501 MRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGG 560
Query: 1634 ILK----QKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVC 1689
+LK + + E I + S + A++NY + KE LA++ +I KF L FL+
Sbjct: 561 MLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIR 620
Query: 1690 VD 1691
D
Sbjct: 621 TD 622
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 287 bits (735), Expect = 2e-76
Identities = 167/418 (39%), Positives = 242/418 (56%), Gaps = 12/418 (2%)
Query: 1279 RIEDILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEELLHFCQ 1338
+IE++LE + S D PN + ++L SD + K +P++ +
Sbjct: 206 KIEELLEKVCSENPLD-PNKTKQWMKASIKL------SDPSKAIKVKPMKYSPMDREEFD 258
Query: 1339 KEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNK 1398
K+I +LL+ K+I+ SKSP AF VN +AE RG R+V+NYK +N+A Y PNK
Sbjct: 259 KQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNK 318
Query: 1399 KDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPSEF 1458
+LL K+FS FD KSGFWQ+ L ++ + TAFT P G YEWNV+PFGLK APS F
Sbjct: 319 DELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIF 378
Query: 1459 QRIMNEIFNPYSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSLFQT 1518
QR M+E F + K VY+DD+L+FS + H H+ + ++G+ +SK K LF+
Sbjct: 379 QRHMDEAFRVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKK 438
Query: 1519 KIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKP 1578
KI FLG I +GT P +E +KFPD + DK QLQRFLG L Y +D+ P+L+ I KP
Sbjct: 439 KINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKP 498
Query: 1579 LHDRLKKDPP-P*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGILK- 1636
L +LK++ P + T ++++K ++ P L+ P P+ IIETDASD +GG+LK
Sbjct: 499 LQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKA 558
Query: 1637 ---QKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVCVD 1691
+ + E I + S + A++NY + KE LA++ +I KF L FL+ D
Sbjct: 559 IKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTD 616
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 235 bits (600), Expect = 7e-61
Identities = 207/714 (28%), Positives = 345/714 (47%), Gaps = 80/714 (11%)
Query: 1035 DTFKRLEKSTVKPVTIQDL-HFEINSLKTEVKSLKQ-----IQKSQQLILEKLTKNYEED 1088
+ K EK++ TIQ+ E+N +K E++ K+ I + ++ I+ + EE
Sbjct: 1114 EALKHSEKASRVFSTIQESDEAELNLIKEELRQFKEETRMAIAQLKEAIIVQEEDTIEER 1173
Query: 1089 DSSIPDSNPAPNDNCEDFLENINQVTIQKFF---IHVKILIGDFVLE---IPALFDTGAD 1142
+ I + E ENI T + + +VK+ I +E I A+ DTGA
Sbjct: 1174 CAMILE---------EKHTENIYSATAKAEYNGLYNVKVGIKPDNMEPYYINAIVDTGAT 1224
Query: 1143 SSCISEGLIPTRYFE--KTTEKLSAAEGSRLKIKYKIPSAIIKNGSLEIETPFLLVRNLS 1200
+ I IP Y+E K T + G + I + I G P V N+
Sbjct: 1225 ACLIQISAIPENYYEDAKVTVNFRSVLGIGTSTQM-IKAGRILIGEQYFRMPVTYVMNMG 1283
Query: 1201 Q----KIIIGTPFIKKLFPYNTDENGITVQHLGQPILFKFSEPPIDKTLNVI-SYKEKQI 1255
++IIG FI+ L E G+ ++ + +T V S +E ++
Sbjct: 1284 LSPGIQMIIGCSFIRSL------EGGLRIEKDIITFYKLVTSIETSRTTQVANSIEELEL 1337
Query: 1256 NFLKEEISYRTIEEQLQQPSVKS-----RIEDILENIQS-SICSDLPNAFWERKSHMVEL 1309
+ E Y I ++ PS + +D+L+ ++ + P FW+ +L
Sbjct: 1338 S----EDEYLNIAASVETPSFLDQEFARKNKDLLKEMKEMKYIGENPMEFWKNNKIKCKL 1393
Query: 1310 PYEKDFSDKQISTKARPIQM----NEELLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVN 1365
+ + I RPI+ +EE + ++IN LL+ K+IR S+S AF V
Sbjct: 1394 ----NIINPDIKIMGRPIKHVTPGDEEAM---TRQINLLLQMKVIRPSESKHRSTAFIVR 1446
Query: 1366 KQAEIE-------RGTPRLVINYKPLNQALCWIRYPIPNKKDLLARQHDAKVFSKFDMKS 1418
EI+ +G R+V NYK LN+ +Y +P ++++ +K++SKFD+KS
Sbjct: 1447 SGTEIDPITGKEKKGKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSKIYSKFDLKS 1506
Query: 1419 GFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSKLTIVYID 1478
GFWQ+ ++E+ TAF YEW VMPFGLKNAP+ FQR M+ +F K VYID
Sbjct: 1507 GFWQVAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMDNVFKGTEKFIAVYID 1566
Query: 1479 DVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSLFQTKIRFLGHNIHQGTI----IP 1534
D+L+FS+T +QH +HL T + + K NGL +S TK+ + +I FLG ++ I
Sbjct: 1567 DILVFSETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLGCTKIKLQPHI 1626
Query: 1535 INRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPPP*SDIH 1594
I++ +F+D +++ ++ +LG L+Y ++ + +++PL ++ +
Sbjct: 1627 ISKICDFSD---EKLATPEGMRSWLGILSYARNYIQDIGKLVQPLRQKMAPTGDKRMNPE 1683
Query: 1595 T-NVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGILKQKVF-----DKEQIIAF 1648
T +V+QIK +VKNLP L LP +F IIETD G+G + K K+ E+I A+
Sbjct: 1684 TWKMVRQIKEKVKNLPDLQLPPKDSFIIIETDGCMTGWGAVCKWKMSKHDPRSTERICAY 1743
Query: 1649 TSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVC-VDCKSAKEILQK 1701
S +NP + ST+ E+ A + + KF+ +++K L+ DC++ + K
Sbjct: 1744 ASGSFNPIK---STIDAEIQAAIHGLDKFKIYYLDKKELIIRSDCEAIIKFYNK 1794
Score = 33.9 bits (76), Expect = 3.9
Identities = 43/251 (17%), Positives = 90/251 (35%), Gaps = 43/251 (17%)
Query: 735 LSNLKCKSLGDFRWYKDTFLTRVYTR----EDSQQAFWKEKFLAGLPKSFGDKVREKLRS 790
L L C + R Y +LT +++ E+ +P + G++V + +
Sbjct: 735 LKQLVCPNYQSIRRYLMDYLTLAAETGLMWSETEGPAISEELFTKMPAAIGERVAQAYKI 794
Query: 791 QNPGGEIPYHTLSYGQLIAIIQRVALKICQDDKIQQQLTKEKSQNRRDLGTFCEQFGIQG 850
+P + + Y + + ++ C++ + L FC F I+G
Sbjct: 795 MDPTSAVNLPSRVYFTINYLTEQ-----CKEASYMRSLKALD---------FCRDFPIEG 840
Query: 851 CPKKPKPRKQDPPPKQQWRKRSSRNDHRKPKPRSKPQSSQIPKNPPETRPSQGKDVTCYN 910
+ +K+ K + ++H + T+ + CY
Sbjct: 841 YYGRSGEKKKYTARKATKYTGKAHDNHIRV-----------------TKAKYQRKCKCYI 883
Query: 911 CGKPGHISRYCRLKRRISE-LHLEPEIEDKINNLLIQTSDEEESNPSDSEV-------SE 962
CG+ GH + CR K + + + + ++ K N ++ D+EE + V E
Sbjct: 884 CGQEGHYANQCRNKHKDQQRVAILQSLDLKENEEVVSADDKEEEDDEIFSVLGEEDYQEE 943
Query: 963 DLNQIQNDDSQ 973
+ ++ DD Q
Sbjct: 944 TIMVLEEDDIQ 954
>POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 692
Score = 227 bits (578), Expect = 2e-58
Identities = 173/537 (32%), Positives = 276/537 (51%), Gaps = 34/537 (6%)
Query: 1119 FIHVKILIGDFVLEIPALFDTGADSSCISEGLIPTRY-FEKTTEKLSAAEGSRLKIKYKI 1177
FI V I +F+ A DTGA + C + I + K +++ A+ S+ I+ I
Sbjct: 21 FIKVSIGKRNFL----AYIDTGA-TLCFGKRKISNNWEILKQPKEIIIADKSKHYIREAI 75
Query: 1178 PSAIIKNGSLEIETPFLLVRNLSQKIIIGTPFIKKLFPYNTDENGITVQ--HLGQPILFK 1235
+ +K + E P + + + +IIG F+K P+ I ++ +L P +
Sbjct: 76 SNVFLKIENKEFLIPIIYLHDSGLDLIIGNNFLKLYQPFIQRLETIELRWKNLNNPKESQ 135
Query: 1236 FSEPPIDKTLNVISYKEKQINF-LKEEISYRTIEEQLQQPSVKSRIEDILENIQSSICSD 1294
I V+ ++I+ L++ + ++TIEEQL++ +CS+
Sbjct: 136 MISTKILTKNEVLKLSFEKIHICLEKYLFFKTIEEQLEE-----------------VCSE 178
Query: 1295 LPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEELLHFCQKEINDLLEKKLIRRSK 1354
P + K+ ++ KD + T P + + + ++E DLL+K LIR S+
Sbjct: 179 HPLDETKNKNGLLIEIRLKDPLQEINVTNRIPYTIRD--VQEFKEECEDLLKKGLIRESQ 236
Query: 1355 SPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARQHDAKVFSKF 1414
SP S AFYV EI+RG R+VINYK +N+A Y +P K +L + + FS
Sbjct: 237 SPHSAPAFYVENHNEIKRGKRRMVINYKKMNEATIGDSYKLPRKDFILEKIKGSLWFSSL 296
Query: 1415 DMKSGFWQIQLQEKDKYKTAFTV-PFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSKLT 1473
D KSG++Q++L E K TAF+ P YEWNV+ FGLK APS +QR M++ +
Sbjct: 297 DAKSGYYQLRLHENTKPLTAFSCPPQKHYEWNVLSFGLKQAPSIYQRFMDQSLKGLEHIC 356
Query: 1474 IVYIDDVLIFSQ-TLDQHFKHLNTFISVIKRNGLAVSKTKVSLFQTKIRFLGHNIH-QGT 1531
+ YIDD+LIF++ + +QH + + IK G+ +SK K L Q +I +LG I G
Sbjct: 357 LAYIDDILIFTKGSKEQHVNDVRIVLQRIKEKGIIISKKKSKLIQQEIEYLGLKIQGNGE 416
Query: 1532 IIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVAD--FCPQLSTIIKPLHDRLK-KDPP 1588
I E +FPD++ D+ Q+QRFLGC+NY+A+ F L+ K L ++ K+P
Sbjct: 417 IDLSPHTQEKILQFPDELEDRKQIQRFLGCINYIANEGFFKNLALERKHLQKKISVKNPW 476
Query: 1589 P*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGILKQKVFDKEQI 1645
I T +V+ IK ++++LP LY + Q F I+ETDAS + G L+ K++I
Sbjct: 477 KWDTIDTKMVQSIKGKIQSLPKLYNASIQDFLIVETDASQHSWSGCLRALPKGKQKI 533
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 210 bits (535), Expect = 2e-53
Identities = 213/844 (25%), Positives = 374/844 (44%), Gaps = 94/844 (11%)
Query: 899 RPSQGKDVTCYNCGKPGHISRYCRLKRRISELHLEPEIEDKINNLLIQTSDEEESNPSDS 958
RPS K CY C H++ C RR + I+ +++ SD+E+
Sbjct: 765 RPSIKKKCRCYICQDENHLANRC--PRRYTNQARASLIDGLDEDIVSIASDDED------ 816
Query: 959 EVSEDLNQIQNDDSQSSSSVNTLSINTLTNEQDLLFRAINSIPDPEEK---KIYLERLRS 1015
+ L I+ D+ + SS + ++D + + D + K +
Sbjct: 817 -IENFLEIIELDEFIAHSSQEHEHTWEIGGKKDKVCEICSYFTDYNKTVSCKTCETQYCK 875
Query: 1016 TLEDRPPKSPITINKFNLRDTFKRLEKSTVKPVTIQDLHFEINSLKTEVKSLKQIQKSQQ 1075
T D+ L ++K T + I DL + +L+ V L+ + Q
Sbjct: 876 TCSDQ------------LALEVTEVKKPTKEETMIDDLKLNVKNLEFRVTILEHKVEMQN 923
Query: 1076 LI--LEKLTKNYEEDDSSIPDSNPAPNDNCEDFLE-NINQVTIQKFFIHVKILIGDFVLE 1132
L E + + + + IP ++ A N ++++ +IN+ ++ KI +
Sbjct: 924 LQDKFETMQIRNKSEITEIPTTSLAMRANESNYIKTSINKTA--GCYVETKISFNNENRI 981
Query: 1133 IPALFDTGADSSCISEGLIPTRYFEKTTEKLS--AAEGSRLKIKYKIPSAIIKNGSLEIE 1190
I AL D+G+ + I LIP + T ++ A + S+ + ++ I K E++
Sbjct: 982 ITALIDSGSTHNIICPTLIPASWINNTHREIIMFAVDNSKYNLNQELIDDI-KLQFQEVD 1040
Query: 1191 TPFLLVRNLSQKIIIGTPFIKKLFPYN--TDENG--------ITVQ-------------- 1226
F + L Q + P + + T+ENG IT+Q
Sbjct: 1041 ETFGIKYKLGQTYVAPKPTKTFIIGHRFLTNENGSVTIHKDYITIQKTTGIYPTARHELK 1100
Query: 1227 ------HLGQPILFKFSEPPIDKTLNVISYKEKQINFLKEEISYRTIEEQLQQPSVKSRI 1280
H G+P LF +K ++ SY+ + I K EI +++ +++ I
Sbjct: 1101 SEFARKHGGRPPLFSNIPETYNKIPHLHSYQPQPILGYKNEIGNQSLITMVKELEALGFI 1160
Query: 1281 -EDILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEELLHFCQK 1339
+DI +N + +C D + +PY +DK++ +K
Sbjct: 1161 GDDITKNRTTWVC-DFKIINPDINITCATIPYTP--ADKEVF----------------EK 1201
Query: 1340 EINDLLEKKLIRRSKSPWS--CAAFYVNKQAEIERGTPRLVINYKPLNQALCWIRYPIPN 1397
+I +LL+ KLI+++ AAF V +E PR+V NYK LN + + IP+
Sbjct: 1202 QIKELLDNKLIKKADPTCRHRTAAFIVRNHSEEVAQKPRIVYNYKRLNDNMHTDPFNIPH 1261
Query: 1398 KKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPSE 1457
K ++ A +FSKFD+K+GF ++L++ K T FT G Y WNV PFG+ NAP
Sbjct: 1262 KISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSEGLYTWNVCPFGIANAPCA 1321
Query: 1458 FQRIMNEIFNPYSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSLFQ 1517
FQR M E F K ++YIDD+LI S +H +HL F + +K G +SK K +F
Sbjct: 1322 FQRFMQESFGDL-KFALLYIDDILIASNNEKEHIEHLKIFFNRVKEVGCVLSKKKSKMFL 1380
Query: 1518 TKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQ-LQRFLGCLNYVADFCPQLSTII 1576
++ +LG I +G I ++ KF ++ + LQ +LG LNY + LS ++
Sbjct: 1381 KEVEYLGVEIKEGKISLQPHIVDKIKKFDKNKLNTLKGLQAYLGLLNYARGYIKDLSKLV 1440
Query: 1577 KPLHDRLKKDPPP*SDIHT-NVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGGIL 1635
PL+ + K+ + N++ +I+ V + L P + IIETDAS+ G+G +L
Sbjct: 1441 GPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKPLERPKETDYIIIETDASEEGWGAVL 1500
Query: 1636 -----KQKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVCV 1690
K D E+I + S ++ ++ ++++ E+ AI +++KFQ +++ F +
Sbjct: 1501 VCKPDKYSGKDTEKIAGYASGNFG-EKKTWTSLDYEIEAINEALNKFQI-YLDKDFTIRT 1558
Query: 1691 DCKS 1694
DC++
Sbjct: 1559 DCEA 1562
Score = 43.9 bits (102), Expect = 0.004
Identities = 25/107 (23%), Positives = 53/107 (49%), Gaps = 2/107 (1%)
Query: 6 YLHIGSVQVGLKPLTRKSLDIAVLLCLRDVRHNQFHDSLLGTVETSLSNG-PIFFNCFPD 64
Y HIG + +G+K L R+ + V++ D + ++ +G++E ++ G +F++C PD
Sbjct: 112 YYHIGMMAIGVKGLHRRKIGTKVMIMFYDDSFGKAREASIGSIEMDMNAGCGVFYSC-PD 170
Query: 65 LTVSLEDKNILDVLFLNIKLHGLDMKEDSIPISLIYRVQYKVMNSIK 111
++D + L + + + K S+ I I R+ + + K
Sbjct: 171 FAKYIKDLSHLKIGIQTLGYENYEGKNLSVAIKTIGRLTTNIQSKYK 217
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 201 bits (512), Expect = 1e-50
Identities = 173/674 (25%), Positives = 313/674 (45%), Gaps = 76/674 (11%)
Query: 1028 INKFNLRDTFKRLEKSTVKPVTIQDLHFEINSLKTEVKSLKQ-IQKSQQLILEKLTKNYE 1086
++K N+ D RL + P +++ L E N K+E+ + ++ + K K ++
Sbjct: 147 MSKANVDDFHTRLFILWMLPYSLRKLK-ERNYWKSEISEIYDFLEDKRTASYGKTHKRFQ 205
Query: 1087 EDDSSIPDSNPAPNDNCEDFLE----NINQV--TIQKFFIHVK--------ILIGDFVLE 1132
+ ++ + + +N + N++++ + KF H + + DF
Sbjct: 206 LQNKNLGKESLSKKNNTTNSRNLRKTNVSRIEYSSNKFLNHTRKRYEMVLQAELPDFKCS 265
Query: 1133 IPALFDTGADSSCISEGLI-----PTRYFEKTTEKLSAAEGSRLKIKYKIPSAIIKNGSL 1187
IP L DTGA ++ I+E + PTR + K+ KI K I +
Sbjct: 266 IPCLIDTGAQANIITEETVRAHKLPTRPWSKSVIYGGVYPN---KINRKTIKLNISLNGI 322
Query: 1188 EIETPFLLVRNLSQKIIIGTPFIKKLFPYNTDENGITVQHLGQPILFKFSEPPIDKTLNV 1247
I+T FL+V+ S I L+ N + I + +
Sbjct: 323 SIKTEFLVVKKFSHPAAIS---FTTLYDNNIE---------------------ISSSKHT 358
Query: 1248 ISYKEKQINFLKEEISYRTIEEQLQQPSVKSRIEDILENIQSSICSDLPNAFWERKSHMV 1307
+S K N +KE + P + +DI + LP +
Sbjct: 359 LSQMNKVSNIVKEP----------ELPDIYKEFKDITAETNTE---KLPKP-------IK 398
Query: 1308 ELPYEKDFSDKQISTKARPIQMNEELLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVNKQ 1367
L +E + + + R + + EIN L+ +IR SK+ +C +V K+
Sbjct: 399 GLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPKK 458
Query: 1368 AEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQE 1427
GT R+V++YKPLN+ + YP+P + LLA+ + +F+K D+KS + I++++
Sbjct: 459 ----EGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514
Query: 1428 KDKYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSKLTIV-YIDDVLIFSQT 1486
D++K AF P G +E+ VMP+G+ AP+ FQ +N I + +V Y+DD+LI S++
Sbjct: 515 GDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKS 574
Query: 1487 LDQHFKHLNTFISVIKRNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFP 1546
+H KH+ + +K L +++ K Q++++F+G++I + P I+ ++
Sbjct: 575 ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQW- 633
Query: 1547 DQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-P*SDIHTNVVKQIKLRV 1605
Q ++ +L++FLG +NY+ F P+ S + PL++ LKKD + T ++ IK +
Sbjct: 634 KQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCL 693
Query: 1606 KNLPCLYLPNPQAFKIIETDASDIGFGGILKQK-VFDKEQIIAFTSKHWNPAQQNYSTVK 1664
+ P L + ++ETDASD+ G +L QK DK + + S + AQ NYS
Sbjct: 694 VSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSD 753
Query: 1665 KEVLAIVLSISKFQ 1678
KE+LAI+ S+ ++
Sbjct: 754 KEMLAIIKSLKHWR 767
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 199 bits (507), Expect = 4e-50
Identities = 172/674 (25%), Positives = 313/674 (45%), Gaps = 76/674 (11%)
Query: 1028 INKFNLRDTFKRLEKSTVKPVTIQDLHFEINSLKTEVKSLKQ-IQKSQQLILEKLTKNYE 1086
++K N+ D RL + P +++ L E N K+E+ + ++ + K K ++
Sbjct: 147 MSKANVDDFHTRLFILWMLPYSLRKLK-ERNYWKSEISEIYDFLEDKRTASYGKTHKRFQ 205
Query: 1087 EDDSSIPDSNPAPNDNCEDFLE----NINQV--TIQKFFIHVK--------ILIGDFVLE 1132
+ ++ + + +N + N++++ + KF H + + DF
Sbjct: 206 PQNKNLGKESLSKKNNTTNSRNLRKTNVSRIEYSSNKFLNHTRKRYEMVLQAELPDFKCS 265
Query: 1133 IPALFDTGADSSCISEGLI-----PTRYFEKTTEKLSAAEGSRLKIKYKIPSAIIKNGSL 1187
IP L DTGA ++ I+E + PTR + K+ KI K I +
Sbjct: 266 IPCLIDTGAQANIITEETVRAHKLPTRPWSKSVIYGGVYPN---KINRKTIKLNISLNGI 322
Query: 1188 EIETPFLLVRNLSQKIIIGTPFIKKLFPYNTDENGITVQHLGQPILFKFSEPPIDKTLNV 1247
I+T FL+V+ S I L+ N + I + +
Sbjct: 323 SIKTEFLVVKKFSHPAAIS---FTTLYDNNIE---------------------ISSSKHT 358
Query: 1248 ISYKEKQINFLKEEISYRTIEEQLQQPSVKSRIEDILENIQSSICSDLPNAFWERKSHMV 1307
+S K N +KE + P + +DI + LP +
Sbjct: 359 LSQMNKVSNIVKEP----------ELPDIYKEFKDITAETNTE---KLPKP-------IK 398
Query: 1308 ELPYEKDFSDKQISTKARPIQMNEELLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVNKQ 1367
L +E + + + R + + EIN L+ +IR SK+ +C +V K+
Sbjct: 399 GLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPKK 458
Query: 1368 AEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQE 1427
GT R+V++YKPLN+ + YP+P + LLA+ + +F+K D+KS + I++++
Sbjct: 459 ----EGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514
Query: 1428 KDKYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSKLTIV-YIDDVLIFSQT 1486
D++K AF P G +E+ VMP+G+ AP+ FQ +N I + +V Y+D++LI S++
Sbjct: 515 GDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKS 574
Query: 1487 LDQHFKHLNTFISVIKRNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFP 1546
+H KH+ + +K L +++ K Q++++F+G++I + P I+ ++
Sbjct: 575 ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQW- 633
Query: 1547 DQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-P*SDIHTNVVKQIKLRV 1605
Q ++ +L++FLG +NY+ F P+ S + PL++ LKKD + T ++ IK +
Sbjct: 634 KQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCL 693
Query: 1606 KNLPCLYLPNPQAFKIIETDASDIGFGGILKQK-VFDKEQIIAFTSKHWNPAQQNYSTVK 1664
+ P L + ++ETDASD+ G +L QK DK + + S + AQ NYS
Sbjct: 694 VSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSD 753
Query: 1665 KEVLAIVLSISKFQ 1678
KE+LAI+ S+ ++
Sbjct: 754 KEMLAIIKSLKHWR 767
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 197 bits (500), Expect = 3e-49
Identities = 173/675 (25%), Positives = 307/675 (44%), Gaps = 78/675 (11%)
Query: 1028 INKFNLRDTFKRLEKSTVKPVTIQDLHFEINSLKTEVKSLKQ-IQKSQQLILEKLTKNYE 1086
++K N+ D RL + P +++ L E N K+E+ + ++ + K K ++
Sbjct: 147 MSKANVDDFHTRLFILWMLPYSLRKLK-ERNYWKSEISEIYDFLEDKRTASYGKTHKRFQ 205
Query: 1087 EDDSSIPDSNPAPNDNCEDFLENINQVTIQ-------KFFIHVK--------ILIGDFVL 1131
+ ++ P N N+ + I KF H + + DF
Sbjct: 206 PQNKNL-GKEFLPKKNNTTNSRNLRKTNISRIEYSSNKFLNHTRKRYEMVLQAELPDFKC 264
Query: 1132 EIPALFDTGADSSCISEGLI-----PTRYFEKTTEKLSAAEGSRLKIKYKIPSAIIKNGS 1186
IP L DTG ++ I+E + PTR + K+ KI K I
Sbjct: 265 SIPCLIDTGTQANIITEETVRAHKLPTRPWSKSVIYGGVYPN---KINRKTIKLNISLNG 321
Query: 1187 LEIETPFLLVRNLSQKIIIGTPFIKKLFPYNTDENGITVQHLGQPILFKFSEPPIDKTLN 1246
+ I+T FL+V+ S I L+ N + I + +
Sbjct: 322 ISIKTEFLVVKKFSHPAAIS---FTTLYDNNIE---------------------ISSSKH 357
Query: 1247 VISYKEKQINFLKEEISYRTIEEQLQQPSVKSRIEDILENIQSSICSDLPNAFWERKSHM 1306
+S K N +KE + P + +DI + LP +
Sbjct: 358 TLSQMNKVSNIVKEP----------ELPDIYKEFKDITAETNTE---KLPKP-------I 397
Query: 1307 VELPYEKDFSDKQISTKARPIQMNEELLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVNK 1366
L +E + + + R + + EIN L+ +IR SK+ +C +V K
Sbjct: 398 KGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPK 457
Query: 1367 QAEIERGTPRLVINYKPLNQALCWIRYPIPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQ 1426
+ GT R+V++YKPLN+ + YP+P + LLA+ + +F+K D+KS + I+++
Sbjct: 458 K----EGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVR 513
Query: 1427 EKDKYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIFNPYSKLTIV-YIDDVLIFSQ 1485
+ D++K AF P G +E+ VMP+G+ AP+ FQ +N I + +V Y+D++LI S+
Sbjct: 514 KGDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSK 573
Query: 1486 TLDQHFKHLNTFISVIKRNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKF 1545
+ +H KH+ + +K L +++ K Q++++F+G++I + P I+ ++
Sbjct: 574 SESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQW 633
Query: 1546 PDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRLKKDPP-P*SDIHTNVVKQIKLR 1604
Q ++ +L++FLG +NY+ F P+ S + PL++ LKKD + T ++ IK
Sbjct: 634 -KQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQC 692
Query: 1605 VKNLPCLYLPNPQAFKIIETDASDIGFGGILKQK-VFDKEQIIAFTSKHWNPAQQNYSTV 1663
+ + P L + ++ETDASD+ G +L QK DK + + S + AQ NYS
Sbjct: 693 LVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVS 752
Query: 1664 KKEVLAIVLSISKFQ 1678
KE+LAI+ S+ ++
Sbjct: 753 DKEMLAIIKSLKHWR 767
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 191 bits (485), Expect = 1e-47
Identities = 116/358 (32%), Positives = 190/358 (52%), Gaps = 9/358 (2%)
Query: 1338 QKEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIE-RGTPRLVINYKPLNQALCWIRYPIP 1396
+ +I D+L + +IR S SP++ + V K+ + + R+VI+Y+ LN+ R+PIP
Sbjct: 224 ESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPIP 283
Query: 1397 NKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPS 1456
N ++L + F+ D+ GF QI++ + KTAF+ G YE+ MPFGLKNAP+
Sbjct: 284 NMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPA 343
Query: 1457 EFQRIMNEIFNP-YSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSL 1515
FQR MN+I P +K +VY+DD+++FS +LD+H + L + + L + K
Sbjct: 344 TFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEF 403
Query: 1516 FQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTI 1575
+ + FLGH + I P IE K+P K +++ FLG Y F P + I
Sbjct: 404 LKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPK-EIKAFLGLTGYYRKFIPNFADI 462
Query: 1576 IKPLHDRLKKDP--PP*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETDASDIGFGG 1633
KP+ LKK+ + + + K++K + P L +P+ + TDASD+ G
Sbjct: 463 AKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGA 522
Query: 1634 ILKQKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQKFLVCVD 1691
+L Q +++ S+ N + NYST++KE+LAIV + F+ L+ + F + D
Sbjct: 523 VLSQ----DGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSD 576
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 189 bits (479), Expect = 7e-47
Identities = 118/376 (31%), Positives = 196/376 (51%), Gaps = 11/376 (2%)
Query: 1320 ISTKARPIQMNEELLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTP-RLV 1378
I +K P+ E+ + ++ ++L + LIR S SP++ + V K+ + R+V
Sbjct: 207 IYSKQYPLAQTHEIE--VENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVV 264
Query: 1379 INYKPLNQALCWIRYPIPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVP 1438
I+Y+ LN+ RYPIPN ++L + + F+ D+ GF QI++ E+ KTAF+
Sbjct: 265 IDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTK 324
Query: 1439 FGQYEWNVMPFGLKNAPSEFQRIMNEIFNP-YSKLTIVYIDDVLIFSQTLDQHFKHLNTF 1497
G YE+ MPFGL+NAP+ FQR MN I P +K +VY+DD++IFS +L +H +
Sbjct: 325 SGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLV 384
Query: 1498 ISVIKRNGLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQR 1557
+ + L + K + + FLGH + I P ++ +P DK +++
Sbjct: 385 FTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDK-EIRA 443
Query: 1558 FLGCLNYVADFCPQLSTIIKPLHDRLKKDPPP*SD--IHTNVVKQIKLRVKNLPCLYLPN 1615
FLG Y F P + I KP+ LKK + + +++K + P L LP+
Sbjct: 444 FLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPD 503
Query: 1616 PQAFKIIETDASDIGFGGILKQKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSIS 1675
+ ++ TDAS++ G +L Q I+F S+ N + NYS ++KE+LAIV +
Sbjct: 504 FEKKFVLTTDASNLALGAVLSQ----NGHPISFISRTLNDHELNYSAIEKELLAIVWATK 559
Query: 1676 KFQSDLINQKFLVCVD 1691
F+ L+ ++FL+ D
Sbjct: 560 TFRHYLLGRQFLIASD 575
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 172 bits (436), Expect = 7e-42
Identities = 130/473 (27%), Positives = 238/473 (49%), Gaps = 42/473 (8%)
Query: 1245 LNVISYKEKQINFLKEEISYRTIEEQLQQ---PSVK-SRIEDILENIQSSICSDLPNAFW 1300
L++++ ++N ++ + Y+ I E+L PSV + + DI+ + S+ + +
Sbjct: 94 LDLLTQAGVKLNLAEDSLEYQGIAEKLHYFSCPSVNFTDVNDIV--VPDSVKKEFKDTII 151
Query: 1301 ERKSHMVE----LPYE-------KDFSDKQISTKARPIQMNEELLHFCQKEINDLLEKKL 1349
RK LP+ + ++ + ++A P M + F E+ LL+ +
Sbjct: 152 RRKKAFSTTNEALPFNTAVTATIRTVDNEPVYSRAYPTLMG--VSDFVNNEVKQLLKDGI 209
Query: 1350 IRRSKSPWSCAAFYVNKQAEIERGTP--RLVINYKPLNQALCWIRYPIPNKKDLLARQHD 1407
IR S+SP++ + V+K+ G P RLVI+++ LN+ RYP+P+ +LA
Sbjct: 210 IRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYPMPSIPMILANLGK 269
Query: 1408 AKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIF- 1466
AK F+ D+KSG+ QI L E D+ KT+F+V G+YE+ +PFGL+NA S FQR ++++
Sbjct: 270 AKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNASSIFQRALDDVLR 329
Query: 1467 NPYSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKTKVSLFQTKIRFLGHN 1526
K+ VY+DDV+IFS+ H +H++T + + + VS+ K F+ + +LG
Sbjct: 330 EQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKTRFFKESVEYLGFI 389
Query: 1527 IHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFCPQLSTIIKPLHDRL--- 1583
+ + ++ ++P+ +++ FLG +Y F + I +P+ D L
Sbjct: 390 VSKDGTKSDPEKVKAIQEYPEPDC-VYKVRSFLGLASYYRVFIKDFAAIARPITDILKGE 448
Query: 1584 ---------KKDPPP*SDIHTNVVKQIK--LRVKNLPCLYLPNPQAFKIIETDASDIGFG 1632
KK P ++ N ++++ L +++ Y + F + TDAS G G
Sbjct: 449 NGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFD-LTTDASASGIG 507
Query: 1633 GILKQKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQK 1685
+L Q + + I S+ +QNY+T ++E+LAIV ++ K Q+ L +
Sbjct: 508 AVLSQ----EGRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFLYGSR 556
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 170 bits (430), Expect = 4e-41
Identities = 123/426 (28%), Positives = 215/426 (49%), Gaps = 21/426 (4%)
Query: 1274 PSVKSRIEDILENIQSSICSDLPNAFWERKSHM-VELPYEKDF---SDKQISTKARPIQM 1329
P + + D + I +S+ + P F S M VE + + + I K+ P +
Sbjct: 74 PLLAAEHPDGTQEILNSLLGEFPRIFEPPLSGMSVETAVKAEIRTNTQDPIYAKSYPYPV 133
Query: 1330 NEELLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTP-RLVINYKPLNQAL 1388
N + +++I++LL+ +IR S SP++ + V K+ + R+V+++K LN
Sbjct: 134 N--MRGEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVT 191
Query: 1389 CWIRYPIPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMP 1448
YPIP+ LA +AK F+ D+ SGF QI ++E D KTAF+ G+YE+ +P
Sbjct: 192 IPDTYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLP 251
Query: 1449 FGLKNAPSEFQRIMNEIFNPY-SKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLA 1507
FGLKNAP+ FQR++++I + K+ VYIDD+++FS+ D H+K+L ++ + + L
Sbjct: 252 FGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQ 311
Query: 1508 VSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVAD 1567
V+ K T++ FLG+ + I + + + P K +L+RFLG +Y
Sbjct: 312 VNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVK-ELKRFLGMTSYYRK 370
Query: 1568 FCPQLSTIIKPLHDRLK------------KDPPP*SDIHTNVVKQIKLRVKNLPCLYLPN 1615
F + + KPL + + K P + +K + + L P
Sbjct: 371 FIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPC 430
Query: 1616 PQAFKIIETDASDIGFGGILKQKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSIS 1675
+ TDAS+ G +L Q +++ IA+ S+ N ++NY+T++KE+LAI+ S+
Sbjct: 431 FTKPFHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLD 490
Query: 1676 KFQSDL 1681
++ L
Sbjct: 491 NLRAYL 496
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 168 bits (425), Expect = 1e-40
Identities = 146/499 (29%), Positives = 236/499 (47%), Gaps = 37/499 (7%)
Query: 1217 NTDENGITVQH-LGQPILFKFSEPPIDKTLNVISYKEKQINFLKEEISYRTIEEQLQQPS 1275
N E ITV+ L P LF E + + V+ E F TI E L++ +
Sbjct: 849 NKAEQDITVEEVLNDPTLFSEIETDTN-SCEVVKTAETYERFT-------TICEHLKREN 900
Query: 1276 VKSR-IEDILENIQS--SICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPIQMNEE 1332
R I D++E Q +I D ++ E E + I K RPI +
Sbjct: 901 GDDRKIWDVIEQFQDVFAISDDELG-----RNSGTECVIELKEGAEPIRQKPRPIPL--A 953
Query: 1333 LLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPLNQALCWIR 1392
L +K I +L +K+IR SKSPWS V K+ G+ R+ I+Y+ +N+ +
Sbjct: 954 LKPEIRKMIQKMLNQKVIRESKSPWSSPVVLVKKKD----GSIRMCIDYRKVNKVVKNNA 1009
Query: 1393 YPIPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVMPFGLK 1452
+P+PN + L K+++ FDM +GFWQI L EK K TAF + +EWNV+PFGL
Sbjct: 1010 HPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLV 1069
Query: 1453 NAPSEFQRIMNEIFNP-YSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGLAVSKT 1511
+P+ FQ M EI VY+DD+LI S+ ++QH + + ++ I+++G+ + +
Sbjct: 1070 ISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRAS 1129
Query: 1512 KVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFP--DQIIDKTQLQRFLGCLNYVADFC 1569
K + + ++ +LGH + T+ + TDK + + +LQ FLG + Y F
Sbjct: 1130 KCHIAKKEVEYLGHKV---TLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFI 1186
Query: 1570 PQLSTIIKPLHDRLKKDPPP*SDIHTNVV-KQIKLRVKNLPCLYLPNPQAFK------II 1622
+ I L + + + +++K V P L P+ +A +I
Sbjct: 1187 LNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMI 1246
Query: 1623 ETDASDIGFGGILKQKVFDKEQ-IIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDL 1681
TDAS G G +L Q+ D +Q IAF SK +PA+ Y E LA++ ++ +F++ +
Sbjct: 1247 YTDASRKGIGAVLAQEGPDGQQHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTII 1306
Query: 1682 INQKFLVCVDCKSAKEILQ 1700
V D K +L+
Sbjct: 1307 YGTAITVFTDHKPLISLLK 1325
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 138 bits (348), Expect = 1e-31
Identities = 111/426 (26%), Positives = 197/426 (46%), Gaps = 15/426 (3%)
Query: 1276 VKSRIEDILENIQSSICSDLPNAF-WERKSHMVELPYEKDF---SDKQISTK--ARPIQM 1329
+K ++ ++ +ICS+ + F E + V Y++ D+ + TK P
Sbjct: 267 LKKNFPELFKSQLENICSEYIDIFALESEPITVNNLYKQQLRLKDDEPVYTKNYRSPHSQ 326
Query: 1330 NEELLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVNKQAE--IERGTPRLVINYKPLNQA 1387
EE+ Q ++ L++ K++ S S ++ V K++ ++ RLVI+Y+ +N+
Sbjct: 327 VEEI----QAQVQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKK 382
Query: 1388 LCWIRYPIPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEWNVM 1447
L ++P+P D+L + AK FS D+ SGF QI+L E + T+F+ G Y + +
Sbjct: 383 LLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRL 442
Query: 1448 PFGLKNAPSEFQRIMNEIFNPYS-KLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRNGL 1506
PFGLK AP+ FQR+M F+ +Y+DD+++ + K+L + L
Sbjct: 443 PFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNL 502
Query: 1507 AVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQIIDKTQLQRFLGCLNYVA 1566
+ K S F ++ FLGH I+P ++ + +P D +RF+ NY
Sbjct: 503 KLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVP-HDADSARRFVAFCNYYR 561
Query: 1567 DFCPQLSTIIKPLHDRLKKDPP-P*SDIHTNVVKQIKLRVKNLPCLYLPNPQAFKIIETD 1625
F + + + KK+ P +D +K ++ N L P+ I TD
Sbjct: 562 RFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTD 621
Query: 1626 ASDIGFGGILKQKVFDKEQIIAFTSKHWNPAQQNYSTVKKEVLAIVLSISKFQSDLINQK 1685
AS G +L Q + +A+ S+ + + N ST ++E+ AI +I F+ + +
Sbjct: 622 ASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKH 681
Query: 1686 FLVCVD 1691
F V D
Sbjct: 682 FTVKTD 687
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 97.1 bits (240), Expect = 4e-19
Identities = 78/322 (24%), Positives = 142/322 (43%), Gaps = 26/322 (8%)
Query: 1268 EEQLQQPSVKSRIEDILENIQSSICSDLPNAFWERKSHMVELPYEKDFSDKQISTKARPI 1327
E LQQ ++ +++L+ + +A W+ + V K + + RP
Sbjct: 123 ERLLQQTALPKEQKELLQKLFLKY-----DALWQHWENQVGHRRIKPHNIATGTLAPRPQ 177
Query: 1328 Q---MNEELLHFCQKEINDLLEKKLIRRSKSPWSCAAFYVNKQAEIERGTPRLVINYKPL 1384
+ +N + Q I+DLL++ ++ + S + + V K G R+V++Y+ +
Sbjct: 178 KQYPINPKAKPSIQIVIDDLLKQGVLIQQNSTMNTPVYPVPKPD----GKWRMVLDYREV 233
Query: 1385 NQALCWIRYPIPNKKDLLARQHDAKVFSKFDMKSGFWQIQLQEKDKYKTAFTVPFGQYEW 1444
N+ + I + +L+ + K + D+ +GFW + + + TAFT QY W
Sbjct: 234 NKTIPLIAAQNQHSAGILSSIYRGKYKTTLDLTNGFWAHPITPESYWLTAFTWQGKQYCW 293
Query: 1445 NVMPFGLKNAPSEFQRIMNEIFNPYSKLTIVYIDDVLIFSQTLDQHFKHLNTFISVIKRN 1504
+P G N+P+ F + ++ + Y+DD+ I +H + L S++
Sbjct: 294 TRLPQGFLNSPALFTADVVDLLKEIPNVQ-AYVDDIYISHDDPQEHLEQLEKIFSILLNA 352
Query: 1505 GLAVSKTKVSLFQTKIRFLGHNIHQGTIIPINRAIEFTDKFPDQII------DKTQLQRF 1558
G VS K + Q ++ FLG NI TD F +++ D QLQ
Sbjct: 353 GYVVSLKKSEIAQREVEFLGFNI-------TKEGRGLTDTFKQKLLNITPPKDLKQLQSI 405
Query: 1559 LGCLNYVADFCPQLSTIIKPLH 1580
LG LN+ +F P S ++KPL+
Sbjct: 406 LGLLNFARNFIPNYSELVKPLY 427
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.316 0.134 0.389
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 206,823,710
Number of Sequences: 164201
Number of extensions: 9477616
Number of successful extensions: 42479
Number of sequences better than 10.0: 431
Number of HSP's better than 10.0 without gapping: 142
Number of HSP's successfully gapped in prelim test: 298
Number of HSP's that attempted gapping in prelim test: 39796
Number of HSP's gapped (non-prelim): 2213
length of query: 1703
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1579
effective length of database: 39,613,130
effective search space: 62549132270
effective search space used: 62549132270
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 73 (32.7 bits)
Lotus: description of TM0019a.7