
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC148289.13 - phase: 0
(2306 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 230 4e-59
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 220 3e-56
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 213 7e-54
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 208 2e-52
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 201 3e-50
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 198 1e-49
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 198 1e-49
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 196 6e-49
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 183 6e-45
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 130 4e-29
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 129 9e-29
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 129 1e-28
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 125 1e-27
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 125 1e-27
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 124 4e-27
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 121 2e-26
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 118 2e-25
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript... 117 3e-25
POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23... 115 2e-24
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 108 2e-22
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 230 bits (586), Expect = 4e-59
Identities = 147/485 (30%), Positives = 252/485 (51%), Gaps = 36/485 (7%)
Query: 1227 LETSVKKQVIELLKEYVDV-------FAWSYQDMPGLDTDIVVHHLPL--KPECPPVKQK 1277
L K+++ LL++Y D+ ++ Q ++T H+LPL K P ++
Sbjct: 165 LNNEEKQRLCALLQKYHDIQYHEGDKLTFTNQTKHTINTK---HNLPLYSKYSYPQAYEQ 221
Query: 1278 LRRTRPDMALKIKEEVQKQIDAGFLITSNYPQWLANIVPVPKKDG-----KVRMCVDYRD 1332
+++ ++Q ++ G + TSN P + + I VPKK K R+ +DYR
Sbjct: 222 ----------EVESQIQDMLNQGIIRTSNSP-YNSPIWVVPKKQDASGKQKFRIVIDYRK 270
Query: 1333 LNKASPKDDFPLPHIDVLVDSTAKSKVFSFMDGFSGYNQIKMAPEDREKTSFITPWGTFC 1392
LN+ + D P+P++D ++ + F+ +D G++QI+M PE KT+F T G +
Sbjct: 271 LNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYE 330
Query: 1393 YKVMPFGLINAGATYQRGMTTLFHDMIHKEIEVYVDDMIVKSVTEEDHVKYLQKMFQRLR 1452
Y MPFGL NA AT+QR M + +++K VY+DD+IV S + ++H++ L +F++L
Sbjct: 331 YLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLA 390
Query: 1453 KYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVKAIREMPAPRTEKEVRGFLGRLN 1512
K L+L +KC F + LG +++ GI+ +P+K++AI++ P P KE++ FLG
Sbjct: 391 KANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTG 450
Query: 1513 YISRFISHMTATCGPIFKLLRKEQGIVWTE-DCQKAFDSIKKYLLEPPILIPPVEGRPLI 1571
Y +FI + P+ K L+K I T + AF +K + E PIL P +
Sbjct: 451 YYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFT 510
Query: 1572 MYLTVLENSMGCVLGQQDETGRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAKRLRH 1631
+ + ++G VL Q H + Y+S+ E E YS +EK A+ WA K RH
Sbjct: 511 LTTDASDVALGAVLSQDG------HPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRH 564
Query: 1632 YMINHTTWLVSKMDPIKYIFEKPALTGRIARWQMLLSEYDIECRSQKAIKGSILADHLAH 1691
Y++ + S P+ +++ ++ RW++ LSE+D + + K K + +AD L+
Sbjct: 565 YLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKG-KENCVADALSR 623
Query: 1692 QPLED 1696
LE+
Sbjct: 624 IKLEE 628
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 220 bits (561), Expect = 3e-56
Identities = 131/419 (31%), Positives = 217/419 (51%), Gaps = 20/419 (4%)
Query: 1287 LKIKEEVQKQIDAGFLITSNYPQ----WLANIVPVPKKDGKVRMCVDYRDLNKASPKDDF 1342
++++ +VQ+ ++ G + SN P W+ P K R+ +DYR LN+ + D +
Sbjct: 220 IEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDYRKLNEITIPDRY 279
Query: 1343 PLPHIDVLVDSTAKSKVFSFMDGFSGYNQIKMAPEDREKTSFITPWGTFCYKVMPFGLIN 1402
P+P++D ++ K + F+ +D G++QI+M E KT+F T G + Y MPFGL N
Sbjct: 280 PIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYEYLRMPFGLRN 339
Query: 1403 AGATYQRGMTTLFHDMIHKEIEVYVDDMIVKSVTEEDHVKYLQKMFQRLRKYKLRLNPNK 1462
A AT+QR M + +++K VY+DD+I+ S + +H+ +Q +F +L L+L +K
Sbjct: 340 APATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQLDK 399
Query: 1463 CTFGVRSGKLLGFIVSQKGIEVDPDKVKAIREMPAPRTEKEVRGFLGRLNYISRFISHMT 1522
C F + LG IV+ GI+ +P KVKAI P P +KE+R FLG Y +FI +
Sbjct: 400 CEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNYA 459
Query: 1523 ATCGPIFKLLRKEQGIVWTEDCQK-----AFDSIKKYLLEPPILIPPVEGRPLIMYLTVL 1577
P+ L+K I D QK AF+ +K ++ PIL P + ++
Sbjct: 460 DIAKPMTSCLKKRTKI----DTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDAS 515
Query: 1578 ENSMGCVLGQQDETGRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAKRLRHYMINHT 1637
++G VL Q H I ++S+ + E YS +EK A+ WA K RHY++
Sbjct: 516 NLALGAVLSQNG------HPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGRQ 569
Query: 1638 TWLVSKMDPIKYIFEKPALTGRIARWQMLLSEYDIECRSQKAIKGSILADHLAHQPLED 1696
+ S P++++ ++ RW++ LSEY + K + S+ AD L+ +E+
Sbjct: 570 FLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKGKENSV-ADALSRIKIEE 627
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 213 bits (541), Expect = 7e-54
Identities = 144/475 (30%), Positives = 238/475 (49%), Gaps = 13/475 (2%)
Query: 1232 KKQVIELLKEYVDVFAWSYQDMPGLDTDIVVHHLPLKPECPPVKQKLRRTRPDMALKIKE 1291
K Q+ + EY+D+FA + P ++ L LK + PV K R+ +I+
Sbjct: 276 KSQLENICSEYIDIFA--LESEPITVNNLYKQQLRLKDD-EPVYTKNYRSPHSQVEEIQA 332
Query: 1292 EVQKQIDAGFLITSNYPQWLANIVPVPKKDG------KVRMCVDYRDLNKASPKDDFPLP 1345
+VQK I ++ + Q+ + ++ VPKK K R+ +DYR +NK D FPLP
Sbjct: 333 QVQKLIKDK-IVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLP 391
Query: 1346 HIDVLVDSTAKSKVFSFMDGFSGYNQIKMAPEDREKTSFITPWGTFCYKVMPFGLINAGA 1405
ID ++D ++K FS +D SG++QI++ R+ TSF T G++ + +PFGL A
Sbjct: 392 RIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPN 451
Query: 1406 TYQRGMTTLFHDMIHKEIEVYVDDMIVKSVTEEDHVKYLQKMFQRLRKYKLRLNPNKCTF 1465
++QR MT F + + +Y+DD+IV +E+ +K L ++F + R+Y L+L+P KC+F
Sbjct: 452 SFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSF 511
Query: 1466 GVRSGKLLGFIVSQKGIEVDPDKVKAIREMPAPRTEKEVRGFLGRLNYISRFISHMTATC 1525
+ LG + KGI D K I+ P P R F+ NY RFI +
Sbjct: 512 FMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYS 571
Query: 1526 GPIFKLLRKEQGIVWTEDCQKAFDSIKKYLLEPPILIPPVEGRPLIMYLTVLENSMGCVL 1585
I +L +K WT++CQKAF +K L+ P +L P + + + + G VL
Sbjct: 572 RHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVL 631
Query: 1586 GQQDETGRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAKRLRHYMINHTTWLVSKMD 1645
Q+ G + + Y S+ FT+ ES S E+ A+ WA R Y+ + +
Sbjct: 632 -TQNHNGH-QLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHR 689
Query: 1646 PIKYIFEKPALTGRIARWQMLLSEYDIECRSQKAIKGSILADHLAHQPLEDYRPI 1700
P+ Y+F + ++ R ++ L EY+ K K + +AD L+ +++ + I
Sbjct: 690 PLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKG-KDNHVADALSRITIKELKDI 743
Score = 117 bits (292), Expect = 5e-25
Identities = 107/427 (25%), Positives = 188/427 (43%), Gaps = 36/427 (8%)
Query: 1895 DIKVFLQTREYPPGASNKD-------KKTLRRLSSNFFLNGDILYKRNFDTVLLRCV--- 1944
D+ FLQ E G + KK +S + F N +N LL V
Sbjct: 828 DLDQFLQRLELQAGIYDISQIKMAPWKKIFEHVSIDKFKNMGNKILKNLKVALLNPVTQI 887
Query: 1945 -DKYEADLLIHEIHEGSFGIHPNGHTMAKKIL---RAGYYWMTMESDCYKHTRKCHKCQI 2000
++ E + ++ +H+ GHT K L + YYW M ++ RKC KCQ
Sbjct: 888 NNEKEKEAILSTLHDDPI---QGGHTGITKTLAKVKRHYYWKNMSKYIKEYVRKCQKCQK 944
Query: 2001 YADKIHMPPTTLNLLSSP-WPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASY 2059
H T + + +P F +D IG + PK+ NG+ + + I TK++ A
Sbjct: 945 AKTTKHTK-TPMTITETPEHAFDRVVVDTIGPL-PKSENGNEYAVTLICDLTKYLVAIPI 1002
Query: 2060 ANVTKQVVVKFIKNHIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQM 2119
AN + + V K I I +YG ITD GT N ++ +LC KI++ S+ + Q
Sbjct: 1003 ANKSAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQT 1062
Query: 2120 NGAVEAANKNIKRIVQKMVVTYK-DWHEMLPFALHGYRTSVRTSTGATPFSLVYGMEAVL 2178
G VE +++ + ++ + T K DW L + ++ + T+ P+ LV+G + L
Sbjct: 1063 VGVVERSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNL 1122
Query: 2179 PVEV-EIPSLRVLMEADLSEAEWVQNRYDQLNLIEEKRMTALCHGQLYQKRMKQAFDKKV 2237
P ++ S+ + D ++ + +L + + L + ++++ K+ +D KV
Sbjct: 1123 PKHFNKLHSIEPIYNID----DYAKESKYRLEVAYARARKLL---EAHKEKNKENYDLKV 1175
Query: 2238 RPREFKEGDLVLKKIFSFQPDSRGKWAPNYEGPYVVKRAFSGGAMTLQTMDGEELPRPVN 2297
+ E + GD VL + + K Y GPY ++ +TL T ++ + V+
Sbjct: 1176 KDIELEVGDKVL-----LRNEVGHKLDFKYTGPYKIESIGDNNNITLLTNKNKK--QIVH 1228
Query: 2298 TDAVKKY 2304
D +KK+
Sbjct: 1229 KDRLKKF 1235
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 208 bits (529), Expect = 2e-52
Identities = 128/470 (27%), Positives = 242/470 (51%), Gaps = 13/470 (2%)
Query: 1228 ETSVKKQVIELLKEYVDVFAWSYQDMP-GLDTDIVVHHLPLKPECPPVKQKLRRTRPDMA 1286
E +++ ++++++ DVFA S ++ T+ V+ LK P++QK R +
Sbjct: 899 ENGDDRKIWDVIEQFQDVFAISDDELGRNSGTECVIE---LKEGAEPIRQKPRPIPLALK 955
Query: 1287 LKIKEEVQKQIDAGFLITSNYPQWLANIVPVPKKDGKVRMCVDYRDLNKASPKDDFPLPH 1346
+I++ +QK ++ + S P W + +V V KKDG +RMC+DYR +NK + PLP+
Sbjct: 956 PEIRKMIQKMLNQKVIRESKSP-WSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPN 1014
Query: 1347 IDVLVDSTAKSKVFSFMDGFSGYNQIKMAPEDREKTSFITPWGTFCYKVMPFGLINAGAT 1406
I+ + S A K+++ D +G+ QI + + +E T+F F + V+PFGL+ + A
Sbjct: 1015 IEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPAL 1074
Query: 1407 YQRGMTTLFHDMIHKEIEVYVDDMIVKSVTEEDHVKYLQKMFQRLRKYKLRLNPNKCTFG 1466
+Q M + D++ VYVDD+++ S E H++ +++ R+RK ++L +KC
Sbjct: 1075 FQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIA 1134
Query: 1467 VRSGKLLGFIVSQKGIEVDPDKVKAIREMPAPRTEKEVRGFLGRLNYISRFISHMTATCG 1526
+ + LG V+ G+E K +++ P KE++ FLG + Y +FI +
Sbjct: 1135 KKEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIAS 1194
Query: 1527 PIFKLLRKEQGIVWTEDCQKAFDSIKKYLLEPPILI-PPVEG-----RPLIMYLTVLENS 1580
+ L+ + +W ++ + AF +KK + + P+L P VE RP ++Y
Sbjct: 1195 SLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKG 1254
Query: 1581 MGCVLGQQDETGRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAKRLRHYMINHTTWL 1640
+G VL Q+ G ++H I + SK + E+RY + + A+ +A +R + + +
Sbjct: 1255 IGAVLAQEGPDG-QQHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITV 1313
Query: 1641 VSKMDPIKYIFEKPALTGRIARWQMLLSEYDIECRSQKAIKGSILADHLA 1690
+ P+ + + L R+ RW + + E+D++ A K + +AD L+
Sbjct: 1314 FTDHKPLISLLKGSPLADRLWRWSIEILEFDVKI-VYLAGKANAVADALS 1362
Score = 108 bits (269), Expect = 2e-22
Identities = 89/330 (26%), Positives = 151/330 (44%), Gaps = 20/330 (6%)
Query: 1952 LIHEIHEGSFGIHPNGHTMAKKILRA---GYYWMTMESDCYKHTRKCHKCQIYADKIHMP 2008
L+ E+HEG GH KK+ R +YW M R C KC D +
Sbjct: 1468 LLKELHEGMLA----GHFGIKKMWRMVHRKFYWPQMRVCVENCVRTCAKCLCANDHSKLT 1523
Query: 2009 PTTLNLLSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYANVTKQVVV 2068
++L +P + D++ + G+R+IL ID FTK+ A + + V+
Sbjct: 1524 -SSLTPYRMTFPLEIVACDLMD--VGLSVQGNRYILTIIDLFTKYGTAVPIPDKKAETVL 1580
Query: 2069 K-FIKNHIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQMNGAVEAAN 2127
K F++ I IP +++TD G N + + KIEH + Y + NGAVE N
Sbjct: 1581 KAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGYNSRANGAVERFN 1640
Query: 2128 KNIKRIVQKMVVTYKDWHEMLPFALHGYRTSVRTSTGATPFSLVYGMEAVLPVEVEIPSL 2187
K I I++K +W + + +A++ Y V +TG TP L++G + + P+E+
Sbjct: 1641 KTIMHIMKKKTAVPMEWDDQVVYAVYAYNNCVHENTGETPMFLMHGRDVMGPLEMSGEDA 1700
Query: 2188 RVLMEADLSEAEWVQNRYDQLNLIEEKRMTALCHGQLYQKRMKQAFDKKVRPREFK---E 2244
+ AD+ E + + + L++ +++ A H Q+ K FD+K ++ +
Sbjct: 1701 VGINYADMDEYKHLLTQ----ELLKVQKI-AKEHAMREQESYKSLFDQKYASKKHRFPQP 1755
Query: 2245 GDLVLKKIFSFQPDSR-GKWAPNYEGPYVV 2273
G VL +I S + ++ K + GPY V
Sbjct: 1756 GSRVLLEIPSEKLGAQCPKLVNKWSGPYRV 1785
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 201 bits (510), Expect = 3e-50
Identities = 125/441 (28%), Positives = 227/441 (51%), Gaps = 22/441 (4%)
Query: 1283 PDMALKIKEEVQKQIDAGFLITSNYPQWLANIVPVPKKDGKVRMCVDYRDLNKASPKDDF 1342
P + +E+ + + +G + S ++ VPKK+G +RM VDY+ LNK + +
Sbjct: 422 PGKMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIY 480
Query: 1343 PLPHIDVLVDSTAKSKVFSFMDGFSGYNQIKMAPEDREKTSFITPWGTFCYKVMPFGLIN 1402
PLP I+ L+ S +F+ +D S Y+ I++ D K +F P G F Y VMP+G+
Sbjct: 481 PLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGIST 540
Query: 1403 AGATYQRGMTTLFHDMIHKEIEVYVDDMIVKSVTEEDHVKYLQKMFQRLRKYKLRLNPNK 1462
A A +Q + T+ + + Y+DD+++ S +E +HVK+++ + Q+L+ L +N K
Sbjct: 541 APAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAK 600
Query: 1463 CTFGVRSGKLLGFIVSQKGIEVDPDKVKAIREMPAPRTEKEVRGFLGRLNYISRFISHMT 1522
C F K +G+ +S+KG + + + + P+ KE+R FLG +NY+ +FI +
Sbjct: 601 CEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTS 660
Query: 1523 ATCGPIFKLLRKEQGIVWTEDCQKAFDSIKKYLLEPPILIPPVEGRPLIMYLTVLENSMG 1582
P+ LL+K+ WT +A ++IK+ L+ PP+L + +++ + ++G
Sbjct: 661 QLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVG 720
Query: 1583 CVLGQQDETGRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAKRLRHYMINHTTWLVS 1642
VL Q+ + K + + Y S K ++ + YS+ +K A+ + K RHY L S
Sbjct: 721 AVLSQKHDDD-KYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHY-------LES 772
Query: 1643 KMDPIKYIFEKPALTGRI-----------ARWQMLLSEYDIECRSQKAIKGSILADHLAH 1691
++P K + + L GRI ARWQ+ L +++ E + + +AD L+
Sbjct: 773 TIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPG-SANHIADALS- 830
Query: 1692 QPLEDYRPIKFDFPDEEIMYL 1712
+ +++ PI D D I ++
Sbjct: 831 RIVDETEPIPKDSEDNSINFV 851
Score = 107 bits (268), Expect = 3e-22
Identities = 103/443 (23%), Positives = 192/443 (43%), Gaps = 37/443 (8%)
Query: 1836 ELHHIPRDENQMADALATLSSMIKVNHHNDVPLISVKFLDRPAYVFAAEVVFDDKPWFHD 1895
E+++ P N +ADAL+ + V+ +P K + + F ++ D
Sbjct: 814 EINYRPGSANHIADALSRI-----VDETEPIP----KDSEDNSINFVNQISITDDFKNQV 864
Query: 1896 IKVFLQTREYPPGASNKDKKTLRRLSSNFFLNGDILYKRNFDTVLLRCVDKYEADLLIHE 1955
+ + + +N+DK R+ N L +L D +LL D +I +
Sbjct: 865 VTEYTNDTKLLNLLNNEDK----RVEENIQLKDGLLINSK-DQILLPN-DTQLTRTIIKK 918
Query: 1956 IHEGSFGIHPNGHTMAKKILRAGYYWMTMESDCYKHTRKCHKCQIYADKIHMPPTTLNLL 2015
HE IHP + ILR + W + ++ + CH CQI + H P L +
Sbjct: 919 YHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPI 977
Query: 2016 S-SPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKW-VEAASYANVTKQVVVKFIKN 2073
S P+ +D I + S+G+ + V +D F+K + ++T + +
Sbjct: 978 PPSERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQ 1035
Query: 2074 HIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQMNGAVEAANKNIKRI 2133
+I +G P II DN ++ K+ + S PYRPQ +G E N+ ++++
Sbjct: 1036 RVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKL 1095
Query: 2134 VQKMVVTYKD-WHEMLPFALHGYRTSVRTSTGATPFSLVYGMEAVLPVEVEIPSLRVLME 2192
++ + T+ + W + + Y ++ ++T TPF +V+ L +E+PS
Sbjct: 1096 LRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSF----- 1149
Query: 2193 ADLSEAEWVQNRYDQLNLIEEKRMTALCHGQLYQKRMKQAFDKKVRP-REFKEGDLVL-K 2250
+D ++ +N + + + + T H +MK+ FD K++ EF+ GDLV+ K
Sbjct: 1150 SDKTD----ENSQETIQVFQ----TVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK 1201
Query: 2251 KIFSFQPDSRGKWAPNYEGPYVV 2273
+ + K AP++ GP+ V
Sbjct: 1202 RTKTGFLHKSNKLAPSFAGPFYV 1224
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 198 bits (504), Expect = 1e-49
Identities = 124/441 (28%), Positives = 228/441 (51%), Gaps = 22/441 (4%)
Query: 1283 PDMALKIKEEVQKQIDAGFLITSNYPQWLANIVPVPKKDGKVRMCVDYRDLNKASPKDDF 1342
P + +E+ + + +G + S ++ VPKK+G +RM VDY+ LNK + +
Sbjct: 422 PGKMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIY 480
Query: 1343 PLPHIDVLVDSTAKSKVFSFMDGFSGYNQIKMAPEDREKTSFITPWGTFCYKVMPFGLIN 1402
PLP I+ L+ S +F+ +D S Y+ I++ D K +F P G F Y VMP+G+
Sbjct: 481 PLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISI 540
Query: 1403 AGATYQRGMTTLFHDMIHKEIEVYVDDMIVKSVTEEDHVKYLQKMFQRLRKYKLRLNPNK 1462
A A +Q + T+ ++ + Y+D++++ S +E +HVK+++ + Q+L+ L +N K
Sbjct: 541 APAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAK 600
Query: 1463 CTFGVRSGKLLGFIVSQKGIEVDPDKVKAIREMPAPRTEKEVRGFLGRLNYISRFISHMT 1522
C F K +G+ +S+KG + + + + P+ KE+R FLG +NY+ +FI +
Sbjct: 601 CEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTS 660
Query: 1523 ATCGPIFKLLRKEQGIVWTEDCQKAFDSIKKYLLEPPILIPPVEGRPLIMYLTVLENSMG 1582
P+ LL+K+ WT +A ++IK+ L+ PP+L + +++ + ++G
Sbjct: 661 QLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVG 720
Query: 1583 CVLGQQDETGRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAKRLRHYMINHTTWLVS 1642
VL Q+ + K + + Y S K ++ + YS+ +K A+ + K RHY L S
Sbjct: 721 AVLSQKHDDD-KYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHY-------LES 772
Query: 1643 KMDPIKYIFEKPALTGRI-----------ARWQMLLSEYDIECRSQKAIKGSILADHLAH 1691
++P K + + L GRI ARWQ+ L +++ E + + +AD L+
Sbjct: 773 TIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPG-SANHIADALS- 830
Query: 1692 QPLEDYRPIKFDFPDEEIMYL 1712
+ +++ PI D D I ++
Sbjct: 831 RIVDETEPIPKDSEDNSINFV 851
Score = 107 bits (268), Expect = 3e-22
Identities = 103/443 (23%), Positives = 192/443 (43%), Gaps = 37/443 (8%)
Query: 1836 ELHHIPRDENQMADALATLSSMIKVNHHNDVPLISVKFLDRPAYVFAAEVVFDDKPWFHD 1895
E+++ P N +ADAL+ + V+ +P K + + F ++ D
Sbjct: 814 EINYRPGSANHIADALSRI-----VDETEPIP----KDSEDNSINFVNQISITDDFKNQV 864
Query: 1896 IKVFLQTREYPPGASNKDKKTLRRLSSNFFLNGDILYKRNFDTVLLRCVDKYEADLLIHE 1955
+ + + +N+DK R+ N L +L D +LL D +I +
Sbjct: 865 VTEYTNDTKLLNLLNNEDK----RVEENIQLKDGLLINSK-DQILLPN-DTQLTRTIIKK 918
Query: 1956 IHEGSFGIHPNGHTMAKKILRAGYYWMTMESDCYKHTRKCHKCQIYADKIHMPPTTLNLL 2015
HE IHP + ILR + W + ++ + CH CQI + H P L +
Sbjct: 919 YHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPI 977
Query: 2016 S-SPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKW-VEAASYANVTKQVVVKFIKN 2073
S P+ +D I + S+G+ + V +D F+K + ++T + +
Sbjct: 978 PPSERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQ 1035
Query: 2074 HIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQMNGAVEAANKNIKRI 2133
+I +G P II DN ++ K+ + S PYRPQ +G E N+ ++++
Sbjct: 1036 RVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKL 1095
Query: 2134 VQKMVVTYKD-WHEMLPFALHGYRTSVRTSTGATPFSLVYGMEAVLPVEVEIPSLRVLME 2192
++ + T+ + W + + Y ++ ++T TPF +V+ L +E+PS
Sbjct: 1096 LRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSF----- 1149
Query: 2193 ADLSEAEWVQNRYDQLNLIEEKRMTALCHGQLYQKRMKQAFDKKVRP-REFKEGDLVL-K 2250
+D ++ +N + + + + T H +MK+ FD K++ EF+ GDLV+ K
Sbjct: 1150 SDKTD----ENSQETIQVFQ----TVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK 1201
Query: 2251 KIFSFQPDSRGKWAPNYEGPYVV 2273
+ + K AP++ GP+ V
Sbjct: 1202 RTKTGFLHKSNKLAPSFAGPFYV 1224
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 198 bits (504), Expect = 1e-49
Identities = 124/441 (28%), Positives = 228/441 (51%), Gaps = 22/441 (4%)
Query: 1283 PDMALKIKEEVQKQIDAGFLITSNYPQWLANIVPVPKKDGKVRMCVDYRDLNKASPKDDF 1342
P + +E+ + + +G + S ++ VPKK+G +RM VDY+ LNK + +
Sbjct: 422 PGKMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIY 480
Query: 1343 PLPHIDVLVDSTAKSKVFSFMDGFSGYNQIKMAPEDREKTSFITPWGTFCYKVMPFGLIN 1402
PLP I+ L+ S +F+ +D S Y+ I++ D K +F P G F Y VMP+G+
Sbjct: 481 PLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISI 540
Query: 1403 AGATYQRGMTTLFHDMIHKEIEVYVDDMIVKSVTEEDHVKYLQKMFQRLRKYKLRLNPNK 1462
A A +Q + T+ ++ + Y+D++++ S +E +HVK+++ + Q+L+ L +N K
Sbjct: 541 APAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAK 600
Query: 1463 CTFGVRSGKLLGFIVSQKGIEVDPDKVKAIREMPAPRTEKEVRGFLGRLNYISRFISHMT 1522
C F K +G+ +S+KG + + + + P+ KE+R FLG +NY+ +FI +
Sbjct: 601 CEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTS 660
Query: 1523 ATCGPIFKLLRKEQGIVWTEDCQKAFDSIKKYLLEPPILIPPVEGRPLIMYLTVLENSMG 1582
P+ LL+K+ WT +A ++IK+ L+ PP+L + +++ + ++G
Sbjct: 661 QLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVG 720
Query: 1583 CVLGQQDETGRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAKRLRHYMINHTTWLVS 1642
VL Q+ + K + + Y S K ++ + YS+ +K A+ + K RHY L S
Sbjct: 721 AVLSQKHDDD-KYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHY-------LES 772
Query: 1643 KMDPIKYIFEKPALTGRI-----------ARWQMLLSEYDIECRSQKAIKGSILADHLAH 1691
++P K + + L GRI ARWQ+ L +++ E + + +AD L+
Sbjct: 773 TIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPG-SANHIADALS- 830
Query: 1692 QPLEDYRPIKFDFPDEEIMYL 1712
+ +++ PI D D I ++
Sbjct: 831 RIVDETEPIPKDSEDNSINFV 851
Score = 107 bits (268), Expect = 3e-22
Identities = 103/443 (23%), Positives = 192/443 (43%), Gaps = 37/443 (8%)
Query: 1836 ELHHIPRDENQMADALATLSSMIKVNHHNDVPLISVKFLDRPAYVFAAEVVFDDKPWFHD 1895
E+++ P N +ADAL+ + V+ +P K + + F ++ D
Sbjct: 814 EINYRPGSANHIADALSRI-----VDETEPIP----KDSEDNSINFVNQISITDDFKNQV 864
Query: 1896 IKVFLQTREYPPGASNKDKKTLRRLSSNFFLNGDILYKRNFDTVLLRCVDKYEADLLIHE 1955
+ + + +N+DK R+ N L +L D +LL D +I +
Sbjct: 865 VTEYTNDTKLLNLLNNEDK----RVEENIQLKDGLLINSK-DQILLPN-DTQLTRTIIKK 918
Query: 1956 IHEGSFGIHPNGHTMAKKILRAGYYWMTMESDCYKHTRKCHKCQIYADKIHMPPTTLNLL 2015
HE IHP + ILR + W + ++ + CH CQI + H P L +
Sbjct: 919 YHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPI 977
Query: 2016 S-SPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKW-VEAASYANVTKQVVVKFIKN 2073
S P+ +D I + S+G+ + V +D F+K + ++T + +
Sbjct: 978 PPSERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQ 1035
Query: 2074 HIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQMNGAVEAANKNIKRI 2133
+I +G P II DN ++ K+ + S PYRPQ +G E N+ ++++
Sbjct: 1036 RVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKL 1095
Query: 2134 VQKMVVTYKD-WHEMLPFALHGYRTSVRTSTGATPFSLVYGMEAVLPVEVEIPSLRVLME 2192
++ + T+ + W + + Y ++ ++T TPF +V+ L +E+PS
Sbjct: 1096 LRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSF----- 1149
Query: 2193 ADLSEAEWVQNRYDQLNLIEEKRMTALCHGQLYQKRMKQAFDKKVRP-REFKEGDLVL-K 2250
+D ++ +N + + + + T H +MK+ FD K++ EF+ GDLV+ K
Sbjct: 1150 SDKTD----ENSQETIQVFQ----TVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK 1201
Query: 2251 KIFSFQPDSRGKWAPNYEGPYVV 2273
+ + K AP++ GP+ V
Sbjct: 1202 RTKTGFLHKSNKLAPSFAGPFYV 1224
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 196 bits (498), Expect = 6e-49
Identities = 131/427 (30%), Positives = 210/427 (48%), Gaps = 23/427 (5%)
Query: 1287 LKIKEEVQKQIDA----GFLITSNYPQ----WLANIVPVPKKDGKVRMCVDYRDLNKASP 1338
+ ++ EV++QID G + SN P W+ P P + + RM VD++ LN +
Sbjct: 133 VNMRGEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTI 192
Query: 1339 KDDFPLPHIDVLVDSTAKSKVFSFMDGFSGYNQIKMAPEDREKTSFITPWGTFCYKVMPF 1398
D +P+P I+ + S +K F+ +D SG++QI M D KT+F T G + + +PF
Sbjct: 193 PDTYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPF 252
Query: 1399 GLINAGATYQRGMTTLFHDMIHKEIEVYVDDMIVKSVTEEDHVKYLQKMFQRLRKYKLRL 1458
GL NA A +QR + + + I K VY+DD+IV S + H K L+ + L K L++
Sbjct: 253 GLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQV 312
Query: 1459 NPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVKAIREMPAPRTEKEVRGFLGRLNYISRFI 1518
N K F + LG+IV+ GI+ DP KV+AI EMP P + KE++ FLG +Y +FI
Sbjct: 313 NLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFI 372
Query: 1519 SHMTATCGPIFKLLR-----------KEQGIVWTEDCQKAFDSIKKYLLEPPILIPPVEG 1567
P+ L R + I E ++F+ +K L IL P
Sbjct: 373 QDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFT 432
Query: 1568 RPLIMYLTVLENSMGCVLGQQDETGRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAK 1627
+P + ++G VL Q D+ ++ I Y+S+ + E Y+ +EK A+ W+
Sbjct: 433 KPFHLTTDASNWAIGAVLSQDDQ--GRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLD 490
Query: 1628 RLRHYMINHTTWLV-SKMDPIKYIFEKPALTGRIARWQMLLSEYDIECRSQKAIKGSILA 1686
LR Y+ T V + P+ + ++ RW+ + EY+ E K K +++A
Sbjct: 491 NLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCEL-IYKPGKSNVVA 549
Query: 1687 DHLAHQP 1693
D L+ P
Sbjct: 550 DALSRIP 556
Score = 40.4 bits (93), Expect = 0.058
Identities = 56/248 (22%), Positives = 98/248 (38%), Gaps = 21/248 (8%)
Query: 1935 NFDTVLLRCVDKYEADLL----IHEIHEGSFGIHPNGHTMAKKILRAGYYWMTMESDCYK 1990
NF +R + AD+ I EI E G T + L YY+ M S
Sbjct: 671 NFLLYKIRITQRLVADVSGAEEICEIIEKEHRRAHRGPTEIRLQLLEKYYFPRMSSTIRL 730
Query: 1991 HTRKCHKCQIYADKIHMPPTTLNLLSSP---WPFSMWGIDMIGRIEPKASNGHRFILVAI 2047
T C C++Y + H P NL +P +P + ID+ + R L I
Sbjct: 731 QTSSCQCCKLYKYERH--PNKPNLQPTPIPNYPCEILHIDIFALEK-------RLYLSCI 781
Query: 2048 DYFTKWVEAASYANVTKQVVVKFIKNHIICRYGIPNRIITDNGTNLNNKMMKELCDDFKI 2107
D F+K+ + + V ++ + + P +++DN L + I
Sbjct: 782 DKFSKFAK-LFHLQSKASVHLRETLVEALHYFTAPKVLVSDNERGLLCPTVLNYLRSLDI 840
Query: 2108 EHHNSSPYRPQMNGAVEAANK---NIKRIVQKMVVTYKDWHEMLPFALHGYRTSVRTSTG 2164
+ + + + ++NG VE + I R ++ + T+K E++ A+ Y TSV + T
Sbjct: 841 DLYYAPTQKSEVNGQVERFHSTFLEIYRCLKDELPTFKP-VELVHIAVDRYNTSVHSVTN 899
Query: 2165 ATPFSLVY 2172
P + +
Sbjct: 900 RKPADVFF 907
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 183 bits (464), Expect = 6e-45
Identities = 141/488 (28%), Positives = 239/488 (48%), Gaps = 38/488 (7%)
Query: 1230 SVKKQVIELLKEYVDVFAWSYQDMPGLDTDIVVHHLPLKPECPPVKQKLRRTRPDMALKI 1289
SVKK+ + + F+ + + +P +T + + E PV + T ++ +
Sbjct: 141 SVKKEFKDTIIRRKKAFSTTNEALP-FNTAVTATIRTVDNE--PVYSRAYPTLMGVSDFV 197
Query: 1290 KEEVQKQIDAGFLITS----NYPQWLANIVPVPKK------DGKVRMCVDYRDLNKASPK 1339
EV++ + G + S N P W+ V KK + R+ +D+R LN+ +
Sbjct: 198 NNEVKQLLKDGIIRPSRSPYNSPTWV-----VDKKGTDAFGNPNKRLVIDFRKLNEKTIP 252
Query: 1340 DDFPLPHIDVLVDSTAKSKVFSFMDGFSGYNQIKMAPEDREKTSFITPWGTFCYKVMPFG 1399
D +P+P I +++ + K+K F+ +D SGY+QI +A DREKTSF G + + +PFG
Sbjct: 253 DRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFG 312
Query: 1400 LINAGATYQRGMTTLFHDMIHKEIEVYVDDMIVKSVTEEDHVKYLQKMFQRLRKYKLRLN 1459
L NA + +QR + + + I K VYVDD+I+ S E DHV+++ + + L +R++
Sbjct: 313 LRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVS 372
Query: 1460 PNKCTFGVRSGKLLGFIVSQKGIEVDPDKVKAIREMPAPRTEKEVRGFLGRLNYISRFIS 1519
K F S + LGFIVS+ G + DP+KVKAI+E P P +VR FLG +Y FI
Sbjct: 373 QEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIK 432
Query: 1520 HMTATCGPIFKLLRKEQGIV-----------WTEDCQKAFDSIKKYLL-EPPILIPPVEG 1567
A PI +L+ E G V + E + AF ++ L E IL P
Sbjct: 433 DFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFK 492
Query: 1568 RPLIMYLTVLENSMGCVLGQQDETGRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAK 1627
+P + + +G VL Q+ GR I +S+ + E Y+ E+ A+ WA
Sbjct: 493 KPFDLTTDASASGIGAVLSQE---GR---PITMISRTLKQPEQNYATNERELLAIVWALG 546
Query: 1628 RLRHYMI-NHTTWLVSKMDPIKYIFEKPALTGRIARWQMLLSEYDIECRSQKAIKGSILA 1686
+L++++ + + + P+ + +I RW+ + +++ + K K + +A
Sbjct: 547 KLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKV-FYKPGKENFVA 605
Query: 1687 DHLAHQPL 1694
D L+ Q L
Sbjct: 606 DALSRQNL 613
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 130 bits (327), Expect = 4e-29
Identities = 133/527 (25%), Positives = 225/527 (42%), Gaps = 42/527 (7%)
Query: 1193 LEQEKKTIQPYGDELEVINLGTKEDKKEIKVGASLETSVKKQVIELLKEYVDVFAWSYQD 1252
+++ KT QP E +N+ T + + +K E ++ + L +E + + Q
Sbjct: 162 MKKRSKTQQP-----EPVNISTNKIENPLK-----EIAILSEGRRLSEEKLFITQQRMQK 211
Query: 1253 MPGLDTDIVVHHLPLKPECPP--VKQKLRRTRPDMALKIK---------EEVQKQIDAGF 1301
+ L + V PL P +K ++ + P A+K+K EE KQI
Sbjct: 212 IEEL-LEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELL 270
Query: 1302 LITSNYPQWLANIVPV-------PKKDGKVRMCVDYRDLNKASPKDDFPLPHIDVLVDST 1354
+ P ++ P K+ GK RM V+Y+ +NKA+ D + LP+ D L+
Sbjct: 271 DLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLI 330
Query: 1355 AKSKVFSFMDGFSGYNQIKMAPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTL 1414
K+FS D SG+ Q+ + E R T+F P G + + V+PFGL A + +QR M
Sbjct: 331 RGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEA 390
Query: 1415 FHDMIHKEIEVYVDDMIVKSVTEEDHVKYLQKMFQRLRKYKLRLNPNKCTFGVRSGKLLG 1474
F + K VYVDD++V S EEDH+ ++ + Q+ ++ + L+ K + LG
Sbjct: 391 FR-VFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLG 449
Query: 1475 FIVSQKGIEVDPDKVKAIREMP-APRTEKEVRGFLGRLNYISRFISHMTATCGPIFKLLR 1533
+ + + ++ I + P +K+++ FLG L Y S +I + P+ L+
Sbjct: 450 LEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLK 509
Query: 1534 KEQGIVWTEDCQKAFDSIKKYLLEPPILIPPVEGRPLIMYLTVLENSMGCVLG--QQDET 1591
+ WT++ +KK L P L P+ LI+ ++ G +L + +E
Sbjct: 510 ENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEG 569
Query: 1592 GRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAKRLR------HYMINHTTWLVSKMD 1645
E Y S F E Y +K A+ K+ H++I
Sbjct: 570 TNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFV 629
Query: 1646 PIKYIFEKPALTGRIARWQMLLSEYDIECRSQKAIKGSILADHLAHQ 1692
+ Y + + GR RWQ LS Y + K AD L+ +
Sbjct: 630 NLNY--KGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNH-FADFLSRE 673
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 129 bits (324), Expect = 9e-29
Identities = 132/527 (25%), Positives = 225/527 (42%), Gaps = 42/527 (7%)
Query: 1193 LEQEKKTIQPYGDELEVINLGTKEDKKEIKVGASLETSVKKQVIELLKEYVDVFAWSYQD 1252
+++ KT QP E +N+ T + + ++ E ++ + L +E + + Q
Sbjct: 162 MKKRSKTQQP-----EPVNISTNKIENPLE-----EIAILSEGRRLSEEKLFITQQRMQK 211
Query: 1253 MPGLDTDIVVHHLPLKPECPP--VKQKLRRTRPDMALKIK---------EEVQKQIDAGF 1301
+ L + V PL P +K ++ + P A+K+K EE KQI
Sbjct: 212 IEEL-LEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELL 270
Query: 1302 LITSNYPQWLANIVPV-------PKKDGKVRMCVDYRDLNKASPKDDFPLPHIDVLVDST 1354
+ P ++ P K+ GK RM V+Y+ +NKA+ D + LP+ D L+
Sbjct: 271 DLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLI 330
Query: 1355 AKSKVFSFMDGFSGYNQIKMAPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTL 1414
K+FS D SG+ Q+ + E R T+F P G + + V+PFGL A + +QR M
Sbjct: 331 RGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEA 390
Query: 1415 FHDMIHKEIEVYVDDMIVKSVTEEDHVKYLQKMFQRLRKYKLRLNPNKCTFGVRSGKLLG 1474
F + K VYVDD++V S EEDH+ ++ + Q+ ++ + L+ K + LG
Sbjct: 391 FR-VFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLG 449
Query: 1475 FIVSQKGIEVDPDKVKAIREMP-APRTEKEVRGFLGRLNYISRFISHMTATCGPIFKLLR 1533
+ + + ++ I + P +K+++ FLG L Y S +I + P+ L+
Sbjct: 450 LEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLK 509
Query: 1534 KEQGIVWTEDCQKAFDSIKKYLLEPPILIPPVEGRPLIMYLTVLENSMGCVLG--QQDET 1591
+ WT++ +KK L P L P+ LI+ ++ G +L + +E
Sbjct: 510 ENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEG 569
Query: 1592 GRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAKRLR------HYMINHTTWLVSKMD 1645
E Y S F E Y +K A+ K+ H++I
Sbjct: 570 TNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFV 629
Query: 1646 PIKYIFEKPALTGRIARWQMLLSEYDIECRSQKAIKGSILADHLAHQ 1692
+ Y + + GR RWQ LS Y + K AD L+ +
Sbjct: 630 NLNY--KGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNH-FADFLSRE 673
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 129 bits (323), Expect = 1e-28
Identities = 132/527 (25%), Positives = 225/527 (42%), Gaps = 42/527 (7%)
Query: 1193 LEQEKKTIQPYGDELEVINLGTKEDKKEIKVGASLETSVKKQVIELLKEYVDVFAWSYQD 1252
+++ KT QP E +N+ T + + ++ E ++ + L +E + + Q
Sbjct: 162 MKKRSKTQQP-----EPVNISTNKIENPLE-----EIAILSEGRRLSEEKLFITQQRMQK 211
Query: 1253 MPGLDTDIVVHHLPLKPECPP--VKQKLRRTRPDMALKIK---------EEVQKQIDAGF 1301
+ L + V PL P +K ++ + P A+K+K EE KQI
Sbjct: 212 IEEL-LEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELL 270
Query: 1302 LITSNYPQWLANIVPV-------PKKDGKVRMCVDYRDLNKASPKDDFPLPHIDVLVDST 1354
+ P ++ P K+ GK RM V+Y+ +NKA+ D + LP+ D L+
Sbjct: 271 DLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLI 330
Query: 1355 AKSKVFSFMDGFSGYNQIKMAPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTL 1414
K+FS D SG+ Q+ + E R T+F P G + + V+PFGL A + +QR M
Sbjct: 331 RGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEA 390
Query: 1415 FHDMIHKEIEVYVDDMIVKSVTEEDHVKYLQKMFQRLRKYKLRLNPNKCTFGVRSGKLLG 1474
F + K VYVDD++V S EEDH+ ++ + Q+ ++ + L+ K + LG
Sbjct: 391 FR-VFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLG 449
Query: 1475 FIVSQKGIEVDPDKVKAIREMP-APRTEKEVRGFLGRLNYISRFISHMTATCGPIFKLLR 1533
+ + + ++ I + P +K+++ FLG L Y S +I + P+ L+
Sbjct: 450 LEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLK 509
Query: 1534 KEQGIVWTEDCQKAFDSIKKYLLEPPILIPPVEGRPLIMYLTVLENSMGCVLG--QQDET 1591
+ WT++ +KK L P L P+ LI+ ++ G +L + +E
Sbjct: 510 ENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEG 569
Query: 1592 GRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAKRLR------HYMINHTTWLVSKMD 1645
E Y S F E Y +K A+ K+ H++I
Sbjct: 570 TNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFV 629
Query: 1646 PIKYIFEKPALTGRIARWQMLLSEYDIECRSQKAIKGSILADHLAHQ 1692
+ Y + + GR RWQ LS Y + K AD L+ +
Sbjct: 630 NLNY--KGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNH-FADFLSRE 673
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 125 bits (315), Expect = 1e-27
Identities = 115/444 (25%), Positives = 191/444 (42%), Gaps = 29/444 (6%)
Query: 1274 VKQKLRRTRPDMALKIK---------EEVQKQIDAGFLITSNYPQWLANIVPV------- 1317
+K ++ + P A+K+K EE KQI + P ++ P
Sbjct: 229 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 288
Query: 1318 PKKDGKVRMCVDYRDLNKASPKDDFPLPHIDVLVDSTAKSKVFSFMDGFSGYNQIKMAPE 1377
K+ GK RM V+Y+ +NKA+ D + P+ D L+ K+FS D SG+ Q+ + E
Sbjct: 289 EKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 348
Query: 1378 DREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEIEVYVDDMIVKSVTE 1437
R T+F P G + + V+PFGL A + +QR M F + K VYVDD++V S E
Sbjct: 349 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNE 407
Query: 1438 EDHVKYLQKMFQRLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVKAIREMP- 1496
EDH+ ++ + Q+ ++ + L+ K + LG + + + ++ I + P
Sbjct: 408 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPD 467
Query: 1497 APRTEKEVRGFLGRLNYISRFISHMTATCGPIFKLLRKEQGIVWTEDCQKAFDSIKKYLL 1556
+K+++ FLG L Y S +I + P+ L++ WT++ +KK L
Sbjct: 468 TLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQ 527
Query: 1557 EPPILIPPVEGRPLIMYLTVLENSMGCVLG--QQDETGRKEHAIYYLSKKFTECESRYSM 1614
P L P+ LI+ ++ G +L + +E E Y S F E Y
Sbjct: 528 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHS 587
Query: 1615 LEKTCCALAWAAKRLR------HYMINHTTWLVSKMDPIKYIFEKPALTGRIARWQMLLS 1668
+K A+ K+ H++I + Y + + GR RWQ LS
Sbjct: 588 NDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNY--KGDSKLGRNIRWQAWLS 645
Query: 1669 EYDIECRSQKAIKGSILADHLAHQ 1692
Y + K AD L+ +
Sbjct: 646 HYSFDVEHIKGTDNH-FADFLSRE 668
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 125 bits (314), Expect = 1e-27
Identities = 136/529 (25%), Positives = 229/529 (42%), Gaps = 33/529 (6%)
Query: 1187 DEISRLLEQEKKTIQPYGDELEVINLGTKED-KKEIKVGASLETSVKKQVIELLKEYVDV 1245
D I+ L+ E I+ V N E+ KK+ K T++ K +I + Y +
Sbjct: 139 DRIAFHLKNEMVLIKKVTKAFSVSNPSFLENMKKDSKTEQIPGTNISKNIINPEERYF-L 197
Query: 1246 FAWSYQDMPGLDTDIVVHHLPLKPECPP--VKQKLRRTRPDMALKIK------------- 1290
YQ + L D V P+ P +K ++ P +++K
Sbjct: 198 ITEKYQKIEQL-LDKVCSENPIDPIKSKQWMKASIKLIDPLKVIRVKPMSYSPQDREGFA 256
Query: 1291 EEVQKQIDAGFLITSNY----PQWLANIVPVPKKDGKVRMCVDYRDLNKASPKDDFPLPH 1346
+++++ +D G +I S P +L ++ GK RM V+Y+ +N+A+ D LP+
Sbjct: 257 KQIKELLDLGLIIPSKSQHMSPAFLVEN-EAERRRGKKRMVVNYKAINQATIGDSHNLPN 315
Query: 1347 IDVLVDSTAKSKVFSFMDGFSGYNQIKMAPEDREKTSFITPWGTFCYKVMPFGLINAGAT 1406
+ L+ +FS D SG+ Q+ + E ++ T+F P G F +KV+PFGL A +
Sbjct: 316 MQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQKLTAFTCPQGHFQWKVVPFGLKQAPSI 375
Query: 1407 YQRGMTTLFHDMIHKEIEVYVDDMIVKSVTEEDHVKYLQKMFQRLRKYKLRLNPNKCTFG 1466
+QR M T + K VYVDD+IV S +E DH ++ + + + KY + L+ K
Sbjct: 376 FQRHMQTALNG-ADKFCMVYVDDIIVFSNSELDHYNHVYAVLKIVEKYGIILSKKKANLF 434
Query: 1467 VRSGKLLGFIVSQKGIEVDPDKV-KAIREMP-APRTEKEVRGFLGRLNYISRFISHMTAT 1524
LG + KG + + + I + P +K ++ FLG L Y +I +
Sbjct: 435 KEKINFLGLEI-DKGTHCPQNHILENIHKFPDRLEDKKHLQRFLGVLTYAETYIPKLAEI 493
Query: 1525 CGPIFKLLRKEQGIVWTEDCQKAFDSIKKYLLEPPILIPPVEGRPLIMYLTVLENSMGCV 1584
P+ L+K+ WT+ IKK L P L P LI+ ++ G V
Sbjct: 494 RKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLGSFPKLYLPKPEDHLIIETDASDSFWGGV 553
Query: 1585 LGQQDETGRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAKRLRHYM--INHTTWLVS 1642
L + G E Y S F + E Y +K A+ + Y+ + T +
Sbjct: 554 LKARALDG-VELICRYSSGSFKQAEKNYHSNDKELLAVKQVITKFSAYLTPVRFTVRTDN 612
Query: 1643 KMDP--IKYIFEKPALTGRIARWQMLLSEYDIECRSQKAIKGSILADHL 1689
K ++ + + GR+ RWQ S+Y + + +K ++LAD L
Sbjct: 613 KNFTYFLRINLKGDSKQGRLVRWQNWFSKYQFDVEHLEGVK-NVLADCL 660
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 124 bits (310), Expect = 4e-27
Identities = 130/527 (24%), Positives = 222/527 (41%), Gaps = 42/527 (7%)
Query: 1193 LEQEKKTIQPYGDELEVINLGTKEDKKEIKVGASLETSVKKQVIELLKEYVDVFAWSYQD 1252
+++ KT QP E +N+ T + + ++ E ++ + L +E + + Q
Sbjct: 163 MKKRSKTQQP-----EPVNISTNKIENPLE-----EIAILSEGRRLSEEKLFITQQRMQK 212
Query: 1253 MPGLDTDIVVHHLPLKPECPP--VKQKLRRTRPDMALKIK---------EEVQKQIDAGF 1301
L + V PL P +K ++ + P A+K+K EE KQI
Sbjct: 213 TEEL-LEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELL 271
Query: 1302 LITSNYPQWLANIVPV-------PKKDGKVRMCVDYRDLNKASPKDDFPLPHIDVLVDST 1354
+ P ++ P G RM V+Y+ +NKA+ D + LP+ D L+
Sbjct: 272 DLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLTLI 331
Query: 1355 AKSKVFSFMDGFSGYNQIKMAPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTL 1414
K+FS D SG+ Q+ + E R T+F P G + + V+PFGL A + +QR M
Sbjct: 332 RGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEA 391
Query: 1415 FHDMIHKEIEVYVDDMIVKSVTEEDHVKYLQKMFQRLRKYKLRLNPNKCTFGVRSGKLLG 1474
F + K VYVDD++V S EEDH+ ++ + Q+ ++ + L+ K + LG
Sbjct: 392 FR-VFRKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLG 450
Query: 1475 FIVSQKGIEVDPDKVKAIREMP-APRTEKEVRGFLGRLNYISRFISHMTATCGPIFKLLR 1533
+ + + ++ I + P +K+++ FLG L Y S +I ++ P+ L+
Sbjct: 451 LEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLK 510
Query: 1534 KEQGIVWTEDCQKAFDSIKKYLLEPPILIPPVEGRPLIMYLTVLENSMGCVLG--QQDET 1591
+ WT++ +KK L P L P+ LI+ ++ G +L + +E
Sbjct: 511 ENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEG 570
Query: 1592 GRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAKRLR------HYMINHTTWLVSKMD 1645
E Y S F E Y +K A+ K+ H++I
Sbjct: 571 TNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFV 630
Query: 1646 PIKYIFEKPALTGRIARWQMLLSEYDIECRSQKAIKGSILADHLAHQ 1692
+ Y + + GR RWQ LS Y + K AD L+ +
Sbjct: 631 NLNY--KGDSKLGRNIRWQAWLSHYSFDVEHIKGTDNH-FADFLSRE 674
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 121 bits (304), Expect = 2e-26
Identities = 87/294 (29%), Positives = 148/294 (49%), Gaps = 12/294 (4%)
Query: 1228 ETSVKKQVIELLKEYVDVFAWSYQDMPGLDTDIVVHHLPLKPECPPVKQKLRRTRPDMAL 1287
ET + + L ++ +VF +D GL T + PV ++ R
Sbjct: 400 ETEASRLEVMLKNDFPEVF----KDGLGLCTKEKAE-FRTEENAVPVFKRARPVPYGSLE 454
Query: 1288 KIKEEVQKQIDAGFLITSNYPQWLANIVPVPKKD-GKVRMCVDYR--DLNKASPKDDFPL 1344
++ E+ + + G ++ Y +W A IV + KK GK+R+C D++ LN A + PL
Sbjct: 455 AVETELNRLQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCADFKCSGLNAALKDEFHPL 514
Query: 1345 PHIDVLVDSTAKSKVFSFMDGFSGYNQIKMAPEDREKTSFITPWGTFCYKVMPFGLINAG 1404
P + + S K V+S +D Y Q+++ E ++ T G F Y M FGL A
Sbjct: 515 PTSEDIF-SRLKGTVYSQIDLKDAYLQVELDEEAQKLAVINTHRGIFKYLRMTFGLKPAP 573
Query: 1405 ATYQRGMTTLFHDMIHKEIEVYVDDMIVKSVTEEDHVKYLQKMFQRLRKYKLRLNPNKCT 1464
A++Q+ M + + + VY DD+I+ + + E+H K L+++F+R ++Y R++ KC
Sbjct: 574 ASFQKIMDKMVSGLTG--VAVYWDDIIISASSIEEHEKILRELFERFKEYGFRVSAEKCA 631
Query: 1465 FGVRSGKLLGFIVSQKGIEVDPDKVKAIREMPAPRTEKEVRGFLGRLNYISRFI 1518
F + LGF V + G D K +AIR M AP +K++ FLG +++SR +
Sbjct: 632 FAQKQVTFLGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQLASFLGAADWLSRMM 684
Score = 87.8 bits (216), Expect = 3e-16
Identities = 70/260 (26%), Positives = 121/260 (45%), Gaps = 24/260 (9%)
Query: 1944 VDKYEADLLIHEIHEGSFGIHPNGHTMAKKILRAGYYWMTMESDCYKHTRKCHKCQIYAD 2003
V K +++ ++HEG HP G K+ R+ +W ++SD R C+ CQ +
Sbjct: 778 VPKSLQKIVLKQLHEG----HP-GIVQMKQKARSFVFWRGLDSDIENMVRHCNNCQENSK 832
Query: 2004 KIHMPPTTLNLLSSPWPF--SMWG---IDMIGRIEPKASNGHRFILVAIDYFTKWVEAAS 2058
+ P +PWP + W ID G + NG ++LV +D TK+ E
Sbjct: 833 MPRVVPL------NPWPVPEAPWKRIHIDFAGPL-----NGC-YLLVVVDAKTKYAEVKL 880
Query: 2059 YANVTKQVVVKFIKNHIICRYGIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQ 2118
+++ + ++ I +G P II+DNGT L + + ++C IEH S+ Y P+
Sbjct: 881 TRSISAVTTIDLLEE-IFSIHGYPETIISDNGTQLTSHLFAQMCQSHGIEHKTSAVYYPR 939
Query: 2119 MNGAVEAANKNIKRIVQKMVVTYKDWHEMLPFALHGYRTSVRTS-TGATPFSLVYGMEAV 2177
NGA E +KR + K+ ++L L YR + ++ G+TP +G +
Sbjct: 940 SNGAAERFVDTLKRGIAKIKGEGSVNQQILNKFLISYRNTPHSALNGSTPAECHFGRKIR 999
Query: 2178 LPVEVEIPSLRVLMEADLSE 2197
+ + +P+ RVL L++
Sbjct: 1000 TTMSLLMPTDRVLKVPKLTQ 1019
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 118 bits (296), Expect = 2e-25
Identities = 107/423 (25%), Positives = 182/423 (42%), Gaps = 12/423 (2%)
Query: 1282 RPDMALKIKEEVQKQIDAGFLITSNYPQWLANIVPVPKKDGKVRMCVDYRDLNKASPKDD 1341
R + +IKE ++ ++ T P +L ++ GK RM V+Y+ +NKA+ D
Sbjct: 241 REEFDRQIKELLELKVIKPSKSTHMSPAFLVEN-EAERRRGKKRMVVNYKAMNKATKGDA 299
Query: 1342 FPLPHIDVLVDSTAKSKVFSFMDGFSGYNQIKMAPEDREKTSFITPWGTFCYKVMPFGLI 1401
LP+ D L+ K++S D SG Q+ + E + T+F P G + + V+PFGL
Sbjct: 300 HNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFGLK 359
Query: 1402 NAGATYQRGMTTLFHDMIHKEIEVYVDDMIVKSVT-EEDHVKYLQKMFQRLRKYKLRLNP 1460
A + + + + K VYVDD++V S T ++H ++ + +R K + L+
Sbjct: 360 QAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSK 419
Query: 1461 NKCTFGVRSGKLLGFIVSQKGIEVDPDKVKAIREMP-APRTEKEVRGFLGRLNYISRFIS 1519
K LG + Q ++ I + P +K+++ FLG L Y S +I
Sbjct: 420 KKAQLFKEKINFLGLEIDQGTHCPQNHILEHIHKFPDRIEDKKQLQRFLGILTYASDYIP 479
Query: 1520 HMTATCGPIFKLLRKEQGIVWTEDCQKAFDSIKKYLLEPPILIPPVEGRPLIMYLTVLEN 1579
+ + P+ L+++ W + + IKK L P L P L++ E
Sbjct: 480 KLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKNLKSFPKLYHPEPNDKLVIETDASEE 539
Query: 1580 SMGCVLGQQDETGRKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAKRLRHYMINHTTW 1639
G +L + E+ Y S F E Y EK A+ K+ Y + + +
Sbjct: 540 FWGGILKAIHNS--HEYICRYASGSFKAAERNYHSNEKELLAVIRVIKKFSIY-LTPSRF 596
Query: 1640 LVSKMDP-----IKYIFEKPALTGRIARWQMLLSEYDIECRSQKAIKGSILADHLAHQPL 1694
L+ + + + GR+ RWQM LS+YD + K ++ AD L L
Sbjct: 597 LIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQYDFDVEHIAGTK-NVFADFLQENTL 655
Query: 1695 EDY 1697
+Y
Sbjct: 656 TNY 658
>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 886
Score = 117 bits (294), Expect = 3e-25
Identities = 101/406 (24%), Positives = 186/406 (44%), Gaps = 18/406 (4%)
Query: 1314 IVPVPKKDGKVRMCVDYRDLNKASPKDDFPLPHIDVLVDSTAKSKVFSFMDGFSGYNQIK 1373
+ PVPK DG+ RM +DYR++NK P H ++ + + K + +D +G+
Sbjct: 5 VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGFWAHP 64
Query: 1374 MAPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEIEVYVDDMIVK 1433
+ PE T+F +C+ +P G +N+ A + + L ++ ++VYVDD+ +
Sbjct: 65 ITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTADVVDLLKEI--PNVQVYVDDIYLS 122
Query: 1434 SVTEEDHVKYLQKMFQRLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEV-DPDKVKAI 1492
++HV+ L+K+FQ L + ++ K G ++ + LGF ++++G + D K K +
Sbjct: 123 HDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTVEFLGFNITKEGRGLTDTFKTKLL 182
Query: 1493 REMPAPRTEKEVRGFLGRLNYISRFISHMTATCGPIFKLLRKEQG--IVWTEDCQKAFDS 1550
P P+ K+++ LG LN+ FI + P++ L+ +G I W+E+ K +
Sbjct: 183 NITP-PKDLKQLQSILGLLNFARNFIPNFAELVQPLYNLIASAKGKYIEWSEENTKQLNM 241
Query: 1551 IKKYLLEPPILIPPVEGRPLIMYLTVLENSMGCVLGQQDETGRKEHAIYYLSKKFTECES 1610
+ + L L + + L++ + S G V +ETG+K I YL+ F++ E
Sbjct: 242 VIEALNTASNLEERLPEQRLVIKVNT-SPSAGYV-RYYNETGKK--PIMYLNYVFSKAEL 297
Query: 1611 RYSMLEKTCCALAWAAKRLRHYMINHTTWLVSKMDPIKYIFEKP-----ALTGRIARWQM 1665
++SMLEK + A + + + S + + I + P AL R W
Sbjct: 298 KFSMLEKLLTTMHKALIKAMDLAMGQEILVYSPIVSMTKIQKTPLPERKALPIRWITWMT 357
Query: 1666 LLSEYDIECRSQKAIKGSILADHLAHQPLEDYRPIKFDFPDEEIMY 1711
L + I+ K + H+ P+K E + Y
Sbjct: 358 YLEDPRIQFHYDKTLPE---LKHIPDVYTSSQSPVKHPSQYEGVFY 400
Score = 74.3 bits (181), Expect = 4e-12
Identities = 52/202 (25%), Positives = 85/202 (41%), Gaps = 5/202 (2%)
Query: 1979 YYWMTMESDCYKHTRKCHKCQIYADKIHMPPTTLNLLSSPWPFSMWGIDMIGRIEPKASN 2038
Y+W M D K +C +C I L PF + ID IG + P S
Sbjct: 634 YWWPNMRKDVVKQLGRCQQCLITNASNKASGPILRPDRPQKPFDKFFIDYIGPLPP--SQ 691
Query: 2039 GHRFILVAIDYFTKWVEAASYANVTKQVVVKFIKNHIICRYGIPNRIITDNGTNLNNKMM 2098
G+ ++LV +D T + + VK + +++ IP I +D G +
Sbjct: 692 GYLYVLVVVDGMTGFTWLYPTKAPSTSATVKSL--NVLTSIAIPKVIHSDQGAAFTSSTF 749
Query: 2099 KELCDDFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVV-TYKDWHEMLPFALHGYRT 2157
E + I S+PY PQ VE N +IKR++ K++V W+++LP
Sbjct: 750 AEWAKERGIHLEFSTPYHPQSGSKVERKNSDIKRLLTKLLVGRPTKWYDLLPVVQLALNN 809
Query: 2158 SVRTSTGATPFSLVYGMEAVLP 2179
+ TP L++G+++ P
Sbjct: 810 TYSPVLKYTPHQLLFGIDSNTP 831
>POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1165
Score = 115 bits (287), Expect = 2e-24
Identities = 102/395 (25%), Positives = 169/395 (41%), Gaps = 48/395 (12%)
Query: 1267 LKPECPPVKQKLRRTRPDMALK-----------IKEEVQKQIDAGFLITSNYPQWLANIV 1315
L + PPV +LR +A++ I+ +QK +D G L+ P W ++
Sbjct: 158 LANQVPPVVVELRSGASPVAVRQYPMSKEAREGIRPHIQKFLDLGVLVPCRSP-WNTPLL 216
Query: 1316 PVPKKD-GKVRMCVDYRDLNKASPKDDFPLPHIDVLVDSTAKSKV-FSFMDGFSGYNQIK 1373
PV K R D R++NK +P+ L+ S S +S +D + ++
Sbjct: 217 PVKKPGTNDYRPVQDLREINKRVQDIHPTVPNPYNLLSSLPPSYTWYSVLDLKDAFFCLR 276
Query: 1374 MAPEDREKTSFITPW--------GTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEIEV 1425
+ P + +F W G + +P G N+ TLF + +H+++
Sbjct: 277 LHPNSQPLFAF--EWKDPEKGNTGQLTWTRLPQGFKNS--------PTLFDEALHRDLAP 326
Query: 1426 ------------YVDDMIVKSVTEEDHVKYLQKMFQRLRKYKLRLNPNKCTFGVRSGKLL 1473
YVDD++V + T ED K QK+ Q L K R++ K R L
Sbjct: 327 FRALNPQVVLLQYVDDLLVAAPTYEDCKKGTQKLLQELSKLGYRVSAKKAQLCQREVTYL 386
Query: 1474 GFIVSQKGIEVDPDKVKAIREMPAPRTEKEVRGFLGRLNYISRFISHMTATCGPIFKLLR 1533
G+++ + + P + + ++P P T ++VR FLG + +I + P++ L +
Sbjct: 387 GYLLKEGKRWLTPARKATVMKIPVPTTPRQVREFLGTAGFCRLWIPGFASLAAPLYPLTK 446
Query: 1534 KEQGIVWTEDCQKAFDSIKKYLLEPPILIPPVEGRPLIMYLTVLENSMGCVLGQQDET-G 1592
+ +WTE+ Q+AFD IKK LL P L P +P +Y ++ G G +T G
Sbjct: 447 ESIPFIWTEEHQQAFDHIKKALLSAPALALPDLTKPFTLY---IDERAGVARGVLTQTLG 503
Query: 1593 RKEHAIYYLSKKFTECESRYSMLEKTCCALAWAAK 1627
+ YLSKK S + K A+A K
Sbjct: 504 PWRRPVAYLSKKLDPVASGWPTCLKAVAAVALLLK 538
Score = 79.0 bits (193), Expect = 1e-13
Identities = 53/156 (33%), Positives = 78/156 (49%), Gaps = 5/156 (3%)
Query: 2020 PFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYANVTKQVVVKFIKNHIICRY 2079
P W +D I+P G++++LV ID F+ WVEA T +V K I I+ R+
Sbjct: 876 PGVYWEVDFT-EIKP-GRYGNKYLLVFIDTFSGWVEAFPTKTETALIVCKKILEEILPRF 933
Query: 2080 GIPNRIITDNGTNLNNKMMKELCDDFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVV 2139
GIP + +DNG ++ + L I YRPQ +G VE N+ IK + K+ +
Sbjct: 934 GIPKVLGSDNGPAFVAQVSQGLATQLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLAL 993
Query: 2140 TY--KDWHEMLPFALHGYRTSVRTSTGATPFSLVYG 2173
KDW +LP AL R + G TP+ ++YG
Sbjct: 994 ETGGKDWVTLLPLALLRAR-NTPGRFGLTPYEILYG 1028
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 108 bits (270), Expect = 2e-22
Identities = 90/339 (26%), Positives = 158/339 (46%), Gaps = 22/339 (6%)
Query: 1256 LDTDIVVHHLPLKPECPPVKQKLRRTRPDMALKIK---EEVQKQIDAGFLITSNYPQWLA 1312
++ DI + P+K P ++ + R + ++ L++K K F++ S
Sbjct: 1396 INPDIKIMGRPIKHVTPGDEEAMTR-QINLLLQMKVIRPSESKHRSTAFIVRSG-----T 1449
Query: 1313 NIVPVPKKD--GKVRMCVDYRDLNKASPKDDFPLPHIDVLVDSTAKSKVFSFMDGFSGYN 1370
I P+ K+ GK RM +Y+ LN+ + D + LP I+ ++ +SK++S D SG+
Sbjct: 1450 EIDPITGKEKKGKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSKIYSKFDLKSGFW 1509
Query: 1371 QIKMAPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEIEVYVDDM 1430
Q+ M E T+F+ + + VMPFGL NA A +QR M +F K I VY+DD+
Sbjct: 1510 QVAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMDNVFKG-TEKFIAVYIDDI 1568
Query: 1431 IVKSVTEEDHVKYLQKMFQRLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVK 1490
+V S T E H ++L M Q ++ L L+P K G LG + I++ P +
Sbjct: 1569 LVFSETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLGCTKIKLQPHIIS 1628
Query: 1491 AIREMPAPR--TEKEVRGFLGRLNYISRFISHMTATCGPIFKLLRKEQGIVWTEDCQKAF 1548
I + + T + +R +LG L+Y +I + P+ + + + K
Sbjct: 1629 KICDFSDEKLATPEGMRSWLGILSYARNYIQDIGKLVQPLRQKMAPTGDKRMNPETWKMV 1688
Query: 1549 DSIKKYLLE-PPILIPPVEGRPLIMYLTVLENSMGCVLG 1586
IK+ + P + +PP + ++ + GC+ G
Sbjct: 1689 RQIKEKVKNLPDLQLPPKDS-------FIIIETDGCMTG 1720
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.319 0.136 0.407
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 286,273,292
Number of Sequences: 164201
Number of extensions: 13251735
Number of successful extensions: 67125
Number of sequences better than 10.0: 721
Number of HSP's better than 10.0 without gapping: 412
Number of HSP's successfully gapped in prelim test: 338
Number of HSP's that attempted gapping in prelim test: 46427
Number of HSP's gapped (non-prelim): 6263
length of query: 2306
length of database: 59,974,054
effective HSP length: 126
effective length of query: 2180
effective length of database: 39,284,728
effective search space: 85640707040
effective search space used: 85640707040
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 74 (33.1 bits)
Medicago: description of AC148289.13