
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0306.1
(1224 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 210 2e-53
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 209 5e-53
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 197 1e-49
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 192 6e-48
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 180 2e-44
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 174 1e-42
POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.2... 172 7e-42
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 169 4e-41
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 166 5e-40
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 166 5e-40
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 155 8e-37
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 121 1e-26
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 121 1e-26
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 121 1e-26
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 121 1e-26
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 120 2e-26
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 119 7e-26
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 110 2e-23
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 108 7e-23
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 99 7e-20
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 210 bits (534), Expect = 2e-53
Identities = 148/495 (29%), Positives = 243/495 (48%), Gaps = 14/495 (2%)
Query: 144 DIDQEGVCLDPRE-GFLEHKMTPEEETKTVKVGERNLKVGVNLTAIQ--EARLTQLLAEN 200
DI E V DP +E E KT + ER + +L + ++ ++ +
Sbjct: 854 DITVEEVLNDPTLFSEIETDTNSCEVVKTAETYERFTTICEHLKRENGDDRKIWDVIEQF 913
Query: 201 MDLFAWSAQDLPGIDPNFICHKLALNPGVKPIAQMKRKMGEEKAQAVKAETNKLIDAGFI 260
D+FA S +L G + C + L G +PI Q R + ++ K+++ I
Sbjct: 914 QDVFAISDDEL-GRNSGTEC-VIELKEGAEPIRQKPRPIPLALKPEIRKMIQKMLNQKVI 971
Query: 261 REVKYPTWLANVVMVKKSNGKWRMCTDYTDLNKHCPKDSYPLPNIDKLVDRASGFGMLSL 320
RE K P W + VV+VKK +G RMC DY +NK +++PLPNI+ + +G + ++
Sbjct: 972 RESKSP-WSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTV 1030
Query: 321 MDAYSGYHQIRMYAPDEEKTAFMTNQANYCYQTMPFGLKNAGATYQRLMDRVFEGQVGRN 380
D +G+ QI + +E TAF + + +PFGL + A +Q M+ + +G
Sbjct: 1031 FDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVC 1090
Query: 381 MEIYVDDMVVKSEEMGGHCLDLAEAFGEIRKHNMRLNPEKCSFGIQSGKFLGFMITRRGI 440
+YVDD+++ S++M H D+ EA IRK M+L KC + ++LG +T G+
Sbjct: 1091 AFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLDGV 1150
Query: 441 EVNPDKCKAILEMQSPTSVKEVQKLIGRIAALSRFLPCSGSKATPFFQCLRKNRVFQWTD 500
E K + + PT+VKE+Q +G + +F+ A+ + + W
Sbjct: 1151 ETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEK 1210
Query: 501 ECEQAFQSLKELLSKPPILSRP-----IPG-TPLSVFISISDNAVSSVLLQECKD-ELRI 553
E E AFQ LK+L+ + P+L++P + G P ++ S + +VL QE D +
Sbjct: 1211 EQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHP 1270
Query: 554 IYFVSHALQGAELRYQKIEKAALALIISARKLRPYFQGFQIKVKTDF-PLRQVLQKPDLA 612
I F S AL AE RY + ALA++ + R+ + G I V TD PL +L+ LA
Sbjct: 1271 IAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSPLA 1330
Query: 613 GRMVSWAVELSEFGI 627
R+ W++E+ EF +
Sbjct: 1331 DRLWRWSIEILEFDV 1345
Score = 112 bits (281), Expect = 5e-24
Identities = 137/597 (22%), Positives = 244/597 (39%), Gaps = 48/597 (8%)
Query: 628 VFEKKGQVKAQVLADFVNEM----SPEVKVSEEAEWILSVDGSSYLKGSGAGVVLEGPGG 683
++EK+ ++ Q L V + P+V+ + + + + + KG GA + EGP G
Sbjct: 1207 IWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDG 1266
Query: 684 VIIEQSLKFDFKASN------NQAEYEAIIAGINLAIEMNV---HCLVIKTDSQLVANQI 734
+ + F KA + + + EA+ L + + + TD + + + +
Sbjct: 1267 Q--QHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISLL 1324
Query: 735 KGDYQAKDIQLAKYLTKTQELMKRMDSVQVNHVPREENTRADVLCKLASTKKPGNNKSVI 794
KG LA L + + D V++ ++ + N AD L + P N
Sbjct: 1325 KGS------PLADRLWRWSIEILEFD-VKIVYLAGKANAVADALSRGGC---PPNELEEE 1374
Query: 795 QETLKSPSINEDDVVMVTGAAPSDWMDRIKMCLEADGADLALFSKDQVR----------- 843
Q + +N + S W++R+K E +A + +
Sbjct: 1375 QTKELTSIVNAIQTELPDILDSSCWLERLKGEDEGWKEVIAALEGGKTKGTFKIVGIESE 1434
Query: 844 -EASHYVLLGDQLYRRGVGVPLLRCVTRDEADRIMFEVHEGVCASHVGGRSLAAKVLRAG 902
+Y ++G L + V ++ E+HEG+ A H G + + V R
Sbjct: 1435 ISLEYYKIVGGVLKNTEIEEQSRSVVPEKIRTPLLKELHEGMLAGHFGIKKMWRMVHRK- 1493
Query: 903 FYWPTLKNDCMGYAKKCEKCQIYADLHRAPPEVLSSMSSAWPFAMWGVDILGPFTPAGAQ 962
FYWP ++ + C KC + A+ H L+ +P + D++
Sbjct: 1494 FYWPQMRVCVENCVRTCAKC-LCANDHSKLTSSLTPYRMTFPLEIVACDLMDVGLSVQGN 1552
Query: 963 IRFVLVAVDYFTKWIEAESMAKITAEKV-KKFYWRKIICRFGVPATLVSDNGTQFTSRIV 1021
R++L +D FTK+ A + AE V K F R I +P L++D G +F + +
Sbjct: 1553 -RYILTIIDLFTKYGTAVPIPDKKAETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLF 1611
Query: 1022 RDFCNEMGIEMRFASVEHPQSNGQVEAANKVILNGIKKRLGDAKGLWADELLTVVWAYNT 1081
F + + IE + ++NG VE NK I++ +KK+ W D+++ V+AYN
Sbjct: 1612 AQFTHMLKIEHITTKGYNSRANGAVERFNKTIMHIMKKKTAVPME-WDDQVVYAVYAYNN 1670
Query: 1082 TPQSTTGETPFRLTYGVDAMVPVEIQDMTFRVAAYDENENHENRLID--LNLAEEVKTEV 1139
TGETP L +G D M P+E+ Y + + +++ L L + + K
Sbjct: 1671 CVHENTGETPMFLMHGRDVMGPLEMSGEDAVGINYADMDEYKHLLTQELLKVQKIAKEHA 1730
Query: 1140 RLRQAAVKQRSERRYNTRVVPRHMQVGDLVLRR---KAKGPDDSKLSPNWEGPYRIL 1193
Q + K +++Y ++ R Q G VL + G KL W GPYR++
Sbjct: 1731 MREQESYKSLFDQKYASK-KHRFPQPGSRVLLEIPSEKLGAQCPKLVNKWSGPYRVI 1786
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 209 bits (531), Expect = 5e-53
Identities = 132/432 (30%), Positives = 217/432 (49%), Gaps = 12/432 (2%)
Query: 245 QAVKAETNKLIDAGFIRE----VKYPTWLANVVMVKKSNGKWRMCTDYTDLNKHCPKDSY 300
Q V+++ +++ G IR P W+ K+R+ DY LN+ D +
Sbjct: 221 QEVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRH 280
Query: 301 PLPNIDKLVDRASGFGMLSLMDAYSGYHQIRMYAPDEEKTAFMTNQANYCYQTMPFGLKN 360
P+PN+D+++ + + +D G+HQI M KTAF T +Y Y MPFGLKN
Sbjct: 281 PIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKN 340
Query: 361 AGATYQRLMDRVFEGQVGRNMEIYVDDMVVKSEEMGGHCLDLAEAFGEIRKHNMRLNPEK 420
A AT+QR M+ + + ++ +Y+DD++V S + H L F ++ K N++L +K
Sbjct: 341 APATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDK 400
Query: 421 CSFGIQSGKFLGFMITRRGIEVNPDKCKAILEMQSPTSVKEVQKLIGRIAALSRFLPCSG 480
C F Q FLG ++T GI+ NP+K +AI + PT KE++ +G +F+P
Sbjct: 401 CEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFA 460
Query: 481 SKATPFFQCLRKNRVFQWTD-ECEQAFQSLKELLSKPPILSRPIPGTPLSVFISISDNAV 539
A P +CL+KN T+ E + AF+ LK L+S+ PIL P ++ SD A+
Sbjct: 461 DIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVAL 520
Query: 540 SSVLLQECKDELRIIYFVSHALQGAELRYQKIEKAALALIISARKLRPYFQGFQIKVKTD 599
+VL Q+ + ++S L E+ Y IEK LA++ + + R Y G ++ +D
Sbjct: 521 GAVLSQDGHP----LSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSD 576
Query: 600 F-PLRQVLQKPDLAGRMVSWAVELSEFGIVFEKKGQVKAQVLADFVNEMS-PEVKVSEEA 657
PL + + D ++ W V+LSEF K + K +AD ++ + E +SE+
Sbjct: 577 HQPLSWLYRMKDPNSKLTRWRVKLSEFDFDI-KYIKGKENCVADALSRIKLEETYLSEQT 635
Query: 658 EWILSVDGSSYL 669
+ D S +
Sbjct: 636 QHSAEEDNSDLI 647
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 197 bits (501), Expect = 1e-49
Identities = 130/434 (29%), Positives = 210/434 (47%), Gaps = 13/434 (2%)
Query: 231 PIAQMKRKMGEEKAQAVKAETNKLIDAGFIRE----VKYPTWLANVVMVKKSNGKWRMCT 286
PI + + + V+ + ++++ G IRE PTW+ K+R+
Sbjct: 206 PIYSKQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVI 265
Query: 287 DYTDLNKHCPKDSYPLPNIDKLVDRASGFGMLSLMDAYSGYHQIRMYAPDEEKTAFMTNQ 346
DY LN+ D YP+PN+D+++ + + +D G+HQI M KTAF T
Sbjct: 266 DYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKS 325
Query: 347 ANYCYQTMPFGLKNAGATYQRLMDRVFEGQVGRNMEIYVDDMVVKSEEMGGHCLDLAEAF 406
+Y Y MPFGL+NA AT+QR M+ + + ++ +Y+DD+++ S + H + F
Sbjct: 326 GHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVF 385
Query: 407 GEIRKHNMRLNPEKCSFGIQSGKFLGFMITRRGIEVNPDKCKAILEMQSPTSVKEVQKLI 466
++ N++L +KC F + FLG ++T GI+ NP K KAI+ PT KE++ +
Sbjct: 386 TKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFL 445
Query: 467 GRIAALSRFLPCSGSKATPFFQCLRK-NRVFQWTDECEQAFQSLKELLSKPPILSRPIPG 525
G +F+P A P CL+K ++ E +AF+ LK L+ + PIL P
Sbjct: 446 GLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFE 505
Query: 526 TPLSVFISISDNAVSSVLLQECKDELRIIYFVSHALQGAELRYQKIEKAALALIISARKL 585
+ S+ A+ +VL Q I F+S L EL Y IEK LA++ + +
Sbjct: 506 KKFVLTTDASNLALGAVLSQNGHP----ISFISRTLNDHELNYSAIEKELLAIVWATKTF 561
Query: 586 RPYFQGFQIKVKTDF-PLRQVLQKPDLAGRMVSWAVELSEFGIVFEK-KGQVKAQVLADF 643
R Y G Q + +D PLR + + ++ W V LSE+ + KG K +AD
Sbjct: 562 RHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKG--KENSVADA 619
Query: 644 VNEMSPEVKVSEEA 657
++ + E EA
Sbjct: 620 LSRIKIEENHHSEA 633
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 192 bits (487), Expect = 6e-48
Identities = 135/425 (31%), Positives = 212/425 (49%), Gaps = 24/425 (5%)
Query: 247 VKAETNKLIDAGFIRE----VKYPTWLANVVMVKKSNGK--WRMCTDYTDLNKHCPKDSY 300
V+ + ++L+ G IR P W+ V K NG+ +RM D+ LN D+Y
Sbjct: 139 VERQIDELLQDGIIRPSNSPYNSPIWI--VPKKPKPNGEKQYRMVVDFKRLNTVTIPDTY 196
Query: 301 PLPNIDKLVDRASGFGMLSLMDAYSGYHQIRMYAPDEEKTAFMTNQANYCYQTMPFGLKN 360
P+P+I+ + + +D SG+HQI M D KTAF T Y + +PFGLKN
Sbjct: 197 PIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKN 256
Query: 361 AGATYQRLMDRVFEGQVGRNMEIYVDDMVVKSEEMGGHCLDLAEAFGEIRKHNMRLNPEK 420
A A +QR++D + +G+ +Y+DD++V SE+ H +L + K N+++N EK
Sbjct: 257 APAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEK 316
Query: 421 CSFGIQSGKFLGFMITRRGIEVNPDKCKAILEMQSPTSVKEVQKLIGRIAALSRFLPCSG 480
F +FLG+++T GI+ +P K +AI EM PTSVKE+++ +G + +F+
Sbjct: 317 SHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYA 376
Query: 481 SKATPFFQCLR----------KNRVFQWTDECE-QAFQSLKELLSKPPILSRPIPGTPLS 529
A P R ++V DE Q+F LK +L IL+ P P
Sbjct: 377 KVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFH 436
Query: 530 VFISISDNAVSSVLLQECKDELRIIYFVSHALQGAELRYQKIEKAALALIISARKLRPYF 589
+ S+ A+ +VL Q+ + R I ++S +L E Y IEK LA+I S LR Y
Sbjct: 437 LTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYL 496
Query: 590 QGF-QIKVKTDF-PLRQVLQKPDLAGRMVSWAVELSEFGI-VFEKKGQVKAQVLADFVNE 646
G IKV TD PL L + ++ W + E+ + K G K+ V+AD ++
Sbjct: 497 YGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPG--KSNVVADALSR 554
Query: 647 MSPEV 651
+ P++
Sbjct: 555 IPPQL 559
Score = 47.0 bits (110), Expect = 3e-04
Identities = 44/212 (20%), Positives = 88/212 (40%), Gaps = 20/212 (9%)
Query: 887 SHVGGRSLAAKVLRAGFYWPTLKNDCMGYAKKCEKCQIYA-DLHRAPPEVLSSMSSAWPF 945
+H G + ++L +Y+P + + C+ C++Y + H P + + +P
Sbjct: 704 AHRGPTEIRLQLLEK-YYFPRMSSTIRLQTSSCQCCKLYKYERHPNKPNLQPTPIPNYPC 762
Query: 946 AMWGVDILGPFTPAGAQIRFVLVAVDYFTKW-----IEAESMAKITAEKVKKFYWRKIIC 1000
+ +DI + R L +D F+K+ +++++ + V+ ++
Sbjct: 763 EILHIDIFA------LEKRLYLSCIDKFSKFAKLFHLQSKASVHLRETLVEALHY----- 811
Query: 1001 RFGVPATLVSDNGTQFTSRIVRDFCNEMGIEMRFASVEHPQSNGQVEAANKVILNGIKKR 1060
F P LVSDN V ++ + I++ +A + + NGQVE + L +
Sbjct: 812 -FTAPKVLVSDNERGLLCPTVLNYLRSLDIDLYYAPTQKSEVNGQVERFHSTFLEIYRCL 870
Query: 1061 LGDAKGLWADELLTV-VWAYNTTPQSTTGETP 1091
+ EL+ + V YNT+ S T P
Sbjct: 871 KDELPTFKPVELVHIAVDRYNTSVHSVTNRKP 902
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 180 bits (457), Expect = 2e-44
Identities = 122/454 (26%), Positives = 205/454 (44%), Gaps = 11/454 (2%)
Query: 184 NLTAIQEARLTQLLAENMDLFAWSAQDLPGIDPNFICHKLALNPGVKPIAQMKRKMGEEK 243
N + +++L + +E +D+FA ++ P N +L L +P+ + +
Sbjct: 270 NFPELFKSQLENICSEYIDIFALESE--PITVNNLYKQQLRLKDD-EPVYTKNYRSPHSQ 326
Query: 244 AQAVKAETNKLIDAGFIREVKYPTWLANVVMVKKSNG------KWRMCTDYTDLNKHCPK 297
+ ++A+ KLI + E + + +++V K + KWR+ DY +NK
Sbjct: 327 VEEIQAQVQKLIKDKIV-EPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLA 385
Query: 298 DSYPLPNIDKLVDRASGFGMLSLMDAYSGYHQIRMYAPDEEKTAFMTNQANYCYQTMPFG 357
D +PLP ID ++D+ S +D SG+HQI + + T+F T+ +Y + +PFG
Sbjct: 386 DKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFG 445
Query: 358 LKNAGATYQRLMDRVFEGQVGRNMEIYVDDMVVKSEEMGGHCLDLAEAFGEIRKHNMRLN 417
LK A ++QR+M F G +Y+DD++V +L E FG+ R++N++L+
Sbjct: 446 LKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLH 505
Query: 418 PEKCSFGIQSGKFLGFMITRRGIEVNPDKCKAILEMQSPTSVKEVQKLIGRIAALSRFLP 477
PEKCSF + FLG T +GI + K I P ++ + RF+
Sbjct: 506 PEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIK 565
Query: 478 CSGSKATPFFQCLRKNRVFQWTDECEQAFQSLKELLSKPPILSRPIPGTPLSVFISISDN 537
+ + +KN F+WTDEC++AF LK L P +L P + S
Sbjct: 566 NFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQ 625
Query: 538 AVSSVLLQECKDELRIIYFVSHALQGAELRYQKIEKAALALIISARKLRPYFQGFQIKVK 597
A +VL Q + + S A E E+ A+ + RPY G VK
Sbjct: 626 ACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVK 685
Query: 598 TDF-PLRQVLQKPDLAGRMVSWAVELSEFGIVFE 630
TD PL + + + ++ +EL E+ E
Sbjct: 686 TDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVE 719
Score = 106 bits (265), Expect = 3e-22
Identities = 78/331 (23%), Positives = 144/331 (42%), Gaps = 8/331 (2%)
Query: 863 PLLRCVTRDEADRIMFEVHEG-VCASHVGGRSLAAKVLRAGFYWPTLKNDCMGYAKKCEK 921
P+ + E + I+ +H+ + H G AKV R +YW + Y +KC+K
Sbjct: 883 PVTQINNEKEKEAILSTLHDDPIQGGHTGITKTLAKVKRH-YYWKNMSKYIKEYVRKCQK 941
Query: 922 CQIYADLHRAPPEVLSSMSSAWPFAMWGVDILGPFTPAGAQIRFVLVAVDYFTKWIEAES 981
CQ + + + F VD +GP + + + + TK++ A
Sbjct: 942 CQKAKTTKHTKTPMTITETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYLVAIP 1001
Query: 982 MAKITAEKVKKFYWRKIICRFGVPATLVSDNGTQFTSRIVRDFCNEMGIEMRFASVEHPQ 1041
+A +A+ V K + I ++G T ++D GT++ + I+ D C + I+ ++ H Q
Sbjct: 1002 IANKSAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQ 1061
Query: 1042 SNGQVEAANKVILNGIKKRLGDAKGLWADELLTVVWAYNTTPQSTTGETPFRLTYGVDAM 1101
+ G VE +++ + I+ + K W L V+ +NTT P+ L +G +
Sbjct: 1062 TVGVVERSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSN 1121
Query: 1102 VPVEIQDMTFRVAAYDENENHENRLIDLNLAEEVKTEVRLRQAAVKQRSERRYNTRVVPR 1161
+P + Y+ ++ + L +A R A K++++ Y+ +V
Sbjct: 1122 LPKHFNKLHSIEPIYNIDDYAKESKYRLEVA---YARARKLLEAHKEKNKENYDLKVKDI 1178
Query: 1162 HMQVGDLVLRRKAKGPDDSKLSPNWEGPYRI 1192
++VGD VL R G KL + GPY+I
Sbjct: 1179 ELEVGDKVLLRNEVG---HKLDFKYTGPYKI 1206
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 174 bits (441), Expect = 1e-42
Identities = 215/981 (21%), Positives = 390/981 (38%), Gaps = 112/981 (11%)
Query: 272 VVMVKKSNGKWRMCTDYTDLNKHCPKDSYPLPNIDKLVDRASGFGMLSLMDAYSGYHQIR 331
V V K +GKWRM DY ++NK P + + ++ + +D +G+
Sbjct: 214 VYPVPKPDGKWRMVLDYREVNKTIPLIAAQNQHSAGILSSIYRGKYKTTLDLTNGFWAHP 273
Query: 332 MYAPDEEKTAFMTNQANYCYQTMPFGLKNAGATYQR-LMDRVFEGQVGRNMEIYVDDMVV 390
+ TAF YC+ +P G N+ A + ++D + E N++ YVDD+ +
Sbjct: 274 ITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTADVVDLLKEIP---NVQAYVDDIYI 330
Query: 391 KSEEMGGHCLDLAEAFGEIRKHNMRLNPEKCSFGIQSGKFLGFMITRRGIEVNPDKCKAI 450
++ H L + F + ++ +K + +FLGF IT+ G + + +
Sbjct: 331 SHDDPQEHLEQLEKIFSILLNAGYVVSLKKSEIAQREVEFLGFNITKEGRGLTDTFKQKL 390
Query: 451 LEMQSPTSVKEVQKLIGRIAALSRFLPCSGSKATPFFQCLRK--NRVFQWTDECEQAFQS 508
L + P +K++Q ++G + F+P P + + + WT++ Q
Sbjct: 391 LNITPPKDLKQLQSILGLLNFARNFIPNYSELVKPLYTIVANANGKFISWTEDNSNQLQH 450
Query: 509 LKELLSKPPILSRPIPGTPLSVFISISDNAVSSVLLQECKDELRIIYFVSHALQGAELRY 568
+ +L++ L P T L + ++ S +A + + R I +V++ AE ++
Sbjct: 451 IISVLNQADNLEERNPETRLIIKVNSSPSA--GYIRYYNEGSKRPIMYVNYIFSKAEAKF 508
Query: 569 QKIEKAALALIISARKLRPYFQGFQIKVKTDFPLRQVLQKPDLAGRM------VSWAVEL 622
+ EK + K G +I V + +Q+ L R ++W L
Sbjct: 509 TQTEKLLTTMHKGLIKAMDLAMGQEILVYSPIVSMTKIQRTPLPERKALPVRWITWMTYL 568
Query: 623 SEFGIVFE-KKGQVKAQVLADFVNEMSPEVKVSEEAEWILSVDGS---------SYLKGS 672
+ I F K + Q + + ++ + K E + DGS S+ G
Sbjct: 569 EDPRIQFHYDKSLPELQQIPNVTEDVIAKTKHPSEFAMVFYTDGSAIKHPDVNKSHSAGM 628
Query: 673 GAGVVLEGPGGVIIEQSLKFDFKASNNQAEYEAIIAGINLAIEMNVHCLVIKTDSQLVAN 732
G V P I+ Q + AE A+ A++++ L++ TDS VA
Sbjct: 629 GIAQVQFIPEYKIVHQWSIPLGDHTAQLAEIAAVEFACKKALKISGPVLIV-TDSFYVAE 687
Query: 733 QIKGDY-----------QAKDIQLAKYLTKTQELMKRMDSVQVNH----------VPREE 771
+ + K ++ E ++ + + H + E
Sbjct: 688 SANKELPYWKSNGFLNNKKKPLRHVSKWKSIAECLQLKPDIIIMHEKGHQQPMTTLHTEG 747
Query: 772 NTRADVLCKLASTKKPGNNKSVIQETLKSPSINEDDVVMVTGAAPSDWMDRIKMCLEADG 831
N AD KLA+ S + +PS++ + ++ G P + + K LE +
Sbjct: 748 NNLAD---KLAT------QGSYVVHCNTTPSLDAELDQLLQGHYPPGYPKQYKYTLEENK 798
Query: 832 ADLALFSKDQVREASHYVLLGDQLYRRGVGVPLLRCVTRDEADRIMFEVHEGVCASHVGG 891
+ R G+ ++ + + ++I+ H +H G
Sbjct: 799 L----------------------IVERPNGIRIVP--PKADREKIISTAHN---IAHTGR 831
Query: 892 RSLAAKVLRAGFYWPTLKNDCMGYAKKCEKCQIYADLHRAPPEVLSSMSSAWPFAMWGVD 951
+ KV + ++WP L+ D + ++C++C + + P +L + PF + +D
Sbjct: 832 DATFLKV-SSKYWWPNLRKDVVKSIRQCKQCLVTNATNLTSPPILRPVKPLKPFDKFYID 890
Query: 952 ILGPFTPAGAQIRFVLVAVDYFTKWIEA-ESMAKITAEKVKKFYWRKIICRFGVPATLVS 1010
+GP P+ + VLV VD T ++ + A T+ VK ++ +P L S
Sbjct: 891 YIGPLPPSNGYLH-VLVVVDSMTGFVWLYPTKAPSTSATVKAL---NMLTSIAIPKVLHS 946
Query: 1011 DNGTQFTSRIVRDFCNEMGIEMRFASVEHPQSNGQVEAANKVILNGIKKRLGDAKGLWAD 1070
D G FTS D+ E GI++ F++ HPQS+G+VE N I + K L W D
Sbjct: 947 DQGAAFTSSTFADWAKEKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLLIGRPAKWYD 1006
Query: 1071 ELLTVVWAYNTTPQSTTGETPFRLTYGVDAMVPVEIQDMTFRVAAYDENENHENRLIDLN 1130
L V A N + ++ TP +L +GVD+ P D T ++ +E L+
Sbjct: 1007 LLPVVQLALNNSYSPSSKYTPHQLLFGVDSNTPFANSD-TLDLSREEE----------LS 1055
Query: 1131 LAEEVKTEVRLRQAAVKQRSERRYNTRVVPRHMQVGDLVLRRKAKGPDDSKLSPNWEGPY 1190
L +E+++ L Q S R ++ VG LV R A+ + L P W P
Sbjct: 1056 LLQEIRSS--LHQPTSPPASSRSWSP-------SVGQLVQERVAR---PASLRPRWHKPT 1103
Query: 1191 RILRDLG-QGAYHLEELSGRR 1210
IL + + L+ L RR
Sbjct: 1104 AILEVVNPRTVIILDHLGNRR 1124
>POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1157
Score = 172 bits (435), Expect = 7e-42
Identities = 218/1012 (21%), Positives = 403/1012 (39%), Gaps = 89/1012 (8%)
Query: 229 VKPIAQMKRKMGEEKAQAVKAETNKLIDAGFIREVKYPTWLANVVMVKKSNGKWRMCTDY 288
V P Q + + + +++ N L+ G + + + V V K +GKWRM DY
Sbjct: 174 VNPRPQKQYPINPKAKASIQTVINDLLKQGVLIQ-QNSIMNTPVYPVPKPDGKWRMVLDY 232
Query: 289 TDLNKHCPKDSYPLPNIDKLVDRASGFGMLSLMDAYSGYHQIRMYAPDEEKTAFMTNQAN 348
++NK P + + ++ + +D +G+ + TAF
Sbjct: 233 REVNKTIPLIAAQNQHSAGILSSIFRGKYKTTLDLSNGFWAHSITPESYWLTAFTWLGQQ 292
Query: 349 YCYQTMPFGLKNAGATYQR-LMDRVFEGQVGRNMEIYVDDMVVKSEEMGGHCLDLAEAFG 407
YC+ +P G N+ A + ++D + E N+++YVDD+ + ++ H L + F
Sbjct: 293 YCWTRLPQGFLNSPALFTADVVDLLKEVP---NVQVYVDDIYISHDDPREHLEQLEKVFS 349
Query: 408 EIRKHNMRLNPEKCSFGIQSGKFLGFMITRRGIEVNPDKCKAILEMQSPTSVKEVQKLIG 467
+ ++ +K +FLGF IT+ G + + +L + P +K++Q ++G
Sbjct: 350 LLLNAGYVVSLKKSEIAQHEVEFLGFNITKEGRGLTETFKQKLLNITPPRDLKQLQSILG 409
Query: 468 RIAALSRFLPCSGSKATPFFQCLR--KNRVFQWTDECEQAFQSLKELLSKPPILSRPIPG 525
+ F+P P + + + WT + Q Q++ +L+ L P
Sbjct: 410 LLNFARNFIPNFSELVKPLYNIIATANGKYITWTTDNSQQLQNIISMLNSAENLEERNPE 469
Query: 526 TPLSVFISISDNAVSSVLLQECKDELRIIYFVSHALQGAELRYQKIEKAALALIISARKL 585
L + ++ S +A E R I ++++ AE+++ EK + K
Sbjct: 470 VRLIMKVNTSPSAGYIRFYNEFAK--RPIMYLNYVYTKAEVKFTNTEKLLTTIHKGLIKA 527
Query: 586 RPYFQGFQIKVKTDFPLRQVLQKPDLAGRM------VSWAVELSEFGIVFE-KKGQVKAQ 638
G +I V + +QK L R ++W L + I F K + Q
Sbjct: 528 LDLGMGQEILVYSPIVSMTKIQKTPLPERKALPIRWITWMSYLEDPRIQFHYDKTLPELQ 587
Query: 639 VLADFVNEMSPEVKVSEEAEWILSVDGS---------SYLKGSGAGVVLEGPGGVIIEQS 689
+ +++ ++K E + DGS S+ G G V P +I
Sbjct: 588 QVPTVTDDIIAKIKHPSEFSMVFYTDGSAIKHPNVNKSHNAGMGIAQVQFKPEFTVINTW 647
Query: 690 LKFDFKASNNQAEYEAIIAGINLAIEMNVHCLVIKTDSQLVANQIK---------GDYQA 740
+ AE A+ A++++ L++ TDS VA + G +
Sbjct: 648 SIPLGDHTAQLAEVAAVEFACKKALKIDGPVLIV-TDSFYVAESVNKELPYWQSNGFFNN 706
Query: 741 KDIQLAKYLTKTQELMKRMDSVQVNHVPREENTRADVLCKLASTKKPGNNKSVIQETLKS 800
K L K+++K + + D +Q+ + D++ +P T +
Sbjct: 707 KKKPL-KHVSKWKSIA---DCIQL---------KPDIIIIHEKGHQP------TASTFHT 747
Query: 801 PSINEDDVVMVTGAAPSDWMDRIKMCLEADGADLALFSKDQVREASHYVLLGDQLYRRGV 860
N D + G+ + + E D + K + + + G + R
Sbjct: 748 EGNNLADKLATQGSYVVNINTTPSLDAELDQLLQGQYPKGFPKHYQYQLENGQVMVTRPN 807
Query: 861 GVPLLRCVTRDEADRIMFEVHEGVCASHVGGRSLAAKVLRAGFYWPTLKNDCMGYAKKCE 920
G ++ + + +I+ + H +H G S KV + ++WP L+ D + ++C+
Sbjct: 808 GKRIIP--PKSDRPQIILQAHN---IAHTGRDSTFLKV-SSKYWWPNLRKDVVKVIRQCK 861
Query: 921 KCQIYADLHRAPPEVLSSMSSAWPFAMWGVDILGPFTPAGAQIRFVLVAVDYFTKWIEA- 979
+C + A P +L PF + +D +GP P+ + VLV VD T ++
Sbjct: 862 QCLVTNAATLAAPPILRPERPVKPFDKFFIDYIGPLPPSNGYLH-VLVVVDSMTGFVWLY 920
Query: 980 ESMAKITAEKVKKFYWRKIICRFGVPATLVSDNGTQFTSRIVRDFCNEMGIEMRFASVEH 1039
+ A T+ VK ++ VP + SD G FTS D+ GI++ F++ H
Sbjct: 921 PTKAPSTSATVKAL---NMLTSIAVPKVIHSDQGAAFTSATFADWAKNKGIQLEFSTPYH 977
Query: 1040 PQSNGQVEAANKVILNGIKKRLGDAKGLWADELLTVVWAYNTTPQSTTGETPFRLTYGVD 1099
PQS+G+VE N I + K L W D L V A N + ++ TP +L +G+D
Sbjct: 978 PQSSGKVERKNSDIKRLLTKLLVGRPAKWYDLLPVVQLALNNSYSPSSKYTPHQLLFGID 1037
Query: 1100 AMVPVEIQDMTFRVAAYDENENHENRLIDLNLAEEVKTEVRLRQAAVKQRSERRYNTRVV 1159
+ P D T ++ +E L+L +E+++ + L + S R ++
Sbjct: 1038 SNTPFANSD-TLDLSREEE----------LSLLQEIRSSLYL--PSTPPASIRAWSP--- 1081
Query: 1160 PRHMQVGDLVLRRKAKGPDDSKLSPNWEGPYRILRDLG-QGAYHLEELSGRR 1210
VG LV R A+ + L P W P +L + + L+ L RR
Sbjct: 1082 ----SVGQLVQERVAR---PASLRPRWHKPTPVLEVINPRAVVILDHLGNRR 1126
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 169 bits (428), Expect = 4e-41
Identities = 116/423 (27%), Positives = 206/423 (48%), Gaps = 10/423 (2%)
Query: 243 KAQAVKAETNKLIDAGFIREVKYPTWLANVVMVKKSNGKWRMCTDYTDLNKHCPKDSYPL 302
K QA+ E N+ + +G IRE K V+ V K G RM DY LNK+ + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 303 PNIDKLVDRASGFGMLSLMDAYSGYHQIRMYAPDEEKTAFMTNQANYCYQTMPFGLKNAG 362
P I++L+ + G + + +D S YH IR+ DE K AF + + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAP 542
Query: 363 ATYQRLMDRVFEGQVGRNMEIYVDDMVVKSEEMGGHCLDLAEAFGEIRKHNMRLNPEKCS 422
A +Q ++ + ++ Y+DD+++ S+ H + + +++ N+ +N KC
Sbjct: 543 AHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 423 FGIQSGKFLGFMITRRGIEVNPDKCKAILEMQSPTSVKEVQKLIGRIAALSRFLPCSGSK 482
F KF+G+ I+ +G + +L+ + P + KE+++ +G + L +F+P +
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 483 ATPFFQCLRKNRVFQWTDECEQAFQSLKELLSKPPILSRPIPGTPLSVFISISDNAVSSV 542
P L+K+ ++WT QA +++K+ L PP+L + + SD AV +V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 543 LLQECKDELRI-IYFVSHALQGAELRYQKIEKAALALIISARKLRPYFQGF--QIKVKTD 599
L Q+ D+ + + S + A+L Y +K LA+I S + R Y + K+ TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782
Query: 600 FP--LRQVLQKPDLAG-RMVSWAVELSEFGI-VFEKKGQVK--AQVLADFVNEMSPEVKV 653
+ ++ + + R+ W + L +F + + G A L+ V+E P K
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKD 842
Query: 654 SEE 656
SE+
Sbjct: 843 SED 845
Score = 115 bits (287), Expect = 9e-25
Identities = 114/497 (22%), Positives = 212/497 (41%), Gaps = 36/497 (7%)
Query: 714 IEMNVHCLVIKTDSQLVANQIKGDYQAKDIQLAKYLTKTQELMKRMDSVQVNHVPREENT 773
+E + I TD + + +I + + ++ +LA++ Q+ + ++N+ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 774 RADVLCKLASTKKPGNNKSVIQETLKSPSINEDDVVMVTGAAPSDWMDRIKMCLEADGAD 833
AD L ++ +P I + + SIN + + +T D+ +++ D
Sbjct: 825 IADALSRIVDETEP------IPKDSEDNSINFVNQISIT----DDFKNQVVTEYTNDTKL 874
Query: 834 LALFSKDQVREASHYVLLGDQLYRRGVGVPLLRCVTRDEADRIMFEVHEGVCASHVGGRS 893
L L + + R + + L D L LL T+ I+ + HE H G
Sbjct: 875 LNLLNNEDKRVEEN-IQLKDGLLINSKDQILLPNDTQ-LTRTIIKKYHEEGKLIHPGIEL 932
Query: 894 LAAKVLRAGFYWPTLKNDCMGYAKKCEKCQIYADLHRAPPEVLSSMS-SAWPFAMWGVDI 952
L +LR F W ++ Y + C CQI + P L + S P+ +D
Sbjct: 933 LTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDF 991
Query: 953 LGPFTPAGAQIRFVLVAVDYFTKW-IEAESMAKITAEKVKKFYWRKIICRFGVPATLVSD 1011
+ P + + V VD F+K I ITAE+ + + +++I FG P +++D
Sbjct: 992 ITAL-PESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIAD 1050
Query: 1012 NGTQFTSRIVRDFCNEMGIEMRFASVEHPQSNGQVEAANKVILNGIKKRLGDAKGLWADE 1071
N FTS+ +DF ++ M+F+ PQ++GQ E N+ + ++ W D
Sbjct: 1051 NDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDH 1110
Query: 1072 LLTVVWAYNTTPQSTTGETPFRLTYGVD-AMVPVEIQDMTFRVAAYDENENHENRLIDLN 1130
+ V +YN S T TPF + + A+ P+E+ + ++ EN +
Sbjct: 1111 ISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFS--------DKTDENSQETIQ 1162
Query: 1131 LAEEVKTEVRLRQAAVKQRSERRYNTRVVPRHMQVGDLVLRRKAKG---PDDSKLSPNWE 1187
+ + VK + +K+ + + Q GDLV+ ++ K +KL+P++
Sbjct: 1163 VFQTVKEHLNTNNIKMKKYFDMKIQE---IEEFQPGDLVMVKRTKTGFLHKSNKLAPSFA 1219
Query: 1188 GPYRILRDLGQGAYHLE 1204
GP+ +L+ G Y L+
Sbjct: 1220 GPFYVLQKSGPNNYELD 1236
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 166 bits (419), Expect = 5e-40
Identities = 115/423 (27%), Positives = 206/423 (48%), Gaps = 10/423 (2%)
Query: 243 KAQAVKAETNKLIDAGFIREVKYPTWLANVVMVKKSNGKWRMCTDYTDLNKHCPKDSYPL 302
K QA+ E N+ + +G IRE K V+ V K G RM DY LNK+ + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 303 PNIDKLVDRASGFGMLSLMDAYSGYHQIRMYAPDEEKTAFMTNQANYCYQTMPFGLKNAG 362
P I++L+ + G + + +D S YH IR+ DE K AF + + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542
Query: 363 ATYQRLMDRVFEGQVGRNMEIYVDDMVVKSEEMGGHCLDLAEAFGEIRKHNMRLNPEKCS 422
A +Q ++ + ++ Y+D++++ S+ H + + +++ N+ +N KC
Sbjct: 543 AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 423 FGIQSGKFLGFMITRRGIEVNPDKCKAILEMQSPTSVKEVQKLIGRIAALSRFLPCSGSK 482
F KF+G+ I+ +G + +L+ + P + KE+++ +G + L +F+P +
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 483 ATPFFQCLRKNRVFQWTDECEQAFQSLKELLSKPPILSRPIPGTPLSVFISISDNAVSSV 542
P L+K+ ++WT QA +++K+ L PP+L + + SD AV +V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 543 LLQECKDELRI-IYFVSHALQGAELRYQKIEKAALALIISARKLRPYFQGF--QIKVKTD 599
L Q+ D+ + + S + A+L Y +K LA+I S + R Y + K+ TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782
Query: 600 FP--LRQVLQKPDLAG-RMVSWAVELSEFGI-VFEKKGQVK--AQVLADFVNEMSPEVKV 653
+ ++ + + R+ W + L +F + + G A L+ V+E P K
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKD 842
Query: 654 SEE 656
SE+
Sbjct: 843 SED 845
Score = 115 bits (287), Expect = 9e-25
Identities = 114/497 (22%), Positives = 212/497 (41%), Gaps = 36/497 (7%)
Query: 714 IEMNVHCLVIKTDSQLVANQIKGDYQAKDIQLAKYLTKTQELMKRMDSVQVNHVPREENT 773
+E + I TD + + +I + + ++ +LA++ Q+ + ++N+ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 774 RADVLCKLASTKKPGNNKSVIQETLKSPSINEDDVVMVTGAAPSDWMDRIKMCLEADGAD 833
AD L ++ +P I + + SIN + + +T D+ +++ D
Sbjct: 825 IADALSRIVDETEP------IPKDSEDNSINFVNQISIT----DDFKNQVVTEYTNDTKL 874
Query: 834 LALFSKDQVREASHYVLLGDQLYRRGVGVPLLRCVTRDEADRIMFEVHEGVCASHVGGRS 893
L L + + R + + L D L LL T+ I+ + HE H G
Sbjct: 875 LNLLNNEDKRVEEN-IQLKDGLLINSKDQILLPNDTQ-LTRTIIKKYHEEGKLIHPGIEL 932
Query: 894 LAAKVLRAGFYWPTLKNDCMGYAKKCEKCQIYADLHRAPPEVLSSMS-SAWPFAMWGVDI 952
L +LR F W ++ Y + C CQI + P L + S P+ +D
Sbjct: 933 LTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDF 991
Query: 953 LGPFTPAGAQIRFVLVAVDYFTKW-IEAESMAKITAEKVKKFYWRKIICRFGVPATLVSD 1011
+ P + + V VD F+K I ITAE+ + + +++I FG P +++D
Sbjct: 992 ITAL-PESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIAD 1050
Query: 1012 NGTQFTSRIVRDFCNEMGIEMRFASVEHPQSNGQVEAANKVILNGIKKRLGDAKGLWADE 1071
N FTS+ +DF ++ M+F+ PQ++GQ E N+ + ++ W D
Sbjct: 1051 NDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDH 1110
Query: 1072 LLTVVWAYNTTPQSTTGETPFRLTYGVD-AMVPVEIQDMTFRVAAYDENENHENRLIDLN 1130
+ V +YN S T TPF + + A+ P+E+ + ++ EN +
Sbjct: 1111 ISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFS--------DKTDENSQETIQ 1162
Query: 1131 LAEEVKTEVRLRQAAVKQRSERRYNTRVVPRHMQVGDLVLRRKAKG---PDDSKLSPNWE 1187
+ + VK + +K+ + + Q GDLV+ ++ K +KL+P++
Sbjct: 1163 VFQTVKEHLNTNNIKMKKYFDMKIQE---IEEFQPGDLVMVKRTKTGFLHKSNKLAPSFA 1219
Query: 1188 GPYRILRDLGQGAYHLE 1204
GP+ +L+ G Y L+
Sbjct: 1220 GPFYVLQKSGPNNYELD 1236
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 166 bits (419), Expect = 5e-40
Identities = 115/423 (27%), Positives = 206/423 (48%), Gaps = 10/423 (2%)
Query: 243 KAQAVKAETNKLIDAGFIREVKYPTWLANVVMVKKSNGKWRMCTDYTDLNKHCPKDSYPL 302
K QA+ E N+ + +G IRE K V+ V K G RM DY LNK+ + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 303 PNIDKLVDRASGFGMLSLMDAYSGYHQIRMYAPDEEKTAFMTNQANYCYQTMPFGLKNAG 362
P I++L+ + G + + +D S YH IR+ DE K AF + + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542
Query: 363 ATYQRLMDRVFEGQVGRNMEIYVDDMVVKSEEMGGHCLDLAEAFGEIRKHNMRLNPEKCS 422
A +Q ++ + ++ Y+D++++ S+ H + + +++ N+ +N KC
Sbjct: 543 AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 423 FGIQSGKFLGFMITRRGIEVNPDKCKAILEMQSPTSVKEVQKLIGRIAALSRFLPCSGSK 482
F KF+G+ I+ +G + +L+ + P + KE+++ +G + L +F+P +
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 483 ATPFFQCLRKNRVFQWTDECEQAFQSLKELLSKPPILSRPIPGTPLSVFISISDNAVSSV 542
P L+K+ ++WT QA +++K+ L PP+L + + SD AV +V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 543 LLQECKDELRI-IYFVSHALQGAELRYQKIEKAALALIISARKLRPYFQGF--QIKVKTD 599
L Q+ D+ + + S + A+L Y +K LA+I S + R Y + K+ TD
Sbjct: 723 LSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTD 782
Query: 600 FP--LRQVLQKPDLAG-RMVSWAVELSEFGI-VFEKKGQVK--AQVLADFVNEMSPEVKV 653
+ ++ + + R+ W + L +F + + G A L+ V+E P K
Sbjct: 783 HRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKD 842
Query: 654 SEE 656
SE+
Sbjct: 843 SED 845
Score = 115 bits (287), Expect = 9e-25
Identities = 114/497 (22%), Positives = 212/497 (41%), Gaps = 36/497 (7%)
Query: 714 IEMNVHCLVIKTDSQLVANQIKGDYQAKDIQLAKYLTKTQELMKRMDSVQVNHVPREENT 773
+E + I TD + + +I + + ++ +LA++ Q+ + ++N+ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 774 RADVLCKLASTKKPGNNKSVIQETLKSPSINEDDVVMVTGAAPSDWMDRIKMCLEADGAD 833
AD L ++ +P I + + SIN + + +T D+ +++ D
Sbjct: 825 IADALSRIVDETEP------IPKDSEDNSINFVNQISIT----DDFKNQVVTEYTNDTKL 874
Query: 834 LALFSKDQVREASHYVLLGDQLYRRGVGVPLLRCVTRDEADRIMFEVHEGVCASHVGGRS 893
L L + + R + + L D L LL T+ I+ + HE H G
Sbjct: 875 LNLLNNEDKRVEEN-IQLKDGLLINSKDQILLPNDTQ-LTRTIIKKYHEEGKLIHPGIEL 932
Query: 894 LAAKVLRAGFYWPTLKNDCMGYAKKCEKCQIYADLHRAPPEVLSSMS-SAWPFAMWGVDI 952
L +LR F W ++ Y + C CQI + P L + S P+ +D
Sbjct: 933 LTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDF 991
Query: 953 LGPFTPAGAQIRFVLVAVDYFTKW-IEAESMAKITAEKVKKFYWRKIICRFGVPATLVSD 1011
+ P + + V VD F+K I ITAE+ + + +++I FG P +++D
Sbjct: 992 ITAL-PESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIAD 1050
Query: 1012 NGTQFTSRIVRDFCNEMGIEMRFASVEHPQSNGQVEAANKVILNGIKKRLGDAKGLWADE 1071
N FTS+ +DF ++ M+F+ PQ++GQ E N+ + ++ W D
Sbjct: 1051 NDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDH 1110
Query: 1072 LLTVVWAYNTTPQSTTGETPFRLTYGVD-AMVPVEIQDMTFRVAAYDENENHENRLIDLN 1130
+ V +YN S T TPF + + A+ P+E+ + ++ EN +
Sbjct: 1111 ISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFS--------DKTDENSQETIQ 1162
Query: 1131 LAEEVKTEVRLRQAAVKQRSERRYNTRVVPRHMQVGDLVLRRKAKG---PDDSKLSPNWE 1187
+ + VK + +K+ + + Q GDLV+ ++ K +KL+P++
Sbjct: 1163 VFQTVKEHLNTNNIKMKKYFDMKIQE---IEEFQPGDLVMVKRTKTGFLHKSNKLAPSFA 1219
Query: 1188 GPYRILRDLGQGAYHLE 1204
GP+ +L+ G Y L+
Sbjct: 1220 GPFYVLQKSGPNNYELD 1236
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 155 bits (391), Expect = 8e-37
Identities = 119/422 (28%), Positives = 197/422 (46%), Gaps = 26/422 (6%)
Query: 247 VKAETNKLIDAGFIREVKYPTWLANVVMVKKS-----NGKWRMCTDYTDLNKHCPKDSYP 301
V E +L+ G IR + P V+ KK N R+ D+ LN+ D YP
Sbjct: 197 VNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYP 256
Query: 302 LPNIDKLVDRASGFGMLSLMDAYSGYHQIRMYAPDEEKTAFMTNQANYCYQTMPFGLKNA 361
+P+I ++ + +D SGYHQI + D EKT+F N Y + +PFGL+NA
Sbjct: 257 MPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNA 316
Query: 362 GATYQRLMDRVFEGQVGRNMEIYVDDMVVKSEEMGGHCLDLAEAFGEIRKHNMRLNPEKC 421
+ +QR +D V Q+G+ +YVDD+++ SE H + + NMR++ EK
Sbjct: 317 SSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKT 376
Query: 422 SFGIQSGKFLGFMITRRGIEVNPDKCKAILEMQSPTSVKEVQKLIGRIAALSRFLPCSGS 481
F +S ++LGF++++ G + +P+K KAI E P V +V+ +G + F+ +
Sbjct: 377 RFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAA 436
Query: 482 KATPFFQCLR-----------KNRVFQWTDECEQAFQSLKELL-SKPPILSRPIPGTPLS 529
A P L+ K ++ + AFQ L+ +L S+ IL P P
Sbjct: 437 IARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFD 496
Query: 530 VFISISDNAVSSVLLQECKDELRIIYFVSHALQGAELRYQKIEKAALALIISARKLRPYF 589
+ S + + +VL QE R I +S L+ E Y E+ LA++ + KL+ +
Sbjct: 497 LTTDASASGIGAVLSQEG----RPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFL 552
Query: 590 QGF-QIKVKTDF-PLRQVLQKPDLAGRMVSWAVELSEFGI-VFEKKGQVKAQVLADFVNE 646
G +I + TD PL + + ++ W + + VF K G K +AD ++
Sbjct: 553 YGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKPG--KENFVADALSR 610
Query: 647 MS 648
+
Sbjct: 611 QN 612
Score = 42.0 bits (97), Expect = 0.010
Identities = 62/270 (22%), Positives = 103/270 (37%), Gaps = 32/270 (11%)
Query: 887 SHVGGRSLAAKVLRAGFYWPTLKNDCMGYAKKCEKC-QIYADLHRAPPEVLSSMSSAWPF 945
+H + +VLR +Y+P + + C C Q D H E+ + ++
Sbjct: 749 AHRAAQENIKQVLR-DYYFPKMGSLAKEVVANCRVCTQAKYDRHPKKQELGETPIPSYTG 807
Query: 946 AMWGVDILGPFTPAGAQIRFVLVAVDYFTKW-----IEAESMAKITAEKVKKFYWRKIIC 1000
M +DI + L +D F+K+ + + ++ ITA ++ II
Sbjct: 808 EMVHIDIFS------TDRKLFLTCIDKFSKYAIVQPVVSRTIVDITAPLLQ------IIN 855
Query: 1001 RFGVPATLVSDNGTQFTSRIVRDFC-NEMGIEMRFASVEHPQSNGQVEAANKVILN---- 1055
F T+ DN F S V N GI++ A H SNGQVE + +
Sbjct: 856 LFPNIKTVYCDNEPAFNSETVTSMLKNSFGIDIVNAPPLHSSSNGQVERFHSTLAEIARC 915
Query: 1056 -GIKKRLGDAKGLWADELLTVVWAYNTTPQSTTGETPFRLTYGVDAMVPVEIQDMTFRV- 1113
+ K+ D L +L YN T S T E P + + +EI+ +
Sbjct: 916 LKLDKKTNDTVEL----ILRATIEYNKTVHSVTRERPIEVVHPGAHERCLEIKARLVKAQ 971
Query: 1114 --AAYDENENHENRLIDLNLAEEVKTEVRL 1141
+ N + +NR+ ++ VK RL
Sbjct: 972 QDSIGRNNPSRQNRVFEVGERVFVKNNKRL 1001
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 121 bits (304), Expect = 1e-26
Identities = 114/468 (24%), Positives = 200/468 (42%), Gaps = 26/468 (5%)
Query: 156 EGFLEH--KMTPEEETKTVKVGERNLKVGVNLTAI---------QEARLTQLLAENMDLF 204
EGFLE K + ++ + V + ++ + AI ++ +TQ + ++
Sbjct: 156 EGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQKIEEL 215
Query: 205 AWSAQDLPGIDPN----FICHKLALNPGVKPIAQMKRKMGEEKAQAVKAETNKLIDAGFI 260
+DPN ++ + L+ K I K + + +L+D I
Sbjct: 216 LEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVI 275
Query: 261 REVKYPTWLANVVM---VKKSNGKWRMCTDYTDLNKHCPKDSYPLPNIDKLVDRASGFGM 317
+ K P ++ +K GK RM +Y +NK D+Y LPN D+L+ G +
Sbjct: 276 KPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKI 335
Query: 318 LSLMDAYSGYHQIRMYAPDEEKTAFMTNQANYCYQTMPFGLKNAGATYQRLMDRVFEGQV 377
S D SG+ Q+ + TAF Q +Y + +PFGLK A + +QR MD F +V
Sbjct: 336 FSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF--RV 393
Query: 378 GRNM-EIYVDDMVVKSEEMGGHCLDLAEAFGEIRKHNMRLNPEKCSFGIQSGKFLGFMIT 436
R +YVDD++V S H L +A + +H + L+ +K + FLG I
Sbjct: 394 FRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEID 453
Query: 437 RRGIEVNPDKCKAILEM-QSPTSVKEVQKLIGRIAALSRFLPCSGSKATPFFQCLRKNRV 495
+ + I + + K++Q+ +G + S ++P P L++N
Sbjct: 454 EGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVP 513
Query: 496 FQWTDECEQAFQSLKELLSKPPILSRPIPGTPLSVFISISDN----AVSSVLLQECKDEL 551
++WT E Q +K+ L P L P+P L + SD+ + ++ + E +
Sbjct: 514 WRWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTE 573
Query: 552 RIIYFVSHALQGAELRYQKIEKAALALIISARKLRPYFQGFQIKVKTD 599
I + S + + AE Y +K LA+I + +K Y ++TD
Sbjct: 574 LICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTD 621
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 674
Score = 121 bits (304), Expect = 1e-26
Identities = 113/461 (24%), Positives = 197/461 (42%), Gaps = 19/461 (4%)
Query: 156 EGFLEH--KMTPEEETKTVKVGERNLKVGVNLTAIQEARL--TQLLAENMDLFAWSAQDL 211
EGFLE K + ++ + V + + + + E +L TQ + ++
Sbjct: 158 EGFLESMKKRSKTQQPEPVNISTNKIAILSEGRRLSEEKLFITQQRMQKIEELLEKVCSE 217
Query: 212 PGIDPN----FICHKLALNPGVKPIAQMKRKMGEEKAQAVKAETNKLIDAGFIREVKYPT 267
+DPN ++ + L+ K I K + + +L+D I+ K P
Sbjct: 218 NPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPH 277
Query: 268 WLANVVM---VKKSNGKWRMCTDYTDLNKHCPKDSYPLPNIDKLVDRASGFGMLSLMDAY 324
++ +K GK RM +Y +NK D+Y PN D+L+ G + S D
Sbjct: 278 MAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCK 337
Query: 325 SGYHQIRMYAPDEEKTAFMTNQANYCYQTMPFGLKNAGATYQRLMDRVFEGQVGRNM-EI 383
SG+ Q+ + TAF Q +Y + +PFGLK A + +QR MD F +V R +
Sbjct: 338 SGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF--RVFRKFCCV 395
Query: 384 YVDDMVVKSEEMGGHCLDLAEAFGEIRKHNMRLNPEKCSFGIQSGKFLGFMITRRGIEVN 443
YVDD++V S H L +A + +H + L+ +K + FLG I +
Sbjct: 396 YVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQ 455
Query: 444 PDKCKAILEM-QSPTSVKEVQKLIGRIAALSRFLPCSGSKATPFFQCLRKNRVFQWTDEC 502
+ I + + K++Q+ +G + S ++P P L++N ++WT E
Sbjct: 456 GHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKED 515
Query: 503 EQAFQSLKELLSKPPILSRPIPGTPLSVFISISDN----AVSSVLLQECKDELRIIYFVS 558
Q +K+ L P L P+P L + SD+ + ++ + E + I + S
Sbjct: 516 TLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYAS 575
Query: 559 HALQGAELRYQKIEKAALALIISARKLRPYFQGFQIKVKTD 599
+ + AE Y +K LA+I + +K Y ++TD
Sbjct: 576 GSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTD 616
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 121 bits (303), Expect = 1e-26
Identities = 114/468 (24%), Positives = 200/468 (42%), Gaps = 26/468 (5%)
Query: 156 EGFLEH--KMTPEEETKTVKVGERNLKVGVNLTAI---------QEARLTQLLAENMDLF 204
EGFLE K + ++ + V + ++ + AI ++ +TQ + ++
Sbjct: 156 EGFLESMKKRSKTQQPEPVNISTNKIENPLKEIAILSEGRRLSEEKLFITQQRMQKIEEL 215
Query: 205 AWSAQDLPGIDPN----FICHKLALNPGVKPIAQMKRKMGEEKAQAVKAETNKLIDAGFI 260
+DPN ++ + L+ K I K + + +L+D I
Sbjct: 216 LEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVI 275
Query: 261 REVKYPTWLANVVM---VKKSNGKWRMCTDYTDLNKHCPKDSYPLPNIDKLVDRASGFGM 317
+ K P ++ +K GK RM +Y +NK D+Y LPN D+L+ G +
Sbjct: 276 KPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKI 335
Query: 318 LSLMDAYSGYHQIRMYAPDEEKTAFMTNQANYCYQTMPFGLKNAGATYQRLMDRVFEGQV 377
S D SG+ Q+ + TAF Q +Y + +PFGLK A + +QR MD F +V
Sbjct: 336 FSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF--RV 393
Query: 378 GRNM-EIYVDDMVVKSEEMGGHCLDLAEAFGEIRKHNMRLNPEKCSFGIQSGKFLGFMIT 436
R +YVDD++V S H L +A + +H + L+ +K + FLG I
Sbjct: 394 FRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEID 453
Query: 437 RRGIEVNPDKCKAILEM-QSPTSVKEVQKLIGRIAALSRFLPCSGSKATPFFQCLRKNRV 495
+ + I + + K++Q+ +G + S ++P P L++N
Sbjct: 454 EGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVP 513
Query: 496 FQWTDECEQAFQSLKELLSKPPILSRPIPGTPLSVFISISDN----AVSSVLLQECKDEL 551
++WT E Q +K+ L P L P+P L + SD+ + ++ + E +
Sbjct: 514 WKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTE 573
Query: 552 RIIYFVSHALQGAELRYQKIEKAALALIISARKLRPYFQGFQIKVKTD 599
I + S + + AE Y +K LA+I + +K Y ++TD
Sbjct: 574 LICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTD 621
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 121 bits (303), Expect = 1e-26
Identities = 114/468 (24%), Positives = 200/468 (42%), Gaps = 26/468 (5%)
Query: 156 EGFLEH--KMTPEEETKTVKVGERNLKVGVNLTAI---------QEARLTQLLAENMDLF 204
EGFLE K + ++ + V + ++ + AI ++ +TQ + ++
Sbjct: 156 EGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQKIEEL 215
Query: 205 AWSAQDLPGIDPN----FICHKLALNPGVKPIAQMKRKMGEEKAQAVKAETNKLIDAGFI 260
+DPN ++ + L+ K I K + + +L+D I
Sbjct: 216 LEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVI 275
Query: 261 REVKYPTWLANVVM---VKKSNGKWRMCTDYTDLNKHCPKDSYPLPNIDKLVDRASGFGM 317
+ K P ++ +K GK RM +Y +NK D+Y LPN D+L+ G +
Sbjct: 276 KPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKI 335
Query: 318 LSLMDAYSGYHQIRMYAPDEEKTAFMTNQANYCYQTMPFGLKNAGATYQRLMDRVFEGQV 377
S D SG+ Q+ + TAF Q +Y + +PFGLK A + +QR MD F +V
Sbjct: 336 FSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF--RV 393
Query: 378 GRNM-EIYVDDMVVKSEEMGGHCLDLAEAFGEIRKHNMRLNPEKCSFGIQSGKFLGFMIT 436
R +YVDD++V S H L +A + +H + L+ +K + FLG I
Sbjct: 394 FRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEID 453
Query: 437 RRGIEVNPDKCKAILEM-QSPTSVKEVQKLIGRIAALSRFLPCSGSKATPFFQCLRKNRV 495
+ + I + + K++Q+ +G + S ++P P L++N
Sbjct: 454 EGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVP 513
Query: 496 FQWTDECEQAFQSLKELLSKPPILSRPIPGTPLSVFISISDN----AVSSVLLQECKDEL 551
++WT E Q +K+ L P L P+P L + SD+ + ++ + E +
Sbjct: 514 WKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTE 573
Query: 552 RIIYFVSHALQGAELRYQKIEKAALALIISARKLRPYFQGFQIKVKTD 599
I + S + + AE Y +K LA+I + +K Y ++TD
Sbjct: 574 LICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTD 621
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 120 bits (302), Expect = 2e-26
Identities = 73/249 (29%), Positives = 129/249 (51%), Gaps = 7/249 (2%)
Query: 231 PIAQMKRKMGEEKAQAVKAETNKLIDAGFIREVKYPTWLANVVMVKKSN-GKWRMCTDY- 288
P+ + R + +AV+ E N+L + G I + Y W A +V++KK GK R+C D+
Sbjct: 440 PVFKRARPVPYGSLEAVETELNRLQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCADFK 499
Query: 289 -TDLNKHCPKDSYPLPNIDKLVDRASGFGMLSLMDAYSGYHQIRMYAPDEEKTAFMTNQA 347
+ LN + +PLP + + R G + S +D Y Q+ + ++ T++
Sbjct: 500 CSGLNAALKDEFHPLPTSEDIFSRLKGT-VYSQIDLKDAYLQVELDEEAQKLAVINTHRG 558
Query: 348 NYCYQTMPFGLKNAGATYQRLMDRVFEGQVGRNMEIYVDDMVVKSEEMGGHCLDLAEAFG 407
+ Y M FGLK A A++Q++MD++ G G + +Y DD+++ + + H L E F
Sbjct: 559 IFKYLRMTFGLKPAPASFQKIMDKMVSGLTG--VAVYWDDIIISASSIEEHEKILRELFE 616
Query: 408 EIRKHNMRLNPEKCSFGIQSGKFLGFMITRRGIEVNPDKCKAILEMQSPTSVKEVQKLIG 467
+++ R++ EKC+F + FLGF + G + K +AI M++PT K++ +G
Sbjct: 617 RFKEYGFRVSAEKCAFAQKQVTFLGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQLASFLG 675
Query: 468 RIAALSRFL 476
LSR +
Sbjct: 676 AADWLSRMM 684
Score = 82.8 bits (203), Expect = 5e-15
Identities = 72/264 (27%), Positives = 113/264 (42%), Gaps = 32/264 (12%)
Query: 876 IMFEVHEGVCASHVGGRSLAAKVLRAGFYWPTLKNDCMGYAKKCEKCQIYADLHRAPPEV 935
++ ++HEG H G + K R+ +W L +D + C CQ + + R P
Sbjct: 786 VLKQLHEG----HPGIVQMKQKA-RSFVFWRGLDSDIENMVRHCNNCQENSKMPRVVP-- 838
Query: 936 LSSMSSAWPF--AMWG---VDILGPFTPAGAQIRFVLVAVDYFTKWIEAESMAKITAEKV 990
+ WP A W +D GP ++LV VD TK+ E + I+A
Sbjct: 839 ----LNPWPVPEAPWKRIHIDFAGPLNGC-----YLLVVVDAKTKYAEVKLTRSISAVTT 889
Query: 991 KKFYWRKIICRFGVPATLVSDNGTQFTSRIVRDFCNEMGIEMRFASVEHPQSNGQVEAAN 1050
+I G P T++SDNGTQ TS + C GIE + ++V +P+SNG E
Sbjct: 890 IDLL-EEIFSIHGYPETIISDNGTQLTSHLFAQMCQSHGIEHKTSAVYYPRSNGAAERFV 948
Query: 1051 KVILNGIKKRLGDAKGLWADELLT-VVWAYNTTPQST-TGETPFRLTYG------VDAMV 1102
+ GI K G+ G ++L + +Y TP S G TP +G + ++
Sbjct: 949 DTLKRGIAKIKGE--GSVNQQILNKFLISYRNTPHSALNGSTPAECHFGRKIRTTMSLLM 1006
Query: 1103 PVEIQDMTFRVAAYDENENHENRL 1126
P + ++ Y +N H L
Sbjct: 1007 PTDRVLKVPKLTQYQQNMKHHYEL 1030
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 680
Score = 119 bits (297), Expect = 7e-26
Identities = 116/469 (24%), Positives = 200/469 (41%), Gaps = 28/469 (5%)
Query: 156 EGFLEH--KMTPEEETKTVKVGERNLKVGVNLTAI-------QEARL---TQLLAENMDL 203
EGFLE K + ++ + V + ++ + AI E +L Q + + +L
Sbjct: 157 EGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQKTEEL 216
Query: 204 FAWSAQDLPGIDPN----FICHKLALNPGVKPIAQMKRKMGEEKAQAVKAETNKLIDAGF 259
+ P +DPN ++ + L+ K I K + + +L+D
Sbjct: 217 LEKVCSENP-LDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKV 275
Query: 260 IREVKYPTWLANVVM---VKKSNGKWRMCTDYTDLNKHCPKDSYPLPNIDKLVDRASGFG 316
I+ K P ++ + G RM +Y +NK D+Y LPN D+L+ G
Sbjct: 276 IKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKK 335
Query: 317 MLSLMDAYSGYHQIRMYAPDEEKTAFMTNQANYCYQTMPFGLKNAGATYQRLMDRVFEGQ 376
+ S D SG+ Q+ + TAF Q +Y + +PFGLK A + +QR MD F +
Sbjct: 336 IFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF--R 393
Query: 377 VGRNM-EIYVDDMVVKSEEMGGHCLDLAEAFGEIRKHNMRLNPEKCSFGIQSGKFLGFMI 435
V R +YVDD+VV S H L +A + +H + L+ +K + FLG I
Sbjct: 394 VFRKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEI 453
Query: 436 TRRGIEVNPDKCKAILEM-QSPTSVKEVQKLIGRIAALSRFLPCSGSKATPFFQCLRKNR 494
+ + I + + K++Q+ +G + S ++P P L++N
Sbjct: 454 DEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENV 513
Query: 495 VFQWTDECEQAFQSLKELLSKPPILSRPIPGTPLSVFISISDN----AVSSVLLQECKDE 550
++WT E Q +K+ L P L P+P L + SD+ + ++ + E +
Sbjct: 514 PWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNT 573
Query: 551 LRIIYFVSHALQGAELRYQKIEKAALALIISARKLRPYFQGFQIKVKTD 599
I + S + + AE Y +K LA+I + +K Y ++TD
Sbjct: 574 ELICRYRSGSFKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTD 622
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 666
Score = 110 bits (276), Expect = 2e-23
Identities = 98/383 (25%), Positives = 163/383 (41%), Gaps = 27/383 (7%)
Query: 276 KKSNGKWRMCTDYTDLNKHCPKDSYPLPNIDKLVDRASGFGMLSLMDAYSGYHQIRMYAP 335
++ GK RM +Y +N+ DS+ LPN+ +L+ G + S D SG+ Q+ +
Sbjct: 287 ERRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEE 346
Query: 336 DEEKTAFMTNQANYCYQTMPFGLKNAGATYQRLMDRVFEGQVGRNMEIYVDDMVVKSEEM 395
++ TAF Q ++ ++ +PFGLK A + +QR M G + +YVDD++V S
Sbjct: 347 SQKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSE 405
Query: 396 GGHCLDLAEAFGEIRKHNMRLNPEKCSFGIQSGKFLGFMITR----------RGIEVNPD 445
H + + K+ + L+ +K + + FLG I + I PD
Sbjct: 406 LDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNHILENIHKFPD 465
Query: 446 KCKAILEMQSPTSVKEVQKLIGRIAALSRFLPCSGSKATPFFQCLRKNRVFQWTDECEQA 505
+ + K +Q+ +G + ++P P L+K+ + WT
Sbjct: 466 RLE---------DKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDY 516
Query: 506 FQSLKELLSKPPILSRPIPGTPLSVFISISDNAVSSVLLQECKDELRII-YFVSHALQGA 564
+ +K+ L P L P P L + SD+ VL D + +I + S + + A
Sbjct: 517 VKKIKKNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFKQA 576
Query: 565 ELRYQKIEKAALALIISARKLRPYFQGFQIKVKTD-----FPLRQVLQKPDLAGRMVSWA 619
E Y +K LA+ K Y + V+TD + LR L+ GR+V W
Sbjct: 577 EKNYHSNDKELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQ 636
Query: 620 VELSEFGIVFEKKGQVKAQVLAD 642
S++ E VK VLAD
Sbjct: 637 NWFSKYQFDVEHLEGVK-NVLAD 658
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 659
Score = 108 bits (271), Expect = 7e-23
Identities = 96/387 (24%), Positives = 158/387 (40%), Gaps = 26/387 (6%)
Query: 276 KKSNGKWRMCTDYTDLNKHCPKDSYPLPNIDKLVDRASGFGMLSLMDAYSGYHQIRMYAP 335
++ GK RM +Y +NK D++ LPN D+L+ G + S D SG Q+ +
Sbjct: 276 ERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKE 335
Query: 336 DEEKTAFMTNQANYCYQTMPFGLKNAGATYQRLMDRVFEGQVGRNMEIYVDDMVVKSEE- 394
+ TAF Q +Y + +PFGLK A + + + Q + +YVDD++V S
Sbjct: 336 SQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTG 395
Query: 395 MGGHCLDLAEAFGEIRKHNMRLNPEKCSFGIQSGKFLGFMITR----------RGIEVNP 444
H + + K + L+ +K + FLG I + I P
Sbjct: 396 RKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEIDQGTHCPQNHILEHIHKFP 455
Query: 445 DKCKAILEMQSPTSVKEVQKLIGRIAALSRFLPCSGSKATPFFQCLRKNRVFQWTDECEQ 504
D+ + K++Q+ +G + S ++P S P L+++ + W D Q
Sbjct: 456 DRIE---------DKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQ 506
Query: 505 AFQSLKELLSKPPILSRPIPGTPLSVFISISDNAVSSVLLQECKDELRIIYFVSHALQGA 564
+K+ L P L P P L + S+ +L I + S + + A
Sbjct: 507 YMAKIKKNLKSFPKLYHPEPNDKLVIETDASEEFWGGILKAIHNSHEYICRYASGSFKAA 566
Query: 565 ELRYQKIEKAALALIISARKLRPYFQGFQIKVKTDFP-----LRQVLQKPDLAGRMVSWA 619
E Y EK LA+I +K Y + ++TD + L+ GR+V W
Sbjct: 567 ERNYHSNEKELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQ 626
Query: 620 VELSEFGIVFEKKGQVKAQVLADFVNE 646
+ LS++ E K V ADF+ E
Sbjct: 627 MWLSQYDFDVEHIAGTK-NVFADFLQE 652
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 99.0 bits (245), Expect = 7e-20
Identities = 85/341 (24%), Positives = 146/341 (41%), Gaps = 17/341 (4%)
Query: 220 CHKLALNPGVKPIAQMKRKMGEEKAQAVKAETNKLIDAGFIR--EVKYPTWLANV----- 272
C +NP +K + + + + +A+ + N L+ IR E K+ + V
Sbjct: 1391 CKLNIINPDIKIMGRPIKHVTPGDEEAMTRQINLLLQMKVIRPSESKHRSTAFIVRSGTE 1450
Query: 273 ---VMVKKSNGKWRMCTDYTDLNKHCPKDSYPLPNIDKLVDRASGFGMLSLMDAYSGYHQ 329
+ K+ GK RM +Y LN++ D Y LP I+ ++ + + S D SG+ Q
Sbjct: 1451 IDPITGKEKKGKERMVFNYKLLNENTESDQYSLPGINTIISKVGRSKIYSKFDLKSGFWQ 1510
Query: 330 IRMYAPDEEKTAFMTNQANYCYQTMPFGLKNAGATYQRLMDRVFEGQVGRNMEIYVDDMV 389
+ M TAF+ Y + MPFGLKNA A +QR MD VF+G + + +Y+DD++
Sbjct: 1511 VAMEEESVPWTAFLAGNKLYEWLVMPFGLKNAPAIFQRKMDNVFKG-TEKFIAVYIDDIL 1569
Query: 390 VKSEEMGGHCLDLAEAFGEIRKHNMRLNPEKCSFGIQSGKFLGFMITRRGIEVNPDKCKA 449
V SE H L +++ + L+P K G FLG + I++ P
Sbjct: 1570 VFSETAEQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLGCTKIKLQPHIISK 1629
Query: 450 ILEM--QSPTSVKEVQKLIGRIAALSRFLPCSGSKATPFFQCLRKNRVFQWTDECEQAFQ 507
I + + + + ++ +G ++ ++ G P Q + + E + +
Sbjct: 1630 ICDFSDEKLATPEGMRSWLGILSYARNYIQDIGKLVQPLRQKMAPTGDKRMNPETWKMVR 1689
Query: 508 SLKELLSKPPILSRPIPGTPLSVFISISDNAVSSVLLQECK 548
+KE + P L P P FI I + + CK
Sbjct: 1690 QIKEKVKNLPDLQLP----PKDSFIIIETDGCMTGWGAVCK 1726
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.320 0.136 0.403
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 142,668,683
Number of Sequences: 164201
Number of extensions: 6095076
Number of successful extensions: 15102
Number of sequences better than 10.0: 169
Number of HSP's better than 10.0 without gapping: 109
Number of HSP's successfully gapped in prelim test: 60
Number of HSP's that attempted gapping in prelim test: 14727
Number of HSP's gapped (non-prelim): 320
length of query: 1224
length of database: 59,974,054
effective HSP length: 122
effective length of query: 1102
effective length of database: 39,941,532
effective search space: 44015568264
effective search space used: 44015568264
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 72 (32.3 bits)
Lotus: description of TM0306.1