
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC148652.2 - phase: 0 /pseudo
(1145 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 223 2e-57
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 203 2e-51
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 202 6e-51
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 197 1e-49
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 192 4e-48
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 185 7e-46
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 182 3e-45
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 182 3e-45
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 174 9e-43
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript... 156 3e-37
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 123 2e-27
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 122 7e-27
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 121 9e-27
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 120 2e-26
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 116 4e-25
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 115 5e-25
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 109 4e-23
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 106 3e-22
POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.2... 101 1e-20
POL_MLVRK (P31795) Pol polyprotein [Contains: Protease (EC 3.4.2... 100 2e-20
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 223 bits (569), Expect = 2e-57
Identities = 146/473 (30%), Positives = 244/473 (50%), Gaps = 22/473 (4%)
Query: 64 KRKIFQLLREYPDIFAWSYEDMPGLD-PKIVEHRIPTKPEYPPVRQKLRRTHPDM-ALKI 121
K+++ LL++Y DI Y + L +H I TK P + ++P ++
Sbjct: 170 KQRLCALLQKYHDI---QYHEGDKLTFTNQTKHTINTKHNLPLYS---KYSYPQAYEQEV 223
Query: 122 KSEVQKQIDAGFLMTVEYPEWVANIVPVPKKDG-----KVRMCVDFRDLIKASPKDNFPL 176
+S++Q ++ G + T P + + I VPKK K R+ +D+R L + + D P+
Sbjct: 224 ESQIQDMLNQGIIRTSNSP-YNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPI 282
Query: 177 PHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITP*GTFCYKVMPFGLINAG 236
P++D ++ + F+ +D G++QI+M PE KT+F T G + Y MPFGL NA
Sbjct: 283 PNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAP 342
Query: 237 ATY*RGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEYLTKMFERLRKYKLRLNPNKCT 296
AT+ R M + +++K VY+DD+IV S ++H++ L +FE+L K L+L +KC
Sbjct: 343 ATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCE 402
Query: 297 FGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFISHMTAT 356
F + LG +++ GI+ +P+K+ AI++ P P K+++ FLG Y +FI +
Sbjct: 403 FLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADI 462
Query: 357 CGPIFKLLRKNQPI-VWNDECQGAFDSIKNYLLEPPILVPPVEGRPLIMYLSVFDESVGC 415
P+ K L+KN I N E AF +K + E PIL P + + D ++G
Sbjct: 463 AKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGA 522
Query: 416 VLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALVWAAKRLRHYLVNHTTWLISR 475
VL Q H + Y+S+ + E Y+ +EK A+VWA K RHYL+ + S
Sbjct: 523 VLSQDG------HPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSD 576
Query: 476 MDQIKYIFEKPAVTRKIARWQMLLSEYDIVFKAQKAIKGSILADHLAYQPLDD 528
+ +++ K+ RW++ LSE+D K K K + +AD L+ L++
Sbjct: 577 HQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKG-KENCVADALSRIKLEE 628
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 203 bits (517), Expect = 2e-51
Identities = 126/460 (27%), Positives = 233/460 (50%), Gaps = 12/460 (2%)
Query: 60 EEGVKRKIFQLLREYPDIFAWSYEDMPGLDPKIVEHRIPTKPEYPPVRQKLRRTHPDMAL 119
E G RKI+ ++ ++ D+FA S +++ E I K P+RQK R +
Sbjct: 899 ENGDDRKIWDVIEQFQDVFAISDDELGRNSG--TECVIELKEGAEPIRQKPRPIPLALKP 956
Query: 120 KIKSEVQKQIDAGFLMTVEYPEWVANIVPVPKKDGKVRMCVDFRDLIKASPKDNFPLPHI 179
+I+ +QK ++ + + P W + +V V KKDG +RMC+D+R + K + PLP+I
Sbjct: 957 EIRKMIQKMLNQKVIRESKSP-WSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNI 1015
Query: 180 DVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITP*GTFCYKVMPFGLINAGATY 239
+ + + A K+++ D +G+ QI + + +E T+F F + V+PFGL+ + A +
Sbjct: 1016 EATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALF 1075
Query: 240 *RGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGV 299
M + D++ VYVDD+++ SKD EQH++ + + R+RK ++L +KC
Sbjct: 1076 QGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAK 1135
Query: 300 RSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFISHMTATCGP 359
+ + LG V+ G+E K +++ P K+++ FLG + Y +FI +
Sbjct: 1136 KEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASS 1195
Query: 360 IFKLLRKNQPIVWNDECQGAFDSIKNYLLEPPILV-PPVEG-----RPLIMYLSVFDESV 413
+ L+ +W E + AF +K + + P+L P VE RP ++Y + +
Sbjct: 1196 LTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGI 1255
Query: 414 GCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALVWAAKRLRHYLVNHTTWLI 473
G VL Q+ G ++H I + SK + ETRY + + A+++A +R + + +
Sbjct: 1256 GAVLAQEGPDG-QQHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVF 1314
Query: 474 SRMDQIKYIFEKPAVTRKIARWQMLLSEYD--IVFKAQKA 511
+ + + + + ++ RW + + E+D IV+ A KA
Sbjct: 1315 TDHKPLISLLKGSPLADRLWRWSIEILEFDVKIVYLAGKA 1354
Score = 104 bits (260), Expect = 1e-21
Identities = 90/339 (26%), Positives = 149/339 (43%), Gaps = 22/339 (6%)
Query: 784 VDEHEAEQLMHDVHDGTFGTHATGHTMSRKLLRAGYYWMTMEHDCYQHARKCHKCQIYAD 843
V E L+ ++H+G H M R + R +YW M R C KC D
Sbjct: 1460 VPEKIRTPLLKELHEGMLAGHFGIKKMWRMVHRK-FYWPQMRVCVENCVRTCAKCLCAND 1518
Query: 844 KIHVPPHALNVISSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYYTKWVEAASYANVT 903
+ +L +P + D++ + G+R+IL ID +TK+ A +
Sbjct: 1519 HSKLTS-SLTPYRMTFPLEIVACDLMD--VGLSVQGNRYILTIIDLFTKYGTAVPIPDKK 1575
Query: 904 KQVVAK-FIRNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHISSPYRPQMNGA 962
+ V K F+ I +P K++TD G N + KIEH + Y + NGA
Sbjct: 1576 AETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGYNSRANGA 1635
Query: 963 VEAANKNIQKIVQKMVTTYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEV 1022
VE NK I I++K +W + + YA++ Y V +TG TP L++G + + PLE+
Sbjct: 1636 VERFNKTIMHIMKKKTAVPMEWDDQVVYAVYAYNNCVHENTGETPMFLMHGRDVMGPLEM 1695
Query: 1023 EIPSLRVIMEAKLSEAEWCQSRYDQLNLVEEKRMDAMARGQSYQARMKTAFDKKVRPREF 1082
I A + E + ++ + L + + + AM +SY K+ FD+K ++
Sbjct: 1696 SGEDAVGINYADMDEYKHLLTQ-ELLKVQKIAKEHAMREQESY----KSLFDQKYASKKH 1750
Query: 1083 K----GGELVL----KRRISQQPDPRGKWTPNYEGPYVV 1113
+ G ++L ++ +Q P KW+ GPY V
Sbjct: 1751 RFPQPGSRVLLEIPSEKLGAQCPKLVNKWS----GPYRV 1785
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 202 bits (513), Expect = 6e-51
Identities = 125/418 (29%), Positives = 211/418 (49%), Gaps = 15/418 (3%)
Query: 110 LRRTHPDMALKIKSEVQKQIDAGFLMT----VEYPEWVANIVPVPKKDGKVRMCVDFRDL 165
L +TH ++++++VQ+ ++ G + P WV P K R+ +D+R L
Sbjct: 214 LAQTHE---IEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDYRKL 270
Query: 166 IKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITP*GTFCY 225
+ + D +P+P++D ++ + + F+ +D G++QI+M E KT+F T G + Y
Sbjct: 271 NEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYEY 330
Query: 226 KVMPFGLINAGATY*RGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEYLTKMFERLRK 285
MPFGL NA AT+ R M + +++K VY+DD+I+ S +H+ + +F +L
Sbjct: 331 LRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLAD 390
Query: 286 YKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNY 345
L+L +KC F + LG IV+ GI+ +P KV+AI P P +K++R FLG Y
Sbjct: 391 ANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGY 450
Query: 346 ISRFISHMTATCGPIFKLLRKNQPI-VWNDECQGAFDSIKNYLLEPPILVPPVEGRPLIM 404
+FI + P+ L+K I E AF+ +K ++ PIL P + ++
Sbjct: 451 YRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVL 510
Query: 405 YLSVFDESVGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALVWAAKRLRHY 464
+ ++G VL Q H I ++S+ D E Y+ +EK A+VWA K RHY
Sbjct: 511 TTDASNLALGAVLSQNG------HPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHY 564
Query: 465 LVNHTTWLISRMDQIKYIFEKPAVTRKIARWQMLLSEYDIVFKAQKAIKGSILADHLA 522
L+ + S ++++ K+ RW++ LSEY K + S+ AD L+
Sbjct: 565 LLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKGKENSV-ADALS 621
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 197 bits (502), Expect = 1e-49
Identities = 136/465 (29%), Positives = 228/465 (48%), Gaps = 13/465 (2%)
Query: 64 KRKIFQLLREYPDIFAWSYEDMPGLDPKIVEHRIPTKPEYPPVRQKLRRTHPDMALKIKS 123
K ++ + EY DIFA E P + + ++ K + P + R H + +I++
Sbjct: 276 KSQLENICSEYIDIFA--LESEPITVNNLYKQQLRLKDDEPVYTKNYRSPHSQVE-EIQA 332
Query: 124 EVQKQIDAGFLMTVEYPEWVANIVPVPKKDG------KVRMCVDFRDLIKASPKDNFPLP 177
+VQK I ++ ++ + ++ VPKK K R+ +D+R + K D FPLP
Sbjct: 333 QVQKLIKDK-IVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLP 391
Query: 178 HIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITP*GTFCYKVMPFGLINAGA 237
ID ++D ++K FS +D SG++QI++ R+ TSF T G++ + +PFGL A
Sbjct: 392 RIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPN 451
Query: 238 TY*RGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEYLTKMFERLRKYKLRLNPNKCTF 297
++ R MT F + + +Y+DD+IV E+ ++ LT++F + R+Y L+L+P KC+F
Sbjct: 452 SFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSF 511
Query: 298 GVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFISHMTATC 357
+ LG + KGI D K I+ P P R F+ NY RFI +
Sbjct: 512 FMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYS 571
Query: 358 GPIFKLLRKNQPIVWNDECQGAFDSIKNYLLEPPILVPPVEGRPLIMYLSVFDESVGCVL 417
I +L +KN P W DECQ AF +K+ L+ P +L P + + ++ G VL
Sbjct: 572 RHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVL 631
Query: 418 GQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALVWAAKRLRHYLVNHTTWLISRMD 477
Q + + Y S+ FT E+ + E+ A+ WA R Y+ + +
Sbjct: 632 TQNH--NGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHR 689
Query: 478 QIKYIFEKPAVTRKIARWQMLLSEYDIVFKAQKAIKGSILADHLA 522
+ Y+F + K+ R ++ L EY+ + K K + +AD L+
Sbjct: 690 PLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKG-KDNHVADALS 733
Score = 121 bits (304), Expect = 9e-27
Identities = 100/381 (26%), Positives = 171/381 (44%), Gaps = 29/381 (7%)
Query: 774 RNYDMVLLRCV----DEHEAEQLMHDVHDGTFGTHATGHTMSRKLLRAGYYWMTMEHDCY 829
+N + LL V +E E E ++ +HD TG T + ++ YYW M
Sbjct: 874 KNLKVALLNPVTQINNEKEKEAILSTLHDDPIQGGHTGITKTLAKVKRHYYWKNMSKYIK 933
Query: 830 QHARKCHKCQIYADKIHVPPHALNVISSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDY 889
++ RKC KCQ H + F +D IG + PK+ NG+ + + I
Sbjct: 934 EYVRKCQKCQKAKTTKHTKTPMTITETPEHAFDRVVVDTIGPL-PKSENGNEYAVTLICD 992
Query: 890 YTKWVEAASYANVTKQVVAKFIRNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEH 949
TK++ A AN + + VAK I + I +YG ITD GT N+++ LC+ KI++
Sbjct: 993 LTKYLVAIPIANKSAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKN 1052
Query: 950 HISSPYRPQMNGAVEAANKNIQKIVQKMVTTYK-DWHEMLPYALHGYRTTVRSSTGATPF 1008
S+ + Q G VE +++ + + ++ ++T K DW L Y ++ + TT P+
Sbjct: 1053 ITSTAHHHQTVGVVERSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPY 1112
Query: 1009 SLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNLVEEKRMDAMARG----QS 1064
LV+G + LP KL E + D + + A AR ++
Sbjct: 1113 ELVFGRTSNLPKHFN----------KLHSIEPIYNIDDYAKESKYRLEVAYARARKLLEA 1162
Query: 1065 YQARMKTAFDKKVRPREFKGGELVLKRRISQQPDPRGKWTPNYEGPYVVKK-AFSGGALI 1123
++ + K +D KV+ E + G+ VL R + K Y GPY ++ + +
Sbjct: 1163 HKEKNKENYDLKVKDIELEVGDKVLLRN-----EVGHKLDFKYTGPYKIESIGDNNNITL 1217
Query: 1124 LTHMDGVELPNPVNADIVKKY 1144
LT+ + ++ V+ D +KK+
Sbjct: 1218 LTNKNKKQI---VHKDRLKKF 1235
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 192 bits (488), Expect = 4e-48
Identities = 139/487 (28%), Positives = 231/487 (46%), Gaps = 34/487 (6%)
Query: 61 EGVKRKIFQLLREYPDIFAWSYEDMPGLDPKIVEHRIPTKPEYPPVRQKLRRTHPDMALK 120
+G + + LL E+P IF P L VE + + +++P +
Sbjct: 82 DGTQEILNSLLGEFPRIFE------PPLSGMSVETAVKAEIRTNTQDPIYAKSYP-YPVN 134
Query: 121 IKSEVQKQIDA----GFLMTVEYPE----WVANIVPVPKKDGKVRMCVDFRDLIKASPKD 172
++ EV++QID G + P W+ P P + + RM VDF+ L + D
Sbjct: 135 MRGEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPD 194
Query: 173 NFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITP*GTFCYKVMPFGL 232
+P+P I+ + + +K F+ +D SG++QI M D KT+F T G + + +PFGL
Sbjct: 195 TYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGL 254
Query: 233 INAGATY*RGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEYLTKMFERLRKYKLRLNP 292
NA A + R + + + I K VY+DD+IV S+D + H + L + L K L++N
Sbjct: 255 KNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNL 314
Query: 293 NKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFISH 352
K F + LG+IV+ GI+ DP KVRAI EMP P + K+++ FLG +Y +FI
Sbjct: 315 EKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQD 374
Query: 353 MTATCGPIFKLLR-----------KNQPIVWNDECQGAFDSIKNYLLEPPILVPPVEGRP 401
P+ L R PI ++ +F+ +K+ L IL P +P
Sbjct: 375 YAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKP 434
Query: 402 LIMYLSVFDESVGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALVWAAKRL 461
+ + ++G VL Q D+ ++ I Y+S+ E Y +EK A++W+ L
Sbjct: 435 FHLTTDASNWAIGAVLSQDDQ--GRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNL 492
Query: 462 RHYLVNHTTWLISRMDQ-IKYIFEKPAVTRKIARWQMLLSEY--DIVFKAQKAIKGSILA 518
R YL T + Q + + K+ RW+ + EY ++++K K+ +++A
Sbjct: 493 RAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKS---NVVA 549
Query: 519 DHLAYQP 525
D L+ P
Sbjct: 550 DALSRIP 556
Score = 35.0 bits (79), Expect = 1.2
Identities = 44/212 (20%), Positives = 84/212 (38%), Gaps = 17/212 (8%)
Query: 807 GHTMSRKLLRAGYYWMTMEHDCYQHARKCHKCQIYADKIHVPPHALNVISSP---WPFSM 863
G T R L YY+ M C C++Y + H P+ N+ +P +P +
Sbjct: 707 GPTEIRLQLLEKYYFPRMSSTIRLQTSSCQCCKLYKYERH--PNKPNLQPTPIPNYPCEI 764
Query: 864 WGIDMIGRIEPKASNGHRFILVAIDYYTKWVEAASYANVTKQVVAKFIRNNIICRYGVPS 923
ID+ + R L ID ++K+ + + V + + + P
Sbjct: 765 LHIDIFALEK-------RLYLSCIDKFSKFAKLF-HLQSKASVHLRETLVEALHYFTAPK 816
Query: 924 KIITDNGTNLNNNVVQALCEEFKIEHHISSPYRPQMNGAVEAANK---NIQKIVQKMVTT 980
+++DN L V I+ + + + ++NG VE + I + ++ + T
Sbjct: 817 VLVSDNERGLLCPTVLNYLRSLDIDLYYAPTQKSEVNGQVERFHSTFLEIYRCLKDELPT 876
Query: 981 YKDWHEMLPYALHGYRTTVRSSTGATPFSLVY 1012
+K E++ A+ Y T+V S T P + +
Sbjct: 877 FKP-VELVHIAVDRYNTSVHSVTNRKPADVFF 907
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 185 bits (469), Expect = 7e-46
Identities = 124/460 (26%), Positives = 229/460 (48%), Gaps = 29/460 (6%)
Query: 95 HRIPTKPEYPPVRQKLRRTHPDMALKIKSEVQKQIDAGFLMTVEYPEWVANIVPVPKKDG 154
+R+P + YP K++ + ++ +KS + ++ A V + VPKK+G
Sbjct: 411 YRLPIR-NYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMF---------VPKKEG 460
Query: 155 KVRMCVDFRDLIKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKT 214
+RM VD++ L K + +PLP I+ L+ S +F+ +D S Y+ I++ D K
Sbjct: 461 TLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKL 520
Query: 215 SFITP*GTFCYKVMPFGLINAGATY*RGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVE 274
+F P G F Y VMP+G+ A A + + T+ + V Y+DD+++ SK E +HV+
Sbjct: 521 AFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVK 580
Query: 275 YLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEK 334
++ + ++L+ L +N KC F K +G+ +S+KG + + + + P+ K
Sbjct: 581 HVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRK 640
Query: 335 QVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQGAFDSIKNYLLEPPILV 394
++R FLG +NY+ +FI + P+ LL+K+ W A ++IK L+ PP+L
Sbjct: 641 ELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLR 700
Query: 395 PPVEGRPLIMYLSVFDESVGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCAL 454
+ +++ D +VG VL Q+ + K + + Y S K + + Y++ +K A+
Sbjct: 701 HFDFSKKILLETDASDVAVGAVLSQKHD-DDKYYPVGYYSAKMSKAQLNYSVSDKEMLAI 759
Query: 455 VWAAKRLRHY----------LVNHTTWLISRMDQIKYIFEKPAVTRKIARWQMLLSEYDI 504
+ + K RHY L +H LI R+ E +++ARWQ+ L +++
Sbjct: 760 IKSLKHWRHYLESTIEPFKILTDHRN-LIGRITN-----ESEPENKRLARWQLFLQDFNF 813
Query: 505 VFKAQKAIKGSILADHLAYQPLDDYQPIKFDFPDEDIMYL 544
+ + +AD L+ + +D+ +PI D D I ++
Sbjct: 814 EINYRPG-SANHIADALS-RIVDETEPIPKDSEDNSINFV 851
Score = 99.4 bits (246), Expect = 5e-20
Identities = 114/506 (22%), Positives = 215/506 (41%), Gaps = 55/506 (10%)
Query: 619 IDMRIKHLNIYGDSALVINQIKGEWETHHAKLIPYRDYARRLLTYFTKVELHHIPRDENQ 678
++ I+ I D +I +I E E + +L ++ + + E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 679 MADALGTISSMFRVNHWNDVPIIKVQR-LERPSHVFTIEDMINRADGNVVDDRPWYYDTK 737
+ADAL I V+ +P + + + +D N+ VV + + DTK
Sbjct: 825 IADALSRI-----VDETEPIPKDSEDNSINFVNQISITDDFKNQ----VVTE--YTNDTK 873
Query: 738 QFLLSREYPPGASNKDKKTLRRLAGRFLLDGDILYKRNYDMVLLRCVDEHEAEQLMHDVH 797
L +N+DK+ + L DG ++ + D +LL D ++ H
Sbjct: 874 LLNL-------LNNEDKRVEENIQ---LKDGLLINSK--DQILLPN-DTQLTRTIIKKYH 920
Query: 798 DGTFGTHATGHTMSRKLLRAGYYWMTMEHDCYQHARKCHKCQIYADKIHVPPHALNVIS- 856
+ H ++ +LR + W + ++ + CH CQI + H P L I
Sbjct: 921 EEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPP 979
Query: 857 SPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYYTKW-VEAASYANVTKQVVAKFIRNNI 915
S P+ +D I + S+G+ + V +D ++K + ++T + A+ +
Sbjct: 980 SERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRV 1037
Query: 916 ICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHISSPYRPQMNGAVEAANKNIQKIVQ 975
I +G P +II DN + + ++ S PYRPQ +G E N+ ++K+++
Sbjct: 1038 IAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLR 1097
Query: 976 KMVTTYKD-WHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAK 1034
+ +T+ + W + + Y + S+T TPF +V+ L +E+PS +
Sbjct: 1098 CVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTDEN 1156
Query: 1035 LSEA-EWCQSRYDQLNLVEEKRMDAMARGQSYQARMKTAFDKKVRP-REFKGGELVL-KR 1091
E + Q+ + LN + +MK FD K++ EF+ G+LV+ KR
Sbjct: 1157 SQETIQVFQTVKEHLN--------------TNNIKMKKYFDMKIQEIEEFQPGDLVMVKR 1202
Query: 1092 RISQQPDPRGKWTPNYEGP-YVVKKA 1116
+ K P++ GP YV++K+
Sbjct: 1203 TKTGFLHKSNKLAPSFAGPFYVLQKS 1228
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 182 bits (463), Expect = 3e-45
Identities = 123/460 (26%), Positives = 230/460 (49%), Gaps = 29/460 (6%)
Query: 95 HRIPTKPEYPPVRQKLRRTHPDMALKIKSEVQKQIDAGFLMTVEYPEWVANIVPVPKKDG 154
+R+P + YP K++ + ++ +KS + ++ A V + VPKK+G
Sbjct: 411 YRLPIR-NYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMF---------VPKKEG 460
Query: 155 KVRMCVDFRDLIKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKT 214
+RM VD++ L K + +PLP I+ L+ S +F+ +D S Y+ I++ D K
Sbjct: 461 TLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKL 520
Query: 215 SFITP*GTFCYKVMPFGLINAGATY*RGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVE 274
+F P G F Y VMP+G+ A A + + T+ ++ V Y+D++++ SK E +HV+
Sbjct: 521 AFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVK 580
Query: 275 YLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEK 334
++ + ++L+ L +N KC F K +G+ +S+KG + + + + P+ K
Sbjct: 581 HVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRK 640
Query: 335 QVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQGAFDSIKNYLLEPPILV 394
++R FLG +NY+ +FI + P+ LL+K+ W A ++IK L+ PP+L
Sbjct: 641 ELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLR 700
Query: 395 PPVEGRPLIMYLSVFDESVGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCAL 454
+ +++ D +VG VL Q+ + K + + Y S K + + Y++ +K A+
Sbjct: 701 HFDFSKKILLETDASDVAVGAVLSQKHD-DDKYYPVGYYSAKMSKAQLNYSVSDKEMLAI 759
Query: 455 VWAAKRLRHY----------LVNHTTWLISRMDQIKYIFEKPAVTRKIARWQMLLSEYDI 504
+ + K RHY L +H LI R+ E +++ARWQ+ L +++
Sbjct: 760 IKSLKHWRHYLESTIEPFKILTDHRN-LIGRITN-----ESEPENKRLARWQLFLQDFNF 813
Query: 505 VFKAQKAIKGSILADHLAYQPLDDYQPIKFDFPDEDIMYL 544
+ + +AD L+ + +D+ +PI D D I ++
Sbjct: 814 EINYRPG-SANHIADALS-RIVDETEPIPKDSEDNSINFV 851
Score = 99.4 bits (246), Expect = 5e-20
Identities = 114/506 (22%), Positives = 215/506 (41%), Gaps = 55/506 (10%)
Query: 619 IDMRIKHLNIYGDSALVINQIKGEWETHHAKLIPYRDYARRLLTYFTKVELHHIPRDENQ 678
++ I+ I D +I +I E E + +L ++ + + E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 679 MADALGTISSMFRVNHWNDVPIIKVQR-LERPSHVFTIEDMINRADGNVVDDRPWYYDTK 737
+ADAL I V+ +P + + + +D N+ VV + + DTK
Sbjct: 825 IADALSRI-----VDETEPIPKDSEDNSINFVNQISITDDFKNQ----VVTE--YTNDTK 873
Query: 738 QFLLSREYPPGASNKDKKTLRRLAGRFLLDGDILYKRNYDMVLLRCVDEHEAEQLMHDVH 797
L +N+DK+ + L DG ++ + D +LL D ++ H
Sbjct: 874 LLNL-------LNNEDKRVEENIQ---LKDGLLINSK--DQILLPN-DTQLTRTIIKKYH 920
Query: 798 DGTFGTHATGHTMSRKLLRAGYYWMTMEHDCYQHARKCHKCQIYADKIHVPPHALNVIS- 856
+ H ++ +LR + W + ++ + CH CQI + H P L I
Sbjct: 921 EEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPP 979
Query: 857 SPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYYTKW-VEAASYANVTKQVVAKFIRNNI 915
S P+ +D I + S+G+ + V +D ++K + ++T + A+ +
Sbjct: 980 SERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRV 1037
Query: 916 ICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHISSPYRPQMNGAVEAANKNIQKIVQ 975
I +G P +II DN + + ++ S PYRPQ +G E N+ ++K+++
Sbjct: 1038 IAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLR 1097
Query: 976 KMVTTYKD-WHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAK 1034
+ +T+ + W + + Y + S+T TPF +V+ L +E+PS +
Sbjct: 1098 CVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTDEN 1156
Query: 1035 LSEA-EWCQSRYDQLNLVEEKRMDAMARGQSYQARMKTAFDKKVRP-REFKGGELVL-KR 1091
E + Q+ + LN + +MK FD K++ EF+ G+LV+ KR
Sbjct: 1157 SQETIQVFQTVKEHLN--------------TNNIKMKKYFDMKIQEIEEFQPGDLVMVKR 1202
Query: 1092 RISQQPDPRGKWTPNYEGP-YVVKKA 1116
+ K P++ GP YV++K+
Sbjct: 1203 TKTGFLHKSNKLAPSFAGPFYVLQKS 1228
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 182 bits (463), Expect = 3e-45
Identities = 123/460 (26%), Positives = 230/460 (49%), Gaps = 29/460 (6%)
Query: 95 HRIPTKPEYPPVRQKLRRTHPDMALKIKSEVQKQIDAGFLMTVEYPEWVANIVPVPKKDG 154
+R+P + YP K++ + ++ +KS + ++ A V + VPKK+G
Sbjct: 411 YRLPIR-NYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMF---------VPKKEG 460
Query: 155 KVRMCVDFRDLIKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKT 214
+RM VD++ L K + +PLP I+ L+ S +F+ +D S Y+ I++ D K
Sbjct: 461 TLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKL 520
Query: 215 SFITP*GTFCYKVMPFGLINAGATY*RGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVE 274
+F P G F Y VMP+G+ A A + + T+ ++ V Y+D++++ SK E +HV+
Sbjct: 521 AFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVK 580
Query: 275 YLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEK 334
++ + ++L+ L +N KC F K +G+ +S+KG + + + + P+ K
Sbjct: 581 HVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRK 640
Query: 335 QVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQGAFDSIKNYLLEPPILV 394
++R FLG +NY+ +FI + P+ LL+K+ W A ++IK L+ PP+L
Sbjct: 641 ELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLR 700
Query: 395 PPVEGRPLIMYLSVFDESVGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCAL 454
+ +++ D +VG VL Q+ + K + + Y S K + + Y++ +K A+
Sbjct: 701 HFDFSKKILLETDASDVAVGAVLSQKHD-DDKYYPVGYYSAKMSKAQLNYSVSDKEMLAI 759
Query: 455 VWAAKRLRHY----------LVNHTTWLISRMDQIKYIFEKPAVTRKIARWQMLLSEYDI 504
+ + K RHY L +H LI R+ E +++ARWQ+ L +++
Sbjct: 760 IKSLKHWRHYLESTIEPFKILTDHRN-LIGRITN-----ESEPENKRLARWQLFLQDFNF 813
Query: 505 VFKAQKAIKGSILADHLAYQPLDDYQPIKFDFPDEDIMYL 544
+ + +AD L+ + +D+ +PI D D I ++
Sbjct: 814 EINYRPG-SANHIADALS-RIVDETEPIPKDSEDNSINFV 851
Score = 99.4 bits (246), Expect = 5e-20
Identities = 114/506 (22%), Positives = 215/506 (41%), Gaps = 55/506 (10%)
Query: 619 IDMRIKHLNIYGDSALVINQIKGEWETHHAKLIPYRDYARRLLTYFTKVELHHIPRDENQ 678
++ I+ I D +I +I E E + +L ++ + + E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 679 MADALGTISSMFRVNHWNDVPIIKVQR-LERPSHVFTIEDMINRADGNVVDDRPWYYDTK 737
+ADAL I V+ +P + + + +D N+ VV + + DTK
Sbjct: 825 IADALSRI-----VDETEPIPKDSEDNSINFVNQISITDDFKNQ----VVTE--YTNDTK 873
Query: 738 QFLLSREYPPGASNKDKKTLRRLAGRFLLDGDILYKRNYDMVLLRCVDEHEAEQLMHDVH 797
L +N+DK+ + L DG ++ + D +LL D ++ H
Sbjct: 874 LLNL-------LNNEDKRVEENIQ---LKDGLLINSK--DQILLPN-DTQLTRTIIKKYH 920
Query: 798 DGTFGTHATGHTMSRKLLRAGYYWMTMEHDCYQHARKCHKCQIYADKIHVPPHALNVIS- 856
+ H ++ +LR + W + ++ + CH CQI + H P L I
Sbjct: 921 EEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPP 979
Query: 857 SPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYYTKW-VEAASYANVTKQVVAKFIRNNI 915
S P+ +D I + S+G+ + V +D ++K + ++T + A+ +
Sbjct: 980 SERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRV 1037
Query: 916 ICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHISSPYRPQMNGAVEAANKNIQKIVQ 975
I +G P +II DN + + ++ S PYRPQ +G E N+ ++K+++
Sbjct: 1038 IAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLR 1097
Query: 976 KMVTTYKD-WHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAK 1034
+ +T+ + W + + Y + S+T TPF +V+ L +E+PS +
Sbjct: 1098 CVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTDEN 1156
Query: 1035 LSEA-EWCQSRYDQLNLVEEKRMDAMARGQSYQARMKTAFDKKVRP-REFKGGELVL-KR 1091
E + Q+ + LN + +MK FD K++ EF+ G+LV+ KR
Sbjct: 1157 SQETIQVFQTVKEHLN--------------TNNIKMKKYFDMKIQEIEEFQPGDLVMVKR 1202
Query: 1092 RISQQPDPRGKWTPNYEGP-YVVKKA 1116
+ K P++ GP YV++K+
Sbjct: 1203 TKTGFLHKSNKLAPSFAGPFYVLQKS 1228
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 174 bits (442), Expect = 9e-43
Identities = 119/388 (30%), Positives = 196/388 (49%), Gaps = 22/388 (5%)
Query: 157 RMCVDFRDLIKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSF 216
R+ +DFR L + + D +P+P I +++ N ++K F+ +D SGY+QI ++ DREKTSF
Sbjct: 238 RLVIDFRKLNEKTIPDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSF 297
Query: 217 ITP*GTFCYKVMPFGLINAGATY*RGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEYL 276
G + + +PFGL NA + + R + + + I K VYVDD+I+ S++E HV ++
Sbjct: 298 SVNGGKYEFCRLPFGLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHI 357
Query: 277 TKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQV 336
+ + L +R++ K F S + LGFIVS+ G + DP+KV+AI+E P P +V
Sbjct: 358 DTVLKCLIDANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKV 417
Query: 337 RGFLGRLNYISRFISHMTATCGPIFKLLR-----------KNQPIVWNDECQGAFDSIKN 385
R FLG +Y FI A PI +L+ K P+ +N+ + AF ++N
Sbjct: 418 RSFLGLASYYRVFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRN 477
Query: 386 YLL-EPPILVPPVEGRPLIMYLSVFDESVGCVLGQQDETGKKEHAIYYLSKKFTDCETRY 444
L E IL P +P + +G VL Q+ I +S+ E Y
Sbjct: 478 ILASEDVILKYPDFKKPFDLTTDASASGIGAVLSQEG------RPITMISRTLKQPEQNY 531
Query: 445 TMLEKTCCALVWAAKRLRHYLV-NHTTWLISRMDQIKYIFEKPAVTRKIARWQMLLSEYD 503
E+ A+VWA +L+++L + + + + + KI RW+ + +++
Sbjct: 532 ATNERELLAIVWALGKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHN 591
Query: 504 I-VFKAQKAIKGSILADHLAYQPLDDYQ 530
VF K K + +AD L+ Q L+ Q
Sbjct: 592 AKVF--YKPGKENFVADALSRQNLNALQ 617
>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 886
Score = 156 bits (395), Expect = 3e-37
Identities = 195/887 (21%), Positives = 352/887 (38%), Gaps = 73/887 (8%)
Query: 146 IVPVPKKDGKVRMCVDFRDLIKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIK 205
+ PVPK DG+ RM +D+R++ K P H ++ + K + +D +G+
Sbjct: 5 VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGFWAHP 64
Query: 206 MSPEDREKTSFITP*GTFCYKVMPFGLINAGATY*RGMTTLFHDMIHKEVEVYVDDMIVK 265
++PE T+F +C+ +P G +N+ A + + L ++ V+VYVDD+ +
Sbjct: 65 ITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTADVVDLLKEI--PNVQVYVDDIYLS 122
Query: 266 SKDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIR 325
D ++HV+ L K+F+ L + ++ K G ++ + LGF ++++G + +
Sbjct: 123 HDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTVEFLGFNITKEGRGLTDTFKTKLL 182
Query: 326 EMPAPQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLL--RKNQPIVWNDECQGAFDSI 383
+ P+ KQ++ LG LN+ FI + P++ L+ K + I W++E + +
Sbjct: 183 NITPPKDLKQLQSILGLLNFARNFIPNFAELVQPLYNLIASAKGKYIEWSEENTKQLNMV 242
Query: 384 KNYLLEPPILVPPVEGRPLIMYLSVFDESVGCVLGQQDETGKKEHAIYYLSKKFTDCETR 443
L L + + L++ ++ S G V +ETGKK I YL+ F+ E +
Sbjct: 243 IEALNTASNLEERLPEQRLVIKVNT-SPSAGYV-RYYNETGKK--PIMYLNYVFSKAELK 298
Query: 444 YTMLEKTCCALVWAAKRLRHYLVNHTTWLISRMDQIKYIFEKPAVTRKI-----ARWQML 498
++MLEK + A + + + S + + I + P RK W
Sbjct: 299 FSMLEKLLTTMHKALIKAMDLAMGQEILVYSPIVSMTKIQKTPLPERKALPIRWITWMTY 358
Query: 499 LSEYDIVFKAQKAIKGSILADHLAYQPLDDYQPIKFDFPDEDIMYLKSKDCEEPLIGEGP 558
L + I F K + H+ P+K E + Y + P
Sbjct: 359 LEDPRIQFHYDKTLPE---LKHIPDVYTSSQSPVKHPSQYEGVFYTDGSAI------KSP 409
Query: 559 DPDSKWGLVFDGDVNAYGKGIGAVIVSPQGHHIPFTARILFECTNNMAEYEACIFGIEEA 618
DP N G GI P+ + + L T MAE A F ++A
Sbjct: 410 DPTKS---------NNAGMGIVHATYKPEYQVLNQWSIPLGNHTAQMAEIAAVEFACKKA 460
Query: 619 IDMRIKHLNIYGDSALVINQIKGEWETHHAKLIPYRDYARRLLTYFTKVELHHIPRDENQ 678
+ + L + DS V E +PY + K L HI + ++
Sbjct: 461 LKIPGPVL-VITDSFYVAESANKE--------LPY--WKSNGFVNNKKKPLKHISKWKS- 508
Query: 679 MADALGTISSMFRVNHWNDVPIIKVQRLERPSHVF---TIEDMINRADGNVVDDRPWYYD 735
+A+ L ++ + H + L+ P + + D + VV+ +
Sbjct: 509 IAECL-SMKPDITIQHEKGI------SLQIPVFILKGNALADKLATQGSYVVN-----CN 556
Query: 736 TKQFLLSREYPPGASNKDKKTLRRLAGRFLLDGDILYKRNYDMVLLRCVDEHEAEQLMHD 795
TK+ L E K + FL DG + R + ++ + + ++++
Sbjct: 557 TKKPNLDAELDQLLQGHYIKGYPKQYTYFLEDGKVKVSRPEGVKII--PPQSDRQKIVLQ 614
Query: 796 VHDGTFGTHATGHTMSRKLLRAGYYWMTMEHDCYQHARKCHKCQIYADKIHVPPHALNVI 855
H+ TG + + Y+W M D + +C +C I L
Sbjct: 615 AHN----LAHTGREATLLKIANLYWWPNMRKDVVKQLGRCQQCLITNASNKASGPILRPD 670
Query: 856 SSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYYT--KWVEAASYANVTKQVVAKFIRN 913
PF + ID IG + P S G+ ++LV +D T W+ Y A
Sbjct: 671 RPQKPFDKFFIDYIGPLPP--SQGYLYVLVVVDGMTGFTWL----YPTKAPSTSATVKSL 724
Query: 914 NIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHISSPYRPQMNGAVEAANKNIQKI 973
N++ +P I +D G ++ +E I S+PY PQ VE N +I+++
Sbjct: 725 NVLTSIAIPKVIHSDQGAAFTSSTFAEWAKERGIHLEFSTPYHPQSGSKVERKNSDIKRL 784
Query: 974 VQK-MVTTYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLP 1019
+ K +V W+++LP T TP L++G+++ P
Sbjct: 785 LTKLLVGRPTKWYDLLPVVQLALNNTYSPVLKYTPHQLLFGIDSNTP 831
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 123 bits (309), Expect = 2e-27
Identities = 130/525 (24%), Positives = 222/525 (41%), Gaps = 53/525 (10%)
Query: 18 PYEITRLLEQERKAIEPHQEEI---------ELINLGTEENKREIKVGAALEEGVK---R 65
P IT+L R IE E + E +N+ T + + ++ A L EG +
Sbjct: 141 PVHITKLTRAVRVGIEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEE 200
Query: 66 KIF---QLLREYPDIFAWSYEDMPGLDPKIVEHRIPTKPEYPPVRQKLRRTHPDMALKIK 122
K+F Q +++ ++ + P LDP + + + ++ + P A+K+K
Sbjct: 201 KLFITQQRMQKIEELLEKVCSENP-LDPNKTKQWM---------KASIKLSDPSKAIKVK 250
Query: 123 ---------SEVQKQIDAGFLMTVEYPEWVANIVPV-------PKKDGKVRMCVDFRDLI 166
E KQI + V P ++ P K+ GK RM V+++ +
Sbjct: 251 PMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMN 310
Query: 167 KASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITP*GTFCYK 226
KA+ D + LP+ D L+ K+FS D SG+ Q+ + E R T+F P G + +
Sbjct: 311 KATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWN 370
Query: 227 VMPFGLINAGATY*RGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEYLTKMFERLRKY 286
V+PFGL A + + R M F + K VYVDD++V S +EE H+ ++ + ++ ++
Sbjct: 371 VVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQH 429
Query: 287 KLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP-APQTEKQVRGFLGRLNY 345
+ L+ K + LG + + + + I + P + +KQ++ FLG L Y
Sbjct: 430 GIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTY 489
Query: 346 ISRFISHMTATCGPIFKLLRKNQPIVWNDECQGAFDSIKNYLLEPPILVPPVEGRPLIMY 405
S +I + P+ L++N P W E +K L P L P+ LI+
Sbjct: 490 ASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIE 549
Query: 406 LSVFDESVGCVLG--QQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALVWAAKRLRH 463
D+ G +L + +E E Y S F E Y +K A++ K+
Sbjct: 550 TDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSI 609
Query: 464 YLVNHTTWLISRMDQ------IKYIFEKPAVTRKIARWQMLLSEY 502
YL + R D + ++ + + RWQ LS Y
Sbjct: 610 YLT--PVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHY 652
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 122 bits (305), Expect = 7e-27
Identities = 127/509 (24%), Positives = 218/509 (41%), Gaps = 46/509 (9%)
Query: 25 LEQERKAIEPHQEEIELINLGTEENKREIKVGAALEEGVK---RKIF---QLLREYPDIF 78
LE +K + Q E +N+ T + + +K A L EG + K+F Q +++ ++
Sbjct: 159 LESMKKRSKTQQPEP--VNISTNKIENPLKEIAILSEGRRLSEEKLFITQQRMQKIEELL 216
Query: 79 AWSYEDMPGLDPKIVEHRIPTKPEYPPVRQKLRRTHPDMALKIK---------SEVQKQI 129
+ P LDP + + + ++ + P A+K+K E KQI
Sbjct: 217 EKVCSENP-LDPNKTKQWM---------KASIKLSDPSKAIKVKPMKYSPMDREEFDKQI 266
Query: 130 DAGFLMTVEYPEWVANIVPV-------PKKDGKVRMCVDFRDLIKASPKDNFPLPHIDVL 182
+ V P ++ P K+ GK RM V+++ + KA+ D + LP+ D L
Sbjct: 267 KELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDEL 326
Query: 183 VDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITP*GTFCYKVMPFGLINAGATY*RG 242
+ K+FS D SG+ Q+ + E R T+F P G + + V+PFGL A + + R
Sbjct: 327 LTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRH 386
Query: 243 MTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSG 302
M F + K VYVDD++V S +EE H+ ++ + ++ ++ + L+ K +
Sbjct: 387 MDEAFR-VFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKI 445
Query: 303 KLLGFIVSQKGIEVDPDKVRAIREMP-APQTEKQVRGFLGRLNYISRFISHMTATCGPIF 361
LG + + + + I + P + +KQ++ FLG L Y S +I + P+
Sbjct: 446 NFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQ 505
Query: 362 KLLRKNQPIVWNDECQGAFDSIKNYLLEPPILVPPVEGRPLIMYLSVFDESVGCVLG--Q 419
L++N P W E +K L P L P+ LI+ D+ G +L +
Sbjct: 506 AKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIK 565
Query: 420 QDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALVWAAKRLRHYLVNHTTWLISRMDQ- 478
+E E Y S F E Y +K A++ K+ YL + R D
Sbjct: 566 INEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLT--PVHFLIRTDNT 623
Query: 479 -----IKYIFEKPAVTRKIARWQMLLSEY 502
+ ++ + + RWQ LS Y
Sbjct: 624 HFKSFVNLNYKGDSKLGRNIRWQAWLSHY 652
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 121 bits (304), Expect = 9e-27
Identities = 89/294 (30%), Positives = 148/294 (50%), Gaps = 12/294 (4%)
Query: 60 EEGVKRKIFQLLREYPDIFAWSYEDMPGLDPKIVEHRIPTKPEYPPVRQKLRRTHPDMAL 119
E R L ++P++F +D GL K + T+ PV ++ R
Sbjct: 400 ETEASRLEVMLKNDFPEVF----KDGLGLCTK-EKAEFRTEENAVPVFKRARPVPYGSLE 454
Query: 120 KIKSEVQKQIDAGFLMTVEYPEWVANIVPVPKKD-GKVRMCVDFR-DLIKASPKDNF-PL 176
+++E+ + + G ++ + Y +W A IV + KK GK+R+C DF+ + A+ KD F PL
Sbjct: 455 AVETELNRLQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCADFKCSGLNAALKDEFHPL 514
Query: 177 PHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITP*GTFCYKVMPFGLINAG 236
P + + + V+S +D Y Q+++ E ++ T G F Y M FGL A
Sbjct: 515 PTSEDIFSRL-KGTVYSQIDLKDAYLQVELDEEAQKLAVINTHRGIFKYLRMTFGLKPAP 573
Query: 237 ATY*RGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEYLTKMFERLRKYKLRLNPNKCT 296
A++ + M + + V VY DD+I+ + E+H + L ++FER ++Y R++ KC
Sbjct: 574 ASFQKIMDKMVSGLTG--VAVYWDDIIISASSIEEHEKILRELFERFKEYGFRVSAEKCA 631
Query: 297 FGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFI 350
F + LGF V + G D K AIR M AP +KQ+ FLG +++SR +
Sbjct: 632 FAQKQVTFLGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQLASFLGAADWLSRMM 684
Score = 85.5 bits (210), Expect = 8e-16
Identities = 76/320 (23%), Positives = 138/320 (42%), Gaps = 44/320 (13%)
Query: 757 LRRLAGRFLLDGDILYKRNYDMVLLRCVDEHEAEQLMHDVHDGTFGTHATGHTMSRKLLR 816
L+ + G LLD ++ ++ ++L+ +H+ H G ++ R
Sbjct: 763 LKLIHGCLLLDDRVIVPKSLQKIVLK---------QLHEGHPGI--------VQMKQKAR 805
Query: 817 AGYYWMTMEHDCYQHARKCHKCQIYADKIHVPPHALNVISSPWPF--SMWG---IDMIGR 871
+ +W ++ D R C+ CQ + V P +PWP + W ID G
Sbjct: 806 SFVFWRGLDSDIENMVRHCNNCQENSKMPRVVP------LNPWPVPEAPWKRIHIDFAGP 859
Query: 872 IEPKASNGHRFILVAIDYYTKWVEAASYANVTKQVVAKFIRNNIICRYGVPSKIITDNGT 931
+ NG ++LV +D TK+ E +++ + I +G P II+DNGT
Sbjct: 860 L-----NGC-YLLVVVDAKTKYAEVKLTRSISAVTTIDLLEE-IFSIHGYPETIISDNGT 912
Query: 932 NLNNNVVQALCEEFKIEHHISSPYRPQMNGAVEAANKNIQKIVQKMVTTYKDWHEMLPYA 991
L +++ +C+ IEH S+ Y P+ NGA E +++ + K+ ++L
Sbjct: 913 QLTSHLFAQMCQSHGIEHKTSAVYYPRSNGAAERFVDTLKRGIAKIKGEGSVNQQILNKF 972
Query: 992 LHGYRTTVRSS-TGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNL 1050
L YR T S+ G+TP +G + + + +P+ RV+ KL++ Q N+
Sbjct: 973 LISYRNTPHSALNGSTPAECHFGRKIRTTMSLLMPTDRVLKVPKLTQY--------QQNM 1024
Query: 1051 VEEKRMDAMARGQSYQARMK 1070
+ AR +++Q K
Sbjct: 1025 KHHYELRNGARAKAFQVNQK 1044
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 120 bits (301), Expect = 2e-26
Identities = 126/509 (24%), Positives = 218/509 (42%), Gaps = 46/509 (9%)
Query: 25 LEQERKAIEPHQEEIELINLGTEENKREIKVGAALEEGVK---RKIF---QLLREYPDIF 78
LE +K + Q E +N+ T + + ++ A L EG + K+F Q +++ ++
Sbjct: 159 LESMKKRSKTQQPEP--VNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQKIEELL 216
Query: 79 AWSYEDMPGLDPKIVEHRIPTKPEYPPVRQKLRRTHPDMALKIK---------SEVQKQI 129
+ P LDP + + + ++ + P A+K+K E KQI
Sbjct: 217 EKVCSENP-LDPNKTKQWM---------KASIKLSDPSKAIKVKPMKYSPMDREEFDKQI 266
Query: 130 DAGFLMTVEYPEWVANIVPV-------PKKDGKVRMCVDFRDLIKASPKDNFPLPHIDVL 182
+ V P ++ P K+ GK RM V+++ + KA+ D + LP+ D L
Sbjct: 267 KELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNLPNKDEL 326
Query: 183 VDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITP*GTFCYKVMPFGLINAGATY*RG 242
+ K+FS D SG+ Q+ + E R T+F P G + + V+PFGL A + + R
Sbjct: 327 LTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRH 386
Query: 243 MTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSG 302
M F + K VYVDD++V S +EE H+ ++ + ++ ++ + L+ K +
Sbjct: 387 MDEAFR-VFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKI 445
Query: 303 KLLGFIVSQKGIEVDPDKVRAIREMP-APQTEKQVRGFLGRLNYISRFISHMTATCGPIF 361
LG + + + + I + P + +KQ++ FLG L Y S +I + P+
Sbjct: 446 NFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQ 505
Query: 362 KLLRKNQPIVWNDECQGAFDSIKNYLLEPPILVPPVEGRPLIMYLSVFDESVGCVLG--Q 419
L++N P W E +K L P L P+ LI+ D+ G +L +
Sbjct: 506 AKLKENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIK 565
Query: 420 QDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALVWAAKRLRHYLVNHTTWLISRMDQ- 478
+E E Y S F E Y +K A++ K+ YL + R D
Sbjct: 566 INEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLT--PVHFLIRTDNT 623
Query: 479 -----IKYIFEKPAVTRKIARWQMLLSEY 502
+ ++ + + RWQ LS Y
Sbjct: 624 HFKSFVNLNYKGDSKLGRNIRWQAWLSHY 652
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 674
Score = 116 bits (290), Expect = 4e-25
Identities = 107/422 (25%), Positives = 182/422 (42%), Gaps = 28/422 (6%)
Query: 106 VRQKLRRTHPDMALKIK---------SEVQKQIDAGFLMTVEYPEWVANIVPV------- 149
++ ++ + P A+K+K E KQI + V P ++ P
Sbjct: 229 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 288
Query: 150 PKKDGKVRMCVDFRDLIKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPE 209
K+ GK RM V+++ + KA+ D + P+ D L+ K+FS D SG+ Q+ + E
Sbjct: 289 EKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 348
Query: 210 DREKTSFITP*GTFCYKVMPFGLINAGATY*RGMTTLFHDMIHKEVEVYVDDMIVKSKDE 269
R T+F P G + + V+PFGL A + + R M F + K VYVDD++V S +E
Sbjct: 349 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNE 407
Query: 270 EQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 328
E H+ ++ + ++ ++ + L+ K + LG + + + + I + P
Sbjct: 408 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPD 467
Query: 329 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQGAFDSIKNYLL 388
+ +KQ++ FLG L Y S +I + P+ L++N P W E +K L
Sbjct: 468 TLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQ 527
Query: 389 EPPILVPPVEGRPLIMYLSVFDESVGCVLG--QQDETGKKEHAIYYLSKKFTDCETRYTM 446
P L P+ LI+ D+ G +L + +E E Y S F E Y
Sbjct: 528 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHS 587
Query: 447 LEKTCCALVWAAKRLRHYLVNHTTWLISRMDQ------IKYIFEKPAVTRKIARWQMLLS 500
+K A++ K+ YL + R D + ++ + + RWQ LS
Sbjct: 588 NDKETLAVINTIKKFSIYLT--PVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLS 645
Query: 501 EY 502
Y
Sbjct: 646 HY 647
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 680
Score = 115 bits (289), Expect = 5e-25
Identities = 124/509 (24%), Positives = 216/509 (42%), Gaps = 46/509 (9%)
Query: 25 LEQERKAIEPHQEEIELINLGTEENKREIKVGAALEEGVK---RKIF---QLLREYPDIF 78
LE +K + Q E +N+ T + + ++ A L EG + K+F Q +++ ++
Sbjct: 160 LESMKKRSKTQQPEP--VNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQKTEELL 217
Query: 79 AWSYEDMPGLDPKIVEHRIPTKPEYPPVRQKLRRTHPDMALKIK---------SEVQKQI 129
+ P LDP + + + ++ + P A+K+K E KQI
Sbjct: 218 EKVCSENP-LDPNKTKQWM---------KASIKLSDPSKAIKVKPMKYSPMDREEFDKQI 267
Query: 130 DAGFLMTVEYPEWVANIVPV-------PKKDGKVRMCVDFRDLIKASPKDNFPLPHIDVL 182
+ V P ++ P G RM V+++ + KA+ D + LP+ D L
Sbjct: 268 KELLDLKVIKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVGDAYNLPNKDEL 327
Query: 183 VDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITP*GTFCYKVMPFGLINAGATY*RG 242
+ K+FS D SG+ Q+ + E R T+F P G + + V+PFGL A + + R
Sbjct: 328 LTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRH 387
Query: 243 MTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSG 302
M F + K VYVDD++V S +EE H+ ++ + ++ ++ + L+ K +
Sbjct: 388 MDEAFR-VFRKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKI 446
Query: 303 KLLGFIVSQKGIEVDPDKVRAIREMP-APQTEKQVRGFLGRLNYISRFISHMTATCGPIF 361
LG + + + + I + P + +KQ++ FLG L Y S +I ++ P+
Sbjct: 447 NFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQ 506
Query: 362 KLLRKNQPIVWNDECQGAFDSIKNYLLEPPILVPPVEGRPLIMYLSVFDESVGCVLG--Q 419
L++N P W E +K L P L P+ LI+ D+ G +L +
Sbjct: 507 AKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIK 566
Query: 420 QDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALVWAAKRLRHYLVNHTTWLISRMDQ- 478
+E E Y S F E Y +K A++ K+ YL + R D
Sbjct: 567 INEGTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKKFSIYLT--PVHFLIRTDNT 624
Query: 479 -----IKYIFEKPAVTRKIARWQMLLSEY 502
+ ++ + + RWQ LS Y
Sbjct: 625 HFKSFVNLNYKGDSKLGRNIRWQAWLSHY 653
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 659
Score = 109 bits (273), Expect = 4e-23
Identities = 117/462 (25%), Positives = 191/462 (41%), Gaps = 30/462 (6%)
Query: 80 WSYEDMPGLDPKIVEHRIPTKP-EYPPV-RQKLRRTHPDMA-LK-IKSEVQKQIDAGFLM 135
W + +DPK V + KP Y P R++ R ++ LK IK + FL+
Sbjct: 215 WMTATIELIDPKTV---VKVKPMSYSPSDREEFDRQIKELLELKVIKPSKSTHMSPAFLV 271
Query: 136 TVEYPEWVANIVPVPKKDGKVRMCVDFRDLIKASPKDNFPLPHIDVLVDNTAQSKVFSFM 195
E ++ GK RM V+++ + KA+ D LP+ D L+ K++S
Sbjct: 272 ENE----------AERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSF 321
Query: 196 DGFSGYNQIKMSPEDREKTSFITP*GTFCYKVMPFGLINAGATY*RGMTTLFHDMIHKEV 255
D SG Q+ + E + T+F P G + + V+PFGL A + + + + K
Sbjct: 322 DCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYC 381
Query: 256 EVYVDDMIVKSK-DEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGI 314
VYVDD++V S ++H ++ + R K + L+ K LG + Q
Sbjct: 382 CVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEIDQGTH 441
Query: 315 EVDPDKVRAIREMP-APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWN 373
+ I + P + +KQ++ FLG L Y S +I + + P+ L+++ WN
Sbjct: 442 CPQNHILEHIHKFPDRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDSTWTWN 501
Query: 374 DECQGAFDSIKNYLLEPPILVPPVEGRPLIMYLSVFDESVGCVLGQQDETGKKEHAIYYL 433
D IK L P L P L++ +E G +L + E+ Y
Sbjct: 502 DTDSQYMAKIKKNLKSFPKLYHPEPNDKLVIETDASEEFWGGIL--KAIHNSHEYICRYA 559
Query: 434 SKKFTDCETRYTMLEKTCCALVWAAKRLRHYLVNHTTWLISRMDQ------IKYIFEKPA 487
S F E Y EK A++ K+ YL + + R D + +
Sbjct: 560 SGSFKAAERNYHSNEKELLAVIRVIKKFSIYLT--PSRFLIRTDNKNFTHFVNINLKGDR 617
Query: 488 VTRKIARWQMLLSEYDIVFKAQKAIKGSILADHLAYQPLDDY 529
++ RWQM LS+YD + K ++ AD L L +Y
Sbjct: 618 KQGRLVRWQMWLSQYDFDVEHIAGTK-NVFADFLQENTLTNY 658
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 106 bits (265), Expect = 3e-22
Identities = 87/346 (25%), Positives = 163/346 (46%), Gaps = 8/346 (2%)
Query: 155 KVRMCVDFRDLIKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKT 214
K R+ +++ L D F +PH +++ ++ +FS D +G++ +K+ + ++ T
Sbjct: 1238 KPRIVYNYKRLNDNMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWT 1297
Query: 215 SFITP*GTFCYKVMPFGLINAGATY*RGMTTLFHDMIHKEVEVYVDDMIVKSKDEEQHVE 274
+F G + + V PFG+ NA + R M F D+ K +Y+DD+++ S +E++H+E
Sbjct: 1298 TFTCSEGLYTWNVCPFGIANAPCAFQRFMQESFGDL--KFALLYIDDILIASNNEKEHIE 1355
Query: 275 YLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQ--T 332
+L F R+++ L+ K ++ + LG + + I + P V I++ + T
Sbjct: 1356 HLKIFFNRVKEVGCVLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIKKFDKNKLNT 1415
Query: 333 EKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQGAFDSIKNYLLEPPI 392
K ++ +LG LNY +I ++ GP++K KN ++N E I+ + +
Sbjct: 1416 LKGLQAYLGLLNYARGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKP 1475
Query: 393 LVPPVEGRPLIMYLSVFDESVGCVLGQQDE--TGKKEHAIY-YLSKKFTDCETRYTMLEK 449
L P E +I+ +E G VL + + +GK I Y S F + +T +T L+
Sbjct: 1476 LERPKETDYIIIETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFGEKKT-WTSLDY 1534
Query: 450 TCCALVWAAKRLRHYLVNHTTWLISRMDQIKYIFEKPAVTRKIARW 495
A+ A + + YL T +K I + R RW
Sbjct: 1535 EIEAINEALNKFQIYLDKDFTIRTDCEAIVKGIKTEDYKKRSKTRW 1580
>POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 101 bits (251), Expect = 1e-20
Identities = 85/298 (28%), Positives = 137/298 (45%), Gaps = 25/298 (8%)
Query: 819 YYWMTMEHDCYQHARKCHKC-QIYADKIHVPPHALNVISSPWPFSMWGIDMIGRIEPKAS 877
YY + + C C Q+ A K V + P + W ID ++P
Sbjct: 874 YYMLNRDRTLKDITETCQACAQVNASKSAVKQGTR--VRGHRPGTHWEIDFT-EVKP-GL 929
Query: 878 NGHRFILVAIDYYTKWVEAASYANVTKQVVAKFIRNNIICRYGVPSKIITDNGTNLNNNV 937
G++++LV ID ++ WVEA T +VV K + I R+G+P + TDNG + V
Sbjct: 930 YGYKYLLVFIDTFSGWVEAFPTKKETAKVVTKKLLEEIFPRFGMPQVLGTDNGPAFVSKV 989
Query: 938 VQALCEEFKIEHHISSPYRPQMNGAVEAANKNIQKIVQK--MVTTYKDWHEMLPYALHGY 995
Q + + ++ + YRPQ +G VE N+ I++ + K + T +DW +LP AL+
Sbjct: 990 SQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRDWVLLLPLALYRA 1049
Query: 996 RTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNLVEEKR 1055
R T G TP+ ++YG P V P + AK++ Q+ L LV+ +
Sbjct: 1050 RNT-PGPHGLTPYEILYGAP---PPLVNFPDPDM---AKVTHNPSLQAHLQALYLVQHEV 1102
Query: 1056 MDAMARGQSYQARMKTAFDKKVRPREFKGGELVLKRRISQQPDPRGKWTPNYEGPYVV 1113
+A +YQ ++ D+ V P F+ G+ V RR + P ++GPY V
Sbjct: 1103 WRPLA--AAYQEQL----DRPVVPHPFRVGDTVWVRRHQTK-----NLEPRWKGPYTV 1149
Score = 78.2 bits (191), Expect = 1e-13
Identities = 85/387 (21%), Positives = 154/387 (38%), Gaps = 35/387 (9%)
Query: 71 LREYPDIFAWSYEDMPGLDPKIVEHRIPTKPEYPPVRQKLRRTHPDMALKIKSEVQKQID 130
L ++P AW+ GL + IP K PV K + L IK +Q+ +D
Sbjct: 151 LSDFPQ--AWAETGGMGLAVRQAPLIIPLKATSTPVSIKQYPMSQEARLGIKPHIQRLLD 208
Query: 131 AGFLMTVEYPEWVANIVPVPKKD-GKVRMCVDFRDLIKASPKDNFPLPHIDVLVDNTAQS 189
G L+ + P W ++PV K R D R++ K + +P+ L+ S
Sbjct: 209 QGILVPCQSP-WNTPLLPVKKPGTNDYRPVQDLREVNKRVEDIHPTVPNPYNLLSGLPPS 267
Query: 190 -KVFSFMDGFSGYNQIKMSPEDREKTSF------ITP*GTFCYKVMPFGLINAGATY*RG 242
+ ++ +D + +++ P + +F + G + +P G N+
Sbjct: 268 HQWYTVLDLKDAFFCLRLHPTSQSLFAFEWRDPEMGISGQLTWTRLPQGFKNS------- 320
Query: 243 MTTLFHDMIHKEVE------------VYVDDMIVKSKDEEQHVEYLTKMFERLRKYKLRL 290
TLF + +H+++ YVDD+++ + E + + + L R
Sbjct: 321 -PTLFDEALHRDLADFRIQHPDLILLQYVDDLLLAATSELDCQQGTRALLQTLGDLGYRA 379
Query: 291 NPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFI 350
+ K + K LG+++ + + + + P P+T +Q+R FLG + +I
Sbjct: 380 SAKKAQICQKQVKYLGYLLKEGQRWLTEARKETVMGQPTPKTPRQLREFLGTAGFCRLWI 439
Query: 351 SHMTATCGPIFKLLRKNQPIVWNDECQGAFDSIKNYLLEPPILVPPVEGRPLIMYLSVFD 410
P++ L + W + Q A+ IK LL P L P +P +++ D
Sbjct: 440 PGFAEMAAPLYPLTKTGTLFEWGPDQQKAYQEIKQALLTAPALGLPDLTKPFELFV---D 496
Query: 411 ESVGCVLG-QQDETGKKEHAIYYLSKK 436
E G G + G + YLSKK
Sbjct: 497 EKQGYAKGVLTQKLGPWRRPVAYLSKK 523
>POL_MLVRK (P31795) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)] (Fragment)
Length = 581
Score = 100 bits (250), Expect = 2e-20
Identities = 87/330 (26%), Positives = 151/330 (45%), Gaps = 28/330 (8%)
Query: 790 EQLMHDVHDGTFGTHATGHTMSRKLLRAG---YYWMTMEHDCYQHARKCHKC-QIYADKI 845
+Q + ++ D G+ + LL G YY + + A C C Q+ A K
Sbjct: 222 DQFVFELLDSLHRLTHLGYQKMKALLDRGESPYYMLNRDKTLQYVADSCTVCAQVNASKA 281
Query: 846 HVPPHALNVISSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYYTKWVEAASYANVTKQ 905
+ + P + W ID ++P G++++LV +D ++ WVEA + T +
Sbjct: 282 KIGAGVR--VRGHRPGTHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFPTKHETAK 337
Query: 906 VVAKFIRNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHISSPYRPQMNGAVEA 965
+V K + I R+G+P + TDNG + V Q++ + I+ + YRPQ +G VE
Sbjct: 338 IVTKKLLEEIFPRFGMPQVLGTDNGPAFVSQVSQSVAKLLGIDWKLHCAYRPQSSGQVER 397
Query: 966 ANKNIQKIVQK--MVTTYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVE 1023
N+ I++ + K + T +DW +LP AL+ R T G TP+ ++YG L +
Sbjct: 398 MNRTIKETLTKLTLATGTRDWVLLLPLALYRARNT-PGPHGLTPYEILYGAPPPL-VNFH 455
Query: 1024 IPSLRVIMEAKLSEAEWCQSRYDQLNLVEEKRMDAMARGQSYQARMKTAFDKKVRPREFK 1083
P + +K + + Q+ L V+ + +A +YQ ++ D+ V P F+
Sbjct: 456 DPEM-----SKFTNSPSLQAHLQALQAVQREVWKPLA--AAYQDQL----DQPVIPHPFR 504
Query: 1084 GGELVLKRRISQQPDPRGKWTPNYEGPYVV 1113
G+ V RR + P ++GPY V
Sbjct: 505 VGDTVWVRRHQTK-----NLEPRWKGPYTV 529
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.322 0.139 0.427
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 143,085,919
Number of Sequences: 164201
Number of extensions: 6469833
Number of successful extensions: 14935
Number of sequences better than 10.0: 135
Number of HSP's better than 10.0 without gapping: 121
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 14645
Number of HSP's gapped (non-prelim): 225
length of query: 1145
length of database: 59,974,054
effective HSP length: 121
effective length of query: 1024
effective length of database: 40,105,733
effective search space: 41068270592
effective search space used: 41068270592
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 71 (32.0 bits)
Medicago: description of AC148652.2