
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC139745.5 + phase: 0 /pseudo
(2129 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 235 1e-60
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 218 2e-55
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 207 3e-52
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 207 3e-52
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 203 5e-51
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 190 4e-47
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 187 2e-46
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 187 2e-46
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 184 3e-45
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript... 161 2e-38
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 130 4e-29
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 129 9e-29
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 129 1e-28
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 125 1e-27
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 124 2e-27
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 124 4e-27
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 118 2e-25
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 114 4e-24
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 110 3e-23
POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23... 107 3e-22
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 235 bits (599), Expect = 1e-60
Identities = 161/545 (29%), Positives = 268/545 (48%), Gaps = 52/545 (9%)
Query: 1007 RLLEQEKKAIQPHQEEIELIN--------IGTEEN--------------KQEIKIGATLE 1044
+LL + K I +E+ L N I T E +Q KI LE
Sbjct: 97 KLLAEAKATISYRDQEVTLYNNKYKLIEGIATHEQSHFQNVNMIPDTMLRQPNKISPILE 156
Query: 1045 EGV----------KQKVIQLLREYPDIFAWSYEDMPGLDPMIVEHRIPTKPECPPVRQKL 1094
+ KQ++ LL++Y DI + + + +H I TK P
Sbjct: 157 SDLYRLEHLNNEEKQRLCALLQKYHDIQYHEGDKLTFTNQ--TKHTINTKHNLPLYS--- 211
Query: 1095 RRTHPDM-ALKIKNEVQKQIDAGFLMTVEYPEWVANIVPVPKKDG-----KVRMCVDFRD 1148
+ ++P ++++++Q ++ G + T P + + I VPKK K R+ +D+R
Sbjct: 212 KYSYPQAYEQEVESQIQDMLNQGIIRTSNSP-YNSPIWVVPKKQDASGKQKFRIVIDYRK 270
Query: 1149 LNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFC 1208
LN+ + D P+P++D ++ + F+ +D G++QI+M PE KT+F T G +
Sbjct: 271 LNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYE 330
Query: 1209 YKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDEEQHVEYLTKMFERLR 1268
Y MPFGL NA AT+QR M + +++K VY+DD+IV ST ++H++ L +FE+L
Sbjct: 331 YLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLA 390
Query: 1269 KYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLN 1328
K L+L +KC F + LG +++ GI+ +P+K+ AI++ P P K+++ FLG
Sbjct: 391 KANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTG 450
Query: 1329 YISRFISHMTATCGPIFKLLRKNQPV-VWNDECQEAFDSIKNYLLEPPILVPPVEGRPLI 1387
Y +FI + P+ K L+KN + N E AF +K + E PIL P +
Sbjct: 451 YYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFT 510
Query: 1388 MYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRH 1447
+ D ++G VL Q H + Y+S+ + E Y+ +EK A+ WA K RH
Sbjct: 511 LTTDASDVALGAVLSQDG------HPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRH 564
Query: 1448 YLVNHTTWLISRMDPIKYIFEKAAVTGKIARWQMLLSEYDIVFKTQKAIKGSILADHLAY 1507
YL+ + S P+ +++ K+ RW++ LSE+D K K K + +AD L+
Sbjct: 565 YLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKG-KENCVADALSR 623
Query: 1508 QPLDD 1512
L++
Sbjct: 624 IKLEE 628
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 218 bits (554), Expect = 2e-55
Identities = 132/437 (30%), Positives = 223/437 (50%), Gaps = 17/437 (3%)
Query: 1077 VEHRIPTKPECPPVRQK--LRRTHPDMALKIKNEVQKQIDAGFLMT----VEYPEWVANI 1130
++H + T P ++ L +TH ++++N+VQ+ ++ G + P WV
Sbjct: 195 IKHVLNTTHNSPIYSKQYPLAQTHE---IEVENQVQEMLNQGLIRESNSPYNSPTWVVPK 251
Query: 1131 VPVPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKM 1190
P K R+ +D+R LN+ + D +P+P++D ++ + + F+ +D G++QI+M
Sbjct: 252 KPDASGANKYRVVIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEM 311
Query: 1191 SPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKS 1250
E KT+F T G + Y MPFGL NA AT+QR M + +++K VY+DD+I+ S
Sbjct: 312 DEESISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFS 371
Query: 1251 TDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIRE 1310
T +H+ + +F +L L+L +KC F + LG IV+ GI+ +P KV+AI
Sbjct: 372 TSLTEHLNSIQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVS 431
Query: 1311 MPAPQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPV-VWNDECQEAFDSIKN 1369
P P +K++R FLG Y +FI + P+ L+K + E EAF+ +K
Sbjct: 432 YPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKA 491
Query: 1370 YLLEPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYT 1429
++ PIL P + ++ + ++G VL Q H I ++S+ D E Y+
Sbjct: 492 LIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQNG------HPISFISRTLNDHELNYS 545
Query: 1430 MLEKTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIFEKAAVTGKIARWQMLLSEYDIV 1489
+EK A+ WA K RHYL+ + S P++++ K+ RW++ LSEY
Sbjct: 546 AIEKELLAIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFK 605
Query: 1490 FKTQKAIKGSILADHLA 1506
K + S+ AD L+
Sbjct: 606 IDYIKGKENSV-ADALS 621
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 207 bits (527), Expect = 3e-52
Identities = 126/469 (26%), Positives = 237/469 (49%), Gaps = 11/469 (2%)
Query: 1044 EEGVKQKVIQLLREYPDIFAWSYEDMPGLDPMIVEHRIPTKPECPPVRQKLRRTHPDMAL 1103
E G +K+ ++ ++ D+FA S +++ E I K P+RQK R +
Sbjct: 899 ENGDDRKIWDVIEQFQDVFAISDDELGRNSG--TECVIELKEGAEPIRQKPRPIPLALKP 956
Query: 1104 KIKNEVQKQIDAGFLMTVEYPEWVANIVPVPKKDGKVRMCVDFRDLNKASPKDNFPLPHI 1163
+I+ +QK ++ + + P W + +V V KKDG +RMC+D+R +NK + PLP+I
Sbjct: 957 EIRKMIQKMLNQKVIRESKSP-WSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNI 1015
Query: 1164 DVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAGATY 1223
+ + + A K+++ D +G+ QI + + +E T+F F + V+PFGL+ + A +
Sbjct: 1016 EATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALF 1075
Query: 1224 QRGMTTLFHDMIHKEVEVYVDDMIVKSTDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGV 1283
Q M + D++ VYVDD+++ S D EQH++ + + R+RK ++L +KC
Sbjct: 1076 QGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAK 1135
Query: 1284 RSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFISHMTATCGP 1343
+ + LG V+ G+E K +++ P K+++ FLG + Y +FI +
Sbjct: 1136 KEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASS 1195
Query: 1344 IFKLLRKNQPVVWNDECQEAFDSIKNYLLEPPILV-PPVEG-----RPLIMYLAVFDESM 1397
+ L+ +W E + AF +K + + P+L P VE RP ++Y + +
Sbjct: 1196 LTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGI 1255
Query: 1398 GCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVNHTTWLI 1457
G VL Q+ G ++H I + SK + ETRY + + A+ +A +R + + +
Sbjct: 1256 GAVLAQEGPDG-QQHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVF 1314
Query: 1458 SRMDPIKYIFEKAAVTGKIARWQMLLSEYDIVFKTQKAIKGSILADHLA 1506
+ P+ + + + + ++ RW + + E+D+ A K + +AD L+
Sbjct: 1315 TDHKPLISLLKGSPLADRLWRWSIEILEFDVKI-VYLAGKANAVADALS 1362
Score = 107 bits (268), Expect = 3e-22
Identities = 89/335 (26%), Positives = 146/335 (43%), Gaps = 14/335 (4%)
Query: 1768 VDEHEAEQLMHDVHDGTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQYARKCHKCQIYAD 1827
V E L+ ++H+G H M R + R +YW M R C KC D
Sbjct: 1460 VPEKIRTPLLKELHEGMLAGHFGIKKMWRMVHRK-FYWPQMRVCVENCVRTCAKCLCAND 1518
Query: 1828 KIHVPPHALNVMSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVT 1887
+ +L +P + D++ + G+R+IL ID FTK+ A +
Sbjct: 1519 HSKLTS-SLTPYRMTFPLEIVACDLMD--VGLSVQGNRYILTIIDLFTKYGTAVPIPDKK 1575
Query: 1888 KQVVAK-FIKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGA 1946
+ V K F++ I +P K++TD G N + KIEH + Y + NGA
Sbjct: 1576 AETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGYNSRANGA 1635
Query: 1947 VEAANKNIKRIVQKMVTTYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEV 2006
VE NK I I++K +W + + YA++ Y V +TG TP L++G + + PLE+
Sbjct: 1636 VERFNKTIMHIMKKKTAVPMEWDDQVVYAVYAYNNCVHENTGETPMFLMHGRDVMGPLEM 1695
Query: 2007 EIPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVH---- 2062
I A + E + ++ + L + + + AM +SY++ + K H
Sbjct: 1696 SGEDAVGINYADMDEYKHLLTQ-ELLKVQKIAKEHAMREQESYKSLFDQKYASKKHRFPQ 1754
Query: 2063 PREFKVGELVLKRKISQQPDPRGKWTPNYEGPYVV 2097
P + E+ ++ +Q P KW+ GPY V
Sbjct: 1755 PGSRVLLEIPSEKLGAQCPKLVNKWS----GPYRV 1785
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 207 bits (527), Expect = 3e-52
Identities = 139/465 (29%), Positives = 231/465 (48%), Gaps = 13/465 (2%)
Query: 1048 KQKVIQLLREYPDIFAWSYEDMPGLDPMIVEHRIPTKPECPPVRQKLRRTHPDMALKIKN 1107
K ++ + EY DIFA E P + + ++ K + P + R H + +I+
Sbjct: 276 KSQLENICSEYIDIFA--LESEPITVNNLYKQQLRLKDDEPVYTKNYRSPHSQVE-EIQA 332
Query: 1108 EVQKQIDAGFLMTVEYPEWVANIVPVPKKDG------KVRMCVDFRDLNKASPKDNFPLP 1161
+VQK I ++ ++ + ++ VPKK K R+ +D+R +NK D FPLP
Sbjct: 333 QVQKLIKDK-IVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLP 391
Query: 1162 HIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAGA 1221
ID ++D ++K FS +D SG++QI++ R+ TSF T G++ + +PFGL A
Sbjct: 392 RIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPN 451
Query: 1222 TYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDEEQHVEYLTKMFERLRKYKLRLNPNKCTF 1281
++QR MT F + + +Y+DD+IV E+ ++ LT++F + R+Y L+L+P KC+F
Sbjct: 452 SFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSF 511
Query: 1282 GVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFISHMTATC 1341
+ LG + KGI D K I+ P P R F+ NY RFI +
Sbjct: 512 FMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYS 571
Query: 1342 GPIFKLLRKNQPVVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLIMYLAVFDESMGCVL 1401
I +L +KN P W DECQ+AF +K+ L+ P +L P + + ++ G VL
Sbjct: 572 RHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVL 631
Query: 1402 GQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVNHTTWLISRMD 1461
Q + + Y S+ FT E+ + E+ A+ WA R Y+ + +
Sbjct: 632 TQNH--NGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHR 689
Query: 1462 PIKYIFEKAAVTGKIARWQMLLSEYDIVFKTQKAIKGSILADHLA 1506
P+ Y+F + K+ R ++ L EY+ + K K + +AD L+
Sbjct: 690 PLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKG-KDNHVADALS 733
Score = 123 bits (309), Expect = 5e-27
Identities = 101/381 (26%), Positives = 169/381 (43%), Gaps = 29/381 (7%)
Query: 1758 RNYDMVLLRCV----DEHEAEQLMHDVHDGTFGTHATGHTMSRKLLRAGYYWMAMEHDCY 1813
+N + LL V +E E E ++ +HD TG T + ++ YYW M
Sbjct: 874 KNLKVALLNPVTQINNEKEKEAILSTLHDDPIQGGHTGITKTLAKVKRHYYWKNMSKYIK 933
Query: 1814 QYARKCHKCQIYADKIHVPPHALNVMSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDY 1873
+Y RKC KCQ H + F +D IG + PK+ NG+ + + I
Sbjct: 934 EYVRKCQKCQKAKTTKHTKTPMTITETPEHAFDRVVVDTIGPL-PKSENGNEYAVTLICD 992
Query: 1874 FTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEH 1933
TK++ A N + + VAK I + I +YG ITD GT N+++ LC+ KI++
Sbjct: 993 LTKYLVAIPIANKSAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKN 1052
Query: 1934 HNSSPYRPQMNGAVEAANKNIKRIVQKMVTTYK-DWHEMLPYALHGYRTTVRSSTGATPF 1992
S+ + Q G VE +++ + ++ ++T K DW L Y ++ + TT P+
Sbjct: 1053 ITSTAHHHQTVGVVERSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPY 1112
Query: 1993 SLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARG----QS 2048
LV+G + LP KL E + D + + A AR ++
Sbjct: 1113 ELVFGRTSNLPKHFN----------KLHSIEPIYNIDDYAKESKYRLEVAYARARKLLEA 1162
Query: 2049 YQARMKTAFDKKVHPREFKVGELVLKRKISQQPDPRGKWTPNYEGPYVVKK-AFSGGALI 2107
++ + K +D KV E +VG+ VL R + K Y GPY ++ + +
Sbjct: 1163 HKEKNKENYDLKVKDIELEVGDKVLLRN-----EVGHKLDFKYTGPYKIESIGDNNNITL 1217
Query: 2108 LTHMDGVELPNPVNADIVKKY 2128
LT+ + ++ V+ D +KK+
Sbjct: 1218 LTNKNKKQI---VHKDRLKKF 1235
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 203 bits (516), Expect = 5e-51
Identities = 142/493 (28%), Positives = 237/493 (47%), Gaps = 34/493 (6%)
Query: 1039 IGATLEEGVKQKVIQLLREYPDIFAWSYEDMPGLDPMIVEHRIPTKPECPPVRQKLRRTH 1098
+ A +G ++ + LL E+P IF P L M VE + + +++
Sbjct: 76 LAAEHPDGTQEILNSLLGEFPRIFE------PPLSGMSVETAVKAEIRTNTQDPIYAKSY 129
Query: 1099 PDMALKIKNEVQKQIDA----GFLMTVEYPE----WVANIVPVPKKDGKVRMCVDFRDLN 1150
P + ++ EV++QID G + P W+ P P + + RM VDF+ LN
Sbjct: 130 P-YPVNMRGEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLN 188
Query: 1151 KASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYK 1210
+ D +P+P I+ + + +K F+ +D SG++QI M D KT+F T G + +
Sbjct: 189 TVTIPDTYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFL 248
Query: 1211 VMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDEEQHVEYLTKMFERLRKY 1270
+PFGL NA A +QR + + + I K VY+DD+IV S D + H + L + L K
Sbjct: 249 RLPFGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKA 308
Query: 1271 KLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYI 1330
L++N K F + LG+IV+ GI+ DP KVRAI EMP P + K+++ FLG +Y
Sbjct: 309 NLQVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYY 368
Query: 1331 SRFISHMTATCGPIFKLLR-----------KNQPVVWNDECQEAFDSIKNYLLEPPILVP 1379
+FI P+ L R P+ ++ ++F+ +K+ L IL
Sbjct: 369 RKFIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAF 428
Query: 1380 PVEGRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALA 1439
P +P + + ++G VL Q D+ ++ I Y+S+ E Y +EK A+
Sbjct: 429 PCFTKPFHLTTDASNWAIGAVLSQDDQ--GRDRPIAYISRSLNKTEENYATIEKEMLAII 486
Query: 1440 WAAKRLRHYLVN-HTTWLISRMDPIKYIFEKAAVTGKIARWQMLLSEY--DIVFKTQKAI 1496
W+ LR YL T + + P+ + K+ RW+ + EY ++++K K+
Sbjct: 487 WSLDNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKS- 545
Query: 1497 KGSILADHLAYQP 1509
+++AD L+ P
Sbjct: 546 --NVVADALSRIP 556
Score = 37.4 bits (85), Expect = 0.45
Identities = 46/212 (21%), Positives = 84/212 (38%), Gaps = 17/212 (8%)
Query: 1791 GHTMSRKLLRAGYYWMAMEHDCYQYARKCHKCQIYADKIHVPPHALNVMSSP---WPFSM 1847
G T R L YY+ M C C++Y + H P+ N+ +P +P +
Sbjct: 707 GPTEIRLQLLEKYYFPRMSSTIRLQTSSCQCCKLYKYERH--PNKPNLQPTPIPNYPCEI 764
Query: 1848 WGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPS 1907
ID+ + R L ID F+K+ + + V + + + P
Sbjct: 765 LHIDIFALEK-------RLYLSCIDKFSKFAK-LFHLQSKASVHLRETLVEALHYFTAPK 816
Query: 1908 KIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANK---NIKRIVQKMVTT 1964
+++DN L V I+ + + + ++NG VE + I R ++ + T
Sbjct: 817 VLVSDNERGLLCPTVLNYLRSLDIDLYYAPTQKSEVNGQVERFHSTFLEIYRCLKDELPT 876
Query: 1965 YKDWHEMLPYALHGYRTTVRSSTGATPFSLVY 1996
+K E++ A+ Y T+V S T P + +
Sbjct: 877 FKP-VELVHIAVDRYNTSVHSVTNRKPADVFF 907
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 190 bits (482), Expect = 4e-47
Identities = 116/419 (27%), Positives = 212/419 (49%), Gaps = 21/419 (5%)
Query: 1133 VPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSP 1192
VPKK+G +RM VD++ LNK + +PLP I+ L+ S +F+ +D S Y+ I++
Sbjct: 455 VPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514
Query: 1193 EDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTD 1252
D K +F P G F Y VMP+G+ A A +Q + T+ + V Y+DD+++ S
Sbjct: 515 GDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKS 574
Query: 1253 EEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP 1312
E +HV+++ + ++L+ L +N KC F K +G+ +S+KG + + + +
Sbjct: 575 ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWK 634
Query: 1313 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPVVWNDECQEAFDSIKNYLL 1372
P+ K++R FLG +NY+ +FI + P+ LL+K+ W +A ++IK L+
Sbjct: 635 QPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLV 694
Query: 1373 EPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLE 1432
PP+L + +++ D ++G VL Q+ + K + + Y S K + + Y++ +
Sbjct: 695 SPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD-DDKYYPVGYYSAKMSKAQLNYSVSD 753
Query: 1433 KTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIFEKAAVTGKI-----------ARWQM 1481
K A+ + K RHYL S ++P K + + + G+I ARWQ+
Sbjct: 754 KEMLAIIKSLKHWRHYLE-------STIEPFKILTDHRNLIGRITNESEPENKRLARWQL 806
Query: 1482 LLSEYDIVFKTQKAIKGSILADHLAYQPLDDYQPIEFDFPDEEIMYLKSKDCEEPLINE 1540
L +++ + + +AD L+ + +D+ +PI D D I ++ + N+
Sbjct: 807 FLQDFNFEINYRPG-SANHIADALS-RIVDETEPIPKDSEDNSINFVNQISITDDFKNQ 863
Score = 98.6 bits (244), Expect = 2e-19
Identities = 114/508 (22%), Positives = 210/508 (40%), Gaps = 59/508 (11%)
Query: 1603 IDMRIKHLDIYGDSALVINQIKGEWETHHANLIPYRDYARRLLTYFTKVELHHIPRDENQ 1662
++ I+ I D +I +I E E + L ++ + + E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 1663 MADALATLSSMFRVNHWNDVPIIRVQRLERPSHVFAIGDVIDQAGENVIDYRPWYYDIKQ 1722
+ADAL+ + PI + + V I D + V +Y D K
Sbjct: 825 IADALSRIVD-------ETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTN---DTKL 874
Query: 1723 F-LLSREYPPGASKQDKKTLRRLAGRFLLDGDILYKRNYDMVLLRCVDE--HEAEQLMHD 1779
LL+ E K+ ++ ++ G + D + N D L R + + HE +L+H
Sbjct: 875 LNLLNNE-----DKRVEENIQLKDGLLINSKDQILLPN-DTQLTRTIIKKYHEEGKLIHP 928
Query: 1780 VHDGTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQYARKCHKCQIYADKIHVPPHALN-V 1838
G + ++ + W + +Y + CH CQI + H P L +
Sbjct: 929 -----------GIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPI 977
Query: 1839 MSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYT-NVTKQVVAKFIKN 1897
S P+ +D I + S+G+ + V +D F+K T ++T + A+
Sbjct: 978 PPSERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQ 1035
Query: 1898 NIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRI 1957
+I +G P +II DN + + ++ S PYRPQ +G E N+ ++++
Sbjct: 1036 RVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKL 1095
Query: 1958 VQKMVTTYKD-WHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIME 2016
++ + +T+ + W + + Y + S+T TPF +V+ L +E+PS +
Sbjct: 1096 LRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTD 1154
Query: 2017 AKLSEA-EWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHP-REFKVGELVL- 2073
E + Q+ + LN + +MK FD K+ EF+ G+LV+
Sbjct: 1155 ENSQETIQVFQTVKEHLN--------------TNNIKMKKYFDMKIQEIEEFQPGDLVMV 1200
Query: 2074 KRKISQQPDPRGKWTPNYEGP-YVVKKA 2100
KR + K P++ GP YV++K+
Sbjct: 1201 KRTKTGFLHKSNKLAPSFAGPFYVLQKS 1228
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 187 bits (476), Expect = 2e-46
Identities = 115/419 (27%), Positives = 213/419 (50%), Gaps = 21/419 (5%)
Query: 1133 VPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSP 1192
VPKK+G +RM VD++ LNK + +PLP I+ L+ S +F+ +D S Y+ I++
Sbjct: 455 VPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514
Query: 1193 EDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTD 1252
D K +F P G F Y VMP+G+ A A +Q + T+ ++ V Y+D++++ S
Sbjct: 515 GDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKS 574
Query: 1253 EEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP 1312
E +HV+++ + ++L+ L +N KC F K +G+ +S+KG + + + +
Sbjct: 575 ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWK 634
Query: 1313 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPVVWNDECQEAFDSIKNYLL 1372
P+ K++R FLG +NY+ +FI + P+ LL+K+ W +A ++IK L+
Sbjct: 635 QPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLV 694
Query: 1373 EPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLE 1432
PP+L + +++ D ++G VL Q+ + K + + Y S K + + Y++ +
Sbjct: 695 SPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD-DDKYYPVGYYSAKMSKAQLNYSVSD 753
Query: 1433 KTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIFEKAAVTGKI-----------ARWQM 1481
K A+ + K RHYL S ++P K + + + G+I ARWQ+
Sbjct: 754 KEMLAIIKSLKHWRHYLE-------STIEPFKILTDHRNLIGRITNESEPENKRLARWQL 806
Query: 1482 LLSEYDIVFKTQKAIKGSILADHLAYQPLDDYQPIEFDFPDEEIMYLKSKDCEEPLINE 1540
L +++ + + +AD L+ + +D+ +PI D D I ++ + N+
Sbjct: 807 FLQDFNFEINYRPG-SANHIADALS-RIVDETEPIPKDSEDNSINFVNQISITDDFKNQ 863
Score = 98.6 bits (244), Expect = 2e-19
Identities = 114/508 (22%), Positives = 210/508 (40%), Gaps = 59/508 (11%)
Query: 1603 IDMRIKHLDIYGDSALVINQIKGEWETHHANLIPYRDYARRLLTYFTKVELHHIPRDENQ 1662
++ I+ I D +I +I E E + L ++ + + E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 1663 MADALATLSSMFRVNHWNDVPIIRVQRLERPSHVFAIGDVIDQAGENVIDYRPWYYDIKQ 1722
+ADAL+ + PI + + V I D + V +Y D K
Sbjct: 825 IADALSRIVD-------ETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTN---DTKL 874
Query: 1723 F-LLSREYPPGASKQDKKTLRRLAGRFLLDGDILYKRNYDMVLLRCVDE--HEAEQLMHD 1779
LL+ E K+ ++ ++ G + D + N D L R + + HE +L+H
Sbjct: 875 LNLLNNE-----DKRVEENIQLKDGLLINSKDQILLPN-DTQLTRTIIKKYHEEGKLIHP 928
Query: 1780 VHDGTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQYARKCHKCQIYADKIHVPPHALN-V 1838
G + ++ + W + +Y + CH CQI + H P L +
Sbjct: 929 -----------GIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPI 977
Query: 1839 MSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYT-NVTKQVVAKFIKN 1897
S P+ +D I + S+G+ + V +D F+K T ++T + A+
Sbjct: 978 PPSERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQ 1035
Query: 1898 NIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRI 1957
+I +G P +II DN + + ++ S PYRPQ +G E N+ ++++
Sbjct: 1036 RVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKL 1095
Query: 1958 VQKMVTTYKD-WHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIME 2016
++ + +T+ + W + + Y + S+T TPF +V+ L +E+PS +
Sbjct: 1096 LRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTD 1154
Query: 2017 AKLSEA-EWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHP-REFKVGELVL- 2073
E + Q+ + LN + +MK FD K+ EF+ G+LV+
Sbjct: 1155 ENSQETIQVFQTVKEHLN--------------TNNIKMKKYFDMKIQEIEEFQPGDLVMV 1200
Query: 2074 KRKISQQPDPRGKWTPNYEGP-YVVKKA 2100
KR + K P++ GP YV++K+
Sbjct: 1201 KRTKTGFLHKSNKLAPSFAGPFYVLQKS 1228
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 187 bits (476), Expect = 2e-46
Identities = 115/419 (27%), Positives = 213/419 (50%), Gaps = 21/419 (5%)
Query: 1133 VPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSP 1192
VPKK+G +RM VD++ LNK + +PLP I+ L+ S +F+ +D S Y+ I++
Sbjct: 455 VPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514
Query: 1193 EDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTD 1252
D K +F P G F Y VMP+G+ A A +Q + T+ ++ V Y+D++++ S
Sbjct: 515 GDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKS 574
Query: 1253 EEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP 1312
E +HV+++ + ++L+ L +N KC F K +G+ +S+KG + + + +
Sbjct: 575 ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWK 634
Query: 1313 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPVVWNDECQEAFDSIKNYLL 1372
P+ K++R FLG +NY+ +FI + P+ LL+K+ W +A ++IK L+
Sbjct: 635 QPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLV 694
Query: 1373 EPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLE 1432
PP+L + +++ D ++G VL Q+ + K + + Y S K + + Y++ +
Sbjct: 695 SPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD-DDKYYPVGYYSAKMSKAQLNYSVSD 753
Query: 1433 KTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIFEKAAVTGKI-----------ARWQM 1481
K A+ + K RHYL S ++P K + + + G+I ARWQ+
Sbjct: 754 KEMLAIIKSLKHWRHYLE-------STIEPFKILTDHRNLIGRITNESEPENKRLARWQL 806
Query: 1482 LLSEYDIVFKTQKAIKGSILADHLAYQPLDDYQPIEFDFPDEEIMYLKSKDCEEPLINE 1540
L +++ + + +AD L+ + +D+ +PI D D I ++ + N+
Sbjct: 807 FLQDFNFEINYRPG-SANHIADALS-RIVDETEPIPKDSEDNSINFVNQISITDDFKNQ 863
Score = 98.6 bits (244), Expect = 2e-19
Identities = 114/508 (22%), Positives = 210/508 (40%), Gaps = 59/508 (11%)
Query: 1603 IDMRIKHLDIYGDSALVINQIKGEWETHHANLIPYRDYARRLLTYFTKVELHHIPRDENQ 1662
++ I+ I D +I +I E E + L ++ + + E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 1663 MADALATLSSMFRVNHWNDVPIIRVQRLERPSHVFAIGDVIDQAGENVIDYRPWYYDIKQ 1722
+ADAL+ + PI + + V I D + V +Y D K
Sbjct: 825 IADALSRIVD-------ETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTN---DTKL 874
Query: 1723 F-LLSREYPPGASKQDKKTLRRLAGRFLLDGDILYKRNYDMVLLRCVDE--HEAEQLMHD 1779
LL+ E K+ ++ ++ G + D + N D L R + + HE +L+H
Sbjct: 875 LNLLNNE-----DKRVEENIQLKDGLLINSKDQILLPN-DTQLTRTIIKKYHEEGKLIHP 928
Query: 1780 VHDGTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQYARKCHKCQIYADKIHVPPHALN-V 1838
G + ++ + W + +Y + CH CQI + H P L +
Sbjct: 929 -----------GIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPI 977
Query: 1839 MSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYT-NVTKQVVAKFIKN 1897
S P+ +D I + S+G+ + V +D F+K T ++T + A+
Sbjct: 978 PPSERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQ 1035
Query: 1898 NIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRI 1957
+I +G P +II DN + + ++ S PYRPQ +G E N+ ++++
Sbjct: 1036 RVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKL 1095
Query: 1958 VQKMVTTYKD-WHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIME 2016
++ + +T+ + W + + Y + S+T TPF +V+ L +E+PS +
Sbjct: 1096 LRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTD 1154
Query: 2017 AKLSEA-EWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHP-REFKVGELVL- 2073
E + Q+ + LN + +MK FD K+ EF+ G+LV+
Sbjct: 1155 ENSQETIQVFQTVKEHLN--------------TNNIKMKKYFDMKIQEIEEFQPGDLVMV 1200
Query: 2074 KRKISQQPDPRGKWTPNYEGP-YVVKKA 2100
KR + K P++ GP YV++K+
Sbjct: 1201 KRTKTGFLHKSNKLAPSFAGPFYVLQKS 1228
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 184 bits (466), Expect = 3e-45
Identities = 132/434 (30%), Positives = 213/434 (48%), Gaps = 37/434 (8%)
Query: 1105 IKNEVQKQIDAGFLMT----VEYPEWVANIVPVPKK------DGKVRMCVDFRDLNKASP 1154
+ NEV++ + G + P WV V KK + R+ +DFR LN+ +
Sbjct: 197 VNNEVKQLLKDGIIRPSRSPYNSPTWV-----VDKKGTDAFGNPNKRLVIDFRKLNEKTI 251
Query: 1155 KDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPF 1214
D +P+P I +++ N ++K F+ +D SGY+QI ++ DREKTSF G + + +PF
Sbjct: 252 PDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPF 311
Query: 1215 GLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDEEQHVEYLTKMFERLRKYKLRL 1274
GL NA + +QR + + + I K VYVDD+I+ S +E HV ++ + + L +R+
Sbjct: 312 GLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRV 371
Query: 1275 NPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFI 1334
+ K F S + LGFIVS+ G + DP+KV+AI+E P P +VR FLG +Y FI
Sbjct: 372 SQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFI 431
Query: 1335 SHMTATCGPIFKLLR-----------KNQPVVWNDECQEAFDSIKNYLL-EPPILVPPVE 1382
A PI +L+ K PV +N+ + AF ++N L E IL P
Sbjct: 432 KDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDF 491
Query: 1383 GRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAA 1442
+P + +G VL Q+ I +S+ E Y E+ A+ WA
Sbjct: 492 KKPFDLTTDASASGIGAVLSQEG------RPITMISRTLKQPEQNYATNERELLAIVWAL 545
Query: 1443 KRLRHYLV-NHTTWLISRMDPIKYIFEKAAVTGKIARWQMLLSEYDI-VFKTQKAIKGSI 1500
+L+++L + + + P+ + KI RW+ + +++ VF K K +
Sbjct: 546 GKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVF--YKPGKENF 603
Query: 1501 LADHLAYQPLDDYQ 1514
+AD L+ Q L+ Q
Sbjct: 604 VADALSRQNLNALQ 617
>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 886
Score = 161 bits (408), Expect = 2e-38
Identities = 198/890 (22%), Positives = 356/890 (39%), Gaps = 79/890 (8%)
Query: 1130 IVPVPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIK 1189
+ PVPK DG+ RM +D+R++NK P H ++ + K + +D +G+
Sbjct: 5 VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGFWAHP 64
Query: 1190 MSPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVK 1249
++PE T+F +C+ +P G +N+ A + + L ++ V+VYVDD+ +
Sbjct: 65 ITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTADVVDLLKEI--PNVQVYVDDIYLS 122
Query: 1250 STDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIR 1309
D ++HV+ L K+F+ L + ++ K G ++ + LGF ++++G + +
Sbjct: 123 HDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTVEFLGFNITKEGRGLTDTFKTKLL 182
Query: 1310 EMPAPQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLL--RKNQPVVWNDECQEAFDSI 1367
+ P+ KQ++ LG LN+ FI + P++ L+ K + + W++E + + +
Sbjct: 183 NITPPKDLKQLQSILGLLNFARNFIPNFAELVQPLYNLIASAKGKYIEWSEENTKQLNMV 242
Query: 1368 KNYLLEPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETR 1427
L L + + L++ + S G V +ETGKK I YL+ F+ E +
Sbjct: 243 IEALNTASNLEERLPEQRLVIKVNT-SPSAGYV-RYYNETGKK--PIMYLNYVFSKAELK 298
Query: 1428 YTMLEKTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIF-----EKAAVTGKIARWQML 1482
++MLEK + A + + + S + + I E+ A+ + W
Sbjct: 299 FSMLEKLLTTMHKALIKAMDLAMGQEILVYSPIVSMTKIQKTPLPERKALPIRWITWMTY 358
Query: 1483 LSEYDIVFKTQKAIKGSILADHLAYQPLDDYQPIEFDFPDEEIMYLKSKDCEEPLINEGP 1542
L + I F K + H+ P++ E + Y + P
Sbjct: 359 LEDPRIQFHYDKTLPE---LKHIPDVYTSSQSPVKHPSQYEGVFYTDGSAI------KSP 409
Query: 1543 DPNSKWGLVFDGAVNAYGKGIGAVIVSPQGHHIPFTARILFECTNNMAEYEACIFGIEEA 1602
DP N G GI P+ + + L T MAE A F ++A
Sbjct: 410 DPTKS---------NNAGMGIVHATYKPEYQVLNQWSIPLGNHTAQMAEIAAVEFACKKA 460
Query: 1603 IDMRIKHLDIYGDSALVINQIKGEWETHHANLIPYRDYARRLLTYFTKVELHHIPRDENQ 1662
+ + L + DS V E +PY + K L HI + ++
Sbjct: 461 LKIPGPVL-VITDSFYVAESANKE--------LPY--WKSNGFVNNKKKPLKHISKWKS- 508
Query: 1663 MADALATLSSMFRVNHWNDVPIIRVQRLERPSHVF---AIGDVIDQAGENVID---YRPW 1716
+A+ L ++ + H + L+ P + A+ D + G V++ +P
Sbjct: 509 IAECL-SMKPDITIQHEKGI------SLQIPVFILKGNALADKLATQGSYVVNCNTKKPN 561
Query: 1717 YYDIKQFLLSREYPPGASKQDKKTLRRLAGRFLLDGDILYKRNYDMVLLRCVDEHEAEQL 1776
LL Y G KQ FL DG + R + ++ + + +++
Sbjct: 562 LDAELDQLLQGHYIKGYPKQ--------YTYFLEDGKVKVSRPEGVKII--PPQSDRQKI 611
Query: 1777 MHDVHDGTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQYARKCHKCQIYADKIHVPPHAL 1836
+ H+ TG + + Y+W M D + +C +C I L
Sbjct: 612 VLQAHN----LAHTGREATLLKIANLYWWPNMRKDVVKQLGRCQQCLITNASNKASGPIL 667
Query: 1837 NVMSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFT--KWVEAASYTNVTKQVVAKF 1894
PF + ID IG + P S G+ ++LV +D T W+ Y A
Sbjct: 668 RPDRPQKPFDKFFIDYIGPLPP--SQGYLYVLVVVDGMTGFTWL----YPTKAPSTSATV 721
Query: 1895 IKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNI 1954
N++ +P I +D G ++ +E I S+PY PQ VE N +I
Sbjct: 722 KSLNVLTSIAIPKVIHSDQGAAFTSSTFAEWAKERGIHLEFSTPYHPQSGSKVERKNSDI 781
Query: 1955 KRIVQK-MVTTYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLP 2003
KR++ K +V W+++LP T TP L++G+++ P
Sbjct: 782 KRLLTKLLVGRPTKWYDLLPVVQLALNNTYSPVLKYTPHQLLFGIDSNTP 831
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 130 bits (327), Expect = 4e-29
Identities = 130/512 (25%), Positives = 217/512 (41%), Gaps = 52/512 (10%)
Query: 1009 LEQEKKAIQPHQEEIELINIGTEENKQEIKIGATLEEG---------VKQKVIQLLREYP 1059
LE KK + Q E +NI T + + +K A L EG + Q+ +Q + E
Sbjct: 159 LESMKKRSKTQQPEP--VNISTNKIENPLKEIAILSEGRRLSEEKLFITQQRMQKIEEL- 215
Query: 1060 DIFAWSYEDMPGLDPMIVEHRIPTKPECPPVRQKLRRTHPDMALKIK---------NEVQ 1110
L+ + E+ + ++ ++ + P A+K+K E
Sbjct: 216 ------------LEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFD 263
Query: 1111 KQIDAGFLMTVEYPEWVANIVPV-------PKKDGKVRMCVDFRDLNKASPKDNFPLPHI 1163
KQI + V P ++ P K+ GK RM V+++ +NKA+ D + LP+
Sbjct: 264 KQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNK 323
Query: 1164 DVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAGATY 1223
D L+ K+FS D SG+ Q+ + E R T+F P G + + V+PFGL A + +
Sbjct: 324 DELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIF 383
Query: 1224 QRGMTTLFHDMIHKEVEVYVDDMIVKSTDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGV 1283
QR M F + K VYVDD++V S +EE H+ ++ + ++ ++ + L+ K
Sbjct: 384 QRHMDEAFR-VFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFK 442
Query: 1284 RSGKLLGFIVSQKGIEVDPDKVRAIREMP-APQTEKQVRGFLGRLNYISRFISHMTATCG 1342
+ LG + + + + I + P + +KQ++ FLG L Y S +I +
Sbjct: 443 KKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRK 502
Query: 1343 PIFKLLRKNQPVVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLIMYLAVFDESMGCVLG 1402
P+ L++N P W E +K L P L P+ LI+ D+ G +L
Sbjct: 503 PLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLK 562
Query: 1403 --QQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVNHTTWLISRM 1460
+ +E E Y S F E Y +K A+ K+ YL + R
Sbjct: 563 AIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLT--PVHFLIRT 620
Query: 1461 DP------IKYIFEKAAVTGKIARWQMLLSEY 1486
D + ++ + G+ RWQ LS Y
Sbjct: 621 DNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHY 652
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 129 bits (324), Expect = 9e-29
Identities = 130/516 (25%), Positives = 218/516 (42%), Gaps = 52/516 (10%)
Query: 1005 ITRLLEQEKKAIQPHQEEIELINIGTEENKQEIKIGATLEEG---------VKQKVIQLL 1055
I LE KK + Q E +NI T + + ++ A L EG + Q+ +Q +
Sbjct: 155 IEGFLESMKKRSKTQQPEP--VNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQKI 212
Query: 1056 REYPDIFAWSYEDMPGLDPMIVEHRIPTKPECPPVRQKLRRTHPDMALKIK--------- 1106
E L+ + E+ + ++ ++ + P A+K+K
Sbjct: 213 EEL-------------LEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDR 259
Query: 1107 NEVQKQIDAGFLMTVEYPEWVANIVPV-------PKKDGKVRMCVDFRDLNKASPKDNFP 1159
E KQI + V P ++ P K+ GK RM V+++ +NKA+ D +
Sbjct: 260 EEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYN 319
Query: 1160 LPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINA 1219
LP+ D L+ K+FS D SG+ Q+ + E R T+F P G + + V+PFGL A
Sbjct: 320 LPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQA 379
Query: 1220 GATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDEEQHVEYLTKMFERLRKYKLRLNPNKC 1279
+ +QR M F + K VYVDD++V S +EE H+ ++ + ++ ++ + L+ K
Sbjct: 380 PSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKA 438
Query: 1280 TFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP-APQTEKQVRGFLGRLNYISRFISHMT 1338
+ LG + + + + I + P + +KQ++ FLG L Y S +I +
Sbjct: 439 QLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLA 498
Query: 1339 ATCGPIFKLLRKNQPVVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLIMYLAVFDESMG 1398
P+ L++N P W E +K L P L P+ LI+ D+ G
Sbjct: 499 QIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWG 558
Query: 1399 CVLG--QQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVNHTTWL 1456
+L + +E E Y S F E Y +K A+ K+ YL
Sbjct: 559 GMLKAIKINEGTNTELICRYASGSFKAAERNYHSNDKETLAVINTIKKFSIYLT--PVHF 616
Query: 1457 ISRMDP------IKYIFEKAAVTGKIARWQMLLSEY 1486
+ R D + ++ + G+ RWQ LS Y
Sbjct: 617 LIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHY 652
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 129 bits (323), Expect = 1e-28
Identities = 129/512 (25%), Positives = 217/512 (42%), Gaps = 52/512 (10%)
Query: 1009 LEQEKKAIQPHQEEIELINIGTEENKQEIKIGATLEEG---------VKQKVIQLLREYP 1059
LE KK + Q E +NI T + + ++ A L EG + Q+ +Q + E
Sbjct: 159 LESMKKRSKTQQPEP--VNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQKIEEL- 215
Query: 1060 DIFAWSYEDMPGLDPMIVEHRIPTKPECPPVRQKLRRTHPDMALKIK---------NEVQ 1110
L+ + E+ + ++ ++ + P A+K+K E
Sbjct: 216 ------------LEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFD 263
Query: 1111 KQIDAGFLMTVEYPEWVANIVPV-------PKKDGKVRMCVDFRDLNKASPKDNFPLPHI 1163
KQI + V P ++ P K+ GK RM V+++ +NKA+ D + LP+
Sbjct: 264 KQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNLPNK 323
Query: 1164 DVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAGATY 1223
D L+ K+FS D SG+ Q+ + E R T+F P G + + V+PFGL A + +
Sbjct: 324 DELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIF 383
Query: 1224 QRGMTTLFHDMIHKEVEVYVDDMIVKSTDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGV 1283
QR M F + K VYVDD++V S +EE H+ ++ + ++ ++ + L+ K
Sbjct: 384 QRHMDEAFR-VFRKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFK 442
Query: 1284 RSGKLLGFIVSQKGIEVDPDKVRAIREMP-APQTEKQVRGFLGRLNYISRFISHMTATCG 1342
+ LG + + + + I + P + +KQ++ FLG L Y S +I +
Sbjct: 443 KKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRK 502
Query: 1343 PIFKLLRKNQPVVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLIMYLAVFDESMGCVLG 1402
P+ L++N P W E +K L P L P+ LI+ D+ G +L
Sbjct: 503 PLQAKLKENVPWRWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLK 562
Query: 1403 --QQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVNHTTWLISRM 1460
+ +E E Y S F E Y +K A+ K+ YL + R
Sbjct: 563 AIKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLT--PVHFLIRT 620
Query: 1461 DP------IKYIFEKAAVTGKIARWQMLLSEY 1486
D + ++ + G+ RWQ LS Y
Sbjct: 621 DNTHFKSFVNLNYKGDSKLGRNIRWQAWLSHY 652
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 125 bits (315), Expect = 1e-27
Identities = 88/297 (29%), Positives = 149/297 (49%), Gaps = 12/297 (4%)
Query: 1041 ATLEEGVKQKVIQLLREYPDIFAWSYEDMPGLDPMIVEHRIPTKPECPPVRQKLRRTHPD 1100
+T E + + L ++P++F +D GL + T+ PV ++ R
Sbjct: 397 STSETEASRLEVMLKNDFPEVF----KDGLGLCTK-EKAEFRTEENAVPVFKRARPVPYG 451
Query: 1101 MALKIKNEVQKQIDAGFLMTVEYPEWVANIVPVPKKD-GKVRMCVDFR--DLNKASPKDN 1157
++ E+ + + G ++ + Y +W A IV + KK GK+R+C DF+ LN A +
Sbjct: 452 SLEAVETELNRLQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCADFKCSGLNAALKDEF 511
Query: 1158 FPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLI 1217
PLP + + + V+S +D Y Q+++ E ++ T G F Y M FGL
Sbjct: 512 HPLPTSEDIFSRL-KGTVYSQIDLKDAYLQVELDEEAQKLAVINTHRGIFKYLRMTFGLK 570
Query: 1218 NAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDEEQHVEYLTKMFERLRKYKLRLNPN 1277
A A++Q+ M + + V VY DD+I+ ++ E+H + L ++FER ++Y R++
Sbjct: 571 PAPASFQKIMDKMVSGLTG--VAVYWDDIIISASSIEEHEKILRELFERFKEYGFRVSAE 628
Query: 1278 KCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFI 1334
KC F + LGF V + G D K AIR M AP +KQ+ FLG +++SR +
Sbjct: 629 KCAFAQKQVTFLGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQLASFLGAADWLSRMM 684
Score = 89.7 bits (221), Expect = 8e-17
Identities = 80/320 (25%), Positives = 137/320 (42%), Gaps = 44/320 (13%)
Query: 1741 LRRLAGRFLLDGDILYKRNYDMVLLRCVDEHEAEQLMHDVHDGTFGTHATGHTMSRKLLR 1800
L+ + G LLD ++ ++ ++L+ +H+ H G ++ R
Sbjct: 763 LKLIHGCLLLDDRVIVPKSLQKIVLK---------QLHEGHPGI--------VQMKQKAR 805
Query: 1801 AGYYWMAMEHDCYQYARKCHKCQIYADKIHVPPHALNVMSSPWPF--SMWG---IDMIGR 1855
+ +W ++ D R C+ CQ + V P +PWP + W ID G
Sbjct: 806 SFVFWRGLDSDIENMVRHCNNCQENSKMPRVVP------LNPWPVPEAPWKRIHIDFAGP 859
Query: 1856 IEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGT 1915
+ NG ++LV +D TK+ E T V + I +G P II+DNGT
Sbjct: 860 L-----NGC-YLLVVVDAKTKYAEV-KLTRSISAVTTIDLLEEIFSIHGYPETIISDNGT 912
Query: 1916 NLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVTTYKDWHEMLPYA 1975
L +++ +C+ IEH S+ Y P+ NGA E +KR + K+ ++L
Sbjct: 913 QLTSHLFAQMCQSHGIEHKTSAVYYPRSNGAAERFVDTLKRGIAKIKGEGSVNQQILNKF 972
Query: 1976 LHGYRTTVRSS-TGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNL 2034
L YR T S+ G+TP +G + + + +P+ RV+ KL++ Q N+
Sbjct: 973 LISYRNTPHSALNGSTPAECHFGRKIRTTMSLLMPTDRVLKVPKLTQY--------QQNM 1024
Query: 2035 IEEKRMDAMARGQSYQARMK 2054
+ AR +++Q K
Sbjct: 1025 KHHYELRNGARAKAFQVNQK 1044
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 124 bits (312), Expect = 2e-27
Identities = 110/422 (26%), Positives = 184/422 (43%), Gaps = 28/422 (6%)
Query: 1090 VRQKLRRTHPDMALKIK---------NEVQKQIDAGFLMTVEYPEWVANIVPV------- 1133
++ ++ + P A+K+K E KQI + V P ++ P
Sbjct: 229 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 288
Query: 1134 PKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPE 1193
K+ GK RM V+++ +NKA+ D + P+ D L+ K+FS D SG+ Q+ + E
Sbjct: 289 EKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 348
Query: 1194 DREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDE 1253
R T+F P G + + V+PFGL A + +QR M F + K VYVDD++V S +E
Sbjct: 349 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNE 407
Query: 1254 EQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 1312
E H+ ++ + ++ ++ + L+ K + LG + + + + I + P
Sbjct: 408 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPD 467
Query: 1313 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPVVWNDECQEAFDSIKNYLL 1372
+ +KQ++ FLG L Y S +I + P+ L++N P W E +K L
Sbjct: 468 TLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQ 527
Query: 1373 EPPILVPPVEGRPLIMYLAVFDESMGCVLG--QQDETGKKEHAIYYLSKKFTDCETRYTM 1430
P L P+ LI+ D+ G +L + +E E Y S F E Y
Sbjct: 528 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHS 587
Query: 1431 LEKTCCALAWAAKRLRHYLVNHTTWLISRMDP------IKYIFEKAAVTGKIARWQMLLS 1484
+K A+ K+ YL + R D + ++ + G+ RWQ LS
Sbjct: 588 NDKETLAVINTIKKFSIYLT--PVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLS 645
Query: 1485 EY 1486
Y
Sbjct: 646 HY 647
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 124 bits (310), Expect = 4e-27
Identities = 126/503 (25%), Positives = 215/503 (42%), Gaps = 34/503 (6%)
Query: 1009 LEQEKKAIQPHQEEIELINIGTEENKQEIKIGATLEEGVKQKVIQLLREYPDIFAWSYED 1068
LE KK + Q E +NI T + + ++ A L EG + +L + E+
Sbjct: 160 LESMKKRSKTQQPEP--VNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRM--QKTEE 215
Query: 1069 MPGLDPMIVEHRIPTKPECPPVRQKLRRTHPDMALKIK---------NEVQKQIDAGFLM 1119
+ L+ + E+ + ++ ++ + P A+K+K E KQI +
Sbjct: 216 L--LEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDL 273
Query: 1120 TVEYPEWVANIVPV-------PKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQ 1172
V P ++ P G RM V+++ +NKA+ D + LP+ D L+
Sbjct: 274 KVIKPSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRG 333
Query: 1173 SKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFH 1232
K+FS D SG+ Q+ + E R T+F P G + + V+PFGL A + +QR M F
Sbjct: 334 KKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR 393
Query: 1233 DMIHKEVEVYVDDMIVKSTDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFI 1292
+ K VYVDD++V S +EE H+ ++ + ++ ++ + L+ K + LG
Sbjct: 394 -VFRKFCCVYVDDIVVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLE 452
Query: 1293 VSQKGIEVDPDKVRAIREMP-APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKN 1351
+ + + + I + P + +KQ++ FLG L Y S +I ++ P+ L++N
Sbjct: 453 IDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKEN 512
Query: 1352 QPVVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLIMYLAVFDESMGCVLG--QQDETGK 1409
P W E +K L P L P+ LI+ D+ G +L + +E
Sbjct: 513 VPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTN 572
Query: 1410 KEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVNHTTWLISRMDP------I 1463
E Y S F E Y +K A+ K+ YL + R D +
Sbjct: 573 TELICRYRSGSFKAAERNYHSNDKETLAVINTIKKFSIYLT--PVHFLIRTDNTHFKSFV 630
Query: 1464 KYIFEKAAVTGKIARWQMLLSEY 1486
++ + G+ RWQ LS Y
Sbjct: 631 NLNYKGDSKLGRNIRWQAWLSHY 653
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 118 bits (296), Expect = 2e-25
Identities = 101/387 (26%), Positives = 167/387 (43%), Gaps = 13/387 (3%)
Query: 1135 KKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPED 1194
++ GK RM V+++ +NKA+ D LP+ D L+ K++S D SG Q+ + E
Sbjct: 277 RRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKES 336
Query: 1195 REKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIV-KSTDE 1253
+ T+F P G + + V+PFGL A + + + + K VYVDD++V +T
Sbjct: 337 QLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTGR 396
Query: 1254 EQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 1312
++H ++ + R K + L+ K LG + Q + I + P
Sbjct: 397 KEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEIDQGTHCPQNHILEHIHKFPD 456
Query: 1313 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPVVWNDECQEAFDSIKNYLL 1372
+ +KQ++ FLG L Y S +I + + P+ L+++ WND + IK L
Sbjct: 457 RIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKNLK 516
Query: 1373 EPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLE 1432
P L P L++ +E G +L + E+ Y S F E Y E
Sbjct: 517 SFPKLYHPEPNDKLVIETDASEEFWGGIL--KAIHNSHEYICRYASGSFKAAERNYHSNE 574
Query: 1433 KTCCALAWAAKRLRHYLVNHTTWLISRMDP------IKYIFEKAAVTGKIARWQMLLSEY 1486
K A+ K+ YL + + R D + + G++ RWQM LS+Y
Sbjct: 575 KELLAVIRVIKKFSIYLT--PSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQY 632
Query: 1487 DIVFKTQKAIKGSILADHLAYQPLDDY 1513
D + K ++ AD L L +Y
Sbjct: 633 DFDVEHIAGTK-NVFADFLQENTLTNY 658
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 114 bits (284), Expect = 4e-24
Identities = 83/316 (26%), Positives = 157/316 (49%), Gaps = 8/316 (2%)
Query: 1139 KVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKT 1198
K R+ +++ LN D F +PH +++ ++ +FS D +G++ +K+ + ++ T
Sbjct: 1238 KPRIVYNYKRLNDNMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWT 1297
Query: 1199 SFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDEEQHVE 1258
+F G + + V PFG+ NA +QR M F D+ K +Y+DD+++ S +E++H+E
Sbjct: 1298 TFTCSEGLYTWNVCPFGIANAPCAFQRFMQESFGDL--KFALLYIDDILIASNNEKEHIE 1355
Query: 1259 YLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQ--T 1316
+L F R+++ L+ K ++ + LG + + I + P V I++ + T
Sbjct: 1356 HLKIFFNRVKEVGCVLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIKKFDKNKLNT 1415
Query: 1317 EKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPVVWNDECQEAFDSIKNYLLEPPI 1376
K ++ +LG LNY +I ++ GP++K KN ++N E I+ + +
Sbjct: 1416 LKGLQAYLGLLNYARGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKP 1475
Query: 1377 LVPPVEGRPLIMYLAVFDESMGCVLGQQDE--TGKKEHAIY-YLSKKFTDCETRYTMLEK 1433
L P E +I+ +E G VL + + +GK I Y S F + +T +T L+
Sbjct: 1476 LERPKETDYIIIETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFGEKKT-WTSLDY 1534
Query: 1434 TCCALAWAAKRLRHYL 1449
A+ A + + YL
Sbjct: 1535 EIEAINEALNKFQIYL 1550
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 110 bits (276), Expect = 3e-23
Identities = 101/377 (26%), Positives = 164/377 (42%), Gaps = 10/377 (2%)
Query: 1135 KKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPED 1194
++ GK RM V+++ +N+A+ D+ LP++ L+ +FS D SG+ Q+ + E
Sbjct: 288 RRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEES 347
Query: 1195 REKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSTDEE 1254
++ T+F P G F +KV+PFGL A + +QR M T + K VYVDD+IV S E
Sbjct: 348 QKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSEL 406
Query: 1255 QHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKV-RAIREMP- 1312
H ++ + + + KY + L+ K LG + KG + + I + P
Sbjct: 407 DHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEI-DKGTHCPQNHILENIHKFPD 465
Query: 1313 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPVVWNDECQEAFDSIKNYLL 1372
+ +K ++ FLG L Y +I + P+ L+K+ W + IK L
Sbjct: 466 RLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLG 525
Query: 1373 EPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLE 1432
P L P LI+ D G VL + G E Y S F E Y +
Sbjct: 526 SFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDG-VELICRYSSGSFKQAEKNYHSND 584
Query: 1433 KTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYI----FEKAAVTGKIARWQMLLSEYDI 1488
K A+ + YL + + Y + + G++ RWQ S+Y
Sbjct: 585 KELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFSKYQF 644
Query: 1489 VFKTQKAIKGSILADHL 1505
+ + +K ++LAD L
Sbjct: 645 DVEHLEGVK-NVLADCL 660
>POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1165
Score = 107 bits (268), Expect = 3e-22
Identities = 96/392 (24%), Positives = 167/392 (42%), Gaps = 48/392 (12%)
Query: 1086 ECPPVRQKLRRTHPDMALK-----------IKNEVQKQIDAGFLMTVEYPEWVANIVPVP 1134
+ PPV +LR +A++ I+ +QK +D G L+ P W ++PV
Sbjct: 161 QVPPVVVELRSGASPVAVRQYPMSKEAREGIRPHIQKFLDLGVLVPCRSP-WNTPLLPVK 219
Query: 1135 KKD-GKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKV-FSFMDGFSGYNQIKMSP 1192
K R D R++NK + +P+ L+ + S +S +D + +++ P
Sbjct: 220 KPGTNDYRPVQDLREINKRVQDIHPTVPNPYNLLSSLPPSYTWYSVLDLKDAFFCLRLHP 279
Query: 1193 EDREKTSFITPW--------GTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEV--- 1241
+ +F W G + +P G N+ TLF + +H+++
Sbjct: 280 NSQPLFAF--EWKDPEKGNTGQLTWTRLPQGFKNS--------PTLFDEALHRDLAPFRA 329
Query: 1242 ---------YVDDMIVKSTDEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFI 1292
YVDD++V + E + K+ + L K R++ K R LG++
Sbjct: 330 LNPQVVLLQYVDDLLVAAPTYEDCKKGTQKLLQELSKLGYRVSAKKAQLCQREVTYLGYL 389
Query: 1293 VSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQ 1352
+ + + P + + ++P P T +QVR FLG + +I + P++ L +++
Sbjct: 390 LKEGKRWLTPARKATVMKIPVPTTPRQVREFLGTAGFCRLWIPGFASLAAPLYPLTKESI 449
Query: 1353 PVVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDET-GKKE 1411
P +W +E Q+AFD IK LL P L P +P +Y+ DE G G +T G
Sbjct: 450 PFIWTEEHQQAFDHIKKALLSAPALALPDLTKPFTLYI---DERAGVARGVLTQTLGPWR 506
Query: 1412 HAIYYLSKKFTDCETRYTMLEKTCCALAWAAK 1443
+ YLSKK + + K A+A K
Sbjct: 507 RPVAYLSKKLDPVASGWPTCLKAVAAVALLLK 538
Score = 85.9 bits (211), Expect = 1e-15
Identities = 78/264 (29%), Positives = 118/264 (44%), Gaps = 40/264 (15%)
Query: 1844 PFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRY 1903
P W +D I+P G++++LV ID F+ WVEA T +V K I I+ R+
Sbjct: 876 PGVYWEVDFT-EIKP-GRYGNKYLLVFIDTFSGWVEAFPTKTETALIVCKKILEEILPRF 933
Query: 1904 GVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMV- 1962
G+P + +DNG V Q L + I YRPQ +G VE N+ IK + K+
Sbjct: 934 GIPKVLGSDNGPAFVAQVSQGLATQLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLAL 993
Query: 1963 -TTYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYG--------MEAVLPLEVEIPSLRV 2013
T KDW +LP AL R T G TP+ ++YG E + P + +P L
Sbjct: 994 ETGGKDWVTLLPLALLRARNT-PGRFGLTPYEILYGGPPPILESGETLGPDDRFLPVLFT 1052
Query: 2014 IMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHPREFKVGELVL 2073
++A L ++ + D + + Y+ T P F+VG+ VL
Sbjct: 1053 HLKA--------------LEIVRTQIWDQIK--EVYKPGTVTI------PHPFQVGDQVL 1090
Query: 2074 KRKISQQPDPRGKWTPNYEGPYVV 2097
R+ +P P ++GPY+V
Sbjct: 1091 VRR--HRP---SSLEPRWKGPYLV 1109
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.340 0.149 0.494
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 231,277,629
Number of Sequences: 164201
Number of extensions: 9571975
Number of successful extensions: 33042
Number of sequences better than 10.0: 136
Number of HSP's better than 10.0 without gapping: 119
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 32642
Number of HSP's gapped (non-prelim): 300
length of query: 2129
length of database: 59,974,054
effective HSP length: 126
effective length of query: 2003
effective length of database: 39,284,728
effective search space: 78687310184
effective search space used: 78687310184
T: 11
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.9 bits)
S2: 74 (33.1 bits)
Medicago: description of AC139745.5