
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC135162.7 + phase: 0 /pseudo
(2175 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 231 2e-59
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 218 1e-55
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 207 3e-52
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 203 4e-51
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 199 7e-50
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 190 4e-47
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 187 2e-46
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 187 2e-46
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 183 4e-45
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript... 162 1e-38
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 126 8e-28
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 126 8e-28
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 126 8e-28
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 124 4e-27
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 124 4e-27
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 122 1e-26
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 116 6e-25
POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat pr... 113 7e-24
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 110 6e-23
POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23... 108 2e-22
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 231 bits (589), Expect = 2e-59
Identities = 134/415 (32%), Positives = 223/415 (53%), Gaps = 14/415 (3%)
Query: 1150 KIKNEVQKQIDAGFLMTVEYPEWVANIVPVPKKDG-----KVRMCVDFRDLNKASPKDNF 1204
++++++Q ++ G + T P + + I VPKK K R+ +D+R LN+ + D
Sbjct: 222 EVESQIQDMLNQGIIRTSNSP-YNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRH 280
Query: 1205 PLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLIN 1264
P+P++D ++ + F+ +D G++QI+M PE KT+F T G + Y MPFGL N
Sbjct: 281 PIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKN 340
Query: 1265 AGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSADEEQHVEYLTKMFERLRKYKLRLNPNK 1324
A AT+QR M + +++K VY+DD+IV S ++H++ L +FE+L K L+L +K
Sbjct: 341 APATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDK 400
Query: 1325 CTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFISHMT 1384
C F + LG +++ GI+ +P+K+ AI++ P P K+++ FLG Y +FI +
Sbjct: 401 CEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFA 460
Query: 1385 ATCGPIFKLLRKNQPI-VWNDECQEAFDSIKNYLLEPPILVPPVEGRPLIMYLAVFDESM 1443
P+ K L+KN I N E AF +K + E PIL P + + D ++
Sbjct: 461 DIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVAL 520
Query: 1444 GCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWATKRLRHYLVNHTTWLI 1503
G VL Q H + Y+S+ + E Y+ +EK A+ WATK RHYL+ +
Sbjct: 521 GAVLSQDG------HPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEIS 574
Query: 1504 SRMDPIKYIFEKAAVTGKIARWQMLLSEYDIVFKTQKAIKGSILADHLAYQPLDD 1558
S P+ +++ K+ RW++ LSE+D K K K + +AD L+ L++
Sbjct: 575 SDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKG-KENCVADALSRIKLEE 628
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 218 bits (555), Expect = 1e-55
Identities = 133/437 (30%), Positives = 223/437 (50%), Gaps = 17/437 (3%)
Query: 1123 VEHRIPTKPECPPVRQK--LRRTHPDMALKIKNEVQKQIDAGFLMT----VEYPEWVANI 1176
++H + T P ++ L +TH ++++N+VQ+ ++ G + P WV
Sbjct: 195 IKHVLNTTHNSPIYSKQYPLAQTHE---IEVENQVQEMLNQGLIRESNSPYNSPTWVVPK 251
Query: 1177 VPVPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKM 1236
P K R+ +D+R LN+ + D +P+P++D ++ + + F+ +D G++QI+M
Sbjct: 252 KPDASGANKYRVVIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEM 311
Query: 1237 SPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKS 1296
E KT+F T G + Y MPFGL NA AT+QR M + +++K VY+DD+I+ S
Sbjct: 312 DEESISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFS 371
Query: 1297 ADEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIRE 1356
+H+ + +F +L L+L +KC F + LG IV+ GI+ +P KV+AI
Sbjct: 372 TSLTEHLNSIQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVS 431
Query: 1357 MPAPQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPI-VWNDECQEAFDSIKN 1415
P P +K++R FLG Y +FI + P+ L+K I E EAF+ +K
Sbjct: 432 YPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKA 491
Query: 1416 YLLEPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYT 1475
++ PIL P + ++ + ++G VL Q H I ++S+ D E Y+
Sbjct: 492 LIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQNG------HPISFISRTLNDHELNYS 545
Query: 1476 MLEKTCCALAWATKRLRHYLVNHTTWLISRMDPIKYIFEKAAVTGKIARWQMLLSEYDIV 1535
+EK A+ WATK RHYL+ + S P++++ K+ RW++ LSEY
Sbjct: 546 AIEKELLAIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFK 605
Query: 1536 FKTQKAIKGSILADHLA 1552
K + S+ AD L+
Sbjct: 606 IDYIKGKENSV-ADALS 621
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 207 bits (527), Expect = 3e-52
Identities = 139/460 (30%), Positives = 228/460 (49%), Gaps = 13/460 (2%)
Query: 1099 NSLREYPDIFAWSYEDMPGLDPMIVEHRIPTKPECPPVRQKLRRTHPDMALKIKNEVQKQ 1158
N EY DIFA E P + + ++ K + P + R H + +I+ +VQK
Sbjct: 281 NICSEYIDIFA--LESEPITVNNLYKQQLRLKDDEPVYTKNYRSPHSQVE-EIQAQVQKL 337
Query: 1159 IDAGFLMTVEYPEWVANIVPVPKKDG------KVRMCVDFRDLNKASPKDNFPLPHIDVL 1212
I ++ ++ + ++ VPKK K R+ +D+R +NK D FPLP ID +
Sbjct: 338 IKDK-IVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLPRIDDI 396
Query: 1213 VDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRG 1272
+D ++K FS +D SG++QI++ R+ TSF T G++ + +PFGL A ++QR
Sbjct: 397 LDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPNSFQRM 456
Query: 1273 MTTLFHDMIHKEVEVYVDDMIVKSADEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSG 1332
MT F + + +Y+DD+IV E+ ++ LT++F + R+Y L+L+P KC+F +
Sbjct: 457 MTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSFFMHEV 516
Query: 1333 KLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFISHMTATCGPIFK 1392
LG + KGI D K I+ P P R F+ NY RFI + I +
Sbjct: 517 TFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYSRHITR 576
Query: 1393 LLRKNQPIVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDE 1452
L +KN P W DECQ+AF +K+ L+ P +L P + + ++ G VL Q
Sbjct: 577 LCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVLTQNH- 635
Query: 1453 TGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWATKRLRHYLVNHTTWLISRMDPIKYI 1512
+ + Y S+ FT E+ + E+ A+ WA R Y+ + + P+ Y+
Sbjct: 636 -NGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHRPLTYL 694
Query: 1513 FEKAAVTGKIARWQMLLSEYDIVFKTQKAIKGSILADHLA 1552
F + K+ R ++ L EY+ + K K + +AD L+
Sbjct: 695 FSMVNPSSKLTRIRLELEEYNFTVEYLKG-KDNHVADALS 733
Score = 121 bits (304), Expect = 2e-26
Identities = 103/387 (26%), Positives = 172/387 (43%), Gaps = 30/387 (7%)
Query: 1798 GNILYKRNYDMVLLRCV----DEHEAEQLMHDVHDGTFGTHATGHTMSRKLLRAGYYWMA 1853
GN + K N + LL V +E E E ++ +HD TG T + ++ YYW
Sbjct: 869 GNKILK-NLKVALLNPVTQINNEKEKEAILSTLHDDPIQGGHTGITKTLAKVKRHYYWKN 927
Query: 1854 MEHDCYQHARKCHKCQIYADKIHVPPHALNVMSSPWPFSMWGIDMIGRIEPKASNGHRFI 1913
M ++ RKC KCQ H + F +D IG + PK+ NG+ +
Sbjct: 928 MSKYIKEYVRKCQKCQKAKTTKHTKTPMTITETPEHAFDRVVVDTIGPL-PKSENGNEYA 986
Query: 1914 LVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGTNLNNNVVQALCE 1973
+ I TK++ A N + + VAK I + I +YG ITD GT N+++ LC+
Sbjct: 987 VTLICDLTKYLVAIPIANKSAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCK 1046
Query: 1974 EFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVTTYK-DWHEMLPYALHGYRTTVRSS 2032
KI++ S+ + Q G VE +++ + ++ ++T K DW L Y ++ + TT
Sbjct: 1047 YLKIKNITSTAHHHQTVGVVERSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMV 1106
Query: 2033 TGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARG 2092
P+ LV+G + LP KL E + D + + A AR
Sbjct: 1107 HNYCPYELVFGRTSNLPKHFN----------KLHSIEPIYNIDDYAKESKYRLEVAYARA 1156
Query: 2093 ----QSYQARMKTAFDKKVHPREFKVGELVLKRRISQQPDPRGKWTPNYEGPYVVKK-AF 2147
++++ + K +D KV E +VG+ VL R + K Y GPY ++
Sbjct: 1157 RKLLEAHKEKNKENYDLKVKDIELEVGDKVLLRN-----EVGHKLDFKYTGPYKIESIGD 1211
Query: 2148 SGGALILTHMDGIELPNPVNADIVKKY 2174
+ +LT+ + ++ V+ D +KK+
Sbjct: 1212 NNNITLLTNKNKKQI---VHKDRLKKF 1235
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 203 bits (517), Expect = 4e-51
Identities = 125/467 (26%), Positives = 236/467 (49%), Gaps = 11/467 (2%)
Query: 1092 GLNRRSSNSLREYPDIFAWSYEDMPGLDPMIVEHRIPTKPECPPVRQKLRRTHPDMALKI 1151
G +R+ + + ++ D+FA S +++ E I K P+RQK R + +I
Sbjct: 901 GDDRKIWDVIEQFQDVFAISDDELGRNSG--TECVIELKEGAEPIRQKPRPIPLALKPEI 958
Query: 1152 KNEVQKQIDAGFLMTVEYPEWVANIVPVPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDV 1211
+ +QK ++ + + P W + +V V KKDG +RMC+D+R +NK + PLP+I+
Sbjct: 959 RKMIQKMLNQKVIRESKSP-WSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNIEA 1017
Query: 1212 LVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAGATYQR 1271
+ + A K+++ D +G+ QI + + +E T+F F + V+PFGL+ + A +Q
Sbjct: 1018 TLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAFAIGSELFEWNVLPFGLVISPALFQG 1077
Query: 1272 GMTTLFHDMIHKEVEVYVDDMIVKSADEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRS 1331
M + D++ VYVDD+++ S D EQH++ + + R+RK ++L +KC +
Sbjct: 1078 TMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAKKE 1137
Query: 1332 GKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFISHMTATCGPIF 1391
+ LG V+ G+E K +++ P K+++ FLG + Y +FI + +
Sbjct: 1138 VEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASSLT 1197
Query: 1392 KLLRKNQPIVWNDECQEAFDSIKNYLLEPPILV-PPVEG-----RPLIMYLAVFDESMGC 1445
L+ +W E + AF +K + + P+L P VE RP ++Y + +G
Sbjct: 1198 SLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGA 1257
Query: 1446 VLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWATKRLRHYLVNHTTWLISR 1505
VL Q+ G ++H I + SK + ETRY + + A+ +A +R + + + +
Sbjct: 1258 VLAQEGPDG-QQHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTD 1316
Query: 1506 MDPIKYIFEKAAVTGKIARWQMLLSEYDIVFKTQKAIKGSILADHLA 1552
P+ + + + + ++ RW + + E+D+ A K + +AD L+
Sbjct: 1317 HKPLISLLKGSPLADRLWRWSIEILEFDVKI-VYLAGKANAVADALS 1362
Score = 107 bits (268), Expect = 3e-22
Identities = 91/351 (25%), Positives = 150/351 (41%), Gaps = 14/351 (3%)
Query: 1798 GNILYKRNYDMVLLRCVDEHEAEQLMHDVHDGTFGTHATGHTMSRKLLRAGYYWMAMEHD 1857
G +L + V E L+ ++H+G H M R + R +YW M
Sbjct: 1444 GGVLKNTEIEEQSRSVVPEKIRTPLLKELHEGMLAGHFGIKKMWRMVHRK-FYWPQMRVC 1502
Query: 1858 CYQHARKCHKCQIYADKIHVPPHALNVMSSPWPFSMWGIDMIGRIEPKASNGHRFILVAI 1917
R C KC D + +L +P + D++ + G+R+IL I
Sbjct: 1503 VENCVRTCAKCLCANDHSKLTS-SLTPYRMTFPLEIVACDLMD--VGLSVQGNRYILTII 1559
Query: 1918 DYFTKWVEAASYTNVTKQVVAK-FIKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFK 1976
D FTK+ A + + V K F++ I +P K++TD G N + K
Sbjct: 1560 DLFTKYGTAVPIPDKKAETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLK 1619
Query: 1977 IEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVTTYKDWHEMLPYALHGYRTTVRSSTGAT 2036
IEH + Y + NGAVE NK I I++K +W + + YA++ Y V +TG T
Sbjct: 1620 IEHITTKGYNSRANGAVERFNKTIMHIMKKKTAVPMEWDDQVVYAVYAYNNCVHENTGET 1679
Query: 2037 PFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARGQSYQ 2096
P L++G + + PLE+ I A + E + ++ + L + + + AM +SY+
Sbjct: 1680 PMFLMHGRDVMGPLEMSGEDAVGINYADMDEYKHLLTQ-ELLKVQKIAKEHAMREQESYK 1738
Query: 2097 ARMKTAFDKKVH----PREFKVGELVLKRRISQQPDPRGKWTPNYEGPYVV 2143
+ + K H P + E+ ++ +Q P KW+ GPY V
Sbjct: 1739 SLFDQKYASKKHRFPQPGSRVLLEIPSEKLGAQCPKLVNKWS----GPYRV 1785
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 199 bits (506), Expect = 7e-50
Identities = 140/477 (29%), Positives = 229/477 (47%), Gaps = 34/477 (7%)
Query: 1101 LREYPDIFAWSYEDMPGLDPMIVEHRIPTKPECPPVRQKLRRTHPDMALKIKNEVQKQID 1160
L E+P IF P L M VE + + +++P + ++ EV++QID
Sbjct: 92 LGEFPRIFE------PPLSGMSVETAVKAEIRTNTQDPIYAKSYP-YPVNMRGEVERQID 144
Query: 1161 A----GFLMTVEYPE----WVANIVPVPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVL 1212
G + P W+ P P + + RM VDF+ LN + D +P+P I+
Sbjct: 145 ELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYPIPDINAT 204
Query: 1213 VDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRG 1272
+ + +K F+ +D SG++QI M D KT+F T G + + +PFGL NA A +QR
Sbjct: 205 LASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAPAIFQRM 264
Query: 1273 MTTLFHDMIHKEVEVYVDDMIVKSADEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSG 1332
+ + + I K VY+DD+IV S D + H + L + L K L++N K F
Sbjct: 265 IDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKSHFLDTQV 324
Query: 1333 KLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFISHMTATCGPIFK 1392
+ LG+IV+ GI+ DP KVRAI EMP P + K+++ FLG +Y +FI P+
Sbjct: 325 EFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYAKVAKPLTN 384
Query: 1393 LLR-----------KNQPIVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLIMYLAVFDE 1441
L R PI ++ ++F+ +K+ L IL P +P + +
Sbjct: 385 LTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLTTDASNW 444
Query: 1442 SMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWATKRLRHYLVN-HTT 1500
++G VL Q D+ ++ I Y+S+ E Y +EK A+ W+ LR YL T
Sbjct: 445 AIGAVLSQDDQ--GRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYGAGTI 502
Query: 1501 WLISRMDPIKYIFEKAAVTGKIARWQMLLSEY--DIVFKTQKAIKGSILADHLAYQP 1555
+ + P+ + K+ RW+ + EY ++++K K+ +++AD L+ P
Sbjct: 503 KVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKS---NVVADALSRIP 556
Score = 37.7 bits (86), Expect = 0.35
Identities = 46/212 (21%), Positives = 84/212 (38%), Gaps = 17/212 (8%)
Query: 1837 GHTMSRKLLRAGYYWMAMEHDCYQHARKCHKCQIYADKIHVPPHALNVMSSP---WPFSM 1893
G T R L YY+ M C C++Y + H P+ N+ +P +P +
Sbjct: 707 GPTEIRLQLLEKYYFPRMSSTIRLQTSSCQCCKLYKYERH--PNKPNLQPTPIPNYPCEI 764
Query: 1894 WGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPS 1953
ID+ + R L ID F+K+ + + V + + + P
Sbjct: 765 LHIDIFALEK-------RLYLSCIDKFSKFAK-LFHLQSKASVHLRETLVEALHYFTAPK 816
Query: 1954 KIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANK---NIKRIVQKMVTT 2010
+++DN L V I+ + + + ++NG VE + I R ++ + T
Sbjct: 817 VLVSDNERGLLCPTVLNYLRSLDIDLYYAPTQKSEVNGQVERFHSTFLEIYRCLKDELPT 876
Query: 2011 YKDWHEMLPYALHGYRTTVRSSTGATPFSLVY 2042
+K E++ A+ Y T+V S T P + +
Sbjct: 877 FKP-VELVHIAVDRYNTSVHSVTNRKPADVFF 907
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 190 bits (482), Expect = 4e-47
Identities = 116/419 (27%), Positives = 212/419 (49%), Gaps = 21/419 (5%)
Query: 1179 VPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSP 1238
VPKK+G +RM VD++ LNK + +PLP I+ L+ S +F+ +D S Y+ I++
Sbjct: 455 VPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514
Query: 1239 EDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSAD 1298
D K +F P G F Y VMP+G+ A A +Q + T+ + V Y+DD+++ S
Sbjct: 515 GDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKS 574
Query: 1299 EEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP 1358
E +HV+++ + ++L+ L +N KC F K +G+ +S+KG + + + +
Sbjct: 575 ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWK 634
Query: 1359 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 1418
P+ K++R FLG +NY+ +FI + P+ LL+K+ W +A ++IK L+
Sbjct: 635 QPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLV 694
Query: 1419 EPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLE 1478
PP+L + +++ D ++G VL Q+ + K + + Y S K + + Y++ +
Sbjct: 695 SPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD-DDKYYPVGYYSAKMSKAQLNYSVSD 753
Query: 1479 KTCCALAWATKRLRHYLVNHTTWLISRMDPIKYIFEKAAVTGKI-----------ARWQM 1527
K A+ + K RHYL S ++P K + + + G+I ARWQ+
Sbjct: 754 KEMLAIIKSLKHWRHYLE-------STIEPFKILTDHRNLIGRITNESEPENKRLARWQL 806
Query: 1528 LLSEYDIVFKTQKAIKGSILADHLAYQPLDDYQPIEFDFPDEEIMYLKSKDCEEPLINE 1586
L +++ + + +AD L+ + +D+ +PI D D I ++ + N+
Sbjct: 807 FLQDFNFEINYRPG-SANHIADALS-RIVDETEPIPKDSEDNSINFVNQISITDDFKNQ 863
Score = 97.4 bits (241), Expect = 4e-19
Identities = 112/505 (22%), Positives = 208/505 (41%), Gaps = 53/505 (10%)
Query: 1649 IDMRIKHLDIYGDSALIINQIKGEWETHHANLIPYRDYARRLLTYFTKVELHHIPRDENQ 1708
++ I+ I D +I +I E E + L ++ + + E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 1709 MADALATLSSMFRVNHWNDVPLIKVQRLERPSHVFAIGDVIDQAGENVVDYRPWYYDIKQ 1768
+ADAL+ + P+ K + V I D + V +Y D K
Sbjct: 825 IADALSRIVD-------ETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTN---DTKL 874
Query: 1769 FLLSREYPPGASKQDKKTLRRLAGRFLLDGNILYKRNYDMVLLRCVDEHEAEQLMHDVHD 1828
L + +DK+ + L DG ++ + D +LL D ++ H+
Sbjct: 875 LNL-------LNNEDKRVEENIQ---LKDGLLINSK--DQILLPN-DTQLTRTIIKKYHE 921
Query: 1829 GTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQHARKCHKCQIYADKIHVPPHALN-VMSS 1887
H ++ +LR + W + ++ + CH CQI + H P L + S
Sbjct: 922 EGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPS 980
Query: 1888 PWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYT-NVTKQVVAKFIKNNII 1946
P+ +D I + S+G+ + V +D F+K T ++T + A+ +I
Sbjct: 981 ERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVI 1038
Query: 1947 CRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK 2006
+G P +II DN + + ++ S PYRPQ +G E N+ ++++++
Sbjct: 1039 AYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRC 1098
Query: 2007 MVTTYKD-WHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKL 2065
+ +T+ + W + + Y + S+T TPF +V+ L +E+PS +
Sbjct: 1099 VCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTDENS 1157
Query: 2066 SEA-EWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHP-REFKVGELVL-KRR 2122
E + Q+ + LN + +MK FD K+ EF+ G+LV+ KR
Sbjct: 1158 QETIQVFQTVKEHLN--------------TNNIKMKKYFDMKIQEIEEFQPGDLVMVKRT 1203
Query: 2123 ISQQPDPRGKWTPNYEGP-YVVKKA 2146
+ K P++ GP YV++K+
Sbjct: 1204 KTGFLHKSNKLAPSFAGPFYVLQKS 1228
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 187 bits (476), Expect = 2e-46
Identities = 115/419 (27%), Positives = 213/419 (50%), Gaps = 21/419 (5%)
Query: 1179 VPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSP 1238
VPKK+G +RM VD++ LNK + +PLP I+ L+ S +F+ +D S Y+ I++
Sbjct: 455 VPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514
Query: 1239 EDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSAD 1298
D K +F P G F Y VMP+G+ A A +Q + T+ ++ V Y+D++++ S
Sbjct: 515 GDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKS 574
Query: 1299 EEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP 1358
E +HV+++ + ++L+ L +N KC F K +G+ +S+KG + + + +
Sbjct: 575 ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWK 634
Query: 1359 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 1418
P+ K++R FLG +NY+ +FI + P+ LL+K+ W +A ++IK L+
Sbjct: 635 QPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLV 694
Query: 1419 EPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLE 1478
PP+L + +++ D ++G VL Q+ + K + + Y S K + + Y++ +
Sbjct: 695 SPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD-DDKYYPVGYYSAKMSKAQLNYSVSD 753
Query: 1479 KTCCALAWATKRLRHYLVNHTTWLISRMDPIKYIFEKAAVTGKI-----------ARWQM 1527
K A+ + K RHYL S ++P K + + + G+I ARWQ+
Sbjct: 754 KEMLAIIKSLKHWRHYLE-------STIEPFKILTDHRNLIGRITNESEPENKRLARWQL 806
Query: 1528 LLSEYDIVFKTQKAIKGSILADHLAYQPLDDYQPIEFDFPDEEIMYLKSKDCEEPLINE 1586
L +++ + + +AD L+ + +D+ +PI D D I ++ + N+
Sbjct: 807 FLQDFNFEINYRPG-SANHIADALS-RIVDETEPIPKDSEDNSINFVNQISITDDFKNQ 863
Score = 97.4 bits (241), Expect = 4e-19
Identities = 112/505 (22%), Positives = 208/505 (41%), Gaps = 53/505 (10%)
Query: 1649 IDMRIKHLDIYGDSALIINQIKGEWETHHANLIPYRDYARRLLTYFTKVELHHIPRDENQ 1708
++ I+ I D +I +I E E + L ++ + + E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 1709 MADALATLSSMFRVNHWNDVPLIKVQRLERPSHVFAIGDVIDQAGENVVDYRPWYYDIKQ 1768
+ADAL+ + P+ K + V I D + V +Y D K
Sbjct: 825 IADALSRIVD-------ETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTN---DTKL 874
Query: 1769 FLLSREYPPGASKQDKKTLRRLAGRFLLDGNILYKRNYDMVLLRCVDEHEAEQLMHDVHD 1828
L + +DK+ + L DG ++ + D +LL D ++ H+
Sbjct: 875 LNL-------LNNEDKRVEENIQ---LKDGLLINSK--DQILLPN-DTQLTRTIIKKYHE 921
Query: 1829 GTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQHARKCHKCQIYADKIHVPPHALN-VMSS 1887
H ++ +LR + W + ++ + CH CQI + H P L + S
Sbjct: 922 EGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPS 980
Query: 1888 PWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYT-NVTKQVVAKFIKNNII 1946
P+ +D I + S+G+ + V +D F+K T ++T + A+ +I
Sbjct: 981 ERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVI 1038
Query: 1947 CRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK 2006
+G P +II DN + + ++ S PYRPQ +G E N+ ++++++
Sbjct: 1039 AYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRC 1098
Query: 2007 MVTTYKD-WHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKL 2065
+ +T+ + W + + Y + S+T TPF +V+ L +E+PS +
Sbjct: 1099 VCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTDENS 1157
Query: 2066 SEA-EWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHP-REFKVGELVL-KRR 2122
E + Q+ + LN + +MK FD K+ EF+ G+LV+ KR
Sbjct: 1158 QETIQVFQTVKEHLN--------------TNNIKMKKYFDMKIQEIEEFQPGDLVMVKRT 1203
Query: 2123 ISQQPDPRGKWTPNYEGP-YVVKKA 2146
+ K P++ GP YV++K+
Sbjct: 1204 KTGFLHKSNKLAPSFAGPFYVLQKS 1228
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 187 bits (476), Expect = 2e-46
Identities = 115/419 (27%), Positives = 213/419 (50%), Gaps = 21/419 (5%)
Query: 1179 VPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSP 1238
VPKK+G +RM VD++ LNK + +PLP I+ L+ S +F+ +D S Y+ I++
Sbjct: 455 VPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRK 514
Query: 1239 EDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSAD 1298
D K +F P G F Y VMP+G+ A A +Q + T+ ++ V Y+D++++ S
Sbjct: 515 GDEHKLAFRCPRGVFEYLVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKS 574
Query: 1299 EEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP 1358
E +HV+++ + ++L+ L +N KC F K +G+ +S+KG + + + +
Sbjct: 575 ESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWK 634
Query: 1359 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 1418
P+ K++R FLG +NY+ +FI + P+ LL+K+ W +A ++IK L+
Sbjct: 635 QPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLV 694
Query: 1419 EPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLE 1478
PP+L + +++ D ++G VL Q+ + K + + Y S K + + Y++ +
Sbjct: 695 SPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD-DDKYYPVGYYSAKMSKAQLNYSVSD 753
Query: 1479 KTCCALAWATKRLRHYLVNHTTWLISRMDPIKYIFEKAAVTGKI-----------ARWQM 1527
K A+ + K RHYL S ++P K + + + G+I ARWQ+
Sbjct: 754 KEMLAIIKSLKHWRHYLE-------STIEPFKILTDHRNLIGRITNESEPENKRLARWQL 806
Query: 1528 LLSEYDIVFKTQKAIKGSILADHLAYQPLDDYQPIEFDFPDEEIMYLKSKDCEEPLINE 1586
L +++ + + +AD L+ + +D+ +PI D D I ++ + N+
Sbjct: 807 FLQDFNFEINYRPG-SANHIADALS-RIVDETEPIPKDSEDNSINFVNQISITDDFKNQ 863
Score = 97.4 bits (241), Expect = 4e-19
Identities = 112/505 (22%), Positives = 208/505 (41%), Gaps = 53/505 (10%)
Query: 1649 IDMRIKHLDIYGDSALIINQIKGEWETHHANLIPYRDYARRLLTYFTKVELHHIPRDENQ 1708
++ I+ I D +I +I E E + L ++ + + E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 1709 MADALATLSSMFRVNHWNDVPLIKVQRLERPSHVFAIGDVIDQAGENVVDYRPWYYDIKQ 1768
+ADAL+ + P+ K + V I D + V +Y D K
Sbjct: 825 IADALSRIVD-------ETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTN---DTKL 874
Query: 1769 FLLSREYPPGASKQDKKTLRRLAGRFLLDGNILYKRNYDMVLLRCVDEHEAEQLMHDVHD 1828
L + +DK+ + L DG ++ + D +LL D ++ H+
Sbjct: 875 LNL-------LNNEDKRVEENIQ---LKDGLLINSK--DQILLPN-DTQLTRTIIKKYHE 921
Query: 1829 GTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQHARKCHKCQIYADKIHVPPHALN-VMSS 1887
H ++ +LR + W + ++ + CH CQI + H P L + S
Sbjct: 922 EGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPS 980
Query: 1888 PWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYT-NVTKQVVAKFIKNNII 1946
P+ +D I + S+G+ + V +D F+K T ++T + A+ +I
Sbjct: 981 ERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVI 1038
Query: 1947 CRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK 2006
+G P +II DN + + ++ S PYRPQ +G E N+ ++++++
Sbjct: 1039 AYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRC 1098
Query: 2007 MVTTYKD-WHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKL 2065
+ +T+ + W + + Y + S+T TPF +V+ L +E+PS +
Sbjct: 1099 VCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTDENS 1157
Query: 2066 SEA-EWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHP-REFKVGELVL-KRR 2122
E + Q+ + LN + +MK FD K+ EF+ G+LV+ KR
Sbjct: 1158 QETIQVFQTVKEHLN--------------TNNIKMKKYFDMKIQEIEEFQPGDLVMVKRT 1203
Query: 2123 ISQQPDPRGKWTPNYEGP-YVVKKA 2146
+ K P++ GP YV++K+
Sbjct: 1204 KTGFLHKSNKLAPSFAGPFYVLQKS 1228
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 183 bits (465), Expect = 4e-45
Identities = 131/434 (30%), Positives = 213/434 (48%), Gaps = 37/434 (8%)
Query: 1151 IKNEVQKQIDAGFLMT----VEYPEWVANIVPVPKK------DGKVRMCVDFRDLNKASP 1200
+ NEV++ + G + P WV V KK + R+ +DFR LN+ +
Sbjct: 197 VNNEVKQLLKDGIIRPSRSPYNSPTWV-----VDKKGTDAFGNPNKRLVIDFRKLNEKTI 251
Query: 1201 KDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTSFITPWGTFCYKVMPF 1260
D +P+P I +++ N ++K F+ +D SGY+QI ++ DREKTSF G + + +PF
Sbjct: 252 PDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPF 311
Query: 1261 GLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSADEEQHVEYLTKMFERLRKYKLRL 1320
GL NA + +QR + + + I K VYVDD+I+ S +E HV ++ + + L +R+
Sbjct: 312 GLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRV 371
Query: 1321 NPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFI 1380
+ K F S + LGFIVS+ G + DP+KV+AI+E P P +VR FLG +Y FI
Sbjct: 372 SQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFI 431
Query: 1381 SHMTATCGPIFKLLR-----------KNQPIVWNDECQEAFDSIKNYLL-EPPILVPPVE 1428
A PI +L+ K P+ +N+ + AF ++N L E IL P
Sbjct: 432 KDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDF 491
Query: 1429 GRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAT 1488
+P + +G VL Q+ I +S+ E Y E+ A+ WA
Sbjct: 492 KKPFDLTTDASASGIGAVLSQEG------RPITMISRTLKQPEQNYATNERELLAIVWAL 545
Query: 1489 KRLRHYLV-NHTTWLISRMDPIKYIFEKAAVTGKIARWQMLLSEYDI-VFKTQKAIKGSI 1546
+L+++L + + + P+ + KI RW+ + +++ VF K K +
Sbjct: 546 GKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVF--YKPGKENF 603
Query: 1547 LADHLAYQPLDDYQ 1560
+AD L+ Q L+ Q
Sbjct: 604 VADALSRQNLNALQ 617
>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 886
Score = 162 bits (409), Expect = 1e-38
Identities = 201/888 (22%), Positives = 357/888 (39%), Gaps = 75/888 (8%)
Query: 1176 IVPVPKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIK 1235
+ PVPK DG+ RM +D+R++NK P H ++ + K + +D +G+
Sbjct: 5 VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGFWAHP 64
Query: 1236 MSPEDREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVK 1295
++PE T+F +C+ +P G +N+ A + + L ++ V+VYVDD+ +
Sbjct: 65 ITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTADVVDLLKEI--PNVQVYVDDIYLS 122
Query: 1296 SADEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIR 1355
D ++HV+ L K+F+ L + ++ K G ++ + LGF ++++G + +
Sbjct: 123 HDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTVEFLGFNITKEGRGLTDTFKTKLL 182
Query: 1356 EMPAPQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLL--RKNQPIVWNDECQEAFDSI 1413
+ P+ KQ++ LG LN+ FI + P++ L+ K + I W++E + + +
Sbjct: 183 NITPPKDLKQLQSILGLLNFARNFIPNFAELVQPLYNLIASAKGKYIEWSEENTKQLNMV 242
Query: 1414 KNYLLEPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETR 1473
L L + + L++ + S G V +ETGKK I YL+ F+ E +
Sbjct: 243 IEALNTASNLEERLPEQRLVIKVNT-SPSAGYV-RYYNETGKK--PIMYLNYVFSKAELK 298
Query: 1474 YTMLEKTCCALAWATKRLRHYLVNHTTWLISRMDPIKYIF-----EKAAVTGKIARWQML 1528
++MLEK + A + + + S + + I E+ A+ + W
Sbjct: 299 FSMLEKLLTTMHKALIKAMDLAMGQEILVYSPIVSMTKIQKTPLPERKALPIRWITWMTY 358
Query: 1529 LSEYDIVFKTQKAIKGSILADHLAYQPLDDYQPIEFDFPDEEIMYLKSKDCEEPLINEGP 1588
L + I F K + H+ P++ E + Y + P
Sbjct: 359 LEDPRIQFHYDKTLPE---LKHIPDVYTSSQSPVKHPSQYEGVFYTDGSAI------KSP 409
Query: 1589 DPDSKWGLVFDGAVNAYGKGIGAVIVSPQGHHIPFTTRILFECTNNMAEYEACIFGIEEA 1648
DP N G GI P+ + + L T MAE A F ++A
Sbjct: 410 DPTKS---------NNAGMGIVHATYKPEYQVLNQWSIPLGNHTAQMAEIAAVEFACKKA 460
Query: 1649 IDMRIKHLDIYGDSALIINQIKGEWETHHANLIPYRDYARRLLTYFTKVELHHIPRDENQ 1708
+ + L + DS + E +PY + K L HI + ++
Sbjct: 461 LKIPGPVL-VITDSFYVAESANKE--------LPY--WKSNGFVNNKKKPLKHISKWKS- 508
Query: 1709 MADALATLSSMFRVNHWNDVPL-IKVQRLERPSHVFAIGDVIDQAGENVVD---YRPWYY 1764
+A+ L ++ + H + L I V L+ A+ D + G VV+ +P
Sbjct: 509 IAECL-SMKPDITIQHEKGISLQIPVFILKGN----ALADKLATQGSYVVNCNTKKPNLD 563
Query: 1765 DIKQFLLSREYPPGASKQDKKTLRRLAGRFLLDGNILYKRNYDMVLLRCVDEHEAEQLMH 1824
LL Y G KQ FL DG + R + ++ + + ++++
Sbjct: 564 AELDQLLQGHYIKGYPKQ--------YTYFLEDGKVKVSRPEGVKII--PPQSDRQKIVL 613
Query: 1825 DVHDGTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQHARKCHKCQIYADKIHVPPHALNV 1884
H+ TG + + Y+W M D + +C +C I L
Sbjct: 614 QAHN----LAHTGREATLLKIANLYWWPNMRKDVVKQLGRCQQCLITNASNKASGPILRP 669
Query: 1885 MSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFT--KWVEAASYTNVTKQVVAKFIK 1942
PF + ID IG + P S G+ ++LV +D T W+ Y A
Sbjct: 670 DRPQKPFDKFFIDYIGPLPP--SQGYLYVLVVVDGMTGFTWL----YPTKAPSTSATVKS 723
Query: 1943 NNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKR 2002
N++ +P I +D G ++ +E I S+PY PQ VE N +IKR
Sbjct: 724 LNVLTSIAIPKVIHSDQGAAFTSSTFAEWAKERGIHLEFSTPYHPQSGSKVERKNSDIKR 783
Query: 2003 IVQK-MVTTYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLP 2049
++ K +V W+++LP T TP L++G+++ P
Sbjct: 784 LLTKLLVGRPTKWYDLLPVVQLALNNTYSPVLKYTPHQLLFGIDSNTP 831
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 126 bits (316), Expect = 8e-28
Identities = 111/422 (26%), Positives = 185/422 (43%), Gaps = 28/422 (6%)
Query: 1136 VRQKLRRTHPDMALKIK---------NEVQKQIDAGFLMTVEYPEWVANIVPV------- 1179
++ ++ + P A+K+K E KQI + V P ++ P
Sbjct: 234 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 293
Query: 1180 PKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPE 1239
K+ GK RM V+++ +NKA+ D + LP+ D L+ K+FS D SG+ Q+ + E
Sbjct: 294 EKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 353
Query: 1240 DREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSADE 1299
R T+F P G + + V+PFGL A + +QR M F + K VYVDD++V S +E
Sbjct: 354 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNE 412
Query: 1300 EQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 1358
E H+ ++ + ++ ++ + L+ K + LG + + + + I + P
Sbjct: 413 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPD 472
Query: 1359 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 1418
+ +KQ++ FLG L Y S +I + P+ L++N P W E +K L
Sbjct: 473 TLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWRWTKEDTLYMQKVKKNLQ 532
Query: 1419 EPPILVPPVEGRPLIMYLAVFDESMGCVLG--QQDETGKKEHAIYYLSKKFTDCETRYTM 1476
P L P+ LI+ D+ G +L + +E E Y S F E Y
Sbjct: 533 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHS 592
Query: 1477 LEKTCCALAWATKRLRHYLVNHTTWLISRMDP------IKYIFEKAAVTGKIARWQMLLS 1530
+K A+ K+ YL + R D + ++ + G+ RWQ LS
Sbjct: 593 NDKETLAVINTIKKFSIYLT--PVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLS 650
Query: 1531 EY 1532
Y
Sbjct: 651 HY 652
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 126 bits (316), Expect = 8e-28
Identities = 111/422 (26%), Positives = 185/422 (43%), Gaps = 28/422 (6%)
Query: 1136 VRQKLRRTHPDMALKIK---------NEVQKQIDAGFLMTVEYPEWVANIVPV------- 1179
++ ++ + P A+K+K E KQI + V P ++ P
Sbjct: 234 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 293
Query: 1180 PKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPE 1239
K+ GK RM V+++ +NKA+ D + LP+ D L+ K+FS D SG+ Q+ + E
Sbjct: 294 EKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 353
Query: 1240 DREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSADE 1299
R T+F P G + + V+PFGL A + +QR M F + K VYVDD++V S +E
Sbjct: 354 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNE 412
Query: 1300 EQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 1358
E H+ ++ + ++ ++ + L+ K + LG + + + + I + P
Sbjct: 413 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPD 472
Query: 1359 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 1418
+ +KQ++ FLG L Y S +I + P+ L++N P W E +K L
Sbjct: 473 TLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQ 532
Query: 1419 EPPILVPPVEGRPLIMYLAVFDESMGCVLG--QQDETGKKEHAIYYLSKKFTDCETRYTM 1476
P L P+ LI+ D+ G +L + +E E Y S F E Y
Sbjct: 533 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHS 592
Query: 1477 LEKTCCALAWATKRLRHYLVNHTTWLISRMDP------IKYIFEKAAVTGKIARWQMLLS 1530
+K A+ K+ YL + R D + ++ + G+ RWQ LS
Sbjct: 593 NDKETLAVINTIKKFSIYLT--PVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLS 650
Query: 1531 EY 1532
Y
Sbjct: 651 HY 652
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 126 bits (316), Expect = 8e-28
Identities = 111/422 (26%), Positives = 185/422 (43%), Gaps = 28/422 (6%)
Query: 1136 VRQKLRRTHPDMALKIK---------NEVQKQIDAGFLMTVEYPEWVANIVPV------- 1179
++ ++ + P A+K+K E KQI + V P ++ P
Sbjct: 234 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 293
Query: 1180 PKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPE 1239
K+ GK RM V+++ +NKA+ D + LP+ D L+ K+FS D SG+ Q+ + E
Sbjct: 294 EKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 353
Query: 1240 DREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSADE 1299
R T+F P G + + V+PFGL A + +QR M F + K VYVDD++V S +E
Sbjct: 354 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNE 412
Query: 1300 EQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 1358
E H+ ++ + ++ ++ + L+ K + LG + + + + I + P
Sbjct: 413 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPD 472
Query: 1359 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 1418
+ +KQ++ FLG L Y S +I + P+ L++N P W E +K L
Sbjct: 473 TLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQ 532
Query: 1419 EPPILVPPVEGRPLIMYLAVFDESMGCVLG--QQDETGKKEHAIYYLSKKFTDCETRYTM 1476
P L P+ LI+ D+ G +L + +E E Y S F E Y
Sbjct: 533 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAERNYHS 592
Query: 1477 LEKTCCALAWATKRLRHYLVNHTTWLISRMDP------IKYIFEKAAVTGKIARWQMLLS 1530
+K A+ K+ YL + R D + ++ + G+ RWQ LS
Sbjct: 593 NDKETLAVINTIKKFSIYLT--PVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLS 650
Query: 1531 EY 1532
Y
Sbjct: 651 HY 652
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 124 bits (310), Expect = 4e-27
Identities = 80/255 (31%), Positives = 132/255 (51%), Gaps = 7/255 (2%)
Query: 1129 TKPECPPVRQKLRRTHPDMALKIKNEVQKQIDAGFLMTVEYPEWVANIVPVPKKD-GKVR 1187
T+ PV ++ R ++ E+ + + G ++ + Y +W A IV + KK GK+R
Sbjct: 434 TEENAVPVFKRARPVPYGSLEAVETELNRLQEMGVIVPITYAKWAAPIVVIKKKGTGKIR 493
Query: 1188 MCVDFR--DLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKTS 1245
+C DF+ LN A + PLP + + + V+S +D Y Q+++ E ++
Sbjct: 494 VCADFKCSGLNAALKDEFHPLPTSEDIFSRL-KGTVYSQIDLKDAYLQVELDEEAQKLAV 552
Query: 1246 FITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSADEEQHVEY 1305
T G F Y M FGL A A++Q+ M + + V VY DD+I+ ++ E+H +
Sbjct: 553 INTHRGIFKYLRMTFGLKPAPASFQKIMDKMVSGLTG--VAVYWDDIIISASSIEEHEKI 610
Query: 1306 LTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQTEKQ 1365
L ++FER ++Y R++ KC F + LGF V + G D K AIR M AP +KQ
Sbjct: 611 LRELFERFKEYGFRVSAEKCAFAQKQVTFLGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQ 669
Query: 1366 VRGFLGRLNYISRFI 1380
+ FLG +++SR +
Sbjct: 670 LASFLGAADWLSRMM 684
Score = 90.1 bits (222), Expect = 6e-17
Identities = 80/320 (25%), Positives = 137/320 (42%), Gaps = 44/320 (13%)
Query: 1787 LRRLAGRFLLDGNILYKRNYDMVLLRCVDEHEAEQLMHDVHDGTFGTHATGHTMSRKLLR 1846
L+ + G LLD ++ ++ ++L+ +H+ H G ++ R
Sbjct: 763 LKLIHGCLLLDDRVIVPKSLQKIVLK---------QLHEGHPGI--------VQMKQKAR 805
Query: 1847 AGYYWMAMEHDCYQHARKCHKCQIYADKIHVPPHALNVMSSPWPF--SMWG---IDMIGR 1901
+ +W ++ D R C+ CQ + V P +PWP + W ID G
Sbjct: 806 SFVFWRGLDSDIENMVRHCNNCQENSKMPRVVP------LNPWPVPEAPWKRIHIDFAGP 859
Query: 1902 IEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGT 1961
+ NG ++LV +D TK+ E T V + I +G P II+DNGT
Sbjct: 860 L-----NGC-YLLVVVDAKTKYAEV-KLTRSISAVTTIDLLEEIFSIHGYPETIISDNGT 912
Query: 1962 NLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVTTYKDWHEMLPYA 2021
L +++ +C+ IEH S+ Y P+ NGA E +KR + K+ ++L
Sbjct: 913 QLTSHLFAQMCQSHGIEHKTSAVYYPRSNGAAERFVDTLKRGIAKIKGEGSVNQQILNKF 972
Query: 2022 LHGYRTTVRSS-TGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNL 2080
L YR T S+ G+TP +G + + + +P+ RV+ KL++ Q N+
Sbjct: 973 LISYRNTPHSALNGSTPAECHFGRKIRTTMSLLMPTDRVLKVPKLTQY--------QQNM 1024
Query: 2081 IEEKRMDAMARGQSYQARMK 2100
+ AR +++Q K
Sbjct: 1025 KHHYELRNGARAKAFQVNQK 1044
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 124 bits (310), Expect = 4e-27
Identities = 110/422 (26%), Positives = 184/422 (43%), Gaps = 28/422 (6%)
Query: 1136 VRQKLRRTHPDMALKIK---------NEVQKQIDAGFLMTVEYPEWVANIVPV------- 1179
++ ++ + P A+K+K E KQI + V P ++ P
Sbjct: 229 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 288
Query: 1180 PKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPE 1239
K+ GK RM V+++ +NKA+ D + P+ D L+ K+FS D SG+ Q+ + E
Sbjct: 289 EKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 348
Query: 1240 DREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSADE 1299
R T+F P G + + V+PFGL A + +QR M F + K VYVDD++V S +E
Sbjct: 349 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDILVFSNNE 407
Query: 1300 EQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 1358
E H+ ++ + ++ ++ + L+ K + LG + + + + I + P
Sbjct: 408 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPD 467
Query: 1359 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 1418
+ +KQ++ FLG L Y S +I + P+ L++N P W E +K L
Sbjct: 468 TLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQ 527
Query: 1419 EPPILVPPVEGRPLIMYLAVFDESMGCVLG--QQDETGKKEHAIYYLSKKFTDCETRYTM 1476
P L P+ LI+ D+ G +L + +E E Y S F E Y
Sbjct: 528 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGSFKAAEKNYHS 587
Query: 1477 LEKTCCALAWATKRLRHYLVNHTTWLISRMDP------IKYIFEKAAVTGKIARWQMLLS 1530
+K A+ K+ YL + R D + ++ + G+ RWQ LS
Sbjct: 588 NDKETLAVINTIKKFSIYLT--PVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLS 645
Query: 1531 EY 1532
Y
Sbjct: 646 HY 647
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 122 bits (305), Expect = 1e-26
Identities = 109/422 (25%), Positives = 183/422 (42%), Gaps = 28/422 (6%)
Query: 1136 VRQKLRRTHPDMALKIK---------NEVQKQIDAGFLMTVEYPEWVANIVPV------- 1179
++ ++ + P A+K+K E KQI + V P ++ P
Sbjct: 235 MKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA 294
Query: 1180 PKKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPE 1239
G RM V+++ +NKA+ D + LP+ D L+ K+FS D SG+ Q+ + E
Sbjct: 295 ENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQE 354
Query: 1240 DREKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSADE 1299
R T+F P G + + V+PFGL A + +QR M F + K VYVDD++V S +E
Sbjct: 355 SRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFR-VFRKFCCVYVDDIVVFSNNE 413
Query: 1300 EQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 1358
E H+ ++ + ++ ++ + L+ K + LG + + + + I + P
Sbjct: 414 EDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPD 473
Query: 1359 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 1418
+ +KQ++ FLG L Y S +I ++ P+ L++N P W E +K L
Sbjct: 474 TLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQ 533
Query: 1419 EPPILVPPVEGRPLIMYLAVFDESMGCVLG--QQDETGKKEHAIYYLSKKFTDCETRYTM 1476
P L P+ LI+ D+ G +L + +E E Y S F E Y
Sbjct: 534 GFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHS 593
Query: 1477 LEKTCCALAWATKRLRHYLVNHTTWLISRMDP------IKYIFEKAAVTGKIARWQMLLS 1530
+K A+ K+ YL + R D + ++ + G+ RWQ LS
Sbjct: 594 NDKETLAVINTIKKFSIYLT--PVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIRWQAWLS 651
Query: 1531 EY 1532
Y
Sbjct: 652 HY 653
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 116 bits (291), Expect = 6e-25
Identities = 100/387 (25%), Positives = 166/387 (42%), Gaps = 13/387 (3%)
Query: 1181 KKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPED 1240
++ GK RM V+++ +NKA+ D LP+ D L+ K++S D SG Q+ + E
Sbjct: 277 RRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKES 336
Query: 1241 REKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIV-KSADE 1299
+ T+F P G + + V+PFGL A + + + + K VYVDD++V +
Sbjct: 337 QLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTGR 396
Query: 1300 EQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMP- 1358
++H ++ + R K + L+ K LG + Q + I + P
Sbjct: 397 KEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEIDQGTHCPQNHILEHIHKFPD 456
Query: 1359 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 1418
+ +KQ++ FLG L Y S +I + + P+ L+++ WND + IK L
Sbjct: 457 RIEDKKQLQRFLGILTYASDYIPKLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKNLK 516
Query: 1419 EPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLE 1478
P L P L++ +E G +L + E+ Y S F E Y E
Sbjct: 517 SFPKLYHPEPNDKLVIETDASEEFWGGIL--KAIHNSHEYICRYASGSFKAAERNYHSNE 574
Query: 1479 KTCCALAWATKRLRHYLVNHTTWLISRMDP------IKYIFEKAAVTGKIARWQMLLSEY 1532
K A+ K+ YL + + R D + + G++ RWQM LS+Y
Sbjct: 575 KELLAVIRVIKKFSIYLT--PSRFLIRTDNKNFTHFVNINLKGDRKQGRLVRWQMWLSQY 632
Query: 1533 DIVFKTQKAIKGSILADHLAYQPLDDY 1559
D + K ++ AD L L +Y
Sbjct: 633 DFDVEHIAGTK-NVFADFLQENTLTNY 658
>POL_RTBVP (P27502) Polyprotein (P194 protein) [Contains: Coat
protein; Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1675
Score = 113 bits (282), Expect = 7e-24
Identities = 83/316 (26%), Positives = 157/316 (49%), Gaps = 8/316 (2%)
Query: 1185 KVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPEDREKT 1244
K R+ +++ LN D F +PH +++ ++ +FS D +G++ +K+ + ++ T
Sbjct: 1238 KPRIVYNYKRLNDNMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWT 1297
Query: 1245 SFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSADEEQHVE 1304
+F G + + V PFG+ NA +QR M F D+ K +Y+DD+++ S +E++H+E
Sbjct: 1298 TFTCSEGLYTWNVCPFGIANAPCAFQRFMQESFGDL--KFALLYIDDILIASNNEKEHIE 1355
Query: 1305 YLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKVRAIREMPAPQ--T 1362
+L F R+++ L+ K ++ + LG + + I + P V I++ + T
Sbjct: 1356 HLKIFFNRVKEVGCVLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIKKFDKNKLNT 1415
Query: 1363 EKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLLEPPI 1422
K ++ +LG LNY +I ++ GP++K KN ++N E I+ + +
Sbjct: 1416 LKGLQAYLGLLNYARGYIKDLSKLVGPLYKKTGKNGQRIFNKEDWNIIFKIEREVSKIKP 1475
Query: 1423 LVPPVEGRPLIMYLAVFDESMGCVLGQQDE--TGKKEHAIY-YLSKKFTDCETRYTMLEK 1479
L P E +I+ +E G VL + + +GK I Y S F + +T +T L+
Sbjct: 1476 LERPKETDYIIIETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFGEKKT-WTSLDY 1534
Query: 1480 TCCALAWATKRLRHYL 1495
A+ A + + YL
Sbjct: 1535 EIEAINEALNKFQIYL 1550
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 110 bits (274), Expect = 6e-23
Identities = 101/377 (26%), Positives = 164/377 (42%), Gaps = 10/377 (2%)
Query: 1181 KKDGKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKVFSFMDGFSGYNQIKMSPED 1240
++ GK RM V+++ +N+A+ D+ LP++ L+ +FS D SG+ Q+ + E
Sbjct: 288 RRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEES 347
Query: 1241 REKTSFITPWGTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEVYVDDMIVKSADEE 1300
++ T+F P G F +KV+PFGL A + +QR M T + K VYVDD+IV S E
Sbjct: 348 QKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSEL 406
Query: 1301 QHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFIVSQKGIEVDPDKV-RAIREMP- 1358
H ++ + + + KY + L+ K LG + KG + + I + P
Sbjct: 407 DHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEI-DKGTHCPQNHILENIHKFPD 465
Query: 1359 APQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQPIVWNDECQEAFDSIKNYLL 1418
+ +K ++ FLG L Y +I + P+ L+K+ W + IK L
Sbjct: 466 RLEDKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLG 525
Query: 1419 EPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDETGKKEHAIYYLSKKFTDCETRYTMLE 1478
P L P LI+ D G VL + G E Y S F E Y +
Sbjct: 526 SFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDG-VELICRYSSGSFKQAEKNYHSND 584
Query: 1479 KTCCALAWATKRLRHYLVNHTTWLISRMDPIKYI----FEKAAVTGKIARWQMLLSEYDI 1534
K A+ + YL + + Y + + G++ RWQ S+Y
Sbjct: 585 KELLAVKQVITKFSAYLTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVRWQNWFSKYQF 644
Query: 1535 VFKTQKAIKGSILADHL 1551
+ + +K ++LAD L
Sbjct: 645 DVEHLEGVK-NVLADCL 660
>POL_GALV (P21414) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1165
Score = 108 bits (269), Expect = 2e-22
Identities = 96/392 (24%), Positives = 167/392 (42%), Gaps = 48/392 (12%)
Query: 1132 ECPPVRQKLRRTHPDMALK-----------IKNEVQKQIDAGFLMTVEYPEWVANIVPVP 1180
+ PPV +LR +A++ I+ +QK +D G L+ P W ++PV
Sbjct: 161 QVPPVVVELRSGASPVAVRQYPMSKEAREGIRPHIQKFLDLGVLVPCRSP-WNTPLLPVK 219
Query: 1181 KKD-GKVRMCVDFRDLNKASPKDNFPLPHIDVLVDNTAQSKV-FSFMDGFSGYNQIKMSP 1238
K R D R++NK + +P+ L+ + S +S +D + +++ P
Sbjct: 220 KPGTNDYRPVQDLREINKRVQDIHPTVPNPYNLLSSLPPSYTWYSVLDLKDAFFCLRLHP 279
Query: 1239 EDREKTSFITPW--------GTFCYKVMPFGLINAGATYQRGMTTLFHDMIHKEVEV--- 1287
+ +F W G + +P G N+ TLF + +H+++
Sbjct: 280 NSQPLFAF--EWKDPEKGNTGQLTWTRLPQGFKNS--------PTLFDEALHRDLAPFRA 329
Query: 1288 ---------YVDDMIVKSADEEQHVEYLTKMFERLRKYKLRLNPNKCTFGVRSGKLLGFI 1338
YVDD++V + E + K+ + L K R++ K R LG++
Sbjct: 330 LNPQVVLLQYVDDLLVAAPTYEDCKKGTQKLLQELSKLGYRVSAKKAQLCQREVTYLGYL 389
Query: 1339 VSQKGIEVDPDKVRAIREMPAPQTEKQVRGFLGRLNYISRFISHMTATCGPIFKLLRKNQ 1398
+ + + P + + ++P P T +QVR FLG + +I + P++ L +++
Sbjct: 390 LKEGKRWLTPARKATVMKIPVPTTPRQVREFLGTAGFCRLWIPGFASLAAPLYPLTKESI 449
Query: 1399 PIVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLIMYLAVFDESMGCVLGQQDET-GKKE 1457
P +W +E Q+AFD IK LL P L P +P +Y+ DE G G +T G
Sbjct: 450 PFIWTEEHQQAFDHIKKALLSAPALALPDLTKPFTLYI---DERAGVARGVLTQTLGPWR 506
Query: 1458 HAIYYLSKKFTDCETRYTMLEKTCCALAWATK 1489
+ YLSKK + + K A+A K
Sbjct: 507 RPVAYLSKKLDPVASGWPTCLKAVAAVALLLK 538
Score = 87.0 bits (214), Expect = 5e-16
Identities = 79/264 (29%), Positives = 118/264 (43%), Gaps = 40/264 (15%)
Query: 1890 PFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRY 1949
P W +D I+P G++++LV ID F+ WVEA T +V K I I+ R+
Sbjct: 876 PGVYWEVDFT-EIKP-GRYGNKYLLVFIDTFSGWVEAFPTKTETALIVCKKILEEILPRF 933
Query: 1950 GVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMV- 2008
G+P + +DNG V Q L + I YRPQ +G VE N+ IK + K+
Sbjct: 934 GIPKVLGSDNGPAFVAQVSQGLATQLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLAL 993
Query: 2009 -TTYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYG--------MEAVLPLEVEIPSLRV 2059
T KDW +LP AL R T G TP+ ++YG E + P + +P L
Sbjct: 994 ETGGKDWVTLLPLALLRARNT-PGRFGLTPYEILYGGPPPILESGETLGPDDRFLPVLFT 1052
Query: 2060 IMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHPREFKVGELVL 2119
++A L ++ + D + + Y+ T P F+VG+ VL
Sbjct: 1053 HLKA--------------LEIVRTQIWDQIK--EVYKPGTVTI------PHPFQVGDQVL 1090
Query: 2120 KRRISQQPDPRGKWTPNYEGPYVV 2143
RR +P P ++GPY+V
Sbjct: 1091 VRR--HRP---SSLEPRWKGPYLV 1109
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.340 0.148 0.492
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 231,941,440
Number of Sequences: 164201
Number of extensions: 9394692
Number of successful extensions: 29928
Number of sequences better than 10.0: 137
Number of HSP's better than 10.0 without gapping: 120
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 29546
Number of HSP's gapped (non-prelim): 285
length of query: 2175
length of database: 59,974,054
effective HSP length: 126
effective length of query: 2049
effective length of database: 39,284,728
effective search space: 80494407672
effective search space used: 80494407672
T: 11
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.9 bits)
S2: 74 (33.1 bits)
Medicago: description of AC135162.7