
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC140025.16 - phase: 0
(801 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 121 8e-27
POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.2... 108 7e-23
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 107 1e-22
POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.2... 103 2e-21
POL_MLVRK (P31795) Pol polyprotein [Contains: Protease (EC 3.4.2... 102 3e-21
POL_MLVRD (P11227) Pol polyprotein [Contains: Protease (EC 3.4.2... 102 3e-21
POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.2... 102 4e-21
POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse transcript... 102 4e-21
POL3_MOUSE (P11367) Retrovirus-related Pol polyprotein (Endonucl... 102 5e-21
POL_MLVAV (P03356) Pol polyprotein [Contains: Protease (EC 3.4.2... 101 7e-21
POL_MLVMO (P03355) Pol polyprotein [Contains: Protease (EC 3.4.2... 101 9e-21
POL_MLVF5 (P26810) Pol polyprotein [Contains: Protease (EC 3.4.2... 101 9e-21
POL_MLVAK (P03357) Pol polyprotein [Contains: Reverse transcript... 99 3e-20
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 99 4e-20
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 99 4e-20
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 99 4e-20
POL_MLVCB (P08361) Pol polyprotein [Contains: Reverse transcript... 97 2e-19
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 89 4e-17
POL_AVIRE (P03360) Pol polyprotein [Contains: Reverse transcript... 89 6e-17
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 87 2e-16
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 121 bits (303), Expect = 8e-27
Identities = 100/381 (26%), Positives = 169/381 (44%), Gaps = 29/381 (7%)
Query: 429 RNYDMVLLRCV----DEHEAEQLMHDVHDGTFGTHATGHTMSRKLLRAGYYWMAMEHDCY 484
+N + LL V +E E E ++ +HD TG T + ++ YYW M
Sbjct: 874 KNLKVALLNPVTQINNEKEKEAILSTLHDDPIQGGHTGITKTLAKVKRHYYWKNMSKYIK 933
Query: 485 QHARKCHKCQIYADKIHVPPHALNVISSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDY 544
++ RKC KCQ H + F +D IG + PK+ NG+ + + I
Sbjct: 934 EYVRKCQKCQKAKTTKHTKTPMTITETPEHAFDRVVVDTIGPL-PKSENGNEYAVTLICD 992
Query: 545 FTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEH 604
TK++ A N + + VAK I + I +YG ITD GT N+++ LC+ KI++
Sbjct: 993 LTKYLVAIPIANKSAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKN 1052
Query: 605 HNSSPYRPQMNGAVEAANKNIKRIVQKMVTTYK-DWHEMLPYALHGYRTTVRSSTGATPF 663
S+ + Q G VE +++ + ++ ++T K DW L Y ++ + TT P+
Sbjct: 1053 ITSTAHHHQTVGVVERSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPY 1112
Query: 664 SLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARG----QS 719
LV+G + LP KL E + D + + A AR ++
Sbjct: 1113 ELVFGRTSNLPKHFN----------KLHSIEPIYNIDDYAKESKYRLEVAYARARKLLEA 1162
Query: 720 YQARMKTAFDKKVHPREFKVGELVLKRRVSQQPDPRGKWTPNYEGPYVVKK-AFSGGALI 778
++ + K +D KV E +VG+ VL R + K Y GPY ++ + +
Sbjct: 1163 HKEKNKENYDLKVKDIELEVGDKVLLRN-----EVGHKLDFKYTGPYKIESIGDNNNITL 1217
Query: 779 LTHMDGVELPNPVNADIVKKY 799
LT+ + ++ V+ D +KK+
Sbjct: 1218 LTNKNKKQI---VHKDRLKKF 1235
Score = 64.3 bits (155), Expect = 1e-09
Identities = 42/157 (26%), Positives = 72/157 (45%), Gaps = 3/157 (1%)
Query: 21 KNQPIVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLILYLSVFDESVGCVLGQQDETGK 80
KN P W DECQ+AF +K+ L+ P +L P + + ++ G VL Q
Sbjct: 580 KNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVLTQNH--NG 637
Query: 81 KEHAIYYLSKKFTDCETRYTMLAKTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIFEK 140
+ + Y S+ FT E+ + + A+ WA R Y+ + + P+ Y+F
Sbjct: 638 HQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHRPLTYLFSM 697
Query: 141 AVVTGKIARWQMLLSEYDIVFKAQKAIKGSILADHLA 177
+ K+ R ++ L EY+ + K K + +AD L+
Sbjct: 698 VNPSSKLTRIRLELEEYNFTVEYLKG-KDNHVADALS 733
>POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1189
Score = 108 bits (269), Expect = 7e-23
Identities = 180/768 (23%), Positives = 295/768 (37%), Gaps = 99/768 (12%)
Query: 21 KNQPIVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLILYLSVFDESVGCVLGQ-QDETG 79
++ P W E Q AF+++K LL P L P +P L+L DE G G + G
Sbjct: 448 ESTPFTWQTEHQLAFEALKKALLSAPALGLPDTSKPFTLFL---DERQGIAKGVLTQKLG 504
Query: 80 KKEHAIYYLSKKFTDCETRYTMLAKTCCALAWAAKRLRHYLVN--------HTTWLISRM 131
+ + YLSKK + + A A K + HT I R
Sbjct: 505 PWKRPVAYLSKKLDPVAAGWPPCLRIMAATAMLVKDSAKLTLGQPLTVITPHTLEAIVRQ 564
Query: 132 DPIKYIFEKAVVTGKIARWQMLLSEYDIVFKAQKAIKGSILADHLAYQPLDDYQPIEFDF 191
P ++I ++ +Q LL + D V + + P+ + QP D
Sbjct: 565 PPDRWI-----TNARLTHYQALLLDTDRV-----QFGPPVTLNPATLLPVPENQPSPHDC 614
Query: 192 PDEEIMYLKSKDCEEPLID-EGPDPNSKWGLVFDGAVNAYGKGIGAVIVSLQGHHIPFTA 250
+ ++ E L D E PD + W +++ + GA +V GH+ +
Sbjct: 615 RQ---VLAETHGTREDLKDQELPDADHTWYTDGSSYLDSGTRRAGAAVVD--GHNTIWAQ 669
Query: 251 RILFECTNNMAEYEACIFGIEEAIDM-RIKHLDIYGDSALVINQIKGEWETHHAKLIPYR 309
+ + AE + + +A+++ + K +IY DS + T H Y
Sbjct: 670 SLPPGTSAQKAE----LIALTKALELSKGKKANIYTDSRYA-------FATAHTHGSIYE 718
Query: 310 DYARRLLTYFTKVELHHIPRDENQMADALATLSSMFRVNHWNDVPIIKVQRLERPSHVFA 369
R LLT K + A+ +A L ++F +V II ++ A
Sbjct: 719 --RRGLLTSEGK--------EIKNKAEIIALLKALFLPQ---EVAIIHCPGHQKGQDPVA 765
Query: 370 IGD-----VIDQAG-ENVVDYKPWYYDIKQFLLSREYPSGASKQDKKTLRRLASRFLLDG 423
+G+ V QA V+ + + Y S +D++ R + + D
Sbjct: 766 VGNRQADRVARQAAMAEVLTLATEPDNTSHITIEHTYTS----EDQEEARAIGATENKD- 820
Query: 424 DILYKRNYDMVLLRCVDEHEAEQLMHDVHDGTFGTHATGHTMSRKLLRAGYYWMAMEHDC 483
RN++ + + EA ++ +H T H + + + +
Sbjct: 821 ----TRNWEKEGKIVLPQKEALAMIQQMHAWT---HLGNRKLKLLIEKTDFLIPRASTLI 873
Query: 484 YQHARKCHKCQ-IYADKIHVPPHALNVISSPWPFSMWGIDMIGRIEPKASNGHRFILVAI 542
Q C CQ + A VP + P + W ID ++P + G++++LV +
Sbjct: 874 EQVTSACKVCQQVNAGATRVPAGKRTRGNRPGVY--WEIDFT-EVKPHYA-GYKYLLVFV 929
Query: 543 DYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKI 602
D F+ WVEA T +VAK I I R+G+P I +DNG + V Q L I
Sbjct: 930 DTFSGWVEAFPTRQETAHIVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQGLARILGI 989
Query: 603 EHHNSSPYRPQMNGAVEAANKNIKRIVQKMV--TTYKDWHEMLPYALHGYRTTVRSSTGA 660
YRPQ +G VE N+ IK + K+ T KDW +L AL R T + G
Sbjct: 990 NWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLLSLALLRARNT-PNRFGL 1048
Query: 661 TPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARGQSY 720
TP+ ++YG PL + S + + Q+R L ++ + +A
Sbjct: 1049 TPYEILYG--GPPPLSTLLNSF-----SPSNSKTDLQARLKGLQAVQAQIWAPLAE---- 1097
Query: 721 QARMKTAFDKKVHPREFKVGELVLKRRVSQQPDPRGKWTPNYEGPYVV 768
+ + HP F+VG+ V RR Q P ++GPY+V
Sbjct: 1098 --LYRPGHSQTSHP--FQVGDSVYVRRHRSQ-----GLEPRWKGPYIV 1136
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 107 bits (268), Expect = 1e-22
Identities = 89/335 (26%), Positives = 146/335 (43%), Gaps = 14/335 (4%)
Query: 439 VDEHEAEQLMHDVHDGTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQHARKCHKCQIYAD 498
V E L+ ++H+G H M R + R +YW M R C KC D
Sbjct: 1460 VPEKIRTPLLKELHEGMLAGHFGIKKMWRMVHRK-FYWPQMRVCVENCVRTCAKCLCAND 1518
Query: 499 KIHVPPHALNVISSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVT 558
+ +L +P + D++ + G+R+IL ID FTK+ A +
Sbjct: 1519 HSKLTS-SLTPYRMTFPLEIVACDLMD--VGLSVQGNRYILTIIDLFTKYGTAVPIPDKK 1575
Query: 559 KQVVAK-FIKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGA 617
+ V K F++ I +P K++TD G N + KIEH + Y + NGA
Sbjct: 1576 AETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGYNSRANGA 1635
Query: 618 VEAANKNIKRIVQKMVTTYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEV 677
VE NK I I++K +W + + YA++ Y V +TG TP L++G + + PLE+
Sbjct: 1636 VERFNKTIMHIMKKKTAVPMEWDDQVVYAVYAYNNCVHENTGETPMFLMHGRDVMGPLEM 1695
Query: 678 EIPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVH---- 733
I A + E + ++ + L + + + AM +SY++ + K H
Sbjct: 1696 SGEDAVGINYADMDEYKHLLTQ-ELLKVQKIAKEHAMREQESYKSLFDQKYASKKHRFPQ 1754
Query: 734 PREFKVGELVLKRRVSQQPDPRGKWTPNYEGPYVV 768
P + E+ ++ +Q P KW+ GPY V
Sbjct: 1755 PGSRVLLEIPSEKLGAQCPKLVNKWS----GPYRV 1785
Score = 62.0 bits (149), Expect = 6e-09
Identities = 39/149 (26%), Positives = 73/149 (48%), Gaps = 9/149 (6%)
Query: 26 VWNDECQEAFDSIKNYLLEPPILVPP-VEG-----RPLILYLSVFDESVGCVLGQQDETG 79
+W E + AF +K + + P+L P VE RP ++Y + +G VL Q+ G
Sbjct: 1207 IWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDG 1266
Query: 80 KKEHAIYYLSKKFTDCETRYTMLAKTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIFE 139
+ +H I + SK + ETRY + A+ +A +R + + + + P+ + +
Sbjct: 1267 Q-QHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISLLK 1325
Query: 140 KAVVTGKIARWQMLLSEYD--IVFKAQKA 166
+ + ++ RW + + E+D IV+ A KA
Sbjct: 1326 GSPLADRLWRWSIEILEFDVKIVYLAGKA 1354
>POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 103 bits (256), Expect = 2e-21
Identities = 87/298 (29%), Positives = 136/298 (45%), Gaps = 25/298 (8%)
Query: 474 YYWMAMEHDCYQHARKCHKC-QIYADKIHVPPHALNVISSPWPFSMWGIDMIGRIEPKAS 532
YY + + C C Q+ A K V + P + W ID ++P
Sbjct: 874 YYMLNRDRTLKDITETCQACAQVNASKSAVKQGTR--VRGHRPGTHWEIDFT-EVKP-GL 929
Query: 533 NGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGTNLNNNV 592
G++++LV ID F+ WVEA T +VV K + I R+G+P + TDNG + V
Sbjct: 930 YGYKYLLVFIDTFSGWVEAFPTKKETAKVVTKKLLEEIFPRFGMPQVLGTDNGPAFVSKV 989
Query: 593 VQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK--MVTTYKDWHEMLPYALHGY 650
Q + + ++ YRPQ +G VE N+ IK + K + T +DW +LP AL+
Sbjct: 990 SQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRDWVLLLPLALYRA 1049
Query: 651 RTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKR 710
R T G TP+ ++YG P V P + AK++ Q+ L L++ +
Sbjct: 1050 RNT-PGPHGLTPYEILYGAP---PPLVNFPDPDM---AKVTHNPSLQAHLQALYLVQHEV 1102
Query: 711 MDAMARGQSYQARMKTAFDKKVHPREFKVGELVLKRRVSQQPDPRGKWTPNYEGPYVV 768
+A +YQ ++ D+ V P F+VG+ V RR + P ++GPY V
Sbjct: 1103 WRPLA--AAYQEQL----DRPVVPHPFRVGDTVWVRRHQTK-----NLEPRWKGPYTV 1149
>POL_MLVRK (P31795) Pol polyprotein [Contains: Protease (EC
3.4.23.-); Reverse transcriptase/ribonuclease H (EC
2.7.7.49) (EC 3.1.26.4) (RT); Integrase (IN)] (Fragment)
Length = 581
Score = 102 bits (255), Expect = 3e-21
Identities = 89/330 (26%), Positives = 150/330 (44%), Gaps = 28/330 (8%)
Query: 445 EQLMHDVHDGTFGTHATGHTMSRKLLRAG---YYWMAMEHDCYQHARKCHKC-QIYADKI 500
+Q + ++ D G+ + LL G YY + + A C C Q+ A K
Sbjct: 222 DQFVFELLDSLHRLTHLGYQKMKALLDRGESPYYMLNRDKTLQYVADSCTVCAQVNASKA 281
Query: 501 HVPPHALNVISSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQ 560
+ + P + W ID ++P G++++LV +D F+ WVEA + T +
Sbjct: 282 KIGAGVR--VRGHRPGTHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFPTKHETAK 337
Query: 561 VVAKFIKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEA 620
+V K + I R+G+P + TDNG + V Q++ + I+ YRPQ +G VE
Sbjct: 338 IVTKKLLEEIFPRFGMPQVLGTDNGPAFVSQVSQSVAKLLGIDWKLHCAYRPQSSGQVER 397
Query: 621 ANKNIKRIVQK--MVTTYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVE 678
N+ IK + K + T +DW +LP AL+ R T G TP+ ++YG L +
Sbjct: 398 MNRTIKETLTKLTLATGTRDWVLLLPLALYRARNT-PGPHGLTPYEILYGAPPPL-VNFH 455
Query: 679 IPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHPREFK 738
P + +K + + Q+ L ++ + +A +YQ ++ D+ V P F+
Sbjct: 456 DPEM-----SKFTNSPSLQAHLQALQAVQREVWKPLA--AAYQDQL----DQPVIPHPFR 504
Query: 739 VGELVLKRRVSQQPDPRGKWTPNYEGPYVV 768
VG+ V RR + P ++GPY V
Sbjct: 505 VGDTVWVRRHQTK-----NLEPRWKGPYTV 529
>POL_MLVRD (P11227) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1196
Score = 102 bits (255), Expect = 3e-21
Identities = 89/330 (26%), Positives = 150/330 (44%), Gaps = 28/330 (8%)
Query: 445 EQLMHDVHDGTFGTHATGHTMSRKLLRAG---YYWMAMEHDCYQHARKCHKC-QIYADKI 500
+Q + ++ D G+ + LL G YY + + A C C Q+ A K
Sbjct: 837 DQFVFELLDSLHRLTHLGYQKMKALLDRGESPYYMLNRDKTLQYVADSCTVCAQVNASKA 896
Query: 501 HVPPHALNVISSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQ 560
+ + P + W ID ++P G++++LV +D F+ WVEA + T +
Sbjct: 897 KIGAGVR--VRGHRPGTHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFPTKHETAK 952
Query: 561 VVAKFIKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEA 620
+V K + I R+G+P + TDNG + V Q++ + I+ YRPQ +G VE
Sbjct: 953 IVTKKLLEEIFPRFGMPQVLGTDNGPAFVSQVSQSVAKLLGIDWKLHCAYRPQSSGQVER 1012
Query: 621 ANKNIKRIVQK--MVTTYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVE 678
N+ IK + K + T +DW +LP AL+ R T G TP+ ++YG L +
Sbjct: 1013 MNRTIKETLTKLTLATGTRDWVLLLPLALYRARNT-PGPHGLTPYEILYGAPPPL-VNFH 1070
Query: 679 IPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHPREFK 738
P + +K + + Q+ L ++ + +A +YQ ++ D+ V P F+
Sbjct: 1071 DPEM-----SKFTNSPSLQAHLQALQAVQREVWKPLA--AAYQDQL----DQPVIPHPFR 1119
Query: 739 VGELVLKRRVSQQPDPRGKWTPNYEGPYVV 768
VG+ V RR + P ++GPY V
Sbjct: 1120 VGDTVWVRRHQTK-----NLEPRWKGPYTV 1144
>POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 102 bits (254), Expect = 4e-21
Identities = 86/298 (28%), Positives = 136/298 (44%), Gaps = 25/298 (8%)
Query: 474 YYWMAMEHDCYQHARKCHKC-QIYADKIHVPPHALNVISSPWPFSMWGIDMIGRIEPKAS 532
YY + + C C Q+ A K V + P + W ID ++P
Sbjct: 874 YYMLNRDRTLKDITETCKACAQVNASKSAVKQGTR--VRGHRPGTHWEIDFT-EVKP-GL 929
Query: 533 NGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGTNLNNNV 592
G++++LV +D F+ WVEA T +VV K + I R+G+P + TDNG + V
Sbjct: 930 YGYKYLLVFVDTFSGWVEAFPTKKETAKVVTKKLLEEIFPRFGMPQVLGTDNGPAFVSKV 989
Query: 593 VQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK--MVTTYKDWHEMLPYALHGY 650
Q + + ++ YRPQ +G VE N+ IK + K + T +DW +LP AL+
Sbjct: 990 SQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRDWVLLLPLALYRA 1049
Query: 651 RTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKR 710
R T G TP+ ++YG P V P + AK++ Q+ L L++ +
Sbjct: 1050 RNT-PGPHGLTPYEILYGAP---PPLVNFPDPDM---AKVTHNPSLQAHLQALYLVQHEV 1102
Query: 711 MDAMARGQSYQARMKTAFDKKVHPREFKVGELVLKRRVSQQPDPRGKWTPNYEGPYVV 768
+A +YQ ++ D+ V P F+VG+ V RR + P ++GPY V
Sbjct: 1103 WRPLA--AAYQEQL----DRPVVPHPFRVGDTVWVRRHQTK-----NLEPRWKGPYTV 1149
>POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 1046
Score = 102 bits (254), Expect = 4e-21
Identities = 159/675 (23%), Positives = 259/675 (37%), Gaps = 93/675 (13%)
Query: 21 KNQPIVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLILYLSVFDESVGCVLGQ-QDETG 79
++ P W ++ Q AF+++K LL P L P +P L++ DE G G + G
Sbjct: 305 ESAPFTWQEKHQSAFEALKEALLSAPALGLPDTSKPFTLFI---DEKQGIAKGVLTQKLG 361
Query: 80 KKEHAIYYLSKKFTDCETRYTMLAKTCCALAWAAKRLRHYLVN--------HTTWLISRM 131
+ + YLSKK + + A A K + H I R
Sbjct: 362 PWKRPVAYLSKKLDPVAAGWPPCLRIMAATAMLVKDSAKLTLGQPLTVITPHALEAIVRQ 421
Query: 132 DPIKYIFEKAVVTGKIARWQMLLSEYD-IVFKAQKAIKGSIL--------ADHLAYQPLD 182
P ++I ++ +Q LL + D I F + + L + H Q L
Sbjct: 422 TPDRWI-----TNARLTHYQALLLDTDRIQFGPPVTLNPATLLPAPEDQQSAHDCRQVLA 476
Query: 183 DYQPIEFDFPDEEIMYLKSKDCEEPLIDEGPDPNSKWGLVFDGAVNAYGKGIGAVIVSLQ 242
+ D D+E+ PD + W +++ + GA +V
Sbjct: 477 ETHGTREDLKDQEL----------------PDADHSWYTDGSSYIDSGTRRAGAAVVD-- 518
Query: 243 GHHIPFTARILFECTNNMAEYEACIFGIEEAIDMRI-KHLDIYGDSALVINQIKGEWETH 301
GHHI + + + AE + + +A+++ K +IY DS + T
Sbjct: 519 GHHIIWAQSLPPGTSAQKAE----LIALTKALELSEGKKANIYTDSRYA-------FATA 567
Query: 302 HAKLIPYRDYARRLLTYFTKVELHHIPRDENQMADALATLSSMFRVNHWNDVPIIKVQRL 361
H Y R LLT K + A+ +A L ++F V II
Sbjct: 568 HTHGSIYE--RRGLLTSEGK--------EIKNKAEIIALLKALFLPRK---VAIIHCPGH 614
Query: 362 ERPSHVFAIGD-VIDQAGENVVDYKPWYYDIK---QFLLSREYPSGASKQDK-KTLRRLA 416
++ A G+ DQ V + K L + +Y Q++ K + +
Sbjct: 615 QKGQDPIATGNRQADQVARQVAVAETLTLTTKLEETNLTTNKYAYTPEDQEEAKAIGAIL 674
Query: 417 SRFLLDGDILYKRNYDMVLLRCVDEHEAEQLMHDVHDGTFGTHATGHTMSRKLLRAGYYW 476
++ D +++ +VL R EA ++ +H T H + + + + +
Sbjct: 675 NQDTKD----WEKEGKIVLPR----KEALAMIQQMHAWT---HLSNQKLKLLIEKTDFLI 723
Query: 477 MAMEHDCYQHARKCHKCQ-IYADKIHVPPHALNVISSPWPFSMWGIDMIGRIEPKASNGH 535
Q C CQ + A VP + P + W ID ++P + G+
Sbjct: 724 PKAGTLIEQVTSACKVCQQVNAGATRVPEGKRTRGNRPGVY--WEIDFT-EVKPHYA-GY 779
Query: 536 RFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGTNLNNNVVQA 595
+++LV +D F+ WVEA T +VAK I I R+G+P I +DNG + V Q
Sbjct: 780 KYLLVFVDTFSGWVEAYPTRQETAHMVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQG 839
Query: 596 LCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMV--TTYKDWHEMLPYALHGYRTT 653
L I YRPQ +G VE N+ IK + K+ T KDW +L AL R T
Sbjct: 840 LARTLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLLSLALLRARNT 899
Query: 654 VRSSTGATPFSLVYG 668
+ G TP+ ++YG
Sbjct: 900 -PNRFGLTPYEILYG 913
>POL3_MOUSE (P11367) Retrovirus-related Pol polyprotein
(Endonuclease) (Fragment)
Length = 390
Score = 102 bits (253), Expect = 5e-21
Identities = 88/316 (27%), Positives = 147/316 (45%), Gaps = 27/316 (8%)
Query: 458 THATGHTMSRKLLR--AGYYWMAMEHDCYQHARKCHKC-QIYADKIHVPPHALNVISSPW 514
TH + M L R + YY + + ++ A C C Q+ A K + A +
Sbjct: 60 THLSYQKMRALLDRKESPYYMLNKDKILHEVAESCQACVQVNASKTKI--RAGTRVRGHR 117
Query: 515 PFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRY 574
+ W ID ++P G++++LV +D F+ WVEA + T ++V K + I R+
Sbjct: 118 LGTHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFPTKHETAKIVTKKLLEEIFPRF 175
Query: 575 GVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK--M 632
G+P + TDNG + V Q++ + I+ YRPQ +G VE N+ IK + K +
Sbjct: 176 GMPQVLGTDNGPAFVSQVSQSVAKLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTL 235
Query: 633 VTTYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSE 692
T +DW +LP AL+ R T G TP+ ++YG L + P + +K +
Sbjct: 236 ATGTRDWVLLLPLALYRARNT-PGPHGLTPYEILYGAPPPL-VNFHDPEM-----SKFTN 288
Query: 693 AEWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHPREFKVGELVLKRRVSQQP 752
+ Q+ L ++ + +A +YQ ++ D+ V P F+VG+ V RR +
Sbjct: 289 SPSLQAHLQALQAVQREVWKPLA--AAYQDQL----DQPVIPHPFRVGDTVWVRRHQTK- 341
Query: 753 DPRGKWTPNYEGPYVV 768
P ++GPY V
Sbjct: 342 ----NLEPRWKGPYTV 353
>POL_MLVAV (P03356) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1196
Score = 101 bits (252), Expect = 7e-21
Identities = 87/330 (26%), Positives = 151/330 (45%), Gaps = 28/330 (8%)
Query: 445 EQLMHDVHDGTFGTHATGHTMSRKLLRAG---YYWMAMEHDCYQHARKCHKC-QIYADKI 500
+Q + ++ D G+ + LL G YY + + A C C Q+ A K
Sbjct: 837 DQFVFELLDSLHRLTHLGYQKMKALLDRGESPYYMLNRDKTLQYVADSCTVCAQVNASKA 896
Query: 501 HVPPHALNVISSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQ 560
+ + P S W ID ++P G++++LV +D F+ WVEA T +
Sbjct: 897 KIGAGVR--VRGHRPGSHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFPTKRETAR 952
Query: 561 VVAKFIKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEA 620
VV+K + I R+G+P + +DNG + V Q++ + I+ YRPQ +G VE
Sbjct: 953 VVSKKLLEEIFPRFGMPQVLGSDNGPAFTSQVSQSVADLLGIDWKLHCAYRPQSSGQVER 1012
Query: 621 ANKNIKRIVQKMVTT--YKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVE 678
N+ IK + K+ +DW +LP AL+ R T G TP+ ++YG L +
Sbjct: 1013 MNRTIKETLTKLTLAAGTRDWVLLLPLALYRARNT-PGPHGLTPYEILYGAPPPL-VNFH 1070
Query: 679 IPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHPREFK 738
P + ++L+ + Q+ L ++ + +A ++Y+ ++ D+ V P F+
Sbjct: 1071 DPDM-----SELTNSPSLQAHLQALQTVQREIWKPLA--EAYRDQL----DQPVIPHPFR 1119
Query: 739 VGELVLKRRVSQQPDPRGKWTPNYEGPYVV 768
+G+ V RR + P ++GPY V
Sbjct: 1120 IGDSVWVRRHQTK-----NLEPRWKGPYTV 1144
>POL_MLVMO (P03355) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1199
Score = 101 bits (251), Expect = 9e-21
Identities = 90/316 (28%), Positives = 143/316 (44%), Gaps = 27/316 (8%)
Query: 458 THATGHTMSRKLLRAG--YYWMAMEHDCYQHARKCHKC-QIYADKIHVPPHALNVISSPW 514
TH + M L R+ YY + + C C Q+ A K V +
Sbjct: 851 THLSFSKMKALLERSHSPYYMLNRDRTLKNITETCKACAQVNASKSAVKQGTR--VRGHR 908
Query: 515 PFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRY 574
P + W ID I+P G++++LV ID F+ W+EA T +VV K + I R+
Sbjct: 909 PGTHWEIDFT-EIKP-GLYGYKYLLVFIDTFSGWIEAFPTKKETAKVVTKKLLEEIFPRF 966
Query: 575 GVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK--M 632
G+P + TDNG + V Q + + I+ YRPQ +G VE N+ IK + K +
Sbjct: 967 GMPQVLGTDNGPAFVSKVSQTVADLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTL 1026
Query: 633 VTTYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSE 692
T +DW +LP AL+ R T G TP+ ++YG P V P + +++
Sbjct: 1027 ATGSRDWVLLLPLALYRARNT-PGPHGLTPYEILYGAP---PPLVNFPDPDM---TRVTN 1079
Query: 693 AEWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHPREFKVGELVLKRRVSQQP 752
+ Q+ L L++ + +A +YQ ++ D+ V P ++VG+ V RR +
Sbjct: 1080 SPSLQAHLQALYLVQHEVWRPLA--AAYQEQL----DRPVVPHPYRVGDTVWVRRHQTK- 1132
Query: 753 DPRGKWTPNYEGPYVV 768
P ++GPY V
Sbjct: 1133 ----NLEPRWKGPYTV 1144
>POL_MLVF5 (P26810) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 101 bits (251), Expect = 9e-21
Identities = 78/256 (30%), Positives = 124/256 (47%), Gaps = 22/256 (8%)
Query: 515 PFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRY 574
P + W ID ++P G++++LV +D F+ WVEA T +VV K + I R+
Sbjct: 914 PGTHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFPTKKETAKVVTKKLLEEIFPRF 971
Query: 575 GVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK--M 632
G+P + TDNG + V Q + + ++ YRPQ +G VE N+ IK + K +
Sbjct: 972 GMPQVLGTDNGPAFVSKVSQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIKETLTKLTL 1031
Query: 633 VTTYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSE 692
T +DW +LP AL+ R T G TP+ ++YG P V P + AK++
Sbjct: 1032 ATGSRDWVLLLPLALYRARNT-PGPHGLTPYEILYGAP---PPLVNFPDPDM---AKVTH 1084
Query: 693 AEWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHPREFKVGELVLKRRVSQQP 752
Q+ L L++ + +A +YQ ++ D+ V P F+VG+ V RR +
Sbjct: 1085 NPSLQAHLQALYLVQHEVWRPLA--AAYQEQL----DRPVVPHPFRVGDTVWVRRHQTK- 1137
Query: 753 DPRGKWTPNYEGPYVV 768
P ++GPY V
Sbjct: 1138 ----NLEPRWKGPYTV 1149
>POL_MLVAK (P03357) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 843
Score = 99.4 bits (246), Expect = 3e-20
Identities = 87/330 (26%), Positives = 153/330 (46%), Gaps = 29/330 (8%)
Query: 445 EQLMHDVHDGTFGTHATGHTMSRKLLRAG---YYWMAMEHDCYQHARKCHKC-QIYADKI 500
+Q + ++ D G+ + LL G YY + + A C C Q+ A K
Sbjct: 485 DQFVFELLDSLHRLTHLGYQKMKALLDRGESPYYMLNRDKTLQYVADSCTVCAQVNASKA 544
Query: 501 HVPPHALNVISSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQ 560
+ + P S W ID ++P G++++LV +D F+ WVEA T +
Sbjct: 545 KIGAGVR--VRGHRPGSHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFPTKRETAR 600
Query: 561 VVAKFIKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEA 620
VV+K + I R+G+P + +DNG + V Q++ + I+ + + YRPQ +G VE
Sbjct: 601 VVSKKLLEEIFPRFGMPQVLGSDNGPAFTSQVSQSVADLLGIDKLHCA-YRPQSSGQVER 659
Query: 621 ANKNIKRIVQKMVTT--YKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVE 678
N+ IK + K+ +DW +LP AL+ R T G TP+ ++YG L +
Sbjct: 660 MNRTIKETLTKLTLAAGTRDWVLLLPLALYRARNT-PGPHGLTPYEILYGAPPPL-VNFH 717
Query: 679 IPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHPREFK 738
P + ++L+ + Q+ L ++ + +A ++Y+ ++ D+ V P F+
Sbjct: 718 DPDM-----SELTNSPSLQAHLQALQTVQREIWKPLA--EAYRDQL----DQPVIPHPFR 766
Query: 739 VGELVLKRRVSQQPDPRGKWTPNYEGPYVV 768
+G+ V RR + P ++GPY V
Sbjct: 767 IGDSVWVRRHQTK-----NLEPRWKGPYTV 791
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 99.0 bits (245), Expect = 4e-20
Identities = 114/505 (22%), Positives = 209/505 (40%), Gaps = 53/505 (10%)
Query: 274 IDMRIKHLDIYGDSALVINQIKGEWETHHAKLIPYRDYARRLLTYFTKVELHHIPRDENQ 333
++ I+ I D +I +I E E + +L ++ + + E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 334 MADALATLSSMFRVNHWNDVPIIKVQRLERPSHVFAIGDVIDQAGENVVDYKPWYYDIKQ 393
+ADAL+ + PI K + V I D + V +Y D K
Sbjct: 825 IADALSRIVD-------ETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTN---DTKL 874
Query: 394 FLLSREYPSGASKQDKKTLRRLASRFLLDGDILYKRNYDMVLLRCVDEHEAEQLMHDVHD 453
L + +DK+ + L DG ++ + D +LL D ++ H+
Sbjct: 875 LNL-------LNNEDKRVEENIQ---LKDGLLINSK--DQILLPN-DTQLTRTIIKKYHE 921
Query: 454 GTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQHARKCHKCQIYADKIHVPPHALNVIS-S 512
H ++ +LR + W + ++ + CH CQI + H P L I S
Sbjct: 922 EGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPS 980
Query: 513 PWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYT-NVTKQVVAKFIKNNII 571
P+ +D I + S+G+ + V +D F+K T ++T + A+ +I
Sbjct: 981 ERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVI 1038
Query: 572 CRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK 631
+G P +II DN + + ++ S PYRPQ +G E N+ ++++++
Sbjct: 1039 AYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRC 1098
Query: 632 MVTTYKD-WHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKL 690
+ +T+ + W + + Y + S+T TPF +V+ L +E+PS +
Sbjct: 1099 VCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTDENS 1157
Query: 691 SEA-EWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHP-REFKVGELVL-KRR 747
E + Q+ + LN + +MK FD K+ EF+ G+LV+ KR
Sbjct: 1158 QETIQVFQTVKEHLN--------------TNNIKMKKYFDMKIQEIEEFQPGDLVMVKRT 1203
Query: 748 VSQQPDPRGKWTPNYEGP-YVVKKA 771
+ K P++ GP YV++K+
Sbjct: 1204 KTGFLHKSNKLAPSFAGPFYVLQKS 1228
Score = 59.7 bits (143), Expect = 3e-08
Identities = 48/192 (25%), Positives = 88/192 (45%), Gaps = 21/192 (10%)
Query: 19 LSKNQPIVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLILYLSVFDESVGCVLGQQDET 78
L K+ W +A ++IK L+ PP+L + ++L D +VG VL Q+ +
Sbjct: 670 LKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD- 728
Query: 79 GKKEHAIYYLSKKFTDCETRYTMLAKTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIF 138
K + + Y S K + + Y++ K A+ + K RHYL S ++P K +
Sbjct: 729 DDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLE-------STIEPFKILT 781
Query: 139 EKAVVTGKI-----------ARWQMLLSEYDIVFKAQKAIKGSILADHLAYQPLDDYQPI 187
+ + G+I ARWQ+ L +++ + + +AD L+ + +D+ +PI
Sbjct: 782 DHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPG-SANHIADALS-RIVDETEPI 839
Query: 188 EFDFPDEEIMYL 199
D D I ++
Sbjct: 840 PKDSEDNSINFV 851
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 99.0 bits (245), Expect = 4e-20
Identities = 114/505 (22%), Positives = 209/505 (40%), Gaps = 53/505 (10%)
Query: 274 IDMRIKHLDIYGDSALVINQIKGEWETHHAKLIPYRDYARRLLTYFTKVELHHIPRDENQ 333
++ I+ I D +I +I E E + +L ++ + + E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 334 MADALATLSSMFRVNHWNDVPIIKVQRLERPSHVFAIGDVIDQAGENVVDYKPWYYDIKQ 393
+ADAL+ + PI K + V I D + V +Y D K
Sbjct: 825 IADALSRIVD-------ETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTN---DTKL 874
Query: 394 FLLSREYPSGASKQDKKTLRRLASRFLLDGDILYKRNYDMVLLRCVDEHEAEQLMHDVHD 453
L + +DK+ + L DG ++ + D +LL D ++ H+
Sbjct: 875 LNL-------LNNEDKRVEENIQ---LKDGLLINSK--DQILLPN-DTQLTRTIIKKYHE 921
Query: 454 GTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQHARKCHKCQIYADKIHVPPHALNVIS-S 512
H ++ +LR + W + ++ + CH CQI + H P L I S
Sbjct: 922 EGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPS 980
Query: 513 PWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYT-NVTKQVVAKFIKNNII 571
P+ +D I + S+G+ + V +D F+K T ++T + A+ +I
Sbjct: 981 ERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVI 1038
Query: 572 CRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK 631
+G P +II DN + + ++ S PYRPQ +G E N+ ++++++
Sbjct: 1039 AYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRC 1098
Query: 632 MVTTYKD-WHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKL 690
+ +T+ + W + + Y + S+T TPF +V+ L +E+PS +
Sbjct: 1099 VCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTDENS 1157
Query: 691 SEA-EWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHP-REFKVGELVL-KRR 747
E + Q+ + LN + +MK FD K+ EF+ G+LV+ KR
Sbjct: 1158 QETIQVFQTVKEHLN--------------TNNIKMKKYFDMKIQEIEEFQPGDLVMVKRT 1203
Query: 748 VSQQPDPRGKWTPNYEGP-YVVKKA 771
+ K P++ GP YV++K+
Sbjct: 1204 KTGFLHKSNKLAPSFAGPFYVLQKS 1228
Score = 59.7 bits (143), Expect = 3e-08
Identities = 48/192 (25%), Positives = 88/192 (45%), Gaps = 21/192 (10%)
Query: 19 LSKNQPIVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLILYLSVFDESVGCVLGQQDET 78
L K+ W +A ++IK L+ PP+L + ++L D +VG VL Q+ +
Sbjct: 670 LKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD- 728
Query: 79 GKKEHAIYYLSKKFTDCETRYTMLAKTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIF 138
K + + Y S K + + Y++ K A+ + K RHYL S ++P K +
Sbjct: 729 DDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLE-------STIEPFKILT 781
Query: 139 EKAVVTGKI-----------ARWQMLLSEYDIVFKAQKAIKGSILADHLAYQPLDDYQPI 187
+ + G+I ARWQ+ L +++ + + +AD L+ + +D+ +PI
Sbjct: 782 DHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPG-SANHIADALS-RIVDETEPI 839
Query: 188 EFDFPDEEIMYL 199
D D I ++
Sbjct: 840 PKDSEDNSINFV 851
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 99.0 bits (245), Expect = 4e-20
Identities = 114/505 (22%), Positives = 209/505 (40%), Gaps = 53/505 (10%)
Query: 274 IDMRIKHLDIYGDSALVINQIKGEWETHHAKLIPYRDYARRLLTYFTKVELHHIPRDENQ 333
++ I+ I D +I +I E E + +L ++ + + E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 334 MADALATLSSMFRVNHWNDVPIIKVQRLERPSHVFAIGDVIDQAGENVVDYKPWYYDIKQ 393
+ADAL+ + PI K + V I D + V +Y D K
Sbjct: 825 IADALSRIVD-------ETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTN---DTKL 874
Query: 394 FLLSREYPSGASKQDKKTLRRLASRFLLDGDILYKRNYDMVLLRCVDEHEAEQLMHDVHD 453
L + +DK+ + L DG ++ + D +LL D ++ H+
Sbjct: 875 LNL-------LNNEDKRVEENIQ---LKDGLLINSK--DQILLPN-DTQLTRTIIKKYHE 921
Query: 454 GTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQHARKCHKCQIYADKIHVPPHALNVIS-S 512
H ++ +LR + W + ++ + CH CQI + H P L I S
Sbjct: 922 EGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPS 980
Query: 513 PWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYT-NVTKQVVAKFIKNNII 571
P+ +D I + S+G+ + V +D F+K T ++T + A+ +I
Sbjct: 981 ERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVI 1038
Query: 572 CRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK 631
+G P +II DN + + ++ S PYRPQ +G E N+ ++++++
Sbjct: 1039 AYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRC 1098
Query: 632 MVTTYKD-WHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKL 690
+ +T+ + W + + Y + S+T TPF +V+ L +E+PS +
Sbjct: 1099 VCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTDENS 1157
Query: 691 SEA-EWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHP-REFKVGELVL-KRR 747
E + Q+ + LN + +MK FD K+ EF+ G+LV+ KR
Sbjct: 1158 QETIQVFQTVKEHLN--------------TNNIKMKKYFDMKIQEIEEFQPGDLVMVKRT 1203
Query: 748 VSQQPDPRGKWTPNYEGP-YVVKKA 771
+ K P++ GP YV++K+
Sbjct: 1204 KTGFLHKSNKLAPSFAGPFYVLQKS 1228
Score = 59.7 bits (143), Expect = 3e-08
Identities = 48/192 (25%), Positives = 88/192 (45%), Gaps = 21/192 (10%)
Query: 19 LSKNQPIVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLILYLSVFDESVGCVLGQQDET 78
L K+ W +A ++IK L+ PP+L + ++L D +VG VL Q+ +
Sbjct: 670 LKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD- 728
Query: 79 GKKEHAIYYLSKKFTDCETRYTMLAKTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIF 138
K + + Y S K + + Y++ K A+ + K RHYL S ++P K +
Sbjct: 729 DDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLE-------STIEPFKILT 781
Query: 139 EKAVVTGKI-----------ARWQMLLSEYDIVFKAQKAIKGSILADHLAYQPLDDYQPI 187
+ + G+I ARWQ+ L +++ + + +AD L+ + +D+ +PI
Sbjct: 782 DHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPG-SANHIADALS-RIVDETEPI 839
Query: 188 EFDFPDEEIMYL 199
D D I ++
Sbjct: 840 PKDSEDNSINFV 851
>POL_MLVCB (P08361) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 282
Score = 97.1 bits (240), Expect = 2e-19
Identities = 70/237 (29%), Positives = 116/237 (48%), Gaps = 20/237 (8%)
Query: 534 GHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGTNLNNNVV 593
G++++LV +D F+ W+EA T +VV K + I R+G+P + TDNG + V
Sbjct: 12 GYKYLLVFVDTFSGWIEAFPTKKETAKVVTKKLLEEIFPRFGMPQVLGTDNGPAFVSKVS 71
Query: 594 QALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK--MVTTYKDWHEMLPYALHGYR 651
Q + + I+ YRPQ +G VE N+ IK + K + T +DW +LP AL+ R
Sbjct: 72 QTVADLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRDWVLLLPLALYRAR 131
Query: 652 TTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRM 711
T G TP+ ++YG P V P + +++ + Q+ L L++ +
Sbjct: 132 NT-PGPHGLTPYEILYGAP---PPLVNFPDPDM---TRVTNSPSLQAHLQALYLVQHEVW 184
Query: 712 DAMARGQSYQARMKTAFDKKVHPREFKVGELVLKRRVSQQPDPRGKWTPNYEGPYVV 768
+A +YQ ++ D+ V P ++VG+ V RR + P ++GPY V
Sbjct: 185 RPLA--AAYQEQL----DRPVVPHPYRVGDTVWVRRHQTK-----NLEPRWKGPYTV 230
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 89.4 bits (220), Expect = 4e-17
Identities = 89/365 (24%), Positives = 150/365 (40%), Gaps = 43/365 (11%)
Query: 395 LLSREYPSGASKQDKKTLRRLASRFLLDGDILYKRNYDMVLLRCVDEHEAEQLMHDVHDG 454
LL YP G KQ K TL + I+ + N ++ D + H++
Sbjct: 777 LLQGHYPPGYPKQYKYTLEE-------NKLIVERPNGIRIVPPKADREKIISTAHNI--A 827
Query: 455 TFGTHATGHTMSRKLLRAGYYWMAMEHDCYQHARKCHKCQIYADKIHVPPHALNVISSPW 514
G AT +S K Y+W + D + R+C +C + P L +
Sbjct: 828 HTGRDATFLKVSSK-----YWWPNLRKDVVKSIRQCKQCLVTNATNLTSPPILRPVKPLK 882
Query: 515 PFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRY 574
PF + ID IG + P SNG+ +LV +D T +V Y A N++
Sbjct: 883 PFDKFYIDYIGPLPP--SNGYLHVLVVVDSMTGFVWL--YPTKAPSTSATVKALNMLTSI 938
Query: 575 GVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK-MV 633
+P + +D G ++ +E I+ S+PY PQ +G VE N +IKR++ K ++
Sbjct: 939 AIPKVLHSDQGAAFTSSTFADWAKEKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLLI 998
Query: 634 TTYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEA 693
W+++LP + S+ TP L++G+++ P +
Sbjct: 999 GRPAKWYDLLPVVQLALNNSYSPSSKYTPHQLLFGVDSNTPF--------------ANSD 1044
Query: 694 EWCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHPREFKVGELVLKRRVSQQPD 753
SR ++L+L++E R +Q A + P VG+LV + RV++
Sbjct: 1045 TLDLSREEELSLLQE------IRSSLHQPTSPPASSRSWSP---SVGQLV-QERVARPAS 1094
Query: 754 PRGKW 758
R +W
Sbjct: 1095 LRPRW 1099
>POL_AVIRE (P03360) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 473
Score = 88.6 bits (218), Expect = 6e-17
Identities = 70/254 (27%), Positives = 108/254 (41%), Gaps = 19/254 (7%)
Query: 515 PFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRY 574
P W +D I K G++++LV +D F+ WVEA T QVV K + +II R+
Sbjct: 188 PGEHWEVDFTEMITAKG--GYKYLLVLVDTFSGWVEAYPAKRETSQVVIKHLILDIIPRF 245
Query: 575 GVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVT 634
G+P +I +DNG V Q LCE + YRPQ +G VE N+ +K+ + K+
Sbjct: 246 GLPVQIGSDNGPAFVAKVTQQLCEALNVSWKLHCAYRPQSSGQVERMNRTLKKAIAKLED 305
Query: 635 TYKDWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAE 694
+ + P + T G +PF ++YG++ + V L I
Sbjct: 306 RDRRGLGLPPPSGFAPGTVYPGREGLSPFEILYGLKPPVVPRVGCDKLASITN------- 358
Query: 695 WCQSRYDQLNLIEEKRMDAMARGQSYQARMKTAFDKKVHPREFKVGELVLKRRVSQQPDP 754
Q+ L ++ R A A + +A P V +K+ QQ P
Sbjct: 359 --QTLLKSLQALQATRSLARAAARPTAPERSSARPYPTVPN--LVTSFFVKKHDFQQLGP 414
Query: 755 RGKWTPNYEGPYVV 768
R ++GPY V
Sbjct: 415 R------WDGPYTV 422
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 87.0 bits (214), Expect = 2e-16
Identities = 79/320 (24%), Positives = 136/320 (41%), Gaps = 44/320 (13%)
Query: 412 LRRLASRFLLDGDILYKRNYDMVLLRCVDEHEAEQLMHDVHDGTFGTHATGHTMSRKLLR 471
L+ + LLD ++ ++ ++L+ +H+ H G ++ R
Sbjct: 763 LKLIHGCLLLDDRVIVPKSLQKIVLK---------QLHEGHPGI--------VQMKQKAR 805
Query: 472 AGYYWMAMEHDCYQHARKCHKCQIYADKIHVPPHALNVISSPWPF--SMWG---IDMIGR 526
+ +W ++ D R C+ CQ + V P +PWP + W ID G
Sbjct: 806 SFVFWRGLDSDIENMVRHCNNCQENSKMPRVVP------LNPWPVPEAPWKRIHIDFAGP 859
Query: 527 IEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGT 586
+ NG ++LV +D TK+ E T V + I +G P II+DNGT
Sbjct: 860 L-----NGC-YLLVVVDAKTKYAEV-KLTRSISAVTTIDLLEEIFSIHGYPETIISDNGT 912
Query: 587 NLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVTTYKDWHEMLPYA 646
L +++ +C+ IEH S+ Y P+ NGA E +KR + K+ ++L
Sbjct: 913 QLTSHLFAQMCQSHGIEHKTSAVYYPRSNGAAERFVDTLKRGIAKIKGEGSVNQQILNKF 972
Query: 647 LHGYRTTVRSS-TGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNL 705
L YR T S+ G+TP +G + + + +P+ RV+ KL++ Q N+
Sbjct: 973 LISYRNTPHSALNGSTPAECHFGRKIRTTMSLLMPTDRVLKVPKLTQY--------QQNM 1024
Query: 706 IEEKRMDAMARGQSYQARMK 725
+ AR +++Q K
Sbjct: 1025 KHHYELRNGARAKAFQVNQK 1044
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.322 0.137 0.425
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 98,601,458
Number of Sequences: 164201
Number of extensions: 4298236
Number of successful extensions: 9031
Number of sequences better than 10.0: 124
Number of HSP's better than 10.0 without gapping: 106
Number of HSP's successfully gapped in prelim test: 18
Number of HSP's that attempted gapping in prelim test: 8858
Number of HSP's gapped (non-prelim): 153
length of query: 801
length of database: 59,974,054
effective HSP length: 118
effective length of query: 683
effective length of database: 40,598,336
effective search space: 27728663488
effective search space used: 27728663488
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 70 (31.6 bits)
Medicago: description of AC140025.16