
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC148359.7 + phase: 0
(789 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 121 8e-27
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 106 3e-22
POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.2... 102 3e-21
POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.2... 101 7e-21
POL_MLVRK (P31795) Pol polyprotein [Contains: Protease (EC 3.4.2... 101 9e-21
POL_MLVRD (P11227) Pol polyprotein [Contains: Protease (EC 3.4.2... 101 9e-21
POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.2... 100 1e-20
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 100 1e-20
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 100 1e-20
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 100 1e-20
POL_MLVAV (P03356) Pol polyprotein [Contains: Protease (EC 3.4.2... 100 1e-20
POL3_MOUSE (P11367) Retrovirus-related Pol polyprotein (Endonucl... 100 1e-20
POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse transcript... 100 2e-20
POL_MLVMO (P03355) Pol polyprotein [Contains: Protease (EC 3.4.2... 100 3e-20
POL_MLVF5 (P26810) Pol polyprotein [Contains: Protease (EC 3.4.2... 100 3e-20
POL_MLVAK (P03357) Pol polyprotein [Contains: Reverse transcript... 98 7e-20
POL_MLVCB (P08361) Pol polyprotein [Contains: Reverse transcript... 96 5e-19
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 92 7e-18
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 89 5e-17
POL_AVIRE (P03360) Pol polyprotein [Contains: Reverse transcript... 89 5e-17
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 121 bits (303), Expect = 8e-27
Identities = 100/381 (26%), Positives = 168/381 (43%), Gaps = 29/381 (7%)
Query: 417 RNYDMVLLRCV----DEHEAEQLMHDVHDGTFGTHATGHTMSRKLLRAGYYWMAMEHDCY 472
+N + LL V +E E E ++ +HD TG T + ++ YYW M
Sbjct: 874 KNLKVALLNPVTQINNEKEKEAILSTLHDDPIQGGHTGITKTLAKVKRHYYWKNMSKYIK 933
Query: 473 QYARKCHKCQIYADKIHVPPHALNVMSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDY 532
+Y RKC KCQ H + F +D IG + PK+ NG+ + + I
Sbjct: 934 EYVRKCQKCQKAKTTKHTKTPMTITETPEHAFDRVVVDTIGPL-PKSENGNEYAVTLICD 992
Query: 533 FTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEH 592
TK++ A N + + VAK I + I +YG ITD GT N+++ LC+ KI++
Sbjct: 993 LTKYLVAIPIANKSAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKN 1052
Query: 593 HNSSPYRPQMNGAVEAANKNIKRIVQKMVTTYK-NWHEMLPYALHGYRTTVRSSTGATPF 651
S+ + Q G VE +++ + ++ ++T K +W L Y ++ + TT P+
Sbjct: 1053 ITSTAHHHQTVGVVERSHRTLNEYIRSYISTDKTDWDVWLQYFVYCFNTTQSMVHNYCPY 1112
Query: 652 SLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARG----HS 707
LV+G + LP KL E + D + + A AR +
Sbjct: 1113 ELVFGRTSNLPKHFN----------KLHSIEPIYNIDDYAKESKYRLEVAYARARKLLEA 1162
Query: 708 YQARMKTAFDKKVNPREFKVGELVLKRRISQQPDPRGKWTPNYEGPYVVKK-AFSGGALI 766
++ + K +D KV E +VG+ VL R + K Y GPY ++ + +
Sbjct: 1163 HKEKNKENYDLKVKDIELEVGDKVLLRN-----EVGHKLDFKYTGPYKIESIGDNNNITL 1217
Query: 767 LTHMDGVELPNPVNADIVKKY 787
LT+ + ++ V+ D +KK+
Sbjct: 1218 LTNKNKKQI---VHKDRLKKF 1235
Score = 66.2 bits (160), Expect = 3e-10
Identities = 43/157 (27%), Positives = 73/157 (46%), Gaps = 3/157 (1%)
Query: 9 KNQPIVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLILYLSVFDESVGCVLGQRDETGK 68
KN P W DECQ+AF +K+ L+ P +L P + + ++ G VL Q
Sbjct: 580 KNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVLTQNH--NG 637
Query: 69 KEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIFEK 128
+ + Y S+ FT E+ + E+ A+ WA R Y+ + + P+ Y+F
Sbjct: 638 HQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHRPLTYLFSM 697
Query: 129 AVVTGKIARGQMLLSEYDIVFKTQKAIKGSILADHLA 165
+ K+ R ++ L EY+ + K K + +AD L+
Sbjct: 698 VNPSSKLTRIRLELEEYNFTVEYLKG-KDNHVADALS 733
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 106 bits (264), Expect = 3e-22
Identities = 91/341 (26%), Positives = 146/341 (42%), Gaps = 26/341 (7%)
Query: 427 VDEHEAEQLMHDVHDGTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQYARKCHKCQIYAD 486
V E L+ ++H+G H M R + R +YW M R C KC D
Sbjct: 1460 VPEKIRTPLLKELHEGMLAGHFGIKKMWRMVHRK-FYWPQMRVCVENCVRTCAKCLCAND 1518
Query: 487 KIHVPPHALNVMSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVT 546
+ +L +P + D++ + G+R+IL ID FTK+ A +
Sbjct: 1519 HSKLTS-SLTPYRMTFPLEIVACDLMD--VGLSVQGNRYILTIIDLFTKYGTAVPIPDKK 1575
Query: 547 KQVVAK-FIKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGA 605
+ V K F++ I +P K++TD G N + KIEH + Y + NGA
Sbjct: 1576 AETVLKAFVERWAIGEGRIPLKLLTDQGKEFVNGLFAQFTHMLKIEHITTKGYNSRANGA 1635
Query: 606 VEAANKNIKRIVQKMVTTYKNWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEV 665
VE NK I I++K W + + YA++ Y V +TG TP L++G + + PLE+
Sbjct: 1636 VERFNKTIMHIMKKKTAVPMEWDDQVVYAVYAYNNCVHENTGETPMFLMHGRDVMGPLEM 1695
Query: 666 EIPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARGHSY--QARMKTAFDKKVNPR 723
I A + E Y L E ++ +A+ H+ Q K+ FD+K +
Sbjct: 1696 SGEDAVGINYADMDE-------YKHLLTQELLKVQKIAKEHAMREQESYKSLFDQKYASK 1748
Query: 724 EFK--------VGELVLKRRISQQPDPRGKWTPNYEGPYVV 756
+ + + E+ ++ +Q P KW+ GPY V
Sbjct: 1749 KHRFPQPGSRVLLEIPSEKLGAQCPKLVNKWS----GPYRV 1785
Score = 57.0 bits (136), Expect = 2e-07
Identities = 38/158 (24%), Positives = 75/158 (47%), Gaps = 8/158 (5%)
Query: 14 VWNDECQEAFDSIKNYLLEPPILVPP-VEG-----RPLILYLSVFDESVGCVLGQRDETG 67
+W E + AF +K + + P+L P VE RP ++Y + +G VL Q G
Sbjct: 1207 IWEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDG 1266
Query: 68 KKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIFE 127
+ +H I + SK + ETRY + + A+ +A +R + + + + P+ + +
Sbjct: 1267 Q-QHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISLLK 1325
Query: 128 KAVVTGKIARGQMLLSEYDIVFKTQKAIKGSILADHLA 165
+ + ++ R + + E+D+ A K + +AD L+
Sbjct: 1326 GSPLADRLWRWSIEILEFDVKI-VYLAGKANAVADALS 1362
>POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1189
Score = 102 bits (255), Expect = 3e-21
Identities = 177/772 (22%), Positives = 294/772 (37%), Gaps = 107/772 (13%)
Query: 9 KNQPIVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLILYLSVFDESVGCVLGQ-RDETG 67
++ P W E Q AF+++K LL P L P +P L+L DE G G + G
Sbjct: 448 ESTPFTWQTEHQLAFEALKKALLSAPALGLPDTSKPFTLFL---DERQGIAKGVLTQKLG 504
Query: 68 KKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVN--------HTTWLISRM 119
+ + YLSKK + + A A K + HT I R
Sbjct: 505 PWKRPVAYLSKKLDPVAAGWPPCLRIMAATAMLVKDSAKLTLGQPLTVITPHTLEAIVRQ 564
Query: 120 DPIKYIFEKAVVTGKIARGQMLLSEYDIVFKTQKAIKGSILADHLAYQPLDDYQPIEFDF 179
P ++I ++ Q LL + D V + + P+ + QP D
Sbjct: 565 PPDRWI-----TNARLTHYQALLLDTDRV-----QFGPPVTLNPATLLPVPENQPSPHDC 614
Query: 180 PDEEIMYLKSKDCEEPLINEGPDPNSKWGLVFDGAVNAYGKGIGAVIVSPQGHHIPFTAR 239
+++ E+ E PD + W +++ + GA +V GH+ +
Sbjct: 615 --RQVLAETHGTREDLKDQELPDADHTWYTDGSSYLDSGTRRAGAAVVD--GHNTIWAQS 670
Query: 240 ILFECTNNMAEYEACIFGIEEAIDM-RIKHLDIYGDSALVINQIKGEWETHHANLIPYRD 298
+ + AE + + +A+++ + K +IY DS + T H + Y
Sbjct: 671 LPPGTSAQKAE----LIALTKALELSKGKKANIYTDSRYA-------FATAHTHGSIYE- 718
Query: 299 YARRLLTYFTKVELHHIPRDENQMADALATLSSMFRVNHWNDVPIIKVQRLERPSHVFAI 358
R LLT K + A+ +A L ++F +V II ++ A+
Sbjct: 719 -RRGLLTSEGK--------EIKNKAEIIALLKALFLPQ---EVAIIHCPGHQKGQDPVAV 766
Query: 359 GD-----VIDQAG-ENVVGYKPWYYDIKQFLLSREYPPGASKQDKKTLRRLAGRFLLDGD 412
G+ V QA V+ + + Y + +D++ R + D
Sbjct: 767 GNRQADRVARQAAMAEVLTLATEPDNTSHITIEHTY----TSEDQEEARAIGATENKD-- 820
Query: 413 ILYKRNYDMVLLRCVDEHEAEQLMHDVHDGTFGTHATGHTMSRKLLRAGYYWMAMEHDCY 472
RN++ + + EA ++ +H T H + + + +
Sbjct: 821 ---TRNWEKEGKIVLPQKEALAMIQQMHAWT---HLGNRKLKLLIEKTDFLIPRASTLIE 874
Query: 473 QYARKCHKCQ-IYADKIHVPPHALNVMSSPWPFSMWGIDMIGRIEPKASNGHRFILVAID 531
Q C CQ + A VP + P + W ID ++P + G++++LV +D
Sbjct: 875 QVTSACKVCQQVNAGATRVPAGKRTRGNRPGVY--WEIDFT-EVKPHYA-GYKYLLVFVD 930
Query: 532 YFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIE 591
F+ WVEA T +VAK I I R+G+P I +DNG + V Q L I
Sbjct: 931 TFSGWVEAFPTRQETAHIVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQGLARILGIN 990
Query: 592 HHNSSPYRPQMNGAVEAANKNIKRIVQKMV--TTYKNWHEMLPYALHGYRTTVRSSTGAT 649
YRPQ +G VE N+ IK + K+ T K+W +L AL R T + G T
Sbjct: 991 WKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLLSLALLRARNT-PNRFGLT 1049
Query: 650 PFSLVYGMEAVLPLEVEIPSL-----RVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMAR 704
P+ ++YG PL + S + ++A+L + Q++ I +
Sbjct: 1050 PYEILYG--GPPPLSTLLNSFSPSNSKTDLQARLKGLQAVQAQ------IWAPLAELYRP 1101
Query: 705 GHSYQARMKTAFDKKVNPREFKVGELVLKRRISQQPDPRGKWTPNYEGPYVV 756
GHS + F+VG+ V RR Q P ++GPY+V
Sbjct: 1102 GHSQTS------------HPFQVGDSVYVRRHRSQ-----GLEPRWKGPYIV 1136
>POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 101 bits (252), Expect = 7e-21
Identities = 85/298 (28%), Positives = 133/298 (44%), Gaps = 25/298 (8%)
Query: 462 YYWMAMEHDCYQYARKCHKC-QIYADKIHVPPHALNVMSSPWPFSMWGIDMIGRIEPKAS 520
YY + + C C Q+ A K V + P + W ID ++P
Sbjct: 874 YYMLNRDRTLKDITETCQACAQVNASKSAVKQGTR--VRGHRPGTHWEIDFT-EVKP-GL 929
Query: 521 NGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGTNLNNNV 580
G++++LV ID F+ WVEA T +VV K + I R+G+P + TDNG + V
Sbjct: 930 YGYKYLLVFIDTFSGWVEAFPTKKETAKVVTKKLLEEIFPRFGMPQVLGTDNGPAFVSKV 989
Query: 581 VQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK--MVTTYKNWHEMLPYALHGY 638
Q + + ++ YRPQ +G VE N+ IK + K + T ++W +LP AL+
Sbjct: 990 SQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRDWVLLLPLALYRA 1049
Query: 639 RTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKR 698
R T G TP+ ++YG P V P + AK++ Q+ L L++ +
Sbjct: 1050 RNT-PGPHGLTPYEILYGAP---PPLVNFPDPDM---AKVTHNPSLQAHLQALYLVQHEV 1102
Query: 699 MDAMARGHSYQARMKTAFDKKVNPREFKVGELVLKRRISQQPDPRGKWTPNYEGPYVV 756
+A + Q D+ V P F+VG+ V RR + P ++GPY V
Sbjct: 1103 WRPLAAAYQEQ------LDRPVVPHPFRVGDTVWVRRHQTK-----NLEPRWKGPYTV 1149
>POL_MLVRK (P31795) Pol polyprotein [Contains: Protease (EC
3.4.23.-); Reverse transcriptase/ribonuclease H (EC
2.7.7.49) (EC 3.1.26.4) (RT); Integrase (IN)] (Fragment)
Length = 581
Score = 101 bits (251), Expect = 9e-21
Identities = 87/330 (26%), Positives = 147/330 (44%), Gaps = 28/330 (8%)
Query: 433 EQLMHDVHDGTFGTHATGHTMSRKLLRAG---YYWMAMEHDCYQYARKCHKC-QIYADKI 488
+Q + ++ D G+ + LL G YY + + A C C Q+ A K
Sbjct: 222 DQFVFELLDSLHRLTHLGYQKMKALLDRGESPYYMLNRDKTLQYVADSCTVCAQVNASKA 281
Query: 489 HVPPHALNVMSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQ 548
+ + P + W ID ++P G++++LV +D F+ WVEA + T +
Sbjct: 282 KIGAGVR--VRGHRPGTHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFPTKHETAK 337
Query: 549 VVAKFIKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEA 608
+V K + I R+G+P + TDNG + V Q++ + I+ YRPQ +G VE
Sbjct: 338 IVTKKLLEEIFPRFGMPQVLGTDNGPAFVSQVSQSVAKLLGIDWKLHCAYRPQSSGQVER 397
Query: 609 ANKNIKRIVQK--MVTTYKNWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVE 666
N+ IK + K + T ++W +LP AL+ R T G TP+ ++YG L +
Sbjct: 398 MNRTIKETLTKLTLATGTRDWVLLLPLALYRARNT-PGPHGLTPYEILYGAPPPL-VNFH 455
Query: 667 IPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARGHSYQARMKTAFDKKVNPREFK 726
P + +K + + Q+ L ++ + +A + Q D+ V P F+
Sbjct: 456 DPEM-----SKFTNSPSLQAHLQALQAVQREVWKPLAAAYQDQ------LDQPVIPHPFR 504
Query: 727 VGELVLKRRISQQPDPRGKWTPNYEGPYVV 756
VG+ V RR + P ++GPY V
Sbjct: 505 VGDTVWVRRHQTK-----NLEPRWKGPYTV 529
>POL_MLVRD (P11227) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1196
Score = 101 bits (251), Expect = 9e-21
Identities = 87/330 (26%), Positives = 147/330 (44%), Gaps = 28/330 (8%)
Query: 433 EQLMHDVHDGTFGTHATGHTMSRKLLRAG---YYWMAMEHDCYQYARKCHKC-QIYADKI 488
+Q + ++ D G+ + LL G YY + + A C C Q+ A K
Sbjct: 837 DQFVFELLDSLHRLTHLGYQKMKALLDRGESPYYMLNRDKTLQYVADSCTVCAQVNASKA 896
Query: 489 HVPPHALNVMSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQ 548
+ + P + W ID ++P G++++LV +D F+ WVEA + T +
Sbjct: 897 KIGAGVR--VRGHRPGTHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFPTKHETAK 952
Query: 549 VVAKFIKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEA 608
+V K + I R+G+P + TDNG + V Q++ + I+ YRPQ +G VE
Sbjct: 953 IVTKKLLEEIFPRFGMPQVLGTDNGPAFVSQVSQSVAKLLGIDWKLHCAYRPQSSGQVER 1012
Query: 609 ANKNIKRIVQK--MVTTYKNWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVE 666
N+ IK + K + T ++W +LP AL+ R T G TP+ ++YG L +
Sbjct: 1013 MNRTIKETLTKLTLATGTRDWVLLLPLALYRARNT-PGPHGLTPYEILYGAPPPL-VNFH 1070
Query: 667 IPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARGHSYQARMKTAFDKKVNPREFK 726
P + +K + + Q+ L ++ + +A + Q D+ V P F+
Sbjct: 1071 DPEM-----SKFTNSPSLQAHLQALQAVQREVWKPLAAAYQDQ------LDQPVIPHPFR 1119
Query: 727 VGELVLKRRISQQPDPRGKWTPNYEGPYVV 756
VG+ V RR + P ++GPY V
Sbjct: 1120 VGDTVWVRRHQTK-----NLEPRWKGPYTV 1144
>POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 100 bits (250), Expect = 1e-20
Identities = 84/298 (28%), Positives = 133/298 (44%), Gaps = 25/298 (8%)
Query: 462 YYWMAMEHDCYQYARKCHKC-QIYADKIHVPPHALNVMSSPWPFSMWGIDMIGRIEPKAS 520
YY + + C C Q+ A K V + P + W ID ++P
Sbjct: 874 YYMLNRDRTLKDITETCKACAQVNASKSAVKQGTR--VRGHRPGTHWEIDFT-EVKP-GL 929
Query: 521 NGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGTNLNNNV 580
G++++LV +D F+ WVEA T +VV K + I R+G+P + TDNG + V
Sbjct: 930 YGYKYLLVFVDTFSGWVEAFPTKKETAKVVTKKLLEEIFPRFGMPQVLGTDNGPAFVSKV 989
Query: 581 VQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK--MVTTYKNWHEMLPYALHGY 638
Q + + ++ YRPQ +G VE N+ IK + K + T ++W +LP AL+
Sbjct: 990 SQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRDWVLLLPLALYRA 1049
Query: 639 RTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKR 698
R T G TP+ ++YG P V P + AK++ Q+ L L++ +
Sbjct: 1050 RNT-PGPHGLTPYEILYGAP---PPLVNFPDPDM---AKVTHNPSLQAHLQALYLVQHEV 1102
Query: 699 MDAMARGHSYQARMKTAFDKKVNPREFKVGELVLKRRISQQPDPRGKWTPNYEGPYVV 756
+A + Q D+ V P F+VG+ V RR + P ++GPY V
Sbjct: 1103 WRPLAAAYQEQ------LDRPVVPHPFRVGDTVWVRRHQTK-----NLEPRWKGPYTV 1149
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 100 bits (249), Expect = 1e-20
Identities = 116/508 (22%), Positives = 209/508 (40%), Gaps = 59/508 (11%)
Query: 262 IDMRIKHLDIYGDSALVINQIKGEWETHHANLIPYRDYARRLLTYFTKVELHHIPRDENQ 321
++ I+ I D +I +I E E + L ++ + + E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 322 MADALATLSSMFRVNHWNDVPIIKVQRLERPSHVFAIGDVIDQAGENVVGYKPWYYDIKQ 381
+ADAL+ + PI K + V I D + V Y D K
Sbjct: 825 IADALSRIVD-------ETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTN---DTKL 874
Query: 382 F-LLSREYPPGASKQDKKTLRRLAGRFLLDGDILYKRNYDMVLLRCVDE--HEAEQLMHD 438
LL+ E K+ ++ ++ G + D + N D L R + + HE +L+H
Sbjct: 875 LNLLNNE-----DKRVEENIQLKDGLLINSKDQILLPN-DTQLTRTIIKKYHEEGKLIHP 928
Query: 439 VHDGTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQYARKCHKCQIYADKIHVPPHALN-V 497
G + ++ + W + +Y + CH CQI + H P L +
Sbjct: 929 -----------GIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPI 977
Query: 498 MSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYT-NVTKQVVAKFIKN 556
S P+ +D I + S+G+ + V +D F+K T ++T + A+
Sbjct: 978 PPSERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQ 1035
Query: 557 NIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRI 616
+I +G P +II DN + + ++ S PYRPQ +G E N+ ++++
Sbjct: 1036 RVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKL 1095
Query: 617 VQKMVTTYKN-WHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIME 675
++ + +T+ N W + + Y + S+T TPF +V+ L +E+PS +
Sbjct: 1096 LRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTD 1154
Query: 676 AKLSEA-EWCQSRYDQLNLIEEKRMDAMARGHSYQARMKTAFDKKVNP-REFKVGELVL- 732
E + Q+ + LN + +MK FD K+ EF+ G+LV+
Sbjct: 1155 ENSQETIQVFQTVKEHLN--------------TNNIKMKKYFDMKIQEIEEFQPGDLVMV 1200
Query: 733 KRRISQQPDPRGKWTPNYEGP-YVVKKA 759
KR + K P++ GP YV++K+
Sbjct: 1201 KRTKTGFLHKSNKLAPSFAGPFYVLQKS 1228
Score = 57.0 bits (136), Expect = 2e-07
Identities = 48/204 (23%), Positives = 91/204 (44%), Gaps = 21/204 (10%)
Query: 7 LSKNQPIVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLILYLSVFDESVGCVLGQRDET 66
L K+ W +A ++IK L+ PP+L + ++L D +VG VL Q+ +
Sbjct: 670 LKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD- 728
Query: 67 GKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIF 126
K + + Y S K + + Y++ +K A+ + K RHYL S ++P K +
Sbjct: 729 DDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLE-------STIEPFKILT 781
Query: 127 EKAVVTGKI-----------ARGQMLLSEYDIVFKTQKAIKGSILADHLAYQPLDDYQPI 175
+ + G+I AR Q+ L +++ + + +AD L+ + +D+ +PI
Sbjct: 782 DHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPG-SANHIADALS-RIVDETEPI 839
Query: 176 EFDFPDEEIMYLKSKDCEEPLINE 199
D D I ++ + N+
Sbjct: 840 PKDSEDNSINFVNQISITDDFKNQ 863
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 100 bits (249), Expect = 1e-20
Identities = 116/508 (22%), Positives = 209/508 (40%), Gaps = 59/508 (11%)
Query: 262 IDMRIKHLDIYGDSALVINQIKGEWETHHANLIPYRDYARRLLTYFTKVELHHIPRDENQ 321
++ I+ I D +I +I E E + L ++ + + E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 322 MADALATLSSMFRVNHWNDVPIIKVQRLERPSHVFAIGDVIDQAGENVVGYKPWYYDIKQ 381
+ADAL+ + PI K + V I D + V Y D K
Sbjct: 825 IADALSRIVD-------ETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTN---DTKL 874
Query: 382 F-LLSREYPPGASKQDKKTLRRLAGRFLLDGDILYKRNYDMVLLRCVDE--HEAEQLMHD 438
LL+ E K+ ++ ++ G + D + N D L R + + HE +L+H
Sbjct: 875 LNLLNNE-----DKRVEENIQLKDGLLINSKDQILLPN-DTQLTRTIIKKYHEEGKLIHP 928
Query: 439 VHDGTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQYARKCHKCQIYADKIHVPPHALN-V 497
G + ++ + W + +Y + CH CQI + H P L +
Sbjct: 929 -----------GIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPI 977
Query: 498 MSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYT-NVTKQVVAKFIKN 556
S P+ +D I + S+G+ + V +D F+K T ++T + A+
Sbjct: 978 PPSERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQ 1035
Query: 557 NIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRI 616
+I +G P +II DN + + ++ S PYRPQ +G E N+ ++++
Sbjct: 1036 RVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKL 1095
Query: 617 VQKMVTTYKN-WHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIME 675
++ + +T+ N W + + Y + S+T TPF +V+ L +E+PS +
Sbjct: 1096 LRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTD 1154
Query: 676 AKLSEA-EWCQSRYDQLNLIEEKRMDAMARGHSYQARMKTAFDKKVNP-REFKVGELVL- 732
E + Q+ + LN + +MK FD K+ EF+ G+LV+
Sbjct: 1155 ENSQETIQVFQTVKEHLN--------------TNNIKMKKYFDMKIQEIEEFQPGDLVMV 1200
Query: 733 KRRISQQPDPRGKWTPNYEGP-YVVKKA 759
KR + K P++ GP YV++K+
Sbjct: 1201 KRTKTGFLHKSNKLAPSFAGPFYVLQKS 1228
Score = 57.0 bits (136), Expect = 2e-07
Identities = 48/204 (23%), Positives = 91/204 (44%), Gaps = 21/204 (10%)
Query: 7 LSKNQPIVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLILYLSVFDESVGCVLGQRDET 66
L K+ W +A ++IK L+ PP+L + ++L D +VG VL Q+ +
Sbjct: 670 LKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD- 728
Query: 67 GKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIF 126
K + + Y S K + + Y++ +K A+ + K RHYL S ++P K +
Sbjct: 729 DDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLE-------STIEPFKILT 781
Query: 127 EKAVVTGKI-----------ARGQMLLSEYDIVFKTQKAIKGSILADHLAYQPLDDYQPI 175
+ + G+I AR Q+ L +++ + + +AD L+ + +D+ +PI
Sbjct: 782 DHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPG-SANHIADALS-RIVDETEPI 839
Query: 176 EFDFPDEEIMYLKSKDCEEPLINE 199
D D I ++ + N+
Sbjct: 840 PKDSEDNSINFVNQISITDDFKNQ 863
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 100 bits (249), Expect = 1e-20
Identities = 116/508 (22%), Positives = 209/508 (40%), Gaps = 59/508 (11%)
Query: 262 IDMRIKHLDIYGDSALVINQIKGEWETHHANLIPYRDYARRLLTYFTKVELHHIPRDENQ 321
++ I+ I D +I +I E E + L ++ + + E+++ P N
Sbjct: 770 LESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANH 824
Query: 322 MADALATLSSMFRVNHWNDVPIIKVQRLERPSHVFAIGDVIDQAGENVVGYKPWYYDIKQ 381
+ADAL+ + PI K + V I D + V Y D K
Sbjct: 825 IADALSRIVD-------ETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTN---DTKL 874
Query: 382 F-LLSREYPPGASKQDKKTLRRLAGRFLLDGDILYKRNYDMVLLRCVDE--HEAEQLMHD 438
LL+ E K+ ++ ++ G + D + N D L R + + HE +L+H
Sbjct: 875 LNLLNNE-----DKRVEENIQLKDGLLINSKDQILLPN-DTQLTRTIIKKYHEEGKLIHP 928
Query: 439 VHDGTFGTHATGHTMSRKLLRAGYYWMAMEHDCYQYARKCHKCQIYADKIHVPPHALN-V 497
G + ++ + W + +Y + CH CQI + H P L +
Sbjct: 929 -----------GIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPI 977
Query: 498 MSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYT-NVTKQVVAKFIKN 556
S P+ +D I + S+G+ + V +D F+K T ++T + A+
Sbjct: 978 PPSERPWESLSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQ 1035
Query: 557 NIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRI 616
+I +G P +II DN + + ++ S PYRPQ +G E N+ ++++
Sbjct: 1036 RVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKL 1095
Query: 617 VQKMVTTYKN-WHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIME 675
++ + +T+ N W + + Y + S+T TPF +V+ L +E+PS +
Sbjct: 1096 LRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTD 1154
Query: 676 AKLSEA-EWCQSRYDQLNLIEEKRMDAMARGHSYQARMKTAFDKKVNP-REFKVGELVL- 732
E + Q+ + LN + +MK FD K+ EF+ G+LV+
Sbjct: 1155 ENSQETIQVFQTVKEHLN--------------TNNIKMKKYFDMKIQEIEEFQPGDLVMV 1200
Query: 733 KRRISQQPDPRGKWTPNYEGP-YVVKKA 759
KR + K P++ GP YV++K+
Sbjct: 1201 KRTKTGFLHKSNKLAPSFAGPFYVLQKS 1228
Score = 57.0 bits (136), Expect = 2e-07
Identities = 48/204 (23%), Positives = 91/204 (44%), Gaps = 21/204 (10%)
Query: 7 LSKNQPIVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLILYLSVFDESVGCVLGQRDET 66
L K+ W +A ++IK L+ PP+L + ++L D +VG VL Q+ +
Sbjct: 670 LKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD- 728
Query: 67 GKKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVNHTTWLISRMDPIKYIF 126
K + + Y S K + + Y++ +K A+ + K RHYL S ++P K +
Sbjct: 729 DDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLE-------STIEPFKILT 781
Query: 127 EKAVVTGKI-----------ARGQMLLSEYDIVFKTQKAIKGSILADHLAYQPLDDYQPI 175
+ + G+I AR Q+ L +++ + + +AD L+ + +D+ +PI
Sbjct: 782 DHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPG-SANHIADALS-RIVDETEPI 839
Query: 176 EFDFPDEEIMYLKSKDCEEPLINE 199
D D I ++ + N+
Sbjct: 840 PKDSEDNSINFVNQISITDDFKNQ 863
>POL_MLVAV (P03356) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1196
Score = 100 bits (249), Expect = 1e-20
Identities = 86/330 (26%), Positives = 147/330 (44%), Gaps = 28/330 (8%)
Query: 433 EQLMHDVHDGTFGTHATGHTMSRKLLRAG---YYWMAMEHDCYQYARKCHKC-QIYADKI 488
+Q + ++ D G+ + LL G YY + + A C C Q+ A K
Sbjct: 837 DQFVFELLDSLHRLTHLGYQKMKALLDRGESPYYMLNRDKTLQYVADSCTVCAQVNASKA 896
Query: 489 HVPPHALNVMSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQ 548
+ + P S W ID ++P G++++LV +D F+ WVEA T +
Sbjct: 897 KIGAGVR--VRGHRPGSHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFPTKRETAR 952
Query: 549 VVAKFIKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEA 608
VV+K + I R+G+P + +DNG + V Q++ + I+ YRPQ +G VE
Sbjct: 953 VVSKKLLEEIFPRFGMPQVLGSDNGPAFTSQVSQSVADLLGIDWKLHCAYRPQSSGQVER 1012
Query: 609 ANKNIKRIVQKMVTT--YKNWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVE 666
N+ IK + K+ ++W +LP AL+ R T G TP+ ++YG L +
Sbjct: 1013 MNRTIKETLTKLTLAAGTRDWVLLLPLALYRARNT-PGPHGLTPYEILYGAPPPL-VNFH 1070
Query: 667 IPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARGHSYQARMKTAFDKKVNPREFK 726
P + ++L+ + Q+ L ++ + +A + Q D+ V P F+
Sbjct: 1071 DPDM-----SELTNSPSLQAHLQALQTVQREIWKPLAEAYRDQ------LDQPVIPHPFR 1119
Query: 727 VGELVLKRRISQQPDPRGKWTPNYEGPYVV 756
+G+ V RR + P ++GPY V
Sbjct: 1120 IGDSVWVRRHQTK-----NLEPRWKGPYTV 1144
>POL3_MOUSE (P11367) Retrovirus-related Pol polyprotein
(Endonuclease) (Fragment)
Length = 390
Score = 100 bits (249), Expect = 1e-20
Identities = 86/316 (27%), Positives = 144/316 (45%), Gaps = 27/316 (8%)
Query: 446 THATGHTMSRKLLR--AGYYWMAMEHDCYQYARKCHKC-QIYADKIHVPPHALNVMSSPW 502
TH + M L R + YY + + ++ A C C Q+ A K + A +
Sbjct: 60 THLSYQKMRALLDRKESPYYMLNKDKILHEVAESCQACVQVNASKTKI--RAGTRVRGHR 117
Query: 503 PFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRY 562
+ W ID ++P G++++LV +D F+ WVEA + T ++V K + I R+
Sbjct: 118 LGTHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFPTKHETAKIVTKKLLEEIFPRF 175
Query: 563 GVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK--M 620
G+P + TDNG + V Q++ + I+ YRPQ +G VE N+ IK + K +
Sbjct: 176 GMPQVLGTDNGPAFVSQVSQSVAKLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTL 235
Query: 621 VTTYKNWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSE 680
T ++W +LP AL+ R T G TP+ ++YG L + P + +K +
Sbjct: 236 ATGTRDWVLLLPLALYRARNT-PGPHGLTPYEILYGAPPPL-VNFHDPEM-----SKFTN 288
Query: 681 AEWCQSRYDQLNLIEEKRMDAMARGHSYQARMKTAFDKKVNPREFKVGELVLKRRISQQP 740
+ Q+ L ++ + +A + Q D+ V P F+VG+ V RR +
Sbjct: 289 SPSLQAHLQALQAVQREVWKPLAAAYQDQ------LDQPVIPHPFRVGDTVWVRRHQTK- 341
Query: 741 DPRGKWTPNYEGPYVV 756
P ++GPY V
Sbjct: 342 ----NLEPRWKGPYTV 353
>POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 1046
Score = 100 bits (248), Expect = 2e-20
Identities = 158/674 (23%), Positives = 256/674 (37%), Gaps = 91/674 (13%)
Query: 9 KNQPIVWNDECQEAFDSIKNYLLEPPILVPPVEGRPLILYLSVFDESVGCVLGQ-RDETG 67
++ P W ++ Q AF+++K LL P L P +P L++ DE G G + G
Sbjct: 305 ESAPFTWQEKHQSAFEALKEALLSAPALGLPDTSKPFTLFI---DEKQGIAKGVLTQKLG 361
Query: 68 KKEHAIYYLSKKFTDCETRYTMLEKTCCALAWAAKRLRHYLVN--------HTTWLISRM 119
+ + YLSKK + + A A K + H I R
Sbjct: 362 PWKRPVAYLSKKLDPVAAGWPPCLRIMAATAMLVKDSAKLTLGQPLTVITPHALEAIVRQ 421
Query: 120 DPIKYIFEKAVVTGKIARGQMLLSEYD-IVFKTQKAIKGSIL--------ADHLAYQPLD 170
P ++I ++ Q LL + D I F + + L + H Q L
Sbjct: 422 TPDRWI-----TNARLTHYQALLLDTDRIQFGPPVTLNPATLLPAPEDQQSAHDCRQVLA 476
Query: 171 DYQPIEFDFPDEEIMYLKSKDCEEPLINEGPDPNSKWGLVFDGAVNAYGKGIGAVIVSPQ 230
+ D D+E+ PD + W +++ + GA +V
Sbjct: 477 ETHGTREDLKDQEL----------------PDADHSWYTDGSSYIDSGTRRAGAAVVD-- 518
Query: 231 GHHIPFTARILFECTNNMAEYEACIFGIEEAIDMRI-KHLDIYGDSALVINQIKGEWETH 289
GHHI + + + AE + + +A+++ K +IY DS + T
Sbjct: 519 GHHIIWAQSLPPGTSAQKAE----LIALTKALELSEGKKANIYTDSRYA-------FATA 567
Query: 290 HANLIPYRDYARRLLTYFTKVELHHIPRDENQMADALATLSSMFRVNHWNDVPIIKVQRL 349
H + Y R LLT K + A+ +A L ++F V II
Sbjct: 568 HTHGSIYE--RRGLLTSEGK--------EIKNKAEIIALLKALFLPRK---VAIIHCPGH 614
Query: 350 ERPSHVFAIGD-VIDQAGENVVGYKPWYYDIK---QFLLSREYPPGASKQDKKTLRRLAG 405
++ A G+ DQ V + K L + +Y Q++ + G
Sbjct: 615 QKGQDPIATGNRQADQVARQVAVAETLTLTTKLEETNLTTNKYAYTPEDQEEA---KAIG 671
Query: 406 RFLLDGDILYKRNYDMVLLRCVDEHEAEQLMHDVHDGTFGTHATGHTMSRKLLRAGYYWM 465
L +++ +VL R EA ++ +H T H + + + + +
Sbjct: 672 AILNQDTKDWEKEGKIVLPR----KEALAMIQQMHAWT---HLSNQKLKLLIEKTDFLIP 724
Query: 466 AMEHDCYQYARKCHKCQ-IYADKIHVPPHALNVMSSPWPFSMWGIDMIGRIEPKASNGHR 524
Q C CQ + A VP + P + W ID ++P + G++
Sbjct: 725 KAGTLIEQVTSACKVCQQVNAGATRVPEGKRTRGNRPGVY--WEIDFT-EVKPHYA-GYK 780
Query: 525 FILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGTNLNNNVVQAL 584
++LV +D F+ WVEA T +VAK I I R+G+P I +DNG + V Q L
Sbjct: 781 YLLVFVDTFSGWVEAYPTRQETAHMVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQGL 840
Query: 585 CEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMV--TTYKNWHEMLPYALHGYRTTV 642
I YRPQ +G VE N+ IK + K+ T K+W +L AL R T
Sbjct: 841 ARTLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLLSLALLRARNT- 899
Query: 643 RSSTGATPFSLVYG 656
+ G TP+ ++YG
Sbjct: 900 PNRFGLTPYEILYG 913
>POL_MLVMO (P03355) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1199
Score = 99.8 bits (247), Expect = 3e-20
Identities = 88/316 (27%), Positives = 140/316 (43%), Gaps = 27/316 (8%)
Query: 446 THATGHTMSRKLLRAG--YYWMAMEHDCYQYARKCHKC-QIYADKIHVPPHALNVMSSPW 502
TH + M L R+ YY + + C C Q+ A K V +
Sbjct: 851 THLSFSKMKALLERSHSPYYMLNRDRTLKNITETCKACAQVNASKSAVKQGTR--VRGHR 908
Query: 503 PFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRY 562
P + W ID I+P G++++LV ID F+ W+EA T +VV K + I R+
Sbjct: 909 PGTHWEIDFT-EIKP-GLYGYKYLLVFIDTFSGWIEAFPTKKETAKVVTKKLLEEIFPRF 966
Query: 563 GVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK--M 620
G+P + TDNG + V Q + + I+ YRPQ +G VE N+ IK + K +
Sbjct: 967 GMPQVLGTDNGPAFVSKVSQTVADLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTL 1026
Query: 621 VTTYKNWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSE 680
T ++W +LP AL+ R T G TP+ ++YG P V P + +++
Sbjct: 1027 ATGSRDWVLLLPLALYRARNT-PGPHGLTPYEILYGAP---PPLVNFPDPDM---TRVTN 1079
Query: 681 AEWCQSRYDQLNLIEEKRMDAMARGHSYQARMKTAFDKKVNPREFKVGELVLKRRISQQP 740
+ Q+ L L++ + +A + Q D+ V P ++VG+ V RR +
Sbjct: 1080 SPSLQAHLQALYLVQHEVWRPLAAAYQEQ------LDRPVVPHPYRVGDTVWVRRHQTK- 1132
Query: 741 DPRGKWTPNYEGPYVV 756
P ++GPY V
Sbjct: 1133 ----NLEPRWKGPYTV 1144
>POL_MLVF5 (P26810) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1204
Score = 99.8 bits (247), Expect = 3e-20
Identities = 76/256 (29%), Positives = 121/256 (46%), Gaps = 22/256 (8%)
Query: 503 PFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRY 562
P + W ID ++P G++++LV +D F+ WVEA T +VV K + I R+
Sbjct: 914 PGTHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFPTKKETAKVVTKKLLEEIFPRF 971
Query: 563 GVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK--M 620
G+P + TDNG + V Q + + ++ YRPQ +G VE N+ IK + K +
Sbjct: 972 GMPQVLGTDNGPAFVSKVSQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIKETLTKLTL 1031
Query: 621 VTTYKNWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSE 680
T ++W +LP AL+ R T G TP+ ++YG P V P + AK++
Sbjct: 1032 ATGSRDWVLLLPLALYRARNT-PGPHGLTPYEILYGAP---PPLVNFPDPDM---AKVTH 1084
Query: 681 AEWCQSRYDQLNLIEEKRMDAMARGHSYQARMKTAFDKKVNPREFKVGELVLKRRISQQP 740
Q+ L L++ + +A + Q D+ V P F+VG+ V RR +
Sbjct: 1085 NPSLQAHLQALYLVQHEVWRPLAAAYQEQ------LDRPVVPHPFRVGDTVWVRRHQTK- 1137
Query: 741 DPRGKWTPNYEGPYVV 756
P ++GPY V
Sbjct: 1138 ----NLEPRWKGPYTV 1149
>POL_MLVAK (P03357) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 843
Score = 98.2 bits (243), Expect = 7e-20
Identities = 86/330 (26%), Positives = 149/330 (45%), Gaps = 29/330 (8%)
Query: 433 EQLMHDVHDGTFGTHATGHTMSRKLLRAG---YYWMAMEHDCYQYARKCHKC-QIYADKI 488
+Q + ++ D G+ + LL G YY + + A C C Q+ A K
Sbjct: 485 DQFVFELLDSLHRLTHLGYQKMKALLDRGESPYYMLNRDKTLQYVADSCTVCAQVNASKA 544
Query: 489 HVPPHALNVMSSPWPFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQ 548
+ + P S W ID ++P G++++LV +D F+ WVEA T +
Sbjct: 545 KIGAGVR--VRGHRPGSHWEIDFT-EVKP-GLYGYKYLLVFVDTFSGWVEAFPTKRETAR 600
Query: 549 VVAKFIKNNIICRYGVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEA 608
VV+K + I R+G+P + +DNG + V Q++ + I+ + + YRPQ +G VE
Sbjct: 601 VVSKKLLEEIFPRFGMPQVLGSDNGPAFTSQVSQSVADLLGIDKLHCA-YRPQSSGQVER 659
Query: 609 ANKNIKRIVQKMVTT--YKNWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVE 666
N+ IK + K+ ++W +LP AL+ R T G TP+ ++YG L +
Sbjct: 660 MNRTIKETLTKLTLAAGTRDWVLLLPLALYRARNT-PGPHGLTPYEILYGAPPPL-VNFH 717
Query: 667 IPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRMDAMARGHSYQARMKTAFDKKVNPREFK 726
P + ++L+ + Q+ L ++ + +A + Q D+ V P F+
Sbjct: 718 DPDM-----SELTNSPSLQAHLQALQTVQREIWKPLAEAYRDQ------LDQPVIPHPFR 766
Query: 727 VGELVLKRRISQQPDPRGKWTPNYEGPYVV 756
+G+ V RR + P ++GPY V
Sbjct: 767 IGDSVWVRRHQTK-----NLEPRWKGPYTV 791
>POL_MLVCB (P08361) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 282
Score = 95.5 bits (236), Expect = 5e-19
Identities = 68/237 (28%), Positives = 113/237 (46%), Gaps = 20/237 (8%)
Query: 522 GHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGTNLNNNVV 581
G++++LV +D F+ W+EA T +VV K + I R+G+P + TDNG + V
Sbjct: 12 GYKYLLVFVDTFSGWIEAFPTKKETAKVVTKKLLEEIFPRFGMPQVLGTDNGPAFVSKVS 71
Query: 582 QALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK--MVTTYKNWHEMLPYALHGYR 639
Q + + I+ YRPQ +G VE N+ IK + K + T ++W +LP AL+ R
Sbjct: 72 QTVADLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRDWVLLLPLALYRAR 131
Query: 640 TTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNLIEEKRM 699
T G TP+ ++YG P V P + +++ + Q+ L L++ +
Sbjct: 132 NT-PGPHGLTPYEILYGAP---PPLVNFPDPDM---TRVTNSPSLQAHLQALYLVQHEVW 184
Query: 700 DAMARGHSYQARMKTAFDKKVNPREFKVGELVLKRRISQQPDPRGKWTPNYEGPYVV 756
+A + Q D+ V P ++VG+ V RR + P ++GPY V
Sbjct: 185 RPLAAAYQEQ------LDRPVVPHPYRVGDTVWVRRHQTK-----NLEPRWKGPYTV 230
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 91.7 bits (226), Expect = 7e-18
Identities = 89/365 (24%), Positives = 152/365 (41%), Gaps = 43/365 (11%)
Query: 383 LLSREYPPGASKQDKKTLRRLAGRFLLDGDILYKRNYDMVLLRCVDEHEAEQLMHDVHDG 442
LL YPPG KQ K TL + I+ + N ++ D + H++
Sbjct: 777 LLQGHYPPGYPKQYKYTLEE-------NKLIVERPNGIRIVPPKADREKIISTAHNI--A 827
Query: 443 TFGTHATGHTMSRKLLRAGYYWMAMEHDCYQYARKCHKCQIYADKIHVPPHALNVMSSPW 502
G AT +S K Y+W + D + R+C +C + P L +
Sbjct: 828 HTGRDATFLKVSSK-----YWWPNLRKDVVKSIRQCKQCLVTNATNLTSPPILRPVKPLK 882
Query: 503 PFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRY 562
PF + ID IG + P SNG+ +LV +D T +V Y A N++
Sbjct: 883 PFDKFYIDYIGPLPP--SNGYLHVLVVVDSMTGFVWL--YPTKAPSTSATVKALNMLTSI 938
Query: 563 GVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQK-MV 621
+P + +D G ++ +E I+ S+PY PQ +G VE N +IKR++ K ++
Sbjct: 939 AIPKVLHSDQGAAFTSSTFADWAKEKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLLI 998
Query: 622 TTYKNWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEA 681
W+++LP + S+ TP L++G+++ P +
Sbjct: 999 GRPAKWYDLLPVVQLALNNSYSPSSKYTPHQLLFGVDSNTPF--------------ANSD 1044
Query: 682 EWCQSRYDQLNLIEEKRMDAMARGHSYQARMKTAFDKKVNPREFKVGELVLKRRISQQPD 741
SR ++L+L++E R +Q A + +P VG+LV + R+++
Sbjct: 1045 TLDLSREEELSLLQE------IRSSLHQPTSPPASSRSWSP---SVGQLV-QERVARPAS 1094
Query: 742 PRGKW 746
R +W
Sbjct: 1095 LRPRW 1099
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 89.0 bits (219), Expect = 5e-17
Identities = 80/320 (25%), Positives = 136/320 (42%), Gaps = 44/320 (13%)
Query: 400 LRRLAGRFLLDGDILYKRNYDMVLLRCVDEHEAEQLMHDVHDGTFGTHATGHTMSRKLLR 459
L+ + G LLD ++ ++ ++L+ +H+ H G ++ R
Sbjct: 763 LKLIHGCLLLDDRVIVPKSLQKIVLK---------QLHEGHPGI--------VQMKQKAR 805
Query: 460 AGYYWMAMEHDCYQYARKCHKCQIYADKIHVPPHALNVMSSPWPF--SMWG---IDMIGR 514
+ +W ++ D R C+ CQ + V P +PWP + W ID G
Sbjct: 806 SFVFWRGLDSDIENMVRHCNNCQENSKMPRVVP------LNPWPVPEAPWKRIHIDFAGP 859
Query: 515 IEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRYGVPSKIITDNGT 574
+ NG ++LV +D TK+ E T V + I +G P II+DNGT
Sbjct: 860 L-----NGC-YLLVVVDAKTKYAEV-KLTRSISAVTTIDLLEEIFSIHGYPETIISDNGT 912
Query: 575 NLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVTTYKNWHEMLPYA 634
L +++ +C+ IEH S+ Y P+ NGA E +KR + K+ ++L
Sbjct: 913 QLTSHLFAQMCQSHGIEHKTSAVYYPRSNGAAERFVDTLKRGIAKIKGEGSVNQQILNKF 972
Query: 635 LHGYRTTVRSS-TGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAEWCQSRYDQLNL 693
L YR T S+ G+TP +G + + + +P+ RV+ KL++ Q N+
Sbjct: 973 LISYRNTPHSALNGSTPAECHFGRKIRTTMSLLMPTDRVLKVPKLTQY--------QQNM 1024
Query: 694 IEEKRMDAMARGHSYQARMK 713
+ AR ++Q K
Sbjct: 1025 KHHYELRNGARAKAFQVNQK 1044
>POL_AVIRE (P03360) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)] (Fragment)
Length = 473
Score = 89.0 bits (219), Expect = 5e-17
Identities = 70/254 (27%), Positives = 107/254 (41%), Gaps = 19/254 (7%)
Query: 503 PFSMWGIDMIGRIEPKASNGHRFILVAIDYFTKWVEAASYTNVTKQVVAKFIKNNIICRY 562
P W +D I K G++++LV +D F+ WVEA T QVV K + +II R+
Sbjct: 188 PGEHWEVDFTEMITAKG--GYKYLLVLVDTFSGWVEAYPAKRETSQVVIKHLILDIIPRF 245
Query: 563 GVPSKIITDNGTNLNNNVVQALCEEFKIEHHNSSPYRPQMNGAVEAANKNIKRIVQKMVT 622
G+P +I +DNG V Q LCE + YRPQ +G VE N+ +K+ + K+
Sbjct: 246 GLPVQIGSDNGPAFVAKVTQQLCEALNVSWKLHCAYRPQSSGQVERMNRTLKKAIAKLED 305
Query: 623 TYKNWHEMLPYALHGYRTTVRSSTGATPFSLVYGMEAVLPLEVEIPSLRVIMEAKLSEAE 682
+ + P + T G +PF ++YG++ + V L I
Sbjct: 306 RDRRGLGLPPPSGFAPGTVYPGREGLSPFEILYGLKPPVVPRVGCDKLASITN------- 358
Query: 683 WCQSRYDQLNLIEEKRMDAMARGHSYQARMKTAFDKKVNPREFKVGELVLKRRISQQPDP 742
Q+ L ++ R A A +A P V +K+ QQ P
Sbjct: 359 --QTLLKSLQALQATRSLARAAARPTAPERSSARPYPTVPN--LVTSFFVKKHDFQQLGP 414
Query: 743 RGKWTPNYEGPYVV 756
R ++GPY V
Sbjct: 415 R------WDGPYTV 422
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.321 0.138 0.426
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 98,561,380
Number of Sequences: 164201
Number of extensions: 4346187
Number of successful extensions: 9153
Number of sequences better than 10.0: 127
Number of HSP's better than 10.0 without gapping: 107
Number of HSP's successfully gapped in prelim test: 20
Number of HSP's that attempted gapping in prelim test: 8987
Number of HSP's gapped (non-prelim): 155
length of query: 789
length of database: 59,974,054
effective HSP length: 118
effective length of query: 671
effective length of database: 40,598,336
effective search space: 27241483456
effective search space used: 27241483456
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 70 (31.6 bits)
Medicago: description of AC148359.7