
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0189.12
(963 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 170 2e-41
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 168 6e-41
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 162 4e-39
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 158 8e-38
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 154 8e-37
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 142 4e-33
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 139 5e-32
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 139 5e-32
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 134 1e-30
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 115 6e-25
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 101 1e-20
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 99 7e-20
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 99 7e-20
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 98 1e-19
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 97 2e-19
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 97 3e-19
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 88 1e-16
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 87 2e-16
RRPO_OENBE (P31843) RNA-directed DNA polymerase homolog (Reverse... 86 4e-16
POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic pro... 82 7e-15
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 170 bits (430), Expect = 2e-41
Identities = 111/378 (29%), Positives = 181/378 (47%), Gaps = 10/378 (2%)
Query: 148 EQRISKLLGDNLDLFAWSHKYMPGIDPNFICHRLALNPGVEPITQTRRRMGNEKEKAIQQ 207
+ ++ + + +D+FA + P N +L L EP+ R + + + IQ
Sbjct: 276 KSQLENICSEYIDIFALESE--PITVNNLYKQQLRLKDD-EPVYTKNYRSPHSQVEEIQA 332
Query: 208 EVNKLLAADFIREIKYPTWLVNVVIVKKANG------KWRMCVDNTDLNKTCPKDSYPLP 261
+V KL+ D I E + +++V K + KWR+ +D +NK D +PLP
Sbjct: 333 QVQKLIK-DKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLP 391
Query: 262 SIDKLVDGALGNELLSLMDAYSGYHQIKMHQSDEDKTTFMTARVNYCYQTMPFGLKNAGA 321
ID ++D + S +D SG+HQI++ + D T+F T+ +Y + +PFGLK A
Sbjct: 392 RIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPN 451
Query: 322 TYQRLMDRVFEGQVGRNMEVYVDDMIVKSVLGSNHHEDLMEAFGRIQKHNMRLNPEKCSF 381
++QR+M F G +Y+DD+IV + ++L E FG+ +++N++L+PEKCSF
Sbjct: 452 SFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSF 511
Query: 382 GIRGGKFLGFLITSRRIEINPDKCKAIQEMKSPSNVKEVQRLIGRITALSRFLLHSGDKS 441
+ FLG T + I + K IQ P + +R + RF+ + D S
Sbjct: 512 FMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYS 571
Query: 442 APFIKCLKKNTTFEWNSECEEAFTRLKEMLSAPPVLSKPVQGLPLHLYFSVGDHAISSVI 501
+ KKN FEW EC++AF LK L P +L P + A +V+
Sbjct: 572 RHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVL 631
Query: 502 LQEVDGEQKIVYFVSYHF 519
Q +G Q V + S F
Sbjct: 632 TQNHNGHQLPVAYASRAF 649
Score = 60.1 bits (144), Expect = 3e-08
Identities = 34/143 (23%), Positives = 70/143 (48%), Gaps = 2/143 (1%)
Query: 818 SPLLRCLPPEKYEAVMTEVHEG-VCASHIGGRSLASKVLRAGFYWPTIRKECAEFVRKCK 876
+P+ + ++ EA+++ +H+ + H G +KV R +YW + K E+VRKC+
Sbjct: 882 NPVTQINNEKEKEAILSTLHDDPIQGGHTGITKTLAKVKRH-YYWKNMSKYIKEYVRKCQ 940
Query: 877 KCQVFTDLPRAPPGQLVTISSPWPFAMWGVDLVGPFPTARSQMKFILVAVDYFTKWVEAE 936
KCQ +T + F VD +GP P + + ++ + + TK++ A
Sbjct: 941 KCQKAKTTKHTKTPMTITETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYLVAI 1000
Query: 937 PLASITAAKIISFYWKRIVCRFG 959
P+A+ +A + ++ + ++G
Sbjct: 1001 PIANKSAKTVAKAIFESFILKYG 1023
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 168 bits (426), Expect = 6e-41
Identities = 102/343 (29%), Positives = 174/343 (49%), Gaps = 8/343 (2%)
Query: 181 LALNPGVEPITQTRRRMGNEKEKAIQQEVNKLLAADFIREIKYPTWLVNVVIVKKANGKW 240
+ L G EPI Q R + + I++ + K+L IRE K P W VV+VKK +G
Sbjct: 934 IELKEGAEPIRQKPRPIPLALKPEIRKMIQKMLNQKVIRESKSP-WSSPVVLVKKKDGSI 992
Query: 241 RMCVDNTDLNKTCPKDSYPLPSIDKLVDGALGNELLSLMDAYSGYHQIKMHQSDEDKTTF 300
RMC+D +NK +++PLP+I+ + G +L ++ D +G+ QI + + ++ T F
Sbjct: 993 RMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTVFDMIAGFWQIPLDEKSKEITAF 1052
Query: 301 MTARVNYCYQTMPFGLKNAGATYQRLMDRVFEGQVGRNMEVYVDDMIVKSVLGSNHHEDL 360
+ + +PFGL + A +Q M+ + +G VYVDD+++ S H +D+
Sbjct: 1053 AIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLLGVCAFVYVDDLLIASKDMEQHLQDV 1112
Query: 361 MEAFGRIQKHNMRLNPEKCSFGIRGGKFLGFLITSRRIEINPDKCKAIQEMKSPSNVKEV 420
EA RI+K M+L KC + ++LG +T +E K +++ P+NVKE+
Sbjct: 1113 KEALTRIRKSGMKLRASKCHIAKKEVEYLGHKVTLDGVETQEVKTDKMKQFSRPTNVKEL 1172
Query: 421 QRLIGRITALSRFLLHSGDKSAPFIKCLKKNTTFEWNSECEEAFTRLKEMLSAPPVLSKP 480
Q +G + +F+L+ ++ + + W E E AF LK+++ PVL++P
Sbjct: 1173 QSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQELKKLVCQTPVLAQP 1232
Query: 481 -----VQG-LPLHLYFSVGDHAISSVILQE-VDGEQKIVYFVS 516
++G P +Y I +V+ QE DG+Q + F S
Sbjct: 1233 DVEAALKGDRPFMIYTDASRKGIGAVLAQEGPDGQQHPIAFAS 1275
Score = 55.1 bits (131), Expect = 9e-07
Identities = 38/151 (25%), Positives = 66/151 (43%), Gaps = 3/151 (1%)
Query: 803 HYTLLDGILFRRGFSSPLLRCLPPEKYEAVMTEVHEGVCASHIGGRSLASKVLRAGFYWP 862
+Y ++ G+L +P + ++ E+HEG+ A H G + + V R FYWP
Sbjct: 1439 YYKIVGGVLKNTEIEEQSRSVVPEKIRTPLLKELHEGMLAGHFGIKKMWRMVHRK-FYWP 1497
Query: 863 TIRKECAEFVRKCKKCQVFTDLPRAPPGQLVTISSPWPFAMWGVDLVGPFPTARSQMKFI 922
+R VR C KC D + L +P + DL+ + + ++I
Sbjct: 1498 QMRVCVENCVRTCAKCLCANDHSKL-TSSLTPYRMTFPLEIVACDLMDVGLSVQGN-RYI 1555
Query: 923 LVAVDYFTKWVEAEPLASITAAKIISFYWKR 953
L +D FTK+ A P+ A ++ + +R
Sbjct: 1556 LTIIDLFTKYGTAVPIPDKKAETVLKAFVER 1586
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 162 bits (410), Expect = 4e-39
Identities = 100/307 (32%), Positives = 159/307 (51%), Gaps = 3/307 (0%)
Query: 200 EKEKAIQQEVNKLLAADFIREIKYPTWLVNVVIVKKANGKWRMCVDNTDLNKTCPKDSYP 259
E E IQ +N+ + P W+V K+R+ +D LN+ D +P
Sbjct: 222 EVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHP 281
Query: 260 LPSIDKLVDGALGN-ELLSLMDAYSGYHQIKMHQSDEDKTTFMTARVNYCYQTMPFGLKN 318
+P++D+++ G LG + +D G+HQI+M KT F T +Y Y MPFGLKN
Sbjct: 282 IPNMDEIL-GKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKN 340
Query: 319 AGATYQRLMDRVFEGQVGRNMEVYVDDMIVKSVLGSNHHEDLMEAFGRIQKHNMRLNPEK 378
A AT+QR M+ + + ++ VY+DD+IV S H + L F ++ K N++L +K
Sbjct: 341 APATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDK 400
Query: 379 CSFGIRGGKFLGFLITSRRIEINPDKCKAIQEMKSPSNVKEVQRLIGRITALSRFLLHSG 438
C F + FLG ++T I+ NP+K +AIQ+ P+ KE++ +G +F+ +
Sbjct: 401 CEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFA 460
Query: 439 DKSAPFIKCLKKNTTFE-WNSECEEAFTRLKEMLSAPPVLSKPVQGLPLHLYFSVGDHAI 497
D + P KCLKKN + N E + AF +LK ++S P+L P L D A+
Sbjct: 461 DIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVAL 520
Query: 498 SSVILQE 504
+V+ Q+
Sbjct: 521 GAVLSQD 527
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 158 bits (399), Expect = 8e-38
Identities = 94/298 (31%), Positives = 153/298 (50%), Gaps = 7/298 (2%)
Query: 189 PITQTRRRMGNEKEKAIQQEVNKLLAADFIRE----IKYPTWLVNVVIVKKANGKWRMCV 244
PI + + E ++ +V ++L IRE PTW+V K+R+ +
Sbjct: 206 PIYSKQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVI 265
Query: 245 DNTDLNKTCPKDSYPLPSIDKLVDGALGN-ELLSLMDAYSGYHQIKMHQSDEDKTTFMTA 303
D LN+ D YP+P++D+++ G LG + + +D G+HQI+M + KT F T
Sbjct: 266 DYRKLNEITIPDRYPIPNMDEIL-GKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTK 324
Query: 304 RVNYCYQTMPFGLKNAGATYQRLMDRVFEGQVGRNMEVYVDDMIVKSVLGSNHHEDLMEA 363
+Y Y MPFGL+NA AT+QR M+ + + ++ VY+DD+I+ S + H +
Sbjct: 325 SGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLV 384
Query: 364 FGRIQKHNMRLNPEKCSFGIRGGKFLGFLITSRRIEINPDKCKAIQEMKSPSNVKEVQRL 423
F ++ N++L +KC F + FLG ++T I+ NP K KAI P+ KE++
Sbjct: 385 FTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAF 444
Query: 424 IGRITALSRFLLHSGDKSAPFIKCLKKNTTFEWNS-ECEEAFTRLKEMLSAPPVLSKP 480
+G +F+ + D + P CLKK T + E EAF +LK ++ P+L P
Sbjct: 445 LGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLP 502
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 154 bits (390), Expect = 8e-37
Identities = 96/328 (29%), Positives = 165/328 (50%), Gaps = 17/328 (5%)
Query: 205 IQQEVNKLLAADFIRE----IKYPTWLVNVVIVKKANGKWRMCVDNTDLNKTCPKDSYPL 260
+++++++LL IR P W+V ++RM VD LN D+YP+
Sbjct: 139 VERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYPI 198
Query: 261 PSIDKLVDGALGN-ELLSLMDAYSGYHQIKMHQSDEDKTTFMTARVNYCYQTMPFGLKNA 319
P I+ + +LGN + + +D SG+HQI M +SD KT F T Y + +PFGLKNA
Sbjct: 199 PDINATL-ASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNA 257
Query: 320 GATYQRLMDRVFEGQVGRNMEVYVDDMIVKSVLGSNHHEDLMEAFGRIQKHNMRLNPEKC 379
A +QR++D + +G+ VY+DD+IV S H ++L + K N+++N EK
Sbjct: 258 PAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKS 317
Query: 380 SFGIRGGKFLGFLITSRRIEINPDKCKAIQEMKSPSNVKEVQRLIGRITALSRFLLHSGD 439
F +FLG+++T+ I+ +P K +AI EM P++VKE++R +G + +F+
Sbjct: 318 HFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYAK 377
Query: 440 KSAPFIKCLK-----------KNTTFEWNSECEEAFTRLKEMLSAPPVLSKPVQGLPLHL 488
+ P + + ++F LK +L + +L+ P P HL
Sbjct: 378 VAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHL 437
Query: 489 YFSVGDHAISSVILQEVDGEQKIVYFVS 516
+ AI +V+ Q+ G + + ++S
Sbjct: 438 TTDASNWAIGAVLSQDDQGRDRPIAYIS 465
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 142 bits (358), Expect = 4e-33
Identities = 88/318 (27%), Positives = 157/318 (48%), Gaps = 3/318 (0%)
Query: 201 KEKAIQQEVNKLLAADFIREIKYPTWLVNVVIVKKANGKWRMCVDNTDLNKTCPKDSYPL 260
K +A+ E+N+ L + IRE K V+ V K G RM VD LNK + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 261 PSIDKLVDGALGNELLSLMDAYSGYHQIKMHQSDEDKTTFMTARVNYCYQTMPFGLKNAG 320
P I++L+ G+ + + +D S YH I++ + DE K F R + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAP 542
Query: 321 ATYQRLMDRVFEGQVGRNMEVYVDDMIVKSVLGSNHHEDLMEAFGRIQKHNMRLNPEKCS 380
A +Q ++ + ++ Y+DD+++ S S H + + + +++ N+ +N KC
Sbjct: 543 AHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 381 FGIRGGKFLGFLITSRRIEINPDKCKAIQEMKSPSNVKEVQRLIGRITALSRFLLHSGDK 440
F KF+G+ I+ + + + + K P N KE+++ +G + L +F+ +
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 441 SAPFIKCLKKNTTFEWNSECEEAFTRLKEMLSAPPVLSKPVQGLPLHLYFSVGDHAISSV 500
+ P LKK+ ++W +A +K+ L +PPVL + L D A+ +V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 501 ILQEVDGEQKIVYFVSYH 518
+ Q+ D ++ Y V Y+
Sbjct: 723 LSQKHDDDK--YYPVGYY 738
Score = 62.0 bits (149), Expect = 7e-09
Identities = 73/317 (23%), Positives = 130/317 (40%), Gaps = 22/317 (6%)
Query: 651 QFKTSNNQAEYEALIAGLKL---AIEVQIDSLPVRTDSPLVANQVNGEFQVKELALIKYV 707
Q S + E A+I LK +E I+ + TD + ++ E + + K +
Sbjct: 746 QLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPEN----KRL 801
Query: 708 ERVRLLMGRLQQVVVEFVPRAQNQRADALAKLASTRKPGNNRSVIQETLANPNIEGEFVA 767
R +L + + + P + N ADAL+++ +P I + + +I FV
Sbjct: 802 ARWQLFLQDFN-FEINYRPGSANHIADALSRIVDETEP------IPKDSEDNSIN--FVN 852
Query: 768 SVDRQETWMNPIIDILAGEPSDVVKYSKAQRREAGHYTLLDGILFRRGFSSPLLRCLPPE 827
+ + + N ++ + + + +R + L DG+L +L +
Sbjct: 853 QISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINS--KDQILLPNDTQ 910
Query: 828 KYEAVMTEVHEGVCASHIGGRSLASKVLRAGFYWPTIRKECAEFVRKCKKCQVFTDLPRA 887
++ + HE H G L + +LR F W IRK+ E+V+ C CQ+
Sbjct: 911 LTRTIIKKYHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHK 969
Query: 888 PPGQLVTI-SSPWPFAMWGVDLVGPFPTARSQMKFILVAVDYFTKWVEAEPLA-SITAAK 945
P G L I S P+ +D + P + S + V VD F+K P SITA +
Sbjct: 970 PYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSITAEQ 1028
Query: 946 IISFYWKRIVCRFGVPR 962
+ +R++ FG P+
Sbjct: 1029 TARMFDQRVIAYFGNPK 1045
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 139 bits (349), Expect = 5e-32
Identities = 87/318 (27%), Positives = 157/318 (49%), Gaps = 3/318 (0%)
Query: 201 KEKAIQQEVNKLLAADFIREIKYPTWLVNVVIVKKANGKWRMCVDNTDLNKTCPKDSYPL 260
K +A+ E+N+ L + IRE K V+ V K G RM VD LNK + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 261 PSIDKLVDGALGNELLSLMDAYSGYHQIKMHQSDEDKTTFMTARVNYCYQTMPFGLKNAG 320
P I++L+ G+ + + +D S YH I++ + DE K F R + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542
Query: 321 ATYQRLMDRVFEGQVGRNMEVYVDDMIVKSVLGSNHHEDLMEAFGRIQKHNMRLNPEKCS 380
A +Q ++ + ++ Y+D++++ S S H + + + +++ N+ +N KC
Sbjct: 543 AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 381 FGIRGGKFLGFLITSRRIEINPDKCKAIQEMKSPSNVKEVQRLIGRITALSRFLLHSGDK 440
F KF+G+ I+ + + + + K P N KE+++ +G + L +F+ +
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 441 SAPFIKCLKKNTTFEWNSECEEAFTRLKEMLSAPPVLSKPVQGLPLHLYFSVGDHAISSV 500
+ P LKK+ ++W +A +K+ L +PPVL + L D A+ +V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 501 ILQEVDGEQKIVYFVSYH 518
+ Q+ D ++ Y V Y+
Sbjct: 723 LSQKHDDDK--YYPVGYY 738
Score = 62.0 bits (149), Expect = 7e-09
Identities = 73/317 (23%), Positives = 130/317 (40%), Gaps = 22/317 (6%)
Query: 651 QFKTSNNQAEYEALIAGLKL---AIEVQIDSLPVRTDSPLVANQVNGEFQVKELALIKYV 707
Q S + E A+I LK +E I+ + TD + ++ E + + K +
Sbjct: 746 QLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPEN----KRL 801
Query: 708 ERVRLLMGRLQQVVVEFVPRAQNQRADALAKLASTRKPGNNRSVIQETLANPNIEGEFVA 767
R +L + + + P + N ADAL+++ +P I + + +I FV
Sbjct: 802 ARWQLFLQDFN-FEINYRPGSANHIADALSRIVDETEP------IPKDSEDNSIN--FVN 852
Query: 768 SVDRQETWMNPIIDILAGEPSDVVKYSKAQRREAGHYTLLDGILFRRGFSSPLLRCLPPE 827
+ + + N ++ + + + +R + L DG+L +L +
Sbjct: 853 QISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINS--KDQILLPNDTQ 910
Query: 828 KYEAVMTEVHEGVCASHIGGRSLASKVLRAGFYWPTIRKECAEFVRKCKKCQVFTDLPRA 887
++ + HE H G L + +LR F W IRK+ E+V+ C CQ+
Sbjct: 911 LTRTIIKKYHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHK 969
Query: 888 PPGQLVTI-SSPWPFAMWGVDLVGPFPTARSQMKFILVAVDYFTKWVEAEPLA-SITAAK 945
P G L I S P+ +D + P + S + V VD F+K P SITA +
Sbjct: 970 PYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSITAEQ 1028
Query: 946 IISFYWKRIVCRFGVPR 962
+ +R++ FG P+
Sbjct: 1029 TARMFDQRVIAYFGNPK 1045
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 139 bits (349), Expect = 5e-32
Identities = 87/318 (27%), Positives = 157/318 (49%), Gaps = 3/318 (0%)
Query: 201 KEKAIQQEVNKLLAADFIREIKYPTWLVNVVIVKKANGKWRMCVDNTDLNKTCPKDSYPL 260
K +A+ E+N+ L + IRE K V+ V K G RM VD LNK + YPL
Sbjct: 424 KMQAMNDEINQGLKSGIIRESKAIN-ACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPL 482
Query: 261 PSIDKLVDGALGNELLSLMDAYSGYHQIKMHQSDEDKTTFMTARVNYCYQTMPFGLKNAG 320
P I++L+ G+ + + +D S YH I++ + DE K F R + Y MP+G+ A
Sbjct: 483 PLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAP 542
Query: 321 ATYQRLMDRVFEGQVGRNMEVYVDDMIVKSVLGSNHHEDLMEAFGRIQKHNMRLNPEKCS 380
A +Q ++ + ++ Y+D++++ S S H + + + +++ N+ +N KC
Sbjct: 543 AHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 381 FGIRGGKFLGFLITSRRIEINPDKCKAIQEMKSPSNVKEVQRLIGRITALSRFLLHSGDK 440
F KF+G+ I+ + + + + K P N KE+++ +G + L +F+ +
Sbjct: 603 FHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQL 662
Query: 441 SAPFIKCLKKNTTFEWNSECEEAFTRLKEMLSAPPVLSKPVQGLPLHLYFSVGDHAISSV 500
+ P LKK+ ++W +A +K+ L +PPVL + L D A+ +V
Sbjct: 663 THPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAV 722
Query: 501 ILQEVDGEQKIVYFVSYH 518
+ Q+ D ++ Y V Y+
Sbjct: 723 LSQKHDDDK--YYPVGYY 738
Score = 62.0 bits (149), Expect = 7e-09
Identities = 73/317 (23%), Positives = 130/317 (40%), Gaps = 22/317 (6%)
Query: 651 QFKTSNNQAEYEALIAGLKL---AIEVQIDSLPVRTDSPLVANQVNGEFQVKELALIKYV 707
Q S + E A+I LK +E I+ + TD + ++ E + + K +
Sbjct: 746 QLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPEN----KRL 801
Query: 708 ERVRLLMGRLQQVVVEFVPRAQNQRADALAKLASTRKPGNNRSVIQETLANPNIEGEFVA 767
R +L + + + P + N ADAL+++ +P I + + +I FV
Sbjct: 802 ARWQLFLQDFN-FEINYRPGSANHIADALSRIVDETEP------IPKDSEDNSIN--FVN 852
Query: 768 SVDRQETWMNPIIDILAGEPSDVVKYSKAQRREAGHYTLLDGILFRRGFSSPLLRCLPPE 827
+ + + N ++ + + + +R + L DG+L +L +
Sbjct: 853 QISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINS--KDQILLPNDTQ 910
Query: 828 KYEAVMTEVHEGVCASHIGGRSLASKVLRAGFYWPTIRKECAEFVRKCKKCQVFTDLPRA 887
++ + HE H G L + +LR F W IRK+ E+V+ C CQ+
Sbjct: 911 LTRTIIKKYHEEGKLIHPGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHK 969
Query: 888 PPGQLVTI-SSPWPFAMWGVDLVGPFPTARSQMKFILVAVDYFTKWVEAEPLA-SITAAK 945
P G L I S P+ +D + P + S + V VD F+K P SITA +
Sbjct: 970 PYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSITAEQ 1028
Query: 946 IISFYWKRIVCRFGVPR 962
+ +R++ FG P+
Sbjct: 1029 TARMFDQRVIAYFGNPK 1045
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 134 bits (337), Expect = 1e-30
Identities = 97/321 (30%), Positives = 152/321 (47%), Gaps = 25/321 (7%)
Query: 205 IQQEVNKLLAADFIREIKYP----TWLVNVVIVKKA-----NGKWRMCVDNTDLNKTCPK 255
+ EV +LL IR + P TW+V+ KK N R+ +D LN+
Sbjct: 197 VNNEVKQLLKDGIIRPSRSPYNSPTWVVD----KKGTDAFGNPNKRLVIDFRKLNEKTIP 252
Query: 256 DSYPLPSIDKLVDGALGNELLSLMDAYSGYHQIKMHQSDEDKTTFMTARVNYCYQTMPFG 315
D YP+PSI ++ + + +D SGYHQI + + D +KT+F Y + +PFG
Sbjct: 253 DRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFG 312
Query: 316 LKNAGATYQRLMDRVFEGQVGRNMEVYVDDMIVKSVLGSNHHEDLMEAFGRIQKHNMRLN 375
L+NA + +QR +D V Q+G+ VYVDD+I+ S S+H + + NMR++
Sbjct: 313 LRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVS 372
Query: 376 PEKCSFGIRGGKFLGFLITSRRIEINPDKCKAIQEMKSPSNVKEVQRLIGRITALSRFLL 435
EK F ++LGF+++ + +P+K KAIQE P V +V+ +G + F+
Sbjct: 373 QEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIK 432
Query: 436 HSGDKSAPFIKCLK-----------KNTTFEWNSECEEAFTRLKEMLSAPPVLSK-PVQG 483
+ P LK K E+N AF RL+ +L++ V+ K P
Sbjct: 433 DFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFK 492
Query: 484 LPLHLYFSVGDHAISSVILQE 504
P L I +V+ QE
Sbjct: 493 KPFDLTTDASASGIGAVLSQE 513
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 115 bits (288), Expect = 6e-25
Identities = 76/251 (30%), Positives = 130/251 (51%), Gaps = 11/251 (4%)
Query: 189 PITQTRRRMGNEKEKAIQQEVNKLLAADFIREIKYPTWLVNVVIVKK-ANGKWRMCVDN- 246
P+ + R + +A++ E+N+L I I Y W +V++KK GK R+C D
Sbjct: 440 PVFKRARPVPYGSLEAVETELNRLQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCADFK 499
Query: 247 -TDLNKTCPKDSYPLPSIDKLVDGALGNELLSLMDAYSGYHQIKMHQSDEDKTTFMTARV 305
+ LN + +PLP+ + + G + S +D Y Q+++ + + T R
Sbjct: 500 CSGLNAALKDEFHPLPTSEDIFSRLKGT-VYSQIDLKDAYLQVELDEEAQKLAVINTHRG 558
Query: 306 NYCYQTMPFGLKNAGATYQRLMDRVFEGQVGRNMEVYVDDMIVKSVLGSNHHEDLMEAFG 365
+ Y M FGLK A A++Q++MD++ G G + VY DD+I+ + H + L E F
Sbjct: 559 IFKYLRMTFGLKPAPASFQKIMDKMVSGLTG--VAVYWDDIIISASSIEEHEKILRELFE 616
Query: 366 RIQKHNMRLNPEKCSFGIRGGKFLGFLITSRRIEINPD--KCKAIQEMKSPSNVKEVQRL 423
R +++ R++ EKC+F + FLGF+ R PD K +AI+ MK+P++ K++
Sbjct: 617 RFKEYGFRVSAEKCAFAQKQVTFLGFVDEHGR---RPDSKKTEAIRSMKAPTDQKQLASF 673
Query: 424 IGRITALSRFL 434
+G LSR +
Sbjct: 674 LGAADWLSRMM 684
Score = 46.6 bits (109), Expect = 3e-04
Identities = 37/129 (28%), Positives = 56/129 (42%), Gaps = 21/129 (16%)
Query: 824 LPPEKYEAVMTEVHEGVCASHIGGRSLASKVLRAGFYWPTIRKECAEFVRKCKKCQVFTD 883
+P + V+ ++HEG H G + K R+ +W + + VR C CQ +
Sbjct: 778 VPKSLQKIVLKQLHEG----HPGIVQMKQKA-RSFVFWRGLDSDIENMVRHCNNCQENSK 832
Query: 884 LPRAPPGQLVTISSPWPF--AMW---GVDLVGPFPTARSQMKFILVAVDYFTKWVEAEPL 938
+PR P +PWP A W +D GP ++LV VD TK+ E +
Sbjct: 833 MPRVVP------LNPWPVPEAPWKRIHIDFAGPLNGC-----YLLVVVDAKTKYAEVKLT 881
Query: 939 ASITAAKII 947
SI+A I
Sbjct: 882 RSISAVTTI 890
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 101 bits (251), Expect = 1e-20
Identities = 84/349 (24%), Positives = 155/349 (44%), Gaps = 27/349 (7%)
Query: 147 QEQRISKLLGDNLDLFAWSHKYMPGIDPNFICHRLALNPGVEPITQTRRRMGNEKEKAIQ 206
+E + K +G+N F ++K C +NP ++ + + + + E+A+
Sbjct: 1368 KEMKEMKYIGENPMEFWKNNKIK--------CKLNIINPDIKIMGRPIKHVTPGDEEAMT 1419
Query: 207 QEVNKLLAADFIR--EIKYPTWL--------VNVVIVKKANGKWRMCVDNTDLNKTCPKD 256
+++N LL IR E K+ + ++ + K+ GK RM + LN+ D
Sbjct: 1420 RQINLLLQMKVIRPSESKHRSTAFIVRSGTEIDPITGKEKKGKERMVFNYKLLNENTESD 1479
Query: 257 SYPLPSIDKLVDGALGNELLSLMDAYSGYHQIKMHQSDEDKTTFMTARVNYCYQTMPFGL 316
Y LP I+ ++ +++ S D SG+ Q+ M + T F+ Y + MPFGL
Sbjct: 1480 QYSLPGINTIISKVGRSKIYSKFDLKSGFWQVAMEEESVPWTAFLAGNKLYEWLVMPFGL 1539
Query: 317 KNAGATYQRLMDRVFEGQVGRNMEVYVDDMIVKSVLGSNHHEDLMEAFGRIQKHNMRLNP 376
KNA A +QR MD VF+G + + VY+DD++V S H + L +++ + L+P
Sbjct: 1540 KNAPAIFQRKMDNVFKG-TEKFIAVYIDDILVFSETAEQHSQHLYTMLQLCKENGLILSP 1598
Query: 377 EKCSFGIRGGKFLGFLITSRRIEINPDKCKAI-----QEMKSPSNVKEVQRLIGRITALS 431
K G FLG + +I++ P I +++ +P ++ +G ++
Sbjct: 1599 TKMKIGTPEIDFLGASLGCTKIKLQPHIISKICDFSDEKLATPEGMRS---WLGILSYAR 1655
Query: 432 RFLLHSGDKSAPFIKCLKKNTTFEWNSECEEAFTRLKEMLSAPPVLSKP 480
++ G P + + N E + ++KE + P L P
Sbjct: 1656 NYIQDIGKLVQPLRQKMAPTGDKRMNPETWKMVRQIKEKVKNLPDLQLP 1704
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 98.6 bits (244), Expect = 7e-20
Identities = 99/386 (25%), Positives = 169/386 (43%), Gaps = 16/386 (4%)
Query: 108 ESLDPRGEGRVNRPTPIEETKDLKFGEKI--LKIGTKLSEEQ----EQRISKLLGDNLDL 161
ES+ R + + P I K E+I L G +LSEE+ +QR+ K+ + L+
Sbjct: 160 ESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQKI-EELLEK 218
Query: 162 FAWSHKYMPGIDPNFICHRLALNPGVEPITQTRRRMGNEKEKAIQQEVNKLLAADFIREI 221
+ P ++ + L+ + I + + +++ +LL I+
Sbjct: 219 VCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPS 278
Query: 222 KYP----TWLVNVVIVKKANGKWRMCVDNTDLNKTCPKDSYPLPSIDKLVDGALGNELLS 277
K P +LVN +K GK RM V+ +NK D+Y LP+ D+L+ G ++ S
Sbjct: 279 KSPHMAPAFLVNNE-AEKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFS 337
Query: 278 LMDAYSGYHQIKMHQSDEDKTTFMTARVNYCYQTMPFGLKNAGATYQRLMDRVFEGQVGR 337
D SG+ Q+ + Q T F + +Y + +PFGLK A + +QR MD F +V R
Sbjct: 338 SFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF--RVFR 395
Query: 338 NM-EVYVDDMIVKSVLGSNHHEDLMEAFGRIQKHNMRLNPEKCSFGIRGGKFLGFLITSR 396
VYVDD++V S +H + + +H + L+ +K + FLG I
Sbjct: 396 KFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEG 455
Query: 397 RIEINPDKCKAIQEMKSP-SNVKEVQRLIGRITALSRFLLHSGDKSAPFIKCLKKNTTFE 455
+ + I + + K++QR +G +T S ++ P LK+N +
Sbjct: 456 THKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWR 515
Query: 456 WNSECEEAFTRLKEMLSAPPVLSKPV 481
W E ++K+ L P L P+
Sbjct: 516 WTKEDTLYMQKVKKNLQGFPPLHHPL 541
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 98.6 bits (244), Expect = 7e-20
Identities = 99/386 (25%), Positives = 170/386 (43%), Gaps = 16/386 (4%)
Query: 108 ESLDPRGEGRVNRPTPIEETKDLKFGEKI--LKIGTKLSEEQ----EQRISKLLGDNLDL 161
ES+ R + + P I K E+I L G +LSEE+ +QR+ K+ + L+
Sbjct: 160 ESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEKLFITQQRMQKI-EELLEK 218
Query: 162 FAWSHKYMPGIDPNFICHRLALNPGVEPITQTRRRMGNEKEKAIQQEVNKLLAADFIREI 221
+ P ++ + L+ + I + + +++ +LL I+
Sbjct: 219 VCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPS 278
Query: 222 KYP----TWLVNVVIVKKANGKWRMCVDNTDLNKTCPKDSYPLPSIDKLVDGALGNELLS 277
K P +LVN +K GK RM V+ +NK D+Y LP+ D+L+ G ++ S
Sbjct: 279 KSPHMAPAFLVNNE-AEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFS 337
Query: 278 LMDAYSGYHQIKMHQSDEDKTTFMTARVNYCYQTMPFGLKNAGATYQRLMDRVFEGQVGR 337
D SG+ Q+ + Q T F + +Y + +PFGLK A + +QR MD F +V R
Sbjct: 338 SFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF--RVFR 395
Query: 338 NM-EVYVDDMIVKSVLGSNHHEDLMEAFGRIQKHNMRLNPEKCSFGIRGGKFLGFLITSR 396
VYVDD++V S +H + + +H + L+ +K + FLG I
Sbjct: 396 KFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEG 455
Query: 397 RIEINPDKCKAIQEMKSP-SNVKEVQRLIGRITALSRFLLHSGDKSAPFIKCLKKNTTFE 455
+ + I + + K++QR +G +T S ++ P LK+N ++
Sbjct: 456 THKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWK 515
Query: 456 WNSECEEAFTRLKEMLSAPPVLSKPV 481
W E ++K+ L P L P+
Sbjct: 516 WTKEDTLYMQKVKKNLQGFPPLHHPL 541
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 97.8 bits (242), Expect = 1e-19
Identities = 92/356 (25%), Positives = 159/356 (43%), Gaps = 14/356 (3%)
Query: 136 ILKIGTKLSEEQ----EQRISKLLGDNLDLFAWSHKYMPGIDPNFICHRLALNPGVEPIT 191
IL G +LSEE+ +QR+ K+ + L+ + P ++ + L+ + I
Sbjct: 190 ILSEGRRLSEEKLFITQQRMQKI-EELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIK 248
Query: 192 QTRRRMGNEKEKAIQQEVNKLLAADFIREIKYP----TWLVNVVIVKKANGKWRMCVDNT 247
+ + +++ +LL I+ K P +LVN +K GK RM V+
Sbjct: 249 VKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNE-AEKRRGKKRMVVNYK 307
Query: 248 DLNKTCPKDSYPLPSIDKLVDGALGNELLSLMDAYSGYHQIKMHQSDEDKTTFMTARVNY 307
+NK D+Y LP+ D+L+ G ++ S D SG+ Q+ + Q T F + +Y
Sbjct: 308 AMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHY 367
Query: 308 CYQTMPFGLKNAGATYQRLMDRVFEGQVGRNM-EVYVDDMIVKSVLGSNHHEDLMEAFGR 366
+ +PFGLK A + +QR MD F +V R VYVDD++V S +H + +
Sbjct: 368 EWNVVPFGLKQAPSIFQRHMDEAF--RVFRKFCCVYVDDILVFSNNEEDHLLHVAMILQK 425
Query: 367 IQKHNMRLNPEKCSFGIRGGKFLGFLITSRRIEINPDKCKAIQEMKSP-SNVKEVQRLIG 425
+H + L+ +K + FLG I + + I + + K++QR +G
Sbjct: 426 CNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQLQRFLG 485
Query: 426 RITALSRFLLHSGDKSAPFIKCLKKNTTFEWNSECEEAFTRLKEMLSAPPVLSKPV 481
+T S ++ P LK+N ++W E ++K+ L P L P+
Sbjct: 486 ILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPL 541
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 680
Score = 97.1 bits (240), Expect = 2e-19
Identities = 101/433 (23%), Positives = 182/433 (41%), Gaps = 43/433 (9%)
Query: 61 YPLSNGHIGGIVVDQRIARECYCNAVDRYGKKNAS--VGHRCSEVETPEESLDPRGEGRV 118
YP+ HI + R+ E + ++ + K V +++E P E + EGR
Sbjct: 141 YPV---HIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGR- 196
Query: 119 NRPTPIEETKDLKFGEKILKIGTKLSEEQEQRISKLLGDNLDLFAWSHKYMPGIDPN--- 175
+ E+ L I + ++ E+ + K+ +N +DPN
Sbjct: 197 ------------RLSEEKLFITQQRMQKTEELLEKVCSEN------------PLDPNKTK 232
Query: 176 -FICHRLALNPGVEPITQTRRRMGNEKEKAIQQEVNKLLAADFIREIKYP----TWLVNV 230
++ + L+ + I + + +++ +LL I+ K P +LVN
Sbjct: 233 QWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNN 292
Query: 231 VIVKKANGKWRMCVDNTDLNKTCPKDSYPLPSIDKLVDGALGNELLSLMDAYSGYHQIKM 290
+ G RM V+ +NK D+Y LP+ D+L+ G ++ S D SG+ Q+ +
Sbjct: 293 E-AENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLL 351
Query: 291 HQSDEDKTTFMTARVNYCYQTMPFGLKNAGATYQRLMDRVFEGQVGRNM-EVYVDDMIVK 349
Q T F + +Y + +PFGLK A + +QR MD F +V R VYVDD++V
Sbjct: 352 DQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF--RVFRKFCCVYVDDIVVF 409
Query: 350 SVLGSNHHEDLMEAFGRIQKHNMRLNPEKCSFGIRGGKFLGFLITSRRIEINPDKCKAIQ 409
S +H + + +H + L+ +K + FLG I + + I
Sbjct: 410 SNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHIN 469
Query: 410 EMKSP-SNVKEVQRLIGRITALSRFLLHSGDKSAPFIKCLKKNTTFEWNSECEEAFTRLK 468
+ + K++QR +G +T S ++ + P LK+N ++W E ++K
Sbjct: 470 KFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVK 529
Query: 469 EMLSAPPVLSKPV 481
+ L P L P+
Sbjct: 530 KNLQGFPPLHHPL 542
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 674
Score = 96.7 bits (239), Expect = 3e-19
Identities = 97/384 (25%), Positives = 167/384 (43%), Gaps = 19/384 (4%)
Query: 108 ESLDPRGEGRVNRPTPIEETKDLKFGEKILKIGTKLSEEQ----EQRISKLLGDNLDLFA 163
ES+ R + + P I K IL G +LSEE+ +QR+ K+ + L+
Sbjct: 162 ESMKKRSKTQQPEPVNISTNKIA-----ILSEGRRLSEEKLFITQQRMQKI-EELLEKVC 215
Query: 164 WSHKYMPGIDPNFICHRLALNPGVEPITQTRRRMGNEKEKAIQQEVNKLLAADFIREIKY 223
+ P ++ + L+ + I + + +++ +LL I+ K
Sbjct: 216 SENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKS 275
Query: 224 P----TWLVNVVIVKKANGKWRMCVDNTDLNKTCPKDSYPLPSIDKLVDGALGNELLSLM 279
P +LVN +K GK RM V+ +NK D+Y P+ D+L+ G ++ S
Sbjct: 276 PHMAPAFLVNNE-AEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSF 334
Query: 280 DAYSGYHQIKMHQSDEDKTTFMTARVNYCYQTMPFGLKNAGATYQRLMDRVFEGQVGRNM 339
D SG+ Q+ + Q T F + +Y + +PFGLK A + +QR MD F +V R
Sbjct: 335 DCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAF--RVFRKF 392
Query: 340 -EVYVDDMIVKSVLGSNHHEDLMEAFGRIQKHNMRLNPEKCSFGIRGGKFLGFLITSRRI 398
VYVDD++V S +H + + +H + L+ +K + FLG I
Sbjct: 393 CCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTH 452
Query: 399 EINPDKCKAIQEMKSP-SNVKEVQRLIGRITALSRFLLHSGDKSAPFIKCLKKNTTFEWN 457
+ + I + + K++QR +G +T S ++ P LK+N ++W
Sbjct: 453 KPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWT 512
Query: 458 SECEEAFTRLKEMLSAPPVLSKPV 481
E ++K+ L P L P+
Sbjct: 513 KEDTLYMQKVKKNLQGFPPLHHPL 536
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 666
Score = 88.2 bits (217), Expect = 1e-16
Identities = 72/298 (24%), Positives = 131/298 (43%), Gaps = 21/298 (7%)
Query: 234 KKANGKWRMCVDNTDLNKTCPKDSYPLPSIDKLVDGALGNELLSLMDAYSGYHQIKMHQS 293
++ GK RM V+ +N+ DS+ LP++ +L+ G + S D SG+ Q+ + +
Sbjct: 287 ERRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEE 346
Query: 294 DEDKTTFMTARVNYCYQTMPFGLKNAGATYQRLMDRVFEGQVGRNMEVYVDDMIVKSVLG 353
+ T F + ++ ++ +PFGLK A + +QR M G + VYVDD+IV S
Sbjct: 347 SQKLTAFTCPQGHFQWKVVPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSE 405
Query: 354 SNHHEDLMEAFGRIQKHNMRLNPEKCSFGIRGGKFLGFLITS----------RRIEINPD 403
+H+ + ++K+ + L+ +K + FLG I I PD
Sbjct: 406 LDHYNHVYAVLKIVEKYGIILSKKKANLFKEKINFLGLEIDKGTHCPQNHILENIHKFPD 465
Query: 404 KCKAIQEMKSPSNVKEVQRLIGRITALSRFLLHSGDKSAPFIKCLKKNTTFEWNSECEEA 463
+ + + K +QR +G +T ++ + P LKK+ T+ W +
Sbjct: 466 RLE---------DKKHLQRFLGVLTYAETYIPKLAEIRKPLQVKLKKDVTWNWTQSDSDY 516
Query: 464 FTRLKEMLSAPPVLSKPVQGLPLHLYFSVGDHAISSVI-LQEVDGEQKIVYFVSYHFR 520
++K+ L + P L P L + D V+ + +DG + I + S F+
Sbjct: 517 VKKIKKNLGSFPKLYLPKPEDHLIIETDASDSFWGGVLKARALDGVELICRYSSGSFK 574
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 659
Score = 87.4 bits (215), Expect = 2e-16
Identities = 93/394 (23%), Positives = 171/394 (42%), Gaps = 50/394 (12%)
Query: 115 EGRVNRPTPIEETKD----LKFG-----EKILKIGTKLSEEQEQRISKLLGDNLDLFAWS 165
+ +VNRP PI T + L+ G E + +I E+ + ++ +N S
Sbjct: 153 KSKVNRPEPINITSNQHLFLEEGGNHVDEMLYEIQISKFSAIEEMLERVSSENPIDPEKS 212
Query: 166 HKYMPG----IDPNFICHRLALNPGVEPITQTRRRMGNEKEKAIQQEVNKLLAADFIREI 221
++M IDP + V+P++ + +++E+ +++ +LL I+
Sbjct: 213 KQWMTATIELIDPKTVVK-------VKPMSYSP----SDREE-FDRQIKELLELKVIKPS 260
Query: 222 KY----PTWLVNVVIVKKANGKWRMCVDNTDLNKTCPKDSYPLPSIDKLVDGALGNELLS 277
K P +LV ++ GK RM V+ +NK D++ LP+ D+L+ G ++ S
Sbjct: 261 KSTHMSPAFLVENE-AERRRGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYS 319
Query: 278 LMDAYSGYHQIKMHQSDEDKTTFMTARVNYCYQTMPFGLKNAGATYQRLMDRVFEGQVGR 337
D SG Q+ + + + T F + +Y + +PFGLK A + + + Q +
Sbjct: 320 SFDCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSK 379
Query: 338 NMEVYVDDMIVKSVLG-SNHHEDLMEAFGRIQKHNMRLNPEKCSFGIRGGKFLGFLITS- 395
VYVDD++V S G H+ ++ R +K + L+ +K FLG I
Sbjct: 380 YCCVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEIDQG 439
Query: 396 ---------RRIEINPDKCKAIQEMKSPSNVKEVQRLIGRITALSRFLLHSGDKSAPFIK 446
I PD+ + + K++QR +G +T S ++ P
Sbjct: 440 THCPQNHILEHIHKFPDRIE---------DKKQLQRFLGILTYASDYIPKLASIRKPLQS 490
Query: 447 CLKKNTTFEWNSECEEAFTRLKEMLSAPPVLSKP 480
LK+++T+ WN + ++K+ L + P L P
Sbjct: 491 KLKEDSTWTWNDTDSQYMAKIKKNLKSFPKLYHP 524
>RRPO_OENBE (P31843) RNA-directed DNA polymerase homolog (Reverse
transcriptase homolog)
Length = 142
Score = 86.3 bits (212), Expect = 4e-16
Identities = 43/120 (35%), Positives = 70/120 (57%)
Query: 241 RMCVDNTDLNKTCPKDSYPLPSIDKLVDGALGNELLSLMDAYSGYHQIKMHQSDEDKTTF 300
RMC+D L K K+ YP+P +D L D + +D SGY Q+++ + DE KTT
Sbjct: 7 RMCIDYRALTKVTIKNKYPIPRVDDLFDRLAQATWFTKLDLRSGYWQVRIAKGDEPKTTC 66
Query: 301 MTARVNYCYQTMPFGLKNAGATYQRLMDRVFEGQVGRNMEVYVDDMIVKSVLGSNHHEDL 360
+T ++ ++ MPFGL NA AT+ LM+ V + + VY+DD++V ++ ++ HE +
Sbjct: 67 VTRYGSFEFRVMPFGLTNALATFCNLMNNVLYEYLDHFVVVYLDDLVVYTIYSNSLHEHI 126
>POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 692
Score = 82.0 bits (201), Expect = 7e-15
Identities = 77/318 (24%), Positives = 152/318 (47%), Gaps = 16/318 (5%)
Query: 206 QQEVNKLLAADFIREIKYPT-----WLVNVVIVKKANGKWRMCVDNTDLNKTCPKDSYPL 260
++E LL IRE + P ++ N +K+ GK RM ++ +N+ DSY L
Sbjct: 220 KEECEDLLKKGLIRESQSPHSAPAFYVENHNEIKR--GKRRMVINYKKMNEATIGDSYKL 277
Query: 261 PSIDKLVDGALGNELLSLMDAYSGYHQIKMHQSDEDKTTF-MTARVNYCYQTMPFGLKNA 319
P D +++ G+ S +DA SGY+Q+++H++ + T F + +Y + + FGLK A
Sbjct: 278 PRKDFILEKIKGSLWFSSLDAKSGYYQLRLHENTKPLTAFSCPPQKHYEWNVLSFGLKQA 337
Query: 320 GATYQRLMDRVFEGQVGRNMEVYVDDMIVKSVLGSNHH-EDLMEAFGRIQKHNMRLNPEK 378
+ YQR MD+ +G + Y+DD+++ + H D+ RI++ + ++ +K
Sbjct: 338 PSIYQRFMDQSLKG-LEHICLAYIDDILIFTKGSKEQHVNDVRIVLQRIKEKGIIISKKK 396
Query: 379 CSFGIRGGKFLGFLITSR-RIEINP-DKCKAIQEMKSPSNVKEVQRLIGRITALSR--FL 434
+ ++LG I I+++P + K +Q + K++QR +G I ++ F
Sbjct: 397 SKLIQQEIEYLGLKIQGNGEIDLSPHTQEKILQFPDELEDRKQIQRFLGCINYIANEGFF 456
Query: 435 LHSGDKSAPFIKCLKKNTTFEWNSECEEAFTRLK-EMLSAPPVLSKPVQGLPLHLYFSVG 493
+ + K + ++W++ + +K ++ S P + + +Q L +
Sbjct: 457 KNLALERKHLQKKISVKNPWKWDTIDTKMVQSIKGKIQSLPKLYNASIQDF-LIVETDAS 515
Query: 494 DHAISSVILQEVDGEQKI 511
H+ S + G+QKI
Sbjct: 516 QHSWSGCLRALPKGKQKI 533
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.323 0.138 0.418
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 112,598,271
Number of Sequences: 164201
Number of extensions: 4793532
Number of successful extensions: 10652
Number of sequences better than 10.0: 128
Number of HSP's better than 10.0 without gapping: 43
Number of HSP's successfully gapped in prelim test: 85
Number of HSP's that attempted gapping in prelim test: 10452
Number of HSP's gapped (non-prelim): 210
length of query: 963
length of database: 59,974,054
effective HSP length: 120
effective length of query: 843
effective length of database: 40,269,934
effective search space: 33947554362
effective search space used: 33947554362
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 71 (32.0 bits)
Lotus: description of TM0189.12