
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0041.7
(1610 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 218 1e-55
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 214 2e-54
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran... 196 3e-49
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran... 192 8e-48
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei... 188 9e-47
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei... 186 6e-46
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei... 186 6e-46
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran... 184 1e-45
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23... 169 7e-41
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran... 151 2e-35
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro... 139 6e-32
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 137 2e-31
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro... 136 5e-31
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro... 134 2e-30
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 134 2e-30
POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic prot... 127 3e-28
POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic prot... 123 4e-27
POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;... 120 3e-26
YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II 115 7e-25
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript... 110 2e-23
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 218 bits (554), Expect = 1e-55
Identities = 168/570 (29%), Positives = 280/570 (48%), Gaps = 38/570 (6%)
Query: 561 ELLKEFKDCFA*DNDEMPGLSRDLVELQLPIKEDKKLVK*LPRRFHPDVLVKIKEEIERL 620
+++++F+D FA +DE+ S E + +KE + ++ PR + +I++ I+++
Sbjct: 908 DVIEQFQDVFAISDDELGRNSG--TECVIELKEGAEPIRQKPRPIPLALKPEIRKMIQKM 965
Query: 621 LRCKFIRAAKYVEWLANVVPVIKKNGKMRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAG 680
L K IR +K W + VV V KK+G +R+CID+R +N + + +P E + S AG
Sbjct: 966 LNQKVIRESKS-PWSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNIEATLQSLAG 1024
Query: 681 HEYLSLLDGYSGYNQIFIAKEDVSKTAFRCSGALGTYEWVVMPFGLKNAGATYQRVMNTI 740
+ ++ D +G+ QI + ++ TAF L +EW V+PFGL + A +Q M I
Sbjct: 1025 KKLYTVFDMIAGFWQIPLDEKSKEITAFAIGSEL--FEWNVLPFGLVISPALFQGTMEEI 1082
Query: 741 FHDFIETFMQVYIDDLVVKSPSRDEHLSHLRKSFERMRIHGLKMNPLKCAFGVIAGDFLG 800
D + VY+DDL++ S ++HL ++++ R+R G+K+ KC ++LG
Sbjct: 1083 IGDLLGVCAFVYVDDLLIASKDMEQHLQDVKEALTRIRKSGMKLRASKCHIAKKEVEYLG 1142
Query: 801 FVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKVNFLRRFIDNLSDKTKPFSSLLRL 860
V G+E + K + S PT+ K+LQS LG V + R+FI N + +SL+
Sbjct: 1143 HKVTLDGVETQEVKTDKMKQFSRPTNVKELQSFLGLVGYYRKFILNFAQIASSLTSLISA 1202
Query: 861 KREDIFRWEA*HQKAFDELKNYLAIPPVMIPP-----IKG-KPMRLYISATDETIGSMLA 914
K I WE + AF ELK + PV+ P +KG +P +Y A+ + IG++LA
Sbjct: 1203 KVAWI--WEKEQEIAFQELKKLVCQTPVLAQPDVEAALKGDRPFMIYTDASRKGIGAVLA 1260
Query: 915 QEDEDGIERAVFYLSRVLNDAKTRYTMIEKLCLCLYFSCTKLKYYIKPIDVMVFSHFDII 974
QE DG + + + S+ L+ A+TRY + + L + F+ + K I + VF+ +
Sbjct: 1261 QEGPDGQQHPIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPL 1320
Query: 975 KHMLSKPILHSRIGKWSLALTEYSLTYAPLKTIKGQAVADFLADHTLP---------EEI 1025
+L L R+ +WS+ + E+ + L K AVAD L+ P +E+
Sbjct: 1321 ISLLKGSPLADRLWRWSIEILEFDVKIVYLAG-KANAVADALSRGGCPPNELEEEQTKEL 1379
Query: 1026 AYV--GLQPWKLFFDDSS------HKEGSGIGMFIVSPQGIPTKFMFRIRESCSNNESEY 1077
+ +Q DSS E G I + +G TK F+I S EY
Sbjct: 1380 TSIVNAIQTELPDILDSSCWLERLKGEDEGWKEVIAALEGGKTKGTFKIVGIESEISLEY 1439
Query: 1078 EALVSGLEILLALGAKNVEVKGDSELVVKQ 1107
+V G+ KN E++ S VV +
Sbjct: 1440 YKIVGGV-------LKNTEIEEQSRSVVPE 1462
Score = 117 bits (294), Expect = 2e-25
Identities = 98/409 (23%), Positives = 185/409 (44%), Gaps = 25/409 (6%)
Query: 1209 TDIKVKYRALNYTIIGNELFKKNVDGTLLKCLSEVDAFIAVSAAHGGLCGAHQAGAKMKW 1268
++I ++Y Y I+G L ++ + E + H G+ H G K W
Sbjct: 1433 SEISLEY----YKIVGGVLKNTEIEEQSRSVVPEKIRTPLLKELHEGMLAGH-FGIKKMW 1487
Query: 1269 ILFRQGMYWPTI---MKDCIEYAKGCQDCQKHSGIQHVPASELHSIIKSWPFRGWAIDLI 1325
+ + YWP + +++C+ C HS + S L ++P A DL+
Sbjct: 1488 RMVHRKFYWPQMRVCVENCVRTCAKCLCANDHSKL----TSSLTPYRMTFPLEIVACDLM 1543
Query: 1326 GEIHPALSRQ-HKYIIVAIDYFIKWVEAIPLQSVTQETVIE-FIQNHIVYRFGLPESLTT 1383
LS Q ++YI+ ID F K+ A+P+ ETV++ F++ + +P L T
Sbjct: 1544 DV---GLSVQGNRYILTIIDLFTKYGTAVPIPDKKAETVLKAFVERWAIGEGRIPLKLLT 1600
Query: 1384 DQGTMFVGRKVAAFAESWGIKLLTSIPYYAQANGQVEAANKILISLIKKHVGRKPKSWHE 1443
DQG FV A F I+ +T+ Y ++ANG VE NK ++ ++KK P W +
Sbjct: 1601 DQGKEFVNGLFAQFTHMLKIEHITTKGYNSRANGAVERFNKTIMHIMKKKTA-VPMEWDD 1659
Query: 1444 SLSQVLWAYRNSPREATGTTPFRLAYGQEAVLPAEVYLQSFRIQRQEDIPSEVYWNMMLD 1503
+ ++AY N E TG TP L +G++ + P E+ + D+ + Y +++
Sbjct: 1660 QVVYAVYAYNNCVHENTGETPMFLMHGRDVMGPLEMSGEDAVGINYADM--DEYKHLLTQ 1717
Query: 1504 ELVNLDEERVLALDVLTRQKDRIAKAYNKKVKNRSFVTGDYVWKVIL--PTDKKDRAYGK 1561
EL+ + + +A + R+++ +++K ++ +V+L P++K K
Sbjct: 1718 ELLKVQK---IAKEHAMREQESYKSLFDQKYASKKHRFPQPGSRVLLEIPSEKLGAQCPK 1774
Query: 1562 WAPNWEGPFKVKKVLSNNAYVIKELSGQRQFVTINDKYLKAYKQMLHEV 1610
W GP++V N+A + L ++ + I + L+ + + ++
Sbjct: 1775 LVNKWSGPYRVISCSENSAEITPVLGKRKHILQIPFENLRVIPEAMPDI 1823
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 214 bits (544), Expect = 2e-54
Identities = 250/1052 (23%), Positives = 442/1052 (41%), Gaps = 107/1052 (10%)
Query: 553 PEL-RGRMIELLKEFKDCFA*DNDEMPGLSRDLVELQLPIKEDKKLVK*LPRRFHPDVLV 611
PEL + ++ + E+ D FA +++ P +L + QL +K+D+ + R H V
Sbjct: 272 PELFKSQLENICSEYIDIFALESE--PITVNNLYKQQLRLKDDEPVYTKNYRSPHSQV-E 328
Query: 612 KIKEEIERLLRCKFIRAAKYVEWLANVVPVIKKNG------KMRVCIDFRDLNAATPKDE 665
+I+ ++++L++ K + + ++ + ++ V KK+ K R+ ID+R +N D+
Sbjct: 329 EIQAQVQKLIKDKIVEPS-VSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADK 387
Query: 666 YHMPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIAKEDVSKTAFRCSGALGTYEWVVMPFG 725
+ +P + ++D +Y S LD SG++QI + + T+F S G+Y + +PFG
Sbjct: 388 FPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSN--GSYRFTRLPFG 445
Query: 726 LKNAGATYQRVMNTIFHDFIETFMQVYIDDLVVKSPSRDEHLSHLRKSFERMRIHGLKMN 785
LK A ++QR+M F + +Y+DDL+V S L +L + F + R + LK++
Sbjct: 446 LKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLH 505
Query: 786 PLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKVNFLRRFID 845
P KC+F + FLG KGI + K I + P + + N+ RRFI
Sbjct: 506 PEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIK 565
Query: 846 NLSDKTKPFSSLLRLKREDIFRWEA*HQKAFDELKNYLAIPPVMIPPIKGKPMRLYISAT 905
N +D ++ + L K+ F W QKAF LK+ L P ++ P K + A+
Sbjct: 566 NFADYSRHITRL--CKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDAS 623
Query: 906 DETIGSMLAQEDEDGIERAVFYLSRVLNDAKTRYTMIEKLCLCLYFSCTKLKYYIKPIDV 965
+ G++L Q + +G + V Y SR ++ + E+ ++++ + YI
Sbjct: 624 KQACGAVLTQ-NHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHF 682
Query: 966 MVFSHFDIIKHMLSKPILHSRIGKWSLALTEYSLTYAPLKTIKGQAVADFLADHTLPEEI 1025
V + + ++ S S++ + L L EY+ T LK K VAD L+ T+ E
Sbjct: 683 TVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKG-KDNHVADALSRITIKELK 741
Query: 1026 AYVGLQPWKLFFDDSSHKEGSGIGMFIVSPQGIPTKFMFRIRESCSNNESEYEALVSGLE 1085
G + + T+F R ++SC+ E + + E
Sbjct: 742 DITG------------------------NILKVTTRFQSR-QKSCAGKE-QLDLQKQTKE 775
Query: 1086 ILLALGAKNVEVKGDSELVVKQLTKEYKCISKNLAKYYVKAMSLLANFDQAGVSYIPRV- 1144
I V + VV + C+ K+ K ++A +D G Y +
Sbjct: 776 IASEPNVYEVITNDEVRKVVTLQLNDSICLFKHGKK-------IIARYD-VGDLYTNGIL 827
Query: 1145 -SNQEANELAQIASGYMIDKQKLKELIRIKEKLSPLDLEVMVIDNLTPNDWRKPIVEYLQ 1203
+Q L A Y I + K+ +I E +S + M N K + L
Sbjct: 828 DLDQFLQRLELQAGIYDISQIKMAPWKKIFEHVSIDKFKNM------GNKILKNLKVALL 881
Query: 1204 NPVGSTDIKVKYRALNYTIIGNELFKKNVDGTLLKCLSEVDAFIAVSAAHGGLCGAHQAG 1263
NPV T I NE K+ + TL GG G +
Sbjct: 882 NPV--------------TQINNEKEKEAILSTLHD-----------DPIQGGHTGITKTL 916
Query: 1264 AKMKWILFRQGMYWPTIMKDCIEYAKGCQDCQKHSGIQHVPASELHSIIKSWPFRGWAID 1323
AK+K + YW + K EY + CQ CQK +H + F +D
Sbjct: 917 AKVK-----RHYYWKNMSKYIKEYVRKCQKCQKAKTTKHTKTPMTITETPEHAFDRVVVD 971
Query: 1324 LIGEIHPALSRQHKYIIVAIDYFIKWVEAIPLQSVTQETVIEFIQNHIVYRFGLPESLTT 1383
IG + P ++Y + I K++ AIP+ + + +TV + I + ++G ++ T
Sbjct: 972 TIGPL-PKSENGNEYAVTLICDLTKYLVAIPIANKSAKTVAKAIFESFILKYGPMKTFIT 1030
Query: 1384 DQGTMFVGRKVAAFAESWGIKLLTSIPYYAQANGQVEAANKILISLIKKHVGRKPKSWHE 1443
D GT + + + IK +TS ++ Q G VE +++ L I+ ++ W
Sbjct: 1031 DMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVERSHRTLNEYIRSYISTDKTDWDV 1090
Query: 1444 SLSQVLWAYRNSPREATGTTPFRLAYGQEAVLPAEV-YLQSFR-IQRQEDIPSEVYWNMM 1501
L ++ + + P+ L +G+ + LP L S I +D E +
Sbjct: 1091 WLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHFNKLHSIEPIYNIDDYAKESKY--- 1147
Query: 1502 LDELVNLDEERVLALDVLTRQKDRIAKAYNKKVKNRSFVTGDYVWKVILPTDKKDRAYGK 1561
L+ A +L K++ + Y+ KVK+ GD KV+L + + K
Sbjct: 1148 -----RLEVAYARARKLLEAHKEKNKENYDLKVKDIELEVGD---KVLLRNEVGHKLDFK 1199
Query: 1562 WAPNWEGPFKVKKVLSNNAYVIKELSGQRQFV 1593
+ GP+K++ + NN + ++Q V
Sbjct: 1200 YT----GPYKIESIGDNNNITLLTNKNKKQIV 1227
>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
transposon 17.6 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1058
Score = 196 bits (499), Expect = 3e-49
Identities = 172/667 (25%), Positives = 305/667 (44%), Gaps = 66/667 (9%)
Query: 373 KPLFVWAKVEDKGVNKILVDGGAAINLMPRFMMKRLGKTETDLIPHDMVLSDYEGKTSTS 432
KP ++ K ++ + K L+D G+ +N+ + + D+ + + TS
Sbjct: 12 KPQYITIKYKENNL-KCLIDTGSTVNMTSKNIF-------------DLPIQNTSTFIHTS 57
Query: 433 MGAIMLNITAGTVSR-----STLFIVVPSKANYNLLLGREWIHGVGAVPSTLHQRISIWK 487
G +++N + S+ + F++ P NY+LLLGR+ + A S Q ++++
Sbjct: 58 NGPLIVNKSIIIPSKILFPTTNEFLLHPFSENYDLLLGRKLLAEAKATISYRDQEVTLY- 116
Query: 488 PDGVVENVQADQSYYLAEA-GYVDKKNFEKSPLVDATKMEAQDPLEKVDLGDGSKKRPTY 546
+ Y L E ++ +F+ ++ T + + + + D Y
Sbjct: 117 ----------NNKYKLIEGIATHEQSHFQNVNMIPDTMLRQPNKISPILESD------LY 160
Query: 547 ISSLIDPELRGRMIELLKEFKDCFA*DNDEMPGLSRDLVELQ----LPIKEDKKLVK*LP 602
++ E + R+ LL+++ D + D++ ++ + LP+ P
Sbjct: 161 RLEHLNNEEKQRLCALLQKYHDIQYHEGDKLTFTNQTKHTINTKHNLPLYSKYSY----P 216
Query: 603 RRFHPDVLVKIKEEIERLLRCKFIRAAKYVE----WLANVVPVIKKNGKMRVCIDFRDLN 658
+ + +V + +I+ +L IR + W+ K R+ ID+R LN
Sbjct: 217 QAYEQEV----ESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLN 272
Query: 659 AATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIAKEDVSKTAFRCSGALGTYE 718
T D + +P + ++ Y + +D G++QI + E VSKTAF S G YE
Sbjct: 273 EITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAF--STKHGHYE 330
Query: 719 WVVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDLVVKSPSRDEHLSHLRKSFERMR 778
++ MPFGLKNA AT+QR MN I + VY+DD++V S S DEHL L FE++
Sbjct: 331 YLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLA 390
Query: 779 IHGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKVN 838
LK+ KC F FLG V+ GI+ N K +AI PT K++++ LG
Sbjct: 391 KANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTG 450
Query: 839 FLRRFIDNLSDKTKPFSSLLRLKREDIFRWEA*HQKAFDELKNYLAIPPVMIPPIKGKPM 898
+ R+FI N +D KP + L+ K I + AF +LK ++ P++ P K
Sbjct: 451 YYRKFIPNFADIAKPMTKCLK-KNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKF 509
Query: 899 RLYISATDETIGSMLAQEDEDGIERAVFYLSRVLNDAKTRYTMIEKLCLCLYFSCTKLKY 958
L A+D +G++L+Q+ + Y+SR LN+ + Y+ IEK L + ++ ++
Sbjct: 510 TLTTDASDVALGAVLSQDG-----HPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRH 564
Query: 959 YIKPIDVMVFSHFDIIKHMLSKPILHSRIGKWSLALTEYSLTYAPLKTIKGQ--AVADFL 1016
Y+ + S + + +S++ +W + L+E+ +K IKG+ VAD L
Sbjct: 565 YLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDF---DIKYIKGKENCVADAL 621
Query: 1017 ADHTLPE 1023
+ L E
Sbjct: 622 SRIKLEE 628
>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
transposon 297 [Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1059
Score = 192 bits (487), Expect = 8e-48
Identities = 167/665 (25%), Positives = 313/665 (46%), Gaps = 79/665 (11%)
Query: 376 FVWAKVEDKGVN-KILVDGGAAINLMPRFMMKRLGKTETDLIPHDMVLSDYEGKTSTSMG 434
+ + K+ KG + K L+D G+ IN++ + + + + + TS G
Sbjct: 13 YEYIKIVYKGRSYKCLLDTGSTINMINENIFC-------------LPIQNSRCEVLTSNG 59
Query: 435 AIMLN----ITAGTVSRSTL-FIVVPSKANYNLLLGREWIHGVGAVPSTLHQRISIWKPD 489
I LN + ++ + T F V NY++L+GR+ + +V + + ++++
Sbjct: 60 PITLNDLIMLPRNSIFKKTEPFYVHRFSNNYDMLIGRKLLKNAQSVINYKNDTVTLF--- 116
Query: 490 GVVENVQADQSYYLAEAGYVDKKNFEKSPLVDATKMEAQDPLEKVDLGDGSKKRPTYISS 549
DQ+Y L + +N ++ Q+ ++K+D S+ R +++
Sbjct: 117 --------DQTYKLITSESERNQNLYIQRTPESIASSDQESIKKLDF---SQFRLDHLNQ 165
Query: 550 LIDPELRGRMIELLKEFKDCFA*DNDEMPGLSR-----------DLVELQLPIKEDKKLV 598
+L+G LL +F++ + +++ + + Q P+ + +
Sbjct: 166 EETFKLKG----LLNKFRNLEYKEGEKLTFTNTIKHVLNTTHNSPIYSKQYPLAQTHE-- 219
Query: 599 K*LPRRFHPDVLVKIKEEIERLLRCKFIRAAKYV----EWLANVVPVIKKNGKMRVCIDF 654
++++ +++ +L IR + W+ P K RV ID+
Sbjct: 220 ------------IEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDY 267
Query: 655 RDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIAKEDVSKTAFRCSGAL 714
R LN T D Y +P + ++ +Y + +D G++QI + +E +SKTAF S
Sbjct: 268 RKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAF--STKS 325
Query: 715 GTYEWVVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDLVVKSPSRDEHLSHLRKSF 774
G YE++ MPFGL+NA AT+QR MN I + VY+DD+++ S S EHL+ ++ F
Sbjct: 326 GHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVF 385
Query: 775 ERMRIHGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLL 834
++ LK+ KC F +FLG +V GI+ N K KAI+ PT K++++ L
Sbjct: 386 TKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFL 445
Query: 835 GKVNFLRRFIDNLSDKTKPFSSLLRLKREDIFRWEA*HQKAFDELKNYLAIPPVMIPPIK 894
G + R+FI N +D KP +S L+ KR I + + +AF++LK + P++ P
Sbjct: 446 GLTGYYRKFIPNYADIAKPMTSCLK-KRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDF 504
Query: 895 GKPMRLYISATDETIGSMLAQEDEDGIERAVFYLSRVLNDAKTRYTMIEKLCLCLYFSCT 954
K L A++ +G++L+Q + ++SR LND + Y+ IEK L + ++
Sbjct: 505 EKKFVLTTDASNLALGAVLSQNG-----HPISFISRTLNDHELNYSAIEKELLAIVWATK 559
Query: 955 KLKYYIKPIDVMVFSHFDIIK--HMLSKPILHSRIGKWSLALTEYSLTYAPLKTIKGQAV 1012
++Y+ ++ S ++ H L +P +++ +W + L+EY +K K +V
Sbjct: 560 TFRHYLLGRQFLIASDHQPLRWLHNLKEP--GAKLERWRVRLSEYQFKIDYIKG-KENSV 616
Query: 1013 ADFLA 1017
AD L+
Sbjct: 617 ADALS 621
>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
type 1
Length = 1333
Score = 188 bits (478), Expect = 9e-47
Identities = 134/472 (28%), Positives = 246/472 (51%), Gaps = 32/472 (6%)
Query: 547 ISSLI-DPELRGRMIELLKEFKDCFA*DNDEMPGLSRDLVELQLPIKEDKKLVK*LPRRF 605
+S+++ +PEL ++ KEFKD A N E L + + L+ ++ ++ + LP R
Sbjct: 365 VSNIVKEPELP----DIYKEFKDITAETNTEK--LPKPIKGLEFEVELTQENYR-LPIRN 417
Query: 606 HP---DVLVKIKEEIERLLRCKFIRAAKYVEWLANVVPVI---KKNGKMRVCIDFRDLNA 659
+P + + +EI + L+ IR +K + N PV+ KK G +R+ +D++ LN
Sbjct: 418 YPLPPGKMQAMNDEINQGLKSGIIRESKAI----NACPVMFVPKKEGTLRMVVDYKPLNK 473
Query: 660 ATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIAKEDVSKTAFRCSGALGTYEW 719
+ Y +P+ E ++ G + LD S Y+ I + K D K AFRC G +E+
Sbjct: 474 YVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPR--GVFEY 531
Query: 720 VVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDLVVKSPSRDEHLSHLRKSFERMRI 779
+VMP+G+ A A +Q +NTI + E+ + Y+DD+++ S S EH+ H++ ++++
Sbjct: 532 LVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKN 591
Query: 780 HGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKVNF 839
L +N KC F F+G+ + +KG + +L P ++K+L+ LG VN+
Sbjct: 592 ANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNY 651
Query: 840 LRRFIDNLSDKTKPFSSLLRLKREDIFRWEA*HQKAFDELKNYLAIPPVMIPPIKGKPMR 899
LR+FI S T P ++L LK++ ++W +A + +K L PPV+ K +
Sbjct: 652 LRKFIPKTSQLTHPLNNL--LKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKIL 709
Query: 900 LYISATDETIGSMLAQEDEDGIERAVFYLSRVLNDAKTRYTMIEKLCLCLYFSCTKLKYY 959
L A+D +G++L+Q+ +D V Y S ++ A+ Y++ +K L + S ++Y
Sbjct: 710 LETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHY 769
Query: 960 ----IKPIDVMVFSHFDIIKHML--SKPILHSRIGKWSLALTEYS--LTYAP 1003
I+P ++ H ++I + S+P + R+ +W L L +++ + Y P
Sbjct: 770 LESTIEPFKILT-DHRNLIGRITNESEP-ENKRLARWQLFLQDFNFEINYRP 819
Score = 91.7 bits (226), Expect = 2e-17
Identities = 75/308 (24%), Positives = 127/308 (40%), Gaps = 20/308 (6%)
Query: 1277 WPTIMKDCIEYAKGCQDCQKHSGIQHVPASELHSIIKSW-PFRGWAIDLIGEIHPALSRQ 1335
W I K EY + C CQ + H P L I S P+ ++D I + S
Sbjct: 943 WKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPE--SSG 1000
Query: 1336 HKYIIVAIDYFIKWVEAIPL-QSVTQETVIEFIQNHIVYRFGLPESLTTDQGTMFVGRKV 1394
+ + V +D F K +P +S+T E ++ FG P+ + D +F +
Sbjct: 1001 YNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTW 1060
Query: 1395 AAFAESWGIKLLTSIPYYAQANGQVEAANKILISLIKKHVGRKPKSWHESLSQVLWAYRN 1454
FA + + S+PY Q +GQ E N+ + L++ P +W + +S V +Y N
Sbjct: 1061 KDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNN 1120
Query: 1455 SPREATGTTPFRLAYGQEAVLPAEVYLQSFRIQRQEDIPSEVYWNMMLDELVNLDEERVL 1514
+ AT TPF + + L + + L SF + E+ + + E +N +
Sbjct: 1121 AIHSATQMTPFEIVHRYSPAL-SPLELPSFSDKTDENSQETIQVFQTVKEHLNTN----- 1174
Query: 1515 ALDVLTRQKDRIAKAYNKKVKN-RSFVTGDYVWKVILPTDKKDRAYGKWAPNWEGPFKVK 1573
++ K ++ K++ F GD V T ++ K AP++ GPF V
Sbjct: 1175 --------NIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGFLHKS-NKLAPSFAGPFYVL 1225
Query: 1574 KVLSNNAY 1581
+ N Y
Sbjct: 1226 QKSGPNNY 1233
>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
type 3
Length = 1333
Score = 186 bits (471), Expect = 6e-46
Identities = 133/472 (28%), Positives = 246/472 (51%), Gaps = 32/472 (6%)
Query: 547 ISSLI-DPELRGRMIELLKEFKDCFA*DNDEMPGLSRDLVELQLPIKEDKKLVK*LPRRF 605
+S+++ +PEL ++ KEFKD A N E L + + L+ ++ ++ + LP R
Sbjct: 365 VSNIVKEPELP----DIYKEFKDITAETNTEK--LPKPIKGLEFEVELTQENYR-LPIRN 417
Query: 606 HP---DVLVKIKEEIERLLRCKFIRAAKYVEWLANVVPVI---KKNGKMRVCIDFRDLNA 659
+P + + +EI + L+ IR +K + N PV+ KK G +R+ +D++ LN
Sbjct: 418 YPLPPGKMQAMNDEINQGLKSGIIRESKAI----NACPVMFVPKKEGTLRMVVDYKPLNK 473
Query: 660 ATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIAKEDVSKTAFRCSGALGTYEW 719
+ Y +P+ E ++ G + LD S Y+ I + K D K AFRC G +E+
Sbjct: 474 YVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPR--GVFEY 531
Query: 720 VVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDLVVKSPSRDEHLSHLRKSFERMRI 779
+VMP+G+ A A +Q +NTI + E+ + Y+D++++ S S EH+ H++ ++++
Sbjct: 532 LVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKN 591
Query: 780 HGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKVNF 839
L +N KC F F+G+ + +KG + +L P ++K+L+ LG VN+
Sbjct: 592 ANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNY 651
Query: 840 LRRFIDNLSDKTKPFSSLLRLKREDIFRWEA*HQKAFDELKNYLAIPPVMIPPIKGKPMR 899
LR+FI S T P ++L LK++ ++W +A + +K L PPV+ K +
Sbjct: 652 LRKFIPKTSQLTHPLNNL--LKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKIL 709
Query: 900 LYISATDETIGSMLAQEDEDGIERAVFYLSRVLNDAKTRYTMIEKLCLCLYFSCTKLKYY 959
L A+D +G++L+Q+ +D V Y S ++ A+ Y++ +K L + S ++Y
Sbjct: 710 LETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHY 769
Query: 960 ----IKPIDVMVFSHFDIIKHML--SKPILHSRIGKWSLALTEYS--LTYAP 1003
I+P ++ H ++I + S+P + R+ +W L L +++ + Y P
Sbjct: 770 LESTIEPFKILT-DHRNLIGRITNESEP-ENKRLARWQLFLQDFNFEINYRP 819
Score = 91.7 bits (226), Expect = 2e-17
Identities = 75/308 (24%), Positives = 127/308 (40%), Gaps = 20/308 (6%)
Query: 1277 WPTIMKDCIEYAKGCQDCQKHSGIQHVPASELHSIIKSW-PFRGWAIDLIGEIHPALSRQ 1335
W I K EY + C CQ + H P L I S P+ ++D I + S
Sbjct: 943 WKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPE--SSG 1000
Query: 1336 HKYIIVAIDYFIKWVEAIPL-QSVTQETVIEFIQNHIVYRFGLPESLTTDQGTMFVGRKV 1394
+ + V +D F K +P +S+T E ++ FG P+ + D +F +
Sbjct: 1001 YNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTW 1060
Query: 1395 AAFAESWGIKLLTSIPYYAQANGQVEAANKILISLIKKHVGRKPKSWHESLSQVLWAYRN 1454
FA + + S+PY Q +GQ E N+ + L++ P +W + +S V +Y N
Sbjct: 1061 KDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNN 1120
Query: 1455 SPREATGTTPFRLAYGQEAVLPAEVYLQSFRIQRQEDIPSEVYWNMMLDELVNLDEERVL 1514
+ AT TPF + + L + + L SF + E+ + + E +N +
Sbjct: 1121 AIHSATQMTPFEIVHRYSPAL-SPLELPSFSDKTDENSQETIQVFQTVKEHLNTN----- 1174
Query: 1515 ALDVLTRQKDRIAKAYNKKVKN-RSFVTGDYVWKVILPTDKKDRAYGKWAPNWEGPFKVK 1573
++ K ++ K++ F GD V T ++ K AP++ GPF V
Sbjct: 1175 --------NIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGFLHKS-NKLAPSFAGPFYVL 1225
Query: 1574 KVLSNNAY 1581
+ N Y
Sbjct: 1226 QKSGPNNY 1233
>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
type 2
Length = 1333
Score = 186 bits (471), Expect = 6e-46
Identities = 133/472 (28%), Positives = 246/472 (51%), Gaps = 32/472 (6%)
Query: 547 ISSLI-DPELRGRMIELLKEFKDCFA*DNDEMPGLSRDLVELQLPIKEDKKLVK*LPRRF 605
+S+++ +PEL ++ KEFKD A N E L + + L+ ++ ++ + LP R
Sbjct: 365 VSNIVKEPELP----DIYKEFKDITAETNTEK--LPKPIKGLEFEVELTQENYR-LPIRN 417
Query: 606 HP---DVLVKIKEEIERLLRCKFIRAAKYVEWLANVVPVI---KKNGKMRVCIDFRDLNA 659
+P + + +EI + L+ IR +K + N PV+ KK G +R+ +D++ LN
Sbjct: 418 YPLPPGKMQAMNDEINQGLKSGIIRESKAI----NACPVMFVPKKEGTLRMVVDYKPLNK 473
Query: 660 ATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIAKEDVSKTAFRCSGALGTYEW 719
+ Y +P+ E ++ G + LD S Y+ I + K D K AFRC G +E+
Sbjct: 474 YVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPR--GVFEY 531
Query: 720 VVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDLVVKSPSRDEHLSHLRKSFERMRI 779
+VMP+G+ A A +Q +NTI + E+ + Y+D++++ S S EH+ H++ ++++
Sbjct: 532 LVMPYGISIAPAHFQYFINTILGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKN 591
Query: 780 HGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKVNF 839
L +N KC F F+G+ + +KG + +L P ++K+L+ LG VN+
Sbjct: 592 ANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNY 651
Query: 840 LRRFIDNLSDKTKPFSSLLRLKREDIFRWEA*HQKAFDELKNYLAIPPVMIPPIKGKPMR 899
LR+FI S T P ++L LK++ ++W +A + +K L PPV+ K +
Sbjct: 652 LRKFIPKTSQLTHPLNNL--LKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKIL 709
Query: 900 LYISATDETIGSMLAQEDEDGIERAVFYLSRVLNDAKTRYTMIEKLCLCLYFSCTKLKYY 959
L A+D +G++L+Q+ +D V Y S ++ A+ Y++ +K L + S ++Y
Sbjct: 710 LETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHY 769
Query: 960 ----IKPIDVMVFSHFDIIKHML--SKPILHSRIGKWSLALTEYS--LTYAP 1003
I+P ++ H ++I + S+P + R+ +W L L +++ + Y P
Sbjct: 770 LESTIEPFKILT-DHRNLIGRITNESEP-ENKRLARWQLFLQDFNFEINYRP 819
Score = 91.7 bits (226), Expect = 2e-17
Identities = 75/308 (24%), Positives = 127/308 (40%), Gaps = 20/308 (6%)
Query: 1277 WPTIMKDCIEYAKGCQDCQKHSGIQHVPASELHSIIKSW-PFRGWAIDLIGEIHPALSRQ 1335
W I K EY + C CQ + H P L I S P+ ++D I + S
Sbjct: 943 WKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPE--SSG 1000
Query: 1336 HKYIIVAIDYFIKWVEAIPL-QSVTQETVIEFIQNHIVYRFGLPESLTTDQGTMFVGRKV 1394
+ + V +D F K +P +S+T E ++ FG P+ + D +F +
Sbjct: 1001 YNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTW 1060
Query: 1395 AAFAESWGIKLLTSIPYYAQANGQVEAANKILISLIKKHVGRKPKSWHESLSQVLWAYRN 1454
FA + + S+PY Q +GQ E N+ + L++ P +W + +S V +Y N
Sbjct: 1061 KDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNN 1120
Query: 1455 SPREATGTTPFRLAYGQEAVLPAEVYLQSFRIQRQEDIPSEVYWNMMLDELVNLDEERVL 1514
+ AT TPF + + L + + L SF + E+ + + E +N +
Sbjct: 1121 AIHSATQMTPFEIVHRYSPAL-SPLELPSFSDKTDENSQETIQVFQTVKEHLNTN----- 1174
Query: 1515 ALDVLTRQKDRIAKAYNKKVKN-RSFVTGDYVWKVILPTDKKDRAYGKWAPNWEGPFKVK 1573
++ K ++ K++ F GD V T ++ K AP++ GPF V
Sbjct: 1175 --------NIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGFLHKS-NKLAPSFAGPFYVL 1225
Query: 1574 KVLSNNAY 1581
+ N Y
Sbjct: 1226 QKSGPNNY 1233
>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
transposon opus [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1003
Score = 184 bits (468), Expect = 1e-45
Identities = 127/422 (30%), Positives = 215/422 (50%), Gaps = 22/422 (5%)
Query: 612 KIKEEIERLLRCKFIRAAKYVE----WLANVVPVIKKNGKMRVCIDFRDLNAATPKDEYH 667
+++ +I+ LL+ IR + W+ P + R+ +DF+ LN T D Y
Sbjct: 138 EVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYP 197
Query: 668 MPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIAKEDVSKTAFRCSGALGTYEWVVMPFGLK 727
+P + S +Y + LD SG++QI + + D+ KTAF S G YE++ +PFGLK
Sbjct: 198 IPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAF--STLNGKYEFLRLPFGLK 255
Query: 728 NAGATYQRVMNTIFHDFIETFMQVYIDDLVVKSPSRDEHLSHLRKSFERMRIHGLKMNPL 787
NA A +QR+++ I + I VYIDD++V S D H +LR + L++N
Sbjct: 256 NAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLE 315
Query: 788 KCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKVNFLRRFIDNL 847
K F +FLG++V GI+ + K +AI + PPTS K+L+ LG ++ R+FI +
Sbjct: 316 KSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDY 375
Query: 848 SDKTKPFSSLLRLKREDI---------FRWEA*HQKAFDELKNYLAIPPVMIPPIKGKPM 898
+ KP ++L R +I + ++F++LK+ L ++ P KP
Sbjct: 376 AKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPF 435
Query: 899 RLYISATDETIGSMLAQEDEDGIERAVFYLSRVLNDAKTRYTMIEKLCLCLYFSCTKLKY 958
L A++ IG++L+Q+D+ G +R + Y+SR LN + Y IEK L + +S L+
Sbjct: 436 HLTTDASNWAIGAVLSQDDQ-GRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRA 494
Query: 959 YIKPI-DVMVFSHFDIIKHMLSKPILHSRIGKWSLALTEYS--LTYAPLKTIKGQAVADF 1015
Y+ + V++ + L ++++ +W + EY+ L Y P K+ VAD
Sbjct: 495 YLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKS---NVVADA 551
Query: 1016 LA 1017
L+
Sbjct: 552 LS 553
Score = 33.1 bits (74), Expect = 6.3
Identities = 65/333 (19%), Positives = 122/333 (36%), Gaps = 44/333 (13%)
Query: 1259 AHQAGAKMKWILFRQGMYWPTIMKDCIEYAKGCQDCQKHSGIQHVPASELHSI-IKSWPF 1317
AH+ +++ L + Y+P + CQ C+ + +H L I ++P
Sbjct: 704 AHRGPTEIRLQLLEK-YYFPRMSSTIRLQTSSCQCCKLYKYERHPNKPNLQPTPIPNYPC 762
Query: 1318 RGWAIDLIGEIHPALSRQHKYIIVAIDYFIKWVEAIPLQSVTQETVIEFIQNHIVYRFGL 1377
ID+ + + + + ID F K+ + LQS + E + + Y F
Sbjct: 763 EILHIDIF-------ALEKRLYLSCIDKFSKFAKLFHLQSKASVHLRETLVEALHY-FTA 814
Query: 1378 PESLTTDQGTMFVGRKVAAFAESWGIKLLTSIPYYAQANGQVEAANKILISLIKKHVGRK 1437
P+ L +D + V + S I L + ++ NGQVE + + + +
Sbjct: 815 PKVLVSDNERGLLCPTVLNYLRSLDIDLYYAPTQKSEVNGQVERFHSTFLEIYRCLKDEL 874
Query: 1438 PKSWHESLSQV-LWAYRNSPREATGTTPFRLAYGQEAVLPAEVYLQSFRIQRQEDIPSEV 1496
P L + + Y S T P + + + + + + L FR Q EDI +
Sbjct: 875 PTFKPVELVHIAVDRYNTSVHSVTNRKPADVFFDRSSRVNYQ-GLTDFRRQTLEDIKGLI 933
Query: 1497 YWNMMLDELVNLDEERVLALDVLTRQKDRIAKAYNKKVKNRSFVTGDYVWKVILPTDKKD 1556
+ + + R K+R + +S+ GD V+ K+
Sbjct: 934 EYKQIRGN--------------MARNKNR--------DEPKSYGPGDEVFVANKQIKTKE 971
Query: 1557 RAYGKWAPNWEGPFKVKKVLSNNAYVIKELSGQ 1589
+A F+ +KV +N +K SG+
Sbjct: 972 KA----------RFRCEKVQEDNKITVKTRSGK 994
>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
3.1.26.4) (RT); Integrase (IN)]
Length = 1161
Score = 169 bits (427), Expect = 7e-41
Identities = 217/999 (21%), Positives = 399/999 (39%), Gaps = 107/999 (10%)
Query: 638 VVPVIKKNGKMRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQIF 697
V PV K +GK R+ +D+R++N P + ++ S +Y + LD +G+
Sbjct: 214 VYPVPKPDGKWRMVLDYREVNKTIPLIAAQNQHSAGILSSIYRGKYKTTLDLTNGFWAHP 273
Query: 698 IAKEDVSKTAFRCSGALGTYEWVVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDLV 757
I E TAF G Y W +P G N+ A + + + + +Q Y+DD+
Sbjct: 274 ITPESYWLTAFTWQGK--QYCWTRLPQGFLNSPALFTADVVDLLKEIPN--VQAYVDDIY 329
Query: 758 VKSPSRDEHLSHLRKSFERMRIHGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKA 817
+ EHL L K F + G ++ K +FLGF + K+G + +
Sbjct: 330 ISHDDPQEHLEQLEKIFSILLNAGYVVSLKKSEIAQREVEFLGFNITKEGRGLTDTFKQK 389
Query: 818 ILDTSPPTSKKQLQSLLGKVNFLRRFIDNLSDKTKPFSSLLRLKREDIFRWEA*HQKAFD 877
+L+ +PP KQLQS+LG +NF R FI N S+ KP +++ W + +
Sbjct: 390 LLNITPPKDLKQLQSILGLLNFARNFIPNYSELVKPLYTIVANANGKFISWT---EDNSN 446
Query: 878 ELKNYLAIPPVMIPPIKGKPMRLYISATDETIGSMLAQEDEDGIERAVFYLSRVLNDAKT 937
+L++ +++ + P I + + + + +G +R + Y++ + + A+
Sbjct: 447 QLQHIISVLNQADNLEERNPETRLIIKVNSSPSAGYIRYYNEGSKRPIMYVNYIFSKAEA 506
Query: 938 RYTMIEKLCLCLYFSCTKLKYYIKPIDVMVFSHFDIIKHMLSKPI-----LHSRIGKWSL 992
++T EKL ++ K +++V+S + + P+ L R W
Sbjct: 507 KFTQTEKLLTTMHKGLIKAMDLAMGQEILVYSPIVSMTKIQRTPLPERKALPVRWITWMT 566
Query: 993 ALTE------YSLTYAPLKTIKGQAVADFLADHTLPEEIAYVGLQPWKLFFDDSSH---- 1042
L + Y + L+ I D +A P E A V F+ D S
Sbjct: 567 YLEDPRIQFHYDKSLPELQQIP-NVTEDVIAKTKHPSEFAMV-------FYTDGSAIKHP 618
Query: 1043 --KEGSGIGMFIVSPQGIPTKFMFRIRESCSNNESEYEALVSGLEILLALGAKNVEVKGD 1100
+ GM I Q IP ++I S ++ A ++ + + K +++ G
Sbjct: 619 DVNKSHSAGMGIAQVQFIPE---YKIVHQWSIPLGDHTAQLAEIAAVEFACKKALKISGP 675
Query: 1101 SELVVKQLTKEYKCISKNLAKYYVKAMSLLANFDQAGVSYIPRVSN-QEANELAQIASGY 1159
+V Y S N Y K+ L N + + VS + E Q+
Sbjct: 676 VLIVTDSF---YVAESANKELPYWKSNGFLNNKKKP----LRHVSKWKSIAECLQLKPDI 728
Query: 1160 MIDKQK---------LKELIRIKEKLSPLDLEVMVIDNLTPNDWRKPIVEYLQN--PVGS 1208
+I +K E + +KL+ +V N TP+ + + LQ P G
Sbjct: 729 IIMHEKGHQQPMTTLHTEGNNLADKLATQG-SYVVHCNTTPS-LDAELDQLLQGHYPPGY 786
Query: 1209 TDIKVKYRALNYTIIGNELFKKNVDGTLLKCLSEVDAFIAVSAAHGGLCGAHQAGAKMKW 1268
+ YT+ N+L + +G + + D +S AH G +
Sbjct: 787 P------KQYKYTLEENKLIVERPNGIRI-VPPKADREKIISTAH----NIAHTGRDATF 835
Query: 1269 ILFRQGMYWPTIMKDCIEYAKGCQDCQKHSGIQHVPASELHSIIKSWPFRGWAIDLIGEI 1328
+ +WP + KD ++ + C+ C + L + PF + ID IG +
Sbjct: 836 LKVSSKYWWPNLRKDVVKSIRQCKQCLVTNATNLTSPPILRPVKPLKPFDKFYIDYIGPL 895
Query: 1329 HPALSRQHKYIIVAIDYFIKWVEAIPLQSVTQETVIEFIQNHIVYRFGLPESLTTDQGTM 1388
P S + +++V +D +V P ++ + ++ + +++ +P+ L +DQG
Sbjct: 896 PP--SNGYLHVLVVVDSMTGFVWLYPTKAPSTSATVKAL--NMLTSIAIPKVLHSDQGAA 951
Query: 1389 FVGRKVAAFAESWGIKLLTSIPYYAQANGQVEAANKILISLIKKHVGRKPKSWHESLSQV 1448
F A +A+ GI+L S PY+ Q++G+VE N + L+ K + +P W++ L V
Sbjct: 952 FTSSTFADWAKEKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLLIGRPAKWYDLLPVV 1011
Query: 1449 LWAYRNSPREATGTTPFRLAYGQEAVLPAEVYLQSFRIQRQEDIPSEVYWNMMLDELVNL 1508
A NS ++ TP +L +G ++ N L
Sbjct: 1012 QLALNNSYSPSSKYTPHQLLFGVDS-------------------------NTPFANSDTL 1046
Query: 1509 DEERVLALDVLTRQKDRIAKAYNKKVKNRSF--VTGDYVW-KVILPTDKKDRAYGKWAPN 1565
D R L +L + + + + +RS+ G V +V P + P
Sbjct: 1047 DLSREEELSLLQEIRSSLHQPTSPPASSRSWSPSVGQLVQERVARPASLR--------PR 1098
Query: 1566 WEGPFKVKKVLSNNAYVIKELSGQRQFVTINDKYLKAYK 1604
W P + +V++ +I + G R+ V++++ L AY+
Sbjct: 1099 WHKPTAILEVVNPRTVIILDHLGNRRTVSVDNLKLTAYQ 1137
>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
transposon gypsy [Contains: Reverse transcriptase (EC
2.7.7.49); Endonuclease]
Length = 1035
Score = 151 bits (381), Expect = 2e-35
Identities = 111/426 (26%), Positives = 212/426 (49%), Gaps = 26/426 (6%)
Query: 613 IKEEIERLLRCKFIRAAKYVEWLANVVPVIKK------NGKMRVCIDFRDLNAATPKDEY 666
+ E+++LL+ IR ++ + + V KK N R+ IDFR LN T D Y
Sbjct: 197 VNNEVKQLLKDGIIRPSRS-PYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRY 255
Query: 667 HMPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIAKEDVSKTAFRCSGALGTYEWVVMPFGL 726
MP M++ + ++ + LD SGY+QI++A+ D KT+F +G G YE+ +PFGL
Sbjct: 256 PMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNG--GKYEFCRLPFGL 313
Query: 727 KNAGATYQRVMNTIFHDFIETFMQVYIDDLVVKSPSRDEHLSHLRKSFERMRIHGLKMNP 786
+NA + +QR ++ + + I VY+DD+++ S + +H+ H+ + + ++++
Sbjct: 314 RNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQ 373
Query: 787 LKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKVNFLRRFIDN 846
K F + ++LGF+V K G + + K KAI + P +++S LG ++ R FI +
Sbjct: 374 EKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKD 433
Query: 847 LSDKTKPFSSLLR---------LKREDIFRWEA*HQKAFDELKNYLAIPPVMIP-PIKGK 896
+ +P + +L+ + ++ + + AF L+N LA V++ P K
Sbjct: 434 FAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKK 493
Query: 897 PMRLYISATDETIGSMLAQEDEDGIERAVFYLSRVLNDAKTRYTMIEKLCLCLYFSCTKL 956
P L A+ IG++L+QE R + +SR L + Y E+ L + ++ KL
Sbjct: 494 PFDLTTDASASGIGAVLSQEG-----RPITMISRTLKQPEQNYATNERELLAIVWALGKL 548
Query: 957 KYYI-KPIDVMVFSHFDIIKHMLSKPILHSRIGKWSLALTEYSLTYAPLKTIKGQAVADF 1015
+ ++ ++ +F+ + ++ +++I +W + +++ K K VAD
Sbjct: 549 QNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVF-YKPGKENFVADA 607
Query: 1016 LADHTL 1021
L+ L
Sbjct: 608 LSRQNL 613
Score = 35.0 bits (79), Expect = 1.7
Identities = 46/217 (21%), Positives = 88/217 (40%), Gaps = 18/217 (8%)
Query: 1259 AHQAGAK-MKWILFRQGMYWPTIMKDCIEYAKGCQDCQKHSGIQHVPASEL-HSIIKSWP 1316
AH+A + +K +L + Y+P + E C+ C + +H EL + I S+
Sbjct: 749 AHRAAQENIKQVL--RDYYFPKMGSLAKEVVANCRVCTQAKYDRHPKKQELGETPIPSYT 806
Query: 1317 FRGWAIDLIGEIHPALSRQHKYIIVAIDYFIKWVEAIPLQSVTQETVIEFIQN--HIVYR 1374
ID+ S K + ID F K+ +Q V T+++ I+
Sbjct: 807 GEMVHIDIF-------STDRKLFLTCIDKFSKYAI---VQPVVSRTIVDITAPLLQIINL 856
Query: 1375 FGLPESLTTDQGTMFVGRKVAAFAE-SWGIKLLTSIPYYAQANGQVEAANKILISLIK-K 1432
F +++ D F V + + S+GI ++ + P ++ +NGQVE + L + +
Sbjct: 857 FPNIKTVYCDNEPAFNSETVTSMLKNSFGIDIVNAPPLHSSSNGQVERFHSTLAEIARCL 916
Query: 1433 HVGRKPKSWHESLSQVLWAYRNSPREATGTTPFRLAY 1469
+ +K E + + Y + T P + +
Sbjct: 917 KLDKKTNDTVELILRATIEYNKTVHSVTRERPIEVVH 953
>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 139 bits (350), Expect = 6e-32
Identities = 130/512 (25%), Positives = 232/512 (44%), Gaps = 28/512 (5%)
Query: 520 VDATKMEAQDPLEKVDLGDGSKKRPTYISSLIDPELRGRMIELLKEFKDCFA*DNDEMPG 579
V+ + + ++PLE++ + S+ R L + R + IE L E K C +N P
Sbjct: 174 VNISTNKIENPLEEIAIL--SEGRRLSEEKLFITQQRMQKIEELLE-KVCS--ENPLDPN 228
Query: 580 LSRDLVELQLPIKEDKKLVK*LPRRFHPDVLVKIKEEIERLLRCKFIRAAKYVEWLANVV 639
++ ++ + + + K +K P ++ P + ++I+ LL K I+ +K +
Sbjct: 229 KTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFL 288
Query: 640 ---PVIKKNGKMRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQI 696
K+ GK R+ ++++ +N AT D Y++P + ++ G + S D SG+ Q+
Sbjct: 289 VNNEAEKRRGKKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQV 348
Query: 697 FIAKEDVSKTAFRCSGALGTYEWVVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDL 756
+ +E TAF C G YEW V+PFGLK A + +QR M+ F F F VY+DD+
Sbjct: 349 LLDQESRPLTAFTC--PQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDI 405
Query: 757 VVKSPSRDEHLSHLRKSFERMRIHGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAK 816
+V S + ++HL H+ ++ HG+ ++ K +FLG + +G +
Sbjct: 406 LVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEI-DEGTHKPQGHIL 464
Query: 817 AILDTSPPT--SKKQLQSLLGKVNFLRRFIDNLSDKTKPFSSLLRLKREDIFRWEA*HQK 874
++ P T KKQLQ LG + + +I L+ KP + +LK +RW
Sbjct: 465 EHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQA--KLKENVPWRWTKEDTL 522
Query: 875 AFDELKNYLAIPPVMIPPIKGKPMRLYISATDETIGSMLAQ---EDEDGIERAVFYLSRV 931
++K L P + P+ + + + A+D+ G ML + E Y S
Sbjct: 523 YMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGS 582
Query: 932 LNDAKTRYTMIEKLCLCLYFSCTKLKYYIKPIDVMV---FSHFDIIKHMLSKPILHSRIG 988
A+ Y +K L + + K Y+ P+ ++ +HF ++ K S++G
Sbjct: 583 FKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKG--DSKLG 640
Query: 989 ---KWSLALTEYSLTYAPLKTIKGQAVADFLA 1017
+W L+ YS +K ADFL+
Sbjct: 641 RNIRWQAWLSHYSFDVEHIKGTDNH-FADFLS 671
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 137 bits (346), Expect = 2e-31
Identities = 129/512 (25%), Positives = 232/512 (45%), Gaps = 28/512 (5%)
Query: 520 VDATKMEAQDPLEKVDLGDGSKKRPTYISSLIDPELRGRMIELLKEFKDCFA*DNDEMPG 579
V+ + + ++PLE++ + S+ R L + R + IE L E K C +N P
Sbjct: 174 VNISTNKIENPLEEIAIL--SEGRRLSEEKLFITQQRMQKIEELLE-KVCS--ENPLDPN 228
Query: 580 LSRDLVELQLPIKEDKKLVK*LPRRFHPDVLVKIKEEIERLLRCKFIRAAKYVEWLANVV 639
++ ++ + + + K +K P ++ P + ++I+ LL K I+ +K +
Sbjct: 229 KTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFL 288
Query: 640 ---PVIKKNGKMRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQI 696
K+ GK R+ ++++ +N AT D Y++P + ++ G + S D SG+ Q+
Sbjct: 289 VNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQV 348
Query: 697 FIAKEDVSKTAFRCSGALGTYEWVVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDL 756
+ +E TAF C G YEW V+PFGLK A + +QR M+ F F F VY+DD+
Sbjct: 349 LLDQESRPLTAFTC--PQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDI 405
Query: 757 VVKSPSRDEHLSHLRKSFERMRIHGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAK 816
+V S + ++HL H+ ++ HG+ ++ K +FLG + +G +
Sbjct: 406 LVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEI-DEGTHKPQGHIL 464
Query: 817 AILDTSPPT--SKKQLQSLLGKVNFLRRFIDNLSDKTKPFSSLLRLKREDIFRWEA*HQK 874
++ P T KKQLQ LG + + +I L+ KP + +LK ++W
Sbjct: 465 EHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQA--KLKENVPWKWTKEDTL 522
Query: 875 AFDELKNYLAIPPVMIPPIKGKPMRLYISATDETIGSMLAQ---EDEDGIERAVFYLSRV 931
++K L P + P+ + + + A+D+ G ML + E Y S
Sbjct: 523 YMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGS 582
Query: 932 LNDAKTRYTMIEKLCLCLYFSCTKLKYYIKPIDVMV---FSHFDIIKHMLSKPILHSRIG 988
A+ Y +K L + + K Y+ P+ ++ +HF ++ K S++G
Sbjct: 583 FKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKG--DSKLG 640
Query: 989 ---KWSLALTEYSLTYAPLKTIKGQAVADFLA 1017
+W L+ YS +K ADFL+
Sbjct: 641 RNIRWQAWLSHYSFDVEHIKGTDNH-FADFLS 671
>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 679
Score = 136 bits (342), Expect = 5e-31
Identities = 128/512 (25%), Positives = 232/512 (45%), Gaps = 28/512 (5%)
Query: 520 VDATKMEAQDPLEKVDLGDGSKKRPTYISSLIDPELRGRMIELLKEFKDCFA*DNDEMPG 579
V+ + + ++PL+++ + S+ R L + R + IE L E K C +N P
Sbjct: 174 VNISTNKIENPLKEIAIL--SEGRRLSEEKLFITQQRMQKIEELLE-KVCS--ENPLDPN 228
Query: 580 LSRDLVELQLPIKEDKKLVK*LPRRFHPDVLVKIKEEIERLLRCKFIRAAKYVEWLANVV 639
++ ++ + + + K +K P ++ P + ++I+ LL K I+ +K +
Sbjct: 229 KTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFL 288
Query: 640 ---PVIKKNGKMRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQI 696
K+ GK R+ ++++ +N AT D Y++P + ++ G + S D SG+ Q+
Sbjct: 289 VNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQV 348
Query: 697 FIAKEDVSKTAFRCSGALGTYEWVVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDL 756
+ +E TAF C G YEW V+PFGLK A + +QR M+ F F F VY+DD+
Sbjct: 349 LLDQESRPLTAFTC--PQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDI 405
Query: 757 VVKSPSRDEHLSHLRKSFERMRIHGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAK 816
+V S + ++HL H+ ++ HG+ ++ K +FLG + +G +
Sbjct: 406 LVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEI-DEGTHKPQGHIL 464
Query: 817 AILDTSPPT--SKKQLQSLLGKVNFLRRFIDNLSDKTKPFSSLLRLKREDIFRWEA*HQK 874
++ P T KKQLQ LG + + +I L+ KP + +LK ++W
Sbjct: 465 EHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQA--KLKENVPWKWTKEDTL 522
Query: 875 AFDELKNYLAIPPVMIPPIKGKPMRLYISATDETIGSMLAQ---EDEDGIERAVFYLSRV 931
++K L P + P+ + + + A+D+ G ML + E Y S
Sbjct: 523 YMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYASGS 582
Query: 932 LNDAKTRYTMIEKLCLCLYFSCTKLKYYIKPIDVMV---FSHFDIIKHMLSKPILHSRIG 988
A+ Y +K L + + K Y+ P+ ++ +HF ++ K S++G
Sbjct: 583 FKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKG--DSKLG 640
Query: 989 ---KWSLALTEYSLTYAPLKTIKGQAVADFLA 1017
+W L+ YS +K ADFL+
Sbjct: 641 RNIRWQAWLSHYSFDVEHIKGTDNH-FADFLS 671
>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 680
Score = 134 bits (338), Expect = 2e-30
Identities = 130/512 (25%), Positives = 234/512 (45%), Gaps = 28/512 (5%)
Query: 520 VDATKMEAQDPLEKVDLGDGSKKRPTYISSLIDPELRGRMIELLKEFKDCFA*DNDEMPG 579
V+ + + ++PLE++ + S+ R L + R + E L E K C +N P
Sbjct: 175 VNISTNKIENPLEEIAIL--SEGRRLSEEKLFITQQRMQKTEELLE-KVCS--ENPLDPN 229
Query: 580 LSRDLVELQLPIKEDKKLVK*LPRRFHPDVLVKIKEEIERLLRCKFIRAAKYVEWL-ANV 638
++ ++ + + + K +K P ++ P + ++I+ LL K I+ +K A +
Sbjct: 230 KTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFL 289
Query: 639 VPVIKKNGK--MRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQI 696
V +NG+ R+ ++++ +N AT D Y++P + ++ G + S D SG+ Q+
Sbjct: 290 VNNEAENGRGNKRMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQV 349
Query: 697 FIAKEDVSKTAFRCSGALGTYEWVVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDL 756
+ +E TAF C G YEW V+PFGLK A + +QR M+ F F F VY+DD+
Sbjct: 350 LLDQESRPLTAFTC--PQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVF-RKFCCVYVDDI 406
Query: 757 VVKSPSRDEHLSHLRKSFERMRIHGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAK 816
VV S + ++HL H+ ++ HG+ ++ K +FLG + +G +
Sbjct: 407 VVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEI-DEGTHKPQGHIL 465
Query: 817 AILDTSPPT--SKKQLQSLLGKVNFLRRFIDNLSDKTKPFSSLLRLKREDIFRWEA*HQK 874
++ P T KKQLQ LG + + +I NL+ +P + +LK ++W
Sbjct: 466 EHINKFPDTLEDKKQLQRFLGILTYASDYIPNLAQMRQPLQA--KLKENVPWKWTKEDTL 523
Query: 875 AFDELKNYLAIPPVMIPPIKGKPMRLYISATDETIGSMLAQ---EDEDGIERAVFYLSRV 931
++K L P + P+ + + + A+D+ G ML + E Y S
Sbjct: 524 YMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYRSGS 583
Query: 932 LNDAKTRYTMIEKLCLCLYFSCTKLKYYIKPIDVMV---FSHFDIIKHMLSKPILHSRIG 988
A+ Y +K L + + K Y+ P+ ++ +HF ++ K S++G
Sbjct: 584 FKAAERNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKG--DSKLG 641
Query: 989 ---KWSLALTEYSLTYAPLKTIKGQAVADFLA 1017
+W L+ YS +K ADFL+
Sbjct: 642 RNIRWQAWLSHYSFDVEHIKGTDNH-FADFLS 672
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 674
Score = 134 bits (337), Expect = 2e-30
Identities = 116/472 (24%), Positives = 212/472 (44%), Gaps = 23/472 (4%)
Query: 560 IELLKEFKDCFA*DNDEMPGLSRDLVELQLPIKEDKKLVK*LPRRFHPDVLVKIKEEIER 619
++ ++E + +N P ++ ++ + + + K +K P ++ P + ++I+
Sbjct: 204 MQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKE 263
Query: 620 LLRCKFIRAAKYVEWLANVV---PVIKKNGKMRVCIDFRDLNAATPKDEYHMPIAEMMVD 676
LL K I+ +K + K+ GK R+ ++++ +N AT D Y+ P + ++
Sbjct: 264 LLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLT 323
Query: 677 SAAGHEYLSLLDGYSGYNQIFIAKEDVSKTAFRCSGALGTYEWVVMPFGLKNAGATYQRV 736
G + S D SG+ Q+ + +E TAF C G YEW V+PFGLK A + +QR
Sbjct: 324 LIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTC--PQGHYEWNVVPFGLKQAPSIFQRH 381
Query: 737 MNTIFHDFIETFMQVYIDDLVVKSPSRDEHLSHLRKSFERMRIHGLKMNPLKCAFGVIAG 796
M+ F F F VY+DD++V S + ++HL H+ ++ HG+ ++ K
Sbjct: 382 MDEAFRVF-RKFCCVYVDDILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKI 440
Query: 797 DFLGFVVHKKGIEINKNKAKAILDTSPPT--SKKQLQSLLGKVNFLRRFIDNLSDKTKPF 854
+FLG + +G + ++ P T KKQLQ LG + + +I L+ KP
Sbjct: 441 NFLGLEI-DEGTHKPQGHILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPL 499
Query: 855 SSLLRLKREDIFRWEA*HQKAFDELKNYLAIPPVMIPPIKGKPMRLYISATDETIGSMLA 914
+ +LK ++W ++K L P + P+ + + + A+D+ G ML
Sbjct: 500 QA--KLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKLIIETDASDDYWGGMLK 557
Query: 915 Q---EDEDGIERAVFYLSRVLNDAKTRYTMIEKLCLCLYFSCTKLKYYIKPIDVMV---F 968
+ E Y S A+ Y +K L + + K Y+ P+ ++
Sbjct: 558 AIKINEGTNTELICRYASGSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDN 617
Query: 969 SHFDIIKHMLSKPILHSRIG---KWSLALTEYSLTYAPLKTIKGQAVADFLA 1017
+HF ++ K S++G +W L+ YS +K ADFL+
Sbjct: 618 THFKSFVNLNYKG--DSKLGRNIRWQAWLSHYSFDVEHIKGTDNH-FADFLS 666
>POL_FMVD (P09523) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 666
Score = 127 bits (318), Expect = 3e-28
Identities = 114/483 (23%), Positives = 213/483 (43%), Gaps = 24/483 (4%)
Query: 549 SLIDPELRGRMI----ELLKEFKDCFA*DNDEMPGLSRDLVELQLPIKEDKKLVK*LPRR 604
++I+PE R +I + +++ D +N P S+ ++ + + + K+++ P
Sbjct: 187 NIINPEERYFLITEKYQKIEQLLDKVCSENPIDPIKSKQWMKASIKLIDPLKVIRVKPMS 246
Query: 605 FHPDVLVKIKEEIERLLRCKFIRAAKYVEWLANVV---PVIKKNGKMRVCIDFRDLNAAT 661
+ P ++I+ LL I +K + ++ GK R+ ++++ +N AT
Sbjct: 247 YSPQDREGFAKQIKELLDLGLIIPSKSQHMSPAFLVENEAERRRGKKRMVVNYKAINQAT 306
Query: 662 PKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIAKEDVSKTAFRCSGALGTYEWVV 721
D +++P + ++ G S D SG+ Q+ + +E TAF C G ++W V
Sbjct: 307 IGDSHNLPNMQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQKLTAFTC--PQGHFQWKV 364
Query: 722 MPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDLVVKSPSRDEHLSHLRKSFERMRIHG 781
+PFGLK A + +QR M T + + F VY+DD++V S S +H +H+ + + +G
Sbjct: 365 VPFGLKQAPSIFQRHMQTALNG-ADKFCMVYVDDIIVFSNSELDHYNHVYAVLKIVEKYG 423
Query: 782 LKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPP--TSKKQLQSLLGKVNF 839
+ ++ K +FLG + KG +N + P KK LQ LG + +
Sbjct: 424 IILSKKKANLFKEKINFLGLEI-DKGTHCPQNHILENIHKFPDRLEDKKHLQRFLGVLTY 482
Query: 840 LRRFIDNLSDKTKPFSSLLRLKREDIFRWEA*HQKAFDELKNYLAIPPVMIPPIKGKPMR 899
+I L++ KP ++LK++ + W ++K L P + P +
Sbjct: 483 AETYIPKLAEIRKPLQ--VKLKKDVTWNWTQSDSDYVKKIKKNLGSFPKLYLPKPEDHLI 540
Query: 900 LYISATDETIGSMLAQEDEDGIERAVFYLSRVLNDAKTRYTMIEKLCLCLYFSCTKLKYY 959
+ A+D G +L DG+E Y S A+ Y +K L + TK Y
Sbjct: 541 IETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYHSNDKELLAVKQVITKFSAY 600
Query: 960 IKPIDVMV------FSHFDIIKHMLSKPILHSRIGKWSLALTEYSLTYAPLKTIKGQAVA 1013
+ P+ V F++F ++ L R+ +W ++Y L+ +K +A
Sbjct: 601 LTPVRFTVRTDNKNFTYF--LRINLKGDSKQGRLVRWQNWFSKYQFDVEHLEGVK-NVLA 657
Query: 1014 DFL 1016
D L
Sbjct: 658 DCL 660
>POL_CERV (P05400) Enzymatic polyprotein [Contains: Aspartic protease
(EC 3.4.23.-); Endonuclease; Reverse transcriptase (EC
2.7.7.49)]
Length = 659
Score = 123 bits (309), Expect = 4e-27
Identities = 132/566 (23%), Positives = 235/566 (41%), Gaps = 65/566 (11%)
Query: 470 HGVGAVPSTLHQRISIWKPDGVVENVQADQSYYLAEAG-YVDKKNFEKSPLVDATKMEA- 527
+GV ++ ++ + +P+ + N+ ++Q +L E G +VD+ +E + +K A
Sbjct: 141 YGVKGFLESMKKKSKVNRPEPI--NITSNQHLFLEEGGNHVDEMLYE----IQISKFSAI 194
Query: 528 QDPLEKVDLGDGSKKRPTYISSLIDPELRGRMIELLKEFKDCFA*DNDEMPGLSRDLVEL 587
++ LE+V S + P IDPE S+ +
Sbjct: 195 EEMLERV-----SSENP------IDPEK-------------------------SKQWMTA 218
Query: 588 QLPIKEDKKLVK*LPRRFHPDVLVKIKEEIERLLRCKFIRAAKYVEWLANVV---PVIKK 644
+ + + K +VK P + P + +I+ LL K I+ +K + ++
Sbjct: 219 TIELIDPKTVVKVKPMSYSPSDREEFDRQIKELLELKVIKPSKSTHMSPAFLVENEAERR 278
Query: 645 NGKMRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIAKEDVS 704
GK R+ ++++ +N AT D +++P + ++ G + S D SG Q+ + KE
Sbjct: 279 RGKKRMVVNYKAMNKATKGDAHNLPNKDELLTLVRGKKIYSSFDCKSGLWQVLLDKESQL 338
Query: 705 KTAFRCSGALGTYEWVVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDD-LVVKSPSR 763
TAF C G Y+W V+PFGLK A + + + + + VY+DD LV + R
Sbjct: 339 LTAFTCPQ--GHYQWNVVPFGLKQAPSIFPKTYANSHSNQYSKYCCVYVDDILVFSNTGR 396
Query: 764 DEHLSHLRKSFERMRIHGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSP 823
EH H+ R G+ ++ K +FLG + +G +N + P
Sbjct: 397 KEHYIHVLNILRRCEKLGIILSKKKAQLFKEKINFLGLEI-DQGTHCPQNHILEHIHKFP 455
Query: 824 P--TSKKQLQSLLGKVNFLRRFIDNLSDKTKPFSSLLRLKREDIFRWEA*HQKAFDELKN 881
KKQLQ LG + + +I L+ KP S +LK + + W + ++K
Sbjct: 456 DRIEDKKQLQRFLGILTYASDYIPKLASIRKPLQS--KLKEDSTWTWNDTDSQYMAKIKK 513
Query: 882 YLAIPPVMIPPIKGKPMRLYISATDETIGSMLAQEDEDGIERAVFYLSRVLNDAKTRYTM 941
L P + P + + A++E G +L + + E Y S A+ Y
Sbjct: 514 NLKSFPKLYHPEPNDKLVIETDASEEFWGGIL-KAIHNSHEYICRYASGSFKAAERNYHS 572
Query: 942 IEKLCLCLYFSCTKLKYYIKPIDVMV------FSHFDIIKHMLSKPILHSRIGKWSLALT 995
EK L + K Y+ P ++ F+HF + L R+ +W + L+
Sbjct: 573 NEKELLAVIRVIKKFSIYLTPSRFLIRTDNKNFTHF--VNINLKGDRKQGRLVRWQMWLS 630
Query: 996 EYSLTYAPLKTIKGQAVADFLADHTL 1021
+Y + K ADFL ++TL
Sbjct: 631 QYDFDVEHIAGTK-NVFADFLQENTL 655
>POL_COYMV (P19199) Putative polyprotein [Contains: Coat protein;
Protease (EC 3.4.23.-); Reverse transcriptase (EC
2.7.7.49); Ribonuclease H (EC 3.1.26.4)]
Length = 1886
Score = 120 bits (301), Expect = 3e-26
Identities = 100/360 (27%), Positives = 172/360 (47%), Gaps = 23/360 (6%)
Query: 549 SLIDPELRGRMIELLKEFKDCFA*DNDEMPGLSRDLVELQLPI-KEDKKLVK*LPRRFHP 607
S +D E + +LLKE K+ + M + ++ +L I D K++ + P
Sbjct: 1353 SFLDQEFARKNKDLLKEMKEMKYIGENPMEFWKNNKIKCKLNIINPDIKIMGRPIKHVTP 1412
Query: 608 DVLVKIKEEIERLLRCKFIRAAK--------YVEWLANVVPVI--KKNGKMRVCIDFRDL 657
+ +I LL+ K IR ++ V + P+ +K GK R+ +++ L
Sbjct: 1413 GDEEAMTRQINLLLQMKVIRPSESKHRSTAFIVRSGTEIDPITGKEKKGKERMVFNYKLL 1472
Query: 658 NAATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIAKEDVSKTAFRCSGALGTY 717
N T D+Y +P ++ + S D SG+ Q+ + +E V TAF L Y
Sbjct: 1473 NENTESDQYSLPGINTIISKVGRSKIYSKFDLKSGFWQVAMEEESVPWTAFLAGNKL--Y 1530
Query: 718 EWVVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDLVVKSPSRDEHLSHLRKSFERM 777
EW+VMPFGLKNA A +QR M+ +F E F+ VYIDD++V S + ++H HL +
Sbjct: 1531 EWLVMPFGLKNAPAIFQRKMDNVFKG-TEKFIAVYIDDILVFSETAEQHSQHLYTMLQLC 1589
Query: 778 RIHGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPP--TSKKQLQSLLG 835
+ +GL ++P K G DFLG + I++ + I D S + + ++S LG
Sbjct: 1590 KENGLILSPTKMKIGTPEIDFLGASLGCTKIKLQPHIISKICDFSDEKLATPEGMRSWLG 1649
Query: 836 KVNFLRRFIDNLSDKTKPFSSLLRL---KREDIFRWEA*HQKAFDELKNYLAIPPVMIPP 892
+++ R +I ++ +P + KR + W+ Q +++KN +P + +PP
Sbjct: 1650 ILSYARNYIQDIGKLVQPLRQKMAPTGDKRMNPETWKMVRQ-IKEKVKN---LPDLQLPP 1705
>YRD6_CAEEL (Q09575) Hypothetical protein K02A2.6 in chromosome II
Length = 1268
Score = 115 bits (289), Expect = 7e-25
Identities = 84/241 (34%), Positives = 131/241 (53%), Gaps = 11/241 (4%)
Query: 610 LVKIKEEIERLLRCKFIRAAKYVEWLANVVPVIKKN-GKMRVCIDFR--DLNAATPKDEY 666
L ++ E+ RL I Y +W A +V + KK GK+RVC DF+ LNAA KDE+
Sbjct: 453 LEAVETELNRLQEMGVIVPITYAKWAAPIVVIKKKGTGKIRVCADFKCSGLNAAL-KDEF 511
Query: 667 H-MPIAEMMVDSAAGHEYLSLLDGYSGYNQIFIAKEDVSKTAFRCSGALGTYEWVVMPFG 725
H +P +E + G Y S +D Y Q+ + E+ K A + G ++++ M FG
Sbjct: 512 HPLPTSEDIFSRLKGTVY-SQIDLKDAYLQVEL-DEEAQKLAV-INTHRGIFKYLRMTFG 568
Query: 726 LKNAGATYQRVMNTIFHDFIETFMQVYIDDLVVKSPSRDEHLSHLRKSFERMRIHGLKMN 785
LK A A++Q++M+ + T + VY DD+++ + S +EH LR+ FER + +G +++
Sbjct: 569 LKPAPASFQKIMDKMVSGL--TGVAVYWDDIIISASSIEEHEKILRELFERFKEYGFRVS 626
Query: 786 PLKCAFGVIAGDFLGFVVHKKGIEINKNKAKAILDTSPPTSKKQLQSLLGKVNFLRRFID 845
KCAF FLGF V + G + K +AI PT +KQL S LG ++L R +
Sbjct: 627 AEKCAFAQKQVTFLGF-VDEHGRRPDSKKTEAIRSMKAPTDQKQLASFLGAADWLSRMMQ 685
Query: 846 N 846
+
Sbjct: 686 D 686
Score = 70.5 bits (171), Expect = 4e-11
Identities = 79/301 (26%), Positives = 130/301 (42%), Gaps = 34/301 (11%)
Query: 1177 SPLDLEVMVIDNLTPND-WRKPIVEYLQNPVGSTDIK---VKYRALNYTIIGNELFKKNV 1232
S D EV + L ND W+ P ST+I+ ++YR I G L V
Sbjct: 726 SQKDHEVSSVVKLVRNDSWK---------PKPSTEIEKHWIRYRDRLKLIHGCLLLDDRV 776
Query: 1233 DGTLLKCLSEVDAFIAVSAAHGGLCGAHQAGAKMKWILFRQGMYWPTIMKDCIEYAKGCQ 1292
+ K L + I + H G G Q K + +F W + D + C
Sbjct: 777 --IVPKSLQK----IVLKQLHEGHPGIVQMKQKARSFVF-----WRGLDSDIENMVRHCN 825
Query: 1293 DCQKHSGIQHVPASELHSIIKSWPFRGWAIDLIGEIHPALSRQHKYIIVAIDYFIKWVEA 1352
+CQ++S + V + ++ P++ ID G ++ Y++V +D K+ E
Sbjct: 826 NCQENSKMPRVVPLNPWPVPEA-PWKRIHIDFAGPLNGC------YLLVVVDAKTKYAEV 878
Query: 1353 IPLQSVTQETVIEFIQNHIVYRFGLPESLTTDQGTMFVGRKVAAFAESWGIKLLTSIPYY 1412
+S++ T I+ ++ I G PE++ +D GT A +S GI+ TS YY
Sbjct: 879 KLTRSISAVTTIDLLEE-IFSIHGYPETIISDNGTQLTSHLFAQMCQSHGIEHKTSAVYY 937
Query: 1413 AQANGQVEAANKILISLIKKHVGRKPKSWHESLSQVLWAYRNSPREA-TGTTPFRLAYGQ 1471
++NG E L I K G + + L++ L +YRN+P A G+TP +G+
Sbjct: 938 PRSNGAAERFVDTLKRGIAKIKGEGSVN-QQILNKFLISYRNTPHSALNGSTPAECHFGR 996
Query: 1472 E 1472
+
Sbjct: 997 K 997
>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
(RT); Integrase (IN)]
Length = 886
Score = 110 bits (276), Expect = 2e-23
Identities = 84/335 (25%), Positives = 152/335 (45%), Gaps = 13/335 (3%)
Query: 638 VVPVIKKNGKMRVCIDFRDLNAATPKDEYHMPIAEMMVDSAAGHEYLSLLDGYSGYNQIF 697
V PV K +G+ R+ +D+R++N P + ++ + +Y + LD +G+
Sbjct: 5 VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGFWAHP 64
Query: 698 IAKEDVSKTAFRCSGALGTYEWVVMPFGLKNAGATYQRVMNTIFHDFIETFMQVYIDDLV 757
I E TAF G Y W +P G N+ A + + + + +QVY+DD+
Sbjct: 65 ITPESYWLTAFTWQGK--QYCWTRLPQGFLNSPALFTADVVDLLKEIPN--VQVYVDDIY 120
Query: 758 VKSPSRDEHLSHLRKSFERMRIHGLKMNPLKCAFGVIAGDFLGFVVHKKGIEINKNKAKA 817
+ EH+ L K F+ + G ++ K G +FLGF + K+G +
Sbjct: 121 LSHDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTVEFLGFNITKEGRGLTDTFKTK 180
Query: 818 ILDTSPPTSKKQLQSLLGKVNFLRRFIDNLSDKTKPFSSLLRLKREDIFRWEA*HQKAFD 877
+L+ +PP KQLQS+LG +NF R FI N ++ +P +L+ + W + K +
Sbjct: 181 LLNITPPKDLKQLQSILGLLNFARNFIPNFAELVQPLYNLIASAKGKYIEWSEENTKQLN 240
Query: 878 ---ELKNYLAIPPVMIPPIKGKPMRLYISATDETIGSMLAQEDEDGIERAVFYLSRVLND 934
E N + +P RL I + +E G ++ + YL+ V +
Sbjct: 241 MVIEALNTASNLEERLP-----EQRLVIKVNTSPSAGYVRYYNETG-KKPIMYLNYVFSK 294
Query: 935 AKTRYTMIEKLCLCLYFSCTKLKYYIKPIDVMVFS 969
A+ +++M+EKL ++ + K +++V+S
Sbjct: 295 AELKFSMLEKLLTTMHKALIKAMDLAMGQEILVYS 329
Score = 79.3 bits (194), Expect = 8e-14
Identities = 50/201 (24%), Positives = 92/201 (44%), Gaps = 4/201 (1%)
Query: 1276 YWPTIMKDCIEYAKGCQDCQKHSGIQHVPASELHSIIKSWPFRGWAIDLIGEIHPALSRQ 1335
+WP + KD ++ CQ C + L PF + ID IG + P S+
Sbjct: 635 WWPNMRKDVVKQLGRCQQCLITNASNKASGPILRPDRPQKPFDKFFIDYIGPLPP--SQG 692
Query: 1336 HKYIIVAIDYFIKWVEAIPLQSVTQETVIEFIQNHIVYRFGLPESLTTDQGTMFVGRKVA 1395
+ Y++V +D + P ++ + ++ + +++ +P+ + +DQG F A
Sbjct: 693 YLYVLVVVDGMTGFTWLYPTKAPSTSATVKSL--NVLTSIAIPKVIHSDQGAAFTSSTFA 750
Query: 1396 AFAESWGIKLLTSIPYYAQANGQVEAANKILISLIKKHVGRKPKSWHESLSQVLWAYRNS 1455
+A+ GI L S PY+ Q+ +VE N + L+ K + +P W++ L V A N+
Sbjct: 751 EWAKERGIHLEFSTPYHPQSGSKVERKNSDIKRLLTKLLVGRPTKWYDLLPVVQLALNNT 810
Query: 1456 PREATGTTPFRLAYGQEAVLP 1476
TP +L +G ++ P
Sbjct: 811 YSPVLKYTPHQLLFGIDSNTP 831
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.325 0.139 0.420
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 180,306,220
Number of Sequences: 164201
Number of extensions: 7603447
Number of successful extensions: 25031
Number of sequences better than 10.0: 144
Number of HSP's better than 10.0 without gapping: 99
Number of HSP's successfully gapped in prelim test: 45
Number of HSP's that attempted gapping in prelim test: 24637
Number of HSP's gapped (non-prelim): 274
length of query: 1610
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1486
effective length of database: 39,613,130
effective search space: 58865111180
effective search space used: 58865111180
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 73 (32.7 bits)
Lotus: description of TM0041.7