Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC147000.7 - phase: 0 
         (1185 letters)

Database: sprot 
           164,201 sequences; 59,974,054 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

POLX_TOBAC (P10978) Retrovirus-related Pol polyprotein from tran...   688  0.0
COPI_DROME (P04146) Copia protein (Gag-int-pol protein) [Contain...   334  1e-90
YCH4_YEAST (P25600) Transposon Ty5-1 34.5 kDa hypothetical protein    171  1e-41
M810_ARATH (P92519) Hypothetical mitochondrial protein AtMg00810...   142  4e-33
YCB9_YEAST (P25384) Transposon Ty2 protein B (Ty1-17 protein B)       132  4e-30
YJL3_YEAST (P47024) Transposon Ty4 207.7 kDa hypothetical protein     130  2e-29
YMT5_YEAST (Q04214) Transposon Ty1 protein B                          128  8e-29
YMU0_YEAST (Q04670) Transposon Ty1 protein B                          128  1e-28
YJZ9_YEAST (P47100) Transposon Ty1 protein B                          128  1e-28
YME4_YEAST (Q04711) Transposon Ty1 protein B                          125  5e-28
YMD9_YEAST (Q03434) Transposon Ty1 protein B                          125  7e-28
YJZ7_YEAST (P47098) Transposon Ty1 protein B                          124  2e-27
M820_ARATH (P92520) Hypothetical mitochondrial protein AtMg00820...   100  4e-20
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran...    61  2e-08
POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.2...    61  2e-08
POL_MLVMO (P03355) Pol polyprotein [Contains: Protease (EC 3.4.2...    59  6e-08
POL3_MOUSE (P11367) Retrovirus-related Pol polyprotein (Endonucl...    59  6e-08
POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.2...    59  8e-08
POL_MLVF5 (P26810) Pol polyprotein [Contains: Protease (EC 3.4.2...    59  8e-08
POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse transcript...    58  1e-07

>POLX_TOBAC (P10978) Retrovirus-related Pol polyprotein from
            transposon TNT 1-94 [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1328

 Score =  688 bits (1775), Expect = 0.0
 Identities = 404/1217 (33%), Positives = 649/1217 (53%), Gaps = 89/1217 (7%)

Query: 25   GMALPEQFQIAVIIDKLPPAWKDFKSLLRHKTKEFSLESLITRLRIEEEARKQEQNEEVF 84
            G+ + E+ +  ++++ LP ++ +  + + H      L+ + + L + E+ RK+ +N+   
Sbjct: 136  GVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALLLNEKMRKKPENQGQA 195

Query: 85   VVSNNNTKKKFVGAVLKPAGKPFKNQNRPMNKNSNRNKTGNNSRPQIQQPPKNDAAPPFN 144
            +++                G+ ++  +    ++  R K+ N S+ +++           N
Sbjct: 196  LITEGR-------------GRSYQRSSNNYGRSGARGKSKNRSKSRVR-----------N 231

Query: 145  CYNCGQADHMARKCRNRTNRPAQA-------HMATDAAPDEPYVAMITE----INMIAGS 193
            CYNC Q  H  R C N      +        + A     ++  V  I E    +++    
Sbjct: 232  CYNCNQPGHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPE 291

Query: 194  DGWWVDTGASRHVCYDRDMFKTYTACDDQKVLLGDSHSTDVVGIGDIELKFTSEKTLILK 253
              W VDT AS H    RD+F  Y A D   V +G++  + + GIGDI +K     TL+LK
Sbjct: 292  SEWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLK 351

Query: 254  DVLHTPKIRKNLVSGFLLNKAGFTQSIGADLYTITKNGIFVGKGYATDGMFKLNIDMNKI 313
            DV H P +R NL+SG  L++ G+        + +TK  + + KG A   +++ N ++ + 
Sbjct: 352  DVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQG 411

Query: 314  S-SSAYMLCDFNIWHSRLCHVNKRIISNMSGLGLIPKISLNDFEKCQFCSQAKINKESHK 372
              ++A      ++WH R+ H++++ +  ++   LI        + C +C   K ++ S +
Sbjct: 412  ELNAAQDEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQ 471

Query: 373  -SVTRITEPFELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTHVYLMRNKNEALDIFKQ 431
             S  R     +L++SD+C      +  G +YF+TFIDD S    VY+++ K++   +F++
Sbjct: 472  TSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQK 531

Query: 432  YVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTAPYSPEMNGKAERKNRT 491
            +   +E +   ++KR RSD G EY S  F EY    GI HE T P +P+ NG AER NRT
Sbjct: 532  FHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRT 591

Query: 492  FTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNKIS-PYEILKKRQPNLSYFRTW 550
              E V + +  +     +WGE + T CY++NR P        P  +   ++ + S+ + +
Sbjct: 592  IVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVF 651

Query: 551  GCLAYVRKPDPKRVKLASRAYECAFIGYALNSKAYRFYDLKSKTIIESNDVDFYENKFP- 609
            GC A+   P  +R KL  ++  C FIGY      YR +D   K +I S DV F E++   
Sbjct: 652  GCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRT 711

Query: 610  ---------------FKSGDSGGNSGGTDNSVLD-------QPSEIITSNENIERDVIEP 647
                           F +  S  N+  +  S  D       QP E+I   E ++  V E 
Sbjct: 712  AADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEV 771

Query: 648  G-------------RGKRARIAKEYGP--EYVAYTIEEDPSSIKEALSSIDADLWQEAIN 692
                          R +R R+     P  EYV  + + +P S+KE LS  + +   +A+ 
Sbjct: 772  EHPTQGEEQHQPLRRSERPRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQLMKAMQ 831

Query: 693  DEMDSLMSNETWHLTDLPPGCKTIGCKWILKKKLKPDGSIDKYKARLVAKGFRQRENVDF 752
            +EM+SL  N T+ L +LP G + + CKW+ K K   D  + +YKARLV KGF Q++ +DF
Sbjct: 832  EEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDF 891

Query: 753  FDTYSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQPEGFVIHGQEN 812
             + +SPV ++TSIR ++SLAA  +L V Q+DVKTAFL+G+LEEEIYM+QPEGF + G+++
Sbjct: 892  DEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKH 951

Query: 813  KVCKLDKSLYGLKQAPKQWHEKFDNLMIENEFKVNESDKCIYSK-YENNTCTIICLYVDD 871
             VCKL+KSLYGLKQAP+QW+ KFD+ M    +    SD C+Y K +  N   I+ LYVDD
Sbjct: 952  MVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDD 1011

Query: 872  LLIFGSNLNAIKDVKSLLCHNFDMKDLGKADVILGIKIT--RTDNGISLNQSHYVEKILR 929
            +LI G +   I  +K  L  +FDMKDLG A  ILG+KI   RT   + L+Q  Y+E++L 
Sbjct: 1012 MLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLE 1071

Query: 930  KYNYFYCKPASTPCDPSVKLFK-------NTGDSVRQTEYASIIGSLRYATDCTRPDISY 982
            ++N    KP STP    +KL K           ++ +  Y+S +GSL YA  CTRPDI++
Sbjct: 1072 RFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAH 1131

Query: 983  AVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQRYPAVLEGYSDADWNNLSDDSK 1042
            AVG++ +F   P  EHW+A++ ++RYL+ T    L +     +L+GY+DAD     D+ K
Sbjct: 1132 AVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPILKGYTDADMAGDIDNRK 1191

Query: 1043 ATSGYIFSIAGGAVSWKSKKQTILAQSTMESEMIALAAASEEASWLRCLLSEIPLWERPL 1102
            +++GY+F+ +GGA+SW+SK Q  +A ST E+E IA     +E  WL+  L E+ L ++  
Sbjct: 1192 SSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQK-- 1249

Query: 1103 PAVLIHCDSTAAIAKIENRYYNGKRRQIRRKHSTIREYLSNGTVRVDFVRTNENLADPLT 1162
               +++CDS +AI   +N  Y+ + + I  ++  IRE + + +++V  + TNEN AD LT
Sbjct: 1250 -EYVVYCDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLT 1308

Query: 1163 KGLNREKVANTSSRMGL 1179
            K + R K       +G+
Sbjct: 1309 KVVPRNKFELCKELVGM 1325


>COPI_DROME (P04146) Copia protein (Gag-int-pol protein) [Contains:
            Copia VLP protein; Copia protease (EC 3.4.23.-)]
          Length = 1409

 Score =  334 bits (856), Expect = 1e-90
 Identities = 195/527 (37%), Positives = 297/527 (56%), Gaps = 13/527 (2%)

Query: 665  AYTIEED-PSSIKEALSSIDADLWQEAINDEMDSLMSNETWHLTDLPPGCKTIGCKWILK 723
            A+TI  D P+S  E     D   W+EAIN E+++   N TW +T  P     +  +W+  
Sbjct: 883  AHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFS 942

Query: 724  KKLKPDGSIDKYKARLVAKGFRQRENVDFFDTYSPVTRITSIRVLISLAAIHNLIVHQMD 783
             K    G+  +YKARLVA+GF Q+  +D+ +T++PV RI+S R ++SL   +NL VHQMD
Sbjct: 943  VKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMD 1002

Query: 784  VKTAFLNGELEEEIYMDQPEGFVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIENE 843
            VKTAFLNG L+EEIYM  P+G  I    + VCKL+K++YGLKQA + W E F+  + E E
Sbjct: 1003 VKTAFLNGTLKEEIYMRLPQG--ISCNSDNVCKLNKAIYGLKQAARCWFEVFEQALKECE 1060

Query: 844  FKVNESDKCIY--SKYENNTCTIICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMKDLGKA 901
            F  +  D+CIY   K   N    + LYVDD++I   ++  + + K  L   F M DL + 
Sbjct: 1061 FVNSSVDRCIYILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEI 1120

Query: 902  DVILGIKITRTDNGISLNQSHYVEKILRKYNYFYCKPASTPCDPSVKLFKNTGDSVRQTE 961
               +GI+I   ++ I L+QS YV+KIL K+N   C   STP    +       D    T 
Sbjct: 1121 KHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELLNSDEDCNTP 1180

Query: 962  YASIIGSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQR 1021
              S+IG L Y   CTRPD++ AV +L +++S+ + E WQ ++RV+RYLK T+ + L +++
Sbjct: 1181 CRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKK 1240

Query: 1022 ---YPAVLEGYSDADWNNLSDDSKATSGYIFSIAG-GAVSWKSKKQTILAQSTMESEMIA 1077
               +   + GY D+DW     D K+T+GY+F +     + W +K+Q  +A S+ E+E +A
Sbjct: 1241 NLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMA 1300

Query: 1078 LAAASEEASWLRCLLSEIPL-WERPLPAVLIHCDSTAAIAKIENRYYNGKRRQIRRKHST 1136
            L  A  EA WL+ LL+ I +  E P   + I+ D+   I+   N   + + + I  K+  
Sbjct: 1301 LFEAVREALWLKFLLTSINIKLENP---IKIYEDNQGCISIANNPSCHKRAKHIDIKYHF 1357

Query: 1137 IREYLSNGTVRVDFVRTNENLADPLTKGLNREKVANTSSRMGLMPID 1183
             RE + N  + ++++ T   LAD  TK L   +      ++GL+  D
Sbjct: 1358 AREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLGLLQDD 1404



 Score =  198 bits (504), Expect = 6e-50
 Identities = 148/623 (23%), Positives = 285/623 (44%), Gaps = 51/623 (8%)

Query: 1   MSDDKSVEAQSHELQQIAHEIIAEGMALPEQFQIAVIIDKLPPAWKDFKSLLRHKTKEFS 60
           +S + S+ +  H   ++  E++A G  + E  +I+ ++  LP  +    + +   ++E  
Sbjct: 108 LSSEMSLLSHFHIFDELISELLAAGAKIEEMDKISHLLITLPSCYDGIITAIETLSEENL 167

Query: 61  LESLITRLRIEEEARKQEQNEEVFVVSNNNTKKKFVGAVLKPAGKPFKNQNRPMNKNSNR 120
             + +    +++E + +  +        N+T KK + A++              N N+ +
Sbjct: 168 TLAFVKNRLLDQEIKIKNDH--------NDTSKKVMNAIVHN------------NNNTYK 207

Query: 121 NKTGNNSRPQIQQPPKNDAAPPFNCYNCGQADHMARKC----RNRTNRPAQAHMATDAAP 176
           N    N   + ++  K ++     C++CG+  H+ + C    R   N+  +       A 
Sbjct: 208 NNLFKNRVTKPKKIFKGNSKYKVKCHHCGREGHIKKDCFHYKRILNNKNKENEKQVQTAT 267

Query: 177 DEPYVAMITEINMIAGSD--GWWVDTGASRHVCYDRDMFKTYTACDDQKVLLGDSHSTDV 234
                 M+ E+N  +  D  G+ +D+GAS H+  D  ++           +        +
Sbjct: 268 SHGIAFMVKEVNNTSVMDNCGFVLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFI 327

Query: 235 VGIGDIELKFTSEKTLILKDVLHTPKIRKNLVSGFLLNKAGFTQSIGADLYTITKNGIFV 294
                  ++  ++  + L+DVL   +   NL+S   L +AG +        TI+KNG+ V
Sbjct: 328 YATKRGIVRLRNDHEITLEDVLFCKEAAGNLMSVKRLQEAGMSIEFDKSGVTISKNGLMV 387

Query: 295 GKGYATDGMFKLN----IDMNKISSSAYMLCDFNIWHSRLCHVN---------KRIISNM 341
            K     GM  LN    I+    S +A    +F +WH R  H++         K + S+ 
Sbjct: 388 VKN---SGM--LNNVPVINFQAYSINAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQ 442

Query: 342 SGLGLIPKISLNDFEKCQFCSQAKINKESHKSVTRITEPFELIHSDLCELDGNLTRNGKR 401
           S L  + ++S    E C    QA++  +  K  T I  P  ++HSD+C     +T + K 
Sbjct: 443 SLLNNL-ELSCEICEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKN 501

Query: 402 YFITFIDDCSDYTHVYLMRNKNEALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFN 461
           YF+ F+D  + Y   YL++ K++   +F+ +V + E  FN+++     D G EY S+   
Sbjct: 502 YFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMR 561

Query: 462 EYYKELGIIHETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVL 521
           ++  + GI +  T P++P++NG +ER  RT TE     +  +     +WGE +LT  Y++
Sbjct: 562 QFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLI 621

Query: 522 NRVPK---TKNKISPYEILKKRQPNLSYFRTWGCLAYVRKPDPKRVKLASRAYECAFIGY 578
           NR+P      +  +PYE+   ++P L + R +G   YV   + K+ K   ++++  F+GY
Sbjct: 622 NRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVHIKN-KQGKFDDKSFKSIFVGY 680

Query: 579 ALNSKAYRFYDLKSKTIIESNDV 601
             N   ++ +D  ++  I + DV
Sbjct: 681 EPN--GFKLWDAVNEKFIVARDV 701


>YCH4_YEAST (P25600) Transposon Ty5-1 34.5 kDa hypothetical protein
          Length = 308

 Score =  171 bits (432), Expect = 1e-41
 Identities = 109/299 (36%), Positives = 159/299 (52%), Gaps = 4/299 (1%)

Query: 782  MDVKTAFLNGELEEEIYMDQPEGFVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIE 841
            MDV TAFLN  ++E IY+ QP GFV     + V +L   +YGLKQAP  W+E  +N + +
Sbjct: 1    MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 842  NEFKVNESDKCIYSKYENNTCTIICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMKDLGKA 901
              F  +E +  +Y +  ++    I +YVDDLL+   +      VK  L   + MKDLGK 
Sbjct: 61   IGFCRHEGEHGLYFRSTSDGPIYIGVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 902  DVILGIKITRTDNG-ISLNQSHYVEKILRKYNYFYCKPASTPCDPSVKLFKNTGDSVRQ- 959
            D  LG+ I ++ NG I+L+   Y+ K   +      K   TP   S  LF+ T   ++  
Sbjct: 121  DKFLGLNIHQSTNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKDI 180

Query: 960  TEYASIIGSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHY 1019
            T Y SI+G L +  +  RPDISY V LL +F   P   H ++  RV+RYL  T ++ L Y
Sbjct: 181  TPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLKY 240

Query: 1020 QRYPAV-LEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKK-QTILAQSTMESEMI 1076
            +    V L  Y DA    + D   +T GY+  +AG  V+W SKK + ++   + E+E I
Sbjct: 241  RSGSQVALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPSTEAEYI 299


>M810_ARATH (P92519) Hypothetical mitochondrial protein AtMg00810
            (ORF240b)
          Length = 240

 Score =  142 bits (359), Expect = 4e-33
 Identities = 77/233 (33%), Positives = 131/233 (56%), Gaps = 2/233 (0%)

Query: 865  ICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMKDLGKADVILGIKITRTDNGISLNQSHYV 924
            + LYVDD+L+ GS+   +  +   L   F MKDLG     LGI+I    +G+ L+Q+ Y 
Sbjct: 3    LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 925  EKILRKYNYFYCKPASTPCDPSVKLFKNTGDSVRQTEYASIIGSLRYATDCTRPDISYAV 984
            E+IL       CKP STP    +    +T      +++ SI+G+L+Y T  TRPDISYAV
Sbjct: 63   EQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLT-LTRPDISYAV 121

Query: 985  GLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQRYPAV-LEGYSDADWNNLSDDSKA 1043
             ++C+    P++  +  ++RV+RY+K T+  GL+  +   + ++ + D+DW   +   ++
Sbjct: 122  NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 181

Query: 1044 TSGYIFSIAGGAVSWKSKKQTILAQSTMESEMIALAAASEEASWLRCLLSEIP 1096
            T+G+   +    +SW +K+Q  +++S+ E+E  ALA  + E +W     S  P
Sbjct: 182  TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTWSSASRSRDP 234


>YCB9_YEAST (P25384) Transposon Ty2 protein B (Ty1-17 protein B)
          Length = 1770

 Score =  132 bits (333), Expect = 4e-30
 Identities = 147/580 (25%), Positives = 257/580 (43%), Gaps = 59/580 (10%)

Query: 623  DNSVLDQPSEIITSNENIERDVIEPGRGKR-----ARIAKEYGPEYVAYTIEEDPSSIKE 677
            DN    + S    +N+N+    +EP R K+     A I      + V  T+  D  +I  
Sbjct: 1199 DNETEIEVSRDTWNNKNMRS--LEPPRSKKRINLIAAIKGVKSIKPVRTTLRYD-EAITY 1255

Query: 678  ALSSIDADLWQEAINDEMDSLMSNETWHLT------DLPPGCKTIGCKWILKKKLKPDGS 731
               + + D + EA + E+  L+   TW         D+ P  K I   +I  KK   DG+
Sbjct: 1256 NKDNKEKDRYVEAYHKEISQLLKMNTWDTNKYYDRNDIDPK-KVINSMFIFNKKR--DGT 1312

Query: 732  IDKYKARLVAKGFRQRENVDFFDTYSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNG 791
               +KAR VA+G  Q  +    D  S      ++   +S+A  ++  + Q+D+ +A+L  
Sbjct: 1313 ---HKARFVARGDIQHPDTYDSDMQSNTVHHYALMTSLSIALDNDYYITQLDISSAYLYA 1369

Query: 792  ELEEEIYMDQPEGFVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIE-NEFKVNESD 850
            +++EE+Y+  P      G  +K+ +L KSLYGLKQ+   W+E   + +I   + +     
Sbjct: 1370 DIKEELYIRPPPHL---GLNDKLLRLRKSLYGLKQSGANWYETIKSYLINCCDMQEVRGW 1426

Query: 851  KCIYSKYENNTCTIICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMK--DLGKADVILGIK 908
             C++     N+   ICL+VDD+++F  +LNA K + + L   +D K  +LG++D  +   
Sbjct: 1427 SCVF----KNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYD 1482

Query: 909  ITRTDNGISLNQSHYVEKILRKYNYFYCKPASTPCDPSVKLFKNTGD----------SVR 958
            I   +  I   +S Y++  + K         + P +P  K  +  G            + 
Sbjct: 1483 ILGLE--IKYQRSKYMKLGMEKSLTEKLPKLNVPLNPKGKKLRAPGQPGHYIDQDELEID 1540

Query: 959  QTEY-------ASIIGSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKK 1011
            + EY         +IG   Y     R D+ Y +  L +    PS +       +++++  
Sbjct: 1541 EDEYKEKVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWD 1600

Query: 1012 TMTLGLHYQRYPAV-----LEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTIL 1066
            T    L + +         L   SDA + N     K+  G IF + G  +  KS K ++ 
Sbjct: 1601 TRDKQLIWHKNKPTKPDNKLVAISDASYGN-QPYYKSQIGNIFLLNGKVIGGKSTKASLT 1659

Query: 1067 AQSTMESEMIALAAASEEASWLRCLLSEIPLWERP-LPAVLIHCDSTAAIAKIENRYYNG 1125
              ST E+E+ A++ A    + L  L+ E  L ++P +  +L    ST +I K  N     
Sbjct: 1660 CTSTTEAEIHAVSEAIPLLNNLSHLVQE--LNKKPIIKGLLTDSRSTISIIKSTNE-EKF 1716

Query: 1126 KRRQIRRKHSTIREYLSNGTVRVDFVRTNENLADPLTKGL 1165
            + R    K   +R+ +S   + V ++ T +N+AD +TK L
Sbjct: 1717 RNRFFGTKAMRLRDEVSGNNLYVYYIETKKNIADVMTKPL 1756



 Score = 99.0 bits (245), Expect = 7e-20
 Identities = 108/442 (24%), Positives = 184/442 (41%), Gaps = 41/442 (9%)

Query: 198 VDTGASRHVCYDRDMFKTYTACDDQKVLLGDSHSTDVV--GIGDIELKFTSEKTLILKDV 255
           +D+GAS+ +   R     + A  + ++ + D+   D+    IG++   F +     +K  
Sbjct: 456 IDSGASQTLV--RSAHYLHHATPNSEINIVDAQKQDIPINAIGNLHFNFQNGTKTSIK-A 512

Query: 256 LHTPKIRKNLVSGFLLNKAGFT---------QSIGADLYTITKNGIF--VGKGYATDG-M 303
           LHTP I  +L+S   L     T         +S G  L  I K+G F  + K Y     +
Sbjct: 513 LHTPNIAYDLLSLSELANQNITACFTRNTLERSDGTVLAPIVKHGDFYWLSKKYLIPSHI 572

Query: 304 FKLNIDMNKISSSAYMLCDFNIWHSRLCHVNKRIISNMSGLGLIPKISLNDFE------- 356
            KL I+ N   S +     + + H  L H N R I        +  +  +D E       
Sbjct: 573 SKLTIN-NVNKSKSVNKYPYPLIHRMLGHANFRSIQKSLKKNAVTYLKESDIEWSNASTY 631

Query: 357 KCQFCSQAKINKESHKSVTRIT-----EPFELIHSDLCELDGNLTRNGKRYFITFIDDCS 411
           +C  C   K  K  H   +R+      EPF+ +H+D+     +L ++   YFI+F D+ +
Sbjct: 632 QCPDCLIGKSTKHRHVKGSRLKYQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEKT 691

Query: 412 DYTHVYLMRNKNEA--LDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGI 469
            +  VY + ++ E   L++F   +  I+NQFN R+   + DRG+EY +   ++++   GI
Sbjct: 692 RFQWVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRGI 751

Query: 470 IHETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKN 529
               T       +G AER NRT        +  SG   H W   +     + N +   KN
Sbjct: 752 TACYTTTADSRAHGVAERLNRTLLNDCRTLLHCSGLPNHLWFSAVEFSTIIRNSLVSPKN 811

Query: 530 KISPYEILKKRQPNLSYFRTWGCLAYVRKPDPKRVKLASRAYECAFIGYAL----NSKAY 585
             S  +       +++    +G    V   +P      S+ +     GYAL    NS  Y
Sbjct: 812 DKSARQHAGLAGLDITTILPFGQPVIVNNHNPD-----SKIHPRGIPGYALHPSRNSYGY 866

Query: 586 RFYDLKSKTIIESNDVDFYENK 607
             Y    K  +++ +    ++K
Sbjct: 867 IIYLPSLKKTVDTTNYVILQDK 888


>YJL3_YEAST (P47024) Transposon Ty4 207.7 kDa hypothetical protein
          Length = 1803

 Score =  130 bits (327), Expect = 2e-29
 Identities = 120/454 (26%), Positives = 204/454 (44%), Gaps = 41/454 (9%)

Query: 735  YKARLVAKGFRQRENVDFFDTYSPVTRIT----SIRVLISLAAIHNLIVHQMDVKTAFLN 790
            YKAR+V +G  Q       DTYS +T  +     I++ + +A   N+ +  +D+  AFL 
Sbjct: 1337 YKARIVCRGDTQSP-----DTYSVITTESLNHNHIKIFLMIANNRNMFMKTLDINHAFLY 1391

Query: 791  GELEEEIYMDQPEGFVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIENEFKVNESD 850
             +LEEEIY+  P           V KL+K+LYGLKQ+PK+W++     +     K N   
Sbjct: 1392 AKLEEEIYIPHPH------DRRCVVKLNKALYGLKQSPKEWNDHLRQYLNGIGLKDNSYT 1445

Query: 851  KCIYSKYENNTCTIICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMKDLGKA-DVILGIKI 909
              +Y   + N   +I +YVDD +I  SN   + +  + L  NF++K  G   D +L   I
Sbjct: 1446 PGLYQTEDKNL--MIAVYVDDCVIAASNEQRLDEFINKLKSNFELKITGTLIDDVLDTDI 1503

Query: 910  TRTD-------NGISLNQSHYVEKILRKYNYFYCK--PASTP------CDPSVKLFKNTG 954
               D         I L    ++ ++ +KYN    K   +S P       DP   + + + 
Sbjct: 1504 LGMDLVYNKRLGTIDLTLKSFINRMDKKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMSE 1563

Query: 955  DSVRQ--TEYASIIGSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKT 1012
            +  RQ   +   ++G L Y     R DI +AV  + +  + P    +  I ++++YL + 
Sbjct: 1564 EEFRQGVLKLQQLLGELNYVRHKCRYDIEFAVKKVARLVNYPHERVFYMIYKIIQYLVRY 1623

Query: 1013 MTLGLHYQR---YPAVLEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTILAQS 1069
              +G+HY R       +   +DA   +   D+++  G I        +  S K T    S
Sbjct: 1624 KDIGIHYDRDCNKDKKVIAITDASVGS-EYDAQSRIGVILWYGMNIFNVYSNKSTNRCVS 1682

Query: 1070 TMESEMIALAAASEEASWLRCLLSEIPLWERPLPAVLIHCDSTAAIAKIENRYYNGKRRQ 1129
            + E+E+ A+     ++  L+  L E  L E     +++  DS  AI  +   Y   K + 
Sbjct: 1683 STEAELHAIYEGYADSETLKVTLKE--LGEGDNNDIVMITDSKPAIQGLNRSYQQPKEKF 1740

Query: 1130 IRRKHSTIREYLSNGTVRVDFVRTNENLADPLTK 1163
               K   I+E +   ++++  +    N+AD LTK
Sbjct: 1741 TWIKTEIIKEKIKEKSIKLLKITGKGNIADLLTK 1774



 Score = 72.8 bits (177), Expect = 5e-12
 Identities = 103/446 (23%), Positives = 182/446 (40%), Gaps = 57/446 (12%)

Query: 198 VDTGASRHVCYDRDMFKTYTACDDQKVL--LGDSHSTDVVGIGDIELKF----TSEKTLI 251
           +DTG+  ++  D+ +   Y   +       +G + S  V G G I++K     T  K L+
Sbjct: 414 IDTGSGVNITNDKTLLHNYEDSNRSTRFFGIGKNSSVSVKGYGYIKIKNGHNNTDNKCLL 473

Query: 252 LKDVLHTPKIRKNLVSGFLLNKAGFTQSIGADLYTITKNGIFVGKGYATDGMFKLNIDMN 311
                + P+    ++S + L K   T+ + +  YT   N I   K    +G+  +++ MN
Sbjct: 474 ---TYYVPEEESTIISCYDLAKK--TKMVLSRKYTRLGNKIIKIKTKIVNGV--IHVKMN 526

Query: 312 KI----------------SSSAYMLCDFNIW----HSRLCHVNKRIISNM-------SGL 344
           ++                SS  + L   +I     H R+ H   + I N          L
Sbjct: 527 ELIERPSDDSKINAIKPTSSPGFKLNKRSITLEDAHKRMGHTGIQQIENSIKHNHYEESL 586

Query: 345 GLIPKISLNDFEKCQFCSQAKINKESHK--SVTRITEPFELIHSDLCELDGNLTRNG--- 399
            LI +   N+F  CQ C  +K  K +H   S+   +   E   S   ++ G ++ +    
Sbjct: 587 DLIKEP--NEFW-CQTCKISKATKRNHYTGSMNNHSTDHEPGSSWCMDIFGPVSSSNADT 643

Query: 400 KRYFITFIDDCSDY--THVYLMRNKNEALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGS 457
           KRY +  +D+ + Y  T  +  +N    L   ++ ++ +E QF+ +++   SDRGTE+ +
Sbjct: 644 KRYMLIMVDNNTRYCMTSTHFNKNAETILAQVRKNIQYVETQFDRKVREINSDRGTEFTN 703

Query: 458 HIFNEYYKELGIIHETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTV 517
               EY+   GI H  T+      NG+AER  RT        +  S     +W   + + 
Sbjct: 704 DQIEEYFISKGIHHILTSTQDHAANGRAERYIRTIITDATTLLRQSNLRVKFWEYAVTSA 763

Query: 518 CYVLNRVPKTKNKISPYEILKKRQP---NLSYFRTWGCLAYVRKPDPKRVKLASRAYECA 574
             + N +        P + +  RQP    L  F  +G    +   + K  KL        
Sbjct: 764 TNIRNYLEHKSTGKLPLKAI-SRQPVTVRLMSFLPFGEKGIIWNHNHK--KLKPSGLPSI 820

Query: 575 FIGYALNSKAYRFYDLKSKTIIESND 600
            +    NS  Y+F+ + SK  I ++D
Sbjct: 821 ILCKDPNSYGYKFF-IPSKNKIVTSD 845


>YMT5_YEAST (Q04214) Transposon Ty1 protein B
          Length = 1328

 Score =  128 bits (322), Expect = 8e-29
 Identities = 128/504 (25%), Positives = 231/504 (45%), Gaps = 41/504 (8%)

Query: 689  EAINDEMDSLMSNETWHLTDLPPGCKTIGCKWILKKKL----KPDGSIDKYKARLVAKGF 744
            EA + E++ L+  +TW  TD     K I  K ++        K DG+   +KAR VA+G 
Sbjct: 825  EAYHKEVNQLLKMKTWD-TDKYYDRKEIDPKRVINSMFIFNRKRDGT---HKARFVARGD 880

Query: 745  RQRENVDFFDTYSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQPEG 804
             Q  +       S      ++   +SLA  +N  + Q+D+ +A+L  +++EE+Y+  P  
Sbjct: 881  IQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEELYIRPPPH 940

Query: 805  FVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIEN-EFKVNESDKCIYSKYENNTCT 863
                G  +K+ +L KSLYGLKQ+   W+E   + +I+    +      C+   +EN+  T
Sbjct: 941  L---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIKQCGMEEVRGWSCV---FENSQVT 994

Query: 864  IICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMK--DLGKADV-----ILGIKIT-RTDNG 915
             ICL+VDD+++F  NLN+ K +   L   +D K  +LG++D      ILG++I  +    
Sbjct: 995  -ICLFVDDMVLFSKNLNSNKRIIDKLKMQYDTKIINLGESDEEIQYDILGLEIKYQRGKY 1053

Query: 916  ISLNQSHYVEKILRKYNYFY---CKPASTPCDPSVKL------FKNTGDSVRQTEYASII 966
            + L   + + + + K N       +  S P  P + +       +     ++  E   +I
Sbjct: 1054 MKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQQELELEEDDYKMKVHEMQKLI 1113

Query: 967  GSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQRYPAV- 1025
            G   Y     R D+ Y +  L +    PS +       +++++  T    L + +   V 
Sbjct: 1114 GLASYVGYKFRFDLLYYINTLAQHILFPSKQVLDMTYELIQFIWNTRDKQLIWHKSKPVK 1173

Query: 1026 ----LEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTILAQSTMESEMIALAAA 1081
                L   SDA + N     K+  G I+ + G  +  KS K ++   ST E+E+ A++ +
Sbjct: 1174 PTNKLVVISDASYGN-QPYYKSQIGNIYLLNGKVIGGKSTKASLTCTSTTEAEIHAISES 1232

Query: 1082 SEEASWLRCLLSEIPLWERPLPAVLIHCDSTAAIAKIENRYYNGKRRQIRRKHSTIREYL 1141
                + L  L+ E+   ++P+   L+    +     I N     + R    K   +R+ +
Sbjct: 1233 VPLLNNLSYLIQELD--KKPITKGLLTDSKSTISIIISNNEEKFRNRFFGTKAMRLRDEV 1290

Query: 1142 SNGTVRVDFVRTNENLADPLTKGL 1165
            S   + V ++ T +N+AD +TK L
Sbjct: 1291 SGNHLHVCYIETKKNIADVMTKPL 1314



 Score =  106 bits (264), Expect = 4e-22
 Identities = 110/442 (24%), Positives = 182/442 (40%), Gaps = 41/442 (9%)

Query: 198 VDTGASRHVCYDRDMFKTYTACDDQKVLLGDSHSTDVVGIGDIELKFTSEKTLILKDVLH 257
           +D+GASR +        + ++  D  V+     +  +  IGD++  F       +K VLH
Sbjct: 33  LDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIK-VLH 91

Query: 258 TPKIRKNLVSGFLLNK-------AGFTQSI-----GADLYTITKNGIF--VGKGYATDGM 303
           TP I  +L+S   LN+       A FT+++     G  L  I K G F  V K Y     
Sbjct: 92  TPNIAYDLLS---LNELAAVDITACFTKNVLERSDGTVLAPIVKYGDFYWVSKKYLLPSN 148

Query: 304 FKLNIDMNKISSSAYMLCDFNIWHSRLCHVNKRIISNMSGLGLIPKISLNDFE------- 356
             +    N  +S +     +   H  L H N + I        I   + +D +       
Sbjct: 149 ISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDY 208

Query: 357 KCQFCSQAKINKESHKSVTRIT-----EPFELIHSDLCELDGNLTRNGKRYFITFIDDCS 411
           +C  C   K  K  H   +R+      EPF+ +H+D+     NL ++   YFI+F D+ +
Sbjct: 209 QCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETT 268

Query: 412 DYTHVYLMRNKNE--ALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGI 469
            +  VY + ++ E   LD+F   +  I+NQF   +   + DRG+EY +   +++ ++ GI
Sbjct: 269 KFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGI 328

Query: 470 IHETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKN 529
               T       +G AER NRT  +     +  SG   H W   +     V N +   K+
Sbjct: 329 TPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKS 388

Query: 530 KISPYEILKKRQPNLSYFRTWGCLAYVRKPDPKRVKLASRAYECAFIGYAL----NSKAY 585
           K S  +       ++S    +G    V   +P      S+ +     GYAL    NS  Y
Sbjct: 389 KKSARQHAGLAGLDISTLLPFGQPVIVNDHNPN-----SKIHPRGIPGYALHPSRNSYGY 443

Query: 586 RFYDLKSKTIIESNDVDFYENK 607
             Y    K  +++ +    + K
Sbjct: 444 IIYLPSLKKTVDTTNYVILQGK 465


>YMU0_YEAST (Q04670) Transposon Ty1 protein B
          Length = 1328

 Score =  128 bits (321), Expect = 1e-28
 Identities = 128/509 (25%), Positives = 234/509 (45%), Gaps = 51/509 (10%)

Query: 689  EAINDEMDSLMSNETWHLTDLPPGCKTIGCKWILKKKL----KPDGSIDKYKARLVAKGF 744
            +A + E++ L+  +TW  TD     K I  K ++        K DG+   +KAR VA+G 
Sbjct: 825  QAYHKEVNQLLKMKTWD-TDRYYDRKEIDPKRVINSMFIFNRKRDGT---HKARFVARG- 879

Query: 745  RQRENVDFFDTYSPVTRITSIR-----VLISLAAIHNLIVHQMDVKTAFLNGELEEEIYM 799
                ++   DTY P  +  ++        +SLA  +N  + Q+D+ +A+L  +++EE+Y+
Sbjct: 880  ----DIQHPDTYDPGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEELYI 935

Query: 800  DQPEGFVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIEN-EFKVNESDKCIYSKYE 858
              P      G  +K+ +L KSLYGLKQ+   W+E   + +I+    +      C++    
Sbjct: 936  RPPPHL---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIKQCGMEEVRGWSCVFK--- 989

Query: 859  NNTCTIICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMK--DLGKAD-----VILGIKIT- 910
             N+   ICL+VDD+++F  +LNA K + + L   +D K  +LG++D      ILG++I  
Sbjct: 990  -NSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDILGLEIKY 1048

Query: 911  RTDNGISLNQSHYVEKILRKYNYFY---CKPASTPCDPSVKL------FKNTGDSVRQTE 961
            +    + L   + + + + K N       +  S P  P + +       +     ++  E
Sbjct: 1049 QRGKYMKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQQELELEEDDYKMKVHE 1108

Query: 962  YASIIGSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQR 1021
               +IG   Y     R D+ Y +  L +    PS +       +++++  T    L + +
Sbjct: 1109 MQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSKQVLDMTYELIQFIWNTRDKQLIWHK 1168

Query: 1022 YPAV-----LEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTILAQSTMESEMI 1076
               V     L   SDA + N     K+  G I+ + G  +  KS K ++   ST E+E+ 
Sbjct: 1169 SKPVKPTNKLVVISDASYGN-QPYYKSQIGNIYLLNGKVIGGKSTKASLTCTSTTEAEIH 1227

Query: 1077 ALAAASEEASWLRCLLSEIPLWERPLPAVLIHCDSTAAIAKIENRYYNGKRRQIRRKHST 1136
            A++ +    + L  L+ E  L ++P+   L+    +     I N     + R    K   
Sbjct: 1228 AISESVPLLNNLSHLVQE--LNKKPITKGLLTDSKSTISIIISNNEEKFRNRFFGTKAMR 1285

Query: 1137 IREYLSNGTVRVDFVRTNENLADPLTKGL 1165
            +R+ +S   + V ++ T +N+AD +TK L
Sbjct: 1286 LRDEVSGNHLHVCYIETKKNIADVMTKPL 1314



 Score =  106 bits (264), Expect = 4e-22
 Identities = 110/442 (24%), Positives = 182/442 (40%), Gaps = 41/442 (9%)

Query: 198 VDTGASRHVCYDRDMFKTYTACDDQKVLLGDSHSTDVVGIGDIELKFTSEKTLILKDVLH 257
           +D+GASR +        + ++  D  V+     +  +  IGD++  F       +K VLH
Sbjct: 33  LDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIK-VLH 91

Query: 258 TPKIRKNLVSGFLLNK-------AGFTQSI-----GADLYTITKNGIF--VGKGYATDGM 303
           TP I  +L+S   LN+       A FT+++     G  L  I K G F  V K Y     
Sbjct: 92  TPNIAYDLLS---LNELAAVDITACFTKNVLERSDGTVLAPIVKYGDFYWVSKKYLLPSN 148

Query: 304 FKLNIDMNKISSSAYMLCDFNIWHSRLCHVNKRIISNMSGLGLIPKISLNDFE------- 356
             +    N  +S +     +   H  L H N + I        I   + +D +       
Sbjct: 149 ISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDY 208

Query: 357 KCQFCSQAKINKESHKSVTRIT-----EPFELIHSDLCELDGNLTRNGKRYFITFIDDCS 411
           +C  C   K  K  H   +R+      EPF+ +H+D+     NL ++   YFI+F D+ +
Sbjct: 209 QCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETT 268

Query: 412 DYTHVYLMRNKNE--ALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGI 469
            +  VY + ++ E   LD+F   +  I+NQF   +   + DRG+EY +   +++ ++ GI
Sbjct: 269 KFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGI 328

Query: 470 IHETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKN 529
               T       +G AER NRT  +     +  SG   H W   +     V N +   K+
Sbjct: 329 TPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKS 388

Query: 530 KISPYEILKKRQPNLSYFRTWGCLAYVRKPDPKRVKLASRAYECAFIGYAL----NSKAY 585
           K S  +       ++S    +G    V   +P      S+ +     GYAL    NS  Y
Sbjct: 389 KKSARQHAGLAGLDISTLLPFGQPVIVNDHNPN-----SKIHPRGIPGYALHPSRNSYGY 443

Query: 586 RFYDLKSKTIIESNDVDFYENK 607
             Y    K  +++ +    + K
Sbjct: 444 IIYLPSLKKTVDTTNYVILQGK 465


>YJZ9_YEAST (P47100) Transposon Ty1 protein B
          Length = 1755

 Score =  128 bits (321), Expect = 1e-28
 Identities = 126/504 (25%), Positives = 229/504 (45%), Gaps = 41/504 (8%)

Query: 689  EAINDEMDSLMSNETWHLTDLPPGCKTIGCKWILKKKL----KPDGSIDKYKARLVAKGF 744
            EA + E++ L+  +TW  TD     K I  K ++        K DG+   +KAR VA+G 
Sbjct: 1252 EAYHKEVNQLLKMKTWD-TDEYYDRKEIDPKRVINSMFIFNKKRDGT---HKARFVARGD 1307

Query: 745  RQRENVDFFDTYSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQPEG 804
             Q  +       S      ++   +SLA  +N  + Q+D+ +A+L  +++EE+Y+  P  
Sbjct: 1308 IQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEELYIRPPPH 1367

Query: 805  FVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIEN-EFKVNESDKCIYSKYENNTCT 863
                G  +K+ +L KSLYGLKQ+   W+E   + +I+    +      C++     N+  
Sbjct: 1368 L---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIQQCGMEEVRGWSCVF----KNSQV 1420

Query: 864  IICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMK--DLGKADV-----ILGIKIT-RTDNG 915
             ICL+VDD+++F  NLN+ K +   L   +D K  +LG++D      ILG++I  +    
Sbjct: 1421 TICLFVDDMVLFSKNLNSNKRIIEKLKMQYDTKIINLGESDEEIQYDILGLEIKYQRGKY 1480

Query: 916  ISLNQSHYVEKILRKYNYFY---CKPASTPCDPSVKL------FKNTGDSVRQTEYASII 966
            + L   + + + + K N       +  S P  P + +       +     ++  E   +I
Sbjct: 1481 MKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQQELELEEDDYKMKVHEMQKLI 1540

Query: 967  GSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQRYPAV- 1025
            G   Y     R D+ Y +  L +    PS +       +++++  T    L + +   V 
Sbjct: 1541 GLASYVGYKFRFDLLYYINTLAQHILFPSKQVLDMTYELIQFIWNTRDKQLIWHKSKPVK 1600

Query: 1026 ----LEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTILAQSTMESEMIALAAA 1081
                L   SDA + N     K+  G I+ + G  +  KS K ++   ST E+E+ A++ +
Sbjct: 1601 PTNKLVVISDASYGN-QPYYKSQIGNIYLLNGKVIGGKSTKASLTCTSTTEAEIHAISES 1659

Query: 1082 SEEASWLRCLLSEIPLWERPLPAVLIHCDSTAAIAKIENRYYNGKRRQIRRKHSTIREYL 1141
                + L  L+ E+   ++P+   L+    +     I N     + R    K   +R+ +
Sbjct: 1660 VPLLNNLSYLIQELD--KKPITKGLLTDSKSTISIIISNNEEKFRNRFFGTKAMRLRDEV 1717

Query: 1142 SNGTVRVDFVRTNENLADPLTKGL 1165
            S   + V ++ T +N+AD +TK L
Sbjct: 1718 SGNHLHVCYIETKKNIADVMTKPL 1741



 Score =  107 bits (268), Expect = 1e-22
 Identities = 128/546 (23%), Positives = 219/546 (39%), Gaps = 53/546 (9%)

Query: 105 KPFKNQNRPMNKNSNRNKTGNNSRPQI----QQPPKNDAAPPFNCYNCGQADHMARKCRN 160
           KP   +N    KN +R+ T N ++P++     Q   N  +     +N   +++      +
Sbjct: 357 KPNYRRNPSDEKNDSRSYT-NTTKPKVIARNPQKTNNSKSKTARAHNVSTSNNSPSTDND 415

Query: 161 RTNRPAQAHMATDAAPDEPYVAMITE--INMIAGSDG-----WWVDTGASRHVCYDRDMF 213
             ++     +  +   D      +TE  +N    SD        +D+GASR +       
Sbjct: 416 SISKSTTEPIQLNNKHDLILGQKLTESTVNHTNHSDDELPGHLLLDSGASRTLIRSAHHI 475

Query: 214 KTYTACDDQKVLLGDSHSTDVVGIGDIELKFTSEKTLILKDVLHTPKIRKNLVSGFLLNK 273
            + ++  D  V+     +  +  IGD++  F       +K VLHTP I  +L+S   LN+
Sbjct: 476 HSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIK-VLHTPNIAYDLLS---LNE 531

Query: 274 -------AGFTQSI-----GADLYTITKNGIF--VGKGYATDGMFKLNIDMNKISSSAYM 319
                  A FT+++     G  L  I K G F  V K Y       +    N  +S +  
Sbjct: 532 LAAVDITACFTKNVLERSDGTVLAPIVKYGDFYWVSKKYLLPSNISVPTINNVHTSESTR 591

Query: 320 LCDFNIWHSRLCHVNKRIISNMSGLGLIPKISLNDFE-------KCQFCSQAKINKESHK 372
              +   H  L H N + I        I   + +D +       +C  C   K  K  H 
Sbjct: 592 KYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDYQCPDCLIGKSTKHRHI 651

Query: 373 SVTRIT-----EPFELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTHVYLMRNKNE--A 425
             +R+      EPF+ +H+D+     NL ++   YFI+F D+ + +  VY + ++ E   
Sbjct: 652 KGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETTKFRWVYPLHDRREDSI 711

Query: 426 LDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTAPYSPEMNGKA 485
           LD+F   +  I+NQF   +   + DRG+EY +   +++ ++ GI    T       +G A
Sbjct: 712 LDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGITPCYTTTADSRAHGVA 771

Query: 486 ERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNKISPYEILKKRQPNLS 545
           ER NRT  +     +  SG   H W   +     V N +   K+K S  +       ++S
Sbjct: 772 ERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKSKKSARQHAGLAGLDIS 831

Query: 546 YFRTWGCLAYVRKPDPKRVKLASRAYECAFIGYAL----NSKAYRFYDLKSKTIIESNDV 601
               +G    V   +P      S+ +     GYAL    NS  Y  Y    K  +++ + 
Sbjct: 832 TLLPFGQPVIVNDHNPN-----SKIHPRGIPGYALHPSRNSYGYIIYLPSLKKTVDTTNY 886

Query: 602 DFYENK 607
              + K
Sbjct: 887 VILQGK 892


>YME4_YEAST (Q04711) Transposon Ty1 protein B
          Length = 1328

 Score =  125 bits (315), Expect = 5e-28
 Identities = 127/504 (25%), Positives = 230/504 (45%), Gaps = 41/504 (8%)

Query: 689  EAINDEMDSLMSNETWHLTDLPPGCKTIGCKWILKKKL----KPDGSIDKYKARLVAKGF 744
            EA + E++ L+   TW  TD     K I  K ++        K DG+   +KAR VA+G 
Sbjct: 825  EAYHKEVNQLLKMNTWD-TDKYYDRKEIDPKRVINSMFIFNRKRDGT---HKARFVARGD 880

Query: 745  RQRENVDFFDTYSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQPEG 804
             Q  +       S      ++   +SLA  +N  + Q+D+ +A+L  +++EE+Y+  P  
Sbjct: 881  IQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEELYIRPPPH 940

Query: 805  FVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIEN-EFKVNESDKCIYSKYENNTCT 863
                G  +K+ +L KSLYGLKQ+   W+E   + +I+    +      C++     N+  
Sbjct: 941  L---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIKQCGMEEVRGWSCVF----KNSQV 993

Query: 864  IICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMK--DLGKAD-----VILGIKIT-RTDNG 915
             ICL+VDD+++F  +LNA K + + L   +D K  +LG++D      ILG++I  +    
Sbjct: 994  TICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDILGLEIKYQRGKY 1053

Query: 916  ISLNQSHYVEKILRKYNYFY---CKPASTPCDPSVKLFKN----TGDSVRQT--EYASII 966
            + L   + + + + K N       +  S P  P + + ++      D  ++   E   +I
Sbjct: 1054 MKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQDELEIDEDEYKEKVHEMQKLI 1113

Query: 967  GSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQRYPAV- 1025
            G   Y     R D+ Y +  L +    PS +       +++++  T    L + +     
Sbjct: 1114 GLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRDKQLIWHKNKPTE 1173

Query: 1026 ----LEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTILAQSTMESEMIALAAA 1081
                L   SDA + N     K+  G I+ + G  +  KS K ++   ST E+E+ A++ +
Sbjct: 1174 PDNKLVAISDASYGN-QPYYKSQIGNIYLLNGKVIGGKSTKASLTCTSTTEAEIHAISES 1232

Query: 1082 SEEASWLRCLLSEIPLWERPLPAVLIHCDSTAAIAKIENRYYNGKRRQIRRKHSTIREYL 1141
                + L  L+ E  L ++P+   L+    +     I N     + R    K   +R+ +
Sbjct: 1233 VPLLNNLSHLVQE--LNKKPITKGLLTDSKSTISIIISNNEEKFRNRFFGTKAMRLRDEV 1290

Query: 1142 SNGTVRVDFVRTNENLADPLTKGL 1165
            S   + V ++ T +N+AD +TK L
Sbjct: 1291 SGNHLHVCYIETKKNIADVMTKPL 1314



 Score =  106 bits (264), Expect = 4e-22
 Identities = 110/442 (24%), Positives = 182/442 (40%), Gaps = 41/442 (9%)

Query: 198 VDTGASRHVCYDRDMFKTYTACDDQKVLLGDSHSTDVVGIGDIELKFTSEKTLILKDVLH 257
           +D+GASR +        + ++  D  V+     +  +  IGD++  F       +K VLH
Sbjct: 33  LDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIK-VLH 91

Query: 258 TPKIRKNLVSGFLLNK-------AGFTQSI-----GADLYTITKNGIF--VGKGYATDGM 303
           TP I  +L+S   LN+       A FT+++     G  L  I K G F  V K Y     
Sbjct: 92  TPNIAYDLLS---LNELAAVDITACFTKNVLERSDGTVLAPIVKYGDFYWVSKKYLLPSN 148

Query: 304 FKLNIDMNKISSSAYMLCDFNIWHSRLCHVNKRIISNMSGLGLIPKISLNDFE------- 356
             +    N  +S +     +   H  L H N + I        I   + +D +       
Sbjct: 149 ISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDY 208

Query: 357 KCQFCSQAKINKESHKSVTRIT-----EPFELIHSDLCELDGNLTRNGKRYFITFIDDCS 411
           +C  C   K  K  H   +R+      EPF+ +H+D+     NL ++   YFI+F D+ +
Sbjct: 209 QCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETT 268

Query: 412 DYTHVYLMRNKNE--ALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGI 469
            +  VY + ++ E   LD+F   +  I+NQF   +   + DRG+EY +   +++ ++ GI
Sbjct: 269 KFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGI 328

Query: 470 IHETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKN 529
               T       +G AER NRT  +     +  SG   H W   +     V N +   K+
Sbjct: 329 TPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKS 388

Query: 530 KISPYEILKKRQPNLSYFRTWGCLAYVRKPDPKRVKLASRAYECAFIGYAL----NSKAY 585
           K S  +       ++S    +G    V   +P      S+ +     GYAL    NS  Y
Sbjct: 389 KKSARQHAGLAGLDISTLLPFGQPVIVNDHNPN-----SKIHPRGIPGYALHPSRNSYGY 443

Query: 586 RFYDLKSKTIIESNDVDFYENK 607
             Y    K  +++ +    + K
Sbjct: 444 IIYLPSLKKTVDTTNYVILQGK 465


>YMD9_YEAST (Q03434) Transposon Ty1 protein B
          Length = 1328

 Score =  125 bits (314), Expect = 7e-28
 Identities = 130/505 (25%), Positives = 234/505 (45%), Gaps = 43/505 (8%)

Query: 689  EAINDEMDSLMSNETWHLTDLPPGCKTIGCKWILKKKL----KPDGSIDKYKARLVAKGF 744
            EA + E++ L+  +TW  TD     K I  K ++        K DG+   +KAR VA+G 
Sbjct: 825  EAYHKEVNQLLKMKTWD-TDEYYDRKEIDPKRVINSMFIFNKKRDGT---HKARFVARGD 880

Query: 745  RQRENVDFFDTYSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQPEG 804
             Q  +       S      ++   +SLA  +N  + Q+D+ +A+L  +++EE+Y+  P  
Sbjct: 881  IQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEELYIRPPPH 940

Query: 805  FVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIEN-EFKVNESDKCIYSKYENNTCT 863
                G  +K+ +L KSLYGLKQ+   W+E   + +I+    +      C++     N+  
Sbjct: 941  L---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIKQCGMEEVRGWSCVF----KNSQV 993

Query: 864  IICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMK--DLGKAD-----VILGIKIT-RTDNG 915
             ICL+VDD+++F  +LNA K + + L   +D K  +LG++D      ILG++I  +    
Sbjct: 994  TICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDILGLEIKYQRGKY 1053

Query: 916  ISLNQSHYVEKILRKYNYFY---CKPASTPCDPSVKLFKN----TGDSVRQT--EYASII 966
            + L   + + + + K N       +  S P  P + + ++      D  ++   E   +I
Sbjct: 1054 MKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQDELEIDEDEYKEKVHEMQKLI 1113

Query: 967  GSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQRYPAV- 1025
            G   Y     R D+ Y +  L +    PS +       +++++  T    L + +     
Sbjct: 1114 GLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRDKQLIWHKNKPTE 1173

Query: 1026 ----LEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTILAQSTMESEMIALAAA 1081
                L   SDA + N     K+  G I+ + G  +  KS K ++   ST E+E+ A++ +
Sbjct: 1174 PDNKLVAISDASYGN-QPYYKSQIGNIYLLNGKVIGGKSTKASLTCTSTTEAEIHAISES 1232

Query: 1082 SEEASWLRCLLSEIPLWERP-LPAVLIHCDSTAAIAKIENRYYNGKRRQIRRKHSTIREY 1140
                + L  L+ E  L ++P +  +L    ST +I K  N     + R    K   +R+ 
Sbjct: 1233 VPLLNNLSYLIQE--LNKKPIIKGLLTDSRSTISIIKSTNE-EKFRNRFFGTKAMRLRDE 1289

Query: 1141 LSNGTVRVDFVRTNENLADPLTKGL 1165
            +S   + V ++ T +N+AD +TK L
Sbjct: 1290 VSGNNLYVYYIETKKNIADVMTKPL 1314



 Score =  106 bits (265), Expect = 3e-22
 Identities = 110/442 (24%), Positives = 182/442 (40%), Gaps = 41/442 (9%)

Query: 198 VDTGASRHVCYDRDMFKTYTACDDQKVLLGDSHSTDVVGIGDIELKFTSEKTLILKDVLH 257
           +D+GASR +        + ++  D  V+     +  +  IGD++  F       +K VLH
Sbjct: 33  LDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIK-VLH 91

Query: 258 TPKIRKNLVSGFLLNK-------AGFTQSI-----GADLYTITKNGIF--VGKGYATDGM 303
           TP I  +L+S   LN+       A FT+++     G  L  I K G F  V K Y     
Sbjct: 92  TPNIAYDLLS---LNELAAVDITACFTKNVLERSDGTVLAPIVKYGDFYWVSKKYLLPSN 148

Query: 304 FKLNIDMNKISSSAYMLCDFNIWHSRLCHVNKRIISNMSGLGLIPKISLNDFEK------ 357
             +    N  +S +     +   H  L H N + I        I   + +D ++      
Sbjct: 149 ISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDRSSAIDY 208

Query: 358 -CQFCSQAKINKESHKSVTRIT-----EPFELIHSDLCELDGNLTRNGKRYFITFIDDCS 411
            C  C   K  K  H   +R+      EPF+ +H+D+     NL ++   YFI+F D+ +
Sbjct: 209 QCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETT 268

Query: 412 DYTHVYLMRNKNE--ALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGI 469
            +  VY + ++ E   LD+F   +  I+NQF   +   + DRG+EY +   +++ ++ GI
Sbjct: 269 KFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGI 328

Query: 470 IHETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKN 529
               T       +G AER NRT  +     +  SG   H W   +     V N +   K+
Sbjct: 329 TPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKS 388

Query: 530 KISPYEILKKRQPNLSYFRTWGCLAYVRKPDPKRVKLASRAYECAFIGYAL----NSKAY 585
           K S  +       ++S    +G    V   +P      S+ +     GYAL    NS  Y
Sbjct: 389 KKSARQHAGLAGLDISTLLPFGQPVIVNDHNPN-----SKIHPRGIPGYALHPSRNSYGY 443

Query: 586 RFYDLKSKTIIESNDVDFYENK 607
             Y    K  +++ +    + K
Sbjct: 444 IIYLPSLKKTVDTTNYVILQGK 465


>YJZ7_YEAST (P47098) Transposon Ty1 protein B
          Length = 1755

 Score =  124 bits (311), Expect = 2e-27
 Identities = 132/507 (26%), Positives = 237/507 (46%), Gaps = 47/507 (9%)

Query: 689  EAINDEMDSLMSNETWHLTDLPPGCKTIGCKWILKKKL----KPDGSIDKYKARLVAKGF 744
            EA + E++ L+  +TW  TD     K I  K ++        K DG+   +KAR VA+G 
Sbjct: 1252 EAYHKEVNQLLKMKTWD-TDEYYDRKEIDPKRVINSMFIFNKKRDGT---HKARFVARG- 1306

Query: 745  RQRENVDFFDT--YSPVTRITSIRVLISLAAIHNLIVHQMDVKTAFLNGELEEEIYMDQP 802
               ++ D +DT   S      ++   +SLA  +N  + Q+D+ +A+L  +++EE+Y+  P
Sbjct: 1307 -DIQHPDTYDTGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEELYIRPP 1365

Query: 803  EGFVIHGQENKVCKLDKSLYGLKQAPKQWHEKFDNLMIEN-EFKVNESDKCIYSKYENNT 861
                  G  +K+ +L KS YGLKQ+   W+E   + +I+    +      C++     N+
Sbjct: 1366 PHL---GMNDKLIRLKKSHYGLKQSGANWYETIKSYLIKQCGMEEVRGWSCVF----KNS 1418

Query: 862  CTIICLYVDDLLIFGSNLNAIKDVKSLLCHNFDMK--DLGKAD-----VILGIKIT-RTD 913
               ICL+VDD+++F  +LNA K + + L   +D K  +LG++D      ILG++I  +  
Sbjct: 1419 QVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDILGLEIKYQRG 1478

Query: 914  NGISLNQSHYVEKILRKYNYFY---CKPASTPCDPSVKLFKN----TGDSVRQT--EYAS 964
              + L   + + + + K N       +  S P  P + + ++      D  ++   E   
Sbjct: 1479 KYMKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQDELEIDEDEYKEKVHEMQK 1538

Query: 965  IIGSLRYATDCTRPDISYAVGLLCKFTSRPSMEHWQAIERVMRYLKKTMTLGLHYQRYPA 1024
            +IG   Y     R D+ Y +  L +    PS +       +++++  T    L + +   
Sbjct: 1539 LIGLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRDKQLIWHKNKP 1598

Query: 1025 V-----LEGYSDADWNNLSDDSKATSGYIFSIAGGAVSWKSKKQTILAQSTMESEMIALA 1079
                  L   SDA + N     K+  G IF + G  +  KS K ++   ST E+E+ A++
Sbjct: 1599 TEPDNKLVAISDASYGN-QPYYKSQIGNIFLLNGKVIGGKSTKASLTCTSTTEAEIHAIS 1657

Query: 1080 AASEEASWLRCLLSEIPLWERP-LPAVLIHCDSTAAIAKIENRYYNGKRRQIRRKHSTIR 1138
             +    + L  L+ E  L ++P +  +L    ST +I K  N     + R    K   +R
Sbjct: 1658 ESVPLLNNLSYLIQE--LNKKPIIKGLLTDSRSTISIIKSTNE-EKFRNRFFGTKAMRLR 1714

Query: 1139 EYLSNGTVRVDFVRTNENLADPLTKGL 1165
            + +S   + V ++ T +N+AD +TK L
Sbjct: 1715 DEVSGNNLYVYYIETKKNIADVMTKPL 1741



 Score =  106 bits (265), Expect = 3e-22
 Identities = 127/546 (23%), Positives = 219/546 (39%), Gaps = 53/546 (9%)

Query: 105 KPFKNQNRPMNKNSNRNKTGNNSRPQI----QQPPKNDAAPPFNCYNCGQADHMARKCRN 160
           KP   +N    KN +R+ T N ++P++     Q   N  +     +N   +++      +
Sbjct: 357 KPNYRRNLSDEKNDSRSYT-NTTKPKVIARNPQKTNNSKSKTARAHNVSTSNNSPSTDND 415

Query: 161 RTNRPAQAHMATDAAPDEPYVAMITE--INMIAGSDG-----WWVDTGASRHVCYDRDMF 213
             ++     +  +   D      +TE  +N    SD        +D+GASR +       
Sbjct: 416 SISKSTTEPIQLNNKHDLTLGQELTESTVNHTNHSDDELPGHLLLDSGASRTLIRSAHHI 475

Query: 214 KTYTACDDQKVLLGDSHSTDVVGIGDIELKFTSEKTLILKDVLHTPKIRKNLVSGFLLNK 273
            + ++  D  V+     +  +  IGD++  F       +K VLHTP I  +L+S   LN+
Sbjct: 476 HSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIK-VLHTPNIAYDLLS---LNE 531

Query: 274 -------AGFTQSI-----GADLYTITKNGIF--VGKGYATDGMFKLNIDMNKISSSAYM 319
                  A FT+++     G  L  I + G F  V K Y       +    N  +S +  
Sbjct: 532 LAAVDITACFTKNVLERSDGTVLAPIVQYGDFYWVSKRYLLPSNISVPTINNVHTSESTR 591

Query: 320 LCDFNIWHSRLCHVNKRIISNMSGLGLIPKISLNDFE-------KCQFCSQAKINKESHK 372
              +   H  L H N + I        I   + +D +       +C  C   K  K  H 
Sbjct: 592 KYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDYQCPDCLIGKSTKHRHI 651

Query: 373 SVTRIT-----EPFELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTHVYLMRNKNE--A 425
             +R+      EPF+ +H+D+     NL ++   YFI+F D+ + +  VY + ++ E   
Sbjct: 652 KGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETTKFRWVYPLHDRREDSI 711

Query: 426 LDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTAPYSPEMNGKA 485
           LD+F   +  I+NQF   +   + DRG+EY +   +++ ++ GI    T       +G A
Sbjct: 712 LDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGITPCYTTTADSRAHGVA 771

Query: 486 ERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNKISPYEILKKRQPNLS 545
           ER NRT  +     +  SG   H W   +     V N +   K+K S  +       ++S
Sbjct: 772 ERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKSKKSARQHAGLAGLDIS 831

Query: 546 YFRTWGCLAYVRKPDPKRVKLASRAYECAFIGYAL----NSKAYRFYDLKSKTIIESNDV 601
               +G    V   +P      S+ +     GYAL    NS  Y  Y    K  +++ + 
Sbjct: 832 TLLPFGQPVIVNDHNPN-----SKIHPRGIPGYALHPSRNSYGYIIYLPSLKKTVDTTNY 886

Query: 602 DFYENK 607
              + K
Sbjct: 887 VILQGK 892


>M820_ARATH (P92520) Hypothetical mitochondrial protein AtMg00820
           (ORF170)
          Length = 170

 Score = 99.8 bits (247), Expect = 4e-20
 Identities = 49/115 (42%), Positives = 73/115 (62%), Gaps = 6/115 (5%)

Query: 661 PEY---VAYTIEEDPSSIKEALSSIDADLWQEAINDEMDSLMSNETWHLTDLPPGCKTIG 717
           P+Y   +  TI+++P S+  AL       W +A+ +E+D+L  N+TW L   P     +G
Sbjct: 14  PKYSLTITTTIKKEPKSVIFALKDPG---WCQAMQEELDALSRNKTWILVPPPVNQNILG 70

Query: 718 CKWILKKKLKPDGSIDKYKARLVAKGFRQRENVDFFDTYSPVTRITSIRVLISLA 772
           CKW+ K KL  DG++D+ KARLVAKGF Q E + F +TYSPV R  +IR ++++A
Sbjct: 71  CKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125


>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
            transposon 412 [Contains: Protease (EC 3.4.23.-); Reverse
            transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1237

 Score = 61.2 bits (147), Expect = 2e-08
 Identities = 52/193 (26%), Positives = 79/193 (39%), Gaps = 13/193 (6%)

Query: 357  KCQFCSQAKINKESHKSVTRITEPFELIHSDLCELDGNLTR--NGKRYFITFIDDCSDYT 414
            KCQ C +AK  K +   +T    P       + +  G L +  NG  Y +T I D + Y 
Sbjct: 938  KCQKCQKAKTTKHTKTPMTITETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYL 997

Query: 415  HVYLMRNKNEALDIFKQYVKEIENQFNIR---IKRFRSDRGTEYGSHIFNEYYKELGIIH 471
                + NK+      K   K I   F ++   +K F +D GTEY + I  +  K L I +
Sbjct: 998  VAIPIANKSA-----KTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKN 1052

Query: 472  ETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNKI 531
             T+  +  +  G  ER +RT  E + + +         W   L    Y  N      +  
Sbjct: 1053 ITSTAHHHQTVGVVERSHRTLNEYIRSYISTDKTD---WDVWLQYFVYCFNTTQSMVHNY 1109

Query: 532  SPYEILKKRQPNL 544
             PYE++  R  NL
Sbjct: 1110 CPYELVFGRTSNL 1122


>POL_MLVFF (P26809) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
            3.1.26.4) (RT); Integrase (IN)]
          Length = 1204

 Score = 60.8 bits (146), Expect = 2e-08
 Identities = 54/211 (25%), Positives = 89/211 (41%), Gaps = 16/211 (7%)

Query: 356  EKCQFCSQAKINKESHKSVTRITEPFELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTH 415
            E CQ C+Q   +K + K  TR+       H ++   +      G +Y + FID  S +  
Sbjct: 888  ETCQACAQVNASKSAVKQGTRVRGHRPGTHWEIDFTEVKPGLYGYKYLLVFIDTFSGWVE 947

Query: 416  VYLMRNKNEALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTA 475
             +  + K  A  + K+ ++EI  +F +  +   +D G  + S +       LG+  +   
Sbjct: 948  AFPTK-KETAKVVTKKLLEEIFPRFGMP-QVLGTDNGPAFVSKVSQTVADLLGVDWKLHC 1005

Query: 476  PYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNKISPYE 535
             Y P+ +G+ ER NRT  E +    L +G+    W  +L    Y     P   + ++PYE
Sbjct: 1006 AYRPQSSGQVERMNRTIKETLTKLTLATGSRD--WVLLLPLALYRARNTP-GPHGLTPYE 1062

Query: 536  ILKKRQPNLSYFRTWGCLAYVRKPDPKRVKL 566
            IL    P L  F           PDP   K+
Sbjct: 1063 ILYGAPPPLVNF-----------PDPDMAKV 1082


>POL_MLVMO (P03355) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
            3.1.26.4) (RT); Integrase (IN)]
          Length = 1199

 Score = 59.3 bits (142), Expect = 6e-08
 Identities = 54/197 (27%), Positives = 86/197 (43%), Gaps = 9/197 (4%)

Query: 353  NDFEKCQFCSQAKINKESHKSVTRIT--EPFELIHSDLCELDGNLTRNGKRYFITFIDDC 410
            N  E C+ C+Q   +K + K  TR+    P      D  E+   L   G +Y + FID  
Sbjct: 880  NITETCKACAQVNASKSAVKQGTRVRGHRPGTHWEIDFTEIKPGLY--GYKYLLVFIDTF 937

Query: 411  SDYTHVYLMRNKNEALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGII 470
            S +   +  + K  A  + K+ ++EI  +F +  +   +D G  + S +       LGI 
Sbjct: 938  SGWIEAFPTK-KETAKVVTKKLLEEIFPRFGMP-QVLGTDNGPAFVSKVSQTVADLLGID 995

Query: 471  HETTAPYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNK 530
             +    Y P+ +G+ ER NRT  E +    L +G+    W  +L    Y     P   + 
Sbjct: 996  WKLHCAYRPQSSGQVERMNRTIKETLTKLTLATGSRD--WVLLLPLALYRARNTP-GPHG 1052

Query: 531  ISPYEILKKRQPNLSYF 547
            ++PYEIL    P L  F
Sbjct: 1053 LTPYEILYGAPPPLVNF 1069


>POL3_MOUSE (P11367) Retrovirus-related Pol polyprotein
           (Endonuclease) (Fragment)
          Length = 390

 Score = 59.3 bits (142), Expect = 6e-08
 Identities = 48/192 (25%), Positives = 84/192 (43%), Gaps = 5/192 (2%)

Query: 356 EKCQFCSQAKINKESHKSVTRITEPFELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTH 415
           E CQ C Q   +K   ++ TR+       H ++   +      G +Y + F+D  S +  
Sbjct: 92  ESCQACVQVNASKTKIRAGTRVRGHRLGTHWEIDFTEVKPGLYGYKYLLVFVDTFSGWVE 151

Query: 416 VYLMRNKNEALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTA 475
            +  +++   + + K+ ++EI  +F +  +   +D G  + S +     K LGI  +   
Sbjct: 152 AFPTKHETAKI-VTKKLLEEIFPRFGMP-QVLGTDNGPAFVSQVSQSVAKLLGIDWKLHC 209

Query: 476 PYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNKISPYE 535
            Y P+ +G+ ER NRT  E +    L +G     W  +L    Y     P   + ++PYE
Sbjct: 210 AYRPQSSGQVERMNRTIKETLTKLTLATGTRD--WVLLLPLALYRARNTP-GPHGLTPYE 266

Query: 536 ILKKRQPNLSYF 547
           IL    P L  F
Sbjct: 267 ILYGAPPPLVNF 278


>POL_MLVFP (P26808) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
            3.1.26.4) (RT); Integrase (IN)]
          Length = 1204

 Score = 58.9 bits (141), Expect = 8e-08
 Identities = 52/211 (24%), Positives = 89/211 (41%), Gaps = 16/211 (7%)

Query: 356  EKCQFCSQAKINKESHKSVTRITEPFELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTH 415
            E C+ C+Q   +K + K  TR+       H ++   +      G +Y + F+D  S +  
Sbjct: 888  ETCKACAQVNASKSAVKQGTRVRGHRPGTHWEIDFTEVKPGLYGYKYLLVFVDTFSGWVE 947

Query: 416  VYLMRNKNEALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTA 475
             +  + K  A  + K+ ++EI  +F +  +   +D G  + S +       LG+  +   
Sbjct: 948  AFPTK-KETAKVVTKKLLEEIFPRFGMP-QVLGTDNGPAFVSKVSQTVADLLGVDWKLHC 1005

Query: 476  PYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNKISPYE 535
             Y P+ +G+ ER NRT  E +    L +G+    W  +L    Y     P   + ++PYE
Sbjct: 1006 AYRPQSSGQVERMNRTIKETLTKLTLATGSRD--WVLLLPLALYRARNTP-GPHGLTPYE 1062

Query: 536  ILKKRQPNLSYFRTWGCLAYVRKPDPKRVKL 566
            IL    P L  F           PDP   K+
Sbjct: 1063 ILYGAPPPLVNF-----------PDPDMAKV 1082


>POL_MLVF5 (P26810) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
            3.1.26.4) (RT); Integrase (IN)]
          Length = 1204

 Score = 58.9 bits (141), Expect = 8e-08
 Identities = 52/211 (24%), Positives = 89/211 (41%), Gaps = 16/211 (7%)

Query: 356  EKCQFCSQAKINKESHKSVTRITEPFELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTH 415
            E C+ C+Q   +K + K  TR+       H ++   +      G +Y + F+D  S +  
Sbjct: 888  ETCKACAQVNASKSAVKQGTRVRGHRPGTHWEIDFTEVKPGLYGYKYLLVFVDTFSGWVE 947

Query: 416  VYLMRNKNEALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTA 475
             +  + K  A  + K+ ++EI  +F +  +   +D G  + S +       LG+  +   
Sbjct: 948  AFPTK-KETAKVVTKKLLEEIFPRFGMP-QVLGTDNGPAFVSKVSQTVADLLGVDWKLHC 1005

Query: 476  PYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNKISPYE 535
             Y P+ +G+ ER NRT  E +    L +G+    W  +L    Y     P   + ++PYE
Sbjct: 1006 AYRPQSSGQVERMNRTIKETLTKLTLATGSRD--WVLLLPLALYRARNTP-GPHGLTPYE 1062

Query: 536  ILKKRQPNLSYFRTWGCLAYVRKPDPKRVKL 566
            IL    P L  F           PDP   K+
Sbjct: 1063 ILYGAPPPLVNF-----------PDPDMAKV 1082


>POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse
           transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
           (RT); Integrase (IN)] (Fragment)
          Length = 1046

 Score = 58.2 bits (139), Expect = 1e-07
 Identities = 53/192 (27%), Positives = 82/192 (42%), Gaps = 13/192 (6%)

Query: 358 CQFCSQ--AKINKESHKSVTRITEPFELIHSDLCELDGNLTRNGKRYFITFIDDCSDYTH 415
           C+ C Q  A   +      TR   P      D  E+  +    G +Y + F+D  S +  
Sbjct: 737 CKVCQQVNAGATRVPEGKRTRGNRPGVYWEIDFTEVKPHYA--GYKYLLVFVDTFSGWVE 794

Query: 416 VYLMRNKNEALDIFKQYVKEIENQFNIRIKRFRSDRGTEYGSHIFNEYYKELGIIHETTA 475
            Y  R +   + + K+ ++EI  +F +  K   SD G  + S +     + LGI  +   
Sbjct: 795 AYPTRQETAHM-VAKKILEEIFPRFGLP-KVIGSDNGPAFVSQVSQGLARTLGINWKLHC 852

Query: 476 PYSPEMNGKAERKNRTFTELVVATMLNSGAAPHWWGEILLTVCYVLNRVPKTKNK--ISP 533
            Y P+ +G+ ER NRT  E +    L +G     W  +L      L R   T N+  ++P
Sbjct: 853 AYRPQSSGQVERMNRTIKETLTKLTLETGLKD--WRRLL---SLALLRARNTPNRFGLTP 907

Query: 534 YEILKKRQPNLS 545
           YEIL    P LS
Sbjct: 908 YEILYGGPPPLS 919


  Database: sprot
    Posted date:  Nov 25, 2004 10:54 AM
  Number of letters in database: 59,974,054
  Number of sequences in database:  164,201
  
Lambda     K      H
   0.318    0.134    0.400 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 143,125,711
Number of Sequences: 164201
Number of extensions: 6371422
Number of successful extensions: 17607
Number of sequences better than 10.0: 94
Number of HSP's better than 10.0 without gapping: 62
Number of HSP's successfully gapped in prelim test: 32
Number of HSP's that attempted gapping in prelim test: 17417
Number of HSP's gapped (non-prelim): 139
length of query: 1185
length of database: 59,974,054
effective HSP length: 121
effective length of query: 1064
effective length of database: 40,105,733
effective search space: 42672499912
effective search space used: 42672499912
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 72 (32.3 bits)


Medicago: description of AC147000.7