Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0151.11
         (1562 letters)

Database: sprot 
           164,201 sequences; 59,974,054 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

POLX_TOBAC (P10978) Retrovirus-related Pol polyprotein from tran...   462  e-129
COPI_DROME (P04146) Copia protein (Gag-int-pol protein) [Contain...   301  7e-81
YCH4_YEAST (P25600) Transposon Ty5-1 34.5 kDa hypothetical protein    159  4e-38
M810_ARATH (P92519) Hypothetical mitochondrial protein AtMg00810...   150  3e-35
YCB9_YEAST (P25384) Transposon Ty2 protein B (Ty1-17 protein B)       111  1e-23
YJZ9_YEAST (P47100) Transposon Ty1 protein B                          105  1e-21
YMU0_YEAST (Q04670) Transposon Ty1 protein B                          102  8e-21
YJL3_YEAST (P47024) Transposon Ty4 207.7 kDa hypothetical protein     102  8e-21
YMT5_YEAST (Q04214) Transposon Ty1 protein B                          101  1e-20
YME4_YEAST (Q04711) Transposon Ty1 protein B                          101  1e-20
YMD9_YEAST (Q03434) Transposon Ty1 protein B                          101  2e-20
YJZ7_YEAST (P47098) Transposon Ty1 protein B                           99  7e-20
M820_ARATH (P92520) Hypothetical mitochondrial protein AtMg00820...    99  9e-20
M240_ARATH (P93290) Hypothetical mitochondrial protein AtMg00240...    66  7e-10
POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.2...    61  2e-08
M710_ARATH (P92512) Hypothetical mitochondrial protein AtMg00710...    58  2e-07
POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse transcript...    55  1e-06
POL_MLVAV (P03356) Pol polyprotein [Contains: Protease (EC 3.4.2...    52  1e-05
ATRX_CAEEL (Q9U7E0) Transcriptional regulator ATRX homolog (X-li...    52  1e-05
POL_MLVMO (P03355) Pol polyprotein [Contains: Protease (EC 3.4.2...    51  3e-05

>POLX_TOBAC (P10978) Retrovirus-related Pol polyprotein from
            transposon TNT 1-94 [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1328

 Score =  462 bits (1190), Expect = e-129
 Identities = 307/950 (32%), Positives = 493/950 (51%), Gaps = 54/950 (5%)

Query: 609  WYLDSGCSRHMTGESRMFQELKLKPGGEVGFGGNEKGKIVGTGTICVDSSP----CIDNV 664
            W +D+  S H T    +F        G V  G     KI G G IC+ ++      + +V
Sbjct: 294  WVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDV 353

Query: 665  LLVDGLTHNLLSISQLADKGYDVIFNQKSCRAVSQIDGS-VLFNSKRKNNIYKIR--LSE 721
              V  L  NL+S   L   GY+  F  +  R      GS V+     +  +Y+    + +
Sbjct: 354  RHVPDLRMNLISGIALDRDGYESYFANQKWRLTK---GSLVIAKGVARGTLYRTNAEICQ 410

Query: 722  LEAQNVKCLLSVNEEQWVWHRRLGHASMRKISQLSKLNLVRGLPNLKFASDALCEACQKG 781
             E    +  +SV+    +WH+R+GH S + +  L+K +L+      K  +   C+ C  G
Sbjct: 411  GELNAAQDEISVD----LWHKRMGHMSEKGLQILAKKSLIS---YAKGTTVKPCDYCLFG 463

Query: 782  KFTKVPFKAKNVVSTSRPLELLHIDLFGPVKTESIGGKRYGMVIVDDYSRWTWVKFLTRK 841
            K  +V F+  +    +  L+L++ D+ GP++ ES+GG +Y +  +DD SR  WV  L  K
Sbjct: 464  KQHRVSFQTSSERKLNI-LDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTK 522

Query: 842  DESHVVFSTFIAQVQNEKACRIVRVRSDHGGE-----FESLFDSYGIAHDFSCPRTPQQN 896
            D+   VF  F A V+ E   ++ R+RSD+GGE     FE    S+GI H+ + P TPQ N
Sbjct: 523  DQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHN 582

Query: 897  GVVERKNRTLQEMARTMLQETGMAKHFLAEAVNTACYIQNRISVRPILNKTPYELWKNIK 956
            GV ER NRT+ E  R+ML+   + K F  EAV TACY+ NR    P+  + P  +W N +
Sbjct: 583  GVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKE 642

Query: 957  PNISYFHPFGCVCYVLNTKDRLHKFDAKSSKCLLLGYSERSKGFRFYNTDAKTIEESIHV 1016
             + S+   FGC  +    K++  K D KS  C+ +GY +   G+R ++   K +  S  V
Sbjct: 643  VSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDV 702

Query: 1017 RFDD---KLDSDQSKLVEK-FADLSINVSDKGKAPEEAEPEEDEPEEEAGP-----SDSQ 1067
             F +   +  +D S+ V+       + +      P  AE   DE  E+           +
Sbjct: 703  VFRESEVRTAADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGE 762

Query: 1068 TLKKSRITAAHPKELILGNKDEPVRTRSAFRPYEETLLSLKGLVSLI----EPKSIDEAL 1123
             L +      HP +     +++    R + RP  E+         LI    EP+S+ E L
Sbjct: 763  QLDEGVEEVEHPTQ----GEEQHQPLRRSERPRVESRRYPSTEYVLISDDREPESLKEVL 818

Query: 1124 Q--DKDWIL-AMEEELNQFSKNDVWSLVKKPESVHVIGTKWVFRNKLNEKGDVVRNKARL 1180
               +K+ ++ AM+EE+    KN  + LV+ P+    +  KWVF+ K +    +VR KARL
Sbjct: 819  SHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARL 878

Query: 1181 VAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEVYV 1240
            V +G+ Q++GID+ E F+PV ++ +IR ++S + + ++ + Q+DVK+AFL+G + EE+Y+
Sbjct: 879  VVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYM 938

Query: 1241 HQPLGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTY- 1299
             QP GFE   K   V KL KSLYGLKQAPR WY +  SF+    +++   D  ++ K + 
Sbjct: 939  EQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFS 998

Query: 1300 KDDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQV--DQTPEGT 1357
            +++ +I+ +YVDD++    ++ L  +    +   F+M  +G  +  LG+++  ++T    
Sbjct: 999  ENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKL 1058

Query: 1358 YIHQSKYTKELLKKFNMLESTVAKTPMHPTCILEKE------DKSGKVCQKLYRGMIGSL 1411
            ++ Q KY + +L++FNM  +    TP+     L K+      ++ G + +  Y   +GSL
Sbjct: 1059 WLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSL 1118

Query: 1412 LY-LTASRPDILFSVHLCARFQSDPRETHLTAIKRILRYLKGTTNLGLMYKKTSEYKLSG 1470
            +Y +  +RPDI  +V + +RF  +P + H  A+K ILRYL+GTT   L +   S+  L G
Sbjct: 1119 MYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCF-GGSDPILKG 1177

Query: 1471 YCDAHYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISA 1520
            Y DA  AGD   RKS++G         +SW SK Q  +ALST EAEYI+A
Sbjct: 1178 YTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAA 1227



 Score = 53.9 bits (128), Expect = 3e-06
 Identities = 68/306 (22%), Positives = 120/306 (38%), Gaps = 60/306 (19%)

Query: 17  GQRFEYWKDRMESFFLGFDADLWDIIVD-GYERPVDADGKKIPRSEMTADQKKLYSQHHK 75
           G ++E  K   ++ F  +   + D+++  G  + +D D KK P +    D   L     +
Sbjct: 3   GVKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKK-PDTMKAEDWADL---DER 58

Query: 76  ARAILLSAISYEEYQKITDREFAKGIF---ESLKMSHEGNKKVKESKALSLIQKYESFIM 132
           A + +   +S +    I D + A+GI+   ESL MS     K+   K L  +       M
Sbjct: 59  AASAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALH------M 112

Query: 133 EPNESIEEMFSRFQLLVAGIRPLNKSYTTKDHVIRVIRCLPESWMPLVTSIELTRDVENM 192
               +     + F  L+  +  L      +D  I ++  LP S+  L T+I         
Sbjct: 113 SEGTNFLSHLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTI--------- 163

Query: 193 SLEELISILKCHELKRSEMQDLRKKSIALKSKSEKAKAEKSKALQAEEEESEEASEDSDE 252
                      H     E++D+   ++ L  K  K    + +AL                
Sbjct: 164 ----------LHGKTTIELKDVTS-ALLLNEKMRKKPENQGQAL---------------- 196

Query: 253 DELTLISKRLNRIWKHRQSKYKGSGKAKGKSESSGQKKSSLKEVTCFECKESGHYKSDCP 312
                I++   R ++   + Y  SG A+GKS++    +S  +   C+ C + GH+K DCP
Sbjct: 197 -----ITEGRGRSYQRSSNNYGRSG-ARGKSKN----RSKSRVRNCYNCNQPGHFKRDCP 246

Query: 313 KLKKDK 318
             +K K
Sbjct: 247 NPRKGK 252


>COPI_DROME (P04146) Copia protein (Gag-int-pol protein) [Contains:
            Copia VLP protein; Copia protease (EC 3.4.23.-)]
          Length = 1409

 Score =  301 bits (772), Expect = 7e-81
 Identities = 178/506 (35%), Positives = 284/506 (55%), Gaps = 31/506 (6%)

Query: 1041 SDKGKAPEEAEPEEDEPEEEAGPSDSQTLKKSRITAAHPKEL------------ILGNKD 1088
            S+K    E  + + D+   E+  S +    +   TA H KE+            I+  + 
Sbjct: 799  SNKYFLNESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDGIEIINRRS 858

Query: 1089 EPVRTRSAFRPYEETLLSLKGLVSLIE------PKSIDEALQDKD---WILAMEEELNQF 1139
            E ++T+     Y E   SL  +V          P S DE     D   W  A+  ELN  
Sbjct: 859  ERLKTKPQIS-YNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAH 917

Query: 1140 SKNDVWSLVKKPESVHVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQEGIDYTETFAP 1199
              N+ W++ K+PE+ +++ ++WVF  K NE G+ +R KARLVA+G++Q+  IDY ETFAP
Sbjct: 918  KINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAP 977

Query: 1200 VARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEVYVHQPLGFEDEKKPDHVFKLK 1259
            VAR+ + R ++S  + +N+ +HQMDVK+AFLNG + EE+Y+  P G       D+V KL 
Sbjct: 978  VARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGISCNS--DNVCKLN 1035

Query: 1260 KSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFC--KTYKDDILIVQIYVDDIIFGS 1317
            K++YGLKQA R W+E     L E EFV   VD  ++   K   ++ + V +YVDD++  +
Sbjct: 1036 KAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVLLYVDDVVIAT 1095

Query: 1318 ANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEGTYIHQSKYTKELLKKFNMLES 1377
             + +    F   +  +F M+ + E+K+F+GI+++   +  Y+ QS Y K++L KFNM   
Sbjct: 1096 GDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENC 1155

Query: 1378 TVAKTPMHPTCILEKEDKSGKVCQKLYRGMIGSLLY-LTASRPDILFSVHLCARFQSDPR 1436
                TP+ P+ I  +   S + C    R +IG L+Y +  +RPD+  +V++ +R+ S   
Sbjct: 1156 NAVSTPL-PSKINYELLNSDEDCNTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNN 1214

Query: 1437 ETHLTAIKRILRYLKGTTNLGLMYKK--TSEYKLSGYCDAHYAGDRTERKSTSGNC-QFL 1493
                  +KR+LRYLKGT ++ L++KK    E K+ GY D+ +AG   +RKST+G   +  
Sbjct: 1215 SELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMF 1274

Query: 1494 GSNLVSWASKRQSTIALSTAEAEYIS 1519
              NL+ W +KRQ+++A S+ EAEY++
Sbjct: 1275 DFNLICWNTKRQNSVAASSTEAEYMA 1300



 Score =  172 bits (437), Expect = 5e-42
 Identities = 123/411 (29%), Positives = 210/411 (50%), Gaps = 29/411 (7%)

Query: 611  LDSGCSRHMTGESRMFQE-LKLKPGGEVGFGGNEKGKIV-----GTGTICVDSSPCIDNV 664
            LDSG S H+  +  ++ + +++ P  ++     ++G+ +     G   +  D    +++V
Sbjct: 291  LDSGASDHLINDESLYTDSVEVVPPLKIAVA--KQGEFIYATKRGIVRLRNDHEITLEDV 348

Query: 665  LLVDGLTHNLLSISQLADKGYDVIFNQKSCRAVSQIDGSVLFNSKRKNNIYKIRLSELEA 724
            L       NL+S+ +L + G  + F+ KS   +S+    V+ NS   NN+  I     +A
Sbjct: 349  LFCKEAAGNLMSVKRLQEAGMSIEFD-KSGVTISKNGLMVVKNSGMLNNVPVINF---QA 404

Query: 725  QNVKCLLSVNEEQWVWHRRLGHASMRKISQLSKLNLVRG---LPNLKFASDALCEACQKG 781
             ++      N    +WH R GH S  K+ ++ + N+      L NL+ + + +CE C  G
Sbjct: 405  YSINAKHKNNFR--LWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCE-ICEPCLNG 461

Query: 782  KFTKVPFKA-KNVVSTSRPLELLHIDLFGPVKTESIGGKRYGMVIVDDYSRWTWVKFLTR 840
            K  ++PFK  K+     RPL ++H D+ GP+   ++  K Y ++ VD ++ +     +  
Sbjct: 462  KQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKY 521

Query: 841  KDESHVVFSTFIAQVQNEKACRIVRVRSDHGGEFES-----LFDSYGIAHDFSCPRTPQQ 895
            K +   +F  F+A+ +     ++V +  D+G E+ S          GI++  + P TPQ 
Sbjct: 522  KSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQL 581

Query: 896  NGVVERKNRTLQEMARTMLQETGMAKHFLAEAVNTACYIQNRISVRPIL--NKTPYELWK 953
            NGV ER  RT+ E ARTM+    + K F  EAV TA Y+ NRI  R ++  +KTPYE+W 
Sbjct: 582  NGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWH 641

Query: 954  NIKPNISYFHPFGCVCYVLNTKDRLHKFDAKSSKCLLLGYSERSKGFRFYN 1004
            N KP + +   FG   YV + K++  KFD KS K + +GY     GF+ ++
Sbjct: 642  NKKPYLKHLRVFGATVYV-HIKNKQGKFDDKSFKSIFVGY--EPNGFKLWD 689



 Score = 43.9 bits (102), Expect = 0.003
 Identities = 69/332 (20%), Positives = 124/332 (36%), Gaps = 71/332 (21%)

Query: 9   NAKPPMFDGQRFEYWKDRMESFFLGFDADLWDIIVDGYERPVDADGKKIPRSEMTADQKK 68
           N KP  FDG+++  WK R+ +     + D+  ++       VD   KK  R         
Sbjct: 7   NIKP--FDGEKYAIWKFRIRALLA--EQDVLKVVDGLMPNEVDDSWKKAERC-------- 54

Query: 69  LYSQHHKARAILLSAISYEEYQKITDREFAKGIFESLKMSHEGNKKVKESKALSLIQKYE 128
                  A++ ++  +S       T    A+ I E+L   +E      +   L+L ++  
Sbjct: 55  -------AKSTIIEYLSDSFLNFATSDITARQILENLDAVYERKSLASQ---LALRKRLL 104

Query: 129 SFIMEPNESIEEMFSRFQLLVAGIRPLNKSYTTKDHVIRVIRCLPESWMPLVTSIELTRD 188
           S  +    S+   F  F  L++ +          D +  ++  LP  +  ++T+IE T  
Sbjct: 105 SLKLSSEMSLLSHFHIFDELISELLAAGAKIEEMDKISHLLITLPSCYDGIITAIE-TLS 163

Query: 189 VENMSLEELISILKCHELK-RSEMQDLRKKSIALKSKSEKAKAEKSKALQAEEEESEEAS 247
            EN++L  + + L   E+K +++  D  KK +               A+      + +  
Sbjct: 164 EENLTLAFVKNRLLDQEIKIKNDHNDTSKKVM--------------NAIVHNNNNTYK-- 207

Query: 248 EDSDEDELTLISKRLNRIWKHRQSKYKGSGKAKGKSESSGQKKSSLKEVTCFECKESGHY 307
                          N ++K+R +K K   K   K            +V C  C   GH 
Sbjct: 208 ---------------NNLFKNRVTKPKKIFKGNSK-----------YKVKCHHCGREGHI 241

Query: 308 KSDCPKLK-----KDKKPKKHFKTKKSLMVTF 334
           K DC   K     K+K+ +K  +T  S  + F
Sbjct: 242 KKDCFHYKRILNNKNKENEKQVQTATSHGIAF 273


>YCH4_YEAST (P25600) Transposon Ty5-1 34.5 kDa hypothetical protein
          Length = 308

 Score =  159 bits (403), Expect = 4e-38
 Identities = 96/309 (31%), Positives = 166/309 (53%), Gaps = 17/309 (5%)

Query: 1223 MDVKSAFLNGYISEEVYVHQPLGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLE 1282
            MDV +AFLN  + E +YV QP GF +E+ PD+V++L   +YGLKQAP  W E +++ L +
Sbjct: 1    MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 1283 NEFVRGKVDTTLFCKTYKDDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGEL 1342
              F R + +  L+ ++  D  + + +YVDD++  + +  +     + +   + M  +G++
Sbjct: 61   IGFCRHEGEHGLYFRSTSDGPIYIGVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 1343 KYFLGIQVDQTPEG-------TYIHQSKYTKELLKKFNMLESTVAKT-PMHPTCILEKED 1394
              FLG+ + Q+  G        YI ++    E +  F + ++ +  + P+  T     +D
Sbjct: 121  DKFLGLNIHQSTNGDITLSLQDYIAKAASESE-INTFKLTQTPLCNSKPLFETTSPHLKD 179

Query: 1395 KSGKVCQKLYRGMIGSLLY-LTASRPDILFSVHLCARFQSDPRETHLTAIKRILRYLKGT 1453
             +       Y+ ++G LL+     RPDI + V L +RF  +PR  HL + +R+LRYL  T
Sbjct: 180  ITP------YQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTT 233

Query: 1454 TNLGLMYKKTSEYKLSGYCDAHYAGDRTERKSTSGNCQFLGSNLVSWASKR-QSTIALST 1512
             ++ L Y+  S+  L+ YCDA +        ST G    L    V+W+SK+ +  I + +
Sbjct: 234  RSMCLKYRSGSQVALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPS 293

Query: 1513 AEAEYISAA 1521
             EAEYI+A+
Sbjct: 294  TEAEYITAS 302


>M810_ARATH (P92519) Hypothetical mitochondrial protein AtMg00810
            (ORF240b)
          Length = 240

 Score =  150 bits (378), Expect = 3e-35
 Identities = 74/215 (34%), Positives = 125/215 (57%), Gaps = 1/215 (0%)

Query: 1308 IYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEGTYIHQSKYTKE 1367
            +YVDDI+   ++ +L       + + F M  +G + YFLGIQ+   P G ++ Q+KY ++
Sbjct: 5    LYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYAEQ 64

Query: 1368 LLKKFNMLESTVAKTPMHPTCILEKEDKSGKVCQKLYRGMIGSLLYLTASRPDILFSVHL 1427
            +L    ML+     TP+ P  +      +       +R ++G+L YLT +RPDI ++V++
Sbjct: 65   ILNNAGMLDCKPMSTPL-PLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAVNI 123

Query: 1428 CARFQSDPRETHLTAIKRILRYLKGTTNLGLMYKKTSEYKLSGYCDAHYAGDRTERKSTS 1487
              +   +P       +KR+LRY+KGT   GL   K S+  +  +CD+ +AG  + R+ST+
Sbjct: 124  VCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRSTT 183

Query: 1488 GNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAI 1522
            G C FLG N++SW++KRQ T++ S+ E EY + A+
Sbjct: 184  GFCTFLGCNIISWSAKRQPTVSRSSTETEYRALAL 218


>YCB9_YEAST (P25384) Transposon Ty2 protein B (Ty1-17 protein B)
          Length = 1770

 Score =  111 bits (278), Expect = 1e-23
 Identities = 112/388 (28%), Positives = 179/388 (45%), Gaps = 41/388 (10%)

Query: 670  LTHNLLSISQLADKGYDVIFNQKSCRAVSQIDGSVLFNSKRKNNIYKI--------RLSE 721
            + ++LLS+S+LA++     F + +   + + DG+VL    +  + Y +         +S+
Sbjct: 518  IAYDLLSLSELANQNITACFTRNT---LERSDGTVLAPIVKHGDFYWLSKKYLIPSHISK 574

Query: 722  LEAQNVKCLLSVNEEQW-VWHRRLGHASMRKISQLSKLNLVRGLPNLKF----ASDALCE 776
            L   NV    SVN+  + + HR LGHA+ R I +  K N V  L         AS   C 
Sbjct: 575  LTINNVNKSKSVNKYPYPLIHRMLGHANFRSIQKSLKKNAVTYLKESDIEWSNASTYQCP 634

Query: 777  ACQKGKFTK---VPFKAKNVVSTSRPLELLHIDLFGPVKTESIGGKRYGMVIVDDYSRWT 833
             C  GK TK   V         +  P + LH D+FGPV         Y +   D+ +R+ 
Sbjct: 635  DCLIGKSTKHRHVKGSRLKYQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEKTRFQ 694

Query: 834  WVKFL-TRKDESHV-VFSTFIAQVQNEKACRIVRVRSDHGGEFES-----LFDSYGIAHD 886
            WV  L  R++ES + VF++ +A ++N+   R++ ++ D G E+ +      F + GI   
Sbjct: 695  WVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRGITAC 754

Query: 887  FSCPRTPQQNGVVERKNRTLQEMARTMLQETGMAKHFLAEAVNTACYIQNRISVRPILNK 946
            ++     + +GV ER NRTL    RT+L  +G+  H    AV  +  I+N + V P  +K
Sbjct: 755  YTTTADSRAHGVAERLNRTLLNDCRTLLHCSGLPNHLWFSAVEFSTIIRNSL-VSPKNDK 813

Query: 947  TPYELWKNIKPNISYFHPFGCVCYVLNTKDRLHKFDAKSSKCLLLGY----SERSKGFRF 1002
            +  +       +I+   PFG    V N     H  D+K     + GY    S  S G+  
Sbjct: 814  SARQHAGLAGLDITTILPFGQPVIVNN-----HNPDSKIHPRGIPGYALHPSRNSYGYII 868

Query: 1003 YNTD-AKTIEESIHVRFDDKLDSDQSKL 1029
            Y     KT++ + +V   DK    QSKL
Sbjct: 869  YLPSLKKTVDTTNYVILQDK----QSKL 892



 Score = 99.8 bits (247), Expect = 5e-20
 Identities = 113/469 (24%), Positives = 222/469 (47%), Gaps = 62/469 (13%)

Query: 1089 EPVRTRSAFRPYEETLLSLKGLVSLIEPKSI---DEAL------QDKD-WILAMEEELNQ 1138
            EP R++         + ++KG+ S+   ++    DEA+      ++KD ++ A  +E++Q
Sbjct: 1220 EPPRSKKRIN----LIAAIKGVKSIKPVRTTLRYDEAITYNKDNKEKDRYVEAYHKEISQ 1275

Query: 1139 FSKNDVWSLVK-----KPESVHVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQEGIDY 1193
              K + W   K       +   VI + ++F    N+K D   +KAR VA+G  Q      
Sbjct: 1276 LLKMNTWDTNKYYDRNDIDPKKVINSMFIF----NKKRDGT-HKARFVARGDIQHPDTYD 1330

Query: 1194 TETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEVYVHQP--LGFEDEKK 1251
            ++  +      A+   +S +++++  + Q+D+ SA+L   I EE+Y+  P  LG  D+  
Sbjct: 1331 SDMQSNTVHHYALMTSLSIALDNDYYITQLDISSAYLYADIKEELYIRPPPHLGLNDK-- 1388

Query: 1252 PDHVFKLKKSLYGLKQAPRAWYERLSSFLL---ENEFVRGKVDTTLFCKTYKDDILIVQI 1308
               + +L+KSLYGLKQ+   WYE + S+L+   + + VRG      +   +K+  + + +
Sbjct: 1389 ---LLRLRKSLYGLKQSGANWYETIKSYLINCCDMQEVRG------WSCVFKNSQVTICL 1439

Query: 1309 YVDDIIFGSANQSLCKEFSEMMQAEFEMSMM------GELKY-FLGIQVD-QTPEGTYIH 1360
            +VDD+I  S + +  K+    ++ +++  ++       E++Y  LG+++  Q  +   + 
Sbjct: 1440 FVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDILGLEIKYQRSKYMKLG 1499

Query: 1361 QSKYTKELLKKFNMLESTVAK---TPMHPTCILEKED---KSGKVCQKLY--RGMIGSLL 1412
              K   E L K N+  +   K    P  P   +++++      +  +K++  + +IG   
Sbjct: 1500 MEKSLTEKLPKLNVPLNPKGKKLRAPGQPGHYIDQDELEIDEDEYKEKVHEMQKLIGLAS 1559

Query: 1413 YLTAS-RPDILFSVHLCARFQSDPRETHLTAIKRILRYLKGTTNLGLMYKKTS----EYK 1467
            Y+    R D+L+ ++  A+    P    L     +++++  T +  L++ K      + K
Sbjct: 1560 YVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRDKQLIWHKNKPTKPDNK 1619

Query: 1468 LSGYCDAHYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAE 1516
            L    DA Y G++   KS  GN   L   ++   S + S    ST EAE
Sbjct: 1620 LVAISDASY-GNQPYYKSQIGNIFLLNGKVIGGKSTKASLTCTSTTEAE 1667


>YJZ9_YEAST (P47100) Transposon Ty1 protein B
          Length = 1755

 Score =  105 bits (261), Expect = 1e-21
 Identities = 109/425 (25%), Positives = 196/425 (45%), Gaps = 50/425 (11%)

Query: 1124 QDKDWILAMEEELNQFSKNDVWSLV-----KKPESVHVIGTKWVFRNKLNEKGDVVRNKA 1178
            + + +I A  +E+NQ  K   W        K+ +   VI + ++F    N+K D   +KA
Sbjct: 1246 EKEKYIEAYHKEVNQLLKMKTWDTDEYYDRKEIDPKRVINSMFIF----NKKRDGT-HKA 1300

Query: 1179 RLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEV 1238
            R VA+G  Q      +   +      A+   +S ++++N  + Q+D+ SA+L   I EE+
Sbjct: 1301 RFVARGDIQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEEL 1360

Query: 1239 YVHQP--LGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLEN---EFVRGKVDTT 1293
            Y+  P  LG  D+     + +LKKSLYGLKQ+   WYE + S+L++    E VRG     
Sbjct: 1361 YIRPPPHLGMNDK-----LIRLKKSLYGLKQSGANWYETIKSYLIQQCGMEEVRG----- 1410

Query: 1294 LFCKTYKDDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMG------ELKY-FL 1346
             +   +K+  + + ++VDD++  S N +  K   E ++ +++  ++       E++Y  L
Sbjct: 1411 -WSCVFKNSQVTICLFVDDMVLFSKNLNSNKRIIEKLKMQYDTKIINLGESDEEIQYDIL 1469

Query: 1347 GIQVDQTPEGTY--IHQSKYTKELLKKFNM---LESTVAKTPMHPTCI-----LEKEDKS 1396
            G+++ +   G Y  +       E + K N+    +      P  P        LE E+  
Sbjct: 1470 GLEI-KYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQQELELEEDD 1528

Query: 1397 GKVCQKLYRGMIGSLLYLTAS-RPDILFSVHLCARFQSDPRETHLTAIKRILRYLKGTTN 1455
             K+     + +IG   Y+    R D+L+ ++  A+    P +  L     +++++  T +
Sbjct: 1529 YKMKVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSKQVLDMTYELIQFIWNTRD 1588

Query: 1456 LGLMYKKTSEY----KLSGYCDAHYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALS 1511
              L++ K+       KL    DA Y G++   KS  GN   L   ++   S + S    S
Sbjct: 1589 KQLIWHKSKPVKPTNKLVVISDASY-GNQPYYKSQIGNIYLLNGKVIGGKSTKASLTCTS 1647

Query: 1512 TAEAE 1516
            T EAE
Sbjct: 1648 TTEAE 1652



 Score = 90.1 bits (222), Expect = 4e-17
 Identities = 121/538 (22%), Positives = 208/538 (38%), Gaps = 75/538 (13%)

Query: 611  LDSGCSRHMTGESRMFQELKLKPGGEVGFGGNEKGKIVGTGTICV---DSSPCIDNVLLV 667
            LDSG SR +   +         P   V         I   G +     D++     VL  
Sbjct: 460  LDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIKVLHT 519

Query: 668  DGLTHNLLSISQLADKGYDVIFNQKSCRAVSQIDGSVLFNSKRKNNIYKI--------RL 719
              + ++LLS+++LA       F +     + + DG+VL    +  + Y +         +
Sbjct: 520  PNIAYDLLSLNELAAVDITACFTKN---VLERSDGTVLAPIVKYGDFYWVSKKYLLPSNI 576

Query: 720  SELEAQNVKCLLSVNEEQWVW-HRRLGHASMRKISQLSKLNLVRGLP----NLKFASDAL 774
            S     NV    S  +  + + HR L HA+ + I    K N +        +   A D  
Sbjct: 577  SVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDYQ 636

Query: 775  CEACQKGKFTK---VPFKAKNVVSTSRPLELLHIDLFGPVKTESIGGKRYGMVIVDDYSR 831
            C  C  GK TK   +        ++  P + LH D+FGPV         Y +   D+ ++
Sbjct: 637  CPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETTK 696

Query: 832  WTWVKFLTRKDESHV--VFSTFIAQVQNEKACRIVRVRSDHGGEFES-----LFDSYGIA 884
            + WV  L  + E  +  VF+T +A ++N+    ++ ++ D G E+ +       +  GI 
Sbjct: 697  FRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGIT 756

Query: 885  HDFSCPRTPQQNGVVERKNRTLQEMARTMLQETGMAKHFLAEAVNTACYIQNRIS----- 939
              ++     + +GV ER NRTL +  RT LQ +G+  H    A+  +  ++N ++     
Sbjct: 757  PCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKSK 816

Query: 940  -------------VRPILNKTPYELWKNIKPNISYFHPFGCVCYVLNTK----------D 976
                         +  +L      +  +  PN S  HP G   Y L+             
Sbjct: 817  KSARQHAGLAGLDISTLLPFGQPVIVNDHNPN-SKIHPRGIPGYALHPSRNSYGYIIYLP 875

Query: 977  RLHKFDAKSSKCLLLGYSERSKGFRFYNTDAKTIEESIHVRFDDKLDSDQSKLVEKFADL 1036
             L K    ++  +L G   R   F   N DA T +E ++           S  +++  DL
Sbjct: 876  SLKKTVDTTNYVILQGKESRLDQF---NYDALTFDEDLNRLTASYQSFIASNEIQQSDDL 932

Query: 1037 SINVSDKGKAPEEAEPEED--------EPEEEAGPS----DSQTLKKSRITAAHPKEL 1082
            +I      ++  E  PE+          P +   PS    DS+ + K+ I A  P+E+
Sbjct: 933  NIESDHDFQSDIELHPEQPRNVLSKAVSPTDSTPPSTHTEDSKRVSKTNIRA--PREV 988


>YMU0_YEAST (Q04670) Transposon Ty1 protein B
          Length = 1328

 Score =  102 bits (254), Expect = 8e-21
 Identities = 108/430 (25%), Positives = 198/430 (45%), Gaps = 60/430 (13%)

Query: 1124 QDKDWILAMEEELNQFSKNDVWSLV-----KKPESVHVIGTKWVFRNKLNEKGDVVRNKA 1178
            + + +I A  +E+NQ  K   W        K+ +   VI + ++F  K +       +KA
Sbjct: 819  EKEKYIQAYHKEVNQLLKMKTWDTDRYYDRKEIDPKRVINSMFIFNRKRDGT-----HKA 873

Query: 1179 RLVAQGYSQQEGIDYTETFAPVARLE-----AIRLLISFSVNHNIVLHQMDVKSAFLNGY 1233
            R VA+G      I + +T+ P  +       A+   +S ++++N  + Q+D+ SA+L   
Sbjct: 874  RFVARG-----DIQHPDTYDPGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYAD 928

Query: 1234 ISEEVYVHQP--LGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLEN---EFVRG 1288
            I EE+Y+  P  LG  D+     + +LKKSLYGLKQ+   WYE + S+L++    E VRG
Sbjct: 929  IKEELYIRPPPHLGMNDK-----LIRLKKSLYGLKQSGANWYETIKSYLIKQCGMEEVRG 983

Query: 1289 KVDTTLFCKTYKDDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMM------GEL 1342
                  +   +K+  + + ++VDD+I  S + +  K+    ++ +++  ++       E+
Sbjct: 984  ------WSCVFKNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEI 1037

Query: 1343 KY-FLGIQVDQTPEGTY--IHQSKYTKELLKKFNM---LESTVAKTPMHPTCI-----LE 1391
            +Y  LG+++ +   G Y  +       E + K N+    +      P  P        LE
Sbjct: 1038 QYDILGLEI-KYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQQELE 1096

Query: 1392 KEDKSGKVCQKLYRGMIGSLLYLTAS-RPDILFSVHLCARFQSDPRETHLTAIKRILRYL 1450
             E+   K+     + +IG   Y+    R D+L+ ++  A+    P +  L     +++++
Sbjct: 1097 LEEDDYKMKVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSKQVLDMTYELIQFI 1156

Query: 1451 KGTTNLGLMYKKTSEY----KLSGYCDAHYAGDRTERKSTSGNCQFLGSNLVSWASKRQS 1506
              T +  L++ K+       KL    DA Y G++   KS  GN   L   ++   S + S
Sbjct: 1157 WNTRDKQLIWHKSKPVKPTNKLVVISDASY-GNQPYYKSQIGNIYLLNGKVIGGKSTKAS 1215

Query: 1507 TIALSTAEAE 1516
                ST EAE
Sbjct: 1216 LTCTSTTEAE 1225



 Score = 90.1 bits (222), Expect = 4e-17
 Identities = 121/538 (22%), Positives = 208/538 (38%), Gaps = 75/538 (13%)

Query: 611  LDSGCSRHMTGESRMFQELKLKPGGEVGFGGNEKGKIVGTGTICV---DSSPCIDNVLLV 667
            LDSG SR +   +         P   V         I   G +     D++     VL  
Sbjct: 33   LDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIKVLHT 92

Query: 668  DGLTHNLLSISQLADKGYDVIFNQKSCRAVSQIDGSVLFNSKRKNNIYKI--------RL 719
              + ++LLS+++LA       F +     + + DG+VL    +  + Y +         +
Sbjct: 93   PNIAYDLLSLNELAAVDITACFTKN---VLERSDGTVLAPIVKYGDFYWVSKKYLLPSNI 149

Query: 720  SELEAQNVKCLLSVNEEQWVW-HRRLGHASMRKISQLSKLNLVRGLP----NLKFASDAL 774
            S     NV    S  +  + + HR L HA+ + I    K N +        +   A D  
Sbjct: 150  SVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDYQ 209

Query: 775  CEACQKGKFTK---VPFKAKNVVSTSRPLELLHIDLFGPVKTESIGGKRYGMVIVDDYSR 831
            C  C  GK TK   +        ++  P + LH D+FGPV         Y +   D+ ++
Sbjct: 210  CPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETTK 269

Query: 832  WTWVKFLTRKDESHV--VFSTFIAQVQNEKACRIVRVRSDHGGEFES-----LFDSYGIA 884
            + WV  L  + E  +  VF+T +A ++N+    ++ ++ D G E+ +       +  GI 
Sbjct: 270  FRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGIT 329

Query: 885  HDFSCPRTPQQNGVVERKNRTLQEMARTMLQETGMAKHFLAEAVNTACYIQNRIS----- 939
              ++     + +GV ER NRTL +  RT LQ +G+  H    A+  +  ++N ++     
Sbjct: 330  PCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKSK 389

Query: 940  -------------VRPILNKTPYELWKNIKPNISYFHPFGCVCYVLNTK----------D 976
                         +  +L      +  +  PN S  HP G   Y L+             
Sbjct: 390  KSARQHAGLAGLDISTLLPFGQPVIVNDHNPN-SKIHPRGIPGYALHPSRNSYGYIIYLP 448

Query: 977  RLHKFDAKSSKCLLLGYSERSKGFRFYNTDAKTIEESIHVRFDDKLDSDQSKLVEKFADL 1036
             L K    ++  +L G   R   F   N DA T +E ++           S  +++  DL
Sbjct: 449  SLKKTVDTTNYVILQGKESRLDQF---NYDALTFDEDLNRLTASYQSFIASNEIQQSDDL 505

Query: 1037 SINVSDKGKAPEEAEPEED--------EPEEEAGPS----DSQTLKKSRITAAHPKEL 1082
            +I      ++  E  PE+          P +   PS    DS+ + K+ I A  P+E+
Sbjct: 506  NIESDHDFQSDIELHPEQPRNVLSKAVSPTDSTPPSTHTEDSKRVSKTNIRA--PREV 561


>YJL3_YEAST (P47024) Transposon Ty4 207.7 kDa hypothetical protein
          Length = 1803

 Score =  102 bits (254), Expect = 8e-21
 Identities = 103/432 (23%), Positives = 200/432 (45%), Gaps = 55/432 (12%)

Query: 1134 EELNQFSKNDVWSLVKKPESVHVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQEGIDY 1193
            +++  F  +  +S  + P+++ ++ T  +F  K N        KAR+V +G +Q     Y
Sbjct: 1301 KDMKVFDVDVKYSRSEIPDNL-IVPTNTIFTKKRNGI-----YKARIVCRGDTQSPDT-Y 1353

Query: 1194 TETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEVYVHQPLGFEDEKKPD 1253
            +           I++ +  + N N+ +  +D+  AFL   + EE+Y+  P    D +   
Sbjct: 1354 SVITTESLNHNHIKIFLMIANNRNMFMKTLDINHAFLYAKLEEEIYIPHP---HDRRC-- 1408

Query: 1254 HVFKLKKSLYGLKQAPRAWYERLSSFL-----LENEFVRGKVDTTLFCKTYKDDILIVQI 1308
             V KL K+LYGLKQ+P+ W + L  +L      +N +  G   T       +D  L++ +
Sbjct: 1409 -VVKLNKALYGLKQSPKEWNDHLRQYLNGIGLKDNSYTPGLYQT-------EDKNLMIAV 1460

Query: 1309 YVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGEL------KYFLGIQ---------VDQT 1353
            YVDD +  ++N+    EF   +++ FE+ + G L         LG+          +D T
Sbjct: 1461 YVDDCVIAASNEQRLDEFINKLKSNFELKITGTLIDDVLDTDILGMDLVYNKRLGTIDLT 1520

Query: 1354 PEGTYIHQ--SKYTKEL--LKKFNMLESTVAKTPMHPTCILEKEDKSGKVCQKLYRGMIG 1409
             + ++I++   KY +EL  ++K ++   +  K       +   E++  +   KL + ++G
Sbjct: 1521 LK-SFINRMDKKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMSEEEFRQGVLKLQQ-LLG 1578

Query: 1410 SLLYLT-ASRPDILFSVHLCARFQSDPRETHLTAIKRILRYLKGTTNLGLMYKK--TSEY 1466
             L Y+    R DI F+V   AR  + P E     I +I++YL    ++G+ Y +    + 
Sbjct: 1579 ELNYVRHKCRYDIEFAVKKVARLVNYPHERVFYMIYKIIQYLVRYKDIGIHYDRDCNKDK 1638

Query: 1467 KLSGYCDAHYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSVR 1526
            K+    DA   G   + +S  G   + G N+ +  S + +   +S+ EAE     + ++ 
Sbjct: 1639 KVIAITDAS-VGSEYDAQSRIGVILWYGMNIFNVYSNKSTNRCVSSTEAE-----LHAIY 1692

Query: 1527 PGFSEQVNFRLT 1538
             G+++    ++T
Sbjct: 1693 EGYADSETLKVT 1704



 Score = 73.6 bits (179), Expect = 4e-12
 Identities = 68/276 (24%), Positives = 119/276 (42%), Gaps = 16/276 (5%)

Query: 741  HRRLGHASMRKISQLSKLN-LVRGLPNLKFASDALCEACQKGKFTK---VPFKAKNVVST 796
            H+R+GH  +++I    K N     L  +K  ++  C+ C+  K TK         N  + 
Sbjct: 562  HKRMGHTGIQQIENSIKHNHYEESLDLIKEPNEFWCQTCKISKATKRNHYTGSMNNHSTD 621

Query: 797  SRPLELLHIDLFGPVKTESIGGKRYGMVIVDDYSRWTWVKFLTRKDESHVVFSTF--IAQ 854
              P     +D+FGPV + +   KRY +++VD+ +R+        K+   ++      I  
Sbjct: 622  HEPGSSWCMDIFGPVSSSNADTKRYMLIMVDNNTRYCMTSTHFNKNAETILAQVRKNIQY 681

Query: 855  VQNEKACRIVRVRSDHGGEF-----ESLFDSYGIAHDFSCPRTPQQNGVVERKNRTLQEM 909
            V+ +   ++  + SD G EF     E  F S GI H  +  +    NG  ER  RT+   
Sbjct: 682  VETQFDRKVREINSDRGTEFTNDQIEEYFISKGIHHILTSTQDHAANGRAERYIRTIITD 741

Query: 910  ARTMLQETGMAKHFLAEAVNTACYIQNRISVRPILNKTPYELWKN--IKPNISYFHPFGC 967
            A T+L+++ +   F   AV +A  I+N +  +    K P +      +   +  F PFG 
Sbjct: 742  ATTLLRQSNLRVKFWEYAVTSATNIRNYLEHKS-TGKLPLKAISRQPVTVRLMSFLPFGE 800

Query: 968  VCYVLNTKDRLHKFDAKSSKCLLLGYSERSKGFRFY 1003
               + N   +  K        ++L     S G++F+
Sbjct: 801  KGIIWNHNHK--KLKPSGLPSIILCKDPNSYGYKFF 834


>YMT5_YEAST (Q04214) Transposon Ty1 protein B
          Length = 1328

 Score =  101 bits (252), Expect = 1e-20
 Identities = 105/425 (24%), Positives = 194/425 (44%), Gaps = 50/425 (11%)

Query: 1124 QDKDWILAMEEELNQFSKNDVWSLVK-----KPESVHVIGTKWVFRNKLNEKGDVVRNKA 1178
            + + +I A  +E+NQ  K   W   K     + +   VI + ++F  K +       +KA
Sbjct: 819  EKEKYIEAYHKEVNQLLKMKTWDTDKYYDRKEIDPKRVINSMFIFNRKRDGT-----HKA 873

Query: 1179 RLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEV 1238
            R VA+G  Q      +   +      A+   +S ++++N  + Q+D+ SA+L   I EE+
Sbjct: 874  RFVARGDIQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEEL 933

Query: 1239 YVHQP--LGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLEN---EFVRGKVDTT 1293
            Y+  P  LG  D+     + +LKKSLYGLKQ+   WYE + S+L++    E VRG     
Sbjct: 934  YIRPPPHLGMNDK-----LIRLKKSLYGLKQSGANWYETIKSYLIKQCGMEEVRG----- 983

Query: 1294 LFCKTYKDDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMG------ELKY-FL 1346
             +   +++  + + ++VDD++  S N +  K   + ++ +++  ++       E++Y  L
Sbjct: 984  -WSCVFENSQVTICLFVDDMVLFSKNLNSNKRIIDKLKMQYDTKIINLGESDEEIQYDIL 1042

Query: 1347 GIQVDQTPEGTY--IHQSKYTKELLKKFNM---LESTVAKTPMHPTCI-----LEKEDKS 1396
            G+++ +   G Y  +       E + K N+    +      P  P        LE E+  
Sbjct: 1043 GLEI-KYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQQELELEEDD 1101

Query: 1397 GKVCQKLYRGMIGSLLYLTAS-RPDILFSVHLCARFQSDPRETHLTAIKRILRYLKGTTN 1455
             K+     + +IG   Y+    R D+L+ ++  A+    P +  L     +++++  T +
Sbjct: 1102 YKMKVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSKQVLDMTYELIQFIWNTRD 1161

Query: 1456 LGLMYKKTSEY----KLSGYCDAHYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALS 1511
              L++ K+       KL    DA Y G++   KS  GN   L   ++   S + S    S
Sbjct: 1162 KQLIWHKSKPVKPTNKLVVISDASY-GNQPYYKSQIGNIYLLNGKVIGGKSTKASLTCTS 1220

Query: 1512 TAEAE 1516
            T EAE
Sbjct: 1221 TTEAE 1225



 Score = 89.0 bits (219), Expect = 9e-17
 Identities = 121/538 (22%), Positives = 208/538 (38%), Gaps = 75/538 (13%)

Query: 611  LDSGCSRHMTGESRMFQELKLKPGGEVGFGGNEKGKIVGTGTICV---DSSPCIDNVLLV 667
            LDSG SR +   +         P   V         I   G +     D++     VL  
Sbjct: 33   LDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIKVLHT 92

Query: 668  DGLTHNLLSISQLADKGYDVIFNQKSCRAVSQIDGSVLFNSKRKNNIYKI--------RL 719
              + ++LLS+++LA       F +     + + DG+VL    +  + Y +         +
Sbjct: 93   PNIAYDLLSLNELAAVDITACFTKN---VLERSDGTVLAPIVKYGDFYWVSKKYLLPSNI 149

Query: 720  SELEAQNVKCLLSVNEEQWVW-HRRLGHASMRKISQLSKLNLVRGLP----NLKFASDAL 774
            S     NV    S  +  + + HR L HA+ + I    K N +        +   A D  
Sbjct: 150  SVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDYQ 209

Query: 775  CEACQKGKFTK---VPFKAKNVVSTSRPLELLHIDLFGPVKTESIGGKRYGMVIVDDYSR 831
            C  C  GK TK   +        ++  P + LH D+FGPV         Y +   D+ ++
Sbjct: 210  CPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETTK 269

Query: 832  WTWVKFLTRKDESHV--VFSTFIAQVQNEKACRIVRVRSDHGGEFES-----LFDSYGIA 884
            + WV  L  + E  +  VF+T +A ++N+    ++ ++ D G E+ +       +  GI 
Sbjct: 270  FRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGIT 329

Query: 885  HDFSCPRTPQQNGVVERKNRTLQEMARTMLQETGMAKHFLAEAVNTACYIQNRIS----- 939
              ++     + +GV ER NRTL +  RT LQ +G+  H    A+  +  ++N ++     
Sbjct: 330  PCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKSK 389

Query: 940  -------------VRPILNKTPYELWKNIKPNISYFHPFGCVCYVLNTK----------D 976
                         +  +L      +  +  PN S  HP G   Y L+             
Sbjct: 390  KSARQHAGLAGLDISTLLPFGQPVIVNDHNPN-SKIHPRGIPGYALHPSRNSYGYIIYLP 448

Query: 977  RLHKFDAKSSKCLLLGYSERSKGFRFYNTDAKTIEESIHVRFDDKLDSDQSKLVEKFADL 1036
             L K    ++  +L G   R   F   N DA T +E ++           S  +++  DL
Sbjct: 449  SLKKTVDTTNYVILQGKESRLDQF---NYDALTFDEDLNRLTASYHSFIASNEIQQSNDL 505

Query: 1037 SINVSDKGKAPEEAEPEE--------DEPEEEAGPS----DSQTLKKSRITAAHPKEL 1082
            +I      ++  E  PE+          P +   PS    DS+ + K+ I A  P+E+
Sbjct: 506  NIESDHDFQSDIELHPEQLRNVLSKAVSPTDSTPPSTHTEDSKRVSKTNIRA--PREV 561


>YME4_YEAST (Q04711) Transposon Ty1 protein B
          Length = 1328

 Score =  101 bits (252), Expect = 1e-20
 Identities = 103/425 (24%), Positives = 198/425 (46%), Gaps = 50/425 (11%)

Query: 1124 QDKDWILAMEEELNQFSKNDVWSLVK-----KPESVHVIGTKWVFRNKLNEKGDVVRNKA 1178
            + + +I A  +E+NQ  K + W   K     + +   VI + ++F  K +       +KA
Sbjct: 819  EKEKYIEAYHKEVNQLLKMNTWDTDKYYDRKEIDPKRVINSMFIFNRKRDGT-----HKA 873

Query: 1179 RLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEV 1238
            R VA+G  Q      +   +      A+   +S ++++N  + Q+D+ SA+L   I EE+
Sbjct: 874  RFVARGDIQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEEL 933

Query: 1239 YVHQP--LGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLEN---EFVRGKVDTT 1293
            Y+  P  LG  D+     + +LKKSLYGLKQ+   WYE + S+L++    E VRG     
Sbjct: 934  YIRPPPHLGMNDK-----LIRLKKSLYGLKQSGANWYETIKSYLIKQCGMEEVRG----- 983

Query: 1294 LFCKTYKDDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMM------GELKY-FL 1346
             +   +K+  + + ++VDD+I  S + +  K+    ++ +++  ++       E++Y  L
Sbjct: 984  -WSCVFKNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDIL 1042

Query: 1347 GIQVDQTPEGTY--IHQSKYTKELLKKFNM---LESTVAKTPMHPTCILEKED---KSGK 1398
            G+++ +   G Y  +       E + K N+    +      P  P   +++++      +
Sbjct: 1043 GLEI-KYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQDELEIDEDE 1101

Query: 1399 VCQKLY--RGMIGSLLYLTAS-RPDILFSVHLCARFQSDPRETHLTAIKRILRYLKGTTN 1455
              +K++  + +IG   Y+    R D+L+ ++  A+    P    L     +++++  T +
Sbjct: 1102 YKEKVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRD 1161

Query: 1456 LGLMYKKTS----EYKLSGYCDAHYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALS 1511
              L++ K      + KL    DA Y G++   KS  GN   L   ++   S + S    S
Sbjct: 1162 KQLIWHKNKPTEPDNKLVAISDASY-GNQPYYKSQIGNIYLLNGKVIGGKSTKASLTCTS 1220

Query: 1512 TAEAE 1516
            T EAE
Sbjct: 1221 TTEAE 1225



 Score = 90.1 bits (222), Expect = 4e-17
 Identities = 121/538 (22%), Positives = 208/538 (38%), Gaps = 75/538 (13%)

Query: 611  LDSGCSRHMTGESRMFQELKLKPGGEVGFGGNEKGKIVGTGTICV---DSSPCIDNVLLV 667
            LDSG SR +   +         P   V         I   G +     D++     VL  
Sbjct: 33   LDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIKVLHT 92

Query: 668  DGLTHNLLSISQLADKGYDVIFNQKSCRAVSQIDGSVLFNSKRKNNIYKI--------RL 719
              + ++LLS+++LA       F +     + + DG+VL    +  + Y +         +
Sbjct: 93   PNIAYDLLSLNELAAVDITACFTKN---VLERSDGTVLAPIVKYGDFYWVSKKYLLPSNI 149

Query: 720  SELEAQNVKCLLSVNEEQWVW-HRRLGHASMRKISQLSKLNLVRGLP----NLKFASDAL 774
            S     NV    S  +  + + HR L HA+ + I    K N +        +   A D  
Sbjct: 150  SVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDYQ 209

Query: 775  CEACQKGKFTK---VPFKAKNVVSTSRPLELLHIDLFGPVKTESIGGKRYGMVIVDDYSR 831
            C  C  GK TK   +        ++  P + LH D+FGPV         Y +   D+ ++
Sbjct: 210  CPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETTK 269

Query: 832  WTWVKFLTRKDESHV--VFSTFIAQVQNEKACRIVRVRSDHGGEFES-----LFDSYGIA 884
            + WV  L  + E  +  VF+T +A ++N+    ++ ++ D G E+ +       +  GI 
Sbjct: 270  FRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGIT 329

Query: 885  HDFSCPRTPQQNGVVERKNRTLQEMARTMLQETGMAKHFLAEAVNTACYIQNRIS----- 939
              ++     + +GV ER NRTL +  RT LQ +G+  H    A+  +  ++N ++     
Sbjct: 330  PCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKSK 389

Query: 940  -------------VRPILNKTPYELWKNIKPNISYFHPFGCVCYVLNTK----------D 976
                         +  +L      +  +  PN S  HP G   Y L+             
Sbjct: 390  KSARQHAGLAGLDISTLLPFGQPVIVNDHNPN-SKIHPRGIPGYALHPSRNSYGYIIYLP 448

Query: 977  RLHKFDAKSSKCLLLGYSERSKGFRFYNTDAKTIEESIHVRFDDKLDSDQSKLVEKFADL 1036
             L K    ++  +L G   R   F   N DA T +E ++           S  +++  DL
Sbjct: 449  SLKKTVDTTNYVILQGKESRLDQF---NYDALTFDEDLNRLTASYQSFIASNEIQQSDDL 505

Query: 1037 SINVSDKGKAPEEAEPEED--------EPEEEAGPS----DSQTLKKSRITAAHPKEL 1082
            +I      ++  E  PE+          P +   PS    DS+ + K+ I A  P+E+
Sbjct: 506  NIESDHDFQSDIELHPEQPRNVLSKAVSPTDSTPPSTHTEDSKRVSKTNIRA--PREV 561


>YMD9_YEAST (Q03434) Transposon Ty1 protein B
          Length = 1328

 Score =  101 bits (251), Expect = 2e-20
 Identities = 105/425 (24%), Positives = 199/425 (46%), Gaps = 50/425 (11%)

Query: 1124 QDKDWILAMEEELNQFSKNDVWSLV-----KKPESVHVIGTKWVFRNKLNEKGDVVRNKA 1178
            + + +I A  +E+NQ  K   W        K+ +   VI + ++F    N+K D   +KA
Sbjct: 819  EKEKYIEAYHKEVNQLLKMKTWDTDEYYDRKEIDPKRVINSMFIF----NKKRDGT-HKA 873

Query: 1179 RLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEV 1238
            R VA+G  Q      +   +      A+   +S ++++N  + Q+D+ SA+L   I EE+
Sbjct: 874  RFVARGDIQHPDTYDSGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEEL 933

Query: 1239 YVHQP--LGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLEN---EFVRGKVDTT 1293
            Y+  P  LG  D+     + +LKKSLYGLKQ+   WYE + S+L++    E VRG     
Sbjct: 934  YIRPPPHLGMNDK-----LIRLKKSLYGLKQSGANWYETIKSYLIKQCGMEEVRG----- 983

Query: 1294 LFCKTYKDDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMM------GELKY-FL 1346
             +   +K+  + + ++VDD+I  S + +  K+    ++ +++  ++       E++Y  L
Sbjct: 984  -WSCVFKNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDIL 1042

Query: 1347 GIQVDQTPEGTY--IHQSKYTKELLKKFNM---LESTVAKTPMHPTCILEKED---KSGK 1398
            G+++ +   G Y  +       E + K N+    +      P  P   +++++      +
Sbjct: 1043 GLEI-KYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQDELEIDEDE 1101

Query: 1399 VCQKLY--RGMIGSLLYLTAS-RPDILFSVHLCARFQSDPRETHLTAIKRILRYLKGTTN 1455
              +K++  + +IG   Y+    R D+L+ ++  A+    P    L     +++++  T +
Sbjct: 1102 YKEKVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRD 1161

Query: 1456 LGLMYKKTS----EYKLSGYCDAHYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALS 1511
              L++ K      + KL    DA Y G++   KS  GN   L   ++   S + S    S
Sbjct: 1162 KQLIWHKNKPTEPDNKLVAISDASY-GNQPYYKSQIGNIYLLNGKVIGGKSTKASLTCTS 1220

Query: 1512 TAEAE 1516
            T EAE
Sbjct: 1221 TTEAE 1225



 Score = 90.1 bits (222), Expect = 4e-17
 Identities = 121/538 (22%), Positives = 208/538 (38%), Gaps = 75/538 (13%)

Query: 611  LDSGCSRHMTGESRMFQELKLKPGGEVGFGGNEKGKIVGTGTICV---DSSPCIDNVLLV 667
            LDSG SR +   +         P   V         I   G +     D++     VL  
Sbjct: 33   LDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIKVLHT 92

Query: 668  DGLTHNLLSISQLADKGYDVIFNQKSCRAVSQIDGSVLFNSKRKNNIYKI--------RL 719
              + ++LLS+++LA       F +     + + DG+VL    +  + Y +         +
Sbjct: 93   PNIAYDLLSLNELAAVDITACFTKN---VLERSDGTVLAPIVKYGDFYWVSKKYLLPSNI 149

Query: 720  SELEAQNVKCLLSVNEEQWVW-HRRLGHASMRKISQLSKLNLVRGLP----NLKFASDAL 774
            S     NV    S  +  + + HR L HA+ + I    K N +        +   A D  
Sbjct: 150  SVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDRSSAIDYQ 209

Query: 775  CEACQKGKFTK---VPFKAKNVVSTSRPLELLHIDLFGPVKTESIGGKRYGMVIVDDYSR 831
            C  C  GK TK   +        ++  P + LH D+FGPV         Y +   D+ ++
Sbjct: 210  CPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETTK 269

Query: 832  WTWVKFLTRKDESHV--VFSTFIAQVQNEKACRIVRVRSDHGGEFES-----LFDSYGIA 884
            + WV  L  + E  +  VF+T +A ++N+    ++ ++ D G E+ +       +  GI 
Sbjct: 270  FRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGIT 329

Query: 885  HDFSCPRTPQQNGVVERKNRTLQEMARTMLQETGMAKHFLAEAVNTACYIQNRIS----- 939
              ++     + +GV ER NRTL +  RT LQ +G+  H    A+  +  ++N ++     
Sbjct: 330  PCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKSK 389

Query: 940  -------------VRPILNKTPYELWKNIKPNISYFHPFGCVCYVLNTK----------D 976
                         +  +L      +  +  PN S  HP G   Y L+             
Sbjct: 390  KSARQHAGLAGLDISTLLPFGQPVIVNDHNPN-SKIHPRGIPGYALHPSRNSYGYIIYLP 448

Query: 977  RLHKFDAKSSKCLLLGYSERSKGFRFYNTDAKTIEESIHVRFDDKLDSDQSKLVEKFADL 1036
             L K    ++  +L G   R   F   N DA T +E ++           S  +++  DL
Sbjct: 449  SLKKTVDTTNYVILQGKESRLDQF---NYDALTFDEDLNRLTASYQSFIASNEIQQSDDL 505

Query: 1037 SINVSDKGKAPEEAEPEED--------EPEEEAGPS----DSQTLKKSRITAAHPKEL 1082
            +I      ++  E  PE+          P +   PS    DS+ + K+ I A  P+E+
Sbjct: 506  NIESDHDFQSDIELHPEQPRNVLSKAVSPTDSTPPSTHTEDSKRVSKTNIRA--PREV 561


>YJZ7_YEAST (P47098) Transposon Ty1 protein B
          Length = 1755

 Score = 99.4 bits (246), Expect = 7e-20
 Identities = 105/425 (24%), Positives = 198/425 (45%), Gaps = 50/425 (11%)

Query: 1124 QDKDWILAMEEELNQFSKNDVWSLV-----KKPESVHVIGTKWVFRNKLNEKGDVVRNKA 1178
            + + +I A  +E+NQ  K   W        K+ +   VI + ++F    N+K D   +KA
Sbjct: 1246 EKEKYIEAYHKEVNQLLKMKTWDTDEYYDRKEIDPKRVINSMFIF----NKKRDGT-HKA 1300

Query: 1179 RLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEV 1238
            R VA+G  Q      T   +      A+   +S ++++N  + Q+D+ SA+L   I EE+
Sbjct: 1301 RFVARGDIQHPDTYDTGMQSNTVHHYALMTSLSLALDNNYYITQLDISSAYLYADIKEEL 1360

Query: 1239 YVHQP--LGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLEN---EFVRGKVDTT 1293
            Y+  P  LG  D+     + +LKKS YGLKQ+   WYE + S+L++    E VRG     
Sbjct: 1361 YIRPPPHLGMNDK-----LIRLKKSHYGLKQSGANWYETIKSYLIKQCGMEEVRG----- 1410

Query: 1294 LFCKTYKDDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMM------GELKY-FL 1346
             +   +K+  + + ++VDD+I  S + +  K+    ++ +++  ++       E++Y  L
Sbjct: 1411 -WSCVFKNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGESDNEIQYDIL 1469

Query: 1347 GIQVDQTPEGTY--IHQSKYTKELLKKFNM---LESTVAKTPMHPTCILEKED---KSGK 1398
            G+++ +   G Y  +       E + K N+    +      P  P   +++++      +
Sbjct: 1470 GLEI-KYQRGKYMKLGMENSLTEKIPKLNVPLNPKGRKLSAPGQPGLYIDQDELEIDEDE 1528

Query: 1399 VCQKLY--RGMIGSLLYLTAS-RPDILFSVHLCARFQSDPRETHLTAIKRILRYLKGTTN 1455
              +K++  + +IG   Y+    R D+L+ ++  A+    P    L     +++++  T +
Sbjct: 1529 YKEKVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRD 1588

Query: 1456 LGLMYKKTS----EYKLSGYCDAHYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALS 1511
              L++ K      + KL    DA Y G++   KS  GN   L   ++   S + S    S
Sbjct: 1589 KQLIWHKNKPTEPDNKLVAISDASY-GNQPYYKSQIGNIFLLNGKVIGGKSTKASLTCTS 1647

Query: 1512 TAEAE 1516
            T EAE
Sbjct: 1648 TTEAE 1652



 Score = 89.7 bits (221), Expect = 6e-17
 Identities = 121/538 (22%), Positives = 208/538 (38%), Gaps = 75/538 (13%)

Query: 611  LDSGCSRHMTGESRMFQELKLKPGGEVGFGGNEKGKIVGTGTICV---DSSPCIDNVLLV 667
            LDSG SR +   +         P   V         I   G +     D++     VL  
Sbjct: 460  LDSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIKVLHT 519

Query: 668  DGLTHNLLSISQLADKGYDVIFNQKSCRAVSQIDGSVLFNSKRKNNIYKI--------RL 719
              + ++LLS+++LA       F +     + + DG+VL    +  + Y +         +
Sbjct: 520  PNIAYDLLSLNELAAVDITACFTKN---VLERSDGTVLAPIVQYGDFYWVSKRYLLPSNI 576

Query: 720  SELEAQNVKCLLSVNEEQWVW-HRRLGHASMRKISQLSKLNLVRGLP----NLKFASDAL 774
            S     NV    S  +  + + HR L HA+ + I    K N +        +   A D  
Sbjct: 577  SVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDYQ 636

Query: 775  CEACQKGKFTK---VPFKAKNVVSTSRPLELLHIDLFGPVKTESIGGKRYGMVIVDDYSR 831
            C  C  GK TK   +        ++  P + LH D+FGPV         Y +   D+ ++
Sbjct: 637  CPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETTK 696

Query: 832  WTWVKFLTRKDESHV--VFSTFIAQVQNEKACRIVRVRSDHGGEFES-----LFDSYGIA 884
            + WV  L  + E  +  VF+T +A ++N+    ++ ++ D G E+ +       +  GI 
Sbjct: 697  FRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGIT 756

Query: 885  HDFSCPRTPQQNGVVERKNRTLQEMARTMLQETGMAKHFLAEAVNTACYIQNRIS----- 939
              ++     + +GV ER NRTL +  RT LQ +G+  H    A+  +  ++N ++     
Sbjct: 757  PCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKSK 816

Query: 940  -------------VRPILNKTPYELWKNIKPNISYFHPFGCVCYVLNTK----------D 976
                         +  +L      +  +  PN S  HP G   Y L+             
Sbjct: 817  KSARQHAGLAGLDISTLLPFGQPVIVNDHNPN-SKIHPRGIPGYALHPSRNSYGYIIYLP 875

Query: 977  RLHKFDAKSSKCLLLGYSERSKGFRFYNTDAKTIEESIHVRFDDKLDSDQSKLVEKFADL 1036
             L K    ++  +L G   R   F   N DA T +E ++           S  +++  DL
Sbjct: 876  SLKKTVDTTNYVILQGKESRLDQF---NYDALTFDEDLNRLTASYQSFIASNEIQESNDL 932

Query: 1037 SINVSDKGKAPEEAEPEED--------EPEEEAGPS----DSQTLKKSRITAAHPKEL 1082
            +I      ++  E  PE+          P +   PS    DS+ + K+ I A  P+E+
Sbjct: 933  NIESDHDFQSDIELHPEQPRNVLSKAVSPTDSTPPSTHTEDSKRVSKTNIRA--PREV 988


>M820_ARATH (P92520) Hypothetical mitochondrial protein AtMg00820
            (ORF170)
          Length = 170

 Score = 99.0 bits (245), Expect = 9e-20
 Identities = 46/97 (47%), Positives = 67/97 (68%)

Query: 1115 EPKSIDEALQDKDWILAMEEELNQFSKNDVWSLVKKPESVHVIGTKWVFRNKLNEKGDVV 1174
            EPKS+  AL+D  W  AM+EEL+  S+N  W LV  P + +++G KWVF+ KL+  G + 
Sbjct: 27   EPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLD 86

Query: 1175 RNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLIS 1211
            R KARLVA+G+ Q+EGI + ET++PV R   IR +++
Sbjct: 87   RLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILN 123


>M240_ARATH (P93290) Hypothetical mitochondrial protein AtMg00240
            (ORF111a)
          Length = 111

 Score = 66.2 bits (160), Expect = 7e-10
 Identities = 29/82 (35%), Positives = 49/82 (59%)

Query: 1412 LYLTASRPDILFSVHLCARFQSDPRETHLTAIKRILRYLKGTTNLGLMYKKTSEYKLSGY 1471
            +YLT +RPD+ F+V+  ++F S  R   + A+ ++L Y+KGT   GL Y  TS+ +L  +
Sbjct: 1    MYLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAF 60

Query: 1472 CDAHYAGDRTERKSTSGNCQFL 1493
             D+ +A     R+S +G C  +
Sbjct: 61   ADSDWASCPDTRRSVTGFCSLV 82


>POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
            3.1.26.4) (RT); Integrase (IN)]
          Length = 1189

 Score = 61.2 bits (147), Expect = 2e-08
 Identities = 73/278 (26%), Positives = 120/278 (42%), Gaps = 28/278 (10%)

Query: 690  NQKSCRAVSQIDGSVLFNSKRKNNIYKIRLSELEAQNVKCLLSVNEEQWVWHRRLGHASM 749
            +Q+  RA+   +     N +++    KI L + EA      L++ ++   W   LG+  +
Sbjct: 806  DQEEARAIGATENKDTRNWEKEG---KIVLPQKEA------LAMIQQMHAW-THLGNRKL 855

Query: 750  RKISQLSKLNLVRGLPNLKFASDALCEACQKGKFTKVPFKAKNVVSTSRPLELLHIDLFG 809
            + + + +   + R    ++  + A C+ CQ+         A      +RP     ID F 
Sbjct: 856  KLLIEKTDFLIPRASTLIEQVTSA-CKVCQQVNAGATRVPAGKRTRGNRPGVYWEID-FT 913

Query: 810  PVKTESIGGKRYGMVIVDDYSRWTWVKFLTRKDESHVVFSTFIAQVQNEKACRIVRVRSD 869
             VK    G K Y +V VD +S W    F TR++ +H+V    + ++        V + SD
Sbjct: 914  EVKPHYAGYK-YLLVFVDTFSGWVEA-FPTRQETAHIVAKKILEEIFPRFGLPKV-IGSD 970

Query: 870  HGGEFES-----LFDSYGIAHDFSCPRTPQQNGVVERKNRTLQEMARTMLQETGMA--KH 922
            +G  F S     L    GI     C   PQ +G VER NRT++E    +  ETG+   + 
Sbjct: 971  NGPAFVSQVSQGLARILGINWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRR 1030

Query: 923  FLAEAVNTACYIQNRISVRPILNKTPYELWKNIKPNIS 960
             L+ A+  A    NR  +      TPYE+     P +S
Sbjct: 1031 LLSLALLRARNTPNRFGL------TPYEILYGGPPPLS 1062


>M710_ARATH (P92512) Hypothetical mitochondrial protein AtMg00710
           (ORF120)
          Length = 120

 Score = 58.2 bits (139), Expect = 2e-07
 Identities = 30/69 (43%), Positives = 38/69 (54%)

Query: 903 NRTLQEMARTMLQETGMAKHFLAEAVNTACYIQNRISVRPILNKTPYELWKNIKPNISYF 962
           NRT+ E  R+ML E G+ K F A+A NTA +I N+     I    P E+W    P  SY 
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61

Query: 963 HPFGCVCYV 971
             FGCV Y+
Sbjct: 62  RRFGCVAYI 70


>POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse
           transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
           (RT); Integrase (IN)] (Fragment)
          Length = 1046

 Score = 55.5 bits (132), Expect = 1e-06
 Identities = 59/195 (30%), Positives = 88/195 (44%), Gaps = 21/195 (10%)

Query: 775 CEACQK--GKFTKVPFKAKNVVSTSRPLELLHIDLFGPVKTESIGGKRYGMVIVDDYSRW 832
           C+ CQ+     T+VP   +     +RP     ID F  VK    G K Y +V VD +S W
Sbjct: 737 CKVCQQVNAGATRVPEGKRT--RGNRPGVYWEID-FTEVKPHYAGYK-YLLVFVDTFSGW 792

Query: 833 TWVKFLTRKDESHVVFSTFIAQVQNEKACRIVRVRSDHGGEFES-----LFDSYGIAHDF 887
               + TR++ +H+V    + ++        V + SD+G  F S     L  + GI    
Sbjct: 793 VEA-YPTRQETAHMVAKKILEEIFPRFGLPKV-IGSDNGPAFVSQVSQGLARTLGINWKL 850

Query: 888 SCPRTPQQNGVVERKNRTLQEMARTMLQETGMA--KHFLAEAVNTACYIQNRISVRPILN 945
            C   PQ +G VER NRT++E    +  ETG+   +  L+ A+  A    NR  +     
Sbjct: 851 HCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLLSLALLRARNTPNRFGL----- 905

Query: 946 KTPYELWKNIKPNIS 960
            TPYE+     P +S
Sbjct: 906 -TPYEILYGGPPPLS 919


>POL_MLVAV (P03356) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
            3.1.26.4) (RT); Integrase (IN)]
          Length = 1196

 Score = 52.4 bits (124), Expect = 1e-05
 Identities = 64/231 (27%), Positives = 98/231 (41%), Gaps = 19/231 (8%)

Query: 743  RLGHASMRKISQL-----SKLNLVRGLPNLKFASDALCEACQKGKFTKVPFKAKNVVSTS 797
            RL H   +K+  L     S   ++     L++ +D+ C  C +   +K    A   V   
Sbjct: 849  RLTHLGYQKMKALLDRGESPYYMLNRDKTLQYVADS-CTVCAQVNASKAKIGAGVRVRGH 907

Query: 798  RPLELLHIDLFGPVKTESIGGKRYGMVIVDDYSRWTWVKFLTRKDESHVVFSTFIAQVQN 857
            RP     ID F  VK   + G +Y +V VD +S W    F T+++ + VV    + ++  
Sbjct: 908  RPGSHWEID-FTEVKP-GLYGYKYLLVFVDTFSGWVEA-FPTKRETARVVSKKLLEEIFP 964

Query: 858  EKACRIVRVRSDHGGEF-----ESLFDSYGIAHDFSCPRTPQQNGVVERKNRTLQEMART 912
                  V + SD+G  F     +S+ D  GI     C   PQ +G VER NRT++E    
Sbjct: 965  RFGMPQV-LGSDNGPAFTSQVSQSVADLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTK 1023

Query: 913  MLQETGMAKHFLAEAVNTACYIQNRISVRPILNKTPYELWKNIKPNISYFH 963
            +    G     L   +  A Y + R +  P    TPYE+     P +  FH
Sbjct: 1024 LTLAAGTRDWVL--LLPLALY-RARNTPGP-HGLTPYEILYGAPPPLVNFH 1070


>ATRX_CAEEL (Q9U7E0) Transcriptional regulator ATRX homolog
           (X-linked nuclear protein-1)
          Length = 1359

 Score = 52.0 bits (123), Expect = 1e-05
 Identities = 44/169 (26%), Positives = 72/169 (42%), Gaps = 28/169 (16%)

Query: 207 KRSEMQDLRKKSIALKSKSEKAKAEKSKALQAEEEESEEASEDSDEDELTLISKRLNRIW 266
           ++ E +   K++  LK K E+      K   A++ ++  + ED D++E +          
Sbjct: 25  RQIENERKEKRAQKLKEKREREGKPPPKKRPAKKRKASSSEEDDDDEEES---------- 74

Query: 267 KHRQSKYKGSGKAKGKSESSGQKKSSLKEVTCFECKESGHYKSDCPKLKKDKKPKKHFKT 326
             R+S  K   +AK +SES              E  E    K    K K D+K K+  K 
Sbjct: 75  -PRKSSKKSRKRAKSESESD-------------ESDEEEDRKKSKSKKKVDQKKKEKSKK 120

Query: 327 KKSLMVTFDESESEDVDSDGESKDSWLLSKTKKQSQRELLTLTQNQKEI 375
           K+    T   SE ED D + E K      KTKKQ+  E    ++ ++++
Sbjct: 121 KR----TTSSSEDEDSDEEREQKSKKKSKKTKKQTSSESSEESEEERKV 165



 Score = 48.5 bits (114), Expect = 1e-04
 Identities = 56/222 (25%), Positives = 94/222 (42%), Gaps = 34/222 (15%)

Query: 183 IELTRDVENMSLEELISILKCHELKRSEMQDLRKKSIALKSKS-----------EKAKAE 231
           +E+ R +EN   E+    LK  E +  E +   KK  A K K+           E  +  
Sbjct: 21  LEMARQIENERKEKRAQKLK--EKREREGKPPPKKRPAKKRKASSSEEDDDDEEESPRKS 78

Query: 232 KSKALQAEEEESEEASEDSDEDELTLISKRLNRIWKHRQSKYK---GSGKAKGKSESSGQ 288
             K+ +  + ESE    D +ED     SK+     K  +SK K    S + +   E   Q
Sbjct: 79  SKKSRKRAKSESESDESDEEEDRKKSKSKKKVDQKKKEKSKKKRTTSSSEDEDSDEEREQ 138

Query: 289 K-KSSLKEVTCFECKESGHYKSDCPKLKKDKKPKKHFKTKKSLMVTFDESESED------ 341
           K K   K+       ES     +  K+KK KK K+  K+ K    T +ES+ ++      
Sbjct: 139 KSKKKSKKTKKQTSSESSEESEEERKVKKSKKNKE--KSVKKRAETSEESDEDEKPSKKS 196

Query: 342 ---------VDSDGESKDSWLLSKTKKQSQRELLTLTQNQKE 374
                     +S+ ES+D   + K+KK+S++ +   ++++ E
Sbjct: 197 KKGLKKKAKSESESESEDEKEVKKSKKKSKKVVKKESESEDE 238



 Score = 48.1 bits (113), Expect = 2e-04
 Identities = 42/188 (22%), Positives = 89/188 (47%), Gaps = 14/188 (7%)

Query: 187 RDVENMSLEELISILKCHELKRSEMQDLRKKSIALKSKSEKAKAEKSKALQAEEEESEEA 246
           +   + S EE     K  + K+++ + ++K++   +   E  K  K      +++   E+
Sbjct: 149 KQTSSESSEESEEERKVKKSKKNKEKSVKKRAETSEESDEDEKPSKKSKKGLKKKAKSES 208

Query: 247 SEDSDEDELTLISKRLNRIWKHRQSKYKGSGKAKGKSESSGQKKSSLKEVTCFECKESGH 306
             +S++++    SK+ ++    ++S+ +     K K+E   + K+S +E        S  
Sbjct: 209 ESESEDEKEVKKSKKKSKKVVKKESESEDEAPEKKKTEKRKRSKTSSEE-------SSES 261

Query: 307 YKSDCPKLKKDKKPKKHFKTKKSLMVTFDESESEDVDSDGESKDSWLLSKTKKQSQRELL 366
            KSD  + +K+  PK   K KK L V    S+ E  +SD E     +L + KK+    L+
Sbjct: 262 EKSDEEEEEKESSPKP--KKKKPLAVKKLSSDEESEESDVE-----VLPQKKKRGAVTLI 314

Query: 367 TLTQNQKE 374
           + ++++K+
Sbjct: 315 SDSEDEKD 322



 Score = 47.4 bits (111), Expect = 3e-04
 Identities = 49/174 (28%), Positives = 75/174 (42%), Gaps = 18/174 (10%)

Query: 205 ELKRSEMQDLRKKSIALKSKSEKAKAEKSKALQAEEEES----EEASEDSDEDELTLISK 260
           E K  +     KK  + +S  E  +  K K  +  +E+S     E SE+SDEDE    SK
Sbjct: 137 EQKSKKKSKKTKKQTSSESSEESEEERKVKKSKKNKEKSVKKRAETSEESDEDEKP--SK 194

Query: 261 RLNRIWKHRQSKYKGSGKAKGKSESSGQKKSSLKEVTCFECKESGHYKSDCPKLKKDKKP 320
           +  +  K +      S     K     +KKS  K+V     K+    + + P+ KK +K 
Sbjct: 195 KSKKGLKKKAKSESESESEDEKEVKKSKKKS--KKVV----KKESESEDEAPEKKKTEK- 247

Query: 321 KKHFKTKKSLMVTFDESESEDVDSDGESKDSWLLSKTKKQSQRELLTLTQNQKE 374
           +K  KT        + SESE  D + E K+S    K KK    + L+  +  +E
Sbjct: 248 RKRSKTSSE-----ESSESEKSDEEEEEKESSPKPKKKKPLAVKKLSSDEESEE 296



 Score = 42.7 bits (99), Expect = 0.008
 Identities = 51/202 (25%), Positives = 88/202 (43%), Gaps = 34/202 (16%)

Query: 205 ELKRSEMQDLRKKSIALKSKSEKA-----------KAEKSKALQAEEEESEEASEDSDED 253
           E K++E +   K S    S+SEK+           K +K K L  ++  S+E SE+SD +
Sbjct: 241 EKKKTEKRKRSKTSSEESSESEKSDEEEEEKESSPKPKKKKPLAVKKLSSDEESEESDVE 300

Query: 254 EL---------TLISKRLNRIWKHRQSKYKGSGKAKGKSESSGQKKSSLKEVTCFECKES 304
            L         TLIS   +   +  +S+     +   K ++  Q+ S           ES
Sbjct: 301 VLPQKKKRGAVTLISDSEDEKDQKSESEASDVEEKVSKKKAKKQESS-----------ES 349

Query: 305 GHYKSD--CPKLKKDKKPKKHFKTKKSLMVTFDESESEDVDSD-GESKDSWLLSKTKKQS 361
           G   S+      +K KK +K  K KK +++   + + E +D++  E +    L K +K+ 
Sbjct: 350 GSDSSEGSITVNRKSKKKEKPEKKKKGIIMDSSKLQKETIDAERAEKERRKRLEKKQKEF 409

Query: 362 QRELLTLTQNQKEILTQTMKMR 383
              +L   ++  E+LT T   R
Sbjct: 410 NGIVLEEGEDLTEMLTGTSSQR 431


>POL_MLVMO (P03355) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
            3.1.26.4) (RT); Integrase (IN)]
          Length = 1199

 Score = 50.8 bits (120), Expect = 3e-05
 Identities = 53/189 (28%), Positives = 82/189 (43%), Gaps = 21/189 (11%)

Query: 743  RLGHASMRKISQLSK--------LNLVRGLPNLKFASDALCEACQKGKFTKVPFKAKNVV 794
            +L H S  K+  L +        LN  R L N+       C+AC +   +K   K    V
Sbjct: 849  QLTHLSFSKMKALLERSHSPYYMLNRDRTLKNIT----ETCKACAQVNASKSAVKQGTRV 904

Query: 795  STSRPLELLHIDLFGPVKTESIGGKRYGMVIVDDYSRWTWVKFLTRKDESHVVFSTFIAQ 854
               RP     ID F  +K   + G +Y +V +D +S W    F T+K+ + VV    + +
Sbjct: 905  RGHRPGTHWEID-FTEIKP-GLYGYKYLLVFIDTFSGWIEA-FPTKKETAKVVTKKLLEE 961

Query: 855  VQNEKACRIVRVRSDHGGEF-----ESLFDSYGIAHDFSCPRTPQQNGVVERKNRTLQEM 909
            +        V + +D+G  F     +++ D  GI     C   PQ +G VER NRT++E 
Sbjct: 962  IFPRFGMPQV-LGTDNGPAFVSKVSQTVADLLGIDWKLHCAYRPQSSGQVERMNRTIKET 1020

Query: 910  ARTMLQETG 918
               +   TG
Sbjct: 1021 LTKLTLATG 1029


  Database: sprot
    Posted date:  Nov 25, 2004 10:54 AM
  Number of letters in database: 59,974,054
  Number of sequences in database:  164,201
  
Lambda     K      H
   0.323    0.137    0.404 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 172,961,635
Number of Sequences: 164201
Number of extensions: 7247375
Number of successful extensions: 32575
Number of sequences better than 10.0: 498
Number of HSP's better than 10.0 without gapping: 87
Number of HSP's successfully gapped in prelim test: 429
Number of HSP's that attempted gapping in prelim test: 29548
Number of HSP's gapped (non-prelim): 2104
length of query: 1562
length of database: 59,974,054
effective HSP length: 124
effective length of query: 1438
effective length of database: 39,613,130
effective search space: 56963680940
effective search space used: 56963680940
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 73 (32.7 bits)


Lotus: description of TM0151.11