Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC147714.1 - phase: 0 /pseudo
         (805 letters)

Database: sprot 
           164,201 sequences; 59,974,054 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III     97  1e-19
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran...    91  1e-17
POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic pro...    82  4e-15
POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic pro...    79  4e-14
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro...    79  5e-14
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro...    77  2e-13
POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic pro...    76  3e-13
POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic pro...    75  5e-13
RRPO_OENBE (P31843) RNA-directed DNA polymerase homolog (Reverse...    75  7e-13
POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from tran...    72  6e-12
POL3_DROME (P04323) Retrovirus-related Pol polyprotein from tran...    71  1e-11
RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protei...    71  1e-11
RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protei...    71  1e-11
RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protei...    71  1e-11
POL2_DROME (P20825) Retrovirus-related Pol polyprotein from tran...    70  3e-11
POLY_DROME (P10401) Retrovirus-related Pol polyprotein from tran...    67  1e-10
GIS2_YEAST (P53849) Zinc-finger protein GIS2                           65  5e-10
CNBP_RAT (P62634) Cellular nucleic acid binding protein (CNBP) (...    62  8e-09
CNBP_HUMAN (P62633) Cellular nucleic acid binding protein (CNBP)...    62  8e-09
HEXP_LEIMA (Q04832) DNA-binding protein HEXBP (Hexamer-binding p...    61  1e-08

>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
          Length = 2186

 Score = 97.4 bits (241), Expect = 1e-19
 Identities = 48/131 (36%), Positives = 79/131 (59%), Gaps = 3/131 (2%)

Query: 669  VVCDFPEVFPDEIPDVPPEREVEFSIDLVPGTKPVSMAPYRISAS---ELKKQLEDLLEK 725
            V+  F +VF     ++      E  I+L  G +P+   P  I  +   E++K ++ +L +
Sbjct: 909  VIEQFQDVFAISDDELGRNSGTECVIELKEGAEPIRQKPRPIPLALKPEIRKMIQKMLNQ 968

Query: 726  KFVRPSVSPWGASVLLVKKKDGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVF 785
            K +R S SPW + V+LVKKKDGS+R+CIDYR++NKV   N +PLP I+  +  L G +++
Sbjct: 969  KVIRESKSPWSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLY 1028

Query: 786  SKIDLRSGYHQ 796
            +  D+ +G+ Q
Sbjct: 1029 TVFDMIAGFWQ 1039



 Score = 39.3 bits (90), Expect = 0.042
 Identities = 29/114 (25%), Positives = 45/114 (39%), Gaps = 12/114 (10%)

Query: 393 NERKGKGQQSRPKPYSAPADKGKQKMVDVRRPKKKDAAEIVCFNCGEKGHKSNACPEEIK 452
           N R      S+    ++     +Q    +  P+K       C +C ++G     C ++ K
Sbjct: 504 NSRNSWNNNSQNNSAASQNISREQSWKTISVPQKHQNPSDRCSDCQQRGWHMFWCSKKSK 563

Query: 453 -----KCVRCGKKGHVVADCNRT-DIVCFNCNGEGHISSQC------TQPKRAP 494
                KC  C + G  +A C +  +  CF CN  GHI+  C      T  K AP
Sbjct: 564 DNASQKCDECQQSGWHMASCFKLKNRACFRCNEMGHIAWNCPKKNENTSEKEAP 617


>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
           transposon 412 [Contains: Protease (EC 3.4.23.-);
           Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1237

 Score = 90.9 bits (224), Expect = 1e-17
 Identities = 58/160 (36%), Positives = 90/160 (56%), Gaps = 25/160 (15%)

Query: 660 NQAVIDRLPVVCDFPEVFPDEIPDVPPEREVEFSIDLVPGT--------------KPVSM 705
           N+ V+ +L    +FPE+F  ++ ++  E    F+++  P T              +PV  
Sbjct: 260 NKTVLSQLKK--NFPELFKSQLENICSEYIDIFALESEPITVNNLYKQQLRLKDDEPVYT 317

Query: 706 APYRISAS---ELKKQLEDLLEKKFVRPSVSPWGASVLLVKKKDG------SMRLCIDYR 756
             YR   S   E++ Q++ L++ K V PSVS + + +LLV KK          RL IDYR
Sbjct: 318 KNYRSPHSQVEEIQAQVQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYR 377

Query: 757 QLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLRSGYHQ 796
           Q+NK  + +++PLPRIDD++DQL  A+ FS +DL SG+HQ
Sbjct: 378 QINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQ 417


>POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic
           protease (EC 3.4.23.-); Endonuclease; Reverse
           transcriptase (EC 2.7.7.49)]
          Length = 692

 Score = 82.4 bits (202), Expect = 4e-15
 Identities = 74/293 (25%), Positives = 140/293 (47%), Gaps = 30/293 (10%)

Query: 520 INNTPLVAIIDTGATHCFIAFDCVSALGLDLSDMNGEMVVETPAKGSVTTSLVCLKCPLS 579
           I     +A IDTGAT CF      +    ++     E+++   +K  +  ++  +   L 
Sbjct: 26  IGKRNFLAYIDTGATLCFGKRKISN--NWEILKQPKEIIIADKSKHYIREAISNVF--LK 81

Query: 580 MFGRDFEMDLVCLPLSGMDVILGMNWLE-YNHVLINCFSKSVHFSSVEEESGAEFLSTKQ 638
           +  ++F + ++ L  SG+D+I+G N+L+ Y   +    +  + + ++     ++ +STK 
Sbjct: 82  IENKEFLIPIIYLHDSGLDLIIGNNFLKLYQPFIQRLETIELRWKNLNNPKESQMISTKI 141

Query: 639 LKQ-------LERDGILMFSLMATLSIENQAVIDRLPVVCDFPEVFPDEIPDVPPEREVE 691
           L +        E+  I +   +   +IE Q     L  VC         + +   +  + 
Sbjct: 142 LTKNEVLKLSFEKIHICLEKYLFFKTIEEQ-----LEEVCS-----EHPLDETKNKNGLL 191

Query: 692 FSIDLVPGTKPVSMA---PYRI-SASELKKQLEDLLEKKFVRPSVSPWGASVLLVKK--- 744
             I L    + +++    PY I    E K++ EDLL+K  +R S SP  A    V+    
Sbjct: 192 IEIRLKDPLQEINVTNRIPYTIRDVQEFKEECEDLLKKGLIRESQSPHSAPAFYVENHNE 251

Query: 745 -KDGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLRSGYHQ 796
            K G  R+ I+Y+++N+ TI + Y LPR D +++++ G+  FS +D +SGY+Q
Sbjct: 252 IKRGKRRMVINYKKMNEATIGDSYKLPRKDFILEKIKGSLWFSSLDAKSGYYQ 304


>POL_CAMVS (P03554) Enzymatic polyprotein [Contains: Aspartic
           protease (EC 3.4.23.-); Endonuclease; Reverse
           transcriptase (EC 2.7.7.49)]
          Length = 679

 Score = 79.3 bits (194), Expect = 4e-14
 Identities = 87/347 (25%), Positives = 155/347 (44%), Gaps = 56/347 (16%)

Query: 502 LTGTQTENEDRL---------IRGTCYINNTPLVAI---IDTGATHCFIAFDCVSALGLD 549
           L  TQT+ E  +         I+G  Y      + +   +DTGA+ C  +   +     +
Sbjct: 5   LLKTQTQTEQVMNVTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIP----E 60

Query: 550 LSDMNGE--MVVETPAKGSVTTSLVCLKCPLSMFGRDFEMDLVCLPLSGMDVILGMNWLE 607
              +N E  ++V+     S+T S VC    L + G  F +  V    SG+D I+G N+ +
Sbjct: 61  EHWVNAERPIMVKIADGSSITISKVCKDIDLIIAGEIFRIPTVYQQESGIDFIIGNNFCQ 120

Query: 608 YNHVLIN-----CFSKS----VHFSSVEE--ESGAE-FLST--KQLKQLERDGILMFSLM 653
                I       F+K+    VH + +      G E FL +  K+ K  + + + + +  
Sbjct: 121 LYEPFIQFTDRVIFTKNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNK 180

Query: 654 ATLSIENQAVID---------------RLPVVCDFPEVFPDEIPDVPPERE--VEFSIDL 696
               +E  A++                R+  + +  E    E P  P + +  ++ SI L
Sbjct: 181 IENPLEEIAILSEGRRLSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKL 240

Query: 697 VPGTKPVSMAPYRISA---SELKKQLEDLLEKKFVRPSVSPWGASVLLV----KKKDGSM 749
              +K + + P + S     E  KQ+++LL+ K ++PS SP  A   LV    +K+ G  
Sbjct: 241 SDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKK 300

Query: 750 RLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLRSGYHQ 796
           R+ ++Y+ +NK T+ + Y LP  D+L+  + G ++FS  D +SG+ Q
Sbjct: 301 RMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQ 347


>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic
           protease (EC 3.4.23.-); Endonuclease; Reverse
           transcriptase (EC 2.7.7.49)]
          Length = 679

 Score = 79.0 bits (193), Expect = 5e-14
 Identities = 83/326 (25%), Positives = 149/326 (45%), Gaps = 47/326 (14%)

Query: 514 IRGTCYINNTPLVAI---IDTGATHCFIAFDCVSALGLDLSDMNGE--MVVETPAKGSVT 568
           I+G  Y      + +   +DTGA+ C  +   +     +   +N E  ++V+     S+T
Sbjct: 26  IKGRLYFKGYKKIELHCFVDTGASLCIASKFVIP----EEHWVNAERPIMVKIADGSSIT 81

Query: 569 TSLVCLKCPLSMFGRDFEMDLVCLPLSGMDVILGMNWLEYNHVLIN-----CFSKS---- 619
            S VC    L + G  F++  V    SG+D I+G N+ +     I       F+K+    
Sbjct: 82  ISKVCKDIDLIIAGEIFKIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDRVIFTKNKSYP 141

Query: 620 VHFSSVEE--ESGAE-FLST--KQLKQLERDGILMFSLMATLSIENQAVID--------- 665
           VH + +      G E FL +  K+ K  + + + + +      +E  A++          
Sbjct: 142 VHITKLTRAVRVGIEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSEGRRLSEEK 201

Query: 666 ------RLPVVCDFPEVFPDEIPDVPPERE--VEFSIDLVPGTKPVSMAPYRISA---SE 714
                 R+  + +  E    E P  P + +  ++ SI L   +K + + P + S     E
Sbjct: 202 LFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREE 261

Query: 715 LKKQLEDLLEKKFVRPSVSPWGASVLLV----KKKDGSMRLCIDYRQLNKVTIKNRYPLP 770
             KQ+++LL+ K ++PS SP  A   LV    +K+ G  R+ ++Y+ +NK TI + Y LP
Sbjct: 262 FDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATIGDAYNLP 321

Query: 771 RIDDLMDQLVGARVFSKIDLRSGYHQ 796
             D+L+  + G ++FS  D +SG+ Q
Sbjct: 322 NKDELLTLIRGKKIFSSFDCKSGFWQ 347


>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic
           protease (EC 3.4.23.-); Endonuclease; Reverse
           transcriptase (EC 2.7.7.49)]
          Length = 674

 Score = 77.0 bits (188), Expect = 2e-13
 Identities = 81/319 (25%), Positives = 144/319 (44%), Gaps = 40/319 (12%)

Query: 514 IRGTCYINNTPLVAI---IDTGATHCFIAFDCVSALGLDLSDMNGE--MVVETPAKGSVT 568
           I+G  Y      + +   +DTGA+ C  +   +     +   +N E  ++V+     S+T
Sbjct: 28  IKGRLYFKGYKKIELHCFVDTGASLCIASKFVIP----EEHWINAERPIMVKIADGSSIT 83

Query: 569 TSLVCLKCPLSMFGRDFEMDLVCLPLSGMDVILGMNWLEYNHVLIN-----CFSKS---- 619
            + VC    L + G  F +  V    SG+D I+G N+ +     I       F+K     
Sbjct: 84  INKVCRDIDLIIAGEIFHIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDRVIFTKDRTYP 143

Query: 620 VHFSSVEE----------ESGAEFLSTKQLK--QLERDGILMFSLMATLSIENQAVID-R 666
           VH + +            ES  +   T+Q +   +  + I + S    LS E   +   R
Sbjct: 144 VHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIAILSEGRRLSEEKLFITQQR 203

Query: 667 LPVVCDFPEVFPDEIPDVPPERE--VEFSIDLVPGTKPVSMAPYRISA---SELKKQLED 721
           +  + +  E    E P  P + +  ++ SI L   +K + + P + S     E  KQ+++
Sbjct: 204 MQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKE 263

Query: 722 LLEKKFVRPSVSPWGASVLLV----KKKDGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMD 777
           LL+ K ++PS SP  A   LV    +K+ G  R+ ++Y+ +NK T+ + Y  P  D+L+ 
Sbjct: 264 LLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLT 323

Query: 778 QLVGARVFSKIDLRSGYHQ 796
            + G ++FS  D +SG+ Q
Sbjct: 324 LIRGKKIFSSFDCKSGFWQ 342


>POL_CAMVE (Q02964) Enzymatic polyprotein [Contains: Aspartic
           protease (EC 3.4.23.-); Endonuclease; Reverse
           transcriptase (EC 2.7.7.49)]
          Length = 679

 Score = 76.3 bits (186), Expect = 3e-13
 Identities = 88/348 (25%), Positives = 157/348 (44%), Gaps = 58/348 (16%)

Query: 502 LTGTQTENEDRL---------IRGTCYINNTPLVAI---IDTGATHCFIAFDCVSALGLD 549
           L  TQT+ E  +         I+G  Y      + +   +DTGA+ C  +   +     +
Sbjct: 5   LLKTQTQTEQVMNVTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIP----E 60

Query: 550 LSDMNGE--MVVETPAKGSVTTSLVCLKCPLSMFGRDFEMDLVCLPLSGMDVILGMNWLE 607
              +N E  ++V+     S+T S VC    L +    F++  V    SG+D I+G N+ +
Sbjct: 61  EHWVNAERPIMVKIADGSSITISKVCKDIDLIIAREIFKIPTVYQQESGIDFIIGNNFCQ 120

Query: 608 YNHVLIN-----CFSKS----VHFS---------------SVEEESGAEF-----LSTKQ 638
                I       F+K+    VH +               S+++ S  +      +ST +
Sbjct: 121 LYEPFIQFTDRVIFTKNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNK 180

Query: 639 LKQLERDGILMFSLMATLSIENQAVID-RLPVVCDFPEVFPDEIPDVPPERE--VEFSID 695
           ++   ++ I + S    LS E   +   R+  + +  E    E P  P + +  ++ SI 
Sbjct: 181 IENPLKE-IAILSEGRRLSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIK 239

Query: 696 LVPGTKPVSMAPYRISA---SELKKQLEDLLEKKFVRPSVSPWGASVLLV----KKKDGS 748
           L   +K + + P + S     E  KQ+++LL+ K ++PS SP  A   LV    +K+ G 
Sbjct: 240 LSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGK 299

Query: 749 MRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLRSGYHQ 796
            R+ ++Y+ +NK TI + Y LP  D+L+  + G ++FS  D +SG+ Q
Sbjct: 300 KRMVVNYKAMNKATIGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQ 347


>POL_CAMVN (Q00962) Enzymatic polyprotein [Contains: Aspartic
           protease (EC 3.4.23.-); Endonuclease; Reverse
           transcriptase (EC 2.7.7.49)]
          Length = 680

 Score = 75.5 bits (184), Expect = 5e-13
 Identities = 86/347 (24%), Positives = 154/347 (43%), Gaps = 56/347 (16%)

Query: 502 LTGTQTENEDRL---------IRGTCYINNTPLVAI---IDTGATHCFIAFDCVSALGLD 549
           L  TQT+ E  +         I+G  Y      + +   +DTGA+ C  +   +     +
Sbjct: 6   LLKTQTQTEQVMNVTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIP----E 61

Query: 550 LSDMNGE--MVVETPAKGSVTTSLVCLKCPLSMFGRDFEMDLVCLPLSGMDVILGMNWLE 607
              +N E  ++V+     S+T S VC    L + G  F++  V    SG+D I+G N+ +
Sbjct: 62  EHWVNAERPIMVKIADGSSITISKVCKDIDLIIVGVIFKIPTVYQQESGIDFIIGNNFCQ 121

Query: 608 YNHVLIN-----CFSKS----VHFSSVEE--ESGAE-FLST--KQLKQLERDGILMFSLM 653
                I       F+K+    VH + +      G E FL +  K+ K  + + + + +  
Sbjct: 122 LYEPFIQFTDRVIFTKNKSYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNK 181

Query: 654 ATLSIENQAVID---------------RLPVVCDFPEVFPDEIPDVPPERE--VEFSIDL 696
               +E  A++                R+    +  E    E P  P + +  ++ SI L
Sbjct: 182 IENPLEEIAILSEGRRLSEEKLFITQQRMQKTEELLEKVCSENPLDPNKTKQWMKASIKL 241

Query: 697 VPGTKPVSMAPYRISA---SELKKQLEDLLEKKFVRPSVSPWGASVLLVKKKD----GSM 749
              +K + + P + S     E  KQ+++LL+ K ++PS SP  A   LV  +     G+ 
Sbjct: 242 SDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAENGRGNK 301

Query: 750 RLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLRSGYHQ 796
           R+ ++Y+ +NK T+ + Y LP  D+L+  + G ++FS  D +SG+ Q
Sbjct: 302 RMVVNYKAMNKATVGDAYNLPNKDELLTLIRGKKIFSSFDCKSGFWQ 348


>RRPO_OENBE (P31843) RNA-directed DNA polymerase homolog (Reverse
           transcriptase homolog)
          Length = 142

 Score = 75.1 bits (183), Expect = 7e-13
 Identities = 33/49 (67%), Positives = 41/49 (83%)

Query: 748 SMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLRSGYHQ 796
           S+R+CIDYR L KVTIKN+YP+PR+DDL D+L  A  F+K+DLRSGY Q
Sbjct: 5   SLRMCIDYRALTKVTIKNKYPIPRVDDLFDRLAQATWFTKLDLRSGYWQ 53


>POL5_DROME (Q8I7P9) Retrovirus-related Pol polyprotein from
           transposon opus [Contains: Protease (EC 3.4.23.-);
           Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1003

 Score = 72.0 bits (175), Expect = 6e-12
 Identities = 40/132 (30%), Positives = 75/132 (56%), Gaps = 7/132 (5%)

Query: 672 DFPEVFPDEIPDVPPEREVEFSIDLVPGTKPVSMA-PYRISA-SELKKQLEDLLEKKFVR 729
           +FP +F   +  +  E  V+  I         + + PY ++   E+++Q+++LL+   +R
Sbjct: 94  EFPRIFEPPLSGMSVETAVKAEIRTNTQDPIYAKSYPYPVNMRGEVERQIDELLQDGIIR 153

Query: 730 PSVSPWGASVLLVKKK-----DGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGARV 784
           PS SP+ + + +V KK     +   R+ +D+++LN VTI + YP+P I+  +  L  A+ 
Sbjct: 154 PSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYPIPDINATLASLGNAKY 213

Query: 785 FSKIDLRSGYHQ 796
           F+ +DL SG+HQ
Sbjct: 214 FTTLDLTSGFHQ 225


>POL3_DROME (P04323) Retrovirus-related Pol polyprotein from
           transposon 17.6 [Contains: Protease (EC 3.4.23.-);
           Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1058

 Score = 71.2 bits (173), Expect = 1e-11
 Identities = 33/88 (37%), Positives = 60/88 (67%), Gaps = 5/88 (5%)

Query: 714 ELKKQLEDLLEKKFVRPSVSPWGASVLLV-KKKDGS----MRLCIDYRQLNKVTIKNRYP 768
           E++ Q++D+L +  +R S SP+ + + +V KK+D S     R+ IDYR+LN++T+ +R+P
Sbjct: 222 EVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHP 281

Query: 769 LPRIDDLMDQLVGARVFSKIDLRSGYHQ 796
           +P +D+++ +L     F+ IDL  G+HQ
Sbjct: 282 IPNMDEILGKLGRCNYFTTIDLAKGFHQ 309


>RT23_SCHPO (Q9UR07) Retrotransposable element Tf2 155 kDa protein
           type 3
          Length = 1333

 Score = 70.9 bits (172), Expect = 1e-11
 Identities = 34/113 (30%), Positives = 66/113 (58%), Gaps = 3/113 (2%)

Query: 686 PEREVEFSIDLVPGTKPVSMAPYRISASELKKQLEDL---LEKKFVRPSVSPWGASVLLV 742
           P + +EF ++L      + +  Y +   +++   +++   L+   +R S +     V+ V
Sbjct: 396 PIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFV 455

Query: 743 KKKDGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLRSGYH 795
            KK+G++R+ +DY+ LNK    N YPLP I+ L+ ++ G+ +F+K+DL+S YH
Sbjct: 456 PKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYH 508


>RT22_SCHPO (Q9C0R2) Retrotransposable element Tf2 155 kDa protein
           type 2
          Length = 1333

 Score = 70.9 bits (172), Expect = 1e-11
 Identities = 34/113 (30%), Positives = 66/113 (58%), Gaps = 3/113 (2%)

Query: 686 PEREVEFSIDLVPGTKPVSMAPYRISASELKKQLEDL---LEKKFVRPSVSPWGASVLLV 742
           P + +EF ++L      + +  Y +   +++   +++   L+   +R S +     V+ V
Sbjct: 396 PIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFV 455

Query: 743 KKKDGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLRSGYH 795
            KK+G++R+ +DY+ LNK    N YPLP I+ L+ ++ G+ +F+K+DL+S YH
Sbjct: 456 PKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYH 508


>RT21_SCHPO (Q05654) Retrotransposable element Tf2 155 kDa protein
           type 1
          Length = 1333

 Score = 70.9 bits (172), Expect = 1e-11
 Identities = 34/113 (30%), Positives = 66/113 (58%), Gaps = 3/113 (2%)

Query: 686 PEREVEFSIDLVPGTKPVSMAPYRISASELKKQLEDL---LEKKFVRPSVSPWGASVLLV 742
           P + +EF ++L      + +  Y +   +++   +++   L+   +R S +     V+ V
Sbjct: 396 PIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFV 455

Query: 743 KKKDGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLRSGYH 795
            KK+G++R+ +DY+ LNK    N YPLP I+ L+ ++ G+ +F+K+DL+S YH
Sbjct: 456 PKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYH 508


>POL2_DROME (P20825) Retrovirus-related Pol polyprotein from
           transposon 297 [Contains: Protease (EC 3.4.23.-);
           Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1059

 Score = 69.7 bits (169), Expect = 3e-11
 Identities = 32/88 (36%), Positives = 58/88 (65%), Gaps = 5/88 (5%)

Query: 714 ELKKQLEDLLEKKFVRPSVSPWGASVLLVKKKDGSM-----RLCIDYRQLNKVTIKNRYP 768
           E++ Q++++L +  +R S SP+ +   +V KK  +      R+ IDYR+LN++TI +RYP
Sbjct: 221 EVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDYRKLNEITIPDRYP 280

Query: 769 LPRIDDLMDQLVGARVFSKIDLRSGYHQ 796
           +P +D+++ +L   + F+ IDL  G+HQ
Sbjct: 281 IPNMDEILGKLGKCQYFTTIDLAKGFHQ 308


>POLY_DROME (P10401) Retrovirus-related Pol polyprotein from
           transposon gypsy [Contains: Reverse transcriptase (EC
           2.7.7.49); Endonuclease]
          Length = 1035

 Score = 67.4 bits (163), Expect = 1e-10
 Identities = 54/211 (25%), Positives = 99/211 (46%), Gaps = 19/211 (9%)

Query: 594 LSGMDVILGMNWLEYNHVLINCFSKSVHFSSVEEESGAEFLSTKQLKQLERDGILMFSLM 653
           L+  D I+G++ L    V +N    S+ +  + E+    + S   +          F+ +
Sbjct: 85  LNAFDAIIGLDLLTQAGVKLNLAEDSLEYQGIAEK--LHYFSCPSVN---------FTDV 133

Query: 654 ATLSIENQAVIDRLPVVCDFPEVFPDEIPDVPPEREVEFSIDLVPGTKPVSMA-PYRISA 712
             + + +    +    +    + F      +P    V  +I  V      S A P  +  
Sbjct: 134 NDIVVPDSVKKEFKDTIIRRKKAFSTTNEALPFNTAVTATIRTVDNEPVYSRAYPTLMGV 193

Query: 713 SE-LKKQLEDLLEKKFVRPSVSPWGASVLLVKKK------DGSMRLCIDYRQLNKVTIKN 765
           S+ +  +++ LL+   +RPS SP+ +   +V KK      + + RL ID+R+LN+ TI +
Sbjct: 194 SDFVNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPD 253

Query: 766 RYPLPRIDDLMDQLVGARVFSKIDLRSGYHQ 796
           RYP+P I  ++  L  A+ F+ +DL+SGYHQ
Sbjct: 254 RYPMPSIPMILANLGKAKFFTTLDLKSGYHQ 284


>GIS2_YEAST (P53849) Zinc-finger protein GIS2
          Length = 153

 Score = 65.5 bits (158), Expect = 5e-10
 Identities = 30/74 (40%), Positives = 42/74 (56%), Gaps = 6/74 (8%)

Query: 430 AEIVCFNCGEKGHKSNACPE----EIKKCVRCGKKGHVVADCNRTDIVCFNCNGEGHISS 485
           +E +C+NC + GH    C      E K+C  CG+ GHV ++C  T   CFNCN  GHIS 
Sbjct: 21  SERLCYNCNKPGHVQTDCTMPRTVEFKQCYNCGETGHVRSEC--TVQRCFNCNQTGHISR 78

Query: 486 QCTQPKRAPTTGRV 499
           +C +PK+     +V
Sbjct: 79  ECPEPKKTSRFSKV 92



 Score = 57.4 bits (137), Expect = 1e-07
 Identities = 27/69 (39%), Positives = 38/69 (54%), Gaps = 6/69 (8%)

Query: 424 PKKKDA-AEIVCFNCGEKGHKSNACPEEIK----KCVRCGKKGHVVADCNRTDIVCFNCN 478
           PKK    +++ C+ CG   H +  C +E      KC  CG+ GH+  DC + D +C+NCN
Sbjct: 83  PKKTSRFSKVSCYKCGGPNHMAKDCMKEDGISGLKCYTCGQAGHMSRDC-QNDRLCYNCN 141

Query: 479 GEGHISSQC 487
             GHIS  C
Sbjct: 142 ETGHISKDC 150



 Score = 49.3 bits (116), Expect = 4e-05
 Identities = 18/40 (45%), Positives = 27/40 (67%), Gaps = 1/40 (2%)

Query: 452 KKCVRCGKKGHVVADCNRTDIVCFNCNGEGHISSQCTQPK 491
           K C  CGK GH+  DC+ ++ +C+NCN  GH+ + CT P+
Sbjct: 4   KACYVCGKIGHLAEDCD-SERLCYNCNKPGHVQTDCTMPR 42


>CNBP_RAT (P62634) Cellular nucleic acid binding protein (CNBP)
           (Zinc finger protein 9)
          Length = 177

 Score = 61.6 bits (148), Expect = 8e-09
 Identities = 32/82 (39%), Positives = 41/82 (49%), Gaps = 7/82 (8%)

Query: 425 KKKDAAEIVCFNCGEKGHKSNACPEEIKK----CVRCGKKGHVVADCNRTD-IVCFNCNG 479
           K  D  E  C+NCG  GH +  C E  ++    C  CGK GH+  DC+  D   C++C  
Sbjct: 65  KDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCYSCGE 124

Query: 480 EGHISSQCTQPK--RAPTTGRV 499
            GHI   CT+ K  R   TG V
Sbjct: 125 FGHIQKDCTKVKCYRCGETGHV 146



 Score = 60.1 bits (144), Expect = 2e-08
 Identities = 25/61 (40%), Positives = 38/61 (61%), Gaps = 3/61 (4%)

Query: 429 AAEIVCFNCGEKGHKSNACPEEIKKCVRCGKKGHVVADCNRT-DIVCFNCNGEGHISSQC 487
           A E  C++CGE GH    C +   KC RCG+ GHV  +C++T ++ C+ C   GH++ +C
Sbjct: 114 ADEQKCYSCGEFGHIQKDCTKV--KCYRCGETGHVAINCSKTSEVNCYRCGESGHLAREC 171

Query: 488 T 488
           T
Sbjct: 172 T 172



 Score = 58.5 bits (140), Expect = 7e-08
 Identities = 25/71 (35%), Positives = 41/71 (57%), Gaps = 5/71 (7%)

Query: 420 DVRRPKKKDAAEIVCFNCGEKGHKSNACPE-EIKKCVRCGKKGHVVADCNRTDIVCFNCN 478
           D + PK++   E  C+NCG+ GH +  C   + +KC  CG+ GH+  DC  T + C+ C 
Sbjct: 86  DCKEPKRE--REQCCYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDC--TKVKCYRCG 141

Query: 479 GEGHISSQCTQ 489
             GH++  C++
Sbjct: 142 ETGHVAINCSK 152



 Score = 56.6 bits (135), Expect = 3e-07
 Identities = 26/87 (29%), Positives = 35/87 (39%), Gaps = 28/87 (32%)

Query: 434 CFNCGEKGHKSNACPEEIKK----------------------------CVRCGKKGHVVA 465
           CF CG  GH +  CP    +                            C RCG+ GH+  
Sbjct: 6   CFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESGHLAK 65

Query: 466 DCNRTDIVCFNCNGEGHISSQCTQPKR 492
           DC+  +  C+NC   GHI+  C +PKR
Sbjct: 66  DCDLQEDACYNCGRGGHIAKDCKEPKR 92



 Score = 56.2 bits (134), Expect = 3e-07
 Identities = 27/96 (28%), Positives = 45/96 (46%), Gaps = 13/96 (13%)

Query: 396 KGKGQQSRPKPYSAPADKGKQKMVDVRRPKKKDAAEIVCFNCGEKGHKSNACPEEIKKCV 455
           +G+G +SR +     +D+G Q +          +   +C+ CGE GH +  C  +   C 
Sbjct: 25  RGRGMRSRGRG-GFTSDRGFQFV--------SSSLPDICYRCGESGHLAKDCDLQEDACY 75

Query: 456 RCGKKGHVVADC----NRTDIVCFNCNGEGHISSQC 487
            CG+ GH+  DC       +  C+NC   GH++  C
Sbjct: 76  NCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDC 111



 Score = 43.9 bits (102), Expect = 0.002
 Identities = 16/43 (37%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 426 KKDAAEIVCFNCGEKGHKSNACPEEIK-KCVRCGKKGHVVADC 467
           +KD  ++ C+ CGE GH +  C +  +  C RCG+ GH+  +C
Sbjct: 129 QKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLAREC 171


>CNBP_HUMAN (P62633) Cellular nucleic acid binding protein (CNBP)
           (Zinc finger protein 9)
          Length = 177

 Score = 61.6 bits (148), Expect = 8e-09
 Identities = 32/82 (39%), Positives = 41/82 (49%), Gaps = 7/82 (8%)

Query: 425 KKKDAAEIVCFNCGEKGHKSNACPEEIKK----CVRCGKKGHVVADCNRTD-IVCFNCNG 479
           K  D  E  C+NCG  GH +  C E  ++    C  CGK GH+  DC+  D   C++C  
Sbjct: 65  KDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCYSCGE 124

Query: 480 EGHISSQCTQPK--RAPTTGRV 499
            GHI   CT+ K  R   TG V
Sbjct: 125 FGHIQKDCTKVKCYRCGETGHV 146



 Score = 60.1 bits (144), Expect = 2e-08
 Identities = 25/61 (40%), Positives = 38/61 (61%), Gaps = 3/61 (4%)

Query: 429 AAEIVCFNCGEKGHKSNACPEEIKKCVRCGKKGHVVADCNRT-DIVCFNCNGEGHISSQC 487
           A E  C++CGE GH    C +   KC RCG+ GHV  +C++T ++ C+ C   GH++ +C
Sbjct: 114 ADEQKCYSCGEFGHIQKDCTKV--KCYRCGETGHVAINCSKTSEVNCYRCGESGHLAREC 171

Query: 488 T 488
           T
Sbjct: 172 T 172



 Score = 58.5 bits (140), Expect = 7e-08
 Identities = 25/71 (35%), Positives = 41/71 (57%), Gaps = 5/71 (7%)

Query: 420 DVRRPKKKDAAEIVCFNCGEKGHKSNACPE-EIKKCVRCGKKGHVVADCNRTDIVCFNCN 478
           D + PK++   E  C+NCG+ GH +  C   + +KC  CG+ GH+  DC  T + C+ C 
Sbjct: 86  DCKEPKRE--REQCCYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDC--TKVKCYRCG 141

Query: 479 GEGHISSQCTQ 489
             GH++  C++
Sbjct: 142 ETGHVAINCSK 152



 Score = 56.6 bits (135), Expect = 3e-07
 Identities = 26/87 (29%), Positives = 35/87 (39%), Gaps = 28/87 (32%)

Query: 434 CFNCGEKGHKSNACPEEIKK----------------------------CVRCGKKGHVVA 465
           CF CG  GH +  CP    +                            C RCG+ GH+  
Sbjct: 6   CFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESGHLAK 65

Query: 466 DCNRTDIVCFNCNGEGHISSQCTQPKR 492
           DC+  +  C+NC   GHI+  C +PKR
Sbjct: 66  DCDLQEDACYNCGRGGHIAKDCKEPKR 92



 Score = 56.2 bits (134), Expect = 3e-07
 Identities = 27/96 (28%), Positives = 45/96 (46%), Gaps = 13/96 (13%)

Query: 396 KGKGQQSRPKPYSAPADKGKQKMVDVRRPKKKDAAEIVCFNCGEKGHKSNACPEEIKKCV 455
           +G+G +SR +     +D+G Q +          +   +C+ CGE GH +  C  +   C 
Sbjct: 25  RGRGMRSRGRG-GFTSDRGFQFV--------SSSLPDICYRCGESGHLAKDCDLQEDACY 75

Query: 456 RCGKKGHVVADC----NRTDIVCFNCNGEGHISSQC 487
            CG+ GH+  DC       +  C+NC   GH++  C
Sbjct: 76  NCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDC 111



 Score = 43.9 bits (102), Expect = 0.002
 Identities = 16/43 (37%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 426 KKDAAEIVCFNCGEKGHKSNACPEEIK-KCVRCGKKGHVVADC 467
           +KD  ++ C+ CGE GH +  C +  +  C RCG+ GH+  +C
Sbjct: 129 QKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLAREC 171


>HEXP_LEIMA (Q04832) DNA-binding protein HEXBP (Hexamer-binding
           protein)
          Length = 271

 Score = 61.2 bits (147), Expect = 1e-08
 Identities = 28/76 (36%), Positives = 35/76 (45%), Gaps = 14/76 (18%)

Query: 426 KKDAAEIVCFNCGEKGHKSNACPEEIKK-------CVRCGKKGHVVADCNRT-------D 471
           K D     CF CGE+GH S  CP E +        C RCG+ GH+  DC  +        
Sbjct: 37  KGDERSTTCFRCGEEGHMSRECPNEARSGAAGAMTCFRCGEAGHMSRDCPNSAKPGAAKG 96

Query: 472 IVCFNCNGEGHISSQC 487
             C+ C  EGH+S  C
Sbjct: 97  FECYKCGQEGHLSRDC 112



 Score = 57.8 bits (138), Expect = 1e-07
 Identities = 28/82 (34%), Positives = 41/82 (49%), Gaps = 16/82 (19%)

Query: 420 DVRRPKKKDAAEIVCFNCGEKGHKSNACPEEIKK-------CVRCGKKGHVVADCNRT-- 470
           DV+RP+ + +    C NCG++GH +  CPE   K       C RCG++GH+  +C     
Sbjct: 6   DVKRPRTESSTS--CRNCGKEGHYARECPEADSKGDERSTTCFRCGEEGHMSRECPNEAR 63

Query: 471 -----DIVCFNCNGEGHISSQC 487
                 + CF C   GH+S  C
Sbjct: 64  SGAAGAMTCFRCGEAGHMSRDC 85



 Score = 53.1 bits (126), Expect = 3e-06
 Identities = 24/75 (32%), Positives = 35/75 (46%), Gaps = 14/75 (18%)

Query: 429 AAEIVCFNCGEKGHKSNACPE--------EIKKCVRCGKKGHVVADC------NRTDIVC 474
           A +  C+ CG+ GH S  CP           +KC +CG+ GH+  +C         D  C
Sbjct: 165 AGDRTCYKCGDAGHISRDCPNGQGGYSGAGDRKCYKCGESGHMSRECPSAGSTGSGDRAC 224

Query: 475 FNCNGEGHISSQCTQ 489
           + C   GHIS +C +
Sbjct: 225 YKCGKPGHISRECPE 239



 Score = 53.1 bits (126), Expect = 3e-06
 Identities = 25/76 (32%), Positives = 32/76 (41%), Gaps = 17/76 (22%)

Query: 429 AAEIVCFNCGEKGHKSNACPEE------IKKCVRCGKKGHVVADCNRT-----------D 471
           A +  C+ CGE GH S  CP         + C +CGK GH+  +C              D
Sbjct: 193 AGDRKCYKCGESGHMSRECPSAGSTGSGDRACYKCGKPGHISRECPEAGGSYGGSRGGGD 252

Query: 472 IVCFNCNGEGHISSQC 487
             C+ C   GHIS  C
Sbjct: 253 RTCYKCGEAGHISRDC 268



 Score = 45.4 bits (106), Expect = 6e-04
 Identities = 23/85 (27%), Positives = 30/85 (35%), Gaps = 31/85 (36%)

Query: 434 CFNCGEKGHKSNACPEE-----------------------IKKCVRCGKKGHVVADC--- 467
           C+ CG++GH S  CP                          + C +CG  GH+  DC   
Sbjct: 99  CYKCGQEGHLSRDCPSSQGGSRGGYGQKRGRSGAQGGYSGDRTCYKCGDAGHISRDCPNG 158

Query: 468 -----NRTDIVCFNCNGEGHISSQC 487
                   D  C+ C   GHIS  C
Sbjct: 159 QGGYSGAGDRTCYKCGDAGHISRDC 183


  Database: sprot
    Posted date:  Nov 25, 2004 10:54 AM
  Number of letters in database: 59,974,054
  Number of sequences in database:  164,201
  
Lambda     K      H
   0.322    0.138    0.420 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 93,643,851
Number of Sequences: 164201
Number of extensions: 4064217
Number of successful extensions: 10730
Number of sequences better than 10.0: 200
Number of HSP's better than 10.0 without gapping: 96
Number of HSP's successfully gapped in prelim test: 104
Number of HSP's that attempted gapping in prelim test: 10066
Number of HSP's gapped (non-prelim): 520
length of query: 805
length of database: 59,974,054
effective HSP length: 118
effective length of query: 687
effective length of database: 40,598,336
effective search space: 27891056832
effective search space used: 27891056832
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 70 (31.6 bits)


Medicago: description of AC147714.1