Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC134322.16 - phase: 0 
         (690 letters)

Database: sprot 
           164,201 sequences; 59,974,054 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

CPSC_HUMAN (Q9UKF6) Cleavage and polyadenylation specificity fac...   709  0.0
CPSC_BOVIN (P79101) Cleavage and polyadenylation specificity fac...   709  0.0
CPSC_MOUSE (Q9QXK7) Cleavage and polyadenylation specificity fac...   706  0.0
Y162_METJA (Q57626) Hypothetical protein MJ0162                       190  1e-47
YC36_METJA (Q58633) Hypothetical protein MJ1236                       189  3e-47
Y047_METJA (Q60355) Hypothetical protein MJ0047                       142  3e-33
CPSB_CAEEL (O17403) Probable cleavage and polyadenylation specif...   137  1e-31
CPSB_ARATH (Q9LKF9) Cleavage and polyadenylation specificity fac...   136  2e-31
CPSB_DROME (Q9V3D6) Probable cleavage and polyadenylation specif...   130  2e-29
CPSB_HUMAN (Q9P2I0) Cleavage and polyadenylation specificity fac...   126  2e-28
CPSB_BOVIN (Q10568) Cleavage and polyadenylation specificity fac...   126  2e-28
CPSB_XENLA (Q9W799) Cleavage and polyadenylation specificity fac...   124  8e-28
CPSB_MOUSE (O35218) Cleavage and polyadenylation specificity fac...   124  1e-27
Y514_SYNY3 (Q55470) Hypothetical protein sll0514                       80  2e-14
YJ70_CORGL (P54122) Hypothetical UPF0036 protein Cgl1970/cg2160        45  6e-04
Y139_MYCGE (P47385) Hypothetical UPF0036 protein MG139                 43  0.003
K2C4_HUMAN (P19013) Keratin, type II cytoskeletal 4 (Cytokeratin...    42  0.004
RNZ_CLOTE (Q892B5) Ribonuclease Z (EC 3.1.26.11) (RNase Z) (tRNA...    42  0.005
Y139_MYCPN (P75497) Hypothetical UPF0036 protein MG139 homolog (...    42  0.007
ROO_DESGI (Q9F0J6) Rubredoxin-oxygen oxidoreductase (EC 1.-.-.-)...    42  0.007

>CPSC_HUMAN (Q9UKF6) Cleavage and polyadenylation specificity
           factor, 73 kDa subunit (CPSF 73 kDa subunit)
          Length = 684

 Score =  709 bits (1830), Expect = 0.0
 Identities = 360/685 (52%), Positives = 479/685 (69%), Gaps = 18/685 (2%)

Query: 14  INRETEDQLIVTPLGAGNEVGRSCVYMTYKGKTVLFDCGIHPGYSGMAALPYFDEIDPST 73
           I  E  DQL++ PLGAG EVGRSC+ + +KG+ ++ DCGIHPG  GM ALPY D IDP+ 
Sbjct: 4   IPAEESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAE 63

Query: 74  VDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDYVKVSKVSVDDMLY 133
           +D+LLI+HFHLDH  +LP+FL+KT+FKGR FMT+ATKAIY+ LLSDYVKVS +S DDMLY
Sbjct: 64  IDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLY 123

Query: 134 DEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSRE 193
            E D+  SMDKIE I+FH+  EV GI+FWCY AGHVLGAAMFM++IAGV++LYTGD+SR+
Sbjct: 124 TETDLEESMDKIETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQ 183

Query: 194 EDRHLRAAETPQFSPDVCIIESTYGVQHHQPRHTREKRFTDVIHSTISQGGRVLIPAYAL 253
           EDRHL AAE P   PD+ IIESTYG   H+ R  RE RF + +H  +++GGR LIP +AL
Sbjct: 184 EDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFAL 243

Query: 254 GRAQELLLILDEYWANHPELQNIPIYYASPLAKKCLTVYETYTLSMNDRI--QNAKSNPF 311
           GRAQELLLILDEYW NHPEL +IPIYYAS LAKKC+ VY+TY  +MND+I  Q   +NPF
Sbjct: 244 GRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPF 303

Query: 312 AFKHISALSSIDIFKDVGPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVIPGYVVEGTL 371
            FKHIS L S+D F D+GPSVVMASPG +QSGLSR+LF+ WC+DK+N  +I GY VEGTL
Sbjct: 304 VFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTL 363

Query: 372 AKTILNEPKEVTLMNGLSAPLHMQVHYISFSAHADSAQTSAFLEELNPPNIILVHGAANE 431
           AK I++EP+E+T M+G   PL M V YISFSAH D  QTS F+  L PP++ILVHG  NE
Sbjct: 364 AKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNE 423

Query: 432 MGRLKQKLMTQFADR---NTKILTPKNCQSVEMYFNSQKMAKTIGKLAEKTPEVGETVSG 488
           M RLK  L+ ++ D    + ++  P+N ++V + F  +K+AK +G LA+K PE G+ VSG
Sbjct: 424 MARLKAALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSG 483

Query: 489 LLVKKGFTYQIMAPDDLHVFSQLSTANVTQRITIPYSGAFCVIQSRLKQIYESVEPSVDE 548
           +LVK+ F Y I++P DL  ++ L+ + V Q   IPY+G F ++  +L+++   VE    +
Sbjct: 484 ILVKRNFNYHILSPCDLSNYTDLAMSTVKQTQAIPYTGPFNLLCYQLQKLTGDVEELEIQ 543

Query: 549 ESGVPMLLVHDRVTVKHESEKHVSLHWASDPINDMVSDSVVALVLNINRDLPKIVAESDA 608
           E   P L V   +TV  E    V L W ++P NDM +D+V  ++L +  + PKI  +   
Sbjct: 544 EK--PALKVFKNITVIQEPGM-VVLEWLANPSNDMYADTVTTVILEVQSN-PKI-RKGAV 598

Query: 609 TKIEEENEKKT-EKVMQALLNSLFGN--VKVGENGKLIINIDGNVAELNKESGEVESE-- 663
            K+ ++ E     K ++ +L  +FG   V V ++  L + +DG  A LN E+  VE E  
Sbjct: 599 QKVSKKLEMHVYSKRLEIMLQDIFGEDCVSVKDDSILSVTVDGKTANLNLETRTVECEEG 658

Query: 664 ---NEGLKERVRTAFRRIQSSVKPI 685
              +E L+E V  A +R+  ++ P+
Sbjct: 659 SEDDESLREMVELAAQRLYEALTPV 683


>CPSC_BOVIN (P79101) Cleavage and polyadenylation specificity
           factor, 73 kDa subunit (CPSF 73 kDa subunit)
          Length = 684

 Score =  709 bits (1829), Expect = 0.0
 Identities = 359/685 (52%), Positives = 478/685 (69%), Gaps = 18/685 (2%)

Query: 14  INRETEDQLIVTPLGAGNEVGRSCVYMTYKGKTVLFDCGIHPGYSGMAALPYFDEIDPST 73
           I  E  DQL++ PLGAG EVGRSC+ + +KG+ ++ DCGIHPG  GM ALPY D IDP+ 
Sbjct: 4   IPAEESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAE 63

Query: 74  VDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDYVKVSKVSVDDMLY 133
           +D+LLI+HFHLDH  +LP+FL+KT+FKGR FMT+ATKAIY+ LLSDYVKVS +S DDMLY
Sbjct: 64  IDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLY 123

Query: 134 DEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSRE 193
            E D+  SMDKIE I+FH+  EV GI+FWCY AGHVLGAAMFM++IAGV++LYTGD+SR+
Sbjct: 124 TETDLEESMDKIETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQ 183

Query: 194 EDRHLRAAETPQFSPDVCIIESTYGVQHHQPRHTREKRFTDVIHSTISQGGRVLIPAYAL 253
           EDRHL AAE P   PD+ IIESTYG   H+ R  RE RF + +H  +++GGR LIP +AL
Sbjct: 184 EDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFAL 243

Query: 254 GRAQELLLILDEYWANHPELQNIPIYYASPLAKKCLTVYETYTLSMNDRI--QNAKSNPF 311
           GRAQELLLILDEYW NHPEL +IPIYYAS LAKKC+ VY+TY  +MND+I  Q   +NPF
Sbjct: 244 GRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPF 303

Query: 312 AFKHISALSSIDIFKDVGPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVIPGYVVEGTL 371
            FKHIS L S+D F D+GPSVVMASPG +QSGLSR+LF+ WC+DK+N  +I GY VEGTL
Sbjct: 304 VFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTL 363

Query: 372 AKTILNEPKEVTLMNGLSAPLHMQVHYISFSAHADSAQTSAFLEELNPPNIILVHGAANE 431
           AK I++EP+E+T M+G   PL M V YISFSAH D  QTS F+  L PP++ILVHG  NE
Sbjct: 364 AKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNE 423

Query: 432 MGRLKQKLMTQFADR---NTKILTPKNCQSVEMYFNSQKMAKTIGKLAEKTPEVGETVSG 488
           M RLK  L+ ++ D    + ++  P+N ++V + F  +K+AK +G LA+K PE G+ VSG
Sbjct: 424 MARLKAALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSG 483

Query: 489 LLVKKGFTYQIMAPDDLHVFSQLSTANVTQRITIPYSGAFCVIQSRLKQIYESVEPSVDE 548
           +LVK+ F Y I++P DL  ++ L+ + V Q   IPY+G F ++  +L+++   VE    +
Sbjct: 484 ILVKRNFNYHILSPCDLSNYTDLAMSTVKQTQAIPYTGPFNLLYYQLQKLTGDVEELEIQ 543

Query: 549 ESGVPMLLVHDRVTVKHESEKHVSLHWASDPINDMVSDSVVALVLNINRDLPKIVAESDA 608
           E   P L V   +TV  E    V L W ++P NDM +D+V  ++L +  + PKI  +   
Sbjct: 544 EK--PALKVFKNITVIQEPGM-VVLEWLANPSNDMYADTVTTVILEVQSN-PKI-RKGAV 598

Query: 609 TKIEEENEKKT-EKVMQALLNSLFGN--VKVGENGKLIINIDGNVAELNKESGEVESE-- 663
            K+ ++ E     K ++ +L  +FG   V V +   L + +DG  A +N E+  VE E  
Sbjct: 599 QKVSKKLEMHVYSKRLEIMLQDIFGEDCVSVKDGSILSVTVDGKTANINLETRTVECEEG 658

Query: 664 ---NEGLKERVRTAFRRIQSSVKPI 685
              +E L+E V  A +R+  ++ P+
Sbjct: 659 SEDDESLREMVELAAQRLYEALTPV 683


>CPSC_MOUSE (Q9QXK7) Cleavage and polyadenylation specificity
           factor, 73 kDa subunit (CPSF 73 kDa subunit)
          Length = 684

 Score =  706 bits (1822), Expect = 0.0
 Identities = 357/685 (52%), Positives = 477/685 (69%), Gaps = 18/685 (2%)

Query: 14  INRETEDQLIVTPLGAGNEVGRSCVYMTYKGKTVLFDCGIHPGYSGMAALPYFDEIDPST 73
           I  E  DQL++ PLGAG EVGRSC+ + +KG+ ++ DCGIHPG  GM ALPY D IDP+ 
Sbjct: 4   IPAEESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAE 63

Query: 74  VDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDYVKVSKVSVDDMLY 133
           +D+LLI+HFHLDH  +LP+FL+KT+FKGR FMT+ATKAIY+ LLSDYVKVS +S DDMLY
Sbjct: 64  IDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLY 123

Query: 134 DEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSRE 193
            E D+  SMDKIE I+FH+  EV GI+FWCY AGHVLGAAMFM++IAGV++LYTGD+SR+
Sbjct: 124 TETDLEESMDKIETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQ 183

Query: 194 EDRHLRAAETPQFSPDVCIIESTYGVQHHQPRHTREKRFTDVIHSTISQGGRVLIPAYAL 253
           EDRHL AAE P   PD+ IIESTYG   H+ R  RE RF   +H  +++GGR LIP +AL
Sbjct: 184 EDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFWHTVHDIVNRGGRGLIPVFAL 243

Query: 254 GRAQELLLILDEYWANHPELQNIPIYYASPLAKKCLTVYETYTLSMNDRI--QNAKSNPF 311
           GRAQELLLILDEYW NHPEL +IPIYYAS LAKKC+ VY+TY  +MND+I  Q   +NPF
Sbjct: 244 GRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPF 303

Query: 312 AFKHISALSSIDIFKDVGPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVIPGYVVEGTL 371
            FKHIS L S+D F D+GPSVVMASPG +Q+GLSR+LF+ WC+DK+N  +I GY VEGTL
Sbjct: 304 VFKHISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTL 363

Query: 372 AKTILNEPKEVTLMNGLSAPLHMQVHYISFSAHADSAQTSAFLEELNPPNIILVHGAANE 431
           AK I++EP+E+T M+G   PL M V YISFSAH D  QTS F+  L PP++ILVHG  NE
Sbjct: 364 AKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNE 423

Query: 432 MGRLKQKLMTQFADR---NTKILTPKNCQSVEMYFNSQKMAKTIGKLAEKTPEVGETVSG 488
           M RLK  L+ ++ D    + ++  P+N ++V + F  +K+AK +G LA+K PE G+ VSG
Sbjct: 424 MARLKAALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSG 483

Query: 489 LLVKKGFTYQIMAPDDLHVFSQLSTANVTQRITIPYSGAFCVIQSRLKQIYESVEPSVDE 548
           +LVK+ F Y I++P DL  ++ L+ + V Q   IPY+G F ++  +L+++   VE    +
Sbjct: 484 ILVKRNFNYHILSPCDLSNYTDLAMSTVKQTQAIPYTGPFYLLYYQLQKLTGDVEELDIQ 543

Query: 549 ESGVPMLLVHDRVTVKHESEKHVSLHWASDPINDMVSDSVVALVLNINRDLPKIVAESDA 608
           E   P L V   +TV  E    V   W ++P NDM +D+V  ++L +  + PKI  +   
Sbjct: 544 EK--PALKVFKSITVVQEPGM-VGSEWLANPSNDMYADTVTTVILEVQSN-PKI-RKGAV 598

Query: 609 TKIEEENEKKT-EKVMQALLNSLFGN--VKVGENGKLIINIDGNVAELNKESGEVESE-- 663
            K+ ++ E     K ++ +L  +FG   V V ++  L + +DG  A +N E+  VE E  
Sbjct: 599 QKVSKKLEMHVYSKRLEVMLQDIFGEDCVSVKDDSVLSVTVDGKTANINLETRAVECEEG 658

Query: 664 ---NEGLKERVRTAFRRIQSSVKPI 685
              +E L+E V  A +R+  ++ P+
Sbjct: 659 SEDDESLREMVELAAQRLYEALTPV 683


>Y162_METJA (Q57626) Hypothetical protein MJ0162
          Length = 421

 Score =  190 bits (482), Expect = 1e-47
 Identities = 123/405 (30%), Positives = 211/405 (51%), Gaps = 27/405 (6%)

Query: 28  GAGNEVGRSCVYMTYKGKTVLFDCGIHPGYSGMAALPYFDEIDPSTVDVLLITHFHLDHA 87
           G   ++G SCV +  +   VL DCG+ P    +       ++D   VD ++++H HLDH 
Sbjct: 8   GGCQQIGMSCVEVETQKGRVLLDCGMSPDTGEIP------KVDDKAVDAVIVSHAHLDHC 61

Query: 88  ASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDYVKVSKVSVDDMLYDEQDINRSMDKIEV 147
            ++P++  K     +++ T+ T  +  +   D + ++K       Y E+DI  +M+ IE 
Sbjct: 62  GAIPFYKFK-----KIYCTHPTADLMFITWRDTLNLTKA------YKEEDIQHAMENIEC 110

Query: 148 IDFHQTVEVN-GIRFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRHLRAAETPQF 206
           +++++  ++   I+F  Y AGH+LG+A   +++ G ++LYTGD +    R L  A+T   
Sbjct: 111 LNYYEERQITENIKFKFYNAGHILGSASIYLEVDGKKILYTGDINEGVSRTLLPADTDID 170

Query: 207 SPDVCIIESTYG--VQHHQPRHTREKRFTDVIHSTISQGGRVLIPAYALGRAQELLLILD 264
             DV IIESTYG  +     R T E++  + I  TI  GG+V+IP +A+GRAQE+LLI++
Sbjct: 171 EIDVLIIESTYGSPLDIKPARKTLERQLIEEISETIENGGKVIIPVFAIGRAQEILLIIN 230

Query: 265 EYWANHPELQNIPIYYASPLAKKCLTVYETYTLSMNDRIQNAKSNPF-AFKHISALSSID 323
            Y     +L+++PIY    L      VY +Y   +N +I+N   N    F  I       
Sbjct: 231 NY-IRSGKLRDVPIYTDGSLI-HATAVYMSYINWLNPKIKNMVENRINPFGEIKKADESL 288

Query: 324 IFKDVGPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVIPGYVVEGTLAKTILNEPKEVT 383
           +F +  P +++++ G +Q G   +   +   D KN  ++ GY  EGTL + +    KE+ 
Sbjct: 289 VF-NKEPCIIVSTSGMVQGGPVLKYLKL-LKDPKNKLILTGYQAEGTLGRELEEGAKEIQ 346

Query: 384 LMNGLSAPLHMQVHYISFSAHADSAQTSAFLEEL-NPPNIILVHG 427
                  P+  +V  I FSAH D      +++++  P   I++HG
Sbjct: 347 PFKN-KIPIRGKVVKIEFSAHGDYNSLVRYIKKIPKPEKAIVMHG 390


>YC36_METJA (Q58633) Hypothetical protein MJ1236
          Length = 634

 Score =  189 bits (479), Expect = 3e-47
 Identities = 139/425 (32%), Positives = 216/425 (50%), Gaps = 25/425 (5%)

Query: 24  VTPLGAGNEVGRSCVYMTYKGKTVLFDCGIHPGYSGMAALPYFD--EIDPSTVDVLLITH 81
           V+ LG   EVGRSC+Y+      VL DCGI+      A  P+FD  E     +D +++TH
Sbjct: 182 VSFLGGAREVGRSCLYVQTPDTRVLIDCGINVACEDKA-FPHFDAPEFSIEDLDAVIVTH 240

Query: 82  FHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDYVKVSKVSVDDMLYDEQDINRS 141
            HLDH   +P  L +  + G V+ T  T+ +  LL  DY++++K    ++ Y  +DI   
Sbjct: 241 AHLDHCGFIPG-LFRYGYDGPVYCTRPTRDLMTLLQKDYLEIAKKEGKEVPYTSKDIKTC 299

Query: 142 MDKIEVIDFHQTVEVNG-IRFWCYTAGHVLGAAMFMVDIAG--VRVLYTGDYSREEDRHL 198
           +     ID+  T +++  I+   + AGHVLG+A+  + I      + YTGD   E  R L
Sbjct: 300 VKHTIPIDYGVTTDISPTIKLTLHNAGHVLGSAIAHLHIGEGLYNLAYTGDIKFETSRLL 359

Query: 199 RAAETPQFSPDVCIIESTYGVQHH--QPRHTREKRFTDVIHSTISQGGRVLIPAYALGRA 256
             A       +  IIESTYG        R   E+    V+  T  +GG+VLIP + +GRA
Sbjct: 360 EPAVCQFPRLETLIIESTYGAYDDVLPEREEAERELLRVVSETTDRGGKVLIPVFGVGRA 419

Query: 257 QELLLILDEYWANHPELQNIPIYYASPL--AKKCLTVYETY-TLSMNDRIQNAKSNPF-- 311
           QEL+L+L+E +  +  + N P+Y    +  A    T Y  Y +  M  +I +   NPF  
Sbjct: 420 QELMLVLEEGY--NQGIFNAPVYLDGMIWEATAIHTAYPEYLSKEMRQKIFHEGDNPFLS 477

Query: 312 -AFKHISALSS-IDIFKDVGPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVIPGYVVEG 369
             FK + + +    +     P V++A+ G L  G S +       D+KN+ +  GY  EG
Sbjct: 478 EVFKRVGSTNERRKVIDSDEPCVILATSGMLTGGPSVEYLKHLAPDEKNAIIFVGYQAEG 537

Query: 370 TLAKTILNEPKEVTLM--NG--LSAPLHMQVHYI-SFSAHADSAQTSAFLEEL--NPPNI 422
           TL + + +  KE+ ++  NG   S P+++QV+ I  FS H+D  Q   ++  L  +P  I
Sbjct: 538 TLGRKVQSGWKEIPIITRNGKTKSIPINLQVYTIEGFSGHSDRKQLIKYIRRLKPSPEKI 597

Query: 423 ILVHG 427
           I+VHG
Sbjct: 598 IMVHG 602


>Y047_METJA (Q60355) Hypothetical protein MJ0047
          Length = 428

 Score =  142 bits (358), Expect = 3e-33
 Identities = 108/409 (26%), Positives = 190/409 (46%), Gaps = 26/409 (6%)

Query: 28  GAGNEVGRSCVYMTYKGKTVLFDCGIHPGYSGMAALPYFDEIDPSTVDVLLITHFHLDHA 87
           GA  EVGRSC+ +      +L DCG+  G       P  D      VD + I+H HLDH+
Sbjct: 7   GAALEVGRSCIEIKTDKSKILLDCGVKLGKE--IEYPILDN-SIRDVDKVFISHAHLDHS 63

Query: 88  ASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDYVKVSKVSVDDMLYDEQDINRSMDKIEV 147
            +LP    +      V  T  +K + K+LL D VK+++     + Y+  D+  ++     
Sbjct: 64  GALPVLFHRK-MDVPVITTELSKKLIKVLLKDMVKIAETENKKIPYNNHDVKEAIRHTIP 122

Query: 148 IDFHQTVEVNGIRFWCYTAGHVLGAAMFMVDIAGVR-VLYTGDYSREEDRHLRAAETPQF 206
           ++++         +  ++AGH+ G+A  +++    + +LYTGD    + R  + A+    
Sbjct: 123 LNYNDKKYYKDFSYELFSAGHIPGSASILLNYQNNKTILYTGDVKLRDTRLTKGADLSYT 182

Query: 207 SPDV--CIIESTYGVQHHQPRHTREKRFTDVIHSTISQGGRVLIPAYALGRAQELLLILD 264
             D+   IIESTYG   H  R   E  F + I   + +GG  LIP +A+ RAQE+LLIL+
Sbjct: 183 KDDIDILIIESTYGNSIHPDRKAVELSFIEKIKEILFRGGVALIPVFAVDRAQEILLILN 242

Query: 265 EYWANHPELQNIPIYYASPLAKKCLTVYETYTLSMNDRIQNAKSNPFAFKHISAL-SSID 323
           +Y  + P        Y   +A +   +   Y   +N+  Q  K    A K++  +  S D
Sbjct: 243 DYNIDAP-------IYLDGMAVEVTKLMLNYKHMLNESSQLEK----ALKNVKIIEKSED 291

Query: 324 IFKDV-----GPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVIPGYVVEGTLAKTILNE 378
             K +        +V+ + G L  G       ++  + KN+ ++ GY V  +  + ++  
Sbjct: 292 RIKAIENLSKNGGIVVTTAGMLDGGPILYYLKLFMHNPKNALLLTGYQVRDSNGRHLIET 351

Query: 379 PKEVTLMNGLSAPLHMQVHYISFSAHADSAQTSAFLEELNPPNIILVHG 427
            K     + +    +++V   +FS HA   +    ++++NP  +I+ HG
Sbjct: 352 GKIFIGKDEIKP--NLEVCMYNFSCHAGMDELHEIIKKVNPELLIIQHG 398


>CPSB_CAEEL (O17403) Probable cleavage and polyadenylation
           specificity factor, 100 kDa subunit (CPSF 100 kDa
           subunit)
          Length = 843

 Score =  137 bits (344), Expect = 1e-31
 Identities = 98/365 (26%), Positives = 176/365 (47%), Gaps = 21/365 (5%)

Query: 28  GAGNEVGRSCVYMTYKGKTVLFDCGIHPGYSGMAALPYFDEIDP--STVDVLLITHFHLD 85
           GA +E G  C  +   G  +L DCG    +     L YF+E+ P    +  +LI+H    
Sbjct: 12  GAKDE-GPLCYLLQVDGDYILLDCG----WDERFGLQYFEELKPFIPKISAVLISHPDPL 66

Query: 86  HAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDYVKVSKVSVDDML-YDEQDINRSMDK 144
           H   LPY + K      V+ T     + ++ + D V  S + V++   Y   D++ + +K
Sbjct: 67  HLGGLPYLVSKCGLTAPVYATVPVYKMGQMFIYDMV-YSHLDVEEFEHYTLDDVDTAFEK 125

Query: 145 IEVIDFHQTVEV---NGIRFWCYTAGHVLGAAMFMV-DIAGVRVLYTGDYSREEDRHLRA 200
           +E + ++QTV +   +G+ F    AGH+LG +++ +  + G  ++Y  D++ +++RHL  
Sbjct: 126 VEQVKYNQTVVLKGDSGVHFTALPAGHMLGGSIWRICRVTGEDIVYCVDFNHKKERHLNG 185

Query: 201 AETPQFSPDVCIIESTYGVQHHQPRHT-REKRFTDVIHSTISQGGRVLIPAYALGRAQEL 259
                F+    +I   + +   Q R   R+++    I  T+ Q G  +I     GR  EL
Sbjct: 186 CSFDNFNRPHLLITGAHHISLPQMRRKDRDEQLVTKILRTVRQKGDCMIVIDTAGRVLEL 245

Query: 260 LLILDEYWANHPE-LQNIPIYYASPLAKKCLTVYETYTLSMNDRI-----QNAKSNPFAF 313
             +LD+ W+N    L    +   S +A   +   ++    MN+++      +A+ NPF  
Sbjct: 246 AHLLDQLWSNADAGLSTYNLVMMSHVASSVVQFAKSQLEWMNEKLFKYDSSSARYNPFTL 305

Query: 314 KHISALSSI-DIFKDVGPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVIPGYVVEGTLA 372
           KH++   S  ++ +   P VV+ S   ++SG SR+LF  WCSD +N  ++       TLA
Sbjct: 306 KHVTLCHSHQELMRVRSPKVVLCSSQDMESGFSRELFLDWCSDPRNGVILTARPASFTLA 365

Query: 373 KTILN 377
             ++N
Sbjct: 366 AKLVN 370


>CPSB_ARATH (Q9LKF9) Cleavage and polyadenylation specificity
           factor, 100 kDa subunit (CPSF 100 kDa subunit)
          Length = 739

 Score =  136 bits (342), Expect = 2e-31
 Identities = 106/372 (28%), Positives = 187/372 (49%), Gaps = 24/372 (6%)

Query: 24  VTPL-GAGNEVGRSCVYMTYKGKTVLFDCGIHPGYSGMAALPYFDEIDPSTVDVLLITHF 82
           VTPL G  NE   S + ++  G   L DCG +  +      P       ST+D +L++H 
Sbjct: 7   VTPLCGVYNENPLSYL-VSIDGFNFLIDCGWNDLFDTSLLEPLSRVA--STIDAVLLSHP 63

Query: 83  HLDHAASLPYFLEKTTFKGRVFMTYATKAIYKL-LLSDYVK-VSKVSVDDM-LYDEQDIN 139
              H  +LPY +++      V   YAT+ +++L LL+ Y + +S+  V D  L+   DI+
Sbjct: 64  DTLHIGALPYAMKQLGLSAPV---YATEPVHRLGLLTMYDQFLSRKQVSDFDLFTLDDID 120

Query: 140 RSMDKIEVIDFHQTVEVNG----IRFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREED 195
            +   +  + + Q   ++G    I    + AGH+LG +++ +   G  V+Y  DY+  ++
Sbjct: 121 SAFQNVIRLTYSQNYHLSGKGEGIVIAPHVAGHMLGGSIWRITKDGEDVIYAVDYNHRKE 180

Query: 196 RHLRAAETPQF-SPDVCIIESTYGVQHHQP-RHTREKRFTDVIHSTISQGGRVLIPAYAL 253
           RHL       F  P V I ++ + +  +Q  R  R+K F D I   +  GG VL+P    
Sbjct: 181 RHLNGTVLQSFVRPAVLITDAYHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDTA 240

Query: 254 GRAQELLLILDEYWANHPELQNIPIYYASPLAKKCLTVYETYTLSMNDRI----QNAKSN 309
           GR  ELLLIL+++W+      + PIY+ + ++   +   +++   M+D I    + ++ N
Sbjct: 241 GRVLELLLILEQHWSQRG--FSFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFETSRDN 298

Query: 310 PFAFKHISALSSIDIFKDV--GPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVIPGYVV 367
            F  +H++ L +     +   GP VV+AS   L++G +R++F  W +D +N  +      
Sbjct: 299 AFLLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQ 358

Query: 368 EGTLAKTILNEP 379
            GTLA+ + + P
Sbjct: 359 FGTLARMLQSAP 370


>CPSB_DROME (Q9V3D6) Probable cleavage and polyadenylation
           specificity factor, 100 kDa subunit (CPSF 100 kDa
           subunit)
          Length = 756

 Score =  130 bits (326), Expect = 2e-29
 Identities = 95/357 (26%), Positives = 164/357 (45%), Gaps = 23/357 (6%)

Query: 37  CVYMTYKGKTVLFDCGIHPGYSGMAALPYFDEIDPSTVDVLLITHFHLDHAASLPYFLEK 96
           C  +      +L DCG    +          ++   T+D +L++H    H  +LPY + K
Sbjct: 20  CYILQIDDVRILLDCGWDEKFDANFIKELKRQVH--TLDAVLLSHPDAYHLGALPYLVGK 77

Query: 97  TTFKGRVFMTYATKAIYKLLLSDYVKVSKVSVDDM-LYDEQDINRSMDKIEVIDFHQTVE 155
                 ++ T     + ++ + D + +S  ++ D  L+   D++ + +KI  + ++QTV 
Sbjct: 78  LGLNCPIYATIPVFKMGQMFMYD-LYMSHFNMGDFDLFSLDDVDTAFEKITQLKYNQTVS 136

Query: 156 VN----GIRFWCYTAGHVLGAAMF-MVDIAGVRVLYTGDYSREEDRHLRAAETPQFSPDV 210
           +     GI      AGH++G  ++ +V +    ++Y  D++ +++RHL   E  +     
Sbjct: 137 LKDKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKKERHLSGCELDRLQRPS 196

Query: 211 CIIESTYGVQHHQPRH-TREKRFTDVIHSTISQGGRVLIPAYALGRAQELLLILDEYWAN 269
            +I   Y  Q+ Q R   R+++    I  T+   G VLI     GR  EL  +LD+ W N
Sbjct: 197 LLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTAGRVLELAHMLDQLWKN 256

Query: 270 HPE--------LQNIPIYYASPLAKKCLTVYETYTLSMNDRIQNAKSNPFAFKHISALSS 321
                      L N   Y     AK  +   E  +  +    + A++NPF FKHI    S
Sbjct: 257 KESGLMAYSLALLNNVSYNVIEFAKSQI---EWMSDKLTKAFEGARNNPFQFKHIQLCHS 313

Query: 322 I-DIFK-DVGPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVIPGYVVEGTLAKTIL 376
           + D++K   GP VV+AS   L+SG +R LF  W S+  NS ++      GTLA  ++
Sbjct: 314 LADVYKLPAGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELV 370


>CPSB_HUMAN (Q9P2I0) Cleavage and polyadenylation specificity
           factor, 100 kDa subunit (CPSF 100 kDa subunit)
          Length = 782

 Score =  126 bits (316), Expect = 2e-28
 Identities = 97/378 (25%), Positives = 169/378 (44%), Gaps = 30/378 (7%)

Query: 24  VTPLGAGNEVGRSCVYMTYKGKTVLFDCGIHPGYSGMAALPYFDEIDP-----STVDVLL 78
           +T L    E    C  +       L DCG    +S        D ID        +D +L
Sbjct: 7   LTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFS-------MDIIDSLRKHVHQIDAVL 59

Query: 79  ITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDYVKVSKVSVDDMLYDEQDI 138
           ++H    H  +LPY + K      ++ T     + ++ + D  +    + D  L+   D+
Sbjct: 60  LSHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDV 119

Query: 139 NRSMDKIEVIDFHQTVEV----NGIRFWCYTAGHVLGAAMFMVDIAGVR-VLYTGDYSRE 193
           + + DKI+ + F Q V +    +G+      AGH++G  ++ +   G   ++Y  D++ +
Sbjct: 120 DAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHK 179

Query: 194 EDRHLRAAETPQFSPDVCIIESTYGVQHHQPRHTR--EKRFTDVIHSTISQGGRVLIPAY 251
            + HL        S    +I  ++   + QPR  +  E+  T+V+  T+   G VLI   
Sbjct: 180 REIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLE-TLRGDGNVLIAVD 238

Query: 252 ALGRAQELLLILDEYWANHPELQNIPIYYASPLAKKCLTVYE---TYTLSMNDRI----Q 304
             GR  EL  +LD+ W        + +Y  + L      V E   +    M+D++    +
Sbjct: 239 TAGRVLELAQLLDQIWRTKDA--GLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFE 296

Query: 305 NAKSNPFAFKHISALSSI-DIFKDVGPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVIP 363
           + ++NPF F+H+S    + D+ +   P VV+AS   L+ G SR LF  WC D KNS ++ 
Sbjct: 297 DKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILT 356

Query: 364 GYVVEGTLAKTILNEPKE 381
                GTLA+ +++ P E
Sbjct: 357 YRTTPGTLARFLIDNPSE 374


>CPSB_BOVIN (Q10568) Cleavage and polyadenylation specificity
           factor, 100 kDa subunit (CPSF 100 kDa subunit)
          Length = 782

 Score =  126 bits (316), Expect = 2e-28
 Identities = 97/378 (25%), Positives = 169/378 (44%), Gaps = 30/378 (7%)

Query: 24  VTPLGAGNEVGRSCVYMTYKGKTVLFDCGIHPGYSGMAALPYFDEIDP-----STVDVLL 78
           +T L    E    C  +       L DCG    +S        D ID        +D +L
Sbjct: 7   LTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFS-------MDIIDSLRKHVHQIDAVL 59

Query: 79  ITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDYVKVSKVSVDDMLYDEQDI 138
           ++H    H  +LPY + K      ++ T     + ++ + D  +    + D  L+   D+
Sbjct: 60  LSHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDV 119

Query: 139 NRSMDKIEVIDFHQTVEV----NGIRFWCYTAGHVLGAAMFMVDIAGVR-VLYTGDYSRE 193
           + + DKI+ + F Q V +    +G+      AGH++G  ++ +   G   ++Y  D++ +
Sbjct: 120 DAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHK 179

Query: 194 EDRHLRAAETPQFSPDVCIIESTYGVQHHQPRHTR--EKRFTDVIHSTISQGGRVLIPAY 251
            + HL        S    +I  ++   + QPR  +  E+  T+V+  T+   G VLI   
Sbjct: 180 REIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLE-TLRGDGNVLIAVD 238

Query: 252 ALGRAQELLLILDEYWANHPELQNIPIYYASPLAKKCLTVYE---TYTLSMNDRI----Q 304
             GR  EL  +LD+ W        + +Y  + L      V E   +    M+D++    +
Sbjct: 239 TAGRVLELAQLLDQIWRTKDA--GLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFE 296

Query: 305 NAKSNPFAFKHISALSSI-DIFKDVGPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVIP 363
           + ++NPF F+H+S    + D+ +   P VV+AS   L+ G SR LF  WC D KNS ++ 
Sbjct: 297 DKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILT 356

Query: 364 GYVVEGTLAKTILNEPKE 381
                GTLA+ +++ P E
Sbjct: 357 YRTTPGTLARFLIDNPSE 374


>CPSB_XENLA (Q9W799) Cleavage and polyadenylation specificity
           factor, 100 kDa subunit (CPSF 100 kDa subunit)
          Length = 783

 Score =  124 bits (311), Expect = 8e-28
 Identities = 93/375 (24%), Positives = 170/375 (44%), Gaps = 24/375 (6%)

Query: 24  VTPLGAGNEVGRSCVYMTYKGKTVLFDCGIHPGYSGMAALPYFDEIDPST--VDVLLITH 81
           +T L    E    C  +       L DCG    +S    +   D +      VD +L++H
Sbjct: 7   LTTLVGAQEESAVCYLLQVDEFRFLLDCGWDENFS----MDIIDSVKKYVHQVDAVLLSH 62

Query: 82  FHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDYVKVSKVSVDDMLYDEQDINRS 141
               H  +LPY + K      ++ T     + ++ + D  +    + D  L+   D++ +
Sbjct: 63  PDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFSLFSLDDVDCA 122

Query: 142 MDKIEVIDFHQTVEV----NGIRFWCYTAGHVLGAAMFMVDIAGVR-VLYTGDYSREEDR 196
            DKI+ + ++Q V +    +G+      AGH++G  ++ +   G   ++Y  D++ + + 
Sbjct: 123 FDKIQQLKYNQIVHLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREI 182

Query: 197 HLRAAETPQFSPDVCIIESTYGVQHHQPRHTR--EKRFTDVIHSTISQGGRVLIPAYALG 254
           HL        +    +I  ++   + QPR  +  E+  T+V+  T+   G VLI     G
Sbjct: 183 HLNGCSLEMINRPSLLITDSFNATYVQPRRKQRDEQLLTNVLE-TLRGDGNVLIAVDTAG 241

Query: 255 RAQELLLILDEYWANHPELQNIPIYYASPLAKKCLTVYE---TYTLSMNDRI----QNAK 307
           R  EL  +LD+ W        + +Y  + L      V E   +    M+D++    ++ +
Sbjct: 242 RVLELAQLLDQIWRTKDA--GLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKR 299

Query: 308 SNPFAFKHISALSSI-DIFKDVGPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVIPGYV 366
           +NPF F+H++      D+ +   P VV+AS   L+ G SR+LF  WC D KNS ++    
Sbjct: 300 NNPFQFRHLTLCHGYSDLARVPSPKVVLASQPDLECGFSRELFIQWCQDPKNSVILTYRT 359

Query: 367 VEGTLAKTILNEPKE 381
             GTLA+ +++ P E
Sbjct: 360 TPGTLARFLIDHPSE 374



 Score = 33.9 bits (76), Expect = 1.5
 Identities = 17/72 (23%), Positives = 34/72 (46%), Gaps = 1/72 (1%)

Query: 389 SAPLHMQVHYISFSAHADSAQTSAFLEELNPPNIILVHGAANEMGRLKQKLMTQFADRNT 448
           S  +  +V YI +   +D       + ++ P  +I+VHG  +    L +     F  ++ 
Sbjct: 528 SMEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPDATQDLAEACRA-FGGKDI 586

Query: 449 KILTPKNCQSVE 460
           K+ TPK  ++V+
Sbjct: 587 KVYTPKLHETVD 598


>CPSB_MOUSE (O35218) Cleavage and polyadenylation specificity
           factor, 100 kDa subunit (CPSF 100 kDa subunit)
          Length = 782

 Score =  124 bits (310), Expect = 1e-27
 Identities = 96/378 (25%), Positives = 169/378 (44%), Gaps = 30/378 (7%)

Query: 24  VTPLGAGNEVGRSCVYMTYKGKTVLFDCGIHPGYSGMAALPYFDEIDP-----STVDVLL 78
           +T L    E    C  +       L DCG    +S        D ID        +D +L
Sbjct: 7   LTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFS-------VDIIDSLRKHVHQIDAVL 59

Query: 79  ITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDYVKVSKVSVDDMLYDEQDI 138
           ++H    H  +LP+ + K      ++ T     + ++ + D  +    + D  L+   D+
Sbjct: 60  LSHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDV 119

Query: 139 NRSMDKIEVIDFHQTVEV----NGIRFWCYTAGHVLGAAMFMVDIAGVR-VLYTGDYSRE 193
           + + DKI+ + F Q V +    +G+      AGH++G  ++ +   G   ++Y  D++ +
Sbjct: 120 DAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHK 179

Query: 194 EDRHLRAAETPQFSPDVCIIESTYGVQHHQPRHTR--EKRFTDVIHSTISQGGRVLIPAY 251
            + HL        S    +I  ++   + QPR  +  E+  T+V+  T+   G VLI   
Sbjct: 180 REIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLE-TLRGDGNVLIAVD 238

Query: 252 ALGRAQELLLILDEYWANHPELQNIPIYYASPLAKKCLTVYE---TYTLSMNDRI----Q 304
             GR  EL  +LD+ W        + +Y  + L      V E   +    M+D++    +
Sbjct: 239 TAGRVLELAQLLDQIWRTKDA--GLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFE 296

Query: 305 NAKSNPFAFKHISALSSI-DIFKDVGPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVIP 363
           + ++NPF F+H+S    + D+ +   P VV+AS   L+ G SR LF  WC D KNS ++ 
Sbjct: 297 DKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILT 356

Query: 364 GYVVEGTLAKTILNEPKE 381
                GTLA+ +++ P E
Sbjct: 357 YRTTPGTLARFLIDNPTE 374


>Y514_SYNY3 (Q55470) Hypothetical protein sll0514
          Length = 554

 Score = 79.7 bits (195), Expect = 2e-14
 Identities = 100/427 (23%), Positives = 173/427 (40%), Gaps = 69/427 (16%)

Query: 26  PLGAGNEVGRSCVYMTYKGKTVLFDCGIHPGYSGMAALPYFDEIDPSTVDVLLITHFHLD 85
           P G G   G  C+ +      +L DCG+      +AA       DP TVD++  +H H D
Sbjct: 19  PYGVGPRDGGICLELHLGPYRILLDCGLEDLTPLLAA-------DPGTVDLVFCSHAHRD 71

Query: 86  HAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDYVKVSKVSVDDMLYDEQDINRSMDKI 145
           H   L  F ++           A++   +LL  ++        D+ +             
Sbjct: 72  HGLGLWQFHQQFPH----IPILASEVTQRLLPLNWP-------DEFV---------PPFC 111

Query: 146 EVIDFHQTVEV-NGIRFWCYTAGHVLGAAMFMVDIAG----VRVLYTGDYSREEDRHLRA 200
            V+ +    EV  G+      AGH+ GAA+ +++        RV+YTGDY      HL+ 
Sbjct: 112 RVLPWRSPQEVLPGLTVELLPAGHLPGAALILLEYHNGDRLYRVIYTGDYCLS---HLQL 168

Query: 201 AE----TPQ--FSPDVCIIESTYGVQHHQPRHTREKRFTDVIHSTISQGGRVLIPAYALG 254
            +    TP     PDV I+E  YG +    R  +EK+F   I + +++G  +L+P   LG
Sbjct: 169 VDGLALTPLRGLKPDVLILEGHYGNRRLPHRRQQEKQFIQAIETVLAKGRNILLPVPPLG 228

Query: 255 RAQELLLILDEYWANHPEL--QNIPIYYASPLAKKCLTVYETYTLSMNDRIQN-AKSNPF 311
            AQE+L +L      H +   + + ++    +A+ C   Y+     + D ++N A+  P 
Sbjct: 229 LAQEILKLL----RTHHQFTGRQVNLWAGESVARGC-DAYQGIIDHLPDNVRNFAQHQPL 283

Query: 312 -----AFKHISALSSIDIFKDV-GPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVIPGY 365
                 + H+  L+       +  PS+V+ +           L  +W         +P  
Sbjct: 284 FWDDKVYPHLRPLTDDQGELSLSAPSIVITTTWPAFWPSPAALPGLWTVFMPQLLTLPSC 343

Query: 366 VVEGTLAKTILNEPKEVTLMNGLSAPLHMQVHYISFSAHADSAQTSAFLEELNPPNIILV 425
           +V    A   L E  +  L + L A             H+D   T+  +  L P +++ V
Sbjct: 344 LV--NFAWQDLEEFPKYELEDYLLAD------------HSDGRNTTQLIHNLRPQHLVFV 389

Query: 426 HGAANEM 432
           HG  +++
Sbjct: 390 HGQPSDI 396


>YJ70_CORGL (P54122) Hypothetical UPF0036 protein Cgl1970/cg2160
          Length = 718

 Score = 45.1 bits (105), Expect = 6e-04
 Identities = 26/80 (32%), Positives = 41/80 (50%), Gaps = 6/80 (7%)

Query: 22  LIVTPLGAGNEVGRSCVYMTYKGKTVLFDCGIHPGYSGMAA----LPYFDEIDPST--VD 75
           L +  LG  +E+GR+     Y  + ++ DCG+    SG       LP F  I+     VD
Sbjct: 155 LRIYALGGISEIGRNMTVFEYNNRLLIVDCGVLFPSSGEPGVDLILPDFGPIEDHLHRVD 214

Query: 76  VLLITHFHLDHAASLPYFLE 95
            L++TH H DH  ++P+ L+
Sbjct: 215 ALVVTHGHEDHIGAIPWLLK 234


>Y139_MYCGE (P47385) Hypothetical UPF0036 protein MG139
          Length = 569

 Score = 42.7 bits (99), Expect = 0.003
 Identities = 36/133 (27%), Positives = 61/133 (45%), Gaps = 15/133 (11%)

Query: 28  GAGNEVGRSCVYMTYKGKTVLFDCGIH------PGYSGMAALPYFDEI--DPSTVDVLLI 79
           G   EVG++   + Y  + ++ DCGI        G +G+  +P F+ +  + S V  L I
Sbjct: 22  GGIQEVGKNMYGIEYDDEIIIIDCGIKFASDDLLGINGI--IPSFEHLIENQSKVKALFI 79

Query: 80  THFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDYVKVSKVSVDDMLYDEQDIN 139
           TH H DH   +PY L++      + + YA +    L+L   V   K +  + +    D +
Sbjct: 80  THGHEDHIGGVPYLLKQVD----IPVIYAPRIAASLILKK-VNEHKDAKLNKIVTFDDFS 134

Query: 140 RSMDKIEVIDFHQ 152
               K   IDF++
Sbjct: 135 EFQTKHFKIDFYR 147


>K2C4_HUMAN (P19013) Keratin, type II cytoskeletal 4 (Cytokeratin 4)
           (K4) (CK4)
          Length = 534

 Score = 42.4 bits (98), Expect = 0.004
 Identities = 45/172 (26%), Positives = 77/172 (44%), Gaps = 31/172 (18%)

Query: 531 IQSRLKQIYESVEP---SVDEESGVPMLLVHDRVTVKHESE----KHVSLHWASDPINDM 583
           +QS LK + +SVE      +EE        +D V +K + +      V L    D +ND 
Sbjct: 223 LQSELKTMQDSVEDFKTKYEEEINKRTAAENDFVVLKKDVDAAYLNKVELEAKVDSLNDE 282

Query: 584 ------------------VSDSVVALVLNINR--DLPKIVAESDAT--KIEEENEKKTEK 621
                             VSD+ V L ++ NR  DL  I+AE  A   +I + ++ + E 
Sbjct: 283 INFLKVLYDAELSQMQTHVSDTSVVLSMDNNRNLDLDSIIAEVRAQYEEIAQRSKAEAEA 342

Query: 622 VMQALLNSLFGNVKVGENGKLIINIDGNVAELNKESGEVESENEGLKERVRT 673
           + Q  +  L   + V ++G  + N    +AELN+    + +E E +K++ +T
Sbjct: 343 LYQTKVQQL--QISVDQHGDNLKNTKSEIAELNRMIQRLRAEIENIKKQCQT 392


>RNZ_CLOTE (Q892B5) Ribonuclease Z (EC 3.1.26.11) (RNase Z) (tRNA 3
           endonuclease)
          Length = 312

 Score = 42.0 bits (97), Expect = 0.005
 Identities = 26/84 (30%), Positives = 35/84 (40%), Gaps = 10/84 (11%)

Query: 24  VTPLGAGN-----EVGRSCVYMTYKGKTVLFDCGIHPGYSGMAALPYFDEIDPSTVDVLL 78
           +T LG G      E   S   + YKG+ +L DCG      G        +     +D++ 
Sbjct: 4   ITLLGTGGGMPTPERNLSAAILNYKGRKILIDCG-----EGTQVSMKISKTGFKNIDIIC 58

Query: 79  ITHFHLDHAASLPYFLEKTTFKGR 102
           ITH+H DH   LP  L      GR
Sbjct: 59  ITHWHGDHIVGLPGLLATMGNSGR 82


>Y139_MYCPN (P75497) Hypothetical UPF0036 protein MG139 homolog
           (A65_orf569)
          Length = 569

 Score = 41.6 bits (96), Expect = 0.007
 Identities = 37/136 (27%), Positives = 60/136 (43%), Gaps = 21/136 (15%)

Query: 28  GAGNEVGRSCVYMTYKGKTVLFDCGIHPGYSGMAAL----PYFDEI--DPSTVDVLLITH 81
           G   EVG++   + Y  + ++ DCGI      +  +    P F+ +  + + V  L ITH
Sbjct: 22  GGIQEVGKNMYGIEYDDEIIIIDCGIKFASDDLLGIDGIIPSFEYLIENQAKVKALFITH 81

Query: 82  FHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDY-----VKVSKVSVDDMLYDEQ 136
            H DH   +PY L++      V + YA +    L+L         K++KV V D      
Sbjct: 82  GHEDHIGGVPYLLKQVD----VPVIYAPRIAASLILKKVNEHKDAKLNKVVVYD------ 131

Query: 137 DINRSMDKIEVIDFHQ 152
           D +    K   IDF++
Sbjct: 132 DFSNFETKHFKIDFYR 147


>ROO_DESGI (Q9F0J6) Rubredoxin-oxygen oxidoreductase (EC 1.-.-.-)
          (ROO) (Rubredoxin oxidase)
          Length = 402

 Score = 41.6 bits (96), Expect = 0.007
 Identities = 22/57 (38%), Positives = 28/57 (48%), Gaps = 1/57 (1%)

Query: 39 YMTYKGKTVLFDCGIHPGYSGMAALPYFDEIDPSTVDVLLITHFHLDHAASLPYFLE 95
          Y+    KT LFD  +   Y G         IDP  +D L+I H  LDHA +LP  +E
Sbjct: 38 YLVEDEKTTLFDT-VKAEYKGELLCGIASVIDPKKIDYLVIQHLELDHAGALPALIE 93


  Database: sprot
    Posted date:  Nov 25, 2004 10:54 AM
  Number of letters in database: 59,974,054
  Number of sequences in database:  164,201
  
Lambda     K      H
   0.317    0.134    0.383 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 79,049,070
Number of Sequences: 164201
Number of extensions: 3388022
Number of successful extensions: 9803
Number of sequences better than 10.0: 113
Number of HSP's better than 10.0 without gapping: 31
Number of HSP's successfully gapped in prelim test: 82
Number of HSP's that attempted gapping in prelim test: 9640
Number of HSP's gapped (non-prelim): 145
length of query: 690
length of database: 59,974,054
effective HSP length: 117
effective length of query: 573
effective length of database: 40,762,537
effective search space: 23356933701
effective search space used: 23356933701
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 69 (31.2 bits)


Medicago: description of AC134322.16