Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC148607.14 - phase: 0 
         (359 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAM65430.1| unknown [Arabidopsis thaliana] gi|20259567|gb|AAM...   424  e-117
gb|AAC16751.1| Contains similarity to pre-mRNA processing protei...   424  e-117
dbj|BAB11095.1| unnamed protein product [Arabidopsis thaliana]        175  2e-42
ref|NP_199452.2| expressed protein [Arabidopsis thaliana]             175  2e-42
ref|XP_482102.1| unknown protein [Oryza sativa (japonica cultiva...   165  2e-39
gb|AAH88586.1| LOC496864 protein [Xenopus tropicalis]                  99  2e-19
ref|NP_060392.2| PRP39 pre-mRNA processing factor 39 homolog [Ho...    97  7e-19
gb|AAH51886.1| PRPF39 protein [Homo sapiens]                           97  7e-19
ref|XP_537427.1| PREDICTED: similar to PRP39 pre-mRNA processing...    94  5e-18
ref|XP_234238.3| PREDICTED: similar to PRP39 pre-mRNA processing...    92  3e-17
emb|CAD87784.1| novel protein similar to pre-mRNA processing pro...    90  9e-17
ref|XP_392380.2| PREDICTED: similar to novel protein similar to ...    89  2e-16
gb|EAA50799.1| hypothetical protein MG04558.4 [Magnaporthe grise...    88  5e-16
emb|CAG09750.1| unnamed protein product [Tetraodon nigroviridis]       85  3e-15
gb|EAA68105.1| hypothetical protein FG01244.1 [Gibberella zeae P...    84  7e-15
gb|EAA64755.1| hypothetical protein AN1635.2 [Aspergillus nidula...    80  7e-14
gb|EAL89886.1| conserved hypothetical protein [Aspergillus fumig...    77  8e-13
gb|AAO41607.1| CG1646-PC, isoform C [Drosophila melanogaster] gi...    75  2e-12
ref|XP_581849.1| PREDICTED: similar to PRP39 pre-mRNA processing...    75  2e-12
gb|AAO41608.2| CG1646-PD, isoform D [Drosophila melanogaster] gi...    75  2e-12

>gb|AAM65430.1| unknown [Arabidopsis thaliana] gi|20259567|gb|AAM14126.1| unknown
           protein [Arabidopsis thaliana]
           gi|15810565|gb|AAL07170.1| unknown protein [Arabidopsis
           thaliana] gi|18379230|ref|NP_563700.1|
           hydroxyproline-rich glycoprotein family protein
           [Arabidopsis thaliana]
          Length = 768

 Score =  424 bits (1091), Expect = e-117
 Identities = 212/321 (66%), Positives = 254/321 (79%), Gaps = 5/321 (1%)

Query: 12  DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
           +++VKLYERCV+ CANYPEYWIRYV  MEAS S DLA N LARA+QVFVK+QPEIHLF A
Sbjct: 378 NKVVKLYERCVVTCANYPEYWIRYVTNMEASGSADLAENALARATQVFVKKQPEIHLFAA 437

Query: 72  RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKG 131
           R KEQ GDI GARAAYQLVH+EISPGLLEA+I+HANME+RLG L+DAFSLYEQ IA+EKG
Sbjct: 438 RLKEQNGDIAGARAAYQLVHSEISPGLLEAVIKHANMEYRLGNLDDAFSLYEQVIAVEKG 497

Query: 132 KEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPK 191
           KEHS  LP+L+AQYSRF YL S ++EKAR I+V  L++   SKPL+EAL+HFEAIQP P+
Sbjct: 498 KEHSTILPLLYAQYSRFSYLVSRDAEKARRIIVEALDHVQPSKPLMEALIHFEAIQPPPR 557

Query: 192 RVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIKRAEDRHAKL 251
             +ID+LE LV K I P+ +   +AS+TEREELS I++EFL +FGDV+SIK+AED+H KL
Sbjct: 558 --EIDYLEPLVEKVIKPDADAQNIASSTEREELSLIYIEFLGIFGDVKSIKKAEDQHVKL 615

Query: 252 FLPNRGLSELKKRHAEDFLASDKTKVSRAYSAQSPAQSVAGAYPNGPNQWP-NYGVQPQT 310
           F P+R  SELKKR A+DFLASD+TK+++ Y+   PAQ V+ AYPN   QW   Y  QPQT
Sbjct: 616 FYPHRSTSELKKRSADDFLASDRTKMAKTYNGTPPAQPVSNAYPNAQAQWSGGYAAQPQT 675

Query: 311 WP--ATTQAQGQQWPAGYTQQ 329
           WP      AQ QQW   Y QQ
Sbjct: 676 WPPAQAAPAQPQQWNPAYGQQ 696


>gb|AAC16751.1| Contains similarity to pre-mRNA processing protein PRP39 gb|L29224
           from S. cerevisiae.  ESTs gb|R64908 and gb|T88158,
           gb|N38703 and gb|AA651043 come from this gene.
           [Arabidopsis thaliana] gi|7485869|pir||T00964
           hypothetical protein F20D22.14 - Arabidopsis thaliana
          Length = 1345

 Score =  424 bits (1091), Expect = e-117
 Identities = 212/321 (66%), Positives = 254/321 (79%), Gaps = 5/321 (1%)

Query: 12  DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
           +++VKLYERCV+ CANYPEYWIRYV  MEAS S DLA N LARA+QVFVK+QPEIHLF A
Sbjct: 378 NKVVKLYERCVVTCANYPEYWIRYVTNMEASGSADLAENALARATQVFVKKQPEIHLFAA 437

Query: 72  RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKG 131
           R KEQ GDI GARAAYQLVH+EISPGLLEA+I+HANME+RLG L+DAFSLYEQ IA+EKG
Sbjct: 438 RLKEQNGDIAGARAAYQLVHSEISPGLLEAVIKHANMEYRLGNLDDAFSLYEQVIAVEKG 497

Query: 132 KEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPK 191
           KEHS  LP+L+AQYSRF YL S ++EKAR I+V  L++   SKPL+EAL+HFEAIQP P+
Sbjct: 498 KEHSTILPLLYAQYSRFSYLVSRDAEKARRIIVEALDHVQPSKPLMEALIHFEAIQPPPR 557

Query: 192 RVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIKRAEDRHAKL 251
             +ID+LE LV K I P+ +   +AS+TEREELS I++EFL +FGDV+SIK+AED+H KL
Sbjct: 558 --EIDYLEPLVEKVIKPDADAQNIASSTEREELSLIYIEFLGIFGDVKSIKKAEDQHVKL 615

Query: 252 FLPNRGLSELKKRHAEDFLASDKTKVSRAYSAQSPAQSVAGAYPNGPNQWP-NYGVQPQT 310
           F P+R  SELKKR A+DFLASD+TK+++ Y+   PAQ V+ AYPN   QW   Y  QPQT
Sbjct: 616 FYPHRSTSELKKRSADDFLASDRTKMAKTYNGTPPAQPVSNAYPNAQAQWSGGYAAQPQT 675

Query: 311 WP--ATTQAQGQQWPAGYTQQ 329
           WP      AQ QQW   Y QQ
Sbjct: 676 WPPAQAAPAQPQQWNPAYGQQ 696


>dbj|BAB11095.1| unnamed protein product [Arabidopsis thaliana]
          Length = 1022

 Score =  175 bits (443), Expect = 2e-42
 Identities = 98/252 (38%), Positives = 150/252 (58%), Gaps = 4/252 (1%)

Query: 12  DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
           D  + LYERC+I CANY E+W RYV  +E+    +LAN  LARASQ FVK    IHLF A
Sbjct: 315 DWAINLYERCLIPCANYTEFWFRYVDFVESKGGRELANFALARASQTFVKSASVIHLFNA 374

Query: 72  RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAI-AIEK 130
           RFKE  GD   A  A      E+  G +E + + ANME RLG  E A + Y +A+     
Sbjct: 375 RFKEHVGDASAASVALSRCGEELGFGFVENVTKKANMEKRLGNFEAAVTTYREALNKTLI 434

Query: 131 GKEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQP 190
           GKE+ +T   L+ Q+SR  Y+ + +++ A +IL+ G EN    K LLE L+    +    
Sbjct: 435 GKENLETTARLYVQFSRLKYVITNSADDAAQILLEGNENVPHCKLLLEELMRLLMMHGGS 494

Query: 191 KRVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIKRAEDRHAK 250
           ++VD+  L+ ++ K ++   ++    SA ++EE+SN+++EF++L G +  +++A  RH K
Sbjct: 495 RQVDL--LDPIIDKELSHQADSSDGLSAEDKEEISNLYMEFIDLSGTIHDVRKALGRHIK 552

Query: 251 LFLPNRGLSELK 262
           LF P+   ++L+
Sbjct: 553 LF-PHSARAKLR 563


>ref|NP_199452.2| expressed protein [Arabidopsis thaliana]
          Length = 1036

 Score =  175 bits (443), Expect = 2e-42
 Identities = 98/252 (38%), Positives = 150/252 (58%), Gaps = 4/252 (1%)

Query: 12  DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
           D  + LYERC+I CANY E+W RYV  +E+    +LAN  LARASQ FVK    IHLF A
Sbjct: 315 DWAINLYERCLIPCANYTEFWFRYVDFVESKGGRELANFALARASQTFVKSASVIHLFNA 374

Query: 72  RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAI-AIEK 130
           RFKE  GD   A  A      E+  G +E + + ANME RLG  E A + Y +A+     
Sbjct: 375 RFKEHVGDASAASVALSRCGEELGFGFVENVTKKANMEKRLGNFEAAVTTYREALNKTLI 434

Query: 131 GKEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQP 190
           GKE+ +T   L+ Q+SR  Y+ + +++ A +IL+ G EN    K LLE L+    +    
Sbjct: 435 GKENLETTARLYVQFSRLKYVITNSADDAAQILLEGNENVPHCKLLLEELMRLLMMHGGS 494

Query: 191 KRVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIKRAEDRHAK 250
           ++VD+  L+ ++ K ++   ++    SA ++EE+SN+++EF++L G +  +++A  RH K
Sbjct: 495 RQVDL--LDPIIDKELSHQADSSDGLSAEDKEEISNLYMEFIDLSGTIHDVRKALGRHIK 552

Query: 251 LFLPNRGLSELK 262
           LF P+   ++L+
Sbjct: 553 LF-PHSARAKLR 563


>ref|XP_482102.1| unknown protein [Oryza sativa (japonica cultivar-group)]
           gi|40253684|dbj|BAD05627.1| unknown protein [Oryza
           sativa (japonica cultivar-group)]
           gi|40253455|dbj|BAD05406.1| unknown protein [Oryza
           sativa (japonica cultivar-group)]
          Length = 1161

 Score =  165 bits (418), Expect = 2e-39
 Identities = 91/244 (37%), Positives = 144/244 (58%), Gaps = 4/244 (1%)

Query: 12  DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
           D  VKLYERC+I CANY E+WIRY   ++A    ++A+  L RAS  FVK  P  H++ A
Sbjct: 278 DWAVKLYERCLIPCANYSEFWIRYAEFVDAKGGREIASYALGRASSYFVKGVPTFHMYYA 337

Query: 72  RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKG 131
            FKEQ GD  GAR+ +      ++      I R ANME R+G  + A  +YE AI  +  
Sbjct: 338 MFKEQIGDAQGARSLFIEGSNNLTSNFCANINRLANMEKRMGNTKAASEIYETAIQ-DAM 396

Query: 132 KEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPK 191
           +++ + LP L+  +++F Y  + N  +A+E+ V G++ A   K L++  + F +    P 
Sbjct: 397 QKNVKILPDLYTNFAQFKYAVNHNISEAKEVFVEGIKQAP-CKALIKGFMQFMSTHGGP- 454

Query: 192 RVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIKRAEDRHAKL 251
             +I  L+S++   + P  +   V S  +RE++S +FLEF++L+GDV+ +++A  RH+KL
Sbjct: 455 -TEIPILDSVISNAVVPGSDISTVLSREDREDISLLFLEFVDLYGDVRDLRKAWARHSKL 513

Query: 252 FLPN 255
           F  N
Sbjct: 514 FPHN 517


>gb|AAH88586.1| LOC496864 protein [Xenopus tropicalis]
          Length = 656

 Score = 99.4 bits (246), Expect = 2e-19
 Identities = 86/277 (31%), Positives = 132/277 (47%), Gaps = 17/277 (6%)

Query: 12  DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
           ++IV LYERC++ACA Y E+W+ YV  ME   S++ A  +L RA  + +  +P + L+ A
Sbjct: 326 ERIVTLYERCLVACALYEEFWLSYVQYME-PHSIEAARCILQRACCIHLPLKPTLSLYWA 384

Query: 72  RFKEQAGDIVGARAA-YQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEK 130
            F+E+ G I  AR+  Y L    + PGL    +R  ++E R G LE+A  L E+A+    
Sbjct: 385 AFEEKHGQIDTARSVLYDL--ENLMPGLAMVRLRRVSLERRTGNLEEAEHLLEEAVKSSL 442

Query: 131 GKEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQP 190
           G E +    +  A   R +    GN EKAR++L   LE    +  L   LL  E +  + 
Sbjct: 443 GTELAAFYSIKLA---RLLLKLQGNMEKARKVLTEALEKEPDNPRLHLCLLEIE-VSREG 498

Query: 191 KRVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFL-NLFGDVQSIKRAEDRHA 249
            + + D L  L V+    +       S   ++ +S   LEFL +   ++ S+  A D H 
Sbjct: 499 SQGEADAL--LCVERALKSS-----LSDDFKKMISQRRLEFLEDNSSNITSVLSAYDEHQ 551

Query: 250 KLFLPNRGLSELKKRHAEDFLASDKTKVSRAYSAQSP 286
           K FL    L    +  +ED     K K        +P
Sbjct: 552 K-FLKQEELKRQAENGSEDETEEKKPKTETVVHISAP 587


>ref|NP_060392.2| PRP39 pre-mRNA processing factor 39 homolog [Homo sapiens]
          Length = 548

 Score = 97.1 bits (240), Expect = 7e-19
 Identities = 88/306 (28%), Positives = 133/306 (42%), Gaps = 26/306 (8%)

Query: 12  DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
           +++V L+ERCVI+CA Y E+WI+Y   ME + S++   +V +RA  + + ++P +H+  A
Sbjct: 249 ERVVVLFERCVISCALYEEFWIKYAKYME-NHSIEGVRHVFSRACTIHLPKKPMVHMLWA 307

Query: 72  RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKG 131
            F+EQ G+I  AR   +    E   GL    +R  ++E R G LE+A  L + AI   K 
Sbjct: 308 AFEEQQGNINEARNILK-TFEECVLGLAMVRLRRVSLERRHGNLEEAEHLLQDAIKNAKS 366

Query: 132 KEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPK 191
              S    +  A   R ++    N  K+R++L+  +E    +  L   LL  E      K
Sbjct: 367 NNESSFYAVKLA---RHLFKIQKNLPKSRKVLLEAIERDKENTKLYLNLLEME-YSGDLK 422

Query: 192 RVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFG-DVQSIKRAEDRHAK 250
           + + + L          +    G      R   S   +EFL  FG DV  +  A D H  
Sbjct: 423 QNEENILNCF-------DKAVHGSLPIKMRITFSQRKVEFLEDFGSDVNKLLNAYDEHQT 475

Query: 251 LFLPNRGLS----------ELKKRHAEDFLASDKTKVSRAYSAQSPAQSVAGAYP-NGPN 299
           L      L           E KK H ED  +S    +     A     + +  Y  N  N
Sbjct: 476 LLKEQDSLKRKAENGSEEPEEKKAHTEDTTSSSTQMIDGDLQANQAVYNYSAWYQYNYQN 535

Query: 300 QWPNYG 305
            W NYG
Sbjct: 536 PW-NYG 540


>gb|AAH51886.1| PRPF39 protein [Homo sapiens]
          Length = 479

 Score = 97.1 bits (240), Expect = 7e-19
 Identities = 88/306 (28%), Positives = 133/306 (42%), Gaps = 26/306 (8%)

Query: 12  DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
           +++V L+ERCVI+CA Y E+WI+Y   ME + S++   +V +RA  + + ++P +H+  A
Sbjct: 180 ERVVVLFERCVISCALYEEFWIKYAKYME-NHSIEGVRHVFSRACTIHLPKKPMVHMLWA 238

Query: 72  RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKG 131
            F+EQ G+I  AR   +    E   GL    +R  ++E R G LE+A  L + AI   K 
Sbjct: 239 AFEEQQGNINEARNILK-TFEECVLGLAMVRLRRVSLERRHGNLEEAEHLLQDAIKNAKS 297

Query: 132 KEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPK 191
              S    +  A   R ++    N  K+R++L+  +E    +  L   LL  E      K
Sbjct: 298 NNESSFYAVKLA---RHLFKIQKNLPKSRKVLLEAIERDKENTKLYLNLLEME-YSGDLK 353

Query: 192 RVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFG-DVQSIKRAEDRHAK 250
           + + + L          +    G      R   S   +EFL  FG DV  +  A D H  
Sbjct: 354 QNEENILNCF-------DKAVHGSLPIKMRITFSQRKVEFLEDFGSDVNKLLNAYDEHQT 406

Query: 251 LFLPNRGLS----------ELKKRHAEDFLASDKTKVSRAYSAQSPAQSVAGAYP-NGPN 299
           L      L           E KK H ED  +S    +     A     + +  Y  N  N
Sbjct: 407 LLKEQDSLKRKAENGSEEPEEKKAHTEDTTSSSTQMIDGDLQANQAVYNYSAWYQYNYQN 466

Query: 300 QWPNYG 305
            W NYG
Sbjct: 467 PW-NYG 471


>ref|XP_537427.1| PREDICTED: similar to PRP39 pre-mRNA processing factor 39 homolog
           [Canis familiaris]
          Length = 712

 Score = 94.4 bits (233), Expect = 5e-18
 Identities = 87/306 (28%), Positives = 133/306 (43%), Gaps = 26/306 (8%)

Query: 12  DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
           +++V L+ERCVI+CA Y E+WI+Y   ME + S++   +V +RA  + + ++P +H+  A
Sbjct: 413 ERVVVLFERCVISCALYEEFWIKYAKYME-NHSIEGVRHVFSRACTIHLPKKPMVHMLWA 471

Query: 72  RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKG 131
            F+EQ G+I  AR   +    E   GL    +R  ++E R   +E+A  L + AI   K 
Sbjct: 472 AFEEQQGNINEARNILR-TFEECVLGLAMVRLRRVSLERRHENMEEAEHLLQDAIKNAKS 530

Query: 132 KEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPK 191
              S    +  A   R ++    N  K+R++L+  +E    +  L   LL  E      K
Sbjct: 531 NNESSFYAIKLA---RHLFKIQKNLPKSRKVLLEAIERDKENTKLYLNLLEME-YSGDLK 586

Query: 192 RVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFG-DVQSIKRAEDRHAK 250
           + + + L          +    G      R   S   +EFL  FG DV  +  A D H  
Sbjct: 587 QNEENILNCF-------DKAIHGSLPIKMRITFSQRKVEFLEDFGSDVNKLLNAYDEHQT 639

Query: 251 LFLPNRGLS----------ELKKRHAEDFLASDKTKVSRAYSAQSPAQSVAGAYP-NGPN 299
           L      L           E KK H ED  +S    +     A   A + +  Y  N  N
Sbjct: 640 LLKEQDSLKRKAENGSEEPEEKKAHTEDTSSSSTQMIDGDLQANQAAYNYSAWYQYNYQN 699

Query: 300 QWPNYG 305
            W NYG
Sbjct: 700 PW-NYG 704


>ref|XP_234238.3| PREDICTED: similar to PRP39 pre-mRNA processing factor 39 homolog
           [Rattus norvegicus]
          Length = 709

 Score = 91.7 bits (226), Expect = 3e-17
 Identities = 89/306 (29%), Positives = 132/306 (43%), Gaps = 28/306 (9%)

Query: 12  DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
           +++V L+ERCVI+CA Y E+WI+Y   ME + S++   +V +RA  V + ++P  H+  A
Sbjct: 412 ERVVVLFERCVISCALYEEFWIKYAKYME-NHSIEGVRHVFSRACTVHLPKKPMAHMLWA 470

Query: 72  RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKG 131
            F+EQ G+I  AR   +    E   GL    +R  ++E R G +E+A  L + AI   K 
Sbjct: 471 AFEEQQGNINEARIILR-TFEECVLGLAMVRLRRVSLERRHGNMEEAEHLLQDAIRNAKS 529

Query: 132 KEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPK 191
              S    +  A   R ++    N  K+R++L+  +E    +  L   LL  E       
Sbjct: 530 NNESSFYAIKLA---RHLFKIQKNLPKSRKVLLEAIEKDKENTKLYLNLLEME------Y 580

Query: 192 RVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFG-DVQSIKRAEDRHAK 250
             D+   E  ++     +    G      R   S   +EFL  FG DV  +  A D H  
Sbjct: 581 SCDLKQNEENILNCF--DKAIHGSLPIKMRITFSQRKVEFLEDFGSDVNKLLNAYDEHQT 638

Query: 251 LFLPNRGLS----------ELKKRHAEDFLASDKTKVSRAYSAQSPAQSVAGAYP-NGPN 299
           L      L           E KK H ED   S    +     A   A + +  Y  N  N
Sbjct: 639 LLKEQDTLKRKAENGSEEPEEKKAHTED--VSSAQIIDGDLQANQAAYNYSAWYQYNYQN 696

Query: 300 QWPNYG 305
            W NYG
Sbjct: 697 PW-NYG 701


>emb|CAD87784.1| novel protein similar to pre-mRNA processing proteins [Danio rerio]
           gi|52218898|ref|NP_001004520.1| PRP39 pre-mRNA
           processing factor 39 homolog [Danio rerio]
          Length = 752

 Score = 90.1 bits (222), Expect = 9e-17
 Identities = 78/310 (25%), Positives = 133/310 (42%), Gaps = 32/310 (10%)

Query: 12  DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
           +++V L+ERC+IACA Y E+WI+Y   +E S S +   ++  +A  V + ++P +HL  A
Sbjct: 445 ERVVVLFERCLIACALYEEFWIKYAKYLE-SYSTEAVRHIYKKACTVHLPKKPNVHLLWA 503

Query: 72  RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKG 131
            F+EQ G I  AR+  + V   + PGL    +R  ++E R G +E+A +L + AI   + 
Sbjct: 504 AFEEQQGSIDEARSILKAVEVSV-PGLAMVRLRRVSLERRHGNMEEAEALLQDAITNGRN 562

Query: 132 KEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFE---AIQP 188
              S    +  A+    V  + G   +A+++L+  +E    +  L   LL  E    +Q 
Sbjct: 563 SSESSFYSVKLARQLVKVQKSIG---RAKKVLLEAVEKDETNPKLYLNLLELEYSGDVQQ 619

Query: 189 QPKRVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFG-DVQSIKRAEDR 247
               +   F  +L               +   R   S   ++FL  FG D+ ++  A ++
Sbjct: 620 NEAEIIACFDRAL-----------SSSMALESRITFSQRKVDFLEDFGSDINTLMAAYEQ 668

Query: 248 HAKLFLPNRGLSELKKRHAEDFLA----SDKTKVSRAYSAQSPAQSVAGAYPN------- 296
           H +L           +  +E+  A    +D   V+        A      Y N       
Sbjct: 669 HQRLLAEQESFKRKAENGSEEPDAKRQRTDDQSVASGQMMDMQANHAGYNYNNWYQYNSW 728

Query: 297 -GPNQWPNYG 305
              N W  YG
Sbjct: 729 GSQNSWGQYG 738


>ref|XP_392380.2| PREDICTED: similar to novel protein similar to pre-mRNA processing
           proteins [Apis mellifera]
          Length = 946

 Score = 89.0 bits (219), Expect = 2e-16
 Identities = 79/302 (26%), Positives = 139/302 (45%), Gaps = 28/302 (9%)

Query: 12  DQIVKLYERCVIACANYPEYWIRYVLCMEA--SESMDLANNVLARASQVFVKRQPEIHLF 69
           ++I+ L+ERC+IACA Y E+W+R+V  +E+   ++++   +V  RA  V   ++P +HL 
Sbjct: 601 NRIIILFERCLIACALYDEFWMRFVRYLESLKGDNVEKIRDVYTRACTVHHPKKPNLHLQ 660

Query: 70  CARFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIE 129
            A F+E  G+   A    + +   I P +L+   R  N+E R G L+ A +LYE  I+  
Sbjct: 661 WATFEEGQGNFEKAANILENIDNVI-PNMLQVAYRRINLERRRGDLDKACTLYENYISNS 719

Query: 130 KGKEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQ 189
           K +  +     +  +Y+RF+     + +KA ++L+   E     K      L+ + I   
Sbjct: 720 KNRTIANN---IVVKYARFLCKVKNDVDKAIKVLLKATE-----KDKDNPRLYLQLIDLG 771

Query: 190 PKRVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFG-DVQSIKRAEDRH 248
            +R  +D  E +    +    E+   A   +R   +   +EFL  F  D++ I +A ++ 
Sbjct: 772 MQRTPVDTQEIVGYMDMFIEREH---ADLEQRVLFAQRKVEFLEDFSPDIRQILKAHEQF 828

Query: 249 AKLFLPNRGLSELKKRHAED----------FLASDKTKVSRAYSAQSPAQSVAGAYPNGP 298
            K     +   E KK   +D              D+T V    S  S +     + P+GP
Sbjct: 829 QKCI---KQAKERKKTKNDDTKTDTSPPKKVKTGDQTNVPPPPSVSSQSSYQYSSGPSGP 885

Query: 299 NQ 300
            Q
Sbjct: 886 YQ 887


>gb|EAA50799.1| hypothetical protein MG04558.4 [Magnaporthe grisea 70-15]
           gi|39945152|ref|XP_362113.1| hypothetical protein
           MG04558.4 [Magnaporthe grisea 70-15]
          Length = 480

 Score = 87.8 bits (216), Expect = 5e-16
 Identities = 62/229 (27%), Positives = 107/229 (46%), Gaps = 7/229 (3%)

Query: 17  LYERCVIACANYPEYWIRYVLCMEAS-ESMDLANNVLARASQVFVK-RQPEIHLFCARFK 74
           LYERC++ CA Y E+W RY   M A  +  +   N+  RA+ +FV   +P I L  A F+
Sbjct: 193 LYERCLVTCAFYDEFWFRYARWMSAQPDKTEEVRNIYLRAATIFVPISRPGIRLQFAYFE 252

Query: 75  EQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKGKEH 134
           E  G +  AR  +  +   + PG +E II  AN+E R   ++ A  + +Q   IE  +  
Sbjct: 253 ESCGRVAMAREVHNAILLRL-PGCIEVIISLANLERRHNDIDTAIEVLKQ--QIESPEVD 309

Query: 135 SQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPKRVD 194
             T  +L  +++  ++   G +E+AR +     +    S+      + FE  QP    ++
Sbjct: 310 IWTKAVLVTEWASLLWTVKGTAEEARAVFQKNAQWYGGSRHFWMQWIQFELEQPTSAELE 369

Query: 195 IDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIKR 243
               E L  + I          S+  ++EL  ++L +L   G   ++K+
Sbjct: 370 AQHSERL--REIIDKIRTESNMSSAAKKELCGVYLAYLQHRGGKDAMKQ 416


>emb|CAG09750.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 509

 Score = 85.1 bits (209), Expect = 3e-15
 Identities = 54/156 (34%), Positives = 85/156 (53%), Gaps = 12/156 (7%)

Query: 17  LYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCARFKEQ 76
           L+ERC+IACA Y E+W RY   +E S +++ A  V  RA ++ + R+P I +  A F+E+
Sbjct: 242 LFERCLIACALYEEFWTRYARYLE-SHNVEEARAVFKRACEIHLTRRPNICMQWATFEER 300

Query: 77  AGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKGKEHSQ 136
             ++  AR     + + + PGL    +R   +E R G+L+ A +L ++A+A  K K    
Sbjct: 301 HNNLAEARRVLAAIESRV-PGLAVVRLRRVALERRAGQLDQAVALLQEAVAESKEK---- 355

Query: 137 TLPMLFAQYS----RFVYLASGNSEKAREILVGGLE 168
             P L A YS    R +   + N  KAR++L   LE
Sbjct: 356 --PTLHAFYSIKLARLLLKLARNPSKARKVLQEALE 389


>gb|EAA68105.1| hypothetical protein FG01244.1 [Gibberella zeae PH-1]
           gi|46108724|ref|XP_381420.1| hypothetical protein
           FG01244.1 [Gibberella zeae PH-1]
          Length = 587

 Score = 84.0 bits (206), Expect = 7e-15
 Identities = 63/232 (27%), Positives = 108/232 (46%), Gaps = 7/232 (3%)

Query: 13  QIVKLYERCVIACANYPEYWIRYVLCMEASE-SMDLANNVLARASQVFVK-RQPEIHLFC 70
           +IV LYERC++ CA Y + W RY   M   E   +   N+  RAS +FV   +P I L  
Sbjct: 295 RIVALYERCLVTCAFYDDLWFRYARWMSGQEGKAEEVRNIYVRASTMFVPISRPGIRLQW 354

Query: 71  ARFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEK 130
           A F+E  G +  A   ++ +   + P  +E I+  AN+E R   ++ A  +Y+    I+ 
Sbjct: 355 AYFEESTGRVDVALDIHEAILLRL-PDSVEVIVSWANVERRQNGIDAAIQVYKN--QIDA 411

Query: 131 GKEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQP 190
                 T   L A+++  ++   G++E+ARE+    +     S+   +    FE  QP  
Sbjct: 412 PTVDIYTKAALVAEWALLLWKVKGSTEEAREVFTKNVTWYGDSRLFWDRWFQFELDQPSS 471

Query: 191 KRVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIK 242
              +    E +  K +          SA  +++L+ I+L +L   GD  ++K
Sbjct: 472 AETEAQHGECM--KKVFDELRERSQLSAPVKKDLAQIYLNYLVERGDKDAMK 521


>gb|EAA64755.1| hypothetical protein AN1635.2 [Aspergillus nidulans FGSC A4]
           gi|67522356|ref|XP_659239.1| hypothetical protein
           AN1635_2 [Aspergillus nidulans FGSC A4]
           gi|49087740|ref|XP_405772.1| hypothetical protein
           AN1635.2 [Aspergillus nidulans FGSC A4]
          Length = 588

 Score = 80.5 bits (197), Expect = 7e-14
 Identities = 59/218 (27%), Positives = 103/218 (47%), Gaps = 7/218 (3%)

Query: 17  LYERCVIACANYPEYWIRYVLCMEASESMDL-ANNVLARASQVFVK-RQPEIHLFCARFK 74
           LYERC++ CA+Y E+W RY   M A    +    N+  RAS ++V    P   L  A F+
Sbjct: 299 LYERCLVTCAHYDEFWQRYARWMSAQPGKEEDVRNIYQRASYLYVPIANPATRLQYAYFE 358

Query: 75  EQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKGKEH 134
           E  G +  A+  ++ +   I P  +E I+  ANM  R G LE A  +Y+    ++  +  
Sbjct: 359 EMCGRVSVAKEIHEAILINI-PNHVETIVSLANMCRRHGGLEAAIEVYKS--QLDSPQCE 415

Query: 135 SQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPKRVD 194
             T   L A+++R ++   G++E+AR++     +    S+    + L FE  QP     +
Sbjct: 416 MSTKAALVAEWARLLWKIKGSTEEARQVFQKNQQYYLDSQAFWHSYLTFELDQPTSAATE 475

Query: 195 IDFLESLVVKFITPNPENPGVASATEREELSNIFLEFL 232
               E   +K +  +  +    S+    +L  I++ +L
Sbjct: 476 SAQYER--IKQVVEDIRSKSALSSNVARDLVQIYMVYL 511


>gb|EAL89886.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
          Length = 591

 Score = 77.0 bits (188), Expect = 8e-13
 Identities = 54/174 (31%), Positives = 86/174 (49%), Gaps = 5/174 (2%)

Query: 17  LYERCVIACANYPEYWIRYVLCMEASESM-DLANNVLARASQVFVK-RQPEIHLFCARFK 74
           LYERC++ CA+Y E+W RY   M A     +   N+  RAS  +V    P   L  A F+
Sbjct: 299 LYERCLVTCAHYDEFWQRYARWMSAQPGKEEEVRNIYQRASCFYVPIANPATRLQYAYFE 358

Query: 75  EQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKGKEH 134
           E +G +  A+  +  +   + P  +E II  ANM  R G LE A  +Y+    ++  +  
Sbjct: 359 EMSGRVDVAKDIHDAILATL-PNHVETIISLANMCRRHGGLEAAIEVYKN--QLDSPQCD 415

Query: 135 SQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQP 188
             T   L A+++R ++   G++E AR++     +    S+P   + L FE  QP
Sbjct: 416 LATKAALVAEWARLLWKIKGSAEDARQVFQKNQQYYLDSRPFWNSYLMFELDQP 469


>gb|AAO41607.1| CG1646-PC, isoform C [Drosophila melanogaster]
           gi|7301703|gb|AAF56816.1| CG1646-PA, isoform A
           [Drosophila melanogaster] gi|28571914|ref|NP_788753.1|
           CG1646-PC, isoform C [Drosophila melanogaster]
           gi|21357975|ref|NP_651634.1| CG1646-PA, isoform A
           [Drosophila melanogaster] gi|15291785|gb|AAK93161.1|
           LD26426p [Drosophila melanogaster]
          Length = 1009

 Score = 75.5 bits (184), Expect = 2e-12
 Identities = 73/290 (25%), Positives = 130/290 (44%), Gaps = 21/290 (7%)

Query: 12  DQIVKLYERCVIACANYPEYWIRYVLCMEASES----MDLANNVLARASQVFVKRQPEIH 67
           ++++ L+ERC+IACA Y E+W++ +  +E+ E     +DL  +V  RA ++    +P +H
Sbjct: 665 ERVLVLFERCLIACALYDEFWLKMLRYLESLEDQSGVVDLVRDVYRRACRIHHPDKPSLH 724

Query: 68  LFCARFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIA 127
           L  A F+E   +   A    Q +  +  P LL+   R  N+E R G L+    LY+  I 
Sbjct: 725 LMWAAFEECQMNFDDAAEILQRI-DQRCPNLLQLSYRRINVERRRGALDKCRELYKHYIE 783

Query: 128 IEKGKEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQ 187
             K K  + +L +   +Y+RF+     + +     L   LE    +  +  AL   +   
Sbjct: 784 STKNKGIAGSLAI---KYARFLNKICHDLDAGLAALQQALERDPANTRV--ALQMIDLCL 838

Query: 188 PQPKRVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIKRAEDR 247
            +PK VD   +  ++ KF+      P      ++   +   +EFL  FG   + +  +D 
Sbjct: 839 QRPK-VDEQEVVEIMDKFMARADIEP-----DQKVLFAQRKVEFLEDFG--STARGLQDA 890

Query: 248 HAKLFLPNRGLSELKKRHAEDFLASDKTKVSRAYSAQSPAQSVAGAYPNG 297
              L    + L++ K+   +   +  +   S +     P  S A AY NG
Sbjct: 891 QRAL---QQALTKAKEAQKKSDGSPSRKNSSSSKEGPVPTGSAAAAYNNG 937


>ref|XP_581849.1| PREDICTED: similar to PRP39 pre-mRNA processing factor 39 homolog,
           partial [Bos taurus]
          Length = 295

 Score = 75.5 bits (184), Expect = 2e-12
 Identities = 52/173 (30%), Positives = 88/173 (50%), Gaps = 20/173 (11%)

Query: 12  DQIVKLYERCVIACANYPEYWIR----YVLCMEASESMDL------------ANNVLARA 55
           +++V L+ERCVI+CA Y E+WI+    Y L +E   ++DL              +V +RA
Sbjct: 124 ERVVVLFERCVISCALYEEFWIKVSKLYGLKLEYIINIDLYAKYMENHSIEGVRHVFSRA 183

Query: 56  SQVFVKRQPEIHLFCARFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKL 115
             + + ++P +H+  A F+EQ G+I  AR   +    E   GL    +R  ++E R G +
Sbjct: 184 CTIHLPKKPMVHMLWAAFEEQQGNINEARNILR-TFEECVLGLAMVRLRRVSLERRHGNM 242

Query: 116 EDAFSLYEQAIAIEKGKEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLE 168
           E+A  L ++AI   K    S    +  A   R ++    N  K+R++L+  +E
Sbjct: 243 EEAERLLQEAIKNAKSNNESSFYAIKLA---RHLFKIQKNLPKSRKVLLEAIE 292


>gb|AAO41608.2| CG1646-PD, isoform D [Drosophila melanogaster]
           gi|45552111|ref|NP_788754.2| CG1646-PD, isoform D
           [Drosophila melanogaster]
          Length = 1026

 Score = 75.5 bits (184), Expect = 2e-12
 Identities = 73/290 (25%), Positives = 130/290 (44%), Gaps = 21/290 (7%)

Query: 12  DQIVKLYERCVIACANYPEYWIRYVLCMEASES----MDLANNVLARASQVFVKRQPEIH 67
           ++++ L+ERC+IACA Y E+W++ +  +E+ E     +DL  +V  RA ++    +P +H
Sbjct: 682 ERVLVLFERCLIACALYDEFWLKMLRYLESLEDQSGVVDLVRDVYRRACRIHHPDKPSLH 741

Query: 68  LFCARFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIA 127
           L  A F+E   +   A    Q +  +  P LL+   R  N+E R G L+    LY+  I 
Sbjct: 742 LMWAAFEECQMNFDDAAEILQRI-DQRCPNLLQLSYRRINVERRRGALDKCRELYKHYIE 800

Query: 128 IEKGKEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQ 187
             K K  + +L +   +Y+RF+     + +     L   LE    +  +  AL   +   
Sbjct: 801 STKNKGIAGSLAI---KYARFLNKICHDLDAGLAALQQALERDPANTRV--ALQMIDLCL 855

Query: 188 PQPKRVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIKRAEDR 247
            +PK VD   +  ++ KF+      P      ++   +   +EFL  FG   + +  +D 
Sbjct: 856 QRPK-VDEQEVVEIMDKFMARADIEP-----DQKVLFAQRKVEFLEDFG--STARGLQDA 907

Query: 248 HAKLFLPNRGLSELKKRHAEDFLASDKTKVSRAYSAQSPAQSVAGAYPNG 297
              L    + L++ K+   +   +  +   S +     P  S A AY NG
Sbjct: 908 QRAL---QQALTKAKEAQKKSDGSPSRKNSSSSKEGPVPTGSAAAAYNNG 954


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.321    0.136    0.405 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 588,156,080
Number of Sequences: 2540612
Number of extensions: 23880318
Number of successful extensions: 60357
Number of sequences better than 10.0: 122
Number of HSP's better than 10.0 without gapping: 27
Number of HSP's successfully gapped in prelim test: 96
Number of HSP's that attempted gapping in prelim test: 60190
Number of HSP's gapped (non-prelim): 205
length of query: 359
length of database: 863,360,394
effective HSP length: 129
effective length of query: 230
effective length of database: 535,621,446
effective search space: 123192932580
effective search space used: 123192932580
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)


Medicago: description of AC148607.14