
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC148607.14 - phase: 0
(359 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAM65430.1| unknown [Arabidopsis thaliana] gi|20259567|gb|AAM... 424 e-117
gb|AAC16751.1| Contains similarity to pre-mRNA processing protei... 424 e-117
dbj|BAB11095.1| unnamed protein product [Arabidopsis thaliana] 175 2e-42
ref|NP_199452.2| expressed protein [Arabidopsis thaliana] 175 2e-42
ref|XP_482102.1| unknown protein [Oryza sativa (japonica cultiva... 165 2e-39
gb|AAH88586.1| LOC496864 protein [Xenopus tropicalis] 99 2e-19
ref|NP_060392.2| PRP39 pre-mRNA processing factor 39 homolog [Ho... 97 7e-19
gb|AAH51886.1| PRPF39 protein [Homo sapiens] 97 7e-19
ref|XP_537427.1| PREDICTED: similar to PRP39 pre-mRNA processing... 94 5e-18
ref|XP_234238.3| PREDICTED: similar to PRP39 pre-mRNA processing... 92 3e-17
emb|CAD87784.1| novel protein similar to pre-mRNA processing pro... 90 9e-17
ref|XP_392380.2| PREDICTED: similar to novel protein similar to ... 89 2e-16
gb|EAA50799.1| hypothetical protein MG04558.4 [Magnaporthe grise... 88 5e-16
emb|CAG09750.1| unnamed protein product [Tetraodon nigroviridis] 85 3e-15
gb|EAA68105.1| hypothetical protein FG01244.1 [Gibberella zeae P... 84 7e-15
gb|EAA64755.1| hypothetical protein AN1635.2 [Aspergillus nidula... 80 7e-14
gb|EAL89886.1| conserved hypothetical protein [Aspergillus fumig... 77 8e-13
gb|AAO41607.1| CG1646-PC, isoform C [Drosophila melanogaster] gi... 75 2e-12
ref|XP_581849.1| PREDICTED: similar to PRP39 pre-mRNA processing... 75 2e-12
gb|AAO41608.2| CG1646-PD, isoform D [Drosophila melanogaster] gi... 75 2e-12
>gb|AAM65430.1| unknown [Arabidopsis thaliana] gi|20259567|gb|AAM14126.1| unknown
protein [Arabidopsis thaliana]
gi|15810565|gb|AAL07170.1| unknown protein [Arabidopsis
thaliana] gi|18379230|ref|NP_563700.1|
hydroxyproline-rich glycoprotein family protein
[Arabidopsis thaliana]
Length = 768
Score = 424 bits (1091), Expect = e-117
Identities = 212/321 (66%), Positives = 254/321 (79%), Gaps = 5/321 (1%)
Query: 12 DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
+++VKLYERCV+ CANYPEYWIRYV MEAS S DLA N LARA+QVFVK+QPEIHLF A
Sbjct: 378 NKVVKLYERCVVTCANYPEYWIRYVTNMEASGSADLAENALARATQVFVKKQPEIHLFAA 437
Query: 72 RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKG 131
R KEQ GDI GARAAYQLVH+EISPGLLEA+I+HANME+RLG L+DAFSLYEQ IA+EKG
Sbjct: 438 RLKEQNGDIAGARAAYQLVHSEISPGLLEAVIKHANMEYRLGNLDDAFSLYEQVIAVEKG 497
Query: 132 KEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPK 191
KEHS LP+L+AQYSRF YL S ++EKAR I+V L++ SKPL+EAL+HFEAIQP P+
Sbjct: 498 KEHSTILPLLYAQYSRFSYLVSRDAEKARRIIVEALDHVQPSKPLMEALIHFEAIQPPPR 557
Query: 192 RVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIKRAEDRHAKL 251
+ID+LE LV K I P+ + +AS+TEREELS I++EFL +FGDV+SIK+AED+H KL
Sbjct: 558 --EIDYLEPLVEKVIKPDADAQNIASSTEREELSLIYIEFLGIFGDVKSIKKAEDQHVKL 615
Query: 252 FLPNRGLSELKKRHAEDFLASDKTKVSRAYSAQSPAQSVAGAYPNGPNQWP-NYGVQPQT 310
F P+R SELKKR A+DFLASD+TK+++ Y+ PAQ V+ AYPN QW Y QPQT
Sbjct: 616 FYPHRSTSELKKRSADDFLASDRTKMAKTYNGTPPAQPVSNAYPNAQAQWSGGYAAQPQT 675
Query: 311 WP--ATTQAQGQQWPAGYTQQ 329
WP AQ QQW Y QQ
Sbjct: 676 WPPAQAAPAQPQQWNPAYGQQ 696
>gb|AAC16751.1| Contains similarity to pre-mRNA processing protein PRP39 gb|L29224
from S. cerevisiae. ESTs gb|R64908 and gb|T88158,
gb|N38703 and gb|AA651043 come from this gene.
[Arabidopsis thaliana] gi|7485869|pir||T00964
hypothetical protein F20D22.14 - Arabidopsis thaliana
Length = 1345
Score = 424 bits (1091), Expect = e-117
Identities = 212/321 (66%), Positives = 254/321 (79%), Gaps = 5/321 (1%)
Query: 12 DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
+++VKLYERCV+ CANYPEYWIRYV MEAS S DLA N LARA+QVFVK+QPEIHLF A
Sbjct: 378 NKVVKLYERCVVTCANYPEYWIRYVTNMEASGSADLAENALARATQVFVKKQPEIHLFAA 437
Query: 72 RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKG 131
R KEQ GDI GARAAYQLVH+EISPGLLEA+I+HANME+RLG L+DAFSLYEQ IA+EKG
Sbjct: 438 RLKEQNGDIAGARAAYQLVHSEISPGLLEAVIKHANMEYRLGNLDDAFSLYEQVIAVEKG 497
Query: 132 KEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPK 191
KEHS LP+L+AQYSRF YL S ++EKAR I+V L++ SKPL+EAL+HFEAIQP P+
Sbjct: 498 KEHSTILPLLYAQYSRFSYLVSRDAEKARRIIVEALDHVQPSKPLMEALIHFEAIQPPPR 557
Query: 192 RVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIKRAEDRHAKL 251
+ID+LE LV K I P+ + +AS+TEREELS I++EFL +FGDV+SIK+AED+H KL
Sbjct: 558 --EIDYLEPLVEKVIKPDADAQNIASSTEREELSLIYIEFLGIFGDVKSIKKAEDQHVKL 615
Query: 252 FLPNRGLSELKKRHAEDFLASDKTKVSRAYSAQSPAQSVAGAYPNGPNQWP-NYGVQPQT 310
F P+R SELKKR A+DFLASD+TK+++ Y+ PAQ V+ AYPN QW Y QPQT
Sbjct: 616 FYPHRSTSELKKRSADDFLASDRTKMAKTYNGTPPAQPVSNAYPNAQAQWSGGYAAQPQT 675
Query: 311 WP--ATTQAQGQQWPAGYTQQ 329
WP AQ QQW Y QQ
Sbjct: 676 WPPAQAAPAQPQQWNPAYGQQ 696
>dbj|BAB11095.1| unnamed protein product [Arabidopsis thaliana]
Length = 1022
Score = 175 bits (443), Expect = 2e-42
Identities = 98/252 (38%), Positives = 150/252 (58%), Gaps = 4/252 (1%)
Query: 12 DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
D + LYERC+I CANY E+W RYV +E+ +LAN LARASQ FVK IHLF A
Sbjct: 315 DWAINLYERCLIPCANYTEFWFRYVDFVESKGGRELANFALARASQTFVKSASVIHLFNA 374
Query: 72 RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAI-AIEK 130
RFKE GD A A E+ G +E + + ANME RLG E A + Y +A+
Sbjct: 375 RFKEHVGDASAASVALSRCGEELGFGFVENVTKKANMEKRLGNFEAAVTTYREALNKTLI 434
Query: 131 GKEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQP 190
GKE+ +T L+ Q+SR Y+ + +++ A +IL+ G EN K LLE L+ +
Sbjct: 435 GKENLETTARLYVQFSRLKYVITNSADDAAQILLEGNENVPHCKLLLEELMRLLMMHGGS 494
Query: 191 KRVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIKRAEDRHAK 250
++VD+ L+ ++ K ++ ++ SA ++EE+SN+++EF++L G + +++A RH K
Sbjct: 495 RQVDL--LDPIIDKELSHQADSSDGLSAEDKEEISNLYMEFIDLSGTIHDVRKALGRHIK 552
Query: 251 LFLPNRGLSELK 262
LF P+ ++L+
Sbjct: 553 LF-PHSARAKLR 563
>ref|NP_199452.2| expressed protein [Arabidopsis thaliana]
Length = 1036
Score = 175 bits (443), Expect = 2e-42
Identities = 98/252 (38%), Positives = 150/252 (58%), Gaps = 4/252 (1%)
Query: 12 DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
D + LYERC+I CANY E+W RYV +E+ +LAN LARASQ FVK IHLF A
Sbjct: 315 DWAINLYERCLIPCANYTEFWFRYVDFVESKGGRELANFALARASQTFVKSASVIHLFNA 374
Query: 72 RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAI-AIEK 130
RFKE GD A A E+ G +E + + ANME RLG E A + Y +A+
Sbjct: 375 RFKEHVGDASAASVALSRCGEELGFGFVENVTKKANMEKRLGNFEAAVTTYREALNKTLI 434
Query: 131 GKEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQP 190
GKE+ +T L+ Q+SR Y+ + +++ A +IL+ G EN K LLE L+ +
Sbjct: 435 GKENLETTARLYVQFSRLKYVITNSADDAAQILLEGNENVPHCKLLLEELMRLLMMHGGS 494
Query: 191 KRVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIKRAEDRHAK 250
++VD+ L+ ++ K ++ ++ SA ++EE+SN+++EF++L G + +++A RH K
Sbjct: 495 RQVDL--LDPIIDKELSHQADSSDGLSAEDKEEISNLYMEFIDLSGTIHDVRKALGRHIK 552
Query: 251 LFLPNRGLSELK 262
LF P+ ++L+
Sbjct: 553 LF-PHSARAKLR 563
>ref|XP_482102.1| unknown protein [Oryza sativa (japonica cultivar-group)]
gi|40253684|dbj|BAD05627.1| unknown protein [Oryza
sativa (japonica cultivar-group)]
gi|40253455|dbj|BAD05406.1| unknown protein [Oryza
sativa (japonica cultivar-group)]
Length = 1161
Score = 165 bits (418), Expect = 2e-39
Identities = 91/244 (37%), Positives = 144/244 (58%), Gaps = 4/244 (1%)
Query: 12 DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
D VKLYERC+I CANY E+WIRY ++A ++A+ L RAS FVK P H++ A
Sbjct: 278 DWAVKLYERCLIPCANYSEFWIRYAEFVDAKGGREIASYALGRASSYFVKGVPTFHMYYA 337
Query: 72 RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKG 131
FKEQ GD GAR+ + ++ I R ANME R+G + A +YE AI +
Sbjct: 338 MFKEQIGDAQGARSLFIEGSNNLTSNFCANINRLANMEKRMGNTKAASEIYETAIQ-DAM 396
Query: 132 KEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPK 191
+++ + LP L+ +++F Y + N +A+E+ V G++ A K L++ + F + P
Sbjct: 397 QKNVKILPDLYTNFAQFKYAVNHNISEAKEVFVEGIKQAP-CKALIKGFMQFMSTHGGP- 454
Query: 192 RVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIKRAEDRHAKL 251
+I L+S++ + P + V S +RE++S +FLEF++L+GDV+ +++A RH+KL
Sbjct: 455 -TEIPILDSVISNAVVPGSDISTVLSREDREDISLLFLEFVDLYGDVRDLRKAWARHSKL 513
Query: 252 FLPN 255
F N
Sbjct: 514 FPHN 517
>gb|AAH88586.1| LOC496864 protein [Xenopus tropicalis]
Length = 656
Score = 99.4 bits (246), Expect = 2e-19
Identities = 86/277 (31%), Positives = 132/277 (47%), Gaps = 17/277 (6%)
Query: 12 DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
++IV LYERC++ACA Y E+W+ YV ME S++ A +L RA + + +P + L+ A
Sbjct: 326 ERIVTLYERCLVACALYEEFWLSYVQYME-PHSIEAARCILQRACCIHLPLKPTLSLYWA 384
Query: 72 RFKEQAGDIVGARAA-YQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEK 130
F+E+ G I AR+ Y L + PGL +R ++E R G LE+A L E+A+
Sbjct: 385 AFEEKHGQIDTARSVLYDL--ENLMPGLAMVRLRRVSLERRTGNLEEAEHLLEEAVKSSL 442
Query: 131 GKEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQP 190
G E + + A R + GN EKAR++L LE + L LL E + +
Sbjct: 443 GTELAAFYSIKLA---RLLLKLQGNMEKARKVLTEALEKEPDNPRLHLCLLEIE-VSREG 498
Query: 191 KRVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFL-NLFGDVQSIKRAEDRHA 249
+ + D L L V+ + S ++ +S LEFL + ++ S+ A D H
Sbjct: 499 SQGEADAL--LCVERALKSS-----LSDDFKKMISQRRLEFLEDNSSNITSVLSAYDEHQ 551
Query: 250 KLFLPNRGLSELKKRHAEDFLASDKTKVSRAYSAQSP 286
K FL L + +ED K K +P
Sbjct: 552 K-FLKQEELKRQAENGSEDETEEKKPKTETVVHISAP 587
>ref|NP_060392.2| PRP39 pre-mRNA processing factor 39 homolog [Homo sapiens]
Length = 548
Score = 97.1 bits (240), Expect = 7e-19
Identities = 88/306 (28%), Positives = 133/306 (42%), Gaps = 26/306 (8%)
Query: 12 DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
+++V L+ERCVI+CA Y E+WI+Y ME + S++ +V +RA + + ++P +H+ A
Sbjct: 249 ERVVVLFERCVISCALYEEFWIKYAKYME-NHSIEGVRHVFSRACTIHLPKKPMVHMLWA 307
Query: 72 RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKG 131
F+EQ G+I AR + E GL +R ++E R G LE+A L + AI K
Sbjct: 308 AFEEQQGNINEARNILK-TFEECVLGLAMVRLRRVSLERRHGNLEEAEHLLQDAIKNAKS 366
Query: 132 KEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPK 191
S + A R ++ N K+R++L+ +E + L LL E K
Sbjct: 367 NNESSFYAVKLA---RHLFKIQKNLPKSRKVLLEAIERDKENTKLYLNLLEME-YSGDLK 422
Query: 192 RVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFG-DVQSIKRAEDRHAK 250
+ + + L + G R S +EFL FG DV + A D H
Sbjct: 423 QNEENILNCF-------DKAVHGSLPIKMRITFSQRKVEFLEDFGSDVNKLLNAYDEHQT 475
Query: 251 LFLPNRGLS----------ELKKRHAEDFLASDKTKVSRAYSAQSPAQSVAGAYP-NGPN 299
L L E KK H ED +S + A + + Y N N
Sbjct: 476 LLKEQDSLKRKAENGSEEPEEKKAHTEDTTSSSTQMIDGDLQANQAVYNYSAWYQYNYQN 535
Query: 300 QWPNYG 305
W NYG
Sbjct: 536 PW-NYG 540
>gb|AAH51886.1| PRPF39 protein [Homo sapiens]
Length = 479
Score = 97.1 bits (240), Expect = 7e-19
Identities = 88/306 (28%), Positives = 133/306 (42%), Gaps = 26/306 (8%)
Query: 12 DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
+++V L+ERCVI+CA Y E+WI+Y ME + S++ +V +RA + + ++P +H+ A
Sbjct: 180 ERVVVLFERCVISCALYEEFWIKYAKYME-NHSIEGVRHVFSRACTIHLPKKPMVHMLWA 238
Query: 72 RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKG 131
F+EQ G+I AR + E GL +R ++E R G LE+A L + AI K
Sbjct: 239 AFEEQQGNINEARNILK-TFEECVLGLAMVRLRRVSLERRHGNLEEAEHLLQDAIKNAKS 297
Query: 132 KEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPK 191
S + A R ++ N K+R++L+ +E + L LL E K
Sbjct: 298 NNESSFYAVKLA---RHLFKIQKNLPKSRKVLLEAIERDKENTKLYLNLLEME-YSGDLK 353
Query: 192 RVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFG-DVQSIKRAEDRHAK 250
+ + + L + G R S +EFL FG DV + A D H
Sbjct: 354 QNEENILNCF-------DKAVHGSLPIKMRITFSQRKVEFLEDFGSDVNKLLNAYDEHQT 406
Query: 251 LFLPNRGLS----------ELKKRHAEDFLASDKTKVSRAYSAQSPAQSVAGAYP-NGPN 299
L L E KK H ED +S + A + + Y N N
Sbjct: 407 LLKEQDSLKRKAENGSEEPEEKKAHTEDTTSSSTQMIDGDLQANQAVYNYSAWYQYNYQN 466
Query: 300 QWPNYG 305
W NYG
Sbjct: 467 PW-NYG 471
>ref|XP_537427.1| PREDICTED: similar to PRP39 pre-mRNA processing factor 39 homolog
[Canis familiaris]
Length = 712
Score = 94.4 bits (233), Expect = 5e-18
Identities = 87/306 (28%), Positives = 133/306 (43%), Gaps = 26/306 (8%)
Query: 12 DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
+++V L+ERCVI+CA Y E+WI+Y ME + S++ +V +RA + + ++P +H+ A
Sbjct: 413 ERVVVLFERCVISCALYEEFWIKYAKYME-NHSIEGVRHVFSRACTIHLPKKPMVHMLWA 471
Query: 72 RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKG 131
F+EQ G+I AR + E GL +R ++E R +E+A L + AI K
Sbjct: 472 AFEEQQGNINEARNILR-TFEECVLGLAMVRLRRVSLERRHENMEEAEHLLQDAIKNAKS 530
Query: 132 KEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPK 191
S + A R ++ N K+R++L+ +E + L LL E K
Sbjct: 531 NNESSFYAIKLA---RHLFKIQKNLPKSRKVLLEAIERDKENTKLYLNLLEME-YSGDLK 586
Query: 192 RVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFG-DVQSIKRAEDRHAK 250
+ + + L + G R S +EFL FG DV + A D H
Sbjct: 587 QNEENILNCF-------DKAIHGSLPIKMRITFSQRKVEFLEDFGSDVNKLLNAYDEHQT 639
Query: 251 LFLPNRGLS----------ELKKRHAEDFLASDKTKVSRAYSAQSPAQSVAGAYP-NGPN 299
L L E KK H ED +S + A A + + Y N N
Sbjct: 640 LLKEQDSLKRKAENGSEEPEEKKAHTEDTSSSSTQMIDGDLQANQAAYNYSAWYQYNYQN 699
Query: 300 QWPNYG 305
W NYG
Sbjct: 700 PW-NYG 704
>ref|XP_234238.3| PREDICTED: similar to PRP39 pre-mRNA processing factor 39 homolog
[Rattus norvegicus]
Length = 709
Score = 91.7 bits (226), Expect = 3e-17
Identities = 89/306 (29%), Positives = 132/306 (43%), Gaps = 28/306 (9%)
Query: 12 DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
+++V L+ERCVI+CA Y E+WI+Y ME + S++ +V +RA V + ++P H+ A
Sbjct: 412 ERVVVLFERCVISCALYEEFWIKYAKYME-NHSIEGVRHVFSRACTVHLPKKPMAHMLWA 470
Query: 72 RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKG 131
F+EQ G+I AR + E GL +R ++E R G +E+A L + AI K
Sbjct: 471 AFEEQQGNINEARIILR-TFEECVLGLAMVRLRRVSLERRHGNMEEAEHLLQDAIRNAKS 529
Query: 132 KEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPK 191
S + A R ++ N K+R++L+ +E + L LL E
Sbjct: 530 NNESSFYAIKLA---RHLFKIQKNLPKSRKVLLEAIEKDKENTKLYLNLLEME------Y 580
Query: 192 RVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFG-DVQSIKRAEDRHAK 250
D+ E ++ + G R S +EFL FG DV + A D H
Sbjct: 581 SCDLKQNEENILNCF--DKAIHGSLPIKMRITFSQRKVEFLEDFGSDVNKLLNAYDEHQT 638
Query: 251 LFLPNRGLS----------ELKKRHAEDFLASDKTKVSRAYSAQSPAQSVAGAYP-NGPN 299
L L E KK H ED S + A A + + Y N N
Sbjct: 639 LLKEQDTLKRKAENGSEEPEEKKAHTED--VSSAQIIDGDLQANQAAYNYSAWYQYNYQN 696
Query: 300 QWPNYG 305
W NYG
Sbjct: 697 PW-NYG 701
>emb|CAD87784.1| novel protein similar to pre-mRNA processing proteins [Danio rerio]
gi|52218898|ref|NP_001004520.1| PRP39 pre-mRNA
processing factor 39 homolog [Danio rerio]
Length = 752
Score = 90.1 bits (222), Expect = 9e-17
Identities = 78/310 (25%), Positives = 133/310 (42%), Gaps = 32/310 (10%)
Query: 12 DQIVKLYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCA 71
+++V L+ERC+IACA Y E+WI+Y +E S S + ++ +A V + ++P +HL A
Sbjct: 445 ERVVVLFERCLIACALYEEFWIKYAKYLE-SYSTEAVRHIYKKACTVHLPKKPNVHLLWA 503
Query: 72 RFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKG 131
F+EQ G I AR+ + V + PGL +R ++E R G +E+A +L + AI +
Sbjct: 504 AFEEQQGSIDEARSILKAVEVSV-PGLAMVRLRRVSLERRHGNMEEAEALLQDAITNGRN 562
Query: 132 KEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFE---AIQP 188
S + A+ V + G +A+++L+ +E + L LL E +Q
Sbjct: 563 SSESSFYSVKLARQLVKVQKSIG---RAKKVLLEAVEKDETNPKLYLNLLELEYSGDVQQ 619
Query: 189 QPKRVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFG-DVQSIKRAEDR 247
+ F +L + R S ++FL FG D+ ++ A ++
Sbjct: 620 NEAEIIACFDRAL-----------SSSMALESRITFSQRKVDFLEDFGSDINTLMAAYEQ 668
Query: 248 HAKLFLPNRGLSELKKRHAEDFLA----SDKTKVSRAYSAQSPAQSVAGAYPN------- 296
H +L + +E+ A +D V+ A Y N
Sbjct: 669 HQRLLAEQESFKRKAENGSEEPDAKRQRTDDQSVASGQMMDMQANHAGYNYNNWYQYNSW 728
Query: 297 -GPNQWPNYG 305
N W YG
Sbjct: 729 GSQNSWGQYG 738
>ref|XP_392380.2| PREDICTED: similar to novel protein similar to pre-mRNA processing
proteins [Apis mellifera]
Length = 946
Score = 89.0 bits (219), Expect = 2e-16
Identities = 79/302 (26%), Positives = 139/302 (45%), Gaps = 28/302 (9%)
Query: 12 DQIVKLYERCVIACANYPEYWIRYVLCMEA--SESMDLANNVLARASQVFVKRQPEIHLF 69
++I+ L+ERC+IACA Y E+W+R+V +E+ ++++ +V RA V ++P +HL
Sbjct: 601 NRIIILFERCLIACALYDEFWMRFVRYLESLKGDNVEKIRDVYTRACTVHHPKKPNLHLQ 660
Query: 70 CARFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIE 129
A F+E G+ A + + I P +L+ R N+E R G L+ A +LYE I+
Sbjct: 661 WATFEEGQGNFEKAANILENIDNVI-PNMLQVAYRRINLERRRGDLDKACTLYENYISNS 719
Query: 130 KGKEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQ 189
K + + + +Y+RF+ + +KA ++L+ E K L+ + I
Sbjct: 720 KNRTIANN---IVVKYARFLCKVKNDVDKAIKVLLKATE-----KDKDNPRLYLQLIDLG 771
Query: 190 PKRVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFG-DVQSIKRAEDRH 248
+R +D E + + E+ A +R + +EFL F D++ I +A ++
Sbjct: 772 MQRTPVDTQEIVGYMDMFIEREH---ADLEQRVLFAQRKVEFLEDFSPDIRQILKAHEQF 828
Query: 249 AKLFLPNRGLSELKKRHAED----------FLASDKTKVSRAYSAQSPAQSVAGAYPNGP 298
K + E KK +D D+T V S S + + P+GP
Sbjct: 829 QKCI---KQAKERKKTKNDDTKTDTSPPKKVKTGDQTNVPPPPSVSSQSSYQYSSGPSGP 885
Query: 299 NQ 300
Q
Sbjct: 886 YQ 887
>gb|EAA50799.1| hypothetical protein MG04558.4 [Magnaporthe grisea 70-15]
gi|39945152|ref|XP_362113.1| hypothetical protein
MG04558.4 [Magnaporthe grisea 70-15]
Length = 480
Score = 87.8 bits (216), Expect = 5e-16
Identities = 62/229 (27%), Positives = 107/229 (46%), Gaps = 7/229 (3%)
Query: 17 LYERCVIACANYPEYWIRYVLCMEAS-ESMDLANNVLARASQVFVK-RQPEIHLFCARFK 74
LYERC++ CA Y E+W RY M A + + N+ RA+ +FV +P I L A F+
Sbjct: 193 LYERCLVTCAFYDEFWFRYARWMSAQPDKTEEVRNIYLRAATIFVPISRPGIRLQFAYFE 252
Query: 75 EQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKGKEH 134
E G + AR + + + PG +E II AN+E R ++ A + +Q IE +
Sbjct: 253 ESCGRVAMAREVHNAILLRL-PGCIEVIISLANLERRHNDIDTAIEVLKQ--QIESPEVD 309
Query: 135 SQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPKRVD 194
T +L +++ ++ G +E+AR + + S+ + FE QP ++
Sbjct: 310 IWTKAVLVTEWASLLWTVKGTAEEARAVFQKNAQWYGGSRHFWMQWIQFELEQPTSAELE 369
Query: 195 IDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIKR 243
E L + I S+ ++EL ++L +L G ++K+
Sbjct: 370 AQHSERL--REIIDKIRTESNMSSAAKKELCGVYLAYLQHRGGKDAMKQ 416
>emb|CAG09750.1| unnamed protein product [Tetraodon nigroviridis]
Length = 509
Score = 85.1 bits (209), Expect = 3e-15
Identities = 54/156 (34%), Positives = 85/156 (53%), Gaps = 12/156 (7%)
Query: 17 LYERCVIACANYPEYWIRYVLCMEASESMDLANNVLARASQVFVKRQPEIHLFCARFKEQ 76
L+ERC+IACA Y E+W RY +E S +++ A V RA ++ + R+P I + A F+E+
Sbjct: 242 LFERCLIACALYEEFWTRYARYLE-SHNVEEARAVFKRACEIHLTRRPNICMQWATFEER 300
Query: 77 AGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKGKEHSQ 136
++ AR + + + PGL +R +E R G+L+ A +L ++A+A K K
Sbjct: 301 HNNLAEARRVLAAIESRV-PGLAVVRLRRVALERRAGQLDQAVALLQEAVAESKEK---- 355
Query: 137 TLPMLFAQYS----RFVYLASGNSEKAREILVGGLE 168
P L A YS R + + N KAR++L LE
Sbjct: 356 --PTLHAFYSIKLARLLLKLARNPSKARKVLQEALE 389
>gb|EAA68105.1| hypothetical protein FG01244.1 [Gibberella zeae PH-1]
gi|46108724|ref|XP_381420.1| hypothetical protein
FG01244.1 [Gibberella zeae PH-1]
Length = 587
Score = 84.0 bits (206), Expect = 7e-15
Identities = 63/232 (27%), Positives = 108/232 (46%), Gaps = 7/232 (3%)
Query: 13 QIVKLYERCVIACANYPEYWIRYVLCMEASE-SMDLANNVLARASQVFVK-RQPEIHLFC 70
+IV LYERC++ CA Y + W RY M E + N+ RAS +FV +P I L
Sbjct: 295 RIVALYERCLVTCAFYDDLWFRYARWMSGQEGKAEEVRNIYVRASTMFVPISRPGIRLQW 354
Query: 71 ARFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEK 130
A F+E G + A ++ + + P +E I+ AN+E R ++ A +Y+ I+
Sbjct: 355 AYFEESTGRVDVALDIHEAILLRL-PDSVEVIVSWANVERRQNGIDAAIQVYKN--QIDA 411
Query: 131 GKEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQP 190
T L A+++ ++ G++E+ARE+ + S+ + FE QP
Sbjct: 412 PTVDIYTKAALVAEWALLLWKVKGSTEEAREVFTKNVTWYGDSRLFWDRWFQFELDQPSS 471
Query: 191 KRVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIK 242
+ E + K + SA +++L+ I+L +L GD ++K
Sbjct: 472 AETEAQHGECM--KKVFDELRERSQLSAPVKKDLAQIYLNYLVERGDKDAMK 521
>gb|EAA64755.1| hypothetical protein AN1635.2 [Aspergillus nidulans FGSC A4]
gi|67522356|ref|XP_659239.1| hypothetical protein
AN1635_2 [Aspergillus nidulans FGSC A4]
gi|49087740|ref|XP_405772.1| hypothetical protein
AN1635.2 [Aspergillus nidulans FGSC A4]
Length = 588
Score = 80.5 bits (197), Expect = 7e-14
Identities = 59/218 (27%), Positives = 103/218 (47%), Gaps = 7/218 (3%)
Query: 17 LYERCVIACANYPEYWIRYVLCMEASESMDL-ANNVLARASQVFVK-RQPEIHLFCARFK 74
LYERC++ CA+Y E+W RY M A + N+ RAS ++V P L A F+
Sbjct: 299 LYERCLVTCAHYDEFWQRYARWMSAQPGKEEDVRNIYQRASYLYVPIANPATRLQYAYFE 358
Query: 75 EQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKGKEH 134
E G + A+ ++ + I P +E I+ ANM R G LE A +Y+ ++ +
Sbjct: 359 EMCGRVSVAKEIHEAILINI-PNHVETIVSLANMCRRHGGLEAAIEVYKS--QLDSPQCE 415
Query: 135 SQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQPQPKRVD 194
T L A+++R ++ G++E+AR++ + S+ + L FE QP +
Sbjct: 416 MSTKAALVAEWARLLWKIKGSTEEARQVFQKNQQYYLDSQAFWHSYLTFELDQPTSAATE 475
Query: 195 IDFLESLVVKFITPNPENPGVASATEREELSNIFLEFL 232
E +K + + + S+ +L I++ +L
Sbjct: 476 SAQYER--IKQVVEDIRSKSALSSNVARDLVQIYMVYL 511
>gb|EAL89886.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
Length = 591
Score = 77.0 bits (188), Expect = 8e-13
Identities = 54/174 (31%), Positives = 86/174 (49%), Gaps = 5/174 (2%)
Query: 17 LYERCVIACANYPEYWIRYVLCMEASESM-DLANNVLARASQVFVK-RQPEIHLFCARFK 74
LYERC++ CA+Y E+W RY M A + N+ RAS +V P L A F+
Sbjct: 299 LYERCLVTCAHYDEFWQRYARWMSAQPGKEEEVRNIYQRASCFYVPIANPATRLQYAYFE 358
Query: 75 EQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIAIEKGKEH 134
E +G + A+ + + + P +E II ANM R G LE A +Y+ ++ +
Sbjct: 359 EMSGRVDVAKDIHDAILATL-PNHVETIISLANMCRRHGGLEAAIEVYKN--QLDSPQCD 415
Query: 135 SQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQP 188
T L A+++R ++ G++E AR++ + S+P + L FE QP
Sbjct: 416 LATKAALVAEWARLLWKIKGSAEDARQVFQKNQQYYLDSRPFWNSYLMFELDQP 469
>gb|AAO41607.1| CG1646-PC, isoform C [Drosophila melanogaster]
gi|7301703|gb|AAF56816.1| CG1646-PA, isoform A
[Drosophila melanogaster] gi|28571914|ref|NP_788753.1|
CG1646-PC, isoform C [Drosophila melanogaster]
gi|21357975|ref|NP_651634.1| CG1646-PA, isoform A
[Drosophila melanogaster] gi|15291785|gb|AAK93161.1|
LD26426p [Drosophila melanogaster]
Length = 1009
Score = 75.5 bits (184), Expect = 2e-12
Identities = 73/290 (25%), Positives = 130/290 (44%), Gaps = 21/290 (7%)
Query: 12 DQIVKLYERCVIACANYPEYWIRYVLCMEASES----MDLANNVLARASQVFVKRQPEIH 67
++++ L+ERC+IACA Y E+W++ + +E+ E +DL +V RA ++ +P +H
Sbjct: 665 ERVLVLFERCLIACALYDEFWLKMLRYLESLEDQSGVVDLVRDVYRRACRIHHPDKPSLH 724
Query: 68 LFCARFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIA 127
L A F+E + A Q + + P LL+ R N+E R G L+ LY+ I
Sbjct: 725 LMWAAFEECQMNFDDAAEILQRI-DQRCPNLLQLSYRRINVERRRGALDKCRELYKHYIE 783
Query: 128 IEKGKEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQ 187
K K + +L + +Y+RF+ + + L LE + + AL +
Sbjct: 784 STKNKGIAGSLAI---KYARFLNKICHDLDAGLAALQQALERDPANTRV--ALQMIDLCL 838
Query: 188 PQPKRVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIKRAEDR 247
+PK VD + ++ KF+ P ++ + +EFL FG + + +D
Sbjct: 839 QRPK-VDEQEVVEIMDKFMARADIEP-----DQKVLFAQRKVEFLEDFG--STARGLQDA 890
Query: 248 HAKLFLPNRGLSELKKRHAEDFLASDKTKVSRAYSAQSPAQSVAGAYPNG 297
L + L++ K+ + + + S + P S A AY NG
Sbjct: 891 QRAL---QQALTKAKEAQKKSDGSPSRKNSSSSKEGPVPTGSAAAAYNNG 937
>ref|XP_581849.1| PREDICTED: similar to PRP39 pre-mRNA processing factor 39 homolog,
partial [Bos taurus]
Length = 295
Score = 75.5 bits (184), Expect = 2e-12
Identities = 52/173 (30%), Positives = 88/173 (50%), Gaps = 20/173 (11%)
Query: 12 DQIVKLYERCVIACANYPEYWIR----YVLCMEASESMDL------------ANNVLARA 55
+++V L+ERCVI+CA Y E+WI+ Y L +E ++DL +V +RA
Sbjct: 124 ERVVVLFERCVISCALYEEFWIKVSKLYGLKLEYIINIDLYAKYMENHSIEGVRHVFSRA 183
Query: 56 SQVFVKRQPEIHLFCARFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKL 115
+ + ++P +H+ A F+EQ G+I AR + E GL +R ++E R G +
Sbjct: 184 CTIHLPKKPMVHMLWAAFEEQQGNINEARNILR-TFEECVLGLAMVRLRRVSLERRHGNM 242
Query: 116 EDAFSLYEQAIAIEKGKEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLE 168
E+A L ++AI K S + A R ++ N K+R++L+ +E
Sbjct: 243 EEAERLLQEAIKNAKSNNESSFYAIKLA---RHLFKIQKNLPKSRKVLLEAIE 292
>gb|AAO41608.2| CG1646-PD, isoform D [Drosophila melanogaster]
gi|45552111|ref|NP_788754.2| CG1646-PD, isoform D
[Drosophila melanogaster]
Length = 1026
Score = 75.5 bits (184), Expect = 2e-12
Identities = 73/290 (25%), Positives = 130/290 (44%), Gaps = 21/290 (7%)
Query: 12 DQIVKLYERCVIACANYPEYWIRYVLCMEASES----MDLANNVLARASQVFVKRQPEIH 67
++++ L+ERC+IACA Y E+W++ + +E+ E +DL +V RA ++ +P +H
Sbjct: 682 ERVLVLFERCLIACALYDEFWLKMLRYLESLEDQSGVVDLVRDVYRRACRIHHPDKPSLH 741
Query: 68 LFCARFKEQAGDIVGARAAYQLVHTEISPGLLEAIIRHANMEHRLGKLEDAFSLYEQAIA 127
L A F+E + A Q + + P LL+ R N+E R G L+ LY+ I
Sbjct: 742 LMWAAFEECQMNFDDAAEILQRI-DQRCPNLLQLSYRRINVERRRGALDKCRELYKHYIE 800
Query: 128 IEKGKEHSQTLPMLFAQYSRFVYLASGNSEKAREILVGGLENASLSKPLLEALLHFEAIQ 187
K K + +L + +Y+RF+ + + L LE + + AL +
Sbjct: 801 STKNKGIAGSLAI---KYARFLNKICHDLDAGLAALQQALERDPANTRV--ALQMIDLCL 855
Query: 188 PQPKRVDIDFLESLVVKFITPNPENPGVASATEREELSNIFLEFLNLFGDVQSIKRAEDR 247
+PK VD + ++ KF+ P ++ + +EFL FG + + +D
Sbjct: 856 QRPK-VDEQEVVEIMDKFMARADIEP-----DQKVLFAQRKVEFLEDFG--STARGLQDA 907
Query: 248 HAKLFLPNRGLSELKKRHAEDFLASDKTKVSRAYSAQSPAQSVAGAYPNG 297
L + L++ K+ + + + S + P S A AY NG
Sbjct: 908 QRAL---QQALTKAKEAQKKSDGSPSRKNSSSSKEGPVPTGSAAAAYNNG 954
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.321 0.136 0.405
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 588,156,080
Number of Sequences: 2540612
Number of extensions: 23880318
Number of successful extensions: 60357
Number of sequences better than 10.0: 122
Number of HSP's better than 10.0 without gapping: 27
Number of HSP's successfully gapped in prelim test: 96
Number of HSP's that attempted gapping in prelim test: 60190
Number of HSP's gapped (non-prelim): 205
length of query: 359
length of database: 863,360,394
effective HSP length: 129
effective length of query: 230
effective length of database: 535,621,446
effective search space: 123192932580
effective search space used: 123192932580
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)
Medicago: description of AC148607.14