
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146817.2 + phase: 0
(145 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_180907.2| hydroxyproline-rich glycoprotein family protein... 65 4e-10
emb|CAD40549.1| OSJNBa0072K14.9 [Oryza sativa (japonica cultivar... 48 6e-05
dbj|BAC42812.1| unknown protein [Arabidopsis thaliana] gi|306937... 47 7e-05
dbj|BAB09712.1| unnamed protein product [Arabidopsis thaliana] 47 7e-05
ref|NP_851112.1| expressed protein [Arabidopsis thaliana] 47 7e-05
gb|AAN15514.1| unknown protein [Arabidopsis thaliana] gi|2253102... 45 3e-04
dbj|BAB01238.1| unnamed protein product [Arabidopsis thaliana] 45 3e-04
dbj|BAD38129.1| hydroxyproline-rich glycoprotein-like [Oryza sat... 42 0.003
dbj|BAD38128.1| hydroxyproline-rich glycoprotein-like [Oryza sat... 42 0.003
ref|ZP_00339903.1| COG0086: DNA-directed RNA polymerase, beta' s... 37 0.13
ref|XP_143396.3| PREDICTED: similar to ifapsoriasin [Mus musculus] 36 0.22
ref|NP_701136.1| hypothetical protein PF11_0276 [Plasmodium falc... 35 0.29
ref|XP_637675.1| hypothetical protein DDB0218841 [Dictyostelium ... 35 0.29
ref|NP_703418.1| hypothetical protein [Plasmodium falciparum 3D7... 34 0.84
ref|NP_220532.1| DNA-DIRECTED RNA POLYMERASE BETA PRIME CHAIN (r... 34 0.84
emb|CAG98603.1| unnamed protein product [Kluyveromyces lactis NR... 34 0.84
ref|YP_067097.1| DNA-directed RNA polymerase beta prime subunit;... 34 0.84
gb|EAK86244.1| hypothetical protein UM04789.1 [Ustilago maydis 5... 34 0.84
emb|CAI04661.1| conserved hypothetical protein [Plasmodium berghei] 34 0.84
ref|NP_359819.1| DNA-directed RNA polymerase beta prime chain [E... 33 1.1
>ref|NP_180907.2| hydroxyproline-rich glycoprotein family protein [Arabidopsis
thaliana]
Length = 623
Score = 64.7 bits (156), Expect = 4e-10
Identities = 31/61 (50%), Positives = 42/61 (68%), Gaps = 2/61 (3%)
Query: 44 NSLKVKEKDNMKHHCDEKREVYEYMIIQPKEKGKSNSGKGEHITSRPLQAAHDEYEEEAT 103
N L++ E+ M+ CDEKR VYE M+ + +EKG+S GKGE + + LQ AHD+YE E T
Sbjct: 124 NELRIVEE--MQRLCDEKRNVYEGMLTRQREKGRSKGGKGETFSPQQLQEAHDDYENETT 181
Query: 104 L 104
L
Sbjct: 182 L 182
>emb|CAD40549.1| OSJNBa0072K14.9 [Oryza sativa (japonica cultivar-group)]
gi|50923911|ref|XP_472316.1| OSJNBa0072K14.9 [Oryza
sativa (japonica cultivar-group)]
Length = 624
Score = 47.8 bits (112), Expect = 6e-05
Identities = 28/70 (40%), Positives = 40/70 (57%), Gaps = 2/70 (2%)
Query: 37 NQIFEDSNSL--KVKEKDNMKHHCDEKREVYEYMIIQPKEKGKSNSGKGEHITSRPLQAA 94
N I S SL +++ + MK CD KR+ YE M +EKG+S K E ++S LQA
Sbjct: 114 NTITNPSESLLKELQVVEEMKELCDHKRQEYEAMRAAYREKGRSRHSKTETLSSEQLQAY 173
Query: 95 HDEYEEEATL 104
+Y+E+A L
Sbjct: 174 FLDYQEDAAL 183
>dbj|BAC42812.1| unknown protein [Arabidopsis thaliana] gi|30693745|ref|NP_198926.2|
expressed protein [Arabidopsis thaliana]
Length = 582
Score = 47.4 bits (111), Expect = 7e-05
Identities = 22/54 (40%), Positives = 37/54 (67%), Gaps = 1/54 (1%)
Query: 52 DNMKHHCDEKREVYEYMIIQP-KEKGKSNSGKGEHITSRPLQAAHDEYEEEATL 104
++MK C+EKR+V ++M+++ K+K + KGE + R L+ A DE ++EATL
Sbjct: 131 EDMKQQCEEKRDVVKHMLMEHVKDKVQVKGTKGERLIRRQLETARDELQDEATL 184
>dbj|BAB09712.1| unnamed protein product [Arabidopsis thaliana]
Length = 534
Score = 47.4 bits (111), Expect = 7e-05
Identities = 22/54 (40%), Positives = 37/54 (67%), Gaps = 1/54 (1%)
Query: 52 DNMKHHCDEKREVYEYMIIQP-KEKGKSNSGKGEHITSRPLQAAHDEYEEEATL 104
++MK C+EKR+V ++M+++ K+K + KGE + R L+ A DE ++EATL
Sbjct: 60 EDMKQQCEEKRDVVKHMLMEHVKDKVQVKGTKGERLIRRQLETARDELQDEATL 113
>ref|NP_851112.1| expressed protein [Arabidopsis thaliana]
Length = 586
Score = 47.4 bits (111), Expect = 7e-05
Identities = 22/54 (40%), Positives = 37/54 (67%), Gaps = 1/54 (1%)
Query: 52 DNMKHHCDEKREVYEYMIIQP-KEKGKSNSGKGEHITSRPLQAAHDEYEEEATL 104
++MK C+EKR+V ++M+++ K+K + KGE + R L+ A DE ++EATL
Sbjct: 131 EDMKQQCEEKRDVVKHMLMEHVKDKVQVKGTKGERLIRRQLETARDELQDEATL 184
>gb|AAN15514.1| unknown protein [Arabidopsis thaliana] gi|22531026|gb|AAM97017.1|
unknown protein [Arabidopsis thaliana]
gi|30688552|ref|NP_189326.2| hydroxyproline-rich
glycoprotein family protein [Arabidopsis thaliana]
Length = 608
Score = 45.4 bits (106), Expect = 3e-04
Identities = 22/53 (41%), Positives = 32/53 (59%), Gaps = 2/53 (3%)
Query: 52 DNMKHHCDEKREVYEYMIIQPKEKGKSNSGKGEHITSRPLQAAHDEYEEEATL 104
++MK CD KR VYE ++ KEKG+ S KGE + A+ E+ +EAT+
Sbjct: 132 EDMKQQCDGKRNVYEMSLV--KEKGRPKSSKGERHIPPESRPAYSEFHDEATM 182
>dbj|BAB01238.1| unnamed protein product [Arabidopsis thaliana]
Length = 621
Score = 45.4 bits (106), Expect = 3e-04
Identities = 22/53 (41%), Positives = 32/53 (59%), Gaps = 2/53 (3%)
Query: 52 DNMKHHCDEKREVYEYMIIQPKEKGKSNSGKGEHITSRPLQAAHDEYEEEATL 104
++MK CD KR VYE ++ KEKG+ S KGE + A+ E+ +EAT+
Sbjct: 132 EDMKQQCDGKRNVYEMSLV--KEKGRPKSSKGERHIPPESRPAYSEFHDEATM 182
>dbj|BAD38129.1| hydroxyproline-rich glycoprotein-like [Oryza sativa (japonica
cultivar-group)]
Length = 320
Score = 42.0 bits (97), Expect = 0.003
Identities = 20/53 (37%), Positives = 29/53 (53%)
Query: 52 DNMKHHCDEKREVYEYMIIQPKEKGKSNSGKGEHITSRPLQAAHDEYEEEATL 104
+ MK CD KR+ YE M +KG S K E ++ L A+ EY+E++ L
Sbjct: 136 EEMKQQCDMKRDAYETMRASYSDKGGSRHSKTESFSTEQLDASFLEYQEDSAL 188
>dbj|BAD38128.1| hydroxyproline-rich glycoprotein-like [Oryza sativa (japonica
cultivar-group)]
Length = 623
Score = 42.0 bits (97), Expect = 0.003
Identities = 20/53 (37%), Positives = 29/53 (53%)
Query: 52 DNMKHHCDEKREVYEYMIIQPKEKGKSNSGKGEHITSRPLQAAHDEYEEEATL 104
+ MK CD KR+ YE M +KG S K E ++ L A+ EY+E++ L
Sbjct: 136 EEMKQQCDMKRDAYETMRASYSDKGGSRHSKTESFSTEQLDASFLEYQEDSAL 188
>ref|ZP_00339903.1| COG0086: DNA-directed RNA polymerase, beta' subunit/160 kD subunit
[Rickettsia akari str. Hartford]
Length = 1372
Score = 36.6 bits (83), Expect = 0.13
Identities = 24/65 (36%), Positives = 32/65 (48%), Gaps = 3/65 (4%)
Query: 67 YMIIQPKEKGKSNSGKGEHITSRPLQAAHDEYEEEATLAKQGCPHIAKLGKGRRIKPLKH 126
Y++I P G S KGE +T LQ A D+Y E+A A G I ++ K LKH
Sbjct: 143 YVVIDP---GLSILQKGELLTEEELQKAKDKYGEDAFTASIGAEVIQQMLKELDFAKLKH 199
Query: 127 SNFNE 131
+ E
Sbjct: 200 ELYEE 204
>ref|XP_143396.3| PREDICTED: similar to ifapsoriasin [Mus musculus]
Length = 704
Score = 35.8 bits (81), Expect = 0.22
Identities = 26/89 (29%), Positives = 38/89 (42%), Gaps = 8/89 (8%)
Query: 24 GKFGPIRVPRGVNNQIFEDSNSLKVKEKDNMKHHCDEKREVYEYMIIQPKEKGKSNS--- 80
G GP + RG N++ E + L E+ KHH ++ + KEK S S
Sbjct: 145 GSRGPAKHRRGSNSKRLERQDELSSSEESRKKHH----GSIFGHSWSSNKEKDGSRSEEL 200
Query: 81 -GKGEHITSRPLQAAHDEYEEEATLAKQG 108
KG+ P + + +EYE L QG
Sbjct: 201 GEKGDKSYDSPSRESEEEYESGYRLNHQG 229
>ref|NP_701136.1| hypothetical protein PF11_0276 [Plasmodium falciparum 3D7]
gi|23496201|gb|AAN35860.1| hypothetical protein
[Plasmodium falciparum 3D7]
Length = 682
Score = 35.4 bits (80), Expect = 0.29
Identities = 18/57 (31%), Positives = 29/57 (50%), Gaps = 1/57 (1%)
Query: 11 ALTSKVDKNIKKNGKFGPIRVPRGVNNQIFEDSNSLKVKEKDNMKHHCDEKREVYEY 67
A + D NIK+ G G ++ +++ F D+ LK K N+ +CD K Y+Y
Sbjct: 358 ATQEREDANIKRGGTLGCDKIKEKESSEYFVDNKKLKNK-SSNLTDNCDNKINKYDY 413
>ref|XP_637675.1| hypothetical protein DDB0218841 [Dictyostelium discoideum]
gi|60466108|gb|EAL64174.1| hypothetical protein
DDB0218841 [Dictyostelium discoideum]
Length = 4592
Score = 35.4 bits (80), Expect = 0.29
Identities = 27/87 (31%), Positives = 38/87 (43%), Gaps = 2/87 (2%)
Query: 15 KVDKNIKKNGKFGPIRVPRGVNNQIFEDSNSLKVKEKDNMKHHCDEKREVYEYMIIQPKE 74
K DK N I NN +D N K KEKD K EK++ E + ++ K+
Sbjct: 450 KDDKKDDINSSSSSIGSSNSSNNTPTKDKN--KEKEKDKEKEKEKEKKKEKEKLKLEEKK 507
Query: 75 KGKSNSGKGEHITSRPLQAAHDEYEEE 101
K K GK + S+ + E+E E
Sbjct: 508 KKKEEKGKSKSKDSKKNKIKGIEFEVE 534
>ref|NP_703418.1| hypothetical protein [Plasmodium falciparum 3D7]
gi|23504558|emb|CAD51438.1| hypothetical protein
[Plasmodium falciparum 3D7]
Length = 893
Score = 33.9 bits (76), Expect = 0.84
Identities = 26/84 (30%), Positives = 38/84 (44%), Gaps = 7/84 (8%)
Query: 1 MRPSKLHQIAALTSKV----DKNIKKNGKFGPIRVPRGVNNQIFEDSNSLKVKEKDNMKH 56
M+ + H+I +TS D N N F + NN I+ N K K K+++K
Sbjct: 333 MKKLREHKIIIVTSSGKIYDDDNNNDNNNFYNDNI---YNNNIYNVHNDDKEKIKNHIKK 389
Query: 57 HCDEKREVYEYMIIQPKEKGKSNS 80
+ +YEY Q E+ KSNS
Sbjct: 390 KNKQNNYLYEYQRTQKDEEQKSNS 413
>ref|NP_220532.1| DNA-DIRECTED RNA POLYMERASE BETA PRIME CHAIN (rpoC) [Rickettsia
prowazekii str. Madrid E] gi|3860708|emb|CAA14609.1|
DNA-DIRECTED RNA POLYMERASE BETA PRIME CHAIN (rpoC)
[Rickettsia prowazekii] gi|7434730|pir||B71724
dna-directed RNA polymerase beta prime chain (rpoC)
RP141 - Rickettsia prowazekii
gi|6226044|sp|Q9ZE20|RPOC_RICPR DNA-directed RNA
polymerase beta' chain (RNAP beta' subunit)
(Transcriptase beta' chain) (RNA polymerase beta'
subunit)
Length = 1372
Score = 33.9 bits (76), Expect = 0.84
Identities = 22/65 (33%), Positives = 32/65 (48%), Gaps = 3/65 (4%)
Query: 67 YMIIQPKEKGKSNSGKGEHITSRPLQAAHDEYEEEATLAKQGCPHIAKLGKGRRIKPLKH 126
Y+++ P G S KGE +T LQ A D+Y E+A A G I ++ K LK
Sbjct: 143 YVVVDP---GLSILQKGELLTEEELQKAKDKYGEDAFTASIGAEVIQQMLKELDFSKLKQ 199
Query: 127 SNFNE 131
++E
Sbjct: 200 ELYDE 204
>emb|CAG98603.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
gi|50312991|ref|XP_455895.1| unnamed protein product
[Kluyveromyces lactis]
Length = 326
Score = 33.9 bits (76), Expect = 0.84
Identities = 24/87 (27%), Positives = 37/87 (41%), Gaps = 4/87 (4%)
Query: 15 KVDKNIKKNGKFGPIRVPRGVNNQIFEDSNSLKVKEKDNMKHHCDEKREVYEYMIIQPKE 74
K DK KK K + + + ED + K +K N KH D++ E + + K
Sbjct: 197 KEDKKTKKEKK----KAKKEEEKKQKEDKKNKKHSDKKNKKHDDDDEYSEAEPSVTEEKR 252
Query: 75 KGKSNSGKGEHITSRPLQAAHDEYEEE 101
S + K H TS L+ + E + E
Sbjct: 253 VETSFTKKPHHTTSAALKEGYKEVKNE 279
>ref|YP_067097.1| DNA-directed RNA polymerase beta prime subunit; RNA
nucleotidyltransferase (DNA-directed).; RNA polymerase
I.; RNA polymerase II.; RNA polymerase III. [Rickettsia
typhi str. Wilmington] gi|51459652|gb|AAU03615.1|
DNA-directed RNA polymerase beta prime subunit; RNA
nucleotidyltransferase (DNA-directed).; RNA polymerase
I.; RNA polymerase II.; RNA polymerase III. [Rickettsia
typhi str. Wilmington]
Length = 1372
Score = 33.9 bits (76), Expect = 0.84
Identities = 22/65 (33%), Positives = 32/65 (48%), Gaps = 3/65 (4%)
Query: 67 YMIIQPKEKGKSNSGKGEHITSRPLQAAHDEYEEEATLAKQGCPHIAKLGKGRRIKPLKH 126
Y+++ P G S KGE +T LQ A D+Y E+A A G I ++ K LK
Sbjct: 143 YVVVDP---GLSILQKGELLTEEELQKAKDKYGEDAFTASIGAEVIQQMLKELDFSKLKQ 199
Query: 127 SNFNE 131
++E
Sbjct: 200 ELYDE 204
>gb|EAK86244.1| hypothetical protein UM04789.1 [Ustilago maydis 521]
gi|49076976|ref|XP_402404.1| hypothetical protein
UM04789.1 [Ustilago maydis 521]
Length = 426
Score = 33.9 bits (76), Expect = 0.84
Identities = 31/118 (26%), Positives = 47/118 (39%), Gaps = 14/118 (11%)
Query: 28 PIRVPRGVNNQIFEDSNSLKVKEKDNMKHHCDEKREVYEYMIIQPKEKGKSNSGKGEHIT 87
P + + N+ E + KEK K + E+ E + P + + EHI
Sbjct: 132 PTKKEKKEKNEKKEKKEKKEKKEKKEKKEQKTQSEEIAEQVPESPSDTEIDSDDAEEHIQ 191
Query: 88 SRPLQAAHDE---------YEEEATLAKQ---GCPHIAKLGKGRRIKPLKH--SNFNE 131
P+QAA E E EA KQ G +I+++ G ++H SNF E
Sbjct: 192 DDPMQAADQEEKIIKPLSTTELEAFKKKQRKLGIVYISRIPPGMTPAKVRHILSNFGE 249
>emb|CAI04661.1| conserved hypothetical protein [Plasmodium berghei]
Length = 469
Score = 33.9 bits (76), Expect = 0.84
Identities = 31/126 (24%), Positives = 54/126 (42%), Gaps = 9/126 (7%)
Query: 4 SKLHQIAALTSKVDKNIKKNGKFGPIRVPRGVNNQIFEDSNSLKVKEKDNMKHHCDEKRE 63
+K + ++ + K K I KN K+ + +P G ++I +L +K D K + E +
Sbjct: 118 NKNNILSCIDEKKKKKIDKNSKYKLVVLPNGKKHKI-----NLNIKLSDFQKLYISEDNK 172
Query: 64 VYEYMIIQPKEKGKSNSGKGEH-ITSRPLQAAHDEYEEEATLAKQGCPHIAKLGKGRRIK 122
+EY++ K K N K + I R +Y EE T C + +
Sbjct: 173 SFEYLLRNMKNK---NIEKNMYSIIKRNEYNMKMDYIEECTKNGIKCDMLHMNKSENELY 229
Query: 123 PLKHSN 128
P++ SN
Sbjct: 230 PMRISN 235
>ref|NP_359819.1| DNA-directed RNA polymerase beta prime chain [EC:2.7.7.6]
[Rickettsia conorii str. Malish 7]
gi|15619230|gb|AAL02720.1| DNA-directed RNA polymerase
beta prime chain [EC:2.7.7.6] [Rickettsia conorii str.
Malish 7] gi|25288418|pir||F97722 hypothetical protein
rpoC [imported] - Rickettsia conorii (strain Malish 7)
gi|14916697|sp|Q9RH40|RPOC_RICCN DNA-directed RNA
polymerase beta' chain (RNAP beta' subunit)
(Transcriptase beta' chain) (RNA polymerase beta'
subunit)
Length = 1372
Score = 33.5 bits (75), Expect = 1.1
Identities = 22/65 (33%), Positives = 31/65 (46%), Gaps = 3/65 (4%)
Query: 67 YMIIQPKEKGKSNSGKGEHITSRPLQAAHDEYEEEATLAKQGCPHIAKLGKGRRIKPLKH 126
Y+++ P G S KGE +T LQ A D+Y E+A A G I ++ K LK
Sbjct: 143 YVVVDP---GLSILQKGELLTEEELQKAKDKYGEDAFTASIGAEVIQQMLKELDFSKLKQ 199
Query: 127 SNFNE 131
+ E
Sbjct: 200 ELYEE 204
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.315 0.133 0.393
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 256,384,344
Number of Sequences: 2540612
Number of extensions: 10755808
Number of successful extensions: 20719
Number of sequences better than 10.0: 90
Number of HSP's better than 10.0 without gapping: 27
Number of HSP's successfully gapped in prelim test: 63
Number of HSP's that attempted gapping in prelim test: 20661
Number of HSP's gapped (non-prelim): 108
length of query: 145
length of database: 863,360,394
effective HSP length: 121
effective length of query: 24
effective length of database: 555,946,342
effective search space: 13342712208
effective search space used: 13342712208
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 67 (30.4 bits)
Medicago: description of AC146817.2