Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC015163A_C01 KMC015163A_c01
(983 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAM64310.1| unknown [Arabidopsis thaliana] 382 e-105
gb|AAK96864.1| Unknown protein [Arabidopsis thaliana] gi|1837747... 380 e-104
ref|NP_566447.1| expressed protein; protein id: At3g13200.1, sup... 379 e-104
dbj|BAA90629.1| ESTs AU032852(S15362),AU070591(S5037) correspond... 374 e-103
ref|NP_075642.1| RIKEN cDNA 0610040D20 [Mus musculus] gi|8926741... 206 6e-52
>gb|AAM64310.1| unknown [Arabidopsis thaliana]
Length = 230
Score = 382 bits (980), Expect = e-105
Identities = 184/230 (80%), Positives = 208/230 (90%), Gaps = 2/230 (0%)
Frame = -2
Query: 913 MTTAARPTWAPAKGGNEQGGTRIFGPSQKYSSRDIASHTTLKPRRDGQDTQDELKRRNLR 734
MTTAARPTWAPAKGGNEQGG RIFGPSQKYSSRD+A+HTTLKPRR+GQ TQ+EL++ NLR
Sbjct: 1 MTTAARPTWAPAKGGNEQGGARIFGPSQKYSSRDVAAHTTLKPRREGQHTQEELQKINLR 60
Query: 733 DELEERERRHFSSKNKSYSDDRDHGKSSHLFLEGTKRDVEDHIVARSVDADDSDVEVKSD 554
DELEERERRHFSSK+KSY+DDRD + S L LEG+KRD E+ I+ RSVDADDSDV++KSD
Sbjct: 61 DELEERERRHFSSKDKSYNDDRDRRRGSQLLLEGSKRDPEERIIPRSVDADDSDVDIKSD 120
Query: 553 EESDD--DDDDEDDTEALLAELEQIKKERAEEKLRKERLEQEEELKVKEAELLRGNPLLN 380
++SDD DDDDEDDTEAL+AEL+QIKKER EE+LRKE+ +Q EEL KE ELL+GNPLLN
Sbjct: 121 DDSDDESDDDDEDDTEALMAELDQIKKERVEERLRKEKEQQMEELNAKEEELLKGNPLLN 180
Query: 379 NPASFNVKRRWDDDVVFKNQARGEAKAPKRFINDTIRNDFHRKFLHKYMK 230
P SF+VKRRWDDDVVFKNQARGE KAPKRFINDTIRNDFHRKFLH+YMK
Sbjct: 181 TPTSFSVKRRWDDDVVFKNQARGEMKAPKRFINDTIRNDFHRKFLHRYMK 230
>gb|AAK96864.1| Unknown protein [Arabidopsis thaliana] gi|18377472|gb|AAL66902.1|
unknown protein [Arabidopsis thaliana]
Length = 230
Score = 380 bits (977), Expect = e-104
Identities = 184/230 (80%), Positives = 208/230 (90%), Gaps = 2/230 (0%)
Frame = -2
Query: 913 MTTAARPTWAPAKGGNEQGGTRIFGPSQKYSSRDIASHTTLKPRRDGQDTQDELKRRNLR 734
MTTAARPTWAPAKGGNEQGG RIFGPSQKYSSRD+A+HTTLKPRR+GQ TQ+EL++ NLR
Sbjct: 1 MTTAARPTWAPAKGGNEQGGARIFGPSQKYSSRDVAAHTTLKPRREGQHTQEELQKINLR 60
Query: 733 DELEERERRHFSSKNKSYSDDRDHGKSSHLFLEGTKRDVEDHIVARSVDADDSDVEVKSD 554
DELEERERRHFSSK+KSY+DDRD + S L LE +KRD E+ I+ RSVDADDSDV++KSD
Sbjct: 61 DELEERERRHFSSKDKSYNDDRDRRRGSQLLLEDSKRDPEERIIPRSVDADDSDVDIKSD 120
Query: 553 EESDD--DDDDEDDTEALLAELEQIKKERAEEKLRKERLEQEEELKVKEAELLRGNPLLN 380
++SDD DDDDEDDTEAL+AEL+QIKKER EE+LRKE+ +Q EEL KE ELL+GNPLLN
Sbjct: 121 DDSDDESDDDDEDDTEALMAELDQIKKERVEERLRKEKEQQMEELNAKEEELLKGNPLLN 180
Query: 379 NPASFNVKRRWDDDVVFKNQARGEAKAPKRFINDTIRNDFHRKFLHKYMK 230
PASF+VKRRWDDDVVFKNQARGE KAPKRFINDTIRNDFHRKFLH+YMK
Sbjct: 181 TPASFSVKRRWDDDVVFKNQARGEMKAPKRFINDTIRNDFHRKFLHRYMK 230
>ref|NP_566447.1| expressed protein; protein id: At3g13200.1, supported by cDNA:
22807., supported by cDNA: gi_15451185, supported by
cDNA: gi_18377471 [Arabidopsis thaliana]
gi|10172608|dbj|BAB01412.1|
dbj|BAA90629.1~gene_id:MJG19.16~similar to unknown
protein [Arabidopsis thaliana]
Length = 230
Score = 379 bits (973), Expect = e-104
Identities = 183/230 (79%), Positives = 207/230 (89%), Gaps = 2/230 (0%)
Frame = -2
Query: 913 MTTAARPTWAPAKGGNEQGGTRIFGPSQKYSSRDIASHTTLKPRRDGQDTQDELKRRNLR 734
MTTAARPTWAPAKGGNEQGG RIFGPSQKYSSRD+A+HTTLKPRR+GQ TQ+EL++ NLR
Sbjct: 1 MTTAARPTWAPAKGGNEQGGARIFGPSQKYSSRDVAAHTTLKPRREGQHTQEELQKINLR 60
Query: 733 DELEERERRHFSSKNKSYSDDRDHGKSSHLFLEGTKRDVEDHIVARSVDADDSDVEVKSD 554
DELEERERRHFSSK+KSY+DDRD + S L LE +KRD E+ I+ RSVDADDSDV++KSD
Sbjct: 61 DELEERERRHFSSKDKSYNDDRDRRRGSQLLLEDSKRDPEERIIPRSVDADDSDVDIKSD 120
Query: 553 EESDD--DDDDEDDTEALLAELEQIKKERAEEKLRKERLEQEEELKVKEAELLRGNPLLN 380
++SDD DDDDEDDTEAL+AEL+QIKKER EE+LRKE+ +Q EEL KE ELL+GNPLLN
Sbjct: 121 DDSDDESDDDDEDDTEALMAELDQIKKERVEERLRKEKEQQMEELNAKEEELLKGNPLLN 180
Query: 379 NPASFNVKRRWDDDVVFKNQARGEAKAPKRFINDTIRNDFHRKFLHKYMK 230
P SF+VKRRWDDDVVFKNQARGE KAPKRFINDTIRNDFHRKFLH+YMK
Sbjct: 181 TPTSFSVKRRWDDDVVFKNQARGEMKAPKRFINDTIRNDFHRKFLHRYMK 230
>dbj|BAA90629.1| ESTs AU032852(S15362),AU070591(S5037) correspond to a region of the
predicted gene.~Similar to Caenorhabditis elegans cosmid
T10C6. (Z93388) [Oryza sativa (japonica cultivar-group)]
Length = 230
Score = 374 bits (961), Expect = e-103
Identities = 179/230 (77%), Positives = 209/230 (90%), Gaps = 2/230 (0%)
Frame = -2
Query: 913 MTTAARPTWAPAKGGNEQGGTRIFGPSQKYSSRDIASHTTLKPRRDGQDTQDELKRRNLR 734
MTTAARPTWAPAKGGNEQGGTRIFGPSQKYSSRD+A+HTTLKPR++GQ TQ+EL++RNLR
Sbjct: 1 MTTAARPTWAPAKGGNEQGGTRIFGPSQKYSSRDLAAHTTLKPRKEGQHTQEELQKRNLR 60
Query: 733 DELEERERRHFSSKNKSYSDDRDHGKSSHLFLEGTKRDVEDHIVARSVDADDSDVEVKSD 554
DELEERER+H+SSK+KSY+++RD KS+ L LEG++R+ ED IV R +DADDSDVE +SD
Sbjct: 61 DELEERERKHYSSKDKSYAEERDRRKSTSLLLEGSRREAEDKIVPREIDADDSDVEPRSD 120
Query: 553 EESDDDDDDEDDTEALLAELEQIKKERAEEKLRKERLEQEEELKVKEAELLRGNPL--LN 380
+ESD+DDDD+DDTEAL+AELE+IKKERAEE LRKER + EEE K+KEAEL+RGNPL +N
Sbjct: 121 DESDEDDDDDDDTEALMAELERIKKERAEENLRKERQQAEEEAKMKEAELMRGNPLININ 180
Query: 379 NPASFNVKRRWDDDVVFKNQARGEAKAPKRFINDTIRNDFHRKFLHKYMK 230
N SFNVKRRWDDDVVFKNQARGE K PKRFINDTIR+DFHRKFL +YMK
Sbjct: 181 NAGSFNVKRRWDDDVVFKNQARGETKTPKRFINDTIRSDFHRKFLQRYMK 230
>ref|NP_075642.1| RIKEN cDNA 0610040D20 [Mus musculus] gi|8926741|emb|CAB96547.1|
hypothetical protein [Mus musculus]
gi|12833160|dbj|BAB22414.1| unnamed protein product [Mus
musculus] gi|12833722|dbj|BAB22638.1| unnamed protein
product [Mus musculus] gi|12834742|dbj|BAB23026.1|
unnamed protein product [Mus musculus]
gi|13435729|gb|AAH04726.1| RIKEN cDNA 0610040D20 gene
[Mus musculus]
Length = 229
Score = 206 bits (523), Expect = 6e-52
Identities = 117/238 (49%), Positives = 161/238 (67%), Gaps = 10/238 (4%)
Frame = -2
Query: 913 MTTAARPTWAPAKGGNEQGGTRIFGPSQKYSSRDIASHTTLKPRRDGQDTQDELKRRNLR 734
MTTAARPT+ PA+GG +G + S++YSSRD+ SHT +K R+ QD +E++ R+ R
Sbjct: 1 MTTAARPTFEPARGGRGKGEGDLSQLSKQYSSRDLPSHTKIKYRQTTQDAPEEVRNRDFR 60
Query: 733 DELEERERRHFSSKNKSYSDDRDHGKSSHLFLEGTKRDVEDHIVARSVDADDSDVEVKSD 554
ELEERER KN+ R+H SS + +K+ D I A ++DADD +D
Sbjct: 61 RELEERERAAARDKNRD-RPTREHTTSSSV----SKKPRLDQIPAANLDADDP----LTD 111
Query: 553 EESDD--DDDDEDDTEALLAELEQIKKERAEEKLRKERLEQEEELKVKEAELLRGNPLLN 380
EE +D ++ D+DDT ALLAELE+IKKERAEE+ RKE+ ++ EE +++ +L GNPLLN
Sbjct: 112 EEDEDFEEESDDDDTAALLAELEKIKKERAEEQARKEQEQKAEEERIRMENILSGNPLLN 171
Query: 379 ------NPASFNVKRRWDDDVVFKNQARG--EAKAPKRFINDTIRNDFHRKFLHKYMK 230
A+F VKRRWDDDVVFKN A+G + K KRF+NDT+R++FH+KF+ KY+K
Sbjct: 172 LTGPSQPQANFKVKRRWDDDVVFKNCAKGIDDQKKDKRFVNDTLRSEFHKKFMEKYIK 229
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 857,407,650
Number of Sequences: 1393205
Number of extensions: 20722814
Number of successful extensions: 304246
Number of sequences better than 10.0: 4775
Number of HSP's better than 10.0 without gapping: 110480
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 195922
length of database: 448,689,247
effective HSP length: 123
effective length of database: 277,325,032
effective search space used: 56574306528
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)