Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC000521A_C01 KMC000521A_c01
(1460 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAM13089.1| unknown protein [Arabidopsis thaliana] 316 7e-85
ref|NP_198447.1| protein kinase-like; protein id: At5g35980.1 [A... 171 2e-41
ref|NP_198448.1| unknown protein; protein id: At5g35990.1 [Arabi... 151 2e-35
ref|NP_508292.1| Putative nuclear protein family member, nematod... 47 8e-04
gb|AAL92314.2|AC115598_1 hypothetical protein [Dictyostelium dis... 46 0.002
>gb|AAM13089.1| unknown protein [Arabidopsis thaliana]
Length = 956
Score = 316 bits (809), Expect = 7e-85
Identities = 191/380 (50%), Positives = 231/380 (60%), Gaps = 18/380 (4%)
Frame = -1
Query: 1460 LGNSPDARRRVKYQPG----NGLGVSPSAGNFAPLPLGASPSQFTPPNSYSQVSVGSPGH 1293
LG SPDARRRV P NGLG SPSAGNFAPLPLG SPSQFTP N+ +Q GSPGH
Sbjct: 584 LGTSPDARRRVMQYPHGNGPNGLGTSPSAGNFAPLPLGTSPSQFTP-NTNNQFLAGSPGH 642
Query: 1292 YGPTSPARGTTHGSPLGKTAAASQFNRRKNWGHSGSPQTQETTFS-SHWHGQYPDSSSHA 1116
+GPTSP R + HGSPLGK AA SQ NRR + G+SG Q+Q+++ S + HG D+
Sbjct: 643 HGPTSPVRNSCHGSPLGKMAAFSQINRRMSAGYSGGSQSQDSSLSQAQGHGM--DNFYQN 700
Query: 1115 EGTSQALGSSPSYLQPNTNPGNWKQRGSG-----GISANQNITCSMMPSSNRNSQLTELV 951
EG S SPS Q ++ N KQ G G S + N S+ S+ N T
Sbjct: 701 EGYSGQFSGSPSRRQLDSGVKNRKQTQGGTTLSTGYSTHNNANSSLR-SNMYNPSSTAHH 759
Query: 950 CDDVETGISLPDPGDWDPNYSDELLLQEDGSDESSLTTEFG-SMNLGSTEPWSGFGRFNP 774
++ +T +S+PDPGDWDPNYSD+LLL+ED +DESSL F M LGST+ S RFN
Sbjct: 760 LENPDTALSVPDPGDWDPNYSDDLLLEEDSADESSLANAFSRGMQLGSTDASSYSRRFNS 819
Query: 773 -VSNSSTPIFVQRRNGPGQTFSNVEMGSPPTHDLHA---SYIPSMSKPFHAMPHISQNTP 606
S SS+ QRR P Q FS VE GSPP++D HA +IP +PH+SQN+P
Sbjct: 820 NASTSSSNPTTQRRYAPNQAFSQVETGSPPSNDPHARFGQHIPGS----QYIPHVSQNSP 875
Query: 605 SRLGHQSVQRFTHGRPPPGG--DWNQIKLQAPPSGFNSVG-PRSPRNTSFTNSMSWGRRM 435
SRLG Q QR+ HGRP G D N + Q PPS NS G RSPR++S+TN + WGRR
Sbjct: 876 SRLGQQPPQRYNHGRPNAGRTMDRNHMNAQLPPSNTNSGGQQRSPRSSSYTNGVPWGRRT 935
Query: 434 NPPVSNIPPASRTRKDYARI 375
N V N+P S R DY I
Sbjct: 936 NNHVPNVPSTSHGRVDYGSI 955
>ref|NP_198447.1| protein kinase-like; protein id: At5g35980.1 [Arabidopsis thaliana]
gi|9758801|dbj|BAB09254.1| protein kinase-like
[Arabidopsis thaliana]
Length = 787
Score = 171 bits (434), Expect = 2e-41
Identities = 101/200 (50%), Positives = 123/200 (61%), Gaps = 10/200 (5%)
Frame = -1
Query: 1460 LGNSPDARRRVKYQPG----NGLGVSPSAGNFAPLPLGASPSQFTPPNSYSQVSVGSPGH 1293
LG SPDARRRV P NGLG SPSAGNFAPLPLG SPSQFTP N+ +Q GSPGH
Sbjct: 584 LGTSPDARRRVMQYPHGNGPNGLGTSPSAGNFAPLPLGTSPSQFTP-NTNNQFLAGSPGH 642
Query: 1292 YGPTSPARGTTHGSPLGKTAAASQFNRRKNWGHSGSPQTQETTFS-SHWHGQYPDSSSHA 1116
+GPTSP R + HGSPLGK AA SQ NRR + G+SG Q+Q+++ S + HG D+
Sbjct: 643 HGPTSPVRNSCHGSPLGKMAAFSQINRRMSAGYSGGSQSQDSSLSQAQGHGM--DNFYQN 700
Query: 1115 EGTSQALGSSPSYLQPNTNPGNWKQRGSG-----GISANQNITCSMMPSSNRNSQLTELV 951
EG S SPS Q ++ N KQ G G S + N S+ S+ N T
Sbjct: 701 EGYSGQFSGSPSRRQLDSGVKNRKQTQGGTTLSTGYSTHNNANSSLR-SNMYNPSSTAHH 759
Query: 950 CDDVETGISLPDPGDWDPNY 891
++ +T +S+PDPGDWDPNY
Sbjct: 760 LENPDTALSVPDPGDWDPNY 779
>ref|NP_198448.1| unknown protein; protein id: At5g35990.1 [Arabidopsis thaliana]
gi|9758802|dbj|BAB09255.1| gene_id:MEE13.10~unknown
protein [Arabidopsis thaliana]
Length = 179
Score = 151 bits (382), Expect = 2e-35
Identities = 91/181 (50%), Positives = 109/181 (59%), Gaps = 8/181 (4%)
Frame = -1
Query: 893 YSDELLLQEDGSDESSLTTEFG-SMNLGSTEPWSGFGRFNP-VSNSSTPIFVQRRNGPGQ 720
YSD+LLL+ED +DESSL F M LGST+ S RFN S SS+ QRR P Q
Sbjct: 2 YSDDLLLEEDSADESSLANAFSRGMQLGSTDASSYSRRFNSNASTSSSNPTTQRRYAPNQ 61
Query: 719 TFSNVEMGSPPTHDLHASY---IPSMSKPFHAMPHISQNTPSRLGHQSVQRFTHGRPPPG 549
FS VE GSPP++D HA + IP +PH+SQN+PSRLG Q QR+ HGRP G
Sbjct: 62 AFSQVETGSPPSNDPHARFGQHIPGSQY----IPHVSQNSPSRLGQQPPQRYNHGRPNAG 117
Query: 548 G--DWNQIKLQAPPSGFNSVGP-RSPRNTSFTNSMSWGRRMNPPVSNIPPASRTRKDYAR 378
D N + Q PPS NS G RSPR++S+TN + WGRR N V N+P S R DY
Sbjct: 118 RTMDRNHMNAQLPPSNTNSGGQQRSPRSSSYTNGVPWGRRTNNHVPNVPSTSHGRVDYGS 177
Query: 377 I 375
I
Sbjct: 178 I 178
>ref|NP_508292.1| Putative nuclear protein family member, nematode specific
[Caenorhabditis elegans] gi|7505357|pir||T34434
hypothetical protein K06A9.1a - Caenorhabditis elegans
gi|3834294|gb|AAC70890.1| Hypothetical protein K06A9.1b
[Caenorhabditis elegans]
Length = 2232
Score = 47.0 bits (110), Expect = 8e-04
Identities = 93/368 (25%), Positives = 131/368 (35%), Gaps = 45/368 (12%)
Frame = -1
Query: 1418 PGNGL-GVSPSAGNFAPLPLGASPSQFTPP-NSYSQVSVGSPGHYGPTSPARGTTHGSP- 1248
PG L +SPS + + G+S +P ++ SQ S +PG G T T GS
Sbjct: 1014 PGTTLTSISPSPSPSSTI--GSSQGSTSPVVSTISQGSTETPGSTGSTVTKPSTVSGSAS 1071
Query: 1247 ------LGKTAAASQF---NRRKNWGHSGSPQTQETTFSSHWHGQYPDSSSHAEGTSQAL 1095
+G T A+S + N S SP T T S G S S + S +
Sbjct: 1072 SGSTATMGSTEASSTSGGSSTSPNPSQSTSPSTSGATSSPGSSGTTLTSISPSPSQSSTI 1131
Query: 1094 GSSPSYLQP--NTNPGNWKQR------------------GSGGISANQNITCSMMPSSNR 975
GSS P +T G+ + GSG S + IT + R
Sbjct: 1132 GSSQGSTSPVVSTTSGDMTSQGSTQIPGSTGSTVTQPSTGSGSTSTSGEITSQGSTQTPR 1191
Query: 974 NSQLTE-LVCDDVETGISLPDPGDWDPNYSDELLLQEDGSDESSLTTEFGSMNLGSTEPW 798
+S T + + +S PG + ++ S S++TT GSTE
Sbjct: 1192 SSLSTSPAISTSTQQSVSTNSPGS---TVTQPSTVRGSTSSGSTVTT-------GSTEGS 1241
Query: 797 SGFGRFNPVSNSSTPIFVQRRNGPGQTFSNVEMGSPPTHDLHASYIPSMSKPFHAM-PHI 621
S G + S SS+ P + S S PT + S P +S M H
Sbjct: 1242 STSGSSSATSLSSSSPVPSTSQSPNPSTSG---SSTPTPNPSQSTSPVVSTTTGEMTSHG 1298
Query: 620 SQNTPSRLGHQSVQRFTHGRPPPGGDWNQI----------KLQAPPSGFNSVGPRSP-RN 474
S TPS +G Q T G I + PS + V SP +
Sbjct: 1299 STQTPSTIGSTVTQPSTVSGSNSSGSTVTIGSSEASTSGSSFKTSPSSISPVPTSSPIPS 1358
Query: 473 TSFTNSMS 450
T+F +S S
Sbjct: 1359 TTFASSTS 1366
Score = 42.4 bits (98), Expect = 0.019
Identities = 65/279 (23%), Positives = 103/279 (36%), Gaps = 7/279 (2%)
Frame = -1
Query: 1400 VSPSAGNFAPLPLGASPSQFTPPNSYSQVSVGSPGHYGPTSPARGTTHGSPLGKTAAASQ 1221
+S S G+ + G+S T +S S SPG +P +T+GS +++S
Sbjct: 351 ISGSTGSTVTVVPGSSS---TFASSTPIASSSSPGSTVTVAPGSSSTYGSSTPSASSSSS 407
Query: 1220 FNRRKNWGHSGSPQTQETTFSSHWHGQYPDSSSHAEGTSQAL--GSSPSYLQPNTNPGNW 1047
N G +GS T SS + P +SS + G++ + GSS +Y + +
Sbjct: 408 GTMSTNSGSTGSTVTVAPVSSSTFGSSTPIASSSSSGSTVTVVSGSSSTYGSSTPSASSS 467
Query: 1046 KQRGSGGISANQNITCSMMPSSNRNSQLTELVCDDVETGISLPDPGDWDPNYSDELLLQE 867
+ IS + T +++P S+ + + G G +
Sbjct: 468 SAGTASTISGSTGSTATIVPGSSSSVGSSTQSASPSSPGTMSTVSGPTGSTVTVVPGSST 527
Query: 866 DGSDESSLTTEFGSMNLGSTEPWSGFGRF--NPVSNSSTPIFVQRRNGPGQT---FSNVE 702
+ SS + GST SG + VS S+ V G Q+ S
Sbjct: 528 SPAPSSSPNPSSSPASTGSTITISGSSSIIVSTVSGST----VSGSTGTSQSTLASSTAT 583
Query: 701 MGSPPTHDLHASYIPSMSKPFHAMPHISQNTPSRLGHQS 585
GS T +S PS P P+ TPS+ QS
Sbjct: 584 PGSSSTVPSSSSPQPSSQSP---APNTGSTTPSQTSSQS 619
Score = 38.9 bits (89), Expect = 0.21
Identities = 69/333 (20%), Positives = 115/333 (33%), Gaps = 14/333 (4%)
Frame = -1
Query: 1400 VSPSAGNFAPLPLGASPSQFTPPNSYSQVSVGSPGHYGPTSPARGTTHGSPLGKTAAASQ 1221
+SPS + L SPS + S ++ SP TS +T G+ +A +
Sbjct: 689 LSPSTSGMSTLTSEPSPSSTQSSGAQSTLTTPSPNPSQSTSSLESSTSGATTSSGSAGTT 748
Query: 1220 FNRRKNWGHSGSPQTQETTFSSHWHGQYPDSSS-----HAEGTSQALGSSPSYLQPNTNP 1056
GS Q + +S G+ S + TS A+ +S +P
Sbjct: 749 MTSPSQSSSVGSSQGSTSPAASTTSGEMTSQGSTQTPGSSVSTSAAILTSTQQSVSTNSP 808
Query: 1055 GNWKQRG---SGGISANQNITCSMMPSSNRNSQLTELVCDDVETGISLPDPG-DWDPNYS 888
G+ R SG S+ +T +S S + S P P +PN S
Sbjct: 809 GSTVTRPSTVSGSTSSGSTVTVGSTEASTSGSSVAS----------SSPAPSTSQNPNPS 858
Query: 887 ----DELLLQEDGSDESSLTTEFGSMNLGSTEPWSGFGRFNPVSNSSTPI-FVQRRNGPG 723
++ Q +S+ E S P + +P + ST I Q PG
Sbjct: 859 TSSGSSMITQSPYPSQSTSPVE-SSTTPSPGSPGTTLTSTSPSPSQSTTIGSTQGSTSPG 917
Query: 722 QTFSNVEMGSPPTHDLHASYIPSMSKPFHAMPHISQNTPSRLGHQSVQRFTHGRPPPGGD 543
+ ++ EM S + S ++++P S + +G P P
Sbjct: 918 ISTTSEEMTSQGSTQTPGSTGSTVTQPSTVSDSTSSGSTVTVGSTE----GSSSPIPSTS 973
Query: 542 WNQIKLQAPPSGFNSVGPRSPRNTSFTNSMSWG 444
N + S ++ P+S ++TS S + G
Sbjct: 974 QNTNPSTSSGSSMSTQTPQSSQSTSPVESSTSG 1006
>gb|AAL92314.2|AC115598_1 hypothetical protein [Dictyostelium discoideum]
Length = 1033
Score = 45.8 bits (107), Expect = 0.002
Identities = 50/219 (22%), Positives = 89/219 (39%), Gaps = 3/219 (1%)
Frame = -1
Query: 1382 NFAPLPLGASPSQFTPPNSYSQVSVGSPG--HYGPTSPARGTTHGSPLGKTAAASQFNRR 1209
+ P+ P P Q +V SP Y T T H SP T + N
Sbjct: 688 SLTPIQAIKLPEYTISPEYEPQTTVDSPSTSSYSSTPGGANTAH-SPA--TLSTPNLNNS 744
Query: 1208 KNWGHSGSPQTQETTFSSHWHGQYPDSSSHAEGTSQALGSSPSYLQPNTNPGNWKQRGSG 1029
N +S + +Q + H H + +++++ +S + GSS S N++ G GSG
Sbjct: 745 NNSVNSNNTPSQ---YHHHHHHHHHHNNNNSSSSSSSSGSSSSTNGNNSSGGGGG--GSG 799
Query: 1028 GISANQNITCSMMPSSNRNSQLTELVCDDVETGISLPDPGDWDPNYSDELLLQEDGSD-E 852
S + +I + PSSN + + ++ + + T P PNY D ++ + + +
Sbjct: 800 SSSNSVDIIQASTPSSNNSGGINIIIPNPMSTNTIPP------PNYGDSMMYNDPFFERK 853
Query: 851 SSLTTEFGSMNLGSTEPWSGFGRFNPVSNSSTPIFVQRR 735
+S+ S N +P+ +S P F +R+
Sbjct: 854 NSIQNGLFSFN-------------DPLRKNSNPFFSERK 879
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,399,770,256
Number of Sequences: 1393205
Number of extensions: 35726198
Number of successful extensions: 132496
Number of sequences better than 10.0: 298
Number of HSP's better than 10.0 without gapping: 112591
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 130770
length of database: 448,689,247
effective HSP length: 127
effective length of database: 271,752,212
effective search space used: 97559044108
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)