Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC004591A_C01 KMC004591A_c01
(470 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_174461.1| unknown protein; protein id: At1g31810.1 [Arabi... 40 0.016
gb|AAL31655.1|AC079179_10 Unknown protein [Oryza sativa] gi|2004... 39 0.021
emb|CAD57687.1| Hypothetical protein C31C9.6 [Caenorhabditis ele... 39 0.036
gb|EAA35389.1| hypothetical protein [Neurospora crassa] 38 0.047
gb|AAA59875.1| mucin 38 0.047
>ref|NP_174461.1| unknown protein; protein id: At1g31810.1 [Arabidopsis thaliana]
gi|25403124|pir||G86441 unknown protein [imported] -
Arabidopsis thaliana
gi|12321294|gb|AAG50715.1|AC079041_8 unknown protein
[Arabidopsis thaliana]
Length = 1201
Score = 39.7 bits (91), Expect = 0.016
Identities = 23/57 (40%), Positives = 25/57 (43%)
Frame = +3
Query: 249 PLPPNTTITSEPPPPPPAPNITTAQL*RSKPTPPLCVSLEV*PPP*QLWRLREQAQP 419
PLPP++T PPPPPP P T P PP V PPP L R P
Sbjct: 703 PLPPSSTRLGAPPPPPPPPLSKT-----PAPPPPPLSKTPVPPPPPGLGRGTSSGPP 754
Score = 32.0 bits (71), Expect = 3.3
Identities = 19/47 (40%), Positives = 25/47 (52%), Gaps = 2/47 (4%)
Frame = +3
Query: 249 PLPPNTTITS--EPPPPPPAPNITTAQL*RSKPTPPLCVSLEV*PPP 383
P PP I++ +PP PPP P +T +L P PP +S PPP
Sbjct: 685 PPPPKANISNAPKPPAPPPLPPSST-RLGAPPPPPPPPLSKTPAPPP 730
Score = 31.2 bits (69), Expect = 5.7
Identities = 17/42 (40%), Positives = 20/42 (47%), Gaps = 1/42 (2%)
Frame = +3
Query: 261 NTTITSEPPPPPPAPNIT-TAQL*RSKPTPPLCVSLEV*PPP 383
+ T+ PPPPPP P T T S+P PP PPP
Sbjct: 477 HVTLLPPPPPPPPPPLFTSTTSFSPSQPPPP--------PPP 510
Score = 30.8 bits (68), Expect = 7.4
Identities = 20/46 (43%), Positives = 23/46 (49%), Gaps = 3/46 (6%)
Frame = +3
Query: 255 PPNTTITSEPPPPPPAP--NITTA-QL*RSKPTPPLCVSLEV*PPP 383
PP+T PPPPPP P NI+ A + P PP L PPP
Sbjct: 676 PPSTP----PPPPPPPPKANISNAPKPPAPPPLPPSSTRLGAPPPP 717
Score = 30.4 bits (67), Expect = 9.7
Identities = 18/50 (36%), Positives = 22/50 (44%), Gaps = 5/50 (10%)
Frame = +3
Query: 249 PLPPNTTIT-----SEPPPPPPAPNITTAQL*RSKPTPPLCVSLEV*PPP 383
P PP+ T ++PPPPPP P T + P PP PPP
Sbjct: 623 PPPPSFGSTGNKRQAQPPPPPPPPPPTRIPAAKCAPPPP--------PPP 664
Score = 30.4 bits (67), Expect = 9.7
Identities = 19/52 (36%), Positives = 23/52 (43%), Gaps = 7/52 (13%)
Frame = +3
Query: 249 PLPPNTTITSE-------PPPPPPAPNITTAQL*RSKPTPPLCVSLEV*PPP 383
P PP T T+ PPPPPP ++T S+P PP PPP
Sbjct: 488 PPPPLFTSTTSFSPSQPPPPPPPPPLFMSTTSFSPSQPPPP--------PPP 531
>gb|AAL31655.1|AC079179_10 Unknown protein [Oryza sativa] gi|20042881|gb|AAM08709.1|AC116601_2
Unknown protein [Oryza sativa]
Length = 1269
Score = 39.3 bits (90), Expect = 0.021
Identities = 18/45 (40%), Positives = 23/45 (51%)
Frame = +3
Query: 249 PLPPNTTITSEPPPPPPAPNITTAQL*RSKPTPPLCVSLEV*PPP 383
P+ PN ++ PPPPPP PN + P PP + V PPP
Sbjct: 610 PILPNRSVPPPPPPPPPLPNHSVLPPPPPPPPPPSLPNRLVPPPP 654
Score = 38.5 bits (88), Expect = 0.036
Identities = 18/45 (40%), Positives = 22/45 (48%)
Frame = +3
Query: 249 PLPPNTTITSEPPPPPPAPNITTAQL*RSKPTPPLCVSLEV*PPP 383
P PN + S PPPPPP P + + P PP + V PPP
Sbjct: 592 PPLPNCLVPSPPPPPPPPPILPNRSVPPPPPPPPPLPNHSVLPPP 636
Score = 38.1 bits (87), Expect = 0.047
Identities = 19/48 (39%), Positives = 24/48 (49%), Gaps = 3/48 (6%)
Frame = +3
Query: 249 PLPPNTTITSEPPP---PPPAPNITTAQL*RSKPTPPLCVSLEV*PPP 383
PLP + +S+PPP PPP PN P PP+ + V PPP
Sbjct: 574 PLPQSNYASSQPPPPPPPPPLPNCLVPSPPPPPPPPPILPNRSVPPPP 621
Score = 30.8 bits (68), Expect = 7.4
Identities = 14/34 (41%), Positives = 17/34 (49%)
Frame = +3
Query: 282 PPPPPPAPNITTAQL*RSKPTPPLCVSLEV*PPP 383
PPPPPP P + + S+P PP PPP
Sbjct: 566 PPPPPPPPPLPQSNYASSQPPPP--------PPP 591
Score = 30.4 bits (67), Expect = 9.7
Identities = 16/45 (35%), Positives = 21/45 (46%)
Frame = +3
Query: 249 PLPPNTTITSEPPPPPPAPNITTAQL*RSKPTPPLCVSLEV*PPP 383
PLP ++ + PPPPPP P +L P P + PPP
Sbjct: 626 PLPNHSVLP--PPPPPPPPPSLPNRLVPPPPAPGIGNKFPAPPPP 668
Score = 30.4 bits (67), Expect = 9.7
Identities = 16/46 (34%), Positives = 17/46 (36%)
Frame = +3
Query: 246 IPLPPNTTITSEPPPPPPAPNITTAQL*RSKPTPPLCVSLEV*PPP 383
+P P T PPPPPP P P PP PPP
Sbjct: 534 LPSPSPTAAAPPPPPPPPPPPSGNKPAFSPPPPPP--------PPP 571
>emb|CAD57687.1| Hypothetical protein C31C9.6 [Caenorhabditis elegans]
Length = 965
Score = 38.5 bits (88), Expect = 0.036
Identities = 17/45 (37%), Positives = 21/45 (45%)
Frame = +3
Query: 249 PLPPNTTITSEPPPPPPAPNITTAQL*RSKPTPPLCVSLEV*PPP 383
P P ++ PPPPPP P I T P PP + E+ PP
Sbjct: 478 PSPQPAEVSKSPPPPPPLPPIATPSSVPPPPPPPPALEQEISGPP 522
>gb|EAA35389.1| hypothetical protein [Neurospora crassa]
Length = 636
Score = 38.1 bits (87), Expect = 0.047
Identities = 20/45 (44%), Positives = 22/45 (48%)
Frame = +3
Query: 249 PLPPNTTITSEPPPPPPAPNITTAQL*RSKPTPPLCVSLEV*PPP 383
P PP TS PPPPPPAP + P PPL + PPP
Sbjct: 460 PPPPPLPATSAPPPPPPAP--------PAPPAPPLPAAHAPPPPP 496
Score = 31.6 bits (70), Expect = 4.4
Identities = 19/53 (35%), Positives = 24/53 (44%), Gaps = 8/53 (15%)
Frame = +3
Query: 249 PLPPNTTITSEPPP--------PPPAPNITTAQL*RSKPTPPLCVSLEV*PPP 383
P PP +S PPP PP AP + T+ + P PPL + PPP
Sbjct: 415 PAPPLPPASSRPPPMLPTRSPAPPQAPPLPTS---NAPPPPPLPATQAPPPPP 464
>gb|AAA59875.1| mucin
Length = 573
Score = 38.1 bits (87), Expect = 0.047
Identities = 19/45 (42%), Positives = 22/45 (48%)
Frame = +3
Query: 249 PLPPNTTITSEPPPPPPAPNITTAQL*RSKPTPPLCVSLEV*PPP 383
P PP TT T+ PP P+P ITT TP +S PPP
Sbjct: 80 PSPPTTTTTTPPPTTTPSPPITTTTTPPPTTTPSPPISTTTTPPP 124
Score = 34.3 bits (77), Expect = 0.67
Identities = 18/45 (40%), Positives = 23/45 (51%)
Frame = +3
Query: 249 PLPPNTTITSEPPPPPPAPNITTAQL*RSKPTPPLCVSLEV*PPP 383
P PP TT T+ PP P+P TT P+PP+ + PPP
Sbjct: 299 PSPPPTTTTTPPPTTTPSPPTTTT------PSPPITTTTT--PPP 335
Score = 34.3 bits (77), Expect = 0.67
Identities = 17/45 (37%), Positives = 23/45 (50%)
Frame = +3
Query: 249 PLPPNTTITSEPPPPPPAPNITTAQL*RSKPTPPLCVSLEV*PPP 383
P PP +T T+ PP P+P TT + P+PP + PPP
Sbjct: 112 PSPPISTTTTPPPTTTPSPPTTTPSPPTTTPSPPTTTTTT--PPP 154
Score = 33.1 bits (74), Expect = 1.5
Identities = 18/45 (40%), Positives = 22/45 (48%)
Frame = +3
Query: 249 PLPPNTTITSEPPPPPPAPNITTAQL*RSKPTPPLCVSLEV*PPP 383
P PP TT T+ PP P+P TT P+PP + PPP
Sbjct: 220 PSPPPTTTTTPPPTTTPSPPTTTT------PSPPTITTTT--PPP 256
Score = 32.0 bits (71), Expect = 3.3
Identities = 16/45 (35%), Positives = 21/45 (46%)
Frame = +3
Query: 249 PLPPNTTITSEPPPPPPAPNITTAQL*RSKPTPPLCVSLEV*PPP 383
P PP T+ T+ PP P+P TT TP ++ PPP
Sbjct: 64 PSPPPTSTTTLPPTTTPSPPTTTTTTPPPTTTPSPPITTTTTPPP 108
Score = 30.8 bits (68), Expect = 7.4
Identities = 16/36 (44%), Positives = 20/36 (55%), Gaps = 2/36 (5%)
Frame = +3
Query: 249 PLPPNTTITSEPPPPPPAPNI--TTAQL*RSKPTPP 350
P PP TT T+ PP P+P I TT + P+PP
Sbjct: 96 PSPPITTTTTPPPTTTPSPPISTTTTPPPTTTPSPP 131
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 446,301,090
Number of Sequences: 1393205
Number of extensions: 12032911
Number of successful extensions: 129747
Number of sequences better than 10.0: 619
Number of HSP's better than 10.0 without gapping: 66848
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 104741
length of database: 448,689,247
effective HSP length: 113
effective length of database: 291,257,082
effective search space used: 12524054526
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)