KMC004654A_c01
[Fasta Sequence]
[Nr Search]
[EST assemble image]
Fasta Sequence
>KMC004654A_C01 KMC004654A_c01
aaaaaataagcctctactggtatatatgcctttcagggaattcatgctttagtgacaggg
gctcacatataacAGCAACAGCTAAATTGAAATCACGCTAAGGATGTTTAGCAAATTCAA
AAACAAAGCCGAAAACAATTCACCAAGCAAGGTCCAGTTCTACTATATAAGTTCAGCAAT
GTTCAAAAAACCACACCAGTAATGAACCTGAATATTCTACATTCATCCATAGTATATGTG
AAATTTACGCAGACTAGAGGTTTTGGTAAATGCCCTGATAAATTCAAGGAGTTGTACAGA
AGCCAGCAACACATTGCAAACGGCCTGATGCCTCATCTTCCATGTCTGGGTGACAAATCA
GCAGATCTTGATCTAGATCGAAGTGGTGATCTTGATTCTCTGCGGGATGACTTCAACTGG
CCTCGATCAGCCTCAGACTTGTATACAGATTTCTCAGTATAGCCATTGCCTTGGTTCTGA
TCAGCACCATTGTCATAAGGTGGAGAGTATGACCTGCGCTTATGATCACCATCTCGCTCC
CTAGAAGGGCTTCTAGGTGATCTAGGGCTTCTAGGTGATCTAGGGTGCTCTGCTTGCCTT
CTAGGGGAAACAGAGTAATCATCACGCCGTCTTGGAG
Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC004654A_C01 KMC004654A_c01
(637 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAC03602.1| SC35-like splicing factor SCL30, 30 kD [Arabidop... 57 2e-07
ref|NP_567021.1| arginine/serine-rich protein SCL30; protein id:... 57 2e-07
pir||T47685 probable RNA binding protein - Arabidopsis thaliana ... 51 1e-05
ref|NP_651272.1| CG13625-PA [Drosophila melanogaster] gi|7301183... 45 8e-04
ref|NP_498134.1| Putative nuclear protein, with a coiled coil do... 42 0.007
>emb|CAC03602.1| SC35-like splicing factor SCL30, 30 kD [Arabidopsis thaliana]
Length = 262
Score = 57.4 bits (137), Expect = 2e-07
Identities = 42/103 (40%), Positives = 53/103 (50%), Gaps = 6/103 (5%)
Frame = -3
Query: 635 PRRRDDYSVSPRRQAEHPRSPRSPRSPRSPSRERDGDHKRRSYSPPYDNGA------DQN 474
PRR D S S R + +PR P P E D ++ RRSYSP Y+ A D+N
Sbjct: 166 PRRPSD-SRSRYRSRSYSPAPRRRGGP--PRGEEDENYSRRSYSPGYEGAAAAAPDRDRN 222
Query: 473 QGNGYTEKSVYKSEADRGQLKSSRRESRSPLRSRSRSADLSPR 345
N EK Y++E R + R SRSP SRSRS ++SPR
Sbjct: 223 GDNEIREKPGYEAEDRR---RGGRAVSRSPSGSRSRSVEVSPR 262
>ref|NP_567021.1| arginine/serine-rich protein SCL30; protein id: At3g55460.1,
supported by cDNA: 21694., supported by cDNA:
gi_20466365 [Arabidopsis thaliana]
gi|20466366|gb|AAM20500.1| putative RNA binding protein
[Arabidopsis thaliana] gi|21554261|gb|AAM63336.1|
putative RNA binding protein [Arabidopsis thaliana]
gi|22136316|gb|AAM91236.1| putative RNA binding protein
[Arabidopsis thaliana]
Length = 262
Score = 57.4 bits (137), Expect = 2e-07
Identities = 42/103 (40%), Positives = 53/103 (50%), Gaps = 6/103 (5%)
Frame = -3
Query: 635 PRRRDDYSVSPRRQAEHPRSPRSPRSPRSPSRERDGDHKRRSYSPPYDNGA------DQN 474
PRR D S S R + +PR P P E D ++ RRSYSP Y+ A D+N
Sbjct: 166 PRRPSD-SRSRYRSRSYSPAPRRRGGP--PRGEEDENYSRRSYSPGYEGAAAAAPDRDRN 222
Query: 473 QGNGYTEKSVYKSEADRGQLKSSRRESRSPLRSRSRSADLSPR 345
N EK Y++E R + R SRSP SRSRS ++SPR
Sbjct: 223 GDNEIREKPGYEAEDRR---RGGRAVSRSPSGSRSRSVEVSPR 262
>pir||T47685 probable RNA binding protein - Arabidopsis thaliana
gi|7076789|emb|CAB75904.1| putative RNA binding protein
[Arabidopsis thaliana]
Length = 309
Score = 50.8 bits (120), Expect = 1e-05
Identities = 39/99 (39%), Positives = 49/99 (49%), Gaps = 6/99 (6%)
Frame = -3
Query: 635 PRRRDDYSVSPRRQAEHPRSPRSPRSPRSPSRERDGDHKRRSYSPPYDNGA------DQN 474
PRR D S S R + +PR P P E D ++ RRSYSP Y+ A D+N
Sbjct: 196 PRRPSD-SRSRYRSRSYSPAPRRRGGP--PRGEEDENYSRRSYSPGYEGAAAAAPDRDRN 252
Query: 473 QGNGYTEKSVYKSEADRGQLKSSRRESRSPLRSRSRSAD 357
N EK Y++E R + R SRSP SRSRS +
Sbjct: 253 GDNEIREKPGYEAEDRR---RGGRAVSRSPSGSRSRSVE 288
>ref|NP_651272.1| CG13625-PA [Drosophila melanogaster] gi|7301183|gb|AAF56315.1|
CG13625-PA [Drosophila melanogaster]
gi|15010486|gb|AAK77291.1| GH07383p [Drosophila
melanogaster]
Length = 647
Score = 45.1 bits (105), Expect = 8e-04
Identities = 36/105 (34%), Positives = 51/105 (48%), Gaps = 8/105 (7%)
Frame = -3
Query: 632 RRRDDYSVSPRRQAEHPRSPRSPRSPRS-----PSRERDGDH---KRRSYSPPYDNGADQ 477
RRRDD PRR+ +SP PR P+ P R+RD D ++R SP + +DQ
Sbjct: 335 RRRDDKQTPPRRRRNSDQSP--PRRPKDVDQSPPRRKRDFDQSPTRKRDKSPRRRHDSDQ 392
Query: 476 NQGNGYTEKSVYKSEADRGQLKSSRRESRSPLRSRSRSADLSPRH 342
+ + +S +S R + K R+ SR S S+SA P H
Sbjct: 393 SPARNH--RSRERSPPPRNRFKEERK-SRWAKASPSKSASPPPTH 434
Score = 34.3 bits (77), Expect = 1.4
Identities = 25/74 (33%), Positives = 35/74 (46%), Gaps = 7/74 (9%)
Frame = -3
Query: 635 PRRRDDYSVS-PRRQAEHPRSPRSPRSPRSPSRERDGD------HKRRSYSPPYDNGADQ 477
PRR D S PRR+ + +SP R +SP R D D H+ R SPP N +
Sbjct: 355 PRRPKDVDQSPPRRKRDFDQSPTRKRD-KSPRRRHDSDQSPARNHRSRERSPPPRNRFKE 413
Query: 476 NQGNGYTEKSVYKS 435
+ + + + S KS
Sbjct: 414 ERKSRWAKASPSKS 427
Score = 34.3 bits (77), Expect = 1.4
Identities = 31/103 (30%), Positives = 40/103 (38%), Gaps = 4/103 (3%)
Frame = -3
Query: 632 RRRDDYSVSPRRQAEHPRSPRSPRSPRSPSRERDGDHKRR----SYSPPYDNGADQNQGN 465
RRRD PRR R S +SP R++D RR SPP AD
Sbjct: 245 RRRDSDQSPPRR-----RRSNSDQSPPRRRRDKDSTPPRRRKDSDQSPPRRRKADDQSPV 299
Query: 464 GYTEKSVYKSEADRGQLKSSRRESRSPLRSRSRSADLSPRHGR 336
K ++D+ + R +SP+R R S PR R
Sbjct: 300 RRERK-----DSDQSPPRKRRDNDQSPVRRRRDSDQSPPRRRR 337
Score = 31.6 bits (70), Expect = 9.2
Identities = 32/102 (31%), Positives = 45/102 (43%), Gaps = 7/102 (6%)
Frame = -3
Query: 632 RRRDDYSVSPRRQAEHPRSP--RSPRSPRSPSRERDGDHKRRSYSPP---YDNGADQNQG 468
RRRD S PRR+ + +SP R +SP R + K SPP DN +
Sbjct: 268 RRRDKDSTPPRRRKDSDQSPPRRRKADDQSPVRR---ERKDSDQSPPRKRRDNDQSPVRR 324
Query: 467 NGYTEKSVYKSEADRGQLKSSRRES--RSPLRSRSRSADLSP 348
+++S + D Q RR + +SP R R + D SP
Sbjct: 325 RRDSDQSPPRRRRDDKQTPPRRRRNSDQSPPR-RPKDVDQSP 365
>ref|NP_498134.1| Putative nuclear protein, with a coiled coil domain, of bilaterial
origin (52.8 kD) [Caenorhabditis elegans]
gi|7503937|pir||T16418 hypothetical protein F52C9.7 -
Caenorhabditis elegans gi|1055054|gb|AAA81056.1|
Hypothetical protein F52C9.7 [Caenorhabditis elegans]
Length = 451
Score = 42.0 bits (97), Expect = 0.007
Identities = 31/101 (30%), Positives = 44/101 (42%), Gaps = 1/101 (0%)
Frame = -3
Query: 635 PRRRDDYSVSPRRQAEHPRSPRSPRSPRSPSRERDGDHKRRSYSPPYDNGADQNQGNGYT 456
P+RR S PR+++ PR R PRSP R R+ + ++ S P +
Sbjct: 160 PKRRRRPSEKPRKRSRSPRRER----PRSPKRSRE-ESRKLSRKPSSSRSKSPRRSREDP 214
Query: 455 EKSVYKSEADRGQ-LKSSRRESRSPLRSRSRSADLSPRHGR 336
K K R + + RR+SRSP RSR +S R
Sbjct: 215 RKVARKPSRSRSRSAERLRRKSRSPRRSREEPRKMSRSRSR 255
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 506,076,454
Number of Sequences: 1393205
Number of extensions: 10626481
Number of successful extensions: 41612
Number of sequences better than 10.0: 537
Number of HSP's better than 10.0 without gapping: 34282
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 39641
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 26439068301
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
EST assemble image
|
|
|
|
clone |
accession |
position |
1 |
MR073f10_f |
BP081631 |
1 |
354 |
2 |
SPD064b01_f |
BP049079 |
74 |
637 |
|
Lotus japonicus
Kazusa DNA Research Institute