KMC003301A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003301A_C01 KMC003301A_c01
ttccaaTCCAATCCCTCACTCTCACTCCCAACCACAACACTAGTGACTCCAACCAAACAC
AAACTAAGAGTACTCTGTTTTTCTCTTCAACACTGTTCTTTTCTCTTTTCATCTTGTCTT
CAAATGGCGGAAAATTTAGCCAGCATTGAACCATGGATGTACCGACCCTCCTCCGTCGCG
GACTCATGGCTCGCCGATTACATAGCACGTGACGCCGAAACTCTCACTAAGGCGCTTCAG
AAATCACTCTCCAACGACGTTGACGCGGCGATTTCTCCTCTTTTCAACCTTGTCAGGACC
GACGCCGCTGTTTCCCCTGCTCTCCCGGCGACTCCGACCGTCTCCAGCCTCTCCGGCTCC
GACCAGGACTCCCAGCAGCCGAAGCGCAACCGCGTCTCCGGCGGGAGAGTCTCCAAGCGG
AAGTCACGCGCGTCGAAGCGGTCGCAGACGACTTTCATCACGGCGGACCCGGCGAATTTC
CGGCAGATGGTGCAGCAGATCACCGGCGCGAGGTTTACCGGCTCGTTGCCGGCGCCAATG
GCGCCGGTGGTTCGACCGGAGCCACTGAAGGCGGCGGTCGGTGCCGGAGGGAGGAT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003301A_C01 KMC003301A_c01
         (596 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_191247.1| putative protein; protein id: At3g56880.1, supp...   116  2e-25
ref|NP_181634.1| unknown protein; protein id: At2g41010.1, suppo...   100  2e-20
dbj|BAA89548.1| unnamed protein product [Oryza sativa (japonica ...    60  2e-08
dbj|BAC10346.1| contains EST AU183230(E51202)~unknown protein [O...    60  3e-08
ref|NP_492875.2| pre-mRNA splicing SR protein related RSR-1 (68....    56  4e-07

>ref|NP_191247.1| putative protein; protein id: At3g56880.1, supported by cDNA:
           39584., supported by cDNA: gi_15028354 [Arabidopsis
           thaliana] gi|11289614|pir||T51276 hypothetical protein
           T8M16_210 - Arabidopsis thaliana
           gi|9663007|emb|CAC00751.1| putative protein [Arabidopsis
           thaliana] gi|15028355|gb|AAK76654.1| unknown protein
           [Arabidopsis thaliana] gi|24030447|gb|AAN41377.1|
           unknown protein [Arabidopsis thaliana]
          Length = 245

 Score =  116 bits (290), Expect = 2e-25
 Identities = 75/163 (46%), Positives = 106/163 (65%), Gaps = 13/163 (7%)
 Frame = +1

Query: 127 AENLASIEPWMYRPSSVADSWL-ADYIARDAETLTKALQKSLSNDVDAA-ISPLFNLVRT 300
           +E LAS++PW +R +   DSWL +D  + D++ L KAL +S+S   +++ +SP      +
Sbjct: 4   SEGLASVDPWSFRQNFNIDSWLLSDSFSHDSDILAKALHRSISTSTESSPLSPSSFFDSS 63

Query: 301 DAAVSPALPATPTVSSLS-GSDQDSQQP------KRNR---VSGGRVSKRKSRAS-KRSQ 447
            AAV  + P T  +S++S GSD +          KR R   VSGG+ +KR+SR S K+SQ
Sbjct: 64  TAAVDFSPPQT--LSNVSFGSDPEIPAASALGLGKRKRGPGVSGGKQTKRRSRVSNKKSQ 121

Query: 448 TTFITADPANFRQMVQQITGARFTGSLPAPMAPVVRPEPLKAA 576
           TTFITAD ANFRQMVQQ+TGA+F GS  +  AP+V+PEP + A
Sbjct: 122 TTFITADAANFRQMVQQVTGAKFLGSSNSIFAPIVKPEPHRLA 164

>ref|NP_181634.1| unknown protein; protein id: At2g41010.1, supported by cDNA:
           gi_13272432 [Arabidopsis thaliana]
           gi|7487625|pir||T02118 hypothetical protein At2g41010
           [imported] - Arabidopsis thaliana
           gi|3402716|gb|AAD12010.1| unknown protein [Arabidopsis
           thaliana] gi|13272433|gb|AAK17155.1|AF325087_1 unknown
           protein [Arabidopsis thaliana]
           gi|26449325|dbj|BAC41790.1| unknown protein [Arabidopsis
           thaliana]
          Length = 238

 Score = 99.8 bits (247), Expect = 2e-20
 Identities = 62/156 (39%), Positives = 92/156 (58%), Gaps = 8/156 (5%)
 Frame = +1

Query: 127 AENLASIEPWMYRPSSVADSWL-ADYIARDAETLTKALQKSLSNDVDAAISPLFNLVRTD 303
           +E LAS++ W+YR     DSWL +D  + D + L +AL  +++      ++P      + 
Sbjct: 4   SEGLASVDSWLYRQGFNVDSWLLSDTFSHDNDLLARALHTTVT--APHTLTPSSAFFDSS 61

Query: 304 AAVSPALPAT--PTVSSLSGSDQDSQQPKRNR---VSGGRVSKRKSRASKRSQTTFITAD 468
           A   P+   T   TVS  S  +      KR R   ++ G+ +KR++RASK+SQTTFITAD
Sbjct: 62  AVSHPSSTNTLSSTVSGASDPEIIGGGAKRKRNCLLTDGKAAKRRARASKKSQTTFITAD 121

Query: 469 PANFRQMVQQITGARF--TGSLPAPMAPVVRPEPLK 570
           P+NFRQMVQQ+TGA++    S      P+V+PEPL+
Sbjct: 122 PSNFRQMVQQVTGAKYIDDSSSFGIFDPIVKPEPLR 157

>dbj|BAA89548.1| unnamed protein product [Oryza sativa (japonica cultivar-group)]
          Length = 362

 Score = 60.5 bits (145), Expect = 2e-08
 Identities = 38/104 (36%), Positives = 59/104 (56%), Gaps = 3/104 (2%)
 Frame = +1

Query: 223 LTKALQKSLSNDVDAAISPLFNLVRTDAAVSPALPATPTVSSLSGSDQDSQQPKRNRVS- 399
           L+ A   +++  + A+++P        +A SP    + T +S S +  +       R + 
Sbjct: 155 LSPAADDAITAALWASMAPSSASSYCGSAASPTPSTSTTTTSSSAASAEILAGGGARAAT 214

Query: 400 --GGRVSKRKSRASKRSQTTFITADPANFRQMVQQITGARFTGS 525
              GRVSKRK R S+R+ TT+ITADPA+FR+MVQ+ITG    G+
Sbjct: 215 RPSGRVSKRKPRPSRRAHTTYITADPADFRRMVQEITGFPVPGA 258

>dbj|BAC10346.1| contains EST AU183230(E51202)~unknown protein [Oryza sativa
           (japonica cultivar-group)]
          Length = 218

 Score = 59.7 bits (143), Expect = 3e-08
 Identities = 41/101 (40%), Positives = 50/101 (48%)
 Frame = +1

Query: 289 LVRTDAAVSPALPATPTVSSLSGSDQDSQQPKRNRVSGGRVSKRKSRASKRSQTTFITAD 468
           LV   A  SPA PA           +  QQ      +GGR  KR+SRASKR+ TT+I+ D
Sbjct: 80  LVADSARPSPAGPAR----------RHQQQQLGLGPAGGRAGKRRSRASKRAPTTYISTD 129

Query: 469 PANFRQMVQQITGARFTGSLPAPMAPVVRPEPLKAAVGAGG 591
           PANFR MVQ +TG              V+ +P   A GA G
Sbjct: 130 PANFRLMVQHVTG--------------VQADPASLADGAAG 156

>ref|NP_492875.2| pre-mRNA splicing SR protein related RSR-1 (68.2 kD) (rsr-1)
           [Caenorhabditis elegans] gi|19571645|emb|CAB04214.3| C.
           elegans RSR-1 protein (corresponding sequence F28D9.1)
           [Caenorhabditis elegans]
          Length = 601

 Score = 55.8 bits (133), Expect = 4e-07
 Identities = 49/142 (34%), Positives = 66/142 (45%), Gaps = 2/142 (1%)
 Frame = -2

Query: 586 RHRPPPSVAPVEPPAPL-APATSR*TSRR*SAAPSAGNSPGPP**KSSATASTRVTSAWR 410
           R R  PS +   PPAP  A + S+    R   +PSA  SP  P  + S + S        
Sbjct: 346 RRRRSPSASKSPPPAPKRAKSRSKSPPARRRRSPSASKSPPAPRRRRSPSKSRSPAPKRE 405

Query: 409 LS-RRRRGCASAAGSPGRSRRGWRRSESPGEQGKQRRRS*QG*KEEKSPRQRRWRVISEA 233
           +   RRR   SA+ SP   +R   RS+SP    ++R  S     + KSP  RR R  S++
Sbjct: 406 IPPARRRRSPSASKSPPAPKRAKSRSKSPPAPRRRRSPS-----QSKSPAPRRRRSPSKS 460

Query: 232 P**EFRRHVLCNRRAMSPRRRR 167
           P    RR      ++ SPRRRR
Sbjct: 461 PQAPRRRRSPSGSKSRSPRRRR 482

 Score = 40.0 bits (92), Expect = 0.023
 Identities = 38/115 (33%), Positives = 49/115 (42%), Gaps = 3/115 (2%)
 Frame = +2

Query: 170 PPSRTHGSPIT*HVTPKLSLRRFRNHSPTTLTRRFLLFSTLSGPTPLFPLLSRRLRPS-- 343
           PP+R   SP      P     + R+ SP    RR     + S      P   RR  PS  
Sbjct: 407 PPARRRRSPSASKSPPAPKRAKSRSKSPPAPRRRRSPSQSKS------PAPRRRRSPSKS 460

Query: 344 PASPAPTRTPSSRSATASPAGESPSGSHARR-SGRRRLSSRRTRRISGRWCSRSP 505
           P +P   R+PS   + +     SP+ +  RR S +RR S RR R  S    SRSP
Sbjct: 461 PQAPRRRRSPSGSKSRSPRRRRSPAAAPRRRQSPQRRRSPRRRRSPSSSSRSRSP 515

 Score = 35.4 bits (80), Expect = 0.56
 Identities = 31/80 (38%), Positives = 36/80 (44%), Gaps = 16/80 (20%)
 Frame = +2

Query: 356 APTRTPSSRSATASPAGESPSGSHARRSG-----RRRLSSRR-----------TRRISGR 487
           +P +TP  R+A  SP G+   GS A R G     RRR   RR           TRR   R
Sbjct: 180 SPRKTPPRRNA--SPGGDGGGGSPAARRGGSAGNRRRSPPRRGSPRRGSPRRDTRRSPPR 237

Query: 488 WCSRSPARGLPARCRRQWRR 547
                PARG   R RR+ RR
Sbjct: 238 RRGSPPARGGDRRDRREDRR 257

 Score = 35.0 bits (79), Expect = 0.73
 Identities = 49/150 (32%), Positives = 59/150 (38%), Gaps = 13/150 (8%)
 Frame = +2

Query: 170 PPSRTHGSPIT*HVTPKLSLRRFRNHSPTTLTRRFLLFSTLSGPTPLFPLLSRRLRPSPA 349
           PP+R   SP      P    RR  + S +   +R +            P   RR  PS +
Sbjct: 371 PPARRRRSPSASKSPPAPRRRRSPSKSRSPAPKREI------------PPARRRRSPSAS 418

Query: 350 S--PAPTRTPS-SRSATASPAGESPSGSHARRSGRRRLSSR-----RTRRISGRWCSRSP 505
              PAP R  S S+S  A     SPS S +    RRR  S+     R RR      SRSP
Sbjct: 419 KSPPAPKRAKSRSKSPPAPRRRRSPSQSKSPAPRRRRSPSKSPQAPRRRRSPSGSKSRSP 478

Query: 506 AR-----GLPARCRRQWRRWFDRSH*RRRS 580
            R       P R +   RR   RS  RRRS
Sbjct: 479 RRRRSPAAAPRRRQSPQRR---RSPRRRRS 505

 Score = 33.1 bits (74), Expect = 2.8
 Identities = 34/124 (27%), Positives = 47/124 (37%)
 Frame = +2

Query: 167 PPPSRTHGSPIT*HVTPKLSLRRFRNHSPTTLTRRFLLFSTLSGPTPLFPLLSRRLRPSP 346
           PPP+R   SP     +P     + R+ SP    RR    S    P P             
Sbjct: 314 PPPARRRRSPSQ-SKSPAPKRAKSRSKSPPAPARRRRSPSASKSPPP------------- 359

Query: 347 ASPAPTRTPSSRSATASPAGESPSGSHARRSGRRRLSSRRTRRISGRWCSRSPARGLPAR 526
              AP R  S   +  +    SPS S +  + RRR S  ++R       S +P R +P  
Sbjct: 360 ---APKRAKSRSKSPPARRRRSPSASKSPPAPRRRRSPSKSR-------SPAPKREIPPA 409

Query: 527 CRRQ 538
            RR+
Sbjct: 410 RRRR 413

 Score = 32.0 bits (71), Expect = 6.2
 Identities = 30/106 (28%), Positives = 45/106 (42%), Gaps = 10/106 (9%)
 Frame = -2

Query: 454 KSSATASTRVTSAWRLSRRRRGCASAAGSPGRSRRGWRRSESPGEQ----------GKQR 305
           K+ A+ + R  S     RRR   AS +  P R RR   +S+SP  +             R
Sbjct: 288 KAIASVAARAKSG-SPRRRRSPSASKSPPPARRRRSPSQSKSPAPKRAKSRSKSPPAPAR 346

Query: 304 RRS*QG*KEEKSPRQRRWRVISEAP**EFRRHVLCNRRAMSPRRRR 167
           RR      +   P  +R +  S++P    RR    ++   +PRRRR
Sbjct: 347 RRRSPSASKSPPPAPKRAKSRSKSPPARRRRSPSASKSPPAPRRRR 392

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.316    0.127    0.365 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 568,793,798
Number of Sequences: 1393205
Number of extensions: 13971231
Number of successful extensions: 119652
Number of sequences better than 10.0: 619
Number of HSP's better than 10.0 without gapping: 79444
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 114180
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 23140425222
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)


EST assemble image


clone accession position
1 MWM126c02_f AV766745 1 596
2 GNf033c07 BP069756 6 441
3 GNf067g07 BP072363 7 438
4 GNf051e09 BP071161 51 525




Lotus japonicus
Kazusa DNA Research Institute