KMC000784A_c02
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000784A_C02 KMC000784A_c02
GATAAAAAAAATGAATGACTAATTCTCAGATAAATATCATTACACCGGTACGAGCACTGT
GCATTTTTCTATTTAGATCACCCTCAGTGTACATTCGCACAAATTACGAGGCATAAAAGC
CAACAGAACAAGAGATGGACTTTTTCCAACGAATAAATGTAATTACTCTAGCAAACACAT
TGTGTATAATTTTTTCCCTCCAGTTGTTTGCATTCAAAAATGTTGATTGTATCCCTTCAC
CCCTAGAGCCATTAAGTGAAAAAATGAAGCCTCTCTCTGGCCCTAGAAGAAGCCTTACTT
CAATAGTTCAACATATTGCATCTTCAAATTTAACAAAAACTTGGATAAAACTCAGATTCC
TTCTTGAAAGCTGCTGGATCAGTGATGACCTGCTCAGATTTGGAAAGTGCTTCAGCATCT
TTCAAGTCTGTGTTGCAACCCCATACACGAACAAGTAGCCTCCTACATTTCGGAGATGAT
GGTTTCAAATACGTCTTGTACCACTCTACAACATCATTCTTGCTTATGTTCCTCAATTCT
TCTGCTTCCTTCTCTGAAACGTCAAAAATGTACCTTTTATCAACAATCTGATTCCACAAT
CTGTGGCTTTCATATGTGAGTGAAGGATCTTTCTCCAGCAGCT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000784A_C02 KMC000784A_c02
         (643 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||G86203 probable N-arginine dibasic convertase [imported] - ...   147  2e-34
ref|NP_172173.1| unknown protein; protein id: At1g06900.1 [Arabi...   147  2e-34
ref|NP_195764.1| putative protein; protein id: At5g01440.1 [Arab...    70  2e-11
gb|EAA26930.1| hypothetical protein [Neurospora crassa]                53  4e-06
emb|CAA63696.1| NRD2 convertase [Rattus sp.]                           49  7e-05

>pir||G86203 probable N-arginine dibasic convertase [imported] - Arabidopsis
            thaliana gi|7523693|gb|AAF63132.1|AC011001_2 Putative
            N-arginine dibasic convertase [Arabidopsis thaliana]
          Length = 1039

 Score =  147 bits (370), Expect = 2e-34
 Identities = 67/103 (65%), Positives = 81/103 (78%)
 Frame = -3

Query: 641  LLEKDPSLTYESHRLWNQIVDKRYIFDVSEKEAEELRNISKNDVVEWYKTYLKPSSPKCR 462
            LLEKDPSL  E++ LW+QIVDKRY+FD S KEAEELR+I K DV+ WYKTY + SSPKCR
Sbjct: 937  LLEKDPSLLSETNDLWSQIVDKRYMFDFSHKEAEELRSIQKKDVISWYKTYFRESSPKCR 996

Query: 461  RLLVRVWGCNTDLKDAEALSKSEQVITDPAAFKKESEFYPSFC 333
            RL VRVWGC+T++K+ +   K+ QVI D  AFK  S+FYPS C
Sbjct: 997  RLAVRVWGCDTNMKETQTDQKAVQVIADAVAFKSTSKFYPSLC 1039

>ref|NP_172173.1| unknown protein; protein id: At1g06900.1 [Arabidopsis thaliana]
          Length = 1023

 Score =  147 bits (370), Expect = 2e-34
 Identities = 67/103 (65%), Positives = 81/103 (78%)
 Frame = -3

Query: 641  LLEKDPSLTYESHRLWNQIVDKRYIFDVSEKEAEELRNISKNDVVEWYKTYLKPSSPKCR 462
            LLEKDPSL  E++ LW+QIVDKRY+FD S KEAEELR+I K DV+ WYKTY + SSPKCR
Sbjct: 921  LLEKDPSLLSETNDLWSQIVDKRYMFDFSHKEAEELRSIQKKDVISWYKTYFRESSPKCR 980

Query: 461  RLLVRVWGCNTDLKDAEALSKSEQVITDPAAFKKESEFYPSFC 333
            RL VRVWGC+T++K+ +   K+ QVI D  AFK  S+FYPS C
Sbjct: 981  RLAVRVWGCDTNMKETQTDQKAVQVIADAVAFKSTSKFYPSLC 1023

>ref|NP_195764.1| putative protein; protein id: At5g01440.1 [Arabidopsis thaliana]
           gi|11290050|pir||T48166 hypothetical protein T10O8.150 -
           Arabidopsis thaliana gi|7320722|emb|CAB81927.1| putative
           protein [Arabidopsis thaliana]
          Length = 307

 Score = 70.5 bits (171), Expect = 2e-11
 Identities = 29/56 (51%), Positives = 41/56 (72%)
 Frame = -3

Query: 596 WNQIVDKRYIFDVSEKEAEELRNISKNDVVEWYKTYLKPSSPKCRRLLVRVWGCNT 429
           W++IV +  IFD   +E +EL  I+KND++EWYK Y++ SSPKC   +V +WGCNT
Sbjct: 142 WSEIVRESCIFDFYSEEKKELSLITKNDLIEWYKRYVRLSSPKCCSFVVSIWGCNT 197

>gb|EAA26930.1| hypothetical protein [Neurospora crassa]
          Length = 1082

 Score = 52.8 bits (125), Expect = 4e-06
 Identities = 21/63 (33%), Positives = 43/63 (67%)
 Frame = -3

Query: 638  LEKDPSLTYESHRLWNQIVDKRYIFDVSEKEAEELRNISKNDVVEWYKTYLKPSSPKCRR 459
            LEK   L  E+++ W+QI  + Y F++S+++A  ++ ++K +++E++K Y+ PSSP   +
Sbjct: 885  LEKPKFLDQETNKQWSQIHSEYYDFEISQRDAAHVKPLTKEELIEFFKHYIHPSSPSRAK 944

Query: 458  LLV 450
            L +
Sbjct: 945  LAI 947

>emb|CAA63696.1| NRD2 convertase [Rattus sp.]
          Length = 1229

 Score = 48.5 bits (114), Expect = 7e-05
 Identities = 21/53 (39%), Positives = 33/53 (61%)
 Frame = -3

Query: 632  KDPSLTYESHRLWNQIVDKRYIFDVSEKEAEELRNISKNDVVEWYKTYLKPSS 474
            +D  L  E  R WN++V ++Y+FD    E E L++ SK+D+V W+K +  P S
Sbjct: 1107 EDTHLGEEVDRNWNEVVTQQYLFDRLAHEIEALKSFSKSDLVSWFKAHRGPGS 1159

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 521,517,735
Number of Sequences: 1393205
Number of extensions: 10796087
Number of successful extensions: 29751
Number of sequences better than 10.0: 52
Number of HSP's better than 10.0 without gapping: 28816
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 29741
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 27007650415
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPDL007e12_f AV776872 1 593
2 SPDL072c05_f BP056446 1 364
3 SPDL020b05_f BP053225 18 353
4 GNLf004h08 BP075047 57 336
5 SPDL044a08_f BP054744 147 644




Lotus japonicus
Kazusa DNA Research Institute