KMC003639A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003639A_C01 KMC003639A_c01
gctttccaaatCCAAAAAACTAAAAGTGCTTAATATTCCACAGTAGAATGTAAGAGCATC
TTAGTACATGGTATAATCATCACTCTTAAAGCTACTTTAGCACACTCCATGATGCTTAGC
TCATTGGATTACCCTCCCGGGCCCCGATTCCCTCTCTCTCTCTCTCTCTCTCTCTCTCTC
TCTTGTGACATAGAATGGCCCCCATAGTTCTTTCTTTCTCTTCCTCTTACAGCAGCAGTA
AAGCCAGGGTGGACACAGCAATATGAAATCCAAATGACCCAATCATCACCTGTCCTTGAT
GGCCAGAGGTTGGGAGGGGGCGTATGGTAGCAGAATCAACTTCACCAGTAGGCAGAGGAA
CTGCAGGGTTGCTGTTAATGAAGGGCATGCTAGTGCTAGTGCTACCCCCTTTACCTGGTA
CCAGTGGAACTGTCTCCGCCGGTGAACTTGCTGCTGGTGCTGGACCAGGTGAAGCTGCTG
CTTGGTGAGGACGGAAAGGCCATGGAAATGAAGGAGAAGATGGCACATCTCCACCTGAAG
AAGGAGCTGGAGTAGTTACCACTGGAGTAGGTTCTGGAGTCACTTCTGAACCTTGTGGTG
GTGCTGCTGCTGCTGAAGAAACCTTTATAGCTAGCTTCTGAGAATTTTGGCATGTTTTGA
GCGAGCCATTGTTGAAGGTGAAGTAGAAGAAACCAGTGCGAGATGGATGCCACGTGTAGG
AGGTAGTGGTGGGATTGGTGAGAAGAGTGGCTTGGGTGAGATTGCAAATATTGAAGGCAT
TCTGGTTCTTGAAAATGTAGAGGCTGTGATTTTGCTTGTGCTTGAAAATGATAGAATCAC
CAATGTGAACGGTTGGGTTCTTCCACTCAGAAAATCCATCTACTACAATTGTGGTGgaat
gtgagagcttgaacagtgaaagtagaagaaagaagaagaagatggggaaggacattatga
taatgtgctttgtttt


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003639A_C01 KMC003639A_c01
         (976 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_564135.1| expressed protein; protein id: At1g21090.1, sup...   233  3e-60
gb|AAM61317.1| unknown [Arabidopsis thaliana]                         230  2e-59
pir||A86344 protein T22I11.8 [imported] - Arabidopsis thaliana g...   223  3e-57
pir||T04605 hypothetical protein F20O9.30 - Arabidopsis thaliana       58  2e-07
ref|NP_567806.1| copper-binding protein-like; protein id: At4g28...    58  2e-07

>ref|NP_564135.1| expressed protein; protein id: At1g21090.1, supported by cDNA:
           116544. [Arabidopsis thaliana]
          Length = 242

 Score =  233 bits (595), Expect = 3e-60
 Identities = 133/252 (52%), Positives = 169/252 (66%), Gaps = 6/252 (2%)
 Frame = -1

Query: 970 KHIIIMSFPIFFFFLLLSLF-KLSHSTTIVVDGFSEWKNPTVHIGDSIIFKHKQNHSLYI 794
           KH+  M F  F+FF  LSLF + S S T +VDG S WK+PTVH GDS+IF+HK  + LYI
Sbjct: 6   KHLTSMLF--FYFFCFLSLFSRPSLSATFLVDGVSVWKSPTVHTGDSVIFRHKYGYDLYI 63

Query: 793 FKNQNAFNICNLTQATLLTNPTTTSYTWHPSRTGFFYFTF-NNGSL-KTCQNSQKLAIKV 620
           F+N++AFN+CN TQATLLT P +TS+TW+PSRTG +YF+F NN SL KTCQ +QKL ++V
Sbjct: 64  FRNKDAFNVCNFTQATLLTKPNSTSFTWYPSRTGSYYFSFTNNTSLPKTCQLNQKLTVQV 123

Query: 619 SSAAAAPPQGSEVTPEPTPVVTTPAPSSGGDVPSSP-SFPWPFRPHQAAA-SPGPAPAAS 446
             AAA+PP          P  T P P S G V SSP S+PWP  P + +A SPGP+P+  
Sbjct: 124 ILAAASPPS--------QPPATAPVPVSEGGVISSPSSYPWPLGPREGSAFSPGPSPSEI 175

Query: 445 SPAETVPLVPGKGGSTSTSMPFINSNPAVPLPTGEVDSATIRPLPTSGHQG-QVMIGSFG 269
               T   VPGK G     +PFINSNPAVPLPTG+VDS +I PLPTS +   QVM+ +  
Sbjct: 176 ----TSVTVPGKDG-----VPFINSNPAVPLPTGDVDSTSINPLPTSTNSAHQVMMMTLT 226

Query: 268 FHIAVSTLALLL 233
             + +  +A+ L
Sbjct: 227 VKLGLCCVAMFL 238

>gb|AAM61317.1| unknown [Arabidopsis thaliana]
          Length = 232

 Score =  230 bits (587), Expect = 2e-59
 Identities = 129/242 (53%), Positives = 163/242 (67%), Gaps = 6/242 (2%)
 Frame = -1

Query: 940 FFFFLLLSLFKL-SHSTTIVVDGFSEWKNPTVHIGDSIIFKHKQNHSLYIFKNQNAFNIC 764
           F+FF  LSLF   S S T +VDG S WK+PTVH GDS+IF+HK  + LYIF+N++AFN+C
Sbjct: 4   FYFFCFLSLFSCPSLSATFLVDGVSVWKSPTVHTGDSVIFRHKYGYDLYIFRNKDAFNVC 63

Query: 763 NLTQATLLTNPTTTSYTWHPSRTGFFYFTF-NNGSL-KTCQNSQKLAIKVSSAAAAPPQG 590
           N TQATLLT P +TS+TW+PSRTG +YF+F NN SL KTCQ +QKL ++V  AAA+PP  
Sbjct: 64  NFTQATLLTKPNSTSFTWYPSRTGSYYFSFTNNTSLPKTCQLNQKLTVQVILAAASPPS- 122

Query: 589 SEVTPEPTPVVTTPAPSSGGDVPSSP-SFPWPFRPHQAAA-SPGPAPAASSPAETVPLVP 416
                   P  T P P S G V SSP S+PWP  P + +A SPGP+P+      T   VP
Sbjct: 123 -------QPPATAPVPVSEGGVISSPSSYPWPLGPREGSAFSPGPSPSEI----TSVTVP 171

Query: 415 GKGGSTSTSMPFINSNPAVPLPTGEVDSATIRPLPTSGHQG-QVMIGSFGFHIAVSTLAL 239
           GK G     +PFINSNPAVPLPTG+VDS +I PLPTS +   QVM+ +    + +  +A+
Sbjct: 172 GKDG-----VPFINSNPAVPLPTGDVDSTSINPLPTSTNSAHQVMMMTLTVKLGLCCVAM 226

Query: 238 LL 233
            L
Sbjct: 227 FL 228

>pir||A86344 protein T22I11.8 [imported] - Arabidopsis thaliana
           gi|8886992|gb|AAF80652.1|AC012190_8 T22I11.8
           [Arabidopsis thaliana]
          Length = 233

 Score =  223 bits (568), Expect = 3e-57
 Identities = 130/237 (54%), Positives = 159/237 (66%), Gaps = 8/237 (3%)
 Frame = -1

Query: 970 KHIIIMSFPIFFFFLLLSLF-KLSHSTTIVVDGFSEWKNPTVHIGDSIIFKHKQNHSLYI 794
           KH+  M F  F+FF  LSLF + S S T +VDG S WK+PTVH GDS+  KHK  + LYI
Sbjct: 6   KHLTSMLF--FYFFCFLSLFSRPSLSATFLVDGVSVWKSPTVHTGDSVS-KHKYGYDLYI 62

Query: 793 FKNQNAFNICNLTQATLLTNPTTTSYTWHPSRTGFFYFTF-NNGSL-KTCQNSQKLAIKV 620
           F+N++AFN+CN TQATLLT P +TS+TW+PSRTG +YF+F NN SL KTCQ +QKL ++V
Sbjct: 63  FRNKDAFNVCNFTQATLLTKPNSTSFTWYPSRTGSYYFSFTNNTSLPKTCQLNQKLTVQV 122

Query: 619 SSAAAAPPQGSEVTPEPTPVVTTPAPSSGGDVPSSP-SFPWPFRPHQAAA-SPGPAPAAS 446
             AAA+PP          P  T P P S G V SSP S+PWP  P + +A SPGP+P+  
Sbjct: 123 ILAAASPPS--------QPPATAPVPVSEGGVISSPSSYPWPLGPREGSAFSPGPSPSEI 174

Query: 445 SPAETVPLVPGKGGSTSTSMPFINSNPAVPLPTGEVDSATIRPLPT---SGHQGQVM 284
               T   VPGK G     +PFINSNPAVPLPTG+VDS +I PLPT   S HQ  +M
Sbjct: 175 ----TSVTVPGKDG-----VPFINSNPAVPLPTGDVDSTSINPLPTSTNSAHQVMMM 222

>pir||T04605 hypothetical protein F20O9.30 - Arabidopsis thaliana
          Length = 508

 Score = 58.2 bits (139), Expect = 2e-07
 Identities = 49/190 (25%), Positives = 86/190 (44%), Gaps = 17/190 (8%)
 Frame = -1

Query: 961 IIMSFPIFFFFLLLSL--FKLSHSTTIVVDG-----------FSEWKNPT-VHIGDSIIF 824
           ++M F ++  F++L    F +S+     V G           +S W +     + D++ F
Sbjct: 3   LVMRFDLYLMFVMLMGLGFTISNGYKFYVGGKDGWVPTPSEDYSHWSHRNRFQVNDTLHF 62

Query: 823 KHKQNHSLYIFKNQNAFNICNLTQATLLTNPTTTSYTWHPSRTGFFYFTFNNGSLKTCQN 644
           K+ +     +   +  +N CN T    LT+ +     +  S +G ++F   +G+ + C  
Sbjct: 63  KYAKGKDSVLEVTEQEYNTCNTTHP--LTSLSDGDSLFLLSHSGSYFFI--SGNSQNCLK 118

Query: 643 SQKLAIKVSSAAAAPPQGSEVTPEPTPV---VTTPAPSSGGDVPSSPSFPWPFRPHQAAA 473
            QKLA+KV S           +P P+PV   +++P PS G + PSS S       +    
Sbjct: 119 GQKLAVKVLSTVHHSHSPRHTSPSPSPVHQELSSPGPSPGVE-PSSDS-------NSRVP 170

Query: 472 SPGPAPAASS 443
           +PGPA A +S
Sbjct: 171 APGPATAPNS 180

>ref|NP_567806.1| copper-binding protein-like; protein id: At4g28365.1, supported by
           cDNA: gi_15810214 [Arabidopsis thaliana]
          Length = 199

 Score = 58.2 bits (139), Expect = 2e-07
 Identities = 49/190 (25%), Positives = 86/190 (44%), Gaps = 17/190 (8%)
 Frame = -1

Query: 961 IIMSFPIFFFFLLLSL--FKLSHSTTIVVDG-----------FSEWKNPT-VHIGDSIIF 824
           ++M F ++  F++L    F +S+     V G           +S W +     + D++ F
Sbjct: 3   LVMRFDLYLMFVMLMGLGFTISNGYKFYVGGKDGWVPTPSEDYSHWSHRNRFQVNDTLHF 62

Query: 823 KHKQNHSLYIFKNQNAFNICNLTQATLLTNPTTTSYTWHPSRTGFFYFTFNNGSLKTCQN 644
           K+ +     +   +  +N CN T    LT+ +     +  S +G ++F   +G+ + C  
Sbjct: 63  KYAKGKDSVLEVTEQEYNTCNTTHP--LTSLSDGDSLFLLSHSGSYFFI--SGNSQNCLK 118

Query: 643 SQKLAIKVSSAAAAPPQGSEVTPEPTPV---VTTPAPSSGGDVPSSPSFPWPFRPHQAAA 473
            QKLA+KV S           +P P+PV   +++P PS G + PSS S       +    
Sbjct: 119 GQKLAVKVLSTVHHSHSPRHTSPSPSPVHQELSSPGPSPGVE-PSSDS-------NSRVP 170

Query: 472 SPGPAPAASS 443
           +PGPA A +S
Sbjct: 171 APGPATAPNS 180

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 945,762,059
Number of Sequences: 1393205
Number of extensions: 24505244
Number of successful extensions: 197546
Number of sequences better than 10.0: 2851
Number of HSP's better than 10.0 without gapping: 113104
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 168093
length of database: 448,689,247
effective HSP length: 123
effective length of database: 277,325,032
effective search space used: 55742331432
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GNf057a06 BP075956 1 407
2 MPD093f08_f AV776118 12 357
3 MPD065e01_f AV774333 27 478
4 MFB062c08_f BP038494 417 978




Lotus japonicus
Kazusa DNA Research Institute