KMC004746A_c02
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004746A_C02 KMC004746A_c02
atggtattagattcaaaatAAAAACACAACAGAGGGGGGTGTAATAGAAATCACTTTCTC
ATACATATATTGAATAACACTTTATGTACAGAAAAAATGGATTCAGATCAAAGCAAATAG
CAGCAAATGTAGTAGATACATCACCATAAATCTAAATTAAATATTCAATAGATCATTCAT
TCCCCATTACAACATAGCCAGCACACCAAAAGTTGCAAGGGAGCAAGGTTGCCCCAGATT
CCATAGCTGAGGATCTTGTAAGAAGATGAACCATCAGGTGATGATGGAGAAGATGCTTTG
CTGTCAGCACTTTTTTCAATGGGAGAATCTGCAGCTGGTGCAATTTCAGGAGCTGGTGCA
GGAGCAGGTGTTGGAGGAATATCAGTGCCAAAAACGGCCTCAGGAAGGAGAACCTTACCA
ACCTCATATATTGCAACAGGATCTGTGGAATGAACAGCACTAGTAACCTTGGTCTTTGAC
CATCCTGAATTGATATGCACAGTTCCAGAATCATCAGTGAAATTCAAGGAGTAACTTCCA
CCAGCAAATGTGGGAGTGGAACCAGTTTGGCTAAGGTTTTTGAAGTCAGCAAGAGAGTAG
TATTTTGGCAAGGCATGGAAGAGGATCACTTGCTTTAACTGG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004746A_C02 KMC004746A_c02
         (642 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||T09841 hypothetical protein - upland cotton gi|606942|gb|AA...   175  5e-43
ref|NP_565313.1| fasciclin-like arabinogalactan-protein (FLA7); ...   172  4e-42
gb|AAM61109.1| unknown [Arabidopsis thaliana]                         171  1e-41
pir||S52995 arabinogalactan-like protein - loblolly pine gi|6077...    86  5e-16
ref|NP_565475.1| fasciclin-like arabinogalactan-protein (FLA6); ...    83  3e-15

>pir||T09841 hypothetical protein - upland cotton gi|606942|gb|AAA79366.1|
           unknown
          Length = 263

 Score =  175 bits (443), Expect = 5e-43
 Identities = 90/151 (59%), Positives = 114/151 (74%), Gaps = 2/151 (1%)
 Frame = -2

Query: 641 QLKQVILFHALPKYYSLADFKNLSQTGSTPTFAGGSYSLNFTDDSGTVHINSGWSKTKVT 462
           Q K V+L+HALP+YY+LADF +LS+ G   T AGG Y+L F D+SGTV ++SGWSKTKVT
Sbjct: 110 QFKSVLLYHALPRYYALADFNDLSEKGPISTLAGGQYTLQFNDESGTVRLDSGWSKTKVT 169

Query: 461 SAVHSTDPVAIYEVGKVLLPEAVFGTDIPPTPAPAPAPEIAPAADSP-IEKSADSKASSP 285
           SAVH++ PVA+Y++ KVLLPEA+FGTDIPPTPAPAPA  I P+AD+P   KS ++ +SS 
Sbjct: 170 SAVHTSKPVAVYQIDKVLLPEAIFGTDIPPTPAPAPALGIGPSADTPSAAKSEETGSSSK 229

Query: 284 SSPDGSSSYKI-LSYGIWGNLAPLQLLVCWL 195
            S  GSSS +I ++ GIW  L  L  L  WL
Sbjct: 230 PSFSGSSSPRIMMNSGIWTQLV-LAFLGGWL 259

>ref|NP_565313.1| fasciclin-like arabinogalactan-protein (FLA7); protein id:
           At2g04780.1, supported by cDNA: 11114., supported by
           cDNA: gi_13377781, supported by cDNA: gi_20453157
           [Arabidopsis thaliana] gi|25350182|pir||D84461
           hypothetical protein At2g04780 [imported] - Arabidopsis
           thaliana gi|4544419|gb|AAD22328.1| expressed protein
           [Arabidopsis thaliana]
           gi|13377782|gb|AAK20860.1|AF333973_1 fasciclin-like
           arabinogalactan-protein 7 [Arabidopsis thaliana]
           gi|20453158|gb|AAM19820.1| At2g04780/F28I8.18
           [Arabidopsis thaliana] gi|24417404|gb|AAN60312.1|
           unknown [Arabidopsis thaliana]
           gi|24797004|gb|AAN64514.1| At2g04780/F28I8.18
           [Arabidopsis thaliana]
          Length = 254

 Score =  172 bits (435), Expect = 4e-42
 Identities = 89/150 (59%), Positives = 114/150 (75%), Gaps = 1/150 (0%)
 Frame = -2

Query: 641 QLKQVILFHALPKYYSLADFKNLSQTGSTPTFAGGSYSLNFTDDSGTVHINSGWSKTKVT 462
           QLKQ++LFHALP YYSL++FKNLSQ+G   TFAGG YSL FTD SGTV I+S W++TKV+
Sbjct: 109 QLKQLVLFHALPHYYSLSEFKNLSQSGPVSTFAGGQYSLKFTDVSGTVRIDSLWTRTKVS 168

Query: 461 SAVHSTDPVAIYEVGKVLLPEAVFGTDIPPTPAPAPAPEIAPAADSPIEKSADSK-ASSP 285
           S+V STDPVA+Y+V +VLLPEA+FGTD+PP PAPAPAP ++  +DSP    ADS+ ASSP
Sbjct: 169 SSVFSTDPVAVYQVNRVLLPEAIFGTDVPPMPAPAPAPIVSAPSDSP--SVADSEGASSP 226

Query: 284 SSPDGSSSYKILSYGIWGNLAPLQLLVCWL 195
            S   +S  K+L       LAP+ +++  L
Sbjct: 227 KSSHKNSGQKLL-------LAPISMVISGL 249

>gb|AAM61109.1| unknown [Arabidopsis thaliana]
          Length = 251

 Score =  171 bits (432), Expect = 1e-41
 Identities = 88/150 (58%), Positives = 114/150 (75%), Gaps = 1/150 (0%)
 Frame = -2

Query: 641 QLKQVILFHALPKYYSLADFKNLSQTGSTPTFAGGSYSLNFTDDSGTVHINSGWSKTKVT 462
           QLKQ++LFHALP YYSL++FKNLSQ+G   TFAGG YSL FTD SGTV I+S W++TKV+
Sbjct: 106 QLKQLVLFHALPHYYSLSEFKNLSQSGPVSTFAGGQYSLKFTDVSGTVRIDSLWTRTKVS 165

Query: 461 SAVHSTDPVAIYEVGKVLLPEAVFGTDIPPTPAPAPAPEIAPAADSPIEKSADSK-ASSP 285
           S+V STDPVA+Y++ +VLLPEA+FGTD+PP PAPAPAP ++  +DSP    ADS+ ASSP
Sbjct: 166 SSVFSTDPVAVYQLNRVLLPEAIFGTDVPPMPAPAPAPIVSAPSDSP--SVADSEGASSP 223

Query: 284 SSPDGSSSYKILSYGIWGNLAPLQLLVCWL 195
            S   +S  K+L       LAP+ +++  L
Sbjct: 224 KSSHKNSGQKLL-------LAPISMVISGL 246

>pir||S52995 arabinogalactan-like protein - loblolly pine
           gi|607774|gb|AAA74420.1| arabinogalactan-like protein
          Length = 264

 Score = 85.5 bits (210), Expect = 5e-16
 Identities = 47/133 (35%), Positives = 71/133 (53%), Gaps = 1/133 (0%)
 Frame = -2

Query: 629 VILFHALPKYYSLADFKNLSQTGSTPTFA-GGSYSLNFTDDSGTVHINSGWSKTKVTSAV 453
           ++ +HALP YY+ + F+ +S    T     GG + +N T    +V++++G   T V SAV
Sbjct: 120 LLQYHALPSYYTFSQFQTVSNPVRTMASGNGGPFGVNVTAFGNSVNVSTGLVNTPVNSAV 179

Query: 452 HSTDPVAIYEVGKVLLPEAVFGTDIPPTPAPAPAPEIAPAADSPIEKSADSKASSPSSPD 273
           +S  PVA+Y+V KVLLPE +FG      PA AP PE      SP    + +  S  S+  
Sbjct: 180 YSQSPVAVYQVDKVLLPEEIFGV---KPPAAAPTPEPGAPVSSPAVSPSGASGSGASTSS 236

Query: 272 GSSSYKILSYGIW 234
              S   L+ G++
Sbjct: 237 ACGSMAALAKGLY 249

>ref|NP_565475.1| fasciclin-like arabinogalactan-protein (FLA6); protein id:
           At2g20520.1, supported by cDNA: gi_13377779 [Arabidopsis
           thaliana] gi|13377780|gb|AAK20859.1|AF333972_1
           fasciclin-like arabinogalactan-protein 6 [Arabidopsis
           thaliana] gi|20198085|gb|AAD25652.2| putative surface
           protein [Arabidopsis thaliana]
          Length = 247

 Score = 83.2 bits (204), Expect = 3e-15
 Identities = 52/136 (38%), Positives = 71/136 (51%), Gaps = 4/136 (2%)
 Frame = -2

Query: 632 QVILFHALPKYYSLADFKNLSQTGSTPTFA--GGSYSLNFTDD--SGTVHINSGWSKTKV 465
           Q++L+H +PKYYSL+D    S    T      GG + LNFT    S  V++++G  +T++
Sbjct: 104 QLMLYHIIPKYYSLSDLLLASNPVRTQATGQDGGVFGLNFTGQAQSNQVNVSTGVVETRI 163

Query: 464 TSAVHSTDPVAIYEVGKVLLPEAVFGTDIPPTPAPAPAPEIAPAADSPIEKSADSKASSP 285
            +A+    P+A+Y V  VLLPE +FGT   PT APAP         S     ADS A+  
Sbjct: 164 NNALRQQFPLAVYVVDSVLLPEELFGTKTTPTGAPAP-------KSSTSSSDADSPAADD 216

Query: 284 SSPDGSSSYKILSYGI 237
                 SS K  S GI
Sbjct: 217 EHKSAGSSVKRTSLGI 232

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 564,251,193
Number of Sequences: 1393205
Number of extensions: 12859595
Number of successful extensions: 101874
Number of sequences better than 10.0: 542
Number of HSP's better than 10.0 without gapping: 60646
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 90096
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 27007650415
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPD026f05_f AV771794 1 349
2 MR090f12_f BP082947 20 398
3 SPD001c09_f BP044072 41 583
4 MPDL073h03_f AV780276 42 526
5 SPD043b11_f BP047407 43 603
6 SPD089c01_f BP051092 44 567
7 SPD003a07_f BP044207 46 528
8 SPD023g08_f BP045845 46 605
9 SPD015d01_f BP045179 47 621
10 MPD049a01_f AV773286 89 625
11 MWM127f09_f AV766766 89 646
12 SPD092a04_f BP051313 112 648




Lotus japonicus
Kazusa DNA Research Institute