KMC002847A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002847A_C01 KMC002847A_c01
tgggtcgggccccccagaatgaaactatcactccacgacgcggcgtagaaatggaacctg
ggaagcaaggggtgatGGTGGCGAAGATATTACCACAGCAGCTTCTCAACCCAATAGAGC
AGCTTCAAACTCGCTTCAAGGAAGTCGAATCCGGCTTCAAACTCTGGCTCTCCAAGCAAT
CCATCGCCGTCGAAGCCGCCGTCGTCACCACCACCAGCGCCGCCCAAGGTGCCGCCATCG
GCGCCTGTCATGGGTACCCTCACTGGGGACGCTTCCTCCCCGTTTCCTACTCCGCCACCT
AATGCCTCTCTTAACCCTCAGGCTATGGCTTCTCTTAATCAAGCTCAGGCTCTTGCTGGA
GGCCCTTTAGTTCAAGCTCGTAATTTTGCTGTCATGACTGGTGTGAATGCTGGTATTACA
AGTGTATTGACAAGGATAAGGGGGAAGG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002847A_C01 KMC002847A_c01
         (448 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAM65866.1| unknown [Arabidopsis thaliana]                          79  1e-14
ref|NP_197853.1| putative protein; protein id: At5g24650.1, supp...    79  1e-14
ref|NP_190525.1| putative protein; protein id: At3g49560.1, supp...    78  3e-14
pir||T07907 hydroxyproline-rich glycoprotein GAS28 precursor - C...    51  3e-06
gb|AAF18323.1|AF134858_2 espin [Mus musculus]                          50  6e-06

>gb|AAM65866.1| unknown [Arabidopsis thaliana]
          Length = 246

 Score = 79.0 bits (193), Expect = 1e-14
 Identities = 40/67 (59%), Positives = 44/67 (64%)
 Frame = +1

Query: 247 VMGTLTGDASSPFPTPPPNASLNPQAMASLNQAQALAGGPLVQARNFAVMTGVNAGITSV 426
           V G   G          P A ++PQAMASL Q QAL GGPLVQARNFA +TGVNAGI  V
Sbjct: 50  VQGAFIGGLMGTLSPEMPQAGIDPQAMASLKQTQALVGGPLVQARNFAAITGVNAGIACV 109

Query: 427 LTRIRGK 447
           + RIRGK
Sbjct: 110 MKRIRGK 116

 Score = 65.1 bits (157), Expect = 2e-10
 Identities = 32/60 (53%), Positives = 38/60 (63%)
 Frame = +3

Query: 75  MVAKILPQQLLNPIEQLQTRFKEVESGFKLWLSKQSIAVEAAVVTTTSAAQGAAIGACHG 254
           M    L +   NPI+Q Q +FKE+E+GFK WLSKQ + VEAAVVT     QGA IG   G
Sbjct: 1   MAVMSLMKDQQNPIQQFQVKFKEIETGFKSWLSKQKLPVEAAVVTAMGGVQGAFIGGLMG 60

>ref|NP_197853.1| putative protein; protein id: At5g24650.1, supported by cDNA:
           5684., supported by cDNA: gi_17979478 [Arabidopsis
           thaliana] gi|10177865|dbj|BAB11217.1|
           emb|CAB62460.1~gene_id:K18P6.19~similar to unknown
           protein [Arabidopsis thaliana]
           gi|17979479|gb|AAL50076.1| AT5g24650/K18P6_19
           [Arabidopsis thaliana] gi|22655440|gb|AAM98312.1|
           At5g24650/K18P6_19 [Arabidopsis thaliana]
          Length = 259

 Score = 79.0 bits (193), Expect = 1e-14
 Identities = 40/67 (59%), Positives = 44/67 (64%)
 Frame = +1

Query: 247 VMGTLTGDASSPFPTPPPNASLNPQAMASLNQAQALAGGPLVQARNFAVMTGVNAGITSV 426
           V G   G          P A ++PQAMASL Q QAL GGPLVQARNFA +TGVNAGI  V
Sbjct: 63  VQGAFIGGLMGTLSPEMPQAGIDPQAMASLKQTQALVGGPLVQARNFAAITGVNAGIACV 122

Query: 427 LTRIRGK 447
           + RIRGK
Sbjct: 123 MKRIRGK 129

 Score = 66.6 bits (161), Expect = 6e-11
 Identities = 33/64 (51%), Positives = 40/64 (61%)
 Frame = +3

Query: 63  KQGVMVAKILPQQLLNPIEQLQTRFKEVESGFKLWLSKQSIAVEAAVVTTTSAAQGAAIG 242
           K+  M    L +   NPI+Q Q +FKE+E+GFK WLSKQ + VEAAVVT     QGA IG
Sbjct: 10  KRETMAVMSLMKDQQNPIQQFQVKFKEIETGFKSWLSKQKLPVEAAVVTAMGGVQGAFIG 69

Query: 243 ACHG 254
              G
Sbjct: 70  GLMG 73

>ref|NP_190525.1| putative protein; protein id: At3g49560.1, supported by cDNA:
           15698., supported by cDNA: gi_13430557 [Arabidopsis
           thaliana] gi|11285728|pir||T46233 hypothetical protein
           T9C5.150 - Arabidopsis thaliana
           gi|6561956|emb|CAB62460.1| putative protein [Arabidopsis
           thaliana] gi|13430558|gb|AAK25901.1|AF360191_1 unknown
           protein [Arabidopsis thaliana]
           gi|21553682|gb|AAM62775.1| unknown [Arabidopsis
           thaliana] gi|25054947|gb|AAN71950.1| unknown protein
           [Arabidopsis thaliana]
          Length = 261

 Score = 77.8 bits (190), Expect = 3e-14
 Identities = 42/81 (51%), Positives = 49/81 (59%)
 Frame = +1

Query: 205 SPPPAPPKVPPSAPVMGTLTGDASSPFPTPPPNASLNPQAMASLNQAQALAGGPLVQARN 384
           S P     V   + V G   G          P A ++PQA+AS+ QAQAL GGP VQARN
Sbjct: 54  SIPVEAAVVSTMSGVQGAFIGGLMGTLSPEMPQAGVDPQAIASMKQAQALVGGPWVQARN 113

Query: 385 FAVMTGVNAGITSVLTRIRGK 447
           FA +TGVNAGI SV+ RIRGK
Sbjct: 114 FAAITGVNAGIASVMKRIRGK 134

 Score = 67.8 bits (164), Expect = 3e-11
 Identities = 37/65 (56%), Positives = 43/65 (65%)
 Frame = +3

Query: 60  GKQGVMVAKILPQQLLNPIEQLQTRFKEVESGFKLWLSKQSIAVEAAVVTTTSAAQGAAI 239
           G+   M +    QQ  NPI+Q Q +FKEVE+ FK WLSKQSI VEAAVV+T S  QGA I
Sbjct: 16  GEMMAMASLFNDQQ--NPIQQFQVKFKEVETNFKTWLSKQSIPVEAAVVSTMSGVQGAFI 73

Query: 240 GACHG 254
           G   G
Sbjct: 74  GGLMG 78

>pir||T07907 hydroxyproline-rich glycoprotein GAS28 precursor - Chlamydomonas
           reinhardtii gi|2384728|gb|AAB69862.1|
           hydroxyproline-rich glycoprotein gas28p precursor
           [Chlamydomonas reinhardtii]
          Length = 446

 Score = 51.2 bits (121), Expect = 3e-06
 Identities = 35/98 (35%), Positives = 45/98 (45%), Gaps = 2/98 (2%)
 Frame = +1

Query: 151 PASNSGSPSNPSPSKPPSSPPPAPPKVPPSAPVMGTLTGDASSPFPTPPPNASLNPQAMA 330
           P+ +  SP +PSP  PPS PPP PP   P  P +     DA +    PPP +   P++  
Sbjct: 226 PSPSPPSPPSPSPPPPPSPPPPPPPTPSPPPPELPPAQPDAPARKRPPPPASPPPPRSDF 285

Query: 331 SLNQAQALAGGPLVQARNFAVMTGVNAGITSV--LTRI 438
              Q Q         AR   +MT  +  IT V  LTRI
Sbjct: 286 PFCQCQR-------NARGSRLMTTASNNITVVNGLTRI 316

 Score = 34.3 bits (77), Expect = 0.36
 Identities = 20/47 (42%), Positives = 22/47 (46%)
 Frame = +1

Query: 160 NSGSPSNPSPSKPPSSPPPAPPKVPPSAPVMGTLTGDASSPFPTPPP 300
           N+  P  PSPS P  SPP   P  PPS P           P P+PPP
Sbjct: 219 NAPPPHVPSPSPP--SPPSPSPPPPPSPP-------PPPPPTPSPPP 256

>gb|AAF18323.1|AF134858_2 espin [Mus musculus]
          Length = 532

 Score = 50.1 bits (118), Expect = 6e-06
 Identities = 36/114 (31%), Positives = 46/114 (39%), Gaps = 10/114 (8%)
 Frame = +1

Query: 4   VGPPRMKLSLHDAA*KWNLGSKG*WWRRYYHSSFSTQ*SSFK--LASRKSNPASNSGSPS 177
           +G P    S +D+    +  SKG   +R    + +    S+   L   KS P    G PS
Sbjct: 40  LGSPTSTFSNYDSCSSSHSSSKG---QRSNRGARAADLQSYMDMLNPEKSLPRGKLGKPS 96

Query: 178 NPSPS--------KPPSSPPPAPPKVPPSAPVMGTLTGDASSPFPTPPPNASLN 315
            P PS         PP  PPP PP  PP  P  GT     S  +P P P   L+
Sbjct: 97  PPPPSTTTTTKLPSPPPPPPPPPPSFPPPPPPTGTQPPPPSPGYPAPNPPVGLH 150

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 453,040,708
Number of Sequences: 1393205
Number of extensions: 12125994
Number of successful extensions: 295141
Number of sequences better than 10.0: 9530
Number of HSP's better than 10.0 without gapping: 99272
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 202240
length of database: 448,689,247
effective HSP length: 124
effective length of database: 275,931,827
effective search space used: 6622363848
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPD035f11_f AV772417 1 424
2 GNf001b03 BP067434 77 448




Lotus japonicus
Kazusa DNA Research Institute