KMC005155A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC005155A_C01 KMC005155A_c01
agtagtaatagtatcagtatcacactatcacgtGACAATAAAAATACTACATTATTTGCA
TTAATGAAGAAACTTAAGTATCTGAATAAAATTACTAGCAGCACCACAACCCCACAAACA
AGAGAAAAGGCCAACAAAAGAAATAGAAAAATCTGATTTTACATTATAGACGGAAGAAGT
AGAAGAAGTGATATAATGAGATAATCAGATATCAAGAGCATTAGGACCAGTACAGAAATT
CAGGACGTGGCTCCACACCCACTGAACCAAGAACCTCCGTCGGATTCCCTTCATGTTAAA
CGTGGCACCATCTCCACCGTCCAACACCTGCCCGGTGTATGACCCACCTCCACCGGTGCC
GTAAATCCCCTCGCACAGATCGGCGATCTCCACCGGAAATGTCGGGTCCTGACCCGCGTA
CCAAGCATTCGCCAACGGGTTCGTTGCCAGCTCCGCAATCTCGTGCCCGATCACGCTTAT
CATCCCATCCACCCCAACGTCGCCGTTTGGTGACTTGAATGGTTTCCGGTTCGGAATGAA
CGCCGGCACGGCGAACGGGTACGCGCACTGCCCCGGGCACAGTTTCGCCGAGTTACCCAC
CCAAGCGTAGGGTAAGGTGTACCCGACCAGCGACGGGAACGTGAAGTAGTGGAACCCACA
CACCGACGTGCAGAAATCCTGAACGAACACGTCATCGGCGGTGAGTAGCAGGTAGAGCCC
GCTTTTCGGGTTGATCGG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC005155A_C01 KMC005155A_c01
         (738 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_199968.1| putative protein; protein id: At5g51550.1, supp...   328  6e-89
gb|AAL24269.1| AT5g51550/K17N15_10 [Arabidopsis thaliana] gi|216...   328  6e-89
gb|AAM64823.1| unknown [Arabidopsis thaliana]                         326  2e-88
dbj|BAB19386.1| contains ESTs C27834(C53187),AU161017(C53187)~un...   268  8e-71
ref|NP_565409.1| expressed protein; protein id: At2g17230.1, sup...   253  3e-66

>ref|NP_199968.1| putative protein; protein id: At5g51550.1, supported by cDNA:
           33455., supported by cDNA: gi_16226761, supported by
           cDNA: gi_16604526 [Arabidopsis thaliana]
           gi|9758197|dbj|BAB08671.1|
           gb|AAD25141.1~gene_id:K17N15.10~similar to unknown
           protein [Arabidopsis thaliana]
           gi|16226762|gb|AAL16255.1|AF428325_1 AT5g51550/K17N15_10
           [Arabidopsis thaliana]
          Length = 337

 Score =  328 bits (840), Expect = 6e-89
 Identities = 147/176 (83%), Positives = 161/176 (90%)
 Frame = -1

Query: 738 PINPKSGLYLLLTADDVFVQDFCTSVCGFHYFTFPSLVGYTLPYAWVGNSAKLCPGQCAY 559
           P+NPKSGLYLLLTADDV+VQDFC  VCGFHYFTFPS+VG+TLPYAWVGNSAKLCPG CAY
Sbjct: 162 PVNPKSGLYLLLTADDVYVQDFCGQVCGFHYFTFPSIVGFTLPYAWVGNSAKLCPGVCAY 221

Query: 558 PFAVPAFIPNRKPFKSPNGDVGVDGMISVIGHEIAELATNPLANAWYAGQDPTFPVEIAD 379
           PFAVPAFIP  KP KSPNGDVGVDGMISVI HEIAELATNPL NAWYAG DP  PVEIAD
Sbjct: 222 PFAVPAFIPGLKPVKSPNGDVGVDGMISVIAHEIAELATNPLVNAWYAGPDPVAPVEIAD 281

Query: 378 LCEGIYGTGGGGSYTGQVLDGGDGATFNMKGIRRRFLVQWVWSHVLNFCTGPNALD 211
           LCEGIYGTGGGGSYTGQ+L+   GAT+N+ GIRRR+L+QW+WSHV+++CTGPNALD
Sbjct: 282 LCEGIYGTGGGGSYTGQMLNDHSGATYNVNGIRRRYLIQWLWSHVVSYCTGPNALD 337

>gb|AAL24269.1| AT5g51550/K17N15_10 [Arabidopsis thaliana]
           gi|21655293|gb|AAM65358.1| AT5g51550/K17N15_10
           [Arabidopsis thaliana]
          Length = 337

 Score =  328 bits (840), Expect = 6e-89
 Identities = 147/176 (83%), Positives = 161/176 (90%)
 Frame = -1

Query: 738 PINPKSGLYLLLTADDVFVQDFCTSVCGFHYFTFPSLVGYTLPYAWVGNSAKLCPGQCAY 559
           P+NPKSGLYLLLTADDV+VQDFC  VCGFHYFTFPS+VG+TLPYAWVGNSAKLCPG CAY
Sbjct: 162 PVNPKSGLYLLLTADDVYVQDFCGQVCGFHYFTFPSIVGFTLPYAWVGNSAKLCPGVCAY 221

Query: 558 PFAVPAFIPNRKPFKSPNGDVGVDGMISVIGHEIAELATNPLANAWYAGQDPTFPVEIAD 379
           PFAVPAFIP  KP KSPNGDVGVDGMISVI HEIAELATNPL NAWYAG DP  PVEIAD
Sbjct: 222 PFAVPAFIPGLKPVKSPNGDVGVDGMISVIAHEIAELATNPLVNAWYAGPDPVAPVEIAD 281

Query: 378 LCEGIYGTGGGGSYTGQVLDGGDGATFNMKGIRRRFLVQWVWSHVLNFCTGPNALD 211
           LCEGIYGTGGGGSYTGQ+L+   GAT+N+ GIRRR+L+QW+WSHV+++CTGPNALD
Sbjct: 282 LCEGIYGTGGGGSYTGQMLNDHSGATYNVNGIRRRYLIQWLWSHVVSYCTGPNALD 337

>gb|AAM64823.1| unknown [Arabidopsis thaliana]
          Length = 337

 Score =  326 bits (836), Expect = 2e-88
 Identities = 146/176 (82%), Positives = 160/176 (89%)
 Frame = -1

Query: 738 PINPKSGLYLLLTADDVFVQDFCTSVCGFHYFTFPSLVGYTLPYAWVGNSAKLCPGQCAY 559
           P+NPKSGLYLLLTADDV+VQDFC  VCGFHYFTFPS+VG+TLPYAWVGNSAKLCPG CAY
Sbjct: 162 PVNPKSGLYLLLTADDVYVQDFCGQVCGFHYFTFPSIVGFTLPYAWVGNSAKLCPGVCAY 221

Query: 558 PFAVPAFIPNRKPFKSPNGDVGVDGMISVIGHEIAELATNPLANAWYAGQDPTFPVEIAD 379
           PF VPAFIP  KP KSPNGDVGVDGMISVI HEIAELATNPL NAWYAG DP  PVEIAD
Sbjct: 222 PFXVPAFIPGLKPVKSPNGDVGVDGMISVIAHEIAELATNPLVNAWYAGPDPVAPVEIAD 281

Query: 378 LCEGIYGTGGGGSYTGQVLDGGDGATFNMKGIRRRFLVQWVWSHVLNFCTGPNALD 211
           LCEGIYGTGGGGSYTGQ+L+   GAT+N+ GIRRR+L+QW+WSHV+++CTGPNALD
Sbjct: 282 LCEGIYGTGGGGSYTGQMLNDHSGATYNVNGIRRRYLIQWLWSHVVSYCTGPNALD 337

>dbj|BAB19386.1| contains ESTs C27834(C53187),AU161017(C53187)~unknown protein
           [Oryza sativa (japonica cultivar-group)]
           gi|28190675|gb|AAO33153.1| unknown [Oryza sativa
           (japonica cultivar-group)]
          Length = 348

 Score =  268 bits (684), Expect = 8e-71
 Identities = 121/174 (69%), Positives = 150/174 (85%), Gaps = 3/174 (1%)
 Frame = -1

Query: 723 SGLYLLLTADDVFVQDFCTSVCGFHYFTFPSLVGYTLPYAWVGNSAKLCPGQCAYPFAVP 544
           SG+YL+LT+ +V V++FC  VCGFHYFTFPS+VGYTLPYAWVGNSA  CP  CAYPFA+P
Sbjct: 174 SGVYLVLTSPEVVVENFCGQVCGFHYFTFPSVVGYTLPYAWVGNSAARCPEVCAYPFAIP 233

Query: 543 AFIPN-RKPFKSPNGDVGVDGMISVIGHEIAELATNPLANAWYAGQDPTFPVEIADLCEG 367
           +++   R+    PNGDVGVDGM+SVI HE+AELA+NPLANAWYAG+DP+FP EIADLCEG
Sbjct: 234 SYVGGGRRAEAPPNGDVGVDGMVSVIAHELAELASNPLANAWYAGEDPSFPTEIADLCEG 293

Query: 366 IYGTGGGGSYTGQVL-DGGDGATFNMKGI-RRRFLVQWVWSHVLNFCTGPNALD 211
           IYGTGGGG+YTGQ+L DG  GA++N+ G+  R+FLVQWVW+ +L++C+GPNALD
Sbjct: 294 IYGTGGGGAYTGQLLTDGRSGASYNVNGVGGRKFLVQWVWNPILSYCSGPNALD 347

>ref|NP_565409.1| expressed protein; protein id: At2g17230.1, supported by cDNA:
           641., supported by cDNA: gi_16604329, supported by cDNA:
           gi_19699195 [Arabidopsis thaliana]
           gi|25371899|pir||F84549 hypothetical protein At2g17230
           [imported] - Arabidopsis thaliana
           gi|4584346|gb|AAD25141.1| expressed protein [Arabidopsis
           thaliana] gi|16604330|gb|AAL24171.1| At2g17230/T23A1.9
           [Arabidopsis thaliana] gi|19699196|gb|AAL90964.1|
           At2g17230/T23A1.9 [Arabidopsis thaliana]
          Length = 363

 Score =  253 bits (645), Expect = 3e-66
 Identities = 111/179 (62%), Positives = 142/179 (79%), Gaps = 3/179 (1%)
 Frame = -1

Query: 738 PINPKSGLYLLLTADDVFVQDFCTSVCGFHYFTFPSLVGYTLPYAWVGNSAKLCPGQCAY 559
           P++ K+G+YL+LT+ DV +QDFC +VCGFHYFTFPS+VGYT+PYAWVG S K CP  CAY
Sbjct: 185 PVDHKNGMYLVLTSHDVTMQDFCRAVCGFHYFTFPSMVGYTMPYAWVGQSGKQCPEVCAY 244

Query: 558 PFAVPAFIPNRKP--FKSPNGDVGVDGMISVIGHEIAELATNPLANAWYAGQDPTFPVEI 385
           PFA+P ++ +  P   + PNG+ GVDGM+SVIGHE+AE+ +NPL NAWYAG+DPT P EI
Sbjct: 245 PFALPGYMGHGGPGELRPPNGETGVDGMVSVIGHELAEVVSNPLINAWYAGEDPTAPTEI 304

Query: 384 ADLCEGIYGTGGGGSYTGQVLDGGDGATFNMKGI-RRRFLVQWVWSHVLNFCTGPNALD 211
            DLCEG+YG+GGGG Y GQV+   +G TFNM G   R+FLVQW+W+  L  C+GPN++D
Sbjct: 305 GDLCEGLYGSGGGGGYIGQVMRDREGKTFNMNGKGGRKFLVQWIWNPNLKACSGPNSVD 363

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 729,778,288
Number of Sequences: 1393205
Number of extensions: 19471440
Number of successful extensions: 95226
Number of sequences better than 10.0: 309
Number of HSP's better than 10.0 without gapping: 73073
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 92038
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 35188080875
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MF018f07_f BP029209 1 473
2 MFB040a10_f BP036899 34 514
3 SPD014g12_f BP045146 35 567
4 SPD038e05_f BP047027 40 592
5 SPD023e03_f BP045819 105 651
6 MPD045d08_f AV773052 167 738
7 MPD020g06_f AV771408 168 728




Lotus japonicus
Kazusa DNA Research Institute