KMC000629A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000629A_C01 KMC000629A_c01
aataatggAGAAATCGATCAATATCCAAATCAAATTAGAAATTGGTATTATACAAATTAT
TCATATATATGCTAGATGTACTATTTAACTAGAAATGGGTGATCCACTATTGGGCTCTGT
GTTCTCCTTGACAGCCCATTTGTTAACACGATTTCGAGCTATCACTACCTGATTTTCCCA
ATCTGTTTTAAGTGTGATTGTAATTAGCATTACGGTTTGAACAAATGTTCCAAACAACAT
TCCAATCCAAACACCCTTTTCTTGCAAATGGATAAGGTTACCCAGCACCACTCCAACCGG
AATACCAATGAGGTAATAACAGCCTATGTTTACATATGCTACAACGCTTGGCCATCCAGC
TCCAACAGAAACCCCAGAGAGAACAGGTGGGACACTGTTCAACAATATGGAGATTGACAA
TAAAGGTGATAAATCCCCAACTGCTTCAGCTACCTCTGAGTCTAGAGTGAAAACATAAGC
AAGTCTTTCCCTCAAAAACAG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000629A_C01 KMC000629A_c01
         (501 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_174585.1| hypothetical protein; protein id: At1g33090.1 [...   171  6e-42
ref|NP_174587.1| unknown protein; protein id: At1g33110.1, suppo...   170  7e-42
ref|NP_174586.1| unknown protein; protein id: At1g33100.1 [Arabi...   165  2e-40
ref|NP_174584.2| hypothetical protein; protein id: At1g33080.1, ...   160  1e-38
pir||B86455 T9L6.1 protein - Arabidopsis thaliana gi|9665160|gb|...   157  5e-38

>ref|NP_174585.1| hypothetical protein; protein id: At1g33090.1 [Arabidopsis
           thaliana]
          Length = 494

 Score =  171 bits (432), Expect = 6e-42
 Identities = 80/131 (61%), Positives = 100/131 (76%)
 Frame = -1

Query: 501 LFLRERLAYVFTLDSEVAEAVGDLSPLLSISILLNSVPPVLSGVSVGAGWPSVVAYVNIG 322
           LFLR R++Y+FT    VA  V DLSPLL+ SILLNSV PVLSGV+VGAGW   VAY+N+ 
Sbjct: 358 LFLRGRISYIFTTSEAVAAEVADLSPLLAFSILLNSVQPVLSGVAVGAGWQGYVAYINLA 417

Query: 321 CYYLIGIPVGVVLGNLIHLQEKGVWIGMLFGTFVQTVMLITITLKTDWENQVVIARNRVN 142
           CYYL+GIPVG+VLG ++ LQ KGVWIGMLFG FVQT +L  +TL+TDW+ QV  +   +N
Sbjct: 418 CYYLLGIPVGLVLGYVVGLQVKGVWIGMLFGIFVQTCVLTIMTLRTDWDQQVSTSLKNIN 477

Query: 141 KWAVKENTEPN 109
           +W V E+ + N
Sbjct: 478 RWVVPESRDAN 488

>ref|NP_174587.1| unknown protein; protein id: At1g33110.1, supported by cDNA:
           gi_17065359 [Arabidopsis thaliana]
           gi|17065360|gb|AAL32834.1| Unknown protein [Arabidopsis
           thaliana] gi|21387205|gb|AAM48006.1| unknown protein
           [Arabidopsis thaliana]
          Length = 494

 Score =  170 bits (431), Expect = 7e-42
 Identities = 79/134 (58%), Positives = 101/134 (74%)
 Frame = -1

Query: 501 LFLRERLAYVFTLDSEVAEAVGDLSPLLSISILLNSVPPVLSGVSVGAGWPSVVAYVNIG 322
           LFLR R++Y+FT    VA  V DLSPLL+ SIL+NSV PVLSGV+VGAGW   V YVN+ 
Sbjct: 358 LFLRGRVSYIFTTSEAVAAEVADLSPLLAFSILMNSVQPVLSGVAVGAGWQGYVTYVNLA 417

Query: 321 CYYLIGIPVGVVLGNLIHLQEKGVWIGMLFGTFVQTVMLITITLKTDWENQVVIARNRVN 142
           CYYL+GIP+G++LG ++ LQ KGVWIGMLFG FVQT +L  +TL+TDW+ QV  +  R+N
Sbjct: 418 CYYLVGIPIGIILGYVVGLQVKGVWIGMLFGIFVQTCVLTVMTLRTDWDQQVSTSLRRLN 477

Query: 141 KWAVKENTEPNSGS 100
           +W V E+ + N  S
Sbjct: 478 RWVVPESRDVNQVS 491

>ref|NP_174586.1| unknown protein; protein id: At1g33100.1 [Arabidopsis thaliana]
          Length = 475

 Score =  165 bits (418), Expect = 2e-40
 Identities = 75/126 (59%), Positives = 96/126 (75%)
 Frame = -1

Query: 486 RLAYVFTLDSEVAEAVGDLSPLLSISILLNSVPPVLSGVSVGAGWPSVVAYVNIGCYYLI 307
           R++Y+FT    VA  V DLSPLL+ SILLNSV PVLSGV++GAGW   VAYVN+ CYYL+
Sbjct: 344 RISYIFTTSEAVAAEVADLSPLLAFSILLNSVQPVLSGVAIGAGWQGYVAYVNLACYYLV 403

Query: 306 GIPVGVVLGNLIHLQEKGVWIGMLFGTFVQTVMLITITLKTDWENQVVIARNRVNKWAVK 127
           GIP+GV+LG ++ LQ KGVWIGMLFG FVQT +L  +TL+TDW+ QV  +   +N+W V 
Sbjct: 404 GIPIGVILGYVVGLQVKGVWIGMLFGIFVQTCVLTVMTLRTDWDQQVSTSLRNINRWVVP 463

Query: 126 ENTEPN 109
           E+ + N
Sbjct: 464 ESRDAN 469

>ref|NP_174584.2| hypothetical protein; protein id: At1g33080.1, supported by cDNA:
           gi_19423993 [Arabidopsis thaliana]
           gi|19423994|gb|AAL87319.1| unknown protein [Arabidopsis
           thaliana] gi|22136880|gb|AAM91784.1| unknown protein
           [Arabidopsis thaliana]
          Length = 494

 Score =  160 bits (404), Expect = 1e-38
 Identities = 74/131 (56%), Positives = 96/131 (72%)
 Frame = -1

Query: 501 LFLRERLAYVFTLDSEVAEAVGDLSPLLSISILLNSVPPVLSGVSVGAGWPSVVAYVNIG 322
           LFLRER++Y+FT    VA  V DLSPLL+ SILLNS+ PVLSGV+VGAGW   V  VN+ 
Sbjct: 358 LFLRERVSYIFTTSEAVATQVADLSPLLAFSILLNSIQPVLSGVAVGAGWQKYVTVVNLA 417

Query: 321 CYYLIGIPVGVVLGNLIHLQEKGVWIGMLFGTFVQTVMLITITLKTDWENQVVIARNRVN 142
           CYYL+GIP G+ LG ++ LQ KGVW+GM+FG FVQT +L  +T++TDW+ QV  +  R+N
Sbjct: 418 CYYLVGIPSGLFLGYVVGLQVKGVWLGMIFGIFVQTCVLTVMTMRTDWDQQVSSSLKRLN 477

Query: 141 KWAVKENTEPN 109
           +W   E+   N
Sbjct: 478 RWVEPESPSRN 488

>pir||B86455 T9L6.1 protein - Arabidopsis thaliana
           gi|9665160|gb|AAF97344.1|AC021045_1 Hypothetical Protein
           [Arabidopsis thaliana]
          Length = 424

 Score =  157 bits (398), Expect = 5e-38
 Identities = 72/113 (63%), Positives = 90/113 (78%)
 Frame = -1

Query: 501 LFLRERLAYVFTLDSEVAEAVGDLSPLLSISILLNSVPPVLSGVSVGAGWPSVVAYVNIG 322
           LFLR R++Y+FT    VA  V DLSPLL+ SIL+NSV PVLSGV+VGAGW   V YVN+ 
Sbjct: 302 LFLRGRVSYIFTTSEAVAAEVADLSPLLAFSILMNSVQPVLSGVAVGAGWQGYVTYVNLA 361

Query: 321 CYYLIGIPVGVVLGNLIHLQEKGVWIGMLFGTFVQTVMLITITLKTDWENQVV 163
           CYYL+GIP+G++LG ++ LQ KGVWIGMLFG FVQT +L  +TL+TDW+ QV+
Sbjct: 362 CYYLVGIPIGIILGYVVGLQVKGVWIGMLFGIFVQTCVLTVMTLRTDWDQQVI 414

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 456,901,259
Number of Sequences: 1393205
Number of extensions: 10640989
Number of successful extensions: 27829
Number of sequences better than 10.0: 190
Number of HSP's better than 10.0 without gapping: 26534
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 27681
length of database: 448,689,247
effective HSP length: 114
effective length of database: 289,863,877
effective search space used: 15072921604
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENLf059g09 BP065524 1 501
2 GENLf027d04 BP063768 9 440




Lotus japonicus
Kazusa DNA Research Institute