KMC016196A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC016196A_C01 KMC016196A_c01
cttaccatatccagcatAATCAAAGTAAACTTTGGTATTACAGATGTACACACCATTGTT
CTAGGTTCTCACAGATACATTTAGAAGAAAACAGGTGGGAAGTGAAAAGAGTAAAAGCAT
GAACAAATCATAAAGAAATGATACAAACAAAATCAACACAAGAACAGAAGAGAATTAAAA
GAACTTATCATACTGGCTAGTGAGGGAGTCTCTCAGATTTGAATAAAGGCCATAGAGTGA
ATACTGGCGAAGGTTTTTTACTTCATACCAAGAGTTGAGAGCCGAAAGATGTTTATCCGT
GCCAGAGAACGCCAATTGTATCCCTAAGCTGCAATCCACAGTTCCTCCATGGCTCTTGCA
GCTGGATGTTTGCAGTGCACAATCCTGATTGTTGAGACAGACAGCTTTGGAGTTACTTGA
GCATTTAGAACACCCATCTTTTTTCCAATACAAGTTTTGTAGCCTTCCCTTCTTAAACTC
AAGGACAAGAGTGAAACTGGTTACGGTATACGTGCTGTTTGCAACAAATGCAGGTGGAGA
CCTTGCAGCATATTTCCGACCAGCAAATGCAACCATATAGCCATACAAGTCCGCAACGAA
G


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC016196A_C01 KMC016196A_c01
         (601 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_190001.2| putative protein; protein id: At3g44150.1, supp...   229  2e-59
pir||T49131 hypothetical protein F26G5.100 - Arabidopsis thalian...   213  1e-54
ref|NP_566401.1| expressed protein; protein id: At3g11800.1, sup...   213  2e-54
gb|AAF23196.1|AC016795_9 unknown protein [Arabidopsis thaliana]       192  4e-48
ref|NP_190432.1| hypothetical protein; protein id: At3g48630.1 [...   163  2e-39

>ref|NP_190001.2| putative protein; protein id: At3g44150.1, supported by cDNA:
           gi_17529171 [Arabidopsis thaliana]
           gi|17529172|gb|AAL38812.1| unknown protein [Arabidopsis
           thaliana] gi|23297558|gb|AAN12895.1| unknown protein
           [Arabidopsis thaliana]
          Length = 246

 Score =  229 bits (585), Expect = 2e-59
 Identities = 107/142 (75%), Positives = 122/142 (85%), Gaps = 1/142 (0%)
 Frame = -2

Query: 600 FVADLYG-YMVAFAGRKYAARSPPAFVANSTYTVTSFTLVLEFKKGRLQNLYWKKDGCSK 424
           F  D YG YMVAFAGRKYAARS PAF+ANST+ VTSFTLV+EF+KGRLQNLYWK+DGC+ 
Sbjct: 105 FFPDNYGGYMVAFAGRKYAARSIPAFIANSTFIVTSFTLVMEFQKGRLQNLYWKRDGCAS 164

Query: 423 CSSNSKAVCLNNQDCALQTSSCKSHGGTVDCSLGIQLAFSGTDKHLSALNSWYEVKNLRQ 244
           C  N   VCLN QDCA++T SCK  GG VDCSLGIQLAFSGTDKHL+ LNSWYEV+NL+Q
Sbjct: 165 CKGNQNFVCLNKQDCAIRTPSCKGRGGAVDCSLGIQLAFSGTDKHLAVLNSWYEVENLKQ 224

Query: 243 YSLYGLYSNLRDSLTSQYDKFF 178
           YSLYGLYSNL+ SLT+Q++ FF
Sbjct: 225 YSLYGLYSNLKSSLTNQFNNFF 246

>pir||T49131 hypothetical protein F26G5.100 - Arabidopsis thaliana
           gi|7635460|emb|CAB88423.1| putative protein [Arabidopsis
           thaliana]
          Length = 240

 Score =  213 bits (543), Expect = 1e-54
 Identities = 101/142 (71%), Positives = 117/142 (82%), Gaps = 1/142 (0%)
 Frame = -2

Query: 600 FVADLYG-YMVAFAGRKYAARSPPAFVANSTYTVTSFTLVLEFKKGRLQNLYWKKDGCSK 424
           F  D YG YMVAFAGRKYAARS PAF+ANST+      +V+EF+KGRLQNLYWK+DGC+ 
Sbjct: 105 FFPDNYGGYMVAFAGRKYAARSIPAFIANSTF------IVMEFQKGRLQNLYWKRDGCAS 158

Query: 423 CSSNSKAVCLNNQDCALQTSSCKSHGGTVDCSLGIQLAFSGTDKHLSALNSWYEVKNLRQ 244
           C  N   VCLN QDCA++T SCK  GG VDCSLGIQLAFSGTDKHL+ LNSWYEV+NL+Q
Sbjct: 159 CKGNQNFVCLNKQDCAIRTPSCKGRGGAVDCSLGIQLAFSGTDKHLAVLNSWYEVENLKQ 218

Query: 243 YSLYGLYSNLRDSLTSQYDKFF 178
           YSLYGLYSNL+ SLT+Q++ FF
Sbjct: 219 YSLYGLYSNLKSSLTNQFNNFF 240

>ref|NP_566401.1| expressed protein; protein id: At3g11800.1, supported by cDNA:
           gi_15028124, supported by cDNA: gi_19310798 [Arabidopsis
           thaliana] gi|15028125|gb|AAK76686.1| unknown protein
           [Arabidopsis thaliana] gi|19310799|gb|AAL85130.1|
           unknown protein [Arabidopsis thaliana]
          Length = 246

 Score =  213 bits (542), Expect = 2e-54
 Identities = 98/135 (72%), Positives = 116/135 (85%)
 Frame = -2

Query: 582 GYMVAFAGRKYAARSPPAFVANSTYTVTSFTLVLEFKKGRLQNLYWKKDGCSKCSSNSKA 403
           GYMVAFAG KYAARS P  VA+S + VTSFTLVLEF+KGRL+N++WKKDGCSKCS +SK 
Sbjct: 112 GYMVAFAGAKYAARSLPIMVADSNHIVTSFTLVLEFQKGRLENMFWKKDGCSKCSGDSKF 171

Query: 402 VCLNNQDCALQTSSCKSHGGTVDCSLGIQLAFSGTDKHLSALNSWYEVKNLRQYSLYGLY 223
           VCLN ++CA++  +CK+ GG VDCSLGIQLAFSGTDKH +ALNSWYEV NL+QYSLYGLY
Sbjct: 172 VCLNKEECAIKPQNCKNQGGQVDCSLGIQLAFSGTDKHYTALNSWYEVANLKQYSLYGLY 231

Query: 222 SNLRDSLTSQYDKFF 178
           SNL+DSLT+ +   F
Sbjct: 232 SNLKDSLTNPFKNIF 246

>gb|AAF23196.1|AC016795_9 unknown protein [Arabidopsis thaliana]
          Length = 127

 Score =  192 bits (487), Expect = 4e-48
 Identities = 90/133 (67%), Positives = 109/133 (81%)
 Frame = -2

Query: 576 MVAFAGRKYAARSPPAFVANSTYTVTSFTLVLEFKKGRLQNLYWKKDGCSKCSSNSKAVC 397
           MVAFAG KYAARS P  VA+S +      +VLEF+KGRL+N++WKKDGCSKCS +SK VC
Sbjct: 1   MVAFAGAKYAARSLPIMVADSNH------IVLEFQKGRLENMFWKKDGCSKCSGDSKFVC 54

Query: 396 LNNQDCALQTSSCKSHGGTVDCSLGIQLAFSGTDKHLSALNSWYEVKNLRQYSLYGLYSN 217
           LN ++CA++  +CK+ GG VDCSLGIQLAFSGTDKH +ALNSWYEV NL+QYSLYGLYSN
Sbjct: 55  LNKEECAIKPQNCKNQGGQVDCSLGIQLAFSGTDKHYTALNSWYEVANLKQYSLYGLYSN 114

Query: 216 LRDSLTSQYDKFF 178
           L+DSLT+ +   F
Sbjct: 115 LKDSLTNPFKNIF 127

>ref|NP_190432.1| hypothetical protein; protein id: At3g48630.1 [Arabidopsis
           thaliana] gi|11285723|pir||T46207 hypothetical protein
           T8P19.140 - Arabidopsis thaliana
           gi|6523094|emb|CAB62352.1| hypothetical protein
           [Arabidopsis thaliana]
          Length = 122

 Score =  163 bits (412), Expect = 2e-39
 Identities = 75/112 (66%), Positives = 87/112 (76%)
 Frame = -2

Query: 513 TYTVTSFTLVLEFKKGRLQNLYWKKDGCSKCSSNSKAVCLNNQDCALQTSSCKSHGGTVD 334
           TY V     V+EF+KGRLQNLYWK+D C+ C  N   VCL  Q CA++T SCK  GG+V 
Sbjct: 11  TYGVVMTFKVMEFQKGRLQNLYWKRDVCASCKGNQNFVCLKKQVCAIRTPSCKGRGGSVG 70

Query: 333 CSLGIQLAFSGTDKHLSALNSWYEVKNLRQYSLYGLYSNLRDSLTSQYDKFF 178
           CSLGIQLAFSGTDKHL+ LNSWYEV NL+QYSLYGLYSNL+ SLT+Q + FF
Sbjct: 71  CSLGIQLAFSGTDKHLAVLNSWYEVDNLKQYSLYGLYSNLKSSLTNQLNNFF 122

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 515,415,217
Number of Sequences: 1393205
Number of extensions: 10596312
Number of successful extensions: 32026
Number of sequences better than 10.0: 22
Number of HSP's better than 10.0 without gapping: 30631
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 31957
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 23426109484
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD035a08_f BP046755 1 601
2 MWM225f02_f AV768177 18 483




Lotus japonicus
Kazusa DNA Research Institute