KMC018960A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC018960A_C01 KMC018960A_c01
atgtaagatatacagttaaaaggggatgaGAAAAGAATTACATTCAAGAATGCCAGATTC
CACTCAGCATACTGCGGAACCGGTTGGTGGCTAGCTTCCTAACAAGTCTCTGGGCATCAG
AAGAAACTTCAGCTTCTGCAAGTCTCTCTATATTTTGGGCATATCCAGAACTAACAATCT
TCTTCTTCCCACTGTTGCAGCTGGTTAATGACATTAAGATGGAGATCAAAAACTTTTGGT
TATTTGAGTTTCCCTCATTTGGTTCCAGCAATTGCAGAAGAAGAGCAATACTTTGATCAT
CTTGCACAAACCTTTTTCTATTTTTGGGCACCATAACCATGCCAGAGAGTGCCTCAGCTG
CCATTTCGCGAACTTCAAATGATTTGGCAGTTAGAAACTTAACAAGCTCTGCCATGAATC
CCGCATCTCCCAACGCTTTCTTAGCCTCCTCTGATGTTCCACACAGTCTAATGGCCACTT
TCAACGCCAGCTCTTGAATGGAAACTTCTCCATTTCTAACATAATATAGTAATTGATCCA
CAAAGCCATAAGTTATCAAAAT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC018960A_C01 KMC018960A_c01
         (562 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_564774.1| expressed protein; protein id: At1g61350.1, sup...   202  3e-51
gb|AAL14389.1| At1g61350/T1F9_16 [Arabidopsis thaliana]               201  7e-51
ref|NP_182096.1| unknown protein; protein id: At2g45720.1 [Arabi...    93  2e-18
gb|AAM92297.1| putative arm repeat containing protein [Oryza sat...    90  2e-17
ref|NP_199903.1| putative protein; protein id: At5g50900.1, supp...    79  4e-14

>ref|NP_564774.1| expressed protein; protein id: At1g61350.1, supported by cDNA:
           gi_16209657, supported by cDNA: gi_18700124 [Arabidopsis
           thaliana] gi|25404883|pir||B96639 protein T1F9.16
           [imported] - Arabidopsis thaliana
           gi|3056595|gb|AAC13906.1|AAC13906 T1F9.16 [Arabidopsis
           thaliana] gi|18700125|gb|AAL77674.1| At1g61350/T1F9_16
           [Arabidopsis thaliana]
          Length = 573

 Score =  202 bits (513), Expect = 3e-51
 Identities = 105/171 (61%), Positives = 134/171 (77%), Gaps = 5/171 (2%)
 Frame = -1

Query: 544 FVDQLLYYVRNGEVSIQELALKVAIRLCGTSEEAKKALGDAGFMAELVKFLTAKSFEVRE 365
           F+D LL  +RNGE+S+QE ALKV  RLC   EE K+ +G+AGFM ELVKFL AKS +VRE
Sbjct: 404 FLDHLLNLLRNGEISVQESALKVTSRLCSLQEEVKRIMGEAGFMPELVKFLDAKSIDVRE 463

Query: 364 MAAEALSGMVMVPKNRKRFVQDDQSIALLLQLLEPNEG-----NSNNQKFLISILMSLTS 200
           MA+ AL  ++ VP+NRK+F QDD +I+ +LQLL+  +G     +S N KFLISILMSLTS
Sbjct: 464 MASVALYCLISVPRNRKKFAQDDFNISYILQLLDHEDGSNVSSDSGNTKFLISILMSLTS 523

Query: 199 CNSGKKKIVSSGYAQNIERLAEAEVSSDAQRLVRKLATNRFRSMLSGIWHS 47
           CNS ++KI SSGY ++IE+LAE E  SDA++LV+KL+ NRFRS+LSGIWHS
Sbjct: 524 CNSARRKIASSGYLKSIEKLAETE-GSDAKKLVKKLSMNRFRSILSGIWHS 573

>gb|AAL14389.1| At1g61350/T1F9_16 [Arabidopsis thaliana]
          Length = 573

 Score =  201 bits (510), Expect = 7e-51
 Identities = 104/171 (60%), Positives = 134/171 (77%), Gaps = 5/171 (2%)
 Frame = -1

Query: 544 FVDQLLYYVRNGEVSIQELALKVAIRLCGTSEEAKKALGDAGFMAELVKFLTAKSFEVRE 365
           F+D LL  +RNGE+S+QE ALKV  RLC   EE K+ +G+AGFM ELVKFL AKS +VR+
Sbjct: 404 FLDHLLNLLRNGEISVQESALKVTSRLCSLQEEVKRIMGEAGFMPELVKFLDAKSIDVRQ 463

Query: 364 MAAEALSGMVMVPKNRKRFVQDDQSIALLLQLLEPNEG-----NSNNQKFLISILMSLTS 200
           MA+ AL  ++ VP+NRK+F QDD +I+ +LQLL+  +G     +S N KFLISILMSLTS
Sbjct: 464 MASVALYCLISVPRNRKKFAQDDFNISYILQLLDHEDGSNVSSDSGNTKFLISILMSLTS 523

Query: 199 CNSGKKKIVSSGYAQNIERLAEAEVSSDAQRLVRKLATNRFRSMLSGIWHS 47
           CNS ++KI SSGY ++IE+LAE E  SDA++LV+KL+ NRFRS+LSGIWHS
Sbjct: 524 CNSARRKIASSGYLKSIEKLAETE-GSDAKKLVKKLSMNRFRSILSGIWHS 573

>ref|NP_182096.1| unknown protein; protein id: At2g45720.1 [Arabidopsis thaliana]
           gi|7485635|pir||T02475 hypothetical protein At2g45720
           [imported] - Arabidopsis thaliana
           gi|3386623|gb|AAC28553.1| unknown protein [Arabidopsis
           thaliana] gi|20197052|gb|AAM14897.1| unknown protein
           [Arabidopsis thaliana]
          Length = 553

 Score = 93.2 bits (230), Expect = 2e-18
 Identities = 51/163 (31%), Positives = 99/163 (60%)
 Frame = -1

Query: 550 YGFVDQLLYYVRNGEVSIQELALKVAIRLCGTSEEAKKALGDAGFMAELVKFLTAKSFEV 371
           +  +  L++ +++G +  Q+ A     R+  TS E K+ +G++G +  L++ L AK+   
Sbjct: 392 FKIIPSLVHVLKSGSIGAQQAAASTICRIA-TSNETKRMIGESGCIPLLIRMLEAKASGA 450

Query: 370 REMAAEALSGMVMVPKNRKRFVQDDQSIALLLQLLEPNEGNSNNQKFLISILMSLTSCNS 191
           RE+AA+A++ +V VP+N +   +D++S+  L+ LLEP+ GNS  +K+ +S L +L S   
Sbjct: 451 REVAAQAIASLVTVPRNCREVKRDEKSVTSLVMLLEPSPGNS-AKKYAVSGLAALCSSRK 509

Query: 190 GKKKIVSSGYAQNIERLAEAEVSSDAQRLVRKLATNRFRSMLS 62
            KK +VS G    +++L+E EV   +++L+ ++   + +S  S
Sbjct: 510 CKKLMVSHGAVGYLKKLSELEVPG-SKKLLERIEKGKLKSFFS 551

>gb|AAM92297.1| putative arm repeat containing protein [Oryza sativa (japonica
           cultivar-group)] gi|27311271|gb|AAO00697.1| putative
           armadillo repeat containing protein [Oryza sativa
           (japonica cultivar-group)]
          Length = 575

 Score = 90.1 bits (222), Expect = 2e-17
 Identities = 54/166 (32%), Positives = 102/166 (60%)
 Frame = -1

Query: 559 LITYGFVDQLLYYVRNGEVSIQELALKVAIRLCGTSEEAKKALGDAGFMAELVKFLTAKS 380
           L++ G + +L++ +R G V  Q+ A     R+  +S E K+ +G+ G M  LV+ L AKS
Sbjct: 411 LVSLGVLPRLVHVLREGSVGAQQAAAAAICRV-SSSSEMKRLVGEHGCMPLLVRLLEAKS 469

Query: 379 FEVREMAAEALSGMVMVPKNRKRFVQDDQSIALLLQLLEPNEGNSNNQKFLISILMSLTS 200
              RE+AA+A++ ++    N +   +D++S+  L+QLLEP+  N+  +K+ IS L++L++
Sbjct: 470 NGAREVAAQAVASLMSCLANARDIKKDEKSVPNLVQLLEPSPQNT-AKKYAISCLLTLSA 528

Query: 199 CNSGKKKIVSSGYAQNIERLAEAEVSSDAQRLVRKLATNRFRSMLS 62
               KK ++S G    +++L+E +V+  A++L+ KL   + R++ S
Sbjct: 529 SKRCKKLMISHGAIGYLKKLSEMDVAG-AKKLLEKLERGKLRNLFS 573

 Score = 33.1 bits (74), Expect = 2.4
 Identities = 33/134 (24%), Positives = 65/134 (47%), Gaps = 1/134 (0%)
 Frame = -1

Query: 562 ILITYGFVDQLLYYVRNGEVSIQELALKVAIRLCGTSEEAKKALGDAGFMAELVKFLTAK 383
           +L++ G +  L+  V +G +  +E A+    RL  + + A+  +G +G    +    T  
Sbjct: 246 LLVSEGALPPLIRLVESGSLVGREKAVITLQRLSMSPDIARAIVGHSGVRPLIDICQTGD 305

Query: 382 SFEVREMAAEALSGMVMVPKNRKRFVQDDQSIALLLQLLEPNEGNSNNQKFLISILMSLT 203
           S   +  AA AL  +  VP+ R+   ++   + +++ LL+        +++    L SLT
Sbjct: 306 SIS-QSAAAGALKNLSAVPEVRQALAEEG-IVRVMVNLLDCGVV-LGCKEYAAECLQSLT 362

Query: 202 SCNSG-KKKIVSSG 164
           S N G ++ +VS G
Sbjct: 363 SSNDGLRRAVVSEG 376

>ref|NP_199903.1| putative protein; protein id: At5g50900.1, supported by cDNA:
           gi_14532769 [Arabidopsis thaliana]
           gi|9758237|dbj|BAB08736.1|
           gene_id:K3K7.4~pir||T02475~similar to unknown protein
           [Arabidopsis thaliana]
          Length = 555

 Score = 79.0 bits (193), Expect = 4e-14
 Identities = 45/170 (26%), Positives = 99/170 (57%)
 Frame = -1

Query: 562 ILITYGFVDQLLYYVRNGEVSIQELALKVAIRLCGTSEEAKKALGDAGFMAELVKFLTAK 383
           ++I+ GF+ +L+  +  G + ++ +A   A+   G S +++K +G++G +  L+  L  K
Sbjct: 390 VVISEGFIPRLVPVLSCGVLGVR-IAAAEAVSSLGFSSKSRKEMGESGCIVPLIDMLDGK 448

Query: 382 SFEVREMAAEALSGMVMVPKNRKRFVQDDQSIALLLQLLEPNEGNSNNQKFLISILMSLT 203
           + E +E A++ALS +++   NRK F + D+ +  L+QLL+P +    ++++ +S L  L 
Sbjct: 449 AIEEKEAASKALSTLLVCTSNRKIFKKSDKGVVSLVQLLDP-KIKKLDKRYTVSALELLV 507

Query: 202 SCNSGKKKIVSSGYAQNIERLAEAEVSSDAQRLVRKLATNRFRSMLSGIW 53
           +    +K++V++G   ++++L + +         +KLA N  RS + G++
Sbjct: 508 TSKKCRKQVVAAGACLHLQKLVDMDTEG-----AKKLAENLSRSKIWGVF 552

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 485,898,634
Number of Sequences: 1393205
Number of extensions: 9995187
Number of successful extensions: 26245
Number of sequences better than 10.0: 45
Number of HSP's better than 10.0 without gapping: 25320
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 26222
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 20095422690
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD089a11_f BP051082 1 562
2 MFB097e09_f BP041069 30 129




Lotus japonicus
Kazusa DNA Research Institute