KMC019112A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC019112A_C01 KMC019112A_c01
tttcaaaaaagcaatttcctgagtgattgctggtttagagcatcccgcccgaccaaggtc
ttctgtttgaaacaagagcaAAATTTCAAACAGATGCCCATGCTAATTGATCAAAGGAGA
AGGTTGATTTGATGTTTGCTTTGCAACTTAAAGGAGGAAGTGTTAGAGACTCTCTGCCTT
GGTAATAGGTACACTTGAGGTTGAAAAATGAGCAAACTAGGTAATGTTTGGATTGAAAAC
CAGAGAAAAAATAAGTTACAGACGAAGTTATAAGCAACTTCAACTCACTCTCAAAATCTC
AAACTTCAAATGAAATATTTGTTTGCTTGTGTGTTAGCAGGTGCCCTCTATAGAACATGA
CCCTCTTTTCTTGTAGTTTTGATCAATGCTCTGGTGATGACATGGGTAAGAATCCCAATG
GGGCAAAAGAACAAGCAAAATGAAACTGAATGCCGAGTTTCAACCTGATTCTGCAGTCCA
TCTTGGAAAATATGCCTTGCAGCAAAAAGATCAACAACCAAGAGGTGAATCCAGGCAGAA
GCTAAAGTCAATTCGTTAGAGAACATTTTTGCTATACTAGTCAGCTCTGGTAGCAAGTGT
TTACTTGCAAAAATCAAACGAATTGTTTCAGGTGTCCAAGAAAGGTACAATAAATATGCA
TACAGAACACCAAGCACTACATAGGGTAAAGTACTTTCCACAGACTTCTTGGTTAGCTCA
GATTTTGGAGCTAGAACCATGAGTGTGTAAAATGGGAGCACTGCAACTGTTCCC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC019112A_C01 KMC019112A_c01
         (774 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_564889.1| expressed protein; protein id: At1g67080.1, sup...   212  4e-54
pir||F96694 hypothetical protein F1O19.13 [imported] - Arabidops...   212  4e-54
dbj|BAB03378.1| unnamed protein product [Oryza sativa (japonica ...   200  2e-50
ref|ZP_00071078.1| hypothetical protein [Trichodesmium erythraeu...   104  2e-21
ref|NP_487431.1| hypothetical protein [Nostoc sp. PCC 7120] gi|2...   100  3e-20

>ref|NP_564889.1| expressed protein; protein id: At1g67080.1, supported by cDNA:
           11578. [Arabidopsis thaliana] gi|21536950|gb|AAM61291.1|
           unknown [Arabidopsis thaliana]
          Length = 220

 Score =  212 bits (540), Expect = 4e-54
 Identities = 94/136 (69%), Positives = 122/136 (89%)
 Frame = -2

Query: 773 GTVAVLPFYTLMVLAPKSELTKKSVESTLPYVVLGVLYAYLLYLSWTPETIRLIFASKHL 594
           GT AVLPFYTLMV+APK+E+TKK +ES++PY++LGVLY YLLY+SWTPET++ +F+SK++
Sbjct: 85  GTTAVLPFYTLMVVAPKAEITKKCMESSVPYIILGVLYVYLLYISWTPETLKYMFSSKYM 144

Query: 593 LPELTSIAKMFSNELTLASAWIHLLVVDLFAARHIFQDGLQNQVETRHSVSFCLFFCPIG 414
           LPEL+ IAKMFS+E+TLASAWIHLLVVDLFAAR ++ DGL+NQ+ETRHSVS CL FCP+G
Sbjct: 145 LPELSGIAKMFSSEMTLASAWIHLLVVDLFAARQVYNDGLENQIETRHSVSLCLLFCPVG 204

Query: 413 ILTHVITRALIKTTRK 366
           I++H +T+A+I    K
Sbjct: 205 IVSHFVTKAIINNQYK 220

>pir||F96694 hypothetical protein F1O19.13 [imported] - Arabidopsis thaliana
           gi|9755455|gb|AAF98216.1|AC007152_12 Unknown protein
           [Arabidopsis thaliana]
          Length = 203

 Score =  212 bits (540), Expect = 4e-54
 Identities = 94/136 (69%), Positives = 122/136 (89%)
 Frame = -2

Query: 773 GTVAVLPFYTLMVLAPKSELTKKSVESTLPYVVLGVLYAYLLYLSWTPETIRLIFASKHL 594
           GT AVLPFYTLMV+APK+E+TKK +ES++PY++LGVLY YLLY+SWTPET++ +F+SK++
Sbjct: 68  GTTAVLPFYTLMVVAPKAEITKKCMESSVPYIILGVLYVYLLYISWTPETLKYMFSSKYM 127

Query: 593 LPELTSIAKMFSNELTLASAWIHLLVVDLFAARHIFQDGLQNQVETRHSVSFCLFFCPIG 414
           LPEL+ IAKMFS+E+TLASAWIHLLVVDLFAAR ++ DGL+NQ+ETRHSVS CL FCP+G
Sbjct: 128 LPELSGIAKMFSSEMTLASAWIHLLVVDLFAARQVYNDGLENQIETRHSVSLCLLFCPVG 187

Query: 413 ILTHVITRALIKTTRK 366
           I++H +T+A+I    K
Sbjct: 188 IVSHFVTKAIINNQYK 203

>dbj|BAB03378.1| unnamed protein product [Oryza sativa (japonica cultivar-group)]
          Length = 222

 Score =  200 bits (508), Expect = 2e-50
 Identities = 89/131 (67%), Positives = 115/131 (86%)
 Frame = -2

Query: 773 GTVAVLPFYTLMVLAPKSELTKKSVESTLPYVVLGVLYAYLLYLSWTPETIRLIFASKHL 594
           GT+AVLPFYTLMV+AP +++TK++V+S+ PYV LG+LYAYLLYLSWTP+T+R +FASK+ 
Sbjct: 91  GTIAVLPFYTLMVVAPNADVTKRAVDSSAPYVALGILYAYLLYLSWTPDTLRAMFASKYW 150

Query: 593 LPELTSIAKMFSNELTLASAWIHLLVVDLFAARHIFQDGLQNQVETRHSVSFCLFFCPIG 414
           LPELT I +MF++E+T+ASAWIHLL VDLFAAR ++ DG++N +ETRHSVS CL FCPIG
Sbjct: 151 LPELTGIVRMFASEMTVASAWIHLLAVDLFAARQVYHDGIKNNIETRHSVSLCLLFCPIG 210

Query: 413 ILTHVITRALI 381
           I THV+T+  I
Sbjct: 211 IATHVLTKVHI 221

>ref|ZP_00071078.1| hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 155

 Score =  104 bits (259), Expect = 2e-21
 Identities = 52/123 (42%), Positives = 78/123 (63%)
 Frame = -2

Query: 761 VLPFYTLMVLAPKSELTKKSVESTLPYVVLGVLYAYLLYLSWTPETIRLIFASKHLLPEL 582
           VLPF+ L++  P  ++T+K +ES +P+++L V+Y YLL  S TPE+     A+    P L
Sbjct: 14  VLPFWALIIFLPNWKVTRKIMESFIPFILLVVVYLYLLISSLTPES-----AAALSNPTL 68

Query: 581 TSIAKMFSNELTLASAWIHLLVVDLFAARHIFQDGLQNQVETRHSVSFCLFFCPIGILTH 402
           + IAK F  E   A+ W+H LV+DLF  R I+ +G +  V T HS+  CLF  P G+L+H
Sbjct: 69  SDIAKFFGEESAAATGWVHFLVMDLFVGRWIYWEGQRTGVWTFHSIILCLFAGPFGLLSH 128

Query: 401 VIT 393
           ++T
Sbjct: 129 ILT 131

>ref|NP_487431.1| hypothetical protein [Nostoc sp. PCC 7120] gi|25534615|pir||AH2229
           hypothetical protein all3391 [imported] - Nostoc sp.
           (strain PCC 7120) gi|17132486|dbj|BAB75090.1|
           ORF_ID:all3391~hypothetical protein [Nostoc sp. PCC
           7120]
          Length = 156

 Score =  100 bits (248), Expect = 3e-20
 Identities = 51/130 (39%), Positives = 79/130 (60%)
 Frame = -2

Query: 767 VAVLPFYTLMVLAPKSELTKKSVESTLPYVVLGVLYAYLLYLSWTPETIRLIFASKHLLP 588
           V VLPF+ LM+L P  ++T++ +ES L ++ L   Y YL   S TPE  + +       P
Sbjct: 12  VFVLPFWALMILLPNWKVTRRVMESYLIFLPLAGAYLYLFITSITPENAQALSN-----P 66

Query: 587 ELTSIAKMFSNELTLASAWIHLLVVDLFAARHIFQDGLQNQVETRHSVSFCLFFCPIGIL 408
           +L  IA+ F++E   A+ WIH LV+DLF  R I+ +G +  + T HS++ CLF  P+G+L
Sbjct: 67  QLADIARFFADETAAATGWIHFLVMDLFVGRWIYWEGQKTGIWTIHSLTLCLFAGPLGVL 126

Query: 407 THVITRALIK 378
           +H+ T  + K
Sbjct: 127 SHIFTYWITK 136

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 623,553,821
Number of Sequences: 1393205
Number of extensions: 12595826
Number of successful extensions: 28207
Number of sequences better than 10.0: 21
Number of HSP's better than 10.0 without gapping: 27408
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 28199
length of database: 448,689,247
effective HSP length: 121
effective length of database: 280,111,442
effective search space used: 38095156112
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB043a01_f BP037105 1 526
2 MFB021d06_f BP035495 238 774




Lotus japonicus
Kazusa DNA Research Institute