KMC005365A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC005365A_C01 KMC005365A_c01
aataataaattcaatgccattaccaatttgatatcatttgaaggattcatttggtgacaa
caaaacatgAAAAACTAATTTGATGTTGCATATATCTTTAGACCAAAATAGACACATACT
CCAAACAATGTAAAGGTTTTCTGAAGACAATCTTAAACATGCCATGTTTTCTACTAAGAG
GAAAGTTGGTCTATGTTCATACCAATCAACATTTCCTGACAAATTCATATAATTATTTAA
CAACGATGAAGCATGACGAGAACCAGTCTATTCTTCCATGTTTAAGATGTTTTAGACTTG
ACTGGGGCAGGAAGCAACTGATAGCAAGATATTAGAGATGAAACAAATCCAAATGCTCCA
GTCACACGAGGAGTGACTTTCTTGGGTGCCAATTGAAGCAGTCCAACTGCTACCACTATA
TCAATGCTTGCTTTGACAAGTGCTAGTGTCCTCTCATTTGACTTCTGTAGCTTGGCGCGG
TATTGCTCATTATCATATTTGTTAGTGTTTTTGAGTTCTTTCTCTAACTTCTTCATTGAT
GTAGAGAGTCTTCCGAGCTCCCCGAGCTCAACCAAGGTGGTGCACGCTGAGGAACCCAGC
CAACAGAAAAGAGAAATCCGGCCAAGTAGTTCGGTACGTTCTTTGTTCTCGATGATGCCG
GTCCTACCAAGCCACACAAATTGATCAAGAAACAAGAAAGTGGAGAGTAATGCATTCTTG
GACTTTCCCAACAGAATCAAGGGGAGGGGAGTACCCTGTGGTGTTGGACTTATCAGACCA
TGCAGGTCATTGACAAACTTAAATAGACGGAAAACTTTTCGTGCCAAGCTGGTTGATTTG
TCAACATTCTGGGCAGTACCAGGTTCACCATTGCTCAAAAATTTGGAACCATACTGTATT
GCTCTGCAAATCTTGTCCCTTGCTTCAGCCTTGTTCAAATATAAGACAGAAGaccagttc
tgccctagttgtatccacgcactcattttcggtgctaattagtaatcacgaacgaagtga
cgaaggagaatg


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC005365A_C01 KMC005365A_c01
         (1032 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAF75750.1|AF261140_1 unknown [Lycopersicon esculentum]            375  e-103
ref|NP_566055.1| expressed protein; protein id: At2g45740.1, sup...   372  e-102
ref|NP_563636.1| expressed protein; protein id: At1g01820.1, sup...   369  e-101
pir||T02473 hypothetical protein At2g45740 [imported] - Arabidop...   363  2e-99
gb|AAO22773.1| unknown protein [Arabidopsis thaliana] gi|2839405...   360  2e-98

>gb|AAF75750.1|AF261140_1 unknown [Lycopersicon esculentum]
          Length = 235

 Score =  375 bits (963), Expect = e-103
 Identities = 187/222 (84%), Positives = 206/222 (92%)
 Frame = -2

Query: 950 SVLYLNKAEARDKICRAIQYGSKFLSNGEPGTAQNVDKSTSLARKVFRLFKFVNDLHGLI 771
           +VLYLNKAEARDKICRAIQYG+KFLS+G+PGTAQNVDKSTSLARK+FRLFKF+NDLH LI
Sbjct: 14  AVLYLNKAEARDKICRAIQYGAKFLSDGQPGTAQNVDKSTSLARKLFRLFKFINDLHALI 73

Query: 770 SPTPQGTPLPLILLGKSKNALLSTFLFLDQFVWLGRTGIIENKERTELLGRISLFCWLGS 591
           SP   GTPLPLILLGKSKNALLST+LFLDQFVWLGR+GI +NKE+TEL+GRIS F W+GS
Sbjct: 74  SPNAPGTPLPLILLGKSKNALLSTYLFLDQFVWLGRSGIYKNKEQTELIGRISFFSWMGS 133

Query: 590 SACTTLVELGELGRLSTSMKKLEKELKNTNKYDNEQYRAKLQKSNERTLALVKASIDIVV 411
           S CT LVE+GELGRLS+SMKKLEKELKNT+KY NEQYR+KLQKSNER+LAL+KA  DIVV
Sbjct: 134 SICTALVEIGELGRLSSSMKKLEKELKNTDKYMNEQYRSKLQKSNERSLALIKAGTDIVV 193

Query: 410 AVGLLQLAPKKVTPRVTGAFGFVSSLISCYQLLPAPVKSKTS 285
           AVGLLQLAPKKVTPRVTGAFGFVSSLISCYQLLP+  K K S
Sbjct: 194 AVGLLQLAPKKVTPRVTGAFGFVSSLISCYQLLPSSPKDKAS 235

>ref|NP_566055.1| expressed protein; protein id: At2g45740.1, supported by cDNA:
           12250., supported by cDNA: gi_15450879 [Arabidopsis
           thaliana] gi|15450880|gb|AAK96711.1| Unknown protein
           [Arabidopsis thaliana] gi|20197204|gb|AAC28551.2|
           expressed protein [Arabidopsis thaliana]
           gi|21537163|gb|AAM61504.1| unknown [Arabidopsis
           thaliana]
          Length = 236

 Score =  372 bits (955), Expect = e-102
 Identities = 181/220 (82%), Positives = 204/220 (92%)
 Frame = -2

Query: 947 VLYLNKAEARDKICRAIQYGSKFLSNGEPGTAQNVDKSTSLARKVFRLFKFVNDLHGLIS 768
           V+YLNKAEARDK+CRAIQYGSKFLS G+PGTAQNVDKSTSLARKVFRLFKFVNDLHGLIS
Sbjct: 16  VMYLNKAEARDKLCRAIQYGSKFLSGGQPGTAQNVDKSTSLARKVFRLFKFVNDLHGLIS 75

Query: 767 PTPQGTPLPLILLGKSKNALLSTFLFLDQFVWLGRTGIIENKERTELLGRISLFCWLGSS 588
           P P+GTPLPL+LLGKSKNALLSTFLFLDQ VWLGR+GI +NKER ELLGRISLFCW+GSS
Sbjct: 76  PVPKGTPLPLVLLGKSKNALLSTFLFLDQIVWLGRSGIYKNKERAELLGRISLFCWMGSS 135

Query: 587 ACTTLVELGELGRLSTSMKKLEKELKNTNKYDNEQYRAKLQKSNERTLALVKASIDIVVA 408
            CTTLVE+GE+GRLS+SMKK+EK LKN NKY +E YRAKL+KSNER+LAL+K+++DIVVA
Sbjct: 136 VCTTLVEVGEMGRLSSSMKKIEKGLKNGNKYQDEDYRAKLKKSNERSLALIKSAMDIVVA 195

Query: 407 VGLLQLAPKKVTPRVTGAFGFVSSLISCYQLLPAPVKSKT 288
            GLLQLAP K+TPRVTGAFGF++S+ISCYQLLP   K KT
Sbjct: 196 AGLLQLAPTKITPRVTGAFGFITSIISCYQLLPTRPKIKT 235

>ref|NP_563636.1| expressed protein; protein id: At1g01820.1, supported by cDNA:
           28475., supported by cDNA: gi_12083289, supported by
           cDNA: gi_17381254, supported by cDNA: gi_20453366
           [Arabidopsis thaliana] gi|25511713|pir||A86150 T1N6.24
           protein - Arabidopsis thaliana
           gi|8671852|gb|AAF78415.1|AC009273_21 Contains similarity
           to an unknown protein F4I18.28 gi|7486466 from
           Arabidopsis thaliana BAC F4I18 gb|AC004665.  ESTs
           gb|F14309, gb|AI998750, gb|995247, gb|T14224 and
           gb|AI995247 come from this gene
           gi|12083290|gb|AAG48804.1|AF332441_1 unknown protein
           [Arabidopsis thaliana] gi|17381255|gb|AAL36046.1|
           At1g01820/T1N6_18 [Arabidopsis thaliana]
           gi|20453367|gb|AAM19922.1| At1g01820/T1N6_18
           [Arabidopsis thaliana] gi|21555588|gb|AAM63892.1|
           unknown [Arabidopsis thaliana]
          Length = 235

 Score =  369 bits (948), Expect = e-101
 Identities = 181/219 (82%), Positives = 204/219 (92%)
 Frame = -2

Query: 947 VLYLNKAEARDKICRAIQYGSKFLSNGEPGTAQNVDKSTSLARKVFRLFKFVNDLHGLIS 768
           V+YLNKAEARDKICRAIQYGSKFLS+G+PGTAQNVDK+TSLARKVFRLFKFVNDLH LIS
Sbjct: 15  VVYLNKAEARDKICRAIQYGSKFLSDGQPGTAQNVDKNTSLARKVFRLFKFVNDLHALIS 74

Query: 767 PTPQGTPLPLILLGKSKNALLSTFLFLDQFVWLGRTGIIENKERTELLGRISLFCWLGSS 588
           P P+GTPLPL+LLGKSKNALLSTFLFLDQ VWLGRTGI ++KER E+LGRISLFCW+GSS
Sbjct: 75  PVPKGTPLPLVLLGKSKNALLSTFLFLDQIVWLGRTGIYKDKERAEILGRISLFCWMGSS 134

Query: 587 ACTTLVELGELGRLSTSMKKLEKELKNTNKYDNEQYRAKLQKSNERTLALVKASIDIVVA 408
            CT+LVE+GELGRLS S+KKLEKE+ N +K+ NEQYRAK++KSNER+LAL+KA +D+VVA
Sbjct: 135 VCTSLVEVGELGRLSASIKKLEKEIGNKDKHQNEQYRAKVEKSNERSLALIKAGMDVVVA 194

Query: 407 VGLLQLAPKKVTPRVTGAFGFVSSLISCYQLLPAPVKSK 291
            GLLQLAPKKVTPRVTGAFGF SSLISCYQLLP+  KSK
Sbjct: 195 FGLLQLAPKKVTPRVTGAFGFASSLISCYQLLPSHPKSK 233

>pir||T02473 hypothetical protein At2g45740 [imported] - Arabidopsis thaliana
          Length = 239

 Score =  363 bits (932), Expect = 2e-99
 Identities = 175/211 (82%), Positives = 199/211 (93%)
 Frame = -2

Query: 947 VLYLNKAEARDKICRAIQYGSKFLSNGEPGTAQNVDKSTSLARKVFRLFKFVNDLHGLIS 768
           V+YLNKAEARDK+CRAIQYGSKFLS G+PGTAQNVDKSTSLARKVFRLFKFVNDLHGLIS
Sbjct: 16  VMYLNKAEARDKLCRAIQYGSKFLSGGQPGTAQNVDKSTSLARKVFRLFKFVNDLHGLIS 75

Query: 767 PTPQGTPLPLILLGKSKNALLSTFLFLDQFVWLGRTGIIENKERTELLGRISLFCWLGSS 588
           P P+GTPLPL+LLGKSKNALLSTFLFLDQ VWLGR+GI +NKER ELLGRISLFCW+GSS
Sbjct: 76  PVPKGTPLPLVLLGKSKNALLSTFLFLDQIVWLGRSGIYKNKERAELLGRISLFCWMGSS 135

Query: 587 ACTTLVELGELGRLSTSMKKLEKELKNTNKYDNEQYRAKLQKSNERTLALVKASIDIVVA 408
            CTTLVE+GE+GRLS+SMKK+EK LKN NKY +E YRAKL+KSNER+LAL+K+++DIVVA
Sbjct: 136 VCTTLVEVGEMGRLSSSMKKIEKGLKNGNKYQDEDYRAKLKKSNERSLALIKSAMDIVVA 195

Query: 407 VGLLQLAPKKVTPRVTGAFGFVSSLISCYQL 315
            GLLQLAP K+TPRVTGAFGF++S+ISCYQ+
Sbjct: 196 AGLLQLAPTKITPRVTGAFGFITSIISCYQV 226

>gb|AAO22773.1| unknown protein [Arabidopsis thaliana] gi|28394051|gb|AAO42433.1|
           unknown protein [Arabidopsis thaliana]
          Length = 231

 Score =  360 bits (925), Expect = 2e-98
 Identities = 180/220 (81%), Positives = 201/220 (90%)
 Frame = -2

Query: 947 VLYLNKAEARDKICRAIQYGSKFLSNGEPGTAQNVDKSTSLARKVFRLFKFVNDLHGLIS 768
           VLYLNKAEARDKICRAIQYGSKFLS G+PGTAQ VDK+TSLARKVFRLFKFVND HGLIS
Sbjct: 15  VLYLNKAEARDKICRAIQYGSKFLSGGQPGTAQTVDKNTSLARKVFRLFKFVNDFHGLIS 74

Query: 767 PTPQGTPLPLILLGKSKNALLSTFLFLDQFVWLGRTGIIENKERTELLGRISLFCWLGSS 588
           P P+GTPLPL+LLGKSKNALLSTFLFLDQ VWLGR+GI +NKERTELLGRISLFCWLGSS
Sbjct: 75  PVPKGTPLPLVLLGKSKNALLSTFLFLDQIVWLGRSGIYKNKERTELLGRISLFCWLGSS 134

Query: 587 ACTTLVELGELGRLSTSMKKLEKELKNTNKYDNEQYRAKLQKSNERTLALVKASIDIVVA 408
            CT+ VE+GELGRLS+SMKK+EKEL    K D+E YRAKLQKSN+RTLAL+K+S+DI+VA
Sbjct: 135 VCTSAVEIGELGRLSSSMKKMEKEL----KADDELYRAKLQKSNDRTLALIKSSMDIIVA 190

Query: 407 VGLLQLAPKKVTPRVTGAFGFVSSLISCYQLLPAPVKSKT 288
           +GLLQLAPK ++PRVTGAFGF +SLISCYQLLP+  K KT
Sbjct: 191 IGLLQLAPKTISPRVTGAFGFTTSLISCYQLLPSRPKLKT 230

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 877,474,586
Number of Sequences: 1393205
Number of extensions: 19595625
Number of successful extensions: 55025
Number of sequences better than 10.0: 32
Number of HSP's better than 10.0 without gapping: 52638
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 54998
length of database: 448,689,247
effective HSP length: 124
effective length of database: 275,931,827
effective search space used: 60429070113
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MWM073c04_f AV765881 1 582
2 MPD070a04_f AV774603 69 538
3 SPDL025a09_f BP053526 75 585
4 SPD077h07_f BP050196 84 636
5 MFB085h05_f BP040241 118 640
6 SPD015h08_f BP045221 118 601
7 MFB021a10_f BP035470 131 622
8 SPD083c11_f BP050618 548 1040




Lotus japonicus
Kazusa DNA Research Institute