KMC019086A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC019086A_C01 KMC019086A_c01
tagaatggtgtgtaaatctcttattttagaagaatgaaaacagagcaatgaaatAACGGC
ATAAAGTGGATTCAATAGTTAATATATATGAAAGACTTTCTCATTACATACATCTCCAGT
TTTCATCTTTTCATTTCCTGTATATTTACAGTTTCTTTACAGATGAAACAAATAACATAA
GAAACTAATAAAATATAGGAGAATCAGTGTTATACATGGATCAAATATAGTTTTAAGGCT
TCTGAGCTGTGAACATAATGAAAGACTGCTGAATTTTGCATGAGTAGTTGGTTAGACCAC
ATGAGGTGCAAAGGTCTTCAATTTCTTCCGCTGATAAATAGCCATAGCCTGGAAAAGTCC
TCTCTCTGAAGAGGCGTACAAGCCAGGGAGTTGACGAATTGTAACGCAGAAAAGTGGTTC
CAACAAATACTCCACCGCTTCTTAGTACCCGGGTGATTTCAGCAACAGCATTGGAGGGAG
ATGGCCAGCAATGTAAAGCTGCACCAGCATGGACTGCATCAACTGAACCTGATGAAAAGG
GAAGCCTAGAAACATCCGCCCTTACAAGTGCAAGATTAGTGGTTGAAAGTGTGTCATCTT
TCTTAATGAAATCATAACTCTGGCGAAGCATATTTTCAGAAA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC019086A_C01 KMC019086A_c01
         (642 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_565170.1| expressed protein; protein id: At1g78140.1, sup...   141  6e-33
pir||F96810 hypothetical protein T11I11.8 [imported] - Arabidops...   126  3e-28
ref|NP_181637.1| hypothetical protein; protein id: At2g41040.1 [...    97  2e-19
gb|ZP_00043721.1| hypothetical protein [Magnetococcus sp. MC-1]        49  6e-05
ref|NP_378029.1| 230aa long conserved hypothetical protein [Sulf...    49  7e-05

>ref|NP_565170.1| expressed protein; protein id: At1g78140.1, supported by cDNA:
           30065. [Arabidopsis thaliana] gi|21592590|gb|AAM64539.1|
           unknown [Arabidopsis thaliana]
           gi|28393453|gb|AAO42148.1| unknown protein [Arabidopsis
           thaliana] gi|28827348|gb|AAO50518.1| unknown protein
           [Arabidopsis thaliana]
          Length = 355

 Score =  141 bits (356), Expect = 6e-33
 Identities = 78/141 (55%), Positives = 100/141 (70%), Gaps = 6/141 (4%)
 Frame = -3

Query: 640 SENMLRQSYDFIKKDDTL-STTNLALVRADVSRLPFSSGSVDAVHAGAALHCWPSPSNAV 464
           SENMLRQ Y+ + K++   +   L LVRAD++RLPF SGSVDAVHAGAALHCWPSPS+AV
Sbjct: 215 SENMLRQCYELLNKEENFPNKEKLVLVRADIARLPFLSGSVDAVHAGAALHCWPSPSSAV 274

Query: 463 AEITRVLRSGGVFVGTTFLRYN---SSTPWLVRLFRE--RTFPGYGYLSAEEIEDLCTSC 299
           AEI+RVLR GGVFV TTF+ Y+   S  P+L  L +E  R    + +L+  E+ED+C +C
Sbjct: 275 AEISRVLRPGGVFVATTFI-YDGPFSFIPFLKNLRQEIMRYSGSHIFLNERELEDICKAC 333

Query: 298 GLTNYSCKIQQSFIMFTAQKP 236
           GL N++      FIM +A KP
Sbjct: 334 GLVNFTRVRNGPFIMLSATKP 354

>pir||F96810 hypothetical protein T11I11.8 [imported] - Arabidopsis thaliana
           gi|12324257|gb|AAG52104.1|AC012680_15 hypothetical
           protein; 38642-36701 [Arabidopsis thaliana]
          Length = 317

 Score =  126 bits (316), Expect = 3e-28
 Identities = 69/118 (58%), Positives = 86/118 (72%), Gaps = 5/118 (4%)
 Frame = -3

Query: 574 LALVRADVSRLPFSSGSVDAVHAGAALHCWPSPSNAVAEITRVLRSGGVFVGTTFLRYN- 398
           L LVRAD++RLPF SGSVDAVHAGAALHCWPSPS+AVAEI+RVLR GGVFV TTF+ Y+ 
Sbjct: 200 LVLVRADIARLPFLSGSVDAVHAGAALHCWPSPSSAVAEISRVLRPGGVFVATTFI-YDG 258

Query: 397 --SSTPWLVRLFRE--RTFPGYGYLSAEEIEDLCTSCGLTNYSCKIQQSFIMFTAQKP 236
             S  P+L  L +E  R    + +L+  E+ED+C +CGL N++      FIM +A KP
Sbjct: 259 PFSFIPFLKNLRQEIMRYSGSHIFLNERELEDICKACGLVNFTRVRNGPFIMLSATKP 316

>ref|NP_181637.1| hypothetical protein; protein id: At2g41040.1 [Arabidopsis
           thaliana] gi|7487623|pir||T02115 hypothetical protein
           At2g41040 [imported] - Arabidopsis thaliana
           gi|3402713|gb|AAD12007.1| hypothetical protein
           [Arabidopsis thaliana]
          Length = 262

 Score = 97.1 bits (240), Expect = 2e-19
 Identities = 47/61 (77%), Positives = 55/61 (90%), Gaps = 1/61 (1%)
 Frame = -3

Query: 640 SENMLRQSYDFIKKDDTL-STTNLALVRADVSRLPFSSGSVDAVHAGAALHCWPSPSNAV 464
           SENMLRQ  +FIK D+T  ++TN+A+VRADVSRLPF SGSVDAVHAGAALHCWPSP+NAV
Sbjct: 201 SENMLRQCKEFIKNDNTFDNSTNIAVVRADVSRLPFPSGSVDAVHAGAALHCWPSPTNAV 260

Query: 463 A 461
           +
Sbjct: 261 S 261

>gb|ZP_00043721.1| hypothetical protein [Magnetococcus sp. MC-1]
          Length = 301

 Score = 48.9 bits (115), Expect = 6e-05
 Identities = 33/104 (31%), Positives = 49/104 (46%), Gaps = 6/104 (5%)
 Frame = -3

Query: 583 TTNLALVRADVSRLPFSSGSVDAVHAGAALHCWPSPSNAVAEITRVLRSGGVFVGT---- 416
           TT+   + AD++ LP++  S D V +   LH  P PS  +AEI RVLR  G  + +    
Sbjct: 106 TTHAPYLCADLTELPYADSSFDGVISNLTLHWSPDPSRTLAEIRRVLRGNGFLLSSQPGA 165

Query: 415 -TFLRYNSSTPWLVRLFRERTFPGYGY-LSAEEIEDLCTSCGLT 290
             F    S+   L +    R FP     +  +++ DL  S G T
Sbjct: 166 DNFRELRSALAQLDQTHYGRIFPRLPRGVDIQQVGDLLASSGYT 209

>ref|NP_378029.1| 230aa long conserved hypothetical protein [Sulfolobus tokodaii]
           gi|15623149|dbj|BAB67138.1| 230aa long conserved
           hypothetical protein [Sulfolobus tokodaii]
          Length = 230

 Score = 48.5 bits (114), Expect = 7e-05
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 2/86 (2%)
 Frame = -3

Query: 619 SYDFIK--KDDTLSTTNLALVRADVSRLPFSSGSVDAVHAGAALHCWPSPSNAVAEITRV 446
           SY F+K  KD      N+  VR +  +LPF+  S+D + A   LH +PS   AV EI RV
Sbjct: 111 SYKFLKILKD---KRPNVVAVRGNALKLPFADESIDGISAMFVLHMFPSVLVAVREINRV 167

Query: 445 LRSGGVFVGTTFLRYNSSTPWLVRLF 368
           L+ G   V T   + N  + +L  ++
Sbjct: 168 LKHGKKCVATILTKNNMISQFLATIW 193

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 515,219,408
Number of Sequences: 1393205
Number of extensions: 10570072
Number of successful extensions: 25312
Number of sequences better than 10.0: 284
Number of HSP's better than 10.0 without gapping: 24501
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 25294
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 27007650415
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB022d01_f BP035574 1 542
2 MFB004e07_f BP034192 52 643
3 MFB069h03_f BP039046 55 432
4 MFB024f05_f BP035747 55 500
5 MFB078h11_f BP039739 56 517
6 MFB065h10_f BP038754 59 591
7 MFB093f07_f BP040796 60 536
8 MFB009d09_f BP034555 61 429
9 MFB094f09_f BP040868 70 612
10 MFB047b01_f BP037398 71 489
11 MFB083b01_f BP040047 73 618
12 MFB098a02_f BP041104 74 546
13 MFB099b01_f BP041177 76 481
14 MFB040b10_f BP036906 80 490




Lotus japonicus
Kazusa DNA Research Institute