KMC003309A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003309A_C01 KMC003309A_c01
ATATCAACCACCAAACTTTAACAAAAAGAGAGCAGTACACAGATTTTACTAGCAAATCAC
AACAACAAGGGTGGCTTATTTAGTATTTAGCGCCCAACCACAAAGCACTGGGATGTTCAT
AGCATCCTACATAAAAACATACATAAAACAACACAACACACAAGATAATGGGACTATAAT
CCAAATTACACCATAATCATCAAAGAGTTCAGCAGTTTAGTAAGCAATTTAGTCTCCCCA
CCATTGATGGACATAGACACCATGGCTCCTAAGACCCTGGTAAAGATCAATGCAGCTTTC
CTTTTGGGTCATGGTTTTATCATAATTTCCTTTCTCAGGACACCTGAGAAACAACTCATC
AACAAAGCATATAGCTCCAGTCTCAAAGATATCCGAGAGGAACTTGAGTTCAACTTTCCC
TGCATTCATCTTCAGCACAACAAAATCTGCATACGGCACAGTTTCTTTGAACCAGGCAAC
AAAATCAAACTCATCCTCCCCGAGATAAGGATCCAAATCTCCATCTGTAGCATATGCCTC
CGAAACCTCACCAGCCAATCCAnGGTGATAGACAAAGGTCACACCGGGTCTCTTAACATA
GGACAAGA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003309A_C01 KMC003309A_c01
         (608 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_195939.1| putative protein; protein id: At5g03190.1 [Arab...   136  2e-31
ref|NP_190908.1| putative protein; protein id: At3g53400.1, supp...   134  7e-31
ref|NP_195791.1| putative protein; protein id: At5g01710.1, supp...    79  5e-14
gb|AAG50697.1|AC079604_4 hypothetical protein [Arabidopsis thali...    59  5e-08
ref|NP_176109.1| hypothetical protein; protein id: At1g58120.1 [...    59  5e-08

>ref|NP_195939.1| putative protein; protein id: At5g03190.1 [Arabidopsis thaliana]
           gi|11282325|pir||T48340 hypothetical protein F15A17.220
           - Arabidopsis thaliana gi|7413596|emb|CAB86086.1|
           putative protein [Arabidopsis thaliana]
           gi|9757770|dbj|BAB08379.1| gene_id:MOK16.10~unknown
           protein [Arabidopsis thaliana]
           gi|27311561|gb|AAO00746.1| putative protein [Arabidopsis
           thaliana]
          Length = 451

 Score =  136 bits (343), Expect = 2e-31
 Identities = 67/127 (52%), Positives = 84/127 (65%), Gaps = 2/127 (1%)
 Frame = -3

Query: 606 LSYVKRPGVTFVYHXGLA--GEVSEAYATDGDLDPYLGEDEFDFVAWFKETVPYADFVVL 433
           LSYVK+PGVTFVYH  LA      +       L+P+  ++ FDF+AWF+ET  YADFVVL
Sbjct: 334 LSYVKKPGVTFVYHPDLAENNSTGKKITPLEQLEPFPEDERFDFLAWFEETAKYADFVVL 393

Query: 432 KMNAGKVELKFLSDIFETGAICFVDELFLRCPEKGNYDKTMTQKESCIDLYQGLRSHGVY 253
           KMN  +VE+KFL+ + ETG IC+VDELFLRC            K  CI++ Q LR+ GV+
Sbjct: 394 KMNTNQVEMKFLTVLLETGVICYVDELFLRC---------SNHKSDCINMLQTLRARGVF 444

Query: 252 VHQWWGD 232
           VHQWW D
Sbjct: 445 VHQWWED 451

>ref|NP_190908.1| putative protein; protein id: At3g53400.1, supported by cDNA:
           gi_17528947 [Arabidopsis thaliana]
           gi|11282324|pir||T45880 hypothetical protein F4P12.100 -
           Arabidopsis thaliana gi|6729491|emb|CAB67647.1| putative
           protein [Arabidopsis thaliana]
          Length = 466

 Score =  134 bits (338), Expect = 7e-31
 Identities = 65/124 (52%), Positives = 80/124 (64%)
 Frame = -3

Query: 603 SYVKRPGVTFVYHXGLAGEVSEAYATDGDLDPYLGEDEFDFVAWFKETVPYADFVVLKMN 424
           SYVK PGVTF+YH GLA   +    T    +P++ +D FDF+AWFKET  +ADFVVLKMN
Sbjct: 353 SYVKSPGVTFIYHPGLAATKTTIANTGDHEEPFVEDDSFDFLAWFKETASFADFVVLKMN 412

Query: 423 AGKVELKFLSDIFETGAICFVDELFLRCPEKGNYDKTMTQKESCIDLYQGLRSHGVYVHQ 244
               ELKFLS++ +TGAIC VDELFL C          T    C  + + LR+ GV+VHQ
Sbjct: 413 TSDTELKFLSELIKTGAICSVDELFLHC----------TGYSDCTGIIKSLRNSGVFVHQ 462

Query: 243 WWGD 232
           WW D
Sbjct: 463 WWED 466

>ref|NP_195791.1| putative protein; protein id: At5g01710.1, supported by cDNA:
           gi_15810368 [Arabidopsis thaliana]
           gi|11357829|pir||T48192 hypothetical protein F7A7.230 -
           Arabidopsis thaliana gi|7327830|emb|CAB82287.1| putative
           protein [Arabidopsis thaliana]
           gi|15810369|gb|AAL07072.1| unknown protein [Arabidopsis
           thaliana] gi|23296924|gb|AAN13203.1| unknown protein
           [Arabidopsis thaliana] gi|24417484|gb|AAN60352.1|
           unknown [Arabidopsis thaliana]
          Length = 513

 Score = 79.0 bits (193), Expect = 5e-14
 Identities = 39/95 (41%), Positives = 51/95 (53%), Gaps = 11/95 (11%)
 Frame = -3

Query: 489 FDFVAWFKETVPYADFVVLKMNAGKVELKFLSDIFETGAICFVDELFLRC---------- 340
           FDF  W K++V   DFVV+KM+    E   +  + +TGAIC +DELFL C          
Sbjct: 422 FDFADWLKKSVRERDFVVMKMDVEGTEFDLIPRLIKTGAICLIDELFLECHYNRWQRCCP 481

Query: 339 -PEKGNYDKTMTQKESCIDLYQGLRSHGVYVHQWW 238
                 Y+KT  Q   C++L+  LR  GV VHQWW
Sbjct: 482 GQRSQKYNKTYNQ---CLELFNSLRQRGVLVHQWW 513

>gb|AAG50697.1|AC079604_4 hypothetical protein [Arabidopsis thaliana]
           gi|26451877|dbj|BAC43031.1| unknown protein [Arabidopsis
           thaliana]
          Length = 420

 Score = 58.9 bits (141), Expect = 5e-08
 Identities = 36/97 (37%), Positives = 47/97 (48%), Gaps = 7/97 (7%)
 Frame = -3

Query: 504 LGEDEFDFVAWFKETVPYADFVVLKMNAGKVELKFLSDIFETGAICFVDELFLRCPEKGN 325
           L  ++     W KE V   ++VV+K  A  VE     ++  + +I  VDELFL C  KG 
Sbjct: 329 LESEKMGMTEWLKENVKEEEYVVMKAEAEMVE-----EMMRSKSIKMVDELFLECKPKGL 383

Query: 324 --YDKTMTQKES-----CIDLYQGLRSHGVYVHQWWG 235
               + M  K       C+ LY  LR  GV VHQWWG
Sbjct: 384 GLRGRKMQSKSGRAYWECLALYGKLRDEGVAVHQWWG 420

>ref|NP_176109.1| hypothetical protein; protein id: At1g58120.1 [Arabidopsis
           thaliana] gi|25404192|pir||E96614 hypothetical protein
           T18I24.4 [imported] - Arabidopsis thaliana
           gi|12321385|gb|AAG50763.1|AC079131_8 hypothetical
           protein [Arabidopsis thaliana]
          Length = 420

 Score = 58.9 bits (141), Expect = 5e-08
 Identities = 36/97 (37%), Positives = 47/97 (48%), Gaps = 7/97 (7%)
 Frame = -3

Query: 504 LGEDEFDFVAWFKETVPYADFVVLKMNAGKVELKFLSDIFETGAICFVDELFLRCPEKGN 325
           L  ++     W KE V   ++VV+K  A  VE     ++  + +I  VDELFL C  KG 
Sbjct: 329 LESEKMGMTEWLKENVKEEEYVVMKAEAEMVE-----EMMRSKSIKMVDELFLECKPKGL 383

Query: 324 --YDKTMTQKES-----CIDLYQGLRSHGVYVHQWWG 235
               + M  K       C+ LY  LR  GV VHQWWG
Sbjct: 384 GLRGRKMQSKSGRAYWECLALYGKLRDEGVAVHQWWG 420

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 538,590,144
Number of Sequences: 1393205
Number of extensions: 11768760
Number of successful extensions: 30982
Number of sequences better than 10.0: 23
Number of HSP's better than 10.0 without gapping: 29091
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 30803
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 24283162270
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB081c05_f BP039918 1 483
2 MFBL001b03_f BP041307 1 506
3 MFBL040c04_f BP043276 20 479
4 GNf033g10 BP069799 20 521
5 MR083h10_f BP082428 20 344
6 SPDL042g06_f BP054667 48 535
7 SPDL032h08_f BP054021 48 446
8 SPDL026g12_f BP053634 73 527
9 SPD028c04_f BP046202 89 610




Lotus japonicus
Kazusa DNA Research Institute