KMC002245A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002245A_C01 KMC002245A_c01
AATAGTAACTTAACTCTTTATAAACTATAAATGCATCCCAATTGTTGTCTAACTGGATGT
GAAAGTGATGATTCTTCCAGACTTGTTTTCCAACTCATCTGTACTAGTACAATACAATAG
GCAAATACACACTATTTTTTTCTTTCTCAATTCCTAGTTTCTGTGTGTACAAGTTTCTTT
TTCTTTTTATATATATCCTTTTCTCCTATTTCTGGTAGCTGGAATGCGCCTACCTGGGCA
AACAACTCATGAAGAACAAGAAAGCATACATACGCCTGGTCGATATGCCCACGCTGGGGG
CAAGCGAGCATACTTTTCCATTCCAGGTTGCTCATCAAAAGGATGCTCCATTAATTTGAG
CAACCTACGGACTTCTCCAAAATCACCAATTTCAGCAGCATCAATTGCAGTCTGGCAAAG
ATAGTTCCTCAGAATATATTTAGGATTTACCAAATCCATTGAGGTCTTCCTCTCTTCATC
AGAAATACCACCGGTGGACAGCTCATGTATGTAGGTTTTTAACCAACTGGTCCAAGCTTC
CTTACGCTCCTTGCCCATATCTAAGAGCACCGACTTTAGTGGGACTAGCAATTCATCATC
TGGAATGCTTGTGTCTGCTTTAATATTTGATAGTGTACGAAAGAAGTTTGTATAATCAAC
TTTGTCAACAGCCATGTTACTAAGAAGTTTACCAATCAGCTGCTTATTGTACTTAGGGAG
GCCAAGCTTTTTGGTCATTATAGCTTGATAATCAT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002245A_C01 KMC002245A_c01
         (755 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||T49917 hypothetical protein T24H18.200 - Arabidopsis thalia...   264  9e-70
gb|AAN41282.1| unknown protein [Arabidopsis thaliana]                 264  9e-70
gb|AAK25868.1|AF360158_1 unknown protein [Arabidopsis thaliana]       264  9e-70
ref|NP_196808.1| putative protein; protein id: At5g13040.1 [Arab...   264  9e-70
ref|ZP_00083048.1| hypothetical protein [Pseudomonas fluorescens...   109  4e-23

>pir||T49917 hypothetical protein T24H18.200 - Arabidopsis thaliana
           gi|7630059|emb|CAB88267.1| putative protein [Arabidopsis
           thaliana]
          Length = 554

 Score =  264 bits (675), Expect = 9e-70
 Identities = 122/168 (72%), Positives = 147/168 (86%)
 Frame = -3

Query: 753 DYQAIMTKKLGLPKYNKQLIGKLLSNMAVDKVDYTNFFRTLSNIKADTSIPDDELLVPLK 574
           +YQAIM+KKLGL KYNK++I KLL+NM+VDKVDYTNFFR L+N+KA+ + P++ELL PLK
Sbjct: 387 EYQAIMSKKLGLTKYNKEVISKLLNNMSVDKVDYTNFFRLLANVKANPNTPENELLKPLK 446

Query: 573 SVLLDMGKERKEAWTSWLKTYIHELSTGGISDEERKTSMDLVNPKYILRNYLCQTAIDAA 394
           +VLLD+GKERKEAW  W+++YI E+    +SDEERK  MD VNPKYILRNYLCQ+AIDAA
Sbjct: 447 AVLLDIGKERKEAWIKWMRSYIQEVGGSEVSDEERKARMDSVNPKYILRNYLCQSAIDAA 506

Query: 393 EIGDFGEVRRLLKLMEHPFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 250
           E GDF EV  L++LM+ P++EQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 507 EQGDFSEVNNLIRLMKRPYEEQPGMEKYARLPPAWAYRPGVCMLSCSS 554

>gb|AAN41282.1| unknown protein [Arabidopsis thaliana]
          Length = 633

 Score =  264 bits (675), Expect = 9e-70
 Identities = 122/168 (72%), Positives = 147/168 (86%)
 Frame = -3

Query: 753 DYQAIMTKKLGLPKYNKQLIGKLLSNMAVDKVDYTNFFRTLSNIKADTSIPDDELLVPLK 574
           +YQAIM+KKLGL KYNK++I KLL+NM+VDKVDYTNFFR L+N+KA+ + P++ELL PLK
Sbjct: 466 EYQAIMSKKLGLTKYNKEVISKLLNNMSVDKVDYTNFFRLLANVKANPNTPENELLKPLK 525

Query: 573 SVLLDMGKERKEAWTSWLKTYIHELSTGGISDEERKTSMDLVNPKYILRNYLCQTAIDAA 394
           +VLLD+GKERKEAW  W+++YI E+    +SDEERK  MD VNPKYILRNYLCQ+AIDAA
Sbjct: 526 AVLLDIGKERKEAWIKWMRSYIQEVGGSEVSDEERKARMDSVNPKYILRNYLCQSAIDAA 585

Query: 393 EIGDFGEVRRLLKLMEHPFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 250
           E GDF EV  L++LM+ P++EQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 586 EQGDFSEVNNLIRLMKRPYEEQPGMEKYARLPPAWAYRPGVCMLSCSS 633

>gb|AAK25868.1|AF360158_1 unknown protein [Arabidopsis thaliana]
          Length = 585

 Score =  264 bits (675), Expect = 9e-70
 Identities = 122/168 (72%), Positives = 147/168 (86%)
 Frame = -3

Query: 753 DYQAIMTKKLGLPKYNKQLIGKLLSNMAVDKVDYTNFFRTLSNIKADTSIPDDELLVPLK 574
           +YQAIM+KKLGL KYNK++I KLL+NM+VDKVDYTNFFR L+N+KA+ + P++ELL PLK
Sbjct: 418 EYQAIMSKKLGLTKYNKEVISKLLNNMSVDKVDYTNFFRLLANVKANPNTPENELLKPLK 477

Query: 573 SVLLDMGKERKEAWTSWLKTYIHELSTGGISDEERKTSMDLVNPKYILRNYLCQTAIDAA 394
           +VLLD+GKERKEAW  W+++YI E+    +SDEERK  MD VNPKYILRNYLCQ+AIDAA
Sbjct: 478 AVLLDIGKERKEAWIKWMRSYIQEVGGSEVSDEERKARMDSVNPKYILRNYLCQSAIDAA 537

Query: 393 EIGDFGEVRRLLKLMEHPFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 250
           E GDF EV  L++LM+ P++EQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 538 EQGDFSEVNNLIRLMKRPYEEQPGMEKYARLPPAWAYRPGVCMLSCSS 585

>ref|NP_196808.1| putative protein; protein id: At5g13040.1 [Arabidopsis thaliana]
          Length = 305

 Score =  264 bits (675), Expect = 9e-70
 Identities = 122/168 (72%), Positives = 147/168 (86%)
 Frame = -3

Query: 753 DYQAIMTKKLGLPKYNKQLIGKLLSNMAVDKVDYTNFFRTLSNIKADTSIPDDELLVPLK 574
           +YQAIM+KKLGL KYNK++I KLL+NM+VDKVDYTNFFR L+N+KA+ + P++ELL PLK
Sbjct: 138 EYQAIMSKKLGLTKYNKEVISKLLNNMSVDKVDYTNFFRLLANVKANPNTPENELLKPLK 197

Query: 573 SVLLDMGKERKEAWTSWLKTYIHELSTGGISDEERKTSMDLVNPKYILRNYLCQTAIDAA 394
           +VLLD+GKERKEAW  W+++YI E+    +SDEERK  MD VNPKYILRNYLCQ+AIDAA
Sbjct: 198 AVLLDIGKERKEAWIKWMRSYIQEVGGSEVSDEERKARMDSVNPKYILRNYLCQSAIDAA 257

Query: 393 EIGDFGEVRRLLKLMEHPFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 250
           E GDF EV  L++LM+ P++EQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 258 EQGDFSEVNNLIRLMKRPYEEQPGMEKYARLPPAWAYRPGVCMLSCSS 305

>ref|ZP_00083048.1| hypothetical protein [Pseudomonas fluorescens PfO-1]
          Length = 464

 Score =  109 bits (273), Expect = 4e-23
 Identities = 67/171 (39%), Positives = 96/171 (55%), Gaps = 4/171 (2%)
 Frame = -3

Query: 750 YQAIMTKKLGLPKY---NKQLIGKLLSNMAVDKVDYTNFFRTLSNIKADTSIPDDELLVP 580
           Y  +M ++LG       ++ L+ +LL  M    VDYT FFR L    A+ ++        
Sbjct: 308 YLDLMRRRLGFTTAEDDDQMLLEQLLQLMQNSGVDYTLFFRRLGEESAEQAV------AR 361

Query: 579 LKSVLLDMGKERKEAWTSWLKTYIHELSTGGISDEE-RKTSMDLVNPKYILRNYLCQTAI 403
           L+   +D+     + + +W + Y+  ++  G +D+E R+  M  VNP YILRNYL Q AI
Sbjct: 362 LRDDFVDI-----KGFDAWGERYVARVARDGATDQEQRRARMHAVNPLYILRNYLAQKAI 416

Query: 402 DAAEIGDFGEVRRLLKLMEHPFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 250
           DAAE GD+ EVRRL  ++ +PF+EQPGME YA  PP W        +SCSS
Sbjct: 417 DAAEQGDYSEVRRLHAVLSNPFEEQPGMESYAERPPEWGKH---LEISCSS 464

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 653,877,057
Number of Sequences: 1393205
Number of extensions: 14890699
Number of successful extensions: 37941
Number of sequences better than 10.0: 69
Number of HSP's better than 10.0 without gapping: 36293
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 37839
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 36877108757
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPDL033g08_f BP054082 1 547
2 SPD061h03_f BP048889 30 522
3 SPDL084e10_f BP057265 30 439
4 MR086e05_f BP082621 30 414
5 GENf048c09 BP060372 30 394
6 MF024b02_f BP029516 30 458
7 MFBL050b05_f BP043802 45 550
8 GNf030h04 BP069577 96 523
9 MWL075g07_f AV769939 226 791




Lotus japonicus
Kazusa DNA Research Institute