KMC000162A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000162A_C01 KMC000162A_c01
gaattGAAATTAAAGGAGGATATATACTACTGGTATTAATTTCCAATTGCCAATTACAAA
ATCAATCACAGACATAATAATCTCACTACAGAGGAACAAGAACCACCATACCAACTCCAT
TATCCAAATTCAAAGAAAAAAGAAAACCCATATATACACAATTGCAAGAACACAAAATTG
GGCAATTGGAGGAGGAAATTGATTGATTGATCAATCAATTAATTTGATGCCAGTCTCCAT
GAATGAAGTCATGGTGAACCCTAAGGCGAGAGCAGTGGTACGGAAGCGTCTCATTGGTAG
AGCTGAAGTACACGTAGTATCCCTTCACGCCGGTTAAGCCGATGATCTTCCCGTACCACC
ACCCGTCGTTGTCGAACACGTTCACAGCCTGGTACATTTCGTAGTGGCCACCGCGTGTCC
GGACACACGGCGGCACAGGGCGGAGGTCCTTTGGAAGCACCGTCTCTTTCAGCGGAATCT
TGGTCTCTTCGTCCACGATCAGCGTC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000162A_C01 KMC000162A_c01
         (506 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_187304.1| hypothetical protein; protein id: At3g06520.1 [...    67  2e-10
pir||C86226 protein T31J12.4 [imported] - Arabidopsis thaliana g...    65  4e-10
ref|NP_172403.1| unknown protein; protein id: At1g09320.1 [Arabi...    65  4e-10
ref|NP_172123.1| hypothetical protein; protein id: At1g06340.1 [...    62  3e-09
ref|NP_182245.1| unknown protein; protein id: At2g47230.1 [Arabi...    48  6e-05

>ref|NP_187304.1| hypothetical protein; protein id: At3g06520.1 [Arabidopsis
           thaliana] gi|12322681|gb|AAG51333.1|AC020580_13
           hypothetical protein; 66083-64412 [Arabidopsis thaliana]
          Length = 466

 Score = 66.6 bits (161), Expect = 2e-10
 Identities = 34/96 (35%), Positives = 57/96 (58%), Gaps = 1/96 (1%)
 Frame = -2

Query: 505 TLIVDEETKIPLKETVLPKDLRPVPPCVRTRGGHYEMYQAVNVFDNDGWWYGKIIGLTGV 326
           TL  D+E ++ LKE     D+RP PP +  +G  YE+Y+ V+ + N+GWW G++  +   
Sbjct: 372 TLKTDDEREL-LKEEARGSDIRPPPPPLIPKGYRYELYELVDAWYNEGWWSGRVYKINNN 430

Query: 325 KGYY-VYFSSTNETLPYHCSRLRVHHDFIHGDWHQI 221
           K  Y VYF +T+E+L +  + LR    + +G W ++
Sbjct: 431 KTRYGVYFQTTDESLEFAYNDLRPCQVWRNGKWSRV 466

 Score = 53.9 bits (128), Expect = 1e-06
 Identities = 33/91 (36%), Positives = 45/91 (49%), Gaps = 3/91 (3%)
 Frame = -2

Query: 493 DEETKIPLKETVLPKDLRPVPPCVRTRGGHYEMYQAVNVFDNDGWW---YGKIIGLTGVK 323
           D+   IPL++ V  KD+RPVPP   +    YE    V+ + N  WW     K++G  G  
Sbjct: 212 DDGESIPLRDVVEAKDIRPVPPSELSPVVCYEPGVIVDAWFNKRWWTSRVSKVLG-GGSN 270

Query: 322 GYYVYFSSTNETLPYHCSRLRVHHDFIHGDW 230
            Y V+  ST E        LR H D+I+G W
Sbjct: 271 KYSVFIISTGEETTILNFNLRPHKDWINGQW 301

 Score = 38.5 bits (88), Expect = 0.045
 Identities = 28/95 (29%), Positives = 43/95 (44%), Gaps = 3/95 (3%)
 Frame = -2

Query: 505 TLIVDEETKIPLKETVLPKDLRPVPPCVRTRGGHYEMYQAVNVF-DNDGWWY-GKIIGLT 332
           +L V     + +KE V P  LRP PP  R     ++    V+VF D++G W  G +  + 
Sbjct: 47  SLTVGGSVSVRMKEYVTPTRLRPSPP--RELNRRFKADDEVDVFRDSEGCWVRGNVTTVL 104

Query: 331 GVKGYYVYFSSTNE-TLPYHCSRLRVHHDFIHGDW 230
               Y V F   N   +      LR+H +++ G W
Sbjct: 105 EDSRYIVEFKGENRPEIEVDQFNLRLHREWLDGGW 139

>pir||C86226 protein T31J12.4 [imported] - Arabidopsis thaliana
           gi|4337176|gb|AAD18097.1| T31J12.4 [Arabidopsis
           thaliana]
          Length = 514

 Score = 65.1 bits (157), Expect = 4e-10
 Identities = 33/87 (37%), Positives = 46/87 (51%)
 Frame = -2

Query: 490 EETKIPLKETVLPKDLRPVPPCVRTRGGHYEMYQAVNVFDNDGWWYGKIIGLTGVKGYYV 311
           E+ K PL+E V    +RP+P         +E +  VN   NDGWW G I  +     Y V
Sbjct: 421 EDGKEPLREEVNVSRIRPLP-LESVMVSPFERHDKVNALYNDGWWVGVIRKVLAKSSYLV 479

Query: 310 YFSSTNETLPYHCSRLRVHHDFIHGDW 230
            F +T E L +H S+LR+H ++I G W
Sbjct: 480 LFKNTQELLKFHHSQLRLHQEWIDGKW 506

 Score = 54.7 bits (130), Expect = 6e-07
 Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 2/94 (2%)
 Frame = -2

Query: 505 TLIVDEETKIPLKETVLPKDLRPVPPCVRT--RGGHYEMYQAVNVFDNDGWWYGKIIGLT 332
           TL  D+E   PLKE V    LRP  P +    +     + + V+ F NDGWW G +  + 
Sbjct: 58  TLFFDKEGTKPLKEVVDMSQLRPPAPPMSEIEKKKKIVVGEEVDAFYNDGWWEGDVTEVL 117

Query: 331 GVKGYYVYFSSTNETLPYHCSRLRVHHDFIHGDW 230
               + V+F S+ E + +    LR H +++ G W
Sbjct: 118 DDGKFSVFFRSSKEQIRFRKDELRFHREWVDGAW 151

 Score = 50.8 bits (120), Expect = 9e-06
 Identities = 27/82 (32%), Positives = 39/82 (46%)
 Frame = -2

Query: 475 PLKETVLPKDLRPVPPCVRTRGGHYEMYQAVNVFDNDGWWYGKIIGLTGVKGYYVYFSST 296
           PLKE      +RP PP  R     + +   +N F NDGWW G +I         +YF  +
Sbjct: 267 PLKEETDFLHIRPPPP--RDEDIDFAVGDKINAFYNDGWWVGVVIDGMKHGTVGIYFRQS 324

Query: 295 NETLPYHCSRLRVHHDFIHGDW 230
            E + +    LR+H D++ G W
Sbjct: 325 QEKMRFGRQGLRLHKDWVDGTW 346

>ref|NP_172403.1| unknown protein; protein id: At1g09320.1 [Arabidopsis thaliana]
          Length = 491

 Score = 65.1 bits (157), Expect = 4e-10
 Identities = 33/87 (37%), Positives = 46/87 (51%)
 Frame = -2

Query: 490 EETKIPLKETVLPKDLRPVPPCVRTRGGHYEMYQAVNVFDNDGWWYGKIIGLTGVKGYYV 311
           E+ K PL+E V    +RP+P         +E +  VN   NDGWW G I  +     Y V
Sbjct: 398 EDGKEPLREEVNVSRIRPLP-LESVMVSPFERHDKVNALYNDGWWVGVIRKVLAKSSYLV 456

Query: 310 YFSSTNETLPYHCSRLRVHHDFIHGDW 230
            F +T E L +H S+LR+H ++I G W
Sbjct: 457 LFKNTQELLKFHHSQLRLHQEWIDGKW 483

 Score = 54.7 bits (130), Expect = 6e-07
 Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 2/94 (2%)
 Frame = -2

Query: 505 TLIVDEETKIPLKETVLPKDLRPVPPCVRT--RGGHYEMYQAVNVFDNDGWWYGKIIGLT 332
           TL  D+E   PLKE V    LRP  P +    +     + + V+ F NDGWW G +  + 
Sbjct: 58  TLFFDKEGTKPLKEVVDMSQLRPPAPPMSEIEKKKKIVVGEEVDAFYNDGWWEGDVTEVL 117

Query: 331 GVKGYYVYFSSTNETLPYHCSRLRVHHDFIHGDW 230
               + V+F S+ E + +    LR H +++ G W
Sbjct: 118 DDGKFSVFFRSSKEQIRFRKDELRFHREWVDGAW 151

 Score = 50.8 bits (120), Expect = 9e-06
 Identities = 27/82 (32%), Positives = 39/82 (46%)
 Frame = -2

Query: 475 PLKETVLPKDLRPVPPCVRTRGGHYEMYQAVNVFDNDGWWYGKIIGLTGVKGYYVYFSST 296
           PLKE      +RP PP  R     + +   +N F NDGWW G +I         +YF  +
Sbjct: 244 PLKEETDFLHIRPPPP--RDEDIDFAVGDKINAFYNDGWWVGVVIDGMKHGTVGIYFRQS 301

Query: 295 NETLPYHCSRLRVHHDFIHGDW 230
            E + +    LR+H D++ G W
Sbjct: 302 QEKMRFGRQGLRLHKDWVDGTW 323

>ref|NP_172123.1| hypothetical protein; protein id: At1g06340.1 [Arabidopsis
           thaliana] gi|25406917|pir||B86199 hypothetical protein
           [imported] - Arabidopsis thaliana
           gi|8927671|gb|AAF82162.1|AC068143_4 Contains similarity
           to a hypothetical protein T31J12.4 gi|4337176 from
           Arabidopsis thaliana BAC T31J12 gb|AC006416
          Length = 134

 Score = 62.4 bits (150), Expect = 3e-09
 Identities = 37/96 (38%), Positives = 52/96 (53%), Gaps = 5/96 (5%)
 Frame = -2

Query: 502 LIVDEETKIPLKETVLPKDLRPVPPC---VRTRGGHYEMYQAVNVFDNDGWWYGKIIGLT 332
           L+ D +    L E +   +LRP+PP    V  R G       V+ FD DGWW G++   T
Sbjct: 44  LVSDTDQSKRLVEVISADELRPMPPKSLHVLIRCG-----DKVDAFDKDGWWVGEV---T 95

Query: 331 GVKG--YYVYFSSTNETLPYHCSRLRVHHDFIHGDW 230
            V+   Y VYFS+T+E L Y    LR HH++++G W
Sbjct: 96  AVRRNIYSVYFSTTDEELEYPLYSLRKHHEWVNGSW 131

>ref|NP_182245.1| unknown protein; protein id: At2g47230.1 [Arabidopsis thaliana]
           gi|25364476|pir||F84912 hypothetical protein At2g47230
           [imported] - Arabidopsis thaliana
           gi|2275201|gb|AAB63823.1| unknown protein [Arabidopsis
           thaliana]
          Length = 701

 Score = 48.1 bits (113), Expect = 6e-05
 Identities = 25/82 (30%), Positives = 42/82 (50%)
 Frame = -2

Query: 499 IVDEETKIPLKETVLPKDLRPVPPCVRTRGGHYEMYQAVNVFDNDGWWYGKIIGLTGVKG 320
           +++++   PL E + P+ +RPVPP     G   E    V+    DGWW G II       
Sbjct: 48  LLNDDALSPLIENIEPRFIRPVPPENEYNGIVLEEGTVVDADHKDGWWTGVIIKKLENGK 107

Query: 319 YYVYFSSTNETLPYHCSRLRVH 254
           ++VY+ S  + + +  ++LR H
Sbjct: 108 FWVYYDSPPDIIEFERNQLRPH 129

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 482,553,037
Number of Sequences: 1393205
Number of extensions: 10974667
Number of successful extensions: 38195
Number of sequences better than 10.0: 41
Number of HSP's better than 10.0 without gapping: 36447
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 38138
length of database: 448,689,247
effective HSP length: 114
effective length of database: 289,863,877
effective search space used: 15652649358
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB054a08_f BP037891 1 489
2 GENLf006c08 BP062653 6 495
3 GENLf087h11 BP067119 8 507
4 GENLf086d10 BP067035 13 502




Lotus japonicus
Kazusa DNA Research Institute