KMC003865A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003865A_C01 KMC003865A_c01
ttgtgaataaaaaagacCGTAGATTCACCAAAATGATATCAAAGTCTTATTTTCTTCTTC
TTAAAAAAAAGGCATAATTTAACCGCTAAATATATTAAGCCATAACATTTGGTGAAAAGT
CCACTGCAAAACTGGTCTTATGCTAAAATCACCCAGCAGTCAAATTGAGATACATAGCTG
AAGCCTTCATCTAAGATAGGTCGATGAAACCCATAAAAAAACATAACTTAACACTGACGA
GATCACTTCAAATGCGCATGGTCCCTTAAGTACTCGAATAGGAGGAATCACTGACACCAT
CAAGAAGGTTCCCAACAGCCACGAAAAAATAAATGCTCCAACCCCATAAAGGAAAGCTCG
TATCTGGCTCTTCAGACGATCATGTATAAAATATATTGTAGCGATCAGAGATAGCGCTAC
TTGAAGTGTTGGCCCTTCTTCAGTCGGAAAGAGAACTGTCAGAACTCCAAGTAATAAGAA
TACTAGTGAAGTTTTTATAATGAATTTTGTGGATGGAGTCTGAAATCTTCCTCTGATAGC
CTGAAGAAATTTAGATTGATTGACT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003865A_C01 KMC003865A_c01
         (565 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_566946.1| putative protein; protein id: At3g51140.1, supp...   168  5e-41
pir||T45745 hypothetical protein F24M12.180 - Arabidopsis thalia...   168  5e-41
ref|NP_197695.1| putative protein; protein id: At5g23040.1 [Arab...    75  6e-13
ref|NP_487299.1| hypothetical protein [Nostoc sp. PCC 7120] gi|2...    41  0.012
ref|NP_440462.1| unknown protein [Synechocystis sp. PCC 6803] gi...    37  0.22

>ref|NP_566946.1| putative protein; protein id: At3g51140.1, supported by cDNA:
           266414., supported by cDNA: gi_14334851, supported by
           cDNA: gi_17104692 [Arabidopsis thaliana]
           gi|14334852|gb|AAK59604.1| unknown protein [Arabidopsis
           thaliana] gi|17104693|gb|AAL34235.1| unknown protein
           [Arabidopsis thaliana] gi|21554869|gb|AAM63714.1|
           unknown [Arabidopsis thaliana]
          Length = 278

 Score =  168 bits (425), Expect = 5e-41
 Identities = 84/125 (67%), Positives = 100/125 (79%)
 Frame = -2

Query: 564 VNQSKFLQAIRGRFQTPSTKFIIKTSLVFLLLGVLTVLFPTEEGPTLQVALSLIATIYFI 385
           V QSK +  +  RFQTP    ++KT++ F +LGVLTVLFPTEEGPTLQV LSLIAT YFI
Sbjct: 156 VRQSKVVNFVFERFQTPPNAVLVKTAVTFAVLGVLTVLFPTEEGPTLQVLLSLIATFYFI 215

Query: 384 HDRLKSQIRAFLYGVGAFIFSWLLGTFLMVSVIPPIRVLKGPCAFEVISSVLSYVFLWVS 205
           H RL+ ++  FLYG GAFIFSWL+GTFLMVSVIPP   +KGP  FEV+SS+LSYV LWV+
Sbjct: 216 HQRLQKKLWTFLYGAGAFIFSWLVGTFLMVSVIPPF--IKGPRGFEVMSSLLSYVLLWVA 273

Query: 204 STYLR 190
           S+YLR
Sbjct: 274 SSYLR 278

>pir||T45745 hypothetical protein F24M12.180 - Arabidopsis thaliana
           gi|6562266|emb|CAB62636.1| putative protein [Arabidopsis
           thaliana]
          Length = 247

 Score =  168 bits (425), Expect = 5e-41
 Identities = 84/125 (67%), Positives = 100/125 (79%)
 Frame = -2

Query: 564 VNQSKFLQAIRGRFQTPSTKFIIKTSLVFLLLGVLTVLFPTEEGPTLQVALSLIATIYFI 385
           V QSK +  +  RFQTP    ++KT++ F +LGVLTVLFPTEEGPTLQV LSLIAT YFI
Sbjct: 125 VRQSKVVNFVFERFQTPPNAVLVKTAVTFAVLGVLTVLFPTEEGPTLQVLLSLIATFYFI 184

Query: 384 HDRLKSQIRAFLYGVGAFIFSWLLGTFLMVSVIPPIRVLKGPCAFEVISSVLSYVFLWVS 205
           H RL+ ++  FLYG GAFIFSWL+GTFLMVSVIPP   +KGP  FEV+SS+LSYV LWV+
Sbjct: 185 HQRLQKKLWTFLYGAGAFIFSWLVGTFLMVSVIPPF--IKGPRGFEVMSSLLSYVLLWVA 242

Query: 204 STYLR 190
           S+YLR
Sbjct: 243 SSYLR 247

>ref|NP_197695.1| putative protein; protein id: At5g23040.1 [Arabidopsis thaliana]
           gi|9759362|dbj|BAB09821.1|
           emb|CAB62636.1~gene_id:MYJ24.3~similar to unknown
           protein [Arabidopsis thaliana]
           gi|21928168|gb|AAM78111.1| AT5g23040/MYJ24_3
           [Arabidopsis thaliana] gi|23505829|gb|AAN28774.1|
           At5g23040/MYJ24_3 [Arabidopsis thaliana]
          Length = 258

 Score = 75.1 bits (183), Expect = 6e-13
 Identities = 39/120 (32%), Positives = 74/120 (61%)
 Frame = -2

Query: 549 FLQAIRGRFQTPSTKFIIKTSLVFLLLGVLTVLFPTEEGPTLQVALSLIATIYFIHDRLK 370
           +L+A+    + P    I +   +F  +G  +++   E GP  QVA+SL A +YF++++ K
Sbjct: 141 WLKALLDFVEMPPMDTIFRRLFLFAFMGGWSIMNSAEGGPAFQVAVSLAACVYFLNEKTK 200

Query: 369 SQIRAFLYGVGAFIFSWLLGTFLMVSVIPPIRVLKGPCAFEVISSVLSYVFLWVSSTYLR 190
           S  RA L G+GA +  W  G+ L++ +IP   +++     E+++S+++YVFL++S T+L+
Sbjct: 201 SLGRACLIGIGALVAGWFCGS-LIIPMIPTF-LIQPTWTLELLTSLVAYVFLFLSCTFLK 258

>ref|NP_487299.1| hypothetical protein [Nostoc sp. PCC 7120] gi|25381750|pir||AD2213
           hypothetical protein all3259 [imported] - Nostoc sp.
           (strain PCC 7120) gi|17132354|dbj|BAB74958.1|
           ORF_ID:all3259~hypothetical protein [Nostoc sp. PCC
           7120]
          Length = 208

 Score = 40.8 bits (94), Expect = 0.012
 Identities = 29/111 (26%), Positives = 55/111 (49%), Gaps = 2/111 (1%)
 Frame = -2

Query: 516 PSTKFIIKTSLVFLLLGVLTVLFPTEEGPTLQVALSL-IATIYFIHDRLKSQI-RAFLYG 343
           PS   ++   + FL L  ++V +P      LQ+AL + + T  F  +R + +  RA L+ 
Sbjct: 100 PSGTDVLLPGVWFLGLSAISVFYPAAGDQVLQLALVIGVGTSIFFLNRKEGRFGRAVLFT 159

Query: 342 VGAFIFSWLLGTFLMVSVIPPIRVLKGPCAFEVISSVLSYVFLWVSSTYLR 190
           +   I   + G  +   ++P I  +         S+VL+++ LW+ S++LR
Sbjct: 160 LVGLIIGLITGGLIAGLLLPQIPAISFTA--NQFSTVLTFILLWLISSFLR 208

>ref|NP_440462.1| unknown protein [Synechocystis sp. PCC 6803] gi|7470542|pir||S75228
           hypothetical protein slr1918 - Synechocystis sp. (strain
           PCC 6803) gi|1652218|dbj|BAA17142.1|
           ORF_ID:slr1918~unknown protein [Synechocystis sp. PCC
           6803]
          Length = 228

 Score = 36.6 bits (83), Expect = 0.22
 Identities = 20/80 (25%), Positives = 46/80 (57%)
 Frame = -2

Query: 429 TLQVALSLIATIYFIHDRLKSQIRAFLYGVGAFIFSWLLGTFLMVSVIPPIRVLKGPCAF 250
           +L + + +   I+F++ + +   +A L  +GA +   +LGT ++  ++    V  GP   
Sbjct: 151 SLLLVVGVFGNIFFLNRKQRKFGKALLLSLGALLVGIILGT-VLGQLLLGANVAIGP-NL 208

Query: 249 EVISSVLSYVFLWVSSTYLR 190
           E IS+ ++++ LW+ S+++R
Sbjct: 209 EQISATIAFIILWLISSFVR 228

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 429,798,130
Number of Sequences: 1393205
Number of extensions: 8454951
Number of successful extensions: 17811
Number of sequences better than 10.0: 18
Number of HSP's better than 10.0 without gapping: 17498
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 17800
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 20382500157
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GNf080d10 BP073274 1 503
2 MPD005c01_f AV770315 17 541
3 MPD005g09_f AV770353 26 523
4 MR040h04_f BP079122 26 474
5 MFB077b03_f BP039600 26 443
6 GNf098c02 BP074614 43 542
7 MWM095a12_f AV766282 57 303
8 GNf084g05 BP073586 78 218
9 SPD010d09_f BP044792 78 581
10 MPD019e09_f AV771321 80 254
11 MF072f12_f BP032136 83 408
12 MF013b08_f BP028908 87 544
13 MPD072h08_f AV774757 88 258




Lotus japonicus
Kazusa DNA Research Institute