KMC018849A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC018849A_C01 KMC018849A_c01
CGTACAAAATCCAATACACATTTATTCACTAGAAGTGATTTTAGAGCATTTTACAGTGGA
ACCAAACATGCTATGAAGGTTCCATGGATACATTCAACCCCACCATGGGTGCACTGCAAC
TCCCTCATCCCTCAGCCTTCCATACAAAGCCAAGCACTCCCAATATGCTCTCCTACTCTT
CTTTCTCTTCCCCTTCTTCTGCCACCATTCATTCTTACACTCTAGGAAGAGTTCATCCAC
CAAATAGATTGTCCTTTCCTTTATCATCTCCTCCACAACCTCTGCTTCTGCCTTCATCAC
AACATACTCTTCCTCCTTCACATGCTTAGATAACCAAGCAGAAACATCATTATCATCTTC
TGGTACAAACAGAAGGCTGTGAATTTCAAATTTGGTATCCTTCTTCGGGTAATTTCGCTC
GAACCATTCAATCACACCCTTATTTTCCTTTGACAACAAGCCTGCACTAATGAAGAGTCT
TCGATTGTAACCTTCAAGAGAATCACCCAACAACTCAGGTAGATACTTGATCTTCTTCAA
AT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC018849A_C01 KMC018849A_c01
         (542 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAG50697.1|AC079604_4 hypothetical protein [Arabidopsis thali...   161  5e-39
ref|NP_176109.1| hypothetical protein; protein id: At1g58120.1 [...   161  5e-39
dbj|BAC10179.1| contains ESTs C72105(E1001),AU101116(E1001)~simi...   124  7e-28
ref|NP_195791.1| putative protein; protein id: At5g01710.1, supp...    75  4e-13
ref|NP_190908.1| putative protein; protein id: At3g53400.1, supp...    53  3e-06

>gb|AAG50697.1|AC079604_4 hypothetical protein [Arabidopsis thaliana]
           gi|26451877|dbj|BAC43031.1| unknown protein [Arabidopsis
           thaliana]
          Length = 420

 Score =  161 bits (407), Expect = 5e-39
 Identities = 81/157 (51%), Positives = 114/157 (72%), Gaps = 10/157 (6%)
 Frame = -3

Query: 537 KKIKYLPELLGDSL--EGYNRRLFISAGLLSKENKGVIEWFERNYPKKDTKFEIHSLLFV 364
           K+ +YLP+L+GD+L  E Y+RR+FI  G  + +    +EWF  NYP ++ KFE++ +  V
Sbjct: 266 KRTRYLPDLMGDNLDLESYSRRVFIDVG--NGKGSSGMEWFVENYPTRNQKFEMYKIETV 323

Query: 363 PEDDN------DVSAWLSKHVKEEEYVVMKAEAEVVEEMIKERTIYLVDELFLECKNEWW 202
            ++ +       ++ WL ++VKEEEYVVMKAEAE+VEEM++ ++I +VDELFLECK +  
Sbjct: 324 NDEMSLESEKMGMTEWLKENVKEEEYVVMKAEAEMVEEMMRSKSIKMVDELFLECKPKGL 383

Query: 201 QKKGKR--KKSRRAYWECLALYGRLRDEGVAVHPWWG 97
             +G++   KS RAYWECLALYG+LRDEGVAVH WWG
Sbjct: 384 GLRGRKMQSKSGRAYWECLALYGKLRDEGVAVHQWWG 420

>ref|NP_176109.1| hypothetical protein; protein id: At1g58120.1 [Arabidopsis
           thaliana] gi|25404192|pir||E96614 hypothetical protein
           T18I24.4 [imported] - Arabidopsis thaliana
           gi|12321385|gb|AAG50763.1|AC079131_8 hypothetical
           protein [Arabidopsis thaliana]
          Length = 420

 Score =  161 bits (407), Expect = 5e-39
 Identities = 81/157 (51%), Positives = 114/157 (72%), Gaps = 10/157 (6%)
 Frame = -3

Query: 537 KKIKYLPELLGDSL--EGYNRRLFISAGLLSKENKGVIEWFERNYPKKDTKFEIHSLLFV 364
           K+ +YLP+L+GD+L  E Y+RR+FI  G  + +    +EWF  NYP ++ KFE++ +  V
Sbjct: 266 KRTRYLPDLMGDNLDLESYSRRVFIDVG--NGKGSSGMEWFVENYPTRNQKFEMYKIETV 323

Query: 363 PEDDN------DVSAWLSKHVKEEEYVVMKAEAEVVEEMIKERTIYLVDELFLECKNEWW 202
            ++ +       ++ WL ++VKEEEYVVMKAEAE+VEEM++ ++I +VDELFLECK +  
Sbjct: 324 NDEMSLESEKMGMTEWLKENVKEEEYVVMKAEAEMVEEMMRSKSIKMVDELFLECKPKGL 383

Query: 201 QKKGKR--KKSRRAYWECLALYGRLRDEGVAVHPWWG 97
             +G++   KS RAYWECLALYG+LRDEGVAVH WWG
Sbjct: 384 GLRGRKMQSKSGRAYWECLALYGKLRDEGVAVHQWWG 420

>dbj|BAC10179.1| contains ESTs C72105(E1001),AU101116(E1001)~similarto Arabidopsis
           thaliana chromosome 1 T15M6.13~unknown protein [Oryza
           sativa (japonica cultivar-group)]
          Length = 414

 Score =  124 bits (311), Expect = 7e-28
 Identities = 68/151 (45%), Positives = 87/151 (57%), Gaps = 8/151 (5%)
 Frame = -3

Query: 528 KYLPELLGDSLEGYNRRLFISAGLLSKENKGVIEWFERNYPKKDTKFEIHSLLFVPEDDN 349
           +YLPEL GDSLEGY RR FI          G   WF+++YP+    F++  L      + 
Sbjct: 229 RYLPELTGDSLEGYRRRTFIDVA--PSRGGGAASWFKKHYPRGKRVFDMVRLDAADATEP 286

Query: 348 DVSA-------WLSKHVKEEEYVVMKAEAEVVEEMIKERT-IYLVDELFLECKNEWWQKK 193
             S+       WL  +V+EE+YVV+KA  E VEE+++ R  +  VDELFL+C        
Sbjct: 287 AASSSAAGIAEWLEGNVREEDYVVVKAGVEAVEEILRRRAAVRRVDELFLDC-----DAG 341

Query: 192 GKRKKSRRAYWECLALYGRLRDEGVAVHPWW 100
                +RR YWECLALYGRLRD GVAVH WW
Sbjct: 342 AGADAARRPYWECLALYGRLRDHGVAVHQWW 372

>ref|NP_195791.1| putative protein; protein id: At5g01710.1, supported by cDNA:
           gi_15810368 [Arabidopsis thaliana]
           gi|11357829|pir||T48192 hypothetical protein F7A7.230 -
           Arabidopsis thaliana gi|7327830|emb|CAB82287.1| putative
           protein [Arabidopsis thaliana]
           gi|15810369|gb|AAL07072.1| unknown protein [Arabidopsis
           thaliana] gi|23296924|gb|AAN13203.1| unknown protein
           [Arabidopsis thaliana] gi|24417484|gb|AAN60352.1|
           unknown [Arabidopsis thaliana]
          Length = 513

 Score = 75.5 bits (184), Expect = 4e-13
 Identities = 39/91 (42%), Positives = 54/91 (58%), Gaps = 8/91 (8%)
 Frame = -3

Query: 348 DVSAWLSKHVKEEEYVVMKAEAE-----VVEEMIKERTIYLVDELFLECKNEWWQK--KG 190
           D + WL K V+E ++VVMK + E     ++  +IK   I L+DELFLEC    WQ+   G
Sbjct: 423 DFADWLKKSVRERDFVVMKMDVEGTEFDLIPRLIKTGAICLIDELFLECHYNRWQRCCPG 482

Query: 189 KR-KKSRRAYWECLALYGRLRDEGVAVHPWW 100
           +R +K  + Y +CL L+  LR  GV VH WW
Sbjct: 483 QRSQKYNKTYNQCLELFNSLRQRGVLVHQWW 513

 Score = 32.7 bits (73), Expect = 2.9
 Identities = 14/56 (25%), Positives = 32/56 (57%)
 Frame = -3

Query: 540 LKKIKYLPELLGDSLEGYNRRLFISAGLLSKENKGVIEWFERNYPKKDTKFEIHSL 373
           +K IKY+P ++   +   +R +++  G  S     +  WF++ YPK++  F++ ++
Sbjct: 295 IKNIKYVPSMV--DIRFKSRYVYVDVGARSY-GSSIGSWFKKEYPKQNKTFDVFAI 347

>ref|NP_190908.1| putative protein; protein id: At3g53400.1, supported by cDNA:
           gi_17528947 [Arabidopsis thaliana]
           gi|11282324|pir||T45880 hypothetical protein F4P12.100 -
           Arabidopsis thaliana gi|6729491|emb|CAB67647.1| putative
           protein [Arabidopsis thaliana]
          Length = 466

 Score = 52.8 bits (125), Expect = 3e-06
 Identities = 32/99 (32%), Positives = 43/99 (43%), Gaps = 5/99 (5%)
 Frame = -3

Query: 381 HSLLFVPEDDNDVSAWLSKHVKEEEYVVMK-----AEAEVVEEMIKERTIYLVDELFLEC 217
           H   FV +D  D  AW  +     ++VV+K      E + + E+IK   I  VDELFL C
Sbjct: 381 HEEPFVEDDSFDFLAWFKETASFADFVVLKMNTSDTELKFLSELIKTGAICSVDELFLHC 440

Query: 216 KNEWWQKKGKRKKSRRAYWECLALYGRLRDEGVAVHPWW 100
                            Y +C  +   LR+ GV VH WW
Sbjct: 441 ---------------TGYSDCTGIIKSLRNSGVFVHQWW 464

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 503,389,333
Number of Sequences: 1393205
Number of extensions: 12049958
Number of successful extensions: 46500
Number of sequences better than 10.0: 109
Number of HSP's better than 10.0 without gapping: 41614
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 45564
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 18750593680
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MF029c09_f BP029790 1 435
2 SPD079b03_f BP050294 1 542




Lotus japonicus
Kazusa DNA Research Institute