KMC005465A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC005465A_C01 KMC005465A_c01
aaagtggaaacagttgaattttatattcaaacacaataaaaatatattacatgattactg
atcctatgctcccacaagtaTGGGAGAAGCTGGTTGATATTAGACAAAGGTGACAAACAA
TAGAGTCCTGTCTTTACAACTCAGAAAAATAAGACCTATTTCATCAAGGATGAACACCAT
CTAGAACCAGAAAGGTAACTTGCTAATTACTGCTTATAAAAAAAAACACTTGCTAATTAC
TATCTACACGTAAATGATGGGATACAAAACTATAATCCAAATATGGGGTCATTTGCACCA
AATATAGAAGAACCAAACTGTCTCGTATTCAAATCATTGCCATCCAAAGTGGACCAAAAC
GATCCAGCCACTGGGTTGGACTTGAAGTTTCATTCTCCTCCACCCACTCAGAAGTCATGT
CAATGTCATCAAACTGAAGCAGATCAACTTCTAAAGACTTTGCATTCATCTGCCTAGCTA
ACTTGAGATTGTAGTTTATATAAACAAGGTCGTTCAATGTGTTCTCTATCAATCTTGTTC
CGCTTCTCTGAGTGAATCTGCCTGAAGGTGCTCCACTGCCTCTGAAATGACAAGGTGCTG
CAAACTTGACTTAATATTCTAATGGCAACTCGTTGCAACCCCGGCGCAGAGTCGCCGTAT
TGTTCCCACCAAAGCCATGGCGCAACTGTGCTTCTTGCCTCCTTTGCTAAGCTACAACCA
AACATCCCATGTGCCTTTG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC005465A_C01 KMC005465A_c01
         (739 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_178092.1| hypothetical protein; protein id: At1g79740.1 [...   130  2e-29
dbj|BAA94530.2| unnamed protein product [Oryza sativa (japonica ...   125  5e-28
ref|NP_680299.1| hypothetical protein; protein id: At5g33406.1 [...    70  7e-15
ref|NP_188861.1| hypothetical protein; protein id: At3g22220.1 [...    60  3e-13
gb|AAO18451.1| hypothetical protein [Oryza sativa (japonica cult...    64  5e-13

>ref|NP_178092.1| hypothetical protein; protein id: At1g79740.1 [Arabidopsis
           thaliana] gi|25406602|pir||D96828 hypothetical protein
           F19K16.28 [imported] - Arabidopsis thaliana
           gi|7715599|gb|AAF68117.1|AC010793_12 F20B17.17
           [Arabidopsis thaliana]
           gi|12324578|gb|AAG52239.1|AC011717_7 hypothetical
           protein; 97951-99813 [Arabidopsis thaliana]
          Length = 518

 Score =  130 bits (327), Expect = 2e-29
 Identities = 79/164 (48%), Positives = 97/164 (58%), Gaps = 9/164 (5%)
 Frame = -3

Query: 737 KAHGMFGCSLAKEARSTVAPWLWWEQYGDSAPGLQRVAIRILSQVCSTLSFQRQWSTFRQ 558
           +A GMFGC+LA EAR +V+P LWWEQ+GDSAP LQRVAIRILSQVCS  + +RQWSTF+Q
Sbjct: 366 RAKGMFGCNLAMEARDSVSPGLWWEQFGDSAPVLQRVAIRILSQVCSGYNLERQWSTFQQ 425

Query: 557 IHSEKRNKIDREHIERPCL----YKLQSQVS*ADECKVFRS*SASV**H*HDF*VGGGE* 390
           +H E+RNKIDRE + +        KL   ++   +                D  +     
Sbjct: 426 MHWERRNKIDREILNKLAYVNQNLKLGRMITLETDPIAL-----------EDIDMMSEWV 474

Query: 389 NFKSNPVAGS----FWSTLDGNDLNTRQFGSSIFGAND-PIFGL 273
               NP        F + LDG DLNTRQFG +IF AND  IFGL
Sbjct: 475 EEAENPSPAQWLDRFGTALDGGDLNTRQFGGAIFSANDHNIFGL 518

 Score = 68.2 bits (165), Expect = 1e-10
 Identities = 33/55 (60%), Positives = 38/55 (69%)
 Frame = -1

Query: 517 LNDLVYINYNLKLARQMNAKSLEVDLLQFDDIDMTSEWVEENETSSPTQWLDRFG 353
           LN L Y+N NLKL R +   +LE D +  +DIDM SEWVEE E  SP QWLDRFG
Sbjct: 439 LNKLAYVNQNLKLGRMI---TLETDPIALEDIDMMSEWVEEAENPSPAQWLDRFG 490

>dbj|BAA94530.2| unnamed protein product [Oryza sativa (japonica cultivar-group)]
          Length = 521

 Score =  125 bits (315), Expect = 5e-28
 Identities = 74/157 (47%), Positives = 93/157 (59%), Gaps = 2/157 (1%)
 Frame = -3

Query: 737 KAHGMFGCSLAKEARSTVAPWLWWEQYGDSAPGLQRVAIRILSQVCSTLSFQRQWSTFRQ 558
           KA GMFG ++AKEAR+  +P +WWEQYGDSAP LQ  A+RI+SQVCSTL+FQR WS   +
Sbjct: 368 KAQGMFGSNIAKEARNNTSPGMWWEQYGDSAPSLQHAAVRIVSQVCSTLTFQRDWSIIVR 427

Query: 557 IHSEKRNKIDREHIERPCLYKLQSQVS*ADECKVFRS*SASV**H*HDF*VGGGE*NFKS 378
            HSEKRNK+D+E +           +    + K+ +     +     D      E +   
Sbjct: 428 NHSEKRNKLDKEALADQAYVHYNFMLH--SDSKMKKGDGDPIALDAIDMTSPWVEDSDSP 485

Query: 377 NPV--AGSFWSTLDGNDLNTRQFGSSIFGANDPIFGL 273
           N       F S LDG DLNTRQFG SIFG ND +FGL
Sbjct: 486 NLAQWLDRFPSALDG-DLNTRQFGGSIFGTNDTLFGL 521

 Score = 52.0 bits (123), Expect = 9e-06
 Identities = 24/60 (40%), Positives = 35/60 (58%)
 Frame = -1

Query: 535 RLIENTLNDLVYINYNLKLARQMNAKSLEVDLLQFDDIDMTSEWVEENETSSPTQWLDRF 356
           +L +  L D  Y++YN  L      K  + D +  D IDMTS WVE++++ +  QWLDRF
Sbjct: 435 KLDKEALADQAYVHYNFMLHSDSKMKKGDGDPIALDAIDMTSPWVEDSDSPNLAQWLDRF 494

>ref|NP_680299.1| hypothetical protein; protein id: At5g33406.1 [Arabidopsis
           thaliana]
          Length = 485

 Score = 69.7 bits (169), Expect(2) = 7e-15
 Identities = 28/69 (40%), Positives = 43/69 (61%)
 Frame = -3

Query: 737 KAHGMFGCSLAKEARSTVAPWLWWEQYGDSAPGLQRVAIRILSQVCSTLSFQRQWSTFRQ 558
           KA G+FG  +A   R+ ++P  WW  YG S P LQ  AI++LS  CS    +R W  F+ 
Sbjct: 195 KATGLFGIPMAIRLRTKMSPAEWWSAYGSSTPNLQNFAIKVLSLTCSATGCERNWGVFQL 254

Query: 557 IHSEKRNKI 531
           +H+++RN++
Sbjct: 255 LHTKRRNRL 263

 Score = 32.7 bits (73), Expect(2) = 7e-15
 Identities = 18/57 (31%), Positives = 32/57 (55%), Gaps = 2/57 (3%)
 Frame = -1

Query: 535 RLIENTLNDLVYINYNLKLARQMNAKSLEVDLLQFDDIDMTSEWV--EENETSSPTQ 371
           RL +  LND++++ YN  L R+   ++   D +  ++ID  +EW+     E SS T+
Sbjct: 262 RLTQCRLNDMIFVKYNRALQRRYK-RNDTFDPILLNEIDQCNEWLTGRMEENSSDTE 317

>ref|NP_188861.1| hypothetical protein; protein id: At3g22220.1 [Arabidopsis
           thaliana]
          Length = 759

 Score = 60.1 bits (144), Expect(2) = 3e-13
 Identities = 33/74 (44%), Positives = 46/74 (61%), Gaps = 1/74 (1%)
 Frame = -3

Query: 734 AHGMFGCSLAKEARSTVAPWLWWEQYGDSAPGLQRVAIRILSQVC-STLSFQRQWSTFRQ 558
           A G+FG +LA  AR T+ P  WW  YG+S   L R AIRILSQ C S++   R  ++  Q
Sbjct: 584 AVGIFGRNLAIRARDTMLPAEWWSTYGESCLNLSRFAIRILSQTCSSSIGSVRNLTSISQ 643

Query: 557 IHSEKRNKIDREHI 516
           I+ E +N I+R+ +
Sbjct: 644 IY-ESKNSIERQRL 656

 Score = 37.0 bits (84), Expect(2) = 3e-13
 Identities = 15/43 (34%), Positives = 27/43 (61%)
 Frame = -1

Query: 517 LNDLVYINYNLKLARQMNAKSLEVDLLQFDDIDMTSEWVEENE 389
           LNDLV++ YN++L R  ++    VD L   ++++  +WV  N+
Sbjct: 656 LNDLVFVQYNMRLRRIESSGDDTVDPLSHSNMEVLEDWVSRNQ 698

>gb|AAO18451.1| hypothetical protein [Oryza sativa (japonica cultivar-group)]
          Length = 779

 Score = 63.5 bits (153), Expect(2) = 5e-13
 Identities = 30/74 (40%), Positives = 43/74 (57%)
 Frame = -3

Query: 737 KAHGMFGCSLAKEARSTVAPWLWWEQYGDSAPGLQRVAIRILSQVCSTLSFQRQWSTFRQ 558
           +A G F   +A  AR T+ P  WW  YG + P L R+A+RILSQ CS     R+  +F Q
Sbjct: 607 EAAGDFRRQMAIRARHTLPPAEWWYTYGGACPNLTRLAVRILSQTCSAKGCDRRHISFEQ 666

Query: 557 IHSEKRNKIDREHI 516
           IH ++ N  +R+ +
Sbjct: 667 IHDQRMNLFERQRM 680

 Score = 32.7 bits (73), Expect(2) = 5e-13
 Identities = 12/39 (30%), Positives = 23/39 (58%)
 Frame = -1

Query: 517 LNDLVYINYNLKLARQMNAKSLEVDLLQFDDIDMTSEWV 401
           ++ L ++ YNL+L  +   K+   D +  D+ID+  +WV
Sbjct: 680 MHHLTFVQYNLRLQHRQQHKTKAFDPVSVDNIDIVDDWV 718

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 633,450,412
Number of Sequences: 1393205
Number of extensions: 13418804
Number of successful extensions: 32709
Number of sequences better than 10.0: 39
Number of HSP's better than 10.0 without gapping: 31251
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 32693
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 35188080875
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPDL068d11_f AV779971 1 516
2 MFBL048e11_f BP043719 82 535
3 MPDL007f04_f AV776873 112 740




Lotus japonicus
Kazusa DNA Research Institute