KMC020375A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC020375A_C01 KMC020375A_c01
tcgtctcctctaccacgttctctttacgcaacgcgctcacaccttcttcactctcatcaa
catgtctttttctcttccttGAAATTTTTCCATTTTCCATACACATTTTAATCAACAGAA
ATCTCATTTTGAGTTGACAAATTAAGATGCAGTGACAAACACTTTGTGAGTGAGTGGGAG
ACACACACACACAGAAAGAGAGGTGTCTGTCAGTGACTCCCCTTTGTGCTTGATTCCTCA
TCTGGAATAGGGTAGAGATTTTGATAGTGATTTAATAGTTAATAAGAATGTTGGGTCGGT
CTGCATTTTCCAGAGCTGGAAGTTTCAGGCCAGAAAATTTAGGCCAGAATACCATGGCCA
TGATTGGGAATGTTTGCTTTTCTGTGTTTGTTGTTGGGGTTTTGGTTTTTACCATTATGG
CTGCTACTTATGAACCTGAGGATCCTTTGTTTAACCCTTCAACCAAGATATCTACATTCC
TCACTTCCAAATCCAATGCCACTTTCAAGTCTGATGACAGTGTTGTTAGGACTGGTGAGG
ATTTCATGGCTGCCAATGAGACTGCTTTTGGCACCATCATTAACATGGCTGATGTTGATA
ACTCGGCTAGCGCTTCAACCGCCGAGGCTGAGGCTGAGGGGTCTC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC020375A_C01 KMC020375A_c01
         (645 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_193006.1| putative protein; protein id: At4g12700.1 [Arab...   166  3e-40
ref|NP_565310.1| expressed protein; protein id: At2g04280.1, sup...   154  1e-36
ref|NP_192621.1| putative protein; protein id: At4g08810.1, supp...    79  7e-14
dbj|BAC07112.1| contains ESTs C73312(E3729),AU094248(E3729)~simi...    65  8e-10
ref|NP_704954.1| hypothetical protein [Plasmodium falciparum 3D7...    35  1.1

>ref|NP_193006.1| putative protein; protein id: At4g12700.1 [Arabidopsis thaliana]
           gi|7487274|pir||T06628 hypothetical protein T20K18.50 -
           Arabidopsis thaliana gi|4586246|emb|CAB40987.1| putative
           protein [Arabidopsis thaliana]
           gi|7267971|emb|CAB78312.1| putative protein [Arabidopsis
           thaliana]
          Length = 561

 Score =  166 bits (419), Expect = 3e-40
 Identities = 80/110 (72%), Positives = 96/110 (86%)
 Frame = +3

Query: 288 MLGRSAFSRAGSFRPENLGQNTMAMIGNVCFSVFVVGVLVFTIMAATYEPEDPLFNPSTK 467
           M GRSAFSR G FRPENLGQN +++IG++ FSV V+GV+VFTI+AATYEPEDPLF+PS K
Sbjct: 1   MSGRSAFSRTGGFRPENLGQNAVSLIGSIGFSVLVIGVVVFTIIAATYEPEDPLFHPSDK 60

Query: 468 ISTFLTSKSNATFKSDDSVVRTGEDFMAANETAFGTIINMADVDNSASAS 617
           I+TFLTS SNAT KSDDS+V+TGEDFMAAN+TAFG  IN+ADV+ S + S
Sbjct: 61  ITTFLTSNSNATLKSDDSIVKTGEDFMAANQTAFGGFINIADVETSENDS 110

>ref|NP_565310.1| expressed protein; protein id: At2g04280.1, supported by cDNA:
           gi_15810336 [Arabidopsis thaliana]
           gi|25365943|pir||G84455 hypothetical protein At2g04280
           [imported] - Arabidopsis thaliana
           gi|4689474|gb|AAD27910.1| expressed protein [Arabidopsis
           thaliana] gi|15810337|gb|AAL07056.1| unknown protein
           [Arabidopsis thaliana]
          Length = 568

 Score =  154 bits (389), Expect = 1e-36
 Identities = 77/115 (66%), Positives = 92/115 (79%)
 Frame = +3

Query: 288 MLGRSAFSRAGSFRPENLGQNTMAMIGNVCFSVFVVGVLVFTIMAATYEPEDPLFNPSTK 467
           M GRSA  R G FR ENLGQN + +IGN+ FS+FV GVL+FTI+AATYEPEDPLF+PS K
Sbjct: 1   MFGRSAI-RGGGFRAENLGQNALTLIGNIGFSLFVFGVLIFTIIAATYEPEDPLFHPSDK 59

Query: 468 ISTFLTSKSNATFKSDDSVVRTGEDFMAANETAFGTIINMADVDNSASASTAEAE 632
           I+TFLTS SNAT +SDDSVV+TGEDFM AN+TAF   IN+ DV+ S + +T E E
Sbjct: 60  ITTFLTSTSNATLRSDDSVVKTGEDFMLANQTAFAEFININDVEASTNETTTEEE 114

>ref|NP_192621.1| putative protein; protein id: At4g08810.1, supported by cDNA:
           gi_15912228 [Arabidopsis thaliana]
           gi|25365941|pir||F85088 hypothetical protein AT4g08810
           [imported] - Arabidopsis thaliana
           gi|7267523|emb|CAB78006.1| putative protein [Arabidopsis
           thaliana] gi|7321070|emb|CAB82117.1| putative protein
           [Arabidopsis thaliana] gi|15912229|gb|AAL08248.1|
           AT4g08810/T32A17_120 [Arabidopsis thaliana]
           gi|27363304|gb|AAO11571.1| At4g08810/T32A17_120
           [Arabidopsis thaliana]
          Length = 552

 Score = 78.6 bits (192), Expect = 7e-14
 Identities = 35/74 (47%), Positives = 50/74 (67%)
 Frame = +3

Query: 333 ENLGQNTMAMIGNVCFSVFVVGVLVFTIMAATYEPEDPLFNPSTKISTFLTSKSNATFKS 512
           E + QN + +I NVCFSVFV  VL+FT++A TY+P DP    +  ++  LT   NATFK 
Sbjct: 9   EPIAQNLIKLISNVCFSVFVFTVLIFTVIAVTYQPPDPWLESAPALTKLLTETENATFKI 68

Query: 513 DDSVVRTGEDFMAA 554
           D S+++TGED  ++
Sbjct: 69  DGSILKTGEDLASS 82

>dbj|BAC07112.1| contains ESTs C73312(E3729),AU094248(E3729)~similar to Arabidopsis
           thaliana chromosome 4, At4g08810~unknown protein [Oryza
           sativa (japonica cultivar-group)]
           gi|22296387|dbj|BAC10156.1| P0519E12.32 [Oryza sativa
           (japonica cultivar-group)]
          Length = 585

 Score = 65.1 bits (157), Expect = 8e-10
 Identities = 39/99 (39%), Positives = 59/99 (59%), Gaps = 2/99 (2%)
 Frame = +3

Query: 339 LGQNTMAMIGNVCFSVFVVGVLVFTIMAATYEPEDPLFNPSTKISTFLTS-KSNATF-KS 512
           + Q+ +    NVCFS+FV+ VLV T++A TY+P DP    S  I+T L+    N+TF   
Sbjct: 41  VAQSIIKAASNVCFSLFVLAVLVVTVVAVTYQPPDPWLQSSAAITTSLSRVLPNSTFLLP 100

Query: 513 DDSVVRTGEDFMAANETAFGTIINMADVDNSASASTAEA 629
           DDS++ TGEDF +++ T   +     D D + + +TA A
Sbjct: 101 DDSLLPTGEDFNSSSSTP--SAPRRDDPDQATATATAAA 137

>ref|NP_704954.1| hypothetical protein [Plasmodium falciparum 3D7]
           gi|23615199|emb|CAD52189.1| hypothetical protein
           [Plasmodium falciparum 3D7]
          Length = 771

 Score = 34.7 bits (78), Expect = 1.1
 Identities = 14/32 (43%), Positives = 23/32 (71%)
 Frame = -2

Query: 452 KQRILRFISSSHNGKNQNPNNKHRKANIPNHG 357
           K++++ +I+S+HN K QN NN++    I NHG
Sbjct: 355 KEQLICYINSTHNNKKQNMNNQNNILIICNHG 386

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 560,565,832
Number of Sequences: 1393205
Number of extensions: 12523006
Number of successful extensions: 43997
Number of sequences better than 10.0: 22
Number of HSP's better than 10.0 without gapping: 37083
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 43109
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 27291941472
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB093f05_f BP040795 1 536
2 MF093b05_f BP033149 100 645




Lotus japonicus
Kazusa DNA Research Institute