KMC005453A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC005453A_C01 KMC005453A_c01
ctgcaatctagtggtctcacaagaggttctgttgtggattatccacagccgcttgctaca
ttgcatccttatcaatctccACCTGTGAGGAACTTTCTTGGACATAACACATCTTGGTTA
TCCCAAGCCTCCATTCGTGGACCTTGGATTAGCTCTCCAACTCCTGCGCCCAGCACCCAT
CTTTCTGCATCACCTGTCTTTGATACAATTAAGTTAGGTTCTGGCAAGGGATCTTCTCTG
CCTCCTTCCTCGAGCATAAAGAATGTTACTCCTGGTCCTCCAGCTTCCAGTGCAGGTTTG
CAAGGTATCTTTGTTGGGACTGCTTCTCTTTTGGACGTAATCAATGTGGCAGTATCACCT
GCGCAGCATTCTTCAGATCCCAAGCCCAAGAAAAGGAAAAAAGGTGTGGTATCTGAAGAT
CTTGGCCAGAAGGCTCTGCAGTCATTGACTCCAGCAGTCAGTAACCATACATCTACTTCT
TTTGCCGTTTTGACTCCTCTTGGCAATGTACCAGTTACTGCTGTTGAAAAATCAATTGTG
TCTGTCTCTCCTCTAGATAATCAACCTGAAAATGATGGAAATGTTGAGAAGAGGATTCTA
TCAGATGAGTCTCTTATGAAAGTGAAGGAGGCTAAGGTATATGCAGAAGAAGCTTCTGCT
CTTTCTGGTGCTGCTGTGAATCATAGCCTAGAGCTATGGAATCAGTTGGATAAGTATAAA
AATTCTAGATCGATGCCAGATGTTGAGGCCAAATTGGCTTCGGCAGCAGTTGCAGTTGCT
GCTGCTGCAGCTGATGCAAAGGCAGCGG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC005453A_C01 KMC005453A_c01
         (808 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||E71442 hypothetical protein - Arabidopsis thaliana gi|22450...   145  9e-34
ref|NP_193464.2| G2484-1 protein; protein id: At4g17330.1, suppo...   145  9e-34
emb|CAA10906.1| G2484-1 [Arabidopsis thaliana]                        126  3e-28
gb|AAL79800.1|AC079874_23 unknown protein [Oryza sativa]               67  4e-10
ref|NP_493947.1| Fes CIP4 homology domain (108.5 kD) [Caenorhabd...    45  0.002

>pir||E71442 hypothetical protein - Arabidopsis thaliana
            gi|2245092|emb|CAB10514.1| hypothetical protein
            [Arabidopsis thaliana] gi|7268485|emb|CAB78736.1|
            hypothetical protein [Arabidopsis thaliana]
          Length = 1732

 Score =  145 bits (365), Expect = 9e-34
 Identities = 100/293 (34%), Positives = 155/293 (52%), Gaps = 24/293 (8%)
 Frame = +1

Query: 1    LQSSGLTRGSVVDYPQPLATLHPYQSPPVRNFLGHNTSWLSQASIRGPWISSPTPAP--- 171
            LQSS + RGS   +   L+  H +Q+PP +N +GHNT W+S    R  W++S   +    
Sbjct: 1088 LQSSSVQRGSAATHQPLLSASHAHQTPPTQNIVGHNTPWMSPLPFRNAWLASQQTSGFDV 1147

Query: 172  STHLSASPVFDTIKLGSGKGSSLPPSSSIKNVTPGPPASSAGLQGIFVGTASLLDVINVA 351
             +     P+ D +KL   K SS+  S + K+V  G  ++ + +          L+  +  
Sbjct: 1148 GSRFPVYPITDPVKLTPMKESSMTLSGA-KHVQSGTSSNVSKV-------TPTLEPTSTV 1199

Query: 352  VSPAQHSSDPKPKKRKKGVVSEDLGQKALQSL----------TPAVSNHTSTSFAVLTPL 501
            V+PAQHS+  K +KRKK  VS + G   L SL           P      +  +   T  
Sbjct: 1200 VAPAQHSTRVKSRKRKKMPVSVESGPNILNSLKQTELAASPLVPFTPTPANLGYNAGTLP 1259

Query: 502  GNVPVTAVEKSIVSVSPLDN------QPENDGNV-----EKRILSDESLMKVKEAKVYAE 648
              V +TAV   +VS  P          P   GN+     ++ +LS++++ K+KEAK++AE
Sbjct: 1260 SVVSMTAVPMDLVSTFPGKKIKSSFPSPIFGGNLVREVKQRSVLSEDTIEKLKEAKMHAE 1319

Query: 649  EASALSGAAVNHSLELWNQLDKYKNSRSMPDVEAKLASAAVAVAAAAADAKAA 807
            +ASAL+ AAV+HS  +W Q+++  ++   P+ + +LASAAVA+AAAAA AKAA
Sbjct: 1320 DASALATAAVSHSEYVWKQIEQQSHAGLQPETQDRLASAAVAIAAAAAVAKAA 1372

>ref|NP_193464.2| G2484-1 protein; protein id: At4g17330.1, supported by cDNA:
           gi_20466527 [Arabidopsis thaliana]
           gi|20466528|gb|AAM20581.1| G2484-1 protein [Arabidopsis
           thaliana]
          Length = 1058

 Score =  145 bits (365), Expect = 9e-34
 Identities = 100/293 (34%), Positives = 155/293 (52%), Gaps = 24/293 (8%)
 Frame = +1

Query: 1   LQSSGLTRGSVVDYPQPLATLHPYQSPPVRNFLGHNTSWLSQASIRGPWISSPTPAP--- 171
           LQSS + RGS   +   L+  H +Q+PP +N +GHNT W+S    R  W++S   +    
Sbjct: 81  LQSSSVQRGSAATHQPLLSASHAHQTPPTQNIVGHNTPWMSPLPFRNAWLASQQTSGFDV 140

Query: 172 STHLSASPVFDTIKLGSGKGSSLPPSSSIKNVTPGPPASSAGLQGIFVGTASLLDVINVA 351
            +     P+ D +KL   K SS+  S + K+V  G  ++ + +          L+  +  
Sbjct: 141 GSRFPVYPITDPVKLTPMKESSMTLSGA-KHVQSGTSSNVSKV-------TPTLEPTSTV 192

Query: 352 VSPAQHSSDPKPKKRKKGVVSEDLGQKALQSL----------TPAVSNHTSTSFAVLTPL 501
           V+PAQHS+  K +KRKK  VS + G   L SL           P      +  +   T  
Sbjct: 193 VAPAQHSTRVKSRKRKKMPVSVESGPNILNSLKQTELAASPLVPFTPTPANLGYNAGTLP 252

Query: 502 GNVPVTAVEKSIVSVSPLDN------QPENDGNV-----EKRILSDESLMKVKEAKVYAE 648
             V +TAV   +VS  P          P   GN+     ++ +LS++++ K+KEAK++AE
Sbjct: 253 SVVSMTAVPMDLVSTFPGKKIKSSFPSPIFGGNLVREVKQRSVLSEDTIEKLKEAKMHAE 312

Query: 649 EASALSGAAVNHSLELWNQLDKYKNSRSMPDVEAKLASAAVAVAAAAADAKAA 807
           +ASAL+ AAV+HS  +W Q+++  ++   P+ + +LASAAVA+AAAAA AKAA
Sbjct: 313 DASALATAAVSHSEYVWKQIEQQSHAGLQPETQDRLASAAVAIAAAAAVAKAA 365

>emb|CAA10906.1| G2484-1 [Arabidopsis thaliana]
          Length = 954

 Score =  126 bits (317), Expect = 3e-28
 Identities = 89/269 (33%), Positives = 140/269 (51%), Gaps = 24/269 (8%)
 Frame = +1

Query: 73  QSPPVRNFLGHNTSWLSQASIRGPWISSPTPAP---STHLSASPVFDTIKLGSGKGSSLP 243
           Q+PP +N +GHNT W+S    R  W++S   +     +     P+ D +KL   K SS+ 
Sbjct: 1   QTPPTQNIVGHNTPWMSPLPFRNAWLASQQTSGFDVGSRFPVYPITDPVKLTPMKESSMT 60

Query: 244 PSSSIKNVTPGPPASSAGLQGIFVGTASLLDVINVAVSPAQHSSDPKPKKRKKGVVSEDL 423
            S + K+V  G  ++ + +          L+  +  V+PAQHS+  K +KRKK  VS + 
Sbjct: 61  LSGA-KHVQSGTSSNVSKV-------TPTLEPTSTVVAPAQHSTRVKSRKRKKMPVSVES 112

Query: 424 GQKALQSL----------TPAVSNHTSTSFAVLTPLGNVPVTAVEKSIVSVSPLDN---- 561
           G   L SL           P      +  +   T    V +TAV   +VS  P       
Sbjct: 113 GPNILNSLKQTELAASPLVPFTPTPANLGYNAGTLPSVVSMTAVPMDLVSTFPGKKIKSS 172

Query: 562 --QPENDGNV-----EKRILSDESLMKVKEAKVYAEEASALSGAAVNHSLELWNQLDKYK 720
              P   GN+     ++ +LS++++ K+KEAK++AE+ASAL+ AAV+HS  +W Q+++  
Sbjct: 173 FPSPIFGGNLVREVKQRSVLSEDTIEKLKEAKMHAEDASALATAAVSHSEYVWKQIEQQS 232

Query: 721 NSRSMPDVEAKLASAAVAVAAAAADAKAA 807
           ++   P+ + +L SAAVA+A AAA AKAA
Sbjct: 233 HAGLQPETQDRLGSAAVAIAGAAAVAKAA 261

>gb|AAL79800.1|AC079874_23 unknown protein [Oryza sativa]
          Length = 2036

 Score = 66.6 bits (161), Expect = 4e-10
 Identities = 76/286 (26%), Positives = 125/286 (43%), Gaps = 22/286 (7%)
 Frame = +1

Query: 16   LTRGSVVDYPQPLATLHPYQSPPVRNFLGHNTSWLSQA--SIRGPWISSP-------TPA 168
            L RG+ +D+ Q ++ + PY S   R       SW  Q+      PW+  P       +  
Sbjct: 1065 LPRGTHLDFGQAVSPVFPYSSQ-TRQPTSGVASWFPQSPGGRAAPWLVQPQNLIFDSSMK 1123

Query: 169  PSTHLSASPVFDTIKLGSGKGSSLPPSSSIKNVTPGPPASSAGLQGIFVGTASLLDVI-- 342
            P    SA+   +T K  S K  S+  + S     P    S          T S L VI  
Sbjct: 1124 PPVPASAN---ETAKGASSKNISISQAVSPVAFPPNQAPS----------TISPLAVIPE 1170

Query: 343  ---NVAVSPAQHSSDP-KPKKRKKGVVSED------LGQKALQSLTPAVSNHTSTSFAVL 492
                 +VS ++  + P K +KRKK   S +      L +  + S+TPA  +    + +  
Sbjct: 1171 EKQKASVSTSKRGATPQKSRKRKKAPASPEQPIIAPLLKTDIASVTPATQHTPGFTLSTH 1230

Query: 493  TPLGNVPVTAVEKSIVSVSPLDN-QPENDGNVEKRILSDESLMKVKEAKVYAEEASALSG 669
            +P  N+  + +  +   V+P+ N Q     + E+RI S++    ++++   A+ A   + 
Sbjct: 1231 SP-SNILASGLVSNTGLVTPVPNYQITGIKDAEQRIFSEQISGAIEQSMGQAKGAGVHAM 1289

Query: 670  AAVNHSLELWNQLDKYKNSRSMPDVEAKLASAAVAVAAAAADAKAA 807
             AV H+  +W+ L      +   +VE KL SAA A +AA + AKAA
Sbjct: 1290 DAVRHAEGIWSHLSTNSKGKLPAEVEEKLTSAAAAASAAVSVAKAA 1335

>ref|NP_493947.1| Fes CIP4 homology domain (108.5 kD) [Caenorhabditis elegans]
           gi|5701573|gb|AAD47126.1| Hypothetical protein F56D12.6a
           [Caenorhabditis elegans]
          Length = 968

 Score = 44.7 bits (104), Expect = 0.002
 Identities = 48/186 (25%), Positives = 81/186 (42%), Gaps = 10/186 (5%)
 Frame = +1

Query: 163 PAPSTHLSASPVFDTIKLGSGKG--SSLPPSSSIKNVTPGPPASSAGL-------QGIFV 315
           PA S+ ++ +PV D + + SG    SS   S  +++  P P  ++  L       +GI V
Sbjct: 281 PASSSSMNLNPVRDLVDIMSGNSMPSSCSSSGILQDQAPPPHPTTVDLLMMDPIGEGIPV 340

Query: 316 GTASLLDVINVAVSPAQHSSDPKPKKRKKGVVSEDLGQKALQSLTPAVSNHT-STSFAVL 492
             +S+    N +  P  ++S P+  K+    +SE  G K L    P     T STS    
Sbjct: 341 VDSSINS--NYSTPPIINNSIPESIKKSSEDLSEKKGGKKLSMFIPKRRTKTVSTSSIDE 398

Query: 493 TPLGNVPVTAVEKSIVSVSPLDNQPENDGNVEKRILSDESLMKVKEAKVYAEEASALSGA 672
           TP    P +A      +     ++ EN+ N+   +  D++      +K    +   L+G+
Sbjct: 399 TPTTAEPFSASGLFKFTREKRRSKKENEANLRASVCMDDTHSTASSSK---SDDKMLNGS 455

Query: 673 AVNHSL 690
           A  HSL
Sbjct: 456 APAHSL 461

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.307    0.124    0.343 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 723,139,091
Number of Sequences: 1393205
Number of extensions: 17178237
Number of successful extensions: 52537
Number of sequences better than 10.0: 137
Number of HSP's better than 10.0 without gapping: 47261
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 51870
length of database: 448,689,247
effective HSP length: 121
effective length of database: 280,111,442
effective search space used: 41176381974
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)


EST assemble image


clone accession position
1 MPDL004d02_f AV776714 1 613
2 MPDL073b11_f AV780239 254 808




Lotus japonicus
Kazusa DNA Research Institute