KMC000091A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000091A_C01 KMC000091A_c01
gaGGAAAAGTACTTACAGAAAAGCTTTTATTCAAAGAACTGCAGAGATTACACACATAGA
GAGAGATAGGTAAGAGAGAGAGAGAGAGAGGTAGGTCAAAAACCCGATGCCTCTTACTCT
TACTAAGCTACCTATTTATACAAGTTCTGACCATGGTTACTTTCGAAAGTAACTCTGGTT
TACATAAAAGTAACTACAGTTAATGAAAAGTAATTGAAGCTTCCTCCTTACCATACTCTG
CCCTCCAGTATCCAAAGAACTGAAATAATGTGTCAATGCCTCAACTTCTGGTTGCATATC
TCCATGGTTCATGCCGCAAATATGGTTGCTGATAGTATCTGCAACAGGTTCAGGTACATG
CTCCCAACTCAGTCTCTTCCAGAATGGATAAATnGCCTCATATGAAATTCTTTCATAACC
TGCAAGTTCACAACCGCAAGGCAGTTCATGAGTCTCTCTCAACAAGCAATCGCATCGGCC
GTAGGACTTCATTCTTATATGTTCAGCATCAAGGAGCTGCAGGCATTTGTTTGACACAAA
TCCTCTAATATTTGTGTAAAATGGGCACATGAAAATGTGATCAATTCTGTGAATACTGCG
CTCAAACGATGCTAATATTTCAGTATGTCGATTACATGTCAAACTATGCGACGCATCCCA
AGAAGTGGCTAGGTCACCCTTGCAATCCCGCAACATCTTCTTCAAACTGGCATGTGCACC
CTCAGCCCTGTTACTTGTTGTTGTTCCAAAGTG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000091A_C01 KMC000091A_c01
         (753 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|EAA09761.1| ebiP481 [Anopheles gambiae str. PEST]                   34  2.6
ref|XP_221644.1| similar to chloride channel [Mus musculus] [Rat...    32  7.5
ref|NP_497809.1| Heavy chain, Unconventional Myosin IA HUM-5 (11...    32  7.5
emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]             32  9.8
ref|NP_702477.1| hypothetical protein [Plasmodium falciparum 3D7...    32  9.8

>gb|EAA09761.1| ebiP481 [Anopheles gambiae str. PEST]
          Length = 975

 Score = 33.9 bits (76), Expect = 2.6
 Identities = 20/74 (27%), Positives = 35/74 (47%), Gaps = 12/74 (16%)
 Frame = -1

Query: 378 KRLSWEHVPEPVADTISNHICGMN------------HGDMQPEVEALTHYFSSLDTGGQS 235
           K +  EH P P+ DT+ N + G+N            H ++ P+   +T + S     G+ 
Sbjct: 472 KAIQREHYPLPIIDTLFNKLKGVNIFSKLDITSAYYHVELNPDSREITTFMS-----GKG 526

Query: 234 MVRRKLQLLFINCS 193
           ++R K  +  INC+
Sbjct: 527 LMRFKRLMFRINCA 540

>ref|XP_221644.1| similar to chloride channel [Mus musculus] [Rattus norvegicus]
          Length = 898

 Score = 32.3 bits (72), Expect = 7.5
 Identities = 14/56 (25%), Positives = 26/56 (46%)
 Frame = +3

Query: 357 HAPNSVSSRMDKXPHMKFFHNLQVHNRKAVHESLSTSNRIGRRTSFLYVQHQGAAG 524
           HA +  + RM    HM+    ++ H     H ++S   +  R+   ++ +H  AAG
Sbjct: 695 HAYSDAAKRMKPTVHMRTHRYMRTHRYMRTHSNISAVTKNTRKVVMIHSEHTHAAG 750

>ref|NP_497809.1| Heavy chain, Unconventional Myosin IA HUM-5 (116.6 kD) (hum-5)
           [Caenorhabditis elegans] gi|7511498|pir||T24349 myosin
           IA - Caenorhabditis elegans gi|414640|emb|CAA53244.1|
           myosin IA [Caenorhabditis elegans]
           gi|3879326|emb|CAA84673.1| C. elegans HUM-5 protein
           (corresponding sequence T02C12.1) [Caenorhabditis
           elegans]
          Length = 1017

 Score = 32.3 bits (72), Expect = 7.5
 Identities = 19/74 (25%), Positives = 34/74 (45%)
 Frame = -1

Query: 381 WKRLSWEHVPEPVADTISNHICGMNHGDMQPEVEALTHYFSSLDTGGQSMVRRKLQLLFI 202
           +  +++++  +PV   ISN++   +    Q E E   H F  L  GG   + R+  L   
Sbjct: 167 YMHINFDYDGDPVGGNISNYLLEKSRVVRQQEGERNFHVFYQLVNGGDDGLLRQFGLTKD 226

Query: 201 NCSYFYVNQSYFRK 160
              Y+++NQ    K
Sbjct: 227 AKQYYFLNQGQSHK 240

>emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
          Length = 350

 Score = 32.0 bits (71), Expect = 9.8
 Identities = 19/76 (25%), Positives = 37/76 (48%), Gaps = 7/76 (9%)
 Frame = -1

Query: 507 AEHIRMKSYGRCDCLLRETHELPC-------GCELAGYERISYEAIYPFWKRLSWEHVPE 349
           ++ I +++ G+   +L +T  L C       GCE  GY+ +++E +  F       +  +
Sbjct: 132 SDRICVQTKGKLQTILSDTDILSCCGRMCGDGCE-GGYDHLAWEWVQRFGVVTGGPYQQK 190

Query: 348 PVADTISNHICGMNHG 301
            V    + H CG++HG
Sbjct: 191 GVCRPYAFHPCGLHHG 206

>ref|NP_702477.1| hypothetical protein [Plasmodium falciparum 3D7]
           gi|23497661|gb|AAN37201.1|AE014825_60 hypothetical
           protein [Plasmodium falciparum 3D7]
          Length = 2753

 Score = 32.0 bits (71), Expect = 9.8
 Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 2/52 (3%)
 Frame = +3

Query: 363 PNSVSSRMDKXPHMKFFHNLQVHNRKA-VHESLSTSNRIGRRTS-FLYVQHQ 512
           PNS SS   K  + K+ HNL+ +NRK+   E  ST N I   TS  LY+ ++
Sbjct: 671 PNS-SSNNHKKYNWKYLHNLEEYNRKSYCEEEKSTDNYIQHSTSDILYLDNK 721

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 618,452,067
Number of Sequences: 1393205
Number of extensions: 12952126
Number of successful extensions: 36732
Number of sequences better than 10.0: 11
Number of HSP's better than 10.0 without gapping: 35276
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 36712
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 36595604110
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENLf050g03 BP065034 1 516
2 GENLf034b10 BP064113 3 482
3 GENLf003h04 BP062538 197 753




Lotus japonicus
Kazusa DNA Research Institute