KMC004787A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004787A_C01 KMC004787A_c01
aatccaaaaagtcgtctttataagtggtagtgtggctaaagtttcaatgagttaAAAAGA
TAGTTATTTAACACATCAAGATCGTTGAGAAAACATGAATAGCCACTGATATCAACTAGC
TGAAATTTAAATGGGAGAGCATAAACTTTCACTAAACAGCAATACAAAACAGAGCCATAA
AGCCCGGTCTCCAAAACTATAAGCATAACAAGTTCTCTCTCCAACATCACCAGGACGGCG
TCAAACTGGATAAGTAGGCTTGGGTAAATGCCATGGAATAATCAGACACCGTTAATGCAA
AAGCAAAAGAACAAAAAAGAGCCAAGTGTCCCAATTAGCAACCCATCCACCTCCCAATGC
AGTAATTAGTACGCCGCCGGCTACCGCGTCTCAGTAAGCAACCTCCGGGATCACCTCCTA
ATGCTACCTGACTTCATATGGCTGTACATACTTCCTTGCCACAACAAGACATAATCTTCA
GCATTTAAGTATTATCCTTCCCATTTGCAGCTTCAAGAAACTTTTCAATCTTGGCCACTA
TCTTCATCAGCTTCCTT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004787A_C01 KMC004787A_c01
         (557 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|ZP_00071334.1| hypothetical protein [Trichodesmium erythraeu...    33  3.1
gb|EAA36982.1| GLP_514_15116_17125 [Giardia lamblia ATCC 50803]        32  4.0
ref|NP_113933.1| restin (Reed-Steinberg cell-espressed intermedi...    32  6.9
gb|EAA30411.1| hypothetical protein [Neurospora crassa]                31  9.0
ref|NP_509663.1| Putative protein, nematode specific [Caenorhabd...    31  9.0

>ref|ZP_00071334.1| hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 285

 Score = 32.7 bits (73), Expect = 3.1
 Identities = 21/71 (29%), Positives = 36/71 (50%)
 Frame = -1

Query: 326 LGSFLFFCFCINGV*LFHGIYPSLLIQFDAVLVMLERELVMLIVLETGLYGSVLYCCLVK 147
           +G FL+    + G+ L + I+ +L +QFD+++  L   L  L  L+  L G +    L  
Sbjct: 54  VGIFLYISLLLPGISLINTIFNNLSLQFDSLIYQLPTWLKFLSYLDNAL-GLLFKLILFT 112

Query: 146 VYALPFKFQLV 114
           ++ L   F LV
Sbjct: 113 IFLLIIGFLLV 123

>gb|EAA36982.1| GLP_514_15116_17125 [Giardia lamblia ATCC 50803]
          Length = 669

 Score = 32.3 bits (72), Expect = 4.0
 Identities = 14/42 (33%), Positives = 24/42 (56%)
 Frame = +1

Query: 295 MQKQKNKKEPSVPISNPSTSQCSN*YAAGYRVSVSNLRDHLL 420
           MQ     K+P +P+++  T+  SN  A   + S +++ DHLL
Sbjct: 237 MQDNSKHKKPHIPVASDKTASSSNGVARRQQTSTADIVDHLL 278

>ref|NP_113933.1| restin (Reed-Steinberg cell-espressed intermediate filament-asso;
           restin (Reed-Steinberg cell-espressed intermediate
           filament-associated protein) [Rattus norvegicus]
           gi|8247352|emb|CAB92974.1| CLIP-170 [Rattus norvegicus]
          Length = 1320

 Score = 31.6 bits (70), Expect = 6.9
 Identities = 16/37 (43%), Positives = 21/37 (56%)
 Frame = +1

Query: 151 TKQQYKTEP*SPVSKTISITSSLSNITRTASNWISRL 261
           T      +P  PV+K  S T  +SN+T+TAS  IS L
Sbjct: 162 TPSNIPQKPSQPVAKETSATPQISNLTKTASESISNL 198

>gb|EAA30411.1| hypothetical protein [Neurospora crassa]
          Length = 1786

 Score = 31.2 bits (69), Expect = 9.0
 Identities = 14/58 (24%), Positives = 31/58 (53%)
 Frame = -2

Query: 556  RKLMKIVAKIEKFLEAANGKDNT*MLKIMSCCGKEVCTAI*SQVALGGDPGGCLLRRG 383
            +K+ ++VA +++ ++     D+  +++ + C  KE   +I  +   GG  G CL + G
Sbjct: 1539 KKMQELVAALDQEIDTMQHLDHVNIVQYLGCERKETSISIFLEYISGGSIGSCLRKHG 1596

>ref|NP_509663.1| Putative protein, nematode specific [Caenorhabditis elegans]
            gi|7499603|pir||T21214 hypothetical protein F21G4.6 -
            Caenorhabditis elegans gi|3876204|emb|CAB02662.1|
            Hypothetical protein F21G4.6 [Caenorhabditis elegans]
          Length = 1721

 Score = 31.2 bits (69), Expect = 9.0
 Identities = 18/60 (30%), Positives = 30/60 (50%)
 Frame = -1

Query: 221  ERELVMLIVLETGLYGSVLYCCLVKVYALPFKFQLVDISGYSCFLNDLDVLNNYLFNSLK 42
            E E + L ++ +GL   +     V++Y L     LV    Y  F++ +D+L+ YL   LK
Sbjct: 1429 EPESIALKLVSSGLSNDIK---TVRIYTLAGVLYLVQSDSYESFISSIDILSAYLEKYLK 1485

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 460,881,506
Number of Sequences: 1393205
Number of extensions: 9763408
Number of successful extensions: 23586
Number of sequences better than 10.0: 11
Number of HSP's better than 10.0 without gapping: 22777
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 23558
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 19808345223
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MRL001b02_f BP083706 1 557
2 MPDL091f03_f AV781259 52 549




Lotus japonicus
Kazusa DNA Research Institute