KMC001707A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001707A_C01 KMC001707A_c01
AAAATGCCATAAATTTATATATGAGCATGTCTTACAATGTAAATTTTATGGGTCTGTCAC
TGCACATGAGTCTCCAACGTGAGAGGATCTGACCCCGTTTTAGTCTCAGATTAATGAGTT
AAAGCCTAGATAATTCATCTACTACCCTCTTCTCTAAATGTGAACAAGTTTTGGGAACAT
GATGAATTCTGCTTCTATCTTCTGTTTGGGCATTTGCATTCATCCAGGTATTGCTGGATA
CACGTACGATATAAAACCCCAAACGGGCCAACACCATAGAAATGACTTTTATCCCATGCA
TTTTGTCTTGAAAGACCATTACAGCTAATATAGGAACAATATGCTGACCCAAAACGCTTA
TAGCATGGGAGAAGAGGGAAGAGACCTCGAAAACCAGTCCCGTACAACCGAGAGTAACGA
GTTGCGAAATTATGGCTGTAAAAGCAAGATTCAATATATAGGCAGCTTTACCTGTCTCAT
ACCCCTCAATTTCTTTCTTCAAACCTTTCCA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001707A_C01 KMC001707A_c01
         (511 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_193556.1| putative protein; protein id: At4g18220.1 [Arab...   111  4e-24
pir||T04923 hypothetical protein T9A21.60 - Arabidopsis thaliana...   107  8e-23
ref|NP_193555.2| putative protein; protein id: At4g18210.1, supp...   107  8e-23
ref|NP_193554.1| putative protein; protein id: At4g18200.1 [Arab...    91  8e-18
gb|AAL69512.1| unknown protein [Arabidopsis thaliana] gi|2046549...    91  8e-18

>ref|NP_193556.1| putative protein; protein id: At4g18220.1 [Arabidopsis thaliana]
           gi|7487850|pir||T04924 hypothetical protein T9A21.70 -
           Arabidopsis thaliana gi|2832696|emb|CAA16794.1| putative
           protein [Arabidopsis thaliana]
           gi|7268615|emb|CAB78824.1| putative protein [Arabidopsis
           thaliana]
          Length = 344

 Score =  111 bits (278), Expect = 4e-24
 Identities = 51/86 (59%), Positives = 70/86 (81%)
 Frame = -1

Query: 511 WKGLKKEIEGYETGKAAYILNLAFTAIISQLVTLGCTGLVFEVSSLFSHAISVLGQHIVP 332
           WK L  E+E Y+ GK +Y++NL +TA+  Q+ ++GCTGL+FE+SSLFS+AIS LG  +VP
Sbjct: 220 WKTLSSEMENYKLGKVSYVMNLVWTAVTWQVFSIGCTGLIFELSSLFSNAISALGLPVVP 279

Query: 331 ILAVMVFQDKMHGIKVISMVLARLGF 254
           ILAV++F DKM+G+KVISM+LA  GF
Sbjct: 280 ILAVIIFHDKMNGLKVISMILAIWGF 305

>pir||T04923 hypothetical protein T9A21.60 - Arabidopsis thaliana
           gi|2832695|emb|CAA16793.1| putative protein [Arabidopsis
           thaliana] gi|7268614|emb|CAB78823.1| putative protein
           [Arabidopsis thaliana]
          Length = 348

 Score =  107 bits (267), Expect = 8e-23
 Identities = 51/86 (59%), Positives = 70/86 (81%)
 Frame = -1

Query: 511 WKGLKKEIEGYETGKAAYILNLAFTAIISQLVTLGCTGLVFEVSSLFSHAISVLGQHIVP 332
           WK L  E++ Y+ GK +YI+NL +TA+  Q+ ++G TGL+FE+SSLFS+AISVLG  +VP
Sbjct: 224 WKTLSSEMDNYKHGKVSYIMNLVWTAVTWQVFSIGGTGLIFELSSLFSNAISVLGLPVVP 283

Query: 331 ILAVMVFQDKMHGIKVISMVLARLGF 254
           ILAV++F DKM+G+KVISM+LA  GF
Sbjct: 284 ILAVIIFHDKMNGLKVISMILAIWGF 309

>ref|NP_193555.2| putative protein; protein id: At4g18210.1, supported by cDNA:
           gi_13877726 [Arabidopsis thaliana]
           gi|13877727|gb|AAK43941.1|AF370622_1 putative protein
           [Arabidopsis thaliana]
          Length = 149

 Score =  107 bits (267), Expect = 8e-23
 Identities = 51/86 (59%), Positives = 70/86 (81%)
 Frame = -1

Query: 511 WKGLKKEIEGYETGKAAYILNLAFTAIISQLVTLGCTGLVFEVSSLFSHAISVLGQHIVP 332
           WK L  E++ Y+ GK +YI+NL +TA+  Q+ ++G TGL+FE+SSLFS+AISVLG  +VP
Sbjct: 25  WKTLSSEMDNYKHGKVSYIMNLVWTAVTWQVFSIGGTGLIFELSSLFSNAISVLGLPVVP 84

Query: 331 ILAVMVFQDKMHGIKVISMVLARLGF 254
           ILAV++F DKM+G+KVISM+LA  GF
Sbjct: 85  ILAVIIFHDKMNGLKVISMILAIWGF 110

>ref|NP_193554.1| putative protein; protein id: At4g18200.1 [Arabidopsis thaliana]
            gi|7487848|pir||T04922 hypothetical protein T9A21.50 -
            Arabidopsis thaliana gi|2832694|emb|CAA16792.1| putative
            protein [Arabidopsis thaliana] gi|7268613|emb|CAB78822.1|
            putative protein [Arabidopsis thaliana]
          Length = 1128

 Score = 90.9 bits (224), Expect = 8e-18
 Identities = 43/86 (50%), Positives = 61/86 (70%)
 Frame = -1

Query: 511  WKGLKKEIEGYETGKAAYILNLAFTAIISQLVTLGCTGLVFEVSSLFSHAISVLGQHIVP 332
            W+ L  E+  Y+ GK +YIL LA  AI  Q+ T+GC GL+FE SS+FS++I+ +G  IVP
Sbjct: 1014 WRTLPSEMRNYKLGKVSYILTLASAAIFWQVYTVGCVGLIFESSSVFSNSITAVGLPIVP 1073

Query: 331  ILAVMVFQDKMHGIKVISMVLARLGF 254
            ++AV+VF DKM   K+ S++LA  GF
Sbjct: 1074 VVAVIVFHDKMDASKIFSIILAIWGF 1099

 Score = 88.2 bits (217), Expect = 5e-17
 Identities = 42/86 (48%), Positives = 61/86 (70%)
 Frame = -1

Query: 511 WKGLKKEIEGYETGKAAYILNLAFTAIISQLVTLGCTGLVFEVSSLFSHAISVLGQHIVP 332
           WK L  E+E Y+ GK  Y++ LA  AI  Q+ T+G  GL+FE SS+FS++I+ +G  IVP
Sbjct: 278 WKTLTSEMENYKLGKVPYVMTLASIAISWQVYTIGVVGLIFESSSVFSNSITAVGLPIVP 337

Query: 331 ILAVMVFQDKMHGIKVISMVLARLGF 254
           ++AV+VF DKM+  K+ S++LA  GF
Sbjct: 338 VVAVIVFHDKMNASKIFSIILAIWGF 363

 Score = 85.1 bits (209), Expect = 4e-16
 Identities = 41/86 (47%), Positives = 59/86 (67%)
 Frame = -1

Query: 511 WKGLKKEIEGYETGKAAYILNLAFTAIISQLVTLGCTGLVFEVSSLFSHAISVLGQHIVP 332
           W+ L  E+  Y+ GK +Y+L LA  AI  Q+ TLG  GL+FE SS+FS++I+ +G  IVP
Sbjct: 639 WETLPSEMRNYKLGKVSYVLTLASAAISWQVYTLGLVGLIFESSSVFSNSITAVGLPIVP 698

Query: 331 ILAVMVFQDKMHGIKVISMVLARLGF 254
           + AV+VF D+M   K+ S++LA  GF
Sbjct: 699 VAAVIVFHDRMDASKIFSIILAICGF 724

>gb|AAL69512.1| unknown protein [Arabidopsis thaliana] gi|20465497|gb|AAM20208.1|
           putative protein [Arabidopsis thaliana]
          Length = 377

 Score = 90.9 bits (224), Expect = 8e-18
 Identities = 43/86 (50%), Positives = 61/86 (70%)
 Frame = -1

Query: 511 WKGLKKEIEGYETGKAAYILNLAFTAIISQLVTLGCTGLVFEVSSLFSHAISVLGQHIVP 332
           W+ L  E+  Y+ GK +YIL LA  AI  Q+ T+GC GL+FE SS+FS++I+ +G  IVP
Sbjct: 263 WRTLPSEMRNYKLGKVSYILTLASAAIFWQVYTVGCVGLIFESSSVFSNSITAVGLPIVP 322

Query: 331 ILAVMVFQDKMHGIKVISMVLARLGF 254
           ++AV+VF DKM   K+ S++LA  GF
Sbjct: 323 VVAVIVFHDKMDASKIFSIILAIWGF 348

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 441,089,272
Number of Sequences: 1393205
Number of extensions: 9489071
Number of successful extensions: 23263
Number of sequences better than 10.0: 43
Number of HSP's better than 10.0 without gapping: 22699
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 23260
length of database: 448,689,247
effective HSP length: 114
effective length of database: 289,863,877
effective search space used: 15942513235
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENf010g10 BP058784 1 377
2 GNf053d08 BP071313 1 414
3 GNf087f07 BP073802 1 422
4 GNf058b02 BP071659 1 414
5 GENf020h01 BP059230 1 370
6 GENf091f08 BP062178 3 535
7 GNf009f03 BP068028 6 286
8 GENf026d06 BP059436 7 477
9 GENf023d11 BP059327 8 385




Lotus japonicus
Kazusa DNA Research Institute