KMC018038A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC018038A_C01 KMC018038A_c01
GAAGAATAACAATTTCAATGAAATCCAATCTGTGCTATCTTAGTCAGAGCAACTTCAGTT
AAATTAGACAAAACCAAGATTATAAAAAAAAATTATAAATTTTTCAGATTTTGAACATCA
AGATGTAGAAGAAGTAGAAGACCCAATAATGAAAGGTGAAGATGGAGGACGAGCACGTCT
GGTTCTACTCCTTCCCCTTCTCAATCCGGAGCTAGAAGAAGTAGGGGAAGATGAAGAGGA
TGTTGAATTTGAGACAGTGAGCTTGTTTGGTTCACTTGTTAATGCTTGCTTCTCTCCACA
TTGCTGCTTTTCTTCTATTTTATGATGCCTAACTTTTGTTGAATTCAAAGTCACCGGAAG
AACACAATGGCACAAAAAACCAATCCTGGCAAGACGATTGACCCAACTGGGTATTGGATT
CTCAGTAAGCCTAACACAAGCAGCATTGCAGAAATGGTTACAGTTCTTTGTGATGAGATT
GTAAGCGTTTCCTCTATACTCCGCCGCAAGTTCTTCCATCACCGCCCTCACCTCACCGGG
ACCCATGCTCGTTTTCCCGATCAGAATCGT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC018038A_C01 KMC018038A_c01
         (570 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_565243.1| expressed protein; protein id: At1g80690.1, sup...   166  2e-40
pir||C96839 hypothetical protein F23A5.4 [imported] - Arabidopsi...   160  1e-38
ref|NP_568467.1| putative protein; protein id: At5g25170.1, supp...   135  5e-31
ref|NP_565588.1| expressed protein; protein id: At2g25190.1, sup...   123  2e-27
gb|AAL38722.1| unknown protein [Arabidopsis thaliana] gi|2025907...   118  6e-26

>ref|NP_565243.1| expressed protein; protein id: At1g80690.1, supported by cDNA:
           40037. [Arabidopsis thaliana] gi|21593549|gb|AAM65516.1|
           unknown [Arabidopsis thaliana]
          Length = 227

 Score =  166 bits (419), Expect = 2e-40
 Identities = 82/151 (54%), Positives = 110/151 (72%), Gaps = 5/151 (3%)
 Frame = -1

Query: 570 TILIGKTSMGPGEVRAVMEELAAEYRGNAYNLITKNCNHFCNAACVRLTENPIPSWVNRL 391
           +ILIGKT +GP EVRA ME+LA  Y+G++YNLITKNCNHFC+  C++LT NPIPSWVNRL
Sbjct: 78  SILIGKTDLGPLEVRATMEQLADNYKGSSYNLITKNCNHFCDETCIKLTGNPIPSWVNRL 137

Query: 390 ARIGFLCHCVLPVTLNSTKVRHHKIEEKQQC---GEKQALT--SEPNKLTVSNSTSSSSS 226
           ARIGF+C+CVLP T+N+T+  ++++ + + C    EK+ LT  S   + T++  +SSSSS
Sbjct: 138 ARIGFMCNCVLPATINATRFGNNRVNQDKSCEAENEKKKLTSVSSRERSTIATPSSSSSS 197

Query: 225 PTSSSSGLRRGRSRTRRARPPSSPFIIGSST 133
           P+    G  R R R  RA  PSSP  +GSS+
Sbjct: 198 PSVQVRG--RSRKRRPRALQPSSPLTLGSSS 226

>pir||C96839 hypothetical protein F23A5.4 [imported] - Arabidopsis thaliana
           gi|6503281|gb|AAF14657.1|AC011713_5 Contains similarity
           to gb|AF151904 CGI-146 protein from Homo sapiens.  EST
           gb|T44446 comes from this gene. [Arabidopsis thaliana]
          Length = 231

 Score =  160 bits (404), Expect = 1e-38
 Identities = 82/155 (52%), Positives = 110/155 (70%), Gaps = 9/155 (5%)
 Frame = -1

Query: 570 TILIGKTSMGPGEVRAVMEELAAEYRGNAYNLITKNCNHFCNAACVRLTENPIPSWVNRL 391
           +ILIGKT +GP EVRA ME+LA  Y+G++YNLITKNCNHFC+  C++LT NPIPSWVNRL
Sbjct: 78  SILIGKTDLGPLEVRATMEQLADNYKGSSYNLITKNCNHFCDETCIKLTGNPIPSWVNRL 137

Query: 390 ARI----GFLCHCVLPVTLNSTKVRHHKIEEKQQC---GEKQALT--SEPNKLTVSNSTS 238
           ARI    GF+C+CVLP T+N+T+  ++++ + + C    EK+ LT  S   + T++  +S
Sbjct: 138 ARIGKFSGFMCNCVLPATINATRFGNNRVNQDKSCEAENEKKKLTSVSSRERSTIATPSS 197

Query: 237 SSSSPTSSSSGLRRGRSRTRRARPPSSPFIIGSST 133
           SSSSP+    G  R R R  RA  PSSP  +GSS+
Sbjct: 198 SSSSPSVQVRG--RSRKRRPRALQPSSPLTLGSSS 230

>ref|NP_568467.1| putative protein; protein id: At5g25170.1, supported by cDNA:
           263500. [Arabidopsis thaliana]
          Length = 218

 Score =  135 bits (339), Expect = 5e-31
 Identities = 74/138 (53%), Positives = 86/138 (61%), Gaps = 1/138 (0%)
 Frame = -1

Query: 570 TILIGKTSMGPGEVRAVMEELAAEYRGNAYNLITKNCNHFCNAACVRLTENPIPSWVNRL 391
           +ILIG+T + P  VR  ME+LA EY GN+Y+LITKNCNHFCN  CV+LT   IPSWVNRL
Sbjct: 81  SILIGRTDLDPENVRVFMEKLAEEYSGNSYHLITKNCNHFCNDVCVQLTRRSIPSWVNRL 140

Query: 390 ARIGFLCHCVLPVTLNSTKVRH-HKIEEKQQCGEKQALTSEPNKLTVSNSTSSSSSPTSS 214
           AR G  C+CVLP  LN TKVR     EEK    EK+ L S  ++     S SSS S   S
Sbjct: 141 ARFGLFCNCVLPAELNETKVRQVRSKEEKIPEVEKKKLRSRSSRFPPGPSLSSSGSLNRS 200

Query: 213 SSGLRRGRSRTRRARPPS 160
             G RR     R+  PPS
Sbjct: 201 RRGERR-----RQCLPPS 213

>ref|NP_565588.1| expressed protein; protein id: At2g25190.1, supported by cDNA:
           14105., supported by cDNA: gi_13877550, supported by
           cDNA: gi_20148724 [Arabidopsis thaliana]
           gi|25354456|pir||D84645 hypothetical protein At2g25190
           [imported] - Arabidopsis thaliana
           gi|4567258|gb|AAD23672.1| expressed protein [Arabidopsis
           thaliana] gi|13877551|gb|AAK43853.1|AF370476_1 Unknown
           protein [Arabidopsis thaliana]
           gi|20148725|gb|AAM10253.1| unknown protein [Arabidopsis
           thaliana] gi|21553378|gb|AAM62471.1| unknown
           [Arabidopsis thaliana]
          Length = 240

 Score =  123 bits (308), Expect = 2e-27
 Identities = 70/157 (44%), Positives = 91/157 (57%), Gaps = 16/157 (10%)
 Frame = -1

Query: 570 TILIGKTSMGPGEVRAVMEELAAEYRGNAYNLITKNCNHFCNAACVRLTENPIPSWVNRL 391
           +IL+GKT +   EVR  ME+LA EY+GN Y+LIT+NCNHFCN  C++L +  IP WVNRL
Sbjct: 80  SILVGKTDLVAKEVRVFMEKLAEEYQGNKYHLITRNCNHFCNEVCLKLAQKSIPRWVNRL 139

Query: 390 ARIGFLCHCVLPVTLNSTKVRHHKIEEKQQCGEKQ---------ALTSEPNKLTVSNSTS 238
           AR+G LC+CVLP  LN  KVR     E  +  +K+          L+S P+  T  N  S
Sbjct: 140 ARLGVLCNCVLPPRLNEAKVRRVGKGELSESEKKKLRNRSRSDPLLSSSPSSSTPDNHRS 199

Query: 237 -----SSSSPTSSSSGLRRGRSRTRR--ARPPSSPFI 148
                SS +  SSSS    G  + RR  A+   SP +
Sbjct: 200 HIRAKSSGNHPSSSSSSSSGSKKNRRPKAQDQKSPSV 236

>gb|AAL38722.1| unknown protein [Arabidopsis thaliana] gi|20259079|gb|AAM14255.1|
           unknown protein [Arabidopsis thaliana]
          Length = 249

 Score =  118 bits (295), Expect = 6e-26
 Identities = 64/141 (45%), Positives = 86/141 (60%), Gaps = 3/141 (2%)
 Frame = -1

Query: 570 TILIGKTSMGPGEVRAVMEELAAEYRGNAYNLITKNCNHFCNAACVRLTENPIPSWVNRL 391
           +IL+G+T M   EVR+ ME+L+ EY+GN Y+LIT+NCNHFCN   ++LT   IPSWVNRL
Sbjct: 80  SILVGETEMKAKEVRSFMEKLSEEYQGNKYHLITRNCNHFCNHVSLKLTHKSIPSWVNRL 139

Query: 390 ARIGFLCHCVLPVTLNSTKVRHHKIEEKQQCGEKQALTSEPNKLTVSNSTSSSSSPTSSS 211
           AR+GFLC+CVLP  LN TKV+    + K    E +    +  K  +  S S   S +SS+
Sbjct: 140 ARLGFLCNCVLPACLNETKVKRVGKDGKLLL-EGENTKKKKRKKKIRRSRSGPLSSSSSN 198

Query: 210 SGLRRGRSRTR---RARPPSS 157
           + L    +  R      PP S
Sbjct: 199 ARLDNTPTHNRSISTGNPPLS 219

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 482,737,451
Number of Sequences: 1393205
Number of extensions: 10236685
Number of successful extensions: 64517
Number of sequences better than 10.0: 327
Number of HSP's better than 10.0 without gapping: 44043
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 55915
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 20956655091
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD077d04_f BP050157 1 618
2 SPD016a05_f BP045225 49 570




Lotus japonicus
Kazusa DNA Research Institute