KMC014822A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC014822A_C01 KMC014822A_c01
tggggtggcatttcttctgaggagcatgatgaagcagttatgcttgaggcagcaatgttg
gtggaatccccgaagggcgtCAGTATCCCTATGCTTTTGCACCACATGAGTTCATGCAGA
ACAGGGGTATTTATCCTCGGCCAATGCCTCGTCCACCTTCGCCATCATTGACAGCTCAGC
GTTTAATAAGGGAACAACAGGTATGGTCGGAAATGTGGATTGTAGGTTGTAGGAGATGGC
CTTACCTAACAGCTGCTATTGTTTCTTATATTTATTTAATGCAGGATGATGAATATCTTG
CATCACTACAAGCAGATAGAGAAAAAGAATTGAAAGCCATCGAAGAAGCCGAGGCTGCTC
GTGAAGAAGAAAAGCGGAGAGAGGAAGAAGCTCGCAGGAAATTACAGGAAAGAGCAGGAA
TTGGAAACACAGTTGGCAGCAAAAGAAGCTTCCCTTCCATCCGAACCATCCTCTACTGAT
GAGAATGCTGTTACCTTGCTAGTAAGAATGCCAGATGGTAGCCGCCGCGGACGCCGATTC
CTTAGATCTGATAAGCTACAGTCTCTTTTCGACTTCATAGATATTGCTAGGGTGGTGAAA
CCTGGAAGTTACAGACTGGTGAGACCTTATCCTAGGCGTGCTTTTGGTAATGAAGAAAGT
GCATCGATACTGGAAGAGCTTGGCCTAACCAACAAGCAAGAAGCCTTGTTTTTGGAGTTA
GTCTAATGCCCTGAAGCATATTTCTGGAAAACCATTAATTGAAGCTTTTAGTTATTAAAT
TATGGATCCTTACACATGATGATATGGAATTTCCACCTGGTTtgtgttgtttgggtaaag
atatttataatgcacccgccagtctaggattacaacaaaatgaaattaaggttcgtgatt
tt


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC014822A_C01 KMC014822A_c01
         (902 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_567380.1| putative protein; protein id: At4g11740.1, supp...   166  6e-40
ref|NP_567675.1| putative protein; protein id: At4g23040.1, supp...   159  7e-38
pir||T05136 hypothetical protein F7H19.230 - Arabidopsis thalian...   159  7e-38
pir||T04221 hypothetical protein T5C23.170 - Arabidopsis thalian...   126  5e-28
ref|NP_680549.1| unknown protein; protein id: At4g00752.1 [Arabi...   125  7e-28

>ref|NP_567380.1| putative protein; protein id: At4g11740.1, supported by cDNA:
           gi_14596000, supported by cDNA: gi_17978734 [Arabidopsis
           thaliana] gi|14596001|gb|AAK68728.1| Unknown protein
           [Arabidopsis thaliana] gi|17978735|gb|AAL47361.1|
           unknown protein [Arabidopsis thaliana]
          Length = 390

 Score =  166 bits (419), Expect = 6e-40
 Identities = 84/120 (70%), Positives = 101/120 (84%)
 Frame = +1

Query: 364 KKKSGERKKLAGNYRKEQELETQLAAKEASLPSEPSSTDENAVTLLVRMPDGSRRGRRFL 543
           KK+   ++KL     +EQELE QL AKEASLP EP + +ENA+TLL+RMPDG+RRGRRFL
Sbjct: 275 KKEEEAQRKL----EEEQELERQLDAKEASLPKEPQADEENAITLLIRMPDGTRRGRRFL 330

Query: 544 RSDKLQSLFDFIDIARVVKPGSYRLVRPYPRRAFGNEESASILEELGLTNKQEALFLELV 723
           +SDKLQ+LF+FIDIARVVKP +YRLVRPYPR AFG+ ES S L +LGLT+KQEALFLEL+
Sbjct: 331 KSDKLQTLFNFIDIARVVKPNTYRLVRPYPRHAFGDGESESTLNDLGLTSKQEALFLELI 390

 Score = 87.8 bits (216), Expect(2) = 4e-23
 Identities = 58/141 (41%), Positives = 73/141 (51%), Gaps = 11/141 (7%)
 Frame = +3

Query: 60  GGIPEGRQYPYAFAPHEFMQNRGIYPRPMPRPPSPSLTAQRLIREQQVWSEMWIVGCRRW 239
           GGIPE       F P +        PR  PRPPSPSLTAQRLIREQQ             
Sbjct: 195 GGIPETGYNHLPFLPPQ--------PRAQPRPPSPSLTAQRLIREQQ------------- 233

Query: 240 PYLTAAIVSYIYLMQDDEYLASLQADREKELKAIEEAEAAR-----------EEEKRREE 386
                          DDEY+ASLQADR+KE+K+I +AEA +           EEEK++EE
Sbjct: 234 ---------------DDEYVASLQADRDKEMKSIRDAEARQLEEETARKAFLEEEKKKEE 278

Query: 387 EARRKLQERAGIGNTVGSKRS 449
           EA+RKL+E   +   + +K +
Sbjct: 279 EAQRKLEEEQELERQLDAKEA 299

 Score = 43.1 bits (100), Expect(2) = 4e-23
 Identities = 20/25 (80%), Positives = 21/25 (84%)
 Frame = +1

Query: 1   WGGISSEEHDEAVMLEAAMLVESPK 75
           WGGISSEEHDEAVMLEAAM    P+
Sbjct: 175 WGGISSEEHDEAVMLEAAMFGGIPE 199

>ref|NP_567675.1| putative protein; protein id: At4g23040.1, supported by cDNA:
           gi_13430703 [Arabidopsis thaliana]
           gi|13430704|gb|AAK25974.1|AF360264_1 unknown protein
           [Arabidopsis thaliana] gi|23296844|gb|AAN13184.1|
           unknown protein [Arabidopsis thaliana]
          Length = 525

 Score =  159 bits (401), Expect = 7e-38
 Identities = 79/120 (65%), Positives = 98/120 (80%)
 Frame = +1

Query: 364 KKKSGERKKLAGNYRKEQELETQLAAKEASLPSEPSSTDENAVTLLVRMPDGSRRGRRFL 543
           +K+   R+K+     +EQELE QL +KEASLP EP + +ENA+TL VR+PDG+R GRRF 
Sbjct: 410 RKEEEARRKV----EEEQELERQLVSKEASLPQEPPAGEENAITLQVRLPDGTRHGRRFF 465

Query: 544 RSDKLQSLFDFIDIARVVKPGSYRLVRPYPRRAFGNEESASILEELGLTNKQEALFLELV 723
           +SDKLQSLFDFIDI RVVKP +YRLVRPYPRRAFG+ E +S L ++GLT+KQEALFLEL+
Sbjct: 466 KSDKLQSLFDFIDICRVVKPNTYRLVRPYPRRAFGDGECSSTLNDIGLTSKQEALFLELI 525

 Score = 74.3 bits (181), Expect = 2e-12
 Identities = 59/136 (43%), Positives = 69/136 (50%), Gaps = 6/136 (4%)
 Frame = +3

Query: 60  GGIPEGRQ-YPYAFAPHEFMQNRGIYPRPMPRPPSPSLTAQRLIREQQVWSEMWIVGCRR 236
           GGI E     PYA            YP+   RPPSPSLTAQRLIREQ             
Sbjct: 339 GGISESEYGVPYAH-----------YPQRTQRPPSPSLTAQRLIREQ------------- 374

Query: 237 WPYLTAAIVSYIYLMQDDEYLASLQADREK-ELKAIEEAEAAR----EEEKRREEEARRK 401
                          QDDEYLASL+ADR K E + +EE EAAR    EE KR+EEEARRK
Sbjct: 375 ---------------QDDEYLASLEADRVKAEARRLEE-EAARVEAIEEAKRKEEEARRK 418

Query: 402 LQERAGIGNTVGSKRS 449
           ++E   +   + SK +
Sbjct: 419 VEEEQELERQLVSKEA 434

 Score = 43.1 bits (100), Expect = 0.006
 Identities = 22/30 (73%), Positives = 24/30 (79%), Gaps = 3/30 (10%)
 Frame = +1

Query: 1   WGGISSEEHDEAVMLEAAM---LVESPKGV 81
           WGGISSEEHDEA+MLEAAM   + ES  GV
Sbjct: 319 WGGISSEEHDEAIMLEAAMFGGISESEYGV 348

>pir||T05136 hypothetical protein F7H19.230 - Arabidopsis thaliana
           gi|3292830|emb|CAA19820.1| putative protein [Arabidopsis
           thaliana] gi|7269151|emb|CAB79259.1| putative protein
           [Arabidopsis thaliana]
          Length = 577

 Score =  159 bits (401), Expect = 7e-38
 Identities = 79/120 (65%), Positives = 98/120 (80%)
 Frame = +1

Query: 364 KKKSGERKKLAGNYRKEQELETQLAAKEASLPSEPSSTDENAVTLLVRMPDGSRRGRRFL 543
           +K+   R+K+     +EQELE QL +KEASLP EP + +ENA+TL VR+PDG+R GRRF 
Sbjct: 462 RKEEEARRKV----EEEQELERQLVSKEASLPQEPPAGEENAITLQVRLPDGTRHGRRFF 517

Query: 544 RSDKLQSLFDFIDIARVVKPGSYRLVRPYPRRAFGNEESASILEELGLTNKQEALFLELV 723
           +SDKLQSLFDFIDI RVVKP +YRLVRPYPRRAFG+ E +S L ++GLT+KQEALFLEL+
Sbjct: 518 KSDKLQSLFDFIDICRVVKPNTYRLVRPYPRRAFGDGECSSTLNDIGLTSKQEALFLELI 577

 Score = 82.8 bits (203), Expect = 6e-15
 Identities = 62/139 (44%), Positives = 79/139 (56%), Gaps = 9/139 (6%)
 Frame = +3

Query: 60  GGIPEGRQ-YPYAFAPHEFMQNRGIYPRPMPRPPSPSLTAQRLIREQQVWSE---MWIVG 227
           GGI E     PYA            YP+   RPPSPSLTAQRLIREQQ   +   ++++ 
Sbjct: 374 GGISESEYGVPYAH-----------YPQRTQRPPSPSLTAQRLIREQQDTDDDEFLFLLK 422

Query: 228 CRRWPYLTAAIVSYIYLMQDDEYLASLQADREK-ELKAIEEAEAAR----EEEKRREEEA 392
           C+              L+QDDEYLASL+ADR K E + +EE EAAR    EE KR+EEEA
Sbjct: 423 CK--------------LVQDDEYLASLEADRVKAEARRLEE-EAARVEAIEEAKRKEEEA 467

Query: 393 RRKLQERAGIGNTVGSKRS 449
           RRK++E   +   + SK +
Sbjct: 468 RRKVEEEQELERQLVSKEA 486

 Score = 43.1 bits (100), Expect = 0.006
 Identities = 22/30 (73%), Positives = 24/30 (79%), Gaps = 3/30 (10%)
 Frame = +1

Query: 1   WGGISSEEHDEAVMLEAAM---LVESPKGV 81
           WGGISSEEHDEA+MLEAAM   + ES  GV
Sbjct: 354 WGGISSEEHDEAIMLEAAMFGGISESEYGV 383

>pir||T04221 hypothetical protein T5C23.170 - Arabidopsis thaliana
           gi|4539465|emb|CAB39945.1| putative protein [Arabidopsis
           thaliana] gi|7267874|emb|CAB78217.1| putative protein
           [Arabidopsis thaliana]
          Length = 511

 Score =  126 bits (316), Expect = 5e-28
 Identities = 73/129 (56%), Positives = 89/129 (68%), Gaps = 9/129 (6%)
 Frame = +1

Query: 364 KKKSGERKKLAGNYRKEQELETQLAAKEASLPSEPSSTDENAVTLLVRMPDGSRRGRRFL 543
           KK+   ++KL     +EQELE QL AKEASLP EP + +ENA+TLL+RMPDG+RRGRRFL
Sbjct: 389 KKEEEAQRKL----EEEQELERQLDAKEASLPKEPQADEENAITLLIRMPDGTRRGRRFL 444

Query: 544 RSDKLQSLFDFIDIARVVKPG---------SYRLVRPYPRRAFGNEESASILEELGLTNK 696
           +SDKLQ   D   + R  + G         S   VRPYPR AFG+ ES S L +LGLT+K
Sbjct: 445 KSDKLQ--VDPFQLYRHCQSGETQHLQTGKSSTYVRPYPRHAFGDGESESTLNDLGLTSK 502

Query: 697 QEALFLELV 723
           QEALFLEL+
Sbjct: 503 QEALFLELI 511

 Score = 88.2 bits (217), Expect(2) = 3e-23
 Identities = 75/215 (34%), Positives = 100/215 (45%), Gaps = 26/215 (12%)
 Frame = +3

Query: 60  GGIPEGRQYPYAFAPHEFMQNRGIYPRPMPRPPSPSLTAQRLIREQQVWSEMWIVGCRRW 239
           GGIPE       F P +        PR  PRPPSPSLTAQRLIREQQ             
Sbjct: 309 GGIPETGYNHLPFLPPQ--------PRAQPRPPSPSLTAQRLIREQQ------------- 347

Query: 240 PYLTAAIVSYIYLMQDDEYLASLQADREKELKAIEEAEAAR-----------EEEKRREE 386
                          DDEY+ASLQADR+KE+K+I +AEA +           EEEK++EE
Sbjct: 348 ---------------DDEYVASLQADRDKEMKSIRDAEARQLEEETARKAFLEEEKKKEE 392

Query: 387 EARRKLQERAGIGNTVGSKR-SFP---------SIRTILY**ECC-----YLASKNARW* 521
           EA+RKL+E   +   + +K  S P         +I  ++   +       +L S   +  
Sbjct: 393 EAQRKLEEEQELERQLDAKEASLPKEPQADEENAITLLIRMPDGTRRGRRFLKSDKLQVD 452

Query: 522 PPRTPIP*I**ATVSFRLHRYC*GGETWKLQTGET 626
           P              F+L+R+C  GET  LQTG++
Sbjct: 453 P--------------FQLYRHCQSGETQHLQTGKS 473

 Score = 43.1 bits (100), Expect(2) = 3e-23
 Identities = 20/25 (80%), Positives = 21/25 (84%)
 Frame = +1

Query: 1   WGGISSEEHDEAVMLEAAMLVESPK 75
           WGGISSEEHDEAVMLEAAM    P+
Sbjct: 289 WGGISSEEHDEAVMLEAAMFGGIPE 313

>ref|NP_680549.1| unknown protein; protein id: At4g00752.1 [Arabidopsis thaliana]
          Length = 493

 Score =  125 bits (315), Expect = 7e-28
 Identities = 70/138 (50%), Positives = 89/138 (63%), Gaps = 17/138 (12%)
 Frame = +1

Query: 361 VKKKSGERKKLAGNYRKEQELETQLAAKE----ASLPSEPSSTDENAVTLLVRMPDGSRR 528
           V  +S       G +++ Q    ++  +E    A+LP EPS  +E+A+TLLVRMPD SR 
Sbjct: 355 VANESASHNNQLGIHKESQSPNQRVVREEPERKAALPIEPSGENEDAITLLVRMPDSSRH 414

Query: 529 GRRFLRSDKL-------------QSLFDFIDIARVVKPGSYRLVRPYPRRAFGNEESASI 669
           GRRFL+SDKL             + LFDFID A +VKPG+YR+VRPYPRRAF  ++ A  
Sbjct: 415 GRRFLKSDKLKVKFVSLNSQVISEYLFDFIDAAGLVKPGTYRVVRPYPRRAFSIQDGALT 474

Query: 670 LEELGLTNKQEALFLELV 723
            EEL LTNKQEALFLEL+
Sbjct: 475 FEELSLTNKQEALFLELL 492

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 792,563,030
Number of Sequences: 1393205
Number of extensions: 18228633
Number of successful extensions: 93248
Number of sequences better than 10.0: 295
Number of HSP's better than 10.0 without gapping: 67664
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 86142
length of database: 448,689,247
effective HSP length: 123
effective length of database: 277,325,032
effective search space used: 49086530664
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB035d06_f BP036565 1 463
2 MWL080c11_f AV770020 331 902




Lotus japonicus
Kazusa DNA Research Institute