KMC003347A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003347A_C01 KMC003347A_c01
agcaagacataattgtgattatctacatcaataacaacaatacacttataatacCTCCAA
GGTTTGAATTTTAATTTAATTTTTTATAATCAGTAATTTCTTGGGTATACTTTATAATCG
GTAATAGAAAAGTAGTAAGGGAATGGAACACTTTTTCCTATTTATACGGTAAGCCAGTTC
AGAGTTCTCCATGGTATACAAACACTCAAATCTTGCAGCTTGCATCATCAGCCGAAATAG
AGAGCAAAAAAAAACGCATATTGAACTTTCTGGTATCACAGTAAACAGGCCCTGAACCCC
TTTTCCAGTTCCTCAGCATCCAAGTTTTTCCCTATAAAAACAATTTTGTTTGTCCTTGGT
TCATCTGGCCCCCACAACCTTTCAGGTGATCCTTGAAAGATGTCATGAACTCCCTGAAAG
ACAAATCTCTCATCCATTCCTTGAACAGATAGAAGACCTTTCATCCTATAAATATCTTCA
CTCCGTTCCATCAACAAGGTACCAAGCCAAAAGTTAGCCTTCTCAAGGTCTAAGCTTCCT
TCACAAACTATGCTGACAGATGAAACACCAGGATCGTGAGTGTGATCATGAGAGTGGTGA
TCATGGTGATCATGCTTGTGGTCAAGTGAATCCTCATGATGATGGTGGTGATGATCATG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003347A_C01 KMC003347A_c01
         (659 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_178163.1| hypothetical protein; protein id: At1g80480.1 [...   225  4e-58
ref|NP_173025.1| PRLI-interacting factor L, putative; protein id...   213  2e-54
gb|AAG31652.1| PRLI-interacting factor L [Arabidopsis thaliana]       212  4e-54
ref|NP_485791.1| hypothetical protein [Nostoc sp. PCC 7120] gi|2...   120  2e-26
ref|NP_419140.1| conserved hypothetical protein [Caulobacter cre...   119  4e-26

>ref|NP_178163.1| hypothetical protein; protein id: At1g80480.1 [Arabidopsis
           thaliana] gi|25406668|pir||F96836 hypothetical protein
           T21F11.27 [imported] - Arabidopsis thaliana
           gi|6730739|gb|AAF27129.1|AC018849_17 hypothetical
           protein; 58060-60358 [Arabidopsis thaliana]
          Length = 444

 Score =  225 bits (574), Expect = 4e-58
 Identities = 106/129 (82%), Positives = 113/129 (87%), Gaps = 2/129 (1%)
 Frame = -1

Query: 659 HDHHHHHHEDSLDHKHDHHDHH--SHDHTHDPGVSSVSIVCEGSLDLEKANFWLGTLLME 486
           HDHHH H+ D   H HD HDHH  SHDHTHDPGVSSVSIVCEGSLDLEKAN WLGTLLME
Sbjct: 316 HDHHHDHNHDHDHHHHDGHDHHHHSHDHTHDPGVSSVSIVCEGSLDLEKANMWLGTLLME 375

Query: 485 RSEDIYRMKGLLSVQGMDERFVFQGVHDIFQGSPERLWGPDEPRTNKIVFIGKNLDAEEL 306
           RSEDIYRMKGLLSV  M+ERFVFQGVHDIFQGSP+RLWG +E R NKIVFIGKNL+ EEL
Sbjct: 376 RSEDIYRMKGLLSVHTMEERFVFQGVHDIFQGSPDRLWGREEERVNKIVFIGKNLNREEL 435

Query: 305 EKGFRACLL 279
           EKGF+ACL+
Sbjct: 436 EKGFKACLI 444

>ref|NP_173025.1| PRLI-interacting factor L, putative; protein id: At1g15730.1,
           supported by cDNA: gi_14194110, supported by cDNA:
           gi_20334729 [Arabidopsis thaliana]
           gi|25518551|pir||E86291 hypothetical protein F7H2.7
           [imported] - Arabidopsis thaliana
           gi|8927652|gb|AAF82143.1|AC034256_7 Contains similarity
           to COBW-like protein from Homo sapiens gb|AF257330 and
           contains a Viral (Superfamily 1) RNA helicase PF|01443
           domain.  EST gb|AI997977 comes from this genes.
           [Arabidopsis thaliana]
           gi|14194111|gb|AAK56250.1|AF367261_1 At1g15730/F7H2_7
           [Arabidopsis thaliana] gi|20334730|gb|AAM16226.1|
           At1g15730/F7H2_7 [Arabidopsis thaliana]
           gi|23397243|gb|AAN31903.1| putative PRLI-interacting
           factor L [Arabidopsis thaliana]
          Length = 448

 Score =  213 bits (542), Expect = 2e-54
 Identities = 98/130 (75%), Positives = 109/130 (83%), Gaps = 4/130 (3%)
 Frame = -1

Query: 656 DHHH----HHHEDSLDHKHDHHDHHSHDHTHDPGVSSVSIVCEGSLDLEKANFWLGTLLM 489
           DHHH    H H +  +H+H+H  HHSHDHTHDPGV SVSIVCEG LDLEKAN WLG LL 
Sbjct: 319 DHHHGHDCHDHHNEHEHEHEHEHHHSHDHTHDPGVGSVSIVCEGDLDLEKANMWLGALLY 378

Query: 488 ERSEDIYRMKGLLSVQGMDERFVFQGVHDIFQGSPERLWGPDEPRTNKIVFIGKNLDAEE 309
           +RSEDIYRMKG+LSVQ MDERFVFQGVH+IF+GSP+RLW  DE RTNKIVFIGKNL+ EE
Sbjct: 379 QRSEDIYRMKGILSVQDMDERFVFQGVHEIFEGSPDRLWRKDETRTNKIVFIGKNLNREE 438

Query: 308 LEKGFRACLL 279
           LE GFRACL+
Sbjct: 439 LEMGFRACLI 448

>gb|AAG31652.1| PRLI-interacting factor L [Arabidopsis thaliana]
          Length = 245

 Score =  212 bits (539), Expect = 4e-54
 Identities = 97/130 (74%), Positives = 109/130 (83%), Gaps = 4/130 (3%)
 Frame = -1

Query: 656 DHHH----HHHEDSLDHKHDHHDHHSHDHTHDPGVSSVSIVCEGSLDLEKANFWLGTLLM 489
           DHHH    H H +  +H+H+H  HHSHDHTHDPGV SVSIVCEG LDLEKAN WLG LL 
Sbjct: 116 DHHHGHDCHDHHNEHEHEHEHEHHHSHDHTHDPGVGSVSIVCEGDLDLEKANMWLGALLY 175

Query: 488 ERSEDIYRMKGLLSVQGMDERFVFQGVHDIFQGSPERLWGPDEPRTNKIVFIGKNLDAEE 309
           +R+EDIYRMKG+LSVQ MDERFVFQGVH+IF+GSP+RLW  DE RTNKIVFIGKNL+ EE
Sbjct: 176 QRNEDIYRMKGILSVQDMDERFVFQGVHEIFEGSPDRLWRKDETRTNKIVFIGKNLNREE 235

Query: 308 LEKGFRACLL 279
           LE GFRACL+
Sbjct: 236 LEMGFRACLI 245

>ref|NP_485791.1| hypothetical protein [Nostoc sp. PCC 7120] gi|25318042|pir||AI2024
           hypothetical protein all1751 [imported] - Nostoc sp.
           (strain PCC 7120) gi|17130841|dbj|BAB73450.1|
           ORF_ID:all1751~hypothetical protein [Nostoc sp. PCC
           7120]
          Length = 323

 Score =  120 bits (301), Expect = 2e-26
 Identities = 56/100 (56%), Positives = 72/100 (72%)
 Frame = -1

Query: 584 HTHDPGVSSVSIVCEGSLDLEKANFWLGTLLMERSEDIYRMKGLLSVQGMDERFVFQGVH 405
           H HD  V SV++V EG LD EK N W+  LL  +  DI+RMKG+L++ G D RFVFQGVH
Sbjct: 222 HEHDDTVFSVALVQEGELDGEKLNAWISELLRTQGTDIFRMKGILNIAGEDNRFVFQGVH 281

Query: 404 DIFQGSPERLWGPDEPRTNKIVFIGKNLDAEELEKGFRAC 285
            IF G P+RLW P+E R N++VFIG+NLD  +L++ F AC
Sbjct: 282 MIFDGRPDRLWKPNEKRKNELVFIGRNLDEAQLKQDFLAC 321

>ref|NP_419140.1| conserved hypothetical protein [Caulobacter crescentus CB15]
           gi|25400532|pir||H87288 conserved hypothetical protein
           CC0321 [imported] - Caulobacter crescentus
           gi|13421466|gb|AAK22308.1| conserved hypothetical
           protein [Caulobacter crescentus CB15]
          Length = 365

 Score =  119 bits (298), Expect = 4e-26
 Identities = 56/128 (43%), Positives = 75/128 (57%), Gaps = 5/128 (3%)
 Frame = -1

Query: 656 DHHHHHHEDSLDHKHDHHDHHSHDH-----THDPGVSSVSIVCEGSLDLEKANFWLGTLL 492
           DHHHHH  D +  +H  HDHH H H      HD GV  +S+  +  +D +K   WL  LL
Sbjct: 235 DHHHHHDHDHVHDEHCGHDHHHHHHDHKSDVHDDGVKGISLTLDKPVDGQKITAWLNDLL 294

Query: 491 MERSEDIYRMKGLLSVQGMDERFVFQGVHDIFQGSPERLWGPDEPRTNKIVFIGKNLDAE 312
             R  DI R KG++ V+G D+R VFQ VH I +G  +R W   + R +++VFIG++LD  
Sbjct: 295 ARRGPDILRAKGIIDVKGEDKRLVFQAVHMILEGDFQRPWTDKDKRYSRMVFIGRDLDEA 354

Query: 311 ELEKGFRA 288
           EL  GF A
Sbjct: 355 ELRAGFEA 362

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 563,452,050
Number of Sequences: 1393205
Number of extensions: 12564069
Number of successful extensions: 80613
Number of sequences better than 10.0: 1171
Number of HSP's better than 10.0 without gapping: 35390
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 57013
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 28289785200
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB093c10_f BP040775 1 492
2 MPD076f02_f AV775001 52 515
3 MR090g02_f BP082948 65 438
4 GNf035h12 BP069949 66 456
5 MF072h03_f BP032142 69 484
6 MPDL081b03_f AV780690 133 625
7 MPDL067f08_f AV779928 135 644
8 MR007h08_f BP076516 138 523
9 MWM245a10_f AV768473 138 660




Lotus japonicus
Kazusa DNA Research Institute