KMC003829A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003829A_C01 KMC003829A_c01
gtttctggatcacaacgacaggcatcgggcggCTAAAGGTGTAAATAGGTAAAACTTGAG
GGAATGTAATATGCATTTAACACCATATATGTATATGAAATAATGAAAGTAGCAATAAAT
CACAACATCCCATAAGGCTGAGTCATAAAACCCCAAAAGAGGAACTATGCAGCCTTAACA
TATTAAAAAAGTATATAGAACATGGTGGCCATGAGCAGGACATATGAATGAGAATCTCCT
CAATATTTACACACTCTGCTAAATGATCAAGTTTGCCAACTTGTTCCGCCATTTTCTGTC
CTTGGTTAAACAATATGTATGAGTTTAGAAAACCCACCAGGAGATGGGTTACAAACACCT
TTGGCCTTTTTGCTTTAGAATCTTGTTTGTAGAAACTAGAAGGAATGCCTTGACTTTGCA
TTGTGCCTTCCACTCTTAAATATTATTTCCCCTCGCCTAATTGGGTACATAAAATACAAG
AAGCAGTTTGCTCATACAGTAAAAACACATATCCATCTGAATACATGATATATTTTGCGA
CAATAAATGATCTTTGGGAATTCACTGACTTAGTTGCTATGGATCTTAAGTCAAGAATCT
TTTAAGTTCTGAAAAAGCTCTGGAAGATGGGCATTAGTCAGCTTGGTCCTCTTGGTGATT
TCATTCTTCTTCTTCTTAGGTTTAGGCTTGGAAGAAATGCCTTGCATTTGCGCTGCACTC
CTTCTCTTTGCTGTGTTAGTAGCAGGCTTCATTCCGTTTTTTTGGAGATCCCTCTCAATA
TCAATCATATCACGAGGCAATTCAATCCCCCGGAAAAGCTGTTTTAGGTCATCATCCACC
TTAATGGGAACTTTAGGATCATTAGGGTAGGCAATGTCTTCCTGGGAATC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003829A_C01 KMC003829A_c01
         (890 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAM01137.1|AC108884_19 Putative Transcription initiation fact...   161  1e-38
ref|NP_193766.1| putative protein; protein id: At4g20330.1, supp...   141  2e-32
ref|NP_193833.1| putative protein; protein id: At4g21010.1 [Arab...   106  4e-22
pir||T49705 related to transcription initiation factor IIE, beta...    38  0.18
emb|CAB91686.2| related to transcription initiation factor IIE, ...    38  0.18

>gb|AAM01137.1|AC108884_19 Putative Transcription initiation factor IIE, beta subunit [Oryza
           sativa (japonica cultivar-group)]
          Length = 279

 Score =  161 bits (407), Expect = 1e-38
 Identities = 76/96 (79%), Positives = 86/96 (89%)
 Frame = -1

Query: 890 DSQEDIAYPNDPKVPIKVDDDLKQLFRGIELPRDMIDIERDLQKNGMKPATNTAKRRSAA 711
           DSQEDI YPNDPK  IKVDDDLKQLFR +ELPRDM+DIE++LQKNG+KP TNTAKRR+AA
Sbjct: 182 DSQEDIVYPNDPKAKIKVDDDLKQLFREMELPRDMVDIEKELQKNGIKPMTNTAKRRAAA 241

Query: 710 QMQGISSKPKPKKKKNEITKRTKLTNAHLPELFQNL 603
           Q+ G+  K KPKKK+ EIT+RTKLTNAHLPELFQNL
Sbjct: 242 QINGVQPKAKPKKKQREITRRTKLTNAHLPELFQNL 277

>ref|NP_193766.1| putative protein; protein id: At4g20330.1, supported by cDNA:
           gi_14326522, supported by cDNA: gi_18700217, supported
           by cDNA: gi_20466493 [Arabidopsis thaliana]
           gi|25407575|pir||G85230 hypothetical protein AT4g20330
           [imported] - Arabidopsis thaliana
           gi|5738378|emb|CAB52821.1| putative protein [Arabidopsis
           thaliana] gi|7268828|emb|CAB79033.1| putative protein
           [Arabidopsis thaliana]
           gi|14326523|gb|AAK60306.1|AF385715_1 AT4g20330/F9F13_2
           [Arabidopsis thaliana] gi|18700218|gb|AAL77719.1|
           AT4g20330/F9F13_2 [Arabidopsis thaliana]
           gi|20466494|gb|AAM20564.1| putative protein [Arabidopsis
           thaliana]
          Length = 286

 Score =  141 bits (355), Expect = 2e-32
 Identities = 71/100 (71%), Positives = 83/100 (83%), Gaps = 1/100 (1%)
 Frame = -1

Query: 890 DSQEDIAYPNDPKVPIKVDDDLKQLFRGIELPRDMIDIERDLQKNGMKPATNTAKRRSAA 711
           +SQEDIAYPND K  IKVDD+ K LFR I +P DM+D+E++L K G+KPATNTA+RR+AA
Sbjct: 183 NSQEDIAYPNDFKCEIKVDDEFKALFRDINIPNDMLDVEKELLKIGLKPATNTAERRAAA 242

Query: 710 QMQGISSKPK-PKKKKNEITKRTKLTNAHLPELFQNLKDS 594
           Q  GIS+KPK  KKKK EI+KRTKLTNAHLPELFQNL  S
Sbjct: 243 QTHGISNKPKDKKKKKQEISKRTKLTNAHLPELFQNLNGS 282

>ref|NP_193833.1| putative protein; protein id: At4g21010.1 [Arabidopsis thaliana]
           gi|7486972|pir||T10643 hypothetical protein T13K14.170 -
           Arabidopsis thaliana gi|5262791|emb|CAB45896.1| putative
           protein [Arabidopsis thaliana]
           gi|7268898|emb|CAB79101.1| putative protein [Arabidopsis
           thaliana] gi|22136644|gb|AAM91641.1| unknown protein
           [Arabidopsis thaliana]
          Length = 275

 Score =  106 bits (265), Expect = 4e-22
 Identities = 55/96 (57%), Positives = 75/96 (77%), Gaps = 1/96 (1%)
 Frame = -1

Query: 890 DSQEDIAYPNDPKVPIKVDDDLKQLFRGIELPRDMIDIERDLQKNGMKPATNTAKRRSAA 711
           DS+E   Y N+ + P K+DD+LK LFR I +P DM+++E++L K G+KPATN A+RR+A 
Sbjct: 182 DSKEGTVYRNNMEYP-KIDDELKALFRDI-IPSDMLEVEKELLKIGLKPATNIAERRAAE 239

Query: 710 QMQGISSKPK-PKKKKNEITKRTKLTNAHLPELFQN 606
           Q+ G+S+KPK  KKKK EIT RTKLTN+H+ ELFQ+
Sbjct: 240 QLHGVSNKPKDKKKKKKEITNRTKLTNSHMLELFQS 275

>pir||T49705 related to transcription initiation factor IIE, beta subunit
           [imported] - Neurospora crassa
           gi|28918641|gb|EAA28312.1| related to transcription
           initiation factor IIE, beta subunit [MIPS] [Neurospora
           crassa]
          Length = 390

 Score = 38.1 bits (87), Expect = 0.18
 Identities = 24/86 (27%), Positives = 44/86 (50%)
 Frame = -1

Query: 863 NDPKVPIKVDDDLKQLFRGIELPRDMIDIERDLQKNGMKPATNTAKRRSAAQMQGISSKP 684
           +DP +  +VD +LK +++ +E+P     ++R L+    KPA+   + +  A        P
Sbjct: 307 DDPSLFHEVDPELKVMWQKVEVPGTDTIVQR-LKAASQKPASEDPRDKMTAA-------P 358

Query: 683 KPKKKKNEITKRTKLTNAHLPELFQN 606
           K +KKK    +  K TN H+  L ++
Sbjct: 359 KAEKKKRAQRRTGKATNTHMEHLLKD 384

>emb|CAB91686.2| related to transcription initiation factor IIE, beta subunit
           [Neurospora crassa]
          Length = 302

 Score = 38.1 bits (87), Expect = 0.18
 Identities = 24/86 (27%), Positives = 44/86 (50%)
 Frame = -1

Query: 863 NDPKVPIKVDDDLKQLFRGIELPRDMIDIERDLQKNGMKPATNTAKRRSAAQMQGISSKP 684
           +DP +  +VD +LK +++ +E+P     ++R L+    KPA+   + +  A        P
Sbjct: 219 DDPSLFHEVDPELKVMWQKVEVPGTDTIVQR-LKAASQKPASEDPRDKMTAA-------P 270

Query: 683 KPKKKKNEITKRTKLTNAHLPELFQN 606
           K +KKK    +  K TN H+  L ++
Sbjct: 271 KAEKKKRAQRRTGKATNTHMEHLLKD 296

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 779,206,813
Number of Sequences: 1393205
Number of extensions: 17595140
Number of successful extensions: 41775
Number of sequences better than 10.0: 24
Number of HSP's better than 10.0 without gapping: 39870
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 41662
length of database: 448,689,247
effective HSP length: 122
effective length of database: 278,718,237
effective search space used: 48496973238
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPD075d04_f AV774920 1 422
2 MR090b11_f BP082903 29 371
3 GNf083a11 BP073463 111 230
4 MPD078f01_f AV775130 212 683
5 MWL036e08_f AV769184 331 702
6 MR062h04_f BP080785 350 836
7 MR038f06_f BP078960 373 891
8 GNf077g03 BP073082 388 759
9 MR050a03_f BP079831 394 554
10 MF089a09_f BP032955 429 871




Lotus japonicus
Kazusa DNA Research Institute