KMC017971A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC017971A_C01 KMC017971A_c01
taatggatatactGAACCGGTTAGTAATGAATTAAATACAGATTTTTTTATAAAAATGTG
AAAGTTATTTTCTAAACTGTAACCATTACACAAATCAAAGCTGCTACGGGTAAACACCAA
TAAAGACGATACACCATTACATTAGATAGATTAAAAATTGTCGAAGATATATGATTCCTG
CCCTTCTCGTACCATAAGTTGAAGAAACCTCTATCTATTTCTTGGCATAAGTTATCAACA
TAGGGTTTTGTGGGGTTATCTTGAAACCTTGCAGGGCCTGCATGGCCACTGTGGATTGCA
TCTCATCTCCATATTCCACAAAGGCAATACCTGGCTTTGTTTCAACCATTCTAACTTCCT
TGAATCCAGGATATTGAAGAAAGAGCATTTGCAGCATCATGGGAGTTGTCTCATTGGGAA
GATTCTGAATGAAGAGAATATTATTGGGAGGCGCAGGAGCCTCAGGTACCATAGATTTTA
AACCACCTGGATAAGGTATAGGTGTTGCACCATAAGCACCAGGATAAGCAGGATTAAGAC
CCATCCCAGCTAAATTAGCATCATTTTGGTCC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC017971A_C01 KMC017971A_c01
         (572 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||S59117 small nuclear ribonucleoprotein U1A - potato gi|1050...   203  1e-51
ref|NP_182280.1| small nuclear ribonucleoprotein U1A; protein id...   178  4e-44
gb|AAK70905.1|AC087551_4 putative small nuclear ribonucleoprotei...   158  4e-38
gb|AAM64950.1| putative small nuclear ribonucleoprotein U2B [Ara...   136  2e-31
ref|NP_180585.1| small nuclear ribonucleoprotein U2B, putative; ...   136  2e-31

>pir||S59117 small nuclear ribonucleoprotein U1A - potato
           gi|1050840|emb|CAA90282.1| U1snRNP-specific protein, U1A
           [Solanum tuberosum]
          Length = 253

 Score =  203 bits (517), Expect = 1e-51
 Identities = 98/122 (80%), Positives = 108/122 (88%), Gaps = 4/122 (3%)
 Frame = -2

Query: 571 DQNDANLAGMGLNPAYPGAYGATP----IPYPGGLKSMVPEAPAPPNNILFIQNLPNETT 404
           DQ D+N AGMGLNPAY GAYGA P    IPY GG K+ VPEAPAPPN+ILF+QNLP+++T
Sbjct: 132 DQQDSNQAGMGLNPAYAGAYGAAPPFSQIPYMGGAKAAVPEAPAPPNSILFVQNLPHQST 191

Query: 403 PMMLQMLFLQYPGFKEVRMVETKPGIAFVEYGDEMQSTVAMQALQGFKITPQNPMLITYA 224
           PMMLQMLF QYPGFKEVRM+E KPGIAF+EYGDEMQSTVAMQALQGFKIT +NPMLITYA
Sbjct: 192 PMMLQMLFCQYPGFKEVRMIEAKPGIAFIEYGDEMQSTVAMQALQGFKITAENPMLITYA 251

Query: 223 KK 218
           KK
Sbjct: 252 KK 253

>ref|NP_182280.1| small nuclear ribonucleoprotein U1A; protein id: At2g47580.1,
           supported by cDNA: gi_15450590, supported by cDNA:
           gi_16649010, supported by cDNA: gi_20259985 [Arabidopsis
           thaliana] gi|2119046|pir||S59118 small nuclear
           ribonucleoprotein U1A [imported] - Arabidopsis thaliana
           gi|1050430|emb|CAA90283.1| U1snRNP-specific protein
           [Arabidopsis thaliana] gi|2529669|gb|AAC62852.1| small
           nuclear ribonucleoprotein U1A [Arabidopsis thaliana]
           gi|15450591|gb|AAK96567.1| At2g47580/T30B22.12
           [Arabidopsis thaliana] gi|16649011|gb|AAL24357.1| small
           nuclear ribonucleoprotein U1A [Arabidopsis thaliana]
           gi|20259986|gb|AAM13340.1| small nuclear
           ribonucleoprotein U1A [Arabidopsis thaliana]
           gi|22655484|gb|AAM98334.1| At2g47580/T30B22.12
           [Arabidopsis thaliana]
          Length = 250

 Score =  178 bits (452), Expect = 4e-44
 Identities = 88/120 (73%), Positives = 99/120 (82%), Gaps = 4/120 (3%)
 Frame = -2

Query: 565 NDANLAGMGLNPAYPGAYGATP----IPYPGGLKSMVPEAPAPPNNILFIQNLPNETTPM 398
           +D+   GM +N AYPG YGA P    +PYPGG+K  +PEAPAPPNNILF+QNLP+ETTPM
Sbjct: 132 HDSTQMGMPMNSAYPGVYGAAPPLSQVPYPGGMKPNMPEAPAPPNNILFVQNLPHETTPM 191

Query: 397 MLQMLFLQYPGFKEVRMVETKPGIAFVEYGDEMQSTVAMQALQGFKITPQNPMLITYAKK 218
           +LQMLF QY GFKEVRM+E KPGIAFVE+ DEMQSTVAMQ LQGFKI  QN MLITYAKK
Sbjct: 192 VLQMLFCQYQGFKEVRMIEAKPGIAFVEFADEMQSTVAMQGLQGFKI-QQNQMLITYAKK 250

>gb|AAK70905.1|AC087551_4 putative small nuclear ribonucleoprotein U1A [Oryza sativa]
          Length = 253

 Score =  158 bits (400), Expect = 4e-38
 Identities = 82/119 (68%), Positives = 93/119 (77%), Gaps = 3/119 (2%)
 Frame = -2

Query: 565 NDANLAGMGLNPAYPGAYGATPI---PYPGGLKSMVPEAPAPPNNILFIQNLPNETTPMM 395
           +D +  G+G+N AYPG YGA P+   P+ G  K M+PE   P NNILF+QNLP+ETTPMM
Sbjct: 137 HDVSQVGLGVN-AYPGVYGAPPLSQLPFAGAQKVMMPEIIVP-NNILFVQNLPHETTPMM 194

Query: 394 LQMLFLQYPGFKEVRMVETKPGIAFVEYGDEMQSTVAMQALQGFKITPQNPMLITYAKK 218
           LQMLF QYPGFKEVRMVE KPGIAFVEYGDE Q+T AM  LQGFKIT  N MLI+YAKK
Sbjct: 195 LQMLFCQYPGFKEVRMVEAKPGIAFVEYGDEGQATAAMNHLQGFKITKDNQMLISYAKK 253

>gb|AAM64950.1| putative small nuclear ribonucleoprotein U2B [Arabidopsis thaliana]
          Length = 232

 Score =  136 bits (343), Expect = 2e-31
 Identities = 67/104 (64%), Positives = 82/104 (78%), Gaps = 2/104 (1%)
 Frame = -2

Query: 523 PGAYGATPIP--YPGGLKSMVPEAPAPPNNILFIQNLPNETTPMMLQMLFLQYPGFKEVR 350
           P A    P P   P G ++M      PPNNILFIQNLP+ETT MMLQ+LF QYPGFKE+R
Sbjct: 135 PSANNGVPAPSFQPSGQETM------PPNNILFIQNLPHETTSMMLQLLFEQYPGFKEIR 188

Query: 349 MVETKPGIAFVEYGDEMQSTVAMQALQGFKITPQNPMLITYAKK 218
           M++ KPGIAFVEY D++Q+++AMQ LQGFKITPQNPM+I++AKK
Sbjct: 189 MIDAKPGIAFVEYEDDVQASIAMQPLQGFKITPQNPMVISFAKK 232

 Score = 32.3 bits (72), Expect = 4.3
 Identities = 26/86 (30%), Positives = 41/86 (47%), Gaps = 6/86 (6%)
 Frame = -2

Query: 460 APAPPNNILFIQNL----PNETTPMMLQMLFLQYPGFKEVRMVETKP--GIAFVEYGDEM 299
           A  PPN+ ++IQNL      E     L  LF Q+    +V  ++T    G A+V + +  
Sbjct: 4   ADIPPNHSIYIQNLNERIKKEELKRSLYCLFSQFGRILDVVALKTPKLRGQAWVTFSEVT 63

Query: 298 QSTVAMQALQGFKITPQNPMLITYAK 221
            +  A++ +Q F      PM + YAK
Sbjct: 64  AAGHAVRQMQNFP-XYDKPMRLQYAK 88

>ref|NP_180585.1| small nuclear ribonucleoprotein U2B, putative; protein id:
           At2g30260.1, supported by cDNA: 34995. [Arabidopsis
           thaliana] gi|25294326|pir||C84706 probable small nuclear
           ribonucleoprotein U2B [imported] - Arabidopsis thaliana
           gi|2347192|gb|AAC16931.1| putative small nuclear
           ribonucleoprotein U2B [Arabidopsis thaliana]
           gi|27765024|gb|AAO23633.1| At2g30260 [Arabidopsis
           thaliana]
          Length = 232

 Score =  136 bits (343), Expect = 2e-31
 Identities = 67/104 (64%), Positives = 82/104 (78%), Gaps = 2/104 (1%)
 Frame = -2

Query: 523 PGAYGATPIP--YPGGLKSMVPEAPAPPNNILFIQNLPNETTPMMLQMLFLQYPGFKEVR 350
           P A    P P   P G ++M      PPNNILFIQNLP+ETT MMLQ+LF QYPGFKE+R
Sbjct: 135 PSANNGVPAPSFQPSGQETM------PPNNILFIQNLPHETTSMMLQLLFEQYPGFKEIR 188

Query: 349 MVETKPGIAFVEYGDEMQSTVAMQALQGFKITPQNPMLITYAKK 218
           M++ KPGIAFVEY D++Q+++AMQ LQGFKITPQNPM+I++AKK
Sbjct: 189 MIDAKPGIAFVEYEDDVQASIAMQPLQGFKITPQNPMVISFAKK 232

 Score = 32.3 bits (72), Expect = 4.3
 Identities = 26/86 (30%), Positives = 40/86 (46%), Gaps = 6/86 (6%)
 Frame = -2

Query: 460 APAPPNNILFIQNL----PNETTPMMLQMLFLQYPGFKEVRMVETKP--GIAFVEYGDEM 299
           A  PPN  ++IQNL      E     L  LF Q+    +V  ++T    G A+V + +  
Sbjct: 4   ADIPPNQSIYIQNLNERIKKEELKRSLYCLFSQFGRILDVVALKTPKLRGQAWVTFSEVT 63

Query: 298 QSTVAMQALQGFKITPQNPMLITYAK 221
            +  A++ +Q F      PM + YAK
Sbjct: 64  AAGHAVRQMQNFPFY-DKPMRLQYAK 88

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 517,231,519
Number of Sequences: 1393205
Number of extensions: 11693493
Number of successful extensions: 30391
Number of sequences better than 10.0: 196
Number of HSP's better than 10.0 without gapping: 28600
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 30323
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 21243732558
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD047h09_f BP047781 1 177
2 MFB046c08_f BP037346 14 525
3 SPD097b05_f BP051723 23 572
4 SPD054h09_f BP048345 33 538




Lotus japonicus
Kazusa DNA Research Institute