KMC005012A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC005012A_C01 KMC005012A_c01
GGCACTAAATTCCTTTCATCAATAATTTTGCCTACGTAGTTGACTGATCTGTGGGTTGAT
AATCTCAATATACCAACCCATAGCCCAACAAATTAATACAGATGGAGTTAGGGGGCATAA
CTCAAGCTTGACAAGCAGGATACAAATCCGTTAAAAAATAGACCACGAAAATAACAAATA
ATAATCATGTAACTACTCTATTTAAAAAGGTAACTACTCTATCAAATAAATCTCTCTTAA
AAAGACTCAGTTCTCTAGCCAGTCTAAGGCAATGATAATCAGGTACTAACTCAATCCTAG
TTCTTAAGATAACAAAAGGAAGAAACTAATTGAAATTACGTTCAATTAGTGATCATTTGA
GACTGGAATCTGGAAAGCTGCTGAGGAATGGTGGATATAACTGGCCCGTATCTCTGAACA
TAGTCATAGAGTTCAAACAATGGAACAGCAAGCAGTTTCAAGTGTTGGGGCACGGCAAAG
TATTCCCTCTCAGACAAGTGAACAAGGAAGAGCTTCTTGCACTCCTTGGGTTTTGTTATA
TGAGGAGGGCAGTATGGGTACATTATGGTTTCAAAATTTGGCCTCCACCAGATTGCAACA
CATTCACCTATCTGCCAGTCAGGCACAAGAGCCGGTGAATTAGCACCAAGCTTGCTGGTC
AACTTTCTCTTCAAGCCCTCAATTTCATTCTCTCCTGGCTTGAGACGACCACCAGGGAGT
TTGCAAAATGTGTTTCCnATTTGCAGGAGAAGTATATGAGGATGATTATGTTCTTGGACC
AGTAACATTCCTTCAACGCTGGTCCTCATTCCTTCCTTCATATAGTTGACTTTCATGCGA
GCGAGACGATCGGCGACGGAGGTGTCTTTCTCCATCTTGGGCTCCTTGGTACCGAAGGTG
TAGCTGGAAAGAGGGTATGTGTTTACCACCTGAGATGACACCatcttcttcttcttctcg
agcttcttcttcttcttcttctgttgtgctgagaattgaacgaagaatggaagaagagag
aa


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC005012A_C01 KMC005012A_c01
         (1022 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAC42701.1| unknown protein [Arabidopsis thaliana] gi|289732...   369  e-101
ref|NP_194285.1| putative protein; protein id: At4g25550.1 [Arab...   317  1e-85
pir||C85295 hypothetical protein AT4g25550 [imported] - Arabidop...   313  2e-84
ref|NP_008937.1| cleavage and polyadenylation specific factor 5,...   243  5e-63
ref|NP_080899.1| cleavage and polyadenylation specific factor 5;...   243  5e-63

>dbj|BAC42701.1| unknown protein [Arabidopsis thaliana] gi|28973229|gb|AAO63939.1|
           unknown protein [Arabidopsis thaliana]
          Length = 200

 Score =  369 bits (947), Expect = e-101
 Identities = 172/200 (86%), Positives = 188/200 (94%)
 Frame = -1

Query: 944 MVSSQVVNTYPLSSYTFGTKEPKMEKDTSVADRLARMKVNYMKEGMRTSVEGMLLVQEHN 765
           M  SQVVNTYPLS+Y+FGTKEPK+EKDTSVADRLARMK+NYMKEGMRTSVEG+LLVQEHN
Sbjct: 1   MAMSQVVNTYPLSNYSFGTKEPKLEKDTSVADRLARMKINYMKEGMRTSVEGILLVQEHN 60

Query: 764 HPHILLLQXGNTFCKLPGGRLKPGENEIEGLKRKLTSKLGANSPALVPDWQIGECVAIWW 585
           HPHILLLQ GNTFCKLPGGRLKPGENE +GLKRKLTSKLG NS ALVPDW +GECVA WW
Sbjct: 61  HPHILLLQIGNTFCKLPGGRLKPGENEADGLKRKLTSKLGGNSAALVPDWTVGECVATWW 120

Query: 584 RPNFETIMYPYCPPHITKPKECKKLFLVHLSEREYFAVPQHLKLLAVPLFELYDYVQRYG 405
           RPNFET+MYPYCPPHITKPKECK+L++VHLSE+EYFAVP++LKLLAVPLFELYD VQRYG
Sbjct: 121 RPNFETMMYPYCPPHITKPKECKRLYIVHLSEKEYFAVPKNLKLLAVPLFELYDNVQRYG 180

Query: 404 PVISTIPQQLSRFQSQMITN 345
           PVISTIPQQLSRF   MI++
Sbjct: 181 PVISTIPQQLSRFHFNMISS 200

>ref|NP_194285.1| putative protein; protein id: At4g25550.1 [Arabidopsis thaliana]
           gi|7486820|pir||T05792 hypothetical protein M7J2.80 -
           Arabidopsis thaliana gi|2980795|emb|CAA18171.1| putative
           protein [Arabidopsis thaliana]
          Length = 210

 Score =  317 bits (813), Expect = 1e-85
 Identities = 154/198 (77%), Positives = 168/198 (84%), Gaps = 21/198 (10%)
 Frame = -1

Query: 944 MVSSQVVNTYPLSSYTFGTKEPKMEKDTSVADRLARMKVNYMKEGMRTSVEGMLLVQEHN 765
           M  SQVVNTYPLS+Y+FGTKEPK+EKDTSVADRLARMK+NYMKEGMRTSVEG+LLVQEHN
Sbjct: 1   MAMSQVVNTYPLSNYSFGTKEPKLEKDTSVADRLARMKINYMKEGMRTSVEGILLVQEHN 60

Query: 764 HPHILLLQXGNTFCKLPGGRLKPGEN---------------EIEGLKRKLTSKLGANSPA 630
           HPHILLLQ GNTFCKLPGGRLKPGEN               E +GLKRKLTSKLG NS A
Sbjct: 61  HPHILLLQIGNTFCKLPGGRLKPGENGIQLPPFWVYYVVSAEADGLKRKLTSKLGGNSAA 120

Query: 629 LVPDWQIGECVAIWWRPNFETIMYPYCPPHITKPK------ECKKLFLVHLSEREYFAVP 468
           LVPDW +GECVA WWRPNFET+MYPYCPPHITKPK      ECK+L++VHLSE+EYFAVP
Sbjct: 121 LVPDWTVGECVATWWRPNFETMMYPYCPPHITKPKVVKKHNECKRLYIVHLSEKEYFAVP 180

Query: 467 QHLKLLAVPLFELYDYVQ 414
           ++LKLLAVPLFELYD VQ
Sbjct: 181 KNLKLLAVPLFELYDNVQ 198

>pir||C85295 hypothetical protein AT4g25550 [imported] - Arabidopsis thaliana
           gi|7269405|emb|CAB81365.1| putative protein [Arabidopsis
           thaliana]
          Length = 209

 Score =  313 bits (803), Expect = 2e-84
 Identities = 152/195 (77%), Positives = 166/195 (84%), Gaps = 21/195 (10%)
 Frame = -1

Query: 935 SQVVNTYPLSSYTFGTKEPKMEKDTSVADRLARMKVNYMKEGMRTSVEGMLLVQEHNHPH 756
           SQVVNTYPLS+Y+FGTKEPK+EKDTSVADRLARMK+ YMKEGMRTSVEG+LLVQEHNHPH
Sbjct: 3   SQVVNTYPLSNYSFGTKEPKLEKDTSVADRLARMKIKYMKEGMRTSVEGILLVQEHNHPH 62

Query: 755 ILLLQXGNTFCKLPGGRLKPGEN---------------EIEGLKRKLTSKLGANSPALVP 621
           ILLLQ GNTFCKLPGGRLKPGEN               E +GLKRKLTSKLG NS ALVP
Sbjct: 63  ILLLQIGNTFCKLPGGRLKPGENGIQLPPVWVYYVVSAEADGLKRKLTSKLGGNSAALVP 122

Query: 620 DWQIGECVAIWWRPNFETIMYPYCPPHITKPK------ECKKLFLVHLSEREYFAVPQHL 459
           DW +GECVA WWRPNFET+MYPYCPPHITKPK      ECK+L++VHLSE+EYFAVP++L
Sbjct: 123 DWTVGECVATWWRPNFETMMYPYCPPHITKPKVVKKHNECKRLYIVHLSEKEYFAVPKNL 182

Query: 458 KLLAVPLFELYDYVQ 414
           KLLAVPLFELYD VQ
Sbjct: 183 KLLAVPLFELYDNVQ 197

>ref|NP_008937.1| cleavage and polyadenylation specific factor 5, 25 kD subunit;
           pre-mRNA cleavage factor Im (25kD); pre-mRNA cleavage
           factor Im, 25kD subunit [Homo sapiens]
           gi|2887288|emb|CAA05026.1| mRNA cleavage factor I 25 kDa
           subunit [Homo sapiens]
           gi|12655103|gb|AAH01403.1|AAH01403 pre-mRNA cleavage
           factor Im (25kD) [Homo sapiens]
          Length = 227

 Score =  243 bits (619), Expect = 5e-63
 Identities = 116/201 (57%), Positives = 145/201 (71%)
 Frame = -1

Query: 968 KKLEKKKKMVSSQVVNTYPLSSYTFGTKEPKMEKDTSVADRLARMKVNYMKEGMRTSVEG 789
           K +++ K +   + +N YPL++YTFGTKEP  EKD+SVA R  RM+  + K GMR +VEG
Sbjct: 23  KYIQQTKPLTLERTINLYPLTNYTFGTKEPLYEKDSSVAARFQRMREEFDKIGMRRTVEG 82

Query: 788 MLLVQEHNHPHILLLQXGNTFCKLPGGRLKPGENEIEGLKRKLTSKLGANSPALVPDWQI 609
           +L+V EH  PH+LLLQ G TF KLPGG L PGE+E+EGLKR +T  LG     ++ DW I
Sbjct: 83  VLIVHEHRLPHVLLLQLGTTFFKLPGGELNPGEDEVEGLKRLMTEILG-RQDGVLQDWVI 141

Query: 608 GECVAIWWRPNFETIMYPYCPPHITKPKECKKLFLVHLSEREYFAVPQHLKLLAVPLFEL 429
            +C+  WWRPNFE   YPY P HITKPKE KKLFLV L E+  FAVP++ KL+A PLFEL
Sbjct: 142 DDCIGNWWRPNFEPPQYPYIPAHITKPKEHKKLFLVQLQEKALFAVPKNYKLVAAPLFEL 201

Query: 428 YDYVQRYGPVISTIPQQLSRF 366
           YD    YGP+IS++PQ LSRF
Sbjct: 202 YDNAPGYGPIISSLPQLLSRF 222

>ref|NP_080899.1| cleavage and polyadenylation specific factor 5; RIKEN cDNA
           3110048P04 gene; cleavage and polyadenylation specific
           factor 5, 25 kD subunit [Mus musculus]
           gi|27658562|ref|XP_214640.1| similar to cleavage and
           polyadenylation specific factor 5, 25 kD subunit; RIKEN
           cDNA 3110048P04 gene [Mus musculus] [Rattus norvegicus]
           gi|12847971|dbj|BAB27778.1| unnamed protein product [Mus
           musculus] gi|12859636|dbj|BAB31718.1| unnamed protein
           product [Mus musculus] gi|14198424|gb|AAH08270.1| RIKEN
           cDNA 3110048P04 gene [Mus musculus]
          Length = 227

 Score =  243 bits (619), Expect = 5e-63
 Identities = 116/201 (57%), Positives = 145/201 (71%)
 Frame = -1

Query: 968 KKLEKKKKMVSSQVVNTYPLSSYTFGTKEPKMEKDTSVADRLARMKVNYMKEGMRTSVEG 789
           K +++ K +   + +N YPL++YTFGTKEP  EKD+SVA R  RM+  + K GMR +VEG
Sbjct: 23  KYIQQTKPLTLERTINLYPLTNYTFGTKEPLYEKDSSVAARFQRMREEFDKIGMRRTVEG 82

Query: 788 MLLVQEHNHPHILLLQXGNTFCKLPGGRLKPGENEIEGLKRKLTSKLGANSPALVPDWQI 609
           +L+V EH  PH+LLLQ G TF KLPGG L PGE+E+EGLKR +T  LG     ++ DW I
Sbjct: 83  VLIVHEHRLPHVLLLQLGTTFFKLPGGELNPGEDEVEGLKRLMTEILG-RQDGVLQDWVI 141

Query: 608 GECVAIWWRPNFETIMYPYCPPHITKPKECKKLFLVHLSEREYFAVPQHLKLLAVPLFEL 429
            +C+  WWRPNFE   YPY P HITKPKE KKLFLV L E+  FAVP++ KL+A PLFEL
Sbjct: 142 DDCIGNWWRPNFEPPQYPYIPAHITKPKEHKKLFLVQLQEKALFAVPKNYKLVAAPLFEL 201

Query: 428 YDYVQRYGPVISTIPQQLSRF 366
           YD    YGP+IS++PQ LSRF
Sbjct: 202 YDNAPGYGPIISSLPQLLSRF 222

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 875,744,728
Number of Sequences: 1393205
Number of extensions: 20245676
Number of successful extensions: 113160
Number of sequences better than 10.0: 47
Number of HSP's better than 10.0 without gapping: 69494
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 101425
length of database: 448,689,247
effective HSP length: 124
effective length of database: 275,931,827
effective search space used: 59601274632
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPD002h06_f AV770162 1 505
2 MPDL027e07_f AV777856 1 613
3 MFB034g12_f BP036524 17 149
4 MPD019b04_f AV771293 17 588
5 MPD091d09_f AV775975 59 551
6 MPD033c10_f AV772249 477 1023




Lotus japonicus
Kazusa DNA Research Institute