KMC011584A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC011584A_C01 KMC011584A_c01
tcgaatctCTCTCTCTCTCCATGGACCCATCCTCGATCTCCTCACCTTCAGCTTCTGCTC
CGCCCACAGCCACCGTCCCCTTCGCCGCCGACCCCAACAACCACCCTCCTCCTCCGCCCG
CTCCCGCCGATAATCCAGCCCATCCACCCTATGCTGAGATGATATACACAGCAATTGGGG
CTTTGAAGGAGAAAGACGGTTCGAGCAAGAGAGCGATAAACAAGTACATAGAGCAAGTCT
ACAAGGACCAGCTCACTCAGTCGCACGAATCGTTGTTGACTCACCACCTCAAGCGTTTGA
AGACCAACGGAATGCTCGTCATGGACAAGAAATCTTACAAGCTACCTGGATCTGCGCCAC
CGATACTTCCTCCGCCGCCGGAGAATATCGCCGCCGGTGCCGGTGCTGCTCCTTCTCCGG
CATCCAGGCCGAGAGGTCGTCCAAGGAAGGTTCAACCCCCCCAGCACCTGCCCCAGACCC
TAACCCTGGCTGGGGTTCAACTTCAACCACAGCTGCCGCCGCAGCAGCAGGTTCAGCCGG
AGGCGCAACCGCAAACGCAAAACCTGCCGCCTCAGTTCAACAACGTGCAGCAGAATGCTG
CTCCTGCTCAGAATGCCGAGCCTGTAGGGGCT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC011584A_C01 KMC011584A_c01
         (632 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

emb|CAC69997.1| HMG I/Y like protein [Glycine max]                    146  3e-34
pir||T02029 DNA-binding protein pabf - common tobacco gi|555655|...   145  4e-34
pir||G96525 protein T1N15.25 [imported] - Arabidopsis thaliana g...   128  5e-29
ref|NP_175295.1| unknown protein; protein id: At1g48620.1 [Arabi...   128  5e-29
gb|AAG50847.1|AC074308_3 hypothetical protein, 3' partial [Arabi...   128  5e-29

>emb|CAC69997.1| HMG I/Y like protein [Glycine max]
          Length = 413

 Score =  146 bits (368), Expect = 3e-34
 Identities = 103/258 (39%), Positives = 127/258 (48%), Gaps = 55/258 (21%)
 Frame = +3

Query: 21  MDPSSISSPSASAPPTATVPFAADPNNHPPPPPAPADNPAHPPYAEMIYTAIGALKEKDG 200
           MDP+SI  P     P  TVPF  +P+NH  P  A   N  HPPY EMIYTAIGALKEKDG
Sbjct: 1   MDPTSIPPP-----PATTVPFTVEPSNHVTP--ADNTNTNHPPYDEMIYTAIGALKEKDG 53

Query: 201 SSKRAINKYIEQVYKDQLTQSHESLLTHHLKRLKTNGMLVMDKKSYKLPGSAP-PIL--- 368
           SSKRAI KY+EQVYKD L  +H +LLTHHL RLK+ G+L++ KKSYKLPGS P P+L   
Sbjct: 54  SSKRAIGKYMEQVYKD-LPPTHSALLTHHLNRLKSAGLLILVKKSYKLPGSDPLPVLQAQ 112

Query: 369 ------------------------------------PPPPENIAAGAGAAPSP-ASRPRG 437
                                               P  P+ IA   G +P P     RG
Sbjct: 113 KPRGRPPKLKSQPNTELTWPALALNDNPALQSAKRGPGRPKKIAGPVGVSPGPMVPGRRG 172

Query: 438 RP----------RKVQPPQHLPQTLTLAGVQLQPQLPPQQQ----VQPEAQPQTQNLPPQ 575
           RP          R  +PP+    +   +G++ +P  PP+ +    V P A P    LP  
Sbjct: 173 RPPGTGRSKLPKRPGRPPKPKSVSAISSGLKRRPGRPPKAESNVNVIPFAAPVAPGLPTV 232

Query: 576 FNNVQQNAAPAQNAEPVG 629
              V   + P  +  P G
Sbjct: 233 QPIVPTASVPNGSPRPRG 250

>pir||T02029 DNA-binding protein pabf - common tobacco gi|555655|gb|AAA50196.1|
           DNA-binding protein
          Length = 546

 Score =  145 bits (366), Expect = 4e-34
 Identities = 101/217 (46%), Positives = 121/217 (55%), Gaps = 16/217 (7%)
 Frame = +3

Query: 21  MDPSSISSPSASAPPT----ATVPFAADPNNHPPPPPAPADNPAHPPYAEMIYTAIGALK 188
           MDPS +  P+ +  PT      V  A  P    PPPPAP+ +P HPPYAEMI  AI ALK
Sbjct: 1   MDPS-MDLPTTTESPTFNSAQVVNHAPTPTPPQPPPPAPSFSPTHPPYAEMITAAITALK 59

Query: 189 EKDGSSKRAINKYIEQVYKDQLTQSHESLLTHHLKRLKTNGMLVMDKKSYKL---PGSAP 359
           E+DGSS+ AI KYI++VY + L  +H +LLTHHLKRLK +G L M K SY L   PGSAP
Sbjct: 60  ERDGSSRIAIAKYIDRVYTN-LPPNHSALLTHHLKRLKNSGYLAMVKHSYMLAGPPGSAP 118

Query: 360 PILPPPPENIAAGAGAAPSPAS-RPRGRPRKVQP-PQHLPQTLTLAGVQLQPQLPPQQQV 533
           P  PP  +  + G G   S  S R  GRP K++P  Q   Q    A VQ Q Q   Q Q 
Sbjct: 119 P--PPSADADSNGVGTDVSSLSKRKPGRPPKLKPEAQPHAQPQVQAQVQFQDQFQAQLQA 176

Query: 534 QPEAQPQTQ-----NLPPQFNNVQQNA--APAQNAEP 623
           Q +AQ Q Q        PQF  +QQ     P Q  +P
Sbjct: 177 QLQAQLQAQQQQAAQFQPQFQLIQQQPQYLPQQQFQP 213

>pir||G96525 protein T1N15.25 [imported] - Arabidopsis thaliana
           gi|8778700|gb|AAF79708.1|AC020889_16 T1N15.25
           [Arabidopsis thaliana]
          Length = 594

 Score =  128 bits (322), Expect = 5e-29
 Identities = 83/202 (41%), Positives = 109/202 (53%), Gaps = 23/202 (11%)
 Frame = +3

Query: 39  SSPSASAPPTATVPFAADPNNHPPPPPAPAD-NPAHPPYAEMIYTAIGALKEKDGSSKRA 215
           + P+A APP     + A P   P   P P   + +HPPY++MI TAI AL E DGSSK+A
Sbjct: 155 TGPTAVAPPNNIHLYQAAPPQQPQTSPVPPHPSISHPPYSDMICTAIAALNEPDGSSKQA 214

Query: 216 INKYIEQVYKDQLTQSHESLLTHHLKRLKTNGMLVMDKKSYKLPGSAPPILPPPPENIAA 395
           I++YIE++Y   +  +H +LLTHHLK LKT+G+LVM KKSYKL  S PP  PPPP ++A 
Sbjct: 215 ISRYIERIYTG-IPTAHGALLTHHLKTLKTSGILVMVKKSYKL-ASTPP--PPPPTSVAP 270

Query: 396 G---------------------AGAAPSPASRPRGRPRKVQPPQHLPQTLTLAGVQL-QP 509
                                 A + P    R RGRP K +P    PQ LT   +   Q 
Sbjct: 271 SLEPPRSDFIVNENQPLPDPVLASSTPQTIKRGRGRPPKAKPDVVQPQPLTNGKLTWEQS 330

Query: 510 QLPPQQQVQPEAQPQTQNLPPQ 575
           +LP  +  + + QP    L PQ
Sbjct: 331 ELPVSRPEEIQIQPPQLPLQPQ 352

>ref|NP_175295.1| unknown protein; protein id: At1g48620.1 [Arabidopsis thaliana]
          Length = 479

 Score =  128 bits (322), Expect = 5e-29
 Identities = 83/202 (41%), Positives = 109/202 (53%), Gaps = 23/202 (11%)
 Frame = +3

Query: 39  SSPSASAPPTATVPFAADPNNHPPPPPAPAD-NPAHPPYAEMIYTAIGALKEKDGSSKRA 215
           + P+A APP     + A P   P   P P   + +HPPY++MI TAI AL E DGSSK+A
Sbjct: 40  TGPTAVAPPNNIHLYQAAPPQQPQTSPVPPHPSISHPPYSDMICTAIAALNEPDGSSKQA 99

Query: 216 INKYIEQVYKDQLTQSHESLLTHHLKRLKTNGMLVMDKKSYKLPGSAPPILPPPPENIAA 395
           I++YIE++Y   +  +H +LLTHHLK LKT+G+LVM KKSYKL  S PP  PPPP ++A 
Sbjct: 100 ISRYIERIYTG-IPTAHGALLTHHLKTLKTSGILVMVKKSYKL-ASTPP--PPPPTSVAP 155

Query: 396 G---------------------AGAAPSPASRPRGRPRKVQPPQHLPQTLTLAGVQL-QP 509
                                 A + P    R RGRP K +P    PQ LT   +   Q 
Sbjct: 156 SLEPPRSDFIVNENQPLPDPVLASSTPQTIKRGRGRPPKAKPDVVQPQPLTNGKLTWEQS 215

Query: 510 QLPPQQQVQPEAQPQTQNLPPQ 575
           +LP  +  + + QP    L PQ
Sbjct: 216 ELPVSRPEEIQIQPPQLPLQPQ 237

>gb|AAG50847.1|AC074308_3 hypothetical protein, 3' partial [Arabidopsis thaliana]
          Length = 332

 Score =  128 bits (322), Expect = 5e-29
 Identities = 83/202 (41%), Positives = 109/202 (53%), Gaps = 23/202 (11%)
 Frame = +3

Query: 39  SSPSASAPPTATVPFAADPNNHPPPPPAPAD-NPAHPPYAEMIYTAIGALKEKDGSSKRA 215
           + P+A APP     + A P   P   P P   + +HPPY++MI TAI AL E DGSSK+A
Sbjct: 40  TGPTAVAPPNNIHLYQAAPPQQPQTSPVPPHPSISHPPYSDMICTAIAALNEPDGSSKQA 99

Query: 216 INKYIEQVYKDQLTQSHESLLTHHLKRLKTNGMLVMDKKSYKLPGSAPPILPPPPENIAA 395
           I++YIE++Y   +  +H +LLTHHLK LKT+G+LVM KKSYKL  S PP  PPPP ++A 
Sbjct: 100 ISRYIERIYTG-IPTAHGALLTHHLKTLKTSGILVMVKKSYKL-ASTPP--PPPPTSVAP 155

Query: 396 G---------------------AGAAPSPASRPRGRPRKVQPPQHLPQTLTLAGVQL-QP 509
                                 A + P    R RGRP K +P    PQ LT   +   Q 
Sbjct: 156 SLEPPRSDFIVNENQPLPDPVLASSTPQTIKRGRGRPPKAKPDVVQPQPLTNGKLTWEQS 215

Query: 510 QLPPQQQVQPEAQPQTQNLPPQ 575
           +LP  +  + + QP    L PQ
Sbjct: 216 ELPVSRPEEIQIQPPQLPLQPQ 237

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 676,695,841
Number of Sequences: 1393205
Number of extensions: 19811763
Number of successful extensions: 353436
Number of sequences better than 10.0: 4349
Number of HSP's better than 10.0 without gapping: 125790
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 280386
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 26154777244
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFBL037b08_f BP043105 1 357
2 SPDL008c01_f BP052459 9 428
3 MPD028g06_f AV771942 95 632




Lotus japonicus
Kazusa DNA Research Institute