Comparison of KIAA cDNA sequences between mouse and human (KIAA1125)

<< Original sequence data >>

mouse  mKIAA1125 (mbg01394)     length:   6180 bp
human   KIAA1125  (hk07594)     length:   4147 bp


<< Aligned sequence information (excl. stop, if exists.) >>

----------------------------------------------------------
            length    #match  #mismatch   %diff
----------------------------------------------------------
DNA

  CDS1 :     3622      3143      479      13.22
  Total:     3622      3143      479      13.22

  3'UTR:      289       167      122      42.21

amino acid

  CDS1 :     1214      1098      116       9.56
  Total:     1214      1098      116       9.56
----------------------------------------------------------


<< Alignment region (incl. stop, if exists.) >>

----------------------------------------------------------
                    cDNA      cDNA original    amino acid
----------------------------------------------------------
  CDS1 : mouse  2138 -  5899   2138 -  5899      1 -  1254
         human   201 -  3815    198 -  3815      2 -  1206
  3'UTR: mouse  5900 -  6180
         human  3816 -  4147
----------------------------------------------------------


<< Alignment >>

*--[ CDS1 ]--*
             1 ----+----*----+----*----+----*----+----*----+----* 50
             1 F  N  R  L  A  E  E  E  I  K  T  E  Q  E  V  V  E  17
mbg01394  2138 TTTAACCGCTTGGCTGAAGAGGAAATAAAAACTGAGCAGGAGGTGGTGGA 2187
               |||    ||||||||||||||||||||||||| || ||||||||||| ||
hk07594    201 TTTGTTGGCTTGGCTGAAGAGGAAATAAAAACAGAACAGGAGGTGGTAGA 250
             2 F  V  G  L  A  E  E  E  I  K  T  E  Q  E  V  V  E  18

            51 ----+----*----+----*----+----*----+----*----+----* 100
            18  G  M  D  I  S  T  R  S  K  E  P  K  E  D  P  V  S 34
mbg01394  2188 GGGAATGGATATCTCTACTCGCTCCAAAGAACCTAAGGAAGATCCTGTCT 2237
               ||| ||||||||||||||||||||||||            ||||||| ||
hk07594    251 GGGCATGGATATCTCTACTCGCTCCAAA............GATCCTGGCT 288
            19  G  M  D  I  S  T  R  S  K  .  .  .  .  D  P  G  S 31

           101 ----+----*----+----*----+----*----+----*----+----* 150
            35   T  E  K  T  A  P  K  R  K  F  P  S  P  P  H  S   50
mbg01394  2238 CTACAGAGAAAACGGCCCCGAAACGGAAGTTCCCCAGCCCTCCACATTCC 2287
               || |||||| ||| |||| |||| | ||||||||||||||||||||||| 
hk07594    289 CTGCAGAGAGAACAGCCCAGAAAAGAAAGTTCCCCAGCCCTCCACATTCT 338
            32   A  E  R  T  A  Q  K  R  K  F  P  S  P  P  H  S   47

           151 ----+----*----+----*----+----*----+----*----+----* 200
            51 S  N  G  H  S  P  Q  D  S  S  T  S  P  I  K  K  K  67
mbg01394  2288 TCCAATGGCCATTCGCCCCAAGACTCATCCACGAGCCCCATTAAAAAGAA 2337
               ||||||||||| ||||| || ||| |||| || |||||||||||||||||
hk07594    339 TCCAATGGCCACTCGCCGCAGGACACATCAACAAGCCCCATTAAAAAGAA 388
            48 S  N  G  H  S  P  Q  D  T  S  T  S  P  I  K  K  K  64

           201 ----+----*----+----*----+----*----+----*----+----* 250
            68  K  K  P  G  L  L  N  S  S  N  K  E  Q  S  E  L  R 84
mbg01394  2338 AAAGAAACCCGGCTTACTCAACAGTAGCAATAAGGAACAGTCAGAGCTAA 2387
               ||||||||| |||||||| ||||||| ||||||||| |||||||| ||||
hk07594    389 AAAGAAACCTGGCTTACTGAACAGTAACAATAAGGAGCAGTCAGAACTAA 438
            65  K  K  P  G  L  L  N  S  N  N  K  E  Q  S  E  L  R 81

           251 ----+----*----+----*----+----*----+----*----+----* 300
            85   H  G  P  F  Y  Y  M  K  Q  P  L  T  T  D  P  V   100
mbg01394  2388 GACATGGTCCGTTTTACTATATGAAGCAGCCACTCACCACAGACCCTGTT 2437
               ||||||||||||||||||||||||||||||||||||||||||||||||||
hk07594    439 GACATGGTCCGTTTTACTATATGAAGCAGCCACTCACCACAGACCCTGTT 488
            82   H  G  P  F  Y  Y  M  K  Q  P  L  T  T  D  P  V   97

           301 ----+----*----+----*----+----*----+----*----+----* 350
           101 D  V  V  P  Q  D  G  R  N  D  F  Y  C  W  V  C  H  117
mbg01394  2438 GATGTTGTACCGCAGGACGGACGGAATGACTTCTATTGCTGGGTTTGTCA 2487
               ||||||||||||||||| ||||||||||| ||||| ||||||||||||||
hk07594    489 GATGTTGTACCGCAGGATGGACGGAATGATTTCTACTGCTGGGTTTGTCA 538
            98 D  V  V  P  Q  D  G  R  N  D  F  Y  C  W  V  C  H  114

           351 ----+----*----+----*----+----*----+----*----+----* 400
           118  R  E  G  Q  V  L  C  C  E  L  C  P  R  V  Y  H  A 134
mbg01394  2488 CCGGGAAGGACAAGTCCTTTGCTGTGAGCTCTGTCCCCGGGTTTATCACG 2537
               ||||||||| ||||||||||||||||||||||||||||||||||||||||
hk07594    539 CCGGGAAGGCCAAGTCCTTTGCTGTGAGCTCTGTCCCCGGGTTTATCACG 588
           115  R  E  G  Q  V  L  C  C  E  L  C  P  R  V  Y  H  A 131

           401 ----+----*----+----*----+----*----+----*----+----* 450
           135   K  C  L  R  L  T  S  E  P  E  G  D  W  F  C  P   150
mbg01394  2538 CTAAGTGTCTGAGACTGACATCGGAGCCAGAGGGGGACTGGTTTTGTCCT 2587
               ||||||||||||||||||||||||| ||||||||||||||||||||||||
hk07594    589 CTAAGTGTCTGAGACTGACATCGGAACCAGAGGGGGACTGGTTTTGTCCT 638
           132   K  C  L  R  L  T  S  E  P  E  G  D  W  F  C  P   147

           451 ----+----*----+----*----+----*----+----*----+----* 500
           151 E  C  E  K  I  T  V  A  E  C  I  E  T  Q  S  K  A  167
mbg01394  2588 GAATGTGAGAAGATTACAGTAGCAGAATGCATCGAGACGCAGAGCAAAGC 2637
               ||||||||||| |||||||||||||||||||||||||| ||||| |||||
hk07594    639 GAATGTGAGAAAATTACAGTAGCAGAATGCATCGAGACCCAGAGTAAAGC 688
           148 E  C  E  K  I  T  V  A  E  C  I  E  T  Q  S  K  A  164

           501 ----+----*----+----*----+----*----+----*----+----* 550
           168  M  T  M  L  T  I  E  Q  L  S  Y  L  L  K  F  A  I 184
mbg01394  2638 CATGACCATGCTGACCATTGAACAACTGTCCTACCTGCTCAAGTTTGCCA 2687
               |||||| ||||| |||||||||||  | ||||||||||||||||||||||
hk07594    689 CATGACAATGCTCACCATTGAACAGTTATCCTACCTGCTCAAGTTTGCCA 738
           165  M  T  M  L  T  I  E  Q  L  S  Y  L  L  K  F  A  I 181

           551 ----+----*----+----*----+----*----+----*----+----* 600
           185   Q  K  M  K  Q  P  G  T  D  A  F  Q  K  P  V  P   200
mbg01394  2688 TTCAGAAAATGAAGCAGCCAGGGACGGATGCATTCCAGAAGCCTGTTCCA 2737
               ||||||||||||| ||||||||||| ||||||||||||||||| ||||||
hk07594    739 TTCAGAAAATGAAACAGCCAGGGACAGATGCATTCCAGAAGCCCGTTCCA 788
           182   Q  K  M  K  Q  P  G  T  D  A  F  Q  K  P  V  P   197

           601 ----+----*----+----*----+----*----+----*----+----* 650
           201 L  E  Q  H  P  D  Y  A  E  Y  I  F  H  P  M  D  L  217
mbg01394  2738 TTGGAGCAACACCCTGACTATGCAGAATATATTTTCCACCCCATGGACCT 2787
               ||||| || |||||||||||||| ||||| || ||||| || ||||||||
hk07594    789 TTGGAACAGCACCCTGACTATGCGGAATACATCTTCCATCCAATGGACCT 838
           198 L  E  Q  H  P  D  Y  A  E  Y  I  F  H  P  M  D  L  214

           651 ----+----*----+----*----+----*----+----*----+----* 700
           218  C  T  L  E  K  N  A  K  K  K  M  Y  G  C  T  E  A 234
mbg01394  2788 TTGTACATTGGAAAAGAATGCAAAAAAGAAGATGTACGGCTGCACAGAAG 2837
               ||||||||||||||||||||| |||||||| ||||| |||||||||||||
hk07594    839 TTGTACATTGGAAAAGAATGCGAAAAAGAAAATGTATGGCTGCACAGAAG 888
           215  C  T  L  E  K  N  A  K  K  K  M  Y  G  C  T  E  A 231

           701 ----+----*----+----*----+----*----+----*----+----* 750
           235   F  L  A  D  A  K  W  I  L  H  N  C  I  I  Y  N   250
mbg01394  2838 CCTTCCTGGCCGATGCCAAGTGGATCCTGCACAACTGCATTATTTATAAT 2887
               |||||||||| ||||| ||||||||  ||||||||||||| |||||||||
hk07594    889 CCTTCCTGGCTGATGCAAAGTGGATTTTGCACAACTGCATCATTTATAAT 938
           232   F  L  A  D  A  K  W  I  L  H  N  C  I  I  Y  N   247

           751 ----+----*----+----*----+----*----+----*----+----* 800
           251 G  G  N  H  K  L  T  Q  I  A  K  V  V  I  K  I  C  267
mbg01394  2888 GGGGGAAATCACAAGTTGACGCAAATAGCAAAAGTCGTCATCAAAATCTG 2937
               |||||||||||||| |||||||||||||| ||||| ||||||||||||||
hk07594    939 GGGGGAAATCACAAATTGACGCAAATAGCGAAAGTAGTCATCAAAATCTG 988
           248 G  G  N  H  K  L  T  Q  I  A  K  V  V  I  K  I  C  264

           801 ----+----*----+----*----+----*----+----*----+----* 850
           268  E  H  E  M  N  E  I  E  V  C  P  E  C  Y  L  A  A 284
mbg01394  2938 TGAGCACGAGATGAATGAAATCGAAGTCTGTCCAGAATGTTATCTTGCAG 2987
               ||| || |||||||||||||||||||| ||||||||||||||||| || |
hk07594    989 TGAACATGAGATGAATGAAATCGAAGTATGTCCAGAATGTTATCTAGCTG 1038
           265  E  H  E  M  N  E  I  E  V  C  P  E  C  Y  L  A  A 281

           851 ----+----*----+----*----+----*----+----*----+----* 900
           285   C  Q  K  R  D  N  W  F  C  E  P  C  S  N  P  H   300
mbg01394  2988 CTTGCCAAAAACGAGACAACTGGTTCTGTGAGCCCTGTAGCAATCCGCAC 3037
               |||||||||||||||| |||||||| |||||||| ||||||||||| || 
hk07594   1039 CTTGCCAAAAACGAGATAACTGGTTTTGTGAGCCTTGTAGCAATCCACAT 1088
           282   C  Q  K  R  D  N  W  F  C  E  P  C  S  N  P  H   297

           901 ----+----*----+----*----+----*----+----*----+----* 950
           301 P  L  V  W  A  K  L  K  G  F  P  F  W  P  A  K  A  317
mbg01394  3038 CCTTTGGTCTGGGCAAAACTGAAAGGATTTCCATTCTGGCCAGCGAAAGC 3087
               |||||||||||||| |||||||| || |||||||||||||| || |||||
hk07594   1089 CCTTTGGTCTGGGCCAAACTGAAGGGGTTTCCATTCTGGCCTGCAAAAGC 1138
           298 P  L  V  W  A  K  L  K  G  F  P  F  W  P  A  K  A  314

           951 ----+----*----+----*----+----*----+----*----+----* 1000
           318  L  R  D  K  D  G  Q  V  D  A  R  F  F  G  Q  H  D 334
mbg01394  3088 TCTGAGGGACAAAGACGGGCAGGTTGACGCCCGTTTCTTTGGACAACATG 3137
               ||| ||||| |||||||||||||| || ||||| ||||||||||||||||
hk07594   1139 TCTAAGGGATAAAGACGGGCAGGTCGATGCCCGATTCTTTGGACAACATG 1188
           315  L  R  D  K  D  G  Q  V  D  A  R  F  F  G  Q  H  D 331

          1001 ----+----*----+----*----+----*----+----*----+----* 1050
           335   R  A  W  V  P  V  N  N  C  Y  L  M  S  K  E  I   350
mbg01394  3138 ACAGAGCCTGGGTTCCAGTCAATAATTGCTACCTCATGTCTAAAGAAATC 3187
               |||| |||||||||||| | ||||||||||||||||||||||||||||| 
hk07594   1189 ACAGGGCCTGGGTTCCAATAAATAATTGCTACCTCATGTCTAAAGAAATT 1238
           332   R  A  W  V  P  I  N  N  C  Y  L  M  S  K  E  I   347

          1051 ----+----*----+----*----+----*----+----*----+----* 1100
           351 P  F  S  V  K  K  T  K  S  I  F  N  S  A  M  Q  E  367
mbg01394  3188 CCCTTTTCTGTGAAAAAGACTAAAAGTATCTTCAACAGCGCCATGCAAGA 3237
               || |||||||||||||||||||| || ||||||||||| |||||||||||
hk07594   1239 CCTTTTTCTGTGAAAAAGACTAAGAGCATCTTCAACAGTGCCATGCAAGA 1288
           348 P  F  S  V  K  K  T  K  S  I  F  N  S  A  M  Q  E  364

          1101 ----+----*----+----*----+----*----+----*----+----* 1150
           368  M  E  V  Y  V  E  N  I  R  R  K  F  G  V  F  N  Y 384
mbg01394  3238 GATGGAAGTTTACGTGGAGAACATACGGAGGAAGTTTGGGGTTTTTAATT 3287
               |||||| ||||||||||||||||| || ||||||||||||||||||||||
hk07594   1289 GATGGAGGTTTACGTGGAGAACATCCGCAGGAAGTTTGGGGTTTTTAATT 1338
           365  M  E  V  Y  V  E  N  I  R  R  K  F  G  V  F  N  Y 381

          1151 ----+----*----+----*----+----*----+----*----+----* 1200
           385   S  P  F  R  T  P  Y  T  P  N  N  Q  Y  Q  M  L   400
mbg01394  3288 ACTCCCCGTTCAGGACGCCCTACACGCCCAACAACCAGTACCAAATGCTG 3337
               |||| || || ||||| |||||||| ||||||| |||||| |||||||||
hk07594   1339 ACTCTCCATTTAGGACACCCTACACACCCAACAGCCAGTATCAAATGCTG 1388
           382   S  P  F  R  T  P  Y  T  P  N  S  Q  Y  Q  M  L   397

          1201 ----+----*----+----*----+----*----+----*----+----* 1250
           401 L  D  P  S  N  P  S  A  G  T  A  K  T  D  K  Q  E  417
mbg01394  3338 CTGGATCCCAGCAACCCCAGCGCGGGCACAGCCAAGACAGACAAACAGGA 3387
               || ||||||| |||||||||||| ||||| ||||||| |||||| |||||
hk07594   1389 CTCGATCCCACCAACCCCAGCGCCGGCACTGCCAAGATAGACAAGCAGGA 1438
           398 L  D  P  T  N  P  S  A  G  T  A  K  I  D  K  Q  E  414

          1251 ----+----*----+----*----+----*----+----*----+----* 1300
           418  K  V  K  L  N  F  D  M  T  A  S  P  K  I  L  L  S 434
mbg01394  3388 GAAGGTGAAGCTTAATTTTGACATGACAGCGTCCCCCAAGATCCTTCTGA 3437
               |||||| ||||| || ||||||||||| || ||||||||||||||  |||
hk07594   1439 GAAGGTCAAGCTCAACTTTGACATGACGGCATCCCCCAAGATCCTGATGA 1488
           415  K  V  K  L  N  F  D  M  T  A  S  P  K  I  L  M  S 431

          1301 ----+----*----+----*----+----*----+----*----+----* 1350
           435   K  P  L  L  S  G  G  A  G  R  R  I  S  L  S  D   450
mbg01394  3438 GCAAGCCCCTTCTGAGCGGGGGTGCCGGCCGCAGGATCTCCCTGTCCGAC 3487
               |||||||  | ||||| |||||  | |||||| |||| ||| |||| || 
hk07594   1489 GCAAGCCTGTGCTGAGTGGGGGCACAGGCCGCCGGATTTCCTTGTCGGAT 1538
           432   K  P  V  L  S  G  G  T  G  R  R  I  S  L  S  D   447

          1351 ----+----*----+----*----+----*----+----*----+----* 1400
           451 M  P  R  S  P  T  S  T  N  S  S  V  H  T  G  S  D  467
mbg01394  3488 ATGCCTCGCTCCCCCACCAGTACGAACTCTTCCGTGCACACGGGCTCCGA 3537
               ||||| ||||||||||  || || |||||||| |||||||||||||||||
hk07594   1539 ATGCCGCGCTCCCCCATGAGCACAAACTCTTCTGTGCACACGGGCTCCGA 1588
           448 M  P  R  S  P  M  S  T  N  S  S  V  H  T  G  S  D  464

          1401 ----+----*----+----*----+----*----+----*----+----* 1450
           468  V  E  Q  D  P  E  K  K  A  P  S  S  H  F  S  A  S 484
mbg01394  3538 TGTGGAGCAGGACCCCGAGAAGAAGGCCCCGTCCAGCCACTTCAGCGCAA 3587
                |||||||||||  | |||||||||||| |||| ||||||||||| || |
hk07594   1589 CGTGGAGCAGGATGCTGAGAAGAAGGCCACGTCGAGCCACTTCAGTGCGA 1638
           465  V  E  Q  D  A  E  K  K  A  T  S  S  H  F  S  A  S 481

          1451 ----+----*----+----*----+----*----+----*----+----* 1500
           485   E  E  S  M  D  F  L  D  K  S  T  A  S  P  A  S   500
mbg01394  3588 GCGAGGAGTCCATGGACTTCCTTGATAAGAGCACAGCTTCTCCAGCCTCC 3637
               |||||||||||||||||||||| ||||||||||||||||| |||||||||
hk07594   1639 GCGAGGAGTCCATGGACTTCCTGGATAAGAGCACAGCTTCACCAGCCTCC 1688
           482   E  E  S  M  D  F  L  D  K  S  T  A  S  P  A  S   497

          1501 ----+----*----+----*----+----*----+----*----+----* 1550
           501 T  K  T  G  Q  A  G  S  L  S  G  S  P  K  P  F  S  517
mbg01394  3638 ACCAAGACGGGGCAAGCCGGGAGCTTGTCTGGCAGCCCAAAGCCTTTCTC 3687
               ||||||||||| ||||| ||||| || || |||||||||||||| |||||
hk07594   1689 ACCAAGACGGGACAAGCAGGGAGTTTATCCGGCAGCCCAAAGCCCTTCTC 1738
           498 T  K  T  G  Q  A  G  S  L  S  G  S  P  K  P  F  S  514

          1551 ----+----*----+----*----+----*----+----*----+----* 1600
           518  P  Q  A  P  T  P  I  M  T  K  P  D  K  T  S  T  S 534
mbg01394  3688 TCCGCAAGCGCCGACACCCATCATGACAAAACCCGACAAGACTTCCACCT 3737
               ||| |||  | |  | || |||| ||| ||| | ||||| || |||||| 
hk07594   1739 TCCTCAACTGTCAGCTCCTATCACGACGAAAACGGACAAAACCTCCACC. 1787
           515  P  Q  L  S  A  P  I  T  T  K  T  D  K  T  S  T  . 530

          1601 ----+----*----+----*----+----*----+----*----+----* 1650
           535   T  T  G  S  I  L  N  L  N  L  D  R  S  K  A  E   550
mbg01394  3738 CCACCACCGGGAGCATCCTGAACCTGAACTTGGATCGAAGCAAGGCCGAG 3787
                    ||||| ||||||||||| || ||| ||||||||||||| || |||
hk07594   1788 .....ACCGGCAGCATCCTGAATCTTAACCTGGATCGAAGCAAAGCTGAG 1832
           531   .  T  G  S  I  L  N  L  N  L  D  R  S  K  A  E   545

          1651 ----+----*----+----*----+----*----+----*----+----* 1700
           551 M  D  L  K  E  L  S  E  S  V  Q  Q  Q  S  A  P  V  567
mbg01394  3788 ATGGACCTGAAGGAGCTGAGCGAGTCGGTCCAGCAGCAGTCAGCCCCCGT 3837
               |||||  |||||||||||||||||||||||||||| |||||  |||| ||
hk07594   1833 ATGGATTTGAAGGAGCTGAGCGAGTCGGTCCAGCAACAGTCCACCCCTGT 1882
           546 M  D  L  K  E  L  S  E  S  V  Q  Q  Q  S  T  P  V  562

          1701 ----+----*----+----*----+----*----+----*----+----* 1750
           568  P  L  I  S  P  K  R  Q  I  R  S  R  F  Q  L  N  L 584
mbg01394  3838 CCCTCTCATCTCTCCCAAGCGGCAGATTCGAAGCCGGTTCCAGCTCAACC 3887
                |||||||||||||||||||| |||||||| ||| |||||||||| || |
hk07594   1883 TCCTCTCATCTCTCCCAAGCGCCAGATTCGTAGCAGGTTCCAGCTGAATC 1932
           563  P  L  I  S  P  K  R  Q  I  R  S  R  F  Q  L  N  L 579

          1751 ----+----*----+----*----+----*----+----*----+----* 1800
           585   D  K  T  I  E  S  C  K  A  Q  L  G  I  N  E  I   600
mbg01394  3888 TGGACAAGACCATAGAGAGTTGCAAAGCACAGCTAGGCATAAATGAGATC 3937
               | |||||||||||||||||||||||||||||  ||||||||||||| |||
hk07594   1933 TTGACAAGACCATAGAGAGTTGCAAAGCACAATTAGGCATAAATGAAATC 1982
           580   D  K  T  I  E  S  C  K  A  Q  L  G  I  N  E  I   595

          1801 ----+----*----+----*----+----*----+----*----+----* 1850
           601 S  E  D  V  Y  T  A  V  E  H  S  D  S  E  D  S  E  617
mbg01394  3938 TCAGAGGATGTTTATACAGCCGTGGAGCACAGCGATTCCGAGGACTCCGA 3987
               || || ||||| ||||| ||||| |||||||||||||| ||||| || ||
hk07594   1983 TCGGAAGATGTCTATACGGCCGTAGAGCACAGCGATTCGGAGGATTCTGA 2032
           596 S  E  D  V  Y  T  A  V  E  H  S  D  S  E  D  S  E  612

          1851 ----+----*----+----*----+----*----+----*----+----* 1900
           618  K  S  E  S  S  D  S  E  Y  V  S  D  E  E  Q  K  P 634
mbg01394  3988 AAAGTCGGAGAGCAGCGACAGCGAGTACGTCAGCGATGAGGAACAGAAGC 4037
                ||||| || || ||||| || |||||  |||| ||||| || |||||| 
hk07594   2033 GAAGTCAGATAGTAGCGATAGTGAGTATATCAGTGATGATGAGCAGAAGT 2082
           613  K  S  D  S  S  D  S  E  Y  I  S  D  D  E  Q  K  S 629

          1901 ----+----*----+----*----+----*----+----*----+----* 1950
           635   K  N  E  P  E  D  P  E  D  K  E  G  S  R  V  D   650
mbg01394  4038 CCAAGAATGAGCCCGAGGACCCCGAGGACAAAGAGGGGAGTCGGGTGGAC 4087
               | ||||| ||||| || ||| | ||||||||||| ||  ||| | |||||
hk07594   2083 CTAAGAACGAGCCAGAAGACACAGAGGACAAAGAAGGTTGTCAGATGGAC 2132
           630   K  N  E  P  E  D  T  E  D  K  E  G  C  Q  M  D   645

          1951 ----+----*----+----*----+----*----+----*----+----* 2000
           651 K  E  A  P  A  I  K  R  K  P  K  P  T  N  Q  V  E  667
mbg01394  4088 AAAGAGGCCCCTGCCATCAAAAGGAAGCCCAAACCCACAAACCAGGTAGA 4137
               |||||| |  ||||  | ||||  |||||||| || |||||||  || ||
hk07594   2133 AAAGAGCCATCTGCTGTTAAAAAAAAGCCCAAGCCTACAAACCCAGTGGA 2182
           646 K  E  P  S  A  V  K  K  K  P  K  P  T  N  P  V  E  662

          2001 ----+----*----+----*----+----*----+----*----+----* 2050
           668  V  K  E  E  A  K  S  N  S  P  V  S  E  K  P  D  P 684
mbg01394  4138 GGTCAAAGAGGAAGCGAAGAGCAACTCTCCTGTCAGCGAGAAGCCGGACC 4187
               | | ||||||||   ||| ||||  || || | |||||||||| | ||||
hk07594   2183 GATTAAAGAGGAGCTGAAAAGCACGTCACCAGCCAGCGAGAAGGCAGACC 2232
           663  I  K  E  E  L  K  S  T  S  P  A  S  E  K  A  D  P 679

          2051 ----+----*----+----*----+----*----+----*----+----* 2100
           685   T  P  A  K  D  K  A  S  P  E  P  E  K  D  F  V   700
mbg01394  4188 CCACACCCGCCAAGGACAAGGCCAGCCCAGAGCCTGAGAAGGACTTTGTA 4237
               |   | | | |||||||||||||||||| ||||||||||||||||||   
hk07594   2233 CTGGAGCAGTCAAGGACAAGGCCAGCCCTGAGCCTGAGAAGGACTTTTCC 2282
           680   G  A  V  K  D  K  A  S  P  E  P  E  K  D  F  S   695

          2101 ----+----*----+----*----+----*----+----*----+----* 2150
           701 E  K  A  K  P  S  P  H  P  T  K  D  K  L  K  G  K  717
mbg01394  4238 GAGAAAGCAAAGCCATCACCTCATCCCACAAAGGACAAACTGAAAGGAAA 4287
               || || ||||| || |||||||| |||| |||||| |||||||| |||||
hk07594   2283 GAAAAGGCAAAACCTTCACCTCACCCCATAAAGGATAAACTGAAGGGAAA 2332
           696 E  K  A  K  P  S  P  H  P  I  K  D  K  L  K  G  K  712

          2151 ----+----*----+----*----+----*----+----*----+----* 2200
           718  D  E  T  D  S  P  T  V  H  L  G  L  D  S  D  S  E 734
mbg01394  4288 GGATGAAACGGATTCTCCCACAGTGCACTTGGGCTTGGATTCGGACTCGG 4337
                ||||| |||||||| || ||||| || |||||| |||| || || || |
hk07594   2333 AGATGAGACGGATTCCCCAACAGTCCATTTGGGCCTGGACTCTGATTCAG 2382
           713  D  E  T  D  S  P  T  V  H  L  G  L  D  S  D  S  E 729

          2201 ----+----*----+----*----+----*----+----*----+----* 2250
           735   S  E  L  V  I  D  L  G  E  D  P  S  G  R  E  G   750
mbg01394  4338 AGAGCGAACTTGTCATAGACTTAGGAGAGGATCCTTCTGGGAGGGAGGGT 4387
               ||||||||||||||||||| |||||||| || | ||||||| ||||||||
hk07594   2383 AGAGCGAACTTGTCATAGATTTAGGAGAAGACCATTCTGGGCGGGAGGGT 2432
           730   S  E  L  V  I  D  L  G  E  D  H  S  G  R  E  G   745

          2251 ----+----*----+----*----+----*----+----*----+----* 2300
           751 R  K  N  K  K  D  P  K  V  P  S  P  K  Q  D  A  I  767
mbg01394  4388 CGAAAAAACAAGAAAGATCCCAAGGTGCCGTCGCCTAAGCAAGACGCTAT 4437
               |||||||| ||||| || ||||| |  || || || || || || | | |
hk07594   2433 CGAAAAAATAAGAAGGAACCCAAAGAACCATCTCCCAAACAGGATGTTGT 2482
           746 R  K  N  K  K  E  P  K  E  P  S  P  K  Q  D  V  V  762

          2301 ----+----*----+----*----+----*----+----*----+----* 2350
           768  G  K  P  P  P  S  S  T  S  A  G  N  Q  S  P  P  E 784
mbg01394  4438 AGGTAAACCGCCACCGTCGTCCACTTCGGCGGGCAACCAGTCTCCCCCAG 4487
               ||||||| | ||||| ||  | ||    | ||||| ||| |||||||| |
hk07594   2483 AGGTAAAACTCCACCATCCACGACG...GTGGGCAGCCATTCTCCCCCGG 2529
           763  G  K  T  P  P  S  T  T  .  V  G  S  H  S  P  P  E 778

          2351 ----+----*----+----*----+----*----+----*----+----* 2400
           785   T  P  V  L  T  R  S  A  T  Q  A  P  A  A  G  V   800
mbg01394  4488 AGACACCGGTACTCACCCGCTCAGCCACCCAAGCACCCGCGGCTGGGGTC 4537
               | |||||||| |||||||||||  || ||||| |  |||||||||| | |
hk07594   2530 AAACACCGGTGCTCACCCGCTCTTCCGCCCAAACTTCCGCGGCTGGCGCC 2579
           779   T  P  V  L  T  R  S  S  A  Q  T  S  A  A  G  A   794

          2401 ----+----*----+----*----+----*----+----*----+----* 2450
           801 T  V  A  A  A  T  T  S  T  M  S  T  V  T  V  T  A  817
mbg01394  4538 ACCGTGGCCGCCGCCACCACCAGCACGATGTCTACCGTCACAGTCACGGC 4587
               ||          |||||||||||||||   || || ||||| ||||||||
hk07594   2580 ACA.........GCCACCACCAGCACGTCCTCCACGGTCACCGTCACGGC 2620
           795 T  .  .  .  A  T  T  S  T  S  S  T  V  T  V  T  A  808

          2451 ----+----*----+----*----+----*----+----*----+----* 2500
           818  P  A  T  A  V  T  G  S  P  V  K  K  Q  R  P  L  L 834
mbg01394  4588 ACCGGCCACCGCCGTCACGGGAAGCCCGGTGAAGAAGCAGAGGCCGCTCT 4637
                |||||| |||||| ||| |||||||| ||||| |||||||||||||| |
hk07594   2621 CCCGGCCCCCGCCGCCACAGGAAGCCCAGTGAAAAAGCAGAGGCCGCTTT 2670
           809  P  A  P  A  A  T  G  S  P  V  K  K  Q  R  P  L  L 825

          2501 ----+----*----+----*----+----*----+----*----+----* 2550
           835   P  K  E  T  V  P  A  V  Q  R  V  V  W  N  A  S   850
mbg01394  4638 TACCGAAGGAGACTGTCCCAGCTGTGCAGCGGGTCGTGTGGAACGCATCA 4687
               ||||||||||||||| ||| || ||||||||||||||||||||| |||||
hk07594   2671 TACCGAAGGAGACTGCCCCGGCCGTGCAGCGGGTCGTGTGGAACTCATCA 2720
           826   P  K  E  T  A  P  A  V  Q  R  V  V  W  N  S  S   841

          2551 ----+----*----+----*----+----*----+----*----+----* 2600
           851 S  K  F  Q  T  S  S  Q  K  W  H  M  Q  K  I  Q  R  867
mbg01394  4688 AGTAAGTTTCAAACGTCCTCCCAAAAGTGGCACATGCAGAAGATACAGCG 4737
               |||||||||||||||||||||||||||||||||||||||||||| |||||
hk07594   2721 AGTAAGTTTCAAACGTCCTCCCAAAAGTGGCACATGCAGAAGATGCAGCG 2770
           842 S  K  F  Q  T  S  S  Q  K  W  H  M  Q  K  M  Q  R  858

          2601 ----+----*----+----*----+----*----+----*----+----* 2650
           868  Q  Q  Q  Q  Q  Q  Q  Q  Q  Q  S  Q  Q  Q  S  Q  Q 884
mbg01394  4738 CCAGCAGCAGCAGCAGCAGCAACAACAGCAGAGCCAACAGCAGAGCCAGC 4787
                |||||||||||||||||||| |||                  | |||||
hk07594   2771 TCAGCAGCAGCAGCAGCAGCAGCAA..................AACCAGC 2802
           859  Q  Q  Q  Q  Q  Q  Q  Q  .  .  .  .  .  .  N  Q  Q 869

          2651 ----+----*----+----*----+----*----+----*----+----* 2700
           885   Q  Q  P  Q  S  S  Q  G  T  R  Y  Q  T  R  Q  A   900
mbg01394  4788 AGCAGCAGCCTCAGTCTTCCCAGGGGACGAGATATCAGACCAGACAGGCT 4837
               ||||||||||||||||||||||||||||||||||||||||||||||||||
hk07594   2803 AGCAGCAGCCTCAGTCTTCCCAGGGGACGAGATATCAGACCAGACAGGCT 2852
           870   Q  Q  P  Q  S  S  Q  G  T  R  Y  Q  T  R  Q  A   885

          2701 ----+----*----+----*----+----*----+----*----+----* 2750
           901 V  K  V  V  Q  Q  K  E  V  T  Q  S  P  S  T  S  T  917
mbg01394  4838 GTGAAAGTTGTCCAGCAGAAGGAGGTCACCCAGAGCCCATCCACGTCCAC 4887
               ||||||| |||||||||||||||| |||| ||||||||||||||||||||
hk07594   2853 GTGAAAGCTGTCCAGCAGAAGGAGATCACACAGAGCCCATCCACGTCCAC 2902
           886 V  K  A  V  Q  Q  K  E  I  T  Q  S  P  S  T  S  T  902

          2751 ----+----*----+----*----+----*----+----*----+----* 2800
           918  I  T  L  V  T  S  T  Q  P  A  A  L  V  S  S  S  G 934
mbg01394  4888 CATCACGCTGGTGACCAGCACACAGCCGGCAGCCCTGGTCAGCAGTTCGG 4937
               |||||| |||||||||||||||||| |  |  ||||||||| ||| ||||
hk07594   2903 CATCACCCTGGTGACCAGCACACAGTCATCGCCCCTGGTCACCAGCTCGG 2952
           903  I  T  L  V  T  S  T  Q  S  S  P  L  V  T  S  S  G 919

          2801 ----+----*----+----*----+----*----+----*----+----* 2850
           935   S  A  S  T  L  A  S  A  I  N  A  D  L  P  I  A   950
mbg01394  4938 GCTCAGCAAGCACCCTGGCGTCTGCAATCAATGCCGACCTTCCCATTGCC 4987
               | ||    |||||||| | |||  || |||| || ||||| ||||| |||
hk07594   2953 GGTCCATGAGCACCCTTGTGTCCTCAGTCAACGCTGACCTGCCCATCGCC 3002
           920   S  M  S  T  L  V  S  S  V  N  A  D  L  P  I  A   935

          2851 ----+----*----+----*----+----*----+----*----+----* 2900
           951 T  A  S  A  D  V  A  A  D  I  A  K  Y  T  S  K  M  967
mbg01394  4988 ACCGCCTCGGCCGACGTGGCCGCAGACATTGCCAAGTACACCAGCAAAAT 5037
               || ||||| || || || ||||| || |||||||||||||| ||||||||
hk07594   3003 ACTGCCTCAGCTGATGTCGCCGCTGATATTGCCAAGTACACTAGCAAAAT 3052
           936 T  A  S  A  D  V  A  A  D  I  A  K  Y  T  S  K  M  952

          2901 ----+----*----+----*----+----*----+----*----+----* 2950
           968  M  D  A  I  K  G  T  M  T  E  I  Y  N  D  L  S  K 984
mbg01394  5038 GATGGATGCCATAAAGGGGACGATGACAGAAATCTACAATGACCTCTCCA 5087
               ||||||||| ||||| || || ||||||||||| ||||| || || || |
hk07594   3053 GATGGATGCAATAAAAGGAACAATGACAGAAATATACAACGATCTTTCTA 3102
           953  M  D  A  I  K  G  T  M  T  E  I  Y  N  D  L  S  K 969

          2951 ----+----*----+----*----+----*----+----*----+----* 3000
           985   N  T  T  G  S  T  I  A  E  I  R  R  L  R  I  E   1000
mbg01394  5088 AGAACACCACTGGGAGCACAATAGCTGAGATTCGAAGGCTGAGGATTGAG 5137
               | ||||| ||||| |||||||||||||||||||| ||||||||||| |||
hk07594   3103 AAAACACTACTGGAAGCACAATAGCTGAGATTCGCAGGCTGAGGATCGAG 3152
           970   N  T  T  G  S  T  I  A  E  I  R  R  L  R  I  E   985

          3001 ----+----*----+----*----+----*----+----*----+----* 3050
          1001 I  E  K  L  Q  W  L  H  Q  Q  E  L  A  E  M  K  H  1017
mbg01394  5138 ATTGAGAAACTGCAGTGGCTGCACCAGCAGGAGCTCGCTGAGATGAAGCA 5187
               || ||||| || ||||||||||||||||| |||||| | || ||||| ||
hk07594   3153 ATAGAGAAGCTCCAGTGGCTGCACCAGCAAGAGCTCTCCGAAATGAAACA 3202
           986 I  E  K  L  Q  W  L  H  Q  Q  E  L  S  E  M  K  H  1002

          3051 ----+----*----+----*----+----*----+----*----+----* 3100
          1018  N  L  E  L  T  M  A  E  M  R  Q  S  L  E  Q  E  R 1034
mbg01394  5188 CAACCTGGAGTTGACCATGGCCGAGATGCGGCAGAGCCTGGAACAGGAGC 5237
               |||| | ||| |||||||||| |||||||||||||||||||| |||||||
hk07594   3203 CAACTTAGAGCTGACCATGGCGGAGATGCGGCAGAGCCTGGAGCAGGAGC 3252
          1003  N  L  E  L  T  M  A  E  M  R  Q  S  L  E  Q  E  R 1019

          3101 ----+----*----+----*----+----*----+----*----+----* 3150
          1035   D  R  L  I  A  E  V  K  K  Q  L  E  L  E  K  Q   1050
mbg01394  5238 GGGATCGGCTCATCGCCGAGGTGAAGAAGCAACTGGAGCTGGAGAAGCAG 5287
               |||| |||||||||||||||||||||||||| |||||| |||||||||||
hk07594   3253 GGGACCGGCTCATCGCCGAGGTGAAGAAGCAGCTGGAGTTGGAGAAGCAG 3302
          1020   D  R  L  I  A  E  V  K  K  Q  L  E  L  E  K  Q   1035

          3151 ----+----*----+----*----+----*----+----*----+----* 3200
          1051 Q  A  V  D  E  T  K  K  K  Q  W  C  A  N  C  K  K  1067
mbg01394  5288 CAGGCGGTGGACGAGACCAAGAAGAAGCAGTGGTGTGCCAACTGCAAGAA 5337
               ||||||||||| ||||||||||||||||||||||| ||||||||||||||
hk07594   3303 CAGGCGGTGGATGAGACCAAGAAGAAGCAGTGGTGCGCCAACTGCAAGAA 3352
          1036 Q  A  V  D  E  T  K  K  K  Q  W  C  A  N  C  K  K  1052

          3201 ----+----*----+----*----+----*----+----*----+----* 3250
          1068  E  A  I  F  Y  C  C  W  N  T  S  Y  C  D  Y  P  C 1084
mbg01394  5338 GGAGGCCATTTTCTACTGCTGCTGGAACACCAGCTACTGTGACTACCCCT 5387
               ||||||||| || |||||||| ||||||||||||||||||||||||||||
hk07594   3353 GGAGGCCATCTTTTACTGCTGTTGGAACACCAGCTACTGTGACTACCCCT 3402
          1053  E  A  I  F  Y  C  C  W  N  T  S  Y  C  D  Y  P  C 1069

          3251 ----+----*----+----*----+----*----+----*----+----* 3300
          1085   Q  Q  A  H  W  P  E  H  M  K  S  C  T  Q  S  A   1100
mbg01394  5388 GTCAGCAGGCCCACTGGCCCGAGCACATGAAGTCCTGTACCCAGTCGGCG 5437
               | ||||| ||||||||||| ||||||||||||||||| |||||||| || 
hk07594   3403 GCCAGCAAGCCCACTGGCCTGAGCACATGAAGTCCTGCACCCAGTCAGCT 3452
          1070   Q  Q  A  H  W  P  E  H  M  K  S  C  T  Q  S  A   1085

          3301 ----+----*----+----*----+----*----+----*----+----* 3350
          1101 T  A  P  Q  Q  E  A  D  A  E  A  S  T  E  T  G  N  1117
mbg01394  5438 ACTGCCCCTCAGCAGGAAGCAGATGCCGAGGCAAGCACAGAAACAGGAAA 5487
               ||||| |||||||||||||| ||||| ||||  | ||||||||||  |||
hk07594   3453 ACTGCTCCTCAGCAGGAAGCGGATGCTGAGGTGAACACAGAAACACTAAA 3502
          1086 T  A  P  Q  Q  E  A  D  A  E  V  N  T  E  T  L  N  1102

          3351 ----+----*----+----*----+----*----+----*----+----* 3400
          1118  K  S  S  Q  G  N  S  S  N  T  Q  S  A  P  S  E  P 1134
mbg01394  5488 TAAGTCATCGCAGGGCAACTCCTCCAACACACAGTCAGCACCTTCAGAAC 5537
               |||||| || ||||| | |||||| | |||||| ||||||||||||||| 
hk07594   3503 TAAGTCCTCCCAGGGGAGCTCCTCGAGCACACAATCAGCACCTTCAGAAA 3552
          1103  K  S  S  Q  G  S  S  S  S  T  Q  S  A  P  S  E  T 1119

          3401 ----+----*----+----*----+----*----+----*----+----* 3450
          1135   A  S  A  P  K  E  K  E  A  P  A  E  K  S  K  D   1150
mbg01394  5538 CGGCCAGCGCCCCCAAAGAGAAAGAGGCGCCAGCGGAGAAGAGCAAGGAC 5587
               ||||||||||| |||||||||| ||| || |||| ||||| |||||||| 
hk07594   3553 CGGCCAGCGCCTCCAAAGAGAAGGAGACGTCAGCTGAGAAAAGCAAGGAG 3602
          1120   A  S  A  S  K  E  K  E  T  S  A  E  K  S  K  E   1135

          3451 ----+----*----+----*----+----*----+----*----+----* 3500
          1151 S  S  N  S  T  L  D  L  S  G  S  R  E  T  P  S  S  1167
mbg01394  5588 AGTAGTAACTCGACCCTGGATCTTTCCGGCTCCAGAGAGACGCCCTCCTC 5637
               ||| |    |||||||| || ||||| |||||||||||||||||||||||
hk07594   3603 AGTGGC...TCGACCCTTGACCTTTCTGGCTCCAGAGAGACGCCCTCCTC 3649
          1136 S  G  .  S  T  L  D  L  S  G  S  R  E  T  P  S  S  1151

          3501 ----+----*----+----*----+----*----+----*----+----* 3550
          1168  M  L  L  G  S  N  Q  S  S  V  S  K  R  C  D  K  Q 1184
mbg01394  5638 CATGCTCTTAGGCTCCAATCAAAGCTCTGTTAGCAAGAGGTGTGACAAGC 5687
               ||| |||||||||||||| ||| |||||               |||    
hk07594   3650 CATTCTCTTAGGCTCCAACCAAGGCTCT...............GAC.... 3680
          1152  I  L  L  G  S  N  Q  G  S  .  .  .  .  .  D  .  . 1161

          3551 ----+----*----+----*----+----*----+----*----+----* 3600
          1185   P  A  Y  T  P  T  T  T  D  H  Q  P  H  P  N  Y   1200
mbg01394  5688 AGCCTGCCTATACCCCAACCACTACAGACCACCAGCCGCACCCCAACTAC 5737
                                                                 
hk07594   3681 .................................................. 3680
          1161   .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .   1161

          3601 ----+----*----+----*----+----*----+----*----+----* 3650
          1201 P  A  Q  K  Y  H  S  R  S  S  K  A  G  L  W  S  S  1217
mbg01394  5738 CCAGCCCAGAAGTACCATTCCCGGAGCAGCAAGGCAGGTTTGTGGAGCAG 5787
                              ||||||||||| |  ||  |  ||   ||||||||
hk07594   3681 ...............CATTCCCGGAGTAATAAATCCAGT...TGGAGCAG 3712
          1162 .  .  .  .  .  H  S  R  S  N  K  S  S  .  W  S  S  1172

          3651 ----+----*----+----*----+----*----+----*----+----* 3700
          1218  S  E  E  K  R  A  S  S  R  S  E  H  S  G  G  T  S 1234
mbg01394  5788 CAGCGAGGAGAAGCGAGCGTCATCCCGCTCTGAGCACAGTGGAGGGACCA 5837
               ||| || |||||| | |  ||  | || || || ||||      | ||||
hk07594   3713 CAGTGATGAGAAGAGGGGATCGACACGTTCCGATCACAACACCAGTACCA 3762
          1173  S  D  E  K  R  G  S  T  R  S  D  H  N  T  S  T  S 1189

          3701 ----+----*----+----*----+----*----+----*----+----* 3750
          1235   T  K  N  L  M  P  K  E  S  R  E  S  R  L  D  A   1250
mbg01394  5838 GCACGAAGAACCTCATGCCCAAAGAGTCCCGGGAGTCTCGGCTAGATGCC 5887
               ||||||||| |||| | || |||||||| |||         || ||  ||
hk07594   3763 GCACGAAGAGCCTCCTCCCGAAAGAGTCTCGG.........CTGGACACC 3803
          1190   T  K  S  L  L  P  K  E  S  R  .  .  .  L  D  T   1202

          3751 ----+----*-- 3762
          1251 F  W  D  *   1254
mbg01394  5888 TTCTGGGACTAG 5899
               ||||||||||||
hk07594   3804 TTCTGGGACTAG 3815
          1203 F  W  D  *   1206


*--[ 3'UTR ]--*
             1 ----+----*----+----*----+----*----+----*----+----* 50
mbg01394  5900 GGGTGCATTGTGAGCCAGAACACCTCCCTATAGGCGGGAGAAAC.....A 5944
                 ||| || | ||  || | ||||  ||     | | || ||||      
hk07594   3816 CAGTGAATCGGGACACAAACCACCCACCCCATTGGGAGAAAAACCCAGAC 3865

            51 ----+----*----+----*----+----*----+----*----+----* 100
mbg01394  5945 GCCCAAACCACAGGAAGCAACAGAAACTGGAGAACCGCCACCTTTAGAAT 5994
               |||   |  | | ||| ||||| |  | ||||||| ||||| || ||| |
hk07594   3866 GCCAGGAAAAGAAGAAACAACAAAGGCAGGAGAACAGCCACTTTCAGACT 3915

           101 ----+----*----+----*----+----*----+----*----+----* 150
mbg01394  5995 TTCCAACACTCAGGCTCTGCCCGTCGGCTTGGCTGCAGGACAGAGACTTG 6044
               |   ||     |  | ||       | ||  ||  | ||   | |   ||
hk07594   3916 TGAAAATGACAAAACCCTCAGTTGAGCCTGAGCCCCCGGCGCGGGGGCTG 3965

           151 ----+----*----+----*----+----*----+----*----+----* 200
mbg01394  6045 ACTCCAT........CCCAGCTCTGGTCTTGGCTG.GGATGGCACTACCC 6085
                  |  |        ||||||   ||  ||| |||  ||  |  |  || 
hk07594   3966 CTACACTACAGGACACCCAGCATCGGCTTTGACTGCAGACTGTTCACCCA 4015

           201 ----+----*----+----*----+----*----+----*----+----* 250
mbg01394  6086 TGTGAGCCAAACACTTTAGGTGTAAGT.AGGTACACTT..GTGACGTCAC 6132
                  |||||     |||| ||||||| | | ||||| ||    || |||| 
hk07594   4016 CACGAGCCCTGTGCTTTTGGTGTAAATAATGTACAATTTGTGGATGTCAT 4065

           251 ----+----*----+----*----+----*----+----*----+----* 300
mbg01394  6133 TG.ACCTAGCGTACCTT.TCCTTTTCATATCTACACTGACCTTAACTTAT 6180
               || | |||| | || ||  |||||| |||| |  | | || |||||||||
hk07594   4066 TGAATCTAGAGGACTTTCCCCTTTTTATATTTGTATTAACTTTAACTTAT 4115

           301 ----+----*----+----*----+----*-- 332
mbg01394  6180 ................................ 6180
                                               
hk07594   4116 TAAAAAAAAAAAAAGAAAAAGAAAAACGATTT 4147