Comparison of KIAA cDNA sequences between mouse and human (KIAA0147)

<< Original sequence data >>

mouse  mKIAA0147 (mbg04389)     length:   5539 bp
human  ha01022s1    length:   5132 bp


<< Aligned sequence information (excl. stop, if exists.) >>

----------------------------------------------------------
            length    #match  #mismatch   %diff
----------------------------------------------------------
DNA

  CDS1 :     4839      3988      851      17.59
  Total:     4839      3988      851      17.59

  3'UTR:      231       198       33      14.29

amino acid

  CDS1 :     1619      1418      201      12.42
  Total:     1619      1418      201      12.42
----------------------------------------------------------


<< Alignment region (incl. stop, if exists.) >>

----------------------------------------------------------
                    cDNA      cDNA original    amino acid
----------------------------------------------------------
  CDS1 : mouse   398 -  5311    112 -   426     58 -  1695
         human     3 -  4895      3 -  4895      1 -  1631
  3'UTR: mouse  5312 -  5539
         human  4896 -  5132
----------------------------------------------------------


<< Alignment >>

*--[ CDS1 ]--*
              1 ----+----*----+----*----+----*----+----*----+----* 50
             58 M  L  K  C  I  P  L  W  R  C  N  R  H  V  E  S  V  74
mbg04389    398 ATGCTGAAGTGCATCCCGCTCTGGCGTTGTAACCGGCACGTGGAGTCGGT 447
                ||||| |||||||||||||| ||||| || ||||||||||||||||||||
ha01022s1     3 ATGCTCAAGTGCATCCCGCTGTGGCGCTGCAACCGGCACGTGGAGTCGGT 52
              1 M  L  K  C  I  P  L  W  R  C  N  R  H  V  E  S  V  17

             51 ----+----*----+----*----+----*----+----*----+----* 100
             75  D  K  R  H  C  S  L  Q  V  V  P  E  E  I  Y  R  Y 91
mbg04389    448 GGATAAGCGGCACTGCTCGCTGCAGGTTGTGCCAGAGGAGATTTACCGCT 497
                ||| ||||||||||| ||||||||||  ||||| |||||||| |||||||
ha01022s1    53 GGACAAGCGGCACTGTTCGCTGCAGGCCGTGCCGGAGGAGATCTACCGCT 102
             18  D  K  R  H  C  S  L  Q  A  V  P  E  E  I  Y  R  Y 34

            101 ----+----*----+----*----+----*----+----*----+----* 150
             92   S  R  S  L  E  E  L  L  L  D  A  N  Q  L  R  E   107
mbg04389    498 ACAGTCGTAGCCTCGAGGAGCTGCTCCTCGACGCCAACCAGCTGCGCGAG 547
                |||| || ||||| ||||||||||| ||||||||||||||||||||||||
ha01022s1   103 ACAGCCGCAGCCTGGAGGAGCTGCTGCTCGACGCCAACCAGCTGCGCGAG 152
             35   S  R  S  L  E  E  L  L  L  D  A  N  Q  L  R  E   50

            151 ----+----*----+----*----+----*----+----*----+----* 200
            108 L  P  K  P  F  F  R  L  L  N  L  R  K  L  G  L  S  124
mbg04389    548 CTACCTAAGCCCTTCTTCCGACTGTTGAACTTGAGAAAGTTGGGCCTCAG 597
                || || ||||| || ||||| ||| |||||||| | ||| ||||||| ||
ha01022s1   153 CTGCCCAAGCCTTTTTTCCGGCTGCTGAACTTGCGCAAGCTGGGCCTGAG 202
             51 L  P  K  P  F  F  R  L  L  N  L  R  K  L  G  L  S  67

            201 ----+----*----+----*----+----*----+----*----+----* 250
            125  D  N  E  I  Q  R  L  P  P  E  V  A  N  F  M  Q  L 141
mbg04389    598 TGACAACGAGATCCAGCGGCTGCCTCCTGAAGTGGCCAACTTCATGCAGC 647
                 |||||||||||||||||| ||||||| || |||||||||||||||||||
ha01022s1   203 CGACAACGAGATCCAGCGGTTGCCTCCCGAGGTGGCCAACTTCATGCAGC 252
             68  D  N  E  I  Q  R  L  P  P  E  V  A  N  F  M  Q  L 84

            251 ----+----*----+----*----+----*----+----*----+----* 300
            142   V  E  L  D  V  S  R  N  D  I  P  E  I  P  E  S   157
mbg04389    648 TGGTGGAGCTAGATGTGTCCCGGAACGATATTCCTGAGATACCTGAGAGC 697
                |||||||||| || ||||||||||||||||| |||||||| || ||||||
ha01022s1   253 TGGTGGAGCTGGACGTGTCCCGGAACGATATCCCTGAGATCCCGGAGAGC 302
             85   V  E  L  D  V  S  R  N  D  I  P  E  I  P  E  S   100

            301 ----+----*----+----*----+----*----+----*----+----* 350
            158 I  K  F  C  K  A  L  E  I  A  D  F  S  G  N  P  L  174
mbg04389    698 ATAAAGTTCTGTAAGGCTCTGGAGATTGCAGACTTCAGTGGGAACCCCCT 747
                || |||||||| |||||||||||||| || |||||||| |||||||||||
ha01022s1   303 ATCAAGTTCTGCAAGGCTCTGGAGATCGCGGACTTCAGCGGGAACCCCCT 352
            101 I  K  F  C  K  A  L  E  I  A  D  F  S  G  N  P  L  117

            351 ----+----*----+----*----+----*----+----*----+----* 400
            175  S  R  L  P  D  G  F  T  Q  L  R  S  L  A  H  L  A 191
mbg04389    748 GTCTAGACTTCCGGATGGCTTCACACAGCTACGCAGCCTGGCTCACCTGG 797
                 || || || || ||||||||||| ||||| |||||||||||||||||||
ha01022s1   353 CTCCAGGCTCCCTGATGGCTTCACTCAGCTGCGCAGCCTGGCTCACCTGG 402
            118  S  R  L  P  D  G  F  T  Q  L  R  S  L  A  H  L  A 134

            401 ----+----*----+----*----+----*----+----*----+----* 450
            192   L  N  D  V  S  L  Q  A  L  P  G  D  V  G  N  L   207
mbg04389    798 CCCTGAATGACGTGTCCCTGCAGGCACTGCCTGGAGATGTGGGCAACCTG 847
                |||||||||| ||||| |||||||||||||| || || ||||||||||| 
ha01022s1   403 CCCTGAATGATGTGTCTCTGCAGGCACTGCCCGGGGACGTGGGCAACCTC 452
            135   L  N  D  V  S  L  Q  A  L  P  G  D  V  G  N  L   150

            451 ----+----*----+----*----+----*----+----*----+----* 500
            208 A  N  L  V  T  L  E  L  R  E  N  L  L  K  S  L  P  224
mbg04389    848 GCCAATCTGGTGACCCTGGAACTCCGGGAGAACCTGCTTAAATCTCTCCC 897
                ||||| |||||||||||||| ||||||||||||||||| || || || ||
ha01022s1   453 GCCAACCTGGTGACCCTGGAGCTCCGGGAGAACCTGCTCAAGTCCCTGCC 502
            151 A  N  L  V  T  L  E  L  R  E  N  L  L  K  S  L  P  167

            501 ----+----*----+----*----+----*----+----*----+----* 550
            225  A  S  L  S  F  L  V  K  L  E  Q  L  D  L  G  G  N 241
mbg04389    898 TGCGTCCCTGTCTTTCCTGGTGAAGCTGGAACAGCTGGATCTGGGAGGCA 947
                 ||||||||||| || ||||| ||||||||||||||||||||||||||||
ha01022s1   503 AGCGTCCCTGTCATTTCTGGTCAAGCTGGAACAGCTGGATCTGGGAGGCA 552
            168  A  S  L  S  F  L  V  K  L  E  Q  L  D  L  G  G  N 184

            551 ----+----*----+----*----+----*----+----*----+----* 600
            242   D  L  E  V  L  P  D  T  L  G  A  L  P  N  L  R   257
mbg04389    948 ACGACCTGGAAGTGCTGCCTGACACCCTGGGGGCTCTGCCTAACCTTCGG 997
                |||| |||||||||||||| ||||| |||||||||||||| || ||||||
ha01022s1   553 ACGATCTGGAAGTGCTGCCAGACACTCTGGGGGCTCTGCCCAATCTTCGG 602
            185   D  L  E  V  L  P  D  T  L  G  A  L  P  N  L  R   200

            601 ----+----*----+----*----+----*----+----*----+----* 650
            258 E  L  W  L  D  R  N  Q  L  S  A  L  P  P  E  L  G  274
mbg04389    998 GAGCTATGGTTAGACCGAAACCAACTGTCAGCGCTGCCCCCGGAGCTAGG 1047
                ||||| ||| | ||||| ||||| |||||||| |||||||||||||| ||
ha01022s1   603 GAGCTGTGGCTTGACCGGAACCAGCTGTCAGCACTGCCCCCGGAGCTCGG 652
            201 E  L  W  L  D  R  N  Q  L  S  A  L  P  P  E  L  G  217

            651 ----+----*----+----*----+----*----+----*----+----* 700
            275  N  L  R  R  L  V  C  L  D  V  S  E  N  R  L  E  E 291
mbg04389   1048 CAATCTGCGGCGGCTGGTGTGCCTGGATGTGTCAGAGAACAGGCTGGAGG 1097
                 || |||||||| |||||||||||||| ||||| || ||| |||||||||
ha01022s1   653 GAACCTGCGGCGCCTGGTGTGCCTGGACGTGTCGGAAAACCGGCTGGAGG 702
            218  N  L  R  R  L  V  C  L  D  V  S  E  N  R  L  E  E 234

            701 ----+----*----+----*----+----*----+----*----+----* 750
            292   L  P  V  E  L  G  G  L  A  L  L  T  D  L  L  L   307
mbg04389   1098 AGCTGCCTGTGGAGTTGGGTGGGCTGGCACTGCTCACGGATCTGCTGCTT 1147
                |||||||||  ||| | || |||||||  |||||||| || |||||||| 
ha01022s1   703 AGCTGCCTGCTGAGCTCGGCGGGCTGGTGCTGCTCACTGACCTGCTGCTG 752
            235   L  P  A  E  L  G  G  L  V  L  L  T  D  L  L  L   250

            751 ----+----*----+----*----+----*----+----*----+----* 800
            308 S  Q  N  L  L  Q  R  L  P  E  G  I  G  Q  L  K  Q  324
mbg04389   1148 TCCCAGAACTTGCTTCAGCGGCTGCCAGAGGGCATCGGTCAGCTGAAGCA 1197
                ||||||||| |||| | | ||||||| || ||||||||||||||||||||
ha01022s1   753 TCCCAGAACCTGCTGCGGAGGCTGCCCGACGGCATCGGTCAGCTGAAGCA 802
            251 S  Q  N  L  L  R  R  L  P  D  G  I  G  Q  L  K  Q  267

            801 ----+----*----+----*----+----*----+----*----+----* 850
            325  L  S  I  L  K  V  D  Q  N  R  L  C  E  V  T  E  A 341
mbg04389   1198 GCTGTCTATCCTGAAGGTGGACCAGAACCGGCTGTGTGAGGTCACTGAGG 1247
                ||| || ||||| ||||| |||||||| |||||||| ||||| |||||||
ha01022s1   803 GCTATCCATCCTAAAGGTAGACCAGAATCGGCTGTGCGAGGTGACTGAGG 852
            268  L  S  I  L  K  V  D  Q  N  R  L  C  E  V  T  E  A 284

            851 ----+----*----+----*----+----*----+----*----+----* 900
            342   I  G  D  C  E  N  L  S  E  L  I  L  T  E  N  L   357
mbg04389   1248 CCATAGGGGACTGTGAGAACCTCTCGGAGCTGATCCTCACGGAGAACCTG 1297
                |||| |||||||||||||||||||| ||||||||||||||||||||||||
ha01022s1   853 CCATCGGGGACTGTGAGAACCTCTCTGAGCTGATCCTCACGGAGAACCTG 902
            285   I  G  D  C  E  N  L  S  E  L  I  L  T  E  N  L   300

            901 ----+----*----+----*----+----*----+----*----+----* 950
            358 L  T  A  L  P  H  S  L  G  K  L  T  K  L  T  N  L  374
mbg04389   1298 TTAACGGCCTTACCCCACTCGCTGGGCAAGCTGACCAAGCTGACTAACCT 1347
                 | | |||| | |||| ||| ||||| |||||||| |||||||| |||||
ha01022s1   903 CTGATGGCCCTGCCCCGCTCCCTGGGAAAGCTGACTAAGCTGACCAACCT 952
            301 L  M  A  L  P  R  S  L  G  K  L  T  K  L  T  N  L  317

            951 ----+----*----+----*----+----*----+----*----+----* 1000
            375  N  V  D  R  N  H  L  E  V  L  P  P  E  I  G  G  C 391
mbg04389   1348 CAATGTGGACCGGAACCATCTTGAGGTGCTGCCTCCTGAAATCGGAGGCT 1397
                ||| |||||||||||||| || |||| |||||| || || ||||| ||||
ha01022s1   953 CAACGTGGACCGGAACCACCTCGAGGCGCTGCCGCCCGAGATCGGGGGCT 1002
            318  N  V  D  R  N  H  L  E  A  L  P  P  E  I  G  G  C 334

           1001 ----+----*----+----*----+----*----+----*----+----* 1050
            392   V  A  L  S  V  L  S  L  R  D  N  R  L  A  V  L   407
mbg04389   1398 GTGTGGCCCTTAGTGTCCTCTCTTTGAGAGACAATCGCCTGGCTGTCCTC 1447
                ||||||| || || |||||||| ||||| ||||| |||||||| ||||| 
ha01022s1  1003 GTGTGGCACTCAGCGTCCTCTCCTTGAGGGACAACCGCCTGGCCGTCCTG 1052
            335   V  A  L  S  V  L  S  L  R  D  N  R  L  A  V  L   350

           1051 ----+----*----+----*----+----*----+----*----+----* 1100
            408 P  P  E  L  A  H  T  A  E  L  H  V  L  D  V  A  G  424
mbg04389   1448 CCTCCTGAGCTTGCCCATACCGCTGAGCTGCACGTGCTTGATGTGGCTGG 1497
                || || ||||| ||||| ||  | |||||||||||||| || ||||| ||
ha01022s1  1053 CCACCAGAGCTGGCCCACACGACAGAGCTGCACGTGCTGGACGTGGCGGG 1102
            351 P  P  E  L  A  H  T  T  E  L  H  V  L  D  V  A  G  367

           1101 ----+----*----+----*----+----*----+----*----+----* 1150
            425  N  R  L  R  S  L  P  F  A  L  T  H  L  N  L  K  A 441
mbg04389   1498 GAACCGGCTGCGGAGTCTGCCGTTTGCGCTCACCCACCTCAACCTCAAGG 1547
                |||||| |||| |||||||||||| ||||||||||||||||| |||||||
ha01022s1  1103 GAACCGCCTGCAGAGTCTGCCGTTCGCGCTCACCCACCTCAATCTCAAGG 1152
            368  N  R  L  Q  S  L  P  F  A  L  T  H  L  N  L  K  A 384

           1151 ----+----*----+----*----+----*----+----*----+----* 1200
            442   L  W  L  A  E  N  Q  A  Q  P  M  L  R  F  Q  T   457
mbg04389   1548 CACTGTGGCTGGCCGAGAACCAGGCACAGCCCATGCTCCGCTTCCAGACT 1597
                | ||||||||||| ||||||||||| |||||||||||||| |||||||| 
ha01022s1  1153 CCCTGTGGCTGGCAGAGAACCAGGCGCAGCCCATGCTCCGGTTCCAGACG 1202
            385   L  W  L  A  E  N  Q  A  Q  P  M  L  R  F  Q  T   400

           1201 ----+----*----+----*----+----*----+----*----+----* 1250
            458 E  D  D  A  Q  T  G  E  K  V  L  T  C  Y  L  L  P  474
mbg04389   1598 GAGGATGATGCCCAGACGGGTGAAAAGGTGCTCACCTGCTACCTGCTGCC 1647
                ||||||||||||| ||| || || |||||||||||||||||| |||||||
ha01022s1  1203 GAGGATGATGCCCGGACCGGCGAGAAGGTGCTCACCTGCTACTTGCTGCC 1252
            401 E  D  D  A  R  T  G  E  K  V  L  T  C  Y  L  L  P  417

           1251 ----+----*----+----*----+----*----+----*----+----* 1300
            475  Q  Q  P  L  P  S  L  E  D  A  G  Q  Q  S  S  P  S 491
mbg04389   1648 CCAGCAGCCCCTGCCAAGTCTCGAAGACGCTGGACAGCAGAGCAGTCCCT 1697
                |||||||||||  |  || ||||| || ||||| |||||| | || | ||
ha01022s1  1253 CCAGCAGCCCCCACTCAGCCTCGAGGATGCTGGGCAGCAGGGGAGCCTCT 1302
            418  Q  Q  P  P  L  S  L  E  D  A  G  Q  Q  G  S  L  S 434

           1301 ----+----*----+----*----+----*----+----*----+----* 1350
            492   E  S  C  S  D  A  P  L  S  R  V  S  V  I  Q  F   507
mbg04389   1698 CAGAGAGCTGTAGTGATGCCCCTCTGAGCCGTGTCAGTGTCATCCAGTTC 1747
                | |||| ||| || |||||||| | |||||| ||||| ||||||||||||
ha01022s1  1303 CGGAGACCTGGAGCGATGCCCCGCCGAGCCGCGTCAGCGTCATCCAGTTC 1352
            435   E  T  W  S  D  A  P  P  S  R  V  S  V  I  Q  F   450

           1351 ----+----*----+----*----+----*----+----*----+----* 1400
            508 E  D  T  L  E  G  E  E  D  A  E  E  A  A  A  E  K  524
mbg04389   1748 GAGGACACCCTGGAAGGTGAAGAGGATGCTGAGGAAGCTGCGGCCGAGAA 1797
                  |||  |||    |||||| ||||| |||||||||||||| || |||||
ha01022s1  1353 CTGGAGGCCCCCATAGGTGATGAGGACGCTGAGGAAGCTGCAGCTGAGAA 1402
            451 L  E  A  P  I  G  D  E  D  A  E  E  A  A  A  E  K  467

           1401 ----+----*----+----*----+----*----+----*----+----* 1450
            525  R  G  L  Q  R  R  A  T  P  H  P  S  E  L  K  V  M 541
mbg04389   1798 GCGGGGCCTACAGCGTAGGGCCACACCTCATCCCAGTGAGCTCAAGGTGA 1847
                |||||||||||||||  ||||||||||||| ||||| |||||||||||||
ha01022s1  1403 GCGGGGCCTACAGCGCCGGGCCACACCTCACCCCAGCGAGCTCAAGGTGA 1452
            468  R  G  L  Q  R  R  A  T  P  H  P  S  E  L  K  V  M 484

           1451 ----+----*----+----*----+----*----+----*----+----* 1500
            542   K  R  G  I  E  E  R  R  N  E  A  F  V  C  K  P   557
mbg04389   1848 TGAAGAGGGGCATTGAAGAGCGCCGGAATGAGGCCTTCGTCTGCAAGCCT 1897
                |||||||| |||| || | ||| ||||  ||||||| |   ||| |||| 
ha01022s1  1453 TGAAGAGGAGCATCGAGGGGCGGCGGAGCGAGGCCTGCCCTTGCCAGCCA 1502
            485   K  R  S  I  E  G  R  R  S  E  A  C  P  C  Q  P   500

           1501 ----+----*----+----*----+----*----+----*----+----* 1550
            558 D  P  S  P  P  S  P  S  E  E  E  K  R  L  S  A  E  574
mbg04389   1898 GACCCCAGCCCACCCTCGCCTTCAGAGGAGGAGAAGAGGCTGAGTGCAGA 1947
                ||| |  |  | |||| |||| |||||||||||||| |||||||||| ||
ha01022s1  1503 GACTCTGGGTCGCCCTTGCCTGCAGAGGAGGAGAAGCGGCTGAGTGCCGA 1552
            501 D  S  G  S  P  L  P  A  E  E  E  K  R  L  S  A  E  517

           1551 ----+----*----+----*----+----*----+----*----+----* 1600
            575  S  A  L  S  G  G  S  V  P  S  A  S  T  A  S  E  G 591
mbg04389   1948 GTCTGCCTTGAGTGGAGGCTCTGTCCCGTCAGCAAGCACAGCCTCGGAAG 1997
                ||||| | |||||| || ||||  ||| || || ||||||| ||| || |
ha01022s1  1553 GTCTGGCCTGAGTGAAGACTCTCGCCCATCTGCCAGCACAGTCTCTGAGG 1602
            518  S  G  L  S  E  D  S  R  P  S  A  S  T  V  S  E  A 534

           1601 ----+----*----+----*----+----*----+----*----+----* 1650
            592   E  P  E  I  L  P  A  E  V  Q  G  L  G  Q  H  E   607
mbg04389   1998 GTGAGCCTGAGATCCTGCCGGCTGAGGTACAAGGGCTAGGCCAGCATGAG 2047
                 |||||| |||  || | ||||||||| ||| ||     ||||||| || 
ha01022s1  1603 CTGAGCCCGAGGGCCCGTCGGCTGAGGCACAGGGTGGGAGCCAGCAGGAA 1652
            535   E  P  E  G  P  S  A  E  A  Q  G  G  S  Q  Q  E   550

           1651 ----+----*----+----*----+----*----+----*----+----* 1700
            608 A  M  P  A  Q  .  E  E  Y  T  E  D  D  Y  N  E  P  623
mbg04389   2048 GCCATGCCTGCACAG...GAGGAGTATACAGAAGACGACTATAACGAGCC 2094
                |||| | ||||       |||||  |  | ||||| |||||  | |||||
ha01022s1  1653 GCCACGACTGCTGGCGGGGAGGAAGACGCCGAAGAGGACTACCAGGAGCC 1702
            551 A  T  T  A  G  G  E  E  D  A  E  E  D  Y  Q  E  P  567

           1701 ----+----*----+----*----+----*----+----*----+----* 1750
            624  T  V  H  F  A  E  D  T  L  I  P  R  E  D  G  E  S 640
mbg04389   2095 CACAGTGCACTTTGCAGAGGACACACTGATCCCTAGGGAAGACGGTGAAA 2144
                ||| ||||| || ||||||||| ||||| | ||  |||| ||| | || |
ha01022s1  1703 CACGGTGCATTTCGCAGAGGACGCACTGCTGCCCGGGGATGACAGGGAGA 1752
            568  T  V  H  F  A  E  D  A  L  L  P  G  D  D  R  E  I 584

           1751 ----+----*----+----*----+----*----+----*----+----* 1800
            641   E  E  G  Q  P  E  A  A  W  P  L  P  S  G  R  Q   656
mbg04389   2145 GTGAAGAGGGACAGCCTGAGGCTGCCTGGCCGCTGCCTAGTGGCCGGCAG 2194
                  || ||||| |||||||||||  ||||| | |||||  | ||  |||||
ha01022s1  1753 TCGAGGAGGGGCAGCCTGAGGCCCCCTGGACCCTGCCAGGCGGGAGGCAG 1802
            585   E  E  G  Q  P  E  A  P  W  T  L  P  G  G  R  Q   600

           1801 ----+----*----+----*----+----*----+----*----+----* 1850
            657 R  L  I  R  K  D  T  P  H  Y  K  K  H  F  K  I  S  673
mbg04389   2195 AGGCTCATCCGCAAAGACACGCCCCACTACAAGAAGCACTTCAAGATCTC 2244
                 ||||||||||||| ||||| || |||||||| |||||||||||||||||
ha01022s1  1803 CGGCTCATCCGCAAGGACACACCTCACTACAAAAAGCACTTCAAGATCTC 1852
            601 R  L  I  R  K  D  T  P  H  Y  K  K  H  F  K  I  S  617

           1851 ----+----*----+----*----+----*----+----*----+----* 1900
            674  K  L  P  Q  P  E  A  V  V  A  L  L  Q  G  V  Q  T 690
mbg04389   2245 CAAGCTCCCACAGCCTGAAGCTGTCGTGGCCCTGCTGCAAGGGGTGCAGA 2294
                |||||| || ||||| || || || ||||| |||||||| ||  ||||| 
ha01022s1  1853 CAAGCTGCCCCAGCCCGAGGCCGTTGTGGCTCTGCTGCAGGGCATGCAGC 1902
            618  K  L  P  Q  P  E  A  V  V  A  L  L  Q  G  M  Q  P 634

           1901 ----+----*----+----*----+----*----+----*----+----* 1950
            691   D  R  E  G  P  T  A  .  .  G  W  H  N  G  P  H   704
mbg04389   2295 CCGACAGGGAGGGCCCCACTGCA......GGTTGGCACAATGGCCCACAC 2338
                | ||  ||||||||||    ||       || |||||||||||||| |||
ha01022s1  1903 CTGATGGGGAGGGCCCTGTGGCTCCCGGGGGCTGGCACAATGGCCCCCAC 1952
            635   D  G  E  G  P  V  A  P  G  G  W  H  N  G  P  H   650

           1951 ----+----*----+----*----+----*----+----*----+----* 2000
            705 T  P  W  A  P  R  A  H  .  .  .  .  .  .  .  .  .  712
mbg04389   2339 ACACCTTGGGCTCCTCGAGCCCAT.......................... 2362
                 |||| ||||||||||| |||||                           
ha01022s1  1953 GCACCCTGGGCTCCTCGGGCCCAGAAGGAGGAGGAGGAGGAGGAAGAGGG 2002
            651 A  P  W  A  P  R  A  Q  K  E  E  E  E  E  E  E  G  667

           2001 ----+----*----+----*----+----*----+----*----+----* 2050
            713  .  .  .  E  E  E  E  E  E  E  E  E  N  R  D  E  E 726
mbg04389   2363 ..........GAGGAGGAAGAGGAGGAGGAGGAGGAGAACAGGGATGAGG 2402
                          |||||||| || |||||||||||||| ||||||| ||| |
ha01022s1  2003 TAGTCCTCAGGAGGAGGAGGAAGAGGAGGAGGAGGAAAACAGGGCTGAAG 2052
            668  S  P  Q  E  E  E  E  E  E  E  E  E  N  R  A  E  E 684

           2051 ----+----*----+----*----+----*----+----*----+----* 2100
            727   E  G  E  A  T  T  E  E  D  D  K  E  E  A  V  A   742
mbg04389   2403 AGGAAGGCGAGGCCACCACGGAAGAAGATGACAAAGAAGAGGCTGTGGCT 2452
                |||||   ||||||| ||| || || || ||||| || | ||| |||| |
ha01022s1  2053 AGGAA...GAGGCCAGCACTGAGGAGGAGGACAAGGAGGGGGCCGTGGTT 2099
            685   E  .  E  A  S  T  E  E  E  D  K  E  G  A  V  V   699

           2101 ----+----*----+----*----+----*----+----*----+----* 2150
            743 S  A  P  S  V  K  G  V  S  F  D  Q  A  N  N  L  L  759
mbg04389   2453 TCTGCTCCCTCTGTCAAGGGGGTGTCGTTTGACCAGGCCAATAACCTGCT 2502
                ||||| |||||||||||||| || ||||||||||||||||||||||||||
ha01022s1  2100 TCTGCGCCCTCTGTCAAGGGAGTTTCGTTTGACCAGGCCAATAACCTGCT 2149
            700 S  A  P  S  V  K  G  V  S  F  D  Q  A  N  N  L  L  716

           2151 ----+----*----+----*----+----*----+----*----+----* 2200
            760  I  E  P  A  R  I  E  E  E  E  L  T  L  T  I  V  R 776
mbg04389   2503 GATAGAGCCTGCTCGCATTGAGGAGGAAGAGTTAACACTCACGATCGTGC 2552
                ||||||||||||||||||||||||||||||| | || ||||| ||| |||
ha01022s1  2150 GATAGAGCCTGCTCGCATTGAGGAGGAAGAGCTGACCCTCACCATCCTGC 2199
            717  I  E  P  A  R  I  E  E  E  E  L  T  L  T  I  L  R 733

           2201 ----+----*----+----*----+----*----+----*----+----* 2250
            777   Q  T  G  G  L  G  I  S  I  A  G  G  K  G  S  T   792
mbg04389   2553 GGCAGACGGGGGGCCTGGGCATCAGTATCGCAGGGGGAAAAGGCTCTACC 2602
                ||||||| ||||||||||||||||| || || || || || ||||| || 
ha01022s1  2200 GGCAGACTGGGGGCCTGGGCATCAGCATTGCGGGCGGCAAGGGCTCCACA 2249
            734   Q  T  G  G  L  G  I  S  I  A  G  G  K  G  S  T   749

           2251 ----+----*----+----*----+----*----+----*----+----* 2300
            793 P  Y  K  G  D  D  E  G  I  F  I  S  R  V  S  E  E  809
mbg04389   2603 CCCTACAAAGGAGATGACGAGGGCATTTTCATCTCTCGAGTGTCTGAAGA 2652
                ||||| || || || ||||||||||| ||||||||||| ||||| || ||
ha01022s1  2250 CCCTATAAGGGGGACGACGAGGGCATATTCATCTCTCGGGTGTCCGAGGA 2299
            750 P  Y  K  G  D  D  E  G  I  F  I  S  R  V  S  E  E  766

           2301 ----+----*----+----*----+----*----+----*----+----* 2350
            810  G  P  A  A  R  A  G  V  R  V  G  D  K  L  L  E  V 826
mbg04389   2653 GGGCCCTGCAGCCCGTGCTGGAGTCCGAGTTGGTGACAAACTTCTTGAGG 2702
                 |||||||| ||||| ||||||||||| || |||||||| || || ||||
ha01022s1  2300 AGGCCCTGCGGCCCGGGCTGGAGTCCGTGTGGGTGACAAGCTCCTGGAGG 2349
            767  G  P  A  A  R  A  G  V  R  V  G  D  K  L  L  E  V 783

           2351 ----+----*----+----*----+----*----+----*----+----* 2400
            827   N  G  V  A  L  Q  D  A  E  H  H  E  A  V  E  A   842
mbg04389   2703 TGAATGGTGTAGCCTTGCAGGACGCAGAGCACCACGAGGCCGTGGAAGCA 2752
                |||||||||| ||  |||||| ||| |||||||||||||||||||| || 
ha01022s1  2350 TGAATGGTGTGGCTCTGCAGGGCGCCGAGCACCACGAGGCCGTGGAGGCG 2399
            784   N  G  V  A  L  Q  G  A  E  H  H  E  A  V  E  A   799

           2401 ----+----*----+----*----+----*----+----*----+----* 2450
            843 L  R  G  A  G  A  A  V  Q  M  R  V  W  R  E  R  M  859
mbg04389   2753 CTTCGGGGGGCAGGTGCTGCTGTGCAGATGCGAGTGTGGAGGGAAAGAAT 2802
                || |||||||| ||  |||| |||||||||||||||||| ||||  | ||
ha01022s1  2400 CTCCGGGGGGCCGGCACTGCCGTGCAGATGCGAGTGTGGCGGGAGCGCAT 2449
            800 L  R  G  A  G  T  A  V  Q  M  R  V  W  R  E  R  M  816

           2451 ----+----*----+----*----+----*----+----*----+----* 2500
            860  V  E  P  E  N  A  V  T  I  T  P  L  R  P  E  D  D 876
mbg04389   2803 GGTGGAGCCGGAGAACGCAGTCACTATTACCCCCTTACGCCCTGAAGATG 2852
                ||||||||| |||||||| ||||| || || ||  | || || || ||||
ha01022s1  2450 GGTGGAGCCTGAGAACGCGGTCACCATCACGCCGCTGCGGCCCGAGGATG 2499
            817  V  E  P  E  N  A  V  T  I  T  P  L  R  P  E  D  D 833

           2501 ----+----*----+----*----+----*----+----*----+----* 2550
            877   Y  S  P  R  E  W  R  G  G  G  L  R  L  P  L  L   892
mbg04389   2853 ACTATAGTCCCCGAGAGTGGCGGGGAGGTGGCCTGCGCCTTCCCCTGCTC 2902
                | || || ||||||||| |||||||||| || |||||||| |||||||||
ha01022s1  2500 ATTACAGCCCCCGAGAGCGGCGGGGAGGGGGGCTGCGCCTGCCCCTGCTC 2549
            834   Y  S  P  R  E  R  R  G  G  G  L  R  L  P  L  L   849

           2551 ----+----*----+----*----+----*----+----*----+----* 2600
            893 Q  P  E  T  P  V  S  L  R  Q  R  H  A  A  C  L  V  909
mbg04389   2903 CAGCCTGAGACTCCTGTATCCCTCCGTCAGCGTCATGCTGCCTGTCTCGT 2952
                | |||||| |  || |   ||||||||||||| || |  ||||| || | 
ha01022s1  2550 CCGCCTGAAAGCCCCGGGCCCCTCCGTCAGCGCCACGTGGCCTGCCTGGC 2599
            850 P  P  E  S  P  G  P  L  R  Q  R  H  V  A  C  L  A  866

           2601 ----+----*----+----*----+----*----+----*----+----* 2650
            910  R  S  E  K  G  L  G  F  S  I  A  G  G  K  G  S  T 926
mbg04389   2953 GCGCAGTGAAAAGGGGCTGGGCTTCAGCATTGCTGGTGGAAAGGGCTCCA 3002
                 ||||| || | ||||||||||||||||||||||||||| || |||||||
ha01022s1  2600 ACGCAGCGAGAGGGGGCTGGGCTTCAGCATTGCTGGTGGGAAAGGCTCCA 2649
            867  R  S  E  R  G  L  G  F  S  I  A  G  G  K  G  S  T 883

           2651 ----+----*----+----*----+----*----+----*----+----* 2700
            927   P  Y  R  A  G  D  G  G  I  F  I  S  R  I  A  E   942
mbg04389   3003 CACCTTACCGGGCTGGTGATGGGGGCATCTTTATATCCCGCATTGCAGAG 3052
                |||| ||| |||||||||||| |||||||||  | ||||||||||| |||
ha01022s1  2650 CACCCTACAGGGCTGGTGATGCGGGCATCTTCGTCTCCCGCATTGCCGAG 2699
            884   P  Y  R  A  G  D  A  G  I  F  V  S  R  I  A  E   899

           2701 ----+----*----+----*----+----*----+----*----+----* 2750
            943 G  G  A  A  H  R  A  G  T  L  Q  V  G  D  R  V  L  959
mbg04389   3053 GGAGGGGCTGCTCACCGGGCAGGCACTCTACAGGTTGGCGACCGTGTCCT 3102
                || || ||||||||||| || ||||| || |||||||||||||| |||||
ha01022s1  2700 GGCGGTGCTGCTCACCGCGCGGGCACACTGCAGGTTGGCGACCGCGTCCT 2749
            900 G  G  A  A  H  R  A  G  T  L  Q  V  G  D  R  V  L  916

           2751 ----+----*----+----*----+----*----+----*----+----* 2800
            960  S  I  N  G  V  D  M  T  E  A  R  H  D  H  A  V  S 976
mbg04389   3103 CTCGATCAATGGAGTGGACATGACTGAGGCCAGGCACGACCATGCTGTCT 3152
                ||| || |||||||||||| |||||||||||||||| ||||| || ||||
ha01022s1  2750 CTCTATTAATGGAGTGGACGTGACTGAGGCCAGGCATGACCACGCCGTCT 2799
            917  S  I  N  G  V  D  V  T  E  A  R  H  D  H  A  V  S 933

           2801 ----+----*----+----*----+----*----+----*----+----* 2850
            977   L  L  T  A  A  S  P  T  I  S  L  L  L  E  R  E   992
mbg04389   3153 CCCTGCTGACTGCTGCTTCCCCTACCATCTCCCTGCTTCTGGAGAGGGAG 3202
                |||||||||| ||||| ||||| |||||| |||||||  ||||| |||||
ha01022s1  2800 CCCTGCTGACCGCTGCCTCCCCCACCATCGCCCTGCTGTTGGAGCGGGAG 2849
            934   L  L  T  A  A  S  P  T  I  A  L  L  L  E  R  E   949

           2851 ----+----*----+----*----+----*----+----*----+----* 2900
            993 T  G  G  T  Y  P  P  S  P  P  P  H  S  S  P  T  P  1009
mbg04389   3203 ACTGGAGGGACTTACCCACCTAGCCCTCCCCCCCATTCCTCCCCAACCCC 3252
                 |||| ||  ||   || || |||||||  || |||||||| ||  || |
ha01022s1  2850 GCTGGGGGCCCTCTTCCTCCCAGCCCTCTGCCACATTCCTCACCCCCCAC 2899
            950 A  G  G  P  L  P  P  S  P  L  P  H  S  S  P  P  T  966

           2901 ----+----*----+----*----+----*----+----*----+----* 2950
           1010  A  A  T  V  A  A  T  V  S  T  A  V  P  G  E  P  L 1026
mbg04389   3253 TGCTGCTACTGTTGCTGCTACCGTGAGCACTGCTGTCCCCGGAGAACCTC 3302
                 ||||||  ||   |  | | | | | ||||||   |||||| |  ||| 
ha01022s1  2900 CGCTGCTGTTGCCACCACCAGCATAACCACTGCCACCCCCGGGGTGCCTG 2949
            967  A  A  V  A  T  T  S  I  T  T  A  T  P  G  V  P  G 983

           2951 ----+----*----+----*----+----*----+----*----+----* 3000
           1027   L  P  R  L  S  P  S  L  L  A  T  A  L  E  G  P   1042
mbg04389   3303 TGCTACCTAGGCTGTCCCCTAGCCTCTTGGCCACTGCCTTGGAAGGACCA 3352
                 | | || || ||| |||| |||||  ||||  | || |||||||| |||
ha01022s1  2950 GGTTGCCGAGCCTGGCCCCCAGCCTGCTGGCTGCCGCGTTGGAAGGGCCA 2999
            984   L  P  S  L  A  P  S  L  L  A  A  A  L  E  G  P   999

           3001 ----+----*----+----*----+----*----+----*----+----* 3050
           1043 Y  P  V  E  E  I  C  L  P  R  A  G  G  P  L  G  L  1059
mbg04389   3353 TACCCTGTGGAGGAAATCTGCCTGCCCAGGGCTGGTGGCCCATTGGGGCT 3402
                ||||| |||||||| ||| | ||||| || ||||| |||||  |||||||
ha01022s1  3000 TACCCAGTGGAGGAGATCCGTCTGCCAAGAGCTGGGGGCCCTCTGGGGCT 3049
           1000 Y  P  V  E  E  I  R  L  P  R  A  G  G  P  L  G  L  1016

           3051 ----+----*----+----*----+----*----+----*----+----* 3100
           1060  S  I  V  G  G  S  D  H  S  S  H  P  F  G  V  Q  D 1076
mbg04389   3403 TAGCATTGTAGGAGGTTCTGATCACTCCAGCCACCCGTTTGGTGTCCAAG 3452
                ||| ||||| ||||| || || || ||||||||||||||||||||||| |
ha01022s1  3050 TAGTATTGTCGGAGGCTCCGACCATTCCAGCCACCCGTTTGGTGTCCAGG 3099
           1017  S  I  V  G  G  S  D  H  S  S  H  P  F  G  V  Q  E 1033

           3101 ----+----*----+----*----+----*----+----*----+----* 3150
           1077   P  G  V  F  I  S  K  V  L  P  R  G  L  A  A  R   1092
mbg04389   3453 ATCCTGGTGTATTCATCTCCAAGGTGCTTCCTCGGGGCCTAGCTGCTCGC 3502
                | |||||||| ||||||||||||||||| || |||||||| || ||||||
ha01022s1  3100 AGCCTGGTGTGTTCATCTCCAAGGTGCTCCCGCGGGGCCTGGCCGCTCGC 3149
           1034   P  G  V  F  I  S  K  V  L  P  R  G  L  A  A  R   1049

           3151 ----+----*----+----*----+----*----+----*----+----* 3200
           1093 C  G  L  R  V  G  D  R  I  L  A  V  N  G  Q  D  V  1109
mbg04389   3503 TGTGGCCTTCGGGTTGGGGACCGCATCCTAGCAGTGAATGGGCAGGATGT 3552
                 | ||||| |||||||||||||||||||| |||||||| ||||| || ||
ha01022s1  3150 AGCGGCCTGCGGGTTGGGGACCGCATCCTGGCAGTGAACGGGCAAGACGT 3199
           1050 S  G  L  R  V  G  D  R  I  L  A  V  N  G  Q  D  V  1066

           3201 ----+----*----+----*----+----*----+----*----+----* 3250
           1110  R  E  A  T  H  Q  E  A  V  S  A  L  L  R  P  C  L 1126
mbg04389   3553 TCGGGAGGCCACACATCAAGAGGCAGTCAGTGCCCTGCTCAGGCCCTGCC 3602
                 ||||| ||||| || ||||| |||||||||||||||||| |||||||||
ha01022s1  3200 GCGGGATGCCACGCACCAAGAAGCAGTCAGTGCCCTGCTCCGGCCCTGCC 3249
           1067  R  D  A  T  H  Q  E  A  V  S  A  L  L  R  P  C  L 1083

           3251 ----+----*----+----*----+----*----+----*----+----* 3300
           1127   E  L  C  L  L  V  R  R  D  P  P  P  P  G  M  R   1142
mbg04389   3603 TGGAGCTGTGTCTGCTTGTGCGGAGGGACCCACCACCCCCGGGCATGCGG 3652
                |||||||||  ||||| ||||||||||||||  ||||||||||| | |||
ha01022s1  3250 TGGAGCTGTCGCTGCTGGTGCGGAGGGACCCGGCACCCCCGGGCCTACGG 3299
           1084   E  L  S  L  L  V  R  R  D  P  A  P  P  G  L  R   1099

           3301 ----+----*----+----*----+----*----+----*----+----* 3350
           1143 E  L  C  I  Q  K  A  P  G  E  K  L  G  I  S  I  R  1159
mbg04389   3653 GAACTCTGCATCCAGAAAGCCCCTGGGGAGAAGCTGGGTATCAGCATCCG 3702
                ||||| ||||||||||| || |||||||||| |||||| |||||||||||
ha01022s1  3300 GAACTGTGCATCCAGAAGGCACCTGGGGAGAGGCTGGGCATCAGCATCCG 3349
           1100 E  L  C  I  Q  K  A  P  G  E  R  L  G  I  S  I  R  1116

           3351 ----+----*----+----*----+----*----+----*----+----* 3400
           1160  G  G  A  K  G  H  A  G  N  P  C  D  P  T  D  E  G 1176
mbg04389   3703 CGGAGGTGCCAAGGGTCACGCGGGGAACCCCTGTGACCCTACAGATGAGG 3752
                ||| ||||||| ||| ||||| || |||||| | ||||| ||||| ||||
ha01022s1  3350 CGGGGGTGCCAGGGGCCACGCTGGCAACCCCCGCGACCCCACAGACGAGG 3399
           1117  G  G  A  R  G  H  A  G  N  P  R  D  P  T  D  E  G 1133

           3401 ----+----*----+----*----+----*----+----*----+----* 3450
           1177   I  F  I  S  K  V  S  P  T  G  A  A  G  R  D  G   1192
mbg04389   3753 GCATCTTTATCTCCAAGGTGAGCCCCACAGGAGCTGCCGGGCGTGATGGG 3802
                ||||||| |||||||||||||||||||| || || |||||||| || || 
ha01022s1  3400 GCATCTTCATCTCCAAGGTGAGCCCCACGGGGGCAGCCGGGCGCGACGGT 3449
           1134   I  F  I  S  K  V  S  P  T  G  A  A  G  R  D  G   1149

           3451 ----+----*----+----*----+----*----+----*----+----* 3500
           1193 R  L  R  V  G  L  R  L  L  E  V  N  Q  Q  S  L  L  1209
mbg04389   3803 CGGCTGCGTGTGGGGCTGCGGCTGCTAGAAGTGAACCAACAGAGCCTGCT 3852
                ||||||||||||||  |||||||| | || |||||||| |||||||||||
ha01022s1  3450 CGGCTGCGTGTGGGTTTGCGGCTGTTGGAGGTGAACCAGCAGAGCCTGCT 3499
           1150 R  L  R  V  G  L  R  L  L  E  V  N  Q  Q  S  L  L  1166

           3501 ----+----*----+----*----+----*----+----*----+----* 3550
           1210  G  L  T  H  A  E  A  V  Q  L  L  R  S  V  G  D  T 1226
mbg04389   3853 GGGCCTTACCCATGCGGAAGCAGTGCAGCTGCTGCGCAGCGTAGGCGACA 3902
                |||||| || || |  || || ||||||||||| ||||| || |||||||
ha01022s1  3500 GGGCCTGACGCACGGCGAGGCGGTGCAGCTGCTCCGCAGTGTGGGCGACA 3549
           1167  G  L  T  H  G  E  A  V  Q  L  L  R  S  V  G  D  T 1183

           3551 ----+----*----+----*----+----*----+----*----+----* 3600
           1227   L  T  V  L  V  C  D  G  F  D  T  S  T  T  T  A   1242
mbg04389   3903 CCCTTACCGTGCTTGTCTGTGATGGTTTTGACACCAGCACCACCACAGCC 3952
                |||| |||||||| |||||||| || || ||  ||||||||  | |||||
ha01022s1  3550 CCCTCACCGTGCTGGTCTGTGACGGCTTCGAGGCCAGCACCGACGCAGCC 3599
           1184   L  T  V  L  V  C  D  G  F  E  A  S  T  D  A  A   1199

           3601 ----+----*----+----*----+----*----+----*----+----* 3650
           1243 L  E  V  S  P  G  V  I  A  N  P  F  A  A  G  L  G  1259
mbg04389   3953 CTGGAGGTGTCCCCAGGTGTCATTGCCAACCCATTTGCAGCAGGCCTTGG 4002
                |||||||||||||||||||||||||||||||| ||||| |||||| | ||
ha01022s1  3600 CTGGAGGTGTCCCCAGGTGTCATTGCCAACCCCTTTGCGGCAGGCATCGG 3649
           1200 L  E  V  S  P  G  V  I  A  N  P  F  A  A  G  I  G  1216

           3651 ----+----*----+----*----+----*----+----*----+----* 3700
           1260  H  R  N  S  L  E  S  I  S  S  I  D  R  E  L  S  P 1276
mbg04389   4003 CCACAGGAACAGTCTGGAAAGCATTTCGTCCATTGACCGGGAACTGAGTC 4052
                |||| ||||||| ||||| ||||| || ||||| |||||||| ||||| |
ha01022s1  3650 CCACCGGAACAGCCTGGAGAGCATCTCTTCCATCGACCGGGAGCTGAGCC 3699
           1217  H  R  N  S  L  E  S  I  S  S  I  D  R  E  L  S  P 1233

           3701 ----+----*----+----*----+----*----+----*----+----* 3750
           1277   E  G  P  G  K  E  K  E  L  A  S  Q  A  L  P  W   1292
mbg04389   4053 CTGAAGGCCCAGGCAAGGAGAAAGAGTTGGCCAGTCAAGCCCTACCCTGG 4102
                |||| ||||||||||||||||| ||| || |  | ||  |||| | ||||
ha01022s1  3700 CTGAGGGCCCAGGCAAGGAGAAGGAGCTGCCTGGACAGACCCTGCACTGG 3749
           1234   E  G  P  G  K  E  K  E  L  P  G  Q  T  L  H  W   1249

           3751 ----+----*----+----*----+----*----+----*----+----* 3800
           1293 E  S  E  S  A  E  T  T  G  R  N  L  E  P  L  K  L  1309
mbg04389   4103 GAGTCTGAGTCTGCAGAGACCACGGGTCGGAATCTAGAGCCCCTGAAGCT 4152
                | | | ||| |  ||||  || | ||||||  |||  |||||||||||||
ha01022s1  3750 GGGCCCGAGGCCACAGAAGCCGCAGGTCGGGGTCTGCAGCCCCTGAAGCT 3799
           1250 G  P  E  A  T  E  A  A  G  R  G  L  Q  P  L  K  L  1266

           3801 ----+----*----+----*----+----*----+----*----+----* 3850
           1310  D  Y  R  A  L  A  A  L  P  S  A  G  S  L  Q  R  G 1326
mbg04389   4153 GGACTACCGTGCCCTGGCCGCCCTGCCCAGTGCTGGGAGTTTACAGAGAG 4202
                ||||||||| |||||||||||| ||||||| ||||| ||  | ||||| |
ha01022s1  3800 GGACTACCGCGCCCTGGCCGCCGTGCCCAGCGCTGGCAGCGTGCAGAGGG 3849
           1267  D  Y  R  A  L  A  A  V  P  S  A  G  S  V  Q  R  V 1283

           3851 ----+----*----+----*----+----*----+----*----+----* 3900
           1327   P  S  A  T  T  G  G  K  T  T  E  A  P  C  S  P   1342
mbg04389   4203 GACCATCTGCGACCACCGGAGGGAAGACGACTGAGGCTCCCTGTTCCCCT 4252
                 ||| ||||   |  | |||||||||| | ||||  ||||||| ||||||
ha01022s1  3850 TACCGTCTGGAGCAGCTGGAGGGAAGATGGCTGAATCTCCCTGCTCCCCT 3899
           1284   P  S  G  A  A  G  G  K  M  A  E  S  P  C  S  P   1299

           3901 ----+----*----+----*----+----*----+----*----+----* 3950
           1343 G  S  Q  Q  P  P  .  .  .  S  P  D  E  L  P  A  N  1356
mbg04389   4253 GGCAGTCAGCAACCACCC.........TCCCCTGATGAACTGCCAGCCAA 4293
                 |  | ||||| || |||         || || ||||| ||||| |||||
ha01022s1  3900 AGTGGCCAGCAGCCGCCCTCCCCGCCTTCTCCGGATGAGCTGCCCGCCAA 3949
           1300 S  G  Q  Q  P  P  S  P  P  S  P  D  E  L  P  A  N  1316

           3951 ----+----*----+----*----+----*----+----*----+----* 4000
           1357  V  K  Q  A  Y  R  A  F  A  A  V  P  T  V  H  P  P 1373
mbg04389   4294 TGTGAAGCAGGCGTATAGGGCCTTTGCAGCTGTGCCCACCGTGCATCCTC 4343
                |||||||||||| || |||||||| || || ||||||||    || || |
ha01022s1  3950 TGTGAAGCAGGCCTACAGGGCCTTCGCGGCCGTGCCCACTTCTCACCCGC 3999
           1317  V  K  Q  A  Y  R  A  F  A  A  V  P  T  S  H  P  P 1333

           4001 ----+----*----+----*----+----*----+----*----+----* 4050
           1374   E  N  S  A  T  Q  P  P  T  P  G  P  A  A  S  P   1389
mbg04389   4344 CTGAGAACAGTGCTACCCAGCCTCCAACACCTGGTCCCGCAGCCTCCCCA 4393
                ||||| |     || ||||||| || || ||||| || ||||||||||| 
ha01022s1  4000 CTGAGGATGCCCCTGCCCAGCCCCCCACGCCTGGGCCTGCAGCCTCCCCG 4049
           1334   E  D  A  P  A  Q  P  P  T  P  G  P  A  A  S  P   1349

           4051 ----+----*----+----*----+----*----+----*----+----* 4100
           1390 E  Q  L  S  F  R  E  R  Q  K  Y  F  E  L  E  V  R  1406
mbg04389   4394 GAGCAGCTGTCCTTCCGTGAACGACAAAAGTACTTCGAGCTGGAGGTACG 4443
                ||||||||||||||||| || || || |||||||| ||||||||||| ||
ha01022s1  4050 GAGCAGCTGTCCTTCCGGGAGCGGCAGAAGTACTTTGAGCTGGAGGTGCG 4099
           1350 E  Q  L  S  F  R  E  R  Q  K  Y  F  E  L  E  V  R  1366

           4101 ----+----*----+----*----+----*----+----*----+----* 4150
           1407  V  P  Q  A  E  G  P  P  K  R  V  S  L  V  G  A  D 1423
mbg04389   4444 GGTGCCTCAAGCAGAGGGTCCCCCAAAGCGTGTGTCCCTGGTGGGTGCTG 4493
                 ||||| || || ||||| ||||| ||||| |||||||||||||||||||
ha01022s1  4100 CGTGCCCCAGGCCGAGGGCCCCCCTAAGCGCGTGTCCCTGGTGGGTGCTG 4149
           1367  V  P  Q  A  E  G  P  P  K  R  V  S  L  V  G  A  D 1383

           4151 ----+----*----+----*----+----*----+----*----+----* 4200
           1424   D  L  R  K  M  Q  E  E  E  A  R  K  L  Q  Q  K   1439
mbg04389   4494 ATGACTTGCGGAAGATGCAAGAGGAGGAAGCTCGCAAGCTGCAGCAGAAG 4543
                | ||| ||||||||||||| |||||||||||  | || || |||||||||
ha01022s1  4150 ACGACCTGCGGAAGATGCAGGAGGAGGAAGCCAGAAAACTACAGCAGAAG 4199
           1384   D  L  R  K  M  Q  E  E  E  A  R  K  L  Q  Q  K   1399

           4201 ----+----*----+----*----+----*----+----*----+----* 4250
           1440 R  A  Q  M  L  R  E  E  A  V  T  S  G  P  D  M  G  1456
mbg04389   4544 AGGGCACAGATGCTTCGGGAAGAAGCAGTAACCTCAGGGCCTGACATGGG 4593
                || || |||||||| |||||    || | |    | ||| | ||   | |
ha01022s1  4200 AGAGCGCAGATGCTGCGGGAG...GCGGCAGAGGCTGGGGCCGAAGCGAG 4246
           1400 R  A  Q  M  L  R  E  .  A  A  E  A  G  A  E  A  R  1415

           4251 ----+----*----+----*----+----*----+----*----+----* 4300
           1457  L  A  S  D  R  E  S  .  P  D  D  Q  Q  E  A  E  Q 1472
mbg04389   4594 GCTCGCCTCAGACAGGGAGTCC...CCAGATGATCAGCAGGAGGCTGAGC 4640
                |||||||   ||| ||||| |       || ||  | ||||||| |||||
ha01022s1  4247 GCTCGCCCTGGACGGGGAGACGCTGGGCGAGGAGGAACAGGAGGATGAGC 4296
           1416  L  A  L  D  G  E  T  L  G  E  E  E  Q  E  D  E  Q 1432

           4301 ----+----*----+----*----+----*----+----*----+----* 4350
           1473   P  .  W  A  V  P  S  H  A  G  G  S  S  P  S  S   1487
mbg04389   4641 AGCCC...TGGGCTGTGCCAAGCCATGCAGGGGGCTCTAGCCCATCATCA 4687
                ||||    |||||    || ||||   |     |    |||||  | || 
ha01022s1  4297 AGCCACCCTGGGCCAGCCCGAGCCCCACCTCAAGGCAGAGCCCGGCGTCC 4346
           1433   P  P  W  A  S  P  S  P  T  S  R  Q  S  P  A  S   1448

           4351 ----+----*----+----*----+----*----+----*----+----* 4400
           1488 P  P  P  L  G  G  N  A  P  V  R  T  A  K  A  E  R  1504
mbg04389   4688 CCCCCACCCCTGGGAGGCAACGCCCCTGTGAGGACAGCCAAAGCTGAGCG 4737
                ||||| |||||||||||   |||||| ||| |||| ||||||||||| ||
ha01022s1  4347 CCCCCGCCCCTGGGAGGTGGCGCCCCGGTGCGGACGGCCAAAGCTGAACG 4396
           1449 P  P  P  L  G  G  G  A  P  V  R  T  A  K  A  E  R  1465

           4401 ----+----*----+----*----+----*----+----*----+----* 4450
           1505  R  H  Q  E  R  L  R  M  Q  S  P  E  L  P  A  P  E 1521
mbg04389   4738 ACGCCATCAGGAACGGTTACGCATGCAGAGCCCCGAGCTTCCTGCCCCGG 4787
                 ||||| ||||| ||| | ||| ||||||| || ||||  || || || |
ha01022s1  4397 GCGCCACCAGGAGCGGCTGCGCGTGCAGAGTCCGGAGCCACCGGCACCCG 4446
           1466  R  H  Q  E  R  L  R  V  Q  S  P  E  P  P  A  P  E 1482

           4451 ----+----*----+----*----+----*----+----*----+----* 4500
           1522   R  A  L  S  P  A  E  R  R  A  L  E  A  E  K  R   1537
mbg04389   4788 AACGGGCCCTGTCTCCTGCAGAGCGACGGGCTCTAGAGGCAGAGAAGCGA 4837
                | || |||||||| ||||| ||||  ||||| || ||||| |||||||| 
ha01022s1  4447 AGCGTGCCCTGTCCCCTGCCGAGCTCCGGGCCCTGGAGGCCGAGAAGCGT 4496
           1483   R  A  L  S  P  A  E  L  R  A  L  E  A  E  K  R   1498

           4501 ----+----*----+----*----+----*----+----*----+----* 4550
           1538 A  L  W  R  A  A  R  M  K  S  L  E  Q  D  A  L  R  1554
mbg04389   4838 GCTCTGTGGAGGGCAGCCAGAATGAAGTCTCTGGAGCAGGATGCTCTCCG 4887
                || ||||||||||||||||| ||||||||  |||| ||||| ||||||||
ha01022s1  4497 GCGCTGTGGAGGGCAGCCAGGATGAAGTCATTGGAACAGGACGCTCTCCG 4546
           1499 A  L  W  R  A  A  R  M  K  S  L  E  Q  D  A  L  R  1515

           4551 ----+----*----+----*----+----*----+----*----+----* 4600
           1555  A  Q  M  V  L  S  K  S  Q  E  G  R  G  K  R  G  P 1571
mbg04389   4888 TGCACAGATGGTCCTTAGCAAGTCCCAGGAAGGCCGTGGCAAGAGAGGAC 4937
                 |||||||||||||| |||| ||||||||||||||| |||| | | || |
ha01022s1  4547 AGCACAGATGGTCCTCAGCAGGTCCCAGGAAGGCCGGGGCACGCGGGGGC 4596
           1516  A  Q  M  V  L  S  R  S  Q  E  G  R  G  T  R  G  P 1532

           4601 ----+----*----+----*----+----*----+----*----+----* 4650
           1572   L  E  R  L  A  E  A  P  S  P  A  P  T  P  S  P   1587
mbg04389   4938 CCTTGGAACGGCTGGCTGAGGCCCCTTCACCTGCCCCCACCCCATCACCC 4987
                || |||| || ||||| ||||||||||| ||||| |||||||| || |||
ha01022s1  4597 CCCTGGAGCGACTGGCCGAGGCCCCTTCCCCTGCGCCCACCCCGTCGCCC 4646
           1533   L  E  R  L  A  E  A  P  S  P  A  P  T  P  S  P   1548

           4651 ----+----*----+----*----+----*----+----*----+----* 4700
           1588 T  P  L  E  D  F  G  L  Q  T  S  A  S  P  G  R  L  1604
mbg04389   4988 ACACCATTGGAAGACTTCGGCCTCCAGACCAGTGCCTCCCCTGGACGCTT 5037
                || ||  |||||||| | |||| |||||||||  ||||||| |||||| |
ha01022s1  4647 ACCCCTGTGGAAGACCTTGGCCCCCAGACCAGCACCTCCCCGGGACGCCT 4696
           1549 T  P  V  E  D  L  G  P  Q  T  S  T  S  P  G  R  L  1565

           4701 ----+----*----+----*----+----*----+----*----+----* 4750
           1605  P  L  S  G  K  K  F  D  Y  R  A  F  A  A  L  P  S 1621
mbg04389   5038 GCCTTTGTCTGGAAAGAAGTTTGACTACAGGGCCTTCGCAGCCCTGCCCT 5087
                |                                                 
ha01022s1  4697 G................................................. 4697
           1565  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  . 1565

           4751 ----+----*----+----*----+----*----+----*----+----* 4800
           1622   S  R  P  V  Y  D  I  Q  S  P  D  F  V  E  E  L   1637
mbg04389   5088 CTTCCAGACCTGTGTATGACATCCAGTCACCAGATTTCGTGGAGGAGCTG 5137
                                          ||||| || || |  |||||| ||
ha01022s1  4698 ..........................TCACCGGACTTTGCTGAGGAGTTG 4721
           1566   .  .  .  .  .  .  .  .  S  P  D  F  A  E  E  L   1573

           4801 ----+----*----+----*----+----*----+----*----+----* 4850
           1638 R  T  L  E  A  S  P  S  P  G  S  Q  E  E  D  G  E  1654
mbg04389   5138 AGGACTTTGGAAGCATCTCCCAGTCCTGGCTCCCAGGAAGAAGATGGAGA 5187
                ||| |  ||||| |||||||||| |||||| | ||||| || ||||||||
ha01022s1  4722 AGGTCCCTGGAACCATCTCCCAGCCCTGGCCCGCAGGAGGAGGATGGAGA 4771
           1574 R  S  L  E  P  S  P  S  P  G  P  Q  E  E  D  G  E  1590

           4851 ----+----*----+----*----+----*----+----*----+----* 4900
           1655  V  A  L  V  L  L  G  R  P  S  P  G  A  V  G  P  E 1671
mbg04389   5188 AGTGGCCTTGGTGCTCCTTGGCAGGCCCTCACCTGGCGCTGTGGGCCCTG 5237
                ||||||  ||||||| || |||||||||||||| ||||||||||||||||
ha01022s1  4772 AGTGGCTCTGGTGCTTCTGGGCAGGCCCTCACCCGGCGCTGTGGGCCCTG 4821
           1591  V  A  L  V  L  L  G  R  P  S  P  G  A  V  G  P  E 1607

           4901 ----+----*----+----*----+----*----+----*----+----* 4950
           1672   D  M  T  L  C  S  S  R  R  S  V  R  P  G  R  R   1687
mbg04389   5238 AAGACATGACGCTGTGCAGCAGCCGTCGGTCTGTGCGGCCGGGACGCCGG 5287
                ||||  || | |||||||||||||| ||  | ||  |||| || ||||| 
ha01022s1  4822 AAGATGTGGCACTGTGCAGCAGCCGCCGCCCCGTAAGGCCTGGGCGCCGT 4871
           1608   D  V  A  L  C  S  S  R  R  P  V  R  P  G  R  R   1623

           4951 ----+----*----+----*---- 4974
           1688 G  L  G  P  V  P  S  *   1695
mbg04389   5288 GGCCTGGGCCCTGTGCCCTCCTAG 5311
                ||||||||||||||||||||||||
ha01022s1  4872 GGCCTGGGCCCTGTGCCCTCCTAG 4895
           1624 G  L  G  P  V  P  S  *   1631


*--[ 3'UTR ]--*
              1 ----+----*----+----*----+----*----+----*----+----* 50
mbg04389   5312 ..GGCCAGGTGCCT.CCCCAGACTTAGGGTGGGAG.CCTGCCAGCTTCAT 5357
                  |  ||||  ||| |||||||||| ||||||| | |||||||||| || 
ha01022s1  4896 AGGAGCAGGCACCTCCCCCAGACTTGGGGTGGGGGCCCTGCCAGCTCCAG 4945

             51 ----+----*----+----*----+----*----+----*----+----* 100
mbg04389   5358 CACCTCCCTTATCCCAAGTCTGTTAGCCTTGGTGTTAGCATTTTAAAGAG 5407
                |||| |||||  ||||||||| ||| ||| ||||||||||||||||||||
ha01022s1  4946 CACCACCCTTGCCCCAAGTCTTTTAACCTGGGTGTTAGCATTTTAAAGAG 4995

            101 ----+----*----+----*----+----*----+----*----+----* 150
mbg04389   5408 ACCCCTCAGGAGTTCTGGCTTCTGATTAACT.......GCCCCGTAGCCG 5450
                ||||| ||||||||||||| | ||| |||||        | ||  |||||
ha01022s1  4996 ACCCCACAGGAGTTCTGGCCTGTGACTAACTAACTGCCCCACCCCAGCCG 5045

            151 ----+----*----+----*----+----*----+----*----+----* 200
mbg04389   5451 GGATCTCGTGGCGAGACTGTAACTAGTGATGTTTGTACAACCAAAGACTC 5500
                 || |||  |||||||||||||||||||||||||||||||||||||||||
ha01022s1  5046 AGACCTC..GGCGAGACTGTAACTAGTGATGTTTGTACAACCAAAGACTC 5093

            201 ----+----*----+----*----+----*----+---- 239
mbg04389   5501 TATTTTGTGGTTTAAGAAGAATAAAGTTACCTACGTTTT 5539
                |||||||||||||||| |||||||||||  |||| ||||
ha01022s1  5094 TATTTTGTGGTTTAAGGAGAATAAAGTTGACTACATTTT 5132