Comparison of KIAA cDNA sequences between mouse and human (KIAA0067)

<< Original sequence data >>

mouse  mKIAA0067 (mbg05662)     length:   5124 bp
human   KIAA0067  (ha01038)     length:   4333 bp


<< Aligned sequence information (excl. stop, if exists.) >>

----------------------------------------------------------
            length    #match  #mismatch   %diff
----------------------------------------------------------
DNA

  CDS1 :     1780      1589      191      10.73
  CDS2 :     1651      1472      179      10.84
  Total:     3431      3061      370      10.78

  3'UTR:      317       232       85      26.81

amino acid

  CDS1 :      596       561       35       5.87
  CDS2 :      551       500       51       9.26
  Total:     1147      1061       86       7.50
----------------------------------------------------------


<< Alignment region (incl. stop, if exists.) >>

----------------------------------------------------------
                    cDNA      cDNA original    amino acid
----------------------------------------------------------
  CDS1 : mouse     3 -  1838      3 -  1838      1 -   612
         human   519 -  2300     60 -  3962    154 -   747
  CDS2 : mouse  2925 -  4577   2898 -  4577      1 -   551
         human  2307 -  3962     60 -  3962    750 -  1301
  3'UTR: mouse  4578 -  5124
         human  3963 -  4333
----------------------------------------------------------


<< Alignment >>

*--[ CDS1 ]--*
             1 ----+----*----+----*----+----*----+----*----+----* 50
             1 P  K  D  Q  K  L  R  E  A  M  A  A  L  R  K  S  A  17
mbg05662     3 CCAAAAGACCAGAAGCTTCGTGAAGCTATGGCTGCCTTAAGAAAATCAGC 52
               ||||||||||||||||| |||||||||||||||||||||||||| |||||
ha01038    519 CCAAAAGACCAGAAGCTCCGTGAAGCTATGGCTGCCTTAAGAAAGTCAGC 568
           154 P  K  D  Q  K  L  R  E  A  M  A  A  L  R  K  S  A  170

            51 ----+----*----+----*----+----*----+----*----+----* 100
            18  Q  D  V  Q  K  F  M  D  A  V  N  K  K  S  S  S  Q 34
mbg05662    53 TCAAGATGTCCAGAAGTTCATGGATGCTGTCAACAAGAAAAGCAGTTCTC 102
               ||||||||| ||||||||||||||||||||||||||||| |||||||| |
ha01038    569 TCAAGATGTTCAGAAGTTCATGGATGCTGTCAACAAGAAGAGCAGTTCCC 618
           171  Q  D  V  Q  K  F  M  D  A  V  N  K  K  S  S  S  Q 187

           101 ----+----*----+----*----+----*----+----*----+----* 150
            35   D  L  H  K  G  T  L  G  Q  V  S  G  E  L  S  K   50
mbg05662   103 AAGATCTACATAAAGGAACCTTGGGTCAGGTGTCTGGAGAACTGAGCAAA 152
               | ||||| ||||||||||||||| ||||| ||||||||||||| ||||||
ha01038    619 AGGATCTGCATAAAGGAACCTTGAGTCAGATGTCTGGAGAACTAAGCAAA 668
           188   D  L  H  K  G  T  L  S  Q  M  S  G  E  L  S  K   203

           151 ----+----*----+----*----+----*----+----*----+----* 200
            51 D  G  D  L  I  V  S  M  R  I  L  G  K  K  R  T  K  67
mbg05662   153 GATGGGGACCTGATAGTCAGCATGCGGATTCTGGGCAAGAAGAGGACTAA 202
               ||||| |||||||||||||||||||| ||||||||||||||||| |||||
ha01038    669 GATGGTGACCTGATAGTCAGCATGCGAATTCTGGGCAAGAAGAGAACTAA 718
           204 D  G  D  L  I  V  S  M  R  I  L  G  K  K  R  T  K  220

           201 ----+----*----+----*----+----*----+----*----+----* 250
            68  T  W  H  K  G  T  L  I  A  I  Q  T  V  G  L  G  K 84
mbg05662   203 GACATGGCACAAAGGCACCCTTATTGCCATCCAGACTGTTGGGCTAGGAA 252
               ||| |||||||||||||||||||||||||||||||| ||||||| ||| |
ha01038    719 GACTTGGCACAAAGGCACCCTTATTGCCATCCAGACAGTTGGGCCAGGGA 768
           221  T  W  H  K  G  T  L  I  A  I  Q  T  V  G  P  G  K 237

           251 ----+----*----+----*----+----*----+----*----+----* 300
            85   K  Y  K  V  K  F  D  N  K  G  K  S  L  L  S  G   100
mbg05662   253 AAAAATACAAAGTGAAATTTGACAACAAAGGAAAGAGTCTGCTATCTGGG 302
               | |||||||| ||||||||||||||||||||||||||||| || || |||
ha01038    769 AGAAATACAAGGTGAAATTTGACAACAAAGGAAAGAGTCTACTGTCGGGG 818
           238   K  Y  K  V  K  F  D  N  K  G  K  S  L  L  S  G   253

           301 ----+----*----+----*----+----*----+----*----+----* 350
           101 N  H  I  A  Y  D  Y  H  P  P  A  D  K  L  F  V  G  117
mbg05662   303 AACCATATTGCCTATGATTACCACCCTCCCGCTGACAAGCTGTTTGTGGG 352
               ||||||||||||||||||||||||||||| ||||||||||||| ||||||
ha01038    819 AACCATATTGCCTATGATTACCACCCTCCTGCTGACAAGCTGTATGTGGG 868
           254 N  H  I  A  Y  D  Y  H  P  P  A  D  K  L  Y  V  G  270

           351 ----+----*----+----*----+----*----+----*----+----* 400
           118  S  R  V  V  A  K  Y  K  D  G  N  Q  V  W  L  Y  A 134
mbg05662   353 CAGTCGAGTGGTGGCCAAGTACAAAGATGGAAATCAGGTCTGGCTTTATG 402
               |||||| ||||| ||||| ||||||||||| |||||||||||||| ||||
ha01038    869 CAGTCGGGTGGTCGCCAAATACAAAGATGGGAATCAGGTCTGGCTCTATG 918
           271  S  R  V  V  A  K  Y  K  D  G  N  Q  V  W  L  Y  A 287

           401 ----+----*----+----*----+----*----+----*----+----* 450
           135   G  I  V  A  E  T  P  N  V  K  N  K  L  R  F  L   150
mbg05662   403 CTGGCATTGTAGCTGAGACCCCTAACGTCAAGAACAAGCTCAGATTTTTA 452
               ||||||||||||||||||| || |||||||| ||||||||||| ||| | 
ha01038    919 CTGGCATTGTAGCTGAGACACCAAACGTCAAAAACAAGCTCAGGTTTCTC 968
           288   G  I  V  A  E  T  P  N  V  K  N  K  L  R  F  L   303

           451 ----+----*----+----*----+----*----+----*----+----* 500
           151 I  F  F  D  D  G  Y  A  S  Y  V  T  Q  S  E  L  Y  167
mbg05662   453 ATTTTTTTTGATGATGGCTATGCTTCCTATGTCACTCAGTCAGAGCTTTA 502
               ||||| ||||||||||||||||||||||||||||| ||||| || || ||
ha01038    969 ATTTTCTTTGATGATGGCTATGCTTCCTATGTCACACAGTCGGAACTGTA 1018
           304 I  F  F  D  D  G  Y  A  S  Y  V  T  Q  S  E  L  Y  320

           501 ----+----*----+----*----+----*----+----*----+----* 550
           168  P  I  C  R  P  L  K  K  T  W  E  D  I  E  D  S  S 184
mbg05662   503 TCCCATTTGCCGACCACTAAAAAAGACTTGGGAGGACATAGAAGATAGCT 552
               |||||||||||| ||||| |||||||||||||||||||||||||| | ||
ha01038   1019 TCCCATTTGCCGGCCACTGAAAAAGACTTGGGAGGACATAGAAGACATCT 1068
           321  P  I  C  R  P  L  K  K  T  W  E  D  I  E  D  I  S 337

           551 ----+----*----+----*----+----*----+----*----+----* 600
           185   C  R  D  F  I  E  E  Y  I  T  A  Y  P  N  R  P   200
mbg05662   553 CCTGCCGAGACTTCATAGAGGAATATATCACTGCCTATCCAAACCGCCCA 602
               ||||||| |||||||||||||| ||| |||||||||| || |||||||| 
ha01038   1069 CCTGCCGTGACTTCATAGAGGAGTATGTCACTGCCTACCCCAACCGCCCC 1118
           338   C  R  D  F  I  E  E  Y  V  T  A  Y  P  N  R  P   353

           601 ----+----*----+----*----+----*----+----*----+----* 650
           201 M  V  L  L  K  S  G  Q  L  I  K  T  E  W  E  G  T  217
mbg05662   603 ATGGTACTTCTCAAGAGTGGGCAGCTTATCAAGACTGAGTGGGAAGGCAC 652
               |||||||| ||||||||||| |||||||||||||||||||||||||||||
ha01038   1119 ATGGTACTGCTCAAGAGTGGCCAGCTTATCAAGACTGAGTGGGAAGGCAC 1168
           354 M  V  L  L  K  S  G  Q  L  I  K  T  E  W  E  G  T  370

           651 ----+----*----+----*----+----*----+----*----+----* 700
           218  W  W  K  S  R  V  E  E  V  D  G  S  L  V  R  I  L 234
mbg05662   653 ATGGTGGAAGTCTCGAGTTGAAGAGGTGGATGGCAGCCTAGTCAGGATCC 702
                ||||||||||| |||||||| ||||||||||||||||||||||||||||
ha01038   1169 GTGGTGGAAGTCCCGAGTTGAGGAGGTGGATGGCAGCCTAGTCAGGATCC 1218
           371  W  W  K  S  R  V  E  E  V  D  G  S  L  V  R  I  L 387

           701 ----+----*----+----*----+----*----+----*----+----* 750
           235   F  L  D  D  K  R  C  E  W  I  Y  R  G  S  T  R   250
mbg05662   703 TCTTTCTGGATGACAAAAGATGTGAGTGGATATATCGAGGCTCTACACGC 752
               |||| |||||||||||||||||||||||||| ||||||||||||||||| 
ha01038   1219 TCTTCCTGGATGACAAAAGATGTGAGTGGATCTATCGAGGCTCTACACGG 1268
           388   F  L  D  D  K  R  C  E  W  I  Y  R  G  S  T  R   403

           751 ----+----*----+----*----+----*----+----*----+----* 800
           251 L  E  P  M  F  S  M  K  T  S  S  A  S  A  M  E  K  267
mbg05662   753 CTGGAACCTATGTTTAGTATGAAGACATCCTCAGCCTCTGCAATGGAGAA 802
               ||||| || ||||| || ||||| |||||||||||||||||| |||||||
ha01038   1269 CTGGAGCCCATGTTCAGCATGAAAACATCCTCAGCCTCTGCACTGGAGAA 1318
           404 L  E  P  M  F  S  M  K  T  S  S  A  S  A  L  E  K  420

           801 ----+----*----+----*----+----*----+----*----+----* 850
           268  K  Q  G  G  Q  L  R  T  R  P  N  M  G  A  V  R  S 284
mbg05662   803 GAAGCAAGGGGGGCAACTCAGAACCCGTCCTAATATGGGTGCTGTGAGGA 852
               |||||||||    || ||||| || ||||| |||||||||||||||||||
ha01038   1319 GAAGCAAGGA...CAGCTCAGGACACGTCCAAATATGGGTGCTGTGAGGA 1365
           421  K  Q  G  .  Q  L  R  T  R  P  N  M  G  A  V  R  S 436

           851 ----+----*----+----*----+----*----+----*----+----* 900
           285   K  G  P  V  V  Q  Y  T  Q  D  L  T  G  T  G  I   300
mbg05662   853 GCAAAGGTCCTGTTGTTCAGTATACACAGGATCTAACTGGTACTGGAATC 902
               ||||||| |||||||| ||||| ||||||||||| || |||||||||| |
ha01038   1366 GCAAAGGCCCTGTTGTCCAGTACACACAGGATCTGACCGGTACTGGAACC 1415
           437   K  G  P  V  V  Q  Y  T  Q  D  L  T  G  T  G  T   452

           901 ----+----*----+----*----+----*----+----*----+----* 950
           301 Q  F  K  P  M  E  P  L  Q  P  I  A  P  P  A  P  .  316
mbg05662   903 CAGTTTAAGCCCATGGAGCCCCTACAGCCTATAGCTCCACCGGCCCCA.. 950
               ||||| |||||  |||| |||| |||||||| ||||||||| ||||||  
ha01038   1416 CAGTTCAAGCCAGTGGAACCCCCACAGCCTACAGCTCCACCTGCCCCACC 1465
           453 Q  F  K  P  V  E  P  P  Q  P  T  A  P  P  A  P  P  469

           951 ----+----*----+----*----+----*----+----*----+----* 1000
           317  .  L  P  I  P  P  L  S  P  Q  A  A  D  T  E  S  L 332
mbg05662   951 ....CTTCCTATACCTCCTCTTTCCCCCCAAGCAGCTGACACTGAAAGCT 996
                   |  |||   || ||||| ||||||||||||| ||||| |||    |
ha01038   1466 TTTCCCACCTGCTCCACCTCTATCCCCCCAAGCAGGTGACAGTGAC...T 1512
           470  F  P  P  A  P  P  L  S  P  Q  A  G  D  S  D  .  L 485

          1001 ----+----*----+----*----+----*----+----*----+----* 1050
           333   E  S  Q  L  A  Q  S  R  K  Q  V  A  K  K  S  T   348
mbg05662   997 TAGAAAGCCAACTTGCACAATCACGGAAACAAGTAGCCAAGAAGAGCACA 1046
               | |||||||| ||||| || |||||||| || |||||||| |||||||| 
ha01038   1513 TGGAAAGCCAGCTTGCCCAGTCACGGAAGCAGGTAGCCAAAAAGAGCACG 1562
           486   E  S  Q  L  A  Q  S  R  K  Q  V  A  K  K  S  T   501

          1051 ----+----*----+----*----+----*----+----*----+----* 1100
           349 S  F  R  P  G  S  V  G  S  G  H  S  S  P  T  S  S  365
mbg05662  1047 TCATTCCGACCAGGATCTGTGGGCTCCGGCCATTCCTCCCCTACTTCATC 1096
               || || |||||||||||||||||||| || |||||||||||||| ||  |
ha01038   1563 TCCTTTCGACCAGGATCTGTGGGCTCTGGTCATTCCTCCCCTACATCTCC 1612
           502 S  F  R  P  G  S  V  G  S  G  H  S  S  P  T  S  P  518

          1101 ----+----*----+----*----+----*----+----*----+----* 1150
           366  T  L  S  E  N  V  S  A  G  K  L  G  I  N  Q  T  Y 382
mbg05662  1097 CACACTCAGTGAAAATGTGTCTGCTGGGAAACTTGGGATAAACCAGACAT 1146
                 |||||||||||||||| |||| |||||||| |||||| ||||||||||
ha01038   1613 TGCACTCAGTGAAAATGTCTCTGGTGGGAAACCTGGGATCAACCAGACAT 1662
           519  A  L  S  E  N  V  S  G  G  K  P  G  I  N  Q  T  Y 535

          1151 ----+----*----+----*----+----*----+----*----+----* 1200
           383   R  S  P  L  A  S  V  T  S  T  P  A  S  A  A  P   398
mbg05662  1147 ATCGGTCACCTTTGGCCTCAGTAACATCTACCCCAGCATCTGCAGCCCCT 1196
               || | |||||||| | |||   | | ||| |||||||| |  |||| |  
ha01038   1663 ATAGATCACCTTTAGGCTCCACAGCCTCTGCCCCAGCACCCTCAGCACTC 1712
           536   R  S  P  L  G  S  T  A  S  A  P  A  P  S  A  L   551

          1201 ----+----*----+----*----+----*----+----*----+----* 1250
           399 P  V  P  P  V  P  P  G  P  P  T  P  P  G  P  P  A  415
mbg05662  1197 CCAGTCCCTCCAGTCCCACCAGGGCCTCCAACCCCTCCAGGGCCTCCAGC 1246
               || | |||||||                                      
ha01038   1713 CCGGCCCCTCCA...................................... 1724
           552 P  A  P  P  .  .  .  .  .  .  .  .  .  .  .  .  .  555

          1251 ----+----*----+----*----+----*----+----*----+----* 1300
           416  P  P  G  P  L  A  P  P  A  F  H  G  M  L  E  R  A 432
mbg05662  1247 TCCTCCAGGGCCTCTAGCTCCTCCAGCCTTCCATGGCATGTTAGAGCGGG 1296
                               || || |||| ||||||||||||| | |||||||
ha01038   1725 ................GCACCCCCAGTCTTCCATGGCATGCTGGAGCGGG 1758
           556  .  .  .  .  .  A  P  P  V  F  H  G  M  L  E  R  A 567

          1301 ----+----*----+----*----+----*----+----*----+----* 1350
           433   P  A  E  P  S  Y  R  A  P  M  E  K  L  F  Y  L   448
mbg05662  1297 CACCAGCTGAGCCCTCCTACCGAGCCCCCATGGAGAAGCTTTTCTATTTA 1346
               | ||||| |||||||||||||| || |||||||||||||||||||| |||
ha01038   1759 CCCCAGCAGAGCCCTCCTACCGTGCTCCCATGGAGAAGCTTTTCTACTTA 1808
           568   P  A  E  P  S  Y  R  A  P  M  E  K  L  F  Y  L   583

          1351 ----+----*----+----*----+----*----+----*----+----* 1400
           449 P  H  V  C  S  Y  T  C  L  S  R  I  R  P  M  R  N  465
mbg05662  1347 CCTCATGTCTGCAGTTACACTTGTTTGTCCCGGATCAGACCCATGAGAAA 1396
               |||||||||||||| || || ||| |||| ||  ||||||| ||||| ||
ha01038   1809 CCTCATGTCTGCAGCTATACCTGTCTGTCTCGAGTCAGACCTATGAGGAA 1858
           584 P  H  V  C  S  Y  T  C  L  S  R  V  R  P  M  R  N  600

          1401 ----+----*----+----*----+----*----+----*----+----* 1450
           466  E  Q  Y  R  G  K  N  P  L  L  V  P  L  L  Y  D  F 482
mbg05662  1397 CGAACAGTATCGGGGCAAGAACCCTCTATTAGTTCCACTTCTGTATGACT 1446
                || ||||| |||||||||||||||||  | || ||  | || |||||||
ha01038   1859 TGAGCAGTACCGGGGCAAGAACCCTCTGCTGGTCCCGTTACTATATGACT 1908
           601  E  Q  Y  R  G  K  N  P  L  L  V  P  L  L  Y  D  F 617

          1451 ----+----*----+----*----+----*----+----*----+----* 1500
           483   R  R  M  T  A  R  R  R  V  N  R  K  M  G  F  H   498
mbg05662  1447 TCCGGAGGATGACAGCACGGCGCAGAGTTAACCGCAAAATGGGCTTTCAT 1496
               ||||| |||||||||| |||||  ||||||||||||| ||||||||||||
ha01038   1909 TCCGGCGGATGACAGCCCGGCGTCGAGTTAACCGCAAGATGGGCTTTCAT 1958
           618   R  R  M  T  A  R  R  R  V  N  R  K  M  G  F  H   633

          1501 ----+----*----+----*----+----*----+----*----+----* 1550
           499 V  I  Y  K  T  P  C  G  L  C  L  R  T  M  Q  E  I  515
mbg05662  1497 GTAATCTATAAGACACCCTGTGGTCTCTGCCTTCGGACGATGCAGGAGAT 1546
               || |||||||||||||| |||||||||||||||||||| |||||||||||
ha01038   1959 GTTATCTATAAGACACCTTGTGGTCTCTGCCTTCGGACAATGCAGGAGAT 2008
           634 V  I  Y  K  T  P  C  G  L  C  L  R  T  M  Q  E  I  650

          1551 ----+----*----+----*----+----*----+----*----+----* 1600
           516  E  R  Y  L  F  E  T  G  C  D  F  L  F  L  E  M  F 532
mbg05662  1547 AGAGCGCTACCTTTTTGAGACTGGCTGTGACTTTCTGTTCCTGGAGATGT 1596
               ||| ||||||||||| ||||||||||||||||| || |||||||||||||
ha01038   2009 AGAACGCTACCTTTTCGAGACTGGCTGTGACTTCCTCTTCCTGGAGATGT 2058
           651  E  R  Y  L  F  E  T  G  C  D  F  L  F  L  E  M  F 667

          1601 ----+----*----+----*----+----*----+----*----+----* 1650
           533   C  L  D  P  Y  V  L  V  D  R  K  F  Q  P  F  K   548
mbg05662  1597 TCTGTTTGGATCCATATGTTCTTGTTGACAGAAAGTTTCAACCCTTTAAG 1646
               ||||||||||||||||||||||||| ||| |||||||||| |||| ||||
ha01038   2059 TCTGTTTGGATCCATATGTTCTTGTGGACCGAAAGTTTCAGCCCTATAAG 2108
           668   C  L  D  P  Y  V  L  V  D  R  K  F  Q  P  Y  K   683

          1651 ----+----*----+----*----+----*----+----*----+----* 1700
           549 P  F  Y  Y  I  L  D  I  T  Y  G  K  E  D  V  P  L  565
mbg05662  1647 CCTTTTTACTATATTTTGGACATCACCTATGGCAAGGAAGATGTTCCCCT 1696
               |||||||||||||||||||||||||| ||||| |||||||||||||||||
ha01038   2109 CCTTTTTACTATATTTTGGACATCACTTATGGGAAGGAAGATGTTCCCCT 2158
           684 P  F  Y  Y  I  L  D  I  T  Y  G  K  E  D  V  P  L  700

          1701 ----+----*----+----*----+----*----+----*----+----* 1750
           566  S  C  V  N  E  I  D  T  T  P  P  P  Q  V  A  Y  S 582
mbg05662  1697 GTCCTGTGTTAATGAGATTGACACAACTCCCCCACCCCAGGTGGCCTACA 1746
                |||||||| ||||||||||||||||| || |||||||||||||||||||
ha01038   2159 ATCCTGTGTCAATGAGATTGACACAACCCCTCCACCCCAGGTGGCCTACA 2208
           701  S  C  V  N  E  I  D  T  T  P  P  P  Q  V  A  Y  S 717

          1751 ----+----*----+----*----+----*----+----*----+----* 1800
           583   K  E  R  I  P  G  K  G  V  F  I  N  T  G  P  E   598
mbg05662  1747 GCAAGGAACGCATTCCTGGCAAGGGTGTTTTCATTAACACAGGCCCTGAA 1796
               |||||||||| || || |||||||||||||||||||||||||||||||||
ha01038   2209 GCAAGGAACGTATCCCGGGCAAGGGTGTTTTCATTAACACAGGCCCTGAA 2258
           718   K  E  R  I  P  G  K  G  V  F  I  N  T  G  P  E   733

          1801 ----+----*----+----*----+----*----+----*-- 1842
           599 F  L  V  G  C  D  C  K  D  G  C  R  D  K   612
mbg05662  1797 TTTCTGGTTGGCTGTGACTGCAAGGATGGGTGTCGGGATAAG 1838
               |||||||||||||||||||||||||||||||||||||| |||
ha01038   2259 TTTCTGGTTGGCTGTGACTGCAAGGATGGGTGTCGGGACAAG 2300
           734 F  L  V  G  C  D  C  K  D  G  C  R  D  K   747



*--[ CDS2 ]--*
             1 ----+----*----+----*----+----*----+----*----+----* 50
             1 C  A  C  H  Q  L  T  I  Q  A  T  A  C  T  P  G  G  17
mbg05662  2925 TGTGCCTGCCACCAGCTAACTATCCAGGCCACAGCCTGTACCCCAGGGGG 2974
               ||||||||||| || |||||||||||||| ||||||||||||||||| ||
ha01038   2307 TGTGCCTGCCATCAACTAACTATCCAGGCTACAGCCTGTACCCCAGGAGG 2356
           750 C  A  C  H  Q  L  T  I  Q  A  T  A  C  T  P  G  G  766

            51 ----+----*----+----*----+----*----+----*----+----* 100
            18  Q  V  N  P  N  S  G  Y  Q  Y  K  R  L  E  E  C  L 34
mbg05662  2975 CCAAGTCAACCCTAACTCTGGCTACCAGTATAAAAGACTAGAAGAGTGTC 3024
               |||| ||||||||||||||||||||||||| || ||||||||||||||||
ha01038   2357 CCAAATCAACCCTAACTCTGGCTACCAGTACAAGAGACTAGAAGAGTGTC 2406
           767  Q  I  N  P  N  S  G  Y  Q  Y  K  R  L  E  E  C  L 783

           101 ----+----*----+----*----+----*----+----*----+----* 150
            35   P  T  G  V  Y  E  C  N  K  R  C  N  C  D  P  N   50
mbg05662  3025 TGCCCACAGGGGTTTATGAGTGTAACAAACGCTGCAATTGTGACCCAAAC 3074
               | ||||||||||| ||||||||||||||||||||||| ||||||||||||
ha01038   2407 TACCCACAGGGGTATATGAGTGTAACAAACGCTGCAAATGTGACCCAAAC 2456
           784   P  T  G  V  Y  E  C  N  K  R  C  K  C  D  P  N   799

           151 ----+----*----+----*----+----*----+----*----+----* 200
            51 M  C  T  N  R  L  V  Q  H  G  L  Q  V  R  L  Q  L  67
mbg05662  3075 ATGTGCACAAATCGGTTGGTGCAGCATGGTCTGCAGGTTCGACTACAGCT 3124
               ||||||||||| ||||||||||| ||||| || || ||||| ||||||||
ha01038   2457 ATGTGCACAAACCGGTTGGTGCAACATGGACTACAAGTTCGGCTACAGCT 2506
           800 M  C  T  N  R  L  V  Q  H  G  L  Q  V  R  L  Q  L  816

           201 ----+----*----+----*----+----*----+----*----+----* 250
            68  F  K  T  Q  N  K  G  W  G  I  R  C  L  D  D  I  A 84
mbg05662  3125 GTTTAAGACACAGAACAAGGGCTGGGGTATCCGCTGCTTGGATGATATTG 3174
                || ||||||||||||||||||||||||||||||||||||||||| ||||
ha01038   2507 ATTCAAGACACAGAACAAGGGCTGGGGTATCCGCTGCTTGGATGACATTG 2556
           817  F  K  T  Q  N  K  G  W  G  I  R  C  L  D  D  I  A 833

           251 ----+----*----+----*----+----*----+----*----+----* 300
            85   K  G  S  F  V  C  I  Y  A  G  K  I  L  T  D  D   100
mbg05662  3175 CCAAAGGCTCTTTTGTCTGCATTTATGCAGGCAAAATCCTGACAGATGAC 3224
               |||||||||||||||| || ||||||||||||||||||||||||||||||
ha01038   2557 CCAAAGGCTCTTTTGTTTGTATTTATGCAGGCAAAATCCTGACAGATGAC 2606
           834   K  G  S  F  V  C  I  Y  A  G  K  I  L  T  D  D   849

           301 ----+----*----+----*----+----*----+----*----+----* 350
           101 F  A  D  K  E  G  L  E  M  G  D  E  Y  F  A  N  L  117
mbg05662  3225 TTTGCAGACAAAGAAGGCCTGGAGATGGGTGATGAGTACTTTGCAAATCT 3274
               ||||||||||| || || ||||| ||||||||||||||||||||||||||
ha01038   2607 TTTGCAGACAAGGAGGGTCTGGAAATGGGTGATGAGTACTTTGCAAATCT 2656
           850 F  A  D  K  E  G  L  E  M  G  D  E  Y  F  A  N  L  866

           351 ----+----*----+----*----+----*----+----*----+----* 400
           118  D  H  I  E  S  V  E  N  F  K  E  G  Y  E  S  D  V 134
mbg05662  3275 GGACCACATTGAAAGTGTGGAGAACTTCAAGGAAGGATATGAGAGTGATG 3324
               |||||| || || || |||||||||||||| |||||||||||||||||||
ha01038   2657 GGACCATATCGAGAGCGTGGAGAACTTCAAAGAAGGATATGAGAGTGATG 2706
           867  D  H  I  E  S  V  E  N  F  K  E  G  Y  E  S  D  A 883

           401 ----+----*----+----*----+----*----+----*----+----* 450
           135   P  T  S  S  D  S  S  G  V  D  M  K  D  Q  E  D   150
mbg05662  3325 TCCCCACTTCCTCTGACAGCAGTGGGGTAGATATGAAGGACCAGGAAGAT 3374
                ||||  |||||||||||||||||| |||||  |||||||||||||||||
ha01038   2707 CCCCCTGTTCCTCTGACAGCAGTGGTGTAGACTTGAAGGACCAGGAAGAT 2756
           884   P  C  S  S  D  S  S  G  V  D  L  K  D  Q  E  D   899

           451 ----+----*----+----*----+----*----+----*----+----* 500
           151 G  N  S  G  S  E  D  P  E  E  S  N  D  D  S  S  D  167
mbg05662  3375 GGCAACAGCGGTTCAGAGGACCCTGAAGAATCCAATGATGACAGCTCTGA 3424
               |||||||||||| |||||||||||||||| ||||||||||| ||||| ||
ha01038   2757 GGCAACAGCGGTACAGAGGACCCTGAAGAGTCCAATGATGATAGCTCAGA 2806
           900 G  N  S  G  T  E  D  P  E  E  S  N  D  D  S  S  D  916

           501 ----+----*----+----*----+----*----+----*----+----* 550
           168  D  N  F  C  K  D  E  D  F  S  T  S  S  V  W  R  S 184
mbg05662  3425 TGATAACTTCTGTAAGGATGAGGACTTCAGCACCAGTTCAGTGTGGCGTA 3474
               |||||||||||||||||||||||||||||||||||||||||||||||| |
ha01038   2807 TGATAACTTCTGTAAGGATGAGGACTTCAGCACCAGTTCAGTGTGGCGGA 2856
           917  D  N  F  C  K  D  E  D  F  S  T  S  S  V  W  R  S 933

           551 ----+----*----+----*----+----*----+----*----+----* 600
           185   Y  A  T  R  R  Q  T  R  G  Q  K  E  N  E  L  S   200
mbg05662  3475 GCTATGCTACCCGGAGGCAGACTCGGGGTCAAAAGGAGAATGAATTGTCT 3524
               |||||||||||||||||||||| ||||| || || ||||| | | | |||
ha01038   2857 GCTATGCTACCCGGAGGCAGACCCGGGGCCAGAAAGAGAACGGACTCTCT 2906
           934   Y  A  T  R  R  Q  T  R  G  Q  K  E  N  G  L  S   949

           601 ----+----*----+----*----+----*----+----*----+----* 650
           201 E  M  T  S  K  D  S  R  P  P  D  L  G  P  P  H  V  217
mbg05662  3525 GAGATGACTTCCAAGGACTCCCGCCCCCCAGACCTCGGGCCTCCACATGT 3574
               ||||  |||||||||||||||| ||||||||| || || || |||||| |
ha01038   2907 GAGACAACTTCCAAGGACTCCCACCCCCCAGATCTTGGACCCCCACATAT 2956
           950 E  T  T  S  K  D  S  H  P  P  D  L  G  P  P  H  I  966

           651 ----+----*----+----*----+----*----+----*----+----* 700
           218  P  I  P  S  S  V  S  V  G  G  C  N  P  P  S  S  E 234
mbg05662  3575 TCCTATCCCTTCCTCAGTATCTGTAGGGGGCTGCAATCCACCTTCCTCTG 3624
               |||| | ||| ||||| |  ||||||| |||||||||||||||||||| |
ha01038   2957 TCCTGTTCCTCCCTCAATCCCTGTAGGTGGCTGCAATCCACCTTCCTCCG 3006
           967  P  V  P  P  S  I  P  V  G  G  C  N  P  P  S  S  E 983

           701 ----+----*----+----*----+----*----+----*----+----* 750
           235   E  T  P  K  N  K  V  A  S  W  L  S  C  N  S  V   250
mbg05662  3625 AAGAGACACCCAAGAACAAGGTGGCCTCGTGGTTGAGTTGCAATAGTGTC 3674
               |||||||||||||||||||||||||||| |||||||| ||||||||||||
ha01038   3007 AAGAGACACCCAAGAACAAGGTGGCCTCATGGTTGAGCTGCAATAGTGTC 3056
           984   E  T  P  K  N  K  V  A  S  W  L  S  C  N  S  V   999

           751 ----+----*----+----*----+----*----+----*----+----* 800
           251 S  E  G  G  F  A  D  S  D  S  R  S  S  F  K  T  S  267
mbg05662  3675 AGTGAAGGTGGATTTGCTGACTCTGACAGCCGTTCTTCCTTCAAGACTAG 3724
               ||||||||||| |||||||||||||| |||| ||| ||||||||||||| 
ha01038   3057 AGTGAAGGTGGTTTTGCTGACTCTGATAGCCATTCATCCTTCAAGACTAA 3106
          1000 S  E  G  G  F  A  D  S  D  S  H  S  S  F  K  T  N  1016

           801 ----+----*----+----*----+----*----+----*----+----* 850
           268  E  G  G  D  G  R  A  G  G  G  R  G  E  A  E  R  A 284
mbg05662  3725 TGAAGGTGGAGATGGCCGTGCTGGGGGAGGCCGGGGAGAGGCTGAAAGGG 3774
               ||||||||| || ||||| ||||||||| ||||    |||||||| | ||
ha01038   3107 TGAAGGTGGGGAGGGCCGGGCTGGGGGAAGCCGAATGGAGGCTGAGAAGG 3156
          1017  E  G  G  E  G  R  A  G  G  S  R  M  E  A  E  K  A 1033

           851 ----+----*----+----*----+----*----+----*----+----* 900
           285   S  T  S  G  L  S  F  K  D  E  G  D  N  K  Q  P   300
mbg05662  3775 CCTCTACCTCAGGATTGAGCTTCAAGGATGAAGGAGACAATAAGCAGCCT 3824
               |||| ||||||||| |  || |||||||||| |||||||  || ||| | 
ha01038   3157 CCTCCACCTCAGGACTAGGCATCAAGGATGAGGGAGACATCAAACAGGCC 3206
          1034   S  T  S  G  L  G  I  K  D  E  G  D  I  K  Q  A   1049

           901 ----+----*----+----*----+----*----+----*----+----* 950
           301 K  K  E  D  P  E  N  R  N  K  M  P  V  V  T  E  G  317
mbg05662  3825 AAAAAAGAGGACCCTGAGAACCGAAACAAGATGCCAGTAGTTACTGAAGG 3874
               || ||||||||| ||||  |||||||||||||| |||||||||||||| |
ha01038   3207 AAGAAAGAGGACACTGACGACCGAAACAAGATGTCAGTAGTTACTGAAAG 3256
          1050 K  K  E  D  T  D  D  R  N  K  M  S  V  V  T  E  S  1066

           951 ----+----*----+----*----+----*----+----*----+----* 1000
           318  S  Q  N  H  G  H  N  P  .  P  M  K  S  E  G  L  R 333
mbg05662  3875 CTCTCAGAATCATGGACATAATCCT...CCCATGAAGTCTGAAGGGCTTC 3921
               |||||  ||| | ||  | ||||||   ||  ||||| ||||||| ||||
ha01038   3257 CTCTCGAAATTACGGTTACAATCCTTCTCCTGTGAAGCCTGAAGGACTTC 3306
          1067  S  R  N  Y  G  Y  N  P  S  P  V  K  P  E  G  L  R 1083

          1001 ----+----*----+----*----+----*----+----*----+----* 1050
           334   R  S  A  S  K  M  S  V  L  Q  S  Q  R  V  V  T   349
mbg05662  3922 GCCGATCAGCTAGTAAAATGTCTGTGCTCCAGAGCCAGCGAGTTGTGACT 3971
               ||||  || ||||||| |    | |||  || ||||   || |  || ||
ha01038   3307 GCCGCCCACCTAGTAAGACTAGTATGCATCAAAGCCGAAGACTCATGGCT 3356
          1084   R  P  P  S  K  T  S  M  H  Q  S  R  R  L  M  A   1099

          1051 ----+----*----+----*----+----*----+----*----+----* 1100
           350 S  T  Q  S  N  P  D  D  I  L  T  L  S  S  S  T  E  366
mbg05662  3972 TCTACTCAGTCAAACCCTGATGACATCCTGACACTGTCCAGCAGCACAGA 4021
               ||| ||||||| |||||||||||  |||||||||||||||||||||||||
ha01038   3357 TCTGCTCAGTCCAACCCTGATGATGTCCTGACACTGTCCAGCAGCACAGA 3406
          1100 S  A  Q  S  N  P  D  D  V  L  T  L  S  S  S  T  E  1116

          1101 ----+----*----+----*----+----*----+----*----+----* 1150
           367  S  E  G  E  S  G  T  S  R  K  P  T  A  G  H  T  S 383
mbg05662  4022 GAGTGAGGGGGAAAGTGGAACCAGCCGAAAGCCCACTGCTGGTCACACTT 4071
                ||||||||||||||||| |||||||||||||||||||||||||| ||||
ha01038   3407 AAGTGAGGGGGAAAGTGGGACCAGCCGAAAGCCCACTGCTGGTCAGACTT 3456
          1117  S  E  G  E  S  G  T  S  R  K  P  T  A  G  Q  T  S 1133

          1151 ----+----*----+----*----+----*----+----*----+----* 1200
           384   A  T  A  V  D  S  D  D  I  Q  T  I  S  S  G  S   399
mbg05662  4072 CAGCCACAGCTGTTGATAGTGATGACATCCAGACCATCTCTTCTGGCTCT 4121
               | || ||||| ||||| |||||||| ||||||||||| || |||||||||
ha01038   3457 CGGCTACAGCGGTTGACAGTGATGATATCCAGACCATATCCTCTGGCTCT 3506
          1134   A  T  A  V  D  S  D  D  I  Q  T  I  S  S  G  S   1149

          1201 ----+----*----+----*----+----*----+----*----+----* 1250
           400 D  G  D  D  F  E  D  K  K  N  L  S  G  P  T  K  R  416
mbg05662  4122 GACGGTGATGACTTTGAGGACAAGAAGAACTTGTCAGGACCAACAAAGCG 4171
               || || |||||||||||||||||||||||| || | || ||||  |||||
ha01038   3507 GAAGGGGATGACTTTGAGGACAAGAAGAACATGACTGGTCCAATGAAGCG 3556
          1150 E  G  D  D  F  E  D  K  K  N  M  T  G  P  M  K  R  1166

          1251 ----+----*----+----*----+----*----+----*----+----* 1300
           417  Q  V  A  V  K  S  T  R  G  F  A  L  K  S  T  H  G 433
mbg05662  4172 CCAGGTGGCAGTAAAATCAACCCGAGGCTTTGCTCTTAAATCAACCCATG 4221
                || ||||||||||||||||||||||||||||||||||||||||||||||
ha01038   3557 TCAAGTGGCAGTAAAATCAACCCGAGGCTTTGCTCTTAAATCAACCCATG 3606
          1167  Q  V  A  V  K  S  T  R  G  F  A  L  K  S  T  H  G 1183

          1301 ----+----*----+----*----+----*----+----*----+----* 1350
           434   I  A  I  K  S  T  N  M  A  S  V  D  K  G  E  S   449
mbg05662  4222 GTATTGCCATTAAATCAACCAACATGGCTTCCGTGGACAAGGGGGAGAGT 4271
               | ||||| |||||||||||||||||||| || ||||||||||||||||| 
ha01038   3607 GGATTGCAATTAAATCAACCAACATGGCCTCTGTGGACAAGGGGGAGAGC 3656
          1184   I  A  I  K  S  T  N  M  A  S  V  D  K  G  E  S   1199

          1351 ----+----*----+----*----+----*----+----*----+----* 1400
           450 A  P  V  R  K  N  T  R  Q  F  Y  D  G  E  E  S  C  466
mbg05662  4272 GCACCAGTTCGTAAGAACACACGCCAGTTCTATGATGGTGAAGAGTCTTG 4321
               ||||| |||||||||||||||||||| ||||||||||| || ||||||||
ha01038   3657 GCACCTGTTCGTAAGAACACACGCCAATTCTATGATGGCGAGGAGTCTTG 3706
          1200 A  P  V  R  K  N  T  R  Q  F  Y  D  G  E  E  S  C  1216

          1401 ----+----*----+----*----+----*----+----*----+----* 1450
           467  Y  I  I  D  A  K  L  E  G  N  L  G  R  Y  L  N  H 483
mbg05662  4322 CTACATCATTGATGCCAAACTTGAAGGCAACCTAGGCCGCTACCTCAATC 4371
               |||||||||||||||||| |||||||||||||| |||||||||||||| |
ha01038   3707 CTACATCATTGATGCCAAGCTTGAAGGCAACCTGGGCCGCTACCTCAACC 3756
          1217  Y  I  I  D  A  K  L  E  G  N  L  G  R  Y  L  N  H 1233

          1451 ----+----*----+----*----+----*----+----*----+----* 1500
           484   S  C  S  P  N  L  F  V  Q  N  V  F  V  D  T  H   499
mbg05662  4372 ACAGTTGCAGCCCCAACCTGTTTGTCCAGAATGTGTTTGTGGATACCCAT 4421
               |||||||||||||||||||||||||||||||||| || ||||||||||||
ha01038   3757 ACAGTTGCAGCCCCAACCTGTTTGTCCAGAATGTCTTCGTGGATACCCAT 3806
          1234   S  C  S  P  N  L  F  V  Q  N  V  F  V  D  T  H   1249

          1501 ----+----*----+----*----+----*----+----*----+----* 1550
           500 D  L  R  F  P  W  V  A  F  F  A  S  K  R  I  R  A  516
mbg05662  4422 GATCTTCGCTTCCCTTGGGTGGCCTTCTTTGCCAGCAAGAGAATCCGGGC 4471
               |||||||||||||| ||||||||||||||||||||||| |||||||||||
ha01038   3807 GATCTTCGCTTCCCCTGGGTGGCCTTCTTTGCCAGCAAAAGAATCCGGGC 3856
          1250 D  L  R  F  P  W  V  A  F  F  A  S  K  R  I  R  A  1266

          1551 ----+----*----+----*----+----*----+----*----+----* 1600
           517  G  T  E  L  T  W  D  Y  N  Y  E  V  G  S  V  E  G 533
mbg05662  4472 TGGAACAGAACTCACTTGGGACTACAACTACGAAGTGGGCAGTGTGGAAG 4521
               ||| |||||||| |||||||||||||||||||| ||||||||||||||||
ha01038   3857 TGGGACAGAACTTACTTGGGACTACAACTACGAGGTGGGCAGTGTGGAAG 3906
          1267  G  T  E  L  T  W  D  Y  N  Y  E  V  G  S  V  E  G 1283

          1601 ----+----*----+----*----+----*----+----*----+----* 1650
           534   K  E  L  L  C  C  C  G  A  I  E  C  R  G  R  L   549
mbg05662  4522 GCAAGGAGCTGCTGTGCTGCTGTGGGGCCATTGAATGCAGAGGGAGACTT 4571
               |||||||||| || || ||||||||||||||||||||||||||  | |||
ha01038   3907 GCAAGGAGCTACTCTGTTGCTGTGGGGCCATTGAATGCAGAGGACGTCTT 3956
          1284   K  E  L  L  C  C  C  G  A  I  E  C  R  G  R  L   1299

          1651 ----+- 1656
           550 L  *   551
mbg05662  4572 CTTTAG 4577
               ||||||
ha01038   3957 CTTTAG 3962
          1300 L  *   1301


*--[ 3'UTR ]--*
             1 ----+----*----+----*----+----*----+----*----+----* 50
mbg05662  4578 .AGGAAGACTTCCTCACTTTGAGAACGCTTGAACTATCCTTTTCCCCAGG 4626
                 |  || |||| || |    |     |||||||| | | ||||| ||||
ha01038   3963 AGGACAGCCTTCTTCCC...AACCCTTCTTGAACTGT.CGTTTCCTCAGG 4008

            51 ----+----*----+----*----+----*----+----*----+----* 100
mbg05662  4627 AACTGGGTCTTCCTGACTGTTGAACTCTGACCCCAAGTCTCTGATCTAGC 4676
               |||||||||||||||| |||||||| ||||||| |||||||||  |||||
ha01038   4009 AACTGGGTCTTCCTGATTGTTGAACCCTGACCCGAAGTCTCTGGGCTAGC 4058

           101 ----+----*----+----*----+----*----+----*----+----* 150
mbg05662  4677 T..TCTTCCCAGCTCCTAGTAGATAGAGATGGGGATTCTCAATCA...GG 4721
               |  ||  ||||||||||||| |||||| |||||| ||||  | ||   | 
ha01038   4059 TACTCCCCCCAGCTCCTAGTTGATAGAAATGGGGGTTCTGGACCAGATGA 4108

           151 ----+----*----+----*----+----*----+----*----+----* 200
mbg05662  4722 ACTTTCCCAGCGTGGTGCTAGCAGGCA....................... 4748
                |  | |||  ||||||||||||||||                       
ha01038   4109 TCCCTTCCAATGTGGTGCTAGCAGGCAGGATCCCTTCTCCACCTCCAAAG 4158

           201 ----+----*----+----*----+----*----+----*----+----* 250
mbg05662  4749 .....CTAGGGTGGGTAGACATGACCACTCTAGCATCAGCCTGAGGTCCT 4793
                      |||||||| ||| || ||||||||| | || ||||||  ||| 
ha01038   4159 GCCCTAAAGGGTGGGGAGAGATCACCACTCTAACCTCGGCCTGACATCCC 4208

           251 ----+----*----+----*----+----*----+----*----+----* 300
mbg05662  4794 TCTCATCTTGTATGCATTCATGATT......................... 4818
               || ||||   |||   | || |  |                         
ha01038   4209 TCCCATCCCATATTTGTCCAAGTGTTCCTGCTTCTAACAGACTTTGTTCT 4258

           301 ----+----*----+----*----+----*----+----*----+----* 350
mbg05662  4819 ...CACATGGGCTGTGTATCTGCCATCCCTGATTTGTATGGTTTCTTGAA 4865
                   |    | |||||||||| | ||| |   |||||||  |||||||||
ha01038   4259 TAGAATGGAGCCTGTGTATCTACTATCTCCAGTTTGTATTATTTCTTGAA 4308

           351 ----+----*----+----*----+----*----+----*----+----* 400
mbg05662  4866 AGTCTTCTAACAAGATGGTAAAGTAGAGATTGTGGTTTGTCTTATCTCCT 4915
               |||||| |||||| ||| ||||                            
ha01038   4309 AGTCTTTTAACAATATGATAAAACT......................... 4333

           401 ----+----*----+----*----+----*----+----*----+----* 450
mbg05662  4916 GCCTTGATCTTCCATGTCATTTCCCAGTGGGCAACTATCATTAGTTTATA 4965
                                                                 
ha01038   4333 .................................................. 4333

           451 ----+----*----+----*----+----*----+----*----+----* 500
mbg05662  4966 CCTCGATTTTATTTGCCCTGTGTAATTTTTAACTAAAAATGTTACATAAC 5015
                                                                 
ha01038   4333 .................................................. 4333

           501 ----+----*----+----*----+----*----+----*----+----* 550
mbg05662  5016 CACAAGGGAGGCCTAAGAAAAAAACTAACCCTTTCTCAAGTCTTAATAGT 5065
                                                                 
ha01038   4333 .................................................. 4333

           551 ----+----*----+----*----+----*----+----*----+----* 600
mbg05662  5066 CTGACACAGTCATGTTTTATTTAACCTTTAATATACTTTAATTAAAAAAA 5115
                                                                 
ha01038   4333 .................................................. 4333

           601 ----+---- 609
mbg05662  5116 ATTAACAGT 5124
                        
ha01038   4333 ......... 4333