Comparison of KIAA cDNA sequences between mouse and human (KIAA0233)

<< Original sequence data >>

mouse  mKIAA0233 (mbg19574)     length:   5959 bp
human   KIAA0233  (ha04602)     length:   6368 bp


<< Aligned sequence information (excl. stop, if exists.) >>

----------------------------------------------------------
            length    #match  #mismatch   %diff
----------------------------------------------------------
DNA

  CDS1 :      860       745      115      13.37
  CDS2 :     2420      1919      501      20.70
  CDS3 :      432       386       46      10.65
  CDS4 :      638       556       82      12.85
  Total:     4350      3606      744      17.10

  3'UTR:      247       163       84      34.01

amino acid

  CDS1 :      288       260       28       9.72
  CDS2 :      814       631      183      22.48
  CDS3 :      144       138        6       4.17
  CDS4 :      214       195       19       8.88
  Total:     1460      1224      236      16.16
----------------------------------------------------------


<< Alignment region (incl. stop, if exists.) >>

----------------------------------------------------------
                    cDNA      cDNA original    amino acid
----------------------------------------------------------
  CDS1 : mouse    97 -   954     82 -  1269      6 -   291
         human  1701 -  2573      3 -  6110    567 -   857
  CDS2 : mouse   984 -  3473    954 -  3533      1 -   830
         human  2610 -  5039      3 -  6110    870 -  1679
  CDS3 : mouse  4531 -  4962   4489 -  4977      1 -   144
         human  5040 -  5471      3 -  6110   1680 -  1823
  CDS4 : mouse  5028 -  5696   5010 -  5696      1 -   223
         human  5472 -  6110      3 -  6110   1824 -  2036
  3'UTR: mouse  5697 -  5959
         human  6111 -  6368
----------------------------------------------------------


<< Alignment >>

*--[ CDS1 ]--*
             1 ----+----*----+----*----+----*----+----*----+----* 50
             6 F  L  M  C  .  .  .  .  P  .  L  S  T  D  Y  P  W  17
mbg19574    97 TTTCTGATGTGC............CCC...CTGTCCACAGACTATCCATG 131
               |  ||| |||||            ||    |||| ||  || ||||| ||
ha04602   1701 TACCTGCTGTGCCTGGGGATGCCCCCGGCCCTGTGCATTGATTATCCCTG 1750
           567 Y  L  L  C  L  G  M  P  P  A  L  C  I  D  Y  P  W  583

            51 ----+----*----+----*----+----*----+----*----+----* 100
            18  R  W  S  K  A  I  P  M  N  S  A  L  I  K  W  L  Y 34
mbg19574   132 GCGCTGGAGCAAGGCCATCCCCATGAATTCCGCCCTCATCAAGTGGCTGT 181
               ||||||||||  |||| |||||||||| ||||| ||||||||||||||||
ha04602   1751 GCGCTGGAGCCGGGCCGTCCCCATGAACTCCGCACTCATCAAGTGGCTGT 1800
           584  R  W  S  R  A  V  P  M  N  S  A  L  I  K  W  L  Y 600

           101 ----+----*----+----*----+----*----+----*----+----* 150
            35   L  P  D  F  F  R  A  P  N  S  T  N  L  I  S  D   50
mbg19574   182 ACCTACCTGACTTCTTCAGAGCCCCCAACTCCACCAACCTTATCAGTGAC 231
               |||| ||||| |||||| | |||||||||||||||||||| ||||| |||
ha04602   1801 ACCTGCCTGATTTCTTCCGGGCCCCCAACTCCACCAACCTCATCAGCGAC 1850
           601   L  P  D  F  F  R  A  P  N  S  T  N  L  I  S  D   616

           151 ----+----*----+----*----+----*----+----*----+----* 200
            51 F  L  L  L  L  C  A  S  Q  Q  W  Q  V  F  S  A  E  67
mbg19574   232 TTCCTCCTGCTGCTTTGCGCCTCCCAGCAGTGGCAGGTCTTCTCAGCGGA 281
               || ||||||||||| ||||||||||||||||||||||| |||||||| ||
ha04602   1851 TTTCTCCTGCTGCTGTGCGCCTCCCAGCAGTGGCAGGTGTTCTCAGCTGA 1900
           617 F  L  L  L  L  C  A  S  Q  Q  W  Q  V  F  S  A  E  633

           201 ----+----*----+----*----+----*----+----*----+----* 250
            68  R  T  E  E  W  Q  R  M  A  G  I  N  T  D  H  L  E 84
mbg19574   282 GCGAACGGAGGAGTGGCAACGCATGGCGGGCATCAACACTGACCACCTGG 331
               ||| || ||||||||||| |||||||| ||| ||||||| |||| |||||
ha04602   1901 GCGCACAGAGGAGTGGCAGCGCATGGCTGGCGTCAACACCGACCGCCTGG 1950
           634  R  T  E  E  W  Q  R  M  A  G  V  N  T  D  R  L  E 650

           251 ----+----*----+----*----+----*----+----*----+----* 300
            85   P  L  R  G  E  P  N  P  I  P  N  F  I  H  C  R   100
mbg19574   332 AGCCCCTGCGTGGGGAGCCCAACCCTATACCCAACTTCATCCACTGCAGG 381
               |||| ||||| ||||||||||||||  | |||||||| ||||||||||||
ha04602   1951 AGCCGCTGCGGGGGGAGCCCAACCCCGTGCCCAACTTTATCCACTGCAGG 2000
           651   P  L  R  G  E  P  N  P  V  P  N  F  I  H  C  R   666

           301 ----+----*----+----*----+----*----+----*----+----* 350
           101 S  Y  L  D  M  L  K  V  A  V  F  R  Y  L  F  W  L  117
mbg19574   382 TCCTATCTGGATATGCTGAAGGTGGCCGTCTTCCGCTACCTGTTCTGGCT 431
               ||||| || || ||||||||||||||||||||||| ||||||||||||||
ha04602   2001 TCCTACCTTGACATGCTGAAGGTGGCCGTCTTCCGATACCTGTTCTGGCT 2050
           667 S  Y  L  D  M  L  K  V  A  V  F  R  Y  L  F  W  L  683

           351 ----+----*----+----*----+----*----+----*----+----* 400
           118  V  L  V  V  V  F  V  A  G  A  T  R  I  S  I  F  G 134
mbg19574   432 GGTGCTCGTTGTGGTGTTTGTTGCGGGGGCCACCCGCATAAGCATCTTCG 481
               |||||| || |||||||||||  |||||||||||||||| ||||||||||
ha04602   2051 GGTGCTGGTGGTGGTGTTTGTCACGGGGGCCACCCGCATCAGCATCTTCG 2100
           684  V  L  V  V  V  F  V  T  G  A  T  R  I  S  I  F  G 700

           401 ----+----*----+----*----+----*----+----*----+----* 450
           135   L  G  Y  L  L  A  C  F  Y  L  L  L  F  G  T  T   150
mbg19574   482 GGCTGGGGTACCTGCTAGCCTGCTTCTACCTGCTGCTGTTTGGCACTACC 531
               ||||||| |||||||| |||||||||||||||||||| || |||||  ||
ha04602   2101 GGCTGGGCTACCTGCTGGCCTGCTTCTACCTGCTGCTCTTCGGCACGGCC 2150
           701   L  G  Y  L  L  A  C  F  Y  L  L  L  F  G  T  A   716

           451 ----+----*----+----*----+----*----+----*----+----* 500
           151 L  L  Q  K  D  T  R  A  Q  L  V  L  W  D  C  L  I  167
mbg19574   532 CTGCTGCAGAAGGACACGCGAGCCCAGCTCGTGCTGTGGGACTGCCTCAT 581
               |||||||||| |||||| || ||||  |||||||||||||||||||||||
ha04602   2151 CTGCTGCAGAGGGACACACGGGCCCGCCTCGTGCTGTGGGACTGCCTCAT 2200
           717 L  L  Q  R  D  T  R  A  R  L  V  L  W  D  C  L  I  733

           501 ----+----*----+----*----+----*----+----*----+----* 550
           168  L  Y  N  V  T  V  I  I  S  K  N  M  L  S  L  L  S 184
mbg19574   582 CCTCTATAATGTCACTGTCATCATCTCTAAGAATATGCTGTCGCTCCTGT 631
                || || || ||||| ||||||||||| ||||| ||||||||||||||| 
ha04602   2201 TCTGTACAACGTCACCGTCATCATCTCCAAGAACATGCTGTCGCTCCTGG 2250
           734  L  Y  N  V  T  V  I  I  S  K  N  M  L  S  L  L  A 750

           551 ----+----*----+----*----+----*----+----*----+----* 600
           185   C  V  F  V  E  Q  M  Q  S  N  F  C  W  V  I  Q   200
mbg19574   632 CCTGTGTCTTCGTGGAGCAAATGCAGAGCAACTTCTGCTGGGTCATCCAG 681
               |||| |||||||||||||| ||||||| |  |||||||||||||||||||
ha04602   2251 CCTGCGTCTTCGTGGAGCAGATGCAGACCGGCTTCTGCTGGGTCATCCAG 2300
           751   C  V  F  V  E  Q  M  Q  T  G  F  C  W  V  I  Q   766

           601 ----+----*----+----*----+----*----+----*----+----* 650
           201 L  F  S  L  V  C  T  V  K  G  Y  Y  D  P  K  E  M  217
mbg19574   682 CTCTTCAGCCTCGTGTGCACAGTCAAAGGCTACTATGATCCCAAAGAGAT 731
               ||||||||||| || ||||| ||||| ||||||||||| ||||| |||||
ha04602   2301 CTCTTCAGCCTTGTATGCACCGTCAAGGGCTACTATGACCCCAAGGAGAT 2350
           767 L  F  S  L  V  C  T  V  K  G  Y  Y  D  P  K  E  M  783

           651 ----+----*----+----*----+----*----+----*----+----* 700
           218  M  T  R  D  R  D  C  L  L  P  V  E  E  A  G  I  I 234
mbg19574   732 GATGACCAGGGACCGGGACTGCCTGCTGCCTGTGGAGGAGGCCGGGATCA 781
               ||||  ||| |||| ||||||||||||||||||||||||||| || ||||
ha04602   2351 GATGGACAGAGACCAGGACTGCCTGCTGCCTGTGGAGGAGGCTGGCATCA 2400
           784  M  D  R  D  Q  D  C  L  L  P  V  E  E  A  G  I  I 800

           701 ----+----*----+----*----+----*----+----*----+----* 750
           235   W  D  S  I  C  F  F  F  L  L  L  Q  R  R  I  F   250
mbg19574   782 TCTGGGACAGTATCTGCTTCTTCTTCCTGCTCTTGCAACGGCGCATCTTT 831
               ||||||||||  |||||||||||||||||||  |||| || ||| |||| 
ha04602   2401 TCTGGGACAGCGTCTGCTTCTTCTTCCTGCTGCTGCAGCGCCGCGTCTTC 2450
           801   W  D  S  V  C  F  F  F  L  L  L  Q  R  R  V  F   816

           751 ----+----*----+----*----+----*----+----*----+----* 800
           251 L  S  H  Y  F  L  H  V  S  A  D  L  K  A  T  A  L  267
mbg19574   832 CTCAGCCACTACTTCCTGCATGTCAGCGCTGACCTGAAAGCCACAGCCCT 881
               || ||||| |||| |||||| ||||| || |||||  | ||||| |||||
ha04602   2451 CTTAGCCATTACTACCTGCACGTCAGGGCCGACCTCCAGGCCACCGCCCT 2500
           817 L  S  H  Y  Y  L  H  V  R  A  D  L  Q  A  T  A  L  833

           801 ----+----*----+----*----+----*----+----*----+----* 850
           268  Q  A  S  R  G  F  A  L  Y  N  A  A  N  L  K  S  I 284
mbg19574   882 GCAGGCATCCAGGGGCTTTGCCCTCTACAATGCAGCCAACCTGAAGAGCA 931
               ||  || ||||||||||| ||||||||||| || |||||||| |||||||
ha04602   2501 GCTAGCCTCCAGGGGCTTCGCCCTCTACAACGCTGCCAACCTCAAGAGCA 2550
           834  L  A  S  R  G  F  A  L  Y  N  A  A  N  L  K  S  I 850

           851 ----+----*----+----*--- 873
           285   N  F  H  R  Q  I  E   291
mbg19574   932 TCAACTTCCATCGCCAGATTGAG 954
               |  ||||||| |||  ||| |||
ha04602   2551 TTGACTTCCACCGCAGGATAGAG 2573
           851   D  F  H  R  R  I  E   857



*--[ CDS2 ]--*
             1 ----+----*----+----*----+----*----+----*----+----* 50
             1 R  I  R  A  K  Q  E  K  Y  R  Q  S  Q  A  S  R  G  17
mbg19574   984 CGCATCCGTGCCAAACAGGAGAAGTACAGGCAGAGCCAGGCAAGTCGTGG 1033
               || ||||||||||| ||||||||| |||||||| ||| ||     ||  |
ha04602   2610 CGTATCCGTGCCAAGCAGGAGAAGCACAGGCAGGGCCGGGTGGACCGCAG 2659
           870 R  I  R  A  K  Q  E  K  H  R  Q  G  R  V  D  R  S  886

            51 ----+----*----+----*----+----*----+----*----+----* 100
            18  Q  L  Q  .  S  K  D  P  Q  D  P  S  Q  E  P  G  P 33
mbg19574  1034 CCAACTCCAG...TCCAAAGACCCTCAGGATCCCAGCCAGGAGCCAGGGC 1080
                |  | ||||    ||   | |||  |||| ||| ||| |||||||||||
ha04602   2660 TCGCCCCCAGGACACCCTGGGCCCCAAGGACCCCGGCCTGGAGCCAGGGC 2709
           887  R  P  Q  D  T  L  G  P  K  D  P  G  L  E  P  G  P 903

           101 ----+----*----+----*----+----*----+----*----+----* 150
            34   D  S  P  G  G  S  S  P  P  R  R  Q  W  W  R  P   49
mbg19574  1081 CTGACAGCCCAGGGGGCTCCTCCCCGCCACGGAGACAGTGGTGGCGCCCC 1130
               | ||||| |||||||||||||||||||||||||| ||||||||||| |||
ha04602   2710 CCGACAGTCCAGGGGGCTCCTCCCCGCCACGGAGGCAGTGGTGGCGGCCC 2759
           904   D  S  P  G  G  S  S  P  P  R  R  Q  W  W  R  P   919

           151 ----+----*----+----*----+----*----+----*----+----* 200
            50 W  L  D  H  A  T  V  I  H  S  G  D  Y  F  L  F  E  66
mbg19574  1131 TGGCTGGACCACGCCACAGTCATCCACTCTGGCGACTACTTCCTGTTTGA 1180
               ||||||||||||||||||||||||||||| || |||||||||||||||||
ha04602   2760 TGGCTGGACCACGCCACAGTCATCCACTCCGGGGACTACTTCCTGTTTGA 2809
           920 W  L  D  H  A  T  V  I  H  S  G  D  Y  F  L  F  E  936

           201 ----+----*----+----*----+----*----+----*----+----* 250
            67  S  D  S  E  E  E  E  E  A  L  P  E  D  P  R  P  A 83
mbg19574  1181 GTCAGATAGCGAGGAGGAAGAGGAGGCCCTACCTGAGGACCCCAGGCCTG 1230
               ||| || || ||||| || ||||||||  | ||||| ||||| |||||  
ha04602   2810 GTCCGACAGTGAGGAAGAGGAGGAGGCTGTTCCTGAAGACCCGAGGCCGT 2859
           937  S  D  S  E  E  E  E  E  A  V  P  E  D  P  R  P  S 953

           251 ----+----*----+----*----+----*----+----*----+----* 300
            84   A  Q  S  A  F  Q  M  A  Y  Q  A  W  V  T  N  A   99
mbg19574  1231 CAGCTCAGAGTGCCTTCCAGATGGCATACCAGGCATGGGTAACCAATGCC 1280
               | || ||||||||||||||| |||| |||||||||||||| ||||| |||
ha04602   2860 CGGCACAGAGTGCCTTCCAGCTGGCGTACCAGGCATGGGTGACCAACGCC 2909
           954   A  Q  S  A  F  Q  L  A  Y  Q  A  W  V  T  N  A   969

           301 ----+----*----+----*----+----*----+----*----+----* 350
           100 Q  T  V  L  R  Q  R  .  .  .  R  E  R  A  R  Q  E  113
mbg19574  1281 CAGACAGTGCTGAGGCAGCGT.........CGGGAGCGGGCACGGCAGGA 1321
               ||| | |||||||||| |||          | ||||| |||| |||||||
ha04602   2910 CAGGCGGTGCTGAGGCGGCGGCAGCAGGAGCAGGAGCAGGCAAGGCAGGA 2959
           970 Q  A  V  L  R  R  R  Q  Q  E  Q  E  Q  A  R  Q  E  986

           351 ----+----*----+----*----+----*----+----*----+----* 400
           114  R  A  E  Q  L  A  S  G  G  D  L  N  P  D  V  E  P 130
mbg19574  1322 GCGGGCAGAGCAGCTGGCTTCTGGAGGTGACTTGAACCCAGATGTGGAAC 1371
                | |||||  |||||  |  | |||||||     | ||  || ||||| |
ha04602   2960 ACAGGCAGGACAGCTACCCACAGGAGGTGGTCCCAGCCAGGAGGTGGAGC 3009
           987  Q  A  G  Q  L  P  T  G  G  G  P  S  Q  E  V  E  P 1003

           401 ----+----*----+----*----+----*----+----*----+----* 450
           131   V  D  V  P  E  D  E  M  A  G  R  S  H  M  M  Q   146
mbg19574  1372 CAGTAGATGTCCCAGAAGATGAGATGGCAGGCCGTAGCCACATGATGCAG 1421
               ||| ||| | ||| || || |    ||||||||| |||||  || |||||
ha04602   3010 CAGCAGAGGGCCCCGAGGAGGCAGCGGCAGGCCGGAGCCATGTGGTGCAG 3059
          1004   A  E  G  P  E  E  A  A  A  G  R  S  H  V  V  Q   1019

           451 ----+----*----+----*----+----*----+----*----+----* 500
           147 R  V  L  S  T  M  Q  F  L  W  V  L  G  Q  A  T  V  163
mbg19574  1422 CGTGTGCTAAGCACCATGCAGTTCCTGTGGGTGCTGGGCCAGGCCACGGT 1471
                | ||||| |||||   ||||||||||||| ||||||| |||||    ||
ha04602   3060 AGGGTGCTGAGCACGGCGCAGTTCCTGTGGATGCTGGGGCAGGCGCTAGT 3109
          1020 R  V  L  S  T  A  Q  F  L  W  M  L  G  Q  A  L  V  1036

           501 ----+----*----+----*----+----*----+----*----+----* 550
           164  D  G  L  T  R  W  L  R  A  F  T  K  H  H  R  T  M 180
mbg19574  1472 AGACGGGCTGACGCGCTGGCTGCGTGCATTCACGAAGCACCACCGCACCA 1521
                || | |||||| ||||||||||  |  |||||   ||||||| ||||||
ha04602   3110 GGATGAGCTGACACGCTGGCTGCAGGAGTTCACCCGGCACCACGGCACCA 3159
          1037  D  E  L  T  R  W  L  Q  E  F  T  R  H  H  G  T  M 1053

           551 ----+----*----+----*----+----*----+----*----+----* 600
           181   S  D  V  L  C  A  E  R  Y  L  L  T  Q  E  L  L   196
mbg19574  1522 TGAGCGATGTGCTGTGCGCAGAGCGCTACCTGCTCACCCAGGAGCTTCTT 1571
               ||||||| |||||| | |||||||||||||| ||||| |||||||| || 
ha04602   3160 TGAGCGACGTGCTGCGGGCAGAGCGCTACCTCCTCACACAGGAGCTCCTG 3209
          1054   S  D  V  L  R  A  E  R  Y  L  L  T  Q  E  L  L   1069

           601 ----+----*----+----*----+----*----+----*----+----* 650
           197 R  V  G  E  V  R  R  G  V  L  D  Q  L  Y  V  G  E  213
mbg19574  1572 CGGGTTGGAGAGGTACGCCGAGGTGTGCTGGACCAGCTTTATGTGGGTGA 1621
               | ||  || || || | | | || |||||||| ||||| ||     |  |
ha04602   3210 CAGGGCGGCGAAGTGCACAGGGGCGTGCTGGATCAGCTGTACACAAGCCA 3259
          1070 Q  G  G  E  V  H  R  G  V  L  D  Q  L  Y  T  S  Q  1086

           651 ----+----*----+----*----+----*----+----*----+----* 700
           214  D  E  A  T  L  S  G  P  V  E  T  R  D  G  P  S  T 230
mbg19574  1622 AGATGAGGCCACATTGTCAGGTCCCGTGGAGACCCGGGATGGACCCAGCA 1671
                |  ||||||||  || |||| |||   ||| |||   |||  || ||||
ha04602   3260 GGCCGAGGCCACGCTGCCAGGCCCCACCGAGGCCCCCAATGCCCCAAGCA 3309
          1087  A  E  A  T  L  P  G  P  T  E  A  P  N  A  P  S  T 1103

           701 ----+----*----+----*----+----*----+----*----+----* 750
           231   A  S  S  G  L  G  A  E  E  P  L  S  S  M  T  D   246
mbg19574  1672 CAGCCTCAAGTGGGCTGGGAGCCGAAGAGCCTTTGAGTAGCATGACAGAC 1721
               | |  || ||||||||||| || || |||||  | || ||||||||||||
ha04602   3310 CCGTGTCCAGTGGGCTGGGCGCGGAGGAGCCACTCAGCAGCATGACAGAC 3359
          1104   V  S  S  G  L  G  A  E  E  P  L  S  S  M  T  D   1119

           751 ----+----*----+----*----+----*----+----*----+----* 800
           247 D  T  S  S  P  L  S  T  G  Y  N  T  R  S  G  S  E  263
mbg19574  1722 GACACCAGCAGCCCCCTGAGCACAGGCTATAACACCCGCAGTGGCAGTGA 1771
               ||||   |||||||||||||||| |||||  |||| ||||||||||||||
ha04602   3360 GACATGGGCAGCCCCCTGAGCACCGGCTACCACACGCGCAGTGGCAGTGA 3409
          1120 D  M  G  S  P  L  S  T  G  Y  H  T  R  S  G  S  E  1136

           801 ----+----*----+----*----+----*----+----*----+----* 850
           264  E  I  V  T  D  A  G  D  L  Q  A  G  T  S  L  H  G 280
mbg19574  1772 GGAGATTGTCACCGACGCTGGGGACCTCCAGGCTGGGACCTCCCTGCACG 1821
               ||||   ||||||||| | ||||| |   |||||||  |||| ||| || 
ha04602   3410 GGAGGCAGTCACCGACCCCGGGGAGCGTGAGGCTGGTGCCTCTCTGTAC. 3458
          1137  E  A  V  T  D  P  G  E  R  E  A  G  A  S  L  Y  . 1152

           851 ----+----*----+----*----+----*----+----*----+----* 900
           281   S  Q  E  L  L  A  N  A  R  T  R  M  R  T  A  S   296
mbg19574  1822 GCTCCCAAGAGCTTTTAGCCAATGCTCGTACCCGGATGCGCACGGCCAGC 1871
                    || |  ||                      ||||| |||||||||
ha04602   3459 .....CAGGGACTG.....................ATGCGGACGGCCAGC 3482
          1153   .  Q  G  L  .  .  .  .  .  .  .  M  R  T  A  S   1160

           901 ----+----*----+----*----+----*----+----*----+----* 950
           297 E  L  L  L  D  R  R  L  H  I  P  E  L  E  E  A  E  313
mbg19574  1872 GAGCTGCTACTGGATAGGCGCCTGCATATCCCTGAGCTGGAGGAGGCCGA 1921
               |||||||| ||||| ||||||||||  ||||| |||||||||||||| ||
ha04602   3483 GAGCTGCTCCTGGACAGGCGCCTGCGCATCCCAGAGCTGGAGGAGGCAGA 3532
          1161 E  L  L  L  D  R  R  L  R  I  P  E  L  E  E  A  E  1177

           951 ----+----*----+----*----+----*----+----*----+----* 1000
           314  R  F  E  A  Q  Q  G  R  T  L  R  L  L  R  A  G  Y 330
mbg19574  1922 GCGGTTTGAGGCACAGCAGGGCCGGACTCTGCGGCTGCTCAGGGCTGGGT 1971
               || ||||| ||    |||||||||| | |||||||||||  |||| | ||
ha04602   3533 GCTGTTTGCGGAGGGGCAGGGCCGGGCGCTGCGGCTGCTGCGGGCCGTGT 3582
          1178  L  F  A  E  G  Q  G  R  A  L  R  L  L  R  A  V  Y 1194

          1001 ----+----*----+----*----+----*----+----*----+----* 1050
           331   Q  C  V  A  A  H  S  E  L  L  C  Y  F  I  I  I   346
mbg19574  1972 ACCAGTGCGTGGCGGCACACTCGGAGCTGCTCTGTTACTTCATCATCATC 2021
               ||||||| ||||| || ||||||||||||||||| |||||||||||||||
ha04602   3583 ACCAGTGTGTGGCCGCCCACTCGGAGCTGCTCTGCTACTTCATCATCATC 3632
          1195   Q  C  V  A  A  H  S  E  L  L  C  Y  F  I  I  I   1210

          1051 ----+----*----+----*----+----*----+----*----+----* 1100
           347 L  N  H  M  V  T  A  S  A  A  S  L  V  L  P  V  L  363
mbg19574  2022 CTTAACCACATGGTGACAGCCTCGGCTGCCTCCCTGGTGCTGCCCGTGCT 2071
               || ||||||||||| || ||||| || | |||||||||||||||||||||
ha04602   3633 CTCAACCACATGGTCACGGCCTCCGCCGGCTCCCTGGTGCTGCCCGTGCT 3682
          1211 L  N  H  M  V  T  A  S  A  G  S  L  V  L  P  V  L  1227

          1101 ----+----*----+----*----+----*----+----*----+----* 1150
           364  V  F  L  W  A  M  L  T  I  P  R  P  S  K  R  F  W 380
mbg19574  2072 TGTGTTCCTGTGGGCCATGCTGACCATCCCGAGGCCTAGCAAGCGCTTTT 2121
                || |||||||||||||||||| | ||||||||||| ||||||||||| |
ha04602   3683 CGTCTTCCTGTGGGCCATGCTGTCGATCCCGAGGCCCAGCAAGCGCTTCT 3732
          1228  V  F  L  W  A  M  L  S  I  P  R  P  S  K  R  F  W 1244

          1151 ----+----*----+----*----+----*----+----*----+----* 1200
           381   M  T  A  I  V  F  T  E  V  M  V  V  T  K  Y  L   396
mbg19574  2122 GGATGACAGCTATCGTCTTCACTGAGGTCATGGTGGTCACCAAATACCTG 2171
               ||||||| || ||||||||||| ||| ||  |||||||  ||| ||||||
ha04602   3733 GGATGACGGCCATCGTCTTCACCGAGATCGCGGTGGTCGTCAAGTACCTG 3782
          1245   M  T  A  I  V  F  T  E  I  A  V  V  V  K  Y  L   1260

          1201 ----+----*----+----*----+----*----+----*----+----* 1250
           397 F  Q  F  G  F  F  P  W  N  S  Y  V  V  L  R  R  Y  413
mbg19574  2172 TTCCAGTTCGGCTTCTTCCCCTGGAACAGCTACGTTGTGCTGCGGCGCTA 2221
               |||||||| || |||||||||||||||||| |||| ||||||||||||||
ha04602   3783 TTCCAGTTTGGGTTCTTCCCCTGGAACAGCCACGTGGTGCTGCGGCGCTA 3832
          1261 F  Q  F  G  F  F  P  W  N  S  H  V  V  L  R  R  Y  1277

          1251 ----+----*----+----*----+----*----+----*----+----* 1300
           414  E  N  K  P  Y  F  P  P  R  I  L  G  L  E  K  T  D 430
mbg19574  2222 TGAGAACAAGCCCTACTTCCCTCCGCGAATCCTGGGCCTTGAGAAAACGG 2271
                |||||||||||||||||||| || || ||||||||||| ||||| || |
ha04602   3833 CGAGAACAAGCCCTACTTCCCGCCCCGCATCCTGGGCCTGGAGAAGACTG 3882
          1278  E  N  K  P  Y  F  P  P  R  I  L  G  L  E  K  T  D 1294

          1301 ----+----*----+----*----+----*----+----*----+----* 1350
           431   S  Y  I  K  Y  D  L  V  Q  L  M  A  L  F  F  H   446
mbg19574  2272 ACAGCTACATCAAGTATGACCTGGTGCAGCTCATGGCCCTCTTCTTCCAC 2321
               || ||||||||||||| ||||||||||||||||||||||| |||||||||
ha04602   3883 ACGGCTACATCAAGTACGACCTGGTGCAGCTCATGGCCCTTTTCTTCCAC 3932
          1295   G  Y  I  K  Y  D  L  V  Q  L  M  A  L  F  F  H   1310

          1351 ----+----*----+----*----+----*----+----*----+----* 1400
           447 R  S  Q  L  L  C  Y  G  L  W  D  H  E  E  D  R  Y  463
mbg19574  2322 CGCTCGCAGCTACTGTGTTATGGCCTCTGGGACCATGAGGAGGATCGCTA 2371
               ||||| ||||| ||||| ||||||||||||||||||||||||||      
ha04602   3933 CGCTCCCAGCTGCTGTGCTATGGCCTCTGGGACCATGAGGAGGACTCACC 3982
          1311 R  S  Q  L  L  C  Y  G  L  W  D  H  E  E  D  S  P  1327

          1401 ----+----*----+----*----+----*----+----*----+----* 1450
           464  P  K  D  H  C  R  S  S  V  K  D  R  E  A  K  E  E 480
mbg19574  2372 TCCCAAGGACCATTGCAGGAGTAGTGTGAAGGACCGGGAGGCCAAGGAAG 2421
                 ||||||| |||                  |||  |   | | |||| |
ha04602   3983 ATCCAAGGAGCAT..................GACAAGAGCGGCGAGGAGG 4014
          1328  S  K  E  H  .  .  .  .  .  .  D  K  S  G  E  E  E 1338

          1451 ----+----*----+----*----+----*----+----*----+----* 1500
           481   P  E  A  K  L  E  S  Q  S  E  T  G  T  G  H  P   496
mbg19574  2422 AGCCAGAAGCTAAGCTGGAATCGCAGTCTGAGACAGGCACTGGGCATCCC 2471
               |||  | |||                   |||   ||  | |||   || 
ha04602   4015 AGCAGGGAGCC..................GAGGAGGGGCCAGGGGTGCCT 4046
          1339   Q  G  A  .  .  .  .  .  .  E  E  G  P  G  V  P   1348

          1501 ----+----*----+----*----+----*----+----*----+----* 1550
           497 K  E  P  V  L  A  G  T  P  R  D  H  I  Q  G  K  G  513
mbg19574  2472 AAGGAGCCAGTGTTGGCCGGTACTCCCAGGGACCACATCCAAGGGAAAGG 2521
                              || |  ||  ||   |||||||| || | | ||| 
ha04602   4047 ...............GCGGCCACCACCGAAGACCACATTCAGGTGGAAGC 4081
          1349 .  .  .  .  .  A  A  T  T  E  D  H  I  Q  V  E  A  1360

          1551 ----+----*----+----*----+----*----+----*----+----* 1600
           514  S  I  R  S  K  D  V  I  Q  D  P  P  E  D  L  K  P 530
mbg19574  2522 AAGTATTAGATCCAAGGATGTTATCCAAGATCCCCCAGAGGACCTTAAGC 2571
                ||  |  || ||| ||| |  | || ||| |||| || ||| || | ||
ha04602   4082 GAGGGTCGGACCCACGGACGGGACCCCAGAACCCCAAGTGGAGCTCAGGC 4131
          1361  R  V  G  P  T  D  G  T  P  E  P  Q  V  E  L  R  P 1377

          1601 ----+----*----+----*----+----*----+----*----+----* 1650
           531   R  H  T  R  H  I  S  I  R  F  R  R  R  K  .  E   545
mbg19574  2572 CCCGGCACACGAGGCACATCAGCATACGCTTCAGGAGGCGCAAG...GAG 2618
               ||||  | ||||||| ||||||  |||| || || ||  | |||   |||
ha04602   4132 CCCGTGATACGAGGCGCATCAGTCTACGTTTTAGAAGAAGGAAGAAGGAG 4181
          1378   R  D  T  R  R  I  S  L  R  F  R  R  R  K  K  E   1393

          1651 ----+----*----+----*----+----*----+----*----+----* 1700
           546 T  P  G  P  K  G  T  A  V  M  E  T  E  .  H  E  E  561
mbg19574  2619 ACTCCAGGACCCAAAGGAACAGCAGTCATGGAGACTGAG...CACGAGGA 2665
                  |||| ||  |||||| | |||| ||| ||  |||||      |||||
ha04602   4182 GGCCCAGCACGGAAAGGAGCGGCAGCCATCGAAGCTGAGGACAGGGAGGA 4231
          1394 G  P  A  R  K  G  A  A  A  I  E  A  E  D  R  E  E  1410

          1701 ----+----*----+----*----+----*----+----*----+----* 1750
           562  G  E  G  K  E  T  T  E  R  K  R  P  R  H  T  Q  E 578
mbg19574  2666 GGGAGAAGGAAAAGAAACTACAGAGAGAAAGAGGCCGCGTCACACTCAAG 2715
                | ||| ||  | |||   | ||||      | |  | |  |     |  
ha04602   4232 AGAAGAGGGGGAGGAAGAGAAAGAGGCCCCCACGGGGAGAGAG...AAGA 4278
          1411  E  E  G  E  E  E  K  E  A  P  T  G  R  E  .  K  R 1426

          1751 ----+----*----+----*----+----*----+----*----+----* 1800
           579   K  S  K  F  R  E  R  M  K  A  A  G  R  R  L  Q   594
mbg19574  2716 AAAAATCGAAGTTTCGGGAGAGAATGAAGGCAGCTGGGCGCCGGCTGCAG 2765
                   |      | | | |  ||| | | ||| || ||||| |||||||||
ha04602   4279 GGCCAAGCCGCTCTGGAGGAAGAGTAAGGGCGGCCGGGCGGCGGCTGCAG 4328
          1427   P  S  R  S  G  G  R  V  R  A  A  G  R  R  L  Q   1442

          1801 ----+----*----+----*----+----*----+----*----+----* 1850
           595 S  F  C  V  S  L  A  Q  S  F  Y  Q  P  L  Q  R  F  611
mbg19574  2766 AGCTTCTGTGTGTCACTGGCCCAGAGCTTCTACCAACCCTTGCAGCGTTT 2815
                |||||||  |||| ||||||||| ||   || |  ||  | | ||| ||
ha04602   4329 GGCTTCTGCCTGTCCCTGGCCCAGGGCACATATCGGCCGCTACGGCGCTT 4378
          1443 G  F  C  L  S  L  A  Q  G  T  Y  R  P  L  R  R  F  1459

          1851 ----+----*----+----*----+----*----+----*----+----* 1900
           612  F  H  D  I  L  H  T  K  Y  R  A  A  T  D  V  Y  A 628
mbg19574  2816 CTTCCATGACATTCTGCACACAAAGTACCGGGCGGCCACCGACGTCTACG 2865
               |||||| ||||| |||||||| |||||||| || |||||||||||||| |
ha04602   4379 CTTCCACGACATCCTGCACACCAAGTACCGCGCAGCCACCGACGTCTATG 4428
          1460  F  H  D  I  L  H  T  K  Y  R  A  A  T  D  V  Y  A 1476

          1901 ----+----*----+----*----+----*----+----*----+----* 1950
           629   L  M  F  L  A  D  I  V  D  I  I  I  I  I  F  G   644
mbg19574  2866 CCCTCATGTTCCTGGCCGATATTGTCGACATCATCATCATCATCTTTGGT 2915
               |||||||||||||||| ||| |||||||| ||||||||||||| ||||| 
ha04602   4429 CCCTCATGTTCCTGGCTGATGTTGTCGACTTCATCATCATCATTTTTGGC 4478
          1477   L  M  F  L  A  D  V  V  D  F  I  I  I  I  F  G   1492

          1951 ----+----*----+----*----+----*----+----*----+----* 2000
           645 F  W  A  F  G  K  H  S  A  A  T  D  I  A  S  S  L  661
mbg19574  2916 TTTTGGGCTTTTGGGAAGCACTCTGCAGCCACAGACATTGCATCCTCGCT 2965
               || ||||| |||||||||||||| || |||||||||||  | ||||| ||
ha04602   4479 TTCTGGGCCTTTGGGAAGCACTCGGCGGCCACAGACATCACGTCCTCCCT 4528
          1493 F  W  A  F  G  K  H  S  A  A  T  D  I  T  S  S  L  1509

          2001 ----+----*----+----*----+----*----+----*----+----* 2050
           662  S  D  D  Q  V  P  Q  A  F  L  F  M  L  L  V  Q  F 678
mbg19574  2966 GTCAGATGACCAGGTGCCACAGGCCTTCCTGTTCATGCTGCTGGTCCAGT 3015
                ||||| |||||||| ||  |||| |||||| ||||||||||| ||||||
ha04602   4529 ATCAGACGACCAGGTACCCGAGGCTTTCCTGGTCATGCTGCTGATCCAGT 4578
          1510  S  D  D  Q  V  P  E  A  F  L  V  M  L  L  I  Q  F 1526

          2051 ----+----*----+----*----+----*----+----*----+----* 2100
           679   G  T  M  V  I  D  R  A  L  Y  L  R  K  T  V  L   694
mbg19574  3016 TTGGCACCATGGTCATCGACCGTGCCCTCTACCTGCGCAAGACTGTCCTG 3065
               |  | ||||||||  | ||||| |||||||||||||||||||| || |||
ha04602   4579 TCAGTACCATGGTGGTTGACCGCGCCCTCTACCTGCGCAAGACCGTGCTG 4628
          1527   S  T  M  V  V  D  R  A  L  Y  L  R  K  T  V  L   1542

          2101 ----+----*----+----*----+----*----+----*----+----* 2150
           695 G  K  L  A  F  Q  V  V  L  V  V  A  I  H  I  W  M  711
mbg19574  3066 GGAAAGCTGGCCTTTCAGGTGGTCCTGGTGGTGGCGATTCACATCTGGAT 3115
               || ||||||||||| |||||||  |||||| |||| || ||| | |||||
ha04602   4629 GGCAAGCTGGCCTTCCAGGTGGCGCTGGTGCTGGCCATCCACCTATGGAT 4678
          1543 G  K  L  A  F  Q  V  A  L  V  L  A  I  H  L  W  M  1559

          2151 ----+----*----+----*----+----*----+----*----+----* 2200
           712  F  F  I  L  P  A  V  T  E  R  M  F  S  Q  N  A  V 728
mbg19574  3116 GTTCTTTATCTTACCGGCTGTCACTGAGAGGATGTTCAGCCAGAATGCGG 3165
               |||||| ||| | || || ||||||||||||||||||| |||||||| ||
ha04602   4679 GTTCTTCATCCTGCCCGCCGTCACTGAGAGGATGTTCAACCAGAATGTGG 4728
          1560  F  F  I  L  P  A  V  T  E  R  M  F  N  Q  N  V  V 1576

          2201 ----+----*----+----*----+----*----+----*----+----* 2250
           729   A  Q  L  W  Y  F  V  K  C  I  Y  F  A  L  S  A   744
mbg19574  3166 TGGCACAGCTGTGGTACTTCGTCAAGTGCATTTACTTTGCCCTGTCCGCC 3215
               |||| ||||| ||||||||||| |||||||| ||||| ||||||||||||
ha04602   4729 TGGCCCAGCTCTGGTACTTCGTGAAGTGCATCTACTTCGCCCTGTCCGCC 4778
          1577   A  Q  L  W  Y  F  V  K  C  I  Y  F  A  L  S  A   1592

          2251 ----+----*----+----*----+----*----+----*----+----* 2300
           745 Y  Q  I  R  C  G  Y  P  T  R  I  L  G  N  F  L  T  761
mbg19574  3216 TACCAGATCCGCTGTGGCTACCCCACCCGTATCTTGGGCAACTTCCTCAC 3265
               |||||||||||||| |||||||||||||| ||| | ||||||||||||||
ha04602   4779 TACCAGATCCGCTGCGGCTACCCCACCCGCATCCTCGGCAACTTCCTCAC 4828
          1593 Y  Q  I  R  C  G  Y  P  T  R  I  L  G  N  F  L  T  1609

          2301 ----+----*----+----*----+----*----+----*----+----* 2350
           762  K  K  Y  N  H  L  N  L  F  L  F  Q  G  F  R  L  V 778
mbg19574  3266 CAAGAAATACAACCATCTAAACCTCTTCCTTTTCCAGGGGTTCCGTCTAG 3315
               |||||| ||||| ||||| ||||||||||| |||||||||||||| || |
ha04602   4829 CAAGAAGTACAATCATCTCAACCTCTTCCTCTTCCAGGGGTTCCGGCTGG 4878
          1610  K  K  Y  N  H  L  N  L  F  L  F  Q  G  F  R  L  V 1626

          2351 ----+----*----+----*----+----*----+----*----+----* 2400
           779   P  F  L  V  E  L  R  A  V  M  D  W  V  W  T  D   794
mbg19574  3316 TGCCGTTCCTGGTGGAGCTGCGGGCCGTCATGGACTGGGTGTGGACCGAC 3365
               ||||||||||||||||||||||||| || ||||||||||||||||| |||
ha04602   4879 TGCCGTTCCTGGTGGAGCTGCGGGCAGTGATGGACTGGGTGTGGACGGAC 4928
          1627   P  F  L  V  E  L  R  A  V  M  D  W  V  W  T  D   1642

          2401 ----+----*----+----*----+----*----+----*----+----* 2450
           795 T  T  L  S  L  S  N  W  M  C  V  E  D  I  Y  A  N  811
mbg19574  3366 ACCACGCTGTCCCTGTCCAACTGGATGTGTGTGGAAGACATCTATGCCAA 3415
               ||||||||||||||||||| ||||||||||||||| ||||||||||||||
ha04602   4929 ACCACGCTGTCCCTGTCCAGCTGGATGTGTGTGGAGGACATCTATGCCAA 4978
          1643 T  T  L  S  L  S  S  W  M  C  V  E  D  I  Y  A  N  1659

          2451 ----+----*----+----*----+----*----+----*----+----* 2500
           812  I  F  I  I  K  C  S  R  E  T  E  K  .  V  P  G  Q 827
mbg19574  3416 CATCTTTATCATCAAGTGCAGCCGAGAGACAGAGAAG...GTGCCAGGGC 3462
               |||||| |||||||| |||||||||||||||||||||      ||   ||
ha04602   4979 CATCTTCATCATCAAATGCAGCCGAGAGACAGAGAAGAAATACCCGCAGC 5028
          1660  I  F  I  I  K  C  S  R  E  T  E  K  K  Y  P  Q  P 1676

          2501 ----+----*- 2511
           828   R  G  R   830
mbg19574  3463 AGAGAGGACGG 3473
                 | ||| | |
ha04602   5029 CCAAAGGGCAG 5039
          1677   K  G  Q   1679



*--[ CDS3 ]--*
             1 ----+----*----+----*----+----*----+----*----+----* 50
             1 K  K  K  K  I  V  K  Y  G  M  G  G  L  I  I  L  F  17
mbg19574  4531 AAGAAGAAGAAAATTGTCAAGTATGGTATGGGAGGCCTCATTATCCTCTT 4580
               ||||||||||| || |||||||| || ||||| |||||||| ||||||||
ha04602   5040 AAGAAGAAGAAGATCGTCAAGTACGGCATGGGTGGCCTCATCATCCTCTT 5089
          1680 K  K  K  K  I  V  K  Y  G  M  G  G  L  I  I  L  F  1696

            51 ----+----*----+----*----+----*----+----*----+----* 100
            18  L  I  A  I  I  W  F  P  L  L  F  M  S  L  I  R  S 34
mbg19574  4581 CCTCATCGCCATCATCTGGTTCCCTCTGCTCTTCATGTCACTGATCCGCT 4630
               |||||||||||||||||||||||| |||||||||||||| ||| | ||||
ha04602   5090 CCTCATCGCCATCATCTGGTTCCCGCTGCTCTTCATGTCGCTGGTGCGCT 5139
          1697  L  I  A  I  I  W  F  P  L  L  F  M  S  L  V  R  S 1713

           101 ----+----*----+----*----+----*----+----*----+----* 150
            35   V  V  G  V  V  N  Q  P  I  D  V  T  V  T  L  K   50
mbg19574  4631 CTGTGGTCGGGGTCGTCAACCAGCCCATTGATGTCACCGTCACCCTCAAG 4680
               | ||||| ||||| |||||||||||||| |||||||||||||||||||||
ha04602   5140 CCGTGGTTGGGGTTGTCAACCAGCCCATCGATGTCACCGTCACCCTCAAG 5189
          1714   V  V  G  V  V  N  Q  P  I  D  V  T  V  T  L  K   1729

           151 ----+----*----+----*----+----*----+----*----+----* 200
            51 L  G  G  Y  E  P  L  F  T  M  S  A  Q  Q  P  S  I  67
mbg19574  4681 CTAGGCGGCTATGAGCCACTGTTCACCATGAGCGCCCAGCAGCCATCCAT 4730
               || |||||||||||||| |||||||||||||||||||||||||| |||||
ha04602   5190 CTGGGCGGCTATGAGCCGCTGTTCACCATGAGCGCCCAGCAGCCGTCCAT 5239
          1730 L  G  G  Y  E  P  L  F  T  M  S  A  Q  Q  P  S  I  1746

           201 ----+----*----+----*----+----*----+----*----+----* 250
            68  V  P  F  T  P  Q  A  Y  E  E  L  S  Q  Q  F  D  P 84
mbg19574  4731 TGTGCCATTCACACCCCAGGCCTACGAGGAGCTGTCCCAGCAGTTTGACC 4780
                 | || |||||  |||||||||| ||||||||||||| |||||||||||
ha04602   5240 CATCCCCTTCACGGCCCAGGCCTATGAGGAGCTGTCCCGGCAGTTTGACC 5289
          1747  I  P  F  T  A  Q  A  Y  E  E  L  S  R  Q  F  D  P 1763

           251 ----+----*----+----*----+----*----+----*----+----* 300
            85   Y  P  L  A  M  Q  F  I  S  Q  Y  S  P  E  D  I   100
mbg19574  4781 CCTATCCACTAGCCATGCAGTTCATTAGCCAGTACAGTCCTGAGGACATC 4830
               || | || || |||||||||||||| ||||||||||| ||||||||||||
ha04602   5290 CCCAGCCGCTGGCCATGCAGTTCATCAGCCAGTACAGCCCTGAGGACATC 5339
          1764   Q  P  L  A  M  Q  F  I  S  Q  Y  S  P  E  D  I   1779

           301 ----+----*----+----*----+----*----+----*----+----* 350
           101 V  T  A  Q  I  E  G  S  S  G  A  L  W  R  I  S  P  117
mbg19574  4831 GTCACTGCACAGATCGAGGGCAGCTCGGGGGCGCTGTGGCGCATCAGCCC 4880
               ||||| || ||||| ||||||||||| |||||||||||||||||||| ||
ha04602   5340 GTCACGGCGCAGATTGAGGGCAGCTCCGGGGCGCTGTGGCGCATCAGTCC 5389
          1780 V  T  A  Q  I  E  G  S  S  G  A  L  W  R  I  S  P  1796

           351 ----+----*----+----*----+----*----+----*----+----* 400
           118  P  S  R  A  Q  M  K  Q  E  L  Y  N  G  T  A  D  I 134
mbg19574  4881 ACCCAGCCGAGCCCAGATGAAGCAGGAGCTGTACAACGGCACAGCCGACA 4930
                |||||||| ||||||||||||| |||||| ||||||||||| |||||||
ha04602   5390 CCCCAGCCGTGCCCAGATGAAGCGGGAGCTCTACAACGGCACGGCCGACA 5439
          1797  P  S  R  A  Q  M  K  R  E  L  Y  N  G  T  A  D  I 1813

           401 ----+----*----+----*----+----*-- 432
           135   T  L  R  F  T  W  N  F  Q  R   144
mbg19574  4931 TTACACTGCGCTTTACCTGGAATTTCCAAAGG 4962
               | || |||||||| |||||||| ||||| |||
ha04602   5440 TCACCCTGCGCTTCACCTGGAACTTCCAGAGG 5471
          1814   T  L  R  F  T  W  N  F  Q  R   1823



*--[ CDS4 ]--*
             1 ----+----*----+----*----+----*----+----*----+----* 50
             1 D  L  A  K  G  G  T  V  E  Y  T  N  E  K  H  T  L  17
mbg19574  5028 GACCTGGCCAAGGGTGGCACTGTGGAGTATACTAATGAGAAGCACACCTT 5077
               |||||||| ||||| ||||||||||||||| | || ||||||||||   |
ha04602   5472 GACCTGGCGAAGGGAGGCACTGTGGAGTATGCCAACGAGAAGCACATGCT 5521
          1824 D  L  A  K  G  G  T  V  E  Y  A  N  E  K  H  M  L  1840

            51 ----+----*----+----*----+----*----+----*----+----* 100
            18  E  L  A  P  N  S  T  A  R  R  Q  L  A  Q  L  L  E 34
mbg19574  5078 GGAGCTGGCCCCCAACAGTACGGCACGAAGGCAGCTGGCCCAACTGCTCG 5127
               ||  |||||||||||||| || |||||  |||||||||||   |||||||
ha04602   5522 GGCCCTGGCCCCCAACAGCACTGCACGGCGGCAGCTGGCCAGCCTGCTCG 5571
          1841  A  L  A  P  N  S  T  A  R  R  Q  L  A  S  L  L  E 1857

           101 ----+----*----+----*----+----*----+----*----+----* 150
            35   G  R  P  D  Q  S  V  V  I  P  H  L  F  P  K  Y   50
mbg19574  5128 AGGGCAGACCTGACCAGTCAGTGGTCATTCCCCATCTCTTCCCCAAGTAC 5177
               ||||||   | |||||||| |||||||| ||| |||||||||||||||||
ha04602   5572 AGGGCACCTCGGACCAGTCTGTGGTCATCCCCAATCTCTTCCCCAAGTAC 5621
          1858   G  T  S  D  Q  S  V  V  I  P  N  L  F  P  K  Y   1873

           151 ----+----*----+----*----+----*----+----*----+----* 200
            51 I  R  A  P  N  G  P  E  A  N  P  V  K  Q  L  Q  P  67
mbg19574  5178 ATTCGTGCTCCCAATGGGCCTGAAGCCAACCCTGTGAAGCAGCTGCAGCC 5227
               || ||||| ||||| ||||| |||||||||||||||||||||||||||||
ha04602   5622 ATCCGTGCCCCCAACGGGCCCGAAGCCAACCCTGTGAAGCAGCTGCAGCC 5671
          1874 I  R  A  P  N  G  P  E  A  N  P  V  K  Q  L  Q  P  1890

           201 ----+----*----+----*----+----*----+----*----+----* 250
            68  D  E  E  E  D  Y  L  G  V  R  I  Q  L  R  R  E  Q 84
mbg19574  5228 AGATGAGGAAGAGGACTACCTTGGTGTGCGCATCCAGCTGCGGAGGGAGC 5277
                 ||||||| |  |||||||| || ||||| |||||||||||||||||||
ha04602   5672 CAATGAGGAGGCCGACTACCTCGGCGTGCGTATCCAGCTGCGGAGGGAGC 5721
          1891  N  E  E  A  D  Y  L  G  V  R  I  Q  L  R  R  E  Q 1907

           251 ----+----*----+----*----+----*----+----*----+----* 300
            85   V  G  T  G  A  S  G  E  Q  A  G  T  K  A  S  D   100
mbg19574  5278 AAGTGGGCACAGGGGCCTCTGGGGAGCAAGCGGGCACCAAGGCCTCCGAC 5327
               |    ||  | |||||| | ||                            
ha04602   5722 AG...GGTGCGGGGGCCACCGGC........................... 5741
          1908   .  G  A  G  A  T  G  .  .  .  .  .  .  .  .  .   1913

           301 ----+----*----+----*----+----*----+----*----+----* 350
           101 F  L  E  W  W  V  I  E  L  Q  D  C  K  A  D  C  N  117
mbg19574  5328 TTCCTCGAGTGGTGGGTCATCGAGCTGCAGGACTGCAAGGCTGACTGCAA 5377
               |||||||| ||||||||||||||||||||||| |||  | | ||||||||
ha04602   5742 TTCCTCGAATGGTGGGTCATCGAGCTGCAGGAGTGCCGGACCGACTGCAA 5791
          1914 F  L  E  W  W  V  I  E  L  Q  E  C  R  T  D  C  N  1930

           351 ----+----*----+----*----+----*----+----*----+----* 400
           118  L  L  P  M  V  I  F  S  D  K  V  S  P  P  S  L  G 134
mbg19574  5378 CCTGCTGCCCATGGTCATCTTCAGTGACAAGGTCAGCCCACCTAGCCTGG 5427
               |||||||||||||||||| ||||||||||||||||||||||| ||||| |
ha04602   5792 CCTGCTGCCCATGGTCATTTTCAGTGACAAGGTCAGCCCACCGAGCCTCG 5841
          1931  L  L  P  M  V  I  F  S  D  K  V  S  P  P  S  L  G 1947

           401 ----+----*----+----*----+----*----+----*----+----* 450
           135   F  L  A  G  Y  G  I  V  G  L  Y  V  S  I  V  L   150
mbg19574  5428 GCTTCCTGGCCGGCTACGGGATTGTGGGGCTGTACGTCTCCATCGTGCTG 5477
               |||||||||| |||||||| ||  ||||||||||||| ||||||||||||
ha04602   5842 GCTTCCTGGCTGGCTACGGCATCATGGGGCTGTACGTGTCCATCGTGCTG 5891
          1948   F  L  A  G  Y  G  I  M  G  L  Y  V  S  I  V  L   1963

           451 ----+----*----+----*----+----*----+----*----+----* 500
           151 V  V  G  K  F  V  R  G  F  F  S  E  I  S  H  S  I  167
mbg19574  5478 GTGGTTGGCAAGTTTGTGCGGGGCTTCTTCAGCGAGATCTCTCACTCCAT 5527
               ||  | |||||||| ||||| || ||||||||||||||||| ||||||||
ha04602   5892 GTCATCGGCAAGTTCGTGCGCGGATTCTTCAGCGAGATCTCGCACTCCAT 5941
          1964 V  I  G  K  F  V  R  G  F  F  S  E  I  S  H  S  I  1980

           501 ----+----*----+----*----+----*----+----*----+----* 550
           168  M  F  E  E  L  P  C  V  D  R  I  L  K  L  C  Q  D 184
mbg19574  5528 CATGTTCGAGGAACTGCCGTGTGTGGACCGCATCCTCAAGCTGTGCCAGG 5577
                ||||||||||| |||||||| |||||||||||||||||||| |||||||
ha04602   5942 TATGTTCGAGGAGCTGCCGTGCGTGGACCGCATCCTCAAGCTCTGCCAGG 5991
          1981  M  F  E  E  L  P  C  V  D  R  I  L  K  L  C  Q  D 1997

           551 ----+----*----+----*----+----*----+----*----+----* 600
           185   I  F  L  V  R  E  T  R  E  L  E  L  E  E  E  L   200
mbg19574  5578 ACATCTTCTTGGTGCGCGAGACCCGGGAGCTGGAGCTGGAGGAGGAGCTA 5627
               |||||||| ||||||| ||||| |||||||||||||||||||||||| | 
ha04602   5992 ACATCTTCCTGGTGCGGGAGACTCGGGAGCTGGAGCTGGAGGAGGAGTTG 6041
          1998   I  F  L  V  R  E  T  R  E  L  E  L  E  E  E  L   2013

           601 ----+----*----+----*----+----*----+----*----+----* 650
           201 Y  A  K  L  I  F  L  Y  R  S  P  E  T  M  I  K  W  217
mbg19574  5628 TACGCCAAGCTCATCTTCCTGTACCGATCTCCAGAGACCATGATTAAGTG 5677
               |||||||||||||||||||| ||||| || || ||||||||||| |||||
ha04602   6042 TACGCCAAGCTCATCTTCCTCTACCGCTCACCGGAGACCATGATCAAGTG 6091
          2014 Y  A  K  L  I  F  L  Y  R  S  P  E  T  M  I  K  W  2030

           651 ----+----*----+---- 669
           218  T  R  E  R  E  *   223
mbg19574  5678 GACACGTGAGAGGGAGTAG 5696
               ||| ||||||| |||||||
ha04602   6092 GACTCGTGAGAAGGAGTAG 6110
          2031  T  R  E  K  E  *   2036


*--[ 3'UTR ]--*
             1 ----+----*----+----*----+----*----+----*----+----* 50
mbg19574  5697 GAGCCCAGGCCCTGGGCACCGGGAATGAAGGAGCTG..CCTCAGGGCAGC 5744
                 |  | |   ||||   ||| ||  |||||||| |  |  | |||||||
ha04602   6111 ..GAGCTGCTGCTGGCGCCCGAGAGGGAAGGAGCCGGCCTGCTGGGCAGC 6158

            51 ----+----*----+----*----+----*----+----*----+----* 100
mbg19574  5745 ATGACCACCAGGGCTGGCAGAGCT....CCTTGGGTTGGAGTGTAGCTGC 5790
                || |||| ||||  ||||   ||    ||  |||    | ||   |  |
ha04602   6159 GTGGCCACAAGGGGCGGCACTCCTCAGGCCGGGGGAGCCACTGCCCCGTC 6208

           101 ----+----*----+----*----+----*----+----*----+----* 150
mbg19574  5791 CCAAGGCACGAGCTGTGACAGGTCCT...GGCCTACGTAAGCCCTGATGA 5837
               | | | | | ||||||||    ||||   ||||| | | |||||||||| 
ha04602   6209 CAAGGCCGCCAGCTGTGATGCATCCTCCCGGCCTGCCTGAGCCCTGATGC 6258

           151 ----+----*----+----*----+----*----+----*----+----* 200
mbg19574  5838 TGCTGCTGTCAAAGGGACGCTGTGTCCCTACCCATTGCGTCCCTGCCAGT 5887
               ||||| |   | | |||| ||| ||||| ||    |||||  |       
ha04602   6259 TGCTG.TCAGAGAAGGACACTGCGTCCCCACGGCCTGCGTGGC....... 6300

           201 ----+----*----+----*----+----*----+----*----+----* 250
mbg19574  5888 TGGTCTGGGGGCCTGGCTCTCCCTTCCCACAAGTACTGTAGGG....TTT 5933
                              |||  |    |||||  ||||||||| |    |||
ha04602   6301 ...............GCTGCCGTCCCCCACGTGTACTGTAGAGTTTTTTT 6335

           251 ----+----*----+----*----+----*--- 283
mbg19574  5934 TCTAAGTAAAACAT.TTTTATTTATAC...... 5959
               | ||| ||||| || ||||||||||||      
ha04602   6336 TTTAATTAAAAAATGTTTTATTTATACAAATGG 6368