Comparison of KIAA cDNA sequences between mouse and human (KIAA0003)

<<Original sequence data>>

mouse  mKIAA0003 (msh01094)     length:   4073 bp, CDS:   265 -  1830
human   KIAA0003  (hk04939)     length:   4211 bp, CDS:   274 -  1848

In this page, the longest coding region predicted by GeneMark
was assigned as CDS for each of mouse and human KIAA cDNAs.
They were colored in green.  When the CDS positions were not identical
on the aligned sequences between mouse and human cDNAs, mouse cDNA
sequence was translated based on the human CDS information.
The amino acid sequence produced here may not be identical to the
protein sequence deduced (see Description).


<<Aligned sequence information>>

----------------------------------------------------------
            region      #match  #mismatch  %diff
----------------------------------------------------------
DNA

5'UTR:     16 -   284     232       31      11.8
  CDS:    285 -  1853    1435      132      8.4
3'UTR:   1854 -  4256    1511      702      31.7

amino acid

  CDS:    285 -  1853     503       20      3.8
----------------------------------------------------------


<<Alignment>>

             1 ----+----*----+----*----+----*----+----*----+----* 50
msh01094     1 ...............GAGGAAAGGGGCTAGAATATGTACTCGCAGCTGAC 35
                              |||||||||| ||||||||||||| ||||||||||
hk04939      1 GCAGCAGATAGGGTAGAGGAAAGGGTCTAGAATATGTACACGCAGCTGAC 50

            51 ----+----*----+----*----+----*----+----*----+----* 100
msh01094    36 GCGGGCAGGCTCCACGCTGAACGGTTACACAGAGAGGAAACAATAAATCT 85
                | ||||||||||| |||||||||| ||||||||||||||||||||||||
hk04939     51 TCAGGCAGGCTCCATGCTGAACGGTCACACAGAGAGGAAACAATAAATCT 100

           101 ----+----*----+----*----+----*----+----*----+----* 150
msh01094    86 AAGCTACTATTGCAATAAATATCTCAAGTTTTAACGAAGGAAACTATCAT 135
                |||||||| ||||||||||||||||||||||||||||| |||  |||||
hk04939    101 CAGCTACTA.TGCAATAAATATCTCAAGTTTTAACGAAGAAAAACATCAT 149

           151 ----+----*----+----*----+----*----+----*----+----* 200
msh01094   136 TACAGTTAAAATTTTTTAAAGTAACGCTTTTTTAGAATAAAGCTAACAAA 185
               | |||| |||    |  ||| |       |||||||| ||||||||||||
hk04939    150 TGCAGTGAAA....TAAAAAATTTTAAAATTTTAGAACAAAGCTAACAAA 195

           201 ----+----*----+----*----+----*----+----*----+----* 250
msh01094   186 TGGCTAGTTTTCTGTGGATCTTCTTCAAACGCTTTCTTTAACGGGGAAAG 235
               ||||||||||||| ||  ||||||||||||||||||||| | ||||||||
hk04939    196 TGGCTAGTTTTCTATGATTCTTCTTCAAACGCTTTCTTTGAGGGGGAAAG 245

           251 ----+----*----+----*----+----*----+----*----+----* 300
                                           N  K  E  L  -  L  K  V
msh01094   236 AGT....CAAACAAGCAGTTTTACCTGAAATAAAGAACTAG.TTTAAAGG 280
               |||    |||||||||||||||||||||||||||||||||| |||| |||
hk04939    246 AGTCAAACAAACAAGCAGTTTTACCTGAAATAAAGAACTAGTTTTAGAGG 295
                                           N  K  E  L  V  L  E  V

           301 ----+----*----+----*----+----*----+----*----+----* 350
                 R  R  E  E  Q  A  L  Q  E  A  R  K  A  S  A  G  
msh01094   281 TCAGAAGAGAAGAGCAAGCTTTGCAGGAGGCACGGAAGGCAAGCGCTGGC 330
               |||||||| | ||||||| |||||  |||||||||||||   | ||||||
hk04939    296 TCAGAAGAAAGGAGCAAGTTTTGCGAGAGGCACGGAAGGAGTGTGCTGGC 345
                 R  R  K  E  Q  V  L  R  E  A  R  K  E  C  A  G  

           351 ----+----*----+----*----+----*----+----*----+----* 400
               S  T  M  T  V  F  L  S  F  A  F  F  A  A  I  L  T 
msh01094   331 AGTACAATGACAGTTTTCCTTTCCTTTGCATTCTTCGCTGCCATTCTGAC 380
               ||||||||||||||||||||||||||||| ||| ||||||||||||||||
hk04939    346 AGTACAATGACAGTTTTCCTTTCCTTTGCTTTCCTCGCTGCCATTCTGAC 395
               S  T  M  T  V  F  L  S  F  A  F  L  A  A  I  L  T 

           401 ----+----*----+----*----+----*----+----*----+----* 450
                H  I  G  C  S  N  Q  R  R  N  P  E  N  G  G  R  R
msh01094   381 TCACATAGGGTGCAGCAACCAGCGCCGAAATCCAGAAAACGGAGGGAGAA 430
               |||||||||||||||||| |||||||||| |||||||||| | |||||||
hk04939    396 TCACATAGGGTGCAGCAATCAGCGCCGAAGTCCAGAAAACAGTGGGAGAA 445
                H  I  G  C  S  N  Q  R  R  S  P  E  N  S  G  R  R

           451 ----+----*----+----*----+----*----+----*----+----* 500
                 Y  N  R  I  Q  H  G  Q  C  A  Y  T  F  I  L  P  
msh01094   431 GATATAACCGGATTCAACATGGGCAATGTGCCTACACTTTCATTCTTCCA 480
               ||||||||||||||||||||||||||||||||||||||||||||||||||
hk04939    446 GATATAACCGGATTCAACATGGGCAATGTGCCTACACTTTCATTCTTCCA 495
                 Y  N  R  I  Q  H  G  Q  C  A  Y  T  F  I  L  P  

           501 ----+----*----+----*----+----*----+----*----+----* 550
               E  H  D  G  N  C  R  E  S  A  T  E  Q  Y  N  T  N 
msh01094   481 GAACACGACGGGAACTGCCGTGAGAGTGCGACAGAGCAGTACAACACCAA 530
               |||||||| || ||||| ||||||||| ||||||| ||||||||||| ||
hk04939    496 GAACACGATGGCAACTGTCGTGAGAGTACGACAGACCAGTACAACACAAA 545
               E  H  D  G  N  C  R  E  S  T  T  D  Q  Y  N  T  N 

           551 ----+----*----+----*----+----*----+----*----+----* 600
                A  L  Q  R  D  A  P  H  V  E  P  D  F  S  S  Q  K
msh01094   531 CGCTCTGCAAAGGGATGCTCCACACGTGGAGCCGGATTTCTCTTCCCAGA 580
               ||||||||| || ||||||||||||||||| |||||||||||||||||||
hk04939    546 CGCTCTGCAGAGAGATGCTCCACACGTGGAACCGGATTTCTCTTCCCAGA 595
                A  L  Q  R  D  A  P  H  V  E  P  D  F  S  S  Q  K

           601 ----+----*----+----*----+----*----+----*----+----* 650
                 L  Q  H  L  E  H  V  M  E  N  Y  T  Q  W  L  Q  
msh01094   581 AACTTCAGCATCTGGAGCATGTGATGGAAAATTATACTCAGTGGCTGCAA 630
               ||||||| |||||||| |||||||||||||||||||||||||||||||||
hk04939    596 AACTTCAACATCTGGAACATGTGATGGAAAATTATACTCAGTGGCTGCAA 645
                 L  Q  H  L  E  H  V  M  E  N  Y  T  Q  W  L  Q  

           651 ----+----*----+----*----+----*----+----*----+----* 700
               K  L  E  N  Y  I  V  E  N  M  K  S  E  M  A  Q  I 
msh01094   631 AAACTTGAGAATTACATTGTGGAAAATATGAAGTCGGAGATGGCCCAGAT 680
               |||||||||||||||||||||||||| |||||||||||||||||||||||
hk04939    646 AAACTTGAGAATTACATTGTGGAAAACATGAAGTCGGAGATGGCCCAGAT 695
               K  L  E  N  Y  I  V  E  N  M  K  S  E  M  A  Q  I 

           701 ----+----*----+----*----+----*----+----*----+----* 750
                Q  Q  N  A  V  Q  N  H  T  A  T  M  L  E  I  G  T
msh01094   681 ACAACAGAATGCTGTTCAAAACCACACGGCCACCATGCTTGAGATAGGAA 730
               ||| |||||||| ||||| ||||||||||| |||||||| ||||||||||
hk04939    696 ACAGCAGAATGCAGTTCAGAACCACACGGCTACCATGCTGGAGATAGGAA 745
                Q  Q  N  A  V  Q  N  H  T  A  T  M  L  E  I  G  T

           751 ----+----*----+----*----+----*----+----*----+----* 800
                 S  L  L  S  Q  T  A  E  Q  T  R  K  L  T  D  V  
msh01094   731 CCAGTCTCTTATCTCAGACTGCAGAGCAGACCCGAAAGCTGACAGATGTT 780
               |||| ||| | ||||||||||||||||||||| |||||||||||||||||
hk04939    746 CCAGCCTCCTCTCTCAGACTGCAGAGCAGACCAGAAAGCTGACAGATGTT 795
                 S  L  L  S  Q  T  A  E  Q  T  R  K  L  T  D  V  

           801 ----+----*----+----*----+----*----+----*----+----* 850
               E  T  Q  V  L  N  Q  T  S  R  L  E  I  Q  L  L  E 
msh01094   781 GAGACCCAGGTACTAAATCAAACATCCCGACTTGAAATACAACTGCTAGA 830
               ||||||||||||||||||||||| || |||||||| ||||| ||||| ||
hk04939    796 GAGACCCAGGTACTAAATCAAACTTCTCGACTTGAGATACAGCTGCTGGA 845
               E  T  Q  V  L  N  Q  T  S  R  L  E  I  Q  L  L  E 

           851 ----+----*----+----*----+----*----+----*----+----* 900
                N  S  L  S  T  Y  K  L  E  K  Q  L  L  Q  Q  T  N
msh01094   831 GAATTCATTATCAACATACAAGCTAGAGAAGCAACTTCTCCAACAGACAA 880
               |||||||||||| || ||||||||||||||||||||||| ||||||||||
hk04939    846 GAATTCATTATCCACCTACAAGCTAGAGAAGCAACTTCTTCAACAGACAA 895
                N  S  L  S  T  Y  K  L  E  K  Q  L  L  Q  Q  T  N

           901 ----+----*----+----*----+----*----+----*----+----* 950
                 E  I  L  K  I  H  E  K  N  S  L  L  E  H  K  I  
msh01094   881 ATGAAATTCTGAAGATTCACGAAAAAAACAGTTTACTAGAGCACAAAATC 930
               |||||||  ||||||| || ||||||||||||||| |||| || ||||||
hk04939    896 ATGAAATCTTGAAGATCCATGAAAAAAACAGTTTATTAGAACATAAAATC 945
                 E  I  L  K  I  H  E  K  N  S  L  L  E  H  K  I  

           951 ----+----*----+----*----+----*----+----*----+----* 1000
               L  E  M  E  G  K  H  K  E  E  L  D  T  L  K  E  E 
msh01094   931 TTAGAAATGGAGGGAAAACACAAAGAAGAATTGGACACCTTGAAGGAGGA 980
               ||||||||||| ||||||||||| ||||| ||||||||||| ||||| ||
hk04939    946 TTAGAAATGGAAGGAAAACACAAGGAAGAGTTGGACACCTTAAAGGAAGA 995
               L  E  M  E  G  K  H  K  E  E  L  D  T  L  K  E  E 

          1001 ----+----*----+----*----+----*----+----*----+----* 1050
                K  E  N  L  Q  G  L  V  S  R  Q  T  F  I  I  Q  E
msh01094   981 GAAAGAAAACCTTCAAGGCTTGGTTTCTCGTCAGACATTCATCATCCAGG 1030
               |||||| |||||||||||||||||| ||||||| ||||  || |||||||
hk04939    996 GAAAGAGAACCTTCAAGGCTTGGTTACTCGTCAAACATATATAATCCAGG 1045
                K  E  N  L  Q  G  L  V  T  R  Q  T  Y  I  I  Q  E

          1051 ----+----*----+----*----+----*----+----*----+----* 1100
                 L  E  K  Q  L  S  R  A  T  N  N  N  S  I  L  Q  
msh01094  1031 AGTTGGAGAAGCAACTTAGTAGAGCTACCAACAACAACAGCATCCTGCAG 1080
               || |||| |||||| | |  |||||||||| |||||||||  |||| |||
hk04939   1046 AGCTGGAAAAGCAATTAAACAGAGCTACCACCAACAACAGTGTCCTTCAG 1095
                 L  E  K  Q  L  N  R  A  T  T  N  N  S  V  L  Q  

          1101 ----+----*----+----*----+----*----+----*----+----* 1150
               K  Q  Q  L  E  L  M  D  T  V  H  N  L  V  S  L  C 
msh01094  1081 AAGCAACAACTGGAGCTCATGGACACAGTTCATAACCTTGTCAGCCTTTG 1130
               ||||| ||||||||||| ||||||||||| || ||||||||||  |||||
hk04939   1096 AAGCAGCAACTGGAGCTGATGGACACAGTCCACAACCTTGTCAATCTTTG 1145
               K  Q  Q  L  E  L  M  D  T  V  H  N  L  V  N  L  C 

          1151 ----+----*----+----*----+----*----+----*----+----* 1200
                T  K  E     V  L  L  K  G  G  K  R  E  E  E  K  P
msh01094  1131 CACTAAAGAA...GTTTTGCTAAAGGGAGGAAAAAGAGAAGAAGAGAAAC 1177
               ||||||||||   ||||| |||||||||||||||||||| ||||||||||
hk04939   1146 CACTAAAGAAGGTGTTTTACTAAAGGGAGGAAAAAGAGAGGAAGAGAAAC 1195
                T  K  E  G  V  L  L  K  G  G  K  R  E  E  E  K  P

          1201 ----+----*----+----*----+----*----+----*----+----* 1250
                 F  R  D  C  A  D  V  Y  Q  A  G  F  N  K  S  G  
msh01094  1178 CATTTCGAGACTGTGCAGATGTATATCAAGCTGGTTTTAATAAAAGTGGA 1227
               ||||| ||||||||||||||||||||||||||||||||||||||||||||
hk04939   1196 CATTTAGAGACTGTGCAGATGTATATCAAGCTGGTTTTAATAAAAGTGGA 1245
                 F  R  D  C  A  D  V  Y  Q  A  G  F  N  K  S  G  

          1251 ----+----*----+----*----+----*----+----*----+----* 1300
               I  Y  T  I  Y  F  N  N  M  P  E  P  K  K  V  F  C 
msh01094  1228 ATCTACACTATTTATTTTAATAATATGCCAGAACCCAAAAAGGTATTTTG 1277
               ||||||||||||||| |||||||||||||||||||||||||||| |||||
hk04939   1246 ATCTACACTATTTATATTAATAATATGCCAGAACCCAAAAAGGTGTTTTG 1295
               I  Y  T  I  Y  I  N  N  M  P  E  P  K  K  V  F  C 

          1301 ----+----*----+----*----+----*----+----*----+----* 1350
                N  M  D  V  N  G  G  G  W  T  V  I  Q  H  R  E  D
msh01094  1278 CAATATGGATGTGAATGGGGGAGGTTGGACAGTAATACAACACCGGGAAG 1327
               |||||||||||| ||||||||||||||||| ||||||||||| || ||||
hk04939   1296 CAATATGGATGTCAATGGGGGAGGTTGGACTGTAATACAACATCGTGAAG 1345
                N  M  D  V  N  G  G  G  W  T  V  I  Q  H  R  E  D

          1351 ----+----*----+----*----+----*----+----*----+----* 1400
                 G  S  L  D  F  Q  R  G  W  K  E  Y  K  M  G  F  
msh01094  1328 ATGGAAGCCTGGATTTCCAGAGGGGCTGGAAGGAGTATAAAATGGGTTTT 1377
               ||||||| || |||||||| || ||||||||||| |||||||||||||||
hk04939   1346 ATGGAAGTCTAGATTTCCAAAGAGGCTGGAAGGAATATAAAATGGGTTTT 1395
                 G  S  L  D  F  Q  R  G  W  K  E  Y  K  M  G  F  

          1401 ----+----*----+----*----+----*----+----*----+----* 1450
               G  N  P  S  G  E  Y  W  L  G  N  E  F  I  F  A  I 
msh01094  1378 GGGAATCCCTCTGGTGAATATTGGCTTGGGAACGAGTTCATTTTTGCAAT 1427
               || |||||||| |||||||||||||| ||||| ||||| |||||||| ||
hk04939   1396 GGAAATCCCTCCGGTGAATATTGGCTGGGGAATGAGTTTATTTTTGCCAT 1445
               G  N  P  S  G  E  Y  W  L  G  N  E  F  I  F  A  I 

          1451 ----+----*----+----*----+----*----+----*----+----* 1500
                T  S  Q  R  Q  Y  M  L  R  I  E  L  M  D  W  E  G
msh01094  1428 AACCAGTCAGAGGCAGTACATGCTGAGGATTGAGCTGATGGACTGGGAAG 1477
                ||||||||||||||||||||||| || |||||| | |||||||||||||
hk04939   1446 TACCAGTCAGAGGCAGTACATGCTAAGAATTGAGTTAATGGACTGGGAAG 1495
                T  S  Q  R  Q  Y  M  L  R  I  E  L  M  D  W  E  G

          1501 ----+----*----+----*----+----*----+----*----+----* 1550
                 N  R  A  Y  S  Q  Y  D  R  F  H  I  G  N  E  K  
msh01094  1478 GGAACCGAGCCTACTCACAGTACGACAGATTCCACATAGGAAATGAAAAG 1527
               ||||||||||||| |||||||| |||||||||||||||||||||||||||
hk04939   1496 GGAACCGAGCCTATTCACAGTATGACAGATTCCACATAGGAAATGAAAAG 1545
                 N  R  A  Y  S  Q  Y  D  R  F  H  I  G  N  E  K  

          1551 ----+----*----+----*----+----*----+----*----+----* 1600
               Q  N  Y  R  L  Y  L  K  G  H  T  G  T  A  G  K  Q 
msh01094  1528 CAGAACTATAGGTTATATTTAAAAGGTCACACAGGGACAGCAGGCAAACA 1577
               || ||||||||||| ||||||||||||||||| ||||||||||| |||||
hk04939   1546 CAAAACTATAGGTTGTATTTAAAAGGTCACACTGGGACAGCAGGAAAACA 1595
               Q  N  Y  R  L  Y  L  K  G  H  T  G  T  A  G  K  Q 

          1601 ----+----*----+----*----+----*----+----*----+----* 1650
                S  S  L  I  L  H  G  A  D  F  S  T  K  D  A  D  N
msh01094  1578 GAGCAGCTTGATCTTACACGGTGCTGATTTCAGCACGAAGGATGCTGATA 1627
               ||||||| |||||||||||||||||||||||||||| || ||||||||||
hk04939   1596 GAGCAGCCTGATCTTACACGGTGCTGATTTCAGCACTAAAGATGCTGATA 1645
                S  S  L  I  L  H  G  A  D  F  S  T  K  D  A  D  N

          1651 ----+----*----+----*----+----*----+----*----+----* 1700
                 D  N  C  M  C  K  C  A  L  M  L  T  G  G  W  W  
msh01094  1628 ACGACAACTGTATGTGCAAATGCGCTCTCATGCTAACAGGAGGTTGGTGG 1677
               | |||||||||||||||||||| || |||||| |||||||||| ||||||
hk04939   1646 ATGACAACTGTATGTGCAAATGTGCCCTCATGTTAACAGGAGGATGGTGG 1695
                 D  N  C  M  C  K  C  A  L  M  L  T  G  G  W  W  

          1701 ----+----*----+----*----+----*----+----*----+----* 1750
               F  D  A  C  G  P  S  N  L  N  G  M  F  Y  T  A  G 
msh01094  1678 TTCGATGCCTGTGGCCCTTCCAATCTAAATGGAATGTTCTACACTGCGGG 1727
               || ||||| |||||||| ||||||||||||||||||||||| ||||||||
hk04939   1696 TTTGATGCTTGTGGCCCCTCCAATCTAAATGGAATGTTCTATACTGCGGG 1745
               F  D  A  C  G  P  S  N  L  N  G  M  F  Y  T  A  G 

          1751 ----+----*----+----*----+----*----+----*----+----* 1800
                Q  N  H  G  K  L  N  G  I  K  W  H  Y  F  K  G  P
msh01094  1728 ACAAAATCATGGAAAACTGAATGGGATAAAGTGGCACTACTTCAAAGGGC 1777
               |||||| |||||||||||||||||||||||||||||||||||||||||||
hk04939   1746 ACAAAACCATGGAAAACTGAATGGGATAAAGTGGCACTACTTCAAAGGGC 1795
                Q  N  H  G  K  L  N  G  I  K  W  H  Y  F  K  G  P

          1801 ----+----*----+----*----+----*----+----*----+----* 1850
                 S  Y  S  L  R  S  T  T  M  M  I  R  P  L  D  F  
msh01094  1778 CCAGTTACTCCTTACGTTCCACCACCATGATGATCCGGCCCTTGGACTTT 1827
               |||||||||||||||||||||| || |||||||| || || || || |||
hk04939   1796 CCAGTTACTCCTTACGTTCCACAACTATGATGATTCGACCTTTAGATTTT 1845
                 S  Y  S  L  R  S  T  T  M  M  I  R  P  L  D  F  

          1851 ----+----*----+----*----+----*----+----*----+----* 1900
               *                                                 
msh01094  1828 TGAAGGTGCTCTGCCAGTA.....TTAGAAAGCTGCAAAGAAAGCTGGGC 1872
               |||| | ||  || ||| |     |  ||||||  |||||||| | ||  
hk04939   1846 TGAAAGCGCAATGTCAGAAGCGATTATGAAAGCAACAAAGAAATCCGGAG 1895
               *                                                 

          1901 ----+----*----+----*----+----*----+----*----+----* 1950
msh01094  1873 ATGTTCCCAGATGAGAAGCTAGTCAGAGGCTTCAGAAACAACCAACATTG 1922
               | | | |||| |||||| ||  |   |  |||||||| ||| ||| ||||
hk04939   1896 AAGCTGCCAGGTGAGAAACTGTTTGAAAACTTCAGAAGCAAACAATATTG 1945

          1951 ----+----*----+----*----+----*----+----*----+----* 2000
msh01094  1923 TCTCCATTCCAGCAGCAAGTG...GTTATGTCATGTCACCTGGGTT.... 1965
               ||||| ||||||||  |||||   ||||||| | ||||||  ||||    
hk04939   1946 TCTCCCTTCCAGCAATAAGTGGTAGTTATGTGAAGTCACCAAGGTTCTTG 1995

          2001 ----+----*----+----*----+----*----+----*----+----* 2050
msh01094  1966 ..........TGGAGCCTTCTGAGGTCAACAGAATCGCCACTTGGG.... 2001
                         ||||||| | |||| |||  ||| || | |||||||    
hk04939   1996 ACCGTGAATCTGGAGCCGTTTGAGTTCACAAGAGTCTCTACTTGGGGTGA 2045

          2051 ----+----*----+----*----+----*----+----*----+----* 2100
msh01094  2002 ....................TCCAGAGAATGCCACTCACAATCATG..TT 2029
                                   |  ||| ||  ||||| ||  ||  |  ||
hk04939   2046 CAGTGCTCACGTGGCTCGACTATAGAAAACTCCACTGACTGTCGGGCTTT 2095

          2101 ----+----*----+----*----+----*----+----*----+----* 2150
msh01094  2030 TAAAAGGGAAGAAACTTCTCAGCTTGCTGCACTTCAAAGTGCTACTGGAT 2079
                ||||||||||||||| || |||||||||  ||||||| | |||||||| 
hk04939   2096 AAAAAGGGAAGAAACTGCTGAGCTTGCTGTGCTTCAAACTACTACTGGAC 2145

          2151 ----+----*----+----*----+----*----+----*----+----* 2200
msh01094  2080 CACATTCTGAACTTATAACATCCTGATGCTGAATGCAACTTGTTTCATGT 2129
               |  ||| || |  |||   | || |||| | |||     |  ||||||||
hk04939   2146 CTTATTTTGGAACTATGGTAGCCAGATGATAAATATGGTTAATTTCATGT 2195

          2201 ----+----*----+----*----+----*----+----*----+----* 2250
msh01094  2130 AAAA.............GCAAAAGAAGAAGAAACAGCA..AATGGGAACA 2164
               ||||               ||||  |||| | |||  |  ||| | ||||
hk04939   2196 AAAACAGAAAAAAAGAGTGAAAAAGAGAATATACATGAAGAATAGAAACA 2245

          2251 ----+----*----+----*----+----*----+----*----+----* 2300
msh01094  2165 GGCTTTCCAGAATCTGTTG...AAGATGGATTGTGGAGGTGACCTGGTAT 2211
                || | ||| ||||  |||   |||||| ||| |    ||||   ||| |
hk04939   2246 AGCCTGCCATAATCCTTTGGAAAAGATGTATTATACCAGTGAAAAGGTGT 2295

          2301 ----+----*----+----*----+----*----+----*----+----* 2350
msh01094  2212 CA.CTGTAGGAAATCCTGCTAACAA..TACATCACTGCCCAA....AAGA 2254
                |  | || | || ||| |||||||  || |    ||| |||     | |
hk04939   2296 TATATCTATGCAAACCTACTAACAAATTATACTGTTGCACAATTTTGATA 2345

          2351 ----+----*----+----*----+----*----+----*----+----* 2400
msh01094  2255 GACATAAAGAAAAGTTTTGTCTACTGAGTTGGCTAAAAGTTAGTGGAGTT 2304
                | ||  |||| ||  |||||  ||||||||| |||| |||| |||| ||
hk04939   2346 AAAATTTAGAACAGCATTGTCCTCTGAGTTGGTTAAATGTTAATGGATTT 2395

          2401 ----+----*----+----*----+----*----+----*----+----* 2450
msh01094  2305 CACCTGCCCATTTCCAGTATCATATTTACTAGCTGATTTCAGGTTTCCTG 2354
               ||   ||| | ||||||||||||| ||||||| ||||||| | || ||  
hk04939   2396 CAGAAGCCTAATTCCAGTATCATACTTACTAGTTGATTTCTGCTTACCCA 2445

          2451 ----+----*----+----*----+----*----+----*----+----* 2500
msh01094  2355 TGTTCAAATGTAAACTCTGTTCTTGTAAGCCATGATACAATATAGTACAT 2404
               | |||||||| ||| ||  || ||||||||||| ||  | | ||||||||
hk04939   2446 TCTTCAAATGAAAATTCCATTTTTGTAAGCCATAATGAACTGTAGTACAT 2495

          2501 ----+----*----+----*----+----*----+----*----+----* 2550
msh01094  2405 GGAGGATAAGAGTTGGGGGTAGAAGGTGCCTAAAGACTCTTGAGTTTCTG 2454
               |||  ||||| ||  | |||||||     ||  |      ||| |||   
hk04939   2496 GGACAATAAGTGT..GTGGTAGAAACAAACTCCATTACTCTGATTTT... 2540

          2551 ----+----*----+----*----+----*----+----*----+----* 2600
msh01094  2455 GAGATTCAGTTTTCAAAAGATATAA..AATATAATCAAGAATGGATAAAA 2502
                 ||| ||||||||| || |   ||  || ||||||||| | ||||  | 
hk04939   2541 .TGATACAGTTTTCAGAAAAAGAAATGAACATAATCAAGTAAGGATGTAT 2589

          2601 ----+----*----+----*----+----*----+----*----+----* 2650
msh01094  2503 CAGGTGAAA....ATCACACTCATGCTACAGTGTTCCTTTAC.ATGAAAT 2547
                 |||||||    | ||| | ||| |||  || ||| |||||  | ||| 
hk04939   2590 GTGGTGAAAACTTACCACCCCCATACTATGGTTTTCATTTACTCTAAAAA 2639

          2651 ----+----*----+----*----+----*----+----*----+----* 2700
msh01094  2548 TTGATTAACTGATCCACAAGAATGTTTAGAGCCTGAGTATA.TATAAAGA 2596
                ||||| | ||||  | ||  || |||| |||||||||| | |  |||||
hk04939   2640 CTGATTGAATGATATATAAATATATTTATAGCCTGAGTAAAGTTAAAAGA 2689

          2701 ----+----*----+----*----+----*----+----*----+----* 2750
msh01094  2597 CTGGAAGTGTTATCACCCAGTTCTCAAAACAATAAGCAGGCAGTTAACAT 2646
                || ||    ||||| | |||||| |||| ||||  || ||| |||| ||
hk04939   2690 ATGTAAAATATATCATCAAGTTCTTAAAATAATATACATGCATTTAATAT 2739

          2751 ----+----*----+----*----+----*----+----*----+----* 2800
msh01094  2647 TCTCATTGACAGTATGTAGGAGAGCAATATGTGG................ 2680
               |  | |||| | |||  |||| |||||||| | |                
hk04939   2740 TTCCTTTGATATTATACAGGAAAGCAATATTTTGGAGTATGTTAAGTTGA 2789

          2801 ----+----*----+----*----+----*----+----*----+----* 2850
msh01094  2681 ......AGTACTTGAGTTGGAACAGCCCATTGTACAG....ATCTTGCAT 2720
                     || |  |    |||| |||  |||| |||||      |||||||
hk04939   2790 AGTAAAAGCAAGTACTCTGGAGCAGTTCATTTTACAGTATCTACTTGCAT 2839

          2851 ----+----*----+----*----+----*----+----*----+----* 2900
msh01094  2721 GTATTTGCATATGTATGGCATTATTATTTTTAAAGTGTTCGTAGGCCTTC 2770
               || | | ||||  | |  | | |||||||| ||| | ||  |||  || |
hk04939   2840 GTGTATACATACATGTAACTTCATTATTTTAAAAATATTTTTAGAACTCC 2889

          2901 ----+----*----+----*----+----*----+----*----+----* 2950
msh01094  2771 AATTCTTCATACAGATTTTTCATGCTAATTTAATTTTTGTTAATTAACTG 2820
               ||| | |||  | | | | || ||||||||||| ||||| ||||||||||
hk04939   2890 AATAC.TCACCCTGTTATGTCTTGCTAATTTAAATTTTGCTAATTAACTG 2938

          2951 ----+----*----+----*----+----*----+----*----+----* 3000
msh01094  2821 CAATGTACTTACTAAATATATCCTACTCCAGT.TTTTTATGAGTTATACT 2869
                ||  | ||||| | ||  |  ||  |||||| | | ||  ||  | |||
hk04939   2939 AAACATGCTTACCAGATTCACACTGTTCCAGTGTCTATAAAAGAAACACT 2988

          3001 ----+----*----+----*----+----*----+----*----+----* 3050
msh01094  2870 TTAAAGTCTACAAATAATAGAAGAATTTTAAATATCATTGTACATAATAT 2919
               || ||||||| ||| |||| || |||| ||||||||||||||||||  ||
hk04939   2989 TTGAAGTCTATAAAAAATAAAATAATTATAAATATCATTGTACATAGCAT 3038

          3051 ----+----*----+----*----+----*----+----*----+----* 3100
msh01094  2920 .CTTATACCTGTCCATGCTAAACTCAATAATTGTTTAGTCTGGAATATAT 2968
                 ||||| |||   |    ||||  ||||  |  ||| ||||||||||  
hk04939   3039 GTTTATATCTGCAAA....AAACCTAATAGCTAATTAATCTGGAATATGC 3084

          3101 ----+----*----+----*----+----*----+----*----+----* 3150
msh01094  2969 GATGCTGTCCACAACTGATGACTATAAATATGATTGTTTAAAGACAGTTA 3018
                |   |||||  || ||||| | |  || |  | || | ||||| |  ||
hk04939   3085 AACATTGTCCTTAATTGATG.CAAATAACACAAATGCTCAAAGAAATCTA 3133

          3151 ----+----*----+----*----+----*----+----*----+----* 3200
msh01094  3019 CCATA.CTATTGATTAAATATATTACTCTGCATAGTTTTTCTCCTCCAGG 3067
               | ||| |  || || ||||| || | ||| |||| | |||||||| ||| 
hk04939   3134 CTATATCCCTTAATGAAATACATCATTCTTCATA.TATTTCTCCTTCAG. 3181

          3201 ----+----*----+----*----+----*----+----*----+----* 3250
msh01094  3068 ATCTGTTTCTTCAAGCAATTTCTACCTTGTAAAA..TAATGGTAGTAGAG 3115
                ||  || | | | ||||||| ||  || |||||  || |   ||  |||
hk04939   3182 .TCCATTCCCTTAGGCAATTTTTAATTTTTAAAAATTATTATCAGGGGAG 3230

          3251 ----+----*----+----*----+----*----+----*----+----* 3300
msh01094  3116 .AAAATTGACATAACTCCT...TGTACAAAAGAATTATAGAAAA...... 3155
                ||||||| || ||||  |   ||||    | |  |||| ||||      
hk04939   3231 AAAAATTGGCAAAACTATTATATGTAAGGGAAATATATACAAAAAGAAAA 3280

          3301 ----+----*----+----*----+----*----+----*----+----* 3350
msh01094  3156 ..AATTACAGTCATTTGACTAGGAAGTTTCTGATTGTTAGCTGCTATAAG 3203
                 ||| | |||||  |||||| |||  |||||| || ||| ||| |||| 
hk04939   3281 TTAATCATAGTCACCTGACTAAGAA.ATTCTGACTGCTAGTTGCCATAAA 3329

          3351 ----+----*----+----*----+----*----+----*----+----* 3400
msh01094  3204 TGCCTTAGTTAAGATGCCCCTGTGTTATAATATGTAGTAAATGAAGTTTT 3253
               |  || | |  | ||   ||| ||  ||||| | |  ||| |||| ||||
hk04939   3330 TAACTCAATGGAAATATTCCTATGGGATAATGTATTTTAAGTGAATTTTT 3379

          3401 ----+----*----+----*----+----*----+----*----+----* 3450
msh01094  3254 GGACACAGGATTCTGTGATAACCTGATGTGACTGCAGTATTCTATC...A 3300
               |               |    | ||| || |||||| |||| ||||   |
hk04939   3380 G...............GGGTGCTTGAAGTTACTGCATTATTTTATCAAGA 3414

          3451 ----+----*----+----*----+----*----+----*----+----* 3500
msh01094  3301 AGTTCTCTTTGTTGTTAAATGTTCAAGGTTATAGTAGAAAAAAAACATTC 3350
               |||  ||| ||    ||| ||| |||||||||   ||  ||| |   || 
hk04939   3415 AGTCTTCTCTGCCTGTAAGTGTCCAAGGTTATGACAG.TAAACAGTTTTT 3463

          3501 ----+----*----+----*----+----*----+----*----+----* 3550
msh01094  3351 AATCAAACACAATTTGCCATGAAAGGAGAGAACTAAATGTAGGCACCAGT 3400
               | | |||||  | |  | |||  | |||| || | ||   | ||  | | 
hk04939   3464 ATTAAAACATGAGTCACTATGGGATGAGAAAATTGAAATAAAGCTACTGG 3513

          3551 ----+----*----+----*----+----*----+----*----+----* 3600
msh01094  3401 TCTGTTTTCTCAGAGAAGGAGAAGACTTTCTGGGACGTACATGTACCAAA 3450
                |  |  ||||| | ||| |||    | |  |  | |||    ||||  |
hk04939   3514 GC.CTCCTCTCATAAAAG.AGACAGTTGTTGGCAAGGTAGCAATACC..A 3559

          3601 ----+----*----+----*----+----*----+----*----+----* 3650
msh01094  3451 ATATAAATCTTGATAACCGCAGCCACAAAGCCTTAGTGACTTTCCTCTAC 3500
                | | || |||| | ||   | |||| | |||||| ||  ||||||| | 
hk04939   3560 GTTTCAAACTTGGTGACTTGATCCACTATGCCTTAATG.GTTTCCTCCAT 3608

          3651 ----+----*----+----*----+----*----+----*----+----* 3700
msh01094  3501 CTGGTAAGACAGAGCTCTTCATGCTTTTAAGAAAA..GATTCTGAATGCT 3548
                ||  || | | |||| ||||   | |||||||||    || | || | |
hk04939   3609 TTGAGAAAATAAAGCTATTCACATTGTTAAGAAAAATACTTTTTAAAGTT 3658

          3701 ----+----*----+----*----+----*----+----*----+----* 3750
msh01094  3549 TCCCACCACATCTTTCTTATATTTATATGTGTTCATAAAGTACTATTTTG 3598
               | ||| ||  ||||| |||||||||| ||| |  ||      |  |||||
hk04939   3659 TACCATCAAGTCTTTTTTATATTTATGTGTCTGTATTCTACCCCTTTTTG 3708

          3751 ----+----*----+----*----+----*----+----*----+----* 3800
msh01094  3599 CCTTACAAGAGGTATGTGCCGACATTACAGGATTTTTCTACT.ATAGTGA 3647
               ||||||||| | ||| ||| |  |||| |  ||||||||| |  | ||| 
hk04939   3709 CCTTACAAGTGATATTTGCAGGTATTATACCATTTTTCTATTCTTGGTGG 3758

          3801 ----+----*----+----*----+----*----+----*----+----* 3850
msh01094  3648 CTCCTTCACAGCTTTCTTAAGCCTAGCCCTCTAAAAGCTTC......CTT 3691
               || ||||| |||     |||||||  || ||||||| ||||       ||
hk04939   3759 CTTCTTCATAGC..AGGTAAGCCTCTCCTTCTAAAAACTTCTCAACTGTT 3806

          3851 ----+----*----+----*----+----*----+----*----+----* 3900
msh01094  3692 CTCATTTAGATGAAAGAAAATGAGTATTTTTGTGATTCTGGTGATTGTGG 3741
                |||||||   ||||||||||||||||||                     
hk04939   3807 TTCATTTAAGGGAAAGAAAATGAGTATTT..................... 3835

          3901 ----+----*----+----*----+----*----+----*----+----* 3950
msh01094  3742 TGGTTGTTGTTGTTGTTGTTGTTGTTCCCACAGA...TGTTCGAAAACTC 3788
                           |     ||  |||||| |||||   | |   ||| |  
hk04939   3836 ............TGTCCTTTTGTGTTCCTACAGACACTTTCTTAAACCAG 3873

          3951 ----+----*----+----*----+----*----+----*----+----* 4000
msh01094  3789 ATCTTGGGTAA.....ATTGTTTTTCAATCCACATTACAAAAATAAAGCG 3833
                | |||| |||     | | |||   ||  || |||||||||| |||   
hk04939   3874 TTTTTGGATAAAGAATACTATTTCCAAACTCATATTACAAAAACAAAATA 3923

          4001 ----+----*----+----*----+----*----+----*----+----* 4050
msh01094  3834 AAACAAGGAGAAAAAAAAGCATGGAATTTACTGATTTGTTATGTGGGTTT 3883
               ||| ||  | |||| ||||||||  |||||||| |||||| | |||||||
hk04939   3924 AAATAATAAAAAAAGAAAGCATGATATTTACTGTTTTGTTGTCTGGGTTT 3973

          4051 ----+----*----+----*----+----*----+----*----+----* 4100
msh01094  3884 GAAAAATAAGATATTGTTTTCAGTTATTTATAATAAAGCAGTATAA.... 3929
               || |||| | ||||||||| || |||||||||||||| ||||||||    
hk04939   3974 GAGAAATGAAATATTGTTTCCAATTATTTATAATAAATCAGTATAAAATG 4023

          4101 ----+----*----+----*----+----*----+----*----+----* 4150
msh01094  3930 ............TGTGTACATTGTATAATGCCAACATGTGTGTAGCAATT 3967
                           | |||  ||| | |||| |  |||||| | | ||||||
hk04939   4024 TTTTATGATTGTTATGTGTATTATGTAATACGTACATGTTTATGGCAATT 4073

          4151 ----+----*----+----*----+----*----+----*----+----* 4200
msh01094  3968 TGATACGCATAGCTTTTTGCATTTAATTAATGCAGGGCAGAAAAATTAGA 4017
               | | | |      | | | | |||||||  | |||   || | |||||| 
hk04939   4074 TAACATG......TGTATTCTTTTAATTGTTTCAGAATAGGATAATTAGG 4117

          4201 ----+----*----+----*----+----*----+----*----+----* 4250
msh01094  4018 TAACTCGAACTTTGTCTTGAAGTTTCTATTTCAATAAAAGCTGTGTCATT 4067
               | | ||||| |||||||| ||  |||   |    |  | ||   ||  ||
hk04939   4118 T.ATTCGAATTTTGTCTTTAAAATTCATGTGGTTTCTATGCAAAGTTCTT 4166

          4251 ----+----*----+----*----+----*----+----*----+ 4295
msh01094  4068 TCTATG....................................... 4073
                 |||                                        
hk04939   4167 CATATCATCACAACATTATTTGATTTAAATAAAATTGAAAGTAAT 4211