Comparison of KIAA cDNA sequences between mouse and human (FLJ00067)

<<Original sequence data>>

mouse  mFLJ00067 (msh44296)     length:   3275 bp, CDS:     2 -  2677
human     (as00067)     length:   4394 bp, CDS:  1210 -  2934

In this page, the longest coding region predicted by GeneMark
was assigned as CDS for each of mouse and human KIAA cDNAs.
They were colored in green.  When the CDS positions were not identical
on the aligned sequences between mouse and human cDNAs, mouse cDNA
sequence was translated based on the human CDS information.
The amino acid sequence produced here may not be identical to the
protein sequence deduced (see Description).


<<Aligned sequence information>>

----------------------------------------------------------
            region      #match  #mismatch  %diff
----------------------------------------------------------
DNA

5'UTR:    672 -  1230     328       44      11.8
  CDS:   1231 -  2935    1362      292      17.7
3'UTR:   2936 -  4429     569      671      54.1

amino acid

  CDS:   1231 -  2935     449      119      21.0
----------------------------------------------------------


<<Alignment>>

             1 ----+----*----+----*----+----*----+----*----+----* 50
msh44296     1 .................................................. 1
                                                                 
as00067      1 CCCACGCGTCCGGCCCCAGGCTCTTTGCATAATCCTGTGGCTTCGCTGTC 50

            51 ----+----*----+----*----+----*----+----*----+----* 100
msh44296     1 .................................................. 1
                                                                 
as00067     51 TTCACCCAGCACCAGCGGACAGGGAAGGGCAGAGAAGGCCACCATGGCGA 100

           101 ----+----*----+----*----+----*----+----*----+----* 150
msh44296     1 .................................................. 1
                                                                 
as00067    101 CACTCCTCTCCCATCCGCAGCAGCGCCCTCCCTTCTTGCGCCAGGCCATC 150

           151 ----+----*----+----*----+----*----+----*----+----* 200
msh44296     1 .................................................. 1
                                                                 
as00067    151 AAGATAAGGCGCCGCAGAGTCAGAGATCTACAGGATCCCCCGCCCCAAAT 200

           201 ----+----*----+----*----+----*----+----*----+----* 250
msh44296     1 .................................................. 1
                                                                 
as00067    201 GGCCCCGGAGATCCAGCCTCCATCCCACCACTTCTCCCCCGAGCAGCGGG 250

           251 ----+----*----+----*----+----*----+----*----+----* 300
msh44296     1 .................................................. 1
                                                                 
as00067    251 CCCTGCTCTACGAGGACGCACTCTACACTGTCTTGCACCGCCTGGGTCAT 300

           301 ----+----*----+----*----+----*----+----*----+----* 350
msh44296     1 .................................................. 1
                                                                 
as00067    301 CCTGAGCCCAACCATGTGACGGAGGCCTCTGAGCTGCTGCGATACCTGCA 350

           351 ----+----*----+----*----+----*----+----*----+----* 400
msh44296     1 .................................................. 1
                                                                 
as00067    351 GGAGGCCTTCCACGTGGAGCCCGAGGAGCACCAGCAGACACTGCAGCGGG 400

           401 ----+----*----+----*----+----*----+----*----+----* 450
msh44296     1 .................................................. 1
                                                                 
as00067    401 TCAGGGAGCTTGAGAAGCCAATATTTTGTCTGAAGGCAACAGTGAAACAG 450

           451 ----+----*----+----*----+----*----+----*----+----* 500
msh44296     1 .................................................. 1
                                                                 
as00067    451 GCCAAGGGCATTCTGGGCAAAGATGTCAGTGGGTTCAGCGACCCCTACTG 500

           501 ----+----*----+----*----+----*----+----*----+----* 550
msh44296     1 .................................................. 1
                                                                 
as00067    501 CCTGCTGGGCATTGAGCAGGGGGTAGGTGTGCCAGGGGGCAGCCCCGGGT 550

           551 ----+----*----+----*----+----*----+----*----+----* 600
msh44296     1 .................................................. 1
                                                                 
as00067    551 CCCGGCATCGGCAGAAGGCTGTGGTGAGGCACACCATCCCCGAGGAGGAG 600

           601 ----+----*----+----*----+----*----+----*----+----* 650
msh44296     1 .................................................. 1
                                                                 
as00067    601 ACCCACCGCACGCAGGTCATCACCCAGACACTCAACCCCGTCTGGGACGA 650

           651 ----+----*----+----*----+----*----+----*----+----* 700
msh44296     1 .....................GGACATAGCCAATGCAAGCTTTCACCTGG 29
                                    ||||||  ||||||| |||||||| ||||
as00067    651 GACCTTCATCCTGGAGTTTGAGGACATCACCAATGCGAGCTTTCATCTGG 700

           701 ----+----*----+----*----+----*----+----*----+----* 750
msh44296    30 ACATGTGGGACCTGGACACTGTGGAGTCTGTCAGGCAGAAGCTCGGGGAG 79
               |||||||||||||||||||||||||||||||| | |||||||| ||||||
as00067    701 ACATGTGGGACCTGGACACTGTGGAGTCTGTCCGACAGAAGCTTGGGGAG 750

           751 ----+----*----+----*----+----*----+----*----+----* 800
msh44296    80 CTCACGGACCTGCACGGGCTCCGGAGGATCTTTAAAGAAGCTCGGAAGGA 129
               |||||||| ||||| ||||| || |||||||||||||| || ||||||||
as00067    751 CTCACGGATCTGCATGGGCTTCGCAGGATCTTTAAAGAGGCCCGGAAGGA 800

           801 ----+----*----+----*----+----*----+----*----+----* 850
msh44296   130 TAAAGGCCAGGACGACTTTCTGGGGAATGTGGTTCTGAGGTTGCAGGACC 179
                |||||||||||||||||||||||||| |||||||||||| |||||||||
as00067    801 CAAAGGCCAGGACGACTTTCTGGGGAACGTGGTTCTGAGGCTGCAGGACC 850

           851 ----+----*----+----*----+----*----+----*----+----* 900
msh44296   180 TGCGCTGCCGAGAGGACCAGTGGTTCCCGCTAGAGCCCTGCACAGAGACC 229
               |||||||||||||||||||||||| ||| || || ||| |||| ||||||
as00067    851 TGCGCTGCCGAGAGGACCAGTGGTACCCCCTGGAACCCCGCACTGAGACC 900

           901 ----+----*----+----*----+----*----+----*----+----* 950
msh44296   230 TACCCAGACCGCGGCCAGTGCCACCTTCAGTTCCAGTTCATTCACAAGAG 279
               ||||||||||| |||||||||||||| ||||||||  |||| || ||| |
as00067    901 TACCCAGACCGAGGCCAGTGCCACCTCCAGTTCCAACTCATCCATAAGCG 950

           951 ----+----*----+----*----+----*----+----*----+----* 1000
msh44296   280 .................................................. 280
                                                                 
as00067    951 GGTAGGTCGGGTACTGGGCCAGTGGCCATGCCCAGCTCTTGCGGCGGTCT 1000

          1001 ----+----*----+----*----+----*----+----*----+----* 1050
msh44296   280 .................................................. 280
                                                                 
as00067   1001 GCTGGGTTGCGGGGCTGGCTGCACCCAGTGTGAGGCCCTGTCTGCTCACA 1050

          1051 ----+----*----+----*----+----*----+----*----+----* 1100
msh44296   280 ..............GAGAGCCACGGCGGCCAGCCGCTCTCAGCCCAGCTA 315
                             |||||||||  ||||||||||||| ||||| |||||
as00067   1051 GAGGCCTCATTGCAGAGAGCCACTTCGGCCAGCCGCTCGCAGCCGAGCTA 1100

          1101 ----+----*----+----*----+----*----+----*----+----* 1150
msh44296   316 CACTGTACACTTTCACCTACTGCAGCAGCTGGTGTCCCATGAAGTCACAC 365
               ||| || ||| | ||||| ||||||||||| |||||||| || ||||| |
as00067   1101 CACCGTGCACCTCCACCTCCTGCAGCAGCTTGTGTCCCACGAGGTCACCC 1150

          1151 ----+----*----+----*----+----*----+----*----+----* 1200
msh44296   366 AGCAC............................................. 370
               |||||                                             
as00067   1151 AGCACGAGGTATTGCCCTCCTGGGGCTGGGCGTAGCCGGGGCTGCCCTTC 1200

          1201 ----+----*----+----*----+----*----+----*----+----* 1250
                                             Q  A  G  S  T  S  W 
msh44296   371 ..............................CAGGCCGGCAGTACCTCCTG 390
                                             ||||| || || ||||||||
as00067   1201 AGGTCCTGACACTCCTCCACCTGCTCCCCTCAGGCGGGAAGCACCTCCTG 1250
                        H  S  S  T  C  S  P  Q  A  G  S  T  S  W 

          1251 ----+----*----+----*----+----*----+----*----+----* 1300
                D  A  S  L  S  P  Q  A  V  T  I  L  F  L  H  A  T
msh44296   391 GGACGCATCACTGAGTCCCCAGGCTGTCACCATCCTCTTTCTCCACGCCA 440
               |||||  || |||||||||||||||| |||| |||||||||| |||||||
as00067   1251 GGACGGGTCGCTGAGTCCCCAGGCTGCCACCGTCCTCTTTCTGCACGCCA 1300
                D  G  S  L  S  P  Q  A  A  T  V  L  F  L  H  A  T

          1301 ----+----*----+----*----+----*----+----*----+----* 1350
                 Q  K  D  L  S  D  F  H  Q  S  M  A  Q  W  L  A  
msh44296   441 CTCAGAAGGACCTGTCGGACTTCCACCAGTCCATGGCGCAGTGGTTGGCC 490
               | ||||||||||| || ||||||||||||||||||||||||||| |||||
as00067   1301 CACAGAAGGACCTATCCGACTTCCACCAGTCCATGGCGCAGTGGCTGGCC 1350
                 Q  K  D  L  S  D  F  H  Q  S  M  A  Q  W  L  A  

          1351 ----+----*----+----*----+----*----+----*----+----* 1400
               Y  S  R  L  Y  Q  S  L  E  F  P  S  S  C  L  L  H 
msh44296   491 TACAGCCGCCTCTACCAGAGCCTGGAGTTCCCCAGCAGCTGCCTCCTGCA 540
               ||||||||||||||||||||||||||||||||||||||||||||||||||
as00067   1351 TACAGCCGCCTCTACCAGAGCCTGGAGTTCCCCAGCAGCTGCCTCCTGCA 1400
               Y  S  R  L  Y  Q  S  L  E  F  P  S  S  C  L  L  H 

          1401 ----+----*----+----*----+----*----+----*----+----* 1450
                P  I  T  S  I  E  Y  Q  W  I  Q  G  R  L  K  A  E
msh44296   541 CCCCATCACCAGCATAGAGTACCAGTGGATCCAGGGCCGACTCAAAGCAG 590
               ||||||||||||||| |||||||||||||||||||| || ||||| ||||
as00067   1401 CCCCATCACCAGCATCGAGTACCAGTGGATCCAGGGTCGGCTCAAGGCAG 1450
                P  I  T  S  I  E  Y  Q  W  I  Q  G  R  L  K  A  E

          1451 ----+----*----+----*----+----*----+----*----+----* 1500
                 Q  R  E  E  L  A  T  S  F  T  S  L  L  A  Y  G  
msh44296   591 AACAGCGGGAGGAGCTGGCCACCTCCTTCACATCCCTGTTGGCCTATGGC 640
               |||||| ||||||||||||| |||| ||||  |||||| || |||| |||
as00067   1451 AACAGCAGGAGGAGCTGGCCGCCTCATTCAGCTCCCTGCTGACCTACGGC 1500
                 Q  Q  E  E  L  A  A  S  F  S  S  L  L  T  Y  G  

          1501 ----+----*----+----*----+----*----+----*----+----* 1550
               L  S  L  I  R  K  F  R  S  V  F  P  L  S  V  S  D 
msh44296   641 CTCTCCCTTATCCGGAAGTTCCGCTCCGTCTTTCCCCTGTCTGTCTCTGA 690
               |||||||| ||||||| ||||||||| ||||| ||||| |||||||| ||
as00067   1501 CTCTCCCTCATCCGGAGGTTCCGCTCTGTCTTCCCCCTCTCTGTCTCGGA 1550
               L  S  L  I  R  R  F  R  S  V  F  P  L  S  V  S  D 

          1551 ----+----*----+----*----+----*----+----*----+----* 1600
                S  P  S  R  L  Q  S  L  L  R  V  L  V  Q  M  C  K
msh44296   691 CTCCCCATCCAGGCTGCAGTCCCTCCTCAGAGTCTTGGTCCAGATGTGCA 740
               ||||||| || |||||||||| || ||||| ||| |||| ||||||||||
as00067   1551 CTCCCCAGCCCGGCTGCAGTCTCTTCTCAGGGTCCTGGTACAGATGTGCA 1600
                S  P  A  R  L  Q  S  L  L  R  V  L  V  Q  M  C  K

          1601 ----+----*----+----*----+----*----+----*----+----* 1650
                 M  K  A  F  G  E  L  C  P  D  S  A  P  L  S  Q  
msh44296   741 AAATGAAGGCCTTTGGAGAACTGTGCCCAGACAGCGCTCCACTGTCCCAG 790
               | ||||||||||||||||||||||||||  ||| ||| ||| || |||||
as00067   1601 AGATGAAGGCCTTTGGAGAACTGTGCCCCAACACCGCCCCATTGCCCCAG 1650
                 M  K  A  F  G  E  L  C  P  N  T  A  P  L  P  Q  

          1651 ----+----*----+----*----+----*----+----*----+----* 1700
               L  V  S  E  A  L  R  M  G  T  V  E  W  F  H  L  M 
msh44296   791 CTGGTTTCTGAAGCTCTGCGGATGGGCACAGTTGAGTGGTTTCACCTGAT 840
               |||||  |||| || |||| ||  |||||   ||| ||||| ||||||| 
as00067   1651 CTGGTGACTGAGGCCCTGCAGACTGGCACCACTGAATGGTTCCACCTGAA 1700
               L  V  T  E  A  L  Q  T  G  T  T  E  W  F  H  L  K 

          1701 ----+----*----+----*----+----*----+----*----+----* 1750
                Q  Q  H  H  Q  P  -        G  I  L  E  A  G  K  A
msh44296   841 GCAGCAACACCATCAGCCCAT......GGGCATCCTGGAGGCTGGCAAGG 884
               |||||| |||||||| |||||      |||||||| |||||| |||||||
as00067   1701 GCAGCAGCACCATCAACCCATGGTGCAGGGCATCCCGGAGGCAGGCAAGG 1750
                Q  Q  H  H  Q  P  M  V  Q  G  I  P  E  A  G  K  A

          1751 ----+----*----+----*----+----*----+----*----+----* 1800
                 L  L  N  L  V  Q  D  V  M  G  D  L  Y  Q  C  R  
msh44296   885 CCTTGCTAAATCTGGTACAGGACGTCATGGGTGATCTGTACCAGTGTCGT 934
               |||||||    ||||||||||| ||||| || || ||| ||||||| |  
as00067   1751 CCTTGCTGGGCCTGGTACAGGATGTCATTGGCGACCTGCACCAGTGCCAG 1800
                 L  L  G  L  V  Q  D  V  I  G  D  L  H  Q  C  Q  

          1801 ----+----*----+----*----+----*----+----*----+----* 1850
               R  T  W  N  K  I  F  H  N  V  L  K  I  D  L  F  S 
msh44296   935 CGCACATGGAACAAGATTTTCCACAATGTCCTCAAGATAGACCTGTTCTC 984
               ||||||||| ||||||| |||||||||  |||||||||  |||| |||||
as00067   1801 CGCACATGGGACAAGATCTTCCACAATACCCTCAAGATCCACCTCTTCTC 1850
               R  T  W  D  K  I  F  H  N  T  L  K  I  H  L  F  S 

          1851 ----+----*----+----*----+----*----+----*----+----* 1900
                M  A  F  L  E  L  Q  W  L  V  A  K  R  V  Q  D  H
msh44296   985 CATGGCCTTCCTGGAACTGCAGTGGCTGGTGGCCAAGAGGGTACAGGACC 1034
               |||||| |||| ||| ||||||||||||||||||||| |||| |||||||
as00067   1851 CATGGCTTTCCGGGAGCTGCAGTGGCTGGTGGCCAAGCGGGTGCAGGACC 1900
                M  A  F  R  E  L  Q  W  L  V  A  K  R  V  Q  D  H

          1901 ----+----*----+----*----+----*----+----*----+----* 1950
                 T  V  A  A  G  N  L  V  S  P  D  I  G  E  S  L  
msh44296  1035 ACACGGTGGCGGCTGGCAACCTTGTTTCTCCAGATATTGGAGAGAGTCTG 1084
               |||||  ||  |  ||  |  | || || ||||| || || |||||||||
as00067   1901 ACACGACGGTTGTGGGTGATGTAGTGTCCCCAGAGATGGGCGAGAGTCTG 1950
                 T  T  V  V  G  D  V  V  S  P  E  M  G  E  S  L  

          1951 ----+----*----+----*----+----*----+----*----+----* 2000
               F  Q  L  Y  V  S  L  K  E  L  C  Q  L  G  P  V  P 
msh44296  1085 TTTCAGCTGTATGTCAGCCTGAAGGAGCTCTGCCAGCTGGGCCCTGTCCC 1134
               || ||||| ||  ||||||| |||||||||||||||||| ||     | |
as00067   1951 TTCCAGCTCTACATCAGCCTCAAGGAGCTCTGCCAGCTGCGCATGAGCTC 2000
               F  Q  L  Y  I  S  L  K  E  L  C  Q  L  R  M  S  S 

          2001 ----+----*----+----*----+----*----+----*----+----* 2050
                S  D  S  R  E  V  L  A  L  D  G  F  H  R  W  F  Q
msh44296  1135 CTCAGACAGCCGTGAAGTCCTGGCCCTGGATGGCTTCCACCGCTGGTTTC 1184
               |||||| ||   || ||||||||||||||||   |||||||||||||| |
as00067   2001 CTCAGAGAGGGATGGAGTCCTGGCCCTGGATAATTTCCACCGCTGGTTCC 2050
                S  E  R  D  G  V  L  A  L  D  N  F  H  R  W  F  Q

          2051 ----+----*----+----*----+----*----+----*----+----* 2100
                 P  A  I  P  S  W  L  Q  K  T  Y  S  V  A  L  E  
msh44296  1185 AGCCAGCCATCCCTTCCTGGCTGCAGAAGACTTACAGTGTCGCTCTGGAG 1234
               |||| |||||||| ||||||||||||||||| ||||  |  || |||| |
as00067   2051 AGCCGGCCATCCCCTCCTGGCTGCAGAAGACGTACAACGAGGCCCTGGCG 2100
                 P  A  I  P  S  W  L  Q  K  T  Y  N  E  A  L  A  

          2101 ----+----*----+----*----+----*----+----*----+----* 2150
               R  V  Q  R  A  V  Q  M  D  T  L  V  P  L  G  E  L 
msh44296  1235 CGGGTGCAGCGTGCCGTACAGATGGACACGCTGGTACCCCTGGGCGAACT 1284
               ||||||||||| || || ||||||||   |||||| |||||||| |||||
as00067   2101 CGGGTGCAGCGCGCTGTGCAGATGGATGAGCTGGTGCCCCTGGGTGAACT 2150
               R  V  Q  R  A  V  Q  M  D  E  L  V  P  L  G  E  L 

          2151 ----+----*----+----*----+----*----+----*----+----* 2200
                T  K  H  S  T  S  A  V  D  L  S  T  C  F  A  Q  I
msh44296  1285 GACCAAGCACAGCACTTCTGCCGTGGATCTGTCTACCTGCTTTGCCCAAA 1334
               ||||||||||||||| || || |||||||| || |||||||||||||| |
as00067   2151 GACCAAGCACAGCACATCAGCGGTGGATCTATCCACCTGCTTTGCCCAGA 2200
                T  K  H  S  T  S  A  V  D  L  S  T  C  F  A  Q  I

          2201 ----+----*----+----*----+----*----+----*----+----* 2250
                 S  H  T  A  R  Q  L  D  W  P  D  P  E  E  A  F  
msh44296  1335 TTAGCCACACTGCCCGGCAGCTGGACTGGCCAGACCCAGAGGAGGCCTTC 1384
               | ||||||||||||||||||||||||||||||||||||||||||||||||
as00067   2201 TCAGCCACACTGCCCGGCAGCTGGACTGGCCAGACCCAGAGGAGGCCTTC 2250
                 S  H  T  A  R  Q  L  D  W  P  D  P  E  E  A  F  

          2251 ----+----*----+----*----+----*----+----*----+----* 2300
               M  I  T  V  K  F  V  E  D  T  C  R  L  A  L  V  Y 
msh44296  1385 ATGATCACTGTCAAGTTCGTGGAGGACACGTGCCGGCTGGCCCTGGTCTA 1434
               ||||| || |||||||| ||||||||||| || || ||||||||||| ||
as00067   2251 ATGATTACCGTCAAGTTTGTGGAGGACACCTGTCGCCTGGCCCTGGTGTA 2300
               M  I  T  V  K  F  V  E  D  T  C  R  L  A  L  V  Y 

          2301 ----+----*----+----*----+----*----+----*----+----* 2350
                C  S  L  I  K  A  R  A  R  E  L  S  A  V  Q  K  D
msh44296  1435 CTGTAGCCTTATAAAGGCCCGGGCCCGAGAGCTGTCTGCAGTTCAGAAGG 1484
               ||| ||||||||||||||||||||||| ||||| ||| |||  |||||||
as00067   2301 CTGCAGCCTTATAAAGGCCCGGGCCCGCGAGCTCTCTTCAGGCCAGAAGG 2350
                C  S  L  I  K  A  R  A  R  E  L  S  S  G  Q  K  D

          2351 ----+----*----+----*----+----*----+----*----+----* 2400
                 Q  S  Q  A  A  D  M  L  C  V  V  V  N  N  M  E  
msh44296  1485 ACCAGAGCCAGGCAGCTGACATGCTGTGTGTGGTGGTAAATAACATGGAG 1534
               ||||  ||||||||||  ||||||||||||||||||| ||| ||||||||
as00067   2351 ACCAAGGCCAGGCAGCCAACATGCTGTGTGTGGTGGTGAATGACATGGAG 2400
                 Q  G  Q  A  A  N  M  L  C  V  V  V  N  D  M  E  

          2401 ----+----*----+----*----+----*----+----*----+----* 2450
               Q  L  R  L  I  I  D  K  L  P  T  Q  L  A  W  E  A 
msh44296  1535 CAACTACGGTTGATCATCGACAAGCTACCCACTCAGCTGGCATGGGAGGC 1584
               || || ||| || | |||| |||| | ||| | |||||||||||||||||
as00067   2401 CAGCTGCGGCTGGTGATCGGCAAGTTGCCCGCCCAGCTGGCATGGGAGGC 2450
               Q  L  R  L  V  I  G  K  L  P  A  Q  L  A  W  E  A 

          2451 ----+----*----+----*----+----*----+----*----+----* 2500
                L  E  Q  R  V  G  A  V  L  E  E  G  Q  L  Q  N  T
msh44296  1585 ATTGGAGCAGCGGGTCGGGGCCGTGTTGGAGGAGGGGCAGCTGCAGAACA 1634
                 ||||||||||||| ||||||||| ||||| ||||||||||||||||||
as00067   2451 CCTGGAGCAGCGGGTAGGGGCCGTGCTGGAGCAGGGGCAGCTGCAGAACA 2500
                L  E  Q  R  V  G  A  V  L  E  Q  G  Q  L  Q  N  T

          2501 ----+----*----+----*----+----*----+----*----+----* 2550
                 L  H  A  Q  L  Q  G  A  L  A  G  L  G  H  E  I  
msh44296  1635 CGTTACATGCTCAGCTGCAGGGCGCCTTGGCGGGGCTGGGCCATGAGATC 1684
               || | ||||| ||||||||| ||||  |||| ||||||||||||||||||
as00067   2501 CGCTGCATGCCCAGCTGCAGAGCGCGCTGGCCGGGCTGGGCCATGAGATC 2550
                 L  H  A  Q  L  Q  S  A  L  A  G  L  G  H  E  I  

          2551 ----+----*----+----*----+----*----+----*----+----* 2600
               R  T  G  V  R  T  L  A  E  Q  L  E  V  G  I  A  T 
msh44296  1685 CGTACTGGTGTCCGTACCCTGGCAGAGCAGTTGGAGGTGGGCATTGCCAC 1734
               || ||||| ||||| |||||||| |||||||||||||||||||| |||| 
as00067   2551 CGCACTGGCGTCCGCACCCTGGCCGAGCAGTTGGAGGTGGGCATCGCCAA 2600
               R  T  G  V  R  T  L  A  E  Q  L  E  V  G  I  A  K 

          2601 ----+----*----+----*----+----*----+----*----+----* 2650
                H  I  Q  K  L  I  G  V  K  E  S  V  L  P  E  D  A
msh44296  1735 ACACATCCAGAAACTCATTGGCGTCAAGGAGTCTGTTCTGCCCGAGGATG 1784
                ||||||||||||||  | ||||||| ||||||||| ||||| |||||||
as00067   2601 GCACATCCAGAAACTGGTGGGCGTCAGGGAGTCTGTCCTGCCTGAGGATG 2650
                H  I  Q  K  L  V  G  V  R  E  S  V  L  P  E  D  A

          2651 ----+----*----+----*----+----*----+----*----+----* 2700
                 I  L  P  L  M  K  F  L  E  V  K  L  C  Y  M  N  
msh44296  1785 CCATTCTGCCCCTGATGAAATTCTTGGAGGTGAAGCTTTGCTACATGAAC 1834
               ||||||||||||||||||| ||| |||||||| |||||||||||||||||
as00067   2651 CCATTCTGCCCCTGATGAAGTTCCTGGAGGTGGAGCTTTGCTACATGAAC 2700
                 I  L  P  L  M  K  F  L  E  V  E  L  C  Y  M  N  

          2701 ----+----*----+----*----+----*----+----*----+----* 2750
               T  N  L  V  Q  E  N  F  S  S  L        D  S  V  V 
msh44296  1835 ACCAACCTGGTCCAGGAGAACTTCAGCAGCCTT....CTGACTCTGTTGT 1880
               |||||| |||| |||||||||||||||||   |    | | | |||   |
as00067   2701 ACCAACTTGGTGCAGGAGAACTTCAGCAGGTCTGTAGCAGCCCCTGCCCT 2750
               T  N  L  V  Q  E  N  F  S  R  S  V  A  A  P  A  L 

          2751 ----+----*----+----*----+----*----+----*----+----* 2800
                D  P  H  A  Y  -                                 
msh44296  1881 GGACCCACACGCTTACT................................. 1897
               || | | | |||   ||                                 
as00067   2751 GGGCGCGCTCGCCACCTCCCCCTCTCCTCCCACGGCCCCTCCTTCCTGGG 2800
                G  A  L  A  T  S  P  S  P  P  T  A  P  P  S  W  A

          2801 ----+----*----+----*----+----*----+----*----+----* 2850
                          V  L  V  E  V  A  S  S  Q  R  S  S  S  
msh44296  1898 ...........GTGCTGGTGGAGGTGGCTTCTTCCCAGCGTAGCTCGTCC 1936
                          || |||  ||  || ||  |  |    |   || ||  |
as00067   2801 CCCCAGCATCCGTTCTGAGGGGTGTAGCAGCCCCTGCCCTGGGCGCGCTC 2850
                 P  A  S  V  L  R  G  V  A  A  P  A  L  G  A  L  

          2851 ----+----*----+----*----+----*----+----*----+----* 2900
               L  A  S  G  R  L  K  V  A  L  R   T  W  R  S  A  S
msh44296  1937 CTGGCTTCTGGCAGGCTGAAGGTCGCCCTTCAGAACCTGGAGGTCTGCTT 1986
                   | ||   |   |        |||| ||    |||||   || || |
as00067   2851 GCCACCTCCCCCTCTCCTCCCACGGCCCCTC.TTTCCTGGGCCTCAGCAT 2899
               A  T  S  P  S  P  P  T  A  P  L   S  W  A  S  A  S

          2901 ----+----*----+----*----+----*----+----*----+----* 2950
                 T  L  R  A     G  L  P  P  E  A                 
msh44296  1987 CCACGCTGAGGGCT.GTGGTCTGCCACCAGAGGCCCTGCACACAGACACC 2035
               ||   ||||||| | | |||    | |    |   |  | |  |      
as00067   2900 CCGTTCTGAGGGGTGGAGGTGAATCCCTTTGGTGACCTCCCTTATGGGTG 2949
                 V  L  R  G  G  G  E  S  L  W  *                 

          2951 ----+----*----+----*----+----*----+----*----+----* 3000
msh44296  2036 TTCCAGGCTCTGC.AGAACGACCTGGAGCTGCAGGCGGCCTCCAGCCGGG 2084
                  |||||||||| |   |   ||| ||   | ||  || | |  |    
as00067   2950 CAACAGGCTCTGCTAAGGCTCGCTGCAGGGACGGGGAGCTTGCCACTTCT 2999

          3001 ----+----*----+----*----+----*----+----*----+----* 3050
msh44296  2085 AGCTTATCCAGAAGTACTTCTGCAGCCGAATCCAGCAGCAGGCCGAAACC 2134
                     |||  |  | || ||||       |    | |   ||    |  
as00067   3000 CAGGCCTCCCCATTTCCTGCTGC......CTAGCTCTGAGTGCTTTGAGT 3043

          3051 ----+----*----+----*----+----*----+----*----+----* 3100
msh44296  2135 ACTTCTGAGAGGCTGGGCGCAGTCACCGTCAAGGTCTCCTACCGCGCCTC 2184
                ||||        | |||  || ||   || |  | ||||    | ||| 
as00067   3044 TCTTC......CTTAGGCTGAGCCAAAATCTACCTGTCCT..TTCTCCTT 3085

          3101 ----+----*----+----*----+----*----+----*----+----* 3150
msh44296  2185 TGAGCAGAGGCTTCGCGTGGAACTGCTCAGTGCTTCTAGCCTGCTGCCCC 2234
               |   |  |||| |  | |           |  || |  ||||||| | | 
as00067   3086 TAGTCCCAGGCCTGTCTT........CAGGGCCTCCCTGCCTGCTCCTCT 3127

          3151 ----+----*----+----*----+----*----+----*----+----* 3200
msh44296  2235 TGGACTCCAATGGTTCCAGTGACCCCTTTGTTCAGTTGACACTGGAACCC 2284
                   |    | | | |  |||  ||||     | ||  |    | |   |
as00067   3128 GCCCCCATTAGGTTACAGGTGGTCCCTGCCCACGGTGCAGGGAGAACTTC 3177

          3201 ----+----*----+----*----+----*----+----*----+----* 3250
msh44296  2285 AGACATGAATTCCCTGAAGTGGCCCCCCGGGAGACCCAGAAGCACAAGAA 2334
                  |  |  |  | |  ||  |   ||| | |  ||  | | ||   |  
as00067   3178 TAGCTAGGGTGTCTTATAGCAGGGACCCAGCATGCCAGGCAACAGGCGGT 3227

          3251 ----+----*----+----*----+----*----+----*----+----* 3300
msh44296  2335 GGAACTTCACCCACTCTTTGATGAGACCTTTGAATTCCTGGTGCCTGCTG 2384
               || | | || |   |      ||   |   ||  |||    | |    | 
as00067   3228 GG.AGTCCAGCACATGGGCTCTGGACCACCTGGGTTCAGACTCCGGCTTC 3276

          3301 ----+----*----+----*----+----*----+----*----+----* 3350
msh44296  2385 AGCCTTGCCAAAAAGCCTGGGCATGCCTCCTGCTCACTGTGCTGGACCAC 2434
               | |  |||  |  ||||| ||       |  | || | |||| |      
as00067   3277 ATCACTGCTTAGCAGCCTTGG......GCAAGTTCCCGGTGCAGATGAGA 3320

          3351 ----+----*----+----*----+----*----+----*----+----* 3400
msh44296  2435 GACAGACTGGGAGCAGACGACCTGGAGGGAGAGGCCTTCTTACCGCTCTG 2484
               |     ||||   | |  ||     |    | | | |       |||  |
as00067   3321 GTAGTTCTGGCTTCTGTTGAGGATTA.AATGGGTCATGTGAGGTGCTGAG 3369

          3401 ----+----*----+----*----+----*----+----*----+----* 3450
msh44296  2485 CAGGGTACCTGGACTGACGGACTGTGCAGAGCC.....GGGCGAAGCACC 2529
               |  |||  | | ||  | | | ||  ||| | |     | |  |   || 
as00067   3370 CGTGGTGTCAGTACAAAGGAAGTGGTCAGTGACTATTAGCGGTAGTGACG 3419

          3451 ----+----*----+----*----+----*----+----*----+----* 3500
msh44296  2530 TCAAATGCGCCTGCCTCTCACATACC........................ 2555
                  |    | | ||  | ||| | ||                        
as00067   3420 ATGATGATGACAGCGCCCCACCTCCCTGTGGAATCTGGGCCCTGAAGAGG 3469

          3501 ----+----*----+----*----+----*----+----*----+----* 3550
msh44296  2556 .....................CTGCCCCCAACGGGGACCCAATTCTGCGG 2584
                                     | || ||  | | |    ||  | | | 
as00067   3470 CCGCAGCCCTCCTAGAGTTCTGTTCCACCCCCAGAGGTATAAGCCAGGGC 3519

          3551 ----+----*----+----*----+----*----+----*----+----* 3600
msh44296  2585 CTGTTGGAGAGCCGGAAGGGGGATCGCGAGGCCCAGGCCTTTGTAAAGCT 2634
               |  | ||||  | |        ||  |     ||| |  | | ||    |
as00067   3520 CCCTCGGAGCCCGGCCGTTTTTATTTCTTAAACCATGTATATATATTTTT 3569

          3601 ----+----*----+----*----+----*----+----*----+----* 3650
msh44296  2635 GAGGAGGCAGAGAGCCAAGCAGGC.....CTCCCAACATGCCCCGTGACA 2679
                |       ||||   |  |  ||     | ||||   ||    |     
as00067   3570 TATTTTTTTGAGACGGAGTCTCGCTCTGTCGCCCAGGCTGGAGTGCAGTG 3619

          3651 ----+----*----+----*----+----*----+----*----+----* 3700
msh44296  2680 GGGCGGTGGCTGCTGAAGCAAACGTTTGGGGTCTTTCTTGTCACAGGGAA 2729
               | ||| |  | ||| |    ||| |  |    | | |  ||   || || 
as00067   3620 GCGCGATCTCAGCTCACTGCAACCTCCG....CCTCCGGGTTCAAGTGAT 3665

          3701 ----+----*----+----*----+----*----+----*----+----* 3750
msh44296  2730 GCCCTTCATC......CTGTGTAACACAGGGAGAACCAGCAGGGGGGGA. 2772
                |||| | ||      ||| |||||   |     |   ||| |     | 
as00067   3666 TCCCTGCCTCAGCCTGCTGAGTAACTGGGACTACAGGTGCATGCCATTAT 3715

          3751 ----+----*----+----*----+----*----+----*----+----* 3800
msh44296  2773 GGGGGCCTCAGAGGTCTTGACAGAACAGCTCTTGGTGATGACCGTGGCCA 2822
               |   | || |  | | ||   |||     ||||| |      |  |||||
as00067   3716 GCCTGGCTAATTGTTTTTTTGAGATGGAGTCTTGCTCTGTTGCCAGGCCA 3765

          3801 ----+----*----+----*----+----*----+----*----+----* 3850
msh44296  2823 G.GTTTTATCCTTTCTTGTGCCCTGGCTGTGCTCTCTGCCATATTGTAGC 2871
               | ||    |  |    | |   ||  |||    ||| |||   | |   |
as00067   3766 GAGTGCAGTGGTGCAATCTTGGCTCACTGCAACCTCCGCCTCCTGGGTTC 3815

          3851 ----+----*----+----*----+----*----+----*----+----* 3900
msh44296  2872 ACAGAGGAACATCTCCTGTCTCTGCTCCC..................... 2900
                   | |    ||||||| ||| ||  ||                     
as00067   3816 ....AAGCGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGACTACAGGC 3861

          3901 ----+----*----+----*----+----*----+----*----+----* 3950
msh44296  2901 .TGCACCCCCGTCCCCCCAGAAACGCTGCTTTGGTAGGATAG........ 2941
                 |  || |  | ||     ||    ||  ||  ||| | ||        
as00067   3862 ACGTGCCACAATGCCTGGCTAATTTTTGTATTTTTAGTAGAGACGGGGTT 3911

          3951 ----+----*----+----*----+----*----+----*----+----* 4000
msh44296  2942 ....CATAGCGTCTAGGGCCATCTGCCTTTTGGAACTGCCAGTTCCAGCT 2987
                   |||   | | |||    |||   | |    ||  || | |||| | 
as00067   3912 TCACCATGTTGGCCAGGATGGTCTCAATCTCTTGACCTCCTGATCCACCC 3961

          4001 ----+----*----+----*----+----*----+----*----+----* 4050
msh44296  2988 ACTGGGGTGGGGCCAGACCACTGGGATCAGAGCTGTG.GCCACAACTTCA 3036
               ||    |     ||| |   ||||||| | ||  ||| ||||| ||  | 
as00067   3962 AC.CTCGACCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCACGCCC 4010

          4051 ----+----*----+----*----+----*----+----*----+----* 4100
msh44296  3037 GAGAGAGAGGACTGTCAAA............................... 3055
               |     |     | | |||                               
as00067   4011 GGCCTGGTTCTGTCTTAAATCATGGTTGTCACTGGGGGCCTGGCCTCCTC 4060

          4101 ----+----*----+----*----+----*----+----*----+----* 4150
msh44296  3056 ....................GGCATCCTGGTGCAGGACGGGGGTGG..CC 3083
                                    |||| | | |   |  | |||| ||   |
as00067   4061 CCTGTCTCCAGCCTTGTTTGTGCATTCCGTTAGTGTGCTGGGGAGGGTTC 4110

          4151 ----+----*----+----*----+----*----+----*----+----* 4200
msh44296  3084 ATCACTGAGG........................................ 3093
                |||||||||                                        
as00067   4111 CTCACTGAGGTTGAGAGGTGTGTTGGATAGGACTGATCCCACCTGCCCCT 4160

          4201 ----+----*----+----*----+----*----+----*----+----* 4250
msh44296  3094 .....................................GAGGTGGGAAGAA 3106
                                                    | ||||||| |  
as00067   4161 TGCTGGTCCTGTACCCACCCTCTCCCCAGCCTCACCTGGGGTGGGAGGCG 4210

          4251 ----+----*----+----*----+----*----+----*----+----* 4300
msh44296  3107 GAGACTCTGGCT..CTGAGGTGAGAGCTGGCAGGAAGCTGGGTAGGCACA 3154
               |||     || |    | ||   |||||||   |  |||||||       
as00067   4211 GAGGGGGAGGTTGGGGGTGGGAGGAGCTGGGGTGGGGCTGGGTCACTGAG 4260

          4301 ----+----*----+----*----+----*----+----*----+----* 4350
msh44296  3155 GATGGACAGCCGCAGAAAGGGAGTTTGGCT.......GTCTACAGAATAG 3197
               |  |  |        | || |  ||||  |         | || |    |
as00067   4261 GCCGCCCCTTTCTCAAGAGCGTCTTTGCTTCCTCCTCCGCCACCGCCCTG 4310

          4351 ----+----*----+----*----+----*----+----*----+----* 4400
msh44296  3198 CCTTCCTCCCTCAG.ATAGCACCTGCCACCTAGAAATGTTCTCAGAGAAA 3246
               || | ||   || | ||  | || ||| |     |   | || |      
as00067   4311 CCGTGCTGTGTCCGCATCCCTCCGGCCCCGCCCCAGCCTCCTGACCCTGC 4360

          4401 ----+----*----+----*----+----*----+-- 4437
msh44296  3247 ATAAAAATAAAACATTATCCACTGCCCTC........ 3275
                    |   | ||| |   ||| |  ||         
as00067   4361 TCTGGACCCACACACT...CACAGTGCTGGTGGAGGC 4394