Comparison of KIAA cDNA sequences between mouse and human (FLJ00087)

<<Original sequence data>>

mouse  mFLJ00087 (msh08123)     length:   3219 bp, CDS:     1 -  3165
human     (as00087)     length:   4287 bp, CDS:  1339 -  3363

In this page, the longest coding region predicted by GeneMark
was assigned as CDS for each of mouse and human KIAA cDNAs.
They were colored in green.  When the CDS positions were not identical
on the aligned sequences between mouse and human cDNAs, mouse cDNA
sequence was translated based on the human CDS information.
The amino acid sequence produced here may not be identical to the
protein sequence deduced (see Description).


<<Aligned sequence information>>

----------------------------------------------------------
            region      #match  #mismatch  %diff
----------------------------------------------------------
DNA

5'UTR:      3 -  1509     733      190      20.6
  CDS:   1510 -  3456    1598      340      17.5
3'UTR:   3457 -  4281     182       83      31.3

amino acid

  CDS:   1510 -  3456     548      101      15.6
----------------------------------------------------------


<<Alignment>>

             1 ----+----*----+----*----+----*----+----*----+----* 50
msh08123     1 ..CTGGCCAGGAAATCAAGCAGAGACTCCTCC...GCCATGGACCCACCG 45
                 |||| ||||||| |||| |||  | || ||    ||||||||||||||
as00087      1 GGCTGGTCAGGAAACCAAGGAGACCCCCCCCCCCAACCATGGACCCACCG 50

            51 ----+----*----+----*----+----*----+----*----+----* 100
msh08123    46 TTGCAGAGTGAAGAAGATTCCCAAACGCAGCCCTCGCTGCCCTCTCCGCT 95
               | ||  ||          |||||||| |||||| |     ||||||||||
as00087     51 TCGCCAAG...CCGGACCTCCCAAACCCAGCCCACAGCCACCTCTCCGCT 97

           101 ----+----*----+----*----+----*----+----*----+----* 150
msh08123    96 GACATCCTACCGCTGGCACACTGGAGGCAGTGGGGAAAAGGCTGCTGGAG 145
               ||| ||||||||||||||||| || ||| ||||||| ||||| |||||||
as00087     98 GACTTCCTACCGCTGGCACACAGGGGGCGGTGGGGAGAAGGCGGCTGGAG 147

           151 ----+----*----+----*----+----*----+----*----+----* 200
msh08123   146 GGTTCCGCTGGGGCCGCTTTGCAGGCTGGGGAAGGGCCCTGAGCCATCAG 195
               |||||||||||||||||||||| |||||||| |||||||||||||| |||
as00087    148 GGTTCCGCTGGGGCCGCTTTGCTGGCTGGGGCAGGGCCCTGAGCCACCAG 197

           201 ----+----*----+----*----+----*----+----*----+----* 250
msh08123   196 GAGCCCATGGTGAACAGCCAGCCAGCCCCCAGATCTCTGTTTCGTCGGGT 245
               ||||||||||| | || ||||||||||||  | ||  | || ||||||||
as00087    198 GAGCCCATGGTCAGCACCCAGCCAGCCCCTCGCTCGATATTCCGTCGGGT 247

           251 ----+----*----+----*----+----*----+----*----+----* 300
msh08123   246 CCTCTCTGCACCCCCTAAGGAGTCACGGTCCAATCGCCTCCGGTTTTCCA 295
               ||| ||||| || || |||||||||||| ||| |||||| ||  | ||||
as00087    248 CCTATCTGCGCCTCCCAAGGAGTCACGGACCAGTCGCCTTCGACTCTCCA 297

           301 ----+----*----+----*----+----*----+----*----+----* 350
msh08123   296 AGACCCTGTGGGGAAGGCACAAGAACGTGGCACCACTGGAACCCAAGCCA 345
               || |||| ||||| ||||| ||||||    |||| | ||| ||  | || 
as00087    298 AGGCCCTCTGGGGGAGGCATAAGAAC...CCACCGCCGGAGCCAGACCCG 344

           351 ----+----*----+----*----+----*----+----*----+----* 400
msh08123   346 AATCCGAAGGCCCCAGAAAGGAGCAAGCAAGCCATGGTTCCTGGAGTCAA 395
                | ||| ||                                         
as00087    345 GAGCCGGAG......................................... 353

           401 ----+----*----+----*----+----*----+----*----+----* 450
msh08123   396 AGGACAACTTTCAGGAGTCAGTTCTCTCCTTCAATTACAACCAGAGCTGG 445
                          |||||| |                     ||||||||||
as00087    354 ...........CAGGAGGC....................CCCAGAGCTGG 372

           451 ----+----*----+----*----+----*----+----*----+----* 500
msh08123   446 AGTTGGTAGCAGACCCAGATCTACCCGTTGCACAGATCCCTGAGCCTCCC 495
               ||  ||   |||| |  || |  ||     |||||||||||||| | |||
as00087    373 AGCCGGAGCCAGAGCTGGAGCCCCCTACCCCACAGATCCCTGAGGCCCCC 422

           501 ----+----*----+----*----+----*----+----*----+----* 550
msh08123   496 ACCCCGGACATGCCTGTTTGGAACATTGATGGCTTTACCCTTCTCGAAGG 545
               || ||  || ||||||| ||| ||||||  ||||| ||||| || || ||
as00087    423 ACACCCAACGTGCCTGTCTGGGACATTGGGGGCTTCACCCTGCTTGATGG 472

           551 ----+----*----+----*----+----*----+----*----+----* 600
msh08123   546 AAAGCTGGTGATGCT...CGGTGAGGAGGAGGGTCCTCGCCAAATCCGAG 592
                ||||||||| ||||    || |||||||||||||||||      ||| |
as00087    473 GAAGCTGGTGCTGCTTGGAGGAGAGGAGGAGGGTCCTCGAAGGCCCCGGG 522

           601 ----+----*----+----*----+----*----+----*----+----* 650
msh08123   593 TGGGAAGCGCCAGTTCAGAGAACAGCATGCAGGCAGCTTTGGGGAACCTC 642
               ||||||| || || || |||  |||||| || |  ||  |||||||| ||
as00087    523 TGGGAAGTGCTAGCTCCGAGGGCAGCATCCACGTGGCCATGGGGAACTTC 572

           651 ----+----*----+----*----+----*----+----*----+----* 700
msh08123   643 AAGGATGCAGTTCGGACCCCTGGAAAAACTGAGCCGGAAGCTGCTGGTTC 692
               | |||| ||| |||||  ||||||||||| || |||||  |||||||| |
as00087    573 AGGGATCCAGATCGGATGCCTGGAAAAACAGAACCGGAGACTGCTGGTCC 622

           701 ----+----*----+----*----+----*----+----*----+----* 750
msh08123   693 CAACCAGGTCCACAATGTTCGGAAGTTACTCAAGAGGCTGAAGGAGAAGA 742
               ||||||||||||||| ||||||  ||| |||||||||||||| |||||||
as00087    623 CAACCAGGTCCACAACGTTCGGGGGTTGCTCAAGAGGCTGAAAGAGAAGA 672

           751 ----+----*----+----*----+----*----+----*----+----* 800
msh08123   743 AAAGAGCCAAGTCAGAACTGGGAGCCTACACCCCTCGAGATGGACCTCCC 792
               |||  |||| ||  ||                || || |||||||| |||
as00087    673 AAAAGGCCAGGTTGGA...............GCCCCGGGATGGACCCCCC 707

           801 ----+----*----+----*----+----*----+----*----+----* 850
msh08123   793 AGTGCCCTCGGTTCCAGAGAATCGCTGGCCACACTCTCTGAGCTGGACCT 842
               ||||| || || || || || |||||||||||||||||||| ||||||||
as00087    708 AGTGCTCTGGGCTCTAGGGAGTCGCTGGCCACACTCTCTGAACTGGACCT 757

           851 ----+----*----+----*----+----*----+----*----+----* 900
msh08123   843 AGGTGCTGAGAGGGACGTGAGAGTCTGGCCACTGCATCCTAGCCTCTTGG 892
                ||||| ||| |||| ||| |  ||||||||||||| || |||||| |||
as00087    758 GGGTGCCGAGCGGGATGTGCGGATCTGGCCACTGCACCCCAGCCTCCTGG 807

           901 ----+----*----+----*----+----*----+----*----+----* 950
msh08123   893 GGGAGCCCTACTGCTTCCAGGTAACCTGGGCAGGTGGGAGCCTCTGTTTC 942
               |||||||| ||||||| |||||||||||| | ||||| |||| ||| |||
as00087    808 GGGAGCCCCACTGCTTTCAGGTAACCTGGACGGGTGGAAGCCGCTGCTTC 857

           951 ----+----*----+----*----+----*----+----*----+----* 1000
msh08123   943 TCATGTCGCTCGAGCGCTGAGAGAGACCGTTGGATTGAGGACCTTCGTCG 992
               || |||||||||  ||||||||||||||| ||||| ||||||||||||||
as00087    858 TCTTGTCGCTCGGCCGCTGAGAGAGACCGCTGGATCGAGGACCTTCGTCG 907

          1001 ----+----*----+----*----+----*----+----*----+----* 1050
msh08123   993 GCAGTTCCAGCCGAGC.................................. 1008
                || |||||||| | |                                  
as00087    908 CCAATTCCAGCCCACCCAGGTCATTGCCCTCAGAAAAGCGGGACACGGAG 957

          1051 ----+----*----+----*----+----*----+----*----+----* 1100
msh08123  1009 .................................................. 1009
                                                                 
as00087    958 ACCCCGCCCACAGGCCTGGCCCGCCCCTTCCACTCTACCCAAAGCCAGCT 1007

          1101 ----+----*----+----*----+----*----+----*----+----* 1150
msh08123  1009 .................................................. 1009
                                                                 
as00087   1008 CGCTCCATAACCCGCGCCCCCAACACACCTGCCCCGCCTGCATCGTTGCA 1057

          1151 ----+----*----+----*----+----*----+----*----+----* 1200
msh08123  1009 .................................................. 1009
                                                                 
as00087   1058 ACGGCTCCAAGACTCCCCTTCCATCTATTACCAGCCCCGGGAATATCAGA 1107

          1201 ----+----*----+----*----+----*----+----*----+----* 1250
msh08123  1009 .................................................. 1009
                                                                 
as00087   1108 CCCGCCCCTTCTGTCCACTCCCGGCCCCATATGCACCCAAAGCCCTGCCT 1157

          1251 ----+----*----+----*----+----*----+----*----+----* 1300
msh08123  1009 .................................................. 1009
                                                                 
as00087   1158 GCCAGGCAATCGCGAAGTCTTTACTCCAGTACTACTCTTGTATGCCCTGG 1207

          1301 ----+----*----+----*----+----*----+----*----+----* 1350
msh08123  1009 .................................................. 1009
                                                                 
as00087   1208 AGCCCCCAACTCTCCCGGAGAGCCCAGAAAAGCTAGAAGACTTCCGCGTA 1257

          1351 ----+----*----+----*----+----*----+----*----+----* 1400
msh08123  1009 .................................................. 1009
                                                                 
as00087   1258 ACCGCCTCCGGACTCCCGTGGCTCCACTTCTTTCCCCAAAGGCCCAGCAA 1307

          1401 ----+----*----+----*----+----*----+----*----+----* 1450
msh08123  1009 .................................................. 1009
                                                                 
as00087   1308 TCTCCCTAGGGACCCGACGCAACAGGTTTGAGGCTCCACGCTTCACTTGC 1357
                                              G  S  T  L  H  L  R

          1451 ----+----*----+----*----+----*----+----*----+----* 1500
msh08123  1009 .................................................. 1009
                                                                 
as00087   1358 GGTGCCTTCTTTGGGGTGCCAACAAACCAAAACGCTTCCCCCCATCCCAC 1407
                 C  L  L  W  G  A  N  K  P  K  R  F  P  P  S  H  

          1501 ----+----*----+----*----+----*----+----*----+----* 1550
                        Q  D  N  V  E  R  Q  E  M  W  L  T  V  W 
msh08123  1009 .........CAGGACAACGTGGAGCGGCAAGAGATGTGGCTGACGGTGTG 1049
                        |||||||||||||||||| ||||||  |||||||  |||||
as00087   1408 TCCCATCTGCAGGACAACGTGGAGCGGGAAGAGACATGGCTGAGCGTGTG 1457
               S  H  L  Q  D  N  V  E  R  E  E  T  W  L  S  V  W 

          1551 ----+----*----+----*----+----*----+----*----+----* 1600
                V  H  E  A  K  G  L  P  R  A  -        V  P  G  V
msh08123  1050 GGTGCACGAGGCCAAAGGGCTACCCCGAGCAAC......TGTACCCGGAG 1093
               ||||||||| || || ||||| ||||||||| |       | |||||| |
as00087   1458 GGTGCACGAAGCGAAGGGGCTTCCCCGAGCAGCGGCGGGGGCACCCGGCG 1507
                V  H  E  A  K  G  L  P  R  A  A  A  G  A  P  G  V

          1601 ----+----*----+----*----+----*----+----*----+----* 1650
                 R  A  E  L  W  L  D  G  A  L  L  A  R  T  A  P  
msh08123  1094 TGCGTGCGGAGCTGTGGTTGGATGGCGCGCTATTGGCGCGCACTGCCCCC 1143
               |||| || ||||||||| |||||||||||||  |||| ||||| || || 
as00087   1508 TGCGCGCCGAGCTGTGGCTGGATGGCGCGCTGCTGGCACGCACGGCGCCT 1557
                 R  A  E  L  W  L  D  G  A  L  L  A  R  T  A  P  

          1651 ----+----*----+----*----+----*----+----*----+----* 1700
               R  A  G  P  G  Q  L  F  W  A  E  R  F  H  F  E  A 
msh08123  1144 CGTGCCGGCCCAGGCCAACTCTTCTGGGCTGAACGCTTCCACTTCGAGGC 1193
               || |||||||||||||| ||||||||||| || |||||||||||||||||
as00087   1558 CGGGCCGGCCCAGGCCAGCTCTTCTGGGCCGAGCGCTTCCACTTCGAGGC 1607
               R  A  G  P  G  Q  L  F  W  A  E  R  F  H  F  E  A 

          1701 ----+----*----+----*----+----*----+----*----+----* 1750
                L  P  P  A  R  R  L  S  L  R  L  R  S  A  G  P  A
msh08123  1194 GTTGCCACCTGCCCGTCGCCTGTCGCTGCGGTTGCGCAGCGCAGGCCCGG 1243
               | ||||||| || |||||||||||||||||| ||||| ||   |||||||
as00087   1608 GCTGCCACCGGCACGTCGCCTGTCGCTGCGGCTGCGCGGCTTGGGCCCGG 1657
                L  P  P  A  R  R  L  S  L  R  L  R  G  L  G  P  G

          1751 ----+----*----+----*----+----*----+----*----+----* 1800
                 G  A  T  V  G  R  V  V  L  E  L  D  E  V  S  I  
msh08123  1244 CAGGGGCAACCGTGGGCAGGGTGGTGCTGGAGCTGGACGAGGTGAGCATC 1293
                | | ||     ||||| | ||||  |||| |||||| ||| ||  |  |
as00087   1658 GAAGCGCGGTGCTGGGCCGCGTGGCCCTGGCGCTGGAGGAGCTGGACGCC 1707
                 S  A  V  L  G  R  V  A  L  A  L  E  E  L  D  A  

          1801 ----+----*----+----*----+----*----+----*----+----* 1850
               P  R  A  P  A  A  G  L  E  R  W  F  P  V  L  G  A 
msh08123  1294 CCCCGCGCGCCTGCCGCGGGCCTAGAGCGCTGGTTCCCGGTGCTTGGGGC 1343
               || |||||||||||||| || || ||||||||||||||| |||| |||||
as00087   1708 CCACGCGCGCCTGCCGCCGGTCTGGAGCGCTGGTTCCCGCTGCTCGGGGC 1757
               P  R  A  P  A  A  G  L  E  R  W  F  P  L  L  G  A 

          1851 ----+----*----+----*----+----*----+----*----+----* 1900
                P  A  G  A  V  L  R  A  R  I  R  V  R  C  L  R  V
msh08123  1344 GCCGGCAGGTGCGGTGCTACGAGCGCGGATCAGGGTGCGTTGTCTGCGCG 1393
               |||||| || || | ||| || ||||||||  ||| |||| | |||||||
as00087   1758 GCCGGCGGGCGCAGCGCTGCGGGCGCGGATTCGGGCGCGTCGCCTGCGCG 1807
                P  A  G  A  A  L  R  A  R  I  R  A  R  R  L  R  V

          1901 ----+----*----+----*----+----*----+----*----+----* 1950
                 L  P  S  E  R  Y  K  E  L  A  E  F  L  T  F  H  
msh08123  1394 TGCTGCCGTCGGAGCGCTATAAGGAGCTGGCCGAGTTCCTCACCTTCCAC 1443
               |||||||||| |||||||| ||||||||||| ||||||||||||||||||
as00087   1808 TGCTGCCGTCCGAGCGCTACAAGGAGCTGGCGGAGTTCCTCACCTTCCAC 1857
                 L  P  S  E  R  Y  K  E  L  A  E  F  L  T  F  H  

          1951 ----+----*----+----*----+----*----+----*----+----* 2000
               Y  A  R  L  C  G  A  L  E  P  A  L  S  A  Q  A  K 
msh08123  1444 TACGCGCGCCTGTGCGGGGCGCTGGAGCCGGCGCTGTCTGCTCAGGCCAA 1493
               || |||||||| |||||||| |||||||| |||||| |||| ||||||||
as00087   1858 TATGCGCGCCTCTGCGGGGCCCTGGAGCCCGCGCTGCCTGCGCAGGCCAA 1907
               Y  A  R  L  C  G  A  L  E  P  A  L  P  A  Q  A  K 

          2001 ----+----*----+----*----+----*----+----*----+----* 2050
                E  E  L  A  A  A  M  V  R  V  L  R  A  T  G  R  A
msh08123  1494 AGAGGAGCTGGCAGCAGCCATGGTGCGCGTGCTACGAGCCACTGGCCGAG 1543
                ||||||||||| |||||||||||||||||||| || ||||| ||||| |
as00087   1908 GGAGGAGCTGGCGGCAGCCATGGTGCGCGTGCTGCGGGCCACCGGCCGGG 1957
                E  E  L  A  A  A  M  V  R  V  L  R  A  T  G  R  A

          2051 ----+----*----+----*----+----*----+----*----+----* 2100
                 Q  A  L  V  T  D  L  G  T  A  E  L  A  R  C  G  
msh08123  1544 CGCAGGCATTGGTGACAGACCTAGGAACCGCAGAGCTGGCACGCTGTGGA 1593
               |||||||  ||||||| ||||| || || || |||||||| |||||||||
as00087   1958 CGCAGGCGCTGGTGACTGACCTGGGCACTGCGGAGCTGGCGCGCTGTGGA 2007
                 Q  A  L  V  T  D  L  G  T  A  E  L  A  R  C  G  

          2101 ----+----*----+----*----+----*----+----*----+----* 2150
               G  R  E  A  L  L  F  R  E  N  T  L  A  T  K  A  I 
msh08123  1594 GGCCGTGAAGCACTGCTATTCCGAGAAAATACCTTAGCCACAAAGGCCAT 1643
               |||||||| || ||||| ||||| ||||| || || ||||| ||||| ||
as00087   2008 GGCCGTGAGGCGCTGCTGTTCCGGGAAAACACATTGGCCACCAAGGCTAT 2057
               G  R  E  A  L  L  F  R  E  N  T  L  A  T  K  A  I 

          2151 ----+----*----+----*----+----*----+----*----+----* 2200
                D  E  Y  M  K  L  V  A  Q  E  Y  L  Q  D  T  L  G
msh08123  1644 TGATGAATACATGAAACTGGTGGCACAGGAGTACCTCCAGGACACCCTGG 1693
                ||||| |||||||| || ||||||||||| ||||||||||| |||||||
as00087   2058 CGATGAGTACATGAAGCTCGTGGCACAGGATTACCTCCAGGAGACCCTGG 2107
                D  E  Y  M  K  L  V  A  Q  D  Y  L  Q  E  T  L  G

          2201 ----+----*----+----*----+----*----+----*----+----* 2250
                 Q  V  V  R  C  L  C  A  S  T  E  D  C  E  V  D  
msh08123  1694 GGCAGGTTGTGCGGTGTCTCTGTGCCTCCACTGAGGACTGTGAAGTGGAC 1743
               | |||||||||||| |||||||||| || |||||||||||||||||||||
as00087   2108 GACAGGTTGTGCGGCGTCTCTGTGCTTCTACTGAGGACTGTGAAGTGGAC 2157
                 Q  V  V  R  R  L  C  A  S  T  E  D  C  E  V  D  

          2251 ----+----*----+----*----+----*----+----*----+----* 2300
               P  S  K  C  P  T  P  E  L  P  K  H  Q  A  R  L  R 
msh08123  1744 CCCAGCAAGTGCCCAACACCAGAGCTGCCAAAACACCAAGCTAGACTGCG 1793
               |||||||| || ||| |  | ||||||||| | ||||| || ||||| ||
as00087   2158 CCCAGCAAATGTCCAGCCTCGGAGCTGCCAGAGCACCAGGCCAGACTTCG 2207
               P  S  K  C  P  A  S  E  L  P  E  H  Q  A  R  L  R 

          2301 ----+----*----+----*----+----*----+----*----+----* 2350
                D  S  C  E  E  V  F  E  N  I  I  H  S  Y  N  C  F
msh08123  1794 AGACAGCTGTGAGGAGGTTTTTGAAAACATCATCCATTCTTACAACTGTT 1843
                 ||||||| |||||||| || |||| ||| |||||||| ||| |||| |
as00087   2208 GAACAGCTGCGAGGAGGTCTTCGAAACCATTATCCATTCCTACGACTGGT 2257
                N  S  C  E  E  V  F  E  T  I  I  H  S  Y  D  W  F

          2351 ----+----*----+----*----+----*----+----*----+----* 2400
                 P  A  E  L  G  S  V  F  S  S  W  R  E  A  C  K  
msh08123  1844 TCCCAGCAGAGCTGGGCTCCGTGTTCTCAAGTTGGCGTGAAGCATGCAAA 1893
               |||| || |||||||||  |||||||||||| ||||| |||||||| |||
as00087   2258 TCCCTGCGGAGCTGGGCATCGTGTTCTCAAGCTGGCGAGAAGCATGTAAA 2307
                 P  A  E  L  G  I  V  F  S  S  W  R  E  A  C  K  

          2401 ----+----*----+----*----+----*----+----*----+----* 2450
               A  R  G  S  E  A  L  G  P  R  L  V  C  A  S  L  F 
msh08123  1894 GCACGAGGCTCTGAGGCCCTGGGCCCCCGGCTGGTGTGTGCTTCCCTCTT 1943
               | ||| ||||||||||  ||||||||||| |||||||| || ||||||||
as00087   2308 GAACGTGGCTCTGAGGTGCTGGGCCCCCGACTGGTGTGCGCCTCCCTCTT 2357
               E  R  G  S  E  V  L  G  P  R  L  V  C  A  S  L  F 

          2451 ----+----*----+----*----+----*----+----*----+----* 2500
                L  R  L  L  C  P  A  I  L  A  P  S  L  F  G  L  A
msh08123  1944 CCTGAGGCTTTTATGCCCTGCCATCTTGGCACCTAGCCTCTTTGGCCTGG 1993
               |||| ||||  | |||||||||||| ||||||| |||||||||||  |||
as00087   2358 CCTGCGGCTCCTGTGCCCTGCCATCCTGGCACCCAGCCTCTTTGGTTTGG 2407
                L  R  L  L  C  P  A  I  L  A  P  S  L  F  G  L  A

          2501 ----+----*----+----*----+----*----+----*----+----* 2550
                 P  E  H  P  A  P  G  P  A  R  T  L  T  L  I  A  
msh08123  1994 CACCAGAGCACCCAGCCCCAGGCCCAGCCAGAACTCTTACACTCATCGCC 2043
               ||||||| || ||||| || ||||||||| | || || ||||| || |||
as00087   2408 CACCAGACCATCCAGCACCCGGCCCAGCCCGCACCCTCACACTGATTGCC 2457
                 P  D  H  P  A  P  G  P  A  R  T  L  T  L  I  A  

          2551 ----+----*----+----*----+----*----+----*----+----* 2600
               K  V  I  Q  N  L  A  N  C  A  P  F  G  E  K  E  A 
msh08123  2044 AAGGTCATCCAGAACCTCGCCAACTGTGCCCCGTTTGGTGAGAAGGAAGC 2093
               |||||||||||||||||||||||| |||||||||| ||||||||||| ||
as00087   2458 AAGGTCATCCAGAACCTCGCCAACCGTGCCCCGTTCGGTGAGAAGGAGGC 2507
               K  V  I  Q  N  L  A  N  R  A  P  F  G  E  K  E  A 

          2601 ----+----*----+----*----+----*----+----*----+----* 2650
                Y  M  A  F  M  N  S  F  L  E  D  H  G  P  A  M  Q
msh08123  2094 CTACATGGCGTTTATGAATAGCTTCCTGGAAGATCATGGACCAGCCATGC 2143
               ||||||||  || ||||||||||||||||| || ||||||||||||||||
as00087   2508 CTACATGGGCTTCATGAATAGCTTCCTGGAGGAACATGGACCAGCCATGC 2557
                Y  M  G  F  M  N  S  F  L  E  E  H  G  P  A  M  Q

          2651 ----+----*----+----*----+----*----+----*----+----* 2700
                 H  F  L  D  Q  V  A  T  V  D  A  D  T  T  P  S  
msh08123  2144 AACACTTCCTGGACCAGGTAGCTACGGTGGATGCAGATACCACACCCAGT 2193
               ||  |||||||||||||||||| | ||||||||  ||| |  | ||||||
as00087   2558 AATGCTTCCTGGACCAGGTAGCCATGGTGGATGTGGATGCTGCCCCCAGT 2607
                 C  F  L  D  Q  V  A  M  V  D  V  D  A  A  P  S  

          2701 ----+----*----+----*----+----*----+----*----+----* 2750
               G  Y  Q  G  S  G  D  L  A  L  Q  L  A  V  L  H  V 
msh08123  2194 GGCTACCAGGGAAGTGGAGACCTGGCCCTTCAGTTAGCAGTTCTTCATGT 2243
               || |||||||| ||||| || |||||||| |||||||| || || |||| 
as00087   2608 GGTTACCAGGGCAGTGGTGATCTGGCCCTCCAGTTAGCTGTCCTGCATGC 2657
               G  Y  Q  G  S  G  D  L  A  L  Q  L  A  V  L  H  A 

          2751 ----+----*----+----*----+----*----+----*----+----* 2800
                Q  L  C  T  I  F  A  E  L  D  Q  K  T  Q  D  S  L
msh08123  2244 CCAGCTCTGCACAATCTTTGCCGAACTTGACCAGAAAACCCAAGATAGCT 2293
               ||||||||| ||||| ||||| || |||||||||| ||||| ||| | | 
as00087   2658 CCAGCTCTGTACAATTTTTGCTGAGCTTGACCAGACAACCCGAGACACCC 2707
                Q  L  C  T  I  F  A  E  L  D  Q  T  T  R  D  T  L

          2801 ----+----*----+----*----+----*----+----*----+----* 2850
                 E  P  L  P  T  I  L  R  A  I  E  E  G  R  P  V  
msh08123  2294 TGGAACCACTACCCACCATCCTTCGAGCCATTGAGGAAGGCCGGCCTGTC 2343
               |||||||||| ||||||||||| |||||||||||||| |||| |||||| 
as00087   2708 TGGAACCACTGCCCACCATCCTGCGAGCCATTGAGGAGGGCCAGCCTGTG 2757
                 E  P  L  P  T  I  L  R  A  I  E  E  G  Q  P  V  

          2851 ----+----*----+----*----+----*----+----*----+----* 2900
               P  V  S  V  P  M  R  L  P  R  I  S  T  Q  V  Q  S 
msh08123  2344 CCAGTGTCTGTACCAATGCGTCTTCCCCGGATCTCCACTCAGGTCCAATC 2393
               |  ||||| || ||||||||||| || | |  | |  | |||||||| ||
as00087   2758 CTTGTGTCAGTGCCAATGCGTCTCCCACTGCCCCCGGCCCAGGTCCACTC 2807
               L  V  S  V  P  M  R  L  P  L  P  P  A  Q  V  H  S 

          2901 ----+----*----+----*----+----*----+----*----+----* 2950
                S  F  F  S  G  E  K  P  G  F  L  A  P  R  D  L  P
msh08123  2394 CAGCTTCTTTTCAGGGGAGAAACCCGGCTTCCTGGCACCCCGAGACCTCC 2443
               |||| |||   |||||||||| |||||||||||||| ||||| |||||||
as00087   2808 CAGCCTCTCCGCAGGGGAGAAGCCCGGCTTCCTGGCCCCCCGGGACCTCC 2857
                S  L  S  A  G  E  K  P  G  F  L  A  P  R  D  L  P

          2951 ----+----*----+----*----+----*----+----*----+----* 3000
                 K  H  T  P  L  I  S  K  S  Q  S  L  R  S  F  Q  
msh08123  2444 CCAAGCACACTCCCCTCATTTCCAAGAGTCAATCTCTGCGCAGCTTTCAA 2493
               |||||||||| || ||||| |||||||| || |||||||||||| |||  
as00087   2858 CCAAGCACACCCCTCTCATCTCCAAGAGCCAGTCTCTGCGCAGCGTTCGC 2907
                 K  H  T  P  L  I  S  K  S  Q  S  L  R  S  V  R  

          3001 ----+----*----+----*----+----*----+----*----+----* 3050
               G  A  G  S  W  A  S  R  R  P  D  E  E  R  P  Q  R 
msh08123  2494 GGGGCAGGAAGCTGGGCCAGTCGGCGGCCAGATGAGGAGCGGCCCCAGAG 2543
                |  |||  || |||||| | |  ||||| || || |||||||||| | |
as00087   2908 CGCTCAGAGAGTTGGGCCCGGCCACGGCCGGACGAAGAGCGGCCCCTGCG 2957
               R  S  E  S  W  A  R  P  R  P  D  E  E  R  P  L  R 

          3051 ----+----*----+----*----+----*----+----*----+----* 3100
                R  P  R  P  V  L  R  T  Q  S  V  P  A  R  R  P  T
msh08123  2544 GCGGCCGCGGCCAGTGCTGCGCACACAGAGCGTCCCTGCCAGACGTCCTA 2593
               |||||| ||||| |||| |||||| ||||| ||||| | | | |||||| 
as00087   2958 GCGGCCCCGGCCGGTGCAGCGCACGCAGAGTGTCCCGGTCCGGCGTCCTG 3007
                R  P  R  P  V  Q  R  T  Q  S  V  P  V  R  R  P  A

          3101 ----+----*----+----*----+----*----+----*----+----* 3150
                 H  R  R  P  S  A  G  S  K  P  R  P  K  G  S  L  
msh08123  2594 CCCACCGTCGCCCTTCTGCAGGTTCCAAGCCGCGGCCCAAAGGCTCCCTA 2643
               ||| ||| ||||  ||||| ||  ||  |||||| |||||||||||||| 
as00087   3008 CCCGCCGCCGCCAATCTGCGGGGCCCTGGCCGCGACCCAAAGGCTCCCTG 3057
                 R  R  R  Q  S  A  G  P  W  P  R  P  K  G  S  L  

          3151 ----+----*----+----*----+----*----+----*----+----* 3200
               R  M  G  P  A  P  C  G  R  A  W  T  R  A  S  A  S 
msh08123  2644 CGCATGGGCCCCGCGCCTTGCGGACGGGCCTGGACTAGGGCTTCTGCCTC 2693
                ||||||| || |||||  |||  ||| | |||||  |||  || |||||
as00087   3058 AGCATGGGACCAGCGCCCCGCGCCCGGCCTTGGACCCGGGACTCCGCCTC 3107
               S  M  G  P  A  P  R  A  R  P  W  T  R  D  S  A  S 

          3201 ----+----*----+----*----+----*----+----*----+----* 3250
                L  P  R  K  P  S  V  P  W  Q  R  Q  M  D  Q  P  G
msh08123  2694 GCTACCTCGGAAGCCATCGGTACCATGGCAGCGCCAAATGGACCAGCCGG 2743
               ||| ||||||||||| |||||||| |||||||||||||||||||||||| 
as00087   3108 GCTGCCTCGGAAGCCGTCGGTACCCTGGCAGCGCCAAATGGACCAGCCGC 3157
                L  P  R  K  P  S  V  P  W  Q  R  Q  M  D  Q  P  Q

          3251 ----+----*----+----*----+----*----+----*----+----* 3300
                 D  R  Y  Q  T  T  G  T  H  R  P  V  G  K  L  A  
msh08123  2744 GGGACCGATACCAAACTACGGGAACGCACCGACCAGTGGGCAAGCTAGCA 2793
                 |||||| ||||  |   ||| ||||||||||| |||  |||| | |||
as00087   3158 AAGACCGAAACCAGGCACTGGGCACGCACCGACCTGTGAACAAGTTGGCA 3207
                 D  R  N  Q  A  L  G  T  H  R  P  V  N  K  L  A  

          3301 ----+----*----+----*----+----*----+----*----+----* 3350
               E  I  Q  C  E  V  A  I  F  R  E  A  Q  K  A  L  S 
msh08123  2794 GAGATACAATGTGAGGTGGCTATATTCCGCGAGGCACAGAAAGCTCTGTC 2843
               ||| | || || ||||||||     | || ||||  |||||||  |||||
as00087   3208 GAGCTGCAGTGCGAGGTGGCCGCTCTGCGTGAGGAGCAGAAAGTGCTGTC 3257
               E  L  Q  C  E  V  A  A  L  R  E  E  Q  K  V  L  S 

          3351 ----+----*----+----*----+----*----+----*----+----* 3400
                L  L  V  E  S  L  S  T  Q  V  Q  A  L  K  E  Q  Q
msh08123  2844 CCTTCTGGTGGAGTCGTTGAGTACCCAAGTCCAAGCCTTGAAGGAGCAGC 2893
               ||  || ||||||||| |||| |||||| |||  ||||||| ||||||||
as00087   3258 CCGCCTCGTGGAGTCGCTGAGCACCCAAATCCGGGCCTTGACGGAGCAGC 3307
                R  L  V  E  S  L  S  T  Q  I  R  A  L  T  E  Q  Q

          3401 ----+----*----+----*----+----*----+----*----+----* 3450
                 E  H  F  R  C  Q  L  Q  D  L  Y  S  R  L  G  A  
msh08123  2894 AGGAGCACTTCCGCTGTCAGCTGCAGGATCTGTACTCCAGACTGGGAGCT 2943
               |||||||  | ||  | ||||||||||||||| ||||||| ||  | |||
as00087   3308 AGGAGCAGCTGCGGGGCCAGCTGCAGGATCTGGACTCCAGGCTCCGTGCT 3357
                 E  Q  L  R  G  Q  L  Q  D  L  D  S  R  L  R  A  

          3451 ----+----*----+----*----+----*----+----*----+----* 3500
               -                                                 
msh08123  2944 G................................................. 2944
               |                                                 
as00087   3358 GGGTGAGCCCAGCCCTCTCCATTAGCTCCGCCCCCAGGTGGGCCCGATCC 3407
               G  *                                              

          3501 ----+----*----+----*----+----*----+----*----+----* 3550
msh08123  2945 .................................................. 2945
                                                                 
as00087   3408 AATTAGATCTGACCAGGCCCTCCCCCCCCCCCCCCCAAACCACGCCTAGC 3457

          3551 ----+----*----+----*----+----*----+----*----+----* 3600
msh08123  2945 .................................................. 2945
                                                                 
as00087   3458 ACGGCCAAGCCCCACCCTGGCCCCGCCCCAATAAGTCTGTCCAGGCCCTG 3507

          3601 ----+----*----+----*----+----*----+----*----+----* 3650
msh08123  2945 .................................................. 2945
                                                                 
as00087   3508 CCTCTTGGCAAGCCCCGCCCCCAGGTGGACCCACCCAACGAAGGTCGGCC 3557

          3651 ----+----*----+----*----+----*----+----*----+----* 3700
msh08123  2945 .................................................. 2945
                                                                 
as00087   3558 TAGGACCCTGGCAGATTCTCTCATCCAGGCCCCACCCCTCCATCAGCCCC 3607

          3701 ----+----*----+----*----+----*----+----*----+----* 3750
msh08123  2945 .................................................. 2945
                                                                 
as00087   3608 TCCCAACCTGGCTGGGCCGGGCACTGAGATATCTCAGCCTGGGCACTCCC 3657

          3751 ----+----*----+----*----+----*----+----*----+----* 3800
msh08123  2945 .................................................. 2945
                                                                 
as00087   3658 TCCAGCTCAGTCACGACCCCACCCAGGACTGGCCAAACCCCACCTCCAAG 3707

          3801 ----+----*----+----*----+----*----+----*----+----* 3850
msh08123  2945 .................................................. 2945
                                                                 
as00087   3708 GGATGATTGATAGGTCCTGTCATTCCAATCATTCCCGTCTCTGGTCCGAC 3757

          3851 ----+----*----+----*----+----*----+----*----+----* 3900
msh08123  2945 .................................................. 2945
                                                                 
as00087   3758 CCTTCTTCAAGCATATTTAGGCCCCCGCCCCTCAGTCTGGACAAGCTCCG 3807

          3901 ----+----*----+----*----+----*----+----*----+----* 3950
msh08123  2945 .................................................. 2945
                                                                 
as00087   3808 CCTCCACTCCAAGCTCCGCCCCCGGAGTCCCTGGCACTCCCACCAGGAGC 3857

          3951 ----+----*----+----*----+----*----+----*----+----* 4000
msh08123  2945 .................................................. 2945
                                                                 
as00087   3858 TGGAAGGGACTTCCTTAGCCTCTGCCCTTCGCCCCAATCCCCCGACCCCC 3907

          4001 ----+----*----+----*----+----*----+----*----+----* 4050
msh08123  2945 ...GTATCTCAAAGTTGGATTCTAAGGGTGGTCTCCCAAGCAACGGAAGC 2991
                  | | |||| |||| |||||  ||      ||  ||||||| | | | 
as00087   3908 ACAGGAGCTCAGAGTTTGATTCAGAGCACAACCTAACAAGCAATGAAGGG 3957

          4051 ----+----*----+----*----+----*----+----*----+----* 4100
msh08123  2992 CACAGGCTGAAAAGTCTGGAACAGCGCCTGACTGAGATGGAATGTTCTCA 3041
               ||||| |||||||  ||||| || ||||| | |||||||||  |  ||||
as00087   3958 CACAGTCTGAAAAACCTGGAGCACCGCCTAAATGAGATGGAGAGAACTCA 4007

          4101 ----+----*----+----*----+----*----+----*----+----* 4150
msh08123  3042 GGACCAGCTGAGGGATAGCCTCCAAAGCCTGCAACTTCTTTCAAAAACAC 3091
               ||  ||||||||||||    |||| |||||||| |||  | |||  || |
as00087   4008 GGCTCAGCTGAGGGATGCTGTCCAGAGCCTGCAGCTTTCTCCAAGGACGC 4057

          4151 ----+----*----+----*----+----*----+----*----+----* 4200
msh08123  3092 CAGGATCTCGGAGCCAGCCCCTGCCTCTCAAAGCACCATGTGTCAATGGA 3141
                 || ||| |||| || |||| ||| ||||||||||| ||  ||||||||
as00087   4058 GGGGGTCTTGGAGTCAACCCCAGCCCCTCAAAGCACCCTGCCTCAATGGA 4107

          4201 ----+----*----+----*----+----*----+----*----+----* 4250
msh08123  3142 GCTGATCTGTCAATGGGCACTTGAGCATTTCACC...CCACTCCATATAT 3188
               |                ||| |||||    || |   || |  || |  |
as00087   4108 G............ACACCACCTGAGCTGCCCATCCTGCCTCATCACACGT 4145

          4251 ----+----*----+----*----+----*----+----*----+----* 4300
msh08123  3189 GGTCTGGAAGCCAAGAGAGCCACTACAGAGG................... 3219
               ||||||| |||  |||||    |  |  |||                   
as00087   4146 GGTCTGGGAGCAGAGAGATAGCCATCTTAGGGGGGGTGTCTGACTTTGCC 4195

          4301 ----+----*----+----*----+----*----+----*----+----* 4350
msh08123  3220 .................................................. 3220
                                                                 
as00087   4196 TTAGCCCTACTTGGCCTACAGTGGGGAGTGGAGCTGCTGGTCCCAACCAC 4245

          4351 ----+----*----+----*----+----*----+----*-- 4392
msh08123  3220 .......................................... 3220
                                                         
as00087   4246 TCTGGCAGTATGAAGTTGCCCAGTAAAATCTTGATTTCAGTG 4287