Comparison of KIAA cDNA sequences between mouse and human (FLJ00293)

<<Original sequence data>>

mouse  mFLJ00293 (msh04332)     length:   3197 bp, CDS:     2 -  3070
human     (sh03709)     length:   5391 bp, CDS:  2880 -  5318

In this page, the longest coding region predicted by GeneMark
was assigned as CDS for each of mouse and human KIAA cDNAs.
They were colored in green.  When the CDS positions were not identical
on the aligned sequences between mouse and human cDNAs, mouse cDNA
sequence was translated based on the human CDS information.
The amino acid sequence produced here may not be identical to the
protein sequence deduced (see Description).


<<Aligned sequence information>>

----------------------------------------------------------
            region      #match  #mismatch  %diff
----------------------------------------------------------
DNA

5'UTR:    178 -  2879     325      321      49.7
  CDS:   2880 -  5318    2081      358      14.7
3'UTR:   5319 -  5437      41       35      46.1

amino acid

  CDS:   2880 -  5318     728       85      10.5
----------------------------------------------------------


<<Alignment>>

             1 ----+----*----+----*----+----*----+----*----+----* 50
msh04332     1 .................................................. 1
                                                                 
sh03709      1 GCTTGTGACCCTTCCATCACTACCCCAATTCCCCCTCTCTGCCCACTCAA 50

            51 ----+----*----+----*----+----*----+----*----+----* 100
msh04332     1 .................................................. 1
                                                                 
sh03709     51 CCCCACCTAGCCCATCGTCCTCACTCCCCACCACACTCACCCCAACCAGT 100

           101 ----+----*----+----*----+----*----+----*----+----* 150
msh04332     1 .................................................. 1
                                                                 
sh03709    101 ACCCCACACCCCACAAACAAGAGGGTGTGACTTGCAGGGAACAAAAGCAG 150

           151 ----+----*----+----*----+----*----+----*----+----* 200
msh04332     1 ...........................TGCAGTGGGCAGGATGGAGGACG 23
                                           |  | ||||    |||    | 
sh03709    151 GATGTCCCCTTTATCTGAGACACCTCCAGATGAGGGCTCTCTGGGACCCA 200

           201 ----+----*----+----*----+----*----+----*----+----* 250
msh04332    24 AAGAAGGCCCTGAGTATGGGAAAC.......................... 47
                |   | | ||| | |    || |                          
sh03709    201 TAAGTGTCTCTGTGAACAATAAGCTCCCCACGAGGTGGGCTTCCCCTGGG 250

           251 ----+----*----+----*----+----*----+----*----+----* 300
msh04332    48 ......................................CAGACTTCGTGC 59
                                                     | ||  | ||||
sh03709    251 ATTACAGGCAGTGTCTCTAGTGGTATCAGAGCTCAACACTGAAGTGGTGC 300

           301 ----+----*----+----*----+----*----+----*----+----* 350
msh04332    60 TTTTGGATCAACTGACCATG.....GAGGACTTCATGAAGAACCTAGAGC 104
               |  |||    || | |  ||     ||| || ||    |||     | ||
sh03709    301 TCCTGGGGACACGGGCATTGACTCTGAGCACCTCTGTGAGATAGGCGCGC 350

           351 ----+----*----+----*----+----*----+----*----+----* 400
msh04332   105 TCAGG............................................. 109
                ||||                                             
sh03709    351 CCAGGGACTGTCTCCACAAGCTTTGACACCCCACTGAGGTGGGAGCTCCC 400

           401 ----+----*----+----*----+----*----+----*----+----* 450
msh04332   110 .................................................. 110
                                                                 
sh03709    401 TAGAAACAAACAGTGTCTCTGCAGTCTCAGAGCTTGACACTGAAGTGGAG 450

           451 ----+----*----+----*----+----*----+----*----+----* 500
msh04332   110 .................................................. 110
                                                                 
sh03709    451 CTCCCTGGCACGCCAGCGGTGTGTTTGCAAACTGTGGCCTGCCCACTGTG 500

           501 ----+----*----+----*----+----*----+----*----+----* 550
msh04332   110 .................................................. 110
                                                                 
sh03709    501 TCCTCTGGTTCACAGACAGTGTTTCCACAGACTCCGAACTCCCCCACTGA 550

           551 ----+----*----+----*----+----*----+----*----+----* 600
msh04332   110 .............TTTGAGAAGGGCCGTATCTAT................ 130
                            | |||   | || || ||| |                
sh03709    551 ACTGGGAGCACCCTATGACTGGAGCAGTGTCTCTAGGGACTTGGAGCTCC 600

           601 ----+----*----+----*----+----*----+----*----+----* 650
msh04332   131 .................................................. 131
                                                                 
sh03709    601 CCACTGAGTTAGGAGTTCCCTGCAGCACAGACATGGTCCCCACCAACTCT 650

           651 ----+----*----+----*----+----*----+----*----+----* 700
msh04332   131 .................................................. 131
                                                                 
sh03709    651 CAGCTCCCCAGTGCAGTGGGAGCTCCCTGGAACACAGCAGTTTCTCCCTG 700

           701 ----+----*----+----*----+----*----+----*----+----* 750
msh04332   131 .................................................. 131
                                                                 
sh03709    701 GTACACAGGCGGTGTTTCCATGGATTCTGAGCTCTTTACTGAGGTGGGAG 750

           751 ----+----*----+----*----+----*----+----*----+----* 800
msh04332   131 .................................................. 131
                                                                 
sh03709    751 CTCCCTGAGACAAGCACAGTGTCTCACAGACTCTGTGCTCCCCATGAGAC 800

           801 ----+----*----+----*----+----*----+----*----+----* 850
msh04332   131 .................................................. 131
                                                                 
sh03709    801 AGGAGCTCCCTGGGTAGCGGGCAGGGTCTCCAGGTTCTATGAACCGCCCG 850

           851 ----+----*----+----*----+----*----+----*----+----* 900
msh04332   131 ................................................AC 132
                                                               | 
sh03709    851 CTGAGGTGGGAGCCTCCTGGAATGCAAGGAGTGCCTTTAAGGACTCTGAG 900

           901 ----+----*----+----*----+----*----+----*----+----* 950
msh04332   133 CTACATTGGTGAGGTGCTC............................... 151
               || |  ||||| |||||||                               
sh03709    901 CTTCCATGGTGTGGTGCTCCCTGGGACACGGCAGTGTCTTCATGACTCTG 950

           951 ----+----*----+----*----+----*----+----*----+----* 1000
msh04332   152 .................................................. 152
                                                                 
sh03709    951 ACTTCCCACTGAAGTCACACTCTCTGGGACACAGCAGTGTCTGTAAGGAC 1000

          1001 ----+----*----+----*----+----*----+----*----+----* 1050
msh04332   152 .................................................. 152
                                                                 
sh03709   1001 TCTGAACTCCACTGAGGTGGGAGCCCCCTGGGCTGTCTTTCCAGTGAGTC 1050

          1051 ----+----*----+----*----+----*----+----*----+----* 1100
msh04332   152 ....................................GTATCCGTGAACCC 165
                                                   || || ||| || |
sh03709   1051 TGCGCTGAGGTGGGAGCTCACTGGGAAACCAGTAGTGTCTCTGTGGACTC 1100

          1101 ----+----*----+----*----+----*----+----*----+----* 1150
msh04332   166 CTACCAGGAACTGCCATTGTATGGGCCAGAGGCCAT.........TGCCA 206
                 |||   |  ||   |   |    || ||| |  |         | |  
sh03709   1101 TGACCTTTATTTGAGGTAAAAGCTCCCTGAGACACTGGCAGTGTCTCCAC 1150

          1151 ----+----*----+----*----+----*----+----*----+----* 1200
msh04332   207 AGTACCAGGGCCGCGAGCTCTATGAGCGACCACCTCATCTTTACGCCGTG 256
               ||   | | ||| |   ||    |     ||     |     | | ||| 
sh03709   1151 AGACTCTGAGCCTCCCCCTGCTGGGAGCCCCTGGGAACACAGAGGGCGTC 1200

          1201 ----+----*----+----*----+----*----+----*----+----* 1250
msh04332   257 GCCAATGCTGCTTACAAGGCAATGAAGCGCAGAT.CCAGGGACACCTGCA 305
                |||  |   || |     || ||||  | || | || ||||| || |||
sh03709   1201 TCCACAGACTCTGAGCTCTCACTGAAATGAAGCTCCCTGGGACTCCGGCA 1250

          1251 ----+----*----+----*----+----*----+----*----+----* 1300
msh04332   306 TCGTCATCTCA....................................... 316
                  ||  | ||                                       
sh03709   1251 GGATCTCCACAGACTCTTACCTCCCCACTGAAATGGGAGATTTTGGGACA 1300

          1301 ----+----*----+----*----+----*----+----*----+----* 1350
msh04332   317 .................................................. 317
                                                                 
sh03709   1301 TGGGCAGGGTCTCCATGGACTCAGAGTTCCCCCATTGAAGGGGGAGTTCG 1350

          1351 ----+----*----+----*----+----*----+----*----+----* 1400
msh04332   317 .................................................. 317
                                                                 
sh03709   1351 GTGGCACATGGGCATTGTCTCTAATGACTCTGAGCTCCTGGCTGAGGGGA 1400

          1401 ----+----*----+----*----+----*----+----*----+----* 1450
msh04332   317 .................................................. 317
                                                                 
sh03709   1401 TTGCTCCCTGGGACGTTGCCAGGGTCTCCACAGACTCTGAGCTCCCCATG 1450

          1451 ----+----*----+----*----+----*----+----*----+----* 1500
msh04332   317 .................................................. 317
                                                                 
sh03709   1451 AGGTGGGAGCTCCACAGGATTTGGGCAGGGTCTCCATGAACTATAAACTC 1500

          1501 ----+----*----+----*----+----*----+----*----+----* 1550
msh04332   317 ......GGGGAGAGTGGGGCAGGGAAGACAGAAGCCAGCAAGCACATCAT 360
                     | || | | |   |  || | |||||          || |   |
sh03709   1501 TCCACTGAGGTGGGAGCTCCCTGGGACACAGAGAGTTTTTTTCAGACTCT 1550

          1551 ----+----*----+----*----+----*----+----*----+----* 1600
msh04332   361 GCAGTACATTG....................................... 371
               | | | |  ||                                       
sh03709   1551 GAACTCCCATGAAATGGAGCTCTTTGGAGCATGGACAGTGTCTTTAAGGA 1600

          1601 ----+----*----+----*----+----*----+----*----+----* 1650
msh04332   372 ...............................................CTG 374
                                                               ||
sh03709   1601 CTCTGAGCTACCCACCGAGGTGGGAGCTCAGTGGGACATGTGAAGCAGTG 1650

          1651 ----+----*----+----*----+----*----+----*----+----* 1700
msh04332   375 CTGTCACCAACCCAAGCCAGAGGGCTGAGGTGGAGAGGGTGAAGAA.... 420
                   |||  || |  | | |    || ||||||   | ||   | |    
sh03709   1651 TCTCCACGGACTCTGGGCTGCCCACTCAGGTGGGCTGTGTCTTGGACACA 1700

          1701 ----+----*----+----*----+----*----+----*----+----* 1750
msh04332   421 .................................................. 421
                                                                 
sh03709   1701 GGCAGTGTCTCTATCAACTTTGGGCTCTGCTCTGAGGTAAGAGCCCCCTG 1750

          1751 ----+----*----+----*----+----*----+----*----+----* 1800
msh04332   421 .................................................. 421
                                                                 
sh03709   1751 GGACAGGGGCACTATCTCCTTGGACTTTGAACTCCCACCAAAGTGGAGCT 1800

          1801 ----+----*----+----*----+----*----+----*----+----* 1850
msh04332   421 .................................................. 421
                                                                 
sh03709   1801 CACTGGGACAGTGGCAACGTCTCCACGGATTCTGAGCTCCCGCTGAAGTG 1850

          1851 ----+----*----+----*----+----*----+----*----+----* 1900
msh04332   421 .................................................. 421
                                                                 
sh03709   1851 GAGCTCCCTGGGAATCAGTCAGTGTCCAAGAACTCTGAGCTTCCCTGAGG 1900

          1901 ----+----*----+----*----+----*----+----*----+----* 1950
msh04332   421 .................................................. 421
                                                                 
sh03709   1901 TGGGAGCCCTCTGGAACACAAGGAGTGTCCACGCAGACTCTGAGCTTCCA 1950

          1951 ----+----*----+----*----+----*----+----*----+----* 2000
msh04332   421 ............................................TGTGCT 426
                                                           || |||
sh03709   1951 TGAAGTAGGGCTGTCTGGGACACTGGCGGTGTCTCTATGGTCTCTGAGCT 2000

          2001 ----+----*----+----*----+----*----+----*----+----* 2050
msh04332   427 CCTCAAGTCCACCTGTGTGCTCGAAGCCTTTGGCAATGCCCGCACCAA.. 474
               ||||| |       | |  | |   | | | ||| ||| |  ||| ||  
sh03709   2001 CCTCACGGAGCTGGGAGCTCCCTCGGACATGGGCGATGTCTCCACAAACT 2050

          2051 ----+----*----+----*----+----*----+----*----+----* 2100
msh04332   475 .................................................. 475
                                                                 
sh03709   2051 CTCCATTCCCCTATGAGGTGGAAGCTCTCTGCGACATGACCATTGTCTCT 2100

          2101 ----+----*----+----*----+----*----+----*----+----* 2150
msh04332   475 .................................................. 475
                                                                 
sh03709   2101 GTGGAATCTGAGCTTCCCACTAGTGGGAGCCCCTTGGGACACAGGGCAGG 2150

          2151 ----+----*----+----*----+----*----+----*----+----* 2200
msh04332   475 .................................................. 475
                                                                 
sh03709   2151 GTCTCCACAGACTCTAAGCTCCCACTGAAGTGGAGCTCTCTGGAATAACA 2200

          2201 ----+----*----+----*----+----*----+----*----+----* 2250
msh04332   475 .................................................. 475
                                                                 
sh03709   2201 GCAGGATCTCCACAGACTCTTACCACCTCCAGTGAAGTGGGAACTTTTGG 2250

          2251 ----+----*----+----*----+----*----+----*----+----* 2300
msh04332   475 .................................................. 475
                                                                 
sh03709   2251 GACACAGGCAGTGTCTCCATGGACTCTGAGCTCCCTGCTGAGGTGGGAGT 2300

          2301 ----+----*----+----*----+----*----+----*----+----* 2350
msh04332   475 .................................................. 475
                                                                 
sh03709   2301 TCATGTGGCACCAGTTGTGTCTCCACAGACGCTGAGCTCCCTAGGACATG 2350

          2351 ----+----*----+----*----+----*----+----*----+----* 2400
msh04332   475 .................................................. 475
                                                                 
sh03709   2351 GGCTCTCCCTGGGTGGAGAGCAATGTCTCCACGGAACCTGTGCTCCCTAC 2400

          2401 ----+----*----+----*----+----*----+----*----+----* 2450
msh04332   475 .................................................. 475
                                                                 
sh03709   2401 TGAGGTGGGAGCTTCCTGGGACAGGGGCAGGGTCTCCAGTAACTCAGCTC 2450

          2451 ----+----*----+----*----+----*----+----*----+----* 2500
msh04332   475 ...........TCGCAACCACAACTCCAGCCGCTTTGGCAAGTACATGGA 513
                            ||  ||     | | || | ||   ||   ||    |
sh03709   2451 CCACTGAAGTGGAGCTCCCTAGGATTCTGCAGTTTGTCCACAAACTCTTA 2500

          2501 ----+----*----+----*----+----*----+----*----+----* 2550
msh04332   514 CATCAACTTCGACTTCAAGG....GGGATCCTGTTGGTG....GACACAT 555
               | ||  |    | |   |||    |||| || | | |||     ||| | 
sh03709   2501 CCTCCTCACTCAGTGGGAGGTTTTGGGACCCGGGTAGTGTCTCCACATAC 2550

          2551 ----+----*----+----*----+----*----+----*----+----* 2600
msh04332   556 CCACAGCTACCTGCTG.................................. 571
                |  |||||||  |||                                  
sh03709   2551 TCTGAGCTACCCACTGAGGTGGGATCCCCCTGGGTCGAGGGCAATGTCTC 2600

          2601 ----+----*----+----*----+----*----+----*----+----* 2650
msh04332   572 GAGAAGTCTAGGGTTCTCAAGCAACATGTAGGCGAGAGGAACTTCCATGC 621
                | | |    | | || | | ||| |||  | |  |   ||   |  || 
sh03709   2601 CACAGGGACTGAGCTCCCTACCAAGATGAGGTCTGGCACAAGAACAGTGA 2650

          2651 ----+----*----+----*----+----*----+----*----+----* 2700
msh04332   622 CT................................................ 623
               ||                                                
sh03709   2651 CTTCATGGTGGAGTCATAGCTCCCACTGAAGTGGAGCTCCCTGACCCACT 2700

          2701 ----+----*----+----*----+----*----+----*----+----* 2750
msh04332   624 .................................................. 624
                                                                 
sh03709   2701 GAACAGCTGAATGGGGAGCTTGCTGGCACACAGAAACTGTTCACACTGTC 2750

          2751 ----+----*----+----*----+----*----+----*----+----* 2800
msh04332   624 .................................................. 624
                                                                 
sh03709   2751 TGACCTTCCCCCACGTTTGTTCGTTCAATACTGGAAGGAAGCGCAAGTCT 2800

          2801 ----+----*----+----*----+----*----+----*----+----* 2850
msh04332   624 .................................................. 624
                                                                 
sh03709   2801 GTTTGTTCTCCTGCCTGTGTGTGTGTACAGGGGGACATCCTGGGAGCTTA 2850

          2851 ----+----*----+----*----+----*----+----*----+----* 2900
                                            L  L  R  G  S  E  D  
msh04332   624 .....................TCTACCAGTTGCTTCGAGGCAGTGAGGAC 652
                                    ||  | |||||||  ||||||||||||||
sh03709   2851 TTTGTCTAAAGGATGCCTGTGTCCTCTAGTTGCTGAGAGGCAGTGAGGAC 2900
                                            L  L  R  G  S  E  D  

          2901 ----+----*----+----*----+----*----+----*----+----* 2950
               Q  E  L  Q  G  L  H  L  E  R  N  P  A  V  Y  N  F 
msh04332   653 CAAGAGCTGCAAGGACTGCATCTGGAAAGAAATCCTGCTGTGTATAATTT 702
                |  ||||||| | ||||||  |||| ||||| |||||||| || |||||
sh03709   2901 AAGCAGCTGCATGAACTGCACTTGGAGAGAAACCCTGCTGTATACAATTT 2950
               K  Q  L  H  E  L  H  L  E  R  N  P  A  V  Y  N  F 

          2951 ----+----*----+----*----+----*----+----*----+----* 3000
                T  R  Q  G  A  G  L  N  M  G  V  H  N  A  L  D  S
msh04332   703 CACGCGTCAGGGAGCTGGGCTCAACATGGGTGTGCACAATGCCTTGGACA 752
               ||| |  |||||||| || |||||||||  |||||||| |||||||||||
sh03709   2951 CACACACCAGGGAGCAGGACTCAACATGACTGTGCACAGTGCCTTGGACA 3000
                T  H  Q  G  A  G  L  N  M  T  V  H  S  A  L  D  S

          3001 ----+----*----+----*----+----*----+----*----+----* 3050
                 D  E  K  S  H  Q  G  V  M  E  A  M  R  I  I  G  
msh04332   753 GTGATGAGAAGAGCCACCAAGGAGTGATGGAGGCCATGAGGATCATCGGC 802
               |||||||| |||||||||| | |||||  |||||||||||| ||||||||
sh03709   3001 GTGATGAGCAGAGCCACCAGGCAGTGACCGAGGCCATGAGGGTCATCGGC 3050
                 D  E  Q  S  H  Q  A  V  T  E  A  M  R  V  I  G  

          3051 ----+----*----+----*----+----*----+----*----+----* 3100
               F  S  P  D  E  V  E  S  I  H  R  I  L  A  A  I  L 
msh04332   803 TTCAGTCCTGACGAGGTGGAGTCCATCCATCGCATCCTTGCCGCCATATT 852
               ||||||||||| |||||||||||  | ||||||||||| || ||||||||
sh03709   3051 TTCAGTCCTGAAGAGGTGGAGTCTGTGCATCGCATCCTGGCTGCCATATT 3100
               F  S  P  E  E  V  E  S  V  H  R  I  L  A  A  I  L 

          3101 ----+----*----+----*----+----*----+----*----+----* 3150
                H  L  G  N  I  E  F  V  E  T  E  E  N  G  P  Q  K
msh04332   853 ACACCTGGGAAACATCGAGTTTGTGGAGACAGAGGAAAATGGACCACAGA 902
                ||||||||||||||||||||||||||||| |||||   ||| |  ||||
sh03709   3101 GCACCTGGGAAACATCGAGTTTGTGGAGACGGAGGAGGGTGGGCTGCAGA 3150
                H  L  G  N  I  E  F  V  E  T  E  E  G  G  L  Q  K

          3151 ----+----*----+----*----+----*----+----*----+----* 3200
                 G  G  L  E  V  A  D  E  A  L  V  G  Y  V  A  K  
msh04332   903 AAGGAGGCCTGGAAGTGGCTGATGAGGCCCTGGTAGGATATGTGGCCAAG 952
               | |  ||||||| |||||| || ||||| ||||| |   |||||||  ||
sh03709   3151 AGGAGGGCCTGGCAGTGGCCGAGGAGGCACTGGTGGACCATGTGGCTGAG 3200
                 E  G  L  A  V  A  E  E  A  L  V  D  H  V  A  E  

          3201 ----+----*----+----*----+----*----+----*----+----* 3250
               L  T  A  T  P  R  D  L  V  L  R  T  L  L  A  R  T 
msh04332   953 CTGACAGCCACTCCCAGAGACCTTGTTCTGCGAACCCTGCTGGCTCGAAC 1002
               ||||| ||||| ||| | ||||| || || ||  ||||||||||||| ||
sh03709   3201 CTGACGGCCACACCCCGGGACCTCGTGCTCCGCTCCCTGCTGGCTCGCAC 3250
               L  T  A  T  P  R  D  L  V  L  R  S  L  L  A  R  T 

          3251 ----+----*----+----*----+----*----+----*----+----* 3300
                V  A  S  G  G  R  E  V  I  E  K  S  H  T  V  A  E
msh04332  1003 AGTGGCTTCAGGAGGCCGAGAAGTCATTGAGAAGAGCCACACCGTGGCTG 1052
               ||| || || |||||| | ||| |||| |||||| ||||||| |  ||||
sh03709   3251 AGTTGCCTCGGGAGGCAGGGAACTCATAGAGAAGGGCCACACTGCAGCTG 3300
                V  A  S  G  G  R  E  L  I  E  K  G  H  T  A  A  E

          3301 ----+----*----+----*----+----*----+----*----+----* 3350
                 A  S  Y  A  R  D  A  C  A  K  A  M  Y  Q  R  L  
msh04332  1053 AGGCCAGCTATGCCCGGGATGCCTGTGCAAAGGCGATGTACCAGCGACTG 1102
               |||||||||||||||||||||||||||| |||||  |||||||||| |||
sh03709   3301 AGGCCAGCTATGCCCGGGATGCCTGTGCCAAGGCAGTGTACCAGCGGCTG 3350
                 A  S  Y  A  R  D  A  C  A  K  A  V  Y  Q  R  L  

          3351 ----+----*----+----*----+----*----+----*----+----* 3400
               F  E  W  V  V  N  K  I  N  S  I  M  E  P  R  N  R 
msh04332  1103 TTTGAGTGGGTTGTGAACAAGATCAATAGCATCATGGAACCCCGAAACCG 1152
               ||||||||||| ||||||| |||||| ||  |||||||||||||   |||
sh03709   3351 TTTGAGTGGGTGGTGAACAGGATCAACAGTGTCATGGAACCCCGGGGCCG 3400
               F  E  W  V  V  N  R  I  N  S  V  M  E  P  R  G  R 

          3401 ----+----*----+----*----+----*----+----*----+----* 3450
                D  P  R  C  D  G  K  D  T  V  I  G  V  L  D  I  Y
msh04332  1153 AGACCCTCGGTGTGATGGCAAGGATACTGTCATTGGGGTGCTGGACATTT 1202
                || |||||| ||||||||||||| || |||||||| ||||||||||| |
sh03709   3401 GGATCCTCGGCGTGATGGCAAGGACACAGTCATTGGCGTGCTGGACATCT 3450
                D  P  R  R  D  G  K  D  T  V  I  G  V  L  D  I  Y

          3451 ----+----*----+----*----+----*----+----*----+----* 3500
                 G  F  E  V  F  P  V  N  S  F  E  Q  F  C  I  N  
msh04332  1203 ACGGCTTTGAGGTGTTCCCTGTCAACAGCTTTGAGCAGTTCTGCATCAAC 1252
               | ||||| |||||||| || |||||||| || ||||||||||||||||||
sh03709   3451 ATGGCTTCGAGGTGTTTCCCGTCAACAGTTTCGAGCAGTTCTGCATCAAC 3500
                 G  F  E  V  F  P  V  N  S  F  E  Q  F  C  I  N  

          3501 ----+----*----+----*----+----*----+----*----+----* 3550
               Y  C  N  E  K  L  Q  Q  L  F  I  Q  L  I  L  K  Q 
msh04332  1253 TACTGCAACGAGAAGCTGCAGCAGCTCTTTATCCAGCTTATCCTGAAGCA 1302
               |||||||||||||||||||||||||| || |||||||| |||||||||||
sh03709   3501 TACTGCAACGAGAAGCTGCAGCAGCTATTCATCCAGCTCATCCTGAAGCA 3550
               Y  C  N  E  K  L  Q  Q  L  F  I  Q  L  I  L  K  Q 

          3551 ----+----*----+----*----+----*----+----*----+----* 3600
                E  Q  E  E  Y  E  R  E  G  I  A  W  Q  T  I  E  Y
msh04332  1303 AGAGCAGGAGGAGTATGAGCGAGAGGGCATCGCCTGGCAGACCATCGAGT 1352
                || ||||| ||||| ||||| ||||||||| ||||||||| | | ||||
sh03709   3551 GGAACAGGAAGAGTACGAGCGCGAGGGCATCACCTGGCAGAGCGTTGAGT 3600
                E  Q  E  E  Y  E  R  E  G  I  T  W  Q  S  V  E  Y

          3601 ----+----*----+----*----+----*----+----*----+----* 3650
                 F  N  N  A  T  I  V  E  L  V  E  Q  P  R  R  G  
msh04332  1353 ACTTCAACAACGCTACCATTGTGGAACTTGTAGAGCAGCCCCGCAGAGGC 1402
               | ||||||||||| ||||||||||| || || |||| ||||| | | |||
sh03709   3601 ATTTCAACAACGCCACCATTGTGGATCTGGTGGAGCGGCCCCACCGTGGC 3650
                 F  N  N  A  T  I  V  D  L  V  E  R  P  H  R  G  

          3651 ----+----*----+----*----+----*----+----*----+----* 3700
               I  L  A  V  L  D  E  A  C  S  T  A  G  P  I  T  D 
msh04332  1403 ATCCTGGCTGTGTTAGATGAAGCCTGCAGCACGGCAGGCCCCATCACTGA 1452
               |||||||| ||| | || || ||||||||| | || ||| ||||||||||
sh03709   3651 ATCCTGGCCGTGCTGGACGAGGCCTGCAGCTCTGCTGGCACCATCACTGA 3700
               I  L  A  V  L  D  E  A  C  S  S  A  G  T  I  T  D 

          3701 ----+----*----+----*----+----*----+----*----+----* 3750
                R  I  F  L  Q  T  L  D  T  H  H  R  H  H  P  H  Y
msh04332  1453 CCGAATCTTCCTGCAGACCCTGGACACACACCACCGCCACCACCCACACT 1502
               ||||||||||||||||||||||||||| ||||||||||| |||| |||||
sh03709   3701 CCGAATCTTCCTGCAGACCCTGGACACGCACCACCGCCATCACCTACACT 3750
                R  I  F  L  Q  T  L  D  T  H  H  R  H  H  L  H  Y

          3751 ----+----*----+----*----+----*----+----*----+----* 3800
                 S  S  R  Q  L  C  P  T  D  K  T  M  E  F  G  R  
msh04332  1503 ATTCCAGCCGCCAGCTTTGCCCTACGGACAAGACCATGGAGTTTGGCCGA 1552
               |  ||||||||||||| ||||| || ||||||||||||||||||||||||
sh03709   3751 ACACCAGCCGCCAGCTCTGCCCCACAGACAAGACCATGGAGTTTGGCCGA 3800
                 T  S  R  Q  L  C  P  T  D  K  T  M  E  F  G  R  

          3801 ----+----*----+----*----+----*----+----*----+----* 3850
               D  F  Q  I  K  H  Y  A  G  D  V  T  Y  S  V  E  G 
msh04332  1553 GACTTCCAGATCAAACACTATGCAGGCGATGTCACGTACTCTGTGGAAGG 1602
               ||||||| |||||| ||||||||||| || ||||||||||| ||||||||
sh03709   3801 GACTTCCGGATCAAGCACTATGCAGGGGACGTCACGTACTCCGTGGAAGG 3850
               D  F  R  I  K  H  Y  A  G  D  V  T  Y  S  V  E  G 

          3851 ----+----*----+----*----+----*----+----*----+----* 3900
                F  I  D  K  N  R  D  S  L  F  Q  D  F  K  R  L  L
msh04332  1603 CTTCATTGACAAGAATAGAGACTCTCTCTTCCAGGACTTCAAACGGCTGC 1652
               |||||| |||||||| ||||| |  ||||||||||||||||| |||||||
sh03709   3851 CTTCATCGACAAGAACAGAGATTTCCTCTTCCAGGACTTCAAGCGGCTGC 3900
                F  I  D  K  N  R  D  F  L  F  Q  D  F  K  R  L  L

          3901 ----+----*----+----*----+----*----+----*----+----* 3950
                 Y  N  S  V  D  P  T  L  R  A  M  W  P  D  G  Q  
msh04332  1653 TGTACAATAGTGTGGATCCCACCTTGCGAGCCATGTGGCCTGACGGGCAA 1702
               ||||||| ||   ||| |||||  | || ||||||||||| |||||||| 
sh03709   3901 TGTACAACAGCACGGACCCCACTCTACGGGCCATGTGGCCGGACGGGCAG 3950
                 Y  N  S  T  D  P  T  L  R  A  M  W  P  D  G  Q  

          3951 ----+----*----+----*----+----*----+----*----+----* 4000
               Q  D  I  T  E  V  T  K  R  P  L  T  A  G  T  L  F 
msh04332  1703 CAGGACATCACGGAAGTGACCAAGCGTCCCCTGACAGCCGGCACACTCTT 1752
               ||||||||||| || ||||||||||| |||||||| || |||||||||||
sh03709   3951 CAGGACATCACAGAGGTGACCAAGCGCCCCCTGACGGCTGGCACACTCTT 4000
               Q  D  I  T  E  V  T  K  R  P  L  T  A  G  T  L  F 

          4001 ----+----*----+----*----+----*----+----*----+----* 4050
                K  N  S  M  V  A  L  V  E  N  L  A  S  K  E  P  F
msh04332  1753 TAAGAATTCCATGGTTGCCCTGGTGGAAAACTTGGCTTCCAAGGAACCCT 1802
                ||||| |||||||| ||||||||||| ||| | || |||||||| ||||
sh03709   4001 CAAGAACTCCATGGTGGCCCTGGTGGAGAACCTTGCCTCCAAGGAGCCCT 4050
                K  N  S  M  V  A  L  V  E  N  L  A  S  K  E  P  F

          4051 ----+----*----+----*----+----*----+----*----+----* 4100
                 Y  V  R  C  I  K  P  N  E  D  K  V  A  G  R  L  
msh04332  1803 TCTATGTCCGCTGCATCAAACCCAACGAAGACAAGGTGGCTGGGCGGCTC 1852
               |||| |||||||||||||| ||||| || |||||||| ||||||  ||| 
sh03709   4051 TCTACGTCCGCTGCATCAAGCCCAATGAGGACAAGGTAGCTGGGAAGCTG 4100
                 Y  V  R  C  I  K  P  N  E  D  K  V  A  G  K  L  

          4101 ----+----*----+----*----+----*----+----*----+----* 4150
               D  E  A  H  C  R  H  Q  V  E  Y  L  G  L  L  E  N 
msh04332  1853 GATGAAGCCCACTGTCGTCACCAGGTCGAATACCTGGGACTATTGGAGAA 1902
               |||||   ||||||||| |||||||||| ||||||||| ||  |||||||
sh03709   4101 GATGAGAACCACTGTCGCCACCAGGTCGCATACCTGGGGCTGCTGGAGAA 4150
               D  E  N  H  C  R  H  Q  V  A  Y  L  G  L  L  E  N 

          4151 ----+----*----+----*----+----*----+----*----+----* 4200
                V  R  V  R  R  A  G  F  A  S  R  Q  P  Y  P  R  F
msh04332  1903 TGTGAGGGTCCGCAGGGCTGGCTTTGCTTCCCGGCAGCCCTACCCTCGAT 1952
               |||||||||||||||||||||||| |||||||| ||||||||| ||||||
sh03709   4151 TGTGAGGGTCCGCAGGGCTGGCTTCGCTTCCCGCCAGCCCTACTCTCGAT 4200
                V  R  V  R  R  A  G  F  A  S  R  Q  P  Y  S  R  F

          4201 ----+----*----+----*----+----*----+----*----+----* 4250
                 L  L  R  Y  K  M  T  C  E  Y  T  W  P  N  H  L  
msh04332  1953 TCCTGCTCAGGTACAAGATGACCTGTGAGTACACGTGGCCCAACCACCTG 2002
               |||||||||||||||||||||||||||| ||||| |||||||||||||||
sh03709   4201 TCCTGCTCAGGTACAAGATGACCTGTGAATACACATGGCCCAACCACCTG 4250
                 L  L  R  Y  K  M  T  C  E  Y  T  W  P  N  H  L  

          4251 ----+----*----+----*----+----*----+----*----+----* 4300
               L  G  S  D  R  D  A  V  S  A  L  L  E  Q  H  G  L 
msh04332  2003 CTGGGCTCTGACCGGGATGCGGTGAGCGCCCTGCTGGAACAGCATGGGCT 2052
               |||||||| |||  ||  || |||||||| || ||||| |||||||||||
sh03709   4251 CTGGGCTCCGACAAGGCAGCCGTGAGCGCTCTCCTGGAGCAGCATGGGCT 4300
               L  G  S  D  K  A  A  V  S  A  L  L  E  Q  H  G  L 

          4301 ----+----*----+----*----+----*----+----*----+----* 4350
                Q  G  D  V  A  F  G  H  S  K  L  F  I  R  S  P  R
msh04332  2053 GCAAGGGGATGTGGCCTTTGGCCACAGCAAGCTGTTCATCCGATCCCCAA 2102
               ||| ||||| |||||||||||||||||||||||||||||||| || ||  
sh03709   4301 GCAGGGGGACGTGGCCTTTGGCCACAGCAAGCTGTTCATCCGCTCACCCC 4350
                Q  G  D  V  A  F  G  H  S  K  L  F  I  R  S  P  R

          4351 ----+----*----+----*----+----*----+----*----+----* 4400
                 T  L  V  T  L  E  Q  S  R  A  R  L  I  P  I  I  
msh04332  2103 GGACGCTGGTCACTCTGGAGCAGAGCCGAGCTCGCCTGATTCCCATCATT 2152
               |||| |||||||| ||||||||||||||||| ||||| || |||||||||
sh03709   4351 GGACACTGGTCACACTGGAGCAGAGCCGAGCCCGCCTCATCCCCATCATT 4400
                 T  L  V  T  L  E  Q  S  R  A  R  L  I  P  I  I  

          4401 ----+----*----+----*----+----*----+----*----+----* 4450
               V  L  L  L  Q  K  A  W  R  G  T  L  A  R  W  H  C 
msh04332  2153 GTGTTATTGCTGCAGAAGGCTTGGCGGGGCACCCTGGCTAGGTGGCACTG 2202
               ||| |  |  |||||||||| |||||||||||| |||| ||||||| |||
sh03709   4401 GTGCTGCTATTGCAGAAGGCATGGCGGGGCACCTTGGCGAGGTGGCGCTG 4450
               V  L  L  L  Q  K  A  W  R  G  T  L  A  R  W  R  C 

          4451 ----+----*----+----*----+----*----+----*----+----* 4500
                R  R  L  R  A  I  Y  T  I  M  R  W  F  R  R  H  K
msh04332  2203 CCGGCGACTAAGGGCCATCTACACCATCATGCGCTGGTTCCGGAGGCACA 2252
               |||| | || ||||| ||||||||||||||||||||||||||||| ||||
sh03709   4451 CCGGAGGCTGAGGGCTATCTACACCATCATGCGCTGGTTCCGGAGACACA 4500
                R  R  L  R  A  I  Y  T  I  M  R  W  F  R  R  H  K

          4501 ----+----*----+----*----+----*----+----*----+----* 4550
                 V  R  A  H  L  I  E  L  Q  R  R  F  Q  A  A  R  
msh04332  2253 AGGTGCGTGCTCACCTGATTGAACTACAGCGCCGGTTCCAGGCTGCACGG 2302
               ||||||| |||||||||  ||| || ||||| || |||||||||||| ||
sh03709   4501 AGGTGCGGGCTCACCTGGCTGAGCTGCAGCGGCGATTCCAGGCTGCAAGG 4550
                 V  R  A  H  L  A  E  L  Q  R  R  F  Q  A  A  R  

          4551 ----+----*----+----*----+----*----+----*----+----* 4600
               Q  P  P  L  Y  G  R  D  L  V  W  P  T  P  P  A  V 
msh04332  2303 CAGCCCCCACTCTATGGCCGTGACCTTGTGTGGCCCACACCTCCTGCTGT 2352
               ||||| |||||||| || |||||||||||||||||    || ||||||||
sh03709   4551 CAGCCGCCACTCTACGGGCGTGACCTTGTGTGGCCGCTGCCCCCTGCTGT 4600
               Q  P  P  L  Y  G  R  D  L  V  W  P  L  P  P  A  V 

          4601 ----+----*----+----*----+----*----+----*----+----* 4650
                L  Q  P  F  Q  D  T  C  R  V  L  F  S  R  W  R  A
msh04332  2353 GCTGCAGCCCTTCCAGGACACTTGCCGTGTTCTCTTCAGCAGGTGGCGGG 2402
               ||||||||||||||||||||| ||||  |  |||||| ||||||||||||
sh03709   4601 GCTGCAGCCCTTCCAGGACACCTGCCACGCACTCTTCTGCAGGTGGCGGG 4650
                L  Q  P  F  Q  D  T  C  H  A  L  F  C  R  W  R  A

          4651 ----+----*----+----*----+----*----+----*----+----* 4700
                 R  Q  L  V  K  N  I  P  P  S  D  M  T  Q  I  K  
msh04332  2403 CACGGCAGTTAGTGAAGAACATCCCTCCTTCAGACATGACCCAGATCAAG 2452
               | |||||| | |||||||||||||| |||||||||||| |||||||||||
sh03709   4651 CCCGGCAGCTGGTGAAGAACATCCCCCCTTCAGACATGCCCCAGATCAAG 4700
                 R  Q  L  V  K  N  I  P  P  S  D  M  P  Q  I  K  

          4701 ----+----*----+----*----+----*----+----*----+----* 4750
               A  K  V  A  A  M  G  A  L  Q  G  L  R  Q  D  W  G 
msh04332  2453 GCCAAGGTGGCTGCTATGGGGGCCTTGCAAGGATTGCGGCAGGACTGGGG 2502
               ||||||||||| || ||||||||| |||||||  | || |||||||||||
sh03709   4701 GCCAAGGTGGCCGCCATGGGGGCCCTGCAAGGGCTTCGTCAGGACTGGGG 4750
               A  K  V  A  A  M  G  A  L  Q  G  L  R  Q  D  W  G 

          4751 ----+----*----+----*----+----*----+----*----+----* 4800
                C  Q  R  A  W  A  R  D  Y  L  S  S  D  T  D  N  P
msh04332  2503 TTGCCAGCGGGCCTGGGCCCGAGACTACCTGTCCTCTGACACTGACAACC 2552
                ||||  ||||||||||||||||||||||||||||||| ||||||||| |
sh03709   4751 CTGCCGACGGGCCTGGGCCCGAGACTACCTGTCCTCTGCCACTGACAATC 4800
                C  R  R  A  W  A  R  D  Y  L  S  S  A  T  D  N  P

          4801 ----+----*----+----*----+----*----+----*----+----* 4850
                 T  A  S  H  L  F  A  E  Q  L  K  A  L  R  E  K  
msh04332  2553 CCACAGCTTCCCATCTGTTTGCTGAGCAACTAAAGGCACTTCGGGAGAAA 2602
               ||||||| ||    ||||||||| ||| ||||||| |||||||||| |||
sh03709   4801 CCACAGCATCAAGCCTGTTTGCTCAGCGACTAAAGACACTTCGGGACAAA 4850
                 T  A  S  S  L  F  A  Q  R  L  K  T  L  R  D  K  

          4851 ----+----*----+----*----+----*----+----*----+----* 4900
               D  G  F  G  S  V  L  F  S  S  H  V  R  K  V  N  R 
msh04332  2603 GATGGCTTTGGCTCTGTGCTTTTCTCCAGCCATGTGCGCAAGGTGAATCG 2652
               |||||||| ||  ||||||| || || |||||||| ||||||||||| ||
sh03709   4851 GATGGCTTCGGGGCTGTGCTCTTTTCAAGCCATGTCCGCAAGGTGAACCG 4900
               D  G  F  G  A  V  L  F  S  S  H  V  R  K  V  N  R 

          4901 ----+----*----+----*----+----*----+----*----+----* 4950
                F  R  K  S  R  D  R  A  L  L  L  T  D  R  Y  L  Y
msh04332  2653 CTTCCGCAAGAGCCGGGACCGGGCCCTTCTGCTCACAGATCGGTATCTGT 2702
               ||||| ||||| |||| |||||||||| ||||||||||| | | | || |
sh03709   4901 CTTCCACAAGATCCGGAACCGGGCCCTCCTGCTCACAGACCAGCACCTCT 4950
                F  H  K  I  R  N  R  A  L  L  L  T  D  Q  H  L  Y

          4951 ----+----*----+----*----+----*----+----*----+----* 5000
                 K  L  E  P  G  R  Q  Y  R  V  M  R  A  V  P  L  
msh04332  2703 ACAAGCTGGAGCCTGGACGACAGTACCGGGTGATGCGGGCTGTGCCTCTG 2752
               |||||||||| ||||  || |||||||||||||||||||| ||||| || 
sh03709   4951 ACAAGCTGGACCCTGACCGGCAGTACCGGGTGATGCGGGCCGTGCCCCTT 5000
                 K  L  D  P  D  R  Q  Y  R  V  M  R  A  V  P  L  

          5001 ----+----*----+----*----+----*----+----*----+----* 5050
               E  A  V  T  G  L  S  V  T  S  G  R  D  Q  L  V  V 
msh04332  2753 GAGGCGGTGACAGGGCTGAGTGTGACCAGTGGAAGAGATCAGCTGGTGGT 2802
               ||||||||||| |||||||| |||||||| ||| |||| |||||||||||
sh03709   5001 GAGGCGGTGACGGGGCTGAGCGTGACCAGCGGAGGAGACCAGCTGGTGGT 5050
               E  A  V  T  G  L  S  V  T  S  G  G  D  Q  L  V  V 

          5051 ----+----*----+----*----+----*----+----*----+----* 5100
                L  H  A  Q  G  Y  D  D  L  V  V  C  L  H  R  S  Q
msh04332  2803 GTTACATGCCCAAGGCTATGATGATCTTGTAGTGTGTCTACACCGTTCCC 2852
               | | || ||||  ||| | || || || || ||||| || ||||| ||||
sh03709   5051 GCTGCACGCCCGCGGCCAGGACGACCTCGTGGTGTGCCTGCACCGCTCCC 5100
                L  H  A  R  G  Q  D  D  L  V  V  C  L  H  R  S  R

          5101 ----+----*----+----*----+----*----+----*----+----* 5150
                 P  P  L  D  N  R  I  G  E  L  V  G  M  L  A  A  
msh04332  2853 AACCACCACTGGACAATCGAATTGGGGAGCTGGTGGGCATGCTGGCTGCA 2902
                 || ||| ||||||| ||  ||||||||||||||||| ||||||| |||
sh03709   5101 GGCCGCCATTGGACAACCGCGTTGGGGAGCTGGTGGGCGTGCTGGCCGCA 5150
                 P  P  L  D  N  R  V  G  E  L  V  G  V  L  A  A  

          5151 ----+----*----+----*----+----*----+----*----+----* 5200
               H  C  Q  G  E  G  R  T  L  E  V  R  V  S  D  C  I 
msh04332  2903 CACTGCCAGGGAGAGGGACGAACTCTGGAGGTCCGTGTCTCTGACTGCAT 2952
               ||||||||||| ||||| || || |||||||| || ||||| ||||||||
sh03709   5151 CACTGCCAGGGGGAGGGCCGCACCCTGGAGGTTCGCGTCTCCGACTGCAT 5200
               H  C  Q  G  E  G  R  T  L  E  V  R  V  S  D  C  I 

          5201 ----+----*----+----*----+----*----+----*----+----* 5250
                P  L  S  Q  R  G  A  R  R  L  I  S  V  E  P  R  P
msh04332  2953 CCCACTGAGCCAGCGTGGTGCCCGGCGCCTCATTTCTGTGGAGCCCAGGC 3002
               |||||| ||||| || || | |||||||||||| || |||||||||||||
sh03709   5201 CCCACTAAGCCATCGCGGGGTCCGGCGCCTCATCTCCGTGGAGCCCAGGC 5250
                P  L  S  H  R  G  V  R  R  L  I  S  V  E  P  R  P

          5251 ----+----*----+----*----+----*----+----*----+----* 5300
                 E  Q  P  E  P  D  F  Q  S  S  R  S  T  F  T  L  
msh04332  3003 CAGAGCAGCCTGAGCCAGATTTCCAAAGCAGCCGTAGCACCTTTACCCTC 3052
               | |||||||| ||||| |||||||   ||   ||  || |||| ||||| 
sh03709   5251 CGGAGCAGCCAGAGCCCGATTTCCGCTGCGCTCGCGGCTCCTTCACCCTG 5300
                 E  Q  P  E  P  D  F  R  C  A  R  G  S  F  T  L  

          5301 ----+----*----+----*----+----*----+----*----+----* 5350
               L  W  P  S  H  *                                  
msh04332  3053 CTCTGGCCAAGCCACTGAGCAAGGCCAAACCAGTTTAACCTGGTTTCCAC 3102
               |||||||| |||| ||||||  | ||  ||| |    |||  |       
sh03709   5301 CTCTGGCCCAGCCGCTGAGC..GCCCGCACCCGCCGCACCCCG....... 5341
               L  W  P  S  R  *                                  

          5351 ----+----*----+----*----+----*----+----*----+----* 5400
msh04332  3103 GCTGTTCTAAATACCTGGTCTTGACTGCACTAAAGGCCAGGACTTGTCTA 3152
                                                |||||   | |||||  
sh03709   5342 .................................AGGCCGCCAATTGTCCG 5358

          5401 ----+----*----+----*----+----*----+----*----+ 5445
msh04332  3153 CCCTTACATGGAACATTGCAAAATAAAGATGTTATATTTGTTTTC 3197
               |||   |    | |  ||||||  ||   | | |           
sh03709   5359 CCCCGCC....AGCGCTGCAAATAAACCTTCTGAGTC........ 5391