GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:44:15 Sequence gi568815597f:43172723_43373379 : 200657 bp : 47.64% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 32 188 157 1 1 56 98 192 0.988 17.07 1.02 Term + 490 602 113 2 2 70 41 70 0.903 -0.78 1.03 PlyA + 1547 1552 6 1.05 2.00 Prom + 2600 2639 40 -3.06 2.01 Init + 6215 6347 133 1 1 78 66 79 0.815 5.00 2.02 Intr + 8812 9128 317 2 2 136 82 264 0.995 26.38 2.03 Intr + 10154 10309 156 1 0 59 54 108 0.644 4.71 2.04 Intr + 10869 11155 287 2 2 99 80 162 0.997 12.64 2.05 Intr + 12427 12634 208 1 1 112 86 123 0.998 13.58 2.06 Intr + 13985 14137 153 1 0 60 113 176 0.980 17.57 2.07 Intr + 24831 24970 140 2 2 71 78 86 0.864 5.16 2.08 Intr + 25759 25924 166 1 1 54 83 60 0.989 2.06 2.09 Intr + 26668 26781 114 0 0 108 115 61 0.987 11.44 2.10 Intr + 33998 34210 213 1 0 101 97 191 0.914 20.21 2.11 Intr + 37021 37194 174 0 0 65 105 317 0.993 31.24 2.12 Intr + 42533 42694 162 1 0 101 116 225 0.993 26.77 2.13 Intr + 44989 45237 249 0 0 61 73 95 0.581 2.93 2.14 Intr + 46660 46815 156 0 0 90 62 151 0.990 12.91 2.15 Intr + 48650 48743 94 1 1 53 80 79 0.974 3.14 2.16 Intr + 49383 49573 191 1 2 116 63 325 0.993 32.10 2.17 Intr + 50102 50275 174 1 0 31 64 383 0.967 30.34 2.18 Intr + 51324 51482 159 2 0 83 56 234 0.995 19.88 2.19 Intr + 54261 54404 144 2 0 110 94 163 0.977 19.58 2.20 Intr + 55500 55523 24 2 0 94 95 13 0.630 0.82 2.21 Intr + 59786 59902 117 1 0 117 70 109 0.988 12.66 2.22 Intr + 61557 61691 135 2 0 28 103 218 0.999 18.06 2.23 Intr + 61773 61916 144 2 0 95 82 324 0.999 32.98 2.24 Intr + 68743 68940 198 0 0 121 49 28 0.718 1.75 2.25 Intr + 70505 70637 133 1 1 121 90 160 0.992 19.72 2.26 Intr + 84809 84968 160 0 1 84 67 64 0.150 2.95 2.27 Term + 94892 95099 208 2 1 100 39 140 0.019 7.11 2.28 PlyA + 95295 95300 6 1.05 3.00 Prom + 95858 95897 40 -8.36 3.01 Sngl + 100001 100660 660 1 0 63 49 855 0.926 75.28 3.02 PlyA + 101247 101252 6 1.05 4.07 PlyA - 102650 102645 6 1.05 4.06 Term - 103260 103198 63 0 0 99 40 63 0.503 0.49 4.05 Intr - 104331 104174 158 1 2 87 83 70 0.670 6.13 4.04 Intr - 110385 110089 297 1 0 120 31 224 0.532 16.75 4.03 Intr - 112885 112722 164 0 2 64 61 61 0.292 0.62 4.02 Intr - 113219 113011 209 2 2 72 60 135 0.593 6.98 4.01 Init - 114575 114573 3 2 0 77 101 0 0.593 0.20 4.00 Prom - 124402 124363 40 -5.26 5.00 Prom + 125313 125352 40 -6.16 5.01 Init + 128350 128407 58 0 1 42 106 69 0.868 3.59 5.02 Intr + 132129 132443 315 1 0 42 105 491 0.997 42.24 5.03 Intr + 132511 132621 111 2 0 85 55 93 0.953 6.05 5.04 Intr + 134118 134273 156 1 0 91 24 271 0.458 20.98 5.05 Intr + 134420 134551 132 0 0 31 86 66 0.543 1.32 5.06 Intr + 134710 134850 141 2 0 68 91 26 0.410 1.22 5.07 Intr + 135074 135202 129 0 0 95 75 59 0.840 5.97 5.08 Intr + 136264 136409 146 2 2 78 53 188 0.911 14.30 5.09 Intr + 136666 136810 145 0 1 108 86 198 0.997 21.46 5.10 Intr + 138949 139107 159 2 0 27 92 197 0.997 13.96 5.11 Intr + 139272 139409 138 1 0 39 85 73 0.852 2.54 5.12 Intr + 139583 139879 297 0 0 61 68 179 0.998 10.05 5.13 Intr + 140413 140703 291 2 0 100 93 404 0.983 39.21 5.14 Intr + 141056 141246 191 0 2 88 69 251 0.978 22.50 5.15 Intr + 144477 144687 211 2 1 51 31 395 0.998 28.59 5.16 Intr + 144842 144952 111 0 0 36 83 110 0.884 5.65 5.17 Intr + 145160 145350 191 0 2 128 89 288 0.855 32.20 5.18 Intr + 146513 146626 114 1 0 75 101 155 0.999 16.14 5.19 Intr + 146737 146807 71 0 2 96 56 110 0.790 6.58 5.20 Intr + 148547 148587 41 2 2 70 97 -29 0.577 -5.93 5.21 Intr + 148674 148770 97 1 1 87 65 214 0.653 18.07 5.22 Intr + 148894 148993 100 1 1 121 100 129 0.999 17.51 5.23 Term + 149929 150000 72 0 0 93 49 146 0.999 9.11 5.24 PlyA + 150364 150369 6 1.05 6.00 Prom + 155215 155254 40 -5.56 6.01 Init + 165127 165205 79 0 1 77 100 133 0.995 12.63 6.02 Intr + 165377 165509 133 0 1 18 78 164 0.741 8.10 6.03 Intr + 165796 165998 203 1 2 16 82 168 0.890 7.93 6.04 Intr + 166549 166839 291 2 0 111 19 154 0.579 7.91 6.05 Intr + 167285 167404 120 0 0 17 109 128 0.580 8.27 6.06 Intr + 167665 167791 127 2 1 66 94 31 0.988 1.24 6.07 Intr + 173723 173907 185 2 2 65 99 180 0.973 16.23 6.08 Intr + 173941 174212 272 2 2 62 86 87 0.742 3.16 6.09 Intr + 176121 176280 160 2 1 75 93 138 0.719 12.56 6.10 Intr + 176541 176637 97 1 1 85 86 70 0.688 5.57 6.11 Intr + 179494 179581 88 1 1 129 117 -2 0.989 6.77 6.12 Intr + 186455 186674 220 1 1 38 65 229 0.075 13.47 6.13 Intr + 186768 186916 149 1 2 110 95 202 0.999 23.05 6.14 Intr + 187003 187099 97 0 1 51 90 88 0.725 4.88 6.15 Intr + 187247 187375 129 0 0 72 87 171 0.999 16.07 6.16 Intr + 187471 187667 197 2 2 72 94 155 0.963 13.63 6.17 Intr + 187777 187871 95 0 2 68 82 106 0.995 6.76 6.18 Intr + 188011 188239 229 1 1 105 113 153 0.991 17.37 6.19 Intr + 188398 188523 126 0 0 100 89 73 0.990 9.48 6.20 Intr + 189473 189590 118 1 1 98 115 100 0.997 13.74 6.21 Term + 190229 190407 179 0 2 77 49 150 0.998 7.85 6.22 PlyA + 190445 190450 6 1.05 7.08 PlyA - 190474 190469 6 -0.45 7.07 Term - 191415 191194 222 0 0 116 54 262 0.932 22.52 7.06 Intr - 191738 191602 137 0 2 122 81 61 0.993 9.09 7.05 Intr - 191925 191820 106 0 1 109 100 -48 0.910 -1.71 7.04 Intr - 192072 192016 57 0 0 53 79 88 0.844 3.48 7.03 Intr - 192286 192206 81 1 0 21 110 96 0.967 4.93 7.02 Intr - 192654 192464 191 1 2 85 64 189 0.998 15.50 7.01 Init - 192887 192842 46 2 1 99 110 48 0.908 8.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 186494 186674 181 1 1 64 65 231 0.872 17.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:43172723_43373379|GENSCAN_predicted_peptide_1|89_aa MSAVVAQTLHVFGLRSHVANNIFYFDEQIIIFPSGNHCVKYNVDQKWQKFIPEPLFELSN SFPEHTTFIISPFESKYDEDTVNFIPSRK >gi568815597f:43172723_43373379|GENSCAN_predicted_CDS_1|270_bp atgtcagccgtggtagctcagacgctgcatgtttttggtcttcgatcccacgtggccaac aatatcttctacttcgatgaacagatcattatatttccttcaggaaatcactgtgtgaag tacaatgtggatcagaaatggcaaaaattcattccagagccactttttgagttgtctaac agttttcctgagcacaccaccttcattatatcaccgtttgaatccaagtatgatgaagac actgtaaactttatcccaagtcgcaaataa >gi568815597f:43172723_43373379|GENSCAN_predicted_peptide_2|1502_aa MEYYAAIKKDEFMPFIGIQMKLETIILSKLSQGQKTKHRMFSLIGSEKSQGMLALSISPN RRYLAISETVQEKPAITIYELSSIPCRKRKVLNNFDFQVQKFISMAFSPDSKYLLAQTSP PESNLVYWLWEKQKVMAIVRIDTQNNPVYQVVPNLLWFNLKYSSLQWYKSDMHSGERDQV IRFHDAGQQLQPQPPVSRSITRVSFSPQDNTQVCVTGNGMFKLLRFAEGTLKQTSFQRGE PQNYLAHTWVADDKIVVGTDTGKLFLFESGDQRWETSIMVKEPTNGSKSLDVIQESESLI EFPPVSSPLPSYEQMVAASSHSQMSMPQVFAIAAYSKGFACSAGPGRVLLFEKMEEKDFY RESREIRIPVDPQSNDPSQSDKQDVLCLCFSPSEETLVASTSKNQLYSITMSLTEISKGE PAHFEYLMYPLHSAPITGLATCIRKPLIATCSLDRSIRLWNYETNTLELFKEYQEEAYSI SLHPSGHFIVVGFADKLRLMNLLIDDIRSFKEYSVRGCGECSFSNGGHLFAAVNGNVIHV YTTTSLENISSLKGHTGKIRSIVWNADDSKLISGGTDGAVYEWNLSTGKRETECVLKSCS YNCVTVSPDAKIIFAVGSDHTLKEIADSLILREISAFDVTYTAIVISHSGRMMFVGTSVG TIRAMKYPLPLQKEFNEYQAHAGPITKMLLTFDDQFLLTAAEDGCLFTWKVFDKDGRGIK REREVGFAEEVLVTKTDMEEKLPPTHPSNSSSCLWSFLNLVRAAYRHWRDLYNTKVSLIG FDEDLWKPSKIKSKLPSMVYRAPDGLTQSWLPLPIHPTCAISHAAQVMLELKTRVEELKM ENEYQLRLKDMNYSEKIKELTDKFIQEMESLKTKNQVLRTEKEKQDVYHHEHIEDLLDKQ SRELQDMECCNNQKLLLEYEKYQELQLKSQRMQEEYEKQLRDNDETKSQALEELTEFYEA KLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDREIQDIKTKYEKKLRDEKESNLRLK GETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQ DKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEPTDFLTVPME AELENFHKQNTQLELNITELWQKLRATDQEMRRERQKERDLEALVKRFKTDLHNCVAYIQ EPRLLKEKVRGLFEKYVQRADMVEIAGLNTDLQQEYTRQREHLERNLATLKKKVVKEGEL HRTDYVRIMQVPPHSRGPFVTAQGEQCLRPSCSLGSDSVATTREIMSQVEAQFVPGTGPF YVDLACNCLFWYTDPQENVSLIKEINELRRELKFTRSQVYDLEAALKLTKKVRPQEVSET EFWHQLWPSSVGYLAEAPLVSCYLNTEKPALQKVLASLSCFLYKIWPAIAGPRSTPIHHL LVDLRVVRPHLPLCAHGHLLLLTFTVTNLNYPNNLCFGDRVQQPCEEPAFPLDSKLLDKT QN >gi568815597f:43172723_43373379|GENSCAN_predicted_CDS_2|4509_bp atggaatactatgcagccataaaaaaggatgagttcatgccctttatagggatacagatg aagctggaaaccatcattctgagcaaactgtcgcaaggacaaaaaaccaaacaccgcatg ttctcactcataggctcagagaagagtcagggcatgttggccttgtccatcagtcccaat cggcggtacctcgctatctctgagactgtgcaagaaaaacctgccatcaccatttatgaa ttgtcatccatcccttgccggaagcgcaaagttcttaataattttgacttccaagttcag aaatttattagcatggctttttctccagactccaaatacctattggctcagacgtcacct ccagagtcaaatcttgtctactggctgtgggaaaaacagaaagtaatggccattgttaga atcgacactcagaacaaccctgtctaccaggtagtccccaacttactatggttcaactta aaatattcgtctttacaatggtacaagagtgatatgcactcaggagaaagagaccaagta atacgctttcatgatgcagggcagcagctgcagccacagcccccggtcagccgctcaatc actagggtgagcttcagtccacaggataacactcaggtgtgtgtcactggaaatgggatg tttaagcttctccgttttgctgagggaaccctgaagcaaaccagctttcagaggggagaa ccccaaaactatctagctcacacctgggtggctgatgacaagattgtcgttggcactgac acaggcaaactcttcctctttgaatctggagatcagcgttgggagaccagcataatggtc aaggaacctaccaatggctcaaagagcctggatgtcattcaggaatcagagagcctgatt gaatttccaccagtcagttctccactcccttcctatgaacagatggtggcggccagtagc catagccagatgtccatgccccaggtgtttgccattgcagcctattcaaagggatttgcc tgttctgctgggccagggagagttctgctgtttgagaagatggaagaaaaggatttttac cgtgagagcagagaaatcaggattcctgtggacccgcagagcaatgatccaagtcagtct gacaaacaggacgttctctgcctgtgcttcagcccctcagaggaaactctggttgccagc accagtaagaaccaactctacagcatcaccatgtccctgacagagatcagcaagggggag cctgctcactttgagtatttgatgtatccattgcactcagcacccatcaccggtctagct acctgcatccgcaaaccccttatagccacctgttctctggatcgatccatccgcctttgg aattatgaaacaaacaccctggaactatttaaggaataccaagaagaggcatattccatc agccttcatccatctggacacttcattgtagtagggtttgctgacaaactacgcctcatg aatctactcattgatgatatacgttctttcaaagaatactctgttagaggatgcggagag tgttcctttagcaatggaggtcacctgtttgctgcagtcaatggaaatgtgattcacgtt tacaccaccacgagcctagagaacatctcaagcctgaaaggacacacagggaagattcgc tcaattgtgtggaatgcagatgatagcaaactgatttctggtggcacagatggtgctgtg tatgaatggaatctgtccacaggaaagagagagacagaatgcgtgctcaagtcttgcagc tacaactgtgttactgtctcccccgatgccaaaattatctttgctgttggatcagaccac accctcaaggagattgcagattccttgatccttcgagagatatcggcgtttgatgtcacc tacaccgccattgtcatctcgcattctggacgcatgatgtttgtgggcacctcggtggga accattcgtgccatgaagtaccctctgcctctgcagaaggaattcaatgagtaccaggcc catgccggtcctatcaccaagatgttgcttacctttgatgatcagttcctgctgactgct gctgaggatggctgcctgttcacctggaaggtctttgataaggatggccggggaatcaag cgagagagggaggtgggctttgccgaagaggtgcttgtgactaaaacagacatggaagaa aagttgccgcccacccaccccagcaactcatcatcctgcctttggtccttccttaatcta gtccgtgctgcgtaccgccattggagagatctttataacaccaaagtcagcctgatcggg ttcgatgaggacctttggaagccctctaagataaaatccaaacttcctagcatggtctac cgggcccctgatggtctgacgcagtcttggctgccccttcctatacatccaacctgtgcc atatcacatgcagctcaggttatgttggagctaaagactcgtgtggaggaattaaaaatg gagaatgagtatcaactccgactaaaggacatgaactattctgagaagattaaggagcta acagacaagttcatccaggaaatggagtccttgaaaacaaaaaaccaggtcttaagaaca gaaaaagagaagcaggatgtttatcaccatgagcacatagaagacctcctagacaagcaa agccgggaactgcaggacatggaatgttgcaacaaccaaaagttgcttctagaatatgag aagtaccaggagctgcagctcaagtcccagaggatgcaggaagagtatgaaaaacagctc cgggataacgatgagaccaagagccaggccctggaggagctgactgagttttacgaggca aaactgcaggagaaaaccacccttctggaagaggcacaggaagacgtcaggcagcagctg cgggagtttgaagagaccaagaagcagattgaggaagatgaagaccgagaaatccaagat atcaaaaccaagtatgagaaaaagcttcgggatgaaaaggaatcaaacctgcggctcaag ggagaaacaggcatcatgaggaagaagttcagcagcctacagaaggagattgaagaacga accaatgacatcgagaccctaaaaggagagcagatgaagctgcaaggagtcattaagtct ctggagaaggacatccaaggcctcaagcgagagatccaggaaagagacgagactattcaa gacaaggagaagcgaatttatgatctgaaaaagaaaaatcaagaactagggaaattcaag tttgtgcttgactacaaaataaaggagctgaagaagcaaatagaacctcgagagaatgag atcagggtgatgaaggaacagattcaggagcccactgactttctgacagttcctatggaa gctgaactggagaatttccataagcagaacactcaactggagctgaacatcacagaattg tggcagaaactgagagccaccgatcaggagatgcgcagagagagacagaaggagcgagac ttggaagcgctggtcaaaaggtttaaaacagacctccacaactgcgtagcctatattcag gaaccgcggctgctgaaggagaaggttcgaggtctctttgagaagtacgtgcagcgagca gacatggtggagatcgcagggctgaacacagacctgcagcaggagtacacccggcagcgg gagcacctggagaggaacctggccactctcaagaagaaggtggtcaaggagggcgagctg caccgcacagactacgtccgcatcatgcaggtacctccccacagcagaggaccgtttgtc acagctcagggagaacagtgcttaagaccatcctgttctctgggctctgactccgtagcc accactcgtgagataatgtcccaagtagaagcacagtttgttcctggcacaggtccattt tatgtagatttggcctgcaactgccttttctggtatacagaccctcaggaaaatgtctct ctgatcaaggaaattaatgagctccgcagggagctgaagttcactcggtcccaagtctat gaccttgaagcagctctgaaactgaccaagaaagtccgaccacaagaagtttcagagaca gagttttggcatcagctgtggccaagcagtgtgggctacctagctgaggctcccctggtg tcctgctatctcaacactgaaaagcctgccctacagaaggttctagcatcactgtcctgc ttcttatacaaaatatggccagccattgctggtcccaggagtactcccattcaccacctt cttgtggatctgcgggtggttcgtccgcaccttccgctttgtgcccacggccacctcctc ctcctgaccttcaccgtcaccaatctgaattaccccaacaacctctgctttggggacaga gttcaacagccctgtgaagagcctgcctttccacttgactccaagttacttgataagacc cagaactag >gi568815597f:43172723_43373379|GENSCAN_predicted_peptide_3|219_aa MSEQEAQAPGGRGLPPDMLAEQVELWWSQQPRRSALCFVVAVGLVAGCGAGGVALLSTTS SRSGEWRLATGTVLCLLALLVLVKQLMSSAVQDMNCIRQAHHVALLRSGGGADALVVLLS GLVLLVTGLTLAGLAAAPAPARPLAAMLSVGIALAALGSLLLLGLLLYQVGVSGHCPSIC MATPSTHSGHGGHGSIFSISGQLSAGRRHETTSSIASLI >gi568815597f:43172723_43373379|GENSCAN_predicted_CDS_3|660_bp atgtctgaacaggaggctcaagccccagggggccgggggctgcccccggacatgctggca gagcaggtggagctgtggtggtcccagcagccgcggcgctcggcgctctgcttcgtcgtg gccgtgggcctcgtggcaggctgtggcgcgggcggcgtggcactgctgtcaaccaccagc agccgctcaggtgaatggcggctagcaacgggcactgtgctctgtttgctggctctgctg gttctggtgaaacagctgatgagctcggctgtgcaggacatgaactgcatccgccaggcc caccatgtggccctgctgcgcagtggtggaggggccgacgccctcgtggtgctgctcagt ggcctcgtgctgctggtcaccggcctgaccctggccgggctggccgccgcccctgcccct gctcggccgctggccgccatgctgtctgtgggcattgctctggctgccttgggctcgctt ttgctgctgggcctgctgctgtatcaagtgggtgtgagcggacactgcccctccatctgt atggccactccctccacccacagtggccatggcggccatggcagcatcttcagcatctca ggacagttgtctgctggccggcgtcacgagaccacatccagcattgccagcctcatctga >gi568815597f:43172723_43373379|GENSCAN_predicted_peptide_4|297_aa MACKAPSSTLDFVPVTGRSVVYQPEHLPAEVSAPLGPAGRFSGSQLYKPQALGGEGEVYT HKVLGKSLGERFCRPQPEAYSADCAQFLKVGTEEKARGGRGGGTQALHSTQGASGSRKEA THQSWALVGPSELPTASAVAPGPGTGARAWPVLVGFVLGAVVLSLLIALAAKCHLCRRYH ASYRHRPLPETGRGGRPQVAEDEDDDGFIEDNYIQPGTGELGTEGQSLAPHRSQQVPNPI LATVPYSPGESLSMFSSVLRTKSKFLCWAAKAIGQVGESLVLTHSPTAAAQSDAYAR >gi568815597f:43172723_43373379|GENSCAN_predicted_CDS_4|894_bp atggcctgcaaagctccctcttcaactctggactttgtaccagtcaccggacgttcagtt gtttatcaacctgagcacctgcctgctgaggtgtctgcccctctagggcccgctggacgc ttctctggatcccagctgtacaaaccgcaagcattggggggtgagggtgaggtctacacc cataaagttcttggcaaaagcttgggggagaggttctgccggccccagccggaagcttac tccgcggattgtgcacagttcttgaaggtgggaacagaagagaaggcccgggggggccgg ggagggggtacccaggctctgcacagtacccaaggggcttctggcagcaggaaggaagct acacatcagagttgggcacttgttgggccttcggagctccccacagcgtctgctgtggcc cctggcccaggcactggggctcgggcatggcctgtgctggtaggatttgtgctgggggct gtggtcctctcgctcctcattgcacttgctgccaaatgccacctctgccgccgataccat gccagctaccggcaccgcccactgcctgagacaggaaggggaggccgcccacaggtggct gaagatgaggatgatgatggcttcatcgaggacaattacattcagcctgggactggcgag ctggggacagagggccaatccctggctccacaccgcagccagcaagtcccaaatcctatt ctggccacagtaccctactcccctggcgaaagtctttccatgttttccagtgtcctgagg acaaagtccaaattcctctgctgggctgccaaggccattggacaagtaggagagagcctg gtgctcacccactcacccacggcagctgctcagtcagatgcttatgccaggtaa >gi568815597f:43172723_43373379|GENSCAN_predicted_peptide_5|1138_aa MVWRVPPFLLPILFLASHVGAAVDLTLLANLRLTDPQRFFLTCVSGEAGAGRGSDAWGPP LLLEKDDRIVRTPPGPPLRLARNGSHQVTLRGFSKPSDLVGVFSCVGGAGARRTRVIYVH NSPGAHLLPDKVTHTVNKGDTAVLSARVHKEKQTDVIWKSNGSYFYTLDWHEAQDGRFLL QLPNVQPPSSGIYSATYLEASPLGSAFFRLIVRGCGAGRWGPGCTKECPGCLHGGVCHDH DGECVCPPGFTGTRCEQACREGRFGQSCQEQCPGISGCRGLTFCLPDPYGCSCGSGWRGS QCQEACAPGHFGADCRLQCQCQNGGTCDRFSGCVCPSGWHGVHCEKSDRIPQILNMASEL EFNLETMPRINCAAAGNPFPVRGSIELRKPDGTVLLSTKAIVEPEKTTAEFEVPRLVLAD SGFWECRVSTSGGQDSRRFKVNVKVPPVPLAAPRLLTKQSRQLVVSPLVSFSGDGPISTV RLHYRPQDSTMDWSTIVVDPSENVTLMNLRPKTGYSVRVQLSRPGEGGEGAWGPPTLMTT DCPEPLLQPWLEGWHVEGTDRLRVSWSLPLVPGPLVGDGFLLRLWDGTRGQERRENVSSP QARTALLTGLTPGTHYQLDVQLYHCTLLGPASPPAHVLLPPSGPPAPRHLHAQALSDSEI QLTWKHPEALPGPISKYVVEVQVAGGAGDPLWIDVDRPEETSTIIRGLNASTRYLFRMRA SIQGLGDWSNTVEESTLGNGLQAEGPVQESRAAEEGLDQQLILAVVGSVSATCLTILAAL LTLVCIRRSCLHRRRTFTYQSGSGEETILQFSSGTLTLTRRPKLQPEPLSYPVLEWEDIT FEDLIGEGNFGQVIRAMIKKDGLKMNAAIKMLKEYASENDHRDFAGELEVLCKLGHHPNI INLLGACKNRGYLYIAIEYAPYGNLLDFLRKSRVLETDPAFAREHGTASTLSSRQLLRFA SDAANGMQYLSEKQFIHRDLAARNVLVGENLASKIADFGLSRGEEVYVKKTMGRLPVRWM AIESLNYSVYTTKSDVWSFGVLLWEIVSLGGTPYCGMTCAELYEKLPQGYRMEQPRNCDD EVYELMRQCWRDRPYERPPFAQIALQLGRMLEARKAYVNMSLFENFTYAGIDATAEEA >gi568815597f:43172723_43373379|GENSCAN_predicted_CDS_5|3417_bp atggtctggcgggtgccccctttcttgctccccatcctcttcttggcttctcatgtgggc gcggcggtggacctgacgctgctggccaacctgcggctcacggacccccagcgcttcttc ctgacttgcgtgtctggggaggccggggcggggaggggctcggacgcctggggcccgccc ctgctgctggagaaggacgaccgtatcgtgcgcaccccgcccgggccacccctgcgcctg gcgcgcaacggttcgcaccaggtcacgcttcgcggcttctccaagccctcggacctcgtg ggcgtcttctcctgcgtgggcggtgctggggcgcggcgcacgcgcgtcatctacgtgcac aacagccctggagcccacctgcttccagacaaggtcacacacactgtgaacaaaggtgac accgctgtactttctgcacgtgtgcacaaggagaagcagacagacgtgatctggaagagc aacggatcctacttctacaccctggactggcatgaagcccaggatgggcggttcctgctg cagctcccaaatgtgcagccaccatcgagcggcatctacagtgccacttacctggaagcc agccccctgggcagcgccttctttcggctcatcgtgcggggttgtggggctgggcgctgg gggccaggctgtaccaaggagtgcccaggttgcctacatggaggtgtctgccacgaccat gacggcgaatgtgtatgcccccctggcttcactggcacccgctgtgaacaggcctgcaga gagggccgttttgggcagagctgccaggagcagtgcccaggcatatcaggctgccggggc ctcaccttctgcctcccagacccctatggctgctcttgtggatctggctggagaggaagc cagtgccaagaagcttgtgcccctggtcattttggggctgattgccgactccagtgccag tgtcagaatggtggcacttgtgaccggttcagtggttgtgtctgcccctctgggtggcat ggagtgcactgtgagaagtcagaccggatcccccagatcctcaacatggcctcagaactg gagttcaacttagagacgatgccccggatcaactgtgcagctgcagggaaccccttcccc gtgcggggcagcatagagctacgcaagccagacggcactgtgctcctgtccaccaaggcc attgtggagccagagaagaccacagctgagttcgaggtgccccgcttggttcttgcggac agtgggttctgggagtgccgtgtgtccacatctggcggccaagacagccggcgcttcaag gtcaatgtgaaagtgccccccgtgcccctggctgcacctcggctcctgaccaagcagagc cgccagcttgtggtctccccgctggtctcgttctctggggatggacccatctccactgtc cgcctgcactaccggccccaggacagtaccatggactggtcgaccattgtggtggacccc agtgagaacgtgacgttaatgaacctgaggccaaagacaggatacagtgttcgtgtgcag ctgagccggccaggggaaggaggagagggggcctgggggcctcccaccctcatgaccaca gactgtcctgagcctttgttgcagccgtggttggagggctggcatgtggaaggcactgac cggctgcgagtgagctggtccttgcccttggtgcccgggccactggtgggcgacggtttc ctgctgcgcctgtgggacgggacacgggggcaggagcggcgggagaacgtctcatccccc caggcccgcactgccctcctgacgggactcacgcctggcacccactaccagctggatgtg cagctctaccactgcaccctcctgggcccggcctcgccccctgcacacgtgcttctgccc cccagtgggcctccagccccccgacacctccacgcccaggccctctcagactccgagatc cagctgacatggaagcacccggaggctctgcctgggccaatatccaagtacgttgtggag gtgcaggtggctgggggtgcaggagacccactgtggatagacgtggacaggcctgaggag acaagcaccatcatccgtggcctcaacgccagcacgcgctacctcttccgcatgcgggcc agcattcaggggctcggggactggagcaacacagtagaagagtccaccctgggcaacggg ctgcaggctgagggcccagtccaagagagccgggcagctgaagagggcctggatcagcag ctgatcctggcggtggtgggctccgtgtctgccacctgcctcaccatcctggctgccctt ttaaccctggtgtgcatccgcagaagctgcctgcatcggagacgcaccttcacctaccag tcaggctcgggcgaggagaccatcctgcagttcagctcagggaccttgacacttacccgg cggccaaaactgcagcccgagcccctgagctacccagtgctagagtgggaggacatcacc tttgaggacctcatcggggaggggaacttcggccaggtcatccgggccatgatcaagaag gacgggctgaagatgaacgcagccatcaaaatgctgaaagagtatgcctctgaaaatgac catcgtgactttgcgggagaactggaagttctgtgcaaattggggcatcaccccaacatc atcaacctcctgggggcctgtaagaaccgaggttacttgtatatcgctattgaatatgcc ccctacgggaacctgctagattttctgcggaaaagccgggtcctagagactgacccagct tttgctcgagagcatgggacagcctctacccttagctcccggcagctgctgcgtttcgcc agtgatgcggccaatggcatgcagtacctgagtgagaagcagttcatccacagggacctg gctgcccggaatgtgctggtcggagagaacctagcctccaagattgcagacttcggcctt tctcggggagaggaggtttatgtgaagaagacgatggggcgtctccctgtgcgctggatg gccattgagtccctgaactacagtgtctataccaccaagagtgatgtctggtcctttgga gtccttctttgggagatagtgagccttggaggtacaccctactgtggcatgacctgtgcc gagctctatgaaaagctgccccagggctaccgcatggagcagcctcgaaactgtgacgat gaagtgtacgagctgatgcgtcagtgctggcgggaccgtccctatgagcgaccccccttt gcccagattgcgctacagctaggccgcatgctggaagccaggaaggcctatgtgaacatg tcgctgtttgagaacttcacttacgcgggcattgatgccacagctgaggaggcctga >gi568815597f:43172723_43373379|GENSCAN_predicted_peptide_6|1097_aa MPSWALFMVTSCLLLAPQNLAQVSSQDVSLLASDSEPLKCFSRTFEDLTCFWDEEEAAPS GTYQLLYAYPRRDLFYANREKPRACPLSSQSMPHFGTRYVCQFPDQEEVRLFFPLHLWVK NVFLNQTRTQRVLFVDSVGLPAPPSIIKAMGGSQPGELQISWEEPAPEISDFLRYELRYG PRDPKNSTGPTVIQLIATETCCPALQRPHSASALDQSPCAQPTMPWQDGPKQTSPRLQPG NSYWLQLRSEPDGISLGGSWGSWSLPVTVDLPGDAVALGLQCFTLDLKNVTCQWQQQDHA SSQGFFYHSRARCCPRDRYPIWENCEEEEKTNPGLQTPQFSRCHFKSRNDSIIHILVEVT TAPGTVHSYLGSPFWIHQAVPTPTESDPVPRIPNSDPSDRWLWWHNALCTEGLKLLPADI PVVRLPTPNLHWREISSGHLELEWQHPSSWAAQETCYQLRYTGEGHQDWKVLEPPLGARG GTLELRPRSRYRLQLRARLNGPTYQGPWSSWSDPTRVETATETAWISLVTALHLVLGLSA VLGLLLLRWQFPAHYRRLRHALWPSLPDLHRVLGQYLRDTAALSPAPTARTPPPAGAPMA QFAFESDLHSLLQLDAPIPNAPPARWQRKAKEAAGPAPSPMRAANRSHSAGRTPGRTPGK SSSKVQTTPSKPGGDRYIPHRSAAQMEVASFLLSKENQPENSQTPTKKEHQKAWALNLNG FDVEEAKILRLSGKPQNAPEGYQNRLKVLYSQKATPGSSRKTCRYIPSLPDRILDAPEIR NDYYLNLVDWSSGNVLAVALDNSVYLWSASSGDILQLLQMEQPGEYISSVAWIKEGNYLA VGTSSAEVQLWDVQQQKRLRNMTSHSARVGSLSWNSYILSSGSRSGHIHHHDVRVAEHHV ATLSGHSQEVCGLRWAPDGRHLASGGNDNLVNVWPSAPGEGGWVPLQTFTQHQGAVKAVA WCPWQSNVLATGGGTSDRHIRIWNVCSGACLSAVDAHSQVCSILWSPHYKELISGHGFAQ NQLVIWKYPTMAKVAELKGHTSRVLSLTMSPDGATVASAAADETLRLWRCFELDPARRRE REKASAAKSSLIHQGIR >gi568815597f:43172723_43373379|GENSCAN_predicted_CDS_6|3294_bp atgccctcctgggccctcttcatggtcacctcctgcctcctcctggcccctcaaaacctg gcccaagtcagcagccaagatgtctccttgctggcatcagactcagagcccctgaagtgt ttctcccgaacatttgaggacctcacttgcttctgggatgaggaagaggcagcgcccagt gggacataccagctgctgtatgcctacccgcggagggacctcttctatgccaacagggag aagccccgtgcttgccccctgagttcccagagcatgccccactttggaacccgatacgtg tgccagtttccagaccaggaggaagtgcgtctcttctttccgctgcacctctgggtgaag aatgtgttcctaaaccagactcggactcagcgagtcctctttgtggacagtgtaggcctg ccggctccccccagtatcatcaaggccatgggtgggagccagccaggggaacttcagatc agctgggaggagccagctccagaaatcagtgatttcctgaggtacgaactccgctatggc cccagagatcccaagaactccactggtcccacggtcatacagctgattgccacagaaacc tgctgccctgctctgcagaggcctcactcagcctctgctctggaccagtctccatgtgct cagcccacaatgccctggcaagatggaccaaagcagacctccccaagactccagcctggc aactcctactggctgcagctgcgcagcgaacctgatgggatctccctcggtggctcctgg ggatcctggtccctccctgtgactgtggacctgcctggagatgcagtggcacttggactg caatgctttaccttggacctgaagaatgttacctgtcaatggcagcaacaggaccatgct agctcccaaggcttcttctaccacagcagggcacggtgctgccccagagacaggtacccc atctgggagaactgcgaagaggaagagaaaacaaatccaggactacagaccccacagttc tctcgctgccacttcaagtcacgaaatgacagcattattcacatccttgtggaggtgacc acagccccgggtactgttcacagctacctgggctcccctttctggatccaccaggctgtt cccacccccactgaatctgaccctgtgcccaggatccccaactctgacccttctgaccga tggctctggtggcacaatgccttgtgcacagaaggacttaagctgctccctgctgacatc cctgtagtgcgcctccccaccccaaacttgcactggagggagatctccagtgggcatctg gaattggagtggcagcacccatcgtcctgggcagcccaagagacctgttatcaactccga tacacaggagaaggccatcaggactggaaggtgctggagccgcctctcggggcccgagga gggaccctggagctgcgcccgcgatctcgctaccgtttacagctgcgcgccaggctcaac ggccccacctaccaaggtccctggagctcgtggtcggacccaactagggtggagaccgcc accgagaccgcctggatctccttggtgaccgctctgcatctagtgctgggcctcagcgcc gtcctgggcctgctgctgctgaggtggcagtttcctgcacactacaggagactgaggcat gccctgtggccctcacttccagacctgcaccgggtcctaggccagtaccttagggacact gcagccctgagcccggcaccaactgcaaggacccctccccctgcgggcgctcccatggca cagttcgcgttcgagagtgacctgcactcgctgcttcagctggatgcacccatccccaat gcaccccctgcgcgctggcagcgcaaagccaaggaagccgcaggcccggccccctcaccc atgcgggccgccaaccgatcccacagcgccggcaggactccgggccgaactcctggcaaa tccagttccaaggttcagaccactcctagcaaacctggcggtgaccgctatatcccccat cgcagtgctgcccagatggaggtggccagcttcctcctgagcaaggagaaccagcctgaa aacagccagacgcccaccaagaaggaacatcagaaagcctgggctttgaacctgaacggt tttgatgtagaggaagccaagatccttcggctcagtggaaaaccacaaaatgcgccagag ggttatcagaacagactgaaagtactctacagccaaaaggccactcctggctccagccgg aagacctgccgttacattccttccctgccagaccgtatcctggatgcgcctgaaatccga aatgactattacctgaaccttgtggattggagttctgggaatgtactggccgtggcactg gacaacagtgtgtacctgtggagtgcaagctctggtgacatcctgcagcttttgcaaatg gagcagcctggggaatatatatcctctgtggcctggatcaaagagggcaactacttggct gtgggcaccagcagtgctgaggtgcagctatgggatgtgcagcagcagaaacggcttcga aatatgaccagtcactctgcccgagtgggctccctaagctggaacagctatatcctgtcc agtggttcacgttctggccacatccaccaccatgatgttcgggtagcagaacaccatgtg gccacactgagtggccacagccaggaagtgtgtgggctgcgctgggccccagatggacga catttggccagtggtggtaatgataacttggtcaatgtgtggcctagtgctcctggagag ggtggctgggttcctctgcagacattcacccagcatcaaggggctgtcaaggccgtagca tggtgtccctggcagtccaatgtcctggcaacaggagggggcaccagtgatcgacacatt cgcatctggaatgtgtgctctggggcctgtctgagtgccgtggatgcccattcccaggtg tgctccatcctctggtctccccattacaaggagctcatctcaggccatggctttgcacag aaccagctagttatttggaagtacccaaccatggccaaggtggctgaactcaaaggtcac acatcccgggtcctgagtctgaccatgagcccagatggggccacagtggcatccgcagca gcagatgagaccctgaggctatggcgctgttttgagttggaccctgcgcggcggcgggag cgggagaaggccagtgcagccaaaagcagcctcatccaccaaggcatccgctga >gi568815597f:43172723_43373379|GENSCAN_predicted_peptide_7|279_aa MEAVVNLYQEVMKHADPRIQGYPLMGSPLLMTSILLTYVYFVLSLGPRIMANRKPFQLRG FMIVYNFSLVALSLYIVYEFLMSGWLSTYTWRCDPVDYSNSPEALRMVRVAWLFLFSKFI ELMDTVIFILRKKDGQVTFLHVFHHSVLPWSWWWGVKIAPGGMGSFHAMINSSVHVIMYL YYGLSAFGPVAQPYLWWKKHMTAIQLIQFVLVSLHISQYYFMSSCNYQYPVIIHLIWMYG TIFFMLFSNFWYHSYTKGKRLPRALQQNGAPGIAKVKAN >gi568815597f:43172723_43373379|GENSCAN_predicted_CDS_7|840_bp atggaggctgttgtgaacttgtaccaagaggtgatgaagcacgcagatccccggatccag ggctaccctctgatggggtcccccttgctaatgacctccattctcctgacctacgtgtac ttcgttctctcacttgggcctcgcatcatggctaatcggaagcccttccagctccgtggc ttcatgattgtctacaacttctcactggtggcactctccctctacattgtctatgagttc ctgatgtcgggctggctgagcacctatacctggcgctgtgaccctgtggactattccaac agccctgaggcacttaggatggttcgggtggcctggctcttcctcttctccaagttcatt gagctgatggacacagtgatctttattctccgaaagaaagacgggcaggtgaccttccta catgtcttccatcactctgtgcttccctggagctggtggtggggggtaaagattgccccg ggaggaatgggctctttccatgccatgataaactcttccgtgcatgtcataatgtacctg tactacggattatctgcctttggccctgtggcacaaccctacctttggtggaaaaagcac atgacagccattcagctgatccagtttgtcctggtctcactgcacatctcccagtactac tttatgtccagctgtaactaccagtacccagtcattattcacctcatctggatgtatggc accatcttcttcatgctgttctccaacttctggtatcactcttataccaagggcaagcgg ctgccccgtgcacttcagcaaaatggagctccaggtattgccaaggtcaaggccaactga