GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:30:41 Sequence gi568815579f:40651409_40865164 : 213756 bp : 51.20% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 519 514 6 1.05 1.10 Term - 16730 16060 671 0 2 160 41 538 0.961 50.68 1.09 Intr - 18612 18490 123 1 0 121 103 113 0.999 17.16 1.08 Intr - 22241 21936 306 0 0 72 99 316 0.614 28.07 1.07 Intr - 26013 25824 190 0 1 130 98 268 0.993 31.68 1.06 Intr - 29649 29509 141 0 0 82 75 166 0.988 15.76 1.05 Intr - 31394 31320 75 2 0 131 97 143 0.999 19.71 1.04 Intr - 31560 31486 75 0 0 41 101 112 0.900 8.01 1.03 Intr - 33148 33009 140 2 2 80 96 145 0.143 15.19 1.02 Intr - 35587 35503 85 1 1 97 99 47 0.160 6.59 1.01 Init - 35903 35826 78 2 0 52 45 33 0.044 -4.64 1.00 Prom - 37047 37008 40 -0.01 2.15 PlyA - 38433 38428 6 1.05 2.14 Term - 40965 40627 339 0 0 78 48 455 0.988 35.39 2.13 Intr - 41629 41543 87 1 0 121 81 81 0.994 11.36 2.12 Intr - 44646 44581 66 0 0 97 105 74 0.983 9.59 2.11 Intr - 48766 48659 108 1 0 81 77 152 0.915 14.38 2.10 Intr - 49043 48902 142 1 1 86 85 243 0.999 24.66 2.09 Intr - 51285 51192 94 1 1 92 99 144 0.999 15.42 2.08 Intr - 52214 52133 82 2 1 131 76 117 0.999 14.51 2.07 Intr - 52447 52307 141 1 0 75 94 192 0.999 19.56 2.06 Intr - 53867 53688 180 2 0 129 105 186 0.768 24.98 2.05 Intr - 54039 53942 98 1 2 85 58 92 0.765 6.13 2.04 Intr - 58728 58651 78 1 0 109 109 34 0.959 7.62 2.03 Intr - 62725 62659 67 1 1 84 61 100 0.548 5.87 2.02 Intr - 63227 63123 105 2 0 128 86 14 0.970 6.11 2.01 Init - 64552 64511 42 1 0 87 92 54 0.992 6.06 2.00 Prom - 64995 64956 40 -7.40 3.00 Prom + 65063 65102 40 -4.51 3.01 Init + 65728 66882 1155 0 0 86 84 543 0.983 45.68 3.02 Intr + 73932 74031 100 2 1 143 119 80 0.994 16.78 3.03 Intr + 77794 78007 214 2 1 80 75 286 0.980 24.80 3.04 Intr + 81752 81956 205 2 1 75 100 289 0.999 28.53 3.05 Intr + 85578 85679 102 2 0 84 105 202 0.989 22.37 3.06 Intr + 86290 86442 153 0 0 53 48 129 0.731 6.18 3.07 Term + 87949 88152 204 0 0 129 49 293 0.995 27.19 3.08 PlyA + 89429 89434 6 1.05 4.06 PlyA - 89471 89466 6 1.05 4.05 Term - 91362 91025 338 1 2 142 37 382 0.379 33.59 4.04 Intr - 92649 92497 153 1 0 71 94 46 0.400 4.06 4.03 Intr - 93275 93151 125 1 2 89 99 110 0.978 12.93 4.02 Intr - 98199 98073 127 1 1 86 70 153 0.556 13.54 4.01 Init - 99107 99062 46 2 1 36 40 21 0.272 -6.79 4.00 Prom - 99334 99295 40 -4.71 5.00 Prom + 99723 99762 40 -8.58 5.01 Init + 100001 100073 73 1 1 55 101 104 0.826 9.58 5.02 Intr + 105924 106096 173 1 2 66 89 245 0.764 22.58 5.03 Intr + 108023 108202 180 1 0 76 119 324 0.998 34.88 5.04 Intr + 111493 111666 174 0 0 156 76 168 0.987 23.05 5.05 Intr + 112179 112267 89 2 2 115 89 118 0.999 13.87 5.06 Term + 113600 113759 160 2 1 110 42 263 0.996 21.63 5.07 PlyA + 115639 115644 6 1.05 6.00 Prom + 117391 117430 40 -0.81 6.01 Init + 124135 124261 127 0 1 99 60 164 0.613 14.44 6.02 Intr + 124344 124477 134 1 2 97 75 192 0.979 19.67 6.03 Intr + 125561 125671 111 1 0 102 86 175 0.922 19.78 6.04 Intr + 126869 126983 115 1 1 -74 89 143 0.728 -1.28 6.05 Intr + 128611 128742 132 2 0 49 42 88 0.447 1.42 6.06 Intr + 128977 129091 115 2 1 94 86 173 0.980 17.71 6.07 Intr + 132370 132432 63 1 0 107 94 123 0.999 13.12 6.08 Intr + 132513 132667 155 0 2 128 77 260 0.998 29.13 6.09 Intr + 134341 134457 117 2 0 77 77 68 0.955 5.54 6.10 Intr + 135257 135352 96 0 0 86 94 189 0.999 19.78 6.11 Term + 135440 135555 116 0 2 128 49 122 0.999 11.34 6.12 PlyA + 137138 137143 6 1.05 7.00 Prom + 138764 138803 40 -1.91 7.01 Init + 147788 147854 67 1 1 61 85 71 0.397 5.29 7.02 Intr + 149319 150007 689 1 2 73 116 779 0.390 71.10 7.03 Intr + 155147 155266 120 1 0 104 97 95 0.871 13.19 7.04 Intr + 155730 155897 168 2 0 88 47 254 0.647 21.96 7.05 Term + 156401 156556 156 1 0 117 38 110 0.708 7.15 7.06 PlyA + 157000 157005 6 1.05 8.17 PlyA - 157060 157055 6 1.05 8.16 Term - 157228 157119 110 2 2 40 51 -4 0.564 -9.93 8.15 Intr - 157584 157378 207 1 0 6 61 201 0.774 8.77 8.14 Intr - 158219 158039 181 2 1 -24 75 231 0.857 10.56 8.13 Intr - 158711 158550 162 2 0 54 82 95 0.965 6.19 8.12 Intr - 159087 158929 159 0 0 89 78 146 0.997 14.40 8.11 Intr - 159402 159221 182 1 2 131 46 44 0.501 4.60 8.10 Intr - 159982 159820 163 1 1 38 51 150 0.428 6.36 8.09 Intr - 160506 160475 32 1 2 73 81 45 0.264 0.63 8.08 Intr - 167491 167360 132 2 0 93 22 199 0.123 14.92 8.07 Intr - 170217 170076 142 0 1 105 81 231 0.994 24.54 8.06 Intr - 170991 170804 188 1 2 122 91 308 0.999 34.43 8.05 Intr - 173853 173712 142 0 1 70 100 220 0.995 21.84 8.04 Intr - 174614 174483 132 2 0 90 59 58 0.953 4.45 8.03 Intr - 174864 174688 177 0 0 92 74 279 0.999 27.53 8.02 Intr - 175363 175203 161 2 2 110 60 278 0.999 27.42 8.01 Init - 179215 179152 64 1 1 85 54 6 0.140 -1.84 8.00 Prom - 187997 187958 40 -2.71 9.10 PlyA - 188405 188400 6 1.05 9.09 Term - 192569 192388 182 0 2 137 53 257 0.999 25.19 9.08 Intr - 193364 193223 142 2 1 117 103 107 0.999 15.54 9.07 Intr - 194073 193886 188 1 2 105 43 296 0.999 26.73 9.06 Intr - 194689 194548 142 1 1 75 86 271 0.999 26.04 9.05 Intr - 195643 195467 177 1 0 81 71 280 0.997 26.23 9.04 Intr - 196971 196811 161 1 2 113 93 240 0.998 27.22 9.03 Intr - 197355 197206 150 1 0 106 89 403 0.996 42.85 9.02 Intr - 198572 198410 163 2 1 91 77 216 0.998 20.86 9.01 Init - 199018 198839 180 1 0 95 32 290 0.997 21.26 9.00 Prom - 199071 199032 40 -0.21 10.02 PlyA - 199432 199427 6 1.05 10.01 Term - 201186 201010 177 0 0 125 54 102 0.643 8.50 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 140139 140053 87 0 0 84 107 22 0.919 4.32 S.002 Term - 167491 167310 182 2 2 93 42 218 0.869 15.79 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:40651409_40865164|GENSCAN_predicted_peptide_1|627_aa MSWLRMCVALHMHLEGKATLTASPWAGGPRRPERHLPPAPCGAPGPPETCRTEPDGAGTM NKLRQSLRRRKPAYVPEASRPHQWQADEDAVRKGTCSFPVRYLGHVEVEESRGMHVCEDA VKKLKAMGRKSVKSVLWVSADGLRVVDDKTKDLLVDQTIEKVSFCAPDRNLDKAFSYICR DGTTRRWICHCFLALKDSGERLSHAVGCAFAACLERKQRREKECGVTAAFDASRTSFARE GSFRLSGGGRPAEREAPDKKKAEAAAAPTVAPGPAQPGHVSPTPATTSPGEKGEAGTPVA AGTTAAAIPRRHAPLEQLVRQGSFRGFPALSQKNSPFKRQLSLRLNELPSTLQRRTDFQV KGTVPEMEPPGAGDSDSINALCTQISSSFASAGAPAPGPPPATTGTSAWGEPSVPPAAAF QPGHKRTPSEAERWLEEVSQVAKAQQQQQQQQQQQQQQQQQQQQAASVAPVPTMPPALQP FPAPVGPFDAAPAQVAVFLPPPHMQPPFVPAYPGLGYPPMPRVPVVGITPSQMVANAFCS AAQLQPQPATLLGKAGAFPPPAIPSAPGSQARPRPNGAPWPPEPAPAPAPELDPFEAQWA ALEGKATVEKPSNPFSGDLQKTFEIEL >gi568815579f:40651409_40865164|GENSCAN_predicted_CDS_1|1884_bp atgagttggctgcgtatgtgtgtagccctgcacatgcatctggagggcaaggccactctc acagcatccccttgggcaggcggaccccggaggcctgagcggcacctgcccccagccccc tgtggggccccggggcccccagaaacctgcaggacggagccagacggggcgggcaccatg aacaagttacggcagagcctgcggcggaggaagccagcctacgtgcccgaggcgtcgcgc ccgcaccagtggcaggcagacgaggacgcggtgcggaagggcacgtgcagcttcccggtc aggtacctgggtcacgtggaggtagaggagtcccggggaatgcacgtgtgtgaagatgcg gtgaagaagctgaaggcgatgggccgaaagtccgtgaagtctgtcctgtgggtgtcagcc gatgggctccgagtggtggacgacaaaaccaaggatcttctggtcgaccagaccatcgaa aaggtctccttttgtgctcctgaccgcaacctggacaaggctttctcctatatctgtcgt gacgggactacccgccgctggatctgccactgttttctggcactgaaggactccggcgag aggctgagccacgctgtgggctgtgcttttgccgcctgcctggagcgaaaacagcgacgg gagaaggaatgtggggtcacggccgccttcgatgccagccgcaccagcttcgcccgcgag ggctccttccgcctgtctgggggtgggcggcctgctgagcgagaggccccggacaagaag aaagcagaggcagcagctgcccccactgtggctcctggccctgcccagcctgggcacgtg tccccgacaccagccaccacatcccctggtgagaagggtgaggcaggcacccctgtggct gcaggcaccactgcggccgccatcccccggcgccatgcacccctggagcagctggttcgc cagggctccttccgtgggttcccagcactcagccagaagaactcgcctttcaaacggcag ctgagcctacggctgaatgagctgccatccacgctgcagcgccgcactgacttccaggtg aagggcacagtgcctgagatggagcctcctggtgccggcgacagtgacagcatcaacgct ctgtgcacacagatcagttcatcttttgccagtgctggagcgccagcaccagggccacca cctgccacaacagggacttctgcctggggtgagccctccgtgccccctgcagctgccttc cagcctgggcacaagcggacaccttcagaggctgagcgatggctggaggaggtgtcacag gtggccaaggcccagcagcagcagcagcagcaacagcaacagcagcagcagcagcagcag caacagcagcaagcagcctcagtggccccagtgcccaccatgcctcctgccctgcagcct ttccccgcccccgtggggccctttgacgctgcacctgcccaagtggccgtgttcctgcca cccccacacatgcagcccccttttgtgcccgcctacccgggcttgggctacccaccgatg ccccgggtgcccgtggtgggcatcacaccctcacagatggtggcaaacgccttctgctca gccgcccagctccagcctcagcctgccactctgcttgggaaagctggggccttcccgccc cctgccatacccagtgcccctgggagccaggcccgccctcgccccaatggggccccctgg ccccctgagccagcgcctgccccagctccagagttggacccctttgaggcccagtgggcg gcattagaaggcaaagccactgtagagaaaccctccaaccccttttctggcgacctgcaa aagacattcgagattgaactgtag >gi568815579f:40651409_40865164|GENSCAN_predicted_peptide_2|542_aa MPQPDASIPDCEKWAMWLKVGGLLRGTGGQLGQTVGWPCGALGPGPHRWLSDRSRERKVP ASRISRLANFGGLAVGLGLGVLAEMAKKSMPGGRLQSEGGSGLDSSPFLSEANAERIVQT LCTVRGAALKPYPPTGPNFRYSHGVPPHLFAYFPPGSTVSQDNSFISPQLQHIFERVRQS ADFMPRWQMLRVLEEELGRDWQAKVASLEEVPFAAASIGQVHQGLLRDGTEVAVKIQYPG IAQSIQSDVQNLLAVLKMSAALPAGLFAEQSLQALQQELAWECDYRREAACAQNFRQLLA NDPFFRVPAVVKELCTTRVLGMELAGGVPLDQCQGLSQDLRNQICFQLLTLCLRELFEFR FMQTDPNWANFLYDASSHQVTLLDFGASREFGTEFTDHYIEVVKAAADGDRDCVLQKSRD LKFLTGFETKAFSDAHVEAVMILGEPFATQGPYDFGSGETARRIQDLIPVLLRHRLCPPP EETYALHRKLAGAFLACAHLRAHIACRDLFQDTYHRYWASRQPDAATAGSLPTKGDSWVD PS >gi568815579f:40651409_40865164|GENSCAN_predicted_CDS_2|1629_bp atgccccaaccagatgcctctattccagactgtgagaagtgggcaatgtggctgaaggtg gggggcctacttcgggggaccggtggacagctgggccagactgttggttggccttgtggg gccctggggcctgggccccaccgctggctgagtgaccgctctcgagaacgcaaggtgcct gcctcccgcatcagccgcttggccaactttgggggactggctgtgggcttggggctagga gtactggccgagatggctaagaagtccatgccaggaggtcgtctgcagtcagagggtggt tctgggctggactccagccccttcctgtcggaggccaatgccgagcggattgtgcagacc ttatgtacagttcgaggggccgccctcaagccctacccaccaacaggccccaacttcaga tactcacacggggtccctcctcacctcttcgcttacttcccccctggttccactgtctcc caagacaacagcttcatcagccctcagctgcagcacatctttgagcgggtccgccagagc gccgacttcatgccccgctggcagatgctgagagttcttgaagaggagctcggcagggac tggcaggccaaggtggcctccttggaggaggtgccctttgccgctgcctcaattgggcag gtgcaccagggcctgctgagggacgggacggaggtggccgtgaagatccagtaccccggc atagcccagagcattcagagcgatgtccagaacctgctggcggtactcaagatgagcgcg gccctgcccgcgggcctgtttgccgagcagagcctgcaggccttgcagcaggagctggct tgggagtgtgactaccgtcgtgaggcggcttgtgcccagaatttcaggcagctgctggca aatgaccccttcttccgggtcccagccgtggttaaggagctgtgcacgacacgggtgctg ggcatggagctggctggaggggtccccctggaccagtgccagggcctaagccaggacctg cggaaccagatttgcttccagctcctgacgctgtgtctgcgggagctgtttgagttccga ttcatgcagactgaccccaactgggccaacttcctgtatgatgcctccagccaccaggtg accctgctggactttggtgcaagccgggagtttgggacagagttcacagaccattacatc gaggtggtgaaggctgcagctgatggagacagagactgtgtcctgcagaagtccagggac ctcaaattcctcacaggctttgaaaccaaggcattctccgacgcccacgtggaggcagtg atgatcctgggggagcctttcgccacccagggcccttatgactttgggtcgggggaaacg gcccgccgcatacaggacctcatcccggtgctgctgcggcaccggctgtgtcccccaccc gaggagacctatgccctgcaccgcaagctggcaggggctttcctggcctgtgcccacctc cgagcccacatcgcctgcagggacctcttccaggacacctaccaccgctactgggccagt cgccagccagacgcagccactgccggcagcctccccaccaaaggggactcctgggtggat ccctcatga >gi568815579f:40651409_40865164|GENSCAN_predicted_peptide_3|710_aa MRRCPCRGSLNEAEAGALPAAARMGLEAPRGGRRRQPGQQRPGPGAGAPAGRPEGGGPWA RTEGSSLHSEPERAGLGPAPGTESPQAEFWTDGQTEPAAAGLGVETERPKQKTEPDRSSL RTHLEWSWSELETTCLWTETGTDGLWTDPHRSDLQFQPEEASPWTQPGVHGPWTELETHG SQTQPERVKSWADNLWTHQNSSSLQTHPEGACPSKEPSADGSWKELYTDGSRTQQDIEGP WTEPYTDGSQKKQDTEAARKQPGTGGFQIQQDTDGSWTQPSTDGSQTAPGTDCLLGEPED GPLEEPEPGELLTHLYSHLKCSPLCPVPRLIITPETPEPEAQPVGPPSRVEGGSGGFSSA SSFDESEDDVVAGGGGASDPEDRSGSKPWKKLKTVLKYSPFVVSFRKHYPWVQLSGHAGN FQAGEDGRILKRFCQCEQRSLEQLMKDPLRPFVPAYYGMVLQDGQTFNQMEDLLADFEGP SIMDCKMGSRTYLEEELVKARERPRPRKDMYEKMVAVDPGAPTPEEHAQGAVTKPRYMQW RETMSSTSTLGFRIEGIKKADGTCNTNFKKTQALEQVTKVLEDFVDGDHVILQKYVACLE ELREALEISPFFKTHEVRALASMGGCMGVGVQVHGGETIDELQVVGSSLLFVHDHTGLAK VWMIDFGKTVALPDHQTLSHRLPWAEGNREDGYLWGLDNMICLLQGLAQS >gi568815579f:40651409_40865164|GENSCAN_predicted_CDS_3|2133_bp atgaggcgctgcccgtgccgtgggagcctgaacgaggcggaggccggggcgctgcccgcg gcggcccgcatgggactggaggcgccgcgaggagggcggcggcggcagccgggacagcag cgacctgggcccggcgcaggggccccggcggggcggccggaggggggcgggccctgggcc cggacagaggggtccagcctccacagcgagcctgagagggccggcctcgggcctgcgccg gggacagagagtccgcaggcagaattctggacagacggacagactgagcccgcggcagct ggccttggagtagagaccgagaggcccaagcaaaagacggagccagacaggtccagcctc cggacgcatctagaatggagctggtcagagctggagacgacttgtctttggacggagacc gggacagatggcctttggactgatccgcacaggtccgacctccagtttcagcccgaggag gccagcccctggacacagccaggggttcatgggccctggacagagctggaaacgcatggg tcacagactcagccagagagggtcaagtcctgggctgataacctctggacccaccagaac agttccagcctccagactcacccagaaggagcctgtccctcaaaagagccaagtgctgat ggctcctggaaagaattgtatactgatggctccaggacacaacaggatattgaaggtccc tggacagagccatatactgatggctcccagaaaaaacaggatactgaagcagccaggaaa cagcctggcactggtggtttccaaatacaacaggatactgatggctcctggacacaacct agcactgacggttcccagacagcacctgggacagactgcctcttgggagagcctgaggat ggcccattagaggaaccagagcctggagaattgctgactcacctgtactctcacctgaag tgtagccccctgtgccctgtgccccgcctcatcattacccctgagacccctgagcctgag gcccagccagtgggacccccctcccgggttgaggggggcagcggcggcttctcctctgcc tcttctttcgacgagtctgaggatgacgtggtggccgggggcggaggtgccagcgatccc gaggacaggtctgggagcaaaccctggaagaagctgaagacagttctgaagtattcaccc tttgtggtctccttccgaaaacactacccttgggtccagctttctggacatgctgggaac ttccaggcaggagaggatggtcggattctgaaacgtttctgtcagtgtgagcagcgcagc ctggagcagctgatgaaagacccgctgcgacctttcgtgcctgcctactatggcatggtg ctgcaggatggccagaccttcaaccagatggaagacctcctggctgactttgagggcccc tccattatggactgcaagatgggcagcaggacctatctggaagaggagctagtgaaggca cgggaacgtccccgtccccggaaggacatgtatgagaagatggtggctgtggaccctggg gcccctacccctgaggagcatgcccagggtgcagtcaccaagccccgctacatgcagtgg agggaaaccatgagctccacctctaccctgggcttccggatcgagggcatcaagaaggca gatgggacctgtaacaccaacttcaagaagacgcaggcactggagcaggtgacaaaagtg ctggaggacttcgtggatggagaccacgtcatcctgcaaaagtacgtggcatgcctagaa gaacttcgtgaagctctggagatctcccccttcttcaagacccacgaggtgcgagccctg gcttccatgggtggatgtatgggtgtcggggtccaggtgcatggaggggaaaccattgat gagttacaggtggtaggcagctccctcctcttcgtgcacgaccacaccggcctggccaag gtctggatgatagacttcggcaagacggtggccttgcccgaccaccagacgctcagccac aggctgccctgggctgagggcaaccgtgaggacggctacctctggggcctggacaacatg atctgcctcctgcaggggctggcacagagctga >gi568815579f:40651409_40865164|GENSCAN_predicted_peptide_4|262_aa MGKAVCGGRHFPGVGGPAPRHVFGLEKSQLLKEAFDKAGPVPKGREDVKRLLKLHKDRCG LVALWMAGTLLSPPSGVPLERLIRVATERGYTAQGEMFSGPGREEGRGGKGWVPGPHHRF YLNASTSYDEDFNHEPCQRKGHKAHWAVSAGVLLGVRAVPSLGYTEDPELPGLFHPVLGT PCQPPSLPEEGSPGAVYLLSKQGKSWHYQLWDYDQVRESNLQLTDFSPSRATDGRVYVVP VGGVRAGLCGQALLLTPQDCSH >gi568815579f:40651409_40865164|GENSCAN_predicted_CDS_4|789_bp atgggaaaggcagtgtgcgggggccgccattttccgggagtgggagggcctgctccccgt catgtcttcggcctggagaagagccagctcctgaaggaggcctttgataaggccggcccg gtccccaagggcagagaagatgtgaagaggcttctgaaactacacaaggaccgatgcggg ctggtggccttgtggatggcaggtactctcctgtcgccccccagtggcgtccccctggag agactcatacgggtggccacggaaagaggctacacggcccagggagagatgttctcaggg cctgggagagaggagggcaggggtggcaaagggtgggtcccagggccacaccatcgtttt taccttaatgccagcaccagctacgacgaggacttcaaccatgagccgtgtcagaggaag ggccacaaggcacactgggcggtgagtgcaggggtcctgctgggtgttcgggctgtgccc agtctcggctacactgaggaccctgagctgccgggcctgttccacccagtgctgggcacg ccctgccaaccaccatccctgccagaggagggctccccgggagctgtctacctgctgtcc aagcagggcaagagttggcactatcagctgtgggactacgaccaggtccgggagagcaac ctgcagctgacggacttctcgccctcacgggccactgacggccgggtgtacgtggtgccc gtgggtggggtacgggctggcctctgtggccaggccctgctcctcacaccacaggactgc agccattag >gi568815579f:40651409_40865164|GENSCAN_predicted_peptide_5|282_aa MAVPETRPNHTIYINNLNEKIKKDELKKSLYAIFSQFGQILDILVSRSLKMRGQAFVIFK EVSSATNALRSMQGFPFYDKPMRIQYAKTDSDIIAKMKGTFVERDRKREKRKPKSQETPA TKKAVQGGGATPVVGAVQGPVPGMPPMTQAPRIMHHMPGQPPYMPPPGMIPPPGLAPGQI PPGAMPPQQLMPGQMPPAQPLSENPPNHILFLTNLPEETNELMLSMLFNQFPGFKEVRLV PGRHDIAFVEFDNEVQAGAARDALQGFKITQNNAMKISFAKK >gi568815579f:40651409_40865164|GENSCAN_predicted_CDS_5|849_bp atggcagttcccgagacccgccctaaccacactatttatatcaacaacctcaatgagaag atcaagaaggatgagctaaaaaagtccctgtacgccatcttctcccagtttggccagatc ctggatatcctggtatcacggagcctgaagatgaggggccaggcctttgtcatcttcaag gaggtcagcagcgccaccaacgccctgcgctccatgcagggtttccctttctatgacaaa cctatgcgtatccagtatgccaagaccgactcagatatcattgccaagatgaaaggcacc ttcgtggagcgggaccgcaagcgggagaagaggaagcccaagagccaggagaccccggcc accaagaaggctgtgcaaggcgggggagccacccccgtggtgggggctgtccaggggcct gtcccgggcatgccgccgatgactcaggcgccccgcattatgcaccacatgccgggccag ccgccctacatgccgccccctggtatgatccccccgccaggccttgcacctggccagatc ccaccaggggccatgcccccgcagcagcttatgccaggacagatgccccctgcccagcct ctttctgagaatccaccgaatcacatcttgttcctcaccaacctgccagaggagaccaac gagctcatgctgtccatgcttttcaatcagttccctggcttcaaggaggtccgtctggta cccgggcggcatgacatcgccttcgtggagtttgacaatgaggtacaggcaggggcagct cgcgatgccctgcagggctttaagatcacgcagaacaacgccatgaagatctcctttgcc aagaagtag >gi568815579f:40651409_40865164|GENSCAN_predicted_peptide_6|426_aa MARSLVCLGVIILLSAFSGPGVRGGPMPKLADRKLCADQECSHPISMAVALQDYMAPDCR FLTIHRGQVVYVFSKLKGRGRLFWGGSVQGDYYGDLAARLGYFPSSIVREDQTLKPGKVD VKTDEGAGAVAGVERLPAEEQARPRRHIAALSGRDRVMAETYDFLFKFLVIGSAGTGKSC LLHQFIENKCEFPAVVLGTLSCVYAHVKQDSNHTIGVEFGSRVVNVGGKTVKLQIWDTAG QERFRSVTRSYYRGAAGALLVYDITSRETYNSLAAWLTDARTLASPNIVVILCGNKKDLD PEREVTFLEASRFAQENDGARDTVLETALGPTFMELTVQWVRVEGQTHFQTVTPQEELMF LETSALTGENVEEAFLKCARTILNKIDSGELDPERMGSGIQYGDASLRQLRQPRSAQAVA PQPCGC >gi568815579f:40651409_40865164|GENSCAN_predicted_CDS_6|1281_bp atggcccggtccctggtgtgccttggtgtcatcatcttgctgtctgccttctccggacct ggtgtcaggggtggtcctatgcccaagctggctgaccggaagctgtgtgcggaccaggag tgcagccaccctatctccatggctgtggcccttcaggactacatggcccccgactgccga ttcctgaccattcaccggggccaagtggtgtatgtcttctccaagctgaagggccgtggg cggctcttctggggaggcagcgttcagggagattactatggagatctggctgctcgcctg ggctatttccccagtagcattgtccgagaggaccagaccctgaaacctggcaaagtcgat gtgaagacagacgaaggagccggggctgtagccggagtggagcggctgccagccgaggag caggcgcggccgcggcgccatattgcggccctcagcggccgcgaccgagtcatggctgag acctacgacttcctcttcaaattcctggtgattggcagtgcaggaactggcaaatcatgt ctccttcatcagttcattgagaataagtgtgagtttcccgcagtggtcctgggaaccctg agctgtgtttatgcgcatgtcaaacaggactccaaccacacaatcggcgtggagtttgga tcccgggtggtcaacgtgggtgggaagactgtgaagctacagatttgggacacggctggc caggagcggtttcggtcagtgacgcggagttattaccgaggggcggctggagccctgctg gtgtacgacatcaccagccgggagacatacaactcactggctgcctggctgacggatgcc cgcaccctggccagccccaacatcgtggtcatcctctgtggcaacaagaaggacctggac cctgagcgggaggtcactttcctggaggcctcccgctttgcccaggagaatgatggtgct agagacacagtgttggagacagccctaggcccaaccttcatggagctcacagttcagtgg gtgagggtggaaggacagacccacttccagacagtgacacctcaggaagagctgatgttc ctggagaccagcgctctcacaggcgagaacgtggaggaggcgttcctcaagtgtgcccgc actatcctcaacaagattgactcaggcgagctagacccggagaggatgggctctggcatt cagtacggggatgcgtccctccgccagcttcggcagcctcggagtgcccaggccgtggcc cctcagccgtgtggctgctga >gi568815579f:40651409_40865164|GENSCAN_predicted_peptide_7|399_aa MARCAAQGGWHKRRRRGRRKKLGVPSEASAGSGTPRATATSTTASPLRDGFGGQDGGELR PLQSEGAAALVTKGCQRLAAQGARPEAPKRKWAEDGGDAPSPSKRPWARQENQEAEREGG MSCSCSSGSGEASAGLMEEALPSAPERLALDYIVPCMRYYGICVKDSFLGAALGGRVLAE VEALKRGGRLRDGQLVSQRAIPPRSIRGDQIAWVEGHEPGCRSIGALMAHVDAVIRHCAG RLGSYVINGRTKAMVACYPGNGLGYVRHVDNPHGDGRCITCIYYLNQNWDVKVHGGLLQI FPEGRPVVANIEPLFDRLLIFWSDRRNPHEVKPAYATRYDLYFWRRTQHQDRKVSKYLYH SRLRPPSGQSQSRMADSLNDFRRALGLCWLLLPCHRCCF >gi568815579f:40651409_40865164|GENSCAN_predicted_CDS_7|1200_bp atggcgcgctgtgcggcgcagggcggctggcacaaacggcggcgccggggccggaggaaa aagctcggagtgcctagtgaggcctcggcagggagtgggacccccagagccacagccacc tctaccactgccagccctcttcgggacggttttggcgggcaggatggtggtgagctgcgg ccgctgcagagtgaaggcgctgcagcgctggtcaccaaggggtgccagcgattggcagcc cagggcgcacggcctgaggcccccaaacggaaatgggccgaggatggtggggatgcccct tcacccagcaaacggccctgggccaggcaagagaaccaggaggcagagcgggagggtggc atgagctgcagctgcagcagtggcagtggtgaggccagtgctgggctgatggaggaggcg ctgccctctgcgcccgagcgcctggccctggactatatcgtgccctgcatgcggtactac ggcatctgcgtcaaggacagcttcctgggggcagcactgggcggtcgcgtgctggccgag gtggaggccctcaaacggggtgggcgcctgcgagacgggcagctagtgagccagagggcg atcccgccgcgcagcatccgtggggaccagattgcctgggtggaaggccatgaaccaggc tgtcgaagcattggtgccctcatggcccatgtggacgccgtcatccgccactgcgcaggg cggctgggcagctatgtcatcaacgggcgcaccaaggccatggtggcgtgttacccaggc aacgggctcgggtacgtaaggcacgttgacaatccccacggcgatgggcgctgcatcacc tgtatctattacctgaatcagaactgggacgttaaggtgcatggcggcctgctgcagatc ttccctgagggccggcccgtggtagccaacatcgagccactctttgaccggttgctcatt ttctggtctgaccggcggaacccccacgaggtgaagccagcctatgccaccaggtatgac ctgtacttctggagacgcacccagcatcaggacagaaaggtgtccaagtacctgtatcac agccgcctacgcccacctagtggccagtcccagagccgcatggcagacagcttaaatgac ttcaggagagccctgggcctgtgctggctgctccttccctgccaccgctgctgcttctga >gi568815579f:40651409_40865164|GENSCAN_predicted_peptide_8|777_aa MGGTVYRGHPSKRQVMVESGDGEPFDPTFVLSRSRSNIICSVLFGSRFDYDDERLLTIIR LINDNFQIMSSPWGELYDIFPSLLNWVPGPHQRIFQNFKCLRDLIAHSVHDHQASLDPRS PRDFIHCFLTKMAEGPQMGQGWGTFSLEKLKHLAQLGWSQEPLAATANLQLIKKSSKEKK EDPLSHFHMDTLLMTTHNLLFGGTETVGTTLRHAFLAFMKYPKVQAHVQEEINLVVGHVR LPALKDRAAMPYTDMVIHEVQRFADIIPMNLPHRITRDTAFHGFLIPKGTDVITLLNTVH YDPSQFLTPQEFNPEHFLDANQSFKKSPAFMPFSAGHRLCLGESLARMELFLYLTAILQS FSLQPLGAPEDIDLTPLSSGMGERNSGFVKLSGRWGRVFTVRLGPRPAVGLCGYAALRDA LVLQADAVSGRGSMAVFERFTRGNRILFSNRPCWWTLRNFALGALKKFGLGTRTVEARVL EEAACLLDEFQATIASAWPGERKGLEPRLTPCGYWIMLYPMLSVLVFGNRYRYGDPEFLR LLNLFSDNFCIISSRWGEMYICLSLMDWLPGPHHRIFRNFSELRVISEQIQRHWQMRQPA EPRDFIDCLTRWVQELDPVVGWRPAPSLDYRVCLPYANAVLLEIQCFISVVPLGLPRTLT LDTHLHSHCLPKASWMEVVQVLTKPAGNPSCMYPQGTFVIPLLVTAHRDPTQFKDPDCFN PTNFLDKGKFQGNDAFMPFASEVLPAPCGTPWHHQPHLQCTGLGSVPPDFQLQPVAC >gi568815579f:40651409_40865164|GENSCAN_predicted_CDS_8|2334_bp atgggtggtacagtttatagaggccacccttctaaaagacaggtcatggtggagagtgga gatggcgagccctttgaccccacgtttgtgctgagtcgctcacggtccaacattatctgt tccgtgctcttcggcagccgcttcgactatgatgatgagcgtctgctcaccattatccgc cttatcaatgacaactttcaaatcatgagcagcccctggggcgagttgtacgacatcttc ccgagcctcctgaactgggtgcctgggccgcaccaacgcatcttccagaacttcaagtgc ctgagagacctcatcgcccacagcgtccacgaccaccaggcctcgctagaccccagatct ccccgggacttcatccactgcttcctcaccaagatggcagagggaccccagatggggcag ggttgggggaccttctccctggagaagctgaagcatctggcccagctgggctggagccag gagccactcgcagccaccgccaacttacagctcattaagaagtcatcaaaggagaagaag gaggacccgctgagccacttccacatggataccctgctgatgaccacacataacctgctc tttggcggcaccgagacggtgggcaccacgctgcgccacgccttcctggcattcatgaag tacccgaaagttcaagcccacgtgcaggaggagatcaaccttgtggtgggacacgtgcgg ctgccagcgctgaaggaccgcgcggccatgccttacacagacatggtgatccacgaggtg cagcgctttgcagacatcatccccatgaacttgccgcaccgcatcactagggacacggcc tttcacggcttcctgatacccaagggcaccgatgtcatcaccctccttaacaccgtccac tacgaccccagccagttcctgacgccccaggagttcaaccccgagcattttttggatgcc aatcagtccttcaagaagagtccagccttcatgcccttctcagctgggcaccgtctgtgc ctgggagagtcgctggcgcgcatggagctctttctgtacctcaccgccatcctgcagagc ttttcgctgcagccgctgggtgcgcccgaggacatcgacctgaccccgctcagctcagga atgggggaacggaattctggattcgtgaagctctccggccgctggggccgggtgttcaca gtgcggctgggcccgcgccctgcggtggggctgtgcggctacgcagcgctgcgggacgcg ttagtgctacaggcggatgcggtctctggccgcgggtccatggcagtcttcgaacgcttc acacgcggaaacagaatcttgttttctaaccggccgtgctggtggacactgcgcaatttt gcacttggagcgcttaagaagttcgggttgggtacgcggaccgtcgaggcgcgcgtcctg gaggaggcggcttgtctgctagacgaatttcaagccaccattgcttcggcctggccggga gagcggaagggcctggagccccgtttgaccccgtgcggctactggataatgctgtatcca atgttatctgttcttgtcttcgggaaccgctatcgctatggggacccggagttcctgagg ctcctgaacctcttcagtgacaacttctgcatcattagttccagatggggcgagatgtac atttgcctgtccctcatggactggctcccgggcccgcaccaccgaatcttccgaaacttt tcggagctgcgggtcatctctgagcaaattcaacgacactggcagatgcggcagccagcg gagccccgcgatttcattgattgcttgaccagatgggtgcaggagctggaccctgtggta gggtggaggcccgccccaagcctggactatcgcgtgtgcctgccctacgccaacgcagtg ctgctcgagatccagtgcttcatcagcgtggtgcccctggggctgccgcgcaccctcacc ctcgacacccacctgcacagccactgtctgcccaaagcgtcttggatggaggtggtgcag gtgctcaccaagcccgcaggtaacccaagttgcatgtatccccagggcacttttgtgatt cccctgcttgtgactgcacaccgggaccccactcaattcaaagacccagactgcttcaac cctaccaacttcctggacaagggcaagttccagggcaatgatgctttcatgccctttgcc tcagaggttctgcctgctccctgtggtacgccctggcaccatcaacctcacctgcagtgc actggcctgggcagtgtccccccagacttccagctccagccagtggcctgctga >gi568815579f:40651409_40865164|GENSCAN_predicted_peptide_9|494_aa MLASGMLLVALLVCLTVMVLMSVWQQRKSKGKLPPGPTPLPFIGNYLQLNTEQMYNSLMK ISERYGPVFTIHLGPRRVVVLCGHDAVREALVDQAEEFSGRGEQATFDWVFKGYGVVFSN GERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIDALRGTGGANIDPTFFLSRTVSN VISSIVFGDRFDYKDKEFLSLLRMMLGIFQFTSTSTGQLYEMFSSVMKHLPGPQQQAFQL LQGLEDFIAKKVEHNQRTLDPNSPRDFIDSFLIRMQEEEKNPNTEFYLKNLVMTTLNLFI GGTETVSTTLRYGFLLLMKHPEVEAKVHEEIDRVIGKNRQPKFEDRAKMPYMEAVIHEIQ RFGDVIPMSLARRVKKDTKFRDFFLPKGTEVYPMLGSVLRDPSFFSNPQDFNPQHFLNEK GQFKKSDAFVPFSIGKRNCFGEGLARMELFLFFTTVMQNFRLKSSQSPKDIDVSPKHVGF ATIPRNYTMSFLPR >gi568815579f:40651409_40865164|GENSCAN_predicted_CDS_9|1485_bp atgctggcctcagggatgcttctggtggccttgctggtctgcctgactgtaatggtcttg atgtctgtttggcagcagaggaagagcaaggggaagctgcctccgggacccaccccattg cccttcattggaaactacctgcagctgaacacagagcagatgtacaactccctcatgaag atcagtgagcgctatggccccgtgttcaccattcacttggggccccggcgggtcgtggtg ctgtgtggacatgatgccgtcagggaggctctggtggaccaggctgaggagttcagcggg cgaggcgagcaagccaccttcgactgggtcttcaaaggctatggcgtggtattcagcaac ggggagcgcgccaagcagctccggcgcttctccatcgccaccctgcgggacttcggggtg ggcaagcgaggcatcgaggagcgcatccaggaggaggcgggcttcctcatcgacgccctc cggggcactggcggcgccaatatcgatcccaccttcttcctgagccgcacagtctccaat gtcatcagctccattgtctttggggaccgctttgactataaggacaaagagttcctgtca ctgttgcgcatgatgctaggaatcttccagttcacgtcaacctccacggggcagctctat gagatgttctcttcggtgatgaaacacctgccaggaccacagcaacaggcctttcagttg ctgcaagggctggaggacttcatagccaagaaggtggagcacaaccagcgcacgctggat cccaattccccacgggacttcattgactcctttctcatccgcatgcaggaggaggagaag aaccccaacacggagttctacttgaaaaacctggtgatgaccacgttgaacctcttcatt gggggcaccgagaccgtcagcaccaccctgcgctatggcttcttgctgctcatgaagcac ccagaggtggaggccaaggtccatgaggagattgacagagtgatcggcaagaaccggcag cccaagtttgaggaccgggccaagatgccctacatggaggcagtgatccacgagatccaa agatttggagacgtgatccccatgagtttggcccgcagagtcaaaaaggacaccaagttt cgggatttcttcctccctaagggcaccgaagtgtaccctatgctgggctctgtgctgaga gaccccagtttcttctccaacccccaggacttcaatccccagcacttcctgaatgagaag gggcagtttaagaagagtgatgcttttgtgcccttttccatcggaaagcggaactgtttc ggagaaggcctggccagaatggagctctttctcttcttcaccaccgtcatgcagaacttc cgcctcaagtcctcccagtcacctaaggacattgacgtgtcccccaaacacgtgggcttt gccacgatcccacgaaactacaccatgagcttcctgccccgctga >gi568815579f:40651409_40865164|GENSCAN_predicted_peptide_10|58_aa VSELSILEQFEQDVSPGGANLTHLHLSKVGDPEDRCNPKWSNHHSLSCGHLGPLSGSG >gi568815579f:40651409_40865164|GENSCAN_predicted_CDS_10|177_bp gtctcagagctcagcatcctagaacagtttgagcaagatgtcagtcctgggggtgccaac ctcacccacctccacttgtccaaagttggtgacccagaggacaggtgcaaccctaaatgg tccaaccaccacagcctgtcctgtggccatctggggcccctctcagggtctggctga