GENSCAN 1.0 Date run: 2-Nov-116 Time: 19:16:30 Sequence gi568815583r:34756795_35069613 : 312819 bp : 39.93% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2785 2883 99 2 0 63 66 69 0.058 1.36 1.02 Term + 21844 21914 71 2 2 97 45 82 0.041 1.92 1.03 PlyA + 26401 26406 6 1.05 2.15 PlyA - 31145 31140 6 1.05 2.14 Term - 33761 33618 144 2 0 77 44 93 0.880 0.73 2.13 Intr - 34501 34320 182 2 2 72 98 235 0.985 21.57 2.12 Intr - 35487 35296 192 1 0 83 97 295 0.997 28.44 2.11 Intr - 35775 35614 162 1 0 74 105 284 0.988 27.73 2.10 Intr - 36775 36451 325 1 1 97 85 367 0.999 31.72 2.09 Intr - 37158 37048 111 0 0 57 100 45 0.824 2.26 2.08 Intr - 38024 37886 139 1 1 5 97 196 0.969 11.75 2.07 Intr - 39286 38712 575 1 2 58 113 219 0.609 12.01 2.06 Intr - 42700 42591 110 2 2 62 82 88 0.688 4.68 2.05 Intr - 44015 43955 61 2 1 113 90 55 0.941 5.59 2.04 Intr - 63029 62769 261 2 0 117 74 110 0.556 9.26 2.03 Intr - 76437 76306 132 0 0 24 31 135 0.152 1.32 2.02 Intr - 81800 81523 278 0 2 -26 19 481 0.090 26.11 2.01 Init - 83016 82911 106 0 1 69 -12 85 0.342 -2.96 2.00 Prom - 87681 87642 40 -5.05 3.00 Prom + 88998 89037 40 -6.95 3.01 Sngl + 93128 93541 414 1 0 86 53 350 0.934 27.34 3.02 PlyA + 93875 93880 6 1.05 4.26 PlyA - 93921 93916 6 1.05 4.25 Term - 100312 99998 315 1 0 63 42 394 0.997 26.26 4.24 Intr - 103361 103248 114 2 0 121 109 -27 0.841 2.42 4.23 Intr - 106247 106073 175 1 1 53 97 113 0.988 7.72 4.22 Intr - 110815 110730 86 1 2 111 94 72 0.945 7.80 4.21 Intr - 114128 113958 171 2 0 64 75 135 0.996 9.02 4.20 Intr - 117205 117034 172 0 1 -4 110 135 0.999 5.52 4.19 Intr - 118070 117883 188 2 2 90 93 209 0.998 19.17 4.18 Intr - 119212 119141 72 1 0 71 110 33 0.848 2.48 4.17 Intr - 125845 125708 138 1 0 115 98 159 0.999 19.34 4.16 Intr - 127940 127731 210 2 0 103 94 116 0.999 11.89 4.15 Intr - 129867 129732 136 2 1 37 110 96 0.911 6.35 4.14 Intr - 133530 133421 110 0 2 87 99 190 0.996 18.16 4.13 Intr - 136979 136869 111 2 0 97 99 78 0.993 9.46 4.12 Intr - 140172 140103 70 2 1 67 95 87 0.999 5.57 4.11 Intr - 140911 140765 147 0 0 29 43 185 0.988 6.53 4.10 Intr - 144069 143828 242 0 2 85 103 187 0.980 15.33 4.09 Intr - 147711 147542 170 1 2 66 86 37 0.952 0.04 4.08 Intr - 147988 147860 129 2 0 34 92 101 0.950 4.85 4.07 Intr - 149918 149751 168 0 0 98 56 200 0.988 16.80 4.06 Intr - 153519 153341 179 2 2 87 115 168 0.997 18.04 4.05 Intr - 158385 158244 142 1 1 106 49 112 0.981 7.59 4.04 Intr - 161584 161464 121 1 1 105 86 154 0.954 16.05 4.03 Intr - 173577 173464 114 0 0 61 91 87 0.983 6.02 4.02 Intr - 175640 175524 117 2 0 82 68 73 0.952 4.44 4.01 Init - 177830 177777 54 2 0 82 94 39 0.780 5.23 4.00 Prom - 182659 182620 40 -6.55 5.00 Prom + 184007 184046 40 -4.25 5.01 Sngl + 186286 186603 318 0 0 87 42 200 0.981 11.02 5.02 PlyA + 186627 186632 6 1.05 6.10 PlyA - 188861 188856 6 1.05 6.09 Term - 190321 190227 95 2 2 82 49 76 0.635 0.11 6.08 Intr - 190507 190417 91 1 1 45 74 36 0.003 -3.45 6.07 Intr - 207496 207440 57 1 0 84 115 19 0.385 2.36 6.06 Intr - 213000 212745 256 2 1 29 99 300 0.878 21.72 6.05 Intr - 226389 224569 1821 2 0 39 -52 645 0.000 31.97 6.04 Intr - 236600 236473 128 2 2 32 93 125 0.233 5.96 6.03 Intr - 243985 243880 106 0 1 33 113 91 0.184 5.40 6.02 Intr - 258808 258694 115 2 1 130 78 76 0.446 9.39 6.01 Init - 259936 259891 46 1 1 64 98 -21 0.440 -2.59 6.00 Prom - 260846 260807 40 -9.45 7.04 PlyA - 261095 261090 6 1.05 7.03 Term - 262840 262679 162 1 0 5 37 253 0.545 8.85 7.02 Intr - 282499 282415 85 0 1 37 100 99 0.043 4.90 7.01 Init - 287573 287518 56 2 2 113 100 13 0.023 3.82 7.00 Prom - 295723 295684 40 -4.95 8.00 Prom + 300816 300855 40 -7.65 8.01 Init + 302413 302496 84 0 0 71 59 101 0.919 6.37 8.02 Term + 304866 305057 192 2 0 80 38 155 0.992 6.14 8.03 PlyA + 305916 305921 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 52686 52736 51 2 0 70 116 40 0.812 6.21 S.002 Init - 215087 215059 29 2 2 79 55 42 0.843 -0.88 S.003 Sngl - 288238 288074 165 1 0 75 48 185 0.910 7.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:34756795_35069613|GENSCAN_predicted_peptide_1|56_aa XRGPARQAIAGLPAECLSSLYERDEEIAGEMHIGASGYSNSVWTLVDGNFNPTFDD >gi568815583r:34756795_35069613|GENSCAN_predicted_CDS_1|171_bp nccagaggcccagcaaggcaggctatagcagggcttcctgctgagtgcctttctagtctg tatgaaagagatgaggaaattgcaggggaaatgcacatcggggcctctggttactctaat tctgtgtggaccctggttgatggaaacttcaatccaacctttgacgattga >gi568815583r:34756795_35069613|GENSCAN_predicted_peptide_2|925_aa MGKGSQGPVPTGQYYMLGVTASGVKENKKRLENVMGNHDGDHDGDDDGEGEGEGGDNDED GGGGEGDCDDDDDDGDGDGNDGDGDDGEGGDDDSDGVGSCDYDGDCGSDDDGGGDNNVMM VMVMVMIMVEESTLAPSAKNAARGTIYESMSESSIDTESAGVFILDFPTSRIVPWQSSNY APSVPVSILLFVVPKLDIVTIAMAPVALLSSWQLAQASNLLWFQSESLPLDNQIAQNLME ITNGNLHTQYNRNRLTLTGVRLRSDIVMVGLSRKDGFAPVCPARLPHGQSTPKRASLALK SPGDPPQLCHPTGIYKPLRCAAAVSFVLVAALEDEKPLLLPCCPGPAVRTPQQRAGRQAF PLHVAYCPQGWQGCGGPNPQTIQGAPTPQKGGGVGWRHLVFPCPLPFSACPSPAPYLAIP LTAPSPSLHGLGAPWLILSPALGSMNGLGSPSGCEGDQIRQGGRPGPPPLPPAAPTDPVH QRSIKRPSWSQPPRARCRRSRADPPRRRCAKMCDDEETTALVCDNGSGLVKAGFAGDDAP RAVFPSIVGRPRHQEKYNVSLIQSFQNAVIYIPVDMGDELNNLMKAYDIFKGVMVGMGQK DSYVGDEAQSKRGILTLKYPIEHGIITNWDDMEKIWHHTFYNELRVAPEEHPTLLTEAPL NPKANREKMTQIMFETFNVPAMYVAIQAVLSLYASGRTTGIVLDSGDGVTHNVPIYEGYA LPHAIMRLDLAGRDLTDYLMKILTERGYSFVTTAEREIVRDIKEKLCYVALDFENEMATA ASSSSLEKSYELPDGQVITIGNERFRCPETLFQPSFIGMESAGIHETTYNSIMKCDIDIR KDLYANNVLSGGTTMYPGIADRMQKEITALAPSTMKIKIIAPPERKYSVWIGGSILASLS TFQQMWISKQEYDEAGPSIVHRKCF >gi568815583r:34756795_35069613|GENSCAN_predicted_CDS_2|2778_bp atggggaaaggaagccagggaccagtcccaacagggcagtattacatgttgggtgtcaca gcatctggagtcaaggagaacaagaagagattggaaaatgtaatgggtaatcatgatggt gatcatgatggtgatgacgatggtgaaggtgagggtgagggtggtgataatgatgaggat ggtggtggcggtgagggtgattgtgatgatgatgatgatgatggtgatggagatggtaat gatggagatggtgatgatggtgagggtggtgatgatgacagtgatggtgttggtagttgt gattatgatggtgattgtggtagtgatgatgatggtggtggtgacaataatgtaatgatg gtgatggtgatggtgatgattatggttgaagagagcaccctagccccttcagccaagaat gcagcaaggggcaccatctatgaatctatgagtgagtcttctatagacactgaatcagct ggtgtcttcatcttggacttcccaacctccagaattgttccctggcagtcttctaactat gccccctctgtaccagttagcattctgctctttgttgtcccaaaactagacattgtgaca attgcaatggcacctgtggccctcctttcatcgtggcagctagctcaagccagcaactta ctttggttccaatcagaatctttgcctttagataatcaaattgcacaaaacctaatggaa ataaccaatggtaatctgcatacacaatacaataggaacaggcttacccttactggggtc aggcttagatcagacattgtcatggttgggctctccaggaaagatggatttgctccagtt tgccctgcacgtctccctcatggccagtccacgccgaaaagggcttcactggccctcaag agccctggggacccgccccagctctgccatcccactgggatctacaagccactcagatgt gctgctgcggtgtcctttgtgctggtggcagccctggaagatgagaagccgctgttgctc ccctgctgccctggcccagctgtcaggacccctcagcagagggcagggcgccaagccttc ccactgcatgtggcttattgtccccaaggctggcagggctgcggaggaccgaatccacag accatccagggagcacccacaccccagaaagggggaggggtgggctggcgtcacttagtc ttcccctgccccctacccttcagcgcctgcccctccccagctccctatttggccatcccc ctgactgccccctccccttccttacatggtctgggggctccctggctgatcctctcccct gcccttggctccatgaatggcctcggcagtcctagcgggtgcgaaggggaccaaataagg caaggtggcagaccgggccccccacccctgcccccggctgctccaactgaccctgtccat cagcgttctataaagcggccctcctggagccagccacccagagcccgctgccgccggagc cgagccgacccgccccgccgacgctgtgccaagatgtgtgacgacgaggagaccaccgcc ctggtgtgcgacaacggctctgggctggtgaaggccggctttgcgggcgatgacgcgccc cgcgctgtcttcccgtccatcgtgggccgcccgcggcaccaggagaaatacaatgtgtca ttaattcagtcattccaaaatgcagtcatctatattccagttgacatgggtgatgagctt aataacttaatgaaggcatatgatatttttaagggagttatggtgggtatgggtcagaag gactcctacgtaggtgatgaagcccagagcaagagaggcatcctgaccctgaagtatccc atcgagcatggtatcatcaccaactgggacgacatggagaagatctggcaccacaccttc tacaatgagctccgtgtggctcccgaggagcaccccaccctgctcacagaggccccgctg aaccccaaggccaaccgggagaagatgactcagatcatgtttgagaccttcaatgtccct gccatgtacgtggccatccaggcagtgctatccctgtatgcttctggccgtaccacaggc attgttctggactctggggatggtgtaactcacaatgtccccatctatgagggctacgct ttgccccatgccatcatgcgtctggatctggctggtcgggacctcactgactacctcatg aagatcctcactgagcgtggctactcctttgtcaccactgctgaacgtgaaattgtccgt gacattaaagagaagctgtgctatgtcgccctggattttgagaatgagatggccacagct gcctcttcctcctccctggagaagagctatgaactgcctgatggccaagtcatcactatt ggcaatgagcgcttccgctgtcctgagacactcttccagccctccttcattggtatggaa tctgctggcatccatgaaacaacttacaatagcatcatgaagtgtgacattgatatccgc aaggacctgtatgccaacaatgtcttatctggaggcaccactatgtaccctggtattgct gatcgtatgcagaaggaaatcactgctctggctcctagcaccatgaagattaagattatt gctccccctgagcgtaaatactctgtctggattgggggctccatcctggcctctctgtcc accttccagcaaatgtggattagcaagcaagagtacgatgaggcaggcccatccattgtc caccgcaaatgcttctaa >gi568815583r:34756795_35069613|GENSCAN_predicted_peptide_3|137_aa MKRQTFYQQGGEPRTSSPTKAMENQEALKTCALVGQLENLVQHEASDLLANSIVPLGIVI GSIFLACDELLRVEELVVDGNVNFVNECGLQVYKHCPGLMLASTCLTEDVKGVISPSGLV TWHLAIGLHAVFQAAEL >gi568815583r:34756795_35069613|GENSCAN_predicted_CDS_3|414_bp atgaagagacagacattctatcagcagggaggtgaacccagaaccagttccccaaccaaa gctatggaaaaccaagaagccctgaagacttgtgcactggttggccagcttgagaatttg gtccaacatgaggccagtgatctcctcgccaatagtatagtgcccttgggcatagttatt ggcagcatcttccttgcctgtgatgagctgctcagggtggaagagctagtggtagatggc aacgtgaattttgtcaatgaatgtgggctccaggtctacaaacactgccctgggcttatg cttgccagcacctgtctcaccgaagatgttaaaggagtcatctccccgagtggtcttgtc acttggcacctggccatcgggctgcatgccgtgttccaggccgcagagctctga >gi568815583r:34756795_35069613|GENSCAN_predicted_peptide_4|1216_aa MDKVHYCERFIELMIDLEALLPTRRWFNTILDDSHLLVHCYLSNLVRREEDGHLFSQLLD MLKFYTGFEINDQTGNALTENEMTTIHYDRITSLQVSRHERRISQIQQLNQMPLYPTEKI IWDENIVPTEYYSGEGCLALPKLNLQFLTLHDYLLRNFNLFRLESTYEIRQDIEDSVSRM KPWQSEYGGVVFGGWARMAQPIVAFTVVEVAKPNIGENWPTRVRADVTINLNVRDHIKDE WEGLRKHDVCFLITVRPTKPYGTKFDRRRPFIEQVGLVYVRGCEIQGMLDDKGRVIEDVE CVYKLLQVMVLHRKHYMTAMEGNSKNLEKEENTFSILEFEVGPEPRPNLRGESRTFRVFL DPNQYQQDMTNTIQNGAEDVYETFNIIMRRKPKENNFKAVLETIRNLMNTDCVVPDWLHD IILGYGDPSSAHYSKMPNQIATLDFNDTFLSIEHLKASFPGHNVKVTVEDPALQIPPFRI TFPVRSGKGKKRKDADVEDEDTEEAKTLIVEPHVIPNRGPYPYNQPKRNTIQFTHTQIEA IRAGMQPGLTMVVGPPGTGKTDVAVQIISNIYHNFPEQRTLIVTHSNQALNQLFEKIMAL DIDERHLLRLGHGEEELETEKDFSRYGRVNYVLARRIELLEEVKRLQKSLGVPGDASYTC ETAGYFFLYQVMSRWEEYISKVKNKGSTLPDVTEVSTFFPFHEYFANAPQPIFKGRSYEE DMEIAEGCFRHIKKIFTQLEEFRASELLRSGLDRSKYLLVKEAKIIAMTCTHAALKRHDL VKLGFKYDNILMEEAAQILEIETFIPLLLQNPQDGFSRLKRWIMIGDHHQLPPVIKNMAF QKYSNMEQSLFTRFVRVGVPTVDLDAQGRARASLCNLYNWRYKNLGNLPHVQLLPEFSTA NAGLLYDFQLINVEDFQGVGESEPNPYFYQNLGEAEYVVALFMYMCLLGYPADKISILTT YNGQKHLIRDIINRRCGNNPLIGRPNKVTTVDRFQGQQNDYILLSLVRTRAVGHLRDVRR LVVAMSRARLGLYIFARVSLFQNCFELTPAFSQLTARPLHLHIIPTEPFPTTRKNGERPS HEVQIIKNMPQMANFVYNMYMHLIQTTHHYHQTLLQLPPAMVEEGEEVQNQETELETEEE AMTVQADIIPSPTDTSCRQETPAFQTDTTPSETGATSTPEAIPALSETTPTVVGAVSAPA EANTPQDATSAPEETK >gi568815583r:34756795_35069613|GENSCAN_predicted_CDS_4|3651_bp atggacaaagttcattactgtgaaagattcattgaacttatgattgatctagaggccctg ctacccacaaggcgctggtttaataccatcctggatgattcccaccttctggttcactgt tacctttccaatcttgttcgtagagaagaggatggccatcttttttcccagcttttggac atgcttaaattctatactggttttgaaattaatgaccaaactggaaatgctctgacagag aatgagatgaccacaattcactatgatagaattacttctctacaggtatctcgtcatgaa cgtcgaatttctcagattcagcagttgaaccagatgcctttgtatccaactgagaaaatt atatgggatgaaaatattgtcccaactgagtactattctggagaaggttgtcttgctctt cccaaattgaatttgcagtttttgactcttcatgactacctgctaaggaactttaacctc ttccgcttagaatcaacttatgaaattcgtcaggacattgaagatagtgtcagcagaatg aagccatggcaatctgaatatggcggtgtagtgtttggtggttgggcgcgaatggcccag cccattgtggctttcactgtcgttgaagtggccaaacccaacataggtgaaaactggcca acccgagttcgtgcagatgttaccataaatctcaatgtcagagatcacatcaaagatgaa tgggaaggtcttcgtaagcatgatgtatgctttttaattaccgtacgtcccacaaaacct tatggcactaagtttgaccggaggagaccttttattgagcaggttggcctggtttatgtc agaggctgtgaaattcagggcatgctggatgataaaggacgtgtcattgaagatgttgag tgtgtgtacaaattattgcaagtcatggtgttgcatcgtaaacattacatgactgctatg gaaggaaatagtaagaatttagagaaagaagaaaatactttcagcatcttagaatttgaa gtgggacctgaacccagacccaatcttagaggagaatcaaggacatttagagtgtttttg gatccaaaccagtatcaacaagatatgaccaatactatacaaaatggagcagaggatgtg tatgaaacttttaatataataatgaggagaaaaccaaaggaaaataactttaaggctgtg ctggagactattcggaacctgatgaatactgattgtgtggtacctgactggctgcacgat atcattttaggttatggggacccaagtagtgcacattattcgaaaatgcccaatcagatt gccacccttgatttcaatgatacatttctctccattgagcatttaaaagccagcttccct ggtcataatgttaaagtaactgtagaagaccctgctctacaaataccccctttcaggata acttttccagtaagaagtggaaaagggaagaaaaggaaagatgcggatgtggaagatgaa gacaccgaggaagcaaaaaccttaattgttgagccccatgttattcctaataggggtcct tatccttataatcaacccaaacgtaatacgattcagttcactcatacacagatagaagcc atccgtgctggaatgcagcctgggctgactatggttgtgggcccacctggtacaggcaaa acagatgtggcagttcagatcatatccaacatctaccacaacttcccagaacagaggact ctaattgttactcattccaatcaggccctaaaccagttgtttgagaaaatcatggcatta gacattgatgagcgccacctactgcgtcttggtcatggagaagaagagctggagacagag aaagatttcagcaggtatggaagagttaattatgttctggctcgaagaatagaactttta gaagaagtcaaacgattgcaaaagagtctaggggttccaggagatgcctcatatacctgt gaaactgcaggctatttcttcttataccaggtaatgtctcgctgggaagagtatatcagc aaagtgaaaaataaaggtagtacattgccagatgttacggaagtctccactttcttccct ttccatgaatactttgcaaatgctcctcaacccatttttaaaggaagatcttatgaagaa gacatggaaattgctgaaggatgtttcaggcatattaagaaaatctttacgcagcttgag gaattcagagcctctgaattgcttcgaagtggactggacagatctaaataccttttagtg aaagaagccaaaattattgctatgacctgtactcatgctgccttaaaacgacatgacttg gtcaagctaggtttcaagtatgacaacattttgatggaagaggctgctcagattctggag atagaaacttttatccctcttcttctacagaatcctcaggatggatttagccgactaaaa cgatggattatgattggcgatcatcaccagttacctccagttattaagaacatggccttt caaaagtactcaaacatggagcagtctctcttcactcgctttgttcgcgttggagttccg actgttgaccttgatgctcaagggagagccagagcaagcttgtgcaacctctacaactgg cgatacaagaatctaggaaacttaccccatgtgcagctcttgccagagtttagtacagca aatgctggcttactgtatgacttccagctcattaatgttgaagattttcaaggagtggga gaatctgaacctaatccttacttctatcagaatcttggagaggcagaatatgtagtagca ctttttatgtacatgtgtttacttggttaccctgctgacaaaatcagtattctaacaaca tataatggccaaaagcatcttattcgcgacatcatcaatagacgatgtggaaacaatcca ttgattggaagaccaaacaaggtgacaactgttgatagatttcaaggtcaacagaatgac tatattcttctttctctggtacgaaccagggcagtgggccatctgagggatgtccgtcgc ttggtagtggccatgtctagagccagacttggactttatatcttcgccagagtatccctc ttccaaaactgttttgaactgactccagctttcagtcagctcacagctcgcccccttcat ttgcatataattccaacagaacctttcccaactactagaaagaatggagagagaccatct catgaagtacaaataataaaaaatatgccccagatggcaaactttgtatacaacatgtac atgcatttgatacagactacacatcattatcatcagactttattacaactaccacctgct atggtagaagagggtgaggaagttcaaaatcaagaaacagaattggaaacagaagaagag gccatgactgttcaagctgacatcatacccagtccaacagacaccagctgccgtcaagaa actccagcctttcaaactgacaccacccccagtgagacaggagccacttccactccagaa gccatccctgctttatctgagaccacccctactgtggtaggagctgtatctgcaccggca gaagctaacacacctcaggatgccacatctgccccggaagagaccaagtag >gi568815583r:34756795_35069613|GENSCAN_predicted_peptide_5|105_aa MVNVPKTRRTFCKKCGKHQPHKVTQYKKGKDSLYAQGKRRYDRKQSGYGGQTKPIFWKKA KTTKIVLRLECIEPNCRSKRMLAIKRCKHFELGGDKKRKGQVIQF >gi568815583r:34756795_35069613|GENSCAN_predicted_CDS_5|318_bp atggttaacgtccctaaaacccgccggactttctgtaagaagtgtggcaagcaccaaccc cataaagtgacacagtacaagaagggcaaggattctctgtatgcccagggaaagcggcgt tatgacaggaagcagagtggctatggtgggcaaactaagccaattttctggaaaaaggct aaaactacaaagattgtgctaaggcttgagtgcattgagcccaactgcagatctaagaga atgctggctattaaaagatgcaagcattttgaactgggaggagataagaagagaaagggc caagtgattcagttctaa >gi568815583r:34756795_35069613|GENSCAN_predicted_peptide_6|904_aa MWENMSGVCMWSQSWGVDYTAQNVVKGQLCMVFGVVDRAWICGVESEHLQVSVSHGAKHV SGPGLGEAKGEHLAGVEGTVGWMLPKASKAVGKVDPLSGYKGCIQWCREVGLDSKDEQVR SKPYGGFLPASSICQRHFKNLKTFVKHQQLHNETYQNNVKQVRRLLEAKQEKSMYGVYNT FTTEERWALHPCSKSDPMYSMKRRKNIHACTICGKMFPSQSKLDRHVLIHTGQRPFKCVL CTKSFRQSTHLKIHQLTHSEERPFQCCFCQKGFKIQSKLLKHKQIHTRNKAFRALLLKKR RTESRPLPNKLNANQGGFENGEIGESEENNPLDVHSIYIVPFQCPKCEKCFESEQILNEH SCFAARSGKIPSRFKRSYNYKTIVKKILAKLKRARSKKLDNFQSEKKVFKKSFLRNCDLI SGEQSSEQTQRTFVGSLGKHGTYKTIGNRKKKTLTLPFSWQNMGKNLKGILTTENILSID NSVNKKDLSICGSSGEEFFNNCEVLQCGFSVPRENIRTRHKICPCDKCEKVFPSISKLKR HYLIHTGQRPFGCNICGKSFRQSAHLKRHEQTHNEKSPYASLCQVEFGNFNNLSNHSGNN VNYNASQQCQAPGVQKYEVSESDQMSGVKAESQDFIPGSTGQPCLPNVLLESEQSNPFCS YSEHQEKNDVFLYRCSVCAKSFRSPSKLERHYLIHAGQKPFECSVCGKTFRQAPHWKRHQ LTHFKERPQGKVVALDSVMNAWAVVRVPYGTLKAEDIRPGAFVRRGVKRPGSNLREVDQG CRCPERSFSFHWSGGKSAAMAAPAQPKKIVAPTVSQINAEFVTQLACKYWAPHIKKKSPF DIKHLFNKAHLAPPLIHSTLSGYSTCFTEHRVGDTATIRFLNLFPTFPLFLFYKTAIVIM ARSQ >gi568815583r:34756795_35069613|GENSCAN_predicted_CDS_6|2715_bp atgtgggagaatatgagtggagtatgtatgtggtcccaaagttggggggtggattatact gcccaaaatgtggtcaaggggcagctgtgtatggtgtttggagttgtggatagagcttgg atatgtggggtagagtctgagcacttacaagtctctgtgagtcatggagctaaacatgtt tctggcccgggacttggagaagctaaaggagaacatctggcaggagttgaaggaactgtg ggctggatgttgcccaaggccagcaaggcagtgggcaaggtggaccctttgagtggttac aaggggtgcattcagtggtgtcgagaggttggtcttgactctaaggatgaacaagtacgg tcaaaaccctatgggggatttctgcctgcttccagtatttgtcagcgtcactttaaaaat ctgaagacatttgtgaagcaccaacaacttcacaatgaaacctatcagaataatgttaaa caggtcagaagattgctggaggccaagcaagaaaagtcaatgtatggagtgtataatact tttaccacagaggaaagatgggcattacacccgtgctctaagtctgatcccatgtatagc atgaaaagaagaaagaatattcatgcatgtacaatctgtggcaagatgtttccatcacag tcaaaacttgataggcatgtacttattcatactggtcagaggccttttaaatgtgtcttg tgtactaaatcttttcgacagtcaactcacttaaaaatccaccaacttacacattcagaa gaaagaccttttcaatgttgtttttgtcaaaaaggatttaagattcaaagcaaacttctg aagcataaacaaatccatactaggaataaggcttttcgggctcttttattaaagaagagg cgtacagaatctcgccccctgcctaataagttaaatgcaaatcagggtggttttgaaaat ggtgagattggtgaatctgaggagaataatccacttgatgtccactcaatttatattgtc ccttttcaatgtccaaagtgtgaaaagtgttttgaatcagagcagattctcaatgaacac agctgttttgctgctagaagtggcaaaattccaagcaggttcaaaagaagctacaactat aaaaccattgttaaaaaaatcttggccaagcttaagcgtgctaggagtaaaaaattagat aactttcaatctgagaaaaaagtatttaaaaagagtttcttgagaaattgtgatcttatt tctggtgagcagagctctgaacaaacccagagaacatttgtgggttctcttggcaaacat ggaacatataaaacaattggcaatagaaagaagaaaacattgactttgccattttcttgg caaaatatgggaaaaaatttgaaaggcatccttacgacagaaaacatattaagcattgat aattcagtgaataagaaagacttgtcaatctgtggttcatcaggtgaggaattctttaat aactgtgaggtacttcagtgtggtttttcagttccaagggaaaacatacgtactagacat aagatatgtccttgtgacaaatgtgagaaggtatttccttctatatccaaactaaaaaga cactatttaattcatactggacagaggccctttggctgtaatatttgtgggaaatctttt agacagtcagctcacttaaaaagacatgaacagactcataatgaaaagagtccttatgca tctctttgccaagtagaatttggaaacttcaacaatctttctaatcattcaggtaataat gttaactataatgcttcccaacaatgtcaggctcctggtgttcaaaaatacgaggtctca gagtcagatcaaatgtcaggagttaaggcagagtcacaggattttattcctggtagcacc gggcaaccctgtcttcctaatgtacttttggaatcagagcaaagcaatcctttttgcagt tattcagagcatcaggagaaaaatgatgtcttcctgtaccgatgcagtgtttgtgctaaa agtttccgatctccatctaaactggaaagacactacctaattcatgcagggcagaaacca tttgaatgctcagtttgtggcaaaacattcagacaggctcctcactggaagagacatcag cttactcactttaaagaacgaccacaagggaaagtggttgccttagattcggttatgaac gcttgggctgttgtacgggttccctacgggactctgaaggctgaggatatccggcccgga gcgtttgtgcggcgcggagttaagcgccccggcagtaacttaagggaagtggaccagggt tgccgctgcccagagcggtcctttagtttccactggagtggagggaagagtgctgccatg gcagcccctgcgcagcccaagaagatcgtggcccctacggtgtcccaaatcaatgcggag ttcgtgacccagttagcatgtaaatactgggctccccacatcaagaagaaatcacctttt gatataaagcatctgtttaacaaagcacatcttgcaccgcccttaatccattcaaccctg agtggatacagcacatgtttcacagagcacagggttggggacacggcaaccatccgattt ctcaatcttttccccacctttcccctctttctattctacaaaaccgccattgtcatcatg gcccgttctcaatga >gi568815583r:34756795_35069613|GENSCAN_predicted_peptide_7|100_aa MAHASNLSTLGGQGGRITRPKTTEVADGIEITTITFLFGREFCVTDKLLQLEEEEEEQEE EQEEEQEKKEEEKEEEEKKEEEEGGGGGGGRGRRRKGRKK >gi568815583r:34756795_35069613|GENSCAN_predicted_CDS_7|303_bp atggctcacgcctctaatctcagcactttgggaggccagggtgggcggatcacaagacca aaaacaacggaagtagcagatggaattgagattaccactattacatttctgtttggccgt gaattttgtgtaacagataagttactccagttggaggaggaggaggaggagcaggaggag gagcaggaggaggagcaggagaagaaggaggaggagaaggaggaggaggagaagaaggag gaggaggagggagggggaggaggaggagggagggggaggaggaggaaggggaggaagaaa taa >gi568815583r:34756795_35069613|GENSCAN_predicted_peptide_8|91_aa MALGKYEEKEKQQALFDCTEAVCADFSEELTQRRRTVSTRCDFISDLTNQQLPTLQPPTH QIILKNPDSRVFRETDLSTNKIPVSHTAGSA >gi568815583r:34756795_35069613|GENSCAN_predicted_CDS_8|276_bp atggctcttggcaaatatgaggaaaaggaaaagcaacaagccttatttgactgcactgag gctgtctgtgcagacttttcagaggaactgactcagcgcaggaggacagtttcaactcgc tgtgatttcatctctgacctgaccaatcagcagctccccactctccaaccccctacccac caaattatccttaaaaaccctgattcccgagttttcagggagactgatttgagtactaat aaaattccagtctcccatacagccggctctgcgtga