GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:50:05 Sequence gi568815584f:87910848_88111858 : 201011 bp : 38.18% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 779 774 6 1.05 1.02 Term - 5439 5205 235 2 1 73 49 231 0.312 12.51 1.01 Init - 17063 16891 173 2 2 68 106 110 0.702 9.83 1.00 Prom - 22023 21984 40 -6.55 2.10 PlyA - 22582 22577 6 1.05 2.09 Term - 24031 23885 147 1 0 77 37 113 0.217 2.02 2.08 Intr - 25383 25296 88 2 1 68 56 66 0.134 0.45 2.07 Intr - 31465 31336 130 2 1 57 37 146 0.136 5.23 2.06 Intr - 32137 32106 32 0 2 45 94 35 0.402 -3.34 2.05 Intr - 34886 34706 181 0 1 13 99 175 0.942 9.10 2.04 Intr - 36407 36300 108 0 0 97 80 12 0.597 0.64 2.03 Intr - 37031 36881 151 2 1 71 97 195 0.996 17.51 2.02 Intr - 39084 38998 87 0 0 121 26 69 0.590 3.25 2.01 Init - 39618 39559 60 0 0 56 20 19 0.605 -6.80 2.00 Prom - 40172 40133 40 -9.55 3.00 Prom + 40652 40691 40 -2.55 3.01 Init + 42458 42917 460 1 1 101 29 257 0.835 17.06 3.02 Intr + 43166 43374 209 0 2 42 28 139 0.169 1.17 3.03 Term + 44027 44356 330 1 0 -2 54 302 0.239 11.47 3.04 PlyA + 44422 44427 6 1.05 4.11 PlyA - 44457 44452 6 1.05 4.10 Term - 49290 49270 21 0 0 125 41 3 0.279 -3.37 4.09 Intr - 52664 52537 128 0 2 68 52 166 0.954 10.48 4.08 Intr - 54782 54658 125 1 2 42 80 71 0.968 1.01 4.07 Intr - 57643 57488 156 0 0 132 88 105 0.987 13.10 4.06 Intr - 65641 65511 131 1 2 47 86 141 0.642 8.37 4.05 Intr - 71396 71358 39 2 0 112 93 13 0.587 1.70 4.04 Intr - 73686 73547 140 1 2 139 7 183 0.779 14.56 4.03 Intr - 75755 75642 114 0 0 87 91 91 0.984 8.80 4.02 Intr - 81807 81616 192 1 0 -7 90 158 0.475 5.04 4.01 Init - 82269 82074 196 0 1 49 8 361 0.238 21.34 4.00 Prom - 86997 86958 40 -5.45 5.00 Prom + 87143 87182 40 -6.95 5.01 Sngl + 100001 101014 1014 1 0 68 42 458 0.977 35.96 5.02 PlyA + 101145 101150 6 1.05 6.00 Prom + 103472 103511 40 -3.75 6.01 Init + 107548 107562 15 0 0 95 115 17 0.362 4.55 6.02 Intr + 110439 110645 207 2 0 -18 115 143 0.260 4.75 6.03 Term + 113242 113250 9 0 0 114 43 0 0.051 -4.68 6.04 PlyA + 114884 114889 6 1.05 7.04 PlyA - 115414 115409 6 1.05 7.03 Term - 119748 119638 111 0 0 65 37 103 0.418 0.48 7.02 Intr - 120283 120197 87 1 0 38 63 94 0.305 1.15 7.01 Init - 126746 126624 123 2 0 85 84 83 0.719 7.82 7.00 Prom - 132668 132629 40 -1.45 8.03 PlyA - 133173 133168 6 1.05 8.02 Term - 133882 133663 220 0 1 87 43 149 0.577 5.83 8.01 Init - 145942 145680 263 1 2 74 63 151 0.013 7.78 8.00 Prom - 163115 163076 40 -6.05 9.00 Prom + 166770 166809 40 -6.05 9.01 Init + 168344 168521 178 1 1 74 95 100 0.202 8.77 9.02 Intr + 173190 173308 119 1 2 116 26 43 0.688 0.16 9.03 Intr + 175981 176176 196 0 1 36 60 117 0.516 1.77 9.04 Term + 178852 178916 65 2 2 7 55 217 0.561 7.37 9.05 PlyA + 179113 179118 6 1.05 10.03 PlyA - 179361 179356 6 1.05 10.02 Term - 184497 184291 207 0 0 24 34 198 0.920 4.36 10.01 Init - 185286 185197 90 0 0 74 93 27 0.883 2.39 10.00 Prom - 190744 190705 40 -3.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 19704 20141 438 2 0 61 44 218 0.805 10.71 S.002 Sngl - 142934 142659 276 2 0 64 37 223 0.924 9.93 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:87910848_88111858|GENSCAN_predicted_peptide_1|135_aa MLCAGAGKWETVFWGMCQTPGEAPPKALLKLTRKPPFVVLVKLSGKPPVGLLVKIPANAP APCYHHHQCNCVHSHRQGPHPRSRAVSIAAVNAYIEAGTLAPASTLLQSMSVCSAGLLLP ATGTCKRGWMWLLPS >gi568815584f:87910848_88111858|GENSCAN_predicted_CDS_1|408_bp atgctgtgtgctggggctggcaagtgggaaactgtcttctggggtatgtgtcagactcct ggggaagctcctcctaaggcgctgctgaaactcactaggaaaccaccctttgtggtgctg gtgaaactttctgggaaaccacctgtaggattgttggtgaaaatcccggcaaacgcccca gccccgtgctaccaccaccaccagtgcaactgtgtgcacagtcaccggcagggaccccat ccccggagtcgtgctgtctccatcgctgctgtgaatgcctacatagaggctggcaccctg gcacctgctagcaccctgttgcagtccatgagcgtgtgctctgctgggctgctgctgcct gctactggcacatgcaaacgaggatggatgtggctgctaccgtcctag >gi568815584f:87910848_88111858|GENSCAN_predicted_peptide_2|327_aa MLCDFEEHLQKSLARKLFLVSEIPELQVWYTKLGKTSERFLFKQLDSLWLLDSDGSFTLS LHEDELFTLTTLTTGRKGSYPLPPKSQPFPSTYKDDFNVGRSVLKLLEKFLVIGDLGKPL GKKLEDMVIVVIFLVNYPFFSEAPNFADQTGVFEYFTNIEDPGEHHFTLRQVLNQRPITW AADASNTISIIGDYNCTFKLLMNQKGDRYFIGENGPPSGIPGRAASASPGNLLDMHIQGA APDVLNHAVGEVRKDLTEWVVLECRPECWEEQFEQTKWQGHFTSGMLNDKSLWTDIPVNF PKNGWAAIGTHSFEFAQFDNFLVEATR >gi568815584f:87910848_88111858|GENSCAN_predicted_CDS_2|984_bp atgctctgtgactttgaggaacacttacaaaagtcacttgcaagaaagttgttcttggtg agtgaaataccagagctacaggtatggtataccaaacttggaaaaacatccgaaagattt ctttttaagcagctggattctctatggctccttgacagcgatggcagtttcacactgagc ctgcatgaagatgagctgttcacactcaccactctcaccactggtcgcaaaggcagctac ccgcttcctccaaaatcccagcccttcccaagtacctataaggatgatttcaatgttgga agatctgtcctaaagttgttagaaaagtttttggttataggtgatttaggaaaaccactt ggaaaaaagttagaagacatggtcattgtagttatattcctagtgaattacccatttttt agtgaagctccaaactttgctgatcaaactggtgtatttgaatattttacaaatattgaa gaccctggcgagcatcacttcacgctacgccaagttctcaaccagagacccattacatgg gctgccgatgcatccaacacaatcagtattataggagactacaactgcactttcaaactt ctgatgaaccagaaaggagacagatacttcataggggaaaatggtcctccaagtggcatt cctggacgagcagcatcagcatcacctgggaacttgttagacatgcacattcaaggggct gccccagatgtactgaatcacgctgtgggggaggtgaggaaagacctcactgagtgggta gtacttgagtgcagacctgaatgctgggaagaacaatttgagcagaccaagtggcagggt catttcacctctggcatgctgaatgacaagtctctgtggacagacatccctgtgaatttt ccaaagaatggctgggctgcaattggaactcactcctttgaatttgcacagtttgacaac tttcttgtggaagccacacgctaa >gi568815584f:87910848_88111858|GENSCAN_predicted_peptide_3|332_aa METEPKASYGKIRISEENSIQLDDFTEAYESGQNQAYSLEHFSPVFPKTENSHIHINSDK GPEENTGSQELFSSEDELPPNEIRIELCSSGILCSQLNTFHKSAIKRSCTSEDKVGQSEA LSRVLQVAKKMKLISNAGDSVVEMDQRNVSEFKDVVIHEDQWVGETVLQSTFSSQLLNLG SYSSIQPEEYSSVVSDVVLQDLLAQVSSKHSYLRDLPPRQPQRKIYYRPALMTVVDGRHD VYIRVDSKLIEKILLNIFADCLNRVIVTSSEITYGMVMADLLHSLSAVSAEPCVSKIQSL FVLDESSYPLQQDFSLLDFYPDTVKHGADALF >gi568815584f:87910848_88111858|GENSCAN_predicted_CDS_3|999_bp atggagactgaaccaaaggcaagttacgggaagataagaatatctgaagagaattccatt caacttgatgattttacagaagcatatgaaagtggacaaaaccaagcatattctcttgaa cattttagtcctgtttttcctaaaacagaaaatagccacattcacataaactctgacaaa ggtcctgaagaaaatacaggatctcaagaacttttcagttctgaagatgaactgccacca aatgagatacgtattgagttgtgtagctcaggaatactgtgttcccaactaaacaccttc cacaaaagtgctattaaaagaagctgcacctctgaagataaagtgggccagtctgaagct ctatctagagtccttcaagtagctaagaaaatgaagttgatttctaatgcaggagattct gttgtagaaatggatcagaggaatgtgtctgaatttaaggatgttgttattcatgaggac caatgggttggcgagacagtactacaatcaacatttagcagtcagttattaaatcttggg agttattcatctattcagcctgaagaatattccagtgtagttagtgatgttgtacttcaa gacctactggcacaggtgtcctcaaaacattcctacctcagagatcttcctccgaggcag cctcagaggaaaatatactataggccagctttaatgactgtagttgatggaagacatgat gtttacatccgtgtagactcaaagctgatagagaagattcttctcaacatttttgcagac tgcctcaacagagtgatagttacttcctcagagatcacctatgggatggtcatggcagac ctgttacactccttgtcggcagtcagcgcagaaccttgtgtatcaaagattcagagcctt tttgtgttagatgaaagcagctatccattacaacaagatttctccctcctggatttttat cctgacactgtaaagcatggagccgatgcccttttctga >gi568815584f:87910848_88111858|GENSCAN_predicted_peptide_4|413_aa MTAAAGSAGRAAVPLLLCALLAPGGAYVLDDSDGLGREFDGIGAVSGGGVSGKLRGYAGG GKSPPRSGAQAVFIAEMAVGERRSRKGQGVRERPCAKENAPVGCFGCVSPFWRSWWVVES LSPVQLLVQDGTEPSHMHYALDENYFRGYEWWLMKEAKKRNPNITLIGLPWSFPGWLGKG FDWPYVNLQLTAYYVVTWIVGAKRYHDLDIDYIGIWNERSYNANYIKILRKMLNYQGLQR VKIIASDNLWESISASMLLDAELFKVVDVIGAHYPGTHSAKDAKLTGKKLWSSEDFSTLN SDMGAGCWGRILNQNYINGYMTSTIAWNLVASYYEQLPYGRCGLMTAQEPWSGHYVVESP VWVSAHTTQFTQPGWYYLKTVGHLEKGGSYVALTDGLGNLTIIIETMPHPSVW >gi568815584f:87910848_88111858|GENSCAN_predicted_CDS_4|1242_bp atgactgcggccgcgggttcggcgggccgcgccgcggtgcccttgctgctgtgtgcgctg ctggcgcccggcggcgcgtacgtgctcgacgactccgacgggctgggccgggagttcgac ggcatcggcgcggtcagcggcggcggggtgagcggcaagctgcggggatacgcggggggc ggcaagagcccgccccgcagtggggcacaggctgtcttcatcgcggagatggcagtcggg gagcgtcgctccagaaagggccagggagtgcgggagcgtccgtgtgcgaaagagaacgcc ccagtgggatgctttggctgtgtctctcccttttggaggagttggtgggtggtggagtcg ttaagtcccgtgcagttgttagtgcaggacggcactgagccctcccacatgcattatgca ctagatgagaattatttccgaggatacgagtggtggttgatgaaagaagctaagaagagg aatcccaatattacactcattgggttgccatggtcattccctggatggctgggaaaaggt ttcgactggccttatgtcaatcttcagctgactgcctattatgtcgtgacctggattgtg ggcgccaagcgttaccatgatttggacattgattatattggaatttggaatgagaggtca tataatgccaattatattaagatattaagaaaaatgctgaattatcaaggtctccagcga gtgaaaatcatagcaagtgataatctctgggagtccatctctgcatccatgctccttgat gccgaactcttcaaggtggttgatgttataggggctcattatcctggaacccattcagca aaagatgcaaagttgactgggaagaagctttggtcttctgaagactttagcactttaaat agtgacatgggtgcaggctgctggggtcgcattttaaatcagaattatatcaatggctat atgacttccacaatcgcatggaatttagtggctagttactatgaacagttgccttatggg agatgcgggttgatgacggcccaggagccatggagtgggcactacgtggtagaatctcct gtctgggtatcagctcataccactcagtttactcaacctggctggtattacctgaagaca gttggccatttagagaaaggaggaagctacgtagctctgactgatggcttagggaacctc accatcatcattgaaaccatgccccaccccagtgtctggtaa >gi568815584f:87910848_88111858|GENSCAN_predicted_peptide_5|337_aa MNSTCIEEQHDLDHYLFPIVYIFVIIVSIPANIGSLCVSFLQAKKESELGIYLFSLSLSD LLYALTLPLWIDYTWNKDNWTFSPALCKGSAFLMYMNFYSSTAFLTCIAVDRYLAVVYPL KFFFLRTRRFALMVSLSIWILETIFNAVMLWEDETVVEYCDAEKSNFTLCYDKYPLEKWQ INLNLFRTCTGYAIPLVTILICNRKVYQAVRHNKATENKEKKRIIKLLVSITVTFVLCFT PFHVMLLIRCILEHAVNFEDHSNSGKRTYTMYRITVALTSLNCVADPILYCFVTETGRYD MWNILKFCTGRCNTSQRQRKRILSVSTKDTMELEVLE >gi568815584f:87910848_88111858|GENSCAN_predicted_CDS_5|1014_bp atgaacagcacatgtattgaagaacagcatgacctggatcactatttgtttcccattgtt tacatctttgtgattatagtcagcattccagccaatattggatctctgtgtgtgtctttc ctgcaagcaaagaaggaaagtgaactaggaatttacctcttcagtttgtcactatcagat ttactctatgcattaactctccctttatggattgattatacctggaataaagacaactgg actttctctcctgccttgtgcaaagggagtgcttttctcatgtacatgaatttttacagc agcacagcattcctcacctgcattgccgttgatcggtatttggctgttgtctaccctttg aagttttttttcctaaggacaagaagatttgcactcatggtcagcctgtccatctggata ttggaaaccatcttcaatgctgtcatgttgtgggaagatgaaacagttgttgaatattgc gatgccgaaaagtctaattttactttatgctatgacaaataccctttagagaaatggcaa atcaacctcaacttgttcaggacgtgtacaggctatgcaatacctttggtcaccatcctg atctgcaaccggaaagtctaccaagctgtgcggcacaataaagccacggaaaacaaggaa aagaagagaatcataaaactacttgtcagcatcacagttacttttgtcttatgctttact ccctttcatgtgatgttgctgattcgctgcattttagagcatgctgtgaacttcgaagac cacagcaattctgggaagcgaacttacacaatgtatagaatcacggttgcattaacaagt ttaaattgtgttgctgatccaattctgtactgttttgtaaccgaaacaggaagatatgat atgtggaatatattaaaattctgcactgggaggtgtaatacatcacaaagacaaagaaaa cgcatactttctgtgtctacaaaagatactatggaattagaggtccttgagtag >gi568815584f:87910848_88111858|GENSCAN_predicted_peptide_6|76_aa MGGAKQEPRQGSITGGDTVDPQQTTRSPIILTVVSEASARGMPNGQTGHHLKGQRGPEDS MRSSRLTSSQQRGQAT >gi568815584f:87910848_88111858|GENSCAN_predicted_CDS_6|231_bp atgggaggagctaagcaagaaccaagacaagggagtataactggaggggacacagtagac ccccagcagaccaccagatcaccaattatactgactgtggtctcagaagcttctgccagg gggatgcctaatggacagacaggacatcatctcaaaggccagaggggaccagaagacagt atgaggagcagcaggttgacatcctcacagcaaagaggacaggctacgtga >gi568815584f:87910848_88111858|GENSCAN_predicted_peptide_7|106_aa MEPHRECAIGEIQQDEFLTFRLPIVLIYITPTLHRESEFQGPSAERHGITLVYCIDDVMV IGNVNTLGDPLLACYRIFVEIECLIITSQDTKCELFLSPFSIAIKE >gi568815584f:87910848_88111858|GENSCAN_predicted_CDS_7|321_bp atggagccacatcgtgaatgtgccattggagaaatacaacaggatgaattcttaactttc aggctccctattgtcctcatatatattactccaactctccacagggaatctgagttccaa gggccttctgcagaacgtcatggtataacacttgtctactgtattgatgatgtcatggtt attggaaatgttaataccctaggtgacccactcctggcctgctaccggatatttgtagag attgaatgcctaattataacatcacaagacaccaagtgtgagctgtttctgagtccattt agtattgctataaaggaatag >gi568815584f:87910848_88111858|GENSCAN_predicted_peptide_8|160_aa MGPNQDKICELSEKEFRRLITKLIKEAPEKGEVQLKEIKNMIQDMKGKIFGEIDGINKKQ SHFWKSRTHLDKCKMHWKVSAMVSNKEKNLNVSCKRDEVFLAPGVQGGQRSKRGLLSPKA FPFGNHLPPTQNKATVVGDEAKTMHAPSQPIQWLWSKCSV >gi568815584f:87910848_88111858|GENSCAN_predicted_CDS_8|483_bp atgggtccaaaccaagacaaaatctgtgaattgtcagaaaaagaattcagaagattaatt actaagctaatcaaggaggcaccagagaaaggtgaagtacaacttaaggaaataaaaaac atgatacaggatatgaaaggaaaaatctttggtgaaatagatggcataaataaaaaacaa tcacacttctggaaatcaaggacacacttagataaatgcaaaatgcactggaaagtctca gcaatggtatcgaacaaggagaagaatctcaatgtttcttgtaaaagagatgaagttttt ctggctcctggtgtacaaggagggcagaggtcaaaacgaggacttctatctcccaaagct tttccttttggaaaccacctgccaccaacacagaacaaagccacagttgtgggagatgaa gctaaaaccatgcatgctccatcccagcccattcaatggctttggagcaaatgttctgtc tga >gi568815584f:87910848_88111858|GENSCAN_predicted_peptide_9|185_aa MLLSSASIQRESEIQPPLLEAPTSKLCDSFLGTLLCGSGDDNVTALPSKPKFPHTDELLG SSICRRSPTFCYYYVLQMSRREWGGREQEEERKNNVLKLFSKDNGLQLRQCSCKGHDADF EDASFHIYSLKETNSANQLRELSLSCLKMRSQSAQHHDLGLVTPEEEEEEEEEEEEEEEG GGGEI >gi568815584f:87910848_88111858|GENSCAN_predicted_CDS_9|558_bp atgcttctgagctcagccagcatccagagagagagtgagatacagcctcctcttcttgaa gcacccacttctaaactctgtgacagttttcttggaactctgttatgtggatctggagat gacaatgtcacagcactcccctctaagcctaaattcccccacaccgatgaactattagga tccagtatctgccggaggagccccaccttctgctactattatgttcttcagatgagtaga agagagtggggagggagagagcaggaagaagaaaggaagaataatgtcttgaaattgttt tctaaggataatggtctccagcttcgtcaatgttcctgcaaaggacatgatgctgacttt gaagatgcaagcttccacatctacagcctcaaggaaacaaattcagccaaccaactgagg gagctttccttgtcctgcctcaagatgagatcacagtctgcccaacaccatgatttaggc cttgtgacacctgaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagga ggaggaggagagatttga >gi568815584f:87910848_88111858|GENSCAN_predicted_peptide_10|98_aa MTDGSSAALWLSLLELNLYIWILNRPGAILKRRETEGRWLCEDTEAQGEHHMKKKTETGV MLLQAGNTKNCQPSPEARRGMEQMLPGSPRKEPTLLTS >gi568815584f:87910848_88111858|GENSCAN_predicted_CDS_10|297_bp atgactgatggcagctcagctgccctctggctctcactcctggagcttaacctttacata tggatactaaacaggcctggtgcaatcctgaaaaggagagaaacagagggaagatggcta tgtgaagacacagaggcacaaggagaacaccatatgaagaagaagacagaaactggagtg atgcttctacaagcagggaacaccaagaattgccagccatcaccagaagctaggagaggc atggaacagatgctccctgggagccctcggaaagaaccaacgctgctgacatcttga