GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:20:41 Sequence gi568815594r:100087680_100289983 : 202304 bp : 37.93% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5510 5637 128 0 2 57 81 94 0.154 5.08 1.02 Term + 7686 7790 105 2 0 64 48 64 0.106 -2.57 1.03 PlyA + 8955 8960 6 1.05 2.04 PlyA - 9739 9734 6 1.05 2.03 Term - 10582 10325 258 1 0 62 46 166 0.563 4.37 2.02 Intr - 15673 15552 122 2 2 53 96 115 0.428 8.09 2.01 Init - 35429 35381 49 2 1 46 81 87 0.567 4.96 2.00 Prom - 35562 35523 40 -2.95 3.03 PlyA - 35673 35668 6 1.05 3.02 Term - 41541 41313 229 2 1 52 47 148 0.576 2.22 3.01 Init - 56282 56188 95 2 2 38 110 86 0.162 5.70 3.00 Prom - 65161 65122 40 -4.75 4.02 PlyA - 65605 65600 6 1.05 4.01 Sngl - 67479 67144 336 0 0 73 44 207 0.848 10.58 4.00 Prom - 67533 67494 40 -4.25 5.03 PlyA - 67650 67645 6 1.05 5.02 Term - 69530 68927 604 1 1 14 35 246 0.188 4.90 5.01 Init - 83773 83622 152 1 2 77 30 138 0.072 6.46 5.00 Prom - 84664 84625 40 -4.85 6.06 PlyA - 84991 84986 6 1.05 6.05 Term - 91899 91666 234 0 0 -47 38 266 0.050 3.44 6.04 Intr - 100488 100232 257 1 2 58 48 176 0.074 6.84 6.03 Intr - 102175 102111 65 0 2 100 35 15 0.359 -5.16 6.02 Intr - 102506 102214 293 2 2 56 89 198 0.601 11.71 6.01 Init - 103217 102624 594 2 0 42 99 271 0.193 18.90 6.00 Prom - 106112 106073 40 -11.14 7.00 Prom + 106660 106699 40 -8.15 7.01 Init + 107765 107864 100 1 1 60 68 139 0.601 8.68 7.02 Intr + 107891 107992 102 0 0 50 68 118 0.962 5.23 7.03 Intr + 108019 108123 105 2 0 50 70 89 0.813 2.57 7.04 Intr + 108755 108939 185 0 2 56 27 230 0.711 12.29 7.05 Intr + 113476 113551 76 0 1 31 92 67 0.437 -0.43 7.06 Term + 123674 123744 71 0 2 128 48 63 0.545 3.42 7.07 PlyA + 125778 125783 6 1.05 8.05 PlyA - 126642 126637 6 1.05 8.04 Term - 135513 135317 197 1 2 75 48 137 0.422 4.99 8.03 Intr - 142949 142828 122 1 2 101 111 14 0.166 4.22 8.02 Intr - 153958 153791 168 0 0 74 81 103 0.298 6.34 8.01 Init - 169041 168992 50 0 2 64 97 78 0.664 6.77 8.00 Prom - 171885 171846 40 -3.65 9.03 PlyA - 172191 172186 6 1.05 9.02 Term - 172420 172256 165 1 0 72 44 118 0.461 2.73 9.01 Init - 177290 176838 453 2 0 43 61 161 0.179 4.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:100087680_100289983|GENSCAN_predicted_peptide_1|77_aa XLRSGTLYAVTETAVPMKIRAHGGHANASLSVFSLLALPSAFLCLKIDTTDKLVEPFLKT SMATFKGKQSKERKKKI >gi568815594r:100087680_100289983|GENSCAN_predicted_CDS_1|234_bp natctaagaagtgggaccttgtatgcagtcactgagacagctgttcccatgaaaatcagg gctcatggaggacatgcaaatgcttctctgtcagtcttcagcctgcttgctctgccttct gctttcctgtgccttaaaatagatacaacggataaactagtagaaccctttttaaaaaca tcaatggccactttcaaaggaaaacaaagtaaggaaaggaaaaagaagatctga >gi568815594r:100087680_100289983|GENSCAN_predicted_peptide_2|142_aa MQYFCKIDDGAAAADDGNLQEKLIPIDIELGNRMSRADNPARTGEAADPGAIHLATQDGL DFLTASRTRSKTAKADSAKPLNGKALKQQSFTSATSYWLKPVTKPVEIQKKRDRTHLLIE KSCTCIKKGEDLTAIFEEYLPH >gi568815594r:100087680_100289983|GENSCAN_predicted_CDS_2|429_bp atgcagtacttttgtaaaattgatgatggtgctgctgctgctgatgatggtaacctgcaa gagaagctaatccccattgacatagagctaggaaacagaatgtccagagcagacaatcca gcaagaactggagaagctgcagacccaggtgctatccacctggccacccaggatggccta gatttccttacggcatcgaggacgagatccaagacagcaaaagcagactctgccaagcct cttaatggcaaggccctgaagcagcagagcttcacttctgccacctcctattggttaaag cctgtcacaaagcctgtcgagattcagaaaaagagagatagaacccacctcctgatagaa aaaagctgcacatgcataaagaaaggagaggatttgacagctatctttgaagagtatctg ccccattaa >gi568815594r:100087680_100289983|GENSCAN_predicted_peptide_3|107_aa MVLEAEKAKIKGLASGESLLAALSHGRDAKRGTVHNSKDVETIQKPIKDRLDKENVAIYT MEYYAAIKKDEFMPFAETWLKLEAIILSKLTQEQKTKHHMFSLISGS >gi568815594r:100087680_100289983|GENSCAN_predicted_CDS_3|324_bp atggttttggaggctgagaaggccaagatcaaggggctggcatctggtgagagccttctt gctgcattatcccatggcagagatgcaaagagaggcactgttcacaatagcaaagacgtg gaaacaatccaaaagcccatcaaggatagactggataaagaaaatgtggcaatatacacc atggaatactatgcagccataaaaaaggatgagttcatgccctttgcggagacatggttg aagctggaagccatcattctcagcaaactaacacaggaacagaaaaccaaacaccacatg ttctcactcataagtgggagttga >gi568815594r:100087680_100289983|GENSCAN_predicted_peptide_4|111_aa MDKDFMTKTPKAMAIKAKIDKWDLIKLKIFCTAKDIIRVNRQPTELKKIFAIYPSDKGLI SGIYKELKQIYKKKTKNPIKKWVKDMIRHFSKENIYAANKHTKKKLLITGH >gi568815594r:100087680_100289983|GENSCAN_predicted_CDS_4|336_bp atggacaaagacttcatgactaaaacaccaaaagcaatggcaataaaagccaaaattgac aaatgggatctaattaaactaaagatcttctgcacagcaaaagatatcatcagagtgaac aggcaacctacagaattgaagaaaatttttgcaatctatccatctgacaaagggctaata tccggaatctacaaggaacttaaacaaatttacaagaaaaaaacaaagaaccccatcaaa aagtgggtgaaggatatgatcagacacttttcaaaagaaaacatttatgcagccaacaaa cataccaaaaaaaagctcctcatcactggtcattag >gi568815594r:100087680_100289983|GENSCAN_predicted_peptide_5|251_aa MDRIAAIKLEQMRCSLLWMSKEWFLEMESTRGEDAVNIAEMTTKDLEYSIKLFYGICGDI NKCGVQLKELEKQEQTNSKASRRQEITKIRAELKEIGTQKTFQEINESRSWLFEKINKID RLLARLIKKKREKNQIDTIKTDKGDITTDPTEIQTTIREYYKHLYANKLENLEEIDKFLD TYTLPSLNQEEVEFPNTPITSSEIEAVINSLPTKKSPRPDGFRAKFYQRYKEELIPFPLK IFQTTEKEGLL >gi568815594r:100087680_100289983|GENSCAN_predicted_CDS_5|756_bp atggaccgaattgctgcaataaaacttgaacagatgaggtgtagcctcttatggatgagc aaagaatggtttcttgagatggaatctactcgtggtgaagatgctgtgaacattgctgaa atgacaacaaaggatttagaatattccataaagcttttctatgggatctgtggtgatatt aataaatgtggtgtccaattaaaagaactagagaagcaagagcaaacaaattcaaaagcc agcagaagacaagaaataactaagataagagcagaactgaaggagatagggacacagaaa acctttcaagaaattaatgaatccaggagctggctttttgaaaagattaacaaaatagat agattgctagccagactaataaagaagaaaagagagaagaatcaaatagacacaataaaa actgataaaggggatatcaccactgatcccacagaaatacaaactaccatcagagaatac tataaacacctgtatgcaaataaactagaaaatctagaagaaattgataaattcctggac acatacaccctcccaagtctaaaccaggaagaagtcgaattcccaaatacaccaataaca agttctgaaattgaggcagtaattaacagcctaccaaccaaaaaaagcccacgaccagac ggattcagagccaaattctaccagaggtacaaagaggagcttataccattccctctgaaa atattccaaacaacagaaaaagagggactcctctaa >gi568815594r:100087680_100289983|GENSCAN_predicted_peptide_6|480_aa MLSGRCNRFKPSVQDDTAKPQDKEMHRSLKASRCREQPECESADARGGVALNNQVQALTS RAPGIAVPVARDLPWKQQVSALPQAYPQLPVARLGLPGSVEVERGPDSGGEGRGAGRGGA VQVGGPEQGGDWTYKVLAAPEAMLLRGAGAGWPGRGEQGAAGCWGLQVTESPGEGTPKKP PACVYPAASAQAPANSWQGFVKLSCRGRFHPFGKGVPPLVRSRDAGHFRSLCTTRLLLQI RLFSPHFFQRSGKAVAPREGNRAVDHGCNWQFEQQEPGQHFRIAGLWLSPREPAKWSVPL DLLGGRAFDFLPYLYVGDFDYWDYVVPEPNLNEVIFEESTCQNLVKMLENCLSKSKQTKL GCSKVLVPEKLTQRIAQDVLRLSSTEPCGLRGCVMHVNLEIENPFEGNIEEHFAVFYQQQ HKDIEPTATVSENLLGSYHLIQSELAIADTVKSEGSTSSIRTDKFNVKGSESGVKELSRV >gi568815594r:100087680_100289983|GENSCAN_predicted_CDS_6|1443_bp atgctcagtggacgttgcaacagatttaagccaagtgttcaggatgacacagcaaagccc caggacaaggaaatgcacagatcgcttaaagcttcccgctgccgggagcagcctgagtgt gaaagcgcggacgctcggggaggggtcgcactgaacaatcaggttcaggctttgacgagt cgtgcacccggaattgcggtcccagtggcccgagatctaccctggaagcaacaggtgtcc gctctaccacaggcttatcctcaactccctgtggctaggctcgggctccctggaagtgta gaggttgaaaggggaccagactctggaggggaggggcggggcgctggccggggaggggcg gtgcaggtgggcgggcctgagcagggtggggactggacctataaagttctcgctgccccg gaggctatgctactgcgaggagccggcgcagggtggcccgggaggggtgagcagggtgcc gctggctgctggggtctgcaggtcaccgagtccccaggagaggggactcctaagaagcca cctgcctgtgtttacccggcagcgagcgcgcaggcccccgcgaactcctggcagggcttt gtcaaattgagttgcagaggccggttccacccctttggcaaaggagttcctccgctagtt cgctcccgggacgccgggcattttaggagcctctgcacgactcgcctgcttttgcaaatc cgtctcttctcgcctcattttttccagcgctcaggaaaggccgttgcgcctcgcgaagga aacagagccgttgaccatggttgcaactggcagtttgagcagcaagaacccggccagcat ttcagaattgctggactgtggctatcacccagagagcctgctaagtggtcggtgcctctg gatctattgggtgggcgggctttcgacttcctcccctacctctatgtaggagattttgac tactgggattatgttgttcctgaacccaacctcaacgaggtaatatttgaggaatcaact tgccagaatttggttaaaatgctggagaactgtctgtccaaatcaaagcaaactaaactt ggttgctcaaaggtccttgtccctgagaaactgacccagagaattgctcaagatgtcctg cggctttcctcaacggagccctgcggcttgcgaggttgtgttatgcacgtgaacttggaa attgaaaatccatttgaagggaatatagaagagcattttgcagtattctatcagcaacag cataaagatattgagccaacggctaccgtgtcagagaatcttctgggtagctaccaccta attcaaagtgaacttgctatcgcagacactgtgaagtctgaaggaagtaccagcagcatc aggacagacaagtttaacgtgaagggaagtgaatctggtgtcaaagaactctcaagggtc taa >gi568815594r:100087680_100289983|GENSCAN_predicted_peptide_7|212_aa MKPRTLAVSVTALKVVCLEFVPSDVRMSEFLPSGMKQQTFEVSVTALKVACLEFVPSDVR MSEFLPSGMKPQTFEVSVTALKVACLVFVPSDVRICSEFLPSDSGGQLASPSGSRTRAAG GAACQSRTVRPHSSALGWSIGLGAVEQGAALIGEARAAQEPMEGSVGQGEQKFKRSFGKR SRSAPSNPTGHLSLTMEGIVEVKRNILRADNS >gi568815594r:100087680_100289983|GENSCAN_predicted_CDS_7|639_bp atgaagccgcggaccctcgcagtgagtgttacagctcttaaggtggtgtgtctggagttt gttccttctgatgttcggatgtcggagtttcttccttctggaatgaagcagcagaccttc gaagtgagtgttacagctcttaaggtggcgtgtctggagtttgttccttctgatgttcgg atgtcggagtttcttccttctggaatgaagccgcagaccttcgaagtgagtgttacagct cttaaggtggcgtgtctggtgtttgttccttctgatgttcggatatgttcggagtttctt ccttctgattcaggaggccagctggcttcacccagtggatcccgaaccagggctgcaggt ggagctgcctgccagtcccgcaccgtgcgcccacactcctcagcccttgggtggtcgata ggactgggcgccgtggagcagggggcggcgctcatcggcgaggctcgggctgcacaggag cccatggaggggagtgtagggcagggagagcagaagttcaaacgcagctttggaaaacgt tctaggtcagctccatccaatccgacaggccaccttagtctgacaatggaaggtatagta gaagttaagcgaaacattctcagggctgataacagttga >gi568815594r:100087680_100289983|GENSCAN_predicted_peptide_8|178_aa MPQLSLAPEADLGRADKVKQKSTLSRDSMADFYRPPDAAPPFCDLGITNDQHSSLVREHQ PWGGYGQSTEDACWLTLMENRTYVEILGAGSPDNWQANMVFFFLSACENPIPFAGIYLHT ALVGLSTGITSSVKPPLTLVQTTQEPEDCPTTATAIAHTTSAAQKPEDMPTHQTHFCH >gi568815594r:100087680_100289983|GENSCAN_predicted_CDS_8|537_bp atgccacagttgtcccttgcaccagaagcagatcttggaagagcagataaggttaaacag aaatcaaccctttcaagagactccatggctgatttctaccgaccacctgatgctgcccct cccttttgtgatttaggcataacaaatgaccagcattcctccctggtaagagagcaccaa ccatggggtgggtatggccagtctacagaggatgcatgctggctgacacttatggaaaat agaacctacgttgaaatattgggggcgggttcccctgataactggcaagccaacatggtt ttctttttcctaagtgcatgtgagaacccaattccctttgctggcatctatctgcacaca gctcttgtgggcctatcaactggcataaccagctcagtgaagccaccactcacactggtg cagaccactcaggagccagaggattgtcccaccactgctactgccattgcccatactaca tctgctgcccagaagcctgaggacatgcccacccatcaaacccacttctgccactga >gi568815594r:100087680_100289983|GENSCAN_predicted_peptide_9|205_aa MNVEVKILNKILANQIQQHIKKLIHHDPVSFIPGMQGWFNIHKSINEIHHIKRTNDKNHM IISIETEKFFSKIQQPFMLKTLNKPGIDGTFLKIIRAIYDKPTPNIILNGQKLEGFPLKT GTRQGGPLSPLLFNIVLEVLARAIRQEKEIKVLLAVFYKTQIKRWTKATELINDRVEMRI RFSQMLKLKLFQLSSLQDAWHCKGA >gi568815594r:100087680_100289983|GENSCAN_predicted_CDS_9|618_bp atgaacgtcgaagtgaaaatcctcaataaaatactggcaaaccaaattcagcagcacatc aaaaagcttatccaccacgatccagttagcttcatccctgggatgcaaggctggttcaac atacacaaatcaataaatgaaatccatcacataaagagaaccaatgacaaaaaccacatg attatctcaatagaaacagaaaagttcttcagtaaaattcaacagcccttcatgctaaaa actctcaataaaccaggtattgatggaacatttctcaaaataataagagctatttatgac aaacccacacccaatatcatactgaatgggcaaaaactggaaggattccctttgaaaacc ggcacaagacaaggaggccctctctcaccactcctgttcaacatagtattggaagttctg gccagggcaattaggcaagagaaagaaataaaggtgttactggctgtgttttataagact cagattaagaggtggacaaaggccacagagctaataaatgacagagtggagatgcgaatt cggttctctcagatgctaaagctcaagctctttcaactaagctcactacaagatgcctgg cattgcaaaggtgcctga