GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:06:08 Sequence gi568815586f:118146084_118346962 : 200879 bp : 39.58% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 92 150 59 0 2 102 95 30 0.745 2.78 1.02 Term + 3493 3666 174 0 0 59 43 158 0.962 5.18 1.03 PlyA + 4386 4391 6 1.05 2.17 PlyA - 4408 4403 6 1.05 2.16 Term - 5075 4914 162 2 0 69 50 203 0.988 11.55 2.15 Intr - 6326 6144 183 2 0 106 116 216 0.999 25.26 2.14 Intr - 14275 14063 213 1 0 88 115 153 0.997 15.99 2.13 Intr - 15944 15705 240 2 0 87 116 317 0.909 31.22 2.12 Intr - 26577 26374 204 0 0 84 53 238 0.970 18.37 2.11 Intr - 31246 31118 129 1 0 51 94 136 0.990 10.47 2.10 Intr - 35524 35288 237 1 0 73 78 387 0.983 33.09 2.09 Intr - 43858 43724 135 1 0 105 99 113 0.989 13.94 2.08 Intr - 53174 52941 234 2 0 90 83 202 0.638 16.86 2.07 Intr - 55380 55213 168 0 0 74 50 133 0.928 7.32 2.06 Intr - 66544 66409 136 0 1 82 40 27 0.147 -3.05 2.05 Intr - 68027 67934 94 0 1 112 95 87 0.786 9.90 2.04 Intr - 87682 87591 92 0 2 68 92 81 0.930 5.22 2.03 Intr - 89588 89475 114 1 0 112 84 47 0.934 5.34 2.02 Intr - 92086 91990 97 2 1 113 82 51 0.810 5.15 2.01 Init - 93183 93144 40 0 1 48 116 -30 0.302 -3.80 2.00 Prom - 93358 93319 40 -4.65 3.00 Prom + 95717 95756 40 -7.05 3.01 Sngl + 100001 100882 882 1 0 48 46 871 0.598 74.67 3.02 PlyA + 100900 100905 6 1.05 4.03 PlyA - 102271 102266 6 1.05 4.02 Term - 103475 103452 24 2 0 126 44 -12 0.086 -4.55 4.01 Init - 109484 109365 120 2 0 90 78 135 0.978 12.94 4.00 Prom - 133461 133422 40 -1.65 5.03 PlyA - 133681 133676 6 1.05 5.02 Term - 136837 136598 240 1 0 66 47 205 0.823 9.24 5.01 Init - 149159 149157 3 2 0 98 95 0 0.067 1.75 5.00 Prom - 194102 194063 40 -1.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:118146084_118346962|GENSCAN_predicted_peptide_1|77_aa XLFSLKKLGNVALLEAHKVDDRKVQLTTCPHRMVQHYLQPHRRTWKTELYLKGLNGSSWS DRGYGEQFQEGGCPSEL >gi568815586f:118146084_118346962|GENSCAN_predicted_CDS_1|234_bp ntacttttttctctgaaaaaacttggaaatgttgccttgctggaagcacataaggtagat gacagaaaggtgcagttgaccacgtgcccacatcgaatggtgcaacactatcttcagccc catagaagaacttggaagacagagctttacctgaaaggcttgaatggctcttcttggagc gaccgaggctatggagaacagtttcaggagggtggctgtccctctgagttgtga >gi568815586f:118146084_118346962|GENSCAN_predicted_peptide_2|825_aa MEYCLGSASDLLEVHKKPLQEVEIAAITHGALHGLAYLHSHALIHRDIKAGNILLTEPGQ VKLADFGSASMASPANSFVGTPYWMAPEVILAMDEGQYDGKVDIWSLGITCIELAERKPP LFNMNAMSALYHIAQNDSPTLQSNECHHCDVPGPNICININPVVSGQSWLVVQTPTLGFS VRLLPNPQAVKHDFVRRDRPLRVLIDLIQRTKDAVRELDNLQYRKMKKILFQETRNGPLN ESQEDEEDSEHGTSLNREMDSLGSNHSIPSMSVSTGSQSSSVNSMQEVMDESSSELVMMH DDESTINSSSSVVHKKVGFLVPSTEDHVFIRDEAGHGDPRPEPRPTQSVQSQALHYRNRE RFATIKSASLVTRQIHEHEQENELREQMSGYKRMRRQHQKQLIALENKLKAEMDEHRLKL QKEVETHANNSSIELEKLAKKQVAIIEKEAKVAAADEKKFQQQILAQQKKDLTTFLESQK KQYKICKEKIKEEMNEDHSTPKKEKQERISKHKENLQHTQAEEEAHLLTQQRLYYDKNCR FFKRKIMIKRHEVEQQNIREELNKKRTQKEMEHAMLIRHDESTRELEYRQLHTLQKLRMD LIRLQHQTELENQLEYNKRRERELHRKHVMELRQQPKNLKAMEMQIKKQFQDTCKVQTKQ YKALKNHQLEVTPKNEHKTILKTLKDEQTRKLAILAEQYEQSINEMMASQALRLDEAQEA ECQALRLQLQQEMELLNAYQSKIKMQTEAQHERELQKLEQRVSLRRAHLEQKIEEELAAL QKERSERIKNLLERQEREIETFDMESLRMGFGNLVTLDFPKEDYR >gi568815586f:118146084_118346962|GENSCAN_predicted_CDS_2|2478_bp atggaatattgcttaggctcagcctctgatttattagaagttcataaaaaaccacttcag gaagtggagatcgctgccattactcatggagccttgcatggactagcctacctacattct catgcattgattcatagggatattaaagcaggaaatattcttctaacagagccaggtcag gtaaaactagctgattttggatctgcttcaatggcttctcctgccaactccttcgtgggc acaccttactggatggctccagaggtgatcttagctatggatgaaggacagtatgatggg aaagttgatatttggtcacttggcatcacttgtattgaattggcggaacggaagccgccc cttttcaacatgaatgcaatgagtgccttatatcacattgcccagaatgactccccaacg ttacagtctaatgaatgtcatcattgtgatgttcctgggcctaacatctgcataaacatt aatccagtggtctcaggacagagctggctggtggttcaaacaccaactctgggattttca gttaggttgttgccaaatcctcaggcagtgaagcatgactttgttcgacgagaccggcca ctacgtgtcctcattgacctcatacagaggacaaaagatgcagttcgtgagctagataac ctacagtaccgaaaaatgaaaaaaatacttttccaagagacacggaatggacccttgaat gagtcacaggaggatgaggaagacagtgaacatggaaccagcctgaacagggaaatggac agcctgggcagcaaccattccattccaagcatgtccgtgagcacaggcagccagagcagc agtgtgaacagcatgcaggaagtcatggacgagagcagttccgaacttgtcatgatgcac gatgacgaaagcacaatcaattccagctcctccgtcgtgcataagaaagtaggtttcttg gtaccctccacagaggatcatgtattcataagggatgaggcgggccacggcgatcccagg cctgagccgcggcctacccagtcagttcagagccaggccctccactaccggaacagagag cgctttgccacgatcaaatcagcatctttggttacacgacagatccatgagcatgagcag gagaacgagttgcgggaacagatgtcaggttataagcggatgcggcgccagcaccagaag cagctgatcgccctggagaacaagctgaaggctgagatggacgagcaccgcctcaagcta cagaaggaggtggagacgcatgccaacaactcgtccatcgagctggagaagctggccaag aagcaagtggctatcatagaaaaggaggcaaaggtagctgcagcagatgagaagaagttc cagcaacagatcttggcccagcagaagaaagatttgacaactttcttagaaagtcagaag aagcagtataagatttgtaaggaaaaaataaaagaggaaatgaatgaggaccatagcaca cccaagaaagagaagcaagagcggatctccaaacataaagagaacttgcagcacacacag gctgaagaggaagcccaccttctcactcaacagagactgtactacgacaaaaattgtcgt ttcttcaagcggaaaataatgatcaagcggcacgaggtggagcagcagaacattcgggag gaactaaataaaaagaggacccagaaggagatggagcatgccatgctaatccggcacgac gagtccacccgagagctagagtacaggcagctgcacacgttacagaagctacgcatggat ctgatccgtttacagcaccagacggaactggaaaaccagctggagtacaataagaggcga gaaagagaactgcacagaaagcatgtcatggaacttcggcaacagccaaaaaacttaaag gccatggaaatgcaaattaaaaaacagtttcaggacacttgcaaagtacagaccaaacag tataaagcactcaagaatcaccagttggaagttactccaaagaatgagcacaaaacaatc ttaaagacactgaaagatgagcagacaagaaaacttgccattttggcagagcagtatgaa cagagtataaatgaaatgatggcctctcaagcgttacggctagatgaggctcaagaagca gaatgccaggccttgaggctacagctccagcaggaaatggagctgctcaacgcctaccag agcaaaatcaagatgcaaacagaggcacaacatgaacgtgagctccagaagctagagcag agagtgtctctgcgcagagcacaccttgagcagaagattgaagaggagctggctgccctt cagaaggaacgcagcgagagaataaagaacctattggaaaggcaagagcgagagattgaa acttttgacatggagagcctcagaatgggatttgggaatttggttacattagattttcct aaggaggactacagatga >gi568815586f:118146084_118346962|GENSCAN_predicted_peptide_3|293_aa MADDAGAAGGHGGPGGPGMGNRGGFRGGFGSGIRGRGRGRGRGRGRGRGARGGKAEDKEW MPVTKLGRLVKDMKIKSLEEIYLFSLPIKESEIIDFFLGASLKDEVLKIMPVQKQTRASQ RTRFKAFVAIGDYNGHVGLGVKCSKEVATAIRGAIILAKLSIVPVRRGYWGNKIGKPHTV PCKVTGRCGSVLVRLIPAPRGTGIVSAPVPKKLLMMAGIDDCYTSARGCTATLGNFAKAT FDAISKTYSYLTPDLWKETVFTKSPYQEFTDHLVKTHTRVSVQRTQDPAVATT >gi568815586f:118146084_118346962|GENSCAN_predicted_CDS_3|882_bp atggcggatgacgccggtgcagcgggggggcacggaggccctggtggccctgggatgggg aaccgcggtggcttccgcggaggtttcggcagtggcatccggggccggggtcgcggccgt ggacggggccggggccgaggccgcggagctcgcggaggcaaggccgaggataaggagtgg atgcccgtcaccaagttgggccgcttggtcaaggacatgaagatcaagtccctggaggag atctatctcttctccctgcccattaaggaatcagagatcattgatttcttcctgggggcc tctctcaaggatgaggttttgaagattatgccagtgcagaagcagacccgtgccagccag cgcaccaggttcaaggcgtttgttgctatcggggactacaatggccacgtcggtctgggt gttaagtgctccaaggaggtggccaccgccatccgtggggccatcatcctggccaagctc tccattgtccccgtgcgcagaggctactgggggaacaagatcggcaagccccacactgtc ccttgcaaggtgacaggccgctgcggctctgtgctggtgcgcctcatccctgcacccagg ggcactggcatcgtctccgcacctgtgcctaagaagctgctcatgatggctggtatcgat gactgctacacctcagcccggggctgcactgccaccctgggcaacttcgccaaggccacc tttgatgccatttctaagacctacagctacctgacccccgacctctggaaggagactgta tttaccaagtctccctatcaggaattcactgaccacctcgttaagacccacaccagagtc tccgtgcagcggactcaggatccagctgtggctacaacatag >gi568815586f:118146084_118346962|GENSCAN_predicted_peptide_4|47_aa MRKGVLKDPEIADLFYKDDPEELFIGLHEIGHGSFGAVYFTKSHSVT >gi568815586f:118146084_118346962|GENSCAN_predicted_CDS_4|144_bp atgcgtaaaggggtgctgaaggacccagagattgccgatctattctacaaagatgatcct gaggaactttttattggtttgcatgaaattggacatggaagttttggagcagtttatttt acaaaatctcactctgtcacctag >gi568815586f:118146084_118346962|GENSCAN_predicted_peptide_5|80_aa MVEASVLPSAVDTAKGVQVDIPSSVAWQLSGGHVTLVNQILPPRTLNLASSGHSRDDCGS CEDDKSPVEQNRMMMASTAS >gi568815586f:118146084_118346962|GENSCAN_predicted_CDS_5|243_bp atggtggaagccagtgttcttccttctgctgtggatacagcgaaaggagtacaggtcgat attccatcttcagttgcctggcagctttcaggcgggcatgtgactttggtcaatcagata ctcccacccagaactttgaatcttgcaagctcagggcacagcagagatgattgtggcagt tgtgaagatgataagagtccagtagagcagaatcgaatgatgatggcatcaacagcatcc taa