GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:12:22 Sequence gi568815595r:196256688_196532399 : 275712 bp : 44.29% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3654 3694 41 2 2 86 87 11 0.042 0.46 1.02 Intr + 17247 17412 166 0 1 98 45 94 0.554 6.06 1.03 Term + 24370 24522 153 0 0 114 48 52 0.813 1.72 1.04 PlyA + 25532 25537 6 1.05 2.00 Prom + 26111 26150 40 -3.56 2.01 Sngl + 31232 31564 333 1 0 64 46 224 0.563 10.56 2.02 PlyA + 32936 32941 6 1.05 3.03 PlyA - 33761 33756 6 1.05 3.02 Term - 50325 50252 74 1 2 112 48 85 0.588 4.97 3.01 Init - 61465 61309 157 1 1 98 10 181 0.826 11.38 3.00 Prom - 65780 65741 40 -4.26 4.05 PlyA - 65845 65840 6 -0.45 4.04 Term - 67310 67130 181 1 1 57 48 178 0.987 7.78 4.03 Intr - 67753 67584 170 1 2 134 67 28 0.955 4.04 4.02 Intr - 70345 70268 78 1 0 135 68 36 0.947 6.05 4.01 Init - 82114 82085 30 1 0 84 116 48 0.224 6.93 4.00 Prom - 89141 89102 40 -5.36 5.07 PlyA - 90998 90993 6 1.05 5.06 Term - 100159 99998 162 1 0 110 41 114 0.995 6.84 5.05 Intr - 105236 105157 80 0 2 79 94 73 0.998 6.27 5.04 Intr - 106000 105607 394 1 1 58 115 332 0.998 27.13 5.03 Intr - 111468 111341 128 1 2 58 89 69 0.904 4.40 5.02 Intr - 112824 112734 91 0 1 67 110 -9 0.828 -1.23 5.01 Init - 115337 115209 129 2 0 49 94 87 0.134 5.55 5.00 Prom - 116242 116203 40 -6.46 6.00 Prom + 117545 117584 40 -4.06 6.01 Init + 124837 125146 310 0 1 99 -54 251 0.006 9.48 6.02 Term + 127450 128285 836 2 2 53 36 341 0.008 18.55 6.03 PlyA + 129684 129689 6 1.05 7.06 PlyA - 131317 131312 6 1.05 7.05 Term - 132541 132473 69 1 0 58 29 50 0.053 -5.96 7.04 Intr - 135238 135126 113 2 2 72 98 92 0.207 8.70 7.03 Intr - 146332 146265 68 0 2 84 103 48 0.185 4.45 7.02 Intr - 175202 175097 106 0 1 120 80 32 0.085 4.87 7.01 Init - 175853 175640 214 2 1 62 111 288 0.946 25.41 7.00 Prom - 189415 189376 40 -5.16 8.14 PlyA - 191678 191673 6 1.05 8.13 Term - 202387 202298 90 1 0 83 39 84 0.487 0.72 8.12 Intr - 207225 207085 141 0 0 103 52 18 0.252 0.25 8.11 Intr - 216085 215213 873 1 0 132 63 298 0.442 23.23 8.10 Intr - 218625 218544 82 2 1 67 95 35 0.511 1.74 8.09 Intr - 227204 227083 122 2 2 60 100 22 0.479 -0.11 8.08 Intr - 230891 230712 180 2 0 14 87 164 0.958 8.96 8.07 Intr - 231996 231920 77 1 2 44 100 83 0.416 4.33 8.06 Intr - 246470 246186 285 0 0 29 92 287 0.100 20.51 8.05 Intr - 250158 250122 37 0 1 81 46 58 0.091 -1.26 8.04 Intr - 251644 251284 361 0 1 52 91 192 0.152 11.12 8.03 Intr - 252982 252833 150 0 0 120 81 119 0.992 13.68 8.02 Intr - 260022 259945 78 2 0 83 38 88 0.734 1.97 8.01 Init - 261435 261311 125 0 2 88 66 51 0.495 2.55 8.00 Prom - 265364 265325 40 -4.56 9.02 PlyA - 267420 267415 6 1.05 9.01 Term - 272360 272173 188 0 2 65 52 146 0.154 6.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 115355 115209 147 2 0 82 94 103 0.848 10.73 S.002 Sngl + 124837 125175 339 0 0 99 34 266 0.950 18.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:196256688_196532399|GENSCAN_predicted_peptide_1|119_aa MAMSKKTGNNKQSCAQAIHEGAPAGLNGAALSSALASPPVLLDTQNPEGAKAEGGWHVST APSTRKPSQAYIIFPTKTQCPVFFYQHINAQFQSRVRKQTLHLGCFSASTVVVLNELVL >gi568815595r:196256688_196532399|GENSCAN_predicted_CDS_1|360_bp atggctatgagcaaaaagacaggcaataacaaacagtcttgcgcccaggctattcacgaa ggggcgcctgcaggcctgaatggagctgctctcagctccgccttggcgtcccctcctgtg ctgctggacacccaaaatccagagggggccaaggcagaagggggctggcatgtcagcact gcccctagcacacgcaaacccagccaggcctacattatcttccccactaaaacccaatgc ccagtatttttctatcagcatatcaatgcccagttccaaagccgagtgcgcaagcagact ttacatttgggatgtttctctgcttctaccgtggtggtgcttaatgagctggtgctctaa >gi568815595r:196256688_196532399|GENSCAN_predicted_peptide_2|110_aa MQRRRLLEVGYGCEPAQVLAGPSRRAARLLPCPTRAVSAQAGLCAEVLPLHLRAYRSPHP RTRAANVQEARAPSAETPRPRLPRRSVSFYLRGEVRGICDRQPSPLKTCG >gi568815595r:196256688_196532399|GENSCAN_predicted_CDS_2|333_bp atgcagcgaagacgtctccttgaagtgggatacggatgtgaaccggcccaagtgctcgcc ggtccgtcaagacgcgctgcccggctgctcccgtgtccaactcgggctgtgtccgcccag gcgggcctgtgcgcggaggtcctaccgctgcacctccgcgcctaccgcagcccgcacccc cgcacccgggcagccaacgtgcaggaggcccgggcgccttcagcggagacgccccgaccg cggctgcctcgccgcagcgttagcttttacctacgtggggaagtaaggggaatttgcgac cgccagcccagtccgctgaaaacctgtggctga >gi568815595r:196256688_196532399|GENSCAN_predicted_peptide_3|76_aa MATSIGVSFSVGDGVPEAEKNAGEPENTYILRPVFQQRRVEDGLWRGDLERAEMGFDRYK MVVQVVIGEQRGEGVL >gi568815595r:196256688_196532399|GENSCAN_predicted_CDS_3|231_bp atggccacgtccatcggagtgtccttctcggtgggcgacggggtgcctgaggctgagaag aacgcaggggagcccgagaacacctatattctgcggcctgttttccagcagaggcgggta gaggacgggctgtggcggggcgacctcgagcgcgctgaaatgggatttgaccgatacaaa atggtggtgcaagtagtgattggagaacaaagaggtgaaggagtattgtga >gi568815595r:196256688_196532399|GENSCAN_predicted_peptide_4|152_aa MQHIESAKGKVLTAAILISLMGWRYGCFSKSGLCRSVLTALLSGGLALLGALICFVTSGV ALKDGPFCMFDVSSFNQTQAWKYGYPFKDLHSRNYLYDRSLWNSVCLEPSAAVVWHVSLF SALLCISLLQLLLVVVHVINSLLGLFCSLCEK >gi568815595r:196256688_196532399|GENSCAN_predicted_CDS_4|459_bp atgcaacacatcgagtcagccaagggcaaggtactcactgcagctatcctcatctccttg atgggctggagatacggctgcttcagtaagagtgggctctgtcgaagcgtgcttactgct ctgttgtcaggtggcctggctttacttggagccctgatttgctttgtcacttctggagtt gctctgaaagatggtcctttttgcatgtttgatgtttcatccttcaatcagacacaagct tggaaatatggttacccattcaaagacctgcatagtaggaattatctgtatgaccgttcg ctctggaactccgtctgcctggagccctctgcagctgttgtctggcacgtgtccctcttc tccgcccttctgtgcatcagcctgctccagcttctcctggtggtcgttcatgtcatcaac agcctcctgggccttttctgcagcctctgcgagaagtga >gi568815595r:196256688_196532399|GENSCAN_predicted_peptide_5|327_aa MQNKWLMINIQNVQDFACQCLNRDVWSNEAVKNIIREHFIFWQVYHDSEEGQRYIQFYKL GDFPYVSILDPRTGQKLVEWHQLDVSSFLDQVTGFLGEHGQLDGLSSSPPKKCARSESLI DASEDSQLEAAIRASLQETHFDSTQTKQDSRSDEESESELFSGSEEFISVCGSDEEEEVE NLAKSRKSPHKDLGHRKEENRRPLTEPPVRTDPGTATNHQGLPAVDSEILEMPPEKADGV VEGIDVNGPKAQLMLRYPDGKREQITLPEQAKLLALVKHVQSKGYPNERFELLTNFPRRK LSHLDYDITLQEAGLCPQETVFVQERN >gi568815595r:196256688_196532399|GENSCAN_predicted_CDS_5|984_bp atgcaaaataagtggctgatgataaacattcaaaatgttcaagactttgcatgtcagtgc ctcaaccgcgatgtgtggagcaacgaagctgtgaagaatattatccgggaacatttcatt ttctggcaggtttatcatgacagtgaggaaggtcagagatacatacagttttataagtta ggggatttcccctatgtttccatattggacccacggacaggtcagaagctagtagaatgg caccagttagatgtatcttctttcttggaccaagtgacgggatttctgggtgaacatgga caactggatggactttctagcagtccccccaaaaaatgtgcccgttcagagagccttata gatgcaagtgaagacagccagctagaagctgccatcagagcctccttacaagaaacacat tttgattcaacacagacaaaacaggatagccgctcagatgaagaatctgaatctgaactt ttttctggcagtgaggagttcatatccgtttgtggctctgatgaagaagaagaggtagag aatcttgccaagtccagaaagtctccccacaaagatttggggcatagaaaagaggagaat agaaggccgctgactgagccaccagtcagaactgatcctggaacagccacaaaccaccaa ggattgccagctgtggattcagagatactggagatgccacctgaaaaagcagatggagta gtggaggggatagatgtaaatggaccaaaagcacagctgatgttgcggtatccagatgga aaaagggaacagatcactcttccagagcaagctaaactgctagctttggtgaagcacgtg cagtctaaaggatacccaaatgaacgttttgaacttctcaccaactttcctcgaaggaaa ttatctcatctggactatgatattacattgcaagaggcaggcctttgtcctcaagagact gtctttgtacaggaaagaaattaa >gi568815595r:196256688_196532399|GENSCAN_predicted_peptide_6|381_aa MGRNQSRKDENSKNQSASSPPKDRNSLPATEQRWMENDFDKLTEVVFRRSVITNFSKLKE DVRTHRKEAKHLEKRLDELLTRINSVEKNLNDLMELKTTARELQIQTTTREYYKHLYANK LENLEEMGKFLDTYTLPRLNQEEVESLNRPITGSEIEAIINSLTTKKSPGPDGFTAKFYQ RYKEELVPFLLKLFQSTEKKGILPNSFYEANIILIPKPGRDTTKKENFRPISLMNIDAKI LNKILANRIQQHMKKLIHHDQVGFIPGMQGWFNICKSINIIHHINRTNDKNHMIISIDAE KTFNKIQQPFTLKILNKLGIDGTYLKIIRDIYGKPTANTILNGQKLEAFPLKTGTRQGCP LSPLLFNILLEVLARAIRQEK >gi568815595r:196256688_196532399|GENSCAN_predicted_CDS_6|1146_bp atggggagaaaccagagcagaaaagatgaaaattctaaaaatcagagcgcctcttctcct ccaaaggatcgcaactccttgccagcaacggaacaacgctggatggagaatgactttgac aagttgacagaagtagtcttcagaaggtcggtaataacaaacttctccaagctaaaggag gatgttcgaacccatcgcaaggaagctaaacaccttgaaaaaagattagatgaattgcta actagaataaacagtgtagagaagaacttaaatgacctgatggagctgaaaaccacagca cgagaacttcaaatacaaactaccaccagagaatactataaacacctctatgcaaataaa ctagaaaatctagaagaaatgggcaaattcctggacacatacaccctcccaagactaaac caggaagaagttgaatctctgaacagaccaataacaggctctgaaattgaggcaataatt aacagcctaacaaccaaaaaaagtccaggaccagacggattcacagccaaattctaccag aggtacaaagaggagctggtaccattccttctgaaactattccaatcaacagaaaaaaag ggaatcctccctaactcattttatgaggccaacatcatcctgataccaaagcctggcaga gacacaacaaaaaaggagaattttagaccaatatccctgatgaacattgatgcgaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatgaaaaagcttatccaccatgat caagtcggcttcatccctgggatgcaaggttggttcaacatatgcaaatcaataaacata atccatcatatcaacagaaccaacgacaaaaaccacatgattatctcaatagatgcagaa aagaccttcaacaaaattcaacagcccttcacgctaaaaattctcaataaactaggtatt gatggaacgtatctcaaaataataagagatatttatggcaaacccacagccaataccata ctgaatgggcaaaaactggaagcattccctttgaaaaccggcacaagacaaggatgccct ctctcaccactcctattcaacatactgttggaagttctggccagggcaatcaggcaagag aaataa >gi568815595r:196256688_196532399|GENSCAN_predicted_peptide_7|189_aa MRLRRPLGTGGAGLEGRSMRLRTPHGEVGEGSPCVSVLLFGGGGGGKMAAHGGSAASSAL KGLIQQFTTITGHSAPVPGLPFPFLGPSEPSPWPWRVRALPPSRQGREEVRAPIPQKQEI LVEPEPLFGVRQEQELRNGGAIDKKLTTLADLFRPPIDLMHKGSFETYSGDTTAIERENS SNIKGRKGS >gi568815595r:196256688_196532399|GENSCAN_predicted_CDS_7|570_bp atgcgcctgcgcagaccgctggggacgggaggggcggggctcgaggggcggtcaatgcgc ctgcgcacaccgcacggcgaagtgggggagggcagtccgtgtgtgtctgtgttgttgttc ggcggcggcggcggcggtaagatggctgcccacgggggctccgcggcgtcctcggcgctg aaggggttaattcaacagttcaccaccattaccggtcattctgccccggtccccggcctg cccttcccctttctgggcccctcggagccatcgccgtggccctggcgggttcgggccctc ccgccttcacggcaaggccgagaagaagttcgtgccccaattcctcaaaagcaggaaata ctggtggaaccagaaccattatttggtgttcggcaagaacaagaattaagaaatggagga gctatcgataagaaattaactacccttgcagatctattccggccacccattgatttgatg cataaaggcagctttgaaacatattctggtgatactacagccattgagagagagaatagc agcaatattaaaggcagaaaaggatcttag >gi568815595r:196256688_196532399|GENSCAN_predicted_peptide_8|866_aa MPVHVTSTWLAAMISAHVATKKKQQELPYWVNLALSHQHLHIMFLEDKDFALLITILPVP GIVLLIHSVDHKLQALETQFKELDFTKDNLMQKFEHHSKALASQAAQDEMWTAVRALQLT SMELNILYSYVIEVLICLHTRVLEKLPDLVRGLPTLASVLRRKVKNKRVRVVWESILEEC GLQEGDITALCTFFIARGNKAEHYTAKVRQMYIRDVTFLITNMVKNQALQDSLLRAVQSW DNNSELIKFGNAIPSLSECQCGICMEILVEPVTLPCNHTLCKPCFQSTVEKASLCCPFCR RRVSSWTRYHTRRNSLVNVELWTIIQKHYPRECKLRASGQESEEVADDYQPVRLLSKPGE LRREYEEEISKVAAERRASEEEENKASEEYIQRLLAEEEEEEKRQAEKRRRAMEEQLKSD EELARKLSIDINNFCEGSISASPLNSRKSDPVTPKSEKKSKNKQRNTGDIQKYLTPKSQF GSASHSEAVQEVRKDSVSKDIDSSDRKSPTGQDTEIEDMPTLSPQISLGVGEQGADSSIE SPMPWLCACGAEWYHEGNVKTRPSNHGKELCVLSHERPKTRVPYSKETAVMPCGRTESGC APTSGVTQTNGNNTGETENEESCLLISKEISKRKNQESSFEAVKDPCFSAKRRKVSPESS PDQEETEINFTQKLIDLEHLLFERHKQEEQDRLLALQLQKEVDKEQMVPNRQKGSPDEYH LRATSSPPDKVLNGQRKNPKDGNFKRQTHTKHPTPERGSRDKNRQVSLKMQLKQSVNRRK MPNSTRDHCKMYPGSQKTLAPEETDSQKYLVTYSSSHTSSAYVVWIKIWKTDTKNLEEIK PSDIITKFPELLEGYNPEIIEHQDKI >gi568815595r:196256688_196532399|GENSCAN_predicted_CDS_8|2601_bp atgcctgtacatgttacttctacctggctggctgccatgatatctgcacatgtggcaaca aagaaaaagcagcaggaactgccatattgggtgaacctggctcttagccaccagcattta catattatgttccttgaggacaaggatttcgccttacttatcactatattaccagtgcct ggaatagtgctgctcatccattcagtagaccacaaactccaagcgttagaaacacagttc aaagaactagacttcaccaaggataacctgatgcagaaattcgaacatcatagtaaggct ttggcaagccaagcagcccaagatgagatgtggacagcagttcgggcactccagctcact tcaatggaattgaatattttatacagctacgtcattgaagtacttatctgcttgcatact cgtgtgcttgagaagctgccagacctggtgagaggtcttccaaccttagcctctgtactc agaagaaaagttaagaacaagcgcgttagagttgtatgggagtccatactggaggagtgt gggctgcaagaaggagacatcacagcactttgtaccttctttattgcacgtggtaacaag gcagaacactatactgctaaagtgaggcagatgtacatcagggatgtcacgttcctaatt actaacatggtaaagaaccaggctctgcaggacagtttgctgagggctgtgcagtcatgg gacaacaactcagaactcatcaaattcgggaacgccatcccctcgctgtccgagtgccag tgcgggatctgcatggaaatcctcgtggagcccgtcaccctcccgtgtaaccacacgctg tgtaaaccgtgcttccagtcgaccgtcgaaaaggcgagtttatgctgtcccttctgtcgc cgccgggtatcgtcgtggactcggtaccatacccgaagaaattctctcgtcaacgtggaa ctgtggacgataattcaaaaacactatcccagggagtgcaagcttagagcgtctggccaa gaatcagaggaagtggctgatgactatcagccagttcgtctgctcagtaaacctggggaa ctgagaagagaatatgaagaggaaataagcaaggtggcggcagagcgacgggccagcgag gaagaagaaaacaaagccagtgaagaatacatacagaggttgttggcagaggaggaagaa gaggaaaaaagacaggcagaaaaaaggcgaagagcgatggaagaacaactgaaaagtgat gaggaactggcaagaaagctaagcattgatattaacaatttctgtgagggaagtatctcg gcttctcccttgaattccagaaaatctgatccagttacacccaagtctgaaaagaaaagt aagaacaaacaaagaaacactggagatattcagaagtatttgacaccgaaatctcagttt gggtcagcctcacactctgaagctgtacaagaagtcaggaaagactccgtatctaaggac attgacagtagtgataggaaaagcccaacagggcaagacacagaaatagaagatatgccg acactttctccacagatatcccttggagttggagaacaaggtgcagattcttcaatagag tcccctatgccatggttatgtgcctgtggtgccgaatggtaccatgaaggaaacgtcaaa acaagaccaagcaatcatgggaaagagttatgtgtcttaagtcacgagcgacctaaaacc agagttccctactcgaaagaaactgcagttatgccttgtggcagaacagaaagtgggtgc gcccccacatcaggggtgacacagacaaatggaaacaacacaggtgagacagaaaatgaa gagtcgtgcctactgatcagtaaggagatttccaaaagaaaaaaccaagaatcttccttt gaagcagtcaaggatccatgcttttctgcaaaaagaagaaaagtgtcccccgaatcttcc ccagatcaagaggaaacagaaataaactttacccaaaaactgatagatttggagcatcta ctgtttgagagacataaacaagaagaacaggacaggttattggcattacaacttcagaag gaggtggataaagagcaaatggtgccaaaccggcaaaaaggatccccagatgagtatcac ttacgcgctacatcctcccctccagacaaagtgctaaatggacagaggaagaatcccaaa gatgggaacttcaaaaggcaaactcacacaaagcatccaacaccagagagaggctcaagg gacaaaaataggcaagtgtctttaaagatgcagttgaagcagtcagttaatagaagaaag atgccaaattctactagagatcactgtaagatgtatcctgggtcacaaaagaccttggct ccagaagaaactgattctcagaaataccttgtgacttactcaagttctcatactagctct gcgtatgtggtgtggataaaaatttggaaaacagacactaaaaacttggaggaaattaag ccatcagacatcatcactaaattcccagagctcctagaaggatacaatccagaaataata gaacaccaggataagatttga >gi568815595r:196256688_196532399|GENSCAN_predicted_peptide_9|62_aa XLHSGDQCSELGRHRLTLSDGRGREPPADGGSGRQRPLEARAGSTPAAPATAQLPAEAPR PV >gi568815595r:196256688_196532399|GENSCAN_predicted_CDS_9|189_bp nctctccattccggggaccagtgttctgagctcggccgccatcgcctcacgctcagcgac gggcgtggccgggagccgcctgcggatggaggaagcggccgccagcgccccctggaggct cgggccggctccactcccgcggcccccgccaccgcccagctccccgcggaggcgcccagg ccggtctga