GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:07:21 Sequence gi568815595r:149866492_150070856 : 204365 bp : 38.06% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5538 5663 126 2 0 70 86 54 0.329 3.36 1.02 Intr + 28982 29069 88 1 1 141 98 106 0.984 15.52 1.03 Intr + 35581 35671 91 2 1 43 110 56 0.761 1.43 1.04 Intr + 45487 45592 106 1 1 115 78 -18 0.597 -0.80 1.05 Intr + 47069 47179 111 1 0 118 68 -1 0.414 0.56 1.06 Intr + 54643 54736 94 0 1 118 115 31 0.846 7.42 1.07 Term + 58550 58659 110 0 2 81 50 71 0.596 0.29 1.08 PlyA + 61561 61566 6 1.05 2.03 PlyA - 62004 61999 6 1.05 2.02 Term - 70871 70723 149 0 2 58 39 109 0.221 0.08 2.01 Init - 73288 72847 442 1 1 82 32 442 0.165 34.27 2.00 Prom - 77811 77772 40 -3.65 3.00 Prom + 78494 78533 40 -11.04 3.01 Sngl + 79469 79762 294 1 0 88 54 297 0.911 21.75 3.02 PlyA + 82112 82117 6 -1.75 4.00 Prom + 82124 82163 40 -3.45 4.01 Init + 92996 93038 43 1 1 9 115 3 0.410 -4.24 4.02 Intr + 93565 93645 81 2 0 86 60 68 0.845 2.49 4.03 Term + 94249 94613 365 2 2 66 38 445 0.971 31.04 4.04 PlyA + 95380 95385 6 1.05 5.06 PlyA - 96871 96866 6 1.05 5.05 Term - 99773 99676 98 0 2 81 47 82 0.710 0.55 5.04 Intr - 102059 101867 193 2 1 33 105 195 0.964 13.84 5.03 Intr - 104520 104234 287 1 2 99 89 155 0.351 12.84 5.02 Intr - 105095 104867 229 2 1 7 41 139 0.112 -2.38 5.01 Init - 105268 105116 153 1 0 75 -1 164 0.239 6.23 5.00 Prom - 108697 108658 40 -6.55 6.02 PlyA - 108895 108890 6 1.05 6.01 Sngl - 116766 115687 1080 0 0 101 33 1171 0.859 109.73 6.00 Prom - 117109 117070 40 -14.06 7.00 Prom + 117743 117782 40 -6.15 7.01 Sngl + 117995 118306 312 1 0 3 38 326 0.488 14.78 7.02 PlyA + 118495 118500 6 1.05 8.00 Prom + 122928 122967 40 -5.25 8.01 Init + 126066 126115 50 2 2 54 89 46 0.329 1.77 8.02 Term + 133837 134068 232 1 1 49 49 245 0.477 11.66 8.03 PlyA + 134531 134536 6 1.05 9.00 Prom + 143709 143748 40 -4.25 9.01 Init + 150241 150442 202 0 1 93 82 115 0.171 10.49 9.02 Term + 157160 157512 353 0 2 -3 38 284 0.031 7.96 9.03 PlyA + 158590 158595 6 1.05 10.00 Prom + 162178 162217 40 -4.15 10.01 Init + 163120 163221 102 0 0 78 83 115 0.977 10.29 10.02 Intr + 165645 165724 80 2 2 92 45 97 0.706 3.23 10.03 Intr + 168958 169133 176 1 2 100 11 163 0.686 8.46 10.04 Intr + 172452 172570 119 1 2 109 30 76 0.194 3.16 10.05 Intr + 173296 173379 84 0 0 84 86 53 0.513 3.80 10.06 Intr + 175019 175079 61 1 1 79 59 50 0.172 -1.41 10.07 Intr + 180306 180450 145 1 1 74 90 48 0.351 2.02 10.08 Term + 184410 184983 574 0 1 -55 49 404 0.273 15.14 10.09 PlyA + 185431 185436 6 1.05 11.05 PlyA - 186696 186691 6 1.05 11.04 Term - 187171 187007 165 1 0 56 36 124 0.268 0.93 11.03 Intr - 190746 190594 153 0 0 41 92 148 0.952 9.85 11.02 Intr - 200723 200562 162 2 0 56 63 75 0.011 1.05 11.01 Intr - 203943 203862 82 2 1 114 61 40 0.616 2.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 157180 157512 333 0 0 78 38 261 0.913 15.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:149866492_150070856|GENSCAN_predicted_peptide_1|241_aa GFLINSKPENACEPIVPPPVKDNSSGTFIVLIRRLDCNFDIKVLNAQRAGYKAAIVHNVD SDDLISMGSNDIEVLKKIDIPSVFIGESSANSLKDEFTYEKGGHLILVPEFSLPLEYYLI PFLIIVGICLILIVIFMDQIQDTTLYFIILSSLWSMTILLFLFFMALIILSTSQITKFVQ DRHRARRNRLRKDQLKKLPVHKFKKGRKSPTEGEDRGGGIGAFKNRGIKYYYSRRVGGGM D >gi568815595r:149866492_150070856|GENSCAN_predicted_CDS_1|726_bp ggttttttgattaactcaaaaccagagaatgcctgtgaacccatagtgcctccaccagta aaagacaattcatctggcactttcatcgtgttaattagaagacttgattgtaattttgat ataaaggttttaaatgcacagagagcaggatacaaggcagccatagttcacaatgttgat tctgatgacctcattagcatgggatccaacgacattgaggtactaaagaaaattgacatt ccatctgtctttattggtgaatcatcagctaattctctgaaagatgaattcacatatgaa aaagggggccaccttatcttagttccagaatttagtcttcctttggaatactacctaatt cccttccttatcatagtgggcatctgtctcatcttgatagtcattttcatggatcaaatc caggataccacattgtatttcattatcctgtcttccctctggtctatgacaattttgctg ttcttgtttttcatggccttgataattttgagtactagccagatcacaaaatttgtccag gatagacatagagctagaagaaacagacttcgtaaagatcaacttaagaaacttcctgta cataaattcaagaaaggtaggaaatcaccaactgagggtgaggacagaggaggaggtatt ggagcttttaagaacagaggtataaaatactattattccaggagagtaggaggaggaatg gactag >gi568815595r:149866492_150070856|GENSCAN_predicted_peptide_2|196_aa MDMSPLRPQNYLFSCQQKADKDYHFKVNNDENEHQLSLRTVISGTGAKDKLHIVETEAMN YEGSPIKVTLATLKMSVQPMVSGGLEITPLVILRLKCASEPVHISGQHLVAVEEDAESED EEEEYVKLKFIWKAICPRRWQKSSTEKIIRHWIKKSPKTYGSYPGETETKVDSQFPYRLG LLCRKAVPASTHRKHC >gi568815595r:149866492_150070856|GENSCAN_predicted_CDS_2|591_bp atggacatgagccctctgaggccccagaactatcttttcagttgtcaacaaaaggccgac aaagattatcactttaaggtgaataatgatgaaaatgagcaccagttatctttaagaacg gtgatttcagggactggagcaaaggataaattgcatattgttgaaacagaggcaatgaat tatgaaggcagtccgattaaagtaacactggcaactttgaaaatgtctgtacagccaatg gtttctgggggccttgaaataacaccactggtgatcttacggttgaagtgtgcttcagag ccagtgcatattagtggacagcacttagtagctgtggaggaagatgcagagtcagaagat gaagaagaggagtatgtgaaactcaagtttatctggaaagcgatctgcccccggaggtgg cagaaaagttccacagaaaaaattattaggcactggataaaaaagtctcccaagacttac ggttcctatcctggagaaactgagactaaggtagacagccagtttccctaccgtctaggg ttactatgcaggaaagctgttcctgcctcaacccacaggaagcattgctag >gi568815595r:149866492_150070856|GENSCAN_predicted_peptide_3|97_aa MGKKQSRKAENSKNQSASPPPKECSSSPATEQSWMENDFDKLREEGFRRSNFSELKEEVR THPKESKNLEKRLDKWLTRITSVAKSLNDLMELKTMA >gi568815595r:149866492_150070856|GENSCAN_predicted_CDS_3|294_bp atggggaaaaaacagagcagaaaagctgaaaattctaaaaatcagagcgcctctccccct ccaaaggaatgcagctcctcaccagcaacggaacaaagctggatggagaatgactttgac aagttgagagaagaaggcttcagacgatcaaatttctccgagctaaaggaggaagtgcga acccatcccaaagaatctaaaaaccttgagaaaagattagacaaatggctaactagaata accagtgtagcaaagtccttaaatgacctgatggagctgaaaaccatggcatga >gi568815595r:149866492_150070856|GENSCAN_predicted_peptide_4|162_aa MCLALCSFHPYNLQGDEYDVCAICLDEYEDGDKLRILPCSHAYHCKCVDPWLTKTKKTCP VCKQKVVPSQGDSDSDTDSSQEENEVTEHTPLLRPLASVSAQSFGALSESRSHQNMTESS DYEEDDNEDTDSSDAENEINEHDVVVQLQPNGERDYNIANTV >gi568815595r:149866492_150070856|GENSCAN_predicted_CDS_4|489_bp atgtgcctggccctttgctcatttcatccttataacctgcaaggagatgagtatgatgta tgtgccatttgtttggatgagtatgaagatggagacaaactcagaatccttccctgttcc catgcttatcactgcaagtgtgtagacccttggctaactaaaaccaaaaaaacctgtcca gtgtgcaagcaaaaagttgttccttctcaaggcgattcagactctgacacagacagtagt caagaagaaaatgaagtgacagaacatacccctttactgagacctttagcttctgtcagt gcccagtcatttggggctttatcggaatcccgctcacatcagaacatgacagaatcttca gactatgaggaagacgacaatgaagatactgacagtagtgatgcagaaaatgaaattaat gaacatgatgtcgtggtccagttgcagcctaatggtgaacgggattacaacatagcaaat actgtttga >gi568815595r:149866492_150070856|GENSCAN_predicted_peptide_5|319_aa MKDRRSSARFLSATDACSDFRLPARARGSAFMDEAQEGSRKDGALRDSEISLPLPPAAPT TKRRKRSASRHQDNSVTRHRGSRFRIPGSEHRRKAHRGPAAGLQGVRAAAVPGVPRATRG PPSPPARAPRGGGRSVPAQSRLLPAPAPPPPAPPPPATAAAAAAPAAPRRPRCSAKGSKM AGWQSYVDNLMCDGCCQEAAIVGYCDAKYVWAATAGGVFQSITPIEIDMIVGKDREGFFT NGLTLGAKKCSVIRDSLYVDGDCTMDIRTKSQGGEPTYNVAVGRAGRALVIVMGKEGVHG GTLNKKAYELALYLRRSDV >gi568815595r:149866492_150070856|GENSCAN_predicted_CDS_5|960_bp atgaaagatcggcgaagctcagcaaggttcctgagcgccacggacgcctgctctgatttc aggttgccggcgcgcgcccgcgggtctgcctttatggacgaggcgcaagagggatccagg aaagacggtgcccttagagactcggagattagcctgcccctgccaccagccgcccccaca acaaaaaggaggaaacgctccgcatcccgtcaccaggacaactctgtaacccgacaccga ggctcaagattccgcatcccgggcagcgaacacaggagaaaagcacatcgcggcccggcc gcggggctccaaggcgtccgtgccgccgctgttcctggggtgccgagagcgacgcggggt ccaccctctccgccggcccgagctcctcggggaggggggcggtcggtgcctgcgcagagc cgcctcctccccgcccccgccccgcctccccccgcgccgccgccgcccgctaccgccgcc gccgccgctgcgcctgctgctcctcgccgtccgcgctgcagtgcgaagggctcgaagatg gccggttggcagagctacgtggataacctgatgtgcgatggctgctgccaggaggccgcc attgtcggctactgcgacgccaaatacgtctgggcagccacggccgggggcgtctttcag agcattacgccaatagaaatagatatgattgtaggaaaagaccgggaaggtttctttacc aacggtttgactcttggcgcgaagaaatgctcagtgatcagagatagtctatacgtcgat ggtgactgcacaatggacatccggacaaagagtcaaggtggggagccaacatacaatgtg gctgtcggcagagctggtagagcattggttatagtcatgggaaaggaaggtgtccacgga ggcacacttaacaagaaagcatatgaactcgctttatacctgaggaggtctgatgtgtaa >gi568815595r:149866492_150070856|GENSCAN_predicted_peptide_6|359_aa MPKRGKRLKFRAHDACSGRVTVADYADSDLAVVRSGRVKKAVANAVRQEVKSLCGLEASQ VPAEEALSGAGEPYDIIDSSDEMDAQEENIHERTVSRKKKSKRHKEELDGAGGEEYPMDI WLLLASYIRPEDIVNFSLICKNAWTVTCTAAFWTRLYRRHYTLDASLPLRLRPESMEKLH CLRACVIRSLYHMYEPFAARISKNPAIPESTPSTLKNSKCLLFWCRKIVGNRQEPMWEFN FKFKKQSPRLKSKCTGGLQPPVQYEDVHTNPDQDCCLLQVTTLNFIFIPIVMGMIFTLFT INVSTDMRHHRVRLVFQDSPVHGGRKLRSEQGVQVILDPVHSVRLFDWWHPQYPFSLRA >gi568815595r:149866492_150070856|GENSCAN_predicted_CDS_6|1080_bp atgcccaagagaggaaagcgactcaagttccgggcccacgacgcctgctccggccgagtg accgtggcggattacgccgactcggatctggcggtcgtgaggtctggacgagtcaagaaa gccgtagccaacgctgttcggcaggaagtaaaatctctttgtggcttggaagcctctcag gttcctgcagaggaagctctttctggggctggtgagccctatgacatcatcgacagcagt gatgagatggatgcccaggaggaaaacatccatgagagaactgtctccagaaaaaagaaa agcaagagacacaaagaagaactggacggggctggaggagaagagtatcccatggatatt tggctattgctggcctcctatatccgtcctgaggacattgtgaatttttccctgatttgt aagaatgcctggactgtcacttgcactgctgccttttggaccaggttgtaccgaaggcac tacacgctggatgcttccctgcctttgcgtctgcgaccagagtcaatggagaagctgcac tgtctccgggcttgtgtgatccgatctctgtaccatatgtatgagccatttgctgctcga atctccaagaatccagccattccagaaagcacccccagcacattaaagaattccaaatgc ttacttttctggtgcagaaagattgttgggaacagacaggaaccaatgtgggaattcaac ttcaagttcaaaaaacagtcccctaggttaaagagcaagtgtacaggaggattgcagcct cccgttcagtacgaagatgttcataccaatccagaccaggactgctgcctactgcaggtc accaccctcaatttcatctttattccgattgtcatgggaatgatatttactctgtttact atcaatgtgagcacggacatgcggcatcatcgagtgagactggtgttccaagattcccct gtccatggtggtcggaaactgcgcagtgaacagggtgtgcaagtcatcctggacccagtg cacagcgttcggctctttgactggtggcatcctcagtacccattctccctgagagcgtag >gi568815595r:149866492_150070856|GENSCAN_predicted_peptide_7|103_aa MCLGGDFTCHNGTGGKSIYAEKFDDESVILKYTGSGILSMANAVLNTNGSHFFICTAKTE WLVGKHVVFGKVKAGTNKVEAMECFGSRNGKTSKKIIADCGKR >gi568815595r:149866492_150070856|GENSCAN_predicted_CDS_7|312_bp atgtgtctgggtggtgatttcacatgccataatggcactggtggaaagtccatctatgcg gagaaatttgatgatgagagcgtcatcctgaagtatacaggttctggcatcttgtccatg gcaaatgctgtactcaacacaaatggttcccattttttcatctgcactgcaaagactgag tggttggtaggcaagcatgtggtcttcggcaaggtaaaagcaggcactaataaggtggaa gccatggaatgctttgggtccaggaatggcaagaccagcaagaagatcattgctgactgt ggaaaacgctaa >gi568815595r:149866492_150070856|GENSCAN_predicted_peptide_8|93_aa MTGSVGSLQRLLNVGKEEEEALVTLEEEAEATEVVNRVMETRAVAMAGLAPMTAITMEEA EAALMLAVEAILEVAEATMILAITTVSLQILNP >gi568815595r:149866492_150070856|GENSCAN_predicted_CDS_8|282_bp atgacaggatcagttgggagtctgcagaggctcctcaatgtggggaaagaggaggaggaa gccctggttactctggaggaggaagcagaggctacagaagtggtgaacagggttatggaa accagggcagtggctatggcaggattggcacctatgactgctataacaatggaggaggca gaggcagcattaatgctagcagtggaagcaattttggaggtggcagaagctacaatgatt ttggcaattacaactgtcagtcttcaaattttgaatccatga >gi568815595r:149866492_150070856|GENSCAN_predicted_peptide_9|184_aa MGSRTISPKMVFKLGSWLRSSGACLLSDESDREVIVKGMGSRAQVDGESLETLHPEKRKG GEFYIDADHTSSPAMDPNQDRISELPKKEFRIKLIQEAPEKDKIQLKEMKNMIQDMKGKF FSETDSINKKQSQLMQIKDTFREMQSALESLRNTIEQAEETTPELGDKAFELTQSIKDKE KGIF >gi568815595r:149866492_150070856|GENSCAN_predicted_CDS_9|555_bp atgggttccaggaccatcagcccaaagatggtattcaagctgggatcctggttgagatca tctggagcatgtttgctttctgatgaaagtgatagagaagtcattgttaaggggatggga tccagagctcaagtggacggggaatctctggagacacttcatccagagaaaagaaaaggt ggagaattttacatagatgctgatcataccagctcaccagcaatggatccaaaccaagac agaatctctgaactgccaaaaaaagaattcagaattaagctaatacaggaggcaccagag aaagataaaatccaacttaaagaaatgaaaaacatgatacaggatatgaaaggaaaattc ttcagtgaaacagatagcataaataaaaaacaatcgcaacttatgcaaatcaaggacaca tttagagaaatgcaaagtgcactggaaagtctccgcaatacaatcgaacaagcagaagaa acaactccagagcttggagacaaggcttttgaattaacacaatccatcaaagacaaagag aaaggaattttttaa >gi568815595r:149866492_150070856|GENSCAN_predicted_peptide_10|446_aa MKFTRVIHTGNINLGVISVQMVFKAKRLAEITKGEGRIEVAADPYGFAGGNTQLIWMHIK MVVVDLAMLCGRPQGIGVSSFVDKKQFRENTGPPVGRIYLKTKGWPSSGDLMAAALRVSS YQMFAQECNLCNFAAASDCRPPLHLHEVTTKWPMGNLQRVSDFVKLHANLQTFLSGGDAN DFCLREKALSKLQHKEEEPDLAQPSCQRLIVEICLSIKKQKQPAQQLEERDCPLKIQASQ KNGSIFFLNSYFLNHRRSRSKTGGKKFRKEKNSREDKRERTATEEKQEEIKKRLKEPEEP KVLTPEEQLADKLGLNKLQEESDLELAKETFGVNSTVYGIDAMNPSSRDDFTEFGKLLKD KITQNEKSLYDVFFFLEILVRDVCISLEIDNLKKITNSLTVLCRKNRSKKNQKEEERCGS WRGIKATMKDDLADYGGYDGGICTRL >gi568815595r:149866492_150070856|GENSCAN_predicted_CDS_10|1341_bp atgaagtttacgagagttattcacactggaaatataaatttgggtgtcatcagcgtacag atggtgtttaaagccaagagacttgcagagatcactaagggagagggaagaattgaggtt gctgcagacccatatggattcgctggtggtaacactcagttgatttggatgcacattaaa atggtagtggtagaccttgcaatgctttgtggccgtccccaggggataggggtaagctca tttgttgacaagaaacaatttagagaaaacactgggccacctgtgggcagaatctattta aaaaccaaaggctggcccagcagtggagatctcatggctgctgctctacgcgtgagctcc tatcagatgtttgcacaggagtgtaacctttgtaacttcgctgcagcctctgattgcagg ccgccacttcatttacatgaggtgaccaccaagtggccaatgggaaacctgcagagggtc tcagactttgtgaaattacatgctaacttgcagacatttttgtctggtggagatgcaaat gatttctgtctgagagaaaaggctctttctaagctgcagcacaaggaagaggaacccgac ttggcccaaccatcatgccaaaggttaatagtggaaatatgtttatccatcaagaaacaa aaacagcctgcccaacaactggaagagagggattgtcctctcaagattcaggctagccaa aagaatggaagtatctttttcttaaacagctatttcctcaaccacagaagaagcagaagt aaaaccggaggtaaaaagttcagaaaagaaaaaaacagcagagaagataaaagagaaaga acggcaacagaagaaaagcaagaagaaattaaaaagaggttaaaagaacctgaagaacct aaagtgctaacaccagaagaacaattagcagataaactggggctaaataaattacaggaa gagtcagacctcgaattagcaaaggaaacttttggtgttaatagtacagtttatggaata gatgctatgaacccatcttcaagagatgattttacagagtttggaaagttactaaaagat aaaattacacaaaatgaaaagtcactatatgatgtctttttttttttggaaatcttagtt cgagatgtgtgtatttcattggaaattgataacttgaaaaagattaccaattcactgact gtactttgccgtaaaaacagaagcaagaaaaaccaaaaagaagaagaaaggtgtggttcc tggagagggataaaagccaccatgaaagatgatctggcagattatggtggttatgatgga ggtatatgtacaagactatga >gi568815595r:149866492_150070856|GENSCAN_predicted_peptide_11|187_aa XSPLSNRKLEGKRLNAAHTSRLPGTQHRDNWEPWSVLGTSKDQTRCELMQLSSSISKSES NGKTKMAMGEQKECQPLPRQGRESPEQKSTQGDNILTIYLLNIFTGLALPCVQFVAGAAA SKKKPPIQEDCAQYSRSTYTVPDIILEAENTAENKTGKAPAILKVTCEWHENNREIIVNK DDNFRYP >gi568815595r:149866492_150070856|GENSCAN_predicted_CDS_11|564_bp nngtccccattatccaataggaagttggagggcaaaagactcaatgcagcccataccagt cggcttcctgggacacagcacagggataactgggagccatggagcgtcttaggtacgagt aaagaccagaccagatgtgaactgatgcagctctctagcagcatcagtaagtcagaaagc aatggaaaaaccaagatggcaatgggggagcaaaaggaatgccagcctctgccaaggcag ggcagagaaagccctgaacaaaaatcaacccaaggtgacaacattctcaccatttacctt ctgaacatctttactggccttgcactgccctgtgttcagtttgtggcaggggctgctgct tccaaaaagaagcccccaatccaggaggactgcgctcagtattcacggagcacctacact gtaccagacatcattctagaagcagaaaatacagcagagaacaagacaggtaaagcccca gctatcctgaaggttacctgtgaatggcatgaaaataatagagaaataatagttaataaa gatgataatttcagatatccataa