GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:15:49 Sequence gi568815577f:33303544_33536959 : 233416 bp : 43.90% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 21513 21588 76 2 1 49 94 190 0.501 14.95 1.02 Intr + 21679 21905 227 2 2 -12 30 182 0.671 0.10 1.03 Intr + 31175 31390 216 1 0 61 33 153 0.348 5.80 1.04 Intr + 34453 34538 86 0 2 24 102 56 0.368 -0.78 1.05 Intr + 37456 37631 176 1 2 85 116 29 0.761 4.98 1.06 Intr + 39725 39879 155 0 2 81 74 42 0.417 1.89 1.07 Term + 51773 52006 234 1 0 54 47 176 0.684 6.42 1.08 PlyA + 56250 56255 6 1.05 2.04 PlyA - 57325 57320 6 1.05 2.03 Term - 65326 65179 148 0 1 68 53 134 0.603 5.27 2.02 Intr - 79842 79629 214 1 1 70 69 130 0.243 7.07 2.01 Init - 82427 82382 46 2 1 86 58 48 0.623 0.45 2.00 Prom - 96713 96674 40 -1.96 3.00 Prom + 98797 98836 40 -4.16 3.01 Init + 100001 100073 73 1 1 78 99 271 0.974 26.43 3.02 Intr + 104383 104465 83 2 2 8 59 68 0.013 -4.74 3.03 Intr + 107902 108028 127 0 1 41 76 106 0.301 4.95 3.04 Intr + 111345 111477 133 1 1 103 66 97 0.981 8.70 3.05 Intr + 117937 118142 206 1 2 99 95 125 0.939 13.14 3.06 Intr + 129171 129328 158 1 2 78 94 97 0.589 9.03 3.07 Term + 133285 133419 135 0 0 27 46 116 0.126 -0.68 3.08 PlyA + 133653 133658 6 -1.75 4.08 PlyA - 133734 133729 6 -0.45 4.07 Term - 133823 133757 67 1 1 62 42 73 0.302 -2.49 4.06 Intr - 135754 135671 84 0 0 95 115 101 0.360 12.44 4.05 Intr - 156605 156536 70 0 1 92 43 11 0.023 -4.76 4.04 Intr - 156962 156870 93 0 0 85 106 4 0.455 1.84 4.03 Intr - 161866 161799 68 0 2 93 103 38 0.691 4.35 4.02 Intr - 163569 163467 103 1 1 66 103 12 0.559 -0.27 4.01 Init - 165342 165252 91 0 1 76 76 42 0.583 2.45 4.00 Prom - 166164 166125 40 -3.16 5.00 Prom + 167670 167709 40 -3.76 5.01 Init + 175276 175342 67 0 1 63 91 60 0.425 5.03 5.02 Term + 176339 176439 101 0 2 35 54 81 0.275 -2.31 5.03 PlyA + 177701 177706 6 1.05 6.02 PlyA - 178018 178013 6 1.05 6.01 Sngl - 178570 178037 534 1 0 43 34 793 0.988 65.47 6.00 Prom - 178741 178702 40 -3.66 7.02 PlyA - 178978 178973 6 1.05 7.01 Sngl - 185829 184684 1146 0 0 80 41 559 0.553 46.78 7.00 Prom - 198486 198447 40 -3.46 8.18 PlyA - 199649 199644 6 1.05 8.17 Term - 200772 200581 192 0 0 87 48 151 0.998 8.42 8.16 Intr - 200984 200869 116 0 2 113 68 -16 0.988 -0.93 8.15 Intr - 202159 202018 142 1 1 95 103 72 0.994 9.33 8.14 Intr - 202561 202431 131 2 2 126 89 64 0.999 10.71 8.13 Intr - 207915 207709 207 1 0 69 75 120 0.915 7.85 8.12 Intr - 213535 213446 90 2 0 75 82 55 0.759 3.57 8.11 Intr - 214065 213814 252 1 0 91 65 167 0.929 12.11 8.10 Intr - 217019 216821 199 2 1 55 56 62 0.670 -1.38 8.09 Intr - 217472 217363 110 0 2 60 84 58 0.697 2.60 8.08 Intr - 218739 218645 95 2 2 90 82 88 0.992 8.01 8.07 Intr - 221457 221226 232 1 1 74 98 136 0.994 10.03 8.06 Intr - 224701 224624 78 2 0 62 109 92 0.782 8.22 8.05 Intr - 225394 225307 88 1 1 99 96 28 0.938 4.24 8.04 Intr - 227341 227216 126 1 0 88 80 25 0.855 2.58 8.03 Intr - 228913 228802 112 0 1 82 96 110 0.972 11.58 8.02 Intr - 231210 231036 175 1 1 67 92 36 0.943 0.90 8.01 Intr - 231777 231682 96 1 0 102 63 64 0.958 5.28 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:33303544_33536959|GENSCAN_predicted_peptide_1|389_aa MMVVLLGATTLVLVAVAPWVLSAAADAQSGKPSVHFAAPKIKPDLGSQINQEKVVFWVLS CRLPVAVYGSSGAPGSHPREMAVPELCVEFDSFRESTAAPLCQVMRRVIQVCEGQLDVQT EGTGAISGYPTTQFMTQVVIQGDITSYDAVNTEGKAAEIHYTFPPLQWEMGQGKLFAKYA SNKSLISKELKQSTTKPKSIKRTGMDNWIKLSGCQNITSTKCNFSSLKLNVYEEIKLRIR AEKENTSSWYEVDSFTPFRKAQIGPPEVHLEAEDKAIVIHISPGTKDSVMWALDGLSFTY SLVIWKNSSGVEYFSEQPLKNLLLSTSEEQIEKCFIIENISTIATVEETNQTDEDHKKYS SQTSQDSGNYSNEDESESKTSEELQQDFV >gi568815577f:33303544_33536959|GENSCAN_predicted_CDS_1|1170_bp atgatggtcgtcctcctgggcgcgacgaccctagtgctcgtcgccgtggcgccatgggtg ttgtccgcagccgcagacgcccagtctgggaaaccttcggtccactttgccgcgccaaag attaaacccgacctgggctcgcaaatcaaccaggagaaagtggtgttctgggtcctctct tgccgcttgcctgtggccgtgtacgggtcctcgggagcgcccgggtcccacccccgtgaa atggcggtgccagagctttgtgtcgagtttgattctttccgggaaagtaccgcggctccg ctgtgtcaagtgatgcgcagggtgatccaggtgtgtgaggggcagctggatgtccagact gagggcactggtgccatcagtggctatcctaccactcaattcatgacccaggtggtgatc cagggtgatatcaccagttatgatgcagttaacacagaggggaaagctgctgagatacac tatactttcccacccctgcagtgggagatggggcaggggaaactatttgcaaaatatgca tccaacaagagcttaatatccaaggaactcaaacaatcaacaacaaaacccaaatccatc aaaagaactgggatggataattggataaaattgtctgggtgtcagaatattactagtacc aaatgcaacttttcttcactcaagctgaatgtttatgaagaaattaaattgcgtataaga gcagaaaaagaaaacacttcttcatggtatgaggttgactcatttacaccatttcgcaaa gctcagattggtcctccagaagtacatttagaagctgaagataaggcaatagtgatacac atctctcctggaacaaaagatagtgttatgtgggctttggatggtttaagctttacatat agcttagttatctggaaaaactcttcaggtgtagaatatttctctgaacagccattgaag aatcttctgctttcaacttctgaggaacaaatcgaaaaatgtttcataattgaaaatata agcacaattgctacagtagaagaaactaatcaaactgatgaagatcataaaaaatacagt tcccaaactagccaagattcaggaaattattctaatgaagatgaaagcgaaagtaaaaca agtgaagaactacagcaggactttgtatga >gi568815577f:33303544_33536959|GENSCAN_predicted_peptide_2|135_aa MGFLHVEAGLELLTSVWVTWRFLWLTASDFSINFVAKAPCTVLCMVRTVWFVSDSGAFPD AGLSVLKLGKPRVNQDESAALPRALDKPAAGDILTTQPHLHYGLYPVPKLLSKKNEKNED MLTIRRRRMGREEFC >gi568815577f:33303544_33536959|GENSCAN_predicted_CDS_2|408_bp atggggtttctccatgttgaggctggtctcgaactgctgacctcagtctgggtcacgtgg cgcttcctgtggctcactgcctctgacttttcaatcaactttgtggctaaagcaccatgc actgtgctttgcatggtccggactgtctggtttgtcagcgactcaggggcttttccagat gcgggactttcagtgctgaaactgggaaagcccagggtaaaccaggatgagtcagcggcc ctgcctcgtgctctggataagcctgctgctggagacatcctgaccactcagccccacctg cattatggcttgtacccagttcccaagctcttgtccaagaagaatgagaagaatgaggat atgctgacaattcgaaggcggaggatgggcagagaagaattttgttga >gi568815577f:33303544_33536959|GENSCAN_predicted_peptide_3|304_aa MRPTLLWSLLLLLGVFAAAAAAPPVFLEENNFLVMAEESENAFLFLICKVMERWEMVNRR VEGGIMVPKDVHALILRVREPGNVLPDVQKGLRRYPLSQLPAPQHPKIRLYNAEQVLSWE PVALSNSTRPVVYQVQFKYTDSKWFTADIMSIGVNCTQITATECDFTAASPSAGFPMDFN VTLRLRAELGALHSAWVTMPWFQHYRNASTELQQVILISVGTFSLLSVLAGACFFLVLKY RGLIKYWFHTPPSIPLQIEEYLKDPTQPILEALDKDSSPKDDVWDSVSIISFPEKEQEDV LQTL >gi568815577f:33303544_33536959|GENSCAN_predicted_CDS_3|915_bp atgcgaccgacgctgctgtggtcgctgctgctgctgctcggagtcttcgccgccgccgcc gcggccccgccagtatttcttgaagagaacaatttcctggttatggctgaggaatcagaa aatgcctttttatttctcatctgcaaggtgatggagagatgggagatggtgaacaggcgt gtggagggcggaataatggtccccaaagatgtccacgccctcatcctcagagtccgtgaa cctgggaatgtgctgcctgacgtacaaaagggactccgcagataccctctttcccagctg cccgctcctcagcacccgaagattcgcctgtacaacgcagagcaggtcctgagttgggag ccagtggccctgagcaatagcacgaggcctgttgtctaccaagtgcagtttaaatacacc gacagtaaatggttcacggccgacatcatgtccataggggtgaattgtacacagatcaca gcaacagagtgtgacttcactgccgccagtccctcagcaggcttcccaatggatttcaat gtcactctacgccttcgagctgagctgggagcactccattctgcctgggtgacaatgcct tggtttcaacactatcggaatgcctccactgagcttcagcaagtcatcctgatctccgtg ggaacattttcgttgctgtcggtgctggcaggagcctgtttcttcctggtcctgaaatat agaggcctgattaaatactggtttcacactccaccaagcatcccattacagatagaagag tatttaaaagacccaactcagcccatcttagaggccttggacaaggacagctcaccaaag gatgacgtctgggactctgtgtccattatctcgtttccggaaaaggagcaagaagatgtt ctccaaacgctttga >gi568815577f:33303544_33536959|GENSCAN_predicted_peptide_4|191_aa MAGFLDNFRWPECECIDWSERRNAVASVVAGWWIMIDAAVVYPKPEQLNHAFHTCGVFST LAFFMINAVSNAQVRGDSYESGCLGRTGARVWLFIGFMLMFGSLIASMWILFGAYVTQNS QDFMLEAHLLGVAVEKVRNQYRCILHSSLATILLPNRRSSSVSDFLNGLRLCDLPKGLRL ICQSQDDDQVF >gi568815577f:33303544_33536959|GENSCAN_predicted_CDS_4|576_bp atggcaggcttcctagataattttcgttggccagaatgtgaatgtattgactggagtgag agaagaaatgctgtggcatctgttgtcgcaggctggtggataatgattgatgcagctgtg gtgtatcctaagccagaacagttgaaccatgcctttcacacatgtggtgtattttccaca ttggctttcttcatgataaatgctgtatccaatgctcaggtgagaggtgatagctatgaa agcggctgtttaggaagaacaggtgctcgagtttggcttttcattggtttcatgttgatg tttgggtcacttattgcttccatgtggattctttttggtgcatatgttacccaaaacagc caggattttatgctagaggcccatttactgggagttgcagtagaaaaagtgagaaaccag tacagatgtatcctccactcctcgctggccaccatcctgctgcccaacagaagaagctct tctgtctccgatttcctgaacggtctaaggttgtgtgacctgcccaaggggctccggctc atttgccaaagtcaagacgacgaccaggtcttctga >gi568815577f:33303544_33536959|GENSCAN_predicted_peptide_5|55_aa MMKKCDRPESSEPLKCQGIHSAGRRLLRHNPAGVPRLHLSPGSPELGADARIERR >gi568815577f:33303544_33536959|GENSCAN_predicted_CDS_5|168_bp atgatgaaaaagtgtgaccggcctgagagttcagagcctctgaagtgtcaagggatccac agtgcaggaaggagactgctgcgccacaaccctgccggcgtcccgcggctccacctcagc cccgggagcccggagctgggagcagacgcgaggatagagcgccggtga >gi568815577f:33303544_33536959|GENSCAN_predicted_peptide_6|177_aa MQINGISLQDYTAVKEKYAKYLPHSAGWYAAKCFRKAQCPIVEPLTNSMMMHGCNNSNKL MIMCIIKHAFDFIHLLTGENPLQVLVNAIINSGPWEDSTCIGRAGTVRQQAVDMSPLHCV NQVVWLLCTGTREAAFWNIKTIAECLADELINATKASSSSYAINKKDELECGDKSNR >gi568815577f:33303544_33536959|GENSCAN_predicted_CDS_6|534_bp atgcagatcaatggcatttccctgcaggattacactgcagtgaaggagaagtatgccaag tacctgcctcacagtgctgggtggtatgcagccaaatgcttccgcaaagctcagtgcccc attgtggagcccctcactaactccatgatgatgcacggctgcaacaacagcaacaagctc atgatcatgtgcatcatcaagcatgccttcgatttcatccacctgctcacaggcgagaac cctctccaggtcctggtgaacgccatcatcaacagtggtccctgggaggactccacatgc attgggcgagcagggactgtgagacaacaggctgtggacatgtccccactgcactgtgtg aatcaggtcgtctggctgctgtgcacaggcactcgtgaggctgccttctggaacatcaag accattgctgagtgcctggcggatgagctcatcaatgccaccaaggcctcctccagctcc tatgccatcaacaagaaggatgagctggagtgtggggacaagtccaaccgctga >gi568815577f:33303544_33536959|GENSCAN_predicted_peptide_7|381_aa MAQILRSHLIKATVIPNRVKMLPYFGIIRNRMMSTHKSKKKIREYYRLLNVEEGCSADEV RESFHKLAKQYHPDSGSNTADSATFIRIEKAYRKVLSHVIEQTNASQSKGEEEEDVEKFK YKTPQHRHYLSFEGIGFGTPTQREKHYRQFRADRAAEQVMEYQKQKLQSQYFPDSVIVKN IRQSKQQKITQAIERLVEDLIQESMAKGDFDNLSGKGKPLKKFSDCSYIDPMTHNLNRIL IDNGYQPEWILKQKEISDTIEQLREAILVSRKKLGNPMTPTEKKQWNHVCEQFQENIRKL NKRINDFNLIVPILTRQKVHFDAQKEIVRAQKIYETLIKTKEVTDRNPNNLDQGEGEKTP EIKKGFLNWMNLWKFIKIRSF >gi568815577f:33303544_33536959|GENSCAN_predicted_CDS_7|1146_bp atggctcagatcttaagatctcacctgataaaggctacagtgattcctaatcgagtgaaa atgcttccatattttggtatcattagaaatagaatgatgtcaacccataaatccaaaaag aagatcagagaatattatagactgctgaacgtggaggaaggatgctctgcagatgaagtc agggaatcttttcataagcttgccaagcaatatcatcctgacagtggctctaatactgct gattctgcaacatttataaggattgaaaaagcttatagaaaggtgctctcccatgtgata gaacaaacaaatgccagtcagagtaaaggtgaagaagaagaagatgtagaaaaattcaaa tataaaacaccccaacaccgacattatttaagttttgaaggtattggttttgggactcca actcaacgagagaagcattataggcaatttagggcagaccgtgctgctgaacaagtgatg gaatatcaaaagcagaaactacaaagccagtattttcctgatagtgtaattgttaaaaat ataagacagagcaaacagcaaaagataacgcaagctatagaacgtttagtggaggacctc attcaagaatccatggcaaaaggagactttgacaatctcagtgggaaaggaaaacctctg aaaaagttttctgactgttcttacattgatcccatgactcacaacctgaaccgaatactg atcgataatggataccaaccagaatggatccttaagcaaaaggaaataagcgatactatt gagcaactcagagaggcaattttagtgtctaggaaaaaacttgggaatccaatgacacca actgaaaagaaacagtggaaccatgtttgtgagcagtttcaagaaaacatcagaaaatta aacaagcgaattaatgattttaatttaattgttcccatcctgaccaggcaaaaagtccat tttgatgctcagaaagaaattgtcagagcccagaaaatatacgagacccttataaaaaca aaagaagtcacagatagaaacccaaataaccttgatcaaggagaaggagagaaaacacct gaaatcaagaaaggttttttaaactggatgaatctgtggaaatttattaaaatacgatca ttttga >gi568815577f:33303544_33536959|GENSCAN_predicted_peptide_8|813_aa XISISDHTALAQFCKEKKIEFVVVGPEAPLAAGIVGNLRSAGVQCFGPTAEAAQLESSKR FAKEFMDRHGIPTAQWKAFTKPEEACSFILSADFPALVVKASGLAAGKGVIVAKSKEEAC KAVQEIMQCLCFTDGKTVAPMPPAQDHKRLLEGDGGPNTGGMGAYCPAPQVSNDLLLKIK DTVLQRTVDGMQQEGTPYTENHTALTVVMASKGYPGDYTKGVEITGFPEAQALGLEVFHA GTALKNGKVVTHGGRVLAVTAIRENLISALEEAKKGLAAIKFEGAIYRKDVGFRAIAFLQ QPRSLTYKESGVDIAAGNMLVKKIQPLAKATSRSGCKVDLGGFAGLFDLKAAGFKDPLLA SGTDGVGTKLKIAQLCNKHDTIGQDLVAMCVNDILAQGAEPLFFLDYFSCGKLDLSVTEA VVAGIAKACGKAGCALLGGETAEMPDMYPPGEYDLAGFAVGAMERDQKLPHLERITEGDV VVGIASSGLHSNGFSLVRKIVAKSSLQYSSPAPDGCGDQTLGHVKAFAHITGGGLLENIP RVLPEKLGVDLDAQTWRIPRVFSWLQQEGHLSEEEMARTFNCGVGAVLVVSKEQTEQILR DIQQHKEEAWVIGSVVARAEGSNLQALIDSTREPNSSAQIDIVISNKAAVAGLDKAERAG IPTRVINHKLYKNRVEFDSAIDLVLEEFSIDIVCLAGFMRILSGPFVQKWNGKMLNIHPS LLPSFKGSNAHEQALETGVTVTGCTVHFVAEDVDAGQIILQEAVPVKRGDTVATLSERVK LAEHKIFPAALQLVASGTVQLGENGKICWVKEE >gi568815577f:33303544_33536959|GENSCAN_predicted_CDS_8|2442_bp nccatctcaatcagtgaccacactgcccttgctcaattctgcaaagagaagaaaattgaa tttgtagttgttggaccagaagcacctctggctgctgggattgttgggaacctgaggtct gcaggagtgcaatgctttggcccaacagcagaagcggctcagttagagtccagcaaaagg tttgccaaagagtttatggacagacatggaatcccaaccgcacaatggaaggctttcacc aaacctgaagaagcctgcagcttcattttgagtgcagacttccctgctttggttgtgaag gccagtggtcttgcagctggaaaaggggtgattgttgcaaagagcaaagaagaggcctgc aaagctgtacaagagatcatgcagtgtctgtgtttcactgatggcaagactgtggccccc atgcccccagcacaggaccataagcgattactggagggagatggtggccctaacacaggg ggaatgggagcctattgtccagcccctcaggtttctaatgatctattactaaaaattaaa gatactgttcttcagaggacagtggatggcatgcagcaagagggtactccatatacagaa aaccacaccgccctaactgttgtcatggcaagtaaaggttatcctggagactacaccaag ggtgtagagataacagggtttcctgaggctcaagctctaggactggaggtgttccatgca ggcactgccctcaaaaatggcaaagtagtaactcatgggggtagagttcttgcagtcaca gccatccgggaaaatctcatatcagcccttgaggaagccaagaaaggactagctgctata aagtttgagggagcaatttataggaaagacgtcggctttcgtgccatagctttcctccag cagcccaggagtttgacttacaaggaatctggagtagatatcgcagctggaaatatgctg gtcaagaaaattcagcctttagcaaaagccacttccagatcaggctgtaaagttgatctt ggaggttttgctggtctttttgatttaaaagcagctggtttcaaagatccccttctggcc tctggaacagatggcgttggaactaaactaaagattgcccagctatgcaataaacatgat accattggtcaagatttggtagcaatgtgtgttaatgatattctggcacaaggagcagag cccctcttcttccttgattacttttcctgtggaaaacttgacctcagtgtaactgaagct gttgttgctggaattgctaaagcttgtggaaaagctggatgtgctctccttggaggtgaa acagcagaaatgcctgacatgtatccccctggagagtatgacctagctgggtttgccgtt ggtgccatggagcgagatcagaaactccctcacctggaaagaatcactgagggtgatgtt gttgttggaatagcttcatctggtcttcatagcaatggatttagccttgtgaggaaaatc gtggcaaaatcttccctccagtactcctctccagcacctgatggttgtggtgaccagact ttaggacatgtcaaagcctttgcccatattactggtggaggattactagagaacatcccc agagtcctccctgagaaacttggggtagatttagatgcccagacctggaggatccccagg gtcttctcatggttgcagcaggaaggacacctctctgaggaagagatggccagaacattt aactgtggggttggcgctgtccttgtggtatcaaaggagcagacagagcagattctgagg gatatccagcagcacaaggaagaagcctgggtgattggcagtgtggttgcacgagctgaa ggatcgaacctgcaagcacttatagacagtactcgggaaccaaatagctctgcacaaatt gatattgttatctccaacaaagccgcagtagctgggttagataaagcggaaagagctggt attcccactagagtaattaatcataaactgtataaaaatcgtgtagaatttgacagtgca attgacctagtccttgaagagttctccatagacatagtctgtcttgcaggattcatgaga attctttctggcccctttgtccaaaagtggaatggaaaaatgctcaatatccacccatcc ttgctcccttcttttaagggttcaaatgcccatgagcaagccctggaaaccggagtcaca gttactgggtgcactgtacactttgtagctgaagatgtggatgctggacagattattttg caagaagctgttcccgtgaagaggggtgatactgtcgcaactctttctgaaagagtaaaa ttagcagaacataaaatatttcctgcagcccttcagctggtggccagtggaactgtacag cttggagaaaatggcaagatctgttgggttaaagaggaatga