GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:45:13 Sequence gi568815585r:77798203_78018573 : 220371 bp : 37.25% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3630 3723 94 2 1 26 90 78 0.036 0.42 1.02 Term + 16502 16797 296 0 2 53 55 248 0.966 12.48 1.03 PlyA + 17211 17216 6 1.05 2.05 PlyA - 17349 17344 6 1.05 2.04 Term - 22923 22587 337 2 1 67 49 161 0.125 3.16 2.03 Intr - 28041 27919 123 2 0 25 56 110 0.064 0.28 2.02 Intr - 29840 29675 166 0 1 115 50 64 0.646 3.40 2.01 Init - 53828 53729 100 2 1 90 100 70 0.686 8.89 2.00 Prom - 71844 71805 40 -5.25 3.03 PlyA - 73467 73462 6 1.05 3.02 Term - 74452 74369 84 1 0 86 48 82 0.022 0.67 3.01 Init - 75074 75024 51 2 0 76 55 67 0.023 3.57 3.00 Prom - 82078 82039 40 -3.65 4.11 PlyA - 83040 83035 6 1.05 4.10 Term - 84724 84659 66 1 0 79 39 109 0.431 2.06 4.09 Intr - 105158 104954 205 1 1 73 99 175 0.993 15.38 4.08 Intr - 105405 105293 113 0 2 71 75 70 0.920 2.26 4.07 Intr - 109078 109043 36 1 0 111 86 23 0.686 1.94 4.06 Intr - 113008 112952 57 1 0 109 72 35 0.753 2.16 4.05 Intr - 120422 119889 534 2 0 106 98 496 0.858 44.59 4.04 Intr - 123251 123208 44 0 2 47 69 29 0.009 -6.06 4.03 Intr - 126259 126185 75 2 0 103 100 42 0.944 5.47 4.02 Intr - 127691 127542 150 0 0 78 22 131 0.527 4.71 4.01 Init - 128438 128303 136 2 1 53 57 113 0.936 5.05 4.00 Prom - 129755 129716 40 -5.45 5.04 PlyA - 129916 129911 6 1.05 5.03 Term - 138990 138694 297 0 0 -12 53 245 0.534 5.28 5.02 Intr - 139578 139544 35 1 2 103 100 41 0.839 3.82 5.01 Init - 153748 153682 67 1 1 42 106 65 0.918 4.99 5.00 Prom - 154375 154336 40 -6.55 6.00 Prom + 160075 160114 40 -7.95 6.01 Init + 161102 161363 262 1 1 70 71 136 0.615 7.37 6.02 Intr + 163765 164012 248 2 2 43 41 164 0.323 3.46 6.03 Term + 164829 166205 1377 2 0 20 54 374 0.720 17.51 6.04 PlyA + 166573 166578 6 1.05 7.03 PlyA - 167661 167656 6 1.05 7.02 Term - 174841 174645 197 2 2 42 37 171 0.835 3.99 7.01 Init - 178458 178353 106 0 1 69 90 83 0.897 7.03 7.00 Prom - 181322 181283 40 -6.55 8.00 Prom + 186169 186208 40 -6.75 8.01 Init + 188741 188914 174 1 0 51 80 117 0.719 6.60 8.02 Term + 190342 190545 204 0 0 65 53 137 0.944 4.29 8.03 PlyA + 191142 191147 6 1.05 9.04 PlyA - 193322 193317 6 1.05 9.03 Term - 197802 197734 69 0 0 126 54 56 0.929 2.96 9.02 Intr - 208883 208793 91 1 1 62 51 89 0.019 1.68 9.01 Init - 219975 219968 8 0 2 88 89 11 0.402 0.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 123303 123208 96 0 0 67 69 48 0.898 1.16 S.002 Term - 126062 125950 113 0 2 52 48 107 0.806 0.84 S.003 Term - 152318 151765 554 0 2 28 42 182 0.845 1.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:77798203_78018573|GENSCAN_predicted_peptide_1|129_aa VSGTVLYTVNTTRSKGSKVLDLAELTSYWSTDSDSLDKSGKGKLKTFWKGFTILDVTKKI HDSLEELKISTLKGIWKRFISTLLDDFEGFETSVEEVPADIVEIARELELEVEPEDVTEF LQSHDKTNG >gi568815585r:77798203_78018573|GENSCAN_predicted_CDS_1|390_bp gtatcaggcactgttctgtacactgtaaataccactagaagcaaaggcagcaaagtgctt gaccttgcggagcttacatcctactggagcacagacagtgattcccttgataagtctggg aaaggaaaattgaaaaccttctggaaaggattcaccattctagatgtcactaagaaaatt catgattcattggaggagctcaaaatatcaacattaaaaggaatttggaagaggtttatt tcaaccctcctggatgactttgaggggtttgagacttcagtggaggaagtacctgcagat attgtagaaatagcaagagaactagaattagaagtggagcctgaagatgtgactgaattc ctgcaatctcatgataaaactaatggatga >gi568815585r:77798203_78018573|GENSCAN_predicted_peptide_2|241_aa MVLTSTWPPGSLRKLTIMAEGKGGAGKSHGENRSTSYASVPKISRAANVELYPARGPEHA PSFSRWNILSHFPSSHQPLCLNDFNKSVRRTFSPWEQSDIHFPGLPIAEVEECDLGFANA LKMETQERAPVAALSTKPHLVIIMSGIINIHLGTQRIFRDPFSLQVGNIIGDPGTLNDLN YQIVEPLVAGGGLALKSWSPLGYKISLGAKDVLGPVSLFRTHKSGSQEEIEVTTPPPKLF K >gi568815585r:77798203_78018573|GENSCAN_predicted_CDS_2|726_bp atggtgctgacatccacttggcctccagggagcctcaggaagcttacaatcatggcagaa ggcaaaggaggagcaggcaagtcacatggtgaaaacagaagcacatcatatgcttctgta cctaaaatctccagggctgcaaatgtggaactctacccagccagaggacctgaacatgcg ccatcattctctcgctggaatatcctctcccacttcccttcttctcatcagccactgtgc ctaaatgatttcaacaaatccgtcaggaggactttctccccgtgggagcagtcagatatt cactttcctggccttcctatagctgaggtcgaggaatgtgacttgggctttgccaacgct ctcaagatggagactcaggaaagagcaccagtggctgctttatctactaaacctcatctt gttataatcatgagtggtataattaacattcatcttggaacacaaagaatatttagggat ccttttagtctacaagtaggaaatataataggagatcctggtacgctaaatgacctaaat tatcaaattgtggaacccctagtggcagggggtggccttgctttgaaatcctggtctcct ctaggctataagatctcacttggagctaaggatgtactgggacctgtgtccctcttcagg actcataaatcaggatctcaagaagaaatagaagtcaccacacctcctccaaaattattc aaatga >gi568815585r:77798203_78018573|GENSCAN_predicted_peptide_3|44_aa MLPLAALVSRSAVAALEDDDDDDDMMAVMKTIIMITKYLIASFY >gi568815585r:77798203_78018573|GENSCAN_predicted_CDS_3|135_bp atgcttccactggcagctctggtcagcaggtcggcagtagcagcccttgaggatgatgat gatgatgatgatatgatggcagtgatgaagacaattatcatgatcacaaaatatttaata gccagtttttattga >gi568815585r:77798203_78018573|GENSCAN_predicted_peptide_4|471_aa MWESLELPRDLSNGFDKNADSDMNNKVQAEVVSDGDEELVGNWSKGYSLASGCFRGPALI VCVFSRRTVQDVGKSTTLGSGGRWPSSDSSSRRCPRYLAPVTLAMSPSNASFALSAALIR KNNTELEDKHQRINEALKLRSGHRTPSGAGSSMQPPPSLCGRALVALVLACGLSRIWGEE RGFPPDRATPLLQTAEIMTPPTKTLWPKGSNASLARSLAPAEVPKGDRTAGSPPRTISPP PCQGPIEIKETFKYINTVVSCLVFVLGIIGNSTLLRIIYKNKCMRNGPNILIASLALGDL LHIVIDIPINVYKISPVILEDPAQISELIIGQALKRLDNGHPPRLLAEDWPFGAEMCKLV PFIQKASVGITVLSLCALSIDRYRAVASWSRIKGIGVPKWTAVEIVLIWVVSVVLAVPEA IGFDIITMDYKGSYLRICLLHPVQKTAFMQPILYEDDENEDLYDDPLPLIE >gi568815585r:77798203_78018573|GENSCAN_predicted_CDS_4|1416_bp atgtgggaaagtttggaacttcctagagacttgtcgaatggctttgacaaaaatgctgat agtgatatgaataataaggtccaggctgaggtggtctcagatggagatgaggaacttgtt gggaactggagcaaagggtacagcctcgcctctggctgctttcgtgggccggcattgatt gtctgcgtcttttccaggcgcacggtgcaagatgtcggtaaatctaccactctggggtct ggaggacggtggccctcttctgacagctccagtaggcggtgccccaggtacttggcacct gttaccttggccatgtccccctcaaatgccagctttgcattatctgctgccctcataaga aaaaataatacagagctggaagacaaacatcagagaatcaatgaggctctgaaactgcgg agcggccaccggacgccttctggagcaggtagcagcatgcagccgcctccaagtctgtgc ggacgcgccctggttgcgctggttcttgcctgcggcctgtcgcggatctggggagaggag agaggcttcccgcctgacagggccactccgcttttgcaaaccgcagagataatgacgcca cccactaagaccttatggcccaagggttccaacgccagtctggcgcggtcgttggcacct gcggaggtgcctaaaggagacaggacggcaggatctccgccacgcaccatctcccctccc ccgtgccaaggacccatcgagatcaaggagactttcaaatacatcaacacggttgtgtcc tgccttgtgttcgtgctggggatcatcgggaactccacacttctgagaattatctacaag aacaagtgcatgcgaaacggtcccaatatcttgatcgccagcttggctctgggagacctg ctgcacatcgtcattgacatccctatcaatgtctacaagatatcaccagtgatcttagaa gatccagctcagataagtgaactaataataggccaggccctcaagagattggataatggc cacccacctaggctgctggcagaggactggccatttggagctgagatgtgtaagctggtg cctttcatacagaaagcctctgtgggaatcactgtgctgagtctatgtgctctgagtatt gacagatatcgagctgttgcttcttggagtagaattaaaggaattggggttccaaaatgg acagcagtagaaattgttttgatttgggtggtctctgtggttctggctgtccctgaagcc ataggttttgatataattacgatggactacaaaggaagttatctgcgaatctgcttgctt catcccgttcagaagacagctttcatgcagcctattctatatgaagatgatgagaatgaa gatctttatgatgatccacttccacttattgaatag >gi568815585r:77798203_78018573|GENSCAN_predicted_peptide_5|132_aa MDDTEETPNPKEIDCSTDWTTLDPSDLSPHRQAKLKETTQGKDENPAQFRAHLAATLRRF TALDPERPEGRLILNMHFITQSTPDIRKKLQKLESGPQTPQQELINLAFKVYNNRKQPNG NAFLSYNYLPLL >gi568815585r:77798203_78018573|GENSCAN_predicted_CDS_5|399_bp atggatgatactgaagagaccccgaacccaaaggaaatagactgcagcactgattggacg actttggacccatctgacctctcccctcatcgccaggctaagcttaaagaaactacccaa ggtaaagacgaaaacccagcccagttcagggcccacttagcagcaacccttagacgcttt accgccctagacccagaaaggccagaaggccgccttattcttaatatgcattttatcacc caatccactcctgacattaggaaaaaacttcaaaaattagaatctggccctcaaacccca caacaggaattaatcaacctcgccttcaaggtgtacaataataggaagcagccaaacggc aatgcatttctgagttacaattacttgcctctgctgtga >gi568815585r:77798203_78018573|GENSCAN_predicted_peptide_6|628_aa MTPHMARFPSETKLPEERSGSNICCSAIFAVLQPPLLIPRQTGSGVDFQQTPTDLQLRVL TVRRKTNKQKGHPHQNPICTSPSSKTKEIQTTIREYYKHLYANKLENLEEMDKFLDTYTL LRLNQEEVKSLNRPITGSEIEAIINSLPTKKRPGPNGFTAEFYQRYKEELENKIPRNPTY KGCEGPLQGELQTTAQRNKRGHKQMEEHSMLMDKNNQYRENVHTAQVIYRVNAIPINLPM TFFTELEKTTLKFIWNQKRAHIAKSVLSQKNKAGGIMLPDFKVYYKATVTKTAWYWYQNR DIDQWNRTEPSEIIPYIYNNLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLT PYTKINSRWIKDLNVRPKTIKTLEEILGNTIQDGGMGKDFMSKTPKAMATKAKVDKWDLI KLKSFCTAKETTIRVNRQPTEWEKIFATYSSDKGLISRIHNELKQIYKKKTNNPIKNWVK DMNRHFSEEDIYAANRHMKKCASSLAIREMQIKTTMRYHLTPVTMAIIKKSGNNRCWRGC GEIGTLLHCWWGCKLVQPLWKTVWPLLKDLELEIPFDPAIPLLGIYPKDYKSCCYKDTCY KDTYVYCGTIQNIKHLEPTQMSINDRLD >gi568815585r:77798203_78018573|GENSCAN_predicted_CDS_6|1887_bp atgacacctcacatggccaggttcccctctgagacgaagcttccagaggaacgatcaggc agcaacatttgctgttcagcaatattcgctgttctgcagccaccactgctgatacccagg caaacagggtcgggagtggacttccagcaaactccaacagacctgcagctgagggtcctg actgttagaaggaagactaacaaacagaaaggacatccacaccaaaaccccatctgtacg tcaccatcatcaaagaccaaagaaatacaaactaccatcagagaatactataaacacctc tacgcaaataaactagaaaatctagaagaaatggataaattcctggacacgtacaccctc ctaagactaaaccaggaagaagttaaatccctgaatagaccaataacaggctctgaaatt gaggcaataattaatagcctaccaaccaaaaaacgtccaggaccaaatggattcacagcc gaattctaccagaggtacaaggaggagctggagaataaaatacctaggaatccaacttac aagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaaga ggacacaaacaaatggaagaacattccatgctcatggataagaataatcaatatcgtgaa aatgtccatactgcccaagtaatttatagagtcaatgccatccccatcaacctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagct cacattgccaagtcagtcctaagccaaaagaacaaagctggaggcattatgctacctgac ttcaaagtatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagaccagtggaacagaacagagccctcagaaataataccatatatctacaacaat ctgatctttgataaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccata aaaaccctagaagaaatcctaggcaataccattcaggacggaggcatgggcaaggatttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaagttgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtaaacaggcaacctaca gaatgggagaaaatttttgcaacctactcatctgacaaagggctaatatccagaatccac aatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaactgggtgaag gatatgaacagacacttctcagaagaagacatttatgcagccaacagacacatgaaaaaa tgtgcctcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctc acaccagttacaatggcaatcattaaaaagtcaggaaacaacaggtgctggagaggatgt ggagaaataggaacacttttacattgttggtggggctgtaaactagttcaaccattgtgg aagacagtgtggccactcctcaaggatctagaactagaaataccatttgatccagccatc ccattactgggtatatacccaaaggattataaatcatgctgctataaagacacatgctat aaagacacatatgtttattgtggtactattcagaatatcaaacacttggaaccaacccaa atgtccatcaatgatagactggattga >gi568815585r:77798203_78018573|GENSCAN_predicted_peptide_7|100_aa MGEEVQQEEVPEKRFSMACFRICKEAGIGGHEWGQEFPVNQHQPEDHVLIKRWKEEKLEP AWEGPYFVLLTTKAAVHTAKKDGLITPESRKRHPLQSRGP >gi568815585r:77798203_78018573|GENSCAN_predicted_CDS_7|303_bp atgggggaagaagttcagcaagaagaagtccctgagaaaaggttcagcatggcatgtttc aggatctgcaaagaggctggtataggtggacatgagtgggggcaggaatttccagtaaac cagcaccagcctgaagatcacgttctcatcaaaaggtggaaagaagaaaaactcgagcca gcctgggaaggaccctactttgtgctgctaaccaccaaggctgctgttcatacagcaaaa aaggatggactcatcacacctgagtcaagaaagcgccaccccctccagagtcgtgggcca tag >gi568815585r:77798203_78018573|GENSCAN_predicted_peptide_8|125_aa MLQGTNPQPRVFIQGGLDPIRINPSVVARNFLPPTASGHLLNPFLEGFHSPFGAFPGQCT ATPAKAGQGDDECQLDSSPLPRLNDKALPPGMDHVCSKPQWTVSSSVIAATPSKLPRSLR KLRRT >gi568815585r:77798203_78018573|GENSCAN_predicted_CDS_8|378_bp atgctccagggcacaaaccctcagcctcgggtattcatccagggaggtttggatccgatt cgcataaatcccagtgtggtagcaaggaatttcctgcctcctactgcttcaggacacctg ctcaacccatttttagaaggctttcacagcccttttggggcattcccaggacagtgtact gcaacccctgccaaggcaggacagggggacgatgaatgccagctggattccagtccactc cccagactgaatgataaggctctaccaccgggaatggatcatgtctgttcaaagccacaa tggacagtgtcctcttctgtcattgcagctacaccatccaaactccccagatccctaagg aaactcagaagaacatga >gi568815585r:77798203_78018573|GENSCAN_predicted_peptide_9|55_aa MIKNIMRWLVFNLTVLRDAQIAGKTAFPAVTVRAPVCDVPVPASKGSYCSTTTYE >gi568815585r:77798203_78018573|GENSCAN_predicted_CDS_9|168_bp atgatcaaaaacataatgcgatggctagttttcaacttgactgtgctacgagatgcccag atagctggcaaaacagcatttccagctgtaactgtgagggccccagtgtgtgatgttccc gtccctgcgtccaagggttcttattgttcaactaccacttatgagtga