GENSCAN 1.0 Date run: 5-Nov-116 Time: 12:09:29 Sequence gi568815584f:77718979_78033388 : 314410 bp : 45.40% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.14 Intr - 1850 1733 118 1 1 55 82 154 0.225 11.97 1.13 Intr - 4326 4203 124 1 1 13 92 69 0.482 -0.56 1.12 Intr - 12151 12010 142 1 1 42 86 61 0.767 1.23 1.11 Intr - 13623 13507 117 0 0 101 81 133 0.987 14.56 1.10 Intr - 16034 15969 66 2 0 122 86 15 0.933 3.80 1.09 Intr - 17028 16959 70 2 1 91 61 10 0.994 -2.22 1.08 Intr - 18097 17993 105 0 0 91 108 61 0.989 7.73 1.07 Intr - 19906 19800 107 1 2 91 76 69 0.998 5.01 1.06 Intr - 20083 19988 96 1 0 65 78 100 0.932 6.91 1.05 Intr - 32502 32341 162 0 0 53 84 203 0.977 16.57 1.04 Intr - 36142 35989 154 0 1 93 73 21 0.669 1.17 1.03 Intr - 42305 42136 170 2 2 21 94 77 0.002 0.34 1.02 Intr - 69784 69594 191 2 2 92 72 82 0.338 6.30 1.01 Init - 81355 81259 97 1 1 72 50 88 0.135 1.88 1.00 Prom - 86617 86578 40 -3.86 2.00 Prom + 97270 97309 40 -7.26 2.01 Init + 100001 100135 135 1 0 113 90 171 0.769 17.97 2.02 Intr + 103457 103540 84 1 0 83 115 109 0.999 13.12 2.03 Intr + 106898 107050 153 1 0 110 103 38 0.926 7.77 2.04 Intr + 112135 112249 115 0 1 31 72 45 0.272 -2.78 2.05 Intr + 115695 115819 125 1 2 40 110 93 0.477 7.00 2.06 Intr + 140098 140301 204 0 0 113 65 298 0.726 29.40 2.07 Intr + 141449 141494 46 1 1 84 101 52 0.764 4.08 2.08 Intr + 145101 145240 140 1 2 109 42 17 0.351 -0.62 2.09 Intr + 148746 148827 82 2 1 108 106 68 0.717 9.81 2.10 Intr + 158368 158459 92 2 2 44 84 22 0.025 -2.89 2.11 Intr + 168113 168271 159 1 0 95 78 148 0.568 14.68 2.12 Intr + 180122 180280 159 1 0 75 100 191 0.725 19.18 2.13 Intr + 188825 188941 117 1 0 115 78 192 0.593 21.56 2.14 Intr + 205479 205628 150 2 0 106 75 179 0.724 18.76 2.15 Intr + 206786 206983 198 1 0 71 75 234 0.999 19.95 2.16 Intr + 212540 212733 194 1 2 78 61 311 0.856 25.69 2.17 Intr + 214242 214371 130 0 1 115 -11 145 0.786 8.00 2.18 Intr + 214484 214666 183 1 0 89 84 50 0.712 4.68 2.19 Intr + 217717 217772 56 0 2 110 92 3 0.142 0.68 2.20 Term + 220955 221009 55 2 1 112 48 46 0.080 0.03 2.21 PlyA + 223045 223050 6 1.05 3.06 PlyA - 223057 223052 6 1.05 3.05 Term - 226222 226118 105 1 0 96 42 54 0.282 0.01 3.04 Intr - 237872 237742 131 0 2 39 115 77 0.770 5.91 3.03 Intr - 249010 248801 210 2 0 94 103 66 0.950 7.58 3.02 Intr - 250555 250467 89 0 2 81 99 42 0.935 4.21 3.01 Init - 259862 259762 101 2 2 78 77 75 0.545 5.19 3.00 Prom - 290811 290772 40 -3.66 4.03 PlyA - 290954 290949 6 1.05 4.02 Term - 295398 294885 514 2 1 43 36 244 0.583 8.52 4.01 Intr - 310522 310379 144 0 0 59 80 126 0.841 8.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 163333 163240 94 1 1 83 93 69 0.812 7.58 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:77718979_78033388|GENSCAN_predicted_peptide_1|573_aa MGLSLPAPRAAAQPEQRPTPGVSMPVPSRSRRGDLWQCLETFVIVMTWEMLLASSGQRSS VLLDILQCTELAFTTKNCPVPNVNSITVEKPWSIYQNTNGTPVQRTTNPRRPFRPERLRI AHSFLSAPHHLERPAQRCCRSRWKKRKKMALTSFLPAPTQLSQDQLEAEEKARSQRSRQT SLVSSRREPPPYGYRKGWIPRLLEDFGDGGAFPEIHVAQYPLDMGRKKKMSNALAIQVDS EGKIKYDAIARQGQSKDKVIYSKYTDLVPKEVMNADDPDLQRPDEEAIKEITEKTRVALE KSVSQKVAAAMPVRAADKLAPAQYIRYTPSQQGVAFNSGAKQRVIRMVEMQKDPMEPPRF KINKKIPRGPPSPPAPVMHSPSRKMTVKEQQEWKIPPCISNWKNAKGYTIPLDKRLAADG RGLQTVHINENFAKLAEALYIADRKAREAVEMRAQVERKMAQKEKEKHEEKLREMAQKAR ERRAGIKTHVEKVKYFSTVHTEDGEARERDEIRHDRRKERQHDRNLSRAAPDKRSKLQRN ENRDISEVIALGVPNPRTSNEVQYDQRLFNQSK >gi568815584f:77718979_78033388|GENSCAN_predicted_CDS_1|1719_bp atggggctgtccctcccggcgcccagggcggctgcacaaccggagcagaggccgactccc ggcgtgtccatgccggttccctcgcgctccaggcgcggggatctttggcaatgtctggag acatttgtgattgtcatgacttgggagatgctactggcatctagtgggcagaggtcaagt gtgttgttggacatcttacaatgcacagaactggccttcacaacaaagaactgtccagtt ccaaatgtcaacagtatcactgtggagaaaccctggtcaatctatcagaatacaaacggt actcctgttcaaagaactacaaatcccagaaggccttttcggccagagcgcctgcgcatc gcgcactccttcctttccgctcctcatcatctggaaagacccgcccagcggtgctgtcgc tcgcgctggaagaagcggaagaagatggcgctcaccagctttttacctgcacctactcag ctatctcaggaccagcttgaggctgaagaaaaggcaagatcccagagatcacggcagacc tcactggtctcctcccgaagagaacctcccccgtacggataccggaaaggctggatacct cggttattagaggattttggagatggaggtgcttttccagagatccatgtggcccagtat ccactggatatgggacgaaagaaaaaaatgtcgaatgcgctggccattcaggtggattct gaaggaaaaattaaatatgatgcaattgctcgacaaggacagtcaaaagacaaggtcatt tatagcaaatacactgacctggttccaaaggaggttatgaatgcagatgatccagacctg caaaggcccgatgaagaagctattaaagagataacagaaaagacaagagtagccttagaa aaatctgtatcacagaaggtcgccgcagccatgccagttcgagcagctgacaaattggct cctgctcagtatatccgatacacaccatctcagcaaggagtggcattcaactctggagct aaacagagggttattcggatggtagaaatgcagaaagatccaatggagcctccaaggttc aagattaataagaaaattccccggggaccaccttctcctcctgcgcctgtcatgcattct cctagccgaaagatgactgtaaaggaacaacaagagtggaagattcctccttgtatttct aactggaaaaatgcaaagggttatacaattccattagacaaacgtctggctgctgatgga agaggactacagacagtacacataaatgaaaatttcgccaaattggcagaagccctctac attgctgatcggaaggctcgtgaagctgtggaaatgcgtgcccaagtagagagaaaaatg gctcagaaagaaaaggaaaaacatgaagagaaacttagagaaatggcccagaaagccagg gagagaagagctgggatcaaaactcatgtggaaaaagtgaagtacttttcaactgttcac acagaggatggggaggcacgtgagagggatgaaatccggcatgacaggcgaaaagagaga cagcatgaccggaatctttccagggcagctcctgataagaggtcgaaacttcagagaaat gaaaatcgggatatcagtgaagttattgctctcggtgttcctaatcctcggacttccaat gaagttcagtatgaccaaaggctcttcaaccaatccaag >gi568815584f:77718979_78033388|GENSCAN_predicted_peptide_2|858_aa MARKALKLASWTSMALAASGIYFYSNKYLDPNDFGAVRVGRAVATTAVISYDYLTSLKSV PYGSEEYLQLRSKLTGAMGLGIHWCNILRRACRLGHRGKASALRNLQRRQAMGVKKTVAC HVSHADMAGQASQLGDCLLEEEGAENQCFADKQIPNREERGRAHSSSRVHVCSSLTYVGC LDTWVGFLLLQSCPPVAPTTWFYRVHLRSARRLCELCCANRGTFIKVGQHLGALDYLLPE EYTSTLKVLHSQAPQSSMQEIRQVIREDLGKEGAGTDYWSLNVELAPGELILQLPIVRRD KRMLQVLCVLDAPSDWSFNENEGERRCRGTVVQRACTLSVPCSRPVSCTLNPGVTMFYVI LANGAAVSEECCALWMQEQLRVVGKGLWGFVRIHDLFQSFDDTPLGTASLAQVHKAVLHD GRTVAVKVQHPKVRAQSSKDILLMEVLVLAVKQLFPEFEFMWLVDEAKKNLPLELDFLNE GRNAEKVSQMLRHFDFLKVPRIHWDLSTERVLLMEFVDGGQVNDRDYMERNKIDVNEISR HLGKMYSEMIFVNGFVHCDPHPGNVLVRKHPGTGKAEIVLLDHGLYQMLTEEFRLNYCHL WQSLIWTDMKRVKEYSQRLGAGDLYPLFACMLTARSWDSVNRGISQAPVTATEDLEIRNN AANYLPQISHLLNHVPRQMLLILKTNDLLRGIEAALGTRASASSFLNMSRCCIRALAEHK KKNTCSFFRRTQISFSEAFNLWQINLHELILRVKGLKLADRLLHFCHIVARSPRVTVHVT ILLLLWNPLRTLWPLSQGPQAELWHSSLFFSKKTQQPTFPFLVEGRGILTLAHLVQSLSD MCLPEKEEMQLMNPTCGH >gi568815584f:77718979_78033388|GENSCAN_predicted_CDS_2|2577_bp atggccagaaaggctctcaagcttgcttcgtggaccagcatggctcttgctgcctctggc atctacttctacagtaacaagtacttggaccctaatgactttggcgctgtcagggtgggc agagcagttgctacgacggctgtcatcagttacgactacctcacttccctgaagagtgtc ccttatggctcagaggagtacttgcagctgagatctaagctaaccggtgctatgggcctt ggcatccactggtgtaacattctacggagagcttgcaggctcggacacaggggcaaggcc tctgctttacggaatctgcagaggaggcaggcgatgggtgtcaaaaagacagttgcctgt cacgtgtcacatgcagacatggctggacaagcctctcagttgggggactgcctcttagag gaggaaggagctgaaaatcaatgctttgcagataagcagatcccaaacagagaggagaga ggaagggcacacagttcctctcgtgtgcatgtctgcagctccctcacctatgtgggctgc ctggacacatgggtgggcttcctgctacttcagagctgccctccagttgcccctaccacc tggttctacagggtgcaccttcgctctgccaggcgtctctgtgagctctgctgtgccaac cggggcaccttcatcaaggtgggccagcacctgggggctctggactacctgttgccagag gagtacaccagcacgctgaaggtactgcacagccaggctccacagagcagcatgcaagag atccgccaggtcatccgagaagatctgggcaaggagggagctggcactgactattggtcc cttaatgtggagctggccccaggggaacttattttacagctccccatagtacgtagggac aagcgtatgctgcaggtgctctgtgttcttgatgcaccatctgactggagtttcaatgag aatgaaggagagagaaggtgcaggggtaccgttgtgcaacgggcctgtaccctgtctgtg ccctgctcccggcctgtgtcctgcaccttgaatcctggtgtcaccatgttctatgtcatc ctggcaaatggagctgctgtaagtgaagaatgctgtgctttgtggatgcaggaacaactg agggttgtcggaaaaggcctttggggttttgtgaggatccatgatttgttccagagcttc gatgacacccctctggggacggcctccctggcccaggtccacaaggcagtgctgcatgat gggcggacggtggccgtgaaggtccagcacccaaaggtgcgggctcagagctcgaaggac attctcctgatggaggtgctcgttctggctgtgaagcagctgttcccagagtttgagttt atgtggcttgtggatgaagccaagaagaacctgcctttggagctggatttcctcaatgaa gggaggaatgctgagaaggtgtcccagatgctcaggcattttgacttcttgaaggtcccc cgaatccactgggacctgtccacggagcgggtcctcctgatggagtttgtggatggcggg caggtcaatgacagagactacatggagaggaacaagatcgacgtcaatgagatctcacgc cacctgggcaagatgtatagtgagatgatcttcgtcaatggcttcgtgcactgcgatccc caccccggcaatgtactggtgcggaagcaccccggcacgggaaaggcggagattgtcctg ttggaccatgggctttaccagatgctcacggaagaattccgcctgaattactgccacctc tggcagtctctgatctggactgacatgaagagagtgaaggagtacagccagcgactggga gccggggatctctaccccttgtttgcctgcatgctgacggcgcgatcgtgggactcggtc aacagaggcatcagccaagctcccgtcactgccactgaggacttagagattcgcaacaac gcggccaactacctcccccagatcagccatctcctcaaccacgtgccgcgccagatgctg ctcatcttgaagaccaacgacctgctgcgtggcattgaggccgccctgggcacccgcgcc agcgccagctcctttctcaacatgtcacgttgctgcatcagagcgctagctgagcacaag aagaagaatacctgttcattcttcagaaggacccagatctctttcagcgaggccttcaac ttatggcagatcaacctccatgagctcatcctgcgtgtgaaggggttgaagctggctgac cggctgctccatttttgccacatcgtggcccgcagccccagagtcactgtccatgtcacc atccttctcctcctttggaatcctctccgcacactgtggcccttgtctcagggcccacaa gctgaactgtggcatagctctctcttcttctccaagaagactcagcagcctacattccca ttcctggtggagggaagaggaattctgaccctggcccatttggtccagagcctctcagat atgtgtcttccagagaaggaagaaatgcagctcatgaaccccacctgtgggcattag >gi568815584f:77718979_78033388|GENSCAN_predicted_peptide_3|211_aa MTTNFGVLDHLVASGMSLGPGNSGKKEKQEDAERISGLKRQTRNAFIITSTVTSAELRPL LIPGLWEAFCEVFCEGPSTCHGFSKSLSRICDVSHPVLGAVGLCDGDGSGDSKQSTGCGP CHQGSSNSDGLVTSWFRNSATVIAVRGSRNRVTFPEEVKQRGMRNITQASTVLEVYKTYE DFPLDFSRLSQFPDGSRALKDNSCNAQGSFS >gi568815584f:77718979_78033388|GENSCAN_predicted_CDS_3|636_bp atgaccacaaattttggagtcctagaccatctggtagcatcaggaatgtccctagggcct ggaaatagtggcaagaaggagaaacaggaagatgcagaaaggatttcagggctgaaaaga caaacaaggaatgcgttcataatcaccagcacagtcacctctgccgagctgcgaccactg ctaattccagggttatgggaagccttttgtgaagtcttctgtgaagggccatcgacatgt catggtttcagcaagagtttgtccaggatttgtgatgtgtctcatccagtgctaggtgct gtgggcctgtgtgatggtgatggcagtggggacagcaaacagagcacaggttgtggtcct tgtcatcaaggatcttccaattcagatggtttggtcacatcatggttcaggaacagtgct actgtgattgctgtgcgcggaagtcgtaatagagtcaccttccctgaagaggttaagcag agaggaatgaggaacatcacccaggcaagcacagtgttagaggtttacaagacatatgaa gattttcctctagacttctccaggctctcacagttcccagatggatccagggcccttaaa gataatagctgcaatgctcagggctctttcagttaa >gi568815584f:77718979_78033388|GENSCAN_predicted_peptide_4|219_aa XTQQKQAIETRKNSSHPYTPEEAIKPSYEPLIPAESSLYLEEKNVLILREVWRERREREP GLCAALVGQLEFRVGLGLAGPTLRAAGQPCWPQAMRDLAPRPVAAEGVLGPPSSASPPAL RSISHRALAAFPRGRAPDLQPAMPEPPTYSMGSGAAQASPTSSTPCSTAPSPIDHPRAEE CERTARDWQAAPPAALLQDPLGEASWDPESGGEVDSLYV >gi568815584f:77718979_78033388|GENSCAN_predicted_CDS_4|660_bp nngacacagcagaagcaggccatcgaaactagaaagaattcttctcatccctacacccca gaagaggccataaaacccagctatgagccactcattccagcagagtcctccctgtacctg gaagaaaagaatgttcttatcctcagagaggtgtggagggagaggcgcgagcgggaaccg gggctgtgtgcggcgcttgtgggccagctggagtttcgagtgggcttgggcttggcgggc cccacactcagagcagccggccagccctgctggccccaggcaatgagggacttagcaccc aggccagtggctgcggagggtgtactgggtccccccagcagtgccagcccaccggcgctg cgctcgatttctcaccgagccttagctgccttcccgcggggcagggctccagacctgcag cctgccatgcctgagcctcccacctactccatgggctccggtgcggcccaagcctccccg acgagcagcaccccctgctccacagcgcccagtcccatcgaccacccaagggctgaggag tgcgagcgcacggcgcgggactggcaggcagctccacctgcagccctgttgcaggatcca ctaggtgaagccagctgggatcctgagtctggtggggaggtggatagtctttatgtctag