GENSCAN 1.0 Date run: 3-Nov-116 Time: 08:37:05 Sequence gi568815592r:36395787_36640202 : 244416 bp : 43.66% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 4798 4668 131 2 2 100 78 101 0.140 10.71 1.01 Init - 30179 30128 52 2 1 69 110 18 0.247 3.42 1.00 Prom - 32833 32794 40 -3.16 2.00 Prom + 35391 35430 40 -3.56 2.01 Init + 46809 46924 116 2 2 62 102 78 0.398 6.28 2.02 Intr + 53077 53228 152 1 2 66 73 60 0.211 2.11 2.03 Intr + 79003 79276 274 2 1 78 88 155 0.655 11.10 2.04 Intr + 83335 83437 103 1 1 44 70 162 0.915 10.18 2.05 Intr + 83805 83925 121 2 1 94 80 64 0.999 6.27 2.06 Intr + 85776 85973 198 1 0 111 19 269 0.988 21.62 2.07 Intr + 88928 89038 111 0 0 111 93 103 0.999 13.45 2.08 Term + 91097 91389 293 0 2 88 49 165 0.971 8.11 2.09 PlyA + 95337 95342 6 1.05 3.19 PlyA - 98124 98119 6 1.05 3.18 Term - 100128 99998 131 1 2 79 38 146 0.954 7.04 3.17 Intr - 101019 100925 95 2 2 51 92 89 0.809 5.21 3.16 Intr - 102700 102577 124 2 1 75 52 39 0.746 -1.36 3.15 Intr - 104204 104087 118 2 1 87 123 107 0.896 14.14 3.14 Intr - 105514 105461 54 1 0 91 80 22 0.555 0.88 3.13 Intr - 110858 110797 62 0 2 88 94 29 0.634 1.95 3.12 Intr - 111816 111714 103 0 1 110 89 -23 0.230 -0.25 3.11 Intr - 113277 113158 120 0 0 97 50 45 0.305 2.29 3.10 Intr - 119706 119552 155 1 2 89 67 99 0.122 7.69 3.09 Intr - 122054 121931 124 2 1 40 76 31 0.420 -2.74 3.08 Intr - 126031 125948 84 1 0 79 103 30 0.595 3.62 3.07 Intr - 128677 128555 123 1 0 48 83 68 0.838 3.08 3.06 Intr - 129856 129805 52 0 1 99 76 79 0.975 6.71 3.05 Intr - 144421 144286 136 2 1 117 88 39 0.849 6.43 3.04 Intr - 150931 150838 94 1 1 34 99 55 0.628 0.74 3.03 Intr - 151597 151404 194 2 2 96 105 47 0.741 6.41 3.02 Intr - 153619 153508 112 1 1 -61 -7 276 0.298 3.15 3.01 Init - 162437 162423 15 2 0 75 99 16 0.220 1.45 3.00 Prom - 179059 179020 40 -4.56 4.00 Prom + 198542 198581 40 -2.66 4.01 Init + 200977 201182 206 0 2 62 55 168 0.947 9.22 4.02 Intr + 203063 203197 135 2 0 94 80 61 0.946 5.58 4.03 Intr + 204217 204437 221 1 2 55 23 138 0.888 1.95 4.04 Intr + 205366 205404 39 2 0 127 99 12 0.889 4.40 4.05 Intr + 217942 218057 116 2 2 66 61 133 0.654 8.57 4.06 Intr + 222671 222869 199 1 1 55 45 64 0.314 -2.28 4.07 Term + 226525 226730 206 2 2 37 47 215 0.706 9.83 4.08 PlyA + 226821 226826 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:36395787_36640202|GENSCAN_predicted_peptide_1|61_aa MTQLVSSQPVPAMSRNPDHNLLSQPKEHSIVQKHHQEEIIHKLAMQLRHIGDNIDHRMVR E >gi568815592r:36395787_36640202|GENSCAN_predicted_CDS_1|183_bp atgacccaacttgtcagctcccagccagttccagccatgtcaaggaacccagatcataat ctactttctcagcccaaggagcatagcattgttcagaagcatcaccaggaggaaataatt cacaagttggccatgcagctgagacacattggggacaacattgatcataggatggttcga gag >gi568815592r:36395787_36640202|GENSCAN_predicted_peptide_2|455_aa MYIQGSDFRANAFKQLQYSYYATSNLAPRMGRSGFHIFGGFVVLLTSGVKPQTFSVSVTA LKGGVSGVVCSSRWVRGLADFRSEAADLCNLSLDYASQPANLQFPHIMPLAEDIKGSCFQ SGNKRNHEPFIAPERFGNSSVGFGSNSHSQAPEKVTLLVDGTRFVVNPQIFTAHPDTMLG RMFGPGREYNFTRPNEKGEYEIAEGISATVFRTVLDYYKTGIINCPDGISIPDLRDTCDY LCINFDFNTIRCQDLSALLHELSNDGAHKQFDHYLEELILPIMVGCAKKGERECHIVVLT DEDSVDWDEDHPPPMGEEYSQILYSSKLYRFFKYIENRDVAKTVLKERGLKNIRIGIEGY PTCKEKIKRRPGGRSEVIYNYVQRPFIQMSWEKEEGKSRHVDFQCVRSKSLTNLVAAGDD VLEDQEILMHHPPQVDELDRLNAPLSQMASNDFQD >gi568815592r:36395787_36640202|GENSCAN_predicted_CDS_2|1368_bp atgtatatccagggtagtgactttcgagcaaatgcgtttaagcaactccagtattcttac tacgctacctccaacctcgctccacgcatggggagatcagggttccacattttcggtggg ttcgtggtcttgctgacttcaggagtgaagccgcagaccttctcagtgagtgttacagct cttaaaggtggcgtgtctggagttgtttgttcctcccggtgggttcgtggtcttgctgac ttcaggagtgaagctgcagacctttgcaacctctcacttgactatgcctctcagccagca aatcttcagttccctcacataatgccccttgctgaagacatcaaaggttcttgcttccaa agtgggaataaacggaaccatgaaccttttattgctccagaaagatttggaaacagtagt gtgggctttggcagtaattcccattcccaagcaccagagaaagtgacgcttcttgtagat ggcacacgttttgttgtgaatccacagattttcactgctcatccggataccatgctggga aggatgtttggaccaggaagagagtacaacttcactcggcccaatgagaagggagagtat gagattgctgaaggcatcagtgcaactgtatttcgcacagtgctggattattacaaaacc ggtatcatcaattgtcctgatggcatctctatcccagatcttagagatacttgtgattat ctctgcattaattttgacttcaacactatccgatgtcaagatctgagtgctttactccat gaactgtctaatgacggtgctcataagcagtttgatcactacctcgaagagctcatcttg cccatcatggtgggctgtgccaagaaaggagaacgagagtgccacattgttgtgctgacg gatgaggattctgtggactgggatgaagaccaccctccaccaatgggggaggaatattcc caaattctttatagctccaagctctacagattcttcaaatatattgagaatagggatgtt gcaaaaacagtgttaaaggaacggggcctaaaaaacattcgcattggaattgaaggttac cctacctgtaaagaaaaaattaagagaaggcctggcggccggtctgaagtcatctataat tatgtacaacgccccttcatccagatgtcatgggaaaaggaagaagggaagagtcgccat gtggatttccagtgtgttcgaagcaaatccctcacgaatctggtagctgctggagatgat gtcttggaggaccaggagatattaatgcatcacccaccccaagtggatgaacttgaccgg ctaaatgccccactttctcagatggcttctaacgactttcaggattag >gi568815592r:36395787_36640202|GENSCAN_predicted_peptide_3|631_aa MQKKLKKEKEKEKEKKKKKKKKKKKKKKKKRKKKKKKRLAGHVGTLPVVLTRNRLLPPVS ESLTRPLPSLARWLPPPGLRQPSSRDYWPKGRLRLSAVPSPASPWALDTGLQISFKSQEK AGKILKKRVEKQQPEEKVAAMAMTGSTPCSSMSNHTKERVTMTKVTLENFYSNLIAQHEE REMRQKKLEKVMEEEGLKDEEKRLRRSAHARKETEFLRLKRTRLGLEDFESLKVIGRGAF GEVRLVQKKDTGHVYAMKILRKADMLEKEQVGHIRAERDILVEADSLWVVKMFYSFQDKL NLYLIMEFLPGGDMMTLLMKKDTLTEEETQFYIAETVLAIDSIHQLGFIHRDIKPDNLLL DSKATRRREERRPFQEPRLRGFLSQCCDTPFRALRFLASPSFQGHVKLSDFGLCTGLKKA HRTEFYRNLNHSLPSDFTFQNMNSKRKAETWKRNRRQLSWYVRVEYWNAENVSLGQAFST VGTPDYIAPEVFMQTGYNKLCDWWSLGVIMYEMLIGYPPFCSETPQETYKKVMNWKETLT FPPEVPISEKAKDLILRERPAAISIEIKSIDDTSNFDEFPESDILKPTVATSNHPETDYK NKDWVFINYTYKRFEGLTARGAIPSYMKAAK >gi568815592r:36395787_36640202|GENSCAN_predicted_CDS_3|1896_bp atgcagaagaaactgaagaaggagaaggagaaggagaaggagaagaagaagaagaagaag aagaagaagaagaagaagaagaagaagaagaggaagaagaagaagaagaaaagattagct gggcatgttggaacgcttcctgttgtcctcacccgtaaccgcctgttgccccctgtctca gagtccctcacgcgtcccctcccgtctttggctcgttggctgccgccgccggggcttcgc cagccttcaagtcgagactactggccgaaggggcgtctgcggctctccgccgtccccagc cctgcctctccctgggctctggatactgggcttcagataagcttcaaatcacaggaaaag gcagggaaaattcttaagaagagagtggaaaagcaacagccagaggaaaaagtcgcagcc atggcaatgacaggctcaacaccttgctcatccatgagtaaccacacaaaggaaagggtg acaatgaccaaagtgacactggagaatttttatagcaaccttatcgctcaacatgaagaa cgagaaatgagacaaaagaagttagaaaaggtgatggaagaagaaggcctaaaagatgag gagaaacgactccggagatcagcacatgctcggaaggaaacagagtttcttcgtttgaag agaacaagacttggattggaagattttgagtccttaaaagtaataggcagaggagcattt ggtgaggtacggcttgttcagaagaaagatacgggacatgtgtatgcaatgaaaatactc cgtaaagcagatatgcttgaaaaagagcaggttggccacattcgtgcggagcgtgacatt ctagtggaggcagacagtttgtgggttgtgaaaatgttctatagttttcaggataagcta aacctctacctaatcatggagttcctgcctggaggggacatgatgaccttgttgatgaaa aaagacactctgacagaagaggagactcagttttatatagcagaaacagtattagccata gactctattcaccaacttggattcatccacagagacatcaaaccagacaaccttcttttg gacagcaaggcaacaagaaggagagaagagcggcggcccttccaggagcccagacttagg ggcttcctgagccagtgctgtgacaccccctttagggctctgcggtttctggcatctcca agcttccagggccatgtgaaactttctgactttggtctttgcacaggactgaaaaaagca cataggacagaattttataggaatctgaaccacagcctccccagtgatttcactttccag aacatgaattccaaaaggaaagcagaaacctggaaaagaaatagacgtcagctatcctgg tatgttagagtagaatactggaatgcagaaaatgtttctcttggtcaggccttctccaca gtaggcactcctgactacattgctcctgaggtgttcatgcagaccgggtacaacaagctc tgtgattggtggtcgcttggggtgatcatgtatgagatgctcatcggctacccacctttc tgttctgagacccctcaagagacatataagaaggtgatgaactggaaagaaactttgact tttcctccagaagttcccatctctgagaaagccaaggatctaattttgagagagagacct gctgcaatatctattgaaatcaaaagcattgatgatacctcaaacttcgatgagtttcca gaatctgatattcttaagccaacagtggccacaagtaatcatcctgagactgactacaag aacaaagactgggtcttcatcaattacacgtacaagcgctttgagggcctgactgcaagg ggggcaataccttcctacatgaaagcagcaaaatag >gi568815592r:36395787_36640202|GENSCAN_predicted_peptide_4|373_aa MHRDSCPLDCKVYVGNLGNNGNKTELERAFGYYGPLRSVWVARNPPGFAFVEFEDPRDAA DAVRELDGRTLCGCRVRVELSNGEKRSRNRGPPPSWGRRPRDDYRRRSPPPRRSLLSNLN QIGSSHLDRPHIPGQSAQLFIYQMSSQQLQQQPSANKKAGKIHNTPFANQLNPTQHLAKP FQQILPGHLQEGEASLAAGAEIATATPAFSDHNPDQSAAINTEGRPSTSKVMWLKAYMIN PEQVPQEQVRGVSPVGKAFLRGSEAVSQERGEAPLSQSCSAHLAAAATHLYAPSWYITIT TGSHDRNTEVGSSHKEVASSIIKKLIPAHVDGFEEFKTLVEEVMKDVVEIARELKLESGD VTGSTFSSLAIPT >gi568815592r:36395787_36640202|GENSCAN_predicted_CDS_4|1122_bp atgcatcgtgattcctgtccattggactgtaaggtttatgtaggcaatcttggaaacaat ggcaacaagacggaattggaacgggcttttggctactatggaccactccgaagtgtgtgg gttgctagaaacccacccggctttgcttttgttgaatttgaagatccccgagatgcagct gatgcagtccgagagctagatggaagaacactatgtggctgccgtgtaagagtggaactg tcgaatggtgaaaaaagaagtagaaatcgtggcccacctccctcttggggtcgtcgccct cgagatgattatcgtaggaggagtcctccacctcgtcgcagccttctctccaaccttaac caaatcggcagcagccacctcgaccgcccacacattcctggccaatcagctcagctgttt atttaccaaatgtcttcacaacaactacagcagcagccttcggctaacaaaaaagcagga aaaatccacaacacccccttcgccaaccaactaaatccaacgcaacatctggcaaaacct tttcagcaaattcttcctggccatctccaagaaggagaagcttctctcgcagccggagca gaaattgccacagccactccagccttcagcgaccacaatcctgatcagtcagcagccatc aacactgagggaagaccctccaccagcaaggttatgtggctgaaggcttatatgattaat ccagagcaggtcccccaggagcaggttcgtggcgtatcaccagtggggaaggcttttctc agaggctctgaggcagtgtctcaagaaaggggggaggctccgctttctcagagttgcagt gctcaccttgctgctgctgccacacatctgtatgcgccttcctggtacatcaccatcacc acaggctctcatgatagaaacacagaagttggttccagccacaaggaagtggcttcatcc atcatcaagaagttgattccagcccacgtggatggctttgaggagttcaagactttagtg gaggaagtgatgaaagatgtggtggaaatagcaagagaactaaaactggagtctggagat gtgacaggctccactttcagttctcttgctattcccacatga