GENSCAN 1.0 Date run: 6-Nov-116 Time: 03:39:36 Sequence gi568815592f:36370098_36587172 : 217075 bp : 43.52% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 Intr - 1463 1233 231 0 0 93 105 201 0.965 20.04 1.06 Intr - 3481 3356 126 2 0 107 100 121 0.993 15.85 1.05 Intr - 5938 5774 165 2 0 61 71 181 0.867 13.63 1.04 Intr - 15572 15437 136 2 1 115 94 32 0.274 6.64 1.03 Intr - 16715 16478 238 1 1 76 -32 108 0.226 -4.78 1.02 Intr - 17257 17138 120 0 0 103 48 101 0.925 7.21 1.01 Init - 18028 17961 68 1 2 90 47 94 0.982 4.10 1.00 Prom - 19790 19751 40 -5.46 2.04 PlyA - 20484 20479 6 1.05 2.03 Term - 26441 26135 307 1 1 79 55 117 0.700 2.09 2.02 Intr - 30487 30322 166 2 1 100 91 85 0.403 9.02 2.01 Init - 55868 55817 52 2 1 69 110 18 0.280 3.42 2.00 Prom - 58522 58483 40 -3.16 3.00 Prom + 61080 61119 40 -3.56 3.01 Init + 72498 72613 116 2 2 62 102 78 0.397 6.28 3.02 Intr + 78766 78917 152 1 2 66 73 60 0.212 2.11 3.03 Intr + 104692 104965 274 2 1 78 88 155 0.655 11.10 3.04 Intr + 109024 109126 103 1 1 44 70 162 0.915 10.18 3.05 Intr + 109494 109614 121 2 1 94 80 64 0.999 6.27 3.06 Intr + 111465 111662 198 1 0 111 19 269 0.988 21.62 3.07 Intr + 114617 114727 111 0 0 111 93 103 0.999 13.45 3.08 Term + 116786 117078 293 0 2 88 49 165 0.971 8.11 3.09 PlyA + 121026 121031 6 1.05 4.19 PlyA - 123813 123808 6 1.05 4.18 Term - 125817 125687 131 1 2 79 38 146 0.954 7.04 4.17 Intr - 126708 126614 95 2 2 51 92 89 0.809 5.21 4.16 Intr - 128389 128266 124 2 1 75 52 39 0.746 -1.36 4.15 Intr - 129893 129776 118 2 1 87 123 107 0.896 14.14 4.14 Intr - 131203 131150 54 1 0 91 80 22 0.555 0.88 4.13 Intr - 136547 136486 62 0 2 88 94 29 0.634 1.95 4.12 Intr - 137505 137403 103 0 1 110 89 -23 0.230 -0.25 4.11 Intr - 138966 138847 120 0 0 97 50 45 0.305 2.29 4.10 Intr - 145395 145241 155 1 2 89 67 99 0.122 7.69 4.09 Intr - 147743 147620 124 2 1 40 76 31 0.420 -2.74 4.08 Intr - 151720 151637 84 1 0 79 103 30 0.595 3.62 4.07 Intr - 154366 154244 123 1 0 48 83 68 0.838 3.08 4.06 Intr - 155545 155494 52 0 1 99 76 79 0.975 6.71 4.05 Intr - 170110 169975 136 2 1 117 88 39 0.849 6.43 4.04 Intr - 176620 176527 94 1 1 34 99 55 0.628 0.74 4.03 Intr - 177286 177093 194 2 2 96 105 47 0.741 6.41 4.02 Intr - 179308 179197 112 1 1 -61 -7 276 0.298 3.15 4.01 Init - 188126 188112 15 2 0 75 99 16 0.220 1.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:36370098_36587172|GENSCAN_predicted_peptide_1|362_aa MRGPFLSYVRRLTGSFLVLGSGPRGSPPLTLSVSYTALKASRTAPAGPFRSPLTTGRPYT NQMRQPLRSDYLGNSTKGGPAGNEVRTTRKGVESPFAAAKASGVCFAGSDPNSLSADISP RIPRDECKAFDNPRVKNDLMELEGELAISPISPVAAMPPLGTHVQARCEAQINLLGEGGI CKLPGRLRIQPALWSREDVLHWLRWAEQEYSLPCTAEHGFEMNGRALCILTKDDFRHRAP SSGDVLYELLQYIKTQRRALVCGPFFGGIFRLKTPTQHSPVPPEEVTGPSQMDTRRGHLL QPPDPGLTSNFGHLDDPGLARWTPGKEESLNLCHCAELGCRTQGVCSFPAMPQAPIDGRI AX >gi568815592f:36370098_36587172|GENSCAN_predicted_CDS_1|1086_bp atgcgaggccccttcctatcctatgtgcggagactcacagggtccttcctggtcctgggc tcgggaccacgcgggtcaccgcccctgactctctcggtgtcatacacagctctgaaggca tcacgcacagcccctgcgggccctttccgcagcccccttaccactggtcgcccctacacc aaccagatgaggcaacccctgaggtctgattacttgggaaacagcaccaaagggggaccg gctggaaatgaagtgagaacgacccggaaaggtgttgaatcccctttcgcagcagccaag gcttctggagtttgctttgctggctccgatccaaattctctctcggcagatatctctcct cggatcccaagagatgaatgtaaagcttttgacaatcctagagtgaaaaatgacttgatg gagctggagggagaattggctatttctcctataagccctgtggcagccatgcctccccta ggcacccacgtgcaagccagatgtgaagctcaaattaacctgctgggtgaaggggggatc tgcaagctgccaggaagactccgcatccagcccgcactgtggagcagggaggacgtgctg cactggctgcgctgggcagagcaggagtactctctgccatgcaccgcggagcacgggttc gagatgaacggacgcgccctctgcatcctcaccaaggacgacttccggcaccgtgcgccc agctcaggtgacgtcctgtatgagctgctccagtacatcaagacccagcggcgagccctg gtgtgtgggcccttttttggagggatcttcaggctgaagacgcccacccagcactctcca gtccccccggaagaggtgactggcccctctcagatggacacccgaaggggccacctgctg cagccaccagacccagggcttaccagcaacttcggccacctggatgaccctggcctggca aggtggacccctggcaaggaggagtccctcaacttatgtcactgtgcagagctcggctgc aggacccagggggtctgttccttccccgcgatgccgcaggcccccattgacggcaggatc gctgnn >gi568815592f:36370098_36587172|GENSCAN_predicted_peptide_2|174_aa MTQLVSSQPVPAMSRNPDHNLLSQPKEHSIVQKHHQEEIIHKLAMQLRHIGDNIDHRMVR EVRLQFPQPGPVRNQSASGAMRDSAPTPAQRLEQALGAERGQAVAADTPEPSGVGVGCFL GPLRVQAAEMPGSCNWEGGHSGTQGAPAPTQKGLGSYWLHGACGPSCTFLLQPA >gi568815592f:36370098_36587172|GENSCAN_predicted_CDS_2|525_bp atgacccaacttgtcagctcccagccagttccagccatgtcaaggaacccagatcataat ctactttctcagcccaaggagcatagcattgttcagaagcatcaccaggaggaaataatt cacaagttggccatgcagctgagacacattggggacaacattgatcataggatggttcga gaggtgaggcttcagtttccccagccagggcctgtcaggaatcaatctgcctccggcgcc atgcgggactcagccccaacccctgctcagagattggagcaggctctgggagcagagaga ggccaggcagtggcagcagacacccctgagccttcaggggtaggggtagggtgcttcctg gggcccctgagggtgcaagctgcagagatgcctgggtcctgcaactgggagggtggccac agtggcacccagggagctcctgccccaactcagaaggggctgggctcctactggctccat ggagcatgtggccccagctgcaccttcctgctgcagccagcatga >gi568815592f:36370098_36587172|GENSCAN_predicted_peptide_3|455_aa MYIQGSDFRANAFKQLQYSYYATSNLAPRMGRSGFHIFGGFVVLLTSGVKPQTFSVSVTA LKGGVSGVVCSSRWVRGLADFRSEAADLCNLSLDYASQPANLQFPHIMPLAEDIKGSCFQ SGNKRNHEPFIAPERFGNSSVGFGSNSHSQAPEKVTLLVDGTRFVVNPQIFTAHPDTMLG RMFGPGREYNFTRPNEKGEYEIAEGISATVFRTVLDYYKTGIINCPDGISIPDLRDTCDY LCINFDFNTIRCQDLSALLHELSNDGAHKQFDHYLEELILPIMVGCAKKGERECHIVVLT DEDSVDWDEDHPPPMGEEYSQILYSSKLYRFFKYIENRDVAKTVLKERGLKNIRIGIEGY PTCKEKIKRRPGGRSEVIYNYVQRPFIQMSWEKEEGKSRHVDFQCVRSKSLTNLVAAGDD VLEDQEILMHHPPQVDELDRLNAPLSQMASNDFQD >gi568815592f:36370098_36587172|GENSCAN_predicted_CDS_3|1368_bp atgtatatccagggtagtgactttcgagcaaatgcgtttaagcaactccagtattcttac tacgctacctccaacctcgctccacgcatggggagatcagggttccacattttcggtggg ttcgtggtcttgctgacttcaggagtgaagccgcagaccttctcagtgagtgttacagct cttaaaggtggcgtgtctggagttgtttgttcctcccggtgggttcgtggtcttgctgac ttcaggagtgaagctgcagacctttgcaacctctcacttgactatgcctctcagccagca aatcttcagttccctcacataatgccccttgctgaagacatcaaaggttcttgcttccaa agtgggaataaacggaaccatgaaccttttattgctccagaaagatttggaaacagtagt gtgggctttggcagtaattcccattcccaagcaccagagaaagtgacgcttcttgtagat ggcacacgttttgttgtgaatccacagattttcactgctcatccggataccatgctggga aggatgtttggaccaggaagagagtacaacttcactcggcccaatgagaagggagagtat gagattgctgaaggcatcagtgcaactgtatttcgcacagtgctggattattacaaaacc ggtatcatcaattgtcctgatggcatctctatcccagatcttagagatacttgtgattat ctctgcattaattttgacttcaacactatccgatgtcaagatctgagtgctttactccat gaactgtctaatgacggtgctcataagcagtttgatcactacctcgaagagctcatcttg cccatcatggtgggctgtgccaagaaaggagaacgagagtgccacattgttgtgctgacg gatgaggattctgtggactgggatgaagaccaccctccaccaatgggggaggaatattcc caaattctttatagctccaagctctacagattcttcaaatatattgagaatagggatgtt gcaaaaacagtgttaaaggaacggggcctaaaaaacattcgcattggaattgaaggttac cctacctgtaaagaaaaaattaagagaaggcctggcggccggtctgaagtcatctataat tatgtacaacgccccttcatccagatgtcatgggaaaaggaagaagggaagagtcgccat gtggatttccagtgtgttcgaagcaaatccctcacgaatctggtagctgctggagatgat gtcttggaggaccaggagatattaatgcatcacccaccccaagtggatgaacttgaccgg ctaaatgccccactttctcagatggcttctaacgactttcaggattag >gi568815592f:36370098_36587172|GENSCAN_predicted_peptide_4|631_aa MQKKLKKEKEKEKEKKKKKKKKKKKKKKKKRKKKKKKRLAGHVGTLPVVLTRNRLLPPVS ESLTRPLPSLARWLPPPGLRQPSSRDYWPKGRLRLSAVPSPASPWALDTGLQISFKSQEK AGKILKKRVEKQQPEEKVAAMAMTGSTPCSSMSNHTKERVTMTKVTLENFYSNLIAQHEE REMRQKKLEKVMEEEGLKDEEKRLRRSAHARKETEFLRLKRTRLGLEDFESLKVIGRGAF GEVRLVQKKDTGHVYAMKILRKADMLEKEQVGHIRAERDILVEADSLWVVKMFYSFQDKL NLYLIMEFLPGGDMMTLLMKKDTLTEEETQFYIAETVLAIDSIHQLGFIHRDIKPDNLLL DSKATRRREERRPFQEPRLRGFLSQCCDTPFRALRFLASPSFQGHVKLSDFGLCTGLKKA HRTEFYRNLNHSLPSDFTFQNMNSKRKAETWKRNRRQLSWYVRVEYWNAENVSLGQAFST VGTPDYIAPEVFMQTGYNKLCDWWSLGVIMYEMLIGYPPFCSETPQETYKKVMNWKETLT FPPEVPISEKAKDLILRERPAAISIEIKSIDDTSNFDEFPESDILKPTVATSNHPETDYK NKDWVFINYTYKRFEGLTARGAIPSYMKAAK >gi568815592f:36370098_36587172|GENSCAN_predicted_CDS_4|1896_bp atgcagaagaaactgaagaaggagaaggagaaggagaaggagaagaagaagaagaagaag aagaagaagaagaagaagaagaagaagaagaggaagaagaagaagaagaaaagattagct gggcatgttggaacgcttcctgttgtcctcacccgtaaccgcctgttgccccctgtctca gagtccctcacgcgtcccctcccgtctttggctcgttggctgccgccgccggggcttcgc cagccttcaagtcgagactactggccgaaggggcgtctgcggctctccgccgtccccagc cctgcctctccctgggctctggatactgggcttcagataagcttcaaatcacaggaaaag gcagggaaaattcttaagaagagagtggaaaagcaacagccagaggaaaaagtcgcagcc atggcaatgacaggctcaacaccttgctcatccatgagtaaccacacaaaggaaagggtg acaatgaccaaagtgacactggagaatttttatagcaaccttatcgctcaacatgaagaa cgagaaatgagacaaaagaagttagaaaaggtgatggaagaagaaggcctaaaagatgag gagaaacgactccggagatcagcacatgctcggaaggaaacagagtttcttcgtttgaag agaacaagacttggattggaagattttgagtccttaaaagtaataggcagaggagcattt ggtgaggtacggcttgttcagaagaaagatacgggacatgtgtatgcaatgaaaatactc cgtaaagcagatatgcttgaaaaagagcaggttggccacattcgtgcggagcgtgacatt ctagtggaggcagacagtttgtgggttgtgaaaatgttctatagttttcaggataagcta aacctctacctaatcatggagttcctgcctggaggggacatgatgaccttgttgatgaaa aaagacactctgacagaagaggagactcagttttatatagcagaaacagtattagccata gactctattcaccaacttggattcatccacagagacatcaaaccagacaaccttcttttg gacagcaaggcaacaagaaggagagaagagcggcggcccttccaggagcccagacttagg ggcttcctgagccagtgctgtgacaccccctttagggctctgcggtttctggcatctcca agcttccagggccatgtgaaactttctgactttggtctttgcacaggactgaaaaaagca cataggacagaattttataggaatctgaaccacagcctccccagtgatttcactttccag aacatgaattccaaaaggaaagcagaaacctggaaaagaaatagacgtcagctatcctgg tatgttagagtagaatactggaatgcagaaaatgtttctcttggtcaggccttctccaca gtaggcactcctgactacattgctcctgaggtgttcatgcagaccgggtacaacaagctc tgtgattggtggtcgcttggggtgatcatgtatgagatgctcatcggctacccacctttc tgttctgagacccctcaagagacatataagaaggtgatgaactggaaagaaactttgact tttcctccagaagttcccatctctgagaaagccaaggatctaattttgagagagagacct gctgcaatatctattgaaatcaaaagcattgatgatacctcaaacttcgatgagtttcca gaatctgatattcttaagccaacagtggccacaagtaatcatcctgagactgactacaag aacaaagactgggtcttcatcaattacacgtacaagcgctttgagggcctgactgcaagg ggggcaataccttcctacatgaaagcagcaaaatag