GENSCAN 1.0 Date run: 7-Nov-116 Time: 19:37:57 Sequence gi568815594f:25562501_25776746 : 214246 bp : 45.19% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 16257 16296 40 -1.16 1.01 Init + 50790 50879 90 2 0 92 113 14 0.249 3.60 1.02 Term + 60775 61101 327 0 0 34 44 262 0.456 11.11 1.03 PlyA + 61326 61331 6 1.05 2.03 PlyA - 61412 61407 6 1.05 2.02 Term - 85791 85522 270 0 0 -26 34 318 0.671 10.48 2.01 Init - 87508 87506 3 1 0 113 81 0 0.544 1.80 2.00 Prom - 90950 90911 40 -4.46 3.00 Prom + 91271 91310 40 -6.16 3.01 Init + 93206 93390 185 1 2 90 105 69 0.936 7.43 3.02 Intr + 99636 99783 148 0 1 79 92 22 0.913 1.94 3.03 Intr + 99998 100112 115 1 1 118 106 72 0.961 12.02 3.04 Intr + 100205 100342 138 0 0 95 98 119 0.881 14.04 3.05 Intr + 101702 101830 129 0 0 100 94 36 0.974 6.07 3.06 Intr + 103628 103771 144 0 0 92 84 176 0.999 17.85 3.07 Intr + 105380 105491 112 0 1 87 116 79 0.998 10.04 3.08 Intr + 107147 107342 196 2 1 86 93 266 0.999 26.32 3.09 Intr + 108238 108333 96 0 0 83 102 87 0.998 9.81 3.10 Intr + 109101 109221 121 2 1 54 82 152 0.579 11.27 3.11 Intr + 110587 110754 168 2 0 103 82 157 0.998 16.52 3.12 Intr + 111796 111912 117 2 0 116 97 88 0.997 12.94 3.13 Intr + 112005 112129 125 1 2 119 71 141 0.999 15.80 3.14 Term + 113635 114249 615 0 0 130 50 894 0.985 84.26 3.15 PlyA + 115162 115167 6 1.05 4.00 Prom + 116296 116335 40 -5.06 4.01 Init + 116350 116483 134 0 2 83 25 93 0.678 2.11 4.02 Term + 119746 119929 184 1 1 55 38 153 0.676 4.02 4.03 PlyA + 120110 120115 6 1.05 5.00 Prom + 124613 124652 40 -0.56 5.01 Init + 131900 131962 63 1 0 42 96 58 0.527 3.15 5.02 Term + 136995 137015 21 2 0 124 48 16 0.530 -0.49 5.03 PlyA + 137927 137932 6 1.05 6.11 PlyA - 138440 138435 6 1.05 6.10 Term - 151012 150927 86 2 2 90 37 102 0.883 3.12 6.09 Intr - 159430 159325 106 1 1 21 106 104 0.682 5.29 6.08 Intr - 163643 163559 85 1 1 91 93 13 0.126 1.92 6.07 Intr - 186430 186399 32 1 2 100 52 50 0.002 -0.47 6.06 Intr - 195290 195188 103 1 1 84 82 28 0.545 1.98 6.05 Intr - 196568 196441 128 2 2 69 105 57 0.876 4.98 6.04 Intr - 202935 202826 110 1 2 60 97 160 0.715 14.10 6.03 Intr - 204336 204165 172 0 1 100 52 -12 0.342 -4.08 6.02 Intr - 205330 205240 91 0 1 54 100 56 0.772 3.40 6.01 Intr - 213860 213777 84 1 0 121 103 126 0.993 16.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 179873 179940 68 1 2 86 96 43 0.900 3.45 S.002 Term + 180353 180416 64 2 1 89 47 76 0.870 0.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:25562501_25776746|GENSCAN_predicted_peptide_1|138_aa MHSYLLCPEFIPSGEFLVLLTSRMKPRTCTIIQWCTNQKDNPPPPPEADENEEKRTDAIP AWDQKFLKIDPGTLFEVILAANYLDIKGLLDVPCKTVAYLIKGKAPEEICTNRLPLIPKL TLLEEAQIPKENQWCEET >gi568815594f:25562501_25776746|GENSCAN_predicted_CDS_1|417_bp atgcactcctacctactgtgtccggaatttattccttccggtgagttcttggtcttgctg acttcaagaatgaagccacggacctgcacgataattcagtggtgcaccaaccaaaaggat aaccctcctcctcctccagaggctgatgagaatgaagaaaagcgaacagatgctatccct gcttgggaccaaaaattcctgaaaattgacccaggaacactttttgaagtcattttggct gcaaactacttagacatcaaaggtttgcttgatgttccatgcaagactgttgcctatttg atcaaggggaaggctcctgaggagatttgcacaaacaggcttccgttgataccaaaactg actttactggaggaagcccagatacccaaagagaaccagtggtgtgaagagacatga >gi568815594f:25562501_25776746|GENSCAN_predicted_peptide_2|90_aa MLEAASELETASELETVSELETASELEIVSELETASELEIVSELEIVSELETASELEIAS ELETASELEIASELETASELETASVRTGSH >gi568815594f:25562501_25776746|GENSCAN_predicted_CDS_2|273_bp atgctggaggccgcttcagagctggagaccgcttcagagctggagactgtttcagagctg gagaccgcttcagagctggagattgtttcagagctggagaccgcttcagagctggagatt gtttcagagctggagatcgtttcagagctggagaccgcttcagagctggagatcgcttca gagctggagaccgcttcagagctggagatagcttcagagctggagaccgcttcagagctg gagaccgcttcagtcagaactggaagccactag >gi568815594f:25562501_25776746|GENSCAN_predicted_peptide_3|802_aa MGSLRRQVGSAPAAAAAGGPGMCEGRDDSGQPLCATPSPYIPGALRSTWPPPPAQHLRRE RCAGITGVSHGARCTLLLTEGESQDKNKFTRLKVVHAANSTLYEVVCFSDWTMAPWPELG DAQPNPDKYLEGAAGQQPTAPDKSKETNKTDNTEAPVTKIELLPSYSTATLIDEPTEVDD PWNLPTLQDSGIKWSERDTKGKILCFFQGIGRLILLLGFLYFFVCSLDILSSAFQLVGGK MAGQFFSNSSIMSNPLLGLVIGVLVTVLVQSSSTSTSIVVSMVSSSLLTVRAAIPIIMGA NIGTSITNTIVALMQVGDRSEFRRAFAGATVHDFFNWLSVLVLLPVEVATHYLEIITQLI VESFHFKNGEDAPDLLKVITKPFTKLIVQLDKKVISQIAMNDEKAKNKSLVKIWCKTFTN KTQINVTVPSTANCTSPSLCWTDGIQNWTMKNVTYKENIAKCQHIFVNFHLPDLAVGTIL LILSLLVLCGCLIMIVKILGSVLKGQVATVIKKTINTDFPFPFAWLTGYLAILVGAGMTF IVQSSSVFTSALTPLIGIGVITIERAYPLTLGSNIGTTTTAILAALASPGNALRSSLQIA LCHFFFNISGILLWYPIPFTRLPIRMAKGLGNISAKYRWFAVFYLIIFFFLIPLTVFGLS LAGWRVLVGVGVPVVFIIILVLCLRLLQSRCPRVLPKKLQNWNFLPLWMRSLKPWDAVVS KFTGCFQMRCCCCCRVCCRACCLLCDCPKCCRCSKCCEDLEEAQEGQDVPVKAPETFDNI TISREAQGEVPASDSKTECTAL >gi568815594f:25562501_25776746|GENSCAN_predicted_CDS_3|2409_bp atgggttcattaaggcggcaggtaggcagtgccccggcggcggctgcggcaggcggtcct ggaatgtgcgaggggcgtgatgacagcggccagcctctttgcgcaacaccttcgccatat atacccggggcgctgcgctccacctggccgccgcctccagcccagcacctgcggagggag cgctgtgctgggattacaggcgtgagccacggtgcccgttgcaccctgcttttgactgaa ggagagagtcaagacaaaaacaagtttacgagattaaaggtggtacatgcagctaactca acactgtacgaggtagtttgctttagcgactggaccatggctccctggcctgaattggga gatgcccagcccaaccccgataagtacctcgaaggggccgcaggtcagcagcccactgcc cctgataaaagcaaagagaccaacaaaacagataacactgaggcacctgtaaccaagatt gaacttctgccgtcctactccacggctacactgatagatgagcccactgaggtggatgac ccctggaacctacccactcttcaggactcggggatcaagtggtcagagagagacaccaaa gggaagattctctgtttcttccaagggattgggagattgattttacttctcggatttctc tactttttcgtgtgctccctggatattcttagtagcgccttccagctggttggaggaaaa atggcaggacagttcttcagcaacagctctattatgtccaaccctttgttggggctggtg atcggggtgctggtgaccgtcttggtgcagagctccagcacctcaacgtccatcgttgtc agcatggtgtcctcttcattgctcactgttcgggctgccatccccattatcatgggggcc aacattggaacgtcaatcaccaacactattgttgcgctcatgcaggtgggagatcggagt gagttcagaagagcttttgcaggagccactgtccatgacttcttcaactggctgtccgtg ttggtgctcttgcccgtggaggtggccacccattacctcgagatcataacccagcttata gtggagagcttccacttcaagaatggagaagatgccccagatcttctgaaagtcatcact aagcccttcacaaagctcattgtccagctggataaaaaagttatcagccaaattgcaatg aacgatgaaaaagcgaaaaacaagagtcttgtcaagatttggtgcaaaacttttaccaac aagacccagattaacgtcactgttccctcgactgctaactgcacctccccttccctctgt tggacggatggcatccaaaactggaccatgaagaatgtgacctacaaggagaacatcgcc aaatgccagcatatctttgtgaatttccacctcccggatcttgctgtgggcaccatcttg ctcatactctccctgctggtcctctgtggttgcctgatcatgattgtcaagatcctgggc tctgtgctcaaggggcaggtcgccactgtcatcaagaagaccatcaacactgatttcccc tttccctttgcatggttgactggctacctggccatcctcgtcggggcaggcatgaccttc atcgtacagagcagctctgtgttcacgtcggccttgacccccctgattggaatcggcgtg ataaccattgagagggcttatccactcacgctgggctccaacatcggcaccaccaccacc gccatcctggccgccttagccagccctggcaatgcattgaggagttcactccagatcgcc ctgtgccactttttcttcaacatctccggcatcttgctgtggtacccgatcccgttcact cgcctgcccatccgcatggccaaggggctgggcaacatctctgccaagtatcgctggttc gccgtcttctacctgatcatcttcttcttcctgatcccgctgacggtgtttggcctctcg ctggccggctggcgggtgctggttggtgtcggggttcccgtcgtcttcatcatcatcctg gtactgtgcctccgactcctgcagtctcgctgcccacgcgtcctgccgaagaaactccag aactggaacttcctgccgctgtggatgcgctcgctgaagccctgggatgccgtcgtctcc aagttcaccggctgcttccagatgcgctgctgctgctgctgccgcgtgtgctgccgcgcg tgctgcttgctgtgtgactgccccaagtgctgccgctgcagcaagtgctgcgaggacttg gaggaggcgcaggaggggcaggatgtccctgtcaaggctcctgagacctttgataacata accattagcagagaggctcagggtgaggtccctgcctcggactcaaagaccgaatgcacg gccttgtag >gi568815594f:25562501_25776746|GENSCAN_predicted_peptide_4|105_aa MGHQQLYWSHPKKFGQGSRSCRVYSNQHGLIRKYGLNKCRQCFCQILWIRDLEQKQAQHP ASTVTRLANEATIKDTASILPDWWGQQNNPCQKRPALIADHGVNK >gi568815594f:25562501_25776746|GENSCAN_predicted_CDS_4|318_bp atgggtcaccagcagctgtactggagccacccaaaaaaattcggccagggttctcgttct tgtcgtgtctattcaaaccagcacggtctgatccggaaatatggcctcaataagtgccgc caatgtttctgtcaaatcttgtggataagagacctggagcagaaacaggcccagcatcca gccagcactgtcaccagacttgcaaatgaagccaccataaaggatacagccagcatcctg cctgactggtggggccagcagaacaacccctgccaaaaacgcccagccctgattgctgac catggagtaaacaaataa >gi568815594f:25562501_25776746|GENSCAN_predicted_peptide_5|27_aa MYNTKSEPDVSYALWVIMMCQIQEIIS >gi568815594f:25562501_25776746|GENSCAN_predicted_CDS_5|84_bp atgtacaacaccaagagtgaacctgatgtcagctatgctctttgggtgataatgatgtgt cagatccaggagataatcagctaa >gi568815594f:25562501_25776746|GENSCAN_predicted_peptide_6|332_aa XWAKHVAEKNGYLGHVIRKGLNAYLEGSWHEALLYYVLAAETGIEVSQTNLAHICEERPA IRGTGRRLQDGRRSPSTSTFVSLLWTVSLAVATFPWFQPLPGSSTLFDLSTRLWVSAYLK MGDLYYYGHQNQSQDLELSVQMYAQAALDGDSQGFFNLALLIEEGTIIPHHILDFLEIDS TLHSNNISILQELYERCWSHSNEESFSPCSLAWLYLHLRLLWGAILHSALKSVGIHDLFC RKWMRTDTLTWYFHQRRSNQGDSILNRGKQSHGKKARKHNCANTFEDVHHVTSTEIPLAK ASHMESGEHNDEDDKIIIIIIIHHHDHHLDGK >gi568815594f:25562501_25776746|GENSCAN_predicted_CDS_6|999_bp nnatgggcaaaacatgtagctgagaaaaatggctacttgggccatgtcatccgcaaaggc ctcaatgcctacctggaaggttcatggcatgaagctttgctgtattatgttttagcagca gaaactggaattgaagtgtcacagacaaatttagcacacatctgtgaggagaggccagca ataagaggcactggcagaagattgcaggatgggagaagaagtccaagtacttctaccttt gtctctctgctttggactgtgtctctggcagtggctacatttccatggttccagcccctg cctggcagctccaccctctttgatctcagcaccaggctctgggtctctgcatatttgaag atgggagacctttactactatggccaccaaaaccagtcacaagacctggagttgtctgtg cagatgtacgcccaagccgccctggatggagactcccagggattttttaacctggccctg ctaatcgaggaaggtacgataatcccacaccatatcttggatttcttggaaattgactca actctccattctaataacatctccattctccaggaactgtacgaaaggtgctggagccac agtaacgaggagtccttcagcccctgctccttggcctggctttacctgcacttgcggctt ctctggggtgctatcctgcactcagccctgaagagtgttggaatccacgacctcttctgt aggaagtggatgagaactgatactctgacctggtactttcatcagaggcgttcaaaccag ggggactccatcttgaataggggcaagcaatctcatggcaaaaaggctcgaaagcacaac tgtgcaaacacatttgaagacgtccatcatgtcacttccactgagatcccactggcgaaa gcaagtcacatggagtcaggagaacataatgatgaagatgataaaataatcatcatcatc atcattcatcatcatgatcatcatcttgatggcaaatag