GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:17:04 Sequence gi568815592f:36496763_36701794 : 205032 bp : 45.47% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.17 PlyA - 171 166 6 1.05 1.16 Term - 1113 1061 53 1 2 98 42 41 0.116 -1.91 1.15 Intr - 3228 3111 118 0 1 87 123 107 0.893 14.14 1.14 Intr - 4538 4485 54 2 0 91 80 22 0.554 0.88 1.13 Intr - 9882 9821 62 1 2 88 94 29 0.633 1.95 1.12 Intr - 10840 10738 103 1 1 110 89 -23 0.230 -0.25 1.11 Intr - 12301 12182 120 1 0 97 50 45 0.304 2.29 1.10 Intr - 18730 18576 155 2 2 89 67 99 0.122 7.69 1.09 Intr - 21078 20955 124 0 1 40 76 31 0.420 -2.74 1.08 Intr - 25055 24972 84 2 0 79 103 30 0.595 3.62 1.07 Intr - 27701 27579 123 2 0 48 83 68 0.838 3.08 1.06 Intr - 28880 28829 52 1 1 99 76 79 0.974 6.71 1.05 Intr - 43445 43310 136 0 1 117 88 39 0.849 6.43 1.04 Intr - 49955 49862 94 2 1 34 99 55 0.628 0.74 1.03 Intr - 50621 50428 194 0 2 96 105 47 0.741 6.41 1.02 Intr - 52643 52532 112 2 1 -61 -7 276 0.298 3.15 1.01 Init - 61461 61447 15 0 0 75 99 16 0.220 1.45 1.00 Prom - 78083 78044 40 -4.56 2.00 Prom + 97566 97605 40 -2.66 2.01 Init + 100001 100206 206 1 2 62 55 168 0.947 9.22 2.02 Intr + 102087 102221 135 0 0 94 80 61 0.946 5.58 2.03 Intr + 103241 103461 221 2 2 55 23 138 0.888 1.95 2.04 Intr + 104390 104428 39 0 0 127 99 12 0.889 4.40 2.05 Intr + 116966 117081 116 0 2 66 61 133 0.661 8.57 2.06 Intr + 121695 121893 199 2 1 55 45 64 0.320 -2.28 2.07 Term + 125549 125754 206 0 2 37 47 215 0.718 9.83 2.08 PlyA + 125845 125850 6 1.05 3.00 Prom + 128831 128870 40 -5.16 3.01 Init + 142845 142902 58 2 1 59 111 32 0.298 4.11 3.02 Intr + 146736 146850 115 1 1 119 28 47 0.025 1.31 3.03 Intr + 168099 168205 107 0 2 112 94 37 0.338 6.56 3.04 Term + 177053 177561 509 0 2 47 45 400 0.003 26.17 3.05 PlyA + 177636 177641 6 -5.80 4.00 Prom + 177879 177918 40 -13.24 4.01 Init + 177961 178399 439 0 1 88 64 253 0.511 19.18 4.02 Intr + 179763 179808 46 1 1 82 97 -6 0.215 -2.83 4.03 Intr + 181063 181171 109 1 1 28 109 46 0.459 0.99 4.04 Intr + 181929 182198 270 2 0 14 70 131 0.200 1.64 4.05 Intr + 185905 185968 64 0 1 43 96 64 0.574 0.99 4.06 Intr + 187335 187883 549 1 0 90 75 471 0.463 38.84 4.07 Term + 188989 189038 50 2 2 115 36 81 0.977 3.07 4.08 PlyA + 190547 190552 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 177055 177561 507 0 0 105 45 391 0.991 32.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:36496763_36701794|GENSCAN_predicted_peptide_1|532_aa MQKKLKKEKEKEKEKKKKKKKKKKKKKKKKRKKKKKKRLAGHVGTLPVVLTRNRLLPPVS ESLTRPLPSLARWLPPPGLRQPSSRDYWPKGRLRLSAVPSPASPWALDTGLQISFKSQEK AGKILKKRVEKQQPEEKVAAMAMTGSTPCSSMSNHTKERVTMTKVTLENFYSNLIAQHEE REMRQKKLEKVMEEEGLKDEEKRLRRSAHARKETEFLRLKRTRLGLEDFESLKVIGRGAF GEVRLVQKKDTGHVYAMKILRKADMLEKEQVGHIRAERDILVEADSLWVVKMFYSFQDKL NLYLIMEFLPGGDMMTLLMKKDTLTEEETQFYIAETVLAIDSIHQLGFIHRDIKPDNLLL DSKATRRREERRPFQEPRLRGFLSQCCDTPFRALRFLASPSFQGHVKLSDFGLCTGLKKA HRTEFYRNLNHSLPSDFTFQNMNSKRKAETWKRNRRQLSWYVRVEYWNAENVSLGQAFST VGTPDYIAPEVFMQTGYNKLCDWWSLGVIMYEMLIGSAVNGNIELELLELRK >gi568815592f:36496763_36701794|GENSCAN_predicted_CDS_1|1599_bp atgcagaagaaactgaagaaggagaaggagaaggagaaggagaagaagaagaagaagaag aagaagaagaagaagaagaagaagaagaagaggaagaagaagaagaagaaaagattagct gggcatgttggaacgcttcctgttgtcctcacccgtaaccgcctgttgccccctgtctca gagtccctcacgcgtcccctcccgtctttggctcgttggctgccgccgccggggcttcgc cagccttcaagtcgagactactggccgaaggggcgtctgcggctctccgccgtccccagc cctgcctctccctgggctctggatactgggcttcagataagcttcaaatcacaggaaaag gcagggaaaattcttaagaagagagtggaaaagcaacagccagaggaaaaagtcgcagcc atggcaatgacaggctcaacaccttgctcatccatgagtaaccacacaaaggaaagggtg acaatgaccaaagtgacactggagaatttttatagcaaccttatcgctcaacatgaagaa cgagaaatgagacaaaagaagttagaaaaggtgatggaagaagaaggcctaaaagatgag gagaaacgactccggagatcagcacatgctcggaaggaaacagagtttcttcgtttgaag agaacaagacttggattggaagattttgagtccttaaaagtaataggcagaggagcattt ggtgaggtacggcttgttcagaagaaagatacgggacatgtgtatgcaatgaaaatactc cgtaaagcagatatgcttgaaaaagagcaggttggccacattcgtgcggagcgtgacatt ctagtggaggcagacagtttgtgggttgtgaaaatgttctatagttttcaggataagcta aacctctacctaatcatggagttcctgcctggaggggacatgatgaccttgttgatgaaa aaagacactctgacagaagaggagactcagttttatatagcagaaacagtattagccata gactctattcaccaacttggattcatccacagagacatcaaaccagacaaccttcttttg gacagcaaggcaacaagaaggagagaagagcggcggcccttccaggagcccagacttagg ggcttcctgagccagtgctgtgacaccccctttagggctctgcggtttctggcatctcca agcttccagggccatgtgaaactttctgactttggtctttgcacaggactgaaaaaagca cataggacagaattttataggaatctgaaccacagcctccccagtgatttcactttccag aacatgaattccaaaaggaaagcagaaacctggaaaagaaatagacgtcagctatcctgg tatgttagagtagaatactggaatgcagaaaatgtttctcttggtcaggccttctccaca gtaggcactcctgactacattgctcctgaggtgttcatgcagaccgggtacaacaagctc tgtgattggtggtcgcttggggtgatcatgtatgagatgctcatcggttctgctgtgaat gggaacatagaattggagctcctggagttgaggaaataa >gi568815592f:36496763_36701794|GENSCAN_predicted_peptide_2|373_aa MHRDSCPLDCKVYVGNLGNNGNKTELERAFGYYGPLRSVWVARNPPGFAFVEFEDPRDAA DAVRELDGRTLCGCRVRVELSNGEKRSRNRGPPPSWGRRPRDDYRRRSPPPRRSLLSNLN QIGSSHLDRPHIPGQSAQLFIYQMSSQQLQQQPSANKKAGKIHNTPFANQLNPTQHLAKP FQQILPGHLQEGEASLAAGAEIATATPAFSDHNPDQSAAINTEGRPSTSKVMWLKAYMIN PEQVPQEQVRGVSPVGKAFLRGSEAVSQERGEAPLSQSCSAHLAAAATHLYAPSWYITIT TGSHDRNTEVGSSHKEVASSIIKKLIPAHVDGFEEFKTLVEEVMKDVVEIARELKLESGD VTGSTFSSLAIPT >gi568815592f:36496763_36701794|GENSCAN_predicted_CDS_2|1122_bp atgcatcgtgattcctgtccattggactgtaaggtttatgtaggcaatcttggaaacaat ggcaacaagacggaattggaacgggcttttggctactatggaccactccgaagtgtgtgg gttgctagaaacccacccggctttgcttttgttgaatttgaagatccccgagatgcagct gatgcagtccgagagctagatggaagaacactatgtggctgccgtgtaagagtggaactg tcgaatggtgaaaaaagaagtagaaatcgtggcccacctccctcttggggtcgtcgccct cgagatgattatcgtaggaggagtcctccacctcgtcgcagccttctctccaaccttaac caaatcggcagcagccacctcgaccgcccacacattcctggccaatcagctcagctgttt atttaccaaatgtcttcacaacaactacagcagcagccttcggctaacaaaaaagcagga aaaatccacaacacccccttcgccaaccaactaaatccaacgcaacatctggcaaaacct tttcagcaaattcttcctggccatctccaagaaggagaagcttctctcgcagccggagca gaaattgccacagccactccagccttcagcgaccacaatcctgatcagtcagcagccatc aacactgagggaagaccctccaccagcaaggttatgtggctgaaggcttatatgattaat ccagagcaggtcccccaggagcaggttcgtggcgtatcaccagtggggaaggcttttctc agaggctctgaggcagtgtctcaagaaaggggggaggctccgctttctcagagttgcagt gctcaccttgctgctgctgccacacatctgtatgcgccttcctggtacatcaccatcacc acaggctctcatgatagaaacacagaagttggttccagccacaaggaagtggcttcatcc atcatcaagaagttgattccagcccacgtggatggctttgaggagttcaagactttagtg gaggaagtgatgaaagatgtggtggaaatagcaagagaactaaaactggagtctggagat gtgacaggctccactttcagttctcttgctattcccacatga >gi568815592f:36496763_36701794|GENSCAN_predicted_peptide_3|262_aa MDKPRPGKTTFVIIVSPLPGLTTLGNPCWQTCFQAGNLDISRISGIPGGLTIQKLGQVSG YSGPSSFRTVAPQVSHLPPRDRYFPDLYSATHTDMTKGLVLGIYSKEKEDDVPQFTSAGE NLDKLIAGKLRETLNISGPPLKAGKTRNFYGLHQDFPSVVLVGLGKKAARIDEQENWQEG KENIRAAVAAGCRQIQDLELSSVEVDPCRDAQAAEEGAVLGLYEYDDLKQKKKMAMSVKL YGTGDQEAWQKGVLFASGQNLA >gi568815592f:36496763_36701794|GENSCAN_predicted_CDS_3|789_bp atggataagcctcgccctgggaaaaccaccttcgtgatcatagtatctcccctgccaggg ctgacaacacttggaaacccatgctggcaaacctgcttccaagcagggaacttggacata agtagaatatcaggaattcctggaggacttacaatccagaaactgggacaggtctcgggt tattctggtccttcaagttttagaactgtggccccccaggtgtcacatctgccccccaga gaccgctacttccctgatctttattcagccacacacacagacatgacgaagggccttgtt ttaggaatctattccaaagaaaaagaagatgatgtgccacagttcacaagtgcaggagag aatcttgataaattgatagctggaaagctgagagagactttgaacatatctggaccacct ctgaaggcaggcaagactcgaaacttttatggtctgcatcaggacttccccagcgtggtg ctagttggcctcggcaaaaaggcagccagaatcgacgaacaggaaaactggcaggaaggc aaagaaaacatcagagctgctgttgcagcaggatgcaggcagattcaagacctggagctc tcttccgtggaggtggatccctgtagagatgctcaggctgctgaggagggcgcggtgctt ggtctctatgaatacgatgacctaaagcaaaaaaagaagatggctatgtcggtgaagctc tatggaactggggatcaggaggcctggcagaaaggagtcctgtttgcttctgggcagaac ttggcatga >gi568815592f:36496763_36701794|GENSCAN_predicted_peptide_4|508_aa MPSGKANKLGDVVRARNRKTIQVGNTDAEGRLILADALCYVHTFNPKVILNATTLTGVID VALGSGATGVFTNSSWLWNKLFEASIETGDRVWRMPLFKHCTRQVVDCQLADVNNIGKYR SAGACTSAAFLKEFVTHPKWAHLDIAGEEGDGRRQETSKDPSVYGLCGEYSGDRQLTRQI LPFLANKAAATTGISSVQGRAELRQLRCEQLPKSVPCGAGAGRGFAEAPRHSEEVRERRQ TTGDPGPAAQSRAKRARVCPCVSARMRVRGCVLRSQVFLRQVNDGRGSENPVAAKVVQSL SPGVKPEQAGAMSEPAGDVRQNPCGSKACRRLFGPVDSEQLSRDCDALMAGCIQEARERW NFDFVTETPLEGDFAWERVRGLGLPKLYLPTGPRRGRDELGGGRRPGTSPALLQGTAEED HVDLSLSCTLVPRSGEQAEGSPGGPGDSQGRKRRQTSMTGADMCTEGLCKGPGFSESMVQ GLTCLVLVQHAPDFYHSKRRLIFSKRKP >gi568815592f:36496763_36701794|GENSCAN_predicted_CDS_4|1527_bp atgcccagcggcaaggccaacaagctgggggatgttgttagagccaggaacaggaagacc atccaggttggtaacactgatgctgaggggaggctcatactggctgatgcgctctgttac gtgcacacatttaacccgaaggtcatcctcaatgccaccaccttaacaggtgtcatagat gtagctttggggtcaggtgccactggggtctttaccaattcatcctggctctggaacaag ctcttcgaggccagcattgaaacaggggaccgtgtctggaggatgcctctcttcaaacat tgtacaagacaggttgtagattgccagctggctgatgttaacaacattggaaaatataga tctgcgggagcatgtacatctgcggcattcctgaaagaattcgtgactcatcctaagtgg gcacatttagacatagcaggtgaggaaggggatggtaggagacaggagacctctaaagac cccagtgtatacgggctatgtggggagtattcaggagacagacaactcactcgtcaaatc ctccccttcctggccaacaaagctgctgcaaccacagggatttcttctgttcagggccgc gctgagctgcgccagctgaggtgtgagcagctgccgaagtcagttccttgtggagccgga gctgggcgcggattcgccgaggcaccgaggcactcagaggaggtgagagagcggcggcag acaacaggggaccccgggccggcggcccagagccgagccaagcgtgcccgcgtgtgtccc tgcgtgtccgcgaggatgcgtgttcgcgggtgtgtgctgcgttcacaggtgtttctgcgg caggtgaatgacgggcgtgggtcggaaaatccagttgctgccaaggtcgtgcagtcactc agccctggagtcaagccagagcaggcaggcgccatgtcagaaccggctggggatgtccgt cagaacccatgcggcagcaaggcctgccgccgcctcttcggcccagtggacagcgagcag ctgagccgcgactgtgatgcgctaatggcgggctgcatccaggaggcccgtgagcgatgg aacttcgactttgtcaccgagacaccactggagggtgacttcgcctgggagcgtgtgcgg ggccttggcctgcccaagctctaccttcccacggggccccggcgaggccgggatgagttg ggaggaggcaggcggcctggcacctcacctgctctgctgcaggggacagcagaggaagac catgtggacctgtcactgtcttgtacccttgtgcctcgctcaggggagcaggctgaaggg tccccaggtggacctggagactctcagggtcgaaaacggcggcagaccagcatgacaggt gcggacatgtgcacggaaggactttgtaagggaccaggattctcagaatccatggtccaa gggctgacctgtctggtcctggtccagcatgctccagatttctaccactccaaacgccgg ctgatcttctccaagaggaagccctaa