GENSCAN 1.0 Date run: 3-Nov-116 Time: 02:11:06 Sequence gi568815596f:64969445_65182702 : 213258 bp : 44.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6804 7187 384 2 0 68 61 158 0.244 7.94 1.02 Intr + 16282 16439 158 0 2 75 92 -2 0.215 -2.29 1.03 Intr + 18900 19110 211 0 1 83 78 101 0.714 7.52 1.04 Intr + 19516 19608 93 0 0 126 50 22 0.626 2.36 1.05 Intr + 20179 20726 548 0 2 67 92 762 0.654 66.18 1.06 Intr + 32004 32046 43 0 1 91 103 16 0.740 1.64 1.07 Intr + 34509 34571 63 2 0 56 98 67 0.867 3.41 1.08 Intr + 41153 41319 167 1 2 101 89 158 0.985 15.96 1.09 Intr + 46996 47229 234 1 0 108 109 122 0.949 13.10 1.10 Intr + 48627 48821 195 0 0 74 84 286 0.996 25.33 1.11 Intr + 49101 49235 135 0 0 71 75 154 0.999 12.08 1.12 Term + 51468 51702 235 0 1 95 45 350 0.556 27.19 1.13 PlyA + 58064 58069 6 1.05 2.00 Prom + 60719 60758 40 -4.46 2.01 Init + 61338 61437 100 2 1 42 62 65 0.438 -0.28 2.02 Intr + 61502 61815 314 0 2 53 111 157 0.485 10.40 2.03 Term + 74102 74284 183 1 0 74 49 126 0.535 4.84 2.04 PlyA + 74599 74604 6 1.05 3.00 Prom + 84338 84377 40 -2.66 3.01 Init + 100001 100357 357 1 0 64 96 141 0.849 9.61 3.02 Intr + 102010 103536 1527 0 0 90 90 159 0.639 6.05 3.03 Intr + 104838 104960 123 2 0 99 57 67 0.688 5.48 3.04 Intr + 108424 108520 97 0 1 26 94 104 0.876 4.38 3.05 Term + 113092 113261 170 2 2 22 37 140 0.434 0.34 3.06 PlyA + 113400 113405 6 -0.45 4.07 PlyA - 113587 113582 6 1.05 4.06 Term - 116426 116401 26 0 2 87 44 39 0.377 -2.21 4.05 Intr - 119246 119066 181 2 1 88 49 146 0.816 10.14 4.04 Intr - 119626 119495 132 1 0 56 95 97 0.941 8.04 4.03 Intr - 121634 121539 96 2 0 29 111 59 0.850 2.51 4.02 Intr - 128622 128527 96 0 0 103 92 50 0.950 7.11 4.01 Init - 141645 141613 33 0 0 110 73 -13 0.341 -0.68 4.00 Prom - 144452 144413 40 -4.86 5.00 Prom + 150845 150884 40 -2.46 5.01 Init + 159552 159588 37 2 1 69 89 -6 0.551 -2.96 5.02 Intr + 160353 160544 192 1 0 102 56 134 0.632 11.06 5.03 Term + 160615 160697 83 2 2 82 41 67 0.356 -0.74 5.04 PlyA + 161741 161746 6 1.05 6.00 Prom + 165655 165694 40 -7.56 6.01 Init + 182398 182454 57 0 0 96 94 112 0.994 13.91 6.02 Term + 187000 187023 24 0 0 110 37 67 0.975 2.02 6.03 PlyA + 187707 187712 6 1.05 7.03 PlyA - 187739 187734 6 1.05 7.02 Term - 207341 207274 68 0 2 79 42 59 0.375 -1.50 7.01 Intr - 208281 208114 168 1 0 75 81 69 0.525 4.82 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:64969445_65182702|GENSCAN_predicted_peptide_1|821_aa MPYCYTIQINLYIRLMHHSTNHYSILLHHSTNYAIPLSHSENPYIILLYHSFSKPLHHTA TSFKKTLNHTAMPVTLLHTAVSFSKPLLYTATPFKKQLHHCYTVQQTSVSYSSIYNAPSN LQGAEDSRAERAASTQGMLGHTYAGVHMRLCENILMPPLASHPLSPLAKELNGKAQRQWG RAQAWERKRLKSSGLCFSFGYVAETNWKWWRESSVSSRRALRLKRPYRRQLPGGRPSPIL MAPGHKPRSAPSSGGGTPLRSKFSRALWPRGLRSRPRERARQLGAACSAMEKSNETNGYL DSAQAGPAAGPGAPGTAAGRARRCAGFLRRQALVLLTVSGVLAGAGLGAALRGLSLSRTQ VTYLAFPGEMLLRMLRMIILPLVVCSLVSGAASLDASCLGRLGGIAVAYFGLTTLSASAL AVALAFIIKPGSGAQTLQSSDLGLEDSGPPPVPKETVDSFLDLARNLFPSNLVVAAFRTY ATDYKVVTQNSSSGNVTHEKIPIGTEIEGMNILGLVLFALVLGVALKKLGSEGEDLIRFF NSLNEATMVLVSWIMWYVPVGIMFLVGSKIVEMKDIIVLVTSLGKYIFASILGHVIHGGI VLPLIYFVFTRKNPFRFLLGLLAPFATAFATCSSSATLPSMMKCIEENNGVDKRISRFIL PIGATVNMDGAAIFQCVAAVFIAQLNNVELNAGQIFTILVTATASSVGAAGVPAGGVLTI AIILEAIGLPTHDLPLILAVDWIVDRTTTVVNVEGDALGAGILHHLNQKATKKGEQELAE VKVEAIPNCKSEEETSPLVTHQNPAGPVASAPELESKESVL >gi568815596f:64969445_65182702|GENSCAN_predicted_CDS_1|2466_bp atgccatactgctataccattcaaataaacctgtacatcagactgatgcaccattcaaca aaccactactccattctgctgcaccattcaacaaactatgccataccactaagccactca gaaaacccctatatcatactgctgtaccattcattcagcaaacctctacaccatactgct acatcattcaagaaaacactaaaccatactgctatgccagtcacactactccatactgct gtctcattcagcaaaccactactctatactgctacaccattcaaaaaacaactgcaccac tgctacactgttcaacaaacttctgtatcatacagcagtatctacaatgctcccagcaac ttgcaaggtgctgaggatagcagggctgaaagggcagcatctacccaaggcatgctgggt cacacgtatgcaggagtccacatgcgattgtgtgaaaacattttgatgcctccgcttgca tcacatccacttagtccactagccaaggagctaaatggcaaagctcaaaggcaatgggga agggctcaggcctgggaacggaagaggttaaaatccagtggcttgtgtttcagctttggt tacgtggcagaaaccaactggaagtggtggcgtgagagctctgtttcttctagaagagct ttgcgtcttaaacgcccataccgcagacagctgcctggcgggcgacccagccccatcctg atggccccaggacacaagcccaggagcgccccgtcctcaggaggaggaacgccgctgcgt tctaagttctcccgggccctctggccccgaggactcagatcccggccgcgggagagggcc cggcagctcggagcggcgtgtagcgccatggagaagagcaacgagaccaacggctacctt gacagcgctcaggcggggcctgcggccgggcccggagctccggggaccgcggcgggacgc gcacggcgttgcgcgggcttcctgcggcgccaagcgctggtgctgctcaccgtgtccggg gtgctggcgggcgcgggcctgggcgcggcgttgcgcgggctcagcctgagccgcacgcag gtcacctacctggccttccccggcgagatgctgctccgcatgctgcgcatgatcatcctg ccgctggtggtctgcagcctggtgtcgggcgccgcctcgctcgatgccagctgcctcggg cgtctgggcggcatcgctgtcgcctactttggcctcaccacactgagtgcctcggcgctc gccgtggccttggcgttcatcatcaagccaggatccggtgcgcagacccttcagtccagc gacctggggctggaggactcggggcctcctcctgtccccaaagagacggtggactctttc ctcgacctggccagaaacctgtttccctccaatcttgtggttgcagctttccgtacgtat gcaaccgattataaagtcgtgacccagaacagcagctctggaaatgtaacccatgaaaag atccccataggcactgagatagaagggatgaacattttaggattggtcctgtttgctctg gtgttaggagtggccttaaagaaactaggctccgaaggagaagacctcatccgtttcttc aattccctcaacgaggcgacgatggtgctggtgtcctggattatgtggtacgtacctgtg ggcatcatgttccttgttggaagcaagatcgtggaaatgaaagacatcatcgtgctggtg accagcctggggaaatacatcttcgcatctatattgggccatgttattcatggaggaatt gttctgccacttatttattttgttttcacacgaaaaaacccattcagattcctcctgggc ctcctcgccccatttgcgacagcatttgctacctgctccagctcagcgacccttccctct atgatgaagtgcattgaagagaacaatggtgtggacaagaggatcagcaggtttattctc cccatcggggccaccgtgaacatggacggagcagccatcttccagtgtgtggccgcggtg ttcattgcgcaactcaacaacgtagagctcaacgcaggacagattttcaccattctagtg actgccacagcgtccagtgttggagcagcaggcgtgccagctggaggggtcctcaccatt gccattatcctggaggccattgggctgcctactcatgacctgcctctgatcctggctgtg gactggattgtggaccggaccaccacggtggtgaatgtggaaggggatgccctgggtgca ggcattctccaccacctgaatcagaaggcaacaaagaaaggcgagcaggaacttgctgag gtgaaagtggaagccatccccaactgcaagtctgaggaggagacatcgcccctggtgaca caccagaaccccgctggccccgtggccagtgccccagaactggaatccaaggagtcggtt ctgtga >gi568815596f:64969445_65182702|GENSCAN_predicted_peptide_2|198_aa MQTFTTCISYSEYSCMLLANASSHGTLYCKLRVGICLLMVPAVKNQASGSARGATKVRRK CQAGCQNEHLGELDDGTDGKNQLNIRENGGRGQNCEQELEESVAEKDLSQTSRDLEKMMS KHIFLKPMLSISDLVNFLMQVSKVLVKTAEGIVLQQLPLAFPALHFHAYGNLFPVCSFKH YIYMIDHPIFISIPDFLT >gi568815596f:64969445_65182702|GENSCAN_predicted_CDS_2|597_bp atgcagacttttacaacatgcatcagctacagtgagtattcttgcatgcttttggccaat gcaagctctcatggcacgctgtactgtaagctccgggtcggcatttgtttgttgatggta ccagctgtcaaaaaccaagcaagtggcagtgctagaggtgccacaaaagtgcgaaggaag tgtcaggctggatgtcaaaatgaacaccttggagaactggatgatggaacagacggtaaa aatcagctaaacatcagagaaaatggaggaagaggtcaaaactgtgaacaggaactagaa gaaagtgtagcagaaaaagacttgtcacaaacttcgagagatttggagaaaatgatgtca aaacacatcttcctcaagcccatgctgagtatctctgatttggttaatttcttgatgcaa gtttccaaagttctggtgaaaacggctgaaggaatagtgttacagcaactgcccttggct tttccagctttacacttccatgcctatggaaatctcttccctgtctgtagctttaagcat tacatctatatgatcgaccatcccatcttcatctccatccctgacttcctaacttga >gi568815596f:64969445_65182702|GENSCAN_predicted_peptide_3|757_aa MALGEEKAEAEASEDTKAQSYGRGSCRERELDIPGPMSGEQPPRLEAEGGLISPVWGAEG IPAPTCWIGTDPGGPSRAHQPQASDANREPVAERSEPALSGLPPATMGSGDLLLSGESQV EKTKLSSSEEFPQTLSLPRTTTICSGHDADTEDDPSLADLPQALDLSQQPHSSGLSCLSQ WKSVLSPGSAAQPSSCSISASSTGSSLQGHQERAEPRGGSLAKVSSSLEPVVPQEPSSVV GLGPRPQWSPQPVFSGGDASGLGRRRLSFQAEYWACVLPDSLPPSPDRHSPLWNPNKEYE DLLDYTYPLRPGPQLPKHLDSRVPADPVLQDSGVDLDSFSVSPASTLKSPTNVSPNCPPA EATALPFSGPREPSLKQWPSRVPQKQGGMGLASWSQLASTPRAPGSRDARWERREPALRG AKDRLTIGKHLDMGSPQLRTRDRGWPSPRPEREKRTSQSARRPTCTESRWKSEEEVESDD EYLALPARLTQVSSLVSYLGSISTLVTLPTGDIKGQSPLEVSDSDGPASFPSSSSQSQLP PGAALQGSGDPEGQNPCFLRSFVRAHDSAGEGSLGSSQALGVSSGLLKTRPSLPARLDRW PFSDPDVEGQLPRKGGEQGKESLVQCVKTFCCQLEELICWLYNVADVTDHGTAARSNLTS LKSSLQLYRQFKKDIDEHQSLTESVLQKGEILLQCLLENTPVLEDVLGRIAKQSGELESH ADRLYDSILASLDMLAGCTLIPDKKPMAAMEHPCEGV >gi568815596f:64969445_65182702|GENSCAN_predicted_CDS_3|2274_bp atggccctgggtgaagaaaaggcagaagcggaagcatctgaagacacaaaggcccagtcc tatgggagagggagctgcagggagcgggagctggacatcccagggcccatgagtggggag cagcccccacgcctggaagctgagggagggctcatctcccctgtatggggggcagaaggg atacctgcccctacttgctggattgggactgaccctggcggcccctctagagcccaccag ccacaggccagtgatgccaacagagagcccgtagctgagaggtctgagcctgcactcagt ggcctgcctcctgccaccatggggtctggagaccttctgctctccggggaaagccaggtg gagaagaccaagctttcttcctccgaggagttccctcagactctgagccttcccagaaca acaactatttgctcaggacatgatgctgataccgaagatgatccatccctagcagatttg ccccaggcactggacctcagccagcagcctcacagctcaggtctctcttgcctgtcacag tggaagtccgtgctgagcccaggttccgcagctcagccttccagctgcagcatctctgct tcctccacaggcagcagtctccagggtcaccaggagagggcggagcctcgtggtggttct ctggccaaggtctcctcctccctggagccggtcgtcccccaggaaccttcctctgtggtg gggctaggacctcggccccagtggtcaccacagcctgtgttctctgggggtgatgcttct gggctaggcaggagacgcctctccttccaggctgagtactgggcctgtgtgctgccagat tccctgcctccatcacccgaccgccactcccctctctggaacccaaataaagagtatgaa gatctgcttgactatacttacccactgaggcccgggcctcagctcccaaagcaccttgat agccgtgtgccagctgaccctgtcctgcaggactccggggtagacctggatagcttctct gtctctccagcaagcaccctcaaatcacctactaatgtctcccccaactgcccaccagca gaggccactgccctgccattttctgggcccagagagccaagccttaagcagtggccctcc agagtaccccagaaacagggtggcatgggcttggcatcttggagccaacttgcatctacc cccagagccccaggcagtagggatgctcgttgggagcgcagagagccagccctgaggggt gcgaaggaccggctgactataggcaagcaccttgatatgggctctccccagctaaggaca cgggacagagggtggccctcgcccaggccagagagggagaagaggaccagccagagtgcc cggcgccctacctgcacagagtctaggtggaaatcagaagaggaagtggaaagtgatgac gagtatcttgccctccccgctcggctgacacaggtttctagcctggtttcgtatctagga tccatttctaccttggttaccctgcccactggggatatcaaagggcagagccccttggaa gtgtcagacagtgatgggccagcttccttcccttcaagctccagccaaagccagcttccc cctggagctgccctccaaggatctggggatcctgagggccagaatccctgtttcctgcgc tccttcgtccgtgcccacgactccgcaggggaaggcagtctggggagcagccaggccctc ggggtctcctctggactgctgaaaacacgcccctccttgccagctaggttggaccggtgg ccattctcagacccagatgttgaagggcagcttcccaggaaaggaggagaacagggaaaa gaatcactggtgcaatgtgtgaagacattttgctgtcagctggaagagctgatctgctgg ctgtataatgttgcagatgttactgaccacgggactgcagccaggtccaatcttacaagt ctcaagtcttctctgcagctttaccggcaatttaagaaagatatagatgaacatcagtct ctgacggagagtgtcttacagaagggggagattcttcttcagtgcctgttggagaacacc ccagttttagaggatgtccttgggaggatcgcaaagcagtctggtgagctggagagccac gcagatcgcctgtatgactctatcttggcctctctggacatgctggctggctgcaccctt atccctgacaaaaagcccatggcggcaatggagcacccatgtgaaggggtttaa >gi568815596f:64969445_65182702|GENSCAN_predicted_peptide_4|187_aa MPGQYFEGYTLDDTYTESYISTIGVDFKIRTIELDGKTIKLQIWDTAGQERFRTITSSYY RGAHGIIVVYDVTDQESFNNVKQWLQEIDRYASENVNKLLVGNKCDLTTKKVVDYTTAKE FADSLGIPFLETSAKNATNVEQSFMTMAAEIKKRMGPGATAGGAEKSNVKIQSTPVKQSE MLLAASY >gi568815596f:64969445_65182702|GENSCAN_predicted_CDS_4|564_bp atgcctggccaatattttgagggatatactttggatgatacatatacagaaagctacatc agcacaattggtgtggatttcaaaataagaactatagagttagacgggaaaacaatcaag cttcaaatatgggacacagcaggccaggaaagatttcgaacaatcacctccagttattac agaggagcccatggcatcatagttgtgtatgatgtgacagatcaggagtccttcaataat gttaaacagtggctgcaggaaatagatcgttatgccagtgaaaatgtcaacaaattgttg gtagggaacaaatgtgatctgaccacaaagaaagtagtagactacacaacagcgaaggaa tttgctgattcccttggaattccgtttttggaaaccagtgctaagaatgcaacgaatgta gaacagtctttcatgacgatggcagctgagattaaaaagcgaatgggtcccggagcaaca gctggtggtgctgagaagtccaatgttaaaattcagagcactccagtcaagcagtcagaa atgctgctggcagcatcttactag >gi568815596f:64969445_65182702|GENSCAN_predicted_peptide_5|103_aa MEPHSLSASIQAADAPTSAPTLLDPFKIPQAGPRTQPTGALLNSLFGIHAGHVTAAAAAA TAALAAAAAALTLRATEHNQQPPPLSYRFHPKWPPANQEIGKP >gi568815596f:64969445_65182702|GENSCAN_predicted_CDS_5|312_bp atggaacctcattccctatctgcaagcatccaagcagccgatgccccgacttctgcccca accctcctcgacccctttaagatcccccaagctggcccacggacccagccgaccggtgct ctcctgaactcactattcgggattcatgctggacatgtcactgcagctgccgccgccgcc accgccgcccttgctgccgcagccgccgccctgactctccgcgccacggaacacaatcag cagccgccgccactcagctatcgcttccacccaaaatggccgccggcgaaccaggaaata gggaagccttag >gi568815596f:64969445_65182702|GENSCAN_predicted_peptide_6|26_aa MRPQTATVDEEDPNLEKTQIVDEIYA >gi568815596f:64969445_65182702|GENSCAN_predicted_CDS_6|81_bp atgaggccacagacagcaactgtggatgaagaggatccaaatctggagaagacgcagatt gtggacgagatctatgcatga >gi568815596f:64969445_65182702|GENSCAN_predicted_peptide_7|78_aa XCGRLMSMGGRLSGGRRQLDVGLQVLFGMNSLGAMGTMDGRVMAAGGRQAPGQKKAVVCI PRSSWMQDKNSGPTEWQD >gi568815596f:64969445_65182702|GENSCAN_predicted_CDS_7|237_bp nactgtgggcgcctaatgagcatgggagggaggctgagtgggggccgaaggcagcttgat gtgggcctgcaggtgctctttggcatgaacagcctgggtgccatgggcaccatggatggc agagtgatggcagcaggaggcagacaggctcctgggcagaaaaaggcagttgtctgcata cctcgttcttcctggatgcaggacaagaactcaggacccactgaatggcaggactaa