GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:43:45 Sequence gi568815596r:64988496_65204806 : 216311 bp : 44.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1149 1675 527 2 2 104 92 758 0.824 72.13 1.02 Intr + 12953 12995 43 2 1 91 103 16 0.753 1.64 1.03 Intr + 15458 15520 63 1 0 56 98 67 0.868 3.41 1.04 Intr + 22102 22268 167 0 2 101 89 158 0.985 15.96 1.05 Intr + 27945 28178 234 0 0 108 109 122 0.949 13.10 1.06 Intr + 29576 29770 195 2 0 74 84 286 0.996 25.33 1.07 Intr + 30050 30184 135 2 0 71 75 154 0.999 12.08 1.08 Term + 32417 32651 235 2 1 95 45 350 0.556 27.19 1.09 PlyA + 39013 39018 6 1.05 2.00 Prom + 41668 41707 40 -4.46 2.01 Init + 42287 42386 100 1 1 42 62 65 0.438 -0.28 2.02 Intr + 42451 42764 314 2 2 53 111 157 0.485 10.40 2.03 Term + 55051 55233 183 0 0 74 49 126 0.535 4.84 2.04 PlyA + 55548 55553 6 1.05 3.00 Prom + 65287 65326 40 -2.66 3.01 Init + 80950 81306 357 0 0 64 96 141 0.849 9.61 3.02 Intr + 82959 84485 1527 2 0 90 90 159 0.639 6.05 3.03 Intr + 85787 85909 123 1 0 99 57 67 0.688 5.48 3.04 Intr + 89373 89469 97 2 1 26 94 104 0.876 4.38 3.05 Term + 94041 94210 170 1 2 22 37 140 0.434 0.34 3.06 PlyA + 94349 94354 6 -0.45 4.07 PlyA - 94536 94531 6 1.05 4.06 Term - 97375 97350 26 2 2 87 44 39 0.377 -2.21 4.05 Intr - 100195 100015 181 1 1 88 49 146 0.816 10.14 4.04 Intr - 100575 100444 132 0 0 56 95 97 0.941 8.04 4.03 Intr - 102583 102488 96 1 0 29 111 59 0.850 2.51 4.02 Intr - 109571 109476 96 2 0 103 92 50 0.950 7.11 4.01 Init - 122594 122562 33 2 0 110 73 -13 0.341 -0.68 4.00 Prom - 125401 125362 40 -4.86 5.00 Prom + 131794 131833 40 -2.46 5.01 Init + 140501 140537 37 1 1 69 89 -6 0.551 -2.96 5.02 Intr + 141302 141493 192 0 0 102 56 134 0.632 11.06 5.03 Term + 141564 141646 83 1 2 82 41 67 0.356 -0.74 5.04 PlyA + 142690 142695 6 1.05 6.00 Prom + 146604 146643 40 -7.56 6.01 Init + 163347 163403 57 2 0 96 94 112 0.994 13.91 6.02 Term + 167949 167972 24 2 0 110 37 67 0.975 2.02 6.03 PlyA + 168656 168661 6 1.05 7.04 PlyA - 168688 168683 6 1.05 7.03 Term - 188290 188223 68 2 2 79 42 59 0.363 -1.50 7.02 Intr - 189230 189063 168 0 0 75 81 69 0.269 4.82 7.01 Intr - 211543 211460 84 2 0 66 94 32 0.021 1.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:64988496_65204806|GENSCAN_predicted_peptide_1|532_aa MEKSNETNGYLDSAQAGPAAGPGAPGTAAGRARRCAGFLRRQALVLLTVSGVLAGAGLGA ALRGLSLSRTQVTYLAFPGEMLLRMLRMIILPLVVCSLVSGAASLDASCLGRLGGIAVAY FGLTTLSASALAVALAFIIKPGSGAQTLQSSDLGLEDSGPPPVPKETVDSFLDLARNLFP SNLVVAAFRTYATDYKVVTQNSSSGNVTHEKIPIGTEIEGMNILGLVLFALVLGVALKKL GSEGEDLIRFFNSLNEATMVLVSWIMWYVPVGIMFLVGSKIVEMKDIIVLVTSLGKYIFA SILGHVIHGGIVLPLIYFVFTRKNPFRFLLGLLAPFATAFATCSSSATLPSMMKCIEENN GVDKRISRFILPIGATVNMDGAAIFQCVAAVFIAQLNNVELNAGQIFTILVTATASSVGA AGVPAGGVLTIAIILEAIGLPTHDLPLILAVDWIVDRTTTVVNVEGDALGAGILHHLNQK ATKKGEQELAEVKVEAIPNCKSEEETSPLVTHQNPAGPVASAPELESKESVL >gi568815596r:64988496_65204806|GENSCAN_predicted_CDS_1|1599_bp atggagaagagcaacgagaccaacggctaccttgacagcgctcaggcggggcctgcggcc gggcccggagctccggggaccgcggcgggacgcgcacggcgttgcgcgggcttcctgcgg cgccaagcgctggtgctgctcaccgtgtccggggtgctggcgggcgcgggcctgggcgcg gcgttgcgcgggctcagcctgagccgcacgcaggtcacctacctggccttccccggcgag atgctgctccgcatgctgcgcatgatcatcctgccgctggtggtctgcagcctggtgtcg ggcgccgcctcgctcgatgccagctgcctcgggcgtctgggcggcatcgctgtcgcctac tttggcctcaccacactgagtgcctcggcgctcgccgtggccttggcgttcatcatcaag ccaggatccggtgcgcagacccttcagtccagcgacctggggctggaggactcggggcct cctcctgtccccaaagagacggtggactctttcctcgacctggccagaaacctgtttccc tccaatcttgtggttgcagctttccgtacgtatgcaaccgattataaagtcgtgacccag aacagcagctctggaaatgtaacccatgaaaagatccccataggcactgagatagaaggg atgaacattttaggattggtcctgtttgctctggtgttaggagtggccttaaagaaacta ggctccgaaggagaagacctcatccgtttcttcaattccctcaacgaggcgacgatggtg ctggtgtcctggattatgtggtacgtacctgtgggcatcatgttccttgttggaagcaag atcgtggaaatgaaagacatcatcgtgctggtgaccagcctggggaaatacatcttcgca tctatattgggccatgttattcatggaggaattgttctgccacttatttattttgttttc acacgaaaaaacccattcagattcctcctgggcctcctcgccccatttgcgacagcattt gctacctgctccagctcagcgacccttccctctatgatgaagtgcattgaagagaacaat ggtgtggacaagaggatcagcaggtttattctccccatcggggccaccgtgaacatggac ggagcagccatcttccagtgtgtggccgcggtgttcattgcgcaactcaacaacgtagag ctcaacgcaggacagattttcaccattctagtgactgccacagcgtccagtgttggagca gcaggcgtgccagctggaggggtcctcaccattgccattatcctggaggccattgggctg cctactcatgacctgcctctgatcctggctgtggactggattgtggaccggaccaccacg gtggtgaatgtggaaggggatgccctgggtgcaggcattctccaccacctgaatcagaag gcaacaaagaaaggcgagcaggaacttgctgaggtgaaagtggaagccatccccaactgc aagtctgaggaggagacatcgcccctggtgacacaccagaaccccgctggccccgtggcc agtgccccagaactggaatccaaggagtcggttctgtga >gi568815596r:64988496_65204806|GENSCAN_predicted_peptide_2|198_aa MQTFTTCISYSEYSCMLLANASSHGTLYCKLRVGICLLMVPAVKNQASGSARGATKVRRK CQAGCQNEHLGELDDGTDGKNQLNIRENGGRGQNCEQELEESVAEKDLSQTSRDLEKMMS KHIFLKPMLSISDLVNFLMQVSKVLVKTAEGIVLQQLPLAFPALHFHAYGNLFPVCSFKH YIYMIDHPIFISIPDFLT >gi568815596r:64988496_65204806|GENSCAN_predicted_CDS_2|597_bp atgcagacttttacaacatgcatcagctacagtgagtattcttgcatgcttttggccaat gcaagctctcatggcacgctgtactgtaagctccgggtcggcatttgtttgttgatggta ccagctgtcaaaaaccaagcaagtggcagtgctagaggtgccacaaaagtgcgaaggaag tgtcaggctggatgtcaaaatgaacaccttggagaactggatgatggaacagacggtaaa aatcagctaaacatcagagaaaatggaggaagaggtcaaaactgtgaacaggaactagaa gaaagtgtagcagaaaaagacttgtcacaaacttcgagagatttggagaaaatgatgtca aaacacatcttcctcaagcccatgctgagtatctctgatttggttaatttcttgatgcaa gtttccaaagttctggtgaaaacggctgaaggaatagtgttacagcaactgcccttggct tttccagctttacacttccatgcctatggaaatctcttccctgtctgtagctttaagcat tacatctatatgatcgaccatcccatcttcatctccatccctgacttcctaacttga >gi568815596r:64988496_65204806|GENSCAN_predicted_peptide_3|757_aa MALGEEKAEAEASEDTKAQSYGRGSCRERELDIPGPMSGEQPPRLEAEGGLISPVWGAEG IPAPTCWIGTDPGGPSRAHQPQASDANREPVAERSEPALSGLPPATMGSGDLLLSGESQV EKTKLSSSEEFPQTLSLPRTTTICSGHDADTEDDPSLADLPQALDLSQQPHSSGLSCLSQ WKSVLSPGSAAQPSSCSISASSTGSSLQGHQERAEPRGGSLAKVSSSLEPVVPQEPSSVV GLGPRPQWSPQPVFSGGDASGLGRRRLSFQAEYWACVLPDSLPPSPDRHSPLWNPNKEYE DLLDYTYPLRPGPQLPKHLDSRVPADPVLQDSGVDLDSFSVSPASTLKSPTNVSPNCPPA EATALPFSGPREPSLKQWPSRVPQKQGGMGLASWSQLASTPRAPGSRDARWERREPALRG AKDRLTIGKHLDMGSPQLRTRDRGWPSPRPEREKRTSQSARRPTCTESRWKSEEEVESDD EYLALPARLTQVSSLVSYLGSISTLVTLPTGDIKGQSPLEVSDSDGPASFPSSSSQSQLP PGAALQGSGDPEGQNPCFLRSFVRAHDSAGEGSLGSSQALGVSSGLLKTRPSLPARLDRW PFSDPDVEGQLPRKGGEQGKESLVQCVKTFCCQLEELICWLYNVADVTDHGTAARSNLTS LKSSLQLYRQFKKDIDEHQSLTESVLQKGEILLQCLLENTPVLEDVLGRIAKQSGELESH ADRLYDSILASLDMLAGCTLIPDKKPMAAMEHPCEGV >gi568815596r:64988496_65204806|GENSCAN_predicted_CDS_3|2274_bp atggccctgggtgaagaaaaggcagaagcggaagcatctgaagacacaaaggcccagtcc tatgggagagggagctgcagggagcgggagctggacatcccagggcccatgagtggggag cagcccccacgcctggaagctgagggagggctcatctcccctgtatggggggcagaaggg atacctgcccctacttgctggattgggactgaccctggcggcccctctagagcccaccag ccacaggccagtgatgccaacagagagcccgtagctgagaggtctgagcctgcactcagt ggcctgcctcctgccaccatggggtctggagaccttctgctctccggggaaagccaggtg gagaagaccaagctttcttcctccgaggagttccctcagactctgagccttcccagaaca acaactatttgctcaggacatgatgctgataccgaagatgatccatccctagcagatttg ccccaggcactggacctcagccagcagcctcacagctcaggtctctcttgcctgtcacag tggaagtccgtgctgagcccaggttccgcagctcagccttccagctgcagcatctctgct tcctccacaggcagcagtctccagggtcaccaggagagggcggagcctcgtggtggttct ctggccaaggtctcctcctccctggagccggtcgtcccccaggaaccttcctctgtggtg gggctaggacctcggccccagtggtcaccacagcctgtgttctctgggggtgatgcttct gggctaggcaggagacgcctctccttccaggctgagtactgggcctgtgtgctgccagat tccctgcctccatcacccgaccgccactcccctctctggaacccaaataaagagtatgaa gatctgcttgactatacttacccactgaggcccgggcctcagctcccaaagcaccttgat agccgtgtgccagctgaccctgtcctgcaggactccggggtagacctggatagcttctct gtctctccagcaagcaccctcaaatcacctactaatgtctcccccaactgcccaccagca gaggccactgccctgccattttctgggcccagagagccaagccttaagcagtggccctcc agagtaccccagaaacagggtggcatgggcttggcatcttggagccaacttgcatctacc cccagagccccaggcagtagggatgctcgttgggagcgcagagagccagccctgaggggt gcgaaggaccggctgactataggcaagcaccttgatatgggctctccccagctaaggaca cgggacagagggtggccctcgcccaggccagagagggagaagaggaccagccagagtgcc cggcgccctacctgcacagagtctaggtggaaatcagaagaggaagtggaaagtgatgac gagtatcttgccctccccgctcggctgacacaggtttctagcctggtttcgtatctagga tccatttctaccttggttaccctgcccactggggatatcaaagggcagagccccttggaa gtgtcagacagtgatgggccagcttccttcccttcaagctccagccaaagccagcttccc cctggagctgccctccaaggatctggggatcctgagggccagaatccctgtttcctgcgc tccttcgtccgtgcccacgactccgcaggggaaggcagtctggggagcagccaggccctc ggggtctcctctggactgctgaaaacacgcccctccttgccagctaggttggaccggtgg ccattctcagacccagatgttgaagggcagcttcccaggaaaggaggagaacagggaaaa gaatcactggtgcaatgtgtgaagacattttgctgtcagctggaagagctgatctgctgg ctgtataatgttgcagatgttactgaccacgggactgcagccaggtccaatcttacaagt ctcaagtcttctctgcagctttaccggcaatttaagaaagatatagatgaacatcagtct ctgacggagagtgtcttacagaagggggagattcttcttcagtgcctgttggagaacacc ccagttttagaggatgtccttgggaggatcgcaaagcagtctggtgagctggagagccac gcagatcgcctgtatgactctatcttggcctctctggacatgctggctggctgcaccctt atccctgacaaaaagcccatggcggcaatggagcacccatgtgaaggggtttaa >gi568815596r:64988496_65204806|GENSCAN_predicted_peptide_4|187_aa MPGQYFEGYTLDDTYTESYISTIGVDFKIRTIELDGKTIKLQIWDTAGQERFRTITSSYY RGAHGIIVVYDVTDQESFNNVKQWLQEIDRYASENVNKLLVGNKCDLTTKKVVDYTTAKE FADSLGIPFLETSAKNATNVEQSFMTMAAEIKKRMGPGATAGGAEKSNVKIQSTPVKQSE MLLAASY >gi568815596r:64988496_65204806|GENSCAN_predicted_CDS_4|564_bp atgcctggccaatattttgagggatatactttggatgatacatatacagaaagctacatc agcacaattggtgtggatttcaaaataagaactatagagttagacgggaaaacaatcaag cttcaaatatgggacacagcaggccaggaaagatttcgaacaatcacctccagttattac agaggagcccatggcatcatagttgtgtatgatgtgacagatcaggagtccttcaataat gttaaacagtggctgcaggaaatagatcgttatgccagtgaaaatgtcaacaaattgttg gtagggaacaaatgtgatctgaccacaaagaaagtagtagactacacaacagcgaaggaa tttgctgattcccttggaattccgtttttggaaaccagtgctaagaatgcaacgaatgta gaacagtctttcatgacgatggcagctgagattaaaaagcgaatgggtcccggagcaaca gctggtggtgctgagaagtccaatgttaaaattcagagcactccagtcaagcagtcagaa atgctgctggcagcatcttactag >gi568815596r:64988496_65204806|GENSCAN_predicted_peptide_5|103_aa MEPHSLSASIQAADAPTSAPTLLDPFKIPQAGPRTQPTGALLNSLFGIHAGHVTAAAAAA TAALAAAAAALTLRATEHNQQPPPLSYRFHPKWPPANQEIGKP >gi568815596r:64988496_65204806|GENSCAN_predicted_CDS_5|312_bp atggaacctcattccctatctgcaagcatccaagcagccgatgccccgacttctgcccca accctcctcgacccctttaagatcccccaagctggcccacggacccagccgaccggtgct ctcctgaactcactattcgggattcatgctggacatgtcactgcagctgccgccgccgcc accgccgcccttgctgccgcagccgccgccctgactctccgcgccacggaacacaatcag cagccgccgccactcagctatcgcttccacccaaaatggccgccggcgaaccaggaaata gggaagccttag >gi568815596r:64988496_65204806|GENSCAN_predicted_peptide_6|26_aa MRPQTATVDEEDPNLEKTQIVDEIYA >gi568815596r:64988496_65204806|GENSCAN_predicted_CDS_6|81_bp atgaggccacagacagcaactgtggatgaagaggatccaaatctggagaagacgcagatt gtggacgagatctatgcatga >gi568815596r:64988496_65204806|GENSCAN_predicted_peptide_7|106_aa XPKALQSAAGKSSQACPFLQGSKLPPTPDCGRLMSMGGRLSGGRRQLDVGLQVLFGMNSL GAMGTMDGRVMAAGGRQAPGQKKAVVCIPRSSWMQDKNSGPTEWQD >gi568815596r:64988496_65204806|GENSCAN_predicted_CDS_7|321_bp ngaccaaaggctctacaatcagcagctggcaaatccagccaggcttgtcctttccttcag ggcagcaagctccccccaaccccagactgtgggcgcctaatgagcatgggagggaggctg agtgggggccgaaggcagcttgatgtgggcctgcaggtgctctttggcatgaacagcctg ggtgccatgggcaccatggatggcagagtgatggcagcaggaggcagacaggctcctggg cagaaaaaggcagttgtctgcatacctcgttcttcctggatgcaggacaagaactcagga cccactgaatggcaggactaa