GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:13:22 Sequence gi568815596f:113790280_114057452 : 267173 bp : 40.69% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4198 4619 422 0 2 55 89 170 0.798 9.71 1.02 Intr + 5566 5687 122 1 2 73 33 71 0.564 -0.68 1.03 Term + 5773 5933 161 2 2 70 41 138 0.349 4.42 1.04 PlyA + 6961 6966 6 1.05 2.10 PlyA - 7677 7672 6 1.05 2.09 Term - 13816 13713 104 2 2 60 45 82 0.019 -1.44 2.08 Intr - 26115 25918 198 1 0 69 68 144 0.261 8.90 2.07 Intr - 32442 32387 56 2 2 93 88 49 0.497 3.10 2.06 Intr - 35238 34971 268 1 1 65 81 114 0.297 4.07 2.05 Intr - 41306 41251 56 1 2 127 98 23 0.934 4.90 2.04 Intr - 48756 48710 47 0 2 70 92 35 0.326 -1.61 2.03 Intr - 49914 49864 51 0 0 83 78 55 0.446 2.19 2.02 Intr - 50331 50076 256 2 1 39 72 159 0.387 5.92 2.01 Init - 65449 65259 191 1 2 55 92 113 0.445 6.93 2.00 Prom - 70160 70121 40 -5.65 3.00 Prom + 72280 72319 40 -2.95 3.01 Init + 73823 73825 3 1 0 108 81 0 0.279 1.35 3.02 Intr + 95196 95272 77 2 2 50 89 77 0.690 1.39 3.03 Intr + 99294 99579 286 0 1 97 29 358 0.961 27.22 3.04 Intr + 99826 100044 219 0 0 -39 74 275 0.985 10.98 3.05 Intr + 100333 100499 167 0 2 51 82 106 0.600 4.14 3.06 Intr + 122893 122948 56 1 2 83 95 20 0.160 -0.10 3.07 Intr + 126605 126729 125 0 2 55 95 129 0.189 9.68 3.08 Intr + 137066 137176 111 1 0 83 111 38 0.973 5.26 3.09 Intr + 141022 141117 96 0 0 44 115 72 0.964 4.79 3.10 Intr + 144000 144107 108 2 0 73 79 92 0.985 6.36 3.11 Intr + 149680 149823 144 0 0 98 93 153 0.999 16.36 3.12 Intr + 151907 152080 174 1 0 99 76 111 0.971 10.21 3.13 Intr + 160485 160739 255 2 0 -5 17 217 0.280 2.02 3.14 Intr + 161441 161566 126 1 0 92 81 95 0.916 9.16 3.15 Intr + 165344 165427 84 1 0 57 113 51 0.807 3.60 3.16 Term + 167081 167176 96 1 0 85 38 54 0.636 -2.91 3.17 PlyA + 167954 167959 6 1.05 4.06 PlyA - 169026 169021 6 1.05 4.05 Term - 179668 179025 644 2 2 27 50 231 0.403 6.54 4.04 Intr - 188836 188679 158 0 2 89 97 113 0.696 11.03 4.03 Intr - 189436 189212 225 0 0 -24 27 276 0.811 6.48 4.02 Intr - 189817 189438 380 1 2 -17 -12 372 0.054 9.54 4.01 Init - 192709 192677 33 1 0 125 127 9 0.892 8.44 4.00 Prom - 196706 196667 40 -5.05 5.03 PlyA - 197175 197170 6 1.05 5.02 Term - 204804 204682 123 0 0 152 48 67 0.778 6.50 5.01 Init - 210010 209834 177 1 0 68 47 137 0.545 6.91 5.00 Prom - 223181 223142 40 -3.65 6.02 PlyA - 223944 223939 6 1.05 6.01 Sngl - 227815 227222 594 1 0 88 49 553 0.985 47.34 6.00 Prom - 249184 249145 40 -2.75 7.04 PlyA - 249906 249901 6 1.05 7.03 Term - 253217 253147 71 0 2 70 49 91 0.070 0.52 7.02 Intr - 263965 263833 133 1 1 80 39 64 0.175 -0.00 7.01 Intr - 266232 266035 198 0 0 89 49 114 0.444 6.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 191743 191576 168 1 0 89 55 62 0.824 -0.10 S.002 Term + 216740 216897 158 0 2 96 54 136 0.834 8.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:113790280_114057452|GENSCAN_predicted_peptide_1|234_aa MKVHGRKKGRKEGRKEGRKEGRKEGRKKGRKEGKEGRKEGRKEGRKERKKKKERKKERKK ERKEKKEKERKREKERKKEGRKEREREKERRKEGRKERKRKRERKKERERKKERGREGKT VEMEVEERKTCTERKATLQLRVHLRLHYHWLVRDCECTAAQGTAINVSGVDATDLKCSSD MGRCLQHLQLEAQTGETLLPQIRMADWPSDCDHADEQEGDSCGIPAKFKISVPM >gi568815596f:113790280_114057452|GENSCAN_predicted_CDS_1|705_bp atgaaagtccatggaagaaagaaaggaaggaaggaaggaaggaaggaaggaaggaaggaa ggaaggaaggaaggaaggaagaaaggaaggaaggaagggaaagaaggaaggaaggaagga aggaaggaaggaaggaaagaaagaaagaaaaagaaagaaagaaagaaagaaagaaagaaa gaaagaaaagaaaagaaagagaaagaaagaaagagagagaaagaaagaaagaaggaagga aggaaagaaagagagagagagaaagaaagaaggaaggaaggaaggaaagaaagaaagaga aagagagaaagaaagaaagaaagagaaagaaagaaagaaagagggagggaggggaaaaca gttgaaatggaggtcgaggagaggaagacatgtacagagagaaaagctacattgcaatta cgggtccatttgaggctccattaccactggctagttagagattgtgaatgtacagctgca cagggcacagcaattaatgtttcgggtgtggatgccacagacctgaaatgctcttctgac atggggagatgtctgcagcacttgcagctggaagcccaaactggagagactctgctgcca caaataagaatggcagactggccatctgactgtgaccatgcagatgagcaggagggagat tcttgtgggattccagctaaattcaaaatttcagtgccaatgtga >gi568815596f:113790280_114057452|GENSCAN_predicted_peptide_2|408_aa MGSVQEEVFNKITVRGKVRESIQLARGILSTGHPRAASQAEGHCRCYSSLWQRKKHPLRG QHPGSDTVYLEIASDSIGLGLSPQNCPFHRHPLQMWASGTSDQPASSWGSYTFSLRSVNL LEQLTELRKKVYWFVIKGITKDTDGEMHRGRSISGEDLKTRSQKGGGTLDPLPTPDTVGS VGKSEGVILACFYKNTVAIDRPDPDIKSVLLHGDSEGGVLEERQDRETEYSKGTQLSTLC SFSLLFNETRHSSPEVFPHFPFLRRGTSFPALKNSQLKQWKVDPFKPKVRNQYKQQLSTV HLLYRWPSDFRHPLNTVQLQQHRFELSSSTYIRVFVNKYSRPSLSSGSTSTTKRGRKIQC SNVKPTIQRASLHTVSDIGIAAVKPYHAMVIFRLACEWKSGFKCGIWT >gi568815596f:113790280_114057452|GENSCAN_predicted_CDS_2|1227_bp atggggtcagttcaggaagaggtttttaataagataacagtcagagggaaggtgagggag tccattcagttggcaagaggaattttgagcacaggacatccccgagctgcaagtcaggca gagggacactgcagatgctactcaagtctgtggcagaggaagaagcatccgctcagagga cagcaccctggttccgacactgtctacctggagatagcatcagattctatcggcttgggg cttagtccccaaaactgcccattccacagacacccattgcagatgtgggcctctgggact tccgaccaaccagcttcaagctggggttcctacaccttctctttgcgttcagttaatttg ctagagcagctcacagaactcaggaaaaaagtttactggtttgttataaagggcattaca aaggatacagacggagagatgcacaggggcaggagcatctctggagaggatcttaagacc cgtagtcagaaaggtgggggaaccctggatcctctccctaccccggacacagtaggttca gttggaaagagcgagggagtaattctagcctgtttttacaagaacacagtggctatagac aggccagatcctgatataaaaagcgtattacttcatggagattcagaaggaggggttctt gaggagaggcaggacagagaaacagaatacagcaaagggactcagctgagcacgctctgt tccttctccctactcttcaacgagacacggcactcctccccggaagtattcccacacttt cccttcctgaggagaggaacttctttccctgcattgaaaaactcacagttgaaacaatgg aaagttgatccatttaagcccaaagtaaggaatcaatacaaacagcagctctccacggtc catttgctttatcggtggccctcggactttcggcatcccctgaacacagtgcagcttcaa caacaccggtttgaactatcctcgtccacttatatacgagtttttgtcaataaatatagt cgaccctccttatcctcaggttccacatccacaaccaaaagaggacgaaaaatacagtgt tctaatgtgaaacctacgatacagagggccagtcttcatacagtttctgacattggaatt gcggcagtaaagccttatcacgctatggtgatcttcagactagcatgtgagtggaaatct ggctttaagtgtggaatctggacataa >gi568815596f:113790280_114057452|GENSCAN_predicted_peptide_3|708_aa MASYQIQALQQPYRLNIPVNFIDEETEKAATQVLRMDVCRGRFRLGRFLSGTGGYSWLQE ALPPFPQPSKEVTETNPRVTTEKPLPASEAELEARIDFSASTLLENPQWISPTSGSEPRQ ANPPTEPAAFLLLLRLPGYPPDGEGGPAVDGQIALVSRRQSLAPSSTEQTAAAAAAGEEE DGGTAAGLCGGLWHGRRPKGAGDGRLGGGPRARPIRLSFPSAPVFASRFGDPGAELGDWP GRGARKEKRSWYTKLGYAGNTEPQFIIPSCIAIKESAKVGDQAQRRVMKGVDDLDFFIGD EAIEKPTYATKWPIRHGIVEDWDLMERFMEQVIFKYLRAEPEDHYFLLTEPPLNTPENRE YTAEIMFESFNVPGLYIAVQAVLALAASWTSRQVGERTLTGTVIDSGDGVTHVIPVAEGY VIGSCIKHIPIAGRDITYFIQQLLRDREVGIPPEQSLETAKAVKERYSYVCPDLVKEFNK YDTDGSKWIKQYTGINAISKKEFSIDVGYERFLGPEIFFHPELFVLVRFHTANKDIPETG QFIKERGLIGLMVPRGWGSLTIMAEGKEEQVLSYMMAAGKERMRKTQKQKSLIKPSDLVQ LIHSMRTNIVLSGGSTMFRDFGRRLQRDLKRTVDARLKLSEELSGGRLKPKPIDVQVITH HMQRYAVWFGGSMLASTPEFYQVCHTKKDYEEIGPSICRHNPVFGVMS >gi568815596f:113790280_114057452|GENSCAN_predicted_CDS_3|2127_bp atggcttcttaccaaattcaagctttacaacaaccctataggttaaatattcctgtgaat tttatagatgaagaaacagaaaaagcagccactcaagttttaagaatggatgtatgccgc ggacgttttcgattaggccgttttctctcaggcactggaggatattcgtggctgcaggag gcgcttccaccctttcctcaacctagcaaagaagtaactgaaactaacccaagggttaca accgaaaagccccttccagcttcagaagcagaactggaagctcggatagacttctccgcc tctacactcctggaaaacccgcagtggatttcaccaacttcaggatcggagcccaggcag gcgaacccgccgaccgagcctgctgctttcttgctactgcttcggcttcccggctacccc ccggacggtgaaggcggcccagctgtggatggtcagatagcccttgtctcccgccgccaa tctctggcccctagcagcacggagcagacggcggcagcagcagcagcaggcgaggaggaa gatggcgggacggctgccggcctgtgtggtggactgtggcacgggcgccgcccaaagggc gctggggacggtcgtcttgggggtggtccccgggcccgacccatccggctttcctttccc tccgcgcccgtttttgccagtcggtttggggacccaggggccgagctcggggactggcct ggcaggggagctagaaaagagaagcgctcctggtatacaaaactaggatatgctggaaat acagaaccacagtttatcatcccttcctgtattgctattaaggagtcagcaaaagtgggt gatcaagctcaaaggagggtgatgaaaggtgttgatgacctagacttcttcattggtgat gaagcaatagaaaaacctacatatgcaacaaagtggccaatccgccatggtatagttgaa gattgggacttaatggaaaggtttatggagcaagtgatctttaaatatttaagggcagaa cctgaagaccattattttcttttgactgaacctccattgaatactccagaaaacagggaa tatactgctgaaataatgtttgagtccttcaatgttccaggcttgtacattgctgtgcag gctgttcttgccttagctgcatcttggacctcaagacaagtaggagaacggacgttgacc ggtacggtaatagacagtggagatggtgtcactcatgtcattcctgtggctgaagggtat gtgattggcagctgtattaaacacattccaatcgcaggacgagatataacatattttatt cagcaactgctgagagaccgagaagtaggaatccctccagaacaatccttggaaactgct aaggcagtaaaggagcgctatagttatgtctgcccagatttagtaaaagaatttaacaag tatgatacagatgggtcaaaatggattaaacagtatactggaatcaatgctatctcaaag aaagagttttctatcgatgttggttatgagagatttttgggacctgaaatcttttttcat ccagagctatttgtattagtccgttttcatactgctaataaagacatacctgagactggg caatttataaaagaaagaggtttaattggacttatggttccacgtggctggggaagcctc acaatcatggcagaaggcaaggaggagcaagtcctgtcttacatgatggcagcaggcaaa gagagaatgaggaagacgcaaaagcagaaatccctgataaaaccatcagaccttgtgcaa cttattcactccatgagaacaaatattgtcctctctggaggttcaaccatgttcagggac tttggacgtcgcttgcaaagagatttgaaaagaactgtagatgcccggctgaaattaagt gaggaattgagtggtggtagattgaagccaaaacctattgatgtacaagtcattacacac cacatgcagcgatatgcagtttggtttggaggatcaatgctggcttccacgcctgagttc taccaagtatgccacaccaaaaaggattatgaagaaattggacctagcatttgtcgtcac aatccagtgtttggagtcatgtcgtaa >gi568815596f:113790280_114057452|GENSCAN_predicted_peptide_4|479_aa MAVSGCQTSYTLLGSEGLALYRRVSPPHVGPMLSPRRVFEKHGMYMAGQGALRIHRDAPS AATVTASGHQGAARARCGYTGTRPFSRDSNGLRPPGVGQGALRIHRDAPSAATVTASGHQ GAARARCGYTGTPPQPRHNGLRPPEGGQGALRIHRDAPSAATVTASGHQGAAHGASNCKR LRASESGRGAESGSQKPVGARMPALSRDRGQRGKPFQCSDSRKQVCVAGSRNFKHQQAEE STRVEGSPQGVEIKPPREPADPTLSDPFPREGLGQAGLGRRCRKRPRQEENHPGDLETLF SISKLHQQTHKIVMYKTVFPNWGNKDSCRGRERYHLTIIFFSRMVLSPSHGPREKFSLSP NSHPPKKIFPGVWKCLRFTHQLEVWPASELRTEAVMRWGFAGSWGCHSRRELCVLQLWLP LPGGSSRSEEPRMAPLSSGPNLPPSSLNKSTALGIRLAATSHHREDSKLPNLSPLMEQE >gi568815596f:113790280_114057452|GENSCAN_predicted_CDS_4|1440_bp atggcggtctcagggtgtcagacttcttacacgctccttggcagcgagggcctggctctt tatcgccgtgtgtccccgccccacgtgggtcccatgcttagcccacggcgggtgtttgaa aagcacggaatgtatatggcgggacagggcgcgctgcggatacaccgggacgccccctca gccgcgacagtaacggcctccggccaccagggggcagccagggcgcgctgcggatacacc gggacacgccccttcagccgcgacagtaacggcctccggccaccaggggtcggccagggc gcgctgcggatacaccgggacgccccctcagccgcgacagtaacggcctccggccaccag ggggcggccagggcgcgctgcggatacaccgggacgccccctcagccgcgacataacggc ctccggccaccagagggcggccagggcgcgctgcggatacaccgggacgccccctcagcc gcgacagtaacggcctccggccaccagggggcggcacacggcgcctcaaactgtaaacgg ctccgggcttcggaatccgggcggggagcggagagcggttcgcaaaagccagttggtgcc cggatgccggcattgtcacgggatcgcggccaaagggggaagcctttccagtgttctgat tccagaaagcaggtgtgtgtggccggttcaaggaatttcaagcatcagcaggcagaggaa agtaccagagtggaaggcagcccacagggtgtcgaaatcaagccacccagggagcccgca gatcccaccctctccgacccattccccagagaaggactggggcaggcaggacttggcagg aggtgtagaaagagacccagacaggaagaaaaccatcctggagacctggagacattgttt agcatcagcaaactccatcagcagacacataagatagtgatgtataaaacagtgtttccc aactggggcaataaagacagctgcagagggagagagcgttaccacctcacaataattttc ttctcaagaatggttctctctccttctcatggtccaagggagaaattcagtttgtccccc aactcccacccacccaagaaaatatttcctggtgtttggaaatgtctgcgttttacccac cagttggaggtctggccagcctcagagctgagaaccgaggctgtgatgagatgggggttt gcaggctcgtggggctgccacagcagaagggagctgtgtgtgcttcagctgtggctgcct ctcccaggaggttcatcacggtcagaggaacccaggatggctccgttgtcatcagggccc aatcttccaccatctagccttaataaatctacagcattaggaatcagattggcagcaaca agtcatcacagagaggacagcaaactccccaacctctcacccttaatggagcaggagtag >gi568815596f:113790280_114057452|GENSCAN_predicted_peptide_5|99_aa MESKDTCGGFIARSAWSQTQHQPVRASGAALWGAPGAFAFEDVCDLFPLKNRILPGDSES WDTHLLLPLDIKTPGSLAFRLCDFHQCPPTQFLRPSALN >gi568815596f:113790280_114057452|GENSCAN_predicted_CDS_5|300_bp atggagagtaaggacacctgtggtggatttatagcaagaagtgcttggagccaaacacag catcagccagtcagagctagtggggcagctctctggggtgcacctggagcgttcgccttt gaggacgtctgtgacttatttccactaaagaacaggattcttccaggtgacagtgagagc tgggacacccatcttctcctgcccttggatataaaaactccaggctcgctggccttcagg ctctgtgactttcatcagtgccccccaacccagttcctcaggccttctgctttgaactaa >gi568815596f:113790280_114057452|GENSCAN_predicted_peptide_6|197_aa MGTKQSRKTGNSKKQSTSPPPKEHSSSPAMEQSWTENDFDELREEGFRRSNYSELQEEIQ TKSKEVKNFEKNLDECITRITNTEKCLKELMELKAKARELREECRSLRSRCDQLEERVSV MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVSESDEKNGTKLENTLQD IIQENFPSLARQANIQI >gi568815596f:113790280_114057452|GENSCAN_predicted_CDS_6|594_bp atggggacaaaacagagcagaaaaactggaaactctaaaaagcagagcacctctcctcct ccaaaggaacacagttcctcaccagcaatggaacaaagctggacagagaatgactttgat gagttgagagaagaaggcttcagacgatcaaactactccgagctacaggaggaaattcaa accaaaagcaaagaagttaaaaactttgaaaaaaatttagatgaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaagccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagtg atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtatctgaaagtgacgagaagaatggaaccaagttggaaaacactctgcaggac attatccaggagaacttccccagtctagcaaggcaggccaacattcagatttag >gi568815596f:113790280_114057452|GENSCAN_predicted_peptide_7|133_aa TIKVWYDKDCPKTSLEFTWGFPDKEPVYTEGSSLLTVGDMHSSSASHNNSAPGRRGIMLP WAGGHQAWKPAMEKRQWLTLPFFSSLLHLLFYSPAAGIVEIWRLCRSATAGITGITELKQ YKTSCSRRNEDIA >gi568815596f:113790280_114057452|GENSCAN_predicted_CDS_7|402_bp accatcaaagtgtggtatgacaaggactgtccgaaaacgtccttggagttcacatggggt ttcccagacaaggagccagtatacactgagggctcctcccttctcacagtgggggacatg cactcaagttcagcttcccacaacaatagtgcccctggcagaaggggcatcatgctccca tgggccggaggtcaccaggcttggaagcctgccatggagaagaggcagtggttaactcta cctttcttctcttctctgctccacctcctcttctactcaccagctgctggcatagttgag atctggagattgtgccgttcggccactgcaggaataactggaatcactgagctcaagcag tacaagacatcctgttctaggaggaatgaggatatagcctga