GENSCAN 1.0 Date run: 4-Nov-116 Time: 01:45:32 Sequence gi568815586r:12376836_12621098 : 244263 bp : 41.96% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5399 5435 37 1 1 87 95 62 0.741 7.02 1.02 Term + 21921 22048 128 1 2 113 48 33 0.079 -0.74 1.03 PlyA + 22655 22660 6 1.05 2.04 PlyA - 23025 23020 6 1.05 2.03 Term - 26670 26506 165 0 0 87 36 122 0.285 3.83 2.02 Intr - 30360 30130 231 0 0 103 7 126 0.162 3.05 2.01 Init - 36889 36842 48 1 0 79 78 56 0.016 2.72 2.00 Prom - 42694 42655 40 -3.65 3.02 PlyA - 42978 42973 6 -0.45 3.01 Sngl - 43287 42997 291 0 0 71 43 179 0.887 7.10 3.00 Prom - 43341 43302 40 -5.65 4.02 PlyA - 43458 43453 6 1.05 4.01 Sngl - 44652 43999 654 0 0 43 48 333 0.991 20.72 4.00 Prom - 46473 46434 40 -3.45 5.04 PlyA - 46506 46501 6 1.05 5.03 Term - 53042 52877 166 1 1 58 54 162 0.017 6.21 5.02 Intr - 66767 66599 169 0 1 72 2 126 0.067 0.48 5.01 Init - 71089 70999 91 1 1 51 106 74 0.239 6.20 5.00 Prom - 75162 75123 40 -4.75 6.03 PlyA - 75272 75267 6 1.05 6.02 Term - 75675 75452 224 1 2 33 42 251 0.702 11.10 6.01 Init - 77535 77460 76 0 1 62 107 43 0.259 4.90 6.00 Prom - 78041 78002 40 -6.85 7.00 Prom + 78300 78339 40 -6.55 7.01 Init + 80037 80039 3 2 0 61 115 0 0.333 0.05 7.02 Intr + 84737 84936 200 1 2 75 46 154 0.351 7.13 7.03 Intr + 88430 88680 251 2 2 66 70 201 0.549 12.26 7.04 Term + 88727 88941 215 0 2 44 37 388 0.861 25.71 7.05 PlyA + 89281 89286 6 1.05 8.03 PlyA - 90068 90063 6 -1.75 8.02 Term - 91296 90982 315 0 0 66 42 246 0.846 11.76 8.01 Init - 95023 94820 204 1 0 64 26 303 0.348 18.68 8.00 Prom - 95236 95197 40 -7.85 9.07 PlyA - 96151 96146 6 1.05 9.06 Term - 101180 99998 1183 1 1 120 54 1148 0.998 104.84 9.05 Intr - 103511 103388 124 0 1 40 94 71 0.812 1.62 9.04 Intr - 110352 110193 160 0 1 111 111 119 0.984 15.14 9.03 Intr - 123847 123684 164 2 2 82 98 110 0.920 10.17 9.02 Intr - 143165 143027 139 2 1 85 110 33 0.081 4.32 9.01 Init - 144263 144036 228 2 0 79 92 199 0.968 17.82 9.00 Prom - 145259 145220 40 -10.05 10.00 Prom + 146280 146319 40 -7.25 10.01 Init + 147015 147189 175 2 1 70 28 221 0.629 13.87 10.02 Term + 151443 151585 143 1 2 91 54 148 0.899 8.81 10.03 PlyA + 151821 151826 6 1.05 11.03 PlyA - 152451 152446 6 1.05 11.02 Term - 161424 161401 24 0 0 100 42 26 0.476 -3.55 11.01 Init - 166483 166346 138 1 0 74 95 115 0.915 10.99 11.00 Prom - 169692 169653 40 -8.15 12.00 Prom + 171394 171433 40 -4.45 12.01 Init + 173644 173650 7 0 1 59 92 0 0.386 -1.31 12.02 Term + 184670 184830 161 0 2 105 39 159 0.629 9.82 12.03 PlyA + 185811 185816 6 1.05 13.09 PlyA - 186662 186657 6 1.05 13.08 Term - 191580 191366 215 1 2 37 42 258 0.898 12.51 13.07 Intr - 205640 205565 76 2 1 51 75 33 0.298 -3.53 13.06 Intr - 205728 205675 54 0 0 66 63 87 0.408 2.26 13.05 Intr - 219808 219648 161 2 2 126 39 79 0.266 5.59 13.04 Intr - 223415 223297 119 1 2 100 22 81 0.166 1.89 13.03 Intr - 223740 223643 98 0 2 125 40 53 0.110 2.09 13.02 Intr - 235206 235036 171 0 0 85 47 126 0.006 7.42 13.01 Intr - 240012 239915 98 1 2 76 34 112 0.272 3.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 36889 36707 183 1 0 79 43 173 0.838 4.70 S.002 Term - 143165 142971 195 2 0 85 47 116 0.825 3.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:12376836_12621098|GENSCAN_predicted_peptide_1|54_aa MEEGEGGAGMSHDICAILGIQAAQSIADFTFSYKAENVKISQLFIEISWSYLAT >gi568815586r:12376836_12621098|GENSCAN_predicted_CDS_1|165_bp atggaggaaggtgaaggaggagcaggcatgtcacatgatatctgtgcgatattgggaatt caggcagcccaatcgatagctgattttaccttttcctacaaggcagaaaatgtcaagatt agccaactatttattgaaattagttggagttatctggctacatga >gi568815586r:12376836_12621098|GENSCAN_predicted_peptide_2|147_aa MVGGSAPRPASRPIREDQDYFKIICKKEYIQSEKALTLTVKINITNIHNSDEKIFQQLSL SEGRKPTLSHLPGRMNAILEEQATWVPPTLQQLFPQKQEGAPHSGLGRYGAITERKSLVC PSHGEEQLGLPQKSQQKQQTSNSETAK >gi568815586r:12376836_12621098|GENSCAN_predicted_CDS_2|444_bp atggtgggggggtcagccccccgcccggccagccgccccatccgggaggatcaggactac ttcaaaattatttgcaaaaaggaatatattcaaagtgagaaggccctcactcttactgtg aaaataaacattacaaacattcacaattcagatgagaaaatctttcaacagctctcactg agtgaaggcagaaagcccactttgtcccacttgcctggaagaatgaatgccatcctggaa gaacaggccacttgggtgccccccacacttcagcagctgtttcctcagaagcaggaaggt gccccccacagcggactagggagatatggggcaatcacagaaaggaaatccttggtgtgc ccaagccacggggaagagcagctgggattgccacagaaaagccagcagaagcaacagacc tctaacagtgaaacagccaaataa >gi568815586r:12376836_12621098|GENSCAN_predicted_peptide_3|96_aa MGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFAIYPSDKEL MSKIYKEFKQIYKKKTNNPINKWAKDMNRHSSKEDI >gi568815586r:12376836_12621098|GENSCAN_predicted_CDS_3|291_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaatttttgcaatctatccatcagacaaagagcta atgtccaaaatctacaaagaattcaaacaaatttacaagaaaaaaacaaacaaccccatc aacaagtgggcaaaagatatgaacagacactcctcaaaagaagacatctaa >gi568815586r:12376836_12621098|GENSCAN_predicted_peptide_4|217_aa MNIDAKILNKILANRIQEHIKKLTYHDQVGFIPGMQGWFNIHKSINVIHHINRTDHKKHM IISIDAEKAFNKIQQRFMLKTLNKLGIDGTYLKIIRAINDKPTASIIPNGQKLEAFPLKI GTRQECSLSPLLFNTVLEVLARAIRQEKEIKDIQLGNEEVKLSLFADDMIVYLENSIISA PNLLKLISNFSKVSGYKISVQKSQSFLYTNNRQTAKL >gi568815586r:12376836_12621098|GENSCAN_predicted_CDS_4|654_bp atgaacatcgatgcgaaaatcctcaataaaatactggcaaaccgaatccaggagcacatc aaaaagcttacctaccacgatcaagtcggcttcatccctgggatgcaaggctggttcaac atacacaaatcaataaacgtaatccatcacataaacagaaccgaccacaaaaaacacatg attatctcaatagatgcagaaaaggccttcaacaaaattcaacagcgcttcatgctaaaa actctcaataaactgggtattgatggaacgtatctcaaaataataagagctattaatgac aaacccacagccagtatcataccgaatgggcaaaaactggaagcattccctttgaaaatc ggcacaagacaagaatgctccctctcaccactcctattcaacacagtgttggaagttctg gccagggcaatcaggcaagaaaaagaaataaaggatattcaattaggaaatgaggaagtc aaattgtccctgtttgcagatgacatgattgtatatttagaaaactccatcatctcagcc ccaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcagtgtg caaaaatcacaatcattcctatacaccaataacagacaaacagccaaattatga >gi568815586r:12376836_12621098|GENSCAN_predicted_peptide_5|141_aa MQQVDSEGKAPSHATNLDSLFRCPELKRGKRFLAFGRTVLVQRKCGARPKEALCSIPYSP PNQKQPTPPPGFPATQGTSCISYRKERCMRKLSGASEDYWEPESQNEVEEGYRASPSCQV AEAEGPEPNLSSQNKTLTSTS >gi568815586r:12376836_12621098|GENSCAN_predicted_CDS_5|426_bp atgcagcaggtggatagtgaaggaaaggctccaagccacgccactaacctggactccctt ttccgatgtccagaattaaagagaggtaaaagatttctggcttttggcaggacagtgctg gtgcaaaggaagtgtggggccaggcccaaggaagccctgtgcagcataccttacagtccg ccgaaccagaagcagcccactcccccacctggatttcctgccacccaggggaccagttgt atttcttacagaaaagaaaggtgcatgaggaagctctccggtgcttcagaagactattgg gagcctgaatcgcagaatgaagtggaggaaggctacagggccagcccaagctgccaggtg gctgaagcagagggcccagaacccaaccttagctcacaaaacaaaactctcacttctacc tcttga >gi568815586r:12376836_12621098|GENSCAN_predicted_peptide_6|99_aa MAQIKRQTMTNVSEGVKKLEPLCIAVPTRWARPSHDQGDPNLNNHGKPRLSKFYQPYGFV ETLDKCFENVCELDLIFHVDTVHNILAEMMMGEWYWRQT >gi568815586r:12376836_12621098|GENSCAN_predicted_CDS_6|300_bp atggctcaaattaaaaggcagacaatgacaaatgttagtgagggtgtgaagaaactggaa cccctgtgcattgctgtgcctacccggtgggcccgacccagccacgatcaaggcgatcct aatcttaacaatcatgggaaaccgcggctctccaagttctaccagccctatggatttgtg gaaacattagacaaatgttttgaaaatgtttgtgaactggatttgattttccatgtagac acggttcacaatattcttgcagaaatgatgatgggggaatggtattggagacaaacatga >gi568815586r:12376836_12621098|GENSCAN_predicted_peptide_7|222_aa MAWSPLTHKHPHGPRLLLAHDLPTQRCNLHYSHGLHGVHLHFSSSRVEEGTVKKRSNKEC MPAAFPGREIEISGSQQGTNNAHSQNIEPGTVPSQFSSKGGSGTTVDKAVSLKGFLALGA SCKTCQLPLDLRDCVPHRLDTRPCGEDLIKQETLFSFMQERQKRYAKYAEQIQKVNEMSA ILRRIQMGIDQTVPLLDRLNSMLPEGERLEPFSMKPDRELRL >gi568815586r:12376836_12621098|GENSCAN_predicted_CDS_7|669_bp atggcctggtctcctcttacacataaacatcctcatggacccaggctgcttctagctcat gatctgccaactcaaagatgtaaccttcattattctcatggtctacatggggtgcacctg catttcagcagcagcagagtggaggaggggacagtgaagaagagaagcaacaaagagtgc atgccggctgcctttccaggaagggagatagaaatctcaggatctcaacaaggtacaaat aacgcacattcccagaacatagagccagggacagtcccctctcagttctcttctaaaggt ggttctggcacaacagttgataaagctgtgtccctgaagggctttttagccctgggtgct tcctgtaaaacttgtcagcttccactggatctgcgggactgtgtccctcacaggctggac acccggccctgcggagaggacttgattaaacaagaaactctgttcagcttcatgcaggag cgccagaaaagatacgccaagtatgccgagcagatccagaaagtgaacgagatgtccgcc atcctccgccgcatacagatgggcatcgaccagactgtgcccctgctggacaggctcaac agcatgctgcccgagggcgagcggctggagcccttcagcatgaagcccgaccgcgagctc aggctgtag >gi568815586r:12376836_12621098|GENSCAN_predicted_peptide_8|172_aa MNNSGRAALRAVTLTAKVCSFTPEPATPRTHQKEETPNTSEHQKEQTPDTLPLRTVTLTV RVRDFILETTGQAVGKDGEGEQVRLRPASHKLGPAPVSHQSPCWSLIPKPSSLWLQTKKL VLIPGPGSGKAKGRVLREWEEPWAQRLPTPQVQEIHGNVHVMEQGLTYIHFQ >gi568815586r:12376836_12621098|GENSCAN_predicted_CDS_8|519_bp atgaacaactctggacgcgctgccttaagagctgtaacactcaccgcgaaggtctgcagc ttcactcctgagccagcgacaccacgaacccaccagaaggaagaaactccaaacacatcc gaacatcagaaggaacaaactccggacacgctgcctttaagaactgtaacactcactgtg agggtccgcgacttcattcttgaaaccacaggccaggcggttggaaaggatggtgaggga gagcaagtacgactgagacctgcaagccacaagctgggacctgcacctgtctctcatcag agcccgtgctggtctctcatccccaaaccttcatccttgtggctgcagaccaagaagctg gttctcatacccggcccagggtctggaaaagccaaaggcagagttctgcgggagtgggag gagccgtgggcccagcggctgcccacaccacaagtgcaggagatccatggtaacgtgcac gtgatggaacagggcctgacctacatccacttccagtga >gi568815586r:12376836_12621098|GENSCAN_predicted_peptide_9|665_aa MAHEMIGTQIVTERLVALLESGTEKVLLIDSRPFVEYNTSHILEAININCSKLMKRRLQQ DKVLITELIQHSAKHKVDIDCSQKVVVYDQSSQDVASLSSDCFLTVLLGKLEKSFNSVHL LAGGFAEFSRCFPGLCEGKSTLVPTCISQPCLPVANIGPTRILPNLYLGCQRDVLNKELM QQNGIGYVLNASNTCPKPDFIPESHFLRVPVNDSFCEKILPWLDKSVDFIEKAKASNGCV LVHCLAGISRSATIAIAYIMKRMDMSLDEAYRFVKEKRPTISPNFNFLGQLLDYEKKIKN QTGASGPKSKLKLLHLEKPNEPVPAVSEGGQKSETPLSPPCADSATSEAAGQRPVHPASV PSVPSVQPSLLEDSPLVQALSGLHLSADRLEDSNKLKRSFSLDIKSVSYSASMAASLHGF SSSEDALEYYKPSTTLDGTNKLCQFSPVQELSEQTPETSPDKEEASIPKKLQTARPSDSQ SKRLHSVRTSSSGTAQRSLLSPLHRSGSVEDNYHTSFLFGLSTSQQHLTKSAGLGLKGWH SDILAPQTSTPSLTSSWYFATESSHFYSASAIYGGSASYSAYSCSQLPTCGDQVYSVRRR QKPSDRADSRRSWHEESPFEKQFKRRSCQMEFGESIMSENRSREELGKVGSQSSFSGSME IIEVS >gi568815586r:12376836_12621098|GENSCAN_predicted_CDS_9|1998_bp atggcccatgagatgattggaactcaaattgttactgagaggttggtggctctgctggaa agtggaacggaaaaagtgctgctaattgatagccggccatttgtggaatacaatacatcc cacattttggaagccattaatatcaactgctccaagcttatgaagcgaaggttgcaacag gacaaagtgttaattacagagctcatccagcattcagcgaaacataaggttgacattgat tgcagtcagaaggttgtagtttacgatcaaagctcccaagatgttgcctctctctcttca gactgttttctcactgtacttctgggtaaactggagaagagcttcaactctgttcacctg cttgcaggtgggtttgctgagttctctcgttgtttccctggcctctgtgaaggaaaatcc actctagtccctacctgcatttctcagccttgcttacctgttgccaacattgggccaacc cgaattcttcccaatctttatcttggctgccagcgagatgtcctcaacaaggagctgatg cagcagaatgggattggttatgtgttaaatgccagcaatacctgtccaaagcctgacttt atccccgagtctcatttcctgcgtgtgcctgtgaatgacagcttttgtgagaaaattttg ccgtggttggacaaatcagtagatttcattgagaaagcaaaagcctccaatggatgtgtt ctagtgcactgtttagctgggatctcccgctccgccaccatcgctatcgcctacatcatg aagaggatggacatgtctttagatgaagcttacagatttgtgaaagaaaaaagacctact atatctccaaacttcaattttctgggccaactcctggactatgagaagaagattaagaac cagactggagcatcagggccaaagagcaaactcaagctgctgcacctggagaagccaaat gaacctgtccctgctgtctcagagggtggacagaaaagcgagacgcccctcagtccaccc tgtgccgactctgctacctcagaggcagcaggacaaaggcccgtgcatcccgccagcgtg cccagcgtgcccagcgtgcagccgtcgctgttagaggacagcccgctggtacaggcgctc agtgggctgcacctgtccgcagacaggctggaagacagcaataagctcaagcgttccttc tctctggatatcaaatcagtttcatattcagccagcatggcagcatccttacatggcttc tcctcatcagaagatgctttggaatactacaaaccttccactactctggatgggaccaac aagctatgccagttctcccctgttcaggaactatcggagcagactcccgaaaccagtcct gataaggaggaagccagcatccccaagaagctgcagaccgccaggccttcagacagccag agcaagcgattgcattcggtcagaaccagcagcagtggcaccgcccagaggtccctttta tctccactgcatcgaagtgggagcgtggaggacaattaccacaccagcttccttttcggc ctttccaccagccagcagcacctcacgaagtctgctggcctgggccttaagggctggcac tcggatatcttggccccccagacctctaccccttccctgaccagcagctggtattttgcc acagagtcctcacacttctactctgcctcagccatctacggaggcagtgccagttactct gcctacagctgcagccagctgcccacttgcggagaccaagtctattctgtgcgcaggcgg cagaagccaagtgacagagctgactcgcggcggagctggcatgaagagagcccctttgaa aagcagtttaaacgcagaagctgccaaatggaatttggagagagcatcatgtcagagaac aggtcacgggaagagctggggaaagtgggcagtcagtctagcttttcgggcagcatggaa atcattgaggtctcctga >gi568815586r:12376836_12621098|GENSCAN_predicted_peptide_10|105_aa MELDWERMMPIVHEGTQVAQKVEIREQEQVDSNNGTGSLSLIGSSQRDKIRMGQSSYLDT LPENSGPYLLGEGPVYGGGLGRVEMEGSLDEVTGQEEIRPGIGLI >gi568815586r:12376836_12621098|GENSCAN_predicted_CDS_10|318_bp atggaactggactgggaaagaatgatgccaattgtccatgagggtacccaggtggcacag aaagtggaaatccgagaacaagagcaagttgattccaacaatggcactgggtctctgagt ctgattggctcctctcaaagagataaaatccggatgggacaaagcagttatctggacacg ctgcctgaaaactcaggaccctacctactgggagaggggccggtgtatggaggtggtctt gggagggtagaaatggaaggatccctggatgaagttactgggcaagaggaaatacggcca gggattggcttaatctga >gi568815586r:12376836_12621098|GENSCAN_predicted_peptide_11|53_aa MAHYPQAHQQVSEEQGQFPTHSTVVNPSKPDIDLILLTSPHVYFSPVRETENQ >gi568815586r:12376836_12621098|GENSCAN_predicted_CDS_11|162_bp atggcccactatcctcaagcacatcagcaggtctccgaagaacaaggacagtttcctacc cacagtacagttgtcaaccccagtaaacctgacattgatttaatacttctaactagccca cacgtgtatttcagtcctgtaagagagactgaaaatcagtag >gi568815586r:12376836_12621098|GENSCAN_predicted_peptide_12|55_aa MHASVNKEVTRRRAPQRQDPSPCPKIEGFFGWLVEESSTVTTQVASLLTPKANWG >gi568815586r:12376836_12621098|GENSCAN_predicted_CDS_12|168_bp atgcacgctagtgtaaataaagaagtcactaggcgtcgggccccccaacgccaagacccg agcccctgtccgaaaattgagggctttttcggttggttggttgaagaaagcagcaccgtc accacccaggtggcctcgctgcttacaccaaaggcgaactggggataa >gi568815586r:12376836_12621098|GENSCAN_predicted_peptide_13|330_aa XHSHHTSKPQSSSSLTLKKMGRESVTHTECEGKFASLMPESRGRGTSSGMAERTLEAAPN ASLRRRRLLLPLHDSLWVFTDALESRRDRRVRPKYVLVTFLPMASRGRDHNFSSWSPTPP PMLHEKSYLCSFARSPQGSSVPSQSAKMHGKAVLLFPQVPLGGEERVAPLVQVPTLEERE ERQRGSEGKEETEKEGVKEREKERQRERGRDRDKKACCQQQQPSFTGQGVSTAPKKTTQH SAGDPALIPLSPPNHKGHAEGERDCVQKQAYSHGIVRKLKVDKSRKKLLVDQAEAHWPKT KEVCKLHDECLQAKKEEIIKMLSKEEETER >gi568815586r:12376836_12621098|GENSCAN_predicted_CDS_13|993_bp nggcacagccaccatacttccaaaccccaatcctcatcttcccttacactgaagaaaatg ggacgagagagtgtgactcatacagaatgtgagggaaaattcgcttccctgatgccagag tcacggggacgagggaccagttcaggaatggcagagcggaccctggaggcggccccaaac gcctcccttcgccgccgccgcctcctcctccccctccatgacagtctctgggtgtttaca gacgctctggagagccgccgagaccggagggtgaggcccaaatacgtgcttgtaaccttc ctgcccatggcttccagaggacgagatcataacttctcttcatggtcgcccactcctccc cctatgctgcatgagaaaagctacctctgcagttttgccaggtccccgcaaggcagttca gtccccagccaaagtgctaagatgcacggcaaggctgttctcctgtttccacaggtgccc ctgggaggggaagagagagtagctccactcgtgcaagtccctaccctagaggagagagag gagaggcagagaggcagtgagggaaaggaagagacagaaaaagaaggagtcaaagagaga gagaaagagaggcagagagagagaggaagagacagagacaaaaaggcatgttgccagcag cagcagccttctttcacaggacagggagtcagtacagcaccaaagaagaccactcagcac agtgccggagaccctgccctgattcccctttctcctcccaaccacaaaggccatgcagaa ggtgaaagggattgtgttcaaaaacaagcgtattctcatggaattgtccgcaagctgaag gtagacaagtcccgcaagaagctcctggtggaccaggctgaggcccactggcctaagact aaggaagtgtgcaaacttcatgacgagtgcctccaggccaagaaggaggagatcatcaag atgttgtccaaggaggaagagaccgagagataa