GENSCAN 1.0 Date run: 5-Nov-116 Time: 08:20:04 Sequence gi568815578f:16631703_16841002 : 209300 bp : 40.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 626 621 6 1.05 1.02 Term - 7045 6878 168 1 0 19 54 152 0.600 1.80 1.01 Init - 7295 7164 132 2 0 56 51 151 0.830 8.33 1.00 Prom - 12154 12115 40 -2.25 2.00 Prom + 14430 14469 40 -9.25 2.01 Sngl + 16146 16433 288 2 0 69 39 217 0.847 10.24 2.02 PlyA + 17233 17238 6 1.05 3.03 PlyA - 17421 17416 6 1.05 3.02 Term - 22878 22868 11 1 2 121 48 6 0.243 -2.72 3.01 Init - 27407 26978 430 2 1 70 81 238 0.943 17.77 3.00 Prom - 31001 30962 40 -10.35 4.02 PlyA - 31352 31347 6 1.05 4.01 Sngl - 32031 31651 381 0 0 68 40 216 0.584 10.73 4.00 Prom - 36804 36765 40 -5.55 5.06 PlyA - 36840 36835 6 -0.45 5.05 Term - 38250 37927 324 0 0 79 40 227 0.678 10.78 5.04 Intr - 49705 49629 77 2 2 64 92 9 0.121 -2.78 5.03 Intr - 52678 52435 244 1 1 36 54 190 0.008 6.55 5.02 Intr - 63475 63399 77 2 2 63 63 71 0.008 0.42 5.01 Init - 69801 69633 169 0 1 98 13 140 0.346 7.24 5.00 Prom - 70966 70927 40 -6.25 6.03 PlyA - 71705 71700 6 1.05 6.02 Term - 74397 74308 90 0 0 97 37 79 0.096 0.44 6.01 Init - 83005 82934 72 1 0 80 31 126 0.152 7.22 6.00 Prom - 88301 88262 40 -2.85 7.00 Prom + 88718 88757 40 -7.65 7.01 Init + 91296 91352 57 2 0 86 55 35 0.373 1.43 7.02 Intr + 91768 91807 40 0 1 55 116 30 0.279 -0.62 7.03 Intr + 98183 98307 125 0 2 110 8 168 0.838 10.38 7.04 Intr + 98435 98512 78 1 0 114 60 48 0.573 3.43 7.05 Intr + 98830 98953 124 0 1 75 13 98 0.483 0.24 7.06 Intr + 99966 100064 99 1 0 93 115 54 0.931 7.76 7.07 Intr + 100462 100634 173 2 2 59 103 66 0.971 3.94 7.08 Intr + 105559 105699 141 0 0 11 110 141 0.957 8.23 7.09 Intr + 108623 108711 89 1 2 90 89 42 0.997 2.35 7.10 Term + 109144 109303 160 1 1 110 38 117 0.994 5.33 7.11 PlyA + 109492 109497 6 1.05 8.00 Prom + 115302 115341 40 -5.85 8.01 Init + 116700 116814 115 2 1 75 56 83 0.868 4.22 8.02 Intr + 117165 117304 140 1 2 108 47 109 0.851 8.06 8.03 Intr + 118201 118308 108 0 0 49 115 125 0.987 10.86 8.04 Intr + 130467 130558 92 2 2 -2 72 120 0.164 -0.73 8.05 Intr + 132620 132700 81 2 0 78 52 117 0.503 4.93 8.06 Term + 133012 133108 97 1 1 87 52 123 0.951 5.06 8.07 PlyA + 133336 133341 6 1.05 9.03 PlyA - 133659 133654 6 1.05 9.02 Term - 136721 136532 190 1 1 91 49 175 0.928 9.84 9.01 Init - 158954 158725 230 2 2 61 72 107 0.082 4.29 9.00 Prom - 159838 159799 40 -6.15 10.06 PlyA - 160114 160109 6 1.05 10.05 Term - 160910 160465 446 0 2 -46 55 237 0.050 1.41 10.04 Intr - 161456 161274 183 0 0 15 71 105 0.023 0.34 10.03 Intr - 168786 168487 300 1 0 17 84 134 0.163 1.68 10.02 Intr - 172081 171946 136 1 1 102 66 81 0.700 6.52 10.01 Init - 175917 175669 249 0 0 83 41 184 0.763 10.71 10.00 Prom - 177840 177801 40 -5.55 11.05 PlyA - 178901 178896 6 1.05 11.04 Term - 181989 181903 87 0 0 71 42 150 0.405 5.38 11.03 Intr - 182264 182165 100 1 1 79 88 -14 0.496 -3.11 11.02 Intr - 184697 184671 27 1 0 118 83 13 0.463 0.11 11.01 Init - 185263 184917 347 1 2 104 46 224 0.904 16.53 11.00 Prom - 187300 187261 40 -8.65 12.00 Prom + 189245 189284 40 -6.45 12.01 Init + 190227 190317 91 2 1 83 103 118 0.862 13.50 12.02 Intr + 191399 191552 154 0 1 25 62 121 0.053 1.41 12.03 Term + 205509 205746 238 0 1 8 50 150 0.004 -2.14 12.04 PlyA + 207046 207051 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 119393 119416 24 1 0 111 38 26 0.805 -2.85 S.002 Term + 191399 191595 197 0 2 25 36 129 0.866 -2.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:16631703_16841002|GENSCAN_predicted_peptide_1|99_aa MSRRLRILQGLCSCRASTGLWAVLLVTLPRCMKPAEPGAMTPVTGHSEMADTYLDITNNH SSHSSHAYDAAVVANVPTASLSSELSLRDDFGLLQGDAQ >gi568815578f:16631703_16841002|GENSCAN_predicted_CDS_1|300_bp atgtcacgacgactgcgtatcctgcaaggcctttgttcctgcagggccagcacgggactc tgggcagtgctgctggtcactctccccaggtgcatgaagccggcagagcctggagccatg actccagtcacaggtcactctgagatggcagacacatacctggacatcactaacaatcac agctctcattcaagccatgcctatgatgctgcagttgttgccaatgtccctacagcttca ctgagctctgagcttagtctgagagatgactttggcctcctccagggagatgctcagtga >gi568815578f:16631703_16841002|GENSCAN_predicted_peptide_2|95_aa MSLKRQNYSGNLVSVSPRNMGSSSTYLLVTFWNDSQATNASLFANDRDLISGDDRIDVAA AVPLDLEDALFSLLSPGNIAPDAAVNINDPHSCNL >gi568815578f:16631703_16841002|GENSCAN_predicted_CDS_2|288_bp atgtccctgaaaaggcaaaactattcagggaacctggtgagtgtcagcccgagaaatatg ggcagctcatctacatatcttttagtgaccttctggaatgacagccaagccaccaacgct tccctgtttgcaaatgatagagatctcatttctggggacgacaggattgatgtggcagcc gcagtgccattagacctggaagatgctcttttctcattactgtctcctggcaacatagca ccagatgccgccgtgaatattaatgatcctcattcatgtaacctttag >gi568815578f:16631703_16841002|GENSCAN_predicted_peptide_3|146_aa MDKFPDTYAPPRLNQEEAESLNRPRTSFEIEAVRNSLPTKKSPGPDGFTAESYYRYKEEL TPFLLKLFQTIEKQGLLPDSFYESSIILIQKTGRDTTKKENFRPISLININAKILNKILA NQIQQHIKKLIHHDQVSFIQGMQGFS >gi568815578f:16631703_16841002|GENSCAN_predicted_CDS_3|441_bp atggataaattcccggacacatatgcccccccaagactgaaccaggaagaagctgaatcc ctgaatagaccaagaacaagttttgaaattgaggcagtgagaaatagcctaccaaccaaa aaaagcccagggccagatggatttacagctgaatcctactacaggtacaaagaggagctg acaccctttcttctgaaattattccaaacaattgaaaagcagggactcctccctgactca ttctatgagtccagcatcatcctgatacaaaaaactggcagagatacaacaaaaaaagaa aacttcaggccaatatccctgataaacatcaatgcaaaaattctcaataaaatactggca aaccaaatacagcagcacatcaaaaaacttatccaccatgatcaagtcagcttcatccag gggatgcaaggtttctcctga >gi568815578f:16631703_16841002|GENSCAN_predicted_peptide_4|126_aa MWHMLRVQGHLPTGGEDWDSYLVCWPRQQIQPPAPGSERVQNIPPQNMPLWRINYSELKE LEKQQMQKGLSDLTPSFDVKASHKVSHKKGAFSVPGRKEHSYHQRLGVNGKWICTNEPTK ITLIVH >gi568815578f:16631703_16841002|GENSCAN_predicted_CDS_4|381_bp atgtggcacatgctacgggtgcagggccacctccccacgggaggagaggactgggattct tatttggtttgttggccaagacaacaaatccaaccacctgctcctggtagtgaaagagtt cagaacataccaccccaaaatatgccactttggcgtattaattattctgagctaaaggaa cttgagaaacagcagatgcagaaagggctttctgatctcactccctcttttgatgtaaaa gcaagtcataaagtttcccataagaaaggtgccttctctgtaccaggaagaaaagagcat tcttatcaccagagattgggagtcaatggcaaatggatctgtacaaatgaacctactaaa ataacccttattgtccattaa >gi568815578f:16631703_16841002|GENSCAN_predicted_peptide_5|296_aa MWHDGLPTGSTRTQLLDVMITLNPERLQDQPEGQESNGRGKTTRPALSAERCGHNQDSKN GSRRLKEDASASTNFMAGNTLKTRVPYGINKRKARMGRLIHKMTCAFVTFMQVNLEDKGV LAKLVEATKTNYNNKYDEIHHHGGSNVPGPKSMAHIAKLEMAKVLESESPRSRCQQSWFH SEASLCGLQIHLEARPGLVTELQPGNNFNGIYYNMNPGQTLLWNNLYPQEACPLPSYASL FPDAATGVIFHQTGSSTSTFSLESSQHGTHGLFVKAVEVKTFSEPVNGTVILPALT >gi568815578f:16631703_16841002|GENSCAN_predicted_CDS_5|891_bp atgtggcatgatggtctacccacaggtagcaccaggacccaactgctggatgtgatgata acacttaatccagagaggctacaagaccaacctgaaggccaagagtccaatggaagaggc aaaaccactaggcctgccctaagtgcagaaagatgtggtcacaaccaggacagtaagaat gggtcaagaagactcaaggaagatgcttctgcttctaccaacttcatggcaggcaataca ctcaagacaagggtcccttacggcattaacaaaaggaaggcaagaatgggacgtctgatc cacaagatgacctgcgcctttgtcaccttcatgcaggttaacttagaagacaaaggagtt ttggctaagctggtggaagctaccaagaccaattacaacaacaaatatgatgagatccat catcacgggggaagcaatgtcccaggcccaaaatccatggctcacattgccaagctggaa atggcaaaggttctggagtctgagagtccaagatcaaggtgtcagcagagttggtttcat tcagaggcctctctctgtggcttgcagatacaccttgaagcacgtccaggtctagtcact gagctccagccaggaaataatttcaatgggatctattataacatgaacccaggccagacg ctcctgtggaataacctctacccacaggaagcatgccctctaccgtcttatgcttctctg tttcctgatgcagccactggagtcatcttccaccaaacaggaagctccacttccactttc agccttgaatcttctcagcatgggacacatggcctctttgtcaaagctgtggaggttaag accttcagtgaaccagtaaatggtacagtcattctaccagctttgacatag >gi568815578f:16631703_16841002|GENSCAN_predicted_peptide_6|53_aa MDVWPAAPIRSARYGACSAHAGVQAQHHVEAAKVWGLHPLKPWPTLYVGPFQP >gi568815578f:16631703_16841002|GENSCAN_predicted_CDS_6|162_bp atggacgtgtggccagctgctcccatccgatcagctcgttatggagcatgctctgctcat gcaggagtacaggctcaacaccatgtggaagctgccaaggtttggggcttgcatcctctg aagccatggcccacgctctatgttggcccctttcagccatag >gi568815578f:16631703_16841002|GENSCAN_predicted_peptide_7|361_aa MKLPPNVLGMLGLGEPGTYVPNKGKTLKVPNKGQTHAQRDCQPWLPETRKHRSALCSASR LVGARLRSVATLLPLGADEARSWGGYAFGPGLDLGGAGLGSLPFRRLLCDPCPFSRFRFR GSQGFEKRYVLPVTCTSAAPAYFLLSPEEFNTNMDIRPNHTIYINNMNDKIKKEELKRSL YALFSQFGHVVDIVALKTMKMRGQAFVIFKELGSSTNALRQLQGFPFYGKPMRIQYAKTD SDIISKMRGTFADKEKKKEKKKAKTVEQTATTTNKKPGQVPDYPPNYILFLNNLPEETNE MMLSMLFNQFPGFKEVRLVPGRHDIAFVEFENDGQAGAARDALQGFKITPSHAMKITYAK K >gi568815578f:16631703_16841002|GENSCAN_predicted_CDS_7|1086_bp atgaagctcccacctaatgttcttggaatgcttggtctgggggaacctggcacctatgtg cccaataaaggtaaaacactaaaggtgcccaataaagggcagacgcacgcgcagagagat tgccagccctggctgccagagacccggaagcatcgatcggctctgtgctcggcttctaga cttgtcggcgctcgattgaggagcgtggctacgttgcttccgctgggagcggacgaagcg cgaagctggggtgggtacgcgtttggcccgggtttggatctcggaggggcaggcttgggg tcacttccctttcgtcgtctcctttgcgacccctgccccttctccagatttcgatttagg gggtcgcaaggctttgagaagcgttacgttttacctgttacctgcacaagtgcagcccct gcttattttttactgtctcctgaagaatttaacacaaacatggatatcagaccaaatcat acaatttatatcaacaatatgaatgacaaaattaaaaaggaagaattgaagagatcccta tatgccctgttttctcagtttggtcatgtggtggacattgtggctttaaagaccatgaag atgagggggcaggcctttgtcatatttaaggaactgggctcatccacaaatgccttgaga cagctacaaggatttccattttatggtaaaccaatgcgaatacagtatgcaaaaacagat tcggatataatatcaaaaatgcgtggaacttttgctgacaaagaaaagaaaaaagaaaag aaaaaagccaaaactgtggaacagactgcaacaaccacaaacaaaaagcctggccaggtc cctgattaccctccaaactatattttattccttaataacttaccagaagagactaatgag atgatgttatccatgctgtttaatcagttccctggcttcaaggaagtacgtctggtacca gggaggcatgacattgcttttgttgaatttgaaaatgatgggcaggctggagctgccagg gatgctttacagggatttaagatcacaccgtcccatgctatgaagatcacctatgccaag aaataa >gi568815578f:16631703_16841002|GENSCAN_predicted_peptide_8|210_aa MARILLLFLPGLVAVCAVHGIFMDRLASKKLCADDECVYTISLASAQEDYNAPDCRFINV KKGQQIYVYSKLVKENGAGEFWAGSVYGDGQDEMGVVGYFPRNLVKEQRVYQEATKEVPT TWVKAKDVAKQPTSTIQLLTKNNQDPNDNVPRQILSEAAESMEHGKHLGRTPGTQKHRSD PVSPSAMSGSSPRLSPDADTRPGTFQSPES >gi568815578f:16631703_16841002|GENSCAN_predicted_CDS_8|633_bp atggcaagaatattgttacttttcctcccgggtcttgtggctgtatgtgctgtgcatgga atatttatggaccgtctagcttccaagaagctctgtgcagatgatgagtgtgtctatact atttctctggctagtgctcaagaagattataatgccccggactgtagattcattaacgtt aaaaaagggcagcagatctatgtgtactcaaagctggtaaaagaaaatggagctggagaa ttttgggctggcagtgtttatggtgatggccaggacgagatgggagtcgtgggttatttc cccaggaacttggtcaaggaacagcgtgtgtaccaggaagctaccaaggaagttcccacc acgtgggtaaaggccaaggatgttgctaaacaacctacaagcacaatacagctcctcaca aagaataatcaggacccaaacgacaatgtgccaaggcagatcctgtccgaagctgctgaa agcatggagcatgggaagcacctgggaagaacacctggcactcaaaagcacaggagtgac cccgtttcaccttctgccatgagcggaagcagccctaggctctcaccagatgcagacacc cgacctggaactttccagtcaccagaatcatga >gi568815578f:16631703_16841002|GENSCAN_predicted_peptide_9|139_aa MPTTIREYYKHLYTNKLGNLEEMDKSLDTYTLPRLNQEEVESLNRPITGSEIEAMINSLP TKKSPGPDGFTAKFYQSFREERIERLHVCLMPELVVQGCHEVTGTAGQQHEASQAPEIGI LILSGLCIAKIQELTTTKG >gi568815578f:16631703_16841002|GENSCAN_predicted_CDS_9|420_bp atgccaactaccatcagagaatactataaacatctctatacaaataaactaggaaatcta gaagaaatggataaatccctggacacatacaccctcccaagactaaaccaggaagaagtt gaatctctgaatagaccaataacaggatctgaaattgaggcaatgattaatagcctacca accaaaaaaagtccaggaccagacggattcacagccaaattctaccagagcttcagagaa gaaagaatagaaagactccatgtgtgcctgatgcctgaacttgtcgtccagggctgccat gaggttacagggactgcagggcagcagcatgaagcatcacaagcacctgaaataggaatc ttgattctgagtggtctgtgcattgctaaaattcaggaacttactactactaaaggatga >gi568815578f:16631703_16841002|GENSCAN_predicted_peptide_10|437_aa MATEARLMSDDCYGQQSLKPALWKGVDCTQNWKAGEYRHKPYGLAAAAPRGVISPSGGAG ENPHMLKNVSMLRTDCHRHWQTQPSACWCQAAEGRTYRSRKMRRDERRDESQGRMDKNSQ QSKRRLGLGKDLIQTFTKSLLNLKEHILGWASGIISCWNIAEATHQGNYRPRDHPWSSPK FEKPLVSMGPENSVVFWQDASESPCFPTDSANTYAGRCFHLIPMIQTSSIFAVLQHPLLI PSQTGSGVNLQQTPTDLQLRVLTVRRKTNKKKGHPHQNPICTSPSSKTKERVSVIKDQMN EMKQEEKFREKRVKRNEQSLQEIWDYVKRPNLHLIGVPESDGENGTKLENTLQDIIQENY PNLARQSTIQIQEIQRTPQRYSLGRATPRHIIVRFTKIEMKKKMLRAAREKGRVTHKGKP IRLTADVSAETLQARRE >gi568815578f:16631703_16841002|GENSCAN_predicted_CDS_10|1314_bp atggccactgaagcaagattaatgagtgatgactgctatgggcaacagagtctcaagcca gctctgtggaagggtgtggactgcacccagaactggaaggctggggaatatcgccacaaa ccctatgggttggcagctgctgccccaaggggtgtcatttcaccctctgggggtgcaggg gaaaatccccacatgctgaaaaatgtcagcatgctgaggactgactgccacaggcactgg caaactcagccttctgcctgctggtgccaagctgcagaagggaggacttaccgatccagg aagatgaggagagatgagagaagagatgagagccaagggaggatggataaaaacagccag caaagcaaaaggcggctgggactagggaaggatttaatacagacatttacaaagtcactg ctgaacctaaaggagcacattctgggctgggcttcaggaatcatcagctgttggaacatt gcagaagcgacccaccagggaaactaccggcccagagaccacccctggagttctcccaaa ttcgagaagccactggtgtctatggggccagaaaactcagtagtcttttggcaagatgct tcagaatctccttgtttcccaacagattcagccaacacgtatgcaggaagatgctttcac ctcattcctatgatccaaacatcatcaatattcgctgtcctgcagcacccactgctgata cccagtcaaacagggtctggagtgaacctccagcaaactccaacagacctgcagctgagg gtcctgactgttagaaggaaaactaacaaaaagaaaggacatccacaccaaaaccccatc tgtacgtcaccatcatcaaagaccaaagaaagggtatcagtgattaaagatcaaatgaat gaaatgaagcaagaagagaagtttagagaaaaaagagtaaaaagaaatgaacaaagcctc caagaaatatgggactatgtgaaaagaccaaatctacatcttattggtgtacctgaaagt gacggggagaatggaaccaagttagaaaacactctgcaggacattatccaggagaactac cccaacctagcaaggcagagcaccattcaaattcaggaaatacagagaacaccacaaaga tactccttgggaagagcaaccccaagacatataattgtcagattcaccaagattgaaatg aagaaaaaaatgttaagggcagccagagagaagggtcgggttacccacaaagggaagcct atcagactaacagcagatgtctcagcagaaaccctacaagccagaagggagtga >gi568815578f:16631703_16841002|GENSCAN_predicted_peptide_11|186_aa MPPSYAYKNPETLAGRNISSWTLRGTHRRKKTQAAGRQKDIESTPTGTGMWKAIDRQNDA EFGRGGQKRAPAAEQPDSRGKPPSHSLLARSSSGSYFHSIKPCTHSPSPHMIRFFWGLSS GAPKRNVLMRIIGLFVNNYLTKIVQYSPPHLKITRKAKLTQNKQRTAQDPTDLQVWSQPA LQVLAT >gi568815578f:16631703_16841002|GENSCAN_predicted_CDS_11|561_bp atgcccccatcctatgcctataaaaaccctgagaccctagcgggcagaaacataagcagc tggacattgagaggaacacatagacggaagaagacacaagcagctggacgtcaaaaggac atcgagagcacaccaacaggcaccggcatgtggaaggccatcgacaggcagaacgatgcg gagtttggcaggggtggtcagaagagagctccagctgctgagcaaccagactccagggga aaaccaccttcccactcccttcttgctcgctcatcttctgggagctacttccactcaata aagccttgcactcattctccaagcccacatatgatccgattcttctggggtttgagcagc ggggcaccaaagaggaatgtcctaatgcgaataattggattatttgtaaataattattta accaaaattgtgcaatattcccctcctcacctaaagataacaagaaaggcaaagcttact cagaataagcagagaactgctcaagatccaacggacctgcaggtatggagccagccagcc cttcaggtcttagccacctaa >gi568815578f:16631703_16841002|GENSCAN_predicted_peptide_12|160_aa MKNLNEMAPPKDATSPPTTVPNQSGNLEMTETYLTCNDTNRLKVKGWRKIYHANGKRTRA DIVILISDETAIKLTTVKKDKRRWKLNDHLNEIHNKPTEIVNMEDLQALRGYELFDFVSR QNSKEYLPYILGVIMAIDWYHIPKTFHLEQANKYVVIVPI >gi568815578f:16631703_16841002|GENSCAN_predicted_CDS_12|483_bp atgaaaaatctgaatgaaatggcaccaccaaaggatgccactagccctccaacaacggtc cctaaccaaagtggaaacttagagatgacagagacatatctcacatgtaatgacaccaat aggctcaaagtaaagggctggagaaagatctatcatgcaaatggaaaacgaacaagagca gacattgttattcttatttcagatgaaacagcaattaaactaacaacagtaaaaaaggac aaaagaagatggaagctaaatgatcatctcaatgaaatacataacaaacccactgaaatt gtaaatatggaagatttgcaggctttaagagggtatgaactgtttgactttgtgtccagg caaaactcaaaagaataccttccttacatcctgggagtaatcatggccatagattggtac cacatacccaagacatttcatttggagcaagccaataagtacgtggttattgtgcccatt tag