GENSCAN 1.0 Date run: 19-Feb-121 Time: 20:42:16 Sequence gi568815593f:17175217_17375897 : 200681 bp : 42.43% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 135 130 6 1.05 1.02 Term - 2557 2371 187 0 1 64 54 106 0.519 0.78 1.01 Init - 4654 4506 149 1 2 29 72 162 0.805 8.31 1.00 Prom - 11752 11713 40 -3.65 2.00 Prom + 15842 15881 40 -6.15 2.01 Init + 20515 20568 54 0 0 86 59 27 0.099 0.94 2.02 Intr + 27759 27896 138 2 0 45 36 102 0.128 0.44 2.03 Intr + 36828 36966 139 2 1 95 96 23 0.310 2.92 2.04 Intr + 41042 41217 176 0 2 30 115 140 0.462 9.64 2.05 Intr + 41473 41681 209 0 2 98 59 96 0.745 4.65 2.06 Intr + 42357 42594 238 0 1 -58 107 290 0.506 12.99 2.07 Intr + 42677 42753 77 1 2 101 37 70 0.459 0.59 2.08 Intr + 43552 43818 267 1 0 117 -4 248 0.796 14.22 2.09 Intr + 44447 44657 211 2 1 35 24 181 0.037 4.39 2.10 Term + 66559 66786 228 0 0 93 49 204 0.939 12.65 2.11 PlyA + 70335 70340 6 1.05 3.00 Prom + 72154 72193 40 -6.35 3.01 Init + 72520 72553 34 0 1 73 35 49 0.785 -1.58 3.02 Intr + 73958 74032 75 0 0 107 82 68 0.340 6.67 3.03 Intr + 76277 76346 70 0 1 102 75 43 0.274 1.72 3.04 Intr + 94277 94385 109 2 1 100 75 33 0.136 2.57 3.05 Term + 99992 100684 693 1 0 88 48 746 0.460 63.18 3.06 PlyA + 101594 101599 6 1.05 4.03 PlyA - 102166 102161 6 1.05 4.02 Term - 104054 103720 335 0 2 47 41 239 0.685 8.89 4.01 Init - 112628 112460 169 2 1 72 75 76 0.228 4.45 4.00 Prom - 113262 113223 40 -7.85 5.00 Prom + 113390 113429 40 -8.65 5.01 Init + 114624 114684 61 2 1 82 19 85 0.485 2.49 5.02 Intr + 117217 117446 230 2 2 88 77 205 0.706 16.07 5.03 Intr + 125324 125410 87 1 0 108 91 95 0.705 11.05 5.04 Term + 131164 131607 444 0 0 72 54 147 0.149 3.95 5.05 PlyA + 131824 131829 6 1.05 6.02 PlyA - 132122 132117 6 1.05 6.01 Sngl - 136672 136439 234 1 0 97 36 248 0.976 13.44 6.00 Prom - 150043 150004 40 -6.05 7.04 PlyA - 150882 150877 6 1.05 7.03 Term - 153376 153244 133 0 1 89 52 130 0.794 6.08 7.02 Intr - 159754 159668 87 0 0 49 99 48 0.063 0.17 7.01 Init - 167491 167364 128 1 2 81 76 64 0.321 4.18 7.00 Prom - 178455 178416 40 -3.65 8.02 PlyA - 178484 178479 6 -0.45 8.01 Sngl - 179418 178657 762 0 0 78 46 798 0.999 70.46 8.00 Prom - 183521 183482 40 -5.75 9.06 PlyA - 183794 183789 6 1.05 9.05 Term - 184883 184627 257 0 2 20 36 311 0.979 13.86 9.04 Intr - 190351 190177 175 1 1 8 33 89 0.036 -6.01 9.03 Intr - 190857 190744 114 0 0 80 77 158 0.680 13.62 9.02 Intr - 192922 192805 118 0 1 44 82 47 0.597 -0.75 9.01 Intr - 194173 194080 94 2 1 77 55 115 0.497 5.20 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:17175217_17375897|GENSCAN_predicted_peptide_1|111_aa MVSEQNQGCVHKEGGGVAAGLVINNEGDTGGKTSIDAGEIRTVGVSVAIGIINMQETTPS VDEKASLLVRGQQMFSIKDKIVNISGFAGCTAFIATTPLCLCSLKAAIDHT >gi568815593f:17175217_17375897|GENSCAN_predicted_CDS_1|336_bp atggtgtctgaacaaaatcagggctgtgttcacaaggaaggaggaggagtggctgctggg ttggtaatcaacaatgaaggagacacaggagggaagaccagcatagacgctggtgagatc agaactgtgggtgtgagtgtggccattggcatcatcaacatgcaggagaccacaccatca gttgatgagaaagccagtctcctagtcagaggtcagcaaatgttttccataaaggacaag atagtaaacatttctggctttgcaggctgtacagccttcattgcaacaactccactctgc ctttgcagtttgaaagcagctatagaccatacatga >gi568815593f:17175217_17375897|GENSCAN_predicted_peptide_2|578_aa MALRPIISYPFKAHFTPRDCEAVGIGGLRLWRRGSPFAYGIKKAADTGTDHHWKEAVFAT CGQQEQVGTAWKKPSLLRRLSPFYHWFVTGKGPNPDPKGGFLDLMQERIQGEHESDPNTA RGRQNAANKSSKGVNWLLREGKIGFCVIKELPLSTMSRYVIFVCPSCQQFREHPPRGDVS RPTSWTSATAPPPKLQPSGLSPAGAGAGAEPPKVRNSFAAAQPRRACSWGSPQPRCQPRK GRGSEREKCRGCSGGGGGSSGSGDDGGGSAPTGSSLRAPPSSRERASASGQAPARRRQRP RPLGGTALTQGLHRHPEPPPGSPFMQRAAGMWARIGRQGPGVPLKIKIAESWIRSHAQCA AANRILFMNIHRTLGQAPYLRPQRHFRLETNSVDSPCPCNSHSAEISTAHMGAFFRTLYR LTTPLSPPRPLALDTTWGKLRHLGVYSNKMAYKNRREENDYLVGILTLSVFSGLRTHCFI KDSKRRMASRWKSDGGRKKAVVGESDLWGQFTGTKQKLQPRAAGFKDKAAKQVRGAGTQT QGFLKHKLQHLIKWGKEGTLNRDTELHCVSTKCECPTF >gi568815593f:17175217_17375897|GENSCAN_predicted_CDS_2|1737_bp atggcgctaagacctatcatcagctaccctttcaaggcacattttacacctagggactgt gaagcagtgggaattggagggctcaggctatggagaaggggaagtccttttgcatatggt attaagaaggcggcggacactgggactgatcatcactggaaagaagctgtttttgccaca tgtggacagcaagaacaggtaggcactgcatggaaaaagccctcgctgcttaggaggcta tcacctttttaccattggtttgttacaggaaagggtcccaatccagaccccaagggagga ttcttggatctcatgcaagaaagaattcagggcgagcatgaaagcgatccaaacacagcc agagggcgccaaaatgccgcaaataaaagttccaaaggcgtcaactggcttttgcgggaa ggtaaaattggcttttgtgtaatcaaagagctaccgttgtcaacgatgtcacgttacgtt atttttgtatgtccctcttgccaacagttccgagagcatcctccgaggggcgacgtctct cgccccaccagctggacctcggcgaccgcgcctcccccgaagctgcagccttcggggctg agccccgcgggtgcgggtgcaggtgcggagccgcccaaggtgcgcaactcgtttgcagcg gcgcagcccagacgcgcctgcagctggggctcaccccaacctcgctgccagccgaggaag gggagggggagcgagcgcgagaaatgcagaggctgcagcggcggcggcggcggcagtagc ggcagcggcgacgacggcggcggcagcgctccaactggctcctcgctccgggctccgccg tcgagccgggagagagcctccgccagcggccaggcaccagccagacgacgccagcgaccc cggcctctcggcggcaccgcgctaactcaggggctgcataggcacccagagccgcccccg ggctctcccttcatgcagcgggcagcagggatgtgggcccggattggacggcaagggcct ggggtccctctaaaaataaaaattgcggaatcctggatccgatcgcacgcacaatgcgcg gctgcaaaccgcattcttttcatgaatattcatcgaacgcttggccaggctccttatctg cgtccccagagacatttccgccttgaaacgaactcggtggacagcccttgcccatgtaat tctcactctgctgaaattagcacagcacacatgggtgcattttttaggaccttgtacagg ttgacaacacctctgtctcccccgcggccactcgcacttgacaccacctggggaaaatta agacatcttggcgtttattcgaacaagatggcctacaaaaaccggagagaagaaaacgat taccttgtcgggatcctaaccctttctgttttctctggactccgaactcattgtttcatt aaggattccaaaaggcgaatggcgtccagatggaaatctgatgggggcagaaagaaggca gtcgttggggagtcagacctgtggggacagttcacaggaacaaaacagaagctccaacca agagctgcaggatttaaggacaaggctgcaaaacaagtacgtggagctggcacgcaaact cagggctttctgaaacacaaacttcagcatttaattaaatgggggaaggaaggaactctc aaccgagatactgaactgcactgtgtctctacaaaatgcgagtgtcccacattttaa >gi568815593f:17175217_17375897|GENSCAN_predicted_peptide_3|326_aa MKCVRVRVLALQPNAGTVPVVGLALMVGLSCVGCSKLLVSVEGSKQRLELQLSLSSSRRS PGCSDQLTVLEWHTLFSLHRKAELYHISSRIAVSSQNSKMGGKLSKKKKGYNVNDEKAKE KDKKAEGAATEEEGTPKESEPQAAAEPAEAKEGKEKPDQDAEGKAEEKEGEKDAAAAKEE APKAEPEKTEGAAEAKAEPPKAPEQEQAAPGPAAGGEAPKAAEAAAAPAESAAPAAGEEP SKEEGEPKKTEAPAAPAAQETKSDGAPASDSKPGSSEAAPSSKETPAATEAPSSTPKAQG PAASAEEPKPVEAPAANSDQTVTVKE >gi568815593f:17175217_17375897|GENSCAN_predicted_CDS_3|981_bp atgaagtgtgttcgtgtccgtgtgctggcgcttcagcccaacgcaggaaccgtccctgtt gttggccttgcattgatggtgggcctgtcctgtgtggggtgcagtaaattgttggtttct gtggaagggtccaaacagcggctggagctccaactgtctttgtctagcagcaggaggagc ccaggttgctctgatcaactcacagtgctcgaatggcacactctcttctctcttcaccga aaagcagaactttatcacatttcttccagaatagctgtttctagccagaactccaagatg ggaggcaagctcagcaagaagaagaagggctacaatgtgaacgacgagaaagccaaggag aaagacaagaaggccgagggcgcggcgacggaagaggaggggaccccgaaggagagtgag ccccaggcggccgcagagcccgccgaggccaaggagggcaaggagaagcccgaccaggac gccgagggcaaggccgaggagaaggagggcgagaaggacgcggcggctgccaaggaggag gccccgaaggcggagcccgagaagacggagggcgcggcagaggccaaggctgagcccccg aaggcgcccgagcaggagcaggcggcccccggccccgctgcgggcggcgaggcccccaaa gctgctgaggccgccgcggccccggccgagagcgcggcccctgccgccggggaggagccc agcaaggaggaaggggaacccaaaaagactgaggcgcccgcagctcctgccgcccaggag accaaaagtgacggggccccagcttcagactcaaaacccggcagctcggaggctgccccc tcttccaaggagacccccgcagccacggaagcgcctagttccacacccaaggcccagggc cccgcagcctctgcagaagagcccaagccggtggaggccccggcagctaattccgaccaa accgtaaccgtgaaagagtga >gi568815593f:17175217_17375897|GENSCAN_predicted_peptide_4|167_aa MEILQGGPASWVCHLDSHKGPHTQSPVLDRMLCSLEILKSLSKAQPFLQSVLPVGPNLRL DSREKEDSPRLGNDIPGVPVPPLPPYCQEQTEDGLLSGLRPGRAILGCMAHASSWGPESI PIMQRRHQLRKRIFKVEGQAWHPCTSSSGFLIVSSLQYAQRHPVNER >gi568815593f:17175217_17375897|GENSCAN_predicted_CDS_4|504_bp atggagatcttgcagggaggaccagcttcatgggtgtgccacctggacagtcacaaagga ccccacactcagagtcctgtgctggatcgaatgctttgctctcttgaaattcttaagagt ttaagcaaggcccagccctttttgcagtcagtcctgcctgtaggacctaacctacgtctt gattccagagagaaagaagattctccaagactggggaatgacattccgggagtgccagta ccacctctgcctccatattgccaggaacagactgaagatggccttctctcagggttgaga ccaggaagggccatcctgggatgcatggcacatgccagctcttggggaccagagtccatt cccatcatgcagaggcggcatcagctacggaaaagaatcttcaaagtggagggacaagcc tggcatccctgtacaagttcctctggattcctcatagtttcctccttgcagtacgcacaa aggcacccagtaaatgaacgttag >gi568815593f:17175217_17375897|GENSCAN_predicted_peptide_5|273_aa MKIRFPQSRGGWRALGLMAKDNDSVLFLEMVSGLREELEESLVWCLRTQEDQGSGAIVSR HDRCLKTSRARKEHTPGEPATQMQKLVTHVSTPSADKVSFIASSDSPSHSLTRGGGVETS GACRSQAILLVRLTSQLFRAVSWAGCSLPWPVSGVLSKHACRPHYSTQPTGPGRAASLPS KGAQLELEEMLVPRKMSISPLESWLTACCLLPRLDAQTPGTAAPAQFYECLPSQMGEGAK QEDEKAWDTTQMQSKNVLKTRRQKMNHHKHRKL >gi568815593f:17175217_17375897|GENSCAN_predicted_CDS_5|822_bp atgaagattcgatttccgcagtcccgtgggggatggcgagccctgggcctaatggctaaa gataacgactccgtgttgttccttgagatggtttcgggtttacgggaagagttagaagag agtcttgtgtggtgtttgaggactcaggaagatcagggttctggggcgatagtttctaga cacgatagatgcctgaaaaccagcagggcccgaaaggagcatacccctggggaaccagca actcaaatgcagaaactggtaacccacgtgagcaccccatcagcagacaaggtctccttc atcgctagctctgattcgccctcacactccctcactaggggtgggggtgtggagacttca ggtgcctgccggagccaggccatactcctggtgcgcctgacttctcagctgttcagggct gtttcctgggcaggctgcagtctgccctggcctgtttctggagtgctgagcaagcatgcc tgcaggccccattacagcacacagccaactggcccaggtagggctgcctcactccccagc aagggggcccagctggagctcgaggagatgctggtccccaggaagatgtccatcagcccc ctggagagctggcttacagcctgctgcctcctgcccagactggatgcccagaccccaggg actgcggctccagcccaattctatgagtgtctccctagccagatgggggaaggggccaag caggaggatgagaaggcctgggatacaactcagatgcagagcaaaaacgtgctgaagacc cgccggcagaagatgaaccaccacaagcaccggaagctgtga >gi568815593f:17175217_17375897|GENSCAN_predicted_peptide_6|77_aa MPEPSPASVGSCAARASPTSATPCSTAPSPIDHPRAEECERMARDWQAAPPAAPVQDPLG EASWAPEFGGALENLYV >gi568815593f:17175217_17375897|GENSCAN_predicted_CDS_6|234_bp atgcctgagccttcccccgcctccgtgggctcctgtgcagcccgagcctccccgacgagc gccaccccctgctccacggcgcccagtcccatcgaccacccaagggctgaggagtgtgag cgcatggcgcgggactggcaggcagctccacctgcagccccggtgcaggatccactgggt gaagccagctgggctcctgagtttggcggggctttggagaacctttatgtctag >gi568815593f:17175217_17375897|GENSCAN_predicted_peptide_7|115_aa MMILLKRWDFGELIIGYKGVAVMNAISALIQETPESCLTSSIGSMSMSHGLKMDSAKNEQ SIKAGFTLHDTSKTYLGGSQIVVLNQQQQHHLGLSNAKSQPLTQDRLTQKLRGWG >gi568815593f:17175217_17375897|GENSCAN_predicted_CDS_7|348_bp atgatgatacttcttaagaggtgggactttggggagctgattataggttataagggtgta gcagtcatgaatgcgattagtgccctgatacaagaaaccccagagagctgcctcacctct tccatcgggtcaatgagtatgtctcatgggcttaaaatggacagtgcaaagaatgagcag tctataaaagcaggttttacactgcatgataccagcaaaacctacctcggtggttctcaa atagtggtcctcaaccagcagcagcagcatcacctgggcctgagcaatgcaaagtctcaa cctctcacccaagaccgactgactcagaaactcaggggttggggctga >gi568815593f:17175217_17375897|GENSCAN_predicted_peptide_8|253_aa MDVLPRESSGFPASTVLGQNLALVPHTGRLPIASPPSPPHRALGLPQGPRRRSSTAQSPP RTQPPLLSRRHDDRVHLTGAPGLPPGLKAAINRQINLELYASCVYLSMSYYFDRDDVALK NFAKYFLHQSHEEREHAEKLMKLQNQGGGRIFLQDIKKPDCDDWESGLNAMECALHLEKN VNQSLLELHKLATDKNDPHLCDFIETHYLNEQVKAIKELGDHMTNLCKMGAPESSLAEYL FDQHTLGDSDNES >gi568815593f:17175217_17375897|GENSCAN_predicted_CDS_8|762_bp atggacgttcttccacgagagtcgtcggggtttcctgcttcaacagtgcttggacagaac ctggcgctcgtcccccacaccggccggctgcccatagccagccctccgtcacctcctcac cgcgccctcggactgccccaaggcccccgccgccgctccagcactgcgcagtcaccaccg cgaacgcagccgcctctccttagtcgccgccatgacgaccgcgtccacctcacaggtgcg ccaggactaccaccaggactaaaggccgccatcaaccgccagatcaacctggagctctac gcctcctgcgtttacctgtccatgtcttactactttgaccgcgatgatgtggctttgaag aactttgccaaatactttcttcaccaatctcatgaggagagggaacatgctgagaaactg atgaagctgcagaaccaaggaggtggccgaatcttccttcaggatatcaagaaaccggac tgtgatgactgggagagcgggctgaatgcgatggagtgtgcattacatttggaaaaaaat gtgaatcagtcactactggaactgcacaaactggccactgacaaaaatgacccccatttg tgtgacttcattgagacacattacctgaatgagcaggtcaaagccatcaaagaattgggt gaccacatgaccaacttgtgcaagatgggagcacccgaatctagcttggcggaatatctc tttgaccagcacaccctgggagacagtgataatgaaagctaa >gi568815593f:17175217_17375897|GENSCAN_predicted_peptide_9|252_aa XHLAFPHINLVLTPHSGDHQLRGKKRERRTMGTSLPQQAVHLWIPLPDANFAHVMSIKIK KMEPITEISTWLGILALFSTCRSELSLRSQLRDAFLDHPIMDREPPAQQSSSKYKQLLEI QTGGPAPNKVAGKLSCSVKSSQRCSHENNSFCKAIGLLPPWALIALPESSSNLAWEKEEK SADQKNNWPEKSDVRKQHAGGAAPAEALENKGRFKITGLDNEERDYDLQKSSVSRKERLR FARCKLPRNHPL >gi568815593f:17175217_17375897|GENSCAN_predicted_CDS_9|759_bp ngccacctggctttcccacacatcaacctcgtgctcacaccccactctggagaccaccag cttcgaggcaaaaagagagaacgacgtacaatggggacttcgcttccccagcaggcagtt cacctctggatacccctccctgatgcaaactttgctcatgtcatgtctatcaagatcaag aaaatggagcccataactgagatcagcacatggcttggcatcctcgccctcttctccaca tgccggtctgagctgtcactcagaagtcagctcagagatgcctttctggaccatccaatc atggacagagagccaccagcacagcagagcagcagcaaatataaacaacttctagaaatc caaacaggagggccagcaccaaacaaggttgctggcaagctttcttgctcagtaaagtcc tctcaacgctgttcccatgaaaacaattcattctgcaaagcaatcggcctcctgcctcct tgggcactaattgctctgcctgaaagttcgagcaacctagcctgggagaaagaggagaaa tccgcagaccagaagaacaactggccagagaaaagtgatgtcagaaagcagcatgctgga ggagcagctccagcagaagccctggagaacaaggggagattcaaaatcactggacttgac aatgaggagagagattatgaccttcaaaaaagcagcgtcagcagaaaggagcggttacgc tttgctcgatgcaagttacccagaaatcatcccctctaa