GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:05:36 Sequence gi568815594f:187902808_188103737 : 200930 bp : 41.98% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5254 5295 42 0 0 58 90 53 0.721 2.97 1.02 Intr + 8169 8311 143 2 2 90 75 129 0.323 10.03 1.03 Term + 9598 9745 148 1 1 -27 43 225 0.233 2.89 1.04 PlyA + 11545 11550 6 1.05 2.05 PlyA - 12050 12045 6 1.05 2.04 Term - 26077 25620 458 2 2 85 45 284 0.821 18.10 2.03 Intr - 32851 32726 126 2 0 46 -9 157 0.293 1.53 2.02 Intr - 34808 34670 139 2 1 117 25 108 0.554 6.52 2.01 Init - 35168 35106 63 2 0 64 94 49 0.499 4.30 2.00 Prom - 38023 37984 40 -8.05 3.02 PlyA - 38321 38316 6 1.05 3.01 Sngl - 40089 39901 189 0 0 70 37 202 0.628 8.16 3.00 Prom - 45940 45901 40 -3.65 4.00 Prom + 53403 53442 40 -3.65 4.01 Init + 53975 54209 235 1 1 83 26 111 0.283 2.75 4.02 Term + 57733 57896 164 2 2 16 48 185 0.729 4.42 4.03 PlyA + 59503 59508 6 1.05 5.00 Prom + 59530 59569 40 -4.35 5.01 Init + 62054 62123 70 1 1 76 52 52 0.356 1.66 5.02 Intr + 68024 68349 326 0 2 39 81 425 0.676 31.47 5.03 Intr + 73761 73905 145 2 1 40 61 75 0.512 -1.07 5.04 Term + 74033 74196 164 0 2 123 43 93 0.464 5.42 5.05 PlyA + 76917 76922 6 1.05 6.06 PlyA - 77252 77247 6 1.05 6.05 Term - 80132 79992 141 2 0 69 41 138 0.316 4.15 6.04 Intr - 84772 84653 120 1 0 111 -2 87 0.020 1.77 6.03 Intr - 90044 89875 170 0 2 24 50 259 0.855 14.44 6.02 Intr - 90414 90334 81 1 0 72 90 28 0.311 0.09 6.01 Init - 94172 94121 52 2 1 80 101 43 0.298 4.17 6.00 Prom - 94423 94384 40 -11.04 7.00 Prom + 94991 95030 40 -5.45 7.01 Sngl + 100001 100933 933 1 0 82 37 773 0.998 67.80 7.02 PlyA + 102059 102064 6 1.05 8.06 PlyA - 102302 102297 6 1.05 8.05 Term - 109816 109716 101 2 2 114 43 77 0.430 3.11 8.04 Intr - 129114 128965 150 1 0 105 89 110 0.438 12.01 8.03 Intr - 132824 132704 121 2 1 45 83 69 0.460 1.25 8.02 Intr - 133312 133154 159 1 0 26 61 146 0.652 4.96 8.01 Init - 135658 135584 75 1 0 49 103 29 0.462 1.64 8.00 Prom - 137532 137493 40 -3.25 9.00 Prom + 139125 139164 40 -2.65 9.01 Init + 142498 142597 100 0 1 24 91 88 0.629 3.17 9.02 Term + 146737 147005 269 2 2 54 46 170 0.621 4.07 9.03 PlyA + 148817 148822 6 1.05 10.12 PlyA - 150957 150952 6 1.05 10.11 Term - 160229 160077 153 2 0 96 38 95 0.342 2.24 10.10 Intr - 161449 161380 70 0 1 89 105 74 0.322 7.47 10.09 Intr - 165885 165845 41 0 2 73 82 34 0.043 -2.60 10.08 Intr - 174467 174285 183 2 0 20 51 180 0.117 6.56 10.07 Intr - 187724 187692 33 2 0 82 95 48 0.013 2.40 10.06 Intr - 189134 188578 557 0 2 81 -9 366 0.011 17.83 10.05 Intr - 194354 194254 101 1 2 98 89 33 0.950 3.23 10.04 Intr - 194539 194517 23 1 2 100 113 -3 0.967 -1.18 10.03 Intr - 195472 195398 75 1 0 10 115 79 0.732 1.59 10.02 Intr - 196368 196228 141 0 0 9 116 168 0.434 11.33 10.01 Init - 198419 198249 171 2 0 53 94 113 0.344 7.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 80935 81156 222 0 0 88 48 164 0.819 7.30 S.002 Term - 84772 84641 132 1 0 111 48 103 0.965 5.71 S.003 Term - 189134 188566 569 0 2 81 33 376 0.969 24.99 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:187902808_188103737|GENSCAN_predicted_peptide_1|110_aa MLWIEEVSFQCGRRGGYSGAFSKALPELIEKVFFVNKQRSRNASLPWSLVKVKCPTCGQE RSRKPTEPQKRAATEKQAVKEAGQQLPYRGPHFHTILHAMERSVLHPGEA >gi568815594f:187902808_188103737|GENSCAN_predicted_CDS_1|333_bp atgctgtggatagaagaggtgtcattccagtgtggccggaggggaggatattctggagcc ttttcaaaggctcttccagagctgattgagaaggttttcttcgtcaataaacagaggtca aggaatgcctctttgccatggagtctggtcaaagttaagtgtcccacttgtggacaggaa cggagcagaaaacccacagagcctcagaaacgagcagcaacagaaaaacaggctgttaag gaagcaggtcaacagctcccctaccgtggtcctcatttccacaccatcctgcatgccatg gaacgatccgtcctccaccctggagaagcttag >gi568815594f:187902808_188103737|GENSCAN_predicted_peptide_2|261_aa MNVWDSEQPRKSQGQMTFRLWGLCLCLSQSFILRGLGGDNCEPWRTENTELNRALLMLGY AHDFCSSGNTLFSLFPQMPADQMSLKQAARLRATLTASQLVPGHSSQNSEPTQMRRNQKT NPGNMTKQGSLTPPKITLAHFDSNQGEIPDLPEKEFRRLVIKLIREAPEKGKAQFKEVKE MTQEMRAEMFKEIDSINKKESKLQETMDTLIEMQNALENLSNRLEQAEERTSELEDKAFK LAQSCKGKEKRRKYEQSLQEV >gi568815594f:187902808_188103737|GENSCAN_predicted_CDS_2|786_bp atgaatgtgtgggactcagaacagccacggaaatcccaggggcagatgacgtttcggctc tgggggctctgcctctgcctgtcccagtccttcatcctcagaggccttggaggtgacaac tgcgaaccgtggaggacggagaacactgaactaaatcgtgcgcttctcatgttgggttac gcacacgacttttgttccagtggaaatactttgttcagcctcttccctcagatgccggca gaccagatgtccttgaagcaggcagcaaggcttcgtgcaacgctgacggcctctcagctt gtccctggacactcttctcagaactctgagcctacccaaatgagaaggaaccagaaaacc aaccctggtaatatgacaaaacaaggttctttaacaccccccaaaatcacactagctcac ttcgattcaaaccaaggagaaatccctgatttacctgaaaaagaattcagaagattagtt attaagctaatcagggaggcaccagagaaaggcaaggcccaatttaaggaagtcaaagaa atgacacaagaaatgagggcagaaatgttcaaggaaatagatagcataaataaaaaagaa tcgaaacttcaggaaacaatggacacacttatagaaatgcaaaacgctctggaaaatctc agcaatagactcgaacaagcagaagaaagaacttcagagcttgaagacaaggctttcaaa ttagcccaatcctgcaaaggcaaagaaaaaagaagaaaatatgaacaaagcctccaagaa gtctag >gi568815594f:187902808_188103737|GENSCAN_predicted_peptide_3|62_aa MDRLSLIGTVHLSDNVKQVKEFAKLIWGKEVLDSEKSQCKASAESVPTLSEEIEGQGSGR LS >gi568815594f:187902808_188103737|GENSCAN_predicted_CDS_3|189_bp atggacaggttgagtcttattgggacggtgcatttaagtgacaatgtgaagcaggtgaag gaatttgccaagttgatctggggaaaggaagtcctggacagtgagaagagccagtgcaaa gccagtgcagagagcgtgccaactttgtccgaggagattgaggggcagggcagcggcaga ttaagttaa >gi568815594f:187902808_188103737|GENSCAN_predicted_peptide_4|132_aa MDEAGNHHSQQTITRTENQTSHVLTHRWELNNENTWTQSGEHQTPGPVVGWRERRGIALG EIPNVNDKLMGAANQHGTLMSTDCRIRSQPQNTFTASPDLVFDGITGYQRLAKWICETDH RKSQTTYAKARK >gi568815594f:187902808_188103737|GENSCAN_predicted_CDS_4|399_bp atggatgaagctggaaaccatcattctcagcaaactatcacaaggacagaaaaccaaaca tcacatgttcttactcataggtgggaactgaacaatgagaacacttggacacagagtggg gaacatcaaacaccggggcctgtcgtcgggtggagggagaggagagggatagcattagga gaaatacctaatgtaaatgacaagttaatgggtgcagcaaatcaacatggcacactaatg tcgactgattgtagaattcggtcacagccacagaataccttcacagcatcacctgattta gtgtttgatggaataaccgggtaccagcgactggccaagtggatatgtgaaactgatcat cgcaaaagtcaaacaacctatgcaaaggccagaaaatga >gi568815594f:187902808_188103737|GENSCAN_predicted_peptide_5|234_aa MPGKALAGPGWPERPPRRPIIEVDRVEDRLAWSKDINAYNGEESTEKLPFRIIHDKNRDL FILLGMLDPAEKDEKDMPVTARVVFVIGPDKKPKLSIIYLATTGRNFDEILRVVTSLQLT AEKRVATPVDWKGSPLLSVQVPTEPTVYLSEKKGKLFPGSWLITRRTKGFCAGPCVLACA VLYTKKKMPFLTSVPGPQLKEQSLGWICEGHHLTTTGAAHLKLSQSAHTNKEIL >gi568815594f:187902808_188103737|GENSCAN_predicted_CDS_5|705_bp atgcccggaaaagccctggctggccctggttggcctgaacgcccaccccgtaggccgata attgaggtcgaccgtgttgaggaccgtcttgcctggagcaaggatatcaatgcttacaat ggtgaagaatccacagaaaagttaccttttcgcatcatccatgataagaatcgggacctt ttcatcctgttgggcatgctggatccagcggagaaggatgaaaaggacatgcctgtgaca gctcgtgtggtgtttgttattggtcctgataagaagccgaagctgtctatcatctacctg gctaccactggcaggaactttgatgagattctcagggtagtcacctctctccagctgaca gcagaaaagagggttgccaccccagttgattggaagggaagccccctcctctctgtgcaa gttcccacggagcccaccgtgtacttgtctgaaaagaaggggaaacttttccctgggagc tggttaatcacacgaagaacaaaaggcttctgtgccggaccttgcgtgcttgcctgtgca gtgctttacacgaagaaaaagatgcccttcctcacaagtgtacctggcccacagctcaaa gagcagagcttgggttggatttgtgaagggcaccatttgaccaccaccggggcagcccat ctaaaactctcccagtcagctcatactaataaggaaatcttgtag >gi568815594f:187902808_188103737|GENSCAN_predicted_peptide_6|187_aa MAGRKVGAGARPRRGADGGGSRPWWIRIPGLLGFECGMARSEQEGISVDGRASLTGEGIV MEGPEMSGGECGEAMQAQTLGPRRPWEELRSSVQEPWERNILKESTKCALEQVDLGDVVD IKKPVQPLSSSGMPCCPIPRKSCPFTDRCVPLSPRYSRTMFSYPAPHFDFPPLNAESSPL AYNAAIA >gi568815594f:187902808_188103737|GENSCAN_predicted_CDS_6|564_bp atggcgggcaggaaggtgggggctggggcgaggcccaggcgcggggctgacggcggaggt tctaggccctggtggatacgtattcctggcctcctggggtttgaatgcggaatggccagg tcagaacaggagggaatcagcgtggacggcagagcctccctgacaggagagggaatagtt atggaaggacctgagatgagtggaggcgagtgtggagaggctatgcaggcccagactctg ggaccccgtaggccatgggaagaacttcgatcttcagtccaggagccctgggaacggaat attctgaaagaatccaccaaatgtgccttagaacaggttgacttaggtgatgtagtcgat ataaagaaaccagtgcagcccctcagctcttcaggcatgccgtgctgtcccatccccagg aaaagctgccccttcactgatcgctgcgtgcccctttctccacgctactcaaggacgatg ttcagttatcccgctccccattttgactttccccctctgaacgctgaatcatccccattg gcatacaatgctgcgatagcatga >gi568815594f:187902808_188103737|GENSCAN_predicted_peptide_7|310_aa MSQQLKKRAKTRHQKGLGGRAPSGAKPRQGKSSQDLQAEIEPVSAVWALCDGYVCYEPGP QALGGDDFSDCYIECVIRGEFSQPILEEDSLFESLEYLKKGSEQQLSQKVFEASSLECSL EYMKKGVKKELPQKIVGENSLEYSEYMTGKKLPPGGIPGIDLSDPKQLAEFARKKPPINK EYDSLSAIACPQSGCTRKLRNRAALRKHLLIHGPRDHVCAECGKAFVESSKLKRHFLVHT GEKPFRCTFEGCGKRFSLDFNLRTHVRIHTGEKRFVCPFQGCNRRFIQSNNLKAHILTHA NTNKNEQEGK >gi568815594f:187902808_188103737|GENSCAN_predicted_CDS_7|933_bp atgagccagcaactgaagaaacgggcaaagacaagacaccagaaaggcctgggtggaaga gcccccagtggggctaagcccaggcaaggcaagtcaagccaagacctgcaggcggaaata gaacctgtcagcgcggtgtgggccttatgtgatggctatgtgtgctatgagcctggccct caggctctcggaggggatgatttctcagactgttacatagaatgcgtcataaggggtgag ttttctcaacccatcctggaagaggactcactttttgagtccttggaatacctaaagaaa ggatcagaacaacagctttctcaaaaggttttcgaagcaagctcccttgaatgttctttg gaatacatgaaaaaaggggtaaagaaagagcttccacaaaagatagttggagagaattcg cttgagtattctgagtacatgacaggcaagaagcttccgcctggaggaatacctggcatt gacctatcagatcctaaacagctcgcagaatttgctagaaagaagccccccataaataaa gaatatgacagtctgagcgcaatcgcttgtcctcagagtggatgcactaggaagttgagg aatagagctgccctgagaaagcatctcctcattcatggtccccgagaccacgtctgtgcg gaatgtgggaaagcgttcgttgagagctcaaaactaaagagacatttcctggttcatact ggagagaagccgtttcggtgcacttttgaagggtgcggaaagcgcttctctctggacttt aatttgcgtacgcacgtgcgcatccacacgggggagaaacgtttcgtgtgtccctttcaa ggctgcaacaggaggtttattcagtcaaataacctgaaagcccacatcctaacgcatgca aatacgaacaagaatgaacaagagggaaagtag >gi568815594f:187902808_188103737|GENSCAN_predicted_peptide_8|201_aa MWVKICFSQPEMTRLNLVSYTSSGQGKCTCKNNSIKTPTSKCQVPTEGSQSEELLETVGC MSLTLAEIKKKGNKKTWKRIHLCAFQRVPLSVLNGMGNHKKIQEQALCVGAMRLCAHSGS EEGGTSSSKLMFYTRAFRSCDSTFKTENLSPFHLLSSVLQQESHRHYARPLPVAGLDCAQ EGGLGVVMLLKRQLVSKAAAN >gi568815594f:187902808_188103737|GENSCAN_predicted_CDS_8|606_bp atgtgggtgaaaatctgcttctctcaaccagaaatgacaagattaaacctcgtaagctat acaagcagtggccagggaaaatgtacatgtaaaaacaacagcatcaagacacctacatca aaatgtcaagtccctactgaaggaagccagagtgaagaattactggagactgttggctgc atgtccctcactcttgccgaaataaagaaaaagggcaataagaaaacctggaaaagaatc cacctgtgtgcctttcaacgggtccctttgtctgtgctgaatggaatgggaaaccacaaa aaaatacaggagcaggccctttgtgtaggtgccatgagactctgtgcccacagtggaagc gaggagggtggcacatcatcctcaaaactcatgttctacacccgagccttcagatcctgt gactctaccttcaaaacagagaatctgagcccttttcacctgctcagctcagtcctgcag caagaaagccataggcattatgcaaggcctctgcctgtggctggcttggattgtgcccag gagggtggtcttggggtagtcatgcttcttaaacggcagttggtttccaaggcagcagca aactag >gi568815594f:187902808_188103737|GENSCAN_predicted_peptide_9|122_aa MLIAYAHSLPPFTITNSETASNSSQVNGIKLLGLLNCVSQNAAVFGERILKDVMKLRPLG WALIQSDCCLCKKRKLGHTLGHPCEAAARAKERGLRGEEGPRRQPQLGLPAFRTPRKWIY AG >gi568815594f:187902808_188103737|GENSCAN_predicted_CDS_9|369_bp atgctaattgcctatgcacattctctcccacctttcaccatcacaaattcagaaacagcc agcaactcctcacaagtgaatggtattaagctgttaggtctcctaaactgcgtatctcag aatgcggctgtgtttggagagaggatcttaaaagacgtgatgaagttaaggccattaggg tgggccttaatccaatctgactgctgtctttgtaagaagaggaaattgggacacaccctg ggacacccctgtgaagcagctgcaagagccaaggagaggggcctcagaggagaggaagga ccccgccgacagcctcagcttgggcttccagccttcaggactccgagaaaatggatctat gctggttaa >gi568815594f:187902808_188103737|GENSCAN_predicted_peptide_10|515_aa MIESEYSMRLRLLNEECEQNLQRQQECISDLNLRETLLNQAIKLATELEEMFQEMLQRLG RVGRENMEKLKESEARASEQVRSLLKLIVELEKKCGEGTLALLKCLDLEIRPPELREMNF CCAEATQSMNAKYSLERSKSLLLEHLEPAHITDLSLCHIRGLSSMFRVLQRHLTLDPETA HPCLALSEDLRTMRLRHGQQDGAGNPERLDFSAMVLAAESFTSGRHYWEVDVEKATRWQV GIYHGSADAKGSTARASGEKVLLTGSVMGTEWTLWVFPPLKRLFLEKKLDTVGVFLDCEH GQISFYNVTEMSLIYNFSHCAFQGALRPVFSLCIPNGDTSPDSLTILQHGPSCDATEQFD LLEEARQHFANRDQMFHLINKECIQPDKGITRHTPPQSVLTRGLWGYQTAGLHQLETAIL TDPILLSLRGIWQGHGTVVEGSSTPPVLESSDPNRGRRHCTTDIPTELSAAVTPTPLFFK HTTRTASRGPDTCWPLGLEYPPGNSPTAPTHAASI >gi568815594f:187902808_188103737|GENSCAN_predicted_CDS_10|1548_bp atgattgagtctgagtatagtatgagactccggttgttgaatgaagagtgtgagcagaat ctccagagacagcaggaatgcatatctgacttgaacctgagagaaacccttctgaatcaa gcgatcaagcttgccaccgagctagaggagatgttccaggaaatgctacagagactgggc cgtgtggggagagagaacatggagaaactgaaggagagtgaagccagggcttctgaacag gtccgcagcctcctaaagctcatcgtggagcttgagaaaaagtgtggggaaggcaccttg gcattgctcaagtgccttgatcttgaaattcggcctccagaactgcgagaaatgaatttc tgctgtgcagaagctactcagtctatgaatgcaaaatactctttagaaaggagcaagtca ctgctgcttgagcatctggagcccgctcatatcacagacctgagtttatgccacataaga ggactcagcagcatgttcagagtactccagagacatttaacattggatcctgaaacagct catccctgcctggcactatctgaggacctgagaactatgagattgagacatgggcagcag gatggggctggcaacccagaaagattggatttcagtgccatggtgctggctgcggagagc ttcacctcagggaggcactactgggaggtggacgtggaaaaggcaaccaggtggcaagtg ggcatataccacggctctgcagacgcgaagggcagcacggccagagcttccggagagaaa gtcttgctcacggggtcggtgatggggaccgagtggactctctgggtcttcccccctctg aaaaggctcttcctggaaaagaagttggacacagttggcgttttccttgactgcgaacac gggcagatatcattctacaatgtgaccgagatgtccctcatttacaatttctcccattgc gccttccaaggagctctcaggcctgtgttttccctctgtatcccaaatggagacacaagt ccagactccctcaccatcttacaacatggtccttcttgtgatgctactgaacaatttgat ttattagaagaagcccgacagcactttgccaacagggaccaaatgtttcacctgatcaac aaagagtgcatccagccagataagggcatcaccaggcacactcctccacaatcagtcctc accagaggactctggggctatcaaacagcaggacttcaccagctcgaaacagccatctta acagaccccatcttgctgtcactgagggggatttggcagggtcatgggacagtagtggag ggaagctctaccccacctgtcctggagagctctgacccgaatcgtggacgccgtcactgc accacggacattccgactgaactctctgctgcagtaactccaactcccctgttcttcaaa cacacaacacgcactgccagtcgaggccctgacacatgctggccattaggcctggaatac cctcccggaaacagccccacggctcctacccatgctgcttccatctaa