GENSCAN 1.0 Date run: 7-Nov-116 Time: 17:43:53 Sequence gi568815595f:58232989_58524957 : 291969 bp : 44.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 23599 23717 119 0 2 75 99 40 0.147 3.71 1.02 Intr + 27876 28076 201 0 0 16 41 166 0.092 3.10 1.03 Intr + 34201 34357 157 1 1 120 51 97 0.979 9.21 1.04 Intr + 36333 36446 114 2 0 87 115 104 0.964 13.64 1.05 Intr + 37944 38076 133 2 1 85 119 68 0.999 9.82 1.06 Intr + 41670 41827 158 1 2 59 97 208 0.201 18.53 1.07 Intr + 48566 48646 81 1 0 43 98 42 0.001 0.53 1.08 Intr + 52097 52151 55 1 1 77 84 109 0.989 7.95 1.09 Intr + 52365 52465 101 1 2 51 91 123 0.195 8.73 1.10 Term + 60601 60777 177 0 0 73 54 195 0.752 12.29 1.11 PlyA + 61505 61510 6 1.05 2.04 PlyA - 61879 61874 6 1.05 2.03 Term - 65186 65096 91 1 1 136 44 57 0.623 3.29 2.02 Intr - 65435 65272 164 2 2 91 91 37 0.128 3.07 2.01 Init - 71928 71926 3 0 0 103 81 0 0.096 0.80 2.00 Prom - 75888 75849 40 -4.26 3.00 Prom + 76255 76294 40 -6.76 3.01 Init + 77342 77418 77 1 2 74 95 61 0.852 5.96 3.02 Intr + 77519 77603 85 2 1 114 64 83 0.600 8.32 3.03 Intr + 99977 100102 126 1 0 98 74 260 0.008 26.48 3.04 Intr + 132886 132936 51 0 0 101 115 29 0.199 6.00 3.05 Intr + 136443 136490 48 2 0 104 99 10 0.120 2.58 3.06 Intr + 149526 149712 187 2 1 88 107 42 0.060 5.36 3.07 Intr + 157594 157671 78 2 0 82 66 58 0.683 2.52 3.08 Intr + 158159 158232 74 0 2 73 94 13 0.802 -0.47 3.09 Intr + 158785 158859 75 0 0 111 97 -3 0.761 2.61 3.10 Intr + 162010 162114 105 0 0 77 77 78 0.983 6.01 3.11 Intr + 162670 162771 102 0 0 86 94 52 0.984 5.97 3.12 Intr + 164051 164212 162 1 0 65 101 90 0.994 8.17 3.13 Intr + 164617 164734 118 0 1 102 52 96 0.569 7.44 3.14 Intr + 166254 166389 136 1 1 9 97 41 0.129 -3.27 3.15 Intr + 170874 170922 49 0 1 90 98 39 0.400 3.78 3.16 Intr + 175936 176013 78 0 0 113 99 39 0.365 7.25 3.17 Intr + 177102 177171 70 2 1 85 93 54 0.307 4.35 3.18 Term + 191764 191972 209 2 2 139 48 128 0.973 11.40 3.19 PlyA + 192145 192150 6 1.05 4.11 PlyA - 192191 192186 6 1.05 4.10 Term - 195191 195046 146 0 2 104 29 115 0.935 5.27 4.09 Intr - 195626 195485 142 2 1 65 63 114 0.978 6.53 4.08 Intr - 196811 196720 92 0 2 115 105 -1 0.999 4.01 4.07 Intr - 197250 197140 111 1 0 56 116 43 0.935 4.25 4.06 Intr - 197954 197669 286 2 1 84 76 170 0.984 12.31 4.05 Intr - 198640 198605 36 1 0 82 80 33 0.558 0.36 4.04 Intr - 198805 198743 63 1 0 50 99 51 0.713 1.31 4.03 Intr - 198996 198889 108 0 0 74 88 126 0.985 11.68 4.02 Intr - 200696 200643 54 2 0 115 92 68 0.970 9.08 4.01 Init - 200821 200780 42 1 0 104 80 40 0.704 4.61 4.00 Prom - 204837 204798 40 -1.66 5.00 Prom + 206624 206663 40 -6.06 5.01 Init + 207154 207257 104 0 2 85 115 60 0.457 8.12 5.02 Intr + 228230 228352 123 2 0 44 113 62 0.027 4.00 5.03 Intr + 239329 239472 144 1 0 75 75 60 0.026 2.80 5.04 Intr + 265725 265794 70 0 1 50 101 70 0.333 3.68 5.05 Term + 267958 268644 687 0 0 73 39 497 0.739 36.91 5.06 PlyA + 269077 269082 6 1.05 6.06 PlyA - 269385 269380 6 1.05 6.05 Term - 269787 269782 6 0 0 97 37 0 0.007 -6.33 6.04 Intr - 276037 275905 133 0 1 88 68 156 0.803 14.25 6.03 Intr - 284435 284218 218 2 2 58 117 212 0.953 18.40 6.02 Intr - 289613 289508 106 1 1 89 105 22 0.982 4.22 6.01 Intr - 291617 291438 180 1 0 131 121 60 0.996 12.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 52365 52531 167 1 2 51 55 181 0.800 9.18 S.002 Term - 99863 99535 329 0 2 2 47 279 0.867 10.07 S.003 Intr - 100169 99964 206 1 2 58 75 119 0.943 6.44 S.004 Intr - 100651 100422 230 1 2 52 105 161 0.964 10.87 S.005 Init + 188828 188864 37 1 1 52 121 39 0.888 3.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:58232989_58524957|GENSCAN_predicted_peptide_1|431_aa MDLDVVNMFVIAGGTLAIPILAFVASFLLWPSALIRIYYWEQEQVSPVGHSKSSEQLSLG LSVPGAWPAHFHNTVDYGCSCRRYQMLQHESTESHTGDAVATADDIRYWRRTLGMQVRYV HHEDYQFCYSFRGRPGHKPSILMLHGFSAHKDMWLSVVKFLPKNLHLVCVDMPGHEGTTR SSLDDLSIDGQVKRIHQFVECLKLNKKPFHLVGTSMGGQVAGVYAAYYPSDVSSLCLVCP AGLQYSTDNQFVQRLKELQGSAAVEKIPLIPSTPEEMSEMLQLCSYVRFKVPQQVRRCLP CFSSPSVGCTHCPTSPSEMNQILQGLVDVRIPHNNFYRKLFLEIVSEKSRYSLHQNMDKI KVPTQIIWGKQDQVLDVSGADMLAKSIANCQVELLENCGHSVVMERPRKTAKLIIDFLAS VHNTDNNKKLD >gi568815595f:58232989_58524957|GENSCAN_predicted_CDS_1|1296_bp atggatcttgatgtggttaacatgtttgtgattgcgggcggcacgctggccatcccaatc ctggcatttgtggcttcatttcttctgtggccttcagcactgataagaatctattattgg gaacaagaacaggtgtccccagtaggccacagcaagtcttcagagcagctctctctgggt ctaagtgtccctggagcatggccagcacacttccacaacaccgtggactatggctgctcc tgcaggaggtaccaaatgcttcaacatgagtcaacagagagccacaccggggatgctgtg gccactgcggatgacatcaggtactggcggaggacattgggcatgcaagtccgctatgtt caccatgaagactatcagttctgttattccttccggggcaggcctgggcacaaaccctcc atcctcatgctccacggattctctgcccacaaggatatgtggctcagtgtggtcaagttc cttccaaagaacctgcacttggtctgcgtggacatgccaggacatgagggcaccacccgc tcctccctggatgacctgtccatagatgggcaagttaagaggatacaccagtttgtagaa tgcctgaagctgaacaaaaaacctttccacctggtaggcacctccatgggtggccaggtg gctggggtgtatgctgcttactacccatcggatgtctccagcctgtgtctcgtgtgtcct gctggcctgcagtactcaactgacaatcaatttgtacaacggctcaaagaactgcagggc tctgccgccgtggagaagattcccttgatcccgtctaccccagaagagatgagtgaaatg cttcagctctgctcctatgtccgcttcaaggtgccccagcaggtgaggcgatgcctgccc tgcttcagctcgccctctgtgggctgcacccactgtccaaccagtcccagtgagatgaac cagatcctgcaaggccttgtcgatgtccgcatccctcataacaacttctaccgaaagttg tttttggaaatcgtcagtgagaagtccagatactctctccatcagaacatggacaagatc aaggttccgacgcagatcatctgggggaaacaagaccaggtgctggatgtgtctggggca gacatgttggccaagtcaattgccaactgccaggtggagcttctggaaaactgtgggcac tcagtagtgatggaaagacccaggaagacagccaagctcataatcgactttttagcttct gtgcacaacacagacaacaacaagaagctggactga >gi568815595f:58232989_58524957|GENSCAN_predicted_peptide_2|85_aa MTRLMMVTRLSSHCQHRQPQVHTLVGTSPREQILGPRALRQLGTPEEPAPGLSPGRKWRN CKNSTQFGVGEGEKADEKVQSVVEG >gi568815595f:58232989_58524957|GENSCAN_predicted_CDS_2|258_bp atgactaggctgatgatggtcacaagactctcctcccactgccagcacaggcaaccccaa gtccacaccctggttggtaccagccccagggagcaaatactgggccccagagccctgcgg cagctggggacaccagaagagccagccccaggactgtcacctggcaggaaatggagaaat tgcaagaacagcacccagtttggggttggggagggagaaaaggctgatgagaaggtccag tctgtggttgaagggtga >gi568815595f:58232989_58524957|GENSCAN_predicted_peptide_3|609_aa MPAPAATYERVVYKNPSEYHYMKVCLEFQDCGVGLNAAQFKQLLISAVKDLFGEAAAAGR PGMAFMEKPPAGKVLLDDTVPLTAAIEASQSLQSHTEYIIRVQRGISVENSWQIVRRYSD FDLLNNSLQIAGLSLPLPPKKLIGNMDREFIAERQKGLQNYLNVITTNHILSNCELVKKF LDPNNYSANYTEIALQQVSMFFRSEPKWEVVEPLKDIGWRIRKKYFLMKIKNQPKERLVL SWADLGPDKYLSDKDFQCLIKLLPSCLHPYIYRVTFATANESSALLIRMFNEKGTLKDLI YKAKPKDPFLKKYCNPKKIQGLELQQIKTYGRQILEVLKFLHDKGFPYGHLHASNVMLDG DTCRLLDLENSLLGLPSFYRSYFSQFRKINTLESVDVHCFGHLLYEMTYGRPPDSVPVDS FPPAPSMAVASGFASVFCGTKMCICSFQVAVLESTLSCEACKNGMPTISRLLQMPLFSDV LLTTSEKPQFKIPTKLKEALRIAKECIEKRLIEEQKQKSKRSALENSEEHSAKYSNSNNS GISALPPPPPPPPPPAAPLPPASTEAPAQLSSQAVNGMSRGALLSSIQNFQKGTLRKAKT CDHSAPKIG >gi568815595f:58232989_58524957|GENSCAN_predicted_CDS_3|1830_bp atgcctgcccctgctgccacatatgaaagagtagtttacaaaaacccttccgagtaccac tacatgaaagtctgcctagaatttcaagattgtggagttggactgaatgctgcacagttc aaacagctgcttatttcggctgtgaaggacctgtttggggaggcggcggcggccgggcgt cccgggatggccttcatggagaagccgccagccggcaaggtgctgctggacgacacggtg ccgctgacagcagccatcgaggcgagccagagcctgcagtcccacacggaatatattatt cgagtgcaaagaggaatttctgtggaaaacagctggcagattgttagaagatacagtgac tttgatttgcttaacaacagcttacagattgcaggcctaagtctacctcttcctcccaaa aaattgattggtaacatggatcgtgaattcatagctgaaaggcagaaaggtcttcagaac tatctcaacgtgatcacaacaaatcatatcttgtctaattgtgagctggttaagaagttt ttagatccaaacaactattccgcaaactatactgagattgccttgcaacaggtttccatg ttcttccgatcagaaccaaagtgggaggtggtggaacctttgaaagacataggttggaga ataaggaagaaatatttcttgatgaagattaaaaatcagccaaaggaacggctagtgtta agctgggctgaccttggcccagacaagtatttgtcagataaagattttcagtgtctaatc aaacttctgccttcttgtttgcacccttacatctatcgggttacctttgccacagctaat gaatcctcagcgttgctaattaggatgtttaacgaaaagggaacattgaaggatctgatc tacaaggcaaaaccaaaagacccatttctaaagaagtactgcaaccctaagaagattcag ggcctggaactccagcaaataaaaacatatggacggcaaatattagaggtactgaagttt cttcatgacaagggattcccttatgggcatcttcacgcctccaatgtgatgctcgatggg gacacttgccggctgctggaccttgagaattccttattgggcctgccttccttctaccga tcttatttttcacaattcaggaaaatcaatacattggaaagtgtggatgtccactgcttt ggccacttactgtatgaaatgacttatggacgaccgccagactcggtgcctgtggactcc ttccctcctgccccgtccatggctgtggcaagtggatttgccagcgtgttctgtggaact aaaatgtgtatctgttcatttcaagtggccgtgttggagtctacgctgtcttgtgaagcc tgtaaaaatggcatgcctaccatctcccggctcttacagatgccattattcagcgatgtt ttactaaccacttctgaaaaaccacagtttaagatccctacaaagttaaaagaggcattg agaattgccaaagaatgtatagagaagagactaattgaggaacagaaacagaagtcaaaa cgatctgctcttgaaaatagtgaagagcattcagcgaagtacagcaactccaataattca gggatatctgcattacctccacctcctccacctccaccaccaccagcagctcccttgcct cctgcgagcaccgaggcacctgcccagctctcgtctcaggctgtgaatggcatgagccga ggggccttgctcagctccatccagaatttccaaaaaggaactttgaggaaagccaaaacc tgtgatcacagtgctccgaagatcggctga >gi568815595f:58232989_58524957|GENSCAN_predicted_peptide_4|359_aa MAAVSGLVRRPLREVSGLLKRRFHWTAPAALQVTVRDAINQGMDEELERDEKVFLLGEEV AQYDGAYKVSRGLWKKYGDKRIIDTPISEMGFAGIAVGAAMAGLRPICEFMTFNFSMQAI DQVINSAAKTYYMSGGLQPVPIVFRGPNGASAGVAAQHSQCFAAWYGHCPGLKVVSPWNS EDAKGLIKSAIRDNNPVVVLENELMYGVPFEFPPEAQSKDFLIPIGKAKIERQGTHITVV SHSRPVGHCLEAAAVLSKEGVECEVINMRTIRPMDMETIEASVMKTNHLVTVEGGWPQFG VGAEICARIMEGPAFNFLDAPAVRVTGADVPMPYAKILEDNSIPQVKDIIFAIKKTLNI >gi568815595f:58232989_58524957|GENSCAN_predicted_CDS_4|1080_bp atggcggcggtgtctggcttggtgcggagaccccttcgggaggtctccgggctgctgaag aggcgctttcactggaccgcgccggctgcgctgcaggtgacagttcgtgatgctataaat cagggtatggatgaggagctggaaagagatgagaaggtatttctgcttggagaagaagtt gcccagtatgatggggcatacaaggttagtcgagggctgtggaagaaatatggagacaag aggattattgacactcccatatcagagatgggctttgctggaattgctgtaggtgcagct atggctgggttgcggcccatttgtgaatttatgaccttcaatttctccatgcaagccatt gaccaggttataaactcagctgccaagacctactacatgtctggtggccttcagcctgtg cctatagtcttcagagggcccaatggtgcctcagcaggtgtagctgcccagcactcacag tgctttgctgcctggtatgggcactgcccaggcttaaaggtggtcagtccctggaattca gaggatgctaaaggacttattaaatcagccattcgggataacaatccagtggtggtgcta gagaatgaattgatgtatggggttccttttgaatttcctccggaagctcagtcaaaagat tttctgattcctattggaaaagccaaaatagaaaggcaaggaacacatataactgtggtt tcccattcaagacctgtgggccactgcttagaagctgcagcagtgctatctaaagaagga gttgaatgtgaggtgataaatatgcgtaccattagaccaatggacatggaaaccatagaa gccagtgtcatgaagacaaatcatcttgtaactgtggaaggaggctggccacagtttgga gtaggagctgaaatctgtgccaggatcatggaaggtcctgcgttcaatttcctggatgct cctgctgttcgtgtcactggtgctgatgtccctatgccttatgcaaagattctagaggac aactctatacctcaggtcaaagacatcatatttgcaataaagaaaacattaaatatttag >gi568815595f:58232989_58524957|GENSCAN_predicted_peptide_5|375_aa MGASLDQEHSGHPAGSRGVEVSSGSATVANSSGGRSIFCPNEVTLGGFLDGGRSPERPSH DEKFGAFSSTPYPLGRLPDSGRYSQSQLPSELGEREKGKYKKTQPCGLTPYSMIQAHSQL CVSSFPETWALEDASLEQMDNGDWGYMMTDPVTLNVGGHLYTTSLTTLTRYPDSMLGAMF GGDFPTARDPQGNYFIDRDGPLFRYVLNFLRTSELTLPLDFKEFDLLRKEADFYQIEPLI QCLNDPKPLYPMDTFEEVVELSSTRKLSKYSNPVAVIITQLTITTKVHSLLEGISNYFTK WNKHMMDTRDCQVSFTFGPCDYHQEVSLRVHLMEYITKQGFTIRNTRVHHMSERANENTV EHNWTFCRLARKTDD >gi568815595f:58232989_58524957|GENSCAN_predicted_CDS_5|1128_bp atgggtgcctcgctggatcaggagcacagcggacaccctgccggatccagaggggtggaa gtcagcagcgggtctgcaacggtggcaaacagcagtggtggacggagcatcttttgtcct aatgaggtgactcttggtgggttcctggatgggggccggtctccagaaagaccaagccat gatgagaagtttggagcttttagctccactccctatcctctgggaaggcttccagactct ggacggtattcccagagtcagctcccttctgaattgggagaaagagagaaaggaaaatac aagaagacccaaccatgtggattaacaccctatagcatgatccaggcccacagccagctc tgtgtttccagtttccctgaaacctgggctcttgaagacgcatcactggagcagatggat aatggagactggggctatatgatgactgacccagtcacattaaatgtaggtggacacttg tatacaacgtctctcaccacattgacgcgttacccggattccatgcttggagctatgttt gggggggacttccccacagctcgagaccctcaaggcaattactttattgatcgagatgga cctcttttccgatatgtcctcaacttcttaagaacttcagaattgaccttaccgttggat tttaaggaatttgatctgcttcggaaagaagcagatttttaccagattgagcccttgatt cagtgtctcaatgatcctaagcctttgtatcccatggatacttttgaagaagttgtggag ctgtctagtactcggaagctttctaagtactccaacccagtggctgtcatcataacgcaa ctaaccatcaccactaaggtccattccttactagaaggcatctcaaattattttaccaag tggaataagcacatgatggacaccagagactgccaggtttcctttacttttggaccctgt gattatcaccaggaagtttctcttagggtccacctgatggaatacattacaaaacaaggt ttcacgatccgcaacacccgggtgcatcacatgagtgagcgggccaatgaaaacacagtg gagcacaactggactttctgtaggctagcccggaagacagacgactga >gi568815595f:58232989_58524957|GENSCAN_predicted_peptide_6|214_aa XFLVKSYLQTQMSPGSTPQRSLSPSVAYLTAPDLARCPAQRAADFLCPELYTTAWAHVAV RLIKDSVQHLQTLTQSGADQHEAWNQTTVIHLQAAKVHCYYVTVKGFTEALEKLENEPAI QQVLKRLCDLHAIHGILTNSGDFLHDAFLSGAQVDMARTAYLDLLRLIRKDAILLTDAFD FTDQCLNSALGCYDGNVYERLFQWAQKSPTNTQK >gi568815595f:58232989_58524957|GENSCAN_predicted_CDS_6|645_bp nngttcctggtgaagagctacctgcagactcagatgtcccctggctccacgccacagaga tctctctctccatctgtcgcatatctcaccgcacctgacctggccaggtgtccagcccag agggcagccgacttcctctgcccggagctctacaccacggcctgggcacatgtggcagta aggctcataaaggactcagtgcagcatttacagaccctgacgcaatccggagctgaccag cacgaggcttggaaccagaccactgtcatacacctccaggctgctaaggtgcactgctac tatgtcactgtgaagggttttacagaagctctggagaaactagaaaatgaaccagcgatt cagcaggtgctcaagcgcctctgtgacctccatgccatacatggaatcttgactaactcg ggtgactttctccatgacgccttcctgtctggtgcccaagtggacatggcaagaacagcc tacctggacctgctccgcctgatccggaaggatgccatcctgttaactgatgcttttgac ttcaccgatcagtgtttaaattcagcacttggctgttatgatggaaacgtctacgaacgc ctgttccagtgggctcagaagtcaccaaccaatactcagaaataa