GENSCAN 1.0 Date run: 8-Nov-116 Time: 16:19:07 Sequence gi568815592f:158563269_158864601 : 301333 bp : 46.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1755 1885 131 2 2 86 20 173 0.942 8.00 1.02 Intr + 10152 10255 104 0 2 112 94 206 0.833 23.42 1.03 Intr + 22036 22157 122 2 2 117 84 115 0.987 14.21 1.04 Intr + 26404 26514 111 0 0 77 98 106 0.977 11.08 1.05 Intr + 41999 42079 81 1 0 53 74 102 0.348 5.13 1.06 Intr + 43976 44075 100 1 1 62 64 43 0.581 -1.02 1.07 Intr + 45065 45195 131 0 2 83 99 187 0.806 19.71 1.08 Intr + 46661 46735 75 1 0 55 70 75 0.556 2.11 1.09 Intr + 61836 61938 103 2 1 114 106 61 0.681 10.25 1.10 Intr + 66456 66551 96 1 0 69 103 37 0.904 3.28 1.11 Intr + 68055 68121 67 1 1 65 106 96 0.995 7.06 1.12 Term + 68542 68620 79 1 1 51 52 198 0.999 9.74 1.13 PlyA + 69804 69809 6 1.05 2.08 PlyA - 69853 69848 6 1.05 2.07 Term - 73629 73559 71 1 2 143 43 76 0.995 6.70 2.06 Intr - 74626 74503 124 1 1 91 20 80 0.960 1.66 2.05 Intr - 78092 78051 42 2 0 119 94 33 0.907 5.44 2.04 Intr - 91285 91116 170 2 2 108 1 82 0.014 1.17 2.03 Intr - 92050 91998 53 0 2 72 89 18 0.381 -1.15 2.02 Intr - 93088 92907 182 1 2 73 76 56 0.357 1.57 2.01 Init - 96473 96381 93 2 0 51 61 92 0.367 3.18 2.00 Prom - 98725 98686 40 -8.06 3.00 Prom + 98986 99025 40 -6.56 3.01 Init + 100001 100110 110 1 2 80 78 84 0.963 6.29 3.02 Intr + 102073 102345 273 1 0 62 115 208 0.935 17.55 3.03 Intr + 119657 119721 65 2 2 103 84 71 0.472 6.56 3.04 Term + 127594 127676 83 2 2 69 35 89 0.359 -0.44 3.05 PlyA + 127914 127919 6 1.05 4.00 Prom + 136693 136732 40 -6.46 4.01 Init + 139956 140116 161 2 2 61 28 216 0.511 10.25 4.02 Intr + 145054 145123 70 1 1 84 59 85 0.982 4.38 4.03 Intr + 149802 149939 138 2 0 82 64 61 0.930 3.76 4.04 Intr + 150532 150610 79 0 1 101 106 58 0.814 8.02 4.05 Intr + 154819 154943 125 2 2 101 42 75 0.501 4.50 4.06 Intr + 162235 162369 135 0 0 113 94 -33 0.001 0.56 4.07 Intr + 182212 182390 179 0 2 75 94 136 0.005 11.62 4.08 Intr + 188660 188762 103 2 1 117 75 104 0.965 12.18 4.09 Intr + 193943 194113 171 1 0 124 59 216 0.867 22.44 4.10 Intr + 197372 197477 106 1 1 93 86 63 0.905 6.39 4.11 Term + 201227 201336 110 0 2 111 44 31 0.714 -0.33 4.12 PlyA + 201582 201587 6 1.05 5.17 PlyA - 201953 201948 6 1.05 5.16 Term - 203810 203646 165 2 0 49 42 384 0.999 27.82 5.15 Intr - 204244 203993 252 1 0 38 81 576 0.957 49.53 5.14 Intr - 206150 206058 93 2 0 87 96 128 0.948 13.66 5.13 Intr - 206676 206516 161 1 2 96 62 230 0.998 20.91 5.12 Intr - 207626 207496 131 1 2 80 55 244 0.999 20.64 5.11 Intr - 208139 207976 164 2 2 76 89 238 0.999 21.47 5.10 Intr - 213236 213140 97 1 1 93 95 90 0.983 10.21 5.09 Intr - 220398 220252 147 2 0 69 78 137 0.991 10.15 5.08 Intr - 221459 221376 84 1 0 94 89 72 0.991 6.84 5.07 Intr - 222366 222041 326 0 2 -4 105 438 0.953 30.87 5.06 Intr - 226103 226020 84 2 0 105 77 117 0.965 12.32 5.05 Intr - 232564 232446 119 2 2 104 54 31 0.668 1.48 5.04 Intr - 254898 254814 85 0 1 69 100 156 0.864 14.29 5.03 Intr - 271645 271527 119 2 2 78 57 66 0.262 2.68 5.02 Intr - 272829 272725 105 1 0 82 42 60 0.152 0.99 5.01 Init - 278902 278611 292 1 1 87 4 221 0.163 10.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 53654 52611 1044 2 0 43 41 272 0.991 15.15 S.002 Init - 81440 81414 27 2 0 81 76 33 0.857 1.07 S.003 Term - 91285 91041 245 2 2 108 53 99 0.859 4.46 S.004 Sngl + 112710 113363 654 2 0 86 54 179 0.964 10.49 S.005 Init + 182248 182390 143 0 2 81 94 127 0.993 12.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:158563269_158864601|GENSCAN_predicted_peptide_1|399_aa MKVSLTPLGPLLRMLVLLGSVEVGMLVLASLTSLSLSLQDSNLRLAPMRLYTLSKRHFVL VFVVFFICFGLTIFVGIRETSIKTSFPMTVKVDGVAQDGTTMYIHNKVHNRTRTLTCAGK CAEIIVAHLGYLNYTQYTVIVGFEHLKLPIKGMNFTWKTYNPAFSRLEIWFRFFFVVLTF IVTCLFAHSLRKFSMRDWGIEQKWMSVLLPLLLLYNDPFFPLSFLVNSWLPGMLDDLFQS MFLCALLLFWLCVYHGIRVQVEEYVRTAMRPTDVGKVLQQGFHGQGMKVFFMVVAAVYIL YLLFLIVRACSELRHMPYVAPPAEFLSFYGLLNFYLYTLAFVYSPSKNALYESQLKDNPA FSMLNDSDDDVIYGSDYEEMPLQNGQAIRAKYKEESDSD >gi568815592f:158563269_158864601|GENSCAN_predicted_CDS_1|1200_bp atgaaggtgtctctcacaccccttgggcccctgctgcggatgctggtgttgctgggttct gtagaagtgggaatgctggtgctggcttccctcacatctctgtccctctcgctgcaggac agcaatttgaggctggcgcccatgcggctctacacgctctccaagcgccactttgtcctc gtgtttgtcgtcttcttcatctgctttggcctgaccatcttcgttgggatcagagaaact tctattaagacaagctttcccatgactgttaaagtcgatggtgtagctcaagatggaacc acgatgtacattcataacaaagttcacaaccggacaaggaccctcacatgtgcagggaaa tgtgcggagattattgtggctcaccttggctacctgaactacactcagtatacagtgata gtgggatttgaacacctgaagctccccatcaagggaatgaacttcacatggaagacttat aaccctgccttctcccggttggaaatctggttccggtttttctttgtggtgctcaccttc atcgtcacttgcctgtttgcgcattccctccggaaattttccatgagagactggggcatc gagcagaagtggatgtctgttctcctgcctctgctgctactttacaatgatccgttcttc cccctctccttcctggtcaacagctggctcccagggatgctggatgacctctttcagtcc atgttcctgtgcgccctgctgctcttctggctgtgcgtgtaccacgggattcgtgtccag gtggaggagtacgtcaggactgccatgaggcccaccgacgttgggaaagtacttcagcag gggttccatggccagggaatgaaggtcttcttcatggtggtggcagcggtgtacattctg tacctcttgttcttgatagtgcgggcgtgttccgagctacgtcacatgccttatgtggca ccaccagccgagttcttatctttctatggcctgttgaacttctatctctacaccttggcc tttgtatattctccatcgaagaatgccctctatgagtcccagctgaaagacaatcctgcc ttctccatgctgaatgactcggatgatgatgtgatttatgggagtgactatgaggaaatg ccgctgcagaacggccaggccatccgggccaagtacaaggaggagtcagatagtgactga >gi568815592f:158563269_158864601|GENSCAN_predicted_peptide_2|244_aa MKSQDAVYSFNTLKEIQKDILKMQTTKTIAQANDHMQPPRAAVMLLNGPRGRGRLTGDRK GQDTCLAETPPPGLGEEGSCVRIGKEEKASLGAGSPYSSTAGSKDGQGARRPKSRKLSHC SGIGRDAGSHGENFNLGVSIGLYLVMLYPSGTSASRQTQFGYGIECTAFVVDEVSNIVKE AIESAIGGNAYQHSKVNQWTTNVVEQTLSQLTKLGKPFKYIGSCTVRWENKTMYCIVSAF GLSI >gi568815592f:158563269_158864601|GENSCAN_predicted_CDS_2|735_bp atgaaaagtcaagatgctgtctatagctttaatacactgaaagaaatacagaaagacatc ctcaaaatgcaaacgactaagacaattgcccaggcaaacgaccacatgcaaccacccaga gcagcagtgatgttgctgaatggaccacgaggaagaggaagactgacaggtgacaggaaa ggacaggacacctgcctggctgagacgccaccgccaggcctgggagaagaggggagttgt gtgaggattggaaaggaggagaaggcatctctaggggcaggaagcccatattcaagtaca gcaggaagtaaggacgggcagggggcaagaaggcccaagagtaggaagttaagccactgt tctgggattgggagggacgcgggcagccatggggaaaacttcaatctaggtgtttccatt ggtctttatctggtcatgctgtatcccagtggaacttcggcatccagacaaacacagttt ggttacggcattgagtgtactgcttttgttgttgatgaagtgagcaacattgtaaaagag gctatagaaagcgcaattggtggtaacgcttatcaacacagcaaagtgaaccagtggacc acaaatgtagtagaacaaactttaagccaactcaccaagctgggaaaaccatttaaatac atcgggagctgcactgtgcgatgggagaataagaccatgtactgcatcgtcagtgccttc ggactgtctatttga >gi568815592f:158563269_158864601|GENSCAN_predicted_peptide_3|176_aa MAQEIDLSALKELEREAILQVLYRDQAVQNTEEERTRLGSGVPSNTIFFNSEGCRKLKTH LQHLRWKGAKNTDWEHKEKCCARCQQVLGFLLHRGAVCRGCSHRVCAQCRVFLRGTHAWK CTVCFEDRNVKIKTGEWFYEERAKKFPTGGSSPGQPGIITFCASIHGHLAIRAEGF >gi568815592f:158563269_158864601|GENSCAN_predicted_CDS_3|531_bp atggcccaagaaatagatctgagtgctctcaaggagttagaacgcgaggccattctccag gtcctgtaccgagaccaggcggttcaaaacacagaggaggagaggacacgcctggggtca ggtgttccatctaatactatcttctttaactctgagggctgcaggaaactgaaaacacac ctgcagcatctccggtggaaaggagcgaagaacacggactgggagcacaaagagaagtgc tgtgcgcgctgccagcaggtgctggggttcctgctgcaccggggcgccgtgtgccggggc tgcagccaccgcgtgtgtgcccagtgccgagtgttcctgagggggacccatgcctggaag tgcacggtgtgcttcgaggacaggaatgtcaaaataaaaactggagaatggttctatgag gaacgagccaagaaatttccaactggaggaagctccccaggacaacctggcatcatcacc ttctgtgcctccatccacgggcatttggcaatccgtgcagagggtttttaa >gi568815592f:158563269_158864601|GENSCAN_predicted_peptide_4|458_aa MLRAVPGRHGCQPAMCLRQALAASDLGPGLQLFLSLAKTGIPAKVTRQPDRAGCKISVVP PTPPPVSESQCSRSPGRFQTETGVTFKILLYFLLEISNSPESFQYLRAHDVFIGQMDDWM GGNLQEFGQFRGFNKSVENLFLSLATHVKKLSKSQNDMTSEKHLLATGPRQCVGQTERRS QSDTAVNVTTRKVSAPDILKPLNQEDPKCSTNPILKQQNLPSSPAPSTIFSGGFRHGSLI SIDSTCTEMGNFDNANVTGEIEFAIHYCFKTHSLEICIKACKNLAYGEEKKKKCNPYVKT YLLPDRSSQGKRKTGVQRNTVDPTFQETLKYQVAPAQLVTRQLQVSVWHLGTLARRVFLG EVIIPLATWDFEDSTTQSFRWHPLRAKAEKYEDSVPQSNGELTVRAKLVLPSRPRKLQEA QEEGDTAVGGDACSLSKLQWQKVLSSPNLWTDMTLVLH >gi568815592f:158563269_158864601|GENSCAN_predicted_CDS_4|1377_bp atgctccgcgccgtgcctggcagacacggctgccaaccagccatgtgcctgagacaagcc ctggcagcctcagacctgggccctggcctgcagctgttcctgtccctggcaaagacgggt atccctgctaaagtcactcgccagccagaccgggcagggtgcaaaatttctgtggttcct cctactccacctcctgtcagcgagagccagtgcagccgcagtcctggcaggtttcaaact gagactggggtgaccttcaaaatcctgctctactttctcctggaaatttcaaattctcca gagtcgttccagtacctacgtgctcatgatgtatttattggacagatggatgattggatg ggtggtaatttacaggaatttggtcagtttagaggatttaataagtccgtggaaaatttg tttctgtctcttgctacccacgtgaaaaagctctccaaatcccagaatgatatgacttct gagaagcatcttctcgccacgggccccaggcagtgtgtgggacagacagagagacggagc cagtctgacactgcggtcaacgtcaccaccaggaaggtcagtgcaccagatattctgaaa cctctcaatcaagaggatcccaaatgctctactaaccctattttgaagcaacagaatctc ccatccagtccggcacccagtaccatattctctggaggttttagacacggaagtttaatt agcattgacagcacctgtacagagatgggcaattttgacaatgctaatgtcactggagaa atagaatttgccattcattattgcttcaaaacccattctttagaaatatgcatcaaggcc tgtaagaaccttgcctatggagaagaaaagaagaaaaagtgcaatccgtatgtgaagacc tacctgttgcccgacagatcctcccagggaaagcgcaagactggagtccaaaggaacacc gtggacccgacctttcaggagaccttgaagtatcaggtggcccctgcccagctggtgacc cggcagctgcaggtctcggtgtggcatctgggcacgctggcccggagagtgtttcttgga gaagtgatcattcctctggccacgtgggactttgaagacagcacaacacagtccttccgc tggcatccgctccgggccaaggcggagaaatacgaagacagcgttcctcagagtaatgga gagctcacagtccgggctaagctggttctcccttcacggcccagaaaactccaagaggct caagaagagggagacacagctgttggcggggatgcatgctcactatcgaagctccagtgg cagaaagtcctttccagccccaatctatggacagacatgactcttgtcctgcactga >gi568815592f:158563269_158864601|GENSCAN_predicted_peptide_5|807_aa METLYRVPFLVLECPNLKLKKPPWLQVLSAMIVYALMVVSYFLVTGGIIYDVIVEPPSIG SMTDEHGHQRPVAFLAYRVNEQCIMEGLASSFLFTIGASRMGVVPGPSFSLTAQSAQPSL APAGPVLSYVVEEALEWKQGFLYDSLSGWGCPDGKAMLPGYSHDRQAGPSSWVGTASSLL LDSRVFGDRGYSPETENAETMIPKFGLLKITCGSDSFLGCGCLKSFCGDSDGEADVGSTV INVRVTTMDAELEFAIQPNTTGKQLFDQWAFVFRICVESVACFVQVSAQEVRKENPLQFK FRAKFYPEDVAEELIQDITQKLFFLQVKEGILSDEIYCPPETAVLLGSYAVQAKFGDYNK EVHKSGYLSSERLIPQRVMDQHKLTRDQWEDRIQVWHAEHRGMLKDNAMLEYLKIAQDLE MYGINYFEIKNKKGTDLWLGVDALGLNIYEKDDKLTPKIGFPWSEIRNISFNDKKFVIKP IDKKAPDFVFYAPRLRINKRILQLCMGNHELYMRRRKPDTIEVQQMKAQAREEKHQKQLE RQQLETEKKRRETVEREKEQMMREKEELMLRLQDYEEKTKKAERELSEQIQRALQLEEER KRAQEEAERLEADRMAALRAKEELERQAVDQIKSQEQLAAELAEYTAKIALLEEARRRKE DEVEEWQHRAKEAQDDLVKTKEELHLVMTAPPPPPPPVYEPVSYHVQESLQDEGAEPTGY SAELSSEGIRDDRNEEKRITEAEKNERVQRQLLTLSSELSQARDENKRTHNDIIHNENMR QGRDKYKTLRQIRQGNTKQRIDEFEAL >gi568815592f:158563269_158864601|GENSCAN_predicted_CDS_5|2424_bp atggagactttgtaccgtgtcccattcttagtgctcgaatgtcccaacctgaagctgaag aagccgccctggctgcaagtgctgtcggccatgattgtgtatgctctgatggtggtgtct tacttcctcgtcactggaggaataatttatgatgttattgttgaacctccaagcattggc tctatgactgatgaacacgggcatcagaggccagtagctttcttggcctacagagtaaat gaacaatgtattatggaaggacttgcatccagcttcctgtttacaataggagcttcacgg atgggcgtggttccaggtcctagcttctccctgactgcccagtctgcccagccttccttg gctccagcaggccctgtcctgagctatgtcgttgaagaggccttggagtggaaacaaggg ttcttgtatgactccctgagtggctggggatgtccagatggcaaggctatgctgcccggt tactcacacgacaggcaagctgggccatcgtcctgggttgggacagcgtcttcgctgctg ctggatagtcgtgttttcggggatcgaggatactcaccagaaaccgaaaatgccgaaacc atgattcccaaatttggcctcctcaaaatcacttgtggaagtgacagcttcctgggctgc gggtgtttgaaaagcttctgtggggattctgatggtgaggctgatgtgggaagcactgtg atcaatgtccgagttaccaccatggatgcagagctggagtttgcaatccagccaaataca actggaaaacagctttttgatcagtgggccttcgtcttccggatttgtgtggagagtgtg gcttgtttcgtgcaggtgtctgcccaggaggtcaggaaggagaatcccctccagttcaag ttccgggccaagttctaccctgaagatgtggctgaggagctcatccaggacatcacccag aaacttttcttcctccaagtgaaggaaggaatccttagcgatgagatctactgcccccct gagactgccgtgctcttggggtcctacgctgtgcaggccaagtttggggactacaacaaa gaagtgcacaagtctgggtacctcagctctgagcggctgatccctcaaagagtgatggac cagcacaaacttaccagggaccagtgggaggaccggatccaggtgtggcatgcggaacac cgtgggatgctcaaagataatgctatgttggaatacctgaagattgctcaggacctggaa atgtatggaatcaactatttcgagataaaaaacaagaaaggaacagacctttggcttgga gttgatgcccttggactgaatatttatgagaaagatgataagttaaccccaaagattggc tttccttggagtgaaatcaggaacatctctttcaatgacaaaaagtttgtcattaaaccc atcgacaagaaggcacctgactttgtgttttatgccccacgtctgagaatcaacaagcgg atcctgcagctctgcatgggcaaccatgagttgtatatgcgccgcaggaagcctgacacc atcgaggtgcagcagatgaaggcccaggcccgggaggagaagcatcagaagcagctggag cggcaacagctggaaacagagaagaaaaggagagaaaccgtggagagagagaaagagcag atgatgcgcgagaaggaggagttgatgctgcggctgcaggactatgaggagaagacaaag aaggcagagagagagctctcggagcagattcagagggccctgcagctggaggaggagagg aagcgggcacaggaggaggccgagcgcctagaggctgaccgtatggctgcactgcgggct aaggaggagctggagagacaggcggtggatcagataaagagccaggagcagctggctgcg gagcttgcagaatacactgccaagattgccctcctggaagaggcgcggaggcgcaaggag gatgaagttgaagagtggcagcacagggccaaagaagcccaggatgacctggtgaagacc aaggaggagctgcacctggtgatgacagcacccccgcccccaccaccccccgtgtacgag ccggtgagctaccatgtccaggagagcttgcaggatgagggcgcagagcccacgggctac agcgcggagctgtctagtgagggcatccgggatgaccgcaatgaggagaagcgcatcact gaggcagagaagaacgagcgtgtgcagcggcagctgctgacgctgagcagcgagctgtcc caggcccgagatgagaataagaggacccacaatgacatcatccacaacgagaacatgagg caaggccgggacaagtacaagacgctgcggcagatccggcagggcaacaccaagcagcgc atcgacgagttcgaggccctgtaa