GENSCAN 1.0 Date run: 5-Nov-116 Time: 09:47:47 Sequence gi568815582f:14726291_14865410 : 139120 bp : 46.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 1098 992 107 2 2 93 101 162 0.977 17.86 1.02 Intr - 3472 3322 151 2 1 99 105 103 0.905 12.32 1.01 Init - 7057 6961 97 1 1 51 96 243 0.569 19.87 1.00 Prom - 9463 9424 40 -7.16 2.02 PlyA - 11415 11410 6 1.05 2.01 Sngl - 16389 15829 561 0 0 84 40 351 0.273 26.04 2.00 Prom - 18752 18713 40 -4.16 3.00 Prom + 19265 19304 40 -5.46 3.01 Init + 21776 21895 120 1 0 39 113 143 0.936 11.19 3.02 Intr + 28902 29030 129 2 0 82 56 48 0.765 1.89 3.03 Intr + 32657 32756 100 1 1 95 86 37 0.977 3.88 3.04 Intr + 32920 33064 145 2 1 49 61 140 0.924 6.74 3.05 Intr + 36022 36129 108 1 0 100 84 84 0.467 8.60 3.06 Intr + 37318 37353 36 1 0 88 114 7 0.241 0.68 3.07 Term + 38713 38788 76 1 1 88 42 45 0.177 -2.79 3.08 PlyA + 39129 39134 6 1.05 4.07 PlyA - 39556 39551 6 1.05 4.06 Term - 39920 39916 5 0 2 65 40 0 0.291 -10.03 4.05 Intr - 40218 40112 107 2 2 93 101 162 0.460 17.86 4.04 Intr - 52978 52901 78 0 0 60 61 136 0.256 6.77 4.03 Intr - 54069 53998 72 2 0 116 58 56 0.789 3.92 4.02 Intr - 64793 64766 28 0 1 127 94 -1 0.142 1.57 4.01 Init - 71068 71011 58 1 1 58 65 46 0.221 0.77 4.00 Prom - 71685 71646 40 -2.66 5.07 PlyA - 71863 71858 6 1.05 5.06 Term - 74900 74754 147 2 0 67 28 144 0.441 4.20 5.05 Intr - 81949 81863 87 1 0 96 60 55 0.365 3.67 5.04 Intr - 94630 94502 129 1 0 83 103 95 0.978 11.39 5.03 Intr - 96778 96596 183 1 0 108 25 165 0.705 12.18 5.02 Intr - 98453 98344 110 0 2 117 84 19 0.729 4.40 5.01 Init - 102047 102020 28 2 1 83 42 42 0.537 -1.14 5.00 Prom - 102971 102932 40 -7.36 6.00 Prom + 105916 105955 40 -8.76 6.01 Init + 107562 107726 165 2 0 79 83 444 0.994 40.63 6.02 Intr + 112117 112206 90 0 0 136 116 75 0.999 15.29 6.03 Intr + 112407 112472 66 2 0 70 81 41 0.612 0.70 6.04 Intr + 115072 115117 46 0 1 110 111 -57 0.927 -3.22 6.05 Intr + 118384 118484 101 2 2 110 76 115 0.985 12.33 6.06 Intr + 120287 120393 107 1 2 96 87 62 0.995 5.91 6.07 Intr + 122258 122414 157 2 1 86 98 5 0.587 1.31 6.08 Intr + 126140 126292 153 1 0 67 99 115 0.979 10.77 6.09 Intr + 127219 127314 96 0 0 23 89 66 0.318 0.41 6.10 Intr + 127647 127736 90 2 0 36 94 52 0.549 0.79 6.11 Intr + 130927 131032 106 0 1 106 86 93 0.998 10.69 6.12 Intr + 131215 131365 151 2 1 92 98 146 0.996 15.22 6.13 Intr + 136723 136897 175 1 1 90 80 139 0.994 13.24 6.14 Intr + 138295 138436 142 0 1 114 86 33 0.914 5.63 6.15 Intr + 138734 138865 132 0 0 106 69 219 0.999 22.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:14726291_14865410|GENSCAN_predicted_peptide_1|119_aa MGPLPVCLPIMLLLLLPSLLLLLLLPGPGSGEASRILRVHRRGILELAGTVGCVGPRTPI AYMKYGCFCGLGGHGQPRDAIDWCCHGHDCCYTRAEEAGCSPKTERYSWQCVNQSVLCX >gi568815582f:14726291_14865410|GENSCAN_predicted_CDS_1|357_bp atggggccgctacctgtgtgcctgccaatcatgctgctcctgctactgccgtcgctgctg ctgctgctgcttctacctggccccgggtccggcgaggcctccaggatattacgtgtgcac cggcgtgggatcctggaactggcaggaactgtgggttgtgttggtccccgaacccccatc gcctatatgaaatatggttgcttttgtggcttgggaggccatggccagccccgcgatgcc attgactggtgctgccatggccacgactgttgttacactcgagctgaggaggccggctgc agccccaagacagagcgctactcctggcagtgcgtcaatcagagcgtcctgtgcgnn >gi568815582f:14726291_14865410|GENSCAN_predicted_peptide_2|186_aa MATCGELSVNCCAADFSEQRRRLERRRRQVEPGPRGPGMGQQPLQPGSPGRGAGRQRASR QPPCGALTSLQAAPQQPPGSAHTSLQGSPLALHLPPPPRGVNCAVCRPGYADPGSPGPQQ PDEEPRATARGYEKEQDGAPEKCKSSELGPPCQERLGAEDGEMEMEKRQVGRSGAPPVGS ACAGGA >gi568815582f:14726291_14865410|GENSCAN_predicted_CDS_2|561_bp atggcgacctgcggcgagctgagcgtcaactgctgcgccgccgacttctcggagcagcga aggcgactcgagagaagacgccgccaagtggaacccgggccccgcggccctgggatgggg cagcagccactgcagccaggaagccctgggcggggcgctgggcgccaacgagcgtcacgg caacctccatgcggcgccctcaccagcctacaggcggcaccgcagcagcctccaggcagc gcccacaccagcctacaggggtcgccgctcgcactgcacctgcctccgccgccccggggt gtgaactgcgctgtctgtcggcctggctacgctgacccgggcagcccaggcccgcagcag ccggacgaggagcccagggccactgcccggggttacgagaaggagcaggacggtgcccca gaaaaatgcaagagctcagagctagggcccccgtgccaggaaaggctaggagcagaagat ggagagatggagatggagaagcggcaggtgggaaggagcggcgctccaccggtggggtca gcatgcgctggcggggcttag >gi568815582f:14726291_14865410|GENSCAN_predicted_peptide_3|237_aa MVKLSIVLTPRFLSHDQGQLTKELQQHVKSVTCPCEYLRKVINTLADHRHRGTDFGGSPW LLIITVFLRSYKFAISLCTSYLCVSFLKTIFPSQNGHDGSTDVQQRARRSNRRRQEGIKI VLEDIFTLWRQVETKVRAKICKMKVTTKVNRHDKINGKRKTAKEHLRKLSMKEREHGEKE RQVSEAEENGKLDMKEIHTYISPLLQESLFATGSEWRQRSIVILQDCPTGPTSQLKL >gi568815582f:14726291_14865410|GENSCAN_predicted_CDS_3|714_bp atggtgaagctctctattgtcctgaccccacggttcctgtcccatgaccagggccagctc accaaggagctgcagcagcacgtaaagtcagtgacatgcccatgcgagtacctgaggaag gttatcaatactctggctgaccatcgtcatcgtgggactgactttggtggaagtccttgg ttacttatcattactgtgtttctgagaagttataaatttgccatctccctctgcacaagt tacctttgtgtgtctttcctgaagactatcttcccgtctcaaaatggacatgatggatcc acggatgtacagcagagagccaggaggtccaaccgccgtagacaggaaggaattaaaatt gtcctggaagacatctttactttatggagacaggtggaaaccaaagttcgagctaaaatc tgtaagatgaaggtgacaacaaaagtcaaccgtcatgacaaaatcaatggaaagaggaag accgccaaagaacatctgaggaaactaagcatgaaagaacgtgagcacggagaaaaggag aggcaggtgtcagaggcagaggaaaacgggaaattggatatgaaagaaatacacacctac atatcaccccttctgcaagaaagcctctttgcaaccgggtcagaatggcggcagcggagc atcgtcattcttcaggattgccctactggccctacctcacagctgaaactttaa >gi568815582f:14726291_14865410|GENSCAN_predicted_peptide_4|115_aa MVNIECQLDWIEGGKVLILDPRPTWVKNSEEKLPEVSPEADAGIMLLVQSAEPGKKEQGD LSSEKQLEEEEEEEEDADRCCHGHDCCYTRAEEAGCSPKTERYSWQCVNQSVLCG >gi568815582f:14726291_14865410|GENSCAN_predicted_CDS_4|348_bp atggttaatattgagtgtcaacttgattggatcgaagggggcaaagtattgatcctggac ccgaggcccacctgggtgaagaacagtgaagaaaagcttcctgaggtctcaccagaagca gatgctggcatcatgcttcttgtacagtctgcagaaccaggaaaaaaggagcagggagac cttagctctgagaagcaactagaggaggaggaagaggaggaggaagatgctgacaggtgc tgccatggccacgactgttgttacactcgagctgaggaggccggctgcagccccaagaca gagcgctactcctggcagtgcgtcaatcagagcgtcctgtgcggctag >gi568815582f:14726291_14865410|GENSCAN_predicted_peptide_5|227_aa MLYTHNDIRGSRPSSLPRDLATDRRWDPRRQKAPMAAPAEPCAGQGVWNQTEPEPAATSL LSLCFLRTAGVWVPPMYLWVLGPIYLLFIHHHGRGYLQMSPLFKAKMSFAVFLIHTERKK GVQSSGVLFGYWLLCFVLPATNAAQQASGVLKPYSFAPNIIIASSASPLGETLEYSNQRL TFSMNIIAIHPVTQTKSLNNIFESFSITPYIQPIATFSFIYLLKSRP >gi568815582f:14726291_14865410|GENSCAN_predicted_CDS_5|684_bp atgctgtacacccacaacgatattaggggatcgcggccgagcagtctgcccagagactta gcgacagacagacgctgggacccacgacgacagaaggcgccgatggccgcgcctgctgag ccctgcgcggggcagggggtctggaaccagacagagcctgaacctgccgccaccagcctg ctgagcctgtgcttcctgagaacagcaggggtctgggtgccccccatgtacctctgggtc cttggtcccatctacctcctcttcatccaccaccatggccggggctacctccagatgtcc ccactcttcaaagccaagatgagcttcgcagtgttcctgattcacaccgagaggaaaaag ggagtccagtcatctggagtgctgtttggttactggcttctctgctttgtcttgccagct accaacgctgcccagcaggcctccggagtgctgaagccctattcatttgcacccaacatc atcattgcgtccagtgcctcaccacttggtgagacccttgaatattcaaaccagcgtctc actttctccatgaatatcatcgccattcacccagttacccaaaccaagagcctgaacaac atctttgagtccttctccatcacaccctacattcaaccaattgccaccttctcttttata tacctccttaagtcacggccttaa >gi568815582f:14726291_14865410|GENSCAN_predicted_peptide_6|593_aa MLVGQGAGPLGPAVVTAAVVLLLSGVGPAHGSEDIVVGCGGFVKSDVEINYSLIEIKLYT KHGTLKYQTDCAPNNGYFMIPLYDKGHLAMSRDVFVVTSVAVRYSHPGDFILKIEPPLGW SFEPTTVELHVDGVSDICTKGGDINFVFTGFSVNGKVLSKGQPLGPAGVQVSLRNTGTEA KIQSTVTQPGGNSCLSPLSNPVECSVTGPCCAARIGFMPEILEIKGDVSTPLSDQHMEHP SWHTASTTVRVTNSNANAASPLIVAGYNVSGSVRSDGEPMKGVKFLLFSSLVTKEPQDES LVYLCYTVSREDGSFSFYSLPSGGYTVIPFYRGERITFDVAPSRLDFTVEHDSLKIEPVF HVMGFSVTGRVLNGPEGDGVPEAVVTLNNQIKVKTKADGSFRLENITTGTYTIHAQKEHL YFETVTIKIAPNTPQLADIIATGFSVCGQISIIRFPDTVKQMNKYKVVLSSQDKDKSLVT VETDAHGSFCFKAKPGTYKVQVMVPEAETRAGLTLKPQTFPLTVTNRPMMDVAFVQFLAS VSGKVSCLDTCGDLLVTLQSLSRQGEKRSLQLSGKVNAMTFTFDNVLPGKYKX >gi568815582f:14726291_14865410|GENSCAN_predicted_CDS_6|1779_bp atgctggtgggccagggcgcggggccgctggggcccgcggtggtcaccgccgcggtggtg ctgctgctgagcggcgtggggccggcgcacggctcggaggacatcgtggtgggctgcggt ggcttcgtcaagtcggacgtggagatcaactactctctcatcgagataaagctgtacacc aagcatgggactttgaaataccagacagactgtgcccctaataatggttactttatgatc cctttgtatgataagggccatttggcaatgtctagagacgtttttgttgtgacatctgtg gctgtgcgctacagtcacccaggggatttcattctgaagattgagcctcccctagggtgg agttttgagccgacgaccgtggagctccatgtggatggagtcagtgacatctgcacaaag ggtggggacatcaactttgtcttcactgggttctctgtgaatggcaaggtcctcagcaaa gggcagcccctgggtcctgcgggagttcaggtgtctctgagaaacactgggaccgaagca aagatccagtccacagttacacagcctggcggaaattcatgcctgtcacctctgtcaaac ccagtcgaatgttctgtgactggtccctgctgtgctgcaaggattggttttatgccagaa atccttgagattaagggagacgtaagcactcctctttccgatcagcacatggagcacccc tcttggcacacagcaagcaccacagtgcgtgtaaccaactccaatgccaatgcggccagt cccctcatagttgctggctacaatgtgtctggctctgtccgaagtgatggggagcccatg aaaggcgtgaagtttcttctcttttcttctttagtaactaaagagccccaagacgagagt ctggtgtatttgtgctacacggtctccagagaagatggctcgttctctttctattccttg ccaagtgggggctacactgtgattccgttctatcgaggggagaggattacctttgatgtg gcgccttccagacttgacttcacagtggagcatgacagcttgaaaatcgagcccgtgttc cacgtcatgggattctccgtcaccgggagggtcttgaacggacccgaaggagatggtgtt ccagaagcagtagtcaccctgaataaccaaatcaaagttaaaacaaaagctgatggctca ttccgccttgagaacataaccacagggacatacaccatccatgctcagaaagagcacctc tactttgaaacggtcaccatcaaaattgcaccgaacacacctcagctggctgacattatt gcaacagggttcagtgtctgtggtcagatatcaatcattcgcttccccgacaccgtcaag cagatgaataaatacaaagttgtcctgtcatctcaagacaaggacaagtctttggtcacc gtggagacagatgctcatggatcattttgttttaaagcaaaaccagggacttacaaagtg caggtgatggttcctgaggcagaaaccagagcagggctgacgttgaaaccccagacattt cctcttactgtgaccaacaggcccatgatggatgtggcctttgtacagttcttggcatca gtttctgggaaagtctcttgtttggacacctgtggtgacttgctggtgactctacagtcc ctgagccgccagggtgagaagcggagcctccagctctccggcaaggtcaacgccatgact ttcacctttgacaacgtgctccctggaaaatacaaaann