GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:48:39 Sequence gi568815578r:860360_1102164 : 241805 bp : 47.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 867 862 6 1.05 1.03 Term - 4933 4749 185 2 2 58 45 128 0.441 3.21 1.02 Intr - 8479 8272 208 1 1 61 94 130 0.603 9.55 1.01 Init - 9118 8936 183 1 0 85 92 10 0.771 0.28 1.00 Prom - 10244 10205 40 -6.76 2.12 PlyA - 10482 10477 6 1.05 2.11 Term - 12761 12601 161 0 2 89 43 290 0.998 22.70 2.10 Intr - 14055 13925 131 2 2 78 92 152 0.998 14.94 2.09 Intr - 15130 14984 147 0 0 51 9 142 0.436 1.95 2.08 Intr - 17968 17802 167 1 2 122 94 150 0.987 17.76 2.07 Intr - 19489 19388 102 1 0 98 68 79 0.934 7.27 2.06 Intr - 20927 20812 116 0 2 121 72 120 0.970 13.87 2.05 Intr - 24966 24719 248 2 2 114 100 466 0.991 47.50 2.04 Intr - 28080 27959 122 0 2 48 94 164 0.975 12.29 2.03 Intr - 30009 29854 156 0 0 106 92 240 0.745 26.41 2.02 Intr - 52631 52539 93 2 0 111 80 43 0.830 5.96 2.01 Init - 55837 55547 291 1 0 75 102 157 0.524 11.19 2.00 Prom - 56010 55971 40 -2.46 3.24 PlyA - 59901 59896 6 1.05 3.23 Term - 60323 60193 131 0 2 78 38 29 0.194 -4.76 3.22 Intr - 60583 60481 103 1 1 103 116 -9 0.248 3.15 3.21 Intr - 80167 79867 301 0 1 80 119 93 0.032 8.34 3.20 Intr - 96300 96159 142 1 1 112 30 81 0.138 4.11 3.19 Intr - 96650 96523 128 1 2 50 72 26 0.229 -2.48 3.18 Intr - 99423 99173 251 0 2 69 99 51 0.383 0.54 3.17 Intr - 100107 100031 77 1 2 93 58 67 0.464 3.43 3.16 Intr - 102759 102676 84 1 0 33 91 67 0.478 1.29 3.15 Intr - 103761 103576 186 1 0 112 83 35 0.780 5.06 3.14 Intr - 106955 106815 141 0 0 85 96 122 0.883 13.02 3.13 Intr - 107779 107591 189 2 0 83 68 371 0.995 34.16 3.12 Intr - 120016 119951 66 2 0 101 82 34 0.014 2.98 3.11 Intr - 126088 125921 168 2 0 122 55 36 0.006 3.62 3.10 Intr - 141984 141727 258 1 0 65 82 288 0.992 23.33 3.09 Intr - 147670 147577 94 1 1 41 103 50 0.030 1.34 3.08 Intr - 160403 160349 55 1 1 86 95 49 0.280 4.38 3.07 Intr - 166024 165934 91 2 1 72 84 46 0.075 1.65 3.06 Intr - 181661 181554 108 0 0 58 101 27 0.064 1.26 3.05 Intr - 192587 192394 194 1 2 82 72 123 0.295 9.24 3.04 Intr - 196105 196045 61 2 1 64 116 38 0.628 1.99 3.03 Intr - 198108 198003 106 0 1 81 98 5 0.446 0.59 3.02 Intr - 198969 198871 99 0 0 156 72 11 0.765 6.61 3.01 Init - 207654 207604 51 0 0 85 110 15 0.248 4.56 3.00 Prom - 212562 212523 40 -2.26 4.00 Prom + 216678 216717 40 -1.86 4.01 Init + 232163 232201 39 1 0 111 97 20 0.366 5.39 4.02 Intr + 234806 234931 126 1 0 112 121 -55 0.115 1.08 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 125368 125419 52 1 1 103 42 56 0.821 -0.60 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:860360_1102164|GENSCAN_predicted_peptide_1|191_aa MAADKTNLRILGSTRHCRGHGWCGLEPRLGLLGATAHRCQPLGYLHLRVPMENTCHLSLH QSTLGIWASTTKLVYQGQEELPGKASKTTQSLPLALDGSIISEISALFPAKAVSTLGREF PEYTPETTTTGAQMRMCVLRFNTGTYSATTDVSSGPSQQPGITHVPTEDVFGIAFCCFSA AFLLLRPHELA >gi568815578r:860360_1102164|GENSCAN_predicted_CDS_1|576_bp atggctgcagacaagaccaacctgaggatcctgggcagcacaaggcactgtagagggcac ggctggtgtggtctcgagcctagactgggtctcctgggggccacagctcataggtgccag cctctaggatacctgcatctgcgggttcctatggaaaacacttgtcacctctccctgcac cagagcactctgggcatctgggcatctaccaccaaactcgtgtaccaagggcaggaggaa cttcctggaaaggcctccaagaccactcagtctctcccactggcacttgatggatccatc atttctgagatctctgcgttgttcccagcaaaggcagtttccacactggggagagagttc cctgagtacaccccagagacaaccaccacaggtgcccagatgcgcatgtgtgtgctgagg ttcaacactggaacttattcagctacaacagatgtgtcctctgggccctcgcagcaacct ggtataacacacgtgcctacagaggatgtgtttggcatcgcattctgctgcttttctgct gcttttctgcttcttcgaccacatgaattggcttag >gi568815578r:860360_1102164|GENSCAN_predicted_peptide_2|577_aa MLQGSLLLVVATMSVAQQTRQEADRGCETLVVQHGHCSYTFLLPKSEPCPPGPEVSRDSN TLQRESLANPLHLGKLPTQQVKQLEQALQNNTQWLKKCTGEETGQRGKVTCQSHVAVKAN LDSHYKLQLERAIKTILRSKLEQVQQQMAQNQTAPMLELGTSLLNQTTAQIRKLTDMEAQ LLNQTSRMDAQMPETFLSTNKLENQLLLQRQKLQQLQGQNSALEKRLQALETKQQEELAS ILSKKAKLLNTLSRQSAALTNIERGLRGVRHNSSLLQDQQHSLRQLLVLLRHLVQERANA SAPAFIMAGEQVFQDCAEIQRSGASASGVYTIQVSNATKPRKVFCDLQSSGGRWTLIQRR ENGTVNFQRNWKDYKQGFGDPAGEHWLGNEVVHQLTRRAAYSLRVELQDWEGHEAYAQYE HFHLGSENQLYRSHGLHHQAGLDMPTKATEPGKEIEILTWLAADVEAAELGCPGAQIHGG WLSVVGYSGSAGRQSSLVLQNTSFSTLDSDNDHCLCKCAQVMSGGWWFDACGLSNLNGVY YHAPDNKYKMDGIRWHYFKGPSYSLRASRMMIRPLDI >gi568815578r:860360_1102164|GENSCAN_predicted_CDS_2|1734_bp atgctgcagggcagcctcctccttgtggttgccaccatgtctgtggctcaacagacaagg caggaggcggataggggctgcgagacacttgtagtccagcacggccactgtagctacacc ttcttgctgcccaagtctgagccctgccctccggggcctgaggtctccagggactccaac accctccagagagaatcactggccaacccactgcacctggggaagttgcccacccagcag gtgaaacagctggagcaggcactgcagaacaacacgcagtggctgaagaagtgtacaggt gaggaaacaggccagagagggaaagtgacttgccagagccatgtagcagtgaaagccaat ctggattctcactacaaactgcagctagagagggccatcaagacgatcttgaggtcgaag ctggagcaggtccagcagcaaatggcccagaatcagacggcccccatgctagagctgggc accagcctcctgaaccagaccactgcccagatccgcaagctgaccgacatggaggctcag ctcctgaaccagacatcaagaatggatgcccagatgccagagacctttctgtccaccaac aagctggagaaccagctgctgctacagaggcagaagctccagcagcttcagggccaaaac agcgcgctcgagaagcggttgcaggccctggagaccaagcagcaggaggagctggccagc atcctcagcaagaaggcgaagctgctgaacacgctgagccgccagagcgccgccctcacc aacatcgagcgcggcctgcgcggtgtcaggcacaactccagcctcctgcaggaccagcag cacagcctgcgccagctgctggtgttgttgcggcacctggtgcaagaaagggctaacgcc tcggccccggccttcataatggcaggtgagcaggtgttccaggactgtgcagagatccag cgctctggggccagtgccagtggtgtctacaccatccaggtgtccaatgcaacgaagccc aggaaggtgttctgtgacctgcagagcagtggaggcaggtggaccctcatccagcgccgt gagaatggcaccgtgaattttcagcggaactggaaggattacaaacagggcttcggagac ccagctggggagcactggctgggcaatgaagtggtgcaccagctcaccagaagggcagcc tactctctgcgtgtggagctgcaagactgggaaggccacgaggcctatgcccagtacgaa catttccacctgggcagtgagaaccagctatacagatctcatggtcttcatcaccaggca ggcttggacatgcccaccaaggccacagagcctggcaaagaaatagaaatcctcacctgg ctggcagcagatgtcgaagctgcagagctggggtgccctggagcccagatccatgggggc tggctttctgtggtcgggtacagcggctcagcagggcgccagagcagcctggtcctgcag aacaccagctttagcacccttgactcagacaacgaccactgtctctgcaagtgtgcccaa gtgatgtctggagggtggtggtttgacgcctgtggcctgtcaaacctcaacggcgtctac taccacgctcccgacaacaagtacaagatggacggcatccgctggcactacttcaagggc cccagctactcactgcgtgcctctcgcatgatgatacggcctttggacatctaa >gi568815578r:860360_1102164|GENSCAN_predicted_peptide_3|1027_aa MRKNQQREQCQMLLSNQILIPNKHLAPKLCLRVCLHRTQPATQILIPGGQHSQSLWEVSP SSPSEEQPPVLSPGGVDPFPSLSYKVGSDDSTHMQARSQDSVKEKRSSRKVLGHLSLPCM GHGLLPTLLMLTALALGLGSGDRTQQCCQSLMPMLKTMRGGKTKVKLGEKERSPHSSSAA TFFTLTLITHNTWVWFFARDESPPLEWTWMELEAVILSKLMQEQKTKYHKFSLISGSKRE LLMAPSEGGEAEPAKIGNQARLRTRMSMLITLTRHMAGSSSKFDKAGQCPGSANALTRPG GRTDPRLADAPGAPTAAPAPAVPPPGRGALGPSGRLGSASPDPAAQMRAPLCLLLLVAHA VDMLALNRRKKQGRYVMLSHSLLSSLWVDFSPVCWGAEEGMGIETAVTTLMLLWPPQFPV LENSGAITSEVKRTPKKASAQHWELSKASVVGTGLGGNCTGCIICSEENGCSTCQQRLFL FIRREGIRQYGKCLHDCPPGYFGIRGQEVNRCKKCGATCESCFSQDFCIRCKRQFYLYKG KCLPTCPPGTLAHQNTRECQGECELGPWGGWSPCTHNGKTCGSAWGLESRVREAGRAGHE EAATCQVLSESRKCPIQRPCPGDHYTYFQNVSGGYEGPLIYIEERPQGVLERSPGQKKGR KDRRPRKDRKLDRRLDSPEAISTAFSHQPPRGAGWTRLQGFQRPGLDTSPSWPWRVTTWP LGGQPALEGILCKLPATLIGTPPQASLSWVPCPFFPSRISLSHILWRIEGPLWEGDSAVR WASPTPFLQAKLGEVHREAQKQALHVIPTCCHRMTAVNQEQVAARLVKERISETGSCCGS LLAGLEQGGGLDVMVIMGCSLAVKAGGLKDNLVPTPQCGCRWLFRPPHLPPSAFAGAHME CPPGLTVLQTFTGAPQAVHVGNVRKPKRHHAPNTKEVFSLLTKSPTWVPLEQEVLFFQEN RFQHVTLGKWPPSLSISFLMSEMQGFLERHCTPPLVPSDPVRLTCSKAPGLDTGSSLTIH SSSFLWL >gi568815578r:860360_1102164|GENSCAN_predicted_CDS_3|3084_bp atgaggaagaaccagcaaagagaacagtgtcaaatgcttctgagcaatcagatattgatt cctaataaacaccttgcacctaaactctgtcttcgtgtctgcctgcatagaacccaacct gcaacacaaattctgattcctggaggtcagcattcacaatccctttgggaagtatctccc tcatctccctcggaggaacagcccccagttctcagtccaggtggggttgaccccttccca tccctgagctacaaagttggaagtgatgattccacacatatgcaggctagatctcaagac agtgtcaaggaaaaaaggagcagccgcaaggtgctgggccacctgagtctgccatgcatg ggccacgggctcctaccgaccctcctgatgctcacagccctggcactggggttgggcagt ggggacaggacacaacagtgctgccaatccctgatgcccatgctcaagaccatgagagga ggaaagaccaaggtgaaactgggggaaaaggagaggtctccccactcctcttccgcggca acattcttcaccttaactcttatcactcacaacacgtgggtttggttttttgcccgtgat gagtctcctccactggaatggacatggatggagctggaggccgttatccttagcaaacta atgcaggaacagaaaaccaaataccacaagttctcactcataagtgggagcaaaagggag ctgctgatggcaccttcagagggtggtgaagctgaaccggctaagattgggaaccaggca agattgagaacaaggatgtctatgctcatcacacttactcggcatatggctggaagttct agcaagtttgataaagcagggcagtgccccggctccgccaacgccctcactagacctggc ggccggaccgacccgcgcctggcggatgcgcccggcgcgcccacagcagcccccgcgccc gccgtgccgccgccgggacgtggggcccttgggccgtcgggccgcctggggagcgccagc ccggatccggctgcccagatgcgggcgccactctgcctgctcctgctcgtcgcccacgcc gtggacatgctcgccctgaaccgaaggaagaagcaagggagatatgtcatgctgtctcat tctcttctctcctccctctgggttgatttctcccctgtatgttggggagcagaagaagga atggggatagagacagctgtcactaccctgatgctgctgtggcctccacaattcccggtg ttagaaaactctggggccatcaccagtgaggtcaagcggacaccgaagaaggcctcagca cagcactgggaacttagcaaggcgtcagtagtgggcactggcctggggggcaactgcaca ggctgtatcatctgctcagaggagaacggctgttccacctgccagcagaggctcttcctg ttcatccgccgggaaggcatccgccagtacggcaagtgcctgcacgactgtccccctggg tacttcggcatccgcggccaggaggtcaacaggtgcaaaaaatgtggggccacttgtgag agctgcttcagccaggacttctgcatccggtgcaagaggcagttttacttgtacaagggg aagtgtctgcccacctgcccgccgggcactttggcccaccagaacacacgggagtgccag ggggagtgtgaactgggtccctggggcggctggagcccctgcacacacaatggaaagacc tgcggctcggcttggggcctggagagccgggtacgagaggctggccgggctgggcatgag gaggcagccacctgccaggtgctttctgagtcaaggaaatgtcccatccagaggccctgc ccaggagaccactacacgtatttccaaaatgtttctgggggctatgaaggaccactgatt tacatagaagagaggccacagggtgtgcttgagaggagccccggccagaagaagggcagg aaggaccggcgcccacgcaaggacaggaagctggaccgcaggctggactccccagaggcc atttccacagccttcagccaccagccaccccgaggagctggctggacaaggctccagggc ttccagaggcctggcttggacacctcccccagctggccgtggagggtcacaacctggcct ctgggtgggcagccagccctggagggcatcctctgcaagctgcctgccaccctcatcggc actcccccacaggcctccctctcatgggttccatgcccctttttcccaagccggatcagc ctctcccacattctctggaggattgaggggcctctctgggaaggagactctgctgtcagg tgggcatcgcccactcccttcttacaagccaagcttggagaagttcacagagaagctcag aaacaggccctgcatgtcattcccacatgctgccaccggatgacggcagtcaaccaggaa caggtcgcagccaggcttgtcaaggagcgcatttctgaaacggggtcgtgttgcggttct ctcctagcaggactggagcagggtggagggcttgatgtgatggtgatcatgggctgctct ctggcagtcaaggcgggaggactgaaggacaatctggttccaacaccacagtgtggctgc cgttggctcttcaggcctcctcacctcccaccctctgcctttgctggtgcccacatggaa tgtcctcccgggctgaccgtcctgcagacttttacaggtgccccccaagcggtgcatgtg ggtaacgttcgtaagcccaagaggcaccatgcaccaaataccaaagaggttttttcctta ctcaccaaaagcccaacatgggtgcccctggagcaggaagttctgttcttccaggaaaat agattccagcatgtaacactgggcaagtggcctccctctctgagtatcagtttcctcatg agtgaaatgcaaggttttcttgagcgtcactgtaccccacccttagtcccttccgatcct gtgaggttgacctgctccaaggctccaggcctggacacaggatccagcctgaccatccac agtagctccttcctctggctataa >gi568815578r:860360_1102164|GENSCAN_predicted_peptide_4|55_aa MAVTVLGIVFMFKKLQMRGRSLEIVHLMCPSPHFTDGDAEAQGGAVTCYITNLVK >gi568815578r:860360_1102164|GENSCAN_predicted_CDS_4|165_bp atggctgtcactgttcttggcatagtgttcatgttcaagaaacttcaaatgagggggagg agtttagaaattgtccatcttatgtgcccttctccccattttacagatggggacgctgag gcccagggaggagcagtgacctgctatatcacaaaccttgttaag