GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:20:24 Sequence gi568815590r:17130723_17345152 : 214430 bp : 38.67% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10984 11124 141 0 0 45 48 109 0.016 2.23 1.02 Intr + 19811 19867 57 1 0 23 81 124 0.094 3.36 1.03 Term + 20026 20238 213 0 0 56 38 117 0.096 -0.25 1.04 PlyA + 20349 20354 6 1.05 2.05 PlyA - 21225 21220 6 1.05 2.04 Term - 25337 25110 228 2 0 61 54 177 0.295 7.25 2.03 Intr - 26448 26248 201 0 0 100 52 117 0.252 7.86 2.02 Intr - 28154 28040 115 1 1 47 64 45 0.176 -2.47 2.01 Init - 32675 32311 365 2 2 82 94 72 0.271 3.79 2.00 Prom - 33198 33159 40 -6.05 3.00 Prom + 39172 39211 40 -2.45 3.01 Init + 39475 39529 55 0 1 17 84 42 0.035 -1.80 3.02 Intr + 54067 54093 27 2 0 139 121 35 0.543 9.07 3.03 Intr + 55609 55703 95 2 2 101 100 15 0.469 2.76 3.04 Intr + 64782 64902 121 2 1 71 84 145 0.236 11.55 3.05 Term + 67041 67093 53 1 2 88 54 20 0.031 -4.79 3.06 PlyA + 67739 67744 6 1.05 4.03 PlyA - 67858 67853 6 1.05 4.02 Term - 69083 68703 381 2 0 27 42 559 0.328 39.05 4.01 Init - 82344 82255 90 0 0 69 100 56 0.136 5.44 4.00 Prom - 86847 86808 40 -2.05 5.04 PlyA - 87334 87329 6 1.05 5.03 Term - 100126 99998 129 1 0 82 44 130 0.954 5.20 5.02 Intr - 104138 103994 145 1 1 78 78 173 0.967 14.66 5.01 Init - 106644 106490 155 0 2 62 66 128 0.619 7.50 5.00 Prom - 106796 106757 40 -3.75 6.00 Prom + 107917 107956 40 -5.25 6.01 Init + 111268 111459 192 0 0 77 72 66 0.269 2.91 6.02 Intr + 115468 115680 213 0 0 91 78 218 0.512 19.09 6.03 Intr + 115930 116051 122 0 2 76 16 116 0.372 1.57 6.04 Intr + 116078 116225 148 2 1 19 8 174 0.263 1.82 6.05 Intr + 116357 116686 330 1 0 50 39 264 0.215 12.60 6.06 Intr + 130962 131249 288 2 0 29 105 142 0.280 6.52 6.07 Intr + 137552 137650 99 1 0 85 48 73 0.501 2.39 6.08 Intr + 138134 138234 101 1 2 40 83 73 0.558 -0.01 6.09 Intr + 144011 144245 235 2 1 126 23 181 0.597 12.17 6.10 Intr + 145675 145745 71 0 2 67 92 93 0.995 4.66 6.11 Intr + 149306 149433 128 2 2 113 55 103 0.752 8.90 6.12 Intr + 149517 149575 59 1 2 15 113 19 0.596 -5.22 6.13 Intr + 149653 149721 69 0 0 75 94 56 0.615 3.36 6.14 Intr + 153751 153894 144 0 0 80 45 124 0.518 6.86 6.15 Term + 155625 155705 81 2 0 64 43 84 0.480 -1.79 6.16 PlyA + 157950 157955 6 1.05 7.08 PlyA - 158487 158482 6 1.05 7.07 Term - 163187 162586 602 0 2 83 43 413 0.889 30.00 7.06 Intr - 169502 169211 292 2 1 73 77 182 0.425 11.38 7.05 Intr - 171558 171432 127 2 1 98 63 220 0.999 20.26 7.04 Intr - 175235 175035 201 1 0 91 108 85 0.994 8.28 7.03 Intr - 178604 178555 50 2 2 87 131 32 0.999 3.96 7.02 Intr - 180914 180789 126 2 0 113 101 139 0.999 17.66 7.01 Init - 182656 182570 87 1 0 59 91 63 0.561 4.39 7.00 Prom - 183570 183531 40 -7.65 8.04 PlyA - 184688 184683 6 1.05 8.03 Term - 186684 186556 129 0 0 51 48 176 0.865 7.10 8.02 Intr - 187672 187545 128 2 2 82 28 103 0.535 3.18 8.01 Init - 194679 194553 127 0 1 48 51 118 0.003 4.47 8.00 Prom - 195865 195826 40 -7.45 9.04 PlyA - 196561 196556 6 1.05 9.03 Term - 199736 199525 212 0 2 63 45 173 0.970 6.97 9.02 Intr - 200560 200428 133 1 1 79 92 133 0.314 12.10 9.01 Intr - 210775 210641 135 1 0 98 106 244 0.968 27.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:17130723_17345152|GENSCAN_predicted_peptide_1|136_aa ASEGMTAAGGLTSRTLTHMATPILCYMALSMGLLMTTGQLTSHGSSDEISELNQRPILPD NDIDGEETEVYIDVIEVHPQGSSRAFRNYWVSLQSDIYIFWNMDSLVAENGFHSHSRHSE KVVLYSLDSSSKEEIS >gi568815590r:17130723_17345152|GENSCAN_predicted_CDS_1|411_bp gcatcagaaggaatgactgcagctggaggactgacttccagaacactcactcacatggcg acccccatcctctgctacatggccctctcgatggggctgctcatgacaacagggcagcta acttctcacggctcaagtgatgaaatttcagaactcaaccagagacctattcttccggat aacgacattgatggagaagagactgaggtctacattgatgtgattgaagtccatccacag ggttcctctagggcctttcgcaattactgggtatcccttcaaagtgatatctacattttc tggaatatggacagtctggttgcagagaatggttttcattctcacagtagacacagcgaa aaggtggttctttactctttggattcctcttcaaaggaagaaatttcctag >gi568815590r:17130723_17345152|GENSCAN_predicted_peptide_2|302_aa MVTKKDQTPELAGPCSLRCGRNHLSPASWTLPRLEHWTRCYQAGGAVVVPVFCGDRLPGS SSFQRLFLRCNVSSASLACPSFLSISKLSSPGSPWIQRVAFMPSFTHKVTGPGQSKSGPS IGPLTVVFNVAGIAPGQADRHYARQTGIAPGRQGQEVIDKRSRGPSAPLGGVAEAAAVSG RRREARRRVRGGGATLTRELGGEGEGRARGRSCPEAAFSGGAGGTRQPAWNPRASTCGSL GARGGRKEPPFTTRAFGGSRNAQRRGGAEPASGMGGAVESEEEAFLAVNPGTSGVRNTSS LL >gi568815590r:17130723_17345152|GENSCAN_predicted_CDS_2|909_bp atggtaacgaaaaaagatcaaacacctgagctcgctggaccatgcagcctgagatgtggg cgcaaccacctcagccctgccagctggacactgcccaggctggaacactggacccgctgt taccaggctgggggagcagtggtggttccagtgttctgtggtgaccggctgccaggcagc agctctttccaaaggctgttcctcaggtgcaacgtcagctccgcttcactcgcctgccct tcgtttctgtccatttctaagctaagctctccaggctcgccgtggattcagagagttgca ttcatgcccagtttcacccacaaggtcacaggccctggacagagtaaaagtggcccgagc atcggaccacttactgttgttttcaatgtggcaggcattgcgccaggccaggcagacagg cattacgccaggcagacaggcattgcgccaggcaggcagggacaagaagtgatagataag cggtcccggggaccttccgcccctctagggggtgtggccgaggcggcggcggtgagcgga aggcggcgggaggctcggcggcgagtgcggggcggaggcgccaccttaacccgagagttg ggaggggagggcgaggggcgtgcgcgtggaagaagctgccccgaggctgcttttagcggc ggagccggcgggacgcggcagcccgcgtggaaccctcgcgcttcaacttgcgggtccctg ggagcccgcgggggccgcaaggagcctccctttactactcgggctttcgggggaagcaga aatgcacagcgaagagggggtgccgagccggcgtctggcatgggaggggctgtggagagc gaggaagaagcatttttggctgtgaatccaggcacctctggagttagaaacacaagttct ttgctctga >gi568815590r:17130723_17345152|GENSCAN_predicted_peptide_3|116_aa MTVKLQKKIVLLSDPILKVSMENTGEQVVCLMAYHLLFAMFVWSYWKTIFTLPMNPSKEF HLSYAEKDLLEREPRGEAHQEVLRRAAKDLPIYTRTMSGALDLGNGVGITRWHSYQ >gi568815590r:17130723_17345152|GENSCAN_predicted_CDS_3|351_bp atgacagtgaagttacagaagaaaatagtgcttctcagtgacccaatactgaaggtgtcc atggaaaacactggcgaacaagttgtgtgcctgatggcctatcatctactttttgcaatg tttgtctggtcatactggaaaactatctttacattaccaatgaatccttcaaaagaattc catctctcttatgcagagaaagatttgttggagagagagccaagaggagaagcccatcag gaagttcttaggcgagcagccaaggatcttcccatctataccaggaccatgtctggagcc cttgatttaggaaatggagttggtataacccgctggcattcataccagtga >gi568815590r:17130723_17345152|GENSCAN_predicted_peptide_4|156_aa MLGPCEEQQDLCLRIVSERKVKKNDIQKAKPGRQSETLSQKKKKRRRKKKDEEGKRRKKE EERRKKEKEEKKKKKEGGGRREEEEEKKKKGRRRRRKNKEEERRRRRQRRRRRRRRQKTK TEDEDEDEEEEEEEEEEEEEEEEVLLKTLGEREHLL >gi568815590r:17130723_17345152|GENSCAN_predicted_CDS_4|471_bp atgcttggtccatgtgaggaacagcaagatctgtgtctgaggattgtaagtgaaaggaaa gtaaaaaagaatgacatccagaaggccaagcctgggcgacagagcgagactctgtctcaa aaaaaaaaaaaaagaagaagaaagaagaaggacgaagaaggaaaaagaagaaagaaggaa gaagaaagaaggaagaaggagaaagaagaaaagaagaagaagaaggagggaggagggagg agggaggaggaggaggagaagaagaagaaaggaagaagaagaagaagaaagaataaagaa gaagaaagaagaagaagaagacaaagacgaagaagacgaagacgaagacagaagacgaag acagaagacgaagacgaagacgaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagtcttattaaagacactgggggagagggaacacttgttgtaa >gi568815590r:17130723_17345152|GENSCAN_predicted_peptide_5|142_aa MYAQDSIELLTTSGIQFKKHEEEGIETQYFAELLMTSGVVLCEGVKWLSFHSGYDFGYLI KILTNSNLPEEELDFFEILRLFFPVIYDVKYLMKSCKNLKMFFEDHIDDAKYCGHLYGLG SGSSYVQNGTGNAYEEEANKQS >gi568815590r:17130723_17345152|GENSCAN_predicted_CDS_5|429_bp atgtatgcccaggactctatagagctactaacaacatctggtatccagtttaaaaaacat gaggaggaaggaattgaaacccagtactttgcagaacttcttatgacttctggagtggtc ctctgtgaaggggtcaaatggttgtcatttcatagcggttacgactttggctacttaatc aaaatcctaaccaactctaacttgcctgaagaagaacttgacttctttgagatccttcga ttgttttttcctgtcatttatgatgtgaagtacctcatgaagagctgcaaaaatctcaaa atgttctttgaagatcatattgatgatgccaaatattgtggtcatttgtatggccttggt tctggttcatcctatgtacagaatggcacagggaatgcatatgaagaggaagccaacaag cagtcatga >gi568815590r:17130723_17345152|GENSCAN_predicted_peptide_6|759_aa MWEPTLVSMSFRGTVHSHRYTYSLTLGQLSHTISPNVHSLGMWEEIRGPGRKSTYTRERA HSTQSQKPRRDTKGLLITGESISVLQFPRETRGRHRGIQAIDNSLPWQSASAKVLITLTG ATLTPKHCRVKQYEEAAADARSTCVAELALLLLLLAIETALGGGGGGGGGSGGGSGEGKG EQELRRTVLRRRFSHVLTGALDDDATRRGGGERPRGRGTQVTAARNSGSAGRTGLEKTRS PALGPRTSHPAPLSLENRRAEPLGEAGQSLPGPPARGPEEDELAFSPDQERLLLRGWVPR WPHQPPAAEAAPDRVPPELTLQVTGRCLSTGGKSRVCLGSNKYLEQCPSLMGELPKGISQ QCGKAGKRFLPMVSVEYISYNMELLNRHSDLVFSLAADRAVSRVEDGSLASPLCLWLSSG IFLPSVLMMLLFPQEKPVISVYPPIRHHLMDKQGVYVTSPLVNNFTMHSDLGKIIQSLLD EFWKNPPVLAPTSTAFPYLYSNPSGMSPYASQGFPFLPPYPPQEANRSITSLSVADTVSS STTSHTTAKPAAPSFGVLSNLPLPIPTVDASIPVGITSQNGFGYKMPDVPDAFPELSELS VSQLTDMNEQEEVLLEQFLTLPQLKQIITDKDDLVKSIEELARKNLLLEPSLEAKRQTVL DKYELLTQMKSTFEKKMQRQHELSESCSASALQARLKVAAHEAEEESDNIAEDFLEGKME IDDFLSSFMEKRTICHCRRAKEEKLQQAIAMHSQFHAPL >gi568815590r:17130723_17345152|GENSCAN_predicted_CDS_6|2280_bp atgtgggaaccaaccctggtcagcatgtcgttccgtggcacagtgcactcacacagatac acctactcactcacactgggacaacttagccacaccatttcacctaacgtgcacagcctt gggatgtgggaggaaatcagaggacccgggagaaaatccacatatacacgagaacgtgca cactccactcagtcccagaagccccgaagagacaccaaaggccttctcattactggagaa agcatctccgtactgcagtttcccagagaaacgagaggacggcaccgaggtattcaagcc atagataactctcttccctggcaaagcgccagtgcgaaggtgctgataacactgactgga gctacactgactcccaagcactgccgcgtgaaacaatacgaagaggctgcagcggatgca agatctacttgtgtcgctgagctcgcgctcctcctcctcctcctcgccatagagacagca ctcggcggcggtggcggtggcggtggcggtagcggcggcggcagcggcgaagggaaaggc gagcaggagctgcgccgcaccgtgctgcgccgtcgcttttcgcacgtcctgacgggggcg ctagatgatgacgcgacacgcagagggggcggagagcgcccccgggggcggggcacgcaa gtgacggcggcgcggaactcggggagcgcaggcaggacaggcttagagaagacgcggtcc ccagcgcttgggccacggacgtcccaccccgctcctctgtcgctggagaaccgccgggcc gagccactgggagaagcaggccagagccttccagggcctccggcccgtggacccgaggag gatgagctggctttttcccctgaccaagagcgcctcctcctccgcggctgggtcccccgg tggcctcaccagcctccagcagcagaagcagcgcctgatcgagtccctccggaactcaca ctccaggtgactggtcgctgcctctccaccggaggaaaaagtagggtttgccttggctct aataagtacctggagcagtgcccatctctaatgggggagctcccaaaggggatatcccaa cagtgtgggaaggctggcaaacggttcctgcccatggtttctgtggaatatatctcctac aacatggagctgctgaataggcactctgatttggtgttttctttggctgcagacagggca gtttccagggttgaggatggtagtcttgcctctcccctttgtctctggctgtcctcagga atatttctcccttcagtcctcatgatgcttctgtttcctcaggaaaaaccagtgatcagt gtttatccaccaatacgacatcacttaatggataaacaaggagtgtatgttacctctcca ttagtaaacaattttacaatgcactcagatcttggaaaaattattcagagtctgttggat gagttttggaagaatcctccagttttagctcctacttcaacagcatttccttatctatac agtaacccaagtgggatgtctccttatgcttctcagggttttccatttcttcctccatat cctccacaagaagcaaacaggagtatcacttctttatctgttgctgacactgtttcttct tcaacaacaagtcataccacagccaagcctgccgctccttcatttggtgtcctttcaaat ctgccattacccattcccacagtggatgcttcaataccggttggtatcacaagccaaaat ggttttgggtacaagatgccagatgtccctgatgcatttccagaactctcagaactaagt gtgtcacaactcacagatatgaatgaacaagaggaggtattactagaacagtttctgact ttgcctcaactaaaacaaattattaccgacaaagatgacttagtaaaaagtattgaggaa ctagcaagaaaaaatctccttttggagcccagcttggaagccaaaagacaaactgtttta gataagtatgaattacttacacagatgaagtccactttcgaaaagaagatgcaaaggcag catgaacttagtgagagctgtagtgcaagtgcccttcaggcaagattgaaagtagctgca catgaagctgaggaagaatctgataatattgcagaagacttcttggagggaaagatggaa atagatgattttctcagtagcttcatggaaaagagaacaatttgccactgtagaagagcc aaggaagagaaacttcagcaggcgatagcaatgcacagccaatttcatgctccactatag >gi568815590r:17130723_17345152|GENSCAN_predicted_peptide_7|494_aa MSDFLWGLENSGWLRHIKAIMDAGIFIAKAVSEEGASVLVHCSDGWDRTAQVCSVASLLL DPHYRTLKGFMVLIEKDWISFGHKFNHRYGNLDGDPKEISPVIDQFIECVWQLMEQFPCA FEFNERFLIHIQHHIYSCQFGNFLCNSQKERRELKFWSGMYNRFEKGMQPRQSVTDYLMA VKEETQQLEEELEALEERLEKIQKVQLNCTKVKSKQSEPSKHSGFSTSDNSIANTPQDYS GNMKSFPSRSPSQGDEDSALILTQDNLKSSDPDLSANSDQESGVEDLSCRSPSGGRHLPA GVDRQLIQESSGWHFAGAPLGQSFQRKEQAAIFAVLQPPLVIPRQTGCGVDPQQTPADLQ KRGLLERQLTESNSININKNDDHAKIPSEGHQQQRTKVDKSMKMRKNQCKKAENSKNQKA SSPPKDHNSSPAREQNWTENEFDELTEVGFRRWVINSLELKEHVLTQCKEAKNLDKRLEE LLTRIISLEKNINK >gi568815590r:17130723_17345152|GENSCAN_predicted_CDS_7|1485_bp atgagtgatttcctgtggggtctggagaactctggctggttaaggcacattaaagccata atggatgcaggaatcttcattgcaaaggcagtgtcagaggaaggggcaagtgtgcttgtt cactgttctgatggctgggacaggaccgctcaggtgtgctcggtggcaagcctgctgctg gaccctcactaccggactctgaagggcttcatggtattaattgaaaaggactggatttcc tttggtcataagtttaatcaccgatatggcaatctagatggtgacccaaaagaaatctct ccagttattgaccagttcattgagtgtgtttggcagttaatggaacaatttccctgtgcc tttgagttcaatgagaggtttttgattcacattcaacatcacatttattcctgccagttt ggaaacttcctatgtaacagccaaaaggagagacgagaactcaagttttggagtggaatg tataaccgctttgaaaaggggatgcagccccgacagtcagttacagattacctaatggca gtgaaggaagaaactcagcagctagaggaagaactagaggccctggaagaaaggctggaa aaaattcaaaaggtccagttaaattgcactaaggtgaagagtaagcaaagtgagcccagc aagcactcagggttttctacctcagacaacagcatagccaacactccccaggattacagt gggaatatgaaatcatttccatcccggagcccttcacaaggcgatgaagattctgctctg attctaacccaagacaatctgaaaagttcagatccagatctgtcagccaacagtgaccaa gagtccggggtggaggatttgagctgtcggtctccaagtggtggcagacacctcccagca ggggttgacagacagctcatacaggagagctctggctggcactttgcaggtgcccctctg ggacaaagcttccagaggaaggagcaggcagcaatctttgctgttctgcagcctccgctg gtgatacccaggcaaacaggatgtggagtagacccccagcaaactccagcagacctgcag aagaggggcttgttagaaagacaactaacagaaagcaatagcatcaacatcaacaaaaat gacgaccatgcaaaaattccatccgaaggtcaccaacaacaaagaacaaaggtagataaa tccatgaagatgaggaaaaaccagtgcaaaaaggctgaaaattcaaaaaaccagaaagcc tcttctcctccaaaggatcacaactcctcgccagcaagggaacaaaactggactgagaat gagtttgatgaattgacagaagtaggcttcagaaggtgggtaataaactccttagagcta aaggagcatgttctaactcaatgcaaggaagctaagaaccttgataaaaggttagaagaa ttgctaactagaataatcagtttagagaagaacataaataaatga >gi568815590r:17130723_17345152|GENSCAN_predicted_peptide_8|127_aa MIPDLFFHAKVLRPDGLKSSFGVERMVGGRALGVVVGGTIVVAIIMNAVHWCAVNLSSAQ LFLWTPQNCGSAAPKQHLVYVCWVSDTGIQRDALRCSEDSMPGPTNIEDKGLADVTLAKS PIAKLGS >gi568815590r:17130723_17345152|GENSCAN_predicted_CDS_8|384_bp atgatacccgatctgttcttccacgcaaaggttttgaggcctgatggattgaaatcttca ttcggtgtggaaaggatggttggaggcagagcacttggagttgtcgtgggaggcaccatc gtggtggccattatcatgaacgctgttcactggtgtgctgtaaacttgtcttcagcccag ctttttctgtggactccacagaactgtggaagcgctgctcctaagcagcatttggtgtac gtctgctgggtgtctgacactggcatccaacgtgatgcactcaggtgttcagaagattcc atgccggggcctacaaacatcgaagacaaaggactcgcagatgttacattggccaaatcc cccatagcaaagcttggcagctaa >gi568815590r:17130723_17345152|GENSCAN_predicted_peptide_9|159_aa ASICRSSQPLSGFSARCLEDEQMLQAIRKANPGSDFVYVVDTRPKLNAMANRAAGKGYEN EDNYSNIKFQFIGIENIHVMRNSLQKMLEVTPRGICRCSLPLPALRTGKRQRMQKPWPCA LAVHALSALLGKALESWSWEPGKWAVLMGNAGSGMKRDK >gi568815590r:17130723_17345152|GENSCAN_predicted_CDS_9|480_bp gcctccatctgccggagcagccagcccctgtccggcttcagtgcccggtgcctagaggac gagcagatgctccaggccattaggaaagccaatccaggaagtgacttcgtttatgtcgtt gacacccggcctaaacttaatgcaatggcaaatcgtgctgcagggaaaggctatgagaat gaagacaattattccaatatcaagtttcagtttatcgggatagagaacatccatgtcatg aggaacagtctgcagaaaatgctggaagtgaccccaagaggaatctgtcgctgtagtcta ccacttccggctctgcgaactgggaaaagacagaggatgcagaaaccatggccttgtgcc ttggctgtccatgcactgtctgcgcttctggggaaagccctggagagctggagctgggag cccgggaagtgggctgtcttgatgggaaatgcaggctctggaatgaagagggacaaataa