GENSCAN 1.0 Date run: 4-Nov-116 Time: 05:15:42 Sequence gi568815575f:78860521_79061537 : 201017 bp : 36.75% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 3957 3952 6 1.05 1.01 Sngl - 4441 4085 357 1 0 71 44 164 0.772 6.11 1.00 Prom - 4495 4456 40 -5.65 2.06 PlyA - 4611 4606 6 1.05 2.05 Term - 6223 5769 455 2 2 29 54 302 0.364 15.23 2.04 Intr - 13832 13742 91 2 1 111 54 59 0.681 3.45 2.03 Intr - 14091 13945 147 0 0 35 94 86 0.655 3.41 2.02 Intr - 15264 15044 221 1 2 93 54 104 0.283 4.60 2.01 Init - 37199 37061 139 2 1 65 103 69 0.036 6.45 2.00 Prom - 40498 40459 40 -3.55 3.00 Prom + 44846 44885 40 -3.75 3.01 Init + 49805 49903 99 1 0 64 109 19 0.112 1.92 3.02 Intr + 52798 53061 264 0 0 71 51 116 0.007 2.99 3.03 Term + 68209 68385 177 0 0 83 38 157 0.049 6.90 3.04 PlyA + 70197 70202 6 1.05 4.02 PlyA - 70324 70319 6 1.05 4.01 Sngl - 75506 75315 192 2 0 79 44 247 0.870 14.29 4.00 Prom - 77138 77099 40 -4.95 5.00 Prom + 77151 77190 40 -8.55 5.01 Init + 84615 84674 60 2 0 67 72 79 0.928 5.50 5.02 Intr + 84826 84975 150 0 0 15 110 71 0.671 1.44 5.03 Term + 85690 85809 120 0 0 92 54 126 0.958 7.09 5.04 PlyA + 86345 86350 6 1.05 6.00 Prom + 90502 90541 40 -6.05 6.01 Sngl + 100001 101020 1020 1 0 53 38 464 0.518 34.69 6.02 PlyA + 101292 101297 6 1.05 7.04 PlyA - 101961 101956 6 1.05 7.03 Term - 105419 104881 539 0 2 -19 44 208 0.147 -0.88 7.02 Intr - 111637 111406 232 1 1 76 92 140 0.200 9.62 7.01 Init - 115464 115036 429 0 0 82 86 162 0.680 11.70 7.00 Prom - 118887 118848 40 -5.15 8.00 Prom + 123622 123661 40 -4.55 8.01 Init + 129251 129595 345 1 0 35 58 196 0.249 8.56 8.02 Intr + 131610 131697 88 2 1 26 61 67 0.213 -3.58 8.03 Intr + 152128 152368 241 2 1 83 63 138 0.672 6.49 8.04 Term + 152909 153029 121 2 1 63 37 104 0.347 -0.23 8.05 PlyA + 155739 155744 6 1.05 9.02 PlyA - 158104 158099 6 1.05 9.01 Sngl - 167150 166815 336 2 0 71 48 162 0.815 6.28 9.00 Prom - 167204 167165 40 -5.65 10.02 PlyA - 167321 167316 6 1.05 10.01 Sngl - 168836 168366 471 2 0 70 39 270 0.760 16.17 10.00 Prom - 175161 175122 40 -5.35 11.00 Prom + 186563 186602 40 -4.35 11.01 Sngl + 195465 195962 498 2 0 59 38 232 0.773 11.09 11.02 PlyA + 198034 198039 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 54569 54751 183 1 0 99 42 177 0.834 8.99 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:78860521_79061537|GENSCAN_predicted_peptide_1|118_aa MGKDFMSKTPKAMATKAKIDKWDLIQLKSFCTAKETTIRVNKQPTKWEKIFTTYSSDKGL ISRIYNELKQIYKEKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSPSLTIREMQSK >gi568815575f:78860521_79061537|GENSCAN_predicted_CDS_1|357_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattcaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaagcaacctacaaaatgggagaaaattttcacaacctactcatctgacaaagggcta atatccagaatctacaatgaactcaaacaaatttacaaggaaaaaacaaacaaccccatc aaaaagtgggcgaaggatatgaacagacacttctcaaaagaagacatctatgcagccaaa aaacacatgaaaaaatgttcaccatctctgactatcagagaaatgcaaagcaaatga >gi568815575f:78860521_79061537|GENSCAN_predicted_peptide_2|350_aa MKDKIKFKKEPTDVIELKKSLQEFDNIIGSRKNKIDQDEEIISELNVRPFFCRAAMVFWG PTLDSIYLVSSNTQRCHQWRLQNRKCVCLLLLLGTPPKKGTNLMPTGTLLYEVSGDPCWE CLLKWLPTQVAATSAQLFAWDPRPWWCGFMRVPPDLRVAKIHGKSVVYQGCDNYLVTSNE RTRVPQLKMQNSFAILILLGKQGLEWTSSKLQQTCSCGSCLLEGKLTNRKDIHTKNPSVH HHHQRPKVDKTTKMGKKQSRKTGNSKKQSASPPPKECSSSPATEQSWTENDFDELREEGF RRSNYSELQEEIQTKGKEVENFEKNLDEYITRITNTEKCLKELMELKAKA >gi568815575f:78860521_79061537|GENSCAN_predicted_CDS_2|1053_bp atgaaggacaaaataaagtttaagaaagaaccgactgatgtgatagagctaaaaaagtca ctacaagaatttgataatataataggaagtaggaaaaacaaaatagaccaagatgaagaa ataatttcagagcttaatgtcaggcccttcttctgtagggctgctatggttttctggggg cccactctggactctatttacctggtttcctccaacacccagaggtgtcaccagtggagg ctgcagaacagaaaatgtgtctgcctgctccttcttctaggaactccacccaagaagggc accaacctgatgccaaccggaacgctcctgtatgaggtgtctggagacccctgttgggag tgcctgctcaagtggttgcccactcaagtggccgccacgagtgcacagctctttgcttgg gacccaaggccttggtggtgtgggttcatgagggtacctcctgatctgcgggttgcaaag atccatgggaaaagtgtggtttaccagggttgtgacaactacctagtcacatccaatgag agaacccgtgtacctcagttgaagatgcagaattcatttgccattttaattcttcttggc aaacagggtctagagtggacctctagcaaactccaacagacctgcagctgtgggtcctgt ctgttagaaggaaaactaacaaacagaaaggacatccacaccaaaaacccatctgtacat caccatcatcaaagaccaaaagtagataaaaccacaaagatggggaaaaaacagagcaga aaaactggcaactctaaaaagcagagcgcctctcctcctccaaaggaatgcagctcctca ccagcaacagaacaaagctggacggaaaatgactttgacgagttgagagaagaaggcttc agacgatcaaactactccgagctacaggaggaaattcaaaccaaaggcaaagaagttgaa aactttgaaaaaaatttagacgaatatataactagaataaccaatacagagaagtgctta aaggagctgatggagctgaaagccaaggcttga >gi568815575f:78860521_79061537|GENSCAN_predicted_peptide_3|179_aa MVRLPRTHTLTTGMGDYPPARASPSASFMEKCQTKELHHVATANTGPQGIWPGYHECSLK VQGLFSQLLVDVVRPETHPSGQWATFSPRADPEMLSQSLGLELRTPRHSWCSMPCSLAVP KVTLMQEVGSHVLGELHPYGFAEYSLPPGCLYWLVLSVCSFYRCTVQAVSGSTILRSGG >gi568815575f:78860521_79061537|GENSCAN_predicted_CDS_3|540_bp atggtgaggcttcccagaactcacactctgactactggaatgggtgattaccctccggct agggccagtccaagtgcttccttcatggaaaaatgtcagacaaaggagcttcaccacgtg gccactgccaacacaggcccacagggaatatggccaggatatcatgaatgttcacttaag gtccaagggctctttagtcagcttctggtggatgttgtcaggcctgagactcacccttca ggacaatgggctaccttctcacccagggcagatccagaaatgctatcccagagccttggc ctggaattgcggaccccgagacattcttggtgctctatgccctgttctctagctgtacct aaggtcacactgatgcaagaggtgggctcccatgtccttggggagctccacccctatggc tttgcagagtacagccttcctcctggatgcctttattggctggtattgagtgtctgcagc ttttacagatgcacagtgcaagctgtcagtggatctacgattctgcggtctggaggatga >gi568815575f:78860521_79061537|GENSCAN_predicted_peptide_4|63_aa MWNSSYGGTQTTGMLLTDLQREGTESGQREDTEAGLKGEKVGNSMQDITSGIIAAPAGLQ ENQ >gi568815575f:78860521_79061537|GENSCAN_predicted_CDS_4|192_bp atgtggaacagctcctatggagggacccagacgactggcatgttgctaacagatcttcag agagaaggcactgagagtggacagagggaagatacagaagctgggttgaagggggagaaa gttgggaactctatgcaggacattacatctggaatcatcgctgcccccgcagggctccaa gagaaccaatga >gi568815575f:78860521_79061537|GENSCAN_predicted_peptide_5|109_aa MEPTGISENLDMSVVKEGDRVSVVRHQINSTSSFNNKCVSYQQDPCRQSKAFYPLLPAKK QECLSQLSSRDRKGESIPDDFRVWSEWFLDPLKANFDRSSALLDSKAHE >gi568815575f:78860521_79061537|GENSCAN_predicted_CDS_5|330_bp atggaacctactggaatcagtgaaaatctagatatgtcggttgtcaaggaaggagatagg gtttctgtggtgagacaccagataaactcaacttcctctttcaacaacaaatgtgtcagt tatcagcaggatccatgccgccagagtaaagctttctaccctttactccctgcaaagaaa caagagtgcttatcccagctaagctccagggataggaaaggagaaagtatcccagatgac ttcagagtctggtcagagtggttccttgatcctttgaaagctaattttgaccgaagttca gccttactggatagcaaagcccatgagtga >gi568815575f:78860521_79061537|GENSCAN_predicted_peptide_6|339_aa MANLDKYTETFKMGSNSTSTAEIYCNVTNVKFQYSLYATTYILIFIPGLLANSAALWVLC RFISKKNKAIIFMINLSVADLAHVLSLPLRIYYYISHHWPFQRALCLLCFYLKYLNMYAS ICFLTCISLQRCFFLLKPFRARDWKRRYDVGISAAIWIVVGTACLPFPILRSTDLNNNKS CFADLGYKQMNAVALVGMITVAELAGFVIPVIIIAWCTWKTTISLRQPPMAFQGISERQK ALRMVFMCAAVFFICFTPYHINFIFYTMVKETIISSCPVVRIALYFHPFCLCLASLCCLL DPILYYFMASEFRDQLSRHGSSVTRSRLMSKESGSSMIG >gi568815575f:78860521_79061537|GENSCAN_predicted_CDS_6|1020_bp atggctaaccttgacaaatacactgaaacattcaagatgggtagcaacagtaccagcact gctgagatttactgtaatgtcactaatgtgaaatttcaatactccctctatgcaaccacc tatatcctcatattcattcctggtcttctggctaacagtgcagccttgtgggttctgtgc cgcttcatcagcaagaaaaataaagccatcattttcatgatcaacctctctgtggctgac cttgctcatgtattatctttacccctccggatttactattacatcagccaccactggcct ttccagagagccctttgcctgctctgcttctacctgaagtatctcaacatgtatgccagc atttgtttcctgacgtgcatcagtcttcaaaggtgcttttttctcctcaagcccttcagg gccagagactggaagcgtaggtacgatgtgggcatcagtgctgccatctggatcgttgtg gggactgcctgtttgccatttcccatcctgagaagcacagacttaaacaacaacaagtcc tgctttgctgatcttggatacaagcaaatgaatgcagttgcgttggtcgggatgattaca gttgctgagcttgcaggatttgtgatcccagtgatcatcatcgcatggtgtacctggaaa actactatatccttgagacagccaccaatggctttccaagggatcagtgagaggcagaaa gcactgcggatggtgttcatgtgtgctgcagtcttcttcatctgcttcactccctatcat attaactttattttttacaccatggtaaaggaaaccatcattagcagttgtcccgttgtc cgaatcgcactgtatttccaccctttttgcctgtgccttgcaagtctctgctgccttttg gatccaattctttattactttatggcttcagagtttcgtgaccaactatcccgccatggc agttctgtgacccgctcccgcctcatgagcaaggagagtggttcatcaatgattggctaa >gi568815575f:78860521_79061537|GENSCAN_predicted_peptide_7|399_aa MAIRKEKEKNGIQLGKEEVTLSLFANDMIVYLENPIISAQNLLKLISNFSKVSGYKINLQ KSQAFLYTHKRQTETQIMSELPFTIATKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDT DKWKNIPCSWIGRINMKMAILPKALRPKRKNCLHRQHSGPGCPTQYQDTVPHIPAIPAPA SARSGPDTVWAVAPGVKGCHNPWRLSCGVKPIINEQCASVAWVTEQDSDKKERKERKKER KKERKKERKKERKKERKRKKRKKKERERKKERKKERKKERKKERKERKERRKEGRKEGKK GRKEGKKERRKEGEKKKEREREEGREEGKEGREKERKTERQKERKKERKKERKKERKKEK REGGRKGRKEGRKKERQKERRKERKKERRKKKKKKERKK >gi568815575f:78860521_79061537|GENSCAN_predicted_CDS_7|1200_bp atggcaatcaggaaagagaaagaaaaaaatggtattcaattaggaaaagaggaagtcaca ttgtccctgtttgcaaatgacatgattgtatatttagaaaaccccatcatctcagcccaa aatcttcttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatctgcaa aaatcacaagcattcttatatacccataaaagacaaacagagacccaaatcatgagtgaa ctcccattcacaattgctacaaagagaataaaatatctaggaatccaacttacaagggat gtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggacaca gacaaatggaagaacattccatgctcatggataggaagaataaatatgaaaatggccata ctgcccaaggccctgaggcccaagaggaaaaattgtcttcatcggcaacattcagggcct ggctgccctacacagtatcaggacactgttccccacatcccagccattccagctccagcg tcagctcgatcaggcccagatacagtttgggctgtagctccgggggtcaaaggctgccat aatccttggaggctatcatgtggtgttaagcctatcatcaatgaacagtgtgcaagtgtg gcatgggtgacagagcaagactctgacaagaaagaaagaaaagaaagaaagaaagaaaga aagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaaggaaaagaaagaaa agaaagaaaaaagaaagagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaaga aagaaagaaaggaaagaaaggaaggaaagaaggaaagaaggaaggaaggaaggaaagaaa ggaaggaaagaaggaaagaaggaaagaaggaaggaaggagaaaagaaaaaagaaagagag agagaggaagggagggaggaagggaaggaaggaagggagaaagaaagaaagacagaaaga cagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaaag agagagggagggaggaagggaaggaaggaagggagaaagaaagaaaggcagaaagaaaga aggaaagaaagaaagaaagaaagaagaaagaagaagaagaagaaagaaagaaagaaatga >gi568815575f:78860521_79061537|GENSCAN_predicted_peptide_8|264_aa MYGNTWMSRQKLASRAGPSWITSGWTVWKENMGSELPHRVPTGAPPSGAVKRRSPSSTPK NGRSTDSLYCEHGKAADTQCQFVKSARREAEPCKATEAVLSKVTGTHLLHHHDLDDGEKG IRQLNSCISGTRDGSTCSSNKTTSGPSAAGLLQFVGGPLQTLSACVSPAEAAEQQRFLPV PSSGSFVSEGNSPDSIQSSPEMTVDPCWEVSPNQEARGSGTHLRRGVNGSVLLAFQVPLG FEKKLLQLARCLLSFVLEVQGLVA >gi568815575f:78860521_79061537|GENSCAN_predicted_CDS_8|795_bp atgtatggaaacacatggatgtccaggcagaagttagcatcaagggcagggccgtcatgg ataacgtctggttggacagtatggaaggaaaacatgggatcagagctcccacacagagtt cctactggggcaccacctagtggagctgtaaaaagaagatcaccatcctccactcccaag aatggtagatctactgacagcttgtactgtgaacatggaaaagctgcagatactcaatgc cagtttgtaaaatcagccaggagggaggctgaaccttgcaaagccacagaggctgtcctg tccaaggtcacaggaacccacctcttgcatcaccatgacctggatgatggagaaaaaggt attcgccagctaaatagctgcatatcaggtaccagggatgggtctacttgctcaagcaac aaaaccacatcaggcccttctgctgcaggtttgctgcagtttgttggaggtccactccag accctgtctgcctgtgtatcaccagcagaggctgcagaacagcaaagatttctgcctgtt ccttcttctggaagcttcgtctcagaggggaactcaccagattccatccagagctctcct gagatgactgtcgacccctgctgggaagtgtctcccaatcaggaggcaagggggtcaggg acccacttgaggaggggagtgaacggttctgtcttgcttgcattccaggtgccactgggg tttgaaaagaaactcctgcagctagctcggtgtctcctcagttttgtgctggaagtccag ggcctggtagcgtaa >gi568815575f:78860521_79061537|GENSCAN_predicted_peptide_9|111_aa MGKDFMSKTPKAMATKAKIDKWDLIQLKSFCTAKETTIRVKRQPTEWEKIFAIYSSDKGL ISRIYKELKQIYKKKTNNPIKKWVKDKNRHFSKEDIYVSNRHMKNAHHHWP >gi568815575f:78860521_79061537|GENSCAN_predicted_CDS_9|336_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcgacaaaagccaaaattgac aaatgggatctaattcaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aagaggcaacctacagaatgggagaaaatttttgcaatctactcatctgacaaagggcta atatccagaatctacaaagaactcaaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggtgaaggataagaacagacacttctcaaaagaagacatttatgtgtccaac agacacatgaaaaatgctcatcatcactggccataa >gi568815575f:78860521_79061537|GENSCAN_predicted_peptide_10|156_aa MDKFLNTYTLPRLNQEEVESLNRPITGSEIETITNSLPTKKSPGPDRFTAEFYQTYKEEL VQLLRKLFQSKGKEGIPPKSFYEAGIILIPKPGRDTTKKKENFRPISLMNIDAKILNKIL ANRIQQHIKQLIHHDQVGFIPGMQDWFNIHKSINVI >gi568815575f:78860521_79061537|GENSCAN_predicted_CDS_10|471_bp atggataaattcctcaacacatacactctcccaagactaaaccaggaagaagttgaatcc ctgaatagaccaataacaggctctgaaattgagacaataactaatagcctaccaaccaaa aaaagtccaggaccagacagattcacagccgaattctaccagacgtacaaggaggagctg gtacaattacttcggaaactattccaatcaaaaggaaaagagggaatcccccctaaatca ttttatgaggctggcatcatcctgataccaaagcctggaagagacacaacaaaaaaaaaa gagaattttagaccaatatccctcatgaacattgatgcaaaaatcctcaataagatactg gcaaaccgaatccagcagcacatcaaacagcttatccaccatgatcaagtgggcttcatc cctgggatgcaagactggttcaacatacacaaatcaatcaacgtaatctag >gi568815575f:78860521_79061537|GENSCAN_predicted_peptide_11|165_aa MEDQMNEMKWEEKFGEKRIKRNEQSLQEIWDYVKRPNLHLIGVPESERENGTKLENTLQD IIQENFPNLARQANIQIQEIQRMPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGLV THKGKPIRLTVDLSAETLQARREWGPIFNILKEKNFPCTCLCPEW >gi568815575f:78860521_79061537|GENSCAN_predicted_CDS_11|498_bp atggaagatcaaatgaatgaaatgaagtgggaagagaagtttggagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacatctg attggtgtacctgaaagtgaaagggagaatggaaccaagttggaaaacactctgcaggat attatccaagagaacttccccaatctagcaaggcaggccaacattcagattcaggaaata cagagaatgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtctggtt acccacaaagggaagcccatcagactaacagtggatctctcagcagaaactctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttccttgcacatgc ctatgtccggaatggtaa