GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:14:25 Sequence gi568815597f:170564219_170836183 : 271965 bp : 38.86% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 6059 6295 237 1 0 77 54 178 0.862 8.48 1.02 PlyA + 6377 6382 6 1.05 2.00 Prom + 7416 7455 40 -6.15 2.01 Sngl + 9727 10428 702 0 0 74 35 227 0.516 11.96 2.02 PlyA + 10770 10775 6 1.05 3.06 PlyA - 11179 11174 6 1.05 3.05 Term - 35419 35336 84 1 0 80 42 103 0.859 1.57 3.04 Intr - 37641 37287 355 2 1 5 39 297 0.581 10.87 3.03 Intr - 37788 37689 100 1 1 12 76 49 0.551 -5.65 3.02 Intr - 38435 37848 588 0 0 10 47 361 0.381 15.87 3.01 Init - 39472 39433 40 1 1 73 105 71 0.388 7.70 3.00 Prom - 44503 44464 40 -4.15 4.03 PlyA - 45261 45256 6 1.05 4.02 Term - 46944 46383 562 2 1 18 41 236 0.761 4.76 4.01 Init - 49436 49291 146 2 2 68 98 152 0.758 13.84 4.00 Prom - 51945 51906 40 -8.55 5.00 Prom + 58229 58268 40 -2.55 5.01 Init + 61822 61840 19 0 1 73 84 41 0.575 0.61 5.02 Intr + 66792 66932 141 1 0 49 70 170 0.423 10.70 5.03 Intr + 67580 67711 132 0 0 74 82 35 0.251 1.20 5.04 Intr + 72737 72911 175 0 1 53 10 131 0.095 -0.12 5.05 Intr + 95563 95728 166 1 1 66 51 111 0.201 4.24 5.06 Intr + 96368 96446 79 1 1 84 37 104 0.644 3.11 5.07 Intr + 98050 98210 161 2 2 110 37 58 0.558 1.69 5.08 Intr + 99944 100241 298 1 1 122 79 380 0.707 36.02 5.09 Intr + 100616 100768 153 0 0 -33 75 148 0.477 0.52 5.10 Intr + 101209 101277 69 2 0 90 87 17 0.372 0.04 5.11 Intr + 101414 101529 116 0 2 65 58 46 0.530 -1.45 5.12 Intr + 102518 102619 102 1 0 36 47 105 0.271 0.65 5.13 Term + 104819 105031 213 1 0 45 53 173 0.851 5.75 5.14 PlyA + 105158 105163 6 1.05 6.00 Prom + 108326 108365 40 -5.25 6.01 Init + 119782 119974 193 0 1 75 38 125 0.135 5.38 6.02 Intr + 131817 131909 93 1 0 64 102 25 0.024 0.52 6.03 Term + 136462 136586 125 2 2 50 39 127 0.106 1.57 6.04 PlyA + 136793 136798 6 1.05 7.00 Prom + 145132 145171 40 -2.45 7.01 Init + 154638 154713 76 2 1 57 98 27 0.473 1.90 7.02 Intr + 155508 155683 176 1 2 91 93 176 0.956 17.14 7.03 Intr + 162002 162183 182 1 2 78 68 152 0.758 9.94 7.04 Term + 171830 171968 139 2 1 121 55 80 0.631 4.45 7.05 PlyA + 173429 173434 6 1.05 8.00 Prom + 175664 175703 40 -5.25 8.01 Init + 180170 180315 146 1 2 41 94 105 0.099 6.06 8.02 Intr + 198882 198961 80 0 2 105 99 63 0.017 7.38 8.03 Term + 233032 233198 167 2 2 35 48 161 0.575 3.90 8.04 PlyA + 233340 233345 6 1.05 9.00 Prom + 239737 239776 40 -3.45 9.01 Init + 243205 243280 76 0 1 39 19 137 0.482 3.20 9.02 Intr + 243334 243485 152 2 2 62 73 69 0.852 1.76 9.03 Term + 247624 247860 237 0 0 59 54 190 0.814 7.88 9.04 PlyA + 247947 247952 6 1.05 10.00 Prom + 248977 249016 40 -6.15 10.01 Init + 250749 251323 575 2 2 72 39 276 0.131 15.73 10.02 Intr + 265180 265318 139 1 1 48 42 118 0.027 2.75 10.03 Intr + 269178 269330 153 2 0 77 84 68 0.032 4.65 10.04 Term + 269427 269699 273 2 0 -41 42 195 0.017 -3.51 10.05 PlyA + 270977 270982 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 185823 185712 112 0 1 51 84 99 0.832 6.22 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:170564219_170836183|GENSCAN_predicted_peptide_1|78_aa DQNSSPAREQNWTENEFDKWTEVVFRRWVITNSTELKEHVLTQCKEAKNLDKRLEELLTR ITSLEKNINDLMDLKNTA >gi568815597f:170564219_170836183|GENSCAN_predicted_CDS_1|237_bp gatcaaaactcctcaccagcaagggaacaaaactggacggaaaatgaatttgacaaatgg acagaagtagtcttcagaaggtgggtaataacaaattccaccgagttaaaggagcatgtt ctaacccaatgcaaggaagctaagaaccttgataaaaggctagaggaattgctaactaga ataaccagtttagagaagaacataaatgacctgatggacctgaaaaacacagcatga >gi568815597f:170564219_170836183|GENSCAN_predicted_peptide_2|233_aa MLPDFKLCYKATVTKTACYWHQNRYTDQWNRTETSEITPHIYSHLIFDKPDKNKQWGKDS LFNKWCWENWLAICRKLKLDPLLTPYTKINSRWIKDLNIRPKSIKTLEENLGNTIQDIGM GKDFMTETPKAVATKAKNDKWDPIKLKSFCTAKETIIRVNRQPTEWEKIFAIYPSDKGLI SRICKELQQIYKKKTNNPIKKWVKDLDRRFSKEDIYAANKHLKKCSSLFIREM >gi568815597f:170564219_170836183|GENSCAN_predicted_CDS_2|702_bp atgctacctgacttcaaactatgctacaaggctacagtaaccaaaacagcatgttactgg caccaaaacagatatacagaccaatggaacagaacagaaacctcagaaataacaccacac atctacagccatctgatctttgacaaacctgacaaaaacaagcaatggggaaaggattcg ctatttaacaaatggtgttgggaaaactggctagccatatgcagaaaactgaaactggac cccttgctcacaccttatacaaaaattaactcaagatggattaaagatttaaatataaga cctaaaagcataaaaaccctagaagaaaatctaggcaataccattcaggacataggcatg ggcaaagacttcatgactgaaacaccaaaagcagtggcaacaaaagccaaaaatgacaaa tgggatccaattaaactaaagagcttctgcacagcaaaagaaactatcatcagagtgaac aggcaacctacagaatgggagaaaatttttgcaatctatccatctgacaaagggctaata tccagaatctgcaaagaacttcaacaaatttacaagaaaaaaacaaacaaccccatcaaa aagtgggtgaaggatttggacagacgattttcaaaagaagacatttatgcagccaacaaa catttgaaaaaatgctcatcacttttcattagagaaatgtaa >gi568815597f:170564219_170836183|GENSCAN_predicted_peptide_3|388_aa MKENDAREMGDKQGRGWNSLDGSEEDRKMWESLELPRDLLNGFDQNADNDMDNEIQSEMV SDGDEELVGNWSKGDSCYVLAKRLVAFCPFPRDLWDFGLERDDLGYLVEEISKQQCIQEV TRVLLKAFSFIRETDHKSSENLQPDNAIENKIAFSKKKFKPVAEICISNKEPNVNPQDNG EMSPGHVRDLHGSPSQHRPRGLGGKNGFTAAPAMAEGSNIELWLWLQRVQAPNHGSFHVV LSLVQKMYGNTWMTRQKLAAGAGSSWRTSARAAQKGNVGLEPPSTVPTGVPPSGAVRRRP PSSRPQNGRSTDSLHHAPGKATSTQCQPLKAAMREAVPCKATEAELPKTMGTHLLHQHDL DLIEHLDVEPADMEGHCDAKIMTKTVPN >gi568815597f:170564219_170836183|GENSCAN_predicted_CDS_3|1167_bp atgaaggaaaatgatgctcgagagatgggagacaaacaaggcagaggttggaacagtttg gatggctcagaagaagacaggaaaatgtgggaaagtttggaactccctagagacttgtta aatggctttgatcaaaatgctgataatgatatggacaatgaaatccagtctgagatggtc tcagatggagatgaggaacttgttgggaactggagcaaaggtgactcttgttatgtttta gcaaagagactggtggcattttgtcccttccctagagatttgtgggactttggacttgag cgggatgatttggggtacctggtggaagaaatttctaagcagcaatgcattcaagaggtt actcgtgtgctgttaaaggcattcagttttataagggaaacagaccataaaagttcagaa aatttgcagcctgacaatgccatagaaaataaaatagcattttctaagaagaaattcaag ccagttgcagaaatttgcataagtaataaggagccgaatgttaatccccaagacaatggt gaaatgtctccaggccatgtcagagaccttcatggcagcccctcccaacacaggcccaga ggcctaggaggaaaaaatggtttcacagctgctccagctatggctgaagggtccaacata gagctctggctgtggcttcagagggtgcaagccccaaaccatggcagcttccatgtggtg ttgagcctagttcagaagatgtatggaaacacctggatgaccaggcagaagcttgctgca ggagcaggttcctcatggagaacctctgctagggcagcacagaagggaaatgtggggttg gaacccccaagcacagtccctactggagtaccacccagtggagctgtgagaagaaggcca ccatcctccagaccccagaatggtagatccactgacagcttgcaccatgcacctggaaaa gccacaagcactcaatgccagcccctgaaagcagctatgagggaggcggtaccctgcaaa gccacagaggcagagctgcccaagaccatgggaacccacctcttgcatcagcatgacctg gatttgattgaacacttagatgtagaaccagcagatatggagggacactgtgatgccaaa ataatgaccaaaacagttccaaattga >gi568815597f:170564219_170836183|GENSCAN_predicted_peptide_4|235_aa MRKIQHKKAQNSKNQNASSPSKNHNFLPAKEQNWTENEFDEMTEVGFRRSWFFDKTNKID RLLARPIRKKRQNNQIDTVKNDKGDITTDPTEIQTTIREYYKYFYTNKLENAEEMDNFLD TSTLQRLNQEEDESLNRPITSSEIEAVIDSLSTKIRLGTDGFTTEFYQRYNEELVPFLLK LFQTIEKEGLLSNSFYEASIILIPKPGKDTTNKENFRPISLMNIDAKILNKILAN >gi568815597f:170564219_170836183|GENSCAN_predicted_CDS_4|708_bp atgaggaaaatccagcacaaaaaggctcaaaattccaaaaaccagaatgcctcttctcct tcaaagaatcacaacttcttgccagcaaaggaacaaaactggaccgagaatgagtttgat gaaatgacagaagtgggcttcagaaggagctggttttttgacaagactaacaaaatagac agactgctagccagaccaataaggaagaaaagacagaataatcaaatagacacagtaaaa aatgataaaggggatatcaccactgatcccacagaaatacaaactaccatcagagaatac tataaatacttctacacaaataaactagaaaatgcagaagaaatggataatttcctggac acatctaccctccaaagactaaatcaggaagaagatgaatccctgaatagaccaataaca agttctgaaattgaggcagtaattgatagcctatcaacgaaaatacgcttaggtacagat ggattcacaactgaattctaccagaggtacaatgaggagctggtaccattccttttgaaa ctattccaaacaatagaaaaagagggactcctctctaactcattttatgaggccagcatc atcctgataccaaaacctggcaaagacacaacaaacaaagaaaatttcaggccaatatcc ctgatgaacatcgatgcaaaaatcctcaataaaatactggcaaactga >gi568815597f:170564219_170836183|GENSCAN_predicted_peptide_5|607_aa MLLRLVWAQAQGDPNSVPEALAGVIGDPAGKPRPLRKDGSELGLKMHSDRRPAAPVPVPH PRRPQSPPPYPGPPPPSPPTNVPPPHHQCNSYSYFFIALANDYTDPHLTLPSVNCSLASS TEAKGRMPLPYGNTAEKQHTLLTLHFSGEVNWIATPTWSLEVKERLGAATYGALGRSSRL AYGCPGESITSGSEGFGSAFSSSSTPFVVYERWAPRPCASRAAGAALHRSDKPSSLAARL DAEPHGSLPNSCLWECSCLLQGIWAAGPSGFSVLAWGSNFGLTVEVTKLAGRLVLIRAGR GGWVGSVGETMTSSYGHVLERQPALGGRLDSPGNLDTLQAKKNFSVSHLLDLEEAGDMVA AQADENVGEAGRSLLESPGLTSGSDTPQQDKSQRCLGKQQVKDAESGWKGFSRAAYDGLR EGTRHREQDLALLRLHSPRGHRDMWPDMYRYSLFSSRLQESELSYSQRIDIQIKLHIETI GNRVRAQWWPRGPRYSKPQREPLILQGKRDENCTGCLALSGRHSQEGSSGFVVALEWNFG EIDSPVTPTHFWVKDDLEFEVWGDRPVKSEGLTGSPDPPTLECSEYPPPCFRASLPCVRR RAVLHFG >gi568815597f:170564219_170836183|GENSCAN_predicted_CDS_5|1824_bp atgttgctcaggctggtctgggctcaggcccagggagatccaaattctgtccctgaggct ctggctggagttattggagatcctgcaggaaagccccgcccactgaggaaggatgggtca gaattaggcctgaagatgcactctgaccgcaggccagctgcccccgtccccgtcccccat ccccgtcgtccccagtccccacccccgtaccccggacccccacccccgtccccccccacc aatgtaccacccccacaccaccagtgtaattcttattcttacttctttatagccctagca aatgactacactgatcctcacctgacactaccttctgttaattgttctttggcttcaagc actgaagccaaaggcaggatgccgctaccctatggaaacacagcagaaaaacaacacacg ctccttacccttcacttttcaggagaagtaaactggatagccaccccgacctggagcctg gaggtcaaggagaggctaggagcagcgacctatggcgctctgggaaggagctcccggctt gcctatgggtgccctggggagagcatcacttctggctcggagggtttcgggagcgccttc tccagctcaagcactccttttgtcgtttatgagaggtgggctccccgaccttgcgcttct agagctgcaggagcggcgctgcacaggtctgacaagcccagctcattggcggccaggctg gacgcagaaccgcacggaagcctccccaactcctgcctctgggagtgctcctgcctgctt caaggaatctgggcagcgggaccctctgggtttagcgtcctcgcctggggaagcaacttt ggccttaccgtagaagttacaaaattggcggggcgtttggtgttgattcgagcgggaaga ggggggtgggtgggatcggtgggggagaccatgacctccagctacgggcacgttctggag cggcaaccggcgctgggcggccgcttggacagcccgggcaacctcgacaccctgcaggcg aaaaagaacttctccgtcagtcacctgctagacctggaggaagccggggacatggtggcg gcacaggcggatgagaacgtgggcgaggctggccggagcctgctggagtcgccgggactc accagcggcagcgacaccccgcagcaggacaagagccagcgctgtttgggaaagcagcag gttaaagatgctgagtcgggttggaagggcttctctcgggcagcgtatgacggcttgagg gagggcaccaggcacagggagcaggacttggcgctgctcaggctgcactctccacgcggg caccgtgacatgtggccagatatgtacagatactccctcttctcctcgaggttgcaagag tctgaacttagctactctcaaagaatagacatacaaatcaaactgcacatagaaacaatt ggaaacagggtccgggcacagtggtggccccgcgggccacgctacagcaagccgcagaga gagcctctgatactgcaggggaagagggatgaaaattgtaccggttgtttggctctgagt ggacgccactcccaagaagggagctcgggattcgtcgtggccttagagtggaattttggg gaaattgatagtccagtgactcctacccacttttgggtgaaggacgatttggaatttgaa gtgtggggagacaggcctgtgaagtccgaaggactcactgggtctccagatcccccaact ctggaatgcagtgaatacccacccccatgtttcagagcatctctcccatgtgttagacgt cgtgcagtcctgcattttggctga >gi568815597f:170564219_170836183|GENSCAN_predicted_peptide_6|136_aa MPSTLEDIKEVNGMTSCQSSSWRDGSKIVPHIVHIQHDFNISAPGVIDPVPPLFPITQWS LVLSAVDMSKDQYRKRDRRDQHVPHTCTGHSFKVQAKTKLQVTRMLPGDKTALITHFAAE NPLLTVLPWRMQFKES >gi568815597f:170564219_170836183|GENSCAN_predicted_CDS_6|411_bp atgccaagcactttggaggatattaaagaagtgaatggcatgacctcttgccaatcatcc agctggcgagatggatctaaaattgtgcctcacatagttcatattcagcatgattttaac atatctgctcctggggtcatagatcctgttcctcctcttttccctatcactcaatggtcc ctcgtgctctcagcagtagacatgtccaaagaccagtacagaaaaagagacagaagggat cagcatgtgcctcacacctgcactggtcactcattcaaggtgcaagccaaaaccaaactc caagttaccaggatgctacctggagacaaaactgcccttatcacccactttgctgctgaa aacccgttactgactgtcctaccctggaggatgcagttcaaagagtcttag >gi568815597f:170564219_170836183|GENSCAN_predicted_peptide_7|190_aa MDNLDNYLTTLPSGELRENQGRILTDDQLNSEEKKKRKQRRNRTTFNSSQLQALERVFER THYPDAFVREDLARRVNLTEARVQVWFQNRRAKFRRNERAMLANKNASLLKSYSGDVTAV EQPIVPRPAPRPTDYLSWGTASPYSAMATYSATCANNSPAQGINMANSIANLRLKAKEYS LQRNQVPTVN >gi568815597f:170564219_170836183|GENSCAN_predicted_CDS_7|573_bp atggacaacctggacaattatctcaccacgctaccttcaggggaactcagagaaaaccaa ggaaggatcttgacagatgaccagctgaactcagaagaaaaaaagaagagaaagcagcga aggaataggacaaccttcaatagcagccagctgcaggctttggagcgtgtctttgagcgg acacactatcctgatgcttttgtgcgagaagaccttgcccgccgggtgaacctcaccgag gcgagagtgcaggtgtggtttcagaaccgaagagccaagttccgcaggaatgagagagcc atgctagccaataaaaacgcttccctcctcaaatcctactcaggagacgtgactgctgtg gagcagcccatcgtacctcgtcctgctccgagacccaccgattatctctcctgggggaca gcgtctccgtacagcgccatggctacttattctgccacatgtgccaacaatagccctgca cagggcatcaacatggccaacagcattgccaacctgagactgaaggccaaggaatatagt ttacagaggaaccaggtgccaacagtcaactga >gi568815597f:170564219_170836183|GENSCAN_predicted_peptide_8|130_aa MVAGKRACAGELPFLKPSDLMGLIHYQESSMVETTYMIQVSPFGPLLDRDMDEAENHHSQ QTNTGTEKQTPHVLTQEEMQTYRDAMDVHTPRKDNVWTQQEGRPLQVRETGVRKNQPANT LILDFKPPKL >gi568815597f:170564219_170836183|GENSCAN_predicted_CDS_8|393_bp atggtggcaggcaaaagagcttgtgcaggggaactcccattcttaaaaccatcagatctc atgggacttattcactaccaggagagcagtatggtggaaaccacctacatgattcaagta tctccatttggtcccctccttgacagggacatggatgaagctgaaaaccatcattctcaa caaactaacacaggaacagaaaaacaaacaccgcatgttctcactcaagaagaaatgcag acatacagagatgccatggatgtgcacacaccgaggaaagacaacgtatggacacagcaa gaaggccgccctctgcaagtcagggagacaggcgtcaggaaaaaccaacctgccaacacc ttgatcttggacttcaagcctcccaaactgtga >gi568815597f:170564219_170836183|GENSCAN_predicted_peptide_9|154_aa MIKYSSPMKSTFVDPVDPVAVAVIYSLAAGLVIIRIPFFQLVPLKEEKMVLLNMDYYEMA SLSLVTHRNEAPFSGRDRSSLPATEQSWMENDFDKLTEVGFRRSVITNFSELKEHVLTHH KEAKNLEKRLDKWQTRINSVEKTLNDLMELKTMA >gi568815597f:170564219_170836183|GENSCAN_predicted_CDS_9|465_bp atgattaaatatagcagcccaatgaaaagcacctttgtggaccctgtggatcccgtggca gtggcggtcatctactcacttgcagcaggattggtaatcattcgcattcctttcttccag ttagtacccctaaaggaggaaaaaatggttttgcttaacatggactattatgaaatggca tctctctctctggtgactcacaggaatgaggcgcccttctctggaagggatcgcagctcc ttgccagcaacggaacaaagctggatggagaatgactttgacaagttgacagaagtaggc tttagaaggtcagtaataacaaacttctctgagctaaaggagcatgttctaacccatcac aaggaagctaaaaaccttgaaaaaaggttagacaaatggcaaactagaataaacagtgta gagaagaccttaaatgacctgatggagctgaaaaccatggcgtga >gi568815597f:170564219_170836183|GENSCAN_predicted_peptide_10|379_aa MSLFADDMILSVENPIVSPQNLLKLISNFSKVLGYKINVQKSQAFLYTNNRQRGSQIMSE LPFTIATKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDKNKWKNIPCSWIGRINIMKMA IQPKVIYRFNAIPIKLPVTFFTELEKTTLRSIWNQKRAHIAKTILSKKNKAGGITLPDFK LYYKATVTKTAWCFSKAVLLPPLNTTAIVSIHDEQWNARRSSLASHKEADATICVTTKAW RRRRQKRFCGLGRGSLCCVQPRDLVPCVLATPVVAESGQCKTQTMASESRMHENTRMPRQ KFAAGAVSSWRTSTRAVWKGNVGSESPHRVHTGALPSGAVRRGPPSSRPQNGRSTNSLHH APGKAADTQCQSMKAASHA >gi568815597f:170564219_170836183|GENSCAN_predicted_CDS_10|1140_bp atgtccctgtttgcagatgacatgattttatctgtagaaaaccccatcgtctcaccccaa aatctccttaagctgataagcaacttcagcaaagtcttaggatacaaaatcaatgtgcaa aaatcacaagcattcctatacaccaataacagacaaagagggagccaaatcatgagtgaa ctcccattcacaattgctacaaagagaataaaatacctgggaatccaacttacaagggat gtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggacaaa aacaaatggaagaacattccatgctcatggataggaagaataaatatcatgaaaatggcc atacagcccaaggtaatttatagattcaatgccatccccatcaaactaccagtgactttc ttcacagaattggaaaaaactactttgaggtccatatggaaccaaaaaagagcccacata gccaagacaatcctaagtaaaaagaacaaagctggaggcatcacgctacctgacttcaaa ctatactacaaggctacagtaaccaaaacagcatggtgttttagcaaagcagtgctgctg ccacctctcaacaccactgctatagtgtctatccacgatgaacaatggaatgctagaagg tcttctcttgctagccacaaagaagcagatgccaccatctgtgtgaccaccaaggcctgg agacgtaggaggcaaaaacggttttgtggactgggccgagggtccctgtgctgtgtgcag cctagggacttggtgccctgtgtcctagccactccagttgtggctgaaagtggccaatgt aaaactcagaccatggcttcagagagtaggatgcatgaaaatacccggatgcctaggcag aagtttgctgcaggggcagtgtcctcatggagaacctccactagggcagtgtggaaggga aatgtgggatcagaatccccacacagagtccatactggggcactgcctagtggagctgtg agaagagggccaccatcttccagacctcagaatggtagatccaccaacagcttgcaccat gcacctggaaaagctgcagacactcaatgccagtccatgaaagcagctagtcatgcttag