GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:59:40 Sequence gi568815585r:94611222_94812049 : 200828 bp : 40.07% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 353 653 301 1 1 92 -57 248 0.792 6.93 1.02 Intr + 969 1169 201 1 0 49 99 198 0.743 15.34 1.03 Intr + 7929 8109 181 1 1 117 93 2 0.505 1.60 1.04 Intr + 8247 8296 50 0 2 80 82 34 0.400 -0.59 1.05 Intr + 9883 10014 132 2 0 44 94 57 0.557 1.60 1.06 Term + 13039 13271 233 2 2 27 39 236 0.724 8.25 1.07 PlyA + 15726 15731 6 1.05 2.03 PlyA - 16505 16500 6 1.05 2.02 Term - 19966 19657 310 0 1 75 44 225 0.994 10.35 2.01 Init - 26651 26503 149 2 2 83 75 50 0.844 2.81 2.00 Prom - 26828 26789 40 -4.75 3.03 PlyA - 27194 27189 6 1.05 3.02 Term - 30502 30249 254 2 2 76 49 206 0.792 10.32 3.01 Init - 44236 44176 61 1 1 104 57 21 0.355 2.07 3.00 Prom - 59363 59324 40 -4.65 4.06 PlyA - 59661 59656 6 1.05 4.05 Term - 69941 69838 104 0 2 95 48 86 0.244 2.76 4.04 Intr - 70764 70644 121 0 1 34 113 16 0.323 -2.15 4.03 Intr - 77032 76916 117 1 0 81 72 87 0.833 6.14 4.02 Intr - 79701 79522 180 0 0 -24 62 162 0.339 1.54 4.01 Init - 81240 81175 66 0 0 62 88 5 0.285 -0.98 4.00 Prom - 84851 84812 40 -2.95 5.03 PlyA - 87054 87049 6 1.05 5.02 Term - 88668 88478 191 1 2 129 44 162 0.985 12.53 5.01 Init - 89381 89324 58 2 1 19 89 104 0.709 5.12 5.00 Prom - 89751 89712 40 -8.95 6.00 Prom + 89762 89801 40 -13.30 6.01 Init + 90509 90994 486 1 0 89 27 303 0.457 17.63 6.02 Intr + 91458 91759 302 2 2 -2 86 293 0.545 14.71 6.03 Intr + 92246 92364 119 2 2 118 100 88 0.958 12.19 6.04 Term + 94219 94499 281 2 2 7 48 240 0.040 6.52 6.05 PlyA + 94761 94766 6 -0.45 7.00 Prom + 95307 95346 40 -4.05 7.01 Sngl + 96789 97193 405 2 0 73 39 270 0.998 16.63 7.02 PlyA + 97946 97951 6 1.05 8.08 PlyA - 98429 98424 6 1.05 8.07 Term - 100919 99998 922 1 1 -66 46 1261 0.588 97.17 8.06 Intr - 101391 101112 280 1 1 139 50 178 0.630 14.71 8.05 Intr - 102260 102187 74 1 2 111 68 59 0.905 4.33 8.04 Intr - 102595 102407 189 0 0 93 6 188 0.817 8.88 8.03 Intr - 102753 102638 116 0 2 73 96 57 0.610 3.33 8.02 Intr - 110787 110695 93 0 0 -6 116 80 0.002 0.64 8.01 Init - 127137 126946 192 0 0 69 91 92 0.648 6.61 8.00 Prom - 135783 135744 40 -3.05 9.09 PlyA - 135819 135814 6 1.05 9.08 Term - 147703 147160 544 0 1 -5 49 406 0.352 20.15 9.07 Intr - 148105 148002 104 1 2 -65 63 106 0.020 -9.15 9.06 Intr - 152523 152409 115 2 1 81 106 86 0.233 9.23 9.05 Intr - 187368 187272 97 1 1 27 54 153 0.091 3.95 9.04 Intr - 188284 188161 124 1 1 23 64 74 0.807 -2.26 9.03 Intr - 188883 188639 245 1 2 123 68 110 0.567 8.79 9.02 Intr - 189653 189604 50 1 2 103 87 -25 0.427 -3.69 9.01 Init - 191336 191266 71 2 2 83 65 91 0.870 6.87 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 114256 114456 201 0 0 50 45 136 0.863 1.91 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:94611222_94812049|GENSCAN_predicted_peptide_1|365_aa MVARLGTFLKNAWAREPVLVVSFIIGSLAVILPPLSPYTKYFIMIYETMPYNYPLPVRND GNMTNVPGHPQDPQSSSLEWLKKLSTSTDRGGPSHRSQYELTMNQTEHNLTVSQIPSPQT WHVFYADKYTCQDDKENSQVEDIPFEMVLLNPDAEGNPFDHFSAGESGLHEFFFLLVLVY FVIACIYAQSLWQAIKKGGPMHMILKVLTTALLLQAGSALANYIHFSSYSKDGIGVPFMG SLAEYVILTFESMHGLDNSQNEEVSKQTSPVGFYACIHWHCSIHCHDTALRTSILIRLIL FRQPTLSRVIGRYSEAAMEGEPDSCFIPSVEHCKAAAAAALRTNFRLLSARQEDVILPLS FSEGQ >gi568815585r:94611222_94812049|GENSCAN_predicted_CDS_1|1098_bp atggtggcgagacttggcactttcctcaagaatgcctgggccagggagccagtgctggtt gtgtccttcatcattggtagcctagctgtaattctgcccccactcagcccctacaccaag tacttcatcatgatctacgagaccatgccctacaactacccattgcccgtccgcaatgat gggaacatgaccaatgtgcccggccacccccaggatccccagagctccagcttggagtgg ctgaagaaactgagcacctccactgacagaggaggcccctcccatcgctcccaatacgaa ttgaccatgaaccagaccgaacataatctgacagtgtcccagattccgtctccacaaacg tggcatgtgttttatgcagacaagtatacatgccaagatgacaaggagaattctcaggtg gaagatatcccatttgaaatggtgttactaaacccagatgccgaagggaatccatttgat cattttagtgctggagaatctgggttacatgagttctttttcctcctagtcctagtgtac tttgtgattgcttgcatttatgctcaatcattgtggcaggctattaagaaaggcggaccc atgcacatgattttaaaggttctgacaactgcattgctgttacaagctggttcagcttta gctaattacattcatttctccagttactccaaagatggaataggggtaccatttatggga agtttggcagaatatgttatacttacttttgagtctatgcatgggttggacaatagtcag aatgaagaagtctcaaagcagacctctccagtgggattctacgcctgcatccactggcat tgcagtattcattgtcatgacacagccctgcgaaccagcattctgatcagactgattctg ttcaggcagcccaccttgtccagagttattggccgttattctgaggcagctatggaagga gagccagattcctgctttattcctagtgttgagcattgcaaagctgctgctgctgcagcc ctaagaaccaacttcaggcttctgagcgcaagacaagaagatgtcatcctgcctttgagt ttttcagagggacagtag >gi568815585r:94611222_94812049|GENSCAN_predicted_peptide_2|152_aa MDEAGNHHSQQTASRTENQTPHVLTHRWKLNNENTWTQKGEHHTPRPVVSVNDGVVQQKL APFVRVKRTPGPGAREAEGAIPAGTGVPVGLAVVPQAKRSQQISNNVDKLQQCLVPCALS GPSSYKKNADLAATLLLPSISCIKHFLWPTLS >gi568815585r:94611222_94812049|GENSCAN_predicted_CDS_2|459_bp atggatgaagctggaaaccatcattctcagcaaactgcctcaaggacagaaaaccaaaca ccgcatgttctcactcataggtggaaattgaacaatgagaacacttggacacagaaaggg gaacatcacacaccgagacctgtcgtgagtgtgaatgatggagttgtccagcagaagctg gcaccctttgtcagagtaaaacgcacacctggcccaggagccagagaagctgaaggagcc attccggcaggaactggtgttcctgtgggcctggctgttgtcccacaggccaagagaagc caacagataagcaataatgtggataagctgcaacagtgccttgtgccttgtgccttgtct ggcccttcctcttacaaaaagaatgcagatctggctgctactttacttctgccttctata tcttgtatcaaacatttcttatggcctacattgtcctga >gi568815585r:94611222_94812049|GENSCAN_predicted_peptide_3|104_aa MGNLPMEHPPCTYTILLPFVVFTRANVFEIMQFRHVTAVEIQHKLELTVVYHQESEKLKQ HYDKEEKYEKYGEDKGQEILPYRLKAEPWRETSGLEQTVILLAH >gi568815585r:94611222_94812049|GENSCAN_predicted_CDS_3|315_bp atggggaacctgcccatggagcatccaccctgcacatataccattttattaccctttgtg gttttcactagagccaatgttttcgagataatgcagtttagacatgtcacagctgttgaa attcagcataagcttgagctgacagtggtgtatcatcaagaatcagaaaaactaaaacaa cattatgataaagaggagaagtatgaaaaatacggagaagacaaagggcaggaaatattg ccttaccgacttaaagcagagccctggagggaaacatctggtttggagcaaacagtgatt cttttagcacactga >gi568815585r:94611222_94812049|GENSCAN_predicted_peptide_4|195_aa MTSHLTQCKSTSSQCPHALHWKTDNNVNFRSSFYGSPNPPLCDVYPLIRALLLGAVENET TLRSKQLLKPVIMFPLNLLFLKVSVDIMGPAEYCLSRVEFFSVLKSLAAWIELHQKWGAR RSDQKQNKRKSTSRHWHKVTFTCKEAVPGDCQSGLSKVPSQAPSVLKPRKAECFPRIVDF LLVFSVQCVTTAEVE >gi568815585r:94611222_94812049|GENSCAN_predicted_CDS_4|588_bp atgacttctcatcttactcagtgcaaatctacatcttcacaatgcccacatgccctacac tggaagacagacaataatgtgaattttcggtcttccttttacgggagcccaaatccacct ctctgtgacgtttaccctttaatacgagctctgctcttgggagcagtggagaatgaaacg actctccgttccaaacagcttttgaagccagttatcatgttccctctcaatcttctcttt ctaaaggtttctgtggacataatgggtcctgcggagtattgcctctcaagagttgagttt ttcagtgttctgaagtctctggctgcatggatagaacttcatcagaagtggggtgcaagg aggtctgaccaaaaacaaaacaaaagaaagtctacctcacggcactggcataaagtcacc ttcacatgcaaggaggctgtgcctggggactgccagtctggcctttctaaggtgccctca caagctccttccgtcctgaaacccagaaaagcagaatgctttccacgtattgttgacttt ctccttgtgttttctgttcagtgtgtgacaacagcagaggtagaatga >gi568815585r:94611222_94812049|GENSCAN_predicted_peptide_5|82_aa MFEVNDFRKMSEIHAGKSSALVLREASCSMERPTGQELMYLAKGQQRLEACQQLSEKVTR SNLEMTTAPGGTLIAALEETLS >gi568815585r:94611222_94812049|GENSCAN_predicted_CDS_5|249_bp atgtttgaagtcaatgactttcgaaaaatgagtgaaatccacgcaggaaaatccagcgct cttgttctaagggaagcaagctgctccatggaaagacccacagggcaagagctgatgtat ctggccaaaggacagcaaagacttgaggcctgtcagcagctaagtgaaaaagtcacccga tcaaaccttgagatgactacagccccaggtggcaccttgattgcagctttggaagagacc ctgagctag >gi568815585r:94611222_94812049|GENSCAN_predicted_peptide_6|395_aa MHVFCEALPTFPALLCAPHSPERAVLGILRSRLLRLVVWAEVAAQGGTWGLVGFAVAGVC RLEEGLKAHGSLIPPSLRLLAETLSCAGVGGHPIPPRPRPVTQCVHMRAERQRPGGHGWG RVVKLPKPTHTLSFDTSRRFQRRRHGGFRETSRKGDFSKREMVDPQSRPHLERAASQFRL PAPQRSSGSGEPSRRALMFIHAALSSRAPIHINQPLDLHKSKFTGENLVPEKEEQQSGQA VGAFVPPAPSPTSPKLGSCCEGGEELCGGVSVGNPGVLRRLLIFRQSRHSSALGVPSAGE PEVRSSADNDWKVRRLGGPATRRPGEQPSKIKQLLSLAAKASMDASRAAQGTKGHPRSLR VRALPRGGQEVLCPMRFSQGQGAGRGKGQEPRRET >gi568815585r:94611222_94812049|GENSCAN_predicted_CDS_6|1188_bp atgcatgtcttttgcgaagcccttcccaccttcccagcgctgctctgtgccccgcacagc ccggagcgcgcagtcctggggatactaagaagccgactacttagactcgtggtgtgggcg gaggtggcagcccaaggtgggacgtggggccttgttggttttgcagtagcaggagtctgc aggcttgaggaaggccttaaggcccacggaagcctcatcccgccaagccttcgcctcctc gctgagactctgagctgcgctggggttggcgggcacccgattccgccccggcccagaccg gtcactcagtgtgtgcatatgagagcggagagacagcgacctggaggccatgggtggggg cgggtggtgaagctgccgaagcctacacatacacttagctttgacacttctcgtaggttc caaagacgaagacacggtggcttcagggagacaagtcgcaagggcgacttttccaagcgg gagatggtggacccgcagtcccgcccgcatctggaacgagctgcttcgcagttccggctc ccggcgccccagagaagttcggggagcggtgagcctagccgccgcgcgctcatgtttatt cacgcggccttgagcagccgagctccaatccatattaatcaaccgctcgacctacacaag tctaagtttacgggagaaaacctagtccccgaaaaggaagaacagcaatccggacaagca gttggcgcctttgtcccgccagccccttctccaaccagcccaaagttgggcagctgctgc gaaggaggggaggaactttgcggaggggtcagtgttgggaaccctggggtcctccgccgc ctcctgatcttccgccagtcccgccactcctccgccctgggcgtgccttccgcaggtgaa cctgaagttaggagcagcgctgataacgactggaaagttcgacggctgggtgggcccgcg acccgcaggcctggcgagcaaccatcgaaaataaaacaattgctgtccttggcggcaaag gcctctatggacgcctccagggcagcgcagggaaccaaagggcaccccagatctctgcgg gttcgcgccttgcctcgtgggggacaggaggttctctgtcccatgaggttcagtcaagga caaggagccgggaggggaaaggggcaggagccaaggagagaaacctga >gi568815585r:94611222_94812049|GENSCAN_predicted_peptide_7|134_aa MEIEIASQQQLSAQSQRCICGPLSDAYTTHPRAFLYALLPRSRSPRSNPLLLHSLTWQEA LCRRKTRQLVSKTIGVRMGDKRVCGACGGPQTAAQSVGDMTSVLPLGILTPPLTHLSSPL LIHQQLRSVSPRKR >gi568815585r:94611222_94812049|GENSCAN_predicted_CDS_7|405_bp atggagatagagatcgcgagccagcagcagctctctgcccaatcacagcgctgcatttgt gggccactttcagacgcttacaccacccacccccgggcctttctctacgccctcttgccc cggagccgctccccccgctccaaccccttacttctgcacagtctaacttggcaggaggct ctctgcaggcgcaaaacccgccaacttgttagcaagaccattggagtcagaatgggtgat aaacgagtctgtggggcctgtggaggtcctcaaacagccgcacaatctgttggcgatatg acctcggtgctccccctcggcatcctcacccctcctcttacccatctttccagtcctctt ctcattcatcaacaactccggagcgtttctcccagaaaacgctga >gi568815585r:94611222_94812049|GENSCAN_predicted_peptide_8|621_aa MKEGRKEGRKEGRKEGRKEGRKEGRKEGKRKRKKERKKERKKERKKERERKKERERKKEI YYILLCRTLIRKKTKTPTITNSVQQQTEESRQCSKVHRQCKKMREASVSVASNTWTSKEN EHFLARNSLKHFQGWRGSPTRARRLLPPGEDSRATTRAAARQAGLGSRDDTRVCQRQTGR RLPGALRTLHSPKPALKPPDIHTQPRLDMHTQCRLRARTHAGPPPRLPHPHLLLSNEKVT EENPNTPAAESPLWHLAARGGGLLGSTSRSLRDATFGDALHLRENDRGGGALCPAAATCG ELTSKRHCRRRTQRTGSPRAANIDFLRAEGEGPGGGGLQPRQGESMSKPVDHVKRPMNAF MVWSRAQRRKMAQENPKMHNSEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDY KYRPRRKPKTLLKKDKFAFPVPYGLGGVADAEHPALKAGAGLHAGAGGGLVPESLLANPE KAAAAAAAAAARVFFPQSAAAAAAAAAAAAAGSPYSLLDLGSKMAEISSSSSGLPYASSL GYPTAGAGAFHGAAAAAAAAAAAAGGHTHSHPSPGNPGYMIPCNCSAWPSPGLQPPLAYI LLPGMGKPQLDPYPAAYAAAL >gi568815585r:94611222_94812049|GENSCAN_predicted_CDS_8|1866_bp atgaaagaaggaaggaaggaaggaaggaaggaaggaaggaaggaaggaaggaaggaagga aggaaggaaggaaggaaggaaggaaaaagaaaaagaaagaaagaaagaaagaaagaaaga aagaaagaaagaaagaaagaaagagaaagaaagaaagaaagggaaagaaagaaagagatt tactacatcttgctatgccgtacgttgatcaggaaaaagacaaagacacctaccatcacg aattctgttcagcaacaaactgaagaatctagacagtgcagtaaggtgcatcgccagtgc aagaaaatgcgagaagcctctgtttctgttgcaagcaacacctggacctccaaggaaaat gagcactttcttgctaggaacagtctcaaacattttcaaggctggagaggctctcctact cgtgcccgccgccttcttcctcccggcgaggattccagggccacaaccagggcagctgca cgccaggccggcctgggtagccgggatgacaccagggtttgccaacgccaaacagggcgc agactccccggcgcgctccggactctccactcgcctaaacctgctctcaagcccccagat atacacacacagccgcggctggacatgcacacccagtgccggctccgggcgcgcacacat gcagggccacctccccgccttccccacccccatctgcttctgtcaaatgagaaagtcacc gaggagaacccaaacactccagccgctgagagccccctttggcacttggcagcacgcggc ggcgggctcctcggctcaacttcgaggagtctccgcgacgcaacttttggggacgctttg catttaagagagaacgaccgaggaggaggagcgctctgcccggccgccgctacctgcggg gagctcaccagcaaacgccactgcagacgaaggacccaaagaaccgggtctccgcgggca gccaacattgatttcctccgggccgagggcgagggcccgggaggcggcgggctgcagccg cggcagggcgagagcatgtccaagccggtggaccacgtcaagcggcccatgaacgccttc atggtgtggtcgcgggctcagcggcgcaagatggcccaggagaaccccaagatgcacaac tcggagatcagcaagcgcttgggcgccgagtggaaactgctcacagagtcggagaagcgg ccgttcatcgacgaggccaagcgtctacgcgccatgcacatgaaggagcaccccgactac aagtaccggccgcggcgcaagcccaagacgctgctcaagaaggacaagttcgccttcccg gtgccctacggcctgggcggcgtggcggacgccgagcaccctgcgctcaaggcgggcgcc gggctgcacgcgggggcgggcggcggcctggtgcctgagtcgctgctcgccaatcccgag aaggcggccgcggccgccgccgctgccgccgcacgcgtcttcttcccgcagtcggccgct gccgccgccgctgccgccgccgccgccgccgcgggcagcccctactcgctgctcgacctg ggctccaaaatggcagagatctcgtcgtcctcgtccggcctcccgtacgcgtcgtcgctg ggctacccgaccgcgggcgcgggcgccttccacggcgcggcggcggcggctgcagcggcg gccgccgccgccggggggcacacgcactcgcaccccagcccgggcaacccgggctacatg atcccgtgcaactgcagcgcgtggcccagccccgggctgcagccgccgctcgcctacatc ctgctgccgggcatgggcaagccccagctggacccctaccccgcggcctacgctgccgcg ctatga >gi568815585r:94611222_94812049|GENSCAN_predicted_peptide_9|449_aa MHGHTGCDDVVNMMMMVSCSASWRKEHREKCLSKVIMNSIDFLLFSFLENLLQKSASLPF MGTQTPLKSDPNYGLTLRNMSIHSKPCRNFQKVQEAFETHVRPPIKNLRMFSNDTEQEKI RQFYADSGGSYRYLRNSYPHVWAVATTKFVLPRAYDWLAPDSHEAEKLTVVHVEVQPSAE WVFSSSHEEPPLEGVWAWVSHTMVAGNIPSRPVPMYKQLTEPLLALFANDPSAKLRCVPA PGSGGGHRGQLDRGKKHKERKSDKHFYEGSLAKQEEVEQTPLQEALNQLMRQLQRKDPSA FFSFPVTDFIAPGYSMIIKHPRDFSTMKEKVKNNDYQSIELKDNFKLMCTNAMIYNKPET IYYTAAKKLLHWGMTILSQERIQSLKQSIDFMADLQKTRKQKDRTDWGGWRQSGEDEGCW PRERGLWRCRSTSLPESRQRKLKYRQRYA >gi568815585r:94611222_94812049|GENSCAN_predicted_CDS_9|1350_bp atgcacggtcacactggctgcgacgatgtggttaatatgatgatgatggtgtcatgcagt gcaagctggaggaaggaacatagggagaaatgcctctcaaaagtaatcatgaattctatt gattttctgctcttctctttcctggaaaatttgttgcaaaagagcgcttctttgccattt atggggacacaaacccctttgaaatctgatccaaactatggactcactctcagaaacatg agcatacactcaaaaccttgcagaaattttcagaaggtccaggaagcctttgaaactcat gtaagacccccaattaagaacctccgaatgttctccaatgacactgaacaagaaaaaata agacagttttatgctgactctggaggaagttacagatatttgagaaattcttatcctcat gtttgggcagttgccacaacaaaatttgttctgcctcgtgcctatgattggcttgcccct gatagccatgaagctgaaaaacttacagtggttcacgtggaagttcagccaagtgctgag tgggtcttctcttctagccatgaagaaccgccgttagaaggagtgtgggcttgggtttct catactatggtggctgggaacattccaagcagaccagtcccaatgtacaagcagcttact gagcctctgcttgcactgtttgccaatgacccatcagccaaactgcgctgcgtgcctgct cctggctctgggggcgggcaccggggccagctggacaggggcaagaagcacaaggagcgc aagtcagacaaacacttctatgagggctctttagccaaacaagaagaagtagaacagaca ccccttcaagaagctttgaatcaactgatgagacaattgcagagaaaagacccaagtgct ttcttttcatttcctgtgactgattttattgctcctggctactccatgatcattaaacac ccaagggattttagcactatgaaagaaaaggtcaagaacaatgactaccagtcgatagaa ctaaaggataacttcaaactaatgtgcactaatgccatgatttacaacaaaccagagacc atttattatacagctgcaaagaagctgttgcactgggggatgactattcttagccaggaa agaattcagagcctgaagcagagcatagacttcatggctgacttgcagaaaactcgaaag cagaaagatagaacagactggggaggatggaggcagagtggggaggacgaaggctgctgg cccagggagagaggactctggagatgccgaagcacaagccttccagagtcccggcaaaga aaattaaaatatagacaaagatatgcttga