GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:01:53 Sequence gi568815585r:36748679_36979689 : 231011 bp : 40.24% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1989 2042 54 2 0 72 72 55 0.767 3.63 1.02 Intr + 6623 6746 124 1 1 79 96 169 0.852 16.04 1.03 Intr + 29220 29341 122 1 2 59 79 69 0.099 2.39 1.04 Term + 32829 32933 105 2 0 57 41 120 0.247 1.63 1.05 PlyA + 34400 34405 6 1.05 2.00 Prom + 44836 44875 40 -4.65 2.01 Init + 45806 46055 250 1 1 95 10 223 0.978 12.87 2.02 Term + 46104 46372 269 1 2 73 48 189 0.965 8.07 2.03 PlyA + 48854 48859 6 1.05 3.00 Prom + 60195 60234 40 -4.75 3.01 Init + 70680 71279 600 2 0 93 54 535 0.080 43.93 3.02 Intr + 79339 79482 144 0 0 60 64 72 0.251 1.56 3.03 Intr + 87383 87497 115 1 1 70 29 85 0.052 -0.10 3.04 Intr + 91917 92096 180 1 0 49 66 110 0.008 3.82 3.05 Term + 95134 95291 158 2 2 32 48 129 0.020 0.41 3.06 PlyA + 96577 96582 6 1.05 4.08 PlyA - 96924 96919 6 1.05 4.07 Term - 100141 99998 144 1 0 114 42 123 0.996 7.23 4.06 Intr - 104997 104741 257 1 2 68 116 254 0.223 22.44 4.05 Intr - 117080 116859 222 0 0 98 90 125 0.460 10.88 4.04 Intr - 118705 118595 111 2 0 72 84 37 0.210 1.13 4.03 Intr - 124068 123976 93 1 0 77 52 90 0.264 3.32 4.02 Intr - 130967 130600 368 1 2 84 94 242 0.690 18.26 4.01 Init - 131853 131837 17 0 2 57 69 38 0.712 -1.46 4.00 Prom - 132495 132456 40 -8.45 5.00 Prom + 133277 133316 40 -5.15 5.01 Init + 139067 139252 186 1 0 74 97 95 0.523 8.10 5.02 Term + 145989 146081 93 2 0 155 51 46 0.573 4.45 5.03 PlyA + 147185 147190 6 1.05 6.00 Prom + 152099 152138 40 -6.85 6.01 Init + 159187 159237 51 0 0 33 93 86 0.770 4.81 6.02 Intr + 168052 168252 201 0 0 26 63 179 0.614 7.76 6.03 Intr + 168291 168586 296 2 2 29 96 124 0.412 2.18 6.04 Intr + 170943 171049 107 0 2 84 36 49 0.265 -1.76 6.05 Intr + 171172 171369 198 2 0 74 57 162 0.915 10.10 6.06 Intr + 171459 171589 131 1 2 45 0 278 0.364 14.19 6.07 Term + 171824 172057 234 1 0 91 49 82 0.460 -0.06 6.08 PlyA + 172322 172327 6 1.05 7.02 PlyA - 172927 172922 6 1.05 7.01 Sngl - 178107 177907 201 0 0 65 39 215 0.262 9.23 7.00 Prom - 185524 185485 40 -6.85 8.04 PlyA - 185809 185804 6 1.05 8.03 Term - 189968 189703 266 0 2 -34 53 345 0.128 13.49 8.02 Intr - 190448 190123 326 1 2 66 30 273 0.160 13.79 8.01 Init - 190806 190676 131 0 2 61 47 175 0.988 10.37 8.00 Prom - 193215 193176 40 -7.05 9.02 PlyA - 193533 193528 6 1.05 9.01 Sngl - 195804 195571 234 0 0 52 44 274 0.641 14.25 9.00 Prom - 198354 198315 40 -6.55 10.00 Prom + 201314 201353 40 -3.25 10.01 Init + 206665 206713 49 0 1 45 103 19 0.634 0.26 10.02 Intr + 208963 209183 221 2 2 63 22 208 0.851 8.80 10.03 Intr + 209340 209419 80 2 2 22 56 101 0.730 -2.27 10.04 Term + 209732 209888 157 2 1 86 46 194 0.818 11.42 10.05 PlyA + 210309 210314 6 1.05 11.03 PlyA - 211393 211388 6 1.05 11.02 Term - 217048 216893 156 1 0 89 39 66 0.068 -1.25 11.01 Intr - 223358 223299 60 2 0 89 94 43 0.180 3.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 70680 71336 657 2 0 93 42 542 0.887 44.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:36748679_36979689|GENSCAN_predicted_peptide_1|134_aa MNHTQNLPILQISDLHDKTQTKPHVQTLGEASAKVKLDMQARDADLPLIEREREKERREG DNNFYSEIGQGGQLEAAAVHNTHRGIKGVSEYSTFNQNNQVEAAKPPSLLHSECLQAQQD VEAAKVYGLCPPEW >gi568815585r:36748679_36979689|GENSCAN_predicted_CDS_1|405_bp atgaatcatacccagaatctccccatcctgcagatcagtgacttacatgacaagactcaa acgaagccccatgtccagaccctcggagaagcaagtgccaaggtgaaattggacatgcaa gcccgtgatgcagatctgcccctgattgaaagagaaagagaaaaagaaagaagggaggga gataataatttctattcagagataggccaaggtggccaattagaagcagctgcggtccac aacactcacagaggaataaaaggggtgagtgaatacagcaccttcaaccaaaataatcag gtggaggcagccaagccgccttcactcttgcactctgaatgtctgcaggctcaacaggat gtagaagctgccaaggtttatggcttgtgccctccagagtggtag >gi568815585r:36748679_36979689|GENSCAN_predicted_peptide_2|172_aa MGALEEGFIVPFSSDSDAHFDAAVGYLEDIIMDEDFQLLQRNFINNYYQEFEDTKENKLT YTPIFNEYISLAGKYIEEQYIEERKYIEEQLLEQILGFTMATFTTLQHHKDEVVGDILQM LLKFTDFLAFKEMFLDCRAEKEGQRLDLSRGLVETSLCKSSSMTASQNNLQQ >gi568815585r:36748679_36979689|GENSCAN_predicted_CDS_2|519_bp atgggtgctttagaagaaggcttcattgtgcccttctcctctgactctgatgcacatttt gatgctgcggttggatatttagaggacattatcatggatgaagatttccagttattacag agaaatttcatcaacaactactaccaggagtttgaggacaccaaagagaataaactcacc tacacacctatttttaatgaatatatttctttggcaggaaagtatatagaagaacagtat atagaagaaagaaagtatatagaagaacagttgctggagcaaattcttggctttaccatg gcaactttcacaacgttacagcaccacaaagatgaagtggttggtgacatactccaaatg ctgctcaaatttacagattttctggcttttaaagaaatgtttctggactgcagagcagaa aaagaaggccagagactggacttaagccgtggtttagtggagacttcattgtgcaaatca tcttctatgacagcttctcagaacaatctgcagcagtag >gi568815585r:36748679_36979689|GENSCAN_predicted_peptide_3|398_aa MEAQGVAEGAGPGAASGVPHPAALAPAAAPTLAPASVAAAASQFTLLVMQPCAGQDEAAA PGGSVGAGKPVRYLCEGAGDGEEEAGEDEADLLDTSDPPGGGESAASLEDLEDEETHSGG EGSSGGARRRGSGGGSMSKTCTYEGCSETTSQVAKQRKPWMCKKHRNKMYKDKYKKKKSD QALNCGGTASTGSAGNVKLESLLLKVMKQLLSVEVIKLIDINLDSSSSVVLIKGQFDCHS DLENHWLLSFTLVAEKDVACLKMYSISTDADFYELFVAWALEVPRVGLVDVVLCGKRNFA EVLKLRILRWRDQPELSGWALNVITRVLIRGTLAYRRTRDVMMEATGPDVDVTRWSQQLV TADSTLLIIGLLVHLSPQNGKLLKSRNFVIVSTVPADA >gi568815585r:36748679_36979689|GENSCAN_predicted_CDS_3|1197_bp atggaggcgcagggtgtagcggagggcgcggggccgggcgccgccagcggcgtgccccac cccgcggccctagccccggctgcggctcccaccttggcgccagcctcggtggcggccgcg gcctctcaattcaccctgctagtgatgcaaccctgtgctgggcaggacgaggctgcggcc cccgggggcagcgttggggcgggcaagcccgttaggtacctgtgcgaaggggccggggat ggcgaagaggaggctggggaggacgaggcggacctgttagacacttcggaccctccgggg ggaggcgagagcgcggctagtttggaggatctagaggacgaggagactcactcggggggc gagggcagcagcgggggcgcccggaggcggggcagcggtgggggcagcatgagcaagacc tgcacctacgaaggctgcagcgagaccacgagccaggtggccaagcagcgcaaaccgtgg atgtgcaagaaacaccgcaacaagatgtacaaggacaagtataaaaagaagaagagcgac caggccctgaactgcggtgggactgcctcgactggcagcgcgggaaacgtcaaactcgag tcattgttactgaaggtaatgaagcagttactttctgtggaagtcataaagttaatagat attaatcttgactcatctagctcagtggttctcatcaagggtcaatttgattgtcatagt gaccttgaaaaccactggcttttaagctttaccctcgtagcagaaaaggatgtagcctgc cttaagatgtattccataagcacagatgctgatttttatgagctatttgtagcttgggct ttagaggttccaagagtaggacttgttgatgttgtcttatgtggcaaaaggaactttgca gaggtgcttaagttgaggattctgagatggagagatcaacctgaattatctggatgggct ctaaatgtaatcacacgtgtccttataagaggaacccttgcttacagaagaacaagagat gtgatgatggaagcaacagggcctgatgtagatgtcactcggtggtcccagcagcttgtc acagctgacagcacactgcttattattggtttacttgttcacctttctccacaaaatggt aagctccttaaaagtagaaactttgtcattgtatctacagtgcctgctgatgcataa >gi568815585r:36748679_36979689|GENSCAN_predicted_peptide_4|403_aa MAVQECPAVKRLLGWKQGDEEEKWAEKAVDSLVKKLKKKKGAMDELERALSCPGQPSKCV TIPRSLDGRLQVSHRKGLPHVIYCRVWRWPDLQSHHELKPLECCEFPFGSKQKEVCINPY HYRRVETPATRSPSPRARPATLTPQEVLLSQRVPINTQVIDTPPLPYHATEASETQSGQP VDATADRHVVLSIPNGDFRPVCYEEPQHWCSVAYYELNNRVGETFQASSRSVLIDGFTDP SNNRNRFCLGLLSNVNRNSTIENTRRHIGKGVHLYYVGGEVYAECVSDSSIFVQSRNCNY QHGFHPATVCKIPSGCSLKVFNNQLFAQLLAQSVHHGFEVVYELTKMCTIRMSFVKGWGA EYHRQDVTSTPCWIEIHLHGPLQWLDKVLTQMGSPHNPISSVS >gi568815585r:36748679_36979689|GENSCAN_predicted_CDS_4|1212_bp atggctgtccaggagtgccccgcagtgaagagactgctaggctggaagcaaggagatgaa gaggaaaagtgggcagagaaggcagtggactctctagtgaagaagttaaagaagaagaag ggagccatggacgagctggagagggctctcagctgcccggggcagcccagcaaatgcgtc acgattccccgctccctggacgggcggctgcaggtgtcccaccgcaagggcctgccccat gtgatttactgtcgcgtgtggcgctggccggatctgcagtcccaccacgagctgaagccg ctggagtgctgtgagttcccatttggctccaagcagaaagaagtgtgcattaacccttac cactaccgccgggtggagactccagccacgcgttctcccagtccccgtgcacggccagct accctcactccccaggaagtccttctgagccagagagtccctatcaacactcaggtcatt gacacaccacccctgccttatcatgccacagaagcctctgagacccagagtggccaacct gtagatgccacagctgatagacatgtagtgctatcgataccaaatggagactttcgacca gtttgttacgaggagccccagcactggtgctcggtcgcctactatgaactgaacaaccga gttggggagacattccaggcttcctcccgaagtgtgctcatagatgggttcaccgaccct tcaaataacaggaacagattctgtcttggacttctttctaatgtaaacagaaactcaacg atagaaaataccaggagacatataggaaagggtgtgcacttgtactacgtcgggggagag gtgtatgccgagtgcgtgagtgacagcagcatctttgtgcagagccggaactgcaactat caacacggcttccacccagctaccgtctgcaagatccccagcggctgcagcctcaaggtc ttcaacaaccagctcttcgctcagctcctggcccagtcagttcaccacggctttgaagtc gtgtatgaactgaccaagatgtgtactatccggatgagttttgttaagggttggggtgct gagtatcatcgccaggatgtcaccagcaccccctgctggattgagattcatcttcatggg ccactgcagtggctggacaaagttctgactcagatgggctctccacataaccccatttct tcagtgtcttaa >gi568815585r:36748679_36979689|GENSCAN_predicted_peptide_5|92_aa MHVAKDRSGTTEGKRSKREQIQAGLSIWSQDDEGVPSENSKVAEKLVRQAAGGVTDLRSK EKVQSPPWSPPDPLPDALYLSYVTYPVWHHAD >gi568815585r:36748679_36979689|GENSCAN_predicted_CDS_5|279_bp atgcatgtggcaaaagatagaagtgggacaacagaagggaagaggagtaaacgggagcag atacaggcaggtttgtctatctggtcacaggatgatgagggtgttccctctgaaaactcc aaggttgcagagaagttagtgagacaggcagcaggaggggtgacagatttgaggagcaag gaaaaggtccagtcacctccatggagtcctcctgatcccctgcccgatgctttgtactta tcatatgtcacttatcctgtgtggcatcatgctgattga >gi568815585r:36748679_36979689|GENSCAN_predicted_peptide_6|405_aa MCHRYLEALEENRIPDKVRVEKAKSHRSSKEDIPGTRVITESPVKAAPEQGIQDEEVIFE DPVGGGEAGKASAWGKGAQERKARHKIGKDREDKPGKVVLGQPNTKGSRHQRRAEYKSGK CGRGVRHGFGVGSVRMKTTKQEEHPVFMGIMTPNANRYLEILTTLVDTSFLRDKIYRNIY FKSYSKRPLRRLSWSLPTPRAPAPTRPPPPPRVEHPLTDRLHQKTREQRPPKAKPHAEEP PRLPRGRTTSPSTQCAHTHAHKQRAAEEGTSYLSPPCHSSLPHPAPAGPGGGGGDRDSGC SSGGGGGGGGGPSRRQSDWSREASSPVLPRAALSVPLRDPRLGPCPACRGGPGVGWEPFV CVGRLASLSPTPPLRFLASTWSPKPEELSAQQEYRITGPSVWRPF >gi568815585r:36748679_36979689|GENSCAN_predicted_CDS_6|1218_bp atgtgccatcggtacctggaagccttggaagaaaacagaatccctgataaggtaagagtt gaaaaggcgaagagccacaggagctcgaaggaggacatacctgggaccagggtcatcaca gaaagtcccgtgaaggcagcgcctgaacagggcatacaagatgaagaagtcatatttgag gatcctgtgggaggtggggaagctgggaaggcctcagcgtggggtaaaggtgcacaggag aggaaagccaggcataaaatagggaaagaccgggaagataagcctggaaaagtggtcttg ggtcagcctaacacaaaaggaagccgacaccaaaggagagcagaatacaagtcaggcaag tgcggccgaggtgtccgtcatggttttggggtaggaagtgtcaggatgaagaccaccaag caagaagaacatccagttttcatgggtataatgactcctaatgctaacagatatcttgaa atcctaacaactttagttgacacaagtttcttaagagacaaaatttaccggaatatttat tttaaaagctacagcaagcgacccctcaggcgcctatcctggtccctgcccacgccccgg gccccagccccgacccgaccgccgcccccgccgcgggtcgaacatcccctgacagacagg ctccaccaaaaaacccgggaacaacgaccgccaaaagccaagcctcacgccgaggagcct cccaggctgccccgaggacgaacaacttcgccctccactcaatgcgctcacacgcacgca cacaagcagcgcgcggccgaggagggcacgtcttacctgtccccgccgtgccactcatcc ctcccccacccagcgcctgcagggcccggcggcggcggcggggaccgagacagcggctgc agcagcggcggcggcggcggcggcggcggcggccccagccggcgtcagtcagactggagc cgcgaagcctcatcgcccgtattaccccgcgctgccctctcggtccccctgcgcgacccc aggctcggcccctgcccggcctgccggggtggcccgggggtggggtgggagccctttgtc tgcgtgggtcgcctcgcgtctctctctcccaccccacctctgagatttcttgccagcacc tggagcccgaaaccagaagagttgtcagcccaacaagaatataggatcaccggcccatca gtctggagacccttctga >gi568815585r:36748679_36979689|GENSCAN_predicted_peptide_7|66_aa MCFAKKHNKKGPKKMQANSAKAMSARAKAIEALVKPKEVKPKIPKGVSCKLDRLAYIALP QAWEVC >gi568815585r:36748679_36979689|GENSCAN_predicted_CDS_7|201_bp atgtgctttgccaagaagcacaacaagaagggcccaaagaagatgcaggccaacagtgcc aaggccatgagtgcacgtgccaaggctatcgaggccctcgtaaagcccaaggaggttaag cccaagatcccaaagggagtcagctgcaagcttgatcgacttgcctatattgccctaccc caagcttgggaagtatgctag >gi568815585r:36748679_36979689|GENSCAN_predicted_peptide_8|240_aa MEPEGIIESNWNEIVDSFDDMSFSESLLCGTYAHGFGKPSDIQQCINMCAKVQKLQMEAP HIIMGTPGRVSDMLNWRYLSPKYIKMFVLDEADEMLSHRFKDQICDIFQMLNSNTQVVLL SATIPSDVLEVTKKFMRDPIRILVKKEELTLKDFTVSAMHGDMDQKERDVVMREYRSGSS RVLITTDLLPRGIDVQQLSLVINYDPPTNRENYIHRIGRGGQFGHKGVAINMVAEDKRTL >gi568815585r:36748679_36979689|GENSCAN_predicted_CDS_8|723_bp atggagcctgagggcatcatcgagagtaactggaatgagattgttgacagctttgatgac atgagcttctcggagtccctcctctgtggcacctatgcccatggttttgggaagccctct gacatccagcagtgcatcaacatgtgtgctaaggtacagaaactgcagatggaagctccc catatcatcatgggtacccctggccgtgtgtctgacatgcttaactggagatatctgtct cctaaatacatcaagatgtttgtactggacgaagctgacgaaatgttaagccatagattc aaggaccagatctgtgacatattccaaatgctcaatagcaacacccaggtagttttgctg tcagctacaattccttctgatgtacttgaggtgaccaagaagttcatgagggaccccatt cggattcttgtcaagaaggaagagttaaccctgaaggattttactgtctctgccatgcat ggagatatggaccaaaaggaacgagatgtggtcatgagggagtatcgttctggctctagc agagttttgattaccactgacctgctgcccagaggcattgatgtgcagcagctttcttta gtcatcaactacgaccctcccacgaacagggaaaactatatccacagaatcggtcgaggt ggacagtttggccataaaggtgtggctattaacatggtggcagaagacaagaggactctt tga >gi568815585r:36748679_36979689|GENSCAN_predicted_peptide_9|77_aa MDVGEIQQVIETTPVELAEDDLMKMSASEPVPDDAEEDIQEAGLQLLKSACDFFYNLDPS TIWALKLKQMVEEGLVL >gi568815585r:36748679_36979689|GENSCAN_predicted_CDS_9|234_bp atggatgttggagaaattcaacaggtaatagagacaacaccagtggagttagcagaagat gacttgatgaagatgagtgcttctgaaccagtgccagatgatgcagaagaagacatacaa gaagcaggactccaattattgaagtctgcttgtgacttcttttacaacttggacccttcc acaatatgggcactgaaactaaagcaaatggtagaagaaggattggtactgtag >gi568815585r:36748679_36979689|GENSCAN_predicted_peptide_10|168_aa MSVKKKHKIRGNRNNAALLGIRNSPQKAGLSPYEMLYRQPLLTNDLVLDQETAKLVADIT SLAKYQQVLKTLQEACPREEGKELFHPGDMETENPGDNASYSCEPLEDLCLLFKRQPIEA VKLQVVLQMEPQMQCMTKIYHGPLDWPASPCSDVDGIEGTPPEEISTA >gi568815585r:36748679_36979689|GENSCAN_predicted_CDS_10|507_bp atgagtgttaaaaagaaacacaaaatcagaggcaacagaaacaatgcagccttactagga atccgaaactctccccaaaaagcgggacttagcccatatgaaatgctgtatagacagcca ctcctaaccaatgaccttgtgcttgaccaagagacggccaagttagttgcagacatcacc tccttagccaaatatcaacaagttcttaaaacattacaggaagcctgtccccgagaagag ggaaaggaactattccaccctggagacatggaaaccgaaaatccaggagacaacgctagc tattcctgtgaacctctagaggatctgtgcctgctctttaagcgacaaccaattgaagct gtaaaactacaagtggttcttcaaatggagccccagatgcagtgcatgactaagatctac catggacccctggactggcctgctagcccatgctccgatgttgatggcatcgaaggcacc cctccagaggaaatctcaactgcatga >gi568815585r:36748679_36979689|GENSCAN_predicted_peptide_11|71_aa NQMAIACGSRAHLEKESIAQRSYFRTLLMYGFHFLVWFLCVKGIRDTQCGFKLFTREAAS RTFSSLHVERW >gi568815585r:36748679_36979689|GENSCAN_predicted_CDS_11|216_bp aatcaaatggctatagcatgtggatctcgagctcatttagaaaaagaatcaattgctcag cgttcttacttccgtactcttctcatgtatgggttccactttctggtgtggttcctttgt gtcaaaggaatcagggacacacagtgtgggttcaaattatttactcgagaagcagcttca cggacgttttcatctctacacgttgaacgatggtag