GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:44:57 Sequence gi568815578f:54375947_54750476 : 374530 bp : 39.36% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8575 8700 126 0 0 77 66 48 0.189 1.46 1.02 Intr + 8843 8981 139 1 1 42 48 106 0.276 1.12 1.03 Term + 22489 22601 113 2 2 69 49 196 0.713 11.54 1.04 PlyA + 22889 22894 6 1.05 2.00 Prom + 34009 34048 40 -4.65 2.01 Init + 39149 39237 89 1 2 60 69 44 0.022 -0.14 2.02 Term + 53609 53807 199 2 1 29 33 236 0.097 8.09 2.03 PlyA + 54047 54052 6 1.05 3.00 Prom + 71135 71174 40 -3.65 3.01 Init + 73532 73607 76 1 1 67 37 77 0.594 1.80 3.02 Intr + 78650 78877 228 0 0 56 53 148 0.177 5.02 3.03 Term + 81210 81355 146 1 2 57 41 91 0.486 -1.61 3.04 PlyA + 83014 83019 6 1.05 4.08 PlyA - 83199 83194 6 1.05 4.07 Term - 86723 86348 376 1 1 130 39 144 0.844 6.93 4.06 Intr - 92283 92219 65 0 2 76 98 30 0.016 -0.60 4.05 Intr - 94404 94160 245 1 2 90 100 31 0.007 0.79 4.04 Intr - 99967 99603 365 0 2 34 23 213 0.315 3.30 4.03 Intr - 100369 100040 330 0 0 22 32 267 0.358 8.32 4.02 Intr - 100772 100569 204 1 0 100 59 117 0.628 7.39 4.01 Init - 138305 138013 293 2 2 47 76 147 0.324 6.07 4.00 Prom - 152873 152834 40 -3.05 5.00 Prom + 153573 153612 40 -6.35 5.01 Init + 160624 160629 6 0 0 72 88 0 0.403 -0.67 5.02 Intr + 178987 179094 108 0 0 108 116 47 0.415 9.06 5.03 Intr + 179306 179489 184 1 1 47 92 31 0.327 -2.16 5.04 Intr + 181671 181762 92 1 2 127 80 37 0.807 5.59 5.05 Intr + 186984 187167 184 2 1 34 16 146 0.002 0.44 5.06 Intr + 195366 195442 77 1 2 129 35 46 0.000 1.72 5.07 Intr + 212537 212651 115 1 1 89 72 112 0.884 8.80 5.08 Intr + 212741 212860 120 0 0 77 68 114 0.731 7.85 5.09 Intr + 215670 215859 190 1 1 81 105 52 0.791 3.92 5.10 Intr + 234442 234658 217 1 1 81 49 120 0.044 4.98 5.11 Term + 250024 250134 111 0 0 63 47 88 0.004 -0.22 5.12 PlyA + 251234 251239 6 1.05 6.00 Prom + 253478 253517 40 -7.25 6.01 Init + 253820 253957 138 1 0 63 73 126 0.491 8.79 6.02 Intr + 258080 258162 83 1 2 75 80 50 0.409 0.42 6.03 Intr + 267454 267748 295 1 1 76 97 290 0.145 24.79 6.04 Intr + 278336 278438 103 1 1 7 103 66 0.012 -1.17 6.05 Term + 285189 285313 125 1 2 102 41 118 0.046 6.07 6.06 PlyA + 285778 285783 6 1.05 7.05 PlyA - 285855 285850 6 -0.45 7.04 Term - 286105 285886 220 0 1 16 48 221 0.307 6.43 7.03 Intr - 305368 305257 112 2 1 95 100 13 0.528 1.72 7.02 Intr - 306044 305882 163 2 1 107 49 82 0.173 4.83 7.01 Init - 307363 307211 153 1 0 68 55 142 0.187 8.93 7.00 Prom - 315898 315859 40 -5.45 8.03 PlyA - 317666 317661 6 1.05 8.02 Term - 327610 327350 261 1 0 23 38 263 0.889 9.34 8.01 Init - 341368 341336 33 1 0 84 127 4 0.314 3.87 8.00 Prom - 348820 348781 40 -3.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 91719 91649 71 0 2 36 100 85 0.810 5.07 S.002 Init - 195922 195844 79 1 1 57 80 82 0.863 5.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:54375947_54750476|GENSCAN_predicted_peptide_1|125_aa QMPGLTPISQSLSSTNSALTMSTSEDFSPMLKSAQGCSYPAHIFFLMPRAGKYLKQSNKR HLSFKAEFQKDFARASLQDLQEEEYAWGDKSSGSTKCGYPEKAVTLALALAGGGQLPHMM RKGTN >gi568815578f:54375947_54750476|GENSCAN_predicted_CDS_1|378_bp cagatgcctggtctgacacccatttctcagtctctcagtagtaccaacagtgccctaacc atgagcacatctgaagatttttcaccaatgttgaagtcagctcaaggatgttcataccca gcccatattttctttttaatgccgagagcgggaaagtacttaaaacagtcaaataaacgt cacttgtccttcaaagctgaatttcaaaaggactttgcacgagcatctctacaagatctc caggaagaagaatatgcctggggagacaagagctcaggatccaccaagtgtgggtaccca gaaaaggcggtcacactggcccttgcccttgctggtggagggcagctgccccacatgatg aggaaagggaccaactga >gi568815578f:54375947_54750476|GENSCAN_predicted_peptide_2|95_aa MLLYRDPNPSDAASRKIVLFLTLEHVLMSKNLKSYLGQEQSGPCILGMAGRKVQECSEPV ATAPSRSDFQLTSEGKEPSSEEPLPRDRYQKLQQI >gi568815578f:54375947_54750476|GENSCAN_predicted_CDS_2|288_bp atgcttctctatagggaccccaacccatcagatgcagcttccagaaaaatagttctcttt ctcacattggaacatgttttgatgtcaaaaaacctcaaaagctacctgggtcaagaacag tcagggccttgcattcttggcatggccggcaggaaggtgcaggagtgttcagagcccgtg gccactgctccttcccgctcagactttcagctcacctcagagggaaaagagccatcttcg gaagagccacttcctcgggatcgataccagaaattgcagcaaatttaa >gi568815578f:54375947_54750476|GENSCAN_predicted_peptide_3|149_aa MKIQRNKGNYFEEESGSFGEVCLDQTNLQRVKGKFSLSSYNSVKRKTSAKLNVKEFNGVM NDSQIGQPPESQQIQRDSSAATCWKKIYRQQQKRKWQTEIGKASSSSPSIVMTTTVSPDH CLLSGKIAPSRTADSETQEMVAQEFQQNV >gi568815578f:54375947_54750476|GENSCAN_predicted_CDS_3|450_bp atgaagatccaaagaaacaaaggaaactattttgaggaagagagtggaagttttggagaa gtgtgtttggatcaaacaaaccttcagagggtaaaggggaagttttctcttagctcctac aatagtgtgaaaagaaaaacttcagccaaattaaatgtaaaggagtttaatggagtaatg aatgattcgcaaattgggcagcccccagaatcacagcagattcagagagactccagtgca gccacatgttggaagaagatttacagacaacagcaaaaaaggaagtggcaaacagaaatt ggaaaggccagtagctcctctccctccatagtaatgacaaccacagtatctcccgaccac tgtctcctgagtggcaaaattgcccccagtagaaccgctgattcagagacacaggaaatg gtggcgcaggaattccaacaaaatgtttga >gi568815578f:54375947_54750476|GENSCAN_predicted_peptide_4|625_aa MKEEKNVKLASVKPEDVTSQGRLQKEPCIQSHMSLNWSSDGLGGTNDFVKKSESKTTSDF PAIPLNCILQCTMSTKSRVQLLNPFRISNLNLKIPDGRTYVGSTCAMPGIRFHVNNPSVD LQGVFQQQKSTTPASNHGKSRFKYLPKLAPTRGQPAPPLTRHLGQSSNFHGYREIFISSR WAPGATEAQHAATIRKPSHPRDKDRSITYLDAGLTVTAPETRVSAEKATAQFSGARPGTL RPPSRDAGSRERQSNRRTATRRGSILTPRRLLRILTARTRSGSGEGGGQGQDKQSVGESW RGEAASAALGEALEGAHHFLCPPNPDSGRPACSHFRGAQSAARGGEAGAPASADSAAGLP GGGAAAAEGSSPASSRPRRRRRRRRRRRKEEEERGGRAIEGKKLCTKIQRLLHFSRAGNG LPSFTKGGCVFIKYHKAPKLKLEWHFSNTLTKNNLSFGPAACCFQWECHRNICSLSSGKH MRRCTSGLLSDSLESGSIPLRSDQCLGLKPSPVYSYLLPAMINFMCQLDWASRCPDICLN IILSVSVIALVRFHTADKDIPKTGNKKRFNWTLQFHMAEEASESWQEVKDTSYMAAAREK WGRGKRAETPDKPIRSHETYSLSQE >gi568815578f:54375947_54750476|GENSCAN_predicted_CDS_4|1878_bp atgaaagaggagaaaaatgtcaagttggcatctgtcaaaccagaagatgtgacatcccag gggagactccaaaaggagccctgtattcagagccacatgtcccttaactggagcagtgat ggcttaggaggtaccaacgactttgtcaagaaatcagaatccaaaaccacttcggacttc ccagctattcccttgaattgcatcctgcagtgcacaatgagtacaaaaagtagggtgcag ctattgaatcccttcagaataagtaatctgaatctcaaaattccagatgggaggacatat gtagggagcacttgcgctatgccaggtatcaggttccacgtaaataacccttccgtggac ctccagggtgtattccagcaacagaaatccaccaccccggcttcgaaccacggaaaatcc agattcaagtatctacccaagttggccccgaccagaggccagcccgcaccgcctttgaca cgacacctgggacagagctccaacttccatggctaccgagaaatatttatttcgagtaga tgggctccaggggctactgaagcccaacacgcggccaccatcagaaagccaagtcatcca agagataaagatagatccattacatacctagacgctggccttacagtgacagccccggag acgcgggtttcggctgagaaagccaccgcgcaattctccggcgcgcggccggggactctc cgtccaccctccagggatgctggctcaagagagagacaatcgaaccggcgaacagcaaca cggagaggatcgatactcaccccgaggcgtctgctccggatcctcacagcgcgcacccga agtggctctggggagggtggagggcagggtcaagacaagcagtcggtgggagaaagttgg agaggtgaagctgcttcggcggcgctgggcgaggctttagaaggcgcacatcactttctc tgtccccccaaccctgacagtgggcggccggcttgcagccactttcggggagcgcagagc gcggcgcggggaggcgaggctggagcgccggcgtcagctgactccgcggccggcctgcca ggaggaggagcggcggcggctgagggatccagccctgcctcctcccggccgagaaggagg agaagaaggaggaggaggaggaggaaggaggaggaggaaagaggaggaagggctattgag ggcaagaaattatgtacaaaaatccagaggcttctacacttctctagagcaggtaatggg ctcccaagcttcaccaaaggaggctgtgttttcatcaagtaccacaaagcaccaaagctg aagctggaatggcacttctccaatacccttaccaaaaataacttgagctttggccctgct gcctgctgctttcagtgggagtgtcacagaaacatatgcagcctttcctctggaaagcac atgaggagatgcacatctggccttctgagtgattctcttgaaagcggctccattccttta agatcagatcagtgtcttggtctaaagccttccccagtctactcctacttgctccctgca atgattaattttatgtgtcaacttgactgggcctcaaggtgcccagatatttgcttgaac atcattttgagtgtgtctgtgattgcattagttcgttttcacactgctgataaagacata cccaaaactgggaacaaaaaaaggtttaattggactttacagttccacatggctgaagag gcctcagaatcatggcaggaggtgaaagacacttcttacatggcagcggcaagagaaaaa tggggaagaggcaaaagagcggaaacccctgataaacccatcagatctcatgagacttat tcactatcacaagaatag >gi568815578f:54375947_54750476|GENSCAN_predicted_peptide_5|467_aa MEIYQRCWLVFKKASSKGPKRLEKFSDERAAYFRCYHKLFLLFSSLCYGTYHVVVLPTLK PREDEDWISLCLRHCLMQKDTHSMLGECGERREGKKQKTGALGSSLSLEDAKHLLPFIPA LPFSWNALPQDLCNVFKLLWDYRIRVDPSPIAGTLIRRDTDPQGESHVKTGRHWSDTIKS QRCQGLRQPPDRPIPILLMVKGQHVWPSQLDKAQIQEVTELNNVKNVARLPKSTKKHAIG IYFNDDTSKTFACESDLEADEWCKVLQMECVGTRINDISLGEPDLLATGVEREQSERFNV YLMPSPNLDVHGECALQITYEYICLWDVQNPRVKLISWPLSALRRYGRDTTWFTFEAGRM CETGEGLFIFQTRDGEAIYQKVHSAALAIAEQHERLLQSVKNSMVRLEFLPRVPVPITAE SNWVIFVLENMEGERCCEEGLLSFLEEGAPLWRQTMIYPKLGFQLAV >gi568815578f:54375947_54750476|GENSCAN_predicted_CDS_5|1404_bp atggagatttatcagcgatgctggttagtattcaagaaagcttcaagcaaaggtccaaaa agactggagaaattttctgatgaacgtgctgcatatttcaggtgttatcataagctgttc ttgctgttttccagtctctgctatgggacttatcatgttgttgtactccctacactgaag ccccgagaggatgaggactggatctcattgtgtctcagacactgcctgatgcagaaagac actcattcaatgcttggggaatgtggggagagaagggaggggaagaagcagaaaacaggt gccctgggctcctctctttccttggaagatgccaagcacctcctgccctttatacctgct cttcccttttcctggaatgctctcccccaggatctttgtaatgtattcaagttgctgtgg gattatcggattagagtggaccccagtcccatagctggtacccttataagaagagacaca gacccacagggagaaagccatgtgaagacaggcagacactggagtgatacaattaaaagc caaaggtgtcaaggactgcggcagccaccagaccgtcccattcccatccttctcatggtt aaaggtcaacatgtgtggccaagccagctggacaaggctcaaatccaggaggttacagaa ctcaataatgtgaagaacgtagctcgattgccaaaaagcaccaagaaacatgccataggg atttatttcaatgacgatacctccaagacttttgcttgcgaatcagatcttgaggctgat gagtggtgcaaagtactccagatggagtgtgtaggaacacggatcaatgacatcagcctt ggagagcctgacttactggccactggggttgagagagaacagagtgagagattcaatgtg tatttgatgccatctcctaacttagatgtacatggcgaatgtgccttgcagattacatat gagtatatctgtctttgggacgtccagaatcccagagtcaaactcatctcttggccgcta agcgccctgcggcggtatggacgtgatactacgtggttcacttttgaggcagggaggatg tgtgagactggtgaagggctgtttatctttcagacccgagacggggaggccatctatcag aaagtccactctgctgccttggccatagccgagcagcacgagcgcttgctacagagtgtg aaaaactcgatggtacgtttggaatttcttcctcgtgtcccagtgcctatcactgcagaa agcaattgggtcatttttgtgctggaaaatatggaaggagaacgctgctgcgaggagggt cttctctcctttttagaggaaggagcaccgttgtggaggcagacaatgatatatccaaag cttggctttcaactagctgtctga >gi568815578f:54375947_54750476|GENSCAN_predicted_peptide_6|247_aa MTGNGHSGKVIPRQTAIPKGQVLQAQEPLSAKALRQSVLCVCMCKETHERSAPAASQCQR NQMRNWETRNAFGSQATFGPSVADAQLSSLWLQLQMKMSERAASLSTMVPLPRSAYWQHI TRQHSTGQLYRLQGKRGATCVQGVGQAWEYWGGGSAQLAAAACQLVSALPPKTPVKSQVV EPSQCLHSSGLPKTLPLSGDAEFWHTDHAGELPDITMDFVNCHGAGGSVAVRTTTGHSHR HLGFDES >gi568815578f:54375947_54750476|GENSCAN_predicted_CDS_6|744_bp atgaccgggaatggtcactctggtaaagtgatacccaggcagacagccattcctaagggc caggtgcttcaggcccaggagcctttaagtgccaaagccctgaggcagtctgtgctttgc gtgtgcatgtgcaaagagactcatgaaagaagtgcaccagcagcaagtcagtgtcagcga aaccaaatgaggaactgggaaaccagaaatgcatttggcagtcaggccactttcggcccc agtgttgctgacgcacaactttcttccctttggctgcagctccagatgaagatgagtgag cgggccgcctcgctgagcaccatggtgcccctgcctcgcagcgcctactggcagcacatc acacggcagcacagcacgggacagctctaccgcttgcaaggtaagcgtggggctacctgt gtccagggtgtgggccaggcctgggagtactggggagggggctcagctcaactcgcagct gctgcatgccagctggttagtgcccttcctccaaagacacctgtgaaaagtcaggtggta gagccaagccagtgcctccattcgagtgggctgcccaagacgcttcccttgtcaggagat gcagaattctggcacacagaccatgcaggggaacttcctgacattaccatggattttgta aactgtcatggtgctggtgggagtgtagcagtgaggacgaccacaggtcactctcatcgc caccttggttttgatgagtcttag >gi568815578f:54375947_54750476|GENSCAN_predicted_peptide_7|215_aa MGNIAVQNRQLREPAALANSPFSTACLGSSRCQIITKQASEILRPAGMWYPLLPNLFLHA EKDDGEKKTYGAQNSPISLSQEKSCRGNRGQAYPASFQKGPDQCLSMLPETILRIIQMFS LYTVCLATAADINITGLAWHKLKNPVASVLNRGQLLNDWTASFKRRASVLGGSHMKKHKR DRKSQRTIPSPAQLPSSQGAPSGQGGYGIRRPSVK >gi568815578f:54375947_54750476|GENSCAN_predicted_CDS_7|648_bp atgggtaacatcgcagtccagaacagacagctcagagagccagccgctttggcaaacagc cctttcagtacagcctgtttgggaagttctagatgccagattattaccaagcaggccagt gagatcctcagacctgcaggtatgtggtacccactcctgcccaacctcttccttcatgct gagaaggatgacggtgagaaaaaaacatatggtgcccagaacagtcctatttctctttct caagagaaaagctgcaggggaaacaggggacaggcttatccagcttccttccagaagggg ccagaccagtgcctgtcgatgcttccagagacaatccttagaataatacagatgttctcc ttgtatacagtttgcctagcaaccgcagcagacataaacattacgggcttggcgtggcac aaactgaagaaccctgtggcttctgtcctcaaccgaggtcaattattaaatgactggact gcgagttttaaaaggcgagcatcagtcctgggtgggagccacatgaagaagcacaaaaga gacagaaaaagtcaacgaacaatccccagtcccgctcagttgccttcttctcaaggggca ccttcaggacagggcggctacggaatcaggagacccagtgtaaaatga >gi568815578f:54375947_54750476|GENSCAN_predicted_peptide_8|97_aa MIARLHQPCGTDKKCIKGNERAKGRERKKETEKEGDRDKEKQRQKLEKQVEEAQIQYGTE KDVGAKLTATGRLKQSLLTRWLTIAEHQKAPSNCQVI >gi568815578f:54375947_54750476|GENSCAN_predicted_CDS_8|294_bp atgattgcgaggcttcaccagccatgtggaacggataaaaaatgcattaaaggaaatgag agggcaaaaggaagagagagaaagaaagagacagagaaagagggagacagagacaaagag aaacaaagacagaaacttgagaaacaggttgaagaagctcaaatacagtatggaacagag aaagacgttggagctaaattaacagctaccggaagactaaaacaatctcttctaaccaga tggctgaccattgccgaacatcaaaaggctccttccaactgtcaagttatttaa