GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:45:28 Sequence gi568815574f:2591361_2838256 : 246896 bp : 44.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1997 2051 55 1 1 81 36 55 0.097 1.08 1.02 Intr + 14099 14280 182 0 2 111 66 43 0.058 3.99 1.03 Intr + 17951 18143 193 1 1 4 43 137 0.149 -0.13 1.04 Intr + 20754 20858 105 1 0 72 99 58 0.362 5.49 1.05 Intr + 27380 27424 45 0 0 74 73 60 0.363 1.48 1.06 Intr + 28069 28137 69 2 0 53 111 43 0.595 2.25 1.07 Intr + 31330 31374 45 2 0 89 74 61 0.627 3.18 1.08 Intr + 31989 32093 105 1 0 69 111 44 0.612 4.99 1.09 Term + 35237 35274 38 0 2 87 47 19 0.108 -4.80 1.10 PlyA + 36549 36554 6 1.05 2.02 PlyA - 37249 37244 6 1.05 2.01 Sngl - 41292 40801 492 0 0 64 46 177 0.743 7.16 2.00 Prom - 43356 43317 40 -4.16 3.06 PlyA - 44467 44462 6 1.05 3.05 Term - 49720 49545 176 2 2 37 41 124 0.374 0.52 3.04 Intr - 57500 57344 157 2 1 48 107 56 0.040 3.08 3.03 Intr - 64659 64580 80 1 2 99 44 20 0.002 -2.03 3.02 Intr - 85698 85612 87 1 0 106 69 48 0.461 4.64 3.01 Init - 94988 94898 91 2 1 88 80 27 0.254 0.55 3.00 Prom - 95928 95889 40 -4.16 4.00 Prom + 96563 96602 40 -2.86 4.01 Init + 100001 100067 67 1 1 110 88 248 0.995 26.23 4.02 Intr + 123062 123094 33 0 0 114 105 15 0.831 3.99 4.03 Intr + 128301 128345 45 1 0 108 101 57 0.986 7.38 4.04 Intr + 128996 129064 69 0 0 57 111 61 0.947 4.45 4.05 Intr + 131267 131314 48 0 0 116 87 37 0.915 5.05 4.06 Intr + 134900 135013 114 0 0 104 115 95 0.963 14.22 4.07 Intr + 146840 146896 57 0 0 107 99 86 0.851 10.46 4.08 Intr + 149323 149366 44 2 2 80 94 -4 0.825 -2.64 4.09 Intr + 149775 149857 83 2 2 81 121 3 0.839 1.34 4.10 Term + 150264 150375 112 0 1 61 47 135 0.444 4.73 4.11 PlyA + 150520 150525 6 1.05 5.00 Prom + 151730 151769 40 -1.56 5.01 Init + 160915 160975 61 0 1 91 94 44 0.897 4.71 5.02 Intr + 162546 162662 117 1 0 80 110 19 0.826 3.74 5.03 Term + 163267 163379 113 2 2 38 47 80 0.381 -2.38 5.04 PlyA + 163864 163869 6 1.05 6.00 Prom + 169452 169491 40 -3.76 6.01 Init + 171066 171129 64 2 1 52 105 100 0.931 9.41 6.02 Intr + 176192 176312 121 0 1 73 58 50 0.863 0.05 6.03 Intr + 176998 177131 134 1 2 61 80 83 0.898 5.09 6.04 Intr + 178381 178413 33 2 0 91 91 15 0.550 0.29 6.05 Intr + 179190 179231 42 1 0 99 119 69 0.995 9.31 6.06 Intr + 183356 183379 24 0 0 99 109 25 0.719 3.70 6.07 Term + 188206 188228 23 2 2 97 45 22 0.377 -2.73 6.08 PlyA + 190569 190574 6 1.05 7.02 PlyA - 191477 191472 6 1.05 7.01 Sngl - 196243 195629 615 1 0 46 48 392 0.736 27.20 7.00 Prom - 196571 196532 40 -7.56 8.00 Prom + 196737 196776 40 -8.36 8.01 Init + 198491 198579 89 1 2 84 54 60 0.776 0.81 8.02 Term + 198652 198847 196 1 1 23 49 285 0.946 14.98 8.03 PlyA + 199239 199244 6 1.05 9.03 PlyA - 199260 199255 6 1.05 9.02 Term - 215786 215085 702 2 0 15 48 236 0.143 5.83 9.01 Init - 216956 216552 405 2 0 78 80 248 0.150 19.59 9.00 Prom - 217030 216991 40 -8.56 10.03 PlyA - 217064 217059 6 1.05 10.02 Term - 219317 219045 273 2 0 69 37 217 0.865 10.17 10.01 Init - 222303 222298 6 0 0 78 90 0 0.526 0.07 10.00 Prom - 232153 232114 40 -2.36 11.00 Prom + 234554 234593 40 -6.06 11.01 Init + 238945 239090 146 0 2 78 98 127 0.598 12.29 11.02 Term + 239887 240037 151 1 1 29 38 109 0.604 -2.62 11.03 PlyA + 240201 240206 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 216956 216411 546 2 0 78 41 263 0.834 16.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815574f:2591361_2838256|GENSCAN_predicted_peptide_1|278_aa MALKNDDACVCADLTKFSGALVMNPMMGEEIYFSLTATLPLRQATGSTSSCSVDVLQLEF ELISSSLPSEWSGIEPIFQSSAGSAPPGPWGRFRPAAFARALHSRTISARLGRAVARGAV LALRLLGLLRRLVAAPVSERRDPGTDDCNSHVLQTGDPAPPNAPKPKPDPNPNRPGFTGA DFDLADAFRDGGNNDPAPLNSPKLKPNANPEQPGFIGDDFDLADALHDKGNCTHNLNSHL LQTDEPAPLNPPKPKPNPNPKQPDSTGDDFDFTDVSSW >gi568815574f:2591361_2838256|GENSCAN_predicted_CDS_1|837_bp atggcgctgaagaatgatgatgcctgtgtgtgtgcggacctgaccaaattctcaggggcc ttagtcatgaacccaatgatgggtgaggaaatatatttctctctcactgctaccctgcca ttacgacaggccacaggcagcacttccagctgctccgtggatgtgctgcaacttgaattt gaacttatttcaagttctttgccttcagaatggtcaggaatagagcccatcttccagtcc tccgcgggctccgccccacccggcccgtgggggaggttccgtcccgccgccttcgcccgc gcgctgcactcgcggacgatctctgctcgccttgggcgcgccgtggcgcgcggggccgtg ctggcgctgcggctcctcggcctgctccgccgtctggtggccgccccggtgagcgagcgg cgggatccgggtactgatgattgcaactctcatgttttacaaacaggtgacccagcacca cctaatgcccccaaaccaaagccagatccaaaccccaaccgacctggtttcactggggct gactttgacttagcagatgcctttcgtgatggaggaaataacgacccagcacctcttaat tcacccaagctgaagccaaatgcgaaccctgagcagcctggattcattggggatgacttt gacttagcagatgccttacatgacaaaggaaactgcactcataatctcaactctcatctt ttacaaacagacgagccagcaccactgaacccacccaaaccaaagccaaatccaaacccc aagcagcctgattccaccggggatgactttgatttcacagatgtttcttcatggtga >gi568815574f:2591361_2838256|GENSCAN_predicted_peptide_2|163_aa MSLRTRKCYNYPYELDGSKCRYEPGNTTIFHMNQKVLQCHYEPGSVTMSLKLRSATIIPM NQEVLQCPCKPGSATIIHMNQEVSQCPYEPGSATIICMNLEMLQCPYEPGSATIIRMNQK VLQCPCKLGSSTIIGMKEEVLQCPCEPGSAIIHMNQEVLQLSG >gi568815574f:2591361_2838256|GENSCAN_predicted_CDS_2|492_bp atgtccctacgaaccaggaagtgctacaattatccctatgaactagacggttccaaatgt cgttatgaaccaggaaatactacaattttccatatgaaccaaaaagtgttgcagtgtcac tatgaaccaggaagtgtcacaatgtccctcaaactaagaagtgctacaattatccctatg aaccaggaagtattgcaatgtccctgcaaaccaggaagtgctacaattatccatatgaac caggaagtgtcacaatgtccctatgaaccaggaagtgctacaattatctgtatgaacctg gaaatgttacaatgtccctatgaaccaggaagtgctacaattatccgtatgaaccagaaa gtgttacaatgtccctgcaaattaggaagttctacaattatcggaatgaaagaggaagtg ttacaatgtccctgtgaaccaggaagtgctattatccatatgaaccaggaagtgctacaa ttatctggatga >gi568815574f:2591361_2838256|GENSCAN_predicted_peptide_3|196_aa MEFHHVGQAGLELLTSTDLPASASRSAGITGSDQDTCTFCLQVLVTGKSMNRILIDVCAD LPSLMIQSPAHSPLWVMRAGDYAANRREFLTRHPQPLTCGLSLLMFSQWWEMRPANGTLY RTTVLNCWEDKARSQIKSGHFQPHNGTVVRTSSTRDTRNTKECGTIQQSRIQRHEWCLLR AFRAANEQEMTAGISQ >gi568815574f:2591361_2838256|GENSCAN_predicted_CDS_3|591_bp atggagtttcaccacgttggccaggctggtctcgaactcctgacctcaactgatctgccg gcctcggcctcccgaagtgctggtattacagggtcagatcaggacacctgcacattctgt ttgcaggttctcgtcaccggcaaatcaatgaaccgaattttaattgatgtgtgtgcagat ttaccctcactgatgatccagtcacctgctcactctcctctttgggtcatgagagctggt gactatgcagccaacaggagggaattcttgacaagacatccccagcccctgacttgtgga ttgagcctgctgatgttcagtcagtggtgggaaatgagacctgctaatgggaccctgtac cgcaccacggtgctgaactgttgggaagataaagcaagaagtcagataaagtctggacat ttccagcctcacaatggcactgtggttagaacttcatcaacaagagacacgcgtaacacc aaggaatgtggcacaattcagcagtcccgaatacagcggcatgaatggtgccttctaaga gcattcagagcagctaacgagcaagagatgacggctggcatttctcaataa >gi568815574f:2591361_2838256|GENSCAN_predicted_peptide_4|223_aa MARGAALALLLFGLLGVLVAAPDGGFDLSDALPGDDFDLGDAVVDGENDDPRPPNPPKPM PNPNPNHPSSSGSFSDADLADGVSGGEADAPGVIPGIVGAVVVAVAGAISSFIAYQKKKL CFKENAEQGEVDMESHRNANAEPAEIKPLAPESCRNCVHNLGCLQITCPSSTSETPGSPD FFLIKIRYSSVSRYSSKSRYSLHPDTVLPKMYKMYRIFPTLKT >gi568815574f:2591361_2838256|GENSCAN_predicted_CDS_4|672_bp atggcccgcggggctgcgctggcgctgctgctcttcggcctgctgggtgttctggtcgcc gccccggatggtggtttcgatttatccgatgcccttcctggggatgactttgacttagga gatgctgttgttgatggagaaaatgacgacccacgaccaccgaacccacccaaaccgatg ccaaatccaaaccccaaccaccctagttcctccggtagcttttcagatgctgaccttgcg gatggcgtttcaggtggagaagccgacgccccaggcgtgatccccgggattgtgggggct gtcgtggtcgccgtggctggagccatctctagcttcattgcttaccagaaaaagaagcta tgcttcaaagaaaatgcagaacaaggggaggtggacatggagagccaccggaatgccaac gcagagccagctgaaataaaaccactggctcctgaaagttgtaggaactgtgtccacaat cttggctgtttacaaatcacgtgtccatcgagcacgtctgaaacccctggtagccccgac ttctttttaattaaaataagatactcctctgtatccagatactcctctaaatccaggtac tccctacatccagatactgtacttcctaagatgtacaagatgtaccgcattttcccaaca ctgaagacttga >gi568815574f:2591361_2838256|GENSCAN_predicted_peptide_5|96_aa MESWWGLPCLAFLCFLMHARVQCSINYVRYATPHYKVDCVLDDLSIRWLMKSVLSTLKAG RLQAEEQGEPVQVPKLKNLESDVPASTMGERCRPED >gi568815574f:2591361_2838256|GENSCAN_predicted_CDS_5|291_bp atggagagctggtggggacttccctgtcttgcgttcctgtgttttctaatgcacgcccga gtacaatgttcaataaattatgtgaggtatgcaacacctcattataaagttgactgtgtg ctagatgatttgtccatccgttggctgatgaaaagtgttctgagcactctgaaggcaggc cgtctgcaggctgaggagcaaggagagccagtccaggttccaaaactgaagaacttggag tctgatgttccagcatccaccatgggagaaagatgtaggccagaagactag >gi568815574f:2591361_2838256|GENSCAN_predicted_peptide_6|146_aa MGSEEQIMHQGGLLSSGVCSPVENKPRESNGVMCNPTLETETPELALSTWDISQPRGLTP DPCLDWNCWQPWRGDVAQIQEADTVPVPQPSLSSLDHDDQERWRRPDTFIRLTAFGSGQR DFDLADALDDPEPTKKPNSDTKIHSC >gi568815574f:2591361_2838256|GENSCAN_predicted_CDS_6|441_bp atgggttccgaggagcagatcatgcaccagggcggcctgctcagctctggtgtctgcagc cccgtggaaaataaaccccgtgaaagcaatggcgtcatgtgcaacccgaccttggagaca gagaccccagagttagcgttgtctacctgggacatctcccagcccaggggtcttactcca gacccgtgtctcgactggaactgctggcagccttggaggggagatgttgcacagatccaa gaggctgacacggttcctgtcccccagcccagtctgagcagcttggaccatgatgatcaa gagcgctggaggaggccagacacattcattagattaactgcttttggatcaggtcaaaga gactttgatttggcagatgcccttgatgaccctgaacccaccaagaagccaaactcagac accaaaatccacagctgctga >gi568815574f:2591361_2838256|GENSCAN_predicted_peptide_7|204_aa MQSYASAMLSVFNSDDYSPAVQENIPALRRSSSFLCTESCNSKYQCETGENSKGNVQDRV KRPMNAFIVWSRDQRRKMALENPRMRNSEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMH REKYPNYKYRPRRKAKMLPKNCSLLPADPASVLCSEVQLDNRLYRDDCTKATHSRMEHQL GHLPPINAASSPQQRDRYSHWTKL >gi568815574f:2591361_2838256|GENSCAN_predicted_CDS_7|615_bp atgcaatcatatgcttctgctatgttaagcgtattcaacagcgatgattacagtccagct gtgcaagagaatattcccgctctccggagaagctcttccttcctttgcactgaaagctgt aactctaagtatcagtgtgaaacgggagaaaacagtaaaggcaacgtccaggatagagtg aagcgacccatgaacgcattcatcgtgtggtctcgcgatcagaggcgcaagatggctcta gagaatcccagaatgcgaaactcagagatcagcaagcagctgggataccagtggaaaatg cttactgaagccgaaaaatggccattcttccaggaggcacagaaattacaggccatgcac agagagaaatacccgaattataagtatcgacctcgtcggaaggcgaagatgctgccgaag aattgcagtttgcttcccgcagatcccgcttcggtactctgcagcgaagtgcaactggac aacaggttgtacagggatgactgtacgaaagccacacactcaagaatggagcaccagcta ggccacttaccgcccatcaacgcagccagctcaccgcagcaacgggaccgctacagccac tggacaaagctgtag >gi568815574f:2591361_2838256|GENSCAN_predicted_peptide_8|94_aa MERYSVQLRPATLQDAAPATLHLIPCGPPRLRGDEVAVPSGLVGYVMVTEQEEVSMGKPD PWRGSGSDNQEEEPLERDFDRFLGTTTSFSRFTL >gi568815574f:2591361_2838256|GENSCAN_predicted_CDS_8|285_bp atggagaggtacagcgtccagttgcgcccagccacactgcaggacgccgcccccgccaca ctacacctgataccctgcggcccaccacgtctacggggcgacgaggtggcagtgccgtcc ggcctcgtgggatacgtgatggtgactgaacaggaggaggtgtcgatggggaagccagac ccctggcggggttccgggagtgacaaccaagaagaggaacctctggagcgggactttgac cgcttcctcggaaccactaccagcttcagccgcttcaccctgtag >gi568815574f:2591361_2838256|GENSCAN_predicted_peptide_9|368_aa MDKFLDTYTLPRLNQEEVESLNRPITGSEIEALINSLPTKTSPELDGFTAEFYQWYKEEL IPLLLKLFQSAEKEGILPNSFYKASIILIPKPGRVTIKKENFRPISLVNIDAGILKKILA NQIQQHIQKLIHHDQVIYRFNAIPMKLPMAFFTELEKTTLNSTWNQKRACIAKTILSKKN KAGGIMLPDFKLYYKATITKTAWYWYQNREIDQWNRTEASEIIPHIYNHLIFEKPEKNKK WGKDSLFNKWCWETWLAICRKLKLPPFLTPYTKINSRWIKDLNVRPKTIKTVEENKGNTT QDIGMGKDFMSKTRKAMATKAKIDKWDLIKLKSFCTAKETTIGVNKLATEWEKIFGIYPS DKGLISRI >gi568815574f:2591361_2838256|GENSCAN_predicted_CDS_9|1107_bp atggacaaattcctggacacctacactctcccaagactaaaccaggaagaagttgaatcc ctgaatagaccaataacaggctctgaaattgaggcattaattaatagcctaccaaccaaa acaagtccagaactagatggattcacagctgaattctaccagtggtacaaagaggagctg ataccattacttctgaaactattccaatcagcagaaaaagagggaatcctccctaattca ttttacaaggccagcatcatcctgataccaaagcctggcagagtcacaataaaaaaagag aattttagaccaatatccctggtgaacattgatgcaggaatcctcaagaaaatactggca aaccaaatccagcagcacatccaaaagcttatccatcatgatcaggtaatttatagattc aatgccatccccatgaagctaccaatggctttcttcacagaattggaaaaaactacttta aactccacatggaaccaaaaaagagcctgcattgccaagacaatcctaagcaagaagaac aaagctggaggcatcatgctgcctgacttcaaactatactacaaggctacaataaccaaa acagcatggtactggtaccaaaacagagagatagaccaatggaacagaacagaggcctca gaaataataccacacatctacaaccatctgatctttgagaaacctgagaaaaacaagaaa tggggaaaggattccctatttaataaatggtgctgggaaacctggctagccatatgtaga aagctgaaactgcctcccttccttacaccttatacaaaaattaattcaagatggattaaa gacttaaatgttagacctaaaaccataaaaactgtagaagaaaacaaaggcaataccact caagacataggcatgggcaaagacttcatgagtaaaacacgaaaagcaatggcaacaaaa gccaaaatagacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaact accatcggagtgaacaagctagctacagaatgggagaaaatttttggaatctacccatct gacaaagggctaatatccagaatctga >gi568815574f:2591361_2838256|GENSCAN_predicted_peptide_10|92_aa MEDCSSLPVMEQSCTENDFDELTEVGFRRLVITNFSELKEDIRTHRKEAKNLEKRLDKWL TRTNSVENSLNDLKELKTMARKICDTCTSFSS >gi568815574f:2591361_2838256|GENSCAN_predicted_CDS_10|279_bp atggaggattgcagctccttaccagtgatggaacaaagctgcacagagaatgactttgat gagttgacagaagtaggcttcagaaggttagtaataacaaacttctctgagctaaaggag gatattcgaacccatcgcaaggaagctaaaaaccttgaaaaaagattagacaaatggcta actagaacaaacagtgtagagaatagcttaaatgacctgaaggagctaaaaaccatggca cgaaaaatatgtgacacatgcacaagcttcagtagctga >gi568815574f:2591361_2838256|GENSCAN_predicted_peptide_11|98_aa MGRNQRKKAENSEKQNASSPPKEHNSSSSREEKWVENEFDELTEVGFRRTALQELLKEAL NMERNNQYQPLQKHANCKDHQHYEETASTNGQNNQLAS >gi568815574f:2591361_2838256|GENSCAN_predicted_CDS_11|297_bp atggggagaaaccagcgcaaaaaggctgaaaattccgaaaagcagaatgcctcttctcct ccaaaggaacacaactcctcatcctcaagggaagaaaaatgggtggagaatgagtttgat gaattgacagaagtaggtttcagaagaactgccttacaagagctcctaaaggaagcacta aacatggaaaggaacaaccagtaccagcccctgcaaaaacatgccaattgtaaagaccat caacactatgaagaaactgcatcaactaatggacaaaataaccagctagcatcataa