GENSCAN 1.0 Date run: 3-Nov-116 Time: 07:27:16 Sequence gi568815575f:134277618_134525327 : 247710 bp : 39.25% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7921 8041 121 0 1 68 86 56 0.169 3.80 1.02 Intr + 11057 11157 101 0 2 59 58 112 0.169 4.21 1.03 Term + 17407 17565 159 0 0 128 38 70 0.324 2.96 1.04 PlyA + 18380 18385 6 1.05 2.00 Prom + 21122 21161 40 -5.55 2.01 Init + 25539 25699 161 2 2 59 31 156 0.080 6.34 2.02 Intr + 32164 32354 191 1 2 2 86 128 0.090 2.31 2.03 Term + 38265 38383 119 1 2 45 32 90 0.154 -3.18 2.04 PlyA + 39422 39427 6 1.05 3.00 Prom + 48788 48827 40 -3.25 3.01 Sngl + 50840 52006 1167 1 0 46 43 310 0.660 18.14 3.02 PlyA + 52876 52881 6 1.05 4.00 Prom + 76094 76133 40 -3.35 4.01 Init + 87072 87171 100 2 1 55 34 106 0.623 2.37 4.02 Intr + 90323 90476 154 0 1 124 38 91 0.019 5.91 4.03 Intr + 95436 95692 257 0 2 32 77 160 0.009 5.46 4.04 Term + 95831 95949 119 0 2 103 43 60 0.142 0.72 4.05 PlyA + 96515 96520 6 1.05 5.00 Prom + 97792 97831 40 -6.85 5.01 Init + 100001 100138 138 1 0 46 115 101 0.982 8.79 5.02 Intr + 100388 100489 102 1 0 64 119 47 0.952 4.85 5.03 Intr + 115884 116017 134 2 2 82 -28 69 0.031 -6.78 5.04 Intr + 116292 116335 44 0 2 103 83 40 0.092 1.97 5.05 Intr + 126236 126529 294 0 0 39 86 192 0.680 10.16 5.06 Intr + 135874 136040 167 2 2 27 92 151 0.554 8.16 5.07 Intr + 136206 136349 144 2 0 89 103 87 0.999 9.86 5.08 Intr + 137399 137503 105 1 0 108 26 94 0.962 4.69 5.09 Intr + 139552 139685 134 0 2 88 71 68 0.900 3.62 5.10 Term + 147584 147713 130 2 1 59 43 223 0.970 11.57 5.11 PlyA + 149319 149324 6 1.05 6.00 Prom + 165105 165144 40 -4.05 6.01 Init + 171390 171392 3 2 0 113 81 0 0.336 1.85 6.02 Intr + 173344 173509 166 0 1 -16 80 124 0.384 -0.19 6.03 Intr + 174954 175095 142 1 1 125 -16 70 0.261 -1.21 6.04 Intr + 176168 176282 115 2 1 54 111 136 0.826 12.03 6.05 Intr + 182143 182222 80 0 2 53 72 21 0.008 -5.57 6.06 Intr + 182550 182721 172 0 1 39 77 136 0.005 6.62 6.07 Intr + 195742 195848 107 0 2 85 111 97 0.998 9.79 6.08 Intr + 197564 197747 184 2 1 71 56 144 0.978 8.37 6.09 Intr + 208848 208913 66 2 0 61 95 59 0.547 2.08 6.10 Intr + 212571 212588 18 2 0 130 100 6 0.538 1.89 6.11 Intr + 215891 215973 83 1 2 59 95 62 0.658 1.52 6.12 Intr + 220773 220819 47 0 2 62 93 72 0.801 2.23 6.13 Intr + 220991 221067 77 0 2 62 101 68 0.910 3.82 6.14 Term + 227606 227668 63 1 0 61 41 75 0.247 -2.99 6.15 PlyA + 231249 231254 6 1.05 7.03 PlyA - 232147 232142 6 1.05 7.02 Term - 234526 234420 107 2 2 96 42 71 0.844 0.89 7.01 Init - 238291 238180 112 1 1 44 84 94 0.477 5.02 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 90323 90480 158 0 2 124 36 119 0.962 7.41 S.002 Term - 182367 182264 104 1 2 56 41 165 0.874 6.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:134277618_134525327|GENSCAN_predicted_peptide_1|126_aa MTLNEHAAFKHLFNKAHLAPPLIHLTLSGHSTCFREHGVEVPTVEAACGTPDPTAAGSQH LCQCLELPAPPQQLKCVETSINSNLNSPHTWVTVSVPGILRNYWTVFVADSNSYLSAVFL LSKFLN >gi568815575f:134277618_134525327|GENSCAN_predicted_CDS_1|381_bp atgactcttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccatttaaccctgagtggacacagcacatgtttcagagagcacggggttgag gtgcccacagtggaagctgcttgtggtacacctgatccaactgcagcagggagccagcac ctgtgccagtgcctggagctgcctgccccaccacagcagctgaaatgtgttgaaacatct ataaactccaacctgaacagcccacacacttgggtaacagtgtctgttcctggcatactc agaaattactggacagtctttgtagcagattcaaattcctatctttctgcagtcttcctg ctgtctaaatttcttaactga >gi568815575f:134277618_134525327|GENSCAN_predicted_peptide_2|156_aa MTDTAEIQRIISGYYKQLYASKLENLEEMDKFLDTYNIPKLNEEEIQNLNNNNNLRHNCF KENKIPSNPTYKGCEGPLQGELQTTAQQNKRGHKQMEEHSMLMDRKNQYGENGHTAQEMW SSIRAQLECPRAARQATTDVPVNRSHWSRPTQKSEQ >gi568815575f:134277618_134525327|GENSCAN_predicted_CDS_2|471_bp atgactgatactgcagaaattcagaggatcattagtggctactataagcaactatatgcc agtaaattggaaaatctagaggaaatggacaaattcctagacacatacaacataccaaaa ctgaatgaggaagaaatccaaaacctgaacaataataacaacctccggcacaattgcttt aaagagaataaaatacctagtaatccaacttacaaaggatgtgaaggacctcttcaagga gaactacaaaccactgctcaacaaaataaaagaggacacaaacaaatggaagaacattcc atgctcatggataggaagaatcagtatggtgaaaatggccatactgcccaagagatgtgg tccagcatcagagcacagctagagtgccccagagcagcaaggcaagcaacaactgatgtg cctgtgaaccgtagtcactggtcccgcccaacccagaaatctgaacagtaa >gi568815575f:134277618_134525327|GENSCAN_predicted_peptide_3|388_aa MHSSSAKAGSIPLENRHKTRMLSLTTPIQCSIGCPGQGNQAREINKRHPNRKRGSQTIPL CRRHDPRINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENY KPLLSEIKENTKKRKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPRTFFTELEKTTL KFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRNIDQCNRTEPS EITPHIYNYLIFDKPEKNKQWGKDSVFNKWCWENWLVICRNLKLDPFLTPYTKINSRLIK DLNVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKET TIRVNRQPTKWEKSFATYSSDKGLISRI >gi568815575f:134277618_134525327|GENSCAN_predicted_CDS_3|1167_bp atgcattcatcctcggcaaaagctggaagtattccccttgaaaacaggcacaagacaagg atgctctctctcaccactcctattcaatgtagtattggatgtcctggccagggcaatcag gcaagagaaataaataaacggcatccaaataggaagagaggaagtcaaactatccctctt tgcagacgacatgatcctagaatcaatgtacaaaaatcacaagcattcttatataccaat aacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaagaga ataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactac aaaccactgctcagtgaaataaaagagaatacaaagaaacggaagaacattccatgctca tgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttatagattc aatgccatccccatcaagctaccaaggactttcttcacagaattggaaaaaactacttta aagttcatatggaaccaaaaaagagcccgcattgccaagtcaatcctaagccaaaagaac aaagctggaggcatcacgctacctgacttcaaactatactacaaggctacagtaaccaaa acagcatggtactggtaccaaaacagaaatatagatcaatgcaacagaacagagccctca gaaataacaccgcatatctacaactatctgatctttgacaaacctgagaaaaacaagcaa tggggaaaggattccgtatttaataaatggtgctgggaaaactggctagtcatatgtaga aacctgaaactggatcccttccttacaccttatacaaaaattaattcaagattaattaaa gacttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggcattaccatt caggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaa gccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaact accatcagagtgaacaggcaacctacaaaatgggagaaaagtttcgcaacctactcatct gacaaaggactaatatccagaatctag >gi568815575f:134277618_134525327|GENSCAN_predicted_peptide_4|209_aa MINDEVEMNVGSEVEGSVKNNSLVFGLQNWVDRGPHFSRYLRRAYYEPGTEFYGLRIEQW TRQNKAPAPIELHSSRETDNKHTNNDIFESLSLSPLILPPNTRRSGGAENRRSAREQEAG PAAFSRTGGGFCTCAVAVRLQLPPLCGQSGPAARSGRINALWVVCVRGGRLRLWALPVSP HGTLGPPILSPAGEDKEDGGGQAAAPARC >gi568815575f:134277618_134525327|GENSCAN_predicted_CDS_4|630_bp atgattaatgatgaagtggagatgaatgtggggagtgaagtagagggcagtgtcaagaat aactccctggtctttggcctgcagaactgggtggatagagggcctcacttcagtagatat ttacgaagagcctactatgaaccaggcacagagttctatgggctgaggatagagcagtgg acacgacaaaacaaagctcctgctccaatagagcttcattctagtcgagagacagataat aaacacacgaacaacgatatctttgaatctctttctctatctcctctgatcctgccccca aacactcggcggagcggcggagcggagaatagaaggtccgcgcgcgaacaggaggcgggg cctgccgccttcagtcgtacagggggtggtttctgcacatgcgctgtggctgtgcggctt cagctgccccctctctgcggccaatcagggcccgcggcgcgctcgggacgtatcaacgct ctgtgggtcgtgtgcgtgcgaggggggcgactccgcctctgggccctgccggtgagtccc cacggaaccctggggccccccattctctccccggcgggagaagacaaggaggatggcggt ggccaggccgccgcccctgcccgttgttaa >gi568815575f:134277618_134525327|GENSCAN_predicted_peptide_5|463_aa MSSSVEQKKGPTRQRKCGFCKSNRDKECGQLLISENQKVAAHHKCMLFSSALVSSHSDNE SLGGFSIEDVQKEIKRGTKLMCSLCHCPGATIGCDVKTCHRTYHYHCALHDKAQIREKPS QGIYMVYCRKHKKTAHNSEAWFTEYFKPTLDTYCSEKKIPFKILLLIDNASGHPRALMEI YKEMNVVFMHASTAFIPQFLEPVDQGVLWTFKSYYLRNTFHEAVAAIHSDFCDESGQADL EESFNEHELEPSSPKSKKKSRKGRPRKTNFKGLSEDTRSTSSHGTDEMESSSYRDRSPHR SSPSDTRPKCGFCHVGEEENEARGKLHIFNAKKAAAHYKCMLFSSGTVQLTTTSRAEFGD FDIKTVLQEIKRGKRMKCTLCSQPGATIGCEIKACVKTYHYHCGVQDKAKYIENMSRGIY KLYCKNHSGNDERDEEDEERESKSRGKVEIDQQQLTQQQLNGN >gi568815575f:134277618_134525327|GENSCAN_predicted_CDS_5|1392_bp atgtcaagctcagttgaacagaaaaaagggcctacaagacagcgcaaatgtggcttttgt aagtcaaatagagacaaggaatgtggacagttactaatatctgaaaaccagaaggtggca gcgcaccataagtgcatgctcttttcatctgctttggtatcatcacactctgataatgaa agtcttggtggattttctattgaagatgtccaaaaggaaattaaaagaggcacgaagctg atgtgttctttgtgccattgtcctggagcaacaattggttgtgatgtgaaaacatgtcac aggacataccactaccactgtgcattgcatgataaagctcaaatacgagagaaaccttca caaggaatttacatggtctattgccgaaaacacaagaaaactgcacataactccgaagca tggtttactgaatattttaagcccactcttgacacctactgctcagagaaaaagattcct ttcaaaatattactgcttattgacaatgcatctggtcacccaagagctctgatggagatc tacaaggagatgaatgttgtcttcatgcatgctagtacagcattcattccgcagttctta gagcccgtggatcaaggagtactttggactttcaagtcttattatttaagaaatacattt catgaggctgtagctgccatacatagtgatttctgtgatgaatctgggcaagctgattta gaagaaagttttaatgaacatgaactggagccctcatcacctaaaagtaaaaagaaaagt cgcaaaggaaggccaagaaaaactaattttaaagggctgtcagaagataccaggtccaca tcctcccatggaacagatgaaatggaaagtagttcctatagagataggtctccacacaga agcagccctagtgacaccaggcctaaatgtggattttgccatgtaggggaggaagaaaat gaagcacgaggaaaactgcatatatttaatgccaagaaggcagctgcccattataagtgc atgttgttttcttctggcacagtccagctcacaacaacatcaagagcagaatttggagac tttgatattaaaactgtacttcaggagattaaacgaggaaaaagaatgaaatgtacactt tgcagtcagcctggtgctactattggatgtgaaataaaagcctgtgttaagacttaccat taccactgtggagtacaagacaaagctaaatacattgaaaatatgtcacgaggaatttac aaactatactgtaaaaatcatagtggaaatgatgagagagatgaagaagatgaggaacga gagagtaaaagccgaggaaaagtagaaattgatcagcaacaactaactcagcagcaactt aatggaaactag >gi568815575f:134277618_134525327|GENSCAN_predicted_peptide_6|440_aa MKGQLYKAKKVSTGFGKKAVTQSMMSFANSFVQWWVQKTDHKEFEEKMENEELDRARRYG LRLFRTLLVGITKGADRGFIQDRGGVFRAKLRSEPMNLYQSPWLWEVRGVFTIVRLAEYV VYQPAVPMPKPARYRKLPGHQLDSRSQLLQPVKGLGKHWAKSQENGSHSFRRLRRALRRT SRLSRAAPPLAAPPPPPLLRHRLPPPEQSARAPAGSVMATRSPGVVISDDEPGYDLDLFC IPNHYAEDLERVFIPHGLIMDRTERLARDVMKEMGGHHIVALCVLKGGYKFFADLLDYIK ALNRNSDRSIPMTVDFIRLKSYCNDQSTGDIKVIGGDDLSTLTGKNVLIVEDIIDTGKTM QTLLSLVRQYNPKMVKVASLLVKRTPRSVGYKPDFVGFEIPDKFVVGYALDYNEYFRDLN MRKLNPREHKKLVQSDISDA >gi568815575f:134277618_134525327|GENSCAN_predicted_CDS_6|1323_bp atgaaaggacagttatataaggccaaaaaagtgtccactggatttggcaagaaagcagtc acgcagtcaatgatgtcttttgccaacagttttgtacaatggtgggtgcagaagacagat cacaaagagtttgaggagaaaatggaaaatgaggaattggatagagcaaggaggtatggc cttagactattcagaacgttactggtgggaataacaaagggagctgaccgaggatttata caggaccgtggaggggtgtttagagccaagttgagatctgagcccatgaatctgtaccag tctccatggttgtgggaagtgagaggagtattcaccattgtgcgccttgctgaatatgtg gtgtaccagccagctgtccccatgccaaagcctgcaagatacaggaagctgccaggccac cagttggactctaggtctcaactgttacaaccagttaagggtttggggaagcactgggcc aagagtcaggaaaatggaagccacagcttcaggcggctgcgacgagccctcaggcgaacc tctcggctttcccgcgcggcgccgcctcttgctgcgcctccgcctcctcctctgctccgc caccggcttcctcctcctgagcagtcagcccgcgcgccggccggctccgttatggcgacc cgcagccctggcgtcgtgattagtgatgatgaaccaggttatgaccttgatttattttgc atacctaatcattatgctgaggatttggaaagggtgtttattcctcatggactaattatg gacaggactgaacgtcttgctcgagatgtgatgaaggagatgggaggccatcacattgta gccctctgtgtgctcaaggggggctataaattctttgctgacctgctggattacatcaaa gcactgaatagaaatagtgatagatccattcctatgactgtagattttatcagactgaag agctattgtaatgaccagtcaacaggggacataaaagtaattggtggagatgatctctca actttaactggaaagaatgtcttgattgtggaagatataattgacactggcaaaacaatg cagactttgctttccttggtcaggcagtataatccaaagatggtcaaggtcgcaagcttg ctggtgaaaaggaccccacgaagtgttggatataagccagactttgttggatttgaaatt ccagacaagtttgttgtaggatatgcccttgactataatgaatacttcagggatttgaat atgagaaagctgaatcccagagagcataagaagcttgtccagagtgacatctctgatgca taa >gi568815575f:134277618_134525327|GENSCAN_predicted_peptide_7|72_aa MSLNLGLSDASPRTDSGYAFLEDYHRNYACSQSSIAGGHLIGGLLATVSKTAALATLCAL SSILKPDNISLT >gi568815575f:134277618_134525327|GENSCAN_predicted_CDS_7|219_bp atgtccctcaatttgggtttatctgatgcctccccacgaacagactcaggttatgcattt ttggaggactaccacagaaattatgcttgttctcagagcagcatagcaggaggtcacctc atcggaggcctcctggccactgtatctaaaacagcagcccttgccactctctgcgccctt tcctcaatcctcaaacctgacaatatatcgttaacctga