GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:19:57 Sequence gi568815585f:75449821_75705809 : 255989 bp : 39.15% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 351 346 6 1.05 1.03 Term - 2952 2785 168 0 0 68 38 87 0.028 -1.40 1.02 Intr - 14990 14940 51 2 0 83 78 37 0.183 0.39 1.01 Init - 31773 31087 687 0 0 54 25 406 0.138 24.02 1.00 Prom - 51791 51752 40 -5.55 2.04 PlyA - 52049 52044 6 1.05 2.03 Term - 57777 57272 506 1 2 2 45 192 0.300 -0.08 2.02 Intr - 58738 58661 78 2 0 72 46 136 0.208 6.40 2.01 Init - 62752 62680 73 1 1 62 65 83 0.565 4.68 2.00 Prom - 65647 65608 40 -5.95 3.00 Prom + 66887 66926 40 -8.15 3.01 Init + 70632 70766 135 2 0 73 86 117 0.853 10.19 3.02 Intr + 75435 75538 104 2 2 66 100 23 0.004 -0.65 3.03 Intr + 85575 85685 111 0 0 3 53 157 0.023 2.28 3.04 Term + 87932 88091 160 2 1 102 43 210 0.676 14.33 3.05 PlyA + 88868 88873 6 1.05 4.03 PlyA - 90670 90665 6 1.05 4.02 Term - 91857 91707 151 2 1 98 54 84 0.201 2.40 4.01 Init - 100002 99503 500 0 2 80 5 384 0.195 21.88 4.00 Prom - 110815 110776 40 -5.35 5.00 Prom + 113080 113119 40 -3.25 5.01 Init + 113664 113666 3 2 0 81 64 0 0.522 -3.05 5.02 Intr + 116875 117031 157 0 1 61 27 128 0.904 2.66 5.03 Term + 117407 117516 110 0 2 79 39 136 0.906 5.49 5.04 PlyA + 118042 118047 6 1.05 6.00 Prom + 128935 128974 40 -4.35 6.01 Init + 135001 135240 240 0 0 44 27 148 0.308 2.22 6.02 Intr + 140142 140309 168 2 0 71 81 37 0.245 0.52 6.03 Intr + 145095 145207 113 2 2 103 54 24 0.326 -1.24 6.04 Intr + 150293 150471 179 2 2 40 34 171 0.136 5.54 6.05 Intr + 154949 155007 59 0 2 101 77 30 0.296 0.88 6.06 Term + 155909 155992 84 1 0 122 33 161 0.993 10.67 6.07 PlyA + 157286 157291 6 1.05 7.04 PlyA - 159117 159112 6 1.05 7.03 Term - 160901 160887 15 2 0 124 47 4 0.337 -2.84 7.02 Intr - 167151 166894 258 0 0 42 80 184 0.253 9.74 7.01 Init - 186263 186009 255 2 0 59 99 176 0.608 11.08 7.00 Prom - 190458 190419 40 -5.25 8.02 PlyA - 190838 190833 6 1.05 8.01 Sngl - 210867 210559 309 0 0 91 37 218 0.999 12.65 8.00 Prom - 213195 213156 40 -3.75 9.03 PlyA - 215055 215050 6 1.05 9.02 Term - 221372 221220 153 2 0 -15 41 181 0.212 0.04 9.01 Init - 231418 231245 174 1 0 81 20 155 0.396 7.39 9.00 Prom - 232043 232004 40 -3.45 10.00 Prom + 240871 240910 40 -5.85 10.01 Init + 247275 247355 81 2 0 90 29 74 0.161 2.72 10.02 Term + 254726 254953 228 1 0 81 43 222 0.963 12.65 10.03 PlyA + 255013 255018 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 75857 75699 159 2 0 101 39 108 0.913 4.16 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:75449821_75705809|GENSCAN_predicted_peptide_1|301_aa MAEIRRRSQKPEAGGCGAPAAREVILVLSAPFLRCVPAPGAGASGGTSPSATQPNPAVFI FEHKAQHISRFIHNSHDLTYFAYLIKAQPDDPESQMACHVFRATDPSQARQGLGGAGAGS ALGDRGAGREGRGPHFKDRLRFGCRRARSTWRAQAPAGPFPGPWEWERRRSGAGRAAGAA GRLQQGPGGKRTTVPGWRGSWRVQGARVEERPSREDGGVWGVTGDGLLAPVGYRKDLPRN PGPSQFIITSLLRITHISGIEAFYDLIPYSSTFTSFFVENHLRQFVKTSVEWLCNGMWVN V >gi568815585f:75449821_75705809|GENSCAN_predicted_CDS_1|906_bp atggccgagatccgcaggcgcagccagaagcccgaggcgggcggctgcggggcgccggcg gcccgagaggtgatcctggtgctcagcgcgcccttcctgcgttgcgtccccgcgccgggc gctggggcctcggggggcactagtccgtcggccacgcagcccaacccggcggtattcatc ttcgagcacaaggcgcagcatatctcgcgcttcatccacaacagccacgacctcacctac tttgcctacctgatcaaggcgcagcccgacgaccccgagtcgcagatggcctgccacgtt ttccgcgccacagaccccagccaggcaagacaggggctcggaggcgcgggggcgggctcg gccttgggggatcggggagcagggcgggaggggcggggaccccactttaaggaccgactg aggtttggctgcaggcgcgcgcgctccacgtggcgcgctcaagcgccggctggccccttc cctggaccttgggagtgggagagaaggcgttcgggggctgggcgagcggcgggagcggcc ggccggctacagcagggacccggagggaaaaggacaacggtcccgggatggagggggagc tggcgcgtccaaggagcacgcgtggaggagcgcccgtcccgggaggacggcggtgtctgg ggtgtgaccggagatggcttgcttgccccagtaggctacaggaaagatttgcccaggaat ccaggcccaagccagtttataattacttcactcctccgtatcactcatatatcagggatt gaagccttttatgatttaataccgtatagttcaactttcaccagtttctttgttgaaaat cacttacggcaatttgtaaaaacttctgtggaatggttatgcaatggaatgtgggttaat gtttag >gi568815585f:75449821_75705809|GENSCAN_predicted_peptide_2|218_aa MQARCSRLHQKNVSVQQAQVHLTAVFASVPMMAAAAVWSGRCEDAGCSVGAPTACHIVGD EKETRREELWPFRDPRPRGSPSQSCDTLFGSLQSLLSPSFWMPPHPPCPDAGAHSRNCVQ CIWPNCSLAWSQHLCQHLELPAPTQQLACLAVCSGWTLHLLTHTPLAAPRLAHPGQVWDP SQKCEPIIACWAEWVERAQRVQVILRQEALPAKEVSSW >gi568815585f:75449821_75705809|GENSCAN_predicted_CDS_2|657_bp atgcaagccagatgcagccgtcttcaccagaaaaatgtatctgtccagcaggcccaagta catctgacagctgtttttgccagtgtcccaatgatggcagcagcagctgtctggagtggc cgctgcgaagatgctggctgcagcgtgggagcccccaccgcttgccacattgtgggtgat gagaaggagacaaggagagaagagctgtggcccttccgggatcccagacctaggggctca ccaagccagagctgtgacaccctctttgggtctctgcagtccctgctgtctccaagcttc tggatgccaccacatcccccttgtccagatgcgggtgcccacagcagaaactgtgtgcag tgtatctggcccaactgcagccttgcttggagccagcacctgtgccagcatctggagctg cctgccccaacacagcagctggcatgcctggctgtgtgcagtggctggaccctgcatttg ctcacccacacaccccttgctgctccacgcctggctcaccctgggcaggtgtgggatcca agccagaagtgtgagccgatcatagcctgctgggctgagtgggtggaacgggcccagcgg gtgcaagtaatactcaggcaagaggcactgccagccaaagaagtttccagctggtga >gi568815585f:75449821_75705809|GENSCAN_predicted_peptide_3|169_aa MRENLELPRDLLKCCNQNGDSNMDNEVQVEEVLDEMRNLLETGAKCLSQGLTLGLFAYCL KPRNATLQTGKGKEEGVNPKPPGEAQSLRKLEDPVTRLLRYSRKENGGLDPGGSKRRLEL TSDLASSGGSLDASMGSVWDLRPGLERTPPDQRFRFHVLPLAAWRMGRT >gi568815585f:75449821_75705809|GENSCAN_predicted_CDS_3|510_bp atgagggaaaatttggaacttcctagagatttgttaaaatgttgtaaccaaaatggtgat agtaatatggacaatgaagtccaggttgaggaggtcttggatgagatgaggaacttactg gaaactggagcaaagtgtttgagccagggcctaactttgggtctttttgcatattgcctt aaacctagaaatgctaccctgcaaacagggaaggggaaagaggaaggtgttaatccgaaa ccgccaggtgaggcacaaagtttaaggaagttggaagaccctgttacaaggctactgcgg tattccaggaaagagaatggcggcttggacccaggtggcagcaagaggaggttggaactc acatcggacttagcatccagcggcggctcgctggacgcctccatgggcagcgtctgggac ttgcggcccggactcgagagaacgcccccagaccagcgcttccgcttccacgtcctgccc ctagcggcctggcgcatggggcggacgtag >gi568815585f:75449821_75705809|GENSCAN_predicted_peptide_4|216_aa MAAVPGPPALTAAAFAAAAASAHARLDLGPPNTPPGGGAPKHQKEFDGPLTLLTAPCRAV GQVSYFKLRCCRGEKNKPISYFGRKTVSESTALSPHAWTLPSRALPSARVLRMRLAPVWD RVVTRTFPFLQCQPTALGGVPRSPILRQTPGGYPLPGSKAGGRRAARAWLFTPFTSSDST NHTTLRLMNTAGNMGKNSNTSLQTRIGPSRSTRVLL >gi568815585f:75449821_75705809|GENSCAN_predicted_CDS_4|651_bp atggccgcggtgcccggccctccagctctgacagccgccgccttcgccgccgccgccgct tccgcccacgcgcgccttgacctgggccctcccaacacaccgcccggaggaggagcgcct aaacaccaaaaagagtttgacgggcctctcaccctcttaaccgctccgtgcagggccgta ggtcaagtgtcatattttaaattaagatgctgccgaggagaaaaaaataagccaatttcc tatttcgggagaaaaacagtatctgaatccacagccctctcccctcacgcttggactctt ccgagccgagccttgccgagcgctcgcgttctacgcatgcgccttgcgccggtctgggac cgcgtggtcacccgcaccttcccattcttgcagtgtcaaccgactgcgctaggaggcgtg ccgcggagtcctattctgagacagacgccaggtggctacccattgccaggcagcaaggcc ggaggacgccgtgcagcacgagcatggcttttcaccccttttacttccagtgactctacc aatcacacaactctacgccttatgaacacagcaggcaacatgggcaaaaatagtaacaca tcactccagacccgtattggtccaagcagatcaactagagtccttctctga >gi568815585f:75449821_75705809|GENSCAN_predicted_peptide_5|89_aa MYEVFRTEEEEKIKSQGQDVTSSVYFMKQTISNACGTIGLIHAIANNKDKMHFESGSTLK KFLEESVSMSPEERARYLENYDVGTFFPF >gi568815585f:75449821_75705809|GENSCAN_predicted_CDS_5|270_bp atgtatgaagtattcagaacagaagaggaagaaaaaataaaatctcagggacaagatgtt acatcatcagtatatttcatgaagcaaacaatcagcaatgcctgtggaacaattggactg attcatgctattgcaaacaataaagacaagatgcactttgaatctggatcaaccttgaaa aaattcctggaggaatctgtgtcaatgagccctgaagaacgagccagatacctggagaac tatgatgtcggtaccttctttccgttttga >gi568815585f:75449821_75705809|GENSCAN_predicted_peptide_6|280_aa MPLTDSPVDSTKLRKESVNVKINHWKLPKLTHKGEKNSGKEGEKPHKTNSASNSCGTISS DVICENGIPEGEERWKDMMGPHSSHCRTKASSLCHHASLPWVKRNHVGPAKATSPSLRLR RWRPFLRLPSLGLHSVAPSIDEKVDLHFIALVHVDGHLYELGKNYFNLSWGEKCSVKAER GEEATEERSEASSWFMRFKERSHLHNIKVHGEAAYADGEAASYPEDQAQIFREDGRKPFP INHGETSDETLLEDAIEVCKKFMERDPDELRFNAIALSAA >gi568815585f:75449821_75705809|GENSCAN_predicted_CDS_6|843_bp atgccattaacagattcaccagtagactcaacaaagttgagaaaagaatcggtgaatgtg aagataaatcactggaaattacccaaacttactcacaaaggggaaaaaaatagtggaaaa gagggggaaaaaccacacaaaacaaacagtgcatccaacagctgtgggacaatttcaagt gatgtaatatgtgaaaatggaatcccagaaggagaagagagatggaaagatatgatgggg cctcattctagccactgcagaacgaaggcctcatcattgtgtcaccatgcatccctgccc tgggtcaaacgtaaccatgtaggccctgcaaaggctaccagtccatctctccgtcttcgt cgctggcgacccttcttacgtctaccatctttgggtctccattctgttgcaccaagtata gatgagaaagtagatcttcattttattgcattagttcatgtagatgggcatctctatgaa ttaggtaagaactattttaatttgtcctggggagagaaatgttctgttaaggctgagaga ggtgaggaagctacagaagaaaggtctgaagctagcagttggttcatgaggtttaaggaa agaagtcatctccataacataaaagtgcatggtgaagcagcatatgctgatggagaagct gcaagttatccagaagatcaagctcagatcttccgtgaagatgggcggaagccatttcca attaaccatggtgaaactagtgatgaaactttattagaggatgccatagaagtttgcaag aagtttatggagcgcgaccctgatgaactaagatttaatgcgattgctctttctgcagca tag >gi568815585f:75449821_75705809|GENSCAN_predicted_peptide_7|175_aa MRPRARILPAPARLAPALSRAPESGAPPGSRPPGPLDIPEIAAHNCTFPGQETSPAGERG PPPSPEPRPAEPATRSPAAIRPSRQTPMNIHKHQENPGKHDFTKPNEAPGTKPGRTEICD LSDREFKIAVLRKLKEIQDNTQKEFRILSDKFNKEIEIIKKNQAEILEWGQVNSQ >gi568815585f:75449821_75705809|GENSCAN_predicted_CDS_7|528_bp atgcgcccccgcgcccgcatccttcccgcacccgcccgcctggcccctgccctctcccgg gcccctgagagcggcgcccctcccggctcgcggccgcccggcccgcttgacataccagaa atagctgcacacaattgtacctttcccggccaagaaacctcccccgctggggagcgcggc ccgccgccctcacctgagccccggccagctgagcccgctacccgctctcccgccgcgatc cggcccagccgccagacaccaatgaacatccacaagcatcaagagaatccaggaaaacat gacttcactaaaccaaatgaggcaccagggaccaagcctggaaggactgagatatgtgac ctttcagatagagaattcaaaatagccgttttaaggaaactcaaagaaattcaagataac acacagaaggaattcagaatcctatcagataaatttaacaaggaaattgaaataattaaa aagaatcaagcagaaattctggagtggggccaggtgaactctcagtaa >gi568815585f:75449821_75705809|GENSCAN_predicted_peptide_8|102_aa MKPQASGKQKNLKESRNNKDPVKREKQDAVSSPGIVTLYTEEEALHPISQPIKKLENASV KTKKQEGGFKEKGINDGMEMPFPANISKKGKLIQNWFFEMIT >gi568815585f:75449821_75705809|GENSCAN_predicted_CDS_8|309_bp atgaagccccaagcctctggtaagcagaagaatctgaaagaaagcagaaataacaaagac cccgtgaagagagaaaagcaagatgcagtgagctctcctggtatagtaacactgtacact gaagaagaagcactgcaccccatcagccagccaatcaagaagttggaaaatgcaagtgtg aaaaccaagaaacaagaaggaggatttaaagagaagggcataaatgatggaatggagatg ccattccctgcaaacatatcaaaaaaaggcaaattaatccaaaattggttctttgaaatg atcacttag >gi568815585f:75449821_75705809|GENSCAN_predicted_peptide_9|108_aa MGKDFMAKTSKAIATKVRIGKWDLIKLKRFCTAKEAIIRVNRQPTEWEKSSAIIHLTKCQ ASPSGGIPEEGIGVIGDDSFMCAIAPEGLPVGQDVEVEDSDIDDPNPG >gi568815585f:75449821_75705809|GENSCAN_predicted_CDS_9|327_bp atgggcaaagatttcatggcaaaaacatcgaaagcaattgcaacaaaagtaagaattggc aaatgggatctaattaaactaaagcgcttctgcacagcaaaagaagctatcatcagagtg aacagacaacctacagaatgggagaaaagttctgcaattatccatctgacaaagtgtcag gccagtccctcaggaggtattccagaagaaggcattggtgtcataggagacgacagcttc atgtgtgctattgcccctgagggccttccagtgggacaagacgtggaggtagaagacagt gatatcgatgatcctaaccctgggtag >gi568815585f:75449821_75705809|GENSCAN_predicted_peptide_10|102_aa MNANTHSDWHEEKVEAERASHTLEGGRGFAHKYGGECFWDSRHLLNGCLKEIEKVKREKC LLTRISGHFADNQSGMKWSSRSSQNVWDLSHVDYAIRTGTER >gi568815585f:75449821_75705809|GENSCAN_predicted_CDS_10|309_bp atgaatgccaatacacattcagattggcatgaggagaaggtggaagcagagagagctagc cacactctggaaggaggtcgggggtttgcacacaaatatggtggagagtgcttttgggat tctaggcaccttttaaatggctgcttaaaagaaattgaaaaagttaaacgagagaagtgc ctgctgactcgcattagtgggcattttgctgataaccagtctggtatgaagtggtcttca agaagctcccagaatgtttgggatttaagtcatgtggattatgcgatcagaactggaact gagcgatag