GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:40:36 Sequence gi568815593r:10581059_10861068 : 280010 bp : 45.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 51 46 6 1.05 1.02 Term - 4467 4355 113 1 2 92 49 50 0.625 0.22 1.01 Init - 8288 8180 109 2 1 65 75 110 0.826 7.78 1.00 Prom - 22076 22037 40 -2.06 2.00 Prom + 23648 23687 40 -5.56 2.01 Init + 29684 29755 72 1 0 52 90 42 0.287 -0.13 2.02 Intr + 37275 37404 130 2 1 147 110 191 0.999 27.47 2.03 Intr + 54368 54426 59 0 2 42 68 55 0.038 -2.50 2.04 Intr + 56900 57110 211 1 1 142 91 145 0.668 18.69 2.05 Intr + 57425 57471 47 0 2 87 86 21 0.579 -0.07 2.06 Intr + 61555 61669 115 0 1 43 97 57 0.733 2.12 2.07 Term + 68208 69055 848 1 2 106 55 1207 0.874 112.24 2.08 PlyA + 69914 69919 6 1.05 3.03 PlyA - 70185 70180 6 1.05 3.02 Term - 83653 83430 224 2 2 6 41 398 0.821 23.98 3.01 Init - 86492 86441 52 2 1 56 88 31 0.322 1.22 3.00 Prom - 87935 87896 40 -2.66 4.11 PlyA - 88108 88103 6 1.05 4.10 Term - 89301 88810 492 0 0 32 37 822 0.153 66.11 4.09 Intr - 95634 95410 225 0 0 48 100 91 0.468 4.48 4.08 Intr - 98556 98421 136 2 1 48 71 46 0.434 -0.53 4.07 Intr - 98809 98720 90 0 0 74 81 72 0.494 4.21 4.06 Intr - 100111 100002 110 1 2 111 63 84 0.581 7.28 4.05 Intr - 100495 100370 126 1 0 91 56 28 0.160 0.78 4.04 Intr - 102513 102471 43 2 1 119 114 83 0.832 12.24 4.03 Intr - 118353 118146 208 1 1 40 97 92 0.231 3.44 4.02 Intr - 132118 131972 147 2 0 91 8 80 0.009 0.51 4.01 Init - 134876 134747 130 2 1 72 101 34 0.208 3.44 4.00 Prom - 145120 145081 40 -2.76 5.00 Prom + 160576 160615 40 -0.96 5.01 Init + 180009 180076 68 2 2 91 46 201 0.639 14.74 5.02 Intr + 202387 202762 376 1 1 -18 41 254 0.008 5.12 5.03 Intr + 212272 212399 128 0 2 -1 69 132 0.396 1.88 5.04 Term + 219885 219993 109 0 1 41 46 122 0.381 1.28 5.05 PlyA + 220611 220616 6 1.05 6.00 Prom + 222960 222999 40 -6.16 6.01 Init + 224189 224423 235 1 1 64 22 157 0.708 5.00 6.02 Intr + 230840 230899 60 0 0 118 66 21 0.256 1.61 6.03 Intr + 255075 255117 43 1 1 118 107 7 0.478 2.90 6.04 Intr + 256864 257032 169 1 1 88 37 108 0.525 5.65 6.05 Term + 257693 257746 54 1 0 93 41 85 0.656 1.86 6.06 PlyA + 258211 258216 6 1.05 7.03 PlyA - 260265 260260 6 1.05 7.02 Term - 273610 273543 68 2 2 110 55 74 0.378 4.40 7.01 Init - 278081 277940 142 2 1 77 89 30 0.310 2.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:10581059_10861068|GENSCAN_predicted_peptide_1|73_aa MNQVYYSQVWKEEDSTPGKTKEKRGGAIQDLLLRPAEQAKPHAPQTPSNHMLKLPLSNGN DCGDSKSDLRRLS >gi568815593r:10581059_10861068|GENSCAN_predicted_CDS_1|222_bp atgaatcaagtttactactcacaggtctggaaggaagaggacagcacccctggcaagacc aaagagaagaggggaggagccatccaggacctgctccttcgaccagcagaacaagcaaag ccacatgctccacagaccccatctaatcacatgcttaaactccctttgtcaaatggaaat gactgtggagactcgaagtcagacctaagacgtctaagttaa >gi568815593r:10581059_10861068|GENSCAN_predicted_peptide_2|493_aa MLPIKYIAWAQWLTSVIPALWEAETGLIVACYHGFVDTVVALAECPHVDVNWQDSEGNTA LITAAQAVKDEGEQCGVNESKKHQRLAEGLAGTSTFCGSGNQYTCTLLFTGHAIITNYLL NYFPGLDLERRNAFGFTALMKAAMQGRTDCIRALMLAERGDPLAVNGKTVLLKEGGMRIG HVHNRTALGCMLGLEKPSDMLVVIIVIIMIIGADVHARDPRRGMSPQEWATYTGRVDAVR LMQRLLERPCPEQFWEKYRPELPPPPEAARKPAGSKNCLQRLTDCVLSVLTPRSVRGPED GGVLDHMVRMTTSLYSPAVAIVCQTVCPESPPSVGKRRLAVQEILAARAARGPQAQEEDE VGGAGQRGRTGQEDADSREGSPRAGLPPALGSRGPAAPAPRKASLLPLQRLRRRSVRPGV VVPRVRVSKAPAPTFQPERPARKGSTKDSGHLQIPKWRYKEAKEEKRKAEEAEKKRQAEA QKERRTAPWKKRT >gi568815593r:10581059_10861068|GENSCAN_predicted_CDS_2|1482_bp atgctccctattaaatatatcgcttgggcacagtggctcacgtctgtaatcccggcactt tgggaggccgagaccggccttattgtcgcctgctaccacggctttgtggataccgtggtg gccttagcagagtgcccccacgttgacgtcaactggcaggacagcgaggggaacacagcc ctaatcacagctgcacaggcagtaaaggacgagggagagcaatgtggtgtaaatgagtca aaaaagcaccagcgactggctgaaggcctggctggaaccagcacattttgtggaagtggg aaccaatacacgtgtacacttctctttacagggcacgctatcatcactaactacttgttg aactatttccctggtcttgaccttgaaaggaggaacgcgttcgggttcaccgccctgatg aaagccgccatgcagggtcgaacggactgcatccgagccctgatgctagcagaaaggggt gatcccttggctgtaaatggcaagactgtgctgttgaaggaaggtgggatgcggattggg cacgttcacaacagaacagccctcgggtgcatgctgggcttggagaagccttcggacatg ttggttgtgattattgttattattatgattattggggcggatgtccacgcgagggacccc cgccgtgggatgtcgccgcaggagtgggccacttacacgggccgcgtggatgccgtccgt ctcatgcagaggctgctggagcgcccctgcccggagcagttctgggagaagtaccggccc gagctgccgccgccccctgaagcggcgcggaagcccgcgggctccaagaactgcctgcag aggctcacagactgcgtgctgtccgtgctgacgccgcgctccgtgcggggcccggaggac gggggcgtcctggaccacatggtccggatgaccacgagcctctacagccccgccgtggcc atcgtgtgccagaccgtgtgccctgagagccctccgagcgtggggaagaggcggctggcg gtgcaggagatcctggcggcgcgggctgcacggggcccccaggcgcaggaggaggatgag gtggggggcgcggggcagcgcgggcggaccggacaggaggacgcggactcccgggagggc tccccgagagccggcctccctcccgccctggggtcccggggccccgcagcgcccgccccg cggaaggccagcctcctgcccctgcagcgcctgcggcggagaagcgtgcggcccggtgtg gtggtgccccgggtccgagtcagcaaggcgcccgcgcccaccttccagcccgagcggccg gcgcggaagggcagcaccaaggacagcggccacctgcagatccccaagtggcggtacaag gaggccaaggaggagaagaggaaggcagaggaggccgaaaagaagcgccaggccgaggcg cagaaggagaggcgcactgcgccctggaagaagaggacgtga >gi568815593r:10581059_10861068|GENSCAN_predicted_peptide_3|91_aa MDVSGPHGTCYQCSPEPEPPKSTALAAVCQVIGMYDYTAQNDDELAFNKGQIINILNKED PDWWKGEVNGQVGLFPSNYVKLTTDMDPSQQ >gi568815593r:10581059_10861068|GENSCAN_predicted_CDS_3|276_bp atggatgtcagtgggcctcatgggacttgttaccagtgcagcccagagccagagccacct aagtcaacagcattagcggcagtgtgccaggtgattgggatgtacgactacaccgcgcag aatgatgatgagctggccttcaacaagggccagatcatcaacatcctcaacaaggaggac cccgactggtggaaaggagaagtcaatggacaagtggggctcttcccatccaattatgtg aagttgaccacagacatggacccaagccagcaatga >gi568815593r:10581059_10861068|GENSCAN_predicted_peptide_4|568_aa MVYYREVSCQVQFGIIKLIVTCDKLTCKESNKYLSASSLLEFHGIDYGAFGLGRLPVEFI LRGHVKYQSDVNRDKDVRQIGWLVGWLVGRLVASYTVCEHQMPVSGEEEEDDSDKKRRQA PMSLQQLDRQEKVGDAPEIVQMLPYFQACLILINTVDRQTDSPPKPTVFISGVIARGHWW VLAWGFLTHKHHASCRDVDGRWPGRSSHTTAMLPAGTLGDKDFPPAAAQVAHQKPHASMD KHPSPRTQHIQQPRNRSAAGQQLCREQLPTTSLSGAVLQFTSASCWYKLRVAFRDPDLVG SAAMEHPSKHLPACLSEQHPGGDAALQQHQGLNNRKHRMWFDAKGKGVKSMLRFQDWMAR MLAKVLRKASVNGGGINMGSKRPFWADSSVPTWPPSRATVQGFDKHHHHHHCHHCITITI ITIIITLITIITIITIITTITTITTIITITTIITITTITVTTITIITLIIIAIITIIITI ITITIITITIIITIIITLTITTFIIIITIITTIITIIAIITIITTTIICHHYHHHHHDHN HHHHHHYHHHHDHHYHPPPHIYWTLTMC >gi568815593r:10581059_10861068|GENSCAN_predicted_CDS_4|1707_bp atggtgtattatcgagaggtttcctgtcaggtccaatttgggatcatcaaattaattgtt acgtgtgacaagctgacatgcaaagaaagcaataaatacctcagtgccagctccttgttg gaattccatggaatcgattatggagcctttggcttgggaaggttgcctgtggagttcatc ctgagagggcatgttaaatatcagtcagatgtcaacagagataaagatgtaaggcagatt ggttggctggttggttggctggttggtcggctggttgcgtcctatacagtgtgtgaacac caaatgcctgttagtggagaggaggaagaagatgacagtgataagaagcggagacaagct cctatgtccctccagcaactggacaggcaggaaaaagtaggagatgctccagagattgtg caaatgcttccttattttcaggcatgcttgatacttataaacaccgtagacagacagaca gatagtccacctaaacccactgtgttcatctctggggtcatcgcccggggacactggtgg gtgctggcctgggggttcctcacccacaaacaccatgcttcctgcagggacgttgatggg cgctggcctggacgttcctcacatacaaccgccatgcttcctgcagggacgctgggtgac aaagatttccccccggcggctgcgcaggtggctcaccagaagccgcatgcctccatggac aagcatccttccccaagaacccagcacatccagcagccacgcaaccgatctgctgcaggt cagcaactgtgtcgtgagcagctgccaaccaccagcctttctggtgctgttctccagttc acgtctgccagctgttggtataaactgagggtggcttttagagacccagacttggttggc agcgctgccatggaacaccccagcaagcacctcccagcctgcctttcggagcagcaccca ggaggggatgccgcgctccagcaacaccagggactaaataacagaaaacacagaatgtgg tttgatgcgaagggcaagggggtcaagagcatgctgcggtttcaagactggatggccagg atgttggcgaaggtgttaagaaaagccagcgtgaatggaggaggaattaacatgggaagc aagagaccattctgggcagactccagtgtgcccacgtggcctccaagcagagctactgtg cagggctttgacaagcaccatcatcaccatcattgccatcattgcattaccattaccata attaccattatcatcaccctcatcaccatcatcaccatcatcaccatcatcaccaccatc accaccatcaccaccatcatcaccatcaccaccatcatcaccatcaccaccatcaccgta actaccatcaccatcatcaccctcatcatcattgccatcatcaccatcatcattaccatc atcaccatcaccataattaccatcaccatcatcattaccattatcatcactctcaccatc accaccttcatcatcattatcaccatcattaccaccatcattaccatcattgctattatc accatcattaccaccaccatcatttgtcatcactatcaccaccaccaccatgatcataac catcaccaccaccaccactatcaccaccatcatgatcatcattatcatcctcctcctcac atttattggacacttactatgtgctag >gi568815593r:10581059_10861068|GENSCAN_predicted_peptide_5|226_aa MTRRGFRGAEAAARFSGRAGVRESWLFEKISKIDRPLARLIKKKREKNQIDAIKNDKGDI TINHTEIQTTIREYYKHLYANKLENLEEMDEFLDTYTLPRLNQEDVESLNRPITGSEIQA IINSLPTKKSPGPDGFTAEFYQRYKEELARIEVAAGPYGFASNSDIFHQGNTIIICKSLM PGSDLPRILFRPPYVELTSSHCQIVPSGPLVVYRKTTPNPSLVADG >gi568815593r:10581059_10861068|GENSCAN_predicted_CDS_5|681_bp atgacgcggcggggcttccgcggggccgaggcggcggcgcggttctcgggccgggcgggc gtgcgcgagagctggctttttgaaaagatcagcaaaattgatagaccactagcaagacta ataaagaagaaaagagagaagaatcaaatagatgcaataaaaaatgataaaggggatatc accatcaatcatacagaaatacagactaccatcagagaatactataaacacctctacgca aataaactagaaaatctagaagaaatggatgaattcctggatacatacaccctcccaaga ctaaaccaggaagatgttgaatccctgaatagaccaataacaggctctgaaattcaggca ataattaatagcctaccaaccaaaaaaagtccaggaccagatggattcacagctgaattc taccagagatacaaagaggagctggcaagaattgaggttgctgcaggcccatatggattc gccagtaactcagacatcttccaccagggtaacaccatcatcatttgtaaaagtctcatg ccgggatcagacctgccccgtattctattcaggccaccctatgtggaattaacttcctca cattgccagattgtaccttctggacctctggtcgtctaccggaaaaccacacccaaccca agtctagtggcagatgggtga >gi568815593r:10581059_10861068|GENSCAN_predicted_peptide_6|186_aa MQGEAGRADGKAAASYLEDLAKTIGEGGYTKQQIFNVDEIAFYRRKMPSWTFTAREKSVL GFKASKDMLTFLLGANAAGSMFSNSETTQKPTDLCNPTGLELCLVVPSTCGKREVMSVFL PYECETCDCSNHEKMVEVTVRDQSLKGTTSLYMDLALSQMFTLGTTCSSTSSIVIRLQDN SFADAT >gi568815593r:10581059_10861068|GENSCAN_predicted_CDS_6|561_bp atgcaaggtgaagcaggacgtgctgatggaaaagctgcagcaagctatctagaagatcta gctaagacaattggtgaaggtggttacactaaacaacagattttcaatgtagatgaaata gccttctatcggaggaagatgccatcttggactttcacagctagagagaagtcagtgctt ggtttcaaagcttcaaaggacatgctgacttttttgttaggggccaatgcagctggttct atgttttcaaactcagaaaccacgcagaaacccacagatctttgcaaccccacaggtttg gaactatgcttggtcgttcctagtacttgtggaaagagagaagtcatgtctgtgtttctt ccctatgaatgtgagacttgtgactgctccaatcatgagaaaatggtggaagtgacagta cgtgaccaaagcttaaagggtacaacctccctttatatggacctggccctctctcagatg tttacccttggaaccacttgcagctctacctcaagcatcgtcatcagactccaggacaac agctttgcagatgccacttaa >gi568815593r:10581059_10861068|GENSCAN_predicted_peptide_7|69_aa MHDWGPIETHKDEKFLSREAMEELDSCCLHYGRGCAWHADMCFLCGSMLPPLKQNRKSEA CGSREELTQ >gi568815593r:10581059_10861068|GENSCAN_predicted_CDS_7|210_bp atgcatgactggggacctattgagacgcacaaagatgaaaagttcctctccagagaggct atggaggaattagactcctgttgcttacactacggcaggggatgcgcgtggcatgctgac atgtgttttctgtgcggctcaatgcttcctccactaaagcaaaaccgcaagtctgaggcc tgtggcagcagggaagagctgacccaatga