GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:19:56 Sequence gi568815595f:153062694_153263242 : 200549 bp : 40.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4695 4859 165 2 0 26 28 240 0.628 11.04 1.02 Intr + 5855 5930 76 1 1 90 70 48 0.823 1.37 1.03 Term + 7627 7760 134 2 2 121 48 55 0.880 2.07 1.04 PlyA + 8185 8190 6 1.05 2.05 PlyA - 8337 8332 6 1.05 2.04 Term - 66947 66357 591 2 0 11 48 457 0.007 27.54 2.03 Intr - 92400 92139 262 2 1 -36 100 252 0.286 10.57 2.02 Intr - 92768 92657 112 0 1 27 31 62 0.303 -7.08 2.01 Init - 93171 92967 205 0 1 100 44 163 0.507 10.28 2.00 Prom - 96648 96609 40 -6.15 3.00 Prom + 97309 97348 40 -8.05 3.01 Sngl + 100001 100552 552 1 0 81 54 971 0.999 86.86 3.02 PlyA + 103138 103143 6 1.05 4.00 Prom + 104336 104375 40 -2.75 4.01 Init + 133985 134118 134 1 2 56 71 98 0.524 4.56 4.02 Intr + 145174 145353 180 1 0 89 55 172 0.949 12.06 4.03 Intr + 146966 147029 64 2 1 120 86 54 0.985 6.20 4.04 Intr + 147626 147708 83 1 2 119 30 62 0.888 1.02 4.05 Intr + 157856 157926 71 2 2 94 88 56 0.914 4.11 4.06 Intr + 159294 159448 155 1 2 53 67 128 0.612 6.07 4.07 Term + 161729 161800 72 1 0 43 48 107 0.604 -0.87 4.08 PlyA + 162959 162964 6 1.05 5.05 PlyA - 163106 163101 6 1.05 5.04 Term - 166724 166242 483 2 0 69 33 237 0.752 10.06 5.03 Intr - 169482 169270 213 0 0 54 102 67 0.465 2.69 5.02 Intr - 175357 174650 708 1 0 -16 86 286 0.109 8.71 5.01 Init - 175825 175646 180 1 0 70 41 189 0.275 11.63 5.00 Prom - 176775 176736 40 -6.15 6.02 PlyA - 176944 176939 6 1.05 6.01 Sngl - 177828 177172 657 0 0 65 43 357 0.896 24.82 6.00 Prom - 185255 185216 40 -3.85 7.03 PlyA - 186408 186403 6 1.05 7.02 Term - 190968 190816 153 0 0 -12 47 205 0.440 3.34 7.01 Intr - 193527 193419 109 2 1 90 103 97 0.973 10.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 66980 66357 624 2 0 17 48 493 0.810 32.14 S.002 Sngl + 178906 179241 336 0 0 68 42 269 0.824 14.08 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:153062694_153263242|GENSCAN_predicted_peptide_1|124_aa PEPKPEKAPANKGEKVLKGKKGKADAGKEGNKLAENGDAKENRHRKLKVLQMPSEPGILN AEVYNFLPSFSKQATSEKQSGELTHLPMHHKIKMLTQSIKSSLESVAQPLIRQFAVQKVV LLNE >gi568815595f:153062694_153263242|GENSCAN_predicted_CDS_1|375_bp ccagagcccaagcctgaaaaggcccctgcaaataagggagagaaggtactcaaagggaaa aagggaaaagctgatgctggcaaggaggggaataaacttgcagagaatggagatgccaaa gagaataggcacagaaagctgaaggtgctgcagatgccaagtgaaccaggaattttgaat gcagaggtctacaattttcttccatcattttctaaacaggctacttctgagaaacagtca ggggaactaacacatcttccaatgcatcacaagataaagatgctgactcagtctattaag tcatccctggagtctgtggcacagcccttaataagacagtttgctgtacaaaaagttgtt ttattaaatgaatga >gi568815595f:153062694_153263242|GENSCAN_predicted_peptide_2|389_aa MPEPPTPSVGSCAAGASPRSAAPCSRAPSPIDHPRAEECRRTARDWQAAPPAAPLRDPLG EASWASESANLVGTWRTFISSSGIVNTPISALSKQTTQLYQSAGCGSVASLLSQRDHEAH QKEEIPNTSEHQKEQTLDTPPLGTVTLTARIRGFILEVSETKNPLIPDTQGQQSTACLDV RLKPITPSAAVAPVAGPSRPGATAFWSRDFSEEEQSVVYVQGISTEGNVRSRHMLMSPKA DVKLKTSRATDASISMESLKGAGDSVDEQSSRRGEIKSASLKDLCLEDKRCIANLIKELA RVNEEKEVTEERLKAEQESFEKKIRQLEEQNELIIKEREALQLQYRECQELLSLYQKYLS EEQEKLTMSLSELGAARMQEQQLYNNPYI >gi568815595f:153062694_153263242|GENSCAN_predicted_CDS_2|1170_bp atgcctgagcctcccaccccctccgtgggctcgtgtgcggccggagcctcccctaggagc gccgccccctgctccagggcgcccagtcccatcgaccacccaagggctgaggagtgccgg cgcacggcgcgggactggcaggcagctccacctgcagccccattgcgcgatccactgggt gaagccagctgggcttctgagtctgctaatctggtggggacgtggagaacctttatatct agctcagggattgtaaacacaccaatcagcgccctgtcaaaacagaccactcagctctac caatcagcaggatgtgggtccgtggcttcactcctgagccagcgagaccacgaagcccac cagaaggaagaaattccgaacacatccgaacatcagaaggaacaaactctggacacgcca cctttaggaactgtaacactcaccgcgaggatccgcggcttcattcttgaagtcagtgag accaagaacccactaattccggacacacagggacagcaatccactgcttgccttgatgta cgattgaagcctatcaccccttctgcagcagtggctccagtggcgggcccctcccgcccc ggcgccacggcgttctggagccgggacttttctgaagaagaacaatccgtagtgtacgtt caaggaatttctactgaaggaaatgtcagatcaagacacatgctgatgagtccaaaagct gatgttaaacttaagacttccagggcgactgatgcttcaatctccatggagtctttaaaa ggtgcaggagattcagtagatgaacagagttcccgcaggggagaaataaagagtgcatca ttgaaggatttatgtcttgaagacaaaagatgcattgcaaacttaattaaagaactggcc agagtaaatgaggaaaaggaagtgacagaggaaagattgaaagctgagcaggagtcattt gagaagaagatcaggcagttagaagaacagaatgaactgatcatcaaagaaagggaagct cttcagctacagtatagagaatgccaagaacttctaagcctgtatcagaaatatttatca gaagaacaagagaagctcaccatgtctctctcagaacttggtgctgctagaatgcaggaa cagcagctatataacaatccatacatctga >gi568815595f:153062694_153263242|GENSCAN_predicted_peptide_3|183_aa MREYKVVVLGSGGVGKSALTVQFVTGSFIEKYDPTIEDFYRKEIEVDSSPSVLEILDTAG TEQFASMRDLYIKNGQGFILVYSLVNQQSFQDIKPMRDQIIRVKRYERVPMILVGNKVDL EGEREVSYGEGKALAEEWSCPFMETSAKNKASVDELFAEIVRQMNYAAQPNGDEGCCSAC VIL >gi568815595f:153062694_153263242|GENSCAN_predicted_CDS_3|552_bp atgagagagtacaaagtggtggtgctgggctcgggcggcgtgggcaagtccgcgctcacc gtgcagttcgtgacgggctccttcatcgagaagtacgacccgaccatcgaagacttttac cgcaaggagattgaggtggactcgtcgccgtcggtgctggagatcctggatacggcgggc accgagcagttcgcgtccatgcgggacctgtacatcaagaacggccagggcttcatcctg gtctacagcctcgtcaaccagcagagcttccaggacatcaagcccatgcgggaccagatc atccgcgtgaagcggtacgagcgcgtgcccatgatcctggtgggcaacaaggtggacctg gagggtgagcgcgaggtctcgtacggggagggcaaggccctggctgaggagtggagctgc cccttcatggagacgtcggccaaaaacaaagcctcggtagacgagctatttgccgagatc gtgcggcagatgaactacgcggcgcagcccaacggcgatgagggctgctgctcggcctgc gtgatcctctga >gi568815595f:153062694_153263242|GENSCAN_predicted_peptide_4|252_aa MRYTGEGSSMGIRIESLVLVKVELLISIRVKMFSRLLDIWSSGDRNKQKVTERDNKPVVY QQAFSTKLLSAQVVSAIITPACKCEDKQGHDRSIWLLTSGISLQRECKSNNKWKTLELSH PGYDMKISPVGGSLPRNERGSAGLAQLTLIVPSWYSISTPGVHEQMDRRILNAGGCAESR NAAAEVKPEPSLCVMITALTTYSCMVCTSSGVALPPMEQDTERKALLSPSACRNIEKKGG QVEELEVTEHYR >gi568815595f:153062694_153263242|GENSCAN_predicted_CDS_4|759_bp atgaggtacactggggaaggatcaagtatggggataagaattgagagtttagttctggtt aaggtggagctgcttattagtattcgagtgaagatgtttagtaggctgttggatatttgg agttcaggagacagaaataaacagaaggtaactgagagagacaacaagcctgttgtttat cagcaagcttttagcacaaaattgctgtctgcacaagtggtatcagcaatcattacacct gcttgcaaatgtgaagacaaacagggccatgatcggtcaatctggttattgacttcaggc atctcacttcaaagggaatgcaagtcaaacaataagtggaagactcttgagcttagtcac ccaggttatgacatgaagatttcaccagtaggaggaagcctccctcgaaatgagcgaggc tcagcaggtctggcacagctaaccttgatcgttcctagctggtattcaataagtactcct ggagtacatgaacagatggatagaagaattctaaatgccggaggatgtgcagagagcagg aatgcagcagctgaagtcaaaccggaaccatccctctgcgtcatgataactgctctcacc acctattcgtgcatggtctgcacatcctcaggggttgctcttccgcctatggagcaagac actgaaagaaaggctctattatctcctagtgcctgccgaaatattgagaagaagggagga caggttgaagaattagaagtaacggaacactacagatga >gi568815595f:153062694_153263242|GENSCAN_predicted_peptide_5|527_aa MDKFLDTYTLPRLNQEEVESLNGPITGAEIVAIINSLPTKNSPGPVGFTAEFYQRYKEEL HINRAKDKNHMIISIDAEKAFDKIQQPVMLKTLNKLGIDGTYFKIIRAIYDKPTANIILN GQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDM IVYLENPTVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNKRQTESQIMSELPFTTASK RIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIKS FIAHTKPVWWSLHTDAHEIWCRDSDRGTSLGRSIPCPPALCSVKKIHLRPQVLRPTSPRN ISPILNRHALKRLKPVITRLLQHGLLKPINSPYNSPILPVLKPDKPYKLVQDLYLINQIV LPIHPMVPNPYTLLSSIPPSTTHYSVLDLKHAFFTIPLHPSSQPLFAFTWTDPDTHQAQQ ITWAVLPQAFTDSPHYFSQAQISSSSVTYLGIILIKTHVLSLLFVFD >gi568815595f:153062694_153263242|GENSCAN_predicted_CDS_5|1584_bp atggataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatggaccaataacaggagctgaaattgtggcaataatcaatagtttaccaaccaaa aacagtccaggaccagttggattcacagccgaattctaccagaggtacaaggaggaactg catataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaaaaagcc tttgacaaaattcaacaacccgtcatgctaaaaactctcaataaattaggtattgatggg acgtatttcaaaataataagagctatctatgacaaacccacagccaatatcatactgaat gggcaaaaactggaagcattccctttgaaaactggcacaaggcagggatgccctctctca ccactcctattcaacatagtgttggaagttctggccagggcaattaggcaggagaaggaa ataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatg attgtatatctagaaaaccccactgtctcagcccaaaatctccttaagctgataagcaac ttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacacc aacaagagacaaacagagagccaaatcatgagtgaactcccattcacaactgcttcaaag agaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaac tacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaacattccatgc tcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtgataaaaagc tttattgctcacacaaagcctgtttggtggtctcttcacacggacgcgcatgaaatttgg tgccgtgactcagatcgggggacctcccttgggagatcaatcccctgtcctcctgctctt tgctccgtgaaaaagatccacctacgacctcaggtcctcagacccaccagcccaaggaac atctcaccaattttaaatcggcacgctttaaaaagattaaagcctgttatcactcgcctg ctacagcatggccttttaaagcctataaactctccttacaattcccccattttacctgtc ctaaaaccagacaagccttacaagttagttcaggatctgtaccttatcaaccaaattgtt ttgcctatccaccccatggtgccaaacccatatactctcctatcctcaatacctccctcc acaacccattattctgttctggatctcaaacatgctttctttactattcctttgcaccct tcatcccagcctctcttcgctttcacttggactgaccctgacacccatcaggctcagcaa attacctgggctgtactgccacaagccttcacagatagcccccattacttcagtcaagcc caaatttcatcctcatctgttacctatctcggcataattctcataaaaacacacgtgctc tccctgctgttcgtgtttgactaa >gi568815595f:153062694_153263242|GENSCAN_predicted_peptide_6|218_aa MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARQANVQNQEIQRTPQRYSSRRATPRHIIIRFTKVEMKEKMLRAAREKGRV TLKGKPIRLTADLSAETLQARRECGPIFNIVKEKNFQPRISYPAKLSFISEGEIKYFTDK QMLRDFVTTRPALKELLKEALNMERNNQYQPLQNHAKM >gi568815595f:153062694_153263242|GENSCAN_predicted_CDS_6|657_bp atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagaatcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattatcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt accctcaaaggaaagcccatcagactaacagcggatctctcggcagaaactctacaagcc agaagagagtgtgggccaatattcaacattgttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagcg ctaaacatggaaaggaacaaccagtaccagccactgcaaaatcatgccaaaatgtaa >gi568815595f:153062694_153263242|GENSCAN_predicted_peptide_7|87_aa XNSADEVPGSISGPGDTKIDKTWSLSLNNTLLQRGNHPQAGPSRGIPGEGIVIIGDDSSM HVIVPEDLPVGQDVAVEYSDIDGPDLV >gi568815595f:153062694_153263242|GENSCAN_predicted_CDS_7|264_bp nnaaacagtgctgatgaagtaccaggttctatatcaggtcctggggatacaaaaatcgat aagacatggtccctttccttaaataatacactcttacagagaggaaaccatcctcaggca ggtccttccagaggtattccaggagaaggcattgttatcataggagatgacagctccatg cacgtcattgtccctgaagaccttccagtgggacaagatgtggcagtggagtacagtgat attgatggtcctgacctcgtgtag