GENSCAN 1.0 Date run: 5-Nov-116 Time: 12:28:45 Sequence gi568815595r:125347782_125620172 : 272391 bp : 41.66% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 9515 9887 373 2 1 140 55 170 0.228 12.08 1.02 PlyA + 9913 9918 6 1.05 2.22 PlyA - 12680 12675 6 1.05 2.21 Term - 27759 27346 414 0 0 92 47 196 0.155 10.18 2.20 Intr - 41923 41832 92 2 2 94 29 66 0.003 0.09 2.19 Intr - 49958 49769 190 2 1 31 99 128 0.887 6.44 2.18 Intr - 52701 52564 138 0 0 103 100 113 0.979 13.74 2.17 Intr - 55502 55408 95 0 2 96 7 116 0.794 3.06 2.16 Intr - 69622 69537 86 0 2 69 58 50 0.121 -1.36 2.15 Intr - 86291 86212 80 2 2 88 94 67 0.278 4.73 2.14 Intr - 94434 94383 52 2 1 74 80 66 0.624 2.39 2.13 Intr - 94632 94496 137 0 2 111 82 14 0.532 1.55 2.12 Intr - 103638 103524 115 2 1 88 97 132 0.971 13.63 2.11 Intr - 106174 106029 146 1 2 104 103 117 0.989 12.96 2.10 Intr - 106642 106511 132 1 0 71 35 81 0.660 1.02 2.09 Intr - 109584 109485 100 2 1 46 84 138 0.742 8.39 2.08 Intr - 113079 112990 90 2 0 83 115 88 0.949 9.19 2.07 Intr - 121738 121673 66 0 0 77 75 57 0.303 0.40 2.06 Intr - 128975 128914 62 2 2 42 115 50 0.342 -0.29 2.05 Intr - 149607 149560 48 0 0 62 116 19 0.429 0.06 2.04 Intr - 150202 150053 150 1 0 76 68 55 0.657 1.74 2.03 Intr - 150413 150278 136 1 1 126 94 32 0.852 7.25 2.02 Intr - 156963 156842 122 0 2 68 98 92 0.448 6.57 2.01 Init - 172391 172251 141 2 0 112 89 188 0.743 21.48 2.00 Prom - 173944 173905 40 -4.75 3.15 PlyA - 177682 177677 6 1.05 3.14 Term - 180275 180078 198 2 0 -69 40 216 0.131 -2.48 3.13 Intr - 184233 184080 154 2 1 57 69 149 0.933 9.05 3.12 Intr - 190852 190670 183 0 0 89 91 121 0.980 10.48 3.11 Intr - 199811 199625 187 0 1 93 103 125 0.928 12.33 3.10 Intr - 204898 204400 499 1 1 92 115 301 0.971 24.83 3.09 Intr - 212738 212598 141 2 0 40 89 157 0.953 10.63 3.08 Intr - 216062 215917 146 0 2 44 86 172 0.839 11.68 3.07 Intr - 219814 219613 202 1 1 66 111 97 0.483 7.74 3.06 Intr - 231258 231179 80 1 2 88 95 69 0.694 5.95 3.05 Intr - 232259 232084 176 1 2 25 64 131 0.500 3.06 3.04 Intr - 245434 245305 130 2 1 50 36 97 0.132 -0.47 3.03 Intr - 247045 246818 228 2 0 14 75 262 0.553 14.42 3.02 Intr - 247932 247837 96 1 0 81 55 70 0.494 2.06 3.01 Init - 252997 252943 55 1 1 110 95 1 0.497 4.50 3.00 Prom - 255285 255246 40 -7.55 4.00 Prom + 255487 255526 40 -12.33 4.01 Init + 256500 256549 50 2 2 98 57 36 0.763 1.97 4.02 Intr + 258003 258203 201 0 0 80 109 179 0.718 16.68 4.03 Intr + 269050 269260 211 1 1 1 87 180 0.625 7.19 4.04 Intr + 269402 269616 215 1 2 53 76 233 0.950 15.19 4.05 Intr + 269734 269954 221 1 2 19 27 186 0.948 2.62 4.06 Term + 270052 270497 446 2 2 72 46 210 0.972 9.61 4.07 PlyA + 270532 270537 6 -0.45 5.02 PlyA - 270734 270729 6 1.05 5.01 Sngl - 271219 270977 243 1 0 76 34 191 0.762 7.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 49391 49309 83 0 2 110 41 90 0.918 3.38 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:125347782_125620172|GENSCAN_predicted_peptide_1|124_aa XGAHAHRHTHARTHLHTKGSHGRCWSLHTGTLARDTHTPPKKRRAGFARRWAKARPPPLP GPAHFRERPLAQPRAKSARTQAGGSPPATTQLHFAVPQPPTAQPAQSRPRSWPNRAWAAK AQGI >gi568815595r:125347782_125620172|GENSCAN_predicted_CDS_1|375_bp nnaggggcgcacgcgcacagacacacgcacgcacgcacgcacttacacacaaaaggaagt catggaaggtgctggtccctgcatacaggcacactcgcgcgggacacacacacacccccc aaaaagaggcgtgcggggttcgccagacggtgggcaaaagcccgtcctcccccccttcca gggcctgctcacttcagggagcgcccactcgcccagccacgggccaagagcgcacggacc caggcgggcggcagcccacccgccaccacgcagctccacttcgctgttccacagccacca accgcacagccggcacagtcccgcccgcgcagctggcccaatcgggcctgggctgcaaaa gcccagggaatatga >gi568815595r:125347782_125620172|GENSCAN_predicted_peptide_2|863_aa MEQAPPDPERQLQPAPLEPLGSPDAGLGAAVGKEAEGAGEESSGVDTMTHNNFWLKKIEI SVSEAEKRTGRNAMNMQETYTAYLIETRSVEHTDGQSVLTDSLWRRYSEFELLRSYLLVY YPHIVVPPLPEKRAEFVWHKLSADNMDPDFVERRRIGLENFLLRIASHPILCRDKIFYLF LTQEGNWKETVNETGFQLKRVADRLYGVYKVHGNYGRVFSEWSAIEKEMGDGLQSAGHHM DVYASSIDDILEDEEHYADQLKEYLFYAEALRAVCRKHELMQYDLEMAAQDLASKKQQCE ELVTGIISGIRFSQGTHNLDPSHAQFTIGFVLLRESNASAADLTGGRAQTVRTFSLKGMT TKLFGQETPEQREARIKVLEEQINEGEQQLKSKNLEGREFVKNAWADIERFKEQKNRDLK EALISYAVMQISMCKKRKPLLPVSYASFQKLPIIYRHVHLPQQHTQMGEYSTHFTYLEDR PSNGNDSDFQFGAITDDGQAVNLQVWYSKCGTGPAESAPPGILLESCTHRPRSGFLAAVL TAVVMGSSRLRVVGRSMCEDVCKAFRWYLSSSSKLWGNQNVDINLLSQGFRERSAEYRVW KETSSERFPRVADQRQLCPSEGEGYLESWHLKSNPYLQHKLLIACCNDHFLGPPVPAALD TVDSLPLPAALLILHNPIPFSGYPSAIPPLSMALFVLTLGNLELIAGICWPSHILPDLLD AAGESDAQPGGPAARVQLPRPPWAGDTQAPESKPRCVFPARSLRQTRAGAAERFPCAGLP TMHRAGSGGDRGGALAAACLRLYCSGRRKRRRKRRRRRRGRKRRKAQGWELLPKLPQQKF SPLPPSPPLKAPRKVGAAAACSR >gi568815595r:125347782_125620172|GENSCAN_predicted_CDS_2|2592_bp atggagcaggcacctccggaccccgagcggcagctccagccggcgcccttggagccgctg ggctccccagacgctgggctgggggctgcggtcggcaaggaagcggagggggccggagaa gagagctctggggtcgacacgatgacacacaataatttttggttgaagaagatagaaatc agtgtttcagaagcagaaaaacgaactggaagaaatgccatgaacatgcaagaaacatat actgcttacctcattgaaacaaggtcagttgaacataccgatggtcagagtgtcctaaca gactcactatggcggcgatatagtgaatttgagttgttgagaagctaccttttagtttac tatccacatattgttgtgccacctctgccagaaaaacgggcagaatttgtttggcataaa ctctctgctgacaacatggatccagattttgtggagaggcgacggattggtttagaaaac tttctcttgaggattgcttcacatcccatcctttgtagagacaaaatcttctatctgttt ttaacacaggaaggtaactggaaggagactgtgaatgaaactgggtttcagctgaagaga gtagcagatcgactctatggtgtatataaagtacatgggaattatggtcgagttttcagt gaatggagtgccatagaaaaagaaatgggtgatggactgcagagtgctggtcatcatatg gatgtgtatgcatcttctattgatgatattttggaagatgaagaacattatgcagatcag ttaaaagagtatcttttttatgcagaagcattgcgggctgtgtgcaggaaacatgaactt atgcagtatgacttggagatggctgctcaggacttagcatccaagaagcagcagtgtgag gaactggtaactgggatcatctcaggcattagattttcacaaggaacacacaacctggat ccctcacatgcgcagttcacaatagggttcgtgctcctaagagaatctaatgcctctgct gctgatctgacaggaggcagagctcagactgtgagaacattctctttgaagggaatgact accaagctctttggtcaagaaactccagagcagagagaagccagaataaaggtgctagaa gaacaaataaatgaaggagaacaacagctaaagtctaaaaatctggaaggcagagaattt gtgaaaaacgcatgggctgatattgaacgcttcaaagaacaaaagaaccgagacttaaag gaggccctcataagctatgcagtcatgcagatcagtatgtgcaaaaagaggaaaccactg ttaccagtttcctatgcatccttccagaaattacctatcatataccgacacgtacacctt ccccaacagcacacacaaatgggggaatacagtacacacttcacatatctggaagatcgg cccagtaatggaaatgacagtgatttccagtttggggctatcacagatgatggccaggcg gtgaatctccaagtgtggtactccaagtgtggtactgggccagcagaatctgctccacct gggatcttgttagaaagctgcacccacaggccaagatctggattcctggctgctgtattg acagctgtggttatggggtcaagcagattgagggtagttggtaggtcaatgtgtgaagat gtctgcaaggccttccgctggtatctgagttcttcttcgaagctctggggaaaccagaat gtggacataaatcttttaagtcagggctttagagaacgctctgcagaataccgtgtctgg aaagagacaagttcagagagatttccaagggtagcagaccagaggcagctttgcccttca gaaggtgaaggctacttggagagctggcatctgaagagtaacccatatcttcaacataag cttctgattgcctgttgcaatgaccatttcttaggccctcctgtccctgcagcactggac actgtggattcccttcctcttcctgctgctctcctgatattgcataatccgattcccttc agtggctaccccagtgccatccctcctttatccatggccctctttgtgctgaccttagga aaccttgagcttattgcgggtatttgttggccttcccatattctacctgacctgctggat gctgctggagaaagtgacgcccagcctggggggcctgcggctcgggtccagcttccacgg ccaccctgggctggtgacacgcaggcgccggaatcgaagccacgctgtgtctttccagcc aggtctctgcgccagacgagggcgggcgccgcggagcgcttcccgtgtgcggggcttccc acaatgcaccgggccggcagtggcggcgaccgcggcggcgctctagctgcggcatgtctg cgtctctactgctctgggaggaggaagagaaggaggaagaggaggaggaggaggaggggg aggaagaggagaaaggcgcaggggtgggagctgttgccgaagctgccacagcaaaagttc tcccccctccccccttcccctcctctcaaggcccctagaaaggttggagctgccgccgcc tgcagtcggtga >gi568815595r:125347782_125620172|GENSCAN_predicted_peptide_3|824_aa MALGIYIFTLTPTNWVISASRPATSSQSLRLLQGPAALTQDGARRFGCEGKGQVDFGVKM QGGEPVSTMKVSESEGKLEGQATAVTPNKNSSCGGGISSSSSSRGGSAKGWQYRYAPFSK IISRGGGAAGPLSVALRPDLLWGCGWGAAAALAFTFGSGESRESIKLVVRFFVLNNEAGL LEYFVNEQSRNQKPRGTLQLAGAVISPSDEDSHTFTVNAASGEQYKLRATDAKERQHWVS RLQICTQHHTEAIGKMMSHAEGQQRDLIRRIECLPTSGHLSSLDQDLLMLKATSMATMNC LNDCFHILQLQHASHQKGSLPSGTTIEWLEPKISLSNHYKNGADQPFATDQSKPVAVPEE QPVAESGLLAREPEEINADDEIEDTCDHKEDDLGAVEEQRSVILHLLSQLKLGMDLTRVV LPTFILEKRSLLEMYADFMSHPDLFIAITNGATAEDRMIRFVEYYLTSFHEGRKGAIAKK PYNPIIGETFHCSWKMPKSEVASSVFSSSSTQGVTNHAPLSGESLTQVGSDCYTVRFVAE QVSHHPPVSGFYAECTERKMCVNAHVWTKSKFLGMSIGVTMVGEGILSLLEHGEEYTFSL PCAYARSILTVPWVELGGKVSVNCAKTGYSASITFHTKPFYGGKLHRVTAEVKHNITNTV VCRVQGEWNSVLEFTYSNGETKYVDLTKLAVTKKRVRPLEKQDPFESRRLWKNVTDSLRE SEIDKATEHKHTLEERQRTEERHRTETGTPWKTKYFIKEEAKGTSYMVAARENEEDAKVE TPDKTVRSRRTYSLPREQYGENCPHDSNYLPQHVGIVEWKYNSR >gi568815595r:125347782_125620172|GENSCAN_predicted_CDS_3|2475_bp atggccttaggtatctacattttcacactgacacccacaaactgggtgatttcagccagt cgcccagctaccagctcccagagcctccgcctccttcagggacccgccgccctgacccaa gatggcgccagacgcttcggctgtgagggaaaagggcaagtggactttggcgttaagatg caggggggtgaaccagtgtccacaatgaaagtctcggagagcgaaggaaagctggagggc caggccacagcggtgaccccgaacaagaacagcagctgtggaggtggaatcagtagcagc agcagcagccgcggtggcagtgcaaaaggctggcagtacaggtatgctcctttctccaaa attatttcccgaggtggaggggccgcgggtcccctgtccgtggcgcttcgtcctgacctg ctatggggctgtgggtggggcgccgccgcggccctggcattcacatttgggtctggagag agccgggagtctattaagttggttgtgaggttttttgttttaaacaatgaagctgggctg ttggagtactttgtgaatgaacagtctagaaatcagaaacctagaggaactttgcagctt gcaggagctgtaatatcacccagtgatgaggattctcacaccttcactgtaaacgctgcc agtggggaacaatataaactcagagctacagatgcaaaagagcgacagcactgggttagc agacttcagatatgtacacagcatcatactgaagctattggaaagatgatgtctcatgct gaaggacaacaaagagacttaattagacgaattgaatgccttcctacttctggccatctt agttccttggaccaggatctcttaatgctcaaagctacttccatggcaactatgaactgc ttaaatgactgctttcatattctccagttacagcatgcatcacatcagaagggctcattg ccttcaggaacgacaatcgagtggttagaaccaaagatatctttatcaaaccactataaa aatggagctgaccagccctttgcaactgatcagagtaagccggtggcagtcccagaagag cagcctgttgcagaatctggactattagcgagggagcctgaagaaataaatgcagatgat gagatagaggatacatgtgaccacaaagaggatgacctgggagctgtagaagaacaacgt agtgtcatcctacatctcttgtcacagcttaagctgggcatggatttaacaagagtggtg cttcctacatttatcctagagaagcgttccttgctggaaatgtatgcagactttatgtct catccagacctatttatagccatcactaatggagccacagctgaggacagaatgattcgc tttgttgagtactaccttacctcatttcatgaaggccgtaagggagccattgctaaaaaa ccatacaatcctatcattggagaaacatttcactgttcctggaagatgccaaaaagcgag gtagcatccagtgtttttagcagttcttccacccagggagtcacaaatcatgctccttta tcgggggagtctttgacccaggtgggatcagactgttacacagtcagatttgttgctgag caggtttctcatcatcctccagtctcaggattttatgcagaatgtacagagaggaagatg tgtgtaaatgcgcatgtctggactaagagcaagttcttaggcatgtcaataggcgtgaca atggttggagaaggtatccttagtctgttggagcatggagaagagtacacattttctcta ccctgtgcatatgctcggtcaattttgactgttccttgggtagaactgggtggcaaagtc agtgtcaactgtgcaaaaactggatattcagccagcatcacttttcataccaagccattt tatggtggcaaactgcatcgggttacagctgaagtaaagcacaacatcaccaacactgtg gtatgcagagtgcaaggggaatggaatagtgttcttgagttcacatatagcaatggagag acaaagtatgtggacttgactaaattggcagtgacgaagaaaagagtgagacctctggag aagcaggatccatttgaatccaggcgattgtggaaaaatgtgacagactcgctgagagaa tctgaaattgataaggccacagagcataagcataccctggaagaacgtcagaggactgaa gaaaggcatcgtactgaaacaggcacaccttggaaaaccaaatattttattaaagaggag gcgaaaggcacttcttacatggtggcagcaagagaaaatgaggaagatgcaaaagtggaa acccctgataaaaccgtcagatctcgtaggacttattcactaccacgagaacagtatggg gaaaactgcccccatgattcaaattacctcccacaacacgtgggaattgtggagtggaag tacaattcaagatga >gi568815595r:125347782_125620172|GENSCAN_predicted_peptide_4|447_aa MLGHYSFKMELDNHTESLRGRARVQVFENASVRATKSDLPRSSLWSRRKTSVSATASVSA TNLISMVQGQLRVLGQEGNLLLHRTSLKDNGEGKSSQWTELRAVHLVVHFAWKKKWPDVR LYTDSWAVTSGLAGWSGTWKKHDWKMVTKKCGEEPSLSSPNGPMNKVAMVAGMEVIHGLS NKDFRLPRLTWLRPLPSAQFASSRDQHRALDMAPFLGVISQLPGGRLPSVDSWNALSTIM VFQYSIASDQGTNLMAKEVRQWAHARGIHWSYRVPQHPEAAGLIEQWNGLLKSQLQHRLA SIHRSRNQGVEVKVAPLTITPSDLLAKFLLSVPMTFCSADLEVLVPEGGMLPPRDTTMIP SNWKLRLPPEHFWLLLPLSQQAKKGVTVLAGVTDYQDEIRLLPHNGGKEEYAWNTGDPLG CLLVLPCPVIKVNGKVQQPNPGRTTNG >gi568815595r:125347782_125620172|GENSCAN_predicted_CDS_4|1344_bp atgcttggccactacagctttaaaatggaattggacaaccatactgagagtctaagagga agagcaagagtgcaggttttcgaaaatgcatcggtaagggccactaaatctgaccttcct cggtcctctttgtggtctaggaggaaaactagtgtttctgctactgcttcagtgagcgca actaatctgatcagcatggtccagggacagttgagagttcttgggcaagaggggaatctg ctgctgcatcgaacatccctgaaggacaatggtgaggggaaatcatctcagtggacagaa cttcgagcagtgcacctggttgtgcactttgcatggaagaagaaatggccagatgtgcga ttatatacagattcatgggctgtaaccagtggtttggctggatggtcagggacttggaag aagcacgattggaaaatggtaacaaagaaatgtggggaagagccatccctgtcatcgccc aatgggcccatgaacaaagtggccatggtagcagggatggaggttattcatgggctcagc aacaaggacttccgcttaccaaggctgacctggctacgaccactgccaagtgcccagttt gccagcagcagagaccaacaccgagcccttgatatggcacccttcctcggggtgatcagc cagctacctggtggcagactgccatccgtggactcatggaatgccttatccaccatcatg gtattccagtacagcattgcctctgaccaaggcactaacttaatggctaaagaagtgcgg cagtgggctcatgctcgtggaattcactggtcttaccgtgttccccaacatcctgaagca gctggattgatagaacagtggaatggccttttgaagtcacaattacaacaccgactagcc agtattcacaggtccaggaatcaaggggtggaagtgaaagtggcaccactcaccatcacc cctagtgacctgctagcaaaatttttgctttctgttcccatgacattttgttctgctgac ctagaggtcttagttccagagggaggaatgctgccaccaagagacacaacaatgattcca tcaaactggaagttaagattgccacctgaacacttttggctcctccttcctctaagtcaa caggctaagaaaggagttacagtgttggctggggtgactgactatcaggatgaaatcaga ctactaccccacaatggaggtaaggaagagtatgcgtggaatacaggagatcccttaggg tgtctcttagtattaccatgccctgtgattaaggtcaacgggaaagtacaacaacccaat ccaggcaggactacaaatggctga >gi568815595r:125347782_125620172|GENSCAN_predicted_peptide_5|80_aa MGERRQPEDSADKAYPTFFTCFILATLAADCMVPIDTEGGSPSPSPPTQMSISSGNTLTD NPETILYQPSRILQSTQVDT >gi568815595r:125347782_125620172|GENSCAN_predicted_CDS_5|243_bp atgggagaaaggaggcagccagaagactcagcagacaaagcttatcccaccttcttcacc tgcttcattctagccacactggcagcagattgtatggtgcccatcgacactgagggtggg tctccctctcccagtccaccgacacaaatgtcaatctcctctggcaacaccctcacagac aacccagaaacaatactttaccagccatctaggatccttcaatctactcaagttgacacc taa