GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:26:10 Sequence gi568815594r:45940991_46223913 : 282923 bp : 34.67% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 219 214 6 1.05 1.01 Sngl - 20907 20593 315 0 0 54 39 208 0.389 8.20 1.00 Prom - 22360 22321 40 -2.05 2.04 PlyA - 22477 22472 6 1.05 2.03 Term - 25162 25011 152 2 2 -21 53 155 0.365 -1.81 2.02 Intr - 28120 28033 88 1 1 35 83 60 0.121 -1.18 2.01 Init - 32220 32173 48 0 0 103 107 25 0.462 6.90 2.00 Prom - 40330 40291 40 -3.05 3.02 PlyA - 41245 41240 6 1.05 3.01 Sngl - 59561 59007 555 2 0 66 42 168 0.851 5.87 3.00 Prom - 60020 59981 40 -5.05 4.04 PlyA - 60200 60195 6 1.05 4.03 Term - 61358 61122 237 2 0 67 54 161 0.589 5.78 4.02 Intr - 70754 70684 71 0 2 99 103 17 0.112 2.28 4.01 Init - 79463 79403 61 2 1 92 12 71 0.111 1.36 4.00 Prom - 97039 97000 40 -2.15 5.13 PlyA - 97701 97696 6 1.05 5.12 Term - 110648 110588 61 1 1 114 45 65 0.317 1.10 5.11 Intr - 113890 113239 652 2 1 71 39 342 0.238 17.64 5.10 Intr - 117632 117495 138 0 0 85 58 75 0.864 3.71 5.09 Intr - 123533 123451 83 1 2 59 87 48 0.049 0.16 5.08 Intr - 124594 124374 221 1 2 99 85 87 0.079 5.68 5.07 Intr - 132935 132870 66 2 0 79 75 43 0.052 0.28 5.06 Intr - 143063 142996 68 0 2 86 115 72 0.803 7.41 5.05 Intr - 156359 156211 149 1 2 42 99 203 0.957 15.76 5.04 Intr - 183984 183901 84 2 0 68 49 82 0.000 0.32 5.03 Intr - 207261 207163 99 2 0 45 101 75 0.098 2.81 5.02 Intr - 216799 216736 64 2 1 59 97 79 0.086 2.76 5.01 Init - 217358 216998 361 2 1 74 97 106 0.052 7.43 5.00 Prom - 221417 221378 40 -7.05 6.02 PlyA - 221649 221644 6 1.05 6.01 Sngl - 224991 224521 471 0 0 42 54 248 0.937 12.67 6.00 Prom - 225944 225905 40 -5.75 7.00 Prom + 228600 228639 40 -3.25 7.01 Sngl + 238035 238451 417 2 0 53 36 255 0.985 12.85 7.02 PlyA + 240278 240283 6 1.05 8.00 Prom + 242205 242244 40 -3.15 8.01 Init + 251001 251055 55 2 1 85 100 17 0.179 4.11 8.02 Term + 273271 274367 1097 2 2 60 49 362 0.457 20.75 8.03 PlyA + 274422 274427 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 183366 183437 72 2 0 103 79 65 0.925 5.78 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:45940991_46223913|GENSCAN_predicted_peptide_1|104_aa MRRGLLSCRPQNGRSTNSFHGLPGKATDPQCQPMKAAVREAVPCKATGVELPKTMGTHLL HQHDLDVRPGIKGNHFEAFKFDCPAEFQICMGPVTPLFWPIYPI >gi568815594r:45940991_46223913|GENSCAN_predicted_CDS_1|315_bp atgagaagagggctactgtcctgcagaccccagaatggtagatccaccaacagctttcat ggattgcctggaaaagccacagaccctcagtgccagcccatgaaagcagctgtgagggag gctgtgccttgcaaagccacaggtgtggagctgccaaagaccatgggaacccacctcttg catcagcatgacctggatgtgagacctggaatcaaaggaaatcattttgaagctttcaaa tttgactgccctgctgaatttcagatttgcatgggtcctgtaacccccttgttttggcca atttaccccatttag >gi568815594r:45940991_46223913|GENSCAN_predicted_peptide_2|95_aa MGTGLWPFRNRATQQEACGNHRDNTAHKIKEKENAEHHRKPPNHRAKWNSSHQGPETTGV LLMDLQREADSGWKEGTEAGLNGEKAGKSARVNEY >gi568815594r:45940991_46223913|GENSCAN_predicted_CDS_2|288_bp atggggactggtctttggccttttaggaaccgggccacacagcaggaggcctgtggtaac cacagagacaatacagcacataaaatcaaagagaaagaaaatgcagagcaccatagaaaa ccaccaaaccacagagctaagtggaatagctcccaccaagggcctgagacgactggtgtg cttctgatggatcttcagagagaagctgatagtggatggaaggaaggcacagaagctggg ctgaatggggaaaaagctgggaaatctgcaagggttaatgagtactga >gi568815594r:45940991_46223913|GENSCAN_predicted_peptide_3|184_aa MYQNLWDTVKAMFREKFIALNPHRRKQERLKINTLTSQLKEKQEQTNSKASRIQEITKIR AELKETETRKILQKINEPRSWFFEKINKIDSPLARLIKMKREKNKIDTIKNDKGDITNDR TEIQSAIREYYKHFYPNKLKNLEEMYEFLDTYTLLRLNQEEVESLNRPIKSFEIEAVINS LPTK >gi568815594r:45940991_46223913|GENSCAN_predicted_CDS_3|555_bp atgtaccagaatctctgggacacagttaaagcaatgtttagagagaaatttatagcacta aatccccacagaagaaagcaggaaagattgaaaatcaacaccctaacatcacaattaaaa gagaagcaagagcaaacaaattcaaaagctagcagaattcaagaaataactaagatcaga gcagaactgaaggagacagagacacgaaaaatccttcaaaaaatcaatgaacccaggagc tggttttttgaaaagattaacaaaatagatagtccgctagccagactaataaagatgaaa agagagaagaataaaatagacacaataaaaaatgataaaggggatatcaccaacgatcgc acagaaatacaaagtgccatcagagaatactataaacacttctacccaaataaactaaaa aatctagaagaaatgtatgaattcctagacacatataccctcttaagactaaaccaggaa gaagtcgaatccctgaatagaccaataaaaagttttgaaattgaggcagtaattaatagc ctaccaaccaaataa >gi568815594r:45940991_46223913|GENSCAN_predicted_peptide_4|122_aa MGLNNGNARYTISFNSSTTEEAARHWLCPNSGVRWTAFPCESSQEHNSSPSREQNWTENE FDKLTEVGFRRWVITNSSKLKEHALTQCKEAKNLEKRLDKLLTRKTSLEKIINDLMELKN IA >gi568815594r:45940991_46223913|GENSCAN_predicted_CDS_4|369_bp atgggattaaataatggcaatgctcgatacacgatttcatttaattccagtactactgaa gaggctgcaaggcactggctgtgccccaattctggcgtgaggtggacagctttcccctgt gagtcctcccaggaacacaactcctcaccatcaagggaacaaaactggacagagaatgag tttgacaaattgacagaagtaggcttcagaaggtgggtgataacaaactcctccaagcta aaggagcatgctctaacccaatgcaaggaagctaagaaccttgaaaaaaggttagacaaa ttgctaactagaaaaaccagtttagagaagatcataaatgacctgatggagctgaaaaac atagcatga >gi568815594r:45940991_46223913|GENSCAN_predicted_peptide_5|681_aa MVGSANYMSVSPLPHVFSCKVSALVRGNAVWNTMMVDKAFSESTDGSLGRNIAFRIGKPI AGVSVYSSDDKPPPFPRWKSSNTINLLVDHPRNGAISRAQCWCLLLANWALSSDISKVSL DIDGKADGSMGQGGDVATTGDRKKCRNLTGKLKESTDPVQEAADYTLLHKKGKKICSSCQ RLSGYHGNRRALLGTQSRASEVSVDKADDEDDEDLTVNKTWVLAPKIHEGDITQILNSLL QGYDNKLRPDIGVRPTVIETDVYVNSIGPVDPINMLPMVNNDPKIGEHSTVRYFEKKEYT IDIIFAQTWFDSRLKFNSTMKVLMLNSNMVGKIWIPDTFFRNSRKSDAHWITTPNRLLRI WNDGRVLYTLRLTINAECYLQLHNFPMDEHSCPLEFSSYGYPKNEIEYKWKKPSVEVADP KYWRLYQFAFVGLRNSTEITHTISVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIV YLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMGELPFTIASKRI KYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFN AIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKT AWYHYSSDYDNPEYNCQEVFT >gi568815594r:45940991_46223913|GENSCAN_predicted_CDS_5|2046_bp atggtaggatcagcgaattacatgagcgtgagcccactgccacacgtctttagctgtaaa gtgagtgccttggtcagaggcaacgctgtgtggaataccatgatggtggataaggcattt agtgagtccacagatggtagtcttggcagaaacattgcattcaggataggcaaacccata gctggagtaagtgtctattccagtgatgataaacctcctccctttccaagatggaagagt tccaatacaatcaacctgctagttgatcaccccaggaatggtgccatatcgagggctcag tgttggtgtctgctgctggcaaattgggcactcagcagtgacattagcaaggtcagcctt gatatagatggaaaggctgatggcagcatgggtcagggaggggatgttgccactactggg gacaggaagaaatgcaggaacttaacaggaaaactgaaagaatccacagaccctgtgcaa gaagcagcagactataccctgctccataaaaaaggaaaaaaaatttgttccagctgccag aggctttctggttaccatggcaaccgtcgggctctgctaggaactcaaagcagagcctct gaagttagtgttgataaggcagatgatgaagatgatgaggatttaacggtgaacaaaacc tgggtcttggccccaaaaattcatgaaggagatatcacacaaattctgaattcattgctt caaggctatgacaataaacttcgtccagatataggagtgaggcccacagtaattgaaact gatgtttatgtaaacagcattggaccagttgatccaattaatatgttacccatggtcaac aatgatccaaaaataggtgaacacagtacagtaagatattttgaaaagaaggaatataca atagatataatttttgcccaaacctggtttgacagtcgtttaaaattcaatagtaccatg aaagtgcttatgcttaacagtaatatggttggaaaaatttggattcctgacactttcttc agaaactcaagaaaatctgatgctcactggataacaactcctaatcgtctgcttcgaatt tggaatgatggacgagttctgtatactctaagattgacaattaatgcagaatgttatctt cagcttcataactttcccatggatgaacattcctgtccactggaattttcaagctatgga taccctaaaaatgaaattgagtataagtggaaaaagccctccgtagaagtggctgatcct aaatactggagattatatcagtttgcatttgtagggttacggaactcaactgaaatcact cacacgatctctgtgttggaagttctggccagggcaatcaggcaggagaaggaaataaag ggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtt tatctagaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaacttcagc aaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaacaac agacaaacagagagccaaatcatgggtgaactcccattcacaattgcttcaaagagaata aaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaa ccactgctcaaggaaataaaagaggacacaaacaaatggaagaacattccatgctcatgg gtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttacagattcaat gccatccccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaag ttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaacaaa gctggaggcatcacactacctgacttcaaactatactacaaggctacagtaaccaaaaca gcatggtatcactacagttctgactatgacaaccctgagtacaattgccaggaagtcttt acctaa >gi568815594r:45940991_46223913|GENSCAN_predicted_peptide_6|156_aa MIISIDAEKAFDKIQQSFMLKTLNKLGIDGTYLKITRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARSVRQEKEIKGIRLGKEEVKLSVFADGMIVYLENPIVS AQNLLKPISNFSKVSGYRLIFVKYFISKARPGFETI >gi568815594r:45940991_46223913|GENSCAN_predicted_CDS_6|471_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacagtccttcatgcta aaaactctcaataaattaggtattgatgggacgtatctcaaaataacaagagctatctat gacaaacccacagccaatatcatactgaatggacaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccaggtcagtcaggcaggagaaagaaataaagggtattcgattaggaaaagaggaa gtcaaattgtccgtgtttgcagatggcatgattgtatatttagaaaaccccatcgtctca gcccaaaatctccttaagccgataagcaacttcagcaaagtctcaggatacaggttaata tttgtaaaatatttcatatcgaaagctagacctggttttgagactatttga >gi568815594r:45940991_46223913|GENSCAN_predicted_peptide_7|138_aa MWESLELSRDLLNGFDQNADSDMNNKVEAEVVSDGDEELFGNWSKDDSCYVLAKRLVIFF PCHRDMWNFEFERDDLEYLAEEISQQQSIQDVTWVLLKAFSFLREAEHKSSENLPPDNVI EKNIAFSEEKFKPAQEFV >gi568815594r:45940991_46223913|GENSCAN_predicted_CDS_7|417_bp atgtgggaaagtttagagctttctagagacttgttgaatggctttgaccaaaatgctgat agtgatatgaacaataaagttgaggctgaggtggtctcagatggagatgaggaacttttt gggaactggagcaaagatgactcttgttatgttttagcaaaaagactagtaatatttttc ccctgtcatagagatatgtggaactttgaatttgagagagatgatttagagtatttggct gaagaaatttctcagcagcaaagcattcaagatgtgacttgggtgctgttaaaggcattc agttttctaagggaagcagaacataaaagttcagaaaacttgccacctgacaatgtgata gaaaagaacattgcattttctgaggagaaattcaagccagcacaggaatttgtgtaa >gi568815594r:45940991_46223913|GENSCAN_predicted_peptide_8|383_aa MLPPNLLGENLSVPLSASMLEVLAGAIRQEKEIKDIQLGKEEVKLSLYAYDMIVYLENHI VSAQNLLKLISNFSKVSGYKINVQKSQAFLHTNNRQTETQIMSELPFTIASKRIKYLGIQ LTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCPWVGRINIVKMAILPKVIYRFNAIPIKL PMTFFTELEKTTLKFIWNQKRACIAKSIVSQNNKAGGITLRDFKLYYKATVTKTAWYWYQ NRDIHQWNRTEPSEIMLHIYNHLIFDKPDKNKQWGKDSLFNKWCWENWLAICRKLKLDPF LTPYTKINSRWIKDLNVRPKTIKSLEENLGNTIQDMGMGKDFMSETPKAMATKAKIDKWD LMKLKSFCTAEETTIRVNRQPTE >gi568815594r:45940991_46223913|GENSCAN_predicted_CDS_8|1152_bp atgctccctccaaaccttctaggagagaatctttctgtgcctctttcagcttccatgtta gaagttctggctggggcaatcaggcaggagaaggaaataaaggatattcaattaggaaaa gaggaagtcaaattatccctgtatgcatatgacatgattgtatatctagaaaaccacatc gtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaa atcaatgtgcaaaaatcacaagcattcttacacaccaataacagacaaacagagacccaa atcatgagtgaactcccattcacaattgcttcgaagagaataaaatacctaggaatccaa cttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaata aaagaggatacaaacaaatggaagaacattccatgcccatgggtaggaagaatcaatatc gtgaaaatggccatactgcccaaggtaatttatagattcaatgccattcccatcaagcta ccaatgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaa agagcctgcattgccaagtcaattgtaagccaaaataacaaagctggaggcatcacgcta cgtgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaa aacagagatatacatcaatggaacagaacagagccctcagaaataatgctgcatatctac aaccatctgatctttgacaaacctgacaaaaacaagcaatggggaaaggattccctattt aataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttc cttacaccttatacaaaaattaattcaagatggattaaagacttaaatgttagacctaaa accataaaatccctagaagaaaacctaggcaataccattcaggacatgggcatgggcaag gacttcatgtctgaaacaccaaaagcaatggcaacaaaggccaaaattgacaaatgggat ctaatgaaactaaagagcttctgtacagcagaagaaactaccatcagagtgaacaggcaa cctacagaatag