GENSCAN 1.0 Date run: 7-Nov-116 Time: 14:29:23 Sequence gi568815575f:23954969_24176782 : 221814 bp : 42.96% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 271 266 6 1.05 1.03 Term - 12420 12079 342 0 0 -16 36 238 0.203 1.73 1.02 Intr - 14562 14349 214 2 1 44 61 141 0.005 4.80 1.01 Init - 19849 19485 365 1 2 -1 53 192 0.075 3.37 1.00 Prom - 26630 26591 40 -4.65 2.03 PlyA - 28776 28771 6 1.05 2.02 Term - 34062 32953 1110 0 0 113 39 870 0.737 75.66 2.01 Init - 51725 51021 705 2 0 38 96 558 0.788 46.58 2.00 Prom - 57685 57646 40 -6.15 3.00 Prom + 71112 71151 40 -4.35 3.01 Init + 71468 71583 116 1 2 28 53 147 0.240 4.93 3.02 Intr + 75734 75869 136 2 1 29 52 151 0.249 5.25 3.03 Term + 79643 79759 117 1 0 101 48 131 0.612 7.96 3.04 PlyA + 79992 79997 6 1.05 4.00 Prom + 85371 85410 40 -4.05 4.01 Init + 88617 88710 94 2 1 83 69 65 0.613 4.69 4.02 Term + 94632 95224 593 1 2 -27 36 336 0.477 10.60 4.03 PlyA + 95737 95742 6 1.05 5.00 Prom + 97989 98028 40 -7.15 5.01 Init + 98166 98259 94 2 1 83 69 65 0.533 4.69 5.02 Intr + 102453 102580 128 1 2 87 121 136 0.999 16.28 5.03 Intr + 102665 102786 122 1 2 32 60 112 0.731 1.17 5.04 Intr + 105120 105214 95 0 2 53 67 95 0.731 2.59 5.05 Intr + 107448 107606 159 1 0 46 116 157 0.848 13.34 5.06 Intr + 109233 109367 135 1 0 77 111 91 0.995 9.92 5.07 Intr + 111030 111124 95 1 2 75 93 103 0.999 8.26 5.08 Intr + 112996 113140 145 0 1 96 87 83 0.999 7.93 5.09 Intr + 116590 116759 170 2 2 55 94 136 0.997 9.64 5.10 Intr + 118123 118295 173 0 2 100 115 108 0.932 12.52 5.11 Term + 121754 121817 64 2 1 77 49 60 0.646 -2.62 5.12 PlyA + 122478 122483 6 1.05 6.00 Prom + 163688 163727 40 -3.25 6.01 Init + 178315 178481 167 0 2 80 25 200 0.957 12.06 6.02 Intr + 178541 178746 206 2 2 97 75 78 0.969 5.32 6.03 Intr + 185401 185570 170 2 2 51 105 78 0.707 4.54 6.04 Term + 185602 185676 75 0 0 71 48 76 0.674 -1.24 6.05 PlyA + 186070 186075 6 1.05 7.00 Prom + 188696 188735 40 -7.55 7.01 Init + 194333 194569 237 1 0 86 2 228 0.800 10.06 7.02 Term + 195573 195755 183 2 0 87 41 121 0.879 3.86 7.03 PlyA + 195832 195837 6 1.05 8.03 PlyA - 196042 196037 6 1.05 8.02 Term - 197576 197494 83 0 2 93 43 41 0.500 -3.02 8.01 Init - 198639 198567 73 0 1 71 94 68 0.831 6.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:23954969_24176782|GENSCAN_predicted_peptide_1|306_aa MCTRITHGCCGNTRSDSGVQDKARESAFPTGSQVRPLELLGCTAGVSGILQSCCISHWLC RKSVSPSPISPTPASTTARKTMGDPDEAETHFPWWPVGRSWARGRRWLKIPPLSLGRPLD GSALATIIKIDYIAALIVALKNRVRLLKKTHRTRNDKAYKSTLCYKERTHYSKSMKVKLY TKQTTPKGAWRSQPEDAIEKKNSFSEEKLKLAAEMCIINKEPKNPEKTMGKMSPGHVRDL CGSPSHHRPRGLGGKKWFRETRVPCRVQPRDLVSCILAAPALAKRGQGTARAMTSGCKPQ ALAAST >gi568815575f:23954969_24176782|GENSCAN_predicted_CDS_1|921_bp atgtgcacacgaatcacccacggatgttgtggaaatactcgttctgattcaggggtccag gataaggcacgagagtctgcgtttccaacaggctctcaggtgcggcctttggagctgctg ggctgcacagcaggggtctctgggatcctgcagagctgttgcatttcccactggctttgc agaaagtctgtgtcaccaagtcctatatcacccacacctgccagcaccactgctagaaag acaatgggggacccagacgaggcagaaactcatttcccctggtggcctgtgggaaggagc tgggctcgaggcaggaggtggcttaagattcctcctctgtcccttggacggcccttggat ggaagtgcccttgcaacaattataaaaatagactatatagctgcactaatagtggcactt aaaaacagggttcgattgctaaagaagactcataggacccgaaatgataaagcttacaag tctacgctttgttacaaggaaaggacccactatagcaagagcatgaaagtcaagctatac accaagcagacgactccaaaaggggcctggagaagccagcctgaagatgcgatagaaaag aaaaactcattttctgaggagaaattgaagctggctgcagaaatgtgcataattaacaag gagccaaaaaatcccgaaaaaacgatgggtaaaatgtctccaggacatgtcagagacctt tgtggcagcccctcccatcacaggcccagaggcttaggaggaaaaaaatggtttcgtgag accagggtcccctgccgtgtgcagcctagggacttggtgtcctgcatcctagctgctcca gctttggctaaaaggggccaaggtacagctcgggccatgacttcagggtgcaagccccaa gccttggcagcttccacgtag >gi568815575f:23954969_24176782|GENSCAN_predicted_peptide_2|604_aa MAGDVEGFCSSIHDTSVSAGFRALYEEGLLLDVTLVIEDHQFQAHKALLATQSDYFRIMF TADMRERDQDKIHLKGLTATGFSHVLQFMYYGTIELSMNTVHEILQAAMYVQLIEVVKFC CSFLLAKICLENCAEIMRLLDDFGVNIEGVREKLDTFLLDNFVPLMSRPDFLSYLSFEKL MSYLDNDHLSRFPEIELYEAVQSWLRHDRRRWRHTDTIIQNIRFCLMTPTSVFEKVKTSE FYRYSRQLRYEVDQALNYFQNVHQQPLLDMKSSRIRSAKPQTTVFRGMIGHSMVNSKILL LKKPRVWWELEGPQVPLRPDCLAIVNNFVFLLGGEELGPDGEFHASSKVFRYDPRQNSWL QMADMSVPRSEFAVGVIGKFIYAVAGRTRDETFYSTERYDITNDKWEFVDPYPVNKYGHE GTVLNNKLFITGGITSSSTSKQVCVFDPSKEGTIEQRTRRTQVVTNCWENKSKMNYARCF HKMISYNGKLYVFGGVCVILRASFESQGCPSTEVYNPETDQWTILASMPIGRSGHGVTVL DKQIMVLGGLCYNGHYSDSILTFDPDENKWKEDEYPRMPCKLDGLQVCNLHFPDYVLDEV RRCN >gi568815575f:23954969_24176782|GENSCAN_predicted_CDS_2|1815_bp atggcaggggacgtggaaggattctgttcctccatccacgacaccagtgtctctgctggg ttcagagcactgtatgaggagggattgcttcttgatgtcactctggttattgaagatcat cagttccaggcccataaagcactcttggccacccagagtgattacttcagaattatgttt actgcagacatgagggaacgagatcaggacaaaattcatttaaaaggtctaacagctacc ggtttcagccatgtcctgcaatttatgtactatggaactatagagctgagtatgaatacc gttcatgagattcttcaggctgccatgtatgttcaacttatagaagtggtgaagttctgc tgctcttttctcttagcgaagatctgcctagaaaattgtgcagaaattatgagactctta gatgatttcggcgtaaacatcgagggagtcagggagaagttagacacctttctgctagac aactttgtgccactcatgtctaggcctgactttctgtcctatctgagctttgagaagctc atgtcttacttggataatgatcatctgagcaggttcccagagatagagctgtacgaggct gtgcagtcttggctgcggcatgatagaagacgctggagacataccgataccatcattcag aatatccggttttgcttgatgaccccaaccagcgtttttgagaaggttaagacatcagaa ttttatagatactcccgacagctccgttacgaagttgaccaagcattgaattactttcag aatgttcaccagcagcctttgttggatatgaagtcaagccgcatccgttctgcaaaaccg caaactacagtatttcgaggaatgattggacatagcatggttaacagtaaaatacttctc ttaaagaaaccaagagtctggtgggagctagaaggcccacaagtacctctgcgacctgac tgccttgctatcgtcaataattttgtgttcctgttaggcggggaagagctgggcccggat ggtgaattccatgcttcttccaaagtattcaggtatgacccgagacagaactcctggctg cagatggcagatatgtctgtaccacgctctgaatttgctgtaggtgttattgggaagttt atttacgccgtagcaggcagaaccagagatgagactttctattcaactgagagatatgac atcaccaacgataaatgggaatttgtggatccttatccagttaacaaatatggacatgag gggacagtgctcaataacaaattgtttatcaccggtggaatcacctcatcttccacctcc aagcaagtgtgcgtgtttgaccccagcaaagaagggaccatagaacaacggaccaggaga actcaagtggttaccaactgttgggagaataagagcaagatgaattacgcgagatgcttt cacaagatgatttcttacaatggcaagctttatgtcttcggtggtgtctgtgtgatcttg agggcctctttcgaatctcagggatgcccttctacagaagtatacaacccagagactgat cagtggaccatcttggcatccatgccgattggtagaagtggccatggtgtgactgtgctg gacaaacaaataatggttcttggaggcctttgttataatggtcattacagcgattccatc ctcacttttgatccggatgaaaacaagtggaaggaagatgagtaccctcggatgccctgc aagctggatggtttacaagtatgcaacctgcattttccggactatgtactggatgaggtc aggcgttgcaactaa >gi568815575f:23954969_24176782|GENSCAN_predicted_peptide_3|122_aa MASLSGKSVDDGNTQLEARGSTINLKLGHPLPIADPKLSFSYEIIKGLSVQRGLGFVAQQ CEGVIREKIQDCQGDREGVASEIRMTAYRVEEANRSPQCTRWSPTAPAPDGHVAAAGLNE TL >gi568815575f:23954969_24176782|GENSCAN_predicted_CDS_3|369_bp atggcatctctttctgggaaaagtgtagatgacggaaatacacagttggaagcccgtgga tctacaattaaccttaaattagggcatccattgcccatagcagatcctaaactgagcttc agttatgagattatcaaggggctgagtgtacagcgaggactgggctttgtggcgcagcaa tgtgagggagtcatccgggagaagatacaggactgtcaaggagacagagaaggagtagcc agtgagataagaatgacagcttatagagtggaggaggcaaaccggtcccctcaatgtacc agatggtcacctacagcaccagctccagatggccacgtggctgcagctgggctcaatgaa actctgtga >gi568815575f:23954969_24176782|GENSCAN_predicted_peptide_4|228_aa MDGAESHYSQQSNAGAENQIPHVTTYKWELSGALQPTTALWEPLSGLAKAGAHSLSLQGG VEGEARAGTGAACGACGPAGVPGGRGLGGPCTRSSQPALLAPGNEGLSTRASSCGGCTGS PSSASPPALLSISHGALAAFPQDRARDLQPAMPEPPTHSMGSCAAPASPRSTTPCSTAPS PIDHPRAEECERTAQDWQAAPPAAPVPDPLGGASWAPESGGDVESLYV >gi568815575f:23954969_24176782|GENSCAN_predicted_CDS_4|687_bp atggatggagccgaaagccattattctcagcaaagtaacgcaggagcagaaaaccaaata ccgcatgttaccacttataagtgggagctgagtggagcccttcagcccaccactgcactg tgggagcccctttctgggctggccaaggctggagcccactccctcagcttgcaaggaggt gtggagggagaggcccgagcgggaaccggggctgcatgcggcgcttgcgggccagctgga gttccgggtgggcgtgggcttggcgggccctgcactcggagcagtcagccagccctgctg gccccgggcaatgagggacttagcacccgggccagcagctgcggagggtgtactgggtcc cccagcagtgccagcccgccggcgctgctctcgatttctcacggagccttagctgccttc ccgcaggacagggctcgggacctgcagcccgccatgcctgagcctcccacccactccatg ggctcctgtgcggccccagcctccccgaggagcaccaccccctgctccacagcgcccagt cccatcgaccacccaagggctgaggaatgcgagcgcacggcgcaggactggcaggcagct ccacctgcagccccggtgccggatccactgggtggagccagctgggctcctgagtctggt ggggacgtggagagtctttatgtctag >gi568815575f:23954969_24176782|GENSCAN_predicted_peptide_5|459_aa MDGAESHYSQQSNAGAENQIPHVTTYKWELSGTIGHVAHGKSTVVKAISGVHTVRFKNEL ERNITIKLGYANAKIYKLDDPSCPRPECYRSCGSSTPDEFPTDIPGTKGNFKLVRHVSFV DCPGHDILMATMLNGAAVMDAALLLIAGNESCPQPQTSEHLAAIEIMKLKHILILQNKID LVKESQAKEQYEQILAFVQGTVAEGAPIIPISAQLKYNIEVVCEYIVKKIPVPPRDFTSE PRLIVIRSFDVNKPGCEVDDLKGGVAGGSILKGVLKVGQEIEVRPGIVSKDSEGKLMCKP IFSKIVSLFAEHNDLQYAAPGGLIGVGTKIDPTLCRADRMVGQVLGAVGALPEIFTELEI SYFLLRRLLGVRTEGDKKAAKVQKLSKNEVLMVNIGSLSTGGRVSAVKADLGKIVLTNPV CTEVGEKIALSRRVEKHWRLIGWGQIRRGVTIKPTVDDD >gi568815575f:23954969_24176782|GENSCAN_predicted_CDS_5|1380_bp atggatggagccgaaagccattattctcagcaaagtaacgcaggagcagaaaaccaaata ccgcatgttaccacttataagtgggagctgagtggtacaattggtcatgtagctcatggg aaatccacagtcgtcaaagctatttctggagttcatactgtcaggttcaaaaatgaacta gaaagaaatattacaatcaagcttggatatgctaatgctaagatttataagcttgatgac ccaagttgccctcggccagaatgttatagatcttgtgggagcagtacacctgacgagttt cctacggacattccagggaccaaagggaacttcaaattagtcagacatgtttcctttgtt gactgtcctggccacgatattttgatggctactatgctgaacggtgcagcagtgatggat gcagctcttctgttgatagctggtaatgaatcttgccctcagcctcagacatcggaacac ctggctgctatagagatcatgaaactgaagcatattttgattctacaaaataaaattgat ttggtaaaagaaagtcaggctaaagaacaatacgagcagatccttgcatttgtccaaggt acagtagcagagggagctcccattattccaatttcagctcagctgaaatacaatattgaa gttgtttgtgagtacatagtaaagaaaattccagtacccccaagagactttacttcagag ccccggcttattgttattagatcttttgatgtcaacaaacctggctgtgaagttgatgac cttaagggaggtgtagctggtggtagtatcctaaaaggagtattaaaggtgggccaggag atagaagtaagacctggtattgtttccaaagatagtgaaggaaaactcatgtgtaaacca atcttttccaaaattgtatcactttttgcggagcataatgatctgcaatatgctgctcca ggcggtcttattggagttggaacaaaaattgaccccactttgtgccgggctgacagaatg gtggggcaagtacttggtgcagtcggagctttacctgagatattcacagaattggaaatt tcctatttcctgcttagacggcttctaggtgtacgcactgaaggagacaagaaagcagca aaggttcaaaagctgtctaagaatgaagtgctcatggtgaacataggatccctgtcaaca ggagggagagttagtgctgtcaaggccgatttgggtaaaattgttttgaccaatccagtg tgcacagaggtaggagaaaaaattgcccttagccgaagagttgaaaaacactggcgttta attggttggggtcagataagaagaggagtgacaatcaagccaacagtagatgatgactga >gi568815575f:23954969_24176782|GENSCAN_predicted_peptide_6|205_aa MLLKVNLEALHPWWPVPAGTRGCGHLAPQSAPSHRPQAMAVPDRMYQVTGQALDRRDLDM GAQHHTLAHWRRLQPASAVPQIHQVPAAPPRPAPSAQAMLHYQEDQHLLLVAGASHPSVP HINSVICLSVHEQQNLDGAPGVLVTDFGSLTQYALLVAWLQWAAGSLRSPPKQLPTQFRL EPAPTGFLIASEEQSLKFDNCIQIG >gi568815575f:23954969_24176782|GENSCAN_predicted_CDS_6|618_bp atgctccttaaagtgaacttagaggctctacatccctggtggccagtgcctgctgggacc agaggctgtggccatctggccccccagtcagccccaagtcatcgcccccaggcaatggct gtgcctgacagaatgtaccaggtcactggacaggcactcgataggagagacctagacatg ggtgcccagcaccacaccctggcccactggaggagattacagccagcaagtgcagttcca caaatccatcaagttcctgctgctccaccgcgtcctgcaccatcagcacaagccatgctt cactaccaagaggaccaacaccttctgctagttgcaggggcctctcaccccagtgtgccc catataaactcggtgatttgtctttctgtgcacgagcagcagaacctagatggagcccct ggtgttttagtaacagattttggttccctgacccagtatgctttgcttgtggcttggctg cagtgggctgctgggagtctcagaagccctcctaagcagctgcccacccaatttaggctg gagccagccccaacagggttcctgattgcctcggaagaacagtctttgaaatttgacaac tgcatccagataggatga >gi568815575f:23954969_24176782|GENSCAN_predicted_peptide_7|139_aa MVGGGRAACLSAGSADRGPARALRPRRRRQRRVCGGGGRERGGGGKKGGAVTETPAAGDP AAARPLGSPVAGRERRRHGLSCRRRPPPPPGRRDRFTLSAPGPAPSPAPVATRGQSGQVA FRTPGLTRAYGTPGNVIFA >gi568815575f:23954969_24176782|GENSCAN_predicted_CDS_7|420_bp atggtggggggcggccgggccgcctgcctctccgcggggtcggccgaccgcggccccgcg cgggctttacggccgaggcggcggcggcagcggcgcgtgtgtggcggcggcgggcgggag cgcggaggaggtggaaagaaggggggcgctgtcacggagactccggccgccggagacccc gccgcagcgaggccactgggctccccggtcgcggggcgggagcggcgccgacacgggctg agctgccgcaggcggccaccgccgccgcccggacgccgggaccgtttcaccctcagcgcc cctggccctgcgccttcccccgcgcctgtagccacccgagggcagtcggggcaggtggca ttccggacacctgggcttaccagggcatacgggaccccaggaaatgttatttttgcttaa >gi568815575f:23954969_24176782|GENSCAN_predicted_peptide_8|51_aa MLCKFPNLGNTGFKNKWGAKKVKTALSWSTAAHDLLQSRKMEKLKMGIYAT >gi568815575f:23954969_24176782|GENSCAN_predicted_CDS_8|156_bp atgctgtgcaagtttcccaatttgggtaatactggatttaagaataagtggggtgcaaag aaagtgaagacagccctttcatggagcaccgctgctcacgacctgttacagtctagaaaa atggaaaaactgaaaatgggtatctatgccacctaa