GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:21:08 Sequence gi568815582f:46784716_47028994 : 244279 bp : 45.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1758 1884 127 2 1 71 98 48 0.445 4.42 1.02 Term + 5459 5646 188 0 2 70 54 114 0.824 3.85 1.03 PlyA + 7202 7207 6 1.05 2.05 PlyA - 11993 11988 6 1.05 2.04 Term - 18355 18237 119 2 2 112 43 110 0.982 7.60 2.03 Intr - 21305 21288 18 0 0 103 94 21 0.618 0.78 2.02 Intr - 46265 46100 166 2 1 -6 48 200 0.010 6.13 2.01 Init - 46434 46369 66 0 0 59 94 104 0.909 9.17 2.00 Prom - 49265 49226 40 -5.16 3.00 Prom + 55178 55217 40 -3.46 3.01 Init + 59167 59561 395 0 2 51 37 264 0.647 13.82 3.02 Intr + 59626 59743 118 1 1 33 1 159 0.866 2.17 3.03 Intr + 59763 59918 156 2 0 39 72 155 0.188 9.21 3.04 Intr + 87625 87748 124 0 1 60 106 48 0.012 3.96 3.05 Intr + 99209 99297 89 0 2 61 80 145 0.492 10.69 3.06 Intr + 99409 99493 85 0 1 71 26 67 0.446 -1.81 3.07 Intr + 99548 99623 76 0 1 87 45 51 0.751 -0.73 3.08 Intr + 99979 100243 265 1 1 87 88 533 0.968 50.72 3.09 Intr + 112933 113022 90 0 0 122 99 138 0.999 18.49 3.10 Intr + 115967 116075 109 1 1 115 68 111 0.994 11.66 3.11 Intr + 122127 122260 134 1 2 116 46 267 0.571 25.66 3.12 Intr + 124969 125212 244 0 1 61 46 391 0.402 29.27 3.13 Intr + 130010 130139 130 0 1 58 58 89 0.551 2.65 3.14 Intr + 133857 134042 186 0 0 17 53 342 0.259 22.40 3.15 Intr + 137527 137701 175 1 1 95 72 261 0.927 25.14 3.16 Intr + 139674 139829 156 2 0 92 41 150 0.997 10.91 3.17 Intr + 142210 142322 113 0 2 99 80 94 0.706 8.88 3.18 Term + 144192 144282 91 0 1 107 55 189 0.793 14.69 3.19 PlyA + 144698 144703 6 1.05 4.13 PlyA - 147802 147797 6 1.05 4.12 Term - 172505 172314 192 2 0 25 37 147 0.887 0.72 4.11 Intr - 174415 174288 128 2 2 65 84 122 0.963 9.90 4.10 Intr - 174704 174560 145 2 1 71 99 102 0.995 9.46 4.09 Intr - 179852 179766 87 2 0 67 101 68 0.964 6.17 4.08 Intr - 180092 179896 197 0 2 71 45 141 0.990 7.23 4.07 Intr - 182931 182798 134 2 2 52 95 75 0.516 4.89 4.06 Intr - 183449 183369 81 1 0 90 106 17 0.327 2.45 4.05 Intr - 186857 186634 224 2 2 79 7 123 0.350 0.23 4.04 Intr - 187240 187181 60 1 0 62 106 39 0.796 2.03 4.03 Intr - 189116 188780 337 1 1 40 65 378 0.746 26.22 4.02 Intr - 211411 211335 77 1 2 47 74 118 0.043 4.61 4.01 Init - 232024 231965 60 1 0 91 70 34 0.440 3.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 46265 46044 222 2 0 -6 48 251 0.928 8.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:46784716_47028994|GENSCAN_predicted_peptide_1|104_aa MAENFPRLGKKREIWIQEAQRMPNKKNPKRSTQRHIIIKLTKAKWIRGWCTAWQLRMQTW LTGCSAPPPRWPEESAPARLHSGGGGDPQAARSSPWAGGEEREK >gi568815582f:46784716_47028994|GENSCAN_predicted_CDS_1|315_bp atggctgaaaacttcccaagactggggaagaaaagagaaatctggatccaggaagcccaa aggatgccaaataagaagaatccaaagagatctacacagagacatattataatcaaacta acaaaagccaagtggattcgtggctggtgcacagcgtggcagctgaggatgcagacgtgg ctcacgggctgctcggcgccaccgccacgttggccagaggagtcggcgccagccaggctg cacagtgggggcggcggggatccccaagcagctcggagtagcccctgggccggcggggaa gaacgagaaaaatga >gi568815582f:46784716_47028994|GENSCAN_predicted_peptide_2|122_aa MSATRAKKVKMATKSCPECDQQILGPAGGGGDPGPVGLHRGVSREKPGAVARQRPPGGGQ TAKPLLAVSRPLEERAFGLSFEKEKQEKEIDIYANLSDEKAFVFSVALAEINRKIINQRL IL >gi568815582f:46784716_47028994|GENSCAN_predicted_CDS_2|369_bp atgtctgcaactcgagccaagaaagtgaagatggccaccaaatcatgccccgagtgcgac caacagatcctggggcccgccggcggcggcggggatccggggccggtcgggctgcaccga ggggtttcccgggaaaagcccggggccgtggcccgccagcgtccacctggcggcggccag acggccaagccccttcttgctgtgtcccggccactggaggagcgcgcctttggtttatcc tttgaaaaagagaaacaggaaaaggaaattgacatctatgctaacctgtctgatgaaaag gctttcgtgttttcagtcgccttggcagaaataaatagaaaaattatcaatcaaagactt attctctga >gi568815582f:46784716_47028994|GENSCAN_predicted_peptide_3|911_aa MNSSFLDLKSFKPGEAEAEAVNASFVCPPVARRWPAGGPRRRPTPGRSILSLCQMPRALG HARGQEQGRYRLVEEEVAVEELLLLFRVGPAPARVVQARQVADQSLGRPSRDPREAEVEA VRTASSTQLGASAVPVLKDGAEEILEHVVGLLVGRHAAHGPEYGWPGLSTRREATPGAQL RVQLGQRLGHWVLVAAQVGKLVLRREASFSAPRLSEKGMCGRSFRSCFHIFGYLFSNAPL YWYQFTVLVHFHAADKDIPETGKKTLVGSRRHRVALDDETRGDDCTIQQPASSQALQPLN WDKEPAQPRPRSDERSRLLSGPAAPEQPGELQGPGGSAQARGGPDGGGVSLRKRAMQRAA ALVRRGCGPRTPSSWGRSQSSAAAEASAVLKVRPERSRRERILTLESMNPQVKAVEYAVR GPIVLKAGEIELELQRGIKKPFTEVIRANIGDAQAMGQQPITFLRQVMALCTYPNLLDSP SFPEDAKKRARRILQACGGNSLGSYSASQGVNCIREDVAAYITRRDGGVPADPDNIYLTT GASDGISTILKILVSGGGKSRTGVMIPIPQYPLYSAVISELDAIQVNYYLDEENCWALNV NELRRAVQEAKDHCDPKVLCIINPGNPTGQTMSGERPCPCLVELMGASLGRKKLTLHPVA TANASLFTICCCSLFEDPLVTVPAVPPQVYQDNVYSPDCRFHSFKKVLYEMGPEYSSNVE LASFHSTSKGYMGECGYRGGYMEVINLHPEIKGQLVKLLSVRLCPPVSGQAAMDIVVNPP VAGEESFEQFSREKESVLGNLAKKAKLTEDLFNQVPGIHCNPLQGAMYAFPRIFIPAKAV EAAQAHQMAPDMFYCMKLLEETGICVVPGSGFGQREGTYHFRMTILPPVEKLKTVLQKVK DFHINFLEKYA >gi568815582f:46784716_47028994|GENSCAN_predicted_CDS_3|2736_bp atgaactcgtcgttcctagacttgaagagtttcaaaccgggagaggccgaggcagaagca gtaaatgcctccttcgtttgcccccctgttgcacggagatggcccgcaggtggtcctcgt cgccgccccacaccaggaaggtctatactgtccttgtgccagatgccgcgggcgttgggc catgcccgaggccaggagcaggggcggtacaggctcgtggaagaggaagtggctgtcgag gagctgctgttgctgttccgcgtcggtcctgctcctgcgcgcgtcgtccaggcccgccag gtcgccgaccagtctctagggcgtccatcgcgggacccacgggaggcagaagtggaggcc gtgcgcaccgcgagctcaacacagttgggggccagtgcggtgccggtcctcaaggatggg gccgaagagatccttgaacacgtcgtaggactcctcgtcggccgccacgcggcccacggc cctgagtacgggtggcccgggctgtccacgcggcgcgaagccacccctggcgcgcagctc cgcgttcagctggggcagcgcctcggccactgggtcttggtggccgctcaggtcgggaaa ctcgtcctgcgccgggaggcgagcttcagcgccccgcggctgtcggagaagggcatgtgc gggcgctcgttccgaagttgctttcacattttcgggtatcttttcagcaacgccccactc tactggtaccaatttactgtattagtccattttcatgctgctgataaagacatacctgag actgggaagaaaactctggtgggcagccgcaggcaccgtgtggccttggatgatgaaact cggggcgatgactgcaccatccagcagcctgcgtcctcgcaggctctccagcccctcaac tgggacaaagagcccgcccagccccggccacgttcggacgaacgctcccgcctactgtcc ggtcccgccgctccggagcagccaggggaactacagggcccaggaggcagcgcacaggcc cggggcggccccgacggtggtggggtttctctccgcaagcgcgcgatgcagcgggcggcg gcgctggtccggcggggctgtggtccccggacccccagctcctggggccgcagccagagc agcgcggccgccgaggcctcggcggtgctcaaggtgcggcccgagcgcagccggcgcgag cgcatcctcacgctggagtccatgaacccgcaggtgaaggcggtggagtacgccgtgcgg ggacccatcgtgctcaaggccggcgagatcgagctcgagctgcagcggggtatcaaaaag ccattcacagaggtcatccgagccaacatcggggacgcccaggctatggggcagcagcca atcaccttcctccggcaggtgatggcactatgcacctacccaaacctgctggacagcccc agcttcccagaagatgctaagaaacgtgcccggcggatcctgcaggcttgtggcgggaac agcctggggtcctacagtgctagccagggtgtcaactgcatccgtgaagatgtggctgcc tacatcaccaggagggatggcggtgtgcctgcggaccccgacaacatctacctgaccacg ggagctagtgacggcatttctacgatcctgaagatcctcgtctccgggggcggcaagtca cggacaggtgtgatgatccccatcccacaatatcccctctattcagctgtcatctctgag ctcgacgccatccaggtgaattactacctggacgaggagaactgctgggcgctgaatgtg aatgagctccggcgggcggtgcaggaggccaaagaccactgtgatcctaaggtgctctgc ataatcaaccctgggaaccccacagggcagacgatgagcggggagaggccttgtccctgc cttgtggagctcatgggtgcctctctggggagaaagaagctaactcttcacccggtggcc acagccaatgcgagtctcttcaccatctgctgctgcagcctctttgaggaccctttggtg accgtccctgccgtgcccccgcaggtgtaccaggacaacgtgtactctccagattgcaga ttccactccttcaagaaggtgctgtacgagatggggcccgagtactccagcaacgtggag ctcgcctccttccactccacctccaagggctacatgggcgagtgtggttacagaggaggc tacatggaggtgatcaacctgcaccctgagatcaagggccagctggtgaagctgctgtcg gtgcgcctgtgccccccagtgtctgggcaggccgccatggacattgtcgtgaaccccccg gtggcaggagaggagtcctttgagcaattcagccgagagaaggagtcggtcctgggtaat ctggccaaaaaagcaaagctgacggaagacctgtttaaccaagtcccaggaattcactgc aaccccttgcagggggccatgtacgccttccctcggatcttcattcctgccaaagctgtg gaggctgctcaggcccatcaaatggctccagacatgttctactgcatgaagctcctggag gagactggcatctgtgtcgtgcccggcagtggctttgggcagagggaaggcacttaccac ttcaggatgactatcctccctccagtggagaagctgaaaacggtgctgcagaaggtgaaa gacttccacatcaacttcctggagaagtacgcgtga >gi568815582f:46784716_47028994|GENSCAN_predicted_peptide_4|573_aa MDQSTGPDQADVDGWLCGHLLPNSLDTGDRGYNYDPNYLHLADPMGRLDLEENREPGPLF NTSTPVAAPRALGGGLCTATSRRTVRWVKVQPRPRFPLAVSLGLCRRRRRLWARLRSVCF ATFSPSTARPAAMANVADTKLYDILGVPPGASENELKKAYRKLAKEYHPDKNPNAGDKFK EISFAYEVLSNPEKRELYDRYGEQGLREGSGGGGGMDDIFSHIFGGGLFGFMGNQSRSRN GRRRGEDMMHPLKVSLEDLYNGKTTKLQLSKNVLCSACSGQGGKSGAVQKCSACRGRGVR IMIRQLAPGMVQQMQSVCSDCNGEGEVINEKDRCKKCEGKKVIKEVKILEVHVDKGMKHG QRITFTGEADQAPGVEPGDIVLLLQEKEHEKYFLLVRNRLDLFGVPGATLNTGEVKGIMV FQRDGNDLHMTYKIGLVEALCGFQFTFKHLDGRQIVVKYPPGKVIEPGCVRVVRGEGMPQ YRNPFEKGDLYIKFDVQFPENNWINPDKLSELEDLLPSRPEVPNIIGETEEVELQEFDST RGSGGGQRREAYNDSSDEESSSHHGPGVQCAHQ >gi568815582f:46784716_47028994|GENSCAN_predicted_CDS_4|1722_bp atggaccagtccaccggacctgaccaggcagacgtggatggatggctgtgtggacacctg ttgcccaactctctggacacaggtgaccgaggatacaattatgaccccaactacctgcac ttagcagatcccatgggccgcctggacctggaggaaaacagagaacctggcccactcttc aatacttcgaccccagtggccgcgcctcgggcgctcgggggcgggctctgcacagcgacg tcacggcgcaccgtgcgctgggtcaaagttcagccccgcccccgcttccccctcgctgtc tccctcggcctgtgccgccgccgacgccgcttgtgggcccgactccgctctgtctgcttc gccaccttctccccgagcactgcccggccggccgccatggctaacgtggctgacacgaag ctgtacgacatcctgggcgtcccgcccggcgccagcgagaacgagctgaagaaggcatac agaaagttagccaaggaatatcatcctgataagaatccaaatgcaggagacaaatttaaa gaaataagttttgcatatgaagtactatcaaatcctgagaagcgtgagttatatgacaga tacggagagcaaggtcttcgggaaggcagcggcggaggtggtggcatggatgatattttc tctcacatttttggtgggggattgttcggcttcatgggcaatcagagtagaagtcgaaat ggcagaagaagaggagaggacatgatgcatccactcaaagtatctttagaagatctgtat aatggcaagacaaccaaactacaacttagcaagaatgtgctctgtagtgcatgcagtggc caaggcggaaagtctggagctgtccaaaagtgtagtgcttgtcgaggtcgaggtgtgcgc atcatgatcagacagctggctccagggatggtacaacagatgcagtctgtgtgctctgat tgtaatggagaaggagaggtaattaatgaaaaagaccgctgtaaaaaatgtgaagggaag aaggtgattaaagaagtcaagattcttgaagtccacgtagacaaaggcatgaaacatgga cagagaattacattcactggggaagcagaccaggccccaggagtggaacccggagacatt gttcttttgctacaggagaaagaacatgagaagtacttcttgcttgtgagaaataggtta gatctgtttggagtccctggagcaacactgaacactggtgaagttaagggcatcatggta tttcagagagatgggaatgatttgcacatgacatataaaataggacttgttgaagctcta tgtggatttcagttcacatttaagcaccttgatggacgtcagattgtggtgaaatacccc cctggcaaagtaattgaaccagggtgtgttcgtgtagttcgaggtgaagggatgccgcag tatcgtaatccctttgaaaaaggtgatctttacataaagtttgatgtgcagtttcctgaa aacaactggatcaacccagacaagctttctgaactagaagatcttctgccatctagaccg gaagttcctaacataattggagaaacagaggaggtagagcttcaggaatttgatagcact cgaggctcaggaggtggtcagaggcgtgaagcctataatgatagctctgatgaagaaagc agcagccatcatggacctggagtgcagtgtgcccatcagtaa