GENSCAN 1.0 Date run: 8-Nov-116 Time: 03:06:27 Sequence gi568815592f:43670708_43882087 : 211380 bp : 50.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 82 242 161 2 2 104 39 293 0.999 24.10 1.02 PlyA + 1872 1877 6 1.05 2.07 PlyA - 2020 2015 6 1.05 2.06 Term - 4312 4284 29 2 2 122 44 12 0.273 -1.56 2.05 Intr - 4545 4495 51 1 0 57 119 40 0.715 2.88 2.04 Intr - 4910 4787 124 2 1 94 75 156 0.998 15.06 2.03 Intr - 7918 7811 108 1 0 112 86 108 0.990 13.48 2.02 Intr - 10413 10382 32 1 2 121 92 26 0.990 4.15 2.01 Init - 17072 16961 112 2 1 80 113 80 0.972 8.34 2.00 Prom - 25399 25360 40 -1.86 3.10 PlyA - 27812 27807 6 1.05 3.09 Term - 31817 31641 177 2 0 89 54 3 0.127 -5.31 3.08 Intr - 35425 35334 92 2 2 41 98 115 0.494 7.51 3.07 Intr - 39826 39757 70 1 1 134 80 -9 0.028 1.65 3.06 Intr - 46773 46716 58 2 1 55 89 71 0.141 2.79 3.05 Intr - 46904 46800 105 1 0 -11 105 129 0.042 4.03 3.04 Intr - 67103 67029 75 1 0 73 99 32 0.313 1.43 3.03 Intr - 68433 68356 78 2 0 75 92 33 0.300 1.07 3.02 Intr - 74413 74282 132 0 0 142 75 48 0.774 8.66 3.01 Init - 91480 91377 104 1 2 78 27 121 0.038 4.71 3.00 Prom - 92997 92958 40 -7.66 4.00 Prom + 97857 97896 40 -8.56 4.01 Init + 100540 100605 66 0 0 88 121 71 0.921 9.48 4.02 Intr + 103634 103685 52 1 1 115 113 -12 0.903 2.48 4.03 Intr + 106762 106958 197 2 2 66 80 338 0.983 29.93 4.04 Intr + 107753 107829 77 1 2 147 94 33 0.980 8.11 4.05 Intr + 108182 108211 30 2 0 114 106 2 0.762 1.85 4.06 Intr + 110025 110104 80 0 2 124 -23 105 0.166 2.19 4.07 Term + 113834 114003 170 0 2 124 44 60 0.482 3.24 4.08 PlyA + 114243 114248 6 1.05 5.17 PlyA - 114289 114284 6 -0.45 5.16 Term - 114611 114583 29 0 2 31 39 36 0.080 -8.76 5.15 Intr - 114875 114746 130 2 1 15 94 153 0.238 8.87 5.14 Intr - 116013 115922 92 1 2 87 98 6 0.124 1.21 5.13 Intr - 133753 133472 282 2 0 71 92 63 0.188 2.39 5.12 Intr - 135323 135163 161 1 2 59 42 89 0.285 1.03 5.11 Intr - 138915 138825 91 1 1 70 68 117 0.783 6.95 5.10 Intr - 140528 140507 22 2 1 77 113 18 0.584 0.32 5.09 Intr - 140923 140758 166 0 1 19 98 74 0.158 1.46 5.08 Intr - 147189 147078 112 1 1 44 91 101 0.359 5.44 5.07 Intr - 154006 153887 120 2 0 51 51 77 0.013 0.77 5.06 Intr - 183884 183829 56 1 2 80 91 81 0.876 6.22 5.05 Intr - 185516 185394 123 1 0 112 97 6 0.848 3.60 5.04 Intr - 200597 200547 51 1 0 98 87 14 0.090 0.32 5.03 Intr - 203099 203063 37 0 1 111 84 20 0.397 1.22 5.02 Intr - 205599 205293 307 0 1 99 80 85 0.383 4.82 5.01 Intr - 207302 207174 129 2 0 82 49 53 0.524 1.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:43670708_43882087|GENSCAN_predicted_peptide_1|53_aa XSWSIQMERGNALVVLRSLLWPGLTFYHAPRTKNYGYVYVGTGEKNMDLPFML >gi568815592f:43670708_43882087|GENSCAN_predicted_CDS_1|162_bp nggtcctggagcatccagatggagaggggcaatgccctggtggtgctgcgcagcctgctc tggccgggcctcaccttctaccatgctccccgcaccaagaactatggctacgtctacgtg ggcactggcgagaagaacatggacttgcccttcatgctatag >gi568815592f:43670708_43882087|GENSCAN_predicted_peptide_2|151_aa MAALKALVSGCGRLLRGLLAGPAATSWSRLPARGFREVVETQEGKTTIIEGRITATPKES PNPPNPSGQCPICRWNLKHKYNYDDVLLLSQFIRPHGGMLPRKITGLCQEEHRKIEECVK MAHRAGLGFLKELFRRANPNSTGPLAKAQGG >gi568815592f:43670708_43882087|GENSCAN_predicted_CDS_2|456_bp atggcggccctcaaggctctggtgtccggctgtgggcggcttctccgtgggctactagcg ggcccggcagcgaccagctggtctcggcttccagctcgcgggttcagggaagtggtggag acccaagaagggaagacaactataattgaaggccgtatcacagcgactcccaaggagagt ccaaatcctcctaacccctctggccagtgccccatctgccgttggaacctgaagcacaag tataactatgacgatgttctgctgcttagccagttcatccggcctcatggaggcatgctg ccccgaaagatcacaggcctatgccaggaagaacaccgcaagatcgaggagtgtgtgaag atggcccaccgagcaggcctcggcttcctgaaggagttgttccgaagagcaaaccccaac tcaaccgggcccttagcaaaggcgcaaggagggtag >gi568815592f:43670708_43882087|GENSCAN_predicted_peptide_3|296_aa MAKTFNTCYGSGTVHIELTAYAKPDINLNFWMVERSHCPLAKPSPHTVPSSSPQRSGLLR VWGIPLHVTDIHSYGNVKRHVLSGPPALACTASSEKVQPITKARRLAGQRENFGQDLGLE FPSKWAVSISEGDDGEGDGGGDGGGNDGDSDGGGGSDVIVIVGMRVQDHEETVGHVTEAY RNIVKTYWTPAPFTAQQGGGYKVPIWPADLTGEEEHLATGILITVMEEEAGSGVLTKQPS GSQPWLHIEPFKEPLKNAAAWAPALREFNSTVLFNSGERPGHHLLIPWLMLMSSQE >gi568815592f:43670708_43882087|GENSCAN_predicted_CDS_3|891_bp atggccaaaacatttaacacctgctatggctcaggcacagttcacatagagctgacggcg tatgccaagccagacataaacctcaatttctggatggtggagaggtcccattgcccccta gccaagccctctcctcatactgtccccagctcatccccacaaaggagcggccttcttcgt gtttggggcattcctttgcatgtcactgatattcacagctatggcaatgtcaaaagacat gttctgtctggcccacccgcattggcctgcacagcgtcctcagaaaaggtgcagcccatc accaaagccagaaggctggctggccagagagagaatttcggacaggatttaggcctggaa tttccctccaaatgggcggtgtcgatcagtgaaggggatgatggtgaaggtgatggcggt ggtgatggaggtggcaatgatggagacagcgatggtggtggagggagtgatgtgattgtc atagtgggaatgagggtacaggaccatgaagaaaccgttggccatgtcactgaggcctac aggaacatcgtgaaaacttactggacacccgctccattcacagcccagcaaggtggaggt tataaggtaccaatatggccggctgacctgactggagaggaggaacatctggcaactgga atcttaataacagtaatggaagaggaagcaggctctggtgtcctcaccaagcagcccagt ggttctcaaccctggctgcacatagaaccatttaaggagcctttaaaaaacgctgctgcc tgggctccagccctgagagagttcaattcaactgttctgtttaactccggggagaggcca ggccatcatcttttaattccctggttgatgctgatgagcagccaggaatga >gi568815592f:43670708_43882087|GENSCAN_predicted_peptide_4|223_aa MNFLLSWVHWSLALLLYLHHAKWSQAAPMAEGGGQNHHEVVKFMDVYQRSYCHPIETLVD IFQEYPDEIEYIFKPSCVPLMRCGGCCNDEGLECVPTEESNITMQIMRIKPHQGQHIGEM SFLQHNKCECRPKKDRARQEKKSVRGKGKGQKRKRKKSRYKSWSVYVDVTSRGGEPGRRK EPPSGFREPDLSPGKTDTERSIQKPRCRHHTITIDRTVLNPET >gi568815592f:43670708_43882087|GENSCAN_predicted_CDS_4|672_bp atgaactttctgctgtcttgggtgcattggagccttgccttgctgctctacctccaccat gccaagtggtcccaggctgcacccatggcagaaggaggagggcagaatcatcacgaagtg gtgaagttcatggatgtctatcagcgcagctactgccatccaatcgagaccctggtggac atcttccaggagtaccctgatgagatcgagtacatcttcaagccatcctgtgtgcccctg atgcgatgcgggggctgctgcaatgacgagggcctggagtgtgtgcccactgaggagtcc aacatcaccatgcagattatgcggatcaaacctcaccaaggccagcacataggagagatg agcttcctacagcacaacaaatgtgaatgcagaccaaagaaagatagagcaagacaagaa aaaaaatcagttcgaggaaagggaaaggggcaaaaacgaaagcgcaagaaatcccggtat aagtcctggagcgtgtacgttgatgtgacaagccgaggcggtgagccgggcaggaggaag gagcctccctcagggtttcgggaaccagatctctcaccaggaaagactgatacagaacga tcgatacagaaaccacgctgccgccaccacaccatcaccatcgacagaacagtccttaat ccagaaacctga >gi568815592f:43670708_43882087|GENSCAN_predicted_peptide_5|635_aa GVLQVPLPSISPSPAASGGTAQSPMSVVLAGLSTAVSQLLAMGRLLTRRLHGRLGLIGSK HWAYFRAGLAAPDATDREKNLQGDTWDPRNAGGQALERWGGIGCPSQTTGHPGGISQYMV SCSWTITLSVEATHSALEGDPLRCVVNSHSNQREPVKVALIGVCIPQVCNAEKEKQRTGR LQGTQGQGTRLAGETQGLTVYSGDSSPGSVIRPHASEVLKMTERFSTRILNQGAGCDADS FGSHVPLLGWGDSHSPACCEKEEVYVTAQRLQALKCFSDITRDSMDVSGVKIILCPSDMG YQGYNEQMRKTRASHAGQSGTSPPPAPSQPPSLPITHCHLTPPATLPESIQEKAKVFRTR SLLSGRGLKSSVITRQIHPFTEHMLSTHYMPGPMLVLALNEFFACQSLIPVSTRRQRALS LPASPVALQPSVAVMEQGVLMQGIATTVPPPCEPGPQTGGLLDQLEGAEDFHLGMTCSSY HLFLFCNIINNQPLGETPSLHFIPFSPDWPVGEPQSHVVLQSCWALKEEALKFLSQQPGS RLTLPLTGCMTLELPTFYIKTGWMGSPRTSSAKMEGLCPQHNKEGEQEEDEGESQEGELS WAASSNNVSLLFAGTSASGLLGNSEAEPVSSSLYL >gi568815592f:43670708_43882087|GENSCAN_predicted_CDS_5|1908_bp ggggtgctgcaagttcctctgcccagcatctccccctccccagcagcatctggtggaaca gcccagtcccctatgtctgtggttctggcaggactgtcaactgctgtgtcccagctcctg gccatggggcgcctgcttacccgaaggctccatgggaggctgggcctcataggaagcaag cactgggcatacttcagggctgggttagcagcgccagatgctacagacagggagaagaat cttcagggggatacctgggaccccaggaatgcaggggggcaggcactggaaagatggggg gggatcggctgccccagccagacaactggccatccaggtggcatcagccagtacatggtt tcctgcagctggaccataactctgagtgtggaggccacacactcagcccttgaaggagac ccacttcgctgtgtggtgaattcacacagcaaccagagagagcctgttaaagtggctctc ataggagtttgcatccctcaagtatgcaatgcagagaaggaaaaacagaggactggccgg ctgcagggcacacagggccaaggcaccaggttggcaggagaaactcagggcctgactgtt tatagtggggatagctccccaggctctgtcatcagacctcacgcaagtgaggtcctcaag atgactgagcgcttcagcaccaggattcttaaccaaggagcaggctgtgatgctgactcc tttggctctcacgtccctctcctgggctggggtgacagccacagccctgcttgctgcgag aaggaggaagtgtatgtgacagcccagcgtctgcaagcgctcaagtgtttttctgacatc acccgggacagcatggatgtttcaggtgtcaagatcattttatgtccttcagacatgggg taccaaggatacaatgaacagatgcgtaagacaagagccagccatgcaggacaaagtggt acctctccacccccagcccccagccagcctccaagcctccccatcacccactgccacctg acaccacctgccacactgcccgagagcatccaagaaaaggccaaagtcttcagaacaagg tccctgctgagtgggagggggcttaaatcatctgtcatcacacgtcaaattcacccattc accgaacacatgctgagcacccactacatgccaggccctatgctggtgctggcccttaac gagttctttgcctgccaatctctgatcccggtgtctacacgcaggcaacgggccctctca ctgccagctagtcctgtggctttgcagccctcggtagcagtgatggaacagggggtgctg atgcaggggatagccactactgtccctcctccttgtgaaccaggaccacaaacaggaggc cttctagatcagctggaaggagctgaggatttccacttgggcatgacctgttcctcctac cacctcttcctcttctgcaatatcattaacaaccaacctttaggggaaactccctctctt cacttcattcccttcagcccagactggccagtgggagagccccagagccatgtggtgctg caatcctgctgggcactcaaagaagaagccctgaaatttctcagccagcagccaggttcc cggctcactctgccactcacaggctgcatgaccctggagcttcccaccttctatatcaag acaggatggatgggttcccccaggaccagctcagccaaaatggaaggactttgcccccag cacaacaaggaaggggagcaggaagaggatgagggcgagtcccaggaaggggagctgtca tgggctgcttcttccaacaatgtgtctcttctcttcgccgggacatctgccagtggtctc ctgggcaactcagaagcagagccggtgtcctcatccctgtacctgtga