GENSCAN 1.0 Date run: 7-Nov-116 Time: 00:21:46 Sequence gi568815587f:117886468_118099638 : 213171 bp : 47.20% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.18 PlyA - 906 901 6 1.05 1.17 Term - 15397 15176 222 1 0 61 50 144 0.929 4.82 1.16 Intr - 17340 17188 153 0 0 115 94 123 0.995 15.87 1.15 Intr - 17634 17492 143 1 2 28 94 97 0.914 4.37 1.14 Intr - 19269 19171 99 1 0 62 95 26 0.564 0.78 1.13 Intr - 22317 22145 173 2 2 112 93 419 0.998 44.39 1.12 Intr - 23501 23339 163 0 1 89 86 114 0.965 10.33 1.11 Intr - 24283 24240 44 0 2 139 68 -11 0.461 -0.12 1.10 Intr - 25393 25301 93 0 0 106 80 25 0.836 2.58 1.09 Intr - 27439 27310 130 2 1 114 76 119 0.999 13.05 1.08 Intr - 28047 27925 123 1 0 130 91 156 0.996 20.66 1.07 Intr - 30807 30703 105 1 0 115 65 137 0.599 14.29 1.06 Intr - 31354 31248 107 0 2 108 59 -29 0.919 -3.94 1.05 Intr - 31979 31942 38 2 2 51 110 47 0.580 0.26 1.04 Intr - 32371 32070 302 2 2 73 76 229 0.877 16.65 1.03 Intr - 39644 39529 116 1 2 111 69 75 0.150 7.99 1.02 Intr - 39818 39703 116 2 2 112 89 -13 0.100 0.45 1.01 Init - 42840 42820 21 0 0 100 84 6 0.052 1.16 1.00 Prom - 44304 44265 40 -6.46 2.00 Prom + 48648 48687 40 -6.36 2.01 Init + 51882 51931 50 2 2 86 57 50 0.318 2.12 2.02 Intr + 55893 55944 52 0 1 112 98 -53 0.248 -2.99 2.03 Intr + 59885 59974 90 1 0 113 79 215 0.851 23.29 2.04 Intr + 63859 64015 157 0 1 34 37 142 0.918 3.28 2.05 Intr + 91186 91263 78 2 0 78 101 23 0.119 2.12 2.06 Term + 91935 92713 779 1 2 75 40 214 0.102 8.83 2.07 PlyA + 94548 94553 6 1.05 3.00 Prom + 98931 98970 40 -4.96 3.01 Init + 100001 100067 67 1 1 72 91 147 0.999 12.63 3.02 Intr + 101915 102035 121 0 1 107 78 52 0.888 5.65 3.03 Intr + 102975 103153 179 0 2 76 44 288 0.987 22.76 3.04 Intr + 106774 106943 170 2 2 33 68 263 0.874 18.47 3.05 Intr + 107532 107682 151 2 1 83 65 39 0.900 0.84 3.06 Intr + 109122 109243 122 1 2 64 105 230 0.953 22.51 3.07 Term + 112248 113174 927 2 0 132 41 624 0.591 54.26 3.08 PlyA + 114993 114998 6 1.05 4.05 PlyA - 117186 117181 6 1.05 4.04 Term - 127413 127314 100 2 1 76 44 176 0.958 9.60 4.03 Intr - 128274 128241 34 1 1 105 117 -2 0.965 1.68 4.02 Intr - 129342 129226 117 1 0 68 96 173 0.969 16.54 4.01 Init - 131281 131272 10 1 1 85 74 16 0.711 -0.37 4.00 Prom - 132624 132585 40 -3.86 5.03 PlyA - 133999 133994 6 1.05 5.02 Term - 150870 150634 237 0 0 90 54 201 0.940 13.07 5.01 Init - 165281 165237 45 2 0 79 91 33 0.113 1.43 5.00 Prom - 168256 168217 40 -3.76 6.00 Prom + 183049 183088 40 -2.66 6.01 Init + 188529 188590 62 2 2 93 103 45 0.249 5.74 6.02 Intr + 199046 199073 28 2 1 71 87 27 0.023 -0.98 6.03 Intr + 208355 208388 34 1 1 142 123 16 0.966 8.30 6.04 Intr + 212518 212631 114 2 0 113 89 115 0.938 14.52 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 69335 69471 137 0 2 52 48 114 0.890 1.98 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:117886468_118099638|GENSCAN_predicted_peptide_1|715_aa MERDSHGVCSPHLVLQGPWHLPYASAGVHPSPRPPRTLAPALCPCRCASLTSSSKDPGTC PMPLQVCSLHLVLQGPWYLPCAPAECISSKNTFSWSISSPGISSWDTSRPGISSPGISSP GISSWDTSGPGISSPGISSWYTSRPGISRPGISSPGISSPGISSPGISGSGITFQVLIRQ VIIRQVSTSNQGHQGEPRIQTVPTTTIHTGLDDTWLWRITRSLGPIPQPCPLQGTSLPKF TWREGQKQLPLIGCVLLLIALVVSLIILFQFWQGHTGIRYKEQRESCPKHAVRCDGVVDC KLKSDELGCVRFDWDKSLLKIYSGSSHQWLPICSSNWNDSYSEKTCQQLGFESAHRTTEV AHRDFANSFSILRYNSTIQESLHRSECPSQRYISLQCSHCGLRAMTGRIVGGALASDSKW PWQVSLHFGTTHICGGTLIDAQWVLTAAHCFFVTREKVLEGWKVYAGTSNLHQLPEAASI AEIIINSNYTDEEDDYDIALMRLSKPLTLSAHIHPACLPMHGQTFSLNETCWITGFGKTR ETDDKTSPFLREVQVNLIDFKKCNDYLVYDSYLTPRMMCAGDLRGGRDSCQGDSGGPLVC EQNNRWYLAGVTSWGTGCGQRNKPGVYTKVTEVLPWIYSKMETSPVWADASGPGGAIAVL GWIVGFGGCSFPGPGPSSVKSCSRWLYEHQVLTQTPCWCRGCHQSDCAVANQADN >gi568815587f:117886468_118099638|GENSCAN_predicted_CDS_1|2148_bp atggagagggacagccacggggtgtgtagccctcacctcgtcctccaaggaccctggcac ctgccctatgcctctgcaggtgtgcatccctcacctcgtcctccaaggaccctggcacct gccctatgcccctgcaggtgtgcatccctcacctcgtcctccaaggaccctggcacctgc cctatgcccctgcaggtgtgcagccttcacctcgtcctccaaggaccctggtacctgccc tgtgcccctgcagaatgcatctccagcaagaacaccttcagctggagcatctccagccca ggcatctccagctgggacacctccaggccgggcatctccagcccaggcatctccagccca ggcatctccagctgggacacctccgggccgggcatctccagcccaggcatctccagctgg tacacctccaggccgggcatctccaggccgggcatctccagcccaggcatctccagccca ggcatctccagcccgggcatctccggctctggcatcactttccaggtcctcatccggcag gtcatcatccgccaggtcagcaccagcaaccagggccaccagggagagcccagaattcag actgtccccaccaccacaatacacacaggcctggatgatacgtggctttggaggatcact cgaagtttggggcccatcccacagccttgccctctccagggtacgagcctgcccaagttc acctggcgggagggccagaagcagctaccgctcatcgggtgcgtgctcctcctcattgcc ctggtggtttcgctcatcatcctcttccagttctggcagggccacacagggatcaggtac aaggagcagagggagagctgtcccaagcacgctgttcgctgtgacggggtggtggactgc aagctgaagagtgacgagctgggctgcgtgaggtttgactgggacaagtctctgcttaaa atctactctgggtcctcccatcagtggcttcccatctgtagcagcaactggaatgactcc tactcagagaagacctgccagcagctgggtttcgagagtgctcaccggacaaccgaggtt gcccacagggattttgccaacagcttctcaatcttgagatacaactccaccatccaggaa agcctccacaggtctgaatgcccttcccagcggtatatctctctccagtgttcccactgc ggactgagggccatgaccgggcggatcgtgggaggggcgctggcctcggatagcaagtgg ccttggcaagtgagtctgcacttcggcaccacccacatctgtggaggcacgctcattgac gcccagtgggtgctcactgccgcccactgcttcttcgtgacccgggagaaggtcctggag ggctggaaggtgtacgcgggcaccagcaacctgcaccagttgcctgaggcagcctccatt gccgagatcatcatcaacagcaattacaccgatgaggaggacgactatgacatcgccctc atgcggctgtccaagcccctgaccctgtccgctcacatccaccctgcttgcctccccatg catggacagacctttagcctcaatgagacctgctggatcacaggctttggcaagaccagg gagacagatgacaagacatcccccttcctccgggaggtgcaggtcaatctcatcgacttc aagaaatgcaatgactacttggtctatgacagttaccttaccccaaggatgatgtgtgct ggggaccttcgtgggggcagagactcctgccagggagacagcggggggcctcttgtctgt gagcagaacaaccgctggtacctggcaggtgtcaccagctggggcacaggctgtggccag agaaacaaacctggtgtgtacaccaaagtgacagaagttcttccctggatttacagcaag atggagactagcccagtgtgggcagatgccagcggcccaggtggcgccattgctgtcctg ggatggatcgtgggttttggtggatgcagcttcccagggcctggaccgtcttcggtgaaa agctgctcccgttggctttatgagcatcaagtcctcacccagaccccctgctggtgccgt ggatgtcaccagtcggactgtgctgtggctaaccaggctgacaactga >gi568815587f:117886468_118099638|GENSCAN_predicted_peptide_2|401_aa MAYAQPWVGYSPEVEPCPILPRGQPLRLESLFSKMAHGFIIIIIIIIIIIIIMIRSSLNI LMSKTFLVVTTSGEECRWHQLDRGQDTANNPKMHKTALHENYPDKNVTNVEAEKPSDLTP SCVRNWWVLGLTDFKNEAADPRDSGAQLASPNGSRTGASGGAACQSCAVRSHSSALGWSM GLGALEQGVVLHGEARASEEPMEWVGGSGMAGCRSRALPRGKAAKARQEIEHSAGGPALL GDPVHPPQPLARVLSPSLPGASRAGWLFRVRGPPSPRPPGTPAGPQAPHAAPVAAGASPS TPPCKLREWAPALANPERGSHSAVGGLKGSSNAAKVGAQAGEVPRASEGSEDCQQAVTSH SHLRSHYQLPEHFVSGFTTAQIPILPTANRAKGTGSRVHDW >gi568815587f:117886468_118099638|GENSCAN_predicted_CDS_2|1206_bp atggcctacgcccagccctgggttggctactccccagaagtggaaccttgtcccattctc ccccgaggccagccgttgagactggaatccctcttttccaagatggcccacggattcatc atcatcatcatcatcatcatcatcatcatcatcatcatgatcagatcttcactgaacatc ctaatgtccaagacatttttggttgtcacaactagtggtgaggaatgccgctggcatcaa ttggatcgaggccaggacactgctaacaaccccaaaatgcacaagacagccctccacgag aattacccagacaaaaatgtcacaaatgttgaggctgagaaaccctccgacctcactccc tcttgtgtccggaattggtgggttcttggtctcactgacttcaagaatgaagctgcggac cctcgcgactcaggagcccagctggcttcacctaatggatcccgcaccggggcttcaggt ggagctgcctgccagtcctgcgccgtgcgctcgcattcctcagcccttgggtggtcgatg ggactgggcgccttggagcagggggtggtgctccacggggaggctcgggcctcagaggag cccatggagtgggtgggtggctcaggcatggcgggctgcaggtcccgagccctgccccgc gggaaggcagctaaggcccggcaagaaatcgagcacagcgccggtgggccggcactgctg ggggacccagtacaccctccgcagccactggcccgggtgctaagtccctcattgcccggg gccagcagggctggctggctgttccgagtgcggggcccgccaagcccacgcccacccgga actccagctggcccgcaagcgccgcacgcagccccggttgccgctggagcctctccctcc acacctccctgcaagctgagggagtgggctccagccttggccaacccagaaaggggctcc cacagtgcagtgggggggctgaagggctcctcaaatgccgccaaagtgggagcccaggca ggggaggtgccgagagcaagcgagggctctgaggactgccagcaggctgtcacctctcac tctcacttgagaagccattatcagttaccagaacactttgtgtctggctttacaacagcc cagattccaatcttgccaacagctaatagagccaaagggaccggttccagggtgcacgac tggtag >gi568815587f:117886468_118099638|GENSCAN_predicted_peptide_3|578_aa MLPCLVVLLAALLSLRLGSDAHGTELPSPPSVWFEAEFFHHILHWTPIPNQSESTCYEVA LLRYGIESWNSISNCSQTLSYDLTAVTLDLYHSNGYRARVRAVDGSRHSNWTVTNTRFSV DEVTLTVGSVNLEIHNGFILGKIQLPRPKMAPANDTYESIFSHFREYEIAIRKVPGNFTF THKKVKHENFSLLTSGEVGEFCVQVKPSVASRSNKGMWSKEECISLTRQYFTVTNVIIFF AFVLLLSGALAYCLALQLYVRRRKKLPSVLLFKKPSPFIFISQRPSPETQDTIHPLDEEA FLKVSPELKNLDLHGSTDSGFGSTKPSLQTEEPQFLLPDPHPQADRTLGNREPPVLGDSC SSGSSNSTDSGICLQEPSLSPSTGPTWEQQVGSNSRGQDDSGIDLVQNSEGRAGDTQGGS ALGHHSPPEPEVPGEEDPAAVAFQGYLRQTRCAEEKATKTGCLEEESPLTDGLGPKFGRC LVDEAGLHPPALAKGYLKQDPLEMTLASSGAPTGQWNQPTEEWSLLALSSCSDLGISDWS FAHDLAPLGCVAAPGGLLGSFNSDLVTLPLISSLQSSE >gi568815587f:117886468_118099638|GENSCAN_predicted_CDS_3|1737_bp atgctgccgtgcctcgtagtgctgctggcggcgctcctcagcctccgtcttggctcagac gctcatgggacagagctgcccagccctccgtctgtgtggtttgaagcagaatttttccac cacatcctccactggacacccatcccaaatcagtctgaaagtacctgctatgaagtggcg ctcctgaggtatggaatagagtcctggaactccatctccaactgtagccagaccctgtcc tatgaccttaccgcagtgaccttggacctgtaccacagcaatggctaccgggccagagtg cgggctgtggacggcagccggcactccaactggaccgtcaccaacacccgcttctctgtg gatgaagtgactctgacagttggcagtgtgaacctagagatccacaatggcttcatcctc gggaagattcagctacccaggcccaagatggcccccgcaaatgacacatatgaaagcatc ttcagtcacttccgagagtatgagattgccattcgcaaggtgccgggaaacttcacgttc acacacaagaaagtaaaacatgaaaacttcagcctcctaacctctggagaagtgggagag ttctgtgtccaggtgaaaccatctgtcgcttcccgaagtaacaaggggatgtggtctaaa gaggagtgcatctccctcaccaggcagtatttcaccgtgaccaacgtcatcatcttcttt gcctttgtcctgctgctctccggagccctcgcctactgcctggccctccagctgtatgtg cggcgccgaaagaagctacccagtgtcctgctcttcaagaagcccagccccttcatcttc atcagccagcgtccctccccagagacccaagacaccatccacccgcttgatgaggaggcc tttttgaaggtgtccccagagctgaagaacttggacctgcacggcagcacagacagtggc tttggcagcaccaagccatccctgcagactgaagagccccagttcctcctccctgaccct cacccccaggctgacagaacgctgggaaacagggagccccctgtgctgggggacagctgc agtagtggcagcagcaatagcacagacagcgggatctgcctgcaggagcccagcctgagc cccagcacagggcccacctgggagcaacaggtggggagcaacagcaggggccaggatgac agtggcattgacttagttcaaaactctgagggccgggctggggacacacagggtggctcg gccttgggccaccacagtcccccggagcctgaggtgcctggggaagaagacccagctgct gtggcattccagggttacctgaggcagaccagatgtgctgaagagaaggcaaccaagaca ggctgcctggaggaagaatcgcccttgacagatggccttggccccaaattcgggagatgc ctggttgatgaggcaggcttgcatccaccagccctggccaagggctatttgaaacaggat cctctagaaatgactctggcttcctcaggggccccaacgggacagtggaaccagcccact gaggaatggtcactcctggccttgagcagctgcagtgacctgggaatatctgactggagc tttgcccatgaccttgcccctctaggctgtgtggcagccccaggtggtctcctgggcagc tttaactcagacctggtcaccctgcccctcatctctagcctgcagtcaagtgagtga >gi568815587f:117886468_118099638|GENSCAN_predicted_peptide_4|86_aa MLNCEDSISTLGLILGVGLLLLLVSILGYSLAKWYQRGYCWEGPNFVFNLYQIRNLKDLE MGPPFTISGHISSTDGGYMKFSNGLV >gi568815587f:117886468_118099638|GENSCAN_predicted_CDS_4|261_bp atgctgaactgtgaggactccatcagcaccttgggcctgatccttggcgtggggctcttg ctgctgctcgtgtccatcctcggctacagcctggccaagtggtaccagcgcgggtactgc tgggaggggcctaattttgtcttcaacttatatcaaatccggaacctgaaggatctggag atgggtccacccttcaccatcagtggtcacatcagcagcacagatggtggctacatgaag ttctccaacgggctagtctga >gi568815587f:117886468_118099638|GENSCAN_predicted_peptide_5|93_aa MAGFVILPFLWCVLMNQSCKATNGSSNGAPDAVHGPLDRPASLCSDVNDIEGILLRKSQL HNRYYTPVQQEAVRVVAGQPPQQHLGFPVERGD >gi568815587f:117886468_118099638|GENSCAN_predicted_CDS_5|282_bp atggctggctttgttattctccccttcctgtggtgtgttttgatgaatcaaagctgtaaa gctacaaatggttcttcaaatggagccccagatgcagtccatggacccctggaccggcct gctagcctatgctctgatgttaatgacatcgaaggcatcctcctgagaaaatctcaactg cacaaccgctactacaccccagttcagcaggaagcagttagagtggtcgctggccaacct ccccaacagcacttgggttttcctgttgagaggggggactga >gi568815587f:117886468_118099638|GENSCAN_predicted_peptide_6|80_aa MPSSLHAQTAASFLVHHLSTSLKDNSGQAKDPDSDQPLNSLDVKPLRKPRIPMETFRKVG IPIIIALLSLASIIIVVVLX >gi568815587f:117886468_118099638|GENSCAN_predicted_CDS_6|240_bp atgcccagcagccttcatgcccagacagctgcctctttcctggttcatcacctgagcacc agtttaaaggacaactcaggacaagcaaaagatcctgacagtgatcaacctctgaacagc ctcgatgtcaaacccctgcgcaaaccccgtatccccatggagaccttcagaaaggtgggg atccccatcatcatagcactactgagcctggcgagtatcatcattgtggttgtcctcann