GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:10:27 Sequence gi568815593f:151153217_151367448 : 214232 bp : 44.77% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3170 3272 103 1 1 86 71 75 0.378 5.35 1.02 Intr + 20527 20750 224 2 2 119 53 118 0.604 9.25 1.03 Intr + 21679 21886 208 0 1 34 49 160 0.353 5.35 1.04 Term + 27096 27211 116 1 2 68 47 61 0.111 -1.27 1.05 PlyA + 28544 28549 6 1.05 2.15 PlyA - 29768 29763 6 -1.95 2.14 Term - 30398 30221 178 1 1 83 46 160 0.616 8.46 2.13 Intr - 31225 31128 98 1 2 62 65 110 0.977 4.91 2.12 Intr - 32325 32206 120 0 0 97 14 173 0.855 11.49 2.11 Intr - 32908 32807 102 1 0 77 96 41 0.906 4.17 2.10 Intr - 34243 34170 74 2 2 83 94 24 0.833 1.63 2.09 Intr - 45868 45781 88 1 1 87 116 85 0.988 10.74 2.08 Intr - 48472 48366 107 2 2 98 111 54 0.996 8.63 2.07 Intr - 52259 52184 76 2 1 92 109 54 0.497 6.99 2.06 Intr - 62470 62412 59 2 2 101 42 34 0.007 -1.30 2.05 Intr - 67521 67400 122 2 2 52 80 84 0.027 4.14 2.04 Intr - 70025 69935 91 0 1 110 77 -15 0.040 -1.35 2.03 Intr - 71004 70707 298 0 1 64 98 164 0.050 11.45 2.02 Intr - 81611 81524 88 1 1 51 68 63 0.006 0.57 2.01 Init - 93287 93121 167 2 2 51 -27 203 0.006 4.21 2.00 Prom - 95159 95120 40 -7.16 3.00 Prom + 98710 98749 40 -6.56 3.01 Init + 100001 100081 81 1 0 47 113 204 0.832 17.77 3.02 Intr + 105781 105861 81 0 0 -8 89 121 0.095 2.43 3.03 Intr + 106539 106700 162 2 0 77 105 103 0.911 11.07 3.04 Intr + 113515 113697 183 0 0 110 100 110 0.999 14.38 3.05 Term + 114080 114235 156 1 0 95 38 164 0.997 10.03 3.06 PlyA + 116003 116008 6 1.05 4.26 PlyA - 119660 119655 6 1.05 4.25 Term - 124445 124177 269 0 2 111 32 283 0.903 20.66 4.24 Intr - 127967 127798 170 1 2 91 76 201 0.998 18.79 4.23 Intr - 130994 130828 167 2 2 122 84 127 0.980 14.46 4.22 Intr - 131495 131397 99 2 0 83 95 73 0.993 7.81 4.21 Intr - 134248 134030 219 1 0 47 58 297 0.976 21.10 4.20 Intr - 135254 135170 85 1 1 155 103 2 0.997 8.22 4.19 Intr - 140243 140148 96 1 0 111 62 81 0.946 6.92 4.18 Intr - 143052 142964 89 0 2 144 55 100 0.998 11.07 4.17 Intr - 145467 145377 91 2 1 123 91 33 0.907 7.10 4.16 Intr - 150138 150011 128 0 2 44 97 61 0.366 2.08 4.15 Intr - 150931 150893 39 1 0 95 86 11 0.188 0.02 4.14 Intr - 156445 156261 185 2 2 49 43 193 0.285 10.41 4.13 Intr - 156950 156888 63 0 0 50 61 87 0.039 0.89 4.12 Intr - 157154 157132 23 1 2 38 117 19 0.032 -3.01 4.11 Intr - 163872 163605 268 1 1 118 97 468 0.949 47.29 4.10 Intr - 168999 168830 170 2 2 68 84 274 0.960 24.59 4.09 Intr - 172236 172070 167 0 2 71 30 172 0.352 8.46 4.08 Intr - 180106 180008 99 1 0 114 79 18 0.847 3.81 4.07 Intr - 182331 182113 219 0 0 126 109 344 0.704 38.80 4.06 Intr - 185928 185844 85 2 1 123 80 17 0.001 4.22 4.05 Intr - 189767 189672 96 1 0 76 86 56 0.469 3.32 4.04 Intr - 190382 190294 89 2 2 129 78 87 0.638 10.57 4.03 Intr - 191051 190961 91 1 1 110 99 99 0.990 13.20 4.02 Intr - 194040 194011 30 2 0 97 90 17 0.507 0.05 4.01 Init - 194244 194081 164 0 2 75 92 165 0.685 14.81 4.00 Prom - 198895 198856 40 -4.66 5.02 PlyA - 200735 200730 6 1.05 5.01 Sngl - 207724 207563 162 1 0 74 32 178 0.839 5.39 5.00 Prom - 208470 208431 40 -5.86 6.02 PlyA - 209818 209813 6 1.05 6.01 Term - 213442 213240 203 2 2 75 47 250 0.382 17.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 59548 59673 126 0 0 50 70 141 0.847 8.66 S.002 Init + 105775 105861 87 0 0 53 89 107 0.885 7.94 S.003 Init - 156948 156888 61 0 1 57 61 80 0.923 3.61 S.004 Term + 186802 187152 351 0 0 -37 41 509 0.950 28.19 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:151153217_151367448|GENSCAN_predicted_peptide_1|216_aa ASALGVAQGLPARRSGSTGAETGAVIHFTMKGYAGEIPEHGRTPKAQKMRGSKAAGPSEV QLSGAPCQRKPKADGSQKPMQMKTPMEAIASATPSPPPDPGVGKARFPKEYRAIPIACPS RPYHLSFRIKHFSQCSQLLLVNSSGADTGEYSCWAPHCQDGACAECLAYQRKTFIFFTEI NGASYLTLVCESEMHLGQLMDSPFLSPTVGPTTLHQ >gi568815593f:151153217_151367448|GENSCAN_predicted_CDS_1|651_bp gccagtgccctcggcgtggctcagggcctgcctgctaggaggtcagggtccacaggggct gaaactggggcggtcattcacttcaccatgaagggctacgcaggtgaaatcccggagcat ggcagaactccaaaggctcagaaaatgaggggctccaaagcagctggccccagtgaggtg cagctttcaggagccccctgccagagaaaacccaaagcagacgggagccagaagcccatg cagatgaagacccccatggaggctatagcctctgcaaccccaagccccccacctgaccca ggtgtagggaaggcccgattcccgaaggagtatcgggccatccccatcgcctgcccgtcc cgcccctaccatctgtctttcagaatcaagcacttcagccagtgcagccagctgctgctg gtgaactcctctggcgcagacaccggggaatatagctgttgggctccgcactgtcaggac ggcgcatgcgctgaatgcctggcctaccagagaaagactttcatcttcttcacagagata aatggtgcctcttacctgaccctggtgtgtgagtcagagatgcacctggggcagctgatg gacagtccgtttttatcccctaccgttgggccaaccactctacatcagtga >gi568815593f:151153217_151367448|GENSCAN_predicted_peptide_2|555_aa MPSAIKVQGLDWVHLSRIKPAIPEDPDQEPEVSISHYTCEPVEALKFLFKRQPKDDAFGI LTHTLMNYSSRYLTASAQRFLAILHKDSGHLPLVAAPLALRATLRPRSSGIYGGRPPQLR LKTALPGSVSGSRRQPQPAGAERAAGPGSQAPLGAPELGSGVREPPEEDGLQTQQAEQLQ TPEKGALELGQGPGDNSQVSCSETSCPLGSDEPRRVPGMMSELSIPEGCGYSSIVPSRTS EKGLSTAAKAGPQVTGNPALSFTTTLTLSAEEQGMKRQEPEPEQPPRPEPHELGPLNGDT AITVQLCASEEAERHQKDITRILQQHEEEKKKWAQQVEKERELELRDRLDEQQRVLEGKN EEALQVLRASYEQEKEALTHSFREASSTQQETIDRLTSQLEAFQAKMKRVEESILSRNYK KHIQDYGSPSQFWEQELESLHFVIEMKNERIHELDRRLILMETVKEKNLILEEKITTLQQ ENEDLHVRSRNQVVLSRQLSEDLLLTREALEKEVQLRRQLQQEKEELLYRVLGANASPAF PLAPVTPTEVSFLAT >gi568815593f:151153217_151367448|GENSCAN_predicted_CDS_2|1668_bp atgccttctgccatcaaagtacaaggattagattgggtacatctttcaaggatcaagcca gcaataccagaagatccggaccaggaacctgaagtttccatcagccactacacctgtgaa cctgtggaagccctgaagttcctgtttaaaagacagccaaaagatgatgcctttgggatc ctcacccacactctcatgaactattccagtagatacttgactgcttcagcacagcgcttc ctcgccattctccacaaggattctggtcacctgcctctggtggcggcgccgctggccctg cgggctacactgaggccccgctcttcgggtatttacggcgggcggccgccccagctcagg ctcaagacagcgcttccaggttcagtttcagggtctcgccggcagccccagccggcgggc gcggagcgggcagcggggcccggatcgcaggctcctctgggggccccggagttgggaagc ggcgtccgggagcccccggaggaggatgggctgcagacacagcaggctgagcagctgcaa acccccgaaaaaggggccttggagctgggtcagggcccaggggacaattctcaggtttct tgcagtgagaccagctgccccttggggtctgacgagcccagaagagtaccaggaatgatg tctgaactctccattcccgaaggctgtggctatagctccatagttccaagcaggacctct gagaaggggctcagcacagctgccaaagctgggcctcaggtgacaggcaatcccgccctc tcttttacaaccacccttacactgtctgctgaagaacaggggatgaagcgccaagaacca gaaccagaacagccacccagaccagagccccatgaattaggtcccctcaatggggacaca gctataactgtccagctctgtgcatcagaggaggctgagcggcaccagaaggatataacc agaattctccagcaacatgaggaggaaaagaagaaatgggcacaacaggtggagaaggaa agggagctagagcttcgagacagactggatgagcagcaaagggtcctggaaggaaagaat gaagaggccctgcaagtcctccgggcctcatatgaacaggagaaagaagcgcttacccac tctttccgggaggccagttctacccagcaggagaccatagacagactgacctcacagctg gaggctttccaggccaaaatgaagagggtggaggagtccattctgagccgaaactataag aaacatatccaggattatgggagccccagccagttctgggagcaggagctggagagctta cactttgtcatcgagatgaagaatgagcgtattcatgagctggacaggcggctgatcctc atggaaacagtgaaagagaaaaatctgatattggaggaaaaaattacgaccctgcaacag gaaaatgaggacctccatgtccgaagccgcaaccaggtggtcctgtcaaggcagctgtca gaagacctgcttctcacgcgtgaggccctggagaaggaggtgcagctgcggcgacagctc cagcaggagaaggaggagctgttgtaccgggtccttggggccaatgcctcgcctgccttc cctctggcccctgtcactcccactgaggtctctttcctcgccacatag >gi568815593f:151153217_151367448|GENSCAN_predicted_peptide_3|220_aa MQSLMQAPLLIALGLLLAAPAQAHLKKSHADVVVVESGDLGDVEEEQEQLSTGVPSQLSS FSWDNCDEGKDPAVIRSLTLEPDPIIVPGNVTLSVMGSTSVPLSSPLKVDLVLEKEVAGL WIKIPCTDYIGSCTFEHFCDVLDMLIPTGEPCPEPLRTYGLPCHCPFKEGTYSLPKSEFV VPDLELPSWLTTGNYRIESVLSSSGKRLGCIKIAASLKGI >gi568815593f:151153217_151367448|GENSCAN_predicted_CDS_3|663_bp atgcagtccctgatgcaggctcccctcctgatcgccctgggcttgcttctcgcggcccct gcgcaagcccacctgaaaaagtcacatgcagatgtagtggtggtggaatcaggagacctg ggggatgtggaggaggaacaggagcagctcagcacaggggtgccatcccagctcagtagc ttttcctgggataactgtgatgaagggaaggaccctgcggtgatcagaagcctgactctg gagcctgaccccatcatcgttcctggaaatgtgaccctcagtgtcatgggcagcaccagt gtccccctgagttctcctctgaaggtggatttagttttggagaaggaggtggctggcctc tggatcaagatcccatgcacagactacattggcagctgtacctttgaacacttctgtgat gtgcttgacatgttaattcctactggggagccctgcccagagcccctgcgtacctatggg cttccttgccactgtcccttcaaagaaggaacctactcactgcccaagagcgaattcgtt gtgcctgacctggagctgcccagttggctcaccaccgggaactaccgcatagagagcgtc ctgagcagcagtgggaagcgtctgggctgcatcaagatcgctgcctctctaaagggcata taa >gi568815593f:151153217_151367448|GENSCAN_predicted_peptide_4|1066_aa MSVTKSTEGPQGAVAIKLDLMSPPESAKKLENKDSTFLDESPSESAGLKKTKGITLKLAL AKTLRVFQALIHLVKGNMGTGILGLPLAVKNAGILMGPLSLLVMGFIACHCMHILVKCAQ RFCKRLNKPFMDYGDTVMHGLEANPNAWLQNHAHWGRHIVSFFLIITQLGFCCVYIVFLA DNLKQVVEAVNSTTNNCYSNETVILTPTMDSRLYMLSFLPFLVLLVLIRNLRILTIFSML ANISMLVSLVIIIQYITQEIPDPSRLPLVASWKTYPLFFGTAIFSFESIGVVLPLENKMK NARHFPAILSLGMSIVTSLYIGMAALGYLRFGDDIKASISLNLPNCWLYQSVKLLYIAGI LCTYALQFYVPAEIIIPFAISRVSTRWALPLDLSIRLVMVCLTCLLAILIPRLDLVISLV GSVSGTALALIIPPLLEVTTFYSEGMSPLTIFKDALISILGFVGFVVGTYQALDELLKSE DSHPFSNSTTFVRSEAADLPGMKLQTFAESVTAHKGSVDPKNSGAQLASPSGSHTGDAGG ATCQSCAVGPHSSALGWSMGLGAVEQGAALIGEALAAQEPMKRPLQIVNSMRTEIVMSLL GRDYNSELNSLDNGPQSPSESSSSITSENVHPAGEAGLSMMQTLIHLLKCNIGTGLLGLP LAIKNAGLLVGPVSLLAIGVLTVHCMVILLNCAQHLSQRLQKTFVNYGEATMYGLETCPN TWLRAHAVWGRYTVSFLLVITQLGFCSVYFMFMADNLQQMVEKAHVTSNICQPREILTLT PILDIRFYMLIILPFLILLVFIQNLKVLSVFSTLANITTLGSMALIFEYIMEGIPYPSNL PLMANWKTFLLFFGTAIFTFEGVGMVLPLKNQMKHPQQFSFVLYLGMSIVIILYILLGTL GYMKFGSDTQASITLNLPNCWLYQSVKLMYSIGIFFTYALQFHVPAEIIIPFAISQVSES WALFVDLSVRSALVCLTCVSAILIPRLDLVISLVGSVSSSALALIIPALLEIVIFYSEDM SCVTIAKDIMISIVGLLGCIFGTYQALYELPQPISHSMANSTGVHA >gi568815593f:151153217_151367448|GENSCAN_predicted_CDS_4|3201_bp atgtctgtgacaaaaagtactgagggtccccagggagccgttgccatcaaattggacctt atgtcgcctcctgaaagtgccaagaagttggagaacaaggactctacattcttggatgaa agtccttcagagtcagcaggcttgaagaagaccaagggcataaccctgaagcttgcattg gcaaaaaccctgagagtgttccaggccttgattcacctggtgaaaggcaacatgggcaca gggatcctgggactacccctcgctgtgaagaacgcgggcatcctgatgggcccactcagt ctgctggtgatgggcttcattgcctgccactgtatgcacatcctggtcaagtgtgcccag cgcttctgtaagaggcttaacaagccctttatggactatggggacacggtgatgcatgga ctagaagccaaccccaacgcctggctccagaatcacgctcactggggaaggcatatcgtg agcttcttccttattatcacccaacttggcttctgctgtgtgtacattgtgtttttggct gataatttaaaacaggtagtggaagctgttaatagcacaaccaacaactgctattccaat gagacggtgattctgacccccaccatggactcgcgactctacatgctctccttcctgccc ttcctggtgctgctggtcctcatccggaacctcaggatcttgaccatcttctccatgctg gccaacatcagcatgctggtcagcttggtcatcatcatacagtacattacccaggaaatc ccagaccccagccggttgccactggtagcaagctggaagacctaccctctcttcttcgga acagccattttttcttttgaaagcattggtgtggttctgcctctggaaaacaagatgaag aatgcccgccacttcccagccatcctgtctttgggaatgtccatcgtcacttccctatac attggcatggcggctctgggctacctgcggtttggagatgacatcaaggccagcataagc cttaacctgcctaactgctggctgtaccagtctgtcaagcttctctacattgccggcatc ctgtgcacctatgccctgcagttctacgtccctgcagaaatcatcatcccctttgccatc tcccgggtgtcaacacgctgggcactgcctctggatctgtccattcgcctcgtcatggtc tgcctgacatgcctcctggccatcctcatcccccgcctggacctggtcatctccctggtg ggctccgtgagtggcaccgccctggccctcatcatcccaccgctcctggaggtcaccacg ttctactcagagggcatgagccccctcaccatcttcaaggacgccctgatcagcatcctg ggcttcgtgggctttgtggtggggacctaccaggccctggacgagctgctcaagtcagaa gactctcaccccttttccaactccaccacttttgttcggagtgaagctgcagaccttccc ggaatgaagctgcagaccttcgcagagagtgttacagctcataaaggcagtgtggaccca aaaaactcaggagcccagctggcttcacccagtggatcccacaccggggatgcaggtgga gctacctgccagtcctgcgccgtgggcccgcactcctcagcccttgggtggtcgatggga ctgggcgccgtggagcagggggcagcgctcatcggggaggctctggccgcacaggagccc atgaagcggcccctccagattgtaaattccatgagaacagagattgtaatgtcattgctt ggaagggactacaacagtgagctgaactccttggacaacggacctcagtcaccctcagag agcagcagtagcattacttcagagaatgtccatcctgctggagaagctggactatcgatg atgcaaactttgatccacttgttgaaatgcaacattggcacagggctcctggggcttccc ctggccataaagaatgccggcttgttggtcggtcctgtcagccttctggccatcggggtc ctcaccgtgcactgcatggtcatcctgttgaactgtgctcaacacctcagccagagactg cagaagacttttgtgaactatggagaggccacgatgtacggccttgaaacctgcccgaac acctggctgagggcccatgcagtgtggggaaggtacactgtcagcttcttattagtcatc acccagctgggcttctgcagtgtttattttatgtttatggcagacaatttacaacagatg gtggaaaaagcccacgtgacctccaacatctgccagcccagggagattctgacgctgacc cccatcctggacattcgtttctacatgctgataatcctgcccttcctgatcctgttggtg tttatccagaacctcaaggtgctgtccgtcttctcgacattggccaacatcaccaccctt gggagcatggctctgatctttgagtatatcatggaggggattccatatcccagcaaccta cccttgatggcaaactggaagaccttcttgctgttctttggtacagccatcttcacattt gaaggcgtcggtatggttctgcctctcaaaaaccagatgaagcatccacagcagttttct tttgttctgtacttggggatgtccattgtcatcatcctctatatcttactggggacactg ggctacatgaagtttgggtcagacacccaggccagcatcaccctcaacttgcccaattgc tggttgtaccagtcagtcaagctgatgtactctatcggcatcttcttcacctatgccctc cagttccacgtcccagctgagatcatcatcccgtttgccatctcccaagtgtcagagagc tgggcactgtttgtagacctgtctgtccgctcagccttggtctgtctaacctgtgtctca gccatcctcatcccccgcctggacttggtcatctccctggtaggctccgtgagcagcagc gccctggctctcatcatcccagccctcctggagatcgtcatcttttactctgaggacatg agctgtgtcaccattgccaaggacatcatgattagcatcgtgggccttttagggtgtata tttgggacataccaagccctctatgagttgccccaacccatcagccattccatggccaac tccacaggtgtccatgcataa >gi568815593f:151153217_151367448|GENSCAN_predicted_peptide_5|53_aa MGSEGFQDMDLGEIQEQIDTMPEELTKDDWMEMSASQPVSAEEEEEAAAVSEN >gi568815593f:151153217_151367448|GENSCAN_predicted_CDS_5|162_bp atggggagtgaagggtttcaagatatggatcttggagaaattcaagagcaaatagacacc atgccagaggaattaacaaaagatgactggatggagatgagtgcttctcaaccggtgtca gctgaggaggaagaagaagcagcagcagtgtcagaaaactaa >gi568815593f:151153217_151367448|GENSCAN_predicted_peptide_6|67_aa XSLAILIPHLGMVLTQMSLVSSSTLPLIILCLLEMTTYYSECMSSLIITKDALISILGFV GFLVGTF >gi568815593f:151153217_151367448|GENSCAN_predicted_CDS_6|204_bp ngttccctggccatcctcatcccccatctgggcatggtccttacccagatgagcttggtg agcagcagcaccctgcccctcatcatcctgtgcctcctggagatgactacctactactca gagtgcatgagctccctcatcatcaccaaggacgccctgatcagcatcctgggctttgtg ggatttttggtggggaccttctag