GENSCAN 1.0 Date run: 7-Nov-116 Time: 16:26:06 Sequence gi568815593r:171944885_172288042 : 343158 bp : 45.16% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 34906 35136 231 0 0 80 93 74 0.192 5.07 1.02 Term + 42098 42160 63 1 0 65 55 52 0.295 -2.51 1.03 PlyA + 43588 43593 6 1.05 2.03 PlyA - 52608 52603 6 1.05 2.02 Term - 57073 56978 96 1 0 74 40 74 0.332 -0.83 2.01 Init - 61618 61574 45 1 0 120 101 126 0.882 17.78 2.00 Prom - 80287 80248 40 -4.16 3.03 PlyA - 80297 80292 6 1.05 3.02 Term - 83537 83350 188 0 2 91 38 19 0.130 -5.15 3.01 Init - 87695 87560 136 2 1 73 57 171 0.545 12.80 3.00 Prom - 90388 90349 40 -5.76 4.16 PlyA - 91663 91658 6 1.05 4.15 Term - 100138 99998 141 1 0 103 42 190 0.985 13.83 4.14 Intr - 108158 108045 114 2 0 72 88 152 0.813 14.24 4.13 Intr - 109810 109685 126 1 0 83 86 312 0.961 31.38 4.12 Intr - 110892 110704 189 0 0 100 75 443 0.999 43.98 4.11 Intr - 112589 112465 125 0 2 91 100 242 0.915 26.00 4.10 Intr - 116384 116255 130 2 1 147 59 254 0.482 28.67 4.09 Intr - 119372 119349 24 2 0 103 80 20 0.595 0.92 4.08 Intr - 119928 119836 93 0 0 116 70 164 0.985 17.56 4.07 Intr - 137621 137442 180 2 0 95 105 270 0.993 29.46 4.06 Intr - 138200 138077 124 1 1 78 92 164 0.999 16.39 4.05 Intr - 141432 141242 191 0 2 104 103 35 0.748 4.98 4.04 Intr - 145478 145320 159 2 0 63 37 209 0.726 13.48 4.03 Intr - 149076 148528 549 0 0 82 87 533 0.999 45.67 4.02 Intr - 151676 151542 135 2 0 82 66 321 0.999 30.06 4.01 Init - 152525 152487 39 2 0 64 86 14 0.420 -2.63 4.00 Prom - 152943 152904 40 -3.16 5.00 Prom + 152947 152986 40 -9.26 5.01 Init + 153958 154442 485 0 2 56 55 1031 0.570 91.38 5.02 Intr + 154730 154794 65 2 2 74 100 72 0.549 5.36 5.03 Term + 155424 155563 140 1 2 66 43 70 0.606 -1.57 5.04 PlyA + 155909 155914 6 1.05 6.17 PlyA - 156615 156610 6 1.05 6.16 Term - 158380 158378 3 1 0 114 54 0 0.123 -3.90 6.15 Intr - 160853 160772 82 1 1 119 105 126 0.889 17.04 6.14 Intr - 161930 161736 195 1 0 60 65 404 0.810 33.83 6.13 Intr - 162968 162896 73 0 1 30 105 99 0.742 4.26 6.12 Intr - 172746 172597 150 1 0 109 82 235 0.995 25.13 6.11 Intr - 178508 178374 135 0 0 71 28 78 0.337 0.64 6.10 Intr - 182537 182489 49 2 1 115 92 66 0.716 7.95 6.09 Intr - 184290 184164 127 2 1 62 49 78 0.127 1.98 6.08 Intr - 201394 201307 88 2 1 71 21 80 0.010 -1.37 6.07 Intr - 203391 203259 133 0 1 98 26 62 0.031 1.22 6.06 Intr - 211904 211740 165 2 0 141 82 326 0.957 37.46 6.05 Intr - 221526 221418 109 2 1 46 92 62 0.001 2.69 6.04 Intr - 225497 225448 50 2 2 74 98 64 0.000 3.48 6.03 Intr - 240907 240762 146 2 2 115 37 96 0.032 7.20 6.02 Intr - 242606 242447 160 2 1 -13 98 74 0.298 -2.14 6.01 Init - 243158 243003 156 2 0 99 100 372 0.514 39.41 6.00 Prom - 246537 246498 40 -4.46 7.00 Prom + 248946 248985 40 -2.16 7.01 Init + 249289 249424 136 0 1 51 61 99 0.907 3.81 7.02 Intr + 254499 254623 125 1 2 65 58 183 0.728 13.30 7.03 Intr + 255682 255858 177 0 0 79 101 145 0.953 15.02 7.04 Term + 258345 258461 117 2 0 72 37 70 0.589 -1.16 7.05 PlyA + 258546 258551 6 1.05 8.07 PlyA - 261029 261024 6 1.05 8.06 Term - 267343 266946 398 2 2 64 43 301 0.176 18.44 8.05 Intr - 289474 289238 237 2 0 96 78 155 0.921 12.89 8.04 Intr - 307577 307571 7 2 1 114 91 0 0.001 -3.79 8.03 Intr - 310176 309982 195 0 0 96 21 94 0.031 3.11 8.02 Intr - 310645 310356 290 2 2 49 89 189 0.843 12.06 8.01 Init - 338781 338712 70 0 1 88 92 106 0.660 12.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 212035 211958 78 1 0 56 86 -9 0.831 -3.08 S.002 Init + 272757 272764 8 2 2 104 91 0 0.802 2.40 S.003 Term + 272946 273207 262 0 1 33 47 154 0.810 0.80 S.004 Init + 302150 302220 71 1 2 75 95 97 0.800 9.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:171944885_172288042|GENSCAN_predicted_peptide_1|97_aa KGYTRQQLSHESTLNKRDRVIYNLTCPPYCRVQFQLAGTGPHILYLSRLASNLELFKRGK GRGEQRKKEVTCGMLRKMISPGIRFSQKQEPYCELRM >gi568815593r:171944885_172288042|GENSCAN_predicted_CDS_1|294_bp aaagggtacactcgccagcagcttagccacgagagtacactgaacaaacgagacagggtc atttataacctgacatgtccaccctactgccgtgtccagtttcaattggctggaacagga cctcacattttgtatttgtcccgattggctagcaacttagaactttttaaaagaggcaaa ggcagaggagaacaaaggaagaaggaagtaacttgtggaatgctgagaaagatgatcagc ccaggcattagattctcacagaagcaggaaccctactgtgaactgcgcatgtga >gi568815593r:171944885_172288042|GENSCAN_predicted_peptide_2|46_aa MEPDSVIEDKTIELMHMEDHGPPDSAHTFPLSALIGMDLSPFIEGY >gi568815593r:171944885_172288042|GENSCAN_predicted_CDS_2|141_bp atggagcccgactcggtgattgaggacaagaccatcgagctcatgcacatggaggaccat ggtcctccagattctgctcacacgtttcctttgtcagcactgataggaatggacctaagc ccatttattgaaggctactga >gi568815593r:171944885_172288042|GENSCAN_predicted_peptide_3|107_aa MGESLELPRDLLNGFDQNADNDMDNGVQAEVVSDEDEKCVGNGNKGPTASSLAKGKITEL RFDLIPRRRTLQFESNHVNCLFKRNKTKTKKANEHSSEKQQNPVSTT >gi568815593r:171944885_172288042|GENSCAN_predicted_CDS_3|324_bp atgggggaaagtttggaacttcctagagacctgttgaatggttttgaccaaaatgctgat aatgatatggacaatggagtccaagctgaggtggtctccgatgaagatgagaaatgtgtt gggaacgggaataaagggccaactgcaagtagcttggctaagggtaaaataactgaactg agatttgacctgattcccaggagacggactttgcaatttgaatccaaccatgttaactgc ctgttcaaacgaaacaaaacaaaaaccaagaaagccaacgaacactcttcagagaaacaa cagaatccagtctctacaacatga >gi568815593r:171944885_172288042|GENSCAN_predicted_peptide_4|772_aa MEMFCILSWVAVTHPFVSSITSNKALRELVAEAKAEVMEEIEDGRDEGEEEDAVDAASTL ENHTQNSSEVSPPSLNADKPLEESPSTPLAPSQSQDSVNEPCSQPSGDRSLQTTSPPVVA PGNENGLAVPVPLRKSRPVSMDARIQVAQEKQVAEQGGDLSPAANRSQKASQSRPNSSAL ETLGGEKLANGSLEPPAQAAPGPSKRDSDCSSLCTSESMDYGTNLSTDLSLNKEMGSLSI KDPKLYKKTLKRTRKFVVDGVEVSITTSKIISEDEKKDEEMRFLRQAWEGPPMVKSRGSD SLPSVAVESASGFSQRQGSHCLFFLGRIRPVAIYRVLVMCHVGTLPVSSHLSFKGPNRRQ ELRELRLLQKEEHRNQTQLSNKHELQLEQMHKRFEQEINAKKKFFDTELENLERQQKQQV EKMEQDHAVRRREEARRIRLEQDRDYTRFQEQLKLMKKEVKNEVEKLPRQQRKESMKQKM EEHTQKKQLLSFVTSRSMDRDFVAKQKEDLELAMKRLTTDNRREICDKERECLMKKQELL RDREAALWEMEEHQLQERHQLVKQQLKDQYFLQRHELLRKHEKEREQMQRYNQRMIEQLK VRQQQEKARLPKIQRSEGKTRMAMYKKSLHINGGGSAAEQREKIKQFSQQEEKRQKSERL QQQQKHENQMRDMLAQCESNMSELQQLQNEKCHLLVEHETQKLKALDESHNQNLKEWRDK LRPRKKALEEDLNQKKREQEMFFKLSEEAECPNPSTPSKAAKFFPYSSADAS >gi568815593r:171944885_172288042|GENSCAN_predicted_CDS_4|2319_bp atggaaatgttctgtatcttgagctgggtggcagttacgcatcccttcgtcagcagcatc accagtaacaaggctctgcgggagctggtggctgaggccaaggccgaggtgatggaagag atcgaagacggccgggatgagggggaagaggaggacgccgtggatgccgcctccaccctg gagaaccatactcagaactcctctgaggtgagtccgccaagcctcaatgctgacaagcct ctcgaggagtcaccttccaccccgctggcacccagccagtctcaggacagtgtgaatgag ccctgcagccagccctctggggacagatccctccaaaccaccagtcccccagtcgtggcc cctggaaatgagaacggcctggcagtgcctgtgcccctgcggaagtcccgacccgtgtca atggatgccagaattcaggtagcccaggagaagcaagttgctgagcagggtggggacctc agcccagcagccaacagatctcaaaaggccagccagagccggcccaacagcagcgccctg gagaccttgggtggggagaagctggccaatggcagcctggagccacctgcccaggcagct ccagggccttccaagagggactcggactgcagcagcctctgcacctctgagagcatggac tatggtaccaatctctccactgacctgtcgctgaacaaagagatgggctctctgtccatc aaggacccgaaactgtacaaaaaaaccctcaagcggacacgcaaatttgtggtggatggt gtggaggtgagcatcaccacctccaagatcatcagcgaagatgagaagaaggatgaggag atgagatttctcaggcaagcctgggaagggccaccaatggtgaaaagcagaggctcagac agtcttccctctgtggctgtagaaagcgcctctggcttttcccagaggcagggcagccat tgcctcttcttccttggtagaattcgaccagttgccatttacagagtgctggtcatgtgc cacgtgggcactttacctgtgtcatctcatttatccttcaaaggacccaacaggcgccag gaactccgagagcttcggctgctccagaaagaagagcatcggaaccagacccagctgagt aacaagcatgagctgcagctggagcaaatgcataaacgttttgaacaggaaatcaacgcc aagaagaagttctttgacacggaattagagaacctggagcgtcagcaaaagcagcaagtg gagaagatggagcaagaccatgccgtgcgccgccgggaggaggccaggcggatccgcctg gagcaggatcgggactacaccaggttccaagagcagctcaaactgatgaagaaagaggtg aagaacgaggtggagaagctcccccgacagcagcggaaggaaagcatgaagcagaagatg gaggagcacacgcagaaaaagcagcttctttccttcgtcacctccagaagcatggaccgg gactttgtagccaagcagaaggaggacctggagctggccatgaagaggctcaccaccgac aacaggcgggagatctgtgacaaggagcgcgagtgcctcatgaagaagcaggagctcctt cgagaccgggaagcagccctgtgggagatggaagagcaccagctgcaggagaggcaccag ctggtgaagcagcagctcaaagaccagtacttcctccagcggcacgagctgctgcgcaag catgagaaggagcgggagcagatgcagcgctacaaccagcgcatgatagagcagctgaag gtgcggcagcaacaggaaaaggcgcggctgcccaagatccagaggagtgagggcaagacg cgcatggccatgtacaagaagagcctccacatcaacggcgggggcagcgcagctgagcag cgtgagaagatcaagcagttctcccagcaggaggagaagaggcagaagtcggagcggctg cagcaacagcagaaacacgagaaccagatgcgggacatgctggcgcagtgtgagagcaac atgagcgagctgcagcagctgcagaatgaaaagtgccacctcctggtagagcacgaaacc cagaaactgaaggccctggatgagagccataaccagaacctgaaggaatggcgggacaag cttcggccgcgcaagaaggctctggaagaggatctgaaccagaagaagcgggagcaggag atgttcttcaagctgagcgaggaggcggagtgcccaaacccctccaccccaagcaaggcc gccaagttcttcccctacagttctgcggatgcttcttaa >gi568815593r:171944885_172288042|GENSCAN_predicted_peptide_5|229_aa MAAIIIIITTTTIITFTTITTTITTIITITIITTTTIITFTTITTIITITTIITITIIII TTITFTTITTIITITIIINIIITTITITTITITIITTIIPITITITPITTIITITMTITI TTITPVSSRSSSTIRKARYGPAQWLTSVIPALWEAKAVDHLRAQLFGTKGENQDPKGENQ DPKVRGTGWLFSAIFAEKGNRQLLSSQGLPCHLNRCPFQELKPTFPLCA >gi568815593r:171944885_172288042|GENSCAN_predicted_CDS_5|690_bp atggcagccatcatcatcatcatcaccaccaccaccatcatcaccttcaccaccatcacc accaccatcaccaccatcattaccattaccatcatcaccaccaccaccatcatcaccttc accaccatcaccaccatcattaccatcaccaccatcattaccattaccatcatcatcatc accaccatcaccttcaccaccatcaccaccatcattaccatcaccatcattatcaacatc atcatcacgaccattaccatcaccaccatcactatcaccatcattaccaccatcatcccc atcaccatcaccattacacccatcaccactatcatcaccatcaccatgaccattaccatc accaccatcacccccgtcagcagcaggagcagcagcacaatcagaaaagcccgttatggc ccggcacagtggctcacatctgtaatcccagcactttgggaggccaaggcggtggatcac ctgagagcacagctttttggaaccaaaggtgagaaccaagatcccaaaggtgagaaccaa gatcccaaagtaagaggcaccggctggctgttcagtgccatctttgctgaaaaaggcaac cggcagctgctctctagccagggcctcccctgccatctaaacagatgtcctttccaagaa ctcaagcctacttttcctctatgtgcttag >gi568815593r:171944885_172288042|GENSCAN_predicted_peptide_6|606_aa MAFANFRRILRLSTFEKRKSREYEHVRRDLDPNEVWEIVGELGDGAFGKVYKRPQMLWEY RGDVDRETRLRLDGLGARDTSEDEPETGKWGGRGVAHGLRGGAWKELHLLPGGTRERSSA LQLSLDTASVLDCLRQSLLLQLCSDYCEGSWTPVDSAVYILAQSKALAPASPRERELISF LVANGRNPKGNLSAHVTEKSTGRVRLQAKNKETGALAAAKVIETKSEEELEDYIVEIEIL ATCDHPYIVKLLGAYYHDGKLWLIMHRVGLESACTLSCCVTPDTPLRASELQASHLDVGD DSADHHNRVAMIHSVIHSLTNTCFPGEHRLWARQFSPAYTPSVAQPWGPSSEQQHQQPHF TAEDTVSEDGSDLSKATLIMIEFCPGGAVDAIMLVLLVVLTAPALGSFPPVGGSSERSIL GMMMCKEKLKWGQLTSSEPELDRGLTEPQIQVVCRQMLEALNFLHSKRIIHRDLKAGNVL MTLEGDIRLADFGVSAKNLKTLQKRDSFIGTPYWMAPEVVMCETMKDTPYDYKADIWSLG ITLIEMAQIEPPHHELNPMRVLLKIAKSDPPTLLTPSKWSVEFRDFLKIALDKNPETRPS AAQLLE >gi568815593r:171944885_172288042|GENSCAN_predicted_CDS_6|1821_bp atggcttttgccaatttccgccgcatcctgcgcctgtctaccttcgagaagagaaagtcc cgcgaatatgagcacgtccgccgcgacctggaccccaacgaggtgtgggagatcgtgggc gagctgggcgacggcgccttcggcaaggtttacaagagaccacagatgctgtgggaatat cgaggcgatgtggaccgagaaactcggctgcgcttggacgggttgggagctagggacaca tctgaagatgagccagagacagggaagtggggtgggcgaggggtggcccatggactgaga ggtggggcttggaaagagctgcatcttcttcctggtggcaccagagaacgatcctcagcc ttgcaactgtctctggacactgccagtgtccttgactgccttagacagagcctgctcctg cagctgtgctctgactactgtgagggcagctggacccctgtggactcagctgtgtacatc ctggcccagtcgaaggctttggctcctgcaagccccagagagagggaactcatctccttc ttggtggccaatggcaggaatccaaaagggaatttatcggctcatgtaactgagaagtct acgggtagagtccgccttcaggccaagaataaggagacgggtgctttggctgcggccaaa gtcattgaaaccaagagtgaggaggagctggaggactacatcgtggagattgagatcctg gccacctgcgaccacccctacattgtgaagctcctgggagcctactatcacgacgggaag ctgtggttgatcatgcacagagtgggattggaatctgcctgcaccctgtcctgctgtgtg accccagacacgcctcttcgtgcctctgagctccaggcttctcatctggacgtcggggat gatagtgctgatcaccacaacagggtagcaatgattcactcagtcattcattcactcacc aacacatgcttccccggggagcaccggctgtgggcccggcagtttagcccggcttatacc cctagtgtagcccagccatggggtcccagttcagagcaacagcatcagcagccccatttt acagctgaggacacagtctcagaggacggaagtgacctgtccaaggccacactaatcatg attgagttctgtccagggggagccgtggacgccatcatgctggttctcctcgtggtcctg actgctccagccctaggcagttttccccctgttggtggcagcagtgaaaggagcattttg ggaatgatgatgtgcaaagagaagctgaaatggggacagctgacaagttcagagccagag ctggacagaggcctcacggagccccagatacaggtggtttgccgccagatgctagaagcc ctcaacttcctgcacagcaagaggatcatccaccgagatctgaaagctggcaacgtgctg atgaccctcgagggagacatcaggctggctgactttggtgtgtctgccaagaatctgaag actctacagaaacgagattccttcatcggcacgccttactggatggcccccgaggtggtc atgtgtgagaccatgaaagacacgccctacgactacaaagccgacatctggtccctgggc atcacgctgattgagatggcccagatcgagccgccacaccacgagctcaaccccatgcgg gtcctgctaaagatcgccaagtcggaccctcccacgctgctcacgccctctaagtggtct gtagagttccgtgacttcctgaagatagccctggataagaacccagaaacccgacccagt gccgcgcagctgctggagtga >gi568815593r:171944885_172288042|GENSCAN_predicted_peptide_7|184_aa MRLKQGSFLWYLYLDKIYCLLSVRNVKALAEYFHILDVHGKNTLNDVLFYHFLHHVTDLK KAQINIVFDMLDWNAVGEIDFEKFYMLNHLEGQFMYRHSRPVFDLLDLKGDLRIGAKNFE MYRFLFNIQKQELKDLFRDFDITGDNEFKLYTIIYTDKLQKRQKTEEKEKGERKRSLYSK CHIK >gi568815593r:171944885_172288042|GENSCAN_predicted_CDS_7|555_bp atgagactgaagcaaggatcgtttctgtggtacctctatctggacaaaatatactgctta ttatccgtgagaaacgtgaaggctttggcagaatattttcatattctggacgtgcacggc aagaacaccttgaatgatgtgctgttctatcacttccttcatcatgtgactgacttgaaa aaggcacagatcaacattgtgtttgacatgctggactggaacgctgtgggcgagatcgac tttgagaagttctacatgctgaaccatttggaaggacagtttatgtatcgtcattcccgg cctgtctttgacctgcttgacctgaaaggggatctgagaattggtgcaaaaaacttcgaa atgtacagatttctcttcaatattcaaaaacaggaactcaaagatctcttccgtgacttt gacattacaggtgacaatgaatttaagctgtatacaatcatctacactgacaaattacag aagaggcagaaaacagaggagaaagaaaaaggagagagaaagagaagtctctactcaaaa tgtcacatcaagtag >gi568815593r:171944885_172288042|GENSCAN_predicted_peptide_8|398_aa MGGCVGAQHDSSGSLNENSEGTGGSPDLGLYFSWAASLTTPLKAKHQSTLEGLKECSCLT QFLLSKRPVDPVSVSYSSNYMESMKPNKYGVIYSTQLPDEFFQTLEGLWHGIQMEPVDFM MMAAALSWHRIWSLGILLVIQPVVVQPIPFMYMSHLQEPLMVSLRRRRRKRSSMEEMKNS SSSMQGAVALGRNQPLKKEKPKWKSDYPMTDGQLRSKRDEFWDTAPAFEGRKEIWDALKA AAHAFESNDHELAQAIIDGANITLPHGALTECYDELGNRYQLPVYCLAPPINMIEEKSDI ETLDIPEPPPNSGYECQLRLRLSTGKDLKLVVRSTDTVFHMKRRLHAAEGVEPGSQRWFF SGRPLTDKMKFEELKIPKDYVVQVIVSQPVQNPTPVEN >gi568815593r:171944885_172288042|GENSCAN_predicted_CDS_8|1197_bp atgggcgggtgtgtgggcgcccagcacgactcctcgggcagcctcaacgagaactcggag ggcaccggaggcagcccggatctgggcctttatttctcgtgggcggcgtccctaaccaca cctctcaaagccaaacaccagagcaccctagaaggtttaaaagaatgctcatgtttgacc cagttcctgttaagcaagaggcccgtggaccctgtctcggtgtcatactcatctaattac atggaatccatgaagcccaacaagtatggggtcatctactccacacaattgcctgatgag ttctttcagaccctagaaggcctgtggcatggaatacagatggagccagtggacttcatg atgatggcagctgccctctcctggcacagaatatggagcctggggatcttgctggtcatc cagccggtggtggtgcagcccatcccctttatgtacatgagtcacctccaggagcctctc atggtctccttgaggaggaggaggaggaagaggtcttccatggaggagatgaaaaattcc agtagtagcatgcaaggagcagttgctctaggtcgtaaccagcctttgaaaaaggagaaa ccaaaatggaaaagcgattatcctatgacagatggacaactacgcagcaagagggatgaa ttttgggatacagcaccagcttttgaaggccggaaagagatttgggatgccttgaaggct gctgcacatgcttttgagagcaatgatcatgaactggcacaagcaatcattgatggtgca aacataacattaccacatggtgcacttacagagtgctacgatgaactggggaacagatat cagcttccagtgtattgcttggcaccgccaatcaacatgatagaggaaaagagcgacata gagactctggatattcctgagccaccacccaattctggatatgaatgtcagcttcgtttg cgcctttccacaggcaaagacctcaagcttgtggttcgcagcacagacacagtattccac atgaagagacggttgcatgcagcagagggagtggaaccaggtagtcagcggtggtttttt tctggcagacctctcactgacaaaatgaagttcgaagagctgaagatcccaaaggactat gttgtacaggttatagtgagccaacctgtgcagaacccaacaccagtggagaactga