GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:53:36 Sequence gi568815597f:10330518_10542402 : 211885 bp : 46.10% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4003 4121 119 1 2 101 96 187 0.997 20.91 1.02 Intr + 6140 6225 86 0 2 83 94 80 0.999 7.64 1.03 Intr + 6557 6686 130 1 1 82 71 77 0.993 5.67 1.04 Intr + 6854 7016 163 0 1 62 82 113 0.976 7.13 1.05 Intr + 9252 9342 91 0 1 22 101 67 0.639 1.40 1.06 Intr + 11533 11651 119 0 2 66 36 78 0.593 -0.34 1.07 Intr + 12715 12770 56 1 2 105 99 5 0.932 1.92 1.08 Intr + 13573 13757 185 2 2 88 42 175 0.644 12.41 1.09 Intr + 18132 18216 85 2 1 74 95 77 0.976 6.39 1.10 Intr + 22114 22282 169 2 1 107 19 206 0.506 14.60 1.11 Intr + 30412 30526 115 1 1 91 78 149 0.999 14.65 1.12 Intr + 31175 31308 134 1 2 94 88 124 0.800 12.44 1.13 Intr + 32766 32827 62 0 2 76 115 4 0.920 0.28 1.14 Intr + 34583 34782 200 0 2 86 3 154 0.514 5.77 1.15 Intr + 35024 35131 108 1 0 83 70 196 0.589 17.78 1.16 Intr + 37950 38021 72 2 0 105 105 69 0.995 9.90 1.17 Intr + 40624 40745 122 0 2 77 100 34 0.982 2.79 1.18 Intr + 43799 43948 150 2 0 79 113 98 0.995 10.68 1.19 Intr + 44337 44529 193 0 1 85 73 156 0.959 13.29 1.20 Intr + 44738 44856 119 1 2 87 107 164 0.962 17.46 1.21 Term + 46028 46070 43 2 1 130 42 29 0.828 -0.87 1.22 PlyA + 46303 46308 6 -0.45 2.00 Prom + 50443 50482 40 -2.86 2.01 Init + 68457 68608 152 2 2 69 81 95 0.761 6.53 2.02 Intr + 69112 69187 76 1 1 114 91 190 0.995 21.42 2.03 Intr + 69876 70055 180 2 0 102 73 203 0.987 20.26 2.04 Intr + 72554 72619 66 1 0 77 98 79 0.987 6.90 2.05 Intr + 73644 73762 119 2 2 76 89 46 0.754 2.76 2.06 Intr + 77554 77593 40 1 1 123 0 59 0.695 -1.17 2.07 Intr + 80901 81035 135 2 0 107 94 247 0.993 27.96 2.08 Intr + 82545 82734 190 2 1 76 23 240 0.973 15.46 2.09 Intr + 86470 86600 131 2 2 75 81 160 0.977 14.41 2.10 Intr + 86751 86834 84 2 0 62 44 70 0.502 0.02 2.11 Intr + 86859 86992 134 2 2 45 115 127 0.991 10.54 2.12 Intr + 88309 88408 100 1 1 75 91 140 0.997 13.11 2.13 Intr + 88900 89022 123 0 0 90 115 59 0.999 9.58 2.14 Term + 89113 89232 120 0 0 117 43 115 0.999 8.37 2.15 PlyA + 89606 89611 6 1.05 3.00 Prom + 92601 92640 40 -3.16 3.01 Init + 95834 95941 108 1 0 82 35 205 0.908 12.83 3.02 Intr + 99986 100051 66 1 0 61 92 66 0.247 3.40 3.03 Intr + 103325 103448 124 1 1 73 48 154 0.911 10.06 3.04 Intr + 104140 104173 34 2 1 121 98 37 0.722 5.28 3.05 Term + 109715 109853 139 2 1 58 41 64 0.138 -3.86 3.06 PlyA + 110573 110578 6 1.05 4.00 Prom + 111718 111757 40 -5.16 4.01 Init + 119707 119805 99 0 0 65 105 209 0.987 18.57 4.02 Term + 123468 123500 33 2 0 82 55 19 0.273 -4.41 4.03 PlyA + 127386 127391 6 1.05 5.08 PlyA - 129428 129423 6 1.05 5.07 Term - 131185 130973 213 1 0 53 43 231 0.994 12.33 5.06 Intr - 132692 132541 152 0 2 118 64 143 0.999 14.78 5.05 Intr - 133103 132914 190 2 1 53 97 139 0.982 10.46 5.04 Intr - 136815 136673 143 1 2 63 103 138 0.995 12.87 5.03 Intr - 138821 138660 162 0 0 61 110 152 0.997 14.65 5.02 Intr - 141407 141348 60 0 0 69 92 71 0.949 4.31 5.01 Init - 141941 141761 181 2 1 69 76 203 0.905 16.55 5.00 Prom - 142542 142503 40 -5.66 6.00 Prom + 142834 142873 40 -6.36 6.01 Init + 144450 144485 36 2 0 91 110 30 0.407 5.55 6.02 Intr + 164757 164804 48 2 0 101 119 25 0.718 5.78 6.03 Intr + 165172 165204 33 0 0 83 115 3 0.477 0.92 6.04 Intr + 202710 202808 99 2 0 48 42 83 0.001 0.01 6.05 Intr + 205696 205780 85 0 1 88 87 45 0.043 3.79 6.06 Intr + 209602 209723 122 2 2 58 88 55 0.100 2.71 6.07 Term + 210943 211071 129 0 0 112 53 80 0.074 5.08 6.08 PlyA + 211564 211569 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 116628 116524 105 0 0 31 56 125 0.807 3.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:10330518_10542402|GENSCAN_predicted_peptide_1|840_aa XAFVYLSNLLYPVPLIHRVAIVSEKGEVRGFLRVAVQAIAADEEAPDYGSGIRQSGTAKI SFDNEYFNQSDFSSVAMTRSGLSLEELRIVEGQGQSSEVITPPEEISRINDLDLKSSTLL DGKMVMEGFSEEIGNHLKLGSAFTFRVTVLQASGILPEYADIFCQFNFLHRHDEAFSTEP LKNNGRGSPLAFYHVQNIAVEITESFVDYIKTKPIVFEVFGHYQQHPLHLQGQELNSPPQ PCRRFFPPPMPLSKPDHERKIELIRFLVSGYVNVHMLDTFSEHASVLASSAIATFQDEAD PLPPFLFLPVPSCQSERGIQRRITVTIIHEKGSELHWKDVRELVVGRIRNKPEVDEAAVD AILSLNIISAKYLKSSHNSSRWDTQSSVKKSTLAGVNWYTVRTFYRFEAVWDSSLHNSLL LNRVTPYGEKIYMTLSAYLELDHCIQPAVITKDVCMVFYSRDAKISPPRSLRSLFGSGYS KSPDSNRVTGIYELSLCKMSDTGSPGMQRRRRKILDTSVAYVRGEENLAGWRPRGDSLIL EHQWELEKLELLHEVSRGRVVQMQELSDKIAKISTTTFESAITPSESSGYDSGDIESLVD REKELATKCLQLLTHTFNREFSQVHGSVSDCKLSDISPIGRDPSESSFSSATLTPSSTCP SLVDSRSNSLDQKTPEANSRASSPCPEFEQFQIVPAVETPYLARAGKNEFLNLVPDIEEI RPSSVVSKKGYLHFKEPLYSNWAKHFVVVRRPYVFIYNSDKDPVERGIINLSTAQVEYSE DQQAMVKTPNTFAVCTKHRGVLLQALNDKDMNDWLYAFNPLLAGTIRSKLSRRCPSQSKY >gi568815597f:10330518_10542402|GENSCAN_predicted_CDS_1|2523_bp nnggcatttgtttacctgagcaatctgctgtatcccgtgcccctgatccacagggtggcc atcgtcagtgagaaaggtgaagtgcggggatttctgcgtgtggctgtacaggccatcgca gcggatgaagaagctcctgattatggctctggaattcgacagtcaggaacagctaaaata tcttttgataatgaatactttaatcagagtgacttttcgtctgttgcaatgactcgttct ggtctgtccttggaggagttgaggattgtggaaggacagggtcagagttctgaggtcatc actcctccagaagaaatcagtcgaattaatgacttggatttgaagtcaagcactttgctg gatggtaagatggtaatggaagggttttctgaagagattggcaaccacctgaaactgggc agtgccttcactttccgagtaacagtgttgcaggccagtggaatcctcccagagtatgca gatatcttctgtcagttcaactttttgcatcgccatgatgaagcattctccacggagccc ctcaaaaacaatggcagaggaagtcccctggccttttatcatgtgcagaatattgcagtg gagatcactgaatcatttgtggattacatcaaaaccaagcctattgtatttgaagtcttt gggcattatcagcagcacccacttcatctgcaaggacaggagcttaacagtccgcctcag ccgtgccgccgattcttccctccacccatgccactgtccaagccagaccatgagagaaag attgagctgattcggtttctcgtgtctggctatgtgaatgtgcatatgttggacacattt tctgagcacgccagtgtgcttgcatcctccgctattgccaccttccaggatgaggctgac ccattgccgccttttctctttctgccagttcccagctgccagagcgagaggggcatccag cgaaggatcacagtgaccattatccatgagaaggggagcgagctccattggaaagatgtt cgtgaactggtggtaggtcgtattcggaataagcctgaggtggatgaagctgcagttgat gccatcctctccctaaatattatttctgccaagtacctgaagtcttcccacaactctagc aggtgggacacccagagcagtgtgaagaagtccacacttgcaggcgttaattggtacacc gttaggaccttctaccgctttgaggctgtgtgggatagctctctgcataactcccttctt ctgaaccgagtgacaccctatggagaaaagatctacatgaccttgtcggcctacctagag ctggatcattgcatccagccggctgtcatcaccaaggatgtgtgcatggtcttctactcc cgagatgccaagatctcaccaccacgctctctgcgtagcctctttggcagcggctactca aagtcaccagattcgaatcgagtcactggcatttacgaactcagcttatgcaaaatgtca gacacaggtagtccaggtatgcagagaaggagaagaaaaatcttagatacgtcagtggca tatgtgcggggagaagagaacttagcaggctggcggccccgtggagacagcctcatcctt gagcaccagtgggagctggagaagctggagctcctacatgaggtatccaggggcagggtt gttcagatgcaagaactctcggacaagattgccaaaatctcaaccactacctttgaaagc gccatcacacctagcgagagcagtggctatgattcaggagacatcgaaagcctggtggac cgagagaaagagctggctaccaagtgcctgcaacttctcacccacactttcaacagagaa ttcagccaggtgcacggcagcgtcagtgactgtaagttgtctgatatctctccaattgga cgggatccctctgagtccagtttcagcagtgccaccctcactccctcctccacctgtccc tctctggtagactctaggagcaactctctggatcagaagaccccagaagccaattcccgg gcctctagtccctgcccagaatttgaacagtttcagattgtcccagctgtggaaacacca tatttggcccgagcaggaaaaaacgaatttctcaatcttgttccagatattgaagaaatt agaccaagctcagtggtctctaagaaaggataccttcatttcaaggagcctctttacagt aactgggctaaacattttgttgtcgtccgtcggccttatgtcttcatctataacagtgac aaagaccctgtggagcgtggaatcattaacctgtccacagcacaggtggagtacagtgag gaccagcaggccatggtgaagacaccaaacacctttgctgtctgcacaaagcaccgtggg gtccttttgcaggccctcaatgacaaagacatgaacgactggttgtatgccttcaaccca cttctagctggcacaatacggtcaaagctttcccgcagatgcccgagccagtcgaaatac taa >gi568815597f:10330518_10542402|GENSCAN_predicted_peptide_2|549_aa MAPPPSIRLAGAEKPGVSGRSFWREPLRVFPSLVLRASPLFGSALSAAMAQADIALIGLA VMGQNLILNMNDHGFVVCAFNRTVSKVDDFLANEAKGTKVVGAQSLKEMVSKLKKPRRII LLVKAGQAVDDFIEKLVPLLDTGDIIIDGGNSEYRDTTRRCRDLKAKGILFVGSGVSGGE EGARYGPSLMPGGNKEAWPHIKTIFQGIAAKVGDEGAGHFVKMVHNGIEYGDMQLICEAY HLMKDVLGMAQDEMAQAFEDWNKTELDSFLIEITANILKFQDTDGKHLLPKIRDSAGQKG TGKWTAISALEYGVPVTLIGEAVFARCLSSLKDERIQASKKLKGPQKFQFDGDKKSFLED IRKNIGPSGISTADENKTGRHKAVTLLMAILALYASKIISYAQGFMLLRQAATEFGWTLN YGGIALMWRGGCIIRSVFLGKIKDAFDRNPELQNLLLDDFFKSAVENCQDSWRRAVSTGV QAGIPMPCFTTALSFYDGYRHEMLPASLIQAQRDYFGAHTYELLAKPGQFIHTNWTGHGG TVSSSSYNA >gi568815597f:10330518_10542402|GENSCAN_predicted_CDS_2|1650_bp atggctccacccccttccattcgattggccggcgccgaaaagccgggcgtgagcggccgc agtttctggagggagccgctgcgggtctttccctcactcgtcctccgcgcgtcgccgctc ttcggttctgctctgtccgccgccatggcccaagctgacatcgcgctgatcggattggcc gtcatgggccagaacttaattctgaacatgaatgaccacggctttgtggtctgtgctttt aataggactgtctccaaagttgatgatttcttggccaatgaggcaaagggaaccaaagtg gtgggtgcccagtccctgaaagagatggtctccaagctgaagaagccccggcggatcatc ctcctggtgaaggctgggcaagctgtggatgatttcatcgagaaattggtaccattgttg gatactggtgacatcatcattgacggaggaaattctgaatatagggacaccacaagacgg tgccgagacctcaaggccaagggaattttatttgtggggagcggagtcagtggtggagag gaaggggcccggtatggcccatcgctcatgccaggagggaacaaagaagcgtggccccac atcaagaccatcttccaaggcattgctgcaaaagtgggagatgagggagcaggccacttc gtgaagatggtgcacaacgggatagagtatggggacatgcagctgatctgtgaggcatac cacctgatgaaagacgtgctgggcatggcgcaggacgagatggcccaggcctttgaggat tggaataagacagagctagactcattcctgattgaaatcacagccaatattctcaagttc caagacaccgatggcaaacacctgctgccaaagatcagggacagcgcggggcagaagggc acagggaagtggaccgccatctccgccctggaatacggcgtacccgtcaccctcattgga gaagctgtctttgctcggtgcttatcatctctgaaggatgagagaattcaagctagcaaa aagctgaagggtccccagaagttccagtttgatggtgataagaaatcattcctggaggac attcggaagaatattggcccttctgggatctccactgctgatgagaataagactggtaga cataaggcggtcactctcctaatggcaatcctagcactctacgcttccaagatcatctct tacgctcaaggctttatgctgctaaggcaggcagccaccgagtttggctggactctcaat tatggtggcatcgccctgatgtggagagggggctgcatcattagaagtgtattcctagga aagataaaggatgcatttgatcgaaacccggaacttcagaacctcctactggacgacttc tttaagtcagctgttgaaaactgccaggactcctggcggcgggcagtcagcactggggtc caggctggcattcccatgccctgttttaccactgccctctccttctatgacgggtacaga catgagatgcttccagccagcctcatccaggctcagcgggattacttcggggctcacacc tatgaactcttggccaaaccagggcagtttatccacaccaactggacaggccatggtggc accgtgtcatcctcgtcatacaatgcctga >gi568815597f:10330518_10542402|GENSCAN_predicted_peptide_3|156_aa MRPHSPALGWSMGLGAVEQGATLIGEARATQEPTEWGRPAVMEEEAETEEQQRFSYQQRL KAAVHYTVGCLCEEVALDKEMQFSKQTIAAISELTFRQCENFAKDLEMFARMGYLEIPAS LKSDRELLCFIKTLNNTKHFGDLLLPFLQTCEKNHN >gi568815597f:10330518_10542402|GENSCAN_predicted_CDS_3|471_bp atgcgcccacactccccagcccttgggtggtcgatgggactgggcgccgtggagcagggg gccacgctcatcggggaggctcgggccacacaggagcccacggagtggggtcggcccgca gtgatggaggaggaggcggagaccgaggagcagcagcgattctcttaccaacagaggcta aaggcagcagttcactatactgtgggttgtctttgcgaggaagttgcattggacaaagag atgcagttcagcaaacagaccattgcggccatttcggagctgactttccgacagtgtgaa aattttgccaaagaccttgaaatgtttgcaaggatgggctacttagagatccctgctagt ttgaagtctgaccgtgaacttctgtgtttcatcaaaactttaaataataccaaacatttt ggtgacttgcttctgccctttctgcagacatgcgaaaagaaccacaattaa >gi568815597f:10330518_10542402|GENSCAN_predicted_peptide_4|43_aa MPLSPGLLLLLLSGATATAALPLEGGPTGRDSEDLAPEQLPNC >gi568815597f:10330518_10542402|GENSCAN_predicted_CDS_4|132_bp atgccattgtcccccggcctcctgctgctgctgctctccggggccacggccaccgctgcc ctgcccctggagggtggccccaccggccgagacagcgaggatttggcaccagaacagctc cctaactgctga >gi568815597f:10330518_10542402|GENSCAN_predicted_peptide_5|366_aa MEVTGDAGVPESGEIRTLKPCLLRRNYSREQHGVAASCLEDLRSKGWLGTPGEGVGEPGP GTYAGKRELLLTISGGPDKEPCDILAIDKSLTPVTLVLAEDGTIVDDDDYFLCLPSNTKF VALASNEKWAYNNSDGGTAWISQESFDVDETDSGAGLKWKNVARQLKEDLSSIILLSEED LQMLVDAPCSDLAQELRQSCATVQRLQHTLQQVLDQREEVRQSKQLLQLYLQALEKEGSL LSKQEESKAAFGEEVDAVDTGISRETSSDVALASHILTALREKQAPELSLSSQDLELVTK EDPKALAVALNWDIKKTETVQEACERELALRLQQTQSLHSLRSISASKASPPGDLQNPKR ARQDPT >gi568815597f:10330518_10542402|GENSCAN_predicted_CDS_5|1101_bp atggaggtgaccggggacgccggggtaccagaatctggcgagatccggactctaaagccg tgtctgctgcgccgcaactacagccgcgaacagcacggcgtggccgcctcctgcctcgaa gacctgaggagcaagggttggctcgggaccccgggcgagggtgtgggggagccagggccg ggaacctatgcaggaaagagggagctgctactgaccatttctggaggtccagacaaggaa ccctgtgacattctggccattgataagtccctgacaccagtcaccctggtcctggcagag gatggcaccatagtggatgatgacgattactttctgtgtctaccttccaatactaagttt gtggcattggctagtaatgagaaatgggcatacaacaattcagatggaggtacagcttgg atttcccaagagtcctttgatgtagatgaaacagacagcggggcagggttgaagtggaag aatgtggccaggcagctgaaagaagatctgtccagcatcatcctcctatcagaggaggac ctccagatgcttgttgacgctccctgctcagacctggctcaggaactacgtcagagttgt gccaccgtccagcggctgcagcacacactccaacaggtgcttgaccaaagagaggaagtg cgtcagtccaagcagctcctgcagctgtacctccaggctttggagaaagagggcagcctc ttgtcaaagcaggaagagtccaaagctgcctttggtgaggaggtggatgcagtagacacg ggtatcagcagagagacctcctcggacgttgcgctggcgagccacatccttactgcactg agggagaagcaggctccagagctgagcttatctagtcaggatttggagttggttaccaag gaagaccccaaagcactggctgttgccttgaactgggacataaagaagacggagactgtt caggaggcctgtgagcgggagctcgccctgcgcctgcagcagacgcagagcttgcattct ctccggagcatctcagcaagcaaggcctcaccacctggtgacctgcagaatcctaagcga gccagacaggatcccacatag >gi568815597f:10330518_10542402|GENSCAN_predicted_peptide_6|183_aa MASSEQAEQPSQPSSTPGSENVLPREPLGASASVVFKEQCSDVDAVKYVICLSPAFSDKD QEEHKQAPNILYIATAVKFLQNSRVRQSPLATRRAFLKKKGETRSYPSTVPGSRNPGSKT ASLLEMNLKKPGGPNLGSGLEPCPSVGPVQSLWGRGRALQVSEELLVRSRQGCVAAALVP EDD >gi568815597f:10330518_10542402|GENSCAN_predicted_CDS_6|552_bp atggcgtcctcggagcaggcagagcagccgagccagccaagctctactccaggaagtgaa aatgtgctgcctcgagagccgctgggtgcgagcgcctcagtggtctttaaagaacagtgt agtgatgtcgatgctgttaaatatgtgatttgcctgtcgcctgcattttcagacaaagat caagaggaacataaacaagccccaaatatactttatattgccacggcagtgaagtttcta cagaattcccgggtccgccagagcccacttgcaaccaggagagcattcctaaagaagaaa ggtgaaactagatcttatccttctactgttcctggaagtagaaaccccggtagcaaaacc gcctccttgctggaaatgaacttgaagaaacctggcggtcccaaccttggtagtgggcta gagccctgccctagtgtgggtccagtgcagagcctctgggggcggggccgtgccctgcag gtgagcgaggagctgctcgtgcggagcaggcagggctgtgtggcagccgcgcttgttcct gaggatgattga