GENSCAN 1.0 Date run: 7-Nov-116 Time: 19:49:59 Sequence gi568815594f:158580466_158808524 : 228059 bp : 39.26% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6608 6707 100 1 1 42 70 100 0.269 4.07 1.02 Intr + 12936 13034 99 1 0 91 81 36 0.270 2.36 1.03 Intr + 18861 18966 106 1 1 64 101 72 0.141 4.45 1.04 Intr + 27507 27578 72 0 0 73 93 61 0.073 2.70 1.05 Intr + 52940 53011 72 2 0 81 50 91 0.327 2.20 1.06 Intr + 64444 64673 230 1 2 75 68 131 0.694 6.39 1.07 Intr + 66605 66736 132 0 0 53 91 44 0.281 0.90 1.08 Intr + 71292 71441 150 1 0 112 77 27 0.716 3.21 1.09 Intr + 76904 77086 183 0 0 71 55 100 0.220 3.84 1.10 Intr + 77805 77958 154 1 1 -10 81 133 0.313 1.01 1.11 Intr + 78199 78360 162 1 0 46 52 134 0.322 3.77 1.12 Intr + 79583 79742 160 2 1 34 23 128 0.018 -0.03 1.13 Intr + 80757 80896 140 2 2 45 68 130 0.226 5.04 1.14 Intr + 81181 82175 995 1 2 65 69 584 0.376 43.74 1.15 Intr + 84039 84290 252 1 0 95 -6 235 0.853 11.28 1.16 Intr + 84409 84591 183 2 0 71 80 88 0.208 5.14 1.17 Intr + 87495 87545 51 1 0 68 92 49 0.021 1.26 1.18 Term + 91031 91521 491 0 2 32 47 338 0.014 17.93 1.19 PlyA + 94582 94587 6 1.05 2.00 Prom + 97934 97973 40 -6.45 2.01 Init + 101738 101959 222 1 0 85 43 303 0.909 24.10 2.02 Intr + 104127 104213 87 2 0 107 75 41 0.541 3.95 2.03 Intr + 109883 109960 78 1 0 100 92 79 0.951 8.33 2.04 Intr + 112840 112944 105 0 0 86 119 -19 0.580 0.49 2.05 Intr + 115032 115178 147 2 0 95 53 106 0.994 7.31 2.06 Intr + 117094 117234 141 0 0 107 103 79 0.994 10.93 2.07 Intr + 118522 118665 144 0 0 17 93 83 0.693 1.26 2.08 Intr + 122958 123126 169 2 1 68 82 57 0.818 1.70 2.09 Intr + 126164 126385 222 0 0 16 90 133 0.900 3.48 2.10 Term + 127899 128062 164 1 2 75 37 219 0.999 12.62 2.11 PlyA + 128183 128188 6 1.05 3.10 PlyA - 128399 128394 6 1.05 3.09 Term - 129359 129271 89 0 2 86 43 66 0.850 -1.26 3.08 Intr - 130205 130163 43 2 1 65 115 47 0.782 1.99 3.07 Intr - 132854 132654 201 2 0 9 107 109 0.836 3.46 3.06 Intr - 135219 135097 123 0 0 12 76 151 0.954 6.16 3.05 Intr - 136735 136547 189 1 0 70 106 160 0.987 14.86 3.04 Intr - 138821 138715 107 0 2 82 98 92 0.995 8.61 3.03 Intr - 141018 140878 141 1 0 83 47 103 0.364 5.10 3.02 Intr - 142801 142739 63 2 0 18 83 103 0.079 0.57 3.01 Init - 148772 148634 139 2 1 82 95 102 0.586 10.66 3.00 Prom - 150473 150434 40 -4.55 4.00 Prom + 156295 156334 40 -5.75 4.01 Init + 159118 159244 127 0 1 29 35 145 0.522 3.67 4.02 Intr + 162195 162313 119 1 2 76 37 78 0.005 0.76 4.03 Intr + 186295 186336 42 0 0 89 96 31 0.000 1.52 4.04 Intr + 188079 188246 168 2 0 33 -12 217 0.000 5.42 4.05 Intr + 188718 188854 137 2 2 -1 66 171 0.000 4.45 4.06 Intr + 215445 215559 115 0 1 46 75 129 0.322 6.93 4.07 Intr + 222204 222348 145 2 1 60 31 112 0.072 1.63 4.08 Term + 225689 225849 161 0 2 49 48 123 0.262 1.52 4.09 PlyA + 226551 226556 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 82839 82914 76 2 1 101 100 40 0.823 7.80 S.002 Term - 89303 89148 156 2 0 90 34 172 0.947 8.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:158580466_158808524|GENSCAN_predicted_peptide_1|1243_aa MLSLHLISITDPEVGGCGSNRNMNVVHRVKRGPGDNNGWSLQFDKYFASYYKMTSQYPFE AETPECLVGSVPVQCLCQGLELDCDETNLRAVPSVSSNVTAMYLQNNKITSISIYAFRGL NSLTKLDLGSNKIENLPPLIFKDLKELSQLYFKKFQYCGYAPHVRSCKPNTDGISSLENL LASIIQRVFVWVVSAVTCFGNIFVICMRPYIRSENKLYAMSIISLCLAFIPLSNKEFFKN YYGTNGVCFPLHSEDTESIGAQIYSVAIFLGTITSWVVIFILPINSALNPILYTLTTRPF KEMIHRFWYNYRQRKSMDSKASVSTLAALEEPFSPPMRSRGPSLGLAEARAGSLCSQGGV EGEMREGAGAACSACGTSWVLDTPSLRTVTLTASVCDFILEVSETKNPPIPDTFWRPRGD YRLSSSGETIVKRVGSIGLPGTSFRFSCTSGLSRASTERKAIQLRGADNKLVDPAAMSGT LKGMSPNLPVAPLTINDNPPLISPAQKEISKEISKGPQIPPDHQLCPLQAVGGGEFGPTW VCLTIEGQEIDFLLDIGMAFSVLISCPGRLSSRSVTIRGILGQPVTRQYPLRPEAHKGLQ DIIKHLKAQGLLRKCGSPCNTPILGVQKPNGQWRLVQDLRLINEAVIPLYPVVPNPYTLL SQITEEAEWFTVLDLKDAFFCIPLHSDSQFLFAFQDPTDHTSQLTWTVLPKGFRDSSHLF GQALAQDVGHFSSPGTLVLQYVDDLLLAVSLEASCQQATLDLLNFLANQGYKVSRSKAQL CLQQVKYLGLILARGTRALSKEQIQPILAYPHPKTLKQLQGFLGITGFWRLWIPGYSEIA GPLYTIIKETQRANTHLVEWEPGAETAFKTLKQAIVQAPALSLPTGQNFSLYITERAGIA LGVLTQTRGTTPQPVAYLSPVAAMLLLLAFGPCIFNLLVKFVSSRIEAIKLQMVLQMEPQ MSSTNNFYRGPLDRPAGPSTGLKSSPLEDTTAGPLLRPYPAGTSVSTLDALEEPFSLPLH CGGPSLGLAEAGAGSLCSRGGVEGEAWVGAGAVHGACGPAWVPVVTSGDSGEGGTGQDQT EGPVVPGHGGWGTDSLASRAPPPPVFPEAPRGDAVPTPHSRLRSAAPLPPAGPPLGRCSC SELASPNSLGRPGKMLQRRLKRREKREAAGAAKKPATPQGRPWGRVGTAESDPRSRTDTN CLLTTRAQSRKAPPPNARNPAQSPRGSQPRTTKTFHFSGQPII >gi568815594f:158580466_158808524|GENSCAN_predicted_CDS_1|3732_bp atgctgtcacttcacctaatttccatcactgatcctgaggtaggtggatgtggcagtaac aggaacatgaacgtggtgcacagagtgaagcgaggccctggagacaacaatggatggtct ctgcaatttgacaaatattttgccagttactacaaaatgacttcccaatatccttttgag gcagaaacacctgaatgtttggtcggttctgtgccagtgcaatgtctttgccaaggtctg gagcttgactgtgatgaaaccaatttacgagctgttccatcggtttcttcaaatgtgact gcaatgtacctgcaaaacaataagattacatccatctccatctatgctttcagaggactg aatagccttactaaactggatttaggaagtaataagattgaaaatcttccaccgcttata ttcaaggacctgaaggagctgtcacaattatattttaagaaattccagtactgtgggtat gcaccacatgttcgcagctgtaaaccaaacactgatggaatttcatctctagagaatctc ttggcaagcattattcagagagtatttgtctgggttgtatctgcagttacctgctttgga aacatttttgtcatttgcatgcgaccttatatcaggtctgagaacaagctgtatgccatg tcaatcatttctctctgcttggctttcattccattgagcaataaggaatttttcaaaaac tactatggcaccaatggagtatgcttccctcttcattcagaagatacagaaagtattgga gcccagatttattcagtggcaatttttcttggtaccataacctcttgggtagtgattttt attctgcccattaacagtgctttgaacccaattctctatactctgaccacaagaccattt aaagaaatgattcatcggttttggtataactacagacaaagaaaatctatggacagcaaa gcctcagtgtccactctggctgcactggaggagcccttcagcccgccaatgcgcagtagg ggcccctctctggggctggctgaggccagagctggctccctctgctcgcagggaggtgtg gagggagagatgcgggagggagctggggctgcatgcagtgcttgtgggacgtcgtgggtt ctggacacaccatctttaagaactgtaacactcactgcgagtgtctgcgacttcattctt gaagtcagcgagacaaagaacccaccaattccggacacattttggcgaccacgaggggac tatcgcctatcgtcaagtggtgagactatcgtcaagcgagttgggagcattggtttgcct ggaaccagcttccgcttttcctgtacttctgggctgagccgagcatcgacagagaggaaa gccattcagctccggggtgccgacaacaagttggttgacccagcagccatgagcggaact ctcaaaggcatgtcgcccaacctccctgtagctccccttactattaatgataatcctcct ctaatctcccctgctcagaaagaaataagcaaagaaatctccaaaggaccacaaatcccc ccggaccatcagttatgtccccttcaagctgtagggggaggggaatttggcccaacctgg gtatgtttaaccattgagggccaggaaattgacttcctcctggacattggcatggccttc tcagtgttaatctcctgtcctggacgactgtcctcaaggtctgttaccatccgaggaatc ctgggacagcctgtaaccaggcaatatcccttaaggcctgaagctcataaaggtttacag gatattattaaacatttaaaagctcaaggcttactaaggaaatgtggcagtccctgcaac accccaattctaggagtacaaaaaccaaacggtcagtggagactagtgcaagatcttaga ctcatcaatgaggcagtaattcctctatatccagttgtacccaacccctataccttgctc tctcaaataacagaggaagcagaatggttcactgttctggacctcaaggatgccttcttc tgtattcccctgcactctgactcccagtttctctttgccttccaggatcccacagaccac acatcccaacttacgtggacagtcttgcccaaagggtttagggatagctctcatctattt ggtcaggcactggcccaagatgtaggtcacttctcaagtccaggcactctggtccttcag tatgtggatgatttacttttggctgtcagtttggaagcctcatgccagcaggctactcta gatctcttgaactttctagctaatcaagggtacaaggtgtctaggtcgaaggcccagctt tgcctacagcaggtcaaatatctgggcctaatcttagccagagggaccagggccctcagc aaggaacaaatacagcctatactggcttatcctcaccctaagacattaaaacagttgcag gggttccttggaatcaccggcttttggcgactatggatccccggatacagcgagatagct gggcccctctatactataatcaaggagacccagagggcaaatactcatctagtagaatgg gaaccaggggcagaaacagccttcaaaaccttaaagcaggctatagtacaagctccagct ttaagccttcccacaggacaaaacttttctttatacatcacagagagagcagggatagct cttggagtccttactcagactcgtgggacaaccccacaaccagtggcctacctaagtccc gtggcagccatgttgctgttactcgcctttggaccctgtatttttaaccttcttgtcaaa tttgtttcctctagaatcgaggccatcaagctacagatggtcttacaaatggaaccccaa atgagctcaactaacaacttctaccgaggacccctggaccgacctgctggcccttccact ggcctaaagagttcccctctggaggacacaactgcagggccacttcttcgcccctatcca gcaggaacctcggtgtccactctggatgcactggaggagcccttcagcctgccgctgcac tgtgggggcccctctctggggctggccgaggccggagccggctccctctgttcacgggga ggtgtggagggagaggcatgggtgggagctggggctgtgcacggtgcttgcgggccggcg tgggttccagtggttacctctggggactctggggagggaggaacgggacaggatcaaaca gaagggccggtcgtcccgggtcatgggggttgggggaccgattccctagcctcgcgagcc ccgccgccgccggtcttcccagaggcgccgaggggggacgcggtcccgacaccacactca cgtctccgatctgcagctccacttcctccagctggtccaccgttgggccgctgctcctgc tcggaactggccagcccaaactcactgggccgcccggggaagatgctgcagaggcgtctg aagaggagggagaagagggaggcggcgggggcggcgaagaaacctgcaactcctcagggt cggccatggggaagggttgggaccgcggaatccgacccgagaagccgaaccgacaccaac tgtcttttaaccactcgcgcccaaagccgaaaggccccgcctcctaacgccagaaatccc gcccagtctcctcgtgggtctcagccccggaccacaaagacgtttcatttttcgggacag ccaatcatatga >gi568815594f:158580466_158808524|GENSCAN_predicted_peptide_2|492_aa MERFAEEADVVIVGAGPAGLSAAVRLKQLAVAHEKDIRVCLVEKAAQIGAHTLSGACLDP GAFKELFPDWKEKGAPLNTPVTEDRFGILTEKYRIPVPILPGKVLFHDDGSVKGIATNDV GIQKDGAPKPSLGSYLQYASYHKNSDLDIFKLQRLSNLAGVRHLATFERGLELHAKVTIF AEGCHGHLAKQLYKKFDLRANCEPQTYGIGLKELWVIDEKNWKPGRVDHTVGWPLDRHTY GGSFLYHLNEGEPLVALGLVVGLDYQNPYLSPFREFQRWKHHPSIRPTLEGGKRIAYGAR ALNEGGFQSIPKLTFPGGLLIGCSPGFMNVPKIKGTHTAMKSGILAAESIFNQLTSENLQ SKTIGSDFERLKPAKDCTPIEYPKPDGQISFDLLSSVALSGTNHEHDQPAHLTLRDDSIP VNRNLSIYDGPEQRFCPAGVYEFVPVEQGDGFRLQINAQNCVHCKTCDIKDPSQNINWVV PEGGGGPAYNGM >gi568815594f:158580466_158808524|GENSCAN_predicted_CDS_2|1479_bp atggaaaggtttgcagaagaagcagatgttgtaatagttggtgcaggccctgcagggctc tctgcagctgttcgtctaaaacagttggctgtggcacatgaaaaggacatccgtgtgtgt ctagtggagaaagctgcccagataggagctcatactctctcaggggcttgccttgatcca ggtgcttttaaagaactcttcccagactggaaagagaagggggctccacttaacactcct gtaacagaagacagatttggaattttaacagagaaatacagaattcctgtgccaattctt ccaggtaaggtcctttttcatgatgatggtagtgtaaaaggaattgccactaacgatgta gggatacaaaaggatggtgcaccaaagccttctcttggttcctatctccaatatgcatct tatcataagaattctgatttggacatttttaagttacagaggttaagtaacttggctggg gtcagacacctggcaacatttgagagaggactggaactacatgctaaagtcacaattttt gcagaaggttgccatggacatctagccaagcaactatataagaagtttgatttgagagca aattgtgaacctcaaacctacgggattggactgaaggagttatgggttattgatgaaaag aactggaaacctgggagagtagatcacactgttggttggcccttggacagacatacctat ggaggatctttcctctatcatttgaatgaaggtgaacccctagtagctcttggtcttgtg gttggtctagactatcagaatccatacctgagtccatttagagagttccaaaggtggaaa caccatcctagcattcggccaaccttggaaggtggaaaaaggattgcatacggagccaga gctctcaatgaaggtggctttcagtctataccaaaactcacctttcctggtggtttacta attggttgtagtcctggttttatgaatgttcccaagatcaaaggtactcacacagcaatg aaaagtggaattttagcagcagaatctatttttaatcaactaactagtgaaaatctccaa tcaaagacaataggttctgactttgaacggctcaagccagccaaggattgcacacctatt gagtatccaaaacccgatggacagatcagttttgacctcttgtcatctgtggctctgagt ggtactaatcatgaacatgaccagccggcacacttaaccttaagggatgacagtatacct gtaaatagaaatctgtcgatatatgatgggcccgagcagcgattctgtcctgcaggagtt tatgaatttgtacctgtggaacaaggtgatggatttcggttacagataaatgctcagaac tgtgtacattgtaaaacatgtgatattaaagatccaagtcagaatattaactgggtggta cctgaaggtggaggaggacctgcttacaatggaatgtaa >gi568815594f:158580466_158808524|GENSCAN_predicted_peptide_3|364_aa MNLVLFLGPVAFLGQFRNQGTRPSAPGFHKPIYGGLVASQVCDGDHAKPSNPSNPRVFFD VDIGGERVGRIVLELFADIVPKTAENFRALCTGEKGIGHTTGKPLHFKGCPFHRIIKKFM IQGGDFSNQNGTGGESIYGEKFEDENFHYKHDREGLLSMANAGRNTNGSQFFITTVPTPH LDGKHVVFGQVIKGIGVARILENVEVKGEKPAKLCVIAECGELKEGDDGGIFPKDGSGDS HPDFPEDADIDLKDRRVIPVYVVFHTAFINAFCRYVDSSKAVIETADRAKLQPIALSCVL NIGACKLKMSNWQGAIDSCLEADLKKAQGIAPEDKAIQAELLKVKQKIKAQKDKEKAVYA KMFA >gi568815594f:158580466_158808524|GENSCAN_predicted_CDS_3|1095_bp atgaatcttgttctctttctggggcctgttgccttcctgggacagttcaggaatcaagga acacgtccctctgctcctggatttcacaaacccatctatggaggtttggtggcttcccag gtatgtgacggggaccatgccaagccctccaaccccagtaaccctcgagtcttctttgac gtggacatcggaggggagcgagttggtcgaattgtcttagaattgtttgcagatatcgta cccaaaactgcggaaaattttcgtgcactgtgtacaggagaaaaaggcattggacacacg actgggaaacctctccatttcaaaggatgcccttttcatcgaattattaagaaatttatg attcagggtggagacttctcaaatcagaatgggacaggtggagaaagtatttatggtgaa aaatttgaagatgaaaatttccattacaagcatgatcgggagggtttactgagcatggca aatgcaggccgcaacacaaacggttctcagttttttatcacaacagttccaactcctcat ttggatgggaaacatgtggtgtttggccaagtaattaaaggaataggagtggcaaggata ttggaaaatgtggaagtgaaaggtgaaaaacctgctaaattgtgcgttattgcagaatgt ggagaattgaaggaaggagatgacgggggaatattcccaaaagatggctctggcgacagt catccagatttccctgaggatgcggatatagatttaaaagatagaagagtaattcctgtt tatgtggtctttcatactgcttttattaatgcattctgcagatacgtggacagttcaaag gctgttattgagacagcagatagagccaagctgcaacctatagctttaagctgtgtactg aatattggtgcttgtaaactgaagatgtcaaattggcagggagcaattgacagttgttta gaggctgatcttaagaaagctcaggggatagcaccagaagataaagctatccaggcagaa ttgctgaaagtcaaacaaaagataaaggcacagaaagataaagagaaggcagtatatgca aaaatgtttgcttag >gi568815594f:158580466_158808524|GENSCAN_predicted_peptide_4|337_aa MIVMSEKVHDCDSASGKGHRLLALIEECEGELLCAEIKSERGGECSFKEGGWERTKPEGA EEGSGKNSPGRTPNTKAQVENSLRTRTASWYIFQDRPTEAATARTKPSPLAYAFCERALV PEQSEDPSSGALSLKNKDRPSCMLAVRKNPGSRHGRSCGGGIMAPTLLQKLFNKRGSSGS SAAASAQGRAPKEGPAFSPWKNCLPRSQSLVLEKVGDRCPKPWLVAVCRLGDCCSKAWEW AFSQVSNLGLEVGPGSTASEGFGEESGHHRRQCDCLEILATGKRECSSHNKGTSGFVSKS MFMVRINQKNIKPFRRCAAAQRTPQTSQRAGKTLPDV >gi568815594f:158580466_158808524|GENSCAN_predicted_CDS_4|1014_bp atgattgtgatgtctgaaaaagttcatgattgtgattctgcatctggtaagggccacagg ctgcttgcactcatagaagaatgtgaaggggagctgctgtgtgcagagatcaagagcgaa agaggaggggaatgtagttttaaagagggtggttgggaaaggacaaaacctgaaggagct gaggaaggttcagggaagaacagcccaggaagaacaccaaatactaaggcccaagttgag aacagtctgcgtacaagaacagccagctggtatattttccaagataggcccaccgaagcg gccaccgcgcgtaccaagccttcccccctagcatacgccttctgtgaacgggctcttgta cctgaacaaagcgaggacccaagttcaggggctttgtctctaaagaacaaagatcggcct agctgcatgctcgctgtgcgaaagaacccaggctcccgccacggccggagctgcggcggc ggcatcatggccccgaccctgctccagaagctcttcaacaaaaggggcagcagcggcagc tccgcggcggcgtctgcccagggcagggctcctaaggaaggacccgcctttagtccatgg aaaaattgtcttccacgaagccagtccctggtcctggaaaaggttggggaccgctgccct aaaccgtggttggtggcagtgtgtagactcggtgactgttgttccaaggcatgggaatgg gctttttctcaggtgtcaaatcttggattggaggtgggtcctggcagcactgcatccgaa ggttttggagaggagtctgggcatcacaggagacaatgtgattgtttagagattttggcc acggggaaaagagaatgttcctcacataacaaaggaacttcaggatttgtctccaagtcc atgtttatggtcaggataaaccagaagaacattaaaccgttcaggagatgtgcggcggca cagcgaacaccacaaaccagccagagagctggcaagactctgccagatgtgtga