GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:09:58 Sequence gi568815594r:158609739_158823288 : 213550 bp : 39.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 18904 18971 68 0 2 63 87 37 0.555 1.70 1.02 Intr + 23667 23738 72 0 0 81 50 91 0.353 2.20 1.03 Intr + 35171 35400 230 2 2 75 68 131 0.696 6.39 1.04 Intr + 37332 37463 132 1 0 53 91 44 0.281 0.90 1.05 Intr + 42019 42168 150 2 0 112 77 27 0.715 3.21 1.06 Intr + 47631 47813 183 1 0 71 55 100 0.220 3.84 1.07 Intr + 48532 48685 154 2 1 -10 81 133 0.313 1.01 1.08 Intr + 48926 49087 162 2 0 46 52 134 0.322 3.77 1.09 Intr + 50310 50469 160 0 1 34 23 128 0.018 -0.03 1.10 Intr + 51484 51623 140 0 2 45 68 130 0.226 5.04 1.11 Intr + 51908 52902 995 2 2 65 69 584 0.376 43.74 1.12 Intr + 54766 55017 252 2 0 95 -6 235 0.853 11.28 1.13 Intr + 55136 55318 183 0 0 71 80 88 0.208 5.14 1.14 Intr + 58222 58272 51 2 0 68 92 49 0.021 1.26 1.15 Term + 61758 62248 491 1 2 32 47 338 0.014 17.93 1.16 PlyA + 65309 65314 6 1.05 2.00 Prom + 68661 68700 40 -6.45 2.01 Init + 72465 72686 222 2 0 85 43 303 0.909 24.10 2.02 Intr + 74854 74940 87 0 0 107 75 41 0.541 3.95 2.03 Intr + 80610 80687 78 2 0 100 92 79 0.951 8.33 2.04 Intr + 83567 83671 105 1 0 86 119 -19 0.580 0.49 2.05 Intr + 85759 85905 147 0 0 95 53 106 0.994 7.31 2.06 Intr + 87821 87961 141 1 0 107 103 79 0.994 10.93 2.07 Intr + 89249 89392 144 1 0 17 93 83 0.693 1.26 2.08 Intr + 93685 93853 169 0 1 68 82 57 0.818 1.70 2.09 Intr + 96891 97112 222 1 0 16 90 133 0.900 3.48 2.10 Term + 98626 98789 164 2 2 75 37 219 0.999 12.62 2.11 PlyA + 98910 98915 6 1.05 3.10 PlyA - 99126 99121 6 1.05 3.09 Term - 100086 99998 89 1 2 86 43 66 0.850 -1.26 3.08 Intr - 100932 100890 43 0 1 65 115 47 0.782 1.99 3.07 Intr - 103581 103381 201 0 0 9 107 109 0.836 3.46 3.06 Intr - 105946 105824 123 1 0 12 76 151 0.954 6.16 3.05 Intr - 107462 107274 189 2 0 70 106 160 0.987 14.86 3.04 Intr - 109548 109442 107 1 2 82 98 92 0.995 8.61 3.03 Intr - 111745 111605 141 2 0 83 47 103 0.364 5.10 3.02 Intr - 113528 113466 63 0 0 18 83 103 0.079 0.57 3.01 Init - 119499 119361 139 0 1 82 95 102 0.586 10.66 3.00 Prom - 121200 121161 40 -4.55 4.00 Prom + 127022 127061 40 -5.75 4.01 Init + 129845 129971 127 1 1 29 35 145 0.522 3.67 4.02 Intr + 132922 133040 119 2 2 76 37 78 0.005 0.76 4.03 Intr + 157022 157063 42 1 0 89 96 31 0.000 1.52 4.04 Intr + 158806 158973 168 0 0 33 -12 217 0.000 5.42 4.05 Intr + 159445 159581 137 0 2 -1 66 171 0.000 4.45 4.06 Intr + 186172 186286 115 1 1 46 75 129 0.324 6.93 4.07 Intr + 192931 193075 145 0 1 60 31 112 0.072 1.63 4.08 Intr + 196416 196572 157 1 1 49 56 114 0.061 2.45 4.09 Intr + 201123 201202 80 0 2 48 89 56 0.542 -0.02 4.10 Intr + 204379 204497 119 2 2 104 54 57 0.241 3.16 4.11 Term + 209058 209186 129 2 0 90 36 60 0.181 -1.80 4.12 PlyA + 209543 209548 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 53566 53641 76 0 1 101 100 40 0.823 7.80 S.002 Term - 60030 59875 156 0 0 90 34 172 0.947 8.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:158609739_158823288|GENSCAN_predicted_peptide_1|1140_aa MRKNKINHLNENTFAPLQKLDELDLGSNKIENLPPLIFKDLKELSQLYFKKFQYCGYAPH VRSCKPNTDGISSLENLLASIIQRVFVWVVSAVTCFGNIFVICMRPYIRSENKLYAMSII SLCLAFIPLSNKEFFKNYYGTNGVCFPLHSEDTESIGAQIYSVAIFLGTITSWVVIFILP INSALNPILYTLTTRPFKEMIHRFWYNYRQRKSMDSKASVSTLAALEEPFSPPMRSRGPS LGLAEARAGSLCSQGGVEGEMREGAGAACSACGTSWVLDTPSLRTVTLTASVCDFILEVS ETKNPPIPDTFWRPRGDYRLSSSGETIVKRVGSIGLPGTSFRFSCTSGLSRASTERKAIQ LRGADNKLVDPAAMSGTLKGMSPNLPVAPLTINDNPPLISPAQKEISKEISKGPQIPPDH QLCPLQAVGGGEFGPTWVCLTIEGQEIDFLLDIGMAFSVLISCPGRLSSRSVTIRGILGQ PVTRQYPLRPEAHKGLQDIIKHLKAQGLLRKCGSPCNTPILGVQKPNGQWRLVQDLRLIN EAVIPLYPVVPNPYTLLSQITEEAEWFTVLDLKDAFFCIPLHSDSQFLFAFQDPTDHTSQ LTWTVLPKGFRDSSHLFGQALAQDVGHFSSPGTLVLQYVDDLLLAVSLEASCQQATLDLL NFLANQGYKVSRSKAQLCLQQVKYLGLILARGTRALSKEQIQPILAYPHPKTLKQLQGFL GITGFWRLWIPGYSEIAGPLYTIIKETQRANTHLVEWEPGAETAFKTLKQAIVQAPALSL PTGQNFSLYITERAGIALGVLTQTRGTTPQPVAYLSPVAAMLLLLAFGPCIFNLLVKFVS SRIEAIKLQMVLQMEPQMSSTNNFYRGPLDRPAGPSTGLKSSPLEDTTAGPLLRPYPAGT SVSTLDALEEPFSLPLHCGGPSLGLAEAGAGSLCSRGGVEGEAWVGAGAVHGACGPAWVP VVTSGDSGEGGTGQDQTEGPVVPGHGGWGTDSLASRAPPPPVFPEAPRGDAVPTPHSRLR SAAPLPPAGPPLGRCSCSELASPNSLGRPGKMLQRRLKRREKREAAGAAKKPATPQGRPW GRVGTAESDPRSRTDTNCLLTTRAQSRKAPPPNARNPAQSPRGSQPRTTKTFHFSGQPII >gi568815594r:158609739_158823288|GENSCAN_predicted_CDS_1|3423_bp atgaggaaaaacaaaattaatcacttaaatgaaaatacttttgcacctctccagaaactg gatgaattggatttaggaagtaataagattgaaaatcttccaccgcttatattcaaggac ctgaaggagctgtcacaattatattttaagaaattccagtactgtgggtatgcaccacat gttcgcagctgtaaaccaaacactgatggaatttcatctctagagaatctcttggcaagc attattcagagagtatttgtctgggttgtatctgcagttacctgctttggaaacattttt gtcatttgcatgcgaccttatatcaggtctgagaacaagctgtatgccatgtcaatcatt tctctctgcttggctttcattccattgagcaataaggaatttttcaaaaactactatggc accaatggagtatgcttccctcttcattcagaagatacagaaagtattggagcccagatt tattcagtggcaatttttcttggtaccataacctcttgggtagtgatttttattctgccc attaacagtgctttgaacccaattctctatactctgaccacaagaccatttaaagaaatg attcatcggttttggtataactacagacaaagaaaatctatggacagcaaagcctcagtg tccactctggctgcactggaggagcccttcagcccgccaatgcgcagtaggggcccctct ctggggctggctgaggccagagctggctccctctgctcgcagggaggtgtggagggagag atgcgggagggagctggggctgcatgcagtgcttgtgggacgtcgtgggttctggacaca ccatctttaagaactgtaacactcactgcgagtgtctgcgacttcattcttgaagtcagc gagacaaagaacccaccaattccggacacattttggcgaccacgaggggactatcgccta tcgtcaagtggtgagactatcgtcaagcgagttgggagcattggtttgcctggaaccagc ttccgcttttcctgtacttctgggctgagccgagcatcgacagagaggaaagccattcag ctccggggtgccgacaacaagttggttgacccagcagccatgagcggaactctcaaaggc atgtcgcccaacctccctgtagctccccttactattaatgataatcctcctctaatctcc cctgctcagaaagaaataagcaaagaaatctccaaaggaccacaaatccccccggaccat cagttatgtccccttcaagctgtagggggaggggaatttggcccaacctgggtatgttta accattgagggccaggaaattgacttcctcctggacattggcatggccttctcagtgtta atctcctgtcctggacgactgtcctcaaggtctgttaccatccgaggaatcctgggacag cctgtaaccaggcaatatcccttaaggcctgaagctcataaaggtttacaggatattatt aaacatttaaaagctcaaggcttactaaggaaatgtggcagtccctgcaacaccccaatt ctaggagtacaaaaaccaaacggtcagtggagactagtgcaagatcttagactcatcaat gaggcagtaattcctctatatccagttgtacccaacccctataccttgctctctcaaata acagaggaagcagaatggttcactgttctggacctcaaggatgccttcttctgtattccc ctgcactctgactcccagtttctctttgccttccaggatcccacagaccacacatcccaa cttacgtggacagtcttgcccaaagggtttagggatagctctcatctatttggtcaggca ctggcccaagatgtaggtcacttctcaagtccaggcactctggtccttcagtatgtggat gatttacttttggctgtcagtttggaagcctcatgccagcaggctactctagatctcttg aactttctagctaatcaagggtacaaggtgtctaggtcgaaggcccagctttgcctacag caggtcaaatatctgggcctaatcttagccagagggaccagggccctcagcaaggaacaa atacagcctatactggcttatcctcaccctaagacattaaaacagttgcaggggttcctt ggaatcaccggcttttggcgactatggatccccggatacagcgagatagctgggcccctc tatactataatcaaggagacccagagggcaaatactcatctagtagaatgggaaccaggg gcagaaacagccttcaaaaccttaaagcaggctatagtacaagctccagctttaagcctt cccacaggacaaaacttttctttatacatcacagagagagcagggatagctcttggagtc cttactcagactcgtgggacaaccccacaaccagtggcctacctaagtcccgtggcagcc atgttgctgttactcgcctttggaccctgtatttttaaccttcttgtcaaatttgtttcc tctagaatcgaggccatcaagctacagatggtcttacaaatggaaccccaaatgagctca actaacaacttctaccgaggacccctggaccgacctgctggcccttccactggcctaaag agttcccctctggaggacacaactgcagggccacttcttcgcccctatccagcaggaacc tcggtgtccactctggatgcactggaggagcccttcagcctgccgctgcactgtgggggc ccctctctggggctggccgaggccggagccggctccctctgttcacggggaggtgtggag ggagaggcatgggtgggagctggggctgtgcacggtgcttgcgggccggcgtgggttcca gtggttacctctggggactctggggagggaggaacgggacaggatcaaacagaagggccg gtcgtcccgggtcatgggggttgggggaccgattccctagcctcgcgagccccgccgccg ccggtcttcccagaggcgccgaggggggacgcggtcccgacaccacactcacgtctccga tctgcagctccacttcctccagctggtccaccgttgggccgctgctcctgctcggaactg gccagcccaaactcactgggccgcccggggaagatgctgcagaggcgtctgaagaggagg gagaagagggaggcggcgggggcggcgaagaaacctgcaactcctcagggtcggccatgg ggaagggttgggaccgcggaatccgacccgagaagccgaaccgacaccaactgtctttta accactcgcgcccaaagccgaaaggccccgcctcctaacgccagaaatcccgcccagtct cctcgtgggtctcagccccggaccacaaagacgtttcatttttcgggacagccaatcata tga >gi568815594r:158609739_158823288|GENSCAN_predicted_peptide_2|492_aa MERFAEEADVVIVGAGPAGLSAAVRLKQLAVAHEKDIRVCLVEKAAQIGAHTLSGACLDP GAFKELFPDWKEKGAPLNTPVTEDRFGILTEKYRIPVPILPGKVLFHDDGSVKGIATNDV GIQKDGAPKPSLGSYLQYASYHKNSDLDIFKLQRLSNLAGVRHLATFERGLELHAKVTIF AEGCHGHLAKQLYKKFDLRANCEPQTYGIGLKELWVIDEKNWKPGRVDHTVGWPLDRHTY GGSFLYHLNEGEPLVALGLVVGLDYQNPYLSPFREFQRWKHHPSIRPTLEGGKRIAYGAR ALNEGGFQSIPKLTFPGGLLIGCSPGFMNVPKIKGTHTAMKSGILAAESIFNQLTSENLQ SKTIGSDFERLKPAKDCTPIEYPKPDGQISFDLLSSVALSGTNHEHDQPAHLTLRDDSIP VNRNLSIYDGPEQRFCPAGVYEFVPVEQGDGFRLQINAQNCVHCKTCDIKDPSQNINWVV PEGGGGPAYNGM >gi568815594r:158609739_158823288|GENSCAN_predicted_CDS_2|1479_bp atggaaaggtttgcagaagaagcagatgttgtaatagttggtgcaggccctgcagggctc tctgcagctgttcgtctaaaacagttggctgtggcacatgaaaaggacatccgtgtgtgt ctagtggagaaagctgcccagataggagctcatactctctcaggggcttgccttgatcca ggtgcttttaaagaactcttcccagactggaaagagaagggggctccacttaacactcct gtaacagaagacagatttggaattttaacagagaaatacagaattcctgtgccaattctt ccaggtaaggtcctttttcatgatgatggtagtgtaaaaggaattgccactaacgatgta gggatacaaaaggatggtgcaccaaagccttctcttggttcctatctccaatatgcatct tatcataagaattctgatttggacatttttaagttacagaggttaagtaacttggctggg gtcagacacctggcaacatttgagagaggactggaactacatgctaaagtcacaattttt gcagaaggttgccatggacatctagccaagcaactatataagaagtttgatttgagagca aattgtgaacctcaaacctacgggattggactgaaggagttatgggttattgatgaaaag aactggaaacctgggagagtagatcacactgttggttggcccttggacagacatacctat ggaggatctttcctctatcatttgaatgaaggtgaacccctagtagctcttggtcttgtg gttggtctagactatcagaatccatacctgagtccatttagagagttccaaaggtggaaa caccatcctagcattcggccaaccttggaaggtggaaaaaggattgcatacggagccaga gctctcaatgaaggtggctttcagtctataccaaaactcacctttcctggtggtttacta attggttgtagtcctggttttatgaatgttcccaagatcaaaggtactcacacagcaatg aaaagtggaattttagcagcagaatctatttttaatcaactaactagtgaaaatctccaa tcaaagacaataggttctgactttgaacggctcaagccagccaaggattgcacacctatt gagtatccaaaacccgatggacagatcagttttgacctcttgtcatctgtggctctgagt ggtactaatcatgaacatgaccagccggcacacttaaccttaagggatgacagtatacct gtaaatagaaatctgtcgatatatgatgggcccgagcagcgattctgtcctgcaggagtt tatgaatttgtacctgtggaacaaggtgatggatttcggttacagataaatgctcagaac tgtgtacattgtaaaacatgtgatattaaagatccaagtcagaatattaactgggtggta cctgaaggtggaggaggacctgcttacaatggaatgtaa >gi568815594r:158609739_158823288|GENSCAN_predicted_peptide_3|364_aa MNLVLFLGPVAFLGQFRNQGTRPSAPGFHKPIYGGLVASQVCDGDHAKPSNPSNPRVFFD VDIGGERVGRIVLELFADIVPKTAENFRALCTGEKGIGHTTGKPLHFKGCPFHRIIKKFM IQGGDFSNQNGTGGESIYGEKFEDENFHYKHDREGLLSMANAGRNTNGSQFFITTVPTPH LDGKHVVFGQVIKGIGVARILENVEVKGEKPAKLCVIAECGELKEGDDGGIFPKDGSGDS HPDFPEDADIDLKDRRVIPVYVVFHTAFINAFCRYVDSSKAVIETADRAKLQPIALSCVL NIGACKLKMSNWQGAIDSCLEADLKKAQGIAPEDKAIQAELLKVKQKIKAQKDKEKAVYA KMFA >gi568815594r:158609739_158823288|GENSCAN_predicted_CDS_3|1095_bp atgaatcttgttctctttctggggcctgttgccttcctgggacagttcaggaatcaagga acacgtccctctgctcctggatttcacaaacccatctatggaggtttggtggcttcccag gtatgtgacggggaccatgccaagccctccaaccccagtaaccctcgagtcttctttgac gtggacatcggaggggagcgagttggtcgaattgtcttagaattgtttgcagatatcgta cccaaaactgcggaaaattttcgtgcactgtgtacaggagaaaaaggcattggacacacg actgggaaacctctccatttcaaaggatgcccttttcatcgaattattaagaaatttatg attcagggtggagacttctcaaatcagaatgggacaggtggagaaagtatttatggtgaa aaatttgaagatgaaaatttccattacaagcatgatcgggagggtttactgagcatggca aatgcaggccgcaacacaaacggttctcagttttttatcacaacagttccaactcctcat ttggatgggaaacatgtggtgtttggccaagtaattaaaggaataggagtggcaaggata ttggaaaatgtggaagtgaaaggtgaaaaacctgctaaattgtgcgttattgcagaatgt ggagaattgaaggaaggagatgacgggggaatattcccaaaagatggctctggcgacagt catccagatttccctgaggatgcggatatagatttaaaagatagaagagtaattcctgtt tatgtggtctttcatactgcttttattaatgcattctgcagatacgtggacagttcaaag gctgttattgagacagcagatagagccaagctgcaacctatagctttaagctgtgtactg aatattggtgcttgtaaactgaagatgtcaaattggcagggagcaattgacagttgttta gaggctgatcttaagaaagctcaggggatagcaccagaagataaagctatccaggcagaa ttgctgaaagtcaaacaaaagataaaggcacagaaagataaagagaaggcagtatatgca aaaatgtttgcttag >gi568815594r:158609739_158823288|GENSCAN_predicted_peptide_4|445_aa MIVMSEKVHDCDSASGKGHRLLALIEECEGELLCAEIKSERGGECSFKEGGWERTKPEGA EEGSGKNSPGRTPNTKAQVENSLRTRTASWYIFQDRPTEAATARTKPSPLAYAFCERALV PEQSEDPSSGALSLKNKDRPSCMLAVRKNPGSRHGRSCGGGIMAPTLLQKLFNKRGSSGS SAAASAQGRAPKEGPAFSPWKNCLPRSQSLVLEKVGDRCPKPWLVAVCRLGDCCSKAWEW AFSQVSNLGLEVGPGSTASEGFGEESGHHRRQCDCLEILATGKRECSSHNKGTSGFVSKS MFMVRINQKNIKPFRRCAAAQRTPQTSQRAGKTLPDVSAQLCAEAECWLNWSPHPWFSCY DSCEMEPHSAQAGVQVFTPQKDMQRKVLEGRECYPNLFATGIKPCEVDKADAFWESSALG LEHSKIIWVEGSRARVRSPVYPGES >gi568815594r:158609739_158823288|GENSCAN_predicted_CDS_4|1338_bp atgattgtgatgtctgaaaaagttcatgattgtgattctgcatctggtaagggccacagg ctgcttgcactcatagaagaatgtgaaggggagctgctgtgtgcagagatcaagagcgaa agaggaggggaatgtagttttaaagagggtggttgggaaaggacaaaacctgaaggagct gaggaaggttcagggaagaacagcccaggaagaacaccaaatactaaggcccaagttgag aacagtctgcgtacaagaacagccagctggtatattttccaagataggcccaccgaagcg gccaccgcgcgtaccaagccttcccccctagcatacgccttctgtgaacgggctcttgta cctgaacaaagcgaggacccaagttcaggggctttgtctctaaagaacaaagatcggcct agctgcatgctcgctgtgcgaaagaacccaggctcccgccacggccggagctgcggcggc ggcatcatggccccgaccctgctccagaagctcttcaacaaaaggggcagcagcggcagc tccgcggcggcgtctgcccagggcagggctcctaaggaaggacccgcctttagtccatgg aaaaattgtcttccacgaagccagtccctggtcctggaaaaggttggggaccgctgccct aaaccgtggttggtggcagtgtgtagactcggtgactgttgttccaaggcatgggaatgg gctttttctcaggtgtcaaatcttggattggaggtgggtcctggcagcactgcatccgaa ggttttggagaggagtctgggcatcacaggagacaatgtgattgtttagagattttggcc acggggaaaagagaatgttcctcacataacaaaggaacttcaggatttgtctccaagtcc atgtttatggtcaggataaaccagaagaacattaaaccgttcaggagatgtgcggcggca cagcgaacaccacaaaccagccagagagctggcaagactctgccagatgtttctgctcag ctctgtgctgaagcggaatgctggctgaactggtccccacatccctggttctcctgttat gactcctgtgagatggagcctcactctgcccaagctggagtgcaggtttttacacctcag aaggacatgcagaggaaggtcctggaaggaagagaatgttatcctaacttgtttgctact ggtatcaagccttgtgaggtagataaggcagatgcattttgggaaagctcagcgctaggg ttggagcattctaaaatcatctgggttgaaggcagcagagctagagttagaagccctgtt taccctggtgaatcctaa