GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:42:19 Sequence gi568815575f:41627027_41828034 : 201008 bp : 39.00% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 382 377 6 1.05 1.01 Sngl - 17185 16256 930 1 0 43 39 412 0.855 27.98 1.00 Prom - 20417 20378 40 -7.75 2.00 Prom + 20575 20614 40 -3.05 2.01 Init + 28415 28779 365 1 2 39 116 314 0.585 25.87 2.02 Term + 29836 29890 55 1 1 97 41 34 0.598 -4.35 2.03 PlyA + 30150 30155 6 1.05 3.04 PlyA - 33093 33088 6 1.05 3.03 Term - 33535 33359 177 1 0 110 48 140 0.867 8.90 3.02 Intr - 38426 38251 176 0 2 36 90 114 0.962 5.14 3.01 Init - 40095 39972 124 0 1 77 82 77 0.985 6.38 3.00 Prom - 46183 46144 40 -6.15 4.02 PlyA - 46849 46844 6 1.05 4.01 Sngl - 49468 48731 738 1 0 82 41 695 0.879 60.16 4.00 Prom - 65239 65200 40 -5.75 5.00 Prom + 65303 65342 40 -6.15 5.01 Sngl + 68629 69753 1125 0 0 75 51 141 0.527 5.50 5.02 PlyA + 70226 70231 6 1.05 6.00 Prom + 72568 72607 40 -4.15 6.01 Init + 80272 80335 64 0 1 65 94 43 0.132 3.99 6.02 Intr + 84852 84962 111 1 0 54 87 41 0.010 0.03 6.03 Intr + 87913 88075 163 2 1 83 116 138 0.069 14.21 6.04 Intr + 92271 92307 37 0 1 60 67 54 0.757 -2.15 6.05 Term + 92564 92812 249 1 0 74 49 245 0.926 13.92 6.06 PlyA + 92968 92973 6 1.05 7.08 PlyA - 93003 92998 6 1.05 7.07 Term - 95344 95195 150 1 0 38 47 134 0.008 1.23 7.06 Intr - 112430 112358 73 1 1 82 115 54 0.881 5.99 7.05 Intr - 118575 118498 78 2 0 69 91 108 0.169 6.95 7.04 Intr - 121712 121602 111 1 0 118 78 9 0.038 1.48 7.03 Intr - 160257 160152 106 1 1 96 68 75 0.493 4.65 7.02 Intr - 162405 162304 102 1 0 103 81 9 0.512 0.93 7.01 Init - 167913 167907 7 0 1 77 79 0 0.591 -0.99 7.00 Prom - 169454 169415 40 -2.95 8.00 Prom + 172928 172967 40 -4.25 8.01 Init + 182876 183312 437 1 2 58 77 386 0.341 30.18 8.02 Intr + 184376 184660 285 2 0 -4 63 165 0.140 0.33 8.03 Term + 185241 186318 1078 0 1 3 37 448 0.289 21.79 8.04 PlyA + 186481 186486 6 1.05 9.02 PlyA - 187278 187273 6 1.05 9.01 Term - 197490 197294 197 1 2 44 33 158 0.202 2.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:41627027_41828034|GENSCAN_predicted_peptide_1|309_aa MNINAKILNKILVNRIQQHIKKLIHHDQVSFIPGMQGWFNICKSINIIHHINRTNNKNHM ITSIDAEKAFDKIQQRFMLKTLNKLGIDGTYLKIVRAIYDKPTANIILNGQKLEAFPLKT GTRQECPLSPLLFNIVLEVLARAIRQEKEIKGIRLGKEEVKLSLFADDMIVYLENPIISA QNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQIESQIMSELPFTIASKRIKYLGIQLTK DVKDLFKESYKPLLNEINEDTNKWKNIPCLWIGRINIMKMAILPKVIYRFNAIPIKLPMT FFTELEKLL >gi568815575f:41627027_41828034|GENSCAN_predicted_CDS_1|930_bp atgaacatcaatgcaaaaatcctcaataaaatactggtaaaccgaatacagcagcacatc aaaaagcttatccaccacgatcaagtcagcttcatccctgggatgcaaggctggttcaac atatgcaaatcaataaacataatccatcatataaacagaaccaacaacaaaaaccacatg attacctcaatagatgcagaaaaggcctttgacaaaattcaacagcgcttcatgctaaaa actctcaataaactaggtattgatggaacttatctcaaaatagtaagagctatttatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaact ggcacaagacaggaatgccctctctcaccactcctattcaacatagtgttggaagttctg gccagggcaatcaggcaagagaaagaaataaaaggtattcgattaggaaaagaggaagtc aaattgtccctgtttgcagatgacatgattgtatatttagaaaaccccatcatctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcctatacaccaataacagacaaatagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaaag gatgtgaaggacctcttcaaggagagctacaaaccactgctcaatgaaataaatgaggac acaaacaaatggaagaatattccatgcttatggataggaagaatcaatatcatgaaaatg gccatactgcccaaagtaatttatagattcaatgccatccccatcaagctaccaatgact ttcttcacagaattggaaaaactactttaa >gi568815575f:41627027_41828034|GENSCAN_predicted_peptide_2|139_aa MTEGHKMILTPEIPMLSHMMSEKHSNGDGSAQNSSIIKWKWFMQEHPLREVQGGNILKQG ASFPLGLTLELCEELWDSTDTWTGPREQLSTDQRRAAWCVDDSSKVNRQHPVWKDATLDQ RSGSRTRAITTSVRTRDDS >gi568815575f:41627027_41828034|GENSCAN_predicted_CDS_2|420_bp atgactgaaggacataaaatgatcttgacacctgaaatacccatgctttctcacatgatg tcagagaaacattcaaatggggatggcagcgcccagaatagttccataataaaatggaaa tggtttatgcaggaacatcctctccgggaagtgcaaggaggaaacatcctcaagcaggga gcctcttttcccctaggactgactctggaactgtgtgaggaactgtgggattctactgac acttggacagggccccgtgaacagctctcaactgaccaacgaagagctgcttggtgtgtg gatgacagttccaaggtgaatcgacagcatcctgtttggaaggatgccactcttgatcaa agaagtggttctaggaccagagccataactaccagtgttagaaccagggatgattcctaa >gi568815575f:41627027_41828034|GENSCAN_predicted_peptide_3|158_aa MAGTGHGRSDYSQVKAPILSGLAQEVVGVKLSFKASSKLSAGRVGTPHFMAPEVVKREPY GKPVDVWGCGVILFILLSGCLPFYGTKERLFEGIIKGKYKMNPRQWSHISESAKDLVRRM LMLDPAERITVYEALNHPWLKVHEHSQVTFYTLSEHFQ >gi568815575f:41627027_41828034|GENSCAN_predicted_CDS_3|477_bp atggctgggactggccatggtcgctctgattattcccaagttaaggcccccattttgagt ggcttggcccaggaagtggtaggagtcaagttgagtttcaaagcatcatccaaactgtca gcaggacgtgttggaacacctcattttatggcaccagaagtggtcaaaagagagccttac ggaaagcctgtagacgtctgggggtgcggtgtgatcctttttatcctgctcagtggttgt ttgcctttttacggaaccaaggaaagattgtttgaaggcattattaaaggaaaatataag atgaatccaaggcagtggagccatatctctgaaagtgccaaagacctagtacgtcgcatg ctgatgctggatccagctgaaaggatcactgtttatgaagcactgaatcacccatggctt aaggtacatgagcattcacaggtcaccttctacactttgtctgaacacttccagtga >gi568815575f:41627027_41828034|GENSCAN_predicted_peptide_4|245_aa MDKNELVQKAKLAEQAERYDDMAACMKSVTEQGAELSNEERNLLSVAYKNVLGARRSSWR VVSSIEQKTEGAEKKQQMAREYREKVDTELRDICNDVLFLLEKFLIPSASQAESKVFSLK MKGDYYHYLAEVATGDDKKGIVDQSQQAYQEAFEISKKEMQPTHPIRPGLALNFSVFYYE ILNSPEKACSLAKTAFDEAIAELDTLIEESYKDSMLIMQLLRDNLTLWTSDTQGDEAEAG EGGEN >gi568815575f:41627027_41828034|GENSCAN_predicted_CDS_4|738_bp atggataaaaatgagctggttcagaaggccaaactggccgagcaggctgagcgatatgat gacatggcagcctgcatgaagtctgtaactgagcaaggagctgaattatccaatgaggag aggaatcttctctcagttgcttataaaaatgttttaggagcccgtaggtcatcttggagg gtcgtctcaagtattgaacaaaagacggaaggtgctgagaaaaaacagcagatggctcga gaatacagagagaaagttgatacggagctaagagatatctgcaatgatgtactgtttctt ttggaaaagttcttgatccccagtgcttcacaagcagagagcaaagtcttctctttgaaa atgaaaggagattactaccattacttggcagaggttgccactggtgatgacaagaaaggg attgtggatcagtcacaacaagcataccaagaagcttttgaaatcagcaaaaaggaaatg caaccaacacatcctatcagaccaggtctggcccttaacttctctgtgttctattatgag attctgaactccccagagaaagcctgctctcttgcaaagacagcttttgatgaagccatt gctgaacttgatacattaattgaagagtcatacaaagacagcatgctaataatgcaatta ctgagagacaacttgacattgtggacatcagatacccaaggagacgaagctgaagcagga gaaggaggggaaaattaa >gi568815575f:41627027_41828034|GENSCAN_predicted_peptide_5|374_aa MTTTSVSSWPYSSHRMRFITNHSDQPPQNFSATPNVTTCPMDEKLLSTVLTTSYSVIFIV GLVGNIIALYVFLGIHRKRNSIQIYLLNVAIADLLLIFCLPFRIMYHINQNKWTLGVILC KVVGTLFYMNMYISIILLGFISLDRYIKINRSIQQRKAITTKQSIYVCCIVWMLALGGFL TMIILTLKKGGHNSTMCFHYRDKHNAKGEAIFNFILVVMFWLIFLLIILSYIKIGKNLLR ISKRRSKFPNSGKYATTARNSFIVLIIFTICFVPYHAFRFIYISSQLNVSSCYWKEIVHK TNEIMLVLSSFNSCLDPVMYFLMSSNIRKIMCQLLFRRFQGEPSRSESTSEFKPGYSLHD TSVAVKIQSSSKST >gi568815575f:41627027_41828034|GENSCAN_predicted_CDS_5|1125_bp atgacgacaacttcagtcagcagctggccttactcctcccacagaatgcgctttataacc aatcatagcgaccaaccgccacaaaacttctcagcaacaccaaatgttactacctgtccc atggatgaaaaattgctatctactgtgttaaccacatcctactctgttattttcatcgtg ggactggttgggaacataatcgccctctatgtatttctgggtattcaccgtaaaagaaat tccattcaaatttatctacttaacgtagccattgcagacctcctactcatcttctgcctc cctttccgaataatgtatcatattaaccaaaacaagtggacactaggtgtgattctgtgc aaggttgtgggaacactgttttatatgaacatgtacattagcattattttgcttggattc atcagtttggatcgctatataaaaattaatcggtctatacagcaacggaaggcaataaca accaaacaaagtatttatgtctgttgtatagtatggatgcttgctcttggtggattccta actatgattattttaacacttaagaaaggagggcataattccacaatgtgtttccattac agagataagcataacgcaaaaggagaagccatttttaacttcattcttgtggtaatgttc tggctaattttcttactaataatcctttcatatattaagattgggaagaatctattgagg atttctaaaaggaggtcaaaatttcctaattctggtaaatatgccactacagctcgtaac tcctttattgtacttatcatttttactatatgttttgttccctatcatgcctttcgattc atctacatttcttcacagctaaatgtatcatcttgctactggaaagaaattgttcacaaa accaatgagatcatgctggttctctcatctttcaatagttgcttagatccagtcatgtat ttcctgatgtccagtaacattcgcaaaataatgtgccaacttctttttagacgatttcaa ggtgaaccaagtaggagtgaaagcacttcagaatttaaaccaggatactccctgcatgat acatctgtggcagtgaaaatacagtctagttctaaaagtacttga >gi568815575f:41627027_41828034|GENSCAN_predicted_peptide_6|207_aa MKDHMERGAQPAQLFQVMLQTPVLLLNKILPYPSSPFKLSAYPHSSWIRTGAWELLNAGL TLELCEELRDSTETWTGPRKQLSTDQQRVAWCVNDSSKVNNLVWKDATLDQRSLEQQLCC NRLLKKGYTRQQFCHKSTLNKGDRVIYKLTCPPYCCVRFPLAGTGPRILYLSRLASNLEL LKKGKGRGEQKKEEVTCGMLRKVKTCK >gi568815575f:41627027_41828034|GENSCAN_predicted_CDS_6|624_bp atgaaagatcacatggagagaggggcacagcctgctcaactcttccaggtgatgctccag acacctgttctgctgctcaataaaattcttccctacccatcctcacccttcaaattgtca gcgtatcctcattcttcgtggataaggacaggagcttgggaactgctgaacgcaggactg actctggaactgtgtgaggaactgcgggattctactgagacttggacagggccccgtaaa cagctctcaactgaccaacaaagagttgcttggtgtgtaaatgacagttccaaggtgaac aatcttgtttggaaggatgccactcttgatcaaagaagtttggagcagcagctgtgctgc aatcggttactgaagaaagggtacactcgccagcagttttgccacaagagtacactgaac aaaggagacagggtcatttataagctgacgtgtccaccctactgctgtgtccggtttcca ttggctggaacgggacctcgcattctgtatctgtcccgattggctagcaacttagaactt cttaaaaaaggcaaaggcagaggagaacaaaagaaggaggaagtaacttgtggaatgctg agaaaagtaaaaacctgcaaataa >gi568815575f:41627027_41828034|GENSCAN_predicted_peptide_7|208_aa MGVNTEPPALPCWAPNIQLLIRILTLPSPTDLVDSADLKREASICHMLKHPHIVELLETY SSDGMLYMVFELLRLFPHPFWVPPESPLALRTSPKFFLETPESIEAFYFMDGADLCFEIV KRADAGFVYSEAVASHYMRQILEALRYCHDNNIIHRDVKPLKSTVSLLQYSIGHTEPALH QCGTGKYQEVMITGSHLGNWLSQMSINR >gi568815575f:41627027_41828034|GENSCAN_predicted_CDS_7|627_bp atgggtgtaaacactgagcccccagccctgccatgctgggctccaaacatccagcttctc attagaatcttaacactgccctctccaactgatttggtagactctgcagatctaaagcgg gaagccagtatctgtcatatgctgaaacatccacacattgtagagttattggagacatat agctcagatggaatgctttacatggttttcgaattgctccgtcttttccctcatcccttc tgggtccctccagaaagtcctctagctctcagaacatctcccaaattctttttggaaaca ccagagtccattgaggcattttattttatggatggagcagatctgtgttttgaaatcgta aagcgagctgacgctggttttgtgtacagtgaagctgtagccagccattatatgagacag atactggaagctctacgctactgccatgataataacataattcacagggatgtgaagcct ctgaagtcaacagtgtcacttctgcagtattctattggtcacacagagccagccctgcat cagtgtgggacgggtaaatatcaggaggtgatgatcactgggagccatcttggaaactgg ctatcacagatgtcaatcaaccgttga >gi568815575f:41627027_41828034|GENSCAN_predicted_peptide_8|599_aa MELKIMARELRDKCTSFSSRFDQLEERVSVTEDQMNEMKREEKFREKRVKRNEQSLQEIW DYVKRPNLRLIGVPESDGESGTKLENTLQDIIQENFPNLARQANIQIQEIQRPPQRYSSR RATPRYITVRFTKVEGKNAKGSQRESAIKLEFRIKKLTQNRSTTRKLNNLLLNDYWVHNE MKAEIKMFFETNENKDTTYQNLWDTFKVVCRGKFIALNAHKRKQERSKIDTLISQLKELE KLAEAQQKKGNSRPISLMNIDAKILNKMLANRIQQHIKKLIHHDQVGFIPGMQGWFNISK SINVIHHINRTKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIRAIYDKPT ANIILNGQKLEAFPVKTGTRQGCPLSPHLFNIVLEVLARVIRQEKEIKGNQLGKEEVKLS LFADDMIVCLENPTVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELP FTITSKRIKYLGIQLTKDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWIGRINIVKMAIL PKVIYRFNGIPIKLPMTFFTKLEKTTLKFIWNQKRARIAKSNLSKKNKAGGITLPDFKL >gi568815575f:41627027_41828034|GENSCAN_predicted_CDS_8|1800_bp atggagctgaaaatcatggcacgagaactacgtgacaaatgcacaagcttcagtagccga tttgatcaactggaagaaagggtatcagtgactgaagatcaaatgaatgaaatgaagcga gaagagaagtttagagaaaaaagagtaaaaagaaacgaacaaagccttcaagaaatatgg gactatgtgaaaagaccaaatctacgtctgattggtgtacctgaaagtgatggggagagt ggaaccaagttggaaaacactctgcaggatattatccaggagaacttccccaacctagca aggcaggccaacattcaaattcaggaaatacagagaccaccacaaagatactcctcgcga agagcaactccaagatacataactgtcagattcaccaaagttgaaggaaaaaatgctaag ggcagccagagagaaagtgcaatcaaactagaattcaggattaagaaactcactcaaaac cgctcaactacacggaaactgaacaacctgctcctgaatgactactgggtacataacgaa atgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacacaacataccag aatctctgggacacatttaaagtagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagcaggagagatctaaaattgacaccctaatatcacaattaaaagaactagag aagctggcagaggcacaacaaaaaaaagggaattctagaccaatatccctgatgaacatc gatgcaaaaatcctcaataaaatgctggcaaaccgaatccagcagcacatcaaaaagctt atccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaacatatccaaa tcaataaatgtaatccatcatataaacagaaccaaagacaaaaaccacatgattatctca atagatgcagaaaaggcctttgacaaaattcaacagcccttcatgctaaaaactctcaat aaattaggtattgatgggacgtatctcaaaataataagagctatctatgacaaacccaca gccaatatcatactgaatgggcaaaaactggaagcattccctgtgaaaactggcacaaga cagggatgccctctctcaccacacctattcaacatagtgttggaagttctggccagggta atcaggcaggagaaagaaataaagggtaaccaattaggaaaagaggaagtcaaattgtcc ctgtttgcagatgacatgattgtatgtttagaaaaccccactgtctcagcccaaaatctc cttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatca caagcattcttatacaccaataacagacaaacagagagccaaatcatgagtgaactccca ttcacaattacttcaaagagaataaaatacctaggaatccaacttacaaaggatgtgaag gacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggacacaaacaaa tggaagaacattccatgctcatggataggaagaatcaatatcgtgaaaatggccatactg cccaaggtaatttatagattcaatggcatccccatcaagctaccaatgactttcttcacg aaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgtattgccaag tcaaacctaagcaaaaagaacaaagctggaggcatcacactacctgacttcaagctataa >gi568815575f:41627027_41828034|GENSCAN_predicted_peptide_9|65_aa XKLHTLQEPDEKNTGTGTTEPFLLQCPSSALSGQSLMLCQLEKYSQSISSSAEQAMKCGF GADRQ >gi568815575f:41627027_41828034|GENSCAN_predicted_CDS_9|198_bp nataaactgcacacactgcaggagcctgatgagaaaaacactggaactgggacaacagaa ccctttctcctccagtgcccctccagtgcccttagtggacaaagcttaatgttgtgccag ttggaaaaatattcacagtcaatctccagtagtgcagagcaggcaatgaagtgtggattt ggagctgacagacaatag