GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:26:40 Sequence gi568815575r:87603346_87804193 : 200848 bp : 36.44% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10532 10699 168 2 0 101 97 121 0.725 12.44 1.02 Intr + 11089 11225 137 1 2 88 111 137 0.930 15.29 1.03 Intr + 14587 14783 197 2 2 55 53 210 0.975 12.41 1.04 Intr + 18866 19078 213 1 0 58 80 75 0.591 1.69 1.05 Intr + 22265 22451 187 1 1 59 93 124 0.946 8.34 1.06 Intr + 28865 29089 225 0 0 47 82 149 0.165 7.23 1.07 Intr + 30404 30566 163 0 1 79 89 126 0.969 9.91 1.08 Intr + 32218 32430 213 1 0 73 127 139 0.904 13.21 1.09 Term + 40532 40919 388 2 1 10 47 236 0.203 5.13 1.10 PlyA + 40969 40974 6 1.05 2.00 Prom + 51494 51533 40 -4.15 2.01 Init + 53963 54012 50 1 2 98 68 72 0.779 4.78 2.02 Intr + 61419 61590 172 0 1 73 51 201 0.976 13.92 2.03 Term + 65984 66049 66 1 0 105 32 60 0.581 -0.94 2.04 PlyA + 66686 66691 6 1.05 3.00 Prom + 74404 74443 40 -0.95 3.01 Init + 78875 78932 58 1 1 36 119 18 0.785 1.22 3.02 Intr + 79062 79267 206 1 2 83 19 152 0.715 5.80 3.03 Intr + 81052 81094 43 0 1 15 121 66 0.179 -0.51 3.04 Intr + 82907 83179 273 0 0 69 54 113 0.250 2.59 3.05 Term + 90462 90631 170 1 2 73 42 79 0.123 -1.14 3.06 PlyA + 91870 91875 6 1.05 4.02 PlyA - 92672 92667 6 1.05 4.01 Sngl - 100885 99998 888 1 0 60 43 976 0.669 86.12 4.00 Prom - 101752 101713 40 -6.65 5.00 Prom + 104698 104737 40 -5.35 5.01 Sngl + 105075 105515 441 2 0 70 55 140 0.652 4.90 5.02 PlyA + 105531 105536 6 1.05 6.00 Prom + 106716 106755 40 -1.65 6.01 Init + 119438 119474 37 1 1 92 77 29 0.490 2.45 6.02 Intr + 120502 120591 90 2 0 99 101 32 0.407 4.65 6.03 Term + 127804 127913 110 2 2 101 38 41 0.083 -1.91 6.04 PlyA + 128308 128313 6 1.05 7.05 PlyA - 128474 128469 6 1.05 7.04 Term - 147859 147784 76 0 1 128 53 50 0.689 1.83 7.03 Intr - 150165 149926 240 2 0 49 99 84 0.475 1.34 7.02 Intr - 151428 151339 90 2 0 98 67 126 0.995 9.69 7.01 Init - 153821 153772 50 2 2 62 76 57 0.988 2.37 7.00 Prom - 155293 155254 40 -1.95 8.02 PlyA - 155473 155468 6 1.05 8.01 Sngl - 161089 160763 327 1 0 57 48 171 0.607 5.78 8.00 Prom - 161798 161759 40 -5.05 9.00 Prom + 166890 166929 40 -4.85 9.01 Init + 167359 167368 10 0 1 90 68 1 0.140 -0.85 9.02 Intr + 175602 176085 484 1 1 58 70 265 0.617 12.85 9.03 Intr + 178499 178643 145 2 1 44 41 63 0.539 -3.44 9.04 Term + 178886 179590 705 1 0 31 43 254 0.591 7.72 9.05 PlyA + 179976 179981 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 28924 29089 166 0 1 101 82 124 0.833 12.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:87603346_87804193|GENSCAN_predicted_peptide_1|630_aa XLDTQHSEDMNATRSEEQFHVINHAEQTLRKMENYLKEKQLCDVLLIAGHLRIPAHRLVL SAVSDYFAAMFTNDVLEAKQEEVRMEGVDPNALNSLVQYAYTGVLQLKEDTIESLLAAAC LLQLTQVIDVCSNFLIKQLHPSNCLGIRSFGDAQGCTELLNVAHKYTMEHFIEVIKNQEF LLLPANEISKLLCSDDINVPDEETIFHALMQWVGHDVQNRQGELGMLLSYIRLPLLPPQL LADLETSSMFTGDLECQKLLMEAMKYHLLPERRSMMQSPRTKPRKSTVGALYAVGGMDAM KGTTTIEKYDLRTNSWLHIGTMNGRRLQFGVAVIDNKLYVVGGRDGLKTLNTVECFNPVG KIWTVMPPMSTHRHGLGVATLEGPMYAVGGHDGWSYLNTVERWDPEGRQWNYVASMSTPR STVGVVALNNKLYAIGGRDGSSCLKSMEYFDPHTNKWSLCAPMSKRRGGVGVATYNGFLY VVGGHDAPASNHCSRLSDCVERISIQGTYLNVIKTIYDKPTANTILNVEKLKTFPLRTGT RQGCPLSALLFNMVLEVIARAIRQEKGIKGIQISKKEVKLSLFADDMIVHIENPKDSSRK LLELIKEFSKVSGYKINVHKPVALLYTNSN >gi568815575r:87603346_87804193|GENSCAN_predicted_CDS_1|1893_bp nnattagatacacaacactctgaagacatgaatgccaccagatctgaagagcagttccat gttataaaccacgcagagcaaactcttcgtaaaatggagaactacttgaaagagaaacaa ctatgtgatgtgctactgattgcaggacacctccgcatcccagcccataggttggttctc agcgcagtgtctgattattttgctgcaatgtttactaatgatgtgcttgaagccaaacaa gaagaggtcaggatggaaggagtagatccaaatgcactaaattccttggtgcagtatgct tacacaggagtcctgcaattgaaagaagataccattgaaagtttgctggctgcagcttgt cttctgcagctgactcaggtcattgatgtttgctccaattttctcataaagcagctccat ccttcaaactgcttagggattcgatcatttggagatgcccaaggctgtacagaacttctg aacgtggcacacaaatacactatggaacacttcattgaggtaataaaaaaccaagaattc ctcctgcttccagctaatgaaatttcaaaacttctgtgcagtgatgacattaatgtgcct gatgaagagaccatttttcatgctctaatgcagtgggtggggcatgatgtgcagaatagg caaggagaactggggatgctgctttcttacatcagactgccattactcccaccacagtta ctggcagatcttgaaaccagttccatgtttactggtgatcttgagtgtcagaagctcctg atggaagctatgaagtatcatcttttgcctgagagaagatccatgatgcaaagccctcgg acaaagcctagaaaatcaactgtgggggcactttatgctgtaggaggcatggatgctatg aaaggtactactactattgaaaaatatgacctcaggaccaacagttggctacatattggc accatgaatggccgtaggcttcaatttggagtcgcagttattgataataagctctatgtc gtgggaggaagagacggtttaaaaactttgaatacagtggaatgttttaatccagttggc aaaatctggactgtgatgcctcccatgtcaacacatcggcacggcttaggtgtagccact cttgaaggaccaatgtatgctgtaggtggtcatgatggatggagctatctaaatactgta gaaagatgggaccctgagggacgacagtggaattacgtagccagtatgtcaactcctaga agcacagttggtgttgttgcattaaacaacaaattatatgctattggtggacgtgatgga agttcctgcctcaaatcaatggaatactttgacccacacactaacaagtggagtttgtgt gctccaatgtccaaaagacgtggaggtgtgggagttgccacatacaatggattcttatat gttgtaggggggcatgatgcccctgcttccaaccattgctccaggctttctgactgtgtg gaacgaatcagcatacaagggacataccttaatgtaataaaaaccatctatgacaaaccc acagccaacacaatactgaatgtggaaaagttgaaaacattccctctgagaactggaaca agacaaggctgcccactgtctgcactcctcttcaacatggtactggaagtcatagccaga gcaatcagacaagagaaaggaataaagggcatccaaatcagtaaaaaggaagtcaaactc tcactgtttgctgatgatatgatcgttcacattgaaaaccctaaagactcctcaagaaag ctcctagaactgataaaagaattcagcaaagtttctggatataagattaatgtacacaaa ccagtagctcttctatacaccaacagcaactaa >gi568815575r:87603346_87804193|GENSCAN_predicted_peptide_2|95_aa MKRVKLGRAGPAGATLRYDPKGDSWSTVAPLSVPRDAVAVCPLGDKLYVVGGYDGHTYLN TVESYDAQRNEWKESMQELLQNFYTTQKLKETLGH >gi568815575r:87603346_87804193|GENSCAN_predicted_CDS_2|288_bp atgaagagggtgaagttgggcagagctggaccagctggggccaccctcaggtatgatcca aaaggtgattcatggtcaactgtggcacctctgagtgttcctcgagatgctgttgctgtg tgccctcttggagacaaactctacgtggttggaggatatgacggacatacttatttgaac acagttgagtcatatgatgcacagagaaatgaatggaaagagagcatgcaagaacttcta caaaacttctataccacacagaagctgaaagagactctggggcactaa >gi568815575r:87603346_87804193|GENSCAN_predicted_peptide_3|249_aa MTSGPQTDQPKKHLINFKSVTKETRFIHGPGTLAPVTDWEGSLPLVFNHCRDASLIIHPC FKGVRPHRDACLGPSPLAASPAFLGKGQTITDSEFPVTLTVEDIIPQFSLPTSIQSDNRR AFISQISQAVFQALSIQGNLYIPYGPPSSRKVEWTKGLLKTHLTKLSHQLKRTGQYFYHF PFSEFRPVLGMLQGKSLMYDGLTYDFSTVESDTHSVLSLIYEGFLLDRTIINQGASVYHY HKIINTHLQ >gi568815575r:87603346_87804193|GENSCAN_predicted_CDS_3|750_bp atgacctcaggtcctcagaccgaccagcccaagaaacatctcatcaatttcaaatccgtg acaaaggagacacgttttatccatggacccggaactctggcgccggtcacggactgggaa ggcagccttcccttggtgtttaatcattgcagggacgcctctctgattatacacccatgt ttcaagggtgtcagaccacacagggatgcctgccttggtccttcacccttagcggcaagt cccgcttttctggggaaggggcaaaccatcacagactccgagtttccggtaactctcaca gtggaagacataattcctcagtttagccttcccacctcaatacagtctgataacagacga gcctttattagtcaaatcagccaagcagtttttcaggctcttagtattcagggaaacctt tatatcccttacggtcctccatcttcaagaaaagtagaatggactaaaggtctcttaaaa acacacctcaccaagctcagccaccaacttaaaaggactggacaatacttttaccacttt cccttctcagaattcaggcctgtcctcggaatgctacaggggaaatcgctgatgtatgat ggcttgacttatgatttttcaactgtggaaagtgatacacattcagtactctccttaatt tatgaggggtttcttctggatagaaccatcataaatcaaggagcatctgtatatcattat cataagataataaatacacatttgcagtag >gi568815575r:87603346_87804193|GENSCAN_predicted_peptide_4|295_aa MSGALDVLQMKEEDVLKFLAAGTHLGGTNLDFQMEQYIYKRKSDGIYIINLKRTWEKLLL TARAIVAIENPADVSVISSRNTGQRAVLKFAAATGATPIAGRFTPGTFTNQIQAAFREPR LLVVSDPRADHQPLTEASYVNLPTIALCNTDSPLHYVDIAIPCNNKGTHSVGLMWWMLAR EVLRMRGTISREHPWEVMPDLYFYRDPEEIEKEEQAAAEKAMTREELQGEWTAPAPEFTA TQPEVADWSEGVQVPSVPIQQFPTEDWSTQRATEDWSAAPTAQATEWVGATTDWS >gi568815575r:87603346_87804193|GENSCAN_predicted_CDS_4|888_bp atgtccggagcccttgatgtcctgcaaatgaaggaggaggatgtccttaagttccttgca gcaggaacccacttaggtggcaccaatcttgacttccagatggaacagtacatctataaa aggaaaagtgatggcatctatatcataaatctgaagaggacctgggagaagcttctgctg acagctcgtgctattgttgccattgaaaaccctgctgatgtcagtgttatatcctccagg aatactggccagagggctgtgctgaagtttgctgctgccactggagccactccaattgct ggccgcttcactcctggaaccttcactaaccagatccaggcagccttccgggagccacgg cttcttgtggttagtgaccccagggctgaccaccagcctctcacggaggcatcttatgtt aacctacctaccattgctctgtgtaacacagattctcctctgcactatgtggacattgcc atcccatgcaacaacaagggaactcactcagtgggtttgatgtggtggatgctggctcgg gaagttctgcgcatgcgtggcaccatttcccgtgaacacccatgggaggtcatgcctgat ctctacttctacagagatcctgaagagattgaaaaagaagagcaggctgctgctgaaaag gcaatgaccagggaggaacttcagggtgaatggactgctccagctcctgagttcactgct actcagcctgaggttgcagactggtctgaaggtgtacaggtgccctctgtgcctattcag cagttccctactgaagactggagcactcagcgtgccacggaagactggtctgcagctccc actgctcaggccactgaatgggtaggagcaaccactgactggtcttaa >gi568815575r:87603346_87804193|GENSCAN_predicted_peptide_5|146_aa MGLQTLGKYTHSKWEELAKVKGIQAPGNSKLQQHSQILNSNMISIDFMSHIEVTLMQDVS SHSLGQLHPSGFAGYSPHPGCFHGLVLSVCGFSRHTVQTVGGSTILGSGGSWPSSDSSTR MCPSGDFVWGLQPHISLLHHPSGDSP >gi568815575r:87603346_87804193|GENSCAN_predicted_CDS_5|441_bp atggggttacagacactgggtaaatacactcattccaaatgggaggaattggccaaagtg aaggggatacaggccccaggcaattccaaactccagcagcacagtcaaatcttaaactct aacatgatctccattgacttcatgtctcacattgaggtcacactgatgcaagacgtgagc tcccacagccttgggcagctccacccctctggctttgcagggtacagcccccatcctggc tgctttcatgggcttgtgttgagtgtctgtggcttttccaggcacacagtgcaaactgtc ggtggatctaccattctggggtctggaggaagttggccctcatctgacagctctaccagg atgtgccccagtggagacttcgtgtgggggcttcaaccccacatttcccttctgcaccac cctagtggagattctccctga >gi568815575r:87603346_87804193|GENSCAN_predicted_peptide_6|78_aa MGSSVARTLVVLGIAAGLFVVQLKIGFFDPGPQKFRLADILNEYKHTQLLTDNYRVKKVQ KSLLGERMGYFLKNTPKF >gi568815575r:87603346_87804193|GENSCAN_predicted_CDS_6|237_bp atggggtcttctgtagctaggactctggtggtccttggtattgctgcaggactttttgta gttcagctaaaaatagggttctttgacccagggccacaaaaattcaggcttgcagacatt ttgaatgaatacaagcacacgcaactattaactgacaactacagagtaaagaaagtgcag aaatcattgttaggtgaacgtatgggttattttcttaaaaacactccaaaattttaa >gi568815575r:87603346_87804193|GENSCAN_predicted_peptide_7|151_aa MRTIDTRASLNRERSRIPLESEAPISHYICEPEEDLKYLFKRQPKDRTRNNGLGTGARIL ATSLSYYQRLSENLSDSLEDITQSTVIIQNKMESLAVVTLQNRRGLDLLLKKVAYVFFLE VCCFYVRRNLRIVHLLSIRRAKPAAKSHIGC >gi568815575r:87603346_87804193|GENSCAN_predicted_CDS_7|456_bp atgcgaacaatagacaccagggcctccttgaacagggaacgaagcaggataccattggaa tctgaagctcccatcagccactacatctgtgaacctgaagaagacctgaagtacctgttt aaaagacagccaaaagataggactaggaataacggactaggaacaggagctcgcatcctt gcaacttccctatcctattaccaacgcttgtccgagaatctttcagacagcttggaagac attacccaaagtactgtcatcatacaaaataaaatggagtccttggcagtagtcacttta caaaatagaaggggactggatctccttctgaaaaaggtggcttatgtctttttcctagaa gtatgctgtttttatgtcaggagaaatctcaggattgtccaccttctttcaatccgcaga gccaagccagctgctaagagccatataggctgctga >gi568815575r:87603346_87804193|GENSCAN_predicted_peptide_8|108_aa MLPCVPATPPPALAKKGQGAAQAMASEGASPKPWQSLSGVGPVGTQKARVEIGNLDFTGC METPGCPHRSLLWRWSPYGKPLLGQRGGHMWVWSPHTESPLGHCLVEL >gi568815575r:87603346_87804193|GENSCAN_predicted_CDS_8|327_bp atgttgccctgtgtcccagccactccacctccagctctggctaaaaagggccaaggtgca gctcaggccatggcttcagagggtgcaagccccaagccttggcagtctctaagtggtgtt gggcctgtgggtacacagaaggcaagagttgagattgggaacctagatttcacagggtgt atggaaacacctggatgtccacacagaagtctgctgtggaggtggagcccttatggaaaa cctctactaggacaacgtggagggcacatgtgggtttggagccctcatacagagtcccca ctggggcactgcctagtagagctgtga >gi568815575r:87603346_87804193|GENSCAN_predicted_peptide_9|447_aa MGSWTEHLGEGGGCGHSFRRLKRSFLPSLKRAADLPAQHSSSAKGQTASSTGSLTPMSPD WEMSPGRSRETPHIGDLWLASGGCISATKLPEEEIGSNLCCSAASTGDTQANRVLSGPPE NSSRPAAEAPVRRKANKQKGIASTSTKRTSTQKPHLKIINFKDQRLNQEEVKSLNIPITS SEIEAVINSLPTKKSPGSDGFTAEFYQRFKEELEASLVQHTQISKHNTSHKRTNDKNHTI ISIDAEKAFNKIQHRFMLKTLNKLGIDGTYLKIIRAIYDKLTANIILNGQKLEAFPLKTG TTQGFPLSPLLFNIILEFLARAISQEKEIKHIQIGKDEVKLSLFADDMIVYLEIPIVSAQ NLLKLISNFSKVSGYKINVQKSQAFLYINNRQTEGQIMSELPFTIATKRIKYLGIQLTRD VKDLFKENYKPMLKEIREDTNKWKAIT >gi568815575r:87603346_87804193|GENSCAN_predicted_CDS_9|1344_bp atggggagctggacggagcacctgggggaagggggcggctgtgggcacagcttcagaaga cttaaacgttccttcctgccatctctgaagagagcagcggatctgccagcacagcactcg agctctgctaagggacagactgcctcctctactgggtccctgaccccaatgtctcctgac tgggagatgtctcctggcagaagtcgagagacacctcatataggggacctctggctggca tctggtgggtgcatctctgcgacaaagcttccagaggaagaaataggcagcaatctttgc tgttctgcagcctccactggtgatacccaggcaaacagggtcttgagtggacctccagaa aactccagcagacctgcagcagaggcgcctgttagaaggaaagctaacaaacagaaagga atagcatcaacatccacaaaaaggacatccacacaaaaaccccatctgaagatcatcaac ttcaaagaccaaagactaaaccaggaagaagtcaaatccctgaatataccaataacaagt tctgaaattgaggcagtaattaatagcctaccaacaaaaaaaagcccaggatcagatgga ttcacagctgaattctaccagaggttcaaagaggagctggaagcaagcctggttcaacat acacaaatcagtaaacataatacatcacataaaagaaccaatgacaaaaaccacacgatt atctcaatagatgcagaaaaggccttcaataaaattcaacaccgcttcatgctaaaaact ctcaataaactaggtattgatggaacgtatctcaaaataataagagctatttatgacaaa ctcacagccaatatcatactgaatgggcaaaagctggaagcattccctttgaaaaccggc acaacacaaggattccctctctcaccactcctattcaacataatattggaatttctggcc agggcaatcagccaagagaaagaaataaagcatattcaaataggaaaagacgaagtcaaa ttgtctctgtttgcagatgacatgattgtatatttagaaattcccattgtctcagcccaa aatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaa aaatcacaagcattcctatacatcaataatagacaaacagagggccaaattatgagtgaa ctcccattcacaattgctacgaagagaataaaatacctaggaatacaacttacaagggat gtgaaggacctcttcaaggagaactacaaaccaatgcttaaggaaataagagaggacaca aacaaatggaaagccattacatga