GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:16:27 Sequence gi568815589r:18956346_19202651 : 246306 bp : 43.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12436 12484 49 0 1 86 89 34 0.142 2.42 1.02 Term + 41724 42052 329 1 2 67 54 189 0.664 8.27 1.03 PlyA + 42532 42537 6 1.05 2.00 Prom + 48049 48088 40 -2.46 2.01 Init + 70703 71009 307 1 1 79 39 465 0.350 38.16 2.02 Intr + 71023 71206 184 2 1 -50 17 197 0.274 -2.35 2.03 Intr + 71211 71469 259 0 1 19 86 472 0.671 37.57 2.04 Intr + 71524 71590 67 0 1 42 37 83 0.992 -2.92 2.05 Term + 71635 71789 155 2 2 22 49 303 0.990 17.88 2.06 PlyA + 71830 71835 6 1.05 3.00 Prom + 81940 81979 40 -2.46 3.01 Init + 84892 85215 324 0 0 74 46 86 0.081 0.33 3.02 Term + 92955 94256 1302 2 0 17 42 1337 0.227 113.60 3.03 PlyA + 94657 94662 6 1.05 4.08 PlyA - 95651 95646 6 1.05 4.07 Term - 102656 101605 1052 0 2 53 36 240 0.494 7.60 4.06 Intr - 103878 103743 136 0 1 120 80 17 0.913 4.24 4.05 Intr - 106848 106663 186 0 0 102 78 47 0.744 4.99 4.04 Intr - 126698 126528 171 2 0 96 115 32 0.957 6.84 4.03 Intr - 130437 130389 49 2 1 61 100 51 0.263 2.28 4.02 Intr - 146326 146179 148 2 1 115 56 163 0.452 15.09 4.01 Init - 146702 146696 7 2 1 80 80 0 0.868 -0.41 4.00 Prom - 149209 149170 40 -4.46 5.10 PlyA - 150296 150291 6 1.05 5.09 Term - 152402 152268 135 2 0 85 36 121 0.916 4.62 5.08 Intr - 160304 160014 291 2 0 42 82 297 0.519 21.83 5.07 Intr - 162110 161976 135 2 0 61 94 110 0.998 9.66 5.06 Intr - 163486 163305 182 2 2 79 105 33 0.972 3.69 5.05 Intr - 164820 164535 286 0 1 92 61 271 0.911 21.71 5.04 Intr - 167302 167220 83 2 2 55 99 29 0.521 0.06 5.03 Intr - 169964 169769 196 2 1 64 87 277 0.994 24.19 5.02 Intr - 171126 171074 53 1 2 27 109 81 0.541 2.73 5.01 Init - 244701 244542 160 0 1 74 32 241 0.290 17.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:18956346_19202651|GENSCAN_predicted_peptide_1|125_aa MGLPHVGQAGVELLTSGTQLLTREQNWMENDFDKLTKVGFRRLVITNFSQLKEYVLNHHK EAKNLEKRLDEWLTRITYVEKSLNDLVDLKTTVRELHEAYTSLNSQFDQAEERISAIKDQ INQIK >gi568815589r:18956346_19202651|GENSCAN_predicted_CDS_1|378_bp atggggcttccccatgttggccaggctggtgtcgaactcctgacctcaggaacacaactc ctcaccagggaacaaaactggatggagaatgactttgacaagttgacaaaagtaggcttc agaaggttggtaataacaaacttctcccagctaaaggagtatgttctaaaccatcacaag gaagctaaaaaccttgaaaaaaggttagatgaatggctaactagaataacctatgtagag aagagcttaaatgacctcgtggatctgaaaaccacagtaagagaactgcatgaagcatac acaagcctcaatagccaatttgatcaagcagaagaaagaatatcagcaattaaagatcaa attaatcaaataaagtga >gi568815589r:18956346_19202651|GENSCAN_predicted_peptide_2|323_aa MKDKIKENSEKINVNKTLVYLVSNIIELLDVDPNDQEEDGPNIDLDSQRKDKCAVIKTST RQTYFLPVIGLVDVEKLKPGDLVGVNKDSYPILETLPRVRLAEVDERPTEQYSDTGGLDK QIQELVEATVLSMNHKEKFENLRIQHPKGVLAPRDGEDPPGLGLAQTKATFLKLTGPQLV QMFTGDGAKLVRDAFALAKEKAPSIIFTDELDAIGTKHFDSEKSGDREVQRTMLELLNQL DGFQPNTQVKGGLDRKIECLGQNHADPLPKDEHDFNGAQCKAVSVEAGMIALRRGATELT HEDYMEGILEVQAKKKANLQYYA >gi568815589r:18956346_19202651|GENSCAN_predicted_CDS_2|972_bp atgaaggacaagatcaaagagaacagtgagaaaatcaatgtgaacaaaaccctggtgtac cttgtctccaacatcatcgagctcctggatgttgatcccaatgatcaagaggaggatggt ccaaatattgacctggactcccagaggaaggacaagtgtgctgtgatcaaaacctctaca cgacagacatacttcctacctgtgattggattggtggatgttgaaaagctaaagccagga gacctggtgggtgtgaacaaagactcctatccaatcctggagacactgcccagagtacga ctcgcggaggtggatgagagacccacggagcaatacagtgacactgggggcttggacaag cagatccaggaactggtggaggccactgtcttgtcaatgaaccacaaggagaagtttgag aacttgcggatccaacatccaaaaggggtgctggcccccagggacggagaagaccctcct ggcctgggcctggcacagactaaggccaccttcctaaagctgactggcccccagctggtg cagatgttcactggagatggtgccaagctagtccgggatgccttcgccctggccaaggag aaagcgccctccatcatcttcactgatgagctggatgccataggcaccaagcactttgac agcgagaagtccggggaccgagaggtgcagaggacgatgctggagcttctgaaccagctg gatggcttccagcccaacacccaagttaagggcggcctggaccgcaagatcgagtgcctg ggccagaatcatgcagatccactcccaaaagatgaacatgacttcaacggggcccagtgc aaggctgtgtctgtggaggcgggtatgatcgcactgcgcaggggtgccacggagctcacc cacgaggactacatggaaggcatcctggaggtccaggccaagaagaaagccaacctacag tactacgcctag >gi568815589r:18956346_19202651|GENSCAN_predicted_peptide_3|541_aa MEVKELYNENYKTLMKETEEDTKKWKNIPCTWIGKINFVKMFMLPKAIYRFNEILMKVPI TFCAEIEKTILKCIWNHKRLRIAKAILSKKNKIGGITLPDLQLYYRVMVSLEQRLFKARR ARRAERGSSREIAAGSGWWRGRSGSLPESERISRAVATDSRGRSHLSKSQPVRGLSSPGF ALSTLLPAQLRGGAERGAACESPAAPDPSLGAACRPRPQAQSAVPRRVMPNTAMKKKVLL MGKSGSGKTSMRSIIFANYIARDTRRLGATIDVEHSHVRFLGNLVLNLWDCGGQDTFMEN YFTSQRDNIFRNVEVLIYVFDVESRELEKDMHYYQSCLEAILQNSPDAKIFCLVHKMDLV QEDQRDLIFKEREEDLRRLSRPLECACFRTSIWDETLYKAWSSIVYQLIPNVQQLEMNLR NFAQIIEADEVLLFERATFLVISHYQCKEQRDVHRFEKISNIIKQFKLSCSKLAASFQSM EVRNSNFAAFIDIFTSNTYVMVVMSDPSIPSAATLINIRNARKHFEKLERVDGPKHSLLM R >gi568815589r:18956346_19202651|GENSCAN_predicted_CDS_3|1626_bp atggaagtgaaagagctctacaatgaaaactataaaacactgatgaaagaaacagaagag gacacaaaaaagtggaaaaatattccatgtacatggattggaaaaatcaattttgttaaa atgttcatgctacccaaagcaatctacagatttaatgaaatccttatgaaagtaccaata acattctgcgcagaaatagaaaaaacaatcctaaaatgtatatggaaccacaaaagattg agaatagccaaagctatcctgagcaaaaagaacaaaattggaggaatcacattacctgac ctccagttatactacagagttatggtttcattagagcagcggcttttcaaggctcgtcgg gcacggagggcggagcgagggagctctcgcgagatcgccgccggaagtgggtggtggcgg ggacgcagcggctccctcccggaaagcgagcgtatctcccgagccgttgccactgacagc cgcgggcgctcccatctgagtaagagccagcccgtccgcggcctctccagccccgggttc gcgctctcgactctcctgcctgcccagctgcgcggcggagcggagcgaggcgcggcctgc gagtccccggcagcccccgacccctccctcggcgctgcgtgtaggccgcgccctcaggcc cagtccgcggtgccccggcgggtgatgccaaatacagccatgaagaaaaaggtgctgctg atggggaagagcgggtcggggaagaccagcatgaggtcgataatcttcgccaattacatt gctcgcgacacccggcgcctgggggccaccattgacgtggaacactcccacgtccgattc ctagggaacctggtgctgaacctgtgggactgtggcggtcaggacaccttcatggaaaat tacttcaccagccagcgagacaatatcttccgtaacgtggaagttttgatttacgtgttt gacgtggagagccgcgaactggaaaaggacatgcattattaccagtcgtgtctggaggcc atcctccagaactctcctgacgccaaaatcttctgcctggtgcacaaaatggatctggtt caggaggatcagcgtgacctgatttttaaagagcgagaggaagacctgaggcgtctgtct cgcccgctggagtgtgcttgttttcgaacgtccatctgggatgagacgctctacaaagcc tggtccagcatcgtctaccagctgattcccaacgttcagcagctggagatgaacctcagg aattttgcccaaatcattgaggccgatgaagttctgctgttcgaaagagctacattcttg gttatttcccactaccagtgcaaagagcagcgcgacgtccaccggtttgagaagatcagc aacatcatcaaacagttcaagctgagctgcagtaaattggccgcttccttccagagcatg gaagttaggaattccaacttcgctgctttcatcgacatcttcacctcaaatacgtacgtg atggtggtcatgtcagatccgtcgatcccttctgcggccactctgatcaacattcgcaat gcccggaaacactttgagaagctggagagagtggatggccccaagcacagtctccttatg cgttga >gi568815589r:18956346_19202651|GENSCAN_predicted_peptide_4|582_aa MPATVPTARMSSASVTAFEKEHLWMYLQALGFEPGPATIACGKIVSHTHLGVMEPYDDHS NMEEKIQKVRSLWASVNETLMFLEKEREVVSSVLSLVNQYALDGTNVAINIPRLLLDKIE KQMFQDTKMGTPKEKNEAISKKIPEFEVENSPLSDVAKNTESSAFGGSLPAKKSDPFQKE QDHLVEEVARAVLSDSPQLSEGKEIKLEELIDSLGSNPFLTRNQIPRTPENLITEIRSSW RKAIEMEENRTKEPIQMDAEHREVLPESLPVLHNQREFSMADFLLETTVSDFGQSHLTEE KVISDCECVPQKHVLTSHIDEPPTQNQSDLLNKKVICKQDLECLAFTKLSETSRMETFSP AVGNRIDVMGGSEEEFMKILDHLEVSCNKPSTNKTMLWNSFQISSGISSKSFKDNDFGIL HETLPEEVGHLSFNSSSSSEANFKLEPNSPMHGGTLLEDVVGGRQTTPESDFNLQALRSR YEALKKSLSKKREESYLSNSQTPERHKPELSPTPQNVQTDDTLNFLDTCDLHTEHIKPSL RTSIGERKRSLSPLIKFSPVEQRLRTTIACSLGELPNLKGKV >gi568815589r:18956346_19202651|GENSCAN_predicted_CDS_4|1749_bp atgcccgccaccgtgcctaccgcgaggatgagctcggcctcggtcaccgctttcgagaag gagcatctctggatgtatctgcaggcgctcggcttcgagccaggcccggcaaccattgcc tgcggaaagatcgtgtcgcacacgcacctcggagtaatggaaccctatgatgaccacagt aatatggaagaaaaaattcaaaaggttcggtctttgtgggcttcagtgaatgaaacgctc atgtttttggaaaaagagagagaagttgttagttcggtccttagtcttgttaaccaatat gctttagatggaactaatgttgctattaatattccaaggctcttacttgacaaaattgag aaacaaatgtttcaggacacaaagatgggaactcccaaagaaaaaaatgaagcaatttct aagaaaataccagaatttgaagtggaaaattctccattatcagatgttgcaaagaataca gagagtagtgcatttggagggtctttgccagctaaaaaaagtgatccatttcaaaaagag caagatcatctggtagaagaggttgccagagcagttttatctgattcaccacagctctct gaaggaaaagaaataaaattagaggaactaattgactctctgggttctaaccccttctta acaaggaatcagattccccgtactccagaaaacttgataactgaaattaggagctcatgg agaaaagctattgaaatggaagaaaacagaactaaagaaccaattcaaatggatgctgaa catagagaagtattgccagaatcattacctgtgttgcacaatcaaagagaatttagcatg gctgattttctcttagaaaccactgtatcagattttggccagtctcatttgactgaagag aaagttatttcagattgcgagtgtgtgcctcagaaacatgtgctgaccagtcacatagat gaaccaccaacacaaaatcagtcagatttgttaaataagaaagtaatttgcaagcaagat ttggaatgtttagccttcaccaagctttcagaaactagccgaatggagacattctcccct gctgtcggcaataggatagatgtgatgggtggcagtgaagaggagtttatgaaaatattg gaccacttagaagtttcttgtaacaaaccttccacaaataaaactatgttgtggaattct tttcagatatcaagtggaattagttctaagagttttaaagataatgattttggcatatta cacgaaactctcccggaagaagttggtcatctaagttttaatagttccagtagttcagag gccaattttaaactggagccaaatagtcctatgcatggtggcactcttctagaagatgtt gtgggagggagacagactactccagaatcagactttaatttacaggctcttcgcagtaga tacgaggctctgaagaaatctctttccaagaaaagggaagaatcttacctctcgaattcc caaacacccgaaagacacaaaccagaattgagccctactccccaaaatgtacaaacagat gatacgcttaactttttggacacctgtgatttgcatactgagcatataaagccatcttta cgcacgtccatcggtgaaagaaaacggtctctttcaccactaattaagttttctccagtg gaacaaagattgagaaccacaatagcatgtagtcttggagaactacctaatttaaagggt aaggtttaa >gi568815589r:18956346_19202651|GENSCAN_predicted_peptide_5|506_aa MKLNISFPATGCQKLIEMDDECKLRTFYEKRTATKVAADTLGEEWKGYVVRISAGVVFGT RLLFAFRCSPSSVVTRVVNLPLVSSTYDLMSSAYLSTKDQYPYLKSVCEMAENGVKTITS VAMTSALPIIQKLEPQIAVANTYACKGLDRIEERLPILNQPSTQIVANAKGAVTGAKDAV TTTVTGAKDSVASTITGVMDKTKGAVTGSVEKTKSVVSGSINTVLGSRMMQLVSSGVENA LTKSELLVEQYLPLTEEELEKEAKKVEGFDLVQKPSYYVRLGSLSTKLHSRAYQQALSRV KEAKQKSQQTISQLHSTVHLIEFARKNVYSANQKIQDAQDKLYLSWVEWKRSIGYDDTDE SHCAEHIESRTLAIARNLTQQLQTTCHTLLSNIQGVPQNIQDQAKHMGVMAGDIYSVFRN AASFKEVSDSLLTSSKGQLQKMKESLDDVMDYLVNNTPLNWLVFDFTTIDLTSETDEIPD IIALEEENGSNNSHANGPVLSGQDVE >gi568815589r:18956346_19202651|GENSCAN_predicted_CDS_5|1521_bp atgaagctgaacatctccttcccagccactggctgccagaaactcattgaaatggatgat gaatgcaaacttcgtactttttatgagaaacgtacggccacaaaagttgctgctgacaca ctgggtgaagaatggaagggttatgtggtccgaatcagtgccggagtcgtcttcgggacg cgcctgctcttcgcctttcgctgcagtccgtcgagtgtggtgactcgggtggtcaacctg cccttggtgagctccacgtatgacctcatgtcctcagcctatctcagtacaaaggaccag tatccctacctgaagtctgtgtgtgagatggcagagaacggtgtgaagaccatcacctcc gtggccatgaccagtgctctgcccatcatccagaagctagagccgcaaattgcagttgcc aatacctatgcctgtaaggggctagacaggattgaggagagactgcctattctgaatcag ccatcaactcagattgttgccaatgccaaaggcgctgtgactggggcaaaagatgctgtg acgactactgtgactggggccaaggattctgtggccagcacgatcacaggggtgatggac aagaccaaaggggcagtgactggcagtgtggagaagaccaagtctgtggtcagtggcagc attaacacagtcttggggagtcggatgatgcagctcgtgagcagtggcgtagaaaatgca ctcaccaaatcagagctgttggtagaacagtacctccctctcactgaggaagaactagaa aaagaagcaaaaaaagttgaaggatttgatctggttcagaagccaagttattatgttaga ctgggatccctgtctaccaagcttcactcccgtgcctaccagcaggctctcagcagggtt aaagaagctaagcaaaaaagccaacagaccatttctcagctccattctactgttcacctg attgaatttgccaggaagaatgtgtatagtgccaatcagaaaattcaggatgctcaggat aagctctacctctcatgggtagagtggaaaaggagcattggatatgatgatactgatgag tcccactgtgctgagcacattgagtcacgtactcttgcaattgcccgcaacctgactcag cagctccagaccacgtgccacaccctcctgtccaacatccaaggtgtaccacagaacatc caagatcaagccaagcacatgggggtgatggcaggcgacatctactcagtgttccgcaat gctgcctcctttaaagaagtgtctgacagcctcctcacttctagcaaggggcagctgcag aaaatgaaggaatctttagatgacgtgatggattatcttgttaacaacacgcccctcaac tggctggtgtttgatttcaccaccatagacttgacatctgagactgatgaaattccagat attatagctttggaagaggagaacggatcaaataattcacatgctaatggccctgtcctc tcagggcaagatgttgaataa