GENSCAN 1.0 Date run: 3-Nov-116 Time: 07:44:55 Sequence gi568815586r:110353037_110568203 : 215167 bp : 44.37% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 1820 2005 186 1 0 85 39 144 0.667 4.28 1.02 PlyA + 2049 2054 6 1.05 2.35 PlyA - 6817 6812 6 1.05 2.34 Term - 17387 17188 200 0 2 -22 37 168 0.509 -1.74 2.33 Intr - 18120 17896 225 1 0 112 71 80 0.601 6.66 2.32 Intr - 19161 19043 119 2 2 76 70 25 0.166 -0.39 2.31 Intr - 21297 21112 186 2 0 110 45 316 0.298 28.30 2.30 Intr - 23180 23030 151 0 1 107 102 103 0.999 12.82 2.29 Intr - 24581 24357 225 0 0 105 83 101 0.959 9.26 2.28 Intr - 28912 28716 197 0 2 108 116 41 0.989 7.96 2.27 Intr - 29924 29807 118 0 1 124 51 94 0.566 8.82 2.26 Intr - 33433 33291 143 0 2 -2 115 69 0.917 0.60 2.25 Intr - 34856 34703 154 0 1 87 85 56 0.659 4.33 2.24 Intr - 35587 35476 112 1 1 66 48 70 0.729 0.75 2.23 Intr - 42184 42065 120 1 0 46 106 52 0.913 3.49 2.22 Intr - 43394 43230 165 2 0 73 91 84 0.547 7.36 2.21 Intr - 48422 48317 106 1 1 57 81 44 0.512 0.82 2.20 Intr - 50784 50491 294 2 0 29 86 234 0.587 13.42 2.19 Intr - 52572 52440 133 1 1 84 74 40 0.625 1.90 2.18 Intr - 52935 52825 111 1 0 79 91 34 0.677 3.15 2.17 Intr - 55499 55418 82 2 1 75 115 66 0.944 7.21 2.16 Intr - 83168 83074 95 0 2 60 116 30 0.906 2.68 2.15 Intr - 83647 83521 127 1 1 77 95 41 0.980 3.95 2.14 Intr - 84116 84048 69 2 0 120 86 -20 0.562 0.38 2.13 Intr - 87352 87276 77 2 2 77 94 140 0.995 12.73 2.12 Intr - 92515 92416 100 1 1 77 89 21 0.518 0.78 2.11 Intr - 100835 100707 129 2 0 38 86 139 0.995 9.59 2.10 Intr - 102646 102550 97 0 1 70 107 61 0.997 6.21 2.09 Intr - 102894 102779 116 0 2 96 81 104 0.991 9.75 2.08 Intr - 104598 104474 125 1 2 47 47 74 0.890 -0.50 2.07 Intr - 106826 106659 168 0 0 76 68 210 0.929 17.72 2.06 Intr - 112178 112070 109 2 1 83 119 163 0.619 18.76 2.05 Intr - 115951 115691 261 1 0 58 52 225 0.455 13.58 2.04 Intr - 139086 138999 88 2 1 104 9 66 0.008 0.27 2.03 Intr - 140195 139960 236 2 2 60 92 121 0.911 6.19 2.02 Intr - 143167 142976 192 1 0 23 49 151 0.135 4.39 2.01 Init - 149312 149253 60 2 0 81 49 86 0.059 5.48 2.00 Prom - 151627 151588 40 -2.06 3.00 Prom + 152256 152295 40 -3.86 3.01 Init + 165180 165187 8 2 2 40 103 0 0.317 -2.87 3.02 Intr + 166758 166880 123 0 0 69 116 36 0.793 4.20 3.03 Intr + 169141 169375 235 1 1 73 93 67 0.825 3.39 3.04 Term + 174336 174485 150 2 0 103 53 77 0.759 3.61 3.05 PlyA + 174607 174612 6 1.05 4.06 PlyA - 177951 177946 6 1.05 4.05 Term - 184059 184001 59 1 2 97 37 61 0.700 -0.25 4.04 Intr - 185237 185108 130 2 1 72 99 114 0.997 11.17 4.03 Intr - 186909 186786 124 2 1 106 86 119 0.779 14.09 4.02 Intr - 193042 192844 199 2 1 59 89 177 0.997 13.31 4.01 Intr - 198932 198753 180 0 0 101 123 22 0.887 6.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 75805 75961 157 0 1 80 53 130 0.805 8.85 S.002 Term - 82181 82119 63 2 0 72 49 42 0.834 -3.41 S.003 Init - 107344 107317 28 1 1 34 89 1 0.855 -5.44 S.004 Sngl + 107952 108200 249 2 0 75 41 356 0.997 24.68 S.005 Term - 111503 111493 11 0 2 108 40 12 0.925 -3.24 S.006 Intr + 133289 133390 102 1 0 95 32 101 0.836 5.57 S.007 Term + 133498 133668 171 0 0 58 54 93 0.825 0.73 S.008 Term - 139086 138969 118 2 1 104 42 71 0.966 2.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:110353037_110568203|GENSCAN_predicted_peptide_1|61_aa MADSRDGTENVQYVPEEYQSTRKEGLKNSNQRCGYVKGTQEPTEKATTAEECEQQNKAVL D >gi568815586r:110353037_110568203|GENSCAN_predicted_CDS_1|186_bp atggctgattctagagatgggacagaaaacgtacagtacgtgcctgaagaataccaaagc accagaaaggaagggctcaaaaacagcaatcaaagatgtggttatgtgaaaggaacacag gagccaactgaaaaagccacaacagccgaagaatgtgagcaacaaaataaagcagtattg gactag >gi568815586r:110353037_110568203|GENSCAN_predicted_peptide_2|1629_aa MAAIRPKKQARKARLSSTAQLVLVLGDLHIPHRCNSLPAKFKKLLVPGKIQHILCTGNLC TKESYDYLKTLAGDVHIVRGDFDENLNYPEQKVVTVGQFKIGLIHGHQVIPWGDMASLAL LQRQFDVDILISGHTHKFEAFEHENKFYINPGSATGAYNALETNIIPSFVLMDIQASTVV TYVYQLIGDDVKRAALQKSYAPSSRTPDPGRASPRRRDREPCAGAAVPASLPRQHACVRL LVFRRGGSDAGFSPLRSRSDTARTASGERGGQIRDPGVTSTYCATMVQHCEALNRSVQVV NLDPAAEHFNYSVMADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGH VEDDYILFDCPGQIELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILA ALSAMISLEIPQVNIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKA ICGLIDDYSMVRFLPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKAYHSSLMDPDTKL IGNMALLPIRSQFKGPAPRETKDTDIVDEAIYYFKANVFFKNYEIKNEADRTLIYITLYI SECLKKLQKCNSKSQGEKEMYTLGITNFPIPGEPGFPLNAIYAKPANKQEDEVMRAYLQQ LRQETGLRLCEKVFDPQNDKPSKWLKTDTARSPRKPTGPSQTLWVTLTVEGKVEWTNGLL KTHLTKLSLQLKKDSVKDRAPKLTNQAGSHTPPPIPLEAALKNIAHYLSIPPPKIFAAAT LYDYFVLFFLLTTTTGAGTRVSFYSGTSVVTSCLGVVPRVDPMDPGDAAILESSLRILYR LFESVLPPLPAALQSRMNVIDHVRDMAAAGLHSNVRLLSSLLLTMSNNNPTLGYSHFSTL RITHQLKAQWLCAISTHMLIECVEEKYQLLVYHADSLFHDKEYRNAVSKYTMALQQKKAL SKTSKVRPSTGNSASTPQSQCLPSEIEVKYKMAECYTMLKQDKDAIAILDGIPSRQRTPK INMMLANLYKKAGQERPSVTSYKEVLRQCPLALDAILGLLSLSVKGAEVASMTMNVIQTV PNLDWLSVWIKAYAFVHTGDNSRAISTICSLEKKSLLRDNVDLLGSLADLYFRAGDNKNS VLKFEQAQMLDPYLIKGMDVYGYLLAREGRLEDVENLGCRLFNISDQHAEPWVVSGCHSF YSKRYSRALYLGAKAIQLNSNSVQALLLKGAALRNMGRVQEAIIHFREAIRLAPCRLDCY EGLIECYLASNSIREAMVMANNVYKTLGANAQTLTLLATVCLEDPVTQEKAKTLLDKALT QRPDYIKAVVKKAELLSREQKYEDGIALLRNALANQSDCVLHRILGDFLVAVNEYQEAMD QYSIALSLDPNDQKSLEGMQKMEKEESPTDATQEEDVDDMEGSGEEGDLEGSDSEAAQWA DQEQWFGMHPAESPCALQAPAAQHLLGRILILCSSSPRVTARLEEAAEGVPWKSIEKPAR GELEVPNSCRPDPRGPLGVNSAAVRGWEEPASKGLTCGFLRSQPYTGLPVAGQGLRGAAS QSLGGTTTVENSLAIQKFKQYYYPRIQCSTSGCTPEEVKAGTPKDCTSMSTQHCPQQQKV ETTHVSLDE >gi568815586r:110353037_110568203|GENSCAN_predicted_CDS_2|4890_bp atggctgccatccgccctaaaaagcaggctcggaaggcgcgcctcagctcgacggcccag ttggtgttggtattaggagatctgcacatcccacaccggtgcaacagtttgccagctaaa ttcaaaaaactcctggtgccaggaaaaattcagcacattctctgcacaggaaacctttgc accaaagagagttatgactatctcaagactctggctggtgatgttcatattgtgagagga gacttcgatgagaatctgaattatccagaacagaaagttgtgactgttggacagttcaaa attggtctgatccatggacatcaagttattccatggggagatatggccagcttagccctg ttgcagaggcaatttgatgtggacattcttatctcgggacacacacacaaatttgaagca tttgagcatgaaaataaattctacattaatccaggttctgccactggggcatataatgcc ttggaaacaaacattattccatcatttgtgttgatggatatccaggcttctacagtggtc acctatgtgtatcagctaattggagatgatgtgaaacgggcggctctgcagaagagctac gctccgtccagtcggaccccggaccctggccgggcatctccgcggcgccgagaccgcgag ccgtgtgcgggagcagctgtcccagcatccctcccccgacagcacgcgtgcgtcaggctg ctggtattccggagaggcggctcggatgctggcttctcgcccctccgaagccgttcggac actgcccgaacagcttcgggagagcggggcgggcagatccgcgaccccggagttacgagc acctactgtgccaccatggtccagcactgtgaagccctcaaccggtctgtccaagttgta aacctggatccagcagcagaacacttcaactactccgtgatggctgacatccgggaactg atcgaggtggatgatgtaatggaggatgattctctgcgattcggtcccaacggaggattg gtattttgcatggagtactttgccaataattttgactggctggagaactgtcttggccat gtagaggacgactatatcctttttgattgtccaggtcagattgagttgtacactcacctg cctgtgatgaaacagctggtccagcagctcgagcagtgggagttccgagtctgtggagtt tttcttgttgattctcagttcatggtggagtcattcaagtttatttctggcatcttggca gccctgagtgccatgatctctctagaaattccgcaagtcaacatcatgacaaaaatggat ctgctgagtaaaaaagcaaaaaaggaaattgagaaatttttagatccagacatgtattct ttattagaagattctacaagtgacttaagaagcaaaaaattcaagaaactgactaaagct atatgtggactgattgatgactacagcatggttcgatttttaccttacgatcagtcagat gaagaaagcatgaacattgtattgcagcatattgattttgccattcaatatggagaagac ctagaatttaaagaaccaaaggcttaccactcttctctcatggatcctgataccaaactc atcggaaacatggcactgttgcctatcagaagtcaattcaaaggacctgcccccagagag acaaaagatacagatattgtggatgaagccatctattacttcaaggccaatgtcttcttc aaaaactatgaaattaagaatgaagctgataggaccttgatatatataactctctacatt tctgaatgtctgaagaaactgcaaaagtgcaattccaaaagccaaggtgagaaagaaatg tatacgctgggaatcactaattttcccattcctggagagcctggttttccacttaacgca atttatgccaaacctgcaaacaaacaggaagatgaagtgatgagagcctatttacaacag ctaaggcaagagactggactgagactttgtgagaaagttttcgaccctcagaatgataaa cccagcaagtggctgaagactgacactgcccgatcacctcggaagcctacaggaccatca cagacgctttgggtaactcttacagtggaaggaaaggtagaatggactaatggtctttta aaaacacacctcaccaagctcagcctccaacttaaaaaggactctgtcaaggatagagcc ccaaaactcaccaaccaagcaggttcccacacaccacccccaatcccacttgaagcagcc ctgaaaaacatcgcccattatctctccataccacccccaaaaattttcgctgccgcaaca ctttacgactatttcgttttattcttcttgttaacgaccacgacaggagcagggactcga gtttctttttattcgggcacttcagtagttacctcttgcttgggggtggtgccgcgtgtg gacccgatggaccccggcgacgccgccattttggagtcttccctaaggatcctctaccgg cttttcgagtcagtgctgccgccgctgcccgcggctttgcagagcaggatgaatgtgata gaccacgtgcgggacatggcggccgcggggctgcactccaacgtgcggctcctcagcagc ttgttacttacaatgagtaataacaacccgactctcgggtactcccacttttctaccctt cgcatcacccaccaattgaaagcgcagtggctgtgcgcaataagcactcacatgcttatt gagtgtgttgaggaaaagtaccagcttttggtgtatcatgcagattctctctttcatgat aaggaatatcggaatgctgtgagtaagtataccatggctttacagcagaagaaagcgcta agtaaaacttcaaaagtgagaccttcaactggaaattctgcatctactccacaaagtcag tgtcttccatctgaaattgaagtgaaatacaaaatggctgaatgttatacaatgctaaaa caagataaagatgccattgctatacttgatgggatcccttcaagacaaagaactcccaaa ataaacatgatgctggcaaacctgtacaagaaggctggtcaggagcgcccttcagtcacc agctataaggaggtgctgaggcagtgcccattagcccttgatgccattctaggcttgttg tccctttctgtaaaaggggcagaggtggcatccatgacaatgaatgtgatccaaaccgtg cctaacttggactggctctctgtgtggatcaaagcgtatgcttttgtgcacactggtgac aactcaagagcaatcagtaccatctgttcactagagaaaaaatccttattgcgagataac gtggacctattgggaagcttggcagatctgtacttcagagctggagacaataaaaactct gtcctcaagtttgaacaggcacagatgttggatccttatctgataaaaggaatggatgta tatggctacctactggcacgagaagggcggctagaggatgttgagaaccttggatgccgc cttttcaatatctctgatcagcatgcagaaccgtgggtggtttctggctgtcacagcttc tatagcaaacgctactcccgggccctctatttaggagccaaggccattcagctgaacagt aatagtgttcaagctctgctacttaagggagcagcacttaggaacatgggcagagtccaa gaagcaataatccactttcgggaggccatacggctcgcaccttgtcgcttagattgttat gaaggtcttatcgaatgttacttagcctccaacagtattcgagaagcaatggtaatggct aacaacgtttacaaaactctgggagcaaatgcacagacccttacccttttagccaccgtt tgtcttgaagacccagtgacacaggagaaagccaaaacattattagataaagccctgacc caaaggccagattacattaaggctgtggtgaaaaaagcagaactacttagcagagaacag aaatatgaagatggaattgctttgctgaggaacgcactggctaatcagagtgactgtgtc ctgcatcggatcctaggagatttccttgtagctgtcaatgagtatcaggaggcaatggac cagtatagtatagcactaagtttggaccccaatgaccagaagtctctagaggggatgcag aagatggagaaggaggagagtcccacggatgccactcaggaggaggatgtggacgacatg gaagggagtggggaagaaggggacctggagggcagcgacagtgaggcggcccagtgggct gaccaggagcagtggttcggcatgcaccctgctgagagcccctgtgctctccaggctcct gcagcccagcacctgctaggacggatcctcattctctgcagctccagccccagggtgaca gccaggttggaagaggcggctgaaggtgttccctggaaaagtattgagaaacctgcgaga ggggagctcgaagttcccaacagctgcaggcctgaccccagagggcccttgggagttaac tctgcagctgtccgtgggtgggaggaaccggcctccaaaggcctcacctgtggcttcctg cgttcccagccctacactggcttacctgtggcaggccagggcctgcgcggagctgcttcc cagagcttaggtggcacaaccactgtggagaacagtctggcgattcagaaattcaaacag tactactaccctaggatccagtgttccacctccgggtgtacacccgaagaagtgaaagct ggaacgccaaaagattgcacgtccatgtccacgcagcactgtccacagcagcaaaaggtg gaaacaacccacgtgtccttagatgagtag >gi568815586r:110353037_110568203|GENSCAN_predicted_peptide_3|171_aa MCIPLALSIDDMLVEANFILATLADEQSRASSPQSLCLSQKRKRSDLIEKKAGKNVTGQA LECISKKAAPRRLYPKETLTNISALENCGSPAMKRVDGDVSEVSESSVSNTEEVPGSLCL RKCDITSSEVHGLSTWTSGNHFPACHSQYHGYLESRFNSCKASLAEGIFTF >gi568815586r:110353037_110568203|GENSCAN_predicted_CDS_3|516_bp atgtgcatacctctggctttgagtattgatgatatgttagtggaagctaactttattttg gccacattagctgatgaacaaagtagagcatcttcaccacagtcactgtgtctttcacag aaacgaaaaaggtcagatctgattgaaaaaaaggctggcaaaaatgtaactggccaggcc ctggaatgtatttcaaaaaaagcagcaccaagaaggctttatcctaaggagactctcaca aacatatctgcattggaaaactgtggcagccctgcaatgaaaagagtggatggagatgtc agtgaagtatcagaaagcagtgtcagcaacacagaggaagtgccagggtctctgtgtctc agaaagtgtgacataacatcttcagaggttcatgggcttagtacatggacctctgggaac cattttcctgcctgccacagccagtatcacggctatctggagagcaggttcaacagctgc aaggcaagccttgcagagggcatcttcaccttctga >gi568815586r:110353037_110568203|GENSCAN_predicted_peptide_4|230_aa XVADGVGGWRDYGVDPSQFSGTLMRTCERLVKEGRFVPSNPIGILTTSYCELLQNKVPLL GSSTACIVVLDRTSHRLHTANLGDSGFLVVRGGEVVHRSDEQQHYFNTPFQLSIAPPEAE GVVLSDSPDAADSTSFDVQLGDIILTATDGLFDNMPDYMILQELKKLKNSNYESIQQTAR SIAEQAHELAYDPNYMSPFAQFACDNGLNVRGGKPDDITVLLSIVAEYTD >gi568815586r:110353037_110568203|GENSCAN_predicted_CDS_4|693_bp ngggttgcagatggtgtaggaggctggagagactatggagttgatccatctcaattctca gggactttaatgcggacgtgtgaacgtttagtaaaagaaggacggttcgtacctagtaat cccattggaattctcaccacaagctactgtgagttgctgcaaaataaagtccctttgctc ggtagcagcaccgcctgcattgtggtgctggacagaaccagccaccgcttacacacagca aacctgggcgattcaggcttcctggttgtcaggggtggtgaagtcgtgcaccgatcagat gagcagcagcattacttcaacactccattccagctctcaatcgctccccctgaagccgag ggagtcgtcttgagcgacagtccggatgctgctgatagcacgtctttcgatgtccagcta ggagacattatcctgacggcaacagatggactctttgacaacatgcctgattatatgatt cttcaggagctaaaaaagttaaagaattcaaattatgagagtatacaacagactgccaga agcattgctgagcaagctcatgagctggcctatgacccaaattatatgtcaccttttgca cagtttgcatgtgacaatggattgaatgtgagaggtggaaagccagatgacatcaccgtc cttctttcaatagtggctgagtatacagactag