GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:32:51 Sequence gi568815597r:67313210_67530300 : 217091 bp : 42.01% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8393 8680 288 0 0 77 77 123 0.210 6.39 1.02 Intr + 13526 13640 115 0 1 58 79 53 0.237 -0.01 1.03 Intr + 14991 15175 185 0 2 96 56 108 0.246 6.91 1.04 Intr + 25415 25494 80 0 2 66 113 54 0.073 4.05 1.05 Intr + 37661 37880 220 1 1 76 111 209 0.873 18.95 1.06 Intr + 54616 54816 201 2 0 56 82 186 0.718 13.14 1.07 Intr + 59227 59325 99 2 0 92 113 89 0.997 10.96 1.08 Intr + 66777 66914 138 1 0 124 100 79 0.696 12.21 1.09 Intr + 73370 73460 91 0 1 109 87 68 0.007 6.93 1.10 Intr + 76820 76919 100 2 1 75 94 71 0.042 5.59 1.11 Term + 82338 82880 543 2 0 105 48 393 0.589 30.28 1.12 PlyA + 83772 83777 6 -0.45 2.09 PlyA - 83972 83967 6 1.05 2.08 Term - 84494 84190 305 0 2 65 53 116 0.244 0.15 2.07 Intr - 85480 85396 85 1 1 73 106 42 0.205 2.97 2.06 Intr - 102130 101957 174 1 0 72 86 153 0.926 12.71 2.05 Intr - 106977 106800 178 2 1 50 83 207 0.998 15.40 2.04 Intr - 111113 110991 123 1 0 91 97 125 0.996 12.48 2.03 Intr - 112014 111874 141 2 0 42 97 233 0.999 18.15 2.02 Intr - 113076 112926 151 1 1 79 97 233 0.988 21.60 2.01 Init - 117280 116608 673 1 1 76 13 685 0.902 53.19 2.00 Prom - 145402 145363 40 -4.55 3.07 PlyA - 145894 145889 6 1.05 3.06 Term - 148315 148164 152 2 2 51 39 234 0.705 11.89 3.05 Intr - 150602 150480 123 0 0 39 -6 199 0.966 5.24 3.04 Intr - 151061 150915 147 0 0 21 110 149 0.986 9.69 3.03 Intr - 152588 152355 234 0 0 88 67 73 0.447 1.94 3.02 Intr - 155235 155055 181 0 1 -9 44 173 0.700 1.72 3.01 Init - 155472 155443 30 0 0 45 88 70 0.808 2.62 3.00 Prom - 156836 156797 40 -9.95 4.00 Prom + 156880 156919 40 -1.75 4.01 Init + 162777 162889 113 2 2 53 110 25 0.652 0.93 4.02 Intr + 165982 166144 163 1 1 77 78 129 0.883 9.86 4.03 Intr + 171388 171482 95 0 2 61 44 42 0.268 -5.06 4.04 Intr + 172063 172107 45 1 0 121 105 -4 0.262 1.21 4.05 Intr + 174815 174938 124 2 1 61 45 145 0.056 7.17 4.06 Term + 181626 181769 144 2 0 37 42 148 0.859 2.03 4.07 PlyA + 182361 182366 6 1.05 5.04 PlyA - 183827 183822 6 1.05 5.03 Term - 198464 198396 69 2 0 84 36 135 0.725 4.86 5.02 Intr - 210500 210340 161 0 2 66 42 96 0.221 1.59 5.01 Init - 211955 211898 58 2 1 87 68 32 0.180 2.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 70838 70666 173 0 2 29 53 151 0.838 2.71 S.002 Intr - 71666 71356 311 1 2 17 63 256 0.836 11.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:67313210_67530300|GENSCAN_predicted_peptide_1|686_aa XACKRGDVTVKPSHVILLGSTVNITCSLKPRQGCFHYSRRNKLILYKFDRRINFHHGHSL NSQVTGLPLGTTLFVCKLACINSDEIQICGAEIFVGVAPEQPQNLSCIQKGEQGTVACTW ERGRDTHLYTEYTLQLSGPKNLTWQKQCKDIYCDYLDFGINLTPESPESNFTAKVTAVNS LGSSSSLPSTFTFLDIEPTGMLDVWYMKRHIDYSRQQISLFWKNLSVSEARGKILHYQVT LQELTGGKAMTQNITGHTSWTTVIPRTGNWAVAVSAANSKGSSLPTRINIMNLCEAGLLA PRQVSANSEGMDNILVTWQPPRKDPSAVQEYVVEWRELHPGGDTQVPLNWLRSRPYNVSA LISENIKSYICYEIRVYALSGDQGGCSSILGNSKHKEIPYRVSQNSHPINSLQPRVTYVL WMTALTAAGESSHGNEREFCLQGKANWMAFVAPSICIAIIMVGIFSTHYFQQKVFVLLAA LRPQWCSREIPDPANSTCAKKYPIAEEKTQLPLDRLLIDWPTPEDPEPLVISEVLHQVTP VFRHPPCSNWPQREKGIQGHQASEKDMMHSASSPPPPRALQAESRQLVDLYKVLESRGSD PKPENPACPWTVLPAGDLPTHDGYLPSNIDDLPSHEAPLADSLEELEPQHISLSVFPSSS LHPLTFSCGDKLTLDQLKMRCDSLML >gi568815597r:67313210_67530300|GENSCAN_predicted_CDS_1|2061_bp natgcgtgcaagagaggcgatgtgactgtgaagccttcccatgtaattttacttggatcc actgtcaatattacatgctctttgaagcccagacaaggctgctttcactattccagacgt aacaagttaatcctgtacaagtttgacagaagaatcaattttcaccatggccactccctc aattctcaagtcacaggtcttccccttggtacaaccttgtttgtctgcaaactggcctgt atcaatagtgatgaaattcaaatatgtggagcagagatcttcgttggtgttgctccagaa cagcctcaaaatttatcctgcatacagaagggagaacaggggactgtggcctgcacctgg gaaagaggacgagacacccacttatacactgagtatactctacagctaagtggaccaaaa aatttaacctggcagaagcaatgtaaagacatttattgtgactatttggactttggaatc aacctcacccctgaatcacctgaatccaatttcacagccaaggttactgctgtcaatagt cttggaagctcctcttcacttccatccacattcacattcttggacatagagcctactggg atgttagatgtctggtacatgaaacggcacattgactacagtagacaacagatttctctt ttctggaagaatctgagtgtctcagaggcaagaggaaaaattctccactatcaggtgacc ttgcaggagctgacaggagggaaagccatgacacagaacatcacaggacacacctcctgg accacagtcattcctagaaccggaaattgggctgtggctgtgtctgcagcaaattcaaaa ggcagttctctgcccactcgtattaacataatgaacctgtgtgaggcagggttgctggct cctcgccaggtctctgcaaactcagagggcatggacaacattctggtgacttggcagcct cccaggaaagatccctctgctgttcaggagtacgtggtggaatggagagagctccatcca gggggtgacacacaggtccctctaaactggctacggagtcgaccctacaatgtgtctgct ctgatttcagagaacataaaatcctacatctgttatgaaatccgtgtgtatgcactctca ggggatcaaggaggatgcagctccatcctgggtaactctaagcacaaagaaattccctac agagtctcccaaaattcacatccaataaacagcctgcagccccgagtgacatatgtcctg tggatgacagctctgacagctgctggtgaaagttcccacggaaatgagagggaattttgt ctgcaaggtaaagccaattggatggcgtttgtggcaccaagcatttgcattgctatcatc atggtgggcattttctcaacgcattacttccagcaaaaggtgtttgttctcctagcagcc ctcagacctcagtggtgtagcagagaaattccagatccagcaaatagcacttgcgctaag aaatatcccattgcagaggagaagacacagctgcccttggacaggctcctgatagactgg cccacgcctgaagatcctgaaccgctggtcatcagtgaagtccttcatcaagtgacccca gttttcagacatcccccctgctccaactggccacaaagggaaaaaggaatccaaggtcat caggcctctgagaaagacatgatgcacagtgcctcaagcccaccacctccaagagctctc caagctgagagcagacaactggtggatctgtacaaggtgctggagagcaggggctccgac ccaaagcccgaaaacccagcctgtccctggacggtgctcccagcaggtgaccttcccacc catgatggctacttaccctccaacatagatgacctcccctcacatgaggcacctctcgct gactctctggaagaactggagcctcagcacatctccctttctgttttcccctcaagttct cttcacccactcaccttctcctgtggtgataagctgactctggatcagttaaagatgagg tgtgactccctcatgctctga >gi568815597r:67313210_67530300|GENSCAN_predicted_peptide_2|609_aa MTLAYIPHSALRRRPLPDVCLAPEEKTAPLSRPGHLVGRAEAGALGSARPAAIRGGTRRR ATIMPGHLQEGFGCVVTNRFDQLFDDESDPFEVLKAAENKKKEAGGGGVGGPGAKSAAQA AAQTNSNAAGKQLRKESQKDRKNPLPPSVGVVDKKEETQPPVALKKEGKAGAECGGDWDG GKERGGCRLPLVSGAPRVRGRLCRLYPHVSSRDKSPVLRRCQERGIRRVGRRPDQQLQGE GKIIDRRPERRPPRERRFEKPLEEKGEGGEFSVDRPIIDRPIRGRGGLGRGRGGRGRGMG RGDGFDSRGKREFDRHSGSDRSESPKYIQKQISYNYSDLDQSNVTEETPEGEEHHPVADT ENKENEVEEVKEEGPKEMTLDEWKAIQNKDRAKVEFNIRKPNEGADGQWKKGFVLHKSKS EEAHAEDSVMDHHFRKPANDITSQLEINFGDLGRPGRGGRGGRGGRGRGGRPNRGSRTDK LLWTRHDAGAGETLPKFLLSSCLQSNRERKSIISSLDQDFSTSLAEVPMVKRQRQQWARE GGKPGATALKSKTWIMFQVHVVEKRVIFQAQLELAEDQLNIFSGNESCAFGNRGQTTSED NLRGWGLRP >gi568815597r:67313210_67530300|GENSCAN_predicted_CDS_2|1830_bp atgactctggcctacataccccacagtgccttgcggcgcaggccgctcccggatgtgtgc ctggcgccggaagagaagacggcccccctctctcggcccggccatcttgtgggaagagct gaagcaggcgctcttggctcggcgcggcccgctgcaatccgtggaggaacgcgccgccga gccaccatcatgcctgggcacttacaggaaggcttcggctgcgtggtcaccaaccgattc gaccagttatttgacgacgaatcggaccccttcgaggtgctgaaggcagcagagaacaag aaaaaagaagccggcgggggcggcgttgggggccctggggccaagagcgcagctcaggcc gcggcccagaccaactccaacgcggcaggcaaacagctgcgcaaggagtcccagaaagac cgcaagaacccgctgccccccagcgttggcgtggttgacaagaaagaggagacgcagccg cccgtggcgcttaagaaagaaggtaaagcaggggcagaatgtgggggagactgggatgga gggaaggagcgagggggctgccggctgccacttgtttctggagctccgagggtccgcggg agactttgccgcctatatcctcacgtttcttcacgcgacaaaagtccggtcctgcgaagg tgccaggagcgtggaataagacgagttggaagaagacctgatcaacaacttcagggtgaa gggaaaataattgatagaagaccagaaaggcgaccacctcgtgaacgaagattcgaaaag ccacttgaagaaaagggtgaaggaggcgaattttcagttgatagaccgattattgaccga cctattcgaggtcgtggtggtcttggaagaggtcgagggggccgtggacgtggaatgggc cgaggagatggatttgattctcgtggcaaacgtgaatttgataggcatagtggaagtgat agatcagagtcccccaaatacattcagaaacaaatatcttataattacagtgacttggat caatcaaatgtgactgaggaaacacctgaaggtgaagaacatcatccagtggcagacact gaaaataaggagaatgaagttgaagaggtaaaagaggagggtccaaaagagatgactttg gatgagtggaaggctattcaaaataaggaccgggcaaaagtagaatttaatatccgaaaa ccaaatgaaggtgctgatgggcagtggaagaagggatttgttcttcataaatcaaagagt gaagaggctcatgctgaagattcggttatggaccatcatttccggaagccagcaaatgat ataacgtctcagctggagatcaattttggagaccttggccgcccaggacgtggcggcagg ggaggacgaggtggacgtgggcgtggtgggcgcccaaaccgtggcagcaggaccgacaag ctactatggaccaggcatgatgctggtgctggggaaacactgccaaagttcctgctttca agttgtctgcagtccaacagagaaaggaagagcattattagcagtttagaccaagacttc tccactagtttggcagaagtgcctatggtgaagagacagaggcagcaatgggctagggag ggtggcaagccaggggccacagctttaaagagcaaaacatggatcatgttccaagtccat gttgtggaaaagagagttatcttccaggctcaacttgagctggcagaggaccagctcaat atcttctccgggaatgaaagctgtgcctttgggaacaggggacagaccacttctgaggac aacctcagaggttggggtcttagaccctga >gi568815597r:67313210_67530300|GENSCAN_predicted_peptide_3|288_aa MGLQIRVQSQEARAAAAWGAKTSPWAALTQLSLPSPLHAVQMKSGENESGHQGAGGIEGP EGHLAAWGEGGGPLKLLVGIEAPRAGWVGMMYTCVFPYWLLKKPVFTCFHHQLRAYYNPF NFLVILVQGLKHPERQSLEMIVIPGLVPYDWVDEVSFTKIEITGKKVPEGGQKLVCGYVD EDIHLEPVELEVTVGHPACRCQCNWIKLICNEKQLAVVSCTYGAWLREAVPDSVALLRAT HLTAEANTSCGLIPESPQAVIAEKLSQEETDSQPKATRSGPLSLQALG >gi568815597r:67313210_67530300|GENSCAN_predicted_CDS_3|867_bp atgggcctgcagatccgagtgcagagccaggaagccagggcagcagctgcctggggtgca aagacatctccgtgggctgccctgacccagttgtcccttccctcacccctacatgctgtc cagatgaaatccggtgaaaatgagtcaggacaccaaggagcagggggaatagagggccca gagggacatttggcagcctggggagagggtggtgggcctctaaagctgttggtggggata gaggctccccgagcagggtgggttgggatgatgtacacctgtgttttcccttattggctg ctcaagaagccagtgtttacctgttttcatcatcaactcagggcttactataatcccttc aattttttagtcattcttgttcaaggtctcaagcatccggagaggcagtccttggaaatg attgtcattcctggcctggttccctatgactgggtggatgaagtgtcattcactaagata gagatcactggaaagaaagtaccagaaggagggcagaagctggtttgtggatatgtggat gaggacatccacctggaacctgttgagttagaggtgacagtgggacatccagcctgtcgc tgtcagtgcaattggatcaagctgatttgcaatgaaaagcagctggcagttgtgtcgtgt acctatggagcttggctgcgggaagcagttccagattccgtggctctgctcagagcaact catttgaccgcagaagcaaacacaagctgtggcctaatcccagagtccccacaagcagta attgcagagaagttgagccaagaggagactgactcgcagcccaaagctacccgctcaggt cccctgagcctgcaggctttgggctaa >gi568815597r:67313210_67530300|GENSCAN_predicted_peptide_4|227_aa MFWKKPNLSLCRFSQWRNFPSVRCRVMKAREPKMRWVRPIKENDSESPEQVKPGNHVLPS GSLTPWRGGAVHCHRDSNSESGVREAYVPTGAGSVGVEGFGGEHIQTANLLPAPNRVSVF FLARNEDFGAPKLVHFKNPGIQEQDLVFEVPPLEIAFPGAASFKLYKAYDIIWNQSGKVM NPSSRCICGRVKVRVELNGNKGGVTQVTCVASVGLANAEPALTMAAA >gi568815597r:67313210_67530300|GENSCAN_predicted_CDS_4|684_bp atgttttggaaaaaacccaacttaagcctttgcaggttttcacagtggaggaactttcca tcagtacgctgcagggtgatgaaggctagggagccaaaaatgagatgggtcagacccata aaagaaaatgattcagaaagtcctgagcaagtaaaacctggaaaccacgtgcttccatct ggtagcctcacaccgtggaggggaggggcagtccactgccaccgggactccaacagtgag agtggagtgagggaggcctatgtccccactggagccggctctgtgggggtggagggtttt ggaggagaacacattcaaacagcaaacttgctccctgcccccaaccgtgtttcagtattt ttcttggcaaggaatgaagacttcggagctccaaagctagtgcattttaaaaacccaggt atccaagaacaggacctcgtcttcgaggtaccaccactagagatagccttcccaggggct gccagctttaaactctacaaagcctatgacatcatttggaatcaaagtggaaaagtcatg aacccatcatccagatgtatttgcggaagagtcaaagttagagtcgagttaaatgggaat aaaggtggagtcacccaggtcacctgtgttgctagtgtaggtctggccaatgcagaacca gcactgacaatggcagctgcctga >gi568815597r:67313210_67530300|GENSCAN_predicted_peptide_5|95_aa MAREYCKEMVRCMWLWLEAELISGVNSGDQSCWPCKKTLPGGAVPAGDHFTNKCSASLTV GIMVLRTEKQPCASTQGEDNKDEDLYDDTLKLTEQ >gi568815597r:67313210_67530300|GENSCAN_predicted_CDS_5|288_bp atggccagggagtactgcaaggaaatggtgaggtgcatgtggctgtggctagaggcagaa ctgatcagtggggtaaacagtggtgaccagtcttgctggccctgcaaaaagacgctgcct ggtggagctgttcctgcaggtgaccacttcaccaacaagtgtagtgcttctctaacagtg ggcataatggtcttaagaacagaaaagcagccgtgtgcatctactcaaggtgaagacaac aaggatgaagacctttatgatgatacacttaaacttactgagcagtaa