GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:43:53 Sequence gi568815595r:167584311_167820157 : 235847 bp : 36.42% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 816 855 40 -2.15 1.01 Init + 10970 11854 885 1 0 86 42 213 0.145 10.96 1.02 Term + 12261 12272 12 2 0 93 44 6 0.079 -6.07 1.03 PlyA + 12561 12566 6 1.05 2.08 PlyA - 12675 12670 6 -0.45 2.07 Term - 12989 12859 131 0 2 125 54 65 0.029 4.16 2.06 Intr - 20158 19991 168 2 0 43 74 82 0.011 1.30 2.05 Intr - 25076 25026 51 0 0 115 47 39 0.047 0.46 2.04 Intr - 36293 36119 175 2 1 39 67 84 0.247 -0.01 2.03 Intr - 37333 37157 177 1 0 79 115 118 0.986 12.79 2.02 Intr - 42982 42542 441 1 0 68 87 293 0.180 19.93 2.01 Init - 69115 68951 165 1 0 65 107 118 0.702 11.08 2.00 Prom - 72833 72794 40 -5.95 3.09 PlyA - 72959 72954 6 1.05 3.08 Term - 74016 73963 54 0 0 125 44 62 0.036 2.08 3.07 Intr - 103383 103305 79 2 1 111 83 85 0.435 8.93 3.06 Intr - 106526 106496 31 0 1 69 33 50 0.397 -6.13 3.05 Intr - 110331 110213 119 2 2 111 64 151 0.965 14.19 3.04 Intr - 111412 111286 127 2 1 48 82 121 0.995 6.32 3.03 Intr - 112816 112699 118 1 1 73 92 112 0.875 9.22 3.02 Intr - 120585 120532 54 0 0 63 100 60 0.049 2.96 3.01 Init - 135841 135752 90 1 0 51 105 68 0.107 5.34 3.00 Prom - 140210 140171 40 -4.55 4.05 PlyA - 143263 143258 6 1.05 4.04 Term - 150775 150636 140 2 2 62 46 138 0.646 4.14 4.03 Intr - 151351 150847 505 1 1 99 94 305 0.613 23.82 4.02 Intr - 181160 181032 129 2 0 44 76 84 0.028 2.77 4.01 Init - 181807 181169 639 1 0 31 -11 296 0.047 9.38 4.00 Prom - 181895 181856 40 -10.55 5.00 Prom + 182060 182099 40 -8.65 5.01 Init + 182563 182731 169 0 1 35 56 195 0.720 10.74 5.02 Intr + 183556 183695 140 2 2 3 54 89 0.239 -3.74 5.03 Intr + 183838 183905 68 0 2 55 116 96 0.295 5.88 5.04 Term + 202861 202951 91 1 1 80 54 92 0.312 1.21 5.05 PlyA + 203555 203560 6 1.05 6.00 Prom + 204101 204140 40 -2.15 6.01 Init + 204819 205068 250 2 1 60 95 279 0.886 21.47 6.02 Intr + 206062 206292 231 2 0 121 45 195 0.993 15.32 6.03 Intr + 208280 208474 195 0 0 80 95 108 0.992 9.16 6.04 Intr + 210310 210514 205 2 1 95 80 208 0.983 18.04 6.05 Intr + 221863 221946 84 1 0 77 10 143 0.306 3.42 6.06 Intr + 222934 223031 98 1 2 119 99 102 0.679 13.13 6.07 Intr + 229386 229501 116 1 2 75 92 25 0.021 0.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 126802 127033 232 1 1 43 38 237 0.877 9.16 S.002 Sngl - 181807 181133 675 1 0 31 42 291 0.915 14.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:167584311_167820157|GENSCAN_predicted_peptide_1|298_aa MAILPKIIYRFNAIPIKLPMTFLTELEKTTLKFIWNQKRARIAKSVLSQKNKSGGITLPD FKLYYKATVTKTTWYWYQNRDIDQWNRTKPSEITLHIYNYLIFDKPEKNKQWGKDSLFNK WCWGNWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDF MSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRIY NELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSPSLAIREMQIKTTGHG >gi568815595r:167584311_167820157|GENSCAN_predicted_CDS_1|897_bp atggccatactgcccaagataatttacagattcaatgccatccccatcaagctaccaatg actttcctcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cgcattgccaagtcagtcctaagccaaaagaacaaatctggaggcatcacactacctgac ttcaaactatactacaaggctacagtaaccaaaacaacatggtactggtaccaaaacaga gatatagatcaatggaacagaacaaagccctcagaaataacattgcatatctacaactat ctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctggggaaattggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaatcaattcaagatggattaaagacttaaacgtcagacctaaaaccata aaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaagattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca aaatgggagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaag gacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaaaaa tgctcaccatcactggccatcagagaaatgcaaatcaaaaccacaggacatggatga >gi568815595r:167584311_167820157|GENSCAN_predicted_peptide_2|435_aa MSCQKAVLELNIGSQLGPKSPERTEGVTAFEDYGTGLLENQLSVGDFVKIQKAFESPQPR KIICMSREDFTQKMTEIVGWGTKEEYGELFDKVDVAQDGFINWDKLTSFILLELYEQDER AKATVVPQWKDLEFLPVKHKDTIQKVIFLKNSSHYLTISKEGLLAIWGEHLKLQETFPIT SDATKLKHLWVTSLVSLENVNKIAVAFTSKEVCFYDLLSKEEFACQYKLQGLKGTPICMD YWYDPLDANESILSFGDITGKVQAIAFTAALISLFERPASACEDGEATMTINWAELLSGC HKCCHILEHKLHQGDWVRQVPREALSTTQLLVGVEKVTYNASLDAIISSTTSNTNSVVMA WREKSKKRLNMTSFNIAQGIHAFDYHSRLNLIGSTPCGSCQCLELAPFEVTALAVPWPLL AKAGAGAAGMQGTIP >gi568815595r:167584311_167820157|GENSCAN_predicted_CDS_2|1308_bp atgagttgccagaaagctgtacttgagttaaacatagggtcacagcttgggccaaagagt cctgagagaacagaaggtgtaactgcatttgaagactatggcacaggcctgcttgaaaac caactcagcgtgggtgactttgtaaaaatacagaaggcctttgagtccccacagccaagg aaaatcatttgtatgtccagagaagacttcacgcagaagatgacagagattgttggttgg ggcacgaaggaagaatatggggagctctttgacaaagtggatgtggcccaagatggcttc attaattgggacaagctgacttcatttatactgctagagctttatgagcaagatgaacga gcaaaggcaactgtggtgccccagtggaaggaccttgaattcctcccagtaaaacacaag gacaccattcaaaaagtaattttcttaaaaaattcaagtcattatctgacaattagtaaa gaaggtttattggcaatctggggagagcatttaaagctgcaagaaacattccccatcact tcagatgccaccaagctcaaacacctgtgggtgacaagtctggtttctctggaaaatgta aacaagatagcagtggcttttacaagtaaagaggtttgtttctatgatctgctgtccaaa gaagaatttgcttgccaatacaaactccaaggcctgaaaggaacaccaatttgcatggat tattggtatgatcctcttgatgccaatgaatcaattctttcttttggggatataactgga aaggttcaagcaattgctttcaccgcagccttgatttccctgtttgaacggcctgctagt gcatgtgaagatggagaagccactatgaccattaactgggcagagctgctctctggatgt cacaaatgttgccatatattagagcataaacttcatcaaggagattgggtcaggcaagtt cccagagaagctctcagcaccacgcagttgctggtaggggtggagaaggttacttacaat gcatctttagacgctatcatttccagtacaaccagcaatacaaatagtgtggtgatggct tggagagagaaatcaaaaaagcgtcttaatatgacatccttcaacattgcccagggcatt catgcttttgattatcactctcggctcaatttaattggctcaacaccatgtggaagctgc caatgcttggagcttgcaccctttgaagtaacagccctagctgtaccttggcccctttta gctaaggctggagctggagcagctgggatgcagggtaccattccctga >gi568815595r:167584311_167820157|GENSCAN_predicted_peptide_3|223_aa MTMEEMKNEAETTSMVSMPLYAVMYPVFNELERVNLSAAQTLRAAFIKAEKENPGLTQDI IMKILEKKSVEVNFTESLLRMAADDVEEYMIERPEPEFQDLNEKARALKQILSKIPDEIN DRVRFLQTIKMPKRTVSSADGTVREDPKRRLMWLSAKPALSVVEIEAPKVCASENEGNDL DIASAIKELLDTVNNVFKKYQYQNRRNSASPSDSMPPIPRTLA >gi568815595r:167584311_167820157|GENSCAN_predicted_CDS_3|672_bp atgacaatggaagagatgaagaatgaagctgagaccacatccatggtttctatgcccctc tatgcagtcatgtatcctgtgtttaatgagctagaacgagtaaatctgtctgcagcccag acactgagagccgctttcatcaaggctgaaaaagaaaatccaggtctcacacaagacatc attatgaaaattttagagaaaaaaagcgtggaagttaacttcacggagtcccttcttcgt atggcagctgatgatgtagaagagtatatgattgaacgaccagagccagaattccaagac ctaaacgaaaaggcacgagcacttaaacaaattctcagtaagatcccagatgagatcaat gacagagtgaggtttctgcagacaatcaagatgcccaagaggacggtcagctcagctgat gggacagtgagggaagatcccaaaaggagattgatgtggttgtcagctaaacctgctctt tcagtagtggaaattgaagccccaaaagtttgtgcctctgaaaatgagggtaatgacttg gatatagctagtgcaataaaagaacttcttgatacagtgaataatgtcttcaagaaatat caataccagaaccgcaggaattctgcttctccttctgattcaatgccgcctattcccagg acccttgcttga >gi568815595r:167584311_167820157|GENSCAN_predicted_peptide_4|470_aa MWKRLWNWVTGRGWKSLEGSEEDREVWEILELPRDLLNCFDKNADSDMNNKVQAGVVSDG DELIGNCSKGDSCYVLAKRLAAFCPCPRDLWNFELDRDDLGYLAEEISKQQSIQKLTWVL LKALSFIREVEHKSSENLQPDNAIEKKILFFEEKFKPAEEICISNKELNVIPKTIGKMSP GHVRDFCGSSSHYRPRGLGGKSGFMGWAQEPRAPGDLVHCVLATPAVAERGQCRAWAVAS EGASFKPWQLPYGVEPLANLAAVAPPPAVSAAARTGGSGKQLPWYLSAPCTSAAALPLPL PWGSLSPPLWLTPGAASCPTQASLLGAPAQPILPRPVCPDARCAWGLAALREAAPLPSAS PHTKPSLHLGDFPEASSPPTRHTQPDSPRIPTTGPSPPSFLCSVSPPPQWVRRLRGLLRT LIGAGREFSYYWLEVSPPIASGSPAWPAGKCSLPIAGIADNAGERDYESP >gi568815595r:167584311_167820157|GENSCAN_predicted_CDS_4|1413_bp atgtggaagcgactttggaactgggtaacaggcagaggttggaaaagtttggagggctca gaagaagacagggaagtgtgggaaattttggagcttcctagagacttgttgaattgcttt gacaaaaatgctgatagtgatatgaacaataaggtccaggctggggtggtctcagatgga gatgagcttattgggaactgcagcaaaggtgactcttgttatgttttagcaaagagactg gcagcattttgcccctgccctagagatctgtggaactttgaacttgacagagatgattta gggtatctggcagaggaaatttctaagcagcaaagcattcaaaaattgacttgggtgctg ttaaaggcattgagttttataagagaagtagagcataaaagttcagaaaatttgcagcct gacaatgctatagaaaagaaaattctattttttgaggagaaattcaagccagctgaagaa atctgcataagtaacaaggagctgaatgttatccccaagacaatagggaaaatgtctcca ggacatgtcagagacttctgtggcagctcctcccattacaggcccagaggcctaggagga aaaagtggtttcatgggctgggcccaggaaccccgtgctcctggggacttggtgcactgc gtcctagccactccagctgtggctgaaaggggacaatgtagagcttgggctgtggcttca gagggtgcaagcttcaagccttggcagctaccatatggtgttgagcctttggcgaatttg gcagctgttgctcctccccctgctgtttctgcagctgcccggaccggaggcagcggaaag cagctgccctggtacctttccgctccctgcacttctgcagctgctctgccacttccactc ccgtgggggtccctctcccctcctctgtggctcacacctggagcggcctcctgccccacc caggcgtctctgctgggagccccggctcagcctatccttccccgacctgtgtgtccggat gcccggtgtgcctggggcttagctgctctcagggaagctgcccctctgcccagtgcaagt ccccacaccaagccctcccttcacctgggcgatttccccgaggcttcctctccacccaca cggcatactcagcccgacagccccaggatccccaccacaggcccctcacccccttccttc ctctgctctgtcagcccgccccctcaatgggtgaggcgcctgcggggcctactccgcacc ctgattggcgcaggcagggaattctcctactattggctagaagtgtcaccacccatcgcc tcaggatcccccgcatggcctgccgggaagtgtagtcttccgattgcaggaattgcggac aacgccggcgagcgggactacgagtccccataa >gi568815595r:167584311_167820157|GENSCAN_predicted_peptide_5|155_aa MRFKKGSHLRNIKAYGEAASADGEAAASYPEDLAKMTDEGGYTKQHIFTVDETPLYWMRH CFLWMSKGSVFLEVESTGEDAVNIVEITRKNLDYYKNSVDKAMILLQPPKPSATASLISE QPSTSRWSSGAITPMELSSPVITMPSSGIPPEGSA >gi568815595r:167584311_167820157|GENSCAN_predicted_CDS_5|468_bp atgaggtttaagaaaggaagccatctccgtaacataaaagcgtatggagaagcagcaagt gctgatggagaagctgcagcaagttatccagaagatctagctaagatgactgatgaaggt ggctacactaaacaacatattttcactgtcgatgaaacacccttatactggatgaggcat tgcttcttatggatgagcaaaggaagtgtttttttggaggtggaatctactggtgaagat gctgtgaacattgttgaaattacaagaaagaatttagactattacaaaaactcagttgat aaagcaatgatattgctacaaccacccaaaccttcagcaaccgccagccttatcagtgag cagccatcaacatcaagatggtcttcaggggcaataacacccatggagctgtcatctcct gtgataacaatgccttcttctggaatacctcctgaaggatctgcctga >gi568815595r:167584311_167820157|GENSCAN_predicted_peptide_6|393_aa MAFLGLFSLLVLQSMATGATFPEEAIADLSVNMYNRLRATGEDENILFSPLSIALAMGMM ELGAQGSTQKEIRHSMGYDSLKNGEEFSFLKEFSNMVTAKESQYVMKIANSLFVQNGFHV NEEFLQMMKKYFNAAVNHVDFSQNVAVANYINKWVENNTNNLVKDLVSPRDFDAATYLAL INAVYFKGNWKSQFRPENTRTFSFTKDDESEVQIPMMYQQGEFYYGEFSDGSNEAGGIYQ VLEIPYEGDEISMMLVLSRQEVPLATLEPLVKAQLVEEWANSVKKQKVEVYLPRDMDEAG NDHPQQTNTGAENQAPDVLTHKFTVEQEIDLKDVLKALGITEIFIKDANLTGLSDYLFTY LSLSSELFKSIDIFLSPEHRREFVVGDRVDTEW >gi568815595r:167584311_167820157|GENSCAN_predicted_CDS_6|1179_bp atggctttccttggactcttctctttgctggttctgcaaagtatggctacaggggccact ttccctgaggaagccattgctgacttgtcagtgaatatgtataatcgtcttagagccact ggtgaagatgaaaatattctcttctctccattgagtattgctcttgcaatgggaatgatg gaacttggggcccaaggatctacccagaaagaaatccgccactcaatgggatatgacagc ctaaaaaatggtgaagaattttctttcttgaaggagttttcaaacatggtaactgctaaa gagagccaatatgtgatgaaaattgccaattccttgtttgtgcaaaatggatttcatgtc aatgaggagtttttgcaaatgatgaaaaaatattttaatgcagcagtaaatcatgtggac ttcagtcaaaatgtagccgtggccaactacatcaataagtgggtggagaataacacaaac aatctggtgaaagatttggtatccccaagggattttgatgctgccacttatctggccctc attaatgctgtctatttcaaggggaactggaagtcgcagtttaggcctgaaaatactaga accttttctttcactaaagatgatgaaagtgaagtccaaattccaatgatgtatcagcaa ggagaattttattatggggaatttagtgatggctccaatgaagctggtggtatctaccaa gtcctagaaataccatatgaaggagatgaaataagcatgatgctggtgctgtccagacag gaagttcctcttgctactctggagccattagtcaaagcacagctggttgaagaatgggca aactctgtgaagaagcaaaaagtagaagtatacctgcccagggacatggatgaagctgga aacgatcatcctcagcaaactaacacaggagcagaaaaccaagcaccggatgttctcact cataagttcacagtggaacaggaaattgatttaaaagatgttttgaaggctcttggaata actgaaattttcatcaaagatgcaaatttgacaggcctctctgactatttgtttacatac ttgtctctgagctcagagctattcaaaagcatagacatctttctgtccccagaacacaga agagaatttgtggtgggggacagggtagataccgagtgg