GENSCAN 1.0 Date run: 8-Nov-116 Time: 05:04:16 Sequence gi568815586r:32647207_32855874 : 208668 bp : 40.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7692 7764 73 2 1 52 47 127 0.219 6.28 1.02 Intr + 12872 12924 53 0 2 75 115 32 0.223 2.31 1.03 Intr + 31996 32259 264 0 0 105 94 169 0.165 15.99 1.04 Intr + 54209 54356 148 1 1 91 54 112 0.924 6.99 1.05 Intr + 60161 60207 47 0 2 93 84 30 0.989 0.31 1.06 Intr + 60947 61018 72 1 0 21 102 139 0.990 7.28 1.07 Intr + 63723 63809 87 2 0 68 116 55 0.951 5.55 1.08 Intr + 66003 66165 163 2 1 73 91 113 0.994 8.73 1.09 Intr + 71437 71557 121 2 1 86 71 145 0.999 11.23 1.10 Intr + 73458 73589 132 0 0 67 91 116 0.995 8.64 1.11 Term + 75221 75431 211 2 1 102 42 162 0.955 8.78 1.12 PlyA + 75802 75807 6 1.05 2.03 PlyA - 75964 75959 6 1.05 2.02 Term - 80141 79507 635 0 2 -16 50 611 0.250 40.36 2.01 Init - 80791 80632 160 1 1 68 66 58 0.594 1.63 2.00 Prom - 81241 81202 40 -11.04 3.00 Prom + 82708 82747 40 -8.95 3.01 Init + 83719 83928 210 0 0 48 74 -1 0.476 -6.67 3.02 Intr + 84150 84305 156 2 0 47 97 127 0.923 8.79 3.03 Intr + 84648 84737 90 2 0 57 89 74 0.922 3.67 3.04 Intr + 86509 86601 93 0 0 84 84 63 0.957 4.74 3.05 Intr + 89899 89955 57 0 0 102 93 30 0.873 3.06 3.06 Intr + 92858 93034 177 1 0 38 91 188 0.927 13.29 3.07 Intr + 93203 93312 110 1 2 63 92 127 0.999 8.76 3.08 Intr + 95383 95542 160 1 1 81 67 130 0.947 9.27 3.09 Term + 96148 96204 57 0 0 81 49 66 0.901 -1.29 3.10 PlyA + 96387 96392 6 1.05 4.06 PlyA - 97180 97175 6 1.05 4.05 Term - 99673 99658 16 0 1 106 48 14 0.523 -4.07 4.04 Intr - 102901 102731 171 0 0 81 90 124 0.709 10.04 4.03 Intr - 103668 103513 156 2 0 83 62 256 0.997 20.70 4.02 Intr - 106879 106712 168 0 0 57 113 137 0.997 11.24 4.01 Init - 108668 107890 779 2 2 104 108 612 0.999 59.00 4.00 Prom - 111716 111677 40 -5.45 5.12 PlyA - 111889 111884 6 1.05 5.11 Term - 126139 125951 189 1 0 29 49 155 0.503 2.17 5.10 Intr - 145525 145438 88 0 1 112 116 74 0.996 11.65 5.09 Intr - 149092 148903 190 2 1 66 85 139 0.931 9.12 5.08 Intr - 155350 155197 154 1 1 22 97 192 0.471 12.22 5.07 Intr - 158842 158713 130 0 1 63 53 115 0.182 5.28 5.06 Intr - 162546 162452 95 0 2 93 86 67 0.244 4.84 5.05 Intr - 174323 174150 174 2 0 47 81 232 0.969 17.61 5.04 Intr - 175425 175261 165 0 0 87 61 79 0.497 4.34 5.03 Intr - 176956 176839 118 0 1 79 86 132 0.217 11.65 5.02 Intr - 193999 193822 178 2 1 107 90 161 0.916 16.26 5.01 Intr - 203767 203560 208 1 1 71 72 229 0.696 17.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:32647207_32855874|GENSCAN_predicted_peptide_1|456_aa MVNIETLTTNADKDVEQLELSLMVAHPPAVHPAPQASLRGSQRMACREGAGSRRARSNGC RLPRRGRRRGGRRRTVGPGPIHCRGRRALGPRVFRVMEALIPVINKLQDVFNTVGADIIQ LPQIVVVGTQSSGKSSVLESLVGRDLLPRGTGIVTRRPLILQLVHVSQEDKRKTTGEENG VEAEEWGKFLHTKNKLYTDFDEIRQEIENETERISGNNKGVSPEPIHLKIFSPNVVNLTL VDLPGMTKVPVGDQPKDIELQIRELILRFISNPNSIILAVTAANTDMATSEALKISREVD PDGRRTLAVITKLDLMDAGTDAMDVLMGRVIPVKLGIIGVVNRSQLDINNKKSVTDSIRD EYAFLQKKYPSLANRNGTKYLARTLNRLLMHHIRDCLPELKTRINVLAAQYQSLLNSYGE PVDDKSATLLQLITKFATEYCNTIEGTAKYIETSEL >gi568815586r:32647207_32855874|GENSCAN_predicted_CDS_1|1371_bp atggtcaacatcgagacactgaccacaaatgctgacaaggatgtggagcaactggaactc tcgctcatggtagcacaccctcctgctgtccacccagctccacaggcttcactgagaggc tctcagcgcatggcctgccgggagggggcaggtagccggcgggcccggtccaatgggtgc cggcttccgaggagagggcggaggagaggaggaaggaggcgaactgtgggccccggcccc attcattgccgtggccggcgggcactggggccccgtgttttcagagtcatggaggcgcta attcctgtcataaacaagctccaggacgtcttcaacacggtgggcgccgacatcatccag ctgcctcaaatcgtcgtagtgggaacgcagagcagcggaaagagctcagtgctagaaagc ctggtggggagggacctgcttcccagaggtactggaattgtcacccggagacctctcatt ctgcaactggtccatgtttcacaagaagataaacggaaaacaacaggagaagaaaatggg gtggaagcagaagaatggggtaaatttcttcacaccaaaaataagctttacacggatttt gatgaaattcgacaagaaattgaaaatgaaacagaaagaatttcaggaaataataaggga gtaagccctgaaccaattcatcttaagattttttcacccaacgttgtcaatttgacactt gtggatttgccaggaatgaccaaggtgcctgtaggtgatcaacctaaggatattgagctt caaatcagagagctcattcttcggttcatcagtaatcctaattccattatcctcgctgtc actgctgctaatacagatatggcaacatcagaggcacttaaaatttcaagagaggtagat ccagatggtcgcagaaccctagctgtaatcactaaacttgatctcatggatgcgggtact gatgccatggatgtattgatgggaagggttattccagtcaaacttggaataattggagta gttaacaggagccagctagatattaacaacaagaagagtgtaactgattcaatccgtgat gagtatgcttttcttcaaaagaaatatccatctctggccaatagaaatggaacaaagtat cttgctaggactctaaacaggttactgatgcatcacatcagagattgtttaccagagttg aaaacaagaataaatgttctagctgctcagtatcagtctcttctaaatagctacggtgaa cccgtggatgataaaagtgctactttactccaacttattaccaaatttgccacagaatat tgtaacactattgaaggaactgcaaaatatattgaaacttcggagctgtaa >gi568815586r:32647207_32855874|GENSCAN_predicted_peptide_2|264_aa MDVLAGSKTVQYANNMQYVMKDSKDVISFNLINLLLQIYPQCLLGHKAPKTDAARVVKRG VNALKNLQVKGAQIEAKFYEEVHDLERKYAVLYQPLFDKQSEITNTIYEPTEEECEWKPD EEDEISEELKEKAKTEDEKNDEEKEDPKGIPEFWLPVFKNVDLLSDMLQKHDEPILKHVK DTKVKFSDAGLPMSFVLEFQFEPSKYFTNEVLTKTYRMRLEPEPDDSDHFSFDGQEIMGC TGGQIDWKKGKNVILKTIKQQQKH >gi568815586r:32647207_32855874|GENSCAN_predicted_CDS_2|795_bp atggatgttcttgctggtagtaaaacagtacaatatgctaacaatatgcaatatgtaatg aaagattctaaggatgttatatcctttaatctaataaacttacttctacaaatttatcct cagtgtttacttggacacaaagccccaaagacagatgcagctagggtagttaaaagagga gtgaatgctctcaaaaacctgcaagttaaaggtgcacagatagaagccaaattctatgag gaagttcacgatcttgaaaggaagtatgctgttctctatcagcctctatttgataagcaa tctgagattactaatacaatttatgaacctacagaggaagaatgtgaatggaaaccagat gaagaagatgagatttcagaggagttgaaagaaaaggccaagactgaagatgagaaaaac gatgaagaaaaagaagaccccaaaggaattcctgaattttggttacctgtttttaagaat gttgacttgctcagtgatatgcttcagaaacatgatgaacctattctgaagcatgtgaaa gataccaaagtgaagttctcagatgctggcctgcctatgagttttgtcttagaatttcaa tttgaacccagtaaatattttacaaatgaagtgctgacaaagacatataggatgaggtta gaaccagaaccagatgattctgatcacttttcctttgatggacaagaaattatgggttgt acagggggccagatagattggaaaaaaggaaagaatgtcattttgaaaaccattaagcag cagcagaaacactag >gi568815586r:32647207_32855874|GENSCAN_predicted_peptide_3|369_aa MSLKLLERLLTCIFFPFLQCQKPYTSLPFRCGGARICYIFHETFGRTLESVDPLGGLNTI DILTAIRNATGPRPALFVPEVSFELLVKRQIKRLEEPSLRCVELVHEEMQRIIQHCSNYS TQELLRFPKLHDAIVEVVTCLLRKRLPVTNEMVHNLVAIELAYINTKHPDFADACGLMNN NIEEQRRNRLARELPSAVSRDKVASGGGGVGDGVQEPTTGNWRGMLKTSKAEELLAEEKS KPIPIMPASPQKGHAVNLLDVPVPVARKLSAREQRDCEVIERLIKSYFLIVRKNIQDSVP KAVMHFLVNHVKDTLQSELVGQLYKSSLLDDLLTESEDMAQRRKEAADMLKALQGASQII AEIRETHLW >gi568815586r:32647207_32855874|GENSCAN_predicted_CDS_3|1110_bp atgagcttaaagcttctagaaagacttctaacttgtatcttctttccctttttgcaatgc cagaaaccatatacttcattgcctttcagatgcggtggtgctagaatttgttatattttc catgagacttttgggcgaaccttagaatctgttgatccacttggtggccttaacactatt gacattttgactgccattagaaatgctactggtcctcgtcctgctttatttgtgcctgag gtttcatttgagttactggtgaagcggcaaatcaaacgtctagaagagcccagcctccgc tgtgtggaactggttcatgaggaaatgcaaaggatcattcagcactgtagcaattacagt acacaggaattgttacgatttcctaaacttcatgatgccatagttgaagtggtgacttgt cttcttcgtaaaaggttgcctgttacaaatgaaatggtccataacttagtggcaattgaa ctggcttatatcaacacaaaacatccagactttgctgatgcttgtgggctaatgaacaat aatatagaggaacaaaggagaaacaggctagccagagaattaccttcagctgtatcacga gacaaggttgcatctggaggtggtggggttggagatggtgttcaagaaccaaccacaggc aactggagaggaatgctgaaaacttcaaaagctgaagagttattagcagaagaaaaatca aaacccattccaattatgccagccagtccacaaaaaggtcatgccgtgaacctgctagat gtgccagttcctgttgcacgaaaactatctgctcgggaacagcgagattgtgaggttatt gaacgactcattaaatcatattttctcattgtcagaaagaatattcaagacagtgtgcca aaggcagtaatgcattttttggttaatcatgtgaaagacactcttcagagtgagctagta ggccagctgtataaatcatccttattggatgatcttctgacagaatctgaggacatggca cagcgcaggaaagaagcagctgatatgctaaaggcattacaaggagccagtcaaattatt gctgaaatccgggagactcatctttggtga >gi568815586r:32647207_32855874|GENSCAN_predicted_peptide_4|429_aa MAAPILRSFSWGRWSGTLNLSVLLPLGLRKAHSGAQGLLAAQKARGLFKDFFPETGTKIE LPELFDRGTASFPQTIYCGFDPTADSLHVGHLLALLGLFHLQRAGHNVIALVGGATARLG DPSGRTKEREALETERVRANARALRLGLEALAANHQQLFTDGRSWGSFTVLDNSAWYQKQ HLVDFLAAVGGHFRMGTLLSRQSVQLRLKSPEGMSLAEFFYQVLQAYDFYYLFQRYGCRV QLGGSDQLGNIMSGYEFINKLTGEDVFGITVPLITSTTGAKLGKSAGNAVWLNRDKTSPF ELYQFFVRQPDDSVERYLKLFTFLPLPEIDHIMQLHVKEPERRGPQKRLAAEVTKLVHGR EGLDSAKRCTQALYHSSIDALEVMSDQELKELFKEAPFSEFFLDPGTSVLDTCRKANAIP DGPRGDEEQ >gi568815586r:32647207_32855874|GENSCAN_predicted_CDS_4|1290_bp atggcggcgcccatcttgcggtccttttcctggggccggtggtctggtaccctaaatctc tcagtattgttgcccttggggctgcgtaaggcccactcgggcgctcaggggttactggca gcgcagaaggctcgaggtctgttcaaggacttcttcccggagacggggacgaaaatagag ctcccagagctcttcgaccgtggcacggcgagttttccccaaaccatttactgtggcttc gaccccacggcagactcgcttcatgtgggtcatctacttgcgctgctgggcctgtttcat ttgcagcgagcgggccacaacgtgatcgcgctggtgggaggcgccacggcgcgcctggga gacccgagcggccgtaccaaggaacgcgaggcgctggagacagagcgcgtgcgagccaac gcgcgagctctgcgcctagggcttgaggccctggcggctaatcaccagcagcttttcact gatgggcgctcctggggcagcttcactgtgctggacaactcggcctggtaccagaagcag cacctggtggacttcctggcggcagtggggggtcacttccgcatggggacgctgctgagc cggcagagcgtgcagctgcggctcaagagccccgagggcatgagcttggccgagttcttt taccaggtgctccaggcctatgacttctattacctcttccagcgttatggatgcagggtc cagctgggcggatctgatcaactaggcaacatcatgtccggatatgagttcatcaacaag ttgactggagaagatgtatttggaatcaccgttcctctaattacaagtacaactggagca aagctgggaaagtctgctggcaacgctgtttggctaaacagagataagacatctccattt gaattgtatcaattctttgtcaggcaaccggacgattcagtggaaaggtacctgaagctg ttcactttcctgccccttccagagattgatcatatcatgcagctgcatgtcaaagagcca gaaaggcggggtcctcagaaacgactggcagcagaagtaacaaagcttgttcatggacga gaaggattggattctgctaaaaggtgtacacaagccctttatcacagtagcatagatgca ctggaggtcatgtctgatcaggagttaaaagagttgtttaaagaagctccattttctgaa ttttttctcgatcctggaacaagtgtcctagatacttgccgcaaagcaaatgccattcca gatggtccccgaggtgatgaggaacaatga >gi568815586r:32647207_32855874|GENSCAN_predicted_peptide_5|562_aa VNQLRGILKLLQLLKVQNEDVQRAVCGALRNLVFEDNDNKLEVAELNGVPRLLQVLKQTR DLETKKQITGLLWNLSSNDKLKNLMITEALLTLTENIIIPFSGWPEGDYPKANGLLDFDI FYNVTGCLRNMSSAGADGRKAMRRCDGLIDSLVHYVRGTIADYQPDDKATENCVCILHNL SYQLEAELPEKYSQNIYIQNRNIQTDNNKSIGCFGSRSRKVKEQYQDVPMPEEKSNPKGV EWLWHSIVIRMYLSLIAKSVRNYTQEASLGALQNLTAGSGPIFATCGSGDPLMSPRYQGL GSNKQSCAESQHRMQGWFNIRKSINVIHHINRTTDKNHMISIDAEKAFDKIRHRFMMPTS VAQTVVQKESGLQHTRKMLHVGDPSVKKTAISLLRNLSRNLSLQNEIAKETLPDLVSIIP DTVPSTDLLIETTASACYTLNNIIQNSYQNARDLLNTGGIQKIMAISAGDAYASNKASKA ASVLLYSLWAHTELHHAYKKPISYPHYTAPPIQRDSHPNPLRPVKLQSRTRASRVGMDQL SEKGDMWLAGLSPAAQRSKCKC >gi568815586r:32647207_32855874|GENSCAN_predicted_CDS_5|1689_bp gttaaccagcttcgtggcatcctcaagcttctgcagctcctaaaagttcagaatgaagac gttcagcgagctgtgtgtggggccttgagaaacttagtatttgaagacaatgacaacaaa ttggaggtggctgaactaaatggggtacctcggctgctccaggtgctgaagcaaaccaga gacttggagactaaaaaacaaataacaggtttgctgtggaatttgtcatctaatgacaaa ctcaagaatctcatgataacagaagcattgcttacgctgacggagaatatcatcatcccc ttttctgggtggcctgaaggagactacccaaaagcaaatggtttgctcgattttgacata ttctacaacgtcactggatgcctaagaaacatgagttctgctggcgctgatgggagaaaa gcgatgagaagatgtgacggactcattgactcactggtccattatgtcagaggaaccatt gcagattaccagccagatgacaaggccacggagaattgtgtgtgcattcttcataacctc tcctaccagctggaggcagagctcccagagaaatattcccagaatatctatattcaaaac cggaatatccagactgacaacaacaaaagtattggatgttttggcagtcgaagcaggaaa gtaaaagagcaataccaggacgtgccgatgccggaggaaaagagcaaccccaagggcgtg gagtggctgtggcattccattgttataaggatgtatctgtccttgatcgccaaaagtgtc cgcaactacacacaagaagcatccttaggagctctgcagaacctcacggccggaagtgga ccaatctttgcaacttgtggatcaggagatcccctcatgagcccacgctaccagggcctt gggtccaataaacagagctgtgcggagtctcagcacaggatgcaaggttggttcaacatt cgcaaatcaataaacgtgattcatcacataaacagaactacagacaaaaaccacatgatc tcaatagatgcagaaaaggcctttgataaaattcgacatcgcttcatgatgccgacatca gtggctcagacagttgtccagaaggaaagtggcctgcagcacacccgaaagatgctgcat gttggtgacccaagtgtgaaaaagacagccatctcgctgctgaggaatctgtcccggaat ctttctctgcagaatgaaattgccaaagaaactctccctgatttggtttccatcattcct gacacagtcccgagtactgaccttctcattgaaactacagcctctgcctgttacacattg aacaacataatccaaaacagttaccagaatgcacgcgaccttctaaacaccgggggcatc cagaaaattatggccattagtgcaggcgatgcctatgcctccaacaaagcaagtaaagct gcttccgtccttctgtattctctgtgggcacacacggaactgcatcatgcctacaagaag cccatttcttacccgcactatacagctcccccgattcagagggactctcatccgaaccct ctcaggccagtcaagctgcagagcaggacaagggcttcaagggttggcatggaccagctc tcagagaaaggagacatgtggttagcaggactttcaccagcagcgcagagatcaaaatgc aagtgttga