GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:11:35 Sequence gi568815575r:107614522_107875419 : 260898 bp : 44.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 14108 14229 122 1 2 94 95 174 0.999 18.36 1.02 Intr + 24774 24957 184 0 1 85 97 169 0.999 17.29 1.03 Intr + 26381 26513 133 1 1 83 81 29 0.807 1.92 1.04 Intr + 30673 30829 157 2 1 40 108 288 0.576 25.07 1.05 Intr + 33085 33244 160 1 1 70 94 106 0.933 9.39 1.06 Term + 35419 35511 93 0 0 145 38 27 0.952 1.23 1.07 PlyA + 35538 35543 6 1.05 2.00 Prom + 40150 40189 40 -5.76 2.01 Init + 44476 44554 79 0 1 73 110 28 0.964 4.72 2.02 Term + 45349 45497 149 2 2 124 38 68 0.932 3.46 2.03 PlyA + 45693 45698 6 1.05 3.18 PlyA - 47656 47651 6 1.05 3.17 Term - 59321 59179 143 0 2 83 55 79 0.395 2.19 3.16 Intr - 62487 62313 175 0 1 25 105 77 0.096 2.61 3.15 Intr - 79665 79502 164 1 2 50 89 31 0.018 -0.91 3.14 Intr - 96891 96743 149 2 2 64 48 84 0.237 1.88 3.13 Intr - 100228 100002 227 1 2 67 78 300 0.740 23.68 3.12 Intr - 101429 101378 52 1 1 86 101 83 0.975 8.31 3.11 Intr - 102408 102169 240 2 0 60 91 166 0.021 10.76 3.10 Intr - 109427 109250 178 0 1 97 25 77 0.007 1.28 3.09 Intr - 129430 129374 57 2 0 98 81 51 0.602 4.26 3.08 Intr - 140993 140935 59 1 2 98 18 77 0.079 0.23 3.07 Intr - 145172 144975 198 1 0 -30 80 203 0.036 6.17 3.06 Intr - 146107 146044 64 2 1 107 62 61 0.025 3.18 3.05 Intr - 146789 146692 98 1 2 96 47 16 0.014 -2.05 3.04 Intr - 161004 160579 426 2 0 109 94 691 0.817 64.71 3.03 Intr - 161258 161048 211 0 1 143 34 34 0.838 1.47 3.02 Intr - 161947 161823 125 0 2 82 78 90 0.663 7.63 3.01 Init - 162779 162751 29 2 2 48 101 15 0.318 -4.03 3.00 Prom - 167656 167617 40 -3.86 4.00 Prom + 167854 167893 40 -4.16 4.01 Init + 171299 171337 39 1 0 75 97 24 0.663 0.96 4.02 Intr + 179628 180092 465 2 0 80 110 219 0.668 16.62 4.03 Intr + 199234 199360 127 0 1 84 70 33 0.011 1.35 4.04 Intr + 202280 202306 27 0 0 63 84 51 0.032 0.29 4.05 Intr + 208596 208624 29 1 2 124 85 4 0.136 1.53 4.06 Intr + 211742 211994 253 1 1 -2 55 159 0.015 0.61 4.07 Intr + 216753 216906 154 1 1 88 20 119 0.012 4.23 4.08 Intr + 226288 226864 577 1 1 86 116 533 0.142 48.62 4.09 Intr + 240088 240183 96 0 0 32 87 138 0.214 8.31 4.10 Term + 241583 241606 24 1 0 120 42 21 0.828 -1.08 4.11 PlyA + 241879 241884 6 1.05 5.06 PlyA - 241982 241977 6 1.05 5.05 Term - 248151 248057 95 1 2 77 36 60 0.105 -2.31 5.04 Intr - 249850 249782 69 2 0 74 102 38 0.145 2.95 5.03 Intr - 250708 250654 55 1 1 92 5 56 0.137 -3.75 5.02 Intr - 251895 251763 133 2 1 78 101 52 0.304 6.15 5.01 Intr - 259690 259537 154 2 1 73 91 38 0.327 1.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 145165 144975 191 1 2 55 80 189 0.813 13.29 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:107614522_107875419|GENSCAN_predicted_peptide_1|282_aa MPNIKIFSGSSHQDLSQKIADRLGLELGKVVTKKFSNQETCVEIGESVRGEDVYIVQSGC GEINDNLMELLIMINACKIASASRVTAVIPCFPYARQDKKDKSRAPISAKLVANMLSVAG ADHIITMDLHASQIQVSVEAKYWCLKDRLNVDFALIHKERKKANEVDRMVLVGDVKDRVA ILVDDMADTCGTICHAADKLLSAGATRVYAILTHGIFSGPAISRINNACFEAVVVTNTIP QEDKMKHCSKIQVIDISMILAEAIRRTHNGESVSYLFSHVPL >gi568815575r:107614522_107875419|GENSCAN_predicted_CDS_1|849_bp atgccgaatatcaaaatcttcagcggcagctcccaccaggacttatctcagaaaattgct gaccgcctgggcctggagctaggcaaggtggtgactaagaaattcagcaaccaggagacc tgtgtggaaattggtgaaagtgtacgtggagaggatgtctacattgttcagagtggttgt ggcgaaatcaatgacaatttaatggagcttttgatcatgattaatgcctgcaagattgct tcagccagccgggttactgcagtcatcccatgcttcccttatgcccggcaggataagaaa gataagagccgggcgccaatctcagccaagcttgttgcaaatatgctatctgtagcaggt gcagatcatattatcaccatggacctacatgcttctcaaattcaggtatcagtggaagct aaatattggtgtttgaaagacaggctgaatgtggactttgccttgattcacaaagaacgg aagaaggccaatgaagtggaccgcatggtgcttgtgggagatgtgaaggatcgggtggcc atccttgtggatgacatggctgacacttgtggcacaatctgccatgcagctgacaaactt ctctcagctggcgccaccagagtttatgccatcttgactcatggaatcttctccggtcct gctatttctcgcatcaacaacgcatgctttgaggcagtagtagtcaccaataccatacct caggaggacaagatgaagcattgctccaaaatacaggtgattgacatctctatgatcctt gcagaagccatcaggagaactcacaatggagaatccgtttcttacctattcagccatgtc cctttataa >gi568815575r:107614522_107875419|GENSCAN_predicted_peptide_2|75_aa MKRGRNYVVKDEVNSGAWHISQPSNKEKFATASRFMKLPFSLQTAEGLISTNSMSLANIW EGKFAKVLEAQQNFD >gi568815575r:107614522_107875419|GENSCAN_predicted_CDS_2|228_bp atgaagagagggaggaattacgtggtgaaggatgaagtgaacagcggtgcttggcatata agccagccctcaaacaaggagaaatttgctacagcctctagatttatgaaattgcccttc tctctgcagacagcagagggcctcattagcacaaactcaatgtcactggcaaatatatgg gaaggaaagtttgccaaggttctagaagcacagcaaaactttgattaa >gi568815575r:107614522_107875419|GENSCAN_predicted_peptide_3|864_aa MRGWGVLTEIGNRSPCTPPPQKKGGFGGRQFKATTLGALEGVVSSVGGPIRGPPQSPLTL GKLGTALGPRVAPQGGGPRPQEEGEEKEEEEEVSPKDPLGAVCPAVSIRTRSSTARRGPS RSLSFPGTLESPQASSVGAHLPAAPDLAGQATSEPEKMAQSKLDCRSPVGLDCCNCCLDL AHRSGLQRGSSGENNNPGSPTVSNFRQLQEKLVFENLNTDKLNSIMRQDSLEPVLRDPCY LINEGICNRNIDQTMLSILLFFHRGAGSSSPCGELGWASLPSQGPGEATKEQRTLARLVT TPCTSTCILDGDTEAQNCCEMKVERLEEPQLVEEPQLGEDPELGEEAQFLEELQLEVVES GSDLDPEDLGSSPDFATNLMLELEAHLVDFTKASPGEKQKDDEGYTLGPQVEYYYIWTGK EQENLCHFKDHPFYSEAHNGGRCSCPLHKPLVCCTPIPNKILLVAVFQALPIAPKSFTCY SNLRAGLQRRAQPSAACQQPPSRPAAQPRTKPGQSFLAARAMNTEMYQTPMEVAVYQLHN FSISFFSSLLGGDVVSVKLDNSASGASVVAIDNKIEQAMDLVKNHLMYAVREEVEILKEQ IRELVEKNSQLERENTLLKTLASPEQLEKFQSCLSPEEPAPESPQVPEAPGGSAVLPMPD LQLNEKKQMKKGNLENSPTTPQTSLVIKPFSTRKGKKLAPQSHSDHNGIKQEINNRKIVE NPENTWRLNNTLPSNTWIKEVSTEILKYFELNENENIGQAASKPMLISITCLWPPSPVTG STCRPTRPRLGERKACGPPCGRQTRAGPGRFALPGAPEPLKQGFPFSASTRHLHIPLAIA LTARFSGLCTFIVIISIVISATTD >gi568815575r:107614522_107875419|GENSCAN_predicted_CDS_3|2595_bp atgcgggggtggggagtactgactgaaataggcaaccgctctccttgcacgccccccccc cagaaaaaggggggctttggagggcggcagtttaaggctacaacacttggggctctcgag ggcgtcgtctcttctgtagggggacccattcgaggtccgccccaatccccgctcacactt gggaaacttgggactgcgctggggccgcgtgtggcacctcaggggggcggcccccggcct caagaggagggggaggagaaggaggaagaggaggaagtgagcccgaaggatccgctcgga gctgtttgtccagctgtttctattcgcacccggagcagtacagccagaagggggccgagc cgaagcttgtcttttccaggcaccctggagtcccctcaggccagctcggtgggcgcgcac ctgccagccgcccctgacctcgcaggccaggcgacctccgagcctgagaagatggcccag tccaagctcgattgccgctcacctgtcggcctcgactgctgcaactgctgcctggacctg gcccatcggagtgggctccagcgaggcagcagcggggagaacaacaacccgggcagccct acagtgagcaactttcggcagctgcaggaaaagctggtctttgagaacctcaataccgac aagctcaacagcataatgcggcaggattcgctagagccggtgctgcgggacccctgctac ctgatcaacgagggcatctgcaaccgcaacatcgaccagaccatgctctccatcctgctc ttcttccacagaggggcaggaagttctagcccttgtggggagctaggatgggcttctctg ccctcacaaggccctggagaggccaccaaggagcagaggacattagccaggttggtgacc acaccttgtacctccacgtgcatcttagatggagacacagaggctcagaactgctgtgag atgaaggtggaaaggctggaggagccccagctcgtggaggaaccccaactcggggaggac ccagagcttggggaggaggcacagttcctggaggagctacagctcgaggtggtagaaagt gggtcagatttggatccagaagatttgggttcgagtccagactttgccaccaatctgatg cttgagcttgaggctcaccttgtggatttcacgaaggcaagccctggagagaagcagaaa gacgatgaaggctacacacttgggcctcaagtggagtattactacatctggactgggaag gagcaagagaacctctgccacttcaaagatcacccattctatagcgaggcccacaatgga ggcaggtgctcctgccctcttcacaagcccctcgtctgctgcacccccatccccaacaag atcctcctggttgctgtcttccaagcattgccaatagcccccaagtccttcacatgctac tcaaacctcagggccggactccagcgcagagcccagcccagcgcagcctgccagcagcca cccagccgcccagccgcccagccccgcacgaaacccggccagagcttcctagcagcccga gccatgaacaccgaaatgtatcagacccccatggaggtggcggtctaccagctgcacaat ttctccatctccttcttctcttctctgcttggaggggatgtggtttccgttaagctggac aacagtgcctccggagccagcgtggtggccatagacaacaagatcgaacaggccatggat ctggtgaagaatcatctgatgtatgctgtgagagaggaggtggagatcctgaaggagcag atccgagagctggtggagaagaactcccagctagagcgtgagaacaccctgttgaagacc ctggcaagcccagagcagctggagaagttccagtcctgtctgagccctgaagagccagct cccgaatccccacaagtgcccgaggcccctggtggttctgcggtacttcccatgccagac ttgcagctgaatgagaaaaagcagatgaaaaaaggaaacctagaaaatagcccaacaacg ccccaaacctctctggtcatcaagcctttctctacaagaaaggggaaaaaactggcccca cagtcacatagtgaccacaatggaattaaacaagaaatcaataacaggaagatagttgaa aatcctgaaaatacttggagattaaacaacacacttccaagtaacacatggatcaaagaa gtctcaacagaaattttaaaatattttgaactaaatgaaaatgaaaatataggccaggcg gcttccaagcccatgctgatcagcatcacttgcctctggccaccctcccctgtgaccggc agcacatgtcgccccacgcggccccggcttggggagcgtaaggcatgtgggcctccatgc ggccgccagacgagggcgggaccgggacgcttcgccctgcctggggctcctgagcccctg aaacagggctttcctttctctgccagcactaggcatctccacattcctctggccattgct ctgactgctaggttctcgggcctctgcaccttcatcgttattattagcatcgttatatca gctaccactgattga >gi568815575r:107614522_107875419|GENSCAN_predicted_peptide_4|596_aa MLLTVFLSELRIQVFSGGVTAAKLDRKRPSACCPTSTMSKDLKILCKDPALELSCYRDHQ FSGRKFQQEKLLKESSTLNMGNLSFYTTEEKIHELFSRSDIRNIFMGLDKIKKTACGFCF VECHNRADAENAMRFLTGTCLDEWIICTDWDVGFREGQQYGRGKSGGQAWSKFLAWSYRF RCHQDEGKNGSIHILGADDEGGSHKGQSKNTNISTYYEPGSLLVIVFEKRQRRRRRAAAY SGSGGGGDRGPGARAGARANPLRRQTRAQRPRASGGDGWHLGTLWVKRAPSPRPWRSSVV ALAGLVSGSGRFSGQTPLVDERHPQSLDHRLTETEQSDNKPIIQSEIVLKHEIPVFAICL DRSIILCFSCAHRILVSSCSSGESIEPITAFQCPTCRYVISLNHRGLDGLKRNVTLQNII DRFQKASVSGPNSPSESRRERTYRPTTAMSSERIACQFCEQDPPRDAVKTCITCEVSYCD RCLRATHPNKKPFTSHRLVEPVPDTHLRGITCLDHENEKVNMYCVSDDQLICALCKLVGR HRDHQVASLNDRFEKLKQTLEMNLTNLVKRNSELENQMAKLIQICQQVEVSQVMML >gi568815575r:107614522_107875419|GENSCAN_predicted_CDS_4|1791_bp atgctgcttactgtcttcctctctgagctgagaatccaggttttcagtggtggtgtcact gctgccaagctggaccgaaaacggccatcggcatgctgcccaaccagcaccatgtccaaa gacctgaaaattctatgtaaagaccctgctttggagctgagctgctaccgggaccatcag ttcagtggccgtaaatttcagcaggaaaaattactgaaggaaagctccacattgaatatg gggaatctttccttttatacaaccgaagagaaaatacatgaactctttagtagatctgat atcaggaatatctttatgggcctggataaaataaagaaaacagcatgtggtttttgcttt gtagaatgccataacagagctgatgctgaaaatgccatgcggtttctaactgggacctgc ctagatgaatggattatctgcactgattgggatgtcggttttagagagggtcaacagtat ggtcgcggtaaatctgggggtcaggcttggagtaaatttctggcctggagctatagattc aggtgtcatcaggatgagggtaaaaatggaagcatccacattttaggggctgatgatgaa ggaggatcacacaaaggacaaagtaagaacacaaatatcagcacctactatgagccaggc tccttgctcgtaattgtttttgaaaagcggcagcggcggcgaaggcgggcggcggcctac agtggtagcggcggcggcggcgaccggggcccgggagctcgcgccggagcccgagccaac ccgctgcggaggcagacgagagcccagcgccctcgagcgagcggaggagatggctggcac ctgggaacgctatgggtaaaacgcgccccctcgccgcggccctggaggtcgagcgtcgta gccctcgcagggctagtctccggctccggccgcttttcaggtcagactccgctggttgat gaacggcatccccagtctctggatcatcgtcttactgagaccgaacaatctgataacaaa cccatcatacaatctgaaatcgtcctgaaacatgaaatcccagtatttgcaatttgcctg gataggagcataatcctctgcttcagctgtgcccatcgcattttggtatcaagctgcagc tctggtgaatccattgaacccattactgctttccagtgtcctacctgcaggtatgttatc tcgctgaaccaccggggcctggatggcctcaagaggaatgtgactctgcagaacattatt gatcgcttccagaaggcttcagtcagtgggcccaattcccctagtgagagccgccgggaa aggacttacaggcccaccactgccatgtctagcgagcgaattgcttgccaattctgtgag caggacccgccaagggatgcagtaaaaacatgcatcacctgtgaggtctcctactgtgac cgttgcctgcgggccacgcaccccaacaagaaacctttcaccagccaccgcctggtggaa ccagtgccagacacacatcttcgagggatcacctgcctggaccatgagaatgagaaagtg aacatgtactgtgtatctgatgaccaattgatctgtgccttatgcaaactggtgggtcgt caccgagaccatcaggtcgcatccctgaatgatcgatttgagaaactcaagcaaactctg gagatgaacctcaccaacctggttaagcgcaacagcgaactagaaaatcaaatggccaaa ctaatacagatctgccagcaggttgaggtatcacaggtcatgatgctgtga >gi568815575r:107614522_107875419|GENSCAN_predicted_peptide_5|168_aa XEAGKLEHDCEQLVVQTYVTREDLKETLLENPNWILFTDRSSFVEQGIHKASGFAADILF LTEMEYESMKVFNFALWPADNMQFAYSSVIYFYFLNTISTIISDLQNTAHGAMGCNSLSS VPHEGLSEYGELLWAIKGSNQEEFGQDNGQIYLSINTKNLLDILAEGQ >gi568815575r:107614522_107875419|GENSCAN_predicted_CDS_5|507_bp naggaagctgggaagcttgaacatgactgtgaacagctagtagtacagacctatgtgacc agagaggatctcaaggaaaccctcttagagaacccaaactggattctctttacagacaga agttcttttgtagaacaagggatacataaggcaagtggctttgctgctgatatcctgttt ttgacagaaatggagtacgaaagcatgaaggtgtttaattttgctctctggcctgcagac aatatgcaatttgcgtattcatctgtgatatatttttacttcttaaataccataagcaca ataatcagcgacttgcaaaacactgcacatggagccatgggttgcaactccctctcctct gtaccccatgaagggctgtcagaatatggagagctgctctgggctattaaaggcagtaat caagaagaatttgggcaggataatgggcaaatttacttaagtatcaacacgaagaatctt ttggatattcttgcagaagggcagtaa