GENSCAN 1.0 Date run: 4-Nov-116 Time: 13:02:39 Sequence gi568815591f:76202713_76404170 : 201458 bp : 51.31% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 695 782 88 2 1 75 92 50 0.774 4.57 1.02 Term + 4138 4206 69 0 0 96 55 66 0.933 2.33 1.03 PlyA + 4426 4431 6 1.05 2.00 Prom + 5209 5248 40 -3.51 2.01 Init + 7422 7505 84 2 0 79 66 44 0.197 0.32 2.02 Intr + 32316 32587 272 2 2 113 72 409 0.402 38.68 2.03 Intr + 45476 45577 102 2 0 133 84 132 0.220 17.09 2.04 Intr + 57084 57170 87 0 0 87 77 34 0.729 1.78 2.05 Intr + 57194 57322 129 2 0 53 76 257 0.668 21.31 2.06 Intr + 57405 57485 81 0 0 140 97 64 0.999 11.75 2.07 Intr + 58640 58702 63 2 0 87 80 120 0.927 9.42 2.08 Intr + 62053 62103 51 1 0 113 99 56 0.817 7.71 2.09 Intr + 64546 64771 226 1 1 57 40 183 0.440 8.92 2.10 Intr + 71372 71488 117 1 0 69 77 71 0.142 5.27 2.11 Intr + 78729 79090 362 2 2 111 101 140 0.331 12.18 2.12 Intr + 79936 80160 225 1 0 101 117 142 0.998 16.03 2.13 Intr + 80252 80389 138 2 0 96 80 168 0.984 16.89 2.14 Term + 82903 83131 229 1 1 97 52 236 0.906 17.33 2.15 PlyA + 84554 84559 6 1.05 3.00 Prom + 99926 99965 40 -4.11 3.01 Init + 100001 100364 364 1 1 102 105 742 0.632 72.53 3.02 Intr + 101090 101153 64 0 1 118 94 144 0.999 16.27 3.03 Term + 101272 101461 190 1 1 94 42 288 0.972 22.04 3.04 PlyA + 101559 101564 6 1.05 4.02 PlyA - 102510 102505 6 1.05 4.01 Sngl - 106882 106643 240 1 0 97 36 163 0.489 5.41 4.00 Prom - 108442 108403 40 -4.71 5.05 PlyA - 114207 114202 6 1.05 5.04 Term - 127521 126865 657 0 0 113 48 1256 0.999 118.48 5.03 Intr - 156112 156010 103 0 1 100 92 291 0.697 31.38 5.02 Intr - 172772 172665 108 1 0 40 47 109 0.241 1.90 5.01 Init - 180722 180580 143 2 2 83 70 65 0.417 3.77 5.00 Prom - 184574 184535 40 -3.71 6.10 PlyA - 186650 186645 6 1.05 6.09 Term - 187663 187347 317 2 2 116 46 277 0.999 21.75 6.08 Intr - 189329 189252 78 0 0 88 90 27 0.874 2.92 6.07 Intr - 191004 190675 330 1 0 111 100 563 0.577 55.95 6.06 Intr - 191192 191118 75 0 0 75 106 14 0.696 1.88 6.05 Intr - 192618 192541 78 1 0 134 60 -5 0.518 1.32 6.04 Intr - 195120 194806 315 1 0 95 80 506 0.893 47.09 6.03 Intr - 196085 196008 78 0 0 99 68 47 0.708 3.82 6.02 Intr - 197879 197574 306 0 0 78 101 411 0.818 38.37 6.01 Intr - 200449 200363 87 2 0 44 80 51 0.075 0.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:76202713_76404170|GENSCAN_predicted_peptide_1|52_aa XCSVSNVYFAVERNNLDKEMGDYFYGIKCWRLLCRRKLTDGATAPLASCQGL >gi568815591f:76202713_76404170|GENSCAN_predicted_CDS_1|159_bp nngtgtagtgttagcaatgtctactttgctgttgaaagaaataatctggataaggagatg ggagactacttttacggtataaaatgctggaggctgctgtgccggcggaagctgacggac ggtgccacggctcccttagcatcctgccagggcctctga >gi568815591f:76202713_76404170|GENSCAN_predicted_peptide_2|721_aa MPLGAGQPAPYTNLQLMAQPLVSDRAPQGQRPRPAAPGPATMSSTVNNGAASMQSTPDAA NGFPQPSSSSGTWPRAEEELRAAEPGLVKRAHREILDHERKRRVELKCMELQEMMEEQGY SEEEIRQKVGTFRQMLMEKEGVLTREDRPGGHIRVAARRLAGLGQAPCAEKRGEEKSSRD WCVAETPRLTEGAEPGLEYAPFDDDDGPVDCDCPASCYRGHRGYRTKHWSSSSASPPPKK KKKKKGGHRRSRCGSSSPLRKKKKSVKKHRRDRSRSSKCKRKEKNKEKKRLSPKHRDEGR KTGSQRSSGSRSPSPSGGSGWGSPQRNGGSGQRSGAHGGRPGSAHSPPDVRTLRFAEGSR AAGCAPPGLRSPRWPSGPFRAKGSDRGASAGPPGLQILEEVLAKKPSSPSPRVRDKAAAA APTPPARGKESPSPRSAPSSQGRGGRAAGGAGRRRRRRRRRRRSRSSASAPRRRGRRRPR PAPPRGSSRSLSRARSSSDSGSGRGAPGPGPEPGSERGHGGHGKRAKERPPRARPASTSP SPGAHGRRGGPEGKSSSRSPGPHPRSWSSSRSPSKSRSRSAEKRPHSPSRSPSPKKPLSR DKDGEGRARHSEAEATRARRRSRSYSPIRKRRRDSPSFMEPRRITSARKRPIPYYRPSPS SSSSCLSSDYSTRSHSRSPSPGHSHGSYSSRSHGTRSRTRSPSRTPSPSYHSRSSSESGG F >gi568815591f:76202713_76404170|GENSCAN_predicted_CDS_2|2166_bp atgcctctgggagctgggcaacctgccccctacaccaatctccagctgatggcacagccc ttggtatcagacagagcaccccagggccagcggcccaggccagcggctccagggccagcc acgatgtcctccaccgtgaacaacggggcggccagcatgcagtccacacccgacgccgcg aacggcttcccgcagcccagctcctcctcggggacctggccgcgggcggaagaggagctg cgcgccgcggagccgggcctggtgaagcgcgcgcaccgcgagatcctggaccacgagcgc aagcggcgggtggagctcaagtgcatggagctgcaggagatgatggaggagcaggggtat tcggaggaggagattcggcagaaagtggggacattccggcagatgctgatggagaaggag ggagtgctcaccagggaggaccggcctgggggccacatccgggtagcagcccgcagactt gcggggctaggtcaggccccctgcgccgagaaaaggggtgaagaaaagtcgtcccgagac tggtgtgtggcggagaccccgcggctgaccgagggcgctgagccgggcctggagtacgcg ccctttgacgatgacgacggcccagtggactgtgactgcccggcctcctgctaccgcggc caccgcgggtacaggaccaagcattggtctagcagctcggcatcgccccctcccaagaaa aagaagaaaaagaaaggcggccaccggagaagccgctgtgggagctcctcacccctccgc aagaagaagaagagtgtgaagaagcatcgccgagacagatctcgaagctccaagtgcaaa agaaaagagaagaacaaagagaagaagaggctgagccccaagcaccgagacgaagggcga aagacgggcagccagcggtccagcggaagccggtcgccttccccgtcgggcggcagcgga tgggggtcgccccagcggaacggcggcagcgggcagcggagcggagcgcacgggggccgc cccggctcggcgcacagcccgcccgatgtacgtacgcttcgctttgcggagggttcccgc gccgcgggctgcgcccccccgggacttcggtcaccccgctggccttcagggccctttcgg gcaaaaggcagtgacagaggagccagtgcggggcctcctggccttcagatcctggaggaa gttctggccaagaagcccagctcgccctcgcccagggtccgtgacaaggcggcggccgcc gcacccacgccgcccgcgcgggggaaggagagcccgagcccgcgctcggcgccgtcgtcc caaggtcgcggaggccgcgcggcgggcggggcgggcaggcggcggcggcggcggcgtagg cggcggcgctcgcggtcctcggcgtccgcgccccgccgcaggggtcgccggcgcccccgg cccgcgcccccccggggctcgtcgcgctcgctcagcagggcccgctccagcagcgactcc ggcagcggccgcggcgcccccggccccgggcccgagcccggctctgagcgaggccacggc ggacacgggaaacgggccaaggagcggcccccgcgcgcgcggcccgccagcacctctccg tccccgggcgcgcacggccggcgcggcggcccagaagggaagagctcgtcgcgcagcccc ggcccgcacccccgctcctggagctccagccgctcgccctccaaatctcgctcgcgctct gcggagaagcggccccacagccccagccgctcgccgtcgcccaagaagcccctcagccgg gacaaggacggcgagggccgcgcaaggcactctgaggccgaggccacccgcgcccggcgc cgctcccgcagctactcgcccatccgcaagcggcgccgggactcgccaagcttcatggag ccgcggcgcatcaccagcgcccgcaagcgtcctattccatactaccggcccagcccctct tcctcctccagctgcttgagcagcgactactcgacccggagccacagccgcagccccagc cccggccacagccacgggagctacagcagtcgcagccatgggacccgcagccggacacgc agcccctcgaggacccccagtcccagctaccacagccggagcagctctgagagcgggggc ttctga >gi568815591f:76202713_76404170|GENSCAN_predicted_peptide_3|205_aa MTERRVPFSLLRGPSWDPFRDWYPHSRLFDQAFGLPRLPEEWSQWLGGSSWPGYVRPLPP AAIESPAVAAPAYSRALSRQLSSGVSEIRHTADRWRVSLDVNHFAPDELTVKTKDGVVEI TGKHEERQDEHGYISRCFTRKYTLPPGVDPTQVSSSLSPEGTLTVEAPMPKLATQSNEIT IPVTFESRAQLGGPEAAKSDETAAK >gi568815591f:76202713_76404170|GENSCAN_predicted_CDS_3|618_bp atgaccgagcgccgcgtccccttctcgctcctgcggggccccagctgggaccccttccgc gactggtacccgcatagccgcctcttcgaccaggccttcgggctgccccggctgccggag gagtggtcgcagtggttaggcggcagcagctggccaggctacgtgcgccccctgcccccc gccgccatcgagagccccgcagtggccgcgcccgcctacagccgcgcgctcagccggcaa ctcagcagcggggtctcggagatccggcacactgcggaccgctggcgcgtgtccctggat gtcaaccacttcgccccggacgagctgacggtcaagaccaaggatggcgtggtggagatc accggcaagcacgaggagcggcaggacgagcatggctacatctcccggtgcttcacgcgg aaatacacgctgccccccggtgtggaccccacccaagtttcctcctccctgtcccctgag ggcacactgaccgtggaggcccccatgcccaagctagccacgcagtccaacgagatcacc atcccagtcaccttcgagtcgcgggcccagcttgggggcccagaagctgcaaaatccgat gagactgccgccaagtaa >gi568815591f:76202713_76404170|GENSCAN_predicted_peptide_4|79_aa MPEPPHPAVGSCAARASLTSTAPCSTAPSPIDHPRAEECGRTARDWQAAPPAAPIQDSLG EANWAPESGGEPLCQAKGL >gi568815591f:76202713_76404170|GENSCAN_predicted_CDS_4|240_bp atgcctgagcctccccaccccgctgtgggctcctgcgcggcccgagcctccctgacaagc accgccccctgctccacggcgcccagtcccatcgaccacccaagggctgaggagtgcggg cgcacagcacgggactggcaagcagctccacctgcggcccccatacaggattcactgggt gaagccaactgggctcctgagtctggtggagaacctttatgtcaagctaagggattgtaa >gi568815591f:76202713_76404170|GENSCAN_predicted_peptide_5|336_aa MAMNLLAMKRFGKCYSRSFNGNLAWPAEEPRSVVIGAKKNVLTNQRSRVTPLGFLQQFVQ MEEMPQLPGLVASTFPEQLSFLRRPSPAKMVDREQLVQKARLAEQAERYDDMAAAMKNVT ELNEPLSNEERNLLSVAYKNVVGARRSSWRVISSIEQKTSADGNEKKIEMVRAYREKIEK ELEAVCQDVLSLLDNYLIKNCSETQYESKVFYLKMKGDYYRYLAEVATGEKRATVVESSE KAYSEAHEISKEHMQPTHPIRLGLALNYSVFYYEIQNAPEQACHLAKTAFDDAIAELDTL NEDSYKDSTLIMQLLRDNLTLWTSDQQDDDGGEGNN >gi568815591f:76202713_76404170|GENSCAN_predicted_CDS_5|1011_bp atggccatgaaccttttggctatgaaacggtttgggaagtgctattccaggtccttcaac ggaaacttagcctggcctgcagaagaaccacggagcgttgtgattggagcaaagaagaat gtgctaacgaatcagagaagccgggtcaccccgctgggctttctgcagcagtttgtgcag atggaagagatgccccagctacctggcctggtggcctccaccttccctgagcagctgtct ttcttgcgcagacccagccccgcgaagatggtggaccgcgagcaactggtgcagaaagcc cggctggccgagcaggcggagcgctacgacgacatggccgcggccatgaagaacgtgaca gagctgaatgagccactgtcgaatgaggaacgaaaccttctgtctgtggcctacaagaac gttgtgggggcacgccgctcttcctggagggtcatcagtagcattgagcagaagacatct gcagacggcaatgagaagaagattgagatggtccgtgcgtaccgggagaagatagagaag gagttggaggctgtgtgccaggatgtgctgagcctgctggataactacctgatcaagaat tgcagcgagacccagtacgagagcaaagtgttctacctgaagatgaaaggggactactac cgctacctggctgaagtggccaccggagagaaaagggcgacggtggtggagtcctccgag aaggcctacagcgaagcccacgagatcagcaaagagcacatgcagcccacccaccccatc cgattaggcctggctcttaactactccgtcttctactatgagatccagaacgccccagag caagcgtgccacttggccaagaccgcgttcgacgacgccatcgccgagcttgacaccctc aacgaggactcctacaaggactccacgctcatcatgcagctcctccgcgacaacctcacg ctctggacgagcgaccagcaggacgacgatggcggcgaaggcaacaattaa >gi568815591f:76202713_76404170|GENSCAN_predicted_peptide_6|554_aa XSCSELHDEEECGCLGSLSALPLICYVALKLRLVGGPSRCRGRLEVMHGGSWGSVCDDDW DVVDANVVCRQLGCGLALPVPRPLAFGQGRGPILLDNVECRGQEAALSECGSRGWGVHNC FHYEDVAVLCDEFLPTQPPTRKMLTSRAPPTTLPNGKSEGSVRLVGGANLCQGRVEILHS GLWGTVCDDDWGLPDAAVVCRQLGCGAAMAATTNAFFGYGTGHILLDNVHCEGGEPRLAA CQSLGWGVHNCGHHEDAGALCAGLGPPTLTALPSSATREDWAWQTDPSATGVGPQPSRET ALLTTAAWAAGKKSGRLRLVGGPGPCRGRVEVLHAGGWGTVCDDDWDFADARVACREAGC GPALGATGLGHFGYGRGPVLLDNVGCAGTEARLSDCFHLGWGQHNCGHHEDAGALCAGEA DSEGPEELGLQVQQDGSETTRVPTPRPRDGHLRLVNGAHRCEGRVELYLGQRWGTVCDDA WDLRAAGVLCRQLGCGQALAAPGEAHFGPGRGPILLDNVKCRGEESALLLCSHIRWDAHN CDHSEDASVLCQPS >gi568815591f:76202713_76404170|GENSCAN_predicted_CDS_6|1665_bp ncatcgtgctcagaactgcatgatgaagaagagtgtggctgtctgggtagtctctcagct ctacccttgatttgctacgtggccctgaagctgaggctggtggggggccccagccgctgc cggggccgcctggaagtcatgcacggtggctcctggggcagcgtctgtgatgacgactgg gacgtggtggacgccaacgtagtgtgtcgccagctgggctgtggcctggcactgcccgtg ccacggccccttgcctttggccaaggccgaggccccatcctgctggacaacgtggagtgc cgcgggcaggaagctgcgctgagcgagtgcggcagccgcggctggggcgtccacaattgc tttcactacgaggatgtggctgtcctgtgtgatgaattcttgccaacgcagcccccaaca aggaagatgttaaccagtagagcacctcctacgacactgccgaatggaaaaagtgagggc agcgtacgcctggtagggggcgcgaacctgtgtcagggccgagtggagatcctgcacagt ggcctgtggggcaccgtgtgtgacgacgactgggggctgccggatgccgctgtggtctgt cgtcagctgggctgcggggcggccatggccgccaccaccaacgccttcttcggctatggc accggacacatcctgctggacaacgtgcactgcgaaggcggcgagccccgcctggcagcc tgccagagcctgggctggggtgtgcacaactgcggccaccacgaggacgcgggcgcgctc tgcgcaggcctgggtcccccaacgctcacagcactgccatcctcagccacaagagaggac tgggcttggcagacagatccgtccgctacaggagttggcccccagccttcccgggagaca gcactgctcaccaccgccgcctgggccgcggggaagaaaagtggacggctgcgactggtg ggcggcccgggtccgtgccgcggccgcgtggaggtgttgcacgccgggggctggggcacc gtgtgcgacgatgactgggactttgcggacgcgcgcgtggcctgccgcgaagcgggctgc gggcctgcgctgggcgctacgggactgggccacttcggctacggccgcggccccgtgctg ctggacaacgtgggctgcgccggcaccgaggctcgcctgagcgactgcttccacctgggc tggggccagcacaactgcggccaccacgaggacgcgggagcgctctgcgcaggtgaggct gacagcgaaggcccagaggagctgggactgcaagtccagcaggatggttctgagaccacg cgggtgcccactcctcggcccagggacgggcatctacgtctggtcaatggagcccaccga tgcgagggacgtgtagagctctacctagggcaacggtggggcactgtctgtgatgatgct tgggacctgcgggcagccggtgtcctgtgccgccagctgggctgtggccaggccctcgca gcccctggcgaggctcactttggcccaggccgaggccccattctcctggacaatgtcaag tgccgtggggaagaaagtgctctgctgctctgctctcatatccgctgggatgcccacaac tgtgaccacagcgaggatgccagtgtcctgtgccagccttcatga