GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:31:53 Sequence gi568815581f:68150188_68356966 : 206779 bp : 46.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 458 108 351 2 0 34 56 194 0.579 6.32 1.04 Intr - 1159 716 444 1 0 29 78 152 0.036 1.90 1.03 Intr - 4455 4346 110 1 2 52 54 90 0.114 2.00 1.02 Intr - 8800 8735 66 2 0 49 59 89 0.041 0.98 1.01 Init - 19287 19110 178 0 1 97 40 109 0.389 4.74 1.00 Prom - 20761 20722 40 -2.06 2.00 Prom + 21506 21545 40 -9.46 2.01 Init + 23812 23872 61 0 1 88 105 13 0.304 4.41 2.02 Intr + 46470 46631 162 1 0 49 66 79 0.360 1.75 2.03 Intr + 46710 46841 132 1 0 122 81 -3 0.441 3.02 2.04 Intr + 47273 47409 137 0 2 81 86 5 0.470 -0.11 2.05 Intr + 48716 48918 203 1 2 106 54 397 0.645 36.18 2.06 Intr + 48920 49411 492 2 0 65 -5 291 0.369 9.61 2.07 Intr + 49712 49780 69 2 0 67 89 107 0.402 7.00 2.08 Intr + 55487 55646 160 2 1 37 80 85 0.187 2.59 2.09 Intr + 55877 56051 175 1 1 -96 80 337 0.398 14.01 2.10 Intr + 58525 58785 261 2 0 -6 9 223 0.001 2.46 2.11 Intr + 77289 77433 145 1 1 42 82 51 0.013 -0.76 2.12 Intr + 97297 97638 342 1 0 60 -5 294 0.179 11.85 2.13 Intr + 97730 97758 29 2 2 68 55 -8 0.121 -8.34 2.14 Intr + 100607 100780 174 0 0 63 68 55 0.455 0.91 2.15 Intr + 100863 100991 129 1 0 73 21 137 0.043 6.17 2.16 Intr + 104217 104380 164 1 2 128 115 131 0.992 19.49 2.17 Intr + 105513 105689 177 2 0 66 106 79 0.990 7.62 2.18 Term + 106627 106782 156 0 0 71 55 83 0.453 1.23 2.19 PlyA + 106807 106812 6 1.05 3.13 PlyA - 107545 107540 6 1.05 3.12 Term - 109045 108848 198 1 0 15 32 173 0.091 1.80 3.11 Intr - 119159 118915 245 0 2 99 76 153 0.552 12.42 3.10 Intr - 121467 120652 816 1 0 52 91 531 0.444 41.20 3.09 Intr - 123883 123740 144 2 0 101 119 144 0.964 19.05 3.08 Intr - 128027 127902 126 0 0 38 92 43 0.175 0.35 3.07 Intr - 140357 140181 177 0 0 55 -35 158 0.137 0.09 3.06 Intr - 142438 142304 135 2 0 50 48 117 0.170 4.44 3.05 Intr - 142726 142535 192 2 0 73 81 127 0.366 9.96 3.04 Intr - 151415 151365 51 0 0 68 109 13 0.029 0.28 3.03 Intr - 158762 158637 126 0 0 98 44 45 0.012 1.75 3.02 Intr - 160683 160480 204 1 0 28 89 123 0.043 5.57 3.01 Init - 173716 173668 49 1 1 76 58 63 0.035 1.21 3.00 Prom - 174652 174613 40 -4.96 4.00 Prom + 175258 175297 40 -7.76 4.01 Init + 178591 178679 89 0 2 69 65 110 0.201 6.91 4.02 Intr + 193417 193604 188 1 2 119 61 93 0.709 9.03 4.03 Intr + 196938 196985 48 1 0 98 83 33 0.515 2.45 4.04 Intr + 201388 201499 112 2 1 65 90 66 0.671 3.94 4.05 Intr + 206480 206617 138 2 0 71 100 170 0.912 16.08 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:68150188_68356966|GENSCAN_predicted_peptide_1|383_aa MPEPSPASVSSCAAGASPRSAAPCSTAPSPIYRPRAEQCERMAQDWQAAPPATPVQDPLA PDAKNCRITTLISVDVKKDHQESGKIAVEYRPSEEIVDVRWEEELHGLIQVCGDKNSRTQ QESPAQPLEEMEPSAIQDEAPTEPPGPPIEPELSPSEQKQPAQPSESSGEVESSPAQQDN PPIPTEQADFSLAQPDLPSPPLCSPEKTESPVQQEATAQTPDPPREAEPSPVQQEFPAEP PEPPKEAEPSATQQEASSHPLKSTEEQEVPAQIPEPSMEAEPSPTQQEATVQAPEPPKEV ESSSQQMVPAQLPEPPKEVAAQPPAHYEVTVPTPGQDQAQHWTLPSVTVQPLDLGLTITP EPTKEVEHSTALKKAIVPPKHPK >gi568815581f:68150188_68356966|GENSCAN_predicted_CDS_1|1149_bp atgcctgagccttcccccgcctccgtaagttcctgtgcagctggagcctccccgaggagc gccgccccctgctccacggcgcccagtcccatctaccgcccgagggctgagcaatgcgag cgcatggcgcaggactggcaggcagctccacctgcaaccccggtgcaggatccactagct cctgatgccaagaactgccgaattactactttgatttcagtagatgtcaaaaaggatcat caggaaagtggtaaaatagctgtggagtacagacccagtgaagagattgtagatgtcaga tgggaagaagaactacacggcttaattcaagtatgtggagataaaaactcaaggacccag caggagagcccagctcagcctctagaagagatggaaccttctgcaatccaagatgaggct ccaactgagcctccaggtcctcctatagagcctgaactttcccccagtgagcagaagcag ccagctcagccttctgagtcttctggggaggttgaatcttctccagcccagcaggacaac cctcctattcccactgagcaggctgacttttctctagcccagcctgatctcccttcccca cctctgtgttctccggaaaagactgaatctccagtccagcaagaggccacagctcagact ccagatccccctagggaggcagaaccttctccagtccagcaagagttcccagctgagcca ccagagccccctaaggaggctgaaccatctgcaacccagcaggaagcctcaagtcatcct ctgaagtccactgaagagcaagaggttccagctcagattccagagccctctatggaggca gaaccttctccgacccagcaggaggccacagttcaggctccagagccccctaaggaggta gaatcttcaagccagcagatggtcccagctcagcttccagagccacctaaggaggttgca gctcaacctccagctcattatgaggtgacagttccaacaccaggtcaggatcaagctcag cattggacattgcccagtgtcacagttcaacctttggacctgggacttaccattactcca gaacccactaaggaggttgaacattctacagccctgaagaaggctatagttcctccaaag caccctaag >gi568815581f:68150188_68356966|GENSCAN_predicted_peptide_2|1055_aa MEAQGTHQKSPCIRTRSKHPGSARAASHAKEKVGEGLPSIGLEGSREAPNAHPGVDASVS CVAWAVDAGDLLATGRGRSRGRVPFLSSPRHRAWWASLRWESRSEFFQARPGCAGKGPAP VASSGSVQPLRLRPWKPRPGRWSGPNLPARPPPALGLPEARPADKNAAATMPAYVEGDVY LVVEHPFEYTRKDWRRVAIWPNERYWLLRRSTEHWWHVRREPGGHPFYLPAHTCASCLRW ANDLPANGNARTRQPATAAPPGPPNSPLGPRAAHLRLPACERGGGSGPRQHPRGAPRPRQ LPVRPFATRRRDPAQQPGARPARLPVLAACAVPGLPGARRRLASRRPPRKERQLQGCSVA GSWMCPRPLARSDSETVYEAIQDVHGPPREESGEQGNANELDDRLLPQALLGGYVTYSRG RRGRQPKQPAASAAERCAPAPAPAPAPAPPPPPPKSGPIGGLSSRHREQQGQEEEDGDVE ETQESEDNEEHEMEEDEADSDYLEELEDDDDASYCTESSFRSHSTYSSTPAVILTATVCS FTPKASETTNQPGGTNNSRHVALRAVTLTWKVKVCSFTPEASETTNPPEGRNSKHIRAPE GTNSGHAAFKNCDTQREVLEARSLNSLLLSCALAAGSRGESVPGLFQLLVASRIPWLVAA SLQSSSIRVSASERQGRHEDAPHTCASGFQPPRPPTNKEGKVLRVTAPGSAAALRQPAAA ADSFGATLVAFVTPAMPLKPAACRGRSHTGDPSDAGASVALENRAAEPPRAQARNRRLRG RPHAQGRGMGSLGNTRIISEEYIKWLTGYCKAYFYGLRVKLLEPVPVSVTRCSFRVNENT HNLQIHAGDILKFLKKKKPEDAFCVVGITMIDLYPRDSWNFVFGQASLTDGVGIFSFARY GSDFYSMHYKGKVKKLKKTSSSDYSIFDNYYIPEITSVLLLRSCKTLTHEIGHIFGLRHC QWLACLMQGSNHLEEADRRPLNLCPICLHKLQCAVGFSIVERYKALVRWIDDESSDTPGA TPEHSHEDNGNLPKPVEAFKEWKEWIIKCLAVLQK >gi568815581f:68150188_68356966|GENSCAN_predicted_CDS_2|3168_bp atggaggcccagggaactcaccagaagtccccatgcatcagaacaaggtcgaagcatcct ggatctgcgcgggccgcatcccacgcgaaggagaaggtaggcgaagggctgccctccatt gggctggagggcagcagggaggcccccaacgcccacccaggcgttgacgcctctgtatct tgcgtcgcctgggcggtggacgcaggagacctcctggcgactggtcggggaaggagccgt ggccgggtccccttcttatccagcccgagacaccgcgcctggtgggcgtccctgcgctgg gaatcccgctcggagtttttccaagcccggccgggctgtgcggggaaaggccctgctccc gttgccagctccggttccgtgcagcctctccggcttcgcccctggaagccacgcccaggg cgatggtcggggcctaacctgccggctcggccccccccggcccttgggctccctgaggcc cgcccagccgacaaaaacgccgcggccacgatgccggcgtatgtggagggggatgtctac ctggtggtggagcaccccttcgagtatacccgcaaggactggcgccgcgtggccatctgg cccaatgagcgctactggctgctgcggcgcagcacggagcactggtggcacgtgcggcgc gagcctggcggccaccccttctacctgcccgcgcatacgtgcgcgagctgcctgcgctgg gcaaatgacctccccgcgaacgggaacgcccggacgagacaacctgccactgccgcgccg ccaggtccccccaacagtcccctcggcccccgagccgctcacctacgactaccggcttgt gagcgcggcggcggcagcgggccccgacagcacccccgcggagccccgaggccgcgccag ctccctgtgcggcccttcgcaacgcggcgccgcgacccagcgcagcagcctggcgcccgg cctgcccgcctgcctgtacttgcggcctgcgcagtccctggactacctggcgcgcgccgc cgtctcgcctcccgccggccacctcggaaggagcggcagcttcaaggctgcagtgtggcg ggttcctggatgtgcccgcggcccctggcgcgcagcgactcagagaccgtctacgaggcc atccaggatgtgcacggcccgccgcgggaggagagcggggaacagggaaatgcaaatgag ctcgatgaccggctgctgccccaagccctgctgggaggttatgtgacctacagcaggggc aggcggggcaggcagcccaagcagcctgcggcttccgctgcggagcgctgcgccccggcc cctgccccggcccctgccccggccccgccgccgccgccgcccaagtccggacccatcggg gggctcagctcgcggcaccgcgagcagcaggggcaggaggaagaggacggcgacgtcgag gagacccaggagtctgaggacaacgaggagcatgagatggaggaggacgaggctgattcc gattatctggaggagctggaagacgacgacgacgccagttactgcacagaaagcagcttc aggagccatagtacctacagcagcactccagctgtaatactcaccgcgacggtctgcagc ttcactcctaaagccagcgagaccacgaaccaaccaggaggaacaaacaactccagacac gtcgccttaagagcggtaacactcacctggaaggtcaaggtctgcagcttcactcctgaa gccagcgagaccacgaacccaccagaaggaagaaactccaaacacatccgagcaccagaa ggaacaaactccggacacgccgcctttaagaactgtgacactcagcgcgaggttctagag gccagaagtctaaactccttactgctcagttgtgcactggctgcgggctccaggggagaa tctgttcctggtctcttccagcttctggtggccagccgcattccttggctcgtggctgca tcgctccagtcttcaagcatccgggtttcggcctcggagcgccagggacgccacgaggac gcgccccacacttgcgcttctgggttccagcccccgcggcccccaacaaataaagaaggg aaagtgctgagggtgacggccccggggagcgctgcggctctacgtcaacctgcggcggcc gccgactcatttggggccacgctggttgcattcgtcacgccggcgatgcctctcaaaccc gcggcctgccgaggacgttcccacacgggagaccccagcgacgcgggcgcatctgtggct ctcgagaaccgggccgcggagccgccgcgagcgcaagcgaggaatcggcgactgcggggg cggccgcacgctcaggggcgtggcatgggctctctaggaaacaccagaattatcagtgaa gaatatattaaatggctcacgggctactgtaaagcatatttctatggcttgagagtaaaa ctcctagaaccagttcctgtttctgtaacaagatgttcctttagagtcaatgagaacaca cacaacctacaaattcatgcaggggacatcctgaagttcttgaaaaagaagaaacctgaa gatgccttctgtgttgtgggaataacaatgattgatctttacccaagagactcgtggaat tttgtctttggacaggcctctttgacagatggtgtggggatattcagctttgccaggtat ggcagtgatttttatagcatgcactataaaggcaaagtgaagaagctcaagaaaacatct tcaagtgactattcaattttcgacaactattatattccagaaataactagtgttttacta cttcgatcctgtaagactttaacccatgagatcggacacatatttggactgcgacactgc cagtggcttgcatgcctcatgcaaggctccaaccacttggaagaagctgaccggcgccct ctaaacctttgccctatctgtttgcacaagttgcagtgtgctgttggcttcagcattgta gaaagatacaaagcactggtgaggtggattgatgatgaatcttctgacacacctggagca actccagaacacagtcacgaggataatgggaatttaccgaaacccgtggaagcctttaag gaatggaaagagtggataataaaatgcctggctgttctccaaaaatga >gi568815581f:68150188_68356966|GENSCAN_predicted_peptide_3|820_aa MGFRHVGQAGLELLTSGLKNVFEGLEAEECFYKVHLDGGEEELEWVRPDAETTTHQGRVS QEKHQALGCKWPKLCGDALCVILSAPSPIDHPKAEECRRRARDWQAAPPAAPVRDPLGEA SWAPESGQQMKEAAELGVSCMGPDLEKLTLYEVKLRLQGCKAAQRPLGCTLLAIQGTLYQ RIFSPLTQPELVNGKGWHLTQESLSQNGSLEFLTSEPHSPNPNEGSSRRQSLHTNANNMA FASEQFPNLPSGSGSRGFPGRARVLHLDCQNCQRLTAGPAQLQGSRAANRRKALVSPSSS VAREDGFAEEMVFTYGIIKTFGVFFNDLMDSFNESNSRISWIISICVFVLTFSAPLATVL SNRFGHRLVVMLGGLLVSTGMVAASFSQEVSHMYVAIGIISAIMALKERIGWRYSLLFVG LLQLNIVIFGALLRPIFIRGPASPKIVIQENRKEAQYMLENEKTRTSIDSIDSGVELTTS PKNVPTHTNLELEPKADMQQVLVKTSPRPSEKKAPLLDFSILKEKSFICYALFGLFATLG FFAPSLYIIPLGISLGIDQDRAAFLLSTMAIAEVFGRIGAGFVLNREPIRKIYIELICVI LLTVSLFAFTFATEFWGLMSCSIFFGFMVGTIGGTHIPLLAEDDVVGIEKMSSAAGVYIF IQSIAGLAGPPLAGLLVDQSKIYSRAFYSCAAGMALAAVCLALVRPCKMGLCQHHHSGET KVVSHRGKTLQDIPEDFLEMDLAKNEHRVHVQMEPVVRRRFAAAGPAGLGAAGDSDAFPA REGPERRAGYSGPAAACFDFSTAAPKREQRGPLSLVGLEM >gi568815581f:68150188_68356966|GENSCAN_predicted_CDS_3|2463_bp atggggtttcgccatgttggccaggctggtctcgaactcctgacctcagggctaaaaaat gtctttgaaggactggaagcagaagagtgtttctacaaagttcaccttgatggaggtgag gaagaactggagtgggtgagaccagatgcagaaactaccacccaccaagggcgcgtatct caggaaaagcatcaagctctaggatgcaagtggccaaaactctgtggagatgccctatgc gtcatcctaagcgcgcccagtcccatcgaccacccaaaggctgaggagtgcaggcgcagg gcgcgggactggcaggcagctccacctgcagcgccggtgcgggatccactgggtgaagcc agctgggctcctgagtctgggcagcagatgaaagaagcagctgagttaggagtctcttgc atgggtccagatctggagaaattaacgctctacgaggttaaattacgcctacaaggttgc aaagctgctcagcggccccttggctgtacactactggccatacaaggcaccctgtaccaa cgaatcttctccccacttacacaaccagaacttgtcaacggcaaaggatggcatttaact caggagagcctatcccaaaatggctctcttgagtttctgacatcagaaccacattctccg aaccccaatgaaggcagcagccggcgccaaagtctgcatacgaatgcaaataacatggcc tttgcttccgagcagtttccaaatttgccctcgggatcgggaagccggggatttcccggc agagcgcgggtcctccacctggactgtcagaactgtcagcggctgacggccggtcccgcg caacttcaaggttcccgggctgccaaccgcaggaaagcgctggtgtcgccaagcagttcc gtagcaagggaagatggttttgcagaggaaatggtcttcacctacggcatcatcaagaca tttggtgtcttctttaatgacttaatggacagttttaatgaatccaatagcaggatctca tggataatctcaatctgtgtgtttgtcttaacattttcagctcccctcgccacagtcctg agcaatcgtttcggacaccgtctggtagtgatgttgggggggctacttgtcagcaccggg atggtggccgcctccttctcacaagaggtttctcatatgtacgtcgccatcggcatcatc tctgcaatcatggctctgaaggagcgcattggctggagatacagcctcctcttcgtgggc ctactacagttaaacattgtcatcttcggagcactgctcagacccatctttatcagagga ccagcgtcaccgaaaatagtcatccaggaaaatcggaaagaagcgcagtatatgcttgaa aatgagaaaacacgaacctcaatagactccattgactcaggagtagaactaactacctca cctaaaaatgtgcctactcacactaacctggaactggagccgaaggccgacatgcagcag gtcctggtgaagaccagccccaggccaagcgaaaagaaagccccgctattagacttctcc attttgaaagagaaaagttttatttgttatgcattatttggtctctttgcaacactggga ttctttgcaccttccttgtacatcattcctctgggcattagtctgggcattgaccaggac cgcgctgcttttttattatctacgatggccattgcagaagttttcggaaggatcggagct ggttttgtcctcaacagggagcccattcgtaagatttacattgagctcatctgcgtcatc ttattgactgtgtctctgtttgcctttacttttgctactgaattctggggtctaatgtca tgcagcatattttttgggtttatggttggaacaataggagggactcacattccactgctt gctgaggatgatgtcgtgggcattgagaagatgtcttctgcagctggggtctacatcttc attcagagcatagcaggactggctggaccgccccttgcaggtttgttggtggaccaaagt aagatctacagcagggccttctactcctgcgcagctggcatggccctggctgctgtgtgc ctcgccctggtgagaccgtgtaagatgggactgtgccagcatcatcactcaggtgaaaca aaggtagtgagccatcgtgggaagactttacaggacatacctgaagactttctggaaatg gatcttgcaaaaaatgagcacagagttcacgtgcaaatggagccggtggtgcgccggcgg ttcgcagctgctgggcccgccggcctgggcgcagccggggacagcgacgcgtttcctgcc cgggaagggcccgagcgcagggccggctatagcggtcccgcagctgcctgcttcgatttt agcactgctgctcccaagagggagcaacgcggccctctgtccctcgtagggcttgaaatg taa >gi568815581f:68150188_68356966|GENSCAN_predicted_peptide_4|192_aa MGPAENPAEEASVDAGYGFPTRLEETGLTRFVDFHAAASTCSPSRASLLTGRLGLRNGVT RNFAVTSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLGHHGSYHPNFRGFDYYFGIPYSH DMGCTDTPGYNHPPCPACPQGDGPSRNLQRDCYTDVALPLYENLNIVEQPVNLSSLAQKY AEKATQFIQRAS >gi568815581f:68150188_68356966|GENSCAN_predicted_CDS_4|576_bp atgggacctgcagagaacccagctgaggaagccagtgtcgatgcaggctatggctttccc accagactggaagaaacaggtcttacgaggtttgtggatttccatgcagctgcctccacc tgctcaccctcccgggcttccttgctcaccggccggcttggccttcgcaatggagtcaca cgcaactttgcagtcacttctgtgggaggccttccgctcaacgagaccaccttggcagag gtgctgcagcaggcgggttacgtcactgggataataggcaaatggcatcttggacaccac ggctcttatcaccccaacttccgtggttttgattactactttggaatcccatatagccat gatatgggctgtactgatactccaggctacaaccaccctccttgtccagcgtgtccacag ggtgatggaccatcaaggaaccttcaaagagactgttacactgacgtggccctccctctt tatgaaaacctcaacattgtggagcagccggtgaacttgagcagccttgcccagaagtat gctgagaaagcaacccagttcatccagcgtgcaagn