GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:03:59 Sequence gi568815583f:45487444_45706511 : 219068 bp : 42.99% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 189 84 106 0 1 73 116 -1 0.015 0.07 1.03 Intr - 1524 1398 127 2 1 78 85 49 0.032 3.36 1.02 Intr - 3438 3285 154 1 1 82 75 51 0.017 1.41 1.01 Init - 34911 34521 391 0 1 85 95 439 0.606 41.28 1.00 Prom - 47472 47433 40 -4.85 2.00 Prom + 57009 57048 40 -5.75 2.01 Init + 59870 59973 104 1 2 75 89 95 0.620 8.06 2.02 Intr + 68030 68492 463 2 1 61 28 289 0.069 12.43 2.03 Intr + 77263 77420 158 0 2 93 36 111 0.182 4.29 2.04 Intr + 77991 78184 194 0 2 42 87 58 0.141 -0.69 2.05 Term + 79449 79633 185 1 2 -27 45 196 0.232 0.52 2.06 PlyA + 79723 79728 6 1.05 3.00 Prom + 80218 80257 40 -6.45 3.01 Init + 86383 86461 79 0 1 43 89 86 0.508 5.47 3.02 Intr + 87228 87312 85 1 1 83 59 59 0.086 0.46 3.03 Intr + 99657 100082 426 0 0 31 92 342 0.172 20.88 3.04 Intr + 100253 100353 101 2 2 26 104 11 0.637 -4.67 3.05 Intr + 104692 104833 142 2 1 59 87 151 0.835 10.59 3.06 Term + 107613 107721 109 0 1 83 53 61 0.779 -0.90 3.07 PlyA + 108183 108188 6 1.05 4.00 Prom + 110191 110230 40 -4.35 4.01 Sngl + 110663 112084 1422 1 0 16 47 419 0.717 26.23 4.02 PlyA + 112120 112125 6 -0.45 5.00 Prom + 112413 112452 40 -3.45 5.01 Init + 118036 118071 36 0 0 56 87 8 0.239 -2.44 5.02 Term + 118952 119071 120 1 0 68 48 165 0.986 7.99 5.03 PlyA + 121417 121422 6 1.05 6.00 Prom + 130088 130127 40 -4.35 6.01 Init + 131721 131857 137 2 2 55 66 100 0.553 4.16 6.02 Intr + 133752 133899 148 0 1 83 34 96 0.477 3.02 6.03 Intr + 134042 134127 86 1 2 31 115 63 0.466 1.00 6.04 Term + 134282 134390 109 2 1 50 42 101 0.439 -1.30 6.05 PlyA + 135038 135043 6 1.05 7.00 Prom + 137374 137413 40 -3.65 7.01 Init + 147009 147176 168 2 0 73 97 165 0.867 15.56 7.02 Intr + 158718 158818 101 2 2 50 56 94 0.013 0.39 7.03 Intr + 161155 161310 156 1 0 22 72 166 0.066 6.60 7.04 Intr + 171543 171714 172 0 1 66 99 121 0.092 10.02 7.05 Intr + 174512 174682 171 1 0 56 86 168 0.979 12.62 7.06 Intr + 182485 182538 54 0 0 109 81 50 0.932 4.66 7.07 Intr + 184576 184707 132 0 0 67 36 92 0.683 1.82 7.08 Intr + 186164 186358 195 1 0 100 91 125 0.992 12.69 7.09 Intr + 188658 188867 210 2 0 69 70 293 0.959 23.79 7.10 Intr + 195035 195218 184 1 1 57 97 128 0.946 9.04 7.11 Intr + 196084 196199 116 2 2 74 69 46 0.731 0.55 7.12 Intr + 201596 201774 179 1 2 11 115 171 0.915 9.90 7.13 Intr + 203462 203501 40 2 1 81 88 -34 0.513 -6.79 7.14 Intr + 205877 205945 69 1 0 114 115 59 0.761 9.66 7.15 Intr + 208757 208823 67 1 1 95 64 123 0.570 8.16 7.16 Term + 216836 217023 188 0 2 74 43 125 0.177 3.27 7.17 PlyA + 217800 217805 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 144433 144071 363 1 0 43 37 236 0.993 9.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:45487444_45706511|GENSCAN_predicted_peptide_1|260_aa MAGSGAWKRLKSMLRKDDAPLFLNDTSAFDFSDEAGDEGLSRFNKLRVVVADDGSEAPER PVNGAHPTLQADDDSLLDQDLPLTNSQLSLKVDSCDNCSKQREILKQRKVKARLTIAAVL YLLFMIGELVEVLSAMISVLLVYILMGFLLYEAVQRTIHMNYEINGDIMLITAAVGVAVN VIGSGCERNHGQDSLAVRAAFVHALGDLVQSVGVLIAAYIIRFKPEYKIADPICTYVFSL LVAFTTFRIIWDTVVIILEX >gi568815583f:45487444_45706511|GENSCAN_predicted_CDS_1|780_bp atggccggctctggcgcgtggaagcgcctcaaatctatgctaaggaaggatgatgcgccg ctgtttttaaatgacaccagcgcctttgacttctcggatgaggcgggggacgaggggctt tctcggttcaacaaacttcgagttgtggtggccgatgacggttccgaagccccggaaagg cctgttaacggggcgcacccgaccctccaggccgacgatgattccttactggaccaagac ttacctttgaccaacagtcagctgagtttgaaggtggactcctgtgacaactgcagcaaa cagagagagatactgaagcagagaaaggtgaaagccaggttgaccattgctgccgttctg tacttgcttttcatgattggagaacttgtagaggttttgtcagctatgattagtgtgctg ttggtgtatatacttatgggattcctcttatatgaagctgtgcaaagaactatccatatg aactatgaaataaatggagatataatgctcatcaccgcagctgttggagttgcagttaat gtaataggttctgggtgtgaacgtaaccatgggcaggatagcctggcagtgagagctgca tttgtacatgctttgggagatttggtacagagtgttggtgtgctaatagctgcatacatc atacgattcaagccagaatacaagattgctgaccccatctgtacatacgtattttcatta cttgtggcttttacaacatttcgaatcatatgggatacagtagttataatactagaagnn >gi568815583f:45487444_45706511|GENSCAN_predicted_peptide_2|367_aa MRASLDQKRRGHPAGSGGGEVNGRSAMAANSSGGCALDFADESGSVSCKDMHLLLWLQKR IEMHKAEQCEEEEAMTPRPTKARAPLPSAYVPPLSLPPCPRERLKGMLKEIKPRLSRNCR EDPQGCLLNLLLQSHSRSPERPLQRRERRYLQRRREKLMLARRGITLQKMEMPKQTRHRK LKVLEMPSEHPIKASSLELLIVSVIDIPYSEQQDLDRTLGVSVTSSQPSGKHSHKLSNCN TGWGLVGGDLMWVVSHGLAPSPVLSHDRVLMRPDVLKVCGSSPLVLSPSCPAMLRLACFF FTFRHDLGSQIPYTEQVSKAMLELKALKSSDLTEVVVYGSYLYKLWTKWMLQSVAEWFQL GLNSAPV >gi568815583f:45487444_45706511|GENSCAN_predicted_CDS_2|1104_bp atgcgtgcctcgctggatcagaaacgcagaggacaccctgccggatctggagggggggaa gtcaatggcaggtctgcgatggcagcgaacagcagtggtggatgtgcacttgattttgca gatgagtctggaagtgttagttgcaaagatatgcatttacttctctggctacaaaaaaga atagaaatgcacaaagcagagcagtgtgaagaagaagaggcgatgacccctagaccgacc aaagcccgtgctccactgcccagtgcctatgtcccaccactgtcgctgccaccctgcccg agagaaaggctgaaggggatgctaaaggagataaaaccaaggttaagtaggaactgcaga gaagatccacaaggttgtctgctaaacctgctcctccaaagccacagccgaagcccagaa aggcctctgcaaagaagagagagaaggtacctacagagaagaagggaaaagctgatgctg gcaaggaggggaataaccctgcagaaaatggagatgccaaaacagactaggcacagaaag ctaaaggtgctggagatgccaagtgaacacccaattaaagcctcttccctggaactactc attgtctcagtgattgacattccatacagtgagcaacaggacctagaccgaacccttggc gtttctgtaacaagttcacaaccaagtgggaaacatagccacaaacttagtaactgtaat acagggtggggcctggtgggaggcgatttgatgtgggtggtttctcatggtttagcacca tccccagtgctgtctcatgatagagttctcatgagacctgatgttttaaaagtgtgtggc agttccccacttgttctgtctccttcctgccctgccatgttaaggcttgcctgcttcttc ttcaccttccgccatgatttagggtcacagatcccttacactgagcaggtgagtaaggcc atgcttgagttgaaggctctgaaatcttcagacctcactgaggtcgtggtttatggctcc tatttgtacaagctctggaccaagtggatgctgcagtccgtggctgagtggttccagctt ggtttaaactctgctccagtctga >gi568815583f:45487444_45706511|GENSCAN_predicted_peptide_3|313_aa MYADKWRVSKTKKNFTEGQNSSEETRGTTQCLPAVAGILAMATLDESLLLSFLWRQRSSG LRDPPREQPKPTTVLPQLPPIGSQGLEPHTAESADTASGARRRYQRGVPTTPWGRANSRA RDRSPQFFRPRHFRVRQSLLSGQPLESLGAALLLTSHTFASSLCSQLEGHECPWAVVSGR GPDTATLLPGGRGADACAMGGTEVPRGSFVLGVQLKAVDGEAILSLCVDSGLSDTSPDEG LIEDLTIEDKAVEQLAEGLLSHYLPDLQRSKQALQELTERKPPLPAHQCVQQLGVRGCSE PWSCHCALAWVTE >gi568815583f:45487444_45706511|GENSCAN_predicted_CDS_3|942_bp atgtatgcagacaagtggagggtgagcaagacgaagaagaactttactgagggtcagaac agctcagaggagacccgcggaactacccagtgcctccctgctgtagctggcatcttggca atggccactttagatgagtcgctgctgctatcatttctatggagacaacgctcttccggt cttcgtgacccaccccgtgagcagcccaaaccgaccaccgtcctcccccagctcccacca atcggatcgcagggactcgagccccacactgctgagtccgctgacactgcgtccggggcc agacgacgatatcagcgcggggtccccacaacgccatggggcagagccaactctcgagcg cgtgatcgaagcccgcagttttttcgcccccgtcacttccgggtgcgacaatctcttctg tccggccagccgctggagtcgttaggtgccgccttgcttctgacgagccacacgtttgct tcttccctgtgttcccagctggagggacatgagtgtccctgggccgtcgtctccggacgg ggccctgacacggccaccctactgcctggaggccggggagccgacgcctgcgcaatgggc ggaactgaagtcccgaggggaagctttgtattgggggtgcaattgaaggcggttgatggt gaagccattctgagtttatgtgttgactccggtttaagtgacacttctccagatgaaggg ttaatagaggacttgactatagaagacaaagcagtggagcaactggcagaaggattgctt tctcattatttgccagatctgcagagatcaaaacaagccctccaggaactcactgagagg aaacccccgctgccagcacatcagtgtgttcagcagcttggtgttcgaggctgcagtgaa ccatggtcatgccactgtgccctagcctgggtgacagagtga >gi568815583f:45487444_45706511|GENSCAN_predicted_peptide_4|473_aa MQKAFDKIQQPFMLKTLNKLGIDGTYFKIIAIYDKPTANIILNGQKLEAFPLKTGTRQGC PLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKL ISNFSKVSGYKINVQKSQAFLYPNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLF KENYKPLLKEIKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNAIPIKLPMTFFTELE KTTLKFIWNQKRACIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDINQWNR TEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINS RWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCT AKETTIRVNRQPTTWEKIFATYSSDKGLIPRIYNELKQIYKKKTTPSKSGRRT >gi568815583f:45487444_45706511|GENSCAN_predicted_CDS_4|1422_bp atgcaaaaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaatta ggtattgatgggacgtatttcaaaataatagctatctatgacaaacccacagccaatatc atactgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgc cctctctcaccgctcctattcaacatagtgttggaagttctggccagggcaatcaggcag gagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgca gacgacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctccttaagctg ataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattc ttataccccaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaatt gcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttc aaggagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaac attccatgctcatgggtaggaagaatcaatatcatgaaaatggccatactgcccaaggta atttacagattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaa aaaactactttaaagttcatatggaaccaaaaaagagcctgcatcgccaagtcaatccta agccaaaagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggct acagtaaccaaaacagcatggtactggtaccaaaacagagatataaatcaatggaacaga acagagccctcagaaataatgccgcatatctacaactatctgatctttgacaaacctgag aaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggcta gccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattca agatggattaaagatttaaacgttagacctaaaaccataaaaaccctagaagaaaaccta ggcattaccattcaggacataggcatgggcaaggacttcatgtccaaaacaccaaaagca atggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcaca gcaaaagaaactaccatcagagtgaacaggcaacctacaacatgggagaaaattttcgca acctactcatctgacaaagggctaatacccagaatctacaatgaactcaaacaaatttac aagaaaaaaacaaccccatcaaaaagtgggcgaaggacatga >gi568815583f:45487444_45706511|GENSCAN_predicted_peptide_5|51_aa MLMLHEKTSKLKKRALKLQQKRQKEELEREQQREKEFEREKQLTARPAKRM >gi568815583f:45487444_45706511|GENSCAN_predicted_CDS_5|156_bp atgctgatgcttcatgaaaaaacatcaaagttaaaaaaaagagcacttaaactgcagcag aagaggcaaaaagaagagttggaaagggagcagcaacgagagaaggagtttgaaagagaa aagcagttaactgccagaccagccaaaaggatgtga >gi568815583f:45487444_45706511|GENSCAN_predicted_peptide_6|159_aa MLLASITGSTNCSGKGLNILDFASHTLFVATAQLSCCSTKATIDDMLCMIGGPDSQREHT LARDIIKVPLNYKLWLLTGCFVLFVFRHQQAREELIDAATSALEEYDYREFRPLRNESLG YTTSCPLGRKAHEDYGVAAPCTCVEKYIYTAQGVDCGGH >gi568815583f:45487444_45706511|GENSCAN_predicted_CDS_6|480_bp atgcttttagcatctataacgggatcaacaaactgttcaggtaaagggctaaatatttta gactttgcaagtcatacactctttgtggcaactgctcagctcagctgttgtagcacaaaa gcaaccatagacgatatgctctgcatgattggaggtcctgattctcaaagggaacacact cttgccagggatataataaaggtgccactgaactataagttgtggctgctgacagggtgc tttgtactttttgtgttcaggcaccagcaggcaagagaggagctgatagacgcggcaacc tcagccttagaagagtatgattatcgtgagttcagacccctcaggaatgaaagtttgggt tacaccaccagttgccccttaggaaggaaggcccatgaggactatggagtagctgctcct tgcacctgtgtggagaagtacatctatacggctcaaggggtagattgtggtggccattag >gi568815583f:45487444_45706511|GENSCAN_predicted_peptide_7|733_aa MNMGWDTVEASGWKPAYPCLRVLKEGLQRTPKLNLGHLLCEMSAGIPVFLGSCVNPRPAP LGDSMWGHPEMQDSSVLPYRSREAIPGAGRKPKWEDLVEEEEEFSNSPSTWAPGVSLTCA VYTVLIRALRQEATEIQTNSSQLGTQQVGPLQLHTGASHAARNHYEVLVLGGGSGGITMA ARMKRKVGAENVAIVEPSERHFYQPIWTLVGAGAKQLSSSGRPTASVIPSGVEWIKARVT ELNPDKNCIHTDDDEKISYRYLIIALGIQLDYEKVFLKGAWLPLYGNFSINDSFLKLEKT NEGDKTSTACFCLFISWIIKGLPEGFAHPKIGSNYSVKTVEKTWKALQDFKEGNAIFTFP NTPVKCAGAPQKIMYLSEAYFRKTGKRSKANIIFNTSLGAIFGVKKYADALQEIIQERNL TVNYKKNLIEVRADKQEAVFENLDKPGETQVISYEMLHVTPPMSPPDVLKTSPVADAAGW VDVDKETLQHRRYPNVFGIGDCTNLPTSKTAAAVGTVFSGCYKTQDLQYGHVLCELDTCG TDCVTFADYFNLKYDGYTSCPLVTGYNRVILAEFDYKAEPLETFPFDQSKERLSMYLMKA DLMPFLYWNMMLSILLTEVAPSLLLQGLGRLGDNPVGTLSLDVDKSAGKTFSTGKFPVLQ IRQCRVVGKSAGLVSGFCDRTVQVMVRGEGARKRLARSWSPGCGRPLRAAKPEGSRSFLA FYRPCLETLGLTS >gi568815583f:45487444_45706511|GENSCAN_predicted_CDS_7|2202_bp atgaatatgggatgggacactgtggaggccagtggctggaagccagcttacccctgcctg cgcgttttgaaggaagggctccagcgcacaccaaagctgaacctcggtcaccttctctgt gaaatgagcgcgggaatacctgtgtttctgggcagctgtgtgaacccgaggccggcccct ctaggagacagcatgtggggccacccagagatgcaggactcttctgttctgccctatcgc agcagagaggccatccctggagctggaagaaagcccaagtgggaggatttagttgaggag gaggaggagttttccaacagtccctccacctgggcaccaggtgtaagcctgacctgtgcc gtgtacacagtgctcattcgtgccctgagacaggaagctacagagatccagaccaacagc tcacagctgggcactcagcaggtcggcccccttcagctgcacaccggggccagccatgcg gccaggaaccattatgaggtgctggtgctgggtgggggcagtggcggaatcaccatggct gcccgcatgaagaggaaagtgggtgcagagaatgtggccattgttgagcccagtgagaga catttctaccagccaatctggacactggtgggtgctggtgccaaacaattgtcctcatct ggtcgtcccacggcaagtgtgattccatctggtgtagaatggatcaaagctagagtgact gagttgaacccagacaagaactgcattcacacagatgacgacgagaagatctcctaccga tatcttattattgctctcggaatccagctggactatgagaaggttttcctgaaaggagct tggttaccactttacggcaacttttccatcaacgacagcttcttaaaactagagaaaact aatgaaggtgataagacatcaactgcttgtttctgcttgtttatctcttggataattaaa ggcctacctgaaggtttcgctcatcccaaaatagggtcgaattattcagttaagactgta gagaagacatggaaagctctgcaggacttcaaagagggcaatgccatcttcaccttccca aatactccagtgaagtgtgctggagcccctcagaagatcatgtacttatcagaagcctac ttcaggaagacagggaagcgatccaaggccaatatcattttcaacacttctcttggagcc attttcggggttaagaagtatgcagatgccctgcaggagatcatccaggagcggaacctc actgttaactacaagaaaaacctcattgaagtccgagccgataaacaagaggctgtattt gagaacctggacaaaccaggagagacccaagtgatttcatatgaaatgcttcatgtcaca cctccaatgagcccaccagatgtcctcaagaccagtcctgtggctgatgctgctggttgg gtggatgtggataaagaaactctgcaacacaggaggtacccaaatgtgtttgggattggg gactgcaccaaccttcctacgtcaaagaccgctgctgcagtaggaactgtcttctctgga tgctataaaacacaagaccttcagtatggtcatgttttgtgtgaattagacacctgtgga acagattgtgttacttttgctgactatttcaacctcaagtatgatggctacacatcatgt ccactggtgaccggctacaaccgtgtgattcttgctgagtttgactacaaagcagagccg ctagaaaccttcccctttgatcaaagcaaagagcgcctttccatgtatctcatgaaagct gacctgatgcctttcctgtattggaatatgatgctaagcatcctcctgactgaagtcgcc ccatctctcttgctgcagggtctaggaaggcttggggataatccagttgggactctctcc ctggatgtagacaaaagtgctggcaagactttcagcactgggaaattcccagtgttacag attcgccaatgtcgggtagtgggtaaaagtgcagggcttgtcagtgggttctgtgacaga accgtgcaagtaatggtacgcggagagggcgcgcgaaagcgcctggctcgttcctggagc ccgggctgcggacgccccctgcgtgctgcgaaacctgaggggtctcggtccttcctggcc ttttataggccgtgcctagaaacgcttggcttgaccagttag