GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:09:46 Sequence gi568815591f:80546741_80774144 : 227404 bp : 36.19% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1628 1693 66 0 0 74 119 15 0.028 1.16 1.02 Intr + 16915 17270 356 2 2 44 54 168 0.054 3.08 1.03 Intr + 17877 18024 148 2 1 18 95 143 0.562 6.89 1.04 Term + 28491 28654 164 1 2 76 48 111 0.136 3.02 1.05 PlyA + 29295 29300 6 1.05 2.00 Prom + 72393 72432 40 -4.15 2.01 Init + 100001 100120 120 1 0 73 82 218 0.992 19.99 2.02 Intr + 109800 109960 161 2 2 82 109 121 0.905 11.46 2.03 Intr + 114323 114470 148 2 1 75 73 156 0.838 12.12 2.04 Intr + 117666 117757 92 2 2 26 115 64 0.106 0.77 2.05 Intr + 124237 124424 188 1 2 92 113 124 0.998 13.71 2.06 Intr + 125182 125300 119 2 2 74 58 94 0.987 4.26 2.07 Intr + 126030 126103 74 2 2 88 70 62 0.334 1.69 2.08 Intr + 129080 129207 128 2 2 25 70 97 0.300 1.00 2.09 Intr + 130728 130887 160 1 1 32 55 108 0.069 0.02 2.10 Intr + 139208 139326 119 2 2 93 49 71 0.001 2.89 2.11 Intr + 141953 142069 117 0 0 14 98 73 0.009 0.42 2.12 Intr + 145584 145815 232 1 1 37 72 178 0.055 7.01 2.13 Term + 148086 148260 175 0 1 74 39 103 0.405 0.25 2.14 PlyA + 148393 148398 6 1.05 3.00 Prom + 156512 156551 40 -3.95 3.01 Init + 161289 161366 78 2 0 73 116 67 0.936 9.11 3.02 Term + 167441 168289 849 1 0 -17 43 577 0.627 34.56 3.03 PlyA + 168517 168522 6 1.05 4.00 Prom + 168686 168725 40 -6.15 4.01 Sngl + 169637 170608 972 1 0 70 35 364 0.444 25.78 4.02 PlyA + 170647 170652 6 1.05 5.05 PlyA - 171952 171947 6 1.05 5.04 Term - 198567 198154 414 0 0 113 34 361 0.986 27.48 5.03 Intr - 202288 202158 131 2 2 76 105 127 0.996 12.69 5.02 Intr - 204596 204520 77 1 2 111 61 42 0.418 2.04 5.01 Init - 210799 210744 56 1 2 75 89 25 0.187 2.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 115800 115588 213 0 0 87 38 302 0.857 20.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:80546741_80774144|GENSCAN_predicted_peptide_1|244_aa XSKTLCSSTIALAFTKTLVHFSVADARQLEGERVVPRWRLYAVAQDVVLVWDGVLTTCVG GSPVLCAPLAQKQGAGRDGACWLYAHQGSFCNAGRQESEEILHSHVLVGQLKQNPPVQTC ASKVMWGNAMGTGEAAVWGENGSKNVSTAEAVAERLSLASKGSVWGAAELLLAQSLWQKV ARGPDLEDLPGGYGEMLVGLQECGDAEAVDPQGCGDAEAVDPYPKGRMQPGGVWAFKWCC VAVA >gi568815591f:80546741_80774144|GENSCAN_predicted_CDS_1|735_bp nctagtaaaaccctctgtagcagtaccattgcactggcatttaccaagaccctggtacat ttttctgtggcagatgcaaggcagttggagggggagcgggtggtgccccgatggagactg tatgctgttgcacaggatgtggtgttggtttgggatggggtgctgaccacctgcgtggga ggatcccctgttctctgtgcaccattagcacaaaagcagggtgctggcagggatggggct tgctggctgtatgctcaccaaggctccttttgcaatgctggtcgacaggagtcagaggaa atattgcactcccatgtactggtggggcaattaaagcaaaacccgcctgtacaaacttgt gccagcaaagtgatgtggggaaatgccatgggcacaggggaagctgcagtctggggagaa aatggtagcaagaatgtttccactgcagaggcagtggcagagaggctttcacttgcctct aaaggctctgtctggggagctgctgagttgctactggctcagtcactctggcagaaagtg gctagaggcccagacctggaggacctgcctggtggatatggggagatgttagttgggctc caggagtgtggagatgcagaggctgttgacccccaagggtgtggagatgcagaggctgtt gacccctacccaaagggcaggatgcagcctggtggagtctgggctttcaaatggtgctgt gttgcagttgcttag >gi568815591f:80546741_80774144|GENSCAN_predicted_peptide_2|610_aa MGCDRNCGLIAGAVIGAVLAVFGGILMPVGDLLIQKTIKKQVVLEEGTIAFKNWVKTGTE VYRQFWIFDVQNPQEVMMNSSNIQVKQRGPYTYRVRFLAKENVTQDAEDNTVSFLQPNGA IFEPSLSVGTEADNFTVLNLAVAYNNTADGVYKVFNGKDNISKVAIIDTYKGKRSIYAVF ESDVNLKGIPVYRFVLPSKAFASPVENPDNYCFCTEKIISKNCTSYGVLDISKCKEGRPV YISLPHFLYASPDVSEPIDGLNPNEEEHRTYLDIEPITGFTLQFAKRLQVNLLVKPSEKI QGSQENNPMVLLKVVHTPLNGTWSEGCGNAGPGQMHFSYRSLPAARVNVEHYYRKATNQE STCLERSFASLKVLKAYLVPFLLTGKIKEVSVQDINMDMDETGNHYSEQTTARTENQTLH VLTHRWELNNENTWTEVNYKENSKSSNRKVSNRALWQDGQVGTALSAALSKINTEEIQTT IREYYKHLYANKLENLEEMDKFLNTYTLRSLNQEEIESLNRTITRSEIEAIINSLPTKKS PEPDGFTAKFYQRDMDEAGNHHSRQTNTRTENQTPHVLTHKWELNNENNGLRVGNITHQD LSGDGGVGDG >gi568815591f:80546741_80774144|GENSCAN_predicted_CDS_2|1833_bp atgggctgtgaccggaactgtgggctcatcgctggggctgtcattggtgctgtcctggct gtgtttggaggtattctaatgccagttggagacctgcttatccagaagacaattaaaaag caagttgtcctcgaagaaggtacaattgcttttaaaaattgggttaaaacaggcacagaa gtttacagacagttttggatctttgatgtgcaaaatccacaggaagtgatgatgaacagc agcaacattcaagttaagcaaagaggtccttatacgtacagagttcgttttctagccaag gaaaatgtaacccaggacgctgaggacaacacagtctctttcctgcagcccaatggtgcc atcttcgaaccttcactatcagttggaacagaggctgacaacttcacagttctcaatctg gctgtggcatacaacaatactgcagatggagtttataaagttttcaatggaaaagataac ataagtaaagttgccataatcgacacatataaaggtaaaaggtcaatctatgctgtattt gaatccgacgttaatctgaaaggaatccctgtgtatagatttgttcttccatccaaggcc tttgcctctccagttgaaaacccagacaactattgtttctgcacagaaaaaattatctca aaaaattgtacatcatatggtgtgctagacatcagcaaatgcaaagaagggagacctgtg tacatttcacttcctcattttctgtatgcaagtcctgatgtttcagaacctattgatgga ttaaacccaaatgaagaagaacataggacatacttggatattgaacctataactggattc actttacaatttgcaaaacggctgcaggtcaacctattggtcaagccatcagaaaaaatt caggggtcacaagaaaataacccaatggtccttttgaaagtagtacatacacctttaaat ggaacttggtctgaagggtgtggaaatgctggtccagggcagatgcacttcagctaccgt tccttgcctgccgccagagtaaatgttgagcattactacagaaaagccacaaaccaagaa tctacctgtttggaaagatcttttgcatctctgaaggtgcttaaagcatacttagtgcct ttccttttaactgggaagataaaagaagtatctgtccaagatattaatatggacatggat gaaactggaaaccattattctgagcaaactaccgcaaggacagaaaaccagacactgcat gttctcactcataggtgggaactgaataatgagaacacttggacagaggtgaattacaaa gagaattctaaaagcagcaatagaaaagtatcaaatcgggcactttggcaagatggccaa gtaggaacagctctgtctgcggctctcagcaagatcaacacagaagaaatacaaactacc atcagagaatactataaacacctctatgcaaataaactagaaaatctagaagaaatggat aaattcctgaacacatacaccctccgaagtctaaaccaggaagaaattgaatctctaaat agaacaataacacgttcggaaatcgaggcaataattaatagcctaccaaccaaaaaaagt ccagaaccagatggattcacagccaaattctaccagagggacatggatgaagctggaaac catcattctcggcaaactaacacaagaacagaaaaccaaacaccacatgttctcactcac aagtgggagttgaacaatgagaacaatggactcagggtgggaaacatcacacaccaggac ctgtcaggggatggtggggtaggggatggatag >gi568815591f:80546741_80774144|GENSCAN_predicted_peptide_3|308_aa MGLEEAAFPKDLERMPGRGRGEQSQKEDIQTKGKEVENFEKNLEECITRITNTEKCLKEL MELKTKARELCEECRSLRSRCDQLEERVSAMEDEMNERKREGKFREKRIKRNEQSLQEIW DYVKRPNLCLIGVPESDGENGTKLENTLQDIIQENFPNLARQANVQIQEIQRTPQRYSSR SATPRHIIVRFTEVEMKEKMLRAAREKGRVTLKGKPIRLTADLSAETLQARRPWGPIFNI LKEKNFQPRISYPAKLSFISEGEIKYFTDKQMLRDFVTTRPALKELLKEALNMERNNRYQ LLQNHAKM >gi568815591f:80546741_80774144|GENSCAN_predicted_CDS_3|927_bp atggggctggaagaagcagcatttccaaaggacttagaaagaatgccaggaagaggaaga ggagaacagtcacaaaaggaggacattcaaaccaaaggcaaagaagttgaaaacttcgaa aaaaacttagaagaatgtataactagaataaccaatacagagaagtgcttaaaggagctg atggagctgaaaaccaaggctcgagaactatgtgaagaatgcagaagcctcaggagccga tgcgatcaactggaagaaagggtatcagcaatggaagatgaaatgaatgaaaggaagcga gaagggaagtttagagaaaaaagaataaaaagaaatgagcaaagcctccaagaaatatgg gactatgtgaaaagaccaaatctatgtctgatcggtgtacctgaaagtgacggggagaat ggaaccaagttggaaaacactctgcaggatattatccaggagaacttccccaatctagca aggcaggccaacgttcagattcaggaaatacagagaacgccacaaagatactcctcgaga agtgcaactccaagacacataattgtcagattcaccgaagttgaaatgaaggaaaaaatg ttaagggcagccagagagaaaggtcgggttaccctcaaagggaagcccatcagactaaca gcagatctctcagcagaaaccctacaagccagaagaccgtgggggccaatattcaacatt cttaaagaaaagaattttcaacccagaatttcatatccggccaaactaagcttcataagt gaaggagaaataaaatactttacagacaagcaaatgctaagagattttgtcaccaccagg cctgccctaaaagagctcctgaaggaagcgctaaacatggaaaggaacaaccggtaccag ctgctgcaaaatcatgccaaaatgtaa >gi568815591f:80546741_80774144|GENSCAN_predicted_peptide_4|323_aa MDKFLGTYTLPRLNQEEVESLNRPITGSEIVAIIKSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILA NRIQQHIKKLIHHDQVGFIPGMQGLFNIHKSINVIQCINRAKDKNHMIISIDAEKAFDKI QQRFMLKTLNKLGIDGTYFKIIRVIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLIF NIVLEVLSRAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPTVSAQNLLKLISNFSKV SGYKINVHKSQAFLYTNNRQTES >gi568815591f:80546741_80774144|GENSCAN_predicted_CDS_4|972_bp atggataaattcctcggcacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggatctgaaattgtggcaataatcaaaagcttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactct ttttatgaggccagcatcattctgataccaaagcctggcagagacacaaccaaaaaagag aattttagaccaatatctttgatgaacattgatgcaaaaatcctcaataaaatactggca aacagaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggcttgttcaatatacacaaatcaataaatgtaatccagtgtataaacaga gccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacaacgcttcatgctaaagactctcaataaattaggtattgatgggacatatttcaaa ataataagagttatttatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcatattc aacatagttttggaagttctgtccagggcaatcaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctctttgcagacgacatgattgtatatcta gaaaaccccactgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtacacaaatcacaagcattcttatacaccaataacagacaa acagagagctaa >gi568815591f:80546741_80774144|GENSCAN_predicted_peptide_5|225_aa MDTLICFQNLDDHALSLEWRSRRQDVRHGNPLTQCRGFNLKGIKTYRNAAEIVQYGVKNN TTFLECAPKSPQASIKWLLQKDKDRRKEVKLNERIIATSQGLLIRSVQGSDQGLYHCIAT ENSFKQTIAKINFKVLDSEMVAVVTDKWSPWTWASSVRALPFHPKDIMGAFSHSEMQMIN QYCKDTRQQHQQGDESQKMRGDYGKLKALINSRKSRNRRNQLPES >gi568815591f:80546741_80774144|GENSCAN_predicted_CDS_5|678_bp atggataccttgatttgctttcaaaaccttgatgaccatgctttatccttggagtggagg agccgaagacaagatgtgagacatggaaacccactgactcaatgcagaggatttaatcta aaaggtattaaaacatacagaaatgcagctgaaattgtccagtatggagtaaaaaataac accacttttctggagtgtgcccccaagtctccgcaggcatctatcaagtggctgttacag aaagacaaagacaggaggaaagaggttaagctgaatgaacgaataatagccacttcacag ggactcctgatccgctctgttcagggttctgaccaaggactttatcactgcattgctaca gaaaatagtttcaagcagaccatagccaagatcaacttcaaagttttagattcagaaatg gtggctgttgtgacggacaaatggtccccatggacctgggccagctctgtgagggcttta cccttccacccgaaggacatcatgggggcattcagccactcagaaatgcagatgattaac caatattgcaaagacactcggcagcaacatcagcagggagatgaatcacagaaaatgaga ggggactatggcaagttaaaggccctcatcaatagtcggaaaagtagaaacaggaggaat cagttgccagagtcataa