GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:28:21 Sequence gi568815593f:71619294_71820612 : 201319 bp : 40.77% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1600 1707 108 2 0 82 47 77 0.197 2.34 1.02 Intr + 7264 7431 168 2 0 69 43 107 0.191 3.30 1.03 Intr + 11829 11986 158 1 2 82 78 71 0.396 4.31 1.04 Intr + 12828 12892 65 2 2 100 67 126 0.955 8.30 1.05 Intr + 15194 15358 165 2 0 -5 72 183 0.813 5.55 1.06 Intr + 15650 15749 100 2 1 76 76 55 0.976 2.29 1.07 Intr + 15858 15953 96 2 0 95 87 148 0.449 14.69 1.08 Intr + 21710 21782 73 1 1 100 75 52 0.415 3.16 1.09 Term + 23160 23275 116 1 2 85 36 66 0.464 -1.15 1.10 PlyA + 23543 23548 6 1.05 2.00 Prom + 24782 24821 40 -3.95 2.01 Init + 28799 28940 142 1 1 40 109 141 0.894 11.74 2.02 Intr + 29804 29960 157 0 1 84 82 185 0.989 15.65 2.03 Intr + 30776 30890 115 2 1 121 67 95 0.997 10.23 2.04 Intr + 33376 33461 86 0 2 88 81 157 0.993 12.70 2.05 Term + 37450 37567 118 1 1 95 36 99 0.914 2.43 2.06 PlyA + 37572 37577 6 1.05 3.04 PlyA - 37726 37721 6 1.05 3.03 Term - 43361 43285 77 0 2 129 43 55 0.524 2.12 3.02 Intr - 43888 43687 202 1 1 48 47 67 0.322 -3.46 3.01 Init - 47524 47372 153 1 0 44 90 172 0.945 13.03 3.00 Prom - 50609 50570 40 -2.85 4.07 PlyA - 50673 50668 6 1.05 4.06 Term - 51441 50929 513 0 0 41 43 237 0.168 7.86 4.05 Intr - 58449 58367 83 1 2 76 110 50 0.573 4.44 4.04 Intr - 65796 65758 39 1 0 108 93 22 0.002 1.98 4.03 Intr - 78981 78776 206 2 2 100 37 148 0.314 8.82 4.02 Intr - 85158 85119 40 1 1 54 86 49 0.184 -2.34 4.01 Init - 86924 86873 52 2 1 37 110 36 0.337 2.07 4.00 Prom - 88398 88359 40 -4.35 5.00 Prom + 93491 93530 40 -4.55 5.01 Init + 100001 100163 163 1 1 89 -9 343 0.906 22.64 5.02 Intr + 100256 100406 151 0 1 80 61 104 0.838 5.20 5.03 Intr + 100514 100675 162 2 0 23 76 165 0.094 6.97 5.04 Intr + 111441 111612 172 0 1 80 68 80 0.001 4.22 5.05 Intr + 129473 129541 69 1 0 36 95 82 0.001 2.16 5.06 Intr + 136837 136931 95 0 2 69 90 76 0.016 3.74 5.07 Intr + 141422 142978 1557 2 0 10 29 664 0.006 40.11 5.08 Intr + 159631 159688 58 1 1 91 89 14 0.096 -0.23 5.09 Intr + 163059 163359 301 2 1 104 73 135 0.082 8.98 5.10 Intr + 164927 164961 35 0 2 134 67 22 0.073 1.72 5.11 Term + 167611 167736 126 0 0 112 38 69 0.149 1.60 5.12 PlyA + 168355 168360 6 1.05 6.03 PlyA - 168462 168457 6 1.05 6.02 Term - 171668 171511 158 0 2 110 51 99 0.065 5.51 6.01 Init - 175056 174981 76 0 1 65 48 59 0.008 1.22 6.00 Prom - 176484 176445 40 -5.65 7.04 PlyA - 176601 176596 6 1.05 7.03 Term - 178614 177748 867 0 0 36 41 407 0.141 22.48 7.02 Intr - 183979 183734 246 1 0 -21 100 301 0.033 17.13 7.01 Intr - 190907 190770 138 2 0 10 70 157 0.066 5.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:71619294_71820612|GENSCAN_predicted_peptide_1|349_aa XQAHLTGDWVDGSLSVEFFGVHLCPDADAGPCDVCPVFVGFWIKHYRSHCMSCISCVCRV LGFQIAVVMGSCTAGGAYVPAMADENIIVRKQEFVIVNGIFQHVIEYSKMNKSQFLPPRN SQHRGDDRHNMHITELHSDTCRVGGVKAATGEEVSAEDLGGADLHCRSSVTYTVYAVTNY IMKVCRLWLAFITRNSAYGTGDEYLIEAFVSKAPWSSRGRLRKSGVSDHWALDDHHALHL TRKVVRNLNYQKKLDVTIEPSEEPLFPADELYGIVGANLKRSFDVREVIARIVDGSRFTE FKAFYGDTLVTGLIRQLVKTVPLWVSLVVWGHVPYSAHPVCAWRSLSGA >gi568815593f:71619294_71820612|GENSCAN_predicted_CDS_1|1050_bp nagcaggcccatctcactggtgactgggtggacggtagtctctcagtggagttctttgga gttcacctctgtcctgatgcagatgcaggtccctgtgatgtctgtcctgtgtttgtggga ttctggatcaaacactatagaagtcactgcatgagctgcatctcatgtgtttgtcgtgtg cttggattccagatcgcagtggtcatgggctcctgcaccgcaggaggagcctatgtgcct gccatggctgatgaaaacatcattgtacgcaagcaggagtttgtgatagtaaatggaatt tttcagcatgttattgagtattcaaagatgaataaatcacagttcctgccaccaaggaat tcacagcatagaggagatgacagacataacatgcacattactgaactacattctgacaca tgccgtgttggaggtgttaaagcggcaactggggaagaagtatctgctgaggatcttgga ggtgctgatcttcattgcaggagctcagttacgtacacagtttatgccgtaaccaactac atcatgaaagtgtgccgtctgtggttggccttcatcacacgcaattctgcttacggtaca ggggatgagtacttaatagaagcctttgtcagcaaggccccttggtctagcagaggaagg ttaagaaagtctggagtaagtgaccactgggctttggatgatcatcatgcccttcactta actaggaaggttgtgaggaatctaaattatcagaagaaattggatgtcaccattgaacct tctgaagagcctttatttcctgctgatgaattgtatggaatagttggtgctaaccttaag aggagctttgatgtccgagaggtcattgctagaatcgtggatggaagcagattcactgag ttcaaagccttttatggagacacattagttacaggtttgatcaggcagttggtgaagacc gtccctctatgggtgtccctagtagtgtggggacatgttccctactcagcgcatcctgtc tgtgcctggagatccctaagtggggcctag >gi568815593f:71619294_71820612|GENSCAN_predicted_peptide_2|205_aa MVPSKGLCCHSCGDCRRHTCSGGTGWVELDDQILEFGKPGIVAVNEAGFMVGREYEAEGI AKDGAKMVAAVACAQVPKITLIIGGSYGAGNYGMCGRAYSPRFLYIWPNARISVMGGEQA ANVLATITKDQRAREGKQFSSADEAALKEPIIKKFEEEGNPYYSSARVWDDGIIDPADTR LVLGLSFSAALNAPIEKTDFGIFRM >gi568815593f:71619294_71820612|GENSCAN_predicted_CDS_2|618_bp atggtgccaagtaaagggctgtgctgccattcctgtggtgactgcagaaggcacacgtgc tctggaggaactggttgggttgaattggatgaccagatactggagtttggcaaaccaggg atagtggcagtcaatgaggcaggatttatggttggtagagagtatgaagctgaaggaatt gccaaggatggtgccaagatggtggccgctgtggcctgtgcccaagtgcctaagataacc ctcatcattgggggctcctatggagccggaaactatgggatgtgtggcagagcatatagc ccaagatttctctacatttggccaaatgctcgtatctcagtgatgggaggagagcaggca gccaatgtgttggccacgataacaaaggaccaaagagcccgggaaggaaagcagttctcc agtgctgatgaagcggctttaaaagagcccatcattaagaagtttgaagaggaaggaaac ccttactattccagcgcaagggtatgggatgatgggatcattgatccagcagacaccaga ctggtcttgggtctcagttttagtgcagccctcaacgcaccaatagagaagactgacttc ggtatcttcaggatgtaa >gi568815593f:71619294_71820612|GENSCAN_predicted_peptide_3|143_aa MADNQRNWLGGSTLIQERIKAYAEASSVITEEDLAFDEAESMRFIEQLDVRGSLLATLAK TVQNRRGVVPKENESTVTKRRQRGTKRLKITLKSLTRYCTRPGATSSYLCVPSRSPTHSS YSALVAELERNRSTPRPVFSPRP >gi568815593f:71619294_71820612|GENSCAN_predicted_CDS_3|432_bp atggccgataaccagagaaactggttagggggctctacactgatccaagagagaataaag gcctacgctgaggctagttcagtgataacagaagaggacttagcttttgatgaggcagaa tcaatgagatttatagagcaactggatgtcaggggctctttgcttgccacactggccaag actgtgcagaacaggagaggggtagttcccaaagagaatgaaagtacagttaccaaaaga agacagaggggtaccaagaggctgaagataaccctcaagtcactgaccagatactgcacc aggccaggagccacgtcttcttatctttgtgtaccatcccgaagcccgacacatagttca tacagcgcgttagtagctgaactagagcgaaaccggtccacaccccgcccagtgttctct ccacgaccgtga >gi568815593f:71619294_71820612|GENSCAN_predicted_peptide_4|310_aa MNSILRGTDSIGAGQLPAICGPESLVPVWSGKHVAWHTPDTAEPFTRHSAALGQEGSDLR PFSGVEKEAQVGTAKSGKLAPDLATRGALLTAGYLSSTAGKSGISSWFGSIAVIDLGRKQ GEIEVDLKDNRYSLIRKLPQCKVQRCSFKAVGPWLNGRIKEIDPSSCSASSVLASSSNWQ QQGFTDPRVTCPHDQIPEKKEIKFSSFSLGQRNVSYNRRSTDIPSCQSLEKRMGSFSPVR PIWSCGFTGFPRHVDSFVGMNNKRKNNWSSAGEKQEQMLGRHLTDTRSMITLNPKNGNES RKSNYIHGYF >gi568815593f:71619294_71820612|GENSCAN_predicted_CDS_4|933_bp atgaactccatcctcaggggtactgattcaattggtgcagggcagttgccagccatctgt ggtccagagtccctggtgccagtgtggagtgggaagcatgtggcttggcatacgccagac accgccgagcccttcacccgccactcagcagctctaggccaggagggctctgatttgcgc cctttctctggtgtggaaaaggaagctcaggttgggacggcaaaatcaggcaagctggct ccggatctcgccaccaggggcgctcttctgaccgcagggtacctcagctccaccgcaggg aaatcaggaatttcttcttggtttggatccattgctgttattgatctagggagaaaacaa ggggagattgaagtggatcttaaggacaacagatattctcttataaggaaactcccccag tgtaaagtccagagatgcagcttcaaggcagtgggtccatggctcaatggcaggatcaag gagatagatccctcatcctgctcggcctcctcagtgttggcttcatcctccaactggcag caacaaggcttcacagatcccagagtcacatgtccacatgaccagatcccggagaagaaa gagatcaaattttctagcttttccttggggcaaagaaatgtctcctacaaccgccgctca acagacatcccttcatgccaatccctggaaaagagaatgggaagtttttcaccagtcagg cccatctggagctgtgggttcactggcttcccaaggcatgtggattcatttgtaggtatg aataacaaaagaaaaaacaactggagttctgcaggagagaaacaggaacagatgctgggc agacacctaacagacacccgctctatgataacactaaatccaaaaaatggcaatgagtcc aggaaatcaaattatatccacggatacttttga >gi568815593f:71619294_71820612|GENSCAN_predicted_peptide_5|962_aa MESSRVRLLPLLGAALLLMLPLLGTRAQEDAELQPRALDIYSAVDDASHEKELVESGRGE LSATPRHPLPSEERLELTGSWQSVERIPHPGPSEQQGPQRLRDPRAGSAHLGWARSQGGN FRLRSGVLQIEALQEVLKKLKSKRVPIYEKKYGQVPMVRVTWVYPQGKPYTKSGAPALEE TFQGHFKPCLQLEAVGQCPLRSWSYLSFERRESHFWPLAQVTQKSNWYGFGPDAGLVKMD LDGITSTSSECPVSLREGAVIAAPQWIPVWRINKIDRPLARLIKKKREKNQIDTIKNDKG DITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYILPRLNQEEVESLNRPITGSEI QAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIP KPGRDTTKKENFRPIALMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNICK SINVIQHINRTKDKNHMIISIDAEKSFDKIQQRFMLKSLNKLSVDGTYLKIIRAIYDKPT ANIMLNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEFLARAIRQEKEIKGIQLGKEEVKLS LFADDMIVYLENPIVSAQNLLKLISNFSKVSEYKINVQKSQAFLYTNNRQTESQIMSELP FTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRMNIVKMAIL PKVIYRFNAIPIKLPVTFFTELEKTTLKFIWNQKRACIAKSLLSKKNKAGGITLPDFKLC YKATVTKTAWSQKACDYVHYVMEERMVFRRNKTKFLGILVHSLVSNPADLFFLVISGSVS NDAKLPGFMVSMKLNANCGNSPGQAKPWKVKVAVKAKVPPGLSLGRARPQTDPYHEILGT EVNLGLQLCRLSADSPFSPHDVPFLSLGFGQRAHNLSCTICFFRGQGDPHGPEEMANETN SP >gi568815593f:71619294_71820612|GENSCAN_predicted_CDS_5|2889_bp atggagagctcccgcgtgaggctgctgcccctcctgggcgccgccctgctgctgatgcta cctctgttgggtacccgtgcccaggaggacgccgagctccagccccgagccctggacatc tactctgccgtggatgatgcctcccacgagaaggagctggtcgagtcagggcgcggggag ctgagcgcaacgcccaggcacccactgccatccgaagagcgtctcgagctcacgggctcc tggcagtctgttgagcgaatccctcatcccggcccctctgagcaacagggaccccagcgg ctcagagacccgcgggctggaagtgcgcacctgggctgggctcgcagccaaggcggcaac ttcaggctccgaagcggtgtgttgcagatcgaagcgctgcaagaagtcttgaagaagctc aagagtaaacgtgttcccatctatgagaagaagtatggccaagtccccatggtaagagtg acctgggtatatcctcagggaaagccctacacaaagagcggggctcctgcattggaagag acctttcagggacacttcaaaccatgcctgcagctggaagctgttgggcagtgcccacta cggagctggtcatatttgagttttgagaggagagaaagccatttctggcctctggcccaa gtaactcagaaaagtaattggtatggctttggtcctgatgctggacttgttaagatggat ctggatggtatcacatctacttcctcagaatgtcctgtatcactcagagaaggagcagtt attgcagccccacagtggatccctgtgtggaggatcaacaaaattgatagaccgctagca agactaataaagaagaaaagagagaagaatcaaatagatacaataaaaaatgataaaggg gatattaccaccgatcccacagaaatacaaactaccatcagagaatactacaaacacctc tatgcaaataaactagaaaatctagaagaaatggataaattcctcgacacatacatcctc ccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggctctgaaatt caggcaataatcaatagcttaccaaccaaaaaaagtccaggaccagatggattcacagcc gaattctaccagaggtacaaagaggagctggtaccattccttctgaaactattccaatca atagaaaaagagggaatcctccctaactcattttatgaggccagcatcatcctgatacca aagcctggcagagacacaaccaaaaaagagaattttagaccaatagccttgatgaacatc gatgcaaaaatcctcaataaaatactggcaaaccgaatccagcaacacatcaaaaagctt atccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaacatatgcaaa tcaataaatgtaatccagcatataaacagaaccaaagacaaaaaccacatgattatctca atagatgcagaaaagtcctttgacaaaattcaacaacgcttcatgctaaaatctctcaat aaattaagtgttgatgggacgtatctcaaaataataagagctatctatgacaaacccaca gccaatatcatgctgaatgggcaaaaactggaagcattccctttgaaaacaggcacaaga cagggatgccctctctcaccactcctattcaacatagtgttggaatttctggccagggca atcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtcc ctgtttgcagatgacatgattgtgtatctagaaaaccccattgtctcagctcaaaatctc ctcaagctgataagcaacttcagcaaagtctcagaatacaaaatcaatgtacaaaaatca caagcattcttatacaccaataacagacagacagagagccaaatcatgagtgaactccca ttcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaag gacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggatacaaacaaa tggaagaacattccatgctcatgggtaggaagaatgaatatcgtgaaaatggccatactg cccaaggtaatttatagattcaatgccatccccatcaagctaccagttactttcttcaca gaactggaaaaaactactttaaagttcatatggaaccaaaaaagagcctgcatcgccaag tcactcctaagcaaaaagaacaaagctggaggcatcacactacctgacttcaaactatgc tacaaggctacagtaaccaaaacagcgtggagccagaaagcctgtgattatgtgcattat gtgatggaagaaaggatggtatttagaaggaataagaccaaattcctggggattttggtt cactccttggtcagcaaccctgcagatttattttttctagtcatttcaggctctgtgtct aatgatgccaagttacctggctttatggtgagtatgaagctgaacgctaactgcggaaac tccccgggccaggcaaagccttggaaagtgaaagtggctgtgaaggcaaaggtgccacct ggtctttccctagggagagcaagaccccaaactgatccttatcatgagattctgggtact gaggtgaacttgggcttgcagctttgcaggctcagcgcagacagccctttctcaccccat gatgtccccttcctgtcacttggatttggtcagagagctcacaacctcagctgcaccatc tgctttttcagaggacaaggagaccctcatggcccagaagaaatggcaaatgaaacaaat tctccttga >gi568815593f:71619294_71820612|GENSCAN_predicted_peptide_6|77_aa MRSTSARPPIIWDVRSTSAQLSRLGCLLEFAGGLPQTLFGWVSPAEAAEQQRLLPPPCSG SFIPEGHLPDASWSSPV >gi568815593f:71619294_71820612|GENSCAN_predicted_CDS_6|234_bp atgaggagcacctctgcccggccgcccatcatctgggatgtgaggagcacctctgcccag ctgtcccgtctgggatgtctgctggagtttgctggaggtctaccccagactctgtttggc tgggtatcaccagcggaggctgcagaacagcaaagattgctgcctcctccttgctctgga agcttcattccagaggggcatctgccagatgccagctggagctctcctgtatga >gi568815593f:71619294_71820612|GENSCAN_predicted_peptide_7|416_aa GHKGQVGNGSVRLREGAATLPDHSRGSSGEEIGSTFGHRAAPLSQTPKPNPEQGEEAAEE KLEASRGWFMRFKERGHLHNIKMQGEAPNADVEAMASYPEDLAKIIDEGGYAKKQIFNAD ETALGWKKVHKEMKAEIKMFLETNENKDTAHQNLWDTFKAVRRGKFIALNTHKRKQERSK IDTLTLQLKELEKQEQTHSKASRRQEITKIRTELKEIETQKTLQKINESRSWFFEKINKI DRLLGGLIKKKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFL DTYILPRLNQEEVESLNRPITDSEIQAIINSLPTKKSPGPDGFTAEFYQRYKEALGPFLL KLFQSIEKEGILPNSLDEASIILISKPGRDTTKKENFRPIALMNIDAKILNKILAN >gi568815593f:71619294_71820612|GENSCAN_predicted_CDS_7|1251_bp ggtcataaaggccaggtggggaatggcagtgtccggctgagggagggagctgccacactg cctgatcattcgcgtggcagcagcggggaggagatcggcagcacctttggccacagagct gcgcctctgtcacaaacaccaaaacctaatccagagcaaggggaagaagctgcagaagaa aaactggaagctagcagaggttggttcatgaggtttaaagaaagaggccatcttcataat ataaaaatgcaaggtgaagcaccaaatgctgatgtagaagctatggcaagttatccagaa gatctggctaagatcattgatgaaggtggctacgctaaaaaacagattttcaatgcggat gaaacagccttagggtggaagaaggtacataaggaaatgaaggcagaaataaagatgttc cttgaaacgaatgagaacaaagacacagcacaccagaatctctgggacacattcaaagcc gttcgtagagggaaatttatagcactaaatacccacaagagaaagcaggaaagatctaaa attgacaccctaacattacaattaaaagaactagaaaagcaagagcaaacacattcaaaa gctagcagaaggcaagaaataactaagatcagaacagaactgaaggaaatagagacacaa aaaacccttcaaaaaattaatgaatccaggagctggttttttgaaaagatcaacaaaatt gatagactgctaggaggattaataaagaagaaaagagagaagaatcaaatagatacaata aaaaatgataaaggggatatcaccaccgatcccacagaaatacaaactaccatcagagaa tactacaaacacctctacgcaaataaactagaaaatctagaagaaatggataaattcctc gacacatacatcctcccaagactaaaccaggaagaagttgaatctctgaatagaccaata acagactctgaaattcaggcaataatcaatagcttaccaaccaaaaaaagtccaggacca gatggattcacagccgaattctaccagaggtacaaggaggcactgggaccatttcttctg aaactattccaatcaatagaaaaagagggaatcctccctaactcattagatgaggccagc atcatcctgatatcaaagcctggcagagacacaaccaaaaaagagaattttagaccaata gccttgatgaacattgatgcaaaaatcctcaataaaatactggcaaactga