GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:26:32 Sequence gi568815596r:189461881_189680460 : 218580 bp : 38.23% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1814 1873 60 0 0 89 67 63 0.760 2.09 1.02 Intr + 1966 2081 116 2 2 82 64 73 0.963 3.55 1.03 Intr + 3199 3374 176 0 2 58 52 144 0.982 5.62 1.04 Intr + 4545 4702 158 0 2 77 98 56 0.968 4.23 1.05 Intr + 5588 5768 181 0 1 76 100 75 0.614 5.50 1.06 Intr + 6595 6689 95 1 2 72 66 -2 0.363 -5.31 1.07 Intr + 7464 7559 96 1 0 86 93 66 0.771 5.96 1.08 Intr + 8196 8365 170 1 2 104 93 46 0.952 5.44 1.09 Intr + 8939 8998 60 1 0 90 91 42 0.837 2.71 1.10 Intr + 12306 12452 147 2 0 59 107 83 0.974 6.81 1.11 Intr + 12837 12928 92 2 2 59 88 13 0.942 -3.73 1.12 Term + 13333 13537 205 1 1 42 46 297 0.958 16.76 1.13 PlyA + 13639 13644 6 1.05 2.04 PlyA - 13752 13747 6 1.05 2.03 Term - 21816 21685 132 0 0 62 49 82 0.699 -1.19 2.02 Intr - 22385 22276 110 0 2 7 109 128 0.779 5.88 2.01 Init - 25989 25881 109 0 1 92 47 74 0.656 4.13 2.00 Prom - 33271 33232 40 -4.75 3.06 PlyA - 35203 35198 6 1.05 3.05 Term - 47208 47069 140 1 2 82 55 82 0.233 1.44 3.04 Intr - 58019 57895 125 1 2 82 34 77 0.034 1.01 3.03 Intr - 65309 65083 227 2 2 67 47 188 0.557 8.46 3.02 Intr - 79994 79882 113 0 2 92 80 79 0.053 6.68 3.01 Init - 80841 80823 19 0 1 79 60 10 0.032 -2.45 3.00 Prom - 89764 89725 40 -4.35 4.12 PlyA - 90084 90079 6 1.05 4.11 Term - 100311 99998 314 1 2 45 53 238 0.994 10.18 4.10 Intr - 102345 101704 642 1 0 86 103 482 0.993 40.56 4.09 Intr - 103719 103474 246 1 0 90 92 244 0.976 21.51 4.08 Intr - 109961 109835 127 2 1 105 30 74 0.927 2.63 4.07 Intr - 111081 110966 116 1 2 82 99 69 0.998 6.65 4.06 Intr - 113440 113281 160 1 1 72 113 185 0.928 18.04 4.05 Intr - 118000 117933 68 2 2 98 106 -11 0.016 -0.59 4.04 Intr - 118682 118538 145 2 1 47 89 124 0.008 7.33 4.03 Intr - 119217 119083 135 0 0 92 35 94 0.008 4.34 4.02 Intr - 121608 121454 155 1 2 17 131 59 0.115 1.97 4.01 Init - 126136 126064 73 1 1 85 115 67 0.602 10.38 4.00 Prom - 139975 139936 40 -3.45 5.00 Prom + 146196 146235 40 -2.75 5.01 Init + 164126 164265 140 1 2 68 109 151 0.414 14.86 5.02 Term + 164586 164730 145 0 1 35 37 107 0.312 -3.30 5.03 PlyA + 165306 165311 6 1.05 6.00 Prom + 185631 185670 40 -1.85 6.01 Sngl + 190429 191004 576 0 0 43 38 148 0.375 1.22 6.02 PlyA + 192596 192601 6 1.05 7.00 Prom + 194178 194217 40 -0.95 7.01 Init + 199450 199525 76 0 1 95 91 17 0.312 2.30 7.02 Intr + 199605 199730 126 1 0 46 113 131 0.362 11.13 7.03 Intr + 204728 205716 989 0 2 45 86 776 0.644 62.76 7.04 Intr + 205884 206065 182 2 2 63 86 275 0.999 22.54 7.05 Term + 208561 208846 286 1 1 48 35 260 0.999 10.59 7.06 PlyA + 209375 209380 6 1.05 8.00 Prom + 209389 209428 40 -7.25 8.01 Init + 214611 215211 601 2 1 22 110 229 0.762 13.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 81375 81599 225 2 0 66 48 170 0.829 5.69 S.002 Init + 118579 118641 63 0 0 61 34 124 0.951 3.59 S.003 Term + 151166 151386 221 0 2 101 45 124 0.860 5.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:189461881_189680460|GENSCAN_predicted_peptide_1|518_aa XIIIIHRNLEASAVIQGLVKDRSIFTGLMIDPRTKALVLNGKPGHLQFYSLQSDKQLYNL DIIQQEYINDYGLIQIELTKAAFGCFGNWLATVEQRQEKETELELQMKLWMYNKKTQGFI LNTKINMPHEDCITALCFCNAEKSEQPTLVTASKDGYFKVWILTDDSDIYKKAVGWTCDF VGSYHKYQATNCCFSEDGSLLAVSFEEIVTIWDSVTWELKCTFCQRAGKIRHLCFGRLTC SKYLLGATENGILCCWNLLSCALEWNAKLNVRVMEPDPNSENIAAISQSSVGSDLFVFKP SEPRPLYIQKGISREKVQWGVFVPRDVPESFTSEAYQWLNRSQFYFLTKSQSLLTFSTKS PEEKLTPTSKQLLAEESLPTTPFYFILGKHRQQQDEKLNETLENELVQLPLTENIPAISE LLHTPAHVLPSAAFLCSMFVNSLLLSKETKSAKEIPEDVDMEEEKESEDSDEENDFTEKV QDTSNTGLGEDIIHQLSKSEEKELRKFRKIDYSWIAAL >gi568815596r:189461881_189680460|GENSCAN_predicted_CDS_1|1557_bp nagataataattattcaccgaaaccttgaagcatccgcagtaattcaaggcctagtgaaa gataggagtatcttcactggtttgatgattgatccaagaactaaagctttggttttgaat ggaaaacctggccacctgcagttttattctctccagagtgataaacagttatacaattta gatattatacagcaagaatatattaatgattatggtctgatccaaattgaactaacaaag gctgcatttggctgctttggtaactggcttgcaacagtggaacagcggcaagaaaaggaa actgagcttgaattgcaaatgaaactgtggatgtataataagaaaacacaagggtttatt cttaacactaaaattaacatgccacacgaagactgcattacagctctctgtttctgtaat gcagaaaaatctgaacagcccaccttggttacagctagcaaagatggttacttcaaagta tggatattaacagatgactctgacatatacaaaaaagctgttggctggacctgtgacttt gttggtagttatcacaagtatcaagcaactaactgttgtttctccgaagatggttcttta ctagcagttagttttgaggaaatagtcacaatatgggattctgtaacatgggaacttaaa tgtacattttgccaacgagctgggaaaataaggcacctttgctttgggagattgacgtgt tcaaagtatctacttggtgctactgaaaatggcattctttgctgttggaatctgctgagc tgtgcattggagtggaatgcaaaattaaatgttagagttatggaacccgatcctaattca gagaatattgctgcaatctctcagtcttcagtgggttcagacttgtttgtatttaaacct agtgagccaaggccattgtatattcaaaagggtatctccagagagaaagtccagtgggga gtgtttgttccacgagatgtccctgaatccttcacctcagaagcttaccagtggctaaat agatcccagttttacttcctaacaaaatcacagagtttattgacattcagtacaaagtct ccagaagaaaaactcacaccaacaagcaaacagctgctagcagaagaaagtcttcccaca accccattttatttcatattgggaaaacacaggcaacagcaggatgaaaaactaaacgaa actttagagaatgagctggtacaactacccttaacagaaaacatacccgcaattagtgag cttcttcacactccagcccatgtcctgccatctgctgctttcctgtgctccatgtttgta aattcattgctgctgtctaaagagactaagagtgctaaggaaattcctgaagatgtagat atggaagaagaaaaagaaagtgaagattcagatgaagaaaatgattttaccgaaaaagtc caggatacaagtaacacaggtttaggagaagacattatacatcagttgtcaaaatctgaa gaaaaagaactgagaaaatttaggaaaatagactacagctggatagctgccctttaa >gi568815596r:189461881_189680460|GENSCAN_predicted_peptide_2|116_aa MLTLLIPGYEIIACSVCKIPTGATGMCSGSDGNRAARANNLGLQAPDDDEGHAVGRDQSQ FQDEEGNPGDEHQPHQNGNVPDPGCCTCRRPEGGGEEALLLILGTTVAKERVLEPN >gi568815596r:189461881_189680460|GENSCAN_predicted_CDS_2|351_bp atgctgacactcctcatacctggctatgagataattgcctgctctgtttgcaagatccca acaggagcaactggaatgtgttcagggagtgatggaaatagggcagctagagcaaataat ctgggacttcaagccccagatgacgatgagggccatgcagtaggaagggatcagagtcag ttccaggatgaggaaggtaatcccggagatgagcaccagcctcatcagaatgggaatgtt ccagacccggggtgctgcacttgtaggaggcctgagggtggtggtgaagaagccctcttg ctgattttggggaccacagtggctaaggaacgagtgctagagccaaattga >gi568815596r:189461881_189680460|GENSCAN_predicted_peptide_3|207_aa MTTEVRGRILGTDTTWHDECGISRPHNNNLINQSLGNLCGSGKETWKIRREKLLNETIRA ENAECTQRQQESPHSLRHSKRREGDWTERGELVLQGGGWSSDVEVQFFNFCKMEEKDEKW KHLWDSPVKPLQPDALKELTHSPEAPLLLARPASRTLCSLCCIFSVTSPIWYRSTVLVRF HAADKDILKTVQFTKERGLIGLTVPCG >gi568815596r:189461881_189680460|GENSCAN_predicted_CDS_3|624_bp atgaccactgaggttagggggaggatcttgggtacagatactacctggcacgatgaatgt ggcattagccgtcctcataacaacaatctgataaatcagtcccttggcaacttgtgtgga tctggcaaagagacgtggaaaatcagaagagagaaactgctcaatgagaccataagagct gaaaacgctgagtgtacccaaagacagcaggaaagcccccactccctccgtcatagcaag agaagggagggtgactggactgagaggggagagctggtccttcaaggaggaggttggagt tccgatgtggaagtccagttcttcaacttctgtaagatggaggaaaaggacgagaaatgg aaacatctctgggacagccctgtcaagccactgcagcctgatgctcttaaagaactcact cacagtcctgaagctcccctgcttcttgctcggcccgcatcgcggaccctctgctccctc tgctgtatcttttcagtaacatcccccatctggtaccgatctactgtattagtccgtttt catgctgctgataaagacatactcaagactgtgcaatttacaaaagaaagaggtttaatt ggactcacagttccatgtggctga >gi568815596r:189461881_189680460|GENSCAN_predicted_peptide_4|726_aa MPVYQKKRAGTDKGKLAASAGGDSEAQSWIGALPVIVGDNPITFFLQSSGIVTILYSAFV LVSTSDMAEETEDAWPPHGRPPRLPSRLPGEQENPGSGTRRGEGPRKPRRVSAAGWTLAP GRCYLQFLALLLTSTSARAAAAIAAAEEPAGSPSVMTRAGDHNRQRGCCGSLADYLTSAK FLLYLGHSLSTWGDRMWHFAVSVFLVELYGNSLLLTAVYGLVVAGSVLVLGAIIGDWVDK NARLKVAQTSLVVQNVSVILCGIILMMVFLHKHELLTMYHGWVLTSCYILIITIANIANL ASTATAITIQRDWIVVVAGEDRSKLANMNATIRRIDQLTNILAPMAVGQIMTFGSPVIGC GFISGWNLVSMCVEYVLLWKVYQKTPALAVKAGLKEEETELKQLNLHKDTEPKPLEGTHL MGVKDSNIHELEHEQEPTCASQMAEPFRTFRDGWVSYYNQPVFLAGMGLAFLYMTVLGFD CITTGYAYTQGLSGSILSILMGASAITGIMGTVAFTWLRRKCGLVRTGLISGLAQLSCLI LCVISVFMPGSPLDLSVSPFEDIRSRFIQGESITPTKIPEITTEIYMSNGSNSANIVPET SPESVPIISVSLLFAGVIAARIGLWSFDLTVTQLLQENVIESERGIINGVQNSMNYLLDL LHFIMVILAPNPEAFGLLVLISVSFVAMGHIMYFRFAQNTLGNKLFACGPDAKEVRKENQ ANTSVV >gi568815596r:189461881_189680460|GENSCAN_predicted_CDS_4|2181_bp atgcctgtgtatcagaaaaaaagggcggggacagacaaggggaagctcgcagccagtgct ggtggagactcagaagcacagtcctggattggagcccttcctgtaatagttggtgataac cctattactttttttcttcagagcagtggaatagtgaccattctttattctgcctttgtc ctggtgagcacatctgatatggcagaggaaactgaggatgcctggccgccccacggccgc cccccgaggttgccctcgcggcttcccggagagcaggaaaacccggggagtggaacgcgt cgaggcgaaggtccccgcaagccgcgcagggtgtctgcggccggttggacgcttgcgccc gggaggtgctatctccagttccttgcactcctgttaacaagcacctcagcgagagcagca gcagcgatagcagccgcagaagagccagcggggtcgcctagtgtcatgaccagggcggga gatcacaaccgccagagaggatgctgtggatccttggccgactacctgacctctgcaaaa ttccttctctaccttggtcattctctctctacttggggagatcggatgtggcactttgcg gtgtctgtgtttctggtagagctctatggaaacagcctccttttgacagcagtctacggg ctggtggtggcagggtctgttctggtcctgggagccatcatcggtgactgggtggacaag aatgctagacttaaagtggcccagacctcgctggtggtacagaatgtttcagtcatcctg tgtggaatcatcctgatgatggttttcttacataaacatgagcttctgaccatgtaccat ggatgggttctcacttcctgctatatcctgatcatcactattgcaaatattgcaaatttg gccagtactgctactgcaatcacaatccaaagggattggattgttgttgttgcaggagaa gacagaagcaaactagcaaatatgaatgccacaatacgaaggattgaccagttaaccaac atcttagcccccatggctgttggccagattatgacatttggctccccagtcatcggctgt ggctttatttcgggatggaacttggtatccatgtgcgtggagtacgttctgctctggaag gtttaccagaaaaccccagctctagctgtgaaagctggtcttaaagaagaggaaactgaa ttgaaacagctgaatttacacaaagatactgagccaaaacccctggagggaactcatcta atgggtgtgaaagactctaacatccatgagcttgaacatgagcaagagcctacttgtgcc tcccagatggctgagcccttccgtaccttccgagatggatgggtctcctactacaaccag cctgtgtttctggctggcatgggtcttgctttcctttatatgactgtcctgggctttgac tgcatcaccacagggtacgcctacactcagggactgagtggttccatcctcagtattttg atgggagcatcagctataactggaataatgggaactgtagcttttacttggctacgtcga aaatgtggtttggttcggacaggtctgatctcaggattggcacagctttcctgtttgatc ttgtgtgtgatctctgtattcatgcctggaagccccctggacttgtccgtttctcctttt gaagatatccgatcaaggttcattcaaggagagtcaattacacctaccaagatacctgaa attacaactgaaatatacatgtctaatgggtctaattctgctaatattgtcccggagaca agtcctgaatctgtgcccataatctctgtcagtctgctgtttgcaggcgtcattgctgct agaatcggtctttggtcctttgatttaactgtgacacagttgctgcaagaaaatgtaatt gaatctgaaagaggcattataaatggtgtacagaactccatgaactatcttcttgatctt ctgcatttcatcatggtcatcctggctccaaatcctgaagcttttggcttgctcgtattg atttcagtctcctttgtggcaatgggccacattatgtatttccgatttgcccaaaatact ctgggaaacaagctctttgcttgcggtcctgatgcaaaagaagttaggaaggaaaatcaa gcaaatacatctgttgtttga >gi568815596r:189461881_189680460|GENSCAN_predicted_peptide_5|94_aa MIDDKMQELSSVHSSTIKVEAPPRDFHALAGALTDLLQRQQQFMYHSSFDDELKGNFLSS ERLSYPISEKSGTGELRGCDCLSGDVWHEQNKDT >gi568815596r:189461881_189680460|GENSCAN_predicted_CDS_5|285_bp atgattgatgacaagatgcaggaattatcctctgtacacagctcaaccatcaaggttgag gcaccaccacgtgattttcatgctctggctggagcactgacagacctcctccagagacaa caacagttcatgtaccacagttcttttgatgatgaacttaaaggaaactttttaagttca gaaaggctgagttacccaataagtgaaaaatctggtactggggaactcaggggatgtgac tgcttgtctggtgatgtttggcatgaacagaacaaagacacctaa >gi568815596r:189461881_189680460|GENSCAN_predicted_peptide_6|191_aa MKQVLGDLARDTDSHTIIMGNFNTLLTILDRSSRQKINKDIQDLNSALDQMDLMDLYRTL HPKPTEYTFFSSSRGTYSKIDHIIGQKTNVIKCKRTEIIPNTLLDHSTIKIGVKIMKITQ NHAITWKLNNMLLNDFWVNNEIKAEIKKFFGNDKNKDTTYQILWDTDKAMLTGKFITLNA QIKKLERSLIT >gi568815596r:189461881_189680460|GENSCAN_predicted_CDS_6|576_bp atgaagcaagttcttggagacctagcaagagacacagactcccacacaataataatggga aattttaacaccctactgacaatattagacagatcatcaaggcagaaaattaacaaagat attcaggacttaaactcagcattggaccaaatggatctaatggacctttacagaactctc cacccaaaaccaacagaatatacatttttctcatcctcacgtggcacatactctaaaatt gaccacataattggacaaaaaacaaatgtcatcaaatgtaaaagaactgaaattatacca aacacactcttagaccatagcacaataaaaataggagtcaagattatgaaaatcactcaa aaccatgcaattacatggaaattaaacaacatgctcctgaatgacttttgggtaaataat gaaattaaggcagaaatcaagaagttctttggaaatgacaagaacaaagatacaacatac cagattctctgggacacagataaggcaatgttaacagggaaattcataacactaaatgcc caaatcaaaaagttagaaagatctctaataacctaa >gi568815596r:189461881_189680460|GENSCAN_predicted_peptide_7|552_aa MRRRETPPQARPGLRMRSSERADAIGLSHPACLALGGNAQPRDATRGQLCADPHRQFDPT QGGSKQQVGTQTSGLANQWQEVPASGLFRIDLKSTVISGCIILQLYPWKYISRENIIEEN VNSLSQISADLPAFVSVVANEAKLYLEKPVVPLNMMLPQAALETHCSNISNVPPTREILQ VFLTDVHMKEVIQQFIDVLSVAVKKRVLCLPRDENLTANEVLKTCDRKANVAILFSGGID SMVIATLADRHIPLDEPIDLLNVAFIAEEKTMPTTFNREGNKQKNKCEIPSEEFSKDVAA AAADSPNKHVSVPDRITGRAGLKELQAVSPSRIWNFVEINVSMEELQKLRRTRICHLIRP LDTVLDDSIGCAVWFASRGIGWLVAQEGVKSYQSNAKVVLTGIGADEQLAGYSRHRVRFQ SHGLEGLNKEIMMELGRISSRNLGRDDRVIGDHGKEARFPFLDENVVSFLNSLPIWEKAN LTLPRGIGEKLLLRLAAVELGLTASALLPKRAMQFGSRIAKMEKINEKASDKCGRLQIMS LENLSIEKETKL >gi568815596r:189461881_189680460|GENSCAN_predicted_CDS_7|1659_bp atgcgcaggcgcgaaactcctccccaggccagacccggcctgcgcatgcgttcaagtgag cgcgcagacgctattgggctgagccatcccgcgtgtcttgcgctcggtggaaatgcccag ccgagggacgcgaccagaggacagctctgtgctgatccccaccgacaattcgaccccaca caaggaggatctaagcagcaagttggcacccaaacatctggattggcaaatcagtggcaa gaagttccagcatctggacttttcagaattgatcttaagtctactgtcatttccggatgc attattttacaactgtatccttggaaatatatttctagggagaatattattgaagaaaat gttaatagcctgagtcaaatttcagcagacttaccagcatttgtatcagtggtagcaaat gaagccaaactgtatcttgaaaaacctgttgttcctttaaatatgatgttgccacaagct gcattggagactcattgcagtaatatttccaatgtgccacctacaagagagatacttcaa gtctttcttactgatgtacacatgaaggaagtaattcagcagttcattgatgtcctgagt gtagcagtcaagaaacgtgtcttgtgtttacctagggatgaaaacctgacagcaaatgaa gttttgaaaacgtgtgataggaaagcaaatgttgcaatcctgttttctgggggcattgat tccatggttattgcaacccttgctgaccgtcatattcctttagatgaaccaattgatctt cttaatgtagctttcatagctgaagaaaagaccatgccaactacctttaacagagaaggg aataaacagaaaaataaatgtgaaataccttcagaagaattctctaaagatgttgctgct gctgctgctgacagtcctaataaacatgtcagtgtaccagatcgaatcacaggaagggcg ggactaaaggaactacaagctgttagcccttcccgaatttggaattttgttgaaattaat gtttctatggaagaactgcagaaattaagaagaactcgaatatgtcacttaattcggcca ttggatacagttttggatgatagcattggctgtgcagtctggtttgcttctagaggaatt ggttggttagtggcccaggaaggagtgaaatcctatcagagcaatgcaaaggtagttctc actggaattggtgcagatgagcaacttgcaggttattctcgtcatcgtgtccgctttcag tcgcatgggctggaaggattgaataaggaaataatgatggaactgggtcgaatttcttct agaaatcttggtcgtgatgacagagttattggtgatcatggaaaagaagcaagatttcct ttcctggatgaaaatgttgtctcctttctaaattctctgccgatttgggaaaaagcaaac ttgactttaccccgaggaattggtgaaaaattacttttacgccttgcagctgtggaactt ggtcttacagcctctgctcttctgcccaaacgggccatgcagtttggatcaagaattgca aaaatggaaaaaattaatgaaaaggcatctgataaatgtggacggctccaaatcatgtcc ttagaaaatctttctattgaaaaggagactaaattgtaa >gi568815596r:189461881_189680460|GENSCAN_predicted_peptide_8|201_aa MLRLPKKGLPRFEQVQDEDTYLENLAIQRNASAFFEKYDRSEIQELLTTALVSWLSAKED VRSQVDLPCGIMSQMNNVGFSTAILLTPVDPTALLDYREVHQMIRELAIGIYCLNQIPSI SLEANYDQSSSCQLPPAYYDTRIGQILINIDYMLKALWHGIYMPKEKRARFSELWRAIMD IDPDGKPQTNKDIFSEFSSAX >gi568815596r:189461881_189680460|GENSCAN_predicted_CDS_8|603_bp atgttaagattgcccaaaaaaggattacctagatttgagcaagttcaggatgaagacacc tacctggaaaatttagcaatacaaagaaatgcatctgctttttttgaaaaatatgatcgg agtgaaatacaagagttactaactactgcactagttagctggttgtctgccaaagaggat gtgcgctctcaagtagacctcccatgtggaattatgagtcaaatgaataacgtaggcttc tccactgcaatcctactgactcccgtggaccctactgccctcttagactatagagaggtc catcaaatgataagagagttggctattggaatttattgcctaaatcaaatcccttccatc agtttagaagctaattatgatcagagttcttcttgtcaattacctccagcttattatgat accagaattgggcaaattctgatcaatattgactacatgctgaaagcactatggcatgga atatatatgcccaaagaaaaacgagctagattctctgaattgtggcgtgccatcatggac attgatcctgatggaaaacctcaaacaaataaagacattttttcagagtttagttcagca gnn