GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:28:29 Sequence gi568815596r:152020577_152250230 : 229654 bp : 41.05% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 11284 11491 208 1 1 61 -13 200 0.002 5.36 1.02 Intr + 14088 14196 109 2 1 110 74 28 0.019 2.54 1.03 Intr + 21761 21846 86 0 2 120 47 24 0.021 0.12 1.04 Intr + 26006 26109 104 1 2 73 115 -6 0.067 -1.35 1.05 Intr + 28205 28369 165 2 0 110 54 63 0.213 3.25 1.06 Intr + 38054 38212 159 2 0 26 75 146 0.059 5.28 1.07 Intr + 45474 45590 117 0 0 -9 75 154 0.375 3.06 1.08 Term + 47453 47627 175 2 1 54 36 160 0.728 3.65 1.09 PlyA + 48527 48532 6 1.05 2.02 PlyA - 49226 49221 6 1.05 2.01 Sngl - 53953 53720 234 1 0 73 45 316 0.483 20.65 2.00 Prom - 54903 54864 40 -6.95 3.19 PlyA - 58854 58849 6 1.05 3.18 Term - 62955 62843 113 1 2 80 42 135 0.030 5.84 3.17 Intr - 65113 64988 126 2 0 42 47 97 0.010 0.73 3.16 Intr - 77842 77754 89 0 2 15 80 102 0.054 0.80 3.15 Intr - 78501 78373 129 2 0 92 86 145 0.071 13.59 3.14 Intr - 80959 80861 99 0 0 36 87 72 0.009 0.21 3.13 Intr - 103359 103190 170 0 2 54 115 37 0.294 0.82 3.12 Intr - 105803 105650 154 1 1 35 99 70 0.392 1.95 3.11 Intr - 111592 111538 55 2 1 62 66 78 0.424 0.12 3.10 Intr - 112684 112597 88 1 1 68 75 90 0.770 4.32 3.09 Intr - 112908 112826 83 1 2 49 75 140 0.953 7.24 3.08 Intr - 115027 114933 95 0 2 50 111 -10 0.621 -3.81 3.07 Intr - 123437 123251 187 0 1 31 111 188 0.946 13.23 3.06 Intr - 124381 124312 70 1 1 53 94 91 0.993 4.04 3.05 Intr - 126732 126586 147 0 0 74 94 72 0.983 5.91 3.04 Intr - 127546 127448 99 1 0 44 94 60 0.727 1.59 3.03 Intr - 127724 127649 76 1 1 131 74 21 0.866 3.60 3.02 Intr - 129653 129569 85 0 1 48 86 76 0.485 1.36 3.01 Init - 140742 140622 121 0 1 68 74 77 0.514 4.70 3.00 Prom - 152802 152763 40 -5.85 4.00 Prom + 154537 154576 40 -6.85 4.01 Init + 154661 154677 17 1 2 78 76 13 0.478 -1.14 4.02 Intr + 154936 155045 110 1 2 91 -17 170 0.875 5.81 4.03 Term + 155337 155557 221 1 2 78 29 203 0.810 9.62 4.04 PlyA + 156794 156799 6 -0.45 5.04 PlyA - 156945 156940 6 1.05 5.03 Term - 157801 157557 245 2 2 56 44 170 0.897 4.48 5.02 Intr - 158387 158346 42 0 0 104 98 45 0.556 4.39 5.01 Init - 159368 158921 448 2 1 54 60 146 0.510 4.57 5.00 Prom - 159516 159477 40 -9.75 6.02 PlyA - 159758 159753 6 -0.45 6.01 Sngl - 160208 159936 273 2 0 72 47 324 0.997 21.78 6.00 Prom - 161051 161012 40 -6.95 7.00 Prom + 172549 172588 40 -5.05 7.01 Sngl + 172881 173234 354 2 0 36 40 235 0.829 9.30 7.02 PlyA + 173624 173629 6 1.05 8.00 Prom + 174155 174194 40 -6.15 8.01 Init + 175419 175852 434 2 2 43 80 203 0.320 10.73 8.02 Term + 176697 177168 472 0 1 15 39 187 0.273 -0.08 8.03 PlyA + 178257 178262 6 1.05 9.03 PlyA - 178880 178875 6 1.05 9.02 Term - 203668 203526 143 2 2 63 48 112 0.423 1.81 9.01 Intr - 216341 216243 99 0 0 70 98 42 0.270 2.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 77764 77871 108 0 0 61 93 181 0.900 14.66 S.002 Term - 169547 169441 107 0 2 101 36 87 0.914 2.39 S.003 Init - 170847 170784 64 0 1 57 96 45 0.939 3.56 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:152020577_152250230|GENSCAN_predicted_peptide_1|374_aa XKLPFLLALGSLKGKHRADSLAKPRVKASQPVACLNEGSIRGQQTPSFHIQARAESNRAL MIRVSGSFYSMFVILPLKHQGMASDTELLLSTTAPPQCSFKHTNPWGLVLICRSLPGGWT IFLGASTSSGSCLTKSSIYSALHFDHWKSTPKTVLISALLRCRPKQRRSSCKVKEPNLSP SLRLLLDLKYYYFRDQLLGRVKERREQMEAEEPKKALEYSQVTVWEAEHKSLENLQPDDV VEKKNPFSGEKFKLDAEIGISNQESSVNINRMWKMSPGGIKPIQEALQVLRNLSAPHLTP ASADVATYMQAQPDSNSHGAENQAQELTIKLKDLQRKMKAQSQYVSHVKVRALIGKERIP RPGIRASGQTSENL >gi568815596r:152020577_152250230|GENSCAN_predicted_CDS_1|1125_bp nnaaaactgccctttctgttggcattaggctctctaaaagggaagcaccgtgctgacagt cttgccaagccaagggtgaaagcctcacagcccgtggcttgcttgaacgaaggctccatc aggggacagcaaactccctcatttcatattcaggcaagagcggagagcaacagagccttg atgattcgagtgtctggatccttctatagtatgtttgtaatcctgcctttaaaacaccag gggatggcatccgatacagagttgctgctttctacaacagcacccccacagtgctctttt aaacacactaacccctggggtctggttcttatatgtcgtagcttgccagggggttggacc atatttcttggagcctctacttcttcaggctcctgcttaacaaagagttctatctattca gcactacactttgaccactggaagtccacacctaagacagtccttatttctgccctttta agatgtcgccctaagcagaggagaagcagctgtaaagtgaaagagccaaatcttagccct tctctgaggttgcttctagatctaaagtactactattttagggaccaactgttggggagg gtaaaagaaagaagagagcagatggaggcagaagaaccaaagaaagcactggaatactct caggtgactgtatgggaagcagagcataaaagtttggaaaatttgcagcctgatgatgtg gtagagaagaaaaacccattttctggggagaaattcaagctggatgcagaaattggcata agtaaccaggagtccagtgttaatatcaacagaatgtggaaaatgtctccaggtggaata aagcctatacaggaagcccttcaagtacttcgcaacctctcagccccacatcttactcca gcctctgctgacgtggccacgtacatgcaggctcaaccagattcaaacagccatggggct gaaaatcaggctcaggaactaactataaagctgaaagacctgcaaaggaaaatgaaggca cagtctcagtacgtctctcatgtcaaagtcagagccctcataggaaaagagaggattcca agacctggaatcagggcatctgggcagacatctgagaatctttaa >gi568815596r:152020577_152250230|GENSCAN_predicted_peptide_2|77_aa MVVAVAVMVVMAERRVGTVVVIVVIMPRWLGDGYGGGAAGGDDGDVGDGGEEGGNGSSDY AMMVVVRVLVAAMVVKR >gi568815596r:152020577_152250230|GENSCAN_predicted_CDS_2|234_bp atggtggtggcagtagcggtaatggtggtgatggcggagaggagggtggggacagtagta gtgatagtagtgattatgccacgatggcttggtgatggttatggtggtggtgctgctggt ggtgatgatggtgatgttggtgatggcggagaggagggtgggaatggtagtagtgattat gccatgatggtagtggtgagggtgctggtggcggcgatggtggtaaagcggtga >gi568815596r:152020577_152250230|GENSCAN_predicted_peptide_3|661_aa MTLNEHAAFKHLFNKAHLAPPLIHSTLSGHSTCFREHRVGEKATNEYNTTEDWSLIMDIC DKVGSTPNGAKDCLKAIMKRVNHKVPHVALQALTLLGACVANCGKIFHLEVCSRDFATEV RAVIKNKAHPKVCEKLKSLMVEWSEEFQKDPQFSLISATIKSMKEEGITFPPAGSQTVSA AAKNGTSSNKNKEDEDIAKAIELSLQEQKQQHTETKSLYPSSEIQLNNKVARKVRALYDF EAVEDNELTFKHGEIIIVLDDSDANWWKGENHRGIGLFPSNFVTTNLNIETEAAAVDKLN VIDDDVEEIKKSEPEPVYIDEDKMDRALQVLQSIDPTDSKPDSQDLLDLEDICQQMGPMI DEKLEEIDRKHSELSELNVKVLEALELYNKLVNEAPVYSVYSKLHPPAHYPPASSGVPMQ TYPVQSHGGNYMGQSIHQVTVAQSYSLGPDQIGPLRSLPPNVNSSVTAQPAQTSYLRSTQ SSPTFPDFFLDEPEAKVIKAQVHSVLLWPSPASGASPLRPPHTGWPCGGSERCPPPPTPR TGPRTGRTPPPRSPGGPRHHNPEEQVEKIRWQHHFDQLHPQTATTATQSQTLECHTQVWV WELRREQAERKPQELWRPAGTLMVGLSLAPVIVVAVATVIQDLPFIHDPVQLGFLTIVPA E >gi568815596r:152020577_152250230|GENSCAN_predicted_CDS_3|1986_bp atgactcttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccattcaaccctgagtggacacagcacatgtttcagagagcacagggttggg gaaaaagccacgaatgagtacaacactacagaagattggagtcttattatggacatatgt gacaaagttggaagtactcctaatggagcgaaagattgcctaaaagccataatgaaaagg gtaaatcataaggttccacatgttgctctgcaagcactaactcttcttggggcttgtgtg gcaaactgtggaaagatatttcatttagaagtatgttcccgtgattttgcaacagaagta cgtgctgtgattaaaaataaggcacatcctaaagtatgtgaaaaactgaaatctttaatg gtggagtggtcagaagaatttcagaaggaccctcagtttagtctgatatctgcaactatt aaatctatgaaagaagaaggaattacttttcctccagcaggttctcagactgtctcagct gctgccaagaatggtacgtcatcgaacaaaaacaaagaggatgaagacatagctaaagct attgaattatcgctgcaagaacagaaacagcaacacacagaaacaaaatccttatatcca tcttcagaaattcagttaaataataaggttgcacggaaagtgagagctttatatgatttt gaagctgttgaggacaatgaactcacctttaaacatggtgaaataattattgttttggat gacagtgatgccaattggtggaaaggagaaaatcacagaggaataggacttttcccatcc aattttgtaacaactaatttaaacatagagactgaggcagcggctgtggacaaattgaat gtaattgatgatgatgtggaggaaattaagaaatcagagcctgagcctgtttatatagat gaggataagatggatagagccctgcaggtacttcagagtatagatccaacagattcaaaa ccagactcccaagaccttttggatttagaagatatctgccaacagatgggtccaatgata gatgaaaaacttgaagaaattgataggaagcattcagaattgtctgaattgaatgttaaa gtcctggaagctctggaactatataacaaattggtgaatgaagcaccagtgtactcagtc tattcaaagctccaccctccagcacattacccacctgcatcatctggggttccaatgcag acatatccagttcaatcacatggtggaaactatatgggtcagagcattcaccaagtaact gttgcccaaagctatagcctaggacccgatcaaattggtccactgagatctctgcctcca aatgtgaattcctcagtgacagcacagcctgctcaaacttcatatttaagaagcactcaa agcagtcccacgtttcctgacttctttctcgatgagccagaagcaaaggtgatcaaggcc caagtacactcagtgctgctgtggccaagcccagcctcgggggccagccccctccgccca ccgcacacgggctggccatgcggcggctctgaacgatgtcctcctcctcctacgccaaga acgggaccgcggacgggccgcactcccccacctcgcagtccaggtggcccgaggcaccac aacccggaggagcaggttgaaaagatccgatggcagcaccacttcgaccagcttcatcct cagacagcaacaacagcaacacaaagccagaccttggagtgccacacccaagtgtgggtt tgggagctgagaagggaacaagctgaaaggaagccgcaagagctgtggagaccagccggg accctcatggtgggcctcagtttagcacccgttattgtggtagcagtggccacagtgatt caggatctcccatttatccacgatcctgtccagttaggcttccttacaatagttccagca gaataa >gi568815596r:152020577_152250230|GENSCAN_predicted_peptide_4|115_aa MGQRRSPRSRYRPLSSRTNSSPGPGTQQSRTGHSTHPRLARRDFCRGQAAATATFPALGP RLATKPGTFGAGSSPAWNWTTPEAAVPAAARQGARLPVVFRVGRRALRAKFLSRI >gi568815596r:152020577_152250230|GENSCAN_predicted_CDS_4|348_bp atggggcaaagacggagtccccggagtcgctaccgcccactttctagccggacaaacagc agtccagggccaggcacacagcagtccaggaccgggcacagcactcacccacgtcttgct cgaagggacttttgtaggggtcaggcggccgctacggccacctttccagcgctaggaccc cggctagccacgaagccggggaccttcggagccggaagtagtcccgcctggaactggacg accccggaagctgcagtgccagcggcggcgcggcagggggcgcggttacccgtggttttt cgcgttggacgacgggctctgagggccaagttcttgtcacgaatttag >gi568815596r:152020577_152250230|GENSCAN_predicted_peptide_5|244_aa MGKSEIKRTGGFGSTNKQGKAAYWVNQITDKCPTCEITIQGKKFKGLVDTEADISIISLQ QWPSVWPIQPAQFNIVGVGKAPEVYQSSYILHCEGPDGQPGTIQPIITSAPINLWGRDLL QQWGAQVLIPEQLYSPRSQHTMHEMGYVPDGSSNGKASYSGSKAMLRLTLRRTPTVVSYT HQTQPPTWGQTKKLSQMAEENLRKAGQPITTSNLMVATIAVITIAISIPSTRAAETSSVG ESLT >gi568815596r:152020577_152250230|GENSCAN_predicted_CDS_5|735_bp atgggaaaaagtgaaattaaacgaacaggaggatttggaagcacaaataaacaaggcaaa gcagcttattgggtaaatcaaattactgataaatgtcctacctgtgaaataactattcag ggaaagaaatttaaaggtttggtagatacagaagcggacatttcaatcatttctctacag cagtggccgtctgtgtggccaattcaacccgctcaatttaacatagttggagttggtaaa gcccctgaagtatatcaaagtagttatattttgcattgtgaagggcctgatggacaacct gggactattcaaccaattataacttctgcacctataaatttatggggaagagatttgtta caacaatggggagcacaagttctaattccagaacaattatatagccctcgaagtcaacat acaatgcatgaaatggggtatgtccctgatgggtctagtaatggcaaagcttcttattct ggatcaaaagccatgttgagactgacactgagaaggaccccaactgtcgtgagctatacc catcaaacacagccacccacctggggacagaccaagaagctgtcacagatggcagaagaa aacctgaggaaagcgggacaaccaatcacaacgagtaatttaatggtagctaccatagcg gtgatcaccattgccataagtattccttcaacaagggctgccgagaccagctcggtcggg gagagcctaacctag >gi568815596r:152020577_152250230|GENSCAN_predicted_peptide_6|90_aa MGQVRGVVRSILELFHADDEEEGEYNEITEEVTEQVCLPPKAKAAKQGEVHPHPSASPPY YFEENDPPDLSFLEDTGRKVVAPVTVREVP >gi568815596r:152020577_152250230|GENSCAN_predicted_CDS_6|273_bp atgggacaagtgcggggtgtggttcgttccatcttggaactttttcacgctgatgatgag gaggaaggagagtataacgaaataacagaagaggttacagagcaggtttgtttgccacct aaagctaaagcggcaaagcagggagaggttcatccccacccttctgcatcccctccctat tattttgaagaaaatgaccctccagatctttctttcctggaagacactgggcgaaaagta gttgccccagtgactgttcgagaagtgccttaa >gi568815596r:152020577_152250230|GENSCAN_predicted_peptide_7|117_aa MKRNEQSLQEIWDYVKRPNLSDGENGTKLENTLQDIIQENFPNLARQTNIQIQEIQRTPQ RYSSRRATPRHTIVRFTKVEMEEKMLRAAREKGQVTHKGKPVKLTVDLLAETLQARR >gi568815596r:152020577_152250230|GENSCAN_predicted_CDS_7|354_bp atgaaaaggaatgaacaaagcctccaagaaatatgggactatgtgaaaagacctaacctc agtgatggggagaatggaaccaagttggaaaacactcttcaggatattatccaggagaac ttccccaacttagcaagacagaccaacattcaaattcaggaaatacagagaacaccacaa agatactcctcgagaagagcaaccccaagacacacaatcgtcagattcaccaaggttgaa atggaggaaaaaatgttaagggcagccagagagaaaggtcaggttacccacaaagggaag cctgtcaaactaacagtggatctcttagcagaaaccctgcaagccagaagatag >gi568815596r:152020577_152250230|GENSCAN_predicted_peptide_8|301_aa MNIDAKILNKILVNQIQQHIKKLIHHDQVNFISGMQGWFNTHKSISIIHHINRTNDKNHM IISIDAGKAFNKIQQHFMLKTLNKLGIDGMYLKIITAIYDKPTANIILNEQKLEAFPLKT GTRQGCPLSPLLFNVVLEVLARAFGWIKDLNVRLKNIKTLEENLDNTIQDIDMGKDFMIE TPKAIATKTKIDKWNLIKLRSFCTAKETVIRVNRQPTEWEKIFAIYSSDKGLISRIYKEL KQIYKKKTNNPIKKWAKDMNRHFSKEDIYVANKHMNHKKAHHHWSLEKCKQNHNETPSHA S >gi568815596r:152020577_152250230|GENSCAN_predicted_CDS_8|906_bp atgaacatcgatgcgaaaatcctcaataaaatactggtaaaccaaatccagcagcacatc aaaaagcttatccaccacgatcaagtcaacttcatctctgggatgcaaggttggttcaac acgcacaaatcaataagcataatccatcacataaacagaaccaatgacaaaaaccacatg attatctcaatagatgcaggaaaggccttcaacaaaattcaacagcacttcatgctaaaa actctcaataaactaggtattgatggaatgtatctcaaaataataacagctatttatgac aaacccacagccaatatcatactgaatgagcaaaagctagaagcattccctttgaaaacc ggcacaagacaaggatgccctctctcaccactcctattcaacgtagtattggaagttctg gccagggcattcggatggattaaggacttaaacgtgagacttaaaaacataaaaacccta gaagaaaacctagacaataccattcaggacatagacatgggcaaagatttcatgattgaa acaccaaaagcaatagcaacaaaaaccaaaattgacaaatggaatctaattaaactaagg agcttttgcacagcaaaagaaactgtcatcagagtgaacaggcaacctactgaatgggag aaaatttttgcaatctattcatctgacaaagggctaatatccagaatctacaaagaactt aaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcaaaggatatgaac agacacttctcaaaagaagacatttatgtggccaacaaacatatgaatcataaaaaagct catcatcactggtcattagagaaatgcaaacaaaaccacaatgagacaccatctcatgcc agttag >gi568815596r:152020577_152250230|GENSCAN_predicted_peptide_9|80_aa XSPNSASSITTHTWIAMTLWVARVHLESLPAVEAMTQGPWKGKSLVEEERVGQTTAVMKL VYSCESGLRPYLGVEGTQLY >gi568815596r:152020577_152250230|GENSCAN_predicted_CDS_9|243_bp ngatcccctaattcagcatcatctatcaccacccacacatggatagcaatgactctatgg gttgccagggttcacttagagagcctacctgctgtggaagccatgacccagggtccctgg aaaggaaagagcctggtggaggaggaaagagttggccagacaacagccgtaatgaagtta gtttattcttgtgaatcaggactcaggccatatttgggtgtggagggaacacagctgtat taa