GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:56:48 Sequence gi568815586f:11979306_12194931 : 215626 bp : 41.80% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3137 3211 75 1 0 58 99 31 0.238 2.36 1.02 Intr + 28907 28984 78 1 0 76 80 47 0.104 1.53 1.03 Intr + 35971 36199 229 0 1 82 9 222 0.575 10.22 1.04 Intr + 38011 38250 240 2 0 37 77 165 0.483 6.90 1.05 Intr + 38602 38722 121 2 1 14 44 97 0.096 -3.47 1.06 Intr + 43889 44012 124 2 1 -15 87 114 0.056 0.67 1.07 Term + 46021 46167 147 0 0 56 41 155 0.932 4.52 1.08 PlyA + 46800 46805 6 1.05 2.00 Prom + 50598 50637 40 -5.35 2.01 Init + 52166 52196 31 1 1 90 109 42 0.268 6.51 2.02 Term + 53568 53965 398 1 2 0 37 308 0.186 11.15 2.03 PlyA + 54534 54539 6 1.05 3.05 PlyA - 55326 55321 6 1.05 3.04 Term - 59286 59122 165 0 0 -8 36 160 0.023 -1.87 3.03 Intr - 60832 60696 137 2 2 43 89 119 0.026 6.87 3.02 Intr - 79805 79611 195 0 0 1 77 144 0.002 3.06 3.01 Init - 81590 81473 118 2 1 51 109 167 0.034 15.51 3.00 Prom - 84074 84035 40 -6.05 4.00 Prom + 87021 87060 40 -8.75 4.01 Init + 100001 100433 433 1 1 76 102 329 0.997 29.52 4.02 Intr + 106534 106656 123 2 0 109 53 32 0.717 1.44 4.03 Intr + 107908 108081 174 2 0 93 121 76 0.992 10.39 4.04 Intr + 111474 111544 71 1 2 49 86 69 0.870 0.78 4.05 Intr + 115359 115625 267 2 0 93 94 249 0.980 22.81 4.06 Term + 122133 122324 192 2 0 60 43 145 0.351 3.64 4.07 PlyA + 122999 123004 6 1.05 5.00 Prom + 129649 129688 40 -6.15 5.01 Sngl + 131858 132214 357 1 0 81 42 712 0.985 61.71 5.02 PlyA + 132831 132836 6 1.05 6.22 PlyA - 132849 132844 6 1.05 6.21 Term - 142115 141821 295 1 1 86 55 197 0.857 9.99 6.20 Intr - 145357 145260 98 1 2 90 90 10 0.987 -0.71 6.19 Intr - 146127 145991 137 1 2 84 92 123 0.999 11.67 6.18 Intr - 147616 147386 231 2 0 100 109 82 0.985 8.32 6.17 Intr - 151588 151478 111 2 0 69 81 100 0.981 6.83 6.16 Intr - 152752 152516 237 2 0 66 63 212 0.997 13.16 6.15 Intr - 155995 155870 126 2 0 150 84 1 0.975 5.63 6.14 Intr - 159229 159020 210 2 0 70 98 287 0.999 25.86 6.13 Intr - 168251 168061 191 1 2 145 80 113 0.999 14.51 6.12 Intr - 169848 169637 212 0 2 127 75 216 0.999 20.99 6.11 Intr - 171708 171531 178 2 1 100 91 339 0.875 34.40 6.10 Intr - 172167 171948 220 1 1 115 67 34 0.090 0.44 6.09 Intr - 179850 179524 327 1 0 64 91 224 0.707 15.05 6.08 Intr - 180659 180475 185 1 2 31 99 149 0.600 8.81 6.07 Intr - 183114 182888 227 0 2 103 59 254 0.603 19.86 6.06 Intr - 185257 184968 290 2 2 79 77 189 0.991 12.94 6.05 Intr - 185990 185774 217 2 1 26 97 192 0.096 11.05 6.04 Intr - 200676 200505 172 2 1 89 68 183 0.995 15.42 6.03 Intr - 202134 201738 397 1 1 87 86 548 0.965 47.21 6.02 Intr - 204806 204675 132 0 0 47 95 74 0.433 3.70 6.01 Intr - 207814 207618 197 0 2 75 115 52 0.780 4.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:11979306_12194931|GENSCAN_predicted_peptide_1|337_aa MKELNLVKTIDRTLIPHLLLSRQPQARFPGLIPVLETSAVMFPTSQLFRGWALPSCLPVY FPVTRGTKQQQQQRHHNNCPQVVVGIIFRMARALWLKKRLLVECSKQAPATEHSKWAQDM GVQEQGPDKEGKEESPGMEFQAGSEPRACLSITLKNVAAVPWSGSLVTPGDGSQEHGIEG AWSLSSPEYRSVLAPLPVSSRVSDTQTGELRNQESFGPRLSRCQNEPRAPGEAASFMSLQ HLTVEGPCKTIHFMEMHVPRKKSIEPPNEIFKRKAMYKHMIRFLNATRKHAEVEPKLRME ERDRRLGLLQGLSVWDRVRERQELHRAEIQKSAEETT >gi568815586f:11979306_12194931|GENSCAN_predicted_CDS_1|1014_bp atgaaggagctcaacttggttaaaaccattgacagaacactcatcccccatcttttgctc tctagacaaccgcaggccagattccctgggttaattcctgttcttgagacttcggctgtg atgtttcctacttcacagctctttcggggctgggctctcccgagctgtctgcctgtttat tttcctgtgacgagagggaccaagcagcagcagcagcagcggcatcataataactgccca caagttgtagttggaataattttcagaatggctcgagctctatggttaaagaaacgactg ctggtggagtgtagtaaacaggccccagccacagaacacagcaagtgggcacaggacatg ggagtccaagagcaaggcccggataaggaagggaaagaggagagtcctggcatggagttc caagcaggctcagagccccgggcctgcctttccatcacattaaagaatgtggctgctgtg ccctggagcggttccctagtcacaccaggtgatgggagccaggagcatgggatagaagga gcctggtcactgtccagcccagagtatcgctcagtgttggctcccctgccagtctcttcc agggtatctgacacccaaacaggagaactcagaaatcaggaatcatttggtcccaggctt tccaggtgccagaatgaaccacgagctcccggtgaagccgcaagttttatgagtctgcag cacctcacagtggagggaccctgcaaaacgattcacttcatggaaatgcatgttccccga aagaaatccatagagccccctaatgaaattttcaagagaaaagctatgtacaaacatatg atccgcttcctaaacgccacaagaaagcacgcggaagtggagcccaaactgaggatggag gaaagagaccgaaggttagggctgctgcagggattgagtgtgtgggacagggtacgagag aggcaggagctgcacagagcagaaatccaaaaatctgcagaggagaccacctga >gi568815586f:11979306_12194931|GENSCAN_predicted_peptide_2|142_aa MAVAELEVTLELAGKSGMWTDTDMPCAFFRWRKRVGLGAGGCCLGAEREGRKGAAGSHLT HALLSGPNQDGIKRPWEENKESNKGNEFPIQKGKLPFPLRSDWSEVKDWCALATASAEAL TLELFRPRRITGTFLQPEPATS >gi568815586f:11979306_12194931|GENSCAN_predicted_CDS_2|429_bp atggctgttgcagaactcgaagttactctcgaactggccgggaaatccggcatgtggaca gacacggatatgccgtgtgcatttttccgttggaggaagagggtaggcttgggtgctgga gggtgttgcctgggggcggagcgagaaggaagaaagggcgctgctggttcacacctcacc catgccctcctctcaggccccaatcaggatggaattaaaagaccctgggaggaaaataag gaaagcaacaaaggaaatgagttccccatacagaaagggaaattgcctttcccgttaagg tcagactggtccgaagtaaaggactggtgcgccttagcaaccgcctctgctgaagcccta acgctggagctctttcgccctcgcagaataaccgggactttcttgcagccggaaccagcc acgtcctga >gi568815586f:11979306_12194931|GENSCAN_predicted_peptide_3|204_aa MVYGTSEAIGQRQSSAAKLRRSQSESLRPEFQGLWEWLPVRDRHRSFGSTDKTCLLCLYQ KMKGTEIKRRERLKCGAKIERRKRLRVREVGEESKKRPLTGFEIEERGEAPLYCHKEDEV QAPLVACTRELFTVRRSALFLNTRQRSEEMEGKSNEKQKICSTEHLKPFEIGDTSTVEGA KEDKIMEMKTERLFDGFSKDLLDL >gi568815586f:11979306_12194931|GENSCAN_predicted_CDS_3|615_bp atggtctacgggacttccgaagctatcgggcagcgtcagtcttcagccgctaagctgaga aggagtcagtcagagagccttcggccagagttccaggggctctgggagtggctgccagtc cgtgaccggcaccggagttttgggtctacggataaaacgtgtctcctttgtctctaccag aaaatgaaaggaactgaaattaaaagaagggagagattgaagtgtggcgccaagattgaa aggagaaagaggctgagggttagggaggttggagaagagagtaaaaagaggccgcttact ggatttgaaattgaagagaggggagaggctcctctttactgccacaaggaggacgaagtc caggctcctctcgtggcctgcaccagagaattattcactgtaaggagatcagccttattt ctaaataccaggcagagaagtgaagagatggaagggaagagtaatgagaaacaaaagatt tgttcaacagaacacctgaaaccttttgaaattggggacaccagtacagtggaaggggca aaggaggacaagatcatggagatgaagacagagagattatttgatggtttctcaaaggat cttttagacctctaa >gi568815586f:11979306_12194931|GENSCAN_predicted_peptide_4|419_aa MCSTSGCDLEEIPLDDDDLNTIEFKILAYYTRHHVFKSTPALFSPKLLRTRSLSQRGLGN CSANESWTEVSWPCRNSQSSEKAINLGKKKSSWKAFFGVVEKEDSQSTPAKVSAQGQRTL EYQDSHSQQWSRCLSNVEQCLEHEVLYKAQKCPRSLSTPTFCPKRHSITVFKSCELPNVS LQMLHAVDPKVISIANRVAEIVYSWPPPQATQAGGFKSKEIFVTEGLSFQLQGHVPVASS SKKDEEEQILAKIVELLKYSGDQLERKLKKDKALMGHFQDGLSYSVFKTITDQVLMGVDP RGESEVKAQGFKAALVIDVTAKLTAIDNHPMNRVLGFGTKYLKENFSPWIQQHGGWSTAC KEFLGHAGERGTHAEFTSSLSREDRARSPERSEQLKFASLIKDERERHTKLQRPADISP >gi568815586f:11979306_12194931|GENSCAN_predicted_CDS_4|1260_bp atgtgtagcaccagtgggtgtgacctggaagaaatccccctagatgatgatgacctaaac accatagaattcaaaatcctcgcctactacaccagacatcatgtcttcaagagcacccct gctctcttctcaccaaagctgctgagaacaagaagtttgtcccagaggggcctggggaat tgttcagcaaatgagtcatggacagaggtgtcatggccttgcagaaattcccaatccagt gagaaggccataaaccttggcaagaaaaagtcttcttggaaagcattctttggagtagtg gagaaggaagattcgcagagcacgcctgccaaggtctctgctcagggtcaaaggacgttg gaataccaagattcgcacagccagcagtggtccaggtgtctttctaacgtggagcagtgc ttggagcatgaagtgctgtacaaagcccagaagtgccctaggagtttgagcacaccaacc ttctgtccaaaacgccactccattactgtcttcaagtcctgtgaattacctaatgttagt cttcaaatgttacatgctgtggaccccaaagtcatttccattgccaaccgagtagctgaa attgtttactcctggccaccaccacaagcgacccaggcaggaggcttcaagtccaaagag atttttgtaactgagggtctctccttccagctccaaggccacgtgcctgtagcttcaagt tctaagaaagatgaagaagaacaaatactagccaaaattgttgagctgctgaaatattca ggagatcagttggaaagaaagctgaagaaagataaggctttgatgggccacttccaggat gggctgtcctactctgttttcaagaccatcacagaccaggtcctaatgggtgtggacccc aggggagaatcagaggtcaaagctcagggctttaaggctgcccttgtaatagacgtcacg gccaagctcacagctattgacaaccacccgatgaacagggtcctgggctttggcaccaag tacctgaaagagaacttctcgccatggatccagcagcacggtggatggagtactgcatgc aaagagtttctaggtcatgctggagagaggggaacccacgcagagttcaccagctccctg agtagagaggatagagctaggagtccagaaagatcagagcagctgaagtttgcaagtctg ataaaggatgaaagagagaggcacacaaagctccagagacctgcagatatttccccttag >gi568815586f:11979306_12194931|GENSCAN_predicted_peptide_5|118_aa MSDAAVDTSSEITTEDLKEKKEVVEEAENGRDAPANRNANEENGEPEADNEVDEEEEEGG EEEEEEEGDGEEEDGDEDEGAESATGKRAAEDDEDDDVDTQKQKTDEDDQTAKKEKLN >gi568815586f:11979306_12194931|GENSCAN_predicted_CDS_5|357_bp atgtcagacgcagccgtagacaccagctccgaaatcaccaccgaggacttaaaggagaag aaggaagttgtggaagaggcggaaaatggaagagacgcccctgctaacaggaatgctaat gaggaaaatggggagccggaggctgacaacgaggtagatgaagaagaggaagaaggtggg gaggaagaggaggaggaagaaggtgatggtgaggaagaggacggagatgaagatgaggga gctgagtcagctacgggcaagcgggcagctgaagatgatgaggatgacgatgtcgatacc cagaagcagaagaccgacgaggatgaccagacagcaaaaaaggaaaagttaaactaa >gi568815586f:11979306_12194931|GENSCAN_predicted_peptide_6|1463_aa XQAVVKGSLPHPFALTLFEDILYWTDWSTHSILACNKYTGEGLREIHSDIFSPMDIHAFS QQRQPNATNPCGIDNGGCSHLCLMSPVKPFYQCACPTGVKLLENGKTCKDGATELLLLAR RTDLRRISLDTPDFTDIVLQLEDIRHAIAIDYDPVEGYIYWTDDEVRAIRRSFIDGSGSQ FVVTAQIAHPDGIAVDWVARNLYWTDTGTDRIEVTRLNGTMRKILISEDLEEPRAIVLDP MVGYMYWTDWGEIPKIERAALDGSDRVVLVNTSLGWPNGLALDYDEGKIYWGDAKTDKIE VMNTDGTGRRVLVEDKIPHIFGFTLLGDYVYWTDWQRRSIERVHKRSAEREVIIDQLPDL MGLKATNVHRVIGSNPCAEENGGCSHLCLYRPQGLRCACPIGFELISDMKTCIVPEAFLL FSRRADIRRISLETNNNNVAIPLTGVKEASALDFDVTDNRIYWTDISLKTISRAFMNGSA LEHVVEFGLDYPEGMAVDWLGKNLYWADTGTNRIEVSKLDGQHRQVLVWKDLDSPRALAL DPAEGFMYWTEWGGKPKIDRAAMDGSERTTLVPNVGRANGLTIDYAKRRLYWTDLDTNLI ESSNMLGLNREVIADDLPHPFGLTQYQDYIYWTDWSRRSIERANKTSGQNRTIIQGHLDY VMDILVFHSSRQSGWNECASSNGHCSHLCLAVPVGGFVCGCPAHYSLNADNRTCSDGSCP VKCKISLEVSALIYKSPGFLPPTRQNRLRVIHLVDQENFVYQIPDGNEVFCGISVDKMEK NRVGGYVISQKSAINRMVIDEQQSPDIILPIHSLRNVRAIDYDPLDKQLYWIDSRQNMIR KAQEDGSQGFTVVVSSVPSQNLEIQPYDLSIDIYSRYIYWTCEATNVINVTRLDGRSVGV VLKGEQDRPRAVVVNPEKGYMYFTNLQERSPKIERAALDGTEREVLFFSGLSKPIALALD SRLGKLFWADSDLRRIESSDLSGANRIVLEDSNILQPVGLTVFENWLYWIDKQQQMIEKI DMTGREGRTKVQARIAQLSDIHAVKELNLQEYRQHPCAQDNGGCSHICLVKGDGTTRCSC PMHLVLLQDELSCGEPPTCSPQQFTCFTGEIDCIPVAWRCDGFTECEDHSDELNCPVCSE SQFQCASGQCIDGALRCNGDANCQDKSDEKNCEVLCLIDQFRCANGQCIGKHKKCDHNVD CSDKSDELDCYPTEEPAPQATNTVGSVIGVIVTIFVSGTVYFICQRMLCPRMKGDGETMT NDYVVHGPASVPLGYVPHPSSLSGSLPGMSRGKSMISSLSIMGGSSGPPYDRAHVTGASS SSSSSTKGTYFPAILNPPPSPATERSHYTMEFGYSSNSPSTHRSYSYRPYSYRHFAPPTT PCSTDVCDSDYAPSRRMTSVATAKGYTSDLNYDSEPVPPPPTPRSQYLSAEENYESCPPS PYTERSYSHHLYPPPPSPCTDSS >gi568815586f:11979306_12194931|GENSCAN_predicted_CDS_6|4392_bp nngcaggcagtggttaaaggttcccttccacatccttttgccttgacgttatttgaggac atattgtactggactgactggagcacacactccattttggcttgcaacaagtatactggt gagggtctgcgtgaaatccattctgacatcttctctcccatggatatacatgccttcagc caacagaggcagccaaatgccacaaatccatgtggaattgacaatgggggttgttcccat ttgtgtttgatgtctccagtcaagcctttttatcagtgtgcttgccccactggggtcaaa ctcctggagaatggaaaaacctgcaaagatggtgccacagaattattgcttttagctcga aggacagacttgagacgcatttctttggatacaccagattttacagacattgttctgcag ttagaagacatccgtcatgccattgccatagattacgatcctgtggaaggctacatctac tggactgatgatgaagtgagggccatacgccgttcatttatagatggatctggcagtcag tttgtggtcactgctcaaattgcccatcctgatggtattgctgtggactgggttgcacga aatctttattggacagacactggcactgatcgaatagaagtgacaaggctcaatgggacc atgaggaagatcttgatttcagaggacttagaggaaccccgggctattgtgttagatccc atggttgggtacatgtattggactgactggggagaaattccgaaaattgagcgagcagct ctggatggttctgaccgtgtagtattggttaacacttctcttggttggccaaatggttta gccttggattatgatgaaggcaaaatatactggggagatgccaaaacagacaagattgag gttatgaatactgatggcactgggagacgagtactagtggaagacaaaattcctcacata tttggatttactttgttgggtgactatgtttactggactgactggcagaggcgtagcatt gaaagagttcataaacgaagtgcagagagggaagtgatcatagatcagctgcctgacctc atgggcctaaaggctacaaatgttcatcgagtgattggttccaacccctgtgctgaggaa aacgggggatgtagccatctctgcctctatagacctcagggccttcgctgtgcttgccct attggctttgaactcatcagtgacatgaagacctgcattgtcccagaggctttccttttg ttttcacggagagcagatatcagacgaatttctctggaaacaaacaataataatgtggct attccactcactggtgtcaaagaagcttctgctttggattttgatgtgacagacaaccga atttattggactgatatatcactcaagaccatcagcagagcctttatgaatggcagtgca ctggaacatgtggtagaattcggcttagattatccagaaggcatggcagtagactggctt gggaagaacttgtactgggcagacacaggaacgaatcgaattgaggtgtcaaagttggat gggcagcaccgacaagttttggtgtggaaagacctagatagtcccagagctctcgcgttg gaccctgccgaaggatttatgtattggactgaatggggtggaaaacctaagatagacaga gctgcaatggatggaagtgaacgtactaccttagttccaaatgtggggcgggcaaacggc ctaactattgattatgctaaaaggaggctttattggacagacctggacaccaacttaata gaatcttcaaatatgcttgggctcaaccgtgaagttatagcagatgacttgcctcatcct tttggcttaactcagtaccaagattatatctactggacggactggagccgacgcagcatt gagcgtgccaacaaaaccagtggccaaaaccgcaccatcattcagggccatttggattat gtgatggacatcctcgtctttcactcatctcgacagtcagggtggaatgaatgtgcttcc agcaatgggcactgctcccacctctgcttggctgtgccagttgggggttttgtttgtgga tgccctgcccactactctcttaatgctgacaacaggacttgtagtgatgggtcttgccct gtcaagtgtaaaatcagtctagaagtttctgctttaatttataaaagcccaggcttcctg ccacccaccagacagaataggttaagggttattcacttggtagaccaggagaattttgta tatcaaatacctgatggaaatgaagttttctgtggtatctctgtggacaagatggagaaa aacagggtaggtggctatgtcattagtcaaaagagtgccatcaaccgcatggtgattgat gaacaacagagccccgacatcatccttcccatccacagccttcggaatgtccgggccatt gactatgacccactggacaagcaactctattggattgactcacgacaaaacatgatccga aaggcacaagaagatggcagccagggctttactgtggttgtgagctcagttccgagtcag aacctggaaatacaaccctatgacctcagcattgatatttacagccgctacatctactgg acttgtgaggctaccaatgtcattaatgtgacaagattagatgggagatcagttggagtg gtgctgaaaggcgagcaggacagacctcgagccgttgtggtaaacccagagaaagggtat atgtattttaccaatcttcaggaaaggtctcctaaaattgaacgggctgctttggatggg acagaacgggaggtcctctttttcagtggcttaagtaaaccaattgctttagcccttgat agcaggctgggcaagctcttttgggctgattcagatctccggcgaattgaaagcagtgat ctctcaggtgctaaccggatagtattagaagactccaatatcttgcagcctgtgggactt actgtgtttgaaaactggctctattggattgataaacagcagcaaatgattgaaaaaatt gacatgacaggtcgagagggtagaaccaaagtccaagctcgaattgcccagcttagtgac attcatgcagtaaaggagctgaaccttcaagaatacagacagcacccttgtgctcaggat aatggtggctgttcacatatttgtcttgtaaagggggatggtactacaaggtgttcttgc cccatgcacctggttctacttcaagatgagctatcatgtggagaacctccaacatgttct cctcagcagtttacttgtttcacgggggaaattgactgtatccctgtggcttggcggtgc gatgggtttactgaatgtgaagaccacagtgatgaactcaattgtcctgtatgctcagag tcccagttccagtgtgccagtgggcagtgtattgatggtgccctccgatgcaatggagat gcaaactgccaggacaaatcagatgagaagaactgtgaagtgctttgtttaattgatcag ttccgctgtgccaatggtcagtgcattggaaagcacaagaagtgtgatcataatgtggat tgcagtgacaagtcagatgaactggattgttatccgactgaagaaccagcaccacaggcc accaatacagttggttctgttattggcgtaattgtcaccatttttgtgtctggaactgta tactttatctgccagaggatgttgtgtccacgtatgaagggagatggggaaactatgact aatgactatgtagttcatggaccagcttctgtgcctcttggttatgtgccacacccaagt tctttgtcaggatctcttccaggaatgtctcgaggtaaatcaatgatcagctccctcagt atcatggggggaagcagtggacccccctatgaccgagcccatgttacaggagcatcatca agtagttcttcaagcaccaaaggcacttacttccctgcaattttgaaccctccaccatcc ccagccacagagcgatcacattacactatggaatttggatattcttcaaacagtccttcc actcataggtcatacagctacaggccatatagctaccggcactttgcaccccccaccaca ccctgcagcacagatgtttgtgacagtgactatgctcctagtcggagaatgacctcagtg gcaacagccaagggctataccagtgacttgaactatgattcagaacctgtgcccccacct cccacaccccgaagccaatacttgtcagcagaggagaactatgaaagctgcccaccttct ccatacacagagaggagctattctcatcacctctacccaccgccaccctctccctgtaca gactcctcctga