GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:15:46 Sequence gi568815595f:131281904_131483338 : 201435 bp : 39.79% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 854 849 6 1.05 1.02 Term - 16290 15697 594 0 0 75 49 169 0.991 5.24 1.01 Init - 16546 16346 201 1 0 60 86 126 0.978 8.52 1.00 Prom - 18607 18568 40 -5.25 2.10 PlyA - 18724 18719 6 1.05 2.09 Term - 22907 22208 700 1 1 55 42 377 0.251 21.82 2.08 Intr - 28824 28747 78 2 0 89 70 49 0.067 0.95 2.07 Intr - 40967 40783 185 2 2 52 58 112 0.246 2.26 2.06 Intr - 47103 46945 159 0 0 65 103 203 0.917 18.76 2.05 Intr - 48131 48063 69 2 0 89 115 39 0.928 5.16 2.04 Intr - 49982 49856 127 1 1 65 39 93 0.332 1.86 2.03 Intr - 50434 50339 96 0 0 27 56 153 0.002 4.21 2.02 Intr - 53125 53083 43 2 1 51 111 52 0.000 0.18 2.01 Init - 62536 62401 136 1 1 73 37 166 0.732 10.35 2.00 Prom - 63926 63887 40 -6.65 3.00 Prom + 65632 65671 40 -8.95 3.01 Sngl + 67661 67873 213 1 0 85 48 261 0.120 16.73 3.02 PlyA + 68508 68513 6 1.05 4.00 Prom + 77421 77460 40 -2.35 4.01 Init + 80089 80165 77 0 2 69 81 130 0.256 9.01 4.02 Intr + 80253 80538 286 0 1 55 59 445 0.276 34.72 4.03 Intr + 81287 81397 111 1 0 42 57 93 0.263 1.26 4.04 Intr + 90444 90546 103 2 1 110 97 30 0.018 4.93 4.05 Term + 94615 94688 74 2 2 88 42 43 0.010 -3.21 4.06 PlyA + 95182 95187 6 1.05 5.00 Prom + 97298 97337 40 -2.05 5.01 Init + 99902 100039 138 1 0 103 85 250 0.969 24.39 5.02 Intr + 100143 100412 270 2 0 127 59 518 0.999 49.62 5.03 Term + 101259 101438 180 2 0 135 44 140 0.999 10.93 5.04 PlyA + 102759 102764 6 1.05 6.00 Prom + 108150 108189 40 -4.55 6.01 Init + 115367 115388 22 1 1 74 106 9 0.007 0.92 6.02 Intr + 126258 128427 2170 1 1 46 53 740 0.032 52.67 6.03 Intr + 139722 139815 94 0 1 96 110 35 0.639 5.55 6.04 Term + 150789 151103 315 2 0 53 42 189 0.754 4.76 6.05 PlyA + 151263 151268 6 1.05 7.05 PlyA - 151425 151420 6 1.05 7.04 Term - 159946 159888 59 2 2 129 49 59 0.135 2.97 7.03 Intr - 180972 180846 127 0 1 91 78 127 0.418 11.33 7.02 Intr - 189376 189268 109 0 1 56 59 79 0.412 1.17 7.01 Init - 195497 195433 65 2 2 10 97 88 0.368 2.57 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 67654 67873 220 1 1 63 48 276 0.878 16.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:131281904_131483338|GENSCAN_predicted_peptide_1|264_aa MSEFPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKENTKKWKNIPCSWVGRINIV KMAILPKNWKKTTLKFIWNQKRARITKSILSQKNKAGGIRLPDFKLYYKATVTKTAWYWY QNRDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKPKLDP FLTPYTKINSRWIKDLNIRPETIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKW DLIKLKSFCTAKETTIRVNRKPTK >gi568815595f:131281904_131483338|GENSCAN_predicted_CDS_1|795_bp atgagtgaattcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gagaatacaaagaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaatggccatactgcccaagaattggaaaaaaactactttaaagttcatatggaaccaa aaaagagcccgcatcaccaagtcaatcctaagccaaaagaacaaagctggaggcatcagg ctacctgacttcaaactatactacaaggctacagtaaccaagacagcatggtactggtac caaaacagagatatagatcaatggaacagaacagagccctcagaaataacgccgcatatc tacaactatctgatctttgacaaacccgagaaaaacaagcaatggggaaaggattcccta tttaataaatggtgctgggaaaactggctagccatatgtagaaagccgaaactggatccc ttccttacaccttacacaaaaattaattcaagatggattaaagacttaaacattagacct gaaaccataaaaaccttagaagaaaacctaggcattaccattcaggacataggcatgggc aaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgg gatctcattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacagg aaacctacaaaatag >gi568815595f:131281904_131483338|GENSCAN_predicted_peptide_2|530_aa MRKSLELLRDWLNGCDQNADSDMNSEVQADKVSDGNEELIGNWSKEFGCESIWSRTLFGC SGGCRTADFREPRMLLSDRSSGSFVSEEYLAILISDRCASNQRDSVVVGPSEPVVGYNLL VHRFLSPSEKRSIREPLDSFPASTLTTLQSIVSSAATAIWSENEVCPELLEAPAALLPME SSEDDSRVPQLLGGDKQPTQCCSADATVCHKERFKQGESKDNEGERGPLHQCSVCDPQGQ TDNLEKRSQGHHKDALLKPQGNVIPPAALHHIYTLYFFSADVLEAAKKHISLQLTLKRTL WPAPTIGMLWLADREHLNTPLVQQVPNLKGPENKAQALVPASRVRAHSSAYSINKARCLK PPCTTIKLPRASKTIKAKNPSTRKQLQRLKEHQPTNMRKNQHKNSGNSKSQSVFLPPNNC TSSPAMILNLAEMTDVEFRIWIEKIIEIQEKVKTQSKKSKEYNKTIQELKYEMAILRKDQ TDLIELKNSLPEFHNTIANSNTRIDQAVEKISKLKCWFSKLTQTKIKKKG >gi568815595f:131281904_131483338|GENSCAN_predicted_CDS_2|1593_bp atgaggaaaagtttagaacttcttagagactggttaaatggttgtgaccaaaatgctgat agtgatatgaacagtgaagtccaggctgacaaggtgtcagatggaaatgaggaacttatt gggaactggagcaaagaattcggctgtgaatccatctggtcccggactctttttggttgc agcggtggctgcagaacagcggattttcgtgaaccacgaatgctgctgtctgatcgttcc tctggaagttttgtctcagaggagtacctggccattttgatctcagaccgctgtgctagc aatcagcgagactctgtggttgtaggaccctccgagccagttgtgggatataatctccta gtgcaccgttttttaagcccgtcggaaaagcgcagtattagggagccactcgattcgttt cctgcttctaccctcactactctacagtctattgtcagctcagcagccacggccatctgg agtgaaaatgaagtctgcccagagctgctggaagccccagctgccctgctgcccatggag tcctctgaggacgactccagagtccctcaactccttggaggagataagcagccaacacag tgttgttcagcagatgccaccgtctgccataaggaaagattcaaacaaggagagagcaag gacaatgaaggggagcgggggcctcttcatcagtgctcagtatgtgacccacagggtcag actgacaatttagaaaagaggagccaaggccatcacaaggatgctctgctgaagcctcaa ggaaacgtgatcccaccagcagcattgcatcacatatacacactgtattttttttctgca gatgttttagaagcagccaagaaacacatctccctgcagctgacactcaaaagaactctt tggccagcaccaactattggcatgttgtggctagcagaccgggaacacctcaacaccccc ctagtgcagcaggttcctaacctcaagggtccagagaacaaagctcaggcactggtacca gcctctagagttagagcacacagctcagcttattccataaacaaagccagatgcctgaaa ccaccctgtaccacaatcaaactcccaagggcatcaaagacgataaaagcaaaaaaccca tccacaagaaagcaacttcaaagattgaaggaacatcagcccacaaatatgagaaagaac cagcacaagaactctggcaactcaaaaagtcagagtgtcttcttacctccaaataactgc acaagttccccagcaatgattcttaacttggctgaaatgacagacgtagaattcagaata tggatagaaaagatcattgagattcaggagaaagtcaaaacccaatccaagaaatctaag gaatacaataaaacaatacaggaactgaaatatgaaatggccattttaagaaaggaccaa actgatctgatagagctgaaaaactcacttccagaatttcataatacaattgcaaatagt aataccagaatagaccaagctgtggaaaaaatttcaaagctcaaatgctggttctccaaa ttaactcagacaaaaataaagaaaaaaggataa >gi568815595f:131281904_131483338|GENSCAN_predicted_peptide_3|70_aa MQKLGTEVFEEVYNYLKRARHQNASEAEIRECLEKVVPQASDCFEVDQLLYFEEQLLITM GKEPTLQNHL >gi568815595f:131281904_131483338|GENSCAN_predicted_CDS_3|213_bp atgcagaagctggggacagaagtatttgaagaggtctataattacctcaagagagcaagg catcagaatgctagcgaagcagagatccgcgagtgtttggaaaaagtggtgcctcaagcc agcgactgttttgaagtggaccagctcctgtactttgaagagcagttgctgatcacgatg ggaaaagaacctactctccagaaccatctctag >gi568815595f:131281904_131483338|GENSCAN_predicted_peptide_4|216_aa MPAMRSCTRRTLEFSLAASLCATPYCFLLSQMQMRFDGRLGFPGGFVDSQDSSLEDGLNR GLLELLGEAAAAFRVERPDYRSSHAGSRPRVVAHFYAKSLTLEQLLAVEASATGAKDHGL ERVEPTGPPLDRSLGLVSAGVLRVVASALERSAKERMEVSEKRNQWVYNILLSSILLLQH PVIPGCGCREKPVYLSKSVVLNLSSAFKASGKSDGD >gi568815595f:131281904_131483338|GENSCAN_predicted_CDS_4|651_bp atgcctgctatgcgctcctgcacgcgccggaccctggaattctctctggccgcctccctc tgcgctacgccatactgcttcctcctctctcagatgcagatgcgctttgatgggcgcctg ggcttccctggcggattcgtggactcgcaagacagcagcctggaggacgggctgaaccgt ggtctgctggaactgctgggcgaggcggcggccgccttccgcgtggagcgccctgactac cgcagctctcacgccggatcaaggccacgtgttgtggcccacttctatgccaaatctctg acgctcgagcagctgttggctgtggaggccagcgcaacaggggccaaggaccacgggctg gagagggttgagcccactgggcctcctttagataggagccttgggcttgtttctgctgga gtattaagagttgtggcatctgctctagaaagatctgctaaagagaggatggaggtatct gaaaaacggaaccaatgggtttacaacatcctgttatccagcatcctgttattacaacat cctgtaattccaggctgtggatgtagggaaaagccagtatatttgtctaaatcagtggtt ctcaaccttagctctgcattcaaagcatctggaaaatctgatggggattaa >gi568815595f:131281904_131483338|GENSCAN_predicted_peptide_5|195_aa MAGARRLELGEALALGSGWRHACHALLYAPDPGMLFGRIPLRYAILMQMRFDGRLGFPGG FVDTQDRSLEDGLNRELREELGEAAAAFRVERTDYRSSHVGSGPRVVAHFYAKRLTLEEL LAVEAGATRAKDHGLEVLGLVRVPLYTLRDGVGGLPTFLENSFIGSAREQLLEALQDLGL LQSGSISGLKIPAHH >gi568815595f:131281904_131483338|GENSCAN_predicted_CDS_5|588_bp atggccggagcccgcaggctggagctaggcgaggccctggcgctggggtcgggctggcgt catgcgtgccacgctctcctctacgcgccggaccctgggatgctcttcggccgcatcccg ctgcgctacgccatactgatgcagatgcgcttcgatggacgcctgggcttccccggcgga ttcgtggacacgcaggacagaagcctagaggacgggctgaaccgcgagctgcgcgaggag ctgggcgaagcggctgccgctttccgcgtggagcgcactgactaccgcagctcccacgtc gggtcagggccacgcgttgtggcccacttctatgccaagcgtctgacgctcgaggagctg ttggctgtggaggccggcgcaacacgcgccaaggaccacgggctggaggtgctgggcctg gtgcgagtgcccctgtataccctgcgggatggtgtaggaggcctgcctaccttcctggag aattcctttattggctctgcgcgggagcagttacttgaagctctccaggacttgggactg ctgcagtctggctctatttcaggccttaagattccagctcatcactag >gi568815595f:131281904_131483338|GENSCAN_predicted_peptide_6|866_aa MDRKKRVEIQTTIREYYKHLYTNKLENLEEMDTFLDTYTLPRLNEEEVESLNRPITGSEI VAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIP KPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRK SINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPT ANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLS LFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELP FTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAIL PKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLY YKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWE NWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGVGKDFMSKT PKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFATYSSDKGLISRIYNELK QIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHLTPVR MAIIKKSGNNRLMGSSTATWVFSLQWVSENLCHKFWAWGSGQTSKTSQEGQGQTSPDCKD YNKYLTLQCPDISEHPQALTIQKNMTSPNILNKAPVTNPRVTKICDFSDRKFKIAVLRKL NEIQDNTEKHSRILSDKLNKEIEIIF >gi568815595f:131281904_131483338|GENSCAN_predicted_CDS_6|2601_bp atggacaggaagaaaagggtggaaatacaaactaccatcagagaatactacaaacacctc tacacaaataaactagaaaatctagaagaaatggatacattcctcgacacatacactctc ccaagactaaacgaggaagaagttgaatctctgaatagaccaataacaggctctgaaatt gtggcaataatcaatagtttaccaaccaaaaagagtccaggaccagatggattcacagcc gaattctaccagaggtacaaggaggaactggtaccattccttctgaaactattccaatca atagaaaaagagggaatcctccctaactcattttatgaggccagcatcattctgatacca aagccgggcagagacacaaccaaaaaagagaattttagaccaatatccttgatgaacatt gatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagctt atccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaa tcaataaatgtaatccagcatataaacagagccaaagacaaaaaccacatgattatctca atagatgcagaaaaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaat aaattaggtattgatgggacgtatttcaaaataataagagctatctatgacaaacccaca gccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaactggcacaaga cagggatgccctctctcaccgctcctattcaacatagtgttggaagttctggccagggca atcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtcc ctgtttgcagacgacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctc cttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaacaacagacaaacagagagccaaatcatgagtgaactccca ttcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaag gacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggacacaaacaaa tggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactg cccaaggtaatttacagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatcgccaag tcaatcctaagccaaaagaacaaagcgggaggcatcacactacctgacttcaaactatac tacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaa tggaacagaacagagccctcagaaataatgccacatatctacaactatctgatctttgac aaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaa aactggctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaa atcaattcaagatggattaaagatttaaacgttagacctaaaaccataaaaaccctagaa gaaaacctaggcattaccattcaggacataggcgtgggcaaggacttcatgtccaaaaca ccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagc ttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaacatgggagaaa attttcgcaacctactcatctgacaaagggctaatatccagaatctacaatgaactcaaa caaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacaga cacttctcaaaagaagacatttatgcagccaaaaaacacatgaagaaatgctcatcatca ctggccatcagagaaatgcaaatcaaaaccactatgagatatcatctcacaccagttaga atggcaatcattaaaaagtcaggaaacaacaggttgatgggatcttctacggccacttgg gtgttctcccttcagtgggtatcagagaatctttgccataaattttgggcatggggtagt ggacagacatcaaagacttctcaagaaggacaggggcaaacaagcccagactgcaaagac tacaataaatacctaactcttcaatgcccagacatcagtgaacatccacaagcattgacc atccagaaaaacatgacctcaccaaacatactaaataaggcaccagtgaccaatcctaga gtgacaaagatatgtgacttttcagacagaaaattcaaaatagctgttttgaggaagctc aatgaaattcaagataacacagagaaacattccagaatcctgtcagataaacttaacaaa gagattgaaataattttttaa >gi568815595f:131281904_131483338|GENSCAN_predicted_peptide_7|119_aa MREGEAVCVPVKDEEGYEGKRSIGKGFQGVMKRWGFKGQPATHGQTKTHRRPGAVATGVK DSKLPAYKDLGKNLPFPTYFPDGDEEELPEDLYDENVCQPGPPNPSTVIAPTDPVAKAP >gi568815595f:131281904_131483338|GENSCAN_predicted_CDS_7|360_bp atgagagaaggagaagcagtttgtgttccagtgaaagacgaggagggatatgaagggaaa agaagtattggtaaaggttttcaaggtgtcatgaaaagatggggatttaaaggccagcct gctacgcatggtcaaacgaaaacccacaggagacctggagctgttgcaactggtgtcaaa gattctaaactgcctgcatataaggatctcggtaaaaatctaccattccctacatatttt cctgatggagatgaagaggaactgccagaagatttgtatgatgaaaacgtgtgtcagccc gggcctcccaaccccagcactgtgattgcccccacagacccagtagcaaaggctccatag