GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:05:41 Sequence gi568815594f:164940073_165141485 : 201413 bp : 43.51% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3218 3835 618 1 0 70 87 618 0.093 55.25 1.02 Intr + 8125 9063 939 0 0 29 44 341 0.020 15.17 1.03 Term + 12099 12212 114 2 0 56 36 120 0.664 2.17 1.04 PlyA + 13236 13241 6 1.05 2.07 PlyA - 13411 13406 6 1.05 2.06 Term - 17020 16727 294 1 0 19 49 160 0.435 0.51 2.05 Intr - 17352 17100 253 2 1 18 65 179 0.079 6.04 2.04 Intr - 38118 38038 81 2 0 56 97 98 0.278 6.25 2.03 Intr - 56699 56577 123 1 0 10 61 160 0.028 5.20 2.02 Intr - 57903 57800 104 0 2 49 76 40 0.014 -2.13 2.01 Init - 62635 62558 78 1 0 46 98 11 0.011 -1.04 2.00 Prom - 63466 63427 40 -7.26 3.00 Prom + 65638 65677 40 -3.76 3.01 Init + 66250 66387 138 0 0 50 58 106 0.689 3.94 3.02 Intr + 67307 67478 172 1 1 45 115 137 0.524 11.62 3.03 Intr + 67641 67791 151 1 1 16 105 112 0.571 4.92 3.04 Intr + 69333 69402 70 0 1 55 89 27 0.889 -1.32 3.05 Intr + 70202 70329 128 1 2 20 77 123 0.483 3.88 3.06 Intr + 70557 70773 217 0 1 34 58 112 0.646 1.31 3.07 Term + 71079 71318 240 2 0 57 44 126 0.098 1.13 3.08 PlyA + 72036 72041 6 1.05 4.04 PlyA - 72272 72267 6 1.05 4.03 Term - 73662 73540 123 0 0 87 43 65 0.049 0.28 4.02 Intr - 86844 86324 521 1 2 89 87 155 0.592 8.17 4.01 Init - 87878 87731 148 2 1 61 30 99 0.266 1.76 4.00 Prom - 93966 93927 40 -5.36 5.00 Prom + 94835 94874 40 -5.96 5.01 Sngl + 100001 101416 1416 1 0 62 49 323 0.928 21.74 5.02 PlyA + 101651 101656 6 1.05 6.00 Prom + 117353 117392 40 -6.46 6.01 Sngl + 119076 120482 1407 2 0 53 48 479 0.740 34.40 6.02 PlyA + 123041 123046 6 1.05 7.04 PlyA - 124515 124510 6 1.05 7.03 Term - 139724 139586 139 1 1 79 54 116 0.302 4.74 7.02 Intr - 145648 145514 135 0 0 41 110 114 0.321 8.58 7.01 Init - 154416 154337 80 0 2 86 113 46 0.469 5.44 7.00 Prom - 166821 166782 40 -1.36 8.00 Prom + 169784 169823 40 -5.96 8.01 Init + 175527 175529 3 2 0 108 101 0 0.123 3.30 8.02 Intr + 184722 184842 121 2 1 75 94 25 0.254 1.87 8.03 Term + 186014 186114 101 0 2 47 46 132 0.804 3.19 8.04 PlyA + 189463 189468 6 1.05 9.03 PlyA - 189761 189756 6 1.05 9.02 Term - 191765 191661 105 2 0 55 51 125 0.739 3.91 9.01 Init - 200148 200146 3 0 0 108 81 0 0.418 1.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 3218 3865 648 1 0 70 41 630 0.896 52.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:164940073_165141485|GENSCAN_predicted_peptide_1|556_aa MPCEATETVPATEQELPQPQAETGSGTESDSDESVPEPEEQDSTQTSTQEAQLVAAAEID EEPVSKAKQIWSEKKAQKAMSKLGLRQVTGVTRVAIRKSKNILFVITKPNVCKSPALDTY IVSGEAKIEDLSEQAQLAAAEKFNVQGEAVSNIQENTQTPTVQKESEEEEVDETGVEVKD IELVMSQANVWRAKAVHALKNNSNDICWEDLQDVLPCPVCLRHCPDENLRSNTQLCHMTD MVQQLLTMRSKRKWQEEEPLCGKHSQRLALFCEKGLELLCPQCRVSSDHQYHRLMPIEQA AARNRRKLESYIKPLKKETEHAKMRCEVPILSSLNVKRKMATWRKELQSEFKEIKYFLVK KQAAVHARLLTEEKDDKEKLTENRRQISDHLSTLQNLLNEVTEKCFRADLDVLTGVENTY NTYDNLKTPAVFSYKLKKESLSFPPHYFGLQRIISTFQEDLMLDPETAHPSLIISRDRKN VIFRMRKPHFTDNPPSFSFYPAVWSCEGFDAGRHFWQVEDILNKQKDSKVSQWMELQAIF SELEPLTNDQPNMQFY >gi568815594f:164940073_165141485|GENSCAN_predicted_CDS_1|1671_bp atgccctgtgaagccacagaaaccgtccctgctacagagcaggagttgccgcagccgcag gctgagacagggtctggaacagaatctgacagtgatgaatcagtaccagagcctgaagaa caggattccacccagacgtccacacaagaagcccagctggtggcagcagctgaaattgat gaagagccagtcagtaaagcaaaacagatttggagtgaaaagaaggcacagaaggctatg tccaaactgggtcttcgacaggttacaggagtcactagagtcgctatccggaaatctaag aatatcctttttgtcatcacaaaaccaaatgtctgcaagagccctgctttggatacctac atagtttctggggaagccaagatcgaagatttatctgagcaagcacaactagcagctgct gagaaattcaatgttcaaggtgaagctgtctcaaacattcaagaaaacacacaaactcca actgtacaaaaggagagtgaagaagaagaggtcgatgaaacaggtgtagaagttaaggac atagaattggtcatgtcacaagcaaatgtgtggagagcaaaggcagtccatgccctgaag aacaacagtaatgatatttgctgggaagacctacaagatgtcctcccctgtcctgtctgc ctccgtcactgccctgatgagaacctcaggagcaacacccagttgtgccacatgactgat atggttcagcagcttcttaccatgagaagcaagaggaaatggcaggaagaggagcccctg tgtgggaagcacagtcagcgtctggctctgttctgtgagaagggcctggagctgttgtgt ccccagtgcagggtctcctctgaccaccagtatcaccgcctgatgcccattgagcaagct gcagccagaaacaggagaaagctcgaaagctacattaagccactaaagaaggaaactgaa catgccaagatgcggtgtgaagtcccaattttgagttctcttaatgtgaaaaggaagatg gcaacatggaggaaggaattacaatctgaatttaaagaaattaaatatttcttggtaaag aaacaagctgcagttcatgcaagattacttactgaagagaaggatgataaagaaaaactc actgaaaaccgaagacaaatttcagaccacttatccacactacagaatctcttaaatgaa gtaacagagaagtgttttcgggcagacctggatgtgctgacaggtgttgagaacacctac aacacatatgacaacctgaaaacccctgcagtcttctcatacaaattaaagaaggagagt ttgagtttccctccacattattttggcctgcaaagaattataagcacatttcaagaagat ttgatgctagatccagaaacagcccaccctagtcttattatctcaagagatagaaaaaat gtgatatttaggatgaggaagccacactttactgataatcctccgtcatttagtttttac ccagctgtctggagctgtgagggctttgatgctgggagacacttttggcaagtagaagac attcttaacaagcaaaaggactccaaggtcagtcaatggatggaacttcaggccatcttc tcagaattggagcccctgaccaatgatcagcccaatatgcagttttactaa >gi568815594f:164940073_165141485|GENSCAN_predicted_peptide_2|310_aa MWQGHRIIVVRSADKHVNKGLCIIDKLICIYVNWRIGSPISSDLFLCPINPPSLKLYKLR GAQAYRKADCRRTQDLFVDLSSMTTLERTFTLEKSLGHQMKGLEDFDRGIDPTNASSSLK FRLKCHFLRGWDLMVVLFLTFPGGPSDLEWPGNFGGVTAGMLPLKFLSGHHERLTGHAEK TIRFSQDWWSDRCWTNVVVPGCLSKETLPWRPPGWLSRRWVRGEEDSCPAAPRILPSTVV HGAAPVSNVAPHLHGLTRKSKPRDGAGRTQSLLCRSDFRYGLAQPPGRNAAVLGVLKPLE SLVFCEKVAF >gi568815594f:164940073_165141485|GENSCAN_predicted_CDS_2|933_bp atgtggcagggtcataggataatagtggtgaggtcagcagataaacacgtgaacaaaggt ctctgcatcatagacaagttgatctgcatctatgttaactggaggattggcagccctatt tcttcggacctgttcttgtgtcctatcaatccaccaagtcttaaattgtacaaattgaga ggtgcacaagcttaccgtaaggctgactgtagacgtactcaggatctcttcgtcgacttg tcctcaatgaccacactcgagcgtaccttcaccctagagaaaagccttgggcaccagatg aagggccttgaagactttgaccgtggaattgatccaacgaatgcctcttcatccctcaaa ttccgcctgaaatgtcacttcctcagaggctgggacctgatggtggttctcttcctgact tttcctggtggtccttctgacttggagtggcctggcaatttcggtggcgtgacagccgga atgctgccactaaaattcctctctgggcaccatgagcggctgacagggcacgcggagaaa acgattcgcttctcccaggattggtggagcgaccgctgttggacgaatgtggtcgtccct ggatgtctgagcaaagagacactgccatggcggccgccgggctggctctccaggcgctgg gtccggggagaagaggacagctgccccgccgcaccgcgcatccttccatccactgtggta catggcgctgccccggtctccaacgtcgcgccacaccttcacggtctcacccggaagtcg aagcccagagacggtgccggccgcacccaatcactgctctgccgctccgacttccgctac ggactcgcccagcctcctgggagaaacgctgcagtgcttggggtgctgaagccccttgag agccttgtgttctgtgagaaagttgcattctga >gi568815594f:164940073_165141485|GENSCAN_predicted_peptide_3|371_aa MRILMDKSSPCKGPVAGEFPVLSVEIEVKGEMEIRGYQTVRSFGDMAPQGQGRKSSVDGA LSATLLLVHINLGLSADLSLVQECPRSTLLEVRVQDHSESSAPGLTLQNRDREKMEVAVT GTADIQTVTFTTVAGLVGELMALSPLLYNTVSGSPLKALVSLSGSQLPFQNQYLLLRLLA QLLYDLPQVKAELLSEAEAGDPRGRQRLTSTDRETFRTGSHQCPEGNFRSNTQLGRMIDI AKLLQRARGNDVRQDKMPLWEKHNQPLSVFCKEDLVVLCPLCTQTHDHQGHHVAEMSVMA DVKLLMDVRTVLHRCKACRPQLSTLCSSRRKETGFPCSTRLFRKSYRSLEKLLWILKVHI LICVSLRIRNV >gi568815594f:164940073_165141485|GENSCAN_predicted_CDS_3|1116_bp atgagaattctcatggacaaaagctccccatgcaaaggtcccgtggcaggagagttccct gtgctgagtgtggaaatagaggtgaagggtgagatggagattagggggtaccagactgtt aggtcctttggggacatggctcctcaggggcagggccgcaagtcctcggtggacggggcc ctaagtgcgaccttgctgctggtgcatataaatctgggtctcagcgctgacctcagtctg gtgcaagagtgtcctcggtcgacgctcttggaggtccgtgtccaagaccattctgaatca tctgctccaggattgacactacagaatcgggaccgggagaagatggaggtcgcagttact gggactgcagatatccaaacggtcacatttactacagtcgcagggttagtaggagagctc atggcactctccccgctgctctataacactgtgtcagggagtccgctgaaagctctagta agcctctcaggctctcagcttccatttcagaatcagtacctgcttctaagactacttgcc caactgctctatgacctgccacaggtgaaggcagagctgctctcagaagctgaggctggt gaccccagaggacgtcaacgactgacttcaacagacagggagaccttcaggacaggaagt caccaatgcccagagggaaacttcaggagcaacacccagctgggaaggatgattgacatt gccaagctactccagagagccagaggcaatgacgtcaggcaggacaagatgcccctgtgg gagaagcacaaccagcccctgagtgttttctgcaaggaggacctggtggtgttgtgtccc ctgtgcactcagacccatgaccaccaaggccaccatgtggccgagatgagtgtgatggca gatgtgaaactgctgatggatgtaaggaccgtcctgcacaggtgtaaggcctgcaggccc cagctgtccactctatgcagctccagaaggaaggaaacaggcttcccctgcagtactcgg ctcttcagaaaatcatacagaagtttagagaagttactctggatcctgaaagtgcacatc ctcatctgcgtgtctctgaggataagaaatgtgtga >gi568815594f:164940073_165141485|GENSCAN_predicted_peptide_4|263_aa MKFPAILLHAAQDKITPSFSVSMLQMFPTHESLPGYLNLTVTISQYLDSNPVSTHCGHHF CGSCIHQGCKDLQDVLPCPVCLHHCPDRNLKSNMQLHHMTDIVQQLPTMRNKRKGQEEEP LSEKHSQGLALFLEKGLEPFCPWCRVSDHQDQPLMPTEEAAAMHGRKFKSYLEPLKKQAE VAEMGCEMHISETFEVMGKAEKWRRDIFFEFEQLKYFLRKEHIGDLVIDDPALYFYEIYL SETLSGSRSEYQKNILCASHRGK >gi568815594f:164940073_165141485|GENSCAN_predicted_CDS_4|792_bp atgaaatttcctgccatcctgctccatgctgcccaggacaaaatcaccccttcattcagt gtatccatgctgcagatgttccccacccatgagtcacttcctggctatctcaacctaacc gtcacaatatcacagtacttggattcaaacccagtctccactcactgtgggcaccacttc tgtggctcctgcatccaccagggctgcaaagacctacaggacgtcctcccttgtcctgtc tgcctccaccactgccctgacaggaacctcaagagcaacatgcaattgcaccacatgact gatattgtccagcagcttcccaccatgaggaacaagaggaaagggcaggaagaggagccc ctgtctgagaaacacagtcagggtctggccctgttccttgagaagggcctggagcctttc tgtccttggtgcagggtctctgaccaccaggatcagcccctgatgcccactgaggaagct gcagctatgcatgggaggaagttcaaaagctaccttgagcctctgaaaaagcaagctgaa gttgctgaaatggggtgtgaaatgcacatttcagaaacttttgaagtgatggggaaggcg gaaaagtggaggagggacatattctttgaatttgaacaattaaagtatttcttgagaaaa gagcacattggggaccttgttattgatgacccagccctatatttctatgagatttacctc tcagaaactctatcaggttctcgcagtgaatatcagaaaaacatcctttgtgcttctcac agaggcaagtag >gi568815594f:164940073_165141485|GENSCAN_predicted_peptide_5|471_aa MEFVTALVNLQEESSCPICLEYLKDPVTINCGHNFCRSCLSVSWKDLDDTFPCPVCRFCF PYKSFRRNPQLRNLTEIAKQLQIRRSKRKRQKENAMCEKHNQFLTLFCVKDLEILCTQCS FSTKHQKHYICPIKKAASYHREILEGSLEPLRNNIERVEKVIILQGSKSVELKKKVEYKR EEINSEFEQIRLFLQNEQEMILRQIQDEEMNILAKLNENLVELSDYVSTLKHLLREVEGK SVQSNLELLTQAKSMHHKYQNLKCPELFSFRLTKYGFSLPPQYSGLDRIIKPFQVDVILD LNTAHPQLLVSEDRKAVRYERKKRNICYDPRRFYVCPAVLGSQRFSSGRHYWEVEVGNKP KWILGVCQDCLLRNWQDQPSVLGGFWAIGRYMKSGYVASGPKTTQLLPVVKPSKIGIFLD YELGDLSFYNMNDRSILYTFNDCFTEAVWPYFYTGTDSEPLKICSVSDSER >gi568815594f:164940073_165141485|GENSCAN_predicted_CDS_5|1416_bp atggagtttgtgacagccctggtgaacctccaagaggagtctagctgtcccatctgtctg gagtacttgaaagacccagtgaccatcaactgtgggcacaacttctgtcgctcctgcctc agtgtatcctggaaggatctagatgatacctttccctgtcctgtctgccgtttttgcttt ccatacaagagcttcaggaggaacccccagctccgtaatttgactgaaattgctaaacaa ctccagattaggaggagcaagagaaagaggcagaaagagaatgccatgtgtgaaaaacac aaccagtttctgaccctcttctgtgttaaagatctagagatcttatgtacacagtgcagt ttctccactaaacaccagaagcactacatttgccctattaagaaagctgcctcttatcac agagaaattctagaaggtagccttgagcccttgaggaataatatagaacgagttgaaaaa gtgataattctgcaaggcagcaaatcagtggagctgaaaaagaaggtagaatataagagg gaagaaataaattctgagtttgagcaaataagattgtttttacagaatgaacaagagatg attcttaggcagatacaagatgaagagatgaacattttagcaaaactaaatgaaaacctt gtagaactttcagattatgtttccacattaaaacatctactgagggaggtagagggcaag tctgtgcagtcaaacctggaattactgacacaagctaagagtatgcaccacaagtatcaa aacctaaaatgccctgaactcttttcatttagattaacaaaatatggtttcagtcttcct cctcaatattctggcttggacagaattatcaagccatttcaagtagatgtgattctagat ctcaacacagcacatcctcaacttcttgtctctgaggatagaaaagctgtgcgatatgaa agaaaaaaacgaaacatttgttatgacccaaggagattttatgtctgccctgctgtccta ggctctcagagatttagttctggccgacattactgggaagtagaagtgggaaacaaacct aaatggatattgggtgtgtgtcaagactgtcttcttaggaactggcaggatcagccatca gttctgggcggattctgggcaattgggcgatacatgaagagtggttatgttgcgtcaggt cctaagacaacccagcttctgccagtagtaaaacccagtaaaattggtatttttctggac tatgaattgggtgatctttccttttataatatgaatgataggtctattctctatactttt aacgattgtttcacagaagccgtttggccttatttctatactggaacagattccgaacct cttaaaatctgctcagtatcagattctgaaagataa >gi568815594f:164940073_165141485|GENSCAN_predicted_peptide_6|468_aa MAVAAALTGLQAEAKCSICLDYLSDPVTIECGHNFCRSCIQQSWLDLQELFPCPVCRHQC QEGHFRSNTQLGRMIEIAKLLQSTKSNKRKQEETTLCEKHNQPLSVFCKEDLMVLCPLCT QPPDHQGHHVRPIEKAAIHYRKRFCSYIQPLKKQLADLQKLISTQSKKPLELREMVENQR QELSSEFEHLNQFLDREQQAVLSRLAEEEKDNQQKLSANITAFSNYSATLKSQLSKVVEL SELSELELLSQIKIFYESENESSPSIFSIHLKRDGCSFPPQYSALQRIIKKFKVEIILDP ETAHPNLIVSEDKKRVRFTKRKQKVPGFPKRFTVKPVVLGFPYFHSGRHFWEIEVGDKSE WAIGICKDSLPTKARRPSSAQQECWRIELQDDGYHAPGAFPTPLLLEVKARAIGIFLDYE MGEISFYNMAEKSHICTFTDTFTGPLRPYFYVGPDSQPLRICTGTVCE >gi568815594f:164940073_165141485|GENSCAN_predicted_CDS_6|1407_bp atggcagtggcagcagctctgaccggactccaagcagaagctaagtgctccatctgtctg gattacctgagtgaccccgtcaccatcgaatgtgggcacaacttctgtcgttcctgcatc caacagtcctggctggatctacaggaattgttcccttgccctgtctgtcgtcaccagtgt caagaggggcacttcaggagcaacacccagctgggaaggatgattgaaattgccaagcta ctccagagcaccaagagtaataaaaggaagcaggaagagaccaccttgtgcgagaaacac aaccagcccctgagcgttttctgcaaggaggacctgatggtgttgtgtccgctgtgcact cagccccctgaccaccagggccaccatgtgaggcccatagagaaagctgccattcattat aggaaaagattctgcagttacatccagcccctgaaaaagcaattggcagacctccaaaaa ttaataagcactcaaagcaaaaaacccttagaactgagagagatggtggaaaaccaaagg caggaattatcctctgaatttgagcacctcaaccagtttttagaccgtgagcaacaggca gttctctccagattagctgaagaagagaaggacaatcaacagaaactcagtgcaaacata acagcattttcaaactacagtgccacactcaaaagccagttaagtaaggtagtagagctc agtgagctgtctgaactggaattgctgtcacaaattaaaattttctacgaatctgaaaat gagagtagcccatcgatcttttcaattcatttaaagagagatggctgcagttttcctccc caatattctgctctgcagagaattataaagaaatttaaagtagaaataattctagaccct gaaacagcacaccctaacctgattgtatctgaagataaaaaacgtgtgagatttacaaag agaaaacaaaaggttcctggtttcccaaaaagatttacagtcaagccagttgttctgggt tttccgtattttcattctggcaggcatttctgggagattgaagtgggggataagtcagaa tgggctattggcatttgcaaagattctcttcccacaaaggcgaggagaccctcatcagcc cagcaggaatgttggagaattgagctgcaagatgatggctatcatgcaccaggggctttt ccaacccctctgttgttagaggtgaaagccagggccattggcattttcctggactatgag atgggtgagatctcattctataacatggctgagaaatctcacatctgtactttcactgac acttttactgggcctcttcggccttatttctatgtaggaccagattcacaacctctcaga atctgtacagggacagtttgtgaatga >gi568815594f:164940073_165141485|GENSCAN_predicted_peptide_7|117_aa MGFLPVGQAGLELSTSGATASLESLKRIRSNNDFIILVKIRRFNKAKPEPDILEEEKIYA YPSNITSETGFRTISSLEEIVEKQGDTIEYLKRHNALLSKRLLALTSSDLGCQPSRT >gi568815594f:164940073_165141485|GENSCAN_predicted_CDS_7|354_bp atggggtttctccctgttggtcaggctggtcttgaactttcgacctcaggagccaccgcc agcctagaatctttaaaacgaattcgatctaataatgactttattattttagtgaaaatc cggagatttaataaagctaaaccagagcctgatatacttgaagaagaaaaaatctatgct taccccagcaatattacctcggagactggattcagaactatttcaagcctagaagaaatt gttgaaaagcaaggagacaccattgaatacctgaagcgacacaatgcgctgctgagtaag cgattgttggctctcacttcctcagacctgggctgtcagccaagtagaacgtga >gi568815594f:164940073_165141485|GENSCAN_predicted_peptide_8|74_aa MTKKGVTVYYRVTVLYRVTVLIVQVIDPIYQNEISLLLRSGGRQLRPREKLSSVPVGWHC WRTQYTIRSRWPEC >gi568815594f:164940073_165141485|GENSCAN_predicted_CDS_8|225_bp atgactaagaagggagttacagtgtactacagagttacagtgttgtacagagttacagtg ttgattgtccaggtgattgaccccatctatcaaaatgaaatcagtctactactccgcagt ggaggaaggcagctaaggccccgcgagaaattgagctcagtgccggtgggctggcactgc tggaggacccagtacaccatccgcagccgctggcccgaatgctaa >gi568815594f:164940073_165141485|GENSCAN_predicted_peptide_9|35_aa MRTVYRYTAKASCCTQRSGIREEKEEEDDDDDDDS >gi568815594f:164940073_165141485|GENSCAN_predicted_CDS_9|108_bp atgaggactgtatacaggtatacagcaaaagcttcatgttgcactcaaagatctggcatt agggaggagaaggaggaggaggacgacgacgatgatgatgatagttga