GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:53:50 Sequence gi568815596f:46246871_46484657 : 237787 bp : 44.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 672 667 6 1.05 1.04 Term - 5410 5291 120 1 0 64 48 75 0.863 -0.43 1.03 Intr - 6391 6309 83 2 2 143 99 49 0.982 10.86 1.02 Intr - 6641 6512 130 2 1 44 72 67 0.874 0.97 1.01 Init - 12208 12173 36 1 0 84 81 19 0.493 0.93 1.00 Prom - 12429 12390 40 -2.96 2.00 Prom + 13467 13506 40 -5.56 2.01 Init + 14658 14691 34 2 1 20 100 31 0.861 -2.47 2.02 Term + 15388 15557 170 2 2 71 42 157 0.829 7.44 2.03 PlyA + 17301 17306 6 1.05 3.00 Prom + 27917 27956 40 -2.46 3.01 Init + 49716 49855 140 2 2 78 119 66 0.853 8.31 3.02 Intr + 52061 52191 131 2 2 62 64 71 0.761 2.44 3.03 Term + 52861 53084 224 2 2 42 49 227 0.978 11.28 3.04 PlyA + 53598 53603 6 1.05 4.00 Prom + 60553 60592 40 -4.96 4.01 Init + 64918 64984 67 0 1 81 93 90 0.937 10.03 4.02 Term + 75373 75440 68 2 2 88 46 58 0.183 -0.30 4.03 PlyA + 76448 76453 6 1.05 5.00 Prom + 77879 77918 40 -5.66 5.01 Init + 81417 81423 7 2 1 104 110 5 0.635 5.12 5.02 Term + 82436 82566 131 0 2 74 53 114 0.849 4.84 5.03 PlyA + 84465 84470 6 1.05 6.00 Prom + 95139 95178 40 -3.36 6.01 Init + 98598 98663 66 2 0 60 99 -18 0.339 -2.36 6.02 Intr + 99778 99893 116 0 2 99 66 20 0.493 0.15 6.03 Intr + 100114 100193 80 1 2 79 110 108 0.002 11.29 6.04 Intr + 106743 106886 144 1 0 49 69 75 0.666 1.95 6.05 Intr + 109281 109432 152 1 2 86 61 228 0.999 19.78 6.06 Intr + 109854 109938 85 2 1 80 23 85 0.614 0.59 6.07 Intr + 113768 113886 119 0 2 71 97 131 0.643 12.48 6.08 Intr + 114015 114220 206 2 2 90 91 280 0.990 26.50 6.09 Intr + 122957 123063 107 2 2 39 74 125 0.613 6.06 6.10 Intr + 128820 128967 148 1 1 125 99 262 0.999 30.29 6.11 Intr + 129669 129883 215 0 2 48 72 304 0.999 23.16 6.12 Intr + 130581 130649 69 1 0 50 50 106 0.815 2.15 6.13 Intr + 131024 131217 194 0 2 107 101 212 0.990 23.61 6.14 Intr + 131787 131924 138 2 0 103 61 235 0.549 22.96 6.15 Intr + 133357 133847 491 0 2 106 115 369 0.999 33.20 6.16 Intr + 134726 134852 127 2 1 80 84 226 0.999 22.08 6.17 Intr + 135105 135219 115 2 1 95 89 54 0.756 6.22 6.18 Intr + 135555 135728 174 1 0 80 94 131 0.758 12.81 6.19 Term + 137639 137790 152 0 2 76 52 186 0.999 11.87 6.20 PlyA + 145332 145337 6 1.05 7.00 Prom + 152259 152298 40 -7.26 7.01 Init + 156176 156247 72 1 0 88 47 80 0.594 4.97 7.02 Intr + 163629 163802 174 2 0 68 71 77 0.800 4.14 7.03 Term + 163858 163929 72 0 0 101 42 87 0.981 3.31 7.04 PlyA + 165862 165867 6 1.05 8.00 Prom + 175349 175388 40 -3.06 8.01 Init + 186231 186287 57 2 0 64 64 61 0.399 2.61 8.02 Intr + 213454 213574 121 0 1 113 73 50 0.217 6.07 8.03 Term + 213760 213896 137 2 2 8 47 108 0.106 -3.12 8.04 PlyA + 216099 216104 6 1.05 9.02 PlyA - 217975 217970 6 1.05 9.01 Sngl - 228794 227850 945 2 0 49 44 286 0.911 16.86 9.00 Prom - 228887 228848 40 -4.96 10.00 Prom + 232250 232289 40 -6.66 10.01 Init + 232785 232832 48 2 0 93 95 10 0.647 3.15 10.02 Intr + 233535 233894 360 2 0 78 68 495 0.987 41.92 10.03 Intr + 237374 237562 189 1 0 108 -10 203 0.544 12.28 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 51042 51067 26 2 2 80 121 56 0.922 7.10 S.002 Intr + 100003 100193 191 1 2 78 110 197 0.996 20.13 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:46246871_46484657|GENSCAN_predicted_peptide_1|122_aa MVEGKGEQACYMGKQQLLFLLGEISTSEGVAQKGIIADEDILGSSQSSVGTDGLHSIAPI TLTESSGKCEKSLNKELPWGNGQLDAERHMDAHDSLTHGMPAYFSMAPNRQFSMLSFPVN TS >gi568815596f:46246871_46484657|GENSCAN_predicted_CDS_1|369_bp atggtggaaggcaaaggggagcaggcgtgttacatgggcaaacagcagctgcttttcctc ctgggggaaatcagtacttcagagggagtggctcagaagggcataattgctgatgaggac attttgggtagctcccagagctctgtggggacagacggccttcacagcatagcacccatc acactgactgagtcatccggaaaatgtgagaaatcacttaataaagagctgccctgggga aatggacagctggatgcagaaagacacatggacgcacatgattcattaacgcacggcatg cctgcctacttctctatggcaccaaatcggcagttctcaatgctgagcttcccagttaat acttcttga >gi568815596f:46246871_46484657|GENSCAN_predicted_peptide_2|67_aa MKDKEGKRTLKGSYTRGMGAEFGQKSFDGGEFYSKLYTGKEKVLDQKPSETYRAGWVANV ANQTSMT >gi568815596f:46246871_46484657|GENSCAN_predicted_CDS_2|204_bp atgaaagacaaagagggcaagagaactcttaaaggttcttacacccgtggtatgggagca gaatttggacagaaaagctttgatggtggagaattttacagcaagctctatacagggaag gagaaggttctggaccaaaagccctcagaaacctatagagcaggttgggttgccaatgtg gccaaccaaaccagcatgacctaa >gi568815596f:46246871_46484657|GENSCAN_predicted_peptide_3|164_aa MANSCHSKITLGNQTDFSNSELAPASGGSKGLLCRIPAKTKPPGTSRGLELPAPRLAAAW RGYAAGSWNPTRGRLNRLSRGKPAAERRYRVLYRIYRETWEQGKNDFDTPSYSTWFVSGL LEREKKREWEREERCVIRLPIIGESGRFKNHCGDCLRNALEAAV >gi568815596f:46246871_46484657|GENSCAN_predicted_CDS_3|495_bp atggccaactcttgtcactccaagatcacactggggaaccagactgacttctccaattct gaactcgccccggcctcgggcggctcaaagggcctcctctgccgcatccccgccaaaacc aaaccgcctggcacaagccgtggcctggagcttccagccccgcgcttggccgcggcttgg cgaggctatgctgcgggaagctggaatccaacgcgcggccggctgaaccgcctgagccgc gggaaaccagcggcggagcggcggtatagagttttgtaccgtatataccgagaaacttgg gagcaggggaaaaatgatttcgacacacccagctacagtacctggttcgttagcggactg ttggaaagagagaagaagcgggaatgggagcgggaagaacgttgcgtaattagactccca attattggcgagagcggccgctttaagaaccactgtggggactgcctgcgaaacgccttg gaggccgctgtgtga >gi568815596f:46246871_46484657|GENSCAN_predicted_peptide_4|44_aa MGTPEGGVVTVAASQPKALGKPGSYAGSSYFYKKKKLECIEHLL >gi568815596f:46246871_46484657|GENSCAN_predicted_CDS_4|135_bp atgggcacacctgagggtggtgtggtcactgtggccgccagccagcccaaggccttggga aagcctggttcttatgctggcagctcatatttttacaagaaaaagaagttagaatgtatt gagcacctgctataa >gi568815596f:46246871_46484657|GENSCAN_predicted_peptide_5|45_aa MAGLSTYADSPCKTGGERVGYPIYGKGRPMEKQEDGQGKLAAKRQ >gi568815596f:46246871_46484657|GENSCAN_predicted_CDS_5|138_bp atggcaggactcagcacctatgccgacagtccttgcaagacaggaggggaacgtgtaggc tatcctatttatggtaaggggaggcctatggaaaaacaggaagatggtcaagggaagctg gcagcaaaaaggcaatga >gi568815596f:46246871_46484657|GENSCAN_predicted_peptide_6|965_aa MILGPTFPVQSPTSPFTCWRPQGMQEEDFGCEDTNTHKSRANNTKQVVCGSDNWCEPRSV YVSSHLDKASIMRLAISFLRTHKLLSSAPVQMKILAGKANIHSGIAHTWDIASSLQRADE LAENRCISRNPKQGSVCSENESEAEADQQMDNLYLKALEGFIAVVTQDGDMIFLSENISK FMGLTQVELTGHSIFDFTHPCDHEEIRENLSLKNGSGFGKKSKDMSTERDFFMRMKCTVT NRGRTVNLKSATWKVLHCTGQVKVYNNCPPHNSLCGYKEPLLSCLIIMCEPIQHPSHMDI PLDSKTFLSRHSMDMKFTYCDDRITELIGYHPEELLGRSAYEFYHALDSENMTKSHQNLC TKGQVVSGQYRMLAKHGGYVWLETQGTVIYNPRNLQPQCIMCVNYVLSEIEKNDVVFSMD QTESLFKPHLMAMNSIFDSSGKGAVSEKSNFLFTKLKEEPEELAQLAPTPGDAIISLDFD GFYSGHVRIAAKENLKRSGPLARNQNFEESSAYGKAILPPSQPWATELRSHSTQSEAGSL PAFTVPQAAAPGSTTPSATSSSSSCSTPNSPEDYYTSLDNDLKIEVIEKLFAMDTEAKDQ CSTQVDGCGDQARTDFNELDLETLAPYIPMDGEDFQLSPICPEERLLAENPQSTPQHCFS AMTNIFQPLAPVAPHSPFLLDKFQQQLESKKTEPEHRPMSSIFFDAGSKASLPPCCGQAS TPLSSMGGRSNTQWPPDPPLHFGPTKWAVGDQRTEFLGAAPLGPPVSPPHVSTFKTRSAK GFGARGPDVLSPAMVALSNKLKLKRQLEYEEQAFQDLSGGDPPGGSTSHLMWKRMKNLRG GSCPLMPDKPLSANVPNDKFTQNPMRGLGHPLRHLPLPQPPSAISPGENSKSRFPPQCYA TQYQDYSLSSAHKVSGMASRLLGPSFESYLLPELTRYDCEVNVPVLGSSTLLQGGDLLRA LDQAT >gi568815596f:46246871_46484657|GENSCAN_predicted_CDS_6|2898_bp atgattctggggcccaccttcccagtgcaaagcccgacttccccatttacctgctggagg ccacagggaatgcaagaggaggactttggttgtgaagacaccaacacacacaaaagcaga gctaacaatacaaagcaggttgtgtgtggctcagacaactggtgtgagcccagatcagtc tatgtgagctcccatctggacaaggcctccatcatgcgactggcaatcagcttcctgcga acacacaagctcctctcctcagcccctgttcaaatgaagattttagctgggaaagctaac attcacagtggcatcgcccacacctgggacattgcatcatcgcttcagcgggctgatgag ctggcagagaacaggtgtatcagcagaaaccccaagcagggatcagtttgctctgaaaac gagtccgaagccgaagctgaccagcagatggacaacttgtacctgaaagccttggagggt ttcattgccgtggtgacccaagatggcgacatgatctttctgtcagaaaacatcagcaag ttcatgggacttacacaggtggagctaacaggacatagtatctttgacttcactcatccc tgcgaccatgaggagattcgtgagaacctgagtctcaaaaatggctctggttttgggaaa aaaagcaaagacatgtccacagagcgggacttcttcatgaggatgaagtgcacggtcacc aacagaggccgtactgtcaacctcaagtcagccacctggaaggtcttgcactgcacgggc caggtgaaagtctacaacaactgccctcctcacaatagtctgtgtggctacaaggagccc ctgctgtcctgcctcatcatcatgtgtgaaccaatccagcacccatcccacatggacatc cccctggatagcaagaccttcctgagccgccacagcatggacatgaagttcacctactgt gatgacagaatcacagaactgattggttaccaccctgaggagctgcttggccgctcagcc tatgaattctaccatgcgctagactccgagaacatgaccaagagtcaccagaacttgtgc accaagggtcaggtagtaagtggccagtaccggatgctcgcaaagcatgggggctacgtg tggctggagacccaggggacggtcatctacaaccctcgcaacctgcagccccagtgcatc atgtgtgtcaactacgtcctgagtgagattgagaagaatgacgtggtgttctccatggac cagactgaatccctgttcaagccccacctgatggccatgaacagcatctttgatagcagt ggcaagggggctgtgtctgagaagagtaacttcctattcaccaagctaaaggaggagccc gaggagctggcccagctggctcccaccccaggagacgccatcatctctctggatttcgat gggttctacagcggccacgtgaggattgcagctaaagagaatctgaaacgcagcgggccc ttggccaggaatcagaacttcgaggagtcctcagcctatggcaaggccatcctgcccccg agccagccatgggccacggagttgaggagccacagcacccagagcgaggctgggagcctg cctgccttcaccgtgccccaggcagctgccccgggcagcaccacccccagtgccaccagc agcagcagcagctgctccacgcccaatagccctgaagactattacacatctttggataac gacctgaagattgaagtgattgagaagctcttcgccatggacacagaggccaaggaccaa tgcagtacccaggtagatggctgtggagatcaggctaggacggatttcaatgagctggac ttggagacactggcaccctatatccccatggacggggaagacttccagctaagccccatc tgccccgaggagcggctcttggcggagaacccacagtccaccccccagcactgcttcagt gccatgacaaacatcttccagccactggcccctgtagccccgcacagtcccttcctcctg gacaagtttcagcagcagctggagagcaagaagacagagcccgagcaccggcccatgtcc tccatcttctttgatgccggaagcaaagcatccctgccaccgtgctgtggccaggccagc acccctctctcttccatggggggcagatccaatacccagtggcccccagatccaccatta cattttgggcccacaaagtgggccgtcggggatcagcgcacagagttcttgggagcagcg ccgttggggccccctgtctctccaccccatgtctccaccttcaagacaaggtctgcaaag ggttttggggctcgaggcccagacgtgctgagtccggccatggtagccctctccaacaag ctgaagctgaagcgacagctggagtatgaagagcaagccttccaggacctgagcgggggg gacccacctggtggcagcacctcacatttgatgtggaaacggatgaagaacctcaggggt gggagctgccctttgatgccggacaagccactgagcgcaaatgtacccaatgataagttc acccaaaaccccatgaggggcctgggccatcccctgagacatctgccgctgccacagcct ccatctgccatcagtcccggggagaacagcaagagcaggttccccccacagtgctacgcc acccagtaccaggactacagcctgtcgtcagcccacaaggtgtcaggcatggcaagccgg ctgctcgggccctcatttgagtcctacctgctgcccgaactgaccagatatgactgtgag gtgaacgtgcccgtgctgggaagctccacgctcctgcaaggaggggacctcctcagagcc ctggaccaggccacctga >gi568815596f:46246871_46484657|GENSCAN_predicted_peptide_7|105_aa MKQLELSDITGGGENWHNHWGKLNLSMCWPIAYAVPRTGAQDQVEPSWSAGKTNAENVSL PNIPEYLFRASPDPGPEDTDVKKHKGSCVDVATLAPLDEVGQRDG >gi568815596f:46246871_46484657|GENSCAN_predicted_CDS_7|318_bp atgaagcaactggagctctcagatattactggtggaggtgaaaattggcacaaccactgg gggaagctgaacctgagcatgtgttggccaatagcctatgctgtacccagaacgggtgcc caggatcaagtggaaccttcctggtcagctgggaagacaaatgctgagaacgtttcttta ccaaacattcctgagtacctcttcagggccagccctgatccaggccctgaggatacagat gtgaagaaacacaagggcagctgtgttgatgtagccacgctggccccattagatgaagtt ggacagagggatggatga >gi568815596f:46246871_46484657|GENSCAN_predicted_peptide_8|104_aa MSGIKSCGPESGNSAICCLCQRQKSQGTEEEEVAAAGGGNNRPSLRTSLTHGPKTLRKSS QTEATAIVVYFCYHVDSGESNAICQMVQENTVTRPSKQGAFVHG >gi568815596f:46246871_46484657|GENSCAN_predicted_CDS_8|315_bp atgtctggcatcaagtcatgtggacccgagagcggaaacagtgctatctgctgcttgtgc cagaggcagaagagccagggcactgaagaggaggaggtggcagcagcagggggagggaac aacaggccatcattgaggacgtcactcactcacggtccaaagaccctgaggaaaagcagt cagactgaggcaactgccatcgttgtatatttctgttatcacgttgactcgggggagtcc aacgccatttgtcagatggtgcaggaaaacacagtaaccaggccttcaaaacagggcgcc tttgtccacggctga >gi568815596f:46246871_46484657|GENSCAN_predicted_peptide_9|314_aa MGDFNIPLSTLDRLMRQKVNKDIQELNSALHQADLIDIYRTLHPKSTEYTLFSALHRTYS KTDHIVGSQALLSKCKRTEIITNCLSDHNAIKLELRIKKLTQNRSATWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTHQNLWHIFKAVCRGKFIALNAHKRKHERSKIDTLTSELK ELEKQEQTHSKASRRQEINEIRAELKEKETQKTLQKVNESRSWFFEKINKIDRVLVRLIK KKREKTQIDTIKNDKGATTTDPTEIQTIIREYYKHLYGNKLENLEEMDKFLNTYILPRLN QEEVESLNRTINRL >gi568815596f:46246871_46484657|GENSCAN_predicted_CDS_9|945_bp atgggagactttaacatcccactgtcaactttagacagattaatgagacagaaagttaac aaggatatccaggaattgaactcagctctgcaccaagcagacttaatagacatctacaga actctccaccccaaatcaacagaatacactttgttctcagcactacatcgtacctattcc aaaactgaccacatagttggaagtcaagcactcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacaatgcaatcaaactagaactcaggattaagaaactc actcaaaaccgctcagctacatggaaactgaacaacttgctcctgaatgactactgggta cataatgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacaca acacaccagaatctctggcacatattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcacgaaagatctaaaattgacaccctaacatcagaattaaaa gaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataaatgag atcagagcagaactgaaggagaaagagacacaaaaaacccttcaaaaagtcaatgaatcc aggagctggttttttgaaaagatcaacaaaattgatagagtgctagtaagactaataaag aagaaaagagagaagactcaaatagacacaataaaaaatgataaaggggctaccaccacc gatcccacagaaatacaaactatcatcagagagtactataaacacctctatggaaataaa ctagaaaatctagaagaaatggataaattcctgaacacatatatcctcccaagactaaac caggaagaagttgaatccctgaatagaacaattaacaggctctga >gi568815596f:46246871_46484657|GENSCAN_predicted_peptide_10|199_aa MVPGDSKSEGKPRAYLEAESQKPDSSYDYLEEMEACEDGGCQGPLKSLSPKSCRATKGQA GDGPKPAELPPTPGTERNPEMELEKVRMEFELTRLKYLHEKNQRQRQHEVVMEQLQRERQ HEVVMEQLQQEAAPRLFSGGLQNFLLPQNQFAMFLYCFIFIHIIYVTKEMVFFLFAKHYL FCIAAILLCLIKTFWSYFQ >gi568815596f:46246871_46484657|GENSCAN_predicted_CDS_10|597_bp atggtgcctggtgactccaagtctgaagggaagccaagggcttatctggaggcagagtcc cagaagccagactcctcctatgactacttggaagagatggaagcttgtgaggacggaggc tgccaagggccgcttaaatcgctgtcccccaagtcctgccgtgctaccaaaggccaggct ggcgacggacccaaacccgcagagctgcccccgacccctgggactgagcgcaatcccgag atggagctggagaaggtgcgcatggagttcgagctcacgcggctcaagtacctgcatgag aagaaccagcggcagcggcagcacgaggtggtgatggagcagctgcagcgggagcggcag cacgaggtggtgatggagcagctgcagcaagaggcggcgccccgcctgttttcaggaggc ctccagaacttcctgctgccccagaaccagtttgccatgttcctgtactgcttcatcttc attcacatcatctatgtcaccaaggagatggtcttctttctcttcgccaagcactaccta ttctgcattgcagccattttgctctgtttgattaaaactttctggtcatacttccaa