GENSCAN 1.0 Date run: 6-Nov-116 Time: 20:11:28 Sequence gi568815592f:15146540_15620248 : 473709 bp : 43.79% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1025 1073 49 1 1 89 89 40 0.450 3.31 1.02 Intr + 2504 2576 73 0 1 103 100 -18 0.289 -0.64 1.03 Intr + 13111 13207 97 1 1 78 105 89 0.833 9.61 1.04 Intr + 87803 87940 138 1 0 123 29 52 0.154 3.46 1.05 Term + 98388 98471 84 2 0 46 32 105 0.029 -1.65 1.06 PlyA + 99906 99911 6 1.05 2.02 PlyA - 101063 101058 6 1.05 2.01 Sngl - 101759 101535 225 2 0 49 48 301 0.840 15.34 2.00 Prom - 107343 107304 40 -4.36 3.04 PlyA - 108178 108173 6 1.05 3.03 Term - 122598 122525 74 1 2 72 44 94 0.658 1.47 3.02 Intr - 128728 128531 198 2 0 70 76 65 0.592 2.82 3.01 Init - 147273 147156 118 0 1 40 95 78 0.530 4.06 3.00 Prom - 147959 147920 40 -3.26 4.03 PlyA - 149023 149018 6 1.05 4.02 Term - 167139 166928 212 1 2 83 49 121 0.933 5.16 4.01 Init - 171194 171047 148 2 1 103 82 54 0.877 6.55 4.00 Prom - 171723 171684 40 -7.86 5.00 Prom + 180931 180970 40 -2.36 5.01 Init + 190943 190951 9 1 0 111 90 0 0.590 3.00 5.02 Intr + 192425 192610 186 1 0 -41 69 150 0.006 0.09 5.03 Intr + 207522 207590 69 2 0 44 99 52 0.351 1.28 5.04 Intr + 209168 209215 48 1 0 82 96 15 0.348 0.58 5.05 Intr + 209569 209631 63 0 0 108 87 10 0.460 1.81 5.06 Intr + 227578 227713 136 0 1 122 101 87 0.587 13.54 5.07 Intr + 263685 263826 142 1 1 96 96 139 0.864 14.91 5.08 Intr + 305467 305681 215 1 2 126 77 26 0.010 3.66 5.09 Intr + 322003 322179 177 2 0 96 78 235 0.947 23.19 5.10 Intr + 340768 341003 236 2 2 101 103 126 0.960 12.81 5.11 Intr + 349593 350631 1039 2 1 118 82 1392 0.982 131.67 5.12 Intr + 354368 354870 503 0 2 99 97 1015 0.986 96.20 5.13 Intr + 357961 358053 93 0 0 109 78 33 0.948 4.56 5.14 Intr + 360597 360715 119 2 2 108 76 155 0.999 15.56 5.15 Intr + 360807 360877 71 0 2 132 107 40 0.996 9.13 5.16 Intr + 361801 361915 115 2 1 107 97 -15 0.882 0.81 5.17 Intr + 364757 364862 106 2 1 61 74 192 0.999 15.32 5.18 Intr + 365669 365851 183 1 0 79 105 208 0.998 21.58 5.19 Intr + 366376 366506 131 0 2 73 92 83 0.999 6.69 5.20 Intr + 366700 366883 184 1 1 66 89 209 0.980 18.59 5.21 Intr + 370622 370729 108 1 0 134 78 196 0.735 23.68 5.22 Term + 373530 373712 183 2 0 88 49 116 0.978 5.24 5.23 PlyA + 375070 375075 6 1.05 6.06 PlyA - 375366 375361 6 1.05 6.05 Term - 376680 376436 245 1 2 109 32 229 0.966 15.46 6.04 Intr - 378130 377987 144 2 0 89 63 273 0.577 25.15 6.03 Intr - 381636 381542 95 2 2 67 92 21 0.298 0.01 6.02 Intr - 386856 386652 205 1 1 107 -5 316 0.170 22.46 6.01 Init - 387218 387158 61 2 1 43 61 39 0.321 -1.89 6.00 Prom - 389098 389059 40 -4.26 7.00 Prom + 397292 397331 40 -5.06 7.01 Init + 399444 399510 67 2 1 98 36 61 0.737 3.21 7.02 Term + 405063 405268 206 1 2 117 36 75 0.249 2.73 7.03 PlyA + 405633 405638 6 1.05 8.00 Prom + 407100 407139 40 -4.46 8.01 Init + 412507 412795 289 0 1 64 115 380 0.937 35.58 8.02 Term + 420495 420523 29 1 2 108 54 25 0.617 -0.66 8.03 PlyA + 421248 421253 6 1.05 9.00 Prom + 434093 434132 40 -2.66 9.01 Init + 434382 434478 97 2 1 67 81 94 0.910 7.07 9.02 Intr + 458024 458042 19 0 1 108 99 4 0.177 -0.83 9.03 Term + 464354 464504 151 2 1 29 48 197 0.858 7.18 9.04 PlyA + 465706 465711 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 203088 202981 108 0 0 81 41 86 0.921 1.71 S.002 Init - 204081 204034 48 0 0 45 57 71 0.894 0.65 S.003 Init + 317080 317128 49 0 1 88 58 77 0.869 3.81 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:15146540_15620248|GENSCAN_predicted_peptide_1|146_aa MGFHRVAQADLELLTSGFVPLGKISSYGKTSLAGERQEVRRQPFLAKAMDWNTYKWFFHV DLGFLKHGVCVPKDKRIPGYLFWVKGCFGDTNSINWAPGLTFKNINSNTQSAKSITVTNH ALYTDARRCPEVTPAARHKQQAPPIN >gi568815592f:15146540_15620248|GENSCAN_predicted_CDS_1|441_bp atggggtttcaccgtgttgcccaggctgatctcgaactcctgacctcaggcttcgtccct ttagggaaaatctccagctatgggaaaacctccctagctggggagaggcaggaagtaaga aggcaacccttcttagctaaggctatggattggaacacctacaagtggtttttccacgtg gacctgggctttctcaaacatggtgtctgtgttccaaaggataagagaattccaggttac ctgttttgggtcaaaggatgctttggagatacaaactctatcaactgggcccctggtctc accttcaaaaatatcaattcaaacactcagagtgccaaaagcataactgtgaccaatcac gctctgtacacagacgcgaggcgctgtcctgaggtgacaccggctgcacggcacaagcag caggcgccgccgatcaactag >gi568815592f:15146540_15620248|GENSCAN_predicted_peptide_2|74_aa MPPIGRARAPARPAPAQPAATPAGRPFRSRLEDVGNTGSAPGALTRAVPPAAAASHGLAA GRPDSETRSAERPQ >gi568815592f:15146540_15620248|GENSCAN_predicted_CDS_2|225_bp atgccccctataggccgcgcgcgcgcgcccgcccgcccagccccggcccagcctgccgcg acccccgcgggccgtccctttcgcagccggctcgaggacgtcgggaacaccggctctgct cccggtgcattaactcgtgccgtgcctcccgcagccgccgccagccacgggcttgcagcc ggacgcccggactcggagacacgatccgctgaacgcccgcaataa >gi568815592f:15146540_15620248|GENSCAN_predicted_peptide_3|129_aa MTYPLKQNNKRAQIDSETGRHIPTESEKSGTGCEDSSSSGQDKVFQLIPVFATDPKSQLF HVQEGFSTLEPEPRNILQGEIKQLFLLPTHLLHWSEVVTPSHRDGERGNYLTTGKHFEAT KDMMVQHIF >gi568815592f:15146540_15620248|GENSCAN_predicted_CDS_3|390_bp atgacatacccacttaagcagaacaacaaaagagcacaaattgattctgaaacagggaga cacattcccacagaaagtgagaaatccggcacaggctgtgaggactcttcatcttctggt caggacaaagttttccagctcattcctgtctttgctactgaccccaagagccaactgttc catgttcaagagggtttctccactcttgaacctgagcccagaaacattttgcaaggagag atcaaacagctcttcctgctgcccacacaccttctccactggtccgaggttgtcacccca agtcacagagatggagaaaggggcaactacctcaccactggcaagcattttgaggccacc aaagacatgatggttcagcacatcttttga >gi568815592f:15146540_15620248|GENSCAN_predicted_peptide_4|119_aa MAFSEIQNSQQEKLQLKLKKDTSEPAPPRRKRVPSGSGLWTHQLHGKHSDLQVKTLMPQE TYTISESVRHTANSTVLSRTACCRGTNSLRRCEQQPWLGSVLAADFPEKLALFKMGSHQ >gi568815592f:15146540_15620248|GENSCAN_predicted_CDS_4|360_bp atggcattttctgagattcaaaattctcaacaagagaaactccaactaaaactcaaaaaa gatacttcagaaccagcacctccgagacgcaagagggtaccttctggcagtggactatgg acccatcagttacacggcaagcattcagatttgcaagtcaaaacactaatgccacaagaa acatacaccatttctgaatctgtacggcacactgcaaattccacagtgctctcaaggaca gcatgctgccgcggcaccaacagcctgcgtagatgtgaacagcaaccctggctcggctca gtgttggcagcagacttcccagaaaagctggcactcttcaaaatgggaagtcaccagtga >gi568815592f:15146540_15620248|GENSCAN_predicted_peptide_5|1371_aa MGETLTNTEKQAALQAAERFGDELYVTYSIREGGKHYPTGRAVLMDDLKWDPSDEVEDGK RRHFQVYLSKRDFAAGIDQGSSDEKSIQAETILVPRSPEEVGPQLCFQGSSTLPHEIFFT AEYPIDDSDGIPWSEERVVRKVLYLSLKEFKNSQKRQHAEGIAGSLKTVNGLLGNDQSKG LGPASEQSENEKDDASQVSSTSNDVSSSDFEEGPSRKRPRLQAQRKFAQSQPNSPSTTPV KIVEPLLPPPATQISDLSKRKPKTEDFLTFLCLRGKTLQPSAEVYVESTCSPALPNSMVY FGSSQDEEEVEEEDDETEDVKTATNNASSSCQSTPRKGKTHKHVHNGHVFNGSSRSTREK EPVQKHKSKEATPAKEKHSDHRADSRREQASANHPAAAPSTGSSAKGLAATHHHPPLHRS AQDLRKQVSKVNGVTRMSSLGAGVTSAKKMREVRPSPSKTVKYTATVTKGAVTYTKAKRE LVKDTKPNHHKPSSAVNHTISGKTESSNAKTRKQVLSLGGASKSTGPAVNGLKVSGRLNP KSCTKEVGGRQLREGLQLREGLRNSKRRLEEAHQAEKPQSPPKKMKGAAGPAEGPGKKAP AERGLLNGHVKKEVPERSLERNRPKRATAGKSTPGRQAHGKADSASCENRSTSQPESVHK PQDSGKAEKGGGKAGWAAMDEIPVLRPSAKEFHDPLIYIESVRAQVEKFGMCRVIPPPDW RPECKLNDEMRFVTQIQHIHKLGRRWGPNVQRLACIKKHLKSQGITMDELPLIGGCELDL ACFFRLINEMGGMQQVTDLKKWNKLADMLRIPRTAQDRLAKLQEAYCQYLLSYDSLSPEE HRRLEKEVLMEKEILEKRKGPLEGHTENDHHKFHPLPRFEPKNGLIHGVAPRNGFRSKLK EVGQAQLKTGRRRLFAQEKEVVKEEEEDKGVLNDFHKCIYKGRSVSLTTFYRTARNIMSM CFSKEPAPAEIEQEYWRLVEEKDCHVAVHCGKVDTNTHGSGFPVGKSEPFSRHGWNLTVL PNNTGSILRHLGAVPGVTIPWLNIGMVFSTSCWSRDQNHLPYIDYLHTGADCIWYCIPAE EENKLEDVVHTLLQANGTPGLQMLESNVMISPEVLCKEGIKVHRTVQQSGQFVVCFPGSF VSKVCCGYSVSETVHFATTQWTSMGFETAKEMKRRHIAKPFSMEKLLYQIAQAEAKKENG PTLSTISALLDELRDTELRQRRQLFEAGLHSSARYGSHDGSSTVADGKKKPRKWLQLETS ERRCQICQHLCYLSMVVQENENVVFCLECALRHVEKQKSCRGLKLMYRYDEEQIISLVNQ ICGKVSGKNGSIENCLSKPTPKRGPRKRATVDVPPSRLSASSSSKSASSSS >gi568815592f:15146540_15620248|GENSCAN_predicted_CDS_5|4116_bp atgggggagaccttgacgaacactgagaagcaggctgctctgcaagcagcagagagattt ggggatgagctttacgtcacatacagcatcagggaagggggcaaacattatccaactgga agagcagtactaatggatgaccttaaatgggatcccagtgacgaggtggaagacgggaag cggagacactttcaggtgtatctaagtaaaagggattttgcagctgggattgatcaagga tcatcagacgagaagagtatccaggctgaaacgattctcgtgcctcggtctccagaggag gtgggaccacagctatgttttcaaggttcatccacgctgccccatgaaatcttttttacg gctgaatatcctattgatgacagtgatgggattccgtggtcagaagaacgggtggtacgt aaagtcctttatttgtctctgaaggagttcaagaattcccagaagaggcagcatgcggaa ggcattgctgggagcctgaaaactgtgaatgggctccttggtaatgaccagtctaaggga ttaggaccagcatcagaacagtcagagaatgaaaaggacgatgcatcccaagtgtcctcc actagcaacgatgttagttcttcagattttgaagaagggccgtcgaggaaaaggcccagg ctgcaagcacaaaggaagtttgctcagtctcagccgaatagtcccagcacaactccagta aagatagtggagccattgctaccccctccagctactcagatatcagacctctctaaaagg aagcctaagacagaagattttcttacctttctctgccttcgaggtaagactttgcaacca tcggcggaggtctacgtggaatctacatgttctcctgcgctgcccaacagcatggtgtat tttggaagctctcaggatgaggaggaagtcgaggaggaagatgatgagacagaagacgtc aaaacagccaccaacaatgcttcatcttcatgccagtcgacccccaggaaaggaaaaacc cacaaacatgttcacaacgggcatgttttcaatggttccagcaggtcaacacgggagaag gaacctgttcaaaaacacaaaagcaaagaggccactcccgcaaaggagaagcacagcgat caccgggctgacagccgccgggagcaggcttcagctaaccaccccgcagcggccccctcc acgggttcctcggccaaggggcttgctgccacccatcaccacccccctctgcatcggtcg gctcaggacttacggaaacaggtttctaaggtaaacggagtcactcgaatgtcatctctg ggtgcaggtgtaaccagtgccaaaaagatgcgcgaggtcagaccttcaccatccaaaact gtgaagtacactgccacggtgacgaagggggctgtcacatacaccaaagccaagagagaa ctggtcaaggacaccaaacccaatcaccacaagcccagttccgctgtcaaccacacaatc tcagggaaaactgaaagtagcaatgcaaaaacccgcaaacaggtgctatccctcgggggg gcgtccaagtccactgggcccgccgtcaatggcctcaaggtcagtggcaggttgaaccca aagtcatgcactaaggaggtgggggggcggcagctgcgggagggcctgcagctgcgggag gggctgcggaactccaagaggagactggaagaggcacaccaggcggagaagccgcagtcg ccccccaagaagatgaaaggggcggctggccccgccgaaggccctggcaagaaggccccg gccgagagaggtctgctgaacggacacgtgaagaaggaagtgccggagcgcagtctggag aggaatcggccgaagcgggccacggccgggaagagcacgccaggcagacaagcacatggc aaggcggacagcgcctcctgtgaaaatcgttctacctcgcaaccggagtccgtgcacaag ccgcaggactcgggcaaggccgagaagggcggcggcaaggccgggtgggcggccatggac gagatccccgtcctcaggccctccgccaaggagttccacgatccgctcatctacatcgag tcggtccgcgctcaggtggagaagttcgggatgtgcagggtgatcccccctccggactgg cggcccgagtgcaagctcaacgatgagatgcggtttgtcacgcagattcagcacatccac aagctgggccggcgctggggccccaacgtgcagcggctggcctgcatcaagaagcacctc aaatctcagggcatcaccatggacgagctcccgctcatagggggctgtgagctcgacctg gcctgctttttccggctgattaatgagatgggcggcatgcagcaagtgactgacctcaaa aaatggaacaaactagcagacatgctgcgcatccccagaactgcccaggaccggctggcc aagctgcaggaggcctactgccagtacctactctcctacgactccctgtccccagaggag caccggcggctggagaaggaggtgctgatggagaaggagatcctggagaagcgcaagggg ccgctggaaggccacacagagaacgaccaccacaagttccaccctctgccccgcttcgag cccaagaatgggctcatccacggcgtggcccccaggaacggcttccgcagcaagctcaag gaggtgggccaggcccagttgaagactggccggcggcgactcttcgctcaggaaaaagaa gtggtcaaggaagaggaggaggacaaaggcgtcctcaatgacttccacaagtgcatctat aagggaaggtctgtttctctaacaactttttatcgaacagcgaggaatatcatgagcatg tgtttcagcaaggagcctgccccagccgaaatcgagcaagagtactggaggctagtggaa gagaaggactgccacgtggcagtgcactgcggcaaggtggacaccaacactcacggcagt ggattcccagtaggaaaatcagaacccttttcgaggcatggatggaacctcaccgtcctc cccaataacacagggtccatcctgcgtcacctcggtgctgtgcctggagtgactattccc tggctaaatattggcatggtcttttctacctcatgctggtctcgagaccaaaatcacctt ccatacattgactacttacacactggtgctgactgcatttggtattgcattcctgctgag gaggagaacaagctggaagatgtggtccacaccctgctgcaagccaatggcaccccaggg ctgcagatgctggaaagcaacgtcatgatctccccggaggtgctgtgcaaagaggggatc aaggtgcacaggaccgtgcagcagagtggccagtttgtcgtctgcttcccgggatccttt gtgtccaaagtgtgctgtgggtacagcgtgtctgaaaccgtgcactttgctaccacccag tggacaagtatgggctttgagaccgccaaggaaatgaagcgtcgccatatagctaagcca ttctccatggagaagttactctaccagattgcacaagcagaagcaaaaaaagaaaacggt cccactctcagtaccatctcagccctcctggatgagctcagggatacagagctgcggcag cgcaggcagctgttcgaggctggcctccactcctccgcacgctatggcagccacgatggc agcagcacggtggcggacgggaagaaaaagcctcgaaagtggctgcagttggagacgtca gagaggaggtgtcagatctgccagcacctgtgctacctgtccatggtggtacaagagaac gaaaacgtcgtgttctgtctggagtgtgctctgcgccacgtggagaaacagaagtcctgc cgagggctgaagttgatgtaccgctacgatgaggaacagattatcagtctggtcaatcag atctgcggcaaagtgtctggtaaaaacggcagcattgagaactgtctcagtaaacccaca ccaaaaagaggtccccgcaagagagcgacagtggacgtgcccccctcccgtctgtcagcc tccagttcatccaaaagtgcttcgagctcatcatga >gi568815592f:15146540_15620248|GENSCAN_predicted_peptide_6|249_aa MEQQTPLELVEGRPEKLVLEAELDAEHAQKVLEMEHTQQMKLKERQKFFEEAFQQDMEQY LSTGYLQIAERRGEWGLGLLVWGLTGCAGCHDSSWGLVRWYKCYSFKFVDEPCLQFERNC KPIGSMSSMEVNVDMLEQMDLMDISDQEALDVFLNSGGEENTVLSPALGPESSTCQNEIT LQVPNPSELRAKPPSSSSTCTDSATRDISEGGESPVVQSDEEEVQVDTALATSHTDREAT PDGGEDSDS >gi568815592f:15146540_15620248|GENSCAN_predicted_CDS_6|750_bp atggagcagcagactccactggagttagtagaaggacgacctgagaagctggtcttggaa gctgaactagatgcagagcacgcccagaaggtcctggaaatggagcacacccagcaaatg aagctgaaggagcggcagaagttttttgaggaagccttccagcaggacatggagcagtac ctgtccactggctacctgcagattgcagagcggcgaggcgagtgggggctggggctgctg gtgtggggactcaccggctgtgcgggttgccatgactcttcttggggtcttgtccgttgg tataagtgttacagtttcaagtttgttgacgaaccttgtctacagtttgagcggaattgc aagcccataggcagcatgtcatccatggaagtgaacgtggacatgctggagcagatggac ctgatggacatatcggaccaggaggccctggacgtcttcctgaactctggaggagaagag aacactgtgctgtcccccgccttagggcctgaatccagtacctgtcagaatgagattacc ctccaggttccaaatccctcagaattaagagccaagccaccttcttcttcctccacctgc accgactcggccacccgggacatcagtgagggtggggagtcccccgttgttcagtccgat gaggaggaagttcaggtggacactgccctggccacatcacacactgacagagaggccact ccggatggtggtgaggacagcgactcttaa >gi568815592f:15146540_15620248|GENSCAN_predicted_peptide_7|90_aa MHPGGLRFPIARRVELAEYGSEGQGDPGSLSSTLPHSWYLSIATTHLPPIVPLDATFLES KHCVFQHGSFSACHGVGARQQSDEGTGSNH >gi568815592f:15146540_15620248|GENSCAN_predicted_CDS_7|273_bp atgcaccctggaggcctcagatttcccatcgccaggagggtggagctggctgaatatgga tcggaaggccagggagacccaggctctctttcctccacgctcccacatagctggtacctc tccattgctaccacacacttgccccctatagttccactggatgccacgttcctcgagagc aagcactgtgtctttcaacatggaagctttagtgcctgtcatggagttggcgctcggcaa cagtctgatgaaggtactggtagcaatcactaa >gi568815592f:15146540_15620248|GENSCAN_predicted_peptide_8|105_aa MGHKAVETTRNIHNAFGAGTANELTVQWWFKKFCKGDKSLEDEEHGGQPSEVDNDQLKAI VKADPLTTTREVAKELNINHSVVIQHLKQIGRVKKHDPPNPEEFN >gi568815592f:15146540_15620248|GENSCAN_predicted_CDS_8|318_bp atgggtcataaagcagtggagacaactcgcaacatccacaatgcatttggcgcaggaact gctaatgaactgacagtgcaatggtggttcaagaagttttgcaaaggagacaagagcctt gaagatgaggagcacggtggccagccatcagaagttgacaacgaccagttgaaagcgatc gtcaaggctgatcctcttacaactacacgagaagttgccaaagagctcaacatcaaccat tctgtggtcattcagcatttgaagcaaattggaagggtaaaaaagcatgaccctcccaat cctgaagagtttaactga >gi568815592f:15146540_15620248|GENSCAN_predicted_peptide_9|88_aa MGLDKNDQENKASTISYHEKKRSRNYNSEALEACQSLGSKVQGEAASADVKAAAAYPEDL ATITDEGGYAKQQIVNRQNRLLLEEDDT >gi568815592f:15146540_15620248|GENSCAN_predicted_CDS_9|267_bp atgggtctagacaagaatgaccaggagaataaagcatctacaatcagctaccacgaaaag aaaaggtcaaggaactacaactctgaagccttagaagcttgccagtctcttggcagcaaa gtgcaaggtgaagcagcaagtgctgatgtaaaagctgcagcagcttatccagaagatcta gctacgataactgatgaaggtggctatgctaaacagcagattgtcaacagacaaaataga cttttattggaagaagatgacacctag