GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:57:25 Sequence gi568815596f:48246215_48475440 : 229226 bp : 39.43% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 11 6 6 1.05 1.05 Term - 435 344 92 1 2 102 38 55 0.002 -1.20 1.04 Intr - 8269 8176 94 1 1 104 45 59 0.007 1.82 1.03 Intr - 9845 9750 96 2 0 111 97 17 0.047 4.19 1.02 Intr - 24329 24211 119 0 2 59 28 147 0.357 5.06 1.01 Init - 31713 31653 61 0 1 72 116 70 0.967 9.66 1.00 Prom - 32115 32076 40 -4.55 2.05 PlyA - 32368 32363 6 1.05 2.04 Term - 55907 55528 380 0 2 6 41 311 0.150 12.17 2.03 Intr - 56278 56125 154 1 1 89 94 38 0.100 3.22 2.02 Intr - 68029 67900 130 0 1 -58 55 185 0.265 0.38 2.01 Init - 68554 68169 386 1 2 59 66 195 0.426 10.77 2.00 Prom - 77758 77719 40 -2.75 3.00 Prom + 89096 89135 40 -4.15 3.01 Init + 100001 100537 537 1 0 35 86 349 0.442 24.31 3.02 Intr + 112833 112933 101 2 2 68 78 89 0.618 3.89 3.03 Intr + 115963 115999 37 1 1 93 93 -6 0.397 -2.25 3.04 Intr + 116939 117013 75 1 0 74 29 93 0.328 0.79 3.05 Intr + 117349 117379 31 0 1 73 115 14 0.707 -0.61 3.06 Intr + 127078 127146 69 2 0 105 101 50 0.963 6.24 3.07 Term + 128706 129229 524 1 2 106 43 602 0.998 50.95 3.08 PlyA + 131074 131079 6 1.05 4.00 Prom + 131283 131322 40 -5.75 4.01 Sngl + 138977 139243 267 1 0 72 53 370 0.999 26.89 4.02 PlyA + 139714 139719 6 1.05 5.00 Prom + 140068 140107 40 -8.55 5.01 Sngl + 142442 142876 435 1 0 43 51 286 0.433 16.42 5.02 PlyA + 143218 143223 6 1.05 6.00 Prom + 145716 145755 40 -4.15 6.01 Init + 149530 149665 136 0 1 59 95 125 0.751 10.65 6.02 Intr + 156845 156948 104 0 2 96 10 45 0.399 -3.53 6.03 Intr + 157267 157443 177 0 0 77 49 64 0.269 0.59 6.04 Intr + 160971 161061 91 2 1 92 86 69 0.468 5.75 6.05 Term + 164562 164857 296 1 2 16 55 243 0.576 8.28 6.06 PlyA + 165123 165128 6 1.05 7.04 PlyA - 165157 165152 6 1.05 7.03 Term - 170208 170102 107 1 2 99 41 75 0.634 1.49 7.02 Intr - 173136 172890 247 0 1 54 80 183 0.399 10.21 7.01 Init - 189781 189566 216 1 0 69 83 70 0.655 3.34 7.00 Prom - 190802 190763 40 -4.55 8.00 Prom + 193233 193272 40 -4.95 8.01 Init + 197036 197105 70 1 1 94 50 113 0.799 9.36 8.02 Intr + 199444 199535 92 2 2 67 76 25 0.814 -2.01 8.03 Intr + 204794 204862 69 1 0 96 86 98 0.901 8.86 8.04 Intr + 208381 208527 147 0 0 36 100 184 0.911 13.91 8.05 Intr + 211858 212013 156 0 0 47 52 164 0.910 7.99 8.06 Intr + 213540 213704 165 2 0 9 74 301 0.862 20.04 8.07 Intr + 213881 213939 59 1 2 68 80 116 0.961 5.56 8.08 Intr + 216661 216791 131 1 2 -31 59 186 0.547 3.22 8.09 Intr + 218723 218775 53 0 2 96 103 54 0.981 5.41 8.10 Intr + 219279 219428 150 2 0 69 94 114 0.990 9.54 8.11 Intr + 224873 224974 102 1 0 84 19 96 0.671 1.75 8.12 Intr + 228416 228605 190 1 1 -6 108 129 0.067 3.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 80062 80128 67 0 1 77 72 105 0.929 9.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:48246215_48475440|GENSCAN_predicted_peptide_1|153_aa MTQTSETGIKVDQDKDNRDKERAHGKQMAVIQHRMAPGDSTVPKLSSAEAQTKSTVDNRW SYGLTATVYLGDHCCKSRPALYPYMCALYCHKLRILLHVLLGKQQKALDEKFCYICPPTC IYTLPFHSGPVMEKAAVMIYEIPSGSLFHCLDK >gi568815596f:48246215_48475440|GENSCAN_predicted_CDS_1|462_bp atgacccagacttctgaaactggaatcaaagtggatcaggataaagacaacagagacaaa gaaagggcacatggaaagcagatggctgtcattcagcacagaatggctccaggtgacagt actgttcccaagttgagctctgcggaagctcaaaccaagtcgactgtggacaatcgatgg agttatggtctcactgccacagtctacctgggagaccattgctgcaagtctcgtcctgcc ctttatccttatatgtgtgccctgtattgccacaagctgaggatcttgcttcatgtctta ttaggaaagcagcaaaaagccttagatgagaaattctgctacatctgcccacccacctgc atttacaccttgccttttcattctgggcctgtgatggaaaaggcagcagtgatgatctat gaaataccttctgggtcacttttccattgtctcgataaatag >gi568815596f:48246215_48475440|GENSCAN_predicted_peptide_2|349_aa MAEHRSCSLQLPAVASGKTRQPAGSSSCSGGPGSTTSPSRSLPPPPSPSPLPWQRGAKAI SPTPPSSPASRDPEEGGSAGPSRFPPTSYRREGPGFRLAAWHGGWGGGEMQGPEGRGSHG KQGGGAVGRLPQGSTALGACARAEETFLDTPRYVGFRKTFRHCLGFESALGTRAADLPAQ CSSSAKGQSASSSGSLTPMPPDWETLPSRGGQAPHTREFQLASGHQHQRSKVDKSTKMRK NQHKNAEDSKNQNGFSPPNDRSYSPAKAQNWTENEFDELTEGGLRRWVITNSSELKERVL TQCKEAKNLDKRLQELLTRITSLEKNINDLMELRNTARELREAYTSINS >gi568815596f:48246215_48475440|GENSCAN_predicted_CDS_2|1050_bp atggcagagcaccgcagctgttccctgcagcttccagcagtcgcctccgggaagacgcgg caaccagcaggcagcagcagctgcagtggcggccccggcagcaccacatccccctcccgc tcgcttcctccccctccctccccttccccgctgccatggcaacgcggcgccaaggcgatt agccccacccctccctccagtcccgcctcccgcgaccccgaggaagggggctcggccggc ccctcccggttcccgcccacctcctaccgaagggaggggcctgggttccggctggctgcc tggcacggcggctggggtgggggtgaaatgcagggtccggaagggcggggctcacacgga aagcagggtggaggcgctgtaggccggctgccgcagggcagtacggccctcggtgcctgt gcccgggctgaggagaccttcctcgacaccccacgttatgtgggtttccgcaagaccttc cgccattgcctgggctttgaaagcgctttaggaacgagagcagcagatctcccagcacag tgctcgagctctgctaagggacagtctgcctcctcaagtgggtccttgacccccatgcct cctgactgggagacacttcccagcaggggtggacaggcacctcatacaagagagttccag ctggcctcaggtcatcagcatcaaagatcaaaggtagataaatccacgaagatgaggaaa aaccagcacaaaaatgctgaagattccaaaaaccagaatggcttttctcctccaaatgat cgcagctactctccagcaaaggcacaaaactggacggagaatgagtttgatgaactgaca gaaggaggcctcagaaggtgggtaataacaaactcctctgagctaaaggagcgtgttcta acccaatgcaaggaagctaagaaccttgataaaaggttgcaagaactgctaactagaata accagtttagagaagaatataaatgacctgatggagctgagaaacacagcacgagaactt cgtgaagcatatacaagtatcaatagctga >gi568815596f:48246215_48475440|GENSCAN_predicted_peptide_3|457_aa MGPVIGMTPDKRAETPGAEKIAGLSQIYKMGSLPEAVDAARPKATLVDSESADDELTNLN WLHESTNLLTNFSLGSEGLPIVSPLYDIEGDDVPSFGPACYQNPEKKSATSKPPYSFSLL IYMAIEHSPNKCLPVKEIYSWILDHFPYFATAPTGWKNSVRHNLSLNKCFQKVERSHGKV NGKGSLWCVDPEYKPNLIQALKKQPFSSASSQKLSKLSSISKENKTFQWDKNVEAEDSDI DLDPVYAQVNSNSGPSSFIHESDIDAAAAMMLLNTSIEQGILECEKPLPLKTALQKKRSY GNAFHHPSAVRLQESDSLATSIDPKEDHNYSASSMAAQRCASRSSVSSLSSVDEVYEFIP KNSHVGSDGSEGFHSEEDTDVDYEDDPLGDSGYASQPCAKISEKGQSGKKMRKQTCQEID EELKEAAGSLLHLAGIRTCLGSLISTAKTQNQKQRKK >gi568815596f:48246215_48475440|GENSCAN_predicted_CDS_3|1374_bp atgggtccagtaattggaatgactccagataagagagctgaaaccccaggagctgaaaag attgcaggattaagccagatttacaaaatgggaagcttgcctgaagctgttgatgctgcc aggccgaaggccactctagtggacagtgagtcagcagatgatgaactcacaaacttgaac tggcttcatgaaagcactaatcttctaacaaacttcagcctcggaagtgagggtcttcca attgttagtccattgtatgacatagagggagatgatgtgccatcctttggaccagcttgc taccagaacccagaaaaaaaatcagcgacttcaaagcccccatactcctttagtcttctc atttatatggccattgagcactctccaaataaatgtttgcctgtcaaagaaatttatagc tggattctggaccattttccatattttgctactgcaccaacaggctggaagaattctgtt cgacataatctgtccctgaataaatgttttcagaaagtggaaagaagccatggcaaggtt aatggaaaaggttccttatggtgtgttgatccggaatataaacccaatcttatccaggca ctgaagaagcaacctttttcttcagcatcttcacaaaaactttctaaattgagtagtatt agcaaagaaaacaagaccttccagtgggacaaaaatgtagaggcagaagacagtgatatt gatcttgaccctgtgtatgctcaggttaatagcaattccggtccttcaagcttcattcat gaatctgatattgatgctgctgctgcaatgatgcttttaaatacttctatagaacaagga attttagaatgtgagaagcctcttcctcttaaaacagcattgcaaaaaaagaggagttac ggcaatgcatttcatcatcccagtgctgtacgattacaagagagtgattctttagccacc agcattgatccaaaagaagatcacaattacagtgcaagtagcatggcagcacagcgttgt gcatccaggtctagcgtgtcttccctgtcttctgtggatgaggtatatgaatttatccca aagaatagtcacgtgggaagtgatggcagtgaaggatttcacagtgaagaagatacagac gttgattatgaagatgatcctcttggagacagtggctatgcatcacagccttgtgcaaaa atctctgaaaaagggcagtcaggcaaaaagatgcgaaaacagacatgtcaagaaattgat gaggagctcaaagaggcagctggatctctgctccaccttgctggaattcgtacatgttta ggttccctaataagtactgcaaagacacaaaatcaaaagcaacggaaaaaatag >gi568815596f:48246215_48475440|GENSCAN_predicted_peptide_4|88_aa MGQVWALVRSTLEPFHTDDEEEGEYNEVTEQVCLPAKAKVAKEGEVHPYPSAPPPYFEEK EWPDPPDLSFPEDTGRKVVAPVTVRAAP >gi568815596f:48246215_48475440|GENSCAN_predicted_CDS_4|267_bp atgggacaagtgtgggctctggttcgttccaccttggaaccttttcacactgatgatgag gaggaaggagagtataacgaagtaacagagcaggtttgtttgccagctaaagctaaagtg gcaaaggagggagaggttcatccctacccttctgcaccccctccttattttgaagaaaaa gagtggcctgaccctccagatctttcttttccagaggacactgggcgaaaagtagttgcc ccagtgactgttcgagcagcaccttga >gi568815596f:48246215_48475440|GENSCAN_predicted_peptide_5|144_aa MVKNSKHITGIPYNSQGQATVEQMNLSLKQQLQKQKGGNRDYGTAHIQLNLALLTLKFLS LPKAQMLSAAEQHLQKPAAKTEAEQLVWWRDPITKSWEIGKIITWDRGYACVSPGPNQQL IWIPSRCLKPYHELDAEEEIPGGS >gi568815596f:48246215_48475440|GENSCAN_predicted_CDS_5|435_bp atggtaaagaatagtaaacacattactggcatcccatataattctcaaggacaagccact gtagaacaaatgaatctctccctgaaacagcagttgcaaaaacagaaagggggaaacagg gattacgggacagcccatatacaattgaatctagcattattaactttaaagtttttgagc ctgcctaaagcccagatgttgtcagcagctgaacaacatctacagaaaccagctgcaaag acagaagcagaacaacttgtttggtggagagatccaataacaaaaagttgggaaataggt aaaataataacttgggatagaggttatgcttgtgtttctccaggaccgaatcaacagctg atttggataccatcaagatgcctgaaaccttatcatgaactagatgctgaagaagagatt ccaggaggatcctga >gi568815596f:48246215_48475440|GENSCAN_predicted_peptide_6|267_aa MCRRLQVSAGGASRQEPGKIKEVQPFDVFTGTARVRVTVPGGNCDGVHIGTTFVSMKFRV FTLNLEIQCKGVYPEKIIRQGGKRGNVRVVLYEKDSPWHCWLEDEPTQPLEAKKGKETHS HLEPPRGTQQPHQHLDFNPRLAPSLANQWTLRRLRLQVQTSIFLKLVESGEPHKRTPEEG VKGQNKRVNVKEIAGNPGLKGEVEASSWRRDSVLHKRFWGQLAKPSARERACKGASVTPP GTNGGSGQCLPGKMASIPPIPILRDPP >gi568815596f:48246215_48475440|GENSCAN_predicted_CDS_6|804_bp atgtgcaggagactgcaagtatctgcaggaggagcctctaggcaggaaccagggaagata aaggaagtgcagccctttgatgtgtttacaggaactgcaagagtaagagttactgtaccc ggaggaaactgcgatggtgtacacattggtacaacctttgtgtcaatgaaatttcgagtg ttcacacttaacctagaaattcaatgtaaaggagtttatcccgagaaaataataaggcaa ggaggaaagagagggaatgtcagagtggtgttatatgagaaagactcaccgtggcattgc tggcttgaagatgagccgacgcagcctctagaagccaaaaaaggaaaggaaacacactcc cacttagagcctccaagaggaacacaacagcctcaccaacaccttgatttcaacccccgc ctagctccaagtctagcaaaccagtggactcttagaagactgaggctccaagttcagact agtatcttcctgaaactggtagaatcaggagaaccacacaagaggactcctgaggagggg gtaaaaggccaaaacaagagggttaatgtaaaggagattgctggaaaccctgggttgaag ggagaagtggaagcttccagctggaggagagacagcgtcctccataaacgattttggggg caacttgcaaaaccatctgccagagaaagagcctgcaaaggtgcctctgtcacacctcca ggaactaatggtggctctggacagtgcctgccaggaaagatggcgtccatcccccccatc cctatcctcagggacccaccttga >gi568815596f:48246215_48475440|GENSCAN_predicted_peptide_7|189_aa MENRACQLSYRRLCKIKLSPGAEKGTKTGTIRVLSERLPHFSVEDVPYHKDWILLSHVAG GWVDRKLHSSGRTNGVVHVPGTGIGAGDGFEELRHSQLMGEAISKKIIMVQQKHFSSFMY KEDSTNIEGQKPGPSGDGEEEGRKVLSLGKIYRAEKVTQEIQLSPSSTWHELGPGLFAVQ VEIPRKQPS >gi568815596f:48246215_48475440|GENSCAN_predicted_CDS_7|570_bp atggagaatagagcttgccagctctcatataggaggctgtgcaagattaagttgagccct ggggcagaaaaggggactaagacgggaaccatccgggtcctgtctgaaaggctacctcac ttttctgtggaggatgttccataccacaaagattggatacttttgagccatgtggcaggt ggttgggtagacagaaaactgcatagtagtggaaggaccaatggggtggtgcatgtgcca ggcactgggataggtgctggagatggctttgaagagctcagacatagccagctgatgggg gaggccattagtaagaaaatcattatggtgcaacagaagcactttagcagcttcatgtac aaagaggactccactaacatagagggacaaaagcctggaccttctggggatggagaggag gaaggcaggaaggtgctaagtttagggaagatatacagagcagagaaagtcacacaagaa atacagctgtctccatcctccacgtggcatgagctggggccagggctctttgcagttcag gtggaaatccccaggaaacagccctcatag >gi568815596f:48246215_48475440|GENSCAN_predicted_peptide_8|462_aa MTATDTMRASGKKSPFRVKGYRRGAAKLQGLGFCRSEFPCMGAGAGERMPFEADLRAQNQ VLKKGVVDEQANSAALKEQLKMKDQSLRKLQQEMDSLTFRNLQLAKRVELLQDELALSEP RGKKNKDVSLKDESYVDITYSFHQKSGESSSQLSQEQKSVFDEDLQKKIEENERLHIQFF EADEQHKHVEAELRSRLATLETEAAQHQAVVDGLTRKYMETIEKLQNDKAKLEVKSQTLE KEAKECRLRTEEWLSLDWGDVEDLVEGEGADWSFQGDKGERGSHGEDGDAVDRKKREYSQ YNALNVPLHNRRHQLKMRDIAGQALAFVQDLVTALLNFHTYTEQRIQIFPVDSAIDTISP LNQKFSQYLHENASYVRPLEEGMLHLFESITEDTVTVLLIENRLGCSLLKNLPIPSLEEE CESSLCTSALRARNLELSQDMKKMTAVFEKLQTYIALLALPX >gi568815596f:48246215_48475440|GENSCAN_predicted_CDS_8|1386_bp atgactgccactgacacaatgagggccagcgggaagaagagcccgttcagagtgaagggc tataggagaggggcagctaaactgcaagggttggggttttgtcgtagtgagtttccctgc atgggagcaggagcaggagaaaggatgccatttgaggcagatcttcgggctcagaatcag gttctgaaaaaaggtgttgtggatgaacaagcaaattctgcagctttaaaggagcaactg aaaatgaaggatcagtcattgagaaaactacaacaggaaatggacagtttgacatttcga aatctgcagcttgccaagagggtagaactacttcaagatgaactagctctaagtgaacca cgaggcaagaaaaacaaggatgtttctctaaaggatgaaagttatgtggacataacttat tctttccatcagaaaagtggagaatcttcttctcagttgagtcaagagcagaagagtgtc tttgatgaagatctgcaaaagaagatagaagagaatgaacggttgcatatacaatttttt gaagctgatgagcagcacaagcatgtggaagcagagctgaggagtcgactggccactctg gagacagaagcagcccagcaccaagctgtggttgacggtctcacccggaagtacatggaa accattgagaagctgcagaacgacaaggctaaactagaagtgaaatctcagactctagaa aaggaagccaaggaatgtcgacttcgaacggaagaatggcttagcctggattggggagat gtggaagatcttgtggaaggagagggagcagactggtcctttcaaggagacaagggagaa agaggcagtcatggggaagatggtgatgcagttgaccgaaagaaaagagaatatagtcag tacaacgctctgaacgttccactccacaataggagacaccagctgaagatgcgagatatt gctgggcaggccctggcttttgttcaggatcttgtgacggctcttctaaactttcatacc tacacagaacagaggattcaaatttttcctgttgattctgccattgacactatatctcca ttgaatcagaagttctcacaataccttcatgaaaatgcgtcctatgtccgccctcttgag gaaggaatgcttcatttatttgaaagtatcactgaggatactgtgactgtcttgctaatt gaaaatcgtttgggatgctcacttcttaaaaatctgcctattcctagtttagaagaagaa tgtgaatcctctctttgcacatctgcgttaagagccaggaatctagagctgtcccaggac atgaaaaaaatgacagctgtgtttgagaagctgcagacttacatagctcttcttgccttg ccaann