GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:58:09 Sequence gi568815583r:49027953_49255578 : 227626 bp : 37.74% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.10 Intr - 730 501 230 2 2 68 48 183 0.875 8.97 1.09 Intr - 5148 5013 136 0 1 60 95 140 0.957 11.12 1.08 Intr - 7706 7382 325 1 1 63 106 174 0.785 11.55 1.07 Intr - 9817 9639 179 1 2 62 59 140 0.949 6.30 1.06 Intr - 18113 17926 188 0 2 65 56 50 0.009 -2.01 1.05 Intr - 18492 18324 169 0 1 20 101 108 0.231 3.90 1.04 Intr - 22085 21961 125 0 2 59 87 66 0.422 2.98 1.03 Intr - 24017 23836 182 1 2 90 14 145 0.090 5.89 1.02 Intr - 35313 34926 388 1 1 -55 43 348 0.063 8.92 1.01 Init - 36161 36086 76 2 1 96 84 67 0.986 8.40 1.00 Prom - 42595 42556 40 -8.15 2.00 Prom + 44665 44704 40 -4.25 2.01 Sngl + 54199 54492 294 0 0 56 48 191 0.674 7.35 2.02 PlyA + 54797 54802 6 1.05 3.00 Prom + 59018 59057 40 -2.85 3.01 Init + 71315 71461 147 1 0 69 106 100 0.438 10.04 3.02 Term + 83201 83434 234 1 0 23 42 207 0.265 4.94 3.03 PlyA + 83497 83502 6 1.05 4.15 PlyA - 84022 84017 6 1.05 4.14 Term - 100142 99998 145 1 1 72 42 116 0.613 1.80 4.13 Intr - 100808 100750 59 2 2 28 97 56 0.613 -2.74 4.12 Intr - 101607 101525 83 1 2 107 75 -6 0.663 -1.56 4.11 Intr - 102864 102767 98 2 2 83 95 113 0.978 10.23 4.10 Intr - 105859 105807 53 1 2 81 116 16 0.989 0.49 4.09 Intr - 106156 105978 179 2 2 95 94 143 0.978 14.32 4.08 Intr - 106562 106379 184 2 1 48 -14 145 0.585 -1.26 4.07 Intr - 109275 109198 78 0 0 79 70 41 0.528 0.23 4.06 Intr - 111701 111576 126 2 0 113 66 24 0.800 2.66 4.05 Intr - 116352 116275 78 0 0 115 91 80 0.997 9.83 4.04 Intr - 117126 117013 114 0 0 110 10 141 0.996 8.22 4.03 Intr - 121870 121759 112 0 1 60 89 35 0.593 0.26 4.02 Intr - 127224 127085 140 0 2 21 70 144 0.532 4.24 4.01 Init - 127626 127573 54 0 0 84 81 138 0.705 14.03 4.00 Prom - 133667 133628 40 -7.75 5.00 Prom + 134260 134299 40 -5.75 5.01 Init + 139594 139642 49 0 1 86 58 41 0.524 0.06 5.02 Intr + 142043 142423 381 0 0 101 47 160 0.865 7.16 5.03 Intr + 149622 149881 260 1 2 99 25 204 0.056 11.46 5.04 Intr + 159244 159266 23 0 2 93 86 9 0.002 -3.38 5.05 Intr + 173210 173298 89 2 2 88 36 88 0.002 2.30 5.06 Intr + 189238 189361 124 2 1 79 75 74 0.000 3.92 5.07 Intr + 197451 197561 111 0 0 76 60 75 0.004 2.08 5.08 Intr + 207899 207989 91 2 1 123 99 69 0.556 10.58 5.09 Intr + 211269 211415 147 2 0 91 88 33 0.392 3.11 5.10 Term + 219357 219560 204 2 0 22 40 133 0.034 -1.71 5.11 PlyA + 219634 219639 6 1.05 6.03 PlyA - 221270 221265 6 1.05 6.02 Term - 222165 221890 276 0 0 70 38 262 0.528 13.88 6.01 Intr - 225928 225875 54 1 0 55 32 111 0.439 0.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 10908 10897 12 0 0 96 86 14 0.921 1.29 S.002 Term - 35313 34847 467 1 2 -55 41 361 0.907 11.29 S.003 Sngl - 187814 187470 345 2 0 90 28 160 0.801 5.99 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:49027953_49255578|GENSCAN_predicted_peptide_1|666_aa MGKEEQERKKEKLDGPGQSSREGSACGKLRKSPKIVKRNVKDEAQDKTIAGRHTSRWTSR GTRCLKKTQAAGLREDVEGSMLVEEHMTDGTWAGYGLAERGGVWLGQSEESRGCRVAQIQ GKTISLLALPFAESYFYSIKPCTNSPSPHMIQFFRDFVPCIPAAPAMAKRDQGTARAMAS EGVSPKIWQLPCGIEPVGEQESRIEFGNLCLDFRGCFQLEAGLSWGKGEVLTWNKVCVTF TRFHPRQKKKSSSENLKWRSRIGVAASQIGAEVRNPVWTRAGGHGESGPRRCLHRLARAR CGCRHHGPSPHGAGGGLAVAASPLLGEGVRRLQVVPALLGGGLITSSSLPASPIPFSSGM CFQGGKAFEGFSPRKTNVKLSAEVEPFIPQKKSPDTFMIPMALPNDNGSVSGVEPTPIPS YLITCYPFVQENQSNRQFPLYNNDIRWQQPNPNPTGPYFAYPIISAQPPVSTEYTYYQLM PAPCAQVMGFYHPFPTPYSNTFQAANTVNAITTECTERPSQLGQVFPLSSHRSRNSNRGS VVPKQQLLQQHIKSKRPLVKNVATQKETNAAGPDSRSKIVLLVDASQQTDFPSDIANKSL SETTATMLWKSKGRRRRASHPTAESSSEQGASEADIDSDSGYCSPKHSNNQPAAGALRNP DSGTMN >gi568815583r:49027953_49255578|GENSCAN_predicted_CDS_1|1998_bp atggggaaggaagagcaggagagaaagaaggagaaacttgatgggccgggacagagcagc agagaagggtcagcatgtgggaagcttaggaagagccctaaaatagtcaagaggaatgtg aaagatgaggcccaggacaagaccatagcaggcaggcacacaagccgctggacgtcgaga ggaacacgttgtttgaagaagacacaagcagctggactgcgagaggacgtggaggggagc atgctggtggaagagcacatgacagacggcacgtgggcaggctatggactggcagaacgg ggaggagtttggctggggcagtccgaggagagccggggctgccgagtggcccaaatccag gggaaaaccatctcgcttttggctctcccatttgctgagagctacttctactcaataaaa ccttgcactaattctccaagcccacacatgatccaattcttccgggactttgtgccctgc atcccagccgctccagccatggctaaaagggaccaaggtacagctcgagccatggcttca gagggtgtaagccccaagatttggcagcttccatgtggtattgagcctgtgggtgaacag gagtcaagaattgagtttgggaatctctgcctagatttcagaggatgcttccagcttgaa gcaggactgagctggggaaaaggagaagttttgacgtggaataaagtttgtgtgaccttc acgagatttcatccaaggcagaaaaagaaatcatcttcagagaatttgaagtggcgtagc cgaatcggtgtcgcggccagccagataggggcggaggtccggaacccagtctggacccga gcggggggccatggagaaagcggcccgaggcgctgtttacaccgactagcgcgggcccgt tgcggctgcaggcaccatggaccgagcccccacggagcaggcggagggctcgccgttgcc gcttctcccctgcttggtgaaggggtcagaaggctccaagttgtccctgctctcctggga ggagggcttataacttctagtagtctccccgcttcccccattcccttcagctctggaatg tgttttcaaggtggaaaggcgtttgaaggcttctcacccagaaagactaatgtcaagctg tcagctgaggtggagccatttattccccagaagaagagtcctgatacatttatgatccct atggctctcccaaatgataatggaagtgtttctggtgtggaaccaactccaattcccagc tacctgattacttgttacccatttgtgcaggaaaaccagtccaatagacagtttccttta tataacaatgatatacgatggcaacaacccaatccaaaccctactggaccatactttgcc tatcccattatatctgctcagccgcctgtttctacagagtatacatattatcagctgatg ccagcaccatgtgcccaggttatgggtttctatcatccttttcctacaccttactccaac acctttcaggctgcaaatactgtaaatgctatcaccacagaatgcactgagcgtccaagt cagcttggacaggtcttcccattgtccagccatcgaagcagaaacagtaacagaggatca gtggtcccaaaacaacagcttttacaacagcacataaaaagcaaaaggccgctggtgaaa aatgtagctactcagaaagaaacaaatgcagcaggtcctgatagtcgatcaaaaattgtg cttctggtagatgcttcacagcaaactgatttcccatcagatatcgctaacaagtctctc tcagagaccactgcaacaatgctctggaagtccaagggcaggagaagaagagcatcccac cctactgctgaatcttctagtgagcagggggctagtgaagccgacattgacagtgatagt ggttactgcagtcccaaacacagcaacaaccagcctgcagcaggggctttgagaaatcct gattctgggaccatgaat >gi568815583r:49027953_49255578|GENSCAN_predicted_peptide_2|97_aa MELRIFHKKVKFKVMPEGQGGVNSIKRLFKEPSAQREQCSRTLRQKGEEVQADQASWNTV SKGDKIQRGESEQRDNHTGFEATIMVGGCVLRRVGNY >gi568815583r:49027953_49255578|GENSCAN_predicted_CDS_2|294_bp atggagctcaggatcttccataagaaagtaaaatttaaggtgatgcctgaaggccaagga ggagttaactcaataaaaaggctctttaaagagcctagtgcccagagggaacagtgttcc aggaccctaaggcagaaaggagaggaggtgcaggcagaccaggctagttggaatacagtg agtaaaggtgataagattcagaggggagagagtgagcagagggacaaccatacagggttt gaagccacaattatggttggaggctgcgtcctaagaagagtgggaaactattga >gi568815583r:49027953_49255578|GENSCAN_predicted_peptide_3|126_aa MKEAVMWLSRGNSIPDSEISKCNSLRRDVPGRLEKLQEDEYGRGEMNKKELNNLRTHQAT NATAGTQARWRKAQEMTHLKQLTQVPVHAVLGPKDRHVQPSMATTGASGVVYLVSLFPVK FHQSLH >gi568815583r:49027953_49255578|GENSCAN_predicted_CDS_3|381_bp atgaaggaagcagtcatgtggctgtccaggggaaatagtattccagactcagaaatcagc aagtgcaacagcctgaggagggatgtgcctggcaggttggagaaactgcaagaagacgaa tatggcaggggcgaaatgaacaagaaagagctcaacaacctgcgcacacaccaggccacc aatgccactgctggcacccaagcaaggtggcggaaggcccaagaaatgacccacctgaag caactaacacaagtgccagtgcatgctgtcctggggcccaaggacaggcatgtccagcct tccatggccaccactggggccagtggagtagtctacctggtgtctctgttcccagtaaaa tttcaccagagcctccactaa >gi568815583r:49027953_49255578|GENSCAN_predicted_peptide_4|500_aa MSDMEDDFMCDDEEDYDLALALPFRAQRSGPPLLHTGTERVRNARRSQKTGVSLLLPGLT AKRNFVPTYGTAVRQDSESIVNGEYCFPKVHFLGLLLAVNTREYSEDSNSEPNVDLENQY YNSKALKEDDPKAALSSFQKVLELEGEKGEWGFKALKQMIKINFKLTNFPEMMNRYKQLL TYIRSAVTRNYSEKSINSILDYISTSKQLGKLYLEREEYGKLQKILRQLHQSCQTDDGED DLKKGTQLLEIYALEIQMYTAQKNNKKLKALYEQSLHIKSAIPHPLIMGVIRGKFECGGK MHLREGEFEKAHTDFFEAFKNYDESGSPRRTTCLKYLVLANMLMKSGINPFDSQEAKPYK NDPEILAMTNLVSAYQNNDITEFEKILKTNHSNIMDDPFIREHIEELLRNIRTQVLIKLI KPYTRIHIPFISKELNIDVADVESLLVQCILDNTIHGRIDQVNQLLELDHQKRGGARYTA LDKWTNQLNSLNQAVVSKLA >gi568815583r:49027953_49255578|GENSCAN_predicted_CDS_4|1503_bp atgtctgacatggaggatgatttcatgtgcgatgatgaggaggactacgacctggccctg gctttgccctttcgggcccagcgttccgggcccccacttcttcacactggaactgagcgg gtgcgaaatgctaggagaagtcagaaaacgggggtgtccttactgctcccaggcctgact gcgaaacgcaactttgtgccaacatatggtacagctgttagacaagactcagaaagcatt gttaatggtgagtattgttttcctaaagtacatttcttaggacttctattggcggtaaat acaagagaatactctgaagatagtaactccgagccaaatgtggatttggaaaatcagtac tataattccaaagcattaaaagaagatgacccaaaagcggcattaagcagtttccaaaag gttttggaacttgaaggtgaaaaaggagaatggggatttaaagcactgaaacaaatgatt aagattaacttcaagttgacaaactttccagaaatgatgaatagatataagcagctattg acctatattcggagtgcagtcacaagaaattattctgaaaaatccattaattctattctt gattatatctctacttctaaacagcttggaaaattatatttagaacgagaggaatatgga aagcttcaaaaaattttacgccagttacatcagtcgtgccagactgatgatggagaagat gatctgaaaaaaggtacacagttattagaaatatatgctttggaaattcaaatgtacaca gcacagaaaaataacaaaaaacttaaagcactctatgaacagtcacttcacatcaagtct gccatccctcatccactgattatgggagttatcagaggcaagtttgaatgtggtggtaaa atgcacttgagggaaggtgaatttgaaaaggcacacactgatttttttgaagccttcaag aattatgatgaatctggaagtccaagacgaaccacttgcttaaaatatttggtcttagca aatatgcttatgaaatcgggaataaatccatttgactcacaggaggccaagccgtacaaa aatgatccagaaattttagcaatgacgaatttagtaagtgcctatcagaataatgacatc actgaatttgaaaagattctaaaaacaaatcacagcaacatcatggatgatcctttcata agagaacacattgaagagcttttgcgaaacatcagaacacaagtgcttataaaattaatt aagccttacacaagaatacatattccttttatttctaaggagttaaacatagatgtagct gatgtggagagcttgctggtgcagtgcatattggataacactattcatggccgaattgat caagtcaaccaactccttgaactggatcatcagaagaggggtggtgcacgatatactgca ctagataaatggaccaaccaactaaattctctcaaccaggctgtagtcagtaaactggct taa >gi568815583r:49027953_49255578|GENSCAN_predicted_peptide_5|492_aa MGFHHVGQAVLELLTSGRKKKTSLVSFARALRRGSCEALVPPWELRAGRRLYLTAAWREP WLSPGRRSQRRGRLEAATSAKLQDFRRAGAVYLPEENGSCHRSLVIALGALLRHLKLQEK EGSSEIWLQRALLRVGSRWQNILGINFTNITYISGNGGVGEYCAFSKMHSENHQECGGTY DPTKATDLSACRCQPMHHPLLVDWFGDPLGGRIFSDSIMEWMNEDNFKHFRYCLGGVGLL KLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLPMAVEQDVLIAVEPVKTYALQ LANTNPLYPSAVALDCHGSMNPIVDCMWEGSRLHAPYDNPMLDDLRDFSTSANNIQIDKT KPLWHNYFLCGLKGIQEHFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLG RNLSKRNGKPLEGCELGMARCDLHGLSSFTWEYYTIPLTVAREETIRKKREEKKEKKEGR QMVNALMELKCI >gi568815583r:49027953_49255578|GENSCAN_predicted_CDS_5|1479_bp atggggtttcaccatgttggccaggccgttctcgaactcctgacctcagggcgcaaaaag aaaacctcactagtcagcttcgcacgtgctctgcgcaggggctcctgcgaggccctagtc cctccctgggagctaagagcaggacggaggctgtacctgactgctgcttggagggaaccc tggctgagtccagggaggaggagccagaggaggggccgattggaagcagccacctcagcg aagctgcaggatttccgcagggcgggagctgtttatctcccggaagaaaacggctcctgt cacagaagtctcgtgattgctctgggagctttgcttagacacttgaaactacaggagaaa gaaggatctagcgaaatatggctacagagagccctgctacgcgtcgggtccaggtggcag aacatcctaggtatcaactttacaaacattacttacattagtggtaacggtggagttgga gagtattgtgccttttccaagatgcacagtgagaatcaccaggagtgtggtggaacttat gaccctaccaaggccacggatctttcggcctgcagatgtcagcccatgcatcaccctttg cttgtggactggtttggtgatccactgggtggcaggattttttctgacagcattatggag tggatgaatgaggataacttcaaacattttaggtactgccttggtggtgttgggttactg aagctaaaggagatgtttaactccaagtttggatctattcccaagttttatgttcgagca ccaggaagagtcaacataataggagagcatatagattattgtggatattctgttcttcct atggctgtagaacaagatgtgctaatagctgtagaacctgtgaaaacgtacgctctccaa ctggccaatacaaatcccttgtatccatcagcagtggcattagactgtcatggaagcatg aaccctattgtggactgcatgtgggagggatctaggttgcatgctccttatgataatcca atgcttgatgatctgagggacttcagtactagtgctaataacatccagattgataaaacc aagcctttgtggcacaactatttcttatgtggacttaaaggaattcaggaacactttggt cttagtaacctgactggaatgaactgcctggtagatggaaatatcccaccaagttctggc ctctccagctccagtgctttggtctgttgtgctggcttggtgacgctcacagtgctggga aggaatctatccaagcgtaatggaaaacctttggagggctgtgagctggggatggcaaga tgtgatttacatggattaagttcttttacctgggagtattataccatacccctgacagtg gcaagagaggaaacaataagaaagaaaagagaagagaaaaaagaaaagaaggaaggaaga caaatggtcaatgcccttatggaacttaaatgcatatag >gi568815583r:49027953_49255578|GENSCAN_predicted_peptide_6|109_aa FCVVVVDDDDDDDAWCFKKGYTRQQFYHKSTPNKGDRVIYNLTRPPYCYVRFPLASTGPH ILYLSQLASNLELFKRGTGRREQRKEEVTCGMLRKVKTPPNKEEEQAIT >gi568815583r:49027953_49255578|GENSCAN_predicted_CDS_6|330_bp ttttgtgtagttgttgttgatgatgatgatgatgatgatgcttggtgttttaagaaaggg tacactcgccagcagttttaccacaagagtacaccaaacaaaggagacagagtcatttat aacctgacgcgtccaccctactgctatgtccggtttccattggctagtacgggacctcac attctgtatttgtcccaattggctagcaacttagaactttttaaaagaggcacgggcaga cgagaacaaaggaaggaggaagtaacttgtggaatgctgagaaaggtaaaaacacctcca aataaggaagaggaacaggctataacctaa