GENSCAN 1.0 Date run: 7-Nov-116 Time: 17:14:00 Sequence gi568815597r:32836809_33064685 : 227877 bp : 47.82% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 16271 16358 88 0 1 122 101 36 0.634 7.33 1.02 Term + 22452 22557 106 0 1 100 40 51 0.209 -0.62 1.03 PlyA + 26285 26290 6 1.05 2.07 PlyA - 26749 26744 6 1.05 2.06 Term - 27028 27002 27 1 0 118 47 40 0.793 0.97 2.05 Intr - 27989 27856 134 0 2 145 100 204 0.999 27.66 2.04 Intr - 31034 30945 90 0 0 68 109 115 0.944 11.57 2.03 Intr - 31580 31415 166 2 1 48 49 263 0.966 17.93 2.02 Intr - 32189 32074 116 0 2 85 95 127 0.984 13.27 2.01 Init - 34082 33845 238 2 1 105 90 222 0.999 20.27 2.00 Prom - 37329 37290 40 -9.16 3.00 Prom + 45442 45481 40 -4.96 3.01 Init + 52091 52468 378 1 0 105 80 880 0.754 85.80 3.02 Intr + 56716 56821 106 0 1 130 89 168 0.999 20.89 3.03 Term + 56957 57054 98 0 2 122 54 184 0.697 16.53 3.04 PlyA + 57265 57270 6 -5.99 4.07 PlyA - 57808 57803 6 -1.95 4.06 Term - 58170 57997 174 0 0 -6 54 195 0.681 4.46 4.05 Intr - 58631 58497 135 2 0 100 115 120 0.970 16.66 4.04 Intr - 58935 58747 189 0 0 55 73 151 0.977 10.08 4.03 Intr - 59161 59102 60 1 0 58 92 69 0.852 3.23 4.02 Intr - 61511 61318 194 0 2 66 44 361 0.990 28.71 4.01 Init - 64430 64415 16 2 1 83 97 34 0.630 3.54 4.00 Prom - 77850 77811 40 -2.96 5.12 PlyA - 78740 78735 6 1.05 5.11 Term - 84523 84368 156 1 0 21 43 187 0.857 5.43 5.10 Intr - 89458 89316 143 2 2 35 115 17 0.066 -0.83 5.09 Intr - 100451 100090 362 1 2 70 50 339 0.376 23.16 5.08 Intr - 101720 101589 132 1 0 103 91 112 0.999 12.76 5.07 Intr - 105651 105444 208 1 1 58 97 124 0.967 8.44 5.06 Intr - 107351 107211 141 0 0 146 98 133 0.999 20.42 5.05 Intr - 108820 108706 115 1 1 91 99 84 0.999 9.82 5.04 Intr - 109756 109594 163 0 1 119 116 142 0.998 20.08 5.03 Intr - 111558 111414 145 1 1 48 110 -13 0.427 -3.76 5.02 Intr - 112966 112761 206 0 2 68 75 156 0.738 11.14 5.01 Init - 127877 127243 635 2 2 95 88 930 0.134 88.52 5.00 Prom - 140049 140010 40 -6.96 6.00 Prom + 141721 141760 40 -7.96 6.01 Sngl + 143137 143664 528 0 0 78 43 670 0.779 57.56 6.02 PlyA + 143694 143699 6 1.05 7.05 PlyA - 147060 147055 6 1.05 7.04 Term - 150588 150206 383 1 2 40 43 190 0.761 4.70 7.03 Intr - 155515 155422 94 1 1 102 75 -11 0.484 -1.46 7.02 Intr - 161021 160903 119 0 2 33 72 94 0.452 2.48 7.01 Init - 161805 161703 103 0 1 66 80 76 0.793 5.00 7.00 Prom - 164624 164585 40 -3.56 8.07 PlyA - 166452 166447 6 1.05 8.06 Term - 176594 176373 222 2 0 112 35 269 0.952 20.92 8.05 Intr - 177786 177714 73 2 1 124 52 104 0.999 9.81 8.04 Intr - 184653 184559 95 0 2 30 63 123 0.998 2.76 8.03 Intr - 184895 184785 111 2 0 90 64 107 0.870 9.08 8.02 Intr - 187759 187634 126 1 0 82 82 58 0.612 5.48 8.01 Init - 200020 199928 93 1 0 105 99 105 0.893 13.68 8.00 Prom - 203161 203122 40 -3.46 9.02 PlyA - 206452 206447 6 1.05 9.01 Term - 227808 227674 135 0 0 81 49 104 0.839 3.82 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 127877 127239 639 2 0 95 53 942 0.866 87.59 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:32836809_33064685|GENSCAN_predicted_peptide_1|64_aa XISGELCALMDQVHHMQHSKWQHPSDLTTRASIDCGRSHAIQEHSLFEGDVRNQNQWYRI DGCS >gi568815597r:32836809_33064685|GENSCAN_predicted_CDS_1|195_bp ngtatctcgggggagctttgtgccttgatggatcaagttcatcatatgcagcactcaaaa tggcagcatccttcggacctcaccacgcgagctagcattgattgtggaagaagccatgct atccaagagcattctctctttgaaggggatgtaagaaaccagaatcagtggtacagaatt gatggttgttcataa >gi568815597r:32836809_33064685|GENSCAN_predicted_peptide_2|256_aa MQAARGGAGRPERPGRPGRGPERERERPPGAGAASPCAAPGLPAGGATIHPGSPSAWPPR ARAALRLWLGCVCFALVQADSPSAPVNVTVRHLKANSAVVSWDVLEDEVVIGFAISQQKK DVRMLRFIQEVNTTTRSCALWDLEEDTEYIVHVQAISIQGQSPASEPVLFKTPHEVTMKE MGRNQQLRTGEVLIIVVVLFMWAGVIALFCRQYDIIKDNEPNNNKEKTKSASETSTPEHQ GGGLLRSKPCKKAIHN >gi568815597r:32836809_33064685|GENSCAN_predicted_CDS_2|771_bp atgcaggcggctcggggcggcgcagggcggccggagcggccgggccggccgggccggggg ccggagcgcgagcgcgagcggccgccgggcgccggagccgcgtccccctgcgccgccccg ggcctgccggccggaggagccaccatacaccccgggtcgccgagcgcctggccgccccgc gcccgcgccgcgctccgcctgtggctgggctgcgtctgcttcgcgctggtgcaggcggac agtccctcagccccagtgaacgtcaccgtcaggcacctcaaggccaactctgcagtggtg agctgggatgttctggaggatgaggttgtcatcggatttgccatctcccagcagaagaag gatgtgcggatgctgcgcttcatccaggaggtgaacaccaccacccgctcatgtgccctc tgggacctggaggaggatacggagtacatagtccacgtgcaggccatctccattcagggc cagagcccagccagcgagcctgtgctcttcaagaccccgcatgaggtaaccatgaaagag atggggaggaaccaacagctgcggacaggcgaggtgctgatcatcgtcgtggtcctgttc atgtgggcaggtgtcattgccctcttctgccgccagtatgacatcatcaaggacaatgaa cccaataacaacaaggaaaaaaccaagagtgcatcagaaaccagcacaccagagcaccag ggcggggggcttctccgcagcaagccttgcaagaaggctattcacaattag >gi568815597r:32836809_33064685|GENSCAN_predicted_peptide_3|193_aa MGKQNSKLRPEMLQDLRENTEFSELELQEWYKGFLKDCPTGILNVDEFKKIYANFFPYGD ASKFAEHVFRTFDTNSDGTIDFREFIIALSVTSRGRLEQKLMWAFSMYDLDGNGYISREE MLEIVQAIYKMVSSVMKMPEDESTPEKRTEKIFRQMDTNNDGKLSLEEFIRGAKSDPSIV RLLQCDPSSASQF >gi568815597r:32836809_33064685|GENSCAN_predicted_CDS_3|582_bp atgggcaagcagaacagcaagctgcggcccgagatgttgcaggacctgcgagagaacaca gagttctcagagctggagctgcaggagtggtacaagggcttcctcaaggactgccccaca ggaatcctcaatgtggatgagttcaagaagatctacgccaacttctttccctatggtgac gcctccaagtttgccgagcacgtcttccgcacctttgacaccaacagcgatggcaccata gactttcgggagttcatcattgcgctgagcgtgacctcgcgcggccgcctggagcagaag ctcatgtgggccttcagcatgtatgacctggacggcaacggctacatcagccgggaggag atgctggagatcgtgcaggccatttacaagatggtttcgtccgtgatgaagatgccggag gacgagtcgaccccggaaaagaggactgagaaaatcttccgccaaatggacacaaacaac gacggcaagctgtccttggaggagttcatccgcggggccaaaagcgacccgtccatcgtg cgtctgctgcagtgcgaccccagcagcgcctcccagttctga >gi568815597r:32836809_33064685|GENSCAN_predicted_peptide_4|255_aa MCLRLGGLSVGDFRKVLMKTGLVLVVLGHVSFITAALFHGTVLRYVGTPQDAVALQYCVV NILSVTSAIVVITSGIAAIVLSRYLPSTPLRWTVFSSSVACALLSLTCALGLLASIAMTF ATQGKALLAACTFGSSELLALAPDCPFDPTRIYSSSLCLWGIALVLCVAENVFAVRCAQL THQLLELRPWWGKSSHHMSLPALTSFHFPDVLEEETQAENQWVWQPPVSLQMRENPELVE GRDLLSCTSSEPLTL >gi568815597r:32836809_33064685|GENSCAN_predicted_CDS_4|768_bp atgtgtctgcgcctcggaggcctgagtgtgggcgacttccggaaggtgctgatgaagaca ggcctggtgctggtggtgctgggccatgtgagcttcatcacagctgccctgttccatggc acagtgctgcgctacgtgggcacccctcaagatgcggtggctctgcagtactgcgtggtc aacatcctctctgtcacttccgccatcgtggtcatcacttcaggcatcgcagccatcgtg ttgtcacgctacctccctagcacccccctgcgctggacagtgtttagctcgagcgtggcc tgtgctctcctttctctgacctgtgccctcggcctcttggcctccatcgccatgaccttt gccacccagggcaaggcactgctggctgcctgcacttttgggagctctgaactactggcc ctcgcacctgactgtcccttcgaccccacacgcatttatagctccagcctgtgcctctgg ggcatcgccctagtgctctgcgtggcggagaacgtgtttgctgtacgctgtgctcagctc acccaccagctgctggagctgaggccctggtgggggaaaagcagccaccacatgagcctt ccagccctcacctccttccatttcccggatgtcctggaggaggagacccaggctgagaac cagtgggtctggcagcctcctgtgtctctgcagatgcgggagaacccagagctggtggag ggccgtgacctgctgagctgcaccagctctgagcctctgaccctctga >gi568815597r:32836809_33064685|GENSCAN_predicted_peptide_5|801_aa MGSEKDSESPRSTSLHAAAPDPKCRSGGRRRRLTLHSVFSASARGRRARAKPQAEPPPPA AQPPPAPAPAAAQGPPPEALPAEPAAEAEAEAAAAAAEPGFDDEEAAEGGGPGAEEVECP LCLVRLPPERAPRLLSCPHRSCRDCLRHYLRLEISESRVPISCPECSERLNPHDIRLLLA DPPLMHKYEEFMLRRYLASDPDCRWCPAPDCGYAVIAYGCASCPKLTCEREGCQTEFCYH CKQIWHPNQTCDMARQQRAQTLRVRTKHTSGLSYGQESGPADDIKPCPRCSAYIIKMNDG SCNHMTCAVCGCEFCWLCMKEISDLHYLSPSGCTFWGKKPWSRKKKILWQLGTLIGAPVG ISLIAGIAIPAMVIGIPVYVGRKIHSRYEGRKTSKHKRNLAITGGVTLSVIASPVIAAVS VGIGVPIMLAYVYGVVPISLCRGGGCGVSTANGKGVKIEFDEDDGPITVADAWRALKNPS IGESSIEGLTSVLSTSGSPTDGLSVMQGPYSETASFAALSGGTLSGGILSSGKGKYSRLE VQADVQKEIFPKDTASLGAISDNASTRAMAGSIISSYNPQDRECNNMEIQVDIEAKPSHY QLVSGSSTEDSLHVHAQMAENEEEGSGGGGSEEDPPCRHQSCEQKDCLASKPWDISLAQP ESIRSDLESSDAQSDDVPDITSDECGSPRSHTAACPSTPRAQDGGLTGEAPPEDVPGPHK TLSSGGERIDHRCVPTKNSTPKSSVSSPGKKARIESTFEIDQSNAFIRQRRKRRHKEEDI KTTRPSAYSSLSRLLRPVDLS >gi568815597r:32836809_33064685|GENSCAN_predicted_CDS_5|2406_bp atgggctccgagaaggactccgagtcgccgcgctccacatcgctacatgcggccgcaccc gaccctaagtgccgcagcggcggccggcgccggcgcctcaccttgcacagcgtcttctct gcctcggcccgcggccgccgcgcccgggccaagccgcaggccgagccgccgcccccggct gcgcagccgccgcccgccccggcccctgccgcggcccagggcccgccgcccgaggcgctg cccgccgagccggccgccgaggccgaggcggaggccgcggcggcggcggcggagcctggg ttcgacgatgaggaggcggcggagggcggtggcccgggcgcggaggaggtggagtgtccg ctgtgcctggtgcggctgccgcctgagcgggccccgcgcctcctcagctgtccgcaccgc tcgtgccgggactgcctccgccactacctgcgcctggagataagcgagagcagggtgccc atcagctgccccgagtgcagcgagcgactcaacccgcacgacatccgcttgctgctcgcc gacccgccgcttatgcacaagtacgaggagttcatgctgcgccgctacctagcctcggac cccgactgccgctggtgcccggccccggactgcggttatgctgttattgcctatggctgt gccagctgcccgaagctaacttgtgagagggaaggttgccagactgagttctgctaccac tgcaagcagatatggcatccaaatcagacatgcgatatggcccgtcaacagagggcccag actttacgagttcggaccaaacacacttcaggtctcagttatgggcaagaatctggacca gcagatgacatcaagccatgcccacgatgcagtgcatacattatcaagatgaatgatgga agctgtaatcacatgacctgtgcagtgtgtggctgtgaattctgttggctttgtatgaaa gagatctcagacttgcattacctcagcccctctggctgtacattctggggcaagaagcca tggagccgtaagaagaaaattctttggcagctgggcacgttgattggtgctccagtgggg atttctctcattgctggcattgccattcctgccatggtcattggcattcctgtttatgtt ggaaggaagattcacagcaggtatgagggaaggaaaacctccaaacacaagaggaatttg gctatcactggaggagtgactttgtcggtcattgcatccccagttattgctgcagttagt gttggtattggtgtccccattatgctggcatatgtttatggggttgtgcccatttctctt tgtcgtggaggcggctgtggagttagcacagccaacggaaaaggagtgaaaattgaattt gatgaagatgatggtccaatcacagtggcagatgcctggagagccctcaagaatcccagc attggggaaagcagcattgaaggcctgactagtgtattgagcactagtggaagccctaca gatggacttagtgttatgcaaggtccttacagcgaaacggccagctttgcagccctctca gggggcacgctgagtggcggcattctctccagtggcaagggaaaatatagcaggttagaa gttcaagccgatgtccaaaaggaaattttccccaaagacacagccagtcttggtgcaatt agtgacaacgcaagcactcgtgctatggccggttccataatcagttcctacaacccacag gacagagaatgcaacaatatggaaatccaagtggacattgaagccaaaccaagccactat cagctggtgagtggaagcagcacggaggactcgctccatgttcatgctcagatggcagag aatgaagaagaaggtagtggtggcggaggcagtgaagaggatcccccctgcagacaccaa agctgtgaacagaaagactgcctggccagcaaaccttgggacatcagcctggcccagcct gaaagcatccgcagtgacctagagagttctgatgcacagtcagacgatgtgccagacatc acctcagatgagtgtggctccccccgctcccatactgcagcctgcccctcgacccccaga gcccaagacggcggacttacaggggaggcgccacctgaagacgttccagggccccataag accctatcttccggaggggaacggatcgaccaccggtgtgtgcccacaaaaaattcaact cctaagtcctcagtttctagtcccgggaagaaagccagaattgaaagcacctttgagatc gaccagtccaacgccttcattcgacaaagaaggaaacgaaggcacaaggaggaagacata aaaaccacgaggccgagtgcctacagctccctcagtcgcctcctgaggccagtggacctg agctga >gi568815597r:32836809_33064685|GENSCAN_predicted_peptide_6|175_aa MKALGTQQEYKVVCHCLPTPKCHTLPLYHMQIFAPNHVVAKFHFWYFLSQLKKMKKSSGE TVNCGQVFEKYPLWVKNFGIWLRYDSRSSTHNMYREYRDLTTMGAVTQCYQDMGTQYRAR ANFIQIMKVEEIAASKCWWPVVKQFHDSKIKFLLPHLVLCHQQKPRFTRRPNTFF >gi568815597r:32836809_33064685|GENSCAN_predicted_CDS_6|528_bp atgaaggccttgggcacacaacaagagtacaaggtggtgtgtcactgcctgcccaccccc aaatgccacacactgcccctctaccacatgcaaatctttgcgcctaatcatgtagtcgcc aagttccacttctggtacttcctatctcaattaaagaagatgaagaagtcttcaggggag actgtcaactgtgggcaggtgtttgagaagtacccactgtgggtgaagaactttggcatc tggctgcgctatgactcccggagcagcacccacaacatgtacagggaataccgggaccta accaccatgggcgctgtcacccagtgctaccaagacatgggcacccagtaccgcgcccgg gccaacttcatccagatcatgaaggtggaggagattgcggccagcaagtgctggtggcca gttgtcaagcaattccacgactccaagatcaagttcctgctgccccacttggtgctgtgc catcagcagaagccacgcttcaccaggagacccaacaccttcttctag >gi568815597r:32836809_33064685|GENSCAN_predicted_peptide_7|232_aa MPKELQGSMERATDCLQCPGWLPYRKDVELLHLPDREGIPELQFEHEVLENQLPEAAQEE CYQVRDDLFQVIWPGLLSGLKRGIEYQVLVFVLLSENFLKIAWERDGRGYLLGALNTQTN VATVVPNGNTHLEPGMLVSAGLLLHGRGLQNLILERCPQEKVNDLRLLDGQGEGLDFLQG LDLHVLDQVAQLGDRHPFLIFILASASSMAQALTPTTIQAPMPLPKPPWKPL >gi568815597r:32836809_33064685|GENSCAN_predicted_CDS_7|699_bp atgcccaaggagctgcagggctccatggagagagcaactgactgcctgcagtgtcctgga tggcttccctataggaaggacgtggagctactacatcttccagacagggaaggcatccct gagttgcagtttgaacatgaggttctagagaaccagcttcctgaagcagcacaggaagag tgttaccaagtacgagatgacttgtttcaggtcatctggccagggctgcttagtggactc aaaagaggtatcgagtatcaggttttggtttttgttctgctatcagaaaattttttgaag attgcctgggagagagatggccgtggctacctcctcggagcacttaacacccagaccaac gtggccactgtagtccccaatggcaacacacaccttgaacctggtatgctggtcagcgca ggtctgcttttgcacgggcgtggtcttcaaaacctcatccttgagagatgcccccaggaa aaagtcaatgatctcagactcctcgatggtcaaggagaagggctagatttcctccaggga cttgatcttcatgtccttgaccaggtagcccaacttggtgacaggcatccattcctcatc ttcatccttgcctctgcaagctccatggcccaggccctgaccccaaccactatccaggcc ccaatgccactgccaaagcctccgtggaaaccactatga >gi568815597r:32836809_33064685|GENSCAN_predicted_peptide_8|239_aa MAPSVPAAEPEYPKGIRAVLLGPPGAGKGTQAPRLAENFCVCHLATGDMLRAMVASGSEL GKKLKATMDAGKLVSDEMVVELIEKNLETPLCKNGFLLDGFPRTVRQAEMLDDLMEKRKE KLDSVIEFSIPDSLLIRRITGRLIHPKSGRSYHEEFNPPKEPMKDDITGEPLIRRSDDNE KALKIRLQAYHTQTTPLIEYYRKRGIHSAIDASQTPDVVFASILAAFSKATCKDLVMFI >gi568815597r:32836809_33064685|GENSCAN_predicted_CDS_8|720_bp atggctcccagcgtgccagcggcagaacccgagtatcctaaaggcatccgggccgtgctg ctggggcctcccggggccggtaaagggacccaggcacccagattggctgaaaacttctgt gtctgccatttagctactggggacatgctgagggccatggtggcttctggctcagagcta ggaaaaaagctgaaggcaactatggatgctgggaaactggtgagtgatgaaatggtagtg gagctcattgagaagaatttggagacccccttgtgcaaaaatggttttcttctggatggc ttccctcggactgtgaggcaggcagaaatgctcgatgacctcatggagaagaggaaagag aagcttgattctgtgattgaattcagcatcccagactctctgctgatccgaagaatcaca ggaaggctgattcaccccaagagtggccgttcctaccacgaggagttcaaccctccaaaa gagcccatgaaagatgacatcaccggggaacccttgatccgtcgatcagatgataatgaa aaggccttgaaaatccgcctgcaagcctaccacactcaaaccaccccactcatagagtac tacaggaaacgggggatccactccgccatcgatgcatcccagacccccgatgtcgtgttc gcaagcatcctagcagccttctccaaagccacatgtaaagacttggttatgtttatctaa >gi568815597r:32836809_33064685|GENSCAN_predicted_peptide_9|44_aa VLPSTQAYHKADCRLTRESLIDLSSMTTLERTFTLEKSPTLGAR >gi568815597r:32836809_33064685|GENSCAN_predicted_CDS_9|135_bp gttcttccgagcacacaagcttaccacaaggctgactgtagacttactcgggaatctctc attgacttgtcctcaatgaccacgctcgagcgtaccttcaccctagagaaaagccccacg ttgggcgccagatga