GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:13:21 Sequence gi568815597r:153443762_153644794 : 201033 bp : 48.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 947 942 6 1.05 1.02 Term - 7721 7527 195 2 0 70 47 109 0.651 2.41 1.01 Init - 11063 10194 870 2 0 57 13 188 0.308 2.75 1.00 Prom - 11200 11161 40 -5.46 2.03 PlyA - 14006 14001 6 1.05 2.02 Term - 14209 14045 165 1 0 118 46 195 0.998 16.22 2.01 Init - 15252 15112 141 0 0 86 56 184 0.866 15.13 2.00 Prom - 16927 16888 40 -7.96 3.04 PlyA - 20057 20052 6 1.05 3.03 Term - 21174 20953 222 0 0 57 48 123 0.727 2.12 3.02 Intr - 22309 22178 132 1 0 62 90 107 0.556 9.14 3.01 Init - 30718 30653 66 1 0 62 100 16 0.044 1.27 3.00 Prom - 39259 39220 40 -0.36 4.00 Prom + 44424 44463 40 -3.86 4.01 Init + 56122 56263 142 0 1 25 116 78 0.357 2.83 4.02 Intr + 59558 59672 115 0 1 69 -29 123 0.351 -1.79 4.03 Intr + 60306 60387 82 0 1 90 56 68 0.791 3.44 4.04 Intr + 62791 62881 91 0 1 130 91 35 0.827 7.57 4.05 Term + 63890 64008 119 0 2 118 42 26 0.636 -0.30 4.06 PlyA + 65322 65327 6 1.05 5.04 PlyA - 65351 65346 6 1.05 5.03 Term - 78947 78859 89 0 2 59 43 100 0.349 0.42 5.02 Intr - 82056 81952 105 1 0 105 55 75 0.540 6.09 5.01 Init - 82768 82705 64 1 1 46 47 54 0.288 -1.59 5.00 Prom - 85241 85202 40 -5.76 6.07 PlyA - 88978 88973 6 1.05 6.06 Term - 91069 90935 135 1 0 -13 48 246 0.983 8.52 6.05 Intr - 91599 91441 159 0 0 40 109 248 0.012 22.28 6.04 Intr - 93434 93395 40 1 1 129 53 -26 0.010 -3.77 6.03 Intr - 93675 93539 137 0 2 27 -1 338 0.010 18.17 6.02 Intr - 96444 96293 152 1 2 91 92 97 0.524 10.28 6.01 Init - 97663 97624 40 1 1 57 103 -2 0.355 -2.45 6.00 Prom - 99437 99398 40 -7.76 7.03 PlyA - 99881 99876 6 1.05 7.02 Term - 100162 99998 165 1 0 120 48 280 0.999 25.12 7.01 Init - 101033 100893 141 2 0 93 84 236 0.660 23.83 7.00 Prom - 102080 102041 40 -4.66 8.05 PlyA - 103587 103582 6 1.05 8.04 Term - 104085 103921 165 0 0 77 38 219 0.890 13.72 8.03 Intr - 104729 104584 146 0 2 74 105 254 0.970 25.70 8.02 Intr - 107663 107595 69 0 0 119 33 56 0.768 2.35 8.01 Init - 109548 109485 64 0 1 68 105 66 0.923 5.63 8.00 Prom - 117365 117326 40 -5.96 9.05 PlyA - 117606 117601 6 1.05 9.04 Term - 117830 117678 153 2 0 106 48 240 0.999 19.72 9.03 Intr - 120120 119973 148 2 1 49 84 195 0.501 15.44 9.02 Intr - 123049 122941 109 2 1 60 100 48 0.762 2.54 9.01 Init - 124142 124130 13 2 1 73 105 8 0.598 1.40 9.00 Prom - 141608 141569 40 -4.06 10.09 PlyA - 141616 141611 6 1.05 10.08 Term - 163931 163773 159 2 0 100 44 389 0.999 33.64 10.07 Intr - 164416 164238 179 2 2 95 99 286 0.953 30.04 10.06 Intr - 166955 166895 61 2 1 107 81 15 0.318 1.01 10.05 Intr - 171261 171139 123 0 0 101 -45 143 0.011 3.08 10.04 Intr - 171620 171474 147 2 0 110 72 219 0.992 22.93 10.03 Intr - 172546 172534 13 0 1 85 110 10 0.154 -2.32 10.02 Intr - 175277 175138 140 2 2 114 27 234 0.133 19.16 10.01 Init - 182711 182559 153 2 0 47 102 203 0.459 17.58 10.00 Prom - 183033 182994 40 -12.01 11.00 Prom + 183762 183801 40 -9.16 11.01 Init + 186761 186901 141 1 0 77 50 307 0.793 25.93 11.02 Intr + 187937 188051 115 1 1 134 -70 222 0.044 11.02 11.03 Intr + 190284 190381 98 1 2 25 88 99 0.076 3.33 11.04 Intr + 190507 190582 76 0 1 73 109 4 0.259 0.09 11.05 Intr + 192811 192892 82 2 1 80 77 118 0.290 8.60 11.06 Intr + 194534 194687 154 2 1 16 94 122 0.158 5.67 11.07 Intr + 198482 198503 22 1 1 78 86 11 0.017 -2.98 11.08 Intr + 199466 199603 138 0 0 54 88 68 0.017 3.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 91578 91441 138 0 0 87 109 238 0.983 25.94 S.002 Term - 93675 93535 141 0 0 27 47 351 0.934 22.83 S.003 Term - 171261 171124 138 0 0 101 54 147 0.985 10.56 S.004 Term - 175277 175134 144 2 0 114 42 248 0.861 20.71 S.005 Term + 187937 188080 144 1 0 134 53 216 0.892 20.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:153443762_153644794|GENSCAN_predicted_peptide_1|354_aa MCHSALEGGHFRARPKGTGISVILIRIQPLFLSQKNKVGGITLPDLKLYYKAAVTKTAWY WYQNTDIDQWNRTEPSEIMPHIYNYLIFDKPDKNKKWGKDSLFNKWCWENWLAICRKLKL DPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDFMSKTAKAKATKAKID KWDLIKLKSFCTTKETTIRLNRQPTEWEKIFAIYSSDKGLISRIYNELKQIYKKKTNNPI KKWAKDMNRHFSKDDIYAANRHMKKCSSSLAIREMQIKTTMRYHLTPVRMTNYPNKTYKN MNKNQKIICFEATESYQSDQDLKALVIQEKEDSINALNIKAASLPEKFISCQVV >gi568815597r:153443762_153644794|GENSCAN_predicted_CDS_1|1065_bp atgtgtcattctgcattggaaggaggacattttagagcaaggcctaagggcacaggtatt agtgtcatattgatcagaattcaacctttgttcctaagccaaaagaacaaagttggaggc atcacgctacctgacttaaaactatactacaaggctgcagtgaccaaaacagcatggtac tggtaccaaaacacagatatagaccaatggaacagaacagagccctcagaaataatgccg catatctacaactatctgatctttgacaaacctgacaaaaacaagaaatggggaaaggat tcactatttaataagtggtgctgggaaaactggctagccatatgtagaaagctgaagctg gatcccttccttacaccttatacaaaaattaattcaagatggattaaagacttaaatgtt agacctaaaaccataaaaaccctagaagaaaacctaggcaataccattcaggacataggc atgggcaaggacttcatgtctaaaacagcaaaagcaaaggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacaacaaaagaaactaccatcagactg aacaggcagcctacagaatgggagaaaatttttgcaatctactcatctgacaaagggcta atatcaagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggcaaaggatatgaacagacacttctcaaaagacgacatttatgcagccaac agacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaactaca atgagataccatctcacaccagttagaatgactaattatcccaataaaacatataaaaat atgaataaaaatcagaaaattatctgcttcgaagcaacagaaagctaccaaagtgaccag gacttgaaggctctggttatccaagagaaagaagactcgatcaatgcactgaacattaaa gctgcttctcttcctgagaaatttatcagttgtcaagtagtttga >gi568815597r:153443762_153644794|GENSCAN_predicted_peptide_2|101_aa MSNTQAERSIIGMIDMFHKYTRRDDKIEKPSLLTMMKENFPNFLSACDKKGTNYLADVFE KKDKNEDKKIDFSEFLSLLGDIATDYHKQSHGAAPCSGGSQ >gi568815597r:153443762_153644794|GENSCAN_predicted_CDS_2|306_bp atgagcaacactcaagctgagaggtccataataggcatgatcgacatgtttcacaaatac accagacgtgatgacaagattgagaagccaagcctgctgacgatgatgaaggagaacttc cccaacttccttagtgcctgtgacaaaaagggcacaaattacctcgccgatgtctttgag aaaaaggacaagaatgaggataagaagattgatttttctgagtttctgtccttgctggga gacatagccacagactaccacaagcagagccatggagcagcgccctgttccgggggcagc cagtga >gi568815597r:153443762_153644794|GENSCAN_predicted_peptide_3|139_aa MIDLFHNNIQEDDTLDKSGLSKELLVKISFLKSCKTAARSGTHNLNKGENRQGRPSHVEP PRNSGENSLSVSLSRSPNCCTSPEMQLANTTQKDPKQNSWLSRNIPVAPSALSLSPWLLE VLYGLLKGRPYKASPGSPT >gi568815597r:153443762_153644794|GENSCAN_predicted_CDS_3|420_bp atgattgatctatttcacaataacatccaagaagatgatacactagataagtcaggtctg tcaaaggagcttttggtgaagatctcttttttaaaaagctgcaagactgctgcccgaagt gggacacacaacctaaataagggcgagaaccggcaaggacggcccagccacgtggaaccg cctcgcaactctggcgagaactctctctctgtctctctgtcccgctcccccaactgctgc acgagtccagagatgcagctggccaacaccacccagaaggaccccaaacagaacagctgg ctgtctcggaacattcccgtggctccctctgccctgtccctctcaccgtggctcctggaa gtgctctatggcctattgaaaggacgcccctacaaagcttcccctggttcccccacctga >gi568815597r:153443762_153644794|GENSCAN_predicted_peptide_4|182_aa MAWALRNVAGPCQEGFPSTSASGGNQPSSNSQTGVSIRLVKPPGKRKGADPVGQQFSKCG PQNGSNNGTWDLTRNTESQATTDLLVTLQGVPTSKKTLTRCSPLTLDFSVSKTDYKGLSK VGMVRVPGIKGQGYRISRACAVTAPFSTQKPSWKQPSPPGPVILSPVVHNSFYKPSCNGM LK >gi568815597r:153443762_153644794|GENSCAN_predicted_CDS_4|549_bp atggcatgggcactgaggaatgtggctggcccctgccaggaagggttccccagcacctct gcttccgggggtaaccagcccagctccaactcccagacaggtgtatctatccgactggtg aagccaccagggaagagaaaaggagcagaccctgtaggacagcagttctcaaaatgtggt ccccagaacggcagcaacaacggcacctgggatcttactagaaatacagagtctcaggcc accacagacctgcttgtaactctgcagggagtccccaccagcaagaagaccctcaccaga tgcagccccttgaccctggacttctcagtctccaaaactgactacaaggggctgagcaag gtaggcatggtcagggtgccaggcatcaagggacagggctatcggatcagccgggcctgt gctgtgacagccccattttccacccagaagccctcctggaagcaaccctcacccccgggc cccgttattctaagtccagttgttcacaattcattctacaagccctcatgtaatggaatg ctaaaatag >gi568815597r:153443762_153644794|GENSCAN_predicted_peptide_5|85_aa MALFCVRMVTSAQQYRQCQGCRLHTSDSSILRPLVHNDLGRLKEVWNCCLLENNGRGVKL QTFAVSVTAHKDSVDPKSEQQQDLL >gi568815597r:153443762_153644794|GENSCAN_predicted_CDS_5|258_bp atggctcttttctgtgtacggatggttacatcggcacagcagtaccgacaatgccaaggt tgcagactccacacctctgacagctccatccttcgaccccttgtgcacaatgacctgggc aggttgaaggaggtttggaactgctgcttgctggaaaacaatggcagaggagtgaagctg cagaccttcgcagtgagtgttacagctcataaagacagtgtggacccaaagagtgagcag cagcaagatttattgtaa >gi568815597r:153443762_153644794|GENSCAN_predicted_peptide_6|220_aa MPAAWILWAHSHSELHTVMETPLEKALTTMVTTFHKYSGREGSKLTLSRKELKELIKKEL CLGEMKESSIDDLMKSLDKNSDQEIDFKEYSVFLTMLCMAYNDFFLEDNKWTLPWEEIKF PSQLQAQPSAMACPLDQAIGLLVAIFHKYSGREGDKHTLSKKELKELIQKELTIGSKLQD AEIARLMEDLDRNKDQEVNFQEYVTFLGALALIYNEALKG >gi568815597r:153443762_153644794|GENSCAN_predicted_CDS_6|663_bp atgcctgctgcttggattctctgggctcactcccacagtgagctgcacactgtgatggag actcctctggagaaggccctgaccactatggtgaccacgtttcacaaatattcggggaga gagggtagcaaactgaccctgagtaggaaggaactcaaggagctgatcaagaaagagctg tgtcttggggagatgaaggagagcagcatcgatgacttgatgaagagcctggacaagaac agcgaccaggagatcgacttcaaggagtactcggtgttcctgaccatgctgtgcatggcc tacaacgacttctttctagaggacaacaaatggacccttccatgggaggaaataaagttt ccatcgcagctccaagcccagccctcagccatggcatgccccctggatcaggccattggc ctcctcgtggccatcttccacaagtactccggcagggagggtgacaagcacaccctgagc aagaaggagctgaaggagctgatccagaaggagctcaccattggctcgaagctgcaggat gctgaaattgcaaggctgatggaagacttggaccggaacaaggaccaggaggtgaacttc caggagtatgtcaccttcctgggggccttggctttgatctacaatgaagccctcaagggc tga >gi568815597r:153443762_153644794|GENSCAN_predicted_peptide_7|101_aa MACPLEKALDVMVSTFHKYSGKEGDKFKLNKSELKELLTRELPSFLGKRTDEAAFQKLMS NLDSNRDNEVDFQEYCVFLSCIAMMCNEFFEGFPDKQPRKK >gi568815597r:153443762_153644794|GENSCAN_predicted_CDS_7|306_bp atggcgtgccctctggagaaggccctggatgtgatggtgtccaccttccacaagtactcg ggcaaagagggtgacaagttcaagctcaacaagtcagaactaaaggagctgctgacccgg gagctgcccagcttcttggggaaaaggacagatgaagctgctttccagaagctgatgagc aacttggacagcaacagggacaacgaggtggacttccaagagtactgtgtcttcctgtcc tgcatcgccatgatgtgtaacgaattctttgaaggcttcccagataagcagcccaggaag aaatga >gi568815597r:153443762_153644794|GENSCAN_predicted_peptide_8|147_aa MVHFSVVSSTSAVGAGEAALEGLSLQGEDGTGKAWVLGAEGEVEVRMARPLEQAVAAIVC TFQEYAGRCGDKYKLCQAELKELLQKELATWTPTEFRECDYNKFMSVLDTNKDCEVDFVE YVRSLACLCLYCHEYFKDCPSEPPCSQ >gi568815597r:153443762_153644794|GENSCAN_predicted_CDS_8|444_bp atggtgcatttctcagtggtcagctccacatctgcagtgggggcgggggaggcggctctg gaagggctgagtctccaaggagaggacggcacaggcaaggcttgggtgctgggggccgag ggcgaggtcgaagtgaggatggccaggcctctggagcaggcggtagctgccatcgtgtgc accttccaggaatacgcagggcgctgtggggacaaatacaagctctgccaggcggagctc aaggagctgctgcagaaggagctggccacctggaccccgactgagtttcgggaatgtgac tacaacaaattcatgagtgttctggacaccaacaaggactgcgaggtggactttgtggag tatgtgcgctcacttgcctgcctctgtctctactgccacgagtacttcaaggactgcccc tcagagcccccctgctcccagtag >gi568815597r:153443762_153644794|GENSCAN_predicted_peptide_9|140_aa MSKTGWAGASTYCSGTAAALALGEEGVPLSHCLGIWAWNLGSMMCSSLEQALAVLVTTFH KYSCQEGDKFKLSKGEMKELLHKELPSFVGEKVDEEGLKKLMGSLDENSDQQVDFQEYAV FLALITVMCNDFFQGCPDRP >gi568815597r:153443762_153644794|GENSCAN_predicted_CDS_9|423_bp atgagtaagaccggctgggctggggcttccacctactgctctgggactgctgctgccctg gccctgggggaggagggtgtgccgctgagtcactgcctgggcatctgggcctggaacctc ggatccatgatgtgcagttctctggagcaggcgctggctgtgctggtcactaccttccac aagtactcctgccaagagggcgacaagttcaagctgagtaagggggaaatgaaggaactt ctgcacaaggagctgcccagctttgtgggggagaaagtggatgaggaggggctgaagaag ctgatgggcagcctggatgagaacagtgaccagcaggtggacttccaggagtatgctgtt ttcctggcactcatcactgtcatgtgcaatgacttcttccagggctgcccagaccgaccc tga >gi568815597r:153443762_153644794|GENSCAN_predicted_peptide_10|324_aa MAAEPLTELEESIETVVTTFFTFARQEGRKDSLSVNEFKELVTQQLPHLLKDVGSLDEKM KSLDVNQDSELKFNEYWRLIGELAKEIRKKKDLKIRKNGCQQDAQEFSDVERAIETLIKN FHQYSVEGGKETLTPSELRDLVTQQLPHLMPSNCGLEEKIANLGSCNDSKLEFRSFWELI GEAAKSVKLERPKKLQKSTCQMVISSTVHFTEGPASRRSREMSDCYTELEKAVIVLVENF YKYVSKYSLVKNKISKSSFREMLQKELNHMLSDTGNRKAADKLIQNLDANHDGRISFDEY WTLIGGITGPIAKLIHEQEQQSSS >gi568815597r:153443762_153644794|GENSCAN_predicted_CDS_10|975_bp atggcagcagaaccactgacagagctagaggagtccattgagaccgtggtcaccaccttc ttcacctttgcaaggcaggagggccggaaggatagcctcagcgtcaacgagttcaaagag ctggttacccagcagttgccccatctgctcaaggatgtgggctctcttgatgagaagatg aagagcttggatgtgaatcaggactcggagctcaagttcaatgagtactggagattgatt ggggagctggccaaggaaatcaggaagaagaaagacctgaagatcaggaagaacggctgc caacaggatgctcaggaattcagtgatgtggagagggccattgagaccctcatcaagaac tttcaccagtactccgtggagggtgggaaggagacgctgaccccttctgagctacgggac ctggtcacccagcagctgccccatctcatgccgagcaactgtggcctggaagagaaaatt gccaacctgggcagctgcaatgactctaaactggagttcaggagtttctgggagctgatt ggagaagcggccaagagtgtgaagctggagaggcctaaaaagctccagaaaagcacttgt cagatggttatctccagcactgtccattttaccgaaggccccgccagcaggcgaagcagg gagatgtcagactgctacacggagctggagaaggcagtcattgtcctggtggaaaacttc tacaaatatgtgtctaagtacagcctggtcaagaacaagatcagcaagagcagcttccgc gagatgctccagaaagagctgaaccacatgctgtcggacacagggaaccggaaggctgcg gataagctcatccagaacctggatgccaatcatgatgggcgcatcagcttcgatgagtac tggaccttgataggcggcatcaccggccccatcgccaaactcatccatgagcaggagcag cagagcagcagctag >gi568815597r:153443762_153644794|GENSCAN_predicted_peptide_11|276_aa MGSELETAMETLINVFHAHSGKEGDKYKLSKKELKELLQTELSGFLDAQKDVDAVDKVMK ELDENGDGEVDFQEYVVLVAALTVASGQTSTGRGGHFVLDTEQELAAAADERQERAGRVR PVPGPPAGAVVAAPGGALASVSFDSRDSKMAAQSAPKVVLKSTTKMSLNERFTNMLKNKQ PTPVNIRASMQQQQQLASARNRRLAQQMENRPSVQAALKLKQKSLKQRLGQNLLRGGRAV APRMGLRRGGVRGRGGPGRGGLGRGAMGRGGIGGRX >gi568815597r:153443762_153644794|GENSCAN_predicted_CDS_11|828_bp atgggctctgagctggagacggcgatggagaccctcatcaacgtgttccacgcccactcg ggcaaagagggggacaagtacaagctgagcaagaaggagctgaaagagctgctgcagacg gagctctctggcttcctggatgcccagaaggatgtggatgctgtggacaaggtgatgaag gagctagacgagaatggagacggggaggtggacttccaggagtatgtggtgcttgtggct gctctcacagtggcctccggtcagacaagcactggacgtggcggccattttgttttggac accgagcaggagctggcggccgctgcagacgaaaggcaggaaagggcaggccgggttcgg ccggtccccggcccgcctgccggagccgtggtggcagccccgggaggagcactggcgtct gtttccttcgattctcgggattcgaagatggctgcacagtcagcgccgaaagttgtgcta aaaagcaccaccaagatgtctctaaatgagcgctttactaatatgctgaagaacaaacag ccgacgccagtgaatattcgggcttcgatgcagcaacaacagcagctagccagtgccaga aacagaagactggcccagcagatggagaatagaccctctgtccaggcagcattaaaactt aagcagaagagcttaaagcagcgcctgggtcaaaacctgctccgaggtggacgagccgta gctccccgaatgggcttaagaagaggtggtgttcgaggtcgtggaggtcctgggagaggg ggcctagggcgtggagctatgggtcgtggcggaatcggtggtagagnn