GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:35:01 Sequence gi568815597r:153434699_153635339 : 200641 bp : 48.29% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1608 1603 6 1.05 1.02 Term - 1850 1728 123 2 0 39 37 125 0.801 0.88 1.01 Init - 3631 3491 141 1 0 84 56 103 0.627 6.83 1.00 Prom - 7591 7552 40 -2.46 2.03 PlyA - 10010 10005 6 1.05 2.02 Term - 16784 16590 195 2 0 70 47 109 0.647 2.41 2.01 Init - 20126 19257 870 2 0 57 13 188 0.307 2.75 2.00 Prom - 20263 20224 40 -5.46 3.03 PlyA - 23069 23064 6 1.05 3.02 Term - 23272 23108 165 1 0 118 46 195 0.998 16.22 3.01 Init - 24315 24175 141 0 0 86 56 184 0.866 15.13 3.00 Prom - 25990 25951 40 -7.96 4.04 PlyA - 29120 29115 6 1.05 4.03 Term - 30237 30016 222 0 0 57 48 123 0.727 2.12 4.02 Intr - 31372 31241 132 1 0 62 90 107 0.556 9.14 4.01 Init - 39781 39716 66 1 0 62 100 16 0.044 1.27 4.00 Prom - 48322 48283 40 -0.36 5.00 Prom + 53487 53526 40 -3.86 5.01 Init + 65185 65326 142 0 1 25 116 78 0.357 2.83 5.02 Intr + 68621 68735 115 0 1 69 -29 123 0.351 -1.79 5.03 Intr + 69369 69450 82 0 1 90 56 68 0.791 3.44 5.04 Intr + 71854 71944 91 0 1 130 91 35 0.827 7.57 5.05 Term + 72953 73071 119 0 2 118 42 26 0.636 -0.30 5.06 PlyA + 74385 74390 6 1.05 6.04 PlyA - 74414 74409 6 1.05 6.03 Term - 88010 87922 89 0 2 59 43 100 0.349 0.42 6.02 Intr - 91119 91015 105 1 0 105 55 75 0.540 6.09 6.01 Init - 91831 91768 64 1 1 46 47 54 0.288 -1.59 6.00 Prom - 94304 94265 40 -5.76 7.07 PlyA - 98041 98036 6 1.05 7.06 Term - 100132 99998 135 1 0 -13 48 246 0.983 8.52 7.05 Intr - 100662 100504 159 0 0 40 109 248 0.012 22.28 7.04 Intr - 102497 102458 40 1 1 129 53 -26 0.010 -3.77 7.03 Intr - 102738 102602 137 0 2 27 -1 338 0.010 18.17 7.02 Intr - 105507 105356 152 1 2 91 92 97 0.524 10.28 7.01 Init - 106726 106687 40 1 1 57 103 -2 0.355 -2.45 7.00 Prom - 108500 108461 40 -7.76 8.03 PlyA - 108944 108939 6 1.05 8.02 Term - 109225 109061 165 1 0 120 48 280 0.999 25.12 8.01 Init - 110096 109956 141 2 0 93 84 236 0.660 23.83 8.00 Prom - 111143 111104 40 -4.66 9.05 PlyA - 112650 112645 6 1.05 9.04 Term - 113148 112984 165 0 0 77 38 219 0.890 13.72 9.03 Intr - 113792 113647 146 0 2 74 105 254 0.970 25.70 9.02 Intr - 116726 116658 69 0 0 119 33 56 0.768 2.35 9.01 Init - 118611 118548 64 0 1 68 105 66 0.923 5.63 9.00 Prom - 126428 126389 40 -5.96 10.05 PlyA - 126669 126664 6 1.05 10.04 Term - 126893 126741 153 2 0 106 48 240 0.999 19.72 10.03 Intr - 129183 129036 148 2 1 49 84 195 0.501 15.44 10.02 Intr - 132112 132004 109 2 1 60 100 48 0.762 2.54 10.01 Init - 133205 133193 13 2 1 73 105 8 0.598 1.40 10.00 Prom - 150671 150632 40 -4.06 11.09 PlyA - 150679 150674 6 1.05 11.08 Term - 172994 172836 159 2 0 100 44 389 0.999 33.64 11.07 Intr - 173479 173301 179 2 2 95 99 286 0.953 30.04 11.06 Intr - 176018 175958 61 2 1 107 81 15 0.318 1.01 11.05 Intr - 180324 180202 123 0 0 101 -45 143 0.011 3.08 11.04 Intr - 180683 180537 147 2 0 110 72 219 0.992 22.93 11.03 Intr - 181609 181597 13 0 1 85 110 10 0.154 -2.32 11.02 Intr - 184340 184201 140 2 2 114 27 234 0.133 19.16 11.01 Init - 191774 191622 153 2 0 47 102 203 0.459 17.58 11.00 Prom - 192096 192057 40 -12.01 12.00 Prom + 192825 192864 40 -9.16 12.01 Init + 195824 195964 141 1 0 77 50 307 0.793 25.93 12.02 Term + 197000 197143 144 1 0 134 53 216 0.999 20.61 12.03 PlyA + 197316 197321 6 -1.95 13.03 PlyA - 197574 197569 6 -0.45 13.02 Term - 199055 199038 18 2 0 75 42 29 0.662 -4.68 13.01 Intr - 199412 199260 153 2 0 118 55 110 0.824 10.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 100641 100504 138 0 0 87 109 238 0.983 25.94 S.002 Term - 102738 102598 141 0 0 27 47 351 0.934 22.83 S.003 Term - 180324 180187 138 0 0 101 54 147 0.985 10.56 S.004 Term - 184340 184197 144 2 0 114 42 248 0.861 20.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:153434699_153635339|GENSCAN_predicted_peptide_1|87_aa MNIPLGEKVMLDIVAMFRQYSGDDGRMDMPGLVNLMKENFPNFLSGCKCGLLNASIQLGT GDAKARQSVLGQWGNPSKTHETGYKSS >gi568815597r:153434699_153635339|GENSCAN_predicted_CDS_1|264_bp atgaatatccctctaggtgagaaagtcatgttggacatagtcgcgatgtttcgccaatac agtggagatgatggtaggatggacatgccaggtctggtgaacttgatgaaggagaacttc cccaacttcctcagtggctgtaaatgtggtttgctgaatgccagcatccagctgggcact ggggatgcaaaggcaagacagtcagttttgggccagtgggggaatccttccaaaacacat gaaactggctacaagtccagctga >gi568815597r:153434699_153635339|GENSCAN_predicted_peptide_2|354_aa MCHSALEGGHFRARPKGTGISVILIRIQPLFLSQKNKVGGITLPDLKLYYKAAVTKTAWY WYQNTDIDQWNRTEPSEIMPHIYNYLIFDKPDKNKKWGKDSLFNKWCWENWLAICRKLKL DPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDFMSKTAKAKATKAKID KWDLIKLKSFCTTKETTIRLNRQPTEWEKIFAIYSSDKGLISRIYNELKQIYKKKTNNPI KKWAKDMNRHFSKDDIYAANRHMKKCSSSLAIREMQIKTTMRYHLTPVRMTNYPNKTYKN MNKNQKIICFEATESYQSDQDLKALVIQEKEDSINALNIKAASLPEKFISCQVV >gi568815597r:153434699_153635339|GENSCAN_predicted_CDS_2|1065_bp atgtgtcattctgcattggaaggaggacattttagagcaaggcctaagggcacaggtatt agtgtcatattgatcagaattcaacctttgttcctaagccaaaagaacaaagttggaggc atcacgctacctgacttaaaactatactacaaggctgcagtgaccaaaacagcatggtac tggtaccaaaacacagatatagaccaatggaacagaacagagccctcagaaataatgccg catatctacaactatctgatctttgacaaacctgacaaaaacaagaaatggggaaaggat tcactatttaataagtggtgctgggaaaactggctagccatatgtagaaagctgaagctg gatcccttccttacaccttatacaaaaattaattcaagatggattaaagacttaaatgtt agacctaaaaccataaaaaccctagaagaaaacctaggcaataccattcaggacataggc atgggcaaggacttcatgtctaaaacagcaaaagcaaaggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacaacaaaagaaactaccatcagactg aacaggcagcctacagaatgggagaaaatttttgcaatctactcatctgacaaagggcta atatcaagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggcaaaggatatgaacagacacttctcaaaagacgacatttatgcagccaac agacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaactaca atgagataccatctcacaccagttagaatgactaattatcccaataaaacatataaaaat atgaataaaaatcagaaaattatctgcttcgaagcaacagaaagctaccaaagtgaccag gacttgaaggctctggttatccaagagaaagaagactcgatcaatgcactgaacattaaa gctgcttctcttcctgagaaatttatcagttgtcaagtagtttga >gi568815597r:153434699_153635339|GENSCAN_predicted_peptide_3|101_aa MSNTQAERSIIGMIDMFHKYTRRDDKIEKPSLLTMMKENFPNFLSACDKKGTNYLADVFE KKDKNEDKKIDFSEFLSLLGDIATDYHKQSHGAAPCSGGSQ >gi568815597r:153434699_153635339|GENSCAN_predicted_CDS_3|306_bp atgagcaacactcaagctgagaggtccataataggcatgatcgacatgtttcacaaatac accagacgtgatgacaagattgagaagccaagcctgctgacgatgatgaaggagaacttc cccaacttccttagtgcctgtgacaaaaagggcacaaattacctcgccgatgtctttgag aaaaaggacaagaatgaggataagaagattgatttttctgagtttctgtccttgctggga gacatagccacagactaccacaagcagagccatggagcagcgccctgttccgggggcagc cagtga >gi568815597r:153434699_153635339|GENSCAN_predicted_peptide_4|139_aa MIDLFHNNIQEDDTLDKSGLSKELLVKISFLKSCKTAARSGTHNLNKGENRQGRPSHVEP PRNSGENSLSVSLSRSPNCCTSPEMQLANTTQKDPKQNSWLSRNIPVAPSALSLSPWLLE VLYGLLKGRPYKASPGSPT >gi568815597r:153434699_153635339|GENSCAN_predicted_CDS_4|420_bp atgattgatctatttcacaataacatccaagaagatgatacactagataagtcaggtctg tcaaaggagcttttggtgaagatctcttttttaaaaagctgcaagactgctgcccgaagt gggacacacaacctaaataagggcgagaaccggcaaggacggcccagccacgtggaaccg cctcgcaactctggcgagaactctctctctgtctctctgtcccgctcccccaactgctgc acgagtccagagatgcagctggccaacaccacccagaaggaccccaaacagaacagctgg ctgtctcggaacattcccgtggctccctctgccctgtccctctcaccgtggctcctggaa gtgctctatggcctattgaaaggacgcccctacaaagcttcccctggttcccccacctga >gi568815597r:153434699_153635339|GENSCAN_predicted_peptide_5|182_aa MAWALRNVAGPCQEGFPSTSASGGNQPSSNSQTGVSIRLVKPPGKRKGADPVGQQFSKCG PQNGSNNGTWDLTRNTESQATTDLLVTLQGVPTSKKTLTRCSPLTLDFSVSKTDYKGLSK VGMVRVPGIKGQGYRISRACAVTAPFSTQKPSWKQPSPPGPVILSPVVHNSFYKPSCNGM LK >gi568815597r:153434699_153635339|GENSCAN_predicted_CDS_5|549_bp atggcatgggcactgaggaatgtggctggcccctgccaggaagggttccccagcacctct gcttccgggggtaaccagcccagctccaactcccagacaggtgtatctatccgactggtg aagccaccagggaagagaaaaggagcagaccctgtaggacagcagttctcaaaatgtggt ccccagaacggcagcaacaacggcacctgggatcttactagaaatacagagtctcaggcc accacagacctgcttgtaactctgcagggagtccccaccagcaagaagaccctcaccaga tgcagccccttgaccctggacttctcagtctccaaaactgactacaaggggctgagcaag gtaggcatggtcagggtgccaggcatcaagggacagggctatcggatcagccgggcctgt gctgtgacagccccattttccacccagaagccctcctggaagcaaccctcacccccgggc cccgttattctaagtccagttgttcacaattcattctacaagccctcatgtaatggaatg ctaaaatag >gi568815597r:153434699_153635339|GENSCAN_predicted_peptide_6|85_aa MALFCVRMVTSAQQYRQCQGCRLHTSDSSILRPLVHNDLGRLKEVWNCCLLENNGRGVKL QTFAVSVTAHKDSVDPKSEQQQDLL >gi568815597r:153434699_153635339|GENSCAN_predicted_CDS_6|258_bp atggctcttttctgtgtacggatggttacatcggcacagcagtaccgacaatgccaaggt tgcagactccacacctctgacagctccatccttcgaccccttgtgcacaatgacctgggc aggttgaaggaggtttggaactgctgcttgctggaaaacaatggcagaggagtgaagctg cagaccttcgcagtgagtgttacagctcataaagacagtgtggacccaaagagtgagcag cagcaagatttattgtaa >gi568815597r:153434699_153635339|GENSCAN_predicted_peptide_7|220_aa MPAAWILWAHSHSELHTVMETPLEKALTTMVTTFHKYSGREGSKLTLSRKELKELIKKEL CLGEMKESSIDDLMKSLDKNSDQEIDFKEYSVFLTMLCMAYNDFFLEDNKWTLPWEEIKF PSQLQAQPSAMACPLDQAIGLLVAIFHKYSGREGDKHTLSKKELKELIQKELTIGSKLQD AEIARLMEDLDRNKDQEVNFQEYVTFLGALALIYNEALKG >gi568815597r:153434699_153635339|GENSCAN_predicted_CDS_7|663_bp atgcctgctgcttggattctctgggctcactcccacagtgagctgcacactgtgatggag actcctctggagaaggccctgaccactatggtgaccacgtttcacaaatattcggggaga gagggtagcaaactgaccctgagtaggaaggaactcaaggagctgatcaagaaagagctg tgtcttggggagatgaaggagagcagcatcgatgacttgatgaagagcctggacaagaac agcgaccaggagatcgacttcaaggagtactcggtgttcctgaccatgctgtgcatggcc tacaacgacttctttctagaggacaacaaatggacccttccatgggaggaaataaagttt ccatcgcagctccaagcccagccctcagccatggcatgccccctggatcaggccattggc ctcctcgtggccatcttccacaagtactccggcagggagggtgacaagcacaccctgagc aagaaggagctgaaggagctgatccagaaggagctcaccattggctcgaagctgcaggat gctgaaattgcaaggctgatggaagacttggaccggaacaaggaccaggaggtgaacttc caggagtatgtcaccttcctgggggccttggctttgatctacaatgaagccctcaagggc tga >gi568815597r:153434699_153635339|GENSCAN_predicted_peptide_8|101_aa MACPLEKALDVMVSTFHKYSGKEGDKFKLNKSELKELLTRELPSFLGKRTDEAAFQKLMS NLDSNRDNEVDFQEYCVFLSCIAMMCNEFFEGFPDKQPRKK >gi568815597r:153434699_153635339|GENSCAN_predicted_CDS_8|306_bp atggcgtgccctctggagaaggccctggatgtgatggtgtccaccttccacaagtactcg ggcaaagagggtgacaagttcaagctcaacaagtcagaactaaaggagctgctgacccgg gagctgcccagcttcttggggaaaaggacagatgaagctgctttccagaagctgatgagc aacttggacagcaacagggacaacgaggtggacttccaagagtactgtgtcttcctgtcc tgcatcgccatgatgtgtaacgaattctttgaaggcttcccagataagcagcccaggaag aaatga >gi568815597r:153434699_153635339|GENSCAN_predicted_peptide_9|147_aa MVHFSVVSSTSAVGAGEAALEGLSLQGEDGTGKAWVLGAEGEVEVRMARPLEQAVAAIVC TFQEYAGRCGDKYKLCQAELKELLQKELATWTPTEFRECDYNKFMSVLDTNKDCEVDFVE YVRSLACLCLYCHEYFKDCPSEPPCSQ >gi568815597r:153434699_153635339|GENSCAN_predicted_CDS_9|444_bp atggtgcatttctcagtggtcagctccacatctgcagtgggggcgggggaggcggctctg gaagggctgagtctccaaggagaggacggcacaggcaaggcttgggtgctgggggccgag ggcgaggtcgaagtgaggatggccaggcctctggagcaggcggtagctgccatcgtgtgc accttccaggaatacgcagggcgctgtggggacaaatacaagctctgccaggcggagctc aaggagctgctgcagaaggagctggccacctggaccccgactgagtttcgggaatgtgac tacaacaaattcatgagtgttctggacaccaacaaggactgcgaggtggactttgtggag tatgtgcgctcacttgcctgcctctgtctctactgccacgagtacttcaaggactgcccc tcagagcccccctgctcccagtag >gi568815597r:153434699_153635339|GENSCAN_predicted_peptide_10|140_aa MSKTGWAGASTYCSGTAAALALGEEGVPLSHCLGIWAWNLGSMMCSSLEQALAVLVTTFH KYSCQEGDKFKLSKGEMKELLHKELPSFVGEKVDEEGLKKLMGSLDENSDQQVDFQEYAV FLALITVMCNDFFQGCPDRP >gi568815597r:153434699_153635339|GENSCAN_predicted_CDS_10|423_bp atgagtaagaccggctgggctggggcttccacctactgctctgggactgctgctgccctg gccctgggggaggagggtgtgccgctgagtcactgcctgggcatctgggcctggaacctc ggatccatgatgtgcagttctctggagcaggcgctggctgtgctggtcactaccttccac aagtactcctgccaagagggcgacaagttcaagctgagtaagggggaaatgaaggaactt ctgcacaaggagctgcccagctttgtgggggagaaagtggatgaggaggggctgaagaag ctgatgggcagcctggatgagaacagtgaccagcaggtggacttccaggagtatgctgtt ttcctggcactcatcactgtcatgtgcaatgacttcttccagggctgcccagaccgaccc tga >gi568815597r:153434699_153635339|GENSCAN_predicted_peptide_11|324_aa MAAEPLTELEESIETVVTTFFTFARQEGRKDSLSVNEFKELVTQQLPHLLKDVGSLDEKM KSLDVNQDSELKFNEYWRLIGELAKEIRKKKDLKIRKNGCQQDAQEFSDVERAIETLIKN FHQYSVEGGKETLTPSELRDLVTQQLPHLMPSNCGLEEKIANLGSCNDSKLEFRSFWELI GEAAKSVKLERPKKLQKSTCQMVISSTVHFTEGPASRRSREMSDCYTELEKAVIVLVENF YKYVSKYSLVKNKISKSSFREMLQKELNHMLSDTGNRKAADKLIQNLDANHDGRISFDEY WTLIGGITGPIAKLIHEQEQQSSS >gi568815597r:153434699_153635339|GENSCAN_predicted_CDS_11|975_bp atggcagcagaaccactgacagagctagaggagtccattgagaccgtggtcaccaccttc ttcacctttgcaaggcaggagggccggaaggatagcctcagcgtcaacgagttcaaagag ctggttacccagcagttgccccatctgctcaaggatgtgggctctcttgatgagaagatg aagagcttggatgtgaatcaggactcggagctcaagttcaatgagtactggagattgatt ggggagctggccaaggaaatcaggaagaagaaagacctgaagatcaggaagaacggctgc caacaggatgctcaggaattcagtgatgtggagagggccattgagaccctcatcaagaac tttcaccagtactccgtggagggtgggaaggagacgctgaccccttctgagctacgggac ctggtcacccagcagctgccccatctcatgccgagcaactgtggcctggaagagaaaatt gccaacctgggcagctgcaatgactctaaactggagttcaggagtttctgggagctgatt ggagaagcggccaagagtgtgaagctggagaggcctaaaaagctccagaaaagcacttgt cagatggttatctccagcactgtccattttaccgaaggccccgccagcaggcgaagcagg gagatgtcagactgctacacggagctggagaaggcagtcattgtcctggtggaaaacttc tacaaatatgtgtctaagtacagcctggtcaagaacaagatcagcaagagcagcttccgc gagatgctccagaaagagctgaaccacatgctgtcggacacagggaaccggaaggctgcg gataagctcatccagaacctggatgccaatcatgatgggcgcatcagcttcgatgagtac tggaccttgataggcggcatcaccggccccatcgccaaactcatccatgagcaggagcag cagagcagcagctag >gi568815597r:153434699_153635339|GENSCAN_predicted_peptide_12|94_aa MGSELETAMETLINVFHAHSGKEGDKYKLSKKELKELLQTELSGFLDAQKDVDAVDKVMK ELDENGDGEVDFQEYVVLVAALTVACNNFFWENS >gi568815597r:153434699_153635339|GENSCAN_predicted_CDS_12|285_bp atgggctctgagctggagacggcgatggagaccctcatcaacgtgttccacgcccactcg ggcaaagagggggacaagtacaagctgagcaagaaggagctgaaagagctgctgcagacg gagctctctggcttcctggatgcccagaaggatgtggatgctgtggacaaggtgatgaag gagctagacgagaatggagacggggaggtggacttccaggagtatgtggtgcttgtggct gctctcacagtggcctgtaacaatttcttctgggagaacagttga >gi568815597r:153434699_153635339|GENSCAN_predicted_peptide_13|56_aa RPPAPARCPKQNGRHVQCLSDRLKWRLRNYVRRQIRARLEGGKDLSYACAEPRYTV >gi568815597r:153434699_153635339|GENSCAN_predicted_CDS_13|171_bp cggccgccagctcctgctcggtgtccaaaacaaaatggccgccacgtccagtgcttgtct gaccggctaaaatggcgtctacgcaattacgtcaggcgtcagatccgcgcacgactagag ggcgggaaagatttgagctacgcctgcgcagagccaaggtacaccgtctaa