GENSCAN 1.0 Date run: 2-Nov-116 Time: 18:33:52 Sequence gi568815592f:17181582_17392116 : 210535 bp : 42.06% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 316 311 6 1.05 1.03 Term - 2233 2162 72 1 0 92 48 43 0.003 -2.37 1.02 Intr - 19401 18853 549 0 0 34 55 357 0.571 19.14 1.01 Init - 20174 20079 96 2 0 99 54 28 0.745 0.87 1.00 Prom - 21112 21073 40 -7.85 2.04 PlyA - 21506 21501 6 1.05 2.03 Term - 24233 24151 83 0 2 53 48 121 0.718 1.48 2.02 Intr - 24802 24686 117 2 0 88 76 66 0.676 4.92 2.01 Init - 45711 45627 85 0 1 57 96 74 0.174 6.13 2.00 Prom - 47375 47336 40 -2.05 3.00 Prom + 50069 50108 40 -5.65 3.01 Init + 54255 54335 81 2 0 73 72 71 0.810 5.02 3.02 Intr + 55960 56058 99 0 0 98 61 63 0.350 3.99 3.03 Term + 68018 68041 24 1 0 106 48 34 0.227 -1.55 3.04 PlyA + 68066 68071 6 1.05 4.00 Prom + 79247 79286 40 -5.65 4.01 Init + 84092 84231 140 1 2 62 97 141 0.621 12.06 4.02 Intr + 99910 100168 259 1 1 3 78 348 0.063 21.84 4.03 Intr + 101224 101347 124 0 1 42 94 161 0.941 11.34 4.04 Intr + 103076 103130 55 0 1 25 114 23 0.454 -4.28 4.05 Term + 110175 110538 364 0 1 108 41 502 0.589 40.55 4.06 PlyA + 111426 111431 6 1.05 5.00 Prom + 119816 119855 40 -6.75 5.01 Init + 125229 125238 10 2 1 91 106 5 0.901 3.22 5.02 Intr + 127107 127193 87 1 0 88 42 85 0.392 2.92 5.03 Term + 130058 130203 146 0 2 85 54 111 0.450 4.49 5.04 PlyA + 131041 131046 6 1.05 6.02 PlyA - 134159 134154 6 1.05 6.01 Sngl - 144546 143782 765 0 0 83 47 349 0.889 24.14 6.00 Prom - 148249 148210 40 -4.75 7.05 PlyA - 148367 148362 6 1.05 7.04 Term - 153367 153196 172 0 1 19 48 136 0.541 -0.98 7.03 Intr - 161112 160902 211 1 1 76 95 81 0.047 4.65 7.02 Intr - 167807 167708 100 2 1 62 103 61 0.179 3.76 7.01 Init - 171654 171649 6 0 0 75 93 0 0.223 0.23 7.00 Prom - 183626 183587 40 -3.05 8.00 Prom + 189184 189223 40 -4.95 8.01 Sngl + 202398 202703 306 2 0 71 42 186 0.095 5.95 8.02 PlyA + 203437 203442 6 1.05 9.03 PlyA - 204128 204123 6 1.05 9.02 Term - 204330 204220 111 0 0 93 49 130 0.585 7.18 9.01 Init - 210470 210309 162 2 0 62 80 124 0.505 8.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100168 168 1 0 85 78 249 0.868 23.18 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:17181582_17392116|GENSCAN_predicted_peptide_1|238_aa MGELYLELVDSWFMLTSRMKLHTLVVSQFLNMRDRKVLQLRTRLRSPASFTSDWSSLQGV WGTQPGQSGSPEEAPPGQPRGKEGKRERRRAAIVANDRAKKEGRFMHGTQPPIKPSRRQP AAPSAGLAKPAEPAPTQNPCRPASGVRSPSSSPRLSLHTFPPVEGAGFGLGQPQRGALTV QLRAEGLLERGQSRHRGRGGARGLLACCHLSGSPETSQIYTLYILEDVKNSSHDRELF >gi568815592f:17181582_17392116|GENSCAN_predicted_CDS_1|717_bp atgggtgaactgtatctggaattggtggattcttggtttatgttgacctcaagaatgaag ctgcacaccctggtggtgtcacagttcttaaacatgcgagacagaaaagttctccaactg cgaacccgtctcagaagcccagcgagcttcacctctgactggagctcgctgcagggagtt tggggcacccagcctgggcagtccggcagcccagaggaagctcctcccggacaaccaaga ggaaaagagggaaagcgagaaagacggagagccgccatcgtggccaacgaccgcgcgaag aaggaagggcgattcatgcacgggacccagcctccaatcaagcccagcaggcgccagccg gcggcgccgagtgcagggctcgcgaagcccgccgagcctgcgcccacccagaacccgtgc cggcccgcgagcggtgtgcgcagccccagctccagcccgcgtctctctcttcacactttc ccgccagtagagggagccggctttggcctcggccagccccagagaggggccctcacagtg cagctgcgggctgaagggctccttgagcgtggccagagcagacaccgaggccgaggaggc gcccgagggctgctagcatgttgtcacctctcaggatcacccgagacctctcagatctac actctgtacattctggaagatgtgaagaatagttcacatgacagagaactattctaa >gi568815592f:17181582_17392116|GENSCAN_predicted_peptide_2|94_aa MLPLDTDGGSHRFSVKNAIGVKMQLEATGAGSSRRSQLRSSVGWGLGELLCLARGLSMHQ SALCLAKEVCSFIPEVSKTTNPPEETNNSKPATF >gi568815592f:17181582_17392116|GENSCAN_predicted_CDS_2|285_bp atgctcccacttgacactgatggaggaagtcaccgattttcagttaaaaatgcgatagga gtgaaaatgcagttagaagctacaggcgcaggatccagtaggcgaagccagctgcgctcc tcagtcgggtggggacttggagaacttttatgtctagctagaggattgtcaatgcaccaa tcagcactctgtctagctaaagaggtctgcagcttcattcctgaagtcagcaagaccacg aacccaccggaagagacgaacaactccaagcctgccaccttttaa >gi568815592f:17181582_17392116|GENSCAN_predicted_peptide_3|67_aa MSFLRGKRRDGCRDSGERKGEGIQGGQVGFDTRNQAADTSSLLVRFKMPPRKEKEGAHFL LGLQYDV >gi568815592f:17181582_17392116|GENSCAN_predicted_CDS_3|204_bp atgagctttctgagaggaaagaggagagatggctgcagagactctggagagagaaaaggt gaaggaattcagggtggccaggtgggttttgacacaagaaatcaggctgccgacacttct tcactgttagttagatttaaaatgcctccaagaaaagagaaagagggagcacacttcctt ctagggcttcagtatgatgtttga >gi568815592f:17181582_17392116|GENSCAN_predicted_peptide_4|313_aa MRPPKIVSDKIMAKENIPFIQYIDDIMLIGPTEEEVETLVLHMDFRRVRETRSRRRRRSR CPSRSRSRSPSRGAGAKMHTTQKDTTYTKIFVGGLPYHTTDASLRKYFEVFGEIEEAVVI TDRQTGKSRGYGFVTMADRAAAERACKDPNPIIDGRKANVNLAYLGAKPRIMQPGFAFGV QQLHPALIQRPFGIPAHYVYPQAFVQPGVVIPHVQPTAAAASTTPYIDYTGAAYAQYSAA AAAAAAAAAYDQYPYAASPAAAGYVTAGGYGYAVQQPITAAAPGTAAAAAAAAAAAAAFG QYQPQQLQTDRMQ >gi568815592f:17181582_17392116|GENSCAN_predicted_CDS_4|942_bp atgaggccccccaaaattgtttctgacaaaataatggccaaagagaacatcccatttatc cagtatattgatgacatcatgctaattggaccaactgaggaagaagtggagaccttggta ttacacatggacttcagaagggtgcgggagacgcggagccggaggaggaggcgcagccgc tgcccgagccgcagccgcagccggagcccgagccgcggggcgggtgcgaagatgcacacg acccagaaggacacgacgtacaccaagatcttcgtcggggggctgccctaccacaccacc gacgccagcctgcgcaagtacttcgaggtcttcggcgagatcgaggaggcggtggtcatc accgaccggcagacgggcaagtcccggggctatggatttgtcaccatggctgaccgggct gctgccgaaagggcctgcaaggatcccaatcccatcattgatggcagaaaggccaacgtg aacctggcatacttaggagcaaaaccaaggatcatgcaaccaggttttgcctttggtgtt caacaacttcatccagcccttatacaaagacctttcgggatacctgcccactatgtctat ccgcaggcttttgtgcagccgggagtggtcattccacacgtccagccgacagcagctgcc gcctccaccaccccttacattgattacactggagctgcatacgcacaatactcagcagct gctgctgctgccgccgccgctgctgcctatgaccagtacccctatgcagcctctccagct gctgcaggatatgttactgctgggggctatggctacgcagtccagcagccaatcaccgca gcggcacctgggacagctgccgccgccgctgcagcagctgctgccgctgcagcatttggc cagtaccagcctcagcagctgcagacagaccgaatgcaatag >gi568815592f:17181582_17392116|GENSCAN_predicted_peptide_5|80_aa MLQNIEEMLVNTDMDASIKTVERLQEHASASETHLIPLLFTQPEKRHSAFEQQLERQPPS HCLGRARTWAAIRRHLCPLK >gi568815592f:17181582_17392116|GENSCAN_predicted_CDS_5|243_bp atgctacaaaacatagaagagatgttagttaacactgatatggatgcaagcataaaaact gtagaaaggctacaggaacatgcaagtgcatcagagactcacctgatacctctgctattc actcagcctgagaaaaggcactctgcctttgagcagcagcttgaacgacagccaccctca cattgccttgggcgagcacgcacgtgggctgctatacggagacatctgtgcccgctgaag tga >gi568815592f:17181582_17392116|GENSCAN_predicted_peptide_6|254_aa MTFHPMLLPSVFPPGSNHPASDSDSSPSPRHTRSQTQHAQQPAPILPLQEVATAEGIICV HVPFSLSDLSQIEKCLGFFSSDPNTYIKEFKYLTQSYELTWHDLYIILSSTLLSEEKERV WLAAQAHADDLHRQDPTKSTGAAAAPQEEPPWKYQPTDPDRASCNHMIICLIIGLNKAAH KAVNFEKLKEISQRANKNPAEFLSRLTEALQIYTHIDPTSQEGTIVLNTHFISQSAPDIQ CKLKTAEDGPQTPQ >gi568815592f:17181582_17392116|GENSCAN_predicted_CDS_6|765_bp atgaccttccaccccatgctcctgccctctgtctttccccccggctccaaccacccagct tctgactctgattcatccccatctccacgtcatacccgctctcaaactcagcatgcccaa caaccagcccccatacttcccctccaagaggtggccacagccgaaggcatcatttgcgtc cacgttcctttctccctctctgacctctcccaaattgaaaaatgtctcgggttcttttcc tctgatcccaacacttacatcaaagaatttaaatatctcacccaatcttatgaacttact tggcatgatctctacattatcctctcttctactctcctttcagaagaaaaggaaagagtg tggcttgcagcacaggcgcatgcagatgatcttcatcggcaagaccctactaagtccaca ggggctgctgcagctccccaggaggaacccccctggaagtaccaacccacagaccccgac cgggcatcttgtaaccatatgattatttgcctcatcataggccttaacaaagcagcccat aaagctgtaaattttgaaaagctcaaagaaatctcccaaagagccaacaaaaaccctgcc gaatttctttcccgccttacagaggccctccaaatatatacccacattgaccccacctcc caggaaggaactattgttcttaatacccatttcatctctcagtctgctcctgacatacaa tgcaaacttaaaacagctgaagatggcccccaaaccccacaataa >gi568815592f:17181582_17392116|GENSCAN_predicted_peptide_7|162_aa MQCVSRTKSQLCTFILEKVIYGGAPETDKEDHRTSVTTPTLPLTIECQHLGPTILESNYY HCIYVFQLSDYGSHCKVWSTEKSSHRALQGFRGSPYKSISQGIGESAYKATGINKCKKNA GSTQGPKDPLKGFSEEKELTSTNPSEYKEKSGHKWEPAETMN >gi568815592f:17181582_17392116|GENSCAN_predicted_CDS_7|489_bp atgcagtgtgtgagcaggacaaagtctcagctgtgcacatttatccttgagaaagtgata tatggtggagcaccagagacagacaaagaagaccacagaactagtgtaactacccctaca ttgcctttgaccattgagtgccagcacttgggacctaccatcttggaatctaattattac cattgcatttatgtttttcaattaagtgactacggttcccactgtaaggtatggtctaca gagaagagcagtcacagagctcttcagggattcaggggctctccttacaaatctatttct caaggcattggtgaaagtgcctataaggcaactgggataaacaagtgcaagaagaatgct ggttcaacccaaggcccaaaggatcctcttaaaggattctcagaggagaaggagctcaca agcacaaatccctcagaatataaggaaaaaagtggccataaatgggagccagcggaaaca atgaattga >gi568815592f:17181582_17392116|GENSCAN_predicted_peptide_8|101_aa MPAAQPAMLAILAPWAVLVLSVLTGRIVHSAFPAACSWACLHVHIAVGPPWESSPLLSQS LRLQFAKCPALMNPWKVEAWGKSQHQGGPQLPSSHHRSTHY >gi568815592f:17181582_17392116|GENSCAN_predicted_CDS_8|306_bp atgcccgcagcccagccagccatgctcgccatcctggcaccgtgggctgtcttggttctg tctgtcctcactggccgcatagtgcattctgcctttcctgctgcctgctcctgggcctgc ctgcatgtgcacattgctgtggggcccccatgggaatcctcacctcttttgtctcagtca ctgcgcctccagtttgccaagtgtccagctcttatgaatccctggaaagtagaagcctgg ggaaagagccaacaccaaggaggcccccagctcccttcttcacaccaccgttccactcac tattaa >gi568815592f:17181582_17392116|GENSCAN_predicted_peptide_9|90_aa MKVESSCGNETLRVDCGGKTLKMMGRDKSGEVIKDGMYEGQSVSLAGLEGCGSQDYLYHH VTKGENSSPAVSGFLSCRHENLITPGKFLN >gi568815592f:17181582_17392116|GENSCAN_predicted_CDS_9|273_bp atgaaagttgagtcttcatgcggaaatgagacattaagagtggattgtggaggaaaaacc ctcaagatgatgggcagggacaaaagtggagaggtaataaaggatggcatgtatgaagga cagtctgtgagtttagctggattggagggctgtggaagccaggattatctctatcatcat gtgactaaaggagaaaatagctccccagctgtcagtggctttctcagctgccgacatgag aacttgatcacaccggggaaattcctgaactaa