GENSCAN 1.0 Date run: 5-Nov-116 Time: 21:12:07 Sequence gi568815595r:72278388_72546623 : 268236 bp : 44.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 PlyA - 669 664 6 1.05 1.12 Term - 1164 1051 114 0 0 87 37 53 0.005 -1.33 1.11 Intr - 9306 9169 138 0 0 61 50 72 0.014 1.36 1.10 Intr - 10976 10846 131 0 2 56 59 86 0.019 2.91 1.09 Intr - 17906 17880 27 0 0 93 72 45 0.047 1.49 1.08 Intr - 26368 26288 81 2 0 129 77 7 0.117 3.31 1.07 Intr - 32922 32644 279 1 0 39 59 129 0.009 2.55 1.06 Intr - 41058 40909 150 1 0 47 80 75 0.007 2.73 1.05 Intr - 41174 41120 55 2 1 75 94 32 0.005 1.05 1.04 Intr - 44461 44313 149 2 2 106 19 75 0.007 2.35 1.03 Intr - 50581 50444 138 2 0 77 21 86 0.026 1.24 1.02 Intr - 53052 52868 185 2 2 81 79 25 0.043 0.33 1.01 Init - 59647 59514 134 1 2 95 91 83 0.466 8.92 1.00 Prom - 61821 61782 40 -3.16 2.00 Prom + 75049 75088 40 -5.26 2.01 Init + 84439 84492 54 0 0 50 73 69 0.400 2.89 2.02 Intr + 89042 89125 84 1 0 77 75 34 0.101 1.02 2.03 Term + 92648 92791 144 1 0 67 38 128 0.526 3.61 2.04 PlyA + 93620 93625 6 1.05 3.05 PlyA - 95820 95815 6 1.05 3.04 Term - 100250 99998 253 1 1 103 44 314 0.998 23.61 3.03 Intr - 100751 100653 99 1 0 70 75 62 0.833 2.33 3.02 Intr - 101040 100864 177 2 0 34 108 198 0.947 15.43 3.01 Init - 112746 112721 26 0 2 77 81 42 0.215 1.40 3.00 Prom - 117865 117826 40 -2.86 4.08 PlyA - 119185 119180 6 1.05 4.07 Term - 165698 165540 159 2 0 24 49 179 0.842 5.54 4.06 Intr - 168593 168386 208 1 1 87 59 72 0.487 3.28 4.05 Intr - 169306 169134 173 1 2 83 77 56 0.674 2.74 4.04 Intr - 184453 184418 36 1 0 77 93 39 0.075 1.76 4.03 Intr - 188752 188584 169 0 1 71 99 49 0.044 4.25 4.02 Intr - 193868 193810 59 2 2 81 80 25 0.050 -1.32 4.01 Init - 203534 203475 60 2 0 74 95 59 0.442 6.46 4.00 Prom - 204974 204935 40 -5.16 5.00 Prom + 206429 206468 40 -5.26 5.01 Init + 209291 209411 121 1 1 99 75 117 0.827 11.85 5.02 Intr + 217060 217226 167 2 2 78 105 4 0.095 0.78 5.03 Intr + 225519 225663 145 2 1 41 87 83 0.356 3.36 5.04 Intr + 237064 237174 111 2 0 70 68 49 0.204 1.45 5.05 Intr + 247132 247350 219 2 0 36 31 149 0.022 2.27 5.06 Term + 254288 254292 5 0 2 132 43 0 0.028 -3.03 5.07 PlyA + 256107 256112 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 172174 172100 75 1 0 61 97 34 0.882 2.69 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:72278388_72546623|GENSCAN_predicted_peptide_1|526_aa MGLTAMERGQKTLGTVFQSTTWMPHEGFLRAGTPSALSNTPAVPRPGSESLEEEAGVEAK VAFLCGIKGQAACLAWGKLTFPLDLKLGFGTNLGCATSGCDLGKSPGLGLGSRICISSKS LCDADAVLQEQPLRTAILDAVLQEQPLKTAILGARPGVSSLVRPPESDCESWKQHRPAVL SSCTATFSICAAQYSNHVAAEHVGKLRYREVKTFAQDDTATTTSLKEKQRRWLLSHIQGP VLRKFAEVLPLKEKIRGHDCTSLFVPVRAPASTHGSCKGGPFCGQLVGPPSSEGNLQPCP ECLSGTFDFSLMDVPVSASQAICKLWRWGGGLDTCWGSSDRKDKRTGSLSTSTSLNLPES TCGAQDPTYQRRTPGSHSTTSASLLTNPIFAIMAHGSVTEGFHNSAFVGSLECLPCQGGR TFLRRLPRWALRKSAGGSPSLQEMWNPQMQRADRALLWQPEQMKAVLMIIEAASACTSSP GHPGVTCSLKLSVSKGEIEVMKVLSVTRGEEKLRAGESGMLESWGI >gi568815595r:72278388_72546623|GENSCAN_predicted_CDS_1|1581_bp atggggctgacggccatggagagagggcagaagacccttgggacagttttccagtccacc acctggatgccccatgagggttttctcagagcagggacccccagtgccctttccaacacc cctgctgtccccaggccgggctctgagtctttggaggaagaggctggggtggaagccaaa gtcgcctttctctgtgggataaagggacaagctgcctgcctagcatgggggaagctgaca ttcccactagacttgaagttaggctttggtactaatcttggctgtgccacttctggctgt gaccttggcaagtctccaggtctggggctgggctcaagaatctgcatttccagcaagtcc ttgtgtgatgctgatgctgtgctccaggagcaacctttgagaactgccatcctagatgct gtgctccaggagcaacctttgaaaactgccatcctaggtgcaaggcctggggtgagctcc ttggtccgtccacctgagagcgactgtgagagctggaagcagcaccgcccagcagtgctt tcctcatgcacagccaccttctccatctgtgccgcccagtacagtaaccacgtggctgct gagcacgtagggaaattgaggtacagagaagttaagacatttgcccaagatgacacagct accaccacctccctcaaggagaagcagaggagatggctgctgagtcatatccagggccca gtactcaggaaatttgctgaagtcttaccgttgaaagagaagataagaggacatgactgc acctcactctttgtgcctgttagagccccagcatcaacacatgggagttgcaaaggtggt cctttctgtggacagctggtgggacctcctagctcagaggggaacctccagccatgtccc gagtgcttgtctggcaccttcgacttctccctcatggatgttcctgtttcagcatcgcag gccatctgcaaactgtggagatggggtgggggcctggacacctgctggggtagcagtgat agaaaggacaagaggacagggtccctcagcacttctacctccttgaacctgccagagtcc acctgcggagctcaggatccaacatatcagaggaggactccaggaagtcattctaccacc tctgcctccctgctgacgaatccgatatttgcgataatggcccatggcagcgtcacagag ggttttcataacagcgcctttgtgggctcccttgaatgtctgccatgccaaggaggccgc acattcctccggcggctgcctcgctgggctttgaggaaatctgctggcggcagccccagt ttgcaggagatgtggaacccacagatgcagagggctgaccgtgctttgctatggcagcct gagcaaatgaaagcagtgctaatgatcatagaggccgccagtgcctgcacttcttctcca ggccaccctggagtcacctgcagcttgaagctctctgtgagtaagggggagatagaagtg atgaaagtcctttctgttactagaggagaggagaagctccgtgcaggggaatccgggatg ctggaaagctggggaatttaa >gi568815595r:72278388_72546623|GENSCAN_predicted_peptide_2|93_aa MGGDWIIGVDIPLAVLVIGLLLLEMQTPSRMAGPGMGAPVPQEDSEKRGEPSWGVGIPAS AQQPTKVVPILPASTGHTRHHDTINGINSAGGR >gi568815595r:72278388_72546623|GENSCAN_predicted_CDS_2|282_bp atgggaggtgactggatcatcggggtggacatcccccttgctgttcttgtgataggcctg ctgctgcttgagatgcaaactcccagcaggatggctggccctggaatgggagcacctgtt ccccaagaggacagtgagaagaggggcgagcccagctggggtgttggcattcctgcctct gctcagcagcccaccaaagtggtccccatcttgccggccagcacaggccacacgcgacac cacgacacgattaatggcattaacagcgccggaggccgctga >gi568815595r:72278388_72546623|GENSCAN_predicted_peptide_3|184_aa MEEDNENERKPRINSQLVAQQVAQQYATPPPPKKEKKEKVEKQDKEKPEKDKEISPSVTK KNTNKKTKPKSDILKDPPSEANSIQSANATTKTSETNHTSRPRLKNVDRSTAQQLAVTVG NVTVIITDFKEKTRSSSTSSSTVTSSAGSEQQNQSSSGSESTDKGSSRSSTPKGDMSAVN DESF >gi568815595r:72278388_72546623|GENSCAN_predicted_CDS_3|555_bp atggaggaagacaatgaaaatgaaagaaaacctcggatcaattctcagctggtggcacaa caagtggcacaacagtatgccaccccaccaccccctaaaaaggagaagaaggagaaagtt gaaaagcaggacaaagagaaacctgagaaagacaaggaaattagtcctagtgttaccaag aaaaataccaacaagaaaaccaaaccaaagtctgacattctgaaagatcctcctagtgaa gcaaacagcatacagtctgcaaatgctacaacaaagaccagcgaaacaaatcacacctca aggccccggctgaaaaacgtggacaggagcactgcacagcagttggcagtaactgtgggc aacgtcaccgtcattatcacagactttaaggaaaagactcgctcctcatcgacatcctca tccacagtgacctccagtgcagggtcagaacagcagaaccagagcagctcggggtcagag agcacagacaagggctcctcccgttcctccacgccaaagggcgacatgtcagcagtcaat gatgaatctttctga >gi568815595r:72278388_72546623|GENSCAN_predicted_peptide_4|287_aa MAKDQMTVVTEGVTPTDQEKAIDQYQSMTWGLGTPVLEYRICSLSPKSHLCGCSNQGEPL VAMAYIYPVSMLNDLVLQVPGVQCKTSELVNKGAKLMFGQGDELGSLTGPDREWGFGPAS RPDAGTSEEANTKLRMAEEQRGAGERGPKVDLVGPWALQASRRCERSGPRLPSRPPQPHA QVHKSARFLRPPRPFPGRRRHRLVCEAIRPGPGACRVPVGAPGSLPRSLFQSPERFLIIL ITVTFIYQSTKETTYDLVVGQYDRELYLPFEGRIDIADVIGKILPFV >gi568815595r:72278388_72546623|GENSCAN_predicted_CDS_4|864_bp atggctaaggaccagatgacagtggtgaccgagggcgtaacccccactgaccaggagaag gccatagaccagtaccagtctatgacctgggggttgggaacccctgtcttggagtacaga atttgttcactgtcccccaaatctcatctgtgtggctgctccaaccaaggagaacctctg gttgccatggcatacatttatccggtgagcatgcttaatgatctcgttctccaggtccct ggagtacaatgtaagactagtgagcttgttaacaagggtgcaaagctgatgtttggtcaa ggtgatgaacttgggtcactgacggggcctgacagggagtggggcttcgggccagcgtct cggccggacgcagggacttcagaagaggcaaatacaaagctccgcatggcagaggaacag aggggcgcgggggaacgcggtccaaaggtggaccttgtcggcccctgggcactgcaagcc tcgaggcgctgcgagcgatccggcccgcggctcccctcccggcccccccagccccacgct caagtccacaagtccgcccgcttcctgcgcccgccgcggcccttcccggggcgccgccgc caccgcctggtgtgcgaggctatccggcccgggcccggggcttgcagggtgccagtgggg gcgcccggttcccttcctcgctctctcttccagtccccggaaaggtttttgatcattctc atcactgtcaccttcatctaccagtcaaccaaggaaaccacctatgacttagtggtaggc cagtatgatagagaactatatttgccctttgagggtagaattgatattgctgatgtgatt gggaaaatactgccattcgtctga >gi568815595r:72278388_72546623|GENSCAN_predicted_peptide_5|255_aa MAVATSLTAVAITKGGAADWPHLSDHMASFKDSCSWSLNAESSSSMYDILFNNYSLLNSN GYCETHGRYSPAHRDLPFWWAKKGILKFHLQRIHPKADIKKAFPYESLFYQGGKPFPEAT PPDFPSCPTDQEQAMYSSYKAGCEMSHTDIIIFQLSFHEKLGKLCRKKNKRASPSYHFLG SSFWKHIPLLRETAAKAKDCVVAPNPLPSEPLLRLLGSAHRRHCRRRNECGFEVFAKCVT HGKLPSYESLSFPIV >gi568815595r:72278388_72546623|GENSCAN_predicted_CDS_5|768_bp atggctgtggccaccagcctcacagcagtagccataacaaaaggaggagctgctgactgg cctcatctcagtgaccacatggcatcattcaaagattcgtgctcctggtccctcaacgca gagagcagttcctccatgtatgacattttatttaacaattattccttgttaaattccaac ggatactgtgagacacatggcagatacagccctgcccacagggaccttccattctggtgg gcaaagaaaggaatactgaaatttcatctacagagaatccaccccaaggcagacataaag aaggctttcccctatgagtctctcttttatcaaggaggaaagcctttcccagaagcaact ccaccagactttccttcgtgtccaactgaccaagaacaggccatgtactctagctacaag gcaggctgcgaaatgagccatacagacattatcatcttccagctgtcctttcatgaaaaa cttgggaagctctgcaggaaaaagaacaagagagcctctccttcgtaccattttcttggc agctcattttggaaacatatccccttactgcgagaaacagcagccaaagcaaaggactgt gtggttgctcccaatccccttccttctgagcctctgcttcggctgctgggttcagctcac agaaggcactgtaggcgaaggaacgagtgtggctttgaggtctttgccaagtgcgtgacc catggaaagttaccaagctatgagagcctcagtttccccattgtgtaa