GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:47:28 Sequence gi568815589r:33342046_33547530 : 205485 bp : 46.63% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 760 809 50 2 2 39 81 104 0.621 3.20 1.02 Intr + 2075 2143 69 1 0 40 109 85 0.680 5.18 1.03 Intr + 10601 10674 74 1 2 62 96 45 0.682 0.90 1.04 Intr + 12041 12142 102 2 0 106 91 96 0.851 10.99 1.05 Intr + 21965 22063 99 2 0 111 91 46 0.981 6.43 1.06 Intr + 22663 22729 67 1 1 86 9 87 0.485 -0.49 1.07 Intr + 24584 24729 146 1 2 92 96 130 0.666 13.28 1.08 Term + 27861 27933 73 0 1 97 48 43 0.492 -1.42 1.09 PlyA + 30180 30185 6 -0.45 2.11 PlyA - 30320 30315 6 1.05 2.10 Term - 32922 32791 132 0 0 78 39 80 0.147 0.19 2.09 Intr - 43245 43035 211 2 1 102 39 133 0.320 8.72 2.08 Intr - 43821 43604 218 0 2 58 84 376 0.988 31.50 2.07 Intr - 43983 43850 134 1 2 90 99 17 0.505 3.36 2.06 Intr - 44496 44359 138 1 0 131 79 193 0.999 23.14 2.05 Intr - 45047 44924 124 2 1 67 105 104 0.957 10.16 2.04 Intr - 53150 53033 118 1 1 30 101 130 0.001 8.97 2.03 Intr - 59242 59192 51 0 0 84 115 16 0.012 1.92 2.02 Intr - 85474 85408 67 2 1 135 4 101 0.021 4.36 2.01 Init - 97113 97041 73 0 1 64 86 40 0.157 2.63 2.00 Prom - 97372 97333 40 -8.86 3.35 PlyA - 99133 99128 6 1.05 3.34 Term - 100166 99998 169 1 1 124 48 293 0.995 26.25 3.33 Intr - 100473 100256 218 0 2 80 94 308 0.999 27.90 3.32 Intr - 100925 100807 119 0 2 91 80 170 0.999 16.68 3.31 Intr - 101413 101276 138 2 0 71 103 117 0.999 11.94 3.30 Intr - 101847 101721 127 0 1 126 95 204 0.999 25.15 3.29 Intr - 105504 105378 127 2 1 72 89 277 0.316 26.88 3.28 Intr - 120768 120632 137 0 2 116 28 156 0.125 11.77 3.27 Intr - 121089 120988 102 0 0 130 75 94 0.808 12.67 3.26 Intr - 121396 121202 195 1 0 126 7 278 0.971 23.11 3.25 Intr - 121875 121786 90 0 0 97 98 158 0.999 17.89 3.24 Intr - 122116 121992 125 2 2 75 26 124 0.913 5.20 3.23 Intr - 122921 122834 88 2 1 80 119 40 0.641 5.84 3.22 Intr - 123314 123260 55 1 1 141 65 71 0.948 9.08 3.21 Intr - 123852 123689 164 0 2 98 90 188 0.999 18.77 3.20 Intr - 124180 124026 155 2 2 77 72 157 0.973 12.79 3.19 Intr - 124380 124263 118 0 1 118 111 123 0.993 17.64 3.18 Intr - 124664 124524 141 2 0 104 78 148 0.996 15.95 3.17 Intr - 124942 124867 76 0 1 101 94 75 0.999 8.92 3.16 Intr - 125217 125069 149 0 2 109 74 158 0.999 15.53 3.15 Intr - 125823 125646 178 2 1 86 78 86 0.987 7.32 3.14 Intr - 126100 125985 116 1 2 56 73 100 0.981 4.55 3.13 Intr - 126377 126276 102 2 0 61 81 86 0.970 5.57 3.12 Intr - 126521 126463 59 0 2 51 97 53 0.888 1.10 3.11 Intr - 126827 126707 121 2 1 158 68 43 0.930 9.37 3.10 Intr - 127076 126913 164 0 2 133 113 147 0.999 21.39 3.09 Intr - 127296 127162 135 1 0 63 89 117 0.997 9.84 3.08 Intr - 127622 127454 169 2 1 117 69 208 0.998 21.32 3.07 Intr - 128146 127967 180 1 0 108 86 175 0.996 19.36 3.06 Intr - 130075 129959 117 1 0 115 99 140 0.986 18.46 3.05 Intr - 130367 130161 207 2 0 81 84 177 0.980 15.87 3.04 Intr - 131428 131299 130 0 1 79 42 82 0.583 3.40 3.03 Intr - 139129 139050 80 1 2 95 94 42 0.623 3.85 3.02 Intr - 144436 144274 163 0 1 -2 98 96 0.375 1.58 3.01 Init - 148552 148500 53 1 2 78 61 39 0.625 0.83 3.00 Prom - 149231 149192 40 -0.46 4.00 Prom + 159105 159144 40 -1.06 4.01 Init + 182445 182650 206 2 2 112 74 347 0.995 32.02 4.02 Intr + 186682 186796 115 1 1 106 71 57 0.993 6.25 4.03 Intr + 186955 187128 174 0 0 17 87 117 0.788 4.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 85474 85383 92 2 2 135 42 89 0.888 6.88 S.002 Term - 164470 164455 16 0 1 128 48 11 0.862 -1.19 S.003 Init - 167188 167085 104 1 2 111 63 78 0.949 7.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:33342046_33547530|GENSCAN_predicted_peptide_1|226_aa XVKNLVIVETARHAGKPFPVVLGPLNVPKPALESMSVTIQVELQCECGRRKEMVICSEAS STYQRIAAISMASKITDMQLGGSVEISKLITKKEVHQARRLAEAFHISEDSDPFNIRSSG SKFSDSLKEDARKDLKFVSDVEKEMETLVEAVNKGKNSKKSHSFPPMNRDHRRIIHDLAQ VYGLESVSYDSEPKRNVVVTAIRNPGSSNLQKITKEPIIDYFDVQD >gi568815589r:33342046_33547530|GENSCAN_predicted_CDS_1|681_bp ngtgtgaagaaccttgtcatcgtggaaactgccagacatgctggcaagccattccctgtg gtactaggccccctgaatgtacccaaacctgcgctagagtccatgagtgtgaccatccag gtagagctacagtgtgaatgtggacgaagaaaagagatggtgatttgctctgaagcatct agtacttatcaaagaatagctgcaatctccatggcctctaagataacagacatgcagctt ggaggttcagtggagatcagcaagttaattaccaaaaaggaagttcatcaagccaggaga ttagcagaggcatttcatatcagtgaggattctgatcctttcaatatacgttcttcaggg tcaaaattcagtgatagtttgaaagaagatgccaggaaggacttaaagtttgtcagtgac gttgagaaggaaatggaaaccctcgtggaggccgtgaataagggaaagaatagtaagaaa agccacagcttccctcccatgaacagagaccaccgccggatcatccatgacttggcccaa gtttatggcctggagagcgtgagctatgacagtgaaccgaagcgcaatgtggtggtcact gccatcaggaatcctgggagcagtaatttacagaaaataaccaaggagccaataattgac tattttgacgtccaggactaa >gi568815589r:33342046_33547530|GENSCAN_predicted_peptide_2|421_aa MQGKEARGRNQGNVLPGGSGHVREGDSYEKELEAQKLLMGSAVEEACIYKSERQNMVQAS GHRRSTRGSKMVSWSVIAKIQEILQRKMVREFLAEFMSTYVMMVFGLGSVAHMVLNKKYG SYLGVNLGFGFGVTMGVHVAGRISGAHMNAAVTFANCALGRVPWRKFPVYVLGQFLGSFL AAATIYSLFYRRSQQGVPPDRQDKNSGWRLYRDVSLLVGLGLGHCRGPVAWGGAQAWLTG MLQLCLFAITDQENNPALPGTEALVIGILVVIIGVSLGMNTGYAINPSRDLPPRIFTFIA GWGKQVFSNGENWWWVPVVAPLLGAYLGGIIYLVFIGSTIPREPLKLEDSVAYEDHGITV LPKMGSHEPTISPLTPVSELQEELYKCSNSKAVLRQFQGQFTALVKDTDLRGISDHRLFA E >gi568815589r:33342046_33547530|GENSCAN_predicted_CDS_2|1266_bp atgcagggcaaagaagcacgtggccgcaaccaagggaacgtgttgcctggagggtcaggc catgtgagagagggggatagttacgagaaagaattagaggcccagaaacttctaatgggg tcagccgtggaagaagcctgcatctacaaatctgaaagacaaaacatggttcaagcatcc gggcacaggcggtccacccgtggctccaaaatggtctcctggtccgtgatagcaaagatc caggaaatactgcagaggaagatggtgcgagagttcctggccgagttcatgagcacatat gtcatgatggtattcggccttggttccgtggcccatatggttctaaataaaaaatatggg agctaccttggtgtcaacttgggttttggcttcggagtcaccatgggagtgcacgtggca ggccgcatctctggagcccacatgaacgcagctgtgacctttgctaactgtgcgctgggc cgcgtgccctggaggaagtttccggtctatgtgctggggcagttcctgggctccttcctg gcggctgccaccatctacagtctcttctacagacggagccagcagggagtccctccggat agacaggacaagaactctggatggagactgtaccgagacgtgtctctgctggtgggcttg ggtctggggcactgccgaggtcctgtggcttggggaggggcccaggcgtggctgaccggg atgctccagctgtgtctcttcgccatcacggaccaggagaacaacccagcactgccagga acagaggcgctggtgataggcatcctcgtggtcatcatcggggtgtcccttggcatgaac acaggatatgccatcaacccgtcccgggacctgcccccccgcatcttcaccttcattgct ggttggggcaaacaggtcttcagcaatggggagaactggtggtgggtgccagtggtggca ccacttctgggtgcctatctaggtggcatcatctacctggtcttcattggctccaccatc ccacgggagcccctgaaattggaggattctgtggcgtatgaagaccacgggataaccgta ttgcccaagatgggatctcatgaacccacgatctctcccctcacccccgtctctgaactc caagaggagctgtacaaatgctccaattccaaggcagttttaagacagtttcaggggcaa ttcacagctcttgtcaaggacacagacctgagaggaatttcagatcaccggttgtttgct gagtga >gi568815589r:33342046_33547530|GENSCAN_predicted_peptide_3|1488_aa MWKQPKCSSVDEWINNMRHELLHLVKKGTLDSARVEDLACDSLQLPHCGLRTVLSSPENP SALETSGLVRRTVMDCTTTIPVCGPGDGDPCFNNSLWARICVQFCAPRCSGGHTVPLLER SSKALWDERHRGADGPTAVGEKVMEPALEGTGKEGKKASSRKRTLAEPPAKGLLQPVKLS RAELYKEPTNEELNRLRETEILFHSSLLRLQVEELLKEVRLSEKKKDRIDAFLREVNQRV VRVPSVPETELTDQAWLPAGVRVPLHQVPYAVKGCFRFLPPAQVTVVGSYLLGTCIRPDI NVDVALTMPREILQDKDGLNQRYFRKRALYLAHLAHHLAQDPLFGSVCFSYTNGCHLKPS LLLRPRGKDERLVTVRLHPCPPPDFFRPCRLLPTKNNVRSAWYRGQSPAGDGSPEPPTPR YNTWVLQDTVLESHLQLLSTILSSAQGLKDGVALLKVWLRQRELDKGQGGFTGFLVSMLV VFLVSTRKIHTTMSGYQVLRSVLQFLATTDLTVNGISLCLSSDPSLPALADFHQAFSVVF LDSSGHLNLCADVTASTYHQVQHEARLSMMLLDSRADDGFHLLLMTPKPMIRAFDHVLHL RPLSRLQAACHRLKLWPELQDNGGDYVSAALGPLTTLLEQGLGARLNLLAHSRPPVPEAA KFRQFWGSRSELRRFQDGAIREAVVWEAASMSQKRLIPHQVVTHLLALHADIPETCVHYV GGPLDALIQGLKETSSTGEEALVAAVRCYDDLSRLLWGLEGLPLTVSAVQGAHPVLRYTE VFPPTPVRPAFSFYETLRERSSLLPRLDKPCPAYVEPMTVVCHLEGSGQWPQDAEAVQRV RAAFQLRLAELLTQQHGLQCRATATHTDVLKDGFVFRIRVAYQREPQILKEVQSPEGMIS LRDTAASLRLERDTRQLPLLTSALHGLQQQHPAFSGVARLAKRWVGFLRFLFLVSTFDWK NNPLFVNLNNELTVEEQVEIRSGFLAARAQLPVMVIVTPQDRKNSVWTQDGPSAQILQQL VVLAAEALPMLEKQLMDPRGPGDIRTVFRPPLDIYDVLIRLSPRHIPRHRQAVDSPAASF CRGLLSQPGPSSLMPVLGYDPPQLYLTQLREAFGDLALFFYDQHGGEVIGVLWKPTSFQP QPFKASSTKGRMVMSRGGELVMVPNVEAILEDFAVLGEGLVQTVEARSESAAACPAMGRQ KELVSRCGEMLHIRYRLLRQALAECLGTLILVMFGCGSVAQVVLSRGTHGGFLTINLAFG FAVTLGILIAGQVSGAHLNPAVTFAMCFLAREPWIKLPIYTLAQTLGAFLGAGIVFGLYY DAIWHFADNQLFVSGPNGTAGIFATYPSGHLDMINGFFDQFIGTASLIVCVLAIVDPYNN PVPRGLEAFTVGLVVLVIGTSMGFNSGYAVNPARDFGPRLFTALAGWGSAVFTTGQHWWW VPIVSPLLGSIAGVFVYQLMIGCHLEQPPPSNEEENVKLAHVKHKEQI >gi568815589r:33342046_33547530|GENSCAN_predicted_CDS_3|4467_bp atgtggaagcaacccaaatgttcatctgtggatgaatggataaacaacatgaggcatgag ctactgcacctggtcaagaaaggcactttagacagtgcccgagttgaggatttggcctgt gactcgctgcagttgccacactgtggccttagaacagtcttatccagtcctgaaaacccc agtgccttggaaacatcaggactggtcagaaggacagtcatggactgtaccacgaccata ccggtctgtggcccaggggatggagacccctgctttaacaatagcctttgggccagaatt tgtgtccagttctgtgctccccgctgtagcggagggcatacggtgcctctcctggagcgc tccagtaaggctctctgggatgagaggcatcgtggagcggatgggcccacggctgtaggg gaaaaggtgatggaaccagccctggaaggcacaggcaaagaggggaagaaagcatcctcc aggaagcgtacattggctgaacctccagcgaagggcctcctgcagccagtgaagctcagc agggcagaactgtacaaggagcctaccaatgaggagcttaatcgccttcgggagactgag atcttgttccactccagcttgcttcgtttacaggtagaggagctactaaaggaagtaagg ctgtcagagaagaagaaggatcggattgatgccttcctacgggaggtcaaccagcgggtt gtgagggtgccctcagtccctgagacagagctcactgaccaggcatggctcccagctggg gttcgagtgcccctccaccaagtgccctatgccgtgaagggctgtttccgcttcctgccc ccagcccaggttactgttgtgggcagctaccttctgggcacctgcatccgaccagacatc aatgtggatgtggcactgaccatgcccagggaaatcctacaggacaaggacgggctgaac cagcgctacttccgcaagcgtgccctctacctggcccacttggctcaccacctggcccag gaccccctctttggcagtgtttgcttctcctacacaaatggctgccacctgaaaccctca ctgttgctgcggccgcgtggaaaggatgagcgcctggtcactgtacgtctgcatccgtgc cctccacctgacttcttccgcccgtgccgcttgctgccaaccaagaacaatgtgcgctct gcctggtaccgagggcagagtcctgcaggggatggtagcccagagcctcctaccccccgc tataacacatgggtcctgcaagatacagttctcgagtcccatttgcagctgctgtcaacc attctgagttcagcccagggcctgaaggatggcgtggcacttctgaaggtctggctgcgg cagcgggagctggacaagggccagggtgggtttactgggttccttgtctccatgctggtt gtcttccttgtgtctacacgcaagatccataccaccatgagtggctaccaggtcctgaga agtgtcttgcagtttctggccactacagacctgacagtcaacgggatcagtttatgtctc agctcagatccctctttgccggccctggctgacttccaccaggccttctccgttgtcttc ctggattcctcaggccatctcaacctctgtgctgatgtcactgcctctacttaccaccag gtacagcatgaggcacggctgtctatgatgttgctggacagcagagctgacgacgggttc cacctgctgttgatgactcccaaacccatgatccgggcttttgaccatgtcctgcatctc cgtccactgagtcgcctgcaggcagcgtgccaccggctgaagctctggccagagctgcag gacaatggtggggactatgtctcagctgctttgggccccctgaccaccctcctggagcag ggcctgggggctcggctgaacctgctggctcactctcgacccccagtcccagaggctgct aaattccgccagttctggggatcccgctcggagcttcggcgtttccaggacggagccatt cgggaagctgtggtctgggaggcagcctctatgtcccagaagcgccttattccccaccag gtggtcacccacctcttggcactccatgctgacatcccagaaacctgtgtccactatgtg gggggccccctggatgcacttatccaaggcctgaaagagacctccagcacaggtgaggag gccctggtagcggcggtacgttgctacgacgacctcagtcgcctactgtgggggctagag ggtctcccactgaccgtgtctgctgttcagggagctcacccagtgctgcgctacacagag gtgttcccaccaactccagtccgtccagccttctccttctatgagactctgcgggagcgg tcctcactgctgccccggctcgataagccctgtccggcctacgtggagcccatgaccgtg gtttgtcacctggagggcagtggccagtggccacaggacgctgaggccgtgcagcgggtc cgagctgccttccagctgcgcctggcagagctgttgacacaacagcatggtctgcagtgc cgtgccactgccacgcacacggatgtccttaaggatggatttgtgtttcggattcgcgtg gcctatcagcgggagccccagatcctgaaggaggtgcagagcccagaggggatgatctcg ctgagggacacagctgcctccctccgccttgagagagacacaaggcagttgccactgctc accagtgccctgcacggactgcagcagcagcacccagccttctctggtgtggcacggctg gccaagcggtgggttggcttccttcgattccttttcttggtatcaacgtttgattggaag aacaaccccctctttgtcaacctcaataatgagctcactgtggaggagcaggtggagatc cgcagtggcttcctggcagctcgggcacagctccccgtcatggtcattgttaccccccaa gaccgcaaaaactctgtgtggacacaggatggaccctcagcccagatcctgcagcagctt gtggtcctggcagctgaagccctgcccatgttagagaagcagctcatggatccccgggga cctggggacatcaggacagtgttccggccgcccttggacatttacgacgtgctgattcgc ctgtctcctcgccatatcccgcggcaccgccaggctgtggactcgccagctgcctccttc tgccggggcctgctcagccagccggggccctcatccctgatgcccgtgctgggctatgat cctcctcagctctatctgacgcagctcagggaggcctttggggatctggcccttttcttc tatgaccagcatggtggagaggtgattggtgtcctctggaagcccaccagcttccagccg cagcccttcaaggcctccagcacaaaggggcgcatggtgatgtctcgaggtggggagcta gtaatggtgcccaatgttgaagcaatcctggaggactttgctgtgctgggtgaaggcctg gtgcagactgtggaggcccgaagtgagagcgccgccgcctgccccgccatgggtcgacag aaggagctggtgtcccgctgcggggagatgctccacatccgctaccggctgctccgacag gcgctggccgagtgcctggggaccctcatcctggtgatgtttggctgtggctccgtggcc caggttgtgctcagccggggcacccacggtggtttcctcaccatcaacctggcctttggc tttgctgtcactctgggcatcctcatcgctggccaggtctctggggcccacctgaaccct gccgtgacctttgccatgtgcttcctggctcgtgagccctggatcaagctgcccatctac accctggcacagacgctgggagccttcttgggtgctggaatagtttttgggctgtattat gatgcaatctggcacttcgccgacaaccagctttttgtttcgggccccaatggcacagcc ggcatctttgctacctacccctctggacacttggatatgatcaatggcttctttgaccag ttcataggcacagcctcccttatcgtgtgtgtgctggccattgttgacccctacaacaac cccgtcccccgaggcctggaggccttcaccgtgggcctggtggtcctggtcattggcacc tccatgggcttcaactccggctatgccgtcaaccctgcccgggactttggcccccgcctt tttacagcccttgcgggctggggctctgcagtcttcacgaccggccagcattggtggtgg gtgcccatcgtgtccccactcctgggctccattgcgggtgtcttcgtgtaccagctgatg atcggctgccacctggagcagcccccaccctccaacgaggaagagaatgtgaagctggcc catgtgaagcacaaggagcagatctga >gi568815589r:33342046_33547530|GENSCAN_predicted_peptide_4|165_aa MRKLLSFGRRLGQALLSSMDQEYAGRGYHIRDWELRKIHRAAIKGDAAEVEHCLTRRFRD LDVRDRKDRTVLHLACAHGRVQVVTLLLDRKCQINICDRLNRTPLMKAVHCQEEACAIIL LKRGANPNIKDIYGNTALHYAVYNEGTSLAERLLSHHANIEALNK >gi568815589r:33342046_33547530|GENSCAN_predicted_CDS_4|495_bp atgaggaagctcctcagttttgggagacgcctgggccaggcgctcctgagctccatggac caagagtatgcgggtcgggggtaccacattcgggactgggaactgcggaagatccacagg gcggccatcaagggcgacgccgcagaggtggagcactgcctgacgcgcaggttccgggac ttggacgtccgcgacagaaaagacaggactgttctacatttggcctgtgcccatggccgt gtgcaagtggtcactctcttgctggacagaaaatgccagatcaacatctgtgacagacta aacaggacacctttaatgaaggctgtacactgccaggaagaggcttgtgccattattctc ctgaaacgtggcgccaatccaaacattaaggatatctacggcaacactgctctccattat gccgtgtataatgaggggacttcactggcagaaagactgctttcccaccatgcaaatatt gaagcactaaacaag