GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:52:41 Sequence gi568815589r:33285156_33487065 : 201910 bp : 48.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5418 5494 77 2 2 78 52 74 0.465 1.36 1.02 Intr + 9413 10272 860 2 2 106 97 371 0.639 30.80 1.03 Intr + 57650 57699 50 0 2 39 81 104 0.368 3.20 1.04 Intr + 58965 59033 69 2 0 40 109 85 0.526 5.18 1.05 Intr + 67491 67564 74 2 2 62 96 45 0.656 0.90 1.06 Intr + 68931 69032 102 0 0 106 91 96 0.842 10.99 1.07 Intr + 78855 78953 99 0 0 111 91 46 0.981 6.43 1.08 Intr + 79553 79619 67 2 1 86 9 87 0.485 -0.49 1.09 Intr + 81474 81619 146 2 2 92 96 130 0.666 13.28 1.10 Term + 84751 84823 73 1 1 97 48 43 0.492 -1.42 1.11 PlyA + 87070 87075 6 -0.45 2.11 PlyA - 87210 87205 6 1.05 2.10 Term - 89812 89681 132 1 0 78 39 80 0.147 0.19 2.09 Intr - 100135 99925 211 0 1 102 39 133 0.320 8.72 2.08 Intr - 100711 100494 218 1 2 58 84 376 0.988 31.50 2.07 Intr - 100873 100740 134 2 2 90 99 17 0.505 3.36 2.06 Intr - 101386 101249 138 2 0 131 79 193 0.999 23.14 2.05 Intr - 101937 101814 124 0 1 67 105 104 0.957 10.16 2.04 Intr - 110040 109923 118 2 1 30 101 130 0.001 8.97 2.03 Intr - 116132 116082 51 1 0 84 115 16 0.012 1.92 2.02 Intr - 142364 142298 67 0 1 135 4 101 0.021 4.36 2.01 Init - 154003 153931 73 1 1 64 86 40 0.157 2.63 2.00 Prom - 154262 154223 40 -8.86 3.34 PlyA - 156023 156018 6 1.05 3.33 Term - 157056 156888 169 2 1 124 48 293 0.995 26.25 3.32 Intr - 157363 157146 218 1 2 80 94 308 0.999 27.90 3.31 Intr - 157815 157697 119 1 2 91 80 170 0.999 16.68 3.30 Intr - 158303 158166 138 0 0 71 103 117 0.999 11.94 3.29 Intr - 158737 158611 127 1 1 126 95 204 0.999 25.15 3.28 Intr - 162394 162268 127 0 1 72 89 277 0.316 26.88 3.27 Intr - 177658 177522 137 1 2 116 28 156 0.125 11.77 3.26 Intr - 177979 177878 102 1 0 130 75 94 0.808 12.67 3.25 Intr - 178286 178092 195 2 0 126 7 278 0.971 23.11 3.24 Intr - 178765 178676 90 1 0 97 98 158 0.999 17.89 3.23 Intr - 179006 178882 125 0 2 75 26 124 0.913 5.20 3.22 Intr - 179811 179724 88 0 1 80 119 40 0.641 5.84 3.21 Intr - 180204 180150 55 2 1 141 65 71 0.948 9.08 3.20 Intr - 180742 180579 164 1 2 98 90 188 0.999 18.77 3.19 Intr - 181070 180916 155 0 2 77 72 157 0.973 12.79 3.18 Intr - 181270 181153 118 1 1 118 111 123 0.993 17.64 3.17 Intr - 181554 181414 141 0 0 104 78 148 0.996 15.95 3.16 Intr - 181832 181757 76 1 1 101 94 75 0.999 8.92 3.15 Intr - 182107 181959 149 1 2 109 74 158 0.999 15.53 3.14 Intr - 182713 182536 178 0 1 86 78 86 0.987 7.32 3.13 Intr - 182990 182875 116 2 2 56 73 100 0.981 4.55 3.12 Intr - 183267 183166 102 0 0 61 81 86 0.970 5.57 3.11 Intr - 183411 183353 59 1 2 51 97 53 0.888 1.10 3.10 Intr - 183717 183597 121 0 1 158 68 43 0.930 9.37 3.09 Intr - 183966 183803 164 1 2 133 113 147 0.999 21.39 3.08 Intr - 184186 184052 135 2 0 63 89 117 0.997 9.84 3.07 Intr - 184512 184344 169 0 1 117 69 208 0.998 21.32 3.06 Intr - 185036 184857 180 2 0 108 86 175 0.996 19.36 3.05 Intr - 186965 186849 117 2 0 115 99 140 0.986 18.46 3.04 Intr - 187257 187051 207 0 0 81 84 177 0.972 15.87 3.03 Intr - 188318 188189 130 1 1 79 42 82 0.588 3.40 3.02 Intr - 196019 195940 80 2 2 95 94 42 0.620 3.85 3.01 Intr - 201326 201164 163 1 1 -2 98 96 0.184 1.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 142364 142273 92 0 2 135 42 89 0.888 6.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:33285156_33487065|GENSCAN_predicted_peptide_1|538_aa MAEAPPVSGIVPARAGLGPFQGAEIGQVPYDEISAVHQHSYHPSGSKPKSQQTSFQSSPC NKSPKSHGLQNQPWQKLRNEKHHIRVKKAQSLAEQTSDTAGLESSTRSESGTDLREHSPS ESEKEVVGADPRGAKPKKATQFVYSYGRGPKVKGKLKCEWSNRTTPKPEDAGPESTKPVG VFHPDSSEASSRKGVLDGYGARRNEQRRYPQKRPPWEVEGARPRPGRNPPKQEGHRHTNA GHRNNMGPIPKDDLNERPAKSTCDSENLAVINKSSRRVDQEKCTVRRQDPQVVSPFSRGK QNHVLKNVETHTGVKNLVIVETARHAGKPFPVVLGPLNVPKPALESMSVTIQVELQCECG RRKEMVICSEASSTYQRIAAISMASKITDMQLGGSVEISKLITKKEVHQARRLAEAFHIS EDSDPFNIRSSGSKFSDSLKEDARKDLKFVSDVEKEMETLVEAVNKGKNSKKSHSFPPMN RDHRRIIHDLAQVYGLESVSYDSEPKRNVVVTAIRNPGSSNLQKITKEPIIDYFDVQD >gi568815589r:33285156_33487065|GENSCAN_predicted_CDS_1|1617_bp atggcggaggcgcctcctgtctcaggtattgtcccggcccgagcgggactgggccccttt cagggagcggaaattgggcaggtcccttatgatgaaatctctgctgttcatcagcatagt tatcatccgtcaggaagcaaacctaagagtcagcagacgtctttccagtcctctccttgt aataaatcgcccaagagccatggccttcagaatcaaccttggcagaaattgaggaatgag aagcaccatatcagagtcaagaaagcacagagtcttgctgagcagacctcagatacagct ggattagagagctcgaccagatcagagagtgggacagacctcagagagcatagtccttct gagagtgagaaggaagttgtgggtgcagatcccaggggagcaaaacccaaaaaagcaaca cagtttgtatacagctatggtagaggaccaaaagtcaaggggaaactcaaatgtgaatgg agtaaccgaacaactccaaaaccggaggatgctggacccgaaagtaccaaacctgtgggg gttttccaccctgactcttcagaggcatcctctagaaaaggagtattggatgggtatgga gccagacgaaatgagcagagaagatacccacagaaaaggcctccctgggaagtggagggg gccaggccacgaccaggcagaaatccaccaaaacaggagggccaccgacatacaaacgca ggacacagaaacaacatgggccccattccaaaggatgacctcaatgaaagaccagcaaaa tctacctgtgacagtgagaacttggcagtcatcaacaagtcttccaggagggttgaccaa gagaaatgcactgtacggaggcaggatcctcaagtagtatctcctttctcccgaggcaaa cagaaccatgtgctaaagaatgtggaaacgcacacaggtgtgaagaaccttgtcatcgtg gaaactgccagacatgctggcaagccattccctgtggtactaggccccctgaatgtaccc aaacctgcgctagagtccatgagtgtgaccatccaggtagagctacagtgtgaatgtgga cgaagaaaagagatggtgatttgctctgaagcatctagtacttatcaaagaatagctgca atctccatggcctctaagataacagacatgcagcttggaggttcagtggagatcagcaag ttaattaccaaaaaggaagttcatcaagccaggagattagcagaggcatttcatatcagt gaggattctgatcctttcaatatacgttcttcagggtcaaaattcagtgatagtttgaaa gaagatgccaggaaggacttaaagtttgtcagtgacgttgagaaggaaatggaaaccctc gtggaggccgtgaataagggaaagaatagtaagaaaagccacagcttccctcccatgaac agagaccaccgccggatcatccatgacttggcccaagtttatggcctggagagcgtgagc tatgacagtgaaccgaagcgcaatgtggtggtcactgccatcaggaatcctgggagcagt aatttacagaaaataaccaaggagccaataattgactattttgacgtccaggactaa >gi568815589r:33285156_33487065|GENSCAN_predicted_peptide_2|421_aa MQGKEARGRNQGNVLPGGSGHVREGDSYEKELEAQKLLMGSAVEEACIYKSERQNMVQAS GHRRSTRGSKMVSWSVIAKIQEILQRKMVREFLAEFMSTYVMMVFGLGSVAHMVLNKKYG SYLGVNLGFGFGVTMGVHVAGRISGAHMNAAVTFANCALGRVPWRKFPVYVLGQFLGSFL AAATIYSLFYRRSQQGVPPDRQDKNSGWRLYRDVSLLVGLGLGHCRGPVAWGGAQAWLTG MLQLCLFAITDQENNPALPGTEALVIGILVVIIGVSLGMNTGYAINPSRDLPPRIFTFIA GWGKQVFSNGENWWWVPVVAPLLGAYLGGIIYLVFIGSTIPREPLKLEDSVAYEDHGITV LPKMGSHEPTISPLTPVSELQEELYKCSNSKAVLRQFQGQFTALVKDTDLRGISDHRLFA E >gi568815589r:33285156_33487065|GENSCAN_predicted_CDS_2|1266_bp atgcagggcaaagaagcacgtggccgcaaccaagggaacgtgttgcctggagggtcaggc catgtgagagagggggatagttacgagaaagaattagaggcccagaaacttctaatgggg tcagccgtggaagaagcctgcatctacaaatctgaaagacaaaacatggttcaagcatcc gggcacaggcggtccacccgtggctccaaaatggtctcctggtccgtgatagcaaagatc caggaaatactgcagaggaagatggtgcgagagttcctggccgagttcatgagcacatat gtcatgatggtattcggccttggttccgtggcccatatggttctaaataaaaaatatggg agctaccttggtgtcaacttgggttttggcttcggagtcaccatgggagtgcacgtggca ggccgcatctctggagcccacatgaacgcagctgtgacctttgctaactgtgcgctgggc cgcgtgccctggaggaagtttccggtctatgtgctggggcagttcctgggctccttcctg gcggctgccaccatctacagtctcttctacagacggagccagcagggagtccctccggat agacaggacaagaactctggatggagactgtaccgagacgtgtctctgctggtgggcttg ggtctggggcactgccgaggtcctgtggcttggggaggggcccaggcgtggctgaccggg atgctccagctgtgtctcttcgccatcacggaccaggagaacaacccagcactgccagga acagaggcgctggtgataggcatcctcgtggtcatcatcggggtgtcccttggcatgaac acaggatatgccatcaacccgtcccgggacctgcccccccgcatcttcaccttcattgct ggttggggcaaacaggtcttcagcaatggggagaactggtggtgggtgccagtggtggca ccacttctgggtgcctatctaggtggcatcatctacctggtcttcattggctccaccatc ccacgggagcccctgaaattggaggattctgtggcgtatgaagaccacgggataaccgta ttgcccaagatgggatctcatgaacccacgatctctcccctcacccccgtctctgaactc caagaggagctgtacaaatgctccaattccaaggcagttttaagacagtttcaggggcaa ttcacagctcttgtcaaggacacagacctgagaggaatttcagatcaccggttgtttgct gagtga >gi568815589r:33285156_33487065|GENSCAN_predicted_peptide_3|1471_aa XHELLHLVKKGTLDSARVEDLACDSLQLPHCGLRTVLSSPENPSALETSGLVRRTVMDCT TTIPVCGPGDGDPCFNNSLWARICVQFCAPRCSGGHTVPLLERSSKALWDERHRGADGPT AVGEKVMEPALEGTGKEGKKASSRKRTLAEPPAKGLLQPVKLSRAELYKEPTNEELNRLR ETEILFHSSLLRLQVEELLKEVRLSEKKKDRIDAFLREVNQRVVRVPSVPETELTDQAWL PAGVRVPLHQVPYAVKGCFRFLPPAQVTVVGSYLLGTCIRPDINVDVALTMPREILQDKD GLNQRYFRKRALYLAHLAHHLAQDPLFGSVCFSYTNGCHLKPSLLLRPRGKDERLVTVRL HPCPPPDFFRPCRLLPTKNNVRSAWYRGQSPAGDGSPEPPTPRYNTWVLQDTVLESHLQL LSTILSSAQGLKDGVALLKVWLRQRELDKGQGGFTGFLVSMLVVFLVSTRKIHTTMSGYQ VLRSVLQFLATTDLTVNGISLCLSSDPSLPALADFHQAFSVVFLDSSGHLNLCADVTAST YHQVQHEARLSMMLLDSRADDGFHLLLMTPKPMIRAFDHVLHLRPLSRLQAACHRLKLWP ELQDNGGDYVSAALGPLTTLLEQGLGARLNLLAHSRPPVPEAAKFRQFWGSRSELRRFQD GAIREAVVWEAASMSQKRLIPHQVVTHLLALHADIPETCVHYVGGPLDALIQGLKETSST GEEALVAAVRCYDDLSRLLWGLEGLPLTVSAVQGAHPVLRYTEVFPPTPVRPAFSFYETL RERSSLLPRLDKPCPAYVEPMTVVCHLEGSGQWPQDAEAVQRVRAAFQLRLAELLTQQHG LQCRATATHTDVLKDGFVFRIRVAYQREPQILKEVQSPEGMISLRDTAASLRLERDTRQL PLLTSALHGLQQQHPAFSGVARLAKRWVGFLRFLFLVSTFDWKNNPLFVNLNNELTVEEQ VEIRSGFLAARAQLPVMVIVTPQDRKNSVWTQDGPSAQILQQLVVLAAEALPMLEKQLMD PRGPGDIRTVFRPPLDIYDVLIRLSPRHIPRHRQAVDSPAASFCRGLLSQPGPSSLMPVL GYDPPQLYLTQLREAFGDLALFFYDQHGGEVIGVLWKPTSFQPQPFKASSTKGRMVMSRG GELVMVPNVEAILEDFAVLGEGLVQTVEARSESAAACPAMGRQKELVSRCGEMLHIRYRL LRQALAECLGTLILVMFGCGSVAQVVLSRGTHGGFLTINLAFGFAVTLGILIAGQVSGAH LNPAVTFAMCFLAREPWIKLPIYTLAQTLGAFLGAGIVFGLYYDAIWHFADNQLFVSGPN GTAGIFATYPSGHLDMINGFFDQFIGTASLIVCVLAIVDPYNNPVPRGLEAFTVGLVVLV IGTSMGFNSGYAVNPARDFGPRLFTALAGWGSAVFTTGQHWWWVPIVSPLLGSIAGVFVY QLMIGCHLEQPPPSNEEENVKLAHVKHKEQI >gi568815589r:33285156_33487065|GENSCAN_predicted_CDS_3|4416_bp nngcatgagctactgcacctggtcaagaaaggcactttagacagtgcccgagttgaggat ttggcctgtgactcgctgcagttgccacactgtggccttagaacagtcttatccagtcct gaaaaccccagtgccttggaaacatcaggactggtcagaaggacagtcatggactgtacc acgaccataccggtctgtggcccaggggatggagacccctgctttaacaatagcctttgg gccagaatttgtgtccagttctgtgctccccgctgtagcggagggcatacggtgcctctc ctggagcgctccagtaaggctctctgggatgagaggcatcgtggagcggatgggcccacg gctgtaggggaaaaggtgatggaaccagccctggaaggcacaggcaaagaggggaagaaa gcatcctccaggaagcgtacattggctgaacctccagcgaagggcctcctgcagccagtg aagctcagcagggcagaactgtacaaggagcctaccaatgaggagcttaatcgccttcgg gagactgagatcttgttccactccagcttgcttcgtttacaggtagaggagctactaaag gaagtaaggctgtcagagaagaagaaggatcggattgatgccttcctacgggaggtcaac cagcgggttgtgagggtgccctcagtccctgagacagagctcactgaccaggcatggctc ccagctggggttcgagtgcccctccaccaagtgccctatgccgtgaagggctgtttccgc ttcctgcccccagcccaggttactgttgtgggcagctaccttctgggcacctgcatccga ccagacatcaatgtggatgtggcactgaccatgcccagggaaatcctacaggacaaggac gggctgaaccagcgctacttccgcaagcgtgccctctacctggcccacttggctcaccac ctggcccaggaccccctctttggcagtgtttgcttctcctacacaaatggctgccacctg aaaccctcactgttgctgcggccgcgtggaaaggatgagcgcctggtcactgtacgtctg catccgtgccctccacctgacttcttccgcccgtgccgcttgctgccaaccaagaacaat gtgcgctctgcctggtaccgagggcagagtcctgcaggggatggtagcccagagcctcct accccccgctataacacatgggtcctgcaagatacagttctcgagtcccatttgcagctg ctgtcaaccattctgagttcagcccagggcctgaaggatggcgtggcacttctgaaggtc tggctgcggcagcgggagctggacaagggccagggtgggtttactgggttccttgtctcc atgctggttgtcttccttgtgtctacacgcaagatccataccaccatgagtggctaccag gtcctgagaagtgtcttgcagtttctggccactacagacctgacagtcaacgggatcagt ttatgtctcagctcagatccctctttgccggccctggctgacttccaccaggccttctcc gttgtcttcctggattcctcaggccatctcaacctctgtgctgatgtcactgcctctact taccaccaggtacagcatgaggcacggctgtctatgatgttgctggacagcagagctgac gacgggttccacctgctgttgatgactcccaaacccatgatccgggcttttgaccatgtc ctgcatctccgtccactgagtcgcctgcaggcagcgtgccaccggctgaagctctggcca gagctgcaggacaatggtggggactatgtctcagctgctttgggccccctgaccaccctc ctggagcagggcctgggggctcggctgaacctgctggctcactctcgacccccagtccca gaggctgctaaattccgccagttctggggatcccgctcggagcttcggcgtttccaggac ggagccattcgggaagctgtggtctgggaggcagcctctatgtcccagaagcgccttatt ccccaccaggtggtcacccacctcttggcactccatgctgacatcccagaaacctgtgtc cactatgtggggggccccctggatgcacttatccaaggcctgaaagagacctccagcaca ggtgaggaggccctggtagcggcggtacgttgctacgacgacctcagtcgcctactgtgg gggctagagggtctcccactgaccgtgtctgctgttcagggagctcacccagtgctgcgc tacacagaggtgttcccaccaactccagtccgtccagccttctccttctatgagactctg cgggagcggtcctcactgctgccccggctcgataagccctgtccggcctacgtggagccc atgaccgtggtttgtcacctggagggcagtggccagtggccacaggacgctgaggccgtg cagcgggtccgagctgccttccagctgcgcctggcagagctgttgacacaacagcatggt ctgcagtgccgtgccactgccacgcacacggatgtccttaaggatggatttgtgtttcgg attcgcgtggcctatcagcgggagccccagatcctgaaggaggtgcagagcccagagggg atgatctcgctgagggacacagctgcctccctccgccttgagagagacacaaggcagttg ccactgctcaccagtgccctgcacggactgcagcagcagcacccagccttctctggtgtg gcacggctggccaagcggtgggttggcttccttcgattccttttcttggtatcaacgttt gattggaagaacaaccccctctttgtcaacctcaataatgagctcactgtggaggagcag gtggagatccgcagtggcttcctggcagctcgggcacagctccccgtcatggtcattgtt accccccaagaccgcaaaaactctgtgtggacacaggatggaccctcagcccagatcctg cagcagcttgtggtcctggcagctgaagccctgcccatgttagagaagcagctcatggat ccccggggacctggggacatcaggacagtgttccggccgcccttggacatttacgacgtg ctgattcgcctgtctcctcgccatatcccgcggcaccgccaggctgtggactcgccagct gcctccttctgccggggcctgctcagccagccggggccctcatccctgatgcccgtgctg ggctatgatcctcctcagctctatctgacgcagctcagggaggcctttggggatctggcc cttttcttctatgaccagcatggtggagaggtgattggtgtcctctggaagcccaccagc ttccagccgcagcccttcaaggcctccagcacaaaggggcgcatggtgatgtctcgaggt ggggagctagtaatggtgcccaatgttgaagcaatcctggaggactttgctgtgctgggt gaaggcctggtgcagactgtggaggcccgaagtgagagcgccgccgcctgccccgccatg ggtcgacagaaggagctggtgtcccgctgcggggagatgctccacatccgctaccggctg ctccgacaggcgctggccgagtgcctggggaccctcatcctggtgatgtttggctgtggc tccgtggcccaggttgtgctcagccggggcacccacggtggtttcctcaccatcaacctg gcctttggctttgctgtcactctgggcatcctcatcgctggccaggtctctggggcccac ctgaaccctgccgtgacctttgccatgtgcttcctggctcgtgagccctggatcaagctg cccatctacaccctggcacagacgctgggagccttcttgggtgctggaatagtttttggg ctgtattatgatgcaatctggcacttcgccgacaaccagctttttgtttcgggccccaat ggcacagccggcatctttgctacctacccctctggacacttggatatgatcaatggcttc tttgaccagttcataggcacagcctcccttatcgtgtgtgtgctggccattgttgacccc tacaacaaccccgtcccccgaggcctggaggccttcaccgtgggcctggtggtcctggtc attggcacctccatgggcttcaactccggctatgccgtcaaccctgcccgggactttggc ccccgcctttttacagcccttgcgggctggggctctgcagtcttcacgaccggccagcat tggtggtgggtgcccatcgtgtccccactcctgggctccattgcgggtgtcttcgtgtac cagctgatgatcggctgccacctggagcagcccccaccctccaacgaggaagagaatgtg aagctggcccatgtgaagcacaaggagcagatctga