GENSCAN 1.0 Date run: 6-Nov-116 Time: 20:58:41 Sequence gi568815588f:35037836_35311800 : 273965 bp : 41.69% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 1247 1085 163 2 1 39 115 159 0.548 12.33 1.05 Intr - 6841 6731 111 1 0 30 116 67 0.601 3.36 1.04 Intr - 7033 6937 97 0 1 43 92 10 0.708 -4.01 1.03 Intr - 11930 11848 83 2 2 121 87 125 0.939 13.22 1.02 Intr - 16704 16599 106 2 1 88 75 122 0.989 10.20 1.01 Init - 25144 25125 20 1 2 45 121 36 0.528 2.14 1.00 Prom - 29362 29323 40 -10.25 2.00 Prom + 29588 29627 40 -6.85 2.01 Init + 35133 35307 175 2 1 69 63 102 0.681 5.36 2.02 Term + 52269 52915 647 1 2 103 54 203 0.588 11.70 2.03 PlyA + 53109 53114 6 1.05 3.09 PlyA - 53590 53585 6 1.05 3.08 Term - 63225 63093 133 2 1 133 40 116 0.968 7.88 3.07 Intr - 88827 88733 95 0 2 23 68 124 0.033 1.74 3.06 Intr - 89827 89545 283 0 1 47 66 207 0.013 10.90 3.05 Intr - 90570 90498 73 1 1 46 76 15 0.091 -6.35 3.04 Intr - 103785 103621 165 1 0 72 80 88 0.590 5.41 3.03 Intr - 106070 105991 80 1 2 82 82 53 0.608 2.38 3.02 Intr - 106554 106395 160 1 1 109 82 54 0.631 5.02 3.01 Init - 107602 107554 49 1 1 52 96 41 0.401 2.46 3.00 Prom - 109110 109071 40 -7.15 4.00 Prom + 110998 111037 40 -8.25 4.01 Init + 111511 111568 58 0 1 73 115 13 0.666 4.02 4.02 Intr + 111932 112031 100 0 1 92 38 79 0.684 1.55 4.03 Term + 114445 114544 100 1 1 89 51 135 0.520 6.52 4.04 PlyA + 114608 114613 6 1.05 5.00 Prom + 120217 120256 40 -4.35 5.01 Init + 124165 124226 62 0 2 76 80 69 0.164 5.67 5.02 Intr + 129923 129968 46 2 1 27 94 85 0.047 0.49 5.03 Intr + 138018 138164 147 2 0 57 71 97 0.901 4.41 5.04 Intr + 141054 141151 98 2 2 49 97 96 0.942 4.49 5.05 Intr + 141299 141441 143 2 2 55 45 122 0.813 3.68 5.06 Intr + 150365 150553 189 0 0 75 71 142 0.839 9.84 5.07 Intr + 155117 155320 204 0 0 93 115 3 0.016 1.75 5.08 Intr + 163616 163651 36 0 0 75 119 19 0.247 1.02 5.09 Intr + 169060 169216 157 2 1 81 94 219 0.733 19.95 5.10 Intr + 173419 173559 141 1 0 99 -6 144 0.913 4.65 5.11 Term + 173821 173968 148 1 1 102 43 183 0.961 11.59 5.12 PlyA + 174265 174270 6 1.05 6.03 PlyA - 175403 175398 6 1.05 6.02 Term - 176821 176741 81 1 0 102 45 65 0.118 0.31 6.01 Init - 178732 178553 180 1 0 71 52 121 0.331 6.03 6.00 Prom - 179503 179464 40 -0.95 7.10 PlyA - 182080 182075 6 1.05 7.09 Term - 185802 185667 136 2 1 20 43 133 0.017 -1.49 7.08 Intr - 200766 200592 175 1 1 57 90 149 0.204 10.08 7.07 Intr - 220246 220084 163 1 1 64 31 80 0.009 -1.47 7.06 Intr - 227821 227728 94 0 1 80 5 118 0.001 1.75 7.05 Intr - 236823 236689 135 2 0 14 84 91 0.003 0.06 7.04 Intr - 240455 240339 117 1 0 96 82 37 0.014 2.56 7.03 Intr - 242875 242745 131 1 2 101 94 -12 0.034 -0.73 7.02 Intr - 268920 268494 427 2 1 24 80 209 0.709 6.57 7.01 Init - 269306 269206 101 2 2 35 95 124 0.715 7.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 89347 89199 149 2 2 34 42 200 0.839 7.08 S.002 Init - 231069 230961 109 0 1 79 80 111 0.937 8.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:35037836_35311800|GENSCAN_predicted_peptide_1|194_aa MFGICIRYLNTQFIKKNKLTEADLQYGYGGVDMNEPLMEIGELALDMWRKLMVEPLQAIL IRMLLREIKNDRGGEDPNQKVIHGVINSFVHVEQYKKKFPLKFYQEIFESPFLTETGEYY KQEASNLLQESNCSQYMEKVLGRLKDEEIRCRKYLHPSSYTKVIHECQQRMVADHLQFLH AECHNIIRQEKKNX >gi568815588f:35037836_35311800|GENSCAN_predicted_CDS_1|582_bp atgttcggcatttgcataaggtatctcaacacccagtttattaaaaagaataaattaaca gaagcggaccttcagtatggctatggtggtgtagatatgaatgaaccacttatggaaata ggagagctagcattggatatgtggaggaaattgatggttgaaccacttcaggccatcctt atccgaatgctgctccgagaaatcaaaaatgatcgtggtggagaagacccaaaccagaaa gtaatccatggggttattaactcctttgttcatgttgaacagtataagaaaaaattcccc ttaaagttttatcaggaaatttttgagtctccctttctgactgaaacaggagagtattac aaacaagaagcttcaaatttattacaagaatcaaactgctcacagtatatggaaaaggtt ctaggtagattaaaagatgaagaaattcgatgtcgaaaatacctacatccaagttcatat actaaggtgattcatgaatgtcaacaacgaatggtagcagaccacttacagtttttacat gcagaatgtcataatataattcgacaagagaaaaaaaatgnn >gi568815588f:35037836_35311800|GENSCAN_predicted_peptide_2|273_aa MHKENGQRERVGWRPHHQPRTGRGPSVFPDEKRAQGIPGIDSMEKNEKNQRKAKYPKEAP PPINREGKFPGACSPAPRPSAGLPSAEAGQGAGDKGRGRQPEGVKDEAPARKVGGGGGGG GCQLAFPCSQRCHYCAPRPPPRPAAATGPGDAGGGERSPGSPRADTHLSPRGKTAKSKAA AAPRRDRGGREGGKGEKDGPITGSVKQSSSSASGFAQHTPSGLLLPLLPNPPRHPESNGT GRVTASAGSQASCDCERPRKLIFQVIPCWACCQ >gi568815588f:35037836_35311800|GENSCAN_predicted_CDS_2|822_bp atgcataaagagaatgggcaaagggagcgggtaggatggcggccgcaccaccagccacgg actgggagaggcccttctgtgtttccagatgagaagagagctcaaggaatcccaggaata gactccatggaaaagaatgaaaaaaatcagagaaaagccaaatatccaaaagaggccccg ccacccattaaccgggaagggaagtttcccggcgcctgcagcccggctccccgaccctca gcggggttaccttctgcagaagcagggcaaggggcaggggacaaggggagggggaggcag cccgaaggagtgaaagacgaagccccggcgaggaaggtgggcggaggcggcggcggcggc ggctgtcagctcgcgttcccctgctcacagcgctgccattactgtgcgccgaggccgcct ccccggccggccgccgccaccggccccggagacgcggggggtggggagcgaagcccgggg agcccgcgcgccgacactcacctgtctccgcgaggaaagacagcaaagtccaaggccgcc gccgctccccggcgggacaggggcggtagggagggggggaaaggggagaaggacgggccg atcacgggatccgtaaaacaatcgtcctcctctgcctccggctttgcccaacacacaccc tccggtctccttcttccgctgctgccaaatcccccccgccacccggaaagcaatggtaca ggccgcgttacggcaagcgcagggagccaagcttcgtgtgattgcgaaaggccacggaaa ttaatattccaagtgattccttgctgggcttgctgtcaatga >gi568815588f:35037836_35311800|GENSCAN_predicted_peptide_3|345_aa MTKSARECGHMSRILKVKSIIIHSGSKARIVCLYIISQNEINHKKQPHKAKHQSGYHEAL SLREKGRDGSNMLSGTPKASLLSSQESTAHLPTPNQGTSKSVGPELNSSSSSTNFLPEFP ISGNGTPSSNELFKPETPETPSLSFHTASSPDLCFKGKVHPHLPGKLSRKETESRTFWGK TGCVRFRLNCARKVGQRRWGHSNPRSLLLDSFSFSELNRSPEPHGSLREDFPTRPKFARR QAGPGAAERTKRAWGGEPAASGAPSEGPRQKSNDLSARVTAVVRSSGSRGRFLPNAANPK LRIEALSIPDCSKAQVVRNDWEEGRKTQVAKDMDENPWRHGKKTL >gi568815588f:35037836_35311800|GENSCAN_predicted_CDS_3|1038_bp atgaccaagtctgccagagaatgtggtcatatgagtcgtatattgaaggtcaagagtatc atcatccattcaggctccaaagctagaatcgtgtgtctatatataatcagtcagaatgag atcaaccacaaaaagcaaccccacaaagctaaacatcagtcaggataccatgaggcgttg agccttagggagaaaggcagagatggaagcaatatgctatctggaactcccaaagcctct ctgctgtcctctcaagagagtactgctcatcttcccacacccaatcaaggaacctcaaaa tcagtgggtccagaactaaactcctcttcttcctccacaaactttctacccgagttccct atctcaggaaatgggaccccatcatccaacgagttgttcaagccagaaacccctgaaact ccctctctgtcattccacacagcatcatcaccagatctttgtttcaagggaaaggtacac cctcatctgcccggcaaattaagtagaaaggaaactgagtcacgtacgttttgggggaaa acgggctgcgtccgctttcgtctaaactgcgctcgtaaagtggggcagaggcgctggggg cactcaaatccgaggtccttacttctggattccttttccttttccgagttaaaccgctcc cctgagccgcacggctctctcagggaggattttccaactcgtcccaagttcgcgaggagg caggcgggtcccggggcagctgagcgcacgaagcgggcgtggggcggcgagccggcggcc tcgggggcgccgagcgaggggccgcgtcagaaatccaatgacttgtcagccagagtaacg gctgttgtcaggtcttcagggtctcgaggtcggtttcttcctaacgccgccaaccccaag ctaaggatagaggcactctccattcctgattgttccaaggctcaagtggtcagaaatgac tgggaagaagggagaaaaactcaggtggcaaaggacatggacgaaaacccatggcgacat ggaaagaaaaccctgtga >gi568815588f:35037836_35311800|GENSCAN_predicted_peptide_4|85_aa MGIEFNWSQVSGLGSPAGKAANTHSFPAGFLRYSVELYLPSNTFCYTERQAFRASANQEY TMKIIRDLATVELGFGCQAPVLLMD >gi568815588f:35037836_35311800|GENSCAN_predicted_CDS_4|258_bp atgggtattgagtttaactggagccaggtgtctgggctgggcagtccagctggtaaagct gccaacacccactcgttccctgcaggttttctgcggtattctgtagagctgtatctccct agcaacaccttttgttacactgagagacaggcatttagagcttctgctaatcaagaatat actatgaaaatcatcagagacctagcaactgttgagcttggttttggctgtcaggcccct gttctgctgatggattga >gi568815588f:35037836_35311800|GENSCAN_predicted_peptide_5|456_aa MAKIRNQPRCSSVDEWIKKVWRPIEEDYSSGDVEEKVSVAGSGTRRGSPAVTLVQLPSGQ TIHVQGVIQTPQPWVIQSSEIHTVQVAAIAETDESAESEGVIDSHKRREILSRRPSYRKI LNELSSDVPGVPKIEEERSEEEGTPPSIATMAVPTSIYQTSTGQYIAIAQGGTIQISNPG SDGVQGLQALTMTNSGAPPPGATIVQYAAQSADGTQQFFVPGSQVVVQALSFISWDRPCP LSLQGLLHTYRLPDKNTFPEEALPILHIWLRQPSVSSYCTSLGTYHVVLWLSVLLDDEET ELAPSHMAAATGDMPTYQIRAPTAALPQGVVMAASPGSLHSPQQLAEEATRKRELRLMKN REAARECRRKKKEYVKCLENRVAVLENQNKTLIEELKALKDLYCHKVEEAAKECRRRKKE YVKCLESRVAVLEVQNKKLIEELETLKDICSPKTDY >gi568815588f:35037836_35311800|GENSCAN_predicted_CDS_5|1371_bp atggccaagatacggaatcagcctcggtgttcatcagtggatgaatggataaagaaagta tggcgtcctatagaagaggattattcttcaggggatgtggaagaaaaggtttctgtggct ggatcaggcaccagaagaggctccccagctgtaactctagtgcagttaccttcgggccaa actatacatgtccagggagtaattcagacaccacagccatgggttattcagtcatcagaa atacacaccgttcaggtagcagcaattgcagagacagatgaatctgcagaatcagaaggt gtaattgattctcataaacgtagagaaatcctttcacgaagaccctcttataggaaaata ctgaatgaactgtcctctgatgtgcctggtgttcccaagattgaagaagagagatcagag gaagaaggaacaccacctagtattgctaccatggcagtaccaactagcatatatcagact agcacggggcaatacattgctatagcccaaggtggaacaatccagatttctaacccagga tctgatggtgttcagggactgcaggcattaacaatgacaaattcaggagctcctccacca ggtgctacaattgtacagtacgcagcacaatcagctgatggcacacagcagttctttgtc ccaggcagccaggttgttgttcaagctctttctttcatttcctgggataggccatgccct ctctccctccagggccttcttcacacataccgattgcctgataagaatacgttccccgag gaagccttgcccatcctccacatctggcttaggcaaccttctgtgagttcctactgtact tccttaggcacatatcatgttgtgttgtggctctctgtcttgctagatgatgaggaaact gaacttgccccaagtcacatggctgctgccactggtgacatgccaacttaccagatccga gctcctactgctgctttgccacagggagtggtgatggctgcatcgcccggaagtttgcac agtccccagcagctggcagaagaagcaacacgcaaacgagagctgaggctaatgaaaaac agggaagctgcccgggagtgtcgcaggaagaagaaagaatatgtcaaatgtcttgaaaat cgtgtggctgtgcttgaaaaccaaaacaagactctcattgaggaactcaaggccctcaaa gatctttattgccataaagtagaggaagctgccaaagaatgtcgacgtcgaaagaaagaa tatgtaaaatgtctggagagccgagttgcagtgctggaagtccagaacaagaagcttata gaggaacttgaaaccttgaaagacatttgttctcccaaaacagattactag >gi568815588f:35037836_35311800|GENSCAN_predicted_peptide_6|86_aa MRIMVTASTLVGLLGELNVEAPWRRVSTLYTVAVILDCHDYVAVASVELHTHLEESKPRM GKCSVPVPDVPSVYKPLFQKPSFIRH >gi568815588f:35037836_35311800|GENSCAN_predicted_CDS_6|261_bp atgaggataatggtaacagcatcaacacttgttgggctgctaggagaattaaatgtggaa gccccttggcgcagggtgagcactctgtacacggtagctgttattcttgactgccatgac tacgtggctgtagcatctgtggagctacatacacacctggaagaaagcaagcccaggatg ggtaagtgctcagtcccagtgccagacgtccccagtgtctacaagccactcttccagaag cccagttttattcgacattga >gi568815588f:35037836_35311800|GENSCAN_predicted_peptide_7|492_aa MNQEAEAQVASVLPEGTRLVSSSTEALQPSPAARLLCHVENEGGKKEKEKEGRGHVGEGE RACGRRGEDSEQTGEYQKDQLFPNSFCHDACFMLNDPQTSFMCGFALVPSWSPKQGQSSV RRSWNGPKAREGAGGNPGRPPSSGMISGNGNIVQRSKMPGANFHAVLNDKSKMVGQCHIS EAVTKSRVIISMFTFKRFVFWAPTFNISICFELIFVYGVRPTASSVGLCWDMTYYWSVIS IVSFELLTVPNDRMRMGGRDPGHLDIPQNKTLVLYVDDLVLIGPAEEEMASSLEALIKHM PFAGSSFKAGRERHLQALGVGLSLLSPPALNAGCTKQKNSRAEKHNKMKNSLEGFKGRFE QAEGGVSKVKEKTVEIIESKEQKEKRLKKNLQALSLNNLPLSVDVQQTFGKVSLGPYSVG VTDRILKLLYFSGSHSPGNPGPDKASEGPFRPAEPKLTENGKCRFLSQLVPGDETGRASG PQAHAFFPLSTV >gi568815588f:35037836_35311800|GENSCAN_predicted_CDS_7|1479_bp atgaatcaggaagcagaggcccaggttgccagcgtgttacccgagggcacaaggttagtg agcagcagcactgaagcactgcagcccagcccagcagcaaggttgttatgtcatgtagaa aatgaaggaggaaagaaagaaaaggagaaggaggggagagggcatgtgggagaaggggag agggcatgtgggagaaggggtgaggacagtgaacagaccggggagtaccagaaagaccag ctcttcccaaacagcttctgccatgatgcttgcttcatgctaaatgacccacaaacttca ttcatgtgtgggtttgcccttgtgccttcttggtcccccaagcagggacagagttctgtt cggagaagctggaatggaccgaaagccagggaaggagctggtggaaacccaggcaggcct ccctcaagtggcatgatctcaggaaatggaaatattgtacaaagaagcaagatgcctggc gccaacttccatgctgtgctgaatgacaaatccaagatggttggccagtgtcatatctca gaagctgtcactaaatccagagtcattatttctatgtttacttttaagaggtttgtattt tgggctcctacatttaatatttcaatctgttttgagttaatttttgtgtatggtgtgagg cccacggcatcctcagttggactttgctgggacatgacctattactggtctgtaatcagc attgtatcttttgagctcttgacagtccctaatgataggatgaggatggggggaagggac ccgggccatctggacattccacagaataagacattggtcctctatgttgatgacctagtg ctaattggacctgcggaggaagagatggcaagttcattggaggccttgataaaacatatg ccatttgcaggctcctcattcaaagccggccgggagcggcaccttcaggcgcttggcgtg ggcctctcgctgctgtctcccccagctctcaacgctggctgcacgaaacaaaaaaattct agagctgaaaagcacaataaaatgaaaaattcactagagggattcaaaggcagatttgag caggcagaaggaggagtcagtaaagttaaagaaaagacagtggaaattattgagtctaag gaacagaaggaaaaaagattgaagaaaaatttgcaagctttgtctcttaataatctgcct ttgtctgttgatgttcagcaaacctttgggaaggtttcccttggcccctacagtgttggc gtcacagacaggatcctaaagctgctctacttttctggaagccacagtccagggaaccca ggacctgacaaagcatcagaagggccctttagacctgcagaacccaaacttacagagaat gggaaatgccgattcctttcccaactggtccctggggatgagacaggcagggcctctggt ccccaggcccatgcattcttcccactgagcactgtgtaa