GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:28:19 Sequence gi568815575f:134373358_134600074 : 226717 bp : 40.94% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4261 4398 138 0 0 46 115 101 0.980 8.79 1.02 Intr + 4648 4749 102 0 0 64 119 47 0.956 4.85 1.03 Intr + 20144 20277 134 1 2 82 -28 69 0.031 -6.78 1.04 Intr + 20552 20595 44 2 2 103 83 40 0.093 1.97 1.05 Intr + 30496 30789 294 2 0 39 86 192 0.680 10.16 1.06 Intr + 40134 40300 167 1 2 27 92 151 0.554 8.16 1.07 Intr + 40466 40609 144 1 0 89 103 87 0.999 9.86 1.08 Intr + 41659 41763 105 0 0 108 26 94 0.962 4.69 1.09 Intr + 43812 43945 134 2 2 88 71 68 0.900 3.62 1.10 Term + 51844 51973 130 1 1 59 43 223 0.970 11.57 1.11 PlyA + 53579 53584 6 1.05 2.00 Prom + 69365 69404 40 -4.05 2.01 Init + 75650 75652 3 1 0 113 81 0 0.336 1.85 2.02 Intr + 77604 77769 166 2 1 -16 80 124 0.384 -0.19 2.03 Intr + 79214 79355 142 0 1 125 -16 70 0.261 -1.21 2.04 Intr + 80428 80542 115 1 1 54 111 136 0.826 12.03 2.05 Intr + 86403 86482 80 2 2 53 72 21 0.008 -5.57 2.06 Intr + 86810 86981 172 2 1 39 77 136 0.005 6.62 2.07 Intr + 100002 100108 107 2 2 85 111 97 0.998 9.79 2.08 Intr + 101824 102007 184 1 1 71 56 144 0.978 8.37 2.09 Intr + 113108 113173 66 1 0 61 95 59 0.547 2.08 2.10 Intr + 116831 116848 18 1 0 130 100 6 0.538 1.89 2.11 Intr + 120151 120233 83 0 2 59 95 62 0.658 1.52 2.12 Intr + 125033 125079 47 2 2 62 93 72 0.801 2.23 2.13 Intr + 125251 125327 77 2 2 62 101 68 0.911 3.82 2.14 Term + 156863 157033 171 1 0 94 36 148 0.720 7.04 2.15 PlyA + 158808 158813 6 1.05 3.12 PlyA - 160478 160473 6 -0.45 3.11 Term - 161701 161583 119 2 2 84 42 97 0.019 2.42 3.10 Intr - 168833 168672 162 0 0 72 68 59 0.281 1.33 3.09 Intr - 171484 171357 128 0 2 92 94 112 0.864 11.60 3.08 Intr - 176308 175855 454 2 1 54 46 204 0.086 4.19 3.07 Intr - 176725 176518 208 1 1 71 -48 277 0.432 10.13 3.06 Intr - 182466 182272 195 0 0 39 74 82 0.087 0.59 3.05 Intr - 189797 189726 72 2 0 93 72 51 0.127 2.68 3.04 Intr - 193270 192801 470 2 2 106 32 290 0.007 17.09 3.03 Intr - 211604 211541 64 2 1 84 80 64 0.067 2.57 3.02 Intr - 215221 215044 178 0 1 77 47 55 0.057 -0.70 3.01 Intr - 225041 224842 200 2 2 40 82 133 0.257 5.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 86627 86524 104 0 2 56 41 165 0.874 6.06 S.002 Term - 138786 138680 107 1 2 96 42 71 0.818 0.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:134373358_134600074|GENSCAN_predicted_peptide_1|463_aa MSSSVEQKKGPTRQRKCGFCKSNRDKECGQLLISENQKVAAHHKCMLFSSALVSSHSDNE SLGGFSIEDVQKEIKRGTKLMCSLCHCPGATIGCDVKTCHRTYHYHCALHDKAQIREKPS QGIYMVYCRKHKKTAHNSEAWFTEYFKPTLDTYCSEKKIPFKILLLIDNASGHPRALMEI YKEMNVVFMHASTAFIPQFLEPVDQGVLWTFKSYYLRNTFHEAVAAIHSDFCDESGQADL EESFNEHELEPSSPKSKKKSRKGRPRKTNFKGLSEDTRSTSSHGTDEMESSSYRDRSPHR SSPSDTRPKCGFCHVGEEENEARGKLHIFNAKKAAAHYKCMLFSSGTVQLTTTSRAEFGD FDIKTVLQEIKRGKRMKCTLCSQPGATIGCEIKACVKTYHYHCGVQDKAKYIENMSRGIY KLYCKNHSGNDERDEEDEERESKSRGKVEIDQQQLTQQQLNGN >gi568815575f:134373358_134600074|GENSCAN_predicted_CDS_1|1392_bp atgtcaagctcagttgaacagaaaaaagggcctacaagacagcgcaaatgtggcttttgt aagtcaaatagagacaaggaatgtggacagttactaatatctgaaaaccagaaggtggca gcgcaccataagtgcatgctcttttcatctgctttggtatcatcacactctgataatgaa agtcttggtggattttctattgaagatgtccaaaaggaaattaaaagaggcacgaagctg atgtgttctttgtgccattgtcctggagcaacaattggttgtgatgtgaaaacatgtcac aggacataccactaccactgtgcattgcatgataaagctcaaatacgagagaaaccttca caaggaatttacatggtctattgccgaaaacacaagaaaactgcacataactccgaagca tggtttactgaatattttaagcccactcttgacacctactgctcagagaaaaagattcct ttcaaaatattactgcttattgacaatgcatctggtcacccaagagctctgatggagatc tacaaggagatgaatgttgtcttcatgcatgctagtacagcattcattccgcagttctta gagcccgtggatcaaggagtactttggactttcaagtcttattatttaagaaatacattt catgaggctgtagctgccatacatagtgatttctgtgatgaatctgggcaagctgattta gaagaaagttttaatgaacatgaactggagccctcatcacctaaaagtaaaaagaaaagt cgcaaaggaaggccaagaaaaactaattttaaagggctgtcagaagataccaggtccaca tcctcccatggaacagatgaaatggaaagtagttcctatagagataggtctccacacaga agcagccctagtgacaccaggcctaaatgtggattttgccatgtaggggaggaagaaaat gaagcacgaggaaaactgcatatatttaatgccaagaaggcagctgcccattataagtgc atgttgttttcttctggcacagtccagctcacaacaacatcaagagcagaatttggagac tttgatattaaaactgtacttcaggagattaaacgaggaaaaagaatgaaatgtacactt tgcagtcagcctggtgctactattggatgtgaaataaaagcctgtgttaagacttaccat taccactgtggagtacaagacaaagctaaatacattgaaaatatgtcacgaggaatttac aaactatactgtaaaaatcatagtggaaatgatgagagagatgaagaagatgaggaacga gagagtaaaagccgaggaaaagtagaaattgatcagcaacaactaactcagcagcaactt aatggaaactag >gi568815575f:134373358_134600074|GENSCAN_predicted_peptide_2|476_aa MKGQLYKAKKVSTGFGKKAVTQSMMSFANSFVQWWVQKTDHKEFEEKMENEELDRARRYG LRLFRTLLVGITKGADRGFIQDRGGVFRAKLRSEPMNLYQSPWLWEVRGVFTIVRLAEYV VYQPAVPMPKPARYRKLPGHQLDSRSQLLQPVKGLGKHWAKSQENGSHSFRRLRRALRRT SRLSRAAPPLAAPPPPPLLRHRLPPPEQSARAPAGSVMATRSPGVVISDDEPGYDLDLFC IPNHYAEDLERVFIPHGLIMDRTERLARDVMKEMGGHHIVALCVLKGGYKFFADLLDYIK ALNRNSDRSIPMTVDFIRLKSYCNDQSTGDIKVIGGDDLSTLTGKNVLIVEDIIDTGKTM QTLLSLVRQYNPKMVKVASLLVKRTPRSVGYKPDFVGFEIPDKFVVGYALDYNEYFRDLN VWKPAGGHPTPLANTLLTPLGLWLCMPSGPCVDALMTRLMSDTVLPGCLFREMLTP >gi568815575f:134373358_134600074|GENSCAN_predicted_CDS_2|1431_bp atgaaaggacagttatataaggccaaaaaagtgtccactggatttggcaagaaagcagtc acgcagtcaatgatgtcttttgccaacagttttgtacaatggtgggtgcagaagacagat cacaaagagtttgaggagaaaatggaaaatgaggaattggatagagcaaggaggtatggc cttagactattcagaacgttactggtgggaataacaaagggagctgaccgaggatttata caggaccgtggaggggtgtttagagccaagttgagatctgagcccatgaatctgtaccag tctccatggttgtgggaagtgagaggagtattcaccattgtgcgccttgctgaatatgtg gtgtaccagccagctgtccccatgccaaagcctgcaagatacaggaagctgccaggccac cagttggactctaggtctcaactgttacaaccagttaagggtttggggaagcactgggcc aagagtcaggaaaatggaagccacagcttcaggcggctgcgacgagccctcaggcgaacc tctcggctttcccgcgcggcgccgcctcttgctgcgcctccgcctcctcctctgctccgc caccggcttcctcctcctgagcagtcagcccgcgcgccggccggctccgttatggcgacc cgcagccctggcgtcgtgattagtgatgatgaaccaggttatgaccttgatttattttgc atacctaatcattatgctgaggatttggaaagggtgtttattcctcatggactaattatg gacaggactgaacgtcttgctcgagatgtgatgaaggagatgggaggccatcacattgta gccctctgtgtgctcaaggggggctataaattctttgctgacctgctggattacatcaaa gcactgaatagaaatagtgatagatccattcctatgactgtagattttatcagactgaag agctattgtaatgaccagtcaacaggggacataaaagtaattggtggagatgatctctca actttaactggaaagaatgtcttgattgtggaagatataattgacactggcaaaacaatg cagactttgctttccttggtcaggcagtataatccaaagatggtcaaggtcgcaagcttg ctggtgaaaaggaccccacgaagtgttggatataagccagactttgttggatttgaaatt ccagacaagtttgttgtaggatatgcccttgactataatgaatacttcagggatttgaat gtttggaaacctgcaggaggccaccccacacccctggcaaacaccctccttactccactt gggctctggctttgcatgccgagcggcccctgtgtagatgccctcatgacccggctcatg tctgacaccgtcttgccaggctgcctcttcagggagatgctgactccctag >gi568815575f:134373358_134600074|GENSCAN_predicted_peptide_3|749_aa FLKHPVQFWAHTEDLLVKEQYRLKESVTSSAADELSSGDTLVYSSASTSHKLHEYDPLIQ QRQRIKITLTPHPSWACSPAVDAVIYSPAFCNHQTTTGSCGFPTQGGSNSSTHITEKRVL REAEVAGERRSACVYVSVAEFVGNRKSAGSGQSPMTVLCSIDWFMVTVHPFMLNNDVCVH FHELHLGLGCPPNHVQPHAYQFTYRVTECGIRAKAVSQDMVIYSTEIHYSSKGTPSKFVI PVSCAAPQKSPWLTKPCSMRVASKSRATAQKDEKCYEVFSLSQSSQRPNCDCPPCVFSEE EHTQMRKLRLREVEIVPLGNDITGTQTQRMCSKCQSQMDVCEANMLDMSSTTLGQHQEHQ MVPKPTLEDVSHHLYRQHSTLFSLLPVPFFPSQVLPVSTEQKASLSHLAPSRQLKGHRQR TRERGVVSVPGASGCALRYLVPLALPFAILRFFSPSPRYSSRWRLREPLRRLRRSPPRER SVLAPRSAPLRLRHAALRASAAASAPSTLRPRLSALRPCSSVPSTSRPATSPSPKSRAPL LSLSVAVRVSLARSVSPPPQPFLPPCLPRWLARSLALGRGETASQANPRNLCKKLAMKFK KFFDFGAIFEWSQSFPLSPRPVRFPLEPEGSPVPASLPERPSPRHPASSSGRLQDPGLIL CSLCHYLFIRTDYAVVTRAGNGGECVWLQKFPVKLASLMMSFGLYIGHFLGVPPANPQEA PRTRSFFTPKSVMEPSAVPGTAERVLHPG >gi568815575f:134373358_134600074|GENSCAN_predicted_CDS_3|2250_bp tttcttaaacacccagtgcagttctgggcacatactgaagatctgttagttaaagagcag tacagactcaaggaatctgtcaccagttcagcagctgacgagctcagctcaggggacact ttagtctatagttctgcttcaacatctcacaagctacacgagtatgatccactcattcaa caaagacaacgtattaaaatcaccctcaccccacatccatcctgggcctgctccccagct gtagatgcggtcatttattcaccagcattttgcaatcaccaaacaaccacaggttcctgt gggttccctacacaaggagggtctaactccagcacacacataacagaaaagagggttcta cgggaggcagaggttgcaggagaaagacgttctgcctgtgtttatgtgtcagttgcagaa tttgtgggtaacaggaaatcagccggttcaggacaaagtccaatgactgtgctgtgctcc atagactggttcatggtcacagtgcaccccttcatgctaaacaacgatgtgtgtgtacac tttcatgaactacacttgggcctgggttgccccccaaaccatgttcagccacacgcctac cagttcacctaccgtgttactgaatgtggcatcagggccaaagctgtctctcaggacatg gttatctacagcactgagatacactactcttctaagggcacgccatctaagtttgtgatc ccagtgtcatgtgctgccccccaaaagtccccatggctcaccaagccctgctccatgaga gtagccagcaagagcagggccacagcccagaaggatgagaaatgctacgaggtgttcagc ttgtcacagtccagtcaaaggcccaactgcgattgtccaccttgtgtcttcagtgaagaa gagcatacccagatgaggaaactgaggctcagagaagtggagattgtaccattaggaaat gacatcactggaactcaaacccagaggatgtgttccaagtgtcagtcccagatggatgtt tgtgaagccaacatgttagatatgagctcaacaacccttggtcagcaccaagaacaccaa atggtgcctaagccaactcttgaggatgttagccatcatctttaccgccagcacagcacc ttgttctcacttcttccggttccttttttcccttctcaggtactccctgtaagcaccgag caaaaagcttcgctcagccaccttgcgccgtcccggcagctcaaaggacaccgccagagg acccgtgagcgtggggtggtgtccgtcccaggagcctctgggtgcgcgctgcgctacctc gtgcccttggctttgccgtttgccatcctccgttttttctccccgagccctcgctattcg tctcggtggcggctccgggagcctctccgccggctgcgccgctcgcctccaagggagcgt tccgtgctggcccctcgctcggctccgctccgcctccgccacgccgctctccgagcttcg gcagcagccagcgccccctccacgctgcgaccccggctctcggctctccgtccctgtagc tccgtgcccagcacctcgcgccccgcaacctcgccgtctcccaagtcccgggccccgctt ctgtcgctgtccgtcgctgtccgtgtgtcgctcgctcggtctgtctctccgcctccccag cccttccttcctccctgtctccctcgctggcttgcacgttcgctcgctctcggacgcggc gaaacagcttcgcaggcaaatcccagaaacctttgcaaaaagcttgcaatgaaatttaag aagttcttcgatttcggcgccattttcgagtggagccagagcttccccctttctccccgt cctgtacggttccccctggagccagaaggaagcccggtgccagccagccttcctgaaaga ccaagcccgcgccatccggcttcctccagtggacgcctgcaggacccaggcctcatactg tgttccttatgtcattacctgttcatcagaaccgactatgcagtagtgaccagagcagga aatggaggtgaatgtgtttggttacaaaaattcccggtgaaactggctagtcttatgatg agctttggcctatacattggacacttcctgggtgtgcctcctgcaaatccgcaggaagca ccaagaaccaggtccttcttcacccccaaatctgtcatggagcccagcgctgttcccggc acagcagagcgggtgctccatccaggttga