GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:15:10 Sequence gi568815584r:34985593_35221959 : 236367 bp : 40.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11118 11195 78 2 0 85 87 58 0.949 6.51 1.02 Intr + 13966 14057 92 0 2 114 97 95 0.894 10.87 1.03 Intr + 25917 26067 151 0 1 50 92 166 0.825 12.44 1.04 Intr + 27754 27902 149 0 2 44 85 139 0.803 7.31 1.05 Intr + 28210 28310 101 1 2 108 76 46 0.963 4.23 1.06 Intr + 29152 29238 87 2 0 113 31 113 0.724 7.12 1.07 Intr + 33399 33482 84 1 0 26 116 53 0.250 0.77 1.08 Intr + 37318 37488 171 2 0 44 116 134 0.944 10.79 1.09 Intr + 48606 48751 146 1 2 90 22 88 0.001 1.48 1.10 Intr + 60416 60538 123 1 0 27 86 105 0.008 4.06 1.11 Intr + 60836 61036 201 1 0 37 96 206 0.008 14.86 1.12 Intr + 67686 67859 174 2 0 28 94 186 0.039 12.41 1.13 Intr + 76224 76314 91 2 1 46 39 106 0.021 0.15 1.14 Intr + 93335 93432 98 0 2 70 69 161 0.997 11.21 1.15 Term + 95430 95636 207 2 0 92 38 248 0.977 16.56 1.16 PlyA + 96113 96118 6 1.05 2.08 PlyA - 96320 96315 6 1.05 2.07 Term - 100186 99998 189 1 0 112 39 203 0.997 14.27 2.06 Intr - 105615 105478 138 0 0 104 110 44 0.992 7.94 2.05 Intr - 109592 109456 137 0 2 82 22 119 0.959 4.07 2.04 Intr - 113792 113660 133 2 1 109 98 16 0.524 4.00 2.03 Intr - 125037 124933 105 0 0 40 110 83 0.908 5.19 2.02 Intr - 131145 131018 128 1 2 72 110 46 0.608 4.68 2.01 Init - 131358 131322 37 0 1 57 34 53 0.182 -2.98 2.00 Prom - 134242 134203 40 -6.75 3.00 Prom + 135943 135982 40 -7.55 3.01 Init + 136598 136690 93 1 0 59 13 105 0.080 0.53 3.02 Intr + 136891 137043 153 0 0 66 84 102 0.095 6.95 3.03 Intr + 137105 137175 71 1 2 44 72 86 0.050 -0.34 3.04 Intr + 137219 137318 100 2 1 56 94 84 0.093 4.99 3.05 Intr + 137360 137517 158 1 2 100 56 65 0.068 2.39 3.06 Intr + 137683 138639 957 1 0 32 76 393 0.067 21.75 3.07 Intr + 141887 142019 133 2 1 32 109 154 0.998 11.63 3.08 Intr + 150418 150666 249 0 0 1 18 254 0.318 6.41 3.09 Intr + 154666 154773 108 0 0 75 96 40 0.280 3.06 3.10 Intr + 173067 173162 96 2 0 71 42 78 0.024 0.79 3.11 Term + 176492 176515 24 1 0 135 32 18 0.046 -1.85 3.12 PlyA + 176816 176821 6 1.05 4.03 PlyA - 176904 176899 6 1.05 4.02 Term - 191394 190019 1376 1 2 17 42 398 0.070 18.59 4.01 Init - 193981 193558 424 1 1 95 77 197 0.208 15.85 4.00 Prom - 197530 197491 40 -3.55 5.04 PlyA - 198706 198701 6 1.05 5.03 Term - 214470 214323 148 2 1 120 38 29 0.245 -2.51 5.02 Intr - 221735 221570 166 0 1 106 99 60 0.577 7.00 5.01 Init - 233727 233547 181 0 1 96 54 125 0.338 7.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 54271 54177 95 2 2 85 49 80 0.864 0.81 S.002 Intr - 61025 60888 138 0 0 71 119 228 0.916 23.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:34985593_35221959|GENSCAN_predicted_peptide_1|650_aa MVLADLGRKITSALRSLSNATIINEEVLNAMLKEVCTALLEADVNIKLVKQLRENVNYTE MDPVIIASEGVEKFKNENFEIIIVDTSGRHKQEDSLFEEMLQVANAIQPDNIVYVMDASI GQACEAQAKAFKDKVDVASVIVTKLDGHAKGGGALSAVAATKSPIIFIGTGEHIDDFEPF KTQPFISKLLGMGDIEGLIDKVNELKLDDNEALIEKLKHDFMSKGNEQESMARLKKLMTI MDSMNDQELDSTDGAKVFSKQPGRIQRVARGSGVSTRDVQELLTQYTKFAQMVKKMGGIK GLFKGNSPPPGCFHRLVLSVAFPGAQCKQLVDLPFWRLEDGGPFLTAPLSSAPPLKYNDT AVFILQVQKPRSCGVTAPDAAGDGLRRGRDARLQPGSASDWAGRPRMEVGLPAITLFLTS ASSPVVATTMDQEPVGGVERGEAVAASGAAAAAAFGESAGQMSNERGFENVELGVIGKKK KVPRRVIHFVSGETMEEYSTDEDEVDGLEKKDVLPTVDPNLPLLPYELGKGDQGPGVLSG AMLKVEPPFLCDFLGEKIASVLGISTPKYQYAIDEYYRMKKEEEEEEEENRMSEEAEKQY QQNKLQTDSIVQTDQPETVISSSFVNVNFEMEGDSEVIMESKQNPVSVPP >gi568815584r:34985593_35221959|GENSCAN_predicted_CDS_1|1953_bp atggttctagcagaccttggaagaaaaataacatcagcattacgctcgttgagcaatgcc accattatcaatgaagaggtattgaatgctatgctaaaagaagtctgtaccgctttgttg gaagcagatgttaatattaaactagtgaagcaactaagagaaaatgttaactatacagaa atggatcctgtcatcattgcttctgaaggagtagagaaatttaaaaatgaaaattttgaa attattattgttgatacaagtggccgccacaaacaagaagactctttgtttgaagaaatg cttcaagttgctaatgctatacaacctgataacattgtttatgtgatggatgcctccatt gggcaggcttgtgaagcccaggctaaggcttttaaagataaagtagatgtagcctcagta atagtgacaaaacttgatggccatgcaaaaggaggtggtgcactcagtgcagtcgctgcc acaaaaagtccgattattttcattggtacaggggaacatatagatgactttgaacctttc aaaacacagccttttattagcaaacttcttggtatgggcgacattgaaggactgatagat aaagtcaacgagttgaagttggatgacaatgaagcacttatagagaagttgaaacatgat tttatgagcaaaggaaatgaacaggagtcaatggcaaggctaaagaaattaatgacaata atggatagtatgaatgatcaagaactagacagtacggatggtgccaaagtttttagtaaa caaccaggaagaatccaaagagtagcaagaggatcgggtgtatcaacaagagatgttcaa gaacttttgacacaatataccaagtttgcacagatggtaaaaaagatgggaggtatcaaa ggacttttcaaagggaacagcccccctcccggctgctttcacaggctggtgttgtctgtg gcttttccaggtgcacagtgcaagcagttggtggatctaccattctggcgtctggaggat ggtggcccttttctcacagctccactaagtagtgccccacctttgaagtacaacgacaca gctgtcttcattttacaagtgcaaaaaccgaggtcctgcggggtgactgccccagacgcc gctggtgacgggctgcgccgaggtcgagacgccaggcttcagcctggctcggccagcgac tgggcggggagaccaaggatggaagtgggcttaccggccattaccctctttctcaccagc gccagcagccctgtggtggcgacgacgatggaccaggagccagtgggcggtgtggaacga ggagaagccgtcgcagcctcgggagctgcggccgccgcggcattcggggaatctgcaggg cagatgagtaacgaaagaggctttgaaaatgtagaactgggagtcataggaaaaaagaag aaagtcccaaggagagtcatccactttgttagtggtgaaacaatggaagaatatagcaca gatgaagacgaagttgatggcctggagaagaaagatgttttgcctactgttgatccgaac ttgcctctgctgccttatgaactgggtaaaggcgatcagggccctggtgttctcagtggt gccatgctcaaggtagagcctccattcttgtgtgacttccttggagagaagattgcatct gttttgggtatcagcaccccaaagtaccaatatgccattgatgaatattatcggatgaag aaggaggaagaagaagaagaagaagaaaacaggatgtctgaagaagcagaaaaacaatat caacagaataaattgcagactgattccattgttcagacagatcaaccagagacagtgata tccagctcatttgtgaatgtcaattttgaaatggagggagacagtgaagtaattatggaa agcaagcaaaatccagtctctgtcccaccataa >gi568815584r:34985593_35221959|GENSCAN_predicted_peptide_2|288_aa MVEDVEEAASVEEKKSEQELKDEEMDLFTKYYSEWKGGRKNTNEFYKTIPRFYYRLPAED EVLLQKLREESRAVFLQRKSRELLDNEELQDLENYILELIPTLPQLDGLEKSFYSFYVCT AVRKFFFFLDPLRTGQYLNLDKDHNGMLSKEELSRYGTATMTNVFLDRVFQECLTYDGEM DYKTYLDFVLALENRKEPAALQYIFKLLDIENKGYLNVFSLNYFFRDEIFDMVKPKDPLK ISLQDLINSNQGDTVTTILIDLNGFWTYENREALVANDSENSADLDDT >gi568815584r:34985593_35221959|GENSCAN_predicted_CDS_2|867_bp atggttgaagatgtagaggaggctgcttcagtagaggaaaaaaaaagtgaacaagaatta aaagatgaagaaatggatttatttacaaaatattactccgaatggaaaggaggtagaaaa aacacaaatgaattctataagaccattccccggttttattataggctgcctgctgaagat gaagtcttactacagaaattaagagaggaatcaagagctgtctttctacaaagaaaaagc agagaactgttagataatgaagaattacaggatttagaaaactacatattggaacttatc cctacgttgccacaattagatggtctggaaaaatctttctactccttttatgtttgtaca gcagttaggaagttcttcttctttttagatcctttaagaacaggccagtacttgaatctt gataaagatcacaatggcatgctcagtaaagaagaactctcacgctatggaacagctacc atgaccaatgtcttcttagaccgtgttttccaggagtgtctcacttatgatggagaaatg gactataagacctacttggactttgtccttgcattagaaaacagaaaggaacctgcagct ctacaatatattttcaaactgcttgatattgagaacaaaggatacctgaatgtcttttca cttaattatttctttagggatgaaatctttgacatggtaaaaccaaaggatcctttgaaa atctctcttcaggatttaatcaacagtaatcaaggagacacagtaaccaccattctaatc gatttgaatggcttctggacttacgagaacagagaggctcttgttgcaaatgacagtgaa aactctgcagaccttgatgatacatga >gi568815584r:34985593_35221959|GENSCAN_predicted_peptide_3|713_aa MVLLQRRPKLTKPDFKGHDKPAFSFIVCLWRAERLSPGQSRPSVGAGAVHRGHCSFRRVY GRVKSECRFELLNEVNFICQRSLTRHLLPPRPPTGGVLRSVESYAGLELGVPGYGFGFTP PFQQASRGNLFLFRTAVIHVHELECKRHQRIPALSPGLRLATLDIPGLLFNRENSPAFLL LSSPKSRTLICLFPKLWKSPYLGLGPGHSYVSLFLADRCGIRNQQRLFSLKTMSPQNTKA TNLIAKARYLRKDEGSNKQVYSVPHFFLAGAAKERSQMNSQTEDHALAPVRNTIQLPTQP LNSEEWDKLKEDLKENTGKTSFESWIISQMAGCHSSIDVAKSLLAWVAAKNNGIVSYDLL VKYLYLCVFHMQTSEVIDVFEIMKARYKTLEPRGYSLLIRGLIHSDRWREALLLLEDIKK VITPSKKNYNDCIQGALLHQDVNTAWNLYQELLGHDIVPMLETLKAFFDFGKDIKDDNYS NKLLDILSYLRNNQLYPGESFAHSIKTWFESGQCSGCGKTIESIQLSPEEYECLKGKIMR DVIDGGDQYRKTTPQRDLSGSWLEKGVEKDKIEALELSQTAAEWSRCGMMREMKARIRVE KDQLSFENILGDKNFTIWFGCSDGRRQDDVQVFGLGDGKLPNCFPEWLYHFTFPPAEYEG YSSPTSWLMLSMVKVVHRFTINVVDDELPAITIIYCISKSYLVSTILYGMTII >gi568815584r:34985593_35221959|GENSCAN_predicted_CDS_3|2142_bp atggttctccttcagaggcgacccaagctcaccaagccggattttaaaggccacgataag cccgcgttttcctttatcgtgtgcctttggcgcgcggaacgcctaagtccgggtcagtct cgtccgtcggtcggggctggcgcggtgcatcgtgggcactgtagtttccgccgcgtttat ggccgcgttaagtctgagtgccgctttgagttgttgaatgaagtgaacttcatttgtcag cgttcgctgactcgccacctcctccctcctcgtccccccaccggaggagttttgcggtct gtagagagctatgcagggctggagttgggggtccctggatacggttttggctttacaccc cctttccagcaagcttcccgtgggaatctgttccttttcaggacagctgtgatccacgtt catgaactggaatgtaagaggcaccagaggattcctgctctgtcccctggtttgcggctt gcgacgttggacatccccggattgttgtttaatagagaaaactcacctgccttcttgctt ttaagtagccccaaaagcagaaccttgatttgtctctttccgaagctttggaagagccca taccttgggctaggcccagggcactcttatgtctcgctgtttctggcagaccgctgtggc atcaggaaccagcagaggttgttttctcttaaaacaatgtctccacagaataccaaagca acgaatctgattgccaaggccagatatctcaggaaagatgagggcagtaataagcaagtt tattctgttcctcatttttttttagctggagcagctaaggagagatcacagatgaattct caaactgaagatcatgccttggcacctgtgaggaacactattcaactcccaacacaacct ttgaattcagaggagtgggataaacttaaggaagatttaaaagaaaacaccggaaagacc agtttcgaaagttggatcatttcacagatggctggctgtcatagctctatagatgtggct aaatctctgctggcatgggtagcagccaaaaataatggtattgtaagttacgatttactg gtcaagtatttgtatctctgtgtctttcatatgcagacatctgaagttattgatgtcttt gaaattatgaaagccagatataagactttagaacctagaggttacagtcttctcatccgg ggattgatccattcagacagatggagagaagcattgttgctgttagaggacatcaaaaaa gttataactccttcaaaaaagaactataatgactgtatccagggagctctccttcatcaa gatgtaaacacagcttggaatttatatcaggaattgctaggtcatgatattgttcctatg ttggaaactttaaaagctttctttgattttggaaaagacataaaggatgataactattca aataaactactagatattctttcatatctaagaaataatcagctgtatccaggggagtca tttgcacacagtataaaaacatggtttgagagtggccagtgttcgggctgtggaaaaacc atagagtctattcagctgagtccagaagaatatgaatgtcttaagggaaaaatcatgagg gatgtgatagatggaggtgaccagtacagaaagacaacacctcagagagacctctctggc agctggttggaaaaaggagttgaaaaggataagattgaggcactggagttgagtcagacg gctgctgagtggtccaggtgtgggatgatgagggaaatgaaagccagaataagggtagag aaggaccagctcagctttgagaacattttaggagataaaaatttcaccatttggtttggc tgcagtgatgggagaagacaggatgatgttcaggttttcggcttgggtgatgggaaactg cccaactgttttccagagtggttgtaccattttacatttccaccagcagagtatgagggc tatagtagccccacatcctggctgatgcttagtatggttaaggtagtccacagatttact atcaatgtagttgatgacgagctgccagcaattactattatctactgcatctccaaatcc tatttggtatcaactattctgtatggaatgaccataatttaa >gi568815584r:34985593_35221959|GENSCAN_predicted_peptide_4|599_aa MARELRDECTSFSNRLDQLEERVSVMEDEMNEMKQDEKFREKRGKRNEQSLQEIWDYMKR PNLCLIGVPESDRENGTKLENTAGYYPGELPQPSRQATIQIQEIQRMPQRYSSRRATPRY IIVRFTKVEMKEKMLRAAREKDAEKAFDKIQQPFTLKTLNKLGIDGTYLKIIRAIYDKPT ANVILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARTIRQEKEIKGIQLGKEEVKLS LFADDMIVYLENPIVSAQNLLKLISSFSKVSGYKINVQKSQAFLHTNNRQTESQIMSELP FKIATKRIKYLGIQLTRDVKDLFQENYKPLLNEIKKDTNKWKNIPCSWIGRISIVKMAIL PKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRACIAKTILSQKNKAGGIKLHDCKLY YKVIVTKTAWYWYQNRDIDQWNRTEPSEIMPLIYNYLIFDKPDKNKQWGKDSLFNKWCWE NWLAICRKLKLDPFLTPYTKINSSWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDFMSKT LKAMATKAKIDKWDLIKLKSFCTAKEPTIRVNRQPTEWEKIFAIYSSDKGLISRIYKEL >gi568815584r:34985593_35221959|GENSCAN_predicted_CDS_4|1800_bp atggcacgagaactacgtgacgaatgcacaagcttcagtaaccgactcgatcaactggaa gaaagggtatcagtgatggaagatgaaatgaatgaaatgaagcaagatgagaagtttaga gaaaaaagaggaaaaagaaatgaacaaagcctccaagaaatatgggactatatgaaaaga ccaaatctatgtctgattggtgtacctgaaagtgacagggagaatggaaccaagttggaa aacactgcaggatattatccaggagaacttccccaacctagcaggcaggccaccattcaa attcaggaaatacagagaatgccacaaagatactcctcaagaagagcaactccaagatac ataattgtcagattcactaaagttgaaatgaaggaaaaaatgttaagggcagctagagag aaagatgcagaaaaggcctttgacaaaattcaacagcccttcacgctaaaaactctcaat aaattaggtattgatgggacgtatctcaaaataataagagctatttatgacaaacccaca gccaatgtcatactgaatgggcaaaaactggaagcattccctttgaaaaccggcacaaga cagggatgccctctctcaccacttctattcaacatagtgttggaagttctggccaggaca atcaggcaggagaaagaaataaagggtattcaattaggaaaagaagaagtcaaattgtcc ctgtttgcagatgacatgatcgtatatttagaaaaccccattgtctcagcccaaaatctc cttaagctgataagcagcttcagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttacacaccaataacagacaaacagagagccaaatcatgagtgaactccca ttcaaaattgctactaagagaataaaatacctaggaatccaacttacaagggatgtgaag gacctcttccaggagaactacaaaccactgctcaacgaaataaaaaaggacacaaacaaa tggaagaacattccatgctcatggataggaagaatcagtattgtgaaaatggccatactg cccaaggtaatttatagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcctgcattgccaag acaatcctaagccaaaagaacaaagctggaggcatcaagctacatgactgcaaactatac tacaaggttatagtaaccaaaacagcatggtactggtaccaaaacagagatatagaccaa tggaacagaacagagccctcagaaataatgccgcttatctacaactatctgatctttgac aaacctgacaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaa aactggctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaa attaattcaagttggattaaagacttaaatgttagacctaaaaccataaaaactctagaa gaaaacctaggcaataccattcaggacataggcatgggcaaggacttcatgtctaaaaca ctaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactcaagagc ttctgcacagcaaaagaacctaccatcagagtgaacaggcaacctacagaatgggagaaa atttttgctatctactcatctgacaaagggctaatatccagaatctacaaagaactctaa >gi568815584r:34985593_35221959|GENSCAN_predicted_peptide_5|164_aa MQRVLRGSATCTMFLTAFLCCSRIPGRQWTGKHQWSQTVWRTLLAGRALSDHRTGVRSQH GSKPLIKQNCKVFLFYIQELNFESKELKRVGNNISACISCTDINICMCPEGSGIARIILV HLKTNGCFHLSSKLKPNKITCYLFPQQMRKFSGGFPDITEAFIT >gi568815584r:34985593_35221959|GENSCAN_predicted_CDS_5|495_bp atgcagagagtcctaaggggctcggccacatgcaccatgttcctgactgccttcctctgc tgcagtcgcattcctggcaggcagtggactgggaaacaccagtggtcgcagactgtatgg agaaccttactggctgggcgtgccctatctgatcacaggacaggagtaaggtcacagcat ggaagtaagcctttaataaagcagaattgtaaggtctttttattttacattcaagaactg aattttgagagtaaagagttaaagcgtgtaggcaataacatttcagcatgtatctcatgt actgatataaatatatgcatgtgcccagagggctccggcatcgccaggattattttggtt catttgaagacgaatggctgtttccatttatcctccaagcttaaacccaataagattacc tgttatctctttcctcaacagatgagaaaattttcagggggttttcctgatataacagag gctttcataacataa