GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:57:00 Sequence gi568815587f:105977711_106196904 : 219194 bp : 37.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4473 4635 163 2 1 56 110 86 0.182 7.54 1.02 Term + 19422 19627 206 1 2 -4 41 468 0.999 29.45 1.03 PlyA + 19820 19825 6 1.05 2.03 PlyA - 19860 19855 6 1.05 2.02 Term - 32400 31825 576 0 0 66 34 776 0.995 63.48 2.01 Init - 33207 32746 462 0 0 59 100 423 0.909 36.34 2.00 Prom - 43088 43049 40 -4.55 3.00 Prom + 57140 57179 40 -3.35 3.01 Init + 58381 58430 50 0 2 55 111 78 0.913 7.27 3.02 Term + 60407 60479 73 2 1 109 52 91 0.998 4.00 3.03 PlyA + 60501 60506 6 1.05 4.04 PlyA - 60529 60524 6 1.05 4.03 Term - 64579 64391 189 1 0 67 32 107 0.024 -0.53 4.02 Intr - 76745 75194 1552 1 1 121 60 633 0.043 52.17 4.01 Init - 79902 79823 80 0 2 60 62 39 0.096 -0.92 4.00 Prom - 87946 87907 40 -5.95 5.00 Prom + 95962 96001 40 -4.15 5.01 Init + 97328 97475 148 1 1 31 35 138 0.578 3.10 5.02 Intr + 98777 99036 260 0 2 -54 -58 345 0.785 1.96 5.03 Intr + 99153 99322 170 2 2 68 96 78 0.985 4.42 5.04 Intr + 99584 99843 260 2 2 84 24 275 0.958 16.88 5.05 Intr + 99888 100183 296 1 2 -41 34 377 0.553 15.20 5.06 Intr + 101757 101982 226 2 1 57 97 195 0.886 13.94 5.07 Intr + 112847 112968 122 0 2 97 10 25 0.063 -5.11 5.08 Intr + 113606 113767 162 1 0 29 94 117 0.929 5.65 5.09 Intr + 116873 116944 72 1 0 105 58 59 0.822 3.28 5.10 Intr + 119033 119183 151 1 1 101 42 81 0.269 3.61 5.11 Term + 123622 123743 122 2 2 41 53 81 0.035 -2.44 5.12 PlyA + 126475 126480 6 1.05 6.03 PlyA - 126689 126684 6 1.05 6.02 Term - 129681 129428 254 1 2 76 41 317 0.790 20.62 6.01 Init - 132218 132188 31 2 1 76 4 47 0.162 -4.95 6.00 Prom - 132685 132646 40 -6.55 7.00 Prom + 133734 133773 40 -5.95 7.01 Init + 137118 137324 207 2 0 66 35 123 0.575 3.68 7.02 Intr + 138266 138431 166 1 1 36 20 219 0.604 8.51 7.03 Term + 145143 145678 536 1 2 67 47 173 0.320 4.52 7.04 PlyA + 145928 145933 6 1.05 8.00 Prom + 146692 146731 40 -3.65 8.01 Init + 151580 151786 207 1 0 9 65 208 0.695 9.47 8.02 Intr + 158213 158357 145 1 1 74 66 87 0.372 4.03 8.03 Intr + 160801 160945 145 2 1 73 81 11 0.019 -2.68 8.04 Intr + 169288 169373 86 1 2 88 100 30 0.186 2.74 8.05 Term + 179309 179406 98 0 2 55 39 141 0.648 3.05 8.06 PlyA + 179637 179642 6 1.05 9.04 PlyA - 179959 179954 6 1.05 9.03 Term - 188438 188064 375 2 0 51 44 216 0.245 7.25 9.02 Intr - 196346 196183 164 0 2 17 9 131 0.047 -3.13 9.01 Init - 199476 199338 139 0 1 30 37 177 0.776 7.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:105977711_106196904|GENSCAN_predicted_peptide_1|122_aa MVPNLMIIRPMIIQVYDDMHSVQYSISYMRYSTLSNKIGFVLDDFEANISVLNTGGAGEG EGGGGGEAEAEEEEEGEGEEEKEGEEEEEGEEEEGEEEDNKRKRKRKEEEKKEEEEEEKE KK >gi568815587f:105977711_106196904|GENSCAN_predicted_CDS_1|369_bp atggtccccaacttaatgatcattcgacccatgattattcaagtttatgatgatatgcat tcagtacagtactcgataagttacatgagatattcaacacttagtaataaaataggcttt gtgttagatgactttgaggctaatataagcgttctgaacacaggaggagcaggagaagga gaaggaggaggaggaggagaagcagaagcagaagaagaggaggagggggaaggggaagag gaaaaggaaggggaagaggaggaggaaggtgaggaggaagaaggagaagaagaagacaac aagaggaagaggaagaggaaggaggaggagaagaaggaggaggaggaggaggagaaggag aagaaataa >gi568815587f:105977711_106196904|GENSCAN_predicted_peptide_2|345_aa MKQLKRKRKSNFSVQETQTLLKEITKRKEVIFSKQLNTTINVMKRMAWEEIAQCVNAVGE GEQRTGTEVKRRYLDWRALMKRKRMKANIKLVGSGFPLPSSDLDDSLTEEIDEKIGFRND ANFDWQNVADFRDAGGSLTEVKVEEEERDPQSPEFEIEEEEEMLSSVIPDSRRENELPDF PHIDEFFTLNSTPSRSAYDEPHLLVNIEKQKLELEKRRLDIEAERLQVEKERLQIEKERL RHLDMEHERLQLEKERLQIEREKLRLQIVNSEKPSLENELGQGEKSMLQPQDIETEKLKL ERERLQLEKDRLQFLKFESEKLQIEKERLQVEKDRLRIQKEGHLQ >gi568815587f:105977711_106196904|GENSCAN_predicted_CDS_2|1038_bp atgaagcagttgaaaagaaaaaggaaaagcaattttagtgttcaagaaactcagaccctt ttgaaagaaattacgaaaaggaaagaagtcattttttccaagcagctcaatacaacaatt aatgtgatgaagcgaatggcttgggaagagattgcacagtgtgtgaatgctgtaggagaa ggagaacagaggacagggacagaggtgaaaagaaggtaccttgactggcgagcacttatg aagagaaagaggatgaaggccaacattaagctggttggttcaggatttccccttccctcc tctgatttggatgactctctcactgaagagatagatgaaaagattggattccgaaatgat gcaaattttgactggcaaaatgtggcagatttcagggatgcaggtggatccttaactgag gtcaaggtggaagaggaagaaagggatccgcagagtcctgaatttgaaattgaggaggag gaagaaatgttgtcatccgtcataccagattccaggagagaaaatgaacttcccgatttc ccccacattgatgagttttttacccttaactcaacaccatctagatctgcatatgatgag cctcatttgctcgtaaatattgagaaacagaaactagagttggaaaaacgacgactggat atcgaggccgaaaggctgcaggtagaaaaggaacgcctacaaatcgagaaagagaggctg cggcatttagacatggaacatgagcggcttcagctagagaaggagcggctgcagattgaa agagaaaagttgaggttacagatagtcaattcagagaaaccgtccttggaaaatgaactt ggtcaaggagaaaaatccatgcttcaaccacaggacatagaaacagagaagttaaaactt gagcgagaacgcttgcaactggaaaaggataggctgcagtttttgaagtttgaatctgag aagctgcagattgaaaaggaacgcttacaggtagagaaagacagacttcgaattcagaaa gaaggacacttgcagtga >gi568815587f:105977711_106196904|GENSCAN_predicted_peptide_3|40_aa MSRGEEKMPRSTEAADSDWKPHKASPEADAAVLPMKPAEP >gi568815587f:105977711_106196904|GENSCAN_predicted_CDS_3|123_bp atgtcacgtggtgaagaaaagatgccgagaagcactgaggcagcagacagtgattggaag cctcataaggcctccccagaagcagatgctgctgtgcttcctatgaagcctgcagaacca tga >gi568815587f:105977711_106196904|GENSCAN_predicted_peptide_4|606_aa MMTADFHFCEVKGQTHDLAITVLIVSRAMFEVNMKERDDGSVTITNLSSKAVKAFLDYAY TGKTKITDDNVEMFFQLSSFLQVSFLSKACSDFLIKSINLVNCLQLLSISDSYGSTSLFD HALHFVQHHFSLLFKSSDFLEMNFGVLQKCLESDELNVPEEEMVLKVVLSWTKHNLESRQ KYLPHLIEKVRLHQLSEETLQDCLFNEESLLKSTNCFDIIMDAIKCVQGSGGLFPDARPS TTEKYIFIHKTEENGENQYTFCYNIKSDSWKILPQSHLIDLPGSSLSSYGEKIFLTGGCK GKCCRTVRLHIAESYHDATDQTWCYCPVKNDFFLVSTMKTPRTMHTSVMALDRLFVIGGK TRGSRDIKSLLDVESYNPLSKEWISVSPLPRGIYYPEASTCQNVIYVLGSEVEITDAFNP SLDCFFKYNATTDQWSELVAEFGQFFHATLIKAVPVNCTLYICDLSTYKVYSFCPDTCVW KGEGSFECAGFNAGAIGIEDKIYILGGDYAPDEITDEVQVYHSNRSEWEEVSPMPRALTE FYCQLTSLALHMLHLHTSRVMTICVFEKKKVEGYLMESRRDPDLQVLLRTKKKHSNPVAG AQTLLL >gi568815587f:105977711_106196904|GENSCAN_predicted_CDS_4|1821_bp atgatgacagcagatttccatttctgcgaagttaagggtcagacacatgaccttgcaatt actgtgttaattgtatcaagggctatgtttgaagtaaacatgaaagaaagagatgatgga agtgttaccattactaatttgtcctccaaggcagtaaaagcatttctcgattatgcctat actggaaaaacaaaaataacagatgataatgtggaaatgttcttccagttgtcatcattt cttcaagtttccttcctatccaaagcttgcagtgactttttaataaaaagtattaatctt gtcaattgtttacagttattatctatatcagatagctatggctccaccagtttgtttgat catgcattacactttgtacaacatcacttttctttattatttaaatccagtgatttctta gagatgaattttggagtactacagaaatgtctggaatcagatgaattaaatgttcctgaa gaagaaatggtactgaaagttgtccttagttggactaaacataacttagaatcaaggcaa aagtatctgcctcatttgattgaaaaagtgagattacatcagttatctgaggagacactt caggactgtctgttcaatgaagagagtttactcaaaagcacaaactgttttgacataatc atggatgcaattaagtgtgtgcaaggttctggtggactcttccctgatgctcgaccatcc acaactgagaaatacatattcattcacaaaactgaggaaaatggagaaaatcaatataca ttttgctataacattaaatctgattcatggaaaatactgccgcaatcacacctgattgat ttgccaggatctagtctttcgagttacggagagaaaatattcttgacaggtggttgcaaa gggaaatgttgtcgaacggttcgactgcatattgccgagtcatatcatgatgccactgat caaacctggtgctactgtccagtgaaaaatgatttcttcttggtatcaactatgaaaaca ccaagaaccatgcatacatcagttatggctctcgatagattatttgtcataggtggaaaa actagaggatcccgggacattaaaagtctcttagatgttgaatcttacaatcctctttcc aaagaatggatatctgttagcccattacccagaggcatatactatccagaagcaagcaca tgccaaaatgtaatttatgttcttggatcagaggtagagattacagatgcttttaaccca tcacttgattgcttttttaaatacaatgctacaactgatcagtggtctgaactagtagca gagtttgggcaattttttcatgcaacattaattaaagctgtaccagtaaactgtacactg tatatatgtgacctttccacctataaggtttatagtttttgtccagacacttgtgtttgg aaaggcgaaggatcttttgagtgtgcaggctttaatgcaggtgcaattggaattgaagat aaaatttatatattaggtggtgattatgcaccagatgaaatcacagatgaagtgcaggtc taccacagcaacaggtctgaatgggaagaagtttcaccaatgcctagagccttaacagaa ttttactgccagctaacatctctggccttacatatgctacacttacatacgtccagagtt atgacaatctgtgtctttgagaagaaaaaggtggaaggctatcttatggagagtaggagg gatccagacttacaggttctgttgagaacaaagaagaaacactcaaatccagtggctgga gcccagactttgttgttataa >gi568815587f:105977711_106196904|GENSCAN_predicted_peptide_5|662_aa MEEVMSHIGSCVTNVTQTIGGRAGNRIRVPQNPAKKPQISTKQDDHYRLSNCVNYYRWPL ADISGQSEEEGNRLNPAVPHFNRPPGSHLQSSAETNFSLDQNLQGTWIAWAFIRESTTAY TEQLRKQTYINMTLRKCSSPGLAIAKQHLGRFIAPNKQPALAQLLAGSLGSRRGQCCALG IVPLLGLRREGFRPRSSPTCNCSLRSLHPSLPASRGLVAIGLHRCPVNSSASVSGVTVGL LALVPPFARLRSLAYLTSYNAAKRRADLRLRPQLLSPRGVCVATARPLRTDAGKKGVGPR LRPRHQARDSGEVRFQCMVFPAKRFCLVPSMEGVRWAFSCGTWLPSRAEWLLAVRSIQPE EKERIGQFVFARDAKAAMAGRLMIRKLVAEKLNIPWNHIRLQRTAKGKPVLAKDSSNPYP NFNFNISHQGDYAVLAAEPELQVGIDIMKTSFPGRGSIPEFFHIMKRKFTNKEWETIRSF KDEWTQLDMFYRNWALKESFIKAIGVGLGFELQRLEFDLSPLNLDIGQVYKETRLFLDGE EEKEWAFEESKIDEHHFVAVALRKPDGSRHQDVPSQDDSKPTQRQFTILNFNDLMSSAVP MTPEDPSFWDCFCFTEEIPIRNEAFSNRQVKSHKFDAPDSSMSVFGCYLAPFKALHAFSS VA >gi568815587f:105977711_106196904|GENSCAN_predicted_CDS_5|1989_bp atggaagaggttatgtcacacataggatcctgtgtgaccaatgtcacacagacaattggt ggtagagctggaaatagaatacgagtccctcagaatcctgctaagaagccacagatcagc accaagcaagatgatcattacagactgagtaattgtgttaattactaccgctggcctttg gctgatatttccgggcagtcagaggaagaaggaaatcggctgaatcccgcagtaccgcac tttaatcgacctccaggtagtcatttacagtcatctgcagagaccaacttttccctggac caaaacctgcagggcacttggatcgcgtgggctttcatacgagagtcaacaacagcctac acggaacaactaaggaaacagacatacatcaatatgacccttcgcaaatgctctagccca ggcttggcaatagcaaaacagcacctgggtcgcttcatcgccccaaataaacagccagca ttagctcagctgctcgctggcagcctcggatccaggcggggtcagtgttgcgcactgggg atagtgcctctgctcggccttcggagggagggtttcagaccccggagctcacccacctgc aactgcagtctccgaagtctccaccccagtctccctgcctccagaggactggttgcgatt ggcctgcaccgctgtcccgttaactcttccgcaagtgtgagtggcgtaactgtcgggctc ttggccttggtcccgcccttcgctcgcctccgaagcctcgcctacttgacgtcatacaat gccgcaaagcgcagggctgatctccgtctccgcccccagctgctttctccgagaggagtc tgcgtagcgacggcccgtcccctgcgcacggacgccgggaagaagggggtggggccacgt ttgcgtccgcgccatcaggcccgagatagcggcgaggtccgctttcagtgtatggttttc cctgccaaacggttctgcttggtgccatccatggagggcgtgcgctgggccttttcctgc ggcacttggctgccgagccgagccgaatggctgctggcagtgcgatcgattcagcccgag gagaaggagcgcattggccagttcgtctttgcccgggacgctaaggcagccatggctggt cgtctgatgataaggaaattagttgcagagaaattgaatatcccttggaatcatattcgt ttgcaaagaactgcaaaaggaaaaccagttcttgcaaaggactcatcgaatccttacccg aatttcaactttaacatctctcatcaaggagactatgcagtgcttgctgctgaacctgag ctgcaagttggaattgatataatgaagactagttttccaggtcgtggttcaattccagaa ttctttcatattatgaaaagaaagtttaccaacaaagaatgggaaacaatcagaagcttt aaggatgagtggactcagctggatatgttttataggaattgggcacttaaggaaagcttc ataaaagccattggtgttggactaggatttgaattgcagcggcttgaatttgatctatct ccattaaacttggatataggccaagtttataaagaaacacgtttattcctggatggagag gaagaaaaagaatgggcatttgaggaaagcaaaatagatgagcaccattttgttgcagtt gctcttaggaaacccgatggatctagacatcaggatgttccatctcaggatgattccaaa ccaacccagaggcaatttactattctcaactttaatgatttaatgtcatctgccgttccc atgacacctgaagatccttcattttgggactgtttttgcttcacagaagaaattccaata cgaaatgaagccttcagcaataggcaagtgaaatcccataaatttgatgctcctgatagt tccatgagtgttttcggttgttacttagcaccctttaaagctcttcatgctttttccagt gttgcatga >gi568815587f:105977711_106196904|GENSCAN_predicted_peptide_6|94_aa MYYSRVCGDADCGNDVRQRANSSVFLFEFKMGCKAAKTTRNISNTYGPGTANECTVQWWF KKFCKGDESLEDEECTGQPLEVDNDQLRATIEPS >gi568815587f:105977711_106196904|GENSCAN_predicted_CDS_6|285_bp atgtattattcgcgtgtttgtggtgatgctgactgtggaaacgatgttagacaaagagca aattcgagcgttttcttattcgagttcaaaatgggttgtaaagcagcaaagacaactcgc aatatcagcaacacatatggcccaggaactgctaatgaatgtacagtgcagtggtggttc aagaagttttgcaaaggagacgagagccttgaagatgaggagtgtaccggccagccattg gaagttgacaacgaccaattgagagcaaccatcgaaccctcttaa >gi568815587f:105977711_106196904|GENSCAN_predicted_peptide_7|302_aa MSLVCLIGFWRQHIPHLGVLLQPIYRMTQKAASFEWGLEQEKSLQQVQAAVQAALPLGLY DIAEPMVLEAHEQSGLGGRDGRYTWAQQHELPLTKADLATTTAECPICQQQRTTLSPQCV TIPQVLEVLARAIRQEKEIKVIQLGKEEVKLSLFADDMIVCLENPIVSAQNLLKLISNFS KVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTTASKRIKYLGIQLTRDMKDLFKENYK PLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLK FI >gi568815587f:105977711_106196904|GENSCAN_predicted_CDS_7|909_bp atgagcctagtgtgcctaattggattttggaggcaacacattcctcatttgggtgtgtta ctccagcccatttatcgaatgacccaaaaagctgccagttttgagtggggtctagaacag gagaagtctctgcaacaggtccaggctgctgtgcaagctgctctgccacttgggctatat gacatagcagagccaatggtgcttgaggcccatgaacaaagtggccttggtggcagggat ggacgttacacatgggctcagcaacatgaacttccactcaccaaagctgacctggctaca accactgctgagtgcccaatttgccagcagcagagaacaacgctgagtcctcagtgtgtc accattcctcaggtgttggaagttctggccagggcaattaggcaggagaaggaaataaag gttattcaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgta tgtctagaaaaccccattgtctcagcccaaaatctccttaagctgataagcaacttcagc aaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcctatacaccaataac agacaaacagagagccaaatcatgagtgaactcccattcacaactgcttcaaagagaata aaatacctaggaatccaacttacaagggacatgaaggacctcttcaaggagaactacaaa ccactgctcaaggaaataaaagaggatacaaacaaatggaagaacattccatgctcatgg gtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttatagattcaat gccatccccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaag ttcatatga >gi568815587f:105977711_106196904|GENSCAN_predicted_peptide_8|226_aa MWILKPYDELRFLVEKKIQKRDKGKKEEEEEELEVQAAHEQLERVEGARHNLETSRSAQG PPQRAGKHEVKDITKNIQMLHRSVETATILATGAAVNGLQGPTKVLTMILSVGTGILRTW IELEAIIPSKQMHQRKTKYHMFSLRSGRIRVHKENNRLWGLLEGGRTFISMQAGTLLTAA SPETKTMPVILKVPKCWLFLGKRLLAHFINLTAAGRCNAQQQQQQQ >gi568815587f:105977711_106196904|GENSCAN_predicted_CDS_8|681_bp atgtggattttgaagccatatgatgaattaagatttctggtggagaaaaaaatccagaag agagataagggcaagaaggaggaggaggaagaagagcttgaggtgcaggctgcacatgag cagttggagagggtggagggagccagacataacctggagacaagcaggagtgcccaggga ccaccacagagagccgggaagcacgaggtcaaggatataactaagaacatacagatgcta cacagatcagtagagacagccacaattctagccacaggggcagctgtcaatggtcttcag ggtcccaccaaggtgctaacaatgatactatcagttgggacaggaattttgcgcacatgg atagagctggaggccattatccctagcaaacaaatgcaccaacggaaaaccaaataccac atgttctcacttagaagtgggagaatacgtgtacacaaagagaacaatagactctggggc ctacttgagggtggaagaacgtttatctccatgcaagcagggactttattgacggctgcc tccccagagaccaaaacaatgcctgtaatattgaaggtacccaaatgttggcttttcctg gggaagcgtttactagctcacttcataaacctgacagctgcaggcaggtgcaatgcacag cagcagcagcagcagcagtag >gi568815587f:105977711_106196904|GENSCAN_predicted_peptide_9|225_aa MARLVDLLVPVGVLFCQCALDDQWLVSSSADVFLTMLSRFLCLPAGVASQSGQIQNVRSR SSQWKPDTAVGWTREVINHPVSFVQCTLVAKLLTSVPFLQKQCENQLILEICTIESGVLL ERYRKMWKRLWNWVTGRGWNSLEGSEEDRMMWESFELPRDLSNDFDKNADNCVDNEVQAE VVSNGDEELLVSGSRGPSCYALAKRLKAFCPALEISGTLNLREII >gi568815587f:105977711_106196904|GENSCAN_predicted_CDS_9|678_bp atggcccggcttgtcgacctgctagtgcctgttggcgtgctcttctgccagtgtgccctc gatgaccagtggcttgtatcttcctctgccgacgtgttcctcacaatgctcagccgcttt ttgtgtctacctgctggggttgctagccaatcaggacaaatacagaatgtgaggtcccgt tccagccagtggaaaccggacacagcagtagggtggacacgtgaggttataaatcaccct gtctcctttgttcagtgtactcttgtggcaaaactgctgacgagtgtgccgtttctgcag aaacagtgtgagaaccaactaatactggaaatttgtaccatagagagtggggtactgcta gaaagataccgaaaaatgtggaagcgactttggaactgggtaacaggtagaggttggaac agtttggagggctcagaagaagacaggatgatgtgggaaagttttgaacttcctagagac ttgtcgaatgattttgacaaaaatgctgataattgcgtggacaatgaagtccaggctgag gtggtctcaaatggagatgaggaacttcttgtgagtgggagcagaggtccctcttgctat gctttagcaaagaggctgaaggccttttgtcctgccctagagatctctggaactttgaac ttgagagagatcatctga