GENSCAN 1.0 Date run: 7-Nov-116 Time: 00:58:30 Sequence gi568815586r:64951352_65221191 : 269840 bp : 38.27% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5231 5425 195 0 0 79 64 163 0.028 11.46 1.02 Intr + 13737 13891 155 1 2 113 36 73 0.001 3.47 1.03 Intr + 22080 22101 22 2 1 99 100 2 0.582 -1.30 1.04 Intr + 25370 25511 142 0 1 91 59 69 0.509 2.79 1.05 Intr + 30269 30407 139 2 1 93 5 118 0.273 3.55 1.06 Intr + 31707 31802 96 2 0 87 100 12 0.236 1.59 1.07 Intr + 43017 43160 144 2 0 89 55 93 0.328 5.66 1.08 Intr + 46195 46304 110 0 2 74 100 13 0.171 -0.74 1.09 Intr + 48561 48604 44 0 2 56 111 65 0.209 2.57 1.10 Term + 62860 62897 38 2 2 139 48 49 0.018 2.52 1.11 PlyA + 63557 63562 6 1.05 2.05 PlyA - 63876 63871 6 1.05 2.04 Term - 64895 64671 225 2 0 36 38 143 0.195 -0.10 2.03 Intr - 65822 65714 109 1 1 78 72 79 0.228 4.67 2.02 Intr - 80629 80483 147 0 0 83 39 109 0.411 3.93 2.01 Init - 81535 80769 767 1 2 43 78 247 0.750 13.41 2.00 Prom - 82810 82771 40 -6.15 3.03 PlyA - 82979 82974 6 1.05 3.02 Term - 83892 83240 653 1 2 -49 43 388 0.744 13.81 3.01 Init - 84224 83915 310 2 1 88 -16 278 0.944 14.92 3.00 Prom - 85342 85303 40 -5.05 4.09 PlyA - 85470 85465 6 1.05 4.08 Term - 88175 88051 125 0 2 41 55 120 0.338 1.57 4.07 Intr - 101436 101310 127 0 1 18 26 125 0.320 -1.37 4.06 Intr - 103862 103767 96 2 0 76 87 109 0.766 8.89 4.05 Intr - 104775 104680 96 0 0 98 94 42 0.425 5.09 4.04 Intr - 116474 116344 131 0 2 26 116 93 0.652 5.39 4.03 Intr - 117553 117413 141 2 0 106 121 65 0.991 11.00 4.02 Intr - 126499 126395 105 2 0 27 94 174 0.147 11.17 4.01 Init - 138241 138052 190 1 1 69 92 81 0.511 5.85 4.00 Prom - 138476 138437 40 -5.15 5.04 PlyA - 139500 139495 6 1.05 5.03 Term - 142438 142349 90 1 0 38 37 144 0.249 1.04 5.02 Intr - 169205 169066 140 0 2 41 111 93 0.986 6.16 5.01 Init - 169840 169693 148 1 1 95 82 187 0.996 17.10 5.00 Prom - 171574 171535 40 -6.85 6.00 Prom + 172116 172155 40 -7.95 6.01 Init + 173586 173673 88 2 1 107 63 69 0.527 6.95 6.02 Intr + 174598 174771 174 2 0 52 92 52 0.274 0.99 6.03 Intr + 189105 189213 109 1 1 44 87 51 0.290 -0.98 6.04 Intr + 196522 196629 108 1 0 131 102 74 0.866 11.58 6.05 Intr + 199286 199356 71 2 2 105 111 -18 0.373 0.11 6.06 Intr + 217729 217880 152 2 2 67 71 150 0.171 10.16 6.07 Intr + 218186 219767 1582 1 1 -37 92 1553 0.124 130.72 6.08 Intr + 245312 245488 177 0 0 70 40 99 0.001 2.27 6.09 Intr + 259575 259612 38 1 2 68 108 22 0.003 -0.74 6.10 Intr + 264626 264692 67 1 1 79 91 47 0.003 1.66 6.11 Intr + 267201 267268 68 1 2 59 80 72 0.001 1.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 7458 7358 101 1 2 85 39 132 0.835 5.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:64951352_65221191|GENSCAN_predicted_peptide_1|361_aa XLSSYSISYTIAIPSSVLDLTGVFQQCIDIHQQILKLCVVGLQDGVYISSAPTLTLLVMV ARLPVGHSTAGLLQFAGGLFQTLVALDFPIPGGFTSEGCRTAKMAASSFLWKLCPRGVGI ITIQVFTSHTKLMSPCLHPSAIGVILKGHCFAIALERKFLDITTDKILANERSLSCRSVG VAGRLLQTLFAWVSSAEAAEQQILLPDPSSGSFVPEGPLLQTSFSYSMSPPNEVLLHLLL LTELPRWSTKISDIVWKSIMKIPVLFCSFLITGAYSPQSRTPCLLNRSRPSHADPLRVVI ISSYSQHILIVALPVGSSVLFPSLIPFKPLINPSRRNESQALAIGADPIGKRRLHLMDES F >gi568815586r:64951352_65221191|GENSCAN_predicted_CDS_1|1086_bp nctttgtcatcatacagcatttcttatactattgctatacctagttcagtgctagatctg accggagtcttccaacaatgcatcgacatccaccagcagatcttgaagctctgcgtggta ggactccaagatggcgtctacatttcctcggcccccaccctcacactacttgtcatggtt gccagactccctgttggccactctactgcagggctgctacagtttgctgggggtctgttc cagaccctagttgccttggatttccccatacctggaggtttcaccagtgaaggctgcaga acagcaaagatggcagccagctccttcctctggaagctctgtcccaggggggtgggaatt attacaattcaagtgtttacttcccatacaaaacttatgtctccatgtctccatccttca gcaataggggtaattctgaaaggtcattgttttgcaattgcccttgaaaggaaattcttg gacataacaacagataagattctagccaatgaaaggtccctcagctgcaggtctgttgga gttgctggacgtctactccagaccctgtttgcctgggtgtcatcagcagaggctgcagaa cagcaaatattgctgcctgatccttcctctggaagctttgttccagaggggcctcttctg caaacaagcttctcttactctatgtctccaccaaatgaggtacttctgcatctacttctg ctcacagaactgcctaggtggtccaccaagatttcagacattgtatggaaaagcatcatg aaaatccctgtcctgttctgttcctttctgattactggtgcatacagcccccagtcacgt actccctgcttgctcaatcgatcacgaccctctcatgcagaccctcttagagttgtgata atcagtagttatagccagcatatattgattgttgctctaccagttggttcctctgtgctc tttcccagtcttatcccatttaaacctctcatcaacccttcaaggagaaatgagtctcag gcactggctataggggcagatcctattgggaagaggcgtctacatcttatggatgagagt ttctaa >gi568815586r:64951352_65221191|GENSCAN_predicted_peptide_2|415_aa MNIDAKILSKILTNRIQQHIDRLVHHDQVGFTPGMQGWFNASKSINVIHHINRTKDKNQM IISTDAEKAFDRIQQRFMLKTLSKLGIDGMYLKIMGAIYDKPTANIILDGQKLEAFPLKT GTRQGCPLSPLLFNVVLEVLARAIRWEKEIKGIQLGKEEVKLSLFADDMIVCLESPIVLA LNLLRLMGNFGKVSGCRVSVQKSQAFLYTNNGQTESRVMGELPFTIASGRMKCLGIQLAG MWGISLGRTADHCSMRIGKNYFEVHMEPKKSPQSQDNPKPKEQIGGITLPDFKQYYKATV IKTACPSSMSNLKDLKGNYHIHQGLIIITKIDPIWCLQRGRRSQESPAARRKVCLALLIA CPVTGALQLMPMATLLKSQPKMDMKGERTHKSNSALVNCFMAFKEGKGKQLKIQP >gi568815586r:64951352_65221191|GENSCAN_predicted_CDS_2|1248_bp atgaacatcgatgcaaaaatcctcagtaaaatactgacaaatcgaatccagcagcacatc gacaggcttgtccaccatgatcaagttggcttcacccctgggatgcaaggctggttcaat gcaagcaaatcaataaacgtaatccatcatataaacagaaccaaagacaaaaaccagatg attatctcaacagatgcagaaaaggcctttgacagaattcaacagcgcttcatgctaaaa actctcagtaaactaggtattgatgggatgtatctcaaaataatgggagctatttatgac aaacccacagccaatatcatactggatgggcaaaaactggaagcattccctttgaaaact ggcacaagacagggatgccctctctcaccactcctattcaatgtagtgttggaggttctg gccagggcaatcaggtgggagaaagaaataaagggtattcaattaggaaaagaggaagtc aaattgtccctatttgcagatgacatgattgtatgtttggaaagccccattgtcttagcc ctaaatctccttaggctgatgggcaactttggcaaagtctcaggatgcagggtcagtgtg caaaaatcacaagcattcctatacaccaataatggacaaacagagagccgggtcatgggt gaactcccattcacaattgcttcagggagaatgaaatgcctgggaatccagcttgcaggg atgtgggggatctctttggggaggactgcagaccactgctcaatgagaattggaaaaaac tactttgaagttcatatggaaccaaaaaagagcccacaaagccaagacaatcctaagcca aaagaacaaattggaggcatcacgttacctgacttcaaacaatactacaaggctacagta atcaaaacagcatgtcccagttccatgtcaaatctgaaggatcttaaagggaactatcac attcaccaaggacttatcatcatcaccaagattgatcccatatggtgtctacaaagagga aggagaagccaggaaagcccagctgcaagaaggaaggtttgtctggctctcctaattgct tgcccagttaccggagcattacagctcatgccaatggccacactattgaagagccagcca aaaatggacatgaaaggagaaaggacacataaaagcaattctgctctagtaaactgcttc atggcgttcaaggaggggaaaggcaaacaactcaaaatacaaccttaa >gi568815586r:64951352_65221191|GENSCAN_predicted_peptide_3|320_aa MGRNQSRKADNSKNQSTSSPPKECSSSPATEQSRMENDFDELREEGFRRSVITNFSELKE DVQTHHKEAKNLEKRLDEWLTRINSLEKTFNDLMEPKTMARELLAKVEERVPVIEDQMNE MKREEKFREKRVKRNKQNLQEIWDYVKRPNLCLIGVPESDRENGNKLENTLQDIIQENFP NLARQTNIQIQEIQRMPQRYSSRRATPGHVIVRFTKVEMKEKMLRAAREKGRVTHKGKPI RLTADLSAETLQARREWGPIFNILKEKNFQPRVSDPAKLGFMGEGEIKSFTDKQMLRDFV TTRPALEELLKEALNMERNN >gi568815586r:64951352_65221191|GENSCAN_predicted_CDS_3|963_bp atggggagaaaccagagcagaaaagctgacaattctaaaaatcagagcacctcttctcct ccaaaggaatgcagctcctcaccagcaacagaacaaagccggatggagaatgactttgac gagttgagagaagaaggcttcagacgatcagtaataacaaacttctctgagctaaaggag gatgttcaaacccatcacaaagaagctaaaaaccttgaaaaaagattagacgaatggcta actagaataaacagcttagagaagaccttcaatgacctgatggagccgaaaaccatggca cgagaactactagctaaagtggaagaaagggtaccagtgattgaagatcaaatgaatgaa atgaagcgagaagagaagtttagagaaaaaagagtaaaaagaaacaaacaaaacctccaa gaaatatgggactatgtgaaaagaccaaatttatgtctgattggtgtacctgaaagtgac agggagaatggaaacaagttggaaaacactcttcaggatattatccaggagaacttcccc aacctagcaagacagaccaacattcaaattcaggaaatacagagaatgccacaaagatac tcctcgagaagagcaactccaggacacgtaattgtcagattcaccaaagttgaaatgaag gaaaaaatgttaagggcagccagagagaaaggtcgggttacccacaaagggaagcccatc agactaacagcagatctctcggcagagactctacaagccagaagagagtgggggccaata ttcaatattcttaaagagaagaattttcagcccagagtttcagatccagccaagctaggc ttcatgggtgaaggagaaataaaatcctttacagacaagcaaatgctgagagattttgtc acaaccaggcctgccttagaagagctcctaaaggaagcactaaacatggaaaggaacaac tga >gi568815586r:64951352_65221191|GENSCAN_predicted_peptide_4|336_aa MWEKRLVGRAREKVVASQSLHCNKKPLRVLSRSSDIFQLAFKKKNWMMVPIEGINLERMS YQGEYFYEFLSLRSLDKGIMADPTVNVPLLGTVPHKASVVQVGFPCLGKQDGVAAFEVDV IVMNSEGNTILQTPQNAIFFKTCQQGCPYKLCFSLLKLSAQAGAEMEAFVMKDASASVLM GSTDLTVRKANAHNPVEMEVNALVKANVSVPKVTRETSVQSLSASLAVVHMEPAMNPTNA NVKKVGMEDTAIKHPHLEIMLLKASAISVPPSNPSIACQEDDLVLDSNANVFSRRAWESV RHPVGPEDKAMGPDPTPPTPGLSTQLKGIELRSELK >gi568815586r:64951352_65221191|GENSCAN_predicted_CDS_4|1011_bp atgtgggagaagaggctggtgggcagagctagagagaaagttgtagctagccaaagtctt cattgtaataagaaaccattaagggtgttaagtaggagcagtgatatattccagttggca tttaagaagaaaaattggatgatggtgccgattgagggaataaatctggagcgcatgagt taccagggagaatacttctatgaattcctgtccttgcgctccctggataaaggcatcatg gcagatccaaccgtcaatgtccctctgctgggaacagtgcctcacaaggcatcagttgtt caagttggtttcccatgtcttggaaaacaggatggggtggcagcatttgaagtggatgtg attgttatgaattctgaaggcaacaccattctccaaacacctcaaaatgctatcttcttt aaaacatgtcaacaagggtgtccatataagctttgtttctctctgctgaagctgagtgcc caggcgggtgccgaaatggaggcttttgtaatgaaagacgcatctgcgagtgtcctgatg ggttccacggacctcactgtgagaaaggcaaatgcccacaaccctgtcgaaatggaggta aatgcattggtaaaagcaaatgtaagtgttccaaaggttaccagggagacctctgttcaa agcctgtctgcgagcctggctgtggtgcacatggaacctgccatgaacccaacaaatgcc aatgtcaagaaggttggcatggaagacactgcaataaaacacccccacttggagattatg ctgctgaaagcgtctgccatcagtgtgcctccttcaaatccaagcattgcatgccaagaa gatgacctggtgcttgattcaaatgcaaatgtgttttccaggagagcctgggagtcagta cgccaccctgtggggccagaggacaaagccatgggcccagacccaaccccccccaccccg ggtttgagcacacagctcaagggtattgagctgagatctgagctcaagtga >gi568815586r:64951352_65221191|GENSCAN_predicted_peptide_5|125_aa MARRSAFPAAALWLWSILLCLLALRAEAGPPQEESLYLWIDAHQARVLIGFEEDILIVSE GKMAPFTHDFRKAQQRMPAIPVNIHSMNFTWQAAGQTIREPNPPIHMRELSLITSDQQSS LKDDS >gi568815586r:64951352_65221191|GENSCAN_predicted_CDS_5|378_bp atggcccggaggagcgccttccctgccgccgcgctctggctctggagcatcctcctgtgc ctgctggcactgcgggcggaggccgggccgccgcaggaggagagcctgtacctatggatc gatgctcaccaggcaagagtactcataggatttgaagaagatatcctgattgtttcagag gggaaaatggcaccttttacacatgatttcagaaaagcacaacagagaatgccagctatt cctgtcaatatccattccatgaattttacctggcaagctgcagggcagaccatccgggaa ccgaatccacctatccacatgcgggaactttctctcatcacatcagatcaacagtcatca ctaaaagatgacagttga >gi568815586r:64951352_65221191|GENSCAN_predicted_peptide_6|878_aa MQDLQVLTCSMMSLEWEREGAAGDEAQNVACFALLLDHLGFCRLLSLPCISWRRPGGTGK KKVSEGAEASAHLGSAVVWEPGQHSSREKDTSSSRSALLINKKTLIFSLYHIVTEAEIPY VEKLEYILVRCIMTSAFYSTGVVVRRTWFSSCAWDAGVGGLILLKHSVNHTFVIYHQTGC CFERQAKDELKIDNLKSDGKAANLVWPAEFLSVGLGSLGIGNPDVKKTYAINFKGLGRRV TSGSAELVKHPGEKMAAAAASAPQQLSDEELFSQLRRYGLSPGPVTESTRPVYLKKLKKL REEEQQQHRSGGRGNKTRNSNNNNTAAATVAAAGPAAAAAAGMGVRPVSGDLSYLRTPGG LCRISASGPESLLGGPGGASAAPAAGSKVLLGFSSDESDVEASPRDQAGGGGRKDRASLQ YRGLKAPPAPLAASEVTNSNSAERRKPHSWWGARRPAGPELQTPPGKDGAVEDEEGEGED GEERDPETEEPLWASRTVNGSRLVPYSCRENYSDSEEEDDDDVASSRQVLKDDSLSRHRP RRTHSKPLPPLTAKSAGGRLETSVQGGGGLAMNDRAAAAGSLDRSRNLEEAAAAEQGGGC DQVDSSPVPRYRVNAKKLTPLLPPPLTDMDSTLDSSTGSLLKTNNHIGGGAFSVDSPRIY SNSLPPSAAVAASSSLRINHANHTGSNHTYLKNTYNKPKLSEPEEELLQQFKREEVSPTG SFSAHYLSMFLLTAACLFFLILGLTYLGMRGTGVSEDGELSSPVWIGNAHSQENSKQVVK LVPERDLPMVTSKCAAKSDELKGLSGGTYACYSLLLWVIIVENPFGETFGKIQESEKTLM MNTLYKLHDRLAQLAGDHECGSSSQRTLSVQEAAAYLK >gi568815586r:64951352_65221191|GENSCAN_predicted_CDS_6|2634_bp atgcaggatctgcaggtgctgacctgtagtatgatgagcctggaatgggagagggaggga gcagctggggatgaagcccaaaatgtagcatgctttgctcttttacttgaccatctaggc ttctgtaggctcctgagtttgccatgcatatcctggaggagacccggagggacagggaag aagaaggtatctgagggggctgaagccagtgctcatcttggaagtgcagttgtgtgggaa ccagggcaacacagctctagagaaaaagataccagctcatctcgttcagctttgcttata aataaaaagacactgatcttttcattatatcacattgtaacagaagctgaaataccttat gttgagaaattggaatacattttggtccgctgcattatgacatcggcattttacagcact ggggtcgttgtgaggaggacttggttttccagttgtgcctgggatgctggagtgggcggg ttaatcctcctcaaacatagcgttaatcatacatttgttatataccaccaaactggctgt tgctttgaacgtcaagctaaagacgaactgaaaatagacaacctgaagtcagacgggaaa gctgccaatctagtctggccagcagaatttctttcagtaggtctaggctccctcggcatt gggaatccagatgtaaagaaaacttacgccataaacttcaagggtctagggcggcgcgtc acttccggtagcgcggagcttgtaaaacaccctggagagaaaatggcggcggcagcagct tcggcgcctcagcagctctcggatgaggagcttttctctcagctccgccgttacggcctg tctcccggaccagtgacggagagcacccgcccggtctacctcaagaagctgaagaagctt cgagaggaagagcagcaacagcaccggtcagggggccgcggcaacaagacgcggaacagt aataacaataacacggcagccgccacggtcgcagccgcgggaccagcggcggcggcggcc gcggggatgggggtccggccggtctcgggcgacctctcctacttacggactcctgggggc ctgtgccgaatctcggcctctggcccagagagcctcctgggagggcccgggggcgcctcc gccgcccccgcggctggcagcaaagtgctgctgggcttcagctcggacgagtcggacgtg gaggccagtccccgggaccaggccggcggcggcgggaggaaagaccgggcttcgctccag taccgcgggctcaaagcgccgccggcgcccctggccgccagcgaggtgactaacagcaac tctgcagagcgaaggaagccccactcgtggtggggggccaggaggccggcgggccccgag ctgcagaccccgccggggaaagatggagcagtggaggacgaggaaggggagggagaggac ggtgaggagagggacccggagaccgaggagccgctctgggcgagccggaccgtgaatggc agccggcttgtcccctacagctgccgggaaaactattcggactcagaggaagaggacgac gacgacgtggcctccagcagacaggtattaaaggacgactccctttcccggcatcggccc agacgaacccatagtaagcctctccccccgctgactgctaaatcggccggcggcaggctg gagacttcagttcagggagggggaggactcgcgatgaatgacagggcggcggctgccggg agtctagacaggagccgaaacctcgaagaggcggcggccgcggagcagggaggagggtgt gatcaagtggactccagccccgttcctagataccgtgttaacgctaagaaactgacccct ctcctgcccccgccacttactgacatggactcaaccttggattcgtcaacaggctccctt ctgaaaaccaataatcatattggcggtggggccttcagtgtggactcccccaggatttat tctaacagtctccctcccagtgcggcggtggccgcctctagttcactcaggatcaatcac gccaatcatacgggctccaatcatacctacctgaaaaacacatacaacaaaccgaagctt tccgaacccgaagaggaacttctccagcaatttaaacgggaggaggtgtccccaacaggg agtttcagtgcccactacttgtcgatgtttctcttaactgctgcctgcttatttttccta atactgggactgacttacctaggaatgagagggacaggagtatctgaggatggagaactc agcagcccagtgtggatcggaaatgctcatagtcaagagaattcaaaacaagtagttaag cttgtacctgagagggacctaccaatggtgacctccaaatgcgcagcaaaaagtgatgaa ctcaaggggctcagtgggggcacttatgcctgttactcattgctcctgtgggttatcatt gtagaaaacccctttggtgaaacatttggaaaaatacaagaaagtgaaaaaactcttatg atgaacacattatataagcttcatgatcgattggcacagcttgcaggagatcatgaatgt ggcagttctagtcaaagaacgctttctgttcaagaggcagctgcgtatttaaaa