GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:13:39 Sequence gi568815593r:15967474_16279575 : 312102 bp : 39.56% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7497 7622 126 2 0 82 63 81 0.036 5.21 1.02 Intr + 38339 38501 163 1 1 63 105 106 0.435 8.43 1.03 Term + 41422 41639 218 2 2 18 38 287 0.811 13.02 1.04 PlyA + 42081 42086 6 1.05 2.00 Prom + 43807 43846 40 -4.85 2.01 Init + 48282 48452 171 2 0 55 85 128 0.871 8.69 2.02 Intr + 80310 80381 72 2 0 44 96 90 0.167 4.08 2.03 Term + 83891 84082 192 1 0 48 42 178 0.949 5.64 2.04 PlyA + 84148 84153 6 1.05 3.00 Prom + 86863 86902 40 -6.15 3.01 Init + 87833 87898 66 1 0 98 -6 153 0.628 8.02 3.02 Term + 92782 92871 90 0 0 140 38 39 0.190 0.84 3.03 PlyA + 96150 96155 6 1.05 4.09 PlyA - 96355 96350 6 1.05 4.08 Term - 100320 99998 323 1 2 54 48 274 0.990 14.00 4.07 Intr - 101006 100943 64 2 1 64 98 45 0.742 0.47 4.06 Intr - 101394 101220 175 2 1 82 61 40 0.590 -0.28 4.05 Intr - 113056 112927 130 2 1 97 77 104 0.851 9.03 4.04 Intr - 115794 115487 308 2 2 1 76 231 0.086 8.37 4.03 Intr - 122778 122662 117 2 0 83 66 95 0.339 5.46 4.02 Intr - 123532 123416 117 0 0 64 82 69 0.350 2.56 4.01 Init - 126102 125972 131 0 2 65 11 91 0.149 -1.23 4.00 Prom - 131865 131826 40 -5.95 5.00 Prom + 136019 136058 40 -4.95 5.01 Init + 144580 144906 327 0 0 41 39 251 0.229 12.90 5.02 Intr + 145475 145532 58 1 1 111 91 4 0.213 0.54 5.03 Term + 153158 153360 203 0 2 74 50 178 0.861 9.17 5.04 PlyA + 153483 153488 6 1.05 6.12 PlyA - 153836 153831 6 1.05 6.11 Term - 157723 157586 138 1 0 59 49 129 0.772 3.08 6.10 Intr - 207757 207587 171 1 0 17 70 163 0.897 6.62 6.09 Intr - 210408 210253 156 0 0 95 115 53 0.995 7.99 6.08 Intr - 212154 211566 589 2 1 128 110 452 0.917 43.10 6.07 Intr - 213252 213172 81 2 0 82 84 46 0.590 1.43 6.06 Intr - 213408 213278 131 0 2 65 94 48 0.771 1.67 6.05 Intr - 217119 216904 216 0 0 57 77 94 0.423 2.98 6.04 Intr - 220340 220140 201 2 0 -16 95 150 0.452 3.86 6.03 Intr - 223527 223406 122 1 2 86 61 87 0.487 5.09 6.02 Intr - 224938 224831 108 2 0 60 77 71 0.670 2.54 6.01 Init - 225234 225165 70 0 1 70 81 47 0.855 3.46 6.00 Prom - 225582 225543 40 -6.55 7.04 PlyA - 225698 225693 6 1.05 7.03 Term - 228908 228472 437 0 2 14 33 208 0.493 2.36 7.02 Intr - 229832 229275 558 0 0 -18 29 397 0.351 15.07 7.01 Init - 244216 243952 265 1 1 55 37 169 0.026 5.75 7.00 Prom - 252051 252012 40 -4.55 8.00 Prom + 252798 252837 40 -4.55 8.01 Init + 262367 262439 73 1 1 56 56 110 0.283 5.88 8.02 Intr + 267745 267826 82 2 1 122 53 33 0.618 0.98 8.03 Intr + 277885 278006 122 1 2 48 84 76 0.328 2.42 8.04 Intr + 280592 280753 162 0 0 100 103 60 0.575 7.73 8.05 Intr + 295452 295635 184 1 1 84 78 57 0.508 2.22 8.06 Intr + 297741 297922 182 0 2 58 54 124 0.363 4.59 8.07 Term + 309117 309211 95 1 2 82 49 82 0.008 0.71 8.08 PlyA + 310592 310597 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 14042 14250 209 0 2 73 39 160 0.817 6.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:15967474_16279575|GENSCAN_predicted_peptide_1|168_aa MQQAFSLELTEALKGKKTVKQESYIQQNCPFTVKEKLRHTFSQTPEMKDPETKETSSPGS ISLTSQDESWEAPATSPKPPHSGMQLELCANVAHQGECEGRICEMLIFIISQWLIEKLEE NGTAGMEKQFMTDEKEEEEEKKKKKEKEEEEEGEEEEVWGGKKEPPPQ >gi568815593r:15967474_16279575|GENSCAN_predicted_CDS_1|507_bp atgcagcaggctttctctttagaactcacagaagctctaaaaggaaaaaaaacagtcaag caagaatcttatatccagcaaaactgtcctttcacggtgaaggagaaattaagacataca ttttcacaaacaccagaaatgaaagacccagaaaccaaagaaacatcttctcctggctcc atctccttgacttctcaggatgaatcctgggaagccccagccacatcaccaaagccacca cactcaggcatgcaattggagctctgtgcaaatgtggctcatcagggagaatgtgaaggt aggatttgcgagatgttaatctttataataagccaatggttgattgaaaagttagaagaa aatgggacagcaggtatggaaaagcagtttatgactgatgagaaggaagaagaggaagag aagaagaagaagaaggagaaggaggaggaggaggagggagaggaggaagaggtatggggt gggaagaaagaaccaccaccacagtaa >gi568815593r:15967474_16279575|GENSCAN_predicted_peptide_2|144_aa MTDDLEALYDCAGCKTVGQLVWLSTGPSVPVGQSPGYFQPRSEALLRDGFPLAALSLPQI LGMSTLQDGQLQENKCREEAQNWCYATYESRPQEALTRCLHPLGMLLPLCEEALAEMKRA HGERPSHASAPTESDSANPPAECS >gi568815593r:15967474_16279575|GENSCAN_predicted_CDS_2|435_bp atgactgatgacctggaggctctgtatgactgtgctggctgtaaaactgtggggcaattg gtctggctgagtactgggccctctgtccctgttgggcagagtcctggctactttcaacct cgatctgaagctctcctcagggacgggttcccacttgcagctctctctctgccccagata ttagggatgtccacgctgcaggatggtcagctccaggagaacaaatgcagagaagaggcc cagaattggtgttatgccacttatgagtcaaggccacaagaggccctgacacgttgtctg caccctcttggaatgctgctgccactatgcgaagaagctctggctgagatgaaaagagca catggagagaggcccagtcatgccagtgctccaactgagtcagactcagccaatcctcca gctgaatgcagctga >gi568815593r:15967474_16279575|GENSCAN_predicted_peptide_3|51_aa MAEGEEEAGTSYMADTGGRESEVLVPSMSLSYKVLAAILLLSFGLTHLTAY >gi568815593r:15967474_16279575|GENSCAN_predicted_CDS_3|156_bp atggcggaaggtgaagaagaagccggcacttcctacatggctgacacaggaggacgagag agcgaagtgttggtcccatctatgtccctgtcatataaggttctagcagcaatcttgctc ctaagctttggactcacacatctgactgcctattag >gi568815593r:15967474_16279575|GENSCAN_predicted_peptide_4|454_aa MPYRKWKRMAPQGQCMATRFPGSQETLQPVHPTITSILPNPSIHQCDLAPLVSLQPLCSV AEEGHPFSDLLWNVWFYGSSVHSTVRKRKPIMQHLCGVCEVHRHRDGDQSARGRAVWVAV GRAKTTRIQCGLIRDHLHSEDTFSTGSGDHSCQPPTAPCISELYKQSSCTPFKSVINFHQ KTFITRLLEPKSNHLITTQSPPPLPSLESELLGSLRIAELWKLTEFLGNPSGCGQCAPRF KGLELRLEVGTGDLYLVVETSVCREMLWIFIDLSTNFSGLSASSKRRALPHFTSPERPSP GTTFCEGSRIQKPLDNTLKCKCVKCRLYVFEHVSEHPQGLVSLFAKRGLIVHEGAAVYRV FKRWRAVNLHWDVLNYDKATDIEESSRGESSTSRTLWLPLTALRNRNLVHPTQLTSPRFQ CGYVLLHLFNRMRPHEDLSEDNSSGEVVMRVTSV >gi568815593r:15967474_16279575|GENSCAN_predicted_CDS_4|1365_bp atgccatacagaaaatggaagaggatggcacctcagggtcagtgcatggccactcgattt cctgggtcccaggaaacactacagccagtgcatccaaccatcacttccatcctgccaaat ccaagcattcaccagtgtgacttggctcctctggtcagccttcagcccctatgcagtgtg gcagaggaaggacatcctttttcagatctgctatggaatgtatggttttatggatctagt gtgcatagtactgtgagaaaacggaagccaatcatgcagcatttatgtggcgtgtgtgag gtgcacaggcatcgcgatggtgaccagtcggcgaggggtagagcagtgtgggtggctgtg gggagagccaagaccaccagaatccagtgtggtctgattcgggaccacttgcacagtgaa gatactttctctacgggaagtggagaccacagctgccaaccaccaacagccccgtgcatc tcagagctgtacaagcagagcagctgcacaccattcaaaagtgtcatcaattttcatcaa aagactttcatcacaagacttttggagcccaaatccaaccatttgattaccactcaatca ccgccaccattaccaagcctggaatctgagctcctggggtcactgaggatagccgagctt tggaagttgacagagtttctggggaatccaagtggatgtggccagtgtgcacccagattt aagggtctggaactcaggctagaggttggaactggagatttgtatctggtagttgaaacc agtgtctgcagagaaatgctttggatattcattgatctctctaccaacttcagtggactt tcagcctcctcaaagagaagagctcttcctcacttcacctcacctgagagaccttcccca gggactactttttgtgagggatcacgaattcaaaagcctttggacaacacgctgaaatgt aaatgtgtgaaatgcaggttgtatgtttttgagcacgtttctgaacatccccaaggcttg gtttccttgtttgccaaacggggtctcattgttcatgaaggagctgcagtttacagagtg tttaagcgctggcgagctgtgaatttgcactgggatgtgttaaattatgacaaagccaca gacatcgaagaaagcagccggggagagtcttccacaagtaggactttgtggttgccattg acagctctgcggaacagaaacttggtccacccaactcagttaacctcaccaaggtttcag tgtggctatgtgttattgcacctgttcaatcggatgaggccccatgaagacttatcagaa gataacagctcgggggaggttgtgatgagagtgacttcagtgtga >gi568815593r:15967474_16279575|GENSCAN_predicted_peptide_5|195_aa MFRQKFAAGVWPSWKTSARAVQKGNVGCEPPHGVPTGALSGGAVRREPISSRPQNGKSTD SLHCVPEKAANTQHQPMTAVKSGAIPCKATGKEPTKAMGTHLLHQRDLDPHGTVSPLNLF FVVNYPVSDCKTGHMSGSSSHQELESISPPLESRVGCETGFDQYNVAEVTCHVNKHEITS WRMRGQCGPEISRPT >gi568815593r:15967474_16279575|GENSCAN_predicted_CDS_5|588_bp atgttcaggcagaaatttgctgcaggagtgtggccctcatggaaaacctctgctagggct gtgcagaagggaaatgtggggtgtgagcccccacatggagtccccactggggcactgtct ggtggagctgtgagaagagagccaatatcctccagaccccagaatggtaaatccactgac agcttgcactgtgtacctgaaaaagccgcaaacactcaacaccagcccatgacagcagtc aagagtggggctataccctgcaaagccacagggaaagagccgaccaaggccatgggaacc cacctcttgcatcagcgtgacctggatccacatggaactgtgagtccattaaacctcttt ttcgtcgtaaattacccagtctcagactgcaaaactggccacatgagcggcagctcctcc catcaagagctggaatctatttccccaccccttgaatctagggttggctgtgagactggc tttgaccagtataatgtggcagaggtcacctgccatgtcaacaagcatgaaataacttct tggaggatgagaggccagtgtggaccagagataagccgccccacttga >gi568815593r:15967474_16279575|GENSCAN_predicted_peptide_6|660_aa MPCEAIETAPATEQELLQPRPRQGQMSTRAQLQIPLHFGEAKSKDTSQQVQLPDAEKFKG CLTASLASTHLDVRRTVPLIVTTKNVSRHGEMSAGSDGVKGRQIQDGSRLSPLAPMPVAL RISRAGVDPVHHLPHICQVSVSHQLALCVSKLPRGSLNGGGPHKAPLELLFTHISYVILL KGFSATAQQLMVQAGATMILFWDVLKSGKRVGWEWCVRLVVCRKLGAASSSYLKEDSSKT ESVTSVKIRIDTLPHINPVLYDLTLHGFGNYCMCTILSLVPCRKKTVSSALKMSPTEVLQ GGVTEKLTSGQSPARQPGRQEGGGGTTMSFEGGHGGSRCRGAESGDAEPPPQPPPPPPPT PPPGEPAPVPAAPRYLPPLPASPETPERAAGPSEPLGEVAPRCRGADELPPPPLPLQPAG QEVAAAGDSGEGPRRLPEAAAAKGGPGESEAGAGGERERRGAGDQPETRSVCSSRSSSSG GGDQRAGHQHQHHQPICKICFQGAEQGELLNPCRCDGSVRYTHQLCLLKWISERGSWTCE LCCYRYHVIAIKMKQPCQSQWVDGGQLRCLSLHATSPEAIIIQRLNQIGTSKKACSFDLV LVLAVGWKLSWGCHLPRVTYHSYRGAIELELQITMYGLNTTLLSHNCQESEKYSCEETSD >gi568815593r:15967474_16279575|GENSCAN_predicted_CDS_6|1983_bp atgccctgtgaagccatagaaactgcccctgccacagagcaggagttgctacagcccagg ccaagacaaggccagatgtctacaagagcccagcttcagatacctctacattttggggaa gccaagagcaaagatacatctcagcaagtacagctaccagatgctgagaaattcaaagga tgtttaacagcatccctggcctctacccatcttgatgtcagaagaaccgtcccactaatt gtgacaaccaaaaatgtctccagacatggtgaaatgtccgctgggagtgatggggtaaaa gggaggcagatacaagacggcagcaggctctcgcctctagcaccaatgcctgtggccctc aggatttcccgggcaggagtagacccagttcaccatctacctcacatctgtcaagtgtca gtcagtcaccagctagccctttgtgtgtccaaattgccaagaggctccctgaatggaggg ggtccccacaaagctccactggaattactatttactcacatcagctatgtcattttgttg aagggattttcagcgactgctcagcagctcatggtccaggctggagcaaccatgatcctt ttctgggatgttttaaagtctggaaaaagggtgggttgggagtggtgtgtgaggctggtg gtttgccgtaaacttggagctgcctcctcttcttatctgaaggaagattcttctaagact gaaagcgtcacatcagtgaaaattcggatagacacactgccgcatataaatccggtgctt tatgatctcacactgcacggatttggaaactactgcatgtgcacgattctcagcctagtt ccctgcagaaagaaaactgttagctctgccctgaagatgagtcccacagaggttttgcag ggtggagttaccgagaaattgacttcgggccagagcccggcccgccagccgggccggcag gagggcggcggcggcacaaccatgagctttgagggcggccacggcggcagtcggtgtcgc ggggcggagagcggggacgccgagcctcccccgcaacctcccccgccgccgccgccgacg ccgccgccgggagagccggccccggtccccgcggccccgcgctacctgccgccgctgccc gcgtcccccgagacccccgagcgcgccgcggggccaagcgagccgctaggggaggtggcc ccgcggtgcaggggagcggacgagctgccgcctccgcccctgcccctgcagcccgccggc caggaagtggcggcggccggcgactccggggaaggtccgaggcgcctcccggaggcggca gcagcgaaaggcggccccggggagtctgaggccggcgcgggcggcgagcgcgagcggcgg ggcgccggagaccagcccgagacgcgctcggtgtgcagcagccgcagcagcagcagtggc ggcggcgaccagcgcgctgggcaccagcaccagcaccaccagcccatctgcaagatctgc ttccagggcgcggagcagggtgagttgttgaacccctgccgatgtgatgggtcagttcgg tatacacatcagctgtgcctgctaaaatggatcagtgagagaggttcctggacctgtgaa ctttgctgttatagataccatgttatagccattaaaatgaaacaaccttgccagagtcaa tgggttgatggaggtcagctgagatgtctttctctgcatgcaacatcacctgaagctatc atcatccagaggcttaaccagattggaacatccaagaaggcttgctcatttgacctggtt ttggtgctggctgttggctggaagctcagttggggctgccacctgcctagagtgacatat cacagttatcggggggctattgagttggaattgcaaattacgatgtatggcttaaacacc actttgctgtcacacaactgccaagagtctgaaaaatattcttgcgaagaaacctcagat tga >gi568815593r:15967474_16279575|GENSCAN_predicted_peptide_7|419_aa MLDILRCAGQAQLNCQQKMLMRNSAVDQVVQKKQAAAARPSRSQVESLPPNCIGYKRFII VLVTRDSQSWSTVTRKGIVIHLLMGDWQDTCKCGSDFGTENRQRLEQFGGLKRRQENVKS LEHPIDLLNGFDQNADNDMDNEIQAKVVSDGDEELAGNWSKGDSCYVLAKRLAAFCPFHR DLWNFELERDGLGYLAEEISKQQSIQEVTWVLLKAFHFIREAEHKNSQNLQPDNVIEKKI QFSEEKFKLAAEICINNREPNVNPQDNGENVFRALWKGHVGLEPPHRVPTGAQPSEAVRR WPPSSRPQNGRSTDILHHVPGKATDTQRQAMKAARRGTIPCKSTEVEPPKAMGSHLLHQR NLDVRHGVKGDHFGALRFDCPAGFWTCMGPAAPLLWPISPIWNGCICPMPVSPLYLGSN >gi568815593r:15967474_16279575|GENSCAN_predicted_CDS_7|1260_bp atgctagacatcctacgatgtgcaggacaagcccagcttaactgccaacaaaaaatgtta atgagaaactctgcggtagatcaagtggtccaaaagaaacaggcagcagctgcaaggcct tctcgaagtcaagtggagtcacttccaccaaattgtattggttacaagagattcataatt gtattggttacaagagattcacaaagctggtctacagtcaccaggaagggaattgtaatt cacctcctgatgggtgactggcaagatacctgcaaatgtggaagtgactttggaactgag aacaggcagaggttggagcagtttggagggctcaaaagaagacaggaaaatgtaaaaagt ttggaacaccctatagacttgttgaatggctttgaccaaaatgctgataatgatatggac aatgaaattcaggctaaggtggtctcagatggagatgaggaacttgctgggaactggagc aaaggtgactcttgttatgttttagcaaagagactggctgcattttgccccttccataga gacttgtggaactttgaacttgaaagagatggtttagggtatcttgcagaagaaatttct aagcagcaaagcattcaagaagtgacttgggtgctgttaaaggcattccattttataaga gaagcagagcataaaaattcacaaaatttgcagcctgacaatgtgatagaaaagaaaatc caattttctgaagagaaattcaagctggctgcagaaatttgcataaataacagggagccg aatgttaatccccaagacaatggggaaaacgtcttcagggcattgtggaagggacatgtg gggctggagcccccacacagagtccctactggggcacagcctagtgaagctgtaagaaga tggccaccgtcttccagaccccagaatggtagatccactgacatcttgcaccatgtgcct ggaaaagccacagacactcaacgccaggccatgaaagcagccaggagggggactataccc tgcaaatccacagaggtagagccacccaaggccatgggatcccacctcttgcatcagcgt aacctggatgtgagacatggagtcaaaggagatcattttggagctttaagatttgactgc cctgctggattttggacttgcatggggcctgcagcccctttgctttggccaatttctcca atttggaatggctgtatttgcccaatgcctgtatccccgttgtatctaggaagtaactag >gi568815593r:15967474_16279575|GENSCAN_predicted_peptide_8|299_aa MMSLQGEEETPESSVSPAGTEKRPSHSFSPIVAFLQILKYNEPAPPVGLSNWTLVSGTQP PCCEDVARPQRPGFRPKASVGIPANSHTRVPTLFNLGPVHINACSALDFMSLFVYLLSLP DFKVQEGKDLLFFIFGTHYNYINLAHVLTLKKRKPHSHIGVTKLKSAVTILNLGALPAEL PLCLGLSSPSPAPLPVAHSLSQQQQQVFLIFSEATLAQAPNAQRNITLMGICLPCLAGFE AGSNFTGFLGLLQPHTEWLKTTEMHSYKYTATIRFLNLFPTFPTFLFHKAAIVILARSQ >gi568815593r:15967474_16279575|GENSCAN_predicted_CDS_8|900_bp atgatgtccttacaaggagaggaagagacaccagagagctctgtgtctcctgcaggcaca gagaaaaggccatctcattctttttctcccatagtcgccttccttcagatcctcaaatat aatgaacctgctcctcccgtagggctttcaaactggacactcgtttctggaacccagcca ccatgctgtgaggatgtggcaagaccacagaggccaggtttccggcctaaagcttcagtt ggaatcccagccaatagccatacaagagtaccaacattgtttaatcttggacctgttcat attaacgcatgtagtgcattagattttatgtctttgtttgtgtaccttttatctctacca gactttaaggtacaagagggtaaggaccttctcttttttatttttggaacccactacaac tacatcaacctggcacatgttcttactttaaaaaaaagaaaaccccacagccatattggt gtcaccaagctcaagtcagcagttaccattctcaatcttggtgcacttcccgctgagctt cctctctgtctgggcctctcctcaccttcccctgcccctctacctgtagcacacagcctt agccagcaacagcagcaggttttccttatcttcagcgaagccactcttgctcaagcccct aatgctcagcgtaatattactctaatgggcatttgccttccttgtcttgctggatttgaa gccggaagcaactttactggcttcctagggctgctgcaaccacacactgagtggcttaaa accacagaaatgcactcttacaaatacacggcaaccatccgatttctcaatcttttcccc acctttcccacctttctattccacaaagccgccattgtcatcctggcccgttctcaatga