GENSCAN 1.0 Date run: 5-Nov-116 Time: 09:43:53 Sequence gi568815594r:155729859_155966226 : 236368 bp : 37.12% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 172 373 202 1 1 56 43 171 0.960 5.28 1.02 PlyA + 1044 1049 6 1.05 2.00 Prom + 10835 10874 40 -6.15 2.01 Init + 11828 11934 107 1 2 28 80 87 0.102 1.64 2.02 Intr + 13012 13164 153 1 0 89 78 66 0.005 3.97 2.03 Intr + 19043 19088 46 2 1 99 94 19 0.002 1.09 2.04 Intr + 29205 29285 81 2 0 60 115 61 0.212 4.92 2.05 Intr + 29929 30002 74 0 2 118 108 65 0.253 8.79 2.06 Intr + 45110 45210 101 2 2 113 89 114 0.978 12.83 2.07 Intr + 47666 47784 119 0 2 38 115 90 0.968 5.96 2.08 Intr + 59856 60053 198 2 0 82 87 232 0.669 21.13 2.09 Intr + 63998 64228 231 1 0 73 98 235 0.955 20.05 2.10 Intr + 66519 66652 134 2 2 61 84 78 0.974 3.22 2.11 Intr + 70019 70216 198 2 0 52 76 177 0.967 10.44 2.12 Term + 72355 72742 388 1 1 31 42 440 0.961 27.13 2.13 PlyA + 73200 73205 6 1.05 3.00 Prom + 73286 73325 40 -9.45 3.01 Init + 73859 73906 48 1 0 66 92 59 0.485 5.10 3.02 Intr + 74735 74889 155 1 2 77 94 101 0.899 7.55 3.03 Intr + 75245 75371 127 2 1 113 64 64 0.866 6.26 3.04 Term + 85576 85785 210 0 0 -16 35 192 0.221 -0.19 3.05 PlyA + 87308 87313 6 1.05 4.11 PlyA - 88315 88310 6 1.05 4.10 Term - 100967 100788 180 2 0 25 38 178 0.685 3.13 4.09 Intr - 102057 101988 70 2 1 113 101 27 0.653 4.77 4.08 Intr - 106999 106831 169 2 1 107 91 57 0.967 5.98 4.07 Intr - 110756 110714 43 2 1 24 50 92 0.426 -4.11 4.06 Intr - 113972 113823 150 2 0 38 94 136 0.871 8.64 4.05 Intr - 114440 114288 153 2 0 74 76 137 0.955 10.45 4.04 Intr - 124391 124219 173 0 2 23 92 109 0.719 3.54 4.03 Intr - 125019 124892 128 2 2 77 80 118 0.729 9.30 4.02 Intr - 133896 133590 307 1 1 7 98 222 0.664 9.68 4.01 Init - 136368 136329 40 0 1 50 113 23 0.661 1.40 4.00 Prom - 140102 140063 40 -6.05 5.02 PlyA - 140698 140693 6 1.05 5.01 Sngl - 149163 148828 336 0 0 64 41 242 0.715 12.88 5.00 Prom - 151234 151195 40 -8.15 6.00 Prom + 151659 151698 40 -7.75 6.01 Init + 152984 153000 17 1 2 90 57 8 0.099 -2.22 6.02 Term + 162484 162769 286 1 1 98 43 223 0.706 12.69 6.03 PlyA + 162840 162845 6 1.05 7.00 Prom + 163838 163877 40 -6.25 7.01 Init + 165899 165965 67 1 1 26 86 72 0.004 2.09 7.02 Intr + 174186 174265 80 1 2 21 103 126 0.005 5.75 7.03 Intr + 175209 175299 91 2 1 36 115 44 0.889 0.55 7.04 Intr + 177864 177934 71 1 2 69 103 51 0.855 2.68 7.05 Intr + 179029 179156 128 0 2 115 69 157 0.999 15.06 7.06 Intr + 180167 180353 187 2 1 51 64 145 0.865 7.17 7.07 Intr + 184465 184580 116 0 2 67 68 165 0.792 10.73 7.08 Intr + 187537 187616 80 1 2 119 95 60 0.913 8.08 7.09 Intr + 188291 188381 91 0 1 16 92 71 0.725 -1.57 7.10 Term + 189979 190132 154 1 1 71 38 121 0.766 1.81 7.11 PlyA + 190522 190527 6 1.05 8.11 PlyA - 190811 190806 6 1.05 8.10 Term - 195707 195556 152 0 2 51 44 109 0.473 -0.11 8.09 Intr - 198570 198478 93 1 0 75 103 57 0.749 4.92 8.08 Intr - 199847 199684 164 1 2 63 111 30 0.487 1.50 8.07 Intr - 202809 202657 153 2 0 13 92 102 0.246 1.37 8.06 Intr - 204705 204571 135 2 0 120 -11 80 0.112 0.06 8.05 Intr - 207625 207504 122 1 2 73 103 64 0.983 4.77 8.04 Intr - 209680 209513 168 1 0 77 57 115 0.917 6.52 8.03 Intr - 212598 212459 140 1 2 57 88 44 0.497 0.56 8.02 Intr - 213406 213298 109 1 1 97 75 39 0.542 2.44 8.01 Init - 223989 223855 135 0 0 100 53 273 0.464 23.19 8.00 Prom - 227012 226973 40 -2.65 9.02 PlyA - 229019 229014 6 1.05 9.01 Term - 229830 229711 120 0 0 71 50 99 0.470 1.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 165705 166169 465 2 0 10 38 247 0.805 7.79 S.002 Intr + 174160 174265 106 1 1 74 103 131 0.990 12.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:155729859_155966226|GENSCAN_predicted_peptide_1|67_aa XLLKDCPGFVFTPRSREELPPNFPSEIPGICHFLDAYQQGTNSKPCFQKKDVEDGNANFL GKASGID >gi568815594r:155729859_155966226|GENSCAN_predicted_CDS_1|204_bp nnattactcaaagactgtcctggtttcgtgtttacccctcgatcaagggaggaacttcca ccaaacttccctagtgaaatccccggaatctgccattttctggatgcttaccaacaagga acaaactcaaaaccatgcttccaaaagaaagatgtggaagatggcaatgccaatttttta ggcaaagcatcaggaatagattag >gi568815594r:155729859_155966226|GENSCAN_predicted_peptide_2|609_aa MTHSWRVDITLKWESEGVLLSLPAENKLLLVGLIGRFLNSVTADPKIRPGLQKREITEPF KDASAALTNLFLTVLAKGMSSGHHQCKLNRLERRVCMIQDTEPLPPLPGSLRPYLCVGAA SPAPGADTMYGFVNHALELLVIRNYGPEVWEDIKKEAQLDEEGQFLVRIIYDDSKTYDLV AAASKVLNLNAGEILQMFGKMFFVFCQESGYDTILRVLGSNVREFLQNLDALHDHLATIY PGMRAPSFRCTDAEKGKGLILHYYSEREGLQDIVIGIIKTVAQQIHGTEIDMKVIQQRNE ECDHTQFLIEEKESKEEDFYEDLDRFEENGTQESRISPYTFCKAFPFHIIFDRDLVVTQC GNAIYRVLPQEGLLDVEKLECEDELTGTEISCLRLKGQMIYLPEADSILFLCSPSVMNLD DLTRRGLYLSDIPLHDATRDLVLLGEQFREEYKLTQELEILTDRLQLTLRALEDEKKKTD TGCPARIQAFKVQTTLMLCEKDSRSTKGFPSISYSGFLLIPLNRLLYSVLPPSVANELRH KRPVPAKRYDNVTILFSGIVGFNAFCSKHASGEGAMKIVNLLNDLYTRFDTLTDSRKNPF VYKASVLYR >gi568815594r:155729859_155966226|GENSCAN_predicted_CDS_2|1830_bp atgacacacagctggagagttgatataaccctgaagtgggagagtgaaggtgtattgctg tccttgccagctgaaaacaaactgcttctggtgggccttataggcaggtttcttaattca gtgactgcagatcccaaaataaggcctggcctacagaaaagagaaattacggagcccttc aaagatgccagtgctgctctcacaaatttatttctcacagtactggcgaaaggtatgtct tctggacaccaccagtgtaagctcaataggttagagcgacgagtatgcatgattcaggac actgagccgctgccgcctctgcctgggtcccttcggccgtacctctgcgtgggggctgcc tccccggctcccggtgcagacaccatgtacggatttgtgaatcacgccctggagttgctg gtgatccgcaattacggccccgaggtgtgggaagacatcaaaaaagaggcacagttagat gaagaaggacagtttcttgtcagaataatatatgatgactccaaaacttatgatttggtt gctgctgcaagcaaagtcctcaatctcaatgctggagaaatcctccaaatgtttgggaag atgtttttcgtcttttgccaagaatctggttatgatacaatcttgcgtgtcctgggctct aatgtcagagaatttctacagaaccttgatgctctgcacgaccaccttgctaccatctac ccaggaatgcgtgcaccttcctttaggtgcactgatgcagaaaagggcaaaggactcatt ttgcactactactcagagagagaaggacttcaggatattgtcattggaatcatcaaaaca gtggcacaacaaatccatggcactgaaatagacatgaaggttattcagcaaagaaatgaa gaatgtgatcatactcaatttttaattgaagaaaaagagtcaaaagaagaggatttttat gaagatcttgacagatttgaagaaaatggtacccaggaatcacgcatcagcccatataca ttctgcaaagcttttccttttcatataatatttgaccgggacctagtggtcactcagtgt ggcaatgctatatacagagttctcccccaggaaggattgttggatgtggagaaattagaa tgtgaggatgaactgactgggactgagatcagctgcttacgtctcaagggtcaaatgatc tacttacctgaagcagatagcatactttttctatgttcaccaagtgtcatgaacctggac gatttgacaaggagagggctgtatctaagtgacatccctctgcatgatgccacgcgcgat cttgttcttttgggagaacaatttagagaggaatacaaactcacccaagaactggaaatc ctcactgacaggctacagctcacgttaagagccctggaagatgaaaagaaaaagacagac acaggatgtcctgctagaatccaggcctttaaagtacaaacgacactgatgctgtgtgaa aaggacagcagaagcactaaaggctttcccagtatttcttacagtggctttctgctgatc ccactgaacagattgctgtattctgtccttcctccgtctgttgccaatgagctgcggcac aagcgtccagtgcctgccaaaagatatgacaatgtgaccatcctctttagtggcattgtg ggcttcaatgctttctgtagcaagcatgcatctggagaaggagccatgaagatcgtcaac ctcctcaacgacctctacaccagatttgacacactgactgattcccggaaaaacccattt gtttataaggcaagtgttctttatcgctga >gi568815594r:155729859_155966226|GENSCAN_predicted_peptide_3|179_aa MMEIAGQVQVDGESVQITIGIHTGEVVTGVIGQRMPRYCLFGNTVNLTSRTETTGEKGKI NVSEYTYRCLMSPENSDPQFHLEHRGPVSMKGKKEPMQVWFLSRKNTGTETNGFQSPSFQ LGCSDASHSKISKRLASGVHLVGVLTPDECVSGNKVLVRQAGKECEEDMAGGIGPKLYD >gi568815594r:155729859_155966226|GENSCAN_predicted_CDS_3|540_bp atgatggaaattgctggccaggttcaagtagatggtgaatctgttcagataacaataggg atacacactggagaggtagttacaggtgtcataggacagcggatgcctcgatactgtctt tttgggaatactgtcaacctcacaagccgaacagaaaccacaggagaaaagggaaaaata aatgtgtctgaatatacatacagatgtcttatgtctccagaaaattcagatccacaattc cacttggagcacagaggcccagtgtccatgaagggcaaaaaagaaccaatgcaagtttgg tttctatccagaaaaaatacaggaacagagacaaatggattccagagcccaagcttccaa ctaggatgctcggatgcatctcatagcaaaataagcaagcggctggcaagtggcgtacat ctggtgggcgttttaacaccagatgagtgtgtgtcaggtaacaaagttctagtcaggcaa gcaggcaaggagtgtgaagaagatatggcaggtggtattggccctaaactttacgactaa >gi568815594r:155729859_155966226|GENSCAN_predicted_peptide_4|470_aa MEQTEKSKVYAENGLLEKIKLCLSKKPLPSPTERKKFDHDFAISTSFHGIHNIVQNRSKI RRVLWLVVVLGSVSLVTWQIYIRLLNYFTWPTTTSIEVQYVEKMEFPAVTFCNLNRSHSY LTQYLKKLKEKVTKELEELWSWLLPDTSKLSQQMDCSVVLHLQEITANSTGSREATDFAA SHQNFSIVEFIRNKGFYLNNSTLLDCEFFGKPCSPKRMNLRPPNEKQWKIRDPWRKEFLE FSNRPIGLGYKVPKLVPSTDRLAAAGQEAFTDNPALGFVDAGIIFVIHSPKKVPQFDGLG LLSPVGMHARVTIRQVKLVQRTPKNEQGQLGDHIEFKDLCTVGTHNSSCPVSCEEIEYPA TISYSSFPSQKALKYLSKKLNQSRKYIRENLVKIEINYSDLNYKITQQQKAHCWSNQIQR QKVKEWLSWAEERKDEELEFNAYRVSVREDENILQIDKDYGMAAKQCERT >gi568815594r:155729859_155966226|GENSCAN_predicted_CDS_4|1413_bp atggagcagacagaaaaatcaaaagtatatgctgagaacggactcttagaaaagataaag ctttgcctttcaaagaaaccactgccatctcccactgagcgaaagaagtttgaccatgac tttgccatctccacttcctttcatgggatacacaatattgttcagaaccggagcaaaatt cgcagggtgctctggttggtggtggttctgggctcagtctcacttgtgacatggcagatc tacattcgcttgctcaactacttcacatggccaaccacaacgtccattgaggttcaatat gtggaaaagatggagttcccagctgtgacattttgtaatttgaacaggtcgcactcttac ctaacccagtatttgaagaagctgaaggaaaaggtcacaaaggagctggaggagctgtgg tcctggctgctgcccgatactagcaagttgagtcagcagatggattgcagtgtggtcctc catcttcaagaaattactgccaattccactggctctagagaggctactgattttgctgca agtcaccaaaacttcagcattgtggaatttatcaggaacaaaggtttttatctcaacaat agcactttgttggactgtgagttttttggaaagccatgtagcccaaagcgaatgaacctc agacccccaaatgaaaaacaatggaagatcagggatccctggaggaaagagttcctggag ttcagcaatcgtcctattggtttgggctataaggtgcccaagctggtaccaagcactgat aggctagctgctgcaggccaggaggcattcactgataacccagcccttggtttcgttgat gctgggatcatctttgttatccattcaccaaagaaggtgccacagtttgatgggttaggc ttgttgtcacctgtgggaatgcacgcaagggtaaccatccgccaagtgaagttagttcag cgaacgcccaagaatgaacaaggacagcttggagaccacattgaatttaaggatttatgt acagtaggaacacataactctagctgccccgtttcttgtgaagaaatagaatacccggcc actatttcttattcctcttttccaagtcaaaaagctttgaaatatctttccaagaagttg aatcaaagccggaaatacatcagggagaatcttgtaaaaattgaaattaactatagtgac ctaaactataagataacccagcagcaaaaggcgcactgctggagtaatcagattcagaga cagaaagtaaaagagtggttgtcatgggccgaggagaggaaggatgaggagttagagttt aatgcgtacagagtttcagttagggaagatgaaaacatcctgcaaatagataaagattat ggaatggctgcaaaacaatgtgaacgtacttaa >gi568815594r:155729859_155966226|GENSCAN_predicted_peptide_5|111_aa MRGSSSHRVPTGVLPSGVVRRGPKFSRPLNGRSTDSLHCVPEKAADTQCQPVKAARKGTA PAKPQGQRCPKTMGTGLLHQYNVTVRHESKEIILELYNLTALLDFGLAWGL >gi568815594r:155729859_155966226|GENSCAN_predicted_CDS_5|336_bp atgaggggttccagctcccacagagtccccactggagtattgcctagtggagttgtgaga agagggccaaaattctccagacccctgaatggtagatccactgacagcttgcactgtgta cctgaaaaagctgcagacactcaatgccagcctgtaaaagcagccaggaaggggacagcc cctgcaaagccacaggggcagaggtgccccaagaccatgggaaccggcctcttgcatcag tacaacgtgactgtgagacatgagtcaaaggagatcattttggagctttacaatttgact gccttgctggattttggacttgcatggggcctgtag >gi568815594r:155729859_155966226|GENSCAN_predicted_peptide_6|100_aa MEVKQRGRLTPLMARYLSETKLPEEQLGSNICCSAIFAVLQPPLLISRQTGSGVDLQQTP TDLQLRVLTVRRKTNKQKGHPHQDKNYMMNAQASVANSIN >gi568815594r:155729859_155966226|GENSCAN_predicted_CDS_6|303_bp atggaggtgaagcagaggggcagattgacacctctcatggccaggtacctctctgagaca aagcttccagaggaacaactgggcagcaacatttgctgttcagcaatattcgctgttctg cagcctccgctgctgatatccaggcaaacagggtctggagtggacctccagcaaactcca acagacctgcagctgagggtcctgactgtcagaaggaaaactaacaaacagaaaggacat ccacaccaagataagaactacatgatgaatgcacaagcttcagtagccaattcaatcaac tag >gi568815594r:155729859_155966226|GENSCAN_predicted_peptide_7|354_aa MEEYSMLMDRKTQYRENGHTGQEGSEEDKSQTGVNRASKGGLIYGNYLHLEKVLNAQELQ SETKGNKIHDEHLFIITHQAYELWFKQILWELDSVREIFQNGHVRDERNMLKVVSRMHRV SVILKLLVQQFSILETMTALDFNDFREYLSPASGFQSLQFRLLENKIGVLQNMRVPYNRR HYRDNFKGEENELLLKSEQEKTLLELVEAKEESEEKEEQVAEFQKQKEVLLSLFDEKRHE HLLSKGREEPRFQVPFQLLTSLMDIDSLMTKWRYNHVCMVHRMLGSKAGTGGSSGYHYLR STVSDRYKVFVDLFNLSTYLIPRHWIPKMNPTIHKFLYTAEYCDSSYFSSDESD >gi568815594r:155729859_155966226|GENSCAN_predicted_CDS_7|1065_bp atggaagaatattccatgctcatggacaggaagactcaatatcgtgaaaatggccatact ggccaagaaggcagcgaagaagacaaatcacaaactggtgtgaatagagccagcaaagga ggtcttatctatgggaactacctgcatttggaaaaagttttgaatgcacaagaactgcaa agtgaaacaaaaggaaataaaatccatgatgaacatctttttatcataactcatcaagct tatgaactctggtttaagcaaatcctctgggagttggattctgttcgagagatctttcag aatggccatgtcagagatgaaaggaacatgcttaaggttgtttctcggatgcaccgagtg tcagtgatcctgaaactgctggtgcagcagttttccattctggagacgatgacagccttg gacttcaatgacttcagagagtacttatctccagcatcaggcttccagagtttgcaattc cgactattagaaaacaagataggtgttcttcagaacatgagagtcccttataacagaaga cattatcgtgataacttcaaaggagaagaaaatgaactgctacttaaatctgagcaggaa aagacacttctggaattagtggaggctaaagaagagtctgaagaaaaagaggaacaggtg gctgaatttcagaagcaaaaagaggtgctactgtccttatttgatgagaaacgtcatgaa catctccttagtaaaggcagggaagagcctaggttccaggtgccttttcagttgctgact tctcttatggacatagattcactgatgaccaaatggagatataaccatgtgtgcatggtg cacagaatgctgggcagcaaagctggcaccggtggttcctcaggctatcactacctgcga tcaactgtgagtgataggtacaaggtatttgtagatttatttaatctttcaacatacctg attccccgacactggataccgaagatgaacccaaccattcacaaatttctatatacagca gaatactgtgatagctcctacttcagcagtgatgaatcagattaa >gi568815594r:155729859_155966226|GENSCAN_predicted_peptide_8|456_aa MDVRALPWLPWLLWLLCRGGGDADSRAPFTPTWPRSREREAAAFRESLNRHRYLNSLFPS ENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVHMSIPNVSLPLRFDWRDKQVV TQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLNALN WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFRVQPPSWLLSRSGIEC LWLFQVHSASCQWIYHSGVWRTVALFSQLLAIPSKLVIVVFPDSLVEGASGDGNGDWACF PGSRRKRAQGKNCRISLLQVGDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHCSS GEANHAVLITGFDKTGSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCVSYISSPLRWRIYR KDFKHQLLYLNKWCLPFEQQSDHVEGRKRAKSLQLN >gi568815594r:155729859_155966226|GENSCAN_predicted_CDS_8|1371_bp atggacgtgcgggcgctgccgtggctgccgtggctgctgtggctgctgtgccggggcggc ggcgatgcggactcccgcgcccccttcaccccgacctggccgcggagccgcgagcgtgaa gccgccgccttccgggaaagtcttaatagacatcgatacttgaattctttatttcccagt gaaaactccaccgccttctatggaataaatcagttttcctatttgtttcctgaagagttt aaagccatttatttaagaagcaaaccttccaagtttcccagatactcagcagaagtacat atgtccatccccaatgtgtctttgccgttaagatttgactggagggacaagcaggttgtg acacaagtgagaaaccagcagatgtgtggaggatgctgggccttcagcgtggtgggggca gtggaatctgcttatgcaataaaggggaagcccctggaagacctaagtgtccagcaggtc attgactgttcgtataataattatggctgcaatggaggctctactctcaatgctttgaac tggttaaacaagatgcaagtaaaactggtgaaagattcagaatatccttttaaagcacaa aatggtctgtgccattacttttctggttcacattctggattttcaatcaaaggttattct gcatatgacttcagggtacagcctccctcctggctgctttcacggtctggcattgagtgt ctgtggcttttccaggtgcatagtgcaagctgtcagtggatctaccattctggggtctgg aggacagtggccctcttctcgcagctccttgccatcccatccaagttggtgattgtcgtc ttccctgattcattagttgagggtgcatctggggatggaaatggggactgggcttgcttt cctggctctaggaggaaaagggctcagggaaagaactgcaggatctcattgttgcaagtt ggtgaccaagaagatgaaatggcaaaagcacttcttacctttggccctttggtagtcata gtagatgcagtgagctggcaagattatctgggaggcattatacagcatcactgctctagt ggagaagcaaatcatgcagttctcataactgggtttgataaaacaggaagcactccatat tggattgtgcggaattcctggggaagttcttggggagtagatggttatgcccatgtcaaa atgggaagtaatgtttgtgttagttatattagcagccctctgagatggcgtatctatcgg aaggatttcaaacaccaattgctttacctgaacaaatggtgcttaccctttgaacagcag agtgaccacgtagaaggaaggaaaagggcaaaatcgcttcagttaaactga >gi568815594r:155729859_155966226|GENSCAN_predicted_peptide_9|39_aa LSLTGRPAFFQADPRADPVLEHMCVVHGTPSLGSWGTVL >gi568815594r:155729859_155966226|GENSCAN_predicted_CDS_9|120_bp ctctcactgactggccgtccagccttcttccaggcagacccccgtgctgaccctgtcctt gagcacatgtgtgtggtccatgggacgccctccttgggctcctggggcacagttctctga