GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:23:51 Sequence gi568815585r:21528435_21729015 : 200581 bp : 40.44% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1239 1269 31 2 1 66 103 24 0.441 1.66 1.02 Term + 2346 2551 206 1 2 40 38 194 0.536 6.15 1.03 PlyA + 2755 2760 6 1.05 2.00 Prom + 4641 4680 40 -5.75 2.01 Init + 26491 27413 923 0 2 70 42 223 0.430 9.38 2.02 Intr + 36022 36177 156 1 0 32 77 127 0.701 4.20 2.03 Term + 37754 37880 127 2 1 110 55 52 0.676 0.87 2.04 PlyA + 37895 37900 6 1.05 3.00 Prom + 62609 62648 40 -4.75 3.01 Sngl + 65533 65868 336 0 0 81 44 159 0.716 6.58 3.02 PlyA + 66218 66223 6 1.05 4.03 PlyA - 66617 66612 6 1.05 4.02 Term - 75443 75225 219 2 0 73 28 198 0.984 8.36 4.01 Init - 75714 75505 210 0 0 107 92 238 0.999 22.93 4.00 Prom - 79840 79801 40 -4.95 5.04 PlyA - 79953 79948 6 1.05 5.03 Term - 82606 82449 158 2 2 53 54 129 0.788 3.11 5.02 Intr - 83177 83049 129 0 0 56 101 174 0.156 15.25 5.01 Init - 91859 91793 67 2 1 86 85 61 0.146 6.89 5.00 Prom - 94500 94461 40 -6.35 6.02 PlyA - 94877 94872 6 1.05 6.01 Sngl - 100581 99979 603 0 0 53 42 764 0.998 64.24 6.00 Prom - 102055 102016 40 -3.65 7.04 PlyA - 102655 102650 6 1.05 7.03 Term - 109322 109090 233 0 2 68 45 127 0.026 2.05 7.02 Intr - 114081 113759 323 2 2 41 91 114 0.005 1.58 7.01 Init - 135589 135471 119 1 2 67 28 142 0.520 5.82 7.00 Prom - 135862 135823 40 -5.05 8.03 PlyA - 137610 137605 6 1.05 8.02 Term - 141144 140405 740 1 2 -49 37 514 0.852 25.23 8.01 Init - 142220 142163 58 2 1 81 110 -10 0.772 2.02 8.00 Prom - 142882 142843 40 -14.71 9.00 Prom + 143064 143103 40 -6.35 9.01 Init + 143479 143755 277 0 1 39 74 270 0.969 17.89 9.02 Term + 144858 145066 209 1 2 60 42 220 0.881 11.12 9.03 PlyA + 147719 147724 6 1.05 10.00 Prom + 148091 148130 40 -6.35 10.01 Init + 148162 148198 37 0 1 55 80 77 0.017 4.14 10.02 Intr + 152608 152711 104 2 2 100 89 115 0.096 11.77 10.03 Intr + 161544 161603 60 2 0 104 61 35 0.030 0.41 10.04 Intr + 163112 163273 162 1 0 85 -22 158 0.015 3.75 10.05 Intr + 172756 172962 207 0 0 136 16 118 0.022 7.75 10.06 Intr + 192153 192300 148 2 1 65 19 112 0.222 0.89 10.07 Term + 194118 194248 131 1 2 79 47 106 0.628 2.96 10.08 PlyA + 194991 194996 6 -0.45 11.03 PlyA - 195677 195672 6 1.05 11.02 Term - 197762 197465 298 1 1 31 45 382 0.731 21.95 11.01 Init - 198486 198374 113 0 2 43 105 85 0.912 5.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 83331 83049 283 0 1 39 101 248 0.834 18.45 S.002 Term + 172756 173001 246 0 0 136 48 138 0.913 9.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:21528435_21729015|GENSCAN_predicted_peptide_1|78_aa MISGTVSEVSSAAPGRDGGRGQASRGAPMDDGFLSLDSPSYVLYSDRAEWADIDLVLQNV GPNPVVQIIYSDKYTLWK >gi568815585r:21528435_21729015|GENSCAN_predicted_CDS_1|237_bp atgatctcaggcactgtgtcagaagtttcatcagcagcaccaggaagagatggcggccga ggccaggcaagccgtggtgcccccatggacgacgggtttctgagcctggactcaccctcc tacgtcctgtacagcgacagagcagaatgggctgatatagatctagtgctgcagaatgtt ggccccaatcctgtggtccagatcatttacagtgacaaatatacactctggaaatga >gi568815585r:21528435_21729015|GENSCAN_predicted_peptide_2|401_aa MDKFLDTYTLPRLNQEEVESLNRPITGSEIEAIINILPTKKSPGPDGFTAEFYQRYKEKL IPFLLKLFQSIEKEGILPNLFYEASIILIPKPGRDTTKKENFRPISLMNTDAKILNKILA NRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRTKDKNHMIISIDAEKAFDKI QQCFMLKTLNKLGIDGMYLKIIIAIYDKPTANIILNGQKLEAFPLKINRLILTSELITSS GDFNKVTGNHIVFLNPSTLSLFYKAISVNLNTFPPSSLSADNLASQFTEKKRKKGNKQLD QLPLDVMWGLLYGCGGEVKREVRIVQSYDQLSVFQSDCVTVKGCTLQKRFSVFLFFHSPC CCIFIESIPVHLGRRQKANKFSQTTSSIWAQRMGSNPLYAK >gi568815585r:21528435_21729015|GENSCAN_predicted_CDS_2|1206_bp atggataaattcctcgacacatatactctcccaagactaaatcaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgaggcaataattaatatcttaccaaccaaa aaaagtccaggaccagatggattcacagctgaattctaccagaggtacaaggagaagctg ataccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactta ttttatgaggccagcatcatcctgataccaaagcctggcagagacacaacaaaaaaagag aattttagaccaatatccttgatgaacactgatgcaaaaatcctcaataaaatactggca aaccgaatacagcaacacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaacatacgaaaatcaataaacgtaatccagcatataaacaga accaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacaatgcttcatgctaaaaactctcaataaattaggtattgatgggatgtatctcaaa ataataatagctatctatgacaaacccacagccaatatcatactgaatggacaaaaactg gaagcattccctttgaaaataaatagacttattttaacttcagaattaataacttcaagt ggggactttaataaggttactggcaatcatattgtatttctcaatccatcaactctctca ctgttttataaggctatttctgtaaacctcaataccttccccccctcttcactctcagct gataacctggcttcccaattcactgagaaaaaaagaaagaaaggtaataaacaacttgac cagcttccattagatgtcatgtggggcctgctctatggatgtggtggtgaggtgaaaagg gaggtacgtattgtacagtcctatgatcagctgtcagtctttcagtcggactgtgtgact gtgaagggatgtacccttcaaaagcggttctcagtttttttgtttttccactccccttgc tgctgcatatttatagagagtattccagtacacttaggaaggcgtcagaaagccaacaag ttttcacagactacttcttcaatatgggcacagaggatggggtcaaatccactctatgca aaatga >gi568815585r:21528435_21729015|GENSCAN_predicted_peptide_3|111_aa MGKDFMTKMPKVIATKAKIDKWDLIKVKSFCTAKETIIRVNRQLTEWEKIFAIYPSDKGL IYRIYKKLKHIYKKKRNNPTKKWAKDMNRHFSKEDIYTANKHEKELIITGH >gi568815585r:21528435_21729015|GENSCAN_predicted_CDS_3|336_bp atgggcaaagacttcatgacaaaaatgccaaaagtaattgcaacaaaagccaaaattgac aaatgggatctaattaaagtaaagagcttctgcacagcaaaggaaactatcatcagggtg aacaggcaacttacagaatgggagaaaatttttgcaatctacccatctgacaaaggtcta atatacagaatttacaagaaacttaaacatatttataagaaaaaaagaaacaaccccacc aaaaagtgggcaaaggatatgaacagacatttctcaaaagaagacatttacacggccaac aaacatgaaaaagagctcatcatcactggtcattag >gi568815585r:21528435_21729015|GENSCAN_predicted_peptide_4|142_aa MAAAAGSCARVAAWGGKLRRGLAVSRQAVRSPGPLAAAVAGAALAGAGAAWHHSRVSVAA RDGSFTVSAQAGSGLALGGSPGAVGGGAAADRSALRLVGLAPSWTGPRPESPRAFEAGDR RFCLPWENPADAEHHGPQNWPH >gi568815585r:21528435_21729015|GENSCAN_predicted_CDS_4|429_bp atggcggcggctgcgggtagctgcgcgcgggtggcggcctggggcggaaaactgcgacgg gggctcgctgtcagccgacaggctgtgcggagtcccggccccttggcagcggcagtggcc ggcgcggccctggcaggagcaggagcggcctggcaccacagccgcgtcagtgttgcggcg cgggatggcagttttacagtctccgcacaggcggggagtggtttggctctgggcggctct cccggagccgtgggtggaggtgccgctgcggatcggagcgccctgcgactggtgggactc gctccttcctggaccggcccccggcctgagtccccgagggccttcgaggctggtgaccgg agattctgcctcccttgggagaaccccgcggacgctgaacaccacggcccacaaaactgg ccgcactaa >gi568815585r:21528435_21729015|GENSCAN_predicted_peptide_5|117_aa MQLRPGKRQRGPVTCRAGKRQGGPYVRAGRTAHAQHRSDCLLTQRATVSGIGGLSDFKNE ATDPRARCKSSPSPHRTQKPGRLHLSRPRHLGLHLLDPLAGLLQGPVHTLGVWVLPH >gi568815585r:21528435_21729015|GENSCAN_predicted_CDS_5|354_bp atgcagcttaggccaggcaagcgccagcgagggccagtaacctgcagagctggcaagagg cagggaggcccctatgtgagagcaggtcggaccgcccacgcgcagcaccgcagcgactgc ctcctcacccagcgtgcgactgtgtccggaattggtgggttgtctgacttcaagaatgaa gccacagaccctcgcgctagatgtaaaagttctccaagtcctcaccggacccagaagcct ggaaggcttcacctctcacgaccccggcacctggggctgcacctgctcgacccactggcc ggcctgctccagggcccagtgcacactcttggggtctgggtgcttcctcattga >gi568815585r:21528435_21729015|GENSCAN_predicted_peptide_6|200_aa MFSSSAKIVKPNDEKPDEFESGISQALLELEMNSDLKAQLRELNITAAKEIELGGGRKAI IIFVPIPQLKSFQKIQVRLVRELEKKFSGKHVVFIVQRRILPKPTRKSRTKNKQKRPRSH TLTAVHDAILEDLVFPSEIVGKRIRVKLDGSRLIKVHLDKAQQNNVEHKVETFSGVYKKL TGKDVNFEFPEFQLQTKMTT >gi568815585r:21528435_21729015|GENSCAN_predicted_CDS_6|603_bp atgttcagttcgagcgccaagatcgtgaagcccaatgatgagaagccggacgagttcgag tccggcatctcccaggctcttctggagctggagatgaactcggacctcaaagctcagctc agggagctgaatattacggcagccaaggaaattgaacttggtggtggtcggaaagctatc ataatctttgttcccattcctcaactgaaatctttccagaaaattcaagtccggctagta cgcgaattggagaaaaagttcagtgggaagcatgtcgtctttatcgttcagagaagaatt ctgcctaagccaactcgaaaaagccgtacaaaaaataagcaaaagcgtcccaggagccat actctgacagctgtgcacgatgccatccttgaggacttggtcttcccaagcgaaattgtg ggcaagagaatccgcgtgaaactagatggcagccggctcataaaggttcatttggacaaa gcacagcagaacaatgtggaacacaaggttgaaactttttctggtgtctataagaagctc acgggcaaggatgttaattttgaattcccagagtttcaattacaaacaaaaatgactaca taa >gi568815585r:21528435_21729015|GENSCAN_predicted_peptide_7|224_aa METKGKNLKEVVEIKMTIAEMKNAFDEIVSKLDTAEKVISNWHVVLTVLWLPGVTILLMC LRFSGDQKLHLMPSLHLGTPKLRISFILWIESVWDKKVYEAHWIFFYFSKGKDSVELLKL RNLLLNPKITCKRDFLCYPTSDLGISAVYSGCLITRCCTNKSHFHIQIWFAESSSQDSHI GSNAHKGAAMQRGTVAEVPGSSALQTGDCRGERTERPWRPNFRG >gi568815585r:21528435_21729015|GENSCAN_predicted_CDS_7|675_bp atggaaactaaagggaagaatctaaaagaagtggtagaaatcaaaatgactatagcagaa atgaagaatgcctttgatgagattgtcagtaaactggacacagctgaaaaagtgattagt aattggcacgtcgtcctcacagtcctgtggctgccaggggtcaccattctgctcatgtgc ttgaggttttctggggaccaaaaactgcacctaatgccatcacttcaccttggtacccct aaattacggatcagtttcattctatggattgagtcagtgtgggacaagaaggtttatgaa gcccattggatttttttctatttctccaaaggaaaggattcagttgagcttttgaagttg aggaatttgcttcttaacccaaagatcacgtgcaaacgtgacttcctgtgctaccctact tctgatttaggaatcagcgcagtatacagtgggtgcttaataactaggtgttgcacaaat aaatcccatttccacatccaaatttggtttgctgaaagcagcagccaagacagccacata ggtagtaacgcgcacaagggagcggcaatgcaaagaggaactgtggctgaggtgcccgga agctccgctttgcaaaccggggactgcaggggagagaggacggaaaggccgtggcggccg aacttccgaggctga >gi568815585r:21528435_21729015|GENSCAN_predicted_peptide_8|265_aa MAVSLPFFGHTHRSANYVSGLRLKTKGSDEFADFGGAGGASDLERPSPALRGGRRAPARA RRKERRASGPPLFLSHPPFAAVQGHWVSGAGSVRLAYVFLKHESPAPWYLQQSHPRWYRA GFNVNCSAADTRSWVLSVTSHKPPESLALISEAASSDGYSRGVPRPVGYSAPGWLLSGEK SDGRNRDRPRETGRRFIRRETSSTWTLDISSEDKTGTISTSCLQPHSFDLNLDCHPMSKD HFRFRILQRVTVALKSVAAVEERAK >gi568815585r:21528435_21729015|GENSCAN_predicted_CDS_8|798_bp atggcagtttctcttccattcttcggccacacacacagaagcgcaaactatgtttcaggg cttcgcctgaagacgaaagggagcgatgaatttgcagatttcgggggcgctggaggtgcc agcgacctggagcgaccttcccctgccctccggggaggaaggcgtgctccggcccgggcg aggaggaaggagagaagagcgagtggtcctcccctcttcctttcccaccccccctttgcc gcagtccagggtcattgggtgtccggggctggctcggtgcgtcttgcttacgtgtttctg aagcatgagagccctgccccgtggtatttacagcagtcacatccgcggtggtacagagcg ggctttaacgttaactgctcggcggcagatacaagaagctgggtattaagtgtcacttct cataaaccacccgagtcgctcgcattaatctccgaggcagccagcagtgatgggtacagc aggggcgttcctcgaccagttgggtacagtgcgccgggatggttacttagtggggaaaaa agcgatggaagaaacagagacaggcccagagagacagggagacgcttcataagaagagaa acgagcagcacgtggacattggacatttccagtgaagacaagactgggaccatttccacc agctgtctccaaccacattctttcgatctgaatttagattgtcatccaatgagcaaagac catttccgcttccgaatcctgcagcgtgtgactgtcgctttaaaaagcgttgcagcagtg gaagagagggcgaagtag >gi568815585r:21528435_21729015|GENSCAN_predicted_peptide_9|161_aa MAPLGEVGNYFGVQDAVPFGNVPVLPVDSPVLLSDHLGQSEAGGLPRGPAVTDLDHLKGI LRRRQLYCRTGFHLEIFPNGTIQGTRKDHSRFVPDVEHNLNCMFSFVRLRMMPEGEEDLG LGWGWSQIHEAQDLLLMLTPEPKPTAEPSSVKLLRPSPLAA >gi568815585r:21528435_21729015|GENSCAN_predicted_CDS_9|486_bp atggctcccttaggtgaagttgggaactatttcggtgtgcaggatgcggtaccgtttggg aatgtgcccgtgttgccggtggacagcccggttttgttaagtgaccacctgggtcagtcc gaagcaggggggctccccaggggacccgcagtcacggacttggatcatttaaaggggatt ctcaggcggaggcagctatactgcaggactggatttcacttagaaatcttccccaatggt actatccagggaaccaggaaagaccacagccgatttgtgccagatgttgaacacaatcta aactgtatgtttagcttcgtgcggctgcggatgatgccagaaggtgaggaggacttaggc ctgggctggggatggtcccagatccacgaggcccaggatttgctcctcatgctcacccct gagcctaagcccacagctgagcctagtagtgtgaaattgctccggccgtctccactcgct gcttaa >gi568815585r:21528435_21729015|GENSCAN_predicted_peptide_10|282_aa MVDLAALVRASLGILEFISIAVGLVSIRGVDSGLYLGMNEKGELYGSNGLFIAWNPGEPL NQSLSVMLLPEASLLTCGESSPVSISSPSGAVVQRPGPRDALHPYKVEWEPLSWASFGSG LEKLTQECVFREQFEENWYNTYSSNLYKHVDTGRRYYVALNKDGTPREGTRTKRHQKFTH FLPRPVDPDKLGSPKSSELDSPPRLMAPIHPSQRCQESTELQLHTSGACCVLEWSRFQNV RTLKIYALSKFQENDPMLLATGTVQQQSCRTEAPRKLKLCTL >gi568815585r:21528435_21729015|GENSCAN_predicted_CDS_10|849_bp atggttgatctggcagccctggttcgagcttctctgggcattctggaatttatcagtata gcagtgggcctggtcagcattcgaggcgtggacagtggactctacctcgggatgaatgag aagggggagctgtatggatcaaatggattgttcattgcatggaaccctggagaacccctc aaccagagcctcagtgtcatgttgctgccagaggcatctttgctgacatgtggagagtcc tcccctgtcagcatcagcagcccctccggggctgtcgttcagcgtcctggcccgagagat gccctccatccttacaaagtggagtgggaaccactgtcctgggccagctttggcagtggc ctggaaaaactaacccaagagtgtgtattcagagaacagttcgaagaaaactggtataat acgtactcatcaaacctatataagcacgtggacactggaaggcgatactatgttgcatta aataaagatgggaccccgagagaagggactaggactaaacggcaccagaaattcacacat tttttacctagaccagtggaccccgacaaattaggctcccccaagtcatctgagctggac tccccgcctaggctcatggcccccatccatccttcacagagatgccaggagagtacagag ctgcaacttcatacctctggggcctgctgtgttttggaatggtccagatttcagaatgtg agaacacttaagatctacgctctcagcaaatttcaagaaaacgatccaatgttgttagct acaggcaccgtacaacagcagagctgcagaactgaggcacctcgcaaactcaaactttgc accctttga >gi568815585r:21528435_21729015|GENSCAN_predicted_peptide_11|136_aa MGSPLLETSAEDTESRSPSRSRSGNTSEQSTLTVEPERCSISAAGDSDGSGSIGSGSGCD DGGGGSDGASDGGGDGGGDGDGDGDGSGDGGDGGGGGGDGGGGGGDGGDGDGADGADGGD GDGGDGDGDGWWWWWW >gi568815585r:21528435_21729015|GENSCAN_predicted_CDS_11|411_bp atgggatcccctctcttggaaacctctgcagaagacacagagagcaggagtccttccagg agcaggagtggaaacaccagcgagcagtcaacactcacagtagagcctgagaggtgctca ataagtgctgctggtgatagtgatggtagtggtagtattggtagtggtagtggttgtgat gatggtggcggtggtagtgatggtgctagtgatggtggtggtgatggtggtggtgatggt gatggtgatggtgatggtagtggtgatggtggtgatggtggtggtggtggtggtgatggt ggtggtggtggtggtgatggtggtgatggtgatggtgctgatggtgctgatggtggtgat ggtgatggtggtgatggtgatggtgatggttggtggtggtggtggtggtga