GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:56:30 Sequence gi568815577f:33225056_33455546 : 230491 bp : 44.84% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4899 5161 263 0 2 73 92 102 0.355 6.23 1.02 Intr + 18618 18659 42 1 0 110 99 22 0.909 3.71 1.03 Intr + 19896 20019 124 1 1 110 73 -17 0.923 -1.36 1.04 Intr + 21663 21835 173 0 2 58 79 132 0.980 8.89 1.05 Intr + 23654 23799 146 0 2 82 115 130 0.996 15.10 1.06 Intr + 27607 27775 169 0 1 95 91 64 0.995 6.92 1.07 Intr + 29620 29852 233 2 2 37 55 148 0.923 3.89 1.08 Intr + 32131 32241 111 0 0 38 93 78 0.641 3.88 1.09 Term + 37738 38445 708 0 0 116 41 331 0.541 24.71 1.10 PlyA + 39125 39130 6 1.05 2.00 Prom + 39981 40020 40 -10.35 2.01 Init + 41411 41459 49 1 1 75 94 67 0.638 5.14 2.02 Intr + 43339 43462 124 2 1 56 98 35 0.537 0.94 2.03 Intr + 51547 51698 152 1 2 48 72 78 0.365 2.01 2.04 Intr + 54697 54863 167 2 2 64 116 99 0.951 9.98 2.05 Intr + 58039 58186 148 0 1 65 115 56 0.775 5.81 2.06 Intr + 63049 63206 158 2 2 46 92 119 0.053 7.83 2.07 Intr + 71129 71252 124 1 1 123 21 99 0.964 6.86 2.08 Intr + 73636 73879 244 2 1 25 80 169 0.531 6.36 2.09 Intr + 74889 74911 23 0 2 121 62 -26 0.262 -4.71 2.10 Intr + 77027 77142 116 0 2 68 72 74 0.629 3.97 2.11 Term + 83130 83267 138 2 0 132 41 48 0.823 2.46 2.12 PlyA + 83815 83820 6 1.05 3.00 Prom + 86204 86243 40 -5.86 3.01 Init + 100001 100076 76 1 1 49 94 190 0.501 14.95 3.02 Intr + 100167 100393 227 1 2 -12 30 182 0.671 0.10 3.03 Intr + 109663 109878 216 0 0 61 33 153 0.348 5.80 3.04 Intr + 112941 113026 86 2 2 24 102 56 0.368 -0.78 3.05 Intr + 115944 116119 176 0 2 85 116 29 0.761 4.98 3.06 Intr + 118213 118367 155 2 2 81 74 42 0.417 1.89 3.07 Term + 130261 130494 234 0 0 54 47 176 0.684 6.42 3.08 PlyA + 134738 134743 6 1.05 4.04 PlyA - 135813 135808 6 1.05 4.03 Term - 143814 143667 148 2 1 68 53 134 0.603 5.27 4.02 Intr - 158330 158117 214 0 1 70 69 130 0.243 7.07 4.01 Init - 160915 160870 46 1 1 86 58 48 0.623 0.45 4.00 Prom - 175201 175162 40 -1.96 5.00 Prom + 177285 177324 40 -4.16 5.01 Init + 178489 178561 73 0 1 78 99 271 0.974 26.43 5.02 Intr + 182871 182953 83 1 2 8 59 68 0.013 -4.74 5.03 Intr + 186390 186516 127 2 1 41 76 106 0.301 4.95 5.04 Intr + 189833 189965 133 0 1 103 66 97 0.981 8.70 5.05 Intr + 196425 196630 206 0 2 99 95 125 0.940 13.14 5.06 Intr + 207659 207816 158 0 2 78 94 97 0.630 9.03 5.07 Term + 211773 211907 135 2 0 27 46 116 0.095 -0.68 5.08 PlyA + 212141 212146 6 -1.75 6.03 PlyA - 212222 212217 6 -0.45 6.02 Term - 212311 212245 67 0 1 62 42 73 0.399 -2.49 6.01 Intr - 214242 214159 84 2 0 95 115 101 0.477 12.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 16868 16922 55 1 1 76 106 3 0.888 2.42 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:33225056_33455546|GENSCAN_predicted_peptide_1|656_aa XPGLGARTRTKDASSRRVGIPPASRTVPRAQPADHHPAARAAFVPRPPLLSERPPARRIL TASVGSQSRARLGPEASISREPQGESCKVYISLVFGISYDSPDYTDESCTFKISLRNFRS ILSWELKNHSIVPTHYTLLYTIMSKPEDLKVVKNCANTTRSFCDLTDEWRSTHEAYVTVL EGFSGNTTLFSCSHNFWLAIDMSFEPPEFEIVGFTNHINVMVKFPSIVEEELQFDLSLVI EEQSEGIVKKHKPEIKGNMSGNFTYIIDKLIPNTNYCVSVYLEHSDEQAVIKSPLKCTLL PPGQESVFNQKIFESTYDLEAPPLRIVPPFQIEPVYILHALIDALCLPKMYKTKLHPDQG FSGSPGAVSWAIGHSYLTQTESLQMRQPPFGQGQFLMCPELVPANGFVVSLTSRMEPQTF LNFHNFLAWPFPNLPPLEAMDMVEVIYINRKKKVWDYNYDDESDSDTEAAPRTSGGGYTM HGLTVRPLGQASATSTESQLIDPESEEEPDLPEVDVELPTMPKDSPQQLELLSGPCERRK SPLQDPFPEEDYSSTEGSGGRITFNVDLNSVFLRVLDDEDSDDLEAPLMLSSHLEEMVDP EDPDNVQSNHLLASGEGTQPTFPSPSSEGLWSEDAPSDQSDTSESDVDLGDGYIMR >gi568815577f:33225056_33455546|GENSCAN_predicted_CDS_1|1971_bp nngccggggctcggcgcgcgcacccgcactaaagacgcttcttcccggcgggtaggaatc ccgccggcgagccgaacagttccccgagcgcagcccgcggaccaccacccggccgcacgg gccgcttttgtcccccgcccgccgcttctgtccgagaggccgcccgcgaggcgcatcctg accgcgagcgtcgggtcccagagccgggcgcggctggggcccgaggctagcatctctcgg gagccgcaaggcgagagctgcaaagtgtatatcagcctcgtgtttggtatttcatatgat tcgcctgattacacagatgaatcttgcactttcaagatatcattgcgaaatttccggtcc atcttatcatgggaattaaaaaaccactccattgtaccaactcactatacattgctgtat acaatcatgagtaaaccagaagatttgaaggtggttaagaactgtgcaaataccacaaga tcattttgtgacctcacagatgagtggagaagcacacacgaggcctatgtcaccgtccta gaaggattcagcgggaacacaacgttgttcagttgctcacacaatttctggctggccata gacatgtcttttgaaccaccagagtttgagattgttggttttaccaaccacattaatgtg atggtgaaatttccatctattgttgaggaagaattacagtttgatttatctctcgtcatt gaagaacagtcagagggaattgttaagaagcataaacccgaaataaaaggaaacatgagt ggaaatttcacctatatcattgacaagttaattccaaacacgaactactgtgtatctgtt tatttagagcacagtgatgagcaagcagtaataaagtctcccttaaaatgcaccctcctt ccacctggccaggaatcagtttttaatcagaaaatctttgaatctacctatgacctggaa gcccccccacttcgaattgtcccacctttccagattgaaccagtgtacatcttacatgca ttgattgatgccttatgtctccctaaaatgtataaaaccaagctgcaccctgaccaaggg ttctcgggatctcctggggctgtgtcatgggccattggtcactcatatttgactcagact gaatctcttcaaatgaggcagcccccttttgggcaagggcagttccttatgtgtccggag ttggttcctgccaatgggttcgtggtctcgctgacttcaagaatggagccgcagaccttc ctgaattttcataactttttagcctggccatttcctaacctgccaccgttggaagccatg gatatggtggaggtcatttacatcaacagaaagaagaaagtgtgggattataattatgat gatgaaagtgatagcgatactgaggcagcgcccaggacaagtggcggtggctataccatg catggactgactgtcaggcctctgggtcaggcctctgccacctctacagaatcccagttg atagacccggagtccgaggaggagcctgacctgcctgaggttgatgtggagctccccacg atgccaaaggacagccctcagcagttggaactcttgagtgggccctgtgagaggagaaag agtccactccaggacccttttcccgaagaggactacagctccacggaggggtctgggggc agaattaccttcaatgtggacttaaactctgtgtttttgagagttcttgatgacgaggac agtgacgacttagaagcccctctgatgctatcgtctcatctggaagagatggttgaccca gaggatcctgataatgtgcaatcaaaccatttgctggccagcggggaagggacacagcca acctttcccagcccctcttcagagggcctgtggtccgaagatgctccatctgatcaaagt gacacttctgagtcagatgttgaccttggggatggttatataatgagatga >gi568815577f:33225056_33455546|GENSCAN_predicted_peptide_2|480_aa MAWSLGSWLGGCLLVSALGMVPPPENVRMNSVNFKNILQWESPAFAKGNLTFTAQYLRIF QDKCMNTTLTECDFSSLSKYGDHTLRVRAEFADEHSDWVNITFCPVDDTIIGPPGMQVEV LADSLHMRFLAPKIENEYETWTMKNVYNSWTYNVQYWKNGTDEKFQITPQYDFEVLRNLE PWTTYCVQVRGFLPDRNKAGEWSEPVCEQTTHDETVPSWMVAVILMASVFMVCLALLGCF ALLWCVYKKTKYAFSPRNSLPQHLKEFLGHPHHNTLLFFSFPLSDENDVFDKLSVIAEDS ESGKQNPEKRSYYPGHSTEKQKNGKYEWLGYKPEKSSCKRRLASWKLAQVNAAAVPIEKG YLEAKYIQYGGSSSLLFVVMCAGVMVLARKISFLRGSHLHEASCHIGEVTWLGPEDASSS QQRTGNRGSQSNSNRLSGINFSTSQKHFGPQIPKGTLFFILENTMPTYQNKGQIGAFASK >gi568815577f:33225056_33455546|GENSCAN_predicted_CDS_2|1443_bp atggcgtggagccttgggagctggctgggtggctgcctgctggtgtcagcattgggaatg gtaccacctcccgaaaatgtcagaatgaattctgttaatttcaagaacattctacagtgg gagtcacctgcttttgccaaagggaacctgactttcacagctcagtacctaaggatattc caagataaatgcatgaatactaccttgacggaatgtgatttctcaagtctttccaagtat ggtgaccacaccttgagagtcagggctgaatttgcagatgagcattcagactgggtaaac atcaccttctgtcctgtggatgacaccattattggaccccctggaatgcaagtagaagta cttgctgattctttacatatgcgtttcttagcccctaaaattgagaatgaatacgaaact tggactatgaagaatgtgtataactcatggacttataatgtgcaatactggaaaaacggt actgatgaaaagtttcaaattactccccagtatgactttgaggtcctcagaaacctggag ccatggacaacttattgtgttcaagttcgagggtttcttcctgatcggaacaaagctggg gaatggagtgagcctgtctgtgagcaaacaacccatgacgaaacggtcccctcctggatg gtggccgtcatcctcatggcctcggtcttcatggtctgcctggcactcctcggctgcttc gccttgctgtggtgcgtttacaagaagacaaagtacgccttctcccctaggaattctctt ccacagcacctgaaagagtttttgggccatcctcatcataacacacttctgtttttctcc tttccattgtcggatgagaatgatgtttttgacaagctaagtgtcattgcagaagactct gagagcggcaagcagaatcctgaaaaaagaagttattatccagggcacagtacagagaaa caaaagaatggaaaatatgagtggttaggatacaagcctgaaaaatcaagctgcaagcgt agattagcaagctggaagcttgcacaggtgaatgcggcagctgtgccaatagaaaaggga tacctggaagccaagtacatccaatatggaggttcctcctcccttctctttgtcgtcatg tgtgcaggtgtcatggtgctggccaggaagatttctttcctgaggggttcacacttgcat gaagcaagctgccatattggagaagtcacatggctaggacctgaggatgcctcctccagc caacaacgaacagggaatcgaggttctcaatccaacagcaacaggctctctggcatcaat ttttccacttctcagaaacactttggcccacaaattcccaaaggaaccttgttcttcatt ctggaaaacacgatgccaacatatcagaataaaggacagataggagcttttgcttcaaag tga >gi568815577f:33225056_33455546|GENSCAN_predicted_peptide_3|389_aa MMVVLLGATTLVLVAVAPWVLSAAADAQSGKPSVHFAAPKIKPDLGSQINQEKVVFWVLS CRLPVAVYGSSGAPGSHPREMAVPELCVEFDSFRESTAAPLCQVMRRVIQVCEGQLDVQT EGTGAISGYPTTQFMTQVVIQGDITSYDAVNTEGKAAEIHYTFPPLQWEMGQGKLFAKYA SNKSLISKELKQSTTKPKSIKRTGMDNWIKLSGCQNITSTKCNFSSLKLNVYEEIKLRIR AEKENTSSWYEVDSFTPFRKAQIGPPEVHLEAEDKAIVIHISPGTKDSVMWALDGLSFTY SLVIWKNSSGVEYFSEQPLKNLLLSTSEEQIEKCFIIENISTIATVEETNQTDEDHKKYS SQTSQDSGNYSNEDESESKTSEELQQDFV >gi568815577f:33225056_33455546|GENSCAN_predicted_CDS_3|1170_bp atgatggtcgtcctcctgggcgcgacgaccctagtgctcgtcgccgtggcgccatgggtg ttgtccgcagccgcagacgcccagtctgggaaaccttcggtccactttgccgcgccaaag attaaacccgacctgggctcgcaaatcaaccaggagaaagtggtgttctgggtcctctct tgccgcttgcctgtggccgtgtacgggtcctcgggagcgcccgggtcccacccccgtgaa atggcggtgccagagctttgtgtcgagtttgattctttccgggaaagtaccgcggctccg ctgtgtcaagtgatgcgcagggtgatccaggtgtgtgaggggcagctggatgtccagact gagggcactggtgccatcagtggctatcctaccactcaattcatgacccaggtggtgatc cagggtgatatcaccagttatgatgcagttaacacagaggggaaagctgctgagatacac tatactttcccacccctgcagtgggagatggggcaggggaaactatttgcaaaatatgca tccaacaagagcttaatatccaaggaactcaaacaatcaacaacaaaacccaaatccatc aaaagaactgggatggataattggataaaattgtctgggtgtcagaatattactagtacc aaatgcaacttttcttcactcaagctgaatgtttatgaagaaattaaattgcgtataaga gcagaaaaagaaaacacttcttcatggtatgaggttgactcatttacaccatttcgcaaa gctcagattggtcctccagaagtacatttagaagctgaagataaggcaatagtgatacac atctctcctggaacaaaagatagtgttatgtgggctttggatggtttaagctttacatat agcttagttatctggaaaaactcttcaggtgtagaatatttctctgaacagccattgaag aatcttctgctttcaacttctgaggaacaaatcgaaaaatgtttcataattgaaaatata agcacaattgctacagtagaagaaactaatcaaactgatgaagatcataaaaaatacagt tcccaaactagccaagattcaggaaattattctaatgaagatgaaagcgaaagtaaaaca agtgaagaactacagcaggactttgtatga >gi568815577f:33225056_33455546|GENSCAN_predicted_peptide_4|135_aa MGFLHVEAGLELLTSVWVTWRFLWLTASDFSINFVAKAPCTVLCMVRTVWFVSDSGAFPD AGLSVLKLGKPRVNQDESAALPRALDKPAAGDILTTQPHLHYGLYPVPKLLSKKNEKNED MLTIRRRRMGREEFC >gi568815577f:33225056_33455546|GENSCAN_predicted_CDS_4|408_bp atggggtttctccatgttgaggctggtctcgaactgctgacctcagtctgggtcacgtgg cgcttcctgtggctcactgcctctgacttttcaatcaactttgtggctaaagcaccatgc actgtgctttgcatggtccggactgtctggtttgtcagcgactcaggggcttttccagat gcgggactttcagtgctgaaactgggaaagcccagggtaaaccaggatgagtcagcggcc ctgcctcgtgctctggataagcctgctgctggagacatcctgaccactcagccccacctg cattatggcttgtacccagttcccaagctcttgtccaagaagaatgagaagaatgaggat atgctgacaattcgaaggcggaggatgggcagagaagaattttgttga >gi568815577f:33225056_33455546|GENSCAN_predicted_peptide_5|304_aa MRPTLLWSLLLLLGVFAAAAAAPPVFLEENNFLVMAEESENAFLFLICKVMERWEMVNRR VEGGIMVPKDVHALILRVREPGNVLPDVQKGLRRYPLSQLPAPQHPKIRLYNAEQVLSWE PVALSNSTRPVVYQVQFKYTDSKWFTADIMSIGVNCTQITATECDFTAASPSAGFPMDFN VTLRLRAELGALHSAWVTMPWFQHYRNASTELQQVILISVGTFSLLSVLAGACFFLVLKY RGLIKYWFHTPPSIPLQIEEYLKDPTQPILEALDKDSSPKDDVWDSVSIISFPEKEQEDV LQTL >gi568815577f:33225056_33455546|GENSCAN_predicted_CDS_5|915_bp atgcgaccgacgctgctgtggtcgctgctgctgctgctcggagtcttcgccgccgccgcc gcggccccgccagtatttcttgaagagaacaatttcctggttatggctgaggaatcagaa aatgcctttttatttctcatctgcaaggtgatggagagatgggagatggtgaacaggcgt gtggagggcggaataatggtccccaaagatgtccacgccctcatcctcagagtccgtgaa cctgggaatgtgctgcctgacgtacaaaagggactccgcagataccctctttcccagctg cccgctcctcagcacccgaagattcgcctgtacaacgcagagcaggtcctgagttgggag ccagtggccctgagcaatagcacgaggcctgttgtctaccaagtgcagtttaaatacacc gacagtaaatggttcacggccgacatcatgtccataggggtgaattgtacacagatcaca gcaacagagtgtgacttcactgccgccagtccctcagcaggcttcccaatggatttcaat gtcactctacgccttcgagctgagctgggagcactccattctgcctgggtgacaatgcct tggtttcaacactatcggaatgcctccactgagcttcagcaagtcatcctgatctccgtg ggaacattttcgttgctgtcggtgctggcaggagcctgtttcttcctggtcctgaaatat agaggcctgattaaatactggtttcacactccaccaagcatcccattacagatagaagag tatttaaaagacccaactcagcccatcttagaggccttggacaaggacagctcaccaaag gatgacgtctgggactctgtgtccattatctcgtttccggaaaaggagcaagaagatgtt ctccaaacgctttga >gi568815577f:33225056_33455546|GENSCAN_predicted_peptide_6|50_aa XCILHSSLATILLPNRRSSSVSDFLNGLRLCDLPKGLRLICQSQDDDQVF >gi568815577f:33225056_33455546|GENSCAN_predicted_CDS_6|153_bp nnatgtatcctccactcctcgctggccaccatcctgctgcccaacagaagaagctcttct gtctccgatttcctgaacggtctaaggttgtgtgacctgcccaaggggctccggctcatt tgccaaagtcaagacgacgaccaggtcttctga