GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:17:17 Sequence gi568815597r:32913184_33136828 : 223645 bp : 46.61% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 PlyA - 2365 2360 6 1.05 1.11 Term - 8148 7993 156 0 0 21 43 187 0.846 5.43 1.10 Intr - 13083 12941 143 1 2 35 115 17 0.065 -0.83 1.09 Intr - 24076 23715 362 0 2 70 50 339 0.376 23.16 1.08 Intr - 25345 25214 132 0 0 103 91 112 0.999 12.76 1.07 Intr - 29276 29069 208 0 1 58 97 124 0.967 8.44 1.06 Intr - 30976 30836 141 2 0 146 98 133 0.999 20.42 1.05 Intr - 32445 32331 115 0 1 91 99 84 0.999 9.82 1.04 Intr - 33381 33219 163 2 1 119 116 142 0.998 20.08 1.03 Intr - 35183 35039 145 0 1 48 110 -13 0.427 -3.76 1.02 Intr - 36591 36386 206 2 2 68 75 156 0.738 11.14 1.01 Init - 51502 50868 635 1 2 95 88 930 0.134 88.52 1.00 Prom - 63674 63635 40 -6.96 2.00 Prom + 65346 65385 40 -7.96 2.01 Sngl + 66762 67289 528 2 0 78 43 670 0.779 57.56 2.02 PlyA + 67319 67324 6 1.05 3.05 PlyA - 70685 70680 6 1.05 3.04 Term - 74213 73831 383 0 2 40 43 190 0.761 4.70 3.03 Intr - 79140 79047 94 0 1 102 75 -11 0.484 -1.46 3.02 Intr - 84646 84528 119 2 2 33 72 94 0.452 2.48 3.01 Init - 85430 85328 103 2 1 66 80 76 0.793 5.00 3.00 Prom - 88249 88210 40 -3.56 4.07 PlyA - 90077 90072 6 1.05 4.06 Term - 100219 99998 222 1 0 112 35 269 0.952 20.92 4.05 Intr - 101411 101339 73 1 1 124 52 104 0.999 9.81 4.04 Intr - 108278 108184 95 2 2 30 63 123 0.998 2.76 4.03 Intr - 108520 108410 111 1 0 90 64 107 0.870 9.08 4.02 Intr - 111384 111259 126 0 0 82 82 58 0.612 5.48 4.01 Init - 123645 123553 93 0 0 105 99 105 0.893 13.68 4.00 Prom - 126786 126747 40 -3.46 5.00 Prom + 129057 129096 40 -7.66 5.01 Init + 137987 138230 244 1 1 72 53 134 0.099 6.24 5.02 Intr + 145154 145224 71 0 2 64 35 62 0.070 -2.60 5.03 Intr + 151243 151386 144 0 0 96 91 65 0.236 8.08 5.04 Intr + 157465 157833 369 0 0 -118 78 351 0.004 8.80 5.05 Intr + 168995 169171 177 1 0 116 89 190 0.924 22.02 5.06 Intr + 170771 170944 174 1 0 84 105 100 0.912 11.44 5.07 Intr + 178867 179039 173 0 2 66 82 203 0.426 16.24 5.08 Intr + 181365 181530 166 0 1 82 76 169 0.728 15.06 5.09 Intr + 183524 183686 163 1 1 92 73 210 0.995 19.45 5.10 Intr + 184884 184996 113 1 2 95 105 200 0.986 22.50 5.11 Term + 186352 186474 123 0 0 79 39 46 0.401 -2.82 5.12 PlyA + 188379 188384 6 1.05 6.00 Prom + 192876 192915 40 -6.16 6.01 Init + 193468 193647 180 0 0 87 77 108 0.848 8.78 6.02 Intr + 204719 204933 215 1 2 75 99 223 0.997 19.61 6.03 Intr + 206861 206995 135 2 0 100 84 165 0.996 16.98 6.04 Intr + 207234 207255 22 0 1 99 64 7 0.878 -3.05 6.05 Term + 207760 207888 129 0 0 118 53 135 0.987 11.18 6.06 PlyA + 208396 208401 6 1.05 7.03 PlyA - 210191 210186 6 1.05 7.02 Term - 216640 216571 70 0 1 95 36 84 0.623 1.31 7.01 Init - 220420 220275 146 1 2 78 115 73 0.519 6.87 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 51502 50864 639 1 0 95 53 942 0.866 87.59 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:32913184_33136828|GENSCAN_predicted_peptide_1|801_aa MGSEKDSESPRSTSLHAAAPDPKCRSGGRRRRLTLHSVFSASARGRRARAKPQAEPPPPA AQPPPAPAPAAAQGPPPEALPAEPAAEAEAEAAAAAAEPGFDDEEAAEGGGPGAEEVECP LCLVRLPPERAPRLLSCPHRSCRDCLRHYLRLEISESRVPISCPECSERLNPHDIRLLLA DPPLMHKYEEFMLRRYLASDPDCRWCPAPDCGYAVIAYGCASCPKLTCEREGCQTEFCYH CKQIWHPNQTCDMARQQRAQTLRVRTKHTSGLSYGQESGPADDIKPCPRCSAYIIKMNDG SCNHMTCAVCGCEFCWLCMKEISDLHYLSPSGCTFWGKKPWSRKKKILWQLGTLIGAPVG ISLIAGIAIPAMVIGIPVYVGRKIHSRYEGRKTSKHKRNLAITGGVTLSVIASPVIAAVS VGIGVPIMLAYVYGVVPISLCRGGGCGVSTANGKGVKIEFDEDDGPITVADAWRALKNPS IGESSIEGLTSVLSTSGSPTDGLSVMQGPYSETASFAALSGGTLSGGILSSGKGKYSRLE VQADVQKEIFPKDTASLGAISDNASTRAMAGSIISSYNPQDRECNNMEIQVDIEAKPSHY QLVSGSSTEDSLHVHAQMAENEEEGSGGGGSEEDPPCRHQSCEQKDCLASKPWDISLAQP ESIRSDLESSDAQSDDVPDITSDECGSPRSHTAACPSTPRAQDGGLTGEAPPEDVPGPHK TLSSGGERIDHRCVPTKNSTPKSSVSSPGKKARIESTFEIDQSNAFIRQRRKRRHKEEDI KTTRPSAYSSLSRLLRPVDLS >gi568815597r:32913184_33136828|GENSCAN_predicted_CDS_1|2406_bp atgggctccgagaaggactccgagtcgccgcgctccacatcgctacatgcggccgcaccc gaccctaagtgccgcagcggcggccggcgccggcgcctcaccttgcacagcgtcttctct gcctcggcccgcggccgccgcgcccgggccaagccgcaggccgagccgccgcccccggct gcgcagccgccgcccgccccggcccctgccgcggcccagggcccgccgcccgaggcgctg cccgccgagccggccgccgaggccgaggcggaggccgcggcggcggcggcggagcctggg ttcgacgatgaggaggcggcggagggcggtggcccgggcgcggaggaggtggagtgtccg ctgtgcctggtgcggctgccgcctgagcgggccccgcgcctcctcagctgtccgcaccgc tcgtgccgggactgcctccgccactacctgcgcctggagataagcgagagcagggtgccc atcagctgccccgagtgcagcgagcgactcaacccgcacgacatccgcttgctgctcgcc gacccgccgcttatgcacaagtacgaggagttcatgctgcgccgctacctagcctcggac cccgactgccgctggtgcccggccccggactgcggttatgctgttattgcctatggctgt gccagctgcccgaagctaacttgtgagagggaaggttgccagactgagttctgctaccac tgcaagcagatatggcatccaaatcagacatgcgatatggcccgtcaacagagggcccag actttacgagttcggaccaaacacacttcaggtctcagttatgggcaagaatctggacca gcagatgacatcaagccatgcccacgatgcagtgcatacattatcaagatgaatgatgga agctgtaatcacatgacctgtgcagtgtgtggctgtgaattctgttggctttgtatgaaa gagatctcagacttgcattacctcagcccctctggctgtacattctggggcaagaagcca tggagccgtaagaagaaaattctttggcagctgggcacgttgattggtgctccagtgggg atttctctcattgctggcattgccattcctgccatggtcattggcattcctgtttatgtt ggaaggaagattcacagcaggtatgagggaaggaaaacctccaaacacaagaggaatttg gctatcactggaggagtgactttgtcggtcattgcatccccagttattgctgcagttagt gttggtattggtgtccccattatgctggcatatgtttatggggttgtgcccatttctctt tgtcgtggaggcggctgtggagttagcacagccaacggaaaaggagtgaaaattgaattt gatgaagatgatggtccaatcacagtggcagatgcctggagagccctcaagaatcccagc attggggaaagcagcattgaaggcctgactagtgtattgagcactagtggaagccctaca gatggacttagtgttatgcaaggtccttacagcgaaacggccagctttgcagccctctca gggggcacgctgagtggcggcattctctccagtggcaagggaaaatatagcaggttagaa gttcaagccgatgtccaaaaggaaattttccccaaagacacagccagtcttggtgcaatt agtgacaacgcaagcactcgtgctatggccggttccataatcagttcctacaacccacag gacagagaatgcaacaatatggaaatccaagtggacattgaagccaaaccaagccactat cagctggtgagtggaagcagcacggaggactcgctccatgttcatgctcagatggcagag aatgaagaagaaggtagtggtggcggaggcagtgaagaggatcccccctgcagacaccaa agctgtgaacagaaagactgcctggccagcaaaccttgggacatcagcctggcccagcct gaaagcatccgcagtgacctagagagttctgatgcacagtcagacgatgtgccagacatc acctcagatgagtgtggctccccccgctcccatactgcagcctgcccctcgacccccaga gcccaagacggcggacttacaggggaggcgccacctgaagacgttccagggccccataag accctatcttccggaggggaacggatcgaccaccggtgtgtgcccacaaaaaattcaact cctaagtcctcagtttctagtcccgggaagaaagccagaattgaaagcacctttgagatc gaccagtccaacgccttcattcgacaaagaaggaaacgaaggcacaaggaggaagacata aaaaccacgaggccgagtgcctacagctccctcagtcgcctcctgaggccagtggacctg agctga >gi568815597r:32913184_33136828|GENSCAN_predicted_peptide_2|175_aa MKALGTQQEYKVVCHCLPTPKCHTLPLYHMQIFAPNHVVAKFHFWYFLSQLKKMKKSSGE TVNCGQVFEKYPLWVKNFGIWLRYDSRSSTHNMYREYRDLTTMGAVTQCYQDMGTQYRAR ANFIQIMKVEEIAASKCWWPVVKQFHDSKIKFLLPHLVLCHQQKPRFTRRPNTFF >gi568815597r:32913184_33136828|GENSCAN_predicted_CDS_2|528_bp atgaaggccttgggcacacaacaagagtacaaggtggtgtgtcactgcctgcccaccccc aaatgccacacactgcccctctaccacatgcaaatctttgcgcctaatcatgtagtcgcc aagttccacttctggtacttcctatctcaattaaagaagatgaagaagtcttcaggggag actgtcaactgtgggcaggtgtttgagaagtacccactgtgggtgaagaactttggcatc tggctgcgctatgactcccggagcagcacccacaacatgtacagggaataccgggaccta accaccatgggcgctgtcacccagtgctaccaagacatgggcacccagtaccgcgcccgg gccaacttcatccagatcatgaaggtggaggagattgcggccagcaagtgctggtggcca gttgtcaagcaattccacgactccaagatcaagttcctgctgccccacttggtgctgtgc catcagcagaagccacgcttcaccaggagacccaacaccttcttctag >gi568815597r:32913184_33136828|GENSCAN_predicted_peptide_3|232_aa MPKELQGSMERATDCLQCPGWLPYRKDVELLHLPDREGIPELQFEHEVLENQLPEAAQEE CYQVRDDLFQVIWPGLLSGLKRGIEYQVLVFVLLSENFLKIAWERDGRGYLLGALNTQTN VATVVPNGNTHLEPGMLVSAGLLLHGRGLQNLILERCPQEKVNDLRLLDGQGEGLDFLQG LDLHVLDQVAQLGDRHPFLIFILASASSMAQALTPTTIQAPMPLPKPPWKPL >gi568815597r:32913184_33136828|GENSCAN_predicted_CDS_3|699_bp atgcccaaggagctgcagggctccatggagagagcaactgactgcctgcagtgtcctgga tggcttccctataggaaggacgtggagctactacatcttccagacagggaaggcatccct gagttgcagtttgaacatgaggttctagagaaccagcttcctgaagcagcacaggaagag tgttaccaagtacgagatgacttgtttcaggtcatctggccagggctgcttagtggactc aaaagaggtatcgagtatcaggttttggtttttgttctgctatcagaaaattttttgaag attgcctgggagagagatggccgtggctacctcctcggagcacttaacacccagaccaac gtggccactgtagtccccaatggcaacacacaccttgaacctggtatgctggtcagcgca ggtctgcttttgcacgggcgtggtcttcaaaacctcatccttgagagatgcccccaggaa aaagtcaatgatctcagactcctcgatggtcaaggagaagggctagatttcctccaggga cttgatcttcatgtccttgaccaggtagcccaacttggtgacaggcatccattcctcatc ttcatccttgcctctgcaagctccatggcccaggccctgaccccaaccactatccaggcc ccaatgccactgccaaagcctccgtggaaaccactatga >gi568815597r:32913184_33136828|GENSCAN_predicted_peptide_4|239_aa MAPSVPAAEPEYPKGIRAVLLGPPGAGKGTQAPRLAENFCVCHLATGDMLRAMVASGSEL GKKLKATMDAGKLVSDEMVVELIEKNLETPLCKNGFLLDGFPRTVRQAEMLDDLMEKRKE KLDSVIEFSIPDSLLIRRITGRLIHPKSGRSYHEEFNPPKEPMKDDITGEPLIRRSDDNE KALKIRLQAYHTQTTPLIEYYRKRGIHSAIDASQTPDVVFASILAAFSKATCKDLVMFI >gi568815597r:32913184_33136828|GENSCAN_predicted_CDS_4|720_bp atggctcccagcgtgccagcggcagaacccgagtatcctaaaggcatccgggccgtgctg ctggggcctcccggggccggtaaagggacccaggcacccagattggctgaaaacttctgt gtctgccatttagctactggggacatgctgagggccatggtggcttctggctcagagcta ggaaaaaagctgaaggcaactatggatgctgggaaactggtgagtgatgaaatggtagtg gagctcattgagaagaatttggagacccccttgtgcaaaaatggttttcttctggatggc ttccctcggactgtgaggcaggcagaaatgctcgatgacctcatggagaagaggaaagag aagcttgattctgtgattgaattcagcatcccagactctctgctgatccgaagaatcaca ggaaggctgattcaccccaagagtggccgttcctaccacgaggagttcaaccctccaaaa gagcccatgaaagatgacatcaccggggaacccttgatccgtcgatcagatgataatgaa aaggccttgaaaatccgcctgcaagcctaccacactcaaaccaccccactcatagagtac tacaggaaacgggggatccactccgccatcgatgcatcccagacccccgatgtcgtgttc gcaagcatcctagcagccttctccaaagccacatgtaaagacttggttatgtttatctaa >gi568815597r:32913184_33136828|GENSCAN_predicted_peptide_5|638_aa MKRSRCRDRPQPPPPDRREDGVQRAAELSQSLPPRRRAPPGRQRLEERTGPAGPEGKEQP PALASQSAEIAASARPPPRLGRLLGFQKACRCWSLNPHILMALLRSLVPPDKKHPQVWRG RPPLHLAPNVGLFSRVKVRSSVVIEDKSMRDSRRGLSQRRRRRKKKKRGSSSKKKKRRKK RKKKKRKKRKRRKNRKKKKKRKNKRKKKRKKEEKKEEEEERRKKEEEDEEGRGRGRRKRK RKKRKKRRSRKKKETAAAAAAGERLGKWWPGECPVECVAYFLRRRLQQRLHPARQLLLQG MAGYLSESDFVMVEEGFSTRDLLKELTLGASQATTDEVAAFFVADLGAIVRKHFCFLKCL PRVRPFYAVKCNSSPGVLKVLAQLGLGFSCANKAEMELVQHIGIPASKIICANPCKQIAQ IKYAAKHGIQLLSFDNEMELAKVVKSHPSANFHIGSGCPDPQAYAQSIADARLVFEMGTE LGHKMHVLDLGGGFPGTEGAKVRFEEIASVINSALDLYFPEGCGVDIFAELGRYYVTSAF TVAVSIIAKKEVLLDQPGREEENGSTSKTIVYHLDEGVYGIFNSVLFDNICPTPILQKYS SSCCMLALTHSTPFIGSSEEEMMVPAHCHAPHRDLCFG >gi568815597r:32913184_33136828|GENSCAN_predicted_CDS_5|1917_bp atgaagcggagccgctgccgtgaccgaccgcagccgccgccgcccgaccgccgggaggat ggagttcagcgggcagcggagctgtctcagtctttgccgccgcgccggcgagcgccgccc gggaggcagcggctggaggagcggacgggccccgcggggcccgagggcaaggagcagccg cctgccttggcctcccaaagtgccgagattgcagcctctgcccggccgccaccccgtctg ggaaggcttctgggattccagaaagcctgcaggtgttggagcctcaaccctcatatcctc atggccctgctgaggtctcttgtcccacctgacaagaaacacccacaggtgtggaggggc aggccaccccttcatctggcgcccaacgtggggcttttctctagggtgaaggtacgctcg agcgtggtcattgaggacaagtcaatgagagattcccgaagaggcctatctcaaagaaga agaaggagaaagaagaagaagagaggtagcagcagcaagaagaagaagaggaggaagaag agaaagaagaagaagaggaagaagaggaagaggaggaagaataggaagaagaagaagaag aggaagaacaagaggaagaagaagaggaagaaggaagaaaagaaagaagaggaagaagaa agaagaaagaaggaggaggaagatgaagaaggaagaggaagaggaagaaggaagaggaag aggaagaagaggaagaaaagaagaagcagaaagaagaaagaaacagcagcagcagcagca gctggagaaaggctgggaaaatggtggccaggagagtgtccggtagagtgtgttgcatac tttctaaggcggcggctgcagcagcggctccatccagcccgtcagctcctcctgcaaggc atggctggctacctgagtgaatcggactttgtgatggtggaggagggcttcagtacccga gacctgctgaaggaactcactctgggggcctcacaggccaccacggacgaggtagctgcc ttcttcgtggctgacctgggtgccatagtgaggaagcacttttgctttctgaagtgcctg ccacgagtccggcccttttatgctgtcaagtgcaacagcagcccaggtgtgctgaaggtt ctggcccagctggggctgggctttagctgtgccaacaaggcagagatggagttggtccag catattggaatccctgccagtaagatcatctgcgccaacccctgtaagcaaattgcacag atcaaatatgctgccaagcatgggatccagctgctgagctttgacaatgagatggagctg gcaaaggtggtaaagagccaccccagtgccaattttcacattggcagtggctgtcctgac cctcaggcctatgctcagtccatcgcagacgcccggctcgtgtttgaaatgggcaccgag ctgggtcacaagatgcacgttctggaccttggtggtggcttccctggcacagaaggggcc aaagtgagatttgaagagattgcttccgtgatcaactcagccttggacctgtacttccca gagggctgtggcgtggacatctttgctgagctggggcgctactacgtgacctcggccttc actgtggcagtcagcatcattgccaagaaggaggttctgctagaccagcctggcagggag gaggaaaatggttccacctccaagaccatcgtgtaccaccttgatgagggcgtgtatggg atcttcaactcagtcctgtttgacaacatctgccctacccccatcctgcagaagtactcg tcttcctgttgcatgctggctctcacccactccactcccttcattggctcctcagaagag gagatgatggtcccagcccactgccacgccccccaccgggatctttgctttggctga >gi568815597r:32913184_33136828|GENSCAN_predicted_peptide_6|226_aa MKDKNHMIISVDGEQAFDKIQHPFMIKTFNKLGMEGMYLNMIRAIYDKLTANVILNKDPK KPSTEQPLYSSSLWGPAVDGCDCVAEGLWLPQLHVGDWLVFDNMGAYTVGMGSPFWGTQA CHITYAMSRVAWEALRRQLMAAEQEDDVEGVCKPLSCGWEITDTLCVGPVFTPASIIARG LVSQVGKAVRVKQTPGGRFCFFGQCLLEQGWPRSIAVSSADSPGIP >gi568815597r:32913184_33136828|GENSCAN_predicted_CDS_6|681_bp atgaaggataaaaatcatatgatcatctcagtagatggagaacaagcatttgacaaaatt cagcatcctttcatgataaaaactttcaacaaattaggtatggaaggaatgtacctcaac atgataagggccatatatgacaagctcacagctaatgttatactcaacaaggatcctaag aaaccatccacggagcagcccctgtacagcagcagcctgtggggcccggcggttgatggc tgtgattgcgtggctgagggcctgtggctgccgcaactacacgtaggggactggctggtc tttgacaacatgggcgcctacactgtgggcatgggttcccccttttgggggacccaggcc tgccacatcacctatgccatgtcccgggtggcctgggaagcgctgcgaaggcagctgatg gctgcagaacaggaggatgacgtggagggtgtgtgcaagcctctgtcctgcggctgggag atcacagacaccctgtgcgtgggccctgtcttcaccccagcgagcatcattgcaaggggc ctggtcagccaggttggcaaggcagtcagagtaaagcagacacctggtggtcgcttttgc ttctttgggcagtgcctgttagaacagggctggccacggagtattgctgtgtccagtgcc gacagccctggcatcccctga >gi568815597r:32913184_33136828|GENSCAN_predicted_peptide_7|71_aa MGGLQAAVITIPALLSSSTLLVISCINGKRAAGGPRSPAPAHHPHEGPRNSDDLLILRNI TLYCPKDHQNG >gi568815597r:32913184_33136828|GENSCAN_predicted_CDS_7|216_bp atgggtggcttgcaagctgcagtgatcaccatcccggccctgctctcctcatccactctg ctggtcatcagctgtataaatggaaaacgtgccgccggtggtcccaggtcacctgcccca gcccaccatccccatgaggggcccagaaattctgatgacttgctgatacttcggaacatc accctttattgtccaaaagatcaccagaatgggtaa