GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:18:31 Sequence gi568815587r:60072161_60281727 : 209567 bp : 38.77% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 5834 6259 426 1 0 70 41 380 0.982 27.54 1.02 PlyA + 6659 6664 6 1.05 2.00 Prom + 16470 16509 40 -6.25 2.01 Init + 16606 16661 56 0 2 77 96 31 0.961 3.61 2.02 Intr + 17532 17661 130 0 1 86 84 63 0.859 5.48 2.03 Intr + 21240 21398 159 2 0 72 40 96 0.820 2.46 2.04 Intr + 21804 21902 99 2 0 74 71 68 0.836 3.09 2.05 Term + 23398 23496 99 0 0 98 48 47 0.845 -1.15 2.06 PlyA + 24260 24265 6 1.05 3.03 PlyA - 25652 25647 6 1.05 3.02 Term - 30751 30341 411 1 0 65 48 193 0.796 7.26 3.01 Init - 37096 37085 12 1 0 66 108 0 0.207 -0.04 3.00 Prom - 40631 40592 40 -5.75 4.00 Prom + 42653 42692 40 -3.55 4.01 Init + 46833 47072 240 2 0 46 52 155 0.222 5.62 4.02 Intr + 49873 49928 56 0 2 92 113 18 0.661 1.56 4.03 Intr + 52960 53054 95 1 2 102 36 65 0.711 1.39 4.04 Term + 53180 53334 155 0 2 58 55 100 0.389 0.80 4.05 PlyA + 53524 53529 6 1.05 5.00 Prom + 57203 57242 40 -3.35 5.01 Sngl + 62105 62611 507 1 0 50 39 293 0.171 16.39 5.02 PlyA + 63030 63035 6 1.05 6.18 PlyA - 63086 63081 6 1.05 6.17 Term - 72406 72318 89 2 2 94 47 73 0.432 0.64 6.16 Intr - 72907 72840 68 0 2 56 86 54 0.237 -0.37 6.15 Intr - 82404 81456 949 1 1 45 53 239 0.001 5.35 6.14 Intr - 89346 89184 163 0 1 66 84 164 0.714 12.43 6.13 Intr - 97036 96908 129 1 0 43 69 95 0.081 3.07 6.12 Intr - 103451 103242 210 2 0 91 87 52 0.134 3.59 6.11 Intr - 105255 105152 104 1 2 44 44 75 0.079 -2.33 6.10 Intr - 106156 106100 57 2 0 92 97 15 0.411 0.74 6.09 Intr - 107805 107671 135 1 0 121 59 25 0.661 2.52 6.08 Intr - 109581 109339 243 1 0 57 72 161 0.026 7.95 6.07 Intr - 112245 112160 86 2 2 36 80 79 0.001 0.44 6.06 Intr - 113939 113835 105 1 0 91 85 57 0.001 4.11 6.05 Intr - 132798 132730 69 2 0 94 84 55 0.075 3.08 6.04 Intr - 133606 133554 53 1 2 88 44 42 0.043 -3.41 6.03 Intr - 140972 140814 159 2 0 103 101 70 0.658 9.06 6.02 Intr - 143917 143891 27 1 0 92 87 40 0.500 1.69 6.01 Init - 145283 145128 156 2 0 67 105 16 0.373 1.19 6.00 Prom - 146746 146707 40 -9.55 7.00 Prom + 147113 147152 40 -9.95 7.01 Init + 147879 148189 311 2 2 69 107 212 0.877 18.03 7.02 Intr + 148268 148466 199 2 1 18 91 107 0.775 2.43 7.03 Intr + 149254 149435 182 0 2 109 5 147 0.477 6.24 7.04 Term + 149817 150381 565 0 1 8 45 326 0.487 13.18 7.05 PlyA + 151516 151521 6 1.05 8.04 PlyA - 151964 151959 6 1.05 8.03 Term - 161320 161084 237 1 0 53 36 200 0.623 6.48 8.02 Intr - 162095 161957 139 1 1 53 81 55 0.700 0.85 8.01 Init - 163845 163751 95 0 2 42 64 91 0.689 2.00 8.00 Prom - 177104 177065 40 -1.55 9.03 PlyA - 177863 177858 6 1.05 9.02 Term - 186845 186402 444 2 0 83 41 153 0.216 4.35 9.01 Init - 196952 196890 63 2 0 68 86 45 0.575 3.50 9.00 Prom - 197853 197814 40 -2.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 13726 13373 354 1 0 44 41 221 0.951 8.80 S.002 Sngl + 82821 83501 681 2 0 42 28 301 0.918 15.43 S.003 Init - 109567 109339 229 1 1 82 72 145 0.847 10.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:60072161_60281727|GENSCAN_predicted_peptide_1|141_aa MLHFNNNNNNNNNNNNNNNKETTIFTWPEAAAGYDTKGILDQSQQAYQEACEITKKEMQP TDPIRLGMALNFSVFYYELLNSPEKSHSLVKAAFDEALAELDTLSEESYKDRLLMQLLRD NLTLWTLDTQGDETEARGRKS >gi568815587r:60072161_60281727|GENSCAN_predicted_CDS_1|426_bp atgctccatttcaacaacaacaacaacaacaacaacaacaacaacaacaacaacaacaaa gagactactatttttacttggcctgaggctgctgctggttatgacacaaaagggatccta gatcagtcacaacaagcataccaagaagcttgtgaaatcaccaaaaaggaaatgcaacca acagatcctatcagattgggtatggctctaaacttctctgtcttctattatgagcttctg aactccccagagaaatctcattcacttgtaaaggcagcttttgatgaagcccttgctgaa cttgatacattaagtgaagagtcatacaaagatcgcctgctaatgcagttactgagagac aacttgacactgtggacattggatactcaaggagatgaaactgaagcaagaggaagaaaa agttaa >gi568815587r:60072161_60281727|GENSCAN_predicted_peptide_2|180_aa MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEF LGVRGSLGANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETKCFMASFSTEIVVM MLFLTILGLGSAVSLTICGAGEELKGNKVPEDRVYEELNIYSATYSELEDPGEMSPPIDL >gi568815587r:60072161_60281727|GENSCAN_predicted_CDS_2|543_bp atggacacagaaagtaataggagagcaaatcttgctctcccacaggagccttccagtgtg cctgcatttgaagtcttggaaatatctccccaggaagtatcttcaggcagactattgaag tcggcctcatccccaccactgcatacatggctgacagttttgaaaaaagagcaggagttc ctgggggtgagaggaagcctgggagcaaacactgccagcagcatagctgggggaacggga attaccatcctgatcatcaacctgaagaagagcttggcctatatccacatccacagttgc cagaaattttttgagaccaagtgctttatggcttccttttccactgaaattgtagtgatg atgctgtttctcaccattctgggacttggtagtgctgtgtcactcacaatctgtggagct ggggaagaactcaaaggaaacaaggttccagaggatcgtgtttatgaagaattaaacata tattcagctacttacagtgagttggaagacccaggggaaatgtctcctcccattgattta taa >gi568815587r:60072161_60281727|GENSCAN_predicted_peptide_3|140_aa MEIQVIKASIIGQYCIARERKEFTHPVGRLSCLRQKLYNGTTETVTSWSSNHTERNPFSK FPKLRTVWTHPESHRDWTAPTGLYWICGHRAFAKLPDESAGSCVIGTIKPSFFLLPIRTG ELLSFPVYASREKKSIAIEN >gi568815587r:60072161_60281727|GENSCAN_predicted_CDS_3|423_bp atggaaattcaagtcataaaagcctcaattattggacaatattgcatagctagagaaaga aaagaattcactcaccccgtaggacgacttagttgtctaagacagaaactgtataatggt accacagaaacagtcacttcgtggagttcaaatcacacagagagaaatccatttagtaaa ttcccaaagttgcgaaccgtttggacccatccagagtcccaccgggactggacagccccc actggattatactggatatgtgggcatagagcttttgccaaattacctgacgagtcggca ggtagttgtgttattggcactattaaaccatctttcttcttactgcccataaggacaggt gaactcctgagcttccctgtctatgcttcccgcgaaaagaaaagcatagctatagaaaat tga >gi568815587r:60072161_60281727|GENSCAN_predicted_peptide_4|181_aa MKENLLGELTYMITRAACKLRSQEASPGPKISEVGKLIMRFQSLAEGLRAPGKSLVLSLR IQKLKNLESDVRGQEVSNKRMGPPSCRKTNMAPTDSTLRVFEQPSPLAMQPGKKPVSGIL HSSVAQSGSEETQHIDFCDQTCGDFSPPKRQAIISGVNTSWLSFNSVETSSKLEIISDSM V >gi568815587r:60072161_60281727|GENSCAN_predicted_CDS_4|546_bp atgaaggaaaatttattaggagaattgacttacatgatcacaagggcagcctgcaagctg aggagccaggaagccagtccgggtcccaaaatctcagaagtagggaagctgataatgcga tttcagtctttggccgaaggcctgcgagcccctggcaaatcactagtgttaagtctaaga atccaaaaactgaagaacttggagtctgatgttcgagggcaggaagtatccaacaaaaga atgggaccacctagttgcaggaaaacaaacatggctcccactgattctacattacgggtc tttgagcagcccagtcccctagccatgcagccaggcaagaaacctgtgtctggcatactc cattcatccgtcgctcagtcagggtctgaggagacacaacatatagacttctgcgaccaa acgtgtggggatttctccccaccaaaaaggcaagcaatcatttctggagtgaacaccagc tggctgtccttcaattctgttgagacatcatctaaactggagataatatcagattccatg gtttga >gi568815587r:60072161_60281727|GENSCAN_predicted_peptide_5|168_aa MVLGLQVHRSQELRLYRNAWMSRQRGVAGLEPSWRTSARAMWKGNVGYEPPHRVPTGALP NGAVRRRPPSSRLQNGRCTDSLHCMPGKAADTQRQPMKAARRVAIHCKATEAELLKAMGA HFLHQHDLDVRHRVKGDHFGTLRFNVYAVGFQTCMRPAALLFWTVSPM >gi568815587r:60072161_60281727|GENSCAN_predicted_CDS_5|507_bp atggtgttgggtctgcaagtgcacagaagtcaagaattgagattgtatagaaatgcctgg atgtccaggcagaggggtgttgcagggctggagccatcatggagaacctctgctagggca atgtggaagggaaatgtggggtatgagcccccacacagagtccccactggagcactgcct aatggagctgtgagaagacggccaccatcctccagactccagaatggtagatgcactgac agcttgcactgtatgcctggaaaagctgcagacactcaacgccagcccatgaaagcagcc aggagggtggccatacactgcaaagccacagaggcagagcttctcaaggccatgggagcc cacttcttgcatcagcatgacctagatgtgagacatagagtcaaaggagatcattttgga actttaaggtttaatgtctatgctgttggatttcagacttgcatgaggcctgcagccctt ttgttttggacagtttctcccatgtag >gi568815587r:60072161_60281727|GENSCAN_predicted_peptide_6|933_aa MGKLFFLAKKRLLRVYKLSVPSFIKVLPHIPIPSCRVPFFFNQCSRFNSRHLYKYQYQEP AVGGSLGKNITSSVLAISGILINAISLTFYSFRYHYCNHDQLSSNCYMTMSILMMQYDGK RLNVDVDSTVWCSGDGQYLEKVYYDILSSWLRIERGLLCTALMRHSTGAIAYLGVLSGSA SLKLAGVPLRCCEGDKDAGHPLETQTALCERGRGARSLVGNTIMTSQPVPNETIIVLPSN VINFSQAEKPEPTNQGQDSLKKHLHAEIKVIGVNLIQNVLERGWGKCQEMIYVLGLDICH YPDLVWHDGIELGDHFGICFLLSKFYPSDFYTVELCLPIHRTLFFYHLWLSINRHREKVN QAFGVDVGDYVRECHTMQLHRDSIWKLWLQKSPKASKEVHSSLVGSILSALSALVGFIIL SVKQATLNPASLQCELDKNNIPTRSYVSYFYHDSLYTTDCYTAKASLAVSHVAPACVVVP DGSCAARMGLGGNMGGGGQVSCEHGEGTEVPQYLDIRGRCSQGKDHKLSLERDHLTTKPV GGLDLALTATEGFEAFSEKAAPGPREKMAILPKVIDRVDAIPIKLPMPFFTELEKTTLKF IWNQKRARIAKSILSQKNKPGGIMLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEI TPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDL NIRPKTIKTLEENLGITIQGIGMGKDFMSKTPKAMATEAKIDKWDLIKLKSFCTAKETTI RVNRQPTKWEKIFTTYSSDKGLISRIYNELKQICKKKTNNPIKKWAKDMNRHFSKEDIYA AKRHMKKCSPSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRKQEEKKEASTYFKCLIQK LSYEEYLLMDKHSFVFLFYKSCEYSDVDGNMFA >gi568815587r:60072161_60281727|GENSCAN_predicted_CDS_6|2802_bp atggggaagctgttttttctggcaaaaaaaaggctcctgagagtttacaagctcagtgtc cccagcttcatcaaggttctaccgcacatccccattccaagttgcagggtcccattcttt ttcaatcaatgctctcgctttaacagtagacacctgtataagtaccagtaccaagagcca gctgttggaggtagtctaggaaagaatatcaccagttcagtcttggctatatcagggatc ttaatcaatgcaataagcttgacgttttattcattccgttaccattactgtaaccacgat cagttgtcaagtaattgttacatgactatgtccattttaatgatgcaatatgatggaaag agactaaatgttgatgtcgacagtactgtatggtgctccggagatggacagtatctggaa aaagtctattatgacattctatcttcttggttgaggattgaaaggggcctgctgtgcact gctctgatgaggcattccactggggcaattgcctacctgggagtgctctcaggatctgct tcactcaagctggctggagtccccctcagatgctgtgagggtgacaaagatgcagggcac ccactggaaacacagacggcactctgcgaaagaggaaggggcgccaggagcttggttggc aacaccatcatgacatcacaacctgttcccaatgagaccatcatagtgctcccatcaaat gtcatcaacttctcccaagcagagaaacccgaacccaccaaccaggggcaggatagcctg aagaaacatctacacgcagaaatcaaagttattggggtaaatctaattcagaacgtgttg gagaggggttgggggaagtgccaagagatgatatatgtcttgggactggacatctgtcac tatccagatcttgtgtggcatgatggtattgagcttggggatcattttggcatctgcttc cttctctccaaattttacccaagtgacttctacactgttgaactctgcttacccattcat aggaccctttttttttatcatctctggctctctatcaatcgccacagagaaaaggttaac caagcttttggggttgatgttggagattatgtgagagaatgtcataccatgcaattgcac cgagactcaatttggaagctctggctacaaaaatctcccaaagccagcaaggaagtgcat agcagcctggttggaagcattctgagtgctctgtctgccctggtgggtttcattatcctg tctgtcaaacaggccaccttaaatcctgcctcactgcagtgtgagttggacaaaaataat ataccaacaagaagttatgtttcttacttttatcatgattcactttataccacggactgc tatacagccaaagccagtctggctgtcagccatgtggccccagcatgtgttgtggtacct gatggtagctgtgctgctaggatgggccttggtggaaacatgggtggtggtggacaggtc tcctgtgagcatggagaaggcactgaagtacctcaatacctggacatcagaggcagatgc agtcaagggaaggatcataaactttctttggaaagagaccatcttactaccaagccagtc gggggcttggatcttgccctaacagcaactgaaggatttgaggctttttctgaaaaggca gccccaggtcctagggagaaaatggccatactgcccaaggtaattgatagagtcgatgcc atccccatcaagctaccaatgcctttcttcacagaactggaaaaaactactttaaagttc atatggaaccaaaaaagagcccgaattgccaagtcaatcctaagccaaaagaacaaacct ggaggcatcatgctacctgacttcaaactatactacaaggctacagtaaccaaaacagca tggtactggtaccaaaacagagatatagatcaatggaacagaacggagccctcagaaata acgccgcatatctacaactatctgatctttgacaaacctgagaaaaacaagcaatgggga aaggattccctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctg aaactggatcccttccttacaccttatacaaaaatcaattcaagatggattaaagactta aacattagacctaaaaccataaaaaccttagaagaaaacctaggcattaccattcagggc ataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacagaagccaaa attgacaaatgggatctcattaaactaaagagcttctgcacagcaaaagaaactaccatc agagtgaacaggcaacctacaaaatgggagaaaattttcacgacctactcatctgacaaa gggctaatatccagaatctacaatgaactcaaacaaatttgcaagaaaaaaacaaacaac cccatcaaaaagtgggcaaaggacatgaacagacacttctcaaaagaagacatttatgca gccaaaagacacatgaaaaaatgctcaccatcactggccatcagagaaatgcaaatcaaa accacaatgagataccatctcacaccagttagaatggcgatcattaaaaagtcaggaaac aacaggaaacaggaagagaagaaagaagcatccacttattttaagtgtttaattcagaaa ttgtcctatgaagaatatctcctgatggataaacattcttttgtcttccttttctacaaa tcctgtgagtactcggatgtagatggaaacatgtttgcctga >gi568815587r:60072161_60281727|GENSCAN_predicted_peptide_7|418_aa MVEKAKWKPLELPLPRKAVNQQQYPIPEGIADISATIKDLKYARVMIPTTSPFNSPIWPV QKTDGSWRMTVDYHELNQVVTPVAVAVPDVVSLLEQINPSSGTCWQDQQCTFTVLPQGYI NSSLALCHDLVQGDLDHFSLPQNITLVHYIDAIILMGSSEQEVASTLDLLPALMVSWGVP QDQLTEEEKTRALFTDGSARYAGTTSKWSAAALQPLPRTSLKDSGEGKSSQDGGYAWAQQ HGLPFTDLAMATAEHLICQQQRPALSPQYSTIPRNDQPATWWQVDYIRPLPSRKGQRFFL TRIDTYFGYGFAYPAHNASGKTTIQELMECLNHCHGIPHSIASDQGTHLLAKEMRQWAHS HGIHWSYHVSHHPEAAGLIEWWNVLLKSQLQHQPGDNTFQGWGNVLQKAVHALNQRLI >gi568815587r:60072161_60281727|GENSCAN_predicted_CDS_7|1257_bp atggtggaaaaggccaaatggaagccattagagctgcctctacctagaaaagcagtaaat caacaacaatatcccatccctgaagggattgcagacattagtgccaccatcaaggactta aaatatgcaagggtgatgattcccaccacatccccattcaactctcccatttggccagtg cagaagacagatggatcttggagaatgacagtggattatcatgagcttaaccaagtagtg actccagttgcagttgctgtaccagatgtggtttcattgcttgagcaaattaacccatct tctggtacctgctggcaagaccagcaatgtacctttactgtcctacctcaggggtatatc aactcttctctggctttgtgtcatgaccttgttcagggagatcttgatcacttttccctt ccacaaaatatcacactggtccattatattgatgccattatactgatgggatccagtgag caagaagtagcaagcacactggacttattgcctgcactgatggtctcatggggagttccc caggatcaattaacagaggaagagaagacaagggccctgtttacagatggctctgcacga tatgcaggcaccacctcaaagtggtcagctgcagcactacagccccttcctaggacatcc ctgaaggacagcggtgaaggaaaatcgtcccaggatggaggatatgcatgggctcagcaa catggacttccattcactgacctggctatggccactgctgagcacctaatttgccagcag caaagaccagcactgagccctcaatatagcaccattcctcggaatgatcagccagctact tggtggcaagttgattatattagacctcttccatcacggaaagggcagcggtttttcctc accagaatagacacttactttggatatgggtttgcctatcctgcacacaatgcttctggc aagactacaatccaagaactcatggaatgccttaaccactgtcatggtatcccacatagc attgcctctgaccaaggcactcacttgttggctaaagaaatgcggcagtgggctcattct catggaattcactggtcttaccatgtttcccatcatcctgaagcagctggattgatagaa tggtggaatgtccttttgaagtcacaattacaacatcaaccaggtgacaatacttttcag ggctggggcaatgttctccagaaggctgtgcatgctctgaatcagcgtctaatataa >gi568815587r:60072161_60281727|GENSCAN_predicted_peptide_8|156_aa MFEQLQIHMQKNEVGHFLTPNKTAQSGAKTKIIKSKVLSPESSEADMDEIQGMTYLEANV SSCEPVKSKPVICFQNIMVTHKAKAVTLKHTSVTQAHGAGMELRLILGSVHNIIKAPGKE GCFGGLSSGEQVTVEILVPETIGHSSNSNPGEWSAT >gi568815587r:60072161_60281727|GENSCAN_predicted_CDS_8|471_bp atgtttgaacaacttcagattcacatgcaaaagaatgaagttggacactttcttacacca aacaaaacagctcaaagtggagcaaaaaccaaaatcatcaagtctaaagttctaagtcca gagtcatcagaagcagatatggatgagattcaaggtatgacttatctggaggcaaatgtt tccagctgtgagcctgtgaaatcaaaaccagtgatttgtttccaaaatataatggttacc cacaaagctaaggctgtaactctaaagcacacctcagtaactcaagcccatggagcaggg atggaattgagactgattctggggtcagtgcacaacatcatcaaggctccagggaaagag gggtgctttggaggcttgagctctggggagcaggttactgttgaaattttggttccagag acaatcgggcatagcagcaattccaaccctggggaatggagtgccacatag >gi568815587r:60072161_60281727|GENSCAN_predicted_peptide_9|168_aa MDTKKGKTDTGSYWRVEGEKRCPENSHHQYSRAHAQARPPKLPGICHGVMGHELHTVSLL EKALPFCYIDCLFLVGSDEGTAQQGLDTLLIHMQQPSWAINTDKIKRPNNIQMKLIKLLG IIQKAEKPLIAKMIVKILNFSVPPHQKPQKKPQQLVDLFGYWYHSYNT >gi568815587r:60072161_60281727|GENSCAN_predicted_CDS_9|507_bp atggacacaaagaagggaaaaacagacactgggtcctactggagggtagagggtgagaag aggtgccctgagaacagccaccaccaatacagcagagcacatgcacaggcaaggccacct aagctcccaggtatctgccatggagtgatgggccatgagttacacacagtctcactcctg gagaaagctctgcccttctgctatattgattgtctcttcctggtgggctcagatgaaggg actgcccaacaaggactagacaccctgttgatccacatgcagcaacccagctgggccatt aacacagacaagataaaaagacctaataatatccaaatgaaactaataaaactcctgggc ataattcagaaagcagaaaagcctctcattgcaaagatgattgtcaagattttgaacttc tctgtccctccccaccaaaaaccacaaaaaaaaccccagcaattggtagacctatttggc tactggtatcactcctacaatacctga