GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:11:20 Sequence gi568815586r:114571844_114783200 : 211357 bp : 46.15% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3977 4054 78 2 0 77 116 37 0.421 4.07 1.02 Term + 16428 16686 259 0 1 20 44 194 0.758 3.22 1.03 PlyA + 17264 17269 6 1.05 2.00 Prom + 19471 19510 40 -3.96 2.01 Init + 27919 28054 136 0 1 59 36 117 0.213 3.90 2.02 Intr + 38279 38350 72 0 0 97 69 53 0.182 3.68 2.03 Term + 43207 43286 80 2 2 93 42 30 0.044 -3.17 2.04 PlyA + 44014 44019 6 1.05 3.12 PlyA - 44914 44909 6 1.05 3.11 Term - 59682 59539 144 0 0 65 44 133 0.459 4.51 3.10 Intr - 61490 61429 62 0 2 94 98 35 0.238 3.55 3.09 Intr - 78193 77987 207 2 0 95 36 80 0.059 2.55 3.08 Intr - 95352 95187 166 0 1 7 34 206 0.127 6.63 3.07 Intr - 100459 100049 411 1 0 131 33 650 0.017 58.18 3.06 Intr - 102992 102322 671 0 2 45 80 827 0.990 69.02 3.05 Intr - 104627 104470 158 1 2 65 92 167 0.999 14.45 3.04 Intr - 105813 105737 77 0 2 90 113 94 0.999 10.41 3.03 Intr - 107808 107662 147 0 0 117 92 133 0.999 17.03 3.02 Intr - 109303 109036 268 0 1 97 78 133 0.997 10.73 3.01 Init - 111315 110969 347 0 2 92 121 471 0.948 45.49 3.00 Prom - 111582 111543 40 -7.86 4.00 Prom + 112473 112512 40 -3.06 4.01 Init + 113439 113543 105 2 0 59 86 47 0.508 1.34 4.02 Intr + 120393 120601 209 2 2 31 91 129 0.446 5.38 4.03 Intr + 125328 125432 105 0 0 66 21 168 0.347 7.23 4.04 Term + 164426 164657 232 2 1 73 50 115 0.224 2.15 4.05 PlyA + 165155 165160 6 1.05 5.04 PlyA - 165901 165896 6 1.05 5.03 Term - 169249 168971 279 1 0 35 55 131 0.113 -0.05 5.02 Intr - 175476 175375 102 0 0 71 87 41 0.169 2.67 5.01 Init - 199867 199799 69 1 0 64 22 162 0.904 8.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100459 99998 462 1 0 131 41 687 0.973 63.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:114571844_114783200|GENSCAN_predicted_peptide_1|112_aa XHYDCRLEHELWFKPSVVRKTDHFSPRTPDELCFEKGDIIYVTDMNDANWWKGTSKGRTG LTPSNYVAEQAESIDNLLHEVAKRGNLSWLRECLDNRVGVNGLDKAGSTALD >gi568815586r:114571844_114783200|GENSCAN_predicted_CDS_1|339_bp nnccattatgactgccggttagaacatgaattatggtttaagcctagtgttgtgaggaag acagatcatttttccccaagaactccagatgaactatgctttgagaaaggtgatatcatc tacgttactgacatgaatgatgcaaattggtggaaaggcacctccaaaggcaggactgga ctaactccaagcaactatgtggccgagcaggcagaatccattgacaatctattgcatgaa gtagcaaaaagaggcaacttgagttggttgcgagagtgtttggacaaccgagtgggtgtt aatggcttagacaaggctggaagcactgccttagactag >gi568815586r:114571844_114783200|GENSCAN_predicted_peptide_2|95_aa MWESLELPRHLLNGFDQNANKDVDNEVQAELVSDGDEELVGNWSKVDSGLLYWIQYENKE TWVQLYYLPGHSARESLVTLWKVNCWSHYNNYTHG >gi568815586r:114571844_114783200|GENSCAN_predicted_CDS_2|288_bp atgtgggaaagtttggaacttcctagacacttgttgaatggttttgaccaaaatgctaat aaggatgtggacaatgaagtccaggctgagctggtctcagatggagatgaggaacttgtt gggaactggagcaaagtggatagtggcctcctttattggattcagtatgagaacaaggag acctgggttcagctctattacttaccaggacactcagccagagagtcactggtcactctg tggaaggttaattgctggtcccactacaataattacacccacggctag >gi568815586r:114571844_114783200|GENSCAN_predicted_peptide_3|885_aa MAYHPFLPHRAPDFAMSAVLGHQPPFFPALTLPPNGAAALSLPGALAKPIMDQLVGAAET GIPFSSLGPQAHLRPLKTMEPEEEVEDDPKVHLEAKELWDQFHKRGTEMVITKSGRRMFP PFKVRCSGLDKKAKYILLMDIIAADDCRYKFHNSRWMVAGKADPEMPKRMYIHPDSPATG EQWMSKVVTFHKLKLTNNISDKHGFTILNSMHKYQPRFHIVRANDILKLPYSTFRTYLFP ETEFIAVTAYQNDKITQLKIDNNPFAKGFRDTGNGRREKRKQLTLQSMRVFDERHKKENG TSDESSSEQAAFNCFAQASSPAASTVGTSNLKDLCPSEGESDAEAESKEEHGPEACDAAK ISTTTSEEPCRDKGSPAVKAHLFAAERPRDSGRLDKASPDSRHSPATISSSTRGLGAEER RSPVREGTAPAKVEEARALPGKEAFAPLTVQTDAAAAHLAQGPLPGLGFAPGLAGQQFFN GHPLFLHPSQFAMGGAFSSMAAAGMGPLLATVSGASTGVSGLDSTAMASAAAAQGLSGAS AATLPFHLQQHVLASQGLAMSPFGSLFPYPYTYMAAAAAASSAAASSSVHRHPFLNLNTM RPRLRYSPYSIPVPVPDGSSLLTTALPSMAAAAGPLDGKVAALAASPASVAVDSGSELNS RSSTLSSSSMSLSPKLCAEKEAATSELQSIQRLPADRQPPSGRYERLATLHDSALSASLV SNVTSRQPAKKVAVGVQLRGRQRRPGFFCLDPALHGRVAGENNPLFLELTSVHGSGKEVP ENGGWGADPADSRETTKAGIAMSGPSNKGLVVLPFSEGHFVALAKPSLLWLDWEKALKPH AQVLHSKDNWVTECPGEDLEKPSGQFRGACAALSSELTQINVEPP >gi568815586r:114571844_114783200|GENSCAN_predicted_CDS_3|2658_bp atggcctaccatccgttcctacctcaccgggcgccggacttcgccatgagcgcggtgctg ggtcaccagccgccgttcttccccgcgctgacgctgcctcccaacggcgcggcggcgctc tcgctgccgggcgccctggccaagccgatcatggatcaattggtgggggcggccgagacc ggcatcccgttctcctccctggggccccaggcgcatctgaggcctttgaagaccatggag cccgaagaagaggtggaggacgaccccaaggtgcacctggaggctaaagaactttgggat cagtttcacaagcggggcaccgagatggtcattaccaagtcgggaaggcgaatgtttcct ccatttaaagtgagatgttctgggctggataaaaaagccaaatacattttattgatggac attatagctgctgatgactgtcgttataaatttcacaattctcggtggatggtggctggt aaggccgaccccgaaatgccaaagaggatgtacattcacccggacagccccgctactggg gaacagtggatgtccaaagtcgtcactttccacaaactgaaactcaccaacaacatttca gacaaacatggatttactatattgaactccatgcacaaataccagccccggttccacatt gtaagagccaatgacatcttgaaactcccttatagtacatttcggacatacttgttcccc gaaactgaattcatcgctgtgactgcataccagaatgataagataacccagttaaaaata gacaacaacccttttgcaaaaggtttccgggacactggaaatggccgaagagaaaaaaga aaacagctcaccctgcagtccatgagggtgtttgatgaaagacacaaaaaggagaatggg acctctgatgagtcctccagtgaacaagcagctttcaactgcttcgcccaggcttcttct ccagccgcctccactgtagggacatcgaacctcaaagatttatgtcccagcgagggtgag agcgacgccgaggccgagagcaaagaggagcatggccccgaggcctgcgacgcggccaag atctccaccaccacgtcggaggagccctgccgtgacaagggcagccccgcggtcaaggct caccttttcgctgctgagcggccccgggacagcgggcggctggacaaagcgtcgcccgac tcacgccatagccccgccaccatctcgtccagcactcgcggcctgggcgcggaggagcgc aggagcccggttcgcgagggcacagcgccggccaaggtggaagaggcgcgcgcgctcccg ggcaaggaggccttcgcgccgctcacggtgcagacggacgcggccgccgcgcacctggcc cagggccccctgcctggcctcggcttcgccccgggcctggcgggccaacagttcttcaac gggcacccgctcttcctgcaccccagccagtttgccatggggggcgccttctccagcatg gcggccgctggcatgggtcccctcctggccacggtttctggggcctccaccggtgtctcg ggcctggattccacggccatggcctctgccgctgcggcgcagggactgtccggggcgtcc gcggccaccctgcccttccacctccagcagcacgtcctggcctctcagggcctggccatg tcccctttcggaagcctgttcccttacccctacacgtacatggccgcagcggcggccgcc tcctctgcggcagcctccagctcggtgcaccgccaccccttcctcaatctgaacaccatg cgcccgcggctgcgctacagcccctactccatcccggtgccggtcccggacggcagcagt ctgctcaccaccgccctgccctccatggcggcggccgcggggcccctggacggcaaagtc gccgccctggccgccagcccggcctcggtggcagtggactcgggctctgaactcaacagc cgctcctccacgctctcctccagctccatgtccttgtcgcccaaactctgcgcggagaaa gaggcggccaccagcgaactgcagagcatccagcggttgccagcagacagacagccgcct tcaggccgctacgagaggcttgcaacgcttcacgacagcgcgctttctgcctcgctggtt tccaatgtgacctcacggcaaccagccaagaaagtggctgtcggggtgcagctgcgaggc cgccagcggcgacctgggttcttctgcttggaccctgcgctccacggccgggtggctggg gagaacaatccactctttctggaactgacatcagtccatgggagtgggaaggaagtgccc gagaatgggggctggggggcagatcctgctgactccagggagacgacaaaagcaggcatt gcaatgtcagggccctccaataagggccttgttgtccttcccttttccgagggtcatttt gtggccctggctaaaccctccctgctttggctagactgggagaaggccctgaagccccat gcacaagtcctgcacagcaaggacaactgggtcacagagtgcccgggggaggaccttgag aagcccagtggccaatttagaggagcctgtgcggctctgagctcggagctgactcagatc aacgtggagcctccctga >gi568815586r:114571844_114783200|GENSCAN_predicted_peptide_4|216_aa MEDLDKASTARDWVLSLSPPSRCPHLEARLGAQHRASLSTEPEKQLTASDSTLTQGWKGV TGLGSCPKRCTGPQESKEVSDRLSFPSSRSRQGPDRPGNRRTACSSVLDFVRLIDEGDKL AHGEVNTIDADLDETEESRRTSRFSNGLNKTQTMPDVQRTNDLASAYFSGLSFVPARDFP LGTSSPPPPALLRFELWATLPTPSGFQQEKRFGGGG >gi568815586r:114571844_114783200|GENSCAN_predicted_CDS_4|651_bp atggaggatctggacaaagcgagcacagcgagagactgggttctgagcctgtcccctcca agccgctgccctcacctcgaggcccggctgggtgcacagcacagggcctcgctgagcacg gaaccggagaagcagctcacggcctcggattctactctcacccaagggtggaagggggtg acagggctcggatcttgccccaaacgctgcacaggcccgcaggagagcaaagaggtctcg gacaggctgtccttccccagctctcggtcgcgacaaggtcccgacaggcccgggaaccgg cgaaccgcctgcagttctgtcctagacttcgttcggctgatcgatgaaggagacaagctg gcccacggggaggtcaatacaatcgatgcggacctcgacgaaacggaagaatctcgcaga acttcaaggttttccaacggtttgaataaaacccaaacgatgcccgatgtccaacgaacg aatgatctagcctcggcctacttctcaggacttagctttgtccctgcccgcgacttcccg ctgggaacgtcgtccccgccaccaccggctctgcttaggttcgaactctgggccactctt cctactccctctggtttccagcaggagaagcgcttcggaggagggggctag >gi568815586r:114571844_114783200|GENSCAN_predicted_peptide_5|149_aa MKKKEKEEEEEKKEKEKEEKASTPSEDADVINVTCDHHVSRQDPPKPGRQRNKLDPKALT CKAFGLGDPEPTPTLSSTPPRKRRPHTNPSDGETIYFSCGSTCSLSLGRDQEALWVPDKE RVLLQRYSEPPTAITLQKPGSLDFIELQD >gi568815586r:114571844_114783200|GENSCAN_predicted_CDS_5|450_bp atgaagaagaaggagaaggaggaggaggaggagaagaaggagaaggagaaggaggagaag gcaagcacgccctcggaagatgcagacgtgattaatgtcacctgtgaccaccatgtctcc aggcaagacccccccaaacccggcagacagagaaacaaattagatccaaaggcattaaca tgcaaagcttttgggcttggagaccctgagccgacacccaccttgagttccacaccccca agaaaaaggaggccccacacaaatcccagtgatggggagacaatctacttctcttgtgga agcacatgtagcctatccctgggccgggatcaagaggccctttgggtgcctgacaaagaa agggtgcttcttcaaagatacagtgaacctccaactgccatcaccctccagaagcctggg agcctcgatttcatagaacttcaggattga