GENSCAN 1.0 Date run: 5-Nov-116 Time: 21:28:42 Sequence gi568815592r:10298420_10510336 : 211917 bp : 44.99% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 19605 19456 150 1 0 103 84 62 0.184 7.43 1.04 Intr - 23719 23640 80 0 2 63 86 27 0.031 -0.81 1.03 Intr - 31589 31501 89 2 2 48 75 71 0.097 0.57 1.02 Intr - 32155 32072 84 1 0 66 109 41 0.419 4.02 1.01 Init - 40156 40136 21 1 0 60 101 70 0.194 3.40 1.00 Prom - 48498 48459 40 -3.36 2.02 PlyA - 48630 48625 6 1.05 2.01 Sngl - 67450 67103 348 1 0 58 43 246 0.783 13.14 2.00 Prom - 82834 82795 40 -2.36 3.10 PlyA - 84761 84756 6 1.05 3.09 Term - 89746 89587 160 0 1 58 48 85 0.099 -1.09 3.08 Intr - 100286 100002 285 1 0 89 59 527 0.219 46.36 3.07 Intr - 102170 102029 142 0 1 82 78 93 0.987 7.11 3.06 Intr - 104191 104073 119 0 2 60 98 27 0.963 1.01 3.05 Intr - 106320 106089 232 1 1 101 78 303 0.999 27.43 3.04 Intr - 108425 108374 52 2 1 75 121 26 0.643 3.08 3.03 Intr - 111916 111482 435 1 0 127 111 446 0.999 44.58 3.02 Intr - 116650 116522 129 1 0 80 99 69 0.031 8.09 3.01 Init - 120118 120029 90 1 0 93 76 75 0.028 7.29 3.00 Prom - 139217 139178 40 -3.26 4.00 Prom + 139844 139883 40 -1.26 4.01 Init + 141068 141127 60 1 0 63 81 15 0.316 -0.45 4.02 Intr + 148164 148215 52 2 1 92 100 32 0.492 3.28 4.03 Intr + 148562 148703 142 0 1 41 56 50 0.426 -3.49 4.04 Intr + 152479 152561 83 1 2 107 91 63 0.793 7.78 4.05 Term + 153484 153497 14 2 2 105 53 2 0.293 -3.24 4.06 PlyA + 154194 154199 6 1.05 5.06 PlyA - 154248 154243 6 1.05 5.05 Term - 162146 161487 660 2 0 37 37 468 0.018 30.51 5.04 Intr - 167910 167808 103 2 1 75 64 60 0.009 2.48 5.03 Intr - 173616 173454 163 1 1 71 53 78 0.799 1.63 5.02 Intr - 176174 176081 94 2 1 104 98 63 0.925 8.44 5.01 Init - 177988 177950 39 1 0 32 119 6 0.239 -1.69 5.00 Prom - 180038 179999 40 -4.26 6.00 Prom + 180616 180655 40 -6.16 6.01 Init + 181071 181208 138 2 0 66 83 6 0.086 -1.96 6.02 Intr + 184430 184640 211 1 1 35 52 164 0.351 5.99 6.03 Intr + 193330 193397 68 2 2 50 111 54 0.853 2.52 6.04 Term + 195918 196313 396 2 0 78 36 151 0.756 3.98 6.05 PlyA + 197545 197550 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 162137 161487 651 2 0 86 37 445 0.912 35.38 S.002 Term - 172311 172182 130 2 1 85 29 108 0.865 2.25 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:10298420_10510336|GENSCAN_predicted_peptide_1|142_aa MKPLTLALSHYGEQCISECSLGALKGNLDSEALKQYVRDIYDHHSHRTGEEMGDMEGVNM LNCRATSEISPEGLFPDQVPTCGKEARSRFLGHMAMRGKQAGTKVISTACDVCYDTGSIG FDGSAEKTHLTGVRNRCLGMGX >gi568815592r:10298420_10510336|GENSCAN_predicted_CDS_1|426_bp atgaagccgctgaccctcgcgctttcccattatggggagcagtgcatttctgaatgttct ctgggggctcttaaagggaacttagatagtgaagccttaaaacagtatgtcagggacatc tatgaccaccattcccatcgtacaggtgaagaaatgggggacatggaaggtgtcaatatg ctcaactgtagagcgacaagtgagatttctccagaaggcctcttcccagatcaggttccc acttgtggaaaagaagctagaagcagattcctggggcacatggccatgcgagggaagcag gcaggcacaaaagtaatttcaacagcctgtgacgtgtgctatgacacaggcagcataggc tttgacgggagtgcagaaaagacgcatctcacaggtgtcaggaacaggtgtctggggatg ggggnn >gi568815592r:10298420_10510336|GENSCAN_predicted_peptide_2|115_aa MLHIVEPYVTRGFPNVKFVRDLILKCRQATVKNKAIPLTDNTVIEEYLGKFGVICLEELI HEIALSGKYFQEILWFLHPFHLLVACPLPRIEWASSGRWAYLAIRVNASISSSTS >gi568815592r:10298420_10510336|GENSCAN_predicted_CDS_2|348_bp atgctgcatatagtggaaccttatgtgacccggggatttccaaatgtgaagtttgtccgg gatctcattctgaaatgtcgacaagccacagtcaagaataaggccatccctctgacagac aacacagtgattgaggagtacctggggaaatttggtgtcatttgcttggaagagctcatt catgaaattgccttgtccgggaagtatttccaggagatcttatggttcttgcaccctttc cacctcttggtggcctgtccactgccaagaatagaatgggcttcctcagggagatgggct tacctggctatcagggtgaatgcatcaatcagctcatccaccagctga >gi568815592r:10298420_10510336|GENSCAN_predicted_peptide_3|547_aa MVAVKGSACRRFHHQNREALFSPQPRGLVRPALCAHPESSSTWVRDREGHIRSRRSMKML WKLTDNIKYEDCEDRHDGTSNGTARLPQLGTVGQSPYTSAPPLSHTPNADFQPPYFPPPY QPIYPQSQDPYSHVNDPYSLNPLHAQPQPQHPGWPGQRQSQESGLLHTHRGLPHQLSGLD PRRDYRRHEDLLHGPHALSSGLGDLSIHSLPHAIEEVPHVEDPGINIPDQTVIKKGPVSL SKSNSNAVSAIPINKDNLFGGVVNPNEVFCSVPGRLSLLSSTSKYKVTVAEVQRRLSPPE CLNASLLGGVLRRAKSKNGGRSLREKLDKIGLNLPAGRRKAANVTLLTSLVEGEAVHLAR DFGYVCETEFPAKAVAEFLNRQHSDPNEQVTRKNMLLATKQICKEFTDLLAQDRSPLGNS RPNPILEPGIQSCLTHFNLISHGFGSPAVCAAVTALQNYLTEALKAMDKMYLSNNPNSHT DNNAKSSDKEEKHRNQLNGSTQDNGSKKGNYFHGISGSMGVLSLNLEDVASSQNDLGDSH EVQRLGH >gi568815592r:10298420_10510336|GENSCAN_predicted_CDS_3|1644_bp atggtggcggtaaaaggctctgcctgtcgccgatttcatcatcagaaccgcgaggccctg ttttcgccccaaccaagaggcctggtgaggccagcactttgcgctcacccagagagtagc tccacttgggtgcgagaccgagaggggcatatccgttcacgccgatccatgaaaatgctt tggaaattgacggataatatcaagtacgaggactgcgaggaccgtcacgacggcaccagc aacgggacggcacggttgccccagctgggcactgtaggtcaatctccctacacgagcgcc ccgccgctgtcccacacccccaatgccgacttccagcccccatacttccccccaccctac cagcctatctacccccagtcgcaagatccttactcccacgtcaacgacccctacagcctg aaccccctgcacgcccagccgcagccgcagcacccaggctggcccggccagaggcagagc caggagtctgggctcctgcacacgcaccgggggctgcctcaccagctgtcgggcctggat cctcgcagggactacaggcggcacgaggacctcctgcacggcccacacgcgctcagctca ggactcggagacctctcgatccactccttacctcacgccatcgaggaggtcccgcatgta gaagacccgggtattaacatcccagatcaaactgtaattaagaaaggccccgtgtccctg tccaagtccaacagcaatgccgtctccgccatccctattaacaaggacaacctcttcggc ggcgtggtgaaccccaacgaagtcttctgttcagttccgggtcgcctctcgctcctcagc tccacctcgaagtacaaggtcacggtggcggaagtgcagcggcggctctcaccacccgag tgtctcaacgcgtcgctgctgggcggagtgctccggagggcgaagtctaaaaatggagga agatctttaagagaaaaactggacaaaataggattaaatctgcctgcagggagacgtaaa gctgccaacgttaccctgctcacatcactagtagagggagaagctgtccacctagccagg gactttgggtacgtgtgcgaaaccgaatttcctgccaaagcagtagctgaatttctcaac cgacaacattccgatcccaatgagcaagtgacaagaaaaaacatgctcctggctacaaaa cagatatgcaaagagttcaccgacctgctggctcaggaccgatctcccctggggaactca cggcccaaccccatcctggagcccggcatccagagctgcttgacccacttcaacctcatc tcccacggcttcggcagccccgcggtgtgtgccgcggtcacggccctgcagaactatctc accgaggccctcaaggccatggacaaaatgtacctcagcaacaaccccaacagccacacg gacaacaacgccaaaagcagtgacaaagaggagaagcacagaaaccagttgaatggaagc acccaagataatggttccaaaaagggtaactatttccatggaatatcagggagcatgggc gtgttgagtcttaaccttgaggatgttgcttcaagtcagaatgatcttggagattcccat gaagttcagagactgggacactga >gi568815592r:10298420_10510336|GENSCAN_predicted_peptide_4|116_aa MSCVHYDHWEDEFHPSHDYMLDWVIERDYVLGIVLKEVLKAGKSKINTSADSVSDEGLFP SSQTAIFSLCPHMVEGARELSAVSLTWVPTLNTEVAFAQLVSKTARAYFAYIGLLC >gi568815592r:10298420_10510336|GENSCAN_predicted_CDS_4|351_bp atgagctgtgttcattatgaccactgggaagatgagtttcacccttcacatgactatatg cttgactgggtcattgagagagactatgttctgggcattgtattgaaagaagttctgaag gctgggaagtccaagatcaatacctcagctgactcagtgtctgatgagggcctgtttccc agttcacagacagccatcttctccctgtgtcctcacatggtggaaggggcaagggagctc tctgcagtctctctgacctgggtccctacgctgaacactgaggtggcttttgctcagcta gtctccaagacagcacgagcctattttgcctatattggtctgctgtgctga >gi568815592r:10298420_10510336|GENSCAN_predicted_peptide_5|352_aa MFSLSLPVEHKYLSSNIELVVQWKEYYQDPDAAQGKKKTKGDTHRHQSMMQVWLNRCLHI GLVLSKPVAPILQGYPAVEGGPGGQDTMWGKQPWGAEPRIEPKPAGKAEAQLWSSQVQCR RAKGVCAGGREAALEAMQQQRMSRTLEKVLCLRNNTTFKQGFSLLRLRTSAEKPIYSVGG ILLSTSRPYKTKPTHAIGKYKHLIKAEEPEKKQKVEVKLINLGTDDEYGVLNIHLPAYDM TLAESYAQYVHNFCNSLSIKVEESYVMPTKTTEVLWLQDQGSKTFLDSVLTTHERVVHIS SLSATFAEIFLEIIQSSLPEGVRLSLKEHTEEDFKGRFKARPELEERLAKLN >gi568815592r:10298420_10510336|GENSCAN_predicted_CDS_5|1059_bp atgttttctctgagccttcctgtggagcataaatatctgagttcaaacattgaattagtt gtgcaatggaaggagtattaccaggacccagatgcagcccaaggaaaaaagaaaacaaaa ggagacacacacagacaccagagcatgatgcaagtgtggcttaatcggtgcttgcacatt gggcttgttctttccaaacctgttgccccgattctgcaaggctatcctgctgtagaagga ggccctggaggacaagacaccatgtggggaaaacagccatggggagcagaaccgagaata gaacccaagccagccggcaaggcagaagcgcagctgtggagctcccaggtgcaatgtcgc agagcaaagggagtgtgcgctgggggccgagaagcagcactagaggccatgcagcagcaa aggatgagcagaaccttggaaaaagtgctgtgcctgaggaacaataccacttttaagcaa ggcttttcactcttaaggcttagaacttcagcagagaagcccatctattctgtaggtggc attctactaagtactagtcggccctacaagacaaagcccacccacgccattggaaagtac aagcacctaattaaagcagaagagcccgagaagaagcaaaaagtggaagtgaaactcatt aatttggggacagatgatgaatatggggttttaaacattcacctgcctgcatatgacatg accctggcagagagttatgcccagtatgttcacaacttctgcaactctctctccattaaa gtcgaggaaagttatgtgatgccaaccaaaaccactgaagtgttgtggttgcaggaccaa ggcagcaaaacgttcctggactcagtgcttaccacccatgagcgagtggttcacatcagc agtttgagtgctacgtttgcagagattttcttggaaataatccaaagcagtcttcctgaa ggagtcagattgtcgttgaaggagcacactgaagaagacttcaagggaagattcaaagct cgaccagaactggaagaacggttggccaagttgaactag >gi568815592r:10298420_10510336|GENSCAN_predicted_peptide_6|270_aa MLQNLNFLSTNTMPQVEYSTCENPNTNCFITDLKCYIKLPSGYVYKVPSPAPREVAEARR EFERGESGPAVLGDPAPPLQLLAWVLNPSLPGAGSADGPSQCGTRARWELALARERRKKL PLGCQSARIGCCKVPQRLPVWAAYPRILALPLCPNRNQRFEKVWQKVFERSPGHFFATFS GGSIYLTIDVKSKKSSADICVSKAKRLGLAPRDSVVPAGPASSAESVRTGRLWSCCADSD VEKACSRGSACFHKERKKVASRITVPLAVK >gi568815592r:10298420_10510336|GENSCAN_predicted_CDS_6|813_bp atgctccaaaatctgaactttttgagcaccaacacgatgccacaagtagaatattccaca tgtgagaaccctaatacaaactgtttcatcacagatttaaaatgttatataaaattaccc tcaggctatgtgtataaggtcccaagccctgccccgcgggaggtggctgaagcccggaga gaattcgagcgcggcgagagcgggccagcggtgctgggggacccggcgccccctctgcag ctgctggcctgggtgctaaacccctcactgcctggggccggcagcgctgacggcccctcc cagtgcggaacccgcgcccgctgggaactcgcactggcccgcgagcgccggaaaaagctc cctctgggctgccagagtgcccgaattgggtgctgcaaagtgccccagcgcctgccggtc tgggcagcctaccccagaatcttggctctccctctctgcccgaatcgaaatcagagattt gaaaaagtctggcaaaaggtttttgaaaggtcaccaggccatttctttgctacctttagt ggtggtagcatttatcttaccatagacgtcaagtcaaaaaaaagttccgcagacatttgc gtcagtaaagcaaagagactgggcctggctccgcgggactcagtggtgcccgctggccct gccagctctgctgagtctgtgaggactggaaggctttggagctgctgcgccgactccgac gtggaaaaagcatgtagtcggggcagcgcctgtttccacaaggaacgcaaaaaggtggcg tcaaggattacggtcccccttgcagtaaagtaa