GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:28:45 Sequence gi568815596r:70563498_70806408 : 242911 bp : 43.78% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5512 5564 53 2 2 35 109 59 0.055 1.33 1.02 Intr + 18430 18605 176 0 2 91 70 31 0.018 0.34 1.03 Term + 30040 30145 106 1 1 81 34 84 0.042 0.18 1.04 PlyA + 33684 33689 6 1.05 2.05 PlyA - 34505 34500 6 1.05 2.04 Term - 45458 45321 138 2 0 1 33 168 0.077 0.56 2.03 Intr - 46345 46194 152 2 2 45 72 56 0.012 -0.42 2.02 Intr - 54741 54654 88 0 1 91 90 28 0.036 2.84 2.01 Init - 65113 64985 129 1 0 78 89 105 0.943 9.75 2.00 Prom - 67634 67595 40 -7.06 3.05 PlyA - 72685 72680 6 1.05 3.04 Term - 72737 72706 32 0 2 93 52 50 0.087 -0.08 3.03 Intr - 77627 77536 92 1 2 74 84 77 0.136 5.54 3.02 Intr - 84533 84426 108 1 0 116 89 43 0.061 6.60 3.01 Init - 89243 89230 14 2 2 72 63 8 0.020 -3.56 3.00 Prom - 90193 90154 40 -4.66 4.28 PlyA - 91669 91664 6 1.05 4.27 Term - 100238 99928 311 0 2 72 37 410 0.786 29.52 4.26 Intr - 109509 109381 129 1 0 105 96 -4 0.838 2.77 4.25 Intr - 109829 109740 90 0 0 67 75 89 0.919 5.47 4.24 Intr - 111328 111181 148 1 1 44 54 204 0.655 12.41 4.23 Intr - 113388 113299 90 0 0 125 37 80 0.964 6.79 4.22 Intr - 114380 114261 120 2 0 90 115 161 0.587 19.69 4.21 Intr - 114908 114840 69 2 0 84 64 45 0.678 1.08 4.20 Intr - 115464 115207 258 0 0 105 103 451 0.999 45.96 4.19 Intr - 120066 120016 51 0 0 104 80 4 0.522 0.30 4.18 Intr - 120270 120094 177 0 0 86 105 206 0.979 22.22 4.17 Intr - 124625 124527 99 2 0 113 99 42 0.994 8.11 4.16 Intr - 127432 127289 144 1 0 72 77 186 0.997 16.38 4.15 Intr - 129055 128906 150 1 0 95 101 175 0.959 19.86 4.14 Intr - 132304 132224 81 1 0 104 105 68 0.999 9.93 4.13 Intr - 132899 132748 152 0 2 71 101 244 0.977 23.88 4.12 Intr - 133236 133192 45 1 0 81 103 10 0.529 0.18 4.11 Intr - 134295 134152 144 1 0 83 43 131 0.593 8.35 4.10 Intr - 140962 140824 139 1 1 1 27 196 0.984 4.84 4.09 Intr - 142863 142729 135 0 0 101 86 317 0.389 33.56 4.08 Intr - 148538 148396 143 0 2 51 41 123 0.044 3.97 4.07 Intr - 187770 187613 158 2 2 86 100 107 0.339 11.35 4.06 Intr - 203972 203833 140 2 2 31 68 92 0.007 0.76 4.05 Intr - 209871 209802 70 2 1 76 74 28 0.003 -0.62 4.04 Intr - 214174 214140 35 1 2 98 96 20 0.013 0.82 4.03 Intr - 215916 215881 36 0 0 92 98 33 0.026 3.16 4.02 Intr - 224304 224152 153 0 0 121 98 6 0.784 5.17 4.01 Init - 227141 226911 231 2 0 85 76 481 0.979 42.86 4.00 Prom - 230198 230159 40 -4.06 5.00 Prom + 233830 233869 40 -7.46 5.01 Init + 239304 239306 3 2 0 108 81 0 0.744 1.30 5.02 Term + 239627 239929 303 1 0 10 45 344 0.764 17.47 5.03 PlyA + 240720 240725 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 217431 217512 82 2 1 90 86 113 0.933 12.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:70563498_70806408|GENSCAN_predicted_peptide_1|111_aa XITKDLAISLFLMPDTSQVIRFPSGDPCSLRSNVSSADHSCQLTYSSSSNPFLCFFLINL HDEEATTGKMCGHEGNLVKHESVSVNPCTGYPSQGALAGKKEKPSANLNRR >gi568815596r:70563498_70806408|GENSCAN_predicted_CDS_1|336_bp natatcactaaagatctggccatcagcctcttcctgatgccagacacaagtcaggtaatt cgtttcccatcaggtgacccttgcagcctacgctcaaatgtgtcttctgcagatcatagt tgccagttgacctactcgtcttcctcaaacccttttttgtgcttctttcttataaacctg catgatgaagaagctacaacaggaaaaatgtgtggacatgaagggaacttggtcaagcat gaatcagtatctgtcaacccatgtaccggatatcctagtcagggggctcttgctggcaaa aaagagaagccttccgccaatctgaatagaagatga >gi568815596r:70563498_70806408|GENSCAN_predicted_peptide_2|168_aa MEYYAAIKRNQIMSFAGTWMELEAIILSKLTQQQKTKYHMFSLVYYIAQAYICVFYFPIM VMLDFIAIYRNGETGLRGIMATGRGSTMACSSPAHPRANGLEEATATLKTSKNILEASSK ANWQREVFAQGYKQIAGDVPLWYPKEMYLETVCQGDQHQALGTGKEIQ >gi568815596r:70563498_70806408|GENSCAN_predicted_CDS_2|507_bp atggaatactacgcagccataaaaaggaaccagatcatgtcctttgcagggacgtggatg gagctggaagccattatccttagcaagctaacgcagcaacagaaaaccaaataccacatg ttctcacttgtatactacatcgcacaggcctatatttgtgtattttactttcccataatg gttatgttggactttattgccatttatagaaatggagaaacagggcttcggggaatcatg gccacaggaaggggcagtaccatggcctgctcatcccctgcccacccaagagcaaatggc ttggaagaagccacagcaacacttaaaacttccaaaaatatactggaagcttctagcaag gccaactggcagcgtgaagtctttgctcagggttacaagcagattgcaggggacgtgcca ctctggtaccccaaagagatgtacctggagaccgtgtgccaaggggaccagcaccaggcc ctggggactgggaaggagattcagtag >gi568815596r:70563498_70806408|GENSCAN_predicted_peptide_3|81_aa MAQERNRSKQDAFLFLTGSPQQYHHKPWSLLVTSKGMKDKCHRWEPQGKEDAEVWEAEED SGPDIASSQRQVLEVPNAAQF >gi568815596r:70563498_70806408|GENSCAN_predicted_CDS_3|246_bp atggcacaggaaagaaaccggtccaagcaggatgccttcctttttctgacaggcagcccc cagcagtaccatcacaaaccttggtctcttcttgtcaccagcaaagggatgaaggataag tgccatcgctgggagccccagggtaaggaggatgctgaggtatgggaagcagaagaggac agtggtcctgacattgcatcttctcagaggcaggttctggaggtgcccaatgctgcccag ttctga >gi568815596r:70563498_70806408|GENSCAN_predicted_peptide_4|1165_aa MDPAPGVLDPRAAPPALLGTPQAEVLEDVLREQFGPLPQLAAVCRLKRLPSGGYSSTENL QLVLERRRVANAKERERIKNLNRGFARLKALVPFLPQSRKPSKVDILKGATEYIQVLSDL LEGAKDSKVVALYDGKTSTTDRFPEVELLSHRTFQGIGSGIISFECGIVMELGLGRCEAG LAGLGASPGASRSLAARRSAAVAGRAGAASAAAYIEIGATGTFLSNKRQLLYTETFPLEF QSVKVEGIKQIFQSNLFIIEIRHMKLRKDKALIQVGCPTIQFNSDMNYPKLKGTQANYYK IKDKVTNMTIPTSGTSHNWGPQGQPYFDRFSEDDPEYMRLRNRAADLRQDFNLMEQKKRV TMILQSPSFREELEGLIQEQMKKGNNSSNIWALRQIADFMASTSHAVFPTSSMNHIITST NGTLDAGHTSSALAFVDHVITSTNGTLDAGYTSYCSKFHRGEVYSCFDSVPIFRVYDVSM MTPINDLHTADSLNLAKGERLMRCKISSVYRLLDLYGWAQLSDTYVTLRVSKEQDHFLIS PKGVSCSEVTASSLIKVNILGEVVEKGSSCFPVDTTGFCLHSAIYAARPDVRCIIHLHTP ATAAVSAMKWGLLPVSHNALLVGDMAYYDFNGEMEQEADRINLQKCLGPTCKILVLRNHG VVALGDTVEEAFYKIFHLQAACEIQVSALSSAGGVENLILLEQEKHRPHEVGSVQWAGST FGPMQKSRLGEHEFEALMRMLDNLASPLRLSPLEPGSTRHQGYRTGYTYRHPFVQEKTKH KSEVEIPATVTAFVFEEDGAPVPALRQHAQKQQKEKTRWLNTPNTYLRVNVADEVQRSMG SPRPKTTGALGKFKGGAFNIGTSDLMQAQEWMKADEVEKSSSGMPIRIENPNQFVPLYTD PQEVLEMRNKIREQNRQDVKSAGPQSQLLASVIAEKSRSPSTESQLMSKGDEDTKDDSEE TVPNPFSQLTDQELEEYKKEVERKKLELDETGQEREPGSGPAVCEFFSVALHIWSNILVG EKETAPEEPGSPAKSAPASPVQSPAKEAETKSPLVSPSKSLEEGTKKTETSKAATTEPET TQPEGVVVNGREEEQTAEEILSKGLSQMTTSADTDVDTSKDKTESVTSGPMSPEGSPSKS PSKKKKKFRTPSFLKKSKKKEKVES >gi568815596r:70563498_70806408|GENSCAN_predicted_CDS_4|3498_bp atggaccccgcgcccggcgtcctagatccccgcgccgcgccgcccgcgctcctgggcacc ccgcaagccgaggtgctggaggacgtgttgcgggagcagttcgggccgctgccccagctg gccgctgtctgccggctcaagcggctgccctcgggcggctactcgtccactgaaaacctc cagttggtgctggagcggcggcgtgtggccaacgccaaggagcgtgagcggataaaaaat ctcaaccgtggttttgccagattgaaggcacttgtgccatttcttccccaaagcaggaag cccagcaaagttgatatccttaaaggtgcgactgaatatatacaggttctcagtgatctt ttggaaggagccaaagactcaaaggtagtggctctatacgatggtaaaacttctaccaca gatagattcccagaagtagaactgctgagtcacagaacttttcaaggcattggcagtggg attatttcttttgaatgtggcattgtaatggaactgggtctcgggcggtgcgaggcgggc ttggcggggctgggcgcgtcccccggggcctcgcgatcgctggctgcgcggcgctcagcc gcagtggccggccgagcaggtgcggcgtcggcagccgcctacattgagatcggggcaacc ggcacgtttctgagtaacaagcgtcaactgttatatactgaaacattccctctggaattc cagagtgtcaaagtggaagggatcaaacagatcttccagtccaacctcttcattatagaa atcagacacatgaagctgagaaaagataaggcactgatccaagttggctgtcctacaatt caattcaattctgacatgaactaccctaagttgaagggcacacaagctaactactacaag ataaaggacaaggtcaccaatatgaccatccccacctcaggcaccagccacaactggggt ccccaggggcagccttactttgaccgcttctcagaggacgaccccgagtacatgcgcctt cgcaaccgggcggcggacctgcggcaggacttcaacctgatggagcagaagaagcgcgtc accatgatcctgcagagtccctctttcagggaggagctggaaggcctcatccaggagcag atgaagaaggggaacaactcctccaacatctgggccctgcgacagatcgcggacttcatg gccagcacctcccacgcagtcttcccgacatcttccatgaaccacatcatcacttccact aatggcacactcgatgcaggacatacatcgtccgccctcgcttttgtagaccacgtcatc acttccactaatggcacactcgatgcaggatatacatcgtactgcagcaagtttcacaga ggtgaggtttattcttgctttgattcagtgcccatcttcagagtttatgatgtctccatg atgacgcctatcaatgacctccacacagctgactccctgaacctggccaaaggggagcgg ctcatgcggtgcaagatcagcagtgtctaccgactcctggacctctatggctgggcccag ctgagtgacacctatgtcacgttgagagtcagcaaggagcaggaccacttcctgatcagc cctaagggagtttcttgcagtgaagtcacagcgtccagcctgatcaaggtgaacattctg ggagaggtggtggagaagggcagcagctgcttcccagtggacaccacaggcttctgtctg cactcggccatctatgcagcgaggcccgacgtgcgctgcatcatccacctgcacacaccg gccacagcagcggtgtcggccatgaagtggggcctcctgcctgtctcccacaatgccctg ctggtgggggacatggcctattatgacttcaatggggaaatggagcaggaagccgatcgg atcaacctgcagaagtgccttggacccacctgcaagatcctggtgctaagaaaccatgga gtggttgctctgggtgacacggtagaggaggcattttacaagatcttccacctgcaggct gcatgtgagatacaggtgtcggctctgtccagtgccgggggagtggagaacctcatcctc ctggagcaggagaagcaccggccccatgaggtgggctccgtgcagtgggccgggagcacc tttgggcctatgcagaagagtcggctgggggagcatgagtttgaggccctcatgaggatg ctggacaacctggccagtccattgaggcttagccccctggaaccaggaagtacaaggcat cagggctacagaacaggttacacgtatcgccacccctttgttcaagagaaaaccaaacac aaaagtgaggtggagattccagccacggtcacagccttcgtgtttgaggaggacggtgcc ccggtgcccgccctgcgacagcatgcccagaagcagcagaaggagaagacccgctggctc aatacgcccaacacctacctgcgggtcaatgtggccgatgaggtccagaggagcatgggc agcccccgacccaagaccacgggggctcttgggaaattcaagggaggagcatttaacatt gggacttcagatctgatgcaagcccaggagtggatgaaggctgacgaggtggagaaatcc agcagtggcatgccgattcgcatcgaaaacccaaaccaatttgtgcctctctatactgac ccccaggaagtactggagatgaggaacaagattcgagaacaaaaccgacaagatgtgaag tcagcggggcctcagtcccagctcctggcgagcgtcattgccgagaagagccgaagcccg tctacagagagccagctgatgtccaagggagacgaggataccaaagacgattcagaggag acggtgcccaaccccttcagccaactcactgaccaggagttggaggagtacaagaaagag gtggagaggaagaaactagaacttgatgagacaggacaggaacgagagccaggctctggt ccggccgtgtgcgagttcttcagcgttgccctccacatctggagtaacatattggtagga gagaaagaaactgccccagaagagcctggctcacctgcaaagtctgcacctgcttctcca gtgcagagcccagcgaaggaggcagagacaaagagccctttagtctctccttccaagtct ttagaggaaggtactaagaagacagaaacaagcaaagccgccaccacagagcccgaaaca acccagccggaaggggtggtggtcaacgggagggaggaggagcagacggcagaggaaatc ctcagcaaaggcctgagccagatgaccaccagtgctgacacggatgttgatacctctaag gacaaaaccgagtcggtcaccagcggccccatgtccccagagggctcaccttccaagtct ccctcaaagaagaaaaagaaattccgaaccccctccttcctgaaaaagagcaaaaagaag gagaaagtggagtcctga >gi568815596r:70563498_70806408|GENSCAN_predicted_peptide_5|101_aa MHLSPTAVTAATLPKRKTEGDVKGDKAKVDTPQRRSTRLSAKPAPPKPEPKPKKAPAKKG EKVPKGKKGKADAGKEGNYPAEKGDAITDQAREAEGDGEAK >gi568815596r:70563498_70806408|GENSCAN_predicted_CDS_5|306_bp atgcatctaagtcccactgctgtcaccgccgccaccttgcccaagaggaagaccgaaggg gatgttaaaggagataaagccaaggtggacacacctcagagaagatccacaaggttgtct gctaaacctgctcctccaaagccagagcccaagcctaaaaaggcccctgcaaagaaggga gagaaggtacctaaagggaaaaagggaaaagctgatgctggcaaggaggggaattaccct gcagaaaaaggagatgccataacagaccaggccagggaagctgaaggtgatggagaggcc aagtga