GENSCAN 1.0 Date run: 2-Nov-116 Time: 20:31:13 Sequence gi568815592f:154673987_154933392 : 259406 bp : 40.42% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 6115 6213 99 0 0 122 45 105 0.943 6.75 1.02 PlyA + 8289 8294 6 1.05 2.00 Prom + 9830 9869 40 -3.45 2.01 Init + 15730 15748 19 0 1 72 93 9 0.018 0.09 2.02 Intr + 33497 34012 516 0 0 -2 102 405 0.303 24.90 2.03 Term + 35512 35648 137 2 2 33 44 97 0.323 -3.00 2.04 PlyA + 35760 35765 6 1.05 3.00 Prom + 44464 44503 40 -5.35 3.01 Init + 59194 59517 324 0 0 55 52 381 0.978 26.58 3.02 Term + 59671 59940 270 0 0 85 44 186 0.936 8.40 3.03 PlyA + 65407 65412 6 1.05 4.03 PlyA - 65789 65784 6 1.05 4.02 Term - 92197 92135 63 1 0 96 39 63 0.210 -0.89 4.01 Init - 92641 92375 267 1 0 39 35 294 0.789 16.23 4.00 Prom - 94739 94700 40 -7.25 5.00 Prom + 99296 99335 40 -5.55 5.01 Init + 100048 100086 39 0 0 87 113 36 0.818 6.24 5.02 Intr + 104015 104059 45 1 0 95 106 30 0.883 3.19 5.03 Intr + 113875 114036 162 0 0 80 91 173 0.999 16.05 5.04 Intr + 118837 118990 154 0 1 88 91 33 0.988 2.32 5.05 Intr + 121023 121153 131 1 2 49 99 81 0.884 4.79 5.06 Intr + 127985 128161 177 1 0 95 70 173 0.832 15.39 5.07 Intr + 129558 129637 80 2 2 104 56 66 0.969 2.43 5.08 Intr + 131383 131500 118 1 1 106 10 95 0.955 3.05 5.09 Intr + 134084 134215 132 1 0 49 98 86 0.886 5.62 5.10 Intr + 134700 134812 113 2 2 60 13 172 0.047 5.16 5.11 Intr + 136029 136222 194 0 2 -17 103 105 0.022 -0.29 5.12 Intr + 142315 142358 44 2 2 100 94 18 0.459 0.64 5.13 Intr + 144493 144606 114 0 0 55 100 134 0.890 11.02 5.14 Intr + 146191 146347 157 0 1 60 99 91 0.994 6.06 5.15 Intr + 148290 148423 134 1 2 -6 92 125 0.871 2.94 5.16 Intr + 150248 150392 145 1 1 35 95 58 0.799 0.13 5.17 Intr + 153186 153254 69 1 0 93 103 35 0.764 3.74 5.18 Intr + 156936 157154 219 1 0 73 99 255 0.980 22.45 5.19 Term + 157953 159409 1457 1 2 110 37 1120 0.999 98.98 5.20 PlyA + 160052 160057 6 1.05 6.14 PlyA - 160265 160260 6 1.05 6.13 Term - 198786 198558 229 2 1 67 42 138 0.832 2.22 6.12 Intr - 201042 200899 144 2 0 57 30 148 0.816 4.38 6.11 Intr - 203137 202953 185 1 2 -6 81 119 0.060 -0.54 6.10 Intr - 209214 209026 189 0 0 42 70 112 0.304 3.66 6.09 Intr - 209769 209639 131 1 2 84 57 38 0.359 -0.21 6.08 Intr - 215194 215005 190 1 1 54 87 117 0.427 6.44 6.07 Intr - 216479 216354 126 2 0 72 84 111 0.891 9.06 6.06 Intr - 226826 226656 171 2 0 77 73 51 0.222 1.72 6.05 Intr - 244298 244205 94 1 1 13 105 90 0.332 2.25 6.04 Intr - 246830 246675 156 1 0 71 89 117 0.407 8.30 6.03 Intr - 247152 246978 175 1 1 96 80 -6 0.384 -2.52 6.02 Intr - 251244 251100 145 0 1 28 105 106 0.692 5.23 6.01 Intr - 254632 254536 97 0 1 83 53 111 0.642 6.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 13871 13777 95 0 2 85 49 84 0.829 1.21 S.002 Term - 47968 47812 157 0 1 59 50 141 0.975 3.82 S.003 Term + 134700 134831 132 2 0 60 38 188 0.902 8.11 S.004 Init - 203116 202953 164 1 2 26 81 119 0.833 4.25 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:154673987_154933392|GENSCAN_predicted_peptide_1|32_aa IHPESCIALDPRTLSWGLDRDLFPVTIEQQKL >gi568815592f:154673987_154933392|GENSCAN_predicted_CDS_1|99_bp attcaccctgaatcttgtattgcattagatccaagaaccctctcttggggtctggatcgg gacctctttcctgtaacaatagagcagcaaaaattatga >gi568815592f:154673987_154933392|GENSCAN_predicted_peptide_2|223_aa MSWFAQACTKEKDLSLKEIQKKLEAAEERRKSEEVQVLKPLAERREYKQEVFEKALENDT FSRIVEEKLILKVEKIKENEEANLAATMSEGKGGTPRRCAGTRTRRLNSLAEVEEGTACC TISKPTAHIIMSHAVSVWEMYDIVYKGLINKAELKLKKKNVEPDAVTHTCNPSTLRGQEL TLELSESSESSISEKSDSLNESLSNNCSRDVNLTPRNAMFSRI >gi568815592f:154673987_154933392|GENSCAN_predicted_CDS_2|672_bp atgagctggtttgcccaagcttgtactaaggagaaagacctgtccctgaaggagatccag aagaaactggaggctgcagaggaaagaaggaagtctgaggaggtccaggtgctgaagcca ttggcagagaggagggaatacaagcaagaagtctttgagaaggctttggaaaacgacacc ttcagcagaatcgtggaggaaaagctgatcctgaaagtggaaaaaattaaggaaaatgag gaggctaatctagctgctactatgtctgaaggaaaaggaggcacgccgcggaggtgcgca ggaacaaggacgcgcagactcaacagtctggctgaagtggaggagggtacggcatgctgc accatcagtaaacccactgcccatattataatgagtcacgcagtatcggtatgggaaatg tatgacatagtttacaaaggactcattaacaaagcagaattaaaattaaaaaaaaaaaat gtggagccggatgcagtgactcacacctgtaatcccagcactttgcgaggccaagaacta acattagaattgtctgaatcatcagaatcatctatttcagaaaaatcggattcactaaat gaatctttgtccaacaactgttccagagatgtgaacctcactcctaggaatgctatgttt tctaggatttga >gi568815592f:154673987_154933392|GENSCAN_predicted_peptide_3|197_aa MPQRSLPRPRGAGRGRGAGGPGAKARAAEPSRAYGRRRSGPSALDGFRAAAGARTVQSPG GALAGAADPRTARPDSSPPYWMPQPLLPALPPLSSPSGAGSCGPSGGERLCSLERRSPRP SGHAGAALPLREPFPASACPSTPPRGRSGPLSRQRPLLAATQQPASRSPHPVQWPPPLPP PGSGPPQRQHGGREDLQ >gi568815592f:154673987_154933392|GENSCAN_predicted_CDS_3|594_bp atgccgcagcgctcactcccgcgtccgcggggcgcggggagggggcgaggcgcgggcggc cccggagccaaggcgagggcagcggagcccagccgggcctacgggcggcgacgctctggc cccagcgcgctcgatggtttccgcgcggccgccggggctcgcacggtccagtcgccaggg ggcgcgctagccggcgccgcggatccccgaactgcgcggcccgactcgagtccgccatat tggatgccgcagccgctgctgccagcgcttcctcctctgtcttcgccgagcggggctggt tcctgcggcccgagcggcggggagcgcctctgttccctagaacggcgctccccccgccct agcggccatgccggtgccgctctgccgctgagggagcccttccccgccagcgcgtgccct tccactccgccccgaggtcgcagcggcccgctctcccgccagcgccccctcctcgcggcc acgcagcagcccgcgtctcgctctccccacccagtgcagtggccgccgcctcttccgccg ccgggctcggggcctccgcagcgacaacatggaggccgtgaagaccttcaatag >gi568815592f:154673987_154933392|GENSCAN_predicted_peptide_4|109_aa MNLGEIQELMDTSEELTEDNLMKKSASKPVPDGQEAAEEAVPENKSTLDNLAEGLQLSKT AFDLFLGHKPSYDTGTKIKANGGRRIGAMPTQHEDKGKDFYDDLLLLNE >gi568815592f:154673987_154933392|GENSCAN_predicted_CDS_4|330_bp atgaatcttggagaaattcaagagctaatggacacgtcagaggaattaacagaagacaac ttgatgaagaagagtgcttccaaaccagtgccagacggtcaggaagcagcagaagaagca gtgccagaaaacaaatctactttagacaatctggcagaagggttacaattatccaagact gcttttgatttatttttaggacacaaaccctcctatgacacaggcactaaaattaaagcc aacggtggaagaagaattggtgccatgcctactcaacatgaagacaagggtaaagatttt tatgatgacctacttctacttaatgaatag >gi568815592f:154673987_154933392|GENSCAN_predicted_peptide_5|1227_aa MTQITKAAIKAIKFYKHVVQSVEKFIQKCKPEYKVPGLYVIDSIVRQSRHQFGQEKDVFA PRFSNNIISTFQNLYRCPGDDKSKIVRVLNLWQKNNVFKSEIIQPLLDMAAGIPPPVVTP VLASTTTAMSNTPGTPVTPVTPANVVQGLPDPWVSQITNTDTLAAVAQILQSPQGQQLQQ LIQTLQIQQQKPQPSILQALDAGLVVQLQALTAQLTAAAAAANTLTPLEQGVSFNKKLMD RFDFGEDSEHSEEPKKEIPASQLSHVSESVNNSIFHQIAEQLQQQNLEHLRQQLLEQQQP QKATPQDSQEGTFGSEHSASPSQGSSQQHFLEPEVNLDDSIDIQQQDMDIDEGQDGVEEE VFEQEAKKVAVRSRSRTHSRSRSRSPRKRRSRSRSGSRKRKHRKRSRSRSRERKRKSSRS YSSERRAREREKERQKKGLPPIRSKTLSEQMLGKKISMASELLMIPPRGCAYVCMVHRQD AFRALQKLSSGSYKIGSKVIKIAWALNKGVKTEYKQFWDVDLGVTYIPWEKVKVDDLEGF AEGGMIDQETVNTEWETVKSSEPVKETVQTTQSPTPVEKETVVTTQAEVFPPPVAMLQIP VAPAVPTVSLVPPAFPVSMPVPPPGFSPIPPPPFLRASFNPSQPPPGFMPPPVPPPVVPP PTIPPVVPTSLVQPSLSMTPETVKDVGFGSLVIPGGSVASNLATSALPAGNVFNAPTKQA EPEEKVPHLIDHQISSGENTRSVIPNDISSNAAILGGQPPNVTSNSGILGVQRPNVSSNS EILGVRPSNVSSSSGIIAAQPPNILNNSGILGIQPPSVSNSSGLLGVLPPNIPNNSGLVG VQPPNVPNTPGLLGTQPPAGPQNLPPLSIPNQRMPTMPMLDIRPGLIPQAPGPRFPLIQP GIPPQRGIPPPSVLDSALHPPPRGPFPPGDIFSQPERPFLAPGRQSVDNVTNPEKRIPLG NDNIQQEGDRDYRFPPIETRESISRPPPVDVRDVVGRPIDPREGPGRPPLDGRDHFGRPP VDIRENLVRPGIDHLGRRDHFGFNPEKPWGHRGDFDEREHRVLPVYGGPKGLHEERGRFR SGNYRFDPRSGPWNRGFGQEVHRDFDDRRRPWERQRDRDDRDFDFCREMNGNRLGRDRIQ NTWVPPPHARVFDYFEGATSQRKGDNVPQVNGENTERHAQPPPIPVQNDPELYEKLTSSN EINKEKSDTVADIESEPVVESTETEGT >gi568815592f:154673987_154933392|GENSCAN_predicted_CDS_5|3684_bp atgacccaaattactaaggcagccatcaaagctattaagttctataaacatgtggtacag agtgttgagaagtttattcagaaatgtaaaccagaatacaaagtacctggactttatgtt attgactccattgtgcgacaatcccgacatcagtttggtcaagaaaaggatgtgtttgca cccagatttagtaataacatcattagcactttccagaatttatatcgttgccctggggat gacaagagtaaaatagtgagagtactaaacttatggcagaagaataatgtatttaagagt gagattattcagccccttttggatatggcagccgggattccgcctccagttgtcacacct gttttggccagcactaccactgctatgagcaatactccaggaactcctgtgacacctgtt actccggccaatgtggtccaaggcttacctgatccgtgggtatctcagataacaaataca gatacacttgcggctgtagctcagatcttgcaaagtcctcaaggccagcagcttcaacaa ttaatacaaaccttacagatacaacaacagaagccccagccttccattctgcaggcccta gatgctggtcttgttgttcagttgcaagctcttacggcacaacttacagctgcagctgca gctgccaacactcttactcccttagaacagggagtctcctttaacaagaagttgatggat aggtttgattttggggaagactctgagcatagtgaagaacccaaaaaggaaattccagct tcacaactttctcacgtttcagaatctgtgaacaattccatttttcatcagatagcagaa caactacaacagcaaaacctagaacatctcagacagcagctcttggagcagcaacagcct caaaaggccactcctcaggatagtcaggaaggaacctttgggtcagagcattcagcgtca ccatcacaagggagtagtcagcagcattttcttgaacctgaagtcaatttggatgattcc atagatattcagcaacaggatatggatatagatgaagggcaagatggagtggaagaggag gtctttgaacaagaagctaagaaagtggcggttcgctcaagatcaagaacacattcacga tctcgttcaagatcaccaagaaaacgaaggtctaggtcacggtctggctctagaaagcgt aaacacagaaagcgatcacgctcccgctcaagagaaagaaagaggaaatcatcacggtcg tattcaagtgaaaggagagccagagaaagggagaaagaacgacagaaaaagggattacct ccaattagatctaaaacactaagtgagcaaatgttaggtaagaaaatcagtatggccagc gaattgctaatgattcctccccggggctgtgcttatgtctgcatggttcatcgacaagat gcatttcgagctcttcagaaactcagttctggatcatataaaattgggtccaaggtcatt aagatcgcttgggctttaaacaaaggtgtaaaaacagaatacaaacaattctgggatgtg gatcttggagttacatatataccatgggaaaaagttaaagtggatgacttggaaggtttt gcagaaggaggcatgattgatcaggagactgtaaatactgagtgggaaactgtgaaaagc tcagaacctgttaaagagacggtccagacaactcagagcccaactccagttgaaaaggag acagtggtcacaacccaggcagaggttttccctcctcctgttgctatgttgcagattcca gtggcgccagccgtgcctacagttagtttagtcccaccagcatttcctgtgtcgatgccg gttcctcctcctggattcagtccaatccctccacctccttttttaagagcaagttttaac ccttcacaaccaccacctggtttcatgccgcctccagttcccccacctgttgtgccaccc cctacgattccaccagtagtaccaacatctttagtgcagccgtcattatccatgacaccg gaaactgtgaaagatgttggatttggtagccttgttataccaggcggttctgttgccagc aatcttgctacttccgctctgccagctggaaatgtttttaatgctccaactaaacaggca gagcctgaagaaaaagtacctcatcttatagaccaccagatttcttctggtgaaaacacc agatcagtgattccaaatgatatttcaagtaatgctgcaattttaggaggacagccgcca aatgtgacaagcaattctggaattctgggagtccaaagaccaaatgtatcaagtaattct gaaattcttggggtccggccatctaatgtttccagtagttctgggattattgcagcccaa ccaccaaatattctaaataactctggaatattgggaatacagccacccagtgtgtcaaat agttctggacttttgggagtgctacccccaaatatacctaacaattctggacttgtagga gtacagccaccaaatgttccaaatactcctggacttctgggaacacagccaccagctgga cctcaaaacttaccccctttaagtatccctaatcaaaggatgcccacaatgccaatgtta gacattcgtccgggactaataccacaggcacctgggccaagattccctttaatacagcct ggaattccaccccaacggggaatcccacccccatcggtacttgattcagctcttcatcca ccaccccgtggaccttttcctccaggagatatttttagtcaaccagaaagacctttttta gctcctggaagacaaagcgtagacaatgttactaacccagaaaaaaggataccacttggg aatgataacattcaacaggaaggagatagagattaccggtttcctcctatagaaaccagg gaaagcattagtagacctccccctgtggatgttagagatgtggttgggcggcctatagat ccaagagaaggtcctggacggcctccactagatggtagggatcattttggaagacctcct gtagatataagagagaatcttgtgaggccaggtatagatcatcttggtcgaagagaccac tttggctttaatccagagaagccctgggggcatagaggagattttgatgagagagagcat cgggttctaccggtctatggtggtccaaaaggcttacatgaagaaagaggtagatttcgg tctggaaactatcgatttgatcctagaagtggtccttggaaccgaggatttggacaagaa gttcacagagattttgatgaccgcagaagaccctgggagaggcaaagggatagggatgac agagattttgatttctgcagagaaatgaatggaaatcgtcttggacgagacagaattcaa aacacttgggttccccctcctcatgctcgggtttttgattattttgaaggggccacttct caacgaaaaggtgataatgtgcctcaggttaatggtgaaaatacagagagacatgctcag ccaccacctataccagtacagaatgatcctgaactttatgaaaaactgacatcttcaaat gaaataaacaaggagaagagtgacacagttgctgatatagaaagtgaaccagtggtagaa agcacagaaactgaggggacataa >gi568815592f:154673987_154933392|GENSCAN_predicted_peptide_6|677_aa XIAMQSLKILSKIKCQARRELNNSTACKERPLEVAGMLGGHVEASREYQMLSHVGNPDYP HALATEQATSPSTESGSAGFKVGRGGGIVSETNHLAFNPRTSIPFLWAHFINESRPFAFL LRSSEGSASTLNQGPCEFHESDVVTAPSSPHKHVDSRKMVEIPNLKSSGSKCRQPSQREG EAEDTADLITPRAPRGTQRSNPSFALMSTTVMDSPVAIDGPQEAFPVGKDAQSRVSLGHV SEESPVQIPIFPLLSLTGSSSVIDLKGDERGHHLLTGPVATAPKAKKLKPTPYGVTKDTS NPSGARRLFQASGLKIPQMLSRHSLGMMTSSLACNSRKLAFSLLENQNIRPLPSIRILSG LPQFLFGSKFEANLDVLVPREVIFQEDYKGNIATNIEQTHNLCPSHSSSPHSQGFTYCVL TLNPHIYTLLSIIKSGNYRQDFLAEHRECSRLSPKVGFITCLLPSVFAVEGMLQQLHGRV LVVTGSLLRRAQILKAISQMSHAAVKCRFKNEKDDKDKALQDTEHTGALRSFELGPRDTI IWGEEGSHGESWSRQTLIKECEGSTELGASHYNHVAHHLQHTTAFSITLRGIHGCIQHHA QGKLQDARQVPISLIAFPTPPLSTVHGCISTARCHISAMAPITLSSNYLPDCIVDSSRAE TLLSFSVCLVPMCLLNK >gi568815592f:154673987_154933392|GENSCAN_predicted_CDS_6|2034_bp nngattgcaatgcagtctctgaaaatactctccaaaatcaagtgtcaggcccggagagaa ctaaacaactctacggcttgcaaggagcgccctcttgaggtagcagggatgcttggggga catgtagaagcttccagagagtaccagatgttgtcccatgtggggaatccagactacccc catgcccttgctacagagcaggccaccagcccctctactgagagtggctcagccggcttt aaagtggggaggggaggaggaattgtttcagagaccaatcatctggcatttaacccaaga acttccattccgtttttgtgggcacattttataaatgagtcaaggccatttgcattttta ttaaggtcttctgagggatcggccagcaccctgaaccagggaccctgtgaatttcatgag tctgacgtggtaactgcaccatcttccccacacaagcatgtggattctcgaaagatggtg gaaatcccaaacctaaagtcatcaggaagcaagtgcaggcagcccagccaacgtgaagga gaagctgaggacacagctgatctgattactcccagagcaccgcgcggaacacaaagatca aacccatcttttgcccttatgagtacaacagtcatggactctccagtggccatagatgga ccgcaggaggccttcccagtaggtaaggatgcccagtcacgagtctcccttggtcatgtc tccgaggagagccctgttcagatccccatcttccccctcctttccttgactgggagctcc tctgttattgatttaaaaggtgatgaaagaggccaccaccttctcacaggtccggtggca acagcgccaaaggctaagaagctcaaacctacgccctatggtgttaccaaagatacatcc aatccctcaggtgcacgtcgcttatttcaagccagtggccttaaaatccctcaaatgctc tccaggcactctctgggaatgatgacaagttcactggcctgtaattctaggaagctagct ttttctttgcttgaaaatcaaaacattcgcccacttccaagcatcagaatcctctccggc ttgccacagttcctgtttggcagcaagtttgaggccaacctggatgttctggttcccagg gaagtcatttttcaagaagattataaaggtaatattgccaccaacattgaacaaacccat aacctgtgtcccagtcattcttcaagccctcactctcaaggctttacttactgtgtgctc acactcaatccacacatctacaccttgctcagtataataaagagtggtaattacagacag gatttccttgcggaacatcgggaatgttccagactttccccaaaagtaggttttatcacc tgcttgctgccaagtgtttttgctgtggaaggaatgctgcagcagctgcatggaagggtg ctggtggtcactggcagcctgcttaggagagcacagattttgaaagcaatttcccaaatg tcacatgcagctgtcaagtgtagatttaaaaatgaaaaagatgacaaagacaaagccttg caggacacagagcacacaggagctctcaggagctttgaacttggtccaagagacacaatt atttggggagaagagggctcacatggtgaaagctggtcaagacaaactctaataaaagaa tgtgaaggttccacagagcttggagcctctcactacaaccatgttgctcatcacctgcag cacacgactgcattcagcatcacactcaggggtatccacggctgcattcagcatcatgct caaggcaagctgcaggacgcccgtcaagtgccaatttctctaatcgccttccctactcca cccctttccacggttcatggatgcatttccacagcacgctgccacatctctgccatggcg ccaatcactctgtccagtaattaccttcctgactgcattgtggattcctcaagggcagag accttgctttcttttagcgtgtgcctggtgcctatgtgtttgctgaataaatga