GENSCAN 1.0 Date run: 5-Nov-116 Time: 22:53:21 Sequence gi568815594f:74265309_74484805 : 219497 bp : 37.14% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 808 803 6 1.05 1.04 Term - 5174 5018 157 1 1 86 55 88 0.675 1.72 1.03 Intr - 6390 6244 147 2 0 75 39 108 0.621 3.03 1.02 Intr - 6835 6732 104 1 2 80 9 88 0.736 -1.85 1.01 Init - 10108 9794 315 1 0 51 72 178 0.825 9.88 1.00 Prom - 10157 10118 40 -5.85 2.00 Prom + 10254 10293 40 -4.65 2.01 Init + 11761 11809 49 0 1 95 81 44 0.509 5.56 2.02 Term + 16117 16271 155 2 2 99 49 176 0.250 11.90 2.03 PlyA + 16742 16747 6 1.05 3.04 PlyA - 17597 17592 6 1.05 3.03 Term - 18583 18511 73 0 1 85 47 72 0.073 -0.80 3.02 Intr - 24078 23970 109 1 1 63 71 102 0.173 4.42 3.01 Init - 30103 29986 118 1 1 71 98 76 0.920 7.31 3.00 Prom - 37647 37608 40 -2.85 4.00 Prom + 40563 40602 40 -7.95 4.01 Init + 43787 43847 61 1 1 67 88 108 0.805 10.16 4.02 Intr + 46877 46997 121 0 1 93 99 108 0.991 11.03 4.03 Intr + 69563 69858 296 2 2 35 -25 231 0.000 2.23 4.04 Intr + 99962 100067 106 0 1 94 106 136 0.361 14.35 4.05 Intr + 105535 105569 35 1 2 133 44 12 0.095 -1.75 4.06 Intr + 114140 114226 87 0 0 71 87 30 0.264 0.22 4.07 Intr + 115706 115829 124 0 1 75 90 88 0.602 6.42 4.08 Intr + 117337 117486 150 1 0 123 115 83 0.999 12.86 4.09 Term + 119419 119500 82 1 1 73 42 108 0.925 0.89 4.10 PlyA + 121119 121124 6 1.05 5.00 Prom + 123509 123548 40 -6.95 5.01 Init + 127907 127949 43 1 1 99 86 39 0.750 5.43 5.02 Intr + 133207 133322 116 2 2 43 -12 143 0.055 -0.95 5.03 Intr + 148046 148184 139 1 1 -22 19 299 0.473 11.12 5.04 Intr + 151225 151249 25 2 1 82 113 2 0.359 -1.73 5.05 Term + 156415 156658 244 1 1 61 48 231 0.461 10.79 5.06 PlyA + 158004 158009 6 1.05 6.02 PlyA - 158562 158557 6 1.05 6.01 Sngl - 173119 172775 345 1 0 40 49 212 0.294 8.29 6.00 Prom - 177188 177149 40 -5.15 7.00 Prom + 177553 177592 40 -7.95 7.01 Init + 179770 180070 301 0 1 88 -2 303 0.358 17.76 7.02 Intr + 181226 181474 249 0 0 120 105 323 0.999 33.69 7.03 Intr + 183739 183940 202 2 1 80 108 56 0.997 4.22 7.04 Intr + 185072 185224 153 2 0 52 100 175 0.989 13.37 7.05 Intr + 189451 189606 156 1 0 98 93 58 0.516 5.50 7.06 Intr + 212965 213133 169 1 1 47 107 90 0.007 5.83 7.07 Intr + 217988 218146 159 1 0 41 64 136 0.011 5.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 50122 50212 91 1 1 83 43 105 0.939 1.71 S.002 Intr - 216873 216737 137 0 2 18 110 97 0.882 3.35 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:74265309_74484805|GENSCAN_predicted_peptide_1|240_aa MHTYIYRSAIYNNKDLEPNQMPINDGLDKENVVNIHCGILCSHKRERDHVLCRDMDEAGS HYPQQTSTGTENQTPHVLIHKWELNNENTWTQERQQQTLGPVGGGDALFIASFDSDVDLP IPLYPNQAQIEDIKAMPEFLIGKNYFKVHMEPKKSPHCQDNPKPKEQAGGIMLPDFKLYY KATVTKTAWDMDEAGNHHSQQTVARTKNQTLHVLTHRWELNNENTWTQEGEHRTPGPVVG >gi568815594f:74265309_74484805|GENSCAN_predicted_CDS_1|723_bp atgcacacatatatttatcgcagcgctatttacaataacaaagacttggaaccaaaccaa atgcccatcaatgatggactggataaagaaaatgtggtaaatatacactgtggaatacta tgcagccacaaaagggaacgagatcatgtcctttgcagggacatggatgaagctggaagc cattatcctcagcaaactagcacaggaacagaaaaccaaacaccgcatgttctcattcat aagtgggagttgaacaatgagaacacatggacacaggaaaggcaacaacagacattgggg cctgttgggggtggggatgccctattcatagctagtttcgatagcgacgtagacttacca atacctttgtatccaaatcaagcccaaattgaagatatcaaagcgatgccagaattttta attggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcactgccaagac aatcctaagccaaaagaacaagctggaggcatcatgctacctgacttcaaactatactac aaggctacagtaaccaaaacagcatgggacatggatgaagctggaaaccatcattctcag caaactgttgcaagaacaaaaaaccaaacactgcatgttctcactcataggtgggaattg aacaatgagaacacttggacacaggaaggggaacatcgcacaccggggcctgttgtgggg tga >gi568815594f:74265309_74484805|GENSCAN_predicted_peptide_2|67_aa MEEQNGSQYLYCLHKNGIPKLITSDMVKEGAAVIDVGINYVHDPVTGKTKLVGDVDFEGN KPISFDR >gi568815594f:74265309_74484805|GENSCAN_predicted_CDS_2|204_bp atggaggagcaaaatgggtcccagtacctctattgcttgcataagaatggtattccaaag ttgattacgtctgatatggttaaagaaggtgctgctgtaattgatgtgggtatcaactat gtccacgatccagtgacaggaaagacaaaattagttggagatgtggacttcgaaggtaat aaaccaatatcttttgataggtga >gi568815594f:74265309_74484805|GENSCAN_predicted_peptide_3|99_aa MGIRERFVKLQKSCKIDGHLSREERERESNADTGKSKSKAHKNRQPGPPQGLAFAAPLAW NAAPSSVHDSLLHLFRLSPDEQLEEQHVKTYHLPAPVGG >gi568815594f:74265309_74484805|GENSCAN_predicted_CDS_3|300_bp atggggatcagagaaagatttgtaaagctgcagaaaagctgcaagattgatggacacctg agcagggaagaaagggaaagagaaagcaatgcagacacagggaagagtaaaagcaaagct cataaaaaccgccagccaggcccccctcaaggtcttgcatttgctgctccacttgcctgg aatgctgctccctcatctgtccacgactcccttcttcatttattcaggttaagtcctgat gaacaacttgaagagcaacatgtcaaaacctatcacttgcctgctcctgtgggtggctga >gi568815594f:74265309_74484805|GENSCAN_predicted_peptide_4|353_aa MTALTEEAAVTVTPPITAQQADNIEGPIALKFSHLCLEDHNSYCINGACAFHHELEKAIC RLQAVFGLKVGFHRGPVPICLGICLPPFAINPSSEEVHLTAVRIQTVTALNCFMLTGCTV LGKRQSDLSEAYQSVSRVMGPGNGTHCPWLHLHDHLEFDAALRQAPAPAPIADDRGEEDG DALCRQGPCAAALPGDKAAQLQGNKISFHLLQAVLSTTVIPSCIPGESSDNCTALVQTED NPRVAQVSITKCSSDMNGYCLHGQCIYLVDMSQNYCRCEVGYTGVRCEHFFLTVHQPLSK EYVALTVILIILFLITVVGSTYYFCRWYRNRKSKEPKKEYERVTSGDPELPQV >gi568815594f:74265309_74484805|GENSCAN_predicted_CDS_4|1062_bp atgacagcactgaccgaagaggcagccgtgactgtaacacctccaatcacagcccagcaa gctgacaacatagaaggacccatagccttgaagttctcacacctttgcctggaagatcat aacagttactgcatcaacggtgcttgtgcattccaccatgagctagagaaagccatctgc aggcttcaggctgtctttggcttgaaggtcggctttcaccggggacctgtccctatctgc ctaggaatctgtctgcctccttttgctatcaatccctcctctgaagaggtacatctaact gctgttaggatacaaacagtaaccgctcttaactgcttcatgctaacagggtgcactgtt ttgggaaaacggcagtcagatctctcagaggcctatcaaagtgtttcccgggtaatggga ccaggaaatgggacccattgcccgtggctccatttgcatgaccatttggagtttgatgcc gccctccgccaagccccagcgcccgctcccatcgccgatgaccgcggggaggaggatgga gatgctctgtgccggcagggtccctgcgctgctgctctgcctggagataaggcagctcag ttgcagggaaacaaaataagtttccatcttctacaggcagtcctcagtacaactgtgatt ccatcatgtatcccaggagagtccagtgataactgcacagctttagttcagacagaagac aatccacgtgtggctcaagtgtcaataacaaagtgtagctctgacatgaatggctattgt ttgcatggacagtgcatctatctggtggacatgagtcaaaactactgcaggtgtgaagtg ggttatactggtgtccgatgtgaacacttctttttaaccgtccaccaacctttaagcaaa gaatatgtggctttgaccgtgattcttattattttgtttcttatcacagtcgtcggttcc acatattatttctgcagatggtacagaaatcgaaaaagtaaagaaccaaagaaggaatat gagagagttacctcaggggatccagagttgccgcaagtctga >gi568815594f:74265309_74484805|GENSCAN_predicted_peptide_5|188_aa MAEAKESLQVKHSKGVKLQTLAVSVVAHKGSVDPKSEQQQDLLQTVKEQCFHSLHGDDDG DGGSDTDDYGDDDSGNGDNGDYNGGEGSDDDENKNGDDDGLLDKCVLSDLDLFYHRHYEE EGEEEKKGEGKEEGEEEGKGKEDGEEKRKKKKEEKEDANGEEEEKEEKEGRREEERKERR TKERKHIH >gi568815594f:74265309_74484805|GENSCAN_predicted_CDS_5|567_bp atggctgaagccaaagaatcacttcaagttaaacactccaaaggagtgaagctgcagacc ttggcggtaagtgttgtagctcataaaggcagtgtggacccaaagagcgagcagcagcaa gatttattgcaaacagtgaaagaacaatgcttccacagcttacatggtgatgatgatggt gatggtggtagtgatactgatgactatggtgatgatgacagtggcaatggtgataatggt gattataatggtggtgagggtagtgatgatgatgagaataagaatggtgatgatgatggc ttactagataagtgtgtgctaagtgatttagacttattttaccatcgtcactatgaagaa gaaggagaggaagaaaaaaaaggagaaggaaaagaggaaggggaagaggaggggaagggg aaggaagatggggaggaaaagagaaagaagaaaaaagaagaaaaggaggatgctaatggg gaggaggaggaaaaggaagagaaagagggaagaagagaggaagaaaggaaggaaagaagg acaaaagaaagaaaacacatacattaa >gi568815594f:74265309_74484805|GENSCAN_predicted_peptide_6|114_aa MVYNYRDLEEGKREQSKEKQQAKIMAAIIGDALNAQKASKGNPKGHKDNANRGSCFKCKE PGHWTKDYTKPLPKPGQKCEGASYDPWHWRCWLPPLPPRSSVRQNSSSAKRGIR >gi568815594f:74265309_74484805|GENSCAN_predicted_CDS_6|345_bp atggtgtataactaccgtgatctggaagaaggaaaaagggaacagagtaaagaaaaacag caagccaaaattatggcagccatcattggtgatgccctgaatgcccaaaaagcatccaag ggaaacccaaagggccacaaagataatgccaacaggggctcttgtttcaagtgcaaggaa cctgggcattggacaaaggactataccaagcctctgccaaaacccggccaaaaatgtgag ggtgccagttatgatccttggcactggaggtgttggctgcctccactcccaccaaggagc tcagtcaggcaaaactccagcagtgctaaaagaggaatcagatga >gi568815594f:74265309_74484805|GENSCAN_predicted_peptide_7|463_aa MGCGPLPAEPIKRQVRAALQTFAHLGASAPEVPGQPEAPRPPPRAPQAFESGAHSRSPLA LPTPARFGGSSCPRDRVAPETETPPLRRTNESPAATAGAGGHYAAGLDLNDTYSGKREPF SGDHSADGFEVTSRSEMSSGSEISPVSEMPSSSEPSSGADYDYSEEYDNEPQIPGYIVDD SVRVEQVVKPPQNKTESENTSDKPKRKKKGGKNGKNRRNRKKKNPCNAEFQNFCIHGECK YIEHLEAVTCKCQQEYFGERCGEKSMKTHSMIDSSLSKIALAAIAAFMSAVILTAVAVIT VQISHWSHCQVIAINDESVLFPVDHKTMDPFCYDGFKLSIVTFYAISVYKGARRYPLSTV QDTETTNISINNFSLTTENLLSTIGIMVKKVDKAPTLKKFTDDCRRNKTERTSYPGILGT EKSHEERVEESRDKNLLCPFYRGQKSSTLREHNAAIAQIAFLQ >gi568815594f:74265309_74484805|GENSCAN_predicted_CDS_7|1389_bp atgggctgcggccccctcccggctgagcctataaagcggcaggtgcgcgccgccctacag acgttcgcacacctgggtgccagcgccccagaggtcccgggacagcccgaggcgccgcgc ccgccgccccgagctccccaagccttcgagagcggcgcacactcccggtctccactcgct cttccaacacccgctcgttttggcggcagctcgtgtcccagagaccgagttgccccagag accgagacgccgccgctgcgaaggaccaatgagagccccgctgctaccgccggcgccggt ggccattatgctgctggattggacctcaatgacacctactctgggaagcgtgaaccattt tctggggaccacagtgctgatggatttgaggttacctcaagaagtgagatgtcttcaggg agtgagatttcccctgtgagtgaaatgccttctagtagtgaaccgtcctcgggagccgac tatgactactcagaagagtatgataacgaaccacaaatacctggctatattgtcgatgat tcagtcagagttgaacaggtagttaagcccccccaaaacaagacggaaagtgaaaatact tcagataaacccaaaagaaagaaaaagggaggcaaaaatggaaaaaatagaagaaacaga aagaagaaaaatccatgtaatgcagaatttcaaaatttctgcattcacggagaatgcaaa tatatagagcacctggaagcagtaacatgcaaatgtcagcaagaatatttcggtgaacgg tgtggggaaaagtccatgaaaactcacagcatgattgacagtagtttatcaaaaattgca ttagcagccatagctgcctttatgtctgctgtgatcctcacagctgttgctgttattaca gtccagatatcacattggagtcactgccaagtcatagccataaatgatgagtcggtcctc tttccagtggatcataagacaatggaccctttttgttatgatggttttaaactttcaatt gtcactttttatgctatttctgtatataaaggtgcacgaaggtatccattgagcactgta caagatactgaaacaaccaacatttcaatcaacaacttttcattgaccactgaaaatctc ctaagtactataggtataatggtgaagaaggtagataaagcccctactcttaagaagttt acagacgattgtagaaggaataaaaccgagagaacaagttatccaggaattctaggtacc gaaaagagtcatgaagaaagggtggaggaaagtagagataagaatctcttatgtcctttt tatcggggacagaaatcttcaacattaagagagcacaatgctgctattgcacagatagcc tttctccag