GENSCAN 1.0 Date run: 3-Nov-116 Time: 20:32:19 Sequence gi568815576r:31226894_31445602 : 218709 bp : 46.88% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 21745 21876 132 0 0 71 109 77 0.634 8.26 1.02 Intr + 26301 26413 113 2 2 80 70 6 0.330 -2.82 1.03 Intr + 31398 31533 136 0 1 94 101 159 0.958 18.37 1.04 Intr + 32228 32337 110 1 2 111 99 89 0.999 11.38 1.05 Intr + 32996 33184 189 2 0 76 89 245 0.999 22.10 1.06 Intr + 35241 35346 106 0 1 71 65 227 0.999 18.92 1.07 Intr + 35702 35898 197 1 2 81 45 155 0.999 8.71 1.08 Intr + 39053 39239 187 2 1 96 105 153 0.972 17.49 1.09 Intr + 40091 40177 87 1 0 82 94 109 0.999 11.07 1.10 Intr + 40883 41014 132 1 0 96 89 224 0.999 24.14 1.11 Intr + 41251 41307 57 0 0 92 101 33 0.904 4.08 1.12 Intr + 44243 44308 66 1 0 89 76 89 0.794 6.90 1.13 Intr + 45637 45811 175 0 1 100 113 334 0.999 36.61 1.14 Intr + 46559 46614 56 0 2 77 83 36 0.997 0.70 1.15 Intr + 48258 48415 158 2 2 82 77 207 0.893 17.81 1.16 Intr + 49896 50243 348 0 0 29 53 636 0.571 48.67 1.17 Term + 51404 51548 145 2 1 131 41 196 0.942 16.58 1.18 PlyA + 52812 52817 6 1.05 2.07 PlyA - 55303 55298 6 1.05 2.06 Term - 56395 56191 205 0 1 90 52 197 0.998 13.14 2.05 Intr - 62500 62422 79 2 1 93 78 151 0.981 13.21 2.04 Intr - 62806 62606 201 2 0 71 51 233 0.989 17.16 2.03 Intr - 64191 64072 120 1 0 110 82 199 0.999 21.97 2.02 Intr - 64403 64287 117 0 0 94 114 142 0.999 17.84 2.01 Init - 65451 65382 70 0 1 74 78 71 0.913 6.08 2.00 Prom - 81722 81683 40 -3.76 3.00 Prom + 87390 87429 40 -6.86 3.01 Init + 90727 90793 67 0 1 74 62 107 0.886 7.93 3.02 Term + 94656 94783 128 1 2 22 53 115 0.584 -0.16 3.03 PlyA + 95562 95567 6 1.05 4.05 PlyA - 95883 95878 6 1.05 4.04 Term - 100416 99998 419 1 2 100 54 386 0.943 31.84 4.03 Intr - 108970 108799 172 1 1 105 74 167 0.703 16.52 4.02 Intr - 116067 116004 64 2 1 108 91 35 0.812 4.52 4.01 Init - 118709 117439 1271 2 2 111 94 1499 0.987 144.58 4.00 Prom - 137497 137458 40 -0.96 5.02 PlyA - 139531 139526 6 1.05 5.01 Sngl - 158698 158333 366 1 0 76 41 169 0.976 7.10 5.00 Prom - 161601 161562 40 -5.96 6.00 Prom + 163108 163147 40 -4.96 6.01 Init + 172791 172832 42 2 0 98 80 33 0.484 3.92 6.02 Intr + 173727 173850 124 2 1 99 76 82 0.969 8.26 6.03 Intr + 176136 176311 176 1 2 82 109 105 0.998 11.66 6.04 Intr + 184119 184188 70 2 1 95 109 47 0.944 6.25 6.05 Intr + 193363 193532 170 2 2 73 93 169 0.999 15.57 6.06 Intr + 196387 196517 131 0 2 107 84 205 0.992 21.49 6.07 Intr + 199722 199889 168 0 0 95 101 115 0.999 12.56 6.08 Intr + 200167 200289 123 1 0 70 92 91 0.939 7.40 6.09 Term + 206979 207078 100 0 1 102 47 50 0.800 -0.10 6.10 PlyA + 208859 208864 6 1.05 7.04 PlyA - 209820 209815 6 1.05 7.03 Term - 213228 212987 242 1 2 110 53 217 0.829 16.59 7.02 Intr - 213975 213811 165 1 0 120 96 10 0.674 4.93 7.01 Intr - 216173 215910 264 0 0 78 42 259 0.251 17.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:31226894_31445602|GENSCAN_predicted_peptide_1|797_aa MVAIYSRPEAVTDGFAAEPVYLVWEEDGELLVSPGLLHLQRPVSDSGMTGSDLSFTTSQL GTLLPTARADFDFHLKLPKIERCSECQDSLTNWYYEKDGKLYCPKDYWGKFGEFCHGCSL LMTGPFMVAGEFKYHPECFACMSCKVIIEDGDAYALVQHATLYCGKCHNEVVLAPMFERL STESVQEQLPYSVTLISMPATTEGRRGFSVSVESACSNYATTVQVKEVNRMHISPNNRNA IHPGDRILEINGTPVRTLRVEEVEDAISQTSQTLQLLIEHDPVSQRLDQLRLEARLAPHM QNAGHPHALSTLDTKENLEGTLRRRSLRRSNSISKSPGPSSPKEPLLFSRDISRSESLRC SSSYSQQIFRPCDLIHGEVLGKGFFGQAIKVTHKATGKVMVMKELIRCDEETQKTFLTEV KVMRSLDHPNVLKFIGVLYKDKKLNLLTEYIEGGTLKDFLRSMDPFPWQQKVRFAKGIAS GMAYLHSMCIIHRDLNSHNCLIKLDKTVVVADFGLSRLIVEERKRAPMEKATTKKRTLRK NDRKKRYTVVGNPYWMAPEMLNGKSYDETVDIFSFGIVLCEIIGQVYADPDCLPRTLDFG LNVKLFWEKFVPTDCPPAFFPLAAICCRLEPESRAPPGAAGEGPGCADDEGPVRRQGKVT IKYDPKELRKHLNLEEWILEQLTRLYDCQEEEISELEIDVDELLDMESDDAWASRVKELL VDCYKPTEAFISGLLDKIRAMQKLSTPQKKPAFSKLEDSFEALSLYLGELGIPLPAELEE LDHTVSMQYGLTRDSPP >gi568815576r:31226894_31445602|GENSCAN_predicted_CDS_1|2394_bp atggtggccatctacagccggcctgaggcagtcacagacggatttgcagctgagcctgtc tatctggtgtgggaagaagatggggagttacttgtcagtcccggcttacttcacctccag agacctgtttcggacagtgggatgactggttcagacctcagctttaccacctcccagctg ggtactcttctacctacagccagggcagattttgactttcacttgaaacttccaaaaatt gaaaggtgttcagaatgccaggattccctcaccaactggtactatgagaaggatgggaag ctctactgccccaaggactactgggggaagtttggggagttctgtcatgggtgctccctg ctgatgacagggccttttatggtggctggggagttcaagtaccacccagagtgctttgcc tgtatgagctgcaaggtgatcattgaggatggggatgcatatgcactggtgcagcatgcc accctctactgtgggaagtgccacaatgaggtggtgctggcacccatgtttgagagactc tccacagagtctgttcaggagcagctgccctactctgtcacgctcatctccatgccggcc accactgaaggcaggcggggcttctccgtgtccgtggagagtgcctgctccaactacgcc accactgtgcaagtgaaagaggtcaaccggatgcacatcagtcccaacaatcgaaacgcc atccaccctggggaccgcatcctggagatcaatgggacccccgtccgcacacttcgagtg gaggaggtggaggatgcaattagccagacgagccagacacttcagctgttgattgaacat gaccccgtctcccaacgcctggaccagctgcggctggaggcccggctcgctcctcacatg cagaatgccggacacccccacgccctcagcaccctggacaccaaggagaatctggagggg acactgaggagacgttccctaaggcgcagtaacagtatctccaagtcccctggccccagc tccccaaaggagcccctgctgttcagccgtgacatcagccgctcagaatcccttcgttgt tccagcagctattcacagcagatcttccggccctgtgacctaatccatggggaggtcctg gggaagggcttctttgggcaggctatcaaggtgacacacaaagccacgggcaaagtgatg gtcatgaaagagttaattcgatgtgatgaggagacccagaaaacttttctgactgaggtg aaagtgatgcgcagcctggaccaccccaatgtgctcaagttcattggtgtgctgtacaag gataagaagctgaacctcctgacagagtacattgaggggggcacactgaaggactttctg cgcagtatggatccgttcccctggcagcagaaggtcaggtttgccaaaggaatcgcctcc ggaatggcctatttgcactctatgtgcatcatccaccgggatctgaactcgcacaactgc ctcatcaagttggacaagactgtggtggtggcagactttgggctgtcacggctcatagtg gaagagaggaaaagggcccccatggagaaggccaccaccaagaaacgcaccttgcgcaag aacgaccgcaagaagcgctacacggtggtgggaaacccctactggatggcccctgagatg ctgaacggaaagagctatgatgagacggtggatatcttctcctttgggatcgttctctgt gagatcattgggcaggtgtatgcagatcctgactgccttccccgaacactggactttggc ctcaacgtgaagcttttctgggagaagtttgttcccacagattgtcccccggccttcttc ccgctggccgccatctgctgcagactggagcctgagagcagagccccccccggggccgca ggagagggcccgggctgcgcggatgatgagggcccagtgaggcgccaagggaaggtcacc atcaagtatgaccccaaggagctacggaagcacctcaacctagaggagtggatcctggag cagctcacgcgcctctacgactgccaggaagaggagatctcagaactagagattgacgtg gatgagctcctggacatggagagtgacgatgcctgggcttccagggtcaaggagctgctg gttgactgttacaaacccacagaggccttcatctctggcctgctggacaagatccgggcc atgcagaagctgagcacaccccagaagaaaccagcattctcgaaattggaggactccttt gaggccctctccctgtacctgggggagctgggcatcccgctgcctgcagagctggaggag ttggaccacactgtgagcatgcagtacggcctgacccgggactcacctccctag >gi568815576r:31226894_31445602|GENSCAN_predicted_peptide_2|263_aa MLLAWVQAFLVSNMLLAEAYGSGGCFWDNGHLYREDQTSPAPGLRCLNWLDAQSGLASAP VSGAGNHSYCRNPDEDPRGPWCYVSGEAGVPEKRPCEDLRCPETTSQALPAFTTEIQEAS EGPGADEVQVFAPANALPARSEAAAVQPVIGISQRVRMNSKEKKDLGTLGYVLGITMMVI IIAIGAGIILGYSYKRGKDLKEQHDQKVCEREMQRITLPLSAFTNPTCEIVDEKTVVVHT SQTPVDPQEGTTPLMGQAGTPGA >gi568815576r:31226894_31445602|GENSCAN_predicted_CDS_2|792_bp atgctgttggcctgggtacaagcattcctcgtcagcaacatgctcctagcagaagcctat ggatctggaggctgtttctgggacaacggccacctgtaccgggaggaccagacctccccc gcgccgggcctccgctgcctcaactggctggacgcgcagagcgggctggcctcggccccc gtgtcgggggccggcaatcacagttactgccgaaacccggacgaggacccgcgcgggccc tggtgctacgtcagtggcgaggccggcgtccctgagaaacggccttgcgaggacctgcgc tgtccagagaccacctcccaggccctgccagccttcacgacagaaatccaggaagcgtct gaagggccaggtgcagatgaggtgcaggtgttcgctcctgccaacgccctgcccgctcgg agtgaggcggcagctgtgcagccagtgattgggatcagccagcgggtgcggatgaactcc aaggagaaaaaggacctgggaactctgggctacgtgctgggcattaccatgatggtgatc atcattgccatcggagctggcatcatcttgggctactcctacaagagggggaaggatttg aaagaacagcatgatcagaaagtatgtgagagggagatgcagcgaatcactctgcccttg tctgccttcaccaaccccacctgtgagattgtggatgagaagactgtcgtggtccacacc agccagactccagttgaccctcaggagggcaccaccccccttatgggccaggccgggact cctggggcctga >gi568815576r:31226894_31445602|GENSCAN_predicted_peptide_3|64_aa MAKIQTTNADKDVEQQELSFIAAFSSSFYSLKGPFVSLGDMKRFVTSNVSCYLDLKGKFA HVRL >gi568815576r:31226894_31445602|GENSCAN_predicted_CDS_3|195_bp atggccaaaatccagacaacaaatgctgacaaggatgtggagcaacaggaactttcattc attgctgcattttccagcagcttctacagtctgaaagggccctttgtctcccttggtgac atgaagaggtttgtcacttcaaatgtcagctgttacttggacttgaagggcaagtttgcc catgtccgcctgtga >gi568815576r:31226894_31445602|GENSCAN_predicted_peptide_4|641_aa MERVNDASCGPSGCYTYQVSRHSTEMLHNLNQQRKNGGRFCDVLLRVGDESFPAHRAVLA ACSEYFESVFSAQLGDGGAADGGPADVGGATAAPGGGAGGSRELEMHTISSKVFGDILDF AYTSRIVVRLESFPELMTAAKFLLMRSVIEICQEVIKQSNVQILVPPARADIMLFRPPGT SDLGFPLDMTNGAALAANSNGIAGSMQPEEEAARAAGAAIAGQASLPVLPGVDRLPMVAG PLSPQLLTSPFPSVASSAPPLTGKRGRGRPRKANLLDSMFGSPGGLREAGILPCGLCGKV FTDANRLRQHEAQHGVTSLQLGYIDLPPPRLGENGLPISEDPDGPRKRSRTRKQVACEIC GKIFRDVYHLNRHKLSHSGEKPYSCPVCGLRFKRKDRMSYHVRSHDGSVGKPYICQSCGK GFSRPDHLNGHIKQVHTSERPHKCQTCNASFATRDRLRSHLACHEDKVPCQVCGKYLRAA YMADHLKKHSEGPSNFCSICNREGQKCSHQDPIESSDSYGDLSDASDLKTPEKQSANGSF SCDMAVPKNKMESDGEKKYPCPECGSFFRSKSYLNKHIQKVHVRALGGPLGDLGPALGSP FSPQQNMSLLESFGFQIVQSAFASSLVDPEVDQQPMGPEGK >gi568815576r:31226894_31445602|GENSCAN_predicted_CDS_4|1926_bp atggagcgggtgaacgacgcttcgtgcggcccgtctggctgctacacataccaggtgagc agacacagcacggagatgctgcacaacctgaaccagcagcgcaaaaacggcgggcgcttc tgcgacgtgctcttgcgggtaggcgacgagagcttcccagcgcaccgcgccgtgctggcc gcctgcagcgagtactttgagtcggtgttcagcgcccagttgggcgacggcggagctgcg gacgggggtccggctgatgtagggggcgcgacggcagcaccaggcggcggggccgggggc agccgggagctggagatgcacactatcagctccaaggtatttggggacattctggacttc gcctacacttcccgcatcgtggtgcgcttggagagctttcccgaactcatgacggccgcc aagttcctgctgatgaggtcggttatcgagatctgccaggaagtcatcaaacagtccaac gtacagatcctggtaccccctgcccgcgccgatataatgctctttcgcccccctgggacc tcggacttgggcttccctttggacatgaccaacggggcagccttggcagccaacagcaat ggcatcgccggcagcatgcagccagaggaggaggcagctcgggcggctggtgcagccatt gcaggccaagcctctttgcctgtgttacctggggtggaccgcttgcccatggtggctgga cccctatccccccaactgctgacttccccattccccagtgtggcatccagtgcccctccc ctgactggcaagcgaggccggggccgcccaaggaaggccaacctgctggactcaatgttt gggtccccagggggcctgagggaggcaggcatccttccatgcggtctatgtggtaaggtg ttcactgatgccaaccggctccggcagcacgaggcccagcacggtgtcaccagcctccag ctgggctacatcgaccttcctcctccgaggctgggtgagaatgggctacccatctctgaa gaccccgacggcccccgaaagaggagccggaccaggaagcaggtggcttgtgagatctgc ggcaagatcttccgtgatgtgtatcatcttaaccggcacaagctgtcccactctggggag aagccctactcctgccctgtgtgtgggttgcggttcaagagaaaagaccgcatgtcctac catgtgcggtcccatgatgggtccgtgggcaagccttacatctgccagagctgtgggaaa ggcttctccaggcctgatcacttgaacggacatatcaagcaggtgcacacttctgagcgg cctcacaagtgtcagacctgcaatgcttcttttgccacccgagaccgtctgcgctcccac ctggcctgtcatgaagacaaggtgccctgccaggtgtgtgggaagtacttgcgggcagca tacatggcagaccacctgaagaagcacagcgaggggcccagcaacttctgcagtatctgt aaccgagaaggccagaaatgctcacatcaggatccgattgagagctctgactcctatggt gacctctcagatgccagcgacctgaagacgccagagaagcagagtgccaatggctctttc tcctgcgacatggcagtccccaaaaacaaaatggagtctgatggggagaagaagtaccca tgccctgaatgtgggagcttcttccgctctaagtcctacttgaacaaacacatccagaag gtgcatgtccgggctctcgggggccccctgggggacctgggccctgcccttggctcacct ttctctcctcagcagaacatgtctctcctcgagtcctttgggtttcagattgttcagtcg gcatttgcgtcatctttagtagatcctgaggttgaccagcagcccatggggcctgaaggg aaatga >gi568815576r:31226894_31445602|GENSCAN_predicted_peptide_5|121_aa MDTFLDTYTFPKLKQEEIDSLNRTIMSSKMKSVINSLLTKKSPGPDEVTAEFHQMYKDGI VLFLLKLFQKIEAEGLLPNSFCEPGIILTLKPGRDTPKIENFKPISLMNIDAEILNKILA N >gi568815576r:31226894_31445602|GENSCAN_predicted_CDS_5|366_bp atggatacattcctggacacatacaccttcccaaaactgaaacaagaagaaatagattcc ctgaacagaacaataatgagctccaaaatgaaatcagtaataaatagcctactaacaaag aaaagcccaggaccagatgaagtcacagccgaattccaccagatgtacaaagacgggata gtactatttcttctgaaactatttcaaaaaattgaggcagagggactcctccccaactca ttctgtgagcccggcatcatcctgacactaaaacctggcagagacacacccaaaatagaa aacttcaagccaatatccttgatgaacatcgatgcagaaatcctcaacaaaatacttgca aattga >gi568815576r:31226894_31445602|GENSCAN_predicted_peptide_6|367_aa MSSTLAKIAEIEAEMARTQKNKATAHHLGLLKARLAKLRRELITPKGGGGGGPGEGFDVA KTGDARIGFVGFPSVGKSTLLSNLAGVYSEVAAYEFTTLTTVPGVIRYKGAKIQLLDLPG IIEGAKDGKGRGRQVIAVARTCNLILIVLDVLKPLGHKKIIENELEGFGIRLNSKPPNIG FKKKDKGGINLTATCPQSELDAETVKSILAEYKIHNADVTLRSDATADDLIDVVEGNRVY IPCIYVLNKIDQISIEELDIIYKVPHCVPISAHHRWNFDDLLEKIWDYLKLVRIYTKPKG QLPDYTSPVVLPYSRTTVEDFCMKIHKNLIKEFKYALVWGLSVKHNPQKVGKDHTLEDED VIQIVKK >gi568815576r:31226894_31445602|GENSCAN_predicted_CDS_6|1104_bp atgagcagcaccttagctaagatcgcggagatagaagcagagatggctcggactcaaaag aacaaggccacagcacaccacttagggctgcttaaggctcgtcttgctaagcttcgtcga gaactcattactccaaagggtggtggtggtggaggtccaggagaaggttttgatgtggcc aagacaggtgatgctcgaattggatttgttggttttccatctgtggggaagtcaacactg cttagtaacctggcaggggtatattctgaggtggcagcctatgaattcactactctgacc actgtgcctggtgtcatcagatacaaaggtgccaagatccagctcctggatctcccaggt atcattgaaggtgccaaggatgggaaaggtagaggtcgtcaagtcattgcagtggcccga acctgtaacttgatcttgattgttctggatgtcctgaaacctttgggacataagaagata attgaaaatgagctggaaggctttggcattcgcttgaacagcaaaccccccaacattggc tttaagaagaaggacaagggaggcattaatctcacagccacttgcccccagagtgagctg gatgctgaaactgtgaagagcattctggctgaatacaagattcataatgccgatgtgact ctacgtagtgatgctacagctgatgacctcattgatgtggtggaaggaaacagagtttat atcccctgtatctatgtgttaaataagattgaccaaatctccattgaggaattggatatc atctataaggtgcctcactgtgtacccatctctgcccatcaccgctggaattttgatgac ctattggaaaagatctgggactatctgaaactagtgagaatttacaccaaacccaaaggc cagttaccagattacacatccccagtggtgcttccttactccaggaccacagtggaggat ttctgcatgaagattcacaaaaatcttatcaaagaatttaaatatgctctggtctggggt ctctctgtgaaacacaatcctcagaaagtgggtaaagaccatacgttggaggatgaggat gtcattcaaattgtgaagaagtga >gi568815576r:31226894_31445602|GENSCAN_predicted_peptide_7|223_aa XIRKMYESKEKSKEEPASGKAALGDSKEDTQKASEGTAELLRALLKKVCDGENMGLSADQ PDILYKALGKTTSFSGLWFCVSEKVLCDSVLPPGMDLSHLQGISGPILGQPFYPLPAASH PLLNPRPGTPLHLAMVQQQLQRSVLHPPGSGSHAAAVSVQTTPQNVPSRSGLPHMHSQLE HRPSQRSSSPVGLAKWFGSDVLQQPLPSMPAKVISVDELEYRQ >gi568815576r:31226894_31445602|GENSCAN_predicted_CDS_7|672_bp ntgattcgtaagatgtacgagagcaaagagaaaagcaaggaggagccagcatctggaaaa gcagctcttggtgacagtaaagaggatactcagaaggccagtgaaggtactgcagagctg ttaagggcactgctcaagaaagtatgtgatggagaaaacatgggcctgagcgctgaccag cctgatattctatacaaagccctgggcaagaccactagcttctctggactatggttctgt gtcagtgagaaggttctgtgtgacagtgtgcttcctcctgggatggacttgagtcattta cagggaatatctggccccatcctgggtcagcccttttaccctttacctgctgctagtcac cctctcttaaaccctcgtcctggaacacctctgcatctggcaatggtgcaacagcagcta cagcgctcagttctgcatcctccaggctctggttcccatgcagcagctgtcagcgttcag acaacccctcagaacgtgcccagccggtcaggcctgccccacatgcactcccagctggag catcgccccagccagaggagcagctcccctgtgggccttgccaaatggtttggctcagat gtgctacagcaacccctgccctccatgcccgccaaagttatcagtgtagatgaattggaa taccgacagtga