GENSCAN 1.0 Date run: 5-Nov-116 Time: 09:47:45 Sequence gi568815582f:14611689_14750812 : 139124 bp : 46.88% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 3863 3858 6 1.05 1.05 Term - 12265 12261 5 2 2 136 49 0 0.171 -2.03 1.04 Intr - 18031 17909 123 2 0 104 68 154 0.480 15.56 1.03 Intr - 18238 18112 127 1 1 23 12 91 0.246 -4.75 1.02 Intr - 18647 18468 180 2 0 63 39 105 0.176 3.16 1.01 Init - 18851 18804 48 2 0 84 58 14 0.413 -1.04 1.00 Prom - 23730 23691 40 -2.66 2.00 Prom + 28559 28598 40 -5.56 2.01 Init + 32659 32921 263 0 2 68 95 65 0.350 1.74 2.02 Intr + 36700 36904 205 1 1 114 92 59 0.997 8.10 2.03 Intr + 38116 38285 170 0 2 101 100 130 0.984 14.24 2.04 Intr + 43378 43522 145 1 1 93 61 129 0.998 10.98 2.05 Intr + 50204 50377 174 1 0 112 82 191 0.998 21.04 2.06 Intr + 53181 53383 203 2 2 74 100 99 0.907 7.78 2.07 Term + 55947 56139 193 0 1 116 42 52 0.469 0.29 2.08 PlyA + 57174 57179 6 1.05 3.05 PlyA - 57246 57241 6 1.05 3.04 Term - 61061 60919 143 0 2 107 42 131 0.961 8.49 3.03 Intr - 76583 76477 107 1 2 93 101 162 0.981 17.86 3.02 Intr - 78957 78807 151 1 1 99 105 103 0.905 12.32 3.01 Init - 82541 82445 97 2 1 51 96 243 0.569 19.87 3.00 Prom - 84947 84908 40 -7.16 4.02 PlyA - 86899 86894 6 1.05 4.01 Sngl - 91869 91309 561 0 0 84 40 351 0.273 26.04 4.00 Prom - 94232 94193 40 -4.16 5.00 Prom + 94745 94784 40 -5.46 5.01 Init + 97256 97375 120 1 0 39 113 143 0.936 11.19 5.02 Intr + 104380 104508 129 0 0 82 56 48 0.765 1.89 5.03 Intr + 108139 108238 100 0 1 95 86 37 0.977 3.88 5.04 Intr + 108402 108546 145 1 1 49 61 140 0.918 6.74 5.05 Intr + 111504 111611 108 0 0 100 84 84 0.401 8.60 5.06 Intr + 112800 112835 36 0 0 88 114 7 0.200 0.68 5.07 Term + 114195 114270 76 0 1 88 42 45 0.169 -2.79 5.08 PlyA + 114611 114616 6 1.05 6.05 PlyA - 115038 115033 6 1.05 6.04 Term - 115402 115398 5 2 2 65 40 0 0.346 -10.03 6.03 Intr - 115700 115594 107 1 2 93 101 162 0.961 17.86 6.02 Intr - 118074 117924 151 1 1 99 105 103 0.905 12.32 6.01 Init - 121659 121563 97 0 1 51 96 243 0.569 19.87 6.00 Prom - 124065 124026 40 -7.16 7.02 PlyA - 126017 126012 6 1.05 7.01 Sngl - 130991 130431 561 2 0 84 40 351 0.239 26.04 7.00 Prom - 133354 133315 40 -4.16 8.00 Prom + 133867 133906 40 -5.46 8.01 Init + 136378 136497 120 0 0 39 113 143 0.222 11.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:14611689_14750812|GENSCAN_predicted_peptide_1|160_aa MGQGARLPVHHCISTPEGKEPSLGPAPAAAQALTQAQQARAVSGSRGRRHRCGCLSYCRS RRGIRRVEPLRRARAREPCTYSSNPGTIRGLVLLRAGHPPGLVPPDPCTPRGLDSPRPGI QATGSEPLFGFSADFKSNLHKVYQAIEEADFFAIDGEFSG >gi568815582f:14611689_14750812|GENSCAN_predicted_CDS_1|483_bp atggggcagggagcacgtcttcctgttcatcactgtatctccacccctgagggaaaggag cccagccttggccccgcccccgccgccgcgcaggcgctgacgcaagcgcagcaggcgcgc gctgtttccggaagtcgcggccggcgtcaccgctgcggctgcctcagctactgccgcagt cgccgcggaattcggcgagtagaaccgctgaggcgggcgcgggcccgggagccttgtact tactcctccaaccccggcaccatccggggccttgtccttctccgggccgggcaccccccg gggcttgtcccccctgacccatgcacccctcgaggacttgattcccctcgccccggaatt caggccactggttctgagcctcttttcggtttctccgcagattttaagagtaatcttcac aaagtgtaccaggccatagaggaggccgacttcttcgccatcgatggggagttttcaggc tga >gi568815582f:14611689_14750812|GENSCAN_predicted_peptide_2|450_aa MEEPQKSYVNTMDLERDEPLKSTGPQISVSEFSCHCCYDILVNPTTLNCGHSFCRHCLAL WWASSKKTECPECREKWEGFPKVSILLRDAIEKLFPDAIRLRFEDIQQNNDIVQSLAAFQ KYGNDQIPLAPNTGRANQQMGGGFFSGVLTALTGVAVVLLVYHWSSRESEHDLLVHKAVA KWTAEEVVLWLEQLGPWASLYRERFLSERVNGRLLLTLTEEEFSKTPYTIENSSHRRAIL MELERVKALGVKPPQNLWEYKAVNPGRSLFLLYALKSSPRLSLLYLYLFDYTDTFLPFIH TICPLQEDSSGEDIVTKLLDLKEPTWKQWREFLVKYSFLPYQLIAEFAWDWLEVHYWTSR FLIINAMLLSVLELFSFWRIWSRSELKTVPQRMWSHFWKVSTQGLFVAMFWPLIPQFVCN CLFYWALYFNPIINIDLVVKELRRLETQVL >gi568815582f:14611689_14750812|GENSCAN_predicted_CDS_2|1353_bp atggaggaacctcagaaaagctatgtgaacacaatggaccttgagagagatgaacctctc aaaagcaccggccctcagatttctgttagtgaattttcttgccactgctgctacgacatc ctggttaaccccaccaccttgaactgtgggcacagcttctgccgtcactgccttgcttta tggtgggcatcttcaaagaaaacagaatgtccagaatgcagagaaaaatgggaaggtttc cccaaagtcagtattctcctcagggatgccattgaaaagttatttcctgatgccattaga ctgagatttgaagacattcagcagaataatgacatagtccaaagtcttgcagcctttcag aaatatgggaatgatcagattcctttagctcctaacacaggccgagcgaatcagcagatg ggagggggattcttttccggtgtgctcacagctttaactggagtggcagtggtcctgctc gtctatcactggagcagcagggaatctgaacacgacctcctggtccacaaggctgtggcc aaatggacggcggaagaagttgtcctctggctggagcagctgggcccttgggcatctctt tacagggaaaggtttttatctgaacgagtaaatggaaggttgcttttaactttgacagag gaagaattttccaagacgccctataccatagaaaacagcagccacaggagagccatcctc atggagctagaacgtgtcaaagcattaggcgtgaagcccccccagaatctctgggaatat aaggctgtgaacccaggcaggtccctgttcctgctatacgccctcaagagctcccccagg ctgagtctgctctacctgtacctgtttgactacaccgacaccttcctacctttcatccac accatctgccctctgcaagaagacagctctggggaggacatcgtcaccaagcttctggat cttaaggagcctacgtggaagcagtggagagagttcctggtcaaatactccttccttcca taccagctgattgctgagtttgcttgggactggttggaggtccattactggacatcacgg tttctcatcatcaatgctatgttactctcagttctggaattattctccttttggagaatc tggtcgagaagtgaactgaagaccgtgcctcagaggatgtggagccatttctggaaagta tcaacgcaggggctttttgtggccatgttctggcccctcatccctcagtttgtttgcaac tgtttgttttactgggccctgtactttaacccaattattaacattgatcttgtggtcaag gaactccggcggctggaaacccaggtgttgtga >gi568815582f:14611689_14750812|GENSCAN_predicted_peptide_3|165_aa MGPLPVCLPIMLLLLLPSLLLLLLLPGPGSGEASRILRVHRRGILELAGTVGCVGPRTPI AYMKYGCFCGLGGHGQPRDAIDWCCHGHDCCYTRAEEAGCSPKTERYSWQCVNQSVLCGP AENKCQELLCKCDQEIANCLAQTEYNLKYLFYPQFLCEPDSPKCD >gi568815582f:14611689_14750812|GENSCAN_predicted_CDS_3|498_bp atggggccgctacctgtgtgcctgccaatcatgctgctcctgctactgccgtcgctgctg ctgctgctgcttctacctggccccgggtccggcgaggcctccaggatattacgtgtgcac cggcgtgggatcctggaactggcaggaactgtgggttgtgttggtccccgaacccccatc gcctatatgaaatatggttgcttttgtggcttgggaggccatggccagccccgcgatgcc attgactggtgctgccatggccacgactgttgttacactcgagctgaggaggccggctgc agccccaagacagagcgctactcctggcagtgcgtcaatcagagcgtcctgtgcggaccg gcagagaacaaatgccaagaactgttgtgcaagtgtgaccaggagattgctaactgctta gcccaaactgagtacaacttaaagtacctcttctacccccagttcctatgtgagccggac tcgcccaagtgtgactga >gi568815582f:14611689_14750812|GENSCAN_predicted_peptide_4|186_aa MATCGELSVNCCAADFSEQRRRLERRRRQVEPGPRGPGMGQQPLQPGSPGRGAGRQRASR QPPCGALTSLQAAPQQPPGSAHTSLQGSPLALHLPPPPRGVNCAVCRPGYADPGSPGPQQ PDEEPRATARGYEKEQDGAPEKCKSSELGPPCQERLGAEDGEMEMEKRQVGRSGAPPVGS ACAGGA >gi568815582f:14611689_14750812|GENSCAN_predicted_CDS_4|561_bp atggcgacctgcggcgagctgagcgtcaactgctgcgccgccgacttctcggagcagcga aggcgactcgagagaagacgccgccaagtggaacccgggccccgcggccctgggatgggg cagcagccactgcagccaggaagccctgggcggggcgctgggcgccaacgagcgtcacgg caacctccatgcggcgccctcaccagcctacaggcggcaccgcagcagcctccaggcagc gcccacaccagcctacaggggtcgccgctcgcactgcacctgcctccgccgccccggggt gtgaactgcgctgtctgtcggcctggctacgctgacccgggcagcccaggcccgcagcag ccggacgaggagcccagggccactgcccggggttacgagaaggagcaggacggtgcccca gaaaaatgcaagagctcagagctagggcccccgtgccaggaaaggctaggagcagaagat ggagagatggagatggagaagcggcaggtgggaaggagcggcgctccaccggtggggtca gcatgcgctggcggggcttag >gi568815582f:14611689_14750812|GENSCAN_predicted_peptide_5|237_aa MVKLSIVLTPRFLSHDQGQLTKELQQHVKSVTCPCEYLRKVINTLADHRHRGTDFGGSPW LLIITVFLRSYKFAISLCTSYLCVSFLKTIFPSQNGHDGSTDVQQRARRSNRRRQEGIKI VLEDIFTLWRQVETKVRAKICKMKVTTKVNRHDKINGKRKTAKEHLRKLSMKEREHGEKE RQVSEAEENGKLDMKEIHTYISPLLQESLFATGSEWRQRSIVILQDCPTGPTSQLKL >gi568815582f:14611689_14750812|GENSCAN_predicted_CDS_5|714_bp atggtgaagctctctattgtcctgaccccacggttcctgtcccatgaccagggccagctc accaaggagctgcagcagcacgtaaagtcagtgacatgcccatgcgagtacctgaggaag gttatcaatactctggctgaccatcgtcatcgtgggactgactttggtggaagtccttgg ttacttatcattactgtgtttctgagaagttataaatttgccatctccctctgcacaagt tacctttgtgtgtctttcctgaagactatcttcccgtctcaaaatggacatgatggatcc acggatgtacagcagagagccaggaggtccaaccgccgtagacaggaaggaattaaaatt gtcctggaagacatctttactttatggagacaggtggaaaccaaagttcgagctaaaatc tgtaagatgaaggtgacaacaaaagtcaaccgtcatgacaaaatcaatggaaagaggaag accgccaaagaacatctgaggaaactaagcatgaaagaacgtgagcacggagaaaaggag aggcaggtgtcagaggcagaggaaaacgggaaattggatatgaaagaaatacacacctac atatcaccccttctgcaagaaagcctctttgcaaccgggtcagaatggcggcagcggagc atcgtcattcttcaggattgccctactggccctacctcacagctgaaactttaa >gi568815582f:14611689_14750812|GENSCAN_predicted_peptide_6|119_aa MGPLPVCLPIMLLLLLPSLLLLLLLPGPGSGEASRILRVHRRGILELAGTVGCVGPRTPI AYMKYGCFCGLGGHGQPRDAIDWCCHGHDCCYTRAEEAGCSPKTERYSWQCVNQSVLCG >gi568815582f:14611689_14750812|GENSCAN_predicted_CDS_6|360_bp atggggccgctacctgtgtgcctgccaatcatgctgctcctgctactgccgtcgctgctg ctgctgctgcttctacctggccccgggtccggcgaggcctccaggatattacgtgtgcac cggcgtgggatcctggaactggcaggaactgtgggttgtgttggtccccgaacccccatc gcctatatgaaatatggttgcttttgtggcttgggaggccatggccagccccgcgatgcc attgactggtgctgccatggccacgactgttgttacactcgagctgaggaggccggctgc agccccaagacagagcgctactcctggcagtgcgtcaatcagagcgtcctgtgcggctag >gi568815582f:14611689_14750812|GENSCAN_predicted_peptide_7|186_aa MATCGELSVNCCAADFSEQRRRLERRRRQVEPGPRGPGMGQQPLQPGSPGRGAGRQRASR QPPCGALTSLQAAPQQPPGSAHTSLQGSPLALHLPPPPRGVNCAVCRPGYADPGSPGPQQ PDEEPRATARGYEKEQDGAPEKCKSSELGPPCQERLGAEDGEMEMEKRQVGRSGAPPVGS ACAGGA >gi568815582f:14611689_14750812|GENSCAN_predicted_CDS_7|561_bp atggcgacctgcggcgagctgagcgtcaactgctgcgccgccgacttctcggagcagcga aggcgactcgagagaagacgccgccaagtggaacccgggccccgcggccctgggatgggg cagcagccactgcagccaggaagccctgggcggggcgctgggcgccaacgagcgtcacgg caacctccatgcggcgccctcaccagcctacaggcggcaccgcagcagcctccaggcagc gcccacaccagcctacaggggtcgccgctcgcactgcacctgcctccgccgccccggggt gtgaactgcgctgtctgtcggcctggctacgctgacccgggcagcccaggcccgcagcag ccggacgaggagcccagggccactgcccggggttacgagaaggagcaggacggtgcccca gaaaaatgcaagagctcagagctagggcccccgtgccaggaaaggctaggagcagaagat ggagagatggagatggagaagcggcaggtgggaaggagcggcgctccaccggtggggtca gcatgcgctggcggggcttag >gi568815582f:14611689_14750812|GENSCAN_predicted_peptide_8|40_aa MVKLSIVLTPRFLSHDQGQLTKELQQHVKSVTCPCEYLRK >gi568815582f:14611689_14750812|GENSCAN_predicted_CDS_8|120_bp atggtgaagctctctattgtcctgaccccacggttcctgtcccatgaccagggccagctc accaaggagctgcagcagcacgtaaagtcagtgacatgcccatgcgagtacctgaggaag