GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:21:23 Sequence gi568815582f:14544347_14767824 : 223478 bp : 45.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.21 PlyA - 2090 2085 6 1.05 1.20 Term - 4297 4290 8 2 2 114 46 0 0.234 -3.57 1.19 Intr - 7749 7675 75 1 0 90 111 38 0.624 5.79 1.18 Intr - 9805 9719 87 2 0 66 111 20 0.471 2.04 1.17 Intr - 11363 11308 56 1 2 109 93 28 0.219 4.02 1.16 Intr - 36597 36528 70 1 1 111 108 -26 0.128 -0.06 1.15 Intr - 37945 37835 111 2 0 73 73 180 0.999 15.35 1.14 Intr - 40076 40001 76 2 1 111 91 78 0.983 9.49 1.13 Intr - 49032 48955 78 0 0 50 115 64 0.409 5.05 1.12 Intr - 55614 55558 57 0 0 43 123 25 0.224 0.58 1.11 Intr - 59880 59800 81 0 0 52 91 83 0.360 4.83 1.10 Intr - 62180 62138 43 1 1 122 86 16 0.256 3.04 1.09 Intr - 66463 66298 166 2 1 55 78 115 0.548 6.22 1.08 Intr - 73304 73244 61 2 1 92 95 22 0.133 1.61 1.07 Intr - 81450 81418 33 0 0 117 61 28 0.258 1.42 1.06 Intr - 82841 82760 82 1 1 109 111 -10 0.378 3.04 1.05 Intr - 83905 83845 61 2 1 47 54 40 0.308 -5.71 1.04 Intr - 85373 85251 123 0 0 104 68 154 0.480 15.56 1.03 Intr - 85580 85454 127 2 1 23 12 91 0.246 -4.75 1.02 Intr - 85989 85810 180 0 0 63 39 105 0.176 3.16 1.01 Init - 86193 86146 48 0 0 84 58 14 0.413 -1.04 1.00 Prom - 91072 91033 40 -2.66 2.00 Prom + 95901 95940 40 -5.56 2.01 Init + 100001 100263 263 1 2 68 95 65 0.350 1.74 2.02 Intr + 104042 104246 205 2 1 114 92 59 0.997 8.10 2.03 Intr + 105458 105627 170 1 2 101 100 130 0.984 14.24 2.04 Intr + 110720 110864 145 2 1 93 61 129 0.998 10.98 2.05 Intr + 117546 117719 174 2 0 112 82 191 0.998 21.04 2.06 Intr + 120523 120725 203 0 2 74 100 99 0.907 7.78 2.07 Term + 123289 123481 193 1 1 116 42 52 0.469 0.29 2.08 PlyA + 124516 124521 6 1.05 3.05 PlyA - 124588 124583 6 1.05 3.04 Term - 128403 128261 143 1 2 107 42 131 0.961 8.49 3.03 Intr - 143925 143819 107 2 2 93 101 162 0.981 17.86 3.02 Intr - 146299 146149 151 2 1 99 105 103 0.905 12.32 3.01 Init - 149883 149787 97 0 1 51 96 243 0.569 19.87 3.00 Prom - 152289 152250 40 -7.16 4.02 PlyA - 154241 154236 6 1.05 4.01 Sngl - 159211 158651 561 1 0 84 40 351 0.273 26.04 4.00 Prom - 161574 161535 40 -4.16 5.00 Prom + 162087 162126 40 -5.46 5.01 Init + 164598 164717 120 2 0 39 113 143 0.936 11.19 5.02 Intr + 171722 171850 129 1 0 82 56 48 0.765 1.89 5.03 Intr + 175481 175580 100 1 1 95 86 37 0.977 3.88 5.04 Intr + 175744 175888 145 2 1 49 61 140 0.918 6.74 5.05 Intr + 178846 178953 108 1 0 100 84 84 0.401 8.60 5.06 Intr + 180142 180177 36 1 0 88 114 7 0.200 0.68 5.07 Term + 181537 181612 76 1 1 88 42 45 0.169 -2.79 5.08 PlyA + 181953 181958 6 1.05 6.05 PlyA - 182380 182375 6 1.05 6.04 Term - 182744 182740 5 0 2 65 40 0 0.346 -10.03 6.03 Intr - 183042 182936 107 2 2 93 101 162 0.961 17.86 6.02 Intr - 185416 185266 151 2 1 99 105 103 0.905 12.32 6.01 Init - 189001 188905 97 1 1 51 96 243 0.569 19.87 6.00 Prom - 191407 191368 40 -7.16 7.02 PlyA - 193359 193354 6 1.05 7.01 Sngl - 198333 197773 561 0 0 84 40 351 0.273 26.04 7.00 Prom - 200696 200657 40 -4.16 8.00 Prom + 201209 201248 40 -5.46 8.01 Init + 203720 203839 120 1 0 39 113 143 0.936 11.19 8.02 Intr + 210846 210974 129 2 0 82 56 48 0.765 1.89 8.03 Intr + 214601 214700 100 1 1 95 86 37 0.976 3.88 8.04 Intr + 214864 215008 145 2 1 49 61 140 0.915 6.74 8.05 Intr + 217966 218073 108 1 0 100 84 84 0.412 8.60 8.06 Intr + 219262 219297 36 1 0 88 114 7 0.208 0.68 8.07 Term + 220657 220732 76 1 1 88 42 45 0.169 -2.79 8.08 PlyA + 221073 221078 6 1.05 9.03 PlyA - 221500 221495 6 1.05 9.02 Term - 221864 221860 5 0 2 65 40 0 0.350 -10.03 9.01 Intr - 222162 222056 107 2 2 93 101 162 0.925 17.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:14544347_14767824|GENSCAN_predicted_peptide_1|540_aa MGQGARLPVHHCISTPEGKEPSLGPAPAAAQALTQAQQARAVSGSRGRRHRCGCLSYCRS RRGIRRVEPLRRARAREPCTYSSNPGTIRGLVLLRAGHPPGLVPPDPCTPRGLDSPRPGI QATGSEPLFGFSADFKSNLHKVYQAIEEADFFAIDGEFSGISDGPSVSALTNGFDTPEER YITKSFNFYVFPKPFNRSSPDVKFVCQINGNAVMKASRSSSIDFLASQGFDFNKVFRNGI PYLNQEEERQLREQYDEKRSQANGAGALSYVSPNTSKCPVTIPEDQKKFIDQVVYPKGIH VETLETEKKERYIVISKVDEEERKRREQQKHAKEQEELNDAVGFSRVIHAIANSGKLVIG HNMLLDVMHTVHQFYCPLPADIINNTSLAELEKRLKETPFNPPKVESAEGFPSYDTASEQ LHEAGYDAYITGLCFISMANYLGSFLSPPKIHVSARSKLIEPFFNKLFLMRVMDIPYLNL EGPDLQPKRDHVLHVTFPKEWKTSDLYQLFSAFGNIQISWIDDTSAFVSLSQPEQVKIGY >gi568815582f:14544347_14767824|GENSCAN_predicted_CDS_1|1623_bp atggggcagggagcacgtcttcctgttcatcactgtatctccacccctgagggaaaggag cccagccttggccccgcccccgccgccgcgcaggcgctgacgcaagcgcagcaggcgcgc gctgtttccggaagtcgcggccggcgtcaccgctgcggctgcctcagctactgccgcagt cgccgcggaattcggcgagtagaaccgctgaggcgggcgcgggcccgggagccttgtact tactcctccaaccccggcaccatccggggccttgtccttctccgggccgggcaccccccg gggcttgtcccccctgacccatgcacccctcgaggacttgattcccctcgccccggaatt caggccactggttctgagcctcttttcggtttctccgcagattttaagagtaatcttcac aaagtgtaccaggccatagaggaggccgacttcttcgccatcgatggggagttttcagga atcagtgatggaccttcagtctctgcattaacaaatggttttgacactccagaagagagg tatataacgaagtcatttaacttctatgttttcccgaaacccttcaatagatcctcacca gatgtcaaatttgtttgtcagattaatggtaatgcagttatgaaagcctcgaggagctcc agcattgactttctagcaagccagggatttgattttaataaagtttttcgaaatggaatt ccatatttaaatcaggaagaagaaagacagttaagagagcagtatgatgaaaaacgttca caggcgaatggtgcaggagctctgtcctatgtatctcctaacacttcaaaatgtcctgtc acgattcctgaggatcaaaagaagtttattgaccaagtggtgtatccgaaaggcattcat gttgagactttagaaactgaaaagaaggagcgatatatagttatcagcaaagtagatgaa gaagaacgcaaaagaagagagcagcagaaacatgccaaagaacaggaggagctgaatgat gctgtgggattttctagagtcattcacgccattgctaattcgggaaaacttgttattgga cacaatatgctcttggacgtcatgcacacagttcatcagttctactgccctctgcctgcg gatatcattaacaacacatcccttgcggaattggaaaagcggttaaaagagacacctttc aaccctcctaaagttgaaagtgccgaaggttttccaagttatgacacagcctctgaacaa ctccacgaggcaggctacgatgcctacatcacagggctgtgcttcatctccatggccaat tacctaggttcttttctcagccctccaaaaattcatgtgtctgccagatcaaaactcatt gaacctttttttaacaagttatttcttatgagggtcatggatatcccctatctaaacttg gaaggaccagacttgcagcctaaacgtgatcatgttctccatgtgacattccccaaagaa tggaaaaccagcgacctttaccagcttttcagtgcctttggtaacattcagatatcctgg attgatgacacatcagcatttgtttcccttagccagcccgagcaagtaaagattgggtat tga >gi568815582f:14544347_14767824|GENSCAN_predicted_peptide_2|450_aa MEEPQKSYVNTMDLERDEPLKSTGPQISVSEFSCHCCYDILVNPTTLNCGHSFCRHCLAL WWASSKKTECPECREKWEGFPKVSILLRDAIEKLFPDAIRLRFEDIQQNNDIVQSLAAFQ KYGNDQIPLAPNTGRANQQMGGGFFSGVLTALTGVAVVLLVYHWSSRESEHDLLVHKAVA KWTAEEVVLWLEQLGPWASLYRERFLSERVNGRLLLTLTEEEFSKTPYTIENSSHRRAIL MELERVKALGVKPPQNLWEYKAVNPGRSLFLLYALKSSPRLSLLYLYLFDYTDTFLPFIH TICPLQEDSSGEDIVTKLLDLKEPTWKQWREFLVKYSFLPYQLIAEFAWDWLEVHYWTSR FLIINAMLLSVLELFSFWRIWSRSELKTVPQRMWSHFWKVSTQGLFVAMFWPLIPQFVCN CLFYWALYFNPIINIDLVVKELRRLETQVL >gi568815582f:14544347_14767824|GENSCAN_predicted_CDS_2|1353_bp atggaggaacctcagaaaagctatgtgaacacaatggaccttgagagagatgaacctctc aaaagcaccggccctcagatttctgttagtgaattttcttgccactgctgctacgacatc ctggttaaccccaccaccttgaactgtgggcacagcttctgccgtcactgccttgcttta tggtgggcatcttcaaagaaaacagaatgtccagaatgcagagaaaaatgggaaggtttc cccaaagtcagtattctcctcagggatgccattgaaaagttatttcctgatgccattaga ctgagatttgaagacattcagcagaataatgacatagtccaaagtcttgcagcctttcag aaatatgggaatgatcagattcctttagctcctaacacaggccgagcgaatcagcagatg ggagggggattcttttccggtgtgctcacagctttaactggagtggcagtggtcctgctc gtctatcactggagcagcagggaatctgaacacgacctcctggtccacaaggctgtggcc aaatggacggcggaagaagttgtcctctggctggagcagctgggcccttgggcatctctt tacagggaaaggtttttatctgaacgagtaaatggaaggttgcttttaactttgacagag gaagaattttccaagacgccctataccatagaaaacagcagccacaggagagccatcctc atggagctagaacgtgtcaaagcattaggcgtgaagcccccccagaatctctgggaatat aaggctgtgaacccaggcaggtccctgttcctgctatacgccctcaagagctcccccagg ctgagtctgctctacctgtacctgtttgactacaccgacaccttcctacctttcatccac accatctgccctctgcaagaagacagctctggggaggacatcgtcaccaagcttctggat cttaaggagcctacgtggaagcagtggagagagttcctggtcaaatactccttccttcca taccagctgattgctgagtttgcttgggactggttggaggtccattactggacatcacgg tttctcatcatcaatgctatgttactctcagttctggaattattctccttttggagaatc tggtcgagaagtgaactgaagaccgtgcctcagaggatgtggagccatttctggaaagta tcaacgcaggggctttttgtggccatgttctggcccctcatccctcagtttgtttgcaac tgtttgttttactgggccctgtactttaacccaattattaacattgatcttgtggtcaag gaactccggcggctggaaacccaggtgttgtga >gi568815582f:14544347_14767824|GENSCAN_predicted_peptide_3|165_aa MGPLPVCLPIMLLLLLPSLLLLLLLPGPGSGEASRILRVHRRGILELAGTVGCVGPRTPI AYMKYGCFCGLGGHGQPRDAIDWCCHGHDCCYTRAEEAGCSPKTERYSWQCVNQSVLCGP AENKCQELLCKCDQEIANCLAQTEYNLKYLFYPQFLCEPDSPKCD >gi568815582f:14544347_14767824|GENSCAN_predicted_CDS_3|498_bp atggggccgctacctgtgtgcctgccaatcatgctgctcctgctactgccgtcgctgctg ctgctgctgcttctacctggccccgggtccggcgaggcctccaggatattacgtgtgcac cggcgtgggatcctggaactggcaggaactgtgggttgtgttggtccccgaacccccatc gcctatatgaaatatggttgcttttgtggcttgggaggccatggccagccccgcgatgcc attgactggtgctgccatggccacgactgttgttacactcgagctgaggaggccggctgc agccccaagacagagcgctactcctggcagtgcgtcaatcagagcgtcctgtgcggaccg gcagagaacaaatgccaagaactgttgtgcaagtgtgaccaggagattgctaactgctta gcccaaactgagtacaacttaaagtacctcttctacccccagttcctatgtgagccggac tcgcccaagtgtgactga >gi568815582f:14544347_14767824|GENSCAN_predicted_peptide_4|186_aa MATCGELSVNCCAADFSEQRRRLERRRRQVEPGPRGPGMGQQPLQPGSPGRGAGRQRASR QPPCGALTSLQAAPQQPPGSAHTSLQGSPLALHLPPPPRGVNCAVCRPGYADPGSPGPQQ PDEEPRATARGYEKEQDGAPEKCKSSELGPPCQERLGAEDGEMEMEKRQVGRSGAPPVGS ACAGGA >gi568815582f:14544347_14767824|GENSCAN_predicted_CDS_4|561_bp atggcgacctgcggcgagctgagcgtcaactgctgcgccgccgacttctcggagcagcga aggcgactcgagagaagacgccgccaagtggaacccgggccccgcggccctgggatgggg cagcagccactgcagccaggaagccctgggcggggcgctgggcgccaacgagcgtcacgg caacctccatgcggcgccctcaccagcctacaggcggcaccgcagcagcctccaggcagc gcccacaccagcctacaggggtcgccgctcgcactgcacctgcctccgccgccccggggt gtgaactgcgctgtctgtcggcctggctacgctgacccgggcagcccaggcccgcagcag ccggacgaggagcccagggccactgcccggggttacgagaaggagcaggacggtgcccca gaaaaatgcaagagctcagagctagggcccccgtgccaggaaaggctaggagcagaagat ggagagatggagatggagaagcggcaggtgggaaggagcggcgctccaccggtggggtca gcatgcgctggcggggcttag >gi568815582f:14544347_14767824|GENSCAN_predicted_peptide_5|237_aa MVKLSIVLTPRFLSHDQGQLTKELQQHVKSVTCPCEYLRKVINTLADHRHRGTDFGGSPW LLIITVFLRSYKFAISLCTSYLCVSFLKTIFPSQNGHDGSTDVQQRARRSNRRRQEGIKI VLEDIFTLWRQVETKVRAKICKMKVTTKVNRHDKINGKRKTAKEHLRKLSMKEREHGEKE RQVSEAEENGKLDMKEIHTYISPLLQESLFATGSEWRQRSIVILQDCPTGPTSQLKL >gi568815582f:14544347_14767824|GENSCAN_predicted_CDS_5|714_bp atggtgaagctctctattgtcctgaccccacggttcctgtcccatgaccagggccagctc accaaggagctgcagcagcacgtaaagtcagtgacatgcccatgcgagtacctgaggaag gttatcaatactctggctgaccatcgtcatcgtgggactgactttggtggaagtccttgg ttacttatcattactgtgtttctgagaagttataaatttgccatctccctctgcacaagt tacctttgtgtgtctttcctgaagactatcttcccgtctcaaaatggacatgatggatcc acggatgtacagcagagagccaggaggtccaaccgccgtagacaggaaggaattaaaatt gtcctggaagacatctttactttatggagacaggtggaaaccaaagttcgagctaaaatc tgtaagatgaaggtgacaacaaaagtcaaccgtcatgacaaaatcaatggaaagaggaag accgccaaagaacatctgaggaaactaagcatgaaagaacgtgagcacggagaaaaggag aggcaggtgtcagaggcagaggaaaacgggaaattggatatgaaagaaatacacacctac atatcaccccttctgcaagaaagcctctttgcaaccgggtcagaatggcggcagcggagc atcgtcattcttcaggattgccctactggccctacctcacagctgaaactttaa >gi568815582f:14544347_14767824|GENSCAN_predicted_peptide_6|119_aa MGPLPVCLPIMLLLLLPSLLLLLLLPGPGSGEASRILRVHRRGILELAGTVGCVGPRTPI AYMKYGCFCGLGGHGQPRDAIDWCCHGHDCCYTRAEEAGCSPKTERYSWQCVNQSVLCG >gi568815582f:14544347_14767824|GENSCAN_predicted_CDS_6|360_bp atggggccgctacctgtgtgcctgccaatcatgctgctcctgctactgccgtcgctgctg ctgctgctgcttctacctggccccgggtccggcgaggcctccaggatattacgtgtgcac cggcgtgggatcctggaactggcaggaactgtgggttgtgttggtccccgaacccccatc gcctatatgaaatatggttgcttttgtggcttgggaggccatggccagccccgcgatgcc attgactggtgctgccatggccacgactgttgttacactcgagctgaggaggccggctgc agccccaagacagagcgctactcctggcagtgcgtcaatcagagcgtcctgtgcggctag >gi568815582f:14544347_14767824|GENSCAN_predicted_peptide_7|186_aa MATCGELSVNCCAADFSEQRRRLERRRRQVEPGPRGPGMGQQPLQPGSPGRGAGRQRASR QPPCGALTSLQAAPQQPPGSAHTSLQGSPLALHLPPPPRGVNCAVCRPGYADPGSPGPQQ PDEEPRATARGYEKEQDGAPEKCKSSELGPPCQERLGAEDGEMEMEKRQVGRSGAPPVGS ACAGGA >gi568815582f:14544347_14767824|GENSCAN_predicted_CDS_7|561_bp atggcgacctgcggcgagctgagcgtcaactgctgcgccgccgacttctcggagcagcga aggcgactcgagagaagacgccgccaagtggaacccgggccccgcggccctgggatgggg cagcagccactgcagccaggaagccctgggcggggcgctgggcgccaacgagcgtcacgg caacctccatgcggcgccctcaccagcctacaggcggcaccgcagcagcctccaggcagc gcccacaccagcctacaggggtcgccgctcgcactgcacctgcctccgccgccccggggt gtgaactgcgctgtctgtcggcctggctacgctgacccgggcagcccaggcccgcagcag ccggacgaggagcccagggccactgcccggggttacgagaaggagcaggacggtgcccca gaaaaatgcaagagctcagagctagggcccccgtgccaggaaaggctaggagcagaagat ggagagatggagatggagaagcggcaggtgggaaggagcggcgctccaccggtggggtca gcatgcgctggcggggcttag >gi568815582f:14544347_14767824|GENSCAN_predicted_peptide_8|237_aa MVKLSIVLTPRFLSHDQGQLTKELQQHVKSVTCPCEYLRKVINTLADHRHRGTDFGGSPW LLIITVFLRSYKFAISLCTSYLCVSFLKTIFPSQNGHDGSTDVQQRARRSNRRRQEGIKI VLEDIFTLWRQVETKVRAKICKMKVTTKVNRHDKINGKRKTAKEHLRKLSMKEREHGEKE RQVSEAEENGKLDMKEIHTYISPLLQESLFATGSEWRQRSIVILQDCPTGPTSQLKL >gi568815582f:14544347_14767824|GENSCAN_predicted_CDS_8|714_bp atggtgaagctctctattgtcctgaccccacggttcctgtcccatgaccagggccagctc accaaggagctgcagcagcacgtaaagtcagtgacatgcccatgcgagtacctgaggaag gttatcaatactctggctgaccatcgtcatcgtgggactgactttggtggaagtccttgg ttacttatcattactgtgtttctgagaagttataaatttgccatctccctctgcacaagt tacctttgtgtgtctttcctgaagactatcttcccgtctcaaaatggacatgatggatcc acggatgtacagcagagagccaggaggtccaaccgccgtagacaggaaggaattaaaatt gtcctggaagacatctttactttatggagacaggtggaaaccaaagttcgagctaaaatc tgtaagatgaaggtgacaacaaaagtcaaccgtcatgacaaaatcaatggaaagaggaag accgccaaagaacatctgaggaaactaagcatgaaagaacgtgagcacggagaaaaggag aggcaggtgtcagaggcagaggaaaacgggaaattggatatgaaagaaatacacacctac atatcaccccttctgcaagaaagcctctttgcaaccgggtcagaatggcggcagcggagc atcgtcattcttcaggattgccctactggccctacctcacagctgaaactttaa >gi568815582f:14544347_14767824|GENSCAN_predicted_peptide_9|37_aa XCCHGHDCCYTRAEEAGCSPKTERYSWQCVNQSVLCG >gi568815582f:14544347_14767824|GENSCAN_predicted_CDS_9|114_bp nngtgctgccatggccacgactgttgttacactcgagctgaggaggccggctgcagcccc aagacagagcgctactcctggcagtgcgtcaatcagagcgtcctgtgcggctag