GENSCAN 1.0 Date run: 7-Nov-116 Time: 23:48:53 Sequence gi568815585r:27335678_27540448 : 204771 bp : 44.99% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9240 9279 40 2 1 83 67 29 0.201 0.69 1.02 Intr + 19569 19712 144 1 0 56 81 44 0.043 0.75 1.03 Intr + 29065 29098 34 2 1 97 105 6 0.162 0.48 1.04 Intr + 36478 36584 107 1 2 46 109 78 0.196 5.56 1.05 Intr + 37695 37882 188 1 2 71 46 65 0.157 0.01 1.06 Term + 47930 48121 192 1 0 31 46 217 0.710 9.22 1.07 PlyA + 48248 48253 6 1.05 2.09 PlyA - 50756 50751 6 1.05 2.08 Term - 51847 51684 164 2 2 87 50 76 0.135 1.80 2.07 Intr - 64866 64691 176 2 2 45 106 39 0.182 0.98 2.06 Intr - 69336 69129 208 1 1 37 103 86 0.399 3.14 2.05 Intr - 72720 72567 154 0 1 81 26 88 0.732 1.55 2.04 Intr - 73058 72956 103 1 1 55 62 98 0.573 4.08 2.03 Intr - 75921 75869 53 0 2 85 94 40 0.905 2.01 2.02 Intr - 77818 77684 135 1 0 75 43 51 0.490 0.06 2.01 Init - 85107 85054 54 0 0 86 89 51 0.726 6.29 2.00 Prom - 86018 85979 40 -9.26 3.00 Prom + 87324 87363 40 -5.96 3.01 Init + 88886 89007 122 1 2 68 -9 159 0.389 1.87 3.02 Intr + 89027 89261 235 2 1 18 78 242 0.493 13.89 3.03 Intr + 91415 91515 101 1 2 69 75 68 0.863 2.51 3.04 Intr + 94193 94289 97 2 1 58 92 59 0.870 3.31 3.05 Intr + 94856 94944 89 1 2 51 58 86 0.554 0.67 3.06 Term + 97054 97141 88 1 1 75 38 88 0.580 -0.37 3.07 PlyA + 97533 97538 6 -0.45 4.14 PlyA - 97814 97809 6 1.05 4.13 Term - 100216 99998 219 1 0 106 28 54 0.676 -1.76 4.12 Intr - 101596 101439 158 2 2 51 115 69 0.824 5.63 4.11 Intr - 104611 104312 300 2 0 10 82 247 0.047 13.01 4.10 Intr - 114464 114387 78 0 0 103 75 45 0.580 4.22 4.09 Intr - 114935 114832 104 1 2 66 95 75 0.681 5.82 4.08 Intr - 115593 115430 164 0 2 78 33 145 0.651 6.77 4.07 Intr - 133181 133082 100 1 1 113 89 71 0.570 9.81 4.06 Intr - 136970 136895 76 0 1 69 96 8 0.454 -1.73 4.05 Intr - 138726 138610 117 1 0 44 61 87 0.489 2.04 4.04 Intr - 139353 139208 146 2 2 76 97 45 0.537 4.13 4.03 Intr - 141478 141418 61 2 1 100 75 33 0.394 0.99 4.02 Intr - 158245 158124 122 0 2 89 92 81 0.633 8.74 4.01 Init - 158736 158657 80 0 2 73 31 42 0.221 -2.37 4.00 Prom - 159965 159926 40 -5.06 5.00 Prom + 160755 160794 40 -3.26 5.01 Init + 163520 163637 118 1 1 21 116 83 0.107 4.76 5.02 Term + 174985 175067 83 2 2 77 49 114 0.806 4.26 5.03 PlyA + 175182 175187 6 1.05 6.00 Prom + 185370 185409 40 -4.96 6.01 Init + 192583 192641 59 0 2 79 97 27 0.309 3.50 6.02 Intr + 197472 197759 288 0 0 50 73 196 0.202 10.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 104474 104312 163 2 1 91 82 172 0.841 16.79 S.002 Sngl + 107239 107433 195 0 0 92 37 134 0.846 3.86 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:27335678_27540448|GENSCAN_predicted_peptide_1|234_aa MPGGLRKLTIVEQATVMELEAIILSEMTQKVKYCILSLTSGSYTMDTYGQAEWKNRQEMP KGPADFPFRKGLRGQLSGKSAAVSAAGTHTPGGGGGGEVGTKIIYYSTGRSLAEDYGDGL SNKTAWISILVPAPGGLTSLQFRFFTCKIGIRILLEELNKLRSMEYLEQGLTTLADLPHP YPGPIKTRDPSRQTHKPLYVEKNKSMKEDKRLDIERHQGEHAGRRALRQMLAGH >gi568815585r:27335678_27540448|GENSCAN_predicted_CDS_1|705_bp atgcctggaggcctcaggaaacttacaatcgtggagcaagcaacagtgatggaactggag gccattatcctaagtgaaatgactcagaaagttaaatactgcatattgtcacttacaagt gggagctatacaatggatacctatggacaagcagagtggaaaaatagacaggaaatgcca aaaggtcctgctgacttccccttcagaaaaggactgagagggcagctctctgggaagtcg gcagctgtgtctgcagccggcactcacacacctggcggggggggggggggggaggtggga accaagatcatctactacagcacaggaagaagtttagcagaggattatggagatggtctc agcaacaagactgcctggatttccatcctagttcctgccccaggtggcttaacctctctg cagttccgtttcttcacctgtaaaatagggataagaattctgttggaagaattaaataag ctaagaagtatggagtacttggaacaaggtctgaccaccctggccgaccttccacaccct tatcctggacctataaaaacccgagaccctagcaggcagacacacaagccgctgtacgtg gagaaaaacaaatcgatgaaagaagacaagcggctggacatcgagagacatcaaggggag catgccggcagaagagcactcagacagatgctggcaggccattga >gi568815585r:27335678_27540448|GENSCAN_predicted_peptide_2|348_aa MKVNLEGELKQNYGVPAQMFFGHLRVACCASPKAMLRACATEPSSSSLSIKGIETTGASA ILVIRKLGHRVVKRMAQSHSYFCTLNCTIPINPETRCFYVFIGYNYLQASSGNRKASPPC QSVELFKQTNHILPWKPGSTSSSCYYKASSHSPYWFIQCRRCVALHTSHPGANLIFAQQT RRRLPNLSPVPTRASEPADFLPGPAGLAHVLAPGSASLFLLSHGLTCNTHISPVRPALIG AFYKPLGSYRAVISVFYNPAYADWCILQSSCKTEKFSKSPLNPGSPAIFTSQHLASYGHT RLGQRGAASQRRQEMGISTWRSPQPSPPPQRLSPLHTGGATSYQSHKT >gi568815585r:27335678_27540448|GENSCAN_predicted_CDS_2|1047_bp atgaaggtcaacctcgaaggtgaacttaaacagaactatggtgtgcctgctcaaatgttt tttgggcatctccgtgtggcatgctgtgccagccccaaggcgatgctcagggcatgtgcc acagaaccttccagctcctcgctgtctattaaaggcattgagacaactggggccagtgcc atattagtgataaggaaactgggtcaccgagtggttaagcgaatggcccagtcacacagc tatttctgtaccctcaactgtaccattcccatcaatccagagacccggtgcttttacgtc ttcattgggtataactatctccaggcttcttctggaaatagaaaggcttcacctccttgc cagtctgtggaattattcaaacaaaccaatcacatcctcccctggaaaccagggagcacc tcatcctcctgttactacaaagcctcctctcacagcccctactggttcatccagtgccgc cgctgtgtggccctgcacacctcccaccccggcgccaacctgatctttgcccagcagacg cggaggcgcctgcccaacctcagccctgtgcccacgagggcctcagagcccgcagacttc ctgccgggacctgctggccttgcccatgttttagctcctggctctgctagtcttttcctg ctgtctcatggtctcacgtgtaacactcacatctccccagtgagaccagcactgattggt gcattttacaaacctctaggtagctacagagcagtgattagtgtgttttacaatcctgcc tatgctgattggtgcattttacaatcctcttgtaagacggaaaagttctccaagtcccca ctcaacccaggaagtccagctatcttcacctctcaacacctggcctcctatggccacaca cggctgggccagaggggagctgccagccagaggcgacaggaaatgggcattagcacctgg cgcagtcctcaaccctcgcctcctccccagcgcctcagccctttgcacactggtggtgca accagttaccaatcgcataagacttag >gi568815585r:27335678_27540448|GENSCAN_predicted_peptide_3|243_aa MRSSGADAGRCLVTARAPGSVPASREGSAGSRGPGAPVPGTAPGPGLGGAGALDPPAVVA ESVSSLTIADAFIAAGESSAPTPPRPALPRRFICSFPDCSANYSKAWKLDAHLCKHTGER PFVCDYEGCGKAFIRDYHLSRHILTHTGEKPFVCAANGCDQKFNTKSNLKKHFERKHENQ QKQYICSFEDCKKTFKKHQQLKIHQCQHTNEPLFKCTQEGCGKHFASPSKLKRHAKAHEG VYG >gi568815585r:27335678_27540448|GENSCAN_predicted_CDS_3|732_bp atgcgcagcagcggcgccgacgcggggcggtgcctggtgaccgcgcgcgctcccggaagt gtgccggcgtcgcgcgaaggttcagcagggagccgtgggccgggcgcgccggttcccggc accgcgcctggccctgggcttggaggcgccggcgccctggatccgccggccgtggtcgcc gagtcggtgtcgtccttgaccatcgccgacgcgttcattgcagccggcgagagctcagct ccgaccccgccgcgccccgcgcttcccaggaggttcatctgctccttccctgactgcagc gccaattacagcaaagcctggaagcttgacgcgcacctgtgcaagcacacgggggagaga ccatttgtttgtgactatgaagggtgtggcaaggccttcatcagggactaccatctgagc cgccacattctgactcacacaggagaaaagccgtttgtttgtgcagccaatggctgtgat caaaaattcaacacaaaatcaaacttgaagaaacattttgaacgcaaacatgaaaatcaa caaaaacaatatatatgcagttttgaagactgtaagaagacctttaagaaacatcagcag ctgaaaatccatcagtgccagcataccaatgaacctctattcaagtgtacccaggaagga tgtgggaaacactttgcatcacccagcaagctgaaacgacatgccaaggcccacgagggt gtgtacggatag >gi568815585r:27335678_27540448|GENSCAN_predicted_peptide_4|574_aa MGLSTKELGNSRAQGKPELGLGREKALAELSNQHLQMGYRCDREIDPFHLCGSQVGFGPA IAPYPVGTNTRAYGSGQGQVVVMLMELRRKRRPFPGSTLFLLCVPDQGLRCAVAATLSLD VFAGEYPRLVHDGEAVEVDEAAGKLRLLCFRSQHISVSNATVMKNSGLKPQIWKKSKWTQ FGLMTCFAPSKQCTGLGQPLSTQLKPSSLAVIIVSVIGFLCSQQQALNQTPGISESKHSR YEARLGGFTTVDSQSALEEFLIEEEKEGGHGAQTPSTRLSSRLHGSGSGTRGEAPPTTES EGDSPQIRCTCVRYSMSIACPSTGGRRRRLTNNRVGGSRKCSCGLLPGTAFSTAEDTQNE GKKTKKNKTAFSNVGRKISQRVIHLFDEKGNDLGNMHRANVIRLMDERDLRLVQRNTSTE PAEYQLMTGLQILQERQRLREMEKANPKTGPTLRKELILSSNIGQHDLDTKTKQIQQWIK KKHLVQITIKKGKNVDVSENEMEEIFHQILQTMPGIATFSSRPQAVQGGKALMCVLRAFS KNEEKAYKETQETQERDTLNKDHGNDKESNVLHQ >gi568815585r:27335678_27540448|GENSCAN_predicted_CDS_4|1725_bp atgggcttgagcacaaaagaactggggaactcgagggcccagggaaagccagagctgggc ctgggcagagagaaagctttggcagaactttccaatcagcatctgcaaatgggttacagg tgtgaccgagaaattgatcccttccacctttgcggcagccaagtaggatttgggccagcc attgccccatatccagttggcactaacaccagggcttatggctcaggacaaggtcaggtg gtggtaatgctgatggagctgaggaggaagaggaggcccttcccagggagcaccctcttc cttctctgtgtccctgaccaaggcctgaggtgtgccgtggcggccacgctctccctggat gttttcgctggggagtatccgaggctggtccatgacggtgaggcagtagaagtagatgaa gcagcaggaaagctgaggctgctctgtttcagaagccagcacatctctgtatcaaatgcc acggtcatgaaaaactcaggccttaaaccccagatttggaagaaaagcaaatggacacag tttggtttgatgacatgctttgctcccagcaagcagtgcacaggcctgggccagcctctc agcacccaattaaagccttcttctttggcagtaatcatcgtctcagtcattggctttctg tgcagccagcagcaggccctaaaccaaaccccaggcatttcagaatccaaacattccagg tatgaagccagattaggtggcttcacgactgtggactcgcagtctgcccttgaggaattt cttattgaagaagaaaaagaggggggacacggggcccagacccccagcacccggctttcg agcaggctccacgggtccgggtccgggacgcggggtgaagccccgcccactacggagagc gaaggggactcgccgcagatccgctgtacttgcgtccgctacagtatgtcaatcgcttgc cccagcacaggtgggcgtcgccgccgacttaccaacaaccgggtcgggggctcccggaag tgctcttgcggcttactgcctggcacagcctttagtaccgctgaagacacccagaatgaa ggaaaaaagacaaaaaagaataaaacagcttttagtaacgttggaagaaaaattagtcag cgagttattcacttatttgatgagaagggcaatgatttgggaaacatgcaccgagcaaat gtgattagacttatggatgagcgagacctgcgactggttcaaaggaacaccagcacagaa cctgcagagtatcagctcatgacaggattgcagatcctccaggagcggcagaggctgagg gagatggagaaggcgaaccccaaaactggaccaaccctgagaaaggaactgattttgtct tcaaatattggacaacatgatttggacacaaagactaaacagattcagcagtggattaag aaaaaacacctagtccagattaccataaagaaaggaaaaaatgtagacgtgtcagaaaat gaaatggaggagatatttcatcaaatactccagactatgcctggaatagctacattctca tctaggccacaagctgttcaaggaggaaaagctttaatgtgtgttcttcgtgctttcagc aaaaatgaggagaaggcatataaagaaactcaagagacccaggaaagagacactttgaac aaagaccatggaaatgataaggaatcaaatgttctgcatcagtaa >gi568815585r:27335678_27540448|GENSCAN_predicted_peptide_5|66_aa MPPPEKEKTTSKKSHQKVDLIPASKGYSDAAVSLSKSYKVPTYIAGAISHIPPVSFTLTT HQLASP >gi568815585r:27335678_27540448|GENSCAN_predicted_CDS_5|201_bp atgcctcccccagagaaggaaaagacgaccagcaagaaaagtcatcagaaagttgattta attcctgcatccaaaggttattctgacgctgctgtttctctttctaaatcctacaaagtg cccacctacattgctggtgccatctcccacatccccccggtgtccttcacactcaccaca caccagctcgccagtccttga >gi568815585r:27335678_27540448|GENSCAN_predicted_peptide_6|116_aa MGVAHEEALMNGLGHPLGDKYPTLPAAASLTSCTGSHRLQTGCKHRGAASEGGQSIEPAE GAETRPSELGTSRASADHWWCPSSQGSDARLLQSAEKTNRTLHCYFLPDAHESHLX >gi568815585r:27335678_27540448|GENSCAN_predicted_CDS_6|348_bp atgggggtggctcatgaggaggccctcatgaatggcttgggccatccccttggtgataaa tatccaacccttcctgctgcagcttccctgacctcctgcaccgggagccaccggctgcaa acaggctgcaaacaccggggagctgcaagcgagggcggccagtccatcgagccagcagag ggcgctgaaacgcggcccagtgaactagggacgtccagggccagcgctgaccactggtgg tgcccgagctcccagggctctgatgcccggctgctccagtccgcagagaaaactaaccgg acactccactgctactttcttcccgatgctcatgagtctcacctttgn