GENSCAN 1.0 Date run: 7-Nov-116 Time: 23:24:40 Sequence gi568815592r:166829591_167056182 : 226592 bp : 43.96% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1615 1892 278 2 2 150 59 49 0.489 5.44 1.02 Term + 7344 7454 111 2 0 97 53 79 0.731 3.86 1.03 PlyA + 8323 8328 6 1.05 2.00 Prom + 9980 10019 40 -5.66 2.01 Init + 12770 12932 163 1 1 90 89 62 0.590 6.39 2.02 Term + 13105 13466 362 2 2 51 53 201 0.247 7.60 2.03 PlyA + 14420 14425 6 1.05 3.00 Prom + 15536 15575 40 -4.56 3.01 Init + 16458 16619 162 2 0 45 36 125 0.208 2.73 3.02 Intr + 21630 21806 177 2 0 20 55 121 0.459 2.12 3.03 Intr + 23046 23168 123 2 0 84 67 130 0.989 11.28 3.04 Term + 25744 25929 186 0 0 20 48 538 0.931 40.49 3.05 PlyA + 26491 26496 6 1.05 4.06 PlyA - 27476 27471 6 1.05 4.05 Term - 29469 29338 132 0 0 69 45 152 0.192 7.09 4.04 Intr - 31145 30988 158 0 2 41 75 48 0.101 -1.47 4.03 Intr - 32870 32518 353 1 2 85 105 171 0.170 13.47 4.02 Intr - 33103 33020 84 0 0 104 45 38 0.199 0.04 4.01 Init - 39023 38962 62 2 2 67 64 30 0.544 -0.78 4.00 Prom - 39428 39389 40 -3.86 5.00 Prom + 50841 50880 40 -1.96 5.01 Init + 54836 54941 106 1 1 78 -3 113 0.244 1.58 5.02 Intr + 58488 58788 301 1 1 -17 72 227 0.478 6.39 5.03 Intr + 61839 62042 204 0 0 41 35 133 0.360 1.62 5.04 Intr + 65213 65397 185 2 2 42 23 141 0.332 2.43 5.05 Intr + 67163 67437 275 0 2 68 61 197 0.675 12.26 5.06 Intr + 71961 72183 223 2 1 -38 111 158 0.830 3.20 5.07 Intr + 74025 74194 170 1 2 111 29 88 0.320 4.87 5.08 Intr + 74588 74791 204 1 0 74 20 123 0.336 3.50 5.09 Intr + 79388 79468 81 1 0 133 23 35 0.130 1.33 5.10 Intr + 93135 93264 130 2 1 100 81 53 0.634 6.07 5.11 Term + 95991 96151 161 1 2 56 43 116 0.672 2.00 5.12 PlyA + 97910 97915 6 1.05 6.00 Prom + 97940 97979 40 -3.06 6.01 Sngl + 99509 99685 177 1 0 79 41 127 0.234 2.05 6.02 PlyA + 99903 99908 6 1.05 7.09 PlyA - 99934 99929 6 -0.45 7.08 Term - 100201 99998 204 1 0 73 44 185 0.985 9.97 7.07 Intr - 103309 103140 170 2 2 58 80 84 0.612 4.27 7.06 Intr - 103679 103585 95 1 2 32 64 93 0.427 0.91 7.05 Intr - 109418 109305 114 1 0 61 94 152 0.906 12.66 7.04 Intr - 113499 113429 71 0 2 58 119 47 0.133 2.78 7.03 Intr - 116757 116713 45 0 0 88 62 50 0.071 1.01 7.02 Intr - 121122 121020 103 2 1 74 58 26 0.065 -1.62 7.01 Init - 126592 126507 86 1 2 101 101 246 0.721 25.49 7.00 Prom - 153729 153690 40 -1.86 8.00 Prom + 167921 167960 40 -5.16 8.01 Init + 169823 169924 102 1 0 99 100 281 0.426 30.64 8.02 Intr + 173603 173657 55 1 1 108 107 8 0.521 3.25 8.03 Intr + 185862 185926 65 1 2 69 78 48 0.164 0.34 8.04 Intr + 192819 193045 227 2 2 111 59 174 0.797 13.58 8.05 Intr + 195192 195304 113 0 2 66 89 38 0.707 1.72 8.06 Intr + 196957 197025 69 2 0 127 59 3 0.347 0.45 8.07 Intr + 203013 203052 40 1 1 46 82 66 0.272 -0.92 8.08 Intr + 204285 204381 97 0 1 27 86 127 0.309 6.41 8.09 Intr + 216331 216463 133 0 1 71 131 36 0.201 6.42 8.10 Intr + 217118 217199 82 0 1 79 90 40 0.415 2.00 8.11 Intr + 220631 220725 95 2 2 56 74 50 0.069 0.01 8.12 Term + 224650 224684 35 2 2 116 36 13 0.063 -3.35 8.13 PlyA + 225764 225769 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 130038 130159 122 1 2 58 50 106 0.846 2.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:166829591_167056182|GENSCAN_predicted_peptide_1|129_aa XVGPAHLTKLAQRQVPQRLLHASRTGGLTRDPFSDPYPPCHLSSAQVHSGSVAMMTMELL YFNLNLPFLQPLCVFLHQHDTFPPAPARQKASACAPSSVPLGHLQLSFSWDLGPGPAQRD FQYPLHPQR >gi568815592r:166829591_167056182|GENSCAN_predicted_CDS_1|390_bp ngtgttggtcctgcgcacctcactaaacttgctcagaggcaggttcctcagaggctgctc cacgcatccaggaccggaggcctcaccagggatcccttctcagatccttaccctccctgc catctctcttctgctcaggtccacagtgggtccgtggccatgatgactatggagcttctc tacttcaacttgaatttgcccttccttcaaccactctgtgtatttctacaccagcatgac accttcccccctgcccctgccagacagaaagcttcagcttgtgccccgtcttcagtcccg ctcgggcacctgcagctgagcttctcttgggacctgggacctggcccagcacagcgggat ttccagtacccgctgcaccctcagcgctga >gi568815592r:166829591_167056182|GENSCAN_predicted_peptide_2|174_aa MEVETGERIPPAKEHQRSPAKHRGQGEVGTNSPKESPAAASPVDAVITDFQLQNAPCFLA QYPELLILKIQTGKMEIKMVDRRQDWLAAPAGTDKAACGETHIVNFCSKNYHSNIPGKPR ESTDPLKELYRCCRLHETLKNRESACLLSGEGKFSAQVTGCLEIDSVLLGAQWE >gi568815592r:166829591_167056182|GENSCAN_predicted_CDS_2|525_bp atggaggtggagactggagagagaatcccaccagccaaggaacaccagaggtcaccagca aagcacagaggacagggagaggttgggacaaattctcccaaagagtccccagcagcagcc agccctgtggatgctgtgatcacagacttccagctccagaacgctccgtgcttcttagcc caatacccagagttacttatcttaaaaatacagacagggaaaatggagattaagatggtg gataggaggcaggactggcttgcagctcctgctggaacagacaaagcagcatgtggagag acccacatcgtgaacttttgttccaagaactaccacagcaacataccaggaaagcctaga gaatccacggaccctttgaaggaactttatcgctgctgcaggctccatgagacgctgaaa aaccgtgagtctgcctgccttctcagtggggagggcaagttctcagcccaggtcaccggc tgcctagaaatagactcggtgctgttgggggcacagtgggagtga >gi568815592r:166829591_167056182|GENSCAN_predicted_peptide_3|215_aa MQKIFQGYYEHIYEHKLENLEEMDKFLKIYNPPRLNQEDIETLNRPITSSKIEVAGGEPR HQSSGAQTWEALQTPKLDLIAFSNLLLRTLDTEKTELSLGLLLHDLATTSTQKGTRRGVQ AATARQLRELGIGGGHADTLRSSSLSAHRPLDVPTKKTKKTKMKMKKKEKKKKEKEEEEE EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE >gi568815592r:166829591_167056182|GENSCAN_predicted_CDS_3|648_bp atgcaaaagatttttcaaggctactatgaacacatttatgagcataaactagaaaatcta gaggagatggataaattcctcaaaatatacaaccctcctagattaaaccaggaagatata gaaactctgaacagaccaataacaagcagcaagattgaagtggcaggaggagagccacgc caccagtcctcaggggctcagacctgggaggccttacagacacccaaactggatctcatc gccttctccaacctgctgctccgcacactagacacagagaaaacggagctttctttgggt cttttgcttcatgatttagcaacgacctcaacgcagaaggggactcgtcgaggggtccaa gcagcaacggctcggcaactccgggagctgggcattggaggaggccacgctgacacgctc aggagctcatccctgagcgcccacaggcccctggatgtccctacgaagaagacgaagaag acaaagatgaagatgaagaagaaggaaaagaagaaaaaggagaaggaggaggaggaagag gaggaggaagaagaagaggaagaagaggaagaagaagaggaagaagaagaggaagaggaa gaggaagaagaagaagaggaagaagaagaggaagaagaagaagaataa >gi568815592r:166829591_167056182|GENSCAN_predicted_peptide_4|262_aa MAIFAHMNSSGIYDAKIFQVSPWLLCSFYGSGILRLNFHPRLEPIKVKYPSGKRQQLPRA LAWQGGDGARAAILQPPGSLLRPAQAPRSRPPLGAGSVLSIAATRRPLRPLASDIPSGRP PVPVRASVYFLSYYQYSTSRKGKFKNANRTVAGTMEKDRGGAYGNRVLPIAQLVIVSTLA CAEAVRCSALTNEGLVQREDSALGSERMGERGPGALQFQLVLMAGWCSSVELSRVCCMAA GADGWLVFFRGALPCLLHGSRC >gi568815592r:166829591_167056182|GENSCAN_predicted_CDS_4|789_bp atggccatctttgctcacatgaattccagtggtatatatgatgccaagatctttcaggtg agcccttggttgctatgcagtttctatggcagcggcatcctcagactcaactttcaccct cgtttagagccaattaaggtaaaatacccgagcgggaagcggcagcagctccccagagcc ctggcttggcagggaggggacggcgcccgagcagccatattgcagccccccgggtccctc ctccggccggcgcaggccccgcgctcacgacctcctctgggagccggatctgtcctctcc atcgccgccacccgccggcctctgcgtcccttggcttccgacatcccgtctggccgtccc cctgtgccggtccgagcctctgtttatttcctttcctactatcaatactcgaccagcaga aaaggaaagtttaaaaatgccaatcgcacagttgctggaactatggaaaaagatcgaggt ggagcctatggaaatagagtgcttcctatagcccagctggtcatcgtcagcacattggct tgtgctgaagctgtaaggtgctcagccctcacgaatgaagggcttgtccagagggaggat tccgccctgggctcagaaaggatgggagagagggggccaggggctttacaattccagctg gtgctgatggctggctggtgttcttccgtggagctctcccgtgtctgctgcatggcagcc ggtgctgatggctggctggtgttcttccgtggagctctcccgtgtctgctgcatggcagc cggtgctga >gi568815592r:166829591_167056182|GENSCAN_predicted_peptide_5|679_aa MGILNFYLVNQKFRKPSTCDWHLKWGEADRAFELWARLIKKKRDAIKNDKEDIIANPIEI QTTIREYYKHLYANKLENLEEMDKFRDTYTLPRLNQEEVESLNRPITGSEIEAIINSLPT KKSPGPDGFTAEFYQRAITLTTWPKIPFLGIRETKNPRSENTRLATILEAAHRHFGSSQP PPWELCEQGPPGNILVTMKGPPKWQVDFGGSGKGKAGKIECLTGLASSAVYKHTLKKIIQ VEISCLLVHHYVKGITRRPTAPGDEDNEEKIEHNCQQVIPQTCAARGDLLEVPLTDPNLN LYTDGSSLVEKGLRKVGYAVVSDNRILESNPLTPGTSAQLAELIALTRALELGEGKRGRE RRENTEGGKCQVNAQGGKRLVNAQRRKPGERTSSRTPSERTRGECCANTHGKHNRASTQW GERHANAQQEERFALAAQQCPSAKGSRMLFSSSLPATAMGSRIPAGTPPHPRNVNTSGIR IGRCDPGTGGPPQPGSSHFTACSSLLGPGNMQTLSSKTAQDPTFAFLNSSQGTPMLRAEA HAGGTPASGQLWVNFRGVRFQWNEELTVEDGRGLAGHVSTEGEVKYFQNTYYAQVAVFST GAARAAHISGCSPRHMVIKTEEQMVTVLTYATLSASTLYGPPLCHPAGIFYVTQQELEEK RPWSLGVEAQSHPGLQKNS >gi568815592r:166829591_167056182|GENSCAN_predicted_CDS_5|2040_bp atgggaatcctcaatttctacctggtcaatcagaagttcaggaagccctcaacctgtgac tggcatctgaagtggggggaagcggatcgagcctttgagctgtgggcaagactaataaag aagaaaagagatgcaataaaaaatgataaagaggatatcatcgccaatcccatagaaata caaactaccatcagagaatactacaaacacctctatgcaaataaactagaaaatctagaa gaaatggataaattccgcgacacatacaccctcccaagactaaaccaggaagaagttgaa tccctgaatagaccaataacaggctctgaaattgaggcaataattaatagcctaccaacc aaaaaaagtccaggaccagatggattcacagccgaattctaccagagagctataacactc accacatggcccaagattccattccttggaatccgtgagaccaagaaccccaggtcagag aacacgaggcttgcaaccatcttggaagcagcccaccgccattttggaagcagccagcca ccaccttgggagctctgtgagcaaggaccccctggtaacattttggtgaccatgaaggga cctccaaagtggcaagtggactttggaggctctggaaaaggaaaagctgggaaaatcgaa tgcctaacagggcttgcttccagtgcggtctacaagcacacgttaaaaaagatcatccaa gtagaaataagctgcctcctcgtccaccattatgtcaagggaatcactagaaggcccact gccccaggggatgaagacaatgaagaaaagatagaacataactgtcaacaagtaattcct caaacctgtgctgctcgaggggaccttttagaggttcccttgactgatcccaacctcaac ttgtatactgatggaagttcccttgtagaaaaaggacttcgaaaagtgggctatgcggtg gtcagtgataacagaatacttgaaagtaatcccctcactccaggaactagtgctcagctg gcagaactaatagccctcactcgggcactagaattgggagaaggaaaaagggggagagaa cgccgggagaacacagagggaggaaaatgccaggtgaacgcacaagggggaaaacgcctg gtgaacgcacaaaggagaaagccaggtgaacgcacaagcagcagaacgccaagtgaacgc acgaggggagaatgctgcgccaacacacacgggaaacacaaccgcgcgagcacgcagtgg ggagaacggcacgcgaacgctcagcaagaagaacgctttgccttggcagcgcagcagtgc ccctctgcaaaggggtctcgcatgctgttcagcagctctcttcccgccacagcgatgggc agccggatcccagctggaactccgccacatcccaggaatgtgaataccagtggaatccgg ataggacggtgtgacccaggcacaggtgggccaccccaacccggctcctcacacttcacg gcatgcagcagcctcctgggccctgggaacatgcagactctgagcagcaagactgcgcag gacccgacatttgcatttctaaacagctcccaggggacgccaatgctgcgggctgaggcc cacgctggaggaacaccggccagtgggcagctttgggtgaacttcagaggtgttaggttc cagtggaatgaggaactgacagtggaggatggcagaggcctggcagggcatgtcagcact gagggggaggttaaatattttcaaaacacctactatgctcaagtagctgtgttcagtact ggggcagcacgagcagcccacatctcagggtgcagtccaagacatatggtcatcaagaca gaagagcagatggtgacagtcctcacctatgccacattgtccgcatctacactgtacggc cctcccctgtgccatccagctggcatcttctatgtaacacaacaagagctggaagagaag aggccctggtccctgggcgtggaggcgcagagccacccaggactgcaaaagaactcttaa >gi568815592r:166829591_167056182|GENSCAN_predicted_peptide_6|58_aa MSWGSGHMEQTQSELETPPRPTRHQRGLLPCVELFVLTGSLRQEGNRLRGENSSKRVP >gi568815592r:166829591_167056182|GENSCAN_predicted_CDS_6|177_bp atgtcctggggctctggtcacatggaacagacccagtctgagctagagacaccccccagg cccaccaggcatcagcgtgggctcctgccatgcgtggagctcttcgtcttgacagggtcc ctgcgccaagaaggcaacaggctccgaggggaaaactcgagcaagagagtcccctga >gi568815592r:166829591_167056182|GENSCAN_predicted_peptide_7|295_aa MRPAALRGALLGCLCLALLCLGGADKRLRASVSRPFSLEHPEPGHCLDLHLRAYPARCVK LTLAPGLLGQMTDVGIWSDLLPEMRAYWPDVIHSFPNRSRFWKHEWEKHGTCAAQVDALN SQKKYFGRSLELYRELDLNSCSRDRCVIHKSRDIYDVDLYGKCLQTLALEKGSRKMIACR ALTVLTWTHVRIDEGHVPAMFAQSSVFRELITGVAKATGATHLLSCFQDEEVQTIGQIEL CLTKQDQQLQNCTEPGEQPSPKQEVWLANGAAESRGLRVCEDGPVFYPPPKKTKH >gi568815592r:166829591_167056182|GENSCAN_predicted_CDS_7|888_bp atgcgccctgcagccctgcgcggggccctgctgggctgcctctgcctggcgttgctttgc ctgggcggtgcggacaagcgcctgcgtgcgtcagtgtccaggcccttctcattggagcac ccagagccaggacactgcctggatctgcacctcagggcgtaccctgctcggtgtgtcaag ctgaccctggccccaggacttctgggtcagatgaccgacgtgggaatctggtcggatctt ttgccagaaatgagggcatactggcctgacgtaattcactcgtttcccaatcgcagccgc ttctggaagcatgagtgggaaaagcatgggacctgcgccgcccaggtggatgcgctcaac tcccagaagaagtactttggcagaagcctggaactctacagggagctggacctcaacagc tgcagtagggaccgatgtgtcatccacaaatcccgggatatttacgacgtggacctttac ggaaaatgtttgcagactcttgccttagagaaaggaagccgtaagatgatcgcttgcagg gccctcaccgtcctcacctggactcatgtgcgaatagatgagggacatgtgcctgccatg tttgcccagagctcggtgttcagggaactgattacaggggtggcaaaagccacaggggcc acacatttgctgagctgcttccaggatgaggaagtacagacaattggtcagatagaactg tgcctcactaagcaagaccagcagctgcaaaactgcaccgagccgggggagcagccgtcc cccaagcaggaagtctggctggcaaatggggccgccgagagccggggtctgagagtctgt gaagatggcccagtcttctatcccccacctaaaaagaccaagcattga >gi568815592r:166829591_167056182|GENSCAN_predicted_peptide_8|370_aa MAATAAAVVAEEDTELRDLLVQTLENSGVLNRIKNKTPLVNESLKKFLNTKDDAMLHVVV EVVHSKTPDGAIRMKANDEANQSDTSVSLSEPKSKSSLHLLSHETKIGSFLSNRTLDGKD KAGLCPDEDDMEGDSFFDDPIPKPEKTYGLRKEPRKQAGSLASLSDAPPLKSGLSSLAGA PSLKDSESKRGNTVLKDLKLISDKIGSLGLGTGEDDDYVDDFNSTSHRSEKSEISIGEEI EEDLSVEIDDINTSDKHPYCPCASPAPAPLLPQCQAFSSSSHGFIPAAGIAPPRGSALPA GMASTVETYVTKARPSQGAPSNDHSASGLSCHTVKKVPCFSFAFCHECKFHEASPAMLNC RYLNQIIYIK >gi568815592r:166829591_167056182|GENSCAN_predicted_CDS_8|1113_bp atggcggcgacggcggccgcagtggtggccgaggaggacacggagctgcgggacctgctg gtgcagacgctggagaacagcggggtcctgaaccgcatcaagaacaaaactcctttagtt aatgagagcctgaaaaagtttttaaataccaaagacgatgctatgttgcatgtggtggtt gaggttgtacacagcaagactccagatggagccatccgcatgaaggccaatgatgaggcc aatcagagtgatacaagtgtctccttgtcagaacccaagagcaaaagcagccttcactta ctgtcccatgaaacaaaaattggatcttttctaagcaacagaactttagatggcaaagac aaagctggcctttgtccagatgaagatgatatggaaggagattctttctttgatgatccc attcctaagccagagaaaacttacggtttgaggaaggaacctaggaagcaagcaggaagt ctggcctcgctctcggatgcaccccccttaaaaagtggactcagctccctggcgggagcc ccttctttaaaagactctgagagtaaaaggggaaatacagttttgaaagatctgaaattg atcagtgataaaattggatcacttggattaggaactggagaagatgatgactatgttgat gattttaatagtaccagccatcgctcagagaaaagtgagataagtattggtgaagagata gaagaagacctttctgtggaaatagatgacatcaataccagtgataagcacccctactgc ccatgcgcctctcctgccccagcacccctactgccccagtgccaggccttttctagcagc tctcatgggttcatccctgctgcgggcattgctccaccaaggggcagtgctctccctgcc gggatggcctccactgtagagacctatgtcaccaaggccaggccctctcagggggctcca tccaacgaccattctgcctcagggctgtcctgccacactgtgaagaaggtgccttgcttc tcctttgccttctgccatgagtgtaagtttcatgaggcctccccagccatgctgaactgt aggtatttaaatcagatcatctacataaaataa