GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:44:13 Sequence gi568815597r:39797375_40001888 : 204514 bp : 46.84% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 20004 20050 47 2 2 64 85 53 0.135 1.53 1.02 Intr + 22263 22369 107 0 2 38 50 115 0.166 2.56 1.03 Intr + 28189 28329 141 2 0 83 94 42 0.471 4.62 1.04 Term + 40503 40600 98 1 2 94 42 53 0.053 -0.57 1.05 PlyA + 40688 40693 6 1.05 2.13 PlyA - 41105 41100 6 1.05 2.12 Term - 44539 44370 170 2 2 51 48 97 0.878 0.04 2.11 Intr - 46844 46727 118 2 1 111 80 111 0.999 12.64 2.10 Intr - 47266 47157 110 2 2 99 91 32 0.989 4.60 2.09 Intr - 49923 49846 78 1 0 76 92 20 0.612 0.72 2.08 Intr - 50286 50174 113 2 2 78 70 70 0.920 4.22 2.07 Intr - 50717 50612 106 0 1 21 66 89 0.713 -0.73 2.06 Intr - 52887 52745 143 2 2 56 92 70 0.925 4.20 2.05 Intr - 55502 55357 146 2 2 21 110 204 0.224 14.98 2.04 Intr - 56687 56596 92 0 2 9 115 43 0.274 -1.19 2.03 Intr - 60043 59872 172 1 1 130 60 155 0.459 16.42 2.02 Intr - 86038 85944 95 2 2 76 80 190 0.052 16.68 2.01 Init - 96702 96537 166 0 1 75 119 66 0.373 8.23 2.00 Prom - 98534 98495 40 -4.96 3.04 PlyA - 99850 99845 6 1.05 3.03 Term - 100596 99998 599 1 2 82 36 679 0.997 56.79 3.02 Intr - 104069 103565 505 2 1 134 90 594 0.995 56.75 3.01 Init - 104514 104434 81 0 0 42 78 112 0.919 4.58 3.00 Prom - 105546 105507 40 -5.66 4.05 PlyA - 106745 106740 6 1.05 4.04 Term - 110515 110208 308 2 2 45 37 165 0.148 2.38 4.03 Intr - 139526 139446 81 0 0 104 71 118 0.214 11.31 4.02 Intr - 143578 143428 151 1 1 79 -23 214 0.249 9.14 4.01 Init - 152371 152306 66 1 0 71 77 23 0.147 0.60 4.00 Prom - 154245 154206 40 -5.66 5.00 Prom + 154693 154732 40 -5.96 5.01 Init + 157919 158011 93 1 0 86 100 113 0.541 10.87 5.02 Intr + 159713 159847 135 1 0 112 109 75 0.976 12.76 5.03 Intr + 161327 161451 125 1 2 94 105 106 0.923 12.28 5.04 Intr + 163289 163388 100 2 1 35 64 66 0.295 -0.99 5.05 Intr + 164943 164969 27 2 0 89 61 54 0.338 1.11 5.06 Term + 165335 166030 696 1 0 -69 46 965 0.316 70.15 5.07 PlyA + 166070 166075 6 -0.45 6.00 Prom + 166832 166871 40 -8.36 6.01 Init + 167248 167273 26 0 2 81 57 -9 0.350 -5.54 6.02 Intr + 167837 167960 124 2 1 127 113 184 0.993 25.39 6.03 Intr + 168097 168175 79 0 1 64 78 77 0.969 3.42 6.04 Intr + 168483 168640 158 1 2 99 89 41 0.993 5.03 6.05 Intr + 169227 169317 91 2 1 70 94 86 0.997 6.97 6.06 Intr + 169437 169558 122 1 2 97 101 192 0.999 21.61 6.07 Intr + 169712 169795 84 1 0 62 101 63 0.970 5.02 6.08 Intr + 170254 170337 84 0 0 113 70 61 0.986 6.82 6.09 Intr + 170430 170542 113 2 2 84 82 132 0.999 11.38 6.10 Intr + 170960 171103 144 2 0 74 64 176 0.787 13.20 6.11 Intr + 171195 171371 177 0 0 73 105 278 0.999 27.03 6.12 Term + 172131 172194 64 0 1 131 49 50 0.993 2.76 6.13 PlyA + 176511 176516 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:39797375_40001888|GENSCAN_predicted_peptide_1|130_aa MLVVTKSLVKLGERPWCQGAALASRCTTHFAAKPCTRPSSSFPRFRCQSREMNRSHIDSS EEYPCVTSGTGSLTPARAVQGSPQAKGPSRFSAAELLLARTSDDPASSMQQMRTMLQRME KQHDTRNLCP >gi568815597r:39797375_40001888|GENSCAN_predicted_CDS_1|393_bp atgctggttgtcaccaagtcactggtgaaactgggagagcgcccctggtgccagggcgcg gctctggcctcccgctgcacaacccactttgccgccaagccctgtacccggcccagcagc tccttcccgcgcttccgctgccagtcccgggaaatgaatagaagccacatagactcttcc gaggagtacccatgtgtcacttctgggacaggcagcctcactcctgccagggccgtgcag ggcagtccacaggccaaaggtccaagtagattcagcgctgctgagttgcttctggctagg accagtgatgacccagcttcaagcatgcagcagatgaggacaatgctccagaggatggag aagcaacatgatacgaggaatctgtgtccctga >gi568815597r:39797375_40001888|GENSCAN_predicted_peptide_2|502_aa MAMSIMVKEASTNSSLQMGLGLVLPTRIPMEPQYGQKAPLNSSHLQNDPLVTLQPVILGA TGTGKSTLALQLGQRLGGEIVSADSMQVYEGLDIITNKVSAQEQRICRHHMISFVDPLVT NYTVVDFRNRATALISLGKAAAGFDIFARDKIPIVVGGTNYYIESLLWKVLVNTKPQEMG TEKVIDRKVELEKEDGLVLHKRLSQVDPEMAAKLHPHDKRKVARSLQVFEETGISHSEFL HRQHTEEGGGPLGGPLKFSNPCILWLHADQADERLDKRVDDMLAAGLLEELRDFHRRYNQ KNVSENSQDYQHGIFQSIGFKEFHEYLITEGKCTLETSNQLLKKGIEALKQVTKRYARKQ NRWVKNRFLSRPGPIVPPVYGLEVSDVSKWEESVLEPALEIVQSFIQGHKPTATPIKMPY NEAENKRSYHLCDLCDRIIIGDREWAAHIKSKSHLNQLKKRRRLDSDAVNTIESQSVSPD HNKEPKEKGSPGQNDQELKCSV >gi568815597r:39797375_40001888|GENSCAN_predicted_CDS_2|1509_bp atggccatgagcatcatggtgaaagaagcctccactaactcctccttacagatgggactt gggttggtattgcccacccgaattcctatggagcctcagtacggtcaaaaagctcctctt aattcctcacatttacaaaatgaccctcttgtgaccctccaacctgtgattctcggggcc acgggcaccggcaaatccacgctggcgttgcagctaggccagcggctcggcggtgagatc gtcagcgctgactccatgcaggtctatgaaggcctagacatcatcaccaacaaggtttct gcccaagagcagagaatctgccggcaccacatgatcagctttgtggatcctcttgtgacc aattacacagtggtggacttcagaaatagagcaactgctctgatatccttaggaaaggca gcagctgggtttgatatatttgcccgagacaaaattcctattgttgtgggaggaaccaat tattacattgaatctctgctctggaaagttcttgtcaataccaagccccaggagatgggc actgagaaagtgattgaccgaaaagtggagcttgaaaaggaggatggtcttgtacttcac aaacgcctaagccaggtggacccagaaatggctgccaagctgcatccacatgacaaacgc aaagtggccaggagcttgcaagtttttgaagaaacaggaatctctcatagtgaatttctc catcgtcaacatacggaagaaggtggtggtccccttggaggtcctctgaagttctctaac ccttgcatcctttggcttcatgctgaccaggcagatgagcgcttggataagagggtggat gacatgcttgctgctgggctcttggaggaactaagagattttcacagacgctataatcag aagaatgtttcggaaaatagccaggactatcaacatggtatcttccaatcaattggcttc aaggaatttcacgagtacctgatcactgagggaaaatgcacactggagactagtaaccag cttctaaagaaaggtattgaggctctgaaacaagtaactaagagatatgcccggaaacaa aaccgatgggttaaaaaccgttttttgagcagacctggtcccattgtcccccctgtctat ggcttagaggtatctgatgtctcgaagtgggaagagtctgttcttgaacctgctcttgaa atcgtgcaaagtttcatccagggccacaagcctacagccactccaataaagatgccatac aatgaagctgagaacaagagaagttatcacctgtgtgacctctgtgatcgaatcatcatt ggggatcgcgaatgggcagcgcacataaaatccaaatcccacttgaaccaactgaagaaa agaagaagattggactcagatgctgtcaacaccatagaaagtcagagtgtttccccagac cataacaaagaacctaaagagaagggatccccagggcagaatgatcaagagctgaaatgc agcgtttaa >gi568815597r:39797375_40001888|GENSCAN_predicted_peptide_3|394_aa MCVCAGCRAAPSRRGAGPLQVAGGWSEGADMDYDSYQHYFYDYDCGEDFYRSTAPSEDIW KKFELVPSPPTSPPWGLGPGAGDPAPGIGPPEPWPGGCTGDEAESRGHSKGWGRNYASII RRDCMWSGFSARERLERAVSDRLAPGAPRGNPPKASAAPDCTPSLEAGNPAPAAPCPLGE PKTQACSGSESPSDSENEEIDVVTVEKRQSLGIRKPVTITVRADPLDPCMKHFHISIHQQ QHNYAARFPPESCSQEEASERGPQEEVLERDAAGEKEDEEDEEIVSPPPVESEAAQSCHP KPVSSDTEDVTKRKNHNFLERKRRNDLRSRFLALRDQVPTLASCSKAPKVVILSKALEYL QALVGAEKRMATEKRQLRCRQQQLQKRIAYLTGY >gi568815597r:39797375_40001888|GENSCAN_predicted_CDS_3|1185_bp atgtgcgtgtgtgctggctgccgggctgccccgagccggcggggagccggtccgctccag gtggcgggcggctggagcgagggagcggacatggactacgactcgtaccagcactatttc tacgactatgactgcggggaggatttctaccgctccacggcgcccagcgaggacatctgg aagaaattcgagctggtgccatcgccccccacgtcgccgccctggggcttgggtcccggc gcaggggacccggcccccgggattggtcccccggagccgtggcccggagggtgcaccgga gacgaagcggaatcccggggccactcgaaaggctggggcaggaactacgcctccatcata cgccgtgactgcatgtggagcggcttctcggcccgggaacggctggagagagctgtgagc gaccggctcgctcctggcgcgccccgggggaacccgcccaaggcgtccgccgccccggac tgcactcccagcctcgaagccggcaacccggcgcccgccgccccctgtccgctgggcgaa cccaagacccaggcctgctccgggtccgagagcccaagcgactcggagaatgaagaaatt gatgttgtgacagtagagaagaggcagtctctgggtattcggaagccggtcaccatcacg gtgcgagcagaccccctggatccctgcatgaagcatttccacatctccatccatcagcaa cagcacaactatgctgcccgttttcctccagaaagctgctcccaagaagaggcttcagag aggggtccccaagaagaggttctggagagagatgctgcaggggaaaaggaagatgaggag gatgaagagattgtgagtcccccacctgtagaaagtgaggctgcccagtcctgccacccc aaacctgtcagttctgatactgaggatgtgaccaagaggaagaatcacaacttcctggag cgcaagaggcggaatgacctgcgttcgcgattcttggcgctgagggaccaggtgcccacc ctggccagctgctccaaggcccccaaagtagtgatcctaagcaaggccttggaatacttg caagccctggtgggggctgagaagaggatggctacagagaaaagacagctccgatgccgg cagcagcagttgcagaaaagaattgcatacctcactggctactaa >gi568815597r:39797375_40001888|GENSCAN_predicted_peptide_4|201_aa MPITQVMLSAVVKGASSTLAPHTFPRSGSSPEDKDSSTFSERLTAKKTLRMTECEKLRNA SLGTDSYAIRGQREMVKVKVVVDFLKLRKDSPGHQEKAVASQTINQRVYQRKFQKHKVSK LLLPKYLFSKQLLEDMLHQSEEVKRERGRNKLQKTGDATQERSRGNFQDDGKGKSQDDNC EAGQEINKFRLEQEDKGLQPR >gi568815597r:39797375_40001888|GENSCAN_predicted_CDS_4|606_bp atgcccattacccaagtcatgctaagtgctgtggttaaaggagcctcttctacactggca ccccatactttcccacggtcaggatcctccccagaagacaaagacagctctactttttca gagagactgactgccaagaagacgttaagaatgacagagtgtgagaagctgcgcaacgcc agccttggcacagacagctacgccatccgaggccaaagagagatggtcaaagtgaaagtg gtagtagacttcctcaagctgcgtaaggactcccctggccaccaggaaaaagcggtagcc agccaaaccatcaaccaaagggtataccaaagaaaatttcagaaacacaaagtctcaaaa cttttacttcccaagtaccttttctcaaagcagctgctggaggatatgctccaccaaagt gaggaagtaaaacgagaaagaggaagaaacaagctccagaaaacaggggatgcgacacag gagagaagcagagggaatttccaagatgatggtaaagggaagtcacaggatgacaattgt gaagcaggccaagagatcaataagttcagactagaacaggaagacaaaggactccaacca agatga >gi568815597r:39797375_40001888|GENSCAN_predicted_peptide_5|391_aa MAKGEGAESGSAAGLLPTSILQSTERPAQVKKEPKKKKQQLSVCNKLCYALGGAPYQVTG CALGFFLQIYLLDVAQVGPFSASIILFVGRAWDAITDPLVGLCISKSPWTCLGRLMPCSS CSAAYEQRMECLTKADEELLFPEETESREVEAFGMDATFSAEDKEWMPVTKLGRLVKDMK IKSLQEIYLFSLPIKGSEIIDFFLGASLKDEVLKIMPVQKQTRAGQRTRFKAFVAIGDYN GHVGLGVKCSKEVATAIRGAIILAKLSIVPVRRGYWGNKISKPHTVPCKVTGRCGSVLVR LIPAPRSTGIVSTPVPKKLLMMAGIDDCYTSARGCTATLGSFAKGTFDAISKTYSYLTPD LWKETVFTKSPDQEFTDHLIKTHQAPAVATT >gi568815597r:39797375_40001888|GENSCAN_predicted_CDS_5|1176_bp atggccaaaggagaaggcgccgagagcggctccgcggcggggctgctacccaccagcatc ctccaaagcactgaacgcccggcccaggtgaagaaagaaccgaaaaagaagaaacaacag ttgtctgtttgcaacaagctttgctatgcacttgggggagccccctaccaggtgacgggc tgtgccctgggtttcttccttcagatctacctattggatgtggctcaggtgggccctttc tctgcctccatcatcctgtttgtgggccgagcctgggatgccatcacagaccccctggtg ggcctctgcatcagcaaatccccctggacctgcctgggtcgccttatgccctgcagcagc tgctcagctgcttatgagcagaggatggagtgtttgacaaaggctgatgaggagctcttg ttccctgaggagacagagtcaagagaggtggaggcctttggcatggatgccaccttctca gccgaggataaggagtggatgcctgtcaccaagctgggccgcttggtgaaggacatgaag atcaagtccctgcaggagatctatctcttctctctgcccattaagggatctgagatcatt gactttttcctgggggcctctctcaaggatgaggttttgaagattatgccagtgcagaag cagacccgtgccggccagcgcaccaggttcaaggcgtttgttgctatcggagactacaat ggccatgtcggtctgggtgttaagtgctccaaggaggtggccaccgccatccgtggggcc atcatcctggccaagctctccattgtccccgtgcgcagaggctactgggggaacaagatc agcaagccccacaccgtcccttgcaaggtgacaggccgctgcggctctgtgctggtgcgt ctcatccctgcacccaggagcactggcatcgtctccacacctgtgcccaagaagctgctc atgatggctggtatcgatgactgctacacctcagcccggggctgcactgccaccctgggc agctttgccaagggcacctttgatgccatctctaagacctacagctacctgacccccgac ctctggaaggagactgtattcaccaagtctcccgatcaggaattcactgaccacctcatc aagacccaccaggctccagctgtggctacaacatag >gi568815597r:39797375_40001888|GENSCAN_predicted_peptide_6|421_aa MPASLCVNGIIFSTPLAVIAYFLIWFVPDFPHGQTYWYLLFYCLFETMVTCFHVPYSALT MFISTEQTERDSATAYRMTVEVLGTVLGTAIQGQIVGQADTPCFQDLNSSTVASQSANHT HGTTSHRETQKAYLLAAGVIVCIYIICAVILILGVREQREPYEAQQSEPIAYFRGLRLVM SHGPYIKLITGFLFTSLAFMLVEGNFVLFCTYTLGFRNEFQNLLLAIMLSATLTIPIWQW FLTRFGKKTAVYVGISSAVPFLILVALMESNLIITYAVAVAAGISVAAAFLLPWSMLPDV IDDFHLKQPHFHGTEPIFFSFYVFFTKFASGVSLGISTLSLDFAGYQTRGCSQPERVKFT LNMLVTMAPIVLILLGLLLFKMYPIDEERRRQNKKALQALRDEASSSGCSETDSTELASI L >gi568815597r:39797375_40001888|GENSCAN_predicted_CDS_6|1266_bp atgccagcttccctgtgtgtgaatgggatcatcttctccacgcccctggccgtcattgcc tacttcctcatctggttcgtgcccgacttcccacacggccagacctattggtacctgctt ttctattgcctctttgaaacaatggtcacgtgtttccatgttccctactcggctctcacc atgttcatcagcaccgagcagactgagcgggattctgccaccgcctatcggatgactgtg gaagtgctgggcacagtgctgggcacggcgatccagggacaaatcgtgggccaagcagac acgccttgtttccaggacctcaatagctctacagtagcttcacaaagtgccaaccataca catggcaccacctcacacagggaaacgcaaaaggcatacctgctggcagcgggggtcatt gtctgtatctatataatctgtgctgtcatcctgatcctgggcgtgcgggagcagagagaa ccctatgaagcccagcagtctgagccaatcgcctacttccggggcctacggctggtcatg agccacggcccatacatcaaacttattactggcttcctcttcacctccttggctttcatg ctggtggaggggaactttgtcttgttttgcacctacaccttgggcttccgcaatgaattc cagaatctactcctggccatcatgctctcggccactttaaccattcccatctggcagtgg ttcttgacccggtttggcaagaagacagctgtatatgttgggatctcatcagcagtgcca tttctcatcttggtggccctcatggagagtaacctcatcattacatatgcggtagctgtg gcagctggcatcagtgtggcagctgccttcttactaccctggtccatgctgcctgatgtc attgacgacttccatctgaagcagccccacttccatggaaccgagcccatcttcttctcc ttctatgtcttcttcaccaagtttgcctctggagtgtcactgggcatttctaccctcagt ctggactttgcagggtaccagacccgtggctgctcgcagccggaacgtgtcaagtttaca ctgaacatgctcgtgaccatggctcccatagttctcatcctgctgggcctgctgctcttc aaaatgtaccccattgatgaggagaggcggcggcagaataagaaggccctgcaggcactg agggacgaggccagcagctctggctgctcagaaacagactccacagagctggctagcatc ctctag