GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:26:28 Sequence gi568815595r:196138691_196370531 : 231841 bp : 47.53% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 6343 6338 6 1.05 1.03 Term - 8769 8691 79 2 1 123 43 65 0.065 2.74 1.02 Intr - 17624 17542 83 2 2 71 61 46 0.125 -1.36 1.01 Init - 18544 18128 417 1 0 45 91 215 0.791 13.93 1.00 Prom - 20507 20468 40 -6.56 2.16 PlyA - 23262 23257 6 1.05 2.15 Term - 24018 23965 54 0 0 128 41 -10 0.098 -4.14 2.14 Intr - 25439 25271 169 1 1 81 59 81 0.193 4.45 2.13 Intr - 35202 35055 148 1 1 70 56 68 0.101 1.09 2.12 Intr - 41430 41325 106 0 1 126 65 14 0.032 2.69 2.11 Intr - 47870 47812 59 0 2 106 66 1 0.008 -1.70 2.10 Intr - 52215 52071 145 0 1 27 94 208 0.922 15.16 2.09 Intr - 52662 52485 178 2 1 84 2 101 0.199 1.02 2.08 Intr - 56944 56912 33 0 0 101 85 22 0.265 0.54 2.07 Intr - 59761 59609 153 0 0 93 65 31 0.297 0.49 2.06 Intr - 60184 60099 86 1 2 57 89 67 0.560 2.32 2.05 Intr - 68813 68708 106 1 1 33 107 211 0.900 17.72 2.04 Intr - 69870 69686 185 0 2 81 16 269 0.971 17.59 2.03 Intr - 70825 70686 140 2 2 82 105 43 0.882 5.58 2.02 Intr - 72041 71926 116 1 2 79 100 85 0.877 8.89 2.01 Init - 72625 72480 146 1 2 84 86 85 0.997 7.49 2.00 Prom - 74017 73978 40 -8.16 3.00 Prom + 74020 74059 40 0.24 3.01 Init + 75780 75821 42 2 0 36 110 -1 0.206 -2.68 3.02 Intr + 77454 77589 136 2 1 64 77 33 0.248 -0.06 3.03 Intr + 78268 78447 180 2 0 58 64 79 0.660 2.34 3.04 Intr + 79115 79246 132 0 0 87 105 59 0.871 8.12 3.05 Intr + 79542 79658 117 1 0 60 113 38 0.816 3.94 3.06 Intr + 88275 88429 155 1 2 122 89 213 0.990 24.59 3.07 Intr + 88848 89047 200 2 2 59 80 117 0.704 6.15 3.08 Intr + 89425 89583 159 1 0 109 94 179 0.999 19.70 3.09 Intr + 90119 90230 112 2 1 90 79 180 0.973 17.68 3.10 Intr + 91225 91371 147 0 0 103 93 51 0.956 7.53 3.11 Intr + 93729 93834 106 2 1 125 105 21 0.907 7.29 3.12 Term + 94373 94509 137 0 2 57 45 76 0.907 -1.62 3.13 PlyA + 94664 94669 6 1.05 4.07 PlyA - 95703 95698 6 1.05 4.06 Term - 100204 99998 207 1 0 108 35 177 0.972 11.74 4.05 Intr - 101045 100857 189 2 0 40 84 180 0.973 12.58 4.04 Intr - 103400 103258 143 0 2 -21 92 200 0.802 9.57 4.03 Intr - 103950 103872 79 0 1 99 109 108 0.955 13.12 4.02 Intr - 108821 108677 145 1 1 -15 94 317 0.527 22.28 4.01 Init - 111586 111291 296 1 2 80 31 312 0.191 19.39 4.00 Prom - 115297 115258 40 -6.76 5.05 PlyA - 115662 115657 6 1.05 5.04 Term - 118711 118689 23 2 2 65 44 51 0.543 -3.13 5.03 Intr - 119255 119098 158 1 2 93 89 40 0.408 4.25 5.02 Intr - 128675 128590 86 2 2 77 92 46 0.444 2.52 5.01 Init - 131841 131725 117 0 0 67 103 57 0.897 5.30 5.00 Prom - 141715 141676 40 -3.56 6.00 Prom + 144108 144147 40 -3.56 6.01 Sngl + 149229 149561 333 2 0 64 46 224 0.563 10.56 6.02 PlyA + 150933 150938 6 1.05 7.03 PlyA - 151758 151753 6 1.05 7.02 Term - 168322 168249 74 2 2 112 48 85 0.588 4.97 7.01 Init - 179462 179306 157 2 1 98 10 181 0.826 11.38 7.00 Prom - 183777 183738 40 -4.26 8.05 PlyA - 183842 183837 6 -0.45 8.04 Term - 185307 185127 181 2 1 57 48 178 0.987 7.78 8.03 Intr - 185750 185581 170 2 2 134 67 28 0.955 4.04 8.02 Intr - 188342 188265 78 2 0 135 68 36 0.947 6.05 8.01 Init - 200111 200082 30 2 0 84 116 48 0.224 6.93 8.00 Prom - 207138 207099 40 -5.36 9.05 PlyA - 208995 208990 6 1.05 9.04 Term - 218156 217995 162 2 0 110 41 114 0.995 6.84 9.03 Intr - 223233 223154 80 1 2 79 94 73 0.998 6.27 9.02 Intr - 223997 223604 394 2 1 58 115 332 0.998 27.13 9.01 Intr - 229465 229338 128 2 2 58 89 69 0.938 4.40 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 50567 50503 65 0 2 130 51 25 0.841 1.05 S.002 Term + 142367 142519 153 1 0 114 48 52 0.802 1.72 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:196138691_196370531|GENSCAN_predicted_peptide_1|192_aa MQNALESLSNRLKQAEERNSELKDKVFELTQSNRDKEKRTRKYEQRLQGFWDYAKHPNLR IIGVPEEEEKSNSLENIFEEIIEENVPGLARDLDIQIQEAQGTSGKFIAKRSSPRHIVIR LSKVKTKEIILRAVRQKHQDARLTHKDPHKLKVKGWKKTFQANGHQKCWNSETNTPEYGA LTYGTEEALRSL >gi568815595r:196138691_196370531|GENSCAN_predicted_CDS_1|579_bp atgcaaaatgctctggaaagtctcagcaatagactcaaacaagcagaagaaagaaattca gagcttaaagacaaggtctttgaattaacccaatccaacagagacaaagaaaaaagaaca agaaaatatgaacaaaggctccaaggattctgggattatgctaaacacccaaacctaaga ataattggtgttcccgaggaagaagagaaatctaacagtttggaaaacatatttgaggaa ataatcgaggaaaacgtgcccggccttgctagagacctagacatccaaatacaagaagct caaggaacatctgggaaattcatcgcaaaaagatcatcgcctaggcacatagtcatcagg ttatctaaagtcaagacgaaggaaataatcctaagagctgtgaggcaaaagcaccaggat gctcgtctaacacataaggacccgcacaagcttaaggtaaagggctggaaaaagacattt caggcaaatggacaccaaaagtgctggaactcagaaaccaataccccagaatatggtgct ttgacatatgggaccgaagaagccttgaggtctctctga >gi568815595r:196138691_196370531|GENSCAN_predicted_peptide_2|607_aa MTLLTDATPLVKEPHPLPLVPRPWFLPSLFAAFNVVLLVFFSGLFFAFPWLAQNGEWAFP VITGSLFVLTFFSLVSLNFSDPGILHQGSAEQGPLTVHVVWVNHGAFRLQWCPKCCFHRP PRTYHCPWCNICVEDFDHHCKWVNNCIGHRNFRFFMLLVLSLCLYSGAMLVTCLIFLVRT THLPFSTDKAIAYPPTIVVAVSAAGLLVPLSLLLLIQALSVSSADRTYKGKCRHLQGYNP FDQGCASNWYLTICAPLGPKYMAEAVQLQRVVGPDWTSMPNLHPPMSPSALNPPAPTSGS LQSREGTPGACSGRHLLVLASRAVSGCVGSTGRASGELGGQVALLPAGCVALDEAPTSQS CSDLNVKLKPNSMPGRPHSALPGRQKKKEEEEEEEEEKEEEEKTKKKKKKKKKKGEPSWT CIPFLRPGPEQESGGNSVHSSIRPQIFRQPRHGTFQFAKGGSTLSGLCSLVWASCWGPRV ARIMELFVRQHLRESNLAKVKGEKVWGLRSLEGTEHRAQSTGPDAALSSGGPDPLPPDVM LEAWTHGEVLDPCPVQERQGLNPSPVYGACSFASERGKLERDPGLWVPQEVLETLPLRAS RTSDQAY >gi568815595r:196138691_196370531|GENSCAN_predicted_CDS_2|1824_bp atgacactcttaacggatgccacgccgctggtgaaggagccccatcccctgcctctggtc ccacgtccctggttcctccctagcctctttgctgccttcaatgtggtgctgctggtcttt ttcagtggcctcttcttcgcattcccgtggctggctcagaacggggagtgggcctttcct gttatcacaggctccctctttgtccttaccttcttcagtcttgtttcactcaacttctca gaccctggcatcttacatcaaggctccgctgagcagggccccttgacggtgcacgtggtg tgggtgaaccacggggccttccgcctgcaatggtgtccaaagtgctgcttccaccgcccg ccccggacttaccactgcccctggtgcaacatctgtgtggaggactttgaccaccactgc aagtgggtcaataactgcatcggtcaccgcaacttccgcttcttcatgctgcttgtcctg tccctgtgcctctactcgggcgccatgctggtcacctgtctcatcttcctggtgcgcaca acccacctgcccttctccaccgacaaggccatcgcgtatccgcccaccatcgtggtggcc gtgtccgccgcgggcctcctggtgccgctgtccctcctgctgctgatccaggcactgtcc gtgagctcggccgaccgcacctacaagggcaagtgcagacaccttcagggatacaacccc ttcgaccagggctgtgccagcaactggtatttaacaatttgtgcaccactgggacccaag tacatggctgaagctgtccagctgcagagagtggtggggcctgactggacatccatgccg aatctgcaccctccaatgtccccctctgctctcaaccccccagccccaacctctgggtcc ctacaaagcagggaagggacccccggggcgtgttcaggacggcacctactggtcctcgcc tccagggcagtaagcggctgtgtagggagcacaggccgggcatccggagagctgggtgga caggtggctctgctgccagctggatgtgtggccctggatgaagccccgacctctcagagc tgctccgatctcaacgtgaagttgaaaccaaactccatgcccggccgacctcacagtgcg cttcctgggcgacagaagaagaaagaagaagaagaagaggaggaggaggaaaaagaagaa gaagaaaagacgaaaaagaagaagaaaaagaagaaaaagaagggagaaccttcatggact tgtattccttttcttaggccaggtccagaacaggagagtgggggcaattctgtccattct tccatccggcctcagattttcaggcagccccgtcatggaaccttccagttcgctaaaggg ggttccaccctgagtgggctctgcagccttgtctgggccagttgctggggcccccgtgtg gctcgtataatggaacttttcgttcgccagcacttacgggagtcgaatctagccaaagtc aaaggagagaaagtctggggccttagaagcctggaggggacagagcacagagcacagagc actgggccagacgcggccctgtcatcaggagggcccgatccgctacctcctgatgtaatg ctggaggcctggacccatggggaagtcctcgacccttgcccagttcaggagaggcagggc ctcaacccttccccagtttatggggcctgcagctttgccagtgaaagaggaaagctggaa agagacccagggctgtgggtcccgcaggaggtcctggaaacactccctctcagggcctct cgcacatcagatcaagcatattaa >gi568815595r:196138691_196370531|GENSCAN_predicted_peptide_3|540_aa MLLILMLSLSFNEQPVLPSTHQADSTKHGRGLSPESPRCRVPKPEFQQVLQGPLTREHCA WGEKGRRAGAVEGIRRGNGSIPQQQGARPSVRTHPSSGILLFLSGQSLGCCPTRMERIHA PMVLSTGWCLARYTADLLEVLKTNYGIPSACFSQPPTAAQLLRDPRAIWGPKMVPKWSKP HRVSWRAARHPPWTHVTLEGGYALGPVELALTSILTLLALGSIAIFLEDAVYLYKNTLCP IKRRTLLWKSSAPTLATFKERCLECVGFASCVLAAKGFRSLRVSLPPSPAPAPSEQVVSV LCCFGLWIPRSLVLVEMTITSFYAVCFYLLMLVMVEGFGGKEAVLRTLRDTPMMVHTGPC CCCCPCCPRLLLTRKKLQLLMLGPFQYAFLKITLTLVGLFLVPDGIYDPADISEGSTALW INTFLGVSTLLALWTLGIISRQARLHLGEQNMGAKFALFQVLLILTALQPSIFSVLANGG QIACSPPYSSKTRSQVMNCHLLILETFLMTVLTRMYYRRKDHKVGYETFSSPDLDLNLKA >gi568815595r:196138691_196370531|GENSCAN_predicted_CDS_3|1623_bp atgctgcttatcttaatgctttcactgtccttcaatgagcagcctgtcctcccttccaca catcaggctgacagcaccaaacacggccgaggcctcagccctgagtctccccgctgcaga gtgcccaagcctgagttccagcaggttcttcaggggccgctgaccagggaacactgcgcg tggggagagaaaggtcgcagagcgggggctgtggaaggcattcgcagaggaaatggcagc atccctcagcagcagggggcccgtccgtctgtaaggactcacccttcctctgggatcctg ctcttcctctctggtcagtccctgggctgctgcccgacccgcatggagagaatccacgcc cccatggttctgagcacaggctggtgtctcgccaggtacacagcagatcttctggaggtg ctgaagaccaattacggcatcccctccgcctgcttctctcagcctcccacagcagcccaa ctcctgagagaccccagagccatctggggacccaagatggtgcccaagtggtccaagccc catcgggtctcctggagggctgcacggcatccgccctggacccatgtcactctagagggt ggttacgccctgggccctgtggaacttgccctcactagcatcctgaccttgctggcgctg ggctccattgccatcttcctggaggatgccgtctacctgtacaagaacaccctttgcccc atcaagaggcggactctgctctggaagagctcggcacccacgctggcaacgtttaaggaa cgctgcctcgaatgtgtaggttttgcctcctgtgtccttgccgcaaaaggcttccggagc ctcagggtctccctcccgccctcacctgctcctgccccttctgagcaggtggtgtctgtg ctgtgctgctttggtctctggatccctcgttccctggtgctggtggaaatgaccatcacc tcgttttatgccgtgtgcttttacctgctgatgctggtcatggtggaaggctttgggggg aaggaggcagtgctgaggacgctgagggacaccccgatgatggtccacacaggcccctgc tgctgctgctgcccctgctgtccacggctgctgctcaccaggaagaagcttcagctgctg atgttgggccctttccaatacgccttcttgaagataacgctgaccctggtgggcctgttt ctcgtccccgacggcatctatgacccagcagacatttctgaggggagcacagctctatgg atcaacactttccttggcgtgtccacactgctggctctctggaccctgggcatcatttcc cgtcaagccaggctacacctgggtgagcagaacatgggagccaaatttgctctgttccag gttctcctcatcctgactgccctacagccctccatcttctcagtcttggccaacggtggg cagattgcttgttcgcctccctattcctctaaaaccaggtctcaagtgatgaattgccac ctcctcatactggagacttttctaatgactgtgctgacacgaatgtactaccgaaggaaa gaccacaaggttgggtatgaaactttctcttctccagacctggacttgaacctcaaagcc taa >gi568815595r:196138691_196370531|GENSCAN_predicted_peptide_4|352_aa MVYLVLSLGMVYLVLSLGMVYLVLCLGMVYLVLSLGMVYLVLSLGMVYLVLSLGMVYLVL SLGMVYLVLSLGMVYLVLSLGMVYLVLSLGMVYLILSIVDELTHNFKGFTVMNENERYDA VQHCRYVDEVVRNAPWTLTPEFLAEHRIDFVAHDDIPYSSAGSDDVYKHIKEAGMFAPTQ RTEGISTSDIITRIVRDYDVYARRNLQRGYTAKELNVSFINEKKYHLQERVDKVKKKVKD VEEKSKEFVQKVEEKSIDLIQKWEEKSREFIGSFLEMFGPEGALKHMLKEGKGRMLQAIS PKQSPSSSPTRERSPSPSFRWPFSGKTSPPCSPANLSRHKAAAYDISEDEED >gi568815595r:196138691_196370531|GENSCAN_predicted_CDS_4|1059_bp atggtgtatctggtcctcagcctcggcatggtgtacctggtcctcagcctcggcatggtg tacctggtcctctgcctcggcatggtgtacctggtcctcagcctcggcatggtgtacctg gtcctcagcctcggcatggtgtacctggtcctcagcctcggcatggtgtacctggtcctc agcctcggcatggtgtatctggtcctcagcctcggcatggtgtatctggtcctcagcctc ggcatggtgtatctggtcctcagcctcggcatggtgtacctgatcctcagcatagttgat gagctcacacacaacttcaaaggcttcacggtgatgaacgagaatgagcgctatgacgca gtccagcactgccgctacgtggatgaggtggtgaggaatgcgccctggacgctgacaccc gagttcctggccgaacaccggattgattttgtagcccatgatgatattccttattcatct gctggcagtgatgatgtttataagcacatcaaggaggcaggcatgtttgctccaacacag aggacagaaggtatctccacatcagacatcatcacccgaattgtgcgggattatgatgtg tatgcgaggcggaacctgcagaggggctacacagcaaaggagctcaatgtcagctttatc aacgagaagaaataccacttgcaggagagggttgacaaagtaaagaagaaagtgaaagat gtggaggaaaagtcaaaagaatttgttcagaaggtggaggaaaaaagcattgacctcatt cagaagtgggaggagaagtcccgagaattcattggaagttttctggaaatgtttggtccg gaaggagcactgaaacatatgctgaaagaggggaagggccggatgctgcaggccatcagc ccgaagcagagccccagcagcagccctactcgcgagcgctccccctccccctctttccga tggcccttctccggcaagacttccccaccttgctccccagcaaatctctccaggcacaag gctgcagcctatgatatcagtgaggatgaagaagactaa >gi568815595r:196138691_196370531|GENSCAN_predicted_peptide_5|127_aa MDAQCSAKVNARKRRKEAPGPNGATEEDGVPSKVQRCAVSLQPLEATNLLSDSVDLPVLD IHTNDIIRITVYAIELNFLFLHFPFLKGLRQPAPFSDEIEVDFSKPYVRVTMEEASRGTP SGYDAAD >gi568815595r:196138691_196370531|GENSCAN_predicted_CDS_5|384_bp atggatgcacagtgttcagccaaggtcaatgcaaggaagaggagaaaagaggcgcccgga cccaacggggcaacagaagaagatggggttccttccaaagtgcagcgctgtgcagtgtct ctccagcctctggaagccactaatctgctttctgattctgtggatttgcctgttctggac attcatacaaatgacatcatacgtattacagtgtatgccattgagttgaactttcttttc cttcactttccatttcttaagggcttacggcaaccagctcctttttctgatgaaattgaa gttgactttagtaagccctatgtcagggtaactatggaagaagccagcagaggaactcct tctggctatgatgctgccgattga >gi568815595r:196138691_196370531|GENSCAN_predicted_peptide_6|110_aa MQRRRLLEVGYGCEPAQVLAGPSRRAARLLPCPTRAVSAQAGLCAEVLPLHLRAYRSPHP RTRAANVQEARAPSAETPRPRLPRRSVSFYLRGEVRGICDRQPSPLKTCG >gi568815595r:196138691_196370531|GENSCAN_predicted_CDS_6|333_bp atgcagcgaagacgtctccttgaagtgggatacggatgtgaaccggcccaagtgctcgcc ggtccgtcaagacgcgctgcccggctgctcccgtgtccaactcgggctgtgtccgcccag gcgggcctgtgcgcggaggtcctaccgctgcacctccgcgcctaccgcagcccgcacccc cgcacccgggcagccaacgtgcaggaggcccgggcgccttcagcggagacgccccgaccg cggctgcctcgccgcagcgttagcttttacctacgtggggaagtaaggggaatttgcgac cgccagcccagtccgctgaaaacctgtggctga >gi568815595r:196138691_196370531|GENSCAN_predicted_peptide_7|76_aa MATSIGVSFSVGDGVPEAEKNAGEPENTYILRPVFQQRRVEDGLWRGDLERAEMGFDRYK MVVQVVIGEQRGEGVL >gi568815595r:196138691_196370531|GENSCAN_predicted_CDS_7|231_bp atggccacgtccatcggagtgtccttctcggtgggcgacggggtgcctgaggctgagaag aacgcaggggagcccgagaacacctatattctgcggcctgttttccagcagaggcgggta gaggacgggctgtggcggggcgacctcgagcgcgctgaaatgggatttgaccgatacaaa atggtggtgcaagtagtgattggagaacaaagaggtgaaggagtattgtga >gi568815595r:196138691_196370531|GENSCAN_predicted_peptide_8|152_aa MQHIESAKGKVLTAAILISLMGWRYGCFSKSGLCRSVLTALLSGGLALLGALICFVTSGV ALKDGPFCMFDVSSFNQTQAWKYGYPFKDLHSRNYLYDRSLWNSVCLEPSAAVVWHVSLF SALLCISLLQLLLVVVHVINSLLGLFCSLCEK >gi568815595r:196138691_196370531|GENSCAN_predicted_CDS_8|459_bp atgcaacacatcgagtcagccaagggcaaggtactcactgcagctatcctcatctccttg atgggctggagatacggctgcttcagtaagagtgggctctgtcgaagcgtgcttactgct ctgttgtcaggtggcctggctttacttggagccctgatttgctttgtcacttctggagtt gctctgaaagatggtcctttttgcatgtttgatgtttcatccttcaatcagacacaagct tggaaatatggttacccattcaaagacctgcatagtaggaattatctgtatgaccgttcg ctctggaactccgtctgcctggagccctctgcagctgttgtctggcacgtgtccctcttc tccgcccttctgtgcatcagcctgctccagcttctcctggtggtcgttcatgtcatcaac agcctcctgggccttttctgcagcctctgcgagaagtga >gi568815595r:196138691_196370531|GENSCAN_predicted_peptide_9|254_aa XQKLVEWHQLDVSSFLDQVTGFLGEHGQLDGLSSSPPKKCARSESLIDASEDSQLEAAIR ASLQETHFDSTQTKQDSRSDEESESELFSGSEEFISVCGSDEEEEVENLAKSRKSPHKDL GHRKEENRRPLTEPPVRTDPGTATNHQGLPAVDSEILEMPPEKADGVVEGIDVNGPKAQL MLRYPDGKREQITLPEQAKLLALVKHVQSKGYPNERFELLTNFPRRKLSHLDYDITLQEA GLCPQETVFVQERN >gi568815595r:196138691_196370531|GENSCAN_predicted_CDS_9|765_bp ngtcagaagctagtagaatggcaccagttagatgtatcttctttcttggaccaagtgacg ggatttctgggtgaacatggacaactggatggactttctagcagtccccccaaaaaatgt gcccgttcagagagccttatagatgcaagtgaagacagccagctagaagctgccatcaga gcctccttacaagaaacacattttgattcaacacagacaaaacaggatagccgctcagat gaagaatctgaatctgaacttttttctggcagtgaggagttcatatccgtttgtggctct gatgaagaagaagaggtagagaatcttgccaagtccagaaagtctccccacaaagatttg gggcatagaaaagaggagaatagaaggccgctgactgagccaccagtcagaactgatcct ggaacagccacaaaccaccaaggattgccagctgtggattcagagatactggagatgcca cctgaaaaagcagatggagtagtggaggggatagatgtaaatggaccaaaagcacagctg atgttgcggtatccagatggaaaaagggaacagatcactcttccagagcaagctaaactg ctagctttggtgaagcacgtgcagtctaaaggatacccaaatgaacgttttgaacttctc accaactttcctcgaaggaaattatctcatctggactatgatattacattgcaagaggca ggcctttgtcctcaagagactgtctttgtacaggaaagaaattaa