GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:40:37 Sequence gi568815579f:16048751_16257791 : 209041 bp : 50.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3497 3627 131 0 2 15 1 159 0.264 0.31 1.02 Term + 3784 4029 246 0 0 35 38 221 0.570 7.59 1.03 PlyA + 5206 5211 6 1.05 2.00 Prom + 7072 7111 40 -0.86 2.01 Sngl + 8783 8995 213 1 0 98 46 96 0.636 1.64 2.02 PlyA + 11106 11111 6 1.05 3.00 Prom + 15488 15527 40 -1.76 3.01 Init + 18875 18988 114 1 0 102 94 229 0.465 25.11 3.02 Intr + 27297 27422 126 2 0 150 67 261 0.542 31.08 3.03 Intr + 27585 27947 363 2 0 -16 61 364 0.432 18.68 3.04 Intr + 33163 33296 134 0 2 73 94 278 0.990 26.34 3.05 Intr + 37673 37790 118 2 1 51 105 252 0.988 23.67 3.06 Intr + 39277 39347 71 0 2 122 74 90 0.941 8.98 3.07 Intr + 40295 40370 76 2 1 58 100 81 0.992 5.82 3.08 Intr + 44786 44848 63 1 0 89 105 96 0.949 10.31 3.09 Intr + 44934 45003 70 2 1 104 86 84 0.999 8.55 3.10 Term + 46514 46596 83 0 2 75 54 174 0.999 10.56 3.11 PlyA + 46736 46741 6 1.05 4.00 Prom + 52584 52623 40 -4.26 4.01 Init + 61074 61122 49 2 1 92 58 52 0.777 1.71 4.02 Intr + 62540 62635 96 0 0 125 40 52 0.770 4.08 4.03 Intr + 63108 63275 168 1 0 21 65 245 0.770 15.42 4.04 Intr + 69476 69536 61 0 1 141 95 56 0.654 9.39 4.05 Intr + 73000 73060 61 1 1 105 54 45 0.548 1.54 4.06 Intr + 76720 76797 78 0 0 118 78 307 0.609 32.45 4.07 Intr + 78687 78776 90 2 0 98 86 71 0.988 8.09 4.08 Intr + 79276 79341 66 0 0 104 82 133 0.891 13.40 4.09 Intr + 80804 80854 51 1 0 68 97 26 0.532 0.60 4.10 Intr + 80954 81145 192 1 0 85 54 77 0.807 3.69 4.11 Intr + 94683 94758 76 2 1 83 48 41 0.117 -1.31 4.12 Intr + 94882 95024 143 2 2 131 91 49 0.990 9.57 4.13 Intr + 99995 100125 131 1 2 103 42 73 0.128 3.69 4.14 Intr + 103802 103891 90 2 0 108 80 10 0.134 1.31 4.15 Intr + 104293 104458 166 1 1 23 94 232 0.203 17.26 4.16 Intr + 105649 105741 93 0 0 75 87 149 0.706 13.66 4.17 Term + 108460 109044 585 0 0 103 44 252 0.935 16.81 4.18 PlyA + 109168 109173 6 1.05 5.06 PlyA - 111814 111809 6 1.05 5.05 Term - 112736 112715 22 1 1 97 45 53 0.503 -0.32 5.04 Intr - 116163 115968 196 1 1 99 64 287 0.742 25.87 5.03 Intr - 119534 119387 148 2 1 113 74 140 0.999 14.91 5.02 Intr - 121011 120880 132 0 0 52 91 282 0.796 25.74 5.01 Init - 124725 124675 51 0 0 93 80 49 0.906 5.81 5.00 Prom - 127687 127648 40 -4.76 6.00 Prom + 130778 130817 40 -6.26 6.01 Init + 131382 131444 63 2 0 45 78 64 0.759 2.25 6.02 Intr + 136594 136766 173 0 2 23 77 202 0.877 11.34 6.03 Intr + 136874 137015 142 2 1 108 88 223 0.950 24.66 6.04 Intr + 141770 141823 54 1 0 138 105 26 0.990 8.48 6.05 Term + 142137 142205 69 2 0 91 42 97 0.968 3.34 6.06 PlyA + 143271 143276 6 1.05 7.00 Prom + 148346 148385 40 -6.96 7.01 Init + 149277 149318 42 2 0 90 79 131 0.873 12.82 7.02 Intr + 151112 151195 84 1 0 84 72 40 0.708 2.02 7.03 Intr + 151393 151585 193 0 1 46 26 94 0.473 -1.93 7.04 Intr + 152118 152172 55 1 1 130 80 11 0.739 2.54 7.05 Term + 152335 152485 151 1 1 98 42 117 0.906 5.48 7.06 PlyA + 152542 152547 6 1.05 8.00 Prom + 152843 152882 40 -12.68 8.01 Init + 153157 153201 45 0 0 105 98 37 0.972 5.99 8.02 Intr + 154709 154913 205 1 1 96 62 261 0.663 23.07 8.03 Intr + 157591 157658 68 2 2 40 109 49 0.836 0.82 8.04 Intr + 159269 159399 131 1 2 90 64 424 0.999 39.69 8.05 Intr + 160280 160427 148 2 1 92 62 306 0.943 28.64 8.06 Intr + 177671 177797 127 1 1 81 78 360 0.999 34.55 8.07 Intr + 178798 178940 143 2 2 57 94 269 0.999 24.47 8.08 Intr + 179387 179458 72 1 0 97 100 158 0.994 17.50 8.09 Intr + 180020 180178 159 1 0 72 105 286 0.999 28.88 8.10 Intr + 184743 184868 126 2 0 118 98 229 0.948 27.78 8.11 Intr + 185362 185524 163 0 1 60 105 171 0.564 15.55 8.12 Intr + 185831 185919 89 0 2 97 26 35 0.590 -2.11 8.13 Term + 187583 187732 150 1 0 42 39 159 0.626 4.31 8.14 PlyA + 193570 193575 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 168391 167957 435 1 0 85 52 163 0.948 8.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:16048751_16257791|GENSCAN_predicted_peptide_1|125_aa XIATDTRLSVLVRFFLANLAFVVTCFTSTTIPKMLDVQRDPLCHLCGLLVVASWTFAFRN ALTHPVLLTRLSLCTYEWVSHVFCNLSQLLKLACSDATLNNVTVQPLHKTQGHECHSATH SFGTS >gi568815579f:16048751_16257791|GENSCAN_predicted_CDS_1|378_bp nccattgccactgacacccggctctctgtgctcgtgcgctttttccttgccaacctggcc ttcgtggtcacttgcttcacctccaccaccatccccaagatgctggacgtgcaaagagat cccttgtgtcatctctgtggcctcctggtggtggcctcctggaccttcgcattcaggaat gccctgacccacccagtgttactgacccgcctctcactctgcacctacgagtgggtcagc catgtcttctgcaacctcagccagctgctgaagttggcctgctcagacgccactctcaac aatgtgacggtgcaaccgctacacaaaacacaaggacatgaatgccattcagccacacat tcttttggcacttcataa >gi568815579f:16048751_16257791|GENSCAN_predicted_peptide_2|70_aa MEYLYTLPGRRQYPMLLMHPAQSPKSLAKTQSSSGRYKKDGDLVVKNLTNMQTEMEESTD KSPQRVGEII >gi568815579f:16048751_16257791|GENSCAN_predicted_CDS_2|213_bp atggaatacctttatacccttcctggaagaaggcaatatcccatgttattgatgcatcca gctcagagtccaaaatctctggctaaaacacagtcttcatcaggcaggtataagaaagat ggggacttagtggtcaagaacttgaccaatatgcaaacagagatggaagaatccacagat aaaagtccacaaagagttggagaaatcatctga >gi568815579f:16048751_16257791|GENSCAN_predicted_peptide_3|405_aa MEAIKKKMQMLKLDKENAIDRAEQAEADKKAAEDKCKQVEEELTHLQKKLKGTEDELDKY SEDLKDAQEKLELTEKKASDLWPAVPTSGPPPAVLTSAVRPGDLIAPTAAGPGGGERRGR PPRRQRLGGPGRGCAALAGAEPSRASAAARAPLRLRAMAGLNSLEAVKRKIQALQQQADE AEDRAQGLQRELDGERERREKAEGDVAALNRRIQLVEEELDRAQERLATALQKLEEAEKA ADESERGMKVIENRAMKDEEKMEIQEMQLKEAKHIAEEADRKYEEVARKLVILEGELERA EERAEVSELKCGDLEEELKNVTNNLKSLEAASEKYSEKEDKYEEEIKLLSDKLKEAETRA EFAERTVAKLEKTIDDLEDELYAQKLKYKAISEELDHALNDMTSL >gi568815579f:16048751_16257791|GENSCAN_predicted_CDS_3|1218_bp atggaggccatcaagaagaaaatgcagatgctgaagttggacaaggagaatgccatcgac cgcgcggagcaggcggaggcggataagaaagccgctgaggacaagtgcaagcaggtggag gaggagctgacgcacctccagaagaaactaaaagggacagaggacgagctggataaatat tccgaggacctgaaggacgcgcaggagaagctggagctcacggagaagaaggcctccgac ctgtggccagcggtgccgacgtcaggccctcccccagcggtgctgacgtcggcggtccgg ccgggtgacctcatcgccccgacggcagccggcccggggggcggggagaggcgggggcgg cccccgcgcaggcaaaggcttggggggccggggcgcggctgtgcagctctcgccggagcc gagcccagccgagcgtccgccgctgcccgtgcgcctctgcgcctccgcgccatggccggc ctcaactccctggaggcggtgaaacgcaagatccaggccctgcagcagcaggcggacgag gcggaagaccgcgcgcagggcctgcagcgggagctggacggcgagcgcgagcggcgcgag aaagctgaaggtgatgtggccgccctcaaccgacgcatccagctcgttgaggaggagttg gacagggctcaggaacgactggccacggccctgcagaagctggaggaggcagaaaaagct gcagatgagagtgagagaggaatgaaggtgatagaaaaccgggccatgaaggatgaggag aagatggagattcaggagatgcagctcaaagaggccaagcacattgcggaagaggctgac cgcaaatacgaggaggtagctcgtaagctggtcatcctggagggtgagctggagagggca gaggagcgtgcggaggtgtctgaactaaaatgtggtgacctggaagaagaactcaagaat gttactaacaatctgaaatctctggaggctgcatctgaaaagtattctgaaaaggaggac aaatatgaagaagaaattaaacttctgtctgacaaactgaaagaggctgagacccgtgct gaatttgcagagagaacggttgcaaaactggaaaagacaattgatgacctggaagatgag ttatacgctcagaagctcaagtacaaagctatcagcgaggaactggaccacgctctcaac gacatgacctctctctga >gi568815579f:16048751_16257791|GENSCAN_predicted_peptide_4|731_aa MVFYHVGQAGLELLTSGPHCPDSQAADKAESATLLSRARLGAYCACASLERRWPHFPSGR ECNMAKTYDYLFKLLLIGDSGVGKTCVLFRFSEDAFNSTFISTIGIDFKIRTIELDGKRI KLQIWDTAGQERFRTITTAYYRGAMGIMLVYDITNEKSFDNIRNWIRNIEEHASADVEKM ILGNKCDVNDKRQVSKERGEKLALDYGIKFMETSAKANINVENAFFTLARDIKAKMDKKL GPARPHGTLLGKGEEFAGVAGASASGSASLGTLPKQTRPCPVNPQASREHTAFSIPLKGQ KDNMIIDARSQGACTCLRVTPVSSKQQAHRLTCLRKQELPRGSQPRPIDVQTLNRNPGSC RHWHSYSEGLGHPRVSQEAMTEAGKLPLPLPPRLDWFVHTQMGQLAQDGVPEWFHGAISR EDAENLLESQPLGSFLIRVSHSHVGYTLSYKAQSSCCHFMVKLLDDGTFMIPGEKVAHTS LDALVTFHQQKPIEPRRELLTQPCRQKDPANVDYEDLFLYSNAVAEEAACPVSAPEEASP KPVLCHQSKERKPSAEMNRITTKEATSSCPPKSPLGETRQKLWRSLKMLPERGQRVRQQL KSHLATVNLSSLLDVRRSTVISGPGTGKGSQDHSGDPTSGDRGYTDPCVATSLKSPSQPQ APKDRKVPTRKAERSVSCIEVTPGDRSWHQMVVRALSSQESKPEHQGLAEPENDQLPEEY QQPPPFAPGYC >gi568815579f:16048751_16257791|GENSCAN_predicted_CDS_4|2196_bp atggtgttttaccatgttggccaggctggtctggaactcctgacctcaggcccccactgc cccgactcgcaggccgcagacaaggctgagagcgctaccttgctttcccgcgctcgcctc ggtgcctactgcgcctgcgcgagcttggagaggcgctggccgcacttcccgtcggggaga gagtgtaatatggcgaagacctacgattacctgttcaagctgctgctgatcggggactcg ggggtggggaagacctgtgtcctgttccgcttctccgaggacgccttcaactccactttt atctccaccataggaattgactttaaaattaggaccatagagctcgatggcaagagaatt aaactgcagatatgggacacagccggtcaggaacggtttcggacgatcacaacggcctac tacaggggtgcaatgggcatcatgctggtctacgacatcaccaacgagaagtccttcgac aacatccggaactggattcgcaacattgaggagcacgcctctgcagacgtcgaaaagatg atactcgggaacaagtgtgatgtgaatgacaagagacaagtttccaaggaacggggagaa aagctggccctcgactatggaatcaagttcatggagaccagcgcgaaggccaacatcaat gtggaaaatgcatttttcactctcgccagagatatcaaagcaaaaatggacaaaaaattg ggcccagccagaccccatgggacattgctgggaaagggggaagagtttgcaggtgtggct ggggcatcagcatcaggcagtgccagcctggggacgctgcccaagcagacaaggccgtgc ccggtgaatccccaggctagcagggagcacacggccttttccattccactcaagggccag aaagacaacatgattatagatgccaggagtcagggggcctgcacctgtctccgtgtcacc cctgtgtcctccaaacagcaggcacacaggctcacctgcttgaggaaacaggaactgcct cggggcagccagccccgccccattgacgtgcagaccttgaatcgaaacccaggctcctgc aggcactggcacagctacagcgagggcctcggccatccaagggtctcccaggaagctatg acagaggccgggaagctgcccctaccgctacccccacggctggactggtttgtgcacacc cagatgggccagctggcccaagacggggtccccgagtggttccatggtgcaatctcaaga gaggatgctgagaacttgctggagtcacagccactgggatcctttctcatcagggtcagt cacagccatgtgggctacacactctcctacaaagcccaaagcagctgctgccatttcatg gtgaagctcttggatgatgggactttcatgatccccggggagaaggtggcccacacctcg ctggacgccctggtcaccttccaccagcagaagccaattgagccgcgcagggagctgctg acacagccctgcaggcagaaggatcccgcaaacgtggattacgaggatctcttcctctac tccaacgcagtggccgaggaagctgcctgcccggtgtctgcccctgaggaggcctcccca aagccagtcctgtgtcaccaatcaaaggaaaggaagccgtcagcagagatgaacagaata accaccaaggaagccacttcctcctgccccccaaaatcccctcttggagagacccgccag aaactctggaggagcctcaaaatgctccccgagagaggccagagggtccggcagcagcta aaaagccacctcgccactgtgaacttgtcgtcactcttggatgtccggagatccacggtg atctcaggccctgggaccggaaaaggcagccaagatcactcaggggatcccacctcgggg gacagaggctacacggatccctgtgtggccacatctctcaaaagcccctcacagccccag gcaccaaaagacagaaaggtccccaccaggaaggccgagaggtcggtcagctgcattgag gtgaccccaggggacaggagttggcaccaaatggtagtgagagccctatcctcccaggag tccaagccagagcaccagggcttggcagagcctgagaacgaccagctcccggaggagtac caacaaccgccaccctttgcccctgggtactgctag >gi568815579f:16048751_16257791|GENSCAN_predicted_peptide_5|182_aa MGNKQTVFTHEQLEAYQLFPPHIRLFYRYQDLAPQLVPLDYTTCPDVKVPYELIGSMPEL KDNPFRQRIAQVFSEDGDGHMTLDNFLDMFSVMSEMAPRDLKAYYAFKIYDFNNDDYICA WDLEQTVTKLTRGGLSAEEVSLVCEKVLDEADGDHDGRLSLEDFQNMILRAPDFLSTFHI RI >gi568815579f:16048751_16257791|GENSCAN_predicted_CDS_5|549_bp atgggcaacaagcagacagtcttcacacacgagcagctggaagcgtatcagctttttcct ccccacatcaggctcttctatcgctaccaggacctggccccacagctcgtgcccctcgac tataccacctgccccgatgtgaaggtgccctacgagctcattggcagcatgcccgagctg aaggacaaccccttccgccagaggattgcccaggtattctctgaggatggggatggccac atgaccctggacaactttttggacatgttttccgtgatgagtgaaatggctccccgcgac ctcaaggcttactatgcttttaaaatttatgattttaacaacgacgactacatttgtgcg tgggacctggagcagacggtgaccaaactgacgcggggggggctgagtgccgaggaggtg agcctggtatgtgagaaggtgctggatgaggctgatggagaccatgatgggcggctgtcc ctggaagatttccagaacatgatcctccgggcaccagacttcctcagcaccttccacatc cgaatctga >gi568815579f:16048751_16257791|GENSCAN_predicted_peptide_6|166_aa MNDNIREIENRSLFASCAGMELSQSKRSEAAIRAEIVFANRKCGTPATEALESVMEAYEQ VQKGPLKLKGVAELGVTKRKKKKKDKDKAKLLEAMGTSKKNEEEKRRGLDKRTPAQAAFE KMQEKRQMERILKKASKTHKQRVEDFNRHLDTLTEHYDIPKVSWTK >gi568815579f:16048751_16257791|GENSCAN_predicted_CDS_6|501_bp atgaatgacaacatcagagaaatagagaacagatccctgtttgccagctgtgcaggaatg gagctcagccaatcaaagagaagcgaggcggccattcgggcggaaatagtgtttgcaaac aggaagtgtggcactccagctaccgaagcactggagagtgtcatggaggcctacgagcag gtccaaaagggacccctgaagctgaaaggcgtcgcagagctgggagtgaccaagcggaag aagaaaaagaaggacaaagacaaagcgaaactcctggaagcaatgggaacgagcaaaaag aacgaggaggagaagcggcgcggcctggacaagcggaccccggcccaggcggccttcgag aaaatgcaggagaagcggcaaatggaaaggatcctaaagaaggcatccaaaacccacaag cagagagtggaggacttcaacagacacctggacacactcacggagcattacgacattccc aaagtcagctggacgaagtag >gi568815579f:16048751_16257791|GENSCAN_predicted_peptide_7|174_aa MSASAVYVLDLKGKQTFHEHLACTLPYAGHRVSIGAMAVVLGGSHLEAETLPHSWIQAAT GAGTRSQVLQLLNLIFSQWKGEVVMKGSLKRATLEVGLEGTIMWKRGLWSGSGLRVPPHA VTLPSDAITHSAICTRDLDVILASFVSSPLFSLHKCLLSTYYGPGPFLGAGKQL >gi568815579f:16048751_16257791|GENSCAN_predicted_CDS_7|525_bp atgtccgccagcgccgtctacgtgctggacctgaagggcaagcaaacatttcatgagcac ctagcatgcaccctgccctatgccgggcatagggtgtccattggcgccatggctgtggtg ttaggggggagtcacctagaagcagagacattaccacacagttggatccaggcagccaca ggggcaggaacaaggtcccaagtcctgcagttgctcaacctcatctttagccagtggaag ggggaggtggttatgaaaggctccctgaagagggcgacgctggaagtaggtttggaaggc acgatcatgtggaaaagagggctgtggagcggctcaggcctacgcgtgcctccccatgct gtcactcttcccagtgatgccatcacccattcagccatctgcacaagagacctggacgtt atcctcgcctcctttgtctcctcacccctgttttcactccacaaatgtttactgagcacc tactatggtccaggcccctttctgggtgctggaaaacagctgtga >gi568815579f:16048751_16257791|GENSCAN_predicted_peptide_8|541_aa MGTGLGFVLSAGEAGVLICRNYRGDVDMSEVEHFMPILMEKEEEGMLSPILAHGGVRFMW IKHNNLYRIPLLGVLPGDSCVGVVVATSKKNACVSLVFSFLYKVVQVFSEYFKELEEESI RDNFVIIYELLDELMDFGYPQTTDSKILQEYITQEGHKLETGAPRPPATVTNAVSWRSEG IKYRKNEVFLDVIESVNLLVSANGNVLRSEIVGSIKMRVFLSGMPELRLGLNDKVLFDNT GRGKSKSVELEDVKFHQCVRLSRFENDRTISFIPPDGEFELMSYRLNTHVKPLIWIESVI EKHSHSRIEYMIKAKSQFKRRSTANNVEIHIPVPNDADSPKFKTTVGSVKWVPENSEIVW SIKSFPGGKEYLMRAHFGLPSVEAEDKEGKPPISVKFEIPYFTTSGIQPQAAPAGKIKGG PTGREPAVLRALLPVPQVRYLKIIEKSGYQALPWVRYITQNGGHLLCRRHSSQKPLSQKR LVFKKSRFFAPETVEMQEMKQTHLAVPIKKQGDPQEPKTKEVDAEDTVGAHTGVALGNHR V >gi568815579f:16048751_16257791|GENSCAN_predicted_CDS_8|1626_bp atggggacgggcttgggctttgttctcagtgccggggaggctggggtgctcatctgccgg aactaccgtggcgacgtggacatgtcagaggtggagcacttcatgcccatcctgatggag aaggaggaggaggggatgctgtcgcccatcctggcccacgggggggtccgtttcatgtgg atcaaacacaacaacctgtatcgtatccctttgctgggggtgctcccaggggactcctgt gtgggtgttgtggttgccacatccaagaagaacgcgtgcgtgtcgctggtcttttctttc ctctataaggtggtgcaggtgttttccgagtacttcaaggagctggaggaggagagcatc cgggacaactttgttatcatctacgagctgctggacgagctcatggacttcggctacccc cagaccaccgacagcaagatcctgcaggagtacatcactcaggaaggccacaagctggaa acaggggccccgcggccaccagccaccgtcaccaacgcggtgtcctggcggtccgaaggc atcaagtatcggaagaatgaggtgttcttggacgtcatcgagtctgtcaacctcttggtc agcgccaacggcaatgtcctgcgcagcgagatcgtgggctccatcaagatgcgagtcttc ctctcgggcatgcccgagctgcgcctgggcctcaacgacaaggtcctctttgacaacacg ggccgcggcaaaagcaaatccgtggagctggaggatgtgaagttccaccagtgtgtgcgg ctatcacgcttcgagaatgaccgcaccatctccttcatcccacccgacggcgagttcgag ctcatgtcctaccgtctcaacacccacgtcaagcctttgatatggatcgagtcggtgatc gagaagcactcccacagccgcatcgagtacatgatcaaggccaaaagccagttcaagcgg cggtcaacagccaacaacgtggagatccacattcccgtgcccaatgatgccgactcaccc aagttcaagacgacggtggggagcgttaagtgggtccccgagaacagcgagatcgtgtgg tccatcaagtccttcccgggcggcaaggagtacctgatgcgggcccacttcggcctgcct agtgtggaggccgaagacaaggagggcaagcccccgatcagtgtcaagttcgagatccct tacttcactacctccggcatccagccccaggctgccccagcaggaaagattaagggggga ccaacagggcgggagcctgcggtgctcagggccctgctacctgtgccccaggtgcgctac ctgaagatcattgagaagagtgggtaccaggccctgccctgggtgcgttatatcacgcag aatggaggccatctgctctgccgtcgacactcgtctcagaagcccctttcccagaagagg ctggtcttcaagaagtctcgtttctttgcccctgaaactgtagaaatgcaggaaatgaag cagacgcacctagccgtgcccataaaaaagcaaggagacccacaggagcccaagacgaag gaagtagacgcggaagacaccgtgggggctcacaccggagttgctcttggaaaccaccgg gtttag