GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:44:28 Sequence gi568815577f:33973701_34242597 : 268897 bp : 43.04% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 14647 14701 55 0 1 87 56 36 0.010 1.75 1.02 Intr + 35922 36058 137 1 2 54 81 80 0.108 4.19 1.03 Intr + 65069 65170 102 1 0 64 115 65 0.868 7.17 1.04 Intr + 81248 81361 114 1 0 48 97 37 0.497 1.24 1.05 Intr + 85860 85986 127 2 1 57 77 98 0.923 5.85 1.06 Intr + 95532 95654 123 1 0 104 17 92 0.809 4.26 1.07 Intr + 99169 99423 255 2 0 69 27 106 0.430 0.02 1.08 Intr + 99837 100045 209 1 2 30 114 245 0.990 20.10 1.09 Term + 108326 108346 21 1 0 94 49 9 0.071 -4.09 1.10 PlyA + 109082 109087 6 1.05 2.00 Prom + 110853 110892 40 -4.86 2.01 Sngl + 121499 123655 2157 1 0 56 43 1386 0.906 125.04 2.02 PlyA + 126487 126492 6 1.05 3.00 Prom + 127582 127621 40 -2.66 3.01 Init + 151683 151780 98 2 2 68 114 12 0.352 1.67 3.02 Intr + 152527 152646 120 1 0 74 92 63 0.408 4.91 3.03 Term + 154221 154320 100 0 1 29 52 126 0.366 0.70 3.04 PlyA + 154850 154855 6 1.05 4.08 PlyA - 160669 160664 6 1.05 4.07 Term - 161877 161732 146 1 2 54 34 287 0.984 17.97 4.06 Intr - 162241 161992 250 1 1 53 -10 404 0.080 24.11 4.05 Intr - 169911 169755 157 2 1 89 62 42 0.006 1.71 4.04 Intr - 180205 180138 68 1 2 104 72 64 0.424 4.10 4.03 Intr - 188115 187972 144 0 0 101 65 38 0.385 3.28 4.02 Intr - 189037 188938 100 0 1 38 64 45 0.222 -2.79 4.01 Init - 193816 193677 140 1 2 40 89 99 0.404 4.81 4.00 Prom - 194973 194934 40 -5.76 5.03 PlyA - 195536 195531 6 1.05 5.02 Term - 207499 207345 155 2 2 43 48 162 0.610 5.78 5.01 Init - 208789 208762 28 1 1 84 100 31 0.711 3.66 5.00 Prom - 210171 210132 40 -3.86 6.06 PlyA - 210602 210597 6 1.05 6.05 Term - 212872 212623 250 0 1 142 47 138 0.924 10.28 6.04 Intr - 217206 217053 154 1 1 34 75 82 0.719 0.63 6.03 Intr - 217553 217463 91 2 1 61 92 42 0.876 1.47 6.02 Intr - 217969 217857 113 2 2 24 75 100 0.584 2.40 6.01 Init - 218430 218346 85 0 1 66 64 77 0.611 4.10 6.00 Prom - 220047 220008 40 -4.86 7.00 Prom + 220169 220208 40 -4.76 7.01 Init + 220951 221092 142 0 1 86 60 91 0.841 6.40 7.02 Term + 224788 224852 65 2 2 98 45 59 0.772 0.65 7.03 PlyA + 225930 225935 6 -0.45 8.06 PlyA - 226312 226307 6 1.05 8.05 Term - 227880 227743 138 0 0 80 40 78 0.560 0.16 8.04 Intr - 229441 229383 59 2 2 118 87 21 0.499 3.60 8.03 Intr - 230653 230497 157 1 1 60 110 41 0.961 3.08 8.02 Intr - 231007 230893 115 0 1 46 49 151 0.991 7.45 8.01 Init - 232340 232273 68 2 2 68 94 51 0.691 4.24 8.00 Prom - 268095 268056 40 -1.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:33973701_34242597|GENSCAN_predicted_peptide_1|380_aa MVEGQGRVRHFTWLEQEEAKLTWKPGKHHLHGSAPLTSTGEEALDLTGSVVRKEYGRTDS WLTKTTSVADDMVCSQMLLQSEGTHSPAVRKLTGDGSQEAVRVLRKQKCDLINMLFEKDH SGSPRVEEGRPGSVRQMRKLNHRVQLLVQGYTVIGHRRSGFGSMLLTTKLYCCQLATFEQ RIQGQKGKKEPDTGTKETKENPKSTGSWMAKKRPSGLQRGSRPKVTQCQGTESDPSPRAA AHGLCQRAPRRVGGLVSRVRPESCGWGRELHPPTDPGRECSRTPASSRWPHSPRAGRDPA LRREAASAFKPGLALSDRAFAAWEPSGAAVSRSPLSPPSRPFASREPAGFRAALADPPGM PRYELALILKAMQRPEIQIL >gi568815577f:33973701_34242597|GENSCAN_predicted_CDS_1|1143_bp atggtggaaggccaagggcgagtgaggcacttcacgtggctggagcaggaggaagccaaa ctcacctggaagccaggaaaacaccatctgcatgggtcagccccactgaccagcacaggg gaagaagcactggatctgactggctcagtggtccgtaaggaatacggcaggaccgactcc tggctgacaaagacaacttctgtagcagatgatatggtgtgctctcagatgctgcttcag agtgaaggcactcattccccggctgtcaggaagctgactggtgatgggtcacaggaggca gttagagttttacgtaaacagaaatgtgatctgatcaacatgctgtttgagaaggaccac tctgggagcccgagagtagaagagggaagacccgggtcagtgaggcagatgaggaaactg aaccacagagttcagttacttgttcaagggtacacagtcattggacacaggcggtctggc tttggctccatgctcttgaccaccaagctgtactgctgccaactagccacctttgaacag cgcattcagggacagaaggggaaaaaggagccagacacagggaccaaggagacaaaggaa aaccccaagagtacaggatcatggatggccaagaagagaccctcaggactgcagagaggc agcaggcccaaggtcacgcagtgccaagggactgagtcagacccgagcccgcgcgccgcg gcccacggcctgtgccagcgcgctcctcgccgggtagggggcctggtcagccgcgtcaga cctgagagttgtggctgggggcgagagctgcacccccctaccgaccccgggcgggaatgt tccagaacccctgcgagcagccgttggccgcacagcccccgggccggccgcgaccccgcc ttgcgccgcgaagctgccagcgcctttaagcccgggctcgcgctctcggaccgtgctttc gccgcctgggagccgtccggcgcagcagtttctaggtccccactgtccccgccgtcccgc cccttcgcgtcccgggaaccggctggcttccgagccgcactcgccgatcctccaggcatg ccccgctacgagctggctttaatcctgaaagccatgcagcggccagaaatccagatttta tag >gi568815577f:33973701_34242597|GENSCAN_predicted_peptide_2|718_aa MRAVLDTADIAIVALYFILVMCIGFFAMWKSNRSTVSGYFLAGRSMTWVAIGASLFVSNI GSEHFIGLAGSGAASGFAVGAWEFNALLLLQLLGWVFIPIYIRSGVYTMPEYLSKRFGGH RIQVYFAALSLILYIFTKLSVDLYSGALFIQESLGWNLYVSVILLIGMTALLTVTGGLVA VIYTDTLQALLMIIGALTLMIISIMEIGGFEEVKRRYMLASPDVTSILLTYNLSNTNSCN VSPKKEALKMLRNPTDEDVPWPGFILGQTPASVWYWCADQVIVQRVLAAKNIAHAKGSTL MAGFLKLLPMFIIVVPGMISRILFTDDIACINPEHCMLVCGSRAGCSNIAYPRLVMKLVP VGLRGLMMAVMIAALMSDLDSIFNSASTIFTLDVYKLIRKSASSRELMIVGRIFVAFMVV ISIAWVPIIVEMQGGQMYLYIQEVADYLTPPVAALFLLAIFWKRCNEQGAFYGGMAGFVL GAVRLILAFAYRAPECDQPDNRPGFIKDIHYMYVATGLFWVTGLITVIVSLLTPPPTKEQ IRTTTFWSKKNLVVKENCSPKEEPYKMQEKSILRCSENNETINHIIPNGKSEDSIKGLQP EDVNLLVTCREEGNPVASLGHSEAETPVDAYSNGQAALMGEKERKKETDDGGRYWKFIDW FCGFKSKSLSKRSLRDLMEEEAVCLQMLEETRQVKVILNIGLFAVCSLGIFMFVYFSL >gi568815577f:33973701_34242597|GENSCAN_predicted_CDS_2|2157_bp atgagagctgtactggacacagcagacattgccatagtggccctgtattttatcctggtc atgtgcattggtttttttgccatgtggaaatctaatagaagcaccgtgagtggatacttc ctggcggggcgctctatgacctgggtagcaattggtgcctctctgtttgtgagcaatatt gggagtgagcacttcattgggctggcaggatctggagctgcaagtggatttgcagtgggc gcatgggaattcaatgccttactgcttttacaacttctgggatgggttttcatcccaatt tacatccggtcaggggtatataccatgcctgaatacttgtccaagcgatttggtggccat aggattcaggtctattttgcagccttgtctctgattctctatattttcaccaagctctcg gtggatctgtattcgggtgccctttttatccaggagtctttgggttggaatctttatgtg tctgtcatcctgctcattggcatgactgctttgctgactgtcaccggaggccttgttgca gtgatctacacagacactctgcaggctctgctcatgatcattggggcacttacacttatg attattagcataatggagattggcgggtttgaggaagttaagagaaggtacatgttggcc tcacccgatgtcacttccatcttattgacatacaacctttccaacacaaattcttgtaat gtctcccctaagaaagaagccctgaaaatgctgcggaatccaacagatgaagatgttcct tggcctggattcattcttgggcagaccccagcttcagtatggtactggtgtgctgaccaa gtcatcgtgcagagggtccttgcagccaaaaacattgctcatgccaaaggctctactctt atggctggcttcttaaagctcctgccaatgtttatcatagttgtcccaggaatgatttcc aggatactgtttactgatgatatagcttgcatcaacccagagcactgcatgctggtgtgt ggaagcagagctggttgctccaatattgcttacccacgcctggtgatgaagctggttcct gtgggccttcggggtttaatgatggcagtgatgattgcagctctgatgagtgacttagac tctatctttaacagtgccagtaccatattcaccctcgatgtgtacaaacttatccgcaag agcgcaagctcccgggagttaatgattgtggggaggatatttgtggcatttatggtggtg atcagcatagcatgggtgccaatcatcgtggagatgcaaggaggccagatgtacctttac attcaggaggtagcagattacctgacacccccagtggcagccttgttcctgctggcaatt ttctggaagcgctgcaatgaacaaggggctttctatggtggaatggctggctttgttctt ggagcagtccgtttgatactggcctttgcctaccgtgccccagaatgtgaccaacctgat aataggccgggcttcatcaaagacatccattatatgtatgtggccacaggattgttttgg gtcacgggactcattactgtaattgtgagccttctcacaccacctcccacaaaggaacag attcgaaccaccaccttttggtctaagaagaacctggtggtgaaggagaactgctcccca aaagaggaaccatacaaaatgcaagaaaagagcattctgagatgcagtgagaataatgag accatcaaccacatcattcccaacgggaaatctgaagacagcattaagggccttcagcct gaagatgttaatctgttggtaacctgcagagaggagggcaacccagtggcatccttaggt cattcagaggcagaaacaccagttgacgcttactccaatgggcaagcagctctcatgggt gagaaagagagaaagaaagaaacggatgatggaggtcggtactggaagttcatagactgg ttttgtggctttaaaagtaagagcctcagcaagaggagtctcagagacctgatggaagag gaggctgtttgtttacagatgctagaagagactcggcaagttaaagtaatactaaatatt ggactttttgctgtgtgttcacttggaattttcatgtttgtttatttctccttatga >gi568815577f:33973701_34242597|GENSCAN_predicted_peptide_3|105_aa MDRGAIVRDLENLGERALPYRISAHSQQHNRGGTQVMGLPSALPQGPSKTVSADVLCDEN AACSYVSRAVLCSPRGYDTYPKLRQGADLKPDSGVREPAFGWLME >gi568815577f:33973701_34242597|GENSCAN_predicted_CDS_3|318_bp atggacagaggagcaatagtgagggacttggaaaacctgggtgaacgagcgcttccttat aggatctctgcccacagtcagcagcacaacagaggcggaactcaggttatgggtctccca tctgcattgccacaaggacccagcaagacagtctctgcagatgtgctctgcgacgagaat gctgcttgtagttatgtgagtcgagctgtcttgtgcagccctagaggctatgacacgtat ccgaaattacggcaaggcgctgatttgaaacctgacagtggtgtgcgagagccagctttt ggctggctaatggagtga >gi568815577f:33973701_34242597|GENSCAN_predicted_peptide_4|334_aa MFVGKHRNHTAADSCHHYRTLLCYPTSSAFCRKQHPSENATVAPATCNLQLQLRKNKRDG KCKNLDQYSSSEQLSCKSCQVIVSLYYGTLPSDASSKLERVNFPNLKLVGLGLGSGEGNP EAQHAGKREKSSFLLVGMTIKLHDHFDDAAGDSQNRHFPQHFMCRSSVICDGDRHLVQGT PWEGGRASQVFPISPEETEEEMQDSIAVKEKYAKYLPHNAGPYAAKRFCKAQCPIVESLT NSMIMHGHNNSKKLMTVRIIKYAFHFIHLLTGENPHLQVLVNTIINGTLEGALREIMTIA ECLADELITATKGSSNSFAIKKKDELACVAKSNR >gi568815577f:33973701_34242597|GENSCAN_predicted_CDS_4|1005_bp atgtttgttggaaaacatcgaaaccacactgcagctgattcttgtcaccattaccgcacc ctgctgtgctacccgacgtcaagtgcattctgcaggaagcagcacccctctgaaaatgcc actgtggcaccagctacatgcaatctccagctgcagctcagaaagaacaagagggatggg aaatgtaaaaatctggatcaatattctagttctgagcagttatcctgcaaatcctgccag gtaattgtgtctctgtactatgggaccctcccctcagatgcatcctccaaactggaaaga gttaatttcccaaaccttaaactggttggcttaggactgggttcaggggaaggaaaccca gaagcccaacatgccggcaaaagggaaaaatcaagttttttgctggtgggaatgaccatc aaactgcatgaccattttgatgatgcagcaggagattcccagaaccggcacttcccccaa cattttatgtgcagaagttccgtcatctgtgatggtgacagacaccttgtgcagggcacc ccctgggaagggggaagggctagtcaagtctttccaatcagccccgaggagaccgaggaa gagatgcaggattccattgcagtgaaggagaagtatgccaagtacctgcctcacaatgca gggccttatgctgccaaacgcttctgcaaagctcagtgccccattgtggagtccctcact aactccatgataatgcatggccacaacaacagcaagaagctcatgactgtgcgcatcatc aagtatgccttccatttcatccacctgctcacaggcgagaaccctcacctccaggtcctg gtgaacaccatcatcaatggcactcttgagggtgccctccgggagatcatgaccattgct gagtgcctggcagatgagctcatcactgccaccaagggctcctccaactcctttgccatc aagaagaaggatgagctggcatgtgtggccaagtccaaccgctga >gi568815577f:33973701_34242597|GENSCAN_predicted_peptide_5|60_aa MAGVEDGSEATVTYNPSVKGRNFPMQKQQKVNRGNIAMSKGSSTQYLRLCYKATQRVDEK >gi568815577f:33973701_34242597|GENSCAN_predicted_CDS_5|183_bp atggctggtgtggaagatgggtcagaagcaacggtgacatataatcccagtgttaaagga cggaattttccaatgcagaagcagcagaaagtgaacagaggaaacattgccatgagcaaa ggctcctcaacacagtatctccgactgtgctacaaagccacccagcgtgtggatgaaaag tga >gi568815577f:33973701_34242597|GENSCAN_predicted_peptide_6|230_aa MANGTRAGWLLRTTHGVECDQTMSARIKETKSSAFRSMKTTHYPIREMPVTEMPVEIGSF WLQAELAQVPICCHIFAVSAQPFLFWHEFTGSCRTREQLKMYKHCFSHLPLTFNEILLKY NQDDTSSIFNSSTMALACQSGYREDQSRCVYIDTYSCLANEVQGHLHRLPAHRTAGPLRP GTRAISVTLFPELGTEEVGNWYWSSNQLCSLTLRPAFAIKTAQFCEPYKP >gi568815577f:33973701_34242597|GENSCAN_predicted_CDS_6|693_bp atggccaatggcaccagggctggctggctgctgaggacaactcatggagtggaatgtgac cagacaatgtcagcacgcatcaaagagacaaaaagcagtgcttttcgaagtatgaaaact acccattatcccattagagagatgccggttacagagatgccggtggaaatcggaagcttc tggcttcaggcagaactagcacaagttcccatctgctgccacatctttgcagtatcagct cagcctttcctcttctggcacgagttcacaggctcctgtagaaccagagagcagttgaag atgtataaacattgcttctcccatcttccactcactttcaatgaaatactacttaaatac aatcaggacgacacatcttcaatatttaacagctcgactatggccctggcctgccagagt ggatatcgtgaagaccagagcaggtgtgtgtacatagatacatactcgtgtttagctaat gaagtacaaggtcacctgcacaggctgcctgcccaccggactgctggccccttgaggcca ggaaccagggccatctctgtcaccctcttcccagagcttggcaccgaggaggtgggaaat tggtactggtcctctaatcagctctgctccctaaccctgcgaccagccttcgccatcaag acggctcagttctgtgaaccgtacaaaccatga >gi568815577f:33973701_34242597|GENSCAN_predicted_peptide_7|68_aa MERLGTPYPKCGLWASSTCVIWELVRNGILSRSQTLLHQNLHFNEIPGLSLAAKRLSVFA DLQKGARS >gi568815577f:33973701_34242597|GENSCAN_predicted_CDS_7|207_bp atggagcgtctaggaactccctacccaaagtgtggtctgtgggccagcagcacctgcgtc atctgggaacttgtaagaaatgggattcttagcagatcccagaccctgctgcatcagaac ctgcattttaatgagatcccaggcctttctttagctgctaaaaggctgtcggtctttgct gatttgcagaaaggtgcacgctcatga >gi568815577f:33973701_34242597|GENSCAN_predicted_peptide_8|178_aa MRKWFMYYDEKISKMHCQLKTVRRPFITLWLEDHLQQPLLCAGNTEQHVRGCQGLLATWA WQVCGLPKAKDPCYISGYSQSSQRSREHTASPAINACWLPARIQAKNGVALRPDSLGPTV PHPCPMQMAEETIGRTRNAEGGRKLELRGVDLIVQAEMPTFKLKDAARQDAWLSSVFK >gi568815577f:33973701_34242597|GENSCAN_predicted_CDS_8|537_bp atgaggaaatggtttatgtactatgatgaaaagatctctaagatgcactgtcaactgaaa acagtgaggaggccttttatcaccctgtggctggaggaccatctgcagcagcccctgctg tgcgctggcaacactgagcagcatgtgcgcggctgccaggggctgctggccacctgggcc tggcaagtctgtggcctccccaaagctaaggatccctgttatatctccggatattcacag agctcccagcgcagccgagaacacacagccagccctgcaataaatgcttgttggttgcct gccaggatccaagctaaaaatggagttgccttacggccagatagcttggggcccacagtt ccccacccctgcccgatgcagatggcagaagaaaccattggtcgaacaagaaatgcagaa ggtggcaggaaattagaactaaggggagtggaccttattgtgcaggctgaaatgcccacc ttcaagttaaaagatgcagcaagacaagacgcctggctcagcagcgtgttcaaatag