GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:53:34 Sequence gi568815575r:3512203_3813253 : 301051 bp : 44.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 388 260 129 1 0 113 105 9 0.321 5.99 1.03 Intr - 1039 943 97 0 1 53 89 37 0.274 0.31 1.02 Intr - 6958 6878 81 0 0 94 81 79 0.465 6.55 1.01 Init - 8203 8115 89 1 2 80 94 42 0.482 4.12 1.00 Prom - 8896 8857 40 -9.95 2.00 Prom + 9461 9500 40 -6.26 2.01 Init + 11018 11047 30 1 0 69 85 51 0.967 0.80 2.02 Term + 12159 12779 621 2 0 20 32 1980 0.992 180.02 2.03 PlyA + 13345 13350 6 1.05 3.03 PlyA - 14675 14670 6 1.05 3.02 Term - 24307 24237 71 2 2 94 48 56 0.004 0.30 3.01 Init - 44151 44115 37 0 1 96 115 44 0.739 8.08 3.00 Prom - 44739 44700 40 -1.36 4.00 Prom + 46020 46059 40 -2.46 4.01 Init + 69976 70134 159 0 0 66 96 69 0.695 5.32 4.02 Intr + 70480 70598 119 0 2 76 103 23 0.680 1.86 4.03 Term + 85741 86029 289 1 1 64 54 118 0.041 0.85 4.04 PlyA + 86996 87001 6 1.05 5.09 PlyA - 90202 90197 6 1.05 5.08 Term - 100123 99998 126 1 0 55 55 185 0.966 10.18 5.07 Intr - 103690 103613 78 1 0 75 82 80 0.956 5.85 5.06 Intr - 109114 109057 58 0 1 68 86 60 0.832 2.69 5.05 Intr - 114312 114217 96 2 0 114 106 63 0.951 9.82 5.04 Intr - 129769 129650 120 0 0 74 98 115 0.337 10.71 5.03 Intr - 143210 142947 264 1 0 135 110 522 0.996 55.73 5.02 Intr - 162564 162396 169 1 1 147 68 260 0.892 28.90 5.01 Init - 164736 164727 10 0 1 80 93 5 0.467 1.00 5.00 Prom - 167632 167593 40 -0.06 6.05 PlyA - 169461 169456 6 -0.45 6.04 Term - 170737 170694 44 2 2 141 43 26 0.044 0.72 6.03 Intr - 193292 193173 120 0 0 101 90 11 0.143 3.07 6.02 Intr - 201123 200886 238 0 1 44 92 334 0.624 26.59 6.01 Init - 201373 201242 132 1 0 63 52 223 0.995 14.34 6.00 Prom - 201812 201773 40 -13.33 7.00 Prom + 202339 202378 40 -6.06 7.01 Init + 202517 202629 113 1 2 64 0 167 0.810 3.77 7.02 Intr + 204907 205323 417 1 0 30 107 344 0.388 23.64 7.03 Intr + 239283 239326 44 0 2 114 87 52 0.231 5.58 7.04 Term + 248812 248864 53 2 2 114 39 53 0.448 0.59 7.05 PlyA + 248907 248912 6 1.05 8.02 PlyA - 251082 251077 6 1.05 8.01 Sngl - 254206 253802 405 1 0 78 38 164 0.588 6.68 8.00 Prom - 268527 268488 40 -3.36 9.00 Prom + 272023 272062 40 -4.46 9.01 Init + 280335 280498 164 2 2 73 52 152 0.899 9.30 9.02 Term + 281305 281443 139 1 1 42 40 104 0.774 -1.56 9.03 PlyA + 281476 281481 6 1.05 10.04 PlyA - 282499 282494 6 1.05 10.03 Term - 289538 289408 131 0 2 96 48 129 0.994 8.04 10.02 Intr - 294524 294444 81 0 0 96 64 24 0.095 0.41 10.01 Intr - 296615 296395 221 1 2 73 21 132 0.090 2.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 118786 118726 61 1 1 73 103 55 0.925 6.91 S.002 Term - 140672 140530 143 0 2 113 45 60 0.864 2.29 S.003 Intr - 141166 141027 140 0 2 42 50 82 0.826 -0.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:3512203_3813253|GENSCAN_predicted_peptide_1|132_aa MVRDGAQREFREAGLEDRSDDDGAARSWRRQALSFQNPAFARAVPQNGKALTNDRVKIPS AQGASEQSNAYDPREALPSHGRSHPLELMGLLMYIGNNNLYITEIWGDRLEMIYVKTGVP TWPVRNWAAQQE >gi568815575r:3512203_3813253|GENSCAN_predicted_CDS_1|396_bp atggtgcgagatggagcacagagagaattcagagaggctggccttgaggatcggagtgat gacgatggagccgccagaagctggaggaggcaagccctgtccttccagaaccccgccttt gccagggctgtgcctcagaacgggaaggctctgaccaatgacagagtgaaaattccatcc gcccagggtgcttctgagcaaagcaatgcttatgaccccagagaagcccttccttcccat ggacgctcccaccctttggagctcatgggtcttttgatgtacattgggaacaataatctt tatattacagagatttggggagacagattagaaatgatatatgttaaaacaggggtcccc acgtggcccgttaggaactgggccgcacagcaggag >gi568815575r:3512203_3813253|GENSCAN_predicted_peptide_2|216_aa MAPRVFSLSLKQIYLSLEAYITIIIITFTIIITITITIIMTIITNTIITITITTITITIT TITITTTITITITTITTITITTITTTITITITTTTITTTTITITTTTITTTTITTTTITI TIITTTITTITITITTITITTTTTITITTTTIIITITTITITITTITITTTITITTITIT ITTITVTIMVITIIMTIITNTIITITITITIAIIII >gi568815575r:3512203_3813253|GENSCAN_predicted_CDS_2|651_bp atggctccccgggtctttagcctttccctgaaacaaatttatctctcattggaagcttat atcaccatcatcatcatcaccttcaccatcatcatcaccatcaccatcaccatcatcatg actatcatcaccaacaccatcatcaccatcaccatcaccaccatcaccatcaccatcacc accatcaccatcaccaccaccatcaccatcaccatcaccaccatcaccaccatcaccatc accaccatcaccaccaccatcaccatcactatcaccactaccaccatcaccaccaccacc atcaccatcaccaccaccaccatcaccaccaccaccatcaccaccaccaccatcaccatc accatcatcaccaccaccatcaccaccatcaccatcaccatcaccaccatcaccatcacc accaccaccaccatcaccatcaccaccaccaccatcatcatcaccatcaccaccatcacc atcaccatcaccaccatcaccatcaccaccaccatcaccatcaccaccatcaccatcacc atcaccaccatcaccgtcaccatcatggtcatcaccatcatcatgactatcatcaccaac accatcatcaccatcaccatcaccatcaccatcgccatcatcatcatctaa >gi568815575r:3512203_3813253|GENSCAN_predicted_peptide_3|35_aa MDAELQVTLTVEDVHVLISETCESFTLHGKQDFPD >gi568815575r:3512203_3813253|GENSCAN_predicted_CDS_3|108_bp atggacgccgagcttcaggtaactctcacagtggaagatgttcatgtcctgatctcagaa acctgtgaatcttttacattacatggcaaacaggactttccagattaa >gi568815575r:3512203_3813253|GENSCAN_predicted_peptide_4|188_aa MSGESFGCHIGGASGIWWVEPRDAAQHPTMHRTKDYPAVMVVAPNWISLEVDPKWYCAIN SILVISITHSTKMFARFNDVAEYATILLLINSCLNQSIDCDLCQPIRIQQASTSQNSVSV NQLEPNKCQTVRAHRGSTHQNSTSVDSSEPKCQTHQNSTSAHQSEPNKCQPIRTQQVSAY QNSMSISP >gi568815575r:3512203_3813253|GENSCAN_predicted_CDS_4|567_bp atgtctggagagagttttggctgccacattgggggtgctagtggcatctggtgggtagag cccagggatgctgctcaacatcctacaatgcacaggacaaaggattatccagctgtaatg gtggtagcaccaaactggataagccttgaagtagatccgaaatggtattgtgctataaat agcattctggttattagtatcactcactcaacaaaaatgtttgcaagattcaatgatgtt gctgaatatgcaacaattttgctgcttatcaattcatgcctgaatcaatcaatagactgt gatctgtgtcaacccatcagaattcaacaagcatcaactagtcagaactcggtaagtgtc aaccagttagaacccaacaagtgtcaaacagtcagagctcatcgggggtcgacccatcag aactcaacaagtgttgactcatcagaacccaagtgccaaactcatcagaactcaacaagt gcccaccagtcagaacccaataagtgtcaacccatcagaactcaacaagtgtccgcctat cagaactcaatgagcatcagcccatga >gi568815575r:3512203_3813253|GENSCAN_predicted_peptide_5|306_aa MAEGTGTFGRVHLVKEKTAKHFFALKVMSIPDVIRLKQEQHVHNEKSVLKEVSHPFLIRL FWTWHDERFLYMLMEYVPGGELFSYLRNRGRFSSTTGLFYSAEIICAIEYLHSKEIVYRD LKPENILLDRDGHIKLTDFGFAKKLVDRTWTLCGTPEYLAPEVIQSKGHGRAVDWWALGI LIFEMLSGFPPFFDDNPFGIYQKILAGKIDFPRHLDFHVKDLIKKLLVVDRTRRLGNMKN GANDVKHHRWFRSVDWEAVPQRKLKPPIVPKIAGDGDTSNFETYPENDWDTAAPVPQKDL EIFKNF >gi568815575r:3512203_3813253|GENSCAN_predicted_CDS_5|921_bp atggcagaaggcactgggacgttcgggcgggtgcacctggtgaaggagaagacagccaag catttcttcgccctcaaggtgatgagcattcccgacgtcatccgcctaaagcaggagcaa cacgtacacaatgagaagtctgtcctgaaggaagtcagccacccgttcctcatcaggctg ttctggacgtggcatgacgagcgcttcctctacatgctcatggagtacgtgccgggcggc gagctcttcagctacctgcgcaaccgggggcgcttctccagcaccacggggctcttctac tctgcagagatcatctgtgccatcgagtacctgcactccaaagagatcgtctacagggac ttgaagccagagaacatcctgctggatagggatggccacattaagctcacggactttggg ttcgccaagaagctggtagacaggacttggaccctctgtggaacacccgagtacctagcc cccgaagtcattcagagcaagggccacggaagggccgtggactggtgggccctcggcatc ctgatattcgagatgctttcggggtttcctccgttttttgatgacaacccgtttggcatt tatcagaaaattcttgcaggcaaaatagatttccccagacatttggatttccatgtaaaa gacctcattaagaaactgctcgtggttgacagaacaaggcgattaggaaacatgaagaac ggggcgaatgatgtgaagcatcatcggtggttccgctccgtggactgggaagctgttccg cagagaaaactgaagcctcccatcgtgcccaagatagctggtgacggcgacacttccaac ttcgaaacttaccctgagaatgactgggacacagccgcgcccgtgccgcagaaggattta gaaatcttcaagaatttctga >gi568815575r:3512203_3813253|GENSCAN_predicted_peptide_6|177_aa MRRAPASPPPARAAAPAFPAVVPHSVPAPRGRRLLPLGAVAACAAASREARLPERSGLAR CPGPECVPMEAPGLAQAAAAESDSRKVAEETPDGAPALCPSPEALSPEPPVYSLQDFDTL ATVGGLPVVLYCGGSSPKTKSGATIIIFVSPTPRFLCQNPNPQGTPVIVFRVHSNFG >gi568815575r:3512203_3813253|GENSCAN_predicted_CDS_6|534_bp atgcgcagggcccccgcctcgccccccccagcccgggccgcggcccccgccttccccgca gtcgtcccgcactcggtgcccgccccccgaggccggcggctgctcccactcggggccgtt gctgcttgtgccgctgcgtcccgggaagcccggctccccgagcgctccggcctggcccgg tgccccggacctgagtgcgtccccatggaggcgcccgggctggcccaggcggccgcggcg gagagcgactcccgcaaggtggcggaggagacccccgacggggcgcccgcgctctgcccc agccctgaggcgctgtcgccggagccgcctgtgtacagcctgcaggactttgacacgctg gccaccgtgggtggcctccccgtggtcctctactgtggtgggtccagtcccaaaaccaag tctggggccaccatcatcatatttgtgtcccccaccccaagattcttatgtcaaaaccct aatccccaaggaacacccgtcattgtgtttagggtccactccaattttggatga >gi568815575r:3512203_3813253|GENSCAN_predicted_peptide_7|208_aa MQTVLNLRAWHGLDAAPLTRDQGGPGCRFADHSLSSKGFPFTAICGEAATKLQIFVKTLR GKTITLEVEPSDAIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDCNIQKEPTLHL LLRLCGVAKKRKKSYPTPKKNKHKRKKVKLDVLKCYKVDENDKISALLTNVELECFCHTE CASFLFAFCHDCSNLLGLIKDRTNSYSD >gi568815575r:3512203_3813253|GENSCAN_predicted_CDS_7|627_bp atgcagacagtgcttaacttgcgggcctggcacggcctggacgctgcgcccctgacccgc gaccagggcggcccaggctgccggttcgccgaccacagtttgagcagcaagggcttccct ttcactgccatctgcggtgaagctgccaccaaactgcagattttcgtgaaaaccctaagg gggaagaccatcacccttgaggttgaaccctcagatgcgatagaaaatgtaaaggccaag atccaggataaggaaggaattcctcctgatcagcaaagactgatctttgctggcaagcaa ctggaagatggacgtactttgtctgactgcaacattcaaaaggagcctactcttcatctt ttgctgagactttgtggtgttgctaagaaaaggaagaagtcttaccccactcccaagaag aataagcacaagagaaagaaggtgaagctggatgtcctgaaatgttataaggtggatgag aatgacaaaattagtgcccttctgacgaatgtggagctggagtgtttctgccatacagaa tgtgcctccttcctcttcgccttctgccatgattgttccaacctactgggcttgatcaag gacaggaccaattcctattcagattaa >gi568815575r:3512203_3813253|GENSCAN_predicted_peptide_8|134_aa MEYYTAIKKNEIMPFAATWMELEAIILSELTQEQKINACSYLQVGAKHCVLTDIKMGKVD PGDSYKGERGGARVEKLTIEYYVGDRLNRSPNLSTMQYIHVTNLYMYPLNLKFFLKTHHQ IDFIPTRTASIKVG >gi568815575r:3512203_3813253|GENSCAN_predicted_CDS_8|405_bp atggaatactacacagccataaaaaagaatgaaatcatgccctttgcagcaacatggatg gagctggaggccattattctaagtgaactaacacaggaacagaaaatcaacgcatgttct tacttacaagtgggagctaaacattgtgtactcacagacataaagatgggaaaagtagac cctggggactcctacaagggggagagagggggagcaagggttgaaaaactaaccattgag tactatgtgggtgacaggctcaacagaagcccaaacctcagcaccatgcaatatatccat gtaacaaacctgtacatgtaccccttaaatctaaaatttttcttaaaaacccaccatcag attgacttcatacccactaggacagctagcatcaaggtcggataa >gi568815575r:3512203_3813253|GENSCAN_predicted_peptide_9|100_aa MREKEMKEEEEKEEEEEKKKKLEGEGGEREGTIGGEGGGGIERGGEKERKVGGRSELENK RMNLLKLQYGMELDYDKRSQWKYRDILDLRNAVYLWFDRE >gi568815575r:3512203_3813253|GENSCAN_predicted_CDS_9|303_bp atgagggagaaggagatgaaggaggaggaggaaaaggaggaggaggaggagaaaaagaaa aaattggagggagaaggaggagaaagagaaggaacaataggaggagaaggaggaggagga atagaaagaggaggagaaaaagaaagaaaagttggagggaggagtgaattggaaaacaaa agaatgaacctcctcaaacttcagtatggcatggagttggactatgacaagagaagccag tggaagtatagagacatcttggacttaagaaatgctgtgtacctatggtttgacagagag tag >gi568815575r:3512203_3813253|GENSCAN_predicted_peptide_10|144_aa XSIHNSKDTESTPVPIHVEWIRKMWYIYTMDYYTAIRKNKIMSFVATKMQLEAIILSKLI QEPNMGHILTYKVGRTLSLRVRKQKQVWFSLIMPQDEKAETGYLSGDSGPTLIQDELILR SLTYICITYFTHILQYICEDRISK >gi568815575r:3512203_3813253|GENSCAN_predicted_CDS_10|435_bp nnctctattcacaatagcaaagacacggaatcaaccccggtgcccatccatgtggagtgg ataaggaaaatgtggtacatatacaccatggactactacacagccataagaaagaataaa atcatgtccttcgtagcaacaaagatgcagctggaggccattattctaagcaaattaatt caggaaccaaatatggggcatattctcacttataaagtgggaagaactctctcactgcga gtgagaaagcagaaacaggtgtggttttccttaataatgcctcaggatgagaaagcagaa acaggatacttgtcaggggactcagggcccaccctaatccaggatgagctcatcttgaga tccttaacctacatctgcatcacttacttcacacacatccttcagtacatctgtgaagac cggatttccaaataa