GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:27:18 Sequence gi568815584r:50230077_50495871 : 265795 bp : 40.06% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 887 1183 297 1 0 74 71 132 0.784 7.37 1.02 Intr + 1419 1596 178 2 1 78 89 60 0.746 3.67 1.03 Term + 2397 2554 158 1 2 7 43 150 0.584 -0.49 1.04 PlyA + 2907 2912 6 1.05 2.11 PlyA - 5210 5205 6 1.05 2.10 Term - 17177 16982 196 1 1 122 36 236 0.917 17.70 2.09 Intr - 35413 35282 132 0 0 105 82 60 0.730 5.94 2.08 Intr - 37834 37677 158 1 2 72 60 98 0.764 3.29 2.07 Intr - 39254 39087 168 2 0 77 94 116 0.890 10.32 2.06 Intr - 48478 48444 35 2 2 122 98 9 0.866 2.32 2.05 Intr - 53957 53795 163 2 1 78 103 180 0.953 17.13 2.04 Intr - 64170 64039 132 0 0 58 110 79 0.990 7.02 2.03 Intr - 72092 71941 152 0 2 100 86 110 0.997 10.96 2.02 Intr - 72941 72826 116 1 2 19 77 137 0.927 4.87 2.01 Init - 82074 81935 140 0 2 51 109 58 0.750 2.63 2.00 Prom - 82780 82741 40 -3.45 3.00 Prom + 87112 87151 40 -2.85 3.01 Init + 89158 89225 68 0 2 66 30 83 0.483 0.90 3.02 Intr + 92435 92643 209 2 2 84 28 218 0.602 13.20 3.03 Intr + 93869 94040 172 0 1 89 71 102 0.490 6.68 3.04 Intr + 95533 95569 37 1 1 100 46 -8 0.010 -6.45 3.05 Intr + 101946 102160 215 2 2 70 91 174 0.078 12.49 3.06 Term + 105383 105482 100 2 1 84 41 135 0.560 5.02 3.07 PlyA + 105846 105851 6 -0.45 4.34 PlyA - 105875 105870 6 -3.74 4.33 Term - 106144 105917 228 1 0 39 48 177 0.781 4.45 4.32 Intr - 108953 108871 83 0 2 87 101 84 0.935 8.04 4.31 Intr - 111156 110956 201 1 0 36 106 360 0.704 30.94 4.30 Intr - 112146 112056 91 0 1 56 61 52 0.166 -2.05 4.29 Intr - 112982 112830 153 2 0 28 72 94 0.056 1.15 4.28 Intr - 128068 127948 121 0 1 38 76 64 0.164 -0.22 4.27 Intr - 129073 128952 122 1 2 91 96 75 0.505 6.97 4.26 Intr - 132318 132110 209 1 2 37 52 124 0.593 1.57 4.25 Intr - 132961 132814 148 1 1 58 73 163 0.965 10.69 4.24 Intr - 146343 146261 83 1 2 111 57 2 0.060 -2.16 4.23 Intr - 147653 147465 189 0 0 71 81 155 0.895 11.74 4.22 Intr - 148356 148101 256 0 1 88 103 102 0.633 7.79 4.21 Intr - 150156 150029 128 1 2 47 63 64 0.319 -0.72 4.20 Intr - 160108 160053 56 0 2 81 91 32 0.683 0.50 4.19 Intr - 160376 160201 176 2 2 -5 36 192 0.116 2.52 4.18 Intr - 165848 165625 224 0 2 6 86 276 0.772 16.12 4.17 Intr - 167205 166982 224 2 2 64 37 231 0.648 12.45 4.16 Intr - 167560 167510 51 0 0 91 37 139 0.567 6.20 4.15 Intr - 180572 180370 203 2 2 29 49 239 0.002 11.26 4.14 Intr - 195901 195831 71 2 2 74 99 39 0.896 1.58 4.13 Intr - 198678 198586 93 1 0 80 97 87 0.992 7.82 4.12 Intr - 199184 199116 69 0 0 68 98 46 0.781 1.84 4.11 Intr - 204495 204318 178 0 1 95 111 117 0.998 13.27 4.10 Intr - 204989 204886 104 0 2 93 71 74 0.939 5.17 4.09 Intr - 213960 213859 102 1 0 65 43 97 0.598 2.13 4.08 Intr - 215118 214965 154 0 1 46 52 113 0.542 2.22 4.07 Intr - 216045 216003 43 2 1 67 105 7 0.711 -2.38 4.06 Intr - 217405 217338 68 1 2 106 108 59 0.847 6.48 4.05 Intr - 224174 224115 60 2 0 97 99 11 0.434 1.11 4.04 Intr - 232705 232589 117 1 0 93 94 126 0.979 13.44 4.03 Intr - 237123 237102 22 2 1 58 94 27 0.336 -3.07 4.02 Intr - 238706 238575 132 1 0 67 20 138 0.014 3.74 4.01 Intr - 245073 245001 73 1 1 89 99 52 0.014 3.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 133165 133145 21 1 0 60 101 13 0.832 -0.81 S.002 Term + 137322 137499 178 0 1 81 47 163 0.867 7.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:50230077_50495871|GENSCAN_predicted_peptide_1|210_aa MKSSFMIITKPSFRDQAKKKNLRNVLQNATGNCEPPIASQRETARPRGLEPCGRDDLLPV RSSGGHGPPAGRPAPSTDLPQGRDQQSPPFRTVLLAEELRPQRKRGDPQARATPGGDLLS RRRWPASGAAPPPLSGFAARQGYSWPSPAPTPATAAAGGPSLARAPVRSQNHIVFTKLSR YFRIVFLIVSSKLCRLQVLQNLDRRLGNTH >gi568815584r:50230077_50495871|GENSCAN_predicted_CDS_1|633_bp atgaaatccagttttatgatcataacaaaaccttccttccgggaccaggcaaaaaaaaaa aacttgagaaacgtccttcagaatgcaacaggcaattgtgagccccccattgccagccaa cgagaaacggctcggccccggggactcgagccctgtgggagggacgacctgctccccgtt agaagctcggggggccacgggccacccgccggccgccccgcccctagcactgaccttccg cagggccgagaccaacagtccccgccatttcggactgttctcctcgctgaagaactccgc ccgcagaggaagcgcggcgacccgcaagcccgggcgaccccgggcggcgacctcctttcc cggcggcgctggcccgcttctggggcggctcctcctcctttgtctgggttcgcggcccgt caggggtacagctggcccagccccgccccgactcccgctacggccgcggcgggcggtcct tcacttgcacgggcccctgtgcgttcacaaaatcatatcgtttttacaaaactttcaaga tattttcgaattgttttcttaatcgtgtcatccaaattgtgtaggcttcaggtcctacaa aacctggatcggcgtctaggaaacactcattag >gi568815584r:50230077_50495871|GENSCAN_predicted_peptide_2|463_aa MVPALRYLVGACGRARGLFAGGSPGACGFASGRPRPLCGGSRSASTSSFDIVIVGGGIVG LASARALILRHPSLSIGVLEKEKDLAVHQTGHNSGVIHSGIYYKPESLKAKLCVQGAALL YEYCQQKGISYKQCGKLIVAVEQEEIPRLQALYEKGLQNGVPGLRLIQQEDIKKKEPYCR GLMAIDCPHTGIVDYRQVALSFAQDFQEAGGSVLTNFEVKGIEMAKESPSRSIDGMQYPI VIKNTKGEEIRCQYVVTCAGLYSDRISELSGCTPDPRIVPFRGDYLLLKPEKCYLVKGNI YPVPDSRFPFLGVHFTPRMDGSIWLGPNAVLAFKREGYRPFDFSATDVMDIIINSGLIKL ASQNFSYGVTEMYKACFLGATVKYLQKFIPEITISDILRGPAGVRAQALDRDGNLVEDFV FDAGVGDIGNRILHVRNAPSPAATSSIAISGMIADEVQQRFEL >gi568815584r:50230077_50495871|GENSCAN_predicted_CDS_2|1392_bp atggtgccagcgctgcgttatttggttggtgcctgcggacgggcccgcgggcttttcgcc ggtggctcccctggggcgtgcgggttcgcgtctgggaggccaagaccgctgtgtggaggt agccgcagcgccagcaccagctcatttgatatagtcatcgttggtggcggaattgtgggg cttgcctctgccagagcactcatcctgcgacatccatcactttctattggtgttctggaa aaggagaaagatttagctgttcaccagactggacataacagtggtgtcatacatagtgga atttattataaacctgagtctctgaaagccaaattatgtgtacaaggtgcagccctcctc tatgagtactgtcagcaaaagggaatttcctacaagcagtgtggcaagcttatagtagct gttgaacaagaagaaattcccagacttcaggccctatatgagaaaggcctccagaatggt gtcccgggcctgaggctgatccagcaggaggatataaaaaagaaggagccatattgtagg ggtctaatggctattgattgtccacatactggcattgtggactatcggcaggtggctttg tcatttgcccaggatttccaagaagcaggtggctctgtcttgaccaattttgaagtaaaa ggtattgaaatggctaaagaaagtccttcaagaagtatagatggaatgcaatatccaatt gttataaagaatacaaagggagaggaaattcgatgtcagtatgttgtgacatgtgcagga ctttactcagaccgtatttcagagttgagtggctgcactcctgatcctcgaattgtacca ttccggggagattacctgcttttgaagccagaaaaatgttatcttgtaaaaggaaatatt tatccggtcccagatagccggtttcctttcctaggagttcacttcacaccaaggatggat ggcagtatttggctagggcctaatgcagttcttgcctttaaacgagagggttacagaccc tttgacttcagtgccacagatgttatggatataattatcaatagtggcttgattaaactg gcatcccagaatttttcctatggagttactgaaatgtataaagcatgttttcttggtgca acagtgaagtatcttcaaaaattcatccctgaaattactatcagtgatatacttaggggc ccagctggagtaagagcccaggccctggatagagatggaaatctggtagaagattttgta tttgatgcaggagttggggatattggaaatcgcattcttcatgtgagaaatgcaccttct cctgctgctacttcttccattgcaatttctggaatgattgcagatgaagtacaacaaaga tttgaattataa >gi568815584r:50230077_50495871|GENSCAN_predicted_peptide_3|266_aa MIRGSGAEACSNEDGCSEKGMKGVDYDRIRDVGPDRAASEWLLRCGAMVRYHGQERWQKD YNHLPTGPLDKYKIQAIDATDSCIMSIGFDHMEGLEHVEKIRLCKCHYIEDDCLLRLSQL ENLQKTILEMEIISCGNITDKGIIALRHLRNLKYLLLSDLPGAGNPGPCVASGSSRTRAP LGFRERGLQETDVPGALVDRSYDLSPSAGMRMLELKATTAILKSRFQLSVPGGSLLFKQS PVGHSSISFWPYKAILRQSRAGDNQE >gi568815584r:50230077_50495871|GENSCAN_predicted_CDS_3|801_bp atgataagaggtagtggagcagaggcttgcagtaatgaagacggctgtagtgaaaaaggg atgaaaggggtggattatgatcgcatcagggatgttggccctgacagggcggcatccgag tggttgctgcgctgtggggccatggtgcgctaccatggccaggagaggtggcagaaggac tacaaccaccttccaacaggccctctggacaaatacaagattcaggcgatcgacgccacc gactcttgtatcatgagcattggatttgatcacatggagggcctagagcatgttgaaaaa ataaggctgtgcaagtgtcattatatcgaggatgactgtttgctgagacttagtcaactt gaaaatttacaaaaaaccatattggaaatggaaataatatcctgtgggaatatcacagac aaaggcatcattgctttgcgtcatttaagaaacctcaaatatttgttgttaagtgatctt cctggagctggaaaccctggcccctgtgtggcatcaggaagcagcaggacaagggcacct ttaggattcagggagagggggcttcaggagacagatgtccctggggccctggttgataga tcctatgacctgtctcctagtgctggaatgaggatgctggagctgaaagccacaactgcc atcctcaagtctagattccaactctctgtaccaggtggcagcctgctgtttaaacagtct cctgtggggcattcgtcaatctccttttggccgtataaggcgattttgcgccagtcacgt gctggagacaatcaggaataa >gi568815584r:50230077_50495871|GENSCAN_predicted_peptide_4|1410_aa XDFGVAAKITATIAKRKSFIGTPYWMAPEVAAVEKNGGYNQLCDIWAVGITAIELGELQP PMFDLHPMSPQENGSHHTFVAQPGLSRALAVELLDKVNNPDNHAHYTEADDDDFEANYYS DLCHINWLLPDLDLQGLSSDPNFMLQWNPFVDGANTGKSTSKRAIPPPLPPKPRISSYPE DNFPDEEKASTIKHCPDSESRAPQILRRQSSPSCGPVAETSSIGNGDGISKLMSENTEGS AQAPQLPRKKDKRDFPVVRNPYTGHKYLCGALQSGIVLLQWYEPMQKFMLIKHFDFPLPS PLNVFEMLVIPEQEYPMVCVAISKGTESNQVVQFETINLNSASSWFTEIGAGSQQLDSIH VTQLERDTVLVCLDKFVKIVNLQGKLKSSKKLASELSFDFRIESVVCLQDSVLAFWKHGM QGKSFKSDESNFPPCDSRVSKEEAQLPFPTPSSYGADSSNQTYAHKSQSEREQHKEVRAV LNHCSGLDGGEESGLCGTSTRIDAGTGNGTDDKVTDQHRHRRCLQGTKGNNRGSEVWGLL LQGNVDRSGGAPSAGVLLRRRGYSCALHGLRKFANLAGLLSRQQDSARGVSHHSRLKIHF KKIYSSMMEKYEKIGKIGEGSYGVVFKCRNRDTGQIVAIKKFLESEDDPVIKKIALREIR MLKAPSPYAAEPSLCGMKMVRRGKKEFLPAVAEKVDAPSGVGGQGQDSVTVGSLGRRSTY GRKQEKQVRQREGIIYCYVAVLLRIYYFDQGCVAREEEQFQELVFGPFCHIGSYFTGHRT NVRPYILLLSRPSPFKTAAGTYEAGLVILECSYFLAEQEPYCPTQALQQPHPIIGPWALE GGGVESKEDRHPPPKEAPASCEGFLRSAVPKQAYTPFKTSPDKRLSDCVATPPWAPPTPL IISSGVLVAICSMIDPVPEFHSEGLLAKATSGSAGILVWIFLCNDAFIYGKYILRSGVLW VRGLPGFKSEAAALCHKCYSSADPKREQQQDLLQRVKEQSFHSVNGAQARHKGSPSPHQT QEPSWPHPVDPTPGHRWSCLPVPCRAPALLSPWVVDGTGCCGAGGGSDRGGSAAQEPTQL KHPNLVNLLEVFRRKRRLHLVFEYCDHTVLHELDRYQRGICNIFVCTGRRLGEHTEALSK KKKKGGGGPFLKLRAASCRITLFKNVGCGLETTGDLSLNSGGGAASRGVAAALRALVCGT ELTSSDSPQRCIHRDVKPENILITKHSVIKLCDFGFARLLTGPSDYYTDYVATRWYRSPE LLVGDTQYGPPVDVWAIGCVFAELLSGVPLWPGKSDVDQLYLIRKTLGDLIPRHQQVFST NQYFSGVKIPDPEDMSLCLSVTLTEGGLLASGAVKRSQMGSSVSQATSWPHPDIVAETAE LDDIAMARQTPVMLRFNRQKEQEKYLSYGA >gi568815584r:50230077_50495871|GENSCAN_predicted_CDS_4|4233_bp nctgactttggtgtggctgcaaaaataacagctaccattgcaaaacgaaaatctttcatt ggcaccccttactggatggccccagaagttgcagcagtagagaagaatggtggctacaac caactctgtgatatctgggcagtaggaataacagcaattgaacttggagaacttcagcca cctatgtttgatctccacccaatgagcccacaagaaaatgggtcacaccacacttttgtt gcacagccaggtctctctagagccctagcagttgaactgttagacaaagtgaacaatcca gataaccacgcacattacactgaagcagatgacgatgactttgaggcgaattactattct gatctctgtcacataaactggcttttacctgatcttgaccttcagggattgtcatcagac ccaaatttcatgttacagtggaatccttttgttgatggtgcaaatactggcaaatcaacc tcaaaacgtgcaataccacctcccctacctcctaagccaaggataagcagttaccctgaa gacaactttccggatgaagaaaaagcatcaaccataaaacattgtcctgattcagaaagc agagctccccaaattctcagaagacagagtagcccaagttgtgggcctgtggcagagact tcttctattggaaatggtgatggtatttcaaaactgatgagtgaaaatacagaaggatca gcacaagcaccacagttaccacgaaaaaaggacaaacgagacttccctgtagtcagaaac ccttacacgggacataaatacctctgtggagctttacagtctggaattgttttacttcag tggtatgagccaatgcagaaattcatgttgataaagcactttgattttcctttgccaagt cctttgaatgtttttgaaatgctggtgatacctgaacaggaataccctatggtctgtgta gctattagcaaaggcactgaatcgaatcaggtagttcagtttgagacaatcaatttgaac tctgcatcttcatggtttacagaaattggtgcaggcagccagcagttagattccattcat gtaacacagttggagagagataccgttttagtgtgtttagacaaatttgtgaaaattgta aatctacaaggaaaattaaaatcaagtaagaaactggcctctgagttaagttttgatttt cgcattgaatctgtagtatgccttcaagacagtgtgttggctttctggaaacatgggatg cagggtaaaagcttcaagtcagatgagagtaatttcccgccttgtgactctcgtgtctcc aaagaagaagcccaactccctttcccaacacctagcagctacggtgcagactcctccaac cagacttacgcacacaagtctcagtctgaaagagagcagcacaaggaagtacgtgctgtc ctgaaccattgttctggcttggatggtggagaggagtctggcctttgtgggacaagtacc cgcattgatgctggcactgggaacgggacggatgataaagtcactgatcagcacagacac agaagatgcctccaagggaccaaagggaataaccgcggttctgaggtgtggggactgctg ttgcagggaaatgtggaccgctcaggaggggctcctagcgcaggcgtcttgctccgcaga cgcggttacagctgcgctttgcacggcctgcggaagtttgcaaatctagcaggtctgctc tcccggcagcaggactcggcccgcggcgtgagccaccattctcggctgaagatccatttt aagaagatttattcctctatgatggagaagtatgaaaaaattgggaaaattggagaagga tcctatggagttgttttcaaatgtagaaacagggacacgggtcagattgtggccatcaag aagtttctggaatcagaagatgaccctgtcataaagaaaattgcccttcgggaaatccga atgctcaaggccccaagcccctatgctgctgaaccttctctctgtggaatgaaaatggta agaagaggaaagaaagaatttctgccagcagtggcagaaaaagtagatgctcctagcgga gttggaggtcaaggacaggacagtgtcactgtgggatccctgggcagacgttctacctat ggcagaaaacaagaaaaacaagtgagacaaagagaaggaatcatctactgctatgtggca gttttgcttcggatttactattttgaccagggttgtgttgccagggaggaagagcagttt caggagcttgtgtttggccctttctgccatattggctcttacttcacaggtcatagaacc aatgtgaggccttacattttactactatcacggcccagccccttcaagacggctgctggt acctatgaggcaggtcttgtcattcttgagtgcagttatttcctcgcagagcaagaacct tattgccctactcaggcactgcaacaacctcatcctattataggtccgtgggccctagaa ggaggtggggtagaatctaaggaggacaggcatccccctcctaaggaggcacctgcatca tgtgagggatttttaaggtctgcagtgccgaagcaagcttacaccccattcaaaaccagc cctgacaaaaggcttagtgactgtgttgctacacctccctgggctcctcccaccccactt atcatcagcagtggggtccttgttgccatctgctccatgattgatccagttccagaattc cattctgaaggcctgctagctaaggccacttcaggaagtgctggaatcctggtatggata tttttatgcaatgatgcttttatttatggaaagtatattctcagaagtggagtgctatgg gttcgtggtctccctggcttcaagagtgaagctgcagccctttgccataagtgttacagc agtgcggacccaaagcgtgagcagcagcaagatttattgcaaagagtgaaagaacaaagc ttccacagtgtgaatggtgcccaggctagacataaaggttctccaagtccccaccagact caggagcccagctggcctcacccagtggatcccacaccaggccacaggtggagctgcctg ccagtcccatgccgtgcgcccgcactcctcagtccttgggtggtcgatgggaccgggtgc tgcggagcagggggcggttctgatcgtggaggctcggccgcgcaggagcccacgcaactc aagcatcccaaccttgttaacctcctggaagtcttcaggaggaaacggaggcttcacctg gtgtttgaatattgtgaccacacagttctccatgagttggacagataccaaagaggcata tgcaacatctttgtgtgcactggaagaagacttggtgagcacacagaggccttgtccaag aaaaagaaaaaaggaggagggggtcccttcctgaagttgagggcagcctcttgcaggatc accctgtttaagaatgttggctgtgggctggaaaccacaggtgatttgagtctaaattct ggagggggggccgcatctcgaggggtggctgcagctctcagagcacttgtgtgtgggaca gagctgacatctagcgactctccccagaggtgcatacatagagacgtgaagccagaaaat atcctcatcacgaaacattccgtgattaagctttgtgactttggatttgctcggcttttg actggaccgagtgactactatacagactacgtggctaccaggtggtaccgctcccctgag ctgctggtgggggacacgcagtacggccccccggtggatgtttgggcaattggctgtgtc tttgctgagctgctgtcaggagtgcctctgtggccaggaaaatcggatgtggatcagctg tatctgattaggaagaccttgggggatctcattcctaggcaccagcaagtgtttagcacg aatcagtacttcagtggagtgaaaattccagaccctgaagatatgtccctctgcctgtca gtaaccctgacagagggaggcctgcttgcttctggggctgtgaaacgctcacagatgggc tccagcgtatcacaggcaaccagttggcctcatccagacattgttgctgagacagcagag ctcgatgatatagcaatggcacgtcaaaccccagtgatgctcagattcaaccgacagaaa gaacaagagaaatatttaagctatggagcatga