GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:48:21 Sequence gi568815593r:173132572_173335083 : 202512 bp : 47.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11975 12058 84 1 0 115 84 115 0.940 14.62 1.02 Intr + 14295 14387 93 2 0 70 94 27 0.714 1.66 1.03 Intr + 21751 21842 92 0 2 74 96 73 0.975 5.49 1.04 Intr + 27362 27480 119 2 2 97 71 156 0.999 14.91 1.05 Term + 31154 31350 197 0 2 94 50 290 0.982 23.37 1.06 PlyA + 31765 31770 6 1.05 2.05 PlyA - 31782 31777 6 1.05 2.04 Term - 42896 42782 115 1 1 40 43 97 0.299 -1.56 2.03 Intr - 45032 44925 108 1 0 66 99 45 0.494 2.80 2.02 Intr - 51407 51253 155 2 2 83 102 67 0.172 6.47 2.01 Init - 61126 61052 75 1 0 80 79 31 0.021 2.49 2.00 Prom - 72610 72571 40 -5.36 3.03 PlyA - 73147 73142 6 -0.45 3.02 Term - 73365 73215 151 2 1 84 43 141 0.944 6.58 3.01 Init - 76215 76145 71 0 2 49 78 63 0.707 1.92 3.00 Prom - 76379 76340 40 -2.16 4.00 Prom + 80155 80194 40 -7.76 4.01 Sngl + 81128 81310 183 1 0 97 48 215 0.488 11.66 4.02 PlyA + 81704 81709 6 1.05 5.10 PlyA - 85990 85985 6 1.05 5.09 Term - 93447 93265 183 0 0 62 36 226 0.987 12.34 5.08 Intr - 96408 96016 393 0 0 32 65 135 0.278 0.45 5.07 Intr - 97393 97191 203 2 2 98 60 116 0.941 8.80 5.06 Intr - 99060 98893 168 1 0 -21 65 140 0.079 0.72 5.05 Intr - 100638 100015 624 1 0 103 23 1145 0.124 101.72 5.04 Intr - 102654 102179 476 2 2 80 100 362 0.041 29.31 5.03 Intr - 120032 119939 94 0 1 140 100 20 0.516 7.42 5.02 Intr - 125707 125581 127 1 1 66 79 75 0.491 4.65 5.01 Init - 135148 135065 84 1 0 65 90 52 0.727 3.92 5.00 Prom - 136471 136432 40 -4.16 6.00 Prom + 144714 144753 40 -6.16 6.01 Init + 145224 145272 49 2 1 83 115 22 0.706 5.54 6.02 Intr + 151308 151488 181 1 1 67 54 177 0.161 11.13 6.03 Term + 158278 158467 190 1 1 95 47 58 0.072 -0.68 6.04 PlyA + 158871 158876 6 -0.45 7.07 PlyA - 160576 160571 6 1.05 7.06 Term - 161401 161214 188 2 2 62 37 62 0.379 -3.85 7.05 Intr - 161933 161857 77 1 2 100 131 63 0.966 10.96 7.04 Intr - 162133 161986 148 2 1 50 44 110 0.974 2.09 7.03 Intr - 162737 162603 135 0 0 107 51 82 0.981 6.94 7.02 Intr - 165177 165064 114 1 0 59 76 48 0.286 1.12 7.01 Init - 172551 172488 64 0 1 84 88 62 0.839 7.11 7.00 Prom - 179598 179559 40 -5.56 8.10 PlyA - 181935 181930 6 1.05 8.09 Term - 185678 185276 403 1 1 107 44 396 0.940 31.73 8.08 Intr - 186585 186514 72 2 0 59 53 86 0.613 0.72 8.07 Intr - 190859 190648 212 2 2 108 72 249 0.736 23.01 8.06 Intr - 193439 193297 143 0 2 77 98 126 0.968 12.57 8.05 Intr - 195011 194851 161 1 2 58 16 100 0.483 -0.57 8.04 Intr - 195220 195045 176 1 2 73 61 165 0.822 11.04 8.03 Intr - 195896 195724 173 0 2 40 -17 156 0.754 -0.04 8.02 Intr - 196186 196050 137 0 2 50 57 90 0.810 2.31 8.01 Init - 198930 198887 44 0 2 91 100 50 0.698 6.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100638 99998 641 1 2 103 50 1143 0.826 106.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:173132572_173335083|GENSCAN_predicted_peptide_1|194_aa MAAPQDVHVRICNQEIVKFDLEVKALIQDIRDCSGPLSALTELNTKVKEKFQQLRHRIQD LEQLAKEQDKESEKQLLLQEVENHKKQMLRKTTKESLAQTSSTITESLMGISRMMAQQVQ QSEEAMQSLVTSSRTILDANEEFKSMSGTIQLGRKLITKYNRRELTDKLLIFLALALFLA TVLYIVKKRLFPFL >gi568815593r:173132572_173335083|GENSCAN_predicted_CDS_1|585_bp atggcggctccccaagacgtccacgtccggatctgtaaccaagagattgtcaaatttgac ctggaggtgaaggcgcttattcaggatatccgtgattgttcaggacccttaagtgctctt actgaactgaatactaaagtaaaagagaaatttcaacagttgcgtcacagaatacaggac ctggagcagttggctaaagagcaagacaaagaatcagagaaacaacttctactccaggaa gtggagaatcacaaaaagcagatgctcaggaaaaccaccaaagagagcctggcccagaca tccagtaccatcactgagagcctcatggggatcagcaggatgatggcccagcaggtccag cagagcgaggaggccatgcagtctctagtcacttcttcacgaacgatcctggatgcaaat gaagaatttaagtccatgtcgggcaccatccagctgggccggaagcttatcacaaaatac aatcgccgggagctgacggacaagcttctcatcttccttgcgctagccctgtttcttgct acggtcctctatattgtgaaaaagcggctctttccatttttgtga >gi568815593r:173132572_173335083|GENSCAN_predicted_peptide_2|150_aa MDTDVDLHSIGGMSKDLQPRFKTTTGGHWRILKEAMACASEREGLTQGTNLNEESPAALG PVRSCQNDPSKAQGGPSHQWPFHMDLLQMTSLAEGPGGRDMTPFSQPSSQSVRNWVALQE VSRQRALPPELYPLSDQQRHEILIGARTLL >gi568815593r:173132572_173335083|GENSCAN_predicted_CDS_2|453_bp atggacacagacgtagacctgcactccataggaggaatgtcaaaggatttgcagccacgt tttaaaaccaccacaggaggccactggagaattctcaaggaggccatggcatgtgcctct gaaagagaggggctgacgcagggcaccaacttgaatgaggaaagtccggctgcattgggc ccagtcagaagctgccagaatgacccaagcaaggcacagggaggcccaagccaccagtgg cctttccacatggacttgctgcaaatgaccagtctggctgagggccctgggggccgggac atgacacccttctctcagccctcatcacagtctgtgaggaactgggtggccctgcaggag gtcagccgccagcgagcattaccgcctgagctctacccactgtcggatcagcagcggcat gagattctcataggagcgagaaccctattgtga >gi568815593r:173132572_173335083|GENSCAN_predicted_peptide_3|73_aa MEQLEEDLRREFKTEGAARAKALRNRRGFVLSCASEERHPLRGHWMPHAFDVKVPFENGP TLYQQVPETPPGY >gi568815593r:173132572_173335083|GENSCAN_predicted_CDS_3|222_bp atggagcagctggaggaagatctgagaagagagtttaagacagagggagcagctcgtgca aaggccctgagaaacaggagaggtttcgtcctgagctgtgcaagtgaggagcgtcaccct ctgagagggcactggatgccacacgctttcgatgtcaaggtcccttttgaaaatggccct actctgtaccaacaagttccagagacaccaccaggctactaa >gi568815593r:173132572_173335083|GENSCAN_predicted_peptide_4|60_aa MPEPAPDAVGSCAAPASPTSAAPCSTAPGPIDCPRAEECRRKHHGTGRQLRLRPGSRTTE >gi568815593r:173132572_173335083|GENSCAN_predicted_CDS_4|183_bp atgcctgagcctgcccctgacgccgtgggctcctgcgcggccccagcctccccgacgagc gccgccccctgctccacggcgcccggtcccatcgactgcccaagggctgaggagtgccgg cgcaagcaccacgggactggcaggcagctccgcctgcggcccgggtccaggaccactgag tga >gi568815593r:173132572_173335083|GENSCAN_predicted_peptide_5|783_aa MSISVKAKHKPHKSLSPKLEVVNGWETEHPHGANQDEPDPRQTTQSKQRTQSLFSTSMAQ QLEISTVSSTGEAPRAAKEREAEKKWAGEPLSGAPRYGGRSSATCCPDTSRAGRRVRGRA AAPCREAARGRGQRRFLPPTWRCETGAATMFPSPALTPTPFSVKDILNLEQQQRSLAAAG ELSARLEATLAPSSCMLAAFKPEAYAGPEAAAPGLPELRAELGRAPSPAKCASAFPAAPA FYPRAYSDPDPAKDPRAEKKELCALQKAVELEKTEADNAERPRARRRRKPRVLFSQAQVY ELERRFKQQRYLSAPERDQLASVLKLTSTQVKIWFQNRRYKCKRQRQDQTLELVGLPPPP PPPARRIAVPVLVRDGKPCLGDSAPYAPAYGVGLNPYGYNAYPAYPGYGGAACSPGYSCT AAYPAGPSPAQPATAAANNNFVNFGVGDLNAVQSPGIPQSNSGVSTLHACFQMEKPRTQA LGKRSQAPQMPPDATSGPHQLPVGLSAEAGPQRGVLRMVLTSSCDRETEEVTWVVKIIQS LSGNGSWETHRTHTGESVPEARARHTVDVSPVKMQTEWLQAALSRTQTLASGDARKPQVS PLTRKHHAVPRGSSTVFPAAPAPRILREAFRAQAQRFYAAGNSLVASKPCRLPYQAELLE KGACGLPLIYHSQSLSRARHCSELFTSEPIESLFIPLGYREETDEGFILNPERRSTLPRV TQLVNSEDKRALAKLVEAIRTNYNDKYDEIHRHWGGSVLGPKSVVHIAKLEKAKAKELAT KLG >gi568815593r:173132572_173335083|GENSCAN_predicted_CDS_5|2352_bp atgagtatttcagtgaaagcaaaacataagccgcacaaatctctgagccccaaactcgaa gttgtaaatggttgggagactgagcatccacacggtgcaaatcaagatgaaccagatccc cggcagaccacacagtccaagcaaaggacgcagtcattgtttagcacatccatggctcag cagttggaaatatccaccgtctccagcacaggagaagccccacgagcagcgaaggaaaga gaagcagaaaaaaagtgggcaggtgaacctttgtcaggggcaccccgctacggaggaagg tcaagcgctacctgctgcccggacacatccagagctggccgacgggtgcgcgggcgggcg gcggcaccatgcagggaagctgccaggggccgtgggcagcgccgctttctgccgcccacc tggcgctgtgagactggcgctgccaccatgttccccagccctgctctcacgcccacgccc ttctcagtcaaagacatcctaaacctggaacagcagcagcgcagcctggctgccgccgga gagctctctgcccgcctggaggcgaccctggcgccctcctcctgcatgctggccgccttc aagccagaggcctacgctgggcccgaggcggctgcgccgggcctcccagagctgcgcgca gagctgggccgcgcgccttcaccggccaagtgtgcgtctgcctttcccgccgcccccgcc ttctatccacgtgcctacagcgaccccgacccagccaaggaccctagagccgaaaagaaa gagctgtgcgcgctgcagaaggcggtggagctggagaagacagaggcggacaacgcggag cggccccgggcgcgacggcggaggaagccgcgcgtgctcttctcgcaggcgcaggtctat gagctggagcggcgcttcaagcagcagcggtacctgtcggcccccgaacgcgaccagctg gccagcgtgctgaaactcacgtccacgcaggtcaagatctggttccagaaccggcgctac aagtgcaagcggcagcggcaggaccagactctggagctggtggggctgcccccgccgccg ccgccgcctgcccgcaggatcgcggtgccagtgctggtgcgcgatggcaagccatgccta ggggactcggcgccctacgcgcctgcctacggcgtgggcctcaatccctacggttataac gcctaccccgcctatccgggttacggcggcgcggcctgcagccctggctacagctgcact gccgcttaccccgccgggccttccccagcgcagccggccactgccgccgccaacaacaac ttcgtgaacttcggcgtcggggacttgaatgcggttcagagccccgggattccgcagagc aactcgggagtgtccacgctgcatgcctgtttccagatggaaaagccaagaacccaagcc cttggcaagcgttctcaggctcctcagatgcccccagatgccacgtcggggcctcatcag ctgcccgtgggactgagtgccgaggctggaccccagagaggtgtcctgcggatggtgctc acctccagctgtgaccgggaaactgaggaggttacgtgggtggtcaagatcatccagtct ctaagcgggaacgggagctgggaaacacacaggacacatactggggagagtgttcctgaa gccagagcccgccacactgtggatgtctctccagtaaagatgcagacagaatggctccag gccgcgctgagcagaacccagactcttgccagtggggacgcgcggaagccgcaggtttca ccgctgactcggaaacatcacgcggtcccgcgcgggagtagcaccgtcttccccgcagcg cccgcccctcgcatcctccgggaagcattccgagctcaggcccagcgcttctacgccgca ggcaacagccttgtggcctctaagccttgccgactcccctaccaggctgagctcctggag aaaggggcctgtggcctgcctttaatttatcactcacagagtttatcgcgtgccaggcac tgttctgagctctttacaagtgaacccattgaatccctttttatacctctgggttatcgt gaggaaacagatgaaggattcattttaaacccagaaagaaggagtacgttgcccagggtc acacagctggttaactcagaagacaaaagagctttggctaagctggtggaagctatcagg accaattacaacgacaaatatgatgagatccaccgtcactggggaggcagtgtcctgggt cccaagtctgtggttcacatcgccaagcttgaaaaggcaaaggctaaagaacttgccacc aaactgggttaa >gi568815593r:173132572_173335083|GENSCAN_predicted_peptide_6|139_aa MLNLCNFFSSTVCGSDAALQEPARRQREGETGRHVLTRRPPEPALPAGCARMPADAELLP PCGLVRGLLLLTGGASRPLIALEQRDDSLIINFTKIEVGLCQQSPLTVTQMRLFALHGGK GQLGKKKKERKKKAGVVPA >gi568815593r:173132572_173335083|GENSCAN_predicted_CDS_6|420_bp atgctcaacctgtgcaacttcttttcaagtacagtgtgtggatcggatgcggcgctgcag gaaccagcccgccgccagcgcgaaggtgagaccgggcgccacgtgcttacccggcggcct ccggaaccagccctgcccgccggctgtgcgcggatgcctgcagacgccgagctgctgccc ccgtgtggcctggtgcgggggctcctcctgctcactgggggcgcgtcccggcccctaatt gctctggagcagagggatgacagcctaattattaacttcacaaaaattgaagttgggctg tgtcaacaaagtccactcactgtcacccagatgcgactctttgcccttcatggtggcaaa gggcagttaggaaaaaaaaagaaagaaagaaagaaaaaggctggcgtggttccagcttaa >gi568815593r:173132572_173335083|GENSCAN_predicted_peptide_7|241_aa MPPSKVEMVIPTLQIKKQTLRGQGTHGGEAAGAGSGGLPEKAVVFDGLHRKQIQDRKLQD EDSDIPETMNLQRPGSIQVSMATLTSFSLAPEPTCPPRHPLYLAEYTRSTDPMQPIQYWK LSTNKAPREGSEEDDSANGHRENRELTAPSQQADPEERSILQTRGLTLRSKATCHNRITA RSVGGGNRRGRLPFITWWLPGHILYKPGPALSLPQSLSVPLPSNSGESTFKTVLDAVASR L >gi568815593r:173132572_173335083|GENSCAN_predicted_CDS_7|726_bp atgccccccagtaaggtggaaatggttattcctactttacagatcaagaaacagactctc agaggccaggggactcacggtggggaggcagctggagctggcagtggaggcctgcccgag aaggcagtggtcttcgatgggcttcacaggaaacagattcaagacagaaaattgcaagat gaggattcagacatcccagagaccatgaacctccagaggcctgggagcatccaggtctca atggccacactgacatccttcagcttggcacctgagcccacttgcccaccccgacatcct ttgtacctggcagagtacacccgctcaactgatccaatgcaaccaatacagtattggaag ctttctacgaacaaggcacccagagagggctcggaggaagatgacagtgccaacgggcac agggagaatcgtgagcttactgcaccttcccagcaggcagaccccgaggaacggtccatt ctacagacaagaggcctgacgctcagaagcaaagcaacttgccacaatcgcatcaccgcc cgctctgtcggtggaggaaatagacgtggaaggcttcccttcatcacctggtggctgccg ggccacatcctgtacaagccggggccagctctaagtctaccccaaagcctcagcgtgcct cttccctctaactcaggcgaaagtaccttcaagacagtcctggacgcggtggcttcacgc ctgtaa >gi568815593r:173132572_173335083|GENSCAN_predicted_peptide_8|506_aa MGSRRQPQKALDDTSNWSLDSLLYCTVLSTIALEHPHTRTHLRPRSHTLIHTHANAWPPP VTPELPGIFEAPSLPLPGTRGAANGLVTCSPESYGRALGNLYPSRGREEEGKGEQKGRGR GEMKQRLTLPAPAPAPESRVRCKARFIPAGTREAQRVPGAGLRPAPGELLFLACRPQCPT SQNFAGETNGIANFPPGGTAFGREREGSAKRTSAVLPPSPAPGQVSKPRNAEIQHCLVNA GDVGCGVFECFENNSCEIRGLHGICMTFLHNAGKFDAQGKSFIKDALKCKAHALRHRFGC ISRKCPAIREMVSQLQRECYLKHDLCAAAQENTRVIVEMIHFKDLLLHETTQDNRNGLRN WLCPPELGEFIAQPYVDLVNLLLTCGEEVKEAITHSVQVQCEQNWGSLCSILSFCTSAIQ KPPTAPPERQPQVDRTKLSRAHHGEAGHHLPEPSSRETGRGAKGERGSKSHPNAHARGRV GGLGAQGPSGSSEWEDEQSEYSDIRR >gi568815593r:173132572_173335083|GENSCAN_predicted_CDS_8|1521_bp atgggctccaggcgtcaaccacagaaggccctggatgacacaagcaactggagcttggat tcgctcttatattgtacagtcctttcgaccattgccctggagcacccgcacacgcgcacg catctccggccgcgctcacacacactcatacacacgcacgcaaacgcgtggccgccgcca gtgacaccagagcttccagggatatttgaggcaccatccctgccattgccgggcactcgc ggcgctgctaacggcctggtcacatgctctccggagagctacgggagggcgctgggtaac ctctatccgagccgcggccgcgaggaggagggaaaaggcgagcaaaaaggaagaggtcga ggggaaatgaagcagcgtctgacgctgccagcgccagcccccgcccccgaatcccgggtc cggtgcaaagcgcgcttcatcccggccggcacgcgggaggcccagagggtccccggagct gggctgcgccctgcgcccggagaacttctcttcctggcctgccggcctcaatgcccgaca tcccagaactttgccggcgagacaaatggcattgcgaactttccccctgggggcactgct ttcggaagggagcgtgagggcagcgcaaagcgtacctccgcggtgctgccgccatcccca gctcctggccaggtcagcaagccaagaaatgcggagatccagcactgtttggtcaacgct ggcgatgtggggtgtggcgtgtttgaatgtttcgagaacaactcttgtgagattcggggc ttacatgggatttgcatgacttttctgcacaacgctggaaaatttgatgcccagggcaag tcattcatcaaagacgccttgaaatgtaaggcccacgctctgcggcacaggttcggctgc ataagccggaagtgcccggccatcagggaaatggtgtcccagttgcagcgggaatgctac ctcaagcacgacctgtgcgcggctgcccaggagaacacccgggtgatagtggagatgatc catttcaaggacttgctgctgcacgaaacaacccaagacaaccggaatggcctgaggaac tggctctgccctccagaactaggcgagttcattgcacaaccctacgtggacctcgtgaac ttgctgctgacctgtggggaggaggtgaaggaggccatcacccacagcgtgcaggttcag tgtgagcagaactggggaagcctgtgctccatcttgagcttctgcacctcggccatccag aagcctcccacggcgccccccgagcgccagccccaggtggacagaaccaagctctccagg gcccaccacggggaagcaggacatcacctcccagagcccagcagtagggagactggccga ggtgccaagggtgagcgaggtagcaagagccacccaaacgcccatgcccgaggcagagtc gggggccttggggctcagggaccttccggaagcagcgagtgggaagacgaacagtctgag tattctgatatccggaggtga