GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:29:58 Sequence gi568815586r:10520147_10734618 : 214472 bp : 37.52% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 1977 1972 6 1.05 1.01 Sngl - 17680 17231 450 1 0 27 53 303 0.940 16.66 1.00 Prom - 24070 24031 40 -6.45 2.03 PlyA - 24598 24593 6 1.05 2.02 Term - 31305 31200 106 2 1 53 45 168 0.532 5.90 2.01 Init - 42132 42080 53 0 2 68 98 2 0.092 -0.12 2.00 Prom - 44828 44789 40 -3.75 3.00 Prom + 50107 50146 40 -4.95 3.01 Sngl + 50954 51124 171 1 0 48 49 211 0.937 7.78 3.02 PlyA + 52358 52363 6 1.05 4.09 PlyA - 53200 53195 6 1.05 4.08 Term - 60979 60550 430 0 1 80 44 201 0.033 8.69 4.07 Intr - 61284 61212 73 1 1 56 80 35 0.010 -2.95 4.06 Intr - 75781 75545 237 2 0 21 87 104 0.281 0.26 4.05 Intr - 78063 77897 167 2 2 96 91 76 0.537 7.38 4.04 Intr - 87790 87708 83 1 2 77 106 56 0.948 3.82 4.03 Intr - 89795 89685 111 2 0 60 72 134 0.988 8.66 4.02 Intr - 90534 90476 59 1 2 117 107 29 0.998 5.38 4.01 Init - 93386 93293 94 2 1 82 93 54 0.977 5.89 4.00 Prom - 94184 94145 40 -7.25 5.10 PlyA - 96517 96512 6 1.05 5.09 Term - 100202 99998 205 1 1 106 49 274 0.970 21.16 5.08 Intr - 104713 104505 209 1 2 85 9 80 0.187 -3.25 5.07 Intr - 107578 107495 84 1 0 107 99 82 0.977 10.30 5.06 Intr - 109528 109347 182 2 2 128 88 131 0.991 15.77 5.05 Intr - 111162 110899 264 1 0 90 93 214 0.999 18.66 5.04 Intr - 113978 113844 135 0 0 58 98 97 0.983 7.32 5.03 Intr - 114540 114421 120 1 0 40 103 77 0.159 3.95 5.02 Intr - 121911 121839 73 0 1 94 61 5 0.028 -3.54 5.01 Init - 122180 122103 78 2 0 84 110 14 0.412 4.31 5.00 Prom - 131114 131075 40 -4.05 6.03 PlyA - 131136 131131 6 1.05 6.02 Term - 141405 141202 204 0 0 49 32 159 0.212 2.79 6.01 Init - 154101 153808 294 0 0 42 39 228 0.569 10.53 6.00 Prom - 155668 155629 40 -5.05 7.03 PlyA - 157017 157012 6 1.05 7.02 Term - 159953 159873 81 2 0 90 34 116 0.375 3.11 7.01 Init - 176098 175913 186 1 0 65 60 127 0.129 6.71 7.00 Prom - 179976 179937 40 -3.95 8.14 PlyA - 180737 180732 6 1.05 8.13 Term - 181207 181142 66 1 0 104 41 37 0.650 -2.44 8.12 Intr - 181988 181814 175 1 1 94 84 186 0.927 17.82 8.11 Intr - 184002 183905 98 0 2 109 77 166 0.578 15.49 8.10 Intr - 186102 186031 72 0 0 56 102 41 0.174 0.98 8.09 Intr - 189968 189762 207 2 0 59 98 300 0.924 26.45 8.08 Intr - 193187 193065 123 2 0 103 63 295 0.999 28.36 8.07 Intr - 195637 195548 90 1 0 99 100 135 0.998 15.07 8.06 Intr - 197562 197497 66 0 0 61 55 90 0.691 1.18 8.05 Intr - 202415 201921 495 2 0 56 43 406 0.671 25.16 8.04 Intr - 202557 202509 49 2 1 49 99 42 0.398 -0.84 8.03 Intr - 202805 202704 102 1 0 60 109 94 0.118 7.07 8.02 Intr - 211013 210872 142 0 1 100 91 34 0.089 3.39 8.01 Intr - 212926 212825 102 2 0 112 58 64 0.625 5.03 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 61080 60550 531 0 0 70 44 277 0.872 15.35 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:10520147_10734618|GENSCAN_predicted_peptide_1|149_aa MVQAPSLGSFPHAVESVGVQKSRIKVWESPPGFQRLYGDTWMSKQKFAAGEGPSWRTSAR AVWKGNVGLKPPHRVSTGALPSGAVRKGPPSSKPQNGRSTNGLHGVPGKAADTQCQPVKA GGRLYPAKPQGWSYPRPWKPTSCISVTWM >gi568815586r:10520147_10734618|GENSCAN_predicted_CDS_1|450_bp atggtgcaagctccaagccttggcagctttccacatgctgttgagtctgtgggtgtacag aagtcaagaataaaggtttgggaatctccacctggatttcagaggctctatggagacacc tggatgtccaagcagaagtttgctgcaggggagggaccttcctggagaacctctgctagg gcagtgtggaagggaaatgtgggattgaagcccccacacagagtctccactggggcacta cctagtggagctgtgagaaaagggccaccgtcctccaaacctcagaatggtagatccacc aatggcttgcatggtgtgcctggaaaagctgcagacactcagtgccagcctgtgaaagca ggagggaggctataccctgcaaagccacaggggtggagctacccaagaccatggaaaccc acctcttgcatcagtgtgacctggatgtga >gi568815586r:10520147_10734618|GENSCAN_predicted_peptide_2|52_aa MVHTQDFTLALLELSNTSGVLEPLHLSPNGIRCAARTRFTGSMGAFVKIVRR >gi568815586r:10520147_10734618|GENSCAN_predicted_CDS_2|159_bp atggtacacactcaagattttaccttagcattattggaattgtcaaataccagtggtgtc ctggagcccctgcacctgtctccaaatggtatccgctgtgccgccaggacgcgcttcact ggcagcatgggtgcctttgtgaagattgtgaggcgctag >gi568815586r:10520147_10734618|GENSCAN_predicted_peptide_3|56_aa MCNKKSRGQETEGCRQLNSIINAIRCLNPSNLLRLDMWTTLWPHIAAIAPDQYSWQ >gi568815586r:10520147_10734618|GENSCAN_predicted_CDS_3|171_bp atgtgtaataaaaagtctagaggtcaagagactgagggctgtcggcagctcaacagcatc atcaatgccattcgctgtctaaatccttctaatcttctgcgtttggacatgtggacaacc ttgtggccacacatagctgccatagccccagatcagtattcatggcaatga >gi568815586r:10520147_10734618|GENSCAN_predicted_peptide_4|417_aa MAVASDFYLRYYVGHKGKFGHEFLEFEFRPDGKLRYANNSNYKNDVMIRKEAYVHKSVME ELKRIIDDSEITKEDDALWPPPDRVGRQELEIVIGDEHISFTTSKIGSLIDVNQSKNHPI SGFECKTYLLTLKMNDQGEIYSTLRFLQSPSESQNRLRPDDTQRPGKTDDKVFQCIQEKH QRQEILRNCSEKYIMQNDNYLKEQILTNKTLKFDVLKNSFQQKKELDSRLIQKNRCHREN EIVFKVLQNTGLKPQNFAVSVIALKGGPSSVCCSRALIAPFYRVLIGPFLQSADWCVYKP LARHRVLIGAFLQSADWCIYKPLARHRELIGAFLQSADWCIYKPLARHRELIGAFLHSTD WCVSKPLARHRVLIGAFLQSTDWCIYKPLARHRELIGAFLQSADWCVYKPLARQSTD >gi568815586r:10520147_10734618|GENSCAN_predicted_CDS_4|1254_bp atggctgtggctagcgatttctacctgcgctactacgtagggcacaagggcaagtttggg cacgagtttctggagttcgaatttcggccggacggaaagcttagatatgccaacaacagc aattacaaaaatgatgtcatgatcagaaaagaggcttatgtgcacaagagtgtaatggaa gaactgaagagaattattgatgacagtgaaattacaaaagaagatgatgctttgtggcct ccccctgatagggttggccgacaggagcttgaaattgtaattggagatgagcacatatct tttaccacatcaaaaataggttctcttattgatgtaaatcagtcaaaaaaccatcctatt tcaggatttgaatgcaaaacttaccttcttactctaaagatgaatgatcagggagagatt tattcaaccctgagatttttgcagtctccttcagagtcacagaatagattaaggcctgat gatactcaaaggcctgggaaaactgatgacaaagtctttcagtgtattcaagaaaaacat caacggcaggaaattctaagaaactgtagtgaaaagtacatcatgcaaaatgacaactac ttaaaagagcagattttgacaaataagactttaaaatttgacgttctcaaaaatagcttt cagcagaaaaaggaactggattcacgccttatacaaaagaacagatgtcatagagaaaat gagatcgtttttaaagttttgcaaaatacaggattgaaaccacagaacttcgcagttagt gttatagctcttaaaggtggcccatccagtgtttgttgctcccgagcactgattgctcca ttttacagagtgctgattggtccgtttttacagagtgctgattggtgtgtttacaaacct ttagctagacacagagtgctgattggtgcatttttacagagtgctgattggtgcatttac aaacctttagctagacacagagagctgattggtgcatttttacagagtgctgattggtgc atttacaaacctttagctagacacagagagctgattggtgcatttttacacagtactgat tggtgcgtttccaaacctttagctagacacagagtgctgattggtgcatttttacagagt actgattggtgcatttacaaacctttagctagacacagagagctgattggtgcattttta cagagtgccgattggtgcgtttacaaacctttagccagacagagcactgattga >gi568815586r:10520147_10734618|GENSCAN_predicted_peptide_5|449_aa MNDRNEIQMEAKLQSLTIIAQEILCRFFITLRRHARFLLTKLGRQGMARSGITHSCAVCI LCGPSREGDSPVAMGMTRMLLECSLSDKLCVIQEKQYEVIIVPTLLVTIFLILLGVILWL FIREQRTQQQRSGPQGIAPVPPPRDLSWEAGHGGNVALPLKETSVENFLGATTPALAKLQ VPREQLSEVLEQICSGSCGPIFRANMNTGDPSKPKSVILKALKEPAGLHEVQDFLGRIQF HQYLGKHKNLVQLEGCCTEKLPLYMVLEDVAQGDLLSFLWTCRRDVMTMDGLLYDLTEKQ VYHIGKQVLLALEFLQEKHLFHGDVAARNILMQSDLTAKLCGLGLAYEVYTRGAISSTQT IPLKWLAPERLLLRPASIRADVYSIMKSCWRWREADRPSPRELRLRLEAAIKTADDEAVL QVPELVVPELYAAVAGIRVESLFYNYSML >gi568815586r:10520147_10734618|GENSCAN_predicted_CDS_5|1350_bp atgaatgataggaatgagattcaaatggaagccaaactccaaagtcttaccattatagca caggaaattctatgcaggttctttattacccttaggagacatgcacgtttcctgctcact aaactaggaaggcaaggaatggcaaggtcaggaattactcacagctgtgctgtgtgcatt ctctgtgggcctagcagggaaggggacagccctgtggcaatgggcatgacacggatgctc ctggaatgcagtctcagtgacaagttgtgtgtcatccaggagaagcagtatgaagtgatt atcgtcccaactttgttggttactatcttcctcatccttcttggggtcatcctgtggctt tttatcagagaacaaagaactcaacagcagcgttctggacctcaaggcattgcccctgtt cctccacctagggacctaagctgggaagcaggacatggaggaaatgtggctttgccactt aaggagacatccgtggaaaactttctgggagctaccacacctgccctggctaagctgcag gtgccgcgggagcaactctctgaagttctggagcagatttgcagtggtagctgtgggccc atctttcgagccaatatgaacactggggacccttctaagcccaagagtgttattctcaag gctttaaaagaaccagctgggctccatgaggtacaagatttcttagggcgaatccaattc catcaatacctggggaaacacaaaaacctggtgcagctggaaggctgctgcactgaaaag ctgccactctatatggtgttggaggatgtggcccagggggacctgctcagctttctctgg acctgtcggcgggatgtgatgactatggatggtcttctctatgatctcacagaaaaacaa gtatatcacatcggaaagcaggtccttttggcgctggaattcctgcaggagaagcatttg ttccatggggatgtggcagccaggaatattctgatgcaaagtgatctcactgctaagctc tgtggattaggcctggcttatgaagtttacacccgaggggccatctcctctactcaaacc atacctctcaagtggcttgccccagaacggcttctcctgagacctgctagcatcagagca gatgtgtacagtatcatgaagtcctgctggcgctggcgtgaggctgaccgcccctcacct agagagctgcgcttgcgcctagaagctgccattaaaactgcagatgacgaggctgtgtta caagtaccagagttggtggtacctgaactgtatgcagctgtggccggcatcagagtggag agcctcttctacaactatagcatgctttga >gi568815586r:10520147_10734618|GENSCAN_predicted_peptide_6|165_aa MSIDQQQVQPLQLQQRFPVARVWRVPETSEPADFPSREPARTPSFAGSHAPQWEGGTEKH FLEAGTPCPTQPCHYFPSDWAGAIGNRSDQRRDTVVNGVIVGIKTSIKEVCVKDKYQLRN SRAGWIPKQIASLGLLAGLCSMQMCEESGFCARGGRDDTDQEYNI >gi568815586r:10520147_10734618|GENSCAN_predicted_CDS_6|498_bp atgagtatagatcagcaacaagtccagcctttacagttgcagcagaggtttcctgtggct cgagtgtggcgagtcccggaaacctcggagcccgcagacttcccttcgcgggagcccgcc cgaactccatcctttgccggcagccacgccccgcagtgggaaggagggactgaaaagcat ttccttgaggctggcacaccttgccctacccaaccctgtcattatttcccctccgactgg gccggtgccatcggaaaccggagtgaccagaggagggacacggtagtgaacggggtaata gtgggaatcaaaaccagtatcaaggaagtttgtgttaaagacaaatatcagttaaggaat agcagagcagggtggattccaaagcagattgcttcacttggtttattggctggactgtgc tccatgcagatgtgtgaagagtctgggttttgtgcaagaggaggaagagatgacacggat caagaatataacatctaa >gi568815586r:10520147_10734618|GENSCAN_predicted_peptide_7|88_aa MTGVLVRSNTRDMRAQGGGHLKKLQEEGRPSASHGEASVNQTCQHLDLGLPTSRPEKRNF CRGEYSLLEETDKETTNEKRSGTEKKEG >gi568815586r:10520147_10734618|GENSCAN_predicted_CDS_7|267_bp atgactggtgttcttgtgaggagcaacaccagagacatgcgtgcacaggggggtggccat ttgaagaagttgcaagaggaaggaaggccatctgcaagccacggagaggcctcagtgaac caaacctgccagcaccttgatcttggacttccaacctccagacctgagaaaaggaatttc tgtcgtggagaatatagcctattagaagaaacagacaaggaaactacaaatgagaaacgc tctggtaccgagaagaaagaaggatag >gi568815586r:10520147_10734618|GENSCAN_predicted_peptide_8|595_aa XPSNDWTQLEARAKKPVEAVQSGSFSGAQSREKNGNERDPLITKFTLNRRWSPCGAKGIS NPNPAFSRQTSNSNSTVAYFCRKPRWGRGPRSHGHRGRRLFSHRRRQRRRGEKSSRGLRV PSAGRLPCRRSQTGTVADSRGGKCLRLEKKGEAVWERGPWVGMDVGEGPTRGPGALGSLL GIWERRAFPLGKRRALEGDGEGKIEKLLARKGQVSVPDALPTWAWERRHAPVGHMPAKRC CNRLRETWVLVPRLPPPLCAHGCKQPFSVHQWLHASLFPDCGNARGVAAEKTLSEVGVLV KAPLSEQLSIEVNSAEKQITAIKKNNPRKYLRSVGDGETVEFDVVEGEKGAEAANVTGPD GVPVEGSRYAADRRRYRRGYYGRRRGPPRNYAGEEEEEGSGSSEGFDPPATDRQFSGARN QLRRPQYRPQYRQRRFPPYHVGQTFDRRSRVLPHPNRIQSYPWSLPYPLPHQQLLKPLNG QIKAGEIGEMKDGVPEGAQLQGPVHRNPTYRPRYRSRGPPRPRPAPAVGEAEDKENQQAT SGPNQPSVRRGYRRPYNYRRRPRPPNAPSQDGKEAKAGEAPTENPAPPTQQSSAE >gi568815586r:10520147_10734618|GENSCAN_predicted_CDS_8|1788_bp ntgccctccaatgattggacccagctggaagccagagcaaagaagcctgtagaagctgtt caaagtggttcattctctggggcacagagcagggagaaaaatggaaatgagagggatcct ctgattaccaaatttactctcaacaggcgttggtccccatgtggagcaaaaggcatcagt aaccccaatccagctttttctcgacagacttctaactcaaacagcactgtggcctatttc tgcaggaaaccccggtggggacgcggcccccgcagccacgggcaccgcggccgccgcctc tttagccaccgccgccggcagcgaagacgcggagaaaaaagttctcggggtctccgggtc cccagcgctggccgcctcccttgccggcgctcccagacgggcactgttgcggattcgcgt ggtggaaaatgcctgcgtttggagaagaaaggagaggcagtctgggagaggggaccttgg gtagggatggatgtaggggaaggaccgacacgtgggcctggcgctttggggtccttgctt gggatctgggaaaggagagcatttcctttggggaagaggagggcgctagaaggagacggg gagggaaagatagaaaagcttcttgccaggaagggtcaggtgtctgtccctgacgccctt cccacatgggcatgggaaagacgccatgctcctgttggccacatgccagcaaagagatgt tgcaatcgtctccgggaaacttgggtcttagtcccacgtcttcctccccctctctgtgcc cacggctgcaaacagccattcagtgtccaccaatggctacacgcttccctcttcccggac tgtgggaacgcacgtggtgtagcagctgaaaaaactttgtcggaagtgggggtattggtt aaagctcctctgtcagagcagctctcaattgaagtaaatagcgcagagaagcagataact gccatcaagaagaataacccacggaaatatctgcgcagtgtaggagatggagaaactgta gagtttgatgtggttgaaggagagaagggtgcagaagctgccaatgtgactggcccggat ggagttcctgtggaagggagtcgttacgctgcagatcggcgccgttacagacgtggctac tatggaaggcgccgtggccctccccggaattacgctggggaggaggaggaggaagggagc ggcagcagtgaaggatttgacccccctgccactgataggcagttctctggggcccggaat cagctgcgccgcccccagtatcgccctcagtaccggcagcggcggttcccgccttaccac gtgggacagacctttgaccgtcgctcacgggtcttaccccatcccaacagaatacagagt tacccctggtctctcccttacccgttacctcaccaacaacttctaaagccattaaatggg cagatcaaggctggtgagattggagagatgaaggatggagtcccagagggagcacaactt cagggaccggttcatcgaaatccaacttaccgcccaaggtaccgtagcaggggacctcct cgcccacgacctgccccagcagttggagaggctgaagataaagaaaatcagcaagccacc agtggtccaaaccagccgtctgttcgccgtggataccggcgtccctacaattaccggcgt cgcccgcgtcctcctaacgctccttcacaagatggcaaagaggccaaggcaggtgaagca ccaactgagaaccctgctccacccacccagcagagcagtgctgagtaa