GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:29:49 Sequence gi568815587r:111254264_111458918 : 204655 bp : 44.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1726 1829 104 2 2 37 101 92 0.549 5.29 1.02 Intr + 13710 13799 90 2 0 -14 110 88 0.121 0.99 1.03 Term + 17845 17874 30 0 0 94 52 27 0.517 -2.45 1.04 PlyA + 18516 18521 6 1.05 2.00 Prom + 23071 23110 40 -0.96 2.01 Init + 26413 26559 147 0 0 83 95 21 0.570 2.39 2.02 Intr + 27135 27177 43 2 1 73 109 20 0.803 0.41 2.03 Intr + 29813 29935 123 0 0 55 76 129 0.494 8.96 2.04 Intr + 29951 30114 164 0 2 55 59 197 0.435 13.19 2.05 Term + 31390 31791 402 0 0 89 55 193 0.994 11.25 2.06 PlyA + 31963 31968 6 1.05 3.00 Prom + 34553 34592 40 -6.26 3.01 Init + 38039 38163 125 1 2 74 91 135 0.615 12.04 3.02 Intr + 44563 44663 101 1 2 52 53 51 0.354 -2.25 3.03 Intr + 46264 46358 95 2 2 77 77 124 0.784 9.88 3.04 Intr + 50665 50719 55 0 1 88 82 20 0.564 -0.05 3.05 Intr + 52131 52360 230 1 2 125 51 58 0.852 3.39 3.06 Intr + 53806 54094 289 0 1 76 -7 208 0.080 6.92 3.07 Intr + 55480 55626 147 2 0 5 55 166 0.153 5.21 3.08 Intr + 68856 68915 60 1 0 90 101 28 0.506 3.01 3.09 Intr + 76474 76497 24 2 0 112 82 11 0.358 0.90 3.10 Intr + 78714 78863 150 1 0 101 60 16 0.394 0.23 3.11 Term + 81450 81622 173 1 2 83 55 89 0.734 3.09 3.12 PlyA + 83125 83130 6 1.05 4.00 Prom + 86561 86600 40 -3.86 4.01 Init + 90877 90960 84 0 0 84 116 39 0.928 7.12 4.02 Term + 93265 93279 15 0 0 93 55 18 0.689 -2.76 4.03 PlyA + 93483 93488 6 1.05 5.09 PlyA - 94666 94661 6 1.05 5.08 Term - 100312 99998 315 1 0 118 48 220 0.955 15.94 5.07 Intr - 103447 103182 266 2 2 113 92 239 0.987 24.03 5.06 Intr - 103574 103532 43 2 1 145 86 44 0.994 7.71 5.05 Intr - 104655 104525 131 1 2 40 81 118 0.676 6.71 5.04 Intr - 108805 108693 113 0 2 41 54 100 0.576 1.92 5.03 Intr - 109019 108916 104 2 2 48 59 41 0.500 -3.83 5.02 Intr - 109726 109570 157 0 1 57 81 112 0.918 7.41 5.01 Init - 111145 111108 38 1 2 84 97 62 0.988 6.19 5.00 Prom - 114978 114939 40 -6.66 6.00 Prom + 119202 119241 40 -2.16 6.01 Init + 125624 125726 103 1 1 61 13 151 0.644 5.30 6.02 Intr + 133376 133428 53 0 2 52 70 46 0.019 -2.17 6.03 Intr + 135993 136068 76 2 1 31 106 91 0.459 4.29 6.04 Intr + 142105 142181 77 2 2 126 61 -2 0.111 0.13 6.05 Intr + 144412 144490 79 0 1 81 89 28 0.073 1.32 6.06 Intr + 149469 149584 116 1 2 8 54 165 0.017 5.27 6.07 Intr + 162329 162494 166 1 1 97 63 38 0.419 1.73 6.08 Intr + 164886 165002 117 1 0 66 81 39 0.295 1.44 6.09 Intr + 176785 176917 133 2 1 116 53 22 0.003 1.20 6.10 Intr + 194290 194404 115 1 1 110 55 48 0.471 4.15 6.11 Intr + 196077 196182 106 2 1 121 95 42 0.991 7.99 6.12 Intr + 196700 196871 172 0 1 60 76 143 0.912 9.30 6.13 Intr + 196984 197146 163 1 1 113 14 229 0.474 17.98 6.14 Intr + 199724 199766 43 1 1 81 39 18 0.131 -5.99 6.15 Intr + 199973 200094 122 0 2 103 78 76 0.909 8.31 6.16 Intr + 200700 200825 126 2 0 66 80 31 0.635 0.98 6.17 Intr + 200912 201318 407 1 2 62 71 154 0.338 4.25 6.18 Intr + 201494 201615 122 2 2 129 99 127 0.999 18.04 6.19 Intr + 202252 202476 225 2 0 51 105 268 0.863 22.76 6.20 Intr + 202744 202958 215 2 2 85 44 182 0.690 11.93 6.21 Term + 203582 203662 81 1 0 114 53 18 0.249 -1.41 6.22 PlyA + 204399 204404 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:111254264_111458918|GENSCAN_predicted_peptide_1|74_aa XDYSKRVYQGVRVKHTVKDLLAEKRSGQTSNSRLNKSNTILLLFGILMVVLNNNENAKRR RSRIQVCDMQFLTF >gi568815587r:111254264_111458918|GENSCAN_predicted_CDS_1|225_bp ngagactacagcaaacgagtgtatcaaggagtgagagtgaagcatacagtcaaagatctc ctggcagaaaaacgatccgggcagacaagtaactcaagacttaataaatctaataccata ctactgctctttggcattctaatggtggtgctaaacaacaatgaaaatgccaaaagaaga agaagtcgaatacaggtatgtgacatgcagttcctcacattttga >gi568815587r:111254264_111458918|GENSCAN_predicted_peptide_2|292_aa MARKHKSSDTGNLDMPKRSRNVLPLNEKVNVLNLIRKEKKCMLRSDIGMGSVSSSQSPFV QMPGSPVTSGYYGVRRSFLSDSDFHNSKQFSNDVYTSSVGKPFPWQSHAALLEPYFPQEP YGDYRPPALTPNAGSLFSASPLPPLLPPPFPGDPAHFLFRDSWEQTLPDGLSQPDPVSAD ALLTLPPSTSCLSQLESGSIAQHRGSSWGSSLAGAQSYSLHALEDLHHTPGYPTPPPYPF TPFMTVSNDLPPKVGPLSPDEEADTGSLHDPSPWVKEDGSIAWGSYECRRAY >gi568815587r:111254264_111458918|GENSCAN_predicted_CDS_2|879_bp atggcccgaaagcacaagagtagtgacactggcaatttagatatgccaaagagaagccgg aatgtgcttcctttaaatgaaaaagtaaacgttctcaacttaataaggaaagaaaaaaaa tgtatgctgaggtcagacattggtatgggcagcgtcagttcctctcagtctccatttgtg cagatgccaggttcgccggtcacgtcaggttactacggtgtcagaagatctttcttatct gactcagacttccacaacagtaaacagttttcaaatgacgtctacacctccagcgtgggg aagccgtttccctggcagagccatgcggctctcctggagccctacttcccccaggagccc tacggagactaccggcctccggcgctgacgcccaacgcgggctctctgttcagcgcctcg cccctaccgccgctcctgccaccgcccttccccggagacccagctcacttcctatttagg gactcatgggagcagacgttgcctgacggtctcagccagcctgaccctgtgtctgccgat gccctgctgaccttgccacccagcacgagttgcctctcccagcttgagtccgggagcatc gcccagcacaggggctcaagctgggggtcatccctggctggggctcagtcatactcgctg catgctctggaagatctgcaccacactccggggtaccctaccccgcctccttaccccttc acccctttcatgacggtgtcaaatgacctaccgcccaaggtggggccactctccccagat gaggaagcagacaccggttccctccatgacccttccccttgggtgaaagaagatgggagt attgcctgggggtcatatgaatgccgcagagcttattga >gi568815587r:111254264_111458918|GENSCAN_predicted_peptide_3|482_aa MGKDFMSKTPKAMATKAKIDKWDLIKLKSFYTAKETTIRVNRPPPALPRIHCCVLRHAVR TLRAQKLLRGELEAHKKPKVYQGVRVKITVKELLQQRRAHQAASGGTRSGGSSVHLSDPV APSSAGLYFEPEPISSTPNYLQRGEFSSCVSCEENSSCLDQIFDSYLQTEMHPEPLLNST QSAPHHFPDSFQATPFCFNQSLIPGSPSNSSILSGSLDYSYSPVQLPSYAPENYNSPASL DTRTCGYPPEDHSYQHLSSHAQYSCFSSATTSICYCASCEAEDLDALQAAEYFYPSTDCI KAQDLISEVKSEEPLPHLKCQYNGSNTGTFSIGHRYHFLSTSYVPELFISTPEKNPKHLG SRTTSRDGCRNLNLKGVNTEAHRGHHLPNWTAVKPQLTPLAASSCNFSTLSLVLLSLERQ CFSPTRISQGFCCTWQRSESDQKHKCTELSFLPGQCPKTLWQDLWFNDFLYFLHHFARQQ IS >gi568815587r:111254264_111458918|GENSCAN_predicted_CDS_3|1449_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctacacagcaaaagaaactaccatcagagtg aacaggcccccgcccgccctcccacgtatccactgttgtgtcctgagacacgcggtgcgg accctgcgcgcccagaagctgctacggggggagctagaggctcacaaaaaaccgaaggtg tatcaaggtgtccgagtgaagatcacagtgaaggagctgctgcagcaaagacgggcacac caggcggcctccgggggaacccggtccggaggcagcagtgtccacctttcagacccagtt gcaccatcttctgcaggactgtattttgagcctgaaccaatttcttccacgcccaattat ttgcaacggggagaattttccagttgtgtttcatgtgaagaaaactcaagctgcctcgac cagatctttgattcctaccttcagacagagatgcacccggagcctttgctcaattccaca caaagtgctccacaccatttcccagacagcttccaggccacccctttctgctttaaccag agcctgatcccaggatcaccttcaaattcctccattctctctggctccttagactacagt tactcgccagtgcagctgccttcatatgctccagagaattacaattcccctgcttctctg gacaccagaacctgtggctaccccccagaagaccattcctaccaacacttgtcctcacac gcccagtacagctgcttctcctcggccaccacctccatctgctactgcgcatcgtgtgag gcagaggacttggatgctctccaggcagcagagtacttctacccgagcacagactgcatt aaggcacaggacttaatctctgaagtgaagtctgaagagcctttgccacatctcaagtgc cagtacaatggcagcaatactggtaccttcagcattggccacaggtaccattttctgagc acgtcctatgtgccagaattatttatttctactcctgagaaaaaccctaagcacttaggg tctagaaccacatcccgagacggctgcaggaacctgaacttgaaaggtgtaaacactgag gcccatagaggtcatcacctgcccaactggacagctgtgaagccacagctcacaccactt gctgcatcctcctgcaatttcagcacactttctcttgtgctgctctccctggaaagacag tgcttttccccaacacgcataagccaaggtttctgctgcacctggcaaagaagtgaatca gatcagaagcacaaatgcacggaactgagttttctgcctggacagtgtcctaagactctc tggcaagatctctggtttaacgacttcctgtactttcttcatcatttcgccagacagcag atctcctga >gi568815587r:111254264_111458918|GENSCAN_predicted_peptide_4|32_aa MLLPLLTYTLVISQLRDKVQKRPQKYQKTGFE >gi568815587r:111254264_111458918|GENSCAN_predicted_CDS_4|99_bp atgctgctgcctctgctgacctatacattggttataagtcaactgagggacaaagttcaa aagagaccacaaaagtaccagaagacgggatttgagtga >gi568815587r:111254264_111458918|GENSCAN_predicted_peptide_5|388_aa MALYGYKVYLLFSGIERQYGELLSPKKEISNQGREAHAKVERFSSVMDSWIPTGDLKGPQ TTGLMLESVTEFLGSWLWTRAGQSGDPWELWESCNVNRKSTSTKRSLFIDLETVGGGKPS LDKILEVASWSLSSQVCATAPEQAPAPARPYQGVRVKEPVKELLRRKRGHASSGAAPAPT AVVLPHQPLATYTTVGPSCLDMEGSVSAVTEEAALCAGWLSQPTPATLQPLAPWTPYTEY VPHEAVSCPYSADMYVQPVCPSYTVVGPSSVLTYASPPLITNVTTRSSATPAVGPPLEGP EHQAPLTYFPWPQPLSTLPTSTLQYQPPAPALPGPQFVQLPISIPEPVLQDMEDPRRAAS SLTIDKLLLEEEDSDAYALNHTLSVEGF >gi568815587r:111254264_111458918|GENSCAN_predicted_CDS_5|1167_bp atggccttgtacggctacaaggtttacctgttgtttagtggtatagaaaggcaatatggt gaacttctcagcccgaaaaaggagatctccaatcagggccgagaggctcatgctaaggtg gaaagattttcctccgtgatggactcttggatacctacgggagacttaaaaggccctcag accacaggattaatgctggagtcagttacagaattccttggcagctggctgtggacacgt gccgggcagtcaggagacccgtgggagctgtgggagtcctgcaacgtcaataggaagagc acctccaccaagcgctctttgttcatcgacttagaaacagtaggaggcgggaagccctct cttgacaagatcctggaagtggccagctggtccctttccagccaagtgtgtgccacagct ccggagcaagccccagccccggcccggccataccagggcgtccgtgtgaaggagccagtg aaggaactgctgaggaggaagcgaggccacgccagcagtggggcagcacctgcacctacg gcggtggtgctgccccatcagcccctggcgacctacaccacagtgggtccttcctgcctg gacatggaaggttctgtgtctgcagtgacagaggaggctgccctgtgtgccggctggctc tcccagcccaccccggccaccctgcagcccctggccccatggacaccttacaccgagtat gtgccccatgaagctgtcagctgcccctactcagctgacatgtatgtgcagcccgtgtgc cccagctacacggtggtggggccctcctcagtgttgacctatgcctctccgccactcatc accaatgtcacgacaagaagctccgccacgcccgcagtggggcccccgctggagggccca gagcaccaggcacccctcacctatttcccgtggcctcagcccctttccacactacccacc tccaccctgcagtaccagcctccggccccagccctacctgggccccagtttgtccagctc cccatctctatcccagagccagtccttcaggacatggaagaccccagaagagccgccagc tcgttgaccatcgacaagctgcttttggaggaagaggatagcgacgcctatgcgcttaac cacactctctctgtggaaggcttttag >gi568815587r:111254264_111458918|GENSCAN_predicted_peptide_6|938_aa MGALALMSAMPTEKEANPQGRKENMPVLFDWIWACSWQVSMPHFADEKTKDQQIFAECHP TVIQSVFGIKDGTVNKESLSQVSPISLPNQVYPHAPILLQFQQVMVLLLVWESHSDTFPL KLGMAVRPGGPKEMAHGMYWLDDDGDDSIDDIGDIDGNDGGDDNGGSNCETPRTSRQRLS WVGQCEQRARALRLEPLHLYSGSTAYPVTVRKRFDFSQSPFPLESYPEVLGIRKSTYKLG GEEGGCTIQPMTEVNTDAQRSAMPQSVTKGFDIQKQKLKDLSEGRTYLCTLTQAVKEGYI VQYCIERFFNPWPGQDACFPCGSEATQPEKGKDTCVCRGPGRVFQDFYLLDQPSPPRAVG HPCNLDLGQKSVPLYVVKMDGKSREGHLDSSWSLSLNSTHLGDLLPPEPGIRNPTVCLQV NDTLAFLVTHEHYPEYDLGHFYNTLEQYDWRRFRALAEESQLYEQSLSLFLQQFQKPGIY VFRLSSNRHRKMNFNVDMNFLGIFCFSGSWEAEEQVDLEWFDTEAFFGDCLRQSLSVTAK LSHTKEELKLLYLKLLGEARSLQQLWGTRRCLPASANRLLRSVQREQQQHLPNLFHLPGP KALTYSSHHLWEFLIPFAAAAMPVQNNSGVPKVSGVSPSLLSLVAVAPAGEQAGLSLTQA AEAAAWAAEEKASRRRHLAGEYAASLRHQLKLLRQNLLGRQEQWASFCSTLREVQQQLKA QMGSRAVPQLDTVLGHLSQVVLQEGHRLKAWGILGTGTGAELLRPAPAGPPGADDISVNP VTKLMVPGPNSLMLPASGHAGSIPPGYFIHPDTGRVLPEAGHLGYDLLRATLVPTMDTNA GLKEAGALKGTNSSLYPCPGGVRTSEAAILPYVPYPTSPTTGSPPATHLPILQPRRTSPL GALMTDPVTGIKLSYLYWEGCEMLRGTSCCLVTALWSH >gi568815587r:111254264_111458918|GENSCAN_predicted_CDS_6|2817_bp atgggggcgctggctctgatgtcggctatgcccacggaaaaagaggccaaccctcaagga aggaaggagaacatgcccgtgctctttgactggatctgggcttgcagctggcaagtgtcc atgccccattttgcagatgagaaaaccaaggaccagcaaatatttgctgaatgtcatccc actgtcatccagtctgtatttggcattaaggatggaacagtgaacaaggaaagcctctcc caagtcagccccatttcccttcccaaccaagtctacccacatgcccccattctgctgcag ttccagcaggtgatggtgctgctgctggtctgggaatcacacagtgacaccttcccactt aaactaggaatggctgtcaggcctggtggtcctaaggaaatggcacatgggatgtactgg ttagatgatgatggtgatgacagtattgatgatattggtgatattgatggtaatgatggt ggtgatgataatggtggtagcaattgtgaaacccccaggacatcccgacagcgtctgtcc tgggtgggccaatgtgagcagagagcacgggctttgaggctggagcccctgcatttgtat tctggctccaccgcctaccctgtcaccgtgagaaagcggtttgacttctctcaatctcct tttcccctagagtcatatcctgaggtactggggattaggaagtcaacctataaacttgga ggagaagaaggaggctgcacaattcagcccatgacagaagttaacactgatgctcagaga agtgccatgcctcagtcagtgacgaaagggtttgatattcaaaagcagaaattaaaggac ttatctgaaggcagaacatatttgtgtaccttaacccaggcagtaaaggagggctatatt gttcagtattgtattgagaggttcttcaacccctggcccggccaggatgcctgtttcccc tgtggctcagaggccacccagccagagaagggcaaggacacgtgtgtctgccgggggcct gggcgagtgttccaagatttttaccttctggaccagccttccccaccacgtgctgtgggc catccctgcaacttggacttagggcagaagtctgtccccctctatgtggtcaagatggat ggaaagtctagggaaggacatctggactcttcttggtcactgagcctcaactccacacat cttggggatctgctgcccccagagccaggcatccgaaaccccacagtctgcctccaagtc aatgataccctggccttcctggtgactcatgagcattacccagagtatgacctgggccac ttctataacacactggagcaatatgactggcggcgcttccgggccctggctgaggagtcc cagctctatgagcaaagcctcagcctctttctgcagcagttccaaaagccgggcatttat gtctttcgtctgagcagcaaccgacatcggaagatgaactttaatgtagatatgaatttc ctggggatcttctgcttcagtgggagctgggaggctgaggagcaggtagatctggagtgg tttgacacagaggccttctttggagactgcctcaggcagtctctgtcagtgacagccaaa ctcagccacacgaaggaagagctcaaactcctctaccttaaacttctgggtgaagcccgt tctctccagcagctgtggggaacaaggcgctgcctcccagcctccgctaaccggcttctg aggagcgtgcagagggagcagcagcagcacctccccaacctgttccatctccctggccct aaagctcttacctacagttcccaccatctctgggaatttctcatcccctttgccgctgct gccatgcctgtccagaataactctggagttcccaaggtgagtggggtgtccccgagcttg ctgtccctggtggctgtagcgcctgcaggtgagcaggcagggttgtctctgacccaggct gctgaggcagctgcctgggctgcagaagagaaggccagccggaggaggcacctggcaggc gagtacgcagccagccttcgtcaccaactcaagctccttcgccaaaacctcctcgggagg caggagcagtgggcctctttctgctcgacgctgagggaggttcagcagcagttgaaagca cagatgggctccagggcagttccccagctggacacggtgctgggccacctgtcccaggtc gtgctgcaggagggccaccgcctgaaggcctggggcattctgggcaccggcactggggcg gagctactgaggccagccccagcaggccctcctggtgctgatgacatcagtgtgaacccc gtcaccaagttgatggtccctggccccaactccttgatgcttccagccagcggccacgca ggctccataccccctggctacttcatccaccctgacactgggagggtgctgcccgaggct ggacacctgggatatgatctgctgagagctaccctggtgcccactatggacaccaatgcc gggctgaaggaggctggtgcattaaagggtaccaacagctccctgtatccctgtccaggt ggtgtccgaacatcagaagctgccatcctgccctatgtgccctacccaaccagtcccacc acaggttcccctccagccacgcacctgcccatcctgcagccaagaaggacgtccccgctg ggggctctcatgacagaccccgtcacaggcatcaagctcagctacttgtactgggagggc tgcgagatgcttcggggaacctcttgctgcctggtgacagctttgtggagccactga