GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:13:41 Sequence gi568815575r:114905197_115117269 : 212073 bp : 38.05% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1393 2219 827 2 2 134 54 616 0.843 55.04 1.02 PlyA + 2450 2455 6 1.05 2.07 PlyA - 2464 2459 6 1.05 2.06 Term - 5912 5745 168 2 0 99 40 82 0.400 1.40 2.05 Intr - 23185 23058 128 2 2 86 77 51 0.104 3.28 2.04 Intr - 28957 28881 77 0 2 84 91 14 0.029 -0.46 2.03 Intr - 37638 37166 473 0 2 47 71 240 0.066 9.15 2.02 Intr - 38011 37871 141 1 0 29 36 126 0.237 1.13 2.01 Init - 42711 42439 273 0 0 53 105 161 0.957 11.32 2.00 Prom - 43866 43827 40 -5.15 3.03 PlyA - 44716 44711 6 1.05 3.02 Term - 46992 46733 260 1 2 56 45 168 0.588 4.03 3.01 Init - 89512 89419 94 1 1 57 61 103 0.159 5.09 3.00 Prom - 90504 90465 40 -5.15 4.11 PlyA - 92929 92924 6 1.05 4.10 Term - 93336 93179 158 1 2 91 39 126 0.147 5.11 4.09 Intr - 96803 96701 103 2 1 65 38 124 0.165 3.93 4.08 Intr - 100119 100001 119 1 2 65 95 11 0.155 -1.24 4.07 Intr - 102880 102736 145 1 1 76 86 46 0.235 2.13 4.06 Intr - 104470 104325 146 2 2 82 48 72 0.365 1.68 4.05 Intr - 109378 109225 154 1 1 93 32 81 0.583 1.72 4.04 Intr - 110625 110474 152 1 2 46 115 86 0.983 6.06 4.03 Intr - 112106 111980 127 2 1 83 94 60 0.066 5.43 4.02 Intr - 125638 125555 84 1 0 80 110 9 0.002 1.40 4.01 Init - 134949 134719 231 0 0 61 52 162 0.317 8.21 4.00 Prom - 136492 136453 40 -5.25 5.00 Prom + 136747 136786 40 -6.25 5.01 Init + 139458 139512 55 2 1 72 106 54 0.914 7.10 5.02 Term + 142819 142886 68 2 2 103 49 51 0.682 -0.18 5.03 PlyA + 143379 143384 6 1.05 6.06 PlyA - 143392 143387 6 1.05 6.05 Term - 154806 154661 146 1 2 80 38 106 0.865 1.89 6.04 Intr - 157961 157773 189 0 0 73 78 77 0.897 3.84 6.03 Intr - 159038 158760 279 0 0 65 75 140 0.962 6.93 6.02 Intr - 164110 163998 113 0 2 44 31 132 0.009 2.20 6.01 Init - 171268 171231 38 1 2 88 86 38 0.162 3.16 6.00 Prom - 175726 175687 40 -3.65 7.02 PlyA - 176492 176487 6 1.05 7.01 Sngl - 178005 177604 402 0 0 70 43 230 0.965 12.73 7.00 Prom - 178960 178921 40 -5.35 8.02 PlyA - 179129 179124 6 1.05 8.01 Sngl - 180103 179321 783 1 0 49 36 491 0.998 35.71 8.00 Prom - 184521 184482 40 -5.55 9.00 Prom + 187810 187849 40 -2.05 9.01 Init + 193485 193629 145 2 1 67 53 107 0.564 5.48 9.02 Intr + 196705 196858 154 2 1 3 8 197 0.437 1.41 9.03 Intr + 197179 197627 449 1 2 67 86 265 0.512 16.27 9.04 Term + 198869 198963 95 0 2 92 47 16 0.265 -5.09 9.05 PlyA + 199385 199390 6 1.05 10.02 PlyA - 199749 199744 6 1.05 10.01 Term - 208139 208020 120 2 0 106 34 131 0.984 6.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 112073 111980 94 2 1 85 94 57 0.917 6.70 S.002 Term - 128456 128347 110 0 2 36 49 126 0.877 1.19 S.003 Init - 129111 129039 73 0 1 79 89 78 0.911 8.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:114905197_115117269|GENSCAN_predicted_peptide_1|275_aa XVSVPIPVIGLRDEEKVFVNNTTCVLNDPNFVLIGSFVAFFIPLTIMVITYCLTIYVLRR QALMLLHGHTEEPPGLSLDFLKCCKRNTAEEENSANPNQDQNARRRKKKERRPRGTMQAI NNERKASKVLGIVFFVFLIMWCPFFITNILSVLCEKSCNQKLMEKLLNVFVWIGYVCSGI NPLVYTLFNKIYRRAFSNYLRCNYKVEKKPPVRQIPRVAATALSGRELNVNIYRHTNEPV IEKASDNEPGIEMQVENLELPVNPSSVVSERISSV >gi568815575r:114905197_115117269|GENSCAN_predicted_CDS_1|828_bp ngtgtatcagttcctatccctgtgattggactgagggacgaagaaaaggtgttcgtgaac aacacgacgtgcgtgctcaacgacccaaatttcgttcttattgggtccttcgtagctttc ttcataccgctgacgattatggtgattacgtattgcctgaccatctacgttctgcgccga caagctttgatgttactgcacggccacaccgaggaaccgcctggactaagtctggatttc ctgaagtgctgcaagaggaatacggccgaggaagagaactctgcaaaccctaaccaagac cagaacgcacgccgaagaaagaagaaggagagacgtcctaggggcaccatgcaggctatc aacaatgaaagaaaagcttcgaaagtccttgggattgttttctttgtgtttctgatcatg tggtgcccatttttcattaccaatattctgtctgttctttgtgagaagtcctgtaaccaa aagctcatggaaaagcttctgaatgtgtttgtttggattggctatgtttgttcaggaatc aatcctctggtgtatactctgttcaacaaaatttaccgaagggcattctccaactatttg cgttgcaattataaggtagagaaaaagcctcctgtcaggcagattccaagagttgccgcc actgctttgtctgggagggagcttaatgttaacatttatcggcataccaatgaaccggtg atcgagaaagccagtgacaatgagcccggtatagagatgcaagttgagaatttagagtta ccagtaaatccctccagtgtggttagcgaaaggattagcagtgtgtga >gi568815575r:114905197_115117269|GENSCAN_predicted_peptide_2|419_aa MWESLELPRDLLNGFDQNADSDMDNEIQAEVVSDGDEEPIRNWSKGDSCYVLAKRLAAFC PHPRDLWNFELERDDLGFLVEEIYKQQSIQEWRLEHQQDKWSSSVDPIPTEPSKLRSTGL KFSLPTQQSEVDLGCSSLSTWGKGSCGHSFSRLKHSCLPPLKRAVDLPAQHLSSAKGQTA SSSRSLNPVPPDWETPPGMHQQTPHRGELQLVGALLGQSFQRKEQSAIFAVLQPLLVIPR QTGSGADYQQTPADLQQRGLTVRRKSNKQKGIVSTSTKKTSTPKPHPKVTNIKDQSLSSS LEERSFWSEVLKPPRAGGGEYDNEITGRGVKKQLISLVGQIRIKELQLKELQRTTLSYAL GEKMGTDVMPVFSTLTQHQKSIYVERKGQRILSSLVPGMEEESCFSIMSLMILEVSKAS >gi568815575r:114905197_115117269|GENSCAN_predicted_CDS_2|1260_bp atgtgggaaagtttggaacttcctagagacttgttgaatggctttgatcaaaatgctgat agtgatatggacaatgaaatccaggctgaggtggtctcagatggagatgaggaacctatc aggaactggagcaaaggtgactcttgttatgtgttagcaaagagactggcagcattttgc ccccaccctagggatttgtggaactttgaacttgagagagatgatttagggtttctggtg gaagaaatttataagcagcaaagcattcaagagtggcgcctggaacaccagcaagacaag tggtctagctcagtggaccccatccccacagagcccagcaagctaagatccactggcttg aaattctcgctgccaacacagcagtctgaagttgacctgggatgctccagcttgagcacc tggggaaagggcagctgtggtcacagcttcagcagacttaaacattcctgcctgccacct ctgaagagagcagtggatctcccagcacagcacttgagctctgctaagggacagactgcc tcctcaagtaggtccctgaacccagtgcctccagactgggagacacctcccggcatgcat caacagacacctcatagaggagagctccagctggtgggtgcccttctaggacaaagcttc cagaggaaggaacagtcagcaatctttgctgttctgcagcctctgctggtgatacccagg caaacagggtctggagcagactaccagcaaactccagcagacctgcagcagaggggcctg actgttagaaggaaaagtaacaaacagaaaggaatagtatcaacatcaacaaaaaagaca tccacaccaaaaccccatccaaaggtcaccaacatcaaagaccaaagtctgtcaagttct ttggaggaaagaagtttttggtcagaggtcctaaaaccaccaagagctgggggcggtgaa tatgataatgagataacaggcaggggagtaaaaaaacaattgatctccttggtgggtcaa ataaggataaaggaacttcagctaaaggaacttcagagaacaactttatcctatgctttg ggagagaagatgggaactgacgtcatgccagtattttctacactcacccagcatcagaag agcatatatgttgaaagaaaaggtcaacgtattctgtcatccctggttccaggtatggaa gaagagagttgttttagtatcatgtctctgatgattttggaggtatccaaagccagttga >gi568815575r:114905197_115117269|GENSCAN_predicted_peptide_3|117_aa MKICITPPGNKPQPAEVLAEGKGNTEWVVEEDLLRTPGYFSQMVVELAKTQISMVQVEDI PLARASLNAPSVGDGRILPGVVVCCDSATLNSNAKPHNHFALPLPSTWILSSRGTQR >gi568815575r:114905197_115117269|GENSCAN_predicted_CDS_3|354_bp atgaagatttgcatcactccaccaggaaataaaccacaacctgcagaggtgcttgctgaa ggcaaagggaatacagaatgggtagtagaagaagatttactcaggactccagggtacttt agtcagatggtggtggagctagccaaaactcagatttctatggttcaagtggaggatatc cctctagccagggctagtttaaatgctccttctgtaggtgatggcagaattctgcctggt gttgtggtctgctgtgacagtgcaacactgaattccaatgcaaagccccacaatcacttt gctctccctctcccaagcacatggattctctcttcgcgtggcactcagaggtga >gi568815575r:114905197_115117269|GENSCAN_predicted_peptide_4|472_aa MDEESEGGCSGKVVFPWILPQPLLAKLPSVSTLFCHHWSADVCWCLLVCSSALLDVQLLV SMSAMVSSFYGHRMGDMGVSLICLIKGGSMPRPHSNKQAGGKEKQRGNIKVLNLGEMAFV CLAIGCLYTFLISTTFGCTSSSDTEIKVNPPQDFEIVDPGYLGYLYLQWQPPLSLDHFKE CTVEYELKYRNIGSETWKTIITKNLHYKDGFDLNKGIEAKIHTLLPWQCTNGSEVQSSWA ETTYWISPQVKPLPPVYLTFTRESSCEIKLKWSIPLGPIPARCFDYEIEIREDDTTLVTA TVENETYTLKTTNETRQLCFVVRSKVNIYCSDDGIWSEWSDKQCWEGEDLSKKTLLRFWL PFGFILILVIFVTGLLLRKPNTYPKMNGSGLAMEGKQTTRVVRLEEEALPQAHFGDSVTV GSFQIATDSVVHGHKGTGALQQFGGHQTVIGVRGAEKQPHLPFPRVFKFFGG >gi568815575r:114905197_115117269|GENSCAN_predicted_CDS_4|1419_bp atggatgaggagtcagaaggaggatgtagtgggaaggtggtcttcccctggatactcccc caaccgctcctggccaaactcccctcggtgtccacgttgttctgccatcactggtctgct gatgtctgctggtgtctgctggtgtgttcctctgctcttcttgatgtccaactgctcgtg tccatgtccgctatggtctcaagtttttatgggcacaggatgggggacatgggtgtttcc ctcatttgtttgatcaaaggtggctcaatgccaagaccacattctaacaagcaggctgga ggaaaggaaaaacagagaggcaatatcaaggttttaaatctcggagaaatggctttcgtt tgcttggctatcggatgcttatatacctttctgataagcacaacatttggctgtacttca tcttcagacaccgagataaaagttaaccctcctcaggattttgagatagtggatcccgga tacttaggttatctctatttgcaatggcaacccccactgtctctggatcattttaaggaa tgcacagtggaatatgaactaaaataccgaaacattggtagtgaaacatggaagaccatc attactaagaatctacattacaaagatgggtttgatcttaacaagggcattgaagcgaag atacacacgcttttaccatggcaatgcacaaatggatcagaagttcaaagttcctgggca gaaactacttattggatatcaccacaagttaaacctttgccgccagtctatcttactttt actcgggagagttcatgtgaaattaagctgaaatggagcatacctttgggacctattcca gcaaggtgttttgattatgaaattgagatcagagaagatgatactaccttggtgactgct acagttgaaaatgaaacatacaccttgaaaacaacaaatgaaacccgacaattatgcttt gtagtaagaagcaaagtgaatatttattgctcagatgacggaatttggagtgagtggagt gataaacaatgctgggaaggtgaagacctatcgaagaaaactttgctacgtttctggcta ccatttggtttcatcttaatattagttatatttgtaaccggtctgcttttgcgtaagcca aacacctacccaaaaatgaatggcagtggcttggcaatggaaggcaaacaaaccacaaga gttgtcagactggaggaggaagctctgccccaagctcattttggtgatagtgttacagta ggaagtttccaaattgccactgacagtgttgtccatggtcacaagggcacaggggctctc caacagtttggtggtcatcagactgtcataggggtgaggggagcagagaagcagccccac ctaccctttccacgtgtcttcaagttctttgggggttga >gi568815575r:114905197_115117269|GENSCAN_predicted_peptide_5|40_aa MVYTKLKRRCKTKFDADLAGCSPDVIVEAIFNHIDNVNIH >gi568815575r:114905197_115117269|GENSCAN_predicted_CDS_5|123_bp atggtctacaccaaattgaagagaaggtgtaaaaccaaatttgatgcagacctggcaggc tgtagcccagatgtaatagtggaagccatatttaaccatattgacaatgtcaacatacac tag >gi568815575r:114905197_115117269|GENSCAN_predicted_peptide_6|254_aa MKDDERNMSALVGVHDGAQERLRVGLQVLLDVHDLVPGEECLWRRLGRQGGTSYVILDTN HECGECGDKSHKCKQCGKALISLKTVQRNRVTHTLIFSVHFVKHQRTHTGEKAYKCTQCS KAFSFFSSCRIHERIHTREKPYELVGAQLISAKGKYKFTASIKVFLQSADGCVYNPLPRH RVQIGAFLQSADWCIYNPLARHRVLTGNKETQGEGDKQENGQLVEQSEHTHLSMKFAVLY DYGSCHSNVMMITD >gi568815575r:114905197_115117269|GENSCAN_predicted_CDS_6|765_bp atgaaggatgatgagagaaacatgtctgcactggtgggggttcatgacggcgctcaagag cgcctccgagtcggtctccaagtcctcctggacgtgcacgatctggtgcctggtgaagaa tgcctatggcgccgcctgggtcgccagggaggcacatcatatgtcatactggacacaaac catgagtgtggggaatgtggagataagtcacataaatgtaaacaatgtgggaaagcctta atttctctcaagactgttcaaagaaatagagtaacacacactttgattttctcagttcat tttgtaaaacatcagagaacgcacacaggagagaaagcttataaatgtacacaatgtagt aaagccttcagtttcttcagtagttgtagaatacatgaaagaattcacactagagaaaaa ccctatgaattggtaggtgctcaattaatatctgctaaagggaaatacaagttcacagcc agcattaaggtgtttttacagagtgctgatgggtgcgtttacaatcctttacctagacac agagtgcagattggtgcatttttacagagtgctgattggtgcatttacaatcctttagct agacatagagtactgactgggaataaagagacccaaggtgagggagataaacaggagaat ggtcagttggtggagcagtcagagcacacacatttatccatgaagtttgccgtcttatat gactatggctcatgtcactccaatgtaatgatgatcacagattaa >gi568815575r:114905197_115117269|GENSCAN_predicted_peptide_7|133_aa MDKFLNTYTLPRLNQEEVESLNRPITGSEIVAIINSLPGKKSPGPDGFTAELYQRYKEEL VPFLLKLFQSIEKEGIHPNSFDEASIILIPKPGRDTTTKENFRPISLMNIDAKILNKILA KRIQQHTKKAYPP >gi568815575r:114905197_115117269|GENSCAN_predicted_CDS_7|402_bp atggataaattcctcaacacatacaccctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgtggcaataatcaatagcttaccaggcaaa aaaagtccaggaccagatggattcacagccgaattataccagaggtacaaggaggagctg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatccaccctaactca tttgatgaggccagcatcatcctgataccaaagccgggcagagacacaaccacaaaagag aattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggca aaacgaatccagcagcacaccaaaaaagcttatccaccatga >gi568815575r:114905197_115117269|GENSCAN_predicted_peptide_8|260_aa MELKAKARELREECRSLRSRCDQLEESVSVMEDEMNEMKQDGKFREKRIKRNEQSLQEIW DYVKRPNLHLIGVPESDGENGTKLEKTLQDIIQENFPNLARQANIQIQEIQRTPQRYSSR RATPRHIIVRFTKVEMKEKMLRAAREKGRVTHKGKPIRLTADLSAETLQARREWGPIFNI LKEKNFQPRISYPAKLSFISEGEIKSFTDKQMLRDFVTTRPALKELLKEALNMERNNRYQ PLQKHAKMLRPSRLGRNCIN >gi568815575r:114905197_115117269|GENSCAN_predicted_CDS_8|783_bp atggagctgaaagccaaggctcgagaactacgtgaagaatgcagaagcctcaggagccga tgcgatcaactggaagaaagtgtatcagtgatggaagatgaaatgaatgaaatgaagcaa gacgggaagtttagagaaaaaagaataaaaagaaacgagcaaagcctccaagaaatatgg gactatgtgaaaagaccaaatctacatctgattggtgtacctgaaagtgacggggagaat ggaaccaagttggaaaagactctgcaggatattatccaggagaacttccccaatctagca aggcaggccaacattcagattcaggaaatacagagaacgccacaaagatactcctcgaga agagcaactccaagacacataattgtcagattcaccaaagttgaaatgaaggaaaaaatg ttaagggcagccagagagaaaggtcgggttacccacaaagggaagcccatcagactaaca gctgatctctcggcagaaactctacaagccagaagagagtgggggccgatattcaacatt cttaaagaaaagaattttcaacccagaatttcctatccagccaaactaagcttcataagt gaaggagaaataaaatcctttacagacaagcaaatgctgagagattttgtcaccaccagg cctgccctaaaagagctcctgaaggaagcactaaacatggaaaggaacaaccggtaccag ccactgcagaaacatgccaaaatgttaagaccatcaaggctaggaagaaactgcatcaac taa >gi568815575r:114905197_115117269|GENSCAN_predicted_peptide_9|280_aa MTELLQGWVATIIVALGCHFSPAGTRETKQFETRRNSPQCSTVVTADHEIQQQQQQQQQQ QQQQQHFKPITLMNIDAKFLSKILANQIQQHIKKLINTINIRSSVTEIRQEKEIKGIQIE REEVKLSLFADDMILYLENPIVSAQKLLKISNFSEISRNKINVQNSQAFLYTDNRQAESQ IMNELPFTIATKRIKHLGVQLTRKGSEGPLQGELQTTAQGNQREHKQMGKHSILTDKKKK YCENGQTAQGTWMEMEAIILSKHTQEQKTKHHMFLLISGS >gi568815575r:114905197_115117269|GENSCAN_predicted_CDS_9|843_bp atgactgagctcctgcagggatgggttgccaccatcattgtagctctaggctgccatttt tctcctgctggtaccagggagactaaacagtttgaaaccaggaggaattccccacagtgc agcacagtggttacagcagatcacgagatacaacaacaacaacaacaacaacagcagcag caacaacaacaacaacacttcaagccaataaccctgatgaacatcgatgcaaaattcctc agtaaaatactggcaaaccaaatccagcagcacatcaaaaagcttatcaatacgatcaat attagaagttctgtcacggaaatcaggcaagaaaaagaaataaaaggtattcaaatagaa agagaggaagtcaaactgtctctttttgcagatgacatgatcctatatttagaaaaccct attgtctcagcccaaaagcttcttaagataagcaacttcagcgaaatctccagaaacaaa atcaatgtgcaaaattcacaagcattcctatataccgataacagacaagcagagagccaa atcatgaatgaactcccattcacaattgctacaaagagaataaaacacctaggagtacag ctaacaaggaagggaagtgaaggacctcttcaaggagaactacaaaccactgctcaagga aatcagagagaacacaaacaaatgggaaaacattccatactcacagataagaagaagaaa tactgtgaaaatggccaaactgcccaagggacatggatggaaatggaagccattatcctc agcaaacatacgcaggaacagaaaaccaaacaccacatgttcttgcttataagtggaagc tga >gi568815575r:114905197_115117269|GENSCAN_predicted_peptide_10|39_aa ERLCLPHHILEERGLVKVGVTVQALLELPTTKASQLSVA >gi568815575r:114905197_115117269|GENSCAN_predicted_CDS_10|120_bp gaaagactttgtttgcctcatcatattttggaagaacgaggacttgtgaaagttggtgtc acagttcaggcgctccttgaattaccaacaaccaaggcatctcagctttctgtggcttaa