GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:48:27 Sequence gi568815594f:185043373_185246968 : 203596 bp : 42.23% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 215 210 6 1.05 1.05 Term - 3144 2735 410 1 2 24 35 272 0.292 9.89 1.04 Intr - 4027 3845 183 2 0 81 -1 152 0.549 4.44 1.03 Intr - 6296 5565 732 0 0 77 40 258 0.310 10.38 1.02 Intr - 7379 6665 715 2 1 47 74 519 0.897 36.48 1.01 Init - 7676 7440 237 2 0 72 2 310 0.851 18.86 1.00 Prom - 7985 7946 40 -7.05 2.00 Prom + 9269 9308 40 -9.15 2.01 Init + 10231 10592 362 0 2 72 -7 101 0.073 -4.33 2.02 Intr + 14701 14861 161 1 2 94 44 112 0.550 6.11 2.03 Intr + 19876 19962 87 2 0 115 75 110 0.837 11.42 2.04 Term + 28196 28311 116 0 2 114 33 111 0.621 5.95 2.05 PlyA + 28824 28829 6 1.05 3.08 PlyA - 29844 29839 6 1.05 3.07 Term - 37809 37676 134 1 2 85 41 50 0.175 -2.73 3.06 Intr - 42820 42730 91 1 1 72 55 109 0.428 4.65 3.05 Intr - 50721 50611 111 0 0 76 26 88 0.042 1.06 3.04 Intr - 68207 68038 170 0 2 61 23 151 0.040 4.64 3.03 Intr - 77420 77312 109 2 1 53 82 53 0.032 0.14 3.02 Intr - 83766 83621 146 1 2 -1 92 128 0.285 3.38 3.01 Init - 83882 83807 76 2 1 96 89 45 0.303 6.72 3.00 Prom - 83924 83885 40 -6.95 4.03 PlyA - 83940 83935 6 -0.45 4.02 Term - 84618 84231 388 2 1 63 35 199 0.611 5.53 4.01 Init - 85354 85296 59 1 2 66 80 120 0.967 7.83 4.00 Prom - 93015 92976 40 -7.25 5.00 Prom + 97665 97704 40 -6.95 5.01 Init + 100001 100111 111 1 0 103 94 103 0.995 10.79 5.02 Intr + 101392 101878 487 0 1 114 76 485 0.920 41.56 5.03 Intr + 102387 102527 141 1 0 83 86 162 0.997 14.90 5.04 Term + 103442 103599 158 0 2 66 35 122 0.813 1.81 5.05 PlyA + 103880 103885 6 1.05 6.05 PlyA - 103915 103910 6 1.05 6.04 Term - 119553 119426 128 1 2 82 41 120 0.681 4.16 6.03 Intr - 120807 120657 151 0 1 31 71 99 0.429 1.31 6.02 Intr - 132679 132414 266 2 2 94 111 156 0.869 14.81 6.01 Init - 147824 146771 1054 2 1 103 86 733 0.985 69.00 6.00 Prom - 149070 149031 40 -11.04 7.00 Prom + 152303 152342 40 -4.15 7.01 Init + 166455 166883 429 2 0 78 98 307 0.265 25.00 7.02 Intr + 167169 167289 121 2 1 16 52 167 0.227 5.05 7.03 Intr + 175156 175198 43 2 1 71 84 24 0.024 -3.32 7.04 Intr + 183649 183825 177 1 0 85 49 149 0.298 8.81 7.05 Intr + 190880 190950 71 2 2 75 25 104 0.093 0.71 7.06 Intr + 196446 196486 41 1 2 81 82 35 0.392 -0.78 7.07 Intr + 198933 199023 91 2 1 54 84 122 0.936 7.05 7.08 Term + 199098 199204 107 1 2 44 51 154 0.842 4.89 7.09 PlyA + 200331 200336 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:185043373_185246968|GENSCAN_predicted_peptide_1|758_aa MGQVWGLVRSTLELFHTEDEEEPEYSEVTEEVTERVYLPAKAKAAKEGKVHPYPSAPPPH YFEEKDPPDLSFPEDAGQKAGIQQARQEGDLEAWQFPVRIHPPDQQGNIIATFEPFPFKL LKEFKQAINQYGPGSPFVMGLLKNVAVSSRMIPTDWDALTRACLTPAQFLQFKTRWADEA SIQAARNAQAQPQINITAHQLLGVGGWAGLHAQLVMGDDAIEQLRGVCIRAWEKITSSGE QYPSFSAIKQRPKEPYVDFIARLQKSLKKMIADSAAQDIVLQLLAFDNANPDCQAALRPI RGKVHLIDYINACDGIGASSASVDLCCTKAVSLLPGEPPQKVPTGVCEPLPVGTIELLLG RSSIGLKGVQIHTGVIDSDSNGEIQIVVSTSVPWKAEPGGRIAQLLIVPYVGMRKSEIKR TGGFGSTNKQGKAAYWVNQITDKRATCEITIQGKKFKGLVDTGADISIISLQHWPSAWPI QPAQFNVAGVGKAAEVYQSSYILHCEGPDGQPGTIQPIITSVPKNLWGRDLLQQWGAQVL IPEQLYSPHNQHTMHEMGYVPGQKAELVAVIEVLTAFDMPINVISDSSYVVHSTQLIENA QLRFHTDEQLMTLFIQLQTAVRRQMFSAAEQHLQKPAAKTEAEQLIWWREPITKSWEIGK MITWGRGYACITPGQNQQLIWIPSRHLEPCHEPDAKEETPGGSRGPPGCSHVKTDAEEDP TCHEQHPLNTATHLGTDQEAVIDGGRKPEESGTTSHNE >gi568815594f:185043373_185246968|GENSCAN_predicted_CDS_1|2277_bp atgggacaggtatggggtctggttcgttccaccttggaactttttcacactgaggacgag gaggaaccagaatatagcgaagtaacagaagaggttacagagcgtgtttatttaccagct aaagctaaagcagcaaaggaaggaaaggttcatccctacccttctgcaccccctcctcat tattttgaagaaaaagaccctccagatctttcttttccagaggacgctgggcaaaaagca ggaattcagcaagccagacaagagggtgatttagaggcttggcagttccctgttagaata caccccccagatcaacagggaaatattatagctacttttgagccttttccttttaaatta ctcaaagaatttaaacaagctataaatcagtatggaccaggttctccttttgtaatggga ctgttaaagaatgttgctgtttccagtcgaatgattcctactgactgggacgctcttact cgagcttgtctaactcctgctcagttcttacagtttaaaactcggtgggcagatgaagct tccattcaggctgctcgcaatgcccaggcccaacctcaaattaatataactgcacaccaa cttttgggggttggcggctgggctggtttacatgcacaactggtcatgggggatgatgcc atagaacagcttagaggagtgtgcattagagcttgggaaaaaatcacttcaagtggagaa caatacccttcctttagtgctataaaacagcgacccaaagaaccatacgttgattttata gctcggttacagaagtctcttaaaaagatgattgcagattcggctgctcaggatatagtg ttgcagttattagctttcgacaatgctaatcccgattgccaggctgctctgcgacctatc agggggaaagtgcatttaattgattatatcaatgcctgtgatggcatcggagccagtagt gcttcagtagatttatgctgcacaaaagctgtgagccttctgcctggggaacccccgcaa aaggtcccaacaggagtctgcgaacccttgccagtggggacgatagaattacttttagga aggtctagtataggtttaaaaggggtacaaatacatacaggagtcattgattcagattcc aatggggaaattcaaattgttgtatctacttctgttccctggaaagcagagccaggaggg cgtatagcacagctcctgattgtgccgtatgtgggaatgagaaaaagtgaaattaaacga acaggaggatttggaagcacaaataaacaaggcaaagcagcttattgggtaaatcaaatt actgataaacgtgctacctgtgaaataactattcagggaaagaaatttaaaggtttggta gatacaggagcagacatttcaatcatttctctacagcactggccatctgcgtggccaatt caacccgctcaatttaatgtggctggagttggtaaagccgctgaagtatatcaaagtagt tatatcttgcattgtgaagggcctgatggacaacctgggactattcaaccaattataact tctgtacctaaaaatttatggggaagagatttattacaacaatggggagcacaagttcta attccagaacaattatatagccctcataatcaacatacgatgcatgaaatggggtatgtc cctggtcaaaaagcagagcttgtagctgtaattgaggtattgactgcttttgatatgcct attaatgtgatttctgattcttcatacgtggttcattccacgcagttaattgaaaatgct cagttacgatttcatacagatgaacaactgatgactttatttatccaattgcaaacagca gttagaagacagatgttctcagcagctgaacagcatctacagaaaccagctgcaaagaca gaagcagaacaattgatttggtggagagagccaataacaaaaagttgggaaataggtaaa atgataacttggggtagaggttatgcttgtattactccaggccaaaatcaacagctgatt tggataccatcaagacacctggaaccttgtcatgagccagatgccaaggaagagactcca ggaggatcccgaggaccccctggttgcagccatgtcaagaccgatgctgaggaggacccc acctgtcacgagcaacacccgttgaacacagccacccacctggggacagatcaagaagct gtcatagatggcggaagaaaacctgaggaaagcgggacaaccagtcacaatgagtaa >gi568815594f:185043373_185246968|GENSCAN_predicted_peptide_2|241_aa MSLRARPQPAFKRKKKATLLHSSPAPVKTSIFSEDQEKTASGTQYGWAGAGPGAVTEGQP SRVKGEVAQLLQDPGDLTQAHSSAPDKEATKTWQPLMRYFCAWANEGLLQEASDEKPAWL LFEVWGGEGTQRSLRWKKPYRCASPPCSVPYKGHPSLQTSSPELLCLCVRARFASLPQEY EGQSSSSCWSHDVTLKAGNSALEELIREDREVSVGGSKEPPSLQLRLRGLRRFIHKATLH K >gi568815594f:185043373_185246968|GENSCAN_predicted_CDS_2|726_bp atgagcctgcgcgcccgaccccagccagcttttaaaaggaagaaaaaagcaaccctcctc cactcctctccagcccccgtgaagaccagcatcttttcagaagaccaggaaaaaacagcc tcgggcactcagtacggatgggcaggtgccgggcctggtgctgtcactgaggggcagccc agcagggttaaaggggaggtggcccagctcctccaggacccgggagacctgacacaggca cactcaagtgcgccggataaagaggctacaaaaacatggcagcccctaatgagatatttt tgcgcttgggcgaacgaagggctgcttcaggaggctagtgacgaaaaacctgcctggctt ttgtttgaagtttggggtggtgagggaactcaaaggtctctgagatggaagaaaccttac agatgtgcttctccgccctgctcagtaccttacaaaggtcatccctccctgcaaacttca tcccctgagctcctgtgcctctgcgttcgagctcgatttgccagcctgccacaagaatac gaaggccagagctccagcagctgttggagccacgatgtgaccctgaaggcaggaaactca gctttggaagaactgatccgagaggacagggaagtcagtgtgggcggctctaaggagcct ccctctctacagctgcggctcagaggattacggcgcttcatccacaaggcaacattgcac aagtag >gi568815594f:185043373_185246968|GENSCAN_predicted_peptide_3|278_aa MDAARGHVLSQLAPGTENQILHVLTYGNNEHLGAKSGKRGRGPTAEKRPIGYHVHYLVNG FNKSPNLSIMRYTQANHLHGAPWLLQAIYAPLSQGCEAGRSLKKGSAGRTGSLVTLLLMF RLIDMGSRVAEGSGTHVKEPTRRKMFRLTDMGSRALCYPGPGTASAYVWGTGPKTALEFS YSLEGLTELTESCDTHSYGLLQGKLPADLHAQTGAVHRLQQEKGLLACGNHLLQHSRDTM VNRQMMKSSPGDDWCLVPSCTKPVFMEFTREVETKDRQ >gi568815594f:185043373_185246968|GENSCAN_predicted_CDS_3|837_bp atggatgcagctagaggccatgtcctaagccaattagcaccaggaacagaaaaccagatc ctgcatgttctcacttatgggaacaatgagcacttgggcgccaaaagtggcaagcggggg agggggccaacagctgaaaaacgacctatcgggtaccatgttcactatttggttaatggg ttcaataaaagcccaaacctcagcatcatgcgatatacccaggccaatcatctccatggg gccccgtggctgctacaagccatttacgccccgctgagtcagggctgtgaagctgggaga tctctgaagaagggttcagctggcaggacaggctctctagtgactctcctactgatgttc agactcattgacatgggttccagggtagcagagggctctggaacccatgtcaaggagcct acccgcaggaagatgttcaggctcactgacatgggttccagagccctctgctatcctggt cctggcaccgcctcagcatatgtttgggggaccggtcccaaaaccgcccttgagttcagt tattcactagagggactcacagagctcactgagagctgtgatactcacagttatggttta ttacagggaaagctccctgcagacttgcatgctcagacgggagctgtccaccgcctgcaa caggaaaagggcttgcttgcgtgtggtaaccaccttcttcaacattctagggatacaatg gtgaacaggcaaatgatgaagtcaagtcctggagatgactggtgtttagtccctagttgc actaaacctgttttcatggagtttacacgagaagtggagacaaaagacagacaataa >gi568815594f:185043373_185246968|GENSCAN_predicted_peptide_4|148_aa MRGVRRGPGAEGQAGLATPSTHPGHRTPVLSRGSSKPWFSPDREFRSVRAPFRPPAPSVP PQGPEAALGAQRIPRRLEPQPRRFVAAVQETAGEGARGAETACRGLCSPGASFGLAEPDS QPALTTAQRLARWVSEPPTLKDTDLART >gi568815594f:185043373_185246968|GENSCAN_predicted_CDS_4|447_bp atgcgcggggtgcgtcggggtcccggcgccgagggccaggcggggctggcgacacccagc acacatcccggacacagaacgcccgtactttcccgagggtcttccaagccctggttcagc ccggaccgtgagttccggagcgtccgcgcccccttccgccctcccgcaccctcagtcccg cctcagggccccgaagccgccctgggcgcgcagcgcatccccaggcgactggagccccag ccccgacgcttcgtcgccgcagtccaggagaccgcaggagaaggggctcgtggggcggag acagcctgccggggcctctgcagcccgggagcctcctttggactcgccgaacccgactcc caacccgcccttaccactgcccagaggctcgcgcgctgggtttctgagcctcccaccctg aaggacaccgatctcgcccggacataa >gi568815594f:185043373_185246968|GENSCAN_predicted_peptide_5|298_aa MGDHAWSFLKDFLAGGVAAAVSKTAVAPIERVKLLLQVQHASKQISAEKQYKGIIDCVVR IPKEQGFLSFWRGNLANVIRYFPTQALNFAFKDKYKQLFLGGVDRHKQFWRYFAGNLASG GAAGATSLCFVYPLDFARTRLAADVGKGAAQREFHGLGDCIIKIFKSDGLRGLYQGFNVS VQGIIIYRAAYFGVYDTAKGMLPDPKNVHIFVSWMIAQSVTAVAGLVSYPFDTVRRRMMM QSGRKGADIMYTGTVDCWRKIAKDEGAKAFFKGAWSNVLRGMGGAFVLVLYDEIKKYV >gi568815594f:185043373_185246968|GENSCAN_predicted_CDS_5|897_bp atgggtgatcacgcttggagcttcctaaaggacttcctggccgggggcgtcgccgctgcc gtctccaagaccgcggtcgcccccatcgagagggtcaaactgctgctgcaggtccagcat gccagcaaacagatcagtgctgagaagcagtacaaagggatcattgattgtgtggtgaga atccctaaggagcagggcttcctctccttctggaggggtaacctggccaacgtgatccgt tacttccccacccaagctctcaacttcgccttcaaggacaagtacaagcagctcttctta gggggtgtggatcggcataagcagttctggcgctactttgctggtaacctggcgtccggt ggggccgctggggccacctccctttgctttgtctacccgctggactttgctaggaccagg ttggctgctgatgtgggcaagggcgccgcccagcgtgagttccatggtctgggcgactgt atcatcaagatcttcaagtctgatggcctgagggggctctaccagggtttcaacgtctct gtccaaggcatcattatctatagagctgcctacttcggagtctatgatactgccaagggg atgctgcctgaccccaagaacgtgcacatttttgtgagctggatgattgcccagagtgtg acggcagtcgcagggctggtgtcctacccctttgacactgttcgtcgtagaatgatgatg cagtccggccggaaaggggccgatattatgtacacggggacagttgactgctggaggaag attgcaaaagacgaaggagccaaggccttcttcaaaggtgcctggtccaatgtgctgaga ggcatgggcggtgcttttgtattggtgttgtatgatgagatcaaaaaatatgtctaa >gi568815594f:185043373_185246968|GENSCAN_predicted_peptide_6|532_aa MDQFGDILEGEVDHSFFDSDFEEGKKCETNSVFDKQNDDPKERIDKDTKNVNSNTGMQTT ENYLTEKGNERNVKFPPEHPVENDVTQTVSSFSLPASSRSKKLCDVTTGLKIHVSIPNRI PKIVKEGEDDYYTDGEESSDDGKKYHVKSKSAKPSTNVKKSIRKKYCKVSSSSSSSLSSS SSGSGTDCLDAGSDSHLSDSSPSSKSSKKHVSGITLLSPKHKYKSGIKSTETQPSSTTPK CGHYPEESEDTVTDVSPLSTPDISPLQSFELGIANDQKVKIKKQENVSQEIYEDVEDLKN NSKYLKAAKKGKEKHEPDVSSKSSSVLDSSLDHRHKQKVLHDTMDLNHLLKAFLQLDKKG PQKHHFDQPSVAPGKNYSFTREEVRQIDRENQRLLKELSRQAEKPGSKSTIPRSADHPPK LYHSALNRQKEQQRIERENLALLKRLEAVKPTVGMKRSEQLMDYHRNMGYLNSSPLSRRA RSTLGQYSPLRASRTSSATSGLSCRSERSAVDPSSGHPRRRPKPPNVRTAWL >gi568815594f:185043373_185246968|GENSCAN_predicted_CDS_6|1599_bp atggatcagtttggagatatattagaaggtgaagtggaccattctttctttgacagtgac tttgaagaaggaaagaaatgtgaaactaactcagtttttgacaagcaaaatgatgaccca aaggaaagaatagataaagatacaaaaaatgtaaattcgaacactggaatgcaaacaaca gaaaattatcttactgagaagggaaatgaaagaaacgtgaaatttcccccagaacacccc gtagagaatgatgttacacaaactgtaagttctttctcattgccagcctcttcaagatca aaaaaattgtgtgatgttacaacaggacttaaaatacacgtgtccattccaaatagaatt cccaaaattgtaaaagaaggtgaagatgattactacacagatggagaggaaagcagtgat gatgggaagaaataccatgtgaagtccaagtccgctaaaccatctactaacgttaaaaaa agcataaggaaaaagtattgcaaagttagctcctcttcctcctcctctttatcttcctca tcttcaggttcaggtacagattgtttagatgcagggtctgatagccatctatctgattcg tctccgtcatctaagtcatctaagaaacatgtatctggtataaccctcctgtcaccaaaa cacaagtataaatcaggaataaaatcgacagaaacacagccttcaagtactacaccaaaa tgtggccactaccctgaggagtctgaagatactgtgactgacgtaagtcccttatcaact ccagacattagccctcttcagtcttttgaactgggcatagcaaatgatcaaaaagtgaaa attaaaaagcaagaaaatgtgagccaagaaatatatgaagatgttgaggatttgaaaaat aattcaaaatatttgaaagcagccaaaaaagggaaagaaaaacatgagcctgatgtctcc tcaaagtcgtcttcagtgttagactccagtttagaccacagacataaacagaaagtctta catgacacaatggatctgaatcatctcttgaaagcttttctgcaattagataaaaaagga ccacaaaaacatcactttgatcagccttcagtagcacccgggaaaaactactctttcaca agagaagaggtgagacagatcgatcgggaaaatcagaggcttttgaaagaactgtcaaga caggcggaaaagccgggaagcaaaagtacaattcctagatcggctgatcatcccccaaag ttatatcacagtgctctcaacagacagaaggaacaacaaaggattgagagagaaaacttg gctttattgaaaaggcttgaggccgtgaaaccaacagttggtatgaaacgttcagaacaa ctgatggactatcatcgcaatatgggctatctcaactcatcaccattgtcaagacgggcc agatccactcttggccaatatagcccattaagagcttccaggacatccagtgctacgagt ggtctcagttgtaggagtgagcgatcagcggttgacccctccagtggccaccctcgaaga agacctaaaccccctaatgtccgtacagcttggttataa >gi568815594f:185043373_185246968|GENSCAN_predicted_peptide_7|359_aa MHPDATDSGGAGPSPARAAGAGGRPVSGFRGERRPESPGDAEAAAAAAPGAPGGRSWWKP VAVAALAAVALSFLGPGSGEAAGAAGLSSVLFRLSLYLSCAAAAFLLGILFALVCRSPRA QPPDFAAAWSRLAATSAARRPPGGVPRPSVSLAVGGEEGNHSVRYVQALRTVLAIRLVLL AVLASQHSAATLGTCQSCPVSRKSLRAVDQVTVSVLGTTATQDDTQKKKVPFWSKENVLR NASGSVPLTSYWPGLGRKQVIHQETLRKEPAMKVKYDGMREGDLAGSQDNSGGKHRLPGD SVRAHRLSEAQSPRALNPPPPTSVFNLLEQLTELRETLTYIYWFIVEDIAKDADEEMLR >gi568815594f:185043373_185246968|GENSCAN_predicted_CDS_7|1080_bp atgcaccccgatgcgaccgacagtggcggcgccggccccagccccgcgcgggccgcaggc gccggcggccgtcctgtctcgggcttcaggggcgagcggcggccggagtccccgggggac gcggaggcagcagcagcggcggcgccgggggccccgggcggccggagctggtggaagccc gtggcggtggccgcactcgccgccgtggccctctccttcctggggcccggcagcggggag gcggcgggggccgcggggctgagctccgtcctgttcaggctcagcctgtacctgagctgc gcggcggccgccttcctgctggggatcctgtttgccctcgtctgccggagcccgcgcgcc cagccgcccgacttcgccgccgcctggagccggctggccgcgacctcagccgcccgccgc ccgccggggggtgtccccagaccttcggtgtcgcttgccgtgggaggggaagaagggaat cacagcgttcgctacgtgcaggctctgcgcacggtcctggcgatccgcttggttcttctg gccgttctcgctagccagcactcagcagccactctaggcacttgccagtcctgccccgtg agtcggaagagcttgcgggctgttgaccaggtgactgtatcagttttaggcaccacagcc acgcaggatgatacccagaagaagaaagtccctttttggagcaaggaaaatgtgctcaga aatgcttctggcagtgttcccctcacatcttactggccaggtttgggcagaaagcaggtt attcaccaggaaacattacgaaaggaacctgctatgaaagtaaaatatgacggcatgaga gagggggatttggcagggtcacaggacaatagtggagggaagcatcgtctacctggagac agtgtcagagcccatagattaagtgaagctcagtccccaagagccctcaaccccccaccc ccaacgtcagtctttaatttgctggagcagctcacagaactcagggaaacgcttacttac atttactggtttattgtagaagatattgcaaaggatgcagatgaagaaatgctacggtga