GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:53:44 Sequence gi568815593r:62289013_62503772 : 214760 bp : 39.25% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3372 3534 163 2 1 52 71 110 0.734 4.33 1.02 Term + 5926 6551 626 2 2 52 42 307 0.801 16.06 1.03 PlyA + 7273 7278 6 1.05 2.00 Prom + 14407 14446 40 -4.15 2.01 Init + 14803 14846 44 0 2 61 80 18 0.238 -1.76 2.02 Intr + 16690 16750 61 1 1 96 95 14 0.294 0.72 2.03 Intr + 17305 17524 220 0 1 108 89 132 0.701 12.25 2.04 Term + 18071 18162 92 0 2 67 49 92 0.793 0.10 2.05 PlyA + 18494 18499 6 1.05 3.00 Prom + 35730 35769 40 -3.35 3.01 Init + 36299 36347 49 1 1 86 58 63 0.328 2.26 3.02 Intr + 44472 44712 241 1 1 47 32 159 0.454 1.89 3.03 Intr + 45636 45841 206 0 2 106 94 125 0.539 12.82 3.04 Intr + 58118 58212 95 0 2 70 94 67 0.039 4.26 3.05 Intr + 59036 59155 120 1 0 36 96 113 0.854 6.67 3.06 Intr + 61054 61108 55 0 1 77 88 33 0.955 -0.17 3.07 Intr + 63576 63698 123 1 0 86 36 95 0.919 3.74 3.08 Intr + 64263 64363 101 1 2 82 79 105 0.969 7.91 3.09 Intr + 66147 66242 96 2 0 93 58 65 0.826 3.29 3.10 Intr + 68679 68733 55 2 1 72 86 -14 0.893 -5.57 3.11 Intr + 69125 69287 163 0 1 64 96 93 0.941 5.81 3.12 Intr + 72230 72320 91 2 1 84 103 67 0.974 6.88 3.13 Intr + 74166 74308 143 2 2 65 99 75 0.997 4.53 3.14 Intr + 74683 74887 205 1 1 31 94 125 0.990 5.68 3.15 Intr + 76231 76341 111 0 0 88 115 85 0.999 10.86 3.16 Intr + 77402 77469 68 1 2 99 52 14 0.597 -4.32 3.17 Intr + 84675 84825 151 0 1 64 82 177 0.936 13.94 3.18 Intr + 88649 88750 102 1 0 65 103 91 0.994 7.75 3.19 Intr + 92106 92241 136 2 1 78 43 176 0.894 11.32 3.20 Term + 96472 96557 86 2 2 81 40 91 0.804 0.44 3.21 PlyA + 97349 97354 6 1.05 4.12 PlyA - 97963 97958 6 1.05 4.11 Term - 101807 101747 61 1 1 49 42 89 0.743 -3.30 4.10 Intr - 101970 101864 107 0 2 52 91 126 0.903 7.39 4.09 Intr - 105035 104943 93 2 0 85 94 84 0.990 7.94 4.08 Intr - 105595 105472 124 0 1 127 106 61 0.997 11.47 4.07 Intr - 107915 107780 136 0 1 -12 35 102 0.227 -6.39 4.06 Intr - 109498 109417 82 1 1 83 82 73 0.252 4.49 4.05 Intr - 109727 109634 94 1 1 91 91 56 0.941 5.25 4.04 Intr - 109869 109808 62 0 2 83 94 39 0.915 0.61 4.03 Intr - 113110 113024 87 1 0 53 70 76 0.719 1.55 4.02 Intr - 114334 114261 74 2 2 105 98 9 0.849 1.81 4.01 Init - 114760 114682 79 1 1 101 64 119 0.446 10.11 4.00 Prom - 119269 119230 40 -4.25 5.04 PlyA - 119294 119289 6 1.05 5.03 Term - 124075 123444 632 2 2 52 37 233 0.490 8.09 5.02 Intr - 138513 138410 104 2 2 101 18 70 0.054 0.20 5.01 Init - 143405 143248 158 2 2 73 47 124 0.232 6.23 5.00 Prom - 144788 144749 40 -5.85 6.00 Prom + 146183 146222 40 -5.75 6.01 Init + 148268 148405 138 1 0 49 119 95 0.504 8.89 6.02 Intr + 153971 154071 101 1 2 82 89 41 0.953 1.59 6.03 Intr + 160915 160987 73 1 1 108 92 63 0.996 7.19 6.04 Intr + 162718 162921 204 0 0 102 95 150 0.879 15.57 6.05 Intr + 176986 177093 108 0 0 49 89 57 0.263 1.46 6.06 Intr + 178119 178251 133 2 1 107 74 21 0.389 1.90 6.07 Intr + 181238 181296 59 0 2 77 115 52 0.394 4.48 6.08 Intr + 194089 194281 193 0 1 94 86 71 0.548 5.64 6.09 Intr + 194998 195150 153 2 0 84 46 64 0.503 0.92 6.10 Intr + 198829 198849 21 2 0 94 116 6 0.280 0.60 6.11 Intr + 200290 200337 48 2 0 73 84 46 0.557 0.43 6.12 Term + 201103 201218 116 2 2 95 38 113 0.958 4.75 6.13 PlyA + 202339 202344 6 1.05 7.03 PlyA - 202597 202592 6 1.05 7.02 Term - 208075 207948 128 2 2 57 48 75 0.180 -2.14 7.01 Init - 213122 213071 52 2 1 58 121 45 0.774 6.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 58135 58212 78 0 0 51 94 73 0.899 5.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:62289013_62503772|GENSCAN_predicted_peptide_1|262_aa PPLRIPRQTWTGLDLQQTPTDLQLRVLTVRRKTNKQKGHPHQKPICTSPSSKTKEIQSTI REYYKHLYVNKLENLEEMDKFLDTYTLPRQNQEEVESLNRPITGSEIEAIINSLPTKKSP GPDGFTAEFYQRYKEELLPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFT PISLMNITAKMLNEILANRIQQHIKKLIHHDQMGFIPGMQGWFNLCKSVNIIQHINRIND KNHMIISIDAEKVFDKIQQTSC >gi568815593r:62289013_62503772|GENSCAN_predicted_CDS_1|789_bp cctccgctgcggatacccaggcaaacatggactggactggacctccagcaaactccaaca gacctgcagctgagggtcctgactgttagaaggaaaactaacaaacagaaaggacatcca caccaaaaacccatctgtacgtcaccatcatcaaagaccaaagaaatacaatctaccatc agagaatactataaacacctctacgtaaataaactagaaaatctagaagaaatggataaa ttcctcgacacatacacgctcccaagacaaaaccaggaagaagttgaatctctgaataga ccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaaaaaagtcca ggaccagatggattcacagccgaattctaccagaggtacaaggaggagctgttaccattc cttctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattttatgag gccagcatcatcctgataccaaagcctggcagagacacaacaaaaaaagagaattttaca ccaatatccctgatgaacatcactgcaaaaatgctcaatgaaatattggcaaaccgaatc cagcagcacatcaaaaagcttatccaccatgatcaaatgggcttcatccctgggatgcaa ggctggttcaacctatgcaaatcagtaaacataatccagcatataaacagaatcaatgac aaaaaccacatgattatctcaatagatgcagaaaaggtctttgacaaaattcaacaaact tcatgctaa >gi568815593r:62289013_62503772|GENSCAN_predicted_peptide_2|138_aa MKVLVNTYADVYQKNVTGWETYTKCAWEGCLGQLPLLLAAPLAFTLSPGPARPRATAPPP TPTPPPPRLFRPPVPLPRPAAAAPDEVMATANFGKIQIGIYVEIKRSDGLPGFVVDHAAF FFATSWEKNVENIALSHE >gi568815593r:62289013_62503772|GENSCAN_predicted_CDS_2|417_bp atgaaagtcttggtaaatacatatgctgatgtgtaccaaaaaaacgtaactggatgggaa acctacaccaagtgtgcttgggagggctgcttgggtcagctacctctgctccttgcggcc ccgcttgcgttcacgctgtcgcccgggccggcgcggccgcgggcaaccgctccccctccc acacctaccccgccccctccccgccttttccgccctccggtccccctccctcggcccgct gctgctgctccagatgaggtgatggcaacggccaacttcggcaagatccagatcgggatt tacgtggagatcaagcgcagcgatggcctccccggtttcgtcgtcgaccatgctgctttc ttttttgccacttcttgggagaaaaatgtggagaatatcgcgctgagccatgaataa >gi568815593r:62289013_62503772|GENSCAN_predicted_peptide_3|798_aa MGFRHVGQAGLELLTSALAFNLCKFSEITLQKQCQAEDTVALEEMIMDISSTQLHTQKLQ LPYTRSSRYGGSLPTPYHTDSSPYSPAYLSPPQVSSCPLSLLTGPADARRSQQQLPKQFL PVSPTLSSIVQGVPLETSNLHTQPHTPKSLQQPELPSQACSAQPSGRIHQAMVTSLNEDN ESVTVEWIENGDTKGKEIDLESIFSLNPDLVPDEEIEPSPETPPPPASSAKVNKIVKNRR TVASIKNDPPSRDNRVVGSARARPSQFPEQSSSAQQNGSVSDISPVQAAKKEFGPPSRRK SNCVKEVEKLQEKREKRRLQQQELREKRAQDVDATNPNYEIMCMIRDFRGSLDYRPLTTA DPIDEHRICVCVRKRPLNKKETQMKDLDVITIPSKDVVMVHEPKQKVDLTRYLENQTFRF DYAFDDSAPNEMVYRFTARPLVETIFERGMATCFAYGQTGSGKTHVFDLLNRKTKLRVLE DGKQQVQVVGLQEREVKCVEDVLKLIDIGNSCRTSGQTSANAHSSRSHAVFQIILRRKGK LHGKFSLIDLAGNERGADTSSADRQTRLEGAEINKSLLALKECIRALGRNKPHTPFRASK LTQVLRDSFIGENSRTCMIATISPGMASCENTLNTLRYANRVKELTVDPTAAGDVRPIMH HPPNQIDDLETQWGVGSSPQRDDLKLLCEQNEEEVSPQLFTFHEAVSQMVEMEEQVVEDH RAVFQESIRWLEDEKALLEMTEEVDYDVDSYATQLEAILEQKIDILTELRDKVKSFRAAL QEEEQASKQINPKRPRAL >gi568815593r:62289013_62503772|GENSCAN_predicted_CDS_3|2397_bp atggggtttcgccatgttggccaggctggtctcgaactcctgacctcagccttggctttc aatctgtgcaagtttagtgagatcacgctgcagaagcagtgtcaggctgaggacacagtg gccctcgaggaaatgataatggacatcagctccacccagttacacacccaaaaactgcaa ctgccatacacaaggagctcccgttatggtggttctctgccaacaccctaccacactgac agctctccctatagtcctgcctacttatctcctccccaagtgtccagctgccccctaagt ttgctcacaggtccagccgatgccagaaggtcgcaacagcagctacccaaacagtttttg ccagtgtcacccaccctgtcttccatcgttcagggtgtccccctggagaccagtaatctg cacacccagccacacaccccaaagtctctacagcagccagagctgccctctcaggcctgc tcagcgcagccctcaggccgaatacatcaagcaatggtaacatctttaaatgaagataat gaaagtgtaactgttgaatggatagaaaatggagatacaaaaggcaaagagattgacctg gagagcatcttttcacttaaccctgaccttgttcctgatgaagaaattgaacccagtcca gaaacacctccacctccagcatcctcagccaaagtaaacaaaattgtaaagaatcgacgg actgtagcttctattaagaatgaccctccttcaagagataatagagtggttggttcagca cgtgcacggcccagtcaatttcctgaacagtcttcctctgcacaacagaatggtagtgtt tcagatatatctccagttcaagctgcaaaaaaggaatttggacccccttcacgtagaaaa tctaattgtgtgaaagaagtagaaaaactgcaagaaaaacgagagaaaaggagattgcaa cagcaagaacttagagaaaaaagagcccaggacgttgatgctacaaacccaaattatgaa attatgtgtatgatcagagactttagaggaagtttggattatagaccattaacaacagca gatcctattgatgaacataggatatgtgtgtgtgtaagaaaacgaccactcaataaaaaa gaaactcaaatgaaagatcttgatgtaatcacaattcctagtaaagatgttgtgatggta catgaaccaaaacaaaaagtagatttaacaaggtacctagaaaaccaaacatttcgtttt gattatgcctttgatgactcagctcctaatgaaatggtttacaggtttactgctagacca ctagtggaaactatatttgaaaggggaatggctacatgctttgcttatgggcagactgga agtggaaaaactcatgtgtttgacttgctaaacaggaaaacaaaattaagagttctagaa gatggaaaacagcaggttcaagtggtgggattacaggaacgggaggtcaaatgtgttgaa gatgtactgaaactcattgacataggcaacagttgcagaacatccggtcaaacatctgca aatgcacattcatctcggagccatgcagtgtttcagattattcttagaaggaaaggaaaa ctacatggcaaattttctctcattgatttggctggaaatgaaagaggagctgatacttcc agtgcggacaggcaaactaggcttgaaggtgctgaaattaataaaagccttttagcactc aaggagtgcatcagagccttaggtagaaataaacctcatactcctttccgtgcaagtaaa ctcactcaggtgttaagagattctttcataggtgaaaactctcgtacctgcatgattgcc acaatctctccaggaatggcatcctgtgaaaatactcttaatacattaagatatgcaaat agggtcaaagaattgactgtagatccaactgctgctggtgatgttcgtccaataatgcac catccaccaaaccagattgatgacttagagacacagtggggtgtggggagttcccctcag agagatgatctaaaacttctttgtgaacaaaatgaagaagaagtctctccacagttgttt actttccacgaagctgtttcacaaatggtagaaatggaagaacaagttgtagaagatcac agggcagtgttccaggaatctattcggtggttagaagatgaaaaggccctcttagagatg actgaagaagtagattatgatgtcgattcatatgctacacaacttgaagctattcttgag caaaaaatagacattttaactgaactgcgggataaagtgaaatctttccgtgcagctcta caagaggaggaacaagccagcaagcaaatcaacccgaagagaccccgtgccctttaa >gi568815593r:62289013_62503772|GENSCAN_predicted_peptide_4|332_aa MPKVKSGAIGRRRGRQEQRRELKSAGGLMFNTGIGQHILKNPLIINSIIDKAALRPTDVV LEVGPGTGNMTVKLLEKAKKVVACELDPRLVAELHKRVQGTPVASKLQVLVGDVLKTDLP FFDTCVANLPYQVCDKRLELLPNFERLTVQVVNIAKNESELQRIVMEMKYGFTSVILKTK QSESDAYQETEWLQSKQKQTSQEQRCAILMFQREFALRLVAKPGDKLYCRLSINTQLLAR VDHLMKVGKNNFRPPPKVESSVVRIEPKNPPPPINFQIIPEDFSIADKIQQILTSTGFSD KRARSMDIDDFIRLEILKAFKRYKVLPGRVIA >gi568815593r:62289013_62503772|GENSCAN_predicted_CDS_4|999_bp atgccgaaggtcaagtcgggggccatcggccgccgccgcgggcggcaggagcagcgccgg gagctgaagagcgctggaggactcatgttcaacacggggattgggcagcacattttgaaa aatcctctcattattaacagcattatcgataaggctgccttaagaccaactgatgtagtg ctggaagttggacctggaactggcaacatgactgtaaagttgttagaaaaggcaaaaaag gttgttgcttgtgaacttgacccaaggctagtagctgaacttcacaaaagagttcagggc acgcctgtggccagcaaacttcaagtactggtgggtgatgtgctgaaaacagatttgcca ttctttgatacttgtgtggcaaatttgccttatcaggtttgtgacaaaagactagagtta ctgccaaattttgaaagacttacagttcaggtggtgaacatagccaaaaatgagtcagaa ttacaaagaattgtaatggagatgaaatatggctttaccagtgtgatcctaaagacaaag cagagtgaaagtgatgcctaccaagaaacggagtggctccagtcaaagcaaaagcagacc agtcaagagcaaaggtgtgctatacttatgtttcaaagagaatttgccctccgactggtt gcaaaacctggagataagttatactgcagactctcaattaatacacagctgttggcacgt gtggaccatctaatgaaagtgggaaagaataacttcagaccaccgcccaaggtggaatcc agtgttgtaaggatagaacctaagaatccaccaccacccatcaattttcagataatacca gaagatttcagcatagcagataaaatacagcaaatcctaaccagcacaggttttagtgac aaacgggcccgttccatggacatagatgacttcatcagattggagattttaaaggctttt aagcggtataaggtgctacctgggagagttattgcatag >gi568815593r:62289013_62503772|GENSCAN_predicted_peptide_5|297_aa MGSQPASISHTPQLYAAEILYQANTARGLGLSSSIQAPLLRQKLYHRAAGKEYVAQFLAG HGLTDTGPWPEGWGHLPYDTSCRLSKTGCANSQVCRPPLPRYRYPSSPGSPTFLPPLGTR VPNPLPSTPARSVALGPGPHRLVRSGIARPSRSHFPTRSGVGWSLSPRCQTPRRRLLMPK AASRNTPCLASQLQTTRTRENYVTGDQRACDRQREKAIGERREGEGGALSLNQKARGIQI SCVRLRSALAQYKEPGVRTSRICHHGNHRSGFSDSQGAQLFPALIEASIRTISRFIF >gi568815593r:62289013_62503772|GENSCAN_predicted_CDS_5|894_bp atgggtagccagccagccagtatttctcatacaccccagctatatgctgcagaaattctg taccaggcaaacacagctagaggtctagggctctcttcttccatccaggcaccactcctg cggcagaagctctaccacagggcagcaggcaaagaatatgtggcccagttcctagcaggc cacggactgactgatactggtccatggcctgagggctggggacacctgccctatgacact tcatgcagactgtcaaaaactggttgcgcaaacagccaagtctgcaggccgcccttgcct cgctaccgttatccctcatctcctggtagccccactttcctcccgcctctcgggactcgg gtgcccaacccactcccttccactcccgcgcgctcggtggctctcgggccggggcctcac cgtttggttcggtcgggaattgcccggccctcccgctcccacttcccaacacgaagtggc gtcggttggtccctctccccacgctgccagaccccacgccgccgcctactgatgccgaaa gcggcttctaggaacacgccatgtttggcgtcgcagctccaaacgacgcggacgcgcgaa aactacgtcacaggagaccagcgcgcatgcgaccggcagagagagaaggcgataggcgaa cggcgggaaggggaaggaggggctttgtctttgaaccaaaaggcgcggggtatccaaatc agttgtgtgcgcttgcgcagtgcgcttgcgcagtataaagagccaggagtccggactagc cggatctgtcaccatggaaaccataggtctggtttctccgactcccagggagctcaattg tttcctgcgttgattgaagcttcaatcagaaccatttcacgctttatattctag >gi568815593r:62289013_62503772|GENSCAN_predicted_peptide_6|448_aa MDLNSASTVVLQVLTQATSQDTAVLKPAEEQLKQWETQPGFYSVLLNIFTNHTLDINVRW LAVLYFKHGIDRYWRRVAPHALSEEEKTTLRAGLITNFNEPINQIATQIAVLIAKVARLD CPRQWPELIPTLIESVKVQDDLRQHRALLTFYHVTKTLASKRLAADRKLFYDGSRAENLI PLQNLRGDEKLSHDSLMVQIKRRVGDGLLASGIYNFACSLWNHHTDTFLQEVSSGNEAAI LSSLERTLLSLKVLRKLTVNGFVEPHKNMEVMLLDFLDQHPFSFTPLIQRSLEFSVSYVF TEVGEGVTFERFIVQCMNLIKMIVKNYAYKPSKNFEDSSPETLEAHKIKMAFFTYPTLTE ICRRLVSHYFLLTEEELTMWEEDPEGFKMMQTLQGPTNVEDMNALLIKDAVYNAVGLAAY ELFDSVDFDQWFKNQLLPELQVIHNRQV >gi568815593r:62289013_62503772|GENSCAN_predicted_CDS_6|1347_bp atggatctcaatagtgccagcactgttgttcttcaggtgttaacacaggccaccagtcag gatactgctgtgttaaaaccagctgaggagcagttgaagcagtgggagacacagccaggt ttctattcagtgttgctgaatattttcaccaaccacactttggatataaatgtaaggtgg cttgctgtactgtattttaaacatggaattgatcgctactggagacgtgtagcacctcat gctctctcagaggaggagaaaactactctgcgtgcagggctcatcaccaacttcaatgaa ccaataaaccagattgcaactcagattgcagtgctcattgcaaaagttgctagattggat tgtcccagacagtggcctgaactaattcccactcttatagagtctgttaaagtccaggat gatcttcgacagcacagagcattacttaccttctatcatgttaccaagacactggcatct aaacgacttgctgctgatagaaaactattttatgatgggagtagggcagaaaaccttata cctcttcagaacctgagaggtgatgagaaactgtctcatgatagcctaatggtgcagata aagagacgagtaggagatggtcttttagcttctggaatttataattttgcctgctctctg tggaatcaccacacagacacattcctgcaagaagtttcttctggcaatgaagctgcaatt ttgagttcactagaacgaacactgctatcattgaaagtgctgcgtaagttaactgttaat ggatttgtggaacctcataagaatatggaggtgatgcttttggacttcttggatcagcat cctttttcatttactcctctaattcagagatcactggaattttctgtaagctatgttttt acagaagttggtgaaggcgttacatttgaacgattcattgtccaatgtatgaatcttatt aagatgattgtcaaaaattatgcttataagccatccaaaaattttgaagatagcagccct gaaactcttgaagcccataagattaagatggcattcttcacatatcctactttgacagag atatgtagaagattagtctctcattatttcctattaactgaagaagaactgacaatgtgg gaagaagacccagaaggctttaaaatgatgcaaacacttcaaggacccacaaatgtggaa gatatgaatgcactgttaatcaaagatgctgtgtataatgctgttggattagctgcttat gagctctttgacagtgttgattttgatcagtggtttaaaaaccagcttcttccagaatta caagtcattcacaataggcaagtataa >gi568815593r:62289013_62503772|GENSCAN_predicted_peptide_7|59_aa MNEKKMQVNSTSKYCYKAALRKENADSESQRGIRFIRIYILVTQQKQCSNWRKQLLQLV >gi568815593r:62289013_62503772|GENSCAN_predicted_CDS_7|180_bp atgaatgagaagaaaatgcaggtcaacagtacttccaaatactgctacaaagcagccctt aggaaagagaatgcagattctgaaagccagagaggaattcgattcatcaggatctacata cttgtgacacagcagaagcagtgcagcaactggaggaaacaacttctccagttagtatag