GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:12:33 Sequence gi568815595f:157336934_157542976 : 206043 bp : 38.61% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3953 4115 163 1 1 69 71 54 0.313 0.43 1.02 Term + 7226 8010 785 0 2 60 38 330 0.269 17.71 1.03 PlyA + 8252 8257 6 1.05 2.07 PlyA - 9858 9853 6 1.05 2.06 Term - 20487 20378 110 1 2 73 48 33 0.207 -4.51 2.05 Intr - 26828 26431 398 1 2 93 115 296 0.985 26.10 2.04 Intr - 27579 27370 210 2 0 76 75 171 0.999 11.71 2.03 Intr - 44443 44223 221 1 2 68 99 196 0.015 14.88 2.02 Intr - 54194 53904 291 2 0 88 25 216 0.724 11.71 2.01 Init - 54599 54591 9 2 0 107 55 26 0.851 0.34 2.00 Prom - 56021 55982 40 -4.35 3.08 PlyA - 56610 56605 6 1.05 3.07 Term - 58757 58153 605 0 2 -54 43 457 0.104 20.69 3.06 Intr - 68145 68063 83 2 2 89 83 75 0.126 5.46 3.05 Intr - 69766 69587 180 0 0 90 42 67 0.100 0.36 3.04 Intr - 75018 74873 146 0 2 53 30 81 0.206 -3.04 3.03 Intr - 77157 76948 210 0 0 140 100 140 0.953 18.59 3.02 Intr - 78780 78710 71 1 2 6 74 107 0.787 -0.92 3.01 Init - 81117 81027 91 0 1 83 27 100 0.776 4.10 3.00 Prom - 83042 83003 40 -5.65 4.00 Prom + 85489 85528 40 -4.65 4.01 Sngl + 88175 88594 420 1 0 86 34 237 0.430 14.15 4.02 PlyA + 89747 89752 6 1.05 5.00 Prom + 94207 94246 40 -3.95 5.01 Init + 100001 100130 130 1 1 67 103 103 0.955 8.40 5.02 Intr + 100580 100981 402 0 0 85 110 361 0.989 31.47 5.03 Term + 105433 106046 614 2 2 103 36 377 0.951 27.65 5.04 PlyA + 106202 106207 6 1.05 6.00 Prom + 106974 107013 40 -6.25 6.01 Init + 110980 111070 91 0 1 41 46 200 0.665 11.80 6.02 Term + 113814 113908 95 1 2 88 38 47 0.423 -3.29 6.03 PlyA + 114514 114519 6 1.05 7.05 PlyA - 115469 115464 6 1.05 7.04 Term - 118805 118711 95 0 2 60 32 70 0.077 -4.39 7.03 Intr - 123422 123248 175 2 1 69 86 114 0.637 7.89 7.02 Intr - 133596 133381 216 0 0 77 91 158 0.974 12.78 7.01 Init - 158416 158279 138 1 0 84 89 111 0.640 11.00 7.00 Prom - 164250 164211 40 -5.15 8.00 Prom + 173980 174019 40 -3.65 8.01 Init + 175932 175982 51 2 0 70 103 54 0.897 6.31 8.02 Term + 186494 186598 105 1 0 43 46 115 0.650 0.23 8.03 PlyA + 188664 188669 6 1.05 9.00 Prom + 190317 190356 40 -3.45 9.01 Init + 193645 193694 50 0 2 66 40 93 0.168 2.77 9.02 Term + 202633 202810 178 1 1 86 54 72 0.037 -0.22 9.03 PlyA + 203996 204001 6 1.05 10.03 PlyA - 204904 204899 6 1.05 10.02 Term - 205502 205266 237 2 0 56 49 222 0.986 10.28 10.01 Intr - 205949 205730 220 1 1 72 49 91 0.527 0.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 44395 44223 173 1 2 119 99 186 0.895 21.76 S.002 Term + 162342 162639 298 0 1 114 44 217 0.949 13.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:157336934_157542976|GENSCAN_predicted_peptide_1|315_aa PLLLIPRQTGSGVDLQQTPTDLQLRVLTVRRKTNKQEGHPHQNPVCTSPSSKTKVLEVLA RAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIISAQNLLKLISNFSKVSAYKINVQ KSQAFLYTNNRQTESQIMSKLPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDT NKWNIPCSWIGRINIMKMAILPKVIYRSSAIPIKLPMTFFTELEKTTLNFIWNQKRARIA KTVLSQNNKAGGITLPDFKQYYTATITKTSWYWYQNRDIDQRKRTEPSEIMPHIYNHLIF HKPDKNQKWGKDSPI >gi568815595f:157336934_157542976|GENSCAN_predicted_CDS_1|948_bp cctctgctgctgatacctagacaaacagggtctggggtggacctccagcaaactccaaca gacctgcagctgagagtcctgactgttagaaggaaaactaacaaacaggaaggacatcca caccaaaaccccgtctgtacatcaccatcatcaaagaccaaagtgttggaagttctggcc agggcaatcaggcaggagaaagaaataaagggtattcaattaggaaaagaggaagtgaaa ttgtccctgtttgcagatgacatgattgtatatttagaaaaccccatcatctcagcccaa aatctccttaagctgataagcaacttcagcaaagtctcagcatacaaaatcaatgtgcaa aaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagtaaa ctcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggat gtgaaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaagaggataca aacaaatggaacattccatgctcatggataggaagaatcaatatcatgaaaatggccata ctgcccaaggtaatttatagatccagtgccatccccatcaagctaccaatgactttcttc acagaattggaaaaaactactttaaatttcatatggaaccaaaaaagagcccgcattgcc aagacagtcctaagccaaaataacaaagctggaggcatcacgctacctgacttcaaacaa tactacacggctacaataaccaaaacatcatggtactggtaccaaaacagagatatcgac caacggaaaagaacagaaccctcagaaataatgccacacatctacaaccatctgatcttt cacaaacctgacaaaaaccagaaatggggaaaggattcccctatttaa >gi568815595f:157336934_157542976|GENSCAN_predicted_peptide_2|412_aa MPLVRQAWNGKGCVSDKVQDLAAASQVHQLLWQAAPGISRVASSMQGCGWTRHASSSFRF GRWHLDKRNVVVPKNLEMPATMEPRGDVTACHSSGSGSPEERARSCLTYLVSQLANMEHS FHHILLLEIKSITDTFSSILGPQSRDIFRMSNSFTAIAKLLTRQLENTKAGSGRRKISTE IEFPEKLEETKLIVTENEDHEKLQVKIQAFEDKINAGSNTPGSIRRYSLGQVSKEERKNI RFNRSKSLAFHTMLTKGVGSDDGEDENRGDIPASISLSEIDPLGQGNDKLPFKTDTERSQ LGESSVSYPNIIHIDSENLSETVKENSQEETPETTASPIEYQDKLYLHLKKNLSKVKAYA MEIGKKIPVPDQCTIEDEMNIHLTRKQCKIKLVEHMHTLRTFCMEIQEQLSD >gi568815595f:157336934_157542976|GENSCAN_predicted_CDS_2|1239_bp atgccgctggtgagacaggcgtggaacggcaaggggtgtgtgagtgacaaagtacaagat ctggccgcagcaagccaggtgcaccagctgctgtggcaggcagctccaggcatcagcaga gttgcaagctccatgcaaggctgtggctggaccaggcatgcctcaagcagcttccgcttt gggcgctggcatctggacaagaggaatgtggtggtgcccaaaaacttggagatgccagca accatggagccccgaggggatgttacagcgtgtcacagctctggctcagggagtcctgag gagagagccaggagctgcctgacatacctggtgagccaactggccaacatggagcattcg tttcaccatattctcctgctggagattaaaagcatcaccgacaccttctcctcaatcttg ggccctcagagcagagacatcttccgcatgagcaacagcttcaccgccattgctaaactc cttacccgacaactggaaaataccaaggctggaagtggcaggagaaaaatcagcactgaa attgaattccctgagaaactggaagaaaccaagctcatagtaactgaaaatgaagaccat gaaaaactccaagttaaaatccaggcttttgaagacaagataaatgcagggagcaatacc cctggctctatcagaagatatagtctgggccaagtttctaaagaagaaagaaaaaacatt agatttaacaggtcaaaaagtttggctttccacactatgctcacaaagggtgtgggttca gatgacggcgaagatgaaaacaggggagacataccagccagcatctctctttcagaaata gacccacttggccaaggaaatgacaagctgccgtttaagacagacactgagagatcacag ctgggggagtcttcagtttcatacccaaatattatacatatagactcagagaatttgtca gaaactgttaaagaaaactcccaggaagaaactccagagacaactgcaagtcctatagaa taccaagataagctctacttgcacttaaaaaaaaacctcagcaaagtgaaagcatatgcc atggaaattggaaagaagattccagtccctgatcagtgtaccattgaagatgagatgaat attcatctgactagaaaacagtgcaaaataaaattggttgagcacatgcacactcttagg acattttgtatggagatccaggagcagttatctgactga >gi568815595f:157336934_157542976|GENSCAN_predicted_peptide_3|461_aa MPTVDPIFLRGGLEELAYVDWGVHMQGKARVGTGLREGGPSGQETECGKMDRSQVVQKCI PFLIGHLKDSTHNDIILNILIEIAVYEPVALNSFLPMLKEIGERFPYLTGQMARIYGAVG HVDEASPPTLRITNQHEIWVATQIQTISKANLKENSSFGDTCMTLSGYQEKERPIARICE ESNSMPLSAATAIVCVKSCALCLVFLTSWNMLAKHQGVFITSSLGSESWTLFKGGRSDFP TLVSVREVGACLKKDRKPPLKIFGKRYVDGPEWPKTEDICVPCVHSAKDDLGEEDFNNQV DMMTHSVDTIQALSPTTPVITQWTYEHSCHGVREGGYSWAQQHGRPLTKADLATAIADCP VCQQQRPAMSPLYGTVPWGVQPATCGRLITLDHFHYGRDRICSYWNILTQDMDLPFLHAI LLPRVPSMDLQNCLVDCHGITHSIAFDQGTHSTDRNAAMHS >gi568815595f:157336934_157542976|GENSCAN_predicted_CDS_3|1386_bp atgccgacagttgaccccatattcttgagaggtggcctagaagagcttgcatatgtggac tggggtgttcacatgcagggcaaagccagggtgggaactggactgcgggaaggtggacca agtgggcaagagactgaatgtgggaagatggataggagtcaggtagttcagaagtgtatt cctttcctaattgggcatttgaaggattcaacccataatgacatcatcctaaacatcctc atagagatagcagtctatgagccagtggctttgaacagttttcttccaatgctgaaagag attggtgagagattcccctacctcactggacagatggcaaggatttatggagctgttggg catgtggatgaagcctcacctccaacattgaggattacaaatcaacatgagatttgggtg gcaacacagatccaaaccatatcaaaggctaacctgaaagagaacagttcatttggtgac acttgtatgactctcagtgggtaccaggagaaggagagaccaatagcccgcatctgtgaa gaatcaaatagcatgcccctgagtgcagccactgccatcgtgtgtgttaaatcctgtgcc ctctgccttgtttttctaactagctggaatatgctggccaagcaccagggcgtgttcatt acatcctctctgggctcagagtcatggacacttttcaaaggaggacgatctgactttcct acactggtgtctgtgagggaagtaggtgcctgcctcaagaaagacaggaagccaccactt aaaatttttgggaagagatatgtggatggacctgagtggccaaaaactgaagatatttgt gtcccctgtgtacactcagcaaaggatgaccttggagaggaggattttaataatcaagtg gatatgatgacccactctgttgacaccattcaggctctttccccaaccactcctgtcatc acacaatggacttatgaacacagttgccatggtgtcagagaaggaggttattcatgggct cagcaacatggacgtccactcacaaaggctgatctggctactgccatcgctgactgccct gtctgccagcagcagagaccagcaatgagccctctatatggcactgttccttggggtgtt cagccagctacctgtggcaggttgattacattggaccacttccattatggaagggacagg atttgttcttactggaatattcttactcaggatatggatttgcctttcctacatgcaatt cttctgccaagagtaccatccatggacttgcagaactgccttgtcgactgccatggtatt acacatagcattgcttttgaccaaggaactcactctacagacagaaatgcggcaatgcat tcgtga >gi568815595f:157336934_157542976|GENSCAN_predicted_peptide_4|139_aa MTERGQSTARAMTSEGVNPKPLQLPCGVDPVGAQKSRIEVWEPLPRFQRLYGNAWMSRQK FAAGVGLSWRTSPKAVQKGNVGSEPPHRVPTGAPACESSQEGGCTLKSHRGGAAQDHGNP CISVTRMRDMESMEIISEL >gi568815595f:157336934_157542976|GENSCAN_predicted_CDS_4|420_bp atgactgaaaggggccaaagtacagctcgggccatgacttcagaaggtgtaaaccccaag cctttacaacttccatgtggtgttgatcctgtgggtgcacaaaagtcaagaattgaggtt tgggaacctctgcctagatttcagaggctgtatggaaatgcctggatgtccagacagaag tttgctgcaggggtagggctttcatggagaacctctcctaaggcagtgcagaagggaaac gtggggtcagagcctccacacagagtccctactggggcaccagcctgtgaaagcagccag gagggaggctgtaccttgaaaagtcacaggggtggagctgcccaagaccatgggaaccct tgcatcagtgtgactcggatgcgagacatggagtcaatggagatcatttcggagctttaa >gi568815595f:157336934_157542976|GENSCAN_predicted_peptide_5|381_aa MHLLAILFCALWSAVLAENSDDYDLMYVNLDNEIDNGLHPTEDPTPCACGQEHSEWDKLF IMLENSQMRERMLLQATDDVLRGELQRLREELGRLAESLARPCAPGAPAEARLTSALDEL LQATRDAGRRLARMEGAEAQRPEEAGRALAAVLEELRQTRADLHAVQGWAARSWLPAGCE TAILFPMRSKKIFGSVHPVRPMRLESFSACIWVKATDVLNKTILFSYGTKRNPYEIQLYL SYQSIVFVVGGEENKLVAEAMVSLGRWTHLCGTWNSEEGLTSLWVNGELAATTVEMATGH IVPEGGILQIGQEKNGCCVGGGFDETLAFSGRLTGFNIWDSVLSNEEIRETGGAESCHIR GNIVGWGVTEIQPHGGAQYVS >gi568815595f:157336934_157542976|GENSCAN_predicted_CDS_5|1146_bp atgcatctccttgcgattctgttttgtgctctctggtctgcagtgttggccgagaactcg gatgattatgatctcatgtatgtgaatttggacaacgaaatagacaatggactccatccc actgaggaccccacgccgtgcgcctgcggtcaggagcactcggaatgggacaagctcttc atcatgctggagaactcgcagatgagagagcgcatgctgctgcaagccacggacgacgtc ctgcggggcgagctgcagaggctgcgggaggagctgggccggctcgcggaaagcctggcg aggccgtgcgcgccgggggctcccgcagaggccaggctgaccagtgctctggacgagctg ctgcaggcgacccgcgacgcgggccgcaggctggcgcgtatggagggcgcggaggcgcag cgcccagaggaggcggggcgcgccctggccgcggtgctagaggagctgcggcagacgcga gccgacctgcacgcggtgcagggctgggctgcccggagctggctgccggcaggttgtgaa acagctattttattcccaatgcgttccaagaagatttttggaagcgtgcatccagtgaga ccaatgaggcttgagtcttttagtgcctgcatttgggtcaaagccacagatgtattaaac aaaaccatcctgttttcctatggcacaaagaggaatccatatgaaatccagctgtatctc agctaccaatccatagtgtttgtggtgggtggagaggagaacaaactggttgctgaagcc atggtttccctgggaaggtggacccacctgtgcggcacctggaattcagaggaagggctc acatccttgtgggtaaatggtgaactggcggctaccactgttgagatggccacaggtcac attgttcctgagggaggaatcctgcagattggccaagaaaagaatggctgctgtgtgggt ggtggctttgatgaaacattagccttctctgggagactcacaggcttcaatatctgggat agtgttcttagcaatgaagagataagagagaccggaggagcagagtcttgtcacatccgg gggaatattgttgggtggggagtcacagagatccagccacatggaggagctcagtatgtt tcataa >gi568815595f:157336934_157542976|GENSCAN_predicted_peptide_6|61_aa MDLVHSRYSIGDDKNNQNDDDDDDDDDRNHVFCSVELYRILSRPAFVGETQLGHYAFIKW A >gi568815595f:157336934_157542976|GENSCAN_predicted_CDS_6|186_bp atggacctggtacacagcagatactcaattggtgatgacaaaaacaaccagaacgatgat gatgatgatgatgatgatgatagaaatcatgtcttttgcagtgtggaattatatcgaata ttaagccgtcctgcatttgtgggagaaactcaactaggtcattacgcatttataaagtgg gcttaa >gi568815595f:157336934_157542976|GENSCAN_predicted_peptide_7|207_aa MHQLFRLVLGQKDLSRAGDLFSLDDSEIEDSLTEALEQIKIISSSSDYQTNNNDQAVVEI CITRITTAIRETESIEKHAKALVGLWDSCLEHNLRPFGKDEDTPHAKIASDIMSCILQNY NRPPVMALAIPIAVKFLHRGNKELCRNMSNYLSLAAITKADLLADHTEVIVKSILQGTWM EPEAIILSKLTQEQETKYRMLSLISGS >gi568815595f:157336934_157542976|GENSCAN_predicted_CDS_7|624_bp atgcatcaactgttcagactggttttgggacaaaaagatctttcacgagctggggacctc ttctccttagatgactctgagattgaagacagccttacagaagctttggagcaaattaag ataattagctcatcttcagattaccaaaccaataacaatgaccaggcagtagttgaaatc tgtatcacaagaatcacaacagccatcagagagaccgagtccattgaaaagcatgcaaag gcccttgtggggctctgggactcctgcttggaacataacctgagaccctttgggaaagac gaagacactcctcatgcaaaaatcgcatctgatatcatgagttgcattttacagaattac aaccgacccccagtgatggcattagccatccccattgcagtgaaattcctccacagaggc aacaaggaactgtgcaggaatatgtctaactacctgtctctggctgcaattaccaaggca gatctcctggctgatcacacggaagttatagtaaagagcatactccaaggaacttggatg gagccagaggctattatccttagcaagctaacacaggaacaggaaaccaaataccgcatg ttgtcacttataagtgggagctaa >gi568815595f:157336934_157542976|GENSCAN_predicted_peptide_8|51_aa MDQSSVTHSMASTTLIMLPNVYVCTGVPKELLELLAVHILLSQLLFRSGKE >gi568815595f:157336934_157542976|GENSCAN_predicted_CDS_8|156_bp atggaccagtcctcggtgacccacagtatggcatctacaacactgataatgttgccaaat gtttatgtatgcactggtgtccccaaagaattgctggaacttctggcagtgcatatcctg ctgtcacagctcctttttcgttctggaaaagaatga >gi568815595f:157336934_157542976|GENSCAN_predicted_peptide_9|75_aa MVTQEGSKEQDPEKSAGKETNGGIFFSGRKHMITPDVQNDWKFLYLLGSCQFEILGAIVV MRFLPGIISALKSED >gi568815595f:157336934_157542976|GENSCAN_predicted_CDS_9|228_bp atggtcactcaagaaggctccaaggaacaggatccagagaagagtgctggaaaagagaca aatggtggcatattcttctcaggaagaaaacacatgattacacctgatgtccagaatgac tggaagttcctctacctgctggggagctgccaatttgaaattcttggtgcaattgtagtg atgagatttttacctgggattatttcagcacttaagagtgaggattga >gi568815595f:157336934_157542976|GENSCAN_predicted_peptide_10|152_aa XCGEEGKPGQRMARQRDQTVTPAGDWTITWPRTGPPPLQAAFPLCSHPAAVKALPLTHGL HTHSCSTPNSHTLHGRKKEKDFADVIKDLGIGDFPALAVGPNAITSILRKKGRGPFHASA YKRRRQCEQEAETEVMWPQAKGCQQPPEAGRG >gi568815595f:157336934_157542976|GENSCAN_predicted_CDS_10|459_bp nnatgcggagaagagggcaagccagggcagaggatggcacgacaacgggaccaaacagtg acccctgcaggtgactggacgataacttggcctaggaccggccccccgccactgcaagct gcgttccctctgtgctcacaccctgcagcagtcaaagcactcccactcacacacgggctg cacacgcatagttgcagtacaccgaactcacacacactacacggaaggaaaaaagaaaaa gactttgcagatgtgattaaagatttggggataggagattttccggcgttagcagtagga cccaatgcaatcacaagcatccttagaaagaaaggcagagggcccttccatgcttccgca tataagaggagaaggcaatgtgagcaggaagcagagactgaagtgatgtggccacaagcc aagggatgccagcagccaccagaagctggaagaggctag