GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:02:34 Sequence gi568815597f:110791409_110999192 : 207784 bp : 37.75% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1796 1791 6 1.05 1.03 Term - 7725 7551 175 2 1 69 40 121 0.550 1.65 1.02 Intr - 10851 10762 90 2 0 84 113 39 0.929 4.19 1.01 Init - 16336 15933 404 1 2 60 39 231 0.817 11.55 1.00 Prom - 30068 30029 40 -3.65 2.03 PlyA - 31659 31654 6 1.05 2.02 Term - 31868 31695 174 2 0 12 40 150 0.143 -0.62 2.01 Init - 51389 51306 84 2 0 60 94 63 0.183 4.97 2.00 Prom - 54728 54689 40 -3.45 3.05 PlyA - 55240 55235 6 1.05 3.04 Term - 57315 56669 647 1 2 27 38 588 0.998 41.00 3.03 Intr - 58561 58386 176 0 2 -40 4 203 0.220 -2.24 3.02 Intr - 62280 62139 142 1 1 89 105 65 0.619 6.79 3.01 Init - 63097 63040 58 1 1 80 96 26 0.610 4.12 3.00 Prom - 66767 66728 40 -3.75 4.00 Prom + 82535 82574 40 -4.05 4.01 Init + 96592 96666 75 0 0 55 94 65 0.576 4.94 4.02 Intr + 100937 101125 189 1 0 67 109 135 0.965 12.36 4.03 Intr + 102178 102323 146 0 2 54 74 43 0.265 -2.34 4.04 Intr + 107908 108091 184 1 1 130 28 134 0.095 10.47 4.05 Intr + 119777 119883 107 1 2 95 13 75 0.115 -1.21 4.06 Intr + 119969 120235 267 2 0 37 84 189 0.939 9.12 4.07 Term + 122449 122635 187 1 1 52 38 129 0.899 0.28 4.08 PlyA + 122897 122902 6 1.05 5.00 Prom + 134243 134282 40 -6.05 5.01 Init + 138307 138459 153 0 0 87 97 41 0.430 4.93 5.02 Term + 143567 143719 153 1 0 33 49 156 0.693 3.14 5.03 PlyA + 143838 143843 6 1.05 6.04 PlyA - 145065 145060 6 1.05 6.03 Term - 156991 156551 441 1 0 21 38 333 0.831 15.87 6.02 Intr - 158715 158443 273 0 0 80 96 263 0.989 23.11 6.01 Init - 161394 159880 1515 0 0 42 86 707 0.846 58.15 6.00 Prom - 164318 164279 40 -4.85 7.04 PlyA - 164666 164661 6 1.05 7.03 Term - 172161 171906 256 2 1 102 41 217 0.564 12.47 7.02 Intr - 172526 172260 267 1 0 71 -22 175 0.285 0.52 7.01 Init - 173209 173109 101 1 2 53 97 74 0.428 4.58 7.00 Prom - 186570 186531 40 -3.75 8.03 PlyA - 187069 187064 6 1.05 8.02 Term - 191602 191010 593 2 2 45 40 155 0.041 0.10 8.01 Init - 200246 200096 151 2 1 60 48 160 0.328 9.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 22700 22821 122 0 2 59 44 167 0.970 7.06 S.002 Sngl + 113364 113699 336 2 0 90 42 161 0.919 7.48 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:110791409_110999192|GENSCAN_predicted_peptide_1|222_aa MSEFPFTIASKRIKYLGIHLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPRSWIGRINIV KMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKSILSQKNKAGGITLP DFKLYYKATITKTAWWRQEKGFLPRKLDDGESGCSPQTHVFVKKRYCVTYYGHKQDCAEL LEVRLPCLFNFYEYYGLDVVCPTKQENLIATFCSIHPPSYWL >gi568815597f:110791409_110999192|GENSCAN_predicted_CDS_1|669_bp atgagtgaattcccattcacaattgcttcaaagagaataaaatacctaggaatccacctt acaagggacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaattaaa gaggatacaaacaaatggaagaacattccacgctcatggataggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccacatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaaggctacaataaccaaaacagcatggtggaggcaggaaaaa ggtttcttgccaaggaaacttgatgatggggagtctggttgttcacctcaaactcacgta tttgtgaaaaaacggtactgtgtgacatactatggacacaaacaagattgtgctgagctt ctggaagtgcgcctgccctgcctctttaacttttatgagtactacggtttggatgtggtt tgccccacaaaacaagaaaacctgatagccacattctgctctattcatccaccaagttat tggctgtag >gi568815597f:110791409_110999192|GENSCAN_predicted_peptide_2|85_aa MAIIKKIKITNAGEDMGKEELLHTVGGKVNVKEKTLKEAKEKEKVVYKGNPIGLTVKLSA ETLQARRDWGPIFSILKEKKFQPRI >gi568815597f:110791409_110999192|GENSCAN_predicted_CDS_2|258_bp atggctattatcaaaaagataaaaataacaaatgctggcgaggatatggggaaagaggaa ctcttacacactgttggtggaaaggtcaatgtgaaagaaaaaaccttaaaggaagctaaa gagaaggagaaagtcgtttacaaagggaaccccatcgggctaacagtgaaactgtcagca gaaaccttacaagccagaagagattgggggcctatattcagcatcctgaaagaaaagaaa tttcaaccaagaatttga >gi568815597f:110791409_110999192|GENSCAN_predicted_peptide_3|340_aa MNQRGEVTKEVISFIGSTEGAMSVFEENGNRGMQRKMEAGMQRGRRTKKERKNHLLYALP ACTVFICFYPEIRHVVHLTKTMSELSLRMNVGGHLSLGLISQPSFFRINLSPSVGKAIIS PDTGAQRWKRAQREERLKAQQNTDKDVAAHFQASHKPSAEDAEGQSPLSQKYSPSTEKCL PEIQGIFDRDPDTLLYLLQQKSEPEEPCIGSKAPKDDKTIIEEQATKIADLKRHVEFLVA ENKRLRKENKQLKAEKARLLKGPIEKELDVDADFVETSELWSLPPHSETATASSTWKKFA ANTGKAKDIPIPNLPPLDFPSPELPLMELSEDILKGFMNN >gi568815597f:110791409_110999192|GENSCAN_predicted_CDS_3|1023_bp atgaaccagaggggtgaagtcacaaaagaagtgatcagtttcatagggtccacagaagga gcaatgtcagtctttgaggaaaacggaaacagaggaatgcagagaaaaatggaagcagga atgcaaaggggaaggaggacaaagaaagagaggaagaatcacctactatatgcccttcct gcatgtactgtgttcatctgcttttatccagaaatacgccacgtggtgcacttgacaaaa accatgtcagagctgtctctccgcatgaacgtaggaggacacttgagtttgggcctcatt tctcaaccttcttttttccgtataaacttgagtccctctgtgggcaaagccatcatctcc cctgatactggtgctcagagatggaaaagggcccagcgtgaagaaagattgaaagcccag cagaacacagacaaggatgtagctgcccattttcaggcatctcacaaaccctctgcagag gatgcagagggccagagtcccctttctcagaagtacagcccttccacagagaaatgcctg cctgagattcaggggatctttgacagggatccagacacactactttatttacttcagcaa aagagtgagccagaagagccatgtattggaagcaaagcccccaaagatgataaaacaatt atagaggagcaggcaaccaaaattgcagatttgaagaggcatgtggaattccttgtggct gagaataaaagattaaggaaagaaaataaacaactgaaggctgaaaaggccagacttcta aaaggtccaatagaaaaggagctggatgtagatgctgattttgtagaaacgtcagagtta tggagcttgccaccacattcagaaactgctacagcctcctcaacctggaagaagtttgca gcaaacaccgggaaagccaaggacattccaatccccaatcttcctcccttggattttcca tctccagaacttcctcttatggagctctctgaggatattctgaaaggatttatgaataat taa >gi568815597f:110791409_110999192|GENSCAN_predicted_peptide_4|384_aa MKQEIRVESSRCDDKSKKLHDARKEICGCCILGFGIYLLIHNNFGVLFHNLPSLTLGNVF VIVGSIIMVVAFLGCMGSIKENKCLLMSCWDYRRKPLRPVCMGQLLNWSQEKEFFKKSLF SDSCQTNGKGWKIADMGSLSYTTREVGVGQAHPISGSKTIFHSLTAAAMSLKVVKLISEH LLDKRGKDKLDLMAQHQRGKKKPGDYYEQFYAHKLENLEEMDEFLEAYNLPRLKKSPGPE RFTAEFFQICKEEVVPILLKLLKNIEEEGLFPNSFYKASIILILKPGKDKKKKENLRPIY LMNTNTKSSTKYEQIESSSKSKSTIHNSKDMESTWMPINSGLDKENVVLIHYGIQHNHKK EQNYVLCSDMDASGVHYPKQTCLG >gi568815597f:110791409_110999192|GENSCAN_predicted_CDS_4|1155_bp atgaagcaggagataagagtggaaagtagcagatgtgatgacaaaagcaagaagttgcac gatgcaagaaaggagatctgtggctgctgcattttgggctttgggatctacctgctgatc cacaacaacttcggagtgctcttccataacctcccctccctcacgctgggcaatgtgttt gtcatcgtgggctctattatcatggtagttgccttcctgggctgcatgggctctatcaag gaaaacaagtgtctgcttatgtcgtgttgggattataggcgtaagccactgcgcccagtc tgtatgggtcaactcttaaactggagccaagagaaagaattttttaaaaagtccctcttc tcagatagttgtcagactaatggcaaaggatggaagatagcagacatggggtccctgtct tatacaaccagagaagtgggtgttggccaggcacatcccatctcaggcagcaagacaatc tttcactcactgacggcagcagccatgtctctcaaagtggtgaaactaatatctgagcat cttttagacaagagaggcaaagacaaactggatttaatggcccaacatcaaaggggaaaa aaaaaacctggagactattatgaacagttctatgcacacaaactggaaaacctagaagaa atggatgaattcttagaagcatacaacctcccaagactgaaaaaaagccctggaccagaa agattcacagctgaattcttccagatatgtaaagaagaggtggtaccaattctactaaaa ttactcaaaaatattgaggaggaggggctttttcctaactcattctacaaggccagcatc attctcatactaaaacctggcaaagataaaaagaaaaaagaaaacttaaggccaatatac ctgatgaacacaaacacaaaatcctcaacaaaatacgagcaaattgaatccagcagcaaa tcaaaaagcactattcacaatagcaaagacatggaatcaacctggatgcccatcaacagt ggactagataaagaaaatgtggtacttatacactatggaatacaacacaaccataaaaaa gaacaaaattatgtcctttgcagcgacatggatgcttctggagtccattatcctaagcag acttgcttaggataa >gi568815597f:110791409_110999192|GENSCAN_predicted_peptide_5|101_aa MDGARGHYPQQTNAGTENQIPHVLTYKWELNDEITWKQRGNNRYWGLPGEKVVQDREKHS ASLEESEEREQEPLPSNPENSGFVQDRQGSTSMSLQEPQNY >gi568815597f:110791409_110999192|GENSCAN_predicted_CDS_5|306_bp atggatggagctagaggccattatcctcagcaaactaacgcaggaacagaaaaccaaata ccacatgttctcacttataaatgggagctaaatgatgagatcacgtggaaacaaagagga aacaacagatactggggcctaccaggagagaaggtggttcaggacagagagaaacactct gcttctttggaagaaagtgaggaaagagaacaggagcctctgcctagtaatccagagaat tctggatttgtccaagaccgtcaaggcagtacctctatgagtctgcaagaaccacagaat tactga >gi568815597f:110791409_110999192|GENSCAN_predicted_peptide_6|742_aa MYQVVQTIGSDGKNLLQLLPIPKSSGNLIPLVQSSVMSDALKGNTGKPVQVTFQTQISSS STSASVQLPIFQPASSSNYFLTRTVDTSEKGRVTSVGTGNFSSSVSKVQSHGVKIDGLTM QTFAVPPSTQKDSSFIVVNTQSLPVTVKSPVLPSGHHLQIPAHAEVKSVPASSLPPSVQQ KILATATTSTSGMVEASQMPTVIYVSPVNTVKNVVTKNFQNIYPKPVTEIAKPVILNTTQ IPKNVATETQLKGGQHSQAAPVKWIFQDNLQPFTPSLVPVKSSNNVASKILKTFVDRKNL GDNTINMPPLSTIDPSGTRSKNMPIKDNALVMFNGKVYLLAKKGTDVLPSQIDQQNSVSP DTPVRKDTLQTVSSSPVTEISREVVNIVLAKSKSSQMETKSLSNTQLASMANLRAEKNKV EKPSPSTTNPHMNQSSNYLKQSKTLFTNPIFPVGFSTGHNAPRKVTAVIYARKGSVLQSI EKISSSVDATTVTSQQCVFRDQEPKIHNEMASTSDKGAQGRNDKKDSQGRSNKALHLKSD AEFKKIFGLTKDLRVCLTRIPDHLTSGEGFDSFSSLVKSGTYKETEFMVKEGERKQQNFD KKRKAKTNKKMDHIKKRKTENAYNAIINGEANVTGSQLLSSILPTSDVSQHNILTSHSKT RQEKRTEMEYYTHEKQEKGTLNSNAAYEQSHFFNKNYTEDIFPVTPPELEETIRDEKIRR LKQVLREKEAALEEMRKKMHQK >gi568815597f:110791409_110999192|GENSCAN_predicted_CDS_6|2229_bp atgtaccaagtagttcagacgattggctcggatggaaaaaatcttctgcaattacttcca attcctaagtcttctggaaatcttataccactagttcaatcttcagtcatgtctgatgct ttgaaagggaatacaggaaaaccagttcaagttacttttcagactcagatttccagctct tccacaagtgcatcagttcaattgcccatttttcagccagccagttcttcaaactatttt cttacaagaacagtagatacatcagaaaaaggtagagttacttctgtgggaactggaaat ttttcttcatcagtttctaaagttcagagtcatggtgtgaaaattgatggactcaccatg caaacatttgctgttcctccctcaacacaaaaagactcatcttttattgtagttaatacc cagagtcttccagtgactgtgaagtctccagttttgccttctgggcatcatttacagatt ccagcccatgctgaagtgaaatctgtaccagcgtcatcattgcctccttcagtgcagcaa aagatacttgcaactgccaccaccagtacctcaggaatggttgaggcctcccaaatgcca accgttatttatgtatctcctgtaaatacagtgaaaaatgtagttaccaagaactttcaa aacatttacccaaaacctgttacagaaatagcaaagccagtaatactaaataccacacaa attccaaagaatgttgctacagagacacaattgaaaggtggtcagcattctcaagctgct ccagtgaaatggattttccaagataatctacagccttttacgccatctcttgttcctgtt aagtcttcaaataatgtggcttcaaagattttaaaaacttttgtagataggaaaaatttg ggagataatactataaatatgccaccattgagtaccatcgatcctagtgggacgcgatcc aaaaatatgcctattaaagataatgctttggttatgtttaatgggaaagtctatctgttg gctaaaaaggggacagatgttctgccatcacaaattgaccaacagaattctgtttctcct gatactccagtaagaaaagacacgttacagacagtgagttcaagtccagtcacagaaata tccagagaggttgtaaatattgttttggctaaaagtaaatcttcccagatggagacaaaa tcactttccaatacccagcttgcttccatggccaatctaagggcagagaagaataaagtg gagaaaccatctccttctaccacaaatccacatatgaaccaatccagtaactacttaaaa cagagtaagactttattcacaaatccaatctttccagttggatttagtacaggacacaat gcccccagaaaagtaacagccgtcatttatgctagaaaaggaagtgtcctccagagcata gagaaaataagttcctctgttgatgcaacaactgttacttcacaacagtgtgttttcaga gaccaagaaccaaagatccataatgagatggcatcaacatcagataaaggtgcccaagga agaaatgacaagaaagattctcaaggaagaagtaataaggcattacatctgaagagtgat gctgaatttaaaaagatatttggccttactaaggatttgagagtgtgccttactcgaatt cctgaccatttgacctctggagaaggtttcgattcctttagcagtttggtaaagagtggt acttacaaagagacagagtttatggtgaaggaaggagagagaaaacagcagaattttgat aagaaaagaaaagcaaaaactaataagaagatggatcacataaagaagagaaaaacagag aatgcttataacgcaatcataaatggggaagctaatgtcaccggttcccaactcctaagc agtattttaccaacttcagatgtgtcacaacataacattctcacgagtcacagcaaaacc agacaagaaaagagaactgagatggaatactatacccatgagaagcaagagaaaggcact ttgaattcaaatgcagcttatgaacaaagtcatttcttcaataaaaattataccgaagat attttcccagtgacaccaccggagttagaagaaaccattcgagatgaaaaaataagaaga cttaagcaggtgctgagagagaaagaagcagctcttgaagaaatgcgtaagaagatgcac caaaaataa >gi568815597f:110791409_110999192|GENSCAN_predicted_peptide_7|207_aa MDVLMQLPDLPSEIKVYMTISDSFDERRLEERVKAVRAQSTARSLPVSARCATSSRSLGI HPLESPLSPSVKVEPLTVAVGNSSPTFPRSGSRPIGKLGPTLPTFLDEVSPFSPLKCQIT YGGDVESAVVSGRCVVERSWSPQSAPPRREALVSAALKAHRALSSGKCKTSLEPDTAAPR DPSVSSLIGGPSRSPREMKTKMADALM >gi568815597f:110791409_110999192|GENSCAN_predicted_CDS_7|624_bp atggatgttctgatgcaactcccagatctcccttcagaaataaaggtctatatgacaata agtgatagtttcgatgaacgtaggctggaggaaagagttaaggctgtgagggcgcagtcc accgccaggagccttccggtttctgcgcggtgcgcgacctcgtcccgaagcctggggata caccctctcgagagcccgctgtcgccctccgttaaggtcgaacccctcacagttgctgtg ggcaactccagcccaacattccctcgctctggttctcgccccattgggaaactcggcccc acgcttcccacttttctggatgaggtgtcccctttctccccactaaaatgtcaaataacc tacggaggggatgtggagtcggccgtcgtttccggacgctgtgtagttgagaggagttgg agcccgcagagtgcgccacccaggcgggaagcgcttgtctccgcggcgcttaaagcccat agagcgctttcctctgggaaatgtaaaacctctttagagcctgacacagcggctccgcgg gaccccagcgtctcaagtttaataggcggaccgagtagatctccacgggagatgaaaacc aagatggccgacgctcttatgtga >gi568815597f:110791409_110999192|GENSCAN_predicted_peptide_8|247_aa MLNITNDQGNANQNHSVIPPYSCKNDHNQNIKKAVDVGVDAVIREHLYPADTCPLNNSHT QCSLLIPLLPMADPNILQPYIAPYEYEPWEDYFLEIVTCLTLDPALLTPACETMSSKPYY CKAALECIKPLWDAKHFLLNATQGRIPPSMSSVSCLATYPPDFLFLCALLMISCDFCLPA TPTMAETPMHPSTFEAGAQEPIPTDTTPINTSTSCLPHTYVTTATPTWYRPRSPTHMYRL PKVQLWP >gi568815597f:110791409_110999192|GENSCAN_predicted_CDS_8|744_bp atgctcaacatcactaatgatcagggaaatgcaaatcaaaaccacagtgtgataccacct tattcctgcaagaatgaccataatcaaaatatcaaaaaagcagtagatgttggtgtggac gcagtgatcagggaacacctgtaccctgctgacacttgccctttaaacaactcccacact cagtgctcactcttgattcccctgctacccatggctgatccaaatatactgcaaccttat attgccccatatgaatatgaaccctgggaagactatttccttgagattgttacttgtttg accctggatcctgccctgctgacacctgcctgtgaaactatgagttcaaagccctactat tgcaaagcagccctagagtgcattaagcctctgtgggatgcaaagcatttcctcttgaat gccactcagggtagaattcctcctagtatgtcttctgtgtcatgcctagccacttaccct ccagattttctattcctatgtgcacttctaatgatttcctgcgatttctgtttgcctgca acaccaaccatggcagaaacacccatgcaccccagcacttttgaggctggtgcccaggaa cctatacccacagataccacacccataaatacctctacttcctgtcttcctcacacttat gtcacaaccgctacaccaacctggtacagaccaagatctccaacacatatgtacaggctg ccaaaggttcagctatggccataa