GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:05:21 Sequence gi568815583f:74519197_74729880 : 210684 bp : 46.98% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 18564 18627 64 2 1 43 91 84 0.318 2.49 1.02 Intr + 21920 22134 215 0 2 94 111 64 0.495 7.73 1.03 Term + 22876 22959 84 0 0 42 38 95 0.564 -2.45 1.04 PlyA + 23343 23348 6 -0.45 2.00 Prom + 23833 23872 40 -3.66 2.01 Init + 24741 25292 552 2 0 95 82 356 0.839 30.65 2.02 Intr + 53666 53737 72 1 0 135 66 31 0.567 5.20 2.03 Intr + 53936 54008 73 1 1 4 84 143 0.909 4.48 2.04 Intr + 70624 70807 184 2 1 47 73 342 0.155 27.45 2.05 Intr + 71955 72238 284 0 2 121 82 254 0.997 25.16 2.06 Intr + 72364 72618 255 2 0 28 72 426 0.973 32.42 2.07 Intr + 73939 74040 102 2 0 85 94 100 0.771 10.45 2.08 Term + 76415 76578 164 0 2 70 45 150 0.437 7.00 2.09 PlyA + 78911 78916 6 1.05 3.10 PlyA - 79746 79741 6 1.05 3.09 Term - 80260 80163 98 2 2 79 46 57 0.352 -1.27 3.08 Intr - 83975 83794 182 1 2 15 105 113 0.529 5.21 3.07 Intr - 84584 84491 94 0 1 37 80 63 0.310 -0.58 3.06 Intr - 85140 85089 52 0 1 114 98 7 0.608 2.78 3.05 Intr - 89292 89018 275 1 2 54 80 112 0.731 4.26 3.04 Intr - 91556 91466 91 2 1 89 65 92 0.951 6.57 3.03 Intr - 92398 92215 184 0 1 80 59 240 0.962 20.09 3.02 Intr - 92601 92484 118 1 1 97 99 9 0.759 2.42 3.01 Init - 93689 93647 43 2 1 84 80 -9 0.419 -1.49 3.00 Prom - 98237 98198 40 -8.36 4.00 Prom + 99858 99897 40 -11.53 4.01 Init + 100001 100152 152 1 2 55 100 163 0.761 13.71 4.02 Intr + 100609 101029 421 1 1 112 109 332 0.507 31.65 4.03 Intr + 101085 101109 25 2 1 61 100 -15 0.319 -5.40 4.04 Intr + 103298 103364 67 0 1 117 99 78 0.801 9.76 4.05 Intr + 105706 105822 117 1 0 54 62 226 0.998 16.18 4.06 Intr + 106606 106772 167 1 2 95 95 168 0.999 17.80 4.07 Intr + 108156 108262 107 1 2 86 74 127 0.716 11.03 4.08 Intr + 108343 108472 130 0 1 103 97 208 0.995 23.47 4.09 Intr + 108774 108856 83 1 2 90 92 90 0.969 8.96 4.10 Intr + 109408 109487 80 0 2 94 110 79 0.999 9.05 4.11 Intr + 109746 109836 91 0 1 110 86 77 0.999 9.70 4.12 Term + 110511 110687 177 2 0 42 48 275 0.969 16.59 4.13 PlyA + 110897 110902 6 1.05 5.07 PlyA - 111384 111379 6 1.05 5.06 Term - 113750 113416 335 0 2 82 44 447 0.947 34.47 5.05 Intr - 116430 116213 218 2 2 105 105 108 0.836 12.35 5.04 Intr - 121423 121270 154 2 1 75 99 82 0.166 7.13 5.03 Intr - 136872 136537 336 1 0 110 100 476 0.923 46.49 5.02 Intr - 152578 152259 320 0 2 85 95 265 0.972 22.50 5.01 Init - 157824 157745 80 0 2 63 89 73 0.716 5.43 5.00 Prom - 196442 196403 40 -3.06 6.07 PlyA - 196948 196943 6 1.05 6.06 Term - 201578 201293 286 1 1 116 43 341 0.928 27.28 6.05 Intr - 201857 201771 87 1 0 138 100 81 0.999 13.39 6.04 Intr - 202126 202003 124 2 1 104 110 108 0.999 14.24 6.03 Intr - 202307 202218 90 0 0 111 82 64 0.996 8.07 6.02 Intr - 202521 202395 127 0 1 80 60 309 0.999 27.55 6.01 Intr - 203930 203056 875 0 2 76 94 974 0.071 87.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 53377 53501 125 2 2 98 67 20 0.801 1.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:74519197_74729880|GENSCAN_predicted_peptide_1|120_aa NSPGLSVVTGKIVAIIEKALHAASGFGPAPPPVAPPRPRPRPWLAVQPRGAASGAGPAPA VAPGCGGVRDAAAAPGPELPPPPPPENPECPAQHSEKMVPQDKKQLFSDWQDLVEKLFRS >gi568815583f:74519197_74729880|GENSCAN_predicted_CDS_1|363_bp aactctccagggctttcagtggtgaccggcaaaattgttgccataattgagaaggccctg catgcagcttccggcttcggtcccgccccgccccctgtggccccgccccgcccccgcccc cgcccctggctggcggtccagccccgcggcgccgcttccggtgcgggccccgccccggct gtggcccccggctgcggaggagtccgagacgcagctgccgcgccggggcctgagctgccg cctcctccgccgcccgaaaacccggagtgccccgcacagcattcagagaagatggtgcca caagacaagaagcagttgttcagtgactggcaggacttggttgagaagttattccgatcg tga >gi568815583f:74519197_74729880|GENSCAN_predicted_peptide_2|561_aa MEPLQQQQQQQQQQQKQPHLAPLQMDAREKQGQQMREAQFLYAQKLVTQPTLLSATAGRP SGSTPLGPLARVPPTAAVAQVFERGNMNSEPEEEDGGLEDEDGDDEVAEVAEKETQAASK YFHVQKVARQDPRVAPMSNLLPAPGLPPHGQQAKEDHTKDASKASPSVSTAGQPNWNLDE QLKQNGGLAWSDDADGGRGREISRDFAKLYELDGDPERKEFLDDLFVFMQKRGTPINRIP IMAKQILDLYMLYKLVTEKGGLVEIINKKIWREITKGLNLPTSITSAAFTLRTQYMKYLY AYECEKKALSSPAELQAAIDGNRREGRRPSYSSSLFGYSPAAATAAAAAGAPALLSPPKI RFPILGLGSSSGTNTSSPRISPATTLRKGDGAPVTTVPVPNRLAVPVTLASQQAGTRTAA LEQLRERLESGEPAEKKASRLSEEEQRLVQQAFQRNFFSMARQLPMKIRINGRAEDRAEA SAAALNLTTSSIGSINMSVDIDGTTYAGVLFAQKPVVHLITGSAPQSLGSSASSSSSSHC SPSPTSSRGTPSAEPSTSWSL >gi568815583f:74519197_74729880|GENSCAN_predicted_CDS_2|1686_bp atggagccacttcagcagcagcagcagcagcagcagcaacaacagaagcagccacacctg gctcctctgcagatggatgccagagagaagcagggccagcagatgagagaagcccagttc ttgtatgcccaaaagctggtcacacagccgactctcctttccgccacagctgggagacct tctggcagcactcccttaggtcccttagccagagttccacccaccgcagcagtggcccaa gtgtttgaacggggcaacatgaactcagagcctgaggaagaggacggaggtttggaagat gaggatggggatgatgaagttgcagaggtggctgagaaagaaacccaggctgcttcaaaa tattttcatgtgcagaaagtagctcgccaagatcccagagtggcacccatgtccaatcta cttccagcaccagggctcccaccacatggacaacaagctaaagaagaccataccaaagat gcttccaaggcctcaccttctgtctccacagcaggacagccgaactggaatctggatgag cagctcaagcagaatggtggtttggcctggagtgatgatgcagatggaggccggggaaga gagatctctcgagattttgccaagctgtatgaactggacggtgatcctgaaaggaaagag ttcctggatgacctcttcgtctttatgcagaagagggggacccccatcaaccgaatcccc atcatggccaaacagatcctggacctgtacatgctgtataagctggtgaccgagaaggga ggcctggtggagatcatcaacaagaagatctggagggagatcaccaaaggcctaaacctg cccacatccatcaccagcgccgccttcaccctcaggacgcagtacatgaagtatctgtat gcctatgagtgtgagaagaaagccttgagttccccagccgagctccaggcagcaattgat ggcaaccgcagggagggccggcggcccagctacagctcctccctctttggctactcacct gctgcggctactgctgctgccgctgccggggcccctgcccttctctccccacccaagatc cgctttcccatccttgggcttggctccagcagtggtaccaataccagtagccctcggata tccccagcaaccactctcaggaaaggtgatggagccccagtgacaacagtgcctgtgcca aatcgtctggctgtgcccgtgaccttggcaagccagcaggctggtactcggaccgccgca ctggagcagctgcgggagcggctggagtcaggggagcctgctgagaagaaggcatcgagg ctgtctgaggaggagcagcgcctggtgcagcaggccttccagcgcaactttttcagcatg gcacggcagctccccatgaagatcaggatcaacggcagggcagaagacagagcagaggcc tcggctgcagcactgaacctgaccacgagtagcattgggagcattaacatgtctgtggac atcgatggcaccacctatgcaggtgtgctgtttgcccagaagcctgtggtccacctcatc acggggtctgctccccagagcctcggcagcagcgccagcagcagcagcagctctcactgt tcaccaagtcctacctcatcccggggcacccccagcgcagagccctccaccagctggtcc ctctga >gi568815583f:74519197_74729880|GENSCAN_predicted_peptide_3|378_aa MESLSLLTGKTEHPGPTEAEPMDLMPSPARCLWNSYFLGALDRVCPPSPTEGCRAPERDW RRAEKDLWEQLAESRQQRAELASRLRAVPEALSEQALVLQRREHKLGLGWSLKQKHATAV LLASTLKLSAAGEQLERRCISGPAGVHKTIITEREWGGGDLGSMLYLDCTISSSTLQVLE VPVPLRKELEAQESHPRLHEEMAEYWKARWHRLAVALKFKEEELQRLQRQSGTWPPQGRP QQRPQASPRSLQMKAAFLPPIPQELWEPRSAKELESAVWTPPEANGDPGKDVEMEGDVRP ALRGSWARQYYPYGERCGTQGKRGSEGSGVQGKCSGLQSPAQLGGEVKQFLKGGVLGTVE GQDISGNSYPAAFVLSSS >gi568815583f:74519197_74729880|GENSCAN_predicted_CDS_3|1137_bp atggaaagcctgtcacttctaactggcaagacagagcacccaggtcccactgaggcagag cccatggacttgatgcccagccctgcccgctgtctctggaactcctatttcctcggggca ttggaccgtgtctgcccaccaagccccactgaagggtgcagggcccccgagcgggactgg aggcgggcagagaaggacctgtgggagcagctggcagaaagcaggcagcagagggccgag ctggcaagccgtcttcgggcagtgccggaggccctgtcggagcaagccctggtgctgcag aggcgggagcacaagctgggacttggctggtccctaaagcagaaacatgccacagcagtc ctgctggcctccaccctgaagctgtcagcagctggagagcagctggagaggcggtgtatc tcgggcccagctggagttcataagacaataatcactgaaagggagtggggtgggggtgat ctggggtccatgctgtacctggactgcaccatttcttccagtacactccaggtcctggag gtaccagtgcccctcaggaaagagctagaggcacaagaatcacatcccaggcttcatgag gagatggctgaatattggaaggcacgctggcaccggctggcagttgccctgaaatttaaa gaggaggaactgcagagactccagaggcagagtgggacctggcctccccagggcagaccg cagcagagaccccaggccagcccccggagtctgcaaatgaaagcagcgtttctgcctccc atcccccaggagctgtgggagcccaggtctgccaaggagctggagtctgcagtgtggact ccaccagaggccaatggggaccctggaaaggatgtggagatggagggcgatgtgaggcct gccctcaggggctcctgggctaggcagtactacccgtatggggagcgctgcgggactcag ggtaaaagagggagtgaaggatcaggggttcagggcaagtgttctggacttcagtcacct gcacagcttggaggggaagtgaagcagtttcttaaaggaggggtccttggtaccgtggag ggacaggacatctctgggaattcgtacccagctgcgttcgtgctgagttcttcatga >gi568815583f:74519197_74729880|GENSCAN_predicted_peptide_4|538_aa MHHCKRYRSPEPDPYLSYRWKRRRSYSREHEGRLRYPSRREPPPRRSRSRSWRMLSLDRA LEGGLSTFAPSPLCPLPTSVDAFSTGCPPLALYSGLTTGLLLANSKDPADQRPHPPFGSH DRLPYQRRYRERRDSDTYRCEERSPSFGEDYYGPSRSRHRRRSRERGPYRTRKHAHHCHK RRTRSCSSASSPSLRLAGPDEIVGNLGEGTFGKVVECLDHARGKSQVALKIIRNVGKYRE AARLEINVLKKIKEKDKENKFLCVLMSDWFNFHGHMCIAFELLGKNTFEFLKENNFQPYP LPHVRHMAYQLCHALRFLHENQLTHTDLKPENILFVNSEFETLYNEHKVLVGSCEEKSVK NTSIRVADFGSATFDHEHHTTIVATRHYRPPEVILELGWAQPCDVWSIGCILFEYYRGFT LFQTHENREHLVMMEKILGPIPSHMIHRTRKQKYFYKGGLVWDENSSDGRYVKENCKPLK SYMLQDSLEHVQLFDLMRRMLEFDPAQRITLAEALLHPFFAGLTPEERSFHTSRNPSR >gi568815583f:74519197_74729880|GENSCAN_predicted_CDS_4|1617_bp atgcatcactgtaagcgataccgctcccctgaaccagacccgtacctgagctaccgatgg aagaggaggaggtcctacagtcgggaacatgaagggagactgcgatacccgtcccgaagg gagcctcccccacgaagatctcggtccagaagctggaggatgctgtcgttggacagggct ctggagggcgggcttagtacctttgcaccctcccctctctgcccacttcccaccagtgtg gatgcttttagcacaggctgtccccctttagccctttacagtggccttaccacaggtctc ttgcttgctaacagcaaggacccagctgaccagcgtccccatcccccttttggcagccat gaccgcctgccctaccagaggaggtaccgggagcgccgtgacagcgatacataccggtgt gaagagcggagcccatcctttggagaggactactatggaccttcacgttctcgtcatcgt cggcgatcgcgggagagggggccataccggacccgcaagcatgcccaccactgccacaaa cgccgcaccaggtcttgtagcagcgcctcctcgccttctctcaggctggctggtccggat gagattgtggggaacctgggtgaaggcacctttggcaaggtggtggagtgcttggaccat gccagagggaagtctcaggttgccctgaagatcatccgcaacgtgggcaagtaccgggag gctgcccggctagaaatcaacgtgctcaaaaaaatcaaggagaaggacaaagaaaacaag ttcctgtgtgtcttgatgtctgactggttcaacttccacggtcacatgtgcatcgccttt gagctcctgggcaagaacacctttgagttcctgaaggagaataacttccagccttacccc ctaccacatgtccggcacatggcctaccagctctgccacgcccttagatttctgcatgag aatcagctgacccatacagacttgaaaccagagaacatcctgtttgtgaattctgagttt gaaaccctctacaatgagcacaaggtattggtggggagctgtgaggagaagtcagtgaag aacaccagcatccgagtggctgactttggcagtgccacatttgaccatgagcaccacacc accattgtggccacccgtcactatcgcccgcctgaggtgatccttgagctgggctgggca cagccctgtgacgtctggagcattggctgcattctctttgagtactaccggggcttcaca ctcttccagacccacgaaaaccgagagcacctggtgatgatggagaagatcctagggccc atcccatcacacatgatccaccgtaccaggaagcagaaatatttctacaaagggggccta gtttgggatgagaacagctctgacggccggtatgtgaaggagaactgcaaacctctgaag agttacatgctccaagactccctggagcacgtgcagctgtttgacctgatgaggaggatg ttagaatttgaccctgcccagcgcatcacactggccgaggccctgctgcaccccttcttt gctggcctgacccctgaggagcggtccttccacaccagccgcaacccaagcagatga >gi568815583f:74519197_74729880|GENSCAN_predicted_peptide_5|480_aa MNDGFCCSTSSSAFGVVSVLDFGHSDRAGDITELKILEIPGPGDNQHFGDLHQTELGPSG AGCQVGINQNGTGKFVKKPASSSSAPQNIPKRTDVKSQDVAVSPQQQQCSKSYVDRHMES LSQSKSFRRRHNSWSSSSRHPNQATPKKSGLKNGQMKNKDDECFGDDIEEIPDTDFDFEG NLALFDKAAVFEEIDTYERRSGTRSRGIPNERPTRYRHDENILESEPIVYRRIIVPHNVS KEFCTDSGLVVPSISYELHKKLLSVAEKHGLTLERRLEMTGVCASQMALTLLGGPNRLNP KNVHQRPTVALLCGPHVKGAQGISCGRHLANHDVQVILFLPNFVKMLESITNELSLFSKT QGQQVSSLKDLPTSPVDLVINCLDCPENVFLRDQPWYKAAVAWANQNRAPVLSIDPPVHE VEQGIDAKWSLALGLPLPLGEHAGRIYLCDIGIPQQVFQEVGINYHSPFGCKFVIPLHSA >gi568815583f:74519197_74729880|GENSCAN_predicted_CDS_5|1443_bp atgaatgacggtttttgttgctccacatcctcgtcagcatttggtgttgtcagtgttttg gatttcggccattctgacagggcaggtgacattacggagttaaaaattctggagatacca ggacctggagacaaccaacattttggagaccttcatcaaacagaattaggcccctctggt gctggctgccaagtgggcatcaatcagaatggcacaggcaagtttgtcaagaagccagcc tcttccagcagtgcccctcagaatatccctaagaggacagatgtgaagagccaggatgtt gccgtttccccgcagcagcaacagtgctcaaagagctatgtcgacaggcacatggaatcc ttgagtcagtccaaaagtttccgtcgtcggcacaactcctggtcatctagtagcaggcac ccaaatcaggcaactcccaagaaaagtggtttaaagaatggccagatgaagaataaagat gacgagtgcttcggggatgatattgaggagatcccagacacagattttgattttgaaggg aacctggctctttttgacaaggcagctgtgtttgaggagattgatacctatgaaaggaga agtggtacccgttcccggggcatcccaaatgaaaggcccactcggtaccgccatgatgag aacatcttggagtccgagcccattgtctatcgacggatcatagtgccccacaacgtgagc aaggagttctgcacggactctggcctggttgtcccaagtatttcctatgagctgcataaa aagctgttgtccgtggctgagaagcatgggctgacccttgagcggagactggagatgaca ggtgtgtgtgccagtcagatggcactgaccctcctcggaggacctaacaggttgaatccc aaaaatgttcaccagaggcctacagtggctctactgtgtggacctcatgtgaagggggct cagggtatcagctgtggaaggcacctagccaaccatgatgtccaggtcatccttttcctg cccaattttgtcaagatgttggaatctatcaccaatgagctgtcgctcttcagcaagacc caaggccaacaagtgtctagcctcaaagatctgcccactagccctgtggacctggtcatc aactgcctggattgccctgagaacgtcttcctgcgcgatcaaccctggtacaaggcagct gtggcctgggccaaccagaaccgggcaccagtactcagcatagaccctcctgtgcatgaa gtcgaacagggcattgatgccaaatggtcactggcactgggcctgcctctgccactgggg gagcacgcaggccgtatctatttgtgcgacattggcattccccagcaggtcttccaggag gtgggcatcaactaccactcgccctttggctgcaagtttgttatcccactgcactctgct tag >gi568815583f:74519197_74729880|GENSCAN_predicted_peptide_6|529_aa XATSKIPTLIMLFPISMSATEFLLASVIFCLVFWVIRASRPQVPKGLKNPPGPWGWPLIG HMLTLGKNPHLALSRMSQQYGDVLQIRIGSTPVVVLSGLDTIRQALVRQGDDFKGRPDLY TFTLISNGQSMSFSPDSGPVWAARRRLAQNGLKSFSIASDPASSTSCYLEEHVSKEAEVL ISTLQELMAGPGHFNPYRYVVVSVTNVICAICFGRRYDHNHQELLSLVNLNNNFGEVVGS GNPADFIPILRYLPNPSLNAFKDLNEKFYSFMQKMVKEHYKTFEKVQSGERQGHIRDITD SLIEHCQEKQLDENANVQLSDEKIINIVLDLFGAGFDTVTTAISWSLMYLVMNPRVQRKI QEELDTVIGRSRRPRLSDRSHLPYMEAFILETFRHSSFVPFTIPHSTTRDTSLKGFYIPK GRCVFVNQWQINHDQKLWVNPSEFLPERFLTPDGAIDKVLSEKVIIFGMGKRKCIGETIA RWEVFLFLAILLQRVEFSVPLGVKVDMTPIYGLTMKHACCEHFQMQLRS >gi568815583f:74519197_74729880|GENSCAN_predicted_CDS_6|1590_bp ncagccacctccaagatccctacactgatcatgcttttcccaatctccatgtcggccacg gagtttcttctggcctctgtcatcttctgtctggtattctgggtaatcagggcctcaaga cctcaggtccccaaaggcctgaagaatccaccagggccatggggctggcctctgattggg cacatgctgaccctgggaaagaacccgcacctggcactgtcaaggatgagccagcagtat ggggacgtgctgcagatccgaattggctccacacccgtggtggtgctgagcggcctggac accatccggcaggccctggtgcggcagggcgatgatttcaagggccggcccgacctctac accttcaccctcatcagtaatggtcagagcatgtccttcagcccagactctggaccagtg tgggctgcccgccggcgcctggcccagaatggcctgaaaagtttctccattgcctctgac ccagcctcctcaacctcctgctacctggaagagcatgtgagcaaggaggctgaggtcctg ataagcacgttgcaggagctgatggcagggcctgggcactttaacccctacaggtatgtg gtggtatcagtgaccaatgtcatctgtgccatttgctttggccggcgctatgaccacaac caccaagaactgcttagcctagtcaacctgaataataatttcggggaggtggttggctct ggaaacccagctgacttcatccctattcttcgctacctacccaacccttccctgaatgcc ttcaaggacctgaatgagaagttctacagcttcatgcagaagatggtcaaggagcactac aaaacctttgagaaggtacagtctggggaaaggcagggccacatccgggacatcacagac agcctgattgagcactgtcaggagaagcagctggatgagaacgccaatgtccagctgtca gatgagaagatcattaacatcgtcttggacctctttggagctgggtttgacacagtcaca actgctatctcctggagcctcatgtatttggtgatgaaccccagggtacagagaaagatc caagaggagctagacacagtgattggcaggtcacggcggccccggctctctgacagatcc catctgccctatatggaggccttcatcctggagaccttccgacactcttccttcgtcccc ttcaccatcccccacagcacaacaagagacacaagtttgaaaggcttttacatccccaag gggcgttgtgtctttgtaaaccagtggcagatcaaccatgaccagaagctatgggtcaac ccatctgagttcctacctgaacggtttctcacccctgatggtgctatcgacaaggtgtta agtgagaaggtgattatctttggcatgggcaagcggaagtgtatcggtgagaccattgcc cgctgggaggtctttctcttcctggctatcctgctgcaacgggtggaattcagcgtgcca ctgggcgtgaaggtggacatgacccccatctatgggctaaccatgaagcatgcctgctgt gagcacttccaaatgcagctgcgctcttag