GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:05:59 Sequence gi568815594r:88162598_88378583 : 215986 bp : 41.42% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 982 1038 57 0 0 86 57 96 0.452 7.66 1.02 Intr + 13909 13984 76 0 1 89 58 41 0.626 -0.63 1.03 Intr + 15132 15283 152 1 2 106 76 88 0.563 8.36 1.04 Intr + 41296 41391 96 0 0 79 88 49 0.100 3.29 1.05 Intr + 44286 44360 75 2 0 46 91 52 0.142 0.09 1.06 Intr + 50753 50800 48 1 0 59 100 46 0.042 0.86 1.07 Term + 58807 59301 495 0 0 -37 42 381 0.011 14.58 1.08 PlyA + 62705 62710 6 1.05 2.04 PlyA - 62934 62929 6 1.05 2.03 Term - 66743 66502 242 0 2 56 49 125 0.065 0.60 2.02 Intr - 82473 82244 230 2 2 76 84 91 0.428 4.09 2.01 Init - 82783 82572 212 1 2 77 100 106 0.729 9.07 2.00 Prom - 90228 90189 40 -6.65 3.10 PlyA - 91400 91395 6 -0.45 3.09 Term - 92785 92631 155 2 2 38 44 169 0.494 4.60 3.08 Intr - 100028 99894 135 0 0 29 34 140 0.262 2.32 3.07 Intr - 101537 101435 103 2 1 54 90 34 0.585 -0.97 3.06 Intr - 102538 102404 135 1 0 37 60 184 0.732 10.34 3.05 Intr - 104892 103999 894 0 0 78 0 613 0.443 42.07 3.04 Intr - 105737 105593 145 1 1 72 116 86 0.784 9.16 3.03 Intr - 106309 106144 166 2 1 41 74 129 0.536 4.90 3.02 Intr - 108661 108498 164 0 2 123 72 95 0.834 10.10 3.01 Init - 115986 115547 440 0 2 20 39 184 0.676 2.33 3.00 Prom - 116314 116275 40 -7.35 4.00 Prom + 118619 118658 40 -4.35 4.01 Init + 120662 120677 16 1 1 45 58 8 0.270 -7.78 4.02 Term + 120849 121168 320 1 2 47 48 372 0.976 23.16 4.03 PlyA + 121749 121754 6 1.05 5.00 Prom + 122743 122782 40 -7.45 5.01 Init + 127562 127627 66 1 0 60 102 -3 0.677 -0.58 5.02 Intr + 128462 128575 114 1 0 48 28 121 0.239 1.82 5.03 Term + 128980 129102 123 0 0 96 44 139 0.536 7.70 5.04 PlyA + 129590 129595 6 1.05 6.00 Prom + 130378 130417 40 -7.45 6.01 Init + 133282 133609 328 0 1 95 60 152 0.622 10.69 6.02 Intr + 134736 134880 145 1 1 34 110 110 0.561 6.22 6.03 Intr + 142626 142693 68 0 2 45 98 -1 0.013 -5.77 6.04 Intr + 149608 149772 165 2 0 43 44 117 0.307 1.81 6.05 Intr + 150488 150625 138 0 0 59 -6 137 0.295 0.91 6.06 Term + 150742 150953 212 2 2 56 55 121 0.704 2.07 6.07 PlyA + 151336 151341 6 1.05 7.00 Prom + 151756 151795 40 -6.05 7.01 Init + 156772 156828 57 0 0 91 92 100 0.957 12.06 7.02 Term + 158170 158313 144 0 0 71 49 128 0.949 4.13 7.03 PlyA + 161344 161349 6 1.05 8.09 PlyA - 162491 162486 6 1.05 8.08 Term - 165956 165795 162 2 0 82 48 85 0.534 0.85 8.07 Intr - 182843 181927 917 0 2 57 103 257 0.008 14.15 8.06 Intr - 185291 185193 99 0 0 73 69 61 0.020 1.86 8.05 Intr - 186219 186038 182 2 2 0 97 140 0.022 4.69 8.04 Intr - 197038 196920 119 1 2 62 75 24 0.005 -3.16 8.03 Intr - 198609 198445 165 0 0 97 81 122 0.613 11.64 8.02 Intr - 199217 198983 235 1 1 83 105 81 0.457 6.07 8.01 Init - 215198 215155 44 2 2 69 71 45 0.126 0.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:88162598_88378583|GENSCAN_predicted_peptide_1|332_aa MAPAKKSGEKKKGYSAINEGLTVSLRLECDGPVTAHYSPNLLDSVATYHGERNLCVGGGR AQRLWDFALELSAALSQWKEHRAESMLMKTAFRPEITLKYLLSTFVPENLKATAGPKVCQ LQTLKEKLTEKFPKSPPDPEAQPASSLTTTAQSFSDDNLMSYLTEKTEKRLWNWVTGRDW NSLEGSEEDRKMWESLEFPRDLDGLEDRRMWESLELPGDFLNGFDQNADGDMDNKVQAEV VSDGDEELLGNWSKGHSCYATRLVAFCPCPRDLWNFELERDDLGYLVEEIFKQQSIQEEA EHKSLENLQPDDAIEKKTLILGRNSSTLKKFA >gi568815594r:88162598_88378583|GENSCAN_predicted_CDS_1|999_bp atggctcctgcaaagaaaagcggcgagaagaagaagggctattctgccataaacgagggt ctcactgtttcactcaggttggagtgtgatggcccagttacagcacactacagccccaac ctcctggactcagtggccacataccatggagagagaaatctgtgtgttgggggagggaga gcacagcgattatgggacttcgcattggaactcagtgctgccctgtctcagtggaaagaa cacagggcagaatcaatgctgatgaagacagcatttagaccagagattaccctgaaatac ttactctctacttttgtaccagaaaatcttaaagccacagctggaccaaaggtatgccag ctccaaactttaaaagaaaagctaacagaaaagttccccaagtccccacccgacccagaa gcccagccagcttcatctctcaccaccacagcccagtcattctcagatgataaccttatg tcctacttgactgagaaaactgaaaaacgactttggaactgggtaacaggcagagactgg aacagtttggagggctcagaagaagacaggaaaatgtgggaaagtttggaatttcctaga gatttggacggcttagaagacaggaggatgtgggaaagtttggaacttcctggagacttt ttaaatggttttgaccaaaatgctgatggtgatatggacaataaagtgcaggctgaggtg gtctcagatggagatgaggaacttcttgggaactggagtaaaggtcactcttgctatgca acaagactggtggcattttgtccctgccctagagatctgtggaactttgaacttgagagg gatgatttagggtatctggtggaagaaatttttaagcagcaaagcattcaagaggaagca gagcataaaagtttggaaaacttgcagcctgatgatgcaatagaaaagaaaacccttatt ctggggagaaattcaagcacactgaagaaatttgcataa >gi568815594r:88162598_88378583|GENSCAN_predicted_peptide_2|227_aa MRLDQMYCNGLPLQAPVSGRGEQVAPKSSEMPGTAESQKEYYSMSSSDSGSPEVWAPRRR ALSSFLLVPVSSGLLGWPHPTTASCHMGQLPGAHGGQKGYSVTAPLAPATAREPARKVLQ LLLLLPFSGSWVLVLLPGKMELCRQLEVKMLKPFVINQGRAIQVIVRGVCRPGEAWYEHM ELCPELVLSGGFLVSLTSRMKLQTFAASVMAHKGSADPKSEQQQDLL >gi568815594r:88162598_88378583|GENSCAN_predicted_CDS_2|684_bp atgcggctagaccagatgtattgcaatgggcttccactgcaggcaccagtgtctggacga ggggaacaggtggcgcccaaaagctcagagatgccaggaaccgcagagtcccaaaaagag tattacagcatgtcgagctctgactcagggagccctgaggtctgggctcccagaaggaga gccctctcttctttcctcctcgtcccagtaagctcggggctcctgggctggccccacccc accactgcttcctgtcacatgggacagctgcctggtgcccatggagggcagaagggctat agtgttacagcccctttagctcctgccacagctcgggagccagctaggaaagtgttacag ctccttttgctcctgccattcagcgggtcctgggttcttgtcctgcttccaggaaaaatg gagttatgcagacagctagaggtgaagatgctgaagccatttgtcataaatcaaggaagg gctatacaagtgatagtcagaggagtgtgcaggcctggtgaggcctggtatgaacacatg gaactgtgtccagagttggttctttccggtgggttcttggtctcgctgacttcaagaatg aagctgcaaaccttcgcagcaagtgttatggctcacaaaggtagtgcggacccaaagagt gagcagcagcaagatttattatga >gi568815594r:88162598_88378583|GENSCAN_predicted_peptide_3|778_aa MSTAALITLVRSGGNQVRRRVLLSSRLLQDDRRVTPTCHSSTSEPRCSRFDPDGSGSPAT WDNFGIWDNRIDEPILLPPSIKYGKPIPKISLENVGCASQIGKRKENEDRFDFAQLTDEV LYFAVYDGHGGPAAADFCHTHMEKCIMFVMFEHNSWVKFVFASVRVICGRELCLVSSSRP GARGHYMRGFSGKRDLLNFVPATLLTSGTTATVALLRDGIELVVASVGDSRAILCRKGKP MKLTIDHTPERKDEKERIKKCGGFVAWNSLGQPHVNGRLAMTRSIGDLDLKTSGVIAEPE TKRIKLWNSPGEMVLLFFLTCTQSASITCTQSCSITCTQSSSITCTQSASPASNQPASPA PKQSASPAPSQPASPAPNQPASPTPNQPASSAPNQPASPAPNQSASLAPNQPASSAPNQP ASPAPNQPASPAPNQHHLHPISITCTQSASPASSQPASPAPNQPASPAPNQHHLHPISIT CTQSAAPAPSQPAPPAPNQHHLHTASITCTQSANITCTQSANIACTQSSSITCTQSSSIT CTQSASITCIQSASITCTQSASPAPNQLTPAPNQPASPAPNQPASPAPNQHHLHPISQHH LHPLHHADDSFLVLTTDGINFMVNSQEICDFVNQCHDPNEAAHAVTEQGIHEIEEIGPIS WMNTSFLISAACLADICILIHKEALPPVDDGPDYQLGLRVSVQQFFTEHVKKLIRSKRSP NSLDQRTNSPWKEDVTEQGDCLLPSAIRKYLERMTDESCPLAVLLAARERVLAQHSIH >gi568815594r:88162598_88378583|GENSCAN_predicted_CDS_3|2337_bp atgtcaacagctgccttaattactttggtcagaagtggtgggaaccaggtgagaaggaga gtgctgctaagctcccgcctgctgcaggacgacaggcgggtgacacccacgtgccacagc tccacttcagagcctaggtgttctcggtttgacccagatggtagtgggagtccagctacc tgggacaattttgggatctgggataaccgcattgatgagccaattctgctgccacccagc attaagtatggcaagccaattcccaaaatcagcttggaaaatgtggggtgcgcctcacag attggcaaacggaaagagaatgaagatcggtttgacttcgctcagctgacagatgaggtc ctgtactttgcagtgtatgatggacacggtggacctgcagcagctgatttctgtcatacc cacatggagaaatgtattatgtttgtaatgtttgagcacaattcctgggtgaaattcgta tttgcctctgtgcgtgttatttgtggcagagaactgtgcctggtcagcagctctagacct ggagcccgtggacactatatgcggggcttcagtggaaagagagatctgcttaattttgtc ccagcaactcttctgacctctgggactactgcaacagtagccctattgcgagatggtatt gaactggttgtagccagtgttggggacagccgggctattttgtgtagaaaaggaaaaccc atgaagctgaccattgaccatactccagaaagaaaagatgaaaaagaaaggatcaagaaa tgtggtggttttgtagcttggaatagtttggggcagcctcacgtaaatggcaggcttgca atgacaagaagtattggagatttggaccttaagaccagtggtgtcatagcagaacctgaa actaagaggattaagctatggaattctcctggtgaaatggtcctgctgttcttcctcacc tgcacccaatcagccagcatcacctgcactcaatcatgcagcatcacctgcacccagtca tccagcatcacctgcacacaatcagcatcacctgcatccaatcagccagcatcacctgca cccaagcaatcagcatcacctgcacccagtcagccagcatcacctgcacccaatcagcca gcatcacccacacccaatcagccagcatcatctgcacccaatcagccagcatcacctgca cccaatcaatcagcatcacttgcacccaatcagccagcatcatctgcacccaatcagcca gcatcacccgcacccaatcagccagcatcacctgcacccaatcagcatcacctgcatcca atcagcatcacctgcacccaatcagcatcacctgcatccagtcagccagcatcacctgca cccaatcaaccagcatcacctgcacccaatcagcatcacctgcatccaatcagcatcacc tgcacccaatcagcagcacctgcacccagtcaaccagcaccacctgcacccaatcagcat cacctgcacacagccagcatcacctgcacccaatcagccaacatcacctgcacccaatca gccaacatcgcctgcacccagtcatctagcatcacctgcacccagtcatccagcatcacc tgcacccaatcggccagcatcacttgcatccagtcagccagcatcacctgcacccaatca gcatcacctgcacccaatcagctaacacctgcacccaatcagccagcatcacctgcaccc aatcagccagcatcacctgcacccaatcagcatcacctgcacccaatcagccaacatcac ctgcacccattacatcatgctgatgacagcttcctggtcctcaccacagatggaattaac ttcatggtgaatagtcaagagatttgtgactttgtcaatcagtgccatgatcccaacgaa gcagcccatgcggtgactgaacagggtattcatgaaattgaggaaataggcccaatttca tggatgaacacttctttcctaatttcagctgcctgtcttgcagatatttgcattctaatt cacaaagaagctttgcctccagtggacgatgggcctgattaccagctgggacttagagtt tctgtgcaacagtttttcactgagcatgtcaagaaactgataagatcaaaaaggtctcct aactcactagatcagcgcacaaacagtccttggaaagaggacgtgactgaacaaggagac tgtcttctgccatctgccatcaggaaatacctggagaggatgacagatgagagctgtccc ctggcagtgctcctagcagctagagaaagagttctcgcacaacacagcatccactaa >gi568815594r:88162598_88378583|GENSCAN_predicted_peptide_4|111_aa MPPRPVAHSETEGQPHSSALTERTDEALLHVSSGRLRKARGAFETANSLAISTWSLFVTS GEFPHLQKAAAEKDITSYNALSTTALPAQSKTQVFGSKPAGAVHPSRCRTS >gi568815594r:88162598_88378583|GENSCAN_predicted_CDS_4|336_bp atgccaccacgcccggtcgctcattctgaaaccgaaggacagcctcattcctccgccctg acggaaagaaccgatgaggccttgctgcacgttagcagtgggaggctcagaaaagcccgt ggagcttttgagactgcaaattccctggctatttccacctggagcctctttgtcacatcc ggtgaattccctcatctacagaaagcggctgctgaaaaagacatcaccagctacaacgcc ttgagtactacagcgctccctgcgcaaagtaaaacccaagtcttcggatcaaaaccagcc ggggctgtgcatccttcacgatgccggacctcctga >gi568815594r:88162598_88378583|GENSCAN_predicted_peptide_5|100_aa MAINEQYKPGLCFHENWKENLKSLEYQGNIVRDDEKPVCHWLSSNQMHVQTHRLILYRVG TIRSDRKLESGKKENYSKGKTGRGPREGQQAAHMRDLCMA >gi568815594r:88162598_88378583|GENSCAN_predicted_CDS_5|303_bp atggcaataaatgaacaatataaacctggtctctgctttcatgaaaattggaaggaaaat ttgaagagtttggaatatcaaggtaacattgtaagagatgatgaaaagcctgtttgtcac tggttgtcaagcaaccaaatgcacgtacaaacacaccgccttattttgtaccgagtgggt accatacgaagtgataggaagctggaatctgggaagaaggagaattacagcaaaggaaaa acgggaaggggaccaagagagggacagcaggctgcccacatgagggacctctgtatggcc tga >gi568815594r:88162598_88378583|GENSCAN_predicted_peptide_6|351_aa MVSDHWTVRHAPAAAGRGSTGAGTLQVCGWTRYTTSSFHSWHWEHGGTQKLEDARNFRAP KRESQSWIGGLPSLGSPKGCTSSFLLFAHNVVSKGHVSVLFALQLLASFAKTGGSGSPAQ INLCVNESPHGQRPKHPNFLAQLSADPTAGCNFMSDPSSVQLWVTRSWRLSGCLSLGEWT GAKLQTFAVSVTALTVARLELFVPPGGFVVSLASGVKLQTFAVNVTAHKGSVDPKTRHKD SPSPQQTQQPSWLHPVDPALGPQVELPASPALCTRTPQLLGGRQLRPSEKLSTAAASPGA KPLTAQGQRGRPAAWSAGLPSPLHPELALAHKHCAQPQFLPSPLPPHLPAS >gi568815594r:88162598_88378583|GENSCAN_predicted_CDS_6|1056_bp atggtgtccgaccactggacagtcagacatgcccctgctgcagcgggcaggggcagcaca ggtgctggcaccctgcaagtctgtggctggaccaggtacaccacaagcagcttccacagc tggcactgggaacatggtggcactcagaagcttgaagatgccaggaatttcagggcccca aagagggaatcacagtcctggatcggggggctgccaagtctgggctccccaaagggctgc acctcttctttccttctctttgcccacaacgttgtgagcaaggggcatgtctcagtcctg tttgcattacaactcttagcctcatttgccaagacaggaggatcaggaagcccagctcag attaacctctgtgtgaatgaaagcccacatggacagaggcccaagcatcccaatttccta gctcagctctcagctgacccaacagctggatgtaacttcatgagtgatcccagctctgta cagctctgggtgacaaggagctggaggctatctgggtgcttatccttgggagagtggaca ggagcaaagctgcagaccttcgcagtgagtgttacagctctcacggtggcgcgtctggag ttgttcgttcctcccggtgggttcgtggtctcactggcttcaggagtgaagctgcagacc ttcgcggtgaatgttacagctcataaaggcagtgtggacccaaagactagacataaagat tctccaagtccccaacagactcagcagcccagctggcttcacccagtggatcccgcactg gggccgcaggtggagctgcctgccagtcccgcgctgtgcacccgcactcctcagctcttg ggtgggaggcagctaaggcccagcgagaagttgagcacagcagctgccagcccaggtgct aagcccctcactgcccagggccagcggggacggccagccgcttggagtgcagggctgcca agcccactccacccagaactcgcgctggcccacaagcactgcgcgcagccccaattcctg ccttcacctctccctccacacctcccggcaagctga >gi568815594r:88162598_88378583|GENSCAN_predicted_peptide_7|66_aa MEGPGLESRQYKEDTGKDEFTLTLAAKCSLLLGAIAVLFLRASSFHQTVSVPLSSEKCGR PGLAPS >gi568815594r:88162598_88378583|GENSCAN_predicted_CDS_7|201_bp atggaaggacctggactggagtcacgccagtacaaggaggatactggcaaagatgagttc acactcaccctggctgccaagtgctccctgctcctcggggctattgctgtgctgttcttg agagcttcctccttccaccagacagtttctgttccccttagttctgagaaatgtggccgt cctggcctagcaccctcctag >gi568815594r:88162598_88378583|GENSCAN_predicted_peptide_8|640_aa MEAESQLAETTSEFRATPVSGVSQNNQLASGIGGFLVSLTSRMKPRTLVVSVTVLKDGVS RVCSFPCSDVSGVSSFWRVCGWLTSGVKLQTFEDFAAPSLNTPTAQRELVPDNQEEKTVS HKEMETCYPGEEGMVVHTRDSASDQAQQVALHLGTGPEQGGVTHPLRLSGPGENLTSYLK LIIKPLCYLFGGLFTWTRVTFGAVTRIGDLPWEINPLSSCSLLSDKDPPVTSGPQTNQPK EHLTNFKLVSTPGFESFGSSFSTDPSDLSPPPQAAPCQAEPEALKITNCAQLTLYSSHNF QNLFSSLHLMHVLSAPRLLRLYSLFVESPTITIVPGPDFNPASHIIPDTTPDPHDCISLI HLTFTPFPHISFFPVPNPDHTWFIDGSSTRPNRHSPAKEGYAIVSSTSIIEATTLPPSTT SQQAELIALIWALTLAKGLHVNIYIDSKYAFHILHHHAVIWAERGFLAMRGSSIINASLI KTLLKAALLPKEAGVIHCKGHQKVSDPITQGNAYADKVAKESASVPTSVPRGQFFSFSSV TPTYSPTETSTYQSLPTQSKWFLDQGKYLLPASQACFILSSFHNLFHVDSVRIGLEDTQP LSPAELLACLVCGKKLTLIWSQKSSVWIVVNTVRAQKKTL >gi568815594r:88162598_88378583|GENSCAN_predicted_CDS_8|1923_bp atggaagcagaaagccagttagcagagactacttcagagttcagagctactcctgtgtct ggagtttctcaaaataaccagcttgcgtccggaattggtgggttcttggtctcgctgact tcaagaatgaagccgcggaccctggtggtgagtgttacagttcttaaagatggtgtgtct agagtttgttccttcccatgttcggacgtgtctggagtttcttccttctggcgggtttgt ggttggctgacttcaggagtgaagctgcagaccttcgaggactttgcggcacctagcctg aacactccgacagcccagagggagcttgtcccagacaatcaagaggaaaagacggtaagc cataaagagatggagacctgctatcctggcgaagagggaatggtggtccacacacgggat tcagcctcggatcaagcccagcaggtggccttgcaccttgggactgggcctgagcaggga ggagtcactcatccccttaggctttcaggccctggggagaatcttacaagttacctgaaa ctaattatcaagcctctatgctacctgtttggtggtctcttcacatggacgcgtgtgaca tttggtgccgtgactcggattggggacctcccttgggagatcaatcctctgtcctcctgc tctttgctcagtgacaaagatccacctgtgacctcgggtcctcagaccaaccagcccaag gaacatctcaccaattttaaattagtctccaccccaggttttgagtcctttggatcctcc ttttctacggacccatctgacctctcccctcctccccaggctgctccttgccaggctgag ccagaggccctcaaaatcacaaactgtgctcaactcactctctacagctctcataacttc caaaatctattttcttccttacacctgatgcatgtactttctgccccccggctccttcgg ctatactcactctttgttgagtctcccacaattaccattgttcctggcccggacttcaat ccagcctcccacattattcctgataccacacctgacccccatgactgtatctctctgatc cacctgacattcactccatttccccatatttccttctttcctgttcctaaccctgatcac acttggtttattgacggcagttccaccaggccaaatcgccactcaccagcaaaggaaggc tatgctatagtatcttccacatctatcattgaggctaccactctgcccccctccactacc tctcagcaagccgaactcattgccttaatttgggccctcactcttgcaaagggactacat gtcaatatttatattgactctaaatatgccttccatatcctgcatcaccatgctgttata tgggctgaaagaggtttcctcgctatgcgagggtcctccatcattaatgcctctttaata aaaactcttctcaaggccgctttacttccaaaagaagctggagtcattcactgcaagggc catcaaaaggtgtcagatcccatcactcagggcaatgcttatgctgataaggtagctaaa gaatcagctagtgtcccaacttctgtccctcgtggccagtttttctccttctcatcagtc actcccacctactctcctactgaaacttccacctatcaatctcttcccacacaaagcaaa tggttcttggaccaaggaaaatatctccttccagcctcacaggcctgttttattctgtcg tcatttcataacctcttccatgtagatagtgtcagaattggattggaggacactcagccg ctgtccccggcagaattgcttgcttgcctggtgtgtgggaaaaaactcacactcatttgg tcacagaagtcttctgtgtggattgttgttaacacagtgagagcccagaaaaaaacactg tga