GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:26:37 Sequence gi568815586r:68452998_68653845 : 200848 bp : 40.07% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3116 3172 57 1 0 123 17 74 0.300 1.96 1.02 Intr + 4045 4185 141 0 0 83 49 70 0.102 2.23 1.03 Term + 14949 15173 225 2 0 94 42 106 0.052 2.40 1.04 PlyA + 17963 17968 6 1.05 2.00 Prom + 18889 18928 40 -7.35 2.01 Init + 20744 21710 967 1 1 97 81 724 0.052 66.14 2.02 Intr + 22684 22796 113 2 2 79 47 0 0.021 -5.82 2.03 Intr + 24326 24471 146 1 2 74 80 80 0.030 3.96 2.04 Intr + 34643 34891 249 2 0 13 94 298 0.151 18.53 2.05 Intr + 40793 40891 99 2 0 52 90 68 0.101 1.71 2.06 Intr + 58274 58403 130 2 1 31 103 56 0.010 1.18 2.07 Intr + 61959 62163 205 2 1 70 96 91 0.020 5.95 2.08 Intr + 79726 79781 56 2 2 88 67 86 0.077 4.28 2.09 Term + 91361 91456 96 1 0 103 53 91 0.329 4.09 2.10 PlyA + 94513 94518 6 1.05 3.02 PlyA - 95677 95672 6 1.05 3.01 Sngl - 100885 99998 888 1 0 60 43 948 0.946 83.32 3.00 Prom - 101618 101579 40 -7.85 4.00 Prom + 105935 105974 40 -6.85 4.01 Init + 107541 107751 211 2 1 91 55 218 0.856 17.79 4.02 Intr + 121517 121571 55 0 1 64 66 65 0.001 -0.98 4.03 Intr + 136879 137053 175 1 1 54 97 91 0.028 5.62 4.04 Intr + 142885 142990 106 0 1 78 36 53 0.272 -2.03 4.05 Term + 143274 143587 314 1 2 78 36 163 0.582 4.28 4.06 PlyA + 144252 144257 6 1.05 5.03 PlyA - 146793 146788 6 1.05 5.02 Term - 158113 157895 219 1 0 74 45 213 0.602 11.66 5.01 Init - 164968 164966 3 1 0 108 81 0 0.290 1.35 5.00 Prom - 167532 167493 40 -6.05 6.02 PlyA - 167600 167595 6 1.05 6.01 Sngl - 174343 173873 471 1 0 54 55 431 0.368 30.27 6.00 Prom - 179995 179956 40 -3.35 7.02 PlyA - 180005 180000 6 1.05 7.01 Term - 189944 189517 428 0 2 -5 38 482 0.394 28.38 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 20744 21724 981 1 0 97 49 727 0.946 66.25 S.002 Init + 124682 124813 132 1 0 65 105 129 0.979 12.49 S.003 Term + 134413 134540 128 2 2 57 43 145 0.917 4.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:68452998_68653845|GENSCAN_predicted_peptide_1|140_aa MIPFTSCPGQVPVPEGHPRSSPFSTTSGPHLQAWQQADPARTREAYAVPLVEHMDTFEVL PETVFRGWSWERQRETARGFFLSSTLASIGALWPQRMLELTLPIMGAFVGSTEVRPLESS QKPLDVGNACLLASHSSLWP >gi568815586r:68452998_68653845|GENSCAN_predicted_CDS_1|423_bp atgatacccttcacttcctgtccaggtcaggttcctgtaccggagggccaccccaggtct tctcctttcagcaccacatctggtcctcacctccaagcatggcagcaggctgaccctgct aggactagagaagcatatgccgtaccccttgttgagcacatggatacctttgaggtcctt cctgaaacagtgttcaggggctggagctgggaaagacagagagaaacagcaagaggtttc ttcctgtcaagcactcttgcgagcattggtgctctctggcctcagaggatgttggagctg acccttcctatcatgggtgcttttgtgggatccacagaggtacgcccactggagtcttct caaaagccccttgatgtgggcaatgcctgtcttctggcttctcattcctctctgtggccc tag >gi568815586r:68452998_68653845|GENSCAN_predicted_peptide_2|686_aa MADSRGRVLQDYRKKLLEHKIDGRLNELREQLKELTKQYEKSENDLKALQSVGQIVGEVL QQLTEAKFIVKATNGPRYVVGCRRQLDKSKLKPGTGVALDMTTLTIMRYLPREVDPLVYN MSHEDPWNVSYSEIGGLSEQIWELREVIELALTNPELFQWVGIMPPKCCLLHGPPGMGKT LLAQAVASQLDYNFLKVVSSSIVDKYIGESACLIREMFNYPRNHKPCIIFMDEIDAIGGC PFSEGTSADREIQSTSVKLLNQMDGFDTLHRVKMIMTTNRPDTLDPALLHPGRLDRNIHI DLPYEQARLDLLKIHAGPITKHAHQLSSMFVYFMCGPRRFFFQCGPGKPKDWTLMQYILK SISRIQPHFALTSIIVLIQATIISHLDYYNSCLTGLPASTMPLTTKQSETFRRPFAICNL GVKQHGCSQPRPCGSRASQAHPGCLVTVLSIRAIIVIMSSLLLCPGTAVIIYRTQTHRVL SGAVREPPKRARKYRQYWKLRLDYFTAVHKQSLNLALRLKERFYRPCHLLTCVTFGAFLT LSVPHILNCERGDIMMIKRVNTRRAGHMHLDLSCGWLGQSQEARQPCNQITDKGQMSVTE AKIIQNLDSKGRSKTKRISRMEEPSMYWSQRKSKCQETYPTQSEQQAEKQGSENQGGNVA FLGDLKGCSKLKNFQELINQSALVHP >gi568815586r:68452998_68653845|GENSCAN_predicted_CDS_2|2061_bp atggcggactctagaggtagggtgcttcaggactaccgcaagaagctgcttgagcacaag atcgacggccgtcttaacgagttaagggaacaattaaaagaacttaccaagcagtatgaa aagtctgaaaatgatctgaaggccctacaaagtgttgggcagattgtgggtgaagtgctt caacagttaactgaagcaaaattcattgttaaagctacaaatggaccaagatatgttgtg ggttgtcgtcgacagcttgacaaaagtaagctgaagccaggaacaggagtcgctttggat atgactacactaactatcatgagatatttgccaagagaggtggatccactggtttataac atgtctcatgaggacccctggaatgtttcttattctgagattggagggctatcggaacag atctgggaattaagagaggtgatagaattagctcttacaaacccagagttatttcagtgg gtaggaataatgcctccaaaatgctgtttgttacacggaccaccaggtatgggaaaaaca ctcttggcacaagctgttgctagccagctggactacaatttcttaaaggttgtatctagt tctattgtagacaagtacattggtgaaagtgcttgtttgatcagagaaatgtttaattat cccaggaaccataaaccatgcatcatttttatggatgaaatagatgctattggtggttgt ccgttttctgagggtacttcagctgacagagagattcagagtacttcagtgaagttactg aatcaaatggatggatttgatactctgcacagagttaaaatgatcatgactacaaacaga ccggatacactggatcctgctttgctgcatccgggaagattagatagaaacatacatatt gatttgccatatgaacaagcaagattagacctactgaaaatccatgcaggtcccattaca aagcatgctcatcagctatcgtcaatgttcgtgtattttatgtgtggcccaagacgattc ttcttccaatgtggccccgggaagccaaaagattggacactcatgcaatatattctcaag tctatatccaggatccaaccacattttgcccttacctccatcatcgtcctgatccaggcc accatcatctctcacctggattattacaacagctgtctaactggcctccctgcctccacc atgcctctcaccacaaagcagtctgaaactttccgcagacccttcgcgatctgcaatctg ggggtgaagcaacatggatgcagccagccaaggccctgtggaagtcgtgcttctcaagca caccctggatgtctggtcactgtcctcagcatccgggccatcattgtcatcatgtcctcc ttgttgctgtgcccaggcactgcagtcatcatctaccgcacgcagacccatcgggtcctc agtggggctgtccgcgagcctcccaagagggccagaaaatatcgccagtactggaagcta agattggactattttactgctgtacataagcaaagccttaatctggctttaaggctcaaa gaaagattctacaggccctgtcatttacttacctgtgtgacttttggagcttttcttacc ctctctgtgcctcatatcctcaactgtgaacggggggatataatgatgattaaacgagtt aacacacgcagagctgggcacatgcatttggatttgagttgtggttggctgggccaaagc caggaagccaggcagccttgcaatcagatcacggacaaaggccagatgtcagtcactgaa gcaaaaataatacagaatcttgattccaaaggtagatccaaaactaagaggataagcagg atggaagagccatccatgtactggagtcaaagaaagtccaaatgccaggaaacttaccca acccagtctgagcaacaagctgaaaagcagggttctgagaaccagggaggaaacgtagcg ttccttggagacctgaagggatgcagtaagcttaagaattttcaagagcttatcaatcag tcagcccttgttcatccctga >gi568815586r:68452998_68653845|GENSCAN_predicted_peptide_3|295_aa MSGALDVLQMREEDVLKFLAAGTHLGGTNLDFQMEQYIYERKSDGIYIINLKRTWEKLLL AAHAIVAIENPADVSVISSRNTGQTAVLKFAAATGATPIAGRFTPGTFTNQIQAAFREPR LLVVTDPRADHQPLTEASYVNLPTIALCNTDSPLRYVDIAIPCNNKGAYSVGLMWWMLAR EVLCMLGTISCEHPWEVMPDLYFYRDPEEIEKEEQAAAEKAVTKEEFQGEWTAPAPEFTA IQPEVADWSEGVQVPSVPIQQFPTEDWSSQPAMEDWSAAPTAQATEWVGATTDWS >gi568815586r:68452998_68653845|GENSCAN_predicted_CDS_3|888_bp atgtccggagcccttgatgtcctgcaaatgagggaggaggatgtccttaagttccttgca gcaggaacccacttaggtggcaccaatcttgacttccagatggaacagtacatctatgaa aggaaaagtgatggcatctatatcataaatctgaagaggacctgggagaagcttctgctg gcagctcatgctattgttgccattgaaaaccctgctgatgtcagtgttatatcctccagg aatactggccagacggctgtgctgaagtttgctgctgccactggagccactccaattgct gggcgcttcactcctggaaccttcactaaccagatccaggcagccttccgggagccacgg cttcttgtggttactgaccccagggctgaccaccagcctctcacggaggcatcttatgtt aacctacctaccattgcgctgtgtaacacagattctcctctgcgctatgtggacattgcc atcccatgcaacaacaagggagcttactcagtgggtttgatgtggtggatgctggctcgg gaagttctgtgcatgcttggcaccatttcctgtgaacacccatgggaggtcatgcctgat ctgtacttctacagagatcctgaagagattgaaaaagaagagcaggctgctgctgaaaag gcagtgaccaaggaggaatttcagggtgaatggactgctccagctcctgagttcactgct attcagcctgaggttgcagactggtctgaaggtgtacaggtgccctctgtgcctattcag caattccctactgaagactggagctctcagcctgccatggaagactggtctgcagctccc actgctcaggccactgaatgggtaggagcaaccactgactggtcttaa >gi568815586r:68452998_68653845|GENSCAN_predicted_peptide_4|286_aa MDKSRGDIAADRWRTLQEEGSHLAGKLPAGKNAGGCLSWEEFPSPLQAQKPGNEWEAAGV GQASQGKWPRNSWAPQVQAQMRWESMDPSPRGLGRKNGFHGPGPVPCCPEQPHSLGTLLP ASWLLRLPQWLTGNQVKLTLLLQRVQAGPQSAPSLLRFQDLRPDLLTSLSPRALSTSDTH LSSSSFFPLAERRLEKKEPPLHSIRDSSSSKVISERAAAPKVKLAPVIVTSGLFERRSMG FEAKSTYIQNSFERRSMSFEAKSTYIQNSGSLHTQASSEPLSLPLK >gi568815586r:68452998_68653845|GENSCAN_predicted_CDS_4|861_bp atggacaagtctagaggcgatatagctgcagacaggtggaggacactgcaagaggaaggc tcccatttagcaggaaaactgccagctgggaagaatgctggtggctgcctgagttgggaa gaattcccaagccctctgcaagctcagaagccaggaaatgaatgggaagcagctggagtg ggccaagcaagtcaaggcaaatggcccagaaattcatgggcccctcaagtccaagcacag atgcgttgggaatctatggatcccagtcccagaggcctaggaaggaagaatggtttccat gggccaggtccagtgccctgctgccctgagcagcctcacagcctcgggacactgcttcct gcatcctggctgctccggctcccacagtggctaacagggaaccaggttaagctcacactg ctgcttcagagggtgcaagccggtccacaatctgccccgtcactgctcaggtttcaggac ctccggccagacctgctgacctctctgagccctagggcactgagcactagtgacactcac ttgtcaagcagcagcttctttcccttagcagagcgcaggctagagaagaaagagccacca ctacattccatcagagacagcagtagttccaaagtgatttcagaacgtgcagcagctcca aaagtgaagctagctcctgtgatagtaacaagtggcttgtttgagagaaggagcatgggc tttgaagccaaatcaacctatattcaaaatagctttgagagaaggagcatgagctttgaa gccaaatcgacctatattcaaaattctggctctctgcacacacaggcaagttctgagcct ctttctttgcctttaaaatag >gi568815586r:68452998_68653845|GENSCAN_predicted_peptide_5|73_aa MPAPGPARPPPRLPSAATHSLCEPLTLSPGPCSQRRPALPPGRYSRRHGGPAAAAPLRLV YTPESGRGLAMSP >gi568815586r:68452998_68653845|GENSCAN_predicted_CDS_5|222_bp atgccggcgcccggccccgcgcggccgccgccgcggcttccctcagcggccacgcactca ctctgcgaacctctcacgctgtcaccgggtccctgcagccagcgtcgccccgcgctcccc ccgggtcgctactctaggcgccacggcggtcctgccgctgccgcgccgctccggctggtt tacacgcctgaatctgggcgaggtttggcgatgtcgccttga >gi568815586r:68452998_68653845|GENSCAN_predicted_peptide_6|156_aa MGGPSVQWLQTAASLVVYAACVLYCMGCSKGPWRQSFQMDVHVSDLALPQCRFQTGMRGA FGKPQGTVARVHTGQVIISIHTKLQNKEHVIEALRRAKFKFSGRQKIHISKKWGFTKFNA NEFEDMVTEKRLIPDGCRVKYISNRGPVDKWRALHS >gi568815586r:68452998_68653845|GENSCAN_predicted_CDS_6|471_bp atgggggggcccagtgtgcaatggctgcaaacagcagcttccttggtagtgtatgcagcc tgtgtgttgtattgtatgggttgctctaagggaccctggagacagtcctttcagatggat gttcatgtttctgaccttgcactaccccagtgtaggttccaaacaggcatgcgaggtgcc tttgggaagccccagggcactgtggccagggttcacactggccaagttatcatctccatt cacaccaagctgcagaacaaggagcatgtgattgaggccctgcgcagggccaagttcaag ttttctggccgccagaagatccacatctcaaagaagtggggcttcaccaagttcaatgcc aatgagtttgaagacatggtgactgagaagcggctcatcccagatggctgtcgggtcaag tacatttccaatcgtggccctgtggacaagtggcgggccctgcactcatga >gi568815586r:68452998_68653845|GENSCAN_predicted_peptide_7|142_aa XIIPQNQKAIASFLKSWNETLTSRLAALPENPPVIDWAYYKANVAKAGLVDDFKKFNALK VPVPEDKYTAQVDAEEKDVKSCAEWVSLSKARIVEYEKQMEKMKNLIPFDQMTTEDLNEA FPETKLDEKKYPYWPHQLIENL >gi568815586r:68452998_68653845|GENSCAN_predicted_CDS_7|429_bp nagatcataccccagaaccaaaaggccattgctagtttcctgaaatcctggaatgagacc ctcacctccaggttggctgctttacctgagaatccaccggttatcgactgggcttactac aaggccaacgtggccaaggccggtttggtggatgactttaagaagtttaatgccctgaag gttcccgtgccagaggataaatatactgcgcaggtggatgccgaagaaaaagatgtgaaa tcttgtgctgagtgggtgtctctctcaaaggccaggattgtagaatatgagaaacagatg gagaagatgaagaacttaattccatttgatcagatgaccactgaggacttgaacgaagct ttcccagaaaccaaattagacgagaaaaagtatccttattggcctcaccaactaatcgag aatttataa