GENSCAN 1.0 Date run: 4-Nov-116 Time: 01:31:15 Sequence gi568815583r:52482157_52713859 : 231703 bp : 38.84% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6978 7098 121 2 1 13 94 148 0.982 8.30 1.02 Term + 8236 8372 137 2 2 12 48 115 0.616 -2.90 1.03 PlyA + 9740 9745 6 1.05 2.00 Prom + 17538 17577 40 -4.35 2.01 Sngl + 23033 23707 675 1 0 77 38 735 0.509 63.33 2.02 PlyA + 23733 23738 6 1.05 3.00 Prom + 45763 45802 40 -3.25 3.01 Init + 46510 46614 105 0 0 68 3 232 0.907 10.97 3.02 Term + 46900 47283 384 0 0 26 41 233 0.813 6.30 3.03 PlyA + 48511 48516 6 -0.45 4.03 PlyA - 48720 48715 6 1.05 4.02 Term - 50477 50332 146 0 2 110 48 78 0.193 3.09 4.01 Init - 58405 58090 316 1 1 100 31 165 0.199 9.54 4.00 Prom - 59512 59473 40 -6.65 5.13 PlyA - 61331 61326 6 1.05 5.12 Term - 69948 69778 171 0 0 82 38 134 0.908 4.64 5.11 Intr - 75066 74944 123 0 0 2 98 102 0.113 2.46 5.10 Intr - 86746 86692 55 0 1 84 105 80 0.006 7.36 5.09 Intr - 102788 102589 200 2 2 58 80 218 0.918 15.23 5.08 Intr - 111580 111521 60 1 0 100 75 53 0.746 3.21 5.07 Intr - 123090 122977 114 0 0 80 69 34 0.538 0.42 5.06 Intr - 128227 126412 1816 0 1 63 111 966 0.950 82.77 5.05 Intr - 129099 128974 126 2 0 96 91 161 0.999 16.07 5.04 Intr - 129621 129419 203 0 2 35 98 143 0.944 7.16 5.03 Intr - 131701 131517 185 2 2 77 94 153 0.972 13.39 5.02 Intr - 137957 137837 121 2 1 58 63 78 0.527 1.45 5.01 Init - 142245 142213 33 0 0 72 106 17 0.507 1.82 5.00 Prom - 146809 146770 40 -3.75 6.00 Prom + 155849 155888 40 -2.25 6.01 Init + 182912 182971 60 1 0 62 92 64 0.320 5.50 6.02 Intr + 197378 197771 394 1 1 50 35 213 0.059 5.50 6.03 Term + 217566 217666 101 1 2 57 36 107 0.132 -0.29 6.04 PlyA + 218057 218062 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 75063 74944 120 0 0 46 98 90 0.823 6.04 S.002 Init + 90640 90796 157 0 1 58 105 91 0.892 7.92 S.003 Term - 100151 99998 154 1 1 86 44 97 0.891 1.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:52482157_52713859|GENSCAN_predicted_peptide_1|85_aa MLRAKTVRLLEENTGGKLHDTEFGTDFLELTPGAQAIKEKEYYLASKEKEILKDVTTWMN LEEILKDVTTWMNLEDITPYEISQS >gi568815583r:52482157_52713859|GENSCAN_predicted_CDS_1|258_bp atgttaagagctaaaactgtaagactcttagaagaaaatacaggaggaaaacttcatgat actgaatttggcactgatttcttggagctgacaccaggagcacaggcaataaaagaaaaa gaatattatttggcctcaaaagagaaggaaattctgaaagatgtcacgacatggatgaac cttgaggaaattctgaaagatgtcacgacatggatgaaccttgaggacattacgccatat gaaataagccagtcatga >gi568815583r:52482157_52713859|GENSCAN_predicted_peptide_2|224_aa MGFGDLKSPAGLQVLNDYLTDKSYIKGYVPSQADVAVFEAVPRPLPADLCHALCWYNHIK SYEKEKASLPGVKKALGKYGPADVEDTTGSGATDSKDDDDIDLSGSDDEEESEEAKRLRE ERLAQYESKKAKKPALVSKSSILLDVKSWDDETDMAKLEGVRSIQADVLVWDSSKLVPVG YRIKKLQIQCVVEDDKVGTDMLEEQITAFEDYVQPMDVAAFNKI >gi568815583r:52482157_52713859|GENSCAN_predicted_CDS_2|675_bp atgggttttggagacctgaaaagccccgccggcctccaggtgctcaatgattacctgaca gacaagagctacatcaaggggtatgtgccatcacaagcagatgtggcagtatttgaagca gtgccccgcccactgcctgccgacttgtgtcatgctctatgttggtataatcacatcaag tcttacgaaaaggaaaaggccagcctgccaggagtgaagaaagctttgggcaagtatggt cctgcagatgtggaagacactacaggaagtggagctacagatagtaaagatgatgatgac attgatctctctggatctgatgatgaggaagaaagtgaagaagcaaagaggctaagggaa gaacgtcttgcacaatatgaatcaaagaaagccaaaaaacctgcacttgtttccaagtct tccatcttattagatgtgaaatcttgggatgatgagacagatatggcgaaattagagggc gtcagaagcattcaagcagacgtcttagtctgggactcatctaaactagttccagtggga tacagaattaagaaacttcaaatacagtgtgtagttgaggatgataaagttggaacagat atgctggaggagcagatcactgcgtttgaggactatgtgcagcccatggatgtggctgct ttcaacaagatctaa >gi568815583r:52482157_52713859|GENSCAN_predicted_peptide_3|162_aa MALVGLAWARRSRPLPSLTAGGEGRTAPVLDAGRGGSPPPPQPLRRPAMLPPLPPPPPPP PCALWEREEAALQRCHRGCSGLSRAHAASHRLRAQSIIIISLPAGPAAASRPAEARKPAR DVSPEGAAHPGRPHPEASPGPSAAEPSSRGSAAMARAVTVSE >gi568815583r:52482157_52713859|GENSCAN_predicted_CDS_3|489_bp atggcgctggtggggctcgcctgggcccggcgctcccgccccctccccagcctgacagct ggcggcgagggccgcacagccccagtcctcgacgccggccgcggggggtcgccgccgccg ccgcagccacttcggcggcccgcgatgctgcctccgctgccgccgccgccaccgccgccg ccctgtgcattatgggagcgggaggaggctgcgctccagcgctgccaccgcggctgctct gggctctctcgggctcacgctgcctcccatcggctacgagcacagagcatcatcatcatc agcctgccagccggccccgccgccgcctcgcgccctgccgaggcccggaaacccgcccgg gacgtgagccccgagggcgcggcacaccccgggcgcccccaccccgaagcgagcccgggg ccctctgctgcggagcctagctcccgcggtagcgcagccatggcacgggctgttactgtg tccgaataa >gi568815583r:52482157_52713859|GENSCAN_predicted_peptide_4|153_aa MAKTGQGTAQAVALESASPKLWQFPVGAQKSRIEVWESPSRFQRMYGNAWMSRQKSAAGA GPSWRTSARAVQKGNVRSEPPLWVPTGALPTGVVRRGPLSSRPQHASFPHAVPPHVLQPQ TTSHPYHHLDLSMNYLEFFLADLNVTYAQYLVL >gi568815583r:52482157_52713859|GENSCAN_predicted_CDS_4|462_bp atggctaaaacaggtcaaggtacagctcaggctgttgctttagagagtgcaagccccaag ctttggcaatttcctgtgggtgcacagaagtcaagaatcgaggtttgggaatctccatct agatttcagaggatgtatggaaatgcctggatgtccaggcagaagtctgctgcaggagca gggccctcatggagaacctctgctagggcagtgcagaagggaaatgtgaggtcagagccc ccactctgggtccccactggggctctgcctactggggttgtgagaagagggccactgtcc tccagaccccagcatgcttcatttcctcatgctgttcccccacatgtccttcaaccacaa acgacttcccatccatatcaccacttagatctctcaatgaactatttggagttctttctc gctgacttaaatgtcacctatgcccagtacctagtcctctga >gi568815583r:52482157_52713859|GENSCAN_predicted_peptide_5|1068_aa MASVNSGEENKNRWGILRSCYTPEPEEHAKAVSLIIAPRQPGASTILIEVLDTLDEYFEY DAEEFLVSLALLITEGRTPECSVKGRTESFHCPPAQSCYPVTTKHECSDKLAQCRQARRT RSEVTLLWKNNLPIMVEMMLLPDCCYSDDGPTTEGIDLNDPAIKQDALLLERWILEPVPR QNGDRFIEEKTLLLAVRSFVFFSQLSAWLSVSHGAIPRNILYRISAADVDLQWNFSQTPI EHVFPVPNVSHNVALKVSVQSLPRQSNYPVLTCSIHTNIGLYEKRIQQHKLKTHQHHNPN EAEQCGTNSSQRLCSKQTWTMAPESVLHAKSGPSPEYTAAVKNIKLYPGTGSKSDHGTSQ ANILGFSGIGDIKSQETSVRTLKSFSMVDSSISNRQSFWQSAGETNPLIGSLIQERQEII ARIAQHLIHCDPSTSHVSGRPFNTQESSSLHSKLFRVSQENENVGKGKEAFSMTFGSPEF SSPEDTNEGKIRLKPETPRSETCISNDFYSHMPVGETNPLIGSLLQERQDVIARIAQHLE HIDPTASHIPRQSFNMHDSSSVASKVFRSSYEDKNLLKKNKDESSVSISHTKCSLLGDIS DGKNLVPNKCFTSFKNNSKEKCSLKHQTRNQCQNNPSEIIQSTYQETQNKSSSLSTSSIL SQHKENNLDLTSRFKEQEMSNGIDKQYSNCTTIDKQICTNKYKEKIINENYNPKFFGNLQ SDDSKKNDSKIKVTVLEMSEYLNKYESMSSNKDSKRPKTCEQNTQLNSIENYLNKDNEGF KCKKSDQLKNEQDKQEDPTNEKSQNYSQRRSIKDCLSTCEQPKNTEVLRTTLKHSNVWRK HNFHSLDGTSTRAFHPQTGLPLLSSPESVLNYRFDPLGIVDGFTAETLFNPNKTVVKMFV VIYDLRDMPANHQTFLRQRTFSVPVKQEVKRSVNKENIRHTEERLLRYLIHLRGSTMSAE VPEAASAEEQKEMEDKVTSPEKAEEAKLKARYPHLGQKPGGSDFLRKRLQKGQKYFDSGD YNMAKAKMKNKQLPTAAPDKTEVTGDHIPTPQDLPQRKPSLVASKLAG >gi568815583r:52482157_52713859|GENSCAN_predicted_CDS_5|3207_bp atggcttctgtaaattctggggaagagaacaaaaacaggtggggaattttaagaagctgc tacactccagaaccagaagagcatgcaaaagcagtcagtctgatcatagctcctagacaa cctggtgcaagcacaatcctcattgaggtcttagatactctggatgaatattttgaatat gatgcagaggagttcttggtctctttggccttgctgataacagaaggacgaacacctgaa tgttctgtaaaaggtcgaacagaaagctttcattgccctccagcacagtcttgttaccca gtaactaccaaacatgaatgtagtgacaagctggcccagtgccgccaagccagacgaact aggtctgaggtcacattgttgtggaagaataaccttccaatcatggtggaaatgatgcta ctaccagactgctgctacagcgatgatgggcccaccacagagggaattgatctaaatgat cctgcgattaagcaagatgcattattattagaaagatggatcttggagccagttcctcga cagaatggtgaccgatttattgaagagaagacgcttctgttggctgtccgctcatttgtg tttttttctcagttaagtgcatggctgagtgtttctcatggtgctattccacgaaatatt ctctacagaatcagtgctgctgatgtagacctacagtggaatttttcacagactccaatt gagcatgtgtttcctgttcccaatgtttctcacaatgttgccttgaaagtcagtgttcag tccttgcccagacaatctaattatccagttttgacgtgcagtattcacactaatattggc ctttatgagaaaagaattcaacaacataaacttaaaactcatcagcaccataacccaaat gaagcagaacaatgtggtacaaacagttcacagcgtctgtgtagcaaacaaacttggacc atggcacctgaaagtgtgttacatgcaaaaagtggcccaagtccagaatatactgcagct gtcaaaaatatcaaactatatccaggcactggcagtaaatctgaccatgggacatctcaa gccaatattctaggctttagtggtataggtgatataaaatcacaagaaacatcagtgaga actttaaaatcattttcaatggttgattccagtatctctaaccgccagagtttctggcag tcagctggtgagactaaccctttaataggctctttaattcaggagcggcaagaaatcatt gcaagaattgcccaacatttgattcattgtgatccaagcacttcacatgtttctggacgt ccatttaatactcaagagtctagttcactccattcaaaacttttccgggtttcacaagaa aatgagaacgtgggaaaaggtaaagaagctttctccatgacttttggtagtccagagttt agttccccagaagacaccaatgaggggaaaattcgactaaaaccagaaactcctcgaagt gaaacttgtatttctaatgacttttattctcatatgcctgttggagagactaatcctttg ataggctctttactccaggagcggcaagatgttattgcaaggattgctcaacacttggag cacattgatccaacagcatcacatatcccccggcagtcattcaacatgcatgactccagt tcggttgcatctaaagtgtttaggagttcatatgaagacaaaaatttgttgaagaaaaat aaggatgagtcctcagtttccatttctcacacaaaatgttccttgttaggagacatcagt gatgggaaaaacttagtacctaataaatgttttacttcttttaaaaataatagtaaagaa aagtgttctttgaaacatcaaacaagaaatcagtgtcagaacaatcctagtgaaatcatc caaagtacgtatcaggagacacagaacaaaagttctagtttatcaacttcctcaattttg tctcagcacaaagaaaataacttagatttgacaagcagattcaaggagcaagaaatgagc aatggaattgataaacagtattcaaattgcaccactattgacaaacagatttgtacaaat aagtataaggaaaaaataataaatgagaactataatccaaaattctttggcaatcttcag tctgatgattccaaaaaaaatgactcaaaaataaaagttactgtgttggaaatgtctgaa tatttgaacaaatatgaaagcatgtcctcaaataaagactcaaaaaggcctaagacatgt gagcaaaatactcaacttaatagcatagagaattatctcaataaagataatgaaggtttc aaatgtaaaaagtcagaccaattaaaaaatgaacaagataagcaagaagatccaactaat gaaaaatcccaaaactattctcagagaagaagtataaaagactgtttgtctacatgtgag caaccaaaaaatacagaggtattgaggactacactgaaacattcaaatgtgtggcgaaaa cataattttcattccttggatggaacttcaaccagagcctttcatcctcaaactggattg cctcttctttcaagccctgaatctgtcttgaactatcgtttcgatcctctcggcattgtt gatggttttactgccgagaccttatttaatcctaataagactgtggtgaagatgtttgtt gtgatatatgatttacgagatatgccagccaatcatcagacattcctacgacaaagaact ttttctgtacctgttaaacaagaagtgaagagaagtgttaataaagagaacatccgacac acagaagaacggttattacgctacctcatacatctgagagggagcactatgtctgcggaa gtccccgaggcagcctccgcggaggagcagaaggaaatggaagataaagtgactagtcca gagaaagcagaagaagcaaaattaaaagcaagatatcctcatctgggacaaaagcctgga ggttcagatttcttaaggaaacggttgcagaaagggcaaaaatattttgattctggggat tacaacatggctaaagcaaaaatgaagaacaagcaacttcctactgcagctccggataag acggaggtcactggtgaccacattcccactccgcaagaccttcctcaacggaagccgtcc cttgttgctagcaagctggctggctga >gi568815583r:52482157_52713859|GENSCAN_predicted_peptide_6|184_aa MDIEVGTIDTGDYEGNGGRQVPGGCEGAGDGGWETPAESRKPETSQALRLRSQPAEARLD SETRNRSCAIRAQGNWGDVIQRPAPDVEDNYSRRLLLLLLLLLLLLLPPPPPSPPPPPPP PPPGMLAQSCDRAPSGLQRIMEPGFNVTGTGAGILCKYSEYDIKWECAQVNLQRITMPYC SIRI >gi568815583r:52482157_52713859|GENSCAN_predicted_CDS_6|555_bp atggacatagaagtgggaacaatagacactggggactacgaggggaatggagggaggcag gtgcccggcgggtgcgagggcgctggagatggcggctgggagacaccagcagagtcccgg aagcccgagacgtcccaggcacttcggttgcgctcccagccggcagaggcgcgcctggat tctgagaccaggaatcggagctgcgctattagagcgcagggtaactggggagatgtcatc caaaggcctgccccagacgtggaggataattatagtcgtcgtctcctcctcctcctcctc ctcctcctcctcctcctcctcccccctccccctccttccccccctcctcctcctcctcct cctcctcccgggatgctggcacagagttgtgatcgagccccaagtggattgcagcgcata atggaacccgggtttaatgtgaccgggacgggggcgggcatcctgtgcaagtactcagaa tatgatatcaagtgggagtgtgcccaggttaacctgcagaggatcaccatgccttactgt agcattcggatttag