GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:01:44 Sequence gi568815590r:27187553_27411235 : 223683 bp : 44.60% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 659 654 6 1.05 1.10 Term - 14186 14156 31 1 1 107 47 49 0.110 0.03 1.09 Intr - 21658 21540 119 1 2 77 79 14 0.031 -1.34 1.08 Intr - 30128 30024 105 2 0 57 84 63 0.234 3.21 1.07 Intr - 32462 32333 130 1 1 23 91 67 0.198 1.20 1.06 Intr - 49353 49298 56 0 2 158 103 131 0.981 19.38 1.05 Intr - 52610 52419 192 2 0 84 100 260 0.895 26.49 1.04 Intr - 53710 53502 209 2 2 78 94 290 0.906 27.40 1.03 Intr - 54205 54125 81 2 0 99 53 46 0.609 1.81 1.02 Intr - 54940 54845 96 2 0 68 86 137 0.936 11.48 1.01 Init - 57093 56880 214 0 1 77 42 118 0.549 5.01 1.00 Prom - 74606 74567 40 -1.76 2.07 PlyA - 76085 76080 6 1.05 2.06 Term - 100575 99998 578 1 2 79 49 794 0.998 69.23 2.05 Intr - 101728 101610 119 0 2 102 93 194 0.988 21.41 2.04 Intr - 102626 102604 23 2 2 102 131 36 0.998 5.64 2.03 Intr - 106758 106528 231 0 0 155 91 370 0.999 41.97 2.02 Intr - 111007 110912 96 1 0 81 99 185 0.085 19.11 2.01 Init - 123683 123249 435 2 0 84 94 712 0.995 67.57 2.00 Prom - 125616 125577 40 -4.26 3.06 PlyA - 125879 125874 6 1.05 3.05 Term - 129148 129005 144 1 0 52 43 106 0.618 0.41 3.04 Intr - 134889 134815 75 0 0 64 94 80 0.813 5.91 3.03 Intr - 136753 136673 81 1 0 98 75 22 0.371 1.73 3.02 Intr - 139439 139358 82 1 1 94 93 8 0.326 1.54 3.01 Init - 143462 143329 134 2 2 51 116 74 0.765 6.11 3.00 Prom - 155992 155953 40 -2.46 4.06 PlyA - 157686 157681 6 1.05 4.05 Term - 170068 169834 235 0 1 125 55 80 0.479 4.19 4.04 Intr - 177303 177144 160 1 1 -26 81 126 0.396 -0.45 4.03 Intr - 181782 181683 100 0 1 79 121 3 0.013 2.38 4.02 Intr - 197722 197682 41 2 2 103 92 14 0.019 1.24 4.01 Init - 207020 206942 79 2 1 42 93 60 0.476 3.12 4.00 Prom - 207534 207495 40 -4.26 5.00 Prom + 208711 208750 40 -8.36 5.01 Init + 210033 210236 204 2 0 51 114 263 0.929 24.05 5.02 Term + 212215 212247 33 0 0 98 55 14 0.359 -3.31 5.03 PlyA + 212495 212500 6 1.05 6.03 PlyA - 212547 212542 6 -3.24 6.02 Term - 214414 214299 116 2 2 55 43 221 0.978 13.03 6.01 Init - 219122 218987 136 2 1 30 109 118 0.808 8.40 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 110992 110912 81 1 0 94 99 184 0.911 21.07 S.002 Term - 183820 183690 131 2 2 60 48 134 0.829 4.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:27187553_27411235|GENSCAN_predicted_peptide_1|410_aa MPSCQLLSDSNSSFKALLRCYRHQGHLPGSLANAGGVEPYFCVPQQFLMLISGGVEPYLC VPQQFLTLISGAYKEKMKELPLVSLFCSCFLADPLNKSSYKYEGWCGRQCRRKDESQRKD SADWRERRAQADTVDLNWCVISDMEVIELNKCTSGQSFEVILKPPSFDGVPEFNASLPRR RDPSLEEIQKKLEAAEERRKYQEAELLKHLAEKREHEREVIQKAIEENNNFIKMAKEKLA QKMESNKENREAHLAAMLERLQEKDKHAEEVRKNKELKEEASREQYIIQSGQKTTYNGVG RRWAWNEWTIGKDTERQTEKSGLNLQSSIPFAHTLPTIVFLLMLIFQIPVTLDTVSLPSN LTFGLDNPLETISPYPSFARRGNSCLEREHDSSKIIQITQWVYQLTIIAE >gi568815590r:27187553_27411235|GENSCAN_predicted_CDS_1|1233_bp atgccctcctgccagctcctgtctgattctaactcatccttcaaggcactgctcaggtgc taccgacaccagggacacctcccaggctccctggcaaatgctgggggtgtggagccctat ttctgtgtcccacagcaattcctcatgctgatttctggtggtgtggagccctatctctgt gtcccacagcaattcctcacgctgatttctggtgcctacaaagagaagatgaaggagctc ccgctggtgtccttgttctgctcctgcttcctggccgatcccctgaataagtcgtcctac aaatatgaaggctggtgtgggagacagtgtaggaggaaggatgaaagccagcggaaagac agtgctgactggagagaaagaagagctcaggcagacacggtggacctgaattggtgcgtc atttccgacatggaagtcatcgagctgaacaaatgcacctcgggccaatcctttgaagtc atcctgaagccaccctcctttgatggggttcccgagttcaacgcctccctgccaaggcgg cgagacccatccctggaagagatccagaagaaactagaagcggctgaggagcgaaggaag taccaggaagcggagctcctgaaacacctagcagagaaacgggaacatgagagagaggtg atccaaaaggccattgaggaaaacaacaacttcatcaagatggctaaggaaaaactggcc cagaagatggaatccaacaaggagaacagggaggcccacctcgccgccatgttggaacgg ctgcaagagaaggacaagcacgccgaggaggtgcggaaaaacaaggagctgaaggaagag gcctccagggaacagtacattatccagagtggccagaagaccacgtacaatggggttggt agaagatgggcatggaatgaatggacaattgggaaggacactgaacgtcagactgagaaa tctggacttaatctacagtcatccattccatttgctcacacattgccaaccatcgtcttt ctcctgatgctgatctttcagatcccagtgactcttgacactgtatccttgccttccaac ctgacttttgggctggacaatcccttagagaccattagtccatatccttcatttgccaga cgaggaaattcatgcttagagagagaacatgactcttccaagatcatacagatcacacag tgggtttaccaactaaccatcattgctgaatga >gi568815590r:27187553_27411235|GENSCAN_predicted_peptide_2|493_aa MERSPDVSPGPSRSFKEELLCAVCYDPFRDAVTLRCGHNFCRGCVSRCWEVQVSPTCPVC KDRASPADLRTNHTLNNLVEKLLREEAEGARWTSYRFSRVCRLHRGQLSLFCLEDKELLC CSCQADPRHQGHRVQPVKDTAHDFRAKCRNMEHALREKAKAFWAMRRSYEAIAKHNQVEA AWLEGRIRQEFDKLREFLRVEEQAILDAMAEETRQKQLLADEKMKQLTEETEVLAHEIER LQMEMKEDDVSFLMKHKSRKRRLFCTMEPEPVQPGMLIDVCKYLGSLQYRVWKKMLASVE SVPFSFDPNTAAGWLSVSDDLTSVTNHGYRVQVENPERFSSAPCLLGSRVFSQGSHAWEV ALGGLQSWRVGVVRVRQDSGAEGHSHSCYHDTRSGFWYVCRTQGVEGDHCVTSDPATSPL VLAIPRRLRVELECEEGELSFYDAERHCHLYTFHARFGEVRPYFYLGGARGAGPPEPLRI CPLHISVKEELDG >gi568815590r:27187553_27411235|GENSCAN_predicted_CDS_2|1482_bp atggagcggagtcccgacgtgtcccccgggccttcccgctccttcaaggaggagttgctc tgcgccgtctgctacgaccccttccgcgacgcagtcactctgcgctgcggccacaacttc tgccgcgggtgcgtgagccgctgctgggaggtgcaggtgtcgcccacctgcccagtgtgc aaagaccgcgcgtcacccgccgacctgcgcaccaaccacaccctcaacaacctggtggag aagctgctgcgcgaggaggccgagggcgcgcgctggaccagctaccgcttctcgcgtgtc tgccgcctgcaccgcggacagctcagcctcttctgcctcgaggacaaggagctgctgtgc tgctcctgccaggccgacccccgacaccaggggcaccgcgtgcagccggtgaaggacact gcccacgactttcgggccaagtgcaggaacatggagcatgcactgcgggagaaggccaag gccttctgggccatgcggcgctcctatgaggccatcgccaagcacaatcaggtggaggct gcatggctggaaggccggatccggcaggagtttgataagcttcgcgagttcttgagagtg gaggagcaggccattctggatgccatggccgaggagacaaggcagaagcaacttctggcc gacgagaagatgaagcagctcacagaggagacggaggtgctggcacatgagatcgagcgg ctgcagatggagatgaaggaggacgacgtttcttttctcatgaaacacaagagccgaaaa cgccgactcttctgcaccatggagccagagccagtccagcccggcatgcttatcgatgtc tgcaagtacctgggctccctgcagtaccgcgtctggaagaagatgcttgcatctgtggaa tctgtacccttcagctttgaccccaacaccgcagctggctggctctccgtgtctgacgac ctcaccagcgtcaccaaccatggctaccgcgtgcaggtggagaacccggaacgcttctcc tcggcgccctgcctgctgggctcccgtgtcttctcacagggctcgcacgcctgggaggtg gcccttggggggctgcagagctggagggtgggcgtggtacgtgtgcgccaggactcgggc gctgagggccactcacacagctgctaccacgacacacgctcgggcttctggtatgtctgc cgcacgcagggcgtggagggggaccactgcgtgacctcggacccagccacgtcgcccctg gtcctggccatcccacgccgcctgcgtgtggagctggagtgtgaggagggcgagctgtct ttctatgacgcggagcgccactgccacctgtacaccttccacgcccgctttggggaggtt cgcccctacttctacctggggggtgcacggggcgccgggcctccagagcctttgcgcatc tgccccttgcacatcagtgtcaaggaagaactggatggctga >gi568815590r:27187553_27411235|GENSCAN_predicted_peptide_3|171_aa MLANTAKLGNGKACLPQVLAIADLGFSHTNWLPHKSSNHLSAGKSPLVQGWIVPCTFPWQ PSLNHPVSHTALICHVPSTQHFGGNLGDYTAVSRSEGGQLVFTTVYEVASPIPGLQKGKG SLDKASVKVHRTTVISSKIYLVTLMNKVDYVCSARMPVSEDAAASPVADFI >gi568815590r:27187553_27411235|GENSCAN_predicted_CDS_3|516_bp atgttagcgaacacagccaagctaggaaacgggaaggcttgcctcccgcaggtattggcc attgctgatcttggttttagtcacactaactggctgcctcataaaagcagcaaccacctt agtgctgggaaaagccctctagtgcagggttggatagtaccctgcaccttcccctggcag ccctccctcaaccacccggtatcccacactgccctgatttgtcacgttcccagcactcag cattttggaggtaatcttggtgactacacagcggtcagcaggtcagaaggtggtcagctt gtcttcacaaccgtctatgaagttgctagtcctattcctggtttgcagaaaggaaaagga agcttggataaggcctcagtaaaggtacatcggactacagtgatttcctcaaagatttac ttggtaactctgatgaacaaggtcgattatgtctgcagtgccaggatgcctgtttccgaa gatgctgctgccagccctgtggctgactttatttag >gi568815590r:27187553_27411235|GENSCAN_predicted_peptide_4|204_aa MSCAITFTHTHSQIPRATFGSASSIHSLHAKGLTSEKQQQIETPGSPRPSTGPGHLRSPY MPVKGSCDYKRVEDGGDNEEKGVDEGRCPKVESTGIDSQLEEKRILPTIFSRQLDVSEAT HQVRMRRVDKGERRKPSFTTRDPISFKAGGWEMMLLIAVKGDGRHGVAPCTGDLQENEFP ILFCCQLLVTSLSYKIMRRPLEMM >gi568815590r:27187553_27411235|GENSCAN_predicted_CDS_4|615_bp atgtcctgtgccatcacattcactcacactcactcgcagattccgagagcaactttcggt tctgcaagctccattcactctctccatgccaaagggctcacttctgagaaacagcagcag atagaaacccctggatcccccaggcccagcacagggcctggtcatctcaggtccccctac atgccagtgaagggatcctgtgattataagagggtagaagatgggggcgataatgaggaa aaaggagtagatgaaggaagatgtcccaaggtggaatccacaggaattgacagccagtta gaagaaaagcgaatattgcctacaatcttctcccgtcaattggacgtcagtgaggccact catcaagtcaggatgagaagagttgacaaaggagagagaagaaagcccagtttcacaaca agagacccaatttccttcaaagctggtggttgggaaatgatgctcctgatagcggttaag ggagatggcaggcacggggtggccccctgcactggagatttgcaagaaaatgaatttccc attctcttctgctgtcagttgctggttacctccctatcctacaagatcatgagaagaccc ctggaaatgatgtga >gi568815590r:27187553_27411235|GENSCAN_predicted_peptide_5|78_aa MSGVSEPLSRVKLGTLRRPEGPAEPMVVVPVDVEKEDVRILKVCFYSNSFNPGKNFKLVK CTVQTEIRAKCHLLIANS >gi568815590r:27187553_27411235|GENSCAN_predicted_CDS_5|237_bp atgtctggggtgtccgagcccctgagtcgagtaaagttgggcacgttacgccggcctgaa ggccctgcagagcccatggtggtggtaccagtagatgtggaaaaggaggacgtgcgtatc ctcaaggtctgcttctatagcaacagcttcaatcctgggaaaaacttcaaactggtcaaa tgcactgtccagacggagatccgggcaaaatgccacctgcttattgcaaatagctga >gi568815590r:27187553_27411235|GENSCAN_predicted_peptide_6|83_aa MPSPSPNHRENHDNVPTIKGPYTFPAANFQEVWNGSWGEELTHTFVLAAITTTAIIITTI TTTVIITIIITITITIPMAIILI >gi568815590r:27187553_27411235|GENSCAN_predicted_CDS_6|252_bp atgccaagccccagccccaatcatcgtgagaaccatgacaatgtgccaactataaagggc ccctacacatttcccgctgccaacttccaggaggtgtggaacggctcctggggagaagag ctgactcacacattcgtgctagctgcaatcaccactactgccatcatcatcactaccatc accactactgtcatcatcaccatcatcatcaccatcaccatcactatccccatggctata atcttaatttaa