GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:11:26 Sequence gi568815586f:5394215_5594985 : 200771 bp : 44.61% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 105 100 6 1.05 1.06 Term - 37071 36956 116 1 2 93 39 82 0.686 2.53 1.05 Intr - 37325 37231 95 1 2 84 75 94 0.769 7.31 1.04 Intr - 38149 37938 212 1 2 45 62 155 0.784 6.31 1.03 Intr - 39094 38825 270 1 0 36 73 192 0.357 10.24 1.02 Intr - 52744 52643 102 1 0 77 63 79 0.347 4.67 1.01 Init - 57843 57826 18 0 0 76 98 5 0.150 0.34 1.00 Prom - 58307 58268 40 -2.86 2.03 PlyA - 59276 59271 6 1.05 2.02 Term - 61175 61064 112 1 1 73 39 140 0.835 5.63 2.01 Init - 67434 67382 53 0 2 70 79 39 0.217 1.86 2.00 Prom - 72484 72445 40 -1.76 3.00 Prom + 84211 84250 40 -1.36 3.01 Sngl + 91713 92045 333 2 0 91 44 148 0.640 6.73 3.02 PlyA + 94193 94198 6 1.05 4.00 Prom + 94393 94432 40 -4.66 4.01 Init + 96591 96706 116 2 2 109 69 88 0.945 8.68 4.02 Intr + 99253 99370 118 1 1 29 14 101 0.633 -2.73 4.03 Term + 99980 100774 795 1 0 128 38 829 0.908 75.18 4.04 PlyA + 100922 100927 6 1.05 5.15 PlyA - 101545 101540 6 1.05 5.14 Term - 116150 116047 104 0 2 124 43 91 0.695 6.64 5.13 Intr - 124715 124592 124 2 1 39 48 107 0.429 1.96 5.12 Intr - 126742 126582 161 2 2 98 52 78 0.508 4.91 5.11 Intr - 140543 140498 46 2 1 120 39 29 0.086 -0.82 5.10 Intr - 148089 148004 86 1 2 59 85 82 0.559 4.54 5.09 Intr - 151813 151716 98 0 2 21 116 48 0.049 0.55 5.08 Intr - 154188 154075 114 2 0 69 81 34 0.014 0.36 5.07 Intr - 160377 160299 79 1 1 96 69 39 0.014 1.41 5.06 Intr - 169354 169186 169 1 1 119 59 328 0.579 32.52 5.05 Intr - 171449 171344 106 1 1 95 109 74 0.991 10.42 5.04 Intr - 181801 181620 182 1 2 104 103 253 0.983 27.07 5.03 Intr - 183793 183741 53 2 2 107 111 54 0.999 8.23 5.02 Intr - 184304 184152 153 0 0 92 96 198 0.951 21.04 5.01 Init - 184532 184478 55 2 1 100 80 5 0.935 2.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:5394215_5594985|GENSCAN_predicted_peptide_1|270_aa MTGKCQIIVVSLPGLLSSATIRFSHQVGVAHSKHYVAKLHVSRPPTPPLKVREQAGRYRK EIPWDGRARVPAAPRRGPRCAGCVRSSNICAKAQGGVATGWALAKRSPGREVFSEKTLSG NSNAGTRRVAVPAASPYRGKSNHGIRVAVSRDVDMKRKVDASSVIQSLPHPAAATTQQGK KERKKERREKKVVAPGCVAGRTNTPKDNRSPANSPLPGTRTQDAVSAVKGGQGWGPAANH SPSTHQSARGDSARSRKGEQQTNLFVAPRK >gi568815586f:5394215_5594985|GENSCAN_predicted_CDS_1|813_bp atgactggaaaatgccagatcatagttgtctcactacctggcctgctctcttcggctacc atccggttctcgcatcaggtcggcgtggctcattccaagcattatgttgctaaactacac gtctcccggcctcccacgccaccgctcaaagtccgagaacaggctggacgataccgcaag gagatcccctgggatggccgggcccgggtgcccgccgctccccggaggggtcccagatgt gcgggctgcgtacggagctccaacatctgcgcaaaggcccagggtggggtggccaccggg tgggcgctggccaagaggagcccggggcgcgaggtcttttctgaaaaaaccctttcgggg aactccaacgcaggtacgcggcgcgtggcagtgcccgccgcctccccttaccgtggcaaa agtaaccatggcatccgtgtggccgtttccagggacgtcgacatgaagagaaaggtggac gcgagctcggtcattcaaagtctcccccaccccgccgcagccactacccagcaggggaaa aaggagaggaagaaagaaaggagagagaagaaagtagttgcgccgggctgcgtcgcgggg cgcacgaacacgcccaaggacaatcgctccccggcgaactccccgctcccggggacccgc acccaggacgctgtgtcggcggtcaagggaggtcaaggatggggccctgcagctaatcac tcgcccagcacccaccagtctgcacgcggggacagcgcgagaagcaggaaaggcgagcaa caaacaaacctgttcgtggcgccccgaaagtaa >gi568815586f:5394215_5594985|GENSCAN_predicted_peptide_2|54_aa MKNEVKERQVYEDMAGGCSQHLTQAVMGYAQEAWHKPPRADNDRRQMYAIVKEY >gi568815586f:5394215_5594985|GENSCAN_predicted_CDS_2|165_bp atgaagaatgaagtcaaggagaggcaggtgtatgaggacatggcgggagggtgctcacaa cacctaacccaggcagtcatgggttacgctcaagaagcttggcataagcctcccagagct gacaacgatcgaagacagatgtatgccatcgtcaaagaatactaa >gi568815586f:5394215_5594985|GENSCAN_predicted_peptide_3|110_aa MAATGNRGERHLEEQKTVPAVSLQTPWFSGEGIQTRPPINIAADRQSIYLILVKGGCLCG SRETGGAIFDTSLLAKSKGVSLPSDHTFCSQVLWGPNQYGLSLSPTCCYG >gi568815586f:5394215_5594985|GENSCAN_predicted_CDS_3|333_bp atggcggccacagggaacagaggggagagacacctggaggagcaaaagacagtcccagct gtcagtcttcagacaccatggttttctggagaagggatccagaccagaccgccaatcaat atagccgcagaccgccagtcaatatacctcatccttgtgaaaggcggttgtctgtgtggc agcagggagacaggaggggccatatttgacacaagcctgctggccaagtctaaaggagtc agcctgccatctgaccacactttctgcagccaagtcctctgggggccaaatcagtatggt ttgagtttatccccgacttgctgctatggttga >gi568815586f:5394215_5594985|GENSCAN_predicted_peptide_4|342_aa MASGGKPCGDRVGKDCCEPDVEEGALQTLGPSPVCVKGSAVAELRFVESVPGDGADQLVM RMYQKARQSTLERRPSRGILQVNKVMSILFYVIFLAYLRGIQGNNMDQRSLPEDSLNSLI IKLIQADILKNKLSKQMVDVKENYQSTLPKAEAPREPERGGPAKSAFQPVIAMDTELLRQ QRRYNSPRVLLSDSTPLEPPPLYLMEDYVGSPVVANRTSRRKRYAEHKSHRGEYSVCDSE SLWVTDKSSAIDIRGHQVTVLGEIKTGNSPVKQYFYETRCKEARPVKNGCRGIDDKHWNS QCKTSQTYVRALTSENNKLVGWRWIRIDTSCVCALSRKIGRT >gi568815586f:5394215_5594985|GENSCAN_predicted_CDS_4|1029_bp atggcatctgggggaaagccttgtggggaccgtgttggtaaagattgctgtgagccagat gtagaggagggagctctccagactctgggtccctcgcccgtgtgcgtcaagggcagtgct gtggcagagctgcgttttgtggagagcgtccccggggatggagcagatcagttggtgatg cgtatgtatcagaaagctcggcagagcaccctggaacgtaggccctctcgcggaatctta caggtgaacaaggtgatgtccatcttgttttatgtgatatttctcgcttatctccgtggc atccaaggtaacaacatggatcaaaggagtttgccagaagactcgctcaattccctcatt attaagctgatccaggcagatattttgaaaaacaagctctccaagcagatggtggacgtt aaggaaaattaccagagcaccctgcccaaagctgaggctccccgagagccggagcgggga gggcccgccaagtcagcattccagccggtgattgcaatggacaccgaactgctgcgacaa cagagacgctacaactcaccgcgggtcctgctgagcgacagcacccccttggagcccccg cccttgtatctcatggaggattacgtgggcagccccgtggtggcgaacagaacatcacgg cggaaacggtacgcggagcataagagtcaccgaggggagtactcggtatgtgacagtgag agtctgtgggtgaccgacaagtcatcggccatcgacattcggggacaccaggtcacggtg ctgggggagatcaaaacgggcaactctcccgtcaaacaatatttttatgaaacgcgatgt aaggaagccaggccggtcaaaaacggttgcaggggtattgatgataaacactggaactct cagtgcaaaacatcccaaacctacgtccgagcactgacttcagagaacaataaactcgtg ggctggcggtggatacggatagacacgtcctgtgtgtgtgccttgtcgagaaaaatcgga agaacatga >gi568815586f:5394215_5594985|GENSCAN_predicted_peptide_5|509_aa MPNPELCSQVMALTNRPAVIQFGFVTLFVASFPLAPVFALLNNVIEVRLDAKKFVTELRR PDAVRTKDIGIWFDILSGIGKFSVISNAFVIAITSDFIPRLVYQYSYSHNGTLHGFVNHT LSFFNVSQLKEGTQPENSQFDQEVQFCRFKDYREPPWAPNPYEFSKQYWFILSARLAFVI IFQNLVMFLSVLVDWMIPDIPTDISDQIKKEKSLLVDFFLKEEHEKLKLMDEPALRSPGD TPVKFFSTYVDSRSTALFHVRYVKSRLGHSLLTHSLLFKLLPPTPCCFAWALAASEQSSH SMERDQAVVLVLAPLLKATSGLPGSLCSSHPFIPGKDAEKALGKIQHPFMIKILSTIGIE GTYLKAFGGPQMDIWLQGLKTLAASPWKTDHDLCEDTVPEAPACLRSSQFEEEPTAKHMD LISCLQWNSLQTEKIYFPPFSICSVLPADPFGPHQQAPVPAGMQLVQPMQNASRRLSSYE VLGPVQALERQRLNSHNSHVLLITFPLLY >gi568815586f:5394215_5594985|GENSCAN_predicted_CDS_5|1530_bp atgcctaatccagagctttgctcccaagtgatggcattgaccaacaggcctgcagtcatc cagtttggttttgtcaccctcttcgtggcctcctttcccctggcacctgtgtttgccctc ctcaacaacgtcattgaagtgcggctcgatgcaaagaagtttgttacagagctgagacgg ccggatgctgtaagaaccaaagatatcggaatctggtttgacattctctctggaattggc aagttctctgttatcagcaacgcttttgtcattgcgatcacctccgactttatcccccgc ctggtgtaccagtactcctacagtcacaatgggactctgcacggctttgtcaaccacacc ctctcctttttcaacgtcagccagctgaaggaggggacgcagccagaaaactcacagttt gaccaggaggttcagttctgcaggtttaaggattaccgagagccgccatgggccccgaac ccttatgagttttcgaaacagtactggtttattctgtccgcccgtctggcttttgtcata atcttccagaacctcgtgatgttcctgagcgtcctcgtggactggatgattccagacatc cccacggacatcagcgaccagatcaagaaagagaagagcttattagtggatttcttcctg aaagaggagcatgagaagctcaagctgatggatgagccggctctgaggagcccaggagat acccctgtaaagttcttctccacgtatgtggacagcagaagtacagcacttttccatgta cgttatgtcaaaagcagactcggacacagccttctcacacattccctgctcttcaagctt cttccaccaaccccctgctgctttgcctgggcactggcagcttcagagcagagctcccac tcaatggaaagggatcaagcagtagtactggtgctggctcctctcctcaaggccacatca gggctgcctggtagcctctgctccagtcatcccttcattccagggaaagatgcagaaaaa gcacttggcaaaatccagcatccttttatgattaaaattctcagcacaatcggcatagaa gggacatacctcaaggcgtttggtggccctcagatggacatatggcttcaaggactcaag actcttgcagcctccccatggaaaacagaccatgacttatgtgaagacacagtccccgag gcacctgcttgtctacgaagcagccagtttgaggaagaacctacagccaaacacatggac cttataagctgtttgcagtggaattctctgcagactgagaagatctactttccacccttc tccatctgctcggtgctgccggctgacccatttggaccacaccaacaggccccagtgccc gctggcatgcaattggtccagcccatgcagaatgccagcaggagattgagcagttacgaa gtgctaggccctgtacaagcacttgagcggcaacgactcaattcccacaactcccacgtg ctgctcatcacttttcctctgctctactga