GENSCAN 1.0 Date run: 3-Nov-116 Time: 16:23:13 Sequence gi568815581f:32250286_32469756 : 219471 bp : 45.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8522 8646 125 1 2 63 53 238 0.932 15.47 1.02 Intr + 9298 9334 37 1 1 108 86 30 0.973 3.06 1.03 Intr + 12268 12478 211 0 1 82 65 111 0.180 6.69 1.04 Intr + 15867 16015 149 1 2 -77 86 289 0.461 12.15 1.05 Intr + 17617 17640 24 0 0 81 91 23 0.228 0.12 1.06 Intr + 27489 27674 186 2 0 46 81 57 0.054 0.69 1.07 Intr + 31870 31965 96 0 0 49 81 68 0.724 2.41 1.08 Intr + 34374 34532 159 2 0 110 75 220 0.990 23.08 1.09 Intr + 38507 38743 237 1 0 114 35 377 0.993 32.81 1.10 Intr + 44009 44157 149 1 2 100 96 82 0.871 9.23 1.11 Intr + 47807 47919 113 2 2 93 96 165 0.943 17.82 1.12 Intr + 55056 55156 101 1 2 85 89 78 0.122 7.43 1.13 Intr + 64616 64708 93 1 0 89 65 32 0.027 1.16 1.14 Intr + 65947 66007 61 0 1 100 86 74 0.043 6.71 1.15 Term + 70673 70944 272 0 2 141 55 422 0.997 39.85 1.16 PlyA + 71623 71628 6 1.05 2.10 PlyA - 77544 77539 6 1.05 2.09 Term - 81603 81478 126 0 0 26 43 121 0.518 -0.32 2.08 Intr - 83235 83132 104 1 2 60 86 22 0.785 -0.91 2.07 Intr - 84320 84184 137 1 2 48 99 137 0.898 11.01 2.06 Intr - 84554 84490 65 2 2 85 103 45 0.925 3.22 2.05 Intr - 88066 87961 106 0 1 115 14 86 0.715 4.12 2.04 Intr - 89653 89528 126 0 0 8 106 82 0.697 1.79 2.03 Intr - 90999 90919 81 2 0 55 98 50 0.668 1.45 2.02 Intr - 91998 91715 284 0 2 31 53 228 0.464 9.82 2.01 Init - 93626 93420 207 2 0 40 78 64 0.349 -0.50 2.00 Prom - 94818 94779 40 -4.56 3.00 Prom + 105213 105252 40 -8.16 3.01 Init + 105456 105512 57 2 0 48 64 40 0.462 -1.09 3.02 Intr + 108218 108356 139 1 1 45 94 140 0.972 10.34 3.03 Intr + 110313 110480 168 1 0 57 69 111 0.924 6.02 3.04 Intr + 115045 115202 158 2 2 122 110 41 0.996 9.43 3.05 Intr + 117487 117694 208 0 1 52 100 112 0.612 7.45 3.06 Intr + 122517 122628 112 1 1 48 96 94 0.454 5.64 3.07 Term + 138351 138375 25 0 1 106 47 35 0.338 -1.10 3.08 PlyA + 142668 142673 6 1.05 4.00 Prom + 158547 158586 40 -2.36 4.01 Init + 178702 178837 136 0 1 73 38 156 0.509 9.40 4.02 Intr + 196660 196761 102 2 0 52 111 95 0.993 8.35 4.03 Intr + 204210 204334 125 1 2 95 107 73 0.998 10.20 4.04 Intr + 213764 213835 72 1 0 113 92 17 0.938 4.20 4.05 Intr + 214236 214293 58 2 1 129 106 36 0.995 7.96 4.06 Intr + 218714 218908 195 0 0 92 59 253 0.603 22.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 55056 55225 170 1 2 85 36 134 0.806 5.94 S.002 Init + 65959 66007 49 0 1 73 86 119 0.953 9.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:32250286_32469756|GENSCAN_predicted_peptide_1|670_aa MELQDVLLLLLLMLMLLLRPHHQKACLVDPTSFFAAAPHWSSLLDTLVPGSVQWPFILLC VDFKCLVRNLKRGHGVAHLQNEEQGRSANRPSSVSLLISAKGSPREGPLHGTPPGIDECL PAAAAPQPPPPPDPVSAMGEHPSPGPAVAACAEAERIEELEPEAEERLPAAPEDHWKVLF DQSVPSSVTVKKYYDQRSNKFEECWAEQNQTGFSSLKPCRVLDMVRYDPPRGFSTCRTTE PLFQIFTEQQVGEFPEILGVVNQTDVIHETWHFGLKFDPGNTGYISTGKFRSLLESHSSK LDPHKREVLLALADSHADGQIGYQDFVSLMSNKRSNSFRQAILQGNRRLSSKALLEEKGL SLSQRLIRHVAYETLPREIDRKWYYDSYTCCPPPWFMITVTLLEARTRVAFFLYNGVSLG QFVLQVTHPRYLKNSLVYHPQLRAQVWRYLTYIFMHAGIEHLGLNVVLQLLVGVPLEMVH GATRIGLVYVAGVVAGSLAVSVADMTAPVVGSSGGVYALVSAHLANIVMHGTCRQAKTGF PPKAGPQVAHTSLSVKKGLLNWSGMKCQFKLLRMAVALICMSMEFGRAVWLRFHPSAYPP CPHPSFVAHLGGVAVGITLGVVVLRNYEQRLQDQSLWWIFVAMYTVFVLFAVFWNIFAYT LLDLKLPPPP >gi568815581f:32250286_32469756|GENSCAN_predicted_CDS_1|2013_bp atggagctccaggatgtcttgctgctgctgctgctgatgctgatgctgctgctgcggcct caccatcaaaaagcctgcctcgtggaccccacgtcattcttcgctgcagcccctcactgg agcagtctgttggacactctggtgcctggttctgtgcagtggcccttcatcctgctctgc gttgacttcaagtgtcttgttaggaaccttaaaagaggccatggtgtagcgcacttgcaa aatgaagagcaaggaagaagtgccaataggcccagttccgtgtcacttctcattagcgcc aaaggttctcccagggaggggcctctgcatggcacacctccaggcatcgacgagtgcctg cctgctgcagctgccccgcagccgccgccgcccccggaccccgtctcggccatgggcgag caccccagcccgggccccgcggtggccgcctgcgccgaggcggagcgcatcgaggagctg gaacccgaggccgaggagcggctgcccgcggcgccggaggaccactggaaagtcctgttt gatcagagtgttccatcaagtgttactgtgaaaaaatactacgaccaaaggtcaaataag tttgaagaatgctgggctgaacaaaatcagacaggtttctccagcctaaaaccttgcaga gtccttgatatggtgcgttatgaccctccaagaggtttttccacttgtaggaccacagaa cccttatttcagatctttactgagcaacaagtaggtgagtttcctgaaatacttggtgtt gtgaaccaaacagatgtcatccatgaaacctggcattttggcttgaagtttgaccctggg aacacaggctacattagcacaggcaagttccggagtcttctggagagccacagctccaag ctggacccgcacaaaagggaggtcctcctggctcttgccgacagccacgcggatgggcag atcggctaccaggattttgtcagcctaatgagcaacaagcgttccaacagcttccgccaa gccatcctgcagggcaaccgcaggctaagcagcaaggccctgctggaggagaaggggctg agcctctcgcagcgacttatccgccatgtggcctatgagaccctgccccgggaaattgac cgcaagtggtactatgacagctacacctgctgccccccaccctggttcatgatcacagtc acgctgctggaggcaaggacaagggttgcctttttcctctacaatggggtgtcactaggt caatttgtactgcaggtaactcatccacgttacttgaagaactccctggtttaccaccca cagctgcgagcacaggtttggcgctacctgacatacatcttcatgcatgcagggatagaa cacctgggactcaatgtggtgctgcagctgctggtgggggtgcccctggagatggtgcat ggagccacccgaattgggcttgtctacgtggccggtgttgtggcagggtccttggcagtg tctgtggctgacatgaccgctccagtcgtgggctcttctggaggggtgtatgctctcgtc tctgcccatctggccaacattgtcatgcatggaacatgtcggcaagcgaagacaggcttt ccacctaaagcgggaccgcaggtggcccacacttcactttctgtgaagaaggggttgctg aactggtcaggcatgaagtgccagttcaagctgctgcggatggctgtggcccttatctgt atgagcatggagtttgggcgggccgtgtggctccgcttccacccgtcggcctatcccccg tgccctcacccaagctttgtggcgcacttgggtggcgtggccgtgggcatcaccctgggc gtggtggtcctgaggaactacgagcagaggctccaggaccagtcactgtggtggattttt gtggccatgtacaccgtcttcgtgctgttcgctgtcttctggaacatctttgcctacacc ctgctggacttaaagctgccgcctcccccctga >gi568815581f:32250286_32469756|GENSCAN_predicted_peptide_2|411_aa MSFSTILMTVVTQPLQEVLLLKTKIKYISRDYCRSEMPLSCVVRNQNPLLNVVEMRDKET DNVTEWEIEASPCLRTRGNQRNAGVRGPGAGRALIPTAAREQGSGLAETQGRSEAAAMLP SLQESMDGDEKELESSEEGGSAEERRLEPPSSSHYCLYSYRGSRLAQQRGDSEDGSPSGT NAETPSGDDFSLSLADTNLPSEVEPELRSFIAKRLSRGAVFEGLGNVASVELKIPGYRVG CYYCLFQNEKLLPETVTIDSERNPSEYVALSYTPVEVKESDEKTKRDINRFLSVASLQGL IHEGTMTSLCMAMTEEQHKSVVIDCSSSQPQFCNAGSNRFCEDWMQAFLNGAKGGNPFLF RQVLENFKLKNCGSGDILLKIVKVEHEEMPEAKNVIAVLEEFMKEALDQSF >gi568815581f:32250286_32469756|GENSCAN_predicted_CDS_2|1236_bp atgagtttttctaccatcctcatgactgtggtaactcaacctctccaggaagttcttctg cttaaaactaaaatcaagtatatctcaagagactactgccgatctgaaatgcctctgtca tgtgttgtccgcaatcagaatcctcttttaaatgtagttgaaatgagagataaggaaact gacaatgttactgaatgggagatagaggccagtccttgcctgcgaacccggggaaaccag cggaacgcaggtgtgcgagggccgggagccggacgagccctgataccgacggctgcacgg gagcaggggagcggtttggcggagacacagggccgctcagaggccgccgcaatgctcccc tctttgcaggagtcgatggatggagatgaaaaggaactagagagcagcgaagagggaggc tcagccgaggagcggagactcgagccgccgtccagcagccactactgtctttacagctat cgcggaagcagattggcacagcaacgaggggacagtgaggacggaagcccaagtggcaca aatgcagaaactccctctggtgatgatttcagcctctccttggcagatactaatctacca tccgaagtggagccagagctgcgcagtttcattgctaagcgtctttcaagaggtgcagtc tttgaagggctgggtaatgttgcatctgtggagctaaaaattccaggttaccgagttggt tgttattactgccttttccaaaatgaaaaactgcttcctgaaacagtaacgatagactct gaacgtaacccttcagaatatgtggctttgagttacactcctgttgaagttaaagaatca gatgaaaaaacaaagagagacattaacaggtttctgagtgtggccagtcttcaaggactt attcatgaaggcaccatgacttctttgtgcatggccatgacagaggagcagcataagtct gtggtcatcgattgcagcagctcccagcctcagttctgcaatgcaggaagtaaccggttt tgtgaggattggatgcaagcttttttaaatggtgccaaaggaggtaacccttttcttttc cgacaagtactggagaactttaaactaaagaactgtggtagtggagatatacttttgaag attgttaaagtggaacatgaagaaatgcctgaagccaaaaatgtgatagctgtccttgaa gaattcatgaaagaagctcttgaccaaagtttttga >gi568815581f:32250286_32469756|GENSCAN_predicted_peptide_3|288_aa MKGRKTKSSALEVNDVFQGVHKETIDAVPNAIPGRTDIELEIYGMEGIPEKDMDERRRLL EQKTQESQKKKQQDDSDEYDDDDSAASTSFQPQPVQPQQGYIPPMAQPGLPPVPGAPGMP PGMPPPVPRPGIPPMTQAQAVSAPGILNRPPAPTATVPAPQPPVTKPLFPSAGQAQAAVQ GPVGTDFKPLNSTPATTTEPPKPTFPAYTQSTASTTSTTNSTAAKPAASITSKPATLTTT SATIVDEVESKWMPTALTEDNFWDLIIQAGIVIVIIKEAVRLQLTMGN >gi568815581f:32250286_32469756|GENSCAN_predicted_CDS_3|867_bp atgaaaggtaggaaaaccaagagtagtgccctggaagtcaatgacgtgtttcaaggagta cataaagaaacaatagatgccgtaccaaatgcaatacctggaagaacagacatagagttg gaaatatatggtatggaaggtattccagaaaaagacatggatgaaagacgacgacttctt gaacagaaaacacaagaaagtcaaaaaaagaagcaacaagatgattctgatgaatatgat gatgacgactctgcagcctcaacttcatttcagccacagcctgttcaacctcagcaaggt tatattcctccaatggcacagccaggactgccaccagtaccaggagcaccaggaatgcct ccaggtatgcccccacctgttccacgtcctggaattcctccaatgactcaagcacaggct gtttcagcgccaggtattcttaatagaccacctgcaccaacagcaactgtacctgcccca cagcctccagttactaagcctcttttccccagtgctggacaggctcaggcagctgtccaa ggacctgttggtacagatttcaaacccttaaatagtacccctgcaacaactacagaaccc ccaaagcctacattccctgcttatacacagtctacagcttcaacaactagtacaacaaat agtactgcagctaaaccagcggcttcaataacaagtaagcctgctacacttacaacaact agtgcaaccatagtagatgaagttgagtcaaagtggatgccaactgccctcacagaagac aacttttgggacttgataattcaggcaggcattgtgattgtcataattaaggaagctgta agactgcagctgaccatgggtaactga >gi568815581f:32250286_32469756|GENSCAN_predicted_peptide_4|230_aa MRESLELPRDLLNGCDQNADSDMDNDVQAKVVSDADEELTGNWSEVKRDIQENDEEAVQV KEQSILELGSLLAKTGQAAELGGLLKYVRPFLNSISKAKAARLVRSLLDLFLDMEAATGQ EVELCLECIEWAKSEKRTFLRQALEARLVSLYFDTKRYQEALHLGSQLLRELKKMDDKAL LVEVQLLESKTYHALSNLPKARAALTSARTTANAIYCPPKLQATLDMQSX >gi568815581f:32250286_32469756|GENSCAN_predicted_CDS_4|690_bp atgagggaaagtttggaacttcctagagacttactgaatggttgtgaccaaaatgctgat agtgatatggacaatgatgtccaggccaaggtggtctcagatgcagatgaggaacttact ggaaactggagtgaagtgaagcgtgacattcaggaaaacgatgaagaggcagtgcaagtc aaagagcagagcatcctggaactgggatctctcctggcaaagactggacaagctgcagag cttggaggactcctgaagtatgtacgacccttcttgaattccatcagcaaggctaaagca gctcgcctggtccgatctcttcttgatctgtttcttgatatggaagcagctacagggcag gaggtcgagctgtgtttagagtgcatcgaatgggccaagtcagagaaaagaactttctta cgccaagctttggaggcaagactggtgtctttgtactttgataccaagaggtaccaggaa gcattgcatttgggttctcagctgctgcgggagttgaaaaagatggacgacaaagctctt ttggtggaagtacagcttttagaaagcaaaacataccatgccctgagcaacctgccgaaa gcccgagctgccttaacttctgctcgaaccacagcaaatgccatctactgcccccctaaa ttgcaggccaccttggacatgcagtcggnn