GENSCAN 1.0 Date run: 3-Nov-116 Time: 04:44:54 Sequence gi568815591f:94564609_94765590 : 200982 bp : 36.95% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 881 778 104 2 2 63 103 114 0.189 8.45 1.01 Init - 10040 9699 342 2 0 48 29 240 0.162 11.38 1.00 Prom - 13198 13159 40 -6.15 2.10 PlyA - 13669 13664 6 1.05 2.09 Term - 19766 19658 109 1 1 77 41 77 0.057 -1.10 2.08 Intr - 34355 34167 189 1 0 57 92 106 0.332 5.68 2.07 Intr - 36249 36038 212 0 2 67 67 134 0.952 5.99 2.06 Intr - 38844 38682 163 2 1 60 111 107 0.787 9.26 2.05 Intr - 54348 54150 199 1 1 23 86 144 0.159 5.19 2.04 Intr - 61401 61257 145 0 1 25 87 90 0.876 1.53 2.03 Intr - 63751 63594 158 2 2 97 76 90 0.977 7.51 2.02 Intr - 65233 65111 123 2 0 120 84 66 0.984 9.04 2.01 Init - 74754 74733 22 0 1 77 101 12 0.344 1.43 2.00 Prom - 80111 80072 40 -4.45 3.00 Prom + 85301 85340 40 -5.05 3.01 Sngl + 92064 92357 294 2 0 58 45 327 0.613 20.85 3.02 PlyA + 92719 92724 6 1.05 4.00 Prom + 95121 95160 40 -6.05 4.01 Init + 98949 99852 904 2 1 102 28 1230 0.797 112.82 4.02 Term + 99882 101074 1193 1 2 33 36 1136 0.906 93.79 4.03 PlyA + 101296 101301 6 1.05 5.04 PlyA - 103689 103684 6 1.05 5.03 Term - 114745 114574 172 0 1 64 47 152 0.914 5.02 5.02 Intr - 123721 123544 178 2 1 46 59 130 0.611 3.96 5.01 Init - 124741 124672 70 1 1 25 42 100 0.475 0.36 5.00 Prom - 129104 129065 40 -3.15 6.03 PlyA - 129219 129214 6 1.05 6.02 Term - 130573 130417 157 0 1 46 39 215 0.994 8.82 6.01 Init - 131209 131007 203 1 2 98 81 119 0.762 10.60 6.00 Prom - 133085 133046 40 -8.35 7.00 Prom + 135480 135519 40 -5.55 7.01 Init + 135813 135925 113 2 2 55 8 79 0.489 -3.67 7.02 Term + 140876 141386 511 2 1 12 41 515 0.758 32.06 7.03 PlyA + 141690 141695 6 1.05 8.02 PlyA - 143052 143047 6 1.05 8.01 Sngl - 159512 159108 405 2 0 43 28 356 0.727 21.13 8.00 Prom - 162833 162794 40 -3.45 9.03 PlyA - 162898 162893 6 -0.45 9.02 Term - 163227 163084 144 0 0 23 43 119 0.445 -2.17 9.01 Init - 164043 163897 147 0 0 74 68 144 0.894 11.14 9.00 Prom - 166631 166592 40 -7.75 10.00 Prom + 170272 170311 40 -1.95 10.01 Init + 171901 171997 97 0 1 62 82 76 0.684 4.92 10.02 Intr + 174443 174609 167 0 2 29 98 70 0.507 0.86 10.03 Intr + 174646 174860 215 0 2 27 61 179 0.260 5.69 10.04 Term + 179379 179436 58 0 1 92 45 43 0.108 -3.42 10.05 PlyA + 180015 180020 6 1.05 11.03 PlyA - 181093 181088 6 1.05 11.02 Term - 181584 181552 33 0 0 108 44 19 0.153 -3.89 11.01 Init - 187365 187237 129 0 0 78 89 89 0.849 8.20 11.00 Prom - 187488 187449 40 -3.65 12.02 PlyA - 188247 188242 6 1.05 12.01 Sngl - 192152 191850 303 2 0 72 54 221 0.833 12.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 10040 9684 357 2 0 48 43 223 0.835 9.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:94564609_94765590|GENSCAN_predicted_peptide_1|149_aa MAASWTNKGEKREDPNKTIRNDDGNVTTDHKEIKLTIRNYYEYLYAHKLENVEEMDKFLD TYILPRLNQKEIDSLNRLITSSKTESVISSLPTKKSPGPDGFTAKFYQMYKEELMEDEKL KAVNRHKLSFTATGPFRTLTKRNYTFSGX >gi568815591f:94564609_94765590|GENSCAN_predicted_CDS_1|447_bp atggctgctagctggactaataaaggagaaaagagagaagatccaaataaaacaatcaga aatgatgatgggaatgttaccactgaccacaaagaaataaaactaaccatcagaaactac tacgaatacctctatgcacacaaactagaaaatgtagaagagatggataaattcttggac acatatatcctcccaagactgaatcagaaagaaattgattccctgaacagactaataaca agctccaaaactgaatcagtaataagtagcctaccaaccaaaaaaagccctggacctgat ggattcacagccaaattctaccagatgtacaaagaagagctgatggaggatgagaagctg aaggcagttaacagacataagctaagttttacagccacaggaccttttaggacgcttaca aagaggaactataccttcagtgggtgn >gi568815591f:94564609_94765590|GENSCAN_predicted_peptide_2|439_aa MAAGMKDVYSIFSKVHSDRNVYPSAGVLFVHVLEREYFKGEFPPYPKPGEISNDPITFNT NLMGYPDRPGWLRYIQRTPYSDGVLYGSPTAENVGKPTIIEVDVGVGITTLESSTEYLIQ LSIPVPYYLAVLLLAICPKEVLANALLEADFPLPYQAEFFIKNMNVEEMLASEVLGDFLG AVKNVWQPERLNAINITSALDRGGRVPLPINDLKEGVYVMVGADVPFSSCLREVENPQNQ LRCSQEMEPVITCDKKFRTQFYIDWCKISLVDKTKQVSTYQEVIRGEGILPDGGEYKPPS DSLKSRDYYTDFLITLAVPSAVALVLFLILAYIMCCRREGVIQLVHHSAIQKSTKELRDM SKNREIAWPLSTLPVFHPVTGEIIPPLHTDNYDSTNMPLMQTQQSNQLEIVNVLHQCHGL GGMQCCQGSRGPDFGTDTE >gi568815591f:94564609_94765590|GENSCAN_predicted_CDS_2|1320_bp atggctgctgggatgaaagatgtgtacagtattttctccaaggtacactccgatcggaat gtatacccatcagcaggtgtcctctttgttcatgttttggaaagagaatattttaagggg gaatttccaccttacccaaaacctggcgagattagtaatgatcccataacatttaataca aatttaatgggttacccagaccgacctggatggcttcgatatatccaaaggacaccatat agtgatggagtcctatatgggtccccaacagctgaaaatgtggggaagccaacaatcatt gaggtggatgtgggagttggcatcactactttggaaagcagtactgaatatctcatacag ttgagtatccctgtaccttattatctggcagtcctacttctagcaatatgccccaaagaa gtacttgcaaatgccctcctggaggcagacttcccgttgccatatcaagcagaattcttc attaagaatatgaatgtagaagaaatgttggccagtgaggttcttggagactttcttggc gcagtgaaaaatgtgtggcagccagagcgcctgaacgccataaacatcacatcggcccta gacaggggtggcagggtgccacttcccattaatgacctgaaggagggcgtttatgtcatg gttggtgcagatgtcccgttttcttcttgtttacgagaagttgaaaatccacagaatcaa ttgagatgtagtcaagaaatggagcctgtaataacatgtgataaaaaatttcgtactcaa ttttacattgactggtgcaaaatttcattggttgataaaacaaagcaagtgtccacctat caggaagtgattcgtggagaggggattttacctgatggtggagaatacaaacccccttct gattctttgaaaagcagagactattacacggatttcctaattacactggctgtgccctcg gcagtggcactggtcctttttctaatacttgcttatatcatgtgctgccgacgggaaggc gtcatccaactggtccatcacagtgctattcagaaatctaccaaggagcttcgagacatg tccaagaatagagagatagcatggcccctgtcaacgcttcctgtgttccaccctgtgact ggggaaatcatacctcctttacacacagacaactatgatagcacaaacatgccattgatg caaacgcagcagagtaatcagcttgagattgtaaatgtattgcatcagtgtcatggccta ggaggaatgcaatgttgtcaaggctccagaggaccagattttggcacagacacagagtaa >gi568815591f:94564609_94765590|GENSCAN_predicted_peptide_3|97_aa MSPRKTCHAFTGCTPKQMGKPSLKGLGFQDQVQIRAVDRDELRTVMSCRRSGAIDRFAIC SSNALPSAMLYKQISASNALQTGASDALQARFTRATN >gi568815591f:94564609_94765590|GENSCAN_predicted_CDS_3|294_bp atgtctccacgcaaaacttgtcacgccttcactggctgtaccccaaaacaaatggggaaa ccaagccttaaaggcttgggtttccaggaccaagtccagatacgggctgtagacagagat gagctgcggacagtgatgagctgcagacggtcaggagctatagatcgatttgcaatttgc tcttcaaatgcgctaccaagtgccatgctttacaaacagataagcgcttcaaatgcgcta caaactggcgcttcagatgcgctacaagcacgcttcacgcgggcgacaaactga >gi568815591f:94564609_94765590|GENSCAN_predicted_peptide_4|698_aa MTERRRDELSEEINNLREKVMKQSEENNNLQSQVQKLTEENTTLREQVEPTPEDEDDDIE LRGAAAAAAPPPPIEEECPEDLPEKFDGNPDMLAPFMAQCQIFMEKSTRDFSVDRVRVCF VTSMMTGRAARWASAKLERSHYLMHNYPAFMMEMKHVFEDPQRREVAKRKIRRLRQGMGS VIDYSNAFQMIAQDLDWNEPALIDQYHEGLSDHIQEELSHLEVAKSLSALIGQCIHIERR LARAAAARKPRSPPRALVLPHIASHHQVDPTEPVGGARMRLTQEEKERRRKLNLCLYCGT GGLKVFAGGKLPGPAVEGPSATGPEIIRSPQDDASSPHLQVMLQIHLPGRHTLFVRAMID SGASGNFIDHEYVAQNGIPLRIKDWPILVEAIDGRPIASGPVVHETHDLIVDLGDHREVL SFDVTQSPFFPVVLGVRWLSTHDPNITWSTRSIVFDSEYCRYHCRMYSPIPPSLPPPAPQ PPLYYPVDGYRVYQPVRYYYVQNVYTPVDEHVYPDHRLVDPHIEMIPGAHSIPSGHVYSL SEPEMAALRDFVARNVKDGLITPTIAPNGAQVLQVKRGWKLQVSYDCRAPNNFTIQNQYP RLSIPNLEDQAHLATYTEFVPQIPGYQTYPTYAAYPTYPVGFAWYPVGRDGQGRSLYVPV MITWNPHWYRQPPVPQYPPPQPPPPPPPPPPPPSYSTL >gi568815591f:94564609_94765590|GENSCAN_predicted_CDS_4|2097_bp atgaccgaacgaagaagggacgagctctctgaagagatcaacaacttaagagagaaggtc atgaagcagtcggaggagaacaacaacctgcagagccaggtgcagaagctcacagaggag aacaccacccttcgagagcaagtggaacccacccctgaggatgaggatgatgacatcgag ctccgcggtgctgcagcagctgctgccccaccccctccaatagaggaagagtgcccagaa gacctcccagagaagttcgatggcaacccagacatgctggctcctttcatggcccagtgc cagatcttcatggaaaagagcaccagggatttctcagttgatcgtgtccgtgtctgcttc gtgacaagcatgatgaccggccgtgctgcccgttgggcctcagcaaagctggagcgctcc cactacctgatgcacaactacccagctttcatgatggaaatgaagcatgtctttgaagac cctcagaggcgagaggttgccaaacgcaagatcagacgcctgcgccaaggcatggggtct gtcatcgactactccaatgctttccagatgattgcccaggacctggattggaacgagcct gcgctgattgaccagtaccacgagggcctcagcgaccacattcaggaggagctctcccac ctcgaggtcgccaagtcgctgtctgctctgattgggcagtgcattcacattgagagaagg ctggccagggctgctgcagctcgcaagccacgctcgccaccccgggcgctggtgttgcct cacattgcaagccaccaccaggtagatccaaccgagccggtgggaggtgcccgcatgcgc ctgacgcaggaagaaaaagaaagacgcagaaagctgaacctgtgcctctactgtggaaca ggaggcctcaaagtcttcgccggcgggaaactccccggccccgctgtagagggaccttca gcgaccgggccagaaataataaggtccccacaagatgatgcctcatctccacacttgcaa gtgatgctccagattcatcttccgggcagacacaccctgttcgtccgagccatgatcgat tctggtgcttctggcaacttcattgatcacgaatatgttgctcaaaatggaattcctcta agaatcaaggactggccaatacttgtggaagcaattgatgggcgccccatagcatcgggc ccagttgtccacgaaactcacgacctgatagttgacctgggagatcaccgagaggtgctg tcatttgatgtgactcagtctccattcttccctgtcgtcctaggggttcgctggctgagc acacatgatcccaatatcacatggagcactcgatctatcgtctttgattctgaatactgc cgctaccactgccggatgtattctccaataccaccatcgctcccaccaccagcaccacaa ccgccactctattatccagtagatggatacagagtttaccaaccagtgaggtattactat gtccagaatgtgtacactccagtagatgagcacgtctacccagatcaccgcctggttgac cctcacatagaaatgatacctggagcacacagtattcccagtggacatgtgtattcactg tccgaacctgaaatggcagctcttcgagattttgtggcaagaaatgtaaaagatgggcta attactccaacgattgcacctaatggagcccaagttctccaggtgaagagggggtggaaa ctgcaagtttcttatgattgccgagctccaaacaattttactatccagaatcagtatcct cgcctatctattccaaatttagaagaccaagcacacctggcaacgtacactgaattcgta cctcaaatacctggataccaaacataccccacatatgccgcgtacccgacctacccagta ggattcgcctggtacccagtgggacgagacggacaaggaagatcactatatgtacctgtg atgatcacttggaatccacactggtaccgccagcctccggtaccacagtacccgccgcca cagccgccgcctccaccaccaccaccgccgccgcctccatcttacagtaccctgtaa >gi568815591f:94564609_94765590|GENSCAN_predicted_peptide_5|139_aa MRTQLEDAKFEERNEPSPDTESAGITLAGPPEGTGGKPCGSICTMPPPQVHRVHELCGCG YLQLDFKRCSGKPLCPDRELPQRKKKLTVKQPQADPLYSEEGTVIIGDDGFMHVIAPNDL PVGHDVEAEDNVINAPDPV >gi568815591f:94564609_94765590|GENSCAN_predicted_CDS_5|420_bp atgaggacacagctagaagatgctaaatttgaagaaaggaatgagccctcaccagacact gaatctgctggtataactttggcgggccctccagaaggcacaggtggtaaaccatgtggc agtatctgcacaatgccacctccacaggtccacagagttcatgaactgtgtggctgtggc tatctccagctagatttcaaacgatgctctggaaagcctctgtgtccagacagagaactg ccacagagaaaaaaaaagttaactgtaaagcagcctcaggcagatccattatattcagaa gaaggcacagtaatcataggagatgacggcttcatgcatgttattgctcctaatgacctt ccagtgggacatgatgtggaggcagaagacaatgttattaatgctcctgaccctgtgtag >gi568815591f:94564609_94765590|GENSCAN_predicted_peptide_6|119_aa MAVGKNKCFMKGSKKGAKKKVIDQFSKKDWYDVKALAMFNIRNIGKTPSPGPKEPKLHLM VSRVMCLKKVKMLKKPKFELGKLMELHGDSSSSGKAIGDETGAKVERAEGYEPPVQESV >gi568815591f:94564609_94765590|GENSCAN_predicted_CDS_6|360_bp atggcagttggcaagaacaagtgctttatgaaaggcagcaaaaagggagccaagaagaaa gtgattgatcaattttctaagaaagattggtatgatgtgaaagcacttgctatgttcaat ataagaaatattggaaagacgccatcaccaggacccaaggaaccaaaattgcatctgatg gtctcaagggtcatgtgtttgaaaaaagtaaaaatgctgaagaagcccaagtttgaattg ggaaagctcatggagcttcatggtgacagcagtagttctggaaaagccattggggatgag acaggtgctaaagttgaacgagctgagggatatgaaccaccagtccaagaatctgtttaa >gi568815591f:94564609_94765590|GENSCAN_predicted_peptide_7|207_aa MATTKTKTKTSVRRDVEKLESLHTVDGNAKSCSHYGKEEVWRERRQWEPGLREALAGQLE FRVGVGLAALHSEQPAGPAAPGNEGLSTRVSGCRGCTGSPSSASPPALRSISHRALAAFL PVRDRDLQPAMPEPPTPSMGSCAAQASPMSTTPCSTAPSPINHPRAEVCKCTARDWQAAP PAAPVRDPLGEASWAPESGGDVENLHV >gi568815591f:94564609_94765590|GENSCAN_predicted_CDS_7|624_bp atggctaccaccaaaacaaaaacaaaaacgagcgttagaagggatgtggagaaattggaa tccttgcacacagttgatgggaatgcaaaatcatgcagccactatggaaaagaggaggtg tggagggagaggcgccagtgggaacccgggctgcgcgaggcgcttgcgggccagctggag ttccgggtgggcgtgggcttggcggccctgcactcggagcagccagctggccctgccgcc ccgggcaatgaggggcttagcacccgggtcagcggctgcagagggtgtactgggtcgccc agcagtgccagcccaccagcgctgcgctcaatttctcaccgggccttagctgccttcctg ccggtcagggatcgggacctgcagcccgccatgcctgagcctcccaccccctccatgggc tcctgtgcggcccaagcctccccgatgagcaccaccccctgctccacggcgcccagtccc atcaaccacccaagggctgaggtgtgcaagtgcacggcacgggactggcaggcagctcca cctgcagccccggtgcgggatccactgggtgaagccagctgggctcctgagtctggtggg gacgtggagaaccttcatgtctag >gi568815591f:94564609_94765590|GENSCAN_predicted_peptide_8|134_aa MLSAAEQHLQKPAAKTEAEQLVWWRDPITKSWEIGKIITWGRGFACVSPGQNRQPIWIPS RHLKPYHEPDAEEEIPGGSQGPPGCSHVETDAEEDCSCHEQHPLNTATYLGTDQEAVADG RRKPEESGTTSHNK >gi568815591f:94564609_94765590|GENSCAN_predicted_CDS_8|405_bp atgctatcagcagctgaacagcatctacagaaaccagctgcaaagacagaagcagaacaa ctggtttggtggagagatccaataacaaaaagttgggaaataggtaaaataataacttgg ggtagaggttttgcttgtgtttcaccaggccaaaaccggcagccaatttggataccatca agacacctgaaaccttatcatgagccagatgccgaagaagagattccaggaggatcccaa ggaccccctggttgcagccatgttgagactgatgctgaggaggactgcagctgtcacgag caacacccattgaacacagccacctacctggggacagatcaagaagctgttgcagatggc agaagaaaacctgaggaaagcgggacaaccagtcacaataagtaa >gi568815591f:94564609_94765590|GENSCAN_predicted_peptide_9|96_aa MEQVWALVHSTLEPLHSNDEEEGKYNEVTEEVTEQVFLPAKAKVAKEGEGPKEPYTGFIA RLQESLKKVIADSAAQDIVLRLLAFNNANPECQAAL >gi568815591f:94564609_94765590|GENSCAN_predicted_CDS_9|291_bp atggaacaagtgtgggctctggttcattccaccttggaacctttacatagtaatgatgag gaagaaggaaagtataacgaggtaacagaagaggtgacagagcaggtttttttgccagct aaagctaaagtggcaaaggagggagagggaccaaaagaaccatacacaggttttatagct cggttacaggagtctcttaaaaaggtgattgcagattcagctgctcaggatatagtgttg cggttattagctttcaacaatgctaatcctgagtgccaggctgctctgtga >gi568815591f:94564609_94765590|GENSCAN_predicted_peptide_10|178_aa MSKGMFGPECRKANEQKNKKQKQKQIMATAKLEEVKQASIKQIQDTIHLEKLFQLLVQKC HYYFDVQRNNIAMALEVIYWKWLHRVYKNMMHQKEKEHMINWVEKHMLQRISAQQEKETV AKCIANKSKAALKEGSSTASSVSVSIPTETTETTDRLNESFQLGTNQRKPNVLSNSVA >gi568815591f:94564609_94765590|GENSCAN_predicted_CDS_10|537_bp atgagtaaaggcatgtttggcccagagtgtcgaaaagcaaatgaacaaaaaaacaaaaag caaaaacaaaaacaaatcatggcaactgcaaagctagaagaggtgaagcaggcttccatc aaacaaatccaggacacgattcatttggagaagttattccagttattggttcagaagtgc cattactattttgatgtccagaggaataacattgctatggctttggaggttatttactgg aaatggctgcatagagtatataagaatatgatgcatcaaaaggaaaaagagcacatgatc aactgggtggagaagcacatgctccagaggatctctgcacagcaggaaaaggagacagtt gccaagtgcattgcaaacaaatctaaagctgctctcaaagaaggctcaagcacagccagt tctgtcagtgtatccatcccaactgagacaactgaaacaactgaccgactaaatgaaagc ttccaactcgggaccaaccagagaaagccaaatgtgctttccaattcagttgcatag >gi568815591f:94564609_94765590|GENSCAN_predicted_peptide_11|53_aa MEYYAATKKDEFMSFAGTCMTLETIILSKLMQEQKTKHHMFSLVKGKVFTVLV >gi568815591f:94564609_94765590|GENSCAN_predicted_CDS_11|162_bp atggaatactatgcagccacaaaaaaggatgagttcatgtcctttgcagggacgtgtatg acgctggaaaccatcattctcagcaaactaatgcaagaacagaaaaccaaacatcacatg ttctcacttgtgaaaggaaaggtattcactgtcctggtctag >gi568815591f:94564609_94765590|GENSCAN_predicted_peptide_12|100_aa MGRNQCKKAENSKNQKASSATKDNNSSPAREQNWMENEFDKLTEVGFRTCVITNSSKQKE HVLTQCKEAKNLDKRLDKLLIRITSLEKNINDLMELKNTA >gi568815591f:94564609_94765590|GENSCAN_predicted_CDS_12|303_bp atggggagaaaccagtgcaaaaaggctgaaaattccaaaaaccagaaagcctcttctgct acaaaggataacaactcctcaccagcaagggaacaaaactggatggagaatgagtttgac aaattgacagaagtaggcttcagaacatgcgtaataacaaactcctccaagcaaaaggag catgttctaacccaatgcaaggaagctaagaacctcgataaaaggttagacaagttgcta attagaataaccagtttagagaagaacataaatgacctgatggagctgaaaaacacagca tga