GENSCAN 1.0 Date run: 6-Nov-116 Time: 09:39:23 Sequence gi568815591f:123555964_123763205 : 207242 bp : 37.11% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 715 798 84 0 0 114 100 51 0.929 9.77 1.02 Term + 1821 2189 369 2 0 90 44 129 0.746 2.36 1.03 PlyA + 2544 2549 6 1.05 2.05 PlyA - 4048 4043 6 1.05 2.04 Term - 6362 6203 160 1 1 54 48 95 0.061 -1.47 2.03 Intr - 15011 14954 58 0 1 82 119 47 0.247 4.22 2.02 Intr - 26118 25998 121 0 1 122 98 75 0.929 11.05 2.01 Init - 35120 34920 201 2 0 33 86 118 0.602 5.02 2.00 Prom - 35814 35775 40 -10.45 3.02 PlyA - 36118 36113 6 1.05 3.01 Sngl - 36922 36344 579 1 0 51 36 298 0.913 16.82 3.00 Prom - 53527 53488 40 -2.65 4.00 Prom + 54744 54783 40 -6.55 4.01 Init + 58540 58646 107 0 2 74 62 125 0.905 8.24 4.02 Intr + 60258 60310 53 0 2 74 94 22 0.977 -0.97 4.03 Intr + 60401 60532 132 0 0 34 95 126 0.991 7.60 4.04 Intr + 61616 61774 159 0 0 55 27 135 0.276 3.14 4.05 Intr + 68558 68851 294 0 0 30 95 253 0.143 16.26 4.06 Intr + 71147 71318 172 0 1 49 65 192 0.977 11.08 4.07 Intr + 72901 73471 571 1 1 42 117 426 0.751 32.73 4.08 Intr + 88707 88868 162 2 0 64 91 77 0.048 4.85 4.09 Intr + 92481 92517 37 2 1 43 89 51 0.024 -2.38 4.10 Intr + 99972 100273 302 1 2 51 102 393 0.240 32.53 4.11 Intr + 102318 102437 120 2 0 44 85 124 0.936 7.47 4.12 Intr + 105897 107240 1344 2 0 60 114 1476 0.559 134.91 4.13 Term + 123797 123976 180 1 0 71 54 85 0.148 0.03 4.14 PlyA + 125075 125080 6 1.05 5.13 PlyA - 125586 125581 6 1.05 5.12 Term - 128617 128556 62 2 2 79 39 184 0.998 9.59 5.11 Intr - 133187 133079 109 2 1 66 80 94 0.929 5.34 5.10 Intr - 136904 136384 521 0 2 77 84 381 0.929 28.34 5.09 Intr - 138905 138752 154 2 1 50 88 152 0.995 10.12 5.08 Intr - 139902 139860 43 2 1 87 95 18 0.991 -0.28 5.07 Intr - 140784 140616 169 1 1 96 63 113 0.673 7.68 5.06 Intr - 148694 148671 24 0 0 86 103 34 0.767 1.78 5.05 Intr - 150410 150314 97 2 1 20 51 158 0.798 3.96 5.04 Intr - 150863 150777 87 2 0 81 95 16 0.517 0.85 5.03 Intr - 153260 153126 135 2 0 62 113 23 0.566 2.04 5.02 Intr - 157879 157831 49 0 1 44 103 52 0.167 0.06 5.01 Init - 163143 163043 101 0 2 42 100 79 0.171 4.28 5.00 Prom - 187875 187836 40 -3.55 6.06 PlyA - 188728 188723 6 1.05 6.05 Term - 192487 192437 51 1 0 100 49 88 0.913 2.55 6.04 Intr - 192838 192655 184 0 1 20 49 184 0.699 6.57 6.03 Intr - 193420 193255 166 2 1 60 22 185 0.864 7.20 6.02 Intr - 201625 201489 137 0 2 78 94 77 0.934 6.59 6.01 Init - 206661 206576 86 0 2 58 76 75 0.555 3.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:123555964_123763205|GENSCAN_predicted_peptide_1|150_aa MEITGELEKSVMEKVSNVSKNAAIRSMKHTRHDSANDSVTHNPLGTILRPAQSTTEVATL SPWKQLSSTPSGSPKEQRLCPAHLKREKLTHARWAYKPKAKLEDDWGKKGFPSCLFLFLP LLTACPKIEGNTPNYQKTPVAVGTCTVLNR >gi568815591f:123555964_123763205|GENSCAN_predicted_CDS_1|453_bp atggagataactggtgaacttgaaaagagcgtgatggaaaaagtgtcaaatgtgtcaaaa aacgctgctattaggtcaatgaagcacacccgccatgacagcgccaacgactcggtgacg cacaaccctttgggaacaattctcagacctgctcaatcgaccaccgaggtcgccactctg tcaccctggaaacagcttagctccaccccttcaggatcgccaaaggagcaaagactctgc cccgcccatcttaaaagagaaaaattgacgcatgcgcgatgggcatacaagcctaaagct aagctagaagacgattggggaaagaaagggtttcccagctgtctgttcctgttcctgcca ctgctgacggcttgcccaaagattgagggaaacactcccaactaccaaaaaactccagtc gctgtaggaacctgcactgtgcttaaccgttga >gi568815591f:123555964_123763205|GENSCAN_predicted_peptide_2|179_aa MSELPFTITTKRIKYLGIHLTGDVKDLFKENYKPLLNKIKEDTNKWKNIPYSSIGRISIV KMAILPKIFYHYNIDGDFHLLIKDVLYSQMEMEQQYTISAPEFMAGADGNIGIVPEELKD SPRPYSWLMQLLKPMLTYHPENPRTLKNDAKSSLPVFYKWNNKVWAKKKEDFFQNITAH >gi568815591f:123555964_123763205|GENSCAN_predicted_CDS_2|540_bp atgagtgaactcccatttacaattactacaaagagaataaaatacctaggaatccacctt acaggggacgtgaaggacctcttcaaggagaactacaaaccactgctcaataaaataaaa gaagacacaaacaaatggaagaacattccatactcatcgataggaagaatcagtatcgtg aaaatggccatactgcccaagatattttaccattacaatatagatggagattttcatcta ttaataaaagatgttctctattcacaaatggaaatggagcaacagtacacaatttctgcc ccagaattcatggctggtgcagatggaaatattggcatagttccagaggagttaaaggac tcacccaggccgtacagctggcttatgcagctgttgaagcccatgctcacttaccatcct gaaaatcctaggaccctgaagaatgatgctaaatctagtctgcctgtgttctataaatgg aacaacaaagtctgggctaagaaaaaagaagatttctttcaaaatattactgctcattga >gi568815591f:123555964_123763205|GENSCAN_predicted_peptide_3|192_aa MEDFNTALSILDRSMRQKINKDIQDLNSDVMHIYRTLHPKSTEYTFFSAPHRTYSEIDQK IGSKTLLSKCKRTEIATNGLSDHSAIKLEIRIKKLTPNCTTAWKLNNLLLSDYWVHNKMK AEMKMFFEINENKDTTYQNLWDTFKAVCGGKFIALNAHKRKQERSKIITLTSQLKELEKQ EKQIQKLAEDKK >gi568815591f:123555964_123763205|GENSCAN_predicted_CDS_3|579_bp atggaagactttaataccgcactgtcaatattggacagatcaatgagacagaaaattaac aaggatatccaggacttgaattcagacgtaatgcacatctacagaactctccaccccaaa tcaacagaatatacattcttctcagcaccacatcgcacttattccgaaattgaccaaaaa attggaagtaaaacactcctcagcaaatgtaaaagaacagaaatcgcaacaaacggtctc tcagaccacagtgcaatcaaattagaaataaggattaagaaactcactccaaactgcaca actgcatggaaactgaacaacctgctcctgagtgactactgggtacataacaaaatgaag gcagaaatgaagatgttctttgaaatcaatgagaacaaagacacaacgtaccagaatctc tgggatacatttaaagcagtgtgtggagggaaatttatagcactaaatgcccacaagaga aagcaggaaagatctaaaataatcaccctaacatcacaattaaaagaactagagaagcaa gagaaacaaattcaaaagctagcagaagacaagaaataa >gi568815591f:123555964_123763205|GENSCAN_predicted_peptide_4|1210_aa MDTNDDPDEDHLTSYDIQLSIQESIEASKTALCPERFVPLSAQNRKLVEAIKQGHIPELQ EYVKYKYAMDEADEKGWFPLHEAVVQPIQQILEIVLDASYKTLWEFKTCDGETPLTLAVK AGLVENVRTLLEKGVWPNTKNDKGETPLLIGICVNILTVVFNNHFQAVKKGSYDMVSTLI KHNTSLDQPCVKRWSAMHEAAKQGRKDIVALLLKHGGNVHLRDGFGVTPLGVAAEYGHCD VLEHLIHKGGDVLALADDGASVLFEAAGGGNPDCISLLLEYGGSGNVPNRAGHLPIHRAA YEGHYLALKYLIPVTSKNAIRKSGLTPIHSAADGQNAQCLELLIENGFDVNTLLADHISQ SYDDERKTALYFGVSNNDVHCTEVLLAAGADPNLDPLNCLLVAVRANNYEIVRLLLSHGA NVNCYFMHVNDTRFPSVIQYALNDEVMLRLLLNNGYQVEMCFDCMHGDIFGNSFVWSEIQ EEVLPGWTSCVIKDNPDLESSKVTVVMTLRREPYQGDRCLGCSPSTSFFLILQRTTLSKL VRPTLGVSGQRKGLSSEDLFETAPSGSDKAGTMSTFGYRRGLSKYESIDEDELLASLSAE ELKELERELEDIEPDRNLPVGLRQKSLTEKTPTGTFSREALMAYWEKESQKLLEKERLGE CGKLSVVARHDEQELADGGCKSNSHNRENLRAAVGWLGEQGSCVAEDKEESEEELIFTES NSEVSEEVYTEEEEEESQEEEEEEDSDEEERTIETAKGINGTVNYDSVNSDNSKPKIFKS QIENINLTNGSNGRNTESPAAIHPCGNPTVIEDALDKIKSNDPDTTEVNLNNIENITTQT LTRFAEALKDNTVVKTFSLANTHADDSAAMAIAEMLKVNEHITNVNVESNFITGKGILAI MRALQHNTVLTELRFHNQRHIMGSQVEMEIVKLLKENTTLLRLGYHFELPGPRMSMTSIL TRNMDKQRQKRLQEQKQQEGYDGGPNLRTKVWQRGTPSSSPYVSPRHSPWSSPKLPKKVQ TVRSRPLSPVATPPPPPPPPPPPPPSSQRLPPPPPPPPPPLPEKKLITRNIAEVIKQQES AQRALQNGQKKKKGKKVKKQPNSILKEIKNSLRSVQEKKMEDSSRPSTPQRSAHENLMEA IRGSSIKQLKRMTEVPCDDQMLGQCRVTAANEPKYNHMHQSQRLAPQNKPAFLVRDGHRM LAASINTQMI >gi568815591f:123555964_123763205|GENSCAN_predicted_CDS_4|3633_bp atggatactaatgatgaccctgatgaagaccatcttacaagttatgatattcagctaagt attcaagaatccattgaagccagcaagactgcactttgtcctgaaagatttgtaccccta agtgctcaaaacagaaaacttgtggaggccataaaacaaggtcacattcctgagctccag gagtatgtaaaatataaatatgcaatggatgaagctgatgaaaaaggatggtttccattg catgaagctgttgttcaacccattcaacaaatacttgagattgttctggatgcatcctat aagacactctgggaattcaagacctgtgatggagaaacacccttgactttggcagtcaaa gctggtctggtggaaaatgtaagaactttattagaaaagggagtgtggcccaacacaaaa aatgataaaggagagaccccccttctgattggtatttgtgttaatatacttactgttgtt tttaacaatcattttcaagctgtgaaaaagggctcctatgacatggtgtcgactctgatc aaacataacactagcctagaccagccctgtgtcaagcgatggtcagcaatgcatgaagca gccaagcaaggccgaaaagatatcgtagctctgctgctgaaacatggaggcaatgtccac ctgagagatggatttggagtcacaccactaggcgtcgctgccgagtatggtcactgtgac gtgttagaacatctaatccacaaaggtggtgatgtgcttgctttggcggatgatggggcg tcggtgctgtttgaggcagcaggaggtggcaatcccgactgcatttccctcctgctggaa tatggaggaagcggaaatgtacctaaccgagcaggacatcttcctatacaccgagctgcc tatgaggggcattatcttgcactgaaatatcttatcccagtaacatctaaaaatgcaatt cggaaaagtgggctaacaccaattcactcagcagcagatggacaaaatgcacagtgtcta gaactgctcattgaaaatggttttgatgtcaacactctacttgctgaccacatttcccag agctatgacgatgagaggaagactgcgctgtattttggcgtttctaataatgacgttcat tgcacagaagtccttctggctgcaggtgcagacccaaacttagatcccctcaactgtcta cttgttgcagtgagggccaataattatgaaattgtcaggctgcttctctcccatggagct aatgtcaattgttattttatgcatgtgaatgacactcgtttccccagtgtcattcaatat gctctaaacgacgaggtaatgctgaggctattgctgaataatggctatcaagtggagatg tgctttgactgcatgcatggtgacatctttggaaattcatttgtgtggtcagagatacag gaagaggtgctgccaggatggacatcttgtgtaataaaagataacccggacctggagtct tctaaagtgacagttgtcatgaccctgaggagggaaccataccaaggagatcgctgtctt gggtgctcaccttccacatcattcttcttaatactacagcgaacaactctgtccaaactt gtcagaccaacactcggggtttcaggacagagaaaaggcctttccagcgaagacctattt gaaacagctccttctgggtctgacaaagcagggaccatgtctacctttggctaccgaaga ggactcagtaaatacgaatccatcgacgaggatgaactcctcgcctccctgtcagccgag gagctgaaggagctagagagagagttggaagacattgaacctgaccgcaaccttcccgtg gggctaaggcaaaagagcctgacagagaaaacccccacagggacattcagcagagaggca ctgatggcctattgggaaaaggagtcccaaaaactcttggagaaggagaggctgggggaa tgtggaaagctgtctgtagttgccaggcatgatgagcaggaacttgctgatggagggtgc aaatccaattcccacaacagggaaaatcttcgagcagctgttgggtggctgggggagcaa ggtagttgcgttgcagaagacaaagaggaaagtgaagaagagcttatctttactgaaagt aacagtgaggtttctgaggaagtgtatacagaggaggaggaggaggagtcccaggaggaa gaggaggaagaagacagtgacgaagaggaaagaacaattgaaactgcaaaagggattaat ggaactgtaaattatgatagtgtcaattctgacaactctaagccaaagatatttaaaagt caaatagagaacataaatttgaccaatggcagcaatgggaggaacacagagtccccagct gccattcacccttgtggaaatcctacagtgattgaggacgctttggacaagattaaaagc aatgaccctgacaccacagaagtcaatttgaacaacattgagaacatcacaacacagacc cttacccgctttgctgaagccctcaaggacaacactgtggtgaagacgttcagtctggcc aacacgcatgccgacgacagtgcagccatggccattgcagagatgctcaaagtcaatgag cacatcaccaacgtaaacgtcgagtccaacttcataacgggaaaggggatcctggccatc atgagagctctccagcacaacacggtgctcacggagctgcgtttccataaccagaggcac atcatgggcagccaggtggaaatggagattgtcaagctgctgaaggagaacacgacgctg ctgaggctgggataccattttgaactcccaggaccaagaatgagcatgacgagcattttg acaagaaatatggataaacagaggcaaaaacgtttgcaggagcaaaaacagcaggaggga tacgatggaggacccaatcttaggaccaaagtctggcaaagaggaacacctagctcttca ccttatgtatctcccaggcactcaccctggtcatccccaaaactccccaaaaaagtccag actgtgaggagccgtcctctgtctcctgtggccacacctcctcctcctccccctcctcct cctcctccccctccttcttcccaaaggctgccaccacctcctcctcctccccctcctcca ctcccagagaaaaagctcattaccagaaacattgcagaagtcatcaaacaacaggagagt gcccaacgggcattacaaaatggacaaaaaaagaaaaaagggaaaaaggtcaagaaacag ccaaacagtattctaaaggaaataaaaaattctctgaggtcagtgcaagagaagaaaatg gaagacagttcccgaccttctaccccacagagatcagctcatgagaatctcatggaagca attcggggaagcagcataaaacagctaaagcggatgacagaagtgccctgtgatgaccaa atgctcgggcaatgtagggtgactgccgccaatgagccaaaatacaatcacatgcatcag agtcagaggctggctcctcaaaacaagccagcttttctggttagagatggacataggatg ttggcggcatctatcaacacccagatgatatga >gi568815591f:123555964_123763205|GENSCAN_predicted_peptide_5|516_aa MIKINFTLKRDDVTSRVLVEHRCGSENVAATTQKTLRYGVLTVFSEEDLHTMSSAVVQLY AADRNCMWSKKCSGVACLVKDNPQRSYFLRIFDIKDGKLLWEQELYNNFVYNSPRGYFHT FAGDTCQVALNFANEEEAKKFRKAVTDLLGRRQRKSEKRRDPPNGPNLPMATVDIKNPEI TTNRFYGPQVNNISHTKEKKKGKAKKKRLTKADIGTPSNFQHIGHVGWDPNTGFDLNNLD PELKNLFDMCGISEAQLKDRETSKVIYDFIEKTGGVEAVKNELRRQAPPPPPPSRGGPPP PPPPPHNSGPPPPPARGRGAPPPPPSRAPTAAPPPPPPSRPSVAVPPPPPNRMYPPPPPA LPSSAPSGPPPPPPSVLGVGPVAPPPPPPPPPPPGPPPPPGLPSDGDHQVPTTAGNKAAL LDQIREGAQLKKVEQNSRPVSCSGRDALLDQIRQGIQLKSVADGQESTPPTPAPTSGIVG ALMEVMQKRSKAIHSSDEDEDEDDEEDFEDDDEWED >gi568815591f:123555964_123763205|GENSCAN_predicted_CDS_5|1551_bp atgatcaaaataaattttaccttaaaaagagatgatgtgacaagtagagttttagtggaa catcgctgtggaagcgaaaatgttgctgccactacacagaaaaccctgcgatatggagtc ctgactgtatttagtgaagaagatctgcatactatgtcttcagcagtggtgcagttatat gcagcagatcggaactgtatgtggtcaaagaagtgcagtggtgttgcttgtcttgttaag gacaatccacagagatcttattttttaagaatatttgacattaaggatgggaaactattg tgggaacaagagctatacaataactttgtatataatagtcctagaggatattttcatacc tttgctggagatacttgtcaagttgctcttaattttgccaatgaagaagaagcaaaaaaa tttcgaaaagcagttacagaccttttgggccgtcgacaaaggaaatctgagaaaagacga gatcccccaaatggtcctaatctacccatggctacagttgatataaaaaatccagaaatc acaacaaatagattttatggtccacaagtcaacaacatctcccataccaaagaaaagaag aagggaaaagctaaaaagaagagattaaccaaggcagatataggaacaccaagcaatttc cagcacattggacatgttggttgggatccaaatacaggctttgatctgaataatttggat ccagaattgaagaatcttttcgatatgtgtggaatctcagaggcacaacttaaagacaga gaaacatcaaaagttatatatgactttattgaaaaaacaggaggtgttgaagctgttaaa aatgaactgcggaggcaagcaccaccacctccaccaccatcaaggggagggccacctcct cctcctccccctccacacaactcaggtcctcctcctcctcctgctaggggaagaggcgct cctcccccaccaccttcaagagctcccacagctgcacctccaccaccgcctccttccagg ccaagtgtagcagtccctccaccaccgccaaataggatgtaccctcctccacctccagcc cttccctcctcagcaccttcagggcctccaccaccacctccatctgtgttgggggtaggg ccagtggcaccacccccaccgcctccacctccacctcctcctgggccaccgcccccgcct ggcctgccttctgatggggaccatcaggttccaactactgcaggaaacaaagcagctctt ttagatcaaattagagagggtgctcagctaaaaaaagtggagcagaacagtcggccagtg tcctgctctggacgagatgcactgttagaccagatacgacagggtatccaactaaaatct gtggctgatggccaagagtctacaccaccaacacctgcacccacttcaggaattgtgggt gcattaatggaagtgatgcagaaaaggagcaaagccattcattcttcagatgaagatgaa gatgaagatgatgaagaagattttgaggatgatgatgagtgggaagactga >gi568815591f:123555964_123763205|GENSCAN_predicted_peptide_6|207_aa MWVGPDKRIKAGCLSWQWQPAQVPFDTESKNKAWWAFLDDNDKKAQLSALFQSHCPMITM LLSLNGTESKRRLSRKAGGAATARSPHPSSLQAAVAGFGYPTDEPRRQRPEYEVGSLRPK PTGIPLSSARGNELSPTRRRRRPWTPNPAGETMSSVQQQPPPPRRVTNVGSLLLTPQENE SLFTFLGKKCVGAGRGGRAPPSRAAGE >gi568815591f:123555964_123763205|GENSCAN_predicted_CDS_6|624_bp atgtgggtggggccagataagagaataaaagcaggctgcctgagctggcagtggcaacct gctcaggtccctttcgacactgaaagcaagaacaaggcttggtgggcctttctggatgat aatgacaagaaggcacagttgagtgccttgtttcaatcacactgtccaatgatcacaatg ctgctatctctcaatggaactgagagcaaaagaagactctctaggaaggccggcggcgca gcgacggcgaggagtccccatccatcttctcttcaagcagcagtagctgggttcgggtac ccaaccgacgagccgagacgccagaggccagagtacgaagttggaagcctgcgccccaag ccaaccgggattccacttagctccgcgcgggggaacgagctctcgcccactcgccggagg agacggccctggactcccaaccccgccggcgaaaccatgagctccgtccagcagcagccg ccgccgccgcggagggtcaccaacgtggggtccctgttgctcaccccgcaggagaacgag tccctcttcactttcctcggcaagaaatgtgtgggtgcgggccgtggcggtcgggcgccg ccttcccgagctgctggagaataa