GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:19:10 Sequence gi568815591f:12588255_12788854 : 200600 bp : 38.65% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 1287 1282 6 1.05 1.05 Term - 4348 3266 1083 1 0 61 55 457 0.225 30.81 1.04 Intr - 6568 5489 1080 1 0 63 -18 617 0.132 38.00 1.03 Intr - 6764 6668 97 1 1 83 84 33 0.499 1.49 1.02 Intr - 7157 7024 134 2 2 67 48 116 0.736 4.02 1.01 Init - 8047 7940 108 1 0 99 86 8 0.550 1.97 1.00 Prom - 11253 11214 40 -5.55 2.00 Prom + 11385 11424 40 -5.15 2.01 Init + 12109 12111 3 0 0 61 77 0 0.374 -3.75 2.02 Intr + 16260 16409 150 2 0 86 67 149 0.420 12.04 2.03 Intr + 34547 34639 93 1 0 86 101 119 0.999 12.24 2.04 Intr + 36756 36888 133 2 1 15 93 128 0.954 5.30 2.05 Intr + 37508 37596 89 0 2 72 55 59 0.936 -0.23 2.06 Intr + 38330 38545 216 1 0 62 53 188 0.987 10.48 2.07 Intr + 40847 40968 122 1 2 81 89 102 0.996 7.97 2.08 Intr + 47791 47881 91 1 1 79 47 192 0.985 13.28 2.09 Intr + 49124 49372 249 1 0 92 71 98 0.701 5.21 2.10 Intr + 52093 52263 171 0 0 105 80 115 0.998 11.62 2.11 Intr + 55884 56061 178 2 1 87 89 194 0.999 17.97 2.12 Intr + 56330 56451 122 0 2 55 81 45 0.573 -0.21 2.13 Intr + 61213 61290 78 0 0 75 98 98 0.572 8.33 2.14 Intr + 63587 63647 61 1 1 88 107 68 0.039 6.09 2.15 Term + 72199 72287 89 2 2 121 36 103 0.059 5.24 2.16 PlyA + 72774 72779 6 -0.45 3.03 PlyA - 73335 73330 6 1.05 3.02 Term - 73807 73640 168 1 0 20 49 183 0.817 4.50 3.01 Init - 74923 74876 48 1 0 72 92 63 0.700 6.10 3.00 Prom - 81748 81709 40 -5.65 4.00 Prom + 83468 83507 40 -4.05 4.01 Init + 87424 87482 59 0 2 95 60 123 0.905 11.03 4.02 Intr + 97524 97662 139 0 1 10 6 96 0.000 -6.85 4.03 Intr + 98715 98789 75 2 0 65 99 88 0.003 6.39 4.04 Term + 100058 100603 546 1 0 62 42 312 0.004 17.29 4.05 PlyA + 100904 100909 6 1.05 5.00 Prom + 101033 101072 40 -7.75 5.01 Init + 103397 103445 49 1 1 86 58 57 0.190 1.67 5.02 Intr + 112971 113118 148 1 1 111 94 80 0.699 9.27 5.03 Intr + 138366 138507 142 0 1 103 69 38 0.039 2.83 5.04 Intr + 169935 170062 128 2 2 121 71 137 0.809 13.86 5.05 Term + 183331 183400 70 1 1 91 42 106 0.294 2.73 5.06 PlyA + 185135 185140 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 63587 63766 180 1 0 88 39 173 0.953 9.03 S.002 Sngl + 100001 100603 603 1 0 69 42 318 0.996 21.24 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:12588255_12788854|GENSCAN_predicted_peptide_1|833_aa MVEYDLLGTRHWKKEERLRLACVRLPRHTAYSHLPRAYSKVTRLAYSPTGSMGEVTRNVD KPKLRPLVNSYIETWSQDDMCQPPLPEPAKDPKDLPKFSHCWPPHPPSKPLRGASGSSVS WTPDSAATSSIPYKACKTGAALSRLCRLWSLLPLPSPPPFTMGASQSTPPKTTPLGCLLR NRKALGLRSGIRSKRLIFYCNTAWPQYQLDNGSQWPENGTFDFNILRDLDNFCHRNEKWS EIPYIQAFFILRNHPSLCHSCSTFQILLTSSKPDSSPVTPPTDPADDSSSFDPVDFLPPR QHHDPPPEHHDPPPYVPAPALPLSPTLSNQPTSDSESSLPPPLTRSRAQCAQQPAPLLPL REVAGVEGIVHVHVPFSFYDLLQIEERLGSFSSDPDTYIKEFKYLTQSYELTWHDLYFIL SSTLLPEEKERVWLAAQAYADDLHQQDPTKPVGAAAVPWEEPSWEYQPTDPGQSPLLLLS ASPAPDPFPQYPLPASLINPSVAAHHDPIRIQLKDSSKFPNVPQYPISLTHQKGLQPIVN KLCSCSLLRPTHSPYNTPILPVKKSDGSYRLVHNLRAISQAVLPIHPIVHNPYPLLSLVA TNTTLYTAIDLKDAFFTISLHPDSQNLFAFTWTDPDTLQSQQLTWTVLPQGFRDSPHFFG QALARNLTSLNLSPSHLQYVDDLLLCSPSLKDSQTHTATALLNFLAIKGYRVSPSKAQLS ISMMTYLGIQLSPGAQAMTPARAALIENLPPPSSKSEILSFPGLAGFFRIWISNFALLAH PLYEVAKGPPNEPLNPSHNILPSFHKLQTALVTASAPSLPDISQPFTLYTAES >gi568815591f:12588255_12788854|GENSCAN_predicted_CDS_1|2502_bp atggtggagtatgaccttctggggacacgccactggaaaaaggaagaaagactcaggttg gcatgtgtacgacttcctagacacactgcatattctcacctcccaagggcctacagcaag gtcaccagacttgcttacagccctacgggcagcatgggggaggtcacgagaaacgtggat aaacctaagttacgccctcttgtaaattcctatattgaaacctggtcacaagatgatatg tgccaacctccccttcccgaacctgctaaagaccctaaggatctcccaaaattctcccat tgttggccacctcatccaccatcaaagcctctgagaggggcctctggttcctctgtttcc tggacgcctgactccgcagctacctcctctattccatacaaggcctgcaagacaggcgcc gccctctctcgtctttgtcgtctgtggtcccttcttcctctaccctcacctcctccattc actatgggagcctctcagtctacccctcccaagaccaccccccttgggtgtctcctccgc aatcgcaaggctctcggcctccgttcagggatccgttctaagaggcttatcttttactgc aatactgcatggcctcaataccaattggacaatggctcccaatggcccgaaaatggcact ttcgatttcaatatacttagagacttagacaacttttgccatcgcaatgagaaatggtct gagattccttatattcaggctttcttcattctccgtaaccacccttctctctgccactcc tgctctactttccaaatcctactcaccagctcaaaacctgactcatctcccgtcacccca cccacagacccagccgatgactcctcttcctttgaccccgtggattttctccctcctcga cagcatcatgatcctccaccagagcatcatgatcctccaccgtatgtccctgctccggct ctacccctctcccccactctctccaaccaacccacttctgactctgagtcctctctgcct cctcccctcacccgctctcgggcccaatgtgctcagcaaccagctcccttgcttcctctc cgggaagtagcgggagttgaggggatcgtccatgtccacgtccctttctccttctacgat ctcttacagattgaagaacgtctcgggtccttctcctccgatcctgatacttacatcaaa gaatttaaatatcttactcaatcttatgaactcacttggcatgatctctactttatcctc tcttctaccctccttccagaagagaaggaaagagtgtggctcgcagcacaggcatatgcc gatgaccttcatcagcaggatcctactaagcccgtaggagctgctgctgttccctgggaa gagccttcctgggaataccaacccacagaccctggccaaagtcccctcctgctcctctcc gctagtccggcccctgacccctttccccagtacccacttcctgcctctctcattaaccct tccgtagctgctcaccatgaccccatcagaatccagttaaaagactcttccaaatttccc aatgttcctcaataccccatctctctaacccaccaaaagggcctacagcccatcgtaaac aagctctgctcatgcagtcttcttagaccaacacactctccatataacacccctatcctc cctgttaaaaagtctgatggctcataccgactcgttcacaacctccgagccatcagtcaa gctgtcctccctattcatcccatagtccataacccctatccacttctctctctcgtcgct accaacaccaccctctacactgcaattgacctgaaggatgccttctttaccatttcccta caccctgattcccaaaacctatttgctttcacctggactgaccctgacaccctccagtca caacaactcacatggactgtcctccctcagggcttcagggatagccctcatttctttgga caagctctagcccgaaaccttacttccttaaacctgtctcccagtcatcttcaatatgtg gacgaccttcttctttgcagcccctctctaaaagactctcaaactcacacggccactgct ctcttaaactttctcgctatcaaagggtatagggtctccccctccaaggcccaactctcc atctccatgatgacttacttaggaattcaactttcccctggggcccaggctatgactcca gccagggcagcattaatagaaaatctacccccaccctcctccaaaagcgaaatcctttcc ttcccagggctagcaggcttctttagaatatggatttccaactttgccctcctagctcat cccctctatgaagtggccaaaggccctcccaatgaacccctaaacccctcacataacata ctccccagcttccacaaactccaaactgctcttgtcactgcgtcagctccgtccttacct gatatctcccaacctttcactctctatactgctgaaagctga >gi568815591f:12588255_12788854|GENSCAN_predicted_peptide_2|614_aa MEIYQWCGSSCNKYERLKANQVATGIRYNERKGRSELIVVEEGSEPSELIKVLGEKPELP DGGDDDDIIADISNRKMAKLYMVSDASGSMRVTVVAEENPFSMAMLLSEECFILDHGAAK QIFVWKGKDANPQERKAAMKTAEEFLQQMNYSKNTQIQVLPEGGETPIFKQFFKDWRDKD QSDGFGKVYVTEKVAQIKQIPFDASKLHSSPQMAAQHNMVDDGSGKVEIWRVENNGRIQV DQNSYGEFYGGDCYIILYTYPRGQIIYTWQGANATRDELTTSAFLTVQLDRSLGGQAVQV PESLLKQSPTPKSEGGSGGEGERMNRNGRDKEKRQGERGKKRDRRRGEGREEREIMEEEM RQVRRRGGGREEREIMEEEMRQIRVSQGKEPVHLLSLFKDKPLIIYKNGTSKKGGQAPAP PTRLFQVRRNLASITRIVEVDVDANSLNSNDVFVLKLPQNSGYIWVGKGASQEEEKGAEY VASVLKCKTLRIQEGEEPEEFWNSLGGKKDYQTSPLLETQAEDHPPRLYGCSNKTGRFVI EEIPGEFTQDDLAEDDVMLLDAWEQIFIWIGKDANEVEKKESLKSAETKETHFIRGPKTL APVTDSGRQSSLGV >gi568815591f:12588255_12788854|GENSCAN_predicted_CDS_2|1845_bp atggaaatttatcagtggtgtggttcctcgtgcaacaaatatgaacgtctgaaggcaaac caggtagctactggcattcggtacaatgaaaggaaaggaaggtctgaactaattgtcgtg gaagaaggaagtgaaccctcagaacttataaaggtcttaggggaaaagccagagcttcca gatggaggtgatgatgatgacattatagcagacataagtaacaggaaaatggctaaacta tacatggtttcagatgcaagtggctccatgagagtgactgtggtggcagaagaaaacccc ttctcaatggcaatgctgctgtctgaagaatgctttattttggaccacggggctgccaaa caaattttcgtatggaaaggtaaagatgctaatccccaagagaggaaggctgcaatgaag acagctgaagaatttctacagcaaatgaattattccaagaatacccaaattcaagttctt ccagaaggaggtgaaacaccaatcttcaaacagttttttaaggactggagagataaagat cagagtgatggcttcgggaaagtttatgtcacagagaaagtggctcaaataaaacaaatt ccctttgatgcctcaaaattacacagttctccgcagatggcagcccagcacaatatggtg gatgatggttctggcaaagtggagatttggcgtgtagaaaacaatggtaggatccaagtt gaccaaaactcatatggtgaattctatggtggtgactgctacatcatactctacacctat cccagaggacagattatctacacgtggcaaggagcaaatgccacacgagatgagctgaca acatctgcgttcctgactgttcagttggatcggtcccttggaggacaggctgtgcaggta ccagaatctttactgaaacaaagtccaacacccaaaagtgagggaggaagtggtggagag ggagaaaggatgaacagaaatgggagggacaaagagaaaagacagggagaaaggggaaag aaaagggataggaggagaggagagggaagagaagagagggaaatcatggaggaagaaatg agacaagtaaggaggagaggagggggaagagaagagagggaaatcatggaggaagaaatg agacaaatccgagtctcccaaggcaaagagcctgttcacctactgagtttgttcaaagac aaaccgctcattatttacaagaatggaacatcaaagaaaggaggtcaggcacctgctccc cctacacgcctctttcaagtccggagaaacctggcatctatcaccagaattgtggaggtt gatgttgatgcaaattcactgaattctaacgatgtttttgtcctgaaactgccacaaaat agtggctacatctgggtaggaaaaggtgctagccaggaggaggagaaaggagcagagtat gtagcaagtgtcctaaagtgcaaaaccttaaggatccaagaaggcgaggagccagaggag ttctggaattcccttggagggaaaaaagactaccagacctcaccactactggaaacccag gctgaagaccatccacctcggctttacggctgctctaacaaaactggaagatttgttatt gaagagattccaggagagttcacccaggatgatttagctgaagatgatgtcatgttacta gatgcttgggaacagatatttatttggattggcaaagatgctaatgaagttgagaaaaaa gaatctctgaagtctgcagagacaaaggagacacattttatccgtggacccaaaactctg gcgccagtcacagactcgggaagacagtcttcccttggtgtttaa >gi568815591f:12588255_12788854|GENSCAN_predicted_peptide_3|71_aa MIGKEGVCVFMKNYAEASEAIGQCQSSATKPRRSGKESVTELWPRVPGPLGVAARSDSLI SSGVPHRWDTT >gi568815591f:12588255_12788854|GENSCAN_predicted_CDS_3|216_bp atgatcggcaaggaaggcgtgtgtgtttttatgaagaactacgctgaggcttccgaggcg atcgggcagtgtcagtcttcagccactaagccgagaagatctgggaaggagtcagtcaca gagctttggcccagagttccagggcctctgggagtggctgccaggtcggacagtctgatt tccagtggggtcccgcacagatgggacacgacttag >gi568815591f:12588255_12788854|GENSCAN_predicted_peptide_4|272_aa MAEQEQLQSAAPSVNDVEDSNFGADFSTVQDLEMCARHAKTTTINTEDVKLLARRSNSPL KYIQTKGTGISAPPQRLGETLNEEEGRTVSQSFHIVILGLDCAGKTTVLYRLQFNEFVNT VPTKGFNTEKIKVTLGNSKTVTFHFWDVGGQEKLRPLWKSYTRCTDGIVFVVDSVDVERM EEAKTELHKITRISENQGVPVLIVANKQDLRNSLSLSEIEKLLAMGELSSSTPWHLQPTC AIIGDGLKEGLEKLHDMIIKRRKMLRQQKKKR >gi568815591f:12588255_12788854|GENSCAN_predicted_CDS_4|819_bp atggccgaacaggaacagctccagtctgcagctcccagtgtgaatgacgtagaagacagc aatttcggagctgacttttcgacagtgcaagatctagaaatgtgtgcaagacatgcgaaa accaccaccattaacactgaagatgtgaagctcttagccagaaggagtaattcaccgcta aaatacattcagacaaaaggaacaggtatcagcgcgcctccccagcggctgggagagacg ctgaatgaagaagagggaaggacggttagtcagtctttccacattgttattctgggtttg gactgtgctggaaagacaactgtcttatacaggctgcagttcaatgaatttgtaaatacc gtacctaccaaaggatttaacactgagaaaattaaggtaaccttgggaaattctaaaaca gtcacttttcacttctgggatgtaggtggtcaggagaaattaaggccactgtggaagtca tataccagatgcacagatggcattgtatttgttgtggactctgttgatgtcgaaaggatg gaagaagccaaaactgaacttcacaaaataactaggatatcagaaaatcagggagtccct gtacttatagttgctaacaaacaagatttgaggaactcattgtcactttcagaaattgag aaattgttagcaatgggtgaactgagctcatcaactccttggcatttgcagcctacctgt gcaatcataggagatggcctaaaggaaggacttgagaaactacatgatatgatcattaaa agaagaaaaatgttgcggcaacagaaaaagaaaagatga >gi568815591f:12588255_12788854|GENSCAN_predicted_peptide_5|178_aa MGFRHVGQAVLELLTSGNGAGPLLKWVSYDLQSHKSENFFMASSKTERQGRFLTLEEEEQ MKGGRRVPHCFYPQVIYFKSFKADQLCIQVADFKNDPSLIVSAAAPPTPPQAQGDMLPER KDHASPVYHAVSHSSTSFSDADRWKPEQYLVYTTLSCSEARAVTVHIFRRWSTTPAGK >gi568815591f:12588255_12788854|GENSCAN_predicted_CDS_5|537_bp atggggtttcgccatgttggccaggctgttcttgaactcctgacctcagggaatggggca ggacctcttctgaaatgggtgtcttatgacctacaatcacataagtcagagaatttcttt atggccagctctaagacagaaaggcagggaagatttctgactttggaagaggaggagcag atgaaaggaggcaggagggtgcctcactgcttttatcctcaagtgatttactttaaatca tttaaagcagatcaactttgtatccaagtggctgattttaaaaatgacccatcacttatc gtatctgctgcagctcccccaacccctcctcaagcccagggagatatgcttcctgagaga aaggaccacgcttctcctgtttaccatgctgtctcccactctagcacatccttcagcgat gctgaccggtggaaaccagagcagtacctcgtgtatactacgctatcttgcagcgaggcc agggctgtaaccgttcacatctttagacgatggtctactacgccagctggcaagtga