GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:21:58 Sequence gi568815581f:45508571_45708915 : 200345 bp : 45.41% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 PlyA - 404 399 6 1.05 1.12 Term - 486 462 25 2 1 41 50 67 0.007 -4.10 1.11 Intr - 1723 1674 50 1 2 91 94 33 0.030 1.68 1.10 Intr - 4719 4615 105 0 0 103 79 57 0.974 6.71 1.09 Intr - 5585 5298 288 2 0 6 78 253 0.578 13.54 1.08 Intr - 6597 6182 416 1 2 42 94 217 0.593 11.52 1.07 Intr - 6809 6669 141 0 0 30 38 175 0.674 7.02 1.06 Intr - 8712 8586 127 0 1 103 5 41 0.742 -2.45 1.05 Intr - 9578 9291 288 2 0 6 78 253 0.563 13.54 1.04 Intr - 10590 10433 158 1 2 42 59 118 0.713 4.03 1.03 Intr - 10802 10662 141 0 0 30 38 175 0.470 7.02 1.02 Intr - 12855 12737 119 2 2 98 72 5 0.462 0.01 1.01 Init - 20520 20459 62 0 2 106 36 58 0.626 3.12 1.00 Prom - 30211 30172 40 -1.86 2.03 PlyA - 30908 30903 6 1.05 2.02 Term - 41485 39153 2333 2 2 103 48 1197 0.069 102.58 2.01 Init - 47286 47142 145 0 1 67 66 98 0.048 5.78 2.00 Prom - 52923 52884 40 -3.06 3.00 Prom + 55532 55571 40 -3.56 3.01 Init + 77424 78339 916 2 1 75 69 986 0.946 87.63 3.02 Intr + 78379 78456 78 2 0 129 81 40 0.944 6.92 3.03 Term + 88430 88476 47 0 2 33 40 91 0.006 -3.83 3.04 PlyA + 89112 89117 6 1.05 4.03 PlyA - 89400 89395 6 1.05 4.02 Term - 93316 92299 1018 0 1 69 42 1648 0.730 149.89 4.01 Init - 93830 93445 386 2 2 82 26 419 0.563 31.41 4.00 Prom - 96622 96583 40 -5.66 5.00 Prom + 99062 99101 40 -8.56 5.01 Init + 100001 100344 344 1 2 75 75 382 0.139 32.31 5.02 Intr + 107469 107569 101 0 2 71 26 46 0.002 -3.55 5.03 Intr + 124480 124604 125 2 2 109 55 51 0.688 4.20 5.04 Term + 127713 127874 162 2 0 70 43 158 0.893 7.44 5.05 PlyA + 129362 129367 6 1.05 6.09 PlyA - 129809 129804 6 1.05 6.08 Term - 148061 147990 72 2 0 94 41 67 0.159 0.51 6.07 Intr - 162191 162069 123 2 0 52 86 90 0.353 5.98 6.06 Intr - 163627 163548 80 2 2 105 64 23 0.196 0.87 6.05 Intr - 164051 163970 82 2 1 133 68 25 0.162 4.21 6.04 Intr - 193307 193223 85 1 1 88 105 71 0.796 8.62 6.03 Intr - 196100 196025 76 0 1 103 91 10 0.360 1.37 6.02 Intr - 196968 196896 73 0 1 105 49 35 0.274 0.28 6.01 Init - 197172 197107 66 0 0 110 98 14 0.678 5.67 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 76398 76254 145 0 1 92 94 181 0.991 17.53 S.002 Sngl + 100001 100348 348 1 0 75 49 395 0.839 30.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:45508571_45708915|GENSCAN_predicted_peptide_1|639_aa MPSISIVFEEKSAINLIEDPLILPSHMACCLCQLKNSIEAVCKTVKLHCNSACLTNTIHC PEEESVGNPEGAFMKMLQARKNYTSTELTVEPEEPSDSSGINLSGFGTVNLDVESMLLPF IKLPTTGNSLAKIQTVGQNRQKVNRVLMGPMSIQKRHFKEFEIQLTQQLQSLIPNNNVRR LISHVIRTLKMDCSGAHVQVTCAKLISRTGHLMKLLSGQQEVKASEIEWDTDQWKTENYI NESTEAQSEQKEKSLELTKEVPGYGYTDKLILALIVTEILMILIILFCLIVVRTIINSAE EESVGNPEGAFMKMLQARKNYTSTELTVEPEEPSDSSGINLSGFGTVNLDVESMLLPFIK LPTTGNSLAKIQTVGQNWQKVNRVLMGPMSIQKRHFKEVGRQSIRREQGAQASVENAAEE KRLGSPAPRELEQPHTQQGPEKLAGNAIYTKPSFSQEHKAAVSVLTPFSKGAPSTSSPAK ALPQFEIQLTQQLQSLIPNNNVRRLISHVIRTLKMDCSGAHVQVTCAKLISRTGHLMKLL SGQQEVKASEIEWDTDQWKTENYINESTEAQSEQKEKSLELTKEVPGYGYTDKLILALIV TEILMILIILFCLIVMCCHRRSLQEDEEGFSSDSQPYED >gi568815581f:45508571_45708915|GENSCAN_predicted_CDS_1|1920_bp atgcccagcatctccattgtttttgaagagaaatcagctattaatcttattgaggatccc ttgatcttacctagccatatggcctgctgcctctgccaacttaaaaacagcattgaggct gtctgcaagacagtcaagctgcattgcaacagtgcatgtctgacaaacaccatacattgt cctgaagaagaatctgtagggaatccagaaggagcattcatgaagatgttacaagcccgg aagaattacacaagcactgagctgactgttgagccggaggagccctcagacagcagtggc atcaacttgtcaggctttgggacagtaaacctagatgtggaatcaatgttactaccgttc attaaactgccaaccacaggaaacagcctggcaaagattcaaactgtaggccaaaaccgg caaaaagtgaatagagtcctcatgggcccaatgagcatccagaaaaggcacttcaaagag tttgaaattcagctaacccagcagctacagtcccttatccccaacaacaatgtgagaagg ctcatttctcatgttatccggaccttgaagatggactgctctggggcccatgtgcaagtg acctgtgccaagctcatctccaggacaggccacctgatgaagcttctcagtgggcagcag gaagtaaaggcatctgagatagaatgggatacggaccaatggaagactgagaactacatt aatgagagcacggaagcccagagtgaacagaaagagaagtcgcttgagctcacaaaagaa gttccaggatacggctatactgacaaactcatcttggcattaattgtgactgaaatacta atgattttgattatacttttctgcctcattgtggtaaggacaataattaattcagctgaa gaagaatctgtagggaatccagaaggagcattcatgaagatgttacaagcccggaagaat tacacaagcactgagctgactgttgagccggaggagccctcagacagcagtggcatcaac ttgtcaggctttgggacagtaaacctagatgtggaatcaatgttactaccgttcattaaa ctgccaaccacaggaaacagcctggcaaagattcaaactgtaggccaaaactggcaaaaa gtgaatagagtcctcatgggcccaatgagcatccagaaaaggcacttcaaagaggtggga aggcagagcatcaggagggaacagggtgcccaggcatctgtggagaacgctgccgaagaa aaaaggctcgggagtccagccccaagggagctggaacagcctcacacacagcaggggcct gagaagttagcgggaaacgccatctacaccaagccttcgttcagccaagagcataaggca gcagtctctgtgctgacacccttctccaagggcgcgccttctacctccagccctgcaaaa gccctaccacagtttgaaattcagctaacccagcagctacagtcccttatccccaacaac aatgtgagaaggctcatttctcatgttatccggaccttgaagatggactgctctggggcc catgtgcaagtgacctgtgccaagctcatctccaggacaggccacctgatgaagcttctc agtgggcagcaggaagtaaaggcatctgagatagaatgggatacggaccaatggaagact gagaactacattaatgagagcacggaagcccagagtgaacagaaagagaagtcgcttgag ctcacaaaagaagttccaggatacggctatactgacaaactcatcttggcattaattgtg actgaaatactaatgattttgattatacttttctgcctcattgtgatgtgttgtcaccga aggtcattacaagaagatgaagaaggattctcaagcgacagccagccgtatgaagactga >gi568815581f:45508571_45708915|GENSCAN_predicted_peptide_2|825_aa MITQIGILNPLVWVGFKTVVIPMKGKIAQIAIEKALSDAFQKLLIVVLEMPAPPQESTEN LVPFLDTWDSAGEQPLEPEQFLASQQDLKDKLSPQERLPVSPKKLKKDPAQRWSLAEIIG IARQLSTPQSQKQTLQNEYSSTDTPYPSSLPPELRVKSDEPPGPSEQVGPSQFHLEPETQ NPETLEDIQSSSLQQEAPAQLPQLLEEEPSSMQQEAPALPPESSMESLTLPNHEVSVQPP GEDQAYYHLPNITVKPADVEVTITSEPTNETESSQAQQETPIQFPEEVEPSATQQEAPIE PPVSPMEHELSISEQQQPVQPSESSREVESSLTQQETPGQPPEHHEVTVSPPGHHQTHHL DSPSVSVKPPDVQLTIAAEPSAEVGTSLVHQEATARLSGSGNDVEPPAIQHGGPPLLPES SEEAGPLAVQQETSFQSPEPINNENPSPTQQEAAAEHPQTAEEGESSLTHQEAPAQTPEF PNVVVAQPPEHSHLTQATVQPLDLGFTITPESMTEVELSPTMKETPTQPPKKVVPQLRVY QGVTNPTPGQDQAQHPVSPSVTVQLLDLGLTITPEPTTEVGHSTAPKRTIVSPKHPEVTL PHPDQVQTQHSHLTRATVQPLDLGFTITPKSMTEVEPSTALMTTAPPPGHPEVTLPPSDK GQAQHSHLTQATVQPLDLELTITTKPTTEVKPSPTTEETSTQPPDLGLAIIPEPTTETGH STALEKTTAPHPDRVQSLHRSLTEVTGPPTELEPAQDSLVQSESYTQNKALTAPEEQKAS TSTNICELCTCGDEMLSCIDLNPEQRLRQVPVPEPNTHNGTFTIL >gi568815581f:45508571_45708915|GENSCAN_predicted_CDS_2|2478_bp atgatcactcagataggcatcttaaatccattagtttgggttggatttaagaccgttgtc attcctatgaaagggaagatagcccagattgctattgagaaggctttgtcagatgcattc cagaaactgttgattgtggttctagagatgccagccccaccccaggaatcgactgaaaat ttggttccattcctggacacctgggattcagctggagagcagcccctggagccagagcag ttcttggcttcacagcaggatttaaaggacaagctgagtccacaggaaagactccctgtt tcgcccaagaagctgaagaaagatccagctcagcgttggagccttgctgagattattgga attgcacgccaattatccacacctcagagtcagaaacagactttgcagaatgaatattcc agtacagatacaccgtatcccagtagcctgcctccagaactccgggtgaagtcagatgag cctccagggccctctgagcaagttggaccttctcaattccatctagagcccgaaactcaa aatccagagacccttgaagacatccagtcctcttcactccagcaagaagccccagcacag cttccacagctccttgaggaagaaccttcttcaatgcagcaggaggccccagctctgcct ccagagtcctctatggagagtctaactctaccgaatcatgaggtgtcagttcaacctcca ggtgaggatcaagcttattatcacttgcccaacattacagttaaacctgcagatgtggag gttaccataacttcagagcctaccaatgagacagaatcttcccaagcccagcaggagacc ccaattcagtttccagaggaggtggaaccttctgcaacccaacaggaggccccaattgag cctccagtttctcctatggagcatgaactttccatcagtgagcagcagcagccagttcag ccttctgagtcttctagggaggtcgaatcttctctgacccagcaggagaccccaggtcag cctccagaacatcatgaagtcacagtttcacctccaggtcaccatcaaactcatcattta gattcacccagtgtctctgtgaagcctccagacgtgcagctcaccatagcagcagagcct agtgcagaggtgggaacttctctagtccaccaggaggctacagctcggctctcagggtca ggtaatgatgtagaacctcccgccatccagcacgggggcccacctctgcttccagagtca tcagaagaagctggacctttagcagttcaacaggagacttcatttcaatctccggaacct attaataatgagaacccctctccaacccagcaggaggctgcagctgagcatccacagacc gctgaggagggtgagtcttccctaacccatcaggaggccccagctcagactccagagttc cctaatgtagttgtagctcaacctccagagcattcacacctgactcaagccacagttcaa cctttggatctggggtttaccatcactccagaatccatgacagaggttgaactttctcca accatgaaggagaccccaactcagcctcctaagaaagttgtaccccaacttcgagtatat caaggggtaacaaatccaacaccaggtcaggatcaagctcagcatccagtgtcacccagc gttacagttcaacttttggacctgggacttaccatcactccagaacccactacggaggtt ggacattctacagccccgaagaggactatagtttctccaaagcatcctgaggtgacactt ccacatccagaccaggttcagactcagcattcacacctgactcgagccacagttcaacct ttggacctggggtttaccatcactccaaaatccatgacagaggttgaaccttctacagcc ctgatgactacagctcctcctccaggacaccctgaggtgacacttccaccttcagacaag ggtcaggctcagcattcacacctgactcaagccaccgttcaacctctggacctggagctt accataactacaaaacctactacagaggttaaaccatctccaaccacagaggagacctca actcagcctccagacctgggacttgccatcattccagaacccactacagagactggacat tctacagccctggagaagactacagctcctcatccagaccgggttcagagtctgcatcga agcctgactgaagtcacaggtccacctactgaactagaacctgctcaggattcactggtg cagtctgaaagttacacccaaaataaggctttaactgcaccagaggaacagaaggcctcc acaagcaccaacatatgtgagctctgtacctgcggagatgagatgttgtcatgtattgat ctcaacccagagcagaggctccgccaagtgcctgtgccagagcccaacacccacaatggc accttcaccatcttgtaa >gi568815581f:45508571_45708915|GENSCAN_predicted_peptide_3|346_aa MAGHPQAGWAARRRLGQRCSSGGCLRKCMSTSYPAVPARGPPLRVPPDDDLQRPEPRLRI CPLQLAARRAGRHRPLHNHPLRPSCPLLLCRSTGKCELSVDCLPPNLTRTALQPALQPLG PGLQEARLLPSPGPAPGQIALLKFSSHWTAAMAKKALEEGQPHLCGEQVAVEWLKPELKQ RLRQQLVGPSLWSPQPDGSQLALARDKLGSQGARATLQLLCQRMKLGSSVFLTKCLGIGP AGWHRFWYQVVIPGHPVPFSGLIWVVLILDGRDGHEVAKDAVSVRLLQALIESGANLLWS AGAEAGSPNGHAQPVSGPNPADLGGHYLTPKVESMDEEPADLEGQL >gi568815581f:45508571_45708915|GENSCAN_predicted_CDS_3|1041_bp atggcgggccacccccaggctgggtgggcagcccgccgccggctgggtcagaggtgttca tcgggcggctgcctcaggaagtgtatgagcaccagctatcctgctgttccagcgcgtggg ccgcctctacgagttccgcctgatgatgaccttcagcggcctgaaccgcggcttcgcata tgcccgctgcagctcgcggcgcggcgcgcaggccgccatcgcccgctgcacaaccacccg ctgcggccgtcctgcccgctgcttctgtgccgcagcaccgggaagtgtgagctgagcgtt gactgcctgccgccgaatctgacccgcaccgcgctgcagcccgcgctgcagccgctgggt cccggcctgcaggaggcgcggctgctgcccagccccggaccggcgcccgggcagatcgct ctgctcaaattcagctcgcactggaccgctgccatggccaaaaaggccctggaggaaggg cagccacacctctgtggagagcaggtggctgtggagtggctcaagccagaactgaagcag cgacttcgccagcagcttgtgggtccctccttgtggtccccacagccagacggcagccag ttggccttggcaagggacaagttagggtcccaaggggctcgggctaccctgcagttgctg tgccaacgaatgaagctgggcagctctgtgttcctcaccaagtgtttgggcataggacct gctggctggcaccgcttctggtaccaggtggtgattcctgggcatccggtgcccttcagc ggcctcatctgggttgtgctgatcctagatggccgggatgggcatgaggtggccaaggat gctgtgtctgtacggctgctgcaggcactcattgagtctggggccaacctcctgtggtct gctggggctgaggcaggcagcccgaatgggcatgcacagcctgtgtcaggccccaaccca gcagacctgggtggccactatctgacccccaaagttgaatccatggatgaggaacctgca gatctggagggccagctgtag >gi568815581f:45508571_45708915|GENSCAN_predicted_peptide_4|467_aa MKRIKKERRKEKKKEKKREGSAGGGGAGSRLQAEMLQMDLIDATGDTPGAEDDEEDDDEE RAARRPGAGPPKAESGQEPASRGQGQSQGQSQGPGSGDTYRPKRPTTLNLFPQVQLSQDT LNNNSLGKKSRHLHQQPTLLVDEHAQLEPVSLRPCFGDYSDESDSAIVYDNCASVSSPYE SAIGEEYEEASRPQPPACLSKDSTPDEPDVHFSKKFLNIFMSGRSRSSSAESFGLFSCII NREEQEQTHRTIFRFVPRHEDEPELEVDDPLLVELQAEDYWYEAYNMRTGARGFFTAYYA IEVTKEPEHMAALAKNSDWVDQFRVKFLGSVQVPYHKGDVVLSAAMQKIATTRRLTVHFN PPSSCVLEISVRGVKIGVKADDSQEAKGNKCSHFFQLKNISFRGYHPKNNKYFGFITKHL ADHRFACHVFVSEDSTKALAESVGRAFQQFHKQFVEYTCPTEDIYLE >gi568815581f:45508571_45708915|GENSCAN_predicted_CDS_4|1404_bp atgaagagaataaagaaagaaagaagaaaagaaaagaaaaaagaaaagaaaagggagggc tctgcgggtggcggcggcgcggggagccggttgcaggccgagatgctgcagatggacctg atcgacgcgacgggggacactcccggggccgaggacgacgaggaggacgacgacgaggag cgcgcggcccggcggccgggagcggggccgcccaaggccgagtccggccaggagccggcg tcccgcggccagggccagagccaaggccagagccagggcccgggcagcggggacacgtac cggcccaagcggcccaccacgctcaacctctttccgcaggtgcagttgtctcaggacaca ctgaataataattctctgggcaaaaaatcgaggcacctccaccaacagcccacgctgctg gtagatgagcacgcgcagctggagccggtgagcctgcggccgtgcttcggagactacagt gacgagagtgactcggccatcgtctatgacaactgtgcctccgtctcctcgccctatgag tcagccatcggagaggaatatgaggaggcctcccggccccagcctcctgcctgcctctcc aaggactccacgcctgacgaacccgacgtccatttctccaagaagttcctgaacatcttc atgagtggccgctcccgctcctccagtgccgagtccttcgggctgttctcctgcatcatc aaccgggaggagcaggagcagacccaccggaccatattcaggtttgtgcctcgacacgaa gacgaacctgagctggaagtggatgaccctctgctagtggagctccaggctgaagactac tggtacgaggcctacaacatgcgcactggtgcccggggcttctttactgcctattacgcc atcgaagtcaccaaggagcccgagcacatggcagccctggctaaaaacagtgactgggtg gaccagttccgggtgaagttcctgggctcagtccaggttccctatcacaagggcgatgtc gtcctctctgccgctatgcaaaagattgccaccacccgccggctaaccgtgcactttaac ccgccctccagctgtgtcctggagatcagcgtgcggggtgtgaagataggtgtcaaggcc gatgactcccaggaggccaaggggaataaatgtagccactttttccagttaaaaaacatc tctttccgcggatatcatccaaagaacaacaagtactttgggttcatcaccaagcacctc gccgaccaccggtttgcctgccacgtctttgtgtctgaagactccaccaaagccctggca gagtccgtggggagagcattccagcagtttcacaagcagtttgtggagtacacctgcccc acagaagatatctacctggagtag >gi568815581f:45508571_45708915|GENSCAN_predicted_peptide_5|243_aa MTKKRRNNGRAKKGRGHVQPIRCTNCARCVPKDKAIKKFVIRNIVEAAAVRDISEVSVFD AYVLPKLYVKLHYCVSCAIHSKVVRNRSREARKDRTPPPRFRPAGAAPGPPPKPIQDLST YYLQDSPYVNLYSNSRMDVLLLRELRDRGLIFMVDSNDREQIDEAWEVLTYLLEDDELRN AVLLVFANKQDLPNTMNAAEITDKLGLHSLRYRNWHIQATCATTGHGLYEGLNWLANQFQ NQN >gi568815581f:45508571_45708915|GENSCAN_predicted_CDS_5|732_bp atgacaaagaaaagaaggaacaatggtcgtgccaaaaagggccgcggccacgtgcagcct attcgctgcactaactgtgcccgatgcgtgcccaaggacaaggccattaagaaattcgtc attcgaaacatagtggaggccgcagcagtcagggacatttctgaagtgagcgtcttcgat gcctatgtgcttcccaaactgtatgtgaagctacattactgtgtgagttgtgcaattcac agcaaagtagtcaggaatcgatctcgtgaagcccgcaaggaccgaacacccccaccccga tttagacctgcgggtgctgccccaggtcccccaccaaagcccatacaggatcttagcacc tactatttgcaagactccccctatgttaacctttacagcaactctcggatggatgtttta ttattgagagaactgagagacagaggtctgatttttatggttgacagtaatgacagagag cagattgatgaggcctgggaagtgctaacttacttgttagaggacgatgagctcagaaat gcagttttattggtatttgccaataaacaagatctccctaatactatgaacgcggcagag ataacggacaagctcggcctccattccctccgctacagaaactggcacattcaggctact tgtgccactactggacatgggctttacgaaggcctgaactggctcgccaaccagttccag aaccagaactga >gi568815581f:45508571_45708915|GENSCAN_predicted_peptide_6|218_aa MASVSTSWACSKSPEFPHSLGQVPRLAPDADGAAVVNPRFGSCVAAARWSPLELSRSPNT SDLPITASVFTRTSSRARVNIGAKHSFAEWMNGSCRSLDLGSHPLSHIPDPAASTCWAPP AAYAVPTRCTCARTEPKRHSEGQLLELGLEEGQGSCNTHAKVRGSIPEVGETTNPPEGTN SVHTRAAFSLVEKAKIFPQGKFRLKVQSMDALCGHDFM >gi568815581f:45508571_45708915|GENSCAN_predicted_CDS_6|657_bp atggcatcagtaagcaccagctgggcctgctccaagtcccctgagtttccacattccctt gggcaggtcccaagactggcccctgatgcagatggggcagcagtggtaaatcccaggttt ggctcctgtgttgctgcagcccgctggagcccactggaactctcccgctccccaaataca agtgacttgccaattactgcttctgtctttacaaggacgtcttctagggctagagttaac ataggtgccaaacactcgtttgctgaatggatgaacggctcctgccgatcattggatctg gggtctcaccctctctcgcacatcccagatcccgccgcctccacgtgctgggcaccacct gcagcctacgcagtgcctacacgttgcacctgtgcaaggactgagccgaaaaggcacagt gaggggcagctgctggagctggggttggaagagggccaagggagctgtaacactcatgcg aaggtccgtggctccattcctgaagttggtgagaccacgaacccaccggaaggaaccaac tccgtacacactagggctgccttcagcctggtcgaaaaagccaagatctttccacaaggc aaattccgcttgaaggtccaatccatggatgcactgtgtggacatgacttcatgtga