GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:54:27 Sequence gi568815592f:27039309_27239617 : 200309 bp : 40.94% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6154 6397 244 0 1 78 44 227 0.294 13.45 1.02 Intr + 6599 6721 123 0 0 53 31 103 0.600 0.74 1.03 Intr + 10844 11002 159 0 0 8 33 212 0.085 6.74 1.04 Term + 29852 30336 485 0 2 56 28 318 0.956 16.52 1.05 PlyA + 30786 30791 6 1.05 2.08 PlyA - 31399 31394 6 1.05 2.07 Term - 36719 36522 198 2 0 47 54 148 0.809 3.72 2.06 Intr - 52768 52580 189 1 0 99 22 185 0.355 11.86 2.05 Intr - 53246 53145 102 2 0 127 54 27 0.528 2.65 2.04 Intr - 58394 58319 76 1 1 104 75 17 0.368 0.60 2.03 Intr - 74496 74391 106 1 1 84 38 57 0.006 -1.35 2.02 Intr - 75818 75556 263 1 2 65 80 101 0.020 3.21 2.01 Init - 93442 93066 377 1 2 36 4 566 0.046 39.45 2.00 Prom - 93522 93483 40 -6.05 3.00 Prom + 93644 93683 40 -15.49 3.01 Init + 93764 94152 389 1 2 48 32 591 0.814 45.82 3.02 Term + 94418 94700 283 2 1 1 49 319 0.733 13.21 3.03 PlyA + 94943 94948 6 1.05 4.00 Prom + 95706 95745 40 -7.55 4.01 Sngl + 100001 100312 312 1 0 28 36 603 0.942 44.78 4.02 PlyA + 101307 101312 6 1.05 5.02 PlyA - 101504 101499 6 1.05 5.01 Sngl - 107490 107110 381 0 0 88 37 634 0.926 54.22 5.00 Prom - 107586 107547 40 -4.65 6.00 Prom + 107765 107804 40 -2.65 6.01 Sngl + 107821 108207 387 0 0 46 49 566 0.999 44.46 6.02 PlyA + 109595 109600 6 -0.45 7.05 PlyA - 109824 109819 6 1.05 7.04 Term - 111825 111748 78 0 0 64 55 87 0.331 -0.22 7.03 Intr - 118516 118372 145 0 1 56 81 106 0.278 6.06 7.02 Intr - 118961 118645 317 2 2 40 87 209 0.575 9.94 7.01 Init - 119813 119778 36 2 0 68 58 46 0.455 -2.32 7.00 Prom - 121262 121223 40 -3.95 8.00 Prom + 123990 124029 40 -6.65 8.01 Init + 137793 137844 52 2 1 26 48 105 0.050 1.67 8.02 Intr + 138239 138400 162 0 0 70 80 116 0.013 8.03 8.03 Intr + 140828 141139 312 0 0 77 36 162 0.016 5.13 8.04 Intr + 141728 142105 378 0 0 85 -17 170 0.363 0.21 8.05 Intr + 142622 142792 171 0 0 79 63 55 0.430 1.09 8.06 Intr + 143026 143237 212 2 2 101 88 60 0.470 5.01 8.07 Intr + 143528 143703 176 1 2 40 29 136 0.054 0.72 8.08 Intr + 148129 148262 134 1 2 96 89 58 0.460 6.07 8.09 Intr + 149151 149393 243 1 0 -18 72 216 0.770 5.95 8.10 Term + 149471 149583 113 0 2 54 49 141 0.985 4.54 8.11 PlyA + 150637 150642 6 1.05 9.00 Prom + 162997 163036 40 -2.55 9.01 Sngl + 172179 172604 426 2 0 93 37 390 0.971 30.44 9.02 PlyA + 176891 176896 6 1.05 10.04 PlyA - 178152 178147 6 1.05 10.03 Term - 185774 185703 72 2 0 84 38 93 0.145 0.83 10.02 Intr - 199349 199230 120 2 0 -31 84 126 0.299 0.07 10.01 Intr - 199851 199726 126 0 0 71 6 194 0.333 9.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 93442 93062 381 1 0 36 41 570 0.934 43.02 S.002 Term - 166594 166480 115 0 1 114 40 117 0.913 6.56 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:27039309_27239617|GENSCAN_predicted_peptide_1|336_aa ALAAFLWGRARDLQPAMPDPPPHSVGSKPPRRAPPPTPRAPSPIDHPSAEECGCTAQDWQ AAPPAAPVRDPLGKASWAPESANLVGTWRTFVSSSGIVNAPINTLSKRTNQLSVKWTNQQ DVDSSRQTHTRLDVKTNTLAEEHTSTWMWRGCREHADRYRQACRPSTDRPNDLEFEGAAP GTVHQTLLYNGSGSLVDQKEADVSGWELGNDQPPSPWALESSSSTSDRRPQNLLFHCLPL TSSALDSPLPKPKAAPIHSCQVHSPGESVQHASGWLFWQSSTNQGLFIFPQVPVLMGNSH WCHNVNVIISDFAKGKGVLLYNVNPWVQLCPLPEAR >gi568815592f:27039309_27239617|GENSCAN_predicted_CDS_1|1011_bp gccttagctgccttcctgtggggcagggctcgggacctgcagcccgccatgcctgaccct cccccgcactccgtgggctccaagcctccccgacgagcgccgccccctactccacgggca cccagtcccatcgaccacccaagcgctgaggagtgcgggtgcacggcgcaggactggcag gcagctccacctgcggccccagtgcgggatccactgggtaaagccagctgggctcctgag tctgctaatctagtggggacgtggagaacttttgtgtctagctcagggattgtaaacgca ccaatcaacaccctgtcaaaacggaccaatcagctctctgtaaaatggaccaatcagcag gatgtggactctagcaggcagacacacacgcggctggacgtcaagacaaacacattggca gaggaacacacaagcacctggatgtggagagggtgtcgagagcacgctgacaggtaccgg caagcctgcaggccatcgaccgaccggccgaacgacttggagtttgaaggggcagctcca ggcacagtgcatcaaacactgctgtacaatggctctggctcccttgtggaccagaaagaa gcagatgtttctggatgggagttgggaaatgatcagccaccttctccctgggccctggaa tcttcctcttccacaagtgacagaagacctcagaatcttctctttcactgcctacccctg accagctctgccctggacagtccccttccaaagccaaaggcagccccaatccacagttgt caggtacactcacctggggagtctgtgcagcatgcctctgggtggctgttctggcagagc agcaccaaccagggcctgtttatatttccccaagtcccagtgctcatgggcaactcccac tggtgtcacaatgtcaatgtcataatttctgactttgccaaaggaaaaggggtccttctg tacaatgtcaatccctgggtacaactctgccctctccctgaggctcgataa >gi568815592f:27039309_27239617|GENSCAN_predicted_peptide_2|436_aa MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRKRSRKESYSIYVYKVLKQVHPDTGISSKAM GIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVT KYTSANSGSCSDSCSGLLLLLNTAPALPVCSTGGASTLLYTAILGCSACGLVVLSAASTL LYTVAVGFSAYGLALLCGAPALLYYAATLGCSMGVSGNFPTALEEKGGGNLKGKVTGLSG LGRISGFSSPTVTFLSSLLPSPLSQARRRFCSHQLDEPGLAIFPRVNSQPWCYILPWEFH TFSKRVIMHSVKHAHARAHELPTVFLSGFEKRQTEVGERSCSRINIPVSALRRSPDSVLA CLDNSTVDVVKNTLDRVICKEKKFILEAGKSKVMGLASGKGFCSASRHGEKATRQENVDE RGKKKQADWFYNNSLS >gi568815592f:27039309_27239617|GENSCAN_predicted_CDS_2|1311_bp atgccagagccagcgaagtctgctcccgccccgaaaaagggctccaagaaggcggtgact aaggcgcagaagaaagacggcaagaagcgcaagcgcagccgcaaggagagctattccatc tatgtgtacaaggttctgaagcaggtccaccctgacaccggcatttcgtccaaggccatg ggcatcatgaattcgtttgtgaacgacattttcgagcgcatcgcaggtgaggcttcccgc ctggcgcattacaacaagcgctcgaccatcacctccagggagatccagacggccgtgcgc ctgctgctgcctggggagttggccaagcacgccgtgtccgagggtactaaggccgtcacc aagtacaccagcgctaattctggctcctgctctgactcctgctctggactgctcctgttg ctgaacactgctcctgcccttcctgtctgttctactggtggagcatccactttgctgtac actgctattctgggctgttctgcctgtgggttagttgtgctttctgcagcatccaccttg ctgtacacagttgctgtaggcttctctgcctatgggttagccttgctttgtggagcaccc gctttgctgtactatgctgctactctgggatgctctatgggagtcagtggtaatttccct actgctcttgaagaaaagggaggtggaaatttgaaaggaaaagtaactggactctctgga cttggaagaatttctgggtttagtagccccaccgtcaccttcctctcctctctattgccc tctcccctttctcaggctcgaaggcgtttttgcagtcatcagttagacgaacctgggcta gcaatattcccaagagtcaatagtcagccctggtgctatatacttccatgggaattccat actttctcaaaacgcgtcatcatgcactcggttaaacacgcacacgcacgcgcgcacgag ctgcccacggtcttcctttcggggtttgagaagagacagacagaggtgggagagcgcagc tgcagccgcatcaacattccggtgtctgcccttagaagaagtccggattcagtactcgcc tgcttagacaattctactgttgatgtggtgaaaaataccttagaccgggtaatttgtaaa gaaaagaagtttattctggaggctgggaagtccaaggtcatgggactggcttctggcaag ggtttttgttctgcatcacgacatggtgaaaaagcaacacgacaagagaatgtggatgaa agagggaaaaagaagcaggctgactggttttataacaactcactctcatga >gi568815592f:27039309_27239617|GENSCAN_predicted_peptide_3|223_aa MSGRGKQGGKARAKAKTRSSRAGLQFPVGRVHRLLRKGNYAERVGAGAPVYLAAVLEYLT AEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGKVTIAQGGVLPNIQAVLLPKK TESHHKAKGKPLLNFKLNQCVSTEHQWLREFLKHFGPKELSPGLEFSRQLFHAGQGGMKK SRRYVPGTVALRDVRRYQNSELLISKLPLLRELGGDAAARERG >gi568815592f:27039309_27239617|GENSCAN_predicted_CDS_3|672_bp atgtctggacgtggcaagcagggaggcaaagcccgcgctaaggccaagactcgctcttct agggccggtctccagttccccgtgggccgagtgcaccgcctgctccgcaaaggcaactat gccgagcgggtcggggccggcgcgccggtgtatctggcagcggtgctggagtacctgacc gccgagatcctggaactggcgggcaacgcggcccgcgacaacaagaagacccgcatcatc ccgcgtcatctccaactggccatccgcaacgacgaggagctcaacaagctgctgggcaaa gtcaccatcgcacagggcggtgtcctgcccaacattcaggccgtgctactgcccaaaaag actgagagccaccacaaggcgaagggcaagccattacttaattttaaacttaatcagtgt gtgtccacagagcaccagtggcttcgtgagttcttgaagcactttggcccaaaagagtta tcgcccgggctagaatttagcagacagctgttccacgcgggccagggcggcatgaagaag tcccgccgctacgtgcccggcacagtggccctgcgcgacgttcggcgctaccagaactcc gagctgctgatcagcaagctgccgctcctgcgagagctcggcggtgacgccgctgcacga gagcgaggctga >gi568815592f:27039309_27239617|GENSCAN_predicted_peptide_4|103_aa MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLK VFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFGG >gi568815592f:27039309_27239617|GENSCAN_predicted_CDS_4|312_bp atgtcaggacgcggcaaaggaggtaagggcctggggaaagggggtgccaagcgccaccgc aaggtgctgcgcgacaacatccagggtatcaccaagccagccattcggcgccttgctcgc cgcggcggcgtgaagcgcatttctggcctcatctatgaggagacccgcggagtgttgaag gtgttcctggagaacgtgatccgggacgccgtgacctacacggagcacgccaagcgcaag acggtcaccgccatggacgtggtctacgcgctcaagcgccagggccgcaccctctatggc ttcggcggctaa >gi568815592f:27039309_27239617|GENSCAN_predicted_peptide_5|126_aa MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRKRSRKESYSVYVYKVLKQVHPDTGISSKAM GIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVT KYTSAK >gi568815592f:27039309_27239617|GENSCAN_predicted_CDS_5|381_bp atgccggaaccagcgaagtccgctcccgcgcccaagaagggctcgaagaaagccgtgact aaggcgcagaagaaggacggcaagaagcgcaagcgcagccgcaaggagagctactccgta tacgtgtacaaggtgctgaagcaggtccaccccgacaccggcatctcctctaaggccatg ggaatcatgaactccttcgtcaacgacatcttcgaacgcatcgcgggtgaggcttcccgc ctggcgcattacaacaagcgctcgaccatcacctccagggagatccagacggccgtgcgc ctgctgctgcccggggagttggccaagcacgccgtgtccgagggcaccaaggccgtcacc aagtacaccagcgctaagtaa >gi568815592f:27039309_27239617|GENSCAN_predicted_peptide_6|128_aa MSGRGKQGGKARAKAKTRSSRAGLQFPVGRVHRLLRKGNYAERVGAGAPVYLAAVLEYLT AEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGKVTIAQGGVLPNIQAVLLPKK TESHHKAK >gi568815592f:27039309_27239617|GENSCAN_predicted_CDS_6|387_bp atgtctggacgtggcaagcaaggcggtaaagctcgcgccaaggccaagacccgctcttct cgggctgggcttcagttccccgtgggccgagtgcaccgcctgctccgcaagggtaattat gccgagcgggttggagccggcgcgccagtgtacctggctgcggtgctggagtacctgacc gctgagatcctggagctggctggcaatgcggcccgcgacaacaagaagacccgtatcatc ccgcgtcacctccaactggccatccgcaacgacgaggagctcaacaagctgctgggcaaa gtcaccatcgcgcagggtggtgtcttgcccaatatccaggccgtgctgctgcctaagaag actgagagccaccataaggccaaataa >gi568815592f:27039309_27239617|GENSCAN_predicted_peptide_7|191_aa MLAMLVSNSRPQYVEKLRRRQGTAQYVEKLRRHQGTAVTRIRTEVAAATTQSTNHYTITA RHQRPLLVGLFLCLFPRNVCFCHLKNGAVFRPYLHTYPVYFLIHSPSPIFCLWQRSQSLA VWSSDLWQQKEEGAALKWTCQIPFFAKLVRIFGGLAWNLASRQVPENYSLQGMHDNDSIC NRYEDEGEGRN >gi568815592f:27039309_27239617|GENSCAN_predicted_CDS_7|576_bp atgttggccatgctggtctcgaactcccgacctcagtatgttgaaaaattgaggcgacgt caaggtactgctcagtacgttgagaaattgaggcgacatcaaggtactgccgtgactcgg attcgaaccgaggttgctgcggccacaacgcagagtactaaccactatacgatcacggca cgccaccagaggccgctgcttgtggggctcttcctttgcttatttcccagaaacgtctgt ttttgtcatctaaagaatggtgcagtctttcgtccttatttacacacctacccagtttac ttccttatccactccccgtccccaatattttgcttatggcaaagaagccaaagtttagca gtgtggtcctcagatctttggcaacaaaaggaagagggcgctgctttgaagtggacctgc cagatcccattttttgccaaactagtcagaatcttcgggggcttggcatggaatctggca tctcgacaggtgccagaaaattattccttacaaggaatgcatgacaacgactccatctgt aataggtatgaagatgagggggaagggcggaactga >gi568815592f:27039309_27239617|GENSCAN_predicted_peptide_8|650_aa MDRRGYGVRCKTEVYKRGEGLRTNQKAAPSLRSRPVPHGTNRRAAAGSQDRRVRRTRRVR IPLCSLRPFQPGILFLGEERTMICKAAMIAWERKHPPRQNVLAAEHKFPAQDPQWDNNSA AHRENMRDRDMIIKGIQESVPPTQNISQAFNVQQEKDEGPMEFLNRLKEQIRKYADLSCS AEDLTVSGVKGEGFRAKILEETEVKYKNKSVATKFFLIPEAGTNLLGRDLMLRLGTGLYV NQGKLLTSLNVLTTSEESRIHPNVWSKEGNRGWVPPIHVKLKTPGEIVKQKQYPIPLEAR IGDNKDQVTAISVNFLNFLRGQGLRVSKNNIQFIESEVKYLGHLISKGERKIGSERIEEQ PFHLFINVSKGVALGVLTQKHGGHRQPVVFLSKILDPVTRGWPECVQSIAATALLTEESR KITFGGNLFVIEGKRHNGYSIVDRETLTVVESERLPNKWSAQICELFALNQALKSLQNQE GTIYTYSNFPNPERLSSADPTGCQLFGLLDILLKIHWEKGLWGHKLAIKIWEIWSLVRAA LEPFQTNDEAESEEEKEEFDNQDSETPLPSTSQKESPEVIYANPPSLPKPIQKLIQLRVP QEECPEWPPPPQPIHYGEGVIQVRPAVHYSEGAISLKMDDPAPCLGRTVA >gi568815592f:27039309_27239617|GENSCAN_predicted_CDS_8|1953_bp atggataggcgaggctacggagttcgatgtaaaaccgaagtttacaagcgaggtgaagga ctcagaacaaaccaaaaagcagctccaagcttgcgatcgcggccagttcctcacggaacc aaccgcagagccgccgcaggttcgcaggatcggagggttcggaggactcggagggtgcgg attcctctctgctccttgagaccattccagccaggtattctcttcttaggagaggaaaga accatgatctgcaaggctgctatgatagcctgggagcgcaagcacccccctcgtcagaat gtccttgcagcagagcataaattcccggcccaggaccctcaatgggataacaacagtgcg gctcaccgggaaaacatgagagatagggatatgataattaaagggattcaggaatcagtt cctccaacccaaaacatttcccaagcatttaatgtacagcaagagaaagatgaagggccc atggaattcttaaacagacttaaggaacagataagaaaatacgcagatttatcctgttca gccgaagaccttactgtctcaggagttaaaggggaaggattcagagcaaaaattctagag gaaaccgaagtcaaatataagaataagtcggttgctactaagttcttcttaattcctgaa gcaggaactaatttattaggaagggacttaatgttaaggttaggcacaggcctatatgtt aatcaaggaaaactccttacttccttaaacgtactcaccacttcagaagaaagccgcatc catcccaacgtatggtcgaaagaagggaatcgaggatgggttcctccaatccatgtcaaa ttaaaaactcctggggaaatagtgaaacaaaaacaataccctattcccttggaagccagg ataggagataacaaggatcaagtaacagcaatttcagttaacttcctaaatttcctaagg ggacaaggattacgggtctcaaagaacaacatccaattcatagaatctgaggtaaaatat ctaggacacctaatcagtaaaggtgaacgaaagataggatccgaacgaattgaagaacag ccattccatcttttcattaatgtaagcaaaggagtggccttaggggtactcacccaaaaa catggaggccaccggcagcctgtagtctttctgtcaaaaatccttgacccagtaacccgt ggatggcccgaatgtgttcaatccatagcagcaactgccttgctaacagaagaaagcagg aaaataacctttgggggaaacctcttcgtcattgaaggaaaaaggcacaacgggtactcc atagttgatagggaaaccctcacagtggtagagtcagaaagactgccaaataagtggtct gcccaaatatgtgaactctttgcattaaaccaagccttaaaatccctgcagaatcaggaa ggaactatttacacttactccaactttccaaaccctgagagattgagctctgctgaccca acagggtgtcagctgtttgggttgctggacattctcttgaagatacattgggaaaaaggc ttatggggccataaactggcaataaaaatatgggagatttggtcattggtgcgtgccgca ctggagccatttcagacaaatgatgaggctgagtcagaggaggagaaggaggagtttgat aatcaggactctgaaacgcctctaccgagtactagccaaaaggagagtccggaagtaatt tatgccaatccccccagtcttcctaaacctattcagaaactcattcagctcagggttcct caagaggaatgtccggaatggccacctcctcctcagccgattcattatggagagggggta attcaggttcgccctgcagttcattacagtgaaggagcaatttcccttaaaatggatgac ccagcgccctgtctgggtcgaacagtggcctaa >gi568815592f:27039309_27239617|GENSCAN_predicted_peptide_9|141_aa MAFISECGCTPSTSSASTGCCPVLGLTGSKQVWEVPLESPQGTVARILIGQVIMSIRTKL QNKEHVIEALRRVKFKFPGRQKVHISKKGGFTKFDADEFEDMVAEKQLIPDGCGIKYIPD RGPLDKGGLCIHEGFHCAASS >gi568815592f:27039309_27239617|GENSCAN_predicted_CDS_9|426_bp atggctttcatatccgagtgcggctgcaccccttccacgtcatctgcatcaacaggatgt tgtcctgtgctgggactgacaggctccaaacaggtatgggaggtgcctttggaaagcccg cagggcactgtggccaggattctcattggccaagttatcatgtccattcgcaccaagctg cagaacaaggagcatgtgattgaggcccttcgcagggtcaagttcaagttccctggccgc cagaaggtccacatttcaaagaaggggggcttcaccaagttcgatgcagatgaatttgaa gacatggtggctgagaagcagctcatcccagatggctgtgggatcaagtacatccccgat cgtggccctctggacaagggagggctttgcattcatgagggcttccactgtgctgcctcc tcttaa >gi568815592f:27039309_27239617|GENSCAN_predicted_peptide_10|105_aa IFSRVMEASVIVFVLKIEIRETSKWPENLEHSLVDTIVTPEMRRELGEKRYAQSPLLENG DLRAVIKGQLSPPSSHSTTNEEDYDLRVDWKLISGNLPNPLTDPY >gi568815592f:27039309_27239617|GENSCAN_predicted_CDS_10|318_bp atattcagcagggtcatggaggctagcgttatcgtctttgtcctcaaaatagaaatccga gaaacctccaagtggcccgaaaatttagagcactcgctcgtggataccattgtcactcca gaaatgcggcgggaactgggagagaaaaggtatgcacaaagtcctttgttggagaacgga gacctccgagctgttattaaaggacagctttctccaccatcttcccattccaccaccaat gaggaggactatgatctccgggttgactggaagttgatttctggaaatctgcccaaccct cttactgatccttactga