GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:55:01 Sequence gi568815592r:27032373_27232750 : 200378 bp : 40.80% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6779 7035 257 0 2 56 86 215 0.794 14.34 1.02 Intr + 13090 13333 244 0 1 78 44 227 0.626 13.45 1.03 Intr + 13535 13657 123 0 0 53 31 103 0.594 0.74 1.04 Intr + 17780 17938 159 0 0 8 33 212 0.084 6.74 1.05 Term + 36788 37272 485 0 2 56 28 318 0.956 16.52 1.06 PlyA + 37722 37727 6 1.05 2.08 PlyA - 38335 38330 6 1.05 2.07 Term - 43655 43458 198 2 0 47 54 148 0.809 3.72 2.06 Intr - 59704 59516 189 1 0 99 22 185 0.355 11.86 2.05 Intr - 60182 60081 102 2 0 127 54 27 0.528 2.65 2.04 Intr - 65330 65255 76 1 1 104 75 17 0.368 0.60 2.03 Intr - 81432 81327 106 1 1 84 38 57 0.006 -1.35 2.02 Intr - 82754 82492 263 1 2 65 80 101 0.020 3.21 2.01 Init - 100378 100002 377 1 2 36 4 566 0.046 39.45 2.00 Prom - 100458 100419 40 -6.05 3.00 Prom + 100580 100619 40 -15.49 3.01 Init + 100700 101088 389 1 2 48 32 591 0.814 45.82 3.02 Term + 101354 101636 283 2 1 1 49 319 0.733 13.21 3.03 PlyA + 101879 101884 6 1.05 4.00 Prom + 102642 102681 40 -7.55 4.01 Sngl + 106937 107248 312 1 0 28 36 603 0.942 44.78 4.02 PlyA + 108243 108248 6 1.05 5.02 PlyA - 108440 108435 6 1.05 5.01 Sngl - 114426 114046 381 0 0 88 37 634 0.926 54.22 5.00 Prom - 114522 114483 40 -4.65 6.00 Prom + 114701 114740 40 -2.65 6.01 Sngl + 114757 115143 387 0 0 46 49 566 0.999 44.46 6.02 PlyA + 116531 116536 6 -0.45 7.05 PlyA - 116760 116755 6 1.05 7.04 Term - 118761 118684 78 0 0 64 55 87 0.331 -0.22 7.03 Intr - 125452 125308 145 0 1 56 81 106 0.278 6.06 7.02 Intr - 125897 125581 317 2 2 40 87 209 0.575 9.94 7.01 Init - 126749 126714 36 2 0 68 58 46 0.455 -2.32 7.00 Prom - 128198 128159 40 -3.95 8.00 Prom + 130926 130965 40 -6.65 8.01 Init + 144729 144780 52 2 1 26 48 105 0.050 1.67 8.02 Intr + 145175 145336 162 0 0 70 80 116 0.013 8.03 8.03 Intr + 147764 148075 312 0 0 77 36 162 0.016 5.13 8.04 Intr + 148664 149041 378 0 0 85 -17 170 0.363 0.21 8.05 Intr + 149558 149728 171 0 0 79 63 55 0.430 1.09 8.06 Intr + 149962 150173 212 2 2 101 88 60 0.470 5.01 8.07 Intr + 150464 150639 176 1 2 40 29 136 0.054 0.72 8.08 Intr + 155065 155198 134 1 2 96 89 58 0.460 6.07 8.09 Intr + 156087 156329 243 1 0 -18 72 216 0.770 5.95 8.10 Term + 156407 156519 113 0 2 54 49 141 0.985 4.54 8.11 PlyA + 157573 157578 6 1.05 9.00 Prom + 169933 169972 40 -2.55 9.01 Sngl + 179115 179540 426 2 0 93 37 390 0.971 30.44 9.02 PlyA + 183827 183832 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 100378 99998 381 1 0 36 41 570 0.934 43.02 S.002 Term - 173530 173416 115 0 1 114 40 117 0.913 6.56 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:27032373_27232750|GENSCAN_predicted_peptide_1|422_aa XGAAPGTVHQTLLYNGSGSLVDQKEADVSGWELGNDQPPSPWALAPSSSTSDKGPQTLLF HCLPLTSSALDSPLPKPKAAPIHSCQALAAFLWGRARDLQPAMPDPPPHSVGSKPPRRAP PPTPRAPSPIDHPSAEECGCTAQDWQAAPPAAPVRDPLGKASWAPESANLVGTWRTFVSS SGIVNAPINTLSKRTNQLSVKWTNQQDVDSSRQTHTRLDVKTNTLAEEHTSTWMWRGCRE HADRYRQACRPSTDRPNDLEFEGAAPGTVHQTLLYNGSGSLVDQKEADVSGWELGNDQPP SPWALESSSSTSDRRPQNLLFHCLPLTSSALDSPLPKPKAAPIHSCQVHSPGESVQHASG WLFWQSSTNQGLFIFPQVPVLMGNSHWCHNVNVIISDFAKGKGVLLYNVNPWVQLCPLPE AR >gi568815592r:27032373_27232750|GENSCAN_predicted_CDS_1|1269_bp naaggggcagctccaggcacagtgcatcaaacactgctgtacaatggctctggctccctt gtggaccagaaagaagcagatgtttctggttgggagttgggaaatgatcagccaccttct ccctgggccctggcaccttcctcttccacaagtgacaaaggacctcagactcttctcttt cactgtctacccctgaccagctctgccctggacagtccccttccaaagccaaaggcagcc ccaatccacagttgtcaggccttagctgccttcctgtggggcagggctcgggacctgcag cccgccatgcctgaccctcccccgcactccgtgggctccaagcctccccgacgagcgccg ccccctactccacgggcacccagtcccatcgaccacccaagcgctgaggagtgcgggtgc acggcgcaggactggcaggcagctccacctgcggccccagtgcgggatccactgggtaaa gccagctgggctcctgagtctgctaatctagtggggacgtggagaacttttgtgtctagc tcagggattgtaaacgcaccaatcaacaccctgtcaaaacggaccaatcagctctctgta aaatggaccaatcagcaggatgtggactctagcaggcagacacacacgcggctggacgtc aagacaaacacattggcagaggaacacacaagcacctggatgtggagagggtgtcgagag cacgctgacaggtaccggcaagcctgcaggccatcgaccgaccggccgaacgacttggag tttgaaggggcagctccaggcacagtgcatcaaacactgctgtacaatggctctggctcc cttgtggaccagaaagaagcagatgtttctggatgggagttgggaaatgatcagccacct tctccctgggccctggaatcttcctcttccacaagtgacagaagacctcagaatcttctc tttcactgcctacccctgaccagctctgccctggacagtccccttccaaagccaaaggca gccccaatccacagttgtcaggtacactcacctggggagtctgtgcagcatgcctctggg tggctgttctggcagagcagcaccaaccagggcctgtttatatttccccaagtcccagtg ctcatgggcaactcccactggtgtcacaatgtcaatgtcataatttctgactttgccaaa ggaaaaggggtccttctgtacaatgtcaatccctgggtacaactctgccctctccctgag gctcgataa >gi568815592r:27032373_27232750|GENSCAN_predicted_peptide_2|436_aa MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRKRSRKESYSIYVYKVLKQVHPDTGISSKAM GIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVT KYTSANSGSCSDSCSGLLLLLNTAPALPVCSTGGASTLLYTAILGCSACGLVVLSAASTL LYTVAVGFSAYGLALLCGAPALLYYAATLGCSMGVSGNFPTALEEKGGGNLKGKVTGLSG LGRISGFSSPTVTFLSSLLPSPLSQARRRFCSHQLDEPGLAIFPRVNSQPWCYILPWEFH TFSKRVIMHSVKHAHARAHELPTVFLSGFEKRQTEVGERSCSRINIPVSALRRSPDSVLA CLDNSTVDVVKNTLDRVICKEKKFILEAGKSKVMGLASGKGFCSASRHGEKATRQENVDE RGKKKQADWFYNNSLS >gi568815592r:27032373_27232750|GENSCAN_predicted_CDS_2|1311_bp atgccagagccagcgaagtctgctcccgccccgaaaaagggctccaagaaggcggtgact aaggcgcagaagaaagacggcaagaagcgcaagcgcagccgcaaggagagctattccatc tatgtgtacaaggttctgaagcaggtccaccctgacaccggcatttcgtccaaggccatg ggcatcatgaattcgtttgtgaacgacattttcgagcgcatcgcaggtgaggcttcccgc ctggcgcattacaacaagcgctcgaccatcacctccagggagatccagacggccgtgcgc ctgctgctgcctggggagttggccaagcacgccgtgtccgagggtactaaggccgtcacc aagtacaccagcgctaattctggctcctgctctgactcctgctctggactgctcctgttg ctgaacactgctcctgcccttcctgtctgttctactggtggagcatccactttgctgtac actgctattctgggctgttctgcctgtgggttagttgtgctttctgcagcatccaccttg ctgtacacagttgctgtaggcttctctgcctatgggttagccttgctttgtggagcaccc gctttgctgtactatgctgctactctgggatgctctatgggagtcagtggtaatttccct actgctcttgaagaaaagggaggtggaaatttgaaaggaaaagtaactggactctctgga cttggaagaatttctgggtttagtagccccaccgtcaccttcctctcctctctattgccc tctcccctttctcaggctcgaaggcgtttttgcagtcatcagttagacgaacctgggcta gcaatattcccaagagtcaatagtcagccctggtgctatatacttccatgggaattccat actttctcaaaacgcgtcatcatgcactcggttaaacacgcacacgcacgcgcgcacgag ctgcccacggtcttcctttcggggtttgagaagagacagacagaggtgggagagcgcagc tgcagccgcatcaacattccggtgtctgcccttagaagaagtccggattcagtactcgcc tgcttagacaattctactgttgatgtggtgaaaaataccttagaccgggtaatttgtaaa gaaaagaagtttattctggaggctgggaagtccaaggtcatgggactggcttctggcaag ggtttttgttctgcatcacgacatggtgaaaaagcaacacgacaagagaatgtggatgaa agagggaaaaagaagcaggctgactggttttataacaactcactctcatga >gi568815592r:27032373_27232750|GENSCAN_predicted_peptide_3|223_aa MSGRGKQGGKARAKAKTRSSRAGLQFPVGRVHRLLRKGNYAERVGAGAPVYLAAVLEYLT AEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGKVTIAQGGVLPNIQAVLLPKK TESHHKAKGKPLLNFKLNQCVSTEHQWLREFLKHFGPKELSPGLEFSRQLFHAGQGGMKK SRRYVPGTVALRDVRRYQNSELLISKLPLLRELGGDAAARERG >gi568815592r:27032373_27232750|GENSCAN_predicted_CDS_3|672_bp atgtctggacgtggcaagcagggaggcaaagcccgcgctaaggccaagactcgctcttct agggccggtctccagttccccgtgggccgagtgcaccgcctgctccgcaaaggcaactat gccgagcgggtcggggccggcgcgccggtgtatctggcagcggtgctggagtacctgacc gccgagatcctggaactggcgggcaacgcggcccgcgacaacaagaagacccgcatcatc ccgcgtcatctccaactggccatccgcaacgacgaggagctcaacaagctgctgggcaaa gtcaccatcgcacagggcggtgtcctgcccaacattcaggccgtgctactgcccaaaaag actgagagccaccacaaggcgaagggcaagccattacttaattttaaacttaatcagtgt gtgtccacagagcaccagtggcttcgtgagttcttgaagcactttggcccaaaagagtta tcgcccgggctagaatttagcagacagctgttccacgcgggccagggcggcatgaagaag tcccgccgctacgtgcccggcacagtggccctgcgcgacgttcggcgctaccagaactcc gagctgctgatcagcaagctgccgctcctgcgagagctcggcggtgacgccgctgcacga gagcgaggctga >gi568815592r:27032373_27232750|GENSCAN_predicted_peptide_4|103_aa MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLK VFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFGG >gi568815592r:27032373_27232750|GENSCAN_predicted_CDS_4|312_bp atgtcaggacgcggcaaaggaggtaagggcctggggaaagggggtgccaagcgccaccgc aaggtgctgcgcgacaacatccagggtatcaccaagccagccattcggcgccttgctcgc cgcggcggcgtgaagcgcatttctggcctcatctatgaggagacccgcggagtgttgaag gtgttcctggagaacgtgatccgggacgccgtgacctacacggagcacgccaagcgcaag acggtcaccgccatggacgtggtctacgcgctcaagcgccagggccgcaccctctatggc ttcggcggctaa >gi568815592r:27032373_27232750|GENSCAN_predicted_peptide_5|126_aa MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRKRSRKESYSVYVYKVLKQVHPDTGISSKAM GIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVT KYTSAK >gi568815592r:27032373_27232750|GENSCAN_predicted_CDS_5|381_bp atgccggaaccagcgaagtccgctcccgcgcccaagaagggctcgaagaaagccgtgact aaggcgcagaagaaggacggcaagaagcgcaagcgcagccgcaaggagagctactccgta tacgtgtacaaggtgctgaagcaggtccaccccgacaccggcatctcctctaaggccatg ggaatcatgaactccttcgtcaacgacatcttcgaacgcatcgcgggtgaggcttcccgc ctggcgcattacaacaagcgctcgaccatcacctccagggagatccagacggccgtgcgc ctgctgctgcccggggagttggccaagcacgccgtgtccgagggcaccaaggccgtcacc aagtacaccagcgctaagtaa >gi568815592r:27032373_27232750|GENSCAN_predicted_peptide_6|128_aa MSGRGKQGGKARAKAKTRSSRAGLQFPVGRVHRLLRKGNYAERVGAGAPVYLAAVLEYLT AEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGKVTIAQGGVLPNIQAVLLPKK TESHHKAK >gi568815592r:27032373_27232750|GENSCAN_predicted_CDS_6|387_bp atgtctggacgtggcaagcaaggcggtaaagctcgcgccaaggccaagacccgctcttct cgggctgggcttcagttccccgtgggccgagtgcaccgcctgctccgcaagggtaattat gccgagcgggttggagccggcgcgccagtgtacctggctgcggtgctggagtacctgacc gctgagatcctggagctggctggcaatgcggcccgcgacaacaagaagacccgtatcatc ccgcgtcacctccaactggccatccgcaacgacgaggagctcaacaagctgctgggcaaa gtcaccatcgcgcagggtggtgtcttgcccaatatccaggccgtgctgctgcctaagaag actgagagccaccataaggccaaataa >gi568815592r:27032373_27232750|GENSCAN_predicted_peptide_7|191_aa MLAMLVSNSRPQYVEKLRRRQGTAQYVEKLRRHQGTAVTRIRTEVAAATTQSTNHYTITA RHQRPLLVGLFLCLFPRNVCFCHLKNGAVFRPYLHTYPVYFLIHSPSPIFCLWQRSQSLA VWSSDLWQQKEEGAALKWTCQIPFFAKLVRIFGGLAWNLASRQVPENYSLQGMHDNDSIC NRYEDEGEGRN >gi568815592r:27032373_27232750|GENSCAN_predicted_CDS_7|576_bp atgttggccatgctggtctcgaactcccgacctcagtatgttgaaaaattgaggcgacgt caaggtactgctcagtacgttgagaaattgaggcgacatcaaggtactgccgtgactcgg attcgaaccgaggttgctgcggccacaacgcagagtactaaccactatacgatcacggca cgccaccagaggccgctgcttgtggggctcttcctttgcttatttcccagaaacgtctgt ttttgtcatctaaagaatggtgcagtctttcgtccttatttacacacctacccagtttac ttccttatccactccccgtccccaatattttgcttatggcaaagaagccaaagtttagca gtgtggtcctcagatctttggcaacaaaaggaagagggcgctgctttgaagtggacctgc cagatcccattttttgccaaactagtcagaatcttcgggggcttggcatggaatctggca tctcgacaggtgccagaaaattattccttacaaggaatgcatgacaacgactccatctgt aataggtatgaagatgagggggaagggcggaactga >gi568815592r:27032373_27232750|GENSCAN_predicted_peptide_8|650_aa MDRRGYGVRCKTEVYKRGEGLRTNQKAAPSLRSRPVPHGTNRRAAAGSQDRRVRRTRRVR IPLCSLRPFQPGILFLGEERTMICKAAMIAWERKHPPRQNVLAAEHKFPAQDPQWDNNSA AHRENMRDRDMIIKGIQESVPPTQNISQAFNVQQEKDEGPMEFLNRLKEQIRKYADLSCS AEDLTVSGVKGEGFRAKILEETEVKYKNKSVATKFFLIPEAGTNLLGRDLMLRLGTGLYV NQGKLLTSLNVLTTSEESRIHPNVWSKEGNRGWVPPIHVKLKTPGEIVKQKQYPIPLEAR IGDNKDQVTAISVNFLNFLRGQGLRVSKNNIQFIESEVKYLGHLISKGERKIGSERIEEQ PFHLFINVSKGVALGVLTQKHGGHRQPVVFLSKILDPVTRGWPECVQSIAATALLTEESR KITFGGNLFVIEGKRHNGYSIVDRETLTVVESERLPNKWSAQICELFALNQALKSLQNQE GTIYTYSNFPNPERLSSADPTGCQLFGLLDILLKIHWEKGLWGHKLAIKIWEIWSLVRAA LEPFQTNDEAESEEEKEEFDNQDSETPLPSTSQKESPEVIYANPPSLPKPIQKLIQLRVP QEECPEWPPPPQPIHYGEGVIQVRPAVHYSEGAISLKMDDPAPCLGRTVA >gi568815592r:27032373_27232750|GENSCAN_predicted_CDS_8|1953_bp atggataggcgaggctacggagttcgatgtaaaaccgaagtttacaagcgaggtgaagga ctcagaacaaaccaaaaagcagctccaagcttgcgatcgcggccagttcctcacggaacc aaccgcagagccgccgcaggttcgcaggatcggagggttcggaggactcggagggtgcgg attcctctctgctccttgagaccattccagccaggtattctcttcttaggagaggaaaga accatgatctgcaaggctgctatgatagcctgggagcgcaagcacccccctcgtcagaat gtccttgcagcagagcataaattcccggcccaggaccctcaatgggataacaacagtgcg gctcaccgggaaaacatgagagatagggatatgataattaaagggattcaggaatcagtt cctccaacccaaaacatttcccaagcatttaatgtacagcaagagaaagatgaagggccc atggaattcttaaacagacttaaggaacagataagaaaatacgcagatttatcctgttca gccgaagaccttactgtctcaggagttaaaggggaaggattcagagcaaaaattctagag gaaaccgaagtcaaatataagaataagtcggttgctactaagttcttcttaattcctgaa gcaggaactaatttattaggaagggacttaatgttaaggttaggcacaggcctatatgtt aatcaaggaaaactccttacttccttaaacgtactcaccacttcagaagaaagccgcatc catcccaacgtatggtcgaaagaagggaatcgaggatgggttcctccaatccatgtcaaa ttaaaaactcctggggaaatagtgaaacaaaaacaataccctattcccttggaagccagg ataggagataacaaggatcaagtaacagcaatttcagttaacttcctaaatttcctaagg ggacaaggattacgggtctcaaagaacaacatccaattcatagaatctgaggtaaaatat ctaggacacctaatcagtaaaggtgaacgaaagataggatccgaacgaattgaagaacag ccattccatcttttcattaatgtaagcaaaggagtggccttaggggtactcacccaaaaa catggaggccaccggcagcctgtagtctttctgtcaaaaatccttgacccagtaacccgt ggatggcccgaatgtgttcaatccatagcagcaactgccttgctaacagaagaaagcagg aaaataacctttgggggaaacctcttcgtcattgaaggaaaaaggcacaacgggtactcc atagttgatagggaaaccctcacagtggtagagtcagaaagactgccaaataagtggtct gcccaaatatgtgaactctttgcattaaaccaagccttaaaatccctgcagaatcaggaa ggaactatttacacttactccaactttccaaaccctgagagattgagctctgctgaccca acagggtgtcagctgtttgggttgctggacattctcttgaagatacattgggaaaaaggc ttatggggccataaactggcaataaaaatatgggagatttggtcattggtgcgtgccgca ctggagccatttcagacaaatgatgaggctgagtcagaggaggagaaggaggagtttgat aatcaggactctgaaacgcctctaccgagtactagccaaaaggagagtccggaagtaatt tatgccaatccccccagtcttcctaaacctattcagaaactcattcagctcagggttcct caagaggaatgtccggaatggccacctcctcctcagccgattcattatggagagggggta attcaggttcgccctgcagttcattacagtgaaggagcaatttcccttaaaatggatgac ccagcgccctgtctgggtcgaacagtggcctaa >gi568815592r:27032373_27232750|GENSCAN_predicted_peptide_9|141_aa MAFISECGCTPSTSSASTGCCPVLGLTGSKQVWEVPLESPQGTVARILIGQVIMSIRTKL QNKEHVIEALRRVKFKFPGRQKVHISKKGGFTKFDADEFEDMVAEKQLIPDGCGIKYIPD RGPLDKGGLCIHEGFHCAASS >gi568815592r:27032373_27232750|GENSCAN_predicted_CDS_9|426_bp atggctttcatatccgagtgcggctgcaccccttccacgtcatctgcatcaacaggatgt tgtcctgtgctgggactgacaggctccaaacaggtatgggaggtgcctttggaaagcccg cagggcactgtggccaggattctcattggccaagttatcatgtccattcgcaccaagctg cagaacaaggagcatgtgattgaggcccttcgcagggtcaagttcaagttccctggccgc cagaaggtccacatttcaaagaaggggggcttcaccaagttcgatgcagatgaatttgaa gacatggtggctgagaagcagctcatcccagatggctgtgggatcaagtacatccccgat cgtggccctctggacaagggagggctttgcattcatgagggcttccactgtgctgcctcc tcttaa