GENSCAN 1.0 Date run: 3-Nov-116 Time: 20:32:50 Sequence gi568815592f:27047129_27247512 : 200384 bp : 40.95% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3024 3182 159 1 0 8 33 212 0.073 6.74 1.02 Term + 22032 22516 485 1 2 56 28 318 0.956 16.52 1.03 PlyA + 22966 22971 6 1.05 2.08 PlyA - 23579 23574 6 1.05 2.07 Term - 28899 28702 198 0 0 47 54 148 0.809 3.72 2.06 Intr - 44948 44760 189 2 0 99 22 185 0.355 11.86 2.05 Intr - 45426 45325 102 0 0 127 54 27 0.528 2.65 2.04 Intr - 50574 50499 76 2 1 104 75 17 0.368 0.60 2.03 Intr - 66676 66571 106 2 1 84 38 57 0.006 -1.35 2.02 Intr - 67998 67736 263 2 2 65 80 101 0.020 3.21 2.01 Init - 85622 85246 377 2 2 36 4 566 0.046 39.45 2.00 Prom - 85702 85663 40 -6.05 3.00 Prom + 85824 85863 40 -15.49 3.01 Init + 85944 86332 389 2 2 48 32 591 0.814 45.82 3.02 Term + 86598 86880 283 0 1 1 49 319 0.733 13.21 3.03 PlyA + 87123 87128 6 1.05 4.00 Prom + 87886 87925 40 -7.55 4.01 Sngl + 92181 92492 312 2 0 28 36 603 0.942 44.78 4.02 PlyA + 93487 93492 6 1.05 5.02 PlyA - 93684 93679 6 1.05 5.01 Sngl - 99670 99290 381 1 0 88 37 634 0.926 54.22 5.00 Prom - 99766 99727 40 -4.65 6.00 Prom + 99945 99984 40 -2.65 6.01 Sngl + 100001 100387 387 1 0 46 49 566 0.999 44.46 6.02 PlyA + 101775 101780 6 -0.45 7.05 PlyA - 102004 101999 6 1.05 7.04 Term - 104005 103928 78 1 0 64 55 87 0.331 -0.22 7.03 Intr - 110696 110552 145 1 1 56 81 106 0.278 6.06 7.02 Intr - 111141 110825 317 0 2 40 87 209 0.575 9.94 7.01 Init - 111993 111958 36 0 0 68 58 46 0.455 -2.32 7.00 Prom - 113442 113403 40 -3.95 8.00 Prom + 116170 116209 40 -6.65 8.01 Init + 129973 130024 52 0 1 26 48 105 0.050 1.67 8.02 Intr + 130419 130580 162 1 0 70 80 116 0.013 8.03 8.03 Intr + 133008 133319 312 1 0 77 36 162 0.016 5.13 8.04 Intr + 133908 134285 378 1 0 85 -17 170 0.363 0.21 8.05 Intr + 134802 134972 171 1 0 79 63 55 0.430 1.09 8.06 Intr + 135206 135417 212 0 2 101 88 60 0.470 5.01 8.07 Intr + 135708 135883 176 2 2 40 29 136 0.054 0.72 8.08 Intr + 140309 140442 134 2 2 96 89 58 0.460 6.07 8.09 Intr + 141331 141573 243 2 0 -18 72 216 0.770 5.95 8.10 Term + 141651 141763 113 1 2 54 49 141 0.985 4.54 8.11 PlyA + 142817 142822 6 1.05 9.00 Prom + 155177 155216 40 -2.55 9.01 Sngl + 164359 164784 426 0 0 93 37 390 0.971 30.44 9.02 PlyA + 169071 169076 6 1.05 10.05 PlyA - 170332 170327 6 1.05 10.04 Term - 177954 177883 72 0 0 84 38 93 0.143 0.83 10.03 Intr - 191529 191410 120 0 0 -31 84 126 0.237 0.07 10.02 Intr - 192031 191906 126 1 0 71 6 194 0.450 9.46 10.01 Init - 193559 193479 81 2 0 97 75 41 0.899 4.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 85622 85242 381 2 0 36 41 570 0.934 43.02 S.002 Term - 158774 158660 115 1 1 114 40 117 0.913 6.56 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:27047129_27247512|GENSCAN_predicted_peptide_1|214_aa XSSRQTHTRLDVKTNTLAEEHTSTWMWRGCREHADRYRQACRPSTDRPNDLEFEGAAPGT VHQTLLYNGSGSLVDQKEADVSGWELGNDQPPSPWALESSSSTSDRRPQNLLFHCLPLTS SALDSPLPKPKAAPIHSCQVHSPGESVQHASGWLFWQSSTNQGLFIFPQVPVLMGNSHWC HNVNVIISDFAKGKGVLLYNVNPWVQLCPLPEAR >gi568815592f:27047129_27247512|GENSCAN_predicted_CDS_1|645_bp nactctagcaggcagacacacacgcggctggacgtcaagacaaacacattggcagaggaa cacacaagcacctggatgtggagagggtgtcgagagcacgctgacaggtaccggcaagcc tgcaggccatcgaccgaccggccgaacgacttggagtttgaaggggcagctccaggcaca gtgcatcaaacactgctgtacaatggctctggctcccttgtggaccagaaagaagcagat gtttctggatgggagttgggaaatgatcagccaccttctccctgggccctggaatcttcc tcttccacaagtgacagaagacctcagaatcttctctttcactgcctacccctgaccagc tctgccctggacagtccccttccaaagccaaaggcagccccaatccacagttgtcaggta cactcacctggggagtctgtgcagcatgcctctgggtggctgttctggcagagcagcacc aaccagggcctgtttatatttccccaagtcccagtgctcatgggcaactcccactggtgt cacaatgtcaatgtcataatttctgactttgccaaaggaaaaggggtccttctgtacaat gtcaatccctgggtacaactctgccctctccctgaggctcgataa >gi568815592f:27047129_27247512|GENSCAN_predicted_peptide_2|436_aa MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRKRSRKESYSIYVYKVLKQVHPDTGISSKAM GIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVT KYTSANSGSCSDSCSGLLLLLNTAPALPVCSTGGASTLLYTAILGCSACGLVVLSAASTL LYTVAVGFSAYGLALLCGAPALLYYAATLGCSMGVSGNFPTALEEKGGGNLKGKVTGLSG LGRISGFSSPTVTFLSSLLPSPLSQARRRFCSHQLDEPGLAIFPRVNSQPWCYILPWEFH TFSKRVIMHSVKHAHARAHELPTVFLSGFEKRQTEVGERSCSRINIPVSALRRSPDSVLA CLDNSTVDVVKNTLDRVICKEKKFILEAGKSKVMGLASGKGFCSASRHGEKATRQENVDE RGKKKQADWFYNNSLS >gi568815592f:27047129_27247512|GENSCAN_predicted_CDS_2|1311_bp atgccagagccagcgaagtctgctcccgccccgaaaaagggctccaagaaggcggtgact aaggcgcagaagaaagacggcaagaagcgcaagcgcagccgcaaggagagctattccatc tatgtgtacaaggttctgaagcaggtccaccctgacaccggcatttcgtccaaggccatg ggcatcatgaattcgtttgtgaacgacattttcgagcgcatcgcaggtgaggcttcccgc ctggcgcattacaacaagcgctcgaccatcacctccagggagatccagacggccgtgcgc ctgctgctgcctggggagttggccaagcacgccgtgtccgagggtactaaggccgtcacc aagtacaccagcgctaattctggctcctgctctgactcctgctctggactgctcctgttg ctgaacactgctcctgcccttcctgtctgttctactggtggagcatccactttgctgtac actgctattctgggctgttctgcctgtgggttagttgtgctttctgcagcatccaccttg ctgtacacagttgctgtaggcttctctgcctatgggttagccttgctttgtggagcaccc gctttgctgtactatgctgctactctgggatgctctatgggagtcagtggtaatttccct actgctcttgaagaaaagggaggtggaaatttgaaaggaaaagtaactggactctctgga cttggaagaatttctgggtttagtagccccaccgtcaccttcctctcctctctattgccc tctcccctttctcaggctcgaaggcgtttttgcagtcatcagttagacgaacctgggcta gcaatattcccaagagtcaatagtcagccctggtgctatatacttccatgggaattccat actttctcaaaacgcgtcatcatgcactcggttaaacacgcacacgcacgcgcgcacgag ctgcccacggtcttcctttcggggtttgagaagagacagacagaggtgggagagcgcagc tgcagccgcatcaacattccggtgtctgcccttagaagaagtccggattcagtactcgcc tgcttagacaattctactgttgatgtggtgaaaaataccttagaccgggtaatttgtaaa gaaaagaagtttattctggaggctgggaagtccaaggtcatgggactggcttctggcaag ggtttttgttctgcatcacgacatggtgaaaaagcaacacgacaagagaatgtggatgaa agagggaaaaagaagcaggctgactggttttataacaactcactctcatga >gi568815592f:27047129_27247512|GENSCAN_predicted_peptide_3|223_aa MSGRGKQGGKARAKAKTRSSRAGLQFPVGRVHRLLRKGNYAERVGAGAPVYLAAVLEYLT AEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGKVTIAQGGVLPNIQAVLLPKK TESHHKAKGKPLLNFKLNQCVSTEHQWLREFLKHFGPKELSPGLEFSRQLFHAGQGGMKK SRRYVPGTVALRDVRRYQNSELLISKLPLLRELGGDAAARERG >gi568815592f:27047129_27247512|GENSCAN_predicted_CDS_3|672_bp atgtctggacgtggcaagcagggaggcaaagcccgcgctaaggccaagactcgctcttct agggccggtctccagttccccgtgggccgagtgcaccgcctgctccgcaaaggcaactat gccgagcgggtcggggccggcgcgccggtgtatctggcagcggtgctggagtacctgacc gccgagatcctggaactggcgggcaacgcggcccgcgacaacaagaagacccgcatcatc ccgcgtcatctccaactggccatccgcaacgacgaggagctcaacaagctgctgggcaaa gtcaccatcgcacagggcggtgtcctgcccaacattcaggccgtgctactgcccaaaaag actgagagccaccacaaggcgaagggcaagccattacttaattttaaacttaatcagtgt gtgtccacagagcaccagtggcttcgtgagttcttgaagcactttggcccaaaagagtta tcgcccgggctagaatttagcagacagctgttccacgcgggccagggcggcatgaagaag tcccgccgctacgtgcccggcacagtggccctgcgcgacgttcggcgctaccagaactcc gagctgctgatcagcaagctgccgctcctgcgagagctcggcggtgacgccgctgcacga gagcgaggctga >gi568815592f:27047129_27247512|GENSCAN_predicted_peptide_4|103_aa MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLK VFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFGG >gi568815592f:27047129_27247512|GENSCAN_predicted_CDS_4|312_bp atgtcaggacgcggcaaaggaggtaagggcctggggaaagggggtgccaagcgccaccgc aaggtgctgcgcgacaacatccagggtatcaccaagccagccattcggcgccttgctcgc cgcggcggcgtgaagcgcatttctggcctcatctatgaggagacccgcggagtgttgaag gtgttcctggagaacgtgatccgggacgccgtgacctacacggagcacgccaagcgcaag acggtcaccgccatggacgtggtctacgcgctcaagcgccagggccgcaccctctatggc ttcggcggctaa >gi568815592f:27047129_27247512|GENSCAN_predicted_peptide_5|126_aa MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRKRSRKESYSVYVYKVLKQVHPDTGISSKAM GIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVT KYTSAK >gi568815592f:27047129_27247512|GENSCAN_predicted_CDS_5|381_bp atgccggaaccagcgaagtccgctcccgcgcccaagaagggctcgaagaaagccgtgact aaggcgcagaagaaggacggcaagaagcgcaagcgcagccgcaaggagagctactccgta tacgtgtacaaggtgctgaagcaggtccaccccgacaccggcatctcctctaaggccatg ggaatcatgaactccttcgtcaacgacatcttcgaacgcatcgcgggtgaggcttcccgc ctggcgcattacaacaagcgctcgaccatcacctccagggagatccagacggccgtgcgc ctgctgctgcccggggagttggccaagcacgccgtgtccgagggcaccaaggccgtcacc aagtacaccagcgctaagtaa >gi568815592f:27047129_27247512|GENSCAN_predicted_peptide_6|128_aa MSGRGKQGGKARAKAKTRSSRAGLQFPVGRVHRLLRKGNYAERVGAGAPVYLAAVLEYLT AEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGKVTIAQGGVLPNIQAVLLPKK TESHHKAK >gi568815592f:27047129_27247512|GENSCAN_predicted_CDS_6|387_bp atgtctggacgtggcaagcaaggcggtaaagctcgcgccaaggccaagacccgctcttct cgggctgggcttcagttccccgtgggccgagtgcaccgcctgctccgcaagggtaattat gccgagcgggttggagccggcgcgccagtgtacctggctgcggtgctggagtacctgacc gctgagatcctggagctggctggcaatgcggcccgcgacaacaagaagacccgtatcatc ccgcgtcacctccaactggccatccgcaacgacgaggagctcaacaagctgctgggcaaa gtcaccatcgcgcagggtggtgtcttgcccaatatccaggccgtgctgctgcctaagaag actgagagccaccataaggccaaataa >gi568815592f:27047129_27247512|GENSCAN_predicted_peptide_7|191_aa MLAMLVSNSRPQYVEKLRRRQGTAQYVEKLRRHQGTAVTRIRTEVAAATTQSTNHYTITA RHQRPLLVGLFLCLFPRNVCFCHLKNGAVFRPYLHTYPVYFLIHSPSPIFCLWQRSQSLA VWSSDLWQQKEEGAALKWTCQIPFFAKLVRIFGGLAWNLASRQVPENYSLQGMHDNDSIC NRYEDEGEGRN >gi568815592f:27047129_27247512|GENSCAN_predicted_CDS_7|576_bp atgttggccatgctggtctcgaactcccgacctcagtatgttgaaaaattgaggcgacgt caaggtactgctcagtacgttgagaaattgaggcgacatcaaggtactgccgtgactcgg attcgaaccgaggttgctgcggccacaacgcagagtactaaccactatacgatcacggca cgccaccagaggccgctgcttgtggggctcttcctttgcttatttcccagaaacgtctgt ttttgtcatctaaagaatggtgcagtctttcgtccttatttacacacctacccagtttac ttccttatccactccccgtccccaatattttgcttatggcaaagaagccaaagtttagca gtgtggtcctcagatctttggcaacaaaaggaagagggcgctgctttgaagtggacctgc cagatcccattttttgccaaactagtcagaatcttcgggggcttggcatggaatctggca tctcgacaggtgccagaaaattattccttacaaggaatgcatgacaacgactccatctgt aataggtatgaagatgagggggaagggcggaactga >gi568815592f:27047129_27247512|GENSCAN_predicted_peptide_8|650_aa MDRRGYGVRCKTEVYKRGEGLRTNQKAAPSLRSRPVPHGTNRRAAAGSQDRRVRRTRRVR IPLCSLRPFQPGILFLGEERTMICKAAMIAWERKHPPRQNVLAAEHKFPAQDPQWDNNSA AHRENMRDRDMIIKGIQESVPPTQNISQAFNVQQEKDEGPMEFLNRLKEQIRKYADLSCS AEDLTVSGVKGEGFRAKILEETEVKYKNKSVATKFFLIPEAGTNLLGRDLMLRLGTGLYV NQGKLLTSLNVLTTSEESRIHPNVWSKEGNRGWVPPIHVKLKTPGEIVKQKQYPIPLEAR IGDNKDQVTAISVNFLNFLRGQGLRVSKNNIQFIESEVKYLGHLISKGERKIGSERIEEQ PFHLFINVSKGVALGVLTQKHGGHRQPVVFLSKILDPVTRGWPECVQSIAATALLTEESR KITFGGNLFVIEGKRHNGYSIVDRETLTVVESERLPNKWSAQICELFALNQALKSLQNQE GTIYTYSNFPNPERLSSADPTGCQLFGLLDILLKIHWEKGLWGHKLAIKIWEIWSLVRAA LEPFQTNDEAESEEEKEEFDNQDSETPLPSTSQKESPEVIYANPPSLPKPIQKLIQLRVP QEECPEWPPPPQPIHYGEGVIQVRPAVHYSEGAISLKMDDPAPCLGRTVA >gi568815592f:27047129_27247512|GENSCAN_predicted_CDS_8|1953_bp atggataggcgaggctacggagttcgatgtaaaaccgaagtttacaagcgaggtgaagga ctcagaacaaaccaaaaagcagctccaagcttgcgatcgcggccagttcctcacggaacc aaccgcagagccgccgcaggttcgcaggatcggagggttcggaggactcggagggtgcgg attcctctctgctccttgagaccattccagccaggtattctcttcttaggagaggaaaga accatgatctgcaaggctgctatgatagcctgggagcgcaagcacccccctcgtcagaat gtccttgcagcagagcataaattcccggcccaggaccctcaatgggataacaacagtgcg gctcaccgggaaaacatgagagatagggatatgataattaaagggattcaggaatcagtt cctccaacccaaaacatttcccaagcatttaatgtacagcaagagaaagatgaagggccc atggaattcttaaacagacttaaggaacagataagaaaatacgcagatttatcctgttca gccgaagaccttactgtctcaggagttaaaggggaaggattcagagcaaaaattctagag gaaaccgaagtcaaatataagaataagtcggttgctactaagttcttcttaattcctgaa gcaggaactaatttattaggaagggacttaatgttaaggttaggcacaggcctatatgtt aatcaaggaaaactccttacttccttaaacgtactcaccacttcagaagaaagccgcatc catcccaacgtatggtcgaaagaagggaatcgaggatgggttcctccaatccatgtcaaa ttaaaaactcctggggaaatagtgaaacaaaaacaataccctattcccttggaagccagg ataggagataacaaggatcaagtaacagcaatttcagttaacttcctaaatttcctaagg ggacaaggattacgggtctcaaagaacaacatccaattcatagaatctgaggtaaaatat ctaggacacctaatcagtaaaggtgaacgaaagataggatccgaacgaattgaagaacag ccattccatcttttcattaatgtaagcaaaggagtggccttaggggtactcacccaaaaa catggaggccaccggcagcctgtagtctttctgtcaaaaatccttgacccagtaacccgt ggatggcccgaatgtgttcaatccatagcagcaactgccttgctaacagaagaaagcagg aaaataacctttgggggaaacctcttcgtcattgaaggaaaaaggcacaacgggtactcc atagttgatagggaaaccctcacagtggtagagtcagaaagactgccaaataagtggtct gcccaaatatgtgaactctttgcattaaaccaagccttaaaatccctgcagaatcaggaa ggaactatttacacttactccaactttccaaaccctgagagattgagctctgctgaccca acagggtgtcagctgtttgggttgctggacattctcttgaagatacattgggaaaaaggc ttatggggccataaactggcaataaaaatatgggagatttggtcattggtgcgtgccgca ctggagccatttcagacaaatgatgaggctgagtcagaggaggagaaggaggagtttgat aatcaggactctgaaacgcctctaccgagtactagccaaaaggagagtccggaagtaatt tatgccaatccccccagtcttcctaaacctattcagaaactcattcagctcagggttcct caagaggaatgtccggaatggccacctcctcctcagccgattcattatggagagggggta attcaggttcgccctgcagttcattacagtgaaggagcaatttcccttaaaatggatgac ccagcgccctgtctgggtcgaacagtggcctaa >gi568815592f:27047129_27247512|GENSCAN_predicted_peptide_9|141_aa MAFISECGCTPSTSSASTGCCPVLGLTGSKQVWEVPLESPQGTVARILIGQVIMSIRTKL QNKEHVIEALRRVKFKFPGRQKVHISKKGGFTKFDADEFEDMVAEKQLIPDGCGIKYIPD RGPLDKGGLCIHEGFHCAASS >gi568815592f:27047129_27247512|GENSCAN_predicted_CDS_9|426_bp atggctttcatatccgagtgcggctgcaccccttccacgtcatctgcatcaacaggatgt tgtcctgtgctgggactgacaggctccaaacaggtatgggaggtgcctttggaaagcccg cagggcactgtggccaggattctcattggccaagttatcatgtccattcgcaccaagctg cagaacaaggagcatgtgattgaggcccttcgcagggtcaagttcaagttccctggccgc cagaaggtccacatttcaaagaaggggggcttcaccaagttcgatgcagatgaatttgaa gacatggtggctgagaagcagctcatcccagatggctgtgggatcaagtacatccccgat cgtggccctctggacaagggagggctttgcattcatgagggcttccactgtgctgcctcc tcttaa >gi568815592f:27047129_27247512|GENSCAN_predicted_peptide_10|132_aa MAPLRYHQPPYRFFSKIDLSENTPDIKIFSRVMEASVIVFVLKIEIRETSKWPENLEHSL VDTIVTPEMRRELGEKRYAQSPLLENGDLRAVIKGQLSPPSSHSTTNEEDYDLRVDWKLI SGNLPNPLTDPY >gi568815592f:27047129_27247512|GENSCAN_predicted_CDS_10|399_bp atggcaccactgcgctatcatcaaccaccatatagatttttttctaaaattgatctgtct gagaacactcctgatatcaagatattcagcagggtcatggaggctagcgttatcgtcttt gtcctcaaaatagaaatccgagaaacctccaagtggcccgaaaatttagagcactcgctc gtggataccattgtcactccagaaatgcggcgggaactgggagagaaaaggtatgcacaa agtcctttgttggagaacggagacctccgagctgttattaaaggacagctttctccacca tcttcccattccaccaccaatgaggaggactatgatctccgggttgactggaagttgatt tctggaaatctgcccaaccctcttactgatccttactga