GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:10:26 Sequence gi568815592f:27033072_27233461 : 200390 bp : 40.83% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6080 6336 257 0 2 56 86 215 0.797 14.34 1.02 Intr + 12391 12634 244 0 1 78 44 227 0.626 13.45 1.03 Intr + 12836 12958 123 0 0 53 31 103 0.594 0.74 1.04 Intr + 17081 17239 159 0 0 8 33 212 0.084 6.74 1.05 Term + 36089 36573 485 0 2 56 28 318 0.956 16.52 1.06 PlyA + 37023 37028 6 1.05 2.08 PlyA - 37636 37631 6 1.05 2.07 Term - 42956 42759 198 2 0 47 54 148 0.809 3.72 2.06 Intr - 59005 58817 189 1 0 99 22 185 0.355 11.86 2.05 Intr - 59483 59382 102 2 0 127 54 27 0.528 2.65 2.04 Intr - 64631 64556 76 1 1 104 75 17 0.368 0.60 2.03 Intr - 80733 80628 106 1 1 84 38 57 0.006 -1.35 2.02 Intr - 82055 81793 263 1 2 65 80 101 0.020 3.21 2.01 Init - 99679 99303 377 1 2 36 4 566 0.046 39.45 2.00 Prom - 99759 99720 40 -6.05 3.00 Prom + 99881 99920 40 -15.49 3.01 Init + 100001 100389 389 1 2 48 32 591 0.814 45.82 3.02 Term + 100655 100937 283 2 1 1 49 319 0.733 13.21 3.03 PlyA + 101180 101185 6 1.05 4.00 Prom + 101943 101982 40 -7.55 4.01 Sngl + 106238 106549 312 1 0 28 36 603 0.942 44.78 4.02 PlyA + 107544 107549 6 1.05 5.02 PlyA - 107741 107736 6 1.05 5.01 Sngl - 113727 113347 381 0 0 88 37 634 0.926 54.22 5.00 Prom - 113823 113784 40 -4.65 6.00 Prom + 114002 114041 40 -2.65 6.01 Sngl + 114058 114444 387 0 0 46 49 566 0.999 44.46 6.02 PlyA + 115832 115837 6 -0.45 7.05 PlyA - 116061 116056 6 1.05 7.04 Term - 118062 117985 78 0 0 64 55 87 0.331 -0.22 7.03 Intr - 124753 124609 145 0 1 56 81 106 0.278 6.06 7.02 Intr - 125198 124882 317 2 2 40 87 209 0.575 9.94 7.01 Init - 126050 126015 36 2 0 68 58 46 0.455 -2.32 7.00 Prom - 127499 127460 40 -3.95 8.00 Prom + 130227 130266 40 -6.65 8.01 Init + 144030 144081 52 2 1 26 48 105 0.050 1.67 8.02 Intr + 144476 144637 162 0 0 70 80 116 0.013 8.03 8.03 Intr + 147065 147376 312 0 0 77 36 162 0.016 5.13 8.04 Intr + 147965 148342 378 0 0 85 -17 170 0.363 0.21 8.05 Intr + 148859 149029 171 0 0 79 63 55 0.430 1.09 8.06 Intr + 149263 149474 212 2 2 101 88 60 0.470 5.01 8.07 Intr + 149765 149940 176 1 2 40 29 136 0.054 0.72 8.08 Intr + 154366 154499 134 1 2 96 89 58 0.460 6.07 8.09 Intr + 155388 155630 243 1 0 -18 72 216 0.770 5.95 8.10 Term + 155708 155820 113 0 2 54 49 141 0.985 4.54 8.11 PlyA + 156874 156879 6 1.05 9.00 Prom + 169234 169273 40 -2.55 9.01 Sngl + 178416 178841 426 2 0 93 37 390 0.971 30.44 9.02 PlyA + 183128 183133 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 99679 99299 381 1 0 36 41 570 0.934 43.02 S.002 Term - 172831 172717 115 0 1 114 40 117 0.913 6.56 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:27033072_27233461|GENSCAN_predicted_peptide_1|422_aa XGAAPGTVHQTLLYNGSGSLVDQKEADVSGWELGNDQPPSPWALAPSSSTSDKGPQTLLF HCLPLTSSALDSPLPKPKAAPIHSCQALAAFLWGRARDLQPAMPDPPPHSVGSKPPRRAP PPTPRAPSPIDHPSAEECGCTAQDWQAAPPAAPVRDPLGKASWAPESANLVGTWRTFVSS SGIVNAPINTLSKRTNQLSVKWTNQQDVDSSRQTHTRLDVKTNTLAEEHTSTWMWRGCRE HADRYRQACRPSTDRPNDLEFEGAAPGTVHQTLLYNGSGSLVDQKEADVSGWELGNDQPP SPWALESSSSTSDRRPQNLLFHCLPLTSSALDSPLPKPKAAPIHSCQVHSPGESVQHASG WLFWQSSTNQGLFIFPQVPVLMGNSHWCHNVNVIISDFAKGKGVLLYNVNPWVQLCPLPE AR >gi568815592f:27033072_27233461|GENSCAN_predicted_CDS_1|1269_bp naaggggcagctccaggcacagtgcatcaaacactgctgtacaatggctctggctccctt gtggaccagaaagaagcagatgtttctggttgggagttgggaaatgatcagccaccttct ccctgggccctggcaccttcctcttccacaagtgacaaaggacctcagactcttctcttt cactgtctacccctgaccagctctgccctggacagtccccttccaaagccaaaggcagcc ccaatccacagttgtcaggccttagctgccttcctgtggggcagggctcgggacctgcag cccgccatgcctgaccctcccccgcactccgtgggctccaagcctccccgacgagcgccg ccccctactccacgggcacccagtcccatcgaccacccaagcgctgaggagtgcgggtgc acggcgcaggactggcaggcagctccacctgcggccccagtgcgggatccactgggtaaa gccagctgggctcctgagtctgctaatctagtggggacgtggagaacttttgtgtctagc tcagggattgtaaacgcaccaatcaacaccctgtcaaaacggaccaatcagctctctgta aaatggaccaatcagcaggatgtggactctagcaggcagacacacacgcggctggacgtc aagacaaacacattggcagaggaacacacaagcacctggatgtggagagggtgtcgagag cacgctgacaggtaccggcaagcctgcaggccatcgaccgaccggccgaacgacttggag tttgaaggggcagctccaggcacagtgcatcaaacactgctgtacaatggctctggctcc cttgtggaccagaaagaagcagatgtttctggatgggagttgggaaatgatcagccacct tctccctgggccctggaatcttcctcttccacaagtgacagaagacctcagaatcttctc tttcactgcctacccctgaccagctctgccctggacagtccccttccaaagccaaaggca gccccaatccacagttgtcaggtacactcacctggggagtctgtgcagcatgcctctggg tggctgttctggcagagcagcaccaaccagggcctgtttatatttccccaagtcccagtg ctcatgggcaactcccactggtgtcacaatgtcaatgtcataatttctgactttgccaaa ggaaaaggggtccttctgtacaatgtcaatccctgggtacaactctgccctctccctgag gctcgataa >gi568815592f:27033072_27233461|GENSCAN_predicted_peptide_2|436_aa MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRKRSRKESYSIYVYKVLKQVHPDTGISSKAM GIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVT KYTSANSGSCSDSCSGLLLLLNTAPALPVCSTGGASTLLYTAILGCSACGLVVLSAASTL LYTVAVGFSAYGLALLCGAPALLYYAATLGCSMGVSGNFPTALEEKGGGNLKGKVTGLSG LGRISGFSSPTVTFLSSLLPSPLSQARRRFCSHQLDEPGLAIFPRVNSQPWCYILPWEFH TFSKRVIMHSVKHAHARAHELPTVFLSGFEKRQTEVGERSCSRINIPVSALRRSPDSVLA CLDNSTVDVVKNTLDRVICKEKKFILEAGKSKVMGLASGKGFCSASRHGEKATRQENVDE RGKKKQADWFYNNSLS >gi568815592f:27033072_27233461|GENSCAN_predicted_CDS_2|1311_bp atgccagagccagcgaagtctgctcccgccccgaaaaagggctccaagaaggcggtgact aaggcgcagaagaaagacggcaagaagcgcaagcgcagccgcaaggagagctattccatc tatgtgtacaaggttctgaagcaggtccaccctgacaccggcatttcgtccaaggccatg ggcatcatgaattcgtttgtgaacgacattttcgagcgcatcgcaggtgaggcttcccgc ctggcgcattacaacaagcgctcgaccatcacctccagggagatccagacggccgtgcgc ctgctgctgcctggggagttggccaagcacgccgtgtccgagggtactaaggccgtcacc aagtacaccagcgctaattctggctcctgctctgactcctgctctggactgctcctgttg ctgaacactgctcctgcccttcctgtctgttctactggtggagcatccactttgctgtac actgctattctgggctgttctgcctgtgggttagttgtgctttctgcagcatccaccttg ctgtacacagttgctgtaggcttctctgcctatgggttagccttgctttgtggagcaccc gctttgctgtactatgctgctactctgggatgctctatgggagtcagtggtaatttccct actgctcttgaagaaaagggaggtggaaatttgaaaggaaaagtaactggactctctgga cttggaagaatttctgggtttagtagccccaccgtcaccttcctctcctctctattgccc tctcccctttctcaggctcgaaggcgtttttgcagtcatcagttagacgaacctgggcta gcaatattcccaagagtcaatagtcagccctggtgctatatacttccatgggaattccat actttctcaaaacgcgtcatcatgcactcggttaaacacgcacacgcacgcgcgcacgag ctgcccacggtcttcctttcggggtttgagaagagacagacagaggtgggagagcgcagc tgcagccgcatcaacattccggtgtctgcccttagaagaagtccggattcagtactcgcc tgcttagacaattctactgttgatgtggtgaaaaataccttagaccgggtaatttgtaaa gaaaagaagtttattctggaggctgggaagtccaaggtcatgggactggcttctggcaag ggtttttgttctgcatcacgacatggtgaaaaagcaacacgacaagagaatgtggatgaa agagggaaaaagaagcaggctgactggttttataacaactcactctcatga >gi568815592f:27033072_27233461|GENSCAN_predicted_peptide_3|223_aa MSGRGKQGGKARAKAKTRSSRAGLQFPVGRVHRLLRKGNYAERVGAGAPVYLAAVLEYLT AEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGKVTIAQGGVLPNIQAVLLPKK TESHHKAKGKPLLNFKLNQCVSTEHQWLREFLKHFGPKELSPGLEFSRQLFHAGQGGMKK SRRYVPGTVALRDVRRYQNSELLISKLPLLRELGGDAAARERG >gi568815592f:27033072_27233461|GENSCAN_predicted_CDS_3|672_bp atgtctggacgtggcaagcagggaggcaaagcccgcgctaaggccaagactcgctcttct agggccggtctccagttccccgtgggccgagtgcaccgcctgctccgcaaaggcaactat gccgagcgggtcggggccggcgcgccggtgtatctggcagcggtgctggagtacctgacc gccgagatcctggaactggcgggcaacgcggcccgcgacaacaagaagacccgcatcatc ccgcgtcatctccaactggccatccgcaacgacgaggagctcaacaagctgctgggcaaa gtcaccatcgcacagggcggtgtcctgcccaacattcaggccgtgctactgcccaaaaag actgagagccaccacaaggcgaagggcaagccattacttaattttaaacttaatcagtgt gtgtccacagagcaccagtggcttcgtgagttcttgaagcactttggcccaaaagagtta tcgcccgggctagaatttagcagacagctgttccacgcgggccagggcggcatgaagaag tcccgccgctacgtgcccggcacagtggccctgcgcgacgttcggcgctaccagaactcc gagctgctgatcagcaagctgccgctcctgcgagagctcggcggtgacgccgctgcacga gagcgaggctga >gi568815592f:27033072_27233461|GENSCAN_predicted_peptide_4|103_aa MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLK VFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFGG >gi568815592f:27033072_27233461|GENSCAN_predicted_CDS_4|312_bp atgtcaggacgcggcaaaggaggtaagggcctggggaaagggggtgccaagcgccaccgc aaggtgctgcgcgacaacatccagggtatcaccaagccagccattcggcgccttgctcgc cgcggcggcgtgaagcgcatttctggcctcatctatgaggagacccgcggagtgttgaag gtgttcctggagaacgtgatccgggacgccgtgacctacacggagcacgccaagcgcaag acggtcaccgccatggacgtggtctacgcgctcaagcgccagggccgcaccctctatggc ttcggcggctaa >gi568815592f:27033072_27233461|GENSCAN_predicted_peptide_5|126_aa MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRKRSRKESYSVYVYKVLKQVHPDTGISSKAM GIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVT KYTSAK >gi568815592f:27033072_27233461|GENSCAN_predicted_CDS_5|381_bp atgccggaaccagcgaagtccgctcccgcgcccaagaagggctcgaagaaagccgtgact aaggcgcagaagaaggacggcaagaagcgcaagcgcagccgcaaggagagctactccgta tacgtgtacaaggtgctgaagcaggtccaccccgacaccggcatctcctctaaggccatg ggaatcatgaactccttcgtcaacgacatcttcgaacgcatcgcgggtgaggcttcccgc ctggcgcattacaacaagcgctcgaccatcacctccagggagatccagacggccgtgcgc ctgctgctgcccggggagttggccaagcacgccgtgtccgagggcaccaaggccgtcacc aagtacaccagcgctaagtaa >gi568815592f:27033072_27233461|GENSCAN_predicted_peptide_6|128_aa MSGRGKQGGKARAKAKTRSSRAGLQFPVGRVHRLLRKGNYAERVGAGAPVYLAAVLEYLT AEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGKVTIAQGGVLPNIQAVLLPKK TESHHKAK >gi568815592f:27033072_27233461|GENSCAN_predicted_CDS_6|387_bp atgtctggacgtggcaagcaaggcggtaaagctcgcgccaaggccaagacccgctcttct cgggctgggcttcagttccccgtgggccgagtgcaccgcctgctccgcaagggtaattat gccgagcgggttggagccggcgcgccagtgtacctggctgcggtgctggagtacctgacc gctgagatcctggagctggctggcaatgcggcccgcgacaacaagaagacccgtatcatc ccgcgtcacctccaactggccatccgcaacgacgaggagctcaacaagctgctgggcaaa gtcaccatcgcgcagggtggtgtcttgcccaatatccaggccgtgctgctgcctaagaag actgagagccaccataaggccaaataa >gi568815592f:27033072_27233461|GENSCAN_predicted_peptide_7|191_aa MLAMLVSNSRPQYVEKLRRRQGTAQYVEKLRRHQGTAVTRIRTEVAAATTQSTNHYTITA RHQRPLLVGLFLCLFPRNVCFCHLKNGAVFRPYLHTYPVYFLIHSPSPIFCLWQRSQSLA VWSSDLWQQKEEGAALKWTCQIPFFAKLVRIFGGLAWNLASRQVPENYSLQGMHDNDSIC NRYEDEGEGRN >gi568815592f:27033072_27233461|GENSCAN_predicted_CDS_7|576_bp atgttggccatgctggtctcgaactcccgacctcagtatgttgaaaaattgaggcgacgt caaggtactgctcagtacgttgagaaattgaggcgacatcaaggtactgccgtgactcgg attcgaaccgaggttgctgcggccacaacgcagagtactaaccactatacgatcacggca cgccaccagaggccgctgcttgtggggctcttcctttgcttatttcccagaaacgtctgt ttttgtcatctaaagaatggtgcagtctttcgtccttatttacacacctacccagtttac ttccttatccactccccgtccccaatattttgcttatggcaaagaagccaaagtttagca gtgtggtcctcagatctttggcaacaaaaggaagagggcgctgctttgaagtggacctgc cagatcccattttttgccaaactagtcagaatcttcgggggcttggcatggaatctggca tctcgacaggtgccagaaaattattccttacaaggaatgcatgacaacgactccatctgt aataggtatgaagatgagggggaagggcggaactga >gi568815592f:27033072_27233461|GENSCAN_predicted_peptide_8|650_aa MDRRGYGVRCKTEVYKRGEGLRTNQKAAPSLRSRPVPHGTNRRAAAGSQDRRVRRTRRVR IPLCSLRPFQPGILFLGEERTMICKAAMIAWERKHPPRQNVLAAEHKFPAQDPQWDNNSA AHRENMRDRDMIIKGIQESVPPTQNISQAFNVQQEKDEGPMEFLNRLKEQIRKYADLSCS AEDLTVSGVKGEGFRAKILEETEVKYKNKSVATKFFLIPEAGTNLLGRDLMLRLGTGLYV NQGKLLTSLNVLTTSEESRIHPNVWSKEGNRGWVPPIHVKLKTPGEIVKQKQYPIPLEAR IGDNKDQVTAISVNFLNFLRGQGLRVSKNNIQFIESEVKYLGHLISKGERKIGSERIEEQ PFHLFINVSKGVALGVLTQKHGGHRQPVVFLSKILDPVTRGWPECVQSIAATALLTEESR KITFGGNLFVIEGKRHNGYSIVDRETLTVVESERLPNKWSAQICELFALNQALKSLQNQE GTIYTYSNFPNPERLSSADPTGCQLFGLLDILLKIHWEKGLWGHKLAIKIWEIWSLVRAA LEPFQTNDEAESEEEKEEFDNQDSETPLPSTSQKESPEVIYANPPSLPKPIQKLIQLRVP QEECPEWPPPPQPIHYGEGVIQVRPAVHYSEGAISLKMDDPAPCLGRTVA >gi568815592f:27033072_27233461|GENSCAN_predicted_CDS_8|1953_bp atggataggcgaggctacggagttcgatgtaaaaccgaagtttacaagcgaggtgaagga ctcagaacaaaccaaaaagcagctccaagcttgcgatcgcggccagttcctcacggaacc aaccgcagagccgccgcaggttcgcaggatcggagggttcggaggactcggagggtgcgg attcctctctgctccttgagaccattccagccaggtattctcttcttaggagaggaaaga accatgatctgcaaggctgctatgatagcctgggagcgcaagcacccccctcgtcagaat gtccttgcagcagagcataaattcccggcccaggaccctcaatgggataacaacagtgcg gctcaccgggaaaacatgagagatagggatatgataattaaagggattcaggaatcagtt cctccaacccaaaacatttcccaagcatttaatgtacagcaagagaaagatgaagggccc atggaattcttaaacagacttaaggaacagataagaaaatacgcagatttatcctgttca gccgaagaccttactgtctcaggagttaaaggggaaggattcagagcaaaaattctagag gaaaccgaagtcaaatataagaataagtcggttgctactaagttcttcttaattcctgaa gcaggaactaatttattaggaagggacttaatgttaaggttaggcacaggcctatatgtt aatcaaggaaaactccttacttccttaaacgtactcaccacttcagaagaaagccgcatc catcccaacgtatggtcgaaagaagggaatcgaggatgggttcctccaatccatgtcaaa ttaaaaactcctggggaaatagtgaaacaaaaacaataccctattcccttggaagccagg ataggagataacaaggatcaagtaacagcaatttcagttaacttcctaaatttcctaagg ggacaaggattacgggtctcaaagaacaacatccaattcatagaatctgaggtaaaatat ctaggacacctaatcagtaaaggtgaacgaaagataggatccgaacgaattgaagaacag ccattccatcttttcattaatgtaagcaaaggagtggccttaggggtactcacccaaaaa catggaggccaccggcagcctgtagtctttctgtcaaaaatccttgacccagtaacccgt ggatggcccgaatgtgttcaatccatagcagcaactgccttgctaacagaagaaagcagg aaaataacctttgggggaaacctcttcgtcattgaaggaaaaaggcacaacgggtactcc atagttgatagggaaaccctcacagtggtagagtcagaaagactgccaaataagtggtct gcccaaatatgtgaactctttgcattaaaccaagccttaaaatccctgcagaatcaggaa ggaactatttacacttactccaactttccaaaccctgagagattgagctctgctgaccca acagggtgtcagctgtttgggttgctggacattctcttgaagatacattgggaaaaaggc ttatggggccataaactggcaataaaaatatgggagatttggtcattggtgcgtgccgca ctggagccatttcagacaaatgatgaggctgagtcagaggaggagaaggaggagtttgat aatcaggactctgaaacgcctctaccgagtactagccaaaaggagagtccggaagtaatt tatgccaatccccccagtcttcctaaacctattcagaaactcattcagctcagggttcct caagaggaatgtccggaatggccacctcctcctcagccgattcattatggagagggggta attcaggttcgccctgcagttcattacagtgaaggagcaatttcccttaaaatggatgac ccagcgccctgtctgggtcgaacagtggcctaa >gi568815592f:27033072_27233461|GENSCAN_predicted_peptide_9|141_aa MAFISECGCTPSTSSASTGCCPVLGLTGSKQVWEVPLESPQGTVARILIGQVIMSIRTKL QNKEHVIEALRRVKFKFPGRQKVHISKKGGFTKFDADEFEDMVAEKQLIPDGCGIKYIPD RGPLDKGGLCIHEGFHCAASS >gi568815592f:27033072_27233461|GENSCAN_predicted_CDS_9|426_bp atggctttcatatccgagtgcggctgcaccccttccacgtcatctgcatcaacaggatgt tgtcctgtgctgggactgacaggctccaaacaggtatgggaggtgcctttggaaagcccg cagggcactgtggccaggattctcattggccaagttatcatgtccattcgcaccaagctg cagaacaaggagcatgtgattgaggcccttcgcagggtcaagttcaagttccctggccgc cagaaggtccacatttcaaagaaggggggcttcaccaagttcgatgcagatgaatttgaa gacatggtggctgagaagcagctcatcccagatggctgtgggatcaagtacatccccgat cgtggccctctggacaagggagggctttgcattcatgagggcttccactgtgctgcctcc tcttaa