GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:34:45 Sequence gi568815581r:50762983_50964017 : 201035 bp : 49.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 638 415 224 0 2 44 116 100 0.420 6.25 1.05 Intr - 1266 1183 84 1 0 100 40 57 0.582 1.89 1.04 Intr - 5503 5406 98 0 2 127 42 34 0.013 2.35 1.03 Intr - 18539 18245 295 0 1 8 53 295 0.010 14.07 1.02 Intr - 30263 30127 137 1 2 62 94 75 0.337 5.71 1.01 Init - 34646 34351 296 2 2 84 39 191 0.298 10.46 1.00 Prom - 36389 36350 40 -11.04 2.00 Prom + 37773 37812 40 -5.16 2.01 Init + 37841 37944 104 1 2 78 -14 80 0.426 -3.49 2.02 Term + 38016 38268 253 0 1 45 42 296 0.552 15.81 2.03 PlyA + 40005 40010 6 -0.45 3.05 PlyA - 40206 40201 6 1.05 3.04 Term - 40543 40358 186 1 0 71 46 143 0.263 5.89 3.03 Intr - 55251 55137 115 2 1 86 115 31 0.779 6.05 3.02 Intr - 57578 57528 51 1 0 82 63 57 0.398 0.62 3.01 Init - 69399 69392 8 0 2 108 77 0 0.229 1.42 3.00 Prom - 70700 70661 40 -7.56 4.00 Prom + 72671 72710 40 -6.26 4.01 Init + 72956 73165 210 1 0 77 109 309 0.927 28.69 4.02 Intr + 73327 73422 96 0 0 -12 109 118 0.391 4.11 4.03 Intr + 73438 73578 141 0 0 -63 109 137 0.552 1.25 4.04 Intr + 73594 73695 102 0 0 -87 109 151 0.489 0.07 4.05 Intr + 73711 73812 102 0 0 -87 109 151 0.487 0.07 4.06 Term + 76517 78037 1521 1 0 111 48 2285 0.977 217.31 4.07 PlyA + 83966 83971 6 1.05 5.02 PlyA - 88125 88120 6 1.05 5.01 Sngl - 101035 99998 1038 1 0 74 42 623 0.980 53.43 5.00 Prom - 101654 101615 40 -3.66 6.00 Prom + 101789 101828 40 -9.65 6.01 Init + 102059 102061 3 1 0 53 91 0 0.778 -3.20 6.02 Term + 102298 103203 906 0 0 -39 55 419 0.499 18.20 6.03 PlyA + 104311 104316 6 1.05 7.00 Prom + 106003 106042 40 -3.66 7.01 Init + 116068 116129 62 0 2 74 115 18 0.093 3.82 7.02 Intr + 137249 137398 150 2 0 92 82 160 0.579 15.08 7.03 Intr + 155096 155214 119 2 2 93 110 41 0.065 6.91 7.04 Intr + 160778 160839 62 0 2 70 75 57 0.024 1.05 7.05 Intr + 174323 174463 141 1 0 114 -33 105 0.010 1.55 7.06 Intr + 179529 179704 176 2 2 98 82 11 0.005 0.24 7.07 Intr + 187466 187533 68 2 2 119 49 91 0.909 6.85 7.08 Intr + 187929 188003 75 1 0 153 32 57 0.896 6.09 7.09 Term + 194301 194584 284 1 2 27 38 208 0.396 5.29 7.10 PlyA + 195600 195605 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 18539 18241 299 0 2 8 54 308 0.957 14.83 S.002 Init - 21882 21840 43 0 1 76 44 60 0.823 0.98 S.003 Init + 25301 25373 73 1 1 104 75 79 0.960 9.43 S.004 Term + 26285 26304 20 0 2 134 49 0 0.886 -0.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:50762983_50964017|GENSCAN_predicted_peptide_1|378_aa METVLFTFHRFAGDKGYLMKEGLKVLMGKEFLGFLENQKDPLAADITMKDMDQCQDSTLN FQNLFSLTAGLTTVDNNYFVVPMKQKGTKQAELSNYSALIDTQHTSGSSGGVQIGQAQDN ILFREGKNRRCKAGSDDSDDAIWPERRRESPSGNHPETQKIEFATALRGRHTRRDRPGLK TRHGSQRGLTMGTSRSVVALLLRLGGRAPSGGFLVTHTQPGRQEDRRLPGTATLDTQKPG DFQKTHQHAGKTASAWQPGDPGPVPGLCLAGHVTSGEYKQEGTNPHLPTTDPSDNKKPSD TCLGILFPLQTVFEFMGDLELVFKVRIQNPKENDFIEIELKRQELSYQNLLNVSCCELGI KPERVEKIRKLPNTLLRK >gi568815581r:50762983_50964017|GENSCAN_predicted_CDS_1|1134_bp atggaaaccgtgttgtttacgtttcacagatttgccggggataaaggctacttaatgaag gagggcctgaaagtactcatgggaaaggagttccttggatttttggagaatcaaaaagac cctctggctgcagacataacaatgaaggacatggaccagtgccaagacagcacactgaac ttccagaacttgttttcactcactgcggggctcaccactgtggacaacaactattttgta gtacctatgaagcagaagggaacgaagcaggcagaactaagcaattactcagcccttata gacacacaacacacttccggatccagtggtggggtgcagatagggcaagcacaggacaac atcctcttcagagaagggaagaacagaagatgcaaagcaggctctgatgacagcgatgat gccatttggccagagagacggcgcgagagccccagtgggaaccacccggagacccagaag atcgaattcgccactgcgctgcgcggccgccacactcgccgggatcggccgggtttgaag acgcgccatggaagccagaggggtctgacgatggggacctctcggagcgtcgtcgccctg ctactgcgccttggcggccgcgctccatctggaggcttcctcgtgacgcatacgcagcct ggccgccaagaagaccgtcgcctgcctgggacagccaccctggacacccagaagccagga gatttccaaaagacacaccagcatgctgggaagacagcatcggcctggcagccaggtgac ccaggtccagtgccaggcctgtgcctggctggccacgtgactagtggtgaatataaacaa gaaggcacaaatccacacttgccaacaacggacccaagtgataacaagaaacccagtgac acctgtctaggtattttgtttcctttacaaactgtatttgaatttatgggtgatttagag cttgtgtttaaagtcagaattcagaaccccaaagaaaatgacttcattgaaattgaactg aagagacaagaactgagttaccaaaacctactaaacgtgagttgctgtgaactggggatt aaaccagaacgagtggagaagatcagaaagctaccaaacacactgctcagaaag >gi568815581r:50762983_50964017|GENSCAN_predicted_peptide_2|118_aa MGLSVASDAAKKSSDARTECIIGFGDWEVASVSAASYEGWGAAFLEEEEEEEEEEKEEEE EEEEGEELLLIKFPWALLAEQELQLAAALGPFTLDGSLSESLPPGSKVLHCSSKEPPL >gi568815581r:50762983_50964017|GENSCAN_predicted_CDS_2|357_bp atgggattgtcagtagcatcagatgcagccaagaagtcaagtgatgcaagaactgaatgt atcattggctttggtgattgggaggtggcctccgtgagtgcagccagttatgaaggctgg ggagcagccttcttagaggaggaggaggaagaggaggaagaggagaaggaggaagaggag gaggaggaggagggggaagagctgctcctgatcaagttcccttgggcattgctggctgag caggagcttcagcttgcagctgctttgggaccattcaccctggatggaagcctgagtgaa tcattacccccagggtctaaggtgctgcattgtagcagcaaagagcccccattataa >gi568815581r:50762983_50964017|GENSCAN_predicted_peptide_3|119_aa MGSVSISKAEEPMTVRIWAKIVTALCFLTSCPRGIFLLILLVSTEKSPLPGDSPNSSQVQ GAGLEGGGLELLKPTGAGSLTVACEVFRDTPVAAILSFLVSFQSGPVVWFGFDSFEYTS >gi568815581r:50762983_50964017|GENSCAN_predicted_CDS_3|360_bp atggggagcgtttccatatcaaaagcagaagaacccatgactgtccgcatttgggccaaa attgtcactgctctctgctttctgacttcctgtccccggggaattttcctgctcatcctg ctagtctccactgagaaatcacctcttcctggtgattccccaaattcgagccaggtgcag ggggcaggactggaaggaggtggcctggagctgctgaagcccacaggggctgggtctcta acggtggcctgtgaggtcttcagagacaccccagttgctgccattctcagcttcctggtg tccttccaatcggggcctgttgtttggtttggttttgactcgtttgagtatacttcctag >gi568815581r:50762983_50964017|GENSCAN_predicted_peptide_4|723_aa MWAPRCRRFWSRWEQVAALLLLLLLLGVPPRSLALPPIRYSHAGICPNDMNPNLWVDAQS TCRRECETDQECEMDQVSGIQKPQCEADQVNGVQKPQCEMDQKWECEVDQVSGVQKPVCE ADQVSGVQKPQCEMDQVSGIQKLECEADQKWEYEVDQVSGVQKPQCEMDQVSGIQKLECE ADQKWEYEVDQVSGVQKPQCEMDQVSGIQKLECEADQECETYEKCCPNVCGTKSCVAARY MDVKGKKGPVGMPKEATCDHFMCLQQGSECDIWDGQPVCKCKDRCEKEPSFTCASDGLTY YNRCYMDAEACSKGITLAVVTCRYHFTWPNTSPPPPETTMHPTTASPETPELDMAAPALL NNPVHQSVTMGETVSFLCDVVGRPRPEITWEKQLEDRENVVMRPNHVRGNVVVTNIAQLV IYNAQLQDAGIYTCTARNVAGVLRADFPLSVVRGHQAAATSESSPNGTAFPAAECLKPPD SEDCGEEQTRWHFDAQANNCLTFTFGHCHRNLNHFETYEACMLACMSGPLAACSLPALQG PCKAYAPRWAYNSQTGQCQSFVYGGCEGNGNNFESREACEESCPFPRGNQRCRACKPRQK LVTSFCRSDFVILGRVSELTEEPDSGRALVTVDEVLKDEKMGLKFLGQEPLEVTLLHVDW ACPCPNVTVSEMPLIIMGEVDGGMAMLRPDSFVGASSARRVRKLREVMHKKTCDVLKEFL GLH >gi568815581r:50762983_50964017|GENSCAN_predicted_CDS_4|2172_bp atgtgggccccaaggtgtcgccggttctggtctcgctgggagcaggtggcagcgctgctg ctgctgctgctactgctcggggtgcccccgcgaagcctggcgctgccgcccatccgctat tcccacgccggcatctgccccaacgacatgaatcccaacctctgggtggacgcacagagc acctgcaggcgggagtgtgagacggaccaggagtgtgagatggaccaggtgagtgggatc cagaagccacagtgtgaggcagaccaggtgaatggggtccagaagccgcaatgtgagatg gaccagaagtgggagtgtgaggttgaccaggtgagtggggtccagaagccggtgtgtgag gcggaccaggtgagtggggtccagaagccacagtgtgagatggaccaggtgagtgggatc cagaagctggagtgtgaggcggaccagaagtgggagtatgaggtggaccaggtgagtggg gtccagaagccacagtgtgagatggaccaggtgagtgggatccagaagctggagtgtgag gcggaccagaagtgggagtatgaggtggaccaggtgagtggggtccagaagccacagtgt gagatggaccaggtgagtgggatccagaagctggagtgtgaggcggaccaggagtgtgag acctatgagaagtgctgccccaacgtatgtgggaccaagagctgcgtggcggcccgctac atggacgtgaaagggaagaagggcccagtgggcatgcccaaggaggccacatgtgaccac ttcatgtgtctgcagcagggctctgagtgtgacatctgggatggccagcccgtgtgtaag tgcaaagaccgctgtgagaaggagcccagctttacctgcgcctcggacggcctcacctac tataaccgctgctacatggatgccgaggcctgctccaaaggcatcacactggccgttgta acctgccgctatcacttcacctggcccaacaccagccccccaccacctgagaccaccatg caccccaccacagcctccccagagacccctgagctggacatggcggcccctgcgctgctc aacaaccctgtgcaccagtcggtcaccatgggtgagacagtgagcttcctctgtgatgtg gtgggccggccccggcctgagatcacctgggagaagcagttggaggatcgggagaatgtg gtcatgcggcccaaccatgtgcgtggcaacgtggtggtcaccaacattgcccagctggtc atctataacgcccagctgcaggatgctgggatctacacctgcacggcccggaacgtggct ggggtcctgagggctgatttcccgctgtcggtggtcaggggtcatcaggctgcagccacc tcagagagcagccccaatggcacggctttcccggcggccgagtgcctgaagcccccagac agtgaggactgtggcgaagagcagacccgctggcacttcgatgcccaggccaacaactgc ctgaccttcaccttcggccactgccaccgtaacctcaaccactttgagacctatgaggcc tgcatgctggcctgcatgagcgggccgctggccgcgtgcagcctgcccgccctgcagggg ccctgcaaagcctacgcgcctcgctgggcttacaacagccagacgggccagtgccagtcc tttgtctatggtggctgcgagggcaatggcaacaactttgagagccgtgaggcctgtgag gagtcgtgccccttccccagggggaaccagcgctgtcgggcctgcaagcctcggcagaag ctcgttaccagcttctgtcgcagcgactttgtcatcctgggccgagtctctgagctgacc gaggagcctgactcgggccgcgccctggtgactgtggatgaggtcctaaaggatgagaaa atgggcctcaagttcctgggccaggagccattggaggtcactctgcttcacgtggactgg gcatgcccctgccccaacgtgaccgtgagcgagatgccgctcatcatcatgggggaggtg gacggcggcatggccatgctgcgccccgatagctttgtgggcgcatcgagtgcccgccgg gtcaggaagcttcgtgaggtcatgcacaagaagacctgtgacgtcctcaaggagtttctt ggcttgcactga >gi568815581r:50762983_50964017|GENSCAN_predicted_peptide_5|345_aa MQLEIQVALNFIISYLYNKLPRRRVNIFGEELERLLKKKYEGHWYPEKPYKGSGFRCIHI GEKVDPVIEQASKESGLDIDDVRGNLPQDLSVWIDPFEVSYQIGEKGPVKVLYVDDNNEN GCELDKEIKNSFNPEAQVFMPISDPASSVSSSPSPPFGHSAAVSPTFMPRSTQPLTFTTA TFAATKFGSTKMKNSGRSNKVARTSPINLGLNVNDLLKQKAISSSMHSLYGLGLGSQQQP QQQQQPAQPPPPPPPPQQQQQQKTSALSPNAKEFIFPNMQGQGSSTNGMFPGDSPLNLSP LQYSNAFDVFAAYGGLNEKSFVDGLNFSLNNMQYSNQQFQPVMAN >gi568815581r:50762983_50964017|GENSCAN_predicted_CDS_5|1038_bp atgcagcttgaaatccaagtagcactaaattttattatttcgtatttgtacaataagctt cccaggagacgtgtcaacatttttggtgaagaacttgaaagacttcttaagaagaaatat gaagggcactggtatcctgaaaagccatacaaaggatcggggtttagatgtatacacata ggggagaaagtggacccagtgattgaacaagcatccaaagagagtggtttggacattgat gatgttcgtggcaatctgccacaggatcttagtgtttggatcgacccatttgaggtttct taccaaattggtgaaaagggaccagtgaaggtgctttacgtggatgataataatgaaaat ggatgtgagttggataaggagatcaaaaacagctttaacccagaggcccaggtttttatg cccataagtgacccagcctcatcagtgtccagctctccatcgcctccttttggtcactct gctgctgtaagccctaccttcatgccccggtccactcagcctttaacctttaccactgcc acttttgctgccaccaagttcggctctaccaaaatgaagaatagtggccgtagcaacaag gttgcacgtacttctcccatcaacctcggcttgaatgtgaatgacctcttgaagcagaaa gccatctcttcctcaatgcactctctgtatgggcttggcttgggtagccagcagcagcca cagcaacagcagcagccagcccagccgccaccgccaccaccaccaccacagcagcaacaa cagcagaaaacctctgctctttctcctaatgccaaggaatttatttttcctaatatgcag ggtcaaggtagtagtaccaatggaatgttcccaggtgacagcccccttaacctcagtcct ctccagtacagtaatgcctttgatgtgtttgcagcctatggaggcctcaatgagaaatct tttgtagatggcttgaattttagcttaaataacatgcagtattctaaccagcaattccag cctgttatggctaactaa >gi568815581r:50762983_50964017|GENSCAN_predicted_peptide_6|302_aa MPECRQRQLIKPNCLKVQPSEIQARWKVVTKQRLKTGRSVKNYINSQLDLIRFLLTALGA QALQQQQQQEEVEVEEEESTSLLGEGGGRPGKWKSNLSRLGTQHRAGRGAGSAAPLPRRN PTRCSPPRGPRPGLLGAGQSLGEHRFLGTIFSLAATGPSNPSVPPRARRSSGHPNPPIPV SSPRAARPRPTAQDEPDAPAAPRSRSPRVLALPPRRPGPRPRRPRAPEGPGLDAAAAPPP PPGPGAPVSSPTPPGAPRYLRAASGRLGPQTRGWAGSSAAGRGRGRVGSGELAECKTLPL RG >gi568815581r:50762983_50964017|GENSCAN_predicted_CDS_6|909_bp atgcctgagtgccggcagaggcagctgattaagcccaattgcttgaaggtgcagccgagc gaaattcaggctcggtggaaggtagttacaaagcaacgccttaagacaggcaggagtgtg aagaattacataaacagccagttagatctcattagatttctgctcacagccctcggagcc caagcgctacagcagcagcaacagcaagaggaggtggaggtggaggaggaagagagtacg agcctcctcggagaaggaggaggaaggcccgggaaatggaagagcaacctctcccggctg ggcacccagcaccgagcaggtcggggcgccgggtcagccgctcccctcccccgccgtaac ccgacccgctgctccccgccccgtggcccccgccccgggctcctgggagccggccagagc ctgggggagcaccggttcctcggcaccattttcagcctcgcggccaccggcccctcgaac cccagcgttccaccgcgagctcgtcgctcctccgggcaccccaaccccccaattcccgtc agctcgccccgggccgcccgtccccggccaactgcccaggacgagcccgacgcgccggcg gcccctcggagccgctcccctcgggtcctggcgctgccccctcggcgcccagggccgcga ccccgccgtccccgagctcccgagggcccagggctcgacgccgctgcagcccctcctccg ccgcctgggcccggggctcccgtctcctcgccgaccccgcccggcgccccgagatactta cgggctgcttcggggcggcttggaccacagacccgcggctgggcggggagcagcgcggca gggcgggggcgcgggagggtcgggagcggcgagctcgccgagtgcaagacgttgccgctg cggggctga >gi568815581r:50762983_50964017|GENSCAN_predicted_peptide_7|378_aa MEKNWFKIYLQKRHKAKVVLRKKKEWTGSAESGGCVEAARASERSSQCRSTIWDTGGIQG PSQDIQGMWTRAPRWVALPPQDTGLGPPLQNRFEAKQWHVTVQLSRRPQQGTSGSEILAT VQGIEVLPENKDPIQDLTFYVVVIVLPPEAPLGCDHFSDFYFDENILMTLTVLRSPGLPS SYLIVMHCVTAIQTAHFEFPCAACLDGGLSLCPESRLHGKSSGSQFGCLWVRSSCGGSAL PQAPPGRLHQVCAPAVTPSGHPCITLLDTRSLSPASLTSSTMLLGFIPMPRDKLVRQTKQ FVPSGAGYMILSIKDLSTFVTIGIWLGMKAASPHNPSCNGRDGPPSKDVTLNIALQGIQA KSVLVPKDNYNWGLREMR >gi568815581r:50762983_50964017|GENSCAN_predicted_CDS_7|1137_bp atggaaaaaaactggtttaagatctacctccaaaaaagacacaaagccaaggtggtttta cgaaagaaaaaggaatggacagggtcagcagaatctggaggctgcgttgaagctgcccgt gcttctgaacgaagcagccagtgccgctcgaccatctgggacaccggtggaattcagggc ccctctcaggacattcagggcatgtggacaagggcaccccgatgggtggctctgccaccc caggacactggactggggccacccttacagaaccggttcgaagccaaacaatggcacgtc acagtccagctgagccggaggcctcagcagggcaccagtggatctgagattttggcaaca gtccagggcattgaggttttgccagaaaataaggatcccatccaggatctcacattctac gtcgttgtcattgtcttgcctcctgaggctcctcttggctgtgaccacttctcggacttt tattttgatgaaaatattttgatgaccctgacagttttgagaagtcctggtttgccctct tcttacttgattgtgatgcattgtgttactgccatccagactgcccatttcgagttcccg tgtgcagcctgcctggacggaggccttagcctctgccctgagtcccggttgcatgggaag tcctctgggtctcagtttggatgtttgtgggtgaggagcagctgcggtgggtcggccttg ccgcaggcaccccctggacggctgcaccaggtctgcgcgccagcggtgacccccagtggt cacccttgcattactctcctggacactcgtagcctctcacctgcgtcactgacatcctca acgatgctgctggggttcatcccgatgcctcgggacaaactggtgcgtcaaacaaagcag tttgtaccaagtggtgcaggctacatgattctctccattaaggatctgagcacgtttgtg actattgggatctggctgggcatgaaggctgctagtccacacaacccaagttgcaatgga agagacggcccaccctcaaaggatgtcaccttgaacatagccttacagggcatccaagcc aagtccgtgcttgtccccaaggacaactacaactggggactcagagagatgagatga