GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:46:08 Sequence gi568815585r:41092206_41294257 : 202052 bp : 40.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 1206 1201 6 1.05 1.04 Term - 15529 14840 690 1 0 93 43 174 0.214 6.00 1.03 Intr - 40446 38303 2144 1 2 12 65 2000 0.013 176.47 1.02 Intr - 41911 41826 86 0 2 42 105 79 0.009 3.54 1.01 Init - 78836 78565 272 2 2 49 91 282 0.419 21.09 1.00 Prom - 84864 84825 40 -5.95 2.02 PlyA - 84990 84985 6 1.05 2.01 Sngl - 102052 99998 2055 1 0 96 36 2151 0.999 203.67 2.00 Prom - 106902 106863 40 -5.35 3.00 Prom + 107036 107075 40 -5.75 3.01 Init + 121343 121419 77 1 2 1 101 111 0.495 4.31 3.02 Term + 121529 121937 409 2 1 46 45 216 0.738 6.80 3.03 PlyA + 122926 122931 6 1.05 4.12 PlyA - 123403 123398 6 1.05 4.11 Term - 125023 124910 114 1 0 84 36 125 0.963 4.49 4.10 Intr - 131149 131051 99 1 0 76 85 88 0.989 6.69 4.09 Intr - 134363 134227 137 0 2 107 110 70 0.995 10.47 4.08 Intr - 141802 141685 118 1 1 58 115 155 0.999 14.32 4.07 Intr - 148228 148056 173 2 2 110 76 218 0.744 21.54 4.06 Intr - 149617 149491 127 1 1 36 -9 131 0.194 -2.47 4.05 Intr - 155357 155092 266 0 2 33 38 302 0.228 16.01 4.04 Intr - 160547 160440 108 0 0 125 66 112 0.788 12.04 4.03 Intr - 160825 160744 82 1 1 57 78 68 0.945 0.99 4.02 Intr - 162415 162324 92 2 2 93 92 100 0.999 9.69 4.01 Init - 168501 168288 214 0 1 77 56 173 0.973 11.95 4.00 Prom - 178854 178815 40 -5.55 5.04 PlyA - 178982 178977 6 1.05 5.03 Term - 195888 195629 260 1 2 67 44 177 0.489 5.93 5.02 Intr - 196154 196073 82 2 1 69 37 122 0.632 3.49 5.01 Init - 197603 197424 180 2 0 77 16 221 0.691 13.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 40306 38282 2025 1 0 96 44 1936 0.951 183.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:41092206_41294257|GENSCAN_predicted_peptide_1|1063_aa MRSLGQNPTEAELQDMINEVDADGNGTVDFPEFLTMMARKMKDTDSEEEIRDAFCVFDKD GNGYISATELHHVMTNLGENLTDDEVDEMIRGNGKKSGLIVLTTVDSDERGRQWQGFTGG LNSVNGLVLLSLRRRCYLSVSEGRLRRSQSRVLQRFSPSAPVAISTMQSREDAPRSRRLA SPRGGKRPKKIHKPTVSAFFTGPEELKDTAHSAALLAQLKSFYDARLLCDVTIEVVTPGS GPGTGRLFPCNRNVLAAACPYFKSMFTGGMYESQQASVTMHDVDAESFEVLVDYCYTGRV SLSEANVERLYAASDMLQLEYVREACASFLARRLDLTNCTAILKFADAFGHRKLRSQAQS YIAQNFKQLSHMGSIREETLADLTLAQLLAVLRLDSLDVESEQTVCHVAVQWLEAAPKER GPSAAEVFKCVRWMHFTEEDQDYLEGLLTKPIVKKYCLDVIEGALQMRYGDLLYKSLVPV PNSSSSSSSSNSLVSAAENPPQRLGMCAKEMVIFFGHPRDPFLCCDPYSGDLYKVPSPLT CLAHTRTVTTLAVCISPDHDIYLAAQPRTDLWVYKPAQNSWQQLADRLLCREGMDVAYLN GYIYILGGRDPITGVKLKEVECYNVKRNQWALVAPLPHSFLSFDLMVIRDYLYALNSKRM FCYDPSHNMWLKCVSLKRNDFQEACVFNEEIYCICDIPVMKVYNPVRAEWRQMNNIPLVS ETNNYRIIKHGQKLLLITSRTPQWKKNRVTVYEYDIRGDQWINIGTTLGLLQFDSNFFCL SARVYPSCLEPGQSFLTEEEEIPSESSTEWDLGGFSEPDSESGSSSSLSDDDFWDSIFLL LPFMVLGLGSNPAPRSEWVWGVERGQAVRADTPEPAGTGGGGWWGVGMCRVGLPLVPGSC LFLEPEAQTCSCGCCSCSYTWKGRSYLFPALPRVQGGSDPLLHFGRLQPCPGGWGFCLLG GVEQEAWICRCVFGSCSCTGSSHPNLEGAGLPLAPWSVQPQLHFPAAAGVMVAAPAITTV HMRSLTRLDLGLPSVTCGISDYAPIHLPFHYLFYTRDTVDVLY >gi568815585r:41092206_41294257|GENSCAN_predicted_CDS_1|3192_bp atgaggtctcttgggcagaatcccacagaagcagagttacaggacatgattaatgaagta gatgctgatggtaatggcacagttgacttccctgaatttctaacaatgatggcaagaaaa atgaaagacacagacagtgaagaagaaattagagatgcattctgtgtgtttgataaggat ggcaatggctatattagtgcaacagaacttcaccatgtgatgacaaaccttggagagaat ttaacagatgacgaggttgatgaaatgatcagagggaatgggaagaagagtggcttgatt gttttgactacggtggacagtgatgagagaggaagacagtggcagggtttcacaggaggg ttgaattcagtaaatgggctcgtgctgctgtctcttcggagacgctgctatcttagcgtc agcgagggaaggttgaggaggagccagagccgggtcctgcagcgtttctcgccatcagcg cccgtcgccatctccaccatgcagtcccgggaagacgccccgcgctctcgccgcctagcc agtccccgtggtgggaagcggcccaagaagattcacaaacccacagtttcggcctttttc acgggtccagaggaattaaaggacacggcccattctgcagccctgctggcacagctcaag tccttctacgatgcgcggctgctgtgtgatgtgaccatcgaggtggtgacgcctggcagc gggcctggcacgggtcgcctgttcccctgcaaccgcaatgtgctggccgcggcatgtccc tacttcaagagcatgttcacaggtggcatgtacgagagccagcaggccagcgtgaccatg cacgatgtggacgccgagtccttcgaggtgttggtcgactactgctacacgggtcgtgtg tctctcagtgaggccaacgtggagcgcctgtacgcggcctccgacatgctacagctggaa tatgtgcgggaagcctgtgcctccttcttagcccgacgtcttgacctgaccaactgcacc gccatcctcaagtttgcagatgcctttggccatcgcaagctgcgatcccaggcccagtcc tatatagctcagaacttcaagcaactcagccacatgggttcaattcgggaggagactcta gcagatctgaccctggcccagctgctggctgtcctgcgcttggatagtctggacgtggag agtgagcagacagtgtgccatgtggcagtgcagtggctggaggctgctcccaaagagcgg ggtcccagtgctgcagaagtcttcaagtgcgtgcgctggatgcacttcactgaagaagat caggactacttagaagggctgctgaccaagcccatcgtgaagaagtactgcctggacgtt attgaaggggccctgcagatgcgctatggtgacctgttgtacaagtctctggtgccagtg ccaaacagcagcagcagcagtagcagcagcaactctcttgtatctgcagcagaaaatcca ccccagagactgggtatgtgtgccaaggagatggtgatcttctttggacaccccagagat ccctttctctgctgtgatccatactcgggggacctttacaaagtgccgtcacctttgacc tgtctggctcacactaggactgtcaccactttagctgtctgtatctctcctgaccatgac atctatctagctgctcagcccaggacagacctctgggtgtataaaccagctcagaatagt tggcagcaacttgcagatcgcttgctgtgtcgtgagggcatggatgtggcatatctcaat ggctatatctacattttgggggggcgagaccctattactggagttaagttgaaggaagtg gaatgctacaatgttaagagaaaccagtgggcattggtggctccactgccccattctttt ttatcctttgacctaatggtaattcgagactatctctatgctctcaacagtaagcgcatg ttctgttatgatcctagccacaatatgtggctgaagtgcgtttctctgaagcgcaatgac tttcaggaagcctgcgtcttcaatgaggagatctattgtatctgtgatatcccagtcatg aaggtctacaacccagttagggcagaatggaggcaaatgaataatattcccttggtctca gagaccaacaactacagaattatcaagcatggccaaaaattgttgctcatcacctctcgc accccacagtggaaaaagaaccgggtgactgtgtatgaatatgatattaggggagaccaa tggattaatataggtaccacattaggcctcttgcagtttgattctaactttttttgcctc tctgctcgtgtttatccttcctgccttgaacctggtcagagtttcctcactgaagaagaa gaaataccaagtgagtctagcactgaatgggacttaggtggattcagtgagccagactct gagtcaggaagttcaagttctctttctgatgatgatttttgggactctatcttcctcctg ctgccattcatggtcctggggcttggttccaaccctgctccgagatcagagtgggtgtgg ggagtggagagaggccaggcagtgagagcagacacccctgagcctgcagggacaggtggt gggggttggtggggggtagggatgtgtcgggtggggctcccgcttgtccctggctcctgc ctgttcctggagccggaggcccagacctgcagctgcgggtgctgcagctgcagctacacc tggaagggcagatcctacttgttcccagctctcccaagagtacagggaggctcggatcca ttgctgcattttgggcggctgcagccctgcccaggagggtggggcttctgcctgctcggt ggagtagagcaggaggcctggatctgcaggtgcgttttcggcagctgcagctgcacaggg agctcccatcccaacttagaaggggccgggctcccccttgctccatggagtgtgcaaccc cagctgcacttccctgctgcagccggcgtgatggtagcagcccctgccatcaccaccgta cacatgaggtcattgacccgtctggatttaggtttaccatctgttacttgcggaatttca gactatgctccaatccacctaccttttcattatttgttttacacaagggatactgttgat gtcctctactag >gi568815585r:41092206_41294257|GENSCAN_predicted_peptide_2|684_aa MQSREDVPRSRRLASPRGGRRPKRISKPSVSAFFTGPEELKDTAHSAALLAQLKSFYDAR LLCDVTIEVVTPGSGPGTGRLFSCNRNVLAAACPYFKSMFTGGMYESQQASVTMHDVDAE SFEVLVDYCYTGRVSLSEANVQRLYAASDMLQLEYVREACASFLARRLDLTNCTAILKFA DAFDHHKLRSQAQSYIAHNFKQLSRMGSIREETLADLTLAQLLAVLRLDSLDIESERTVC HVAVQWLEAAAKERGPSAAEVFKCVRWMHFTEEDQDYLEGLLTKPIVKKYCLDVIEGALQ MRYGDLLYKSLVPVPNSSSSSSSSNSLVSAAENPPQRLGMCAKEMVIFFGHPRDPFLCYD PYSGDIYTMPSPLTSFAHTKTVTSSAVCVSPDHDIYLAAQPRKDLWVYKPAQNSWQQLAD RLLCREGMDVAYLNGYIYILGGRDPITGVKLKEVECYSVQRNQWALVAPVPHSFYSFELI VVQNYLYAVNSKRMLCYDPSHNMWLNCASLKRSDFQEACVFNDEIYCICDIPVMKVYNPA RGEWRRISNIPLDSETHNYQIVNHDQKLLLITSTTPQWKKNRVTVYEYDTREDQWINIGT MLGLLQFDSGFICLCARVYPSCLEPGQSFITEEDDARSESSTEWDLDGFSELDSESGSSS SFSDDEVWVQVAPQRNAQDQQGSL >gi568815585r:41092206_41294257|GENSCAN_predicted_CDS_2|2055_bp atgcagtcccgggaagacgtcccgcgctctcgccgcctcgccagtccccgtggtgggagg cggcccaagaggatttccaagccctcggtttcggcctttttcacgggtccagaggagtta aaggacacggcccattctgcagccctgctggcacagctcaagtccttctacgacgcgcgg ctgctgtgtgatgtgaccatcgaggtggtgacgcctggcagcgggcctggcacgggtcgc ctcttttcctgcaatcgcaacgtgctagcagctgcgtgtccctacttcaagagcatgttc acaggtggcatgtacgagagccagcaggccagcgtgaccatgcacgatgtggacgccgag tccttcgaggtgttggtcgactactgctacacgggtcgtgtgtctctcagtgaggccaat gtgcagcgcctgtacgcggcctccgacatgctacagctggaatatgtgcgggaagcctgt gcctccttcttagcccgacgtcttgacctgaccaactgcaccgccatcctcaagtttgca gacgccttcgaccatcacaagcttcgatctcaggcccagtcctacatagctcacaacttc aagcagctcagccgaatgggttcaattcgggaggagactctagcagatctaaccctggcc cagctgctggctgtcctacgcctggatagtctggacatagagagtgagcggactgtatgc catgtagctgtgcagtggctggaggctgctgccaaagagcggggtcccagtgctgcagaa gtcttcaagtgcgtgcgctggatgcacttcactgaagaagatcaggactacttagaaggg ctgctgaccaagcccatcgtgaagaagtactgcctggacgttattgaaggggccctgcag atgcgctatggtgacctgttgtacaagtctctggtgccagtgccaaacagcagcagcagc agtagcagcagcaactctcttgtatctgcagcagaaaatccaccccagagactgggtatg tgtgccaaggagatggtgatcttctttggacatcctagagatccctttctctgctatgac ccttactcgggggacatttacacaatgccatcccctttgaccagctttgctcacactaag actgtcacctcctcagctgtctgtgtgtccccagaccatgacatctatctagctgctcag cccaggaaagacctctgggtgtataaaccagctcagaatagttggcagcaacttgcagat cgcttgctgtgtcgtgagggcatggatgtggcatatctcaatggctacatctacattttg gggggacgagaccctattactggagttaagttgaaggaagtggaatgctacagtgttcag agaaaccagtgggcattggtggctcctgtccctcattccttctattcctttgaactcata gtggttcagaactatctttatgctgtcaacagtaagcgcatgctttgctatgatcctagc cacaatatgtggctgaactgtgcttctcttaaacgtagtgactttcaggaagcatgtgtc ttcaatgatgaaatctattgtatctgtgacatcccagtcatgaaggtctacaacccagct aggggagaatggaggcggattagtaatattcctttggattcagagacccacaactaccag attgtcaatcatgaccaaaagttgcttctcatcacttctacaaccccacaatggaaaaag aaccgagtgacagtgtatgagtatgatactagggaagatcagtggattaatataggtacc atgttaggccttttgcagtttgactctggctttatttgcctttgtgctcgtgtttatcct tcctgccttgaacctggtcagagttttattactgaggaagatgatgcacggagtgagtct agtactgaatgggacttagatggattcagtgagctggactctgagtcaggaagttcaagt tctttttcagatgatgaagtctgggtgcaagtagcacctcagcgaaatgcacaggatcag cagggttctttgtaa >gi568815585r:41092206_41294257|GENSCAN_predicted_peptide_3|161_aa MFAQAVRLYNKQVVKPAKLVSVSSGWLAPKPQDKILHTLPSPFHKQRGLSLWPLPPLTHG EFCQTTTSVHLRPKGSSVSCDKCCQAWNSPLRVVKSPLAQCRSRNAIQEPGPGDPKSPLV LYTMAKLIPEAGMSESHSRSTAYCLCIAVGYSGPMGSLVSR >gi568815585r:41092206_41294257|GENSCAN_predicted_CDS_3|486_bp atgtttgcacaggccgtaaggctctacaataagcaggtggtgaagccagccaagcttgtg tccgtctcttcaggatggctggcacccaaaccacaagacaaaatccttcacactcttccc tcccctttccacaagcagaggggcctctccctatggcctctgccaccactgacccatggc gaattctgtcagaccaccactagtgttcacttaaggcccaagggctcttcagtcagctgt gacaaatgttgccaggcctggaactcacccctcagggtagtgaaatcccctctggcacag tgcaggtccagaaatgctatccaagagcctggacctggggaccccaagagcccgttggtg ctctacaccatggccaagctgatacctgaggctggcatgtccgagtctcactcaaggtcc acagcatactgcctgtgcatcgctgttggttactcagggcccatgggctctttagtcagc aggtga >gi568815585r:41092206_41294257|GENSCAN_predicted_peptide_4|509_aa MLWKHKALQKYMENLSKEYQTLEQCLQHIPVNEENRRSLNRRHAELAPLAAIYQEIQETE QAIEELESMCKSLNKQDEKQLQELALEERQTIDQKINMLYNELFQSLVPKEKYDKNDVIL EVTAGRTTGGDICQQFTREIFDMYQNYSCYKHWQFELLNYTPADYALLFRLYRRQVDPGE MMDYRKLNQAVNPIAAAIPDVVSSVEQINIIPGTWYTAVDVANAFFSIPVNESQQKQFAF IQQGHNTPSLSSVRLPNCCRYRGQQNRVTCTKNMQSAKTDCWKFYRSAAQVLHQIICGLH HAAARISGDGVYKHLKYEGGIHRVQRIPEVGLSSRMQRIHTGTMSVIVLPQPDEVDVKLD PKDLRIDTFRAKGAGGQHVNKTDSAVRLVHIPTGLVVECQQERSQIKNKEIAFRVLRARL YQQIIEKDKRQQQSARKLQVGTRAQSERIRTYNFTQDRVSDHRIAYEVRDIKEFLCGGKG LDQLIQRLLQSADEEAIAELLDEHLKSAK >gi568815585r:41092206_41294257|GENSCAN_predicted_CDS_4|1530_bp atgctctggaagcataaagcactacagaaatatatggagaacctgagtaaggagtaccaa acacttgagcaatgtctgcagcatatccctgtgaatgaggaaaaccgaaggtccttgaac agaaggcatgctgagttggcacctcttgcagccatttaccaagaaattcaggagactgaa caagcaattgaagaattagaatcaatgtgtaaaagcctaaataaacaagatgaaaagcag ttacaagaacttgcactggaagaaaggcaaaccattgatcaaaaaatcaacatgttgtac aatgagcttttccagagccttgtgccaaaggagaaatatgacaaaaatgatgttatttta gaggtgacagctggaaggactactggaggtgacatctgccaacaatttacccgagaaata tttgacatgtaccagaattattcgtgctataaacactggcaatttgaacttctgaattat acaccagcagattatgctctcctatttcgcctgtacagaagacaggtggatcctggagaa atgatggattatcgaaagcttaaccaggcagtgaatccaattgcagctgctataccagat gtggtttcatcagttgagcaaattaacataatccctggtacctggtatacagctgttgat gtggcaaatgcctttttctccatccctgtcaatgaaagccagcagaagcagtttgctttc atccagcaaggccacaatacaccttcattgtcctccgttaggctgcctaattgctgccga tacagagggcagcagaacagggtgacttgcaccaagaatatgcagtctgcaaaaacagac tgttggaaattctacaggtcagcagcccaagttcttcaccagatcatttgtggactacat catgcagccgcccgaatttccggtgacggtgtctataagcatttgaagtatgagggtggg attcaccgagttcagcgcatccccgaggtgggcctgtcctcaaggatgcagcgcattcac acaggaacgatgtcggttattgtccttcctcagccagatgaggtggatgtgaaattggac cccaaggatttgcgaatagatacatttcgagccaaaggagcaggagggcagcatgttaat aaaactgatagtgccgtcagacttgtccacatccccacagggctagtagtagaatgccaa caagaaagatcacagataaaaaataaagaaatagcctttcgtgtgttgagagctagactc taccagcagattattgagaaagacaagcgtcagcaacaaagtgctagaaaactgcaggtg ggaacaagagcccagtcagagcgaattcggacatataatttcacccaggatagagtcagt gaccacaggatagcatatgaagttcgtgatattaaggaatttttatgtggtgggaagggc ctggatcagctaattcagagactgcttcaatcagcagatgaagaagccattgctgaactt ttggatgaacaccttaaatcagcaaaataa >gi568815585r:41092206_41294257|GENSCAN_predicted_peptide_5|173_aa MEKNWALFVDQCRLQALRFSVDLFDLLSILVRCDSFAGIQKAVVDQIGSRPSNSDHDLCL CLTTTARAQRVTQVQAIKCVVVGDGATNIFLICFSLVSLASFENVCAEWHPEMQHHFPNT PIILVGSKLDLNVVKDTIEKLKEKKLTPITYLQFLARNKEIGALKYLKCLALT >gi568815585r:41092206_41294257|GENSCAN_predicted_CDS_5|522_bp atggagaagaattgggccctatttgttgaccaatgccggctgcaggctttgcggttttca gtggatctctttgatttgctgagcatacttgtcagatgtgatagttttgcagggattcag aaagctgtcgtggatcagatcggcagcagaccatcaaacagtgaccatgacctttgtctg tgcctaaccaccactgccagggcccagcgagtgacccaggtgcaagccatcaagtgtgtg gtggttggagatggagctacaaacatatttttaatatgcttctcccttgtgagtcttgca tcatttgaaaatgtctgtgcagagtggcatcctgaaatgcagcaccattttcccaacact cccatcattcttgtgggaagtaaacttgatcttaatgttgttaaagacacgattgagaaa ctgaaggagaagaagctgactcccatcacctatctgcagtttttagccaggaataaggag attggcgctttaaaatacctgaagtgcttggctctcacatag