GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:14:28 Sequence gi568815585r:41030490_41232511 : 202022 bp : 40.43% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 11158 11301 144 0 0 29 93 78 0.011 1.96 1.02 Intr + 24554 24685 132 1 0 73 110 46 0.464 5.22 1.03 Intr + 30418 30672 255 0 0 35 49 315 0.885 19.02 1.04 Intr + 31024 31186 163 0 1 13 84 298 0.977 20.53 1.05 Term + 31330 31526 197 2 2 50 46 144 0.590 2.99 1.06 PlyA + 32018 32023 6 1.05 2.00 Prom + 33310 33349 40 -6.85 2.01 Init + 33817 33855 39 0 0 64 63 6 0.526 -4.06 2.02 Intr + 34675 34798 124 0 1 77 98 148 0.906 13.94 2.03 Intr + 38072 38248 177 0 0 98 107 108 0.997 12.67 2.04 Intr + 41038 41084 47 2 2 22 86 55 0.429 -4.09 2.05 Intr + 42293 42368 76 1 1 69 107 64 0.611 4.57 2.06 Intr + 45555 45748 194 1 2 40 100 147 0.972 9.39 2.07 Intr + 50157 50320 164 2 2 87 89 126 0.994 10.45 2.08 Term + 52215 52425 211 0 1 41 33 182 0.835 3.78 2.09 PlyA + 52560 52565 6 1.05 3.02 PlyA - 52588 52583 6 1.05 3.01 Sngl - 63085 62918 168 1 0 94 43 112 0.613 1.81 3.00 Prom - 66803 66764 40 -1.65 4.05 PlyA - 66920 66915 6 1.05 4.04 Term - 77245 76556 690 1 0 93 43 174 0.215 6.00 4.03 Intr - 102162 100019 2144 1 2 12 65 2000 0.013 176.47 4.02 Intr - 103627 103542 86 0 2 42 105 79 0.009 3.54 4.01 Init - 140552 140281 272 2 2 49 91 282 0.419 21.09 4.00 Prom - 146580 146541 40 -5.95 5.02 PlyA - 146706 146701 6 1.05 5.01 Sngl - 163768 161714 2055 1 0 96 36 2151 0.999 203.67 5.00 Prom - 168618 168579 40 -5.35 6.00 Prom + 168752 168791 40 -5.75 6.01 Init + 183059 183135 77 1 2 1 101 111 0.495 4.31 6.02 Term + 183245 183653 409 2 1 46 45 216 0.738 6.80 6.03 PlyA + 184642 184647 6 1.05 7.04 PlyA - 185119 185114 6 1.05 7.03 Term - 186739 186626 114 1 0 84 36 125 0.963 4.49 7.02 Intr - 192865 192767 99 1 0 76 85 88 0.989 6.69 7.01 Intr - 196079 195943 137 0 2 107 110 70 0.858 10.47 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 16472 15900 573 2 0 43 44 251 0.903 12.11 S.002 Sngl - 102022 99998 2025 1 0 96 44 1936 0.951 183.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:41030490_41232511|GENSCAN_predicted_peptide_1|296_aa PEQQSETLFPGGKKKGIQKTSKSQSIDQMHQNHLKMHTLLSPTPKDSTVPSPASSLMIQR ETSQSPCSKVTLLTVVCEREEDMSPDTSSSLQLLLPPPPPPPPPPLLLPTRSRAREQAPS ERVGVVGERRGGAAPLQQQSLRRSLATRSAGPAVSPLFFLTPPEDPDPQVPVAPLVKLGT AERWSRGLSKVSGSEGGSGGHRAAGRARLVPLGKRESEWNPGRLAERLAQSYLQGRFSGR GRNVTPLNALGYGARALGGGRRPVQGAGWTAAGEATQCSLYDAVGGRMHPLGFRSP >gi568815585r:41030490_41232511|GENSCAN_predicted_CDS_1|891_bp cctgagcaacagagtgagaccctttttccagggggaaaaaaaaagggcatccagaagact tcaaaaagccagtccatagatcagatgcaccagaatcatctaaagatgcacacactcctg agtcctacacctaaagattcaactgtgccaagccctgcttcaagtctgatgatacagagg gaaacaagtcaaagcccctgctccaaagttacactcctgacagtggtctgtgagcgggag gaggatatgtccccagacacttccagctctctgcagctgctgctgccgccgccgccgccg ccgccgccgccgccgctgctgctgcccacacgctcccgagctagggaacaagccccatcc gagcgcgtgggcgtcgtaggggagaggaggggcggggccgctccgctgcaacagcaaagt ttgaggcgaagcctagcgactcgcagcgccggtcccgcagtttcacctctttttttttta actccgccagaggaccccgacccacaggtcccggttgctccgctggtcaaattgggaaca gcggaacgctggtcccggggactgagtaaggtgtctggatcggagggaggttcgggtggg catcgggcggctggaagagctcgactcgtcccgctgggaaagcgcgagtctgagtggaac cctggacgacttgcagagcggctggcgcagtcatacctgcagggccggttctcagggcgg ggtagaaatgttaccccgctgaacgccctggggtatggggcacgggctctagggggaggc cggcggccggttcagggggctggttggacggccgcgggtgaagcgacgcagtgctctctc tacgacgctgtcgggggtcgcatgcaccccttgggtttccgaagcccctga >gi568815585r:41030490_41232511|GENSCAN_predicted_peptide_2|343_aa MDTVLLLQTVLQSIKQKSLDKAKEEEKASKEFAAMEAAALKAYQEDLKRLGLESEILEPS ITPVTSTIPPTSTSNQQKEKKEKKKRKKDPSKGRWVEGITSEGYHYYYDLISGASQWEKP EGFQGDLKKTAVKTVWVEGLSEDGFTYYYNTETGESRWEKPDDFIPHTSDLPSSKVNENS LGTLDESKSSDSHSDSDGEQEAEEGGVSTETEKPKIKFKEKNKNSDGGSDPETQKEKSIQ KQNSLGSNEEKSKTLKKSNPYGEWQEIKQEVESHEEVDLELPSTENEYVSTSEADGGGEP KVVFKEKTVTSLGVMADGVAPVFKKRRTENGKSRNLRQRGDDQ >gi568815585r:41030490_41232511|GENSCAN_predicted_CDS_2|1032_bp atggataccgtattgttattgcaaacagtgctgcagtcaattaaacagaaaagcctggat aaggcaaaggaagaagaaaaggcatcaaaggagtttgctgcaatggaggcagctgccctg aaagcataccaagaggatttgaaaagacttggcttagagtcagaaattttggagccaagc ataacaccagtaaccagcactatcccacctacctcgacatcaaatcaacagaaagaaaag aaagaaaagaagaaaagaaaaaaagatccttcaaagggcagatgggtagaaggcataacc tctgagggttaccattactattatgatcttatctcaggagcatctcagtgggagaaacct gaaggatttcaaggagacttaaaaaagacagcagtgaagaccgtttgggtagaaggttta agtgaagatggttttacctattactataatacagaaacaggagaatccagatgggagaaa cctgatgatttcattccacacactagtgatctgccttctagtaaggtcaatgaaaattca cttggcaccctagatgaatccaaatcatcagattcgcatagtgattctgatggggaacag gaagcagaagaaggaggggtctctacagagacagaaaagccaaaaataaagtttaaggaa aaaaataaaaatagtgatggaggaagtgacccagaaacacagaaagaaaaaagtattcag aaacagaattcattaggttcaaatgaagaaaaatcgaaaactcttaagaaatcaaaccca tatggagaatggcaagaaattaaacaagaggttgagtctcatgaggaggtagatttggaa cttccaagcactgaaaatgagtatgtatcaacttcagaagctgatggtggcggagaaccc aaagtggtatttaaagaaaaaacagtcacttctcttggagttatggcagatggagtggcc ccagtcttcaaaaagagaagaactgaaaatggaaaatctagaaatttaaggcaacgaggt gatgatcaatag >gi568815585r:41030490_41232511|GENSCAN_predicted_peptide_3|55_aa MAGEASGNLQSLWKAKGKQAHLTWLEQEEEIEEEVLHTSKQADPMATHSLAQEQQ >gi568815585r:41030490_41232511|GENSCAN_predicted_CDS_3|168_bp atggctggggaagcctcgggaaatttacaatcattgtggaaggcaaaagggaagcaggca catcttacatggttagagcaggaggaagagatagaagaggaggtgctacatacttctaaa caagcagatcctatggcaactcactcactagcacaagaacagcaataa >gi568815585r:41030490_41232511|GENSCAN_predicted_peptide_4|1063_aa MRSLGQNPTEAELQDMINEVDADGNGTVDFPEFLTMMARKMKDTDSEEEIRDAFCVFDKD GNGYISATELHHVMTNLGENLTDDEVDEMIRGNGKKSGLIVLTTVDSDERGRQWQGFTGG LNSVNGLVLLSLRRRCYLSVSEGRLRRSQSRVLQRFSPSAPVAISTMQSREDAPRSRRLA SPRGGKRPKKIHKPTVSAFFTGPEELKDTAHSAALLAQLKSFYDARLLCDVTIEVVTPGS GPGTGRLFPCNRNVLAAACPYFKSMFTGGMYESQQASVTMHDVDAESFEVLVDYCYTGRV SLSEANVERLYAASDMLQLEYVREACASFLARRLDLTNCTAILKFADAFGHRKLRSQAQS YIAQNFKQLSHMGSIREETLADLTLAQLLAVLRLDSLDVESEQTVCHVAVQWLEAAPKER GPSAAEVFKCVRWMHFTEEDQDYLEGLLTKPIVKKYCLDVIEGALQMRYGDLLYKSLVPV PNSSSSSSSSNSLVSAAENPPQRLGMCAKEMVIFFGHPRDPFLCCDPYSGDLYKVPSPLT CLAHTRTVTTLAVCISPDHDIYLAAQPRTDLWVYKPAQNSWQQLADRLLCREGMDVAYLN GYIYILGGRDPITGVKLKEVECYNVKRNQWALVAPLPHSFLSFDLMVIRDYLYALNSKRM FCYDPSHNMWLKCVSLKRNDFQEACVFNEEIYCICDIPVMKVYNPVRAEWRQMNNIPLVS ETNNYRIIKHGQKLLLITSRTPQWKKNRVTVYEYDIRGDQWINIGTTLGLLQFDSNFFCL SARVYPSCLEPGQSFLTEEEEIPSESSTEWDLGGFSEPDSESGSSSSLSDDDFWDSIFLL LPFMVLGLGSNPAPRSEWVWGVERGQAVRADTPEPAGTGGGGWWGVGMCRVGLPLVPGSC LFLEPEAQTCSCGCCSCSYTWKGRSYLFPALPRVQGGSDPLLHFGRLQPCPGGWGFCLLG GVEQEAWICRCVFGSCSCTGSSHPNLEGAGLPLAPWSVQPQLHFPAAAGVMVAAPAITTV HMRSLTRLDLGLPSVTCGISDYAPIHLPFHYLFYTRDTVDVLY >gi568815585r:41030490_41232511|GENSCAN_predicted_CDS_4|3192_bp atgaggtctcttgggcagaatcccacagaagcagagttacaggacatgattaatgaagta gatgctgatggtaatggcacagttgacttccctgaatttctaacaatgatggcaagaaaa atgaaagacacagacagtgaagaagaaattagagatgcattctgtgtgtttgataaggat ggcaatggctatattagtgcaacagaacttcaccatgtgatgacaaaccttggagagaat ttaacagatgacgaggttgatgaaatgatcagagggaatgggaagaagagtggcttgatt gttttgactacggtggacagtgatgagagaggaagacagtggcagggtttcacaggaggg ttgaattcagtaaatgggctcgtgctgctgtctcttcggagacgctgctatcttagcgtc agcgagggaaggttgaggaggagccagagccgggtcctgcagcgtttctcgccatcagcg cccgtcgccatctccaccatgcagtcccgggaagacgccccgcgctctcgccgcctagcc agtccccgtggtgggaagcggcccaagaagattcacaaacccacagtttcggcctttttc acgggtccagaggaattaaaggacacggcccattctgcagccctgctggcacagctcaag tccttctacgatgcgcggctgctgtgtgatgtgaccatcgaggtggtgacgcctggcagc gggcctggcacgggtcgcctgttcccctgcaaccgcaatgtgctggccgcggcatgtccc tacttcaagagcatgttcacaggtggcatgtacgagagccagcaggccagcgtgaccatg cacgatgtggacgccgagtccttcgaggtgttggtcgactactgctacacgggtcgtgtg tctctcagtgaggccaacgtggagcgcctgtacgcggcctccgacatgctacagctggaa tatgtgcgggaagcctgtgcctccttcttagcccgacgtcttgacctgaccaactgcacc gccatcctcaagtttgcagatgcctttggccatcgcaagctgcgatcccaggcccagtcc tatatagctcagaacttcaagcaactcagccacatgggttcaattcgggaggagactcta gcagatctgaccctggcccagctgctggctgtcctgcgcttggatagtctggacgtggag agtgagcagacagtgtgccatgtggcagtgcagtggctggaggctgctcccaaagagcgg ggtcccagtgctgcagaagtcttcaagtgcgtgcgctggatgcacttcactgaagaagat caggactacttagaagggctgctgaccaagcccatcgtgaagaagtactgcctggacgtt attgaaggggccctgcagatgcgctatggtgacctgttgtacaagtctctggtgccagtg ccaaacagcagcagcagcagtagcagcagcaactctcttgtatctgcagcagaaaatcca ccccagagactgggtatgtgtgccaaggagatggtgatcttctttggacaccccagagat ccctttctctgctgtgatccatactcgggggacctttacaaagtgccgtcacctttgacc tgtctggctcacactaggactgtcaccactttagctgtctgtatctctcctgaccatgac atctatctagctgctcagcccaggacagacctctgggtgtataaaccagctcagaatagt tggcagcaacttgcagatcgcttgctgtgtcgtgagggcatggatgtggcatatctcaat ggctatatctacattttgggggggcgagaccctattactggagttaagttgaaggaagtg gaatgctacaatgttaagagaaaccagtgggcattggtggctccactgccccattctttt ttatcctttgacctaatggtaattcgagactatctctatgctctcaacagtaagcgcatg ttctgttatgatcctagccacaatatgtggctgaagtgcgtttctctgaagcgcaatgac tttcaggaagcctgcgtcttcaatgaggagatctattgtatctgtgatatcccagtcatg aaggtctacaacccagttagggcagaatggaggcaaatgaataatattcccttggtctca gagaccaacaactacagaattatcaagcatggccaaaaattgttgctcatcacctctcgc accccacagtggaaaaagaaccgggtgactgtgtatgaatatgatattaggggagaccaa tggattaatataggtaccacattaggcctcttgcagtttgattctaactttttttgcctc tctgctcgtgtttatccttcctgccttgaacctggtcagagtttcctcactgaagaagaa gaaataccaagtgagtctagcactgaatgggacttaggtggattcagtgagccagactct gagtcaggaagttcaagttctctttctgatgatgatttttgggactctatcttcctcctg ctgccattcatggtcctggggcttggttccaaccctgctccgagatcagagtgggtgtgg ggagtggagagaggccaggcagtgagagcagacacccctgagcctgcagggacaggtggt gggggttggtggggggtagggatgtgtcgggtggggctcccgcttgtccctggctcctgc ctgttcctggagccggaggcccagacctgcagctgcgggtgctgcagctgcagctacacc tggaagggcagatcctacttgttcccagctctcccaagagtacagggaggctcggatcca ttgctgcattttgggcggctgcagccctgcccaggagggtggggcttctgcctgctcggt ggagtagagcaggaggcctggatctgcaggtgcgttttcggcagctgcagctgcacaggg agctcccatcccaacttagaaggggccgggctcccccttgctccatggagtgtgcaaccc cagctgcacttccctgctgcagccggcgtgatggtagcagcccctgccatcaccaccgta cacatgaggtcattgacccgtctggatttaggtttaccatctgttacttgcggaatttca gactatgctccaatccacctaccttttcattatttgttttacacaagggatactgttgat gtcctctactag >gi568815585r:41030490_41232511|GENSCAN_predicted_peptide_5|684_aa MQSREDVPRSRRLASPRGGRRPKRISKPSVSAFFTGPEELKDTAHSAALLAQLKSFYDAR LLCDVTIEVVTPGSGPGTGRLFSCNRNVLAAACPYFKSMFTGGMYESQQASVTMHDVDAE SFEVLVDYCYTGRVSLSEANVQRLYAASDMLQLEYVREACASFLARRLDLTNCTAILKFA DAFDHHKLRSQAQSYIAHNFKQLSRMGSIREETLADLTLAQLLAVLRLDSLDIESERTVC HVAVQWLEAAAKERGPSAAEVFKCVRWMHFTEEDQDYLEGLLTKPIVKKYCLDVIEGALQ MRYGDLLYKSLVPVPNSSSSSSSSNSLVSAAENPPQRLGMCAKEMVIFFGHPRDPFLCYD PYSGDIYTMPSPLTSFAHTKTVTSSAVCVSPDHDIYLAAQPRKDLWVYKPAQNSWQQLAD RLLCREGMDVAYLNGYIYILGGRDPITGVKLKEVECYSVQRNQWALVAPVPHSFYSFELI VVQNYLYAVNSKRMLCYDPSHNMWLNCASLKRSDFQEACVFNDEIYCICDIPVMKVYNPA RGEWRRISNIPLDSETHNYQIVNHDQKLLLITSTTPQWKKNRVTVYEYDTREDQWINIGT MLGLLQFDSGFICLCARVYPSCLEPGQSFITEEDDARSESSTEWDLDGFSELDSESGSSS SFSDDEVWVQVAPQRNAQDQQGSL >gi568815585r:41030490_41232511|GENSCAN_predicted_CDS_5|2055_bp atgcagtcccgggaagacgtcccgcgctctcgccgcctcgccagtccccgtggtgggagg cggcccaagaggatttccaagccctcggtttcggcctttttcacgggtccagaggagtta aaggacacggcccattctgcagccctgctggcacagctcaagtccttctacgacgcgcgg ctgctgtgtgatgtgaccatcgaggtggtgacgcctggcagcgggcctggcacgggtcgc ctcttttcctgcaatcgcaacgtgctagcagctgcgtgtccctacttcaagagcatgttc acaggtggcatgtacgagagccagcaggccagcgtgaccatgcacgatgtggacgccgag tccttcgaggtgttggtcgactactgctacacgggtcgtgtgtctctcagtgaggccaat gtgcagcgcctgtacgcggcctccgacatgctacagctggaatatgtgcgggaagcctgt gcctccttcttagcccgacgtcttgacctgaccaactgcaccgccatcctcaagtttgca gacgccttcgaccatcacaagcttcgatctcaggcccagtcctacatagctcacaacttc aagcagctcagccgaatgggttcaattcgggaggagactctagcagatctaaccctggcc cagctgctggctgtcctacgcctggatagtctggacatagagagtgagcggactgtatgc catgtagctgtgcagtggctggaggctgctgccaaagagcggggtcccagtgctgcagaa gtcttcaagtgcgtgcgctggatgcacttcactgaagaagatcaggactacttagaaggg ctgctgaccaagcccatcgtgaagaagtactgcctggacgttattgaaggggccctgcag atgcgctatggtgacctgttgtacaagtctctggtgccagtgccaaacagcagcagcagc agtagcagcagcaactctcttgtatctgcagcagaaaatccaccccagagactgggtatg tgtgccaaggagatggtgatcttctttggacatcctagagatccctttctctgctatgac ccttactcgggggacatttacacaatgccatcccctttgaccagctttgctcacactaag actgtcacctcctcagctgtctgtgtgtccccagaccatgacatctatctagctgctcag cccaggaaagacctctgggtgtataaaccagctcagaatagttggcagcaacttgcagat cgcttgctgtgtcgtgagggcatggatgtggcatatctcaatggctacatctacattttg gggggacgagaccctattactggagttaagttgaaggaagtggaatgctacagtgttcag agaaaccagtgggcattggtggctcctgtccctcattccttctattcctttgaactcata gtggttcagaactatctttatgctgtcaacagtaagcgcatgctttgctatgatcctagc cacaatatgtggctgaactgtgcttctcttaaacgtagtgactttcaggaagcatgtgtc ttcaatgatgaaatctattgtatctgtgacatcccagtcatgaaggtctacaacccagct aggggagaatggaggcggattagtaatattcctttggattcagagacccacaactaccag attgtcaatcatgaccaaaagttgcttctcatcacttctacaaccccacaatggaaaaag aaccgagtgacagtgtatgagtatgatactagggaagatcagtggattaatataggtacc atgttaggccttttgcagtttgactctggctttatttgcctttgtgctcgtgtttatcct tcctgccttgaacctggtcagagttttattactgaggaagatgatgcacggagtgagtct agtactgaatgggacttagatggattcagtgagctggactctgagtcaggaagttcaagt tctttttcagatgatgaagtctgggtgcaagtagcacctcagcgaaatgcacaggatcag cagggttctttgtaa >gi568815585r:41030490_41232511|GENSCAN_predicted_peptide_6|161_aa MFAQAVRLYNKQVVKPAKLVSVSSGWLAPKPQDKILHTLPSPFHKQRGLSLWPLPPLTHG EFCQTTTSVHLRPKGSSVSCDKCCQAWNSPLRVVKSPLAQCRSRNAIQEPGPGDPKSPLV LYTMAKLIPEAGMSESHSRSTAYCLCIAVGYSGPMGSLVSR >gi568815585r:41030490_41232511|GENSCAN_predicted_CDS_6|486_bp atgtttgcacaggccgtaaggctctacaataagcaggtggtgaagccagccaagcttgtg tccgtctcttcaggatggctggcacccaaaccacaagacaaaatccttcacactcttccc tcccctttccacaagcagaggggcctctccctatggcctctgccaccactgacccatggc gaattctgtcagaccaccactagtgttcacttaaggcccaagggctcttcagtcagctgt gacaaatgttgccaggcctggaactcacccctcagggtagtgaaatcccctctggcacag tgcaggtccagaaatgctatccaagagcctggacctggggaccccaagagcccgttggtg ctctacaccatggccaagctgatacctgaggctggcatgtccgagtctcactcaaggtcc acagcatactgcctgtgcatcgctgttggttactcagggcccatgggctctttagtcagc aggtga >gi568815585r:41030490_41232511|GENSCAN_predicted_peptide_7|116_aa XLVVECQQERSQIKNKEIAFRVLRARLYQQIIEKDKRQQQSARKLQVGTRAQSERIRTYN FTQDRVSDHRIAYEVRDIKEFLCGGKGLDQLIQRLLQSADEEAIAELLDEHLKSAK >gi568815585r:41030490_41232511|GENSCAN_predicted_CDS_7|351_bp nggctagtagtagaatgccaacaagaaagatcacagataaaaaataaagaaatagccttt cgtgtgttgagagctagactctaccagcagattattgagaaagacaagcgtcagcaacaa agtgctagaaaactgcaggtgggaacaagagcccagtcagagcgaattcggacatataat ttcacccaggatagagtcagtgaccacaggatagcatatgaagttcgtgatattaaggaa tttttatgtggtgggaagggcctggatcagctaattcagagactgcttcaatcagcagat gaagaagccattgctgaacttttggatgaacaccttaaatcagcaaaataa