GENSCAN 1.0 Date run: 3-Nov-116 Time: 02:36:16 Sequence gi568815597r:219509630_219712062 : 202433 bp : 39.21% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 14888 15055 168 0 0 68 68 83 0.006 3.30 1.02 Term + 19646 19890 245 0 2 52 35 196 0.023 5.78 1.03 PlyA + 20872 20877 6 1.05 2.00 Prom + 23005 23044 40 -6.65 2.01 Init + 23062 23177 116 0 2 73 98 79 0.751 7.13 2.02 Intr + 32537 32576 40 2 1 95 97 14 0.029 0.31 2.03 Intr + 41185 41290 106 0 1 55 65 77 0.259 0.97 2.04 Term + 42462 42613 152 1 2 47 43 138 0.682 2.29 2.05 PlyA + 43806 43811 6 1.05 3.05 PlyA - 44209 44204 6 1.05 3.04 Term - 60931 60790 142 0 1 92 42 101 0.446 2.32 3.03 Intr - 65569 65527 43 2 1 97 97 56 0.089 3.78 3.02 Intr - 66573 66519 55 0 1 35 95 49 0.026 -2.07 3.01 Init - 75950 75741 210 2 0 44 84 121 0.072 6.13 3.00 Prom - 81808 81769 40 -2.85 4.05 PlyA - 81949 81944 6 1.05 4.04 Term - 82844 82761 84 2 0 84 48 133 0.797 5.57 4.03 Intr - 84048 83957 92 1 2 117 47 37 0.585 1.29 4.02 Intr - 90950 90881 70 2 1 93 89 15 0.209 -0.06 4.01 Init - 93739 93731 9 1 0 44 119 14 0.282 -0.16 4.00 Prom - 95438 95399 40 -7.55 5.02 PlyA - 95800 95795 6 1.05 5.01 Sngl - 102433 100016 2418 1 0 86 54 2267 0.977 215.26 5.00 Prom - 118012 117973 40 -5.75 6.04 PlyA - 119797 119792 6 1.05 6.03 Term - 131216 130761 456 2 0 87 45 205 0.017 10.24 6.02 Intr - 131528 131360 169 1 1 42 42 110 0.004 0.83 6.01 Init - 146347 146169 179 1 2 64 42 200 0.261 11.78 6.00 Prom - 147223 147184 40 -9.35 7.00 Prom + 149542 149581 40 -4.25 7.01 Init + 151288 151684 397 0 1 32 29 238 0.452 9.21 7.02 Intr + 156664 156812 149 2 2 87 62 37 0.206 0.03 7.03 Intr + 159576 159696 121 2 1 53 97 58 0.364 2.35 7.04 Intr + 162172 162291 120 2 0 52 9 154 0.211 3.45 7.05 Term + 170015 170190 176 0 2 48 41 162 0.266 4.44 7.06 PlyA + 170240 170245 6 1.05 8.03 PlyA - 170645 170640 6 1.05 8.02 Term - 173123 172980 144 2 0 66 49 133 0.726 4.13 8.01 Init - 183358 183299 60 1 0 82 116 13 0.406 4.80 8.00 Prom - 184504 184465 40 -6.55 9.00 Prom + 189792 189831 40 -6.55 9.01 Init + 196569 196644 76 2 1 89 54 78 0.439 5.80 9.02 Intr + 197532 197647 116 1 2 51 62 74 0.539 0.35 9.03 Intr + 200448 200570 123 2 0 62 95 57 0.526 3.66 9.04 Term + 200806 200970 165 0 0 107 38 74 0.516 1.23 9.05 PlyA + 202077 202082 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 11767 11617 151 1 1 82 93 134 0.929 13.55 S.002 Sngl + 19688 19861 174 1 0 74 42 194 0.915 8.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:219509630_219712062|GENSCAN_predicted_peptide_1|137_aa XCLYLQFSLNQAALILSLLKWSVLNLSATVQGFHLTWDFGFNCPNIRNRTFTASVTSPTS TTTTLRIKFQHAFWWGPTISKPQQLVTCRVPVSNNQSHECQDAVRAAAAAAGTTAAQQQP ADALATTMNSPVLGPRP >gi568815597r:219509630_219712062|GENSCAN_predicted_CDS_1|414_bp nattgcttatacctacaattctccctgaaccaagctgctctgattctatccctcttaaag tggtcagtcctcaatctctctgcaactgttcaaggtttccatcttacatgggactttggc ttcaactgccccaacattcgcaacagaacatttactgcctctgtaactagccccacctcc accaccaccacattaagaatcaaatttcaacatgcattttggtggggaccaaccatatcc aaaccacagcagctggtcacttgcagggttccagtgtcaaataaccagtcccatgaatgt caggatgcagttagagcagcagcagcagcagcagggaccacagcagcacagcagcagcct gctgatgccttggccacaaccatgaatagtcctgtgctgggacctcgaccctaa >gi568815597r:219509630_219712062|GENSCAN_predicted_peptide_2|137_aa MEESSTALRWEEQRDSTEASGGQKQGRRETTRDEIKHIRWLEEIENLRPKHLEHKLETPC QLLLVNGSKLLLLQSNEDPALLYLGTISLASLKGDNECISEEEIISEQSGKWRNADQGPE NPLAYIGDEESLIILGS >gi568815597r:219509630_219712062|GENSCAN_predicted_CDS_2|414_bp atggaggagagttccacagccctaaggtgggaagagcagagggacagcacagaggccagt ggaggtcagaaacagggaagaagggagacgacaagagatgagatcaaacacataaggtgg cttgaagaaatagagaatttaaggccaaaacatttggagcataagctggaaacgccctgc cagctgctgcttgtcaatggatctaaacttctacttttacagtctaatgaagacccagcc cttctctatctggggacaatctctctggcatcacttaaaggagacaatgaatgtatttct gaggaagagatcatctcagaacagtcaggcaaatggagaaatgctgaccaaggaccagag aacccacttgcttacattggagatgaagaaagtcttattattctggggagctaa >gi568815597r:219509630_219712062|GENSCAN_predicted_peptide_3|149_aa MKKQGNTTPSKEHMNSPVTDPKAKEIYEMPEKEFKIMFLRKLNKIQENRDSQFNEIRKTI HDLNEKFHKEGCLFFVRYQYENSRSSERSLQRFSGSIQGETQRCRTKSHPKKKLFGFGSL ASGCDSIVLLRSPTLTTVLRFGGGGGGEY >gi568815597r:219509630_219712062|GENSCAN_predicted_CDS_3|450_bp atgaaaaaacaaggaaacacaacaccttcaaaggaacatatgaattctccagtgacagat cccaaagcaaaggaaatctatgaaatgccagaaaaggaatttaaaataatgttcttaagg aaactgaataagatacaagagaacagagatagtcaatttaatgaaatcagaaaaacaatt catgatttgaatgagaaattccacaaagagggatgccttttctttgtccgttatcaatat gaaaacagcagatccagtgagagatcgctccagcgcttctccgggagcatccaaggagag acacaaaggtgccgaacaaagtcccatccaaaaaaaaaattatttggctttggttccttg gcaagtggctgtgactcaattgttctccttcgatctcctacccttactactgtacttcgt tttggagggggtggagggggagagtattaa >gi568815597r:219509630_219712062|GENSCAN_predicted_peptide_4|84_aa MFLKPNISPLSAFPLAVPSAWIALPSAVGPKGYPRNILNAKHCLRSAVGQLNPWQCSDLI IQTYPQREEPDEMRKARFTFKRKK >gi568815597r:219509630_219712062|GENSCAN_predicted_CDS_4|255_bp atgtttctgaagccaaacatttccccacttagtgcctttccattggctgttccatctgcc tggattgctcttccttcagctgttggtcccaagggataccctagaaatatactaaacgct aaacattgtctcaggtctgctgttggacaactcaacccatggcagtgctctgatctaatt atccagacgtatccacagagagaggaaccagatgaaatgcggaaagcacggttcaccttc aagagaaagaagtag >gi568815597r:219509630_219712062|GENSCAN_predicted_peptide_5|805_aa MPNQGEDCYFFFYSTCTKGDSCPFRHCEAALGNETVCTLWQEGRCFRRVCRFRHMEIDKK RSEIPCYWENQPTGCQKLNCVFHHNRGRYVDGLFLPPSKSVLPTVPESPEEEVKASQLSV QQNKLSVQSNTSPQLRSVMKVESSENVPSPKHPPVVINAADDDEDDDDQFSEEGDETKTP TLQPTPEVHNGLRVTSVRKPAVNIKQGECLHFGIKTLEEIKSKKMKEKSEEQGEGSSGVS SLLLHPEPVPGPEKENVRTVVRTVTLSTKQGEEPLVRLGLTETLGKRKFSTGGDSDPPLK RSLAQRLGKKVEAPETNTDETPKKAQVSKSLKERLGMSADPNNEDATDKVNKVGEIHVKT LEEMLLERASQKHGESQTKLKTEGPSKTDDSTSGARSSSTIRIKTFSEVLAEEEHRQQEA ERQKSKKDTTCIKLKTDSEIKKTVVLPPIVASKGQSEEPAGKTKSMQEVHMKTVEEIKLE KALRVQQSSESSTSSPSQHEATPGARLLLRITKRTWRKEEKKLQEGNEVDFLSRVRMEAT EASVETTGVDITKIQVKRCEIMRETRMQKQQEREKSVLTPLQGDVASCNTQVAEKPVLTA VPGITWHLTKQLPTKSSQKVEVETSGIADSLLNVKWSAQTLEKRGEAKPTVNVKQSVVKV VSSPKLAPKRKAVEMHPAVTAAVKPLSSSSVLQEPPAKKAAVDAVVLLDSEDKSVTVPEA ENPRDSLVLPPTQSSSDSSPPEVSGPSSSQMSMKTRRLSSASTGKPPLSVEDDFEKLTWE ISGGKLEAEIDLDPGKDEDDLPLEL >gi568815597r:219509630_219712062|GENSCAN_predicted_CDS_5|2418_bp atgcctaatcaaggagaagactgctatttttttttctattctacatgtaccaaaggtgac agctgcccattccgtcactgtgaagctgcactaggcaatgaaactgtttgcacattatgg caagaagggcgctgttttcgacgggtgtgcaggtttcggcacatggagattgataaaaaa cgcagtgaaattccttgttattgggaaaatcagccaacaggatgtcaaaaattaaactgc gttttccatcacaatagaggacgatatgttgatggccttttcctacctccgagcaaaagt gtgttgcccactgtgcctgagtcaccagaagaggaagtgaaggctagccaactttcagtt cagcagaacaaattgtctgtccagtccaatacttcccctcagctgcggagcgttatgaaa gtagaaagttccgaaaatgttcctagccccaagcatccaccagttgtaattaatgctgca gatgatgatgaagatgatgatgatcagttttctgaggaaggtgatgaaaccaaaacacct accctgcaaccaactcctgaagttcacaatggattacgagtgacttctgtccggaaacct gcagtcaatataaagcaaggtgaatgtttgcattttggaataaaaactcttgaggaaatt aagtcaaagaaaatgaaggaaaaatctgaggagcaaggtgagggttcttcaggagtttcc agtcttttactccaccctgagcctgttccaggtcctgaaaaagaaaatgtcaggactgtg gtgaggacagtaactctctccaccaaacaaggagaagaacccttggttagattgggcctt actgagacactggggaaacgaaaattttcgacaggcggtgacagtgatcctccattaaag cgtagcctggcacagaggctagggaagaaagttgaagctccagaaactaacactgacgaa acaccaaagaaagctcaagtttccaagtctcttaaggagcgattaggcatgtcagctgat ccaaataatgaggacgcaacagataaagttaataaagttggtgagatccatgtgaagaca ttagaagaaatgcttcttgaaagagccagtcagaaacatggggaatcgcaaactaaactc aagacagaaggaccttcaaaaactgatgattctacttcaggagcaagaagctcctccact atccgtatcaaaaccttctctgaggtcctggctgaagaagaacataggcagcaggaagca gagagacaaaaaagcaaaaaggatacaacttgcatcaagctaaagactgatagtgaaatt aaaaaaacagtagttttgccacccattgttgccagcaaaggacaatcagaggagcctgca ggtaaaacaaagtccatgcaggaggtgcacatgaagacggtggaagaaattaaactggag aaggcactgagggtgcagcagagctctgagagcagcaccagctccccgtctcaacatgag gccactccaggggcaaggttgctgctgcgaatcaccaaaagaacatggaggaaagaagag aagaaacttcaggaaggaaatgaagttgattttctgagccgtgttagaatggaagctaca gaggcttcagttgagaccacaggagttgacatcactaaaattcaagtcaagagatgtgag atcatgagagagacgcgcatgcagaaacagcaggagagggaaaaatcagtcttgacacct cttcagggagatgtagcctcttgcaatacccaagtggcagagaaaccagtgctcactgct gtgccaggaatcacatggcacctgaccaagcagcttcccacaaagtcatcccagaaggtg gaggtagaaacctcagggattgcagactcattattgaatgtgaaatggtcagcacagacc ttggaaaaaaggggtgaagctaaacccacagtgaacgtgaagcaatctgtggttaaagtt gtgtcatcccccaaattggccccaaaacgtaaggcagtggagatgcaccctgctgtcact gccgctgtgaagccactcagctccagcagcgtcctacaggaacccccagccaaaaaggca gctgtggatgctgttgtcctgcttgactctgaggacaaatcagtcactgtgcctgaagca gaaaatcctagagacagtcttgtgctgcctccaacccagtcctcttcagattcctcaccc ccggaggtgtctggcccttcctcatcccaaatgagcatgaaaactcgccgactcagctct gcctcaacaggaaagcccccactctctgtggaggatgattttgagaaactaacatgggag atttcaggaggcaaattggaagctgagattgacctggatcctgggaaagatgaagatgac cttccgcttgagctatga >gi568815597r:219509630_219712062|GENSCAN_predicted_peptide_6|267_aa MALEYSTLTLVGMSKIQEITTTATTTTTATKQDLSMGQMDSEKTCSEVTEEFPIQECGPQ TKASTALGLAEGPQLPLPGYYLCSFKAQGLYNQQVVNPARLASFPSGQRVPAIPAQMSPC GHHWLRPTASTTRLLPMFTQGLFSQFVVNAVKHGSLPSGMWAPFWPRKDPEMLSSSQSLK LGMPGIHLVLYPTVAQLVPKLQDKVSFAFHSCLLKHKEFLPIATIARNVLGHNRRQQSFD FSLRACSEYCLATTADYSEPKGSLVSR >gi568815597r:219509630_219712062|GENSCAN_predicted_CDS_6|804_bp atggcattggagtactccacactcacactggttggaatgtcaaagatacaggaaataaca acaacagccacaacaacaacaacagcaacaaagcaggatctttccatgggccaaatggac tctgaaaaaacttgtagtgaagtcactgaggagttccccattcaagaatgtgggccacag accaaagctagcacagcactgggtcttgccgaaggcccacagttaccactgcctggatac tacctatgttcattcaaggcccaagggctctacaatcagcaagtggtgaatccagccagg cttgcatccttcccttcagggcagcgagttcctgcaatcccagcacagatgagtccctgc ggccaccactggctcaggccgacagcgagtaccaccaggctactgccaatgttcacccaa gggctcttcagtcagtttgtggtgaatgctgtcaagcatgggtctctcccttcagggatg tgggcccccttctggcccaggaaagatccagaaatgctgtccagtagccaaagcctgaaa ctggggatgccaggaatccacttggtgctctaccccactgtggctcagctggtacctaag ctgcaagacaaagtgtcctttgcttttcactcttgtttgctcaagcataaggagtttctc cccatagccaccatagctaggaatgtgctgggtcataaccgaagacagcagagctttgat ttttcacttagggcctgcagtgagtactgcctggctaccactgctgattactcagagccc aagggctctttagtcagcaggtga >gi568815597r:219509630_219712062|GENSCAN_predicted_peptide_7|320_aa MALANGTKSRRAKLRECEVGDVAGEEKVILGQDSCKMTDNENENMGSSLESPEFFSSSSA AYIWLSGALCFLADDPFIFPVLHLKEKERSGEVGSNYSSQPVLERPLPCTWELFVLTLQA LTPGCCFPLSTEGKPVSQSQAHFYKCFMWEGTKPISQKADSEMIGSVSFLPPADLLLHGD FPHTVPERAQVTFVLYGATTASSSTLSKLLSITNLVSFIYKEALSEAADHPADKDSEEDT KENTEYGLPLIRVHKRPKELNKERNLDTQTDTKREEGRLKTGAATGVMQLQAQEHHELPA TTRRWEEIRKNSSLEPLEGA >gi568815597r:219509630_219712062|GENSCAN_predicted_CDS_7|963_bp atggctctggccaacgggaccaaatctcggagggccaaattgagggagtgtgaagtgggg gatgtggctggagaagagaaggtgatactaggccaggacagctgcaaaatgactgacaac gagaatgagaatatgggttcctctcttgagtccccagaatttttttcaagttcttcagct gcatacatttggctctctggcgccctttgttttctggctgatgatccttttatttttcct gtcctacatttaaaagagaaggaaagaagcggggaagtcggttccaactactcctcgcag ccagtcctcgagcgccctctgccgtgtacatgggagctctttgtcttaaccttgcaagcg cttactcctgggtgttgctttccgctatcaacagaagggaagcccgtttctcagagccaa gcacatttctataaatgcttcatgtgggaaggaacaaaaccaataagtcagaaagctgac tcagaaatgattggaagtgtttcctttcttcccccagcagatttgttgctccatggtgat tttccgcatactgtgccggaaagagcacaggtcacctttgttctttatggcgctaccact gccagtagctcgaccttgagcaaattactcagcatcacaaatcttgtctccttcatctat aaagaagcactgagtgaggcagctgaccacccagcagacaaggactcagaggaagatacc aaggagaatacagagtatggcttacccctgatcagagtgcacaagagacccaaggagctc aacaaagagagaaatctggacacacagacagacacaaagagggaagaaggccgtttgaag acaggggcagccactggagtgatgcagctacaagcccaggaacaccatgagttgccagca accaccagaaggtgggaagagattaggaagaattcttccctagagcctttggaaggagca tag >gi568815597r:219509630_219712062|GENSCAN_predicted_peptide_8|67_aa MQPNILTETRVSPGKASTLKITKGGDLDEISYINAREGTMKLCMGRKGVFSSFPKLLEDL SDPMDSQ >gi568815597r:219509630_219712062|GENSCAN_predicted_CDS_8|204_bp atgcagccaaacatactgacagagaccagggtatcccctgggaaagccagtaccctgaag atcactaaaggtggggatcttgatgaaattagctatatcaatgcaagagaaggtactatg aagctgtgcatgggaagaaaaggtgtcttttcatccttcccaaagcttttggaggatctc agtgatccaatggattcacaatga >gi568815597r:219509630_219712062|GENSCAN_predicted_peptide_9|159_aa MAIIKKSRNNRCREVAEKKECFYTVETASAIPAFCNHHPDQSAAINIKARPSISKKDYNS LKVQISWTTSLRISFPAWGEARIDNKRNVHKFWKAERSSSHCTLKPILSIHSNFKPTPGA ETTASHRILYQIPQLWRIRIILPLFQIIHCGSASLIEPN >gi568815597r:219509630_219712062|GENSCAN_predicted_CDS_9|480_bp atggcgattattaaaaagtcaagaaacaacagatgccgtgaggttgcagagaagaaggaa tgcttttacacggttgaaactgcctcagccatcccagccttctgcaaccaccaccctgat cagtcagcagccatcaacattaaggcaagaccctccatcagcaaaaaagattacaactcg ctaaaggttcagataagctggactacatctctcagaatttccttccctgcatggggcgag gctaggattgacaacaagagaaatgttcataagttttggaaagcagaacgaagtagcagc cattgcacactgaagcccatcctatccatccacagcaacttcaagcccacaccaggtgca gagaccacagcatcccatagaattctttatcagatcccacagttatggaggatccgtatc atactccccttatttcagatcattcactgtggctctgcttctctgatcgagccaaactaa