GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:30:25 Sequence gi568815596f:100370489_100582312 : 211824 bp : 43.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10717 10805 89 2 2 82 97 51 0.125 5.09 1.02 Intr + 14886 14961 76 2 1 26 116 33 0.029 -1.01 1.03 Term + 19680 19699 20 1 2 102 50 27 0.050 -1.22 1.04 PlyA + 20463 20468 6 1.05 2.10 PlyA - 21396 21391 6 1.05 2.09 Term - 23294 22757 538 1 1 113 38 557 0.817 46.92 2.08 Intr - 25126 25021 106 2 1 104 81 196 0.924 19.77 2.07 Intr - 27654 27420 235 0 1 94 66 245 0.743 20.16 2.06 Intr - 32167 32076 92 2 2 58 86 107 0.362 7.21 2.05 Intr - 36219 36088 132 1 0 75 88 70 0.613 6.32 2.04 Intr - 37404 37248 157 0 1 51 57 67 0.060 -0.52 2.03 Intr - 44623 44553 71 2 2 90 102 30 0.307 3.50 2.02 Intr - 47110 47016 95 0 2 21 81 94 0.380 1.61 2.01 Init - 50724 50711 14 0 2 79 115 2 0.387 1.79 2.00 Prom - 50879 50840 40 -7.76 3.00 Prom + 53527 53566 40 -3.06 3.01 Init + 54200 54494 295 1 1 70 -36 227 0.189 5.86 3.02 Term + 54875 55029 155 0 2 33 50 215 0.294 10.28 3.03 PlyA + 55217 55222 6 -5.41 4.00 Prom + 55231 55270 40 -7.66 4.01 Sngl + 55628 56107 480 1 0 59 42 713 0.851 59.89 4.02 PlyA + 57120 57125 6 1.05 5.00 Prom + 57567 57606 40 -1.96 5.01 Init + 60659 60731 73 1 1 95 91 1 0.432 2.33 5.02 Intr + 69561 69706 146 1 2 74 53 134 0.204 8.50 5.03 Term + 79389 79484 96 2 0 123 46 63 0.686 3.57 5.04 PlyA + 81855 81860 6 1.05 6.00 Prom + 82019 82058 40 -1.76 6.01 Init + 100001 100076 76 1 1 63 110 64 0.780 7.36 6.02 Intr + 102307 102362 56 2 2 79 65 49 0.648 0.40 6.03 Intr + 106756 106779 24 0 0 111 96 -6 0.599 0.72 6.04 Intr + 108865 108939 75 0 0 123 53 67 0.982 6.41 6.05 Intr + 110008 110097 90 0 0 136 35 93 0.782 8.99 6.06 Intr + 111789 111823 35 2 2 110 50 45 0.351 -0.08 6.07 Term + 134223 134289 67 0 1 -14 43 212 0.941 3.91 6.08 PlyA + 134660 134665 6 1.05 7.00 Prom + 135831 135870 40 -5.56 7.01 Sngl + 138789 139127 339 2 0 55 48 243 0.996 13.03 7.02 PlyA + 139797 139802 6 1.05 8.00 Prom + 142614 142653 40 -3.06 8.01 Init + 153799 153918 120 0 0 102 81 58 0.080 6.74 8.02 Intr + 159297 159437 141 2 0 38 110 40 0.023 1.75 8.03 Intr + 174747 174793 47 2 2 101 110 -27 0.000 -1.99 8.04 Intr + 174975 175039 65 0 2 50 91 83 0.000 3.16 8.05 Intr + 186912 186970 59 1 2 106 94 -13 0.015 -0.30 8.06 Intr + 196015 196141 127 0 1 94 92 183 0.997 19.55 8.07 Intr + 198443 198533 91 0 1 16 115 140 0.981 8.55 8.08 Intr + 199090 199233 144 1 0 89 84 101 0.992 9.20 8.09 Intr + 201102 201310 209 0 2 86 91 158 0.972 14.62 8.10 Term + 205866 206008 143 1 2 48 54 161 0.988 6.79 8.11 PlyA + 206343 206348 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 176981 176896 86 0 2 95 93 23 0.928 3.04 S.002 Init - 178055 177980 76 2 1 61 87 58 0.930 4.25 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:100370489_100582312|GENSCAN_predicted_peptide_1|61_aa XKNLTTYVLGLISEQRCFVTVMIVASESTLCYSLNACAPQNSYVETLILSAMVVEGTNYL L >gi568815596f:100370489_100582312|GENSCAN_predicted_CDS_1|186_bp nagaaaaatctgaccacgtatgtcctaggcctaatctccgaacagcggtgttttgtaact gtcatgattgtagcttcggagagtaccttgtgttatagcctgaatgcttgtgcgccccag aattcctatgttgaaactctaatcctcagtgcaatggtagttgaaggcaccaactacttg ctgtag >gi568815596f:100370489_100582312|GENSCAN_predicted_peptide_2|479_aa MHGASRSSAPGVFLTSAAAAATSRDPTGRGAEASGQVTRGTKNLSSGETQQGNVSPRLQK ATLLVTPGNTRSLIHGHLRVVCEPSGENPGSLAGGELPTAGCIQVWPDNYVVGRVSEWLR CDNMHHQWLLLAACFWVIFMFMVASKFITLTFKDPDVYSAKQEFLFLTTMPEVRKLPEEK HIPEELKPTGKELPDSQLVQPLVYMERLELIRNVCRDDALKNLSHTPVSKFVLDRIFVCD KHKILFCQTPKVGNTQWKKVLIVLNGAFSSIEEIPENVVHDHEKNGLPRLSSFSDAEIQK RLKTYFKFFIVRDPFERLISAFKDKFVHNPRFEPWYRHEIAPGIIRKYRRNRTETRGIQF EDFVRYLGDPNHRWLDLQFGDHIIHWVTYVELCAPCEIMYSVIGHHETLEDDAPYILKEA GIDHLVSYPTIPPGITVYNRTKVEHYFLGISKRDIRRLYARFEGDFKLFGYQKPDFLLN >gi568815596f:100370489_100582312|GENSCAN_predicted_CDS_2|1440_bp atgcatggggcaagccggtctagcgcgccgggcgtcttccttacttccgctgccgccgcc gccacatcccgggacccgacgggccgcggcgcggaggcctcggggcaagtgacaagagga accaagaacctcagttcaggggaaacacagcaaggaaatgtgagccccaggctgcagaag gcaaccctattggtgaccccaggaaacacacgaagccttatacacgggcacctgagggtg gtttgcgagccatccggagaaaacccaggcagcttggcaggtggtgagctccccactgca ggctgtattcaggtgtggccggataactacgtggtgggaagagtcagtgaatggctgcgg tgtgacaacatgcaccaccagtggcttctgctggccgcatgcttttgggtgattttcatg ttcatggtggctagcaagttcatcacgttgacctttaaagacccagatgtgtacagtgcc aaacaggagtttctgttcctgacaaccatgccggaagtgaggaagttgccagaagagaag cacattcctgaggaactgaagccaactgggaaggagcttccagacagccagctcgttcag cccctggtctacatggagcgcctggaactcatcagaaacgtctgcagggatgatgccctg aagaatctctcgcacactcctgtctccaagtttgtcctggaccgaatatttgtctgtgac aagcacaagattcttttctgccagactcccaaagtgggcaacacccagtggaagaaagtg ctgattgttctaaatggagcattttcttccattgaggagatccccgaaaacgtggtgcac gaccacgagaagaacggccttcctcggctctcttccttcagtgatgcagaaattcagaag cgattgaaaacatacttcaagttttttattgtaagagatcccttcgaaagacttatttct gcatttaaggataaatttgttcacaatccccggtttgagccttggtacaggcatgagatt gctcctggcatcatcagaaaatacaggaggaaccggacagagacccgggggatccagttt gaagatttcgtgcgctacctcggcgatccgaaccacagatggctagaccttcagtttggg gaccacatcattcactgggtgacgtatgtagagctctgtgctccctgtgagataatgtac agtgtgattggacaccacgagaccctggaggacgatgccccatacatcttaaaagaggct ggcattgaccacctggtgtcatacccgactatccctccgggcattaccgtgtataacaga accaaggtggagcactatttcctgggcatcagcaaacgagacatccgacgcctgtatgcc cgtttcgaaggggactttaagctctttgggtaccagaaaccagactttttgctaaactaa >gi568815596f:100370489_100582312|GENSCAN_predicted_peptide_3|149_aa MTKCFLPPTSSPSEHHRVEHGSGLTQTPSSEEISPTKFPGLYHTGEPSPPHDILHKPPDI VSDDEKDHGKKKGNFKKKEKRTEGYAAFQEDSSGDEAEKKYGMKCEGIYRVSGIKSKMDE LKAAYDQEKSTNLEEYDPNTVASLRKQYL >gi568815596f:100370489_100582312|GENSCAN_predicted_CDS_3|450_bp atgacaaagtgcttcctgccccccaccagcagccccagtgaacaccacagggtggagcat ggcagtgggcttacccagacccccagctctgaagagatcagccctactaaatttcctgga ttgtaccacactggtgagccctcacctccccatgacatcctccacaagcctcctgatata gtgtctgatgatgagaaagaccatgggaagaaaaaaggaaattttaagaaaaaggaaaag aggaccgaaggctatgcagcctttcaggaagatagctctggagatgaggcagaaaagaag tatggcatgaagtgtgaaggcatctacagagtatcaggaattaaatcaaagatggatgag ctaaaagcagcctatgaccaggagaagtctacaaacttggaagaatatgaccctaacact gtagccagtttgcggaagcagtatttgtga >gi568815596f:100370489_100582312|GENSCAN_predicted_peptide_4|159_aa MNENEEVINILLVQEKEILTKQEELLGMEQFLRRQIASEKEEIECLRAETAEIQSHQQQG LSETEEYSSESESEDEEELQIILEDLQRQNEELEIKNNHLNQAIHEESEAVIEPRMQPWL LQLQPDRAKQQAQEDEEPEWRGGAIQTPKNGILEPVAAK >gi568815596f:100370489_100582312|GENSCAN_predicted_CDS_4|480_bp atgaatgaaaatgaagaagttataaatattctccttgttcaggagaaagagatcctgact aaacaggaggagctcctgggcatggagcagtttctgcgccggcagattgcctcagaaaaa gaagagattgagtgcctcagagccgagactgcggaaattcagagtcaccagcagcagggc ctaagcgagactgaggagtactcctccgagagcgagagtgaggatgaggaggagctgcag atcattctggaagatttacagagacagaacgaagagctggaaataaagaacaaccatttg aatcaagcaattcacgaggagagtgaggccgtcatcgagccgcgcatgcagccctggctg ctccagttgcagccagacagggccaagcagcaggcgcaggaggacgaggagcccgagtgg cgcggaggtgccatccagacgcccaagaatggcatcctcgagccagtagcagctaaatag >gi568815596f:100370489_100582312|GENSCAN_predicted_peptide_5|104_aa MVLASASGQNLRKLLFMVEGDGGADLGDACDQRERLVPFATELNMYLAQEAENPEEKFSR TGRAVDQLLPHTEAGIQPINPRFPSSPQFPVSVKTTTLFSALQA >gi568815596f:100370489_100582312|GENSCAN_predicted_CDS_5|315_bp atggtgctagcatctgcttctggtcagaacctcaggaaacttttattcatggtagaaggt gatgggggagcagacttgggtgatgcgtgtgaccagagggaaaggctggtgccctttgcc acagagctaaacatgtacctggcccaggaggcagagaatcccgaggagaaattcagcagg actggaagagctgtggatcagctgctgccccacaccgaggcaggaatccaacctatcaac ccacggtttccctccagccctcagttcccagtctccgtcaagaccaccaccctcttctct gccttgcaggcctga >gi568815596f:100370489_100582312|GENSCAN_predicted_peptide_6|140_aa MKHLRPQFPLILAIYCFCMLQIPSSGFPQPLADPSDGLDIVQLEDNQDIYKRFPPVHPLM HLAAKLANRRMKRILQRGSGTAAVDFTKKVHKSLGDCDPQRKPYPRRPRNGRNIEDEAHK NDDDDDDDDDDDDISKESYV >gi568815596f:100370489_100582312|GENSCAN_predicted_CDS_6|423_bp atgaaacatcttcgtccccagttccctctcatcttggccatctactgcttctgcatgcta cagattccctcctcaggatttcctcaacctttagctgatccttcagatggcttggatatt gtgcagcttgaggataatcaagacatatacaaaaggtttcctccagtgcatcctctaatg cacctggctgccaagctcgccaacaggcggatgaagagaattctgcagcgaggctcgggg actgctgcagtggacttcaccaagaaggtacacaagagccttggagactgcgacccccaa agaaagccctacccgagaaggcccaggaatggaagaaacattgaagatgaggcccataag aatgatgatgatgatgatgatgatgatgatgatgatgatatcagcaaggaatcctatgtc tag >gi568815596f:100370489_100582312|GENSCAN_predicted_peptide_7|112_aa MSVEVSEAASVEEQKEMEDTVTSPEKVEAAKLKARYPHLGQMPGGSDFLRKRLQKGQKYF DSGDYNMAKAKMKNEQLPTAALDKMEVTGDHIPTPEDLPQQKPSIVASKLAG >gi568815596f:100370489_100582312|GENSCAN_predicted_CDS_7|339_bp atgtctgtggaagtctccgaggcagcctctgtggaggagcagaaggaaatggaagataca gtgactagtccagagaaagttgaagcagcaaaattaaaagcaagatatcctcatctggga caaatgcctggaggttcagatttcttaaggaaacgattgcagaaagggcaaaaatatttt gattctggggattacaacatggctaaagcaaaaatgaagaacgagcaacttcctactgca gctctggataagatggaggtcacgggtgaccacattcccactccagaggaccttcctcaa cagaaaccatccattgttgctagcaagctggctggttga >gi568815596f:100370489_100582312|GENSCAN_predicted_peptide_8|381_aa MAKGSQNLSLLILAVINVVLGSLEKLVDKRPMLVIKQLNMHLFQDAASGGLCNTPGIILH PAVSSTFNQHLTWTMPGSRDLGLQGDQNMKNPYLPLRDAVRIRSWKTSNLKIISLMAHMA ISNAGSKCVQTYQEKLTLLQTLPEDPNADTEWNDILRKKGILPPKESLKELEEEAEEEQR ILQQSVVKTYEDMTLEELEDHEDEFNEEDERAIEMYRRRRLAEWKATKLKNKFGEVLEIS GKDYVQEVTKAGEGLWVILHLYKQGIPLCALINQHLSGLARKFPDVKFIKAISTTCIPNY PDRNLPTIFVYLEGDIKAQFIGPLVFGGMNLTRDELEWKLSESGAIMTDLEENPKKPIED VLLSSVRRSVLMKRDSDSEGD >gi568815596f:100370489_100582312|GENSCAN_predicted_CDS_8|1146_bp atggccaagggcagtcagaatctatccctgctcattcttgctgtgataaatgtggtcctt gggagtctagagaagttggttgacaagagaccgatgctggtgataaaacagttgaatatg cacctattccaagatgctgcctcaggaggcttatgcaacacaccaggcatcattttgcac cctgccgtatcctctacgttcaaccagcacctcacatggacgatgccagggtcccgggat cttggtcttcagggggaccagaatatgaaaaatccctacctacctcttagagatgcagtg agaatcagaagttggaagacttcaaacctgaagattatctctctcatggcccacatggcc atctccaatgcagggagcaaatgtgtgcaaacatatcaagaaaagttgacattgttacag acactgccagaggaccccaacgcagacactgaatggaatgacatcttacgcaaaaagggt atcttaccccccaaggaaagtctgaaagaattggaagaggaggcagaagaggagcagcgc atcctccagcagtcagtggtgaaaacatatgaagatatgactttggaagagctggaggat catgaagacgagtttaatgaggaggatgaacgtgctattgaaatgtacagacggcggaga ctggctgagtggaaagcaactaaactgaagaataaattcggagaagttttggagatctca gggaaggattatgttcaagaagttaccaaagctggcgagggcttgtgggtcatcttgcac ctttacaaacaaggaattcccctctgtgccctgataaatcagcacctcagtggacttgcc aggaagtttcctgatgtcaaatttatcaaagccatttcaacaacctgcatacccaattat cctgataggaatctgcccacgatatttgtttacctggaaggagatatcaaggctcagttt attggtcctctggtgtttggcggcatgaacctgacaagagatgagttggaatggaaactg tctgaatctggagcaattatgacagacctggaggaaaaccctaagaagccgattgaagac gtgttgctgtcctcagtgcggcgctctgtcctcatgaagagggacagcgattccgagggt gactga