GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:15:22 Sequence gi568815579f:9728338_9930005 : 201668 bp : 49.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 33 28 6 -3.64 1.01 Sngl - 1262 642 621 2 0 60 34 257 0.778 13.70 1.00 Prom - 4660 4621 40 -4.96 2.00 Prom + 7148 7187 40 -8.36 2.01 Init + 7789 7831 43 0 1 74 100 40 0.511 4.38 2.02 Intr + 15149 15372 224 0 2 96 88 55 0.660 4.15 2.03 Term + 15525 15965 441 2 0 34 44 212 0.768 6.66 2.04 PlyA + 18117 18122 6 1.05 3.11 PlyA - 20194 20189 6 1.05 3.10 Term - 30427 29138 1290 1 0 74 45 752 0.998 61.10 3.09 Intr - 31605 31523 83 1 2 76 91 55 0.994 3.96 3.08 Intr - 33831 33745 87 1 0 93 96 8 0.704 2.04 3.07 Intr - 35071 34945 127 1 1 -1 66 136 0.869 2.75 3.06 Intr - 40030 39952 79 0 1 84 105 15 0.789 2.35 3.05 Intr - 40651 40398 254 1 2 36 -14 228 0.403 3.63 3.04 Intr - 57742 57589 154 0 1 68 80 114 0.667 8.67 3.03 Intr - 65068 64539 530 1 2 17 85 172 0.018 1.53 3.02 Intr - 68596 68486 111 1 0 97 64 78 0.435 6.88 3.01 Init - 74992 74945 48 1 0 106 57 -15 0.148 -1.84 3.00 Prom - 76113 76074 40 -3.16 4.04 PlyA - 76540 76535 6 1.05 4.03 Term - 83380 82559 822 1 0 113 36 866 0.976 76.99 4.02 Intr - 90280 90208 73 0 1 114 88 112 0.999 13.21 4.01 Init - 90476 90391 86 2 2 103 80 160 0.999 15.05 4.00 Prom - 91523 91484 40 -8.86 5.00 Prom + 91652 91691 40 -1.66 5.01 Sngl + 91882 92121 240 0 0 59 55 465 0.998 35.38 5.02 PlyA + 92160 92165 6 1.05 6.00 Prom + 92216 92255 40 -7.16 6.01 Init + 98493 98529 37 2 1 49 80 5 0.472 -4.02 6.02 Intr + 98959 99095 137 2 2 80 83 90 0.508 7.99 6.03 Term + 99200 99334 135 1 0 85 43 100 0.378 3.22 6.04 PlyA + 101493 101498 6 1.05 7.00 Prom + 102222 102261 40 -4.16 7.01 Init + 107008 107065 58 0 1 101 55 96 0.984 9.07 7.02 Intr + 110099 110311 213 0 0 105 101 395 0.816 41.19 7.03 Intr + 117473 117598 126 0 0 31 80 122 0.621 6.35 7.04 Intr + 119693 119803 111 0 0 111 69 151 0.987 15.85 7.05 Term + 120753 120862 110 1 2 124 55 230 0.962 21.97 7.06 PlyA + 121988 121993 6 1.05 8.09 PlyA - 123567 123562 6 1.05 8.08 Term - 126526 125849 678 1 0 58 52 1242 0.999 111.29 8.07 Intr - 128576 128470 107 0 2 114 100 155 0.999 19.23 8.06 Intr - 129145 128926 220 1 1 41 59 386 0.997 28.87 8.05 Intr - 129524 129378 147 2 0 75 94 231 0.975 22.83 8.04 Intr - 132457 132308 150 1 0 86 100 155 0.505 16.86 8.03 Intr - 135070 135042 29 2 2 119 80 22 0.247 2.33 8.02 Intr - 140837 140738 100 2 1 87 7 108 0.010 2.28 8.01 Init - 185280 185146 135 0 0 88 78 318 0.979 30.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 58208 58192 17 2 2 100 93 17 0.910 1.96 S.002 Term - 140837 140730 108 2 0 87 49 119 0.951 6.41 S.003 Term - 155231 155161 71 0 2 62 42 109 0.897 1.80 S.004 Init - 159223 159178 46 1 1 88 116 34 0.817 7.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:9728338_9930005|GENSCAN_predicted_peptide_1|206_aa MTGSSSTSRKGTVKEGSCSTHQGKSAPKVLSDPTPEESWQELVPAVPPPYREEGVPTPEP TAPIPLPDSHTPRPPRVDKRGSEAVGETPPLAARLQPKTGIQTPLREQRYTGVDEDGDVV ERHAFVCQPFTSVDLLNWKNNTPSYTKKPQALIDLLQTIMQTHNPTWADCHQLLMYLFNT DEQRRVLQAATKWLEEYVPADYQNPQ >gi568815579f:9728338_9930005|GENSCAN_predicted_CDS_1|621_bp atgacaggcagcagcagtactagtaggaaagggacagttaaggaaggttcttgctccaca caccaagggaaatcagcaccaaaagtcctgtcagacccaacaccagaagaatcatggcag gaattggtaccagcagtaccccctccttatcgagaggaaggggtccccactcctgagccc acagcacctatacctctgccagatagccacactcctagaccacccagagtggacaaaaga ggaagtgaagctgtgggagaaactcctcctttggcagctcgcttacagcccaagactgga atccaaacacccctgagagagcagcgatatactggggtagatgaggatggagacgtggtg gaaaggcatgcctttgtgtgtcaacctttcacctctgttgacctcctcaattggaaaaat aatactccatcttacaccaaaaagcctcaagctttaattgacttgctccaaactattatg cagactcataatcctacttgggctgattgccaccagctactcatgtacctctttaataca gacgaacagcgaagggtgctccaggcagcaactaagtggctagaggagtacgtcccagca gattaccaaaacccccagtaa >gi568815579f:9728338_9930005|GENSCAN_predicted_peptide_2|235_aa MGQGHGPTLEDQEEDKQERVYFLAQSYTDTHWLHEPGLQEGTRAVAREYPHWQYQTDSPG IARRDYMVSCLVEGLKKAAYSAVNYDNLKKKSPNLDSGPQTPQQDLINLTFKVFNNREEL AKQQRISELQLLASTVRQPTTSPAYKTFRTTKTQLPGAPSKPPGGPCFKCQKPGHWASES PQPGFPPKPCPLSVGPHWKLDCPTHIATIPKAPGARTQLSLADSFPDLLDLVAED >gi568815579f:9728338_9930005|GENSCAN_predicted_CDS_2|708_bp atgggccaggggcacgggcccacgctagaggaccaggaggaagacaaacaagaaagagtt tattttctagcccagtcctacacagacacccactggcttcatgagccaggcctccaagag ggcaccagggctgttgcccgagagtatccccattggcaataccagacagactccccaggt atagctaggcgagattacatggtctcctgcctagtcgaggggctgaaaaaggcagcatac agcgctgttaattatgacaacctaaagaaaaaatctccaaatttggattctggccctcaa accccacaacaggatttaatcaatctcaccttcaaggtgttcaataacagagaagaactg gccaaacagcaacgtatctctgagttacagctacttgcctccactgtaagacaacccaca acatctccagcatacaaaaccttcagaacaaccaagacacagctcccaggggctccttca aaacctcctggtggaccttgcttcaaatgccaaaagcctggtcactgggcctcggaaagc ccgcagcccgggtttcctcctaagccatgccctctctctgtgggcccccactggaagttg gactgtccgactcacatcgccaccattcctaaagctcctggagctcgaacccaactttcc ttggcagactccttcccagatctcctcgacttagtggctgaagactga >gi568815579f:9728338_9930005|GENSCAN_predicted_peptide_3|920_aa MAGDTALRRPGEHVPKTLEFNVFSGNQHGRSEQSLLKHKDISYLIPHNHMIEGRKTLTYK NANKVVKRDVSVIYCRITNHQETAHPRTSRMNAEESEQRHFEVPPNRLPPSWSRPHPKPP PGSAPTVPSEAGGTGSNRCRLRGNWDFSPGSWRTRPPTLPPPHASETGSRALSRFRAPRG ESEVRVIRAAPPSQEPQLNQRFPLHQPLRPGGASGAASPRGGRKTRAGTRPPRPGPEPSV PSETLERNQKCAWPQRRLLPFSRRWPNGFPSSSLCDQVRLQSPPPTRPLALGFLVRRRSQ YRQIDLDPGPPGMRENGTFLYACRRGGPSSSLSFTSSSFQSSWGSAPTLNAVRNRKRKCQ PARSYTRKSISAVVEKGPSVLLSPTGRLQCVPHLVTFEDVAVDFTQEEWTLLDQAQRDLY RDVMLENYKNLIILAGSELFKRSLMSGLEQMEELRTGVTGVLQELDLQLKTKGSPLLQDI SAERSPNGVQLERSNTAEKLYDSNHSGKVFNEHPFLMTHMITHIGEKTSEDNQSGKALRK NFPHSFYKKSHAEGKMPKCVKHEKAFNQFPNLTRQNKTHTQEKLCECKDCWRTFLNQSSL KLHIRSHNGDKHYVCKECGKAFSNSSHLIGHGRIHSGEKPYVCKECGKAFTQSTGLKLHI RTHSGEKPYKCKECGKAFTHSSYLTDHTRIHSGKKPYVCMECGKAFTRSTGLILHMRIHT GEKPYECKECGKAFIHSSYLTKHVRIHSGEKLYLCKACGKAFTRSSGLVLHMRTHTGEKP YECKECGKAFNNSSMLSQHVRIHTGEKPYECKECGKAFTQSSGLSTHLRTHTGEKACECK ECGKAFARSTNLNMHMRTHTGEKPYACKECGKAFRYSTYLNVHTRTHTGAKPYECKKCGK NFTQSSALAKHLRTKACEKT >gi568815579f:9728338_9930005|GENSCAN_predicted_CDS_3|2763_bp atggccggtgatacagccctcaggaggccaggagaacatgtgcccaagactctagagttt aacgtcttcagtgggaatcagcatggacgttcagaacagtccctccttaaacacaaagac atttcctacctcatacctcataatcacatgatagaagggaggaagaccctgacctataaa aacgcaaacaaagtagtgaagagagatgtatcagttatctactgccgtataacaaaccac caggaaacagcgcatcctcgtacaagccgcatgaatgcagaggaaagcgagcagaggcat ttcgaggtcccgcccaacagactccctccatcgtggtcccgcccccaccccaagccgccg ccgggttctgcccccacggtgcccagtgaagccggagggacggggtccaaccgttgtcgg ctgcgcggaaactgggacttctccccagggtcgtggaggacgcggccgccgaccctccca ccgcctcacgcctcggagaccgggtccagagccctcagccgcttccgagccccgagaggg gaatcagaagtgcgcgtgatcagagcggcacccccttcccaggaaccgcagctgaaccaa cggtttcccctgcaccagcctttgcgaccaggtggggcttcaggtgcggcttcgccacgt ggaggcaggaagacccgggcagggactcggcctccgaggccgggtccagagccctcagtt ccttctgaaaccctggagcggaatcagaagtgcgcgtggccacagcggcgcctcctgccc tttagccgccgctggcccaacggtttccccagcagtagcctctgcgaccaggtgcggctt cagtccccgcctccaacccggcccttggcgctgggtttcctggtccggcgtcgttcccaa tatcgacagattgatctggatcctggtccacctgggatgcgagaaaacgggacttttctg tacgcgtgtcgaagaggtggccccagctcttctctcagcttcacatcatcctcattccaa agctcttggggctcggcccctacacttaatgccgtgcgcaaccggaagcggaagtgccag ccggcgcgcagctatacaaggaaaagcatctcagctgttgtggaaaagggccccagcgtc ctcttgtcaccgacagggcgattgcagtgcgttccgcacttagtgacctttgaggatgtg gctgtggactttacccaggaggagtggactttgttggatcaagcccagagagatctctac agagatgtgatgttggagaactacaagaatctcattatactagcagggtctgaattattc aaacgtagtctcatgtctggattggaacaaatggaagagctgaggacaggagtgacagga gttctgcaggaattggatttgcaactcaaaaccaaaggctccccactgctgcaagatatt tctgcagaaagatcaccaaatggagtacaattggagagaagcaatactgcagagaaactg tatgactctaaccattctggaaaagtcttcaatgaacacccatttcttatgactcacatg ataactcacattggagagaaaacttctgaggataatcagagtggaaaagccttaagaaag aactttcctcatagtttttacaagaaaagtcatgctgaggggaaaatgcctaagtgtgtt aaacatgaaaaagccttcaaccagtttccaaatcttactaggcagaataaaactcacaca caagagaaattgtgtgaatgcaaagactgttggagaacttttcttaatcagtcatccctt aagttacatataagatctcacaatggagacaaacactatgtatgtaaggaatgtgggaaa gccttcagtaattcctcacaccttataggacatggaagaattcacagtggagagaagccc tatgtctgtaaagaatgtggtaaagctttcactcaatccacaggacttaaattacacatc agaactcacagtggagaaaaaccatataaatgtaaagagtgtgggaaagccttcacccat tcttcataccttactgatcatacaagaatccacagtggaaagaagccctatgtatgtatg gaatgtggaaaagccttcactagatccacaggacttattttacacatgcgaattcacact ggagaaaagccatatgaatgtaaggagtgtggaaaagcttttattcattcctcatacctt acaaaacatgtaaggattcacagtggagagaagctgtatttatgtaaggcatgtgggaaa gcttttactcgttcctcaggacttgttttacacatgagaacacatactggagaaaagccc tatgaatgtaaagaatgtgggaaagcctttaataattcctcaatgcttagtcaacatgta aggattcacactggagagaagccatatgaatgcaaagaatgtgggaaagctttcactcaa tcctcgggccttagtacccatttaagaactcacactggagaaaaggcctgtgaatgtaag gaatgcggtaaagcatttgctcgttccacaaatcttaatatgcacatgcgaacgcacaca ggagaaaagccttatgcatgtaaagaatgtgggaaagccttcaggtattccacatacctt aacgttcacacacgaactcacactggagcaaaaccatatgaatgtaagaaatgtgggaaa aacttcactcaatcttcagcacttgctaaacatctaagaactaaggcatgtgaaaaaacc tga >gi568815579f:9728338_9930005|GENSCAN_predicted_peptide_4|326_aa MATLVELPDSVLLEIFSYLPVRDRIRISRVCHRWKRLVDDRWLWRHVDLTLYTMRPKVMW HLLRRYMASRLHSLRMGGYLFSGSQAPQLSPALLRALGQKCPNLKRLCLHVADLSMVPIT SLPSTLRTLELHSCEISMAWLHKQQDPTVLPLLECIVLDRVPAFRDEHLQGLTRFRALRS LVLGGTYRVTETGLDAGLQELSYLQRLEVLGCTLSADSTLLAISRHLRDVRKIRLTVRGL SAPGLAVLEGMPALESLCLQGPLVTPEMPSPTEILSSCLTMPKLRVLELQGLGWEGQEAE KILCKGLPHCMVIVRACPKESMDWWM >gi568815579f:9728338_9930005|GENSCAN_predicted_CDS_4|981_bp atggcgactttggtcgaactgccggactcggtcctgctcgagatcttctcttacctcccg gtacgggaccggatccgcatctccagggtctgtcaccgctggaagaggctggtggacgac cggtggctgtggcgacatgtcgacctgacgctctacacgatgcgacctaaagtcatgtgg cacctccttcgaaggtacatggcatcccggctccattccctgcggatgggtggctacctg ttctctggctcccaggccccccagttgtcccctgctctgttgagagccctgggccagaag tgccccaacctgaagcgcctctgcctgcacgtggccgacctgagcatggtgcccatcacc agcctgcccagcaccttgaggaccctggagctgcacagctgcgagatctccatggcctgg ctccacaagcagcaggaccccaccgtgctgcccctgcttgaatgcatcgtgctggaccgc gtccccgccttccgtgacgagcacctgcagggcctgacgcgcttccgggccttgcgctcg ctggtgctgggtggtacctaccgtgtgaccgagacagggctggatgctggcctgcaggag ctcagctatctgcagaggcttgaggtgctgggctgcaccctgtctgccgacagcaccctg ctggccatcagccgccacctccgagatgtgcgcaagatccggctgaccgtgaggggcctc tctgcccctggcctggctgtgctggagggaatgccggccctggagagtctgtgcctgcag ggtcccctcgtcaccccagaaatgccctcccccactgaaatcctctcctcctgcctcact atgcccaagctcagagtccttgagctgcaggggctggggtgggagggtcaggaggcggag aagatcctgtgtaaggggctgccccactgtatggtcatcgtcagggcttgccccaaagag tctatggactggtggatgtaa >gi568815579f:9728338_9930005|GENSCAN_predicted_peptide_5|79_aa MSIRTKLQNKEHAIEALRRAKFKFPGRQKIYISKKWGFTKFNADEFEDMVAEKRLIPDGC GVKYIPNRGPLDKWLALHS >gi568815579f:9728338_9930005|GENSCAN_predicted_CDS_5|240_bp atgtccatccgcaccaagctgcagaacaaggagcatgcgattgaggccttgcgcagggcc aagttcaagtttcctggccgccagaagatctacatctcaaagaagtggggcttcaccaag ttcaatgctgatgaatttgaagacatggtggctgagaagcggctcatcccagatggctgt ggggtcaagtacatccccaatcgtggccctctggacaagtggctggccctgcactcatga >gi568815579f:9728338_9930005|GENSCAN_predicted_peptide_6|102_aa MNGGGSQAHRGQRHQEPGTVRHHAVTAAVGQRRVYSRIKLRLLTGVRIPGERDGQREELP LQEASGSGSGAYPYSRDPLASPPCDRVSLQQTMTSNELGAGA >gi568815579f:9728338_9930005|GENSCAN_predicted_CDS_6|309_bp atgaatggaggcggctctcaagcacacaggggccaaagacaccaagagcctggaactgtc cggcaccatgcggtaacagcagctgttgggcagcgtagagtctactctaggatcaagctt cgcctactgactggagtccgtatcccgggggaacgcgatggccagagggaagagttgccg ctgcaagaggcttcgggctcaggatcaggagcttacccctactcccgggaccctctggcg tccccaccatgtgaccgagtctccctgcaacaaacaatgacttcaaacgagctgggcgcc ggcgcctga >gi568815579f:9728338_9930005|GENSCAN_predicted_peptide_7|205_aa MADEEKLPPGWEKRMSRSSGRVYYFNHITNASQWERPSGNSSSGGKNGQGEPARVRCSHL LVKHSQSRRPSSWRQEKITRTKEEALELINGIPVAVSPALAAGGCGKVMLVQGLGLAPQT LSACRQSCFPEVGYIQKIKSGEEDFESLASQFSDCSSAKARGDLGAFSRGQMQKPFEDAS FALRTGEMSGPVFTDSGIHIILRTE >gi568815579f:9728338_9930005|GENSCAN_predicted_CDS_7|618_bp atggcggacgaggagaagctgccgcccggctgggagaagcgcatgagccgcagctcaggc cgagtgtactacttcaaccacatcactaacgccagccagtgggagcggcccagcggcaac agcagcagtggtggcaaaaacgggcagggggagcctgccagggtccgctgctcgcacctg ctggtgaagcacagccagtcacggcggccctcgtcctggcggcaggagaagatcacccgg accaaggaggaggccctggagctgatcaacgggatccctgtggctgtgtccccagccctg gccgctgggggctgtggcaaggtgatgcttgttcagggcctgggcctggctcctcagacg ctcagtgcctgcaggcagtcctgcttccccgaagtaggctacatccagaagatcaagtcg ggagaggaggactttgagtctctggcctcacagttcagcgactgcagctcagccaaggcc aggggagacctgggtgccttcagcagaggtcagatgcagaagccatttgaagacgcctcg tttgcgctgcggacgggggagatgagcgggcccgtgttcacggattccggcatccacatc atcctccgcactgagtga >gi568815579f:9728338_9930005|GENSCAN_predicted_peptide_8|521_aa MSVPLLKIGAVLSTMAMVTNWMSQTLPSLVGLNGTVSRAGASEKIDTENRINDQLSLSEL AAKELGAKLMARPYADCSEQPEVTVNTQTLFQNPEEGWQLYTSAQAPDGKCICTAVIPAQ STCSRDGRSRELRQLMEKVQNVSQSMEVLELRTYRDLQYVRGMETLMRSLDARLRAADGS LSAKSFQELKDRMTELLPLSSVLEQYKADTRTIVRLREEVRNLSGSLAAIQEEMGAYGYE DLQQRVMALEARLHACAQKLGCGKLTGVSNPITVRAMGSRFGSWMTDTMAPSADSRVWYM DGYYKGRRVLEFRTLGDFIKGQNFIQHLLPQPWAGTGHVVYNGSLFYNKYQSNVVVKYHF RSRSVLVQRSLPGAGYNNTFPYSWGGFSDMDFMVDESGLWAVYTTNQNAGNIVVSRLDPH TLEVMRSWDTGYPKRSAGEAFMICGVLYVTNSHLAGAKVYFAYFTNTSSYEYTDVPFHNQ YSHISMLDYNPRERALYTWNNGHQVLYNVTLFHVISTSGDP >gi568815579f:9728338_9930005|GENSCAN_predicted_CDS_8|1566_bp atgtcggtgccgctgctcaagatcggggccgtgctgagcaccatggccatggtcaccaac tggatgtcgcagacgctgccctcgctcgtggggctcaacggcaccgtgtcccgtgcgggc gcctctgagaaaatcgacactgagaaccgcataaatgaccagctttccctttctgagttg gctgctaaggagcttggagccaaacttatggcccggccatatgctgattgttcagagcag ccagaggtcactgtgaacacccaaactctcttccagaacccagaagagggctggcagctg tacacctcagcccaggcccctgacgggaaatgcatctgcacggccgtgatcccagcgcag agtacctgctctcgagatggcaggagtcgggagctgcggcaactgatggagaaggtccag aacgtctcccagtccatggaggtccttgagttgcggacgtatcgcgacctccagtatgta cgcggcatggagaccctcatgcggagcctggatgcgcggctccgggcagctgatgggtcc ctctcggccaagagcttccaggagctgaaggacaggatgacggaactgttgcccctgagc tcggtcctggagcagtacaaggcagacacgcggaccattgtacgcttgcgggaggaggtg aggaatctctccggcagtctggcggccattcaggaggagatgggtgcctacgggtatgag gacctgcagcaacgggtgatggccctggaggcccggctccacgcctgcgcccagaagctg ggctgtgggaagctgaccggggtcagtaaccccatcaccgttcgggccatggggtcccgc ttcggctcctggatgactgacacgatggcccccagtgcggatagccgggtctggtacatg gatggctattacaaaggccgccgggtcctggagttccgtaccctgggagacttcatcaaa ggccagaactttatccagcacctgctgccccagccgtgggcgggcacgggccacgtggtg tacaacggctccctgttctataacaagtaccagagcaacgtggtggtcaaataccacttc cgctcgcgctctgtgctggtgcagaggagcctcccgggcgccggttacaacaacaccttc ccctactcctggggcggcttctccgacatggacttcatggtggacgagagcgggctctgg gctgtgtacaccaccaaccagaacgcgggcaacatcgtggtcagccggctggacccgcac accctcgaggtcatgcggtcctgggacaccggctaccccaagcgcagcgctggcgaggcc ttcatgatctgcggtgtgctctacgtgaccaactcccacctggctggggccaaggtctac ttcgcctattttaccaacacgtccagttacgagtacacggacgtgcccttccacaaccag tattcccacatctcgatgctggattacaacccccgggagcgcgccctctatacctggaac aacggccaccaggtgctctacaatgtcaccctgtttcacgtcatcagcacctctggggac ccctga