GENSCAN 1.0 Date run: 2-Nov-116 Time: 23:12:44 Sequence gi568815579f:9735345_9949196 : 213852 bp : 49.37% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 782 824 43 1 1 74 100 40 0.312 4.38 1.02 Intr + 8142 8365 224 1 2 96 88 55 0.449 4.15 1.03 Term + 8518 8958 441 0 0 34 44 212 0.772 6.66 1.04 PlyA + 11110 11115 6 1.05 2.11 PlyA - 13187 13182 6 1.05 2.10 Term - 23420 22131 1290 2 0 74 45 752 0.998 61.10 2.09 Intr - 24598 24516 83 2 2 76 91 55 0.994 3.96 2.08 Intr - 26824 26738 87 2 0 93 96 8 0.704 2.04 2.07 Intr - 28064 27938 127 2 1 -1 66 136 0.869 2.75 2.06 Intr - 33023 32945 79 1 1 84 105 15 0.789 2.35 2.05 Intr - 33644 33391 254 2 2 36 -14 228 0.403 3.63 2.04 Intr - 50735 50582 154 1 1 68 80 114 0.667 8.67 2.03 Intr - 58061 57532 530 2 2 17 85 172 0.018 1.53 2.02 Intr - 61589 61479 111 2 0 97 64 78 0.435 6.88 2.01 Init - 67985 67938 48 2 0 106 57 -15 0.148 -1.84 2.00 Prom - 69106 69067 40 -3.16 3.04 PlyA - 69533 69528 6 1.05 3.03 Term - 76373 75552 822 2 0 113 36 866 0.976 76.99 3.02 Intr - 83273 83201 73 1 1 114 88 112 0.999 13.21 3.01 Init - 83469 83384 86 0 2 103 80 160 0.999 15.05 3.00 Prom - 84516 84477 40 -8.86 4.00 Prom + 84645 84684 40 -1.66 4.01 Sngl + 84875 85114 240 1 0 59 55 465 0.998 35.38 4.02 PlyA + 85153 85158 6 1.05 5.00 Prom + 85209 85248 40 -7.16 5.01 Init + 91486 91522 37 0 1 49 80 5 0.472 -4.02 5.02 Intr + 91952 92088 137 0 2 80 83 90 0.508 7.99 5.03 Term + 92193 92327 135 2 0 85 43 100 0.378 3.22 5.04 PlyA + 94486 94491 6 1.05 6.00 Prom + 95215 95254 40 -4.16 6.01 Init + 100001 100058 58 1 1 101 55 96 0.984 9.07 6.02 Intr + 103092 103304 213 1 0 105 101 395 0.816 41.19 6.03 Intr + 110466 110591 126 1 0 31 80 122 0.621 6.35 6.04 Intr + 112686 112796 111 1 0 111 69 151 0.987 15.85 6.05 Term + 113746 113855 110 2 2 124 55 230 0.962 21.97 6.06 PlyA + 114981 114986 6 1.05 7.09 PlyA - 116560 116555 6 1.05 7.08 Term - 119519 118842 678 2 0 58 52 1242 0.999 111.29 7.07 Intr - 121569 121463 107 1 2 114 100 155 0.999 19.23 7.06 Intr - 122138 121919 220 2 1 41 59 386 0.997 28.87 7.05 Intr - 122517 122371 147 0 0 75 94 231 0.975 22.83 7.04 Intr - 125450 125301 150 2 0 86 100 155 0.505 16.86 7.03 Intr - 128063 128035 29 0 2 119 80 22 0.247 2.33 7.02 Intr - 133830 133731 100 0 1 87 7 108 0.010 2.28 7.01 Init - 178273 178139 135 1 0 88 78 318 0.979 30.94 7.00 Prom - 189583 189544 40 -3.96 8.03 PlyA - 190848 190843 6 1.05 8.02 Term - 194864 194760 105 2 0 84 42 19 0.101 -4.69 8.01 Init - 201022 200960 63 1 0 68 91 211 0.950 18.55 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 51201 51185 17 0 2 100 93 17 0.910 1.96 S.002 Term - 133830 133723 108 0 0 87 49 119 0.951 6.41 S.003 Term - 148224 148154 71 1 2 62 42 109 0.897 1.80 S.004 Init - 152216 152171 46 2 1 88 116 34 0.817 7.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:9735345_9949196|GENSCAN_predicted_peptide_1|235_aa MGQGHGPTLEDQEEDKQERVYFLAQSYTDTHWLHEPGLQEGTRAVAREYPHWQYQTDSPG IARRDYMVSCLVEGLKKAAYSAVNYDNLKKKSPNLDSGPQTPQQDLINLTFKVFNNREEL AKQQRISELQLLASTVRQPTTSPAYKTFRTTKTQLPGAPSKPPGGPCFKCQKPGHWASES PQPGFPPKPCPLSVGPHWKLDCPTHIATIPKAPGARTQLSLADSFPDLLDLVAED >gi568815579f:9735345_9949196|GENSCAN_predicted_CDS_1|708_bp atgggccaggggcacgggcccacgctagaggaccaggaggaagacaaacaagaaagagtt tattttctagcccagtcctacacagacacccactggcttcatgagccaggcctccaagag ggcaccagggctgttgcccgagagtatccccattggcaataccagacagactccccaggt atagctaggcgagattacatggtctcctgcctagtcgaggggctgaaaaaggcagcatac agcgctgttaattatgacaacctaaagaaaaaatctccaaatttggattctggccctcaa accccacaacaggatttaatcaatctcaccttcaaggtgttcaataacagagaagaactg gccaaacagcaacgtatctctgagttacagctacttgcctccactgtaagacaacccaca acatctccagcatacaaaaccttcagaacaaccaagacacagctcccaggggctccttca aaacctcctggtggaccttgcttcaaatgccaaaagcctggtcactgggcctcggaaagc ccgcagcccgggtttcctcctaagccatgccctctctctgtgggcccccactggaagttg gactgtccgactcacatcgccaccattcctaaagctcctggagctcgaacccaactttcc ttggcagactccttcccagatctcctcgacttagtggctgaagactga >gi568815579f:9735345_9949196|GENSCAN_predicted_peptide_2|920_aa MAGDTALRRPGEHVPKTLEFNVFSGNQHGRSEQSLLKHKDISYLIPHNHMIEGRKTLTYK NANKVVKRDVSVIYCRITNHQETAHPRTSRMNAEESEQRHFEVPPNRLPPSWSRPHPKPP PGSAPTVPSEAGGTGSNRCRLRGNWDFSPGSWRTRPPTLPPPHASETGSRALSRFRAPRG ESEVRVIRAAPPSQEPQLNQRFPLHQPLRPGGASGAASPRGGRKTRAGTRPPRPGPEPSV PSETLERNQKCAWPQRRLLPFSRRWPNGFPSSSLCDQVRLQSPPPTRPLALGFLVRRRSQ YRQIDLDPGPPGMRENGTFLYACRRGGPSSSLSFTSSSFQSSWGSAPTLNAVRNRKRKCQ PARSYTRKSISAVVEKGPSVLLSPTGRLQCVPHLVTFEDVAVDFTQEEWTLLDQAQRDLY RDVMLENYKNLIILAGSELFKRSLMSGLEQMEELRTGVTGVLQELDLQLKTKGSPLLQDI SAERSPNGVQLERSNTAEKLYDSNHSGKVFNEHPFLMTHMITHIGEKTSEDNQSGKALRK NFPHSFYKKSHAEGKMPKCVKHEKAFNQFPNLTRQNKTHTQEKLCECKDCWRTFLNQSSL KLHIRSHNGDKHYVCKECGKAFSNSSHLIGHGRIHSGEKPYVCKECGKAFTQSTGLKLHI RTHSGEKPYKCKECGKAFTHSSYLTDHTRIHSGKKPYVCMECGKAFTRSTGLILHMRIHT GEKPYECKECGKAFIHSSYLTKHVRIHSGEKLYLCKACGKAFTRSSGLVLHMRTHTGEKP YECKECGKAFNNSSMLSQHVRIHTGEKPYECKECGKAFTQSSGLSTHLRTHTGEKACECK ECGKAFARSTNLNMHMRTHTGEKPYACKECGKAFRYSTYLNVHTRTHTGAKPYECKKCGK NFTQSSALAKHLRTKACEKT >gi568815579f:9735345_9949196|GENSCAN_predicted_CDS_2|2763_bp atggccggtgatacagccctcaggaggccaggagaacatgtgcccaagactctagagttt aacgtcttcagtgggaatcagcatggacgttcagaacagtccctccttaaacacaaagac atttcctacctcatacctcataatcacatgatagaagggaggaagaccctgacctataaa aacgcaaacaaagtagtgaagagagatgtatcagttatctactgccgtataacaaaccac caggaaacagcgcatcctcgtacaagccgcatgaatgcagaggaaagcgagcagaggcat ttcgaggtcccgcccaacagactccctccatcgtggtcccgcccccaccccaagccgccg ccgggttctgcccccacggtgcccagtgaagccggagggacggggtccaaccgttgtcgg ctgcgcggaaactgggacttctccccagggtcgtggaggacgcggccgccgaccctccca ccgcctcacgcctcggagaccgggtccagagccctcagccgcttccgagccccgagaggg gaatcagaagtgcgcgtgatcagagcggcacccccttcccaggaaccgcagctgaaccaa cggtttcccctgcaccagcctttgcgaccaggtggggcttcaggtgcggcttcgccacgt ggaggcaggaagacccgggcagggactcggcctccgaggccgggtccagagccctcagtt ccttctgaaaccctggagcggaatcagaagtgcgcgtggccacagcggcgcctcctgccc tttagccgccgctggcccaacggtttccccagcagtagcctctgcgaccaggtgcggctt cagtccccgcctccaacccggcccttggcgctgggtttcctggtccggcgtcgttcccaa tatcgacagattgatctggatcctggtccacctgggatgcgagaaaacgggacttttctg tacgcgtgtcgaagaggtggccccagctcttctctcagcttcacatcatcctcattccaa agctcttggggctcggcccctacacttaatgccgtgcgcaaccggaagcggaagtgccag ccggcgcgcagctatacaaggaaaagcatctcagctgttgtggaaaagggccccagcgtc ctcttgtcaccgacagggcgattgcagtgcgttccgcacttagtgacctttgaggatgtg gctgtggactttacccaggaggagtggactttgttggatcaagcccagagagatctctac agagatgtgatgttggagaactacaagaatctcattatactagcagggtctgaattattc aaacgtagtctcatgtctggattggaacaaatggaagagctgaggacaggagtgacagga gttctgcaggaattggatttgcaactcaaaaccaaaggctccccactgctgcaagatatt tctgcagaaagatcaccaaatggagtacaattggagagaagcaatactgcagagaaactg tatgactctaaccattctggaaaagtcttcaatgaacacccatttcttatgactcacatg ataactcacattggagagaaaacttctgaggataatcagagtggaaaagccttaagaaag aactttcctcatagtttttacaagaaaagtcatgctgaggggaaaatgcctaagtgtgtt aaacatgaaaaagccttcaaccagtttccaaatcttactaggcagaataaaactcacaca caagagaaattgtgtgaatgcaaagactgttggagaacttttcttaatcagtcatccctt aagttacatataagatctcacaatggagacaaacactatgtatgtaaggaatgtgggaaa gccttcagtaattcctcacaccttataggacatggaagaattcacagtggagagaagccc tatgtctgtaaagaatgtggtaaagctttcactcaatccacaggacttaaattacacatc agaactcacagtggagaaaaaccatataaatgtaaagagtgtgggaaagccttcacccat tcttcataccttactgatcatacaagaatccacagtggaaagaagccctatgtatgtatg gaatgtggaaaagccttcactagatccacaggacttattttacacatgcgaattcacact ggagaaaagccatatgaatgtaaggagtgtggaaaagcttttattcattcctcatacctt acaaaacatgtaaggattcacagtggagagaagctgtatttatgtaaggcatgtgggaaa gcttttactcgttcctcaggacttgttttacacatgagaacacatactggagaaaagccc tatgaatgtaaagaatgtgggaaagcctttaataattcctcaatgcttagtcaacatgta aggattcacactggagagaagccatatgaatgcaaagaatgtgggaaagctttcactcaa tcctcgggccttagtacccatttaagaactcacactggagaaaaggcctgtgaatgtaag gaatgcggtaaagcatttgctcgttccacaaatcttaatatgcacatgcgaacgcacaca ggagaaaagccttatgcatgtaaagaatgtgggaaagccttcaggtattccacatacctt aacgttcacacacgaactcacactggagcaaaaccatatgaatgtaagaaatgtgggaaa aacttcactcaatcttcagcacttgctaaacatctaagaactaaggcatgtgaaaaaacc tga >gi568815579f:9735345_9949196|GENSCAN_predicted_peptide_3|326_aa MATLVELPDSVLLEIFSYLPVRDRIRISRVCHRWKRLVDDRWLWRHVDLTLYTMRPKVMW HLLRRYMASRLHSLRMGGYLFSGSQAPQLSPALLRALGQKCPNLKRLCLHVADLSMVPIT SLPSTLRTLELHSCEISMAWLHKQQDPTVLPLLECIVLDRVPAFRDEHLQGLTRFRALRS LVLGGTYRVTETGLDAGLQELSYLQRLEVLGCTLSADSTLLAISRHLRDVRKIRLTVRGL SAPGLAVLEGMPALESLCLQGPLVTPEMPSPTEILSSCLTMPKLRVLELQGLGWEGQEAE KILCKGLPHCMVIVRACPKESMDWWM >gi568815579f:9735345_9949196|GENSCAN_predicted_CDS_3|981_bp atggcgactttggtcgaactgccggactcggtcctgctcgagatcttctcttacctcccg gtacgggaccggatccgcatctccagggtctgtcaccgctggaagaggctggtggacgac cggtggctgtggcgacatgtcgacctgacgctctacacgatgcgacctaaagtcatgtgg cacctccttcgaaggtacatggcatcccggctccattccctgcggatgggtggctacctg ttctctggctcccaggccccccagttgtcccctgctctgttgagagccctgggccagaag tgccccaacctgaagcgcctctgcctgcacgtggccgacctgagcatggtgcccatcacc agcctgcccagcaccttgaggaccctggagctgcacagctgcgagatctccatggcctgg ctccacaagcagcaggaccccaccgtgctgcccctgcttgaatgcatcgtgctggaccgc gtccccgccttccgtgacgagcacctgcagggcctgacgcgcttccgggccttgcgctcg ctggtgctgggtggtacctaccgtgtgaccgagacagggctggatgctggcctgcaggag ctcagctatctgcagaggcttgaggtgctgggctgcaccctgtctgccgacagcaccctg ctggccatcagccgccacctccgagatgtgcgcaagatccggctgaccgtgaggggcctc tctgcccctggcctggctgtgctggagggaatgccggccctggagagtctgtgcctgcag ggtcccctcgtcaccccagaaatgccctcccccactgaaatcctctcctcctgcctcact atgcccaagctcagagtccttgagctgcaggggctggggtgggagggtcaggaggcggag aagatcctgtgtaaggggctgccccactgtatggtcatcgtcagggcttgccccaaagag tctatggactggtggatgtaa >gi568815579f:9735345_9949196|GENSCAN_predicted_peptide_4|79_aa MSIRTKLQNKEHAIEALRRAKFKFPGRQKIYISKKWGFTKFNADEFEDMVAEKRLIPDGC GVKYIPNRGPLDKWLALHS >gi568815579f:9735345_9949196|GENSCAN_predicted_CDS_4|240_bp atgtccatccgcaccaagctgcagaacaaggagcatgcgattgaggccttgcgcagggcc aagttcaagtttcctggccgccagaagatctacatctcaaagaagtggggcttcaccaag ttcaatgctgatgaatttgaagacatggtggctgagaagcggctcatcccagatggctgt ggggtcaagtacatccccaatcgtggccctctggacaagtggctggccctgcactcatga >gi568815579f:9735345_9949196|GENSCAN_predicted_peptide_5|102_aa MNGGGSQAHRGQRHQEPGTVRHHAVTAAVGQRRVYSRIKLRLLTGVRIPGERDGQREELP LQEASGSGSGAYPYSRDPLASPPCDRVSLQQTMTSNELGAGA >gi568815579f:9735345_9949196|GENSCAN_predicted_CDS_5|309_bp atgaatggaggcggctctcaagcacacaggggccaaagacaccaagagcctggaactgtc cggcaccatgcggtaacagcagctgttgggcagcgtagagtctactctaggatcaagctt cgcctactgactggagtccgtatcccgggggaacgcgatggccagagggaagagttgccg ctgcaagaggcttcgggctcaggatcaggagcttacccctactcccgggaccctctggcg tccccaccatgtgaccgagtctccctgcaacaaacaatgacttcaaacgagctgggcgcc ggcgcctga >gi568815579f:9735345_9949196|GENSCAN_predicted_peptide_6|205_aa MADEEKLPPGWEKRMSRSSGRVYYFNHITNASQWERPSGNSSSGGKNGQGEPARVRCSHL LVKHSQSRRPSSWRQEKITRTKEEALELINGIPVAVSPALAAGGCGKVMLVQGLGLAPQT LSACRQSCFPEVGYIQKIKSGEEDFESLASQFSDCSSAKARGDLGAFSRGQMQKPFEDAS FALRTGEMSGPVFTDSGIHIILRTE >gi568815579f:9735345_9949196|GENSCAN_predicted_CDS_6|618_bp atggcggacgaggagaagctgccgcccggctgggagaagcgcatgagccgcagctcaggc cgagtgtactacttcaaccacatcactaacgccagccagtgggagcggcccagcggcaac agcagcagtggtggcaaaaacgggcagggggagcctgccagggtccgctgctcgcacctg ctggtgaagcacagccagtcacggcggccctcgtcctggcggcaggagaagatcacccgg accaaggaggaggccctggagctgatcaacgggatccctgtggctgtgtccccagccctg gccgctgggggctgtggcaaggtgatgcttgttcagggcctgggcctggctcctcagacg ctcagtgcctgcaggcagtcctgcttccccgaagtaggctacatccagaagatcaagtcg ggagaggaggactttgagtctctggcctcacagttcagcgactgcagctcagccaaggcc aggggagacctgggtgccttcagcagaggtcagatgcagaagccatttgaagacgcctcg tttgcgctgcggacgggggagatgagcgggcccgtgttcacggattccggcatccacatc atcctccgcactgagtga >gi568815579f:9735345_9949196|GENSCAN_predicted_peptide_7|521_aa MSVPLLKIGAVLSTMAMVTNWMSQTLPSLVGLNGTVSRAGASEKIDTENRINDQLSLSEL AAKELGAKLMARPYADCSEQPEVTVNTQTLFQNPEEGWQLYTSAQAPDGKCICTAVIPAQ STCSRDGRSRELRQLMEKVQNVSQSMEVLELRTYRDLQYVRGMETLMRSLDARLRAADGS LSAKSFQELKDRMTELLPLSSVLEQYKADTRTIVRLREEVRNLSGSLAAIQEEMGAYGYE DLQQRVMALEARLHACAQKLGCGKLTGVSNPITVRAMGSRFGSWMTDTMAPSADSRVWYM DGYYKGRRVLEFRTLGDFIKGQNFIQHLLPQPWAGTGHVVYNGSLFYNKYQSNVVVKYHF RSRSVLVQRSLPGAGYNNTFPYSWGGFSDMDFMVDESGLWAVYTTNQNAGNIVVSRLDPH TLEVMRSWDTGYPKRSAGEAFMICGVLYVTNSHLAGAKVYFAYFTNTSSYEYTDVPFHNQ YSHISMLDYNPRERALYTWNNGHQVLYNVTLFHVISTSGDP >gi568815579f:9735345_9949196|GENSCAN_predicted_CDS_7|1566_bp atgtcggtgccgctgctcaagatcggggccgtgctgagcaccatggccatggtcaccaac tggatgtcgcagacgctgccctcgctcgtggggctcaacggcaccgtgtcccgtgcgggc gcctctgagaaaatcgacactgagaaccgcataaatgaccagctttccctttctgagttg gctgctaaggagcttggagccaaacttatggcccggccatatgctgattgttcagagcag ccagaggtcactgtgaacacccaaactctcttccagaacccagaagagggctggcagctg tacacctcagcccaggcccctgacgggaaatgcatctgcacggccgtgatcccagcgcag agtacctgctctcgagatggcaggagtcgggagctgcggcaactgatggagaaggtccag aacgtctcccagtccatggaggtccttgagttgcggacgtatcgcgacctccagtatgta cgcggcatggagaccctcatgcggagcctggatgcgcggctccgggcagctgatgggtcc ctctcggccaagagcttccaggagctgaaggacaggatgacggaactgttgcccctgagc tcggtcctggagcagtacaaggcagacacgcggaccattgtacgcttgcgggaggaggtg aggaatctctccggcagtctggcggccattcaggaggagatgggtgcctacgggtatgag gacctgcagcaacgggtgatggccctggaggcccggctccacgcctgcgcccagaagctg ggctgtgggaagctgaccggggtcagtaaccccatcaccgttcgggccatggggtcccgc ttcggctcctggatgactgacacgatggcccccagtgcggatagccgggtctggtacatg gatggctattacaaaggccgccgggtcctggagttccgtaccctgggagacttcatcaaa ggccagaactttatccagcacctgctgccccagccgtgggcgggcacgggccacgtggtg tacaacggctccctgttctataacaagtaccagagcaacgtggtggtcaaataccacttc cgctcgcgctctgtgctggtgcagaggagcctcccgggcgccggttacaacaacaccttc ccctactcctggggcggcttctccgacatggacttcatggtggacgagagcgggctctgg gctgtgtacaccaccaaccagaacgcgggcaacatcgtggtcagccggctggacccgcac accctcgaggtcatgcggtcctgggacaccggctaccccaagcgcagcgctggcgaggcc ttcatgatctgcggtgtgctctacgtgaccaactcccacctggctggggccaaggtctac ttcgcctattttaccaacacgtccagttacgagtacacggacgtgcccttccacaaccag tattcccacatctcgatgctggattacaacccccgggagcgcgccctctatacctggaac aacggccaccaggtgctctacaatgtcaccctgtttcacgtcatcagcacctctggggac ccctga >gi568815579f:9735345_9949196|GENSCAN_predicted_peptide_8|55_aa MWPLTVPPPLLLLLCSGLAGQACVCGVMRVCVCLVTVCMFRDHEHRDMLVSSSHQ >gi568815579f:9735345_9949196|GENSCAN_predicted_CDS_8|168_bp atgtggccgctcacggtcccgccgccgctgctgctgctgctgtgctcaggcctggccgga caggcatgtgtctgtggtgtcatgcgtgtatgtgtatgtttggtgactgtgtgcatgttc agggaccatgagcacagggacatgcttgtatcatcatcacaccagtga