GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:31:05 Sequence gi568815589f:122271186_122492541 : 221356 bp : 43.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9258 9413 156 1 0 59 92 74 0.452 4.88 1.02 Intr + 13984 14102 119 2 2 87 94 111 0.989 11.78 1.03 Intr + 20564 20655 92 1 2 87 100 75 0.632 7.39 1.04 Intr + 42042 42201 160 0 1 77 79 199 0.643 17.89 1.05 Term + 51355 51432 78 0 0 108 48 73 0.884 3.06 1.06 PlyA + 51856 51861 6 1.05 2.05 PlyA - 52416 52411 6 1.05 2.04 Term - 70816 70722 95 2 2 15 49 119 0.407 -1.31 2.03 Intr - 71324 71271 54 0 0 56 80 56 0.405 0.55 2.02 Intr - 72059 71974 86 1 2 25 89 67 0.571 -0.04 2.01 Init - 75897 75530 368 0 2 67 43 291 0.556 17.00 2.00 Prom - 90619 90580 40 -1.96 3.00 Prom + 98932 98971 40 -2.56 3.01 Init + 99900 99906 7 2 1 90 78 0 0.349 0.39 3.02 Intr + 100001 100087 87 0 0 138 96 34 0.997 9.14 3.03 Intr + 106714 106830 117 2 0 98 49 189 0.999 16.44 3.04 Intr + 107248 107388 141 2 0 93 94 160 0.999 17.42 3.05 Intr + 107590 107733 144 2 0 100 93 108 0.991 12.75 3.06 Intr + 110186 110367 182 0 2 38 84 185 0.958 12.69 3.07 Intr + 110479 110562 84 0 0 123 72 119 0.782 13.82 3.08 Intr + 112324 112570 247 0 1 58 66 307 0.986 22.53 3.09 Intr + 115261 115547 287 2 2 83 102 587 0.959 56.66 3.10 Intr + 119013 119160 148 2 1 124 71 268 0.995 28.51 3.11 Term + 121004 121359 356 0 2 95 55 342 0.994 26.26 3.12 PlyA + 122094 122099 6 1.05 4.02 PlyA - 124182 124177 6 1.05 4.01 Sngl - 135126 134896 231 0 0 97 37 229 0.944 11.81 4.00 Prom - 137293 137254 40 -4.56 5.02 PlyA - 137455 137450 6 1.05 5.01 Sngl - 139043 137664 1380 2 0 86 47 263 0.962 18.12 5.00 Prom - 139286 139247 40 -8.96 6.02 PlyA - 139434 139429 6 1.05 6.01 Sngl - 141059 139755 1305 2 0 49 48 369 0.495 25.04 6.00 Prom - 141152 141113 40 -6.16 7.02 PlyA - 141321 141316 6 1.05 7.01 Sngl - 142380 141547 834 0 0 36 43 367 0.806 22.74 7.00 Prom - 158563 158524 40 -4.16 8.04 PlyA - 158947 158942 6 1.05 8.03 Term - 160848 160736 113 1 2 24 41 98 0.303 -2.58 8.02 Intr - 161443 161305 139 1 1 123 68 41 0.817 5.64 8.01 Init - 166621 166592 30 1 0 93 45 56 0.332 0.49 8.00 Prom - 169568 169529 40 -3.76 9.04 PlyA - 169993 169988 6 1.05 9.03 Term - 170438 170350 89 0 2 93 38 77 0.764 1.02 9.02 Intr - 193208 193128 81 0 0 65 40 91 0.125 1.61 9.01 Init - 206741 205826 916 2 1 75 30 417 0.436 28.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:122271186_122492541|GENSCAN_predicted_peptide_1|201_aa XKGKGQSQTRVNINAALVEDIINLEEVNEEMKSVIEALKDNFNKTLNIRTSPGSLDKIAV VTADGKLALNQISQISMKSPQLILVNMASFPECTAAAIKAIRESGMNLNPEVEGTLIRVP IPQVTREHREMLVKLAKQNTNKAKDSLRKVRTNSMNKLKKSKDTVSEDTIRLIEKQISQM ADDTVAELDRHLAVKTKELLG >gi568815589f:122271186_122492541|GENSCAN_predicted_CDS_1|606_bp nccaaagggaaaggacagtcccaaaccagagtgaatattaatgctgccttggttgaggat ataatcaacttggaagaggtgaatgaagaaatgaagtctgtgatagaagctctcaaggat aatttcaataagactctcaatataaggacctcaccaggatcccttgacaagattgctgtg gtaactgctgacgggaagcttgctttaaaccagattagccagatctccatgaagtcgcca cagctgattttggtgaatatggccagcttcccagagtgtacagctgcagctatcaaggct ataagagaaagtggaatgaatctgaacccagaagtggaagggacgctaattcgggtaccc attccccaagtaaccagagagcacagagaaatgctggtgaaactggccaaacagaacacc aacaaggccaaagactctttacggaaggttcgcaccaactcaatgaacaagctgaagaaa tccaaggatacagtctcagaggacaccattaggctaatagagaaacagatcagccaaatg gccgatgacacagtggcagaactggacaggcatctggcagtgaagaccaaagaactcctt ggatga >gi568815589f:122271186_122492541|GENSCAN_predicted_peptide_2|200_aa MRASCPPGRAAARSLPALRPPGNDSSPGPPRGYTGSRPGRRTGPPTEGRRARVRGDLATG KRQRAGGRHVFRRAHKPSAGRTPCNMRVCAMHLSLHCTLHVPHTAALPAPTSTCCFHTWW LSGGEKVALGFTAGGQEIIVDTKPKRPLLGPEEYAHGPGIPGDHSLEEGGSSSLNHKIEI QKASVTKKVFHRSDATSDLD >gi568815589f:122271186_122492541|GENSCAN_predicted_CDS_2|603_bp atgcgcgcttcctgcccgcccggccgggccgcggcccgctcactccccgccctgcggccg ccgggaaatgacagcagcccgggaccgccgcgcgggtacacggggtcgcgccccgggaga cggacggggccgcccaccgagggccgccgggcgagggtgcgaggtgacctagcgacgggc aagcggcagcgggcaggcggccgtcatgtattcaggcgcgcgcacaagccttctgccggt cgcaccccttgtaacatgcgtgtctgtgccatgcacctaagtctccactgcacgctgcac gtgccccacacagccgcattgcctgcgccgacatccacgtgttgcttccacacgtggtgg ctgagtgggggagagaaggtggcccttggcttcacggctggtggccaggaaattatagtc gacaccaaaccaaagcggcctcttctagggccagaagaatatgcacatggccccggcatt cctggagaccacagtctggaggagggaggctcatcatcccttaaccacaagattgaaatt caaaaagcttcagtgaccaaaaaggttttccacaggtctgatgccacatctgacctggac taa >gi568815589f:122271186_122492541|GENSCAN_predicted_peptide_3|599_aa MSRSLLLWFLLFLLLLPPLPVLLADPGAPTPVNPCCYYPCQHQGICVRFGLDRYQCDCTR TGYSGPNCTIPGLWTWLRNSLRPSPSFTHFLLTHGRWFWEFVNATFIREMLMRLVLTVRS NLIPSPPTYNSAHDYISWESFSNVSYYTRILPSVPKDCPTPMGTKGKKQLPDAQLLARRF LLRRKFIPDPQGTNLMFAFFAQHFTHQFFKTSGKMGPGFTKALGHGVDLGHIYGDNLERQ YQLRLFKDGKLKYQVLDGEMYPPSVEEAPVLMHYPRGIPPQSQMAVGQEVFGLLPGLMLY ATLWLREHNRVCDLLKAEHPTWGDEQLFQTTRLILIGETIKIVIEEYVQQLSGYFLQLKF DPELLFGVQFQYRNRIAMEFNHLYHWHPLMPDSFKVGSQEYSYEQFLFNTSMLVDYGVEA LVDAFSRQIAGRIGGGRNMDHHILHVAVDVIRESREMRLQPFNEYRKRFGMKPYTSFQEL VGEKEMAAELEELYGDIDALEFYPGLLLEKCHPNSIFGESMIEIGAPFSLKGLLGNPICS PEYWKPSTFGGEVGFNIVKTATLKKLVCLNTKTCPYVSFRVPDASQDDGPAVERPSTEL >gi568815589f:122271186_122492541|GENSCAN_predicted_CDS_3|1800_bp atgagccggagtctcttgctctggttcttgctgttcctgctcctgctcccgccgctcccc gtcctgctcgcggacccaggggcgcccacgccagtgaatccctgttgttactatccatgc cagcaccagggcatctgtgtccgcttcggccttgaccgctaccagtgtgactgcacccgc acgggctattccggccccaactgcaccatccctggcctgtggacctggctccggaattca ctgcggcccagcccctctttcacccacttcctgctcactcacgggcgctggttctgggag tttgtcaatgccaccttcatccgagagatgctcatgcgcctggtactcacagtgcgctcc aaccttatccccagtccccccacctacaactcagcacatgactacatcagctgggagtct ttctccaacgtgagctattacactcgtattctgccctctgtgcctaaagattgccccaca cccatgggaaccaaagggaagaagcagttgccagatgcccagctcctggcccgccgcttc ctgctcaggaggaagttcatacctgacccccaaggcaccaacctcatgtttgccttcttt gcacaacacttcacccaccagttcttcaaaacttctggcaagatgggtcctggcttcacc aaggccttgggccatggggtagacctcggccacatttatggagacaatctggagcgtcag tatcaactgcggctctttaaggatgggaaactcaagtaccaggtgctggatggagaaatg tacccgccctcggtagaagaggcgcctgtgttgatgcactacccccgaggcatcccgccc cagagccagatggctgtgggccaggaggtgtttgggctgcttcctgggctcatgctgtat gccacgctctggctacgtgagcacaaccgtgtgtgtgacctgctgaaggctgagcacccc acctggggcgatgagcagcttttccagacgacccgcctcatcctcataggggagaccatc aagattgtcatcgaggagtacgtgcagcagctgagtggctatttcctgcagctgaaattt gacccagagctgctgttcggtgtccagttccaataccgcaaccgcattgccatggagttc aaccatctctaccactggcaccccctcatgcctgactccttcaaggtgggctcccaggag tacagctacgagcagttcttgttcaacacctccatgttggtggactatggggttgaggcc ctggtggatgccttctctcgccagattgctggccggatcggtgggggcaggaacatggac caccacatcctgcatgtggctgtggatgtcatcagggagtctcgggagatgcggctgcag cccttcaatgagtaccgcaagaggtttggcatgaaaccctacacctccttccaggagctc gtaggagagaaggagatggcagcagagttggaggaattgtatggagacattgatgcgttg gagttctaccctggactgcttcttgaaaagtgccatccaaactctatctttggggagagt atgatagagattggggctcccttttccctcaagggtctcctagggaatcccatctgttct ccggagtactggaagccgagcacatttggcggcgaggtgggctttaacattgtcaagacg gccacactgaagaagctggtctgcctcaacaccaagacctgtccctacgtttccttccgt gtgccggatgccagtcaggatgatgggcctgctgtggagcgaccatccacagagctctga >gi568815589f:122271186_122492541|GENSCAN_predicted_peptide_4|76_aa MPEPPRAVGSCAAPASPSSATPCSTAPSPIDHPRAEECRRTARDWQAAPPAAPLRDPLDE ASWAPASRGDLENLYV >gi568815589f:122271186_122492541|GENSCAN_predicted_CDS_4|231_bp atgcctgagccgccccgcgccgtgggctcctgcgccgccccagcctccccgtcgagcgcc accccctgctctacggcgcccagtcccatcgaccacccaagggctgaggagtgcaggcgc acggcgcgggactggcaggcagctccacctgcggccccgctgcgggatccactggatgaa gccagctgggctcctgcgtctcgtggggacttggagaacctttatgtctag >gi568815589f:122271186_122492541|GENSCAN_predicted_peptide_5|459_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKLILSQKNKAGGITLPD FKLYYKATVTKTAWYWYQNRDIDQWNRTELSEIMLHIYNYLIFDKPEKNKQWGKDSLFNK WCWENWLAICRKLKLDPFLTPYRKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDF MSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWENIFATYSSDKGLISRIY NELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLANREMQIKTTMRYHL TLVRMAIIKKSGNNRCWRGCGEIGTLLHCWWDCKLVQPLWKSVWRFLRDLELEIPFDPAI PLLGPKDYKSCCYKGTCTRMFTAALFTIAKTWNQPKCPTTIDWIKKMWHIYTMEYDAAIK NDEFMSFIRTWMKLETIILSKLSQGQKTKHRMFSLIGGN >gi568815589f:122271186_122492541|GENSCAN_predicted_CDS_5|1380_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cgcatcgccaagttaatcctaagccaaaagaacaaagctggaggcatcacgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagctctcagaaataatgctgcatatctacaactat ctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatagaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccata aaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaataggcaacctaca gaatgggagaacatttttgcaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcaaag gatatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaaaaa tgctcatcatcactggccaacagagaaatgcaaatcaaaaccacaatgagataccatctc acactagttagaatggcgatcattaaaaagtcaggaaacaacaggtgctggagaggatgt ggagaaataggaacacttttacactgttggtgggactgtaaactagttcaaccattgtgg aagtcggtgtggcgattcctcagggatctagaactagaaataccatttgacccagccatc ccattactgggcccaaaggattataaatcatgctgctataaagggacatgcacacgtatg tttactgcggcactattcacaatagcaaagacttggaaccaacccaaatgtccaacaacg atagattggattaagaaaatgtggcacatatacaccatggaatacgatgcagccataaaa aacgatgagttcatgtcctttatacggacatggatgaaactggaaaccatcattctcagc aaactatcacaaggacaaaaaaccaaacaccgcatgttctcactcataggtgggaactga >gi568815589f:122271186_122492541|GENSCAN_predicted_peptide_6|434_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLYPKSTEYTFFSAPHHTYS KTDHIVGSKALLSKCKRTEIMTNCLSDHSAIKLELRIKKLTQNRSTTWKLNNLLLSDYWV HNEMKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKETETQKTLQKINESRSWFFEKINKIDRLLARLIK KKRGKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTSTLPRLN QEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKE GILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHN QVGFIPGMQGWFNI >gi568815589f:122271186_122492541|GENSCAN_predicted_CDS_6|1305_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagttaac aaggatacccaggaattgaactcagctctgcaccaagctgacctaatagacatctacaga actctctaccccaaatcaacagaatatacattcttttcagcaccacaccacacctattcc aaaactgaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatt atgacaaactgtctctcagaccacagtgcaataaaactagaactcaggattaagaaactc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgagtgattactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaatgaaaacaaagacaca acataccagaatctctgggacacattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaacagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaagatcaacaaaattgatagactgttagcaagactaataaag aagaaaagaggaaagaatcaaatagacgcaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatccaccctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggatctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaagagctggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcatcctgataccaaagcctggcaga gacacaaccaaaaaagagaattttagaccgatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccataac caagtgggctttatccctgggatgcaaggctggttcaacatatga >gi568815589f:122271186_122492541|GENSCAN_predicted_peptide_7|277_aa MAKKLKTLKKKLDEWITRITNADKSLKDLMELKAKARELRKECRSLRSQCNQLEERVSVM EDEMNEMKQEGKFREKRIKRNEQGLQEIWDYVKRPNLHLIGVPESDGENGTKLENTLQDI TQKNFPNLAMQANIQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGWVT HKGKPIRLTADLLAEALQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKSFTDKQ MLRDFVTTRPALKELLKEALNMERNNRYQPLQKHAKM >gi568815589f:122271186_122492541|GENSCAN_predicted_CDS_7|834_bp atggcaaagaagttaaaaactttgaaaaaaaaattagatgaatggataactagaataacc aatgcagacaagtccttaaaggacctgatggagctgaaagccaaggctcgagaactacgt aaagaatgcagaagcctcaggagccaatgcaatcaactggaagaaagggtatcagtgatg gaagatgaaatgaatgaaatgaagcaagaagggaagtttagagaaaaaagaataaaaaga aacgaacaaggcctccaagaaatatgggactatgtgaaaagaccaaatctacatctgatt ggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctgcaggacatt acccagaagaacttccccaatctagcaatgcaggccaatattcagattcaggaaatacag agaacgccacagagatactcctcaagaagagcaactccaagacacataattgtcagattc accaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggttgggttacc cacaaagggaagcccatcagactaacagcagatctcttggcagaagctctacaagccaga agagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatttca tatccagccaaactaagcttcataagtgaaggagaaataaaatcctttacagacaagcaa atgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagcacta aacatggaaaggaacaaccggtatcagccactgcaaaaacatgccaaaatgtaa >gi568815589f:122271186_122492541|GENSCAN_predicted_peptide_8|93_aa MKVKAKALGLCEEDPCFLFTFHHDCMPSKASQPCFLLSLWNCESIKPLFFVNYPVSAGKP GMPGVIQSVSKGLKTREATSVRPAVPSPENLEL >gi568815589f:122271186_122492541|GENSCAN_predicted_CDS_8|282_bp atgaaggtgaaagctaaagcccttggcctgtgtgaagaagatccctgcttcctcttcacc ttccaccatgactgtatgccttccaaggcctcccagccatgcttcctgttaagcctgtgg aactgcgagtcaattaaacctcttttctttgtaaattacccagtctcagctggaaaacca gggatgcctggtgtaattcagtctgtgtccaaaggcctgaaaacgagggaggccactagt gtaagacctgcagtcccaagccctgagaacctggagctctga >gi568815589f:122271186_122492541|GENSCAN_predicted_peptide_9|361_aa MSPENQSSVSEFLLLGLPIRPEQQAVFFALFLGMYLTTVLGNLLIMLLIQLDSHLHTPMY FFLSHLALTDISFSSVTVPKMLMNMQTQHLAVFYKGCISQTYFFIFFADLDSFLITSMAY DRYVAICHPLHYATIMTQSQCVMLVAGSWVIACACALLHTLLLAQLSFCADHIIPHYFCD LGALLKLSCSDTSLNQLAIFTAALTAIMLPFLCILVSYGHIGVTILQIPSTKGICKALST CGSHLSVVTIYYRTIIGLYFLPPSSNTNDKNIIASVIYTAVTPMLNPFIYSLRNKDIKGA LRKLLTERLGACRTFSALLTGYLEINSLLLAGGIYISNIITIDLSAYFQKQIIFCVEVYS E >gi568815589f:122271186_122492541|GENSCAN_predicted_CDS_9|1086_bp atgagccctgagaaccagagcagcgtgtccgagttcctcctcctgggcctccccatccgg ccagagcagcaggccgtgttcttcgccctgttcctgggcatgtacctgaccacggtgctg gggaacctgctcatcatgctgctcatccagctagactctcaccttcacacccccatgtac ttcttccttagccacttggccctcactgacatctccttttcatctgtcactgtccctaag atgctgatgaacatgcagactcagcacctagccgtcttttacaagggatgcatttcacag acatattttttcatattttttgctgacttagacagtttccttatcacttcaatggcatat gacaggtatgtggccatctgtcatcctctacattatgccaccatcatgactcagagccag tgtgtcatgctggtggctgggtcctgggtcatcgcttgtgcgtgtgctcttttgcatacc ctcctcctggcccagctttccttctgtgctgaccacatcatccctcactacttctgtgac cttggtgccctgctcaagttgtcctgctcagacacctccctcaatcagttagcaatcttt acagcagcattgacagccattatgcttccattcctgtgcatcctggtttcttatggtcac attggggtcaccatcctccagattccctctaccaagggcatatgcaaagccttgtccact tgtggatcccacctctcagtggtgactatctattatcggacaattattggtctctatttt cttcccccatccagcaacaccaatgacaagaacataattgcttcagtgatatacacagca gtcactcccatgttgaacccattcatttacagtctgagaaataaagacattaagggagcc ctaagaaaactcttgactgagaggcttggagcctgcagaacattctcagccctgctcacc ggctacctagaaataaactcgctgctgctggcgggcggtatctatatttccaacatcatt accatcgatctatctgcctatttccagaagcagatcatcttttgtgttgaagtttattct gaatga