GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:42:28 Sequence gi568815593f:122675104_122929645 : 254542 bp : 39.45% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 888 1183 296 1 2 47 42 349 0.292 20.68 1.02 PlyA + 1663 1668 6 1.05 2.00 Prom + 7558 7597 40 -5.75 2.01 Sngl + 9391 10779 1389 0 0 34 47 386 0.637 24.91 2.02 PlyA + 11585 11590 6 1.05 3.00 Prom + 12595 12634 40 -4.15 3.01 Init + 18580 18664 85 0 1 45 82 140 0.977 10.13 3.02 Intr + 19833 19939 107 1 2 50 82 71 0.774 1.71 3.03 Intr + 24444 24614 171 2 0 73 67 83 0.329 3.92 3.04 Term + 28349 28468 120 1 0 100 37 42 0.153 -2.21 3.05 PlyA + 29659 29664 6 1.05 4.00 Prom + 33030 33069 40 -3.65 4.01 Init + 67139 67372 234 1 0 85 55 184 0.554 13.01 4.02 Intr + 70656 70792 137 2 2 72 75 87 0.227 4.25 4.03 Term + 93297 93624 328 0 1 31 49 624 0.573 46.00 4.04 PlyA + 94406 94411 6 1.05 5.00 Prom + 95261 95300 40 -3.55 5.01 Init + 100001 100108 108 1 0 106 78 96 0.980 10.67 5.02 Intr + 100261 100539 279 0 0 47 55 224 0.146 11.75 5.03 Intr + 120163 120280 118 0 1 67 91 151 0.984 12.42 5.04 Intr + 124589 124752 164 0 2 59 77 137 0.984 8.47 5.05 Intr + 126766 126832 67 0 1 53 97 115 0.998 6.46 5.06 Intr + 126978 127021 44 1 2 105 105 25 0.966 2.94 5.07 Intr + 141812 141925 114 1 0 64 55 157 0.849 9.72 5.08 Intr + 142177 142270 94 0 1 77 76 98 0.999 6.12 5.09 Intr + 143715 143920 206 1 2 70 80 219 0.974 17.30 5.10 Intr + 150947 151090 144 1 0 63 45 117 0.746 4.46 5.11 Intr + 152472 152543 72 2 0 42 115 47 0.518 1.48 5.12 Intr + 152827 152870 44 0 2 88 92 -15 0.005 -4.98 5.13 Intr + 170512 170590 79 1 1 66 78 177 0.024 13.13 5.14 Term + 193406 193564 159 1 0 4 39 190 0.911 2.66 5.15 PlyA + 193934 193939 6 1.05 6.06 PlyA - 194033 194028 6 1.05 6.05 Term - 202959 202804 156 0 0 60 55 126 0.004 3.45 6.04 Intr - 220577 220349 229 1 1 8 42 193 0.061 3.65 6.03 Intr - 224158 223992 167 1 2 65 59 119 0.401 4.54 6.02 Intr - 227234 227103 132 2 0 84 64 113 0.936 8.42 6.01 Init - 228516 228424 93 0 0 83 82 36 0.687 2.93 6.00 Prom - 232136 232097 40 -2.35 7.05 PlyA - 232678 232673 6 1.05 7.04 Term - 235677 235641 37 2 1 97 37 50 0.415 -3.37 7.03 Intr - 237399 235980 1420 1 1 68 40 389 0.560 19.65 7.02 Intr - 237695 237548 148 2 1 93 40 29 0.773 -2.51 7.01 Init - 239551 239147 405 1 0 70 13 237 0.283 11.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 100261 100659 399 0 0 47 43 282 0.807 13.73 S.002 Init + 161419 161518 100 0 1 78 76 101 0.910 8.37 S.003 Term + 169823 170013 191 0 2 96 46 142 0.920 7.43 S.004 Init + 170531 170590 60 1 0 111 78 116 0.897 14.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:122675104_122929645|GENSCAN_predicted_peptide_1|98_aa XSRVAGTTAIWRRHQERTSFTHQQYEELEALFSQTMFPDRNLQEKLALKRNLLESTGKGL VQELAIQIEAAAAAAAAAAISKASKPDPFIQEECAHLP >gi568815593f:122675104_122929645|GENSCAN_predicted_CDS_1|297_bp ncctcccgagtagctgggactacagcaatatggagaaggcatcaagaacgtacttcattc acccaccaacagtatgaagagctagaagctctgtttagccagactatgttcccagataga aatcttcaggagaaactagctttgaaacgcaacctactggagtcaacaggtaaaggcttg gttcaggaactggcaattcagattgaagcagcagcagcagcagcagcagcagcagcaatc agcaaagcaagcaaaccagatcctttcatccaagaagaatgtgcccacctcccctag >gi568815593f:122675104_122929645|GENSCAN_predicted_peptide_2|462_aa MLKTLNKLGIDEMYLKIIRAIYDKPTANIILNGQKLEAFPFKTGTRQGCPLSPLLFNIVL EVLARAIRQEKEIKGIQLGKGEAKLSLFADDMIVHIENPIVSAQNLLKLISNFSKVSGYK INVQKSQAFLYTNNRQTESQIMSEVPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEI KEDTNKWKNILCSWIGRINVVKMATLPQVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQK RAHIARSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIIPHIY NYLIFDRPDKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNIRPK TIKTLEENPGNTIQDIGMGKDFMSKIPKAKATKARIDKWDLIKLKSFCTAKETTIRVNRQ PAEWEKIFIIYSSDKGLLSRIYNELNQIYKKKTTPSTSGQRI >gi568815593f:122675104_122929645|GENSCAN_predicted_CDS_2|1389_bp atgctaaaaactctcaataaattaggtattgatgagatgtatctcaaaataataagagca atttatgacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccct ttcaaaactggcacaagacagggatgccctctctcaccactcctattcaacatagtgttg gaagttctggccagggcaatcaggcaggagaaagaaataaagggtattcaattaggaaaa ggggaagccaaattgtccctgtttgcagatgacatgattgtacatatagaaaaccccatc gtctcagcccaaaatctccttaagctgataagcaacttcagtaaagtctcaggatacaaa atcaatgtgcaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaa atcatgagtgaagtcccgttcacaattgcttcaaagagaataaaatacctaggaatccaa cttacaagggacgtgaaggacctcttcaaggagaactacaagccactgctcaatgaaata aaagaggatacaaacaaatggaagaatattctatgctcatggataggaagaatcaatgtc gtgaaaatggccacactgccccaggtaatttatagattcaatgccatccccatcaagcta ccaatgactttcttcacagaattagaaaaaactactttaaagttcatatggaatcaaaaa agagcccacattgccaggtcaatcctaagccaaaagaacaaagctggaggcatcacgcta cctgacttcaaactatactacaaggctacagtaaccaaaacagcgtggtactggtaccaa aacagagatatagaccaatggaacagaacagagccctcagaaataataccacacatctac aactatctgatctttgacagacctgacaaaaacaagcaatggggaaaggattccctattt aataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttc cttacaccttatacaaaaattaattcaagatggattaaagacttaaatattagacctaaa accataaaaaccctagaagaaaacccaggcaataccattcaggacataggcatgggcaag gacttcatgtctaaaataccaaaagcaaaggcaacaaaagccagaattgacaaatgggat ctaattaaactcaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaa cctgcagaatgggagaaaattttcataatctactcatctgacaaagggctactatccaga atctacaatgaactcaatcaaatttacaagaaaaaaacaaccccatcaacaagtgggcaa aggatatga >gi568815593f:122675104_122929645|GENSCAN_predicted_peptide_3|160_aa MGTDNWEKEKERETLLEEEEEEDMDKEVDIFLNQDKHEEKPENESVFIQVADHTLGPEPN QSLEVSEFRLQEYTLIVIPGKTSFTINESLNTVKITRVVNTFLQDVLTQAPGRQASESSV RAMRVDTSTAPRGQEGSFKKELETDTNLGLLVKSLRRKFV >gi568815593f:122675104_122929645|GENSCAN_predicted_CDS_3|483_bp atgggcacagacaattgggagaaggagaaggagagagagacactactagaagaggaggag gaggaagacatggacaaggaagtagacattttccttaaccaagacaagcacgaggaaaag ccagagaatgagagtgtattcattcaagtagctgaccacacacttggtccagaacccaat cagtctcttgaggtcagtgagtttaggctccaggaatacacactcattgttatcccaggt aaaacatcttttacaatcaatgaatcgttaaacacagtgaagatcactcgcgttgttaat acttttctgcaggatgttcttacacaagccccaggaagacaggcaagcgagagttcagtc cgtgccatgagagttgacactagcactgctccaagggggcaagagggaagtttcaagaag gaactagaaacagacacgaatttgggtcttttagtcaaaagtcttcgcagaaagtttgtg tga >gi568815593f:122675104_122929645|GENSCAN_predicted_peptide_4|232_aa MDIKLKRAWSVGRNPTGHAGPPISSHCLQPDDGAILHKKPAPFITELNTHLTQESEKLQE DAGEGEQLQAQLLCYINEVSLARSVSHAFPEPIIVNGDETAIGGLEQSGLPHSHTCQDAE YVARLGNKSKTLSQKKEEEEGEGEGEGEGEGEGEGEGEGEGEGEEEEEEEEEEEEEEEEE EEEEEEEEKEEEEEEEETVIIKGCISLKHSTAIQALRHRSCAIQWILGQCRD >gi568815593f:122675104_122929645|GENSCAN_predicted_CDS_4|699_bp atggatataaagttgaagagagcctggtcagtgggccggaatccaacagggcacgctgga ccccctatcagttctcattgcctccaaccagatgatggggctatcctacataagaagcca gctcccttcatcacagaactaaacacacacctgacccaggagtcagagaagctgcaggag gatgccggggaaggtgaacagttgcaggcccagcttctgtgctatattaatgaagtctca ttggccagaagtgtgtcacatgcttttcctgaaccaatcattgtcaatggtgatgaaact gctattggtggtttagagcaatcaggattaccacattcccacacctgccaagatgcagaa tatgtggccagactgggtaacaaaagcaaaactctgtctcaaaaaaaggaagaagaagaa ggagaaggagaaggagaaggagaaggagaaggagaaggagaaggagaaggagaaggagaa ggagaaggagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaaaaagaagaagaagaagaagaagaagaaacagttatt atcaaaggatgcattagtctgaagcactccactgccattcaggcactgagacataggagt tgcgctatacaatggatacttggacagtgcagggattag >gi568815593f:122675104_122929645|GENSCAN_predicted_peptide_5|563_aa MAAEREPPPLGDGKPTDFEDLEDGEDLFTSTVSTLERDVLGLSAGPPQESSLRPAGADLI LRGARTEPSIAAALSQLSWPSSVSLARPPLQAGEGLALHPSKPPPSSFEGVLAFPFQREG VPLPGERNESSPSSPEPASLPAEDISANSNGPKPTEVVLDDDREDLFAEATEEVSLDSPE REPILSSEPSPAVTPVTPTTLIAPRIESKSMSAPVIFDRSREEIEEEANGDIFDIEIGVS DPEKVGDGMNAYMAYRVTTKLPRAVNTQALSGAGILRMVNKAADAVNKMTIKMNESDAWF EEKQQQFENLDQQLRKLHVSVEALVCHRKELSANTAAFAKSAAMLGNSEDHTALSRALSQ LAEVEEKIDQLHQEQAFADFYMFSELLSDYIRLIAAVKGVFDHRMKCWQKWEDAQITLLK KREAEAKMMVANKPDKIQQAKNEIREKERVKDFKTVIIKYLESLVQTQQQFVFLVGQTCL SKFSYRLAGAAMEVYIPSFRYEESDLERGYTQPARDSKQHPYLPRILQAPLDLLDHMAEV AEQFAWQLGPIAEPIHILGPTQN >gi568815593f:122675104_122929645|GENSCAN_predicted_CDS_5|1692_bp atggcggccgagagggaacctcctccgctgggggacgggaagcccaccgactttgaggat ctggaggacggagaggacctgttcaccagcactgtctccaccctagagagggatgtgctt gggctgagcgcagggcctcctcaggagagcagcctgcggcccgcgggtgccgacctaatc ctcagaggggccagaaccgaaccgagcattgcagcggcgctttcccagctcagctggcct tcctcggtgtctcttgcacgtcctcctcttcaagccggagaagggcttgctctgcatccc agtaaacctccaccgagctcttttgaaggggtgctcgcttttcctttccagagggagggt gtgccgctgccaggggagcgcaatgaatcaagtccatcatctccagaaccagctagtctt cctgcagaagatattagtgcaaactccaatggcccaaaacccacagaagttgtattagat gatgacagagaagatctttttgcagaagccacagaagaagtttctttggacagccctgaa agggaacctatcctatcctcggaaccttctcctgcagtcacacctgtcactcctactaca ctcattgctcctagaattgaatcaaagagtatgtctgctcccgtgatctttgatagatcc agggaagagattgaagaagaagcaaatggagacatttttgacatagaaattggtgtatca gatccagaaaaagttggtgatggcatgaatgcctatatggcatatagagtaacaacaaag ctgcctagagcagttaatacacaggctctgagtggagcaggaatattgaggatggtgaac aaggctgccgacgctgtcaacaaaatgacaatcaagatgaatgaatcggatgcatggttt gaagaaaagcagcagcaatttgagaatctggatcagcaacttaggaaacttcatgtcagt gttgaagccttggtctgtcatagaaaagaactttcagccaacacagctgcctttgctaaa agtgctgccatgttaggtaattctgaggatcatactgctttatctagagctttgtctcag cttgcagaggttgaggagaagatagaccagttacatcaagaacaagcttttgctgacttt tatatgttttcagaactacttagtgactacattcgtcttattgctgcagtgaaaggtgtg tttgaccatcgaatgaagtgctggcagaaatgggaagatgctcaaattactttgctcaaa aaacgtgaagctgaagcaaaaatgatggttgctaacaaaccagataaaatacagcaagct aaaaatgaaataagagagaaagaacgagtgaaggattttaaaaccgttatcatcaagtac ttagaatcactagttcaaacacaacaacagtttgttttccttgttggtcagacctgtttg agtaaattctcataccggctggccggcgcggccatggaggtctacatcccgtcctttcgc tatgaagagagcgacctggagcggggatacacgcaacctgccagagactccaaacaacat ccgtatctgccacggatacttcaggcaccattggatctgctggatcatatggccgaagtg gcagagcagtttgcatggcagcttggacctattgcagagcctatccatattctgggaccc actcaaaactag >gi568815593f:122675104_122929645|GENSCAN_predicted_peptide_6|258_aa MGLHSRNKDELSSVQVPSPQEIIRCIKHDLQPLEPGAIPTGVERGSALHYTYKSDLQAIE SQEAIEIFRKANSEENIYCQNPTNSKGFTSLAPKSPRLAARCMLKTIGVKRAEETVFFRW GTDLTVEAVRRARWPGEAAAAGMRGHTLQTCPLRGMIIHHPPAPSLGASHTTPSFHQKRT QCRCASASNHQDPRLPTPLVQDVGYKMVSHHSLVWIPQQLAPSTPCQSRKSQIQVESQAS SEQSGHPACVVAIFQSMA >gi568815593f:122675104_122929645|GENSCAN_predicted_CDS_6|777_bp atgggtcttcattcacgcaacaaagatgagctgagcagtgtgcaggtgcccagtccacag gagataattagatgcataaagcatgatttgcagcctcttgaacctggtgccatcccaact ggagttgaaaggggttcagcactacactacacctacaagtctgaccttcaggctatagag agccaagaggcaatagaaatattcagaaaggcaaactcagaagagaacatctactgccag aaccctaccaacagcaagggatttactagtctagctccaaagtcaccacgactcgcagcc aggtgtatgttgaagactataggagtgaaaagagcagaagagactgtgttcttcaggtgg ggcacagatctcacagtggaggctgtcaggagggcacgctggcctggagaagcagcagct gctggcatgcgtggacacactttgcagacgtgtcccctgcgggggatgataattcatcac cctccagcccccagcctaggggcctctcacacaaccccatccttccaccagaaaagaaca cagtgccgatgtgcctctgcttccaatcaccaggacccaaggttgcctacacccttggtc caagatgtgggatacaaaatggtcagccatcattcccttgtgtggattcctcaacaactt gctccctcaacaccttgccagtccaggaaaagccagattcaggttgagtctcaagcatca tctgaacagagtggacatccggcatgtgtagtggccatctttcaaagcatggcctga >gi568815593f:122675104_122929645|GENSCAN_predicted_peptide_7|669_aa MDKFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKPDRDTTRKENCRPISLMNIDAKILNKILA NRIQQHIKKLIHHDQTLPNPPLLETPKNDQLKKKKKTHMIISIDAEKAFDKIQQRFMLKP LNKLVLEVLARAMRQEKEIKGIQLGKEEVKLSRFADDMIVYLENPIVSAQNLLKLISNFS KVSGYKINVQKSQAFLYTNNRQTESQIVSELPFTIASKRIKYLGIQLTRDVKDLFKENYK PLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLK FIWNQKRARIAKSILSQKNKAGSITLPDFKLYYKATVSKTAWYWYQNRDIDQWNRTEPSE ITPLIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKD LNVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETT IRVNRQPTKRKKIFTTYSSDEGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHISKEDIY AAKRHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNKCWRGCGEIGTLLHCWDM DEIGNHHSQ >gi568815593f:122675104_122929645|GENSCAN_predicted_CDS_7|2010_bp atggataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggatctgaaattgtggcaataatcaatagcttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcatcctgataccaaagcctgacagagacacaacaagaaaagag aattgtagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggca aaccgaatccagcaacacatcaaaaagcttatccatcatgatcaaaccctgccaaatccc cctctgctagaaacacccaagaatgatcaattaaaaaaaaaaaaaaaaacccacatgatt atctcaatagatgcagaaaaggcctttgacaaaattcaacaacgcttcatgctaaaacct ctcaataaattagtgttggaagttctggccagggcaatgaggcaggagaaggaaataaag ggtattcaattaggaaaagaggaagtcaaattgtcccggtttgcagatgacatgattgta tatctagaaaaccccattgtctcagcccaaaatctccttaagctcataagcaacttcagc aaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaataac agacaaacagagagccaaatcgtgagtgaactcccattcacaattgcttcaaagagaata aaatacctaggaatccaacttacaagggacgtgaaggacctcttcaaggagaactacaaa ccactgctcaaggaaataaaagaggatacaaacaaatggaagaacattccatgctcatgg gtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttacagattcaat gccatccccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaag ttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaacaaa gctggaagcatcacgctacctgacttcaaactatactacaaggctacagtaagcaaaaca gcatggtactggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaa ataacgccgcttatctacaactatctgatctttgacaaacctgagaaaaacaagcaatgg ggaaaggattccctatttaataaatggtgctgggaaaactggctagccatatgtagaaag ctgaaactggatcccttccttacaccttatacaaaaattaattcaagatggattaaagac ttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggcattaccattcag gacataggcatgggcaaagacttcatgtctaaaacaccaaaagcaatggcaacaaaagcc aaaattgacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactacc atcagagtgaacaggcaacctacaaaacggaagaaaattttcacaacctactcatctgac gaagggctaatatccagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaac aaccccatcaaaaagtgggcaaaggatatgaacagacacatctcaaaagaagacatttat gcagccaaaagacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatc aaaaccacaatgagataccatctcactccagttagaatggcaatcattaaaaagtcagga aacaacaagtgctggagaggatgtggagaaataggaacacttttacactgttgggacatg gatgaaattggaaatcatcattctcagtaa