GENSCAN 1.0 Date run: 8-Nov-116 Time: 07:44:08 Sequence gi568815590f:127315867_127516943 : 201077 bp : 40.80% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 3405 4313 909 2 0 86 39 191 0.762 10.01 1.02 PlyA + 4324 4329 6 -1.95 2.00 Prom + 4521 4560 40 -3.65 2.01 Init + 5183 5263 81 1 0 59 76 44 0.838 1.32 2.02 Term + 5541 5792 252 2 0 35 41 227 0.864 7.35 2.03 PlyA + 8362 8367 6 1.05 3.00 Prom + 15626 15665 40 -5.15 3.01 Init + 18812 18954 143 1 2 81 106 43 0.414 5.05 3.02 Intr + 23415 23569 155 0 2 90 99 28 0.108 2.89 3.03 Intr + 32948 33072 125 0 2 98 7 99 0.050 2.18 3.04 Term + 37978 38124 147 0 0 -30 55 243 0.173 6.12 3.05 PlyA + 38618 38623 6 1.05 4.00 Prom + 54795 54834 40 -4.35 4.01 Init + 63993 65596 1604 2 2 42 53 426 0.824 26.40 4.02 Intr + 66945 67016 72 0 0 72 49 86 0.565 0.70 4.03 Term + 75221 75440 220 2 1 60 36 271 0.997 14.63 4.04 PlyA + 76356 76361 6 1.05 5.06 PlyA - 76438 76433 6 1.05 5.05 Term - 76863 76800 64 2 1 105 53 60 0.011 0.58 5.04 Intr - 85496 85395 102 1 0 97 37 89 0.042 3.07 5.03 Intr - 86246 86131 116 2 2 82 66 73 0.304 2.83 5.02 Intr - 90818 90750 69 2 0 99 84 30 0.343 2.16 5.01 Init - 93204 92797 408 0 0 32 103 157 0.135 8.44 5.00 Prom - 93915 93876 40 -4.85 6.00 Prom + 99484 99523 40 -6.85 6.01 Sngl + 100217 101080 864 1 0 88 51 925 0.664 84.42 6.02 PlyA + 101306 101311 6 1.05 7.00 Prom + 105636 105675 40 -5.05 7.01 Init + 106940 107078 139 1 1 65 5 161 0.494 5.85 7.02 Intr + 111267 111419 153 1 0 72 55 101 0.691 4.32 7.03 Term + 111971 112146 176 0 2 70 49 122 0.660 3.44 7.04 PlyA + 112719 112724 6 1.05 8.11 PlyA - 112845 112840 6 1.05 8.10 Term - 116656 116494 163 0 1 -71 48 230 0.346 -0.47 8.09 Intr - 116851 116675 177 0 0 70 92 88 0.645 5.51 8.08 Intr - 118363 118245 119 1 2 24 109 109 0.697 4.94 8.07 Intr - 132630 132566 65 1 2 92 85 1 0.006 -2.28 8.06 Intr - 134959 134825 135 2 0 29 81 95 0.001 2.52 8.05 Intr - 137931 137800 132 1 0 33 61 87 0.001 0.20 8.04 Intr - 147853 147738 116 0 2 103 67 48 0.049 3.37 8.03 Intr - 163304 163217 88 0 1 138 86 63 0.634 9.21 8.02 Intr - 163940 163838 103 2 1 65 78 66 0.451 2.13 8.01 Init - 166936 166853 84 1 0 76 81 83 0.940 7.27 8.00 Prom - 179717 179678 40 -5.65 9.04 PlyA - 180000 179995 6 1.05 9.03 Term - 180353 180106 248 0 2 46 48 196 0.285 6.37 9.02 Intr - 186275 186124 152 1 2 102 18 109 0.176 4.19 9.01 Init - 187766 187699 68 2 2 45 87 93 0.587 5.50 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 140256 140399 144 2 0 111 43 91 0.953 3.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:127315867_127516943|GENSCAN_predicted_peptide_1|302_aa MAILPKVIYRFNAIPIKLPMIFLTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPD FKLYYKATVTKTAWCWYQNRDIDQWNRTEPSEITRHIYNYLIFDKPEKNKQWGKDSLFNK WCWENWLTICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIKDIGVGKDF MSKTPKAMATKDKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFTTYSSDKGLISRIY NELKQIYKKKTNNSIKKWAKDMNRHFSKEDIYAAKKHEKMLIITGHQRNANQNHNEIPSH TS >gi568815590f:127315867_127516943|GENSCAN_predicted_CDS_1|909_bp atggccatactgcccaaggtaatttacagattcaatgccatccccataaagctaccaatg attttcctcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtgctggtaccaaaacaga gatatagatcaatggaacagaacagagccatcagaaataacgcggcatatctacaactat ctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctaaccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaatcaattcaagatggattaaagatttaaacgttagacctaaaaccata aaaaccctagaagaaaacctaggcattaccattaaggacataggcgtgggcaaggacttc atgtccaaaacaccaaaagcaatggcaacaaaagacaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca aaatgggagaaaattttcacaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaacaactccatcaaaaagtgggcgaag gacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacatgaaaaaatg cttatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctcac accagttag >gi568815590f:127315867_127516943|GENSCAN_predicted_peptide_2|110_aa MGVHTIHWLFSIDYKFHQVFTRMIGGRERAVSWRRHHYQGIQEHNAYLKLSLNPNNRKHS APLLSQQQVLAMRGQEHVKKSSYSIREKKRPKPDSGANPEEHHLAHESPV >gi568815590f:127315867_127516943|GENSCAN_predicted_CDS_2|333_bp atgggagttcacactattcactggctcttctcaatagactacaaatttcatcaggtgttc acaagaatgattggaggcagggaaagggcagtatcatggagaaggcaccactatcagggt attcaggaacacaatgcctatctgaaactgagcctcaatccaaacaacagaaaacactct gctcctctcctcagtcagcagcaggttcttgctatgagaggacaagagcatgtgaaaaaa tcatcttatagcatcagagaaaagaaaagacctaaacctgatagtggagcaaatcctgag gaacatcatcttgcacatgagtctccagtctaa >gi568815590f:127315867_127516943|GENSCAN_predicted_peptide_3|189_aa MVSNLESYTLSPMCEEITKTFLDTQVSQNCCLSDSLTQKAVGNHIPSKGAYLANSTLNLS AWKLLMQCPRWRKVRVDPEEPREDVQHGDLGSSQGPYFSVFLVSGLPSVVLRPTAAAAPR ELLERQSPGPHPRPSESETLWKASRMCQRKALELSFLLEKERRRKKKEEEEQEEEEQEEQ DKEEEVFFP >gi568815590f:127315867_127516943|GENSCAN_predicted_CDS_3|570_bp atggtttccaacctagaatcctatacccttagtccaatgtgtgaggagataacaaagaca tttttagacactcaagtttcccaaaactgttgtctatcagacagccttactcagaaagct gttgggaatcatataccatcaaaaggtgcttatttagcaaattcaaccttaaacctgagt gcatggaaactattgatgcagtgtccaaggtggagaaaggtcagagtggatccagaggag ccaagagaagacgtccagcatggtgacctgggctcaagtcaaggtccttatttctctgtc ttcttggtcagtggtcttccaagtgttgtcctcaggcccacagcagcagcagcacccagg gagttgttagaaaggcaaagtcctggtccccaccccagacccagtgaatcagaaactctg tggaaagcatctagaatgtgtcagcgaaaggcactggaattatcattccttcttgagaag gaaagaagaagaaagaagaaggaggaggaggagcaggaggaagaggagcaggaggagcag gacaaggaggaggaagtcttcttcccttga >gi568815590f:127315867_127516943|GENSCAN_predicted_peptide_4|631_aa MIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVS AQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLT RDVKDLFKVNYKPLLKEIKEDTNKWKNIPCSWVGRINSVKMAILPKVIYRFNAIPIKLPM TFFTELEKTTLKFIWNQKRACIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNR DIDQWNRTEPSEITPHTYNCLIFDKPEKNKKWGKDSLFNKWCWENWLAICRKLKLDPFLT PYTKINSRWIKDLNVRPKTIKTLEENLGITTQDIGMGKDFMSKTPKAMATKDKIDKWDLI KLKSFCTAKETTIRVNRQPTKWEKIFTTYSSDKGLISRIYNELKQIYKKKTNNPIKKWAK DMNRHFSKEDIYAAKEHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRFYIQL CHEEAAFLQKLRNCFSCSPVHGSVSGERDRIRQCTEPYDDIFSQEEEECFIKSPISPRLD CTKILQYEHNVLYCPGSNPFPSGNTFANNDS >gi568815590f:127315867_127516943|GENSCAN_predicted_CDS_4|1896_bp atgattatctcaatagatgcagaaaaagcctttgacaaaattcaacaacccttcatgcta aaaactctcaataaattaggtattgatgggacgtatttcaaaataataagagctatctat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgtatatctagaaaaccccattgtctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtacaaaaatcacaagcattcttatacaccaacaacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca agggatgtgaaggacctcttcaaggtgaactacaaaccactgctcaaggaaataaaagag gatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatagcgtgaaa atggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc tgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgac ttcaaactttactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataacaccgcatacctacaactgt ctgatctttgacaaacctgagaaaaacaagaaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaatcaattcaagatggattaaagacttaaacgttagacctaaaaccata aaaaccctagaagaaaacctaggcattaccactcaggacataggcatgggcaaggacttc atgtccaaaacaccaaaagcaatggcaacaaaagacaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca aaatgggagaaaattttcacaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaag gacatgaacagacacttctcaaaagaagacatttatgcagccaaagaacacatgaaaaaa tgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctc acaccagttagaatggcaatcattaaaaagtcaggaaacaacaggttctatattcagctc tgccatgaagaagctgcgttcctacagaaactgagaaactgcttctcctgcagccctgtg catgggagtgtgtcaggggaaagggaccggatacggcagtgtaccgagccctacgatgac attttcagccaagaagaagaagaatgcttcatcaaatctcccatcagcccaagactagat tgtaccaagattttgcaatatgaacataatgttctctactgtccaggatctaatcctttt ccatcggggaacacatttgcaaacaatgactcctaa >gi568815590f:127315867_127516943|GENSCAN_predicted_peptide_5|252_aa MEATGWQTPRGPHLLPLGALALRRIPVRVVKAVGEGATDESSRAPQCHCDKSACRGTAAL AIALHIKKARRPGMVHHRDRMHHVPMMSSKLHLPLQSASNVPYKKLLEKQKKRKFTDTIH SLCPPSFSLSELSSLQETDLYGLNQQVLMSLMSTWVQTRSRTGEAEHLQDILGMTFKKLQ LSQLSQEPGKHLEHTLFMPKHITWAVEKREQLWDKEHIVKLQSCTVIAWSTSAQYSACKQ QKSNTDSHNCIH >gi568815590f:127315867_127516943|GENSCAN_predicted_CDS_5|759_bp atggaagcaactggctggcagacaccacgggggcctcaccttttgcctttaggtgccctg gctctgcgaagaattccagtcagagttgtgaaggctgtgggtgagggagccactgatgag tccagcagagccccacagtgccactgtgacaagtctgcgtgcagggggacagcagcactg gccattgccctgcatattaagaaggcccgcaggccaggcatggtgcatcatagggacagg atgcaccatgtccctatgatgtccagtaagttacatcttccactgcagtcagcctcaaac gtgccctataagaaactgctggaaaagcaaaagaaaagaaaattcacagatactatccat tccctctgtccaccctccttttctctctctgaattgtcctcactgcaggagactgacctg tatggactaaatcaacaggttctcatgtctctgatgtctacctgggttcagacaaggagt agaacaggggaagctgaacacctgcaggatatcttgggaatgacattcaaaaagctacag ctgtcccagctgagccaggagcctggcaagcacttggagcacacattatttatgcccaaa catattacctgggcagttgagaaacgagaacagttgtgggacaaagaacacattgttaaa ttgcagagttgcacagttattgcctggagcaccagtgcccagtacagtgcctgcaagcaa cagaaatctaatacagattcccacaactgcatacattga >gi568815590f:127315867_127516943|GENSCAN_predicted_peptide_6|287_aa MAYCGPQVGVGLVPQGGLETSQPESEAGVGVESNSNGASPEPCTVPPGAVKLEKEKLEQN PEKSQDIKALQKELEQFAKLLKQKRITLGYTQADVGLILGVLFGKVFSQKTICRFEALQL SFKNMCKLRPLLQKWVEEADNNENLQEICKAETLMQARKRKRTSIENRVRGNLENLFLQC PKPTLQISHIAQQLGLEKDVVRVWFCNRRQKGKRSSSDYAQREDFEAAGSPFSGGPVSFP PAPGPHFGTPGYGSPHFTALYSSVPFPEGEVFPPVSVITLGSPMHSN >gi568815590f:127315867_127516943|GENSCAN_predicted_CDS_6|864_bp atggcgtactgtgggcctcaggttggagtggggctagtgccccaaggcggcttggagacc tctcagcctgagagcgaagcaggagtcggggtggagagcaactccaatggggcctccccg gaaccctgcaccgtcccccctggtgccgtgaagctggagaaggagaagctagagcaaaac ccggagaagtcccaggacatcaaagctctgcagaaagaactcgagcaatttgccaagctc ctgaagcagaagaggatcaccctgggatatacacaggccgatgtggggctcatcctgggg gttctatttgggaaggtgttcagccaaaagaccatctgccgctttgaggctctgcagctt agcttcaagaacatgtgtaagctgcggcccttgctgcagaagtgggtggaggaagctgac aacaatgaaaatcttcaggagatatgcaaagcagaaaccctcatgcaggcccgaaagaga aagcgaaccagtatcgagaaccgagtgagaggcaacctggagaatttgttcctgcagtgc ccgaaacccacactgcagatcagccacatcgcccagcagcttgggctcgagaaggatgtg gtccgagtgtggttctgtaaccggcgccagaagggcaagcgatcaagcagcgactatgca caacgagaggattttgaggctgctgggtctcctttctcagggggaccagtgtcctttcct ccggccccagggccccattttggtaccccaggctatgggagccctcacttcactgcactg tactcctcagtccctttccctgagggggaagtctttcccccagtctccgtcatcactctg ggctctcccatgcattcaaactga >gi568815590f:127315867_127516943|GENSCAN_predicted_peptide_7|155_aa MTRTTRVDLEVVKVVRLRMHLKMELSGPADGYGMAVKESEESGCLPETSRQLYMSSVGLW AWTRHSGVVKVNVPGVIQTELDSQVFTFQLRDIGRLSTFPSSVNDNSTLGVFQLKNLTVS LDSFLHTHAMPTVDAASSVDIWEGGFQVENSKVKG >gi568815590f:127315867_127516943|GENSCAN_predicted_CDS_7|468_bp atgactcggaccacaagggtagacttggaggtggtgaaagtggtcagattaaggatgcac ttgaaaatggagttgtcaggacctgctgatggctatggcatggctgtgaaggaaagtgaa gaatcaggatgcctcccagaaacctcaaggcagttgtatatgtcgtcagtaggtctatgg gcctggacaaggcattctggtgtagtgaaagtcaacgtccctggagtcattcaaactgag ttggactctcaagtcttcactttccagctcagggatattggccgcttatccaccttccca agctcagtaaatgataattccactctaggagttttccagctcaaaaacctcactgtcagc cttgactccttccttcacacccatgccatgccaactgtagatgcagccagctctgtggat atctgggaaggaggcttccaggtggagaatagcaaggtcaaaggctga >gi568815590f:127315867_127516943|GENSCAN_predicted_peptide_8|393_aa MTGNPSQLPLTTLVGPSATSKDSLDKEMYTPECPACSECSKSVGGSDRIPSPLNTTSVAV DNGTEAGEKSVQQLLSLKSLQPLEEGLEVKKRWLDNLLENILAEGLVHLKTWIALNALFW DILMVNLQDEAFTGDFKHLKEQASWRLVAIFFYIHVIILRAQHESPGRYHFSHLTTKPFN IWYTFGPQIISSNSKVTMAMLLLSAWTQEWALPFKTSTKGKSPQLRALNSITSAKSLSWC KGASEAIRQCQSSAAKPRRSGKESVREPWTSVLGALGVAARKAGLAAKGEGEGVEGYLPL PQKSREGVGVLAPSPEKRDLPLRVKDQGRRPCVWSDTLETGVPARIKHQGKAAFPVRDRS RSFGSTDKTCLLCRYQKMKGIEIKGDIEVWHQD >gi568815590f:127315867_127516943|GENSCAN_predicted_CDS_8|1182_bp atgactggcaacccctcacaactgcctctgacaactcttgtgggtccttctgccacatct aaggactctctggataaggagatgtatacaccagaatgccccgcatgcagcgaatgctca aaaagtgttggtggaagtgacagaattccatcaccattaaacacaaccagtgttgcggtt gacaatggcactgaagcaggtgaaaaatccgtacagcagcttctgtctttgaagagtttg cagcctcttgaagagggtttagaggttaaaaaaaggtggcttgataatctactagagaat attttggctgaaggtttagttcacctaaagacttggatagctcttaatgcactattttgg gatattttgatggtaaatcttcaggatgaagctttcactggggacttcaagcatcttaag gagcaggcaagctggcgacttgtggcaatcttcttttacatccatgtcataattcttagg gcccagcatgaaagtccagggcggtatcatttctcacatctgactacaaagcctttcaac atctggtacacatttgggccacagattatctcatccaattcaaaagtcaccatggcaatg cttctgctcagtgcttggacacaggaatgggccttgccttttaaaacctcaaccaaagga aaatctccccaactcagggcacttaattcaatcacatctgcaaagtcactttcatggtgt aagggggcttccgaggcaatcaggcagtgtcagtcttcagctgctaagccgagaagatct gggaaggagtcagtcagagagccttggaccagcgttctaggggctctgggagtggctgcc aggaaagcaggacttgccgctaagggtgaaggagaaggggttgaggggtacttgcccctg ccccagaaaagcagagaaggggttggggtacttgccccttccccagaaaagcgggacttg ccgctaagggtgaaggaccaaggcaggcgtccctgcgtgtggtctgacacccttgaaaca ggcgtccctgcaaggattaaacaccaagggaaggctgccttcccagtccgtgaccggagc cggagttttgggtccacggataaaacgtgtctcctttgtcgctaccagaaaatgaaagga attgaaattaagggagacattgaagtgtggcaccaagattga >gi568815590f:127315867_127516943|GENSCAN_predicted_peptide_9|155_aa MDPGEPICGQELKAYQEQTKRRLKCVGDFRNMLIYKSQLIEKAGYGDTDKVTAQGASRSV HPFRFQLHLNNHGLGKINLPMEDDENLCFAEVLTNVSLTLIVSCADWLKLMFENKDGDRM MFPRGNQDLIKRNGRIPAVQATTDVCYTVSGRYWL >gi568815590f:127315867_127516943|GENSCAN_predicted_CDS_9|468_bp atggaccctggagagcctatttgtggtcaagagctgaaagcttatcaagaacaaactaaa agaagattgaaatgtgttggagatttcagaaacatgcttatttataagtcgcaactaatt gagaaggctggctatggagacacagacaaagtgaccgcacaaggagcttctaggtctgtt catccctttagattccagcttcacttgaataaccatggcttggggaagataaatttgccc atggaagatgacgagaacttgtgtttcgcagaagttctcacaaatgtctcactgacactg attgtttcatgtgctgattggctaaagctcatgtttgagaataaggatggggacaggatg atgttcccacgtggaaatcaggatctcattaagagaaatggaagaattcctgctgtgcag gcaaccacagatgtctgctatacggtgtctggaagatactggttgtga