GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:24:18 Sequence gi568815591f:78251903_78458611 : 206709 bp : 37.30% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 2532 2527 6 1.05 1.03 Term - 2775 2678 98 1 2 91 39 50 0.382 -2.45 1.02 Intr - 4679 4041 639 0 0 131 94 663 0.782 62.30 1.01 Init - 18465 17829 637 0 1 75 28 180 0.223 6.25 1.00 Prom - 18969 18930 40 -7.75 2.00 Prom + 19201 19240 40 -9.75 2.01 Init + 19364 19483 120 1 0 34 10 171 0.247 4.14 2.02 Intr + 22285 22485 201 0 0 65 24 187 0.214 8.56 2.03 Intr + 25510 25671 162 0 0 70 94 67 0.278 4.75 2.04 Intr + 36555 36584 30 2 0 125 103 -21 0.268 0.51 2.05 Intr + 38627 39165 539 1 2 44 116 211 0.663 10.28 2.06 Term + 39503 41366 1864 2 1 6 48 691 0.789 41.24 2.07 PlyA + 42173 42178 6 1.05 3.00 Prom + 45605 45644 40 -3.65 3.01 Init + 66106 66251 146 0 2 68 98 130 0.241 11.64 3.02 Intr + 66608 66822 215 2 2 7 77 114 0.445 -0.36 3.03 Intr + 68060 68287 228 0 0 45 72 126 0.554 3.62 3.04 Term + 68546 69099 554 0 2 51 48 181 0.324 3.89 3.05 PlyA + 69131 69136 6 1.05 4.07 PlyA - 70419 70414 6 1.05 4.06 Term - 75274 75040 235 0 1 52 43 157 0.409 2.41 4.05 Intr - 77251 77158 94 2 1 76 94 33 0.126 0.80 4.04 Intr - 92058 91876 183 1 0 37 110 204 0.116 16.34 4.03 Intr - 93670 93542 129 2 0 13 88 103 0.082 2.55 4.02 Intr - 94141 94020 122 0 2 84 64 124 0.851 8.82 4.01 Init - 94567 94515 53 1 2 27 64 48 0.366 -3.02 4.00 Prom - 97285 97246 40 -6.55 5.00 Prom + 97974 98013 40 -6.65 5.01 Init + 101022 101071 50 2 2 110 83 40 0.597 6.17 5.02 Intr + 106374 106716 343 0 1 73 32 130 0.395 0.41 5.03 Term + 106738 107157 420 0 0 90 38 212 0.573 10.70 5.04 PlyA + 107532 107537 6 1.05 6.03 PlyA - 107912 107907 6 1.05 6.02 Term - 115472 115294 179 0 2 67 36 148 0.490 4.37 6.01 Init - 130877 130871 7 2 1 38 113 0 0.067 -1.36 6.00 Prom - 131449 131410 40 -3.35 7.03 PlyA - 131582 131577 6 1.05 7.02 Term - 136515 136314 202 2 1 -20 48 336 0.106 14.68 7.01 Init - 144642 144629 14 0 2 68 121 19 0.469 2.74 7.00 Prom - 151582 151543 40 -3.65 8.00 Prom + 164839 164878 40 -4.35 8.01 Sngl + 171608 171913 306 1 0 49 45 200 0.214 7.42 8.02 PlyA + 173819 173824 6 1.05 9.03 PlyA - 175825 175820 6 1.05 9.02 Term - 176704 176630 75 1 0 86 48 102 0.393 2.86 9.01 Init - 186902 186807 96 2 0 61 89 41 0.042 1.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 91984 91876 109 1 1 97 110 193 0.880 22.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:78251903_78458611|GENSCAN_predicted_peptide_1|457_aa MAILPKIIYRLNAIPIKLPLTFFTELEKTTSNFIQNQKRALTAKTILSKKNKTGDIMLSD FKLYYKATLTQTAWYWYQNRYIDQWNRTEAPEIPPHIYQHLISDKPDKNKQWGKDSLFNK WCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIETLEENLGNTIQDIVMGKDF MTKTPKAMATKAKIDKWDLIKGLLHSKRNYHQGDVIVYINEVCVLGHTHADVVKLFQSVP IGQSVNLVLCRGYPLPFDPEDPANSMVPPLAIMERPPPVMVNGRHNYETYLEYISRTSQS VPDITDRPPHSLHSMPTDGQLDGTYPPPVHDDNVSMASSGATQAELMTLTIVKGAQGFGF TIADSPTGQRVKQILDIQGCPGLCEGDLIVEINQQNVQNLSHTEVVDILKDCPIGSETSL IIHRGGVFPSRKYHIVDSLFGNGQQTPGICEQHSTKS >gi568815591f:78251903_78458611|GENSCAN_predicted_CDS_1|1374_bp atggccatactgcccaagataatttatagactcaatgctatccccatcaagctaccattg actttcttcacagaattggaaaaaactacttcaaatttcatacagaatcaaaaaagagcc ctcacagccaagacaatcctaagcaaaaagaacaaaactggagacatcatgctatcagac ttcaaactatactacaaggctacattaacacaaacagcatggtactggtaccaaaacaga tacatagaccaatggaacagaacagaggccccagaaataccaccacatatctaccagcat ttgatctctgacaaacctgacaaaaacaagcaatggggaaaggattccctatttaataaa tggtgttgggaaaactggctagccatatgcagaaagctgaaactggaccccttccttaca ccttatacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccata gaaaccctagaagaaaacctaggcaataccattcaggacatagtcatgggcaaagacttt atgactaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaaggccttctgcacagcaaaaggaactatcatcagggtgatgtcattgtctatattaat gaagtttgtgtccttggacacactcatgcagatgttgtcaaacttttccagtctgttcct attggtcagagtgtcaacctggtgttgtgtcgtggctaccctttgccctttgatcctgaa gaccctgctaacagcatggtgccaccccttgcaataatggagaggccacctccagtgatg gtcaatggaagacacaactatgaaacatatttggagtacatttctcggacctcacagtca gttccagatataacagatcggccgcctcattctctgcactccatgccaactgatggtcag ctagacggcacgtatccaccgcccgtccatgatgacaatgtgtctatggcttcatctggg gccacccaagctgaacttatgaccttaaccattgtgaaaggtgcccagggcttcggcttc actattgccgacagtcctacaggacagcgggtgaaacaaatacttgacattcagggatgc cctggcctgtgtgaaggcgacctcattgttgagatcaaccagcagaatgtacagaacctg agccatacagaagtagtggatatacttaaggactgtcccattggaagtgaaacttctttg attatccatcgaggaggtgtttttccttcaaggaagtatcacattgttgattccttgttt ggaaacggacagcaaacaccaggcatttgtgagcagcactctacaaagtcttag >gi568815591f:78251903_78458611|GENSCAN_predicted_peptide_2|971_aa MLIEPALHPRDEADLIVEDKLFDVLLDSVCQYFIEDFRIESGSSAAGLLEFAGGPLQTLF AWVSPADAAEQQSLLPVPSSGSFIPEGHLQMPAIALLYEVSVSSYWEGASSTAVTHSKGL QTFAKCKKLVTFQEGMRCPEQSYPKVQQVLENGVSKRVDSEVPCFNSSGNTINETEREQG YPGLELSSAPSRPNRHLQNSPPQINRIYILLSTTLHCTYSKIDHTLGSKALLSKCKITEI ITDSQTTVLSELRIKKLTQKPLSYMETEQLLNDYWVHNEMKAEIKMFFETNENKDTTYQN LWDTFKAVCRGKFIALNAHKRKQERSKIDTLMSQLKELEKQEQTHSKVSRRPIPSSEIEA IINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKP GRDTTEKENFRPISLMNIDAKILNKILANRIQQHIKKLLHHDQVGFIPGMQGWFNICKSI NVIQHINRTSDKNHTIISIDAEKAFNKIQQLFMLKTLNKLGTNGMYLKIVRAIYDKPTAN IILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIQGIQLGKEEVKLSLF ADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINLQKSQAFLYTNNRQTESQIMSELPFT TASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWRNIPCSWVGRINIVKMAILPK VTYRFNAIPIKLPMTFFTELEKTILKFRWNQKRAHIAKTILSQKNKAGGIRLPDFKLFYK ATVTKTARYWYENRDIDQWNRTEPLEIMPHIYNHLIFDKPDKNKQWGKDSLFNKWCWENW LAICRKLKLDPFLTSYTKINSRWIEDLNVRPKTIETLEENLGNTIQDIGMGEDFMTETPK AMATKAQIDKWDLIKLKSLCTAKETTIRVNRQPTEWEKLFIVYPSDKGLISRIYKELKQI YKKKSNHPIEK >gi568815591f:78251903_78458611|GENSCAN_predicted_CDS_2|2916_bp atgttgattgaaccagccttgcatcccagggatgaagccgacttgatcgtggaggataag ctttttgacgtgctgctggattcggtttgccagtattttattgaggattttcgaattgag tcaggatcctctgctgcaggtctcctggagtttgctggaggtccactccagaccctgttt gcctgggtatcaccagcagatgctgcagaacagcaaagtttgttgcctgttccttcctct ggaagcttcatcccagaggggcacctgcagatgccagccatagctctcctgtatgaggtg tctgtcagctcctactgggagggggcatccagtacagcagtgacacacagtaaaggcttg caaacatttgccaaatgcaaaaaattagtaacatttcaagaaggcatgagatgtccagaa cagagctatccaaaggtgcagcaagttttagaaaatggggttagcaagagagtggattct gaggttccttgttttaattcttcaggaaatacgatcaatgagacagaaagagaacaagga tatccaggacttgaactcagctctgcaccaagcagacctaatagacatctacagaactct ccaccccaaatcaacagaatatacattcttctcagcaccacattgcattgcacttattcc aaaattgaccacacacttggaagtaaagcactcctcagcaaatgtaaaataacagaaatt ataacagactctcagaccacagtgctatcagaactcaggattaagaaactcactcaaaaa ccgctcagctacatggaaactgaacagctcctgaatgactactgggtacataatgaaatg aaggcagaaataaagatgttctttgaaaccaatgagaacaaagacacaacataccagaat ctctgggatacatttaaagcagtgtgtagagggaaatttatagcactaaatgcccacaag agaaagcaggaaagatctaaaattgacaccctaatgtcacaattaaaagagctagagaag caagagcaaacacattcaaaagttagcagaagaccaataccaagttctgaaattgaggca ataattaatagcttaccaaccaaaaaaagtccaggaccagatggattcacagccgaattc taccagaggtacaaggaggagctggtaccattccttctgaaactattccaatcaatagaa aaagagggaatccttcctaactcattttatgaggccagcatcattctgataccaaagcct ggcagagacacaacagaaaaagagaattttagaccaatatccctgatgaacatcgatgca aaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaagaagcttctccac catgatcaagtgggcttcatccctgggatgcaaggctggttcaacatatgcaaatcaata aatgtaattcagcatataaacagaaccagtgacaaaaaccatacgattatctcaatagat gcggaaaaggccttcaacaaaattcaacagctgttcatgctaaaaactctcaataaacta ggtaccaatgggatgtatctcaaaatagtaagagctatttatgacaaacccacagccaat atcatactgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacaggga tgccctctctcaccactcctattcaacatagtattggaagttctggccagggcaatcagg caggagaaagaaatacagggtattcaattaggaaaagaggaagtcaaattgtccctgttt gcagatgacatgattgtatacttagaaaaccccattgtctcagcccaaaacctccttaag ctgataagcaacttcagcaaagtctcaggatacaaaatcaatctgcaaaaatcacaagca ttcttatacaccaataacagacaaacagagagtcaaatcatgagtgaactcccattcaca actgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctc ttcaaggagaactacaaaccactgctcaatgaaataaaagaggacacaaacaaatggagg aacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaag gtaacttatagattcaatgccatccccatcaagctaccaatgactttcttcacagaattg gaaaaaactattttaaagttcagatggaaccaaaaaagagcccacattgccaagacaatc ctaagtcaaaagaacaaagctggaggcatcagactacctgacttcaaactattttacaag gctacagtaacaaaaacagcacggtactggtacgaaaacagagatatagaccaatggaac agaacagagcccctggaaataatgccacacatctacaaccatctgatctttgacaaacct gacaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaaaactgg ctagccatatgtagaaagctgaaactggaccccttccttacatcttatacaaaaattaat tcaagatggattgaagacttaaatgttagacctaaaaccatagaaaccctagaagaaaac ctaggcaataccattcaggacataggcatgggcgaagactttatgactgaaacaccaaaa gcaatggcaacaaaagcccaaattgacaaatgggatctaattaaactaaagagcttgtgc actgcaaaagaaactaccatcagagtgaacaggcaacctacagaatgggagaaacttttt atagtctacccatctgacaaaggcctaatatccagaatctacaaagaacttaaacaaatt tacaagaaaaaatcaaaccaccccatcgaaaagtag >gi568815591f:78251903_78458611|GENSCAN_predicted_peptide_3|380_aa MRRNQRKKAENSKNQNASSPPKDHNSSPAREQNWTENEFDKLTEVGFRSDGENGTKLENT LQDIIQENFPNLARQANIQIQEIQTMPQRYSSRRATSGHITIRFTKVEMREKMLRAAKEK EIQTAIREYYKHLYTNKLENLEETDKFLDTCTLPSLNQEEFESLNRSITSSEIEAVIDYQ PKRVQDQMDSQPNFTRGMQDWFNICKSINVIHHISRTNDKKHMIISIDAEKAFDKIQDVL MLKTLNKLGIDGTYIKIMRVIYDKHTVNIILNGKKLEVLPWKTGTRQGCPLSPLLFNIVL EVLARAIRQEKETKGIQIGREEVKLSLLADDMMVYLENPMVSAQNLLKLISNFSKVSGYK INVQKSQAFLYTNNKQTAKS >gi568815591f:78251903_78458611|GENSCAN_predicted_CDS_3|1143_bp atgaggagaaaccagcgcaaaaaggctgaaaattccaaaaaccagaatgcctcttctcct ccaaaggatcacaactcctcaccagcaagggaacaaaactggacagagaatgagtttgac aaattgacagaagtaggcttcagaagtgatggggagaatgggaccaagttggaaaacact cttcaggatattatccaggagaacttccccaacttagcaagacaagccaacattcaaatt caggaaatacagacaatgccacaaagatattcctcaagaagagcaacctcaggacacata accatcaggtttaccaaggttgaaatgagggaaaaaatgttaagggcagccaaagagaaa gaaatacaaactgccatcagagaatactataaacacctctacacaaataaactagaaaat ctagaagaaacggataaattcctggacacatgcactctcccgagtctaaaccaggaagaa ttcgaatccttgaatagatcaataacaagttctgaaattgaggcagtaatagactaccaa ccaaaaagagtccaggaccagatggattcacagccgaattttaccagagggatgcaagac tggttcaacatttgcaaatcaataaatgtaatccatcacataagccgaaccaatgacaaa aaacacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaagacgtcctc atgctaaaaactctcaataaactaggtattgatggaacatatatcaaaataatgcgagtt atttatgacaaacacacagtcaatatcatactgaatgggaaaaaactggaagtactccct tggaaaactggcacaagacaaggatgccctctctcaccactcttattcaacatagtactg gaagttctggccagggcaatcaggcaagagaaagaaacaaagggtattcaaataggaaga gaggaagtcaaattgtctcttcttgcagatgacatgatggtatatttagaaaaccccatg gtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaa atcaatgtgcaaaaatcacaagcattcctatacaccaataacaaacagacagccaaatca tga >gi568815591f:78251903_78458611|GENSCAN_predicted_peptide_4|271_aa MNLPHIYPVGYGDFAKLSHINRRTQFENPVLEAKRKLQQHNMPHTELGTKPLQAPGFRVD SSSWTLPRPKSIRPRAAPERSRSVNDLSYYQRDPAGRPWLFEKPLFTRDASQLKGTFLST TLKKSNMGFGFTIIGGDEPDEFLQVKSVIPDGPAAQDGKMETAEDLHEVTRVHMVNLECH HLTYYFAGKSNSVRELPAGQGIPSVNRLGSAPMWQELTEVKEVKLEPNITLAGGRMHRSI LQKSQPLDASQLPFTRQAATANIKKHTIATS >gi568815591f:78251903_78458611|GENSCAN_predicted_CDS_4|816_bp atgaatttacctcacatttatcctgttggatatggggactttgcaaagctcagccacata aatagaagaacacagtttgaaaatcctgtcctggaagcaaaaaggaagctacagcaacat aacatgccccacacagaacttggaacaaagcccctgcaggccccaggtttccgagtagac agctcctcatggacattacctaggcccaaatccatccgcccccgtgctgccccggagagg tcgcgcagcgtgaatgacttgtcttactatcagcgcgacccagcagggaggccctggctg ttcgaaaaaccactcttcacccgggatgcatcccagttgaagggaacattcctcagcacc accctaaaaaagagcaacatgggctttggatttaccatcattggtggagacgagcctgat gagtttctgcaggtgaaaagtgtgattccggatgggcctgcagcacaggatggaaaaatg gaaacagctgaggatttgcatgaagtgacaagagtacacatggtaaatcttgagtgtcat catctaacttactactttgctggaaaatcaaactctgtaagagagctgcctgctgggcaa ggaatacccagtgtcaacagactgggcagtgctccaatgtggcaggagctgacagaggtg aaagaggtcaagcttgagccaaatattacactggccggaggaagaatgcacagatccatc ctacaaaagagccagccactggatgccagccagttaccattcaccagacaagcagccaca gccaacataaaaaagcatacaattgcaaccagttaa >gi568815591f:78251903_78458611|GENSCAN_predicted_peptide_5|270_aa MAVCTINHVMKHPGPTRWLLKTAERQNLVLSGQGHLVGHPAAILAKWVLLGRKLVVEHHE GINISGNFWRNKLKYLAFLCKQMSTNPSQGPPAPPFLGPQRIFWWPCRTKWGQAALDHLM VFYGISLPYDKVVRLKPTRKFASLGCLAHEVGWKYQVVTATMGEKRKEKAKMHYRKKKQL MRLQKQATKNVEKTNDKYTEVLRPRDSLFETRKDCLFLMLGLAYPSAIATLGYGGAYPSC ITALGCGGSKGSNIGAMGSLGLKLGAREGP >gi568815591f:78251903_78458611|GENSCAN_predicted_CDS_5|813_bp atggccgtgtgcaccatcaatcatgtcatgaagcacccaggacctacaaggtggctgctg aagacagcagaaagacagaacctggtgctcagtggccaaggccatcttgtgggccacccg gcagccatcttggccaagtgggttctgctgggaaggaaactggtggttgagcaccacgag ggcatcaacatttctggcaatttctggagaaacaaattaaagtatctggccttcctctgc aagcagatgagcaccaacccttcccaaggcccacccgccccgccattcctgggcccccag cgcatcttttggtggccttgcagaaccaagtggggccaggctgccctggaccaccttatg gtgttttatgggatctcactgccctatgacaaggttgtacgactgaagcctacaagaaag tttgcctccttggggtgcctggctcacgaggttggctggaaataccaggtagtgacagcc accatgggggagaagaggaaggagaaggccaagatgcactacaggaagaaaaagcagctc atgcggctgcagaaacaggccacaaagaacgtggagaagacaaatgacaaatacacagag gtcctaagaccgagggactccctgtttgagaccagaaaagactgtttattcctcatgctt ggcctggcctacccttctgccattgcgaccttgggatatgggggagcctacccttcctgc atcaccgccctgggatgtgggggatccaagggcagcaatataggtgccatgggcagcctg ggacttaagctgggggcaagggaagggccttag >gi568815591f:78251903_78458611|GENSCAN_predicted_peptide_6|61_aa MKAPFKLTSNAGEDLTKTFLLCPNHLTERQDTPVSESYASSLLDHRDVCSASEDLLLVDP L >gi568815591f:78251903_78458611|GENSCAN_predicted_CDS_6|186_bp atgaaagctcctttcaagctaacttccaatgcaggagaggatttaactaaaacctttctt ctctgtcctaatcatctaacagagaggcaagacacccctgtgtcagaaagctatgcttca tcccttcttgatcatcgagatgtgtgttctgcaagtgaagatctgctgctggtagatcct ttataa >gi568815591f:78251903_78458611|GENSCAN_predicted_peptide_7|71_aa MVKASDFYLPHTLSLDPMSSSFTIPSSPTIAAIMVTTTTDDDDDDDDDDDDDNSGATVTV LVIIIAVPTLY >gi568815591f:78251903_78458611|GENSCAN_predicted_CDS_7|216_bp atggtgaaggcaagtgatttttacctgccgcacactctaagtttggacccaatgtcttca tcattcactattccatcaagtcctactattgctgctattatggttactaccactaccgat gatgatgatgatgacgatgatgatgatgatgatgataacagtggagcaacagtaacagta ctagtaataatcatagcagtacctactctttattga >gi568815591f:78251903_78458611|GENSCAN_predicted_peptide_8|101_aa MWKQLWNWITGRGWNSLEDSEEDRKTWESLELPRDLLNGFDQKPDNDRDNKVQAEVVSDR DEELVGNWRVKMTLVMFYQRDCRHFGPALEICGALKLREMI >gi568815591f:78251903_78458611|GENSCAN_predicted_CDS_8|306_bp atgtggaagcaactttggaattggataacaggcagaggttggaacagtttggaggactca gaagaagataggaaaacgtgggaaagtttggaacttcctagagacttgttgaatggcttt gaccaaaagcctgataatgacagggacaataaggtccaggctgaggtggtctcagataga gatgaggaacttgttgggaactggagagtaaagatgactcttgttatgttttaccaaaga gactgccggcattttggccctgccctagagatttgtggagccttgaagttgagagagatg atttag >gi568815591f:78251903_78458611|GENSCAN_predicted_peptide_9|56_aa MKRWGKIQALEIHVLWRKSISVPCKAELQVQQVFRDEDLDVEDSSMNFLNILFDKP >gi568815591f:78251903_78458611|GENSCAN_predicted_CDS_9|171_bp atgaaaagatggggcaaaatccaggctttagaaattcatgttttgtggagaaaatccatt tctgtgccttgcaaagcagaactacaggtgcagcaagtattccgggatgaggacctggat gtggaggactctagcatgaattttttaaatatactttttgacaaaccgtga