GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:47:31 Sequence gi568815575f:78654870_78855979 : 201110 bp : 37.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 707 702 6 1.05 1.02 Term - 1978 1630 349 0 1 1 36 319 0.683 10.97 1.01 Init - 3551 2128 1424 2 2 71 72 1197 0.757 106.85 1.00 Prom - 11795 11756 40 -3.35 2.07 PlyA - 11957 11952 6 1.05 2.06 Term - 14595 14511 85 2 1 88 38 103 0.443 1.45 2.05 Intr - 19591 19500 92 1 2 66 71 88 0.450 2.77 2.04 Intr - 53908 53834 75 1 0 75 100 42 0.010 2.89 2.03 Intr - 73897 73811 87 1 0 54 92 41 0.049 0.35 2.02 Intr - 81025 80846 180 1 0 34 89 93 0.205 3.14 2.01 Init - 85850 85641 210 2 0 61 27 137 0.414 3.73 2.00 Prom - 89032 88993 40 -6.85 3.00 Prom + 98187 98226 40 -6.45 3.01 Sngl + 100001 101113 1113 1 0 44 43 361 0.959 23.75 3.02 PlyA + 101859 101864 6 1.05 4.04 PlyA - 102076 102071 6 1.05 4.03 Term - 102193 102144 50 2 2 72 48 81 0.235 -1.01 4.02 Intr - 106172 105249 924 0 0 7 54 297 0.022 8.47 4.01 Init - 110887 110767 121 1 1 68 74 86 0.688 5.60 4.00 Prom - 119821 119782 40 -6.75 5.06 PlyA - 120050 120045 6 1.05 5.05 Term - 121065 120776 290 1 2 60 38 170 0.782 3.75 5.04 Intr - 124411 124262 150 2 0 77 32 112 0.700 3.71 5.03 Intr - 124691 124568 124 2 1 38 -3 130 0.829 -1.86 5.02 Intr - 125342 125271 72 2 0 99 108 48 0.937 6.58 5.01 Init - 130189 130088 102 1 0 44 69 71 0.615 1.09 5.00 Prom - 130389 130350 40 -5.85 6.00 Prom + 134842 134881 40 -5.95 6.01 Sngl + 146836 147507 672 0 0 58 32 359 0.473 21.22 6.02 PlyA + 147960 147965 6 1.05 7.04 PlyA - 148757 148752 6 1.05 7.03 Term - 150299 149711 589 1 1 24 42 321 0.015 14.10 7.02 Intr - 156987 156648 340 1 1 101 50 105 0.012 1.61 7.01 Init - 171480 171405 76 0 1 80 87 49 0.097 5.30 7.00 Prom - 177854 177815 40 -2.65 8.04 PlyA - 178319 178314 6 1.05 8.03 Term - 185649 185355 295 2 1 61 47 188 0.309 5.79 8.02 Intr - 196058 196013 46 0 1 74 53 65 0.202 -1.85 8.01 Init - 197641 197581 61 1 1 66 79 38 0.230 2.16 8.00 Prom - 199586 199547 40 -3.65 9.02 PlyA - 199870 199865 6 -0.45 9.01 Sngl - 200887 200075 813 1 0 60 37 252 0.858 12.82 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 62949 63155 207 2 0 68 36 161 0.904 5.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:78654870_78855979|GENSCAN_predicted_peptide_1|590_aa MVEDLAASYIVLKLENEIRQAQVQWLMEENAALQAQIPELQKSQAAKEYDLLRKSSEAKE PQKLPEHMNPPAAWEAQKTPEFKEPQKPPEPQDLLPWEPPAAWELQEAPAAPESLAPPAT RESQKPPMAHEIPTVLEGQGPANTQDATIAQEPKNSEPQDPPNIEKPQEAPEYQETAAQL EFLELPPPQEPLEPSNAQEFLELSAAQESLEGLIVVETSAASEFPQAPIGLEATDFPLQY TLTFSGDSQKLPEFLVQLYSYMRVRGHLYPTEAALVSFVGNCFSGRAGWWFQLLLDIQSP LLEQCESFIPVLQDTFDNPENMKDANQCIHQLCQGEGHVATHFHLIAQELNWDESTLWIQ FQEGLASSIQDELSHTSPATNLSDLITQCISLEEKPDPNPLGKSSSAEGDGPESPPAENQ PMQAAINCPHISEAEWVRWHKGRLCLYCGYPGHFARDCPVKPHQALQAGNIQACQNSIDH TVVLREKLPIRSNIFPLMLETVDGHPLINGPITKETSPVQVQIGNHVEELQFDIIHAPRY PLIIGIHWLETHDPNIEWSTRTVSFLSRYCHYNCFRHRWNRRNRDEIISG >gi568815575f:78654870_78855979|GENSCAN_predicted_CDS_1|1773_bp atggtggaggacttagcagcctcctatattgttctgaaattggagaatgaaattcggcag gctcaagtgcagtggctgatggaagaaaatgctgctctccaggcccagatcccagagctt cagaagtcccaagcagccaaggagtatgatctactccgaaagtcctcagaggccaaggag ccccagaagctcccagagcacatgaatcccccagcagcctgggaggcccaaaagacccca gagttcaaggaaccccagaagccaccagagccacaggatcttctaccctgggaaccccca gcagcctgggaactccaggaggccccagcagccccagagtccctggcgcctccagcaacc cgggagtcccagaaacctccaatggcccatgagatcccaacagtcttggagggccagggg cctgcaaacacccaggatgccacaatagcccaagagcccaagaattcagaaccccaggat cccccaaatattgagaagccccaggaggcaccagaataccaggaaactgcagcacagcta gagttccttgagcttccacctccccaggagcctctggagccttcaaatgcccaggaattc ctagaactctcagcagcccaggagtccctggagggtctaatagttgtggagacatcagca gcttcagagtttccacaggcccctattgggttagaggctacagatttccccctgcaatac actttaaccttcagtggagattcccagaagcttcctgagttcctggttcagctgtatagt tacatgagagtcagagggcacctgtatcccactgaagcagccctggtgagctttgttggc aattgcttctcaggtagggcaggatggtggttccagctcttactggatatccaaagcccc ttactggagcaatgtgaaagtttcatacctgtgctccaggatacttttgataatccagaa aacatgaaagatgccaaccagtgcatccatcaactctgccaaggggagggtcatgtagca acccacttccacctcattgctcaagagctgaactgggatgaaagcactctctggatccaa tttcaagaaggtcttgccagttctatacaagatgaactgtctcacacaagcccagccacc aacctatctgatctcatcactcagtgcatcagcctggaagagaagcctgatcccaatcca ttagggaaaagttcctctgcagagggagatggacccgagagtccaccagctgaaaaccaa cctatgcaggctgcaatcaactgcccacacatcagtgaagctgaatgggtccgttggcac aaaggccgcttatgcctctactgtggttatcctggtcattttgccagagattgccctgtc aagcctcatcaggccctgcaggcgggaaacatccaggcctgccaaaacagcattgatcat actgtggttcttcgagagaagctgcccatccgcagtaatatcttccctctgatgctggaa actgtcgacggccatccacttattaatggacccataactaaggaaacatcacctgtccaa gttcaaattggaaaccatgttgaagagctccagtttgacattattcatgcaccacgatac cctctgattattggaatccattggcttgagacacatgacccaaacatagaatggagtacc cgcactgtgtcctttctatcacgttattgtcactacaattgcttcaggcacaggtggaat agaagaaatcgtgatgaaataatttctgggtag >gi568815575f:78654870_78855979|GENSCAN_predicted_peptide_2|242_aa MAVTLELGNGQRLEVLEGSEGRKMREHLKLPGDWLNGCDQNSDSHMDFTKSRLLRRSQME IKNCLATEAKSARALGTHKQLPESGESRACVRSSAVLASGLNQQSHSAGDHRGACVIPPP ALGTSEQRDSLCDLMQVTNKLVLQCTHPSNKNNNYTYIIIQVQVTIASMTKFLNSFSWIA VDFPTLCGTKNKTIEAYLSTQLNATVISTIDKIVKDAYKTQKERGLSMVTQKFESRVESG LA >gi568815575f:78654870_78855979|GENSCAN_predicted_CDS_2|729_bp atggcagtgactttggaactgggtaatgggcagagattggaagtgttggaaggctcagag ggcaggaagatgagagaacatttgaaacttcctggagattggttaaatggttgtgaccaa aattcagatagtcatatggacttcaccaagtccaggctgttgaggaggtctcaaatggaa ataaagaactgtttggcaactgaagcaaagagtgccagggccttgggcacacataagcaa ttgccagagagtggtgagagcagggcttgtgtgagatccagtgctgtgctggcttcaggt ctgaaccagcagagtcacagtgctggtgatcacaggggtgcttgtgtcattccaccccca gctttaggtacctcagaacagagagactcactgtgtgaccttatgcaagtcactaacaaa ttggtgcttcaatgtactcatccatcaaataagaataataattatacttacatcataata caagttcaggtgactattgcctctatgaccaaatttctgaactcttttagctggatagct gttgattttccaacattgtgtggaactaagaacaaaacaattgaggcatacctgagcacg cagctgaatgctactgtcatatcaaccatagacaaaattgttaaagacgcatataagact cagaaagaaaggggcttatccatggtcacacagaaatttgagagcagagttgaaagcggc ttagcttaa >gi568815575f:78654870_78855979|GENSCAN_predicted_peptide_3|370_aa MGDRRFIDFQFQDSNSSLRPRLGNATANNTCIVDDSFKYNLNGAVYSVVFILGLITNSVS LFVFCFRMKMRSETAIFITNLAVSDLLFVCTLPFKIFYNFNRHWPFGDTLCKISGTAFLT NIYGSMLFLTCISVDRFLAIVYPFRSRTIRTRRNSAIVCAGVWILVLSGGISASLFSTTN VNNATTTCFEGFSKRVWKTYLSKITIFIEVVGFIIPLILNVSCSSVVLRTLRKPATLSQI GTNKKKVLKMITVHMAVFVVCFVPYNSVLFLYALVRSQAITNCFLERFAKIMYPITLCLA TLNCCFDPFIYYFTLESFQKSFYINAHIRMESLFKTETPLTTKPSLPAIQEEVSDQTTNN GGELMLESTF >gi568815575f:78654870_78855979|GENSCAN_predicted_CDS_3|1113_bp atgggtgacagaagattcattgacttccaattccaagattcaaattcaagcctcagaccc aggttgggcaatgctactgccaataatacttgcattgttgatgattccttcaagtataat ctcaatggtgctgtctacagtgttgtattcatcttgggtctgataaccaacagtgtctct ctgtttgtcttctgtttccgcatgaaaatgagaagtgagactgctatttttatcaccaat ctagctgtctctgatttgctttttgtctgtacactaccttttaaaatattttacaacttc aaccgccactggccttttggtgacaccctctgcaagatctctggaactgcattccttacc aacatctatgggagcatgctctttctcacctgtattagtgtggatcgtttcctggccatt gtctatccttttcgatctcgtactattaggactaggaggaattctgccattgtgtgtgct ggtgtctggatcctagtcctcagtggcggtatttcagcctctttgttttccaccactaat gtcaacaatgcaaccaccacctgctttgaaggcttctccaaacgtgtctggaagacttat ttatccaagatcacaatatttattgaagttgttgggtttatcattcctctaatattgaat gtctcttgctcttctgtggtgctgagaactcttcgcaagcctgctactctgtctcaaatt gggaccaataagaaaaaagtactgaaaatgatcacagtacatatggcagtctttgtggta tgctttgtaccctacaactctgtcctcttcttgtatgccctggtgcgctcccaagctatt actaattgctttttggaaagatttgcaaagatcatgtacccaatcaccttgtgccttgca actctgaactgttgttttgaccctttcatctattacttcacccttgaatcctttcagaag tccttctacatcaatgcccacatcagaatggagtccctgtttaagactgaaacacctttg accacaaagccttcccttccagctattcaagaggaagtgagtgatcaaacaacaaataat ggtggtgaattaatgctagaatccaccttttag >gi568815575f:78654870_78855979|GENSCAN_predicted_peptide_4|364_aa MTLNEHAAFKHLFNKAHLAPPLIHLTLSGHITCFREHRVGVIQLTRDVKDLFKENYKPLL NEIKEDTNKWKNIPCSCVGRINIMKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIW DQKRACITKSILSQKNKARGIMLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTGPSEIMR HIYNHLIFDKPEKNKQWGKDSLFNKWCRENWLAVCRKLKLDPFLTPYTKINSRWIKDLHV RPKTIKTLEENLGNTIQDIGMGKDFMSKTPKAMATKVKIDKWDLIKLKSFCTAKETTIRV KRQPTKWEKIFTTYSSDKGLISRIYNELKQIYKKKTNNLIKKWPKDMGRAQIQQLLAILG SKIR >gi568815575f:78654870_78855979|GENSCAN_predicted_CDS_4|1095_bp atgactcttaatgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccatttaaccctgagtggacacatcacatgtttcagagagcacagggttggg gtaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctc aatgaaataaaagaggatacaaacaaatggaagaacattccatgctcatgtgtaggaaga atcaatatcatgaaaatggccatactgcccaaggtaatttatagattcaatgccatcccc atcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatgg gaccaaaaaagagcctgcatcaccaagtcaatcctaagccaaaagaacaaagccagaggc atcatgctacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtac tggtaccaaaacagagatatagatcaatggaacagaacagggccctcagaaataatgcgg catatctacaaccatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggat tccctatttaataaatggtgccgggaaaactggctagccgtatgtagaaagctgaaactg gatcccttccttacaccttacacaaaaattaattcaagatggattaaagacttacatgtt agacctaaaaccataaaaaccctagaagaaaacctaggcaataccattcaggacataggc atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagtcaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aaaaggcaacctacaaaatgggagaaaattttcacaacctactcatctgacaaagggcta atatccagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaacctcatc aaaaagtggccgaaagatatgggcagggcacaaattcagcagctcctggctatattaggc tccaaaattcgataa >gi568815575f:78654870_78855979|GENSCAN_predicted_peptide_5|245_aa MDPKSKDTKSTKLFEQVNEQKQDAKPQSFNVRIVLQLQLWGWYCELIKVTKGKSITLQDS EDAPNTVLVSTAERPTDGSHHRTPYRQPPESAQSLVDLLEPTQMRRNQKINSGNVTKQRS LTPPKNYTSSPAMDPNREEIPDLREKNSETLEVLARSLRQEKEIKGIQIIKEEDKLLLFP DDIIVYLGHLKDSSETPLELIKEFSKVSGYKINVYKSGALLYTNSDQAENQIKNSTPFTI AAKIK >gi568815575f:78654870_78855979|GENSCAN_predicted_CDS_5|738_bp atggatcccaaaagcaaggacacaaagtcaactaaattgtttgagcaagtgaatgaacag aagcaagatgctaaaccccagagtttcaatgtgaggatagtgcttcagctgcagctctgg ggctggtactgtgaactaatcaaagtgaccaaaggcaaaagcatcactcttcaagactca gaagatgccccaaatactgtgctggtatccacagctgagagacccacagatggttcacat cacaggactccatacagacagcccccagaatcagctcagagcctggtagacctgctggag cctacccaaatgagaaggaaccagaaaatcaactctggtaatgtgaccaaacaacgttct ttaacaccccccaaaaattacactagctcaccagcaatggatccaaaccgagaagaaata cctgatttacgtgaaaaaaattcagaaacactggaagtcttagccagatcactcagacag gagaaagaaatcaagggcatccaaattattaaagaagaagacaaactgctgctgtttcct gatgatattattgtatacctaggacaccttaaagactcctccgaaacgcccctagaactg ataaaagaattcagcaaagtttctggatacaaaattaatgtatacaaatcaggagctctc ctatacaccaacagtgaccaagctgagaatcaaatcaagaactcaacgccttttacaata gctgcaaaaataaaataa >gi568815575f:78654870_78855979|GENSCAN_predicted_peptide_6|223_aa MDQAQAPRPAVCSLGTPSCILATQAMAKRGKGTAWAVASEGASLKPWQLLSGVEPAAAQK SRIEVWKPPPRFQSIYGNAWMSRQKFTAGADPSWRTSARAVQKGNVGLEPSHKVPTGALP SKAVRRRPPSSKQQNGRSTNSLHYAPGKAADTQHQPMKTEERVAIHCKATGMKLPKAVGA HLLQQCDLDVRDKIKGDLFGTLRFNDCPFGFLTCMGPVALSFF >gi568815575f:78654870_78855979|GENSCAN_predicted_CDS_6|672_bp atggaccaggcccaggccccccggcctgctgtgtgcagcctagggactccatcctgcatc ctagccactcaagccatggctaaaaggggcaaaggtacagcttgggctgtggcttcagag ggtgcaagcctcaagccttggcagcttttaagtggtgttgagcctgcagctgcacagaag tcaagaattgaggtctggaaacctccacctagatttcagagtatatatggaaatgcctgg atgtccaggcagaaatttactgcaggggcagacccctcatggagaacctctgctagggca gtgcagaagggaaatgtggggttggaaccctcacacaaagtccccactggggcactgcct agtaaagctgtgagaagaagaccaccatcctccaaacaacagaatggtagatccaccaac agcttgcactatgcacctggaaaagctgcagacactcaacaccagcccatgaaaacagag gagagggtggctatacactgcaaagccacagggatgaagctgcccaaggctgtgggagcc cacctcttgcaacagtgtgacctggatgtgagagataaaataaaaggagatctttttgga actttaaggtttaatgactgcccttttggatttctgacttgcatggggcctgtagctctt tcttttttttaa >gi568815575f:78654870_78855979|GENSCAN_predicted_peptide_7|334_aa MDRAINMSSINKVKSPSNNVKIQPPGIPEFKYLVTLWCSIDGSGSLTLHRAGGSQEQAGA LPLSEMTGWEHCSPRHSCSCRAMAMNPHIPMILGAQEVPLPPQVQKCLLPLTGLSLLLVP APGWSKVVVKPECCHNPACTLLGLENLEEMDEFLDTYTLPRLNQEEVESLNRPITGSEIV AIINSLPTKKSPGPDGFADKFYQKYKEELVTLLLKLFQSIEKEGLLPNSFYEASIILIPK PGRDTTKKENFRPMSLMNIDAKILNKILANRIQQHIKKLIHHDQVDFIPVMQGWFSIHKS INVIQHINRTKDKNHMIISIDGERPLTKFNNPSC >gi568815575f:78654870_78855979|GENSCAN_predicted_CDS_7|1005_bp atggacagggccatcaacatgagtagcattaacaaggttaaatcccccagtaacaatgtc aaaatccaaccaccaggaattccagaatttaaatatttggtcactttatggtgttccatt gatggcagtggcagcctgacactccacagagctggtgggagccaggaacaggcaggagcc ctgcccctttctgagatgacaggatgggagcactgctctcctaggcacagctgcagctgc agagccatggctatgaacccacacatccctatgatcttaggagcccaggaagtccccctt cccccacaggttcagaagtgcctgctcccactgactggcctctccctgctcctggtgcct gctccagggtggagcaaagttgtggtcaagcctgagtgctgtcacaacccagcttgcaca ctgttgggtcttgaaaatctagaagaaatggatgaattccttgacacatacaccctccca agactaaaccaggaagaagttgaatctctgaatagaccaataacaggatctgaaattgtg gcaataatcaatagcttaccaaccaaaaagagtccaggtccagatggatttgcagacaaa ttctatcagaagtacaaggaggaactggtaacattacttctgaaactattccaatcaata gaaaaagaaggactcctccctaactcattttatgaggccagcatcatcctgataccaaag ccgggcagagacacaacaaaaaaagagaattttagaccaatgtccttgatgaacattgat gcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatc caccatgatcaagtggacttcatccctgtgatgcaaggctggttcagtatacacaaatca ataaatgtaatccagcatataaacagaaccaaagacaaaaaccacatgattatctcaata gatggagaaaggcctttgacaaaattcaacaacccttcatgctaa >gi568815575f:78654870_78855979|GENSCAN_predicted_peptide_8|133_aa METKKMGLDIGGRRWVAKVSDPLICPLGSEKQQQTRVVISPRSVADLQMPSGNKGHESET LGIYLVHYASVAELSPKPQVKVFPIFSPVSSSIEVYSLSHHHPWLTASTAWLPLMVKQSP RVLLSGWSKCCQP >gi568815575f:78654870_78855979|GENSCAN_predicted_CDS_8|402_bp atggaaaccaaaaagatgggactcgatataggagggagaagatgggtagcaaaagtctca gatccccttatttgtcctctgggctctgaaaagcagcagcagacaagggtggtgatttcc cctcggtccgtggctgatctacagatgccatctgggaacaagggccatgagtctgaaacc ctaggaatctacctggttcactatgctagtgtggctgagctgtcacccaagccacaagtc aaagtctttcctattttctctcctgtttcctcaagcatagaagtttattccctcagtcac catcacccctggcttacagcaagtactgcctggctaccgctgatggtcaaacaaagccca agagttctcctgtcaggttggagtaaatgctgccagccctga >gi568815575f:78654870_78855979|GENSCAN_predicted_peptide_9|270_aa MSELPFTIASKTIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIV KMVILPKGIYRFNAIPIKLPMTFFTELEKTTLKFIWHQKRARFIKSILSQKNKAGGITLP DFKLYCKAKVTKTAWYWYQNRDIDQWNETEPSEITLHIYNYLIFDKPEKNKKWGKDSLFS KWCWENWLARCRKLKLDPFLTPYTKIYSRWIKDLNVRPKTIKILEENLGITIQDIGMGKD FMPKTPKAMATKDKIDKWDLIQLKSFCTAK >gi568815575f:78654870_78855979|GENSCAN_predicted_CDS_9|813_bp atgagtgaactcccattcacaattgcttcaaagacaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaatggtcatactgcccaagggaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggcaccaaaaaaga gcccgcttcatcaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacct gacttcaaactatactgcaaggctaaagtaaccaaaacagcatggtactggtaccaaaac agagatatagatcaatggaacgaaacagagccctcagaaataacgctgcatatctacaac tatctgatctttgacaaacctgagaaaaacaagaaatggggaaaggattccctatttagt aaatggtgctgggaaaactggctagccagatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaatctattcaagatggattaaagacttaaatgttagacctaaaacc ataaaaatcctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggac ttcatgcctaaaacaccaaaagcaatggcaacaaaagacaaaattgacaaatgggatcta attcaactaaagagcttctgcacagcaaaataa