GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:00:45 Sequence gi568815585r:27863118_28069006 : 205889 bp : 44.31% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 14350 14370 21 0 0 77 101 9 0.087 0.78 1.02 Intr + 45023 45198 176 1 2 23 81 100 0.007 1.54 1.03 Term + 45419 45626 208 2 1 56 39 141 0.008 2.81 1.04 PlyA + 46902 46907 6 1.05 2.00 Prom + 56625 56664 40 -3.66 2.01 Init + 57022 57427 406 0 1 99 115 462 0.998 46.75 2.02 Intr + 61139 61468 330 0 0 117 38 547 0.131 48.20 2.03 Intr + 62796 62925 130 1 1 31 58 97 0.088 0.75 2.04 Intr + 65908 65965 58 1 1 66 116 21 0.135 1.59 2.05 Intr + 66990 67008 19 2 1 146 96 12 0.019 3.98 2.06 Intr + 68895 68922 28 1 1 106 86 -3 0.009 -1.53 2.07 Term + 82142 82298 157 2 1 79 33 178 0.142 8.81 2.08 PlyA + 82430 82435 6 1.05 3.07 PlyA - 82878 82873 6 1.05 3.06 Term - 91192 91059 134 2 2 15 53 174 0.156 4.85 3.05 Intr - 91504 91367 138 2 0 66 77 95 0.152 6.64 3.04 Intr - 98283 98154 130 0 1 44 65 137 0.759 7.27 3.03 Intr - 100252 100088 165 1 0 102 -4 84 0.174 0.76 3.02 Intr - 101898 101753 146 1 2 134 75 321 0.875 35.40 3.01 Init - 105889 105349 541 1 1 96 121 881 0.966 87.24 3.00 Prom - 107104 107065 40 -7.66 4.00 Prom + 110075 110114 40 -9.36 4.01 Sngl + 112660 113031 372 0 0 82 43 195 0.592 8.66 4.02 PlyA + 113078 113083 6 1.05 5.05 PlyA - 113738 113733 6 1.05 5.04 Term - 115335 114989 347 1 2 106 37 732 0.997 64.56 5.03 Intr - 116182 116088 95 0 2 61 56 28 0.195 -3.49 5.02 Intr - 117532 117352 181 2 1 92 80 51 0.143 3.63 5.01 Init - 125520 125346 175 0 1 54 110 162 0.808 14.51 5.00 Prom - 128424 128385 40 -5.86 6.20 PlyA - 129780 129775 6 1.05 6.19 Term - 137517 137472 46 2 1 134 44 90 0.750 5.88 6.18 Intr - 152139 152040 100 1 1 79 101 83 0.950 7.87 6.17 Intr - 152584 152473 112 1 1 94 111 74 0.998 10.25 6.16 Intr - 155472 155350 123 0 0 77 54 140 0.969 10.28 6.15 Intr - 160360 160233 128 2 2 60 76 105 0.971 6.90 6.14 Intr - 165171 165061 111 1 0 90 91 69 0.990 7.75 6.13 Intr - 170874 170770 105 1 0 106 87 63 0.982 8.19 6.12 Intr - 171097 170965 133 1 1 54 91 93 0.986 6.42 6.11 Intr - 171290 171172 119 0 2 102 93 16 0.620 3.68 6.10 Intr - 172556 172378 179 1 2 108 87 16 0.906 3.06 6.09 Intr - 172926 172818 109 1 1 94 29 81 0.958 2.14 6.08 Intr - 174171 174068 104 2 2 70 100 54 0.943 4.62 6.07 Intr - 178182 178075 108 2 0 130 36 21 0.569 0.50 6.06 Intr - 185326 185158 169 2 1 114 94 59 0.984 8.10 6.05 Intr - 186420 186267 154 0 1 56 63 72 0.321 1.15 6.04 Intr - 186657 186518 140 1 2 80 75 29 0.223 0.98 6.03 Intr - 194345 194230 116 1 2 100 100 -31 0.200 -0.61 6.02 Intr - 194624 194592 33 1 0 82 100 22 0.490 0.14 6.01 Intr - 198952 198750 203 1 2 82 111 97 0.964 9.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 61139 61584 446 0 2 117 54 589 0.865 53.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:27863118_28069006|GENSCAN_predicted_peptide_1|134_aa MQEQLGNHKYPQSNKQTILVNIYKGATVIKFQGVHSSSNQKFTSRDEKYKLGTDWFQVID GLWKGSWLDPRMRKQRIQRAKLTFSTCDCSDFGVLGEWDGVLEPIPWGYGESAVPLLICW KRTCFCRAPNDEEL >gi568815585r:27863118_28069006|GENSCAN_predicted_CDS_1|405_bp atgcaagaacagttgggtaatcacaaatatccccagtccaataaacaaacgatccttgtg aatatttacaagggagcaactgttataaagttccagggagtgcacagtagcagcaaccaa aaattcacaagtagagatgaaaagtacaagctgggcacagactggttccaggtgatagac ggactgtggaagggaagttggctggatccgaggatgcggaagcagaggatacagagagcc aagctcacgttttcaacctgcgactgctcggattttggtgttttgggggagtgggacggg gtcctggaaccaatcccctggggatatggagagtcagccgtacctttgctcatctgttgg aaaagaacgtgtttctgcagggcaccaaatgacgaagaactataa >gi568815585r:27863118_28069006|GENSCAN_predicted_peptide_2|375_aa MNGEEQYYAATQLYKDPCAFQRGPAPEFSASPPACLYMGRQPPPPPPHPFPGALGALEQG SPPDISPYEVPPLADDPAVAHLHHHLPAQLALPHPPAGPFPEGAEPGVLEEPNRVQLPFP WMKSTKAHAWKGQWAGGAYAAEPEENKRTRTAYTRAQLLELEKEFLFNKYISRPRRVELA VMLNLTERHIKIWFQNRRMKWKKEEDKKRGGGTAVGGGGVAEPEQDCAVTSGEELLALPP PPPPGGCGRNSCSEVEAVLKTSTVAWCALETNNYSRASMTFTSLEIMKTDRPSWSHKPSI PLGIFNPKVWFVSEEFSMSLISKSMVAYWRQAGLSYIRYSQICAKVVRDALKTEFKANAK KTSGNSVKIVKVKKE >gi568815585r:27863118_28069006|GENSCAN_predicted_CDS_2|1128_bp atgaacggcgaggagcagtactacgcggccacgcagctttacaaggacccatgcgcgttc cagcgaggcccggcgccggagttcagcgccagcccccctgcgtgcctgtacatgggccgc cagcccccgccgccgccgccgcacccgttccctggcgccctgggcgcgctggagcagggc agccccccggacatctccccgtacgaggtgccccccctcgccgacgaccccgcggtggcg caccttcaccaccacctcccggctcagctcgcgctcccccacccgcccgccgggcccttc ccggagggagccgagccgggcgtcctggaggagcccaaccgcgtccagctgcctttccca tggatgaagtctaccaaagctcacgcgtggaaaggccagtgggcaggcggcgcctacgct gcggagccggaggagaacaagcggacgcgcacggcctacacgcgcgcacagctgctagag ctggagaaggagttcctattcaacaagtacatctcacggccgcgccgggtggagctggct gtcatgttgaacttgaccgagagacacatcaagatctggttccaaaaccgccgcatgaag tggaaaaaggaggaggacaagaagcgcggcggcgggacagctgtcgggggtggcggggtc gcggagcctgagcaggactgcgccgtgacctccggcgaggagcttctggcgctgccgccg ccgccgccccccggaggatgtggacgtaattcctgttccgaggtagaggctgtgctgaag acaagcacagtggcctggtgcgccttggaaaccaacaactattcacgagccagtatgacc ttcacatctttagaaattatgaaaacagaccgtccgagctggagccacaagccctccatt cctcttggaatcttcaaccccaaggtgtggtttgtgtctgaggaattctcaatgagcctg atttccaaaagcatggtggcctactggagacaggctggactcagctacatccgatactcc cagatctgtgcaaaagtagtgagagatgcactgaagacagaattcaaagcaaatgccaaa aagacttctggcaacagcgtaaaaattgtgaaagtaaagaaggaataa >gi568815585r:27863118_28069006|GENSCAN_predicted_peptide_3|417_aa MYVSYLLDKDVSMYPSSVRHSGGLNLAPQNFVSPPQYPDYGGYHVAAAAAAAANLDSAQS PGPSWPAAYGAPLREDWNGYAPGGAAAAANAVAHGLNGGSPAAAMGYSSPADYHPHHHPH HHPHHPAAAPSCASGLLQTLNPGPPGPAATAAAEQLSPGGQRRNLCEWMRKPAQQSLGSQ VKTRTKDKYRVVYTDHQRLELEKEFHYSRYITIRRKAELAATLGLSERQVKIWFQNRRAK ERKINKKKLQQQQQQQPPQPPPPPPQPPQPQPGPLRSVPEPLSPDPGEGGLALRFDLPTN IREPNQRAHGGALRAVGEDEQSRETARAGARGLVSRCSRPRRPVSACCRSLLGLRRCPGF PGEIRGVSREVARDAKLFKRCAQLRRGGAGNLGRKRRSHSVKKAWSTGLRPVVEDLG >gi568815585r:27863118_28069006|GENSCAN_predicted_CDS_3|1254_bp atgtacgtgagctacctcctggacaaggacgtgagcatgtaccctagctccgtgcgccac tctggcggcctcaacctggcgccgcagaacttcgtcagccccccgcagtacccggactac ggcggttaccacgtggcggccgcagctgcagcggcagcgaacttggacagcgcgcagtcc ccggggccatcctggccggcagcgtatggcgccccactccgggaggactggaatggctac gcgcccggaggcgccgcggccgccgccaacgccgtggctcacggcctcaacggtggctcc ccggccgcagccatgggctacagcagccccgcagactaccatccgcaccaccacccgcat caccacccgcaccacccggccgccgcgccttcctgcgcttctgggctgctgcaaacgctc aaccccggccctcctgggcccgccgccaccgctgccgccgagcagctgtctcccggcggc cagcggcggaacctgtgcgagtggatgcggaagccggcgcagcagtccctcggcagccaa gtgaaaaccaggacgaaagacaaatatcgagtggtgtacacggaccaccagcggctggag ctggagaaggagtttcactacagtcgctacatcaccatccggaggaaagccgagctagcc gccacgctggggctctctgagaggcaggttaaaatctggtttcagaaccgcagagcaaag gagaggaaaatcaacaagaagaagttgcagcagcaacagcagcagcagccaccacagccg cctccgccgccaccacagcctccccagcctcagccaggtcctctgagaagtgtcccagag cccttgagtccggacccgggcgaggggggcttagcccttcgtttcgatcttcccaccaac atccgagagcctaatcagcgcgcccacggaggcgccttaagggcagttggggaagatgag cagagccgggaaacagcaagagcgggcgcccgggggctcgtgtcccgctgctctcgccca agacggccggtctcggcctgctgccggtccttgctgggtctgcgccgctgcccgggattc cctggagagattcgcggcgtctcccgagaggtggcgcgcgacgccaagcttttcaaaagg tgcgcgcaactacggcgcggaggtgcggggaacctgggccgcaagcgccgaagccactcg gtgaagaaggcctggagcactggccttcgacccgtcgtcgaagacctgggctga >gi568815585r:27863118_28069006|GENSCAN_predicted_peptide_4|123_aa MIAAQARVHAAGAPLPPRAHQRPESGKRLCQAKADRNREHVAICRRELPCQAATGDLPAR ATASAQSSSRRGESEPPAGLVGALPSPASTPRVLRSPLRLLMLRCFGRGAAQPALCPFSS SSF >gi568815585r:27863118_28069006|GENSCAN_predicted_CDS_4|372_bp atgattgccgcgcaagctagggtccacgcggccggggctcctctcccgcctcgggcgcac cagaggccagaaagcggaaagcggctctgccaggccaaggccgacagaaaccgagagcat gtcgctatttgccggagagaactgccctgccaggctgccacaggtgacttgcccgcgcga gccacggcctctgcgcagagctcctcccgaaggggagagtcagagccacccgcgggtttg gtcggggctcttcccagcccggcgtccacgccccgcgtcctgcgctcgcccttgcggctg cttatgctcaggtgtttcggaagaggcgccgcgcagccagctctctgtcccttcagctcc agctccttctaa >gi568815585r:27863118_28069006|GENSCAN_predicted_peptide_5|265_aa MDIEKVNSMDLGEFVDVFGNATERCPLIAAAVWSQRPFSDLEDLEKHFFAFIDALAQSAW ADPPDSMQRWGGKSPDWAGDLAAALKLRTPEEEARKVQPPKTRTFPSSAARAGYRPGPGP RRLCKDFLQVNAVLWGLWDLQSEKGNKSRAGQEGILRCHPDLAGSELQRGTLTAESQREQ SGAGLRSLGADERLRLAELNAQYRARFGFPFVLAARFSDRTAVPRELARRLLCPSAQELR TALGEVKKIGSLRLADLLRADPAKL >gi568815585r:27863118_28069006|GENSCAN_predicted_CDS_5|798_bp atggacattgagaaggtcaactccatggaccttggagaattcgtggatgtgtttgggaat gccactgagagatgtcctctgattgcagctgctgtttggtcccagcggccattctctgat ttggaagatttagagaagcacttttttgcctttattgatgcccttgcacagtcagcttgg gctgatcctccagacagcatgcaacggtggggagggaagtcccctgactgggcgggggac ctagcggctgctctgaaactccgaacacctgaagaggaggcgcggaaggtccagccgccc aagactcgcactttcccctcctccgcagcccgggcaggttaccgtcctgggcctgggcct aggagattatgcaaagatttccttcaagtaaacgctgttctctggggcctctgggatcta cagtcggagaaggggaataagtcccgggccggccaggagggcatcctgcgctgccacccg gacctggcgggcagcgagctgcagcggggcacgctcacggccgagtcgcagcgggaacag agcggcgcaggcctgaggagcctgggcgcggacgagcggctgcggctggccgagctcaac gcgcagtaccgcgcgcgcttcggtttccccttcgtgctcgccgcgcgcttcagcgaccgg acggcggtgccgcgcgagctggcgcgccggctgctctgcccgtccgcgcaggagctgcgc actgctctgggcgaggtgaagaagatcggcagcctgcgcctggccgacctcctccgcgca gaccccgccaagctgtag >gi568815585r:27863118_28069006|GENSCAN_predicted_peptide_6|763_aa VSESPEDLGCALRPQSSGTVYEAAAVEVDVSASITLQVLVDAPGNISCLWVFKHSSLNCQ PHFDLQNRWEFSILEAQRRGVVSMVILKMTETQAGEYLLFIQSEATNYTILFTVSIRNLN QTPQTTLPQLFLKVGEPLWIRCKAVHVNHGFGLTWELENKALEEGNYFEMSTYSTNRTMI RILFAFVSSVARNDTGYYTCSSSKHPSQSALVTIVEKGFINATNSSEDYEIDQYEEFCFS VRFKAYPQIRCTWTFSRKSFPCEQKGLDNGYRVFGHCPERITLSLLEAHVFLLLGHFHAS LFHTAFLSISKFCNHKHQPGEYIFHAENDDAQFTKMFTLNIRRKPQVLAEASASQASCFS DGYPLPSWTWKKCSDKSPNCTEEITEGVWNRKANRKVFGQWVSSSTLNMSEAIKGFLVKC CAYNSLGTSCETILLNSPGPFPFIQDNISFYATIGVCLLFIVVLTLLICHKYKKVKAKQF RYESQLQMVQVTGSSDNEYFYVDFREYEYDLKWEFPRENLEFGKVLGSGAFGKVMNATAY GISKTGVSIQVAVKMLKEKADSSEREALMSELKMMTQLGSHENIVNLLGACTLSDEIEYE NQKRLEEEEDLNVLTFEDLLCFAYQVAKGMEFLEFKSCVHRDLAARNVLVTHGKVVKICD FGLARDIMSDSNYVVRGNARLPVKWMAPESLFEGIYTIKSDVWSYGILLWEIFSLGVNPY PGIPVDANFYKLIQNGFKMDQPFYATEEMSSPVSEPMSDGFLH >gi568815585r:27863118_28069006|GENSCAN_predicted_CDS_6|2292_bp gtatcagaatccccggaagacctcgggtgtgcgttgagaccccagagctcagggacagtg tacgaagctgccgctgtggaagtggatgtatctgcttccatcacactgcaagtgctggtc gacgccccagggaacatttcctgtctctgggtctttaagcacagctccctgaattgccag ccacattttgatttacaaaacagatgggagttttcgatcctagaagctcagagaagagga gttgtttccatggtcattttgaaaatgacagaaacccaagctggagaatacctacttttt attcagagtgaagctaccaattacacaatattgtttacagtgagtataagaaatctaaat caaactcctcagaccacattgccacaattatttcttaaagtaggggaacccttatggata aggtgcaaagctgttcatgtgaaccatggattcgggctcacctgggaattagaaaacaaa gcactcgaggagggcaactactttgagatgagtacctattcaacaaacagaactatgata cggattctgtttgcttttgtatcatcagtggcaagaaacgacaccggatactacacttgt tcctcttcaaagcatcccagtcaatcagctttggttaccatcgtagaaaagggatttata aatgctaccaattcaagtgaagattatgaaattgaccaatatgaagagttttgtttttct gtcaggtttaaagcctacccacaaatcagatgtacgtggaccttctctcgaaaatcattt ccttgtgagcaaaagggtcttgataacggatacagggtctttggccattgtccagagaga attacactttctctcttagaagctcacgtctttcttctcttgggccacttccacgccagc ctcttccacactgccttcctgagcatatccaagttttgcaatcataagcaccagccagga gaatatatattccatgcagaaaatgatgatgcccaatttaccaaaatgttcacgctgaat ataagaaggaaacctcaagtgctcgcagaagcatcggcaagtcaggcgtcctgtttctcg gatggatacccattaccatcttggacctggaagaagtgttcagacaagtctcccaactgc acagaagagatcacagaaggagtctggaatagaaaggctaacagaaaagtgtttggacag tgggtgtcgagcagtactctaaacatgagtgaagccataaaagggttcctggtcaagtgc tgtgcatacaattcccttggcacatcttgtgagacgatccttttaaactctccaggcccc ttccctttcatccaagacaacatctcattctatgcaacaattggtgtttgtctcctcttc attgtcgttttaaccctgctaatttgtcacaagtacaaaaaggtaaaagcaaagcaattt aggtatgaaagccagctacagatggtacaggtgaccggctcctcagataatgagtacttc tacgttgatttcagagaatatgaatatgatctcaaatgggagtttccaagagaaaattta gagtttgggaaggtactaggatcaggtgcttttggaaaagtgatgaacgcaacagcttat ggaattagcaaaacaggagtctcaatccaggttgccgtcaaaatgctgaaagaaaaagca gacagctctgaaagagaggcactcatgtcagaactcaagatgatgacccagctgggaagc cacgagaatattgtgaacctgctgggggcgtgcacactgtcagatgaaattgaatatgaa aaccaaaaaaggctggaagaagaggaggacttgaatgtgcttacatttgaagatcttctt tgctttgcatatcaagttgccaaaggaatggaatttctggaatttaagtcgtgtgttcac agagacctggccgccaggaacgtgcttgtcacccacgggaaagtggtgaagatatgtgac tttggattggctcgagatatcatgagtgattccaactatgttgtcaggggcaatgcccgt ctgcctgtaaaatggatggcccccgaaagcctgtttgaaggcatctacaccattaagagt gatgtctggtcatatggaatattactgtgggaaatcttctcacttggtgtgaatccttac cctggcattccggttgatgctaacttctacaaactgattcaaaatggatttaaaatggat cagccattttatgctacagaagaaatgtcttcgccagtttccgagcccatgagtgacggc ttcctgcactga