GENSCAN 1.0 Date run: 7-Nov-116 Time: 23:24:09 Sequence gi568815596f:68057935_68274799 : 216865 bp : 38.95% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 603 755 153 2 0 36 76 85 0.081 1.45 1.02 Intr + 6669 6831 163 2 1 76 71 99 0.175 5.63 1.03 Intr + 9140 9367 228 0 0 47 72 137 0.078 4.92 1.04 Term + 10024 11114 1091 2 2 40 47 239 0.105 6.26 1.05 PlyA + 11951 11956 6 1.05 2.00 Prom + 16790 16829 40 -4.95 2.01 Init + 19554 19625 72 2 0 102 67 18 0.566 2.22 2.02 Intr + 19758 19889 132 2 0 78 89 68 0.548 5.82 2.03 Intr + 25334 25391 58 1 1 108 101 13 0.154 2.14 2.04 Intr + 34574 34787 214 0 1 120 43 59 0.352 1.45 2.05 Intr + 34965 35135 171 0 0 82 76 40 0.298 0.34 2.06 Intr + 36849 37035 187 0 1 47 98 124 0.571 8.07 2.07 Term + 37118 37198 81 1 0 95 48 64 0.575 -0.19 2.08 PlyA + 37884 37889 6 1.05 3.20 PlyA - 38179 38174 6 1.05 3.19 Term - 45878 45729 150 2 0 1 49 135 0.196 -2.17 3.18 Intr - 52496 52393 104 0 2 58 100 97 0.353 6.87 3.17 Intr - 53553 53482 72 1 0 107 73 66 0.536 5.46 3.16 Intr - 54763 54619 145 1 1 28 60 144 0.539 4.53 3.15 Intr - 56461 56373 89 2 2 45 52 97 0.296 0.57 3.14 Intr - 58495 58395 101 0 2 70 69 55 0.433 0.63 3.13 Intr - 59009 58878 132 1 0 5 56 152 0.302 2.54 3.12 Intr - 61758 61652 107 0 2 86 50 84 0.105 2.49 3.11 Intr - 73511 73343 169 1 1 91 93 152 0.314 15.03 3.10 Intr - 76865 76768 98 2 2 89 86 52 0.966 2.99 3.09 Intr - 79499 79365 135 2 0 104 94 55 0.991 7.54 3.08 Intr - 80923 80808 116 2 2 62 55 88 0.988 2.15 3.07 Intr - 83861 83760 102 0 0 57 27 116 0.731 1.63 3.06 Intr - 86781 86651 131 2 2 41 101 137 0.944 9.72 3.05 Intr - 89633 89533 101 2 2 48 95 81 0.547 2.79 3.04 Intr - 99567 99327 241 2 1 44 90 336 0.019 26.03 3.03 Intr - 99760 99576 185 1 2 36 -33 221 0.034 2.46 3.02 Intr - 100223 99862 362 0 2 94 56 245 0.029 15.91 3.01 Init - 108948 108819 130 0 1 62 4 192 0.121 8.56 3.00 Prom - 120819 120780 40 -3.65 4.12 PlyA - 120942 120937 6 1.05 4.11 Term - 123076 123029 48 1 0 119 41 38 0.437 -1.47 4.10 Intr - 128718 128534 185 1 2 85 87 193 0.932 17.49 4.09 Intr - 129380 129321 60 0 0 58 78 77 0.737 1.49 4.08 Intr - 130756 130580 177 2 0 89 116 205 0.998 22.37 4.07 Intr - 159197 159158 40 2 1 137 93 -4 0.003 1.88 4.06 Intr - 170067 169972 96 0 0 65 60 116 0.742 5.79 4.05 Intr - 186940 186852 89 2 2 86 73 112 0.796 8.27 4.04 Intr - 194292 194191 102 1 0 -4 81 146 0.240 3.93 4.03 Intr - 194752 194563 190 1 1 42 73 82 0.055 0.34 4.02 Intr - 195203 195054 150 2 0 81 4 116 0.031 1.94 4.01 Init - 203296 203255 42 1 0 42 111 31 0.496 1.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 100446 100595 150 2 0 97 86 80 0.916 8.14 S.002 Intr + 103749 103832 84 2 0 80 105 84 0.919 8.40 S.003 Intr + 104331 104391 61 2 1 67 101 37 0.883 0.29 S.004 Term + 104612 104733 122 0 2 77 42 70 0.842 -1.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:68057935_68274799|GENSCAN_predicted_peptide_1|544_aa MAQQKQVTGLVSGKVRQNGTQIEWRRNIPHKSHQWICVGCSLDKIFNTTYKPPMVTPRQT GSRVDLQQTPTDLQLRVLTVRRKTNKKKRHPHQKPICTSPSSKTKEIQTTIREYYKHLYT NKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITVSETEAIINSQPKKVQDQTDSQPNST RDEMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFT IASKRIKYLGIHLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGTINNVKMAILPK VIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHVAKTILSQKNKAGGITLPDFKLYYK ATVTKTAWYWYQNRYIDQWNRIGPLEIIPHIYNHLIFDKPEKNKKWGKDSLFNKWCWENW LAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKPLEENLGNTIQAIGMGKDFMTKTPK AMATKAKIDKWDLIKLKSFCTAKEIIIRVNRQPTEWEKNFTIYPSDKRPLSRIYKELKQI YKKK >gi568815596f:68057935_68274799|GENSCAN_predicted_CDS_1|1635_bp atggcacaacagaaacaggtcacagggctggtaagtggtaaagttcggcagaatggaact caaatcgaatggaggaggaacatccctcacaaatcccatcagtggatttgtgtgggatgc tctctggacaaaatctttaatacaacatataagcctccgatggtgacacccaggcaaaca gggtctagagtggaccttcagcaaactccaacagacctgcagctgagggtcctgactgtt agaaggaaaactaacaaaaagaaaagacatccacaccaaaaacccatctgtacgtcacca tcatcaaagaccaaagaaatacaaactaccatcagagaatactataaacacctctacaca aataaactagaaaatctagaagaaatggataaattcctcgacacatacaccctcccaaga ctaaaccaggaagaagttgaatccctgaatagaccaataacagtttctgaaactgaggca ataattaatagccaaccaaaaaaagtccaggaccagacggattcacagccgaattctacc agagatgaaatgattgtatatttagaaaaccccatagtctcagcccaaaatctccttaag ctgataagcaactttagcaaagtctctggatacaaaatcaatgtgcaaaaatcacaagca ttcctatacaccaataacagacaaacagagagtcaaatcatgagtgaactcccattcaca attgcttcaaagagaataaaatacctaggaatccatcttacaagggatgtgaaggacctc ttcaaggagaactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaag aatattccatgctcatggataggaacaatcaataatgtgaaaatggccatactgcccaag gtaatttatagattcaatgccatccccatcaagctaccaatgactttcttcacagaattg gaaaaaactactctaaagttcatatggaaccaaaaaagagcccacgttgccaagacaatc ctaagccaaaagaacaaagctggaggcatcacgctacctgacttcaaactatactacaag gctacagtaaccaaaacagcatggtactggtaccaaaacagatatatagaccaatggaac agaatagggcccctggaaataataccacacatctacaaccatctgatctttgacaaacct gagaaaaacaagaaatggggaaaggattctctatttaataaatggtgctgggaaaactgg ctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaaattaat tcgagatggattaaagacttaaatgttagacctaaaaccataaaacccctagaagaaaac ctaggcaataccattcaggccataggcatgggcaaagacttcatgactaaaacaccaaaa gcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgc acagcaaaagaaattatcatcagagtgaacaggcaacctacagaatgggagaaaaatttt acaatctacccatctgacaaaaggccactatccagaatctacaaagaacttaaacaaatt tacaagaaaaaatga >gi568815596f:68057935_68274799|GENSCAN_predicted_peptide_2|304_aa MTLNSQLTHGPFPRYMSEPSQDQQFCRIEVQDLVAVMLVSDEASLPGLQMATFPPCPPHM ASPLHMHRMGPSSYRKTISGLSLILRYGKNNVCESLIFKIMSLDIIPVRVVYRKQQKITL ADLSREKHVLLCCSIAQGISTVLEPGLEKEARQGKLGSSSQFKGNSRYIHLVKPGSHTCI LAAREAKKCVEFSAPKVGVKQLKKERRFPLRCDKRSCQEPRASSLVSSSRNLRESRRDPG HWHWQNVAGVAACSTEVAWRVRPIRKSCGWKEFVHKGQQANAHEASDYEVGQIELTQFFS SLQA >gi568815596f:68057935_68274799|GENSCAN_predicted_CDS_2|915_bp atgaccctcaactcacagctgactcatggcccattccctaggtacatgagtgaacccagc caagatcagcagttctgtaggatagaagttcaagatctagttgccgtcatgttggtttct gatgaggcctctcttcctggcttgcagatggccaccttcccaccatgtcctcctcacatg gcctctcctctgcacatgcacagaatgggaccatctagttacaggaaaacaatctcaggg ctctcactgattctacgttatggtaagaataatgtatgtgagagtcttatattcaagata atgagtttggatattataccagtcagagttgtttaccgcaagcaacagaaaataactcta gctgatttaagcagagaaaagcatgtgttgctgtgttgttcaatagctcaaggcatttca acagtgttggaaccaggcttagaaaaggaggccagacaagggaagctaggcagcagctcc cagttcaaaggcaatagcaggtacatccatttagtcaagcctgggtcacatacctgcatc ctagctgcaagggaggctaaaaaatgtgtggagttttcagctcctaaagtgggagtcaag cagctaaaaaaagagagacggtttccactgagatgcgacaaacggagctgccaagagccc agggcctcaagccttgtgagcagctccaggaatctcagggaatcgagaagagacccagga cactggcattggcaaaatgttgctggtgtggccgcgtgtagcactgaagtggcctggagg gtgcggcccattcggaagagctgtggctggaaagagtttgtacataagggacagcaagca aatgcacatgaagccagtgactatgaagtcggtcagattgaattgactcagtttttctca agtcttcaagcgtga >gi568815596f:68057935_68274799|GENSCAN_predicted_peptide_3|889_aa MPEELTEDDLMEVSASEAVPGDEEDDVEAAVSENKVTLDNLAEGSQPHTRSPSPQRGGKT GLFAGLASSVSMRPASPPSPAADSCSACRFFARRPPLRVTWVKPSSALALCVSISDSIPG NLKALPAETRAQLHHAEASLSQPPLQLRPFPKTSQAGDLQDLGPYVCVRKAVGKGDKQIR AVVKEHSVRSQERIWHYPGITTANMPGHLGQNTESGRDKLPMFGVWLPFLGALGVAVAVA EIGCTMSAFEKPQIIAHIQKGFNYTVFDCKWVPCSAKFVTMGNFARGTGVIQLYEIQHGD LKLLREIEKAKPIKCGTFGATSLQQRYLATGDFGGNLHIWNLEAPEMPVYSVKGHKEIIN AIDGIGGLGIGEGAPEIVTGSRDGTVKVWDPRQKDDPVANMEPVQGENKRDCWTVAFGNA YNQEERVVCAGYDNGDIKLFDLRNMALRWETNIKNGVCSLEFDRKDISMNKLVATSLEGK FHVFDMRTQHPTKGFASVSEKAHKSTVWQVRHLPQNRELFLTAGGAGGLHLWKYEYPIQR SKKDSEGIEMGVAGSVSLLQNVTLSTQPISSLDWSPDKRGLCVCSSFDQTEPILITAHNK DCISQSPYQQGVDTVEHEDGPVVQRGSECCNLEVPRDTAFQEAAVTGISTRDFLPGFDLT YPWELLTQSLSFYMAHPNWKHTGKRILGNVIQPRQVGIYKANSDLWTSHSTFAGLYLQPE KRAVESLPVKRDKGGVSEVKTEASGNPEGQALTHEACFTPNRQSPSKEAVQIHMPVVPPT SGKAGFGVELGTYLSGTIYNTHHQQDDFMVHDDCKDLAIKPLRRFEEGIKIEGWKVAHLP GSNQDSQLDEIKSANIPSRKIWMLMGSEWVVKIPPLELGFPIINERIVG >gi568815596f:68057935_68274799|GENSCAN_predicted_CDS_3|2670_bp atgccagaggaattaacagaagacgacttgatggaggtgagtgcttctgaagcagtgccg ggtgatgaggaagatgatgtagaagcagcagtgtcagaaaacaaggtgacattagacaat ttggcagaagggtcccagccacataccaggagcccgtccccacagaggggtgggaagacg ggcctcttcgccggcctggcctcctctgtgtccatgcggcccgcatccccgccctctcct gctgcggacagctgttcagcctgtcgtttcttcgcccgtcggccacccttgcgggtgacc tgggtaaagccctcctctgccctggcgctctgcgtttccatttcggattccatccccgga aatcttaaagcgctgccggctgaaacacgtgcgcagctgcaccacgcagaagccagcctc agccaaccacccctgcagctccgcccctttcctaaaacgtcacaggccggggacctgcag gatttggggccgtatgtatgtgtgcgaaaggcagtgggtaaaggcgataaacaaattcgt gctgttgtgaaggaacattctgtgcgatcgcaggagcgcatctggcattatccggggata actacagccaacatgccagggcatctggggcagaatacggaaagcgggagggacaaattg ccaatgtttggagtctggttgccgtttttgggggctctgggtgtggcggttgccgtagct gaaattggctgcaccatgtcggccttcgagaagcctcagatcatcgcccatatccagaag ggcttcaactacacggtgtttgactgtaagtgggtgccctgcagcgccaaatttgtgacc atgggcaacttcgcacggggcaccggcgtcattcagctgtacgagatccagcacggggac ctgaagctgcttcgggagattgaaaaggccaaacctattaaatgtggaacatttggtgca acatctttacagcagagatatttagctactggagattttggtggaaaccttcatatatgg aatttagaagctccagagatgccagtatattctgtaaagggccataaagaaattataaat gccatagatggcataggtggactaggaattggagaaggagcacctgaaattgtgactggc agccgagatggaactgtgaaggtgtgggacccaaggcaaaaagatgatcctgttgctaat atggaacctgtacaaggagaaaacaagagagactgttggactgtggcatttggcaatgct tataatcaagaagaacgtgttgtttgtgctggctatgacaatggggatatcaaactattt gatctcagaaatatggcattacggtgggagacaaacatcaaaaatggggtgtgtagcttg gagtttgacagaaaagacataagtatgaataagttagtagccacatctctggaaggaaag ttccatgtttttgacatgagaacacagcatccaaccaaaggttttgcctctgtttcagaa aaggctcataaatctactgtgtggcaggtccgacacctgccgcagaacagggagctcttt ctgacagctggaggcgccggcggccttcacctctggaagtatgaataccctattcagcgg tcaaagaaagattctgagggaatagaaatgggagtcgcaggttctgtaagccttctgcag aatgttacgttgtccacccagcccatttcaagtttggattggagtccagataaaaggggt ctctgcgtctgtagttcatttgaccaaacggagcccattttaatcacagcacataataaa gactgtatctcccagtctccctatcagcaaggtgtggacactgtggagcatgaggatggg cctgtcgtccagaggggctctgaatgctgcaatttagaggttccacgtgacactgccttc caggaagcagccgtaacaggaatcagcactagggatttcttaccaggatttgatcttacc tacccatgggagctgctgacacagtctctgagcttctacatggcccaccctaactggaaa catacaggaaagcgaattctgggaaatgtaattcagcctcgccaggttggcatttataaa gccaactcagatctttggacaagtcattccacctttgcaggcctctacctgcaacccgaa aaacgggcggtggagagtctgccagtcaaacgggataagggaggagtctctgaagtcaag acggaagccagtggaaacccagagggtcaggcactgactcacgaggcctgcttcacaccc aataggcaaagcccctcaaaagaagccgttcagatccacatgcccgtggtgccccccacc tcaggaaaagcgggatttggtgtggaactcggaacttacttgtcaggcactatctacaac acccatcaccagcaggatgatttcatggttcatgatgactgcaaggaccttgccatcaaa ccactgagaaggtttgaagaaggaataaagatagaagggtggaaagtggcacatcttcca ggctccaaccaggatagtcaattggatgaaataaaatcagcaaatatcccctctaggaag atttggatgctgatgggatctgaatgggttgtgaaaatccctcctttggagcttggattc ccaataattaatgaacgtatagttggctag >gi568815596f:68057935_68274799|GENSCAN_predicted_peptide_4|392_aa MYRQISSRKVGTEQMGLRVLHRGDSSPDGPCNSVVDPSRYTDAQIPLTASRRTWIFPVFW CGIRRSPTRARQSAAALAAQRGAQVPGCAALAEAPPPPRALQHSAFPSLSALPFFRALRL RVSASRCRPQLGRRRRFLRASLSATLLRASEPASRRPAEQNDTWYMVDVKYYLKEYVKWN KGKGAYRTLKARSAESSSQCSNAGKEYKGIQIGKKKNGSRLQMGNEASYPLEMCSHFDAD EIKRLGKRFKKLDLDNSGSLSVEEFMSLPELQQNPLVQRVIDIFDTDGNGEVDFKEFIEG VSQFSVKGDKEQKLRFAFRIYDMDKDGYISNGELFQVLKMMVGNNLKDTQLQQIVDKTII NADKDGDGRISFEEFCAVVGGLDIHKKMVVDV >gi568815596f:68057935_68274799|GENSCAN_predicted_CDS_4|1179_bp atgtatagacaaatttcctcaagaaaagtgggcactgaacagatgggcctgcgggtgctg caccggggtgacagcagtcctgatgggccgtgcaattctgtagtggatccaagtcggtac acggatgcacagatacctttgactgcctctcgcaggacttggatttttcctgttttttgg tgcggcatccggcgttccccaaccagagctcgccagagcgccgcggcactcgccgcccag cggggcgcgcaggttcccggatgtgcggcgctcgcggaagccccgcccccgccccgcgcg ctgcagcactccgctttcccctccctctccgccctccccttttttcgtgccttgaggttg cgggtcagcgcgagccgctgcaggccgcagctgggccgccgccgccgtttcctgcgagcc agcctgagcgcaacacttctccgagccagcgagccagcgagccgccgacccgccgagcaa aatgatacttggtatatggtagatgttaagtactatttgaaggagtatgtgaagtggaat aaaggaaaaggtgcttatcggacattgaaggcacgtagtgctgaaagttctagtcagtgc agtaatgcaggaaaagaatataaaggcatacagattggaaagaagaaaaacggatcccgt ttgcagatgggaaatgaggcaagttatcctttggaaatgtgctcacactttgatgcggat gaaattaaaaggctaggaaagagatttaagaagcttgatttggacaattctggttctttg agtgtggaagagttcatgtctctgcctgagttacaacagaatcctttagtacagcgagta atagatatattcgacacagatgggaatggagaagtagactttaaagaattcattgagggc gtctctcagttcagtgtcaaaggagataaggagcagaaattgaggtttgctttccgtatc tatgacatggataaagatggctatatttccaatggggaactcttccaggtattgaagatg atggtggggaacaatctgaaagatacacagttacagcaaattgtagacaaaaccataata aatgcagataaggatggagatggaagaatatcctttgaagaattctgtgctgttgtaggt ggcctagatatccacaaaaagatggtggtagatgtgtga