GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:53:18 Sequence gi568815596r:68080966_68317132 : 236167 bp : 38.96% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 309 350 42 2 0 75 84 66 0.785 3.37 1.02 Intr + 2303 2360 58 1 1 108 101 13 0.234 2.14 1.03 Intr + 11543 11756 214 0 1 120 43 59 0.376 1.45 1.04 Intr + 11934 12104 171 0 0 82 76 40 0.308 0.34 1.05 Intr + 13818 14004 187 0 1 47 98 124 0.588 8.07 1.06 Term + 14087 14167 81 1 0 95 48 64 0.592 -0.19 1.07 PlyA + 14853 14858 6 1.05 2.20 PlyA - 15148 15143 6 1.05 2.19 Term - 22847 22698 150 2 0 1 49 135 0.199 -2.17 2.18 Intr - 29465 29362 104 0 2 58 100 97 0.354 6.87 2.17 Intr - 30522 30451 72 1 0 107 73 66 0.537 5.46 2.16 Intr - 31732 31588 145 1 1 28 60 144 0.540 4.53 2.15 Intr - 33430 33342 89 2 2 45 52 97 0.297 0.57 2.14 Intr - 35464 35364 101 0 2 70 69 55 0.433 0.63 2.13 Intr - 35978 35847 132 1 0 5 56 152 0.302 2.54 2.12 Intr - 38727 38621 107 0 2 86 50 84 0.105 2.49 2.11 Intr - 50480 50312 169 1 1 91 93 152 0.314 15.03 2.10 Intr - 53834 53737 98 2 2 89 86 52 0.966 2.99 2.09 Intr - 56468 56334 135 2 0 104 94 55 0.991 7.54 2.08 Intr - 57892 57777 116 2 2 62 55 88 0.988 2.15 2.07 Intr - 60830 60729 102 0 0 57 27 116 0.731 1.63 2.06 Intr - 63750 63620 131 2 2 41 101 137 0.944 9.72 2.05 Intr - 66602 66502 101 2 2 48 95 81 0.547 2.79 2.04 Intr - 76536 76296 241 2 1 44 90 336 0.019 26.03 2.03 Intr - 76729 76545 185 1 2 36 -33 221 0.034 2.46 2.02 Intr - 77192 76831 362 0 2 94 56 245 0.029 15.91 2.01 Init - 85917 85788 130 0 1 62 4 192 0.121 8.56 2.00 Prom - 97788 97749 40 -3.65 3.12 PlyA - 97911 97906 6 1.05 3.11 Term - 100045 99998 48 1 0 119 41 38 0.437 -1.47 3.10 Intr - 105687 105503 185 1 2 85 87 193 0.932 17.49 3.09 Intr - 106349 106290 60 0 0 58 78 77 0.737 1.49 3.08 Intr - 107725 107549 177 2 0 89 116 205 0.998 22.37 3.07 Intr - 136166 136127 40 2 1 137 93 -4 0.003 1.88 3.06 Intr - 147036 146941 96 0 0 65 60 116 0.742 5.79 3.05 Intr - 163909 163821 89 2 2 86 73 112 0.798 8.27 3.04 Intr - 171261 171160 102 1 0 -4 81 146 0.241 3.93 3.03 Intr - 171721 171532 190 1 1 42 73 82 0.055 0.34 3.02 Intr - 172172 172023 150 2 0 81 4 116 0.031 1.94 3.01 Init - 180265 180224 42 1 0 42 111 31 0.517 1.27 3.00 Prom - 209580 209541 40 -2.45 4.05 PlyA - 209795 209790 6 1.05 4.04 Term - 213061 212897 165 1 0 73 48 104 0.292 1.83 4.03 Intr - 219336 219256 81 0 0 46 85 61 0.222 0.52 4.02 Intr - 225979 225843 137 2 2 -8 110 73 0.166 -0.73 4.01 Init - 230639 230549 91 2 1 82 80 47 0.402 2.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 77415 77564 150 2 0 97 86 80 0.916 8.14 S.002 Intr + 80718 80801 84 2 0 80 105 84 0.919 8.40 S.003 Intr + 81300 81360 61 2 1 67 101 37 0.883 0.29 S.004 Term + 81581 81702 122 0 2 77 42 70 0.842 -1.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:68080966_68317132|GENSCAN_predicted_peptide_1|250_aa MTSFLPLSCLFLCLMGPSSYRKTISGLSLILRYGKNNVCESLIFKIMSLDIIPVRVVYRK QQKITLADLSREKHVLLCCSIAQGISTVLEPGLEKEARQGKLGSSSQFKGNSRYIHLVKP GSHTCILAAREAKKCVEFSAPKVGVKQLKKERRFPLRCDKRSCQEPRASSLVSSSRNLRE SRRDPGHWHWQNVAGVAACSTEVAWRVRPIRKSCGWKEFVHKGQQANAHEASDYEVGQIE LTQFFSSLQA >gi568815596r:68080966_68317132|GENSCAN_predicted_CDS_1|753_bp atgacctcttttctgcctctctcgtgtctttttctgtgcctgatgggaccatctagttac aggaaaacaatctcagggctctcactgattctacgttatggtaagaataatgtatgtgag agtcttatattcaagataatgagtttggatattataccagtcagagttgtttaccgcaag caacagaaaataactctagctgatttaagcagagaaaagcatgtgttgctgtgttgttca atagctcaaggcatttcaacagtgttggaaccaggcttagaaaaggaggccagacaaggg aagctaggcagcagctcccagttcaaaggcaatagcaggtacatccatttagtcaagcct gggtcacatacctgcatcctagctgcaagggaggctaaaaaatgtgtggagttttcagct cctaaagtgggagtcaagcagctaaaaaaagagagacggtttccactgagatgcgacaaa cggagctgccaagagcccagggcctcaagccttgtgagcagctccaggaatctcagggaa tcgagaagagacccaggacactggcattggcaaaatgttgctggtgtggccgcgtgtagc actgaagtggcctggagggtgcggcccattcggaagagctgtggctggaaagagtttgta cataagggacagcaagcaaatgcacatgaagccagtgactatgaagtcggtcagattgaa ttgactcagtttttctcaagtcttcaagcgtga >gi568815596r:68080966_68317132|GENSCAN_predicted_peptide_2|889_aa MPEELTEDDLMEVSASEAVPGDEEDDVEAAVSENKVTLDNLAEGSQPHTRSPSPQRGGKT GLFAGLASSVSMRPASPPSPAADSCSACRFFARRPPLRVTWVKPSSALALCVSISDSIPG NLKALPAETRAQLHHAEASLSQPPLQLRPFPKTSQAGDLQDLGPYVCVRKAVGKGDKQIR AVVKEHSVRSQERIWHYPGITTANMPGHLGQNTESGRDKLPMFGVWLPFLGALGVAVAVA EIGCTMSAFEKPQIIAHIQKGFNYTVFDCKWVPCSAKFVTMGNFARGTGVIQLYEIQHGD LKLLREIEKAKPIKCGTFGATSLQQRYLATGDFGGNLHIWNLEAPEMPVYSVKGHKEIIN AIDGIGGLGIGEGAPEIVTGSRDGTVKVWDPRQKDDPVANMEPVQGENKRDCWTVAFGNA YNQEERVVCAGYDNGDIKLFDLRNMALRWETNIKNGVCSLEFDRKDISMNKLVATSLEGK FHVFDMRTQHPTKGFASVSEKAHKSTVWQVRHLPQNRELFLTAGGAGGLHLWKYEYPIQR SKKDSEGIEMGVAGSVSLLQNVTLSTQPISSLDWSPDKRGLCVCSSFDQTEPILITAHNK DCISQSPYQQGVDTVEHEDGPVVQRGSECCNLEVPRDTAFQEAAVTGISTRDFLPGFDLT YPWELLTQSLSFYMAHPNWKHTGKRILGNVIQPRQVGIYKANSDLWTSHSTFAGLYLQPE KRAVESLPVKRDKGGVSEVKTEASGNPEGQALTHEACFTPNRQSPSKEAVQIHMPVVPPT SGKAGFGVELGTYLSGTIYNTHHQQDDFMVHDDCKDLAIKPLRRFEEGIKIEGWKVAHLP GSNQDSQLDEIKSANIPSRKIWMLMGSEWVVKIPPLELGFPIINERIVG >gi568815596r:68080966_68317132|GENSCAN_predicted_CDS_2|2670_bp atgccagaggaattaacagaagacgacttgatggaggtgagtgcttctgaagcagtgccg ggtgatgaggaagatgatgtagaagcagcagtgtcagaaaacaaggtgacattagacaat ttggcagaagggtcccagccacataccaggagcccgtccccacagaggggtgggaagacg ggcctcttcgccggcctggcctcctctgtgtccatgcggcccgcatccccgccctctcct gctgcggacagctgttcagcctgtcgtttcttcgcccgtcggccacccttgcgggtgacc tgggtaaagccctcctctgccctggcgctctgcgtttccatttcggattccatccccgga aatcttaaagcgctgccggctgaaacacgtgcgcagctgcaccacgcagaagccagcctc agccaaccacccctgcagctccgcccctttcctaaaacgtcacaggccggggacctgcag gatttggggccgtatgtatgtgtgcgaaaggcagtgggtaaaggcgataaacaaattcgt gctgttgtgaaggaacattctgtgcgatcgcaggagcgcatctggcattatccggggata actacagccaacatgccagggcatctggggcagaatacggaaagcgggagggacaaattg ccaatgtttggagtctggttgccgtttttgggggctctgggtgtggcggttgccgtagct gaaattggctgcaccatgtcggccttcgagaagcctcagatcatcgcccatatccagaag ggcttcaactacacggtgtttgactgtaagtgggtgccctgcagcgccaaatttgtgacc atgggcaacttcgcacggggcaccggcgtcattcagctgtacgagatccagcacggggac ctgaagctgcttcgggagattgaaaaggccaaacctattaaatgtggaacatttggtgca acatctttacagcagagatatttagctactggagattttggtggaaaccttcatatatgg aatttagaagctccagagatgccagtatattctgtaaagggccataaagaaattataaat gccatagatggcataggtggactaggaattggagaaggagcacctgaaattgtgactggc agccgagatggaactgtgaaggtgtgggacccaaggcaaaaagatgatcctgttgctaat atggaacctgtacaaggagaaaacaagagagactgttggactgtggcatttggcaatgct tataatcaagaagaacgtgttgtttgtgctggctatgacaatggggatatcaaactattt gatctcagaaatatggcattacggtgggagacaaacatcaaaaatggggtgtgtagcttg gagtttgacagaaaagacataagtatgaataagttagtagccacatctctggaaggaaag ttccatgtttttgacatgagaacacagcatccaaccaaaggttttgcctctgtttcagaa aaggctcataaatctactgtgtggcaggtccgacacctgccgcagaacagggagctcttt ctgacagctggaggcgccggcggccttcacctctggaagtatgaataccctattcagcgg tcaaagaaagattctgagggaatagaaatgggagtcgcaggttctgtaagccttctgcag aatgttacgttgtccacccagcccatttcaagtttggattggagtccagataaaaggggt ctctgcgtctgtagttcatttgaccaaacggagcccattttaatcacagcacataataaa gactgtatctcccagtctccctatcagcaaggtgtggacactgtggagcatgaggatggg cctgtcgtccagaggggctctgaatgctgcaatttagaggttccacgtgacactgccttc caggaagcagccgtaacaggaatcagcactagggatttcttaccaggatttgatcttacc tacccatgggagctgctgacacagtctctgagcttctacatggcccaccctaactggaaa catacaggaaagcgaattctgggaaatgtaattcagcctcgccaggttggcatttataaa gccaactcagatctttggacaagtcattccacctttgcaggcctctacctgcaacccgaa aaacgggcggtggagagtctgccagtcaaacgggataagggaggagtctctgaagtcaag acggaagccagtggaaacccagagggtcaggcactgactcacgaggcctgcttcacaccc aataggcaaagcccctcaaaagaagccgttcagatccacatgcccgtggtgccccccacc tcaggaaaagcgggatttggtgtggaactcggaacttacttgtcaggcactatctacaac acccatcaccagcaggatgatttcatggttcatgatgactgcaaggaccttgccatcaaa ccactgagaaggtttgaagaaggaataaagatagaagggtggaaagtggcacatcttcca ggctccaaccaggatagtcaattggatgaaataaaatcagcaaatatcccctctaggaag atttggatgctgatgggatctgaatgggttgtgaaaatccctcctttggagcttggattc ccaataattaatgaacgtatagttggctag >gi568815596r:68080966_68317132|GENSCAN_predicted_peptide_3|392_aa MYRQISSRKVGTEQMGLRVLHRGDSSPDGPCNSVVDPSRYTDAQIPLTASRRTWIFPVFW CGIRRSPTRARQSAAALAAQRGAQVPGCAALAEAPPPPRALQHSAFPSLSALPFFRALRL RVSASRCRPQLGRRRRFLRASLSATLLRASEPASRRPAEQNDTWYMVDVKYYLKEYVKWN KGKGAYRTLKARSAESSSQCSNAGKEYKGIQIGKKKNGSRLQMGNEASYPLEMCSHFDAD EIKRLGKRFKKLDLDNSGSLSVEEFMSLPELQQNPLVQRVIDIFDTDGNGEVDFKEFIEG VSQFSVKGDKEQKLRFAFRIYDMDKDGYISNGELFQVLKMMVGNNLKDTQLQQIVDKTII NADKDGDGRISFEEFCAVVGGLDIHKKMVVDV >gi568815596r:68080966_68317132|GENSCAN_predicted_CDS_3|1179_bp atgtatagacaaatttcctcaagaaaagtgggcactgaacagatgggcctgcgggtgctg caccggggtgacagcagtcctgatgggccgtgcaattctgtagtggatccaagtcggtac acggatgcacagatacctttgactgcctctcgcaggacttggatttttcctgttttttgg tgcggcatccggcgttccccaaccagagctcgccagagcgccgcggcactcgccgcccag cggggcgcgcaggttcccggatgtgcggcgctcgcggaagccccgcccccgccccgcgcg ctgcagcactccgctttcccctccctctccgccctccccttttttcgtgccttgaggttg cgggtcagcgcgagccgctgcaggccgcagctgggccgccgccgccgtttcctgcgagcc agcctgagcgcaacacttctccgagccagcgagccagcgagccgccgacccgccgagcaa aatgatacttggtatatggtagatgttaagtactatttgaaggagtatgtgaagtggaat aaaggaaaaggtgcttatcggacattgaaggcacgtagtgctgaaagttctagtcagtgc agtaatgcaggaaaagaatataaaggcatacagattggaaagaagaaaaacggatcccgt ttgcagatgggaaatgaggcaagttatcctttggaaatgtgctcacactttgatgcggat gaaattaaaaggctaggaaagagatttaagaagcttgatttggacaattctggttctttg agtgtggaagagttcatgtctctgcctgagttacaacagaatcctttagtacagcgagta atagatatattcgacacagatgggaatggagaagtagactttaaagaattcattgagggc gtctctcagttcagtgtcaaaggagataaggagcagaaattgaggtttgctttccgtatc tatgacatggataaagatggctatatttccaatggggaactcttccaggtattgaagatg atggtggggaacaatctgaaagatacacagttacagcaaattgtagacaaaaccataata aatgcagataaggatggagatggaagaatatcctttgaagaattctgtgctgttgtaggt ggcctagatatccacaaaaagatggtggtagatgtgtga >gi568815596r:68080966_68317132|GENSCAN_predicted_peptide_4|157_aa MGFCHVGQAGLELLASQDPPTLSSQRAQITAFPSVPNQKGILQVLEWPSVLFGVIVNDSI RESDFLQKIHNAIPLRSPDWLKLKGLAMPSVGEDIEQLKFSLLFTDIGTFETVWQVKFYN YHKRDHCQWGSPFSVIEYECKPNETRSLMWVNKESFL >gi568815596r:68080966_68317132|GENSCAN_predicted_CDS_4|474_bp atgggcttttgccatgttggccaggctggtctcgaactcctggcctcacaagatccgccc accttgtcctctcaaagggctcagattacagccttcccctctgtgcccaaccaaaaagga attctacaggtacttgaatggccttctgtcctgtttggtgtgattgtaaatgacagtatt cgagaatctgactttcttcaaaaaatacataatgcaatacccttaaggtcaccagactgg ctaaaattgaaaggactggcaatgccaagtgttggagaggatatagagcaattgaaattc tcattgctgttcacagacattgggaccttcgagacagtgtggcaagtcaagttctacaat taccacaagcgggatcactgccagtggggaagccccttctctgtcattgagtatgaatgc aagcccaacgagacacgcagtctgatgtgggtgaacaaggagtccttcctctga