GENSCAN 1.0 Date run: 3-Nov-116 Time: 09:13:18 Sequence gi568815587r:47679108_47948313 : 269206 bp : 44.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.22 PlyA - 437 432 6 1.05 1.21 Term - 966 928 39 0 0 101 45 39 0.316 -1.91 1.20 Intr - 6942 6786 157 2 1 73 121 61 0.839 7.91 1.19 Intr - 11751 10969 783 2 0 70 100 523 0.917 42.29 1.18 Intr - 13149 12996 154 1 1 78 65 114 0.972 7.23 1.17 Intr - 20446 20339 108 2 0 59 72 65 0.782 2.26 1.16 Intr - 26810 26757 54 0 0 131 53 46 0.870 4.35 1.15 Intr - 31404 31270 135 1 0 41 73 84 0.464 2.74 1.14 Intr - 35240 35177 64 2 1 86 85 48 0.111 2.59 1.13 Intr - 40979 40827 153 2 0 77 29 164 0.115 9.67 1.12 Intr - 44209 43869 341 2 2 65 70 86 0.307 -0.31 1.11 Intr - 45065 44921 145 2 1 88 91 26 0.883 2.76 1.10 Intr - 45521 45361 161 0 2 84 111 130 0.761 14.61 1.09 Intr - 52454 52267 188 1 2 66 94 149 0.998 12.63 1.08 Intr - 53563 53430 134 1 2 99 101 66 0.997 8.44 1.07 Intr - 55022 54918 105 2 0 61 107 62 0.899 5.81 1.06 Intr - 57633 57509 125 1 2 77 53 168 0.997 12.50 1.05 Intr - 65056 64846 211 1 1 109 51 138 0.996 10.69 1.04 Intr - 67287 66949 339 0 0 96 80 191 0.999 14.77 1.03 Intr - 75557 75421 137 0 2 104 49 139 0.898 11.89 1.02 Intr - 86255 86163 93 0 0 90 85 75 0.982 7.34 1.01 Init - 88181 87956 226 2 1 88 57 265 0.988 20.03 1.00 Prom - 94173 94134 40 -4.96 2.20 PlyA - 99176 99171 6 1.05 2.19 Term - 100087 99998 90 1 0 118 34 63 0.969 1.62 2.18 Intr - 101340 101236 105 0 0 61 92 64 0.851 4.51 2.17 Intr - 104091 103966 126 0 0 29 80 116 0.919 5.78 2.16 Intr - 113839 113679 161 2 2 68 48 136 0.502 7.31 2.15 Intr - 118777 118672 106 1 1 120 84 151 0.989 17.69 2.14 Intr - 118967 118871 97 1 1 131 76 48 0.661 7.91 2.13 Intr - 119155 119061 95 1 2 39 91 -11 0.659 -6.94 2.12 Intr - 119356 119261 96 1 0 70 110 107 0.991 11.31 2.11 Intr - 122823 122704 120 0 0 80 83 64 0.888 5.79 2.10 Intr - 133117 132957 161 2 2 113 87 66 0.839 8.71 2.09 Intr - 133322 133195 128 1 2 47 99 73 0.986 4.62 2.08 Intr - 133940 133775 166 0 1 83 47 108 0.964 5.22 2.07 Intr - 136542 136368 175 0 1 76 52 109 0.539 5.61 2.06 Intr - 140351 140267 85 1 1 83 89 52 0.971 4.62 2.05 Intr - 142714 142617 98 1 2 80 17 121 0.918 3.01 2.04 Intr - 156702 156544 159 0 0 69 80 91 0.937 6.58 2.03 Intr - 157894 157780 115 0 1 85 115 14 0.569 4.25 2.02 Intr - 168915 168741 175 1 1 71 111 94 0.741 9.00 2.01 Init - 169313 169112 202 2 1 55 60 208 0.588 11.74 2.00 Prom - 174217 174178 40 -3.06 3.00 Prom + 189388 189427 40 -3.46 3.01 Init + 210769 210798 30 0 0 73 86 21 0.114 0.19 3.02 Intr + 236456 236565 110 1 2 4 76 142 0.243 3.68 3.03 Intr + 249305 249338 34 2 1 40 102 37 0.010 -1.47 3.04 Intr + 259365 259439 75 2 0 110 14 55 0.032 0.01 3.05 Intr + 268313 268441 129 1 0 78 82 96 0.124 8.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:47679108_47948313|GENSCAN_predicted_peptide_1|1283_aa MGKKSRAVPGRRPILQLSPPGPRGSTPGRDPEPEPDTEPDSTAAVPSQPAPSAATTTTTA VTAAAASDDSPSEGKDEQEAVQEVPRVVQNPPKPVMTTRPTAVKATGGLCLLGAYADSDD DDNDVSEKLAQSKETNGNQSTDIDSTLANFLAEVNEGIQALSNSEEEKKGVAASLLAPLL PEGIKEEEERWRRKVICKEEPVSEVKETSTTVEEATTIVKPQEIMLDNIEDPSQEDLCSV VQSGESEEEEEQDTLELELVLERKKAELRALEEGDGSVSGSSPRSDISQPASQDGMRRLM SKRGKWKMFVRATSPESTSRSSSKTGRDTPENGETAIGAENSEKIDENSDKEMEVEESPE KIKVQTTPKVEEEQDLKFQIGELANTLTSKFEFLGINRQSISNFHVLLLQTETRIADWRE GALNGNYLKRKLQDAAEQLKQYEINATPKGWSCHWDRDHRRYFYVNEQSGESQWEFPDGE EEEEESQAQENRDETLAKQTLKDKTGTDSNSTESSETSTESPPPPPPPPPPAEDGEIQEV EMEDEGSEEPPAPGTEEDTPLKPSAQTTVVTSQSSVDSTISSSSSTKGIKRKATEISTAV VQRSATIGSSPVLYSQSAIATGHQAAGIGNQATGIGHQTIPVSLPAAGMGHQARGMSLQS NYLGLAAAPAIMSYAECSVPIGVTAPSLQPVQARGAVPTATIIEPPPPPPPPPPPPPPAP KMPPPEKTKKGRKDKAKKSKTKMPSLVKKWQSIQRELDEEDNSSSSEEDRESTAQKRIEE WKQQQLTIPDPYEDFMYRHLQYYGYFKAQRGSLPNSATHQHVRKNNPQCLLNGSLGEKDD LIPDTLQKEKLLWPISLSSAVHRQIEAINREWAPPQPEYFYQPKGNEKVPEIVGEKKGTV VYQLDSVPIEGSYFTSSRVGGKRGIVKELAVTLQGPEDNTLLFESRFESGNLQKAVRVDT YEYELTLRTDLYTNKHTQWFYFRVQNTRKDATYRFTIVNLLKPKSLYTVGMKPLLYSQLD ANTRNIGWRREGNEIKYYKNNTDDGQQPFYCLTWTIQFPYDQDTCFFAHFYPYTYTDLQC YLLSVANNPIQSQFCKLQTLCRSLAGNTVYLLTITNPSQTPQEAAAKKAVVLSARVHPGE SNGSWVMKGFLDFILSNSPDAQLLRDIFVFKVLPMLNPDGVIVGNYRCSLAGRDLNRHYK TILKESFPCIWYTRNMIKRLLEEREVLLYCDFHGHSRKNNIFLYGCNNNNRKYWLHERVF PLMLCKNAPDKVIKETPTLPSKI >gi568815587r:47679108_47948313|GENSCAN_predicted_CDS_1|3852_bp atggggaagaagtcccgggcggtacccggccgtaggcccatcctgcaactctctccgccg ggtcctcggggcagcacgccgggccgggacccggagccggaacccgacactgagccggac tcaaccgcggcggtccccagccagcccgccccgtcggcggcgacgaccaccaccaccgcg gtgactgccgccgcggcctcggacgactcgccttcagaaggcaaggatgaacaggaagcg gtgcaggaggttcctagagttgttcagaatcctccaaaaccagtcatgaccactagaccc acagctgttaaagcaacaggcggtctatgcttgcttggtgcttatgctgacagtgatgac gatgacaatgatgtttccgaaaaactagcacaatccaaagagacaaatggaaaccagtca actgatattgatagtacattggccaacttcctagcggaagtaaatgaaggaattcaggct ctctcaaatagtgaggaggagaagaaaggggtggcagcatcgctgcttgctcctttattg cctgagggaataaaagaagaagaagagagatggagaagaaaagtaatttgtaaagaggag ccagtttcagaagtaaaagaaacaagtacaacagtagaagaagcaacaacaatagtaaag ccacaggaaattatgttggacaatatagaagacccttctcaggaggatctttgcagtgtt gtccaatctggagaaagtgaggaggaagaggaacaagatacccttgaactggagctagtt ttggaaaggaaaaaagcagagttgcgagccttggaggaaggagatggtagtgtgtcaggg tctagtccacgttctgatatcagccagccagcatctcaagatggaatgcgtaggcttatg tctaaaagaggaaaatggaagatgtttgttcgagctaccagtccagaatctaccagtagg agttctagtaaaactggacgagatactccagaaaatggagaaactgcaattggtgctgaa aattcagaaaaaatagatgagaattcagataaagagatggaagtagaagaatctccagag aaaataaaagtacagacaacaccaaaagtagaagaagaacaggatttgaaatttcagatt ggagaactggcaaataccctgacaagtaaattcgagtttctaggcattaatagacaatcc atctccaactttcatgtgctgctcttacagactgagactcgaattgcagactggcgggaa ggggctcttaatggaaactaccttaaacgaaaacttcaggatgcagcagaacaactaaaa cagtatgaaataaacgccactcctaaaggctggtcctgccactgggacagggatcataga cggtatttctatgtaaacgaacagtcgggcgagtctcagtgggagtttccagatggtgaa gaggaagaagaagaaagccaagcacaagaaaatagagatgagactcttgccaaacagacc ttgaaagacaaaactggcactgattcaaattcaacagaatcctctgaaacttccacagaa tcacctccaccccctcctccaccacctcctcctgcggaagatggtgagatccaggaggta gagatggaggatgagggaagtgaggagccccctgccccaggaacagaggaagatacccct ttgaaaccttcagcacaaaccacagttgtaactagccagagttcagttgattccaccatc tctagttcttcttccactaaaggaataaagaggaaagctacagaaattagcactgcagtg gttcagaggtcagctaccattggcagttctccagttctctatagccagtcagctatagct acaggtcaccaggcagcagggattggaaaccaggcaacaggaattggacatcagacaata ccagttagccttccagcagcaggaatgggtcatcaggccagaggaatgagcctgcagtca aattaccttggactagcggcagcacctgcaattatgagttatgcagaatgttctgtccca attggagtgactgctccctcattgcagccagttcaggcccgaggtgctgtgcctaccgct accattatagaaccaccaccaccacctcctcctcctcctcctccaccaccaccagctccc aaaatgccaccacctgaaaagacaaaaaaaggaaggaaagacaaggcaaagaagagtaag accaaaatgccatctttggtaaaaaagtggcagagtatccagcgtgagttagatgaagag gacaattctagttccagtgaagaggatcgggaatcaactgcacagaagcgaattgaagag tggaaacagcagcagctgactattcctgatccttatgaagactttatgtaccgtcacctc caatattatggctactttaaagctcagagaggcagtttaccaaactctgctacgcatcag catgttcggaagaataaccctcaatgcctgttgaatggctctcttggggaaaaagatgat ttgataccagacaccctgcaaaaggagaagcttctatggcctatcagtttatcttcagct gtgcacagacagatagaagccatcaacagagagtgggctccacctcaaccagaatatttc tatcagcctaaaggaaatgaaaaggtaccagagattgtaggagagaaaaaaggaacagtt gtctatcaattagattcagtgcctatagaaggttcctattttaccagttccagagtggga ggcaaacgaggaattgtcaaggaacttgctgtcacgttgcaaggaccagaagataatact ctactgtttgaatcaaggtttgagagtgggaatctgcaaaaagctgtcagagtagacacc tatgagtatgaactcaccttgcgaactgacctctacactaacaaacacactcagtggttt tattttcgtgttcagaacaccagaaaagatgctacctatcgcttcaccattgtcaacttg ctaaaacccaagagtctttatactgtagggatgaagccactcttgtactcccaattggat gccaacacccgcaatattggctggaggagagaaggaaatgaaatcaagtactacaagaac aacacggatgatgggcagcagcccttctactgtctcacgtggaccattcagtttccatat gaccaggacacttgcttctttgcacacttctacccatatacatacactgatttgcaatgc tacctcctgtcagtggcaaacaaccctatccagtctcagttctgcaagctccaaacttta tgcaggagcctagcaggaaataccgtttacttgctcaccatcaccaacccatcccagacc cctcaagaggcagctgcaaagaaagctgtggtcttgagtgccagagttcaccctggagaa agtaatggctcctgggttatgaaaggctttttggacttcatccttagcaactccccagat gcccagctcctcagagatatttttgtcttcaaggtgcttcccatgttaaatccagatggt gtgattgtggggaattatcggtgttccttggccggaagggatttgaacaggcattataaa accattctgaaggagtctttcccttgtatttggtacaccaggaacatgatcaaaagactt cttgaagaaagagaggttctgttgtattgtgatttccatggccacagtcgtaagaataat atcttcctgtatggctgtaataacaacaatcgcaaatactggcttcatgaacgagtcttt cctttaatgttatgcaaaaatgcaccagataaggtaataaaagagacacccactttacca tcgaagatctga >gi568815587r:47679108_47948313|GENSCAN_predicted_peptide_2|819_aa MLHLSAAPPAPPPEVTATARPCLCSVGRRGDGGKMAAAGALERSFVELSGAERERPRHFR EFTVCSIAPLVPVISQKTRVQADHVHFAGTANAVAGAVKYSESAGGFYYVESGKLFSVTR NRFIHWGDQSPSDRPLSLAVHCVEHDAFIFALCQDHKLRMWSYKEQMCLMVADMLEYVPV KKDLRLTAGTGHKLRLAYSPTMGLYLGIYMHAPKRGQETLIDFALTSTDIWALWHDAENQ TVVKYINFEHNVAGQWNPVFMQPLPEEEIVIRDDQDPRLQGSVTEYEFSQEEFRNLQQEF WCKFYACCLQYQEALSHPLALHLNPHTNMVCLLKKVNVDIARDVICLIKCLRLIEESVTV DMSVIMEMSCYNLQSPEKAAEQILEDMITIDVENVMEDICSKLQEIRNPIHAIGLLIREM DYETEVEMEKGFNPAQPLNIRMNLTQLYGSNTAGYIVCRGVHKIASTRFLICRDLLILQQ LLMRLGDAALECFCQAASEVGKEEFLDRLIRSEDGEIVSTPRLQYYDKVLRLLDVIGLPE LVIQLATSAITEAGDDWKSQATLRTCIFKHHLDLGHNSQAYEALTQIPDSSRQLDCLRQL VVVLCERSQLQDLVEFPYVNLHNEVVGIIESRARAVDLMTHNYYELLYAFHIYRHNYRKA GTVMFEYGMRLGREVRTLRGLEKQGNCYLAALNCLRLIRPEYAWIVQPVSGAVKVDAAEL LRLYLNYDLLEEAVDLVSEYVDAVLGKGHQYFGIEFPLSATAPMVWLPYSSIDQLLQALG ENSANSHNIALSQKILDKLEDYQQKVDKATRDLLYRRTL >gi568815587r:47679108_47948313|GENSCAN_predicted_CDS_2|2460_bp atgcttcacctgtccgcagctccgcccgccccacccccggaagtgacggcgaccgcgcgg ccctgcctttgttccgttgggcgtcgcggcgacggcgggaagatggcggcggcgggagcc ctggaacggagcttcgtggagctaagcggagctgagcgcgaaaggccgaggcactttcgg gaattcacagtctgcagcattgctcccttagtccctgtcatttctcagaagacgcgtgtg caggctgaccacgttcattttgcagggactgcaaatgccgtggctggcgccgtaaaatac agtgaaagcgcgggaggcttttactacgtggagagtggcaagttgttctccgtaaccaga aacaggttcattcattggggtgaccagtcgccttcagatcgtcccctcagtcttgctgtt cattgtgtggagcatgatgccttcatctttgctttgtgtcaggatcataaactacgaatg tggtcttacaaggagcaaatgtgcctaatggtagctgacatgctggagtatgtccctgtg aagaaagaccttcggcttactgctggaactggacacaaattacggcttgcttattccccc accatgggactctacctggggatatacatgcatgcaccaaaacgaggacaggagacactg attgactttgccttaacttccacggatatctgggccctgtggcatgatgctgagaaccaa acagtagtgaaatacatcaactttgaacataatgttgcaggtcagtggaatccagttttt atgcagcctctgccagaggaagagattgtcatcagagatgatcaagaccccagacttcaa ggaagtgtaacagagtatgaattctcccaggaggagtttcgaaatttacaacaagaattc tggtgcaagttctatgcctgttgtcttcagtatcaagaagccctctctcaccctcttgcc ctacatttgaatccacacacaaacatggtgtgcctgctgaaaaaagtaaatgtggacatc gctcgggatgtcatatgtcttataaaatgcctccggctgattgaagagtcagtaactgtg gatatgtcagttataatggaaatgagttgttataacctacagtctccggaaaaggctgca gagcagattctggaagatatgatcactattgatgtagaaaatgtgatggaggatatttgt agtaaactgcaagagattaggaacccaatccatgcaattggactacttatacgggaaatg gattatgaaacagaagtggaaatggaaaagggattcaatccagctcagcctttgaatatt cgaatgaatcttacccagctctatggtagtaacacagcagggtatattgtgtgcagaggg gtgcataaaatcgccagtactcgtttcctgatctgcagagatcttttgatcttacagcag ctgttaatgaggcttggagatgctgctctggaatgtttttgtcaggcagcatctgaagta ggcaaagaggaattcttggatcgcttgattcgctcagaggatggggagatcgtgtctacc cccaggctgcagtattatgacaaggttttacgactactagatgtcattggtttgcctgaa ctggttattcagttggctacatcagccataactgaagcaggtgatgactggaaaagtcag gctactctaaggacatgtattttcaaacatcatttggatttgggtcacaatagccaagca tatgaagccttaacccaaattcctgattccagcaggcaattagattgtttacggcagttg gtggtagttctttgtgaacgctcacagctacaggatcttgtagagtttccctatgtgaat ctgcataatgaggttgtgggaataattgagtcacgtgctagagctgtggaccttatgact cacaattactatgaacttctgtatgcctttcacatctatcgccacaattaccgcaaggct ggcacagtgatgtttgagtatggaatgcggcttggcagagaagttcgaactctccgggga cttgagaaacaaggcaactgttatctggctgctctcaattgtttacgacttattcgtcca gaatatgcgtggattgtgcagccagtgtctggtgcagtgaaggttgatgctgctgaattg cttcgtttatacttaaactatgaccttttagaagaagctgtggatttggtgtcagaatat gtggatgctgtattgggaaaaggacatcaatacttcggaattgagtttccactgtccgca acagccccaatggtgtggcttccatactcctctattgatcagcttctccaagctctggga gagaacagtgccaacagtcacaacatcgcactgtcccagaaaatacttgacaaattggag gactaccagcaaaaagttgataaggcaacacgggatttattatatcgtcggaccttgtga >gi568815587r:47679108_47948313|GENSCAN_predicted_peptide_3|126_aa MARGRYVEDWGDPYGLHQGVPLPSGFRFGEAKGKHAPEIKGQEESEMIPEVIVFFPYKNV NCMRIEISLVHHTSAYSYECLAIVKQLCAWHAQDKVEQDAKGATRIILGLVMLTPDTFTD TAKKKW >gi568815587r:47679108_47948313|GENSCAN_predicted_CDS_3|378_bp atggctcgtggcaggtatgtggaagactggggtgacccgtatggcctgcatcaaggagtt cccttaccctctggcttccggtttggtgaggccaaaggaaagcacgcaccggagatcaaa gggcaggaggagagcgagatgatcccggaagtcatcgtcttttttccttacaagaatgta aactgcatgaggatagaaatttctcttgttcatcacaccagtgcctacagctatgagtgt ttggcaatagtcaaacagctgtgtgcttggcacgctcaagacaaagttgagcaagatgcc aagggagccacacgcattatcctgggcctggtgatgctcactccagacacattcacagat acagcaaagaagaaatgg