GENSCAN 1.0 Date run: 3-Nov-116 Time: 04:37:39 Sequence gi568815587r:47617425_47867288 : 249864 bp : 43.77% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 307 302 6 1.05 1.04 Term - 8317 8246 72 1 0 84 47 49 0.225 -1.69 1.03 Intr - 17310 17248 63 0 0 51 94 44 0.337 0.21 1.02 Intr - 18147 18121 27 0 0 110 87 38 0.754 4.21 1.01 Init - 25041 24955 87 0 0 117 80 200 0.999 20.74 1.00 Prom - 36522 36483 40 -4.16 2.24 PlyA - 38106 38101 6 1.05 2.23 Term - 39227 38991 237 2 0 89 54 30 0.050 -4.13 2.22 Intr - 51483 51417 67 2 1 69 99 24 0.252 0.51 2.21 Intr - 59977 59847 131 1 2 59 92 125 0.841 9.49 2.20 Intr - 68625 68469 157 2 1 73 121 61 0.890 7.91 2.19 Intr - 73434 72652 783 2 0 70 100 523 0.927 42.29 2.18 Intr - 74832 74679 154 1 1 78 65 114 0.972 7.23 2.17 Intr - 82129 82022 108 2 0 59 72 65 0.782 2.26 2.16 Intr - 88493 88440 54 0 0 131 53 46 0.870 4.35 2.15 Intr - 93087 92953 135 1 0 41 73 84 0.464 2.74 2.14 Intr - 96923 96860 64 2 1 86 85 48 0.111 2.59 2.13 Intr - 102662 102510 153 2 0 77 29 164 0.115 9.67 2.12 Intr - 105892 105552 341 2 2 65 70 86 0.307 -0.31 2.11 Intr - 106748 106604 145 2 1 88 91 26 0.883 2.76 2.10 Intr - 107204 107044 161 0 2 84 111 130 0.761 14.61 2.09 Intr - 114137 113950 188 1 2 66 94 149 0.998 12.63 2.08 Intr - 115246 115113 134 1 2 99 101 66 0.997 8.44 2.07 Intr - 116705 116601 105 2 0 61 107 62 0.899 5.81 2.06 Intr - 119316 119192 125 1 2 77 53 168 0.997 12.50 2.05 Intr - 126739 126529 211 1 1 109 51 138 0.996 10.69 2.04 Intr - 128970 128632 339 0 0 96 80 191 0.999 14.77 2.03 Intr - 137240 137104 137 0 2 104 49 139 0.898 11.89 2.02 Intr - 147938 147846 93 0 0 90 85 75 0.982 7.34 2.01 Init - 149864 149639 226 2 1 88 57 265 0.988 20.03 2.00 Prom - 155856 155817 40 -4.96 3.20 PlyA - 160859 160854 6 1.05 3.19 Term - 161770 161681 90 1 0 118 34 63 0.969 1.62 3.18 Intr - 163023 162919 105 0 0 61 92 64 0.851 4.51 3.17 Intr - 165774 165649 126 0 0 29 80 116 0.919 5.78 3.16 Intr - 175522 175362 161 2 2 68 48 136 0.502 7.31 3.15 Intr - 180460 180355 106 1 1 120 84 151 0.989 17.69 3.14 Intr - 180650 180554 97 1 1 131 76 48 0.661 7.91 3.13 Intr - 180838 180744 95 1 2 39 91 -11 0.659 -6.94 3.12 Intr - 181039 180944 96 1 0 70 110 107 0.991 11.31 3.11 Intr - 184506 184387 120 0 0 80 83 64 0.888 5.79 3.10 Intr - 194800 194640 161 2 2 113 87 66 0.839 8.71 3.09 Intr - 195005 194878 128 1 2 47 99 73 0.986 4.62 3.08 Intr - 195623 195458 166 0 1 83 47 108 0.964 5.22 3.07 Intr - 198225 198051 175 0 1 76 52 109 0.539 5.61 3.06 Intr - 202034 201950 85 1 1 83 89 52 0.971 4.62 3.05 Intr - 204397 204300 98 1 2 80 17 121 0.918 3.01 3.04 Intr - 218385 218227 159 0 0 69 80 91 0.937 6.58 3.03 Intr - 219577 219463 115 0 1 85 115 14 0.569 4.25 3.02 Intr - 230598 230424 175 1 1 71 111 94 0.741 9.00 3.01 Init - 230996 230795 202 2 1 55 60 208 0.588 11.74 3.00 Prom - 235900 235861 40 -3.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:47617425_47867288|GENSCAN_predicted_peptide_1|82_aa MADAASQVLLGSGLTILSQPLMYVKVLIQHYQESDKGEELGPGNVQKEVSSSFDHVIKEF FASMLTYPFVLVSNLMAVNNCG >gi568815587r:47617425_47867288|GENSCAN_predicted_CDS_1|249_bp atggcggacgcggccagtcaggtgctcctgggctccggtctcaccatcctgtcccagccg ctcatgtacgtgaaagtgctcatccagcattaccaggagagtgacaagggtgaggagtta ggacctggaaatgtacagaaagaagtctcatcttcctttgaccacgttatcaaggagttt tttgcgagtatgttgacctatccctttgtgcttgtctccaatcttatggctgtcaacaac tgtgggtaa >gi568815587r:47617425_47867288|GENSCAN_predicted_peptide_2|1415_aa MGKKSRAVPGRRPILQLSPPGPRGSTPGRDPEPEPDTEPDSTAAVPSQPAPSAATTTTTA VTAAAASDDSPSEGKDEQEAVQEVPRVVQNPPKPVMTTRPTAVKATGGLCLLGAYADSDD DDNDVSEKLAQSKETNGNQSTDIDSTLANFLAEVNEGIQALSNSEEEKKGVAASLLAPLL PEGIKEEEERWRRKVICKEEPVSEVKETSTTVEEATTIVKPQEIMLDNIEDPSQEDLCSV VQSGESEEEEEQDTLELELVLERKKAELRALEEGDGSVSGSSPRSDISQPASQDGMRRLM SKRGKWKMFVRATSPESTSRSSSKTGRDTPENGETAIGAENSEKIDENSDKEMEVEESPE KIKVQTTPKVEEEQDLKFQIGELANTLTSKFEFLGINRQSISNFHVLLLQTETRIADWRE GALNGNYLKRKLQDAAEQLKQYEINATPKGWSCHWDRDHRRYFYVNEQSGESQWEFPDGE EEEEESQAQENRDETLAKQTLKDKTGTDSNSTESSETSTESPPPPPPPPPPAEDGEIQEV EMEDEGSEEPPAPGTEEDTPLKPSAQTTVVTSQSSVDSTISSSSSTKGIKRKATEISTAV VQRSATIGSSPVLYSQSAIATGHQAAGIGNQATGIGHQTIPVSLPAAGMGHQARGMSLQS NYLGLAAAPAIMSYAECSVPIGVTAPSLQPVQARGAVPTATIIEPPPPPPPPPPPPPPAP KMPPPEKTKKGRKDKAKKSKTKMPSLVKKWQSIQRELDEEDNSSSSEEDRESTAQKRIEE WKQQQLTIPDPYEDFMYRHLQYYGYFKAQRGSLPNSATHQHVRKNNPQCLLNGSLGEKDD LIPDTLQKEKLLWPISLSSAVHRQIEAINREWAPPQPEYFYQPKGNEKVPEIVGEKKGTV VYQLDSVPIEGSYFTSSRVGGKRGIVKELAVTLQGPEDNTLLFESRFESGNLQKAVRVDT YEYELTLRTDLYTNKHTQWFYFRVQNTRKDATYRFTIVNLLKPKSLYTVGMKPLLYSQLD ANTRNIGWRREGNEIKYYKNNTDDGQQPFYCLTWTIQFPYDQDTCFFAHFYPYTYTDLQC YLLSVANNPIQSQFCKLQTLCRSLAGNTVYLLTITNPSQTPQEAAAKKAVVLSARVHPGE SNGSWVMKGFLDFILSNSPDAQLLRDIFVFKVLPMLNPDGVIVGNYRCSLAGRDLNRHYK TILKESFPCIWYTRNMIKRLLEEREVLLYCDFHGHSRKNNIFLYGCNNNNRKYWLHERVF PLMLCKNAPDKFTQCLAELKELLRQEIHKKFHELGQDVDLEGSWSDISLSDIESSTSGSD SSLSDGLPVHLANIADEAAKMASGKYAIKWSWASTQVSPSALEIYLLLLKLKFPHLSPSV AISHLHFKSLAPSLLPPSSEQTAILVKTGSEARLP >gi568815587r:47617425_47867288|GENSCAN_predicted_CDS_2|4248_bp atggggaagaagtcccgggcggtacccggccgtaggcccatcctgcaactctctccgccg ggtcctcggggcagcacgccgggccgggacccggagccggaacccgacactgagccggac tcaaccgcggcggtccccagccagcccgccccgtcggcggcgacgaccaccaccaccgcg gtgactgccgccgcggcctcggacgactcgccttcagaaggcaaggatgaacaggaagcg gtgcaggaggttcctagagttgttcagaatcctccaaaaccagtcatgaccactagaccc acagctgttaaagcaacaggcggtctatgcttgcttggtgcttatgctgacagtgatgac gatgacaatgatgtttccgaaaaactagcacaatccaaagagacaaatggaaaccagtca actgatattgatagtacattggccaacttcctagcggaagtaaatgaaggaattcaggct ctctcaaatagtgaggaggagaagaaaggggtggcagcatcgctgcttgctcctttattg cctgagggaataaaagaagaagaagagagatggagaagaaaagtaatttgtaaagaggag ccagtttcagaagtaaaagaaacaagtacaacagtagaagaagcaacaacaatagtaaag ccacaggaaattatgttggacaatatagaagacccttctcaggaggatctttgcagtgtt gtccaatctggagaaagtgaggaggaagaggaacaagatacccttgaactggagctagtt ttggaaaggaaaaaagcagagttgcgagccttggaggaaggagatggtagtgtgtcaggg tctagtccacgttctgatatcagccagccagcatctcaagatggaatgcgtaggcttatg tctaaaagaggaaaatggaagatgtttgttcgagctaccagtccagaatctaccagtagg agttctagtaaaactggacgagatactccagaaaatggagaaactgcaattggtgctgaa aattcagaaaaaatagatgagaattcagataaagagatggaagtagaagaatctccagag aaaataaaagtacagacaacaccaaaagtagaagaagaacaggatttgaaatttcagatt ggagaactggcaaataccctgacaagtaaattcgagtttctaggcattaatagacaatcc atctccaactttcatgtgctgctcttacagactgagactcgaattgcagactggcgggaa ggggctcttaatggaaactaccttaaacgaaaacttcaggatgcagcagaacaactaaaa cagtatgaaataaacgccactcctaaaggctggtcctgccactgggacagggatcataga cggtatttctatgtaaacgaacagtcgggcgagtctcagtgggagtttccagatggtgaa gaggaagaagaagaaagccaagcacaagaaaatagagatgagactcttgccaaacagacc ttgaaagacaaaactggcactgattcaaattcaacagaatcctctgaaacttccacagaa tcacctccaccccctcctccaccacctcctcctgcggaagatggtgagatccaggaggta gagatggaggatgagggaagtgaggagccccctgccccaggaacagaggaagatacccct ttgaaaccttcagcacaaaccacagttgtaactagccagagttcagttgattccaccatc tctagttcttcttccactaaaggaataaagaggaaagctacagaaattagcactgcagtg gttcagaggtcagctaccattggcagttctccagttctctatagccagtcagctatagct acaggtcaccaggcagcagggattggaaaccaggcaacaggaattggacatcagacaata ccagttagccttccagcagcaggaatgggtcatcaggccagaggaatgagcctgcagtca aattaccttggactagcggcagcacctgcaattatgagttatgcagaatgttctgtccca attggagtgactgctccctcattgcagccagttcaggcccgaggtgctgtgcctaccgct accattatagaaccaccaccaccacctcctcctcctcctcctccaccaccaccagctccc aaaatgccaccacctgaaaagacaaaaaaaggaaggaaagacaaggcaaagaagagtaag accaaaatgccatctttggtaaaaaagtggcagagtatccagcgtgagttagatgaagag gacaattctagttccagtgaagaggatcgggaatcaactgcacagaagcgaattgaagag tggaaacagcagcagctgactattcctgatccttatgaagactttatgtaccgtcacctc caatattatggctactttaaagctcagagaggcagtttaccaaactctgctacgcatcag catgttcggaagaataaccctcaatgcctgttgaatggctctcttggggaaaaagatgat ttgataccagacaccctgcaaaaggagaagcttctatggcctatcagtttatcttcagct gtgcacagacagatagaagccatcaacagagagtgggctccacctcaaccagaatatttc tatcagcctaaaggaaatgaaaaggtaccagagattgtaggagagaaaaaaggaacagtt gtctatcaattagattcagtgcctatagaaggttcctattttaccagttccagagtggga ggcaaacgaggaattgtcaaggaacttgctgtcacgttgcaaggaccagaagataatact ctactgtttgaatcaaggtttgagagtgggaatctgcaaaaagctgtcagagtagacacc tatgagtatgaactcaccttgcgaactgacctctacactaacaaacacactcagtggttt tattttcgtgttcagaacaccagaaaagatgctacctatcgcttcaccattgtcaacttg ctaaaacccaagagtctttatactgtagggatgaagccactcttgtactcccaattggat gccaacacccgcaatattggctggaggagagaaggaaatgaaatcaagtactacaagaac aacacggatgatgggcagcagcccttctactgtctcacgtggaccattcagtttccatat gaccaggacacttgcttctttgcacacttctacccatatacatacactgatttgcaatgc tacctcctgtcagtggcaaacaaccctatccagtctcagttctgcaagctccaaacttta tgcaggagcctagcaggaaataccgtttacttgctcaccatcaccaacccatcccagacc cctcaagaggcagctgcaaagaaagctgtggtcttgagtgccagagttcaccctggagaa agtaatggctcctgggttatgaaaggctttttggacttcatccttagcaactccccagat gcccagctcctcagagatatttttgtcttcaaggtgcttcccatgttaaatccagatggt gtgattgtggggaattatcggtgttccttggccggaagggatttgaacaggcattataaa accattctgaaggagtctttcccttgtatttggtacaccaggaacatgatcaaaagactt cttgaagaaagagaggttctgttgtattgtgatttccatggccacagtcgtaagaataat atcttcctgtatggctgtaataacaacaatcgcaaatactggcttcatgaacgagtcttt cctttaatgttatgcaaaaatgcaccagataagttcactcagtgtctagcagagcttaag gagcttttacgacaggaaatccacaagaaattccatgaacttggacaagatgtagattta gaaggaagttggagtgacatctctttgtctgacattgaatccagcaccagtggctctgac agttctctctcagatggtcttcctgttcacctagcaaacatagcagatgaggctgccaag atggcttcgggaaaatatgccatcaagtggtcctgggccagcacacaagtatctccttca gccttggaaatttatcttctgcttctgaaactgaagttcccgcacctttccccctccgtg gctatttcacacctccatttcaagtctcttgctccctctctgctgccaccttcctccgag caaacagccatactagtcaagactggttcagaagctaggctcccgtga >gi568815587r:47617425_47867288|GENSCAN_predicted_peptide_3|819_aa MLHLSAAPPAPPPEVTATARPCLCSVGRRGDGGKMAAAGALERSFVELSGAERERPRHFR EFTVCSIAPLVPVISQKTRVQADHVHFAGTANAVAGAVKYSESAGGFYYVESGKLFSVTR NRFIHWGDQSPSDRPLSLAVHCVEHDAFIFALCQDHKLRMWSYKEQMCLMVADMLEYVPV KKDLRLTAGTGHKLRLAYSPTMGLYLGIYMHAPKRGQETLIDFALTSTDIWALWHDAENQ TVVKYINFEHNVAGQWNPVFMQPLPEEEIVIRDDQDPRLQGSVTEYEFSQEEFRNLQQEF WCKFYACCLQYQEALSHPLALHLNPHTNMVCLLKKVNVDIARDVICLIKCLRLIEESVTV DMSVIMEMSCYNLQSPEKAAEQILEDMITIDVENVMEDICSKLQEIRNPIHAIGLLIREM DYETEVEMEKGFNPAQPLNIRMNLTQLYGSNTAGYIVCRGVHKIASTRFLICRDLLILQQ LLMRLGDAALECFCQAASEVGKEEFLDRLIRSEDGEIVSTPRLQYYDKVLRLLDVIGLPE LVIQLATSAITEAGDDWKSQATLRTCIFKHHLDLGHNSQAYEALTQIPDSSRQLDCLRQL VVVLCERSQLQDLVEFPYVNLHNEVVGIIESRARAVDLMTHNYYELLYAFHIYRHNYRKA GTVMFEYGMRLGREVRTLRGLEKQGNCYLAALNCLRLIRPEYAWIVQPVSGAVKVDAAEL LRLYLNYDLLEEAVDLVSEYVDAVLGKGHQYFGIEFPLSATAPMVWLPYSSIDQLLQALG ENSANSHNIALSQKILDKLEDYQQKVDKATRDLLYRRTL >gi568815587r:47617425_47867288|GENSCAN_predicted_CDS_3|2460_bp atgcttcacctgtccgcagctccgcccgccccacccccggaagtgacggcgaccgcgcgg ccctgcctttgttccgttgggcgtcgcggcgacggcgggaagatggcggcggcgggagcc ctggaacggagcttcgtggagctaagcggagctgagcgcgaaaggccgaggcactttcgg gaattcacagtctgcagcattgctcccttagtccctgtcatttctcagaagacgcgtgtg caggctgaccacgttcattttgcagggactgcaaatgccgtggctggcgccgtaaaatac agtgaaagcgcgggaggcttttactacgtggagagtggcaagttgttctccgtaaccaga aacaggttcattcattggggtgaccagtcgccttcagatcgtcccctcagtcttgctgtt cattgtgtggagcatgatgccttcatctttgctttgtgtcaggatcataaactacgaatg tggtcttacaaggagcaaatgtgcctaatggtagctgacatgctggagtatgtccctgtg aagaaagaccttcggcttactgctggaactggacacaaattacggcttgcttattccccc accatgggactctacctggggatatacatgcatgcaccaaaacgaggacaggagacactg attgactttgccttaacttccacggatatctgggccctgtggcatgatgctgagaaccaa acagtagtgaaatacatcaactttgaacataatgttgcaggtcagtggaatccagttttt atgcagcctctgccagaggaagagattgtcatcagagatgatcaagaccccagacttcaa ggaagtgtaacagagtatgaattctcccaggaggagtttcgaaatttacaacaagaattc tggtgcaagttctatgcctgttgtcttcagtatcaagaagccctctctcaccctcttgcc ctacatttgaatccacacacaaacatggtgtgcctgctgaaaaaagtaaatgtggacatc gctcgggatgtcatatgtcttataaaatgcctccggctgattgaagagtcagtaactgtg gatatgtcagttataatggaaatgagttgttataacctacagtctccggaaaaggctgca gagcagattctggaagatatgatcactattgatgtagaaaatgtgatggaggatatttgt agtaaactgcaagagattaggaacccaatccatgcaattggactacttatacgggaaatg gattatgaaacagaagtggaaatggaaaagggattcaatccagctcagcctttgaatatt cgaatgaatcttacccagctctatggtagtaacacagcagggtatattgtgtgcagaggg gtgcataaaatcgccagtactcgtttcctgatctgcagagatcttttgatcttacagcag ctgttaatgaggcttggagatgctgctctggaatgtttttgtcaggcagcatctgaagta ggcaaagaggaattcttggatcgcttgattcgctcagaggatggggagatcgtgtctacc cccaggctgcagtattatgacaaggttttacgactactagatgtcattggtttgcctgaa ctggttattcagttggctacatcagccataactgaagcaggtgatgactggaaaagtcag gctactctaaggacatgtattttcaaacatcatttggatttgggtcacaatagccaagca tatgaagccttaacccaaattcctgattccagcaggcaattagattgtttacggcagttg gtggtagttctttgtgaacgctcacagctacaggatcttgtagagtttccctatgtgaat ctgcataatgaggttgtgggaataattgagtcacgtgctagagctgtggaccttatgact cacaattactatgaacttctgtatgcctttcacatctatcgccacaattaccgcaaggct ggcacagtgatgtttgagtatggaatgcggcttggcagagaagttcgaactctccgggga cttgagaaacaaggcaactgttatctggctgctctcaattgtttacgacttattcgtcca gaatatgcgtggattgtgcagccagtgtctggtgcagtgaaggttgatgctgctgaattg cttcgtttatacttaaactatgaccttttagaagaagctgtggatttggtgtcagaatat gtggatgctgtattgggaaaaggacatcaatacttcggaattgagtttccactgtccgca acagccccaatggtgtggcttccatactcctctattgatcagcttctccaagctctggga gagaacagtgccaacagtcacaacatcgcactgtcccagaaaatacttgacaaattggag gactaccagcaaaaagttgataaggcaacacgggatttattatatcgtcggaccttgtga