GENSCAN 1.0 Date run: 30-Jun-119 Time: 14:57:01 Sequence gi568815588r:70498056_70700902 : 202847 bp : 49.54% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 15830 15910 81 1 0 86 99 128 0.783 12.70 1.02 Intr + 17313 17532 220 2 1 8 44 162 0.347 1.67 1.03 Intr + 25074 25154 81 1 0 99 78 66 0.088 6.31 1.04 Intr + 27940 28081 142 2 1 3 94 153 0.021 6.81 1.05 Intr + 31174 31276 103 1 1 131 63 240 0.995 25.98 1.06 Intr + 31834 32013 180 0 0 93 110 178 0.981 20.56 1.07 Intr + 33235 33399 165 0 0 77 78 253 0.857 23.36 1.08 Intr + 34566 34726 161 2 2 99 81 289 0.977 28.09 1.09 Intr + 34940 35015 76 2 1 105 60 77 0.993 6.12 1.10 Intr + 35867 36018 152 1 2 90 70 142 0.676 11.56 1.11 Intr + 36370 36469 100 1 1 66 105 85 0.998 8.11 1.12 Intr + 36684 36788 105 2 0 75 94 79 0.993 7.61 1.13 Intr + 39756 39851 96 2 0 111 75 145 0.999 15.71 1.14 Intr + 40225 40436 212 0 2 60 86 217 0.536 16.41 1.15 Intr + 40797 40983 187 0 1 58 31 246 0.318 15.59 1.16 Intr + 41037 41192 156 2 0 82 78 159 0.993 14.51 1.17 Intr + 41525 41707 183 1 0 80 78 162 0.974 14.38 1.18 Intr + 43047 43187 141 2 0 99 70 242 0.558 24.05 1.19 Intr + 43408 43479 72 0 0 55 99 89 0.947 6.30 1.20 Intr + 49251 49391 141 2 0 110 99 329 0.998 36.75 1.21 Intr + 57417 57543 127 2 1 50 55 89 0.584 2.05 1.22 Intr + 63939 64079 141 1 0 64 76 50 0.427 1.72 1.23 Intr + 64222 64317 96 2 0 117 60 43 0.579 4.38 1.24 Intr + 64553 64635 83 0 2 73 64 19 0.585 -2.64 1.25 Intr + 66309 66464 156 2 0 138 78 247 0.999 28.91 1.26 Term + 68526 68678 153 2 0 126 50 272 0.933 25.12 1.27 PlyA + 81226 81231 6 1.05 2.05 PlyA - 81301 81296 6 1.05 2.04 Term - 90285 90248 38 1 2 116 50 26 0.213 -0.90 2.03 Intr - 93657 93506 152 2 2 47 44 116 0.193 2.91 2.02 Intr - 101126 100002 1125 1 0 100 69 1306 0.181 118.56 2.01 Init - 102847 102309 539 1 2 63 94 879 0.999 78.34 2.00 Prom - 114531 114492 40 -5.36 3.00 Prom + 117476 117515 40 1.74 3.01 Sngl + 134661 135056 396 2 0 37 44 301 0.971 16.85 3.02 PlyA + 135193 135198 6 1.05 4.00 Prom + 143343 143382 40 -4.06 4.01 Init + 151055 151186 132 1 0 90 110 90 0.911 11.54 4.02 Intr + 152248 152344 97 0 1 59 44 84 0.448 0.68 4.03 Intr + 154527 154576 50 1 2 104 94 -1 0.482 0.50 4.04 Term + 156851 156946 96 1 0 44 48 125 0.439 2.07 4.05 PlyA + 159608 159613 6 1.05 5.00 Prom + 165744 165783 40 -3.16 5.01 Init + 174748 174829 82 0 1 83 96 193 0.999 18.73 5.02 Intr + 176501 176940 440 0 2 83 87 250 0.909 17.63 5.03 Term + 177727 177813 87 0 0 102 45 58 0.366 0.56 5.04 PlyA + 178356 178361 6 1.05 6.06 PlyA - 180064 180059 6 1.05 6.05 Term - 183660 183494 167 1 2 38 48 173 0.693 6.38 6.04 Intr - 191525 191317 209 1 2 95 70 70 0.640 4.62 6.03 Intr - 192775 192630 146 1 2 15 43 88 0.407 -3.92 6.02 Intr - 192902 192789 114 2 0 54 42 87 0.330 1.34 6.01 Init - 195927 195739 189 0 0 67 40 190 0.601 11.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 4408 4248 161 2 2 55 42 156 0.879 5.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:70498056_70700902|GENSCAN_predicted_peptide_1|1169_aa MAGPHPVTQGPLLAAGQALSSREPACQTKQNEESDVMIVVVKEGGCCGPAQPGSALLAAL VESQGLLAIHRWRLGEAPGSSPGPEGSRLWQLAVAGSSVQAVNICVVSGPPLQGTSCSLS LPLEPMRGTPFEGLQGSGTMDSRHSVSIHSFQSTSLHNSKAKSIIPNKVAPVVITYNCKE EFQIHDELLKAHYTLGRLSDNTPEHYLVQGRYFLVRDVTEKMDVLGTVGSCGAPNFRQVQ GGLTVFGMGQPSLSGFRRVLQKLQKDGHRECVIFCVREEPVLFLRADEDFVSYTPRDKQN LHENLQGLGPGVRVESLELAIRKEIHDFAQLSENTYHVYHNTEDLWGEPHAVAIHGEDDL HVTEEVYKRPLFLQPTYRYHRLPLPEQGSPLEAQLDAFVSVLRETPSLLQLRDAHGPPPA LVFSCQMGVGRTNLGMVLGTLILLHRSGTTSQPEAAPTQAKPLPMEQFQVIQSFLRMVPQ GRRMVEEVDRAITACAELHDLKEVVLENQKKLEGIRPESPAQGSGSRHSVWQRALWSLER YFYLILFNYYLHEQYPLAFALSFSRWLCAHPELYRLPVTLSSAGPVAPRDLIARGSLVSM TGSRRSGHGCVLALAPKHWIPVGLGRLLWVPGLVLSPQREDDLVSPDALSTVREMDVANF RRVPRMPIYGTAQPSAKVTGPQGLGPPALGSILAYLTDAKRRLRKVVWVSLREEAVLECD GHTYSLRWPGPPVAPDQLETLEAQLKAHLSEPPPGKEGPLTYRFQTCLTMQEVFSQHRRA CPGLTYHRIPMPDFCAPREEDFDQLLEALRAALSKDPGTGFVFSCLSGQGRTTTAMVVAV LAFWHIQGFPEVGEEELVSVPDAKFTKGEFQVVMKVVQLLPDGHRVKKEVDAALDTVSET MTPMHYHLREIIICTYRQPRPGGNLVVAEKGSVGREAASRYCVTAWVGAELVLRPKELPG VCGLGDGHGVVVPPCWMDERISELSVVHTCLLIPHFAEKSYFHKPGAWPQSSFWMDGWAG FASYHLLCDPGGKDSKGRGDVPGARGMTPVPRPLPGAGRAAVGLGDGAKAAKEAQEMRRL QLRSLQYLERYVCLILFNAYLHLEKADSWQRPFSTWMQEVASKAGIYEILNELGFPELES GEDQPFSRLRYRWQEQSCSLEPSAPEDLL >gi568815588r:70498056_70700902|GENSCAN_predicted_CDS_1|3510_bp atggctggcccacacccggtgacccagggaccgctgctggctgcggggcaggctttgtca tcccgagaacccgcctgccagacaaagcagaatgaggagagtgatgtgatgatcgtggtt gttaaggagggtggctgctgtggcccagcccagcctggctctgccctgctggctgccttg gttgagagccagggactgctggccattcacagatggcgcctgggcgaggcccctggttct agtcctggtcctgaaggttcacgtctgtggcagctggcagttgctgggagttcagtgcag gctgtgaacatctgtgtcgtctctgggccacccttgcagggcaccagctgtagcctgtca ttgcctctggagcccatgagaggcaccccatttgagggcctacagggcagtggcacgatg gacagtcggcactccgtcagcatccactccttccagagcactagcttgcataacagcaag gccaagtccatcatccccaacaaggtggcccctgttgtgatcacgtacaactgcaaggag gagttccagatccatgatgagctgctcaaggctcattacacgttgggccggctctcggac aacacccctgagcactacctggtgcaaggccgctacttcctggtgcgggatgtcactgag aagatggatgtgctgggcaccgtgggaagctgtggggcccccaacttccggcaggtgcag ggtgggctcactgtgttcggcatgggacagcccagcctctcagggttcaggcgggtcctc cagaaactccagaaggacggacatagggagtgtgtcatcttctgtgtgcgggaggaacct gtgcttttcctgcgtgcagatgaggactttgtgtcctacacacctcgagacaagcagaac cttcatgagaacctccagggccttggacccggggtccgggtggagagcctggagctggcc atccggaaagagatccacgactttgcccagctgagcgagaacacataccatgtgtaccat aacaccgaggacctgtggggggagccccatgctgtggccatccatggtgaggacgacttg catgtgacggaggaggtgtacaagcggcccctcttcctgcagcccacctacaggtaccac cgcctgcccctgcccgagcaagggagtcccctggaggcccagttggacgcctttgtcagt gttctccgggagacccccagcctgctgcagctccgtgatgcccacgggcctcccccagcc ctcgtcttcagctgccagatgggcgtgggcaggaccaacctgggcatggtcctgggcacc ctcatcctgcttcaccgcagtgggaccacctcccagccagaggctgcccccacgcaggcc aagcccctgcctatggagcagttccaggtgatccagagctttctccgcatggtgccccag ggaaggaggatggtggaagaggtggacagagccatcactgcctgtgccgagttgcatgac ctgaaagaagtggtcttggaaaaccagaagaagttagaaggtatccgaccggagagccca gcccagggaagcggcagccgacacagcgtctggcagagggcgctgtggagcctggagcga tacttctacctgatcctgtttaactactaccttcatgagcagtacccgctggcctttgcc ctcagtttcagccgctggctgtgtgcccaccctgagctgtaccgcctgcccgtgacgctg agctcagcaggccctgtggctccgagggacctcatcgccaggggctccctagtgagtatg actggcagtcggagaagtgggcatggctgtgtactggccctggcccctaaacactggatt cctgtagggttgggtcggctgctgtgggtcccaggcttggtgctctccccacagcgggag gacgatctggtctccccggacgcgctcagcactgtcagagagatggatgtggccaacttc cggcgggtgccccgcatgcccatctacggcacggcccagcccagcgccaaggtgaccggc cctcagggcctgggtcccccagccctggggagcatcctggcctacctgacggacgccaag aggaggctgcggaaggttgtctgggtgagccttcgggaggaggccgtgttggagtgtgac gggcacacctacagcctgcggtggcctgggccccctgtggctcctgaccagctggagacc ctggaggcccagctgaaggcccatctaagcgagcctcccccaggcaaggagggccccctg acctacaggttccagacctgccttaccatgcaggaggtcttcagccagcaccgcagggcc tgtcctggcctcacctaccaccgcatccccatgccggacttctgtgccccccgagaggag gactttgaccagctgctggaggccctgcgggccgccctctccaaggacccaggcactggc ttcgtgttcagctgcctcagcggccagggccgtaccacaactgcgatggtggtggctgtc ctggccttctggcacatccaaggcttccccgaggtgggtgaggaggagctcgtgagtgtg cctgatgccaagttcactaagggtgaatttcaggtagtaatgaaggtggtgcagctgcta cccgatgggcaccgtgtgaagaaggaggtggacgcagcgctggacactgtcagcgagacc atgacgcccatgcactaccacctgcgggagatcatcatctgcacctaccgccagccacga cccgggggcaatcttgtggtggcagagaagggaagcgttggcagggaagctgccagccgc tactgtgtgactgcctgggttggtgcagagttggtcctgcgaccaaaggagcttccagga gtatgtggcttgggtgatgggcacggtgtagtggtccctccctgctggatggatgagcgg attagtgagctttctgtggtgcacacgtgccttctcatccctcactttgctgagaaatcc tacttccataaacctggggcctggccccagagctccttctggatggatgggtgggcaggc tttgcaagttaccatctactctgcgaccctggaggcaaagacagcaaaggccgtggagat gttcctggtgcccgagggatgacccctgtgccacgtcctcttcctggagcagggagagct gctgtgggtttgggggatggggcgaaggcagcgaaagaggcgcaagaaatgcggaggctg cagctgcggagcctgcagtacttggagcgctatgtctgcctgattctcttcaacgcgtac ctccacctggagaaggccgactcctggcagaggcccttcagcacctggatgcaggaggtg gcatcgaaggctggcatctacgagatccttaacgagctgggcttccccgagctggagagc ggggaggaccagcccttctccaggctgcgctaccggtggcaggagcagagctgcagcctc gagccctctgcccccgaggacttgctgtag >gi568815588r:70498056_70700902|GENSCAN_predicted_peptide_2|617_aa MAARLLLLGILLLLLPLPVPAPCHTAARSECKRSHKFVPGAWLAGEGVDVTSLRRSGSFP VDTQRFLRPDGTCTLCENALQEGTLQRLPLALTNWRAQGSGCQRHVTRAKVSSTEAVARD AARSIRNDWKVGLDVTPKPTSNVHVSVAGSHSQAANFAAQKTHQDQYSFSTDTVECRFYS FHVVHTPPLHPDFKRALGDLPHHFNASTQPAYLRLISNYGTHFIRAVELGGRISALTALR TCELALEGLTDNEVEDCLTVEAQVNIGIHGSISAEAKACEEKKKKHKMTASFHQTYRERH SEVVGGHHTSINDLLFGIQAGPEQYSAWVNSLPGSPGLVDYTLEPLHVLLDSQDPRREAL RRALSQYLTDRARWRDCSRPCPPGRQKSPRDPCQCVCHGSAVTTQDCCPRQRGLAQLEVT FIQAWGLWGDWFTATDAYVKLFFGGQELRTSTVWDNNNPIWSVRLDFGDVLLATGGPLRL QVWDQDSGRDDDLLGTCDQAPKSGSHEVRCNLNHGHLKFRYHARCLPHLGGGTCLDYVPQ MLLGEPPGNRSGAVWQKDHWADSSPCSSFPSLSFREGAMEGPDTAQEKVEVELGQPPLFS VADNRTIFNTVAGKDSL >gi568815588r:70498056_70700902|GENSCAN_predicted_CDS_2|1854_bp atggcagcccgtctgctcctcctgggcatccttctcctgctgctgcccctgcccgtccct gccccgtgccacacagccgcacgctcagagtgcaagcgcagccacaagttcgtgcctggt gcatggctggccggggagggtgtggacgtgaccagcctccgccgctcgggctccttccca gtggacacacaaaggttcctgcggcccgacggcacctgcaccctctgtgaaaatgcccta caggagggcaccctccagcgcctgcctctggcgctcaccaactggcgggcccagggctct ggctgccagcgccatgtaaccagggccaaagtcagctccactgaagctgtggcccgggat gcggctcgtagcatccgcaacgactggaaggtcgggctggacgtgactcctaagcccacc agcaatgtgcatgtgtctgtggccggctcacactcacaggcagccaactttgcagcccag aagacccaccaggaccagtacagcttcagcactgacacggtggagtgccgcttctacagt ttccatgtggtacacactcccccgctgcaccctgacttcaagagggccctcggggacctg ccccaccacttcaacgcctccacccagcccgcctacctcaggcttatctccaactacggc acccacttcatccgggctgtggagctgggtggccgcatatcggccctcactgccctgcgc acctgcgagctggccctggaagggctcacggacaacgaggtggaggactgcctgactgtc gaggcccaggtcaacataggcatccacggcagcatctctgccgaagccaaggcctgtgag gagaagaagaagaagcacaagatgacggcctccttccaccaaacctaccgggagcgccac tcggaagtggttggcggccatcacacctccattaacgacctgctgttcgggatccaggcc gggcccgagcagtactcagcctgggtaaactcgctgcccggcagccctggcctggtggac tacaccctggaacccctgcacgtgctgctggacagccaggacccgcggcgggaggcactg aggagggccctgagtcagtacctgacggacagggctcgctggagggactgcagccggccg tgcccaccagggcggcagaagagcccccgagacccatgccagtgtgtgtgccatggctca gcggtcaccacccaggactgctgccctcggcagaggggcctggcccagctggaggtgacc ttcatccaagcatggggcctgtggggggactggttcactgccacggatgcctatgtgaag ctcttctttggtggccaggagctgaggacgagcaccgtgtgggacaataacaaccccatc tggtcagtgcggctggattttggggatgtgctcctggccacaggggggcccctgaggttg caggtctgggatcaggactctggcagggacgatgacctccttggcacctgtgatcaggct cccaagtctggttcccatgaggtgagatgcaacctgaatcatggccacctaaaattccgc tatcatgccaggtgcttgccccacctgggaggaggcacctgcctggactatgtcccccaa atgcttctgggggagcctccaggaaaccggagtggggccgtgtggcagaaggaccactgg gctgattccagtccatgcagcagcttcccctcactctccttccgtgaaggagccatggaa gggccagacactgcccaggagaaggtggaggtggagctgggccagccgcctctcttctct gtagctgacaacagaaccatcttcaacacagtggctggaaaggactcactttag >gi568815588r:70498056_70700902|GENSCAN_predicted_peptide_3|131_aa MEENQDRENITNIWEDYTIKDVIIATEKAVKAIKPESINSCWRKLCPDVVHDFVGFMTEP IKELKTEIVVMAKRSGDGGEGFQDMDLGEIQELIDTTSEELIEDDLMEMNASKQLQDDEK EDTEQIAPETN >gi568815588r:70498056_70700902|GENSCAN_predicted_CDS_3|396_bp atggaagaaaaccaagacagagagaatatcacgaatatctgggaggattacaccattaaa gatgtcatcattgctacagaaaaagctgtgaaagccatcaagcctgaatcaataaattcc tgctggagaaaactgtgtccagatgttgtgcatgacttcgtaggatttatgacagagcca atcaaagaactcaaaacagagattgtggttatggcaaaaaggtcgggggatggtggtgaa gggtttcaagatatggatcttggagaaattcaagagctaatagacaccacatcagaggaa ttaatagaagatgacttgatggagatgaatgcttccaaacaactgcaagatgatgagaaa gaagacacagaacaaatagcgccagaaacaaattga >gi568815588r:70498056_70700902|GENSCAN_predicted_peptide_4|124_aa MEKVLKQLEAQSIKKEQTFAGRVGWAFLTVQWEVHTLSLRDVVQQQFQKMLEGEKDITAR PSPSWVLQIKDYLLQPGRRRKGQGVPKADRELQPGLQSETLSNNNNNNNNNNNNNKTPKA LEQE >gi568815588r:70498056_70700902|GENSCAN_predicted_CDS_4|375_bp atggagaaggtgctgaagcagctggaagcacagagcatcaagaaggagcaaacctttgct ggcagagttggatgggcatttttgactgtgcaatgggaagtacacaccctgtctctgagg gatgtagtgcagcagcaattccagaaaatgctcgagggggagaaggacattactgcacga cccagtcccagctgggtgctccagatcaaagactacttgctgcagccaggtaggaggagg aagggccaaggagtccccaaagctgatagagagctccagcctgggcttcagagcgagact ctttctaacaacaacaacaacaacaacaacaacaacaacaacaacaaaacccccaaagct ttagagcaggaatga >gi568815588r:70498056_70700902|GENSCAN_predicted_peptide_5|202_aa MAPLRALLSYLLPLHCALCAAAGSRTPELHLSGKLSDYGVTVPCSTDFRGRFLSHVVSGP AAASAGSMVVDTPPTLPRHSSHLRVARSPLHPGGTLWPGRVGRHSLYFNVTVFGKELHLR LRPNRRLVVPGSSVEWQEDFRELFRQPLRQECVYTGGVTGMPGAAVAISNCDGLCAGPCE LAQQPQGPVGQLFPAPETSDVS >gi568815588r:70498056_70700902|GENSCAN_predicted_CDS_5|609_bp atggctccactccgcgcgctgctgtcctacctgctgcctttgcactgtgcgctctgcgcc gccgcgggcagccggaccccagagctgcacctctctggaaagctcagtgactatggtgtg acagtgccctgcagcacagactttcggggacgcttcctctcccacgtggtgtctggccca gcagcagcctctgcagggagcatggtagtggacacgccacccacactaccacgacactcc agtcacctccgggtggctcgcagccctctgcacccaggagggaccctgtggcctggcagg gtggggcgccactccctctacttcaatgtcactgttttcgggaaggaactgcacttgcgc ctgcggcccaatcggaggttggtagtgccaggatcctcagtggagtggcaggaggatttt cgggagctgttccggcagcccttacggcaggagtgtgtgtacactggaggtgtcactgga atgcctggggcagctgttgccatcagcaactgtgacggattgtgtgcaggcccttgtgag ctggcacagcagcctcaaggtcctgttgggcagctcttccctgccccagagacatcagat gtgagctga >gi568815588r:70498056_70700902|GENSCAN_predicted_peptide_6|274_aa MGLKAARLVHNWHALPRTSARHPIPAPSALPQQAKGSKKHQEAESLCQLERVIWDDHSAT VPKPTKLGEGALPGPCLALSPGSSHGSRMKRQEAALKCQQAALSGGAAPCVEALQFERCH SDAHQLTVSSTGQGAAPLAVENRPKRGPHRITSNKNPHNGATQDPFCTRATVPMGLRAFA PAVHPTRNAPFTQLEGHPITAPSFVLCGGGSWQVLYPCPGWNVEPTRSFPGRPQDAYSLI QPVKSSPRGKDRCLAQETKKPQTSLLCTSIELEG >gi568815588r:70498056_70700902|GENSCAN_predicted_CDS_6|825_bp atggggctgaaagctgccagacttgtccacaactggcacgctctgccacgtacatctgca cggcaccccataccagcaccatcagcgctgccacaacaagccaaaggttcaaagaaacac caggaagcagagagcctctgccagctggagagagttatctgggatgaccactcggccaca gttcccaagcccaccaagctcggcgaaggagctctgcctggcccctgcttggccctgagt cctggaagcagccatggctccaggatgaagagacaggaagcagccctcaagtgccagcag gcggccctgagcggaggagcggcaccctgcgtggaggccctgcagtttgaacgctgccac tcagacgcacaccagctgacagtgagcagcactgggcaaggtgctgcacccttggcagtg gagaacaggcccaagagagggccccacagaattaccagcaacaaaaatccccacaatgga gccacacaggacccgttctgtaccagagccacagttcccatgggcctccgcgccttcgcc cctgctgtgcaccccactaggaatgcccctttcacccagctcgaaggccaccccatcact gctccctcctttgtgctctgtggtggtggctcttggcaggtgctctatccctgtccaggg tggaatgtggagccgacccgcagcttcccagggcgcccacaggacgcctactcactcatc cagccagtcaagagcagcccccggggcaaggacaggtgtttggcacaggagaccaaaaag ccacagacaagtctgctgtgcaccagcatcgagctggaaggatga