GENSCAN 1.0 Date run: 5-Nov-116 Time: 12:32:04 Sequence gi568815581r:17283380_17484008 : 200629 bp : 48.97% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 20172 20438 267 2 0 71 49 482 0.048 37.58 1.02 Intr + 23164 23264 101 0 2 64 106 59 0.018 4.21 1.03 Intr + 39806 39866 61 2 1 100 116 94 0.197 12.14 1.04 Term + 39914 40030 117 1 0 104 37 -15 0.090 -6.46 1.05 PlyA + 41108 41113 6 -0.45 2.02 PlyA - 41121 41116 6 -1.75 2.01 Sngl - 42255 41779 477 0 0 80 40 299 0.404 20.33 2.00 Prom - 53848 53809 40 -3.66 3.00 Prom + 58721 58760 40 -5.96 3.01 Init + 61090 61141 52 0 1 84 37 87 0.606 2.54 3.02 Intr + 61365 61547 183 1 0 90 94 156 0.963 16.16 3.03 Intr + 63174 63286 113 1 2 -1 53 138 0.678 1.50 3.04 Term + 63371 63568 198 1 0 59 49 193 0.716 9.90 3.05 PlyA + 64256 64261 6 1.05 4.06 PlyA - 66329 66324 6 1.05 4.05 Term - 67907 67819 89 0 2 129 55 43 0.528 2.92 4.04 Intr - 68905 68789 117 2 0 65 54 103 0.474 5.04 4.03 Intr - 72310 72266 45 2 0 91 99 13 0.303 1.08 4.02 Intr - 78602 78426 177 0 0 -10 57 151 0.129 2.09 4.01 Init - 84791 84758 34 2 1 52 92 36 0.162 0.44 4.00 Prom - 88612 88573 40 -4.36 5.00 Prom + 89581 89620 40 -7.66 5.01 Init + 91394 91507 114 1 0 53 45 129 0.594 5.31 5.02 Term + 97168 97311 144 0 0 67 55 123 0.517 4.81 5.03 PlyA + 98494 98499 6 1.05 6.02 PlyA - 99194 99189 6 1.05 6.01 Sngl - 100633 99998 636 1 0 109 42 1107 0.990 104.39 6.00 Prom - 104105 104066 40 -5.36 7.00 Prom + 106566 106605 40 -7.36 7.01 Init + 108545 108589 45 1 0 66 113 66 0.627 7.59 7.02 Intr + 108764 108866 103 1 1 39 76 79 0.450 1.55 7.03 Intr + 111236 111339 104 0 2 119 20 64 0.512 2.59 7.04 Intr + 113127 113240 114 2 0 87 34 109 0.372 6.04 7.05 Intr + 114902 114935 34 1 1 73 47 53 0.327 -2.50 7.06 Intr + 116643 116757 115 1 1 87 56 102 0.930 6.41 7.07 Intr + 116953 117086 134 1 2 41 66 119 0.347 5.29 7.08 Intr + 138624 138676 53 1 2 78 64 67 0.551 1.93 7.09 Intr + 139595 140026 432 1 0 14 41 357 0.834 17.44 7.10 Term + 140057 140236 180 1 0 30 44 181 0.705 5.51 7.11 PlyA + 141485 141490 6 1.05 8.00 Prom + 149162 149201 40 -1.96 8.01 Init + 154427 154476 50 1 2 116 75 7 0.846 2.62 8.02 Term + 156014 156089 76 2 1 93 54 97 0.951 4.11 8.03 PlyA + 157625 157630 6 1.05 9.07 PlyA - 158294 158289 6 1.05 9.06 Term - 162425 162297 129 2 0 44 55 92 0.615 -0.32 9.05 Intr - 163706 163632 75 2 0 96 94 -7 0.397 0.41 9.04 Intr - 167561 167445 117 2 0 50 105 47 0.281 3.26 9.03 Intr - 170806 170633 174 1 0 72 98 125 0.954 12.04 9.02 Intr - 172064 171995 70 1 1 105 26 33 0.276 -2.02 9.01 Init - 172921 172815 107 1 2 56 72 75 0.279 2.39 9.00 Prom - 174646 174607 40 -6.16 10.00 Prom + 174926 174965 40 -10.15 10.01 Sngl + 176648 177025 378 1 0 85 49 817 0.707 73.66 10.02 PlyA + 177157 177162 6 1.05 11.00 Prom + 185590 185629 40 -4.76 11.01 Init + 193663 193886 224 0 2 109 95 145 0.982 14.85 11.02 Intr + 198681 198852 172 0 1 87 51 36 0.002 -0.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 20172 20513 342 2 0 71 43 515 0.941 39.33 S.002 Init - 30401 30290 112 2 1 97 116 27 0.900 5.36 S.003 Intr + 185402 185485 84 1 0 120 66 44 0.908 5.42 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:17283380_17484008|GENSCAN_predicted_peptide_1|181_aa MIRLGGWCARRLCSAAVPAGRRGAAGGLGLAGGRALRVLVDMDGVLADFEGGFLRKFRAR FPDQPFIALEDRRGFWVSEQYGRLRPGLSEKAISIWESKNFFFELEPLPGAVEAVKEMAS LQNTDVFICTSPIKMFKYCPYEKVRGTGLSLPVERGWTLTALPHLCDSGFLLPKDLSSLR L >gi568815581r:17283380_17484008|GENSCAN_predicted_CDS_1|546_bp atgatccggctgggcggctggtgtgcgcggcggctctgcagcgcggcggttcccgcgggg cggcgcggggcggcgggcgggctgggcctggcgggaggccgcgccctacgggtgctggtg gacatggacggcgtgctggctgacttcgagggcggattcctcaggaagttccgcgcgcgc tttcccgaccagcccttcatcgcgctggaggaccggcgcggcttctgggtgtcggagcag tacggccgcctgcggccagggctgagcgagaaggccatcagcatttgggagtcaaagaat ttcttttttgaacttgagcctctgccaggggccgtggaagctgtcaaggagatggccagc ctacaaaacactgacgtcttcatctgcacaagccccatcaagatgttcaagtactgtccc tatgagaaggtgaggggtacggggctcagcctgcctgtggaaaggggatggaccctcaca gcactcccccacctctgtgacagcggctttctgctccccaaggacctctccagcctgagg ctctag >gi568815581r:17283380_17484008|GENSCAN_predicted_peptide_2|158_aa MICSVRALSACMELKGAGSLGGPVSCTGSKCQDAGNEKRQKQTGARHVFEIGPMGHGTHQ QAGSEGLEEDISTAPDFRPEATRQHCHAQGNQHPRDRYIGSECLELRGQSRAEATQVGQL AQRQYLNPQERMNEAMSFISLPQLLVNHHGHPVKLQLK >gi568815581r:17283380_17484008|GENSCAN_predicted_CDS_2|477_bp atgatctgctctgtgagggccttgtcagcctgcatggaactgaaaggagcaggcagttta ggaggccctgtcagctgtactggcagtaaatgccaagatgctggcaatgagaagagacag aagcagacaggtgccaggcacgttttcgaaatcggtcccatgggacatgggactcaccaa caggctggaagtgagggcttagaggaagacatcagcacggcccctgatttccgacctgag gccaccagacagcactgccatgcacaaggaaaccagcaccccagggacagatacatcggg tctgagtgtctggagctcagagggcagtcccgggccgaggcaacgcaggtgggacagctg gcacaaagacagtatctaaatccccaggaacggatgaatgaggccatgtcgttcatctca ctcccacaactgctggtcaaccaccatggccacccagtgaaactccagctgaaatag >gi568815581r:17283380_17484008|GENSCAN_predicted_peptide_3|181_aa MAALCSLCLGYAVQCSLGLMSLSSHHLPLPVCVQYAWVEKYFGPDFLEQIVLTRDKTVVS ADLLIDDRPDITGKWPATVLVVTVTRSQSLLTEHWLRVLTASLPHLVPSSHPLRRRAAAL QVSTAELNAAFPPTGAEPTPSWEHVLFTACHNQHLQLQPPRRRLHSWADDWKAILDSKRP C >gi568815581r:17283380_17484008|GENSCAN_predicted_CDS_3|546_bp atggccgccctctgctccctgtgtttgggttatgctgtgcagtgttcgctgggtctgatg tcactttcttctcaccacctgcccttgcctgtttgcgtccagtatgcctgggtggagaag tactttggccctgactttctggagcagattgtgctgaccagagacaagaccgtggtctct gctgaccttctcatagacgaccggccggacatcacaggcaagtggcctgcgacagtgctg gtggtgacagtgacccgcagtcagtccctgttgactgagcactggctgcgcgtgctgacg gcctccttgccgcatctcgtgccatcctcacacccgctgcgaaggcgggcggctgcgctc caggtctccactgctgagctgaatgccgctttcccacccacaggggccgagccaaccccc agctgggagcatgtcctcttcaccgcctgccacaaccagcacctgcagctgcagcccccc cgccgcaggctgcactcgtgggcggacgactggaaggccattctggacagcaagcggccc tgctga >gi568815581r:17283380_17484008|GENSCAN_predicted_peptide_4|153_aa MNESLAVQHHVELEKPTTLFPEEHPEQSEEPIYDVEIKSSCAASEGPKPKSRPCWDSPCL RISGSSICEVGKSEPGLVQKAPVKTGSAARFAATTTPRYFWALSTTSSATTVGQSSAARS EHGHGGNGTAVDWRRIRSFPHMHGLIWPDDPTG >gi568815581r:17283380_17484008|GENSCAN_predicted_CDS_4|462_bp atgaatgagagcttggccgtccagcaccacgtggaattagaaaaaccaaccacactcttc ccagaagagcaccctgagcagagtgaagagcccatttatgacgtggagattaagagcagc tgtgcagcgtcagagggcccaaagcccaaatcccggccgtgctgggactcaccatgcctg cgcatctcaggttcttccatctgtgaagtggggaagtcagagccagggcttgtgcagaaa gcgccggtaaagacaggctctgctgccaggtttgccgccaccaccacccctcgctatttc tgggccctttctacgacatcatctgcaaccactgtgggccagagctcagctgctcgctca gagcatgggcatggtgggaatggcactgccgtggactggagaaggatcagaagctttcct cacatgcatggcctcatctggcctgacgaccccacggggtga >gi568815581r:17283380_17484008|GENSCAN_predicted_peptide_5|85_aa MEKTTASSPEDLVEITDEGGCTKQQILSEDERALYWKKCSDPASAAWCAPLGKALSLNCK MGSNNSTCSISRPRHCGPWSSSQRY >gi568815581r:17283380_17484008|GENSCAN_predicted_CDS_5|258_bp atggagaaaactacagcaagttctccagaagatctagtggagatcaccgatgaaggtggc tgcactaaacagcagattctcagtgaggatgaaagagcactctattggaagaagtgttca gaccctgcttctgctgcctggtgtgcacccttgggcaaggctctgagcctcaactgcaaa atgggcagcaacaacagcacatgctccatcagccgccctcggcactgtgggccctggtca tcctcccagcgatactga >gi568815581r:17283380_17484008|GENSCAN_predicted_peptide_6|211_aa MTPSRNGMVLKPHFHKDWQRRVATWFNQPARKIRRRKARQAKARRIAPRPASGPIRPIVC CPTVRYHTKVRAGRGFNLEELRVAGIHKKVARTIGISVDPRRRNKSTESLQANVQRLKEY RSKLILFPRKPSAPKKGDSSAEELKLATQLTGPVMPIRNVYKKEKARVITEEEKNFKAFA SLRMARANARLFGIRAKRAKEAAEQDVEKKK >gi568815581r:17283380_17484008|GENSCAN_predicted_CDS_6|636_bp atgacgcccagccggaatggcatggtcttgaagccccacttccacaaggactggcagcgg cgcgtggccacgtggttcaaccagccggcccggaagatccgcagacgtaaggcccggcaa gccaaggcgcgccgcatcgccccgcgccccgcgtcggggcccatccggcccatcgtgtgc tgccccacggttcggtaccacacgaaggtgcgcgccggccgcggcttcaatctggaggag ctcagggtggccggcattcacaagaaggtggcccggaccatcggcatttctgtggatccg aggaggcggaacaagtccacggagtccctgcaggcgaacgtgcagcggctgaaggagtac cgctccaaactcatcctcttccccaggaagccctcggcccccaagaagggagacagttct gctgaagaactgaaactggccacccagctgaccggaccggtcatgcccatccggaatgtc tataagaaggagaaagctcgagtgatcactgaggaagagaagaatttcaaagccttcgct agtctccgtatggcccgtgccaacgcccggctcttcggcatacgggcaaaaagagccaag gaagccgcagaacaggatgttgaaaagaaaaaataa >gi568815581r:17283380_17484008|GENSCAN_predicted_peptide_7|437_aa MAKSAASHLDEVSETVCSFTSETSETTNPPGGMSNFGCATFKSRNTDCEDSPGSSRVGAA IPVSDWSWLPMEVGDPQPPPCSHQWSKVEVMVCDSQGRVTEGIAVPSCIFWILALENASR RVLSKAISDNCALRCALLSDVPAADDHSHSDKEAAKAVAPRDEDEAFLFFPLLNDDASAI YNQRCSYTHTLMGCCFNVTSEQVRCDFALAEPPPPVLLLFVIAIGPLQDPAHHMESCPAA SVITEQLEANLSKNNRALSVLRRIKSGSVVANRANQGKENSENITTPEVFPRLYHLIPDG EITSIKINQVDPSESLSIMLMGGSETPLVRIIQHIYHDGANARDDQLLPGDIILNVNGMD ISKVSHNYALHLLRQPYQKFCSRNSGQALDAYRPQDDSCHVICNKSSPKEQLGIKLVCKV DEPGVFIFNVLDGSVAD >gi568815581r:17283380_17484008|GENSCAN_predicted_CDS_7|1314_bp atggcaaagtctgcagcttcacacctggatgaagtcagcgagacggtctgcagcttcact tctgaaaccagcgagaccacgaacccaccaggaggaatgagcaactttggttgcgctacc tttaagagccggaacactgactgcgaagatagccctggatcctcccgcgttggggccgcc atccctgtatctgactggagctggcttcccatggaagtaggtgacccgcagccacctccc tgcagccaccagtggagtaaggtggaggtgatggtgtgtgactcccaaggccgggtcacc gaaggcattgcagtgccctcctgcatcttttggattctcgccctggagaatgccagccga cgtgttctctccaaggccattagtgacaactgtgccctaaggtgtgccctgctctcagat gtgcctgccgctgacgaccattcccactccgacaaggaggctgcaaaggctgtagcaccc agggatgaggatgaagcatttttattcttcccactgcttaatgacgatgcctctgccatc tataaccagaggtgctcttacacccacaccttgatgggctgctgcttcaacgtaacttct gagcaagtccgatgtgattttgcacttgctgaacctccaccacctgtattgctgctcttt gtcatcgccattgggcctctccaggaccctgcccaccacatggaaagctgtccggcagca tcggtcatcacagaacaattagaagcaaatctttccaaaaataatcgagctttgagtgtt cttcgaaggataaagagtgggagtgtagtcgccaaccgagctaaccagggcaaggaaaat tctgaaaacatcaccacccctgaagtctttccaaggttgtaccacctgatcccagatggt gaaattaccagcatcaagatcaatcaagtggatcccagtgaaagcctctccattatgctg atgggaggcagtgaaaccccactggtccgtatcatccaacacatttatcatgatggggca aatgccagagatgaccagctactaccaggagacatcatcctaaatgtcaatgggatggac attagcaaagtctctcacaactacgctctgcatctcctgcggcagccctaccagaagttc tgcagcaggaacagtggacaggccctggatgcctacagaccccaggatgacagctgccat gtgatttgcaacaaaagtagccccaaggaacagcttggaataaaattggtgtgcaaggtg gatgagcctggggtgtttatcttcaacgtgctagatggtagtgtggctgattga >gi568815581r:17283380_17484008|GENSCAN_predicted_peptide_8|41_aa MADVQYICVELTLNSARSLDKLFQPTANQKIFESTYDLEAS >gi568815581r:17283380_17484008|GENSCAN_predicted_CDS_8|126_bp atggccgacgtgcaatacatatgtgttgaattgacattaaattcagccaggtctttagat aaactctttcaaccaaccgccaatcagaaaatctttgaatccacctatgacctggaagcc tcctga >gi568815581r:17283380_17484008|GENSCAN_predicted_peptide_9|223_aa MEVLARKARGDERRERVARASAATPLGHKNNSALLGTDLTEHLVGAVSSLDPGQLWPHTV QDTVLDTGDAGVNKTDKVSALMELVLPWRREIKNKPTNASDYLMGLRTRKDTGRSTTEMI PTPMRHRAAHRMGHSAEELPSGPFLLPPYAILLFAETSHRALPAASWLDATQPHRMVGMA QRQAGPDMASEHQGLPCLTGQLGTNPYESNMRRKPEPEQEAET >gi568815581r:17283380_17484008|GENSCAN_predicted_CDS_9|672_bp atggaggtgctggcgaggaaggcccgaggtgatgaaaggagggagagagttgctagagct tctgcagcgacccccctgggacataagaacaactctgccctgctggggacggacctcaca gagcacttggttggtgctgtgtccagcctggaccctggccagctgtggccacacacagtg caagacactgttctggacacaggagatgcaggtgtgaacaagacagacaaggtctctgcc cttatggagctggtgctcccatggagaagagagattaaaaacaagccaaccaatgcatca gattatctcatgggactaagaactcggaaagacacagggcggagcaccacagaaatgatc cccactccaatgcgtcacagagcagcccatcgcatgggtcattccgcagaagaactacct tctggacccttccttctgccaccatatgccattctgctatttgcagagacaagccacagg gccctgccagctgcctcctggctggatgcaacccaacctcaccgcatggtggggatggcc cagaggcaggcgggacctgatatggcaagtgaacatcaaggcctgccttgcctcacaggg cagctggggaccaacccatacgaatccaacatgcggaggaagccagagccggagcaggag gcagagacctga >gi568815581r:17283380_17484008|GENSCAN_predicted_peptide_10|125_aa MSIAITTIITITTITIMTIAITTIITITTITITIITITTITIITITITIITTIIIITITT ITITAIAITITIITITITITSPPSPSSLSPSPHHRHHHHHHHYHHHHYHHHHHHHHHHHH DHHHH >gi568815581r:17283380_17484008|GENSCAN_predicted_CDS_10|378_bp atgagcatcgccatcaccaccatcatcaccatcaccaccatcaccatcatgaccatcgcc atcaccaccatcatcaccattaccaccatcaccatcaccatcattaccatcaccaccatc accatcatcaccatcaccatcaccatcatcaccaccatcatcatcatcaccatcaccacc atcaccatcactgccatcgccatcaccatcaccatcatcactatcaccatcaccatcaca tcaccaccatcaccatcatcactatcaccatcgccacatcaccgccatcaccatcatcac catcactatcaccatcatcactatcaccatcaccatcaccatcatcaccatcatcaccat gatcatcatcaccattga >gi568815581r:17283380_17484008|GENSCAN_predicted_peptide_11|132_aa MASAGVAAGRQAEDVLPPTSDQPLPDTKPLPPPQPPPVPAPQPQQSPAPRPQSPARAREE ENYSFLPLVHNIIKCSLAKLCNIPLYERAVVYVNSFLLKDTVIVSAFCCWNQHCSKHLCP CYLGELLVVSVG >gi568815581r:17283380_17484008|GENSCAN_predicted_CDS_11|396_bp atggcctctgctggggtggcagccgggcgacaggcggaggatgtattgccgccaacgtcc gaccagccgctgcctgacaccaagccgctgccgcctcctcagccgccgccggtccctgcg cctcaaccgcagcagtcgccggcgccacggcctcagtcacctgcccgcgcgagggaggaa gagaactactcctttttacctttggttcacaacatcatcaaatgctcccttgcaaagctg tgtaatattccactttatgaacgtgctgtcgtttatgtaaacagtttcttgctcaaggac accgtcattgtttctgcattttgctgttggaatcagcactgcagcaagcatctttgccca tgctatcttggtgaactcttggtagtctctgtgggg