GENSCAN 1.0 Date run: 14-Jul-118 Time: 21:20:05 Sequence gi568815584r:69676285_69897155 : 220871 bp : 44.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 8653 8692 40 -2.56 1.01 Init + 14189 14249 61 1 1 51 80 45 0.197 1.41 1.02 Intr + 22669 22792 124 2 1 61 34 87 0.311 0.24 1.03 Intr + 25115 25191 77 2 2 82 83 54 0.757 3.46 1.04 Intr + 27111 27308 198 1 0 115 56 218 0.982 20.62 1.05 Intr + 28320 28458 139 1 1 76 109 105 0.956 10.92 1.06 Intr + 32393 32820 428 2 2 96 101 243 0.617 19.83 1.07 Intr + 33340 33364 25 2 1 67 115 -6 0.035 -2.92 1.08 Intr + 42922 43092 171 1 0 61 100 66 0.019 4.16 1.09 Intr + 48974 49087 114 2 0 29 80 95 0.013 2.36 1.10 Term + 82683 82827 145 0 1 47 54 127 0.185 2.58 1.11 PlyA + 84687 84692 6 -0.45 2.00 Prom + 85488 85527 40 -3.96 2.01 Init + 91873 91998 126 0 0 78 103 109 0.767 11.56 2.02 Intr + 92320 92390 71 0 2 44 100 70 0.756 1.78 2.03 Intr + 92514 92612 99 0 0 74 92 34 0.560 1.63 2.04 Intr + 94711 94821 111 1 0 83 70 41 0.520 1.29 2.05 Term + 94913 95177 265 2 1 59 37 125 0.444 -0.52 2.06 PlyA + 95702 95707 6 1.05 3.06 PlyA - 95969 95964 6 1.05 3.05 Term - 100104 99998 107 1 2 103 41 85 0.990 3.87 3.04 Intr - 102245 102049 197 1 2 48 96 104 0.989 6.26 3.03 Intr - 103076 102898 179 2 2 112 90 129 0.998 14.22 3.02 Intr - 110023 109813 211 0 1 100 116 277 0.686 30.62 3.01 Init - 120871 120516 356 1 2 74 96 517 0.991 47.71 3.00 Prom - 123823 123784 40 -5.06 4.06 PlyA - 124358 124353 6 1.05 4.05 Term - 126959 126769 191 0 2 56 42 148 0.952 4.61 4.04 Intr - 132631 132533 99 2 0 83 53 81 0.331 4.18 4.03 Intr - 146714 146628 87 0 0 75 94 21 0.349 1.34 4.02 Intr - 147630 147542 89 2 2 44 108 41 0.495 1.31 4.01 Init - 148062 147824 239 0 2 81 36 199 0.712 11.49 4.00 Prom - 152938 152899 40 -5.76 5.00 Prom + 154304 154343 40 -4.56 5.01 Init + 157938 157944 7 2 1 40 84 0 0.172 -4.25 5.02 Intr + 166435 166581 147 2 0 43 58 213 0.559 14.01 5.03 Intr + 169471 169616 146 2 2 89 45 41 0.090 -0.10 5.04 Term + 183121 183330 210 0 0 108 42 67 0.013 1.39 5.05 PlyA + 184676 184681 6 1.05 6.00 Prom + 195728 195767 40 -4.76 6.01 Init + 203395 203493 99 0 0 101 110 201 0.913 21.86 6.02 Term + 205491 205607 117 2 0 63 53 31 0.531 -4.36 6.03 PlyA + 206583 206588 6 1.05 7.02 PlyA - 206719 206714 6 1.05 7.01 Sngl - 209856 209056 801 0 0 94 36 910 0.991 82.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:69676285_69897155|GENSCAN_predicted_peptide_1|493_aa MWEGFKEKACKRSRGIFQEHELSTELPSANLELSENDPGAQIKTVSCLDCGFWKPLNNFF AMSCHGVLSINYATEMNQLDTVQGEPGMCPLPPEPENGGYICHPRPCRDPLTAGSVIEYL CAEGYMLKGDYKYLTCKNGEWKPAMEISCRLNEDKDTHTSLGVPTLSIVASTASSVALIL LLVVLFVLLQPKLKSFHHSRRDQGVSGDQVSIMVDGVQVALPSYEEAVYGSSGHCVPPAD PRVQIVLSEGSGPSGRSVPREQQLPDQGACSSAGGEDEAPGQSGLCEAWGSRASETVMVH QATTSSWVAGSGNRQLAHKETADSENSDIQSLLSLTSEEYTDAEETKAHRRIPLSNVQNQ DTKPFSKMQSQMLSSLSSQDGFWVPGKIRQGQETQAAPVSSLGLASERLPGLWKIVKAAL ANPYIQSQKWELDQGFFPQQEPIQASDRLSDEQITPQKASDIRKYLLEFWKGSKKLQQKL LEGTHSPPLAYVV >gi568815584r:69676285_69897155|GENSCAN_predicted_CDS_1|1482_bp atgtgggaaggattcaaagagaaagcttgtaagaggagcagagggatcttccaggagcac gagttaagtacagagcttccttctgcgaacctggaattgtcagagaatgatcctggtgca cagatcaagacagtttcttgcctagactgtggcttctggaagccactgaacaatttcttt gccatgagctgccatggggtgctcagcattaattatgccacggaaatgaaccaattagac acagtccaaggagagccagggatgtgccccctaccaccggagccagagaatggtggctac atctgccacccccggccctgcagagaccccctgacagcaggcagtgtcatcgaatacctg tgtgctgaaggctacatgttgaagggcgattacaaatacctgacgtgtaagaatggcgag tggaaaccagccatggagattagctgccgtctcaacgaggataaagacacccacacatca cttggggtccccacgctgtctatagtggcttctactgccagctccgtggcgctcattctc ctcctcgtggtgctgtttgtgctgctgcagccaaagctgaagtctttccatcatagcagg cgtgaccagggggtatctggggaccaggtctccatcatggtggatggagtccaggttgca ctaccatcatacgaggaggctgtatatggcagttctggtcactgtgtgccacctgctgac cccagagtacagattgtgctgtcagaagggtctgggcccagtgggaggagcgtgccaagg gagcaacagctgccggaccaaggggcctgctcctctgcaggtggagaagatgaggcccca ggccagtctggactatgtgaagcctggggctctcgggcctcagagactgtgatggtgcat caggcaaccacctcttcctgggtggccggctcagggaaccgccaactggcacacaaagaa actgcagattcagagaacagtgacatacaaagccttttatccctcacgtcagaggagtac acagatgctgaggaaactaaggctcacagaagaatccctctcagcaatgtccaaaaccag gacaccaagccatttagcaaaatgcagtcccagatgctcagcagtctcagcagtcaggat gggttctgggtccctggaaagatcaggcaaggccaggagacccaggcagcccctgtctcc tccctgggccttgcttctgaaaggctgcccggtctatggaagattgtaaaggcagcccta gcaaacccatacattcagtctcagaagtgggagctggaccagggcttcttcccacaacaa gaacccatccaggccagtgatagactcagcgatgagcagataaccccacagaaagccagt gacatcaggaagtatttgctggagttctggaaaggatctaagaagcttcaacagaagctc ctggaagggacgcactctcctcctctggcctacgtggtgtga >gi568815584r:69676285_69897155|GENSCAN_predicted_peptide_2|223_aa MSGCRVFIGRLNPAAREKDVERFFKGYGRIRDIDLKRGFGFVEFEDPRDADDAVYELDGK ELCSERVTIEHARARSRGGRGRGRYSDRFSSRRPRNDRRVVEFASYGDLKNAIEKLSGKE INGRKIKLIEGSKRHRSRSRSRSRTRSSSRSRSRSRSRSRKSYSRSRSRSRSRSRSKSRS VSRSPVPEKSQKRGSSSRSKSPASVDRQRSRSRSRSRSVDSGN >gi568815584r:69676285_69897155|GENSCAN_predicted_CDS_2|672_bp atgagtggctgtcgggtattcatcgggagactaaatccagcggccagggagaaggacgtg gaaagattcttcaagggatatggacggataagagatattgatctgaaaagaggctttggt tttgtggaatttgaggatccaagggatgcagatgatgctgtgtatgagcttgatggaaaa gaactctgtagtgaaagggttactattgaacatgctagggctcggtcacgaggtggaaga ggtagaggacgatactctgaccgttttagtagtcgcagacctcgaaatgatagacgggtg gttgagtttgcctcttatggtgacttaaagaatgctattgaaaaactttctggaaaggaa ataaatgggagaaaaataaaattaattgaaggcagcaaaaggcacaggtcaagaagcagg tctcgatcccggaccagaagttcctctaggtctcgtagccgatcccgttcccgtagtcgc aaatcttacagccggtcaagaagcaggagcaggagccggagccggagcaagtcccgttct gttagtaggtctcccgtgcctgagaagagccagaaacgtggttcttcaagtagatctaag tctccagcatctgtggatcgccagaggtcccggtcccgatcaaggtccagatcagttgac agtggcaattaa >gi568815584r:69676285_69897155|GENSCAN_predicted_peptide_3|349_aa MEAHNASAPFNFTLPPNFGKRPTDLALSVILVFMLFFIMLSLGCTMEFSKIKAHLWKPKG LAIALVAQYGIMPLTAFVLGKVFRLKNIEALAILVCGCSPGGNLSNVFSLAMKGDMNLSI VMTTCSTFCALGMMPLLLYIYSRGIYDGDLKDKVPYKGIVISLVLVLIPCTIGIVLKSKR PQYMRYVIKGGMIIILLCSVAVTVLSAINVGKSIMFAMTPLLIATSSLMPFIGFLLGYVL SALFCLNGRCRRTVSMETGCQNVQLCSTILNVAFPPEVIGPLFFFPLLYMIFQLGEGLLL IAIFWCYEKFKTPKDKTKMIYTAATTEETIPGALGNGTYKGEDCSPCTA >gi568815584r:69676285_69897155|GENSCAN_predicted_CDS_3|1050_bp atggaggcccacaacgcgtctgccccattcaacttcaccctgccacccaactttggcaag cgccccacagacctggcactgagcgtcatcctggtgttcatgttgttcttcatcatgctc tcgctgggctgcaccatggagttcagcaagatcaaggctcacttatggaagcctaaaggg ctggccatcgccctggtggcacagtatggcatcatgcccctcacggcctttgtgctgggc aaggtcttccggctgaagaacattgaggcactggccatcttggtctgtggctgctcacct ggagggaacctgtccaatgtcttcagtctggccatgaagggggacatgaacctcagcatt gtgatgaccacctgctccaccttctgtgcccttggcatgatgcctctcctcctgtacatc tactccagggggatctatgatggggacctgaaggacaaggtgccctataaaggcatcgtg atatcactggtcctggttctcattccttgcaccatagggatcgtcctcaaatccaaacgg ccacaatacatgcgctatgtcatcaagggagggatgatcatcattctcttgtgcagtgtg gccgtcacagttctctctgccatcaatgtggggaagagcatcatgtttgccatgacacca ctcttgattgccacctcctccctgatgccttttattggctttctgctgggttatgttctc tctgctctcttctgcctcaatggacggtgcagacgcactgtcagcatggagactggatgc caaaatgtccaactctgttccaccatcctcaatgtggcctttccacctgaagtcattgga ccacttttcttctttcccctcctctacatgattttccagcttggagaagggcttctcctc attgccatattttggtgctatgagaaattcaagactcccaaggataaaacaaaaatgatc tacacagctgccacaactgaagaaacaattccaggagctctgggaaatggcacctacaaa ggggaggactgctccccttgcacagcctag >gi568815584r:69676285_69897155|GENSCAN_predicted_peptide_4|234_aa MPSGTIWILEARHSHSGVLLWPIYRVTRKAASFEREPEQDKALQQAQSAVQAALPLGLYD PADPMVLEVSVADNDAGPHRHKVGRAQQHPIIQWKSYILDQAQTGPEGTKRAEICPHQVA GLIDLDYQDEISLLIHNRAMQHQIQKFTNVLDGTERVQKYSLSDIAASAIGSHKMPCSFD LSLVEQLLWGHQLPDKKSDYSEITMQMTYPQQPYEDPKQKPSAEPVISQNCERE >gi568815584r:69676285_69897155|GENSCAN_predicted_CDS_4|705_bp atgcctagtgggactatttggattttggaagcaagacattctcattcgggtgtgttactc tggcccatttatcgagtgacccgaaaggctgccagttttgagcgagaaccagaacaggac aaggcgctgcaacaggcccagtctgctgtgcaagctgctctgccacttgggctatatgac ccagcagatccaatggtgcttgaggtgtcagtggcagataatgatgctggcccccataga cataaagtgggtcgtgcacagcagcatcccatcatccaatggaagtcgtatatacttgat caggctcaaacaggtcctgaaggcacaaaaagggcagaaatttgtcctcaccaggtggct gggctgattgacctggactatcaagatgaaattagtctactaatccataacagagctatg cagcaccaaattcagaaatttacaaatgtgcttgatggcacagagagagttcagaagtac tcactgagtgatattgctgcttcagccataggaagtcataagatgccttgcagctttgac ttaagcctcgtggagcaattgctctggggccatcagctaccagataagaagtctgactac tctgagatcacgatgcagatgacttatcctcagcaaccatatgaggaccccaagcaaaag ccatcagctgaacctgtcatatcccagaactgtgaaagagaataa >gi568815584r:69676285_69897155|GENSCAN_predicted_peptide_5|169_aa MQAAKRPSDEWEGKAEGGEEKEKEEEEEEEEEEEEVEEVAVAVQARQSGMECDSPQFFGA SIAPSQSGPNLELKQVAGGSQSGLLIPLAPVIEYVAQPEAVRSERGCLEGPLLWPQKPHS RSPQAPLSMRTVDPKAKEKSKKLGPDAMQLTPKFASKLGVREKLRIPKH >gi568815584r:69676285_69897155|GENSCAN_predicted_CDS_5|510_bp atgcaagctgcaaagaggcccagtgatgaatgggaggggaaagcggaaggtggggaggag aaggagaaggaggaggaggaggaggaggaggaggaggaggaggaggtggaggaggtggca gtggcggtacaagcaaggcagagtgggatggagtgtgacagcccccaattttttggtgca tccattgctccctcacaatcaggccccaacttagagctgaagcaggtggcaggtggaagc cagtcagggcttctcattcctctggctccagtaattgagtatgtggctcagccagaagca gtgaggtcagaaagagggtgccttgaaggcccccttctgtggcctcagaaaccacactcc cgcagtccccaggctcccctcagcatgaggacagtggatccaaaggcaaaggagaagagt aagaagctggggcccgatgccatgcaattaacccccaagtttgcttcaaaacttggtgta agagaaaaactgaggatccccaagcactaa >gi568815584r:69676285_69897155|GENSCAN_predicted_peptide_6|71_aa MLPARCARLLTPHLLLVLVQLSPARGHRTTGPRITIRATTYEHLGWALYMISYSNENKKL GLLKAILQMTK >gi568815584r:69676285_69897155|GENSCAN_predicted_CDS_6|216_bp atgctgcccgcgcgctgcgcccgcctgctcacgccccacttgctgctggtgttggtgcag ctgtcccctgctcgcggccaccgcaccacaggccccaggataacaataagagctaccact tatgaacatctgggctgggcactttacatgatctcttactcgaacgagaacaagaagttg ggtcttttaaaagccattttacagatgacaaagtga >gi568815584r:69676285_69897155|GENSCAN_predicted_peptide_7|266_aa MPKGKKAKGKKVAPAPAVVKKQEAKKVVNPLFEKRPKNFGIGQDIQPKRDLTRFVKWPRY IRLQRQRAILYKRLKVPPAINQFTQALDRQTATQLLKLAHKYRPETKQEKKQRLLARAEK KAAGKGDVPKKRPPVLRAGVNTVTTLVENKKAQLVVISHDMDPIELVVFLPALCRKMGVP YCIIKGKARLGRLVHRKTCTTVAFTQVNSEDKGALAKLVEAIRTNYNDRYDEIRRHWGGN VLGPKSVARIAKRKKAKAKELATKLG >gi568815584r:69676285_69897155|GENSCAN_predicted_CDS_7|801_bp atgccgaaaggaaagaaggccaagggaaagaaggtggctccggccccagctgtcgtgaag aagcaggaggctaagaaagtggtgaatcccctgtttgagaaaaggcctaagaattttggc attggacaggacatccagcccaaaagagacctcacccgctttgtgaaatggccccgctat atcaggttgcagcggcagagagccatcctctataagcggctgaaagtgcctcctgcgatt aaccagttcacccaggccctggaccgccaaacagctactcagctgcttaagctggcccac aagtacaggccagagacaaagcaagagaagaagcagagactgttggcccgggccgagaag aaagctgctggcaaaggggacgtcccgaagaagagaccacctgtccttcgagcaggagtt aacactgtcaccaccttggtggagaacaagaaagctcagctggtggtgatttcacacgac atggatcccatcgagctggttgtcttcttgcctgccctatgtcgtaaaatgggggtccct tactgcattatcaaggggaaggcaagactgggacgtctagtccacaggaagacctgcacc actgtcgccttcacacaggtgaactcagaagacaaaggcgctttggctaagctggtggaa gctatcaggaccaattacaatgacagatacgatgagatccgccgtcactggggtggcaat gtcctgggtcctaagtctgtggctcgtatcgccaagcgcaaaaaggcaaaggctaaagaa cttgccactaaactgggttaa