GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:43:16 Sequence gi568815584f:69668157_69871458 : 203302 bp : 44.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 1662 1657 6 1.05 1.04 Term - 1911 1714 198 0 0 61 44 78 0.786 -1.90 1.03 Intr - 2635 2536 100 0 1 115 80 87 0.922 10.71 1.02 Intr - 4880 4849 32 2 2 113 100 2 0.811 0.83 1.01 Init - 10374 10315 60 0 0 102 33 52 0.782 2.35 1.00 Prom - 13500 13461 40 -2.46 2.00 Prom + 16781 16820 40 -2.56 2.01 Init + 22317 22377 61 2 1 51 80 45 0.219 1.41 2.02 Intr + 30797 30920 124 0 1 61 34 87 0.329 0.24 2.03 Intr + 33243 33319 77 0 2 82 83 54 0.767 3.46 2.04 Intr + 35239 35436 198 2 0 115 56 218 0.982 20.62 2.05 Intr + 36448 36586 139 2 1 76 109 105 0.956 10.92 2.06 Intr + 40521 40948 428 0 2 96 101 243 0.617 19.83 2.07 Intr + 41468 41492 25 0 1 67 115 -6 0.035 -2.92 2.08 Intr + 51050 51220 171 2 0 61 100 66 0.019 4.16 2.09 Intr + 57102 57215 114 0 0 29 80 95 0.013 2.36 2.10 Term + 90811 90955 145 1 1 47 54 127 0.185 2.58 2.11 PlyA + 92815 92820 6 -0.45 3.00 Prom + 93616 93655 40 -3.96 3.01 Init + 100001 100126 126 1 0 78 103 109 0.767 11.56 3.02 Intr + 100448 100518 71 1 2 44 100 70 0.756 1.78 3.03 Intr + 100642 100740 99 1 0 74 92 34 0.560 1.63 3.04 Intr + 102839 102949 111 2 0 83 70 41 0.520 1.29 3.05 Term + 103041 103305 265 0 1 59 37 125 0.444 -0.52 3.06 PlyA + 103830 103835 6 1.05 4.06 PlyA - 104097 104092 6 1.05 4.05 Term - 108232 108126 107 2 2 103 41 85 0.990 3.87 4.04 Intr - 110373 110177 197 2 2 48 96 104 0.989 6.26 4.03 Intr - 111204 111026 179 0 2 112 90 129 0.998 14.22 4.02 Intr - 118151 117941 211 1 1 100 116 277 0.686 30.62 4.01 Init - 128999 128644 356 2 2 74 96 517 0.991 47.71 4.00 Prom - 131951 131912 40 -5.06 5.06 PlyA - 132486 132481 6 1.05 5.05 Term - 135087 134897 191 1 2 56 42 148 0.952 4.61 5.04 Intr - 140759 140661 99 0 0 83 53 81 0.331 4.18 5.03 Intr - 154842 154756 87 1 0 75 94 21 0.349 1.34 5.02 Intr - 155758 155670 89 0 2 44 108 41 0.495 1.31 5.01 Init - 156190 155952 239 1 2 81 36 199 0.712 11.49 5.00 Prom - 161066 161027 40 -5.76 6.00 Prom + 162432 162471 40 -4.56 6.01 Init + 166066 166072 7 0 1 40 84 0 0.172 -4.25 6.02 Intr + 174563 174709 147 0 0 43 58 213 0.559 14.01 6.03 Intr + 177599 177744 146 0 2 89 45 41 0.090 -0.10 6.04 Term + 184861 184926 66 0 0 104 42 51 0.122 0.04 6.05 PlyA + 186825 186830 6 1.05 7.03 PlyA - 187227 187222 6 1.05 7.02 Term - 191663 191586 78 2 0 76 44 82 0.732 0.36 7.01 Init - 193264 193142 123 1 0 85 65 67 0.717 4.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:69668157_69871458|GENSCAN_predicted_peptide_1|129_aa MPSRSIVFNSEQLESPIDPSGSRLCAREYKRLPATAGCTCHWQVPHFNNDDDNSNISSSP QPYMQPYQNPGSLGFGARSIAFMYHDVQNKRFKFPKAAITRGHKKLANPFKSKSVKILRV GILLLGKPV >gi568815584f:69668157_69871458|GENSCAN_predicted_CDS_1|390_bp atgcccagccgcagcattgtttttaacagtgaacaactggagtcgcccatagatccatca ggttcaagactctgtgcaagagagtacaaaagactccccgcaacagcgggctgtacctgc cactggcaagtgccacattttaacaacgatgatgataacagcaacatttcatcttcacca caaccctatatgcaaccctaccaaaacccagggtcattagggtttggagcaaggtctata gcgttcatgtaccatgatgtacagaacaaaaggttcaagttccccaaagctgcaattacc agaggtcataaaaagctggccaatcctttcaaatcaaagtcagtcaagattctaagagtt ggaatcctgctcctaggaaaacctgtctga >gi568815584f:69668157_69871458|GENSCAN_predicted_peptide_2|493_aa MWEGFKEKACKRSRGIFQEHELSTELPSANLELSENDPGAQIKTVSCLDCGFWKPLNNFF AMSCHGVLSINYATEMNQLDTVQGEPGMCPLPPEPENGGYICHPRPCRDPLTAGSVIEYL CAEGYMLKGDYKYLTCKNGEWKPAMEISCRLNEDKDTHTSLGVPTLSIVASTASSVALIL LLVVLFVLLQPKLKSFHHSRRDQGVSGDQVSIMVDGVQVALPSYEEAVYGSSGHCVPPAD PRVQIVLSEGSGPSGRSVPREQQLPDQGACSSAGGEDEAPGQSGLCEAWGSRASETVMVH QATTSSWVAGSGNRQLAHKETADSENSDIQSLLSLTSEEYTDAEETKAHRRIPLSNVQNQ DTKPFSKMQSQMLSSLSSQDGFWVPGKIRQGQETQAAPVSSLGLASERLPGLWKIVKAAL ANPYIQSQKWELDQGFFPQQEPIQASDRLSDEQITPQKASDIRKYLLEFWKGSKKLQQKL LEGTHSPPLAYVV >gi568815584f:69668157_69871458|GENSCAN_predicted_CDS_2|1482_bp atgtgggaaggattcaaagagaaagcttgtaagaggagcagagggatcttccaggagcac gagttaagtacagagcttccttctgcgaacctggaattgtcagagaatgatcctggtgca cagatcaagacagtttcttgcctagactgtggcttctggaagccactgaacaatttcttt gccatgagctgccatggggtgctcagcattaattatgccacggaaatgaaccaattagac acagtccaaggagagccagggatgtgccccctaccaccggagccagagaatggtggctac atctgccacccccggccctgcagagaccccctgacagcaggcagtgtcatcgaatacctg tgtgctgaaggctacatgttgaagggcgattacaaatacctgacgtgtaagaatggcgag tggaaaccagccatggagattagctgccgtctcaacgaggataaagacacccacacatca cttggggtccccacgctgtctatagtggcttctactgccagctccgtggcgctcattctc ctcctcgtggtgctgtttgtgctgctgcagccaaagctgaagtctttccatcatagcagg cgtgaccagggggtatctggggaccaggtctccatcatggtggatggagtccaggttgca ctaccatcatacgaggaggctgtatatggcagttctggtcactgtgtgccacctgctgac cccagagtacagattgtgctgtcagaagggtctgggcccagtgggaggagcgtgccaagg gagcaacagctgccggaccaaggggcctgctcctctgcaggtggagaagatgaggcccca ggccagtctggactatgtgaagcctggggctctcgggcctcagagactgtgatggtgcat caggcaaccacctcttcctgggtggccggctcagggaaccgccaactggcacacaaagaa actgcagattcagagaacagtgacatacaaagccttttatccctcacgtcagaggagtac acagatgctgaggaaactaaggctcacagaagaatccctctcagcaatgtccaaaaccag gacaccaagccatttagcaaaatgcagtcccagatgctcagcagtctcagcagtcaggat gggttctgggtccctggaaagatcaggcaaggccaggagacccaggcagcccctgtctcc tccctgggccttgcttctgaaaggctgcccggtctatggaagattgtaaaggcagcccta gcaaacccatacattcagtctcagaagtgggagctggaccagggcttcttcccacaacaa gaacccatccaggccagtgatagactcagcgatgagcagataaccccacagaaagccagt gacatcaggaagtatttgctggagttctggaaaggatctaagaagcttcaacagaagctc ctggaagggacgcactctcctcctctggcctacgtggtgtga >gi568815584f:69668157_69871458|GENSCAN_predicted_peptide_3|223_aa MSGCRVFIGRLNPAAREKDVERFFKGYGRIRDIDLKRGFGFVEFEDPRDADDAVYELDGK ELCSERVTIEHARARSRGGRGRGRYSDRFSSRRPRNDRRVVEFASYGDLKNAIEKLSGKE INGRKIKLIEGSKRHRSRSRSRSRTRSSSRSRSRSRSRSRKSYSRSRSRSRSRSRSKSRS VSRSPVPEKSQKRGSSSRSKSPASVDRQRSRSRSRSRSVDSGN >gi568815584f:69668157_69871458|GENSCAN_predicted_CDS_3|672_bp atgagtggctgtcgggtattcatcgggagactaaatccagcggccagggagaaggacgtg gaaagattcttcaagggatatggacggataagagatattgatctgaaaagaggctttggt tttgtggaatttgaggatccaagggatgcagatgatgctgtgtatgagcttgatggaaaa gaactctgtagtgaaagggttactattgaacatgctagggctcggtcacgaggtggaaga ggtagaggacgatactctgaccgttttagtagtcgcagacctcgaaatgatagacgggtg gttgagtttgcctcttatggtgacttaaagaatgctattgaaaaactttctggaaaggaa ataaatgggagaaaaataaaattaattgaaggcagcaaaaggcacaggtcaagaagcagg tctcgatcccggaccagaagttcctctaggtctcgtagccgatcccgttcccgtagtcgc aaatcttacagccggtcaagaagcaggagcaggagccggagccggagcaagtcccgttct gttagtaggtctcccgtgcctgagaagagccagaaacgtggttcttcaagtagatctaag tctccagcatctgtggatcgccagaggtcccggtcccgatcaaggtccagatcagttgac agtggcaattaa >gi568815584f:69668157_69871458|GENSCAN_predicted_peptide_4|349_aa MEAHNASAPFNFTLPPNFGKRPTDLALSVILVFMLFFIMLSLGCTMEFSKIKAHLWKPKG LAIALVAQYGIMPLTAFVLGKVFRLKNIEALAILVCGCSPGGNLSNVFSLAMKGDMNLSI VMTTCSTFCALGMMPLLLYIYSRGIYDGDLKDKVPYKGIVISLVLVLIPCTIGIVLKSKR PQYMRYVIKGGMIIILLCSVAVTVLSAINVGKSIMFAMTPLLIATSSLMPFIGFLLGYVL SALFCLNGRCRRTVSMETGCQNVQLCSTILNVAFPPEVIGPLFFFPLLYMIFQLGEGLLL IAIFWCYEKFKTPKDKTKMIYTAATTEETIPGALGNGTYKGEDCSPCTA >gi568815584f:69668157_69871458|GENSCAN_predicted_CDS_4|1050_bp atggaggcccacaacgcgtctgccccattcaacttcaccctgccacccaactttggcaag cgccccacagacctggcactgagcgtcatcctggtgttcatgttgttcttcatcatgctc tcgctgggctgcaccatggagttcagcaagatcaaggctcacttatggaagcctaaaggg ctggccatcgccctggtggcacagtatggcatcatgcccctcacggcctttgtgctgggc aaggtcttccggctgaagaacattgaggcactggccatcttggtctgtggctgctcacct ggagggaacctgtccaatgtcttcagtctggccatgaagggggacatgaacctcagcatt gtgatgaccacctgctccaccttctgtgcccttggcatgatgcctctcctcctgtacatc tactccagggggatctatgatggggacctgaaggacaaggtgccctataaaggcatcgtg atatcactggtcctggttctcattccttgcaccatagggatcgtcctcaaatccaaacgg ccacaatacatgcgctatgtcatcaagggagggatgatcatcattctcttgtgcagtgtg gccgtcacagttctctctgccatcaatgtggggaagagcatcatgtttgccatgacacca ctcttgattgccacctcctccctgatgccttttattggctttctgctgggttatgttctc tctgctctcttctgcctcaatggacggtgcagacgcactgtcagcatggagactggatgc caaaatgtccaactctgttccaccatcctcaatgtggcctttccacctgaagtcattgga ccacttttcttctttcccctcctctacatgattttccagcttggagaagggcttctcctc attgccatattttggtgctatgagaaattcaagactcccaaggataaaacaaaaatgatc tacacagctgccacaactgaagaaacaattccaggagctctgggaaatggcacctacaaa ggggaggactgctccccttgcacagcctag >gi568815584f:69668157_69871458|GENSCAN_predicted_peptide_5|234_aa MPSGTIWILEARHSHSGVLLWPIYRVTRKAASFEREPEQDKALQQAQSAVQAALPLGLYD PADPMVLEVSVADNDAGPHRHKVGRAQQHPIIQWKSYILDQAQTGPEGTKRAEICPHQVA GLIDLDYQDEISLLIHNRAMQHQIQKFTNVLDGTERVQKYSLSDIAASAIGSHKMPCSFD LSLVEQLLWGHQLPDKKSDYSEITMQMTYPQQPYEDPKQKPSAEPVISQNCERE >gi568815584f:69668157_69871458|GENSCAN_predicted_CDS_5|705_bp atgcctagtgggactatttggattttggaagcaagacattctcattcgggtgtgttactc tggcccatttatcgagtgacccgaaaggctgccagttttgagcgagaaccagaacaggac aaggcgctgcaacaggcccagtctgctgtgcaagctgctctgccacttgggctatatgac ccagcagatccaatggtgcttgaggtgtcagtggcagataatgatgctggcccccataga cataaagtgggtcgtgcacagcagcatcccatcatccaatggaagtcgtatatacttgat caggctcaaacaggtcctgaaggcacaaaaagggcagaaatttgtcctcaccaggtggct gggctgattgacctggactatcaagatgaaattagtctactaatccataacagagctatg cagcaccaaattcagaaatttacaaatgtgcttgatggcacagagagagttcagaagtac tcactgagtgatattgctgcttcagccataggaagtcataagatgccttgcagctttgac ttaagcctcgtggagcaattgctctggggccatcagctaccagataagaagtctgactac tctgagatcacgatgcagatgacttatcctcagcaaccatatgaggaccccaagcaaaag ccatcagctgaacctgtcatatcccagaactgtgaaagagaataa >gi568815584f:69668157_69871458|GENSCAN_predicted_peptide_6|121_aa MQAAKRPSDEWEGKAEGGEEKEKEEEEEEEEEEEEVEEVAVAVQARQSGMECDSPQFFGA SIAPSQSGPNLELKQVAGGSQSGLLIPLAPVIEYVAQPEAAGTPANCLEGAMTTRHMDTT S >gi568815584f:69668157_69871458|GENSCAN_predicted_CDS_6|366_bp atgcaagctgcaaagaggcccagtgatgaatgggaggggaaagcggaaggtggggaggag aaggagaaggaggaggaggaggaggaggaggaggaggaggaggaggtggaggaggtggca gtggcggtacaagcaaggcagagtgggatggagtgtgacagcccccaattttttggtgca tccattgctccctcacaatcaggccccaacttagagctgaagcaggtggcaggtggaagc cagtcagggcttctcattcctctggctccagtaattgagtatgtggctcagccagaagca gcaggaacacctgcaaactgcctggaaggggccatgaccaccagacacatggataccaca agttga >gi568815584f:69668157_69871458|GENSCAN_predicted_peptide_7|66_aa MEYRAPGRPKSRKHPGRKEEGLTEMKRDQDLHEKGPEHTCRNVRLDTQQLAQNQASAQGA LNTVDE >gi568815584f:69668157_69871458|GENSCAN_predicted_CDS_7|201_bp atggagtacagggccccagggagacctaagtccagaaagcacccaggaagaaaagaggag ggcctaacggagatgaagagggaccaagatctgcatgaaaaagggccggagcacacctgc cggaacgtcagactggatacccagcagctggcacagaaccaggcgtcggctcagggagcc ctgaatactgtcgatgaatga