GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:13:08 Sequence gi568815592f:75502525_75815591 : 313067 bp : 38.66% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 6980 7144 165 1 0 63 43 175 0.750 4.93 1.02 PlyA + 7482 7487 6 1.05 2.06 PlyA - 7830 7825 6 1.05 2.05 Term - 13320 13071 250 2 1 67 38 174 0.525 4.49 2.04 Intr - 15207 15080 128 0 2 49 72 64 0.588 -0.54 2.03 Intr - 15689 15483 207 2 0 56 76 185 0.408 12.45 2.02 Intr - 17661 17421 241 2 1 56 50 113 0.114 0.93 2.01 Init - 40884 40865 20 0 2 61 97 48 0.178 2.55 2.00 Prom - 40933 40894 40 -5.25 3.00 Prom + 41899 41938 40 -4.35 3.01 Init + 47112 47129 18 2 0 105 65 -1 0.119 -0.62 3.02 Term + 58279 58467 189 0 0 -56 45 545 0.904 32.27 3.03 PlyA + 58693 58698 6 -0.45 4.00 Prom + 59893 59932 40 -7.55 4.01 Sngl + 60999 61370 372 2 0 94 43 320 0.967 23.97 4.02 PlyA + 62621 62626 6 1.05 5.02 PlyA - 63492 63487 6 -1.75 5.01 Sngl - 64101 63586 516 0 0 29 32 236 0.343 7.89 5.00 Prom - 75466 75427 40 -4.05 6.00 Prom + 84185 84224 40 -2.75 6.01 Init + 100001 100143 143 1 2 90 55 153 0.069 11.86 6.02 Intr + 117848 117896 49 2 1 50 95 63 0.090 0.96 6.03 Intr + 131057 131202 146 1 2 92 65 152 0.992 11.46 6.04 Intr + 132183 132287 105 0 0 69 116 79 0.970 7.21 6.05 Intr + 138160 138180 21 1 0 64 121 49 0.602 1.54 6.06 Intr + 145207 145277 71 1 2 76 111 27 0.591 1.71 6.07 Intr + 156738 156883 146 1 2 114 107 76 0.988 11.18 6.08 Intr + 160697 160994 298 1 1 65 108 187 0.983 13.92 6.09 Intr + 164188 164417 230 2 2 16 80 228 0.999 11.47 6.10 Intr + 168029 168196 168 1 0 104 87 153 0.950 16.02 6.11 Term + 176059 176172 114 0 0 101 54 99 0.605 5.39 6.12 PlyA + 177244 177249 6 1.05 7.02 PlyA - 181120 181115 6 1.05 7.01 Sngl - 182311 181673 639 1 0 43 41 263 0.981 13.03 7.00 Prom - 183579 183540 40 -6.15 8.02 PlyA - 183748 183743 6 1.05 8.01 Sngl - 185001 184765 237 0 0 88 37 191 0.881 8.84 8.00 Prom - 188508 188469 40 -8.45 9.00 Prom + 189793 189832 40 -6.25 9.01 Init + 190356 190405 50 2 2 111 80 -58 0.874 -3.80 9.02 Intr + 193280 193399 120 2 0 65 91 101 0.954 6.79 9.03 Intr + 194901 194993 93 0 0 86 75 43 0.634 0.96 9.04 Intr + 200121 200548 428 0 2 88 91 259 0.876 18.71 9.05 Intr + 207003 207106 104 1 2 62 98 92 0.717 6.57 9.06 Intr + 208804 208892 89 0 2 47 72 62 0.477 -1.65 9.07 Intr + 210989 211057 69 2 0 79 89 52 0.403 1.78 9.08 Intr + 211151 211301 151 2 1 40 90 53 0.506 -0.06 9.09 Term + 212861 213070 210 1 0 58 47 186 0.990 7.81 9.10 PlyA + 213135 213140 6 1.05 10.03 PlyA - 214338 214333 6 1.05 10.02 Term - 246534 246172 363 0 0 45 50 212 0.556 6.68 10.01 Init - 277337 277293 45 2 0 61 115 32 0.519 3.93 10.00 Prom - 281765 281726 40 -4.95 11.04 PlyA - 281798 281793 6 1.05 11.03 Term - 286321 286227 95 2 2 133 48 61 0.238 3.61 11.02 Intr - 310638 310564 75 1 0 59 92 80 0.481 4.07 11.01 Intr - 311282 311153 130 2 1 27 94 96 0.597 3.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:75502525_75815591|GENSCAN_predicted_peptide_1|54_aa MQTIEGKRKSNPDRRNSTSEILDVEQNTEPSRNIKTMAEYGREKFGKLHKHPAM >gi568815592f:75502525_75815591|GENSCAN_predicted_CDS_1|165_bp atgcaaactatagaggggaagaggaagagtaatcctgacagaaggaatagcacgtcggaa atcctcgatgtggaacagaacacagaaccttcaagaaacataaagacaatggcagagtat gggagagagaaatttggaaagctacataagcatcctgccatgtag >gi568815592f:75502525_75815591|GENSCAN_predicted_peptide_2|281_aa MPDDKARVSNVKVRALIGKEWDSISGDGHVWEDLDEAGDIESLNSDESSLPVEEVAPSPV EVASPPIVVSAFPLPSEGINPVLPEEMPSPKGPMAFYQGNCALGKGNNRTFWGLLVTGSE LTLITGDPKHLRGPPVRVEAFGSQMINGIVAQVHLTKTDGYWRMTVDYCKLNQVVTLITS AVPDVVLLLKQINTSPGTCLPIQMLIFGIDTLNTDLPSLHEMLLPKLLSTDLQNALSTVM VFYTALLLIKVHFSQKKCGNGPVVEEFTDLTMFTTILKLLA >gi568815592f:75502525_75815591|GENSCAN_predicted_CDS_2|846_bp atgcctgatgacaaagccagggtgtctaatgttaaagtgagggcattaattgggaaagaa tgggattccataagtggagatgggcatgtgtgggaagatcttgatgaagctggggacatt gagtctctaaattctgatgagtctagtttgccagtggaagaggtcgccccatccccagtg gaagtagcttctccacccatagtagtgtcagcctttccactaccatctgaggggattaac cctgtcttgcctgaggaaatgccttccccaaagggacctatggccttttaccagggtaac tgtgcattggggaaaggaaacaatcggacattttgggggctattggtcactggctctgaa ttgacactgattacaggagatccaaaacatctccgtggccctccagtcagagtagaggct tttggaagtcagatgatcaatggaattgtagctcaggtccatctcacgaagacagatgga tattggagaatgacagtagactattgtaagcttaaccaggtagtgactttgattacatct gctgtaccagatgtggttttattgcttaagcaaattaacacatctcctggtacctgccta ccaattcaaatgctaatctttggaatagatactctgaatacagatttgccttccctgcat gaaatgcttctgccaaaactactatccacagacttacagaacgctttatccactgtcatg gtattctatactgcattgcttctgattaaggtacacttctcgcaaaagaaatgtggcaat gggcctgtggtcgaagaattcactgatcttaccatgttcaccaccatcctgaagctgctg gcttaa >gi568815592f:75502525_75815591|GENSCAN_predicted_peptide_3|68_aa MERHISKKKLRKQKKKKEKEKEEEGGEGGEGGEEEEEEEEEEEEEEEEEEEEEEEEEEEE EEEQLLKQ >gi568815592f:75502525_75815591|GENSCAN_predicted_CDS_3|207_bp atggagagacacatttctaaaaagaaacttagaaaacaaaagaagaagaaggagaaggag aaagaagaagaaggaggagaaggaggagaaggaggagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaggaagaggaagaggaagaggaagaggaa gaagaagagcagctgcttaagcagtga >gi568815592f:75502525_75815591|GENSCAN_predicted_peptide_4|123_aa MKNLNVMTPPKDHPSSPAIVPKQNENSEMTDKEFNAWIARKLNDIQDKIENKLKETPKAI QKMKEKMNILKTNQSELLELKNSLKEFQNAVESFVNRLDQEEERISELEARRLVFESSLV RQK >gi568815592f:75502525_75815591|GENSCAN_predicted_CDS_4|372_bp atgaaaaacctgaatgtaatgacaccaccaaaggatcaccctagctctccagcaattgtc cctaaacaaaatgaaaactcagaaatgacagataaggaattcaatgcatggattgcaaga aagctcaatgatatccaagataagattgaaaataaactcaaagaaactcctaaagcaatt cagaaaatgaaagaaaagatgaacatcttaaaaacaaatcaatcagagcttctggaattg aaaaactcacttaaggaatttcaaaatgcagttgaaagctttgtcaatagactagaccaa gaagaagaaagaatttcggagctcgaagctcgaagactagtctttgaatctagcctagtc agacagaaataa >gi568815592f:75502525_75815591|GENSCAN_predicted_peptide_5|171_aa MYENAWKSRQKFAAGVKPSSRTSTRDVWRGNVGLEPPHGVPNGALPSRAMRRKPLPSRPQ NGRYTDNLHHVSGNAIGTQRQPLRTAMGAEPCKATWAELSKVLGAHPLHQCGLDIRHGVK GNNFGALRSYDCPAEFQTCMELVAPLFWLIFAFWNGYVYPMSVLPLYLGSN >gi568815592f:75502525_75815591|GENSCAN_predicted_CDS_5|516_bp atgtatgaaaatgcctggaagtccaggcagaagtttgctgcaggggtgaagccctcatcg agaacctctactagggatgtgtggaggggaaatgtggggttggagcccccacatggagtc cccaatggggcactgcctagtagagccatgagaagaaagccactgccctccagaccccag aatggtagatacactgacaacttgcatcatgtgtctgggaatgccataggcactcaacgc cagcccttgagaacagccatgggagctgagccttgcaaagccacatgggctgagctgtcc aaggtcttgggagcccacccactgcatcagtgtggcctggatataagacatggagtcaaa ggaaacaattttggagctttaagatcttatgactgccctgctgagtttcagacttgcatg gagcttgtagctcctttgttttggctgatttttgccttttggaatgggtatgtttatcca atgtctgtactcccattgtatcttggaagtaactaa >gi568815592f:75502525_75815591|GENSCAN_predicted_peptide_6|496_aa MAAGKSGGSAGEITFLEGTSVSALDGEKGGYTGKRGPQNPGYSVICAGILEAGKLDQEVQ IQYLLNRRSEIVANSSGEFILKTYVRRNKSESFKTLKGNPIGLNMLSNNKKLSENTQNTS LCSGTVVHGRRFHHAHAQIPVVKTAAQSSLDRKERKEYPPHVQKVEINPVRLSRLQGVER IMKKTEESESQVEPEIKRKVQQKRHCSTYQPTPPLSPASKKCLTHLEDLQRNCRQAITLN ESTGPLLRTSIHQNSGGQKSQNTGLTTKKFYGNNVEKVPIDIIVNCDDSKHTYLQTNGKV ILPGAKIPKITNLKERKTSLSDLNDPIILSSDDDDDNDRTNRRESISPQPADSACSSPAP STGKVEAALNENTCRAERELRSIPEDSELNTVTLPRKARMKDQFGNSIINTPLKRRKVFS QEPPDALALSCQSSFDSVILNCRSIRVGTLFRLLIEPVIVSFESKIQLRSKQEFQFFDEE EETGENHTIFIGPVEK >gi568815592f:75502525_75815591|GENSCAN_predicted_CDS_6|1491_bp atggcggccggcaagagcggcggtagcgcaggggagattacttttctggaaggtacgtct gtttctgcccttgacggggagaagggagggtatactggaaaacgtggcccccagaacccc ggatattcagttatctgcgctggaattctggaagctgggaagttagatcaagaggtccag attcagtatctgttaaatcgtcgatctgaaattgttgctaatagctctggtgaattcatc ttgaagacatatgtaagacgaaacaagtctgaaagttttaaaactttgaaaggcaaccca attggacttaacatgttgagcaacaataagaaattgagtgaaaatacgcaaaatacgtca ttatgttctggaactgtagttcatggtagacgttttcatcatgctcatgcacagatacca gtagtaaaaacagcagcccaaagcagtctggaccgaaaagaaaggaaagaatacccacct catgtccaaaaagttgaaattaatcctgtaaggttaagtcggctccaaggtgttgaacgt ataatgaagaaaacagaagagtccgaatcacaagtggagcctgaaattaagaggaaagta caacagaaacgacactgtagtacctatcagcctactcctcctctatctcctgcttcaaaa aaatgtttaacccatttagaggatttgcaaagaaattgcagacaagctattactttgaat gagtctactggaccattattaagaacgtcaattcatcagaattctggaggacagaagtca caaaacacaggattaacaaccaagaagttttatggcaacaatgtggaaaaggttccaatt gatattattgtgaattgtgatgacagtaaacacacttatttacagactaatggaaaagtc attttacctggggcaaaaatacccaaaatcacaaacttgaaagaaaggaaaacaagtttg tcagacctaaatgatccaatcattttgtccagtgatgatgatgatgacaacgacagaact aacagaagagaaagcatatctcctcagcctgctgattcagcatgttcttcccctgcacca tccactggaaaagtagaagcagcgctaaatgaaaatacttgcagagcagagcgtgaacta cgaagcattccagaagactcagagttaaatacagttacattgccaagaaaagcaagaatg aaagaccagtttggcaattctattatcaacacacctctgaaacgtcgtaaagtgttttct caagaacctccagatgctttagctttaagctgccaaagttcctttgacagtgtcatttta aactgtcgaagtatacgagtaggaacactcttccggctgttaatagagcctgtaattgta tcatttgaatctaaaatacaacttagaagcaaacaagaatttcagttttttgatgaagaa gaagaaactggagaaaaccacaccatcttcattggcccagtagaaaagtga >gi568815592f:75502525_75815591|GENSCAN_predicted_peptide_7|212_aa MNIDVKILNKTLANEVQQHIKKLIHHDQVGFIPGMQGWFNICKSINIIHRINRTNDKNHM IISIDAEKAFDKIQHPFMLKTLNKLGIDGKYLKIIRAIYDKPTANIILNGQKLEAFHLKT STRQGCPLSPLLFNIVLEVLARAIRQEKEIKRIQLGNGEVKLSLFADDMIVYLENPIISA QNLLMLISNFSKVSGYKINVQKSQAFLYTINR >gi568815592f:75502525_75815591|GENSCAN_predicted_CDS_7|639_bp atgaacattgatgtgaaaatcctcaataaaacactggcaaacgaagtccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaac atatgcaaatcaataaacataatccatcgtataaacagaaccaatgacaaaaaccacatg attatctcaatagatgcagaaaaggcctttgacaaaattcaacatcccttcatgctaaaa actctcaataaactaggtattgatggaaaatatctcaaaataataagagctatttatgac aaacccacagccaatatcatcctgaatgggcaaaaactggaagcattccatttgaaaacc agcacaagacaaggatgccctctctcaccactcctattcaacatagtgttggaagttctg gctagggcaatcaggcaagagaaagaaataaagcgtattcaattaggaaatggggaagtc aaactgtccctgtttgcagatgacatgattgtgtatttagaaaaccccatcatctcagcc caaaatctccttatgctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcctatacaccattaacagataa >gi568815592f:75502525_75815591|GENSCAN_predicted_peptide_8|78_aa MGRNQSRKAENSKNQSTSSSPKDCSFLPATEQSWTENDFDELTEAGFRRSVITNFSELKE DVRTHRKEAKHHEKRLDE >gi568815592f:75502525_75815591|GENSCAN_predicted_CDS_8|237_bp atggggagaaaccagagcagaaaagctgaaaattctaaaaatcagagcacctcttcttct ccaaaggattgcagcttcttgccagcaacggaacaaagctggacggagaatgactttgac gagttgacagaagcaggcttcaggaggtcagtaataacaaacttctccgagctaaaggag gatgttcgaacccatcgcaaggaagctaaacaccatgaaaaaagattagatgaatag >gi568815592f:75502525_75815591|GENSCAN_predicted_peptide_9|437_aa MAKPHVYPHQKKTKLARYLVLEKLKKEDADRIHIFSSFFYKRLNQRERRNHETTNLSIQQ KRHGRVKTWTRHVDIFEKDFIFVPLNEAAHWFLAVVCFPGLEKPKYEPNPHYHENAVIQK CSTVEDSCISSSASEMESCSQNSSAKPVIKKMLNKKHCIAVIDSNPGQEESDPRYKRNIC SVKYSVKKINHTASENEEFNKGESTSQKVADRTKSENGLQNESLSSTHHTDGLSKIRLNY SDESPEAGKMLEDELVDFSEDQDNQDDSSDDGFLADDNCSSEIGQWHLKPTICKQPCILL MDSLRGPSRSNVVKILREYLEVEWEVKKGSKRSFSKDVMKGSNPKVPQQNNFSDCGVYVL QYVESFFENPILSFELPMNLANWFPPPRMRTKREEIRNIILKLQEDQSKEKRKHKDTYST EAPLGEGTEQYVNSISD >gi568815592f:75502525_75815591|GENSCAN_predicted_CDS_9|1314_bp atggcgaaaccccatgtttacccccaccaaaaaaaaacaaagttagccagatacttggtg cttgaaaaactgaagaaggaagacgctgaccgaattcatatattcagttcttttttctat aaacgccttaatcagagagagaggagaaatcatgaaacaactaatctgtcaatacagcaa aaacggcatgggagagtaaaaacatggacccggcacgtagatatttttgagaaggatttt atttttgtaccccttaatgaagctgcacactggtttttggctgttgtttgtttccccggt ttggaaaaaccaaagtatgaacctaatcctcattaccatgaaaatgctgtcatacagaaa tgttcaactgtagaggacagttgtatttcttcttcagccagtgaaatggagagttgttca caaaactcttctgccaagcctgtaattaagaagatgctaaacaaaaaacattgcatagct gtaattgattccaatcctgggcaggaagaaagtgaccctcgttataagagaaacatatgc agtgtaaaatacagtgtgaaaaaaataaatcatactgcgagtgaaaatgaagaattcaat aaaggagaatctacatcccagaaagttgctgataggactaaaagtgagaatggcctacag aatgaaagtttaagttccacacatcatacagatggcttaagcaaaatcagactaaactat agcgatgaatcacctgaagctggtaaaatgcttgaagatgaactcgtcgacttctcagaa gatcaggataaccaggatgatagcagtgacgatggattcctcgctgatgacaactgcagt tcagaaataggacagtggcatttaaagcctactatctgtaaacaaccttgtatcctactt atggactcactccgaggcccttctcggtcaaatgttgtcaaaattttaagagagtattta gaagtggaatgggaagttaaaaaaggaagcaaaagaagtttttccaaagatgttatgaag ggctctaatccaaaagtaccacagcaaaacaacttcagtgactgtggtgtatatgtattg cagtatgtagagagcttttttgagaatccaattctcagttttgaactacctatgaatttg gcaaactggtttcctccaccaagaatgagaacaaaaagagaagaaatccgaaacataatt ctgaagctacaggaagatcagagcaaagagaaaagaaagcataaggacacttactcaaca gaagcacctttaggcgaaggaacagaacaatatgtcaatagtatctcagattga >gi568815592f:75502525_75815591|GENSCAN_predicted_peptide_10|135_aa MKSVTSYSNKLVKKKSLWGEGRRPAAQRGAAPGQAGSSCSSHGETASFAAVVVSSPGKAG PVRLDTVTVQSTNSENQFKQGESCSEKGRGVRGSARGTQEPGVRAWAAARLRGGCGKNHC SLGPGSGRLGLGCPA >gi568815592f:75502525_75815591|GENSCAN_predicted_CDS_10|408_bp atgaaaagtgtaacaagctacagcaacaaactggttaagaaaaagagcctctggggcgag ggtaggcgccctgcagcgcagagaggggctgcccctggacaagccgggagttcttgttct tcccatggcgaaacggcctcttttgccgcggtagttgtttccagtcccggcaaagcaggt cctgtccggctggatactgtgactgtgcagtctaccaacagcgagaatcagttcaagcag ggagaaagttgttcggagaaggggcgaggagtccggggctccgcgcgggggacccaggaa ccaggtgtccgcgcttgggcggcagcgaggctgcgtggcggttgcggcaagaaccactgc tcactcggccccggatcgggccggcttggactgggctgccctgcatag >gi568815592f:75502525_75815591|GENSCAN_predicted_peptide_11|99_aa AEDNSLIWGEEPSPSRIHHLLTKEPLGPEQSAVVARQYSPRVTGGINPKEKGLMNKKKSS EGTKLTGNSPHSPAFSTAPSSSKYSRFIPSLRFSSNGFF >gi568815592f:75502525_75815591|GENSCAN_predicted_CDS_11|300_bp gcagaagataactcactgatttggggggaagaacccagtcccagcagaattcatcacctg ctgactaaagagcccctggggcctgaacagtcagcagtggtagccaggcaatactcacca cgggtcacaggtggcatcaatccaaaagaaaagggcttaatgaacaagaagaaatcatct gaaggtacaaaactcactggtaatagtccacactctccagccttctcaactgctccaagt tcctccaagtattccaggtttatcccatctctacgctttagctcaaatggcttcttctga