GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:00:34 Sequence gi568815586f:92324681_92525043 : 200363 bp : 39.66% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 14945 15048 104 1 2 85 47 104 0.358 5.76 1.02 Intr + 15925 15986 62 1 2 23 100 65 0.146 -1.34 1.03 Term + 29473 29696 224 2 2 16 48 175 0.032 2.40 1.04 PlyA + 30641 30646 6 1.05 2.03 PlyA - 31336 31331 6 1.05 2.02 Term - 39073 38922 152 2 2 55 55 158 0.952 6.29 2.01 Init - 40110 39987 124 0 1 64 83 48 0.837 2.28 2.00 Prom - 41217 41178 40 -5.95 3.00 Prom + 46758 46797 40 -3.75 3.01 Init + 53243 53394 152 1 2 88 28 106 0.049 4.16 3.02 Intr + 62395 62515 121 1 1 67 113 39 0.094 3.88 3.03 Term + 65160 66173 1014 2 0 32 42 390 0.354 19.91 3.04 PlyA + 66233 66238 6 -3.74 4.00 Prom + 66261 66300 40 -10.45 4.01 Sngl + 66339 67457 1119 2 0 71 55 555 0.874 46.97 4.02 PlyA + 68479 68484 6 -0.45 5.08 PlyA - 68822 68817 6 1.05 5.07 Term - 69053 68965 89 0 2 113 36 107 0.868 4.84 5.06 Intr - 70505 70295 211 2 1 -40 83 182 0.109 2.46 5.05 Intr - 77979 77897 83 1 2 -18 78 192 0.058 6.04 5.04 Intr - 81285 81027 259 0 1 0 33 243 0.051 6.11 5.03 Intr - 81976 81814 163 0 1 71 66 119 0.014 7.06 5.02 Intr - 96493 96334 160 2 1 99 64 131 0.326 9.92 5.01 Init - 103466 103415 52 2 1 39 119 56 0.297 5.17 5.00 Prom - 118926 118887 40 -3.95 6.04 PlyA - 119277 119272 6 1.05 6.03 Term - 121234 121114 121 0 1 109 39 103 0.577 4.47 6.02 Intr - 122742 122657 86 0 2 64 100 30 0.216 -0.50 6.01 Init - 125452 125399 54 1 0 68 98 16 0.238 1.93 6.00 Prom - 127683 127644 40 -4.05 7.06 PlyA - 128410 128405 6 1.05 7.05 Term - 142304 142183 122 0 2 83 48 143 0.402 7.46 7.04 Intr - 144075 143836 240 1 0 53 67 96 0.025 0.60 7.03 Intr - 154227 154119 109 0 1 51 67 99 0.216 3.04 7.02 Intr - 191799 191707 93 0 0 104 94 63 0.768 7.74 7.01 Intr - 193622 193518 105 2 0 2 88 114 0.462 2.29 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:92324681_92525043|GENSCAN_predicted_peptide_1|129_aa MGVSLDQKHSRHPARSRGMEVSSGSAMAAVNSGGYGGCKIESLVTRRKMSVLDWQGISVF MPKKLSFGQPRSPSCAHKNPETLVDTHTSGWMLRGAEEHISRHQQASDGGMMHMPRRIWL RAVGGQSSP >gi568815586f:92324681_92525043|GENSCAN_predicted_CDS_1|390_bp atgggtgtctccctggatcagaagcacagcagacaccctgccagatccagaggcatggaa gtcagcagtgggtctgcgatggcagcggtcaacagtggtggatatgggggctgtaagatt gagtctctggttactaggcgaaaaatgtcagtattagattggcaaggcatttctgttttc atgcccaaaaagttgtcttttggccagccacgttccccatcctgtgcccataaaaacccc gagaccctagtggacacacacacaagtggctggatgttgagaggggcagaagaacacatc agcagacaccagcaggccagtgatggtggaatgatgcatatgccgaggagaatatggctg cgggcagttggaggacagtccagcccctag >gi568815586f:92324681_92525043|GENSCAN_predicted_peptide_2|91_aa MSAPVFKSQIRYFQEDTVRRHFWQREESTKAQKHETALYFGASSPIVGKRETEILPVTEH MRMQQLLGVTGFLWVLCEEYAFIPVSGRGEE >gi568815586f:92324681_92525043|GENSCAN_predicted_CDS_2|276_bp atgtctgcaccagtttttaaaagtcaaataagatatttccaggaagacacagtaagaagg catttctggcagagggaagagtccacaaaggcacagaagcatgaaacagctctgtatttt ggagcctcatctccaatcgtggggaagagagaaactgaaattcttcctgtgactgaacac atgagaatgcaacagttacttggggtaacaggtttcctgtgggtcctgtgtgaagagtat gcattcattccagtgtcggggaggggagaagaatga >gi568815586f:92324681_92525043|GENSCAN_predicted_peptide_3|428_aa MMPIPVHNKAAYVVTQIVSRPLNIDPMLFIAVESTKLLFKPTSSCRSEKRVAVTLTAKVR GFILEVSKTTKPPAGTNSRHILGAPLGSHHTVSLTIESQEIDFLLDTGVTFSVLISCPRR LSPRSVTIQGILGQPVTRYFSHLLSCNWESLLFSHAFLVMPESPPSLFGRDILAKAGAII NMNMGSKLPIYCPLLEEGINPEVWVLEGQLGRANNAHPVQIRLKYLTTYPYQRQYPLRPE AHKGLQDIVRQLKVQGLVRKCSSPRNTPILGVQKPNSQWRLEQDLRLISEAVIPLYPVVP NPYILLSQITEEAEWFTVLDLKDAFFCIPLHSDSQFLFAFEDPTDHTSQLTWTVLPQGFG DSPHLFGQALAQDLGHFSSPSTLVLQHVDDLLLATSLEASCQQATLDLLNFLANQGYKVS KSKAQLCL >gi568815586f:92324681_92525043|GENSCAN_predicted_CDS_3|1287_bp atgatgccaattccagttcacaacaaagcagcatacgtagtaacccagattgtctcccgg ccactgaatattgacccaatgctgttcattgctgttgagagcactaaacttctcttcaaa ccaaccagcagctgcaggagtgaaaaaagagtagctgtaacactcactgcaaaggtccgt ggcttcattcttgaagtcagcaagaccacgaaaccaccagcaggaaccaactccagacac atcttgggggctcctctgggatctcaccacacggtaagtttaaccattgagagccaggaa attgacttcctcctagacactggtgtgaccttctcagtgttaatctcctgccccagacgg ctgtccccaaggtctgttactatccaaggaatcctgggacagcctgtaaccaggtatttc tcccacctcctcagttgtaattgggagagtttgctcttttcacatgcctttcttgttatg cctgaaagtcccccgtccttatttggaagggacatactagccaaagctggagctattatc aatatgaatatggggagcaagttacccatttattgtcccctacttgaggagggaatcaac cctgaagtctgggtattggaaggacaattgggaagggcaaacaatgcccacccagtccaa atcaggctaaaatacctcaccacttatccttatcaaaggcaatatcccttaaggcctgaa gctcataaaggattacaggatattgttagacaattaaaagttcaaggcttagtaagaaaa tgcagcagtccccgcaacaccccaattctaggagtacaaaaaccaaacagtcagtggaga ctagagcaagatcttagactcatcagtgaggctgtaattcctctatatccagttgtaccc aacccctatatcctgctctctcaaataacagaggaagcagaatggttcactgttctggac ctcaaggatgccttcttctgcattcccctgcactctgactcccagtttctctttgccttt gaggatcccacagaccacacatcccagcttacatggacggtcttgcctcaagggtttggg gatagtcctcatctgtttggtcaggcactggcccaagatctaggccatttctcaagtcca agcactctggtccttcagcatgtggatgatttacttttggctaccagtttggaagcctca tgccagcaggctactctagatcttttgaactttctagctaatcaagggtacaaggtgtct aaatcgaaggcccagctctgcctataa >gi568815586f:92324681_92525043|GENSCAN_predicted_peptide_4|372_aa MARPLYTLIKETKRANTHLVEWELEAERCFKTLKQALVQDPALSLPTGQNFYLYITERVG IALGVLTQTQGTTSQPVAYLSKEISVAAKGWPHCLWVVAAVAILVSEAIKTIQGKDLTIW TIHEVNGILGAKGNLWLSDNHLLRYQVLLLEEPVLQIHRCDFSPEDGEPINHDCQEVVAQ TYAAQKDLLDVPLANPDLNLYTNGSSFVENGIRRAGYPIVSDVTVLESKPLPPGTSTQLA ELVALTCALELGKEKRINVYTDSKNAYLILHAHAAIWKEGECLTSGGTPIKYHKEIVELL HTVQKPKQVAVLHFQSHQKVEEEKAERNHQADAEAKIAARQNLPLEIPMEGHLVWNNPLQ EIKPQYSPTETE >gi568815586f:92324681_92525043|GENSCAN_predicted_CDS_4|1119_bp atggccaggccactctatactctaatcaaggagaccaagagggcaaatactcatctagta gaatgggaactagaggcagaaagatgcttcaaaaccttaaagcaggccctagtacaagat ccagccttaagccttcccacaggacaaaacttctatttatacatcacagagagagtggga atagctcttggagtccttactcagactcaggggacaacctcacaaccagtggcataccta agtaaggaaatcagtgtagcagcaaaaggctggcctcactgtttgtgggtagttgcggca gtggccattttagtatcagaggctatcaaaacaatacaaggaaaggatctcaccatctgg actattcatgaggtaaatggcatactaggtgccaaaggaaatttatggctatcagacaac cacctgcttagataccaggtgctactccttgaggaaccggtgcttcaaatacacaggtgt gacttttctccagaggatggggagccaatcaatcatgattgccaagaagttgtagcccag acttatgctgcccaaaaggatctcttagatgtccccttagctaatcctgaccttaaccta tataccaatggaagttcatttgtggagaatgggatacgaagggcaggttatcccatagtt agtgatgtaacagtacttgaaagtaagcctcttcccccagggaccagcacccagttagca gaactagtggcacttacctgtgccttagaactgggaaaggaaaaaagaataaatgtgtat acagatagcaagaatgcttatctaatcctacatgcccatgctgcaatatggaaagaaggg gagtgcctaacctctgggggaacccccattaaataccacaaggaaattgtggagttattg cacacagtgcaaaaacccaagcaggtggcagtcttacacttccaaagccatcaaaaagtt gaagaagaaaaggcagaaagaaaccatcaggcagatgctgaggccaaaattgctgccagg cagaacctcccattagaaatacctatggaaggacacttggtatggaacaaccctctccaa gagattaagccccagtattccccgaccgaaacagaatga >gi568815586f:92324681_92525043|GENSCAN_predicted_peptide_5|338_aa MNKLGHNELKECLKTATVALLTSLPPNYEISEGKPRAISAYVRAGKGNVTRRRKKTHLGN DDGKKEAQEKMARDLQTAMPEPPTPSMGSSAARASPMSTAPCSTAPSPINHPRAEECGRK APAAPVCSFTLEASETTNPLGGTNNSRRAALRAVTLTAKVPSFTPELARPRTHQKEDTPN TSEHQKEQTPDTLPLRTVTLTVRVRSFILEVKEEGEKEEEEEKEEEEEKQEHEEEGGGES LQSKDNLRLLKEDSTSHSISYLAKLPKHKPLACDKVLGEADSKERSVVEEALDGQPPADY SLGHDLTEGGHFNCCWELGDDHSSYFLLDGCEEGALLL >gi568815586f:92324681_92525043|GENSCAN_predicted_CDS_5|1017_bp atgaacaaattggggcacaacgaacttaaggaatgccttaagactgccactgtggccttg ctaacaagtttacctccaaactatgagatttcagaaggaaaaccaagagccatctctgca tatgtgagagcgggcaaaggaaatgtgacaaggagaagaaagaagacacacttgggcaat gatgatgggaagaaggaagcacaggagaaaatggctcgggacttgcagaccgccatgcct gagcctcccaccccctccatgggctcctctgcggcccgagcctccccgatgagcaccgcc ccctgctccacggcgcccagtcccatcaaccacccaagggctgaggagtgtgggcgcaag gcgcctgcagccccggtctgcagcttcactcttgaagccagcgagaccacgaacccactg ggaggaacgaacaactccagacgcgctgccttaagagctgtaacactcaccgcgaaggtc cccagcttcactcctgagctggcgagaccacgaacccaccagaaggaagatactccgaac acatcggaacaccagaaggaacaaactccggacacgctgcctttaagaactgtaacactc accgtgagggtccgcagcttcattcttgaagtcaaagaggaaggggaaaaagaagaagag gaagagaaggaggaagaagaagaaaaacaagaacatgaagaagaaggagggggagagtcc cttcagagtaaggacaatttaaggctcctgaaagaggattcaaccagtcactccatttct tatctggccaaactgcctaaacacaaaccccttgcatgtgacaaggtcttgggtgaggca gatagcaaagagagatcagtagtggaggaagcactggatggacagcctcctgcagattat tccctgggacatgacctaacagagggtgggcacttcaactgctgttgggaactgggcgat gaccactctagctacttcctgctggatgggtgtgaagaaggggccctgctgttgtag >gi568815586f:92324681_92525043|GENSCAN_predicted_peptide_6|86_aa MHRKLPGEDILWDAIAQKMTHVKLCKWSHQSVSHFVCIKEETNLSERQSSEYLLPSRFEK VSQSFLLFASSIFKCRRLQLVPKSLL >gi568815586f:92324681_92525043|GENSCAN_predicted_CDS_6|261_bp atgcatagaaaactcccaggtgaggacattctctgggatgctatagcccagaagatgaca catgtcaagctctgcaagtggtcccatcaaagtgtctctcattttgtttgcattaaggaa gaaacaaacttatcggaaaggcagagttctgagtatctgctaccgtcccggtttgaaaaa gtctctcaaagcttcctcctctttgcctcttccattttcaaatgccggagactccaactt gtgccaaagagccttctttga >gi568815586f:92324681_92525043|GENSCAN_predicted_peptide_7|222_aa VYANVGGKTLIGHTPRYIKTLQQMYGNASALGLGEIAAPAPGGKDSLLIKGVTVSAACTE SEYLRERNIEFISPKEAAMPWVSQLQKVQLHGTELLSGNKDAVPNLKKMHLVSILHTCES RRYHPYMHTVLSAGDALSKWQLAIPLFERQSDTMAWGACLVDVLQFLSNSAAEALLGDTP SSGIYFPDSQFSVDTHADEPKNVCLSEINTAFLPAAHRVAGT >gi568815586f:92324681_92525043|GENSCAN_predicted_CDS_7|669_bp gtgtatgctaatgtgggaggaaaaacactgattggccatactccaagatacatcaaaaca ctccagcagatgtatgggaatgcatccgccctcggactgggtgagatagcagccccagct ccaggtggaaaggacagcttactcatcaagggagttacagtgtctgcagcctgtactgag tcagagtatctgagagagagaaacatcgagtttatttcccctaaggaagcagccatgccg tgggtctcacagctgcagaaagttcagcttcatgggactgagctactttcaggaaataag gatgcagttcctaatctgaaaaaaatgcacctagtttctatcttgcatacttgtgagtct cggcgatatcatccatatatgcacacagttctcagtgcaggagatgccctcagcaaatgg caactagccattcctctatttgaaagacagtcagacacaatggcctggggtgcttgcctt gtggatgttctgcagtttctaagcaattctgctgcagaagcactactgggggacaccccc tcttcagggatctacttccccgatagccagttcagcgtggatactcatgccgatgagcct aagaatgtttgcctgtcagagataaacactgcatttctgcctgctgcacacagagttgct ggaacctga