GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:11:56 Sequence gi568815586r:91046061_91278552 : 232492 bp : 34.90% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 18 13 6 1.05 1.02 Term - 5458 5286 173 2 2 93 41 136 0.737 6.41 1.01 Init - 10221 9336 886 0 1 65 106 430 0.492 37.12 1.00 Prom - 13100 13061 40 -3.15 2.00 Prom + 15390 15429 40 -4.95 2.01 Init + 31397 31415 19 1 1 77 110 11 0.933 2.51 2.02 Term + 32339 32499 161 0 2 24 37 322 0.988 17.82 2.03 PlyA + 33351 33356 6 1.05 3.03 PlyA - 33692 33687 6 1.05 3.02 Term - 42655 42560 96 1 0 12 45 105 0.181 -4.41 3.01 Init - 45784 45590 195 1 0 65 76 172 0.788 12.68 3.00 Prom - 56009 55970 40 -6.35 4.03 PlyA - 56846 56841 6 1.05 4.02 Term - 58259 58105 155 0 2 99 34 162 0.990 9.00 4.01 Init - 62919 62058 862 0 1 66 89 568 0.978 49.50 4.00 Prom - 64077 64038 40 -5.45 5.10 PlyA - 64265 64260 6 1.05 5.09 Term - 74735 74629 107 0 2 82 38 61 0.048 -1.91 5.08 Intr - 95529 95390 140 2 2 76 64 115 0.067 7.09 5.07 Intr - 100192 100002 191 1 2 99 -2 120 0.068 1.56 5.06 Intr - 105732 105594 139 2 1 111 95 78 0.991 10.35 5.05 Intr - 107129 107036 94 0 1 88 -3 105 0.949 -0.50 5.04 Intr - 111128 111015 114 0 0 67 54 204 0.980 14.40 5.03 Intr - 112449 112236 214 0 1 53 60 187 0.987 9.77 5.02 Intr - 118657 118545 113 2 2 70 76 130 0.324 9.18 5.01 Init - 132492 132282 211 0 1 64 92 273 0.993 22.29 5.00 Prom - 150423 150384 40 -3.65 6.00 Prom + 160409 160448 40 -4.05 6.01 Init + 187394 188017 624 1 0 35 86 211 0.198 10.99 6.02 Intr + 196961 197258 298 1 1 97 68 158 0.333 10.22 6.03 Intr + 197493 197754 262 1 1 -26 72 224 0.121 5.02 6.04 Intr + 204925 204953 29 1 2 95 103 19 0.197 0.94 6.05 Term + 221014 221036 23 2 2 129 42 30 0.128 -0.00 6.06 PlyA + 221205 221210 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:91046061_91278552|GENSCAN_predicted_peptide_1|352_aa MAGTICFIMWVLFITDTVWSRSVRQVYEVHDSDDWTIHDFECPMECFCPPSFPTALYCEN RGLKEIPAIPSRIWYLYLQNNLIETIPEKPFENATQLRWINLNKNKITNYGIEKGALSQL KKLLFLFLEDNELEEVPSPLPRSLEQLQLARNKVSRIPQGTFSNLENLTLLDLQNNKLVD NAFQRDTFKGLKNLMQLNMAKNALRNMPPRLPANTMQLFLDNNSIEGIPENYFNVIPKVA FLRLNHNKLSDEGLPSRGFDVSSILDLQLSHNQLTKVPRISAHLQHLHLDHNKIKSVNVS VICPSPSMLPAERDSFSYGPHLRYLRLDGNEIKPPIPMALMTCFRLLQAVII >gi568815586r:91046061_91278552|GENSCAN_predicted_CDS_1|1059_bp atggcaggcacaatctgtttcatcatgtgggtgttattcataacagacactgtgtggtct agaagtgtgaggcaggtctatgaagtacatgattcagatgattggactattcatgacttc gagtgtcccatggaatgtttctgcccacccagttttcctactgctttatattgtgaaaat agaggtctcaaagaaattcctgctattccttcaagaatttggtatctttatcttcaaaac aacctgatagaaaccattcctgaaaagccatttgagaatgccacccagctaagatggata aatctaaacaagaacaaaataaccaactacggaattgaaaaaggagccctaagccagctg aagaagttgctcttcttatttctggaagataatgagctagaggaggtaccttctccattg ccaagaagtttagaacaattacaattagctagaaataaggtgtccagaattcctcaaggg acctttagcaatctggagaacctgacccttcttgacctacagaacaacaaattagtggac aatgcctttcaaagagacacttttaaaggactcaagaatctcatgcagctaaacatggcc aagaatgccctgaggaatatgcctccaagattaccagccaatacaatgcagttgttttta gacaacaattccattgaaggaataccagaaaattattttaatgtgattcctaaagtggcc tttttgagactaaatcacaacaaactgtcagatgagggtctcccatcaagaggatttgat gtatcatcaattctagatcttcaactgtcgcacaatcaactcacaaaggttccccgaatc agtgctcatctgcagcaccttcaccttgatcataacaaaattaaaagtgtgaatgtctct gtaatatgtcccagcccatccatgctgcctgcagaacgagattccttcagttatggacct catcttcgctacctccgtctggatggaaatgaaatcaaaccaccaattccaatggcttta atgacctgcttcagacttctgcaggctgtcattatttaa >gi568815586r:91046061_91278552|GENSCAN_predicted_peptide_2|59_aa MRSLKKVLDKDESKDEDEEKGQEEKEEEEKGEKEEEKEEEENKNKNTVKKQPSPPKLTL >gi568815586r:91046061_91278552|GENSCAN_predicted_CDS_2|180_bp atgaggtcactgaaaaaggtccttgataaagatgaaagcaaagatgaagatgaggaaaaa ggacaggaagaaaaggaggaggaggagaagggagagaaagaggaggaaaaggaggaagag gagaataagaataagaatacagtgaagaagcagccatcaccaccaaaactcacactttag >gi568815586r:91046061_91278552|GENSCAN_predicted_peptide_3|96_aa MELPVKKGQLLSLRFGRISHPSLLALEIPNGPEKKGSPTKQHSCSTNKQPECFFNWVPDP VPHYRASIILTPKPGRDTTTTTTTTTTTTKTSGPYP >gi568815586r:91046061_91278552|GENSCAN_predicted_CDS_3|291_bp atggagctcccagtgaagaaggggcagctgctatctctgaggtttggtcgaatcagccat cccagcctgctagctttggagattccaaatggtccagaaaagaaagggtcccccacaaag cagcacagctgctctaccaataaacaaccagagtgcttctttaactgggtccctgatcct gttcctcattacagggccagcatcatcctgacaccaaagcctggcagagatacaacaaca acaacaacaacaacaacaacaacaacaaaaacttcaggcccatatccctga >gi568815586r:91046061_91278552|GENSCAN_predicted_peptide_4|338_aa MSLSAFTLFLALIGGTSGQYYDYDFPLSIYGQSSPNCAPECNCPESYPSAMYCDELKLKS VPMVPPGIKYLYLRNNQIDHIDEKAFENVTDLQWLILDHNLLENSKIKGRVFSKLKQLKK LHINHNNLTESVGPLPKSLEDLQLTHNKITKLGSFEGLVNLTFIHLQHNRLKEDAVSAAF KGLKSLEYLDLSFNQIARLPSGLPVSLLTLYLDNNKISNIPDEYFKRFNALQYLRLSHNE LADSGIPGNSFNVSSLVELDLSYNKLKNIPTVNENLENYYLEVNQLEKFDIKSFCKILGP LSYSKIKHLRLDGNRISETSLPPDMYECLRVANEVTLN >gi568815586r:91046061_91278552|GENSCAN_predicted_CDS_4|1017_bp atgagtctaagtgcatttactctcttcctggcattgattggtggtaccagtggccagtac tatgattatgattttcccctatcaatttatgggcaatcatcaccaaactgtgcaccagaa tgtaactgccctgaaagctacccaagtgccatgtactgtgatgagctgaaattgaaaagt gtaccaatggtgcctcctggaatcaagtatctttaccttaggaataaccagattgaccat attgatgaaaaggcctttgagaatgtaactgatctgcagtggctcattctagatcacaac cttctagaaaactccaagataaaagggagagttttctctaaattgaaacaactgaagaag ctgcatataaaccacaacaacctgacagagtctgtgggcccacttcccaaatctctggag gatctgcagcttactcataacaagatcacaaagctgggctcttttgaaggattggtaaac ctgaccttcatccatctccagcacaatcggctgaaagaggatgctgtttcagctgctttt aaaggtcttaaatcactcgaataccttgacttgagcttcaatcagatagccagactgcct tctggtctccctgtctctcttctaactctctacttagacaacaataagatcagcaacatc cctgatgagtatttcaagcgttttaatgcattgcagtatctgcgtttatctcacaacgaa ctggctgatagtggaatacctggaaattctttcaatgtgtcatccctggttgagctggat ctgtcctataacaagcttaaaaacataccaactgtcaatgaaaaccttgaaaactattac ctggaggtcaatcaacttgagaagtttgacataaagagcttctgcaagatcctggggcca ttatcctactccaagatcaagcatttgcgtttggatggcaatcgcatctcagaaaccagt cttccaccggatatgtatgaatgtctacgtgttgctaacgaagtcactcttaattaa >gi568815586r:91046061_91278552|GENSCAN_predicted_peptide_5|440_aa MKATIILLLLAQVSWAGPFQQRGLFDFMLEDEASGIGPEVPDDRDFEPSLGPVCPFRCQC HLRVVQCSDLGLDKVPKDLPPDTTLLDLQNNKITEIKDGDFKNLKNLHALILVNNKISKV SPGAFTPLVKLERLYLSKNQLKELPEKMPKTLQELRAHENEITKVRKVTFNGLNQMIVIE LGTNPLKSSGIENGAFQGMKKLSYIRIADTNITSIPQGLPPSLTELHLDGNKISRVDAAS LKGLNNLAKLGLSFNSISAVDNGSLANTPHLRELHLDNNKLTRVPGGLAEHKYIQVVYLH NNNISVVGSSDFCPPGHNTKKASYSGVSLFSNPVQYWEIQPSTFRCVYVRSAIQLGNYNV LVAETQQCIMPCPLKQMLQQEETFQDMTIKVEECQAELGNKGCEGEGITERLPGRLSTYY DNLGSQCSYAPDLPVTCQPH >gi568815586r:91046061_91278552|GENSCAN_predicted_CDS_5|1323_bp atgaaggccactatcatcctccttctgcttgcacaagtttcctgggctggaccgtttcaa cagagaggcttatttgactttatgctagaagatgaggcttctgggataggcccagaagtt cctgatgaccgcgacttcgagccctccctaggcccagtgtgccccttccgctgtcaatgc catcttcgagtggtccagtgttctgatttgggtctggacaaagtgccaaaggatcttccc cctgacacaactctgctagacctgcaaaacaacaaaataaccgaaatcaaagatggagac tttaagaacctgaagaaccttcacgcattgattcttgtcaacaataaaattagcaaagtt agtcctggagcatttacacctttggtgaagttggaacgactttatctgtccaagaatcag ctgaaggaattgccagaaaaaatgcccaaaactcttcaggagctgcgtgcccatgagaat gagatcaccaaagtgcgaaaagttactttcaatggactgaaccagatgattgtcatagaa ctgggcaccaatccgctgaagagctcaggaattgaaaatggggctttccagggaatgaag aagctctcctacatccgcattgctgataccaatatcaccagcattcctcaaggtcttcct ccttcccttacggaattacatcttgatggcaacaaaatcagcagagttgatgcagctagc ctgaaaggactgaataatttggctaagttgggattgagtttcaacagcatctctgctgtt gacaatggctctctggccaacacgcctcatctgagggagcttcacttggacaacaacaag cttaccagagtacctggtgggctggcagagcataagtacatccaggttgtctaccttcat aacaacaatatctctgtagttggatcaagtgacttctgcccacctggacacaacaccaaa aaggcttcttattcgggtgtgagtcttttcagcaacccggtccagtactgggagatacag ccatccaccttcagatgtgtctacgtgcgctctgccattcaactcggaaactataatgtt ctagtggcagagacacagcagtgcataatgccttgccctctgaagcagatgttgcagcag gaggagacattccaggacatgacaattaaagtggaggaatgtcaagcagaactggggaat aagggatgtgaaggggagggcattactgaaagattacctgggagactatccacatactac gacaaccttggttcacagtgcagttatgctcctgatcttcctgtaacttgtcagccccat taa >gi568815586r:91046061_91278552|GENSCAN_predicted_peptide_6|411_aa MIKTLNKLGIDGTYLKITRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVL EVLARAIRQEKEIKGTQLGKEEVKLSLFADDMIVYRENPIISAQNLLKLTGNFSKVSGYK INVQKSQAFLYTCNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLSEI KEDTNKWKNIPCSWVGRINIVKMAILPKGISERKAAAPVRDLEIKLPSPWDRAPGGRGNC GRSFSRLKCSCLQPLKRAADLLAQCSSSAKGQTASSSGSLTPVPPDWETPPSRGRQTPHT GEFRLASDHQHQRPKVDKSTKMMKNQCKKAENSKNQNASSPKDHSSSPVREPNWTEKEFD ELTEVGFRRWVITNSSELKEHVLTQCKEAKSLEKSNSTTILDNSCCQEKAL >gi568815586r:91046061_91278552|GENSCAN_predicted_CDS_6|1236_bp atgataaaaactctcaataaattaggtattgatggaacatatctcaaaataacaagagct atctatgacaaacccacagccaatatcatactgaatgggcagaaactggaagcattccct ttgaaaactggcacaagacagggatgccctctctcaccactcctattcaacatagtgttg gaagttctggccagggcaatcaggcaggagaaagaaataaaaggtactcaattaggaaaa gaggaagttaaattatccctgtttgcagatgacatgattgtatatagagaaaaccccatc atctcagcccaaaatctccttaagctgacaggcaacttcagcaaagtctcaggatacaaa atcaatgtgcaaaaatcacaagcattcttatacacctgtaacagacaaacagagagccaa atcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaa cttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcagtgaaata aaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatt gtgaaaatggccatactgcccaagggtatctctgaaagaaaggcagcagccccagtcagg gacttagagataaaactcccatctccctgggacagagcacctgggggaaggggcaactgt gggcgcagcttcagcagacttaaatgttcctgtcttcagcctctgaagagagcagcagat ctcctagcacagtgctcgagctctgctaagggacagactgcctcctcaagtgggtccctg acccctgtgcctcctgactgggagacacctccaagcagaggtcgacagacacctcataca ggagagttccggctggcatctgatcaccaacatcaaagaccaaaggtagataaatccacc aagatgatgaaaaaccagtgcaaaaaggctgaaaattccaaaaaccagaacgcttcttct ccaaaggatcacagctcatctccagtaagagaaccaaactggacagagaaagagtttgat gaattgacagaagtaggcttcagaaggtgggtaataacaaactcctctgagctaaaggag catgttctaacccaatgcaaggaagctaagagccttgaaaaaagtaattccacgactatt ttagataactcatgctgccaagagaaagctctctaa