GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:14:06 Sequence gi568815586r:91004168_91208979 : 204812 bp : 34.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 3291 3358 68 1 2 116 44 67 0.393 2.22 1.02 PlyA + 4685 4690 6 1.05 2.03 PlyA - 4743 4738 6 1.05 2.02 Term - 47351 47179 173 0 2 93 41 136 0.751 6.41 2.01 Init - 52114 51229 886 1 1 65 106 430 0.492 37.12 2.00 Prom - 54993 54954 40 -3.15 3.00 Prom + 57283 57322 40 -4.95 3.01 Init + 73290 73308 19 2 1 77 110 11 0.933 2.51 3.02 Term + 74232 74392 161 1 2 24 37 322 0.988 17.82 3.03 PlyA + 75244 75249 6 1.05 4.03 PlyA - 75585 75580 6 1.05 4.02 Term - 84548 84453 96 2 0 12 45 105 0.181 -4.41 4.01 Init - 87677 87483 195 2 0 65 76 172 0.788 12.68 4.00 Prom - 97902 97863 40 -6.35 5.03 PlyA - 98739 98734 6 1.05 5.02 Term - 100152 99998 155 1 2 99 34 162 0.990 9.00 5.01 Init - 104812 103951 862 1 1 66 89 568 0.978 49.50 5.00 Prom - 105970 105931 40 -5.45 6.10 PlyA - 106158 106153 6 1.05 6.09 Term - 116628 116522 107 1 2 82 38 61 0.048 -1.91 6.08 Intr - 137422 137283 140 0 2 76 64 115 0.067 7.09 6.07 Intr - 142085 141895 191 2 2 99 -2 120 0.068 1.56 6.06 Intr - 147625 147487 139 0 1 111 95 78 0.991 10.35 6.05 Intr - 149022 148929 94 1 1 88 -3 105 0.949 -0.50 6.04 Intr - 153021 152908 114 1 0 67 54 204 0.980 14.40 6.03 Intr - 154342 154129 214 1 1 53 60 187 0.987 9.77 6.02 Intr - 160550 160438 113 0 2 70 76 130 0.324 9.18 6.01 Init - 174385 174175 211 1 1 64 92 273 0.993 22.29 6.00 Prom - 203070 203031 40 -3.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:91004168_91208979|GENSCAN_predicted_peptide_1|22_aa XKVSRHCEDSPVLPALAPDFKR >gi568815586r:91004168_91208979|GENSCAN_predicted_CDS_1|69_bp ncaaaggtgagcagacactgtgaggacagccccgtgctgcctgcccttgcccctgacttc aaaaggtga >gi568815586r:91004168_91208979|GENSCAN_predicted_peptide_2|352_aa MAGTICFIMWVLFITDTVWSRSVRQVYEVHDSDDWTIHDFECPMECFCPPSFPTALYCEN RGLKEIPAIPSRIWYLYLQNNLIETIPEKPFENATQLRWINLNKNKITNYGIEKGALSQL KKLLFLFLEDNELEEVPSPLPRSLEQLQLARNKVSRIPQGTFSNLENLTLLDLQNNKLVD NAFQRDTFKGLKNLMQLNMAKNALRNMPPRLPANTMQLFLDNNSIEGIPENYFNVIPKVA FLRLNHNKLSDEGLPSRGFDVSSILDLQLSHNQLTKVPRISAHLQHLHLDHNKIKSVNVS VICPSPSMLPAERDSFSYGPHLRYLRLDGNEIKPPIPMALMTCFRLLQAVII >gi568815586r:91004168_91208979|GENSCAN_predicted_CDS_2|1059_bp atggcaggcacaatctgtttcatcatgtgggtgttattcataacagacactgtgtggtct agaagtgtgaggcaggtctatgaagtacatgattcagatgattggactattcatgacttc gagtgtcccatggaatgtttctgcccacccagttttcctactgctttatattgtgaaaat agaggtctcaaagaaattcctgctattccttcaagaatttggtatctttatcttcaaaac aacctgatagaaaccattcctgaaaagccatttgagaatgccacccagctaagatggata aatctaaacaagaacaaaataaccaactacggaattgaaaaaggagccctaagccagctg aagaagttgctcttcttatttctggaagataatgagctagaggaggtaccttctccattg ccaagaagtttagaacaattacaattagctagaaataaggtgtccagaattcctcaaggg acctttagcaatctggagaacctgacccttcttgacctacagaacaacaaattagtggac aatgcctttcaaagagacacttttaaaggactcaagaatctcatgcagctaaacatggcc aagaatgccctgaggaatatgcctccaagattaccagccaatacaatgcagttgttttta gacaacaattccattgaaggaataccagaaaattattttaatgtgattcctaaagtggcc tttttgagactaaatcacaacaaactgtcagatgagggtctcccatcaagaggatttgat gtatcatcaattctagatcttcaactgtcgcacaatcaactcacaaaggttccccgaatc agtgctcatctgcagcaccttcaccttgatcataacaaaattaaaagtgtgaatgtctct gtaatatgtcccagcccatccatgctgcctgcagaacgagattccttcagttatggacct catcttcgctacctccgtctggatggaaatgaaatcaaaccaccaattccaatggcttta atgacctgcttcagacttctgcaggctgtcattatttaa >gi568815586r:91004168_91208979|GENSCAN_predicted_peptide_3|59_aa MRSLKKVLDKDESKDEDEEKGQEEKEEEEKGEKEEEKEEEENKNKNTVKKQPSPPKLTL >gi568815586r:91004168_91208979|GENSCAN_predicted_CDS_3|180_bp atgaggtcactgaaaaaggtccttgataaagatgaaagcaaagatgaagatgaggaaaaa ggacaggaagaaaaggaggaggaggagaagggagagaaagaggaggaaaaggaggaagag gagaataagaataagaatacagtgaagaagcagccatcaccaccaaaactcacactttag >gi568815586r:91004168_91208979|GENSCAN_predicted_peptide_4|96_aa MELPVKKGQLLSLRFGRISHPSLLALEIPNGPEKKGSPTKQHSCSTNKQPECFFNWVPDP VPHYRASIILTPKPGRDTTTTTTTTTTTTKTSGPYP >gi568815586r:91004168_91208979|GENSCAN_predicted_CDS_4|291_bp atggagctcccagtgaagaaggggcagctgctatctctgaggtttggtcgaatcagccat cccagcctgctagctttggagattccaaatggtccagaaaagaaagggtcccccacaaag cagcacagctgctctaccaataaacaaccagagtgcttctttaactgggtccctgatcct gttcctcattacagggccagcatcatcctgacaccaaagcctggcagagatacaacaaca acaacaacaacaacaacaacaacaacaaaaacttcaggcccatatccctga >gi568815586r:91004168_91208979|GENSCAN_predicted_peptide_5|338_aa MSLSAFTLFLALIGGTSGQYYDYDFPLSIYGQSSPNCAPECNCPESYPSAMYCDELKLKS VPMVPPGIKYLYLRNNQIDHIDEKAFENVTDLQWLILDHNLLENSKIKGRVFSKLKQLKK LHINHNNLTESVGPLPKSLEDLQLTHNKITKLGSFEGLVNLTFIHLQHNRLKEDAVSAAF KGLKSLEYLDLSFNQIARLPSGLPVSLLTLYLDNNKISNIPDEYFKRFNALQYLRLSHNE LADSGIPGNSFNVSSLVELDLSYNKLKNIPTVNENLENYYLEVNQLEKFDIKSFCKILGP LSYSKIKHLRLDGNRISETSLPPDMYECLRVANEVTLN >gi568815586r:91004168_91208979|GENSCAN_predicted_CDS_5|1017_bp atgagtctaagtgcatttactctcttcctggcattgattggtggtaccagtggccagtac tatgattatgattttcccctatcaatttatgggcaatcatcaccaaactgtgcaccagaa tgtaactgccctgaaagctacccaagtgccatgtactgtgatgagctgaaattgaaaagt gtaccaatggtgcctcctggaatcaagtatctttaccttaggaataaccagattgaccat attgatgaaaaggcctttgagaatgtaactgatctgcagtggctcattctagatcacaac cttctagaaaactccaagataaaagggagagttttctctaaattgaaacaactgaagaag ctgcatataaaccacaacaacctgacagagtctgtgggcccacttcccaaatctctggag gatctgcagcttactcataacaagatcacaaagctgggctcttttgaaggattggtaaac ctgaccttcatccatctccagcacaatcggctgaaagaggatgctgtttcagctgctttt aaaggtcttaaatcactcgaataccttgacttgagcttcaatcagatagccagactgcct tctggtctccctgtctctcttctaactctctacttagacaacaataagatcagcaacatc cctgatgagtatttcaagcgttttaatgcattgcagtatctgcgtttatctcacaacgaa ctggctgatagtggaatacctggaaattctttcaatgtgtcatccctggttgagctggat ctgtcctataacaagcttaaaaacataccaactgtcaatgaaaaccttgaaaactattac ctggaggtcaatcaacttgagaagtttgacataaagagcttctgcaagatcctggggcca ttatcctactccaagatcaagcatttgcgtttggatggcaatcgcatctcagaaaccagt cttccaccggatatgtatgaatgtctacgtgttgctaacgaagtcactcttaattaa >gi568815586r:91004168_91208979|GENSCAN_predicted_peptide_6|440_aa MKATIILLLLAQVSWAGPFQQRGLFDFMLEDEASGIGPEVPDDRDFEPSLGPVCPFRCQC HLRVVQCSDLGLDKVPKDLPPDTTLLDLQNNKITEIKDGDFKNLKNLHALILVNNKISKV SPGAFTPLVKLERLYLSKNQLKELPEKMPKTLQELRAHENEITKVRKVTFNGLNQMIVIE LGTNPLKSSGIENGAFQGMKKLSYIRIADTNITSIPQGLPPSLTELHLDGNKISRVDAAS LKGLNNLAKLGLSFNSISAVDNGSLANTPHLRELHLDNNKLTRVPGGLAEHKYIQVVYLH NNNISVVGSSDFCPPGHNTKKASYSGVSLFSNPVQYWEIQPSTFRCVYVRSAIQLGNYNV LVAETQQCIMPCPLKQMLQQEETFQDMTIKVEECQAELGNKGCEGEGITERLPGRLSTYY DNLGSQCSYAPDLPVTCQPH >gi568815586r:91004168_91208979|GENSCAN_predicted_CDS_6|1323_bp atgaaggccactatcatcctccttctgcttgcacaagtttcctgggctggaccgtttcaa cagagaggcttatttgactttatgctagaagatgaggcttctgggataggcccagaagtt cctgatgaccgcgacttcgagccctccctaggcccagtgtgccccttccgctgtcaatgc catcttcgagtggtccagtgttctgatttgggtctggacaaagtgccaaaggatcttccc cctgacacaactctgctagacctgcaaaacaacaaaataaccgaaatcaaagatggagac tttaagaacctgaagaaccttcacgcattgattcttgtcaacaataaaattagcaaagtt agtcctggagcatttacacctttggtgaagttggaacgactttatctgtccaagaatcag ctgaaggaattgccagaaaaaatgcccaaaactcttcaggagctgcgtgcccatgagaat gagatcaccaaagtgcgaaaagttactttcaatggactgaaccagatgattgtcatagaa ctgggcaccaatccgctgaagagctcaggaattgaaaatggggctttccagggaatgaag aagctctcctacatccgcattgctgataccaatatcaccagcattcctcaaggtcttcct ccttcccttacggaattacatcttgatggcaacaaaatcagcagagttgatgcagctagc ctgaaaggactgaataatttggctaagttgggattgagtttcaacagcatctctgctgtt gacaatggctctctggccaacacgcctcatctgagggagcttcacttggacaacaacaag cttaccagagtacctggtgggctggcagagcataagtacatccaggttgtctaccttcat aacaacaatatctctgtagttggatcaagtgacttctgcccacctggacacaacaccaaa aaggcttcttattcgggtgtgagtcttttcagcaacccggtccagtactgggagatacag ccatccaccttcagatgtgtctacgtgcgctctgccattcaactcggaaactataatgtt ctagtggcagagacacagcagtgcataatgccttgccctctgaagcagatgttgcagcag gaggagacattccaggacatgacaattaaagtggaggaatgtcaagcagaactggggaat aagggatgtgaaggggagggcattactgaaagattacctgggagactatccacatactac gacaaccttggttcacagtgcagttatgctcctgatcttcctgtaacttgtcagccccat taa