GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:56:09 Sequence gi568815578r:52052149_52291636 : 239488 bp : 43.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 225 264 40 -1.26 1.01 Init + 3033 3035 3 2 0 98 95 0 0.185 1.70 1.02 Intr + 9095 9150 56 1 2 70 100 45 0.033 1.68 1.03 Term + 29886 29955 70 0 1 104 48 62 0.206 1.21 1.04 PlyA + 31200 31205 6 1.05 2.07 PlyA - 31886 31881 6 1.05 2.06 Term - 33118 32409 710 2 2 49 36 928 0.998 77.37 2.05 Intr - 36608 36244 365 1 2 58 100 396 0.754 32.63 2.04 Intr - 40286 40157 130 0 1 65 68 55 0.267 0.95 2.03 Intr - 45287 45225 63 0 0 87 77 35 0.606 0.99 2.02 Intr - 46439 46290 150 0 0 34 110 122 0.811 9.13 2.01 Init - 53089 52984 106 1 1 88 100 124 0.994 14.01 2.00 Prom - 53927 53888 40 -6.66 3.03 PlyA - 55238 55233 6 1.05 3.02 Term - 58894 58531 364 0 1 55 37 449 0.887 30.54 3.01 Init - 96083 96055 29 2 2 101 121 -22 0.025 1.62 3.00 Prom - 97076 97037 40 -3.06 4.07 PlyA - 97517 97512 6 1.05 4.06 Term - 101280 99998 1283 1 2 80 39 1469 0.996 132.96 4.05 Intr - 108226 107975 252 2 0 52 100 419 0.999 36.91 4.04 Intr - 112609 112547 63 2 0 131 114 68 0.999 12.39 4.03 Intr - 113871 113716 156 1 0 44 103 76 0.788 4.68 4.02 Intr - 134917 134684 234 2 0 27 92 337 0.885 25.66 4.01 Init - 139488 139443 46 0 1 98 106 112 0.910 14.85 4.00 Prom - 139580 139541 40 -5.66 5.06 PlyA - 140852 140847 6 1.05 5.05 Term - 148651 148529 123 1 0 48 32 96 0.404 -1.62 5.04 Intr - 148851 148783 69 0 0 64 100 20 0.223 0.18 5.03 Intr - 158532 158413 120 0 0 72 94 142 0.913 13.89 5.02 Intr - 166164 166049 116 1 2 31 71 73 0.039 0.07 5.01 Init - 169700 169289 412 2 1 60 -8 220 0.033 6.28 5.00 Prom - 189367 189328 40 -3.06 6.00 Prom + 192381 192420 40 -5.26 6.01 Init + 196643 196703 61 1 1 98 79 37 0.844 5.21 6.02 Term + 196941 197167 227 1 2 84 36 103 0.934 1.64 6.03 PlyA + 197874 197879 6 1.05 7.03 PlyA - 198166 198161 6 1.05 7.02 Term - 200551 200384 168 1 0 39 44 133 0.702 1.88 7.01 Init - 234224 234195 30 2 0 99 72 28 0.313 2.04 7.00 Prom - 237456 237417 40 -1.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 80363 80626 264 1 0 51 41 182 0.973 5.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:52052149_52291636|GENSCAN_predicted_peptide_1|42_aa MLFPDSQWKPWPKSFSDILSCITVAELEPLCYNSSGNRADNK >gi568815578r:52052149_52291636|GENSCAN_predicted_CDS_1|129_bp atgcttttccctgacagccagtggaagccatggcccaagtcatttagtgacatcttaagc tgcattacagtggccgagttggaacctctgtgctacaacagcagtggcaacagagctgat aacaagtag >gi568815578r:52052149_52291636|GENSCAN_predicted_peptide_2|507_aa MSRRKQAKPQHLNSEEPRPARRECAEVAPQVAGEPASELDDDVPKANCLSTESTDTPKAP VITLPSEAREQMATLGERTFNCCYPGCHFKTVHGMKDLDRHLRIHTVPFCFSKVYWNGFQ LGMISPPKGTMSGDIFDCYNWRRKGSTGICIWTGGGGTPICPGYHLGQIFVAILTKKPLT LTPYVFEGDKPHKCEFCDKCFSRKDNLTMHMRCHTSVKPHKCHLCDYAAVDSSSLKKHLR IHSDERPYKCQLCPYASRNSSQLTVHLRSHTGDTPFQCWLCSAKFKISSDLKRHMIVHSG EKPFKCEFCDVRCTMKANLKSHIRIKHTFKCLHCAFQGRDRADLLEHSRLHQADHPEKCP ECSYSCSSAAALRVHSRVHCKDRPFKCDFCSFDTKRPSSLAKHVDKVHRDEAKTENRAPL GKEGLREGSSQHVAKIVTQRAFRCETCGASFVRDDSLRCHKKQHSDQSENKNSDLVTFPP ESGASGQLSTLVSVGQLEAPLEPSQDL >gi568815578r:52052149_52291636|GENSCAN_predicted_CDS_2|1524_bp atgtcgcggcgcaagcaggccaagccccagcacctcaactccgaggagccgcggcctgcg cgccgggagtgtgcggaggtggccccgcaggtggcgggggagccggcttcagaacttgat gatgatgttccaaaagcaaactgcctctccactgaaagcactgacactccgaaggcccct gtcatcactcttccctcagaggcaagggaacaaatggccacccttggagagaggacgttc aactgttgctacccaggttgccacttcaaaactgtccatggcatgaaagacttggaccgc catctcagaatccacacggtccccttctgcttttccaaagtctattggaatggttttcaa ctaggtatgatttcaccccccaagggaacaatgtctggagacatcttcgattgttacaac tggagaagaaagggttctactggcatctgtatttggacaggaggaggcggcacacccata tgcccaggctatcatcttggtcaaatctttgtagccatcttgaccaaaaaacctttaact ttgactccttatgtgtttgaaggagacaaaccgcacaagtgtgagttctgtgacaagtgc ttcagccggaaggacaacctgaccatgcacatgcggtgccacaccagtgtgaagccacac aagtgtcacctgtgtgactacgctgccgtggacagcagtagcctcaagaagcacctgcgg atccactctgatgagcggccgtacaaatgccagctctgcccctatgccagccgcaactcc agccagctcaccgtccacctgcgatctcacacgggggatacccccttccagtgctggctc tgtagcgccaagttcaaaatcagctcggacttgaaaaggcacatgatcgtgcactcgggg gagaagcctttcaagtgcgagttctgcgacgtccgctgcaccatgaaggcgaatctcaaa tcgcacatccgcatcaagcacaccttcaaatgtctgcactgtgccttccagggccgggac cgggctgacctcctggagcacagccggctgcaccaggccgaccacccggagaagtgtcca gagtgcagctactcctgctccagcgcggccgccctgcgcgtgcacagcagagtccactgt aaggaccgtcccttcaagtgtgacttctgcagcttcgacacgaagcggcccagcagcctg gccaagcacgtcgacaaggtgcacagggacgaggccaagacggagaaccgggcccctctg ggcaaggaagggctcagagagggcagctcccagcacgtggccaagatcgtgacgcagagg gccttccgctgtgagacctgcggcgcctccttcgtcagggatgactctctgagatgccac aagaagcagcacagtgatcagagtgagaacaagaactcagacttggtcaccttcccaccg gaaagcggtgcctcgggacagctcagcaccctggtctccgtggggcagctcgaggctccc ctagagcccagccaagacctctag >gi568815578r:52052149_52291636|GENSCAN_predicted_peptide_3|130_aa MTYLIILALSSGLHTKRALPLDTVTFYKVIPKSRFVLVKFNTQYRYGEKQDEFRRLAENS ASSDDLLVAEVGISDYGDKLNMELSEKYKLLKESYPVFYLSREGDFENPVPCTGAVKVGA IQRWLKGQGV >gi568815578r:52052149_52291636|GENSCAN_predicted_CDS_3|393_bp atgacatatttgattatactagcattaagcagcggcctgcacaccaagcgcgcccttccc ctggatacggtcactttctacaaggtcattcccaaaagcaggttcgtcttggtgaagttc aacacccagtaccgctacggtgagaagcaggatgagttcaggcgtcttgctgaaaactcg gcttccagcgatgatctcttggtggcagaggtggggatctcagattatggtgacaagctg aacatggagctgagtgagaaatacaagctgctcaaagagagctacccagtcttctacctc tcccgggagggggactttgagaacccagtcccatgtactggggcagttaaggttggagcc atccagcgctggctgaaggggcaaggggtctag >gi568815578r:52052149_52291636|GENSCAN_predicted_peptide_4|677_aa MNASSEGESFAGSVQSGTTVLVELTPDIHICGICKQQFNNLDAFVAHKQSGCQLTGTSAA APSTVQFVSEETVPATQTQTTTRTITSETQTITAPEFVFEHGYQTYLPTESNENQTATVI SLPAKSRTKKPTTPPAQKRLNCCYPGCQFKTAYGMKDMERHLKIHTGDKPHKCEVCGKCF SRKDKLKTHMRCHTGVKPYKCKTCDYAAADSSSLNKHLRIHSDERPFKCQICPYASRNSS QLTVHLRSHTGDAPFQCWLCSAKFKISSDLKRHMRVHSGEKPFKCEFCNVRCTMKGNLKS HIRIKHSGNNFKCPHCDFLGDSKATLRKHSRVHQSEHPEKCSECSYSCSSKAALRIHERI HCTDRPFKCNYCSFDTKQPSNLSKHMKKFHGDMVKTEALERKDTGRQSSRQVAKLDAKKS FHCDICDASFMREDSLRSHKRQHSEYSESKNSDVTVLQFQIDPSKQPATPLTVGHLQVPL QPSQVPQFSEGRVKIIVGHQVPQANTIVQAAAAAVNIVPPALVAQNPEELPGNSRLQILR QVSLIAPPQSSRCPSEAGAMTQPAVLLTTHEQTDGATLHQTLIPTASGGPQEGSGNQTFI TSSGITCTDFEGLNALIQEGTAEVTVVSDGGQNIAVATTAPPVFSSSSQQELPKQTYSII QGAAHPALLCPADSIPD >gi568815578r:52052149_52291636|GENSCAN_predicted_CDS_4|2034_bp atgaacgcgagcagcgagggcgagagcttcgcgggctcggtgcaaagtggcacaacggtg ctggtggagctgactcccgacatccatatctgcggcatctgcaagcagcagtttaacaac ctggatgcctttgtagctcacaagcaaagtggctgccagctgacaggcacatccgcagca gcccccagcacggtccagtttgtatcggaggaaacagtgcctgccacccagactcagacc accaccagaaccatcacctcggagacccagacaatcacagctccagaatttgtttttgaa catggctatcaaacttacctgcccacggaaagtaatgaaaaccagacagccactgtcatc tctctccctgccaagtcacgcaccaaaaagcccacaacaccacctgctcagaaaaggctt aactgttgctatccaggttgccaattcaagactgcttatggcatgaaggacatggagcgg catttaaaaattcacacgggagacaaaccccataagtgtgaagtctgtggcaagtgcttt agccggaaagacaagctgaaaactcacatgcggtgccacacgggcgtgaagccctacaag tgtaagacgtgtgactacgccgctgccgacagcagcagcctcaacaagcacctgaggatc cactcggacgagcggcccttcaaatgccagatctgcccctacgccagccgcaactccagc cagctcactgtccacctgcgatcccacacgggggacgcccccttccagtgctggctctgt agcgccaagttcaaaatcagctcggacttgaaaaggcacatgcgggtgcactcgggggag aagcctttcaagtgcgagttctgcaatgtccgctgcaccatgaaggggaacctcaagtcg cacatccgtatcaagcacagcgggaataacttcaagtgtcctcattgcgacttcctgggt gacagcaaagccaccctccggaagcacagccgcgtgcaccagtcggagcatcctgagaag tgctcggaatgcagctactcctgctccagcaaggccgccctgcgcatccacgagcgtatc cactgcaccgaccgccctttcaagtgcaactactgcagcttcgacaccaaacagcccagc aacctgagcaagcacatgaagaagttccatggggacatggttaagactgaggctctagag aggaaggacaccggcaggcagagcagccggcaggtggccaagctggatgccaagaagagt ttccactgcgatatatgcgatgcctccttcatgcgggaggactcgctccgcagccacaag agacagcacagtgagtacagtgagagtaagaactcggacgtgaccgttctccagtttcag atcgaccccagcaagcagcccgccacgcccctcactgtgggacacctccaggtgcccctc cagcccagccaagtgccccagttcagcgagggaagagtcaaaatcatcgttgggcatcag gtgccccaggcgaacaccatcgtccaggctgccgccgctgcagtgaacatcgtcccgcct gccttggtggcccagaacccagaggaactcccagggaacagccggctgcagatcctgcgc caggtcagtctgatcgccccccctcagtcctcgcggtgtccgagcgaggcgggcgcaatg acccagccggctgtcctgctgaccacccacgagcagacggacggagccactctgcaccag actctcatccccacggcctcaggtggcccccaggaaggctctggcaatcaaactttcatt accagttcgggtattacttgcactgactttgaaggcctaaacgccttgattcaggagggg acagcagaagtgacagtggtgagcgatggaggccagaacatcgcagtggccaccacagcg ccaccggtcttctcctcctcttcccagcaagaactacccaagcagacctactccatcatt caaggggcagcccatccagctttgctctgtcccgccgactccattccagattag >gi568815578r:52052149_52291636|GENSCAN_predicted_peptide_5|279_aa MWESLGFPRGLLNGFDQNAVSDMDKEVQAEVVSDGDEEVVRNWSKGHSCYALAKRLVAFC PCPRDLWNFELERNDLRLELMFKMEAEHKSLENLPDDAIEKKNPFSGEKFKLAAEICKNN KEPNVTRQYNGENVFKACTDSTTMHSQQYLYEKSRNQSRCHSTPEKLKAKNATLKQTQSP GARRLAPALCVIKTAEAEESWPEDTYPPKTKREAVWGRWQGERQREEVVFSGTTVTRQKE ARSPQNPGYKKQGSASLTNSRIKGRITEQQPLDQTTSLR >gi568815578r:52052149_52291636|GENSCAN_predicted_CDS_5|840_bp atgtgggaaagtttgggatttcctagaggcttgttgaatggttttgaccaaaatgctgtt agtgacatggacaaggaagtccaggctgaggtggtctcagatggagatgaggaagttgtt aggaactggagtaaaggtcactcttgctatgctttagcaaagagactggtggcattttgc ccctgccctagagatctgtggaactttgaacttgagagaaatgatctgagattggaactt atgtttaaaatggaagcagagcataaaagtttggaaaatttgcctgatgacgcaatagaa aagaaaaacccattttctggagagaaattcaagctggctgcagaaatttgcaaaaataac aaggagccaaatgttactcgccaatacaatggagaaaatgtcttcaaggcatgtactgac tcaacaacaatgcacagtcaacaatacctttatgagaaatctagaaaccaatccagatgt cacagtaccccagagaagctcaaagccaagaatgccacactgaaacaaacccagtcgcct ggcgcaagacgacttgcgccagcactgtgcgtcatcaagacagcagaagcggaagagagc tggccggaagacacgtacccgccgaagaccaagagagaggccgtctggggaaggtggcag ggagagagacaaagggaggaggtggtgttttcagggaccacagtcacccggcagaaggaa gctagaagcccccagaaccctggatacaagaaacaagggtcagcgtccctgacaaatagc agaatcaagggaagaatcacagagcaacagcctctggaccagaccaccagccttcgataa >gi568815578r:52052149_52291636|GENSCAN_predicted_peptide_6|95_aa MEDVPFAFCHDCKGFPAMCNILNQLRENRKSQRDGDKNIIFITDKSFIGGCSLREYPHGF IIAHPGKSYACGRLLPQRLLGSGAEKKGSKLQIAF >gi568815578r:52052149_52291636|GENSCAN_predicted_CDS_6|288_bp atggaagatgtgccttttgccttctgccatgattgtaagggcttcccagccatgtgtaac attttgaatcaattgagagagaaccgcaaaagtcagagagatggtgataaaaacatcatc ttcattacggataaaagttttatcggaggatgtagcttacgagagtatccacatgggttc attatagcccacccaggcaaaagttacgcttgtggaagattgttgcctcaaagattgctt ggttctggggcagagaagaaagggagcaaattgcaaatagccttttag >gi568815578r:52052149_52291636|GENSCAN_predicted_peptide_7|65_aa MNKREHEGNKSNKCLQHGIQEKYRFTFTNKMVEFDSLSLETASMEQKWECDFSVVIQTSP GMMPA >gi568815578r:52052149_52291636|GENSCAN_predicted_CDS_7|198_bp atgaacaagagggagcatgaaggaaacaagtccaacaagtgtttacagcacgggatccag gagaagtacagattcacattcactaacaagatggtagagtttgactccctgtctttggaa acagccagcatggaacagaagtgggaatgtgatttctcagtagtcatacagacgagccct ggaatgatgccggcctga