GENSCAN 1.0 Date run: 6-Nov-116 Time: 02:11:22 Sequence gi568815583r:63226288_63481720 : 255433 bp : 44.29% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7902 7929 28 2 1 84 77 29 0.176 1.17 1.02 Intr + 18469 18529 61 2 1 102 115 -15 0.031 0.39 1.03 Intr + 23358 23418 61 0 1 95 101 33 0.040 4.04 1.04 Intr + 29220 29297 78 2 0 90 87 69 0.853 6.75 1.05 Intr + 30218 30307 90 1 0 95 94 62 0.973 7.69 1.06 Intr + 33340 33405 66 0 0 95 94 55 0.889 5.90 1.07 Term + 43599 43811 213 2 0 96 44 43 0.316 -2.07 1.08 PlyA + 45282 45287 6 1.05 2.00 Prom + 50726 50765 40 -5.26 2.01 Init + 50913 50994 82 2 1 99 70 156 0.935 14.03 2.02 Intr + 51245 51449 205 0 1 52 76 228 0.910 16.26 2.03 Intr + 52874 53044 171 2 0 99 106 60 0.945 7.96 2.04 Intr + 60271 60341 71 1 2 70 115 41 0.798 3.83 2.05 Intr + 76058 76185 128 0 2 72 109 91 0.666 10.00 2.06 Term + 79327 79494 168 0 0 114 37 44 0.508 -0.22 2.07 PlyA + 79597 79602 6 1.05 3.15 PlyA - 80193 80188 6 1.05 3.14 Term - 83449 83198 252 1 0 89 32 126 0.310 2.74 3.13 Intr - 95429 95230 200 0 2 67 95 108 0.851 8.47 3.12 Intr - 112658 112532 127 2 1 87 100 117 0.975 13.05 3.11 Intr - 114158 114001 158 0 2 100 93 199 0.999 21.33 3.10 Intr - 114496 114433 64 1 1 127 100 60 0.867 9.39 3.09 Intr - 115810 115715 96 1 0 76 94 97 0.998 9.31 3.08 Intr - 119332 119190 143 2 2 94 107 255 0.999 28.07 3.07 Intr - 120422 120243 180 0 0 64 94 221 0.999 20.14 3.06 Intr - 127388 127272 117 0 0 53 79 54 0.131 1.44 3.05 Intr - 133651 133577 75 2 0 28 84 92 0.132 2.29 3.04 Intr - 142083 141965 119 2 2 134 24 63 0.438 4.61 3.03 Intr - 142539 142368 172 1 1 21 94 95 0.324 2.40 3.02 Intr - 149391 149371 21 1 0 118 111 -5 0.220 2.32 3.01 Init - 155433 155349 85 0 1 91 55 148 0.222 10.91 3.00 Prom - 165170 165131 40 -3.66 4.00 Prom + 168275 168314 40 -5.96 4.01 Init + 169340 169394 55 1 1 100 96 0 0.374 1.64 4.02 Term + 186855 187063 209 1 2 84 42 121 0.476 4.60 4.03 PlyA + 191361 191366 6 1.05 5.00 Prom + 201566 201605 40 -2.06 5.01 Init + 219179 219241 63 1 0 80 96 35 0.931 2.69 5.02 Term + 219279 219368 90 2 0 134 53 53 0.461 4.12 5.03 PlyA + 219789 219794 6 1.05 6.05 PlyA - 220591 220586 6 1.05 6.04 Term - 230505 230262 244 2 1 62 42 183 0.147 6.47 6.03 Intr - 234881 234751 131 2 2 113 82 -7 0.067 0.69 6.02 Intr - 247788 247678 111 0 0 62 26 96 0.275 1.38 6.01 Intr - 254852 254680 173 0 2 111 64 66 0.195 6.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:63226288_63481720|GENSCAN_predicted_peptide_1|198_aa MHQVTAYAEGIDFKIRTIELDGKKIKLQIWDTAGQERFRTITTAYYRGAMGIMLVYDITN EKSFDNIKNWIRNIEEHASSDVERMILGNKCDMNDKRQVSKERGEKLAIDYGIKFLETSA KSSANVEEVGHVLIPGTSECYFIWQKGLCRGDRVKDHEMGDYLDPSGPNVILGVLLMRGQ EESEESHVMMDAEGEKVL >gi568815583r:63226288_63481720|GENSCAN_predicted_CDS_1|597_bp atgcatcaagtcacagcttatgctgaaggaattgattttaaaattagaacgatagaacta gatggaaagaaaattaagcttcagatatgggacacagcgggtcaggaaagattccgaaca atcacgacagcgtactacagaggagccatgggcattatgctggtctatgacatcacaaat gaaaaatcctttgacaatattaaaaattggatcagaaacattgaagagcatgcctcttcc gatgtcgaaagaatgatcctgggtaacaaatgtgatatgaatgacaaaagacaagtgtca aaagaaagaggggagaagctagcaattgactatgggattaaattcttggagacaagcgca aaatccagtgcaaatgtagaagaggttggtcatgtcttaatccccggaacctctgaatgt tactttatatggcaaaaaggactttgcagaggggatcgagttaaggatcatgagatggga gattatcttgatccaagtggacccaatgtaatcctaggggtccttctcatgagaggacag gaagagtcagaagaaagccacgtgatgatggacgcagagggagaaaaggtgctgtga >gi568815583r:63226288_63481720|GENSCAN_predicted_peptide_2|274_aa MAGALTSPRTPASGAGAAAWPWRFRTPAPHPEPHPPNPARAGSISRPQRAPGSVSAVAMT AAVFFGCAFIAFGPALALYVFTIATEPLRIIFLIAGAFFWLVSLLISSLVWFMARVIIDN KDGPTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYAFMT LVIILLHVFWGIVFFDGCEKKKWGILLIVLLTHLLVSAQTFISSYYGINLASAFIILVLM GTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR >gi568815583r:63226288_63481720|GENSCAN_predicted_CDS_2|825_bp atggctggggccctaacgtcgccgaggacgcccgcctcgggagcaggagctgcggcgtgg ccctggcgcttccggacacccgccccccaccccgagccccacccaccgaaccccgcccgc gccgggagcatctcgcgtccccaacgggcccccgggtcggtttccgcggtggccatgact gcggccgtgttcttcggctgcgccttcattgccttcgggcctgcgctcgccctttatgtc ttcaccatcgccaccgagccgttgcgtatcatcttcctcatcgccggagctttcttctgg ttggtgtctctactgatttcgtcccttgtttggttcatggcaagagtcattattgacaac aaagatggaccaacacagaaatatctgctgatctttggagcgtttgtctctgtctatatc caagaaatgttccgatttgcatattataaactcttaaaaaaagccagtgaaggtttgaag agtataaacccaggtgagacagcaccctctatgcgactgctggcctatgctttcatgacg ctggtcattatcttgctgcatgtattctggggcattgtattttttgatggctgtgagaag aaaaagtggggcatcctccttatcgttctcctgacccacctgctggtgtcagcccagacc ttcataagttcttattatggaataaacctggcgtcagcatttataatcctggtgctcatg ggcacctgggcattcttagctgcgggaggcagctgccgaagcctgaaactctgcctgctc tgccaagacaagaactttcttctttacaaccagcgctccagataa >gi568815583r:63226288_63481720|GENSCAN_predicted_peptide_3|602_aa MPRRSLHAAAVLLLVILKEQPSSPAPVNGSKWTYFAPFSHWGSIKPWNRRCSGLAGEEEG RAVGLSCQGLSGLWLCSDLTILLPAGLEPRAFRGNCWAKGAGTTVERRHHGQSLPGATPP GQEREIAPATVKGATEELSAENCLDLVCTVVESGELEDGFRGDMKSAVPSESCGLAVFSV ALATRAGLLDDPQREAGPDGENSWSKKYPSCGGLLQSPIDLHSDILQYDASLTPLEFQGY NLSANKQFLLTNNGHSVKLNLPSDMHIQGLQSRYSATQLHLHWGNPNDPHGSEHTVSGQH FAAELHIVHYNSDLYPDASTASNKSEGLAVLAVLIEMGSFNPSYDKIFSHLQHVKYKGQE AFVPGFNIEELLPERTAEYYRYRGSLTTPPCNPTVLWTVFRNPVQISQEQLLALETALYC THMDDPSPREMINNFRQVQKFDERLVYTSFSQGVPLPLSRGLPAGYLEQLQPRFRKETGP LGHRDHLPVRSSRAAIHTGDRPARGPRARIQLSQDVRGKVRSKEPLGPENVSSNNVINIA CNSLISSQSEQASNGFEYRIIIPYVSCLMARAKLNPLAALSPHQGKEREKDSIGYSESPG KS >gi568815583r:63226288_63481720|GENSCAN_predicted_CDS_3|1809_bp atgccccggcgcagcctgcacgcggcggccgtgctcctgctggtgatcttaaaggaacag ccttccagcccggccccagtgaacggttccaagtggacttattttgcgcccttttcccac tggggcagcatcaagccttggaaccgcaggtgcagcgggctggcaggtgaggaggaagga cgagctgtggggctgtcctgccagggcctttcaggcctctggctgtgctccgacttgacc attctgctacctgctggtcttgagccccgggcattcaggggcaactgctgggcgaagggt gcaggcaccacggtggagcgcaggcaccatggccagtctcttccaggggccacaccacca ggccaagaaagagagatagcacctgctacagtcaaaggagccactgaagaactttcagca gagaactgtctggatctggtgtgcactgttgtggagagtggagagctggaggatggattt agaggggacatgaagtctgcggtcccctctgagtcctgtggcttagctgtgttctctgtg gctctagccacaagagcgggactgttggatgacccccagagagaggcaggtcctgatggg gagaatagctggtccaagaagtacccgtcgtgtgggggcctgctgcagtcccccatagac ctgcacagtgacatcctccagtatgacgccagcctcacgcccctcgagttccaaggctac aatctgtctgccaacaagcagtttctcctgaccaacaatggccattcagtgaagctgaac ctgccctcggacatgcacatccagggcctccagtctcgctacagtgccacgcagctgcac ctgcactgggggaacccgaatgacccgcacggctctgagcacaccgtcagcggacagcac ttcgccgccgagctgcacattgtccattataactcagacctttatcctgacgccagcact gccagcaacaagtcagaaggcctcgctgtcctggctgttctcattgagatgggctccttc aatccgtcctatgacaagatcttcagtcaccttcaacatgtaaagtacaaaggccaggaa gcattcgtcccgggattcaacattgaagagctgcttccggagaggaccgctgaatattac cgctaccgggggtccctgaccacacccccttgcaaccccactgtgctctggacagttttc cgaaaccccgtgcaaatttcccaggagcagctgctggctttggagacagccctgtactgc acacacatggacgacccttcccccagagaaatgatcaacaacttccggcaggtccagaag ttcgatgagaggctggtatacacctccttctcccaaggagtgcccctcccgttgtcacgt ggccttcctgctgggtatctggagcagttgcagcctcgcttccggaaggaaacggggcct ctggggcacagggaccacctgccagtgcgttccagccgggctgccatccacactggagac cgccccgcacgtggtcctcgggctcggattcagctgtcgcaggatgtgaggggaaaggtt agaagcaaagagcccttgggacctgagaacgtttcatccaataacgtcatcaacatagcc tgcaacagcctcatttcaagccagagtgaacaagcatcaaatggttttgagtatcgtatc ataatcccctatgtatcctgtctgatggcaagggccaagctcaaccctttagctgcactg tctcctcaccaagggaaagaaagagaaaaggactctataggctactcagaatcaccaggt aaaagctaa >gi568815583r:63226288_63481720|GENSCAN_predicted_peptide_4|87_aa MARAHLDRGGAGVLIPDTARGKAREMQRVPNKDLSCPDLGLMLPLYLALPANGRPPLLKD KGRGFGHLRPSDISGPSEKLQCRIQRD >gi568815583r:63226288_63481720|GENSCAN_predicted_CDS_4|264_bp atggcgagagcacacctggacaggggaggggcaggagttctgattcctgacacagcgagg ggcaaagccagagaaatgcagagggtgccgaacaaggacctcagctgccctgatctgggg ttgatgctgcctttgtacttagcactgcctgccaatggcaggccacctctgctgaaagac aagggcagagggtttggccaccttaggcctagcgacatctctggacccagtgagaaactc caatgtcgaattcagagggactga >gi568815583r:63226288_63481720|GENSCAN_predicted_peptide_5|50_aa MLWGQLLTRLYLIPRCSLGLRVLLSLLYLLMEQYQKQVRKRIKCERGTKL >gi568815583r:63226288_63481720|GENSCAN_predicted_CDS_5|153_bp atgctctggggccagcttctcaccaggctgtatttgatcccaagatgcagcctggggttg agggtcctgctcagcttgctttatctcttgatggagcagtaccagaagcaggttagaaaa agaatcaagtgtgagagaggaacaaagctctga >gi568815583r:63226288_63481720|GENSCAN_predicted_peptide_6|219_aa XSPQTLALWDLIQSQSFTPGSSYEKPGRPSKQRAKRNIPKQPEWALALCPPPLRHQTTAG EMDRFWRCQNEQKEEEEEAVASSSVNPKSSGSQSKGGPAVTFFYGSDAAEVSTYPKLLQD VTDETAGSPKDQTLGSHKSIQELDAYMMQSTVAEHLLSLGSQTLPATEQMTGKPSPLAVI IPERSSGVPAIFRLNIYTRKAHNYHEHKTWECHGVTIEV >gi568815583r:63226288_63481720|GENSCAN_predicted_CDS_6|660_bp ngctcccctcagacacttgcactctgggacttgattcaaagccagtcatttaccccaggc agcagctatgagaaaccaggtagaccctccaagcaacgagctaagagaaacatccccaaa caaccagaatgggccctcgcactgtgccctcctcctctcagacaccagacaacggcaggt gaaatggatcggttttggagatgtcaaaatgaacagaaagaggaggaggaggaggcagta gcatcatcttctgttaaccccaagagcagtggttctcagagcaagggaggccccgcagta acattcttttatgggtcagatgctgctgaggtttcaacctatcccaaactgttgcaggat gtaactgacgaaactgcaggctccccaaaggatcagacactagggagccacaaaagtatc caagagctggatgcctacatgatgcagtccacagtggctgagcaccttctgtccctggga tctcagactctgcctgccacagagcagatgactggaaaaccctccccacttgctgtcatc attcctgaaaggtcttcaggtgtgccagcaattttcagactgaatatctacaccagaaaa gcacataactaccatgagcataagacgtgggagtgccatggagtgaccatagaagtatag