GENSCAN 1.0 Date run: 3-Nov-116 Time: 11:50:25 Sequence gi568815597r:21119958_21390157 : 270200 bp : 48.50% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 2 41 40 -2.76 1.01 Init + 9449 9457 9 1 0 78 111 0 0.857 1.98 1.02 Intr + 9720 9780 61 2 1 68 53 60 0.020 -1.19 1.03 Intr + 29857 29926 70 2 1 5 115 79 0.049 0.54 1.04 Term + 30188 30314 127 2 1 115 38 85 0.930 3.96 1.05 PlyA + 31021 31026 6 1.05 2.24 PlyA - 33411 33406 6 1.05 2.23 Term - 56563 56008 556 0 1 139 48 371 0.258 32.00 2.22 Intr - 57301 57137 165 0 0 50 61 77 0.051 0.28 2.21 Intr - 100174 100002 173 1 2 93 95 202 0.983 20.14 2.20 Intr - 101885 101790 96 2 0 115 109 111 0.999 16.11 2.19 Intr - 105483 105293 191 1 2 118 80 427 0.998 44.20 2.18 Intr - 107269 107202 68 0 2 107 102 81 0.999 9.95 2.17 Intr - 108084 107974 111 2 0 116 32 159 0.992 12.59 2.16 Intr - 113704 113601 104 1 2 105 70 93 0.984 8.17 2.15 Intr - 115970 115893 78 2 0 108 86 130 0.999 14.55 2.14 Intr - 116887 116789 99 1 0 90 100 163 0.999 18.01 2.13 Intr - 118287 118177 111 0 0 83 92 125 0.954 12.98 2.12 Intr - 124527 124370 158 1 2 89 99 53 0.637 6.23 2.11 Intr - 125146 124935 212 0 2 127 47 302 0.652 28.56 2.10 Intr - 127406 127264 143 2 2 128 95 250 0.999 28.85 2.09 Intr - 136181 135990 192 2 0 70 93 414 0.955 39.79 2.08 Intr - 137633 137568 66 2 0 89 121 62 0.989 8.70 2.07 Intr - 138882 138736 147 0 0 142 99 232 0.999 30.13 2.06 Intr - 140435 140314 122 0 2 73 96 104 0.993 9.91 2.05 Intr - 152954 152742 213 0 0 102 111 185 0.833 20.89 2.04 Intr - 159375 159234 142 0 1 142 58 207 0.985 23.03 2.03 Intr - 162814 162746 69 1 0 154 100 -71 0.091 0.08 2.02 Intr - 170199 170113 87 0 0 54 96 281 0.577 25.67 2.01 Init - 170457 170407 51 0 0 72 60 166 0.577 11.40 2.00 Prom - 174330 174291 40 -9.16 3.00 Prom + 175440 175479 40 -4.86 3.01 Init + 180161 180328 168 1 0 92 94 136 0.533 12.17 3.02 Intr + 187301 187358 58 1 1 109 77 -4 0.013 -0.94 3.03 Intr + 200283 200403 121 1 1 58 80 74 0.426 3.15 3.04 Intr + 206162 206276 115 2 1 95 77 -9 0.204 -0.85 3.05 Intr + 207754 207837 84 0 0 105 83 9 0.512 2.12 3.06 Term + 214354 214515 162 0 0 85 47 139 0.926 7.44 3.07 PlyA + 217807 217812 6 1.05 4.00 Prom + 219309 219348 40 -5.36 4.01 Init + 224215 224375 161 0 2 91 84 48 0.668 2.00 4.02 Term + 224954 225659 706 2 1 36 54 243 0.390 8.69 4.03 PlyA + 228403 228408 6 1.05 5.03 PlyA - 229345 229340 6 1.05 5.02 Term - 256162 256037 126 1 0 107 39 60 0.724 1.28 5.01 Init - 257186 256998 189 2 0 62 82 84 0.482 4.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:21119958_21390157|GENSCAN_predicted_peptide_1|88_aa MHLDISKIYRCQDEEVTTPHPNSICSPTKLEITIILDMYLSSTDTDRNHSSQVLYYARPC CSSSHFWLSPAAVPDESQDGKDKMAKDF >gi568815597r:21119958_21390157|GENSCAN_predicted_CDS_1|267_bp atgcacttggacatttccaaaatctaccgttgccaggatgaagaagtaacaaccccccac cccaattccatatgttcacccaccaagctagaaatcaccatcattctggacatgtatctg tcatcaactgacacagacaggaaccacagcagccaagtcctatactatgcccgcccctgc tgcagcagcagccatttttggctcagccctgctgcagtcccagatgaaagccaggacggc aaggacaagatggcaaaggacttttaa >gi568815597r:21119958_21390157|GENSCAN_predicted_peptide_2|1117_aa MRGVWPPPVSALLSALGMSTYKRATLDEEDLVDSLSEGDAYPNGLQMALLLALSCLLVAH TMTFPCTFKVNFHSPRSGQRCWAARTQVEKRLVVLVVLLAAGLVACLAALGIQYQTRSPS VCLSEACVSVTSSILSSMDPTVDPCHDFFSYACGGWIKANPVPDGHSRWGTFSNLWEHNQ AIIKHLLENSTASVSEAERKAQVYYRACMNETRIEELRAKPLMELIERLGGWNITGPWAK DNFQDTLQVVTAHYRTSPFFSVYVSADSKNSNSNVIQVDQSGLGLPSRDYYLNKTENEKV LTGYLNYMVQLGKLLGGGDEEAIRPQMQQILDFETALANITIPQEKRRDEELIYHKVTAA ELQTLAPAINWLPFLNTIFYPVEINESEPIVVYDKEYLEQISTLINTTDRCLLNNYMIWN LVRKTSSFLDQRFQDADEKFMEVMYGTKKVSLLLRMPEYGLSAYFMPRCLWEHRKGPQPA GGEPTPSGKFHSHFHLLPKEVAVTLSLMQPTNGYSLNSCHRVPGSKPSTGILRQTCLPRW KFCVSDTENNLGFALGPMFVKATFAEDSKSIATEIILEIKKAFEESLSTLKWMDEETRKS AKEKADAIYNMIGYPNFIMDPKELDKVFNDYTAVPDLYFENAMRFFNFSWRVTADQLRKA PNRDQWSMTPPMVNAYYSPTKNEIVFPAGILQAPFYTRSSPKALNFGGIGVVVGHELTHA FDDQGREYDKDGNLRPWWKNSSVEAFKRQTECMVEQYSNYSVNGEPVNGRHTLGENIADN GGLKAAYRAYQNWVKKNGAEHSLPTLGLTNNQLFFLGFAQVWCSVRTPESSHEGLITDPH SPSRFRVIGSLSNSKEFSEHFRCPPGSPMNPPHKCEVWGFSPRRCLSAPASGAQRTADSG AITIVLRGWAAGEDSRKPDLAFGLSLRPAPAPLLPPAPRSRRLPGSAPIPPPRCPRPLHP GAPGPALGGLVASLCHEFTAETGSEIRHSDCSRNRTRHRSGGGGGSSGTASSPGGGSSGG GGGGGGGGGSGPPSSPEHQGPPDSGAPTVSFSPSCRPFGSRRRSPGGPRSPGDPPAGRHS PARKGTAAQAPASGLGGAFGVPRQQHCGGTPRPSAIL >gi568815597r:21119958_21390157|GENSCAN_predicted_CDS_2|3354_bp atgcggggcgtgtggccgcccccggtgtccgccctgctgtcggcgctggggatgtcgacg tacaagcgggccacgctggacgaggaggacctggtggactcgctctccgagggcgacgca taccccaacggcctgcagatggctttgctccttgctcttagctgtctccttgtcgcccac accatgacatttccctgtacatttaaggtgaacttccacagcccccggagtggccagagg tgctgggctgcacggacccaggtggagaagcggctggtggtgttggtggtacttctggcg gcaggactggtggcctgcttggcagcactgggcatccagtaccagacaagatccccctct gtgtgcctgagcgaagcttgtgtctcagtgaccagctccatcttgagctccatggacccc acagtggacccctgccatgacttcttcagctacgcctgtgggggctggatcaaggccaac ccagtccctgatggccactcacgctgggggaccttcagcaacctctgggaacacaaccaa gcaatcatcaagcacctcctcgaaaactccacggccagcgtgagcgaggcagagagaaag gcgcaagtatactaccgtgcgtgcatgaacgagaccaggatcgaggagctcagggccaaa cctctaatggagttgattgagaggctcgggggctggaacatcacaggtccctgggccaag gacaacttccaggacaccctgcaggtggtcaccgcccactaccgcacctcacccttcttc tctgtctatgtcagtgccgattccaagaactccaacagcaacgtgatccaggtggaccag tctggcctgggcttgccctcgagagactattacctgaacaaaactgaaaacgagaaggtg ctgaccggatatctgaactacatggtccagctggggaagctgctgggcggcggggacgag gaggccatccggccccagatgcagcagatcttggactttgagacggcactggccaacatc accatcccacaggagaagcgccgtgatgaggagctcatctaccacaaagtgacggcagcc gagctgcagaccttggcacccgccatcaactggttgccttttctcaacaccatcttctac cccgtggagatcaatgaatccgagcctattgtggtctatgacaaggaataccttgagcag atctccactctcatcaacaccaccgacagatgcctgctcaacaactacatgatctggaac ctggtgcggaaaacaagctccttccttgaccagcgctttcaggacgccgatgagaagttc atggaagtcatgtacgggaccaagaaggtgagcctgctactgcgtatgcctgaatatggg ctgagcgcctactttatgccaaggtgtctatgggagcacaggaagggacctcagccagct gggggggaacctacccccagtggaaagttccacagccacttccaccttctccccaaggaa gtggctgtcaccctctcactcatgcagccgactaatggttactcattgaactcctgccat cgcgtgccaggctctaagcccagtacgggaatactgagacagacctgtcttcctcgctgg aagttttgcgtgagtgacacagaaaacaacctgggctttgcgttgggccccatgtttgtc aaagcaaccttcgccgaggacagcaagagcatagccaccgagatcatcctggagattaag aaggcatttgaggaaagcctgagcaccctgaagtggatggatgaggaaacccgaaaatca gccaaggaaaaggccgatgccatctacaacatgataggataccccaacttcatcatggat cccaaggagctggacaaagtgtttaatgactacactgcagttccagacctctactttgaa aatgccatgcggtttttcaacttctcatggagggtcactgccgatcagctcaggaaagcc cccaacagagatcagtggagcatgaccccgcccatggtgaacgcctactactcgcccacc aagaatgagattgtgtttccggccgggatcctgcaggcaccattctacacacgctcctca cccaaggccttaaactttggtggcataggtgtcgtcgtgggccatgagctgactcatgct tttgatgatcaaggacgggagtatgacaaggacgggaacctccggccatggtggaagaac tcatccgtggaggccttcaagcgtcagaccgagtgcatggtagagcagtacagcaactac agcgtgaacggggagccggtgaacgggcggcacaccctgggggagaacatcgccgacaac gggggtctcaaggcggcctatcgggcttaccagaactgggtgaagaagaacggggctgag cactcgctccccaccctgggcctcaccaataaccagctcttcttcctgggctttgcacag gtctggtgctccgtccgcacacctgagagctcccacgaaggcctcatcaccgatccccac agcccctctcgcttccgggtcatcggctccctctccaattccaaggagttctcagaacac ttccgctgcccacctggctcacccatgaacccgcctcacaagtgcgaagtctggggtttt tcgcctcgccgctgcctctccgcccctgcgagcggggctcagcgaacagccgactccggg gccattaccattgttcttcgaggctgggccgcaggggaggactctcgcaagccagacctc gcctttggtctttccctccgccccgcacccgcgccgctgctccccccagccccccgctcc cgccgcctccccgggtctgcccccatcccgcccccccgctgcccccggcccctgcacccc ggcgcgcccggtcccgctctgggggggttggtggcttcgctttgccatgagtttaccgca gaaaccggctctgaaatcaggcacagcgactgcagcaggaaccggacccggcaccggagc ggcggcggcggcggcagcagcggtaccgcctcctcacccggcggcggcagcagcggcggc ggcggcggcggcggcggcggcggcggcagcggtcccccctcctcacccgaacatcagggc cctccagactcaggcgccccaacagtatccttttcaccttcctgtcggcccttcggttct cgtcgtcgcagtccagggggaccccgcagcccaggggaccctcctgccggccgccacagc ccagccaggaaagggacagcggctcaagcccctgcatcgggcttgggcggagcttttggg gtgccccggcagcagcactgcggggggacccccagaccctctgcaatactgtga >gi568815597r:21119958_21390157|GENSCAN_predicted_peptide_3|235_aa MAWARMLLTHWRQHMGAVGHAGFPAAGLLAATIPAGQQYPQEAAEKQGEEHQAWSQRLWG AGASISTSWVEDRSQGSSRQQLYGLKQVTGLLIYKMGMVTGPTPQRTATIKWDNESPHPS AAPRSQPTIAASQTQLPHSTSSLSKNTHAGEGGELHSWGPQPPGPQTSTSPWPVRNRAAQ QEPPHRKPNRMPSNHPLNTQGRRKMKCRFLMAIFHQTSPGNKEPQSHRIPSSVAV >gi568815597r:21119958_21390157|GENSCAN_predicted_CDS_3|708_bp atggcatgggccaggatgctgctcacccactggaggcagcacatgggggctgtgggacac gcagggttccctgctgctggcctgctggctgccaccattccagctggacaacagtatccc caggaggcggctgagaagcagggagaagaacaccaggcttggagtcagcgactctggggg gctggagctagcatctccacatcatgggtggaagatagatctcaaggctccagccgtcaa cagctgtatgggctcaaacaggtcaccggacttctcatttataaaatggggatggtgaca gggccaaccccacagaggactgctacgatcaaatgggataatgagagcccccatccttct gccgccccacgttctcagccaacaatagctgcttcccagactcagctgccacacagcacg agctccctgagcaaaaatacccatgcgggggagggtggggagctacacagctggggtccc cagccccccgggcctcagaccagtaccagtccatggcctgttaggaaccgggccgcacag caggagccaccacaccggaagcccaacaggatgccgagtaaccacccactgaacactcag gggcgaaggaagatgaagtgccgctttctcatggccatcttccaccagaccagcccagga aacaaggaaccacaaagccacaggattccctccagcgtggccgtgtga >gi568815597r:21119958_21390157|GENSCAN_predicted_peptide_4|288_aa MGGRVVRGLQLWAQQGLGTTVGLFGFTGGPILLPSRRPRAWQHHHREVEGLDHSAPRRAP DRPQCPGTLRSSCIQAPRNRRHPRARPSPAALGLQAKLSPGTSEQPKTDRSNRPQSEPGT PSPPHPAGTRTKLGAAAPDGTKRSAAGRAALCPGAPRLHAFREPGGGCCCSRRPAPREPG RPRVPPAAPLPATVEAGLDRTRPPRAALTIARVLRPGFAQLPAPGSRFPAPGSLLPAQLL GRLGCLAQAARSAPAGELAASAPSAGGAGPPRGQWQGAGAWVAQPITA >gi568815597r:21119958_21390157|GENSCAN_predicted_CDS_4|867_bp atgggggggcgggttgtaaggggactccagctctgggcacagcaggggctgggaaccact gtgggactcttcggctttacagggggtcccatcctgctcccatccaggcgccctcgggct tggcagcatcatcacagagaagtggaggggctggatcacagtgccccaaggcgagcccca gaccgacctcagtgtcccgggacgctcaggtccagctgcatccaggcacctcgaaatcgc cgccacccgcgagcccgacccagcccggctgccctgggtctccaagccaagctgagtcca gggacttcagaacaacccaaaacagaccgcagcaaccgtccccaaagcgaaccggggacc ccgagccccccacatccggctgggaccagaaccaagctcggcgcggccgcccccgacggg accaagcgcagcgccgccgggagagccgcgctctgcccgggcgctccccgactccacgcc ttccgagagccgggcgggggctgctgctgcagccggcgcccagccccccgcgagccaggg cggccccgggtgccacccgcggcaccgctgcccgcgaccgtcgaggctgggctggaccgg accagacctccgcgcgcagcactcaccatagctcgcgtgctccgccccggcttcgcgcag ctccccgcgcccggctcccgattcccagctccgggttccctgctcccagcccagctgctc ggacggctcggctgcctggcccaggcggcgcgctcagctccagcgggcgagctcgcggct tcggcgccgagtgccggaggggcggggccgccgcggggccaatggcagggcgcgggggcc tgggtggcgcagccaatcacggcctga >gi568815597r:21119958_21390157|GENSCAN_predicted_peptide_5|104_aa MAGEASESWREVKGTSYMVAARENEKDEKQKPLINPSDLMRLIHYHKNSMGETAPMIQMI SHQAQHHVEATKVWGLHPQKPWAELYFGPLQQQLEWLGHKAPSP >gi568815597r:21119958_21390157|GENSCAN_predicted_CDS_5|315_bp atggctggggaggcctcagaatcatggcgggaggtgaagggcacttcttacatggtggca gcaagagaaaatgagaaggatgaaaagcagaaacccctgataaacccatcggatctcatg agacttattcactaccacaagaacagtatgggggaaactgcccccatgattcaaatgatc tcccaccaggctcaacaccacgtggaagccaccaaggtttggggcttgcaccctcagaaa ccatgggccgagctgtactttggcccccttcagcaacagctggagtggctgggacacaag gcaccaagtccctag