GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:16:54 Sequence gi568815592r:108112152_108360921 : 248770 bp : 43.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 468 463 6 -0.45 1.03 Term - 3865 3738 128 2 2 17 53 113 0.210 -0.86 1.02 Intr - 5968 5764 205 1 1 53 32 144 0.194 4.07 1.01 Init - 6491 6309 183 2 0 17 69 180 0.414 6.12 1.00 Prom - 19923 19884 40 -4.06 2.00 Prom + 35069 35108 40 -4.46 2.01 Init + 43726 43762 37 0 1 65 93 37 0.052 2.10 2.02 Intr + 54501 54639 139 1 1 83 39 75 0.077 1.62 2.03 Intr + 56140 56250 111 1 0 50 100 91 0.732 5.99 2.04 Intr + 57335 57465 131 2 2 79 70 71 0.874 4.74 2.05 Intr + 58022 58108 87 0 0 64 43 71 0.492 0.14 2.06 Intr + 59307 59452 146 1 2 97 82 190 0.991 19.30 2.07 Intr + 62685 62772 88 2 1 125 41 91 0.990 7.64 2.08 Intr + 64352 64587 236 0 2 69 82 336 0.610 28.51 2.09 Intr + 65944 66090 147 0 0 81 80 170 0.977 15.93 2.10 Intr + 68172 68268 97 2 1 45 99 8 0.776 -2.82 2.11 Intr + 68656 68805 150 2 0 65 119 144 0.984 15.33 2.12 Intr + 69395 69500 106 0 1 76 60 122 0.463 7.47 2.13 Term + 75150 75312 163 0 1 90 43 87 0.456 1.81 2.14 PlyA + 76079 76084 6 1.05 3.00 Prom + 79488 79527 40 -7.46 3.01 Init + 83204 83254 51 1 0 75 93 72 0.806 6.77 3.02 Term + 93466 93528 63 0 0 122 40 62 0.262 2.69 3.03 PlyA + 94753 94758 6 1.05 4.06 PlyA - 95332 95327 6 1.05 4.05 Term - 95703 95586 118 2 1 55 48 62 0.136 -3.09 4.04 Intr - 102471 102347 125 0 2 58 108 70 0.509 5.38 4.03 Intr - 110894 110799 96 2 0 86 116 64 0.972 9.21 4.02 Intr - 148842 148609 234 0 0 8 100 408 0.652 31.89 4.01 Init - 149074 148985 90 1 0 39 49 102 0.589 -0.11 4.00 Prom - 167173 167134 40 -3.66 5.00 Prom + 168399 168438 40 -2.86 5.01 Init + 182929 183067 139 0 1 89 86 43 0.087 2.51 5.02 Intr + 211674 211897 224 1 2 57 100 177 0.357 13.65 5.03 Intr + 234837 234888 52 2 1 60 111 9 0.007 -1.12 5.04 Intr + 243503 243604 102 0 0 63 80 58 0.144 2.65 5.05 Intr + 244539 244669 131 1 2 43 103 115 0.615 8.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 140045 140235 191 0 2 27 48 226 0.915 10.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:108112152_108360921|GENSCAN_predicted_peptide_1|171_aa MRKKRAPSRYLPLRAPQAAGPRQPGFSRRAGRPLRPLRPLPTAPAARRAHGLGGWVLARA QRNFEGGNCSGAHGLGGGSPSLSLHRVWEGLKPKLLGLRRVRGRKVHARSLLSAAFPRSR ALQANLCFWVLLGEKWCLSPFECAEAALVDQSVRAPPVDSFGFLDVQHQVD >gi568815592r:108112152_108360921|GENSCAN_predicted_CDS_1|516_bp atgcgaaagaaacgcgcccctagccggtacctcccgctccgggccccgcaggcggctggg cctcgtcagcccgggttcagccgccgcgccggccgccccctgcgccccctgcgccccctg cccacggcccccgcagcccggcgggcgcacggcctcggcggctgggtcctcgcgcgggcg cagaggaacttcgagggcgggaactgctccggcgctcatggactcgggggcggcagcccg agcctctccttgcaccgtgtctgggagggcctgaagcccaaactcctcgggctgagaagg gtccggggccgcaaggtgcacgcgcggtccttgctgtcagcggcctttccgcgctctcgc gctctgcaggccaacttgtgcttctgggttctcttgggtgaaaaatggtgcttgagtccc ttcgagtgtgcagaagcagccctcgttgaccaaagcgtcagggcgcctccagtggacagt tttgggttcctcgacgtccagcaccaggtggactga >gi568815592r:108112152_108360921|GENSCAN_predicted_peptide_2|545_aa MDMLSIATLANKGALQAGGQLESGGARRRGGRCRPGLGQRPPTAPPRDSQHEQASRINKP PSSLAPGQALAYTPGSKEPRALSSFSGCGAPGALESATGKRTAGVTVLLGPKRGPWSGQR ARVKQRMVVRYGNQVYRFQIPNPSAFGSSCRISGRPEAAGAQTGGLRIGRILDIPCKVCG DRSSGKHYGVYACDGCSGFFKRSIRRNRTYVCKSGNQGGCPVDKTHRNQCRACRLKKCLE VNMNKDAVQHERGPRTSTIRKQVALYFRGHKEENGAAAHFPSAALPAPAFFTAVTQLEPH GLELAAVSTTPERQTLVSLAQPTPKYPHEVNGTPMYLYEVATESVCESAARLLFMSIKWA KSVPAFSTLSLQDQLMLLEDAWRELFVLGIAQWAIPVDANTLLAVSGMNGDNTDSQKLNK IISEIQALQEVVARFRQLRLDATEFACLKCIVTFKAVPTHSGSELRSFRNAAAIAALQDE AQLTLNSYIHTRYPTQPCRFGKLLLLLPALRSISPSTIEEVFFKKTIGNVPITRLLSDMY KSSDI >gi568815592r:108112152_108360921|GENSCAN_predicted_CDS_2|1638_bp atggacatgctcagcattgccacacttgccaataaaggagccttgcaggctggagggcag ctggagagcggcggcgcccggcggcgaggcgggcgctgccggccgggactcgggcagcgc ccaccaaccgctccgccccgggacagccagcatgagcaagccagccggatcaacaagcct cctagctcgctcgccccgggacaggccctcgcctacacccctggaagtaaggagccccgg gctctttcgtccttttcggggtgtggagcccctggggcccttgaaagtgccacagggaag aggacagctggggtgacagtactgctgggccccaaacgtggaccctggagcggtcagagg gcgcgtgttaagcagaggatggttgtaagatatggaaaccaagtttaccgcttccagatc cccaacccaagcgctttcggcagcagctgtcgaatttcagggaggcctgaggccgctggg gcccaaactggaggccttcggataggccgcattttagatatcccctgcaaagtgtgtggc gaccgcagctcggggaagcactacggggtctacgcctgcgacggctgctcaggttttttc aaacggagcatccgaaggaataggacctatgtctgcaaatctggaaaccagggaggctgt ccggtggacaagacgcacagaaaccagtgcagggcgtgtcggctgaagaagtgtttggaa gtcaacatgaacaaagacgccgtgcagcacgagcgggggcctcggacgtccaccatccgc aagcaagtggccctctacttccgtggacacaaggaggagaacggggccgccgcgcacttt ccctcggcggcgctccctgcgccggccttcttcaccgcggtcacgcagctggagccgcac ggcctggagctggccgcggtgtccaccactccagagcggcagaccctcgtgagcctggct cagcccacgcccaagtacccccatgaagtgaatgggaccccaatgtatctctatgaagtg gccacggagtcggtgtgtgaatcagctgccagacttctcttcatgagcatcaagtgggct aagagtgtgccagccttctccacgctgtctttgcaagaccagctgatgcttttggaagat gcttggagagaactgtttgttctaggaatagcacaatgggccattccggttgatgctaac actctactggctgtatctggcatgaacggtgacaacacagattcccagaagctgaacaag atcatatctgaaatacaggctttacaagaggtggtggctcgatttagacaactccggtta gatgctactgaatttgcctgtctaaaatgcatcgtcactttcaaagccgttcctacacat agtggttctgaactgagaagtttccggaatgctgccgccattgcagcccttcaagatgag gctcagctaacgctcaacagctacatccataccagatatcccactcaaccctgtcgcttt ggaaaactcctgttgcttttgccagctttacgttctattagcccatcaactatagaagaa gtgtttttcaaaaaaaccatcggcaatgtgccaattacaagactgctttcagatatgtac aaatccagtgatatctaa >gi568815592r:108112152_108360921|GENSCAN_predicted_peptide_3|37_aa MAAEPDAMHPLRLPAAQPTQHEDEDEDLYGDPLPLNE >gi568815592r:108112152_108360921|GENSCAN_predicted_CDS_3|114_bp atggcagctgaacctgatgcaatgcaccctctgcggctccccgcagcccagcctactcaa catgaagatgaggatgaagacctttacggtgatccacttccacttaatgaatag >gi568815592r:108112152_108360921|GENSCAN_predicted_peptide_4|220_aa MTPLLLTPQSAVPLPSLGRASREPGVAFQLQRRRRRLNAEGAEGARGGGSSYSEMAETVA DTRRLITKPQNLNDAYGPPSNFLEIDVSNPQTVGVGRGRFTTYEIRVKTNLPIFKLKEST VRRRYSDFEWLRSELERESKVVVPPLPGKAFLRQLPFRGDDGIFDDNFIEERKQGLEQFI NKVSARNESDPVDSTSNTKHLIVQSANRQERSNTSHEKTH >gi568815592r:108112152_108360921|GENSCAN_predicted_CDS_4|663_bp atgacaccgcttctcctcacaccccagtccgcagtgcccctccccagcctcggccgggcc tcccgggagccgggcgtggcgttccagctacagcggcggcggcggcggctgaacgcggag ggggcggagggagcccgcggcggcggcagcagctacagcgaaatggcggagaccgtggct gacacccggcggctgatcaccaagccgcagaacctgaatgacgcctacggaccccccagc aacttcctcgagatcgatgtgagcaacccgcaaacggtgggggtcggccggggccgcttc accacttacgaaatcagggtcaagacaaatcttcctattttcaagctgaaagaatctact gttagaagaagatacagtgactttgaatggctgcgaagtgaattagaaagagagagcaag gtcgtagttcccccgctccctgggaaagcgtttttgcgtcagcttccttttagaggagat gatggaatatttgatgacaattttattgaggaaagaaaacaagggctggagcagtttata aacaaagtgagtgccagaaatgaaagcgacccagtggactcaactagcaacaccaagcac ttgattgtgcaaagtgctaacagacaggagagatcgaacaccagtcatgaaaagactcac taa >gi568815592r:108112152_108360921|GENSCAN_predicted_peptide_5|216_aa MAASWSLLVTLRPLAQSPLRGRCVGCGAWAAALAPLATAPGKPFWKAYTVQTSESMTPTA TSETYLKALAVCHGPLDHYDFLIKAHELKDDEHQRRVIQCLQKLHEDLKGYNIEAEGLFS KLFSRSKPPRGLYVYGDVGTGKTMVMDMFYAYVEMKRKKRVHFHGFMLDVHKRIHRLKQS LPKRKPGFMAKSYDPIAPIAEEISEEACLLCFDEFQ >gi568815592r:108112152_108360921|GENSCAN_predicted_CDS_5|648_bp atggcggcctcctggtcgctcttggttaccctgcgccccttagcacagagcccgctgaga gggagatgtgttgggtgcggggcctgggccgccgctctcgctcctctggccaccgcccct gggaagcccttttggaaagcctatacggttcagacatccgagagcatgaccccaactgcc acttcagagacttatttgaaagctttggccgtttgccatggacctctggaccactatgat tttctgatcaaagctcatgagctaaaggatgatgaacatcaaagaagagtcatacagtgt ttgcagaaattacacgaggaccttaaaggatacaatatagaggcagaaggccttttttca aagcttttttcaaggagcaaacctccaaggggcctgtatgtttatggagatgttggtaca ggaaaaacaatggtgatggacatgttttatgcttatgtggaaatgaagaggaaaaaacgg gttcattttcatggtttcatgctagatgtgcacaaaagaatacatcgccttaaacagagt ttgccaaaaaggaaaccaggattcatggctaaatcatatgacccaatagctcccatagcc gaagaaatcagcgaagaagcatgtctcctatgttttgatgaatttcag