GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:50:59 Sequence gi568815592r:39215127_39422460 : 207334 bp : 47.62% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6107 6156 50 1 2 63 108 20 0.263 1.25 1.02 Term + 8123 8222 100 2 1 16 55 143 0.536 1.40 1.03 PlyA + 8896 8901 6 1.05 2.03 PlyA - 9334 9329 6 1.05 2.02 Term - 13381 13142 240 1 0 103 41 94 0.854 2.23 2.01 Init - 13985 13800 186 2 0 101 78 336 0.995 30.96 2.00 Prom - 23468 23429 40 -4.46 3.00 Prom + 26997 27036 40 -2.76 3.01 Init + 27187 27280 94 0 1 67 110 67 0.764 5.34 3.02 Intr + 28483 28580 98 2 2 75 78 35 0.463 0.93 3.03 Intr + 51294 51325 32 2 2 77 115 38 0.766 2.33 3.04 Term + 52179 52350 172 0 1 69 37 105 0.302 0.80 3.05 PlyA + 54045 54050 6 -0.45 4.00 Prom + 54091 54130 40 -0.76 4.01 Init + 57512 57521 10 1 1 95 103 0 0.411 2.90 4.02 Intr + 73059 73186 128 1 2 61 96 73 0.442 5.80 4.03 Intr + 77804 77990 187 1 1 108 67 70 0.803 6.16 4.04 Term + 79664 79773 110 0 2 64 53 75 0.445 0.27 4.05 PlyA + 81466 81471 6 1.05 5.07 PlyA - 83899 83894 6 1.05 5.06 Term - 84159 84013 147 0 0 51 45 75 0.475 -2.60 5.05 Intr - 84611 84484 128 0 2 118 75 66 0.856 8.70 5.04 Intr - 89005 88831 175 1 1 52 77 311 0.960 25.91 5.03 Intr - 89529 89369 161 1 2 93 94 144 0.811 15.21 5.02 Intr - 95881 95767 115 1 1 47 87 116 0.528 7.42 5.01 Init - 99194 98958 237 2 0 56 36 471 0.981 34.71 5.00 Prom - 99533 99494 40 -15.48 6.06 PlyA - 99598 99593 6 1.05 6.05 Term - 101316 101093 224 1 2 99 54 164 0.958 11.18 6.04 Intr - 101821 101656 166 1 1 76 77 172 0.895 14.43 6.03 Intr - 102826 102660 167 2 2 120 85 105 0.999 13.08 6.02 Intr - 104007 103893 115 0 1 96 71 109 0.990 10.02 6.01 Init - 107414 107202 213 2 0 76 72 370 0.878 30.94 6.00 Prom - 108085 108046 40 -5.56 7.12 PlyA - 108571 108566 6 -0.45 7.11 Term - 111162 111047 116 1 2 104 43 118 0.693 7.63 7.10 Intr - 128689 128544 146 0 2 45 41 183 0.001 9.23 7.09 Intr - 130663 130574 90 0 0 104 62 58 0.733 3.91 7.08 Intr - 142248 142151 98 0 2 66 116 10 0.104 0.41 7.07 Intr - 145404 145269 136 2 1 132 80 237 0.781 27.87 7.06 Intr - 147392 147308 85 0 1 108 44 112 0.081 7.68 7.05 Intr - 170546 170460 87 0 0 89 85 45 0.102 4.24 7.04 Intr - 182541 182468 74 2 2 86 103 -20 0.005 -1.65 7.03 Intr - 184830 184729 102 2 0 90 66 60 0.020 3.29 7.02 Intr - 198912 198867 46 1 1 44 103 39 0.003 -1.53 7.01 Intr - 204877 204822 56 0 2 107 99 28 0.104 4.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 128689 128583 107 0 2 45 101 220 0.970 18.86 S.002 Init - 147378 147308 71 0 2 99 44 87 0.900 5.92 S.003 Intr + 150608 150788 181 1 1 92 97 118 0.837 12.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:39215127_39422460|GENSCAN_predicted_peptide_1|49_aa MHISGSHSLWPPQPVAKGHSHNHQTYCLVGDSFSNTTKAPQWQLDKDPY >gi568815592r:39215127_39422460|GENSCAN_predicted_CDS_1|150_bp atgcacatcagtggctcccacagcctgtggcccccacagcctgtggccaaaggccactcc cacaaccatcagacatactgcttggttggtgattccttcagcaacaccaccaaggccccg cagtggcagctggacaaagatccctactga >gi568815592r:39215127_39422460|GENSCAN_predicted_peptide_2|141_aa MVDRGPLLTSAIIFYLAIGAAIFEVLEEPHWKEAKKNYYTQKLHLLKEFPCLGQEGLDKI LEGQCHACQAFLKEREPGRVAGGSLSPSLGKIFRPQQSSGEELEALWGPNPRDEGWCLEC WGRAQASGKAYLAETSFKGLE >gi568815592r:39215127_39422460|GENSCAN_predicted_CDS_2|426_bp atggtggaccggggccctctgctcacctcggccatcatcttctacctggccatcggggcg gcgatcttcgaagtgctggaggagccacactggaaggaggccaagaaaaactactacaca cagaagctgcatctgctcaaggagttcccgtgcctgggtcaggagggcctggacaagatc ctagagggccagtgccacgcctgccaggcttttcttaaggaacgtgaacccggcagagtg gcgggagggtccctcagcccgagcctgggcaaaatcttccgcccccagcagagttccgga gaggagctggaggctctgtggggcccgaacccacgggatgagggctggtgcctggaatgc tggggacgcgctcaggcttccgggaaggcttatctcgcggagacatccttcaagggcttg gagtaa >gi568815592r:39215127_39422460|GENSCAN_predicted_peptide_3|131_aa MQKPSLTSPRALTPAYALLTLPPLCFMRNITGWKTKMIAAMALFRALREKEETSLRQGCD LIPESNFQSCKLHAWQPQVEWKLKTGYISGYCTHSLTAAKDQGEASLGCKKKPDSPGSRD CALSLWGKEVS >gi568815592r:39215127_39422460|GENSCAN_predicted_CDS_3|396_bp atgcagaagccttccctgacttccccaagggccctgaccccagcatatgccctcctgacc cttcctcccttgtgcttcatgagaaatatcacaggctggaaaaccaagatgatagcagct atggctctcttcagagctcttagagagaaagaggaaacttcactgagacaaggatgtgac ctgatcccagagagcaacttccagtcctgcaagctgcatgcatggcagccccaggtggag tggaagctcaaaacgggctacatttctgggtactgcacccactccctgactgctgcaaaa gaccagggagaggcttctcttggctgcaagaagaagccagattctcctgggagcagggac tgtgctctgagcctctgggggaaggaggtcagctag >gi568815592r:39215127_39422460|GENSCAN_predicted_peptide_4|144_aa MGTGRKEKTKTGKKETERKTAKKKRNKERGKKYRQIIVLYLKRCGQEWLTYTQQTLTVPC LLCTHHAHVRCRFPIRKCWGVNTILLQVMRNGIWGIYAPGSLPLRGDNSHWRLKDRQVQG DFLEEKVQLGFMSFLMKEGGEGEG >gi568815592r:39215127_39422460|GENSCAN_predicted_CDS_4|435_bp atggggacaggaaggaaggaaaaaaccaagacgggaaagaaggagacagaaaggaagaca gcaaagaaaaaacgaaataaggaaagagggaaaaaatacagacagattatcgtgctatac ctgaaaaggtgtggccaggaatggttgacatatacgcagcagacgctgacagtgccctgc ctgctctgcacccaccatgctcatgtacgctgcaggtttccaatccgcaaatgctgggga gttaacaccatcctactccaagtgatgaggaatgggatttggggaatatatgcccccggc tctctaccccttagaggggacaactcccattggaggctaaaagacaggcaagtccagggg gacttcttggaggaaaaggtacagttgggcttcatgagcttcctgatgaaagagggtgga gaaggagaaggttga >gi568815592r:39215127_39422460|GENSCAN_predicted_peptide_5|320_aa MYRPRARAAPEGRVRGCAVPSTVLLLLAYLAYLALGTGVFWTLEGRAAQDSSRSFQRDKW ELLQNFTCLDRPALDSLIRDVVQAYKNGASLLSNTTSMGRWELVGSFFFSVSTITTIGYG NLSPNTMAARLFCIFFALVGIPLNLVVLNRLGHLMQQGVNHWASRLGGTWQDPDKARWLA GSGALLSGLLLFLLLPPLLFSHMEGWSYTEGFYFAFITLSTVGFGDYVIGMNPSQRYPLW YKNMVSLWILFGMAWLALIIKLILSQLETPGRGIQRHVLGDMGCDFRVSGQHALLPHFLT LAGCNAADMMAGSSGSHTAP >gi568815592r:39215127_39422460|GENSCAN_predicted_CDS_5|963_bp atgtaccgaccgcgagcccgggcggctcccgagggcagggtccggggctgcgcggtgccc agcaccgtgctcctgctgctcgcctacctggcttacctggcgctgggcaccggcgtgttc tggacgctggagggccgcgcggcgcaggactccagccgcagcttccagcgcgacaagtgg gagctgttgcagaacttcacgtgtctggaccgcccggcgctggactcgctgatccgggat gtcgtccaagcatacaaaaacggagccagcctcctcagcaacaccaccagcatggggcgc tgggagctcgtgggctccttcttcttttctgtgtccaccatcaccaccattggctatggc aacctgagccccaacacgatggctgcccgcctcttctgcatcttctttgcccttgtgggg atcccactcaacctcgtggtgctcaaccgactggggcatctcatgcagcagggagtaaac cactgggccagcaggctggggggcacctggcaggatcctgacaaggcgcggtggctggcg ggctctggcgccctcctctcgggcctcctgctcttcctgctgctgccaccgctgctcttc tcccacatggagggctggagctacacagagggcttctacttcgccttcatcaccctcagc accgtgggcttcggcgactacgtgattggaatgaacccctcccagaggtacccactgtgg tacaagaacatggtgtccctgtggatcctctttgggatggcatggctggccttgatcatc aaactcatcctctcccagctggagacgccagggaggggtatacagagacatgtcctgggt gacatgggatgtgactttcgggtgtcggggcagcatgcccttctcccccacttccttact ttagcgggctgcaatgccgccgatatgatggctgggagctctggcagccatacggcacca tga >gi568815592r:39215127_39422460|GENSCAN_predicted_peptide_6|294_aa MPSAGLCSCWGGRVLPLLLAYVCYLLLGATIFQLLERQAEAQSRDQFQLEKLRFLENYTC LDQWAMEQFVQVIMEAWVKGVNPKGNSTNPSNWDFGSSFFFAGTVVTTIGYGNLAPSTEA GQVFCVFYALLGIPLNVIFLNHLGTGLRAHLAAIERWEDRPRRSQVLQVLGLALFLTLGT LVILIFPPMVFSHVEGWSFSEGFYFAFITLSTIGFGDYVVGTDPSKHYISVYRSLAAIWI LLGLAWLALILPLGPLLLHRCCQLWLLSRGLGVKDGAASDPSGLPRPQKIPISA >gi568815592r:39215127_39422460|GENSCAN_predicted_CDS_6|885_bp atgcccagtgctgggctctgcagctgctggggtggccgggtgctgcccctgctgctggcc tatgtctgctacctgctgctcggtgccactatcttccagctgctagagaggcaggcggag gctcagtccagggaccagtttcagttggagaagctgcgcttcctggagaactacacctgc ctggaccagtgggccatggagcagtttgtgcaggtcatcatggaagcctgggtgaaaggt gtgaaccccaaaggcaactctaccaaccccagcaactgggactttggcagcagtttcttc tttgcaggcacagtcgtcactaccataggatatgggaacctggcacccagcacagaggca ggtcaggtcttctgtgtcttctatgccctgttgggcatcccgcttaacgtgatcttcctc aaccacctgggcacagggctgcgtgcccatctggccgccattgaaagatgggaggaccgt cccaggcgctcccaggtactgcaagtcctgggcctggctctgttcctgaccctggggacg ctggtcattctcatcttcccacccatggtcttcagccatgtggagggctggagcttcagc gagggcttctactttgctttcatcactctcagcaccattggctttggggactatgttgtt ggcacagaccccagcaagcattatatctcagtgtatcggagcctggcagccatctggatc ctcctgggcctggcgtggctggcgctgatcctcccactgggccccctgcttctgcacaga tgctgccagctctggctgctcagtaggggcctcggcgtcaaggatggggcagcctctgac cccagtgggctccccaggcctcagaagatccccatctctgcatga >gi568815592r:39215127_39422460|GENSCAN_predicted_peptide_7|345_aa XFSEAKALGESINEARSKIVIPDPYKNRNGSEKSRGCCLLGTLSSNGAIAGGCYQASKTL WPWRWHLTSPNSLDFLIMHYPQNDQLISTRTEIGHLKEEITQRHIQQVALGRYKENLQEE DEGISENMAVPLMPDQQEEKLRSQLEEEKRRYKTMFTRLKALKVEIEHLQLLMDKAKVKL QKEFEVWWAEEATNLQVNSPAVNSLDHTKPFLQTSDSQHEWSQLLSNKSDVNARKILPSP CPSPHSQKQSSTSTPLEDSIPKRPVSSIPLTGDSQTDSDIIAFIKARQSILQKQCKCTSL GGSSWAAEQPSLPVDCVQTAQRYSVLQSYSVLPEALQTSTDELSK >gi568815592r:39215127_39422460|GENSCAN_predicted_CDS_7|1038_bp nnattttctgaagccaaggccctgggagaaagtataaatgaagcaagaagtaaaattgta attccagatccctacaagaatagaaatggttcagagaaaagcaggggctgctgcctgctg ggcacgctatcctctaatggtgccattgctggtggctgctaccaagcctcaaagaccctg tggccctggcgatggcatctgactagtcctaactccctagatttccttattatgcattat ccccaaaatgaccagcttatatccaccaggactgagatcggtcacctgaaggaagaaatc acccagcggcatatacagcaagtagccctaggtaggtacaaagagaatctgcaggaagag gatgaaggaatctcggaaaacatggccgtgcctctgatgccagaccagcaggaggagaag ctgcgatcacaactggaggaagaaaagagaaggtataaaacaatgttcactcgcctgaaa gccctgaaggtggagatcgagcacttgcagctgctcatggacaaagccaaggtgaagcta cagaaagagtttgaagtctggtgggcagaggaggccaccaacctgcaggtaaattctcca gcagtgaattcactcgatcacacgaagccatttctccagacatctgactcccagcatgaa tggtcccaactcctctctaacaaaagtgatgtgaatgccaggaaaatcctgccctcgcct tgccccagtccacacagccagaaacagagcagcaccagcaccccactggaagacagcatc cccaagaggccagtgtcgtccatccctctcaccggagacagccagacggactcggacatc atcgccttcatcaaggccagacagagcattctgcagaagcaatgtaagtgcacctccctc ggtggctcctcctgggcagcagaacaaccgtcattacctgtggactgtgtccagactgct caaagatactcagtgctacagagttactcagtcctgcctgaagccctgcagacatccacg gatgagctttctaagtga