GENSCAN 1.0 Date run: 7-Nov-116 Time: 15:40:46 Sequence gi568815580r:2439003_2667216 : 228214 bp : 39.20% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 308 303 6 1.05 1.01 Sngl - 1359 1060 300 0 0 95 48 148 0.321 7.05 1.00 Prom - 8269 8230 40 -5.35 2.00 Prom + 13767 13806 40 -3.35 2.01 Init + 28130 28284 155 1 2 79 81 71 0.566 5.00 2.02 Term + 30216 30405 190 0 1 72 54 135 0.502 4.44 2.03 PlyA + 31287 31292 6 1.05 3.00 Prom + 32256 32295 40 -6.05 3.01 Init + 35285 35465 181 1 1 40 62 100 0.408 1.99 3.02 Intr + 39663 39951 289 1 1 94 14 323 0.127 20.88 3.03 Intr + 48121 48278 158 1 2 66 39 96 0.019 1.23 3.04 Intr + 63989 64181 193 0 1 -70 29 320 0.550 7.93 3.05 Term + 69782 69896 115 2 1 83 41 92 0.639 1.06 3.06 PlyA + 72013 72018 6 1.05 4.00 Prom + 73312 73351 40 -3.75 4.01 Init + 77093 77140 48 1 0 89 75 84 0.913 8.20 4.02 Term + 79657 79785 129 0 0 41 39 130 0.614 0.60 4.03 PlyA + 80132 80137 6 1.05 5.08 PlyA - 81127 81122 6 1.05 5.07 Term - 90230 90117 114 2 0 118 41 79 0.101 3.79 5.06 Intr - 96282 96145 138 0 0 31 99 74 0.083 2.54 5.05 Intr - 108527 108353 175 1 1 108 106 27 0.724 5.52 5.04 Intr - 113762 113693 70 0 1 89 97 38 0.804 2.12 5.03 Intr - 116036 115667 370 2 1 76 93 175 0.940 10.55 5.02 Intr - 124857 124795 63 0 0 72 115 28 0.779 1.90 5.01 Init - 128214 127819 396 0 0 31 70 288 0.983 18.05 5.00 Prom - 128295 128256 40 -7.05 6.00 Prom + 129090 129129 40 -9.65 6.01 Init + 129388 129454 67 0 1 68 72 69 0.717 4.59 6.02 Intr + 130668 130858 191 1 2 45 116 38 0.844 0.68 6.03 Intr + 130950 131061 112 2 1 68 67 130 0.949 7.93 6.04 Intr + 132244 132520 277 2 1 44 34 306 0.805 16.45 6.05 Term + 132719 132866 148 2 1 84 40 140 0.801 5.19 6.06 PlyA + 133753 133758 6 1.05 7.00 Prom + 133780 133819 40 -12.13 7.01 Init + 133984 134084 101 0 2 71 92 80 0.381 6.49 7.02 Intr + 138744 138867 124 0 1 47 65 97 0.320 2.97 7.03 Intr + 146111 146200 90 1 0 53 76 75 0.438 2.07 7.04 Intr + 148828 148921 94 0 1 43 80 100 0.849 3.32 7.05 Intr + 150202 150308 107 2 2 57 65 92 0.901 2.81 7.06 Intr + 151016 151160 145 1 1 73 87 109 0.999 8.23 7.07 Intr + 156414 156619 206 1 2 72 80 279 0.561 23.50 7.08 Intr + 160017 160169 153 2 0 108 92 85 0.999 10.25 7.09 Intr + 162394 162483 90 0 0 100 108 77 0.998 10.17 7.10 Intr + 167413 167505 93 0 0 45 116 84 0.302 6.14 7.11 Intr + 169698 169828 131 2 2 51 83 188 0.999 13.17 7.12 Intr + 171757 171859 103 1 1 77 107 62 0.999 6.26 7.13 Term + 177435 177572 138 2 0 79 49 176 0.999 9.78 7.14 PlyA + 177609 177614 6 1.05 8.05 PlyA - 177884 177879 6 1.05 8.04 Term - 186368 186235 134 0 2 99 44 161 0.353 10.07 8.03 Intr - 194197 194059 139 1 1 98 80 98 0.574 9.12 8.02 Intr - 195787 195596 192 1 0 101 95 0 0.099 0.77 8.01 Init - 202888 202766 123 1 0 73 49 106 0.352 5.42 8.00 Prom - 208078 208039 40 -5.25 9.04 PlyA - 208593 208588 6 1.05 9.03 Term - 216410 216234 177 2 0 3 39 194 0.779 2.70 9.02 Intr - 217295 217018 278 0 2 48 69 278 0.882 18.21 9.01 Init - 217405 217336 70 1 1 102 44 42 0.665 2.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 31945 31776 170 0 2 88 88 107 0.803 9.37 S.002 Term + 39663 39955 293 1 2 94 50 325 0.873 23.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580r:2439003_2667216|GENSCAN_predicted_peptide_1|99_aa MENVCVVSIDQTSHNISLIQSLIQSKALALFNSVKAEKGEQDAEEKLETSRGLFITKVQD KAASADVEAAASYPDLTKILDEVGHTKPQIFSVNETDLY >gi568815580r:2439003_2667216|GENSCAN_predicted_CDS_1|300_bp atggagaacgtttgtgtggtctcgatagatcaaaccagccataacatttccttaatccaa agtctaattcagagcaaggccctagctcttttcaattctgtgaaagctgagaaaggtgag caagatgcagaagaaaagttggaaactagcagaggtttgttcataacaaaagtgcaagat aaggcagcaagtgctgatgtagaagctgcagcaagttatccagatctaactaagatcctt gatgaagttggccacactaaaccacagattttcagtgtaaatgaaacagatctctattga >gi568815580r:2439003_2667216|GENSCAN_predicted_peptide_2|114_aa MIIQRRQYKETQGEDSHLQAKERGLEQILSLYPSQGTNPASTLILDIQPGEWLKVAPGDI NSSLSGLNQNAGARGTQGVTEKAQEQGAKRLQVHGGGMLVVGEESECNRSVHQH >gi568815580r:2439003_2667216|GENSCAN_predicted_CDS_2|345_bp atgatcatacagagaagacaatataaggagacacagggagaagacagccatctacaggcc aaggagagaggcctggaacagatcctttccttatatccctcacaaggaaccaaccctgcc agcactttgattttggatatccagcctggagaatggttgaaggttgctcctggggacatt aactcttcactctctgggctgaatcagaatgctggggccaggggaactcaaggggtgaca gagaaggctcaagagcagggagcaaaaagacttcaagtgcacggaggaggaatgttggta gtgggagaggaatctgagtgcaacagatctgtccaccagcactga >gi568815580r:2439003_2667216|GENSCAN_predicted_peptide_3|311_aa MNPPRALKRSLMMQRRDWKGERWGLQQVQVPGAARAEEVMLYPPSWVMDVGSASAWLEGL DKLATNKLVLKFTWKGKTQNSQHNTEKENTWRKEEEKEGLGGGGRRRKRRRKKEKEEEED GGGRGRRWRRRRKKIEEEEEEEGGGGRRRRRRAEEGQNMDGAGGYYPQKTIAGTENQISH VLTYRWELNAENLRTQRRKQHTGVYLKVEEEEEEEGEEEEEEEAAVAEEENKKEEEGRKK RKKKEEEEEEEEEQRKKGKKRKKERSLYTFHSSICVAKAWIQCKQVYHSFLQTHDINMVF MKTGGVYAEPL >gi568815580r:2439003_2667216|GENSCAN_predicted_CDS_3|936_bp atgaacccacctcgggctttgaaacggtcactaatgatgcaaagaagagattggaaagga gagaggtggggtttgcagcaagtccaagttcctggagctgcaagagcagaggaggtaatg ctgtatccaccttcttgggtcatggatgttggaagtgcaagtgcatggctggaggggctt gacaaactggctactaacaagctggttctaaagtttacatggaaaggcaagacccaaaat agccagcacaatactgagaaagaaaatacttggaggaaggaggaggagaaagaaggacta ggaggaggaggaagaaggagaaagaggaggaggaagaaggagaaggaggaggaggaagat ggaggaggaagaggaagaagatggaggaggaggaggaagaagatagaggaggaggaggaa gaagagggaggaggaggaagaaggaggaggaggagggcagaagaagggcagaatatggat ggagctggaggctattatcctcagaaaactattgcaggaacagaaaaccaaatatcacat gttctcacttataggtgggagctaaatgctgagaacttacgaacacaaagaaggaaacag cacactggggtttacttgaaggtggaggaagaggaagaggaagaaggagaagaagaagaa gaagaagaagcagcggtggcagaggaggagaataaaaaggaggaagaaggaagaaagaag aggaagaagaaggaagaagaagaggaggaggaggaggagcagagaaaaaaaggaaagaaa agaaaaaaggagagaagtttgtacacatttcactcttccatttgtgtagcaaaagcctgg attcaatgcaagcaggtctaccactccttcttacaaacacatgacatcaacatggtcttt atgaaaactggaggtgtctatgcagagcctctgtaa >gi568815580r:2439003_2667216|GENSCAN_predicted_peptide_4|58_aa MEIGEQQQSLGRNAKQVTVLTQRKVSRVSRNRLAFVSAVQEELSKLQRTVWALKRTLA >gi568815580r:2439003_2667216|GENSCAN_predicted_CDS_4|177_bp atggagattggagagcagcagcagtcacttggccggaatgcaaagcaggtgactgtgctg acacagaggaaagtgtcgagagtttcccgaaacaggttggcctttgttagtgcagtccag gaagaactctccaaactgcagaggactgtctgggctctgaagagaacactggcttaa >gi568815580r:2439003_2667216|GENSCAN_predicted_peptide_5|441_aa MSVVHQLSAGWLLDHLSFINKINYQLHQHHEPCCRKKEFTTSVHFESLQMDSVSSSGVCA AFIASDSSTKPENDDGGNYEMFTRKFVFRPELFDVTKPYITPAVHKECQQSNEKEDLMNG VKKEISISIIGKKRKRCVVFNQGELDAMEYHTKIRELILDGSLQLIQEGLKSGFLYPLFE KQDKGSKPITLPLDACSLSELCEMAKHLPSLNEMEHQTLQLVEEDTSVTEQDLFLRVVEN NSSFTKVITLMGQKYLLPPKSSFLLSDISCMQPLLNYRKTFDVIVIDPPWQNKSVKRSNR YSYLSPLQIQQIPIPKLAAPNCLLVTWVTNRQKHLRFIKEELYPSWSVEVVAEWHWVKLQ ADAGFSPDVIERIIITSKDLPFPEDAVRFTSFDSCHAPLAYHVLVRKLEREDAKRWDDGK GRVYGRARNRWKFLKPIEENA >gi568815580r:2439003_2667216|GENSCAN_predicted_CDS_5|1326_bp atgtctgtggtacaccagttgtcagctgggtggttactggatcatctttcttttatcaac aagataaactatcaacttcaccagcatcatgaaccttgttgccgtaaaaaggagttcact acttctgttcactttgagtctcttcaaatggattctgtgtcctcctctggagtctgtgct gcatttattgcttctgactcttccactaagccagagaatgatgatggaggaaattatgaa atgttcacacgaaaatttgtttttcgacctgaactgtttgatgtcaccaaaccttatata actccagctgttcataaagaatgccagcaaagtaatgaaaaggaagatctgatgaatggt gttaaaaaagaaatctccatttctattattgggaagaagcgtaaaagatgtgttgttttc aatcaaggtgaattggatgctatggaataccatacaaagatcagggagctgattttggat ggatctttacagttgatccaggaaggtctcaaaagtggttttctttatccactttttgaa aaacaggacaagggtagtaagcccattactttaccacttgacgcctgcagtttgtcagaa ttatgtgaaatggcaaagcatttgccttctctgaatgaaatggaacatcagacattacaa ttggtggaagaggatacatctgttacagaacaggatttatttttgcgagttgttgaaaac aactctagctttacaaaagtgattactttaatgggacagaaatacctgctaccaccgaaa agcagttttcttttatctgacatttcttgtatgcaaccacttctaaactataggaaaaca tttgatgtaattgtgatagatccaccatggcagaacaaatcagttaaaagaagtaatagg tacagttatttgtcacccctgcaaatacagcaaatacctatccctaaattggctgctcca aactgtcttcttgttacttgggtgaccaatagacagaagcacctacgttttataaaggaa gaactttatccctcttggtctgtggaggtagttgctgagtggcactgggtaaaacttcag gcagatgctggcttttctcctgatgtgattgagaggattataatcacttcaaaggacctt cccttccctgaagatgctgtccgatttacctcatttgattcatgtcatgcaccactggcc taccatgtcttggtaaggaagttagaaagagaagatgcgaaaagatgggacgatggaaaa ggaagggtatatgggagggcaagaaacagatggaaatttctgaagccaattgaagaaaat gcttaa >gi568815580r:2439003_2667216|GENSCAN_predicted_peptide_6|264_aa MAVTTGSGVGVPLNFQLLEEPEAQIISPLLRSPLRFAFHEEFRAFPISSLPHLYQEMLLP PRQVSFCTVTTQHFPRHFFALTRSHKPFNSGFDMVKRVTRVIQTPRLSGSLVVKPGKRPH EDNFHREKAFSFSKSPWAPPCCCTEKRFQGELSVQGTPRRSSCPQVFRPNFRGRASLRRD GHTGPQTPVPGVTQLGATSTQPSSDEAVAWKLLRASAVLEEESDGANDLNLEVLRRCFGV CLLLGEASRPKLCEALVSRDLLNF >gi568815580r:2439003_2667216|GENSCAN_predicted_CDS_6|795_bp atggcagtcaccacaggctcaggagtaggagtacctctcaatttccaactgttggaagaa cctgaagcacagatcataagccctctgctccgctcccctctgagattcgccttccatgag gaattcagggctttccccatatcttctcttccccacctttatcaagagatgctgctccct ccccgtcaagtctctttttgcactgtcaccacccaacactttccacgacacttctttgct ttgaccagaagccataagccttttaactcaggttttgatatggtgaaaagagtcaccagg gttattcagacaccccggctgtcagggtccttggtggttaaacctgggaaaaggccacat gaagacaacttccacagagaaaaagctttctccttttctaagagtccatgggcgccgcca tgttgctgtacggaaaaacgtttccagggcgagttgagcgttcagggcactcctcggcgc tcaagctgcccacaggtcttccgacctaactttaggggtcgtgccagtcttcgtagagac ggccacactggcccacagaccccagttcctggggtgacgcagctgggcgcgaccagcacg cagccttccagcgacgaggcggtcgcatggaagttactgcgcgcgtcagctgttctggag gaagaaagtgatggcgccaatgacttaaatttagaggtcctgaggcgatgttttggtgtt tgtttgctgttgggagaagcctcgaggccaaaactgtgcgaggcgctggtttcacgggat ctattaaatttctag >gi568815580r:2439003_2667216|GENSCAN_predicted_peptide_7|524_aa MKRSSVSSGGAGRLSMQELRSQDVNKQGLYTPQTTSGHGSRNSQLGIFSSSEKIKDPRPL NDKAFIQQCIRQLCEIHTAMKESSPLFDDGQPWGEETEDGIMHNKLFLDYTIKCYESFMS GADSFDEMNAELQSKLKDLFNVDAFKLESLEAKNRALNEQIARLEQEREKEPNRLESLRK LKASLQGDVQKYQAYMSNLESHSAILDQKLNGLNEEIARVELECETIKQENTRLQNIIDN QKYSVADIERINHERNELQQTINKLTKDLEAEQQKLWNEELKYARGKEAIETQLAEYHKL ARKLKLIPKGAENSKGYDFEIKFNPEAGANCLVKYRAQVYVPLKELLNETEEEINKALNK KMGLEDTLEQLNAMITESKRSVRTLKEEVQKLDDLYQQKIKEAEEEDEKCASELESLEKH KHLLESTVNQGLSEAMNELDAVQREYQLVVQTTTEERRKVGNNLQRLLEMVATHVGSVEK HLEEQIAKVDREYEECMSEDLSENIKEIRDKYEKKATLIKSSEE >gi568815580r:2439003_2667216|GENSCAN_predicted_CDS_7|1575_bp atgaagcgcagttcagtttccagcggtggtgctggccgcctctccatgcaggagttaaga tcccaggatgtaaataaacaaggcctctatacccctcaaacaactagtggacatggatcc cggaatagtcaacttggtatattttccagttctgagaaaatcaaggacccgagaccactt aatgacaaagcattcattcagcagtgtattcgacaactctgtgagatacatactgccatg aaagaaagctcacctttatttgatgatgggcagccttggggagaagaaactgaagatgga attatgcataataagttgtttttggactacaccataaaatgctatgagagttttatgagt ggtgccgacagctttgatgagatgaatgcagagctgcagtcaaaactgaaggatttattt aatgtggatgcttttaagctggaatcattagaagcaaaaaacagagcattgaatgaacag attgcaagattggaacaagaaagagaaaaagaaccgaatcgtctagagtcgttgagaaaa ctgaaggcttccttacaaggagatgttcaaaagtatcaggcatacatgagcaatttggag tctcattcagccattcttgaccagaaattaaatggtctcaatgaggaaattgctagagta gaactagaatgtgaaacaataaaacaggagaacactcgactacagaatatcattgacaac cagaagtactcagttgcagacattgagcgaataaatcatgaaagaaatgaattgcagcag actattaataaattaaccaaggacctggaagctgaacaacagaagttgtggaatgaggag ttaaaatatgccagaggcaaagaagcgattgaaacacaattagcagagtatcacaaattg gctagaaaattaaaacttattcctaaaggtgctgagaattccaaaggttatgactttgaa attaagtttaatcccgaggctggtgccaactgccttgtcaaatacagggctcaagtttat gtacctcttaaggaactcctgaatgaaactgaagaagaaattaataaagccctaaataaa aaaatgggtttggaggatactttagaacaattgaatgcaatgataacagaaagcaagaga agtgtgagaactctgaaagaagaagttcaaaagctggatgatctttaccaacaaaaaatt aaggaagcagaggaagaggatgaaaaatgtgccagtgagcttgagtccttggagaaacac aagcacctgctagaaagtactgttaaccaggggctcagtgaagctatgaatgaattagat gctgttcagcgggaataccaactagttgtgcaaaccacgactgaagaaagacgaaaagtg ggaaataacttgcaacgtctgttagagatggttgctacacatgttgggtctgtagagaaa catcttgaggagcagattgctaaagttgatagagaatatgaagaatgcatgtcagaagat ctctcggaaaatattaaagagattagagataagtatgagaagaaagctactctaattaag tcttctgaagaatga >gi568815580r:2439003_2667216|GENSCAN_predicted_peptide_8|195_aa MLLENNVMLASKRCDYEEMKLIIRKETETVVCEPREALRGGEWKELSMEGYPLPSPPPQR SCHLPLIGKQLRGHPHLFLKPASFWWVESPASEQMISFSQVRGLKAQHHAEAAKVWDFHP LKPQPKLYVGPFQPWLEWLGTGHQVPRLHTAACRGDTTELPGDAAAPLTRADGLDGVSSR PSCGTWSSGGSSGFT >gi568815580r:2439003_2667216|GENSCAN_predicted_CDS_8|588_bp atgctgctggaaaataacgtgatgcttgccagcaagagatgtgactatgaagaaatgaaa ctgattatcagaaaagaaaccgaaactgtggtttgtgagcctagagaagcactcagagga ggggagtggaaagagctgtcgatggagggctacccattaccctcccctcctccacaaagg agctgtcacctccccttaataggcaaacaactcaggggccatccccatttgtttcttaaa cctgcaagtttctggtgggtggagtctcctgcttcagaacaaatgatatctttctcccaa gtcagaggtctcaaggctcaacaccacgcagaagctgccaaggtgtgggacttccaccct ctgaagccacagcccaagctgtacgttggcccctttcagccatggctggagtggctggga acagggcaccaagtccctagactgcacactgcagcctgcagaggagacaccactgagctt cctggtgatgctgcagcccctctcaccagggctgatggccttgatggggtgtcctcaagg ccttcctgtggaacatggtcctctggagggtcttctggctttacatga >gi568815580r:2439003_2667216|GENSCAN_predicted_peptide_9|174_aa MAGTREREAKAAGPGPGDRFIALIYTRIPSSPLRVPDTHTREIPRSPSAPRPAEAGPRAR TPFRADQTSTPSCGRRLRHPPQSPQRPHQARRRPPLPYWGKGEKTATPAAAGLRAEVPRR LTEPSLAGSRGRGGGKLQAGGTRRAQSPPKRARGSLCRENRRYESRRPFFLLIS >gi568815580r:2439003_2667216|GENSCAN_predicted_CDS_9|525_bp atggcgggcacacgggagagggaagccaaggccgcaggaccgggaccgggtgacaggttt attgccctcatctacacgcgcatcccttcctcgccccttcgcgtacctgacacacacacg cgcgaaatcccgcgtagtccgagcgctccccgacctgcagaggccggtccccgagctcgg actccttttcgcgccgatcaaacaagtacaccgtcctgtggccgacgcctccgccatcct cctcagtccccacagaggccccaccaggcccgccgccgtccgccgctgccatattgggga aaaggagaaaagacagcgacgcctgccgccgccgggctcagggctgaggtgccgcgccgg ctgactgagccctctctggctgggtcgcgtggacgcggcggcgggaagcttcaggccggc ggtactcgccgggcgcagagtccaccaaagcgagcacgtgggtccctctgccgagaaaat aggcgctacgagagccggcgcccctttttcctcctcatttcttag