GENSCAN 1.0 Date run: 2-Nov-116 Time: 22:20:21 Sequence gi568815594r:68214243_68438386 : 224144 bp : 37.53% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2461 2589 129 0 0 46 70 136 0.490 7.57 1.02 Term + 5064 5120 57 2 0 52 48 51 0.233 -5.79 1.03 PlyA + 5337 5342 6 1.05 2.12 PlyA - 5358 5353 6 1.05 2.11 Term - 13830 13669 162 0 0 86 48 121 0.990 4.85 2.10 Intr - 14642 14500 143 0 2 61 91 84 0.733 5.15 2.09 Intr - 15274 15015 260 0 2 79 115 143 0.817 12.28 2.08 Intr - 17138 16961 178 0 1 70 97 50 0.583 2.16 2.07 Intr - 18174 18136 39 1 0 101 93 31 0.570 2.18 2.06 Intr - 20381 20221 161 1 2 59 3 119 0.564 -0.79 2.05 Intr - 21827 21760 68 2 2 115 84 13 0.915 0.38 2.04 Intr - 22024 21909 116 2 2 80 94 57 0.954 4.75 2.03 Intr - 31394 31309 86 1 2 39 103 109 0.186 6.04 2.02 Intr - 38432 38276 157 0 1 54 34 172 0.533 6.55 2.01 Init - 41765 41738 28 2 1 40 109 22 0.435 -0.68 2.00 Prom - 44267 44228 40 -4.95 3.00 Prom + 45020 45059 40 -2.45 3.01 Init + 50376 50433 58 2 1 51 103 32 0.251 2.52 3.02 Intr + 55774 55973 200 2 2 76 37 107 0.344 2.55 3.03 Intr + 64349 64489 141 1 0 9 33 165 0.306 2.73 3.04 Intr + 68574 68633 60 2 0 62 79 93 0.055 3.81 3.05 Intr + 86429 86719 291 1 0 -15 61 214 0.624 4.81 3.06 Intr + 88068 88217 150 2 0 119 92 84 0.983 11.34 3.07 Intr + 90445 90609 165 0 0 -12 105 124 0.719 3.34 3.08 Term + 95191 95202 12 0 0 106 45 7 0.289 -4.57 3.09 PlyA + 97152 97157 6 1.05 4.18 PlyA - 97402 97397 6 1.05 4.17 Term - 100081 99857 225 1 0 119 39 253 0.982 19.30 4.16 Intr - 102206 102072 135 2 0 75 86 82 0.765 6.54 4.15 Intr - 104620 104584 37 0 1 47 111 43 0.759 -0.15 4.14 Intr - 108673 108507 167 1 2 95 72 151 0.991 12.04 4.13 Intr - 115278 115148 131 1 2 78 59 34 0.543 -1.01 4.12 Intr - 116068 115960 109 1 1 56 116 32 0.635 1.74 4.11 Intr - 117955 117861 95 2 2 119 62 53 0.977 4.56 4.10 Intr - 118605 118552 54 1 0 36 64 98 0.538 0.23 4.09 Intr - 119155 119066 90 2 0 80 66 59 0.793 1.95 4.08 Intr - 122298 122206 93 1 0 54 110 43 0.807 2.12 4.07 Intr - 123208 122785 424 1 1 73 79 887 0.986 78.81 4.06 Intr - 123658 123330 329 2 2 59 52 423 0.979 30.29 4.05 Intr - 124142 124041 102 0 0 80 75 142 0.992 11.33 4.04 Intr - 135615 135484 132 1 0 27 96 215 0.989 15.90 4.03 Intr - 158290 158213 78 2 0 85 58 50 0.001 0.30 4.02 Intr - 162226 162011 216 2 0 75 -6 203 0.012 7.15 4.01 Init - 174307 174286 22 1 1 100 101 29 0.442 5.30 4.00 Prom - 176857 176818 40 -5.05 5.00 Prom + 181643 181682 40 -3.15 5.01 Init + 186571 186665 95 0 2 56 64 64 0.550 0.70 5.02 Term + 192223 192436 214 1 1 76 38 243 0.961 13.82 5.03 PlyA + 193636 193641 6 1.05 6.00 Prom + 196995 197034 40 -4.95 6.01 Init + 207077 207369 293 1 2 39 36 286 0.568 15.17 6.02 Intr + 209818 209915 98 1 2 46 40 107 0.042 0.43 6.03 Term + 210214 210788 575 2 2 21 42 257 0.574 8.03 6.04 PlyA + 210849 210854 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:68214243_68438386|GENSCAN_predicted_peptide_1|61_aa VRTSQYELEIFQVRTSQCQYDERKGGGALKETISTQHKGDDSLLDTMTTSMKPATSPGPQ R >gi568815594r:68214243_68438386|GENSCAN_predicted_CDS_1|186_bp gttagaacttctcagtatgaacttgaaatctttcaggttagaacttctcagtgtcagtat gatgaaagaaaaggtggaggtgccttaaaggaaacaatatccactcagcataaaggtgat gactccctgctggacaccatgactacctccatgaagccagccaccagtccaggaccacag agatga >gi568815594r:68214243_68438386|GENSCAN_predicted_peptide_2|465_aa MSMRNVKGAVYYTFHMNLKYTPPDEGIVAYNLILESRGLEAALRSEEYTHFTYRDSSKRS GGIVIVIVIFVIIITTVIPDTAITIRTLCTEKTYYYQGDFHISGVTYNDNCENAASQAST NLSKDIETKMLNAFQNSSIYKEYVKSEVIKLLPNANGSNVQLQLKFKFPPAEGVSMRTKI KAKLHQMLKNNMASWNAVPASIKLMEISKAASEMLTNNCCGRQVANSIITGNKIVNGKSS LEGAWPWQASMQWKGRHYCGASLISSRWLLSAAHCFAKKNNSKDWTVNFGIVVNKPYMTR KVQNIIFHENYSSPGLHDDIALVQLAEEVSFTEYIRKICLPEAKMKLSENDNVVVTGWGT LYMNGSFPVILQEDFLKIIDNKICNASYAYSGFVTDTMLCAGFMSGEADACQNDSGGPLA YPDSRNIWHLVGIVSWGDGCGKKNKPGVYTRVTSYRNWITSKTGL >gi568815594r:68214243_68438386|GENSCAN_predicted_CDS_2|1398_bp atgtctatgaggaatgtgaaaggagcagtatactacaccttccatatgaatctgaaatac accccaccagatgaaggcatagtagcatacaatctgattttagagagcagaggtctggaa gcggcactgaggagtgaggagtacacgcacttcacctatagggacagcagtaaaaggagt ggtggcattgtcatcgttatcgttatcttcgtcatcatcattaccaccgttatacctgat actgccataacaatcagaacattatgtacagagaagacttactattatcaaggtgatttt catatttctggagtcacatacaatgataattgtgaaaacgcagcttcacaagccagcaca aatctaagcaaagatattgagactaagatgttaaatgcatttcaaaattccagtatatat aaggaatatgtcaaatctgaggtcatcaaacttctgcctaatgccaatggttcaaatgtg cagttacagctgaaattcaagtttcctccagcagaaggagttagcatgaggactaaaatc aaggctaaattacatcagatgttgaaaaacaacatggcatcctggaatgcagttcctgct tccattaaactcatggaaatcagcaaggctgcttctgaaatgcttaccaacaactgttgt gggagacaagtagccaacagtatcataactggcaacaaaattgtgaatggaaaaagctcc ctggagggggcatggccatggcaggccagcatgcaatggaaaggccgtcactactgtgga gcctctctgatcagcagcaggtggctattatctgcagctcactgctttgctaagaaaaat aattcaaaagattggactgtcaactttggaattgtagtaaataaaccatatatgacacgg aaagtccaaaacattatttttcatgaaaattatagcagtcctgggcttcatgatgatatt gcccttgtgcagcttgctgaagaagtttcttttacagagtacattcgtaagatttgtctt cctgaagccaaaatgaagctctcagaaaatgacaatgttgtagttacaggttggggaaca ctttatatgaatggttcatttccagtgatacttcaagaagactttttgaagattattgac aacaaaatttgcaatgcctcatatgcatactctggctttgtgactgatacaatgttatgt gctggatttatgtcaggagaagctgatgcatgtcagaatgattctggtggaccactagct taccctgattccagaaatatctggcatcttgttggaatagtaagctggggtgatggatgt ggtaaaaagaataagccaggtgtctatactcgagtgacttcttatcgcaattggattaca tccaagactggactctga >gi568815594r:68214243_68438386|GENSCAN_predicted_peptide_3|358_aa MLESASNPYQLGFNNCPVHDAQCKLSVDLPFWGLEDSGLLLTPPLDSAPVGTLCEGSNPM FPLCIVLVEVLYEASTTATDLCMDIQLLVAEEASGNLQSRWKVKKKQAPSSQGSKREKRR GGELPNTLKPSDLISEEEAPKPTGEVSEGYVKKNILWEYSGYTEKFFKDVVLNKKLMTNL QESRSDVVHANAIGPFGELLAELLKISFVYSLHFSPGYTFEKYSGGFLLPPSYGAVILSE LSGSMTFMETENALQLSEIMGKAEMWLIRNYWYLEFPRPLLPNFEFVVRLYCKPVNPLPK FGSTFPDTNFYVAHNSVGTVRCLLQAGSLTLPHTSIDPRELSTVKAHQPSWDFEELLP >gi568815594r:68214243_68438386|GENSCAN_predicted_CDS_3|1077_bp atgttggaatcagctagcaacccatatcagcttggtttcaacaattgcccagttcatgat gcacaatgcaagctgtcagtggatctaccattctggggtctagaggacagtggccttctt ctcacacctccactagacagtgccccagtggggaccctgtgtgagggttccaatcccatg tttcccctctgcattgtattagtagaggttctctatgaggcctccaccactgcaacggac ttatgcatggacatccagttactcgtggctgaggaagcctcaggaaacttacaatcccgg tggaaggtgaagaagaagcaagcaccttcttcacaaggcagcaagagagagaaaaggaga gggggtgaactgccaaacactttaaaaccatcagatcttatctcagaggaagaagcacct aagcccactggggaggtaagtgaaggctacgtgaaaaagaatattctttgggaatattct ggttatactgagaagttctttaaagatgtagttttgaacaagaaacttatgacaaaccta caagaatcaaggtctgatgtcgttcatgcaaatgccattggtccctttggagagctgctg gctgagctattaaaaatatcctttgtgtacagtctccacttctctcctggctacacattt gagaaatacagtggaggatttctacttccaccttcctatggagctgttattctgtcagaa ttaagtggttcgatgacattcatggagacagaaaatgcactacaattatctgagataatg ggaaaagctgaaatgtggctcattcgaaactactggtatttggaatttcctcgcccactc ttacctaattttgaatttgttgtaagactctactgcaaacctgtcaaccccctgcctaag tttgggagtacgtttcctgacacaaatttctatgtggctcacaatagcgtcggaactgtg agatgcctacttcaagctggctccttgactcttccacacacttcgattgaccctcgggaa ctgagtacagtgaaagctcatcaaccttcatgggattttgaggagcttttaccttga >gi568815594r:68214243_68438386|GENSCAN_predicted_peptide_4|812_aa MERQCLKYAALGTLGTAHRTAGAAAFLAGGAFALFALAGAGAGAGVTGGTGAVGIHGELK RRLESGQVVHGLAGTWRRRGCHYTHFRPVSLKVSRRRFDMQTKVRYGLTDGPVLLPRLRR PRESRLSGGGGGSGGNRRGAMAADSREEKDGELNVLDDILTEVPEQDDELYNPESEQDKN EKKGSKRKSDRMESTDTKRQKPSVHSRQLVSKPLSSSVSNNKRIVSTKGKSATEYKNEEY QRSERNKRLDADRKIRLSSSASREPYKNQPEKTCVRKRDPERRAKSPTPDGSERIGLEVD RRASRSSQSSKEEVNSEEYGSDHETGSSGSSDEQGNNTENEEEGVEEDVEEDEEVEEDAE EDEEVDEDGEEEEEEEEEEEEEEEEEEEEYEQDERDQKEEGNDYDTRSEASDSGSESVSF TDGSVRSGSGTDGSVDWLRQGRQEGWGVRYKMEMKTINKILVEKSDEKKKERKRARGISP IVFDRSGSSASESYAGSEKKHEKLSSSVRAVRKDQTSKLKYVLQDARFFLIKSNNHENVS LAKAKGVWSTLPVNEKKLNLAFRSARSVILIFSVRESGKFQGPYCTLKYCTQEAHYLSDL YVAYTNVLPVKLILVKKCLKGDGYFEIELECGTQLCLLFPPDESIDLYQVIHKMRHKRRM HSQPRSRGRPSRREPVRDVGRVFKGSTIPGSGQPPYPGMEQPPHHPYYQHHAPPPQAHPP YSGHHPVPHEARYRDKRVHDYDMRVDDFLRRTQAVVSGRRSRPRERDRERERDRPRDNRR DRERDRGRDRERERERLCDRDRDRGERGRYRR >gi568815594r:68214243_68438386|GENSCAN_predicted_CDS_4|2439_bp atggagcgacaatgtctgaaatatgcagccctgggcacacttggcacagcccacaggaca gcaggagcagcagcttttcttgcaggaggtgcatttgcactctttgcacttgcaggagcc ggcgcaggtgcaggagtcactggcggcacaggagcagttgggatccatggcgagctgaag aggcggctggagtcgggacaggttgtacacgggctcgctgggacttggaggaggcgtgga tgccactacacccacttcagacccgtgagcctcaaagtgtctaggaggcggtttgatatg cagactaaagtgagatacggactgacggacgggcccgtgcttctgccgcggctgcggcgc ccgcgcgagtcgcgtctaagcggcggcggcggtggcagcggcggaaaccgaaggggagcc atggcggctgacagtcgggaggagaaagatggagaacttaatgttctggatgatatttta actgaagtaccagaacaagatgatgaactgtataatccagagagtgaacaagataaaaat gagaaaaagggatcaaaaagaaaaagtgatcgaatggaatctactgataccaaacgacaa aagccttctgtccattctagacaactggtttctaagccactgagctcatctgttagcaat aacaaaagaatagttagtacaaaaggaaagtcagccacagagtataaaaatgaggaatat caaagatctgaaagaaacaagcgtctagatgctgatcggaaaattcgtctatcaagtagt gcctccagagaaccttataagaatcaacctgaaaaaacctgtgtccggaaaagggatcct gaaaggagggccaaatctcctacgccagatggttctgagagaattgggcttgaagtggat agacgtgcaagcagatccagccagtcttctaaggaagaagtgaactctgaagaatatggc tctgaccatgagactggcagcagtggttcttctgatgagcaagggaacaacactgagaat gaggaggaaggagtggaagaagatgtggaggaagatgaagaagtagaagaagatgcagaa gaagatgaagaggtggatgaagatggagaggaggaggaggaagaggaggaggaggaagag gaggaggaggaggaggaagaagaagaatatgaacaggatgagagagaccagaaagaggag ggaaatgattatgacactcgaagtgaggccagtgactctggttctgaatctgtttccttc acagatgggtctgtcagatctggttcaggcacagatggatcagttgactggctgagacag ggtcgacaagagggctggggtgtaaggtataagatggagatgaagactataaataaaatt cttgtggaaaaatcagatgagaaaaagaaggaaaggaagagagctagaggcatatctcca attgtttttgatagaagtggaagctctgcatcagagtcatatgcaggttcagaaaagaag catgagaaattatcatcttccgttcgtgctgtccgaaaagatcaaaccagtaaactcaaa tatgtgcttcaagatgcaagatttttcctcataaagagtaacaaccatgagaatgtgtct cttgccaaagcgaagggtgtatggtccacgctccctgtaaatgagaagaaattaaatctt gcatttagatctgcaaggagtgttatcttaatattttctgtcagagagagtggaaaattt caaggtccttactgcacactgaagtattgtacacaagaagctcattatttaagtgatctt tacgtggcatacacaaatgtattacctgtaaaattaattttagttaaaaagtgtctgaaa ggagatggttactttgaaattgaacttgaatgtggaacccagctttgtcttctgtttccc cccgatgaaagtattgacttgtatcaggtcattcataaaatgcgtcacaagagaagaatg cattctcagccccgatcacgaggacgtccatcccgtcgagaaccagtccgggatgtggga agggtatttaaaggatccacgataccaggaagtggacagcccccttacccaggaatggaa caacctccacaccatccttactatcagcaccatgctccacctcctcaagctcatccccct tactcaggacatcatccagtaccacatgaagcaagatacagagataaacgagtacatgat tatgatatgagggtggatgatttccttcgtcgcacacaagctgttgtcagtggccggaga agtagaccccgtgaaagagaccgggaacgagagcgagaccgccctagagataacagacga gacagagagcgagatagaggacgtgatagagaaagagaaagagagcgattatgtgatcga gacagagaccgaggggagagaggtcgatatagaagataa >gi568815594r:68214243_68438386|GENSCAN_predicted_peptide_5|102_aa MPFPEVTEHPSPAKVSSLLQEMTLPGSYAKSILFLIGFDQLYPTRSNPKEVPNYGTRPLK WLNSQKNDDNNNSNNNYKNGGVVAEKYTSKRKKKRKGFDFDY >gi568815594r:68214243_68438386|GENSCAN_predicted_CDS_5|309_bp atgcctttccctgaagtgacagaacatccttcacctgccaaggtcagcagtttgctacag gagatgacacttccaggatcatatgccaagtctattctttttctcattgggtttgatcaa ctctatccaactagatcaaatccaaaggaagttccaaattatggaacaaggcctctgaag tggctaaattcccagaaaaacgacgacaacaacaacagcaacaacaactacaaaaacggt ggtgtggtagcagagaaatataccagcaaaaggaaaaaaaagaggaaaggctttgatttt gactactaa >gi568815594r:68214243_68438386|GENSCAN_predicted_peptide_6|321_aa MENDFDELKEDGFRRLVITNFSKLQEDVQTHCKEAKNLEKRLDKWLTRINSVEKSLKDLM ELKTMAREVCDACTSFSSRFDQLEEGVSVIEDQMNEMQTKDNNHMIISIDAEKAFDKIQQ PFMLKTLNKLAISNFSKFSGYKFNVQKSQGFLYTNNRQTERQVMSELPFTIASKTIKYQG IQLTRDVKDLFKENYKPLFNEIKEDTNKWKKIPCSWIGRINILKMAILPKVIYRFNAIPI KLPMTFFTELEKPTLKFMWNQKRAHIAKSVLSQKNKAGGIILPDFKLYYKATVTKTAWYW YQNRDIDQWNRIEPPEIIPHI >gi568815594r:68214243_68438386|GENSCAN_predicted_CDS_6|966_bp atggagaatgactttgacgagttgaaagaagacggcttcagacgattggtaataacaaac ttctccaagctacaggaggatgttcaaacccattgcaaagaagctaaaaaccttgaaaaa agactagacaaatggctaactagaataaacagtgtagagaagtccttaaaggacctgatg gagctgaaaaccatggcacgagaagtatgtgacgcatgcacaagcttcagtagccgattc gatcaactggaagaaggggtatcagtgattgaagatcaaatgaatgaaatgcaaaccaaa gacaacaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaacag cccttcatgctaaaaactctcaataaactagcgataagcaacttcagcaaattctcgggt tacaaattcaatgtgcaaaaatcacaaggattcctatacaccaataatagacaaacagag aggcaagtcatgagtgaactcccattcacaattgcttcaaagacaataaaataccaagga atccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgttcaac gaaataaaagaggacacaaacaaatggaagaaaattccatgctcatggataggaagaatc aatattttgaaaatggccatactgcccaaggtaatttatagattcaatgccatccccatc aagctaccaatgactttcttcacagaattggaaaaacctactttaaagttcatgtggaac caaaaaagagcccacattgccaagtcagtcctaagccaaaagaacaaagctggaggcatc atactacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactgg taccaaaacagagatatagaccaatggaacagaatagagcccccggaaataataccacac atctag