GENSCAN 1.0 Date run: 5-Nov-116 Time: 07:08:06 Sequence gi568815596r:171685277_171993260 : 307984 bp : 39.30% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2165 2351 187 1 1 47 111 117 0.330 8.24 1.02 Intr + 2445 2597 153 1 0 46 94 64 0.772 1.92 1.03 Term + 2790 3004 215 1 2 77 42 141 0.959 4.81 1.04 PlyA + 3204 3209 6 -3.64 2.00 Prom + 3354 3393 40 -4.95 2.01 Init + 4880 4987 108 1 0 68 47 126 0.711 6.77 2.02 Intr + 7501 7618 118 0 1 26 87 107 0.599 3.52 2.03 Intr + 40342 40437 96 2 0 124 65 177 0.989 18.06 2.04 Intr + 40643 40805 163 0 1 67 57 124 0.999 5.31 2.05 Intr + 40918 41017 100 1 1 48 98 49 0.687 1.09 2.06 Intr + 41515 41640 126 0 0 19 80 110 0.619 3.26 2.07 Intr + 42545 42691 147 1 0 38 98 75 0.870 3.01 2.08 Intr + 43441 43574 134 0 2 51 97 148 0.990 10.52 2.09 Intr + 44433 44577 145 0 1 25 65 105 0.956 1.26 2.10 Intr + 58773 58913 141 2 0 89 83 88 0.991 8.03 2.11 Intr + 60526 60651 126 0 0 56 64 138 0.979 8.16 2.12 Term + 62500 62613 114 0 0 72 32 243 0.919 14.69 2.13 PlyA + 63114 63119 6 1.05 3.03 PlyA - 63718 63713 6 1.05 3.02 Term - 71978 71848 131 0 2 56 38 118 0.693 0.96 3.01 Init - 72807 72540 268 0 1 64 72 220 0.893 15.19 3.00 Prom - 81504 81465 40 -5.25 4.20 PlyA - 84276 84271 6 1.05 4.19 Term - 100199 99998 202 1 1 101 45 239 0.670 16.78 4.18 Intr - 102385 102295 91 2 1 105 95 86 0.982 9.13 4.17 Intr - 102671 102513 159 0 0 59 96 162 0.991 13.14 4.16 Intr - 106313 106175 139 2 1 64 93 91 0.995 6.32 4.15 Intr - 108491 108351 141 2 0 109 98 175 0.997 20.23 4.14 Intr - 124410 124330 81 0 0 76 75 103 0.516 6.72 4.13 Intr - 125000 124948 53 0 2 99 93 24 0.959 1.71 4.12 Intr - 128221 128063 159 2 0 76 84 112 0.877 8.64 4.11 Intr - 129926 129845 82 2 1 92 105 9 0.796 1.29 4.10 Intr - 141606 141522 85 2 1 51 80 60 0.410 0.40 4.09 Intr - 148780 148687 94 2 1 100 110 69 0.953 8.40 4.08 Intr - 149589 149451 139 0 1 95 90 84 0.993 8.42 4.07 Intr - 151991 151845 147 2 0 95 89 102 0.996 10.51 4.06 Intr - 159232 159093 140 2 2 96 99 96 0.999 10.76 4.05 Intr - 170673 170558 116 2 2 93 87 83 0.925 7.87 4.04 Intr - 183547 183405 143 1 2 55 106 184 0.963 15.13 4.03 Intr - 202379 202248 132 2 0 53 10 146 0.029 3.22 4.02 Intr - 207982 207929 54 1 0 130 99 14 0.083 4.96 4.01 Init - 208938 208927 12 0 0 91 57 31 0.804 -0.05 4.00 Prom - 217158 217119 40 -4.85 5.00 Prom + 219934 219973 40 -3.95 5.01 Sngl + 237141 237503 363 2 0 38 34 316 0.846 17.33 5.02 PlyA + 240663 240668 6 1.05 6.02 PlyA - 240787 240782 6 1.05 6.01 Sngl - 255604 255149 456 1 0 57 44 232 0.539 11.63 6.00 Prom - 257354 257315 40 -7.35 7.00 Prom + 258456 258495 40 -6.15 7.01 Init + 260581 260629 49 0 1 86 89 28 0.688 1.86 7.02 Intr + 261432 261507 76 1 1 53 80 41 0.665 -2.55 7.03 Intr + 267605 267725 121 2 1 90 94 85 0.833 8.88 7.04 Intr + 280062 280241 180 2 0 99 98 53 0.988 6.54 7.05 Intr + 280511 280632 122 1 2 122 84 88 0.999 10.17 7.06 Intr + 281133 281237 105 0 0 100 86 64 0.964 5.81 7.07 Intr + 281567 281673 107 2 2 41 91 75 0.880 2.04 7.08 Intr + 290881 291032 152 2 2 51 63 111 0.949 3.86 7.09 Intr + 293971 294087 117 0 0 86 97 139 0.999 14.34 7.10 Term + 297909 298076 168 2 0 54 43 192 0.997 8.20 7.11 PlyA + 298329 298334 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:171685277_171993260|GENSCAN_predicted_peptide_1|184_aa VFLVAGVCSSSRSCQFVRRGRKLDSWPENIRRSFPVFGKAEILKAPGLTGFFVLSTGFCF TTALAVAAAAHPSLCALESRGAGGGYSWCWGLRRGTLPGSCRREDSSPSSHLTEAAWGAL RLVRRPLASSGRPAVDRALLAVTPGEKVIRFRLAPFYKLRPSLLALIVSPLTLCVMESPR TRRF >gi568815596r:171685277_171993260|GENSCAN_predicted_CDS_1|555_bp gttttccttgttgccggggtttgtagttcttctcgatcgtgtcagtttgtaaggcgaggg cggaagttggattcctggcctgagaatattaggcgtagttttccagtttttggcaaagcg gaaatacttaaggcccctgggttgactgggttctttgttttatctaccggcttctgcttt acgacagctctcgccgtagcagccgccgcccatccctctttgtgtgctttggaaagccgc ggagctggtggtggctacagttggtgttgggggcttaggcgagggacgttaccgggaagt tgcaggcgggaggactcttccccatccagtcacctgacagaggcggcctggggagccttg aggcttgtccggcgcccactggcttcttccggacgccctgccgtggaccgcgccctcttg gccgtgaccccaggggagaaagtgattcgttttcgccttgccccattttacaaattaagg ccatctctacttgccttgatagtgtctcccctcaccttatgtgtgatggaaagcccccga acacggaggttctag >gi568815596r:171685277_171993260|GENSCAN_predicted_peptide_2|505_aa MSDKSELKAELERKKQRLAQIREEKKRKEEERKKKETDQKKEAVAPVQEESDLEKKRREA EALLQSMGLTPESPIDEEEDDDVVAPKPPIEPEEEKTLKKDEENDSKAPPHELTEEEKQQ ILHSEEFLSFFDHSTRIVERALSEQINIFFDYSGRDLEDKEGEIQAGAKLSLNRQFFDER WSKHRVVSCLDWSSQYPELLVASYNNNEDAPHEPDGVALVWNMKYKKTTPEYVFHCQSAV MSATFAKFHPNLVVGGTYSGQIVLWDNRSNKRTPVQRTPLSAAAHTDSMELVHKQSKAVA VTSMSFPVGDVNNFVVGSEEGSVYTACRHGSKAGISEMFEGHQGPITGIHCHAAVGAVDF SHLFVTSSFDWTVKLWTTKNNKPLYSFEDNADYVYDVMWSPTHPALFACVDGMGRLDLWN LNNDTEVPTASISVEGNPALNRVRWTHSGREIAVGDSEGQIVIYDVGEQIAVPRNDEWAR FGRTLAEINANRADAEEEAATRIPA >gi568815596r:171685277_171993260|GENSCAN_predicted_CDS_2|1518_bp atgtcagacaaaagtgaattaaaggctgagttggaacgtaagaagcagcgactggcccaa atcagagaggaaaagaagagaaaagaagaagaaaggaaaaaaaaagaaacagaccagaag aaggaagctgttgctcctgtgcaagaagaatcagatcttgaaaaaaaaaggagagaagct gaagcattgcttcaaagcatggggctaactccagaatcccccattgatgaagaggaagat gatgatgtagtggctcctaaaccacctattgaacctgaagaagagaaaactttaaagaaa gatgaggaaaatgatagtaaagctccccctcatgagctgactgaagaagaaaagcaacaa atcttgcactctgaggaatttttaagtttctttgaccattctacaagaattgtagaaaga gctctttctgagcagattaacatcttctttgactatagtgggagagatttggaagacaaa gaaggagagattcaagcaggtgctaaactgtcattaaatcgacaattttttgacgaacgt tggtcaaagcatcgggtggttagttgtttggattggtcatctcagtatccggagttactc gtggcttcctataacaacaatgaagatgcccctcatgagcctgatggtgtggcccttgta tggaatatgaaatacaaaaaaactaccccagagtatgtgtttcactgccagtcagctgtg atgtctgccacatttgcaaaatttcatccaaatcttgttgttggtggtacatattcaggc caaattgtgctttgggataaccgtagcaataaaagaactccagtgcaaagaactccactg tcagcagctgcacacacagatagcatggagttggttcataaacagtcaaaagcagtagct gtgacatctatgtccttccctgttggagatgtcaacaactttgttgttgggagtgaagaa ggttctgtgtacacagcatgccgccatggcagcaaagctggaatcagtgagatgtttgag gggcatcaaggaccaatcactggcatccattgtcatgcagctgttggagcagtagacttc tcacatctttttgtcacttcatcgtttgactggacagtaaagctttggacaactaagaat aacaagcctttgtattcatttgaagataatgcagactatgtttatgatgttatgtggtca cctacccacccagccctgtttgcctgtgtggatggcatggggagattggatttgtggaat ctcaataatgacacagaggtaccaactgccagcatttctgtggagggtaatcctgctctt aatcgtgtgagatggacccattctggcagagagattgctgtgggtgattctgaaggacag attgttatatacgatgtgggagagcagattgctgttccccgcaatgatgaatgggcacgg tttggccgaacacttgcagaaattaatgcaaaccgagctgatgcagaggaggaagcagct acccgaatacctgcttag >gi568815596r:171685277_171993260|GENSCAN_predicted_peptide_3|132_aa MRNDKGDTATDPTEIQTTIREYYKYLYANKLENLEEMAEFLDTYTLPSLNQEEGESLNRP ITSSEMKAVIVYRPKNVQDQTDSQPNSTRVLEVVARAIRQEKEIKGIQTGKEEELSLFAD DMIVYLENPISA >gi568815596r:171685277_171993260|GENSCAN_predicted_CDS_3|399_bp atgagaaatgacaaaggggatactgccactgatcccacagaaatacaaactaccatcaga gaatactataaatacctctatgcaaataaactagaaaatctagaagaaatggctgaattc ctggacacatacaccctcccaagtctaaaccaggaagaaggtgaatccctgaatagacca ataacaagttctgaaatgaaggcagtaatagtctaccgaccaaaaaacgtccaggatcag acagattcacagccgaattctaccagagtgttggaagttgtggccagggcaatcaggcaa gagaaagaaataaagggtattcagacaggaaaagaggaagaactgtctctgtttgcagat gacatgattgtatatttagaaaaccccatctcagcctaa >gi568815596r:171685277_171993260|GENSCAN_predicted_peptide_4|722_aa MAVKVQTTKRGDPHELRNIFLQLHQQCTVAKGDNRSSEEEGEVGQQEHKSDKFLNEELIL SVPCGKYASTEVDGERYMTPEDFVQRYLGLYNDPNSNPKIVQLLAGVADQTKDGLISYQE FLAFESVLCAPDSMFIVAFQLFDKSGNGEVTFENVKEIFGQTIIHHHIPFNWDCEFIRLH FGHNRKKHLNYTEFTQFLQELQLEHARQAFALKDKSKSGMISGLDFSDIMVTIRSHMLTP FVEENLVSAAGGSISHQVSFSYFNAFNSLLNNMELVRKIYSTLAGTRKDVEVTKEEFAQS AIRYGQVTPLEIDILYQLADLYNASGRLTLADIERIAPLAEGALPYNLAELQRQQSPGLG RPIWLQIAESAYRFTLGSVAGAVGATAVYPIDLVKTRMQNQRGSGSVVGELMYKNSFDCF KKVLRYEGFFGLYRGLIPQLIGVAPEKAIKLTVNDFVRDKFTRRDGSVPLPAEVLAGGCA GGSQVIFTNPLEIVKIRLQVAGEITTGPRVSALNVLRDLGIFGLYKGAKACFLRDIPFSA IYFPVYAHCKLLLADENGHVGGLNLLAAGAMAGVPAASLVTPADVIKTRLQVAARAGQTT YSGVIDCFRKILREEGPSAFWKGTAARVFRSSPQFGVTLVTYELLQRWFYIDFGGLKPAG SEPTPKSRIADLPPANPDHIGGYRLATATFAGIENKFGLYLPKFKSPSVAVVQPKAAVAA TQ >gi568815596r:171685277_171993260|GENSCAN_predicted_CDS_4|2169_bp atggcggtcaaggtgcagacaactaagcgaggggatcctcatgagttaagaaacatattt ctacagctacatcagcagtgtacagtagcaaaaggagacaatagaagcagtgaagaagaa ggtgaggtgggacagcaggaacataagtctgacaaatttctaaatgaagagttaattttg tctgtcccttgtggcaaatatgccagtactgaggttgatggagagcgttatatgacccca gaagactttgttcagcgctatcttggactgtataatgatccaaatagtaacccaaagatc gtgcagctcttggcaggagtagctgatcaaaccaaggatgggttgatctcctatcaagag tttttggcatttgaatctgttttatgtgctccagattccatgttcatagtggctttccag ttgtttgacaagagtggaaatggagaggtgacatttgaaaatgtcaaagaaatttttgga cagactattattcatcatcatatcccttttaactgggattgtgaatttatccgactgcat tttgggcataaccggaagaagcatcttaactacacagaattcacgcagtttctccaggag ctgcaattggaacatgcaagacaagcctttgcactcaaagacaaaagcaaaagtggcatg atttctggtctggatttcagtgacatcatggttaccattagatctcacatgcttactcct tttgtggaggagaacttagtttcagcagctggaggaagtatctcacaccaggttagcttc tcctacttcaatgcatttaactcgttactgaataacatggagcttgttcgtaagatatat agcactctagctggcacaaggaaagatgttgaagtcacaaaggaggaatttgcccagagt gccatacgctatggacaagtcacaccactagaaattgatattctatatcagcttgcagac ttatataatgcttcagggcgcttgactttggcagatattgagagaatagccccattggct gagggggccttaccttacaacctggcagaacttcagagacagcagtctcctgggttaggc aggcctatctggctccagattgccgagtctgcttacagattcactctgggctcagttgct ggagctgtgggagccactgcagtgtatcctatagatctggtgaagacccgaatgcaaaac cagcgtggctctggctctgttgttggggagctaatgtacaaaaacagctttgactgtttt aagaaagtcttgcgttatgagggcttctttggactctacaggggtctgataccacaactt ataggggttgctccagaaaaggccattaaactgactgttaatgattttgttcgggacaaa tttaccagaagagatggctctgttccacttccagcagaagttcttgctggaggctgtgct ggaggctctcaggtcatttttaccaacccattggagatagtgaagattcgtctgcaagta gctggagagatcaccacgggacccagagtcagcgccctgaatgtgctccgggacttggga atttttggtctgtataagggtgccaaagcgtgtttcctccgagacattcccttctctgca atctattttcctgtttatgctcattgcaaactacttctggctgatgaaaatggacacgtg ggaggtttaaatcttcttgcagctggagccatggcaggtgtcccagctgcatctctggtg acccctgctgatgtcatcaagacaagactgcaggtggctgcccgcgctggccagacgaca tacagtggtgtcatcgactgtttcaggaagattctccgggaagaagggccctcagcattt tggaaagggactgcagctcgagtgtttcgatcctctccccagtttggtgttaccttggtc acttatgaacttctccagcggtggttttacattgattttggaggcctcaaacccgctggt tcagaaccaacacctaagtcacgcattgcagaccttcctcctgccaaccctgatcacatc ggtggatacagactcgccacagccacgtttgcaggcatcgaaaacaaatttggcctttat ctcccgaaatttaagtctcctagtgttgctgtggttcagccaaaggcagcagtggcagcc actcagtga >gi568815596r:171685277_171993260|GENSCAN_predicted_peptide_5|120_aa MSDCAVTSGPGARGLIRPSSAAGDRSSEMAGKLPGKVYQGEEAARSGIERRDGGDNARLE LSGCSLGSRRPAWDRKRGTLSPAAPCWRRSAPPPPTAWGSAPSASNPAPTEGRRRQGKAS >gi568815596r:171685277_171993260|GENSCAN_predicted_CDS_5|363_bp atgtcagactgtgcggtcacttccggcccgggagcgcgcgggttgattcgtccttcctca gccgcgggtgatcgtagctcggaaatggcgggtaagttaccgggaaaagtttaccaaggg gaggaggcggccagatcggggatagaacgccgagacggtggtgacaatgcccgcctagaa ctttcgggctgtagcctgggttccagaaggcctgcttgggaccggaagcgcggaactctc agcccagcagctccttgttggcggcggtccgcgcccccgcccccaactgcttggggttct gcgccttctgccagcaatccggcccctaccgagggccgaagacgccagggaaaagcgtca tga >gi568815596r:171685277_171993260|GENSCAN_predicted_peptide_6|151_aa MRQQKTLRLPTKRKLHLKYQESYLNYGFTATGDSHSPSPLCIVHGDQLSNKATKPSKLLS YIETKHPALKDKPLEFSKEKNMNMKNRSNLLKATTSSNASALRAPFLMANQIAKTKKPFT TGEELILPAAKDICRELLGEARIQKVARVPP >gi568815596r:171685277_171993260|GENSCAN_predicted_CDS_6|456_bp atgagacagcagaagactctaagactgccaacaaaaagaaagctgcatttaaaataccaa gagtcctacttaaattatgggttcactgcaacaggtgattcacattctccaagccctctg tgtatagtacatggtgaccagctatccaacaaagccacaaaaccttcaaaactgcttagt tacatcgagaccaagcaccctgcattaaaagacaagcctttggagttttcaaaagaaaaa aacatgaacatgaagaacagaagcaatttattgaaggccaccacttcatcaaatgcgtct gcactgagagctccattcttaatggctaaccaaattgctaaaactaagaagccctttact actggtgaagagttaatcctgcctgctgctaaggacatttgtcgtgaacttttaggagag gctcgaattcaaaaggtggcgcgtgttcctccttag >gi568815596r:171685277_171993260|GENSCAN_predicted_peptide_7|398_aa MGFHHVGQAGLELLTSVRFPEDLENDIRTFFPEYTHQLFGDDETAFGYKGLKILLYYIAG SLSTMFRVEYASKVDENFDCVEADDVEGKIRQIIPPGFCTNTNDFLSLLEKEVDFKPFGT LLHTYSVLSPTGGENFTFQIYKADMTCRGFREYHERLQTFLMWFIETASFIDVDDERWHY FLVFEKYNKDGATLFATVGYMTVYNYYVYPDKTRPRVSQMLILTPFQGQGHGAQLLETVH RYYTEFPTVLDITAEDPSKSYVKLRDFVLVKLCQDLPCFSREKLMQGFNEDMVIEAQQKF KINKQHARRVYEILRLLVTDMSDAEQYRSYRLDIKRRLISPYKKKQRDLAKMRKCLRPEE LTNQMNQIEISMQHEQLEESFQELVEDYRRVIERLAQE >gi568815596r:171685277_171993260|GENSCAN_predicted_CDS_7|1197_bp atggggtttcaccatgttggccaggctggtcttgaactcctgacctcagttcgttttcct gaagatcttgaaaatgacattagaactttctttcctgagtatacccatcaactctttggg gatgatgaaactgcttttggttacaagggtctaaagatcctgttatactatattgctggt agcctgtcaacaatgttccgtgttgaatatgcatctaaagttgatgagaactttgactgt gtagaggcagatgatgttgagggcaaaattagacaaatcattccacctggattttgcaca aacacgaatgatttcctttctttactggaaaaggaagttgatttcaagccattcggaacc ttacttcatacctactcagttctcagtccaacaggaggagaaaactttacctttcagata tataaggctgacatgacatgtagaggctttcgagaatatcatgaaaggcttcagaccttt ttgatgtggtttattgaaactgctagctttattgacgtggatgatgaaagatggcactac tttctagtatttgagaagtataataaggatggagctacgctctttgcgaccgtaggctac atgacagtctataattactatgtgtacccagacaaaacccggccacgtgtaagtcagatg ctgattttgactccatttcaaggtcaaggccatggtgctcaacttcttgaaacagttcat agatactacactgaatttcctacagttcttgatattacagcggaagatccatccaaaagc tatgtgaaattacgagactttgtgcttgtgaagctttgtcaagatttgccctgtttttcc cgggaaaaattaatgcaaggattcaatgaagatatggtgatagaggcacaacagaagttc aaaataaataagcaacacgctagaagggtttatgaaattcttcgactactggtaactgac atgagtgatgccgaacaatacagaagctacagactggatattaaaagaagactaattagc ccatataagaaaaagcagagagatcttgctaagatgagaaaatgtctcagaccagaagaa ctgacaaaccagatgaaccaaatagaaataagcatgcaacatgaacagctggaagagagt tttcaggaactagtggaagattaccggcgtgttattgaacgacttgctcaagagtaa