GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:43:06 Sequence gi568815575r:52847950_53048588 : 200639 bp : 45.17% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 18 13 6 -0.45 1.01 Sngl - 415 83 333 1 0 81 43 217 0.502 12.52 1.00 Prom - 4457 4418 40 -1.96 2.00 Prom + 7135 7174 40 -2.06 2.01 Sngl + 10940 11836 897 1 0 82 47 263 0.570 17.77 2.02 PlyA + 12683 12688 6 1.05 3.00 Prom + 17663 17702 40 -1.86 3.01 Sngl + 30609 31679 1071 2 0 69 32 480 0.463 37.64 3.02 PlyA + 31874 31879 6 1.05 4.00 Prom + 33388 33427 40 -2.46 4.01 Init + 46572 46734 163 2 1 62 9 148 0.353 4.19 4.02 Intr + 49160 49275 116 0 2 109 60 123 0.416 11.77 4.03 Term + 59160 59918 759 2 0 61 53 317 0.112 18.93 4.04 PlyA + 60587 60592 6 1.05 5.06 PlyA - 60710 60705 6 1.05 5.05 Term - 62026 61917 110 2 2 54 42 139 0.233 4.57 5.04 Intr - 68576 68446 131 1 2 109 72 66 0.249 7.44 5.03 Intr - 79206 79118 89 0 2 35 19 91 0.001 -4.33 5.02 Intr - 87020 86840 181 1 1 94 63 71 0.740 5.07 5.01 Init - 87433 87105 329 1 2 66 84 145 0.501 6.70 5.00 Prom - 89394 89355 40 -1.36 6.00 Prom + 89905 89944 40 -6.66 6.01 Init + 90337 90353 17 0 2 30 83 15 0.046 -4.96 6.02 Intr + 91340 91470 131 2 2 109 72 66 0.065 7.44 6.03 Term + 97890 97999 110 1 2 54 42 139 0.115 4.57 6.04 PlyA + 99206 99211 6 1.05 7.04 PlyA - 99329 99324 6 1.05 7.03 Term - 100756 99998 759 1 0 61 53 317 0.112 18.93 7.02 Intr - 110756 110641 116 0 2 109 60 123 0.416 11.77 7.01 Init - 113344 113182 163 1 1 62 9 148 0.353 4.19 7.00 Prom - 113871 113832 40 -6.46 8.00 Prom + 121296 121335 40 -4.46 8.01 Sngl + 123509 124006 498 1 0 60 48 168 0.751 6.05 8.02 PlyA + 124050 124055 6 1.05 9.00 Prom + 124790 124829 40 -2.46 9.01 Init + 124914 125046 133 2 1 78 63 89 0.836 5.70 9.02 Intr + 141554 141608 55 0 1 36 82 72 0.109 -0.56 9.03 Intr + 142807 143113 307 1 1 42 89 164 0.956 8.35 9.04 Intr + 143681 143782 102 1 0 117 110 -2 0.898 5.27 9.05 Intr + 146125 146154 30 0 0 34 116 49 0.564 0.63 9.06 Intr + 147944 148042 99 1 0 36 76 80 0.726 1.91 9.07 Intr + 152606 152677 72 1 0 39 47 146 0.646 5.20 9.08 Intr + 171956 172097 142 1 1 9 63 208 0.749 10.33 9.09 Term + 179585 179769 185 0 2 81 49 137 0.987 6.81 9.10 PlyA + 179780 179785 6 1.05 10.02 PlyA - 181858 181853 6 1.05 10.01 Term - 196744 196588 157 0 1 75 43 115 0.642 3.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 86577 86509 69 0 0 69 42 59 0.805 -2.66 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:52847950_53048588|GENSCAN_predicted_peptide_1|110_aa MAQELRDTCTSFSSRFDQVEERVMVIEDQINEMKQEEKFREKRVKRNKQSLQEIWDYVKR PNLRLIGVPESDGENGTKLENTLQDIFQENFPNLARQASIQIQEIQRTPQ >gi568815575r:52847950_53048588|GENSCAN_predicted_CDS_1|333_bp atggcacaagaactacgtgacacatgcacaagcttcagtagccgattcgatcaagtggaa gaaagggtaatggtgattgaagatcaaattaatgaaatgaagcaagaagagaagtttaga gaaaaaagagtaaaaagaaacaaacaaagcctccaagaaatatgggactatgtgaaaaga ccaaatctacgtctgattggtgtacctgaaagtgacggggagaatggaaccaagctggaa aacactcttcaggatattttccaggaaaacttccccaacctagcaaggcaggccagcatt caaattcaggaaatacagagaacaccacaatga >gi568815575r:52847950_53048588|GENSCAN_predicted_peptide_2|298_aa MKDLFKENYKPLLNEIKEDTNKWKNTPHSWVGTINIVKMAILPKVIYRFNAIPIKLPMTF FTELEKTTLKFIWNQKRAHIAKSILSQKNKARDIKLPDFKLYYKDTVTKTAWYWYQNRAI DQGNRTEPSEIMPHIYNHLIFDKPDKNKQWGKDSLFNKWCLENWLAICRKLKLEPFLPPY TKINSSWIKDLNVRPENIKTLEENLGNTIQDIGMGKDFMSKTLKAMATKAKIDKWDLIKL KSFCTAKETTIRVNRQPTEWEKIFAIYSSDKGLISRIYNELHQIYKKKTTPSKSGQRI >gi568815575r:52847950_53048588|GENSCAN_predicted_CDS_2|897_bp atgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggataca aacaaatggaagaacactccacactcatgggtaggaacaatcaatattgtgaaaatggcc atactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgactttc ttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccacatc gccaagtcaatcctaagccaaaagaacaaagctagagacatcaagctacctgacttcaaa ctatactacaaggatacagtaaccaaaacagcatggtactggtaccaaaacagagctata gatcaagggaacagaacagagccctcagaaataatgccgcatatctataaccatctgatc tttgacaaacctgacaaaaacaagcaatggggaaaggattccctatttaataaatggtgc ttggaaaactggctagccatatgcagaaagctgaaactggaacccttccttccaccttat acaaaaattaattcaagctggattaaagacttaaatgttagacctgaaaacataaaaact ctggaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttcatgtct aaaacactaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactc aagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcagcctacagaatgg gagaaaatttttgcaatctactcatctgacaaagggctaatatccagaatctacaatgaa ctccatcaaatttacaagaaaaaaacaaccccatcaaaaagtgggcaaaggatatga >gi568815575r:52847950_53048588|GENSCAN_predicted_peptide_3|356_aa MKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKLIALNAHKRKQERSKIDTLTSQLKELE KQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFEKINKIDRPLARIIKKKR EKNQIDAIKNDKGDITIDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLARLNQEE VESLNRPITGPEIEAIINSLPTKKSPGPDGFTAEFYQRHKEELLPFLLKLFQSIEKERTL PNAFCEASIILMLIPKAWQRHNKKKENFRPISLMNIDAKVLNKILANRIQQHIQKLIHHD QVGFIPGMQGWFNIHKSINVIQHINRTKHKNHMIISIDVEKAFDEIHQPFMLKTLN >gi568815575r:52847950_53048588|GENSCAN_predicted_CDS_3|1071_bp atgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacacaacataccag aatctatgggacacattcaaagcagtgtgtagagggaaattgatagcactaaatgcccac aagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaagaactagag aagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaaatcagagca gaactgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaatccaggagctgg ttttttgaaaagatcaacaaaattgatagaccgctagcaagaataataaagaagaaaaga gagaagaatcaaatagacgcaataaaaaatgataaaggggatatcaccatcgatcccaca gaaatacaaactaccatcagagaatactataaacacctctatgcaaataaactagaaaac ctagaagaaatggataaattcctcgacacatacaccctcgcaagactaaaccaggaagaa gttgaatctctgaatagaccaataacaggccctgaaattgaggcaataattaatagctta ccgaccaaaaaaagtccaggaccagatggattcacagctgaattctaccagaggcataag gaggagctgctaccattccttctgaaactattccaatcaatagaaaaagagagaaccctc cctaacgcattttgtgaggccagcatcatcctgatgctgataccaaaagcctggcagaga cacaacaaaaaaaaagagaattttagaccaatatccctgatgaacatcgatgcaaaagtc ctcaataaaatactggcaaaccgaatccagcagcacatccaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaacatacacaaatcaataaacgta atccagcatataaacagaaccaaacacaaaaaccatatgattatctcgatagatgtagaa aaggcctttgacgaaattcaccagcccttcatgctaaaaactctcaattaa >gi568815575r:52847950_53048588|GENSCAN_predicted_peptide_4|345_aa MTRLAVKKGAPLTGTENAWEGVKTGSKMSYELLVVIEEGDEGDWTRELPVVMERIHPSGV GREKPPNRSVFRLRRSAAANQRFSFSAASHPCDSGFCPHTFLATFLVSENLQLHLLFWTR GPGGASSWDQTSMDPLQKRNPASPSKSSPMTAAETSQEGPAPSQPSYSEQPMMGLSNLSP GPGPSQAVPLPEGLLRQRYREEKTLEERRWERLEFLQRKKAFLRHVRRRHRDHMAPYAVG REARISPLGDRSQNRFRCECRYCQSHRPNLSGIPGESNRAPHPSSWETLVQGLSGLTLSL GTNQPGPLPEAALQPQETEEKRQRERQQESKIMFQRLLKQWLEEN >gi568815575r:52847950_53048588|GENSCAN_predicted_CDS_4|1038_bp atgaccagattagcagtcaagaaaggtgccccactcactgggacagagaatgcgtgggaa ggggtcaagactggcagcaagatgagctatgaattgttggtggtcatcgaggagggagat gagggtgactggactagggaacttcctgtggtgatggagaggatccacccgagcggcgtg ggtcgcgagaagccacccaaccgctccgtcttccggctccggcggtccgcggcagccaat caaaggttcagtttctcagcggcgtcgcatccctgtgactctgggttctgcccgcacaca ttcctcgccaccttcttggtgtctgagaacctccagcttcacctccttttctggacccga gggcctggcggagcttccagctgggaccagacctccatggatccactccagaaacggaat ccagcatcgccttccaaatcttccccgatgacagctgcagagacttcccaggaaggtcca gcgccctctcagccttcgtactcagaacagccgatgatgggcctcagtaacctgagcccc ggtcctggccccagccaggccgtgcctctcccagaggggctgctccgccagcggtacaga gaggagaagaccctggaagagcggcggtgggagaggctggagttccttcagaggaagaaa gcattcctgcggcatgtgaggaggagacaccgcgatcacatggccccctatgctgttggg agggaagccagaatctccccattaggtgacagaagtcagaatcgattccgatgtgaatgt cgatactgccagagccacaggccgaatctttctgggatccctggggagagtaacagggcc ccacatccctcctcctgggagacgctggtgcagggcctcagtggcttgactctcagccta ggcaccaaccagcccgggcctctgcctgaagcggcactccagccacaggagacagaggag aagcgccagcgagagaggcagcaggagagcaaaataatgtttcagaggctgctcaagcag tggttagaggaaaactga >gi568815575r:52847950_53048588|GENSCAN_predicted_peptide_5|279_aa MALGLAQVSNAGRLGSRVPQPSAPNRREGSLRRTPMRRADLPGLRRAGPELVNPRPAPPS GLRISGGAGTFCPDAMVPRAELVGVSGAAGTYCAVAVVARAGLGEAFVQGGSLRAWCRKV VEKMEFPLWLVAFGLFRRQVAVLHCGVALLAFRPSLRERKNAAHRFPMEEPFSPGKQQEL SEEAKRLGMVLDVFAPNRDRGYHPGRHPETAPPPPPKESLRSFAMTSFLDKDLQFPTAPG LWPVDVAQKGDRQGEKMETLMIAEMMTVTGSMDLVCLLR >gi568815575r:52847950_53048588|GENSCAN_predicted_CDS_5|840_bp atggctctgggactggcacaggtctcgaacgcggggcggctgggcagtcgtgtccctcag ccgtcagctcctaaccgcagggaaggaagcctgcggcgaaccccgatgagacgagccgac ctccccgggctccgacgcgcaggcccagagctcgtcaatcctaggccagccccgccctcc ggactgcgtatttccggcggcgcaggaactttctgtccggatgctatggttccgagggcg gaactagtgggggtgtctggcgccgcaggaacttattgcgcagttgctgtggttgcgagg gcgggactaggggaggcttttgtgcagggtggatctctcagggcttggtgccgaaaagta gtggagaagatggaatttcctctgtggttggtggctttcggtttatttcgccggcaggtg gcagtgttgcactgtggagtggccctgctcgccttccgcccctcccttcgcgagcgaaaa aacgctgcccatcgttttccgatggaagagccattctctccaggaaaacaacaggaactg tctgaagaagctaaaagacttggaatggttttagatgtgttcgcccccaacagggacagg gggtaccacccaggtagacacccggagactgccccccccccaccgcccaaagagagttta aggagctttgcaatgacgagtttccttgacaaagaccttcagttcccgacggcacctggg ctgtggccagtcgacgtagcccagaagggagacaggcaaggtgagaaaatggaaaccctg atgattgctgaaatgatgactgtcactggcagcatggacctggtgtgcttgctgcgctga >gi568815575r:52847950_53048588|GENSCAN_predicted_peptide_6|85_aa MSVSPRGYHPGRHPETAPPPPPKESLRSFAMTSFLDKDLQFPTAPGLWPVDVAQKGDRQG EKMETLMIAEMMTVTGSMDLVCLLR >gi568815575r:52847950_53048588|GENSCAN_predicted_CDS_6|258_bp atgagcgtgtcaccgagggggtaccacccaggtagacacccggagactgcccccccccca ccgcccaaagagagtttaaggagctttgcaatgacgagtttccttgacaaagaccttcag ttcccgacggcacctgggctgtggccagtcgacgtagcccagaagggagacaggcaaggt gagaaaatggaaaccctgatgattgctgaaatgatgactgtcactggcagcatggacctg gtgtgcttgctgcgctga >gi568815575r:52847950_53048588|GENSCAN_predicted_peptide_7|345_aa MTRLAVKKGAPLTGTENAWEGVKTGSKMSYELLVVIEEGDEGDWTRELPVVMERIHPSGV GREKPPNRSVFRLRRSAAANQRFSFSAASHPCDSGFCPHTFLATFLVSENLQLHLLFWTR GPGGASSWDQTSMDPLQKRNPASPSKSSPMTAAETSQEGPAPSQPSYSEQPMMGLSNLSP GPGPSQAVPLPEGLLRQRYREEKTLEERRWERLEFLQRKKAFLRHVRRRHRDHMAPYAVG REARISPLGDRSQNRFRCECRYCQSHRPNLSGIPGESNRAPHPSSWETLVQGLSGLTLSL GTNQPGPLPEAALQPQETEEKRQRERQQESKIMFQRLLKQWLEEN >gi568815575r:52847950_53048588|GENSCAN_predicted_CDS_7|1038_bp atgaccagattagcagtcaagaaaggtgccccactcactgggacagagaatgcgtgggaa ggggtcaagactggcagcaagatgagctatgaattgttggtggtcatcgaggagggagat gagggtgactggactagggaacttcctgtggtgatggagaggatccacccgagcggcgtg ggtcgcgagaagccacccaaccgctccgtcttccggctccggcggtccgcggcagccaat caaaggttcagtttctcagcggcgtcgcatccctgtgactctgggttctgcccgcacaca ttcctcgccaccttcttggtgtctgagaacctccagcttcacctccttttctggacccga gggcctggcggagcttccagctgggaccagacctccatggatccactccagaaacggaat ccagcatcgccttccaaatcttccccgatgacagctgcagagacttcccaggaaggtcca gcgccctctcagccttcgtactcagaacagccgatgatgggcctcagtaacctgagcccc ggtcctggccccagccaggccgtgcctctcccagaggggctgctccgccagcggtacaga gaggagaagaccctggaagagcggcggtgggagaggctggagttccttcagaggaagaaa gcattcctgcggcatgtgaggaggagacaccgcgatcacatggccccctatgctgttggg agggaagccagaatctccccattaggtgacagaagtcagaatcgattccgatgtgaatgt cgatactgccagagccacaggccgaatctttctgggatccctggggagagtaacagggcc ccacatccctcctcctgggagacgctggtgcagggcctcagtggcttgactctcagccta ggcaccaaccagcccgggcctctgcctgaagcggcactccagccacaggagacagaggag aagcgccagcgagagaggcagcaggagagcaaaataatgtttcagaggctgctcaagcag tggttagaggaaaactga >gi568815575r:52847950_53048588|GENSCAN_predicted_peptide_8|165_aa MSELPSTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRVNIV KMAILPKVIYRINAIPIKLPMTFFTELEKTTLKFIWNQKRACIAKTTLSQKNKSGGITLP DFKLYILQGYSNQNSMVLVPKQRYRPMEQNRALRNNTTHLQPSDL >gi568815575r:52847950_53048588|GENSCAN_predicted_CDS_8|498_bp atgagtgaactcccatccacaattgcttcaaagagaataaaatatctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaa gaggacacaaacaaatggaagaacattccatgctcatggataggaagagtcaatatcgtg aaaatggccatactgcccaaggtaatttatagaatcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagg gcctgcattgccaagacaaccctaagccaaaagaacaaatctggaggcatcacgctacct gacttcaaactatatatactacaaggctatagtaaccaaaacagcatggtactggtacca aaacagagatatagaccaatggaacagaacagagccctcagaaataataccacacatcta caaccatctgatctttga >gi568815575r:52847950_53048588|GENSCAN_predicted_peptide_9|374_aa MEYYAAIKKDEFMSFVGTWMKLETIILRKLSQGQKTKHRMFSLIGEETECRRGQQLGLQL HSRVGDRVRLSRKEKKRKKRKEKKRKEKKRKEKKRKDAGPGQKEEEEEEDQEGLGKGLCS ELGLMRIQKALTCEWVIEVELTVTELGEKSMIHVRWSASVWYRITGGHSLSWRLRSICLP LWPQYGDNDSYHQGLPRKRFLDRDLTAKKDRTVPSRAPLGGANKEQLVDADTHGPFRAGP QRPLCLTVREREDPGTRLYECIPLYSSKTLSPKKIEEEEEEEEEEEEEEKEEEEEEEKES AELDLDAGIRLDFRPTLEPSEELLEPIALWAVPDEQLSIDQQGAAWFTDGSSQGNGNCLV WKAAALKPGHGKMD >gi568815575r:52847950_53048588|GENSCAN_predicted_CDS_9|1125_bp atggaatactatgcagccataaaaaaggatgagttcatgtcctttgtagggacatggatg aagctggaaaccatcattctccgcaaactatcacaaggacaaaaaaccaaacaccgcatg ttctcactcataggtgaggagactgagtgtcgtagagggcagcagctgggcctgcagctg cacagcagagtgggcgatagagtgagactgtcaagaaaagaaaagaaaagaaagaaaaga aaagaaaagaaaagaaaagaaaagaaaagaaaagaaaagaaaagaaaagatgctggccct ggtcagaaggaggaggaggaagaggaggaccaggaaggcctcggcaaaggactgtgctcg gagctggggctgatgagaatacagaaggccctcacctgtgaatgggtgatagaagtagag ttgacagtcacggagttgggtgagaaaagcatgattcatgttaggtggtcagccagtgtc tggtacagaatcacgggtgggcactcactctcttggaggctaaggtccatctgtctacct ctgtggccccagtatggggacaatgacagttaccatcaaggcttgccaaggaagaggttc ctggaccgagacctcacagccaaaaaggatagaactgttccatcccgcgcgccccttggt ggtgcaaacaaggaacagctagtggatgcagatacccatggaccattccgggctggtccc cagaggccactgtgcctgaccgtgcgcgagcgcgaagacccagggacccggctctacgag tgcatcccgctgtactctagcaaaaccctgtctccaaaaaaaatagaagaagaagaagaa gaagaggaagaggaagaggaagaagaaaaggaggaggaagaggaggaggaaaaagagagt gctgagttggaccttgatgctggaataagactagatttcagaccgactttagaaccatct gaggagctactggaaccaattgccctatgggcagtgcccgatgaacagctctcaattgac caacaaggagctgcttggtttacagatggcagttcccaggggaatggaaactgccttgtt tggaaagctgctgcattaaaaccaggacacggaaagatggattga >gi568815575r:52847950_53048588|GENSCAN_predicted_peptide_10|52_aa XDWKSLDAWSKGPQGASAGESDSETRIGWNPCMIIIFTWNCCNLEDINFAQA >gi568815575r:52847950_53048588|GENSCAN_predicted_CDS_10|159_bp nnggattggaaatctctggatgcctggtctaagggtccccagggtgcatcagcaggtgag tccgattccgagacaagaattggctggaatccttgcatgatcatcatctttacatggaac tgctgtaacctggaagatataaactttgcccaggcctaa