GENSCAN 1.0 Date run: 5-Nov-116 Time: 22:45:10 Sequence gi568815586f:4710042_4911628 : 201587 bp : 43.79% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 943 982 40 -2.86 1.01 Init + 10637 10847 211 1 1 60 80 89 0.325 2.80 1.02 Intr + 16491 16788 298 1 1 109 75 277 0.834 24.43 1.03 Intr + 29122 29288 167 1 2 56 89 165 0.713 13.00 1.04 Intr + 34476 34659 184 1 1 14 79 220 0.999 12.55 1.05 Intr + 35388 35585 198 0 0 71 100 285 0.999 26.47 1.06 Intr + 36103 36217 115 1 1 73 59 162 0.880 12.25 1.07 Intr + 50917 51102 186 0 0 114 109 173 0.962 21.89 1.08 Intr + 53212 53349 138 0 0 47 36 117 0.879 3.06 1.09 Intr + 53911 54006 96 0 0 117 70 93 0.932 10.61 1.10 Intr + 55338 55505 168 2 0 130 117 28 0.998 10.04 1.11 Term + 62404 62556 153 0 0 124 42 120 0.923 8.92 1.12 PlyA + 64550 64555 6 1.05 2.00 Prom + 71207 71246 40 -2.56 2.01 Init + 82330 82384 55 0 1 107 64 54 0.822 6.35 2.02 Intr + 85410 85451 42 1 0 87 59 54 0.442 0.61 2.03 Intr + 93553 93677 125 2 2 77 31 77 0.286 1.20 2.04 Intr + 93884 94095 212 1 2 119 81 28 0.537 2.91 2.05 Term + 96551 96668 118 2 1 73 40 89 0.593 0.61 2.06 PlyA + 96800 96805 6 1.05 3.00 Prom + 97924 97963 40 -8.06 3.01 Sngl + 100001 101590 1590 1 0 66 46 2490 0.990 237.97 3.02 PlyA + 105536 105541 6 1.05 4.03 PlyA - 106606 106601 6 1.05 4.02 Term - 108336 108257 80 1 2 63 48 69 0.042 -1.67 4.01 Init - 119313 119256 58 0 1 61 127 30 0.674 5.67 4.00 Prom - 128808 128769 40 -3.16 5.06 PlyA - 129383 129378 6 1.05 5.05 Term - 135490 135358 133 0 1 90 38 95 0.222 2.26 5.04 Intr - 137921 137797 125 2 2 88 66 36 0.144 0.78 5.03 Intr - 146094 145834 261 0 0 64 60 106 0.586 3.08 5.02 Intr - 146363 146258 106 1 1 50 96 29 0.677 0.12 5.01 Init - 146848 146751 98 1 2 65 121 71 0.528 7.88 5.00 Prom - 153734 153695 40 -3.86 6.06 PlyA - 155183 155178 6 1.05 6.05 Term - 166177 166166 12 1 0 121 39 12 0.334 -2.20 6.04 Intr - 169588 169488 101 2 2 58 95 74 0.308 4.93 6.03 Intr - 170620 170484 137 0 2 74 103 -2 0.609 0.11 6.02 Intr - 171783 171646 138 2 0 64 94 98 0.920 7.58 6.01 Init - 181634 181573 62 2 2 66 103 -5 0.136 -0.48 6.00 Prom - 183423 183384 40 -5.76 7.00 Prom + 183498 183537 40 -5.16 7.01 Init + 201338 201573 236 1 2 104 -14 547 0.988 43.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:4710042_4911628|GENSCAN_predicted_peptide_1|637_aa MMFWRKLPKALFIGLTLAIAVNLLLVFSSKGTLQNLFTGGLHRELPLHLNKRYGAVIKRL SHLEVELQDLKESMKLALRQQENVNSTLKRAKDEVRPLLKAMETKVNETKKHKTQMKLFP HSQLFRQWGEDLSEAQQKAAQDLFRKFGYNAYLSNQLPLNRTIPDTRDYRCLRKTYPSQL PSLSVILIFVNEALSIIQRAITSIINRTPSRLLKEIILVDDFSSNGELKVHLDEKIKLYN QKYPGLLKIIRHPERKGLAQARNTGWEAATADVVAILDAHIEVNVGWAEPILARIQEDRT VIVSPVFDNIRFDTFKLDKYELAVDGFNWELWCRYDALPQAWIDLHDVTAPVKSPSIMGI LAANRHFLGEIGSLDGGMLIYGGENVELSLRVWQCGGKVEILPCSRIAHLERHHKPYALD LTAALKRNALRVAEIWMDEHKHMVYLAWNIPLQNSGIDFGDVSSRMALREKLKCKTFDWY LKNVYPLLKPLHTIVGYGRMKNLLDENVCLDQGPVPGNTPIMYYCHEFSSQNVYYHLTGE LYVGQLIAEASASDRCLTDPGKAEKPTLEPCSKAAKNRLHIYWDFKPGGAVINRDTKRCL EMKKDLLGSHVLVLQTCSTQVWEIQHTVRDWGQTNSQ >gi568815586f:4710042_4911628|GENSCAN_predicted_CDS_1|1914_bp atgatgttttggaggaaactccccaaagccctcttcattgggctgactctggccattgct gtcaatctccttctggtattttctagcaaggggactttacaaaacctgtttacgggtggt ctccacagggagcttcctttacatctgaataaacgctacggggcagtgataaagagactc tcccacttggaggtggaattgcaggatctgaaagaaagtatgaaattagctctgaggcaa caagaaaatgtgaacagcacactgaagagggcgaaagatgaagtacgccctcttctaaag gcaatggaaaccaaggtgaatgagacaaagaagcacaaaacccaaatgaaactcttccca cactcacagcttttcaggcaatggggcgaggatctttctgaggcccagcagaaggcggcc caggacctcttccggaagtttggttacaacgcgtacctcagcaaccagctgcctctcaat cgcaccatccccgacacgcgagactacagatgtcttcggaagacatatccttcccaactc ccatccctcagtgtcattctcatattcgtgaatgaagctctgtccattatacaacgggcc atcaccagtatcatcaaccggacgccctctcgattgttgaaggaaatcatcttggtggat gatttcagctcaaatggagaactaaaggtacacttggatgagaagattaagctttacaac cagaagtatccaggactactgaaaataatacggcatcctgaaaggaaaggtcttgctcaa gcccgcaacactggctgggaagctgccacagcagacgtggtcgccatcttggatgctcac attgaagtcaatgttgggtgggcagagccaatcttggctcggattcaggaggaccgcact gtgattgtgtctcctgtgtttgacaacattcgttttgacaccttcaaactggataagtat gaactggcagttgatgggtttaactgggaactctggtgccgctacgatgcactgccacaa gcctggattgatctgcatgatgtcactgccccagtgaagagtccttcaatcatgggcatc ctggctgctaacaggcacttcctgggagagatcgggtctctggatggtggaatgctcatc tatggaggagagaacgtggagcttagcctgagggtgtggcagtgtggagggaaggtcgag attttgccctgttcccggattgcccacctagagagacaccacaagccctacgccttggat ctcaccgctgccttgaagcgcaatgctctgcgagtggccgaaatctggatggatgagcac aaacacatggtctacttggcctggaacatacctctccagaactctggaatagattttgga gacgtttcttccagaatggcactccgggaaaaactgaaatgtaaaacttttgactggtac ctgaaaaatgtttatccactcttgaagccactccacaccatcgtgggctatggaagaatg aaaaacctattggatgaaaatgtctgcttggatcagggacccgttccaggcaacaccccc atcatgtattactgccatgaattcagctcacagaatgtctactatcacctaactggggag ctctatgtgggacaactgattgcagaggccagtgctagtgatcgctgcctgacagaccct ggcaaggcggagaagcccaccttagaaccatgctccaaggcagctaagaatagactgcat atatattgggattttaaaccgggaggagctgtcataaacagagataccaagcggtgtctg gagatgaagaaggatcttttgggtagccacgtgcttgtgctccagacctgtagcacgcaa gtgtgggaaatccagcacactgtcagagactggggtcagaccaacagccagtga >gi568815586f:4710042_4911628|GENSCAN_predicted_peptide_2|183_aa MAEAEEGREVSVNEDSEGAALAADWMVPAQIEGEAQEEGNGHSASRDEMNRRELQVKALC YQVLAGNINEFVEPDSMLSSLSSRKSRRRLREGVNMTQTLGKGVNEGTREGSWARQNAVF GQRSFHLEHAEEPKLEGRALPRVRWGTSLGTVKPFLGHDLMLDVKVITEHQLCLKGGRQP SQA >gi568815586f:4710042_4911628|GENSCAN_predicted_CDS_2|552_bp atggctgaagctgaggaggggagagaagtgtcagtcaatgaggactcagagggagctgcg ctggcagctgattggatggtgcccgcccagattgagggggaagcccaggaggagggaaat ggccacagtgcatccagagacgaaatgaatagaagggaacttcaggtcaaagcgttgtgc tatcaagtgttggcagggaacatcaatgagtttgtggaaccagattcaatgctgtcttct ctctccagcagaaaaagtaggagaaggttaagagaaggagtaaacatgacccagaccctg gggaaaggggtgaatgaggggaccagagaagggagttgggccaggcagaacgctgtgttt ggacagaggagcttccacttggaacacgctgaagaaccaaagctcgagggcagagccttg cccagagtgagatggggaacctccctgggcacagtgaaacccttcttggggcacgacctc atgcttgatgtgaaggtcatcacagaacatcagctctgcctgaagggtggccggcagccc agtcaggcttaa >gi568815586f:4710042_4911628|GENSCAN_predicted_peptide_3|529_aa MRSEKSLTLAAPGEVRGPEGEQQDAGDFPEAGGGGGCCSSERLVINISGLRFETQLRTLS LFPDTLLGDPGRRVRFFDPLRNEYFFDRNRPSFDAILYYYQSGGRLRRPVNVPLDIFLEE IRFYQLGDEALAAFREDEGCLPEGGEDEKPLPSQPFQRQVWLLFEYPESSGPARGIAIVS VLVILISIVIFCLETLPQFRVDGRGGNNGGVSRVSPVSRGSQEEEEDEDDSYTFHHGITP GEMGTGGSSSLSTLGGSFFTDPFFLVETLCIVWFTFELLVRFSACPSKPAFFRNIMNIID LVAIFPYFITLGTELVQQQEQQPASGGGGQNGQQAMSLAILRVIRLVRVFRIFKLSRHSK GLQILGKTLQASMRELGLLIFFLFIGVILFSSAVYFAEADDDDSLFPSIPDAFWWAVVTM TTVGYGDMYPMTVGGKIVGSLCAIAGVLTIALPVPVIVSNFNYFYHRETEQEEQGQYTHV TCGQPAPDLRATDNGLGKPDFPEANRERRPSYLPTPHRAYAEKRMLTEV >gi568815586f:4710042_4911628|GENSCAN_predicted_CDS_3|1590_bp atgagatcggagaaatcccttacgctggcggcgccgggggaggtccgtgggccggaggga gagcaacaggatgcgggagacttcccggaggccggcgggggcgggggctgctgtagtagc gagcggctggtgatcaatatctccgggctgcgctttgagacacaattgcgcaccctgtcg ctgtttccggacacgctgctcggagaccctggccggcgagtccgcttcttcgaccccctg aggaacgagtacttcttcgaccgcaaccggcccagcttcgacgccatcctctactactac cagtctgggggccgcctgcggaggccggtcaacgtgcccctggacattttcctggaggag atccgcttctaccagctgggggacgaggccctggcggccttccgggaggacgagggctgc ctgcccgaaggtggcgaggacgagaagccgctgccctcccagcccttccagcgccaggtg tggctgctctttgagtacccagagagctctgggccggccaggggcatcgccatcgtctcc gtgttggtcattctcatctccatagtcatcttttgcctggagaccttaccccagttccgt gtagatggtcgaggtggaaacaatggtggtgtgagtcgagtctccccagtttccaggggg agtcaggaggaagaggaggatgaagacgattcctacacatttcatcatggcatcacccct ggggaaatggggaccgggggctcctcctcactcagtactcttgggggctccttctttaca gaccccttctttctggtggagacgctgtgcattgtctggttcacttttgagctcctggtg cgcttctccgcctgccctagcaagccggccttcttccggaacatcatgaacatcattgac ttggtggctatcttcccctacttcatcaccctgggcactgagctggtgcagcagcaggag cagcaaccagccagtggaggaggcggccagaatgggcagcaggccatgtccctggccatc ctccgagtcatccgcctggtccgggtgttccgcatcttcaagctctcccgccactccaag gggctgcagatcctgggcaagaccttgcaggcctccatgagggagctggggctgctcatc ttcttcctcttcatcggggtcatcctcttctccagtgccgtctacttcgcagaggctgac gatgacgattcgctttttcccagcatcccggatgccttctggtgggcagtggttacaatg accacggtaggttacggggacatgtaccccatgactgtggggggaaagatcgtgggctcg ctgtgtgccatcgctggggtcctcaccattgccctgcctgtgcccgtcatcgtctccaac ttcaactacttctaccaccgggagacggagcaggaggagcaaggccagtatacccacgtc acttgtgggcagcctgcgccggacctgagggcaactgacaacggacttggcaagcctgac ttccccgaggctaaccgggaacggagacccagctaccttcctacaccacatcgggcctat gcagagaaaagaatgctcacggaggtctga >gi568815586f:4710042_4911628|GENSCAN_predicted_peptide_4|45_aa MLLDSTGKSAQGGEAPSSDAKEGFEPRRSGSSNYALKPFVKSHDV >gi568815586f:4710042_4911628|GENSCAN_predicted_CDS_4|138_bp atgcttctggacagcacaggcaaatcagctcaaggaggggaagcaccctcctcagacgca aaggagggatttgagccgcgacgatcaggctccagcaactatgctcttaaaccctttgtt aaatcgcacgatgtctga >gi568815586f:4710042_4911628|GENSCAN_predicted_peptide_5|240_aa MKQYLEIKSSEDSVTSRRGGLTEPPELAPAGTRGGNQGPEWKTDTRGKRVAYKVTFHVSP PHSGSSDQSTLSQSSQGVQPLRGVSRNTSCCFLYDLFLEVAGTHKQGLSKMSTCFHRPGD REPLRPQLPYIYLESPSSHLYFCTQYLRRSGAEGQGNIKQAVGEYFPGKTKGFQEKKACR HWRPPVRQPDPSFITLQNRVPNVQHLHNNARNVTAEHPEREKLNNLQRDRSSRTKQSEKR >gi568815586f:4710042_4911628|GENSCAN_predicted_CDS_5|723_bp atgaagcagtacttggaaattaaaagcagtgaggactctgtcaccagcaggcgaggtggc ctcacagagccccctgagctggcaccggctggcacaagagggggaaaccaaggccctgag tggaagactgacaccagaggaaaaagggttgcatacaaggtcaccttccatgtctcccct cctcactccgggtccagtgatcagagcacactgagccaaagttcccagggcgtccagcct ctgcgtggagtttccaggaacacttcctgctgcttcctctacgacctcttcctggaagtt gcaggcacccataagcaaggactctctaagatgtccacttgctttcacaggccaggagat cgggagcctttgaggcctcagctcccctacatctacttggaatccccgagcagccacctt tacttctgcacccaatacctccggcggagtggggcagaagggcagggtaatatcaaacag gcagttggagaatattttcctggaaaaaccaaaggattccaagaaaaaaaggcctgtaga cactggaggcccccagtgagacagcctgatcccagcttcatcactctgcagaaccgtgtg cctaatgtccaacacctccacaacaatgcacgtaacgtgactgcggagcatccagaaaga gaaaagcttaacaacctgcagagggacagaagttcacgtacaaagcaatcagaaaagcgt tga >gi568815586f:4710042_4911628|GENSCAN_predicted_peptide_6|149_aa MLQNPTLFEHQHDIINGKFPMHLIPGCGSNKTNRDCLTGPLWSGYFPFRCRQPSEHAASK SSEITAGAEWSTHLPLSQRALVLLCNKASLTTKHVLKWEFVCASSLFACHYTCAPAVAER AQCRIQAMASEGASIKPWQPPLGVEPPLY >gi568815586f:4710042_4911628|GENSCAN_predicted_CDS_6|450_bp atgctccaaaatccaacactttttgaacaccaacatgatatcataaatggaaaattcccc atgcacctgattcctggatgtggcagcaataaaaccaaccgagattgtttgactggcccc ctctggagcgggtatttcccattccgatgccgtcagccatctgagcatgctgcttccaag agctccgaaataacagccggtgctgaatggagcacacaccttcctctcagccagagagca cttgtcctcctgtgcaataaagcttcactgacgaccaagcatgttttaaaatgggagttt gtctgtgcaagctctctctttgcctgccactatacatgtgctccagctgtggctgaaagg gcccaatgtagaattcaggccatggcttcagagggtgcaagcatcaagccttggcagcct ccacttggtgttgagcctccactctactag >gi568815586f:4710042_4911628|GENSCAN_predicted_peptide_7|79_aa MTVMSGENVDEASAAPGHPQDGSYPRQADHDDHECCERVVINISGLRFETQLKTLAQFPN TLLGNPKKRMRYFDPLRND >gi568815586f:4710042_4911628|GENSCAN_predicted_CDS_7|237_bp atgacggtgatgtctggggagaacgtggacgaggcttcggccgccccgggccacccccag gatggcagctacccccggcaggccgaccacgacgaccacgagtgctgcgagcgcgtggtg atcaacatctccgggctgcgcttcgagacgcagctcaagaccctggcgcagttccccaac acgctgctgggcaaccctaagaaacgcatgcgctacttcgaccccctgaggaacgan