GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:20:34 Sequence gi568815580f:31918864_32168381 : 249518 bp : 42.68% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 12660 12466 195 1 0 78 68 183 0.354 13.76 1.01 Init - 23901 23577 325 0 1 103 56 170 0.053 12.84 1.00 Prom - 29423 29384 40 -4.55 2.00 Prom + 33410 33449 40 -4.65 2.01 Init + 43315 43721 407 0 2 90 60 378 0.885 31.31 2.02 Intr + 43873 44246 374 1 2 1 14 360 0.852 13.68 2.03 Term + 44320 44816 497 2 2 33 52 283 0.970 12.94 2.04 PlyA + 45412 45417 6 1.05 3.00 Prom + 46739 46778 40 -8.75 3.01 Sngl + 48118 48507 390 0 0 17 42 574 0.957 41.67 3.02 PlyA + 49435 49440 6 1.05 4.04 PlyA - 51303 51298 6 1.05 4.03 Term - 55874 55567 308 0 2 66 36 162 0.237 3.09 4.02 Intr - 59435 59181 255 0 0 46 64 197 0.454 9.59 4.01 Init - 59734 59674 61 1 1 55 116 44 0.208 5.37 4.00 Prom - 71326 71287 40 -3.15 5.00 Prom + 75147 75186 40 -7.45 5.01 Init + 82168 82225 58 0 1 42 114 23 0.918 1.82 5.02 Intr + 83060 83235 176 0 2 135 50 27 0.883 2.34 5.03 Intr + 83429 83536 108 1 0 12 80 120 0.866 3.16 5.04 Intr + 84210 84291 82 2 1 61 110 127 0.958 10.49 5.05 Intr + 85808 85974 167 0 2 51 65 82 0.379 0.96 5.06 Term + 87405 87464 60 2 0 102 45 37 0.337 -2.37 5.07 PlyA + 89153 89158 6 1.05 6.00 Prom + 92348 92387 40 -6.65 6.01 Init + 100001 100164 164 1 2 85 86 260 0.996 22.75 6.02 Term + 100403 100997 595 2 1 57 37 190 0.765 3.81 6.03 PlyA + 102186 102191 6 1.05 7.05 PlyA - 103288 103283 6 1.05 7.04 Term - 108291 108218 74 1 2 96 29 30 0.282 -5.01 7.03 Intr - 108809 108760 50 1 2 66 92 94 0.435 5.01 7.02 Intr - 115508 115435 74 2 2 83 67 115 0.482 6.19 7.01 Init - 119037 118843 195 0 0 42 52 127 0.093 2.64 7.00 Prom - 122509 122470 40 -2.65 8.04 PlyA - 122568 122563 6 1.05 8.03 Term - 129991 129702 290 2 2 77 38 102 0.927 -1.35 8.02 Intr - 130353 130237 117 1 0 71 86 167 0.512 14.32 8.01 Init - 136596 136587 10 0 1 73 74 3 0.288 -1.85 8.00 Prom - 140555 140516 40 -3.05 9.00 Prom + 143758 143797 40 -4.15 9.01 Init + 144072 144125 54 2 0 56 90 48 0.757 1.13 9.02 Intr + 147039 147146 108 2 0 93 92 107 0.997 11.16 9.03 Term + 149435 149521 87 1 0 85 28 101 0.972 0.48 9.04 PlyA + 150633 150638 6 1.05 10.02 PlyA - 150889 150884 6 1.05 10.01 Sngl - 157409 157197 213 2 0 90 40 231 0.996 13.43 10.00 Prom - 160170 160131 40 -6.95 11.03 PlyA - 163559 163554 6 1.05 11.02 Term - 173315 173042 274 1 1 73 45 163 0.172 4.46 11.01 Init - 173915 173866 50 2 2 101 51 64 0.312 4.50 11.00 Prom - 188974 188935 40 -6.05 12.00 Prom + 189304 189343 40 -5.75 12.01 Init + 192919 193056 138 0 0 61 80 58 0.774 2.49 12.02 Intr + 194882 194997 116 1 2 56 115 58 0.769 3.63 12.03 Intr + 204655 204711 57 1 0 71 121 44 0.606 3.08 12.04 Intr + 221383 221585 203 1 2 20 9 262 0.361 9.51 12.05 Term + 221920 222095 176 2 2 -17 44 202 0.590 2.24 12.06 PlyA + 223351 223356 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:31918864_32168381|GENSCAN_predicted_peptide_1|174_aa MAQCVQSVQELIPDSFVPCVAALCSDEAERLTRLNHLSFAELLKPFSRLTSEGMWYPPFS SGLPQSRGREKEVAGGQGREGQCLRPGAGTVASGAVAPRCSGGGWRRDLHMRDPNNQLHV IKNLKIAVSNIVTQPPQPGAIRKLLNDVVSGSQPAEGLVANVITAGDYDLNISX >gi568815580f:31918864_32168381|GENSCAN_predicted_CDS_1|522_bp atggcccagtgtgtacaatcagtgcaggagctaatcccggactccttcgtcccctgtgtc gctgcgctgtgcagcgacgaagccgagcggctcactcgtctcaatcacctcagcttcgcg gagctgcttaagcccttctcccgcctcacttccgagggtatgtggtatcctcccttttcc agtgggctcccgcaaagccgtggtcgggagaaggaagttgcgggcgggcagggaagagaa gggcagtgtttacgtcctggagccggtaccgtggcgtctggggctgtggccccgcggtgc tccgggggcggctggaggagggaccttcacatgagagatcctaataatcaacttcacgta attaaaaatttgaagatagcagtaagcaacattgtcacccagccacctcagcctggagcc atccggaagcttttgaatgatgttgtttctggcagtcagcctgcagaaggattagtagct aatgtgattacagcaggagattatgaccttaacatcagtgnn >gi568815580f:31918864_32168381|GENSCAN_predicted_peptide_2|425_aa MAQADIVLIRLAVMGQNLILTMKDHGFVVCAFNRTVSEVDDFLANEAKGTKVVGAQSLKE MVSKLKKPQRIILLVKAGQAVDYFIKKLIPLLDIGDIIIDGGNSEYRDTTRRCQDLKAKG ILFVGSGVSGGEEGAWIEYGDMQLICEAYHLMKDVLGMAQDKMAQAFEDWNKTELDSFLI EITANILKFQGADGKHLLPKIRDSAGQKGMGKWTTISALEYGIPVTLIGEAVFAWCLSSL KDGRIQASKKLKGPQKFQFDGFMLLRQGATEFGWTLNYGDIALMWRGGCIIRSVFLGKIK DAFDRNPEPQNLLLDDFLSQLLKTTRTLGGGQSALGSRLAFPCPVLSLPSPSMTSTDTRC FQPTSSRLSGIPLGLTPMNSWPNQGSLSTPTGQATVAVYHPRHTMPDDDDAAPVTLHDST DQDIP >gi568815580f:31918864_32168381|GENSCAN_predicted_CDS_2|1278_bp atggcccaagctgacatcgtgctgatcagattggcagtcatgggccagaacttaattctg accatgaaagaccatggctttgtggtctgtgcttttaataggactgtctccgaagtggat gatttcttggccaatgaggcaaagggaaccaaagtggtgggtgcccagtccctgaaagag atggtctccaagctgaagaagccccagcggatcatcctcctggtgaaggctgggcaagct gtggattatttcatcaaaaaactgataccattgttggacattggtgacatcatcattgat ggaggaaattctgaatatagggacaccacaagacggtgccaagacctcaaggccaaggga attttatttgtggggagcggagtcagtggtggagaggaaggggcctggatagagtatggg gacatgcagctgatctgtgaggcataccacctgatgaaagacgtgctgggcatggcacag gacaagatggcccaggcctttgaggattggaataagacagagctagactcattcctgatt gaaatcacagccaatattctcaagttccaaggtgctgatggcaaacacttgctgccaaag atcagggacagtgcagggcagaagggcatggggaagtggaccaccatctctgccctggag tatggcatacccgtcaccctcattggagaagccgtctttgcttggtgcctgtcatctctg aaggatgggagaattcaagctagcaaaaagctgaagggtccccaaaagttccagtttgat ggctttatgctgctaaggcagggagccaccgagtttggctggaccctcaattatggtgat attgccctgatgtggagagggggctgcattattagaagtgtattcctaggaaagataaag gatgcgtttgatcgaaacccagaacctcagaacctcctactggacgacttcttaagtcag ctgttgaaaaccactaggactcttggtggtgggcagtcagcactggggtccaggctggca ttcccatgccctgttttgtcactgccctctccttctatgacaagtacagacacgagatgc ttccagccaacctcatccaggctcagcgggattcctttggggctcacacctatgaactct tggccaaaccagggcagtttatccacaccaactggacaggccacggtggcagtgtatcat cctcgtcatacaatgcctgatgatgatgatgctgctcctgtcaccctccacgattccaca gaccaggacattccatga >gi568815580f:31918864_32168381|GENSCAN_predicted_peptide_3|129_aa MKNIVKNYSEAEIKVQEATSNDPWGPSSSLMTEIADLTYNMVAFSGIMSMMWKRLNDHSK NWQHVYKELLLLDYPIKTGSECMAQQCREIIFAIQTLKDFQYIDQDGKDQGINVHEKSKQ LVALLKDEE >gi568815580f:31918864_32168381|GENSCAN_predicted_CDS_3|390_bp atgaaaaacattgtgaaaaattactcagaggcagaaatcaaagtccaggaggccacctcc aatgacccatggggcccatccagctctctgatgaccgagattgcagacctgacctacaac atggtggccttctcggggatcatgagcatgatgtggaagcggctcaatgaccatagcaag aactggcagcatgtgtacaaggagctgctgctgctggactaccccatcaagacaggctcc gagtgcatggcccagcagtgccgtgagatcatctttgccatccagaccctgaaggacttc cagtacattgaccaagatggcaaggaccagggcatcaacgtgcatgagaagtcaaagcaa ctggtggccctcctcaaagatgaggaatga >gi568815580f:31918864_32168381|GENSCAN_predicted_peptide_4|207_aa MWAERILTYCSNQVTLCTDQGSKSETNSSEKNRVLTSKQSTRVWDPVDHLPLLNAPDPNF PPPQTATTAPDPSLTHVVPPPYDPDFWELSLHEPAPKYPSLKGFQRCCIIPCIHGLVQRL IETALTKTFNSPPPYSDKLLLLENQAEQQSQDMLKSLKRKNYENQEGEVVGNIEFLFKDP LSSIFKAYLPCLLAPSYGKKFPAIPNL >gi568815580f:31918864_32168381|GENSCAN_predicted_CDS_4|624_bp atgtgggcagaaaggatcctcacatactgcagcaaccaggtaactctgtgcacagaccaa ggaagtaagtcagaaaccaactcctcagaaaagaatagggtccttacctccaaacagtcc accagagtatgggatcctgtagaccaccttcccctgcttaatgcccctgatcctaacttc cctccccctcagacagccactactgccccagatccttcccttactcacgttgttcctcct ccttatgaccctgacttttgggaactatcattgcatgagcctgctcctaagtacccttcc ctaaaaggatttcaacgatgctgcatcataccctgtattcatggattggtgcagagactt atagaaacagctctcaccaaaacctttaattctcccccaccttactcagataaactccta cttttagaaaaccaagcagaacagcagagtcaagacatgttaaagagtttgaagaggaag aattatgaaaatcaagagggggaagtagtaggaaacattgagttcctcttcaaagatcca ttgtcctctattttcaaagcctatcttccttgcctccttgcccctagttatggtaaaaaa tttccagccattcccaatctgtaa >gi568815580f:31918864_32168381|GENSCAN_predicted_peptide_5|216_aa MTSGPQTNQPKEHLTNFKSGCCQVPVLPQPLLPNPIILLSPPLPTPRPGLQFRFATSPPP PAQQFPLGEVAGAEGIVRPNLTLTTAGFMSQTSRKALEQFPEKIPNGTIRQIPQWLKTDA ARSPWKPPRPSRMLWVTLTVEERNFLTMQGSSSIINTSLIKTLLKAALLPKEAGVIHCKG HQKASDPITQGNTYADKGHLRQFIVINQNATRWILA >gi568815580f:31918864_32168381|GENSCAN_predicted_CDS_5|651_bp atgacctcgggtcctcagaccaaccaacccaaggaacatctcaccaatttcaaatcgggc tgctgccaggtcccggttcttcctcagcctctgctccccaatcctataatccttctatca cctcccctcccaacacccaggcctggcttacagtttcgttttgcgactagccctccccca cctgcccaacaatttcctcttggagaggtggctggagctgaaggcatagtcaggcccaat ctcacactgacaaccgccggcttcatgagccagacctccaggaaggcattagagcagttc cctgagaagatccccaatggaactatcaggcagattccccagtggctgaagactgacgct gcccgatcgccttggaagccccctagaccatcacggatgctttgggtaactctcacagtg gaggaaagaaatttcctcactatgcaagggtcctcctccatcattaacacttctttaata aaaactcttctcaaagccgctttacttccaaaggaagctggagtcattcactgcaagggc catcagaaggcatcagatcccatcactcagggcaacacttatgctgataagggacacctg agacagttcatagtaatcaatcaaaatgctacccgttggatattggcttga >gi568815580f:31918864_32168381|GENSCAN_predicted_peptide_6|252_aa MGSVLSTDSGKSAPASATARALERRRDPELPVTSFDCAVCLEVLHQPVRTRCGHVSRSPG AELQRRGSRHGSRPEPGLANWGGCSWLENQRSPRHGKETSSSYKAGAPAKRSYPAEHGRT GCASLCPRNPGPFCFAGAVHVFPECPQRLVLSKQGSVQSVFPPPPLLPFPNTDTQPLKGQ VKLELPSDFPQLLEFNNLLGWGLDVSRQAPECTPSKPSHNVTGGGRLPALRTVSPNRRKL QSKITFSKPHST >gi568815580f:31918864_32168381|GENSCAN_predicted_CDS_6|759_bp atgggctccgtgctgagcaccgacagcggcaaatcggcgcccgcctctgccaccgcgcgg gccctggagcgcaggagggacccggagttgcccgtcacgtccttcgactgcgccgtgtgc cttgaggtgttacaccagcctgtccggacccgctgtggccacgtctcccgaagcccaggt gcggaactccagagaaggggatccaggcacgggtcccgcccagagccagggctcgccaac tggggagggtgctcttggctggagaaccagcgctcgcccaggcacggaaaggaaacgagc tccagctacaaggcaggggctcccgcaaagcgcagttaccccgctgaacacggtagaacg ggctgcgcttccctctgccccagaaacccgggtccgttttgttttgctggagctgtgcac gtgttcccagaatgcccacagaggctcgtcttaagcaaacaaggatctgtgcaaagtgtc tttccacccccacccctgctaccctttcccaacacagacacccagcccctcaaagggcaa gtcaaactggagcttccgagtgactttccacagttgttagagtttaacaacctgctaggg tggggcttggacgttagccgccaagcccctgaatgtactccttcaaaacccagccacaac gtaacaggaggtggcaggctacctgctttacgcacggtctctccaaaccggcggaaatta cagagtaaaataactttctcaaaaccgcatagtacttag >gi568815580f:31918864_32168381|GENSCAN_predicted_peptide_7|130_aa MFRFCPIRVQFFQSSLRLATFRILLIGAFYRVLIGALYRALIDAFYRALIGAFYRALIGV FYNPLGLCPAAAAALLKRKTERDAKEITPRANEWDLGTTDEKPAKKGNKTPGDVNPLTQE GKIKEILGKN >gi568815580f:31918864_32168381|GENSCAN_predicted_CDS_7|393_bp atgttccgtttctgtcctatcagagtgcagtttttccaatcctccctgcgattggctact ttcagaatcctgctgattggtgcgttttacagagtgctgattggtgcactttacagagca ctgattgatgcgttttacagagcactgattggtgcattttacagagcgctaattggtgtg ttttacaatcctcttggcttatgtcccgctgcagctgccgccctgctcaagagaaagact gagcgggatgctaaggagataacgccaagagccaatgaatgggatcttggaacaacggat gaaaagccagcaaagaaagggaacaagaccccaggagatgtaaatcctttaactcaggaa ggaaaaataaaagaaatacttggaaaaaattag >gi568815580f:31918864_32168381|GENSCAN_predicted_peptide_8|138_aa MCNCQLSSMTLKTCKIMEDLASWATFNLGQTEQHVDWISAGAAPQGPIVKQLPSPVPPTE QLEQEKPVTIGSAPSTPNKLLLSHSNCQLPRPLRSITNISACFSLELAIDHRGFPHSPGA RSLYSNSVFVTYQLCDLV >gi568815580f:31918864_32168381|GENSCAN_predicted_CDS_8|417_bp atgtgcaactgtcagctgtcttcaatgaccctgaagacctgtaaaatcatggaggaccta gcttcatgggccaccttcaatttgggccaaactgagcagcatgtggactggatttctgct ggagcagctccccagggacctattgtgaaacaactccctagcccagttcctcctacagag caacttgagcaggaaaagcccgtgaccatcggctctgctccctcaacacctaacaagctc ctacttagccattccaattgccaactccctaggccattaagatcaataacaaatatctct gcatgcttttctcttgaacttgctatagaccacaggggtttcccacacagccctggagct agatcactgtattcaaattctgtctttgttacttaccagctgtgtgaccttgtatga >gi568815580f:31918864_32168381|GENSCAN_predicted_peptide_9|82_aa MGRVGWLMLVIPALWEAEFCPLCRLIPDENPSSFSGSLIRHLQVSHTLFYDDFIDFNIIE EALIRRVLDRSLLEYVNHSNTT >gi568815580f:31918864_32168381|GENSCAN_predicted_CDS_9|249_bp atgggccgggtggggtggctcatgcttgtaatcccagcactttgggaggcagagttctgt ccactttgccgtttaatacccgatgagaatccaagcagcttcagtggcagtttaataaga catctgcaagttagtcacactttgttttatgatgatttcatagattttaatataattgag gaagctcttatccgaagagtcttagaccggtcacttcttgaatatgtgaatcactcgaac accacataa >gi568815580f:31918864_32168381|GENSCAN_predicted_peptide_10|70_aa MVRINVLADALKSINNAKKRGKHQVLIGPCSKVIIRFLTVIMKHGYIGESEIIDDQSWED CEPHRQAKQV >gi568815580f:31918864_32168381|GENSCAN_predicted_CDS_10|213_bp atggtgcgcataaatgtcctggctgatgctctcaagagcatcaacaatgccaaaaagaga ggcaaacaccaggttcttattgggccatgctccaaagtcatcatccggtttctaactgtg ataatgaagcatggttacattggcgagtctgaaatcattgatgatcagagctgggaagat tgtgaacctcacaggcaggctaaacaagtgtag >gi568815580f:31918864_32168381|GENSCAN_predicted_peptide_11|107_aa MAGDGETRRWRQRQRPRDPAQRFPRASCPAAAIFAAEAPGQMEGGVRCAYRPLTGPEPRL MVQGDSPAAASSSVLALRCLQPTRPSPNAGAPGIWAEEPGEAGRHGV >gi568815580f:31918864_32168381|GENSCAN_predicted_CDS_11|324_bp atggcgggggatggggaaacaaggcgatggcgacagcggcagcggccgagagacccggcc cagcgctttcctcgagcttcctgtccggccgccgctatcttcgcagccgaggccccgggg caaatggagggcggcgtgcgctgtgcttatcggcccctaacggggccggagccgaggctc atggtgcagggagattcgcccgccgccgccagcagctctgtcttggcattgcgctgccta cagcctacccggccgtcaccgaacgcgggagcgccggggatctgggcggaggagccagga gaggcggggaggcacggggtgtga >gi568815580f:31918864_32168381|GENSCAN_predicted_peptide_12|229_aa MRESGAHCPLCRGNVTRRERACPERALDLENIMRKFSGSCRCCAKQIKFYRMRHHYKSCK KYQDEYGVSSIIPNFQISQDSVGNSNRSETSTSDNTETYQENTRPLRDPLGEASWAPESG GDLENLYVYLRDCKHTNQHPVSSSGIVNTPIGTLYLAQGLQTPISILRLAQEASKTTNPP EGRNSEHIRTSERTNSRRATLRAVTLTARVRSFILEVSETKNPPIPDTA >gi568815580f:31918864_32168381|GENSCAN_predicted_CDS_12|690_bp atgagggaaagcggagcacattgtcccctatgtcgtggaaatgtgactagaagagagaga gcatgtcctgaacgggccttagaccttgaaaatataatgaggaagttttctggtagctgc agatgctgtgcaaaacagattaaattctatcgcatgagacatcattacaaatcttgtaag aagtatcaggatgaatatggtgtttcttctatcattccaaactttcagatctctcaagat tcagtagggaacagcaataggagtgaaacatccacatctgataacacagaaacttaccaa gagaatacaaggcccctgcgggatccactgggtgaagccagctgggctcctgagtctggt ggggacctggagaacctttatgtctatctcagggattgtaaacacaccaatcagcaccct gtgtctagctcagggattgtaaatacaccaatcggcactctgtatctagctcaaggtttg caaacaccaatcagcatcctgcgtctagctcaggaagccagcaagaccacgaacccacca gaaggaagaaactccgaacacatccgaacatcagaaagaacaaactccagacgcgccacc ttaagagctgtaacactcactgcgagggtccgcagcttcattcttgaagtcagtgagacc aagaacccaccaattcctgacacagcatga