GENSCAN 1.0 Date run: 7-Nov-116 Time: 20:17:25 Sequence gi568815588r:80238467_80452601 : 214135 bp : 44.33% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 369 364 6 1.05 1.04 Term - 9154 8753 402 1 0 47 36 568 0.061 42.65 1.03 Intr - 14133 13927 207 0 0 -50 80 247 0.723 9.37 1.02 Intr - 14592 14338 255 0 0 -44 53 465 0.520 27.44 1.01 Init - 14917 14849 69 1 0 56 75 104 0.870 6.95 1.00 Prom - 15628 15589 40 -1.46 2.10 PlyA - 19905 19900 6 1.05 2.09 Term - 35417 35315 103 1 1 38 43 90 0.379 -2.75 2.08 Intr - 36187 36054 134 1 2 18 103 251 0.733 19.04 2.07 Intr - 36733 36551 183 1 0 125 115 216 0.999 27.98 2.06 Intr - 38128 37910 219 1 0 71 57 488 0.995 42.50 2.05 Intr - 41850 41707 144 0 0 78 99 160 0.999 16.58 2.04 Intr - 42347 42214 134 0 2 89 70 150 0.623 13.66 2.03 Intr - 45572 45450 123 0 0 116 72 218 0.989 23.56 2.02 Intr - 47123 47046 78 0 0 91 83 91 0.994 8.42 2.01 Init - 50957 50867 91 2 1 69 114 53 0.984 6.65 2.00 Prom - 61275 61236 40 -4.26 3.00 Prom + 62274 62313 40 -4.46 3.01 Sngl + 66603 66884 282 2 0 88 37 153 0.785 5.69 3.02 PlyA + 67486 67491 6 1.05 4.00 Prom + 68012 68051 40 -6.36 4.01 Sngl + 69460 70845 1386 0 0 42 32 458 0.887 31.69 4.02 PlyA + 70954 70959 6 -0.45 5.00 Prom + 71237 71276 40 -2.46 5.01 Init + 71674 71806 133 0 1 78 47 90 0.551 4.20 5.02 Term + 77225 77310 86 0 2 131 55 99 0.992 8.72 5.03 PlyA + 77720 77725 6 1.05 6.00 Prom + 87166 87205 40 -3.16 6.01 Init + 89837 89839 3 1 0 93 95 0 0.190 1.20 6.02 Intr + 112382 112588 207 1 0 73 34 83 0.108 0.67 6.03 Intr + 118131 118235 105 2 0 81 78 84 0.190 7.11 6.04 Intr + 123969 124124 156 2 0 95 85 72 0.214 7.81 6.05 Intr + 124485 124607 123 2 0 96 97 146 0.999 17.08 6.06 Term + 128222 128485 264 1 0 93 39 194 0.857 10.51 6.07 PlyA + 128659 128664 6 1.05 7.06 PlyA - 129753 129748 6 1.05 7.05 Term - 141634 141380 255 1 0 29 42 161 0.050 1.19 7.04 Intr - 153370 153325 46 0 1 95 82 0 0.343 -1.49 7.03 Intr - 153485 153397 89 2 2 69 77 106 0.671 6.37 7.02 Intr - 158745 158565 181 2 1 71 34 123 0.931 5.07 7.01 Init - 162329 162292 38 2 2 82 73 46 0.636 1.98 7.00 Prom - 163710 163671 40 -3.26 8.00 Prom + 170736 170775 40 -6.16 8.01 Init + 182035 182179 145 0 1 95 110 303 0.998 31.48 8.02 Intr + 183951 184042 92 1 2 78 105 16 0.958 2.01 8.03 Intr + 187400 187540 141 1 0 104 93 126 0.992 15.25 8.04 Intr + 188866 189030 165 0 0 69 109 136 0.998 13.96 8.05 Term + 193520 193633 114 1 0 128 38 69 0.575 4.47 8.06 PlyA + 194508 194513 6 1.05 9.03 PlyA - 196265 196260 6 1.05 9.02 Term - 206053 206002 52 0 1 104 49 13 0.114 -4.10 9.01 Init - 213407 213193 215 2 2 59 110 124 0.882 9.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:80238467_80452601|GENSCAN_predicted_peptide_1|310_aa MEIIEVTEEIHVKVVIEGPDATQEVDEVPEDQQLQEVDEVPEDLQLQGVDEVPEDQQLQE VNEVPEDQQLQKVDEVPEDHQLQEVDEVPEDHQLREVDEVPEDRQLQEELDKAPENNRVE EVVKFSGDSLVQEVAEFPEDSRVEVVEFPEDSPVEEFVEVPENLQMEGVFEFPDNTQCSA LRKNGFVVLKGWPCKIVEMSASKTGKHGHAKVHLVGIDIFTGKKYEDICPSTHNMDVPNI RRNDFQLIGIQDGYLSLLQDSGEVPEDLRLPEGDLGKETEQKYDCGEEILITVLSAMTEE AAVAIKAMAK >gi568815588r:80238467_80452601|GENSCAN_predicted_CDS_1|933_bp atggagattattgaagtaaccgaggagattcatgtgaaggtggttattgaggggccagac gccacccaggaggtggatgaggtcccagaggaccaacagctgcaggaggtggatgaggtc ccagaagacctacagctgcagggtgtggatgaagtcccagaggaccaacagctgcaggag gtgaatgaggtcccggaagaccaacagctgcaaaaggtggatgaggtcccggaggaccac cagctgcaggaggtggatgaggttccggaggaccaccagcttcgagaggtggatgaggtc ccggaggaccgacagctgcaggaggagctggataaggccccagagaacaatcgagtggag gaggtggttaagttttcaggggactctctagtgcaggaggtggctgagttcccagaggac agtcgagtggaggtggttgaattcccagaggactctccagtggaggagtttgttgaggtc ccagaaaaccttcagatggagggagtgtttgagttcccagacaacacccagtgctcagca ttacgtaagaatggctttgtggtgctcaaaggctggccatgtaagatcgtggagatgtct gcttcgaagactggcaagcacggccacgccaaggtccatctggttggtattgacatcttt actgggaagaaatatgaagatatctgcccgtcaactcataatatggatgtccccaacatc agaaggaatgacttccagctgattggcatccaggatgggtacctatcactgctccaggac agcggggaggtaccagaggaccttcgtctccctgagggagaccttggcaaggagactgag cagaagtacgactgtggagaagagatcctgatcacggtgctgtctgccatgacagaggag gcagctgttgcaatcaaggccatggcaaaataa >gi568815588r:80238467_80452601|GENSCAN_predicted_peptide_2|402_aa MNGPVDGLCDHSLSEGVFMFTSESVGEGHPDKICDQISDAVLDAHLKQDPNAKVACETVC KTGMVLLCGEITSMAMVDYQRVVRDTIKHIGYDDSAKGSLCFRLGFDFKTCNVLVALEQQ SPDIAQCVHLDRNEEDVGAGDQGLMFGYATDETEECMPLTIILAHKLNARMADLRRSGLL PWLRPDSKTQVTVQYMQDNGAVIPVRIHTIVISVQHNEDITLEEMRRALKEQVIRAVVPA KYLDEDTVYHLQPSGRFVIGGPQGDAGVTGRKIIVDTYGGWGAHGGGAFSGKDYTKVDRS AAYAARWVAKSLVKAGLCRRVLVQVSYAIGVAEPLSISIFTYGTSQKTERELLDVVHKNF DLRPGVIVRDLDLKKPIYQKTACYGHFGRSEFPWEVPRKLVF >gi568815588r:80238467_80452601|GENSCAN_predicted_CDS_2|1209_bp atgaatggaccggtggatggcttgtgtgaccactctctaagtgaaggagtcttcatgttc acatcggagtctgtgggagagggacacccggataagatctgtgaccagatcagtgatgca gtgctggatgcccatctcaagcaagaccccaatgccaaggtggcctgtgagacagtgtgc aagaccggcatggtgctgctgtgtggtgagatcacctcaatggccatggtggactaccag cgggtggtgagggacaccatcaagcacatcggctacgatgactcagccaagggctcactt tgcttccgcctaggctttgacttcaagacttgcaacgtgctggtggctttggagcagcaa tccccagatattgcccagtgcgtccatctggacagaaatgaggaggatgtgggggcagga gatcagggtttgatgttcggctatgctaccgacgagacagaggagtgcatgcccctcacc atcatccttgctcacaagctcaacgcccggatggcagacctcaggcgctccggcctcctc ccctggctgcggcctgactctaagactcaggtgacagttcagtacatgcaggacaatggc gcagtcatccctgtgcgcatccacaccatcgtcatctctgtgcagcacaacgaagacatc acgctggaggagatgcgcagggccctgaaggagcaagtcatcagggccgtggtgccggcc aagtacctggacgaagacaccgtctaccacctgcagcccagtgggcggtttgtcatcgga ggtccccagggggatgcgggtgtcactggccgtaagattattgtggacacctatggcggc tggggggctcatggtggtggggccttctctgggaaggactacaccaaggtagaccgctca gctgcatatgctgcccgctgggtggccaagtctctggtgaaagcagggctctgccggaga gtgcttgtccaggtttcctatgccattggtgtggccgagccgctgtccatttccatcttc acctacggaacctctcagaagacagagcgagagctgctggatgtggtgcataagaacttc gacctccggccgggcgtcattgtcagggatttggacttgaagaagcccatctaccagaag acagcatgctacggccatttcggaagaagcgagttcccatgggaggttcccaggaagctt gtattttag >gi568815588r:80238467_80452601|GENSCAN_predicted_peptide_3|93_aa MGRNQSRKAENSKKQSAPSSPKDRSSSPAMEQSRTENDFDELTEVGFRKLVINFFELKED VQTHRKEAKNFEKRLNEWLTKINSVEKTLNDLM >gi568815588r:80238467_80452601|GENSCAN_predicted_CDS_3|282_bp atggggagaaaccagagcagaaaagctgaaaattctaaaaaacagagcgccccttcttct ccaaaggatcgcagctcctcgccagcaatggaacaaagcaggacagagaatgattttgat gagttgacagaagtaggcttcagaaagttggtaataaacttctttgagctaaaggaggat gttcaaacccatcgcaaggaagctaaaaactttgaaaaaagattaaatgaatggctaact aaaataaacagtgtagagaagaccttaaatgacctgatgtag >gi568815588r:80238467_80452601|GENSCAN_predicted_peptide_4|461_aa MIISIDAEKAFNKIQQPFIIKVFNKLGIDGTYLKVIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNITLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIAYLENPVVS AQNLLKLISNFSKVSAYKINVQKSQAFLDTNNRQTESQIMSELPLTIATKIIKYLGIQLT KDVKDLFKENYKPLFNEIKEDTNKWKNIPCSWTGKINIMKTAILPKVIYRFSAITIKLPM TFFTELEKTTLKFIWSQKRARIAKTILSQKNKAGGIMLPDFKLYYKATVTQTAWYWYQNR DIDQWNRTEASEITPHIYNPLIFDKPDKNKKWGKDSLFNKWCWENWLAICRKLKLDPFLT PYTKINSRWIKDLNVRPKTIKALEENLGKTIQDIDMGKDFMTKTSKAMATKAKIDKWDLI KLKSFCTAKETIIRVNRHPTEWDKSFAIYSSDKRLISRIYK >gi568815588r:80238467_80452601|GENSCAN_predicted_CDS_4|1386_bp atgattatctcaatagatgcagaaaaggccttcaacaaaattcaacagcccttcataata aaagttttcaataaactaggtattgatgggacatatctcaaagtaataagagctatttat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacaaggatgccctctctcaccactcctattcaacataacgttggaagtt ctggccagggcaatcaggcaggagaaagaaataaagggtattcagttaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgcgtatttagaaaaccccgtcgtctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcagcatacaaaatcaat gtgcaaaaatcacaggcattcttagacaccaataacagacaaacagagagccaaatcatg agtgaactcccactcacaattgctacaaagataataaaatacctaggaatccaacttaca aaggatgtgaaggacctcttcaaggagaactacaaaccactgttcaatgaaataaaagag gacacaaacaaatggaagaacattccatgctcatggacaggaaaaatcaatatcatgaaa acggccatactgcccaaggtaatttatagattcagtgccatcaccatcaaactaccaatg acattcttcacagaattggaaaaaactactttaaagttcatatggagccaaaaaagagcc cgcattgccaagacaatcctaagccaaaagaacaaagctggaggcatcatgctacctgac ttcaaactatactacaaggctacagtaacccaaacagcatggtactggtaccaaaacaga gatatagaccaatggaacagaacagaggcctcagaaataacaccacacatctacaaccct ctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttacacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccata aaagccctagaagaaaacctaggcaaaaccattcaggacatagacatgggcaaggacttc atgactaaaacatcaaaagcaatggcaacaaaagccaaaatagacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactatcatcagagtgaacaggcatcctaca gaatgggacaaaagttttgcaatctactcatctgacaaaaggctaatatccagaatctac aaataa >gi568815588r:80238467_80452601|GENSCAN_predicted_peptide_5|72_aa MEYYAAIKKDEFMSFVGTRMKLETIILSKLSQGQKNKHHTSSLIGGLLRSPSLPNEGCSL PQGAQSHQLPKG >gi568815588r:80238467_80452601|GENSCAN_predicted_CDS_5|219_bp atggaatactatgcagccataaagaaggatgagttcatgtcctttgtagggacacggatg aagctggaaaccatcattctgagcaaactatcacaaggacagaaaaacaaacaccatacg tcctcactcataggtgggctcctgcgcagcccgagcctccccaacgagggctgctccctg ccccagggcgcccagtcccatcaactgcccaagggctga >gi568815588r:80238467_80452601|GENSCAN_predicted_peptide_6|285_aa MLVLHDPSLQQFPYQYLQLPNHATPILLQLKGLPAPLLHSDIWAQVKRITSSHNQIKWHD YQYLIFNCYQVSPARRPRPNGQNRLCPPDPAKLGEQIPKTVRAFPAARMETNYLKRCFGN CLAQALAEVAKVRPSDPIEYLAHWLYHYRKTAKAKEENREKKIHLQEEYDSSLKEMEMTE MLKQEEYQIQQNCEKCHKELTSETVSTKKTIFMQEDTNPLEKEALKQEFLPGTSSLIPGM PQQVPPSESAGQIDQNFKMPQEINYKEAFQHEVAHEMPPGSKSPF >gi568815588r:80238467_80452601|GENSCAN_predicted_CDS_6|858_bp atgcttgtactccatgacccatcccttcaacaattcccttaccaatatcttcagctccct aatcatgctaccccaatcctgcttcaattaaaaggtctgcctgctccacttcttcactca gacatctgggcccaagtgaagagaatcacatcatcacataaccaaatcaaatggcacgat tatcagtacttgatcttcaactgctaccaagtgagtcctgctcgacgcccaaggcccaac ggtcagaaccgcctctgcccgccggacccagcgaagctaggggaacaaatcccaaaaaca gttcgggcctttccggctgccaggatggaaactaactacctgaagaggtgctttggaaat tgcctggcccaggcactggcagaggtggcgaaggttcggcccagtgacccaatagaatac ctggctcactggctttatcattacaggaaaacagcaaaagcaaaagaagagaatagggaa aagaagatccacctgcaggaggaatatgacagtagcctcaaggaaatggaaatgacagaa atgctgaaacaggaagagtatcagattcaacagaactgtgaaaagtgtcacaaggaactg acttctgaaactgtttccacgaagaagaccatattcatgcaggaggacacaaaccccctt gagaaggaggccttgaagcaggaattcctgccaggtacttccagtctgattccaggaatg cctcaacaggttcctccttcagagtctgctggccagattgaccagaacttcaaaatgcca caagaaataaattacaaggaggcttttcagcatgaagttgctcatgaaatgcctcctggc tccaaatctcctttttag >gi568815588r:80238467_80452601|GENSCAN_predicted_peptide_7|202_aa MRHKAEHTINKLSYMGIGNFYSSLAIPKGFTLSSSDSLFRESEQEGTADIKCASIPVLAF LLTVSVSDYVLDIIAQRLVQGHHSFPQGDLIRSQMLIVLQQLWERRYPRPYVAPKQSQIF NVDEIAFYKKKMPSKTFMATEEKSMPGFKASKDRLTLLLESNASGDVKLKPTLIYHSENP RGLKNDARSTLPVFISGTTKPG >gi568815588r:80238467_80452601|GENSCAN_predicted_CDS_7|609_bp atgaggcacaaagcagaacatactatcaacaagctgagctatatgggaataggaaacttt tactcctcactcgctattcccaagggcttcactctcagctcctcagattccctgttcagg gaatctgaacaggaaggaacagctgacatcaaatgtgcctctatcccagttctggccttt ctcctgactgtttctgtgtctgactatgtgcttgacattatagcacaaagacttgtccaa ggtcatcattcatttcctcaaggtgacctgattcgcagccagatgctcatcgtgctccag cagctgtgggaaaggaggtacccaaggccctatgttgctcccaaacaaagccagattttc aatgtagatgaaatagccttctataagaagaagatgccgtccaagactttcatggctacg gaggagaagtcaatgcctggcttcaaagcttcaaaggacaggctgactctcttgttagag tctaatgcatctggtgatgttaagttgaagccaacactcatttatcattctgaaaatcct aggggccttaagaatgatgctagatctactctgcctgtgtttatcagtggaacaacaaag cctggatag >gi568815588r:80238467_80452601|GENSCAN_predicted_peptide_8|218_aa MGMWSIGAGALGAAALALLLANTDVFLSKPQKAALEYLEDIDLKTLEKEPRTFKAKELWE KNGAVIMAVRRPGCFLCREEAADLSSLKSMLDQLGVPLYAVVKEHIRTEVKDFQPYFKGE IFLDEKKKFYGPQRRKMMFMGFIRLGVWYNFFRAWNGGFSGNLEGEGFILGGVFVVGSGK QGILLEHREKEFGDKVNLLSVLEAAKMIKPQTLASEKK >gi568815588r:80238467_80452601|GENSCAN_predicted_CDS_8|657_bp atggggatgtggtccattggtgcaggagccctgggggctgctgccttggcattgctgctt gccaacacagacgtgtttctgtccaagccccagaaagcggccctggagtacctggaggat atagacctgaaaacactggagaaggaaccaaggactttcaaagcaaaggagctatgggaa aaaaatggagctgtgattatggccgtgcggaggccaggctgtttcctctgtcgagaggaa gctgcggatctgtcctccctgaaaagcatgttggaccagctgggcgtccccctctatgca gtggtaaaggagcacatcaggactgaagtgaaggatttccagccttatttcaaaggagaa atcttcctggatgaaaagaaaaagttctatggtccacaaaggcggaagatgatgtttatg ggatttatccgtctgggagtgtggtacaacttcttccgagcctggaacggaggcttctct ggaaacctggaaggagaaggcttcatccttgggggagttttcgtggtgggatcaggaaag cagggcattcttcttgagcaccgagaaaaagaatttggagacaaagtaaacctactttct gttctggaagctgctaagatgatcaaaccacagactttggcctcagagaaaaaatga >gi568815588r:80238467_80452601|GENSCAN_predicted_peptide_9|88_aa MLKATGSEKDTQGEGVDAEEERVQEGPRGTKAGKKTEKETGEAEGESGVSQRLRKCFQKA VVSCVRHCLEIRILCAPPPPIFVARSLS >gi568815588r:80238467_80452601|GENSCAN_predicted_CDS_9|267_bp atgctcaaggccacgggatctgagaaggacacacagggtgagggagtagatgcagaagag gagagagttcaggagggacccaggggaaccaaagcaggcaagaagactgaaaaagagact ggggaggcagaaggagaatcaggagtatcccagagactgagaaaatgtttccagaaagca gttgtcagctgtgttagacactgcttagagatcagaatcctctgtgcaccgcccccaccc atctttgtggccagatccctctcctga