GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:18:53 Sequence gi568815591f:107079894_107300302 : 220409 bp : 38.24% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 116 111 6 1.05 1.03 Term - 385 237 149 2 2 47 42 154 0.001 3.78 1.02 Intr - 2195 2158 38 1 2 98 107 24 0.001 2.29 1.01 Init - 11696 11545 152 2 2 82 73 112 0.875 8.66 1.00 Prom - 14103 14064 40 -3.65 2.02 PlyA - 14862 14857 6 1.05 2.01 Sngl - 15397 14906 492 1 0 60 48 241 0.716 13.10 2.00 Prom - 16796 16757 40 -9.95 3.02 PlyA - 16850 16845 6 1.05 3.01 Sngl - 18745 18257 489 1 0 88 38 361 0.938 26.91 3.00 Prom - 20123 20084 40 -6.55 4.00 Prom + 22499 22538 40 -3.15 4.01 Init + 25534 25663 130 0 1 43 57 95 0.004 2.26 4.02 Intr + 42059 42111 53 0 2 56 75 111 0.241 4.31 4.03 Intr + 48319 48402 84 0 0 64 111 85 0.775 7.50 4.04 Intr + 60954 61060 107 2 2 40 97 115 0.219 5.69 4.05 Intr + 66415 66568 154 1 1 56 97 149 0.945 11.75 4.06 Intr + 71029 71130 102 0 0 101 107 7 0.904 3.35 4.07 Intr + 73284 73358 75 2 0 107 80 109 0.979 10.79 4.08 Intr + 77293 77431 139 0 1 54 63 105 0.992 3.72 4.09 Term + 79556 79689 134 0 2 62 48 134 0.990 4.07 4.10 PlyA + 79825 79830 6 1.05 5.00 Prom + 84100 84139 40 -5.15 5.01 Init + 87545 87591 47 1 2 68 76 33 0.831 0.32 5.02 Term + 88684 89041 358 1 1 102 48 250 0.933 15.40 5.03 PlyA + 91629 91634 6 1.05 6.00 Prom + 99553 99592 40 -3.25 6.01 Init + 100001 100169 169 1 1 68 99 123 0.958 11.15 6.02 Intr + 102480 102708 229 1 1 92 63 229 0.989 16.81 6.03 Intr + 105908 106049 142 2 1 50 110 68 0.968 4.63 6.04 Intr + 109399 109555 157 0 1 58 97 78 0.869 4.36 6.05 Intr + 110280 110424 145 1 1 69 85 104 0.973 6.62 6.06 Intr + 115941 116258 318 0 0 75 98 207 0.831 14.55 6.07 Intr + 120267 120408 142 0 1 54 36 121 0.508 2.93 6.08 Intr + 122108 122210 103 1 1 11 98 70 0.401 -0.87 6.09 Term + 124499 124575 77 0 2 91 32 109 0.282 2.62 6.10 PlyA + 125641 125646 6 1.05 7.06 PlyA - 126080 126075 6 1.05 7.05 Term - 128807 128693 115 1 1 84 41 52 0.281 -2.84 7.04 Intr - 130712 130633 80 2 2 70 103 80 0.898 5.13 7.03 Intr - 131316 131206 111 0 0 98 90 60 0.898 6.86 7.02 Intr - 140884 140760 125 2 2 29 66 76 0.726 -1.12 7.01 Init - 141314 141218 97 2 1 55 61 120 0.436 6.52 7.00 Prom - 142044 142005 40 -8.75 8.08 PlyA - 142067 142062 6 -0.45 8.07 Term - 143270 143091 180 2 0 113 38 153 0.873 9.43 8.06 Intr - 156794 156557 238 1 1 107 88 196 0.671 18.19 8.05 Intr - 168606 168503 104 0 2 124 26 105 0.057 5.95 8.04 Intr - 178490 178380 111 2 0 45 88 57 0.056 1.06 8.03 Intr - 201506 201407 100 1 1 89 89 69 0.587 6.29 8.02 Intr - 203839 203678 162 0 0 53 95 119 0.803 7.27 8.01 Intr - 218453 218249 205 0 1 84 27 173 0.136 8.14 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 10433 10256 178 1 1 73 46 132 0.983 3.68 S.002 Term - 218453 218230 224 0 2 84 43 179 0.849 9.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:107079894_107300302|GENSCAN_predicted_peptide_1|112_aa MQEPSEPAGMADSHSQALTVHLLQGTDSLSGMCTQLLCACALCGIGLVESGLPLSNLLKL STSDGTTSCTQWDLKDSEGTLWLPTLASGPLHGAPTETSVLLREPRHTDAPS >gi568815591f:107079894_107300302|GENSCAN_predicted_CDS_1|339_bp atgcaggaaccaagtgagcctgcaggaatggcagattcacattcacaggctctcactgtc cacctactgcagggcacagactcactgagtggaatgtgcacgcagctcctttgtgcttgt gctctctgtggcataggtttagtagaatcaggattaccactcagcaacctgctaaaactc agcacgtctgacggtaccacttcctgcacacagtgggatctgaaagatagtgagggcacg ctgtggctccccacactggcatctggaccactccatggtgctcccactgaaacctctgtg ctcctccgagagccacggcatacagatgccccttcctga >gi568815591f:107079894_107300302|GENSCAN_predicted_peptide_2|163_aa MSELPFTIASKRIKYLGIQLTRDVKDFFKENYKPLLNEIKEDTSKWKNISCSWIGRINIM KMAILPKVIHRLNAIPIKLPMTSFTELEKTTLKFMWSQKRARIAKTILSQKNKAGGIMLP DFKLSYKATVTKTAWYCYQNRYRSMEQNRALRNNTTHLQPSDL >gi568815591f:107079894_107300302|GENSCAN_predicted_CDS_2|492_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacttcttcaaggagaactacaaaccactgctcaacgaaataaaa gaggacacaagcaaatggaagaacatttcatgctcatggataggaagaatcaatatcatg aaaatggccatactgcccaaggtaattcacagactcaacgccatcccaatcaagctacca atgacttccttcacagaattggaaaaaactactttaaagttcatgtggagccaaaaaaga gcccgtattgccaagacaatcctaagtcaaaagaacaaagctggaggcatcatgctacct gacttcaaactatcctacaaggccacagtaaccaaaacagcatggtactgttaccaaaac agatatagatcaatggaacagaacagagccctcagaaataataccacacatctacaacca tctgatctttga >gi568815591f:107079894_107300302|GENSCAN_predicted_peptide_3|162_aa MGRNQSRKANDSKNQSASSPPKEHSSSPPTEQSRKENDFDELREGGFRQSVITNFSKLKE DVRTHCKEAKNLQKRLDKWLTRINSTDKTLNDLMELKTMAQELRDACTRFSSQFDQVEER VSVTEDQMNEMKREDKFREKRVKRNEQSLQEIWDYVKDQICV >gi568815591f:107079894_107300302|GENSCAN_predicted_CDS_3|489_bp atggggagaaaccagagcagaaaagctaacgattctaaaaatcagagcgcctcttctcct ccaaaagaacacagctcctcgccaccaacagaacaaagcaggaaagagaatgactttgac gagttgagagaaggcggcttcagacaatcagtaataacaaacttctccaagctaaaggag gatgttcgaacccactgcaaagaagctaaaaaccttcaaaaaagattagacaaatggcta actagaataaacagtacagacaagaccttaaatgacctgatggagctgaaaaccatggca caagaactacgtgatgcatgcacaagattcagtagccaatttgatcaagtggaagaaagg gtatcagtgactgaagatcaaatgaatgaaatgaagcgagaagacaagtttagagaaaaa agagtgaaaagaaatgaacaaagcctccaagaaatatgggactatgtgaaagaccaaatc tgcgtctga >gi568815591f:107079894_107300302|GENSCAN_predicted_peptide_4|325_aa MVQPKEYVNLLIFGWLSIPVMVQDCKDSIPEKWVSTEKAAPLSICAEAYNPDEEEDDAES RIIHPKTDDQRNRLQEACKDILLFKNLDPEQMSQVLDAMFEKLVKDGEHVIDQGDDGDNF YVIDRGTFDIYVKCDGVGRCVGNYDNRGSFGELALMYNTPRAATITATSPGALWGLDRVT FRRIIVKNNAKKRKMYESFIESLPFLKSLEFSERLKVVDVIGTKVYNDGEQIIAQGKSEV EENGAVEIARCSRGQYFGELALVTNKPRAASAHAIGTVKCLAMDVQAFERLLGPCMEIMK RNIATYEEQLVALFGTNMDIVEPTA >gi568815591f:107079894_107300302|GENSCAN_predicted_CDS_4|978_bp atggtacaaccaaaggagtatgtgaatctcctaatttttggctggttgtcaatcccagtt atggtacaagactgtaaagacagtatccctgagaaatgggtcagcactgagaaggctgct cccttgtctatatgtgcagaagcttataatcctgatgaagaagaagatgatgcagagtcc aggattatacatccaaaaactgatgatcaaagaaataggttgcaagaggcttgcaaagac atcctgctgtttaagaatctggatccggagcagatgtctcaagtattagatgccatgttt gaaaaattggtcaaagatggggagcatgtaattgatcaaggtgacgatggtgacaacttt tatgtaattgatagaggcacatttgatatttatgtgaaatgtgatggtgttggaagatgt gttggtaactatgataatcgtgggagtttcggcgaactggccttaatgtacaatacaccc agagcagctacaatcactgctacctctcctggtgctctgtggggtttggacagggtaacc ttcaggagaataattgtgaaaaacaatgccaaaaagagaaaaatgtatgaaagctttatt gagtcactgccattccttaaatctttggagttttctgaacgcctgaaagtagtagatgtg ataggcaccaaagtatacaacgatggagaacaaatcattgctcagggtaaatcagaagtg gaagagaatggtgcagtagaaatcgctcgatgctcgcggggacagtactttggagagctt gccctggtaactaacaaacctcgagcagcttctgcccacgccattgggactgtcaaatgt ttagcaatggatgtgcaagcatttgaaaggcttctgggaccttgcatggaaattatgaaa aggaacatcgctacctatgaagaacagttagttgccctgtttggaacgaacatggatatt gttgaacccactgcatga >gi568815591f:107079894_107300302|GENSCAN_predicted_peptide_5|134_aa MWGLILDFLICYAAFRHQNKRADTRVPMTLVYYQDCAVRAPLGYVVQLFPGQQSPTAKDQ GLRLRTHNPFRGQSLAWVPGLSAIVQSEALTLVIDMPSSPVNTTQKSIHARPPRRTTVPP NGKGEEPNGSSAVQ >gi568815591f:107079894_107300302|GENSCAN_predicted_CDS_5|405_bp atgtgggggcttattttggatttcttgatttgttacgcggctttcagacaccaaaacaaa cgagctgatacacgtgtccctatgactcttgtttactatcaagactgcgctgtgcgagcg cctctgggatacgtagtccagctcttccccggccagcagtctcccacagccaaggaccaa ggcttgagactacgaacccacaatccatttcggggacaaagtttggcttgggtccccgga ctctcggcaatagtccaatccgaagcacttactcttgtgattgacatgccatcttctcca gtaaacacaacacaaaagtcaatacatgcccgccccccgagacggaccacggtaccaccc aatgggaaaggggaagaaccgaatggcagttctgcagtccaataa >gi568815591f:107079894_107300302|GENSCAN_predicted_peptide_6|493_aa MVWEVKTNQMPNAVQKLLLVMDKRASGMNDSLELLQCNENLPSSPGYNSCDEHMELDDLP ELQAVQSDPTQSGMYQLSSDVSHQEYPRSSWNQNTSDIPETTYRENEVDWLTELANIATS PQSPLMQCSFYNRSSPVHIIATSKSLHSYARPPPVSSSSKSEPAFPHHHWKEETPVRHER GYGSDGLKLLSHEESVSFGESVLKLTFDPGTVEDGLLTVECKLDHPFYVKNKGWSSFYPS LTVVQHGIPCCEVHIGDVCLPPGHPDAINFDDSGVFDTFKSYDFTPMDSSAVYVLSSMAR QRRASLSCGGPGGQDFARSGFSKNCGSPGSSQLSSNSLYAKAVKNHSSGTVSATSPNKCK RPMNAFMLFAKKYRVEYTQMYPGKDNRAISVILGDRWKKMKNEERRMYTLEAKALAEEQK RLNPDCWKRKRTNSPKSHLLQLPWHAELMHQVAAAIRYYFQWRPSPGQEKVRQSLNQIVC LSGAKKNGYEINF >gi568815591f:107079894_107300302|GENSCAN_predicted_CDS_6|1482_bp atggtgtgggaagtgaagacaaatcagatgcctaatgcagtacagaaactcctgttggtg atggacaagagagcctcaggaatgaatgactcattggagttgctgcagtgtaatgagaat ttgccatcttcacctggatataactcctgtgatgaacacatggagcttgatgaccttcct gaacttcaggcagttcaaagtgatcctacccaatctggcatgtaccagctgagttcagat gtttcacatcaagaatacccaagatcatcttggaaccaaaatacctcagacataccagaa actacttaccgtgaaaatgaggtggactggctaacagaattggcaaatatcgcgaccagt ccacaaagtccactgatgcagtgctcattttacaatagatcatctcctgtacacatcata gccactagcaaaagtttacattcctatgcacgccctccaccagtgtcctcttcttcgaag agtgaaccagccttccctcatcaccattggaaggaggaaacaccagtaagacacgaaagg ggctatggttctgatggtctaaagttgttatcacatgaagaaagtgtatcatttggcgag tctgtactgaagttgacttttgatcctggtacagtagaagatggtttacttaccgtagag tgtaagctggaccaccctttctatgttaaaaataaaggttggtcatcattttatccaagc ttgactgtggtacagcatggcattccatgttgtgaagttcatattggcgatgtatgtcta cctcctggacaccccgatgccattaattttgatgattcaggtgtttttgatacatttaaa agctatgacttcacacctatggattcttctgcagtttatgtgttaagtagtatggctcgc cagcgtcgtgcatctttgtcttgtggaggacctggtggtcaagactttgcaagatctgga ttcagtaaaaactgtggctcacctggatcatcacagctctcttccaattctttgtatgct aaagctgtcaaaaaccacagctcagggactgtgagtgccacttctcctaataagtgcaaa agaccaatgaatgccttcatgctttttgccaaaaaatacagagttgaatatactcagatg tatccagggaaagataacagagccataagtgtgatccttggtgacaggtggaagaaaatg aagaatgaagagagaagaatgtacacattagaagcaaaggctttggctgaagaacagaaa cgtttaaatcctgactgttggaagaggaaaagaaccaattcacctaagagccacctgctg cagttaccatggcatgctgagttgatgcaccaggtggcagcagccatccgttattatttc caatggagacctagcccaggccaagaaaaagttcgtcagtccctgaaccagatcgtctgt ttaagtggagctaaaaagaatggctacgagattaatttttaa >gi568815591f:107079894_107300302|GENSCAN_predicted_peptide_7|175_aa MGSNGERENSQDQCTAEHDNQGQQDRSEGMKPVPFQGLGTPDLQYELVTRWRDFLLHFFP AIAQTREQPRHNEQASEHVASSPALGDVIPFSIIIQFLFTRAPAELKSPFQRAEWSHTRF SQWLDDHPSEKDRLLLISYEHLTQKWERLPAPVNVPGKICPRERLWIWVRSLGLG >gi568815591f:107079894_107300302|GENSCAN_predicted_CDS_7|528_bp atgggctctaatggagaaagggaaaactcacaggaccaatgtactgctgaacacgacaac cagggacagcaggacagaagtgagggcatgaagccagtcccctttcagggcctgggcaca ccagacctacaatatgaacttgtgacccggtggagggacttcttattacactttttccct gcaatagctcaaacacgagagcagcctcgtcacaatgagcaggcaagtgaacatgtagcc agtagtcctgcattgggggatgtgattccgttcagcatcattattcagtttttgttcacg agagcacccgctgaactgaaatctcctttccagagggcagagtggtcccacacacgcttc tctcagtggctggatgaccatccatctgaaaaggacaggctcctcctcatcagctatgaa catctgacacagaaatgggagaggttaccagccccggtgaatgttcctggtaaaatctgc cctcgggaaaggctttggatttgggtgaggagtttgggcctgggctaa >gi568815591f:107079894_107300302|GENSCAN_predicted_peptide_8|366_aa XSMFLKQAFEGEYPKLLRLYNDLWKRLQQYSQHIQGNFNASGTTDLYVDLQHMEDDAQDI FIPKKPDYDPEKALKDSLQPYEAAYLSKSLSRLFDPINLVFPPGGRNPPSSDELDGIIKT IASELNVAAVDTNLTLAVSKNVAKTIQLYSVKSEQLLSTQGDASQVIGPLTEGQRRNVAV VNSLYKLHQSVTKAIHALMENAVQPLLTSVGDAIEAIIITMHQEDFSGSLSSSGKPDVPC SLYMKELQGFIARVMSDYFKHFECLDFVFDNTEAIAQRAVELFIRHASLIRPLGEGGKMR LAADFAQTSVRILSGLAAFPGPSARGGQDGVMASGMEENDAYICLRIPLMLDVDTGAAGF LPYFTS >gi568815591f:107079894_107300302|GENSCAN_predicted_CDS_8|1101_bp ncttcgatgtttttgaagcaggcatttgaaggagaataccctaaattattacgtctttat aatgacttatggaagcgtcttcaacaatacagtcagcatatccaagggaattttaatgca agtggaactacagacctctatgttgacctacaacacatggaagatgatgcacaagatata ttcataccaaaaaagccagattatgatccagaaaaggctttgaaagactcactacaaccc tatgaggctgcttatctatcaaaatccttatctcgactcttcgatcctatcaacttggtt tttcccccgggtggtcgtaatcctccttcctctgatgaacttgatggtattattaaaact atagcaagtgaactaaatgttgctgctgttgatacaaacctcacattagctgtgtcaaaa aatgtggcaaagaccatccagttatacagtgtaaaatcagagcagcttctctccacacaa ggagatgcaagtcaggtgattgggcctcttactgaaggacagagaagaaatgtggcagta gtgaattcattgtataagttgcaccaatcagtaacaaaggctattcatgctcttatggaa aatgctgtgcaacccttactcacttctgtgggagatgctatagaggccataatcatcacc atgcatcaagaagacttttctgggtcattatccagctcaggaaaacctgatgttccttgt tctctgtacatgaaggagctacaaggtttcattgccagagttatgagtgactattttaaa cactttgaatgcttggattttgtctttgacaacactgaggctattgcccaaagagctgtt gaactttttatccgccatgccagtctcataagacctcttggtgaaggtgggaaaatgcga cttgctgctgattttgcacagacctcagtacgcatacttagtggtcttgctgccttccca ggcccttcagcaagaggaggtcaagatggggtgatggcaagtggcatggaagagaatgat gcctatatctgtctcaggataccactgatgttagatgtggacacaggagcagcaggattt ttgccttactttacttcatga