GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:06:36 Sequence gi568815578f:43358034_43561060 : 203027 bp : 45.71% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5514 5531 18 2 0 68 109 5 0.193 0.79 1.02 Intr + 6109 6251 143 0 2 34 44 106 0.196 -0.05 1.03 Intr + 8850 8928 79 0 1 74 89 47 0.250 2.95 1.04 Term + 10789 11622 834 0 0 -11 48 381 0.216 17.19 1.05 PlyA + 11714 11719 6 1.05 2.00 Prom + 11862 11901 40 -8.96 2.01 Sngl + 11922 12770 849 2 0 60 49 383 0.407 27.39 2.02 PlyA + 13695 13700 6 1.05 3.05 PlyA - 13772 13767 6 1.05 3.04 Term - 20577 20456 122 1 2 103 41 20 0.292 -2.56 3.03 Intr - 24380 24135 246 0 0 54 91 158 0.203 10.13 3.02 Intr - 32516 32363 154 2 1 96 45 6 0.286 -3.25 3.01 Init - 33236 33060 177 2 0 73 1 169 0.285 5.96 3.00 Prom - 38448 38409 40 -2.46 4.03 PlyA - 39646 39641 6 1.05 4.02 Term - 46400 45568 833 0 2 86 43 295 0.950 17.98 4.01 Init - 47814 47766 49 0 1 97 58 12 0.697 -1.79 4.00 Prom - 49375 49336 40 -0.86 5.00 Prom + 52225 52264 40 -4.36 5.01 Init + 71877 71882 6 2 0 92 93 0 0.148 1.75 5.02 Term + 82678 82800 123 0 0 86 54 154 0.947 10.18 5.03 PlyA + 85536 85541 6 1.05 6.00 Prom + 92985 93024 40 -6.46 6.01 Init + 100001 100107 107 1 2 91 92 141 0.959 14.49 6.02 Intr + 100328 100476 149 2 2 90 73 368 0.929 35.38 6.03 Intr + 101738 101862 125 0 2 39 95 56 0.995 1.70 6.04 Term + 102000 102212 213 2 0 98 32 178 0.324 10.43 6.05 PlyA + 102252 102257 6 1.05 7.05 PlyA - 102374 102369 6 -1.75 7.04 Term - 102588 102461 128 1 2 32 45 164 0.316 4.94 7.03 Intr - 102836 102636 201 0 0 92 27 204 0.169 13.96 7.02 Intr - 128517 128492 26 2 2 78 76 28 0.063 -1.73 7.01 Init - 136009 135948 62 1 2 45 103 73 0.545 4.32 7.00 Prom - 136579 136540 40 -2.06 8.00 Prom + 137296 137335 40 -2.86 8.01 Init + 155453 155606 154 1 1 99 109 67 0.277 10.30 8.02 Intr + 155805 156028 224 1 2 66 60 153 0.786 8.15 8.03 Intr + 156602 156743 142 1 1 29 57 163 0.806 7.23 8.04 Intr + 156976 157126 151 2 1 57 91 107 0.997 7.12 8.05 Intr + 157259 157382 124 2 1 67 58 152 0.996 10.69 8.06 Intr + 158060 158144 85 1 1 12 68 64 0.089 -3.81 8.07 Intr + 170624 170712 89 0 2 91 86 95 0.983 9.29 8.08 Intr + 171231 171335 105 2 0 48 83 166 0.827 12.51 8.09 Intr + 172251 172386 136 2 1 47 116 169 0.998 15.74 8.10 Intr + 172765 172856 92 2 2 -36 94 123 0.974 0.21 8.11 Intr + 174740 174891 152 1 2 99 27 293 0.797 23.26 8.12 Intr + 176232 176361 130 0 1 52 34 226 0.402 14.30 8.13 Intr + 176795 176909 115 1 1 76 74 105 0.998 7.92 8.14 Intr + 177804 177903 100 1 1 116 117 -44 0.598 0.47 8.15 Intr + 178064 178261 198 2 0 102 72 134 0.833 11.67 8.16 Intr + 178376 178425 50 2 2 75 89 52 0.990 2.32 8.17 Intr + 182118 182275 158 1 2 91 78 141 0.976 13.13 8.18 Intr + 182720 182782 63 1 0 61 94 67 0.842 3.51 8.19 Intr + 186504 186578 75 2 0 55 72 55 0.468 0.31 8.20 Term + 190078 190212 135 0 0 77 42 201 0.729 12.42 8.21 PlyA + 192897 192902 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 91918 92063 146 2 2 128 41 86 0.811 5.97 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:43358034_43561060|GENSCAN_predicted_peptide_1|357_aa MIKRGQGQSEATVPSAFGPTLSVKQELGQTGGNALELSGTYLTTLRQAAVQPSCNVDAVG GHNPKQLNAKQKTKYHMFSLTTSRLIKKKREKNQIDAIKNDKGDITTNPTEIQTTIREYY KHLSANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDG FTAEFYQRYKEELVPFLLKLFQPTEKKRILPNSFYEASIILIPKPGTDTTKKENFRPISL RNIDVKILNKILANQIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRTKDKNHM IISIDAEKAFDKIQQPFMLKTFNKLDIDGTYLKIIRAIYDKPTANIILNGQNWKHSL >gi568815578f:43358034_43561060|GENSCAN_predicted_CDS_1|1074_bp atgattaaacgagggcagggacaaagtgaggccacggttccctcggccttcggacccact ctcagcgtcaaacaagagctgggccagacagggggaaatgcactggaactctctggcacc tacctgaccaccctcagacaggcagcagtgcagcccagctgcaacgtggatgcagttgga ggccataatcctaagcaacttaacgcaaaacagaaaaccaaataccacatgttctcactt accactagcagactaataaagaagaaaagagagaagaatcaaatagatgcaataaaaaat gataaaggggatatcaccaccaatcccacagaaatacaaactaccatcagagaatactac aaacacctctccgcaaataaactagaaaatctagaagaaatggataaattcctcgacaca tacactctcccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggc tctgaaattgtggcaataatcaatagcttaccaaccaaaaagagtccaggaccggatgga ttcacagccgaattctaccagaggtacaaggaggaactggtaccattccttctgaaacta ttccaaccaacagaaaagaagagaatcctccctaactcattctatgaggccagcatcatc ctgataccaaagccgggcacagacacaaccaaaaaagagaattttagaccaatatccttg aggaacattgatgtaaaaatcctcaataaaatactggcaaaccaaatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaat atacgcaaatcaataaatgtaatccagcatataaacagaaccaaagacaaaaaccacatg attatctcaatagatgcagaaaaggcctttgacaaaattcaacaacccttcatgctaaaa actttcaataagttagatattgatgggacatatctcaaaataataagagctatctatgac aaacccacagccaatatcatactgaatgggcaaaactggaagcattccctttga >gi568815578f:43358034_43561060|GENSCAN_predicted_peptide_2|282_aa MSELPFTIASKRIKYLGIQLTRDVKDLVKENYKPLLNEIKEDTNKWKNIPCSWVGRINIV KMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGIMLP DFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPDKNKKWGKDSLFH KWCWENWLAIYRKLKLGPFLTPYTKINPRWIKDLNVRPKTIKTLEENLGITIQDIGMGKD FMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTITVNRQPTK >gi568815578f:43358034_43561060|GENSCAN_predicted_CDS_2|849_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggcatccaactt acaagggacgtgaaggacctcgtcaaggagaactacaaaccactgctcaatgaaataaaa gaggacacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatattgtg aaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagttacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcatgctacct gacttcaaactgtactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagatcaatggaacagaacagagccctcagaaataatgccgcatatctacaac tatctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttcat aaatggtgctgggaaaactggctagccatatatagaaagctgaaactgggtcccttcctt acaccttatacaaaaattaatccaagatggattaaagacttaaatgttagacctaaaacc ataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctc attaaactaaagagcttctgcacagcaaaagaaactaccatcacagtgaacaggcaacct acaaaatag >gi568815578f:43358034_43561060|GENSCAN_predicted_peptide_3|232_aa MDAGNDCQWQLTAQEETKAESEQTSGFNSNGPNLQLRKSEEHVKLYDRDADNKIQTMGHR PEAGWHQLRSPKASGGGATAPLPAATPRPQRPPPPPPQRPPGFLLFPTTQGHRGYDGLAW AQRPDSCGIVDFVFVVLVVRESQPEVQSRLSFSLLPCVQGGKAEQKELASGVILSPFQVL ALPLLYQNLGKTDATALGSGSQIKHASESPERLVENTGGWAPPPEFDSVDLG >gi568815578f:43358034_43561060|GENSCAN_predicted_CDS_3|699_bp atggatgcaggtaatgattgccaatggcagctgacagcacaagaagagaccaaagctgaa tctgaacaaacctccggattcaactccaacggaccgaatctacaactcagaaaatctgag gaacacgtgaaattatatgacagggatgcagacaacaaaatccagactatgggacatagg ccggaggctggctggcatcagctgcgttccccgaaggcgtcaggaggtggcgccactgct cctctgccggctgcaactccgcgcccccagcgccccccaccgcccccgcctcagaggccg ccagggttcctgctcttccctacaacgcagggtcacaggggatatgatggcttagcttgg gctcagaggcctgacagctgtggcattgtggactttgtcttcgtggtgctcgtggtacga gagagtcagcctgaagtccagtccaggctttccttttccctgctgccttgtgttcaagga ggcaaggcagagcagaaagagctggcctctggtgttatactgagcccatttcaagtccta gctctgccattgctatatcagaacctgggcaaaacagatgccactgctttgggcagtgga tctcaaattaagcatgcatcagagtcacctgaaaggctcgtagaaaacacaggtggctgg gccccacctccagagtttgattcagtggatcttgggtag >gi568815578f:43358034_43561060|GENSCAN_predicted_peptide_4|293_aa MAFHHVGQTGLELLTSGVPELKEIETQKTLQKINESRSWFFEKSNKIHRSLARLIKKKRE VNQIDAIKNDKGDITTDPTEIQTTIRDYYKHLYANKLENLEEMDKFLGTYTFPRLNQKEV ESLNRPITGSETEAIINSLPTKKSPGPDGFTAEFYQRYKEEMVPFLLKLFQLIEKEGILP NSFYEASIILIPKPGRDTTKKENFRLISLMNIDAKILSKILENRIQQHIKKLIHHDQVGF IPGMQGWFNIYKSINVIQHINRTKDKNHTIISIDAEKAFDKIQQRFMLKLSIN >gi568815578f:43358034_43561060|GENSCAN_predicted_CDS_4|882_bp atggcgtttcaccatgttggtcagactggtctcgaactcctgacctcaggtgtacctgaa ctgaaggagatagagacacaaaaaacccttcaaaaaatcaatgaatccaggagctggttt tttgaaaagagcaacaaaattcatagatcgctagcaagactaataaagaagaaaagagag gtgaatcaaatagatgcaataaaaaatgataaaggggatatcaccaccgatcccacagaa atacaaactaccatcagagactactataaacacctctatgcaaataaactagaaaatcta gaagaaatggataaattcctgggcacatacaccttcccaagactaaaccagaaagaagtt gaatccctgaatagaccaataacaggctctgaaactgaggcaataattaatagcctacca accaaaaaaagtccaggaccagacggattcacagccgaattctaccagagatacaaggag gagatggtaccattccttctgaaactattccaattaatagaaaaagagggaatcctccct aactcattttatgaggccagcatcatcctgataccaaaacctggcagagacacaacaaaa aaagagaattttagactaatatccctgatgaacatcgatgcaaaaatcctcagcaaaata ctggaaaaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttc atccctgggatgcaaggctggttcaacatatacaaatcaataaatgtaatccagcatata aacagaaccaaagacaaaaaccacacgattatctcaatagatgcagaaaaggcctttgac aaaattcaacagcgcttcatgctaaaactctcaataaattag >gi568815578f:43358034_43561060|GENSCAN_predicted_peptide_5|42_aa MTEKFANSWSTPCQALTMHHHTYFSTRLWKVSYYDVHATDEK >gi568815578f:43358034_43561060|GENSCAN_predicted_CDS_5|129_bp atgactgaaaagtttgccaactcctggtccacaccgtgccaggcactgacgatgcaccac cacacttacttctcaactcgactctggaaggtgtcctactacgatgtccatgctacagat gagaaatga >gi568815578f:43358034_43561060|GENSCAN_predicted_peptide_6|197_aa MPRVYIGRLSYNVREKDIQRFFSGYGRLLEVDLKNGYGFVEFEDSRDADDAVYELNGKEL CGERVIVEHARGPRRDRDGYSYGSRSGGGGYSSRRTSGRDKYGPPVRTEYRLIVENLSSR CSWQDLKDFMRQAGEVTYADAHKERTNEGVIEFRSYSDMKRALDKLDGTEINGRNIRLIE DKPRTSHRRSYSGSRSR >gi568815578f:43358034_43561060|GENSCAN_predicted_CDS_6|594_bp atgccgcgcgtctacataggacgcctgagctacaacgtccgggagaaggacatccagcgc tttttcagtggctatggccgcctcctcgaagtagacctcaaaaatgggtacggcttcgtg gagttcgaggactcccgcgacgccgacgacgccgtttacgagctgaacggcaaggagctc tgcggcgagcgcgtgatcgtagagcacgcccggggcccgcgtcgcgatcgcgacggctac agctacggaagccgcagtggtggaggtggatacagcagtcggagaacatctggcagagac aaatacggaccacctgttcgtacagaatacaggcttattgtagaaaatctttctagtcgg tgcagttggcaagatttaaaggattttatgcgacaagcaggtgaagtaacctatgcggat gcccacaaggaacgaacaaatgagggtgtaattgagtttcgctcctactctgacatgaag cgtgctttggacaaactggatggcacagaaataaatggcagaaatattaggcttattgaa gataagccacgcacaagccataggcgatcttactctggaagcagatccaggtaa >gi568815578f:43358034_43561060|GENSCAN_predicted_peptide_7|138_aa MRMFLKCVPQKLVAAQEAQLMCCGRDLVGGDRDRDLLRDFSYSSLDLLRECEWEPRSDLG LDLLFDLDFLPFDRERDRPLLRDLELDKERPIFSILMTQHNVIYRERLFEILRDLLRLLL RLLLRDRLLDRDLLGEKQ >gi568815578f:43358034_43561060|GENSCAN_predicted_CDS_7|417_bp atgaggatgtttctgaagtgtgtgccccagaagctcgtggcagctcaggaggctcagctc atgtgttgtgggagggacctggtgggaggggatcgggaccgagacctgcttcgagatttc tcatactcatccttagatctgcttcgagaatgtgaatgggagccccgatcagacttgggc ttagatttgctctttgatctagatttcctgccttttgatcgagaacgtgatcgacctttg ctccgcgacctggaactggacaaggaaagaccgattttctcaatacttatgactcaacac aacgttatttaccgggagcgactttttgagatacttcgagatctactgcggctgctcctg cgactcctacttcgtgaccgtcttctagatcgagacctattaggagaaaaacaatga >gi568815578f:43358034_43561060|GENSCAN_predicted_peptide_8|825_aa MEGHAGMEGHAEMEMLRTLKGPSTGEVSMHLVAGDSPGSGPHLPATAFIIPASSATLGLP SSALDVSCFPREPIHVGAPEQVAGCEPVSATVLPQLSAGPASSSTSTVRLLEWTEAAAPP PGGGLRFRISEYKPLNMAGVEQPPSPELRQEGVTEYEDGGAPAGDGEAGPQQAEDHPQNP PEDPNQDPPEDDSTCQCQACGPHQAAGPDLGSSNDGCPQLFQERSVIVENSSGSTSASEL LKPMKKRKRREYQSPSEEESEPEAMEKQEEGKDPEGQPTASTPESEEWSSSQPATGEKKE CWSWESYLEEQKAITAPVSLFQDSQAVTHNKNGFKLGMKLEGIDPQHPSMYFILTVAEVC GYRLRLHFDGYSECHDFWVNANSPDIHPAGWFEKTGHKLQPPKGYKEEEFSWSQYLRSTR AQAAPKHLFVSQSHSPPPLGFQVGMKLEAVDRMNPSLVCVASVTDVVDSRFLVHFDNWDD TYDYWQLSPLQRPPHSFLVNMKLEAVDRRNPALIRVASVEDVEDHRIKIHFDGWSHGYDF WIDADHPDIHPAGWCSKTGHPLQPPLGPREPSSASPGGCPPLSYRSLPHTRTSKYSFHHR KCPTPGCDGSGHVTGKFTAHHCLSGCPLAERNQSRLKAELSDSEASARKKNLSGFSPRKK PRHHGRIGRPPKYRKIPQEDFQTLTPDVVHQSLFMSALSAHPDRSLSVCWEQHCKLLPGV AGISASTVAKWTIDEVFGFVQTLTGCEDQARLFKDESQTARPQTLAFALKNLEPGLWQLP WMIDGEAFLLLTQADIVKIMSVKLGPALKIYNAILMFKNADDTLK >gi568815578f:43358034_43561060|GENSCAN_predicted_CDS_8|2478_bp atggaggggcatgctgggatggaggggcatgctgaaatggagatgctgaggacactgaag gggccttccacaggggaggtcagcatgcacttggtggccggagacagccccggttctggt cctcacctgcccgcaactgccttcatcattccagccagttcggccaccctcggcctgccc agcagtgccctggatgtgtcttgctttccccgggagccaatccatgtgggtgccccggag caagtggccggctgcgaaccagtttctgccaccgtcctgccgcagcttagcgccgggccg gccagctccagcaccagcacagtgcggcttctggaatggacagaggccgcggccccgccc ccagggggcggcctgcggttccggataagcgagtataagccgctgaacatggcgggagtg gagcagcccccgagccccgagctgcggcaggaaggcgtgaccgaatacgaagatggcggg gccccggcgggagatggcgaggcgggcccccaacaggcggaggaccacccccagaatcct ccagaagatcccaatcaggaccccccagaggatgatagcacctgtcagtgccaggcgtgc gggcctcaccaagccgcgggtccagatcttggttcctctaatgatggctgccctcagctg ttccaggagcggtcagtcatagtggagaactcctcaggctctaccagcgcttctgagctc ctcaaacccatgaagaagaggaagcgcagggaataccagagcccatcagaggaggagtcg gagccagaggccatggagaagcaagaagaaggaaaggacccagagggacaacccactgct agcaccccagagagtgaggagtggagcagcagccagcctgcaacaggtgagaagaaggaa tgctggtcgtgggagtcctacctagaggagcagaaggccattactgctccagtcagcctc ttccaggactcccaggcagtcactcacaacaagaatggcttcaaactgggcatgaagttg gaaggcattgaccctcaacacccgtccatgtacttcatcctcaccgtggctgaggtatgt ggctatcgcctacgcctgcactttgatgggtattctgagtgccatgacttctgggtcaat gccaactcccctgacattcaccctgctggctggttcgagaagacgggccacaagctgcag cctcccaaaggttacaaggaggaggagttcagctggagccagtacctgcgcagcacaaga gctcaggctgcccccaagcacctgtttgtgagccagagccacagtcccccacccctgggc ttccaggtgggcatgaagctggaggctgttgaccgcatgaacccgtcccttgtctgcgtg gccagtgtgaccgatgtggtggacagccgcttcctggtgcactttgacaactgggatgat acttatgactactggcagctgtcccctctgcagcgaccccctcacagcttcctggtcaat atgaagctggaggctgtggaccgcaggaacccagccctgattcgcgtggccagcgtggag gatgtggaggaccatcggataaagatccactttgatggctggagtcatggctatgatttc tggatcgacgctgaccacccagacatccaccctgccggctggtgctccaagacaggacat cccctgcagcctcctctcggacccagagagcccagctctgcctcccctgggggctgtccc cctctcagctataggagcctgccccacactaggacctccaaatacagctttcaccaccgg aagtgccccactcctggttgcgacggctctggccatgtcacaggcaagttcacagctcac cattgcctctcaggctgcccactggctgagaggaaccagagccggctgaaagcggagctg tctgactcggaggcctcagcccgcaagaagaacctctcaggcttctccccaaggaagaag cctcgccatcacggccgaattggacgccctccgaagtatcgaaagattccgcaggaagat ttccagaccctcacgcccgatgtcgtgcaccagtccctcttcatgtcagccctgtcggcc caccctgaccgctcactctcagtgtgctgggagcagcactgcaagctcctgccaggagta gcgggcatctcagcctcgacagtcgccaagtggaccatcgatgaggtcttcggctttgtt cagaccctgacaggttgtgaggaccaagcacgcctcttcaaagacgagtctcagactgcc agaccacagactctggcctttgctttgaagaacctggaacctgggctttggcagttaccg tggatgattgacggcgaggccttccttttgctgacacaggcggacattgtgaagatcatg agcgtcaagctgggcccagccttgaagatctataacgccattctcatgttcaaaaacgct gatgacaccttaaagtga