GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:23:34 Sequence gi568815597f:167265470_167515807 : 250338 bp : 41.00% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 7806 8014 209 1 2 81 44 133 0.406 4.72 1.02 PlyA + 8440 8445 6 1.05 2.00 Prom + 28274 28313 40 -5.15 2.01 Init + 34648 34723 76 0 1 95 89 74 0.937 9.50 2.02 Intr + 52402 52609 208 2 1 55 42 140 0.001 3.31 2.03 Intr + 53545 53587 43 1 1 119 96 14 0.006 2.62 2.04 Intr + 57427 57454 28 0 1 73 80 39 0.001 -1.63 2.05 Intr + 67001 67066 66 0 0 90 110 87 0.551 9.06 2.06 Term + 68372 68445 74 0 2 100 49 66 0.978 0.99 2.07 PlyA + 69012 69017 6 1.05 3.00 Prom + 70320 70359 40 -1.65 3.01 Init + 72048 72160 113 2 2 48 -16 178 0.073 3.13 3.02 Intr + 79444 79591 148 1 1 81 72 72 0.029 4.22 3.03 Intr + 91007 91091 85 1 1 78 63 50 0.002 -0.03 3.04 Intr + 99998 100098 101 0 2 86 91 37 0.038 2.71 3.05 Intr + 104692 104745 54 0 0 55 115 28 0.562 0.46 3.06 Intr + 106448 106567 120 1 0 49 98 164 0.999 13.27 3.07 Intr + 108639 108827 189 2 0 59 109 200 0.844 18.06 3.08 Intr + 110560 110686 127 0 1 76 84 162 0.834 13.93 3.09 Intr + 118388 118482 95 0 2 94 82 11 0.778 -0.14 3.10 Intr + 124119 124292 174 2 0 87 91 194 0.992 18.81 3.11 Intr + 130817 130958 142 1 1 82 111 98 0.963 10.51 3.12 Intr + 132525 132664 140 1 2 92 78 142 0.994 12.86 3.13 Intr + 133717 133896 180 0 0 37 99 114 0.986 6.54 3.14 Intr + 135982 136087 106 0 1 40 115 81 0.805 4.87 3.15 Intr + 146598 146835 238 1 1 99 82 244 0.837 20.65 3.16 Intr + 147557 147645 89 2 2 78 86 88 0.991 6.30 3.17 Term + 150031 150341 311 2 2 69 49 157 0.950 4.14 3.18 PlyA + 150409 150414 6 1.05 4.04 PlyA - 150964 150959 6 1.05 4.03 Term - 152213 152133 81 2 0 77 35 40 0.030 -5.69 4.02 Intr - 158976 158876 101 1 2 69 94 113 0.558 8.91 4.01 Init - 159634 159355 280 1 1 56 59 135 0.643 4.62 4.00 Prom - 163457 163418 40 -7.55 5.00 Prom + 164251 164290 40 -5.65 5.01 Init + 164437 164532 96 0 0 45 75 65 0.774 1.26 5.02 Intr + 165728 166001 274 1 1 84 71 91 0.599 3.19 5.03 Intr + 166768 166893 126 2 0 127 111 76 0.953 13.53 5.04 Term + 167450 167613 164 0 2 45 54 117 0.291 1.12 5.05 PlyA + 168091 168096 6 1.05 6.09 PlyA - 168161 168156 6 -1.75 6.08 Term - 169965 169366 600 0 0 119 35 568 0.810 48.14 6.07 Intr - 173181 173101 81 0 0 96 70 146 0.990 12.52 6.06 Intr - 173931 173875 57 0 0 84 111 62 0.985 6.26 6.05 Intr - 175298 175195 104 0 2 71 98 101 0.906 8.37 6.04 Intr - 180257 180171 87 0 0 34 82 72 0.118 0.22 6.03 Intr - 190826 190708 119 1 2 85 71 78 0.067 4.99 6.02 Intr - 191592 191456 137 0 2 -33 52 150 0.444 -2.25 6.01 Init - 192220 192020 201 1 0 71 59 194 0.969 13.72 6.00 Prom - 192574 192535 40 -7.85 7.11 PlyA - 193705 193700 6 1.05 7.10 Term - 195623 195501 123 2 0 77 48 121 0.975 4.40 7.09 Intr - 197156 196690 467 0 2 46 75 237 0.414 10.12 7.08 Intr - 197687 197535 153 0 0 24 91 129 0.180 5.92 7.07 Intr - 207931 207760 172 1 1 48 71 121 0.009 4.99 7.06 Intr - 212079 211843 237 0 0 36 98 95 0.031 2.19 7.05 Intr - 220197 220079 119 1 2 44 89 92 0.088 4.16 7.04 Intr - 231142 230984 159 2 0 51 26 195 0.289 8.64 7.03 Intr - 232654 232492 163 1 1 12 38 104 0.003 -3.57 7.02 Intr - 237032 236976 57 2 0 88 77 34 0.007 0.46 7.01 Init - 238626 238585 42 0 0 97 94 30 0.160 5.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 76544 76275 270 2 0 71 41 206 0.916 9.23 S.002 Term - 91169 91079 91 1 1 73 41 137 0.898 3.71 S.003 Init - 93461 93379 83 2 2 68 61 68 0.858 2.59 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:167265470_167515807|GENSCAN_predicted_peptide_1|69_aa XTQNKPSVDLPFWGLEDGGPLVTAPVGTASVATVCGGSNPTPLYTALVEVLHEASAPAAE SCLEFPYIL >gi568815597f:167265470_167515807|GENSCAN_predicted_CDS_1|210_bp ngtacacagaacaagccgtcagtagatctaccattctggggtctggaggacggtggccct cttgtcacagctccagtaggcactgcctctgtggcgactgtgtgtgggggctccaacccc actcccctctacactgccctcgtagaggttttgcatgaggcctctgctcctgcagcagag tcctgcctggagtttccatacatcctctga >gi568815597f:167265470_167515807|GENSCAN_predicted_peptide_2|164_aa MDTAGGYYPKRVIAETETKIPHFLTYDKSKGGFITVSYFSQESGSASADYTQTTQIKSTI IIEITESPRVFIHFNGLIAANHLQLLQALQFLALRFSSIICDSFLICPQEHLKRFSTLDS RMNNPSETSKPSMESGDGNTALVPMPVPVLQPPSLGVGNVCREK >gi568815597f:167265470_167515807|GENSCAN_predicted_CDS_2|495_bp atggatacagctggaggctattaccctaagcgagttattgcagaaacagaaaccaaaata ccacattttctcacttatgataaaagcaaaggtggctttatcacagtgagctacttctcg caggagtcgggatctgcatctgcagactatacacagacaacacagattaaaagcacgatc atcattgaaatcacagagtctccaagagtttttatccattttaatgggttaatagctgct aatcatttgcagctccttcaagcactccagttcctggcattaaggttttcttccatcatc tgtgacagcttcttgatctgtccccaggaacaccttaagcggttttccacactggactca agaatgaacaatccgtcagaaaccagtaaaccatctatggagagtggagatggcaacaca gcactggttcctatgccagtgccggtgcttcagccacctagcttaggtgtgggaaatgtt tgcagagagaaatag >gi568815597f:167265470_167515807|GENSCAN_predicted_peptide_3|803_aa MFGFKNVRITEEAASANEEAADLKFSDAMEKISEGRLWDIDEVGSRHPQQTNTETENETP HFLTHKWELNSGNTWTQGGEHHIPGVRALEYDRSSSSVTSGKYPSFTASNVLPVEGTQTN GLDFQKQPVPVGGAISTAQAQAFLGHLHQVQLAGTSLQAAAQSLNVQSKSNEESGDSQQP SQPSQQPSVQAAIPQTQLMLAGGQITGLTLTPAQQQLLLQQAQAQAQLLAAAVQQHSASQ QHSAAGATISASAATPMTQIPLSQPIQIAQDLQQLQQLQQQNLNLQQFVLVHPTTNLQPA QFIISQTPQGQQGLLQAQNLLTQLPQQSQANLLQSQPSITLTSQPATPTRTIAATPIQTL PQSQSTPKRIDTPSLEEPSDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQT TISRFEALNLSFKNMCKLKPLLEKWLNDAENLSSDSSLSSPSALNSPGIEGLSRRRKKRT SIETNIRVALEKSFLENQKPTSEEITMIADQLNMEKEVIRVWFCNRRQKEKRINPPSSGG TSSSPIKAIFPSPTSLVATTPSLVTSSAATTLTVSPVLPLTSAAVTNLSVTASTSEASSA SETSTTQTTSTPLSSPLGTSQVMVTASGLQTAAAAALQGAAQLPANASLAAMAAAAGLNP SLMAPSQFAAGGALLSLNPGTLSGALSPALMSNSTLATIQALASGGSLPITSLDATGNLV FANAGGAPNIVTAPLFLNPQNLSLLTSNPVSLVSAAAASAGNSAPVASLHATSTSAESIQ NSLFTVASASGAASTTTTASKAQ >gi568815597f:167265470_167515807|GENSCAN_predicted_CDS_3|2412_bp atgtttggctttaaaaatgtcaggataacggaagaagcagcttctgccaacgaagaggca gcagacctcaagttctcagatgccatggagaaaatcagtgaaggccgtctgtgggacatt gatgaagttggaagccgtcatcctcagcaaactaacacagaaacagaaaacgaaacacca cattttctcactcataagtgggagttgaacagtgggaacacatggacacagggaggggaa catcacataccaggggtgagggccttggagtatgatagaagtagtagtagtgttaccagt ggcaaatatccaagtttcacagcatcaaatgtgttaccggtagaaggcacacaaaccaat ggtctggactttcagaagcagcctgtgcctgtaggaggagcaatctcaacagcccaggcg caggctttccttggacatctccatcaggtccaactcgctggaacaagtttacaggctgct gctcagtctttaaatgtacagtctaaatctaatgaagaatcgggggattcgcagcagcca agccagccttcccagcagccttcagtgcaggcagccattccccagacccagcttatgcta gctggaggacagataactgggcttactttgacgcctgcccagcaacagttactactccag caggcacaggcacaggcacagctgctggctgctgcagtgcagcagcactccgccagccag cagcacagtgctgctggagccaccatctccgcctctgctgccacgcccatgacgcagatc cccctgtctcagcccatacagatcgcacaggatcttcaacaactgcaacagcttcaacag cagaatctcaacctgcaacagtttgtgttggtgcatccaaccaccaatttgcagccagcg cagtttatcatctcacagacgccccagggccagcagggtctcctgcaagcgcaaaatctt ctaacgcaactacctcagcaaagccaagccaacctcctacagtcgcagccaagcatcacc ctcacctcccagccagcaaccccaacacgcacaatagcagcaaccccaattcagacactt ccacagagccagtcaacaccaaagcgaattgatactcccagcttggaggagcccagtgac cttgaggagcttgagcagtttgccaagaccttcaaacaaagacgaatcaaacttggattc actcagggtgatgttgggctcgctatggggaaactatatggaaatgacttcagccaaact accatctctcgatttgaagccttgaacctcagctttaagaacatgtgcaagttgaagcca cttttagagaagtggctaaatgatgcagagaacctctcatctgattcgtccctctccagc ccaagtgccctgaattctccaggaattgagggcttgagccgtaggaggaagaaacgcacc agcatagagaccaacatccgtgtggccttagagaagagtttcttggagaatcaaaagcct acctcggaagagatcactatgattgctgatcagctcaatatggaaaaagaggtgattcgt gtttggttctgtaaccgccgccagaaagaaaaaagaatcaacccaccaagcagtggtggg accagcagctcacctattaaagcaattttccccagcccaacttcactggtggcgaccaca ccaagccttgtgactagcagtgcagcaactaccctcacagtcagccctgtcctccctctg accagtgctgctgtgacgaatctttcagttacagcctccacctccgaggcatccagtgcc agtgagaccagcacaacacagaccacctccactcctttgtcctcccctcttgggaccagc caggtgatggtgacagcatcaggtttgcaaacagcagcagctgctgcccttcaaggagct gcacagttgccagcaaatgccagtcttgctgccatggcagctgctgcaggactaaaccca agcctgatggcaccctcacagtttgcggctggaggtgccttactcagtctgaatccaggg accctgagcggtgctctcagcccagctctaatgagcaacagtacactggcaactattcaa gctcttgcttctggtggctctcttccaataacatcacttgatgcaactgggaacctggta tttgccaatgcgggaggagcccccaacatcgtgactgcccctctgttcctgaaccctcag aacctctctctgctcaccagcaaccctgttagcttggtctctgccgccgcagcatctgca gggaactctgcacctgtagccagccttcacgccacctccacctctgctgagtccatccag aactctctcttcacagtggcctctgccagcggggctgcgtccaccaccaccaccgcctcc aaggcacagtga >gi568815597f:167265470_167515807|GENSCAN_predicted_peptide_4|153_aa MQLLESPLDIRLSNNNNNNNKLKYLDIFGHVKGKKVKCVCSSQLKGAFGVFPSTQSPEKK QNPEASEEKGLASTPPRKRQVQKLTCGQLSETQDEGMEVQTYAGGERFLRSETSCGLSNA HVPMHAELEREKKCSLGTTRDLSKHPQPPNGNL >gi568815597f:167265470_167515807|GENSCAN_predicted_CDS_4|462_bp atgcagcttctagagagcccattggatataagattgtcaaataataataataataataat aaacttaaatatttggatatttttggtcatgtcaaaggaaaaaaagtgaaatgtgtatgc agcagtcagcttaaaggagcctttggtgtcttcccttctacccaaagtcctgaaaagaaa cagaaccctgaggcctcagaggagaaggggttggcctctaccccaccaaggaaaaggcaa gtacagaagttgacgtgtggccagctgtcagaaacacaggatgaggggatggaagtacag acttatgcgggaggagagcgtttcctgcggagtgagacctcctgcggactgagcaatgca cacgtgcccatgcacgcagagctcgaaagggagaaaaagtgcagtcttgggaccaccaga gatttaagcaagcacccacagccaccaaatgggaacctttaa >gi568815597f:167265470_167515807|GENSCAN_predicted_peptide_5|219_aa MPSDDVRGSCWNPKFSGALTVMVLDFLFENVKGGTTWPSTVTLKGGEGDNRSQIYSRSEA TQTQSDTDIRACASICAFSGDFTKQTQQLSCERQCARLPGRTRNRQETGLPAPPANQRAQ GQGRPVILVFSWSRTGPQGWPQTCWNACPPGPGARKEKDQQQPHTAVGKSWCHQQAKCGG LDLAPCQVVPLPLKGRRQLLPGKGHRAPCPSGAGCLGVK >gi568815597f:167265470_167515807|GENSCAN_predicted_CDS_5|660_bp atgcctagtgatgatgtgcgtggctcctgttggaatcccaaattttcaggggctttaaca gtcatggtcttggatttcctttttgaaaatgtcaagggaggcacaacatggcccagtaca gttaccttgaaaggaggtgaaggtgacaacagaagccaaatttacagcaggagtgaagcc actcagacacagagtgatacagacattagggcatgtgctagcatctgcgctttctctggg gactttacaaaacagactcaacaactcagctgtgagaggcagtgcgcccgcctcccaggg agaacgaggaaccgccaggagacaggtctacctgcaccaccggcaaaccagagggcccaa ggccagggccgcccggtgattctagtgttttcttggtcacgaacaggcccacaaggctgg ccccaaacctgctggaatgcctgtccccctggtccaggggccaggaaggagaaggatcaa cagcagccacacacagctgttggcaagagctggtgccaccagcaggcaaaatgtgggggc cttgatcttgccccttgtcaggtggtgccccttccactgaagggacgccggcagctccta cctggtaaaggccatcgtgccccttgcccctccggcgctggttgtttgggagtgaaatga >gi568815597f:167265470_167515807|GENSCAN_predicted_peptide_6|461_aa MKAKKGLETINGEEGKGQYLDGELWAIFRKTQERLLMNRWTDKRVLPPELTDLLAHLTAG LPFQALQASAPTKPVLRKRLEVEKAGLERDKEIPGLSSLAYSKRSRSSGSRVSWVGASTA PGTGSAGMYSRSSTQECPTLEGKIKFSALHALEVHSLAQRQTDKQRNSSKSHPEKPDSVR IEAQSFGLLDPKLCYLLDGILFIYGVILTALFLRVKFSRSADAPAYQQGQNQLYNELNLG RREEYDVLDKRRGRDPEMGGKPQRRKNPQEGLYNVSRGDLTFDLGKLEEGLEEGSRGRAG GRAGGGLQRKGWRKGWRRAPEEGLEEGLEEGSGGRAGGRAGGGLRRKGWRKGWRRAPEEG LEEGLEEGSGGRAGGRAGGGLRRKGWRKGWRRAPEEGLEEGLEEGSGGRAGGRAGGGLRR KGWRRAPEEGLEEGLEEGSRGRRLLLTLWSLSVQTFPSWIE >gi568815597f:167265470_167515807|GENSCAN_predicted_CDS_6|1386_bp atgaaagctaagaaaggcttggagaccatcaatggagaggaagggaaaggacagtatctg gatggagagctctgggcaatatttaggaagactcaagagcgcctgctcatgaacagatgg acagacaagcgtgtgctccctcctgaactcacagacttactggctcacctcacagcaggg ctccccttccaggctttgcaggcatcagcacccaccaagccggtcctcagaaaacgactg gaggtagagaaagcagggctggaaagagacaaagaaattccaggactgtcctcactagca tattctaaacgaagcagatcttcaggcagccgtgttagctgggtgggagccagcacagcc ccaggcacaggcagtgccggcatgtattctcggagctccacacaggaatgtcccactcta gaagggaagataaaatttagcgctctccatgcactggaagttcatagtctggctcagaga cagacagacaaacaacgcaattcaagtaaatctcacccagagaaaccagattctgtaagg atagaggcacagagctttggcctgctggatcccaaactctgctacctgctggatggaatc ctcttcatctatggtgtcattctcactgccttgttcctgagagtgaagttcagcaggagc gcagacgcccccgcgtaccagcagggccagaaccagctctataacgagctcaatctagga cgaagagaggagtacgatgttttggacaagagacgtggccgggaccctgagatgggggga aagccgcagagaaggaagaaccctcaggaaggcctgtacaatgtgagtagaggagacctc acatttgaccttggaaagttggaggaagggctggaggagggctccagaggaagggctgga ggaagggctggaggagggctccagaggaagggctggaggaagggctggaggagggctccg gaggaagggctggaggaagggctggaggagggctccggaggaagggctggaggaagggct ggaggagggctccggaggaagggctggaggaagggctggaggagggctccggaggaaggg ctggaggaagggctggaggagggctccggaggaagggctggaggtagggctggaggaggg ctccggaggaagggctggaggaagggctggaggagggctccggaggaagggctggaggaa gggctggaggagggctccggaggaagggctggaggaagggctggaggagggctccggagg aagggctggaggagggctccggaggaagggctggaggaagggctggaggagggctccaga ggaaggcggttgctcctcactctgtggtctttgtctgtccagaccttcccttcttggatc gagtaa >gi568815597f:167265470_167515807|GENSCAN_predicted_peptide_7|563_aa MNRIAYTQHPGSPEGQLLAELNVRHPFKDGTSEDWGRHNRTGLAEDATSQGPQDQAMAPS PIYSKICSRNQGEKSRPSVNYAARTVRARDRLGSFFLLSARSVESDSPRDGDLQGQRDAK PPCFRIRRQAAAFKQIPSQAVSHAGIPSPREPSTPASMLTKGGVRQLPNLRQLSVFQERQ SKTLSQGKNKKHFTASVPESKPTGGISEEGSSGHWNYTNGTFKSGPSSISGSVWKKRKAA LFVFLLALTTGIWAQGRLQVGRKQPYFRNNMQVAAALRWALEERKAQREGDGKQGAGETT LAPWAVEGSSGQLAISAESNPCSVATQKSHKQNPGRVPGMARRLPILQRELDQAKEAVPK FLGPGMPGIGPQKEQFSSFSGLDQGRPPVQLSPLGSRVCRYTCLHTQAPTQAHAETPRSP TWVVTSVSRRAIGGTALPEFLPIAAACSLNHTGLASSLDSLCGPQRPLLWSQGPSDILGG ASGTLAGNPRATLNPNLVHVLEYQRRILNPEWDEEPPKLEESQHVPSVAAAGTAGFVGPL IHAKYVIGECSQRVGWAHQGLPN >gi568815597f:167265470_167515807|GENSCAN_predicted_CDS_7|1692_bp atgaaccgcattgcctatacacagcaccccgggagcccagagggtcagcttctagcagaa ctcaatgtaagacacccttttaaagatggtacctcagaggactgggggagacacaacaga acaggactggcagaagatgccacatcccagggtccccaagatcaagctatggcaccatca cccatctacagcaagatctgtagcaggaaccagggagaaaagtctagaccctcagtaaac tatgcggccaggacagtccgagccagagatcgtctaggttccttcttcctactctctgcc cgctctgtagagtcagactctccacgggatggagatcttcaaggccagagagatgcaaaa cctccttgcttcaggatccggagacaagctgctgcttttaagcagatcccctctcaggca gtaagccatgccgggataccctccccacgggagcctagcacgccagccagcatgctgaca aaaggaggagtaaggcagctccctaatctcaggcagctctcagtgtttcaggaaaggcag agtaagaccctgtctcaaggaaaaaacaaaaaacacttcacagccagtgtgccagagtcc aagcccacaggagggatcagtgaggaaggaagctctgggcactggaactacacaaatggc actttcaagtcaggaccatcttccatctctgggtctgtttggaaaaaaagaaaagcagct ttgtttgtctttcttctggcactgaccactggaatttgggctcaaggcaggctccaggta ggcaggaagcaaccttattttcgaaacaatatgcaggttgcagctgccctcagatgggcc ttggaggagagaaaggcccagagagaaggagatggcaagcagggagcaggagaaaccaca cttgccccctgggcggtggagggcagctcgggacagctggctatctcagccgagagcaac ccatgctcagtggcaacgcagaaaagccacaaacagaacccaggaagggttcctgggatg gcccggcgactgcccatcctccagagagagctggatcaagcaaaagaagcagtccccaaa ttcttagggccagggatgccagggatcggcccccaaaaggaacagttctcctccttttct ggcttagaccagggcaggccccctgtgcagctctctcccttgggttctcgtgtatgtaga tacacttgtttacacacacaggctcctacacaggcacatgcagagacgcctaggtcccca acctgggttgtgacttctgtgtcgaggagagccataggaggcacagctttgccagagttc ctgcccattgctgcggcctgttccctaaaccacactggtctggcttcatcgctggactca ctgtgtggcccacagaggcctcttctatggtcacagggccccagtgacattttgggaggg gcttcagggacgctggctggtaacccaagagcaaccctaaatcccaacttggtccatgtc ctagaatatcaaaggagaatcttaaaccctgagtgggatgaggagcctcctaagcttgaa gagagccagcatgtccccagcgtggcagctgcaggcacggcaggctttgtggggcctttg atccacgcaaagtacgtcataggcgaatgctcacagcgcgtgggctgggcacaccaaggg ttgccaaactga