GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:33:03 Sequence gi568815583f:80020345_80237625 : 217281 bp : 42.41% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4946 5003 58 1 1 125 101 26 0.926 5.14 1.02 Term + 6247 6449 203 2 2 101 42 139 0.996 7.17 1.03 PlyA + 7769 7774 6 1.05 2.05 PlyA - 8637 8632 6 1.05 2.04 Term - 11523 11403 121 2 1 95 43 83 0.689 1.47 2.03 Intr - 12656 12602 55 0 1 77 115 21 0.599 0.82 2.02 Intr - 13252 13101 152 0 2 80 33 106 0.624 3.19 2.01 Init - 14058 13943 116 0 2 84 47 102 0.689 5.43 2.00 Prom - 19287 19248 40 -0.85 3.03 PlyA - 19892 19887 6 1.05 3.02 Term - 39083 38516 568 1 1 65 32 243 0.539 9.30 3.01 Init - 55979 55909 71 2 2 60 85 39 0.235 1.38 3.00 Prom - 78416 78377 40 -5.05 4.00 Prom + 84582 84621 40 -4.15 4.01 Init + 100001 100154 154 1 1 92 119 -14 0.839 2.29 4.02 Intr + 101368 101476 109 2 1 28 110 86 0.533 3.22 4.03 Intr + 102356 102456 101 2 2 73 9 138 0.174 3.23 4.04 Intr + 110836 110949 114 2 0 61 95 41 0.181 1.60 4.05 Intr + 113434 113615 182 2 2 39 66 123 0.294 3.87 4.06 Term + 117136 117255 120 0 0 98 33 155 0.334 8.49 4.07 PlyA + 118018 118023 6 1.05 5.00 Prom + 121907 121946 40 -7.05 5.01 Init + 125674 125688 15 0 0 64 81 4 0.099 -2.18 5.02 Intr + 131193 131314 122 2 2 60 82 120 0.331 6.97 5.03 Intr + 132682 132791 110 1 2 65 60 107 0.585 4.61 5.04 Intr + 136477 136682 206 2 2 104 37 65 0.598 1.00 5.05 Intr + 137716 137826 111 0 0 95 91 156 0.885 16.26 5.06 Intr + 139412 139533 122 1 2 78 69 83 0.612 3.77 5.07 Intr + 140066 140217 152 2 2 94 44 38 0.148 -1.11 5.08 Intr + 141902 141992 91 0 1 100 56 97 0.298 5.83 5.09 Intr + 147708 147805 98 0 2 117 74 153 0.963 15.53 5.10 Intr + 148080 148271 192 1 0 86 93 66 0.799 5.44 5.11 Intr + 151818 151904 87 1 0 60 36 112 0.025 2.22 5.12 Intr + 152670 152800 131 1 2 89 115 89 0.037 11.19 5.13 Intr + 152882 153103 222 1 0 -21 81 157 0.024 1.50 5.14 Intr + 154672 154747 76 0 1 109 95 78 0.072 8.77 5.15 Intr + 159297 159481 185 1 2 76 82 65 0.604 3.29 5.16 Intr + 159780 159881 102 2 0 66 117 72 0.675 7.35 5.17 Intr + 160698 160815 118 2 1 38 94 90 0.770 3.72 5.18 Intr + 160974 160997 24 1 0 131 65 51 0.729 4.18 5.19 Term + 165786 165865 80 1 2 123 50 90 0.799 5.65 5.20 PlyA + 165969 165974 6 1.05 6.14 PlyA - 166250 166245 6 1.05 6.13 Term - 168455 168318 138 2 0 92 42 83 0.579 1.08 6.12 Intr - 169147 168979 169 0 1 54 103 79 0.304 5.03 6.11 Intr - 173566 173241 326 1 2 56 98 119 0.164 3.35 6.10 Intr - 173951 173816 136 1 1 11 107 87 0.419 2.55 6.09 Intr - 177592 177448 145 2 1 43 62 72 0.149 -1.48 6.08 Intr - 177843 177674 170 2 2 55 45 116 0.106 2.67 6.07 Intr - 181346 181151 196 0 1 58 39 143 0.116 3.95 6.06 Intr - 181670 181363 308 1 2 97 22 250 0.351 14.47 6.05 Intr - 181797 181743 55 1 1 106 73 101 0.368 7.52 6.04 Intr - 188434 188277 158 0 2 55 38 122 0.079 2.63 6.03 Intr - 189330 189166 165 2 0 89 43 175 0.176 11.25 6.02 Intr - 191004 190818 187 1 1 78 48 110 0.412 3.83 6.01 Init - 216269 216221 49 2 1 94 58 50 0.194 1.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 102356 102473 118 2 1 73 39 112 0.810 1.83 S.002 Term - 152112 151781 332 1 2 42 39 223 0.890 6.63 S.003 Intr - 154879 154635 245 0 2 116 80 168 0.823 15.02 S.004 Init - 155211 155087 125 0 2 58 53 102 0.889 3.18 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:80020345_80237625|GENSCAN_predicted_peptide_1|86_aa MGPSSCRKTSSGVPLNLRYGILGRVTTSPAGSWYISQQGWASISERTRPLCWNQSSGTIR ILAEGNKSASCSESASLKASELPGRI >gi568815583f:80020345_80237625|GENSCAN_predicted_CDS_1|261_bp atgggaccgtctagttgcaggaaaacaagctcaggggtcccactgaatctacgttatggg atattaggaagagtaactacctccccagctggaagctggtatatctcgcagcagggctgg gcttctatttcagaaagaacacggccactttgctggaaccagagcagtggcacaatccga atcctggcggagggcaacaagtcagcttcatgctcagaatctgcttctctgaaggcctct gaattaccaggaagaatctaa >gi568815583f:80020345_80237625|GENSCAN_predicted_peptide_2|147_aa MEWESEKRLGFRQHAFSGLVPGVLVSRISVATEENLGGGPAVLACVQTCKFSKAHRLQCD QLSQFAQNWFGFSIVPGNPSSAGLRSLGLKTRSQKGRMNGETDHFEKRQWVGCWDSIGKQ VSVELIMEEEGDNNQSIISSVISAIMA >gi568815583f:80020345_80237625|GENSCAN_predicted_CDS_2|444_bp atggagtgggaaagtgagaaacggctgggtttcaggcagcatgcattttctggcttggtc cctggggtcttggtgtcccgtatttcagttgcaacagaagagaatttaggaggcggacca gctgtcctggcgtgtgtgcagacctgcaagttcagcaaagcccacagacttcagtgtgac caactgtcccagtttgcccagaactggtttggttttagcattgtcccaggaaacccctct agtgcaggactgaggtccttggggctgaaaactaggagccaaaagggcagaatgaatggt gaaactgatcattttgaaaagcggcagtgggttgggtgctgggactctattggtaaacaa gtttctgtggagctcattatggaggaggagggagacaataaccagtcaattatatccagt gtgataagtgctataatggcctaa >gi568815583f:80020345_80237625|GENSCAN_predicted_peptide_3|212_aa MKKNQNDNQILAWVAVSPICCDFCVPCSTALSGALPAAVSLPPIATGRRLRGAAETLRAW LCRFRYLCHLAGNGVSLSWTEAPKRVALHLPLVQEGILSFCLPMQAGASVCKRKVRAQGA YWKAVLCPVLNWDLEIEARREACMHTCAIHTCTGTSTPSSSLFQKHGVSSHLRGARWAAA LELGVIGRMAPPEVFQPSFGLTSITSRKYVTL >gi568815583f:80020345_80237625|GENSCAN_predicted_CDS_3|639_bp atgaagaagaatcagaatgacaatcagattctggcttgggtggcagtatcacccatttgc tgtgacttttgtgtcccttgctccactgccctctctggggcgcttcccgccgcggtttct ctacctccaatagctacgggaaggcgattaagaggtgccgcggagactttaagggcttgg ttatgtaggttcaggtacctttgccacctcgcagggaatggcgtgtccctgtcttggact gaggcaccaaaaagagttgcactccacctgccacttgtgcaggaaggaattctgagcttc tgtcttcccatgcaggccggcgcctcagtctgcaagcgcaaagtccgagctcagggcgca tactggaaagcagttctctgccctgtcctgaactgggacttggaaatagaagcaagacga gaagcgtgcatgcacacgtgtgcaatacatacatgcacaggcacaagcacccccagttcc agcctgttccaaaagcacggggtcagttcccacctgcgtggtgcacgctgggctgctgcg ctggaactaggggtgataggcagaatggctccccctgaagtttttcagccttcgttcggg ctcacgagcatcacgagcaggaagtacgtgacactttag >gi568815583f:80020345_80237625|GENSCAN_predicted_peptide_4|259_aa MAQETNHSQVPMLCSTGCGFYGNPRTNGMCSVCYKEHLQRQNSSNGRISPPATSVSSLSE SLPVQCTDGSVPEAQSALDSTSSSMQPSPVSNQSLLSESVASSQLDSTSVDKAVPETEDV QASVSDTAQQPSEEQSKSLEKPKQKKNRCFMCRKKVGLTVGFMIRTAITYKAANPSLEGK KINTSCQSFGCTTGTLKQYEPFFVIGYINALSLKSESILLGLNAGVEMFTVVYTVTQMYT IALTITKPMLLRKSEKKIQ >gi568815583f:80020345_80237625|GENSCAN_predicted_CDS_4|780_bp atggctcaagaaactaatcacagccaagtgcctatgctttgttccactggctgtggattt tatggaaaccctcgtacaaatggcatgtgttcagtatgctataaagaacatcttcaaaga cagaatagtagtaatggtagaataagcccacctgcaacctctgtcagtagtctgtctgaa tctttaccagttcaatgcacagatggcagtgtgccagaagcccagtcagcattagactct acatcttcatctatgcagcccagccctgtatcaaatcagtcacttttatcagaatctgta gcatcttctcaattggacagtacatctgtggacaaagcagtacctgaaacagaagatgtg caggcttcagtatcagacacagcacagcagccatctgaagagcaaagcaagtctcttgaa aaaccgaaacaaaaaaagaatcgctgtttcatgtgcaggaagaaagtgggacttactgtt gggtttatgatcagaactgccattacctataaagctgctaacccgagtcttgaagggaaa aagataaacaccagctgccagtcttttggttgtacaacaggaacgcttaaacaatatgaa ccctttttcgtgatcggttacattaatgctttgtccctgaaatcagaaagtatcttgcta ggtttgaatgccggtgtggaaatgtttactgtggtgtacaccgttactcagatgtacaca attgctcttacaattacaaagccgatgctgctgagaaaatcagaaaagaaaatccagtag >gi568815583f:80020345_80237625|GENSCAN_predicted_peptide_5|747_aa MQKTKDLQGTLAQTIVYFSRFQPASWLFKSVPESFTPTITQRHTLRPAAVPGALQHVLHP GGRGFRLPHPQPALRRLLDQRRRHSLKHKFEASRLCLHLRQQAAALVTPFWKLLTGGDSF LPSAGVWGPGSEMWSKGLPCTFPEWKATWIQPRPRIGVAIGDQILDLSIIKHLFTGPVLS KHQDVFNQPTLNSFMGLGQAAWKEARVFLQNLLSVSQARLRDDTELRKCAFISQASATMH LPATIGECSLFTKIRTEQLRGPRGLARCFGSASVWRVPAGDYTDFYSSRQHATNVGIMFR DKENALMPNWLHLPVGYHGRASSVVVSGTPIRRPMGQMKPDDSTSAAALCPHLAIRRTVD PYYLQILYLRIHNYLLIFICKSRINIHRAFVGMCGHAQGSETFELPGPGNRLGEPIPISK AHEHIFGMVLMNDWSARDIQKWEYVPLGPFLGKSFGTTVSPWVVPMDALMPFAVPNPKQA CRPSGGKLCAHGMPTAECLGVPGHDYFQAARASFSLVVSLALAQFLEKTQNFVTLRILQD PGDLASWQEESLQDPRPLPYLCHDEPYTFDINLSVNLKAPLPSVHPKARHGRMGARALLT RSLCCGVDLDSPNTGLEGVPRAEWSVVHKGMLQAEGLSSKYMYWTMLQQLTHHSVNGCNL RPGDLLASGTISGPEPENFGSMLELSWKGTKPIDLGNGQTRKFLLDGDEVIITAMATRDF KWYCQGDGYRIGFGQCAGKVLPALLPS >gi568815583f:80020345_80237625|GENSCAN_predicted_CDS_5|2244_bp atgcagaagactaaggatcttcagggaacccttgcccagactatcgtctacttttctaga tttcagccggcatcctggctgtttaaaagtgtacctgaatcattcacaccaacaataaca caacgacacactctccggcccgcagccgtgccgggtgctcttcagcatgtccttcatccc ggtggccgaggattccgacttccccatccacaacctgccctacggcgtcttctcgaccag aggcgacgtcatagtttgaagcacaagtttgaagcttctaggctttgtttgcatttgagg cagcaggcagctgcacttgtgaccccattttggaagttgcttactggtggtgactccttc ttgccttcagctggggtgtggggaccagggtcagagatgtggtccaagggtctgccctgc acattccctgaatggaaagccacttggattcagccaagaccgaggataggtgtggccatt ggcgaccagatcctggacctcagcatcatcaagcacctctttactggtcctgtcctctcc aaacaccaggatgtcttcaatcagcctacactcaacagcttcatgggcctgggtcaggct gcctggaaggaggcgagagtgttcttgcagaacttgctgtctgtgagccaagccaggctc agagatgacaccgaacttcggaagtgtgcattcatctcccaggcttctgccacgatgcac cttccagccaccataggtgagtgcagtctcttcaccaagataagaacggagcagcttcgt gggccaagagggctggccaggtgctttggttctgcatctgtgtggagggtccctgctgga gactacacagacttctattcctctcggcagcatgctaccaacgtcggaatcatgttcagg gacaaggagaatgcgttgatgccaaattggctgcacttaccagtgggctaccatggccgt gcctcctctgtcgtggtgtctggcaccccaatccgaaggcccatgggacagatgaaacct gatgactccacatcggcagccgctctgtgtccccacctggccatcagaaggacagttgat ccttattacttgcagattctgtatttgagaattcacaactacttgctaatatttatttgt aagtccagaattaatattcacagagcttttgtgggcatgtgtggacatgcgcagggcagt gaaacatttgagctgcctggccctggaaacagattgggagagccgatccccatttccaag gcccatgagcacatttttggaatggtccttatgaacgactggagtgcacgagacattcag aagtgggagtatgtccctctcgggccattccttgggaagagttttgggaccactgtctct ccgtgggtggtgcccatggatgctctcatgccctttgctgtgcccaacccgaagcaggca tgcagaccatcaggagggaagctctgtgcccacggcatgcccacagcagagtgcctggga gttcctggccacgattacttccaggcagccagggcatcattctcccttgtggtttccctg gccctggcacagttcttggagaaaacacagaattttgtaaccttaagaatcttgcaagac ccgggggatctggccagttggcaggaggagtccctgcaggaccccaggcccctgccgtat ctgtgccatgacgagccctacacatttgacatcaacctctctgttaacctgaaagctcca ttgcccagtgtccatcccaaggcccggcatgggagaatgggagcgagggctttgctcacg agaagcctctgctgtggtgtggacttagattctcccaatacgggactggagggtgttccc agggcagagtggagtgttgtacacaagggcatgttgcaggctgagggactgagcagtaag tacatgtactggacgatgctgcagcagctcactcaccactctgtcaacggctgcaacctg cggccgggggacctcctggcttctgggaccatcagcgggccggagccagaaaacttcggc tccatgttggaactgtcgtggaagggaacgaagcccatagacctggggaatggtcagacc aggaagtttctgctggacggggatgaagtcatcataacagctatggctacccgggacttc aagtggtactgccagggggatggttaccgcatcggctttggccagtgtgctggaaaagtg ctgcctgctctcctgccatcatga >gi568815583f:80020345_80237625|GENSCAN_predicted_peptide_6|733_aa MAFHHVAQAGLDLLTSEPADDCTWSTRILLTLSEFIMSLQRTVYPHSEKWRALSVPSSSY FQELVGTSQELALTFWHLLSMFGFFIVSYGFLTAFGRTLFHLDLLQPNLTPSRFDKYTGS AQRPLSLRVTHSPSLFIYEIEGDGLDPCFQSMVQGILEVLWMSKVESAYHTNDGDTAGEG VRNGSIGWSEQQALGAGVADKREQRGGKWYQSNPRRRRSSPQSLPRDGGAHARARLCRRR QRADLGLLRLPLPLPCRDDHPLCQGHHGPLQRHPHIHLGGAAPGRLRHSGCGDWTWRSVG GCAPGCIPCHPTEARNESGHPGKQKVQLSPLSPPSSTSVECPGAVGVATMDQPGEKSPGP GPRRLGFQCGESLSGLWDNEDTSVVGHLGKYNLEGANSPRDKWGSRNMQNQEFRVMVTRR NVASKDAQTVAEMSTCSYRVERSVSPCLIKFPELQRLNQSKRGWPCFQTPSEENYKVCLV TGEGELLLIICIDCDAMETISNMTVLKLQYALESPGRLILTQLDPPGFCISQTGDLNSDV IRTTCPASLKSLKGAQFNCPEDAELLLVYYFVTKRQFQGLPWGHFYNNSPHSMGPIVTLN TNDFIHQDLSGKFKRFNEEGLILGQFSEGQVSPLCNRDRGSQPLEPLVPALAPGSSVPLS LAPLPPYNKTSSGPHLARNTSGLTPPGQTLGQVHWVFPTSPSPLSALNAQQGDKRVISTD DTSILWCQWEQVD >gi568815583f:80020345_80237625|GENSCAN_predicted_CDS_6|2202_bp atggcgttccaccatgttgcccaggctggtctcgatctcctgacctcagaaccagctgat gattgcacatggtcaactcggattcttcttaccctgagtgaatttataatgtcgctgcag cgtactgtttatccccacagtgagaagtggcgtgctctctcagtcccttcatcttcctat tttcaggagctggtgggcaccagccaggaactggctctcaccttctggcatttactgtct atgtttggattcttcatcgtgtcctatggctttctcacagcatttggcaggactcttttc cacttggatctgctacaacccaaccttacaccttcacgctttgacaagtacactgggtca gcccagcggccactttctttgagggtcactcattcgcctagtctcttcatctatgaaata gagggagatgggttagatccatgttttcaaagcatggtccaaggaatcctagaggttttg tggatgtccaaagtggagagtgcataccacactaatgatggtgatactgctggagagggt gtgaggaatggttctatagggtggtctgagcagcaagctctcggtgcaggggtagcagat aagagggagcagaggggtggaaaatggtaccagtcaaacccgaggcggaggaggagcagc ccgcagtcgctgccgcgagatggaggagcccacgcccgagcccgtctatgtcgacgtaga caaagggctgaccttggcctgcttcgtcttcctctgcctcttccttgtcgtgatgatcat ccgctgtgccaaggtcatcatggacccttacagcgccatccccacatccacctgggagga gcagcacctggacgactgaggcacagcggctgtggggactggacctggcgctctgtgggt ggctgtgccccaggatgcatcccctgccatcccacagaggcaaggaatgaatctggccac ccagggaaacagaaggtgcagcttagccctctcagccctccctcgagcaccagcgtggaa tgcccgggtgccgtgggtgtcgcaaccatggaccagccaggggaaaagagccctggacca ggacccaggaggctggggttccagtgtggggagtccctcagtggtctttgggacaatgaa gacacctcggtggtgggtcacttagggaaatacaatctggaaggggccaacagtcccaga gacaagtggggtagcagaaatatgcagaaccaagagttcagagtgatggtcactagaaga aatgtggcctctaaggatgctcagacagttgctgaaatgtctacatgttcttatcgagtt gaaagatcagtgtctccctgtctcattaaatttccagaacttcagaggctaaatcaatcc aaaaggggctggccctgctttcaaacaccaagtgaagagaactacaaagtttgtcttgta acaggtgaaggtgaactccttctgattatctgtattgattgtgatgcaatggaaactatt tctaatatgacagttctcaaacttcagtatgcattagaatcaccgggacggctcattcta acacagctggacccacctggcttttgcataagccagacaggtgacttaaatagtgatgtg attagaaccacctgtcctgcatctctcaagtctttgaagggggcacaattcaattgccct gaagatgcagaattgcttttggtttactattttgtcacaaagaggcagttccaagggctt ccatggggccacttctacaataatagccctcactccatgggtccaattgtcacccttaat accaatgactttattcatcaggatttaagcggcaaattcaaaaggttcaatgaagagggt ttaattttgggacaattttcagagggacaggtaagtcccctctgcaatagggacaggggc tcccagcccctggaacccctagtgcctgccctggccccgggcagtagcgtgcccctgagc ctagctcccttgccaccctacaataaaaccagcagtgggccccatcttgccagaaacacc tctggactgactcctcctggtcagactctggggcaggtccattgggttttcccaacatca ccaagccctttaagtgcattaaatgcccaacaaggggataagcgtgtgatctctactgat gacaccagcattctctggtgccagtgggaacaagtggattaa