GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:54:57 Sequence gi568815593r:175859972_176068208 : 208237 bp : 45.70% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 18769 18799 31 0 1 97 78 63 0.964 6.30 1.02 Intr + 18937 19112 176 2 2 117 82 381 0.997 40.06 1.03 Intr + 19877 20070 194 1 2 82 15 399 0.215 30.29 1.04 Intr + 20647 20758 112 1 1 38 91 54 0.150 1.08 1.05 Intr + 20798 20918 121 1 1 71 38 33 0.416 -3.33 1.06 Intr + 23208 23423 216 1 0 49 89 95 0.547 4.18 1.07 Intr + 24274 24411 138 2 0 103 64 13 0.558 0.84 1.08 Intr + 40564 40677 114 2 0 116 68 23 0.021 3.52 1.09 Intr + 45471 45605 135 1 0 -2 76 163 0.088 6.64 1.10 Intr + 51682 51720 39 2 0 96 100 8 0.518 1.00 1.11 Intr + 63129 63168 40 1 1 118 77 68 0.111 5.98 1.12 Intr + 90197 90284 88 2 1 81 80 -5 0.003 -2.03 1.13 Intr + 94952 95026 75 1 0 115 68 38 0.919 4.21 1.14 Intr + 95663 95737 75 1 0 50 80 73 0.624 2.41 1.15 Term + 96443 96583 141 1 0 114 54 78 0.979 4.93 1.16 PlyA + 97990 97995 6 1.05 2.07 PlyA - 98664 98659 6 1.05 2.06 Term - 100161 99998 164 1 2 76 54 146 0.966 8.10 2.05 Intr - 101180 101080 101 1 2 74 93 69 0.984 5.75 2.04 Intr - 101463 101302 162 2 0 99 111 160 0.953 18.49 2.03 Intr - 105184 104980 205 2 1 49 94 226 0.522 17.56 2.02 Intr - 107296 107140 157 1 1 88 57 66 0.522 3.08 2.01 Init - 108318 107971 348 0 0 45 85 533 0.560 43.89 2.00 Prom - 116444 116405 40 -6.26 3.00 Prom + 118951 118990 40 -3.36 3.01 Init + 122377 122488 112 0 1 73 40 71 0.330 1.17 3.02 Intr + 133248 133359 112 1 1 81 100 41 0.233 4.04 3.03 Intr + 146221 146416 196 1 1 4 7 260 0.066 8.92 3.04 Intr + 146774 146905 132 1 0 72 -18 297 0.940 18.34 3.05 Intr + 147287 147376 90 1 0 60 41 121 0.035 4.79 3.06 Intr + 163919 163945 27 1 0 113 96 17 0.057 3.31 3.07 Intr + 171641 171724 84 1 0 42 91 78 0.254 3.52 3.08 Intr + 174000 174031 32 2 2 100 75 7 0.387 -2.47 3.09 Intr + 174475 174698 224 1 2 88 62 111 0.260 6.27 3.10 Intr + 180716 180876 161 0 2 72 105 323 0.191 32.11 3.11 Intr + 184764 184901 138 2 0 36 49 107 0.003 2.26 3.12 Intr + 185178 185307 130 2 1 27 60 99 0.071 1.27 3.13 Term + 185511 185725 215 1 2 40 45 134 0.020 1.69 3.14 PlyA + 189981 189986 6 1.05 4.00 Prom + 194223 194262 40 -4.96 4.01 Init + 195168 195939 772 2 1 64 28 253 0.457 11.76 4.02 Term + 196024 196970 947 2 2 30 47 315 0.305 13.85 4.03 PlyA + 197903 197908 6 1.05 5.05 PlyA - 197933 197928 6 1.05 5.04 Term - 201099 200824 276 0 0 49 55 124 0.446 0.66 5.03 Intr - 201320 201247 74 0 2 37 61 72 0.458 -1.47 5.02 Intr - 201889 201724 166 1 1 105 89 145 0.851 15.83 5.01 Init - 203828 203787 42 2 0 86 87 8 0.818 0.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 11976 11711 266 1 2 41 52 214 0.805 8.77 S.002 Term - 39016 38798 219 1 0 51 51 124 0.907 2.04 S.003 Term + 147071 147262 192 1 0 9 46 205 0.866 5.82 S.004 Intr - 185745 185611 135 2 0 42 121 137 0.893 12.08 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:175859972_176068208|GENSCAN_predicted_peptide_1|564_aa MDFVMKQALGGATKDMGKMLGGEEEKDPDAQKKEEERQEALRQQEEERKAKHARMEAERE KVRQQIRDKYGLKKKEEKEAEEKAALEQPCEGSLTRPKKAIPAGCGDEEEEEEESILDTV LKYLPGPLQDMFKNPGAPPSPAPTWRDMAPAKLVPSNGSFVDFSSFVEEPQVLGPHCQAG PSLLSQGAARPPENTWSHRVAMVYAMGNTSIGKGPGSHAWEELPERSKKAAVDRPGLGTS DRSGSPNMDVLPSKSRSSQKVSLTAKLCRVLLQMESGSRDTIPESILQPQGLSVPHGPPP HPPHTHTISDTPSPEPLWVPLPENAVKSHGSWGPFDAGPTLIYPRNTVSGPCSVSPGSHT DTCHMRSQLCHDHPKRSGSSDILQNITLIQYTDVIMLIRQIDKQYLACWKPKGKVRPMDT VPAYEGHLEKFEDESDIKWPNSKATSSEAGPWPAPQSKSAFAHIRTSMVSKLKLLCPLAS AHQQCDIFPTKNQIYGAPVVPDSGQVVPGDSVTDAAGQRHLLLPLTIFTSRTLTATPWLG SKLQQRPPTCPLSCTHLCSTQGLH >gi568815593r:175859972_176068208|GENSCAN_predicted_CDS_1|1695_bp atggacttcgtcatgaagcaggcccttggaggggccacaaaggacatggggaagatgctg gggggagaggaggagaaggaccccgacgcgcagaaaaaggaggaggagcggcaggaggcg ctgcggcagcaggaggaggagcgtaaggccaagcacgcgcgcatggaggcggagcgggag aaggtccggcagcagatccgagataagtatgggctgaagaagaaggaggagaaggaagca gaggagaaagcagccctggagcagccctgcgaggggagcctgacccggcccaagaaggcc atccctgcgggctgcggggacgaggaggaggaggaagaggagagcatcctggacacggtg ctcaaatacctgcccgggccgctgcaggacatgttcaagaacccaggggcacccccttcc ccagcccccacctggagagacatggcccctgccaagctggtcccttcaaatggatccttt gtggactttagctcatttgtggaggaaccccaggtcctgggcccccactgccaggctggg cccagcttgctcagtcaaggggctgccaggcccccagaaaacacttggagccatcgggta gcgatggtctatgccatggggaacacctccattgggaagggcccaggatcccatgcctgg gaggagctgccagagagaagcaaaaaggcggctgtggatcgccctgggctgggcaccagt gacaggtcaggatctccaaacatggacgtcctcccctccaaatccagaagctcccagaag gtgtccttaactgcaaagctgtgcagggtactcctccagatggaatcaggaagtcgagac accatcccagaatcaattcttcagccccaggggctgagtgttccccatggaccacccccc caccccccgcacacacacaccatctctgacaccccttcccccgagcccctctgggtgccg cttcctgagaatgctgtcaagtcgcacggaagctggggcccctttgatgctggacccact ctgatataccccagaaacacggtcagtgggccatgcagtgtgtctccaggcagccacaca gacacgtgtcacatgcgatcacagctctgtcatgaccatccaaaaagatctggatcatcg gacatcctgcagaacatcacactgatccagtacactgatgtcatcatgctgatcaggcag atagacaagcagtacctagcatgctggaagccaaagggaaaagtgagacccatggatact gtcccagcgtacgaaggccatttggagaaatttgaagatgaatcagatattaaatggcct aattccaaagccacctcctctgaagctggcccatggcctgcaccccagtccaaaagtgct ttcgcccacatcaggacctccatggtctccaagctaaagcttctctgtcccctggcttct gcccatcagcagtgtgacatctttcccacaaagaaccagatctatggtgccccagtggtc cctgacagcgggcaagttgtacccggtgactctgtcactgacgctgctggtcagcgacat ctcctactgcccctgaccatcttcacgtccaggacactgacggccacaccttggcttggc tccaagctgcagcaacgcccccccacctgtcccctgtcctgcactcacctctgctctacc caaggcctgcactga >gi568815593r:175859972_176068208|GENSCAN_predicted_peptide_2|378_aa MRAAAAPGLTAPWRLLQCCELEAGELGMAVPAAAMGPSALGQSGPGSMAPWCSVSSGPSR YVLGMQELFRGHSKTREFLAHSAKVHSVAWSCDGRRLASGSFDKTASVFLLEKDRLVKEN NYRGHGDSVDQLCWHPSNPDLFVTASGDKTIRIWDVRTTKCIATVNTKGENINICWSPDG QTIAVGNKDDVVTFIDAKTHRSKAEEQFKFEVNEISWNNDNNMFFLTNGNGCINILSYPE LKPVQSINAHPSNCICIKFDPMGKYFATGSADALVSLWDVDELVCVRCFSRLDWPVRTLS FSHDGKMLASASEDHFIDIAEVETGDKLWEVQCESPTFTVAWHPKRPLLAFACDDKDGKY DSSREAGTVKLFGLPNDS >gi568815593r:175859972_176068208|GENSCAN_predicted_CDS_2|1137_bp atgcgtgcggcagcggcgccaggactgactgcgccgtggaggctgctgcagtgttgtgag ttggaagctggggagctcggcatggcggtccccgctgcagccatggggccctcggcgttg ggccagagcggccccggctcgatggccccgtggtgctcagtgagcagcggcccgtcgcgc tacgtgcttgggatgcaggagctgttccggggccacagcaagacgcgcgagttcctggcg cacagcgccaaggtgcactcggtggcctggagttgcgacgggcgtcgcctagcctcgggg tccttcgacaagacggccagcgtcttcttgctggagaaggaccggttggtcaaagaaaac aattatcggggacatggggatagtgtggaccagctttgttggcatccaagtaatcctgac ctatttgttacggcgtccggagataaaaccattcgcatctgggatgtgaggactacaaaa tgcattgccactgtgaacactaaaggggagaacattaatatctgctggagtcctgatggg cagaccattgctgtaggcaacaaggatgatgtggtgacctttattgatgccaagacacac cgttccaaagcagaagagcagttcaagttcgaggtcaacgaaatctcctggaacaatgac aataatatgttcttcctgacaaatggcaatggttgtatcaacatcctcagctacccagaa ctgaagcctgtgcagtccatcaacgcccatccttccaactgcatctgtatcaagtttgac cccatggggaagtactttgccacaggaagtgcagatgctttggtcagcctctgggatgtg gatgagttagtgtgtgttcggtgcttttccaggctggattggcctgtaagaaccctcagt ttcagccatgatgggaaaatgctggcgtcagcatcggaagatcattttattgacattgct gaagtggagacaggggacaaactatgggaggtacagtgtgagtctccgaccttcacagtg gcgtggcaccccaaaaggcctctgctggcatttgcctgtgatgacaaagacggcaaatat gacagcagccgggaagccggaactgtgaagctgtttgggcttcctaatgattcttga >gi568815593r:175859972_176068208|GENSCAN_predicted_peptide_3|550_aa MRYHLTPIRIAIIKKEKQKKKANVGKDVEKLEPLCPVASIITITSGQRDCSRLLIGLPAS ILAPFNPSLQTQQPHITSDSEKQQALFWLFLCMHLVTEAGNTPIILGIGSNPRLHTPTYF FTHLSFVNICFITNLIPKLLCLTQMYFLISFANVDTFLLAIMALDHYVAICSALRYCSII TPGSDTIATIMYTVVTSMLNPFIYSLMNKEVQEALRVMEALLQASKKRSPLGTEAIGFSF TPFSKQTEVLQEKLGGPEAAQRLAAVTTITADANTTSASVTFHAPTHPPGLLPPGLHGHP PTIPVELQSPSLSLPQPQEVSHRATPSTGSNLQLTTAVVMDMFTHVDIFKDLLDAGFKRK ADLYIIVDGSKVKYVLHMCEQTRMHLGHLKELESLWDLLSTRGLVSNVHLESSVHLGPDV FLGPLCPLGPYAHLEGHRRCTEYTQTGKEEGMETWDLRIQYKKSENSNQERTQRKYEKDG LEIAKAAPKTSYSLFRKSQLAKYSKTKEGQELTLEKVLAALGATGWLVLGLLCQQAAFSQ LPFFASGGGG >gi568815593r:175859972_176068208|GENSCAN_predicted_CDS_3|1653_bp atgaggtaccaccttacacccattaggatagctattatcaaaaaagaaaaacaaaagaaa aaagcaaatgttggcaaggatgtggagaaattggagcccttgtgccccgttgcctccatc atcaccatcacctctggccagcgtgactgcagtagactccttattggtctccctgcttcc attcttgcccccttcaatccttctctccagacacagcagccccacatcaccagtgactca gagaagcagcaggccctcttctggctcttcctgtgtatgcacttagtcactgaggctgga aacacacccatcatcctgggcatcggctccaaccctcgcctgcacacccccacgtacttc ttcacccatctctcctttgtcaacatctgcttcatcaccaacctgatccccaagctcctg tgcctgactcagatgtacttcctcatctcctttgccaacgtggacacctttctgctggcc atcatggcactggaccactatgtggccatctgcagcgccctgcggtactgctccatcatc acccccggctctgacaccatagcaaccatcatgtacactgtggtgacctctatgctaaac cccttcatctacagtctgatgaacaaggaggtccaggaggccctcagagtgatggaggct ctgctccaggccagtaagaagcgttctcctttgggcactgaggccatcggcttcagcttc acacccttttccaaacagacagaagtccttcaggagaaactaggtggcccagaggctgcc cagagacttgctgctgttaccactattaccgccgatgccaacacaaccagtgcttctgtc accttccatgcacccacccaccctccagggctccttccacctggcctccacgggcaccct cctaccattcctgtcgagctgcagtctccatcgctgtcactgccacaaccacaggaagtg agccacagagccacgccatctacaggctccaacctccagctcaccacagctgtggtcatg gacatgttcacccatgtggacatcttcaaagacctgctggatgccggcttcaagaggaaa gcagatttgtacatcatcgtggacgggagtaaagtcaagtacgttctgcatatgtgtgag cagacccgcatgcacctggggcacctcaaggaactggagtccctctgggacctgttgtcc acccggggcctggtgtccaacgtccaccttgagtccagtgtgcacttggggcccgacgtt ttcctggggcctttgtgtccgctggggccttatgcccacctggaggggcacaggcgttgc acagaatacacacagactggaaaggaagaaggaatggaaacgtgggacctacgtattcag tataagaaaagtgaaaatagcaaccaggaaagaacacaaagaaagtacgagaaggatggg ctagaaattgctaaggccgccccgaaaacatcctactcacttttcagaaaatcacaactt gcaaagtacagcaaaaccaaggaaggccaagagcttaccttggagaaagttctcgcagcc ctgggcgccaccggctggctggttctggggctgctttgccagcaggcagccttttctcag ctgcctttcttcgcatcaggtggcggcggctga >gi568815593r:175859972_176068208|GENSCAN_predicted_peptide_4|572_aa MDKVLDTYNLTRLNQEEVEYLNRPTGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEELV PFLLKLFQSIEKERILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAEILNKILAN RIQQHIKKLIQHDQVGFIPGMQGWFNIRKSINVIQHINRTKDKNHMIISIDAEKAFGKIQ QPFMLKTLNKLVIDGTYLKIIRAMYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFN IVLEVLARAIRQEKEIKAQNLLKLISNSSKVSGFKINVQKSQAFLYTNNRQTESQIMSEL PFTVASKGIKYLGIQLPRDVKDLLKENYKPLLNEIKEDTNKWKNIPCSWVERINIMKMAI LPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKL YYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPDKNKKWGKDSLYNKWCW ENWLGICRKLKLDPFLTPFTKINSRCIKDLNVRPKTIKTLEENLGNTIQDIGMGKDFLSK TPKAMATKAKIDKWDLIELKSSFCTAKKLPSE >gi568815593r:175859972_176068208|GENSCAN_predicted_CDS_4|1719_bp atggataaagtccttgatacatacaacctcacaagactaaaccaggaggaagttgaatat ctgaacagaccaacaggctctgaaattgaggcaataattaatagcttaccaaccaaaaaa agtccaggaccagacggattcacagctgaattctaccagaggtacaaggaggagctggta ccattccttctgaaactattccaatcaatagaaaaagagagaatcctccctaattcattt tatgaggccagcatcatcctgataccaaagcctggcagagacacaacaaaaaaagagaat tttagacccatatccctgatgaacatcgatgcagaaatcctcaataaaatactggcaaac cgaatccagcagcacatcaaaaagcttatccaacacgatcaagtgggcttcatccctggg atgcaaggctggttcaacatacgcaaatcaataaatgtaatccagcatataaacagaacc aaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttggcaaaattcaa cagcccttcatgctaaaaactctcaataaattagttattgatgggacgtatctcaaaata ataagagctatgtatgacaaacccacagccaatatcatactgaatgggcaaaaactggaa gcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctcttcaac atagtgttggaagttctggccagggcaatcaggcaggagaaggaaataaaggcccaaaat ctccttaagctgataagtaactccagcaaagtctcaggattcaaaatcaatgtacaaaaa tcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagtgaactc ccattcacagttgcttcaaagggaataaaatacctaggaatccaacttccaagggatgtg aaggatctcctcaaggagaactacaaaccactgctcaatgaaataaaagaggatacgaac aaatggaagaacattccatgctcatgggtagaaagaatcaatatcatgaaaatggccata ctgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgactttcttc actgaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcattgcc aagtcaatcctaagccaaaagaacaaagctggaggcatcacgcttcctgacttcaaacta tactacaaggctacagtaaccaaaacggcatggtactggtaccaaaacagagatatagac caatggaacagaacagagccctcagaaataatgccacatatctacaactatctgatcttt gacaaacctgacaaaaacaagaaatggggaaaggattccctatataataaatggtgctgg gaaaactggctaggcatatgtagaaagctgaaactggatcccttccttacaccttttaca aaaattaattcaagatgcattaaagacttaaatgttagacctaaaaccataaaaacccta gaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttcctgtctaaa acaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattgaactaaag agctccttctgcacagcaaaaaaactaccatcagagtga >gi568815593r:175859972_176068208|GENSCAN_predicted_peptide_5|185_aa MRNEKFTIRSLTWIVSAPALQAALHFRGFALDSALWIFRFGTPNPCRRGLGSPGPPAPAA PKAEELLQGAATLAASDLSDLVSLAKPPGSRKPQRDLVLAWAPRGHRSGFPDEPGFGADA LALGAAWLVSGSGRRLESHSLLPPPAPTAARARTRSLQLQLRHWRGMAELPLDGVRVTEC TAYLV >gi568815593r:175859972_176068208|GENSCAN_predicted_CDS_5|558_bp atgagaaatgagaaattcactatcagaagtcttacctggattgtctctgctcctgcgctc caggcagcgctccacttccgcggcttcgcccttgacagcgccctgtggatcttccgattc ggtaccccgaacccctgtagacgtggcctagggagccccggaccgccggcccctgcggct cccaaagccgaagaacttcttcaaggtgcagcgacactggcggcctccgacctctcagac ctagtgagcctcgcaaagccgcccggctcccggaagccgcagagagatctggtgctggca tgggcaccccgcggccaccggagtggcttcccggatgagcctggcttcggcgctgacgct ctggccctgggggctgcctggctggtgtcaggtagcggaagacgcctggagagtcactcg ctccttcccccacccgcccccaccgctgctcgtgccaggacgcgcagtttgcagttgcag ctcaggcactggcgcgggatggcggagcttcccttggatggcgtcagggtcaccgagtgc acagcctacctggtctga