GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:40:38 Sequence gi568815597f:201911137_202115320 : 204184 bp : 48.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 1861 1807 55 2 1 77 91 79 0.013 5.14 1.04 Intr - 5828 5760 69 0 0 84 83 37 0.020 1.95 1.03 Intr - 30973 30912 62 0 2 96 75 79 0.366 5.78 1.02 Intr - 33088 33012 77 1 2 107 67 -11 0.126 -2.99 1.01 Init - 35204 34944 261 2 0 84 76 492 0.625 44.66 1.00 Prom - 40809 40770 40 -7.36 2.00 Prom + 41994 42033 40 -4.86 2.01 Init + 44391 44416 26 2 2 99 85 37 0.928 3.51 2.02 Intr + 46145 46244 100 2 1 68 103 82 0.946 7.81 2.03 Intr + 46375 46438 64 0 1 105 99 -6 0.901 0.49 2.04 Intr + 54297 54407 111 1 0 79 101 15 0.120 2.25 2.05 Intr + 58333 58414 82 2 1 118 2 30 0.043 -3.90 2.06 Term + 67508 67715 208 2 1 60 28 265 0.226 14.51 2.07 PlyA + 67850 67855 6 1.05 3.00 Prom + 70698 70737 40 -9.16 3.01 Init + 71531 71977 447 1 0 117 84 822 0.993 78.67 3.02 Intr + 77768 77908 141 1 0 93 94 132 0.999 14.85 3.03 Intr + 78247 78395 149 0 2 89 91 130 0.999 12.43 3.04 Intr + 85011 85127 117 0 0 102 70 121 0.989 11.28 3.05 Intr + 86183 86418 236 2 2 97 31 240 0.857 16.53 3.06 Intr + 88766 88879 114 0 0 26 42 129 0.713 2.52 3.07 Intr + 90240 90352 113 1 2 82 75 103 0.869 8.50 3.08 Intr + 90523 90631 109 0 1 95 115 81 0.999 11.36 3.09 Intr + 92101 92325 225 2 0 105 99 264 0.999 27.16 3.10 Intr + 93218 93360 143 0 2 115 94 153 0.998 18.67 3.11 Intr + 94422 94575 154 2 1 67 -21 137 0.210 0.35 3.12 Intr + 99007 99060 54 2 0 102 76 45 0.885 3.65 3.13 Intr + 99993 100163 171 1 0 133 94 162 0.914 21.21 3.14 Intr + 100821 101042 222 1 0 100 70 247 0.999 22.20 3.15 Intr + 101208 101300 93 1 0 87 77 96 0.997 8.34 3.16 Intr + 101504 101623 120 0 0 119 94 149 0.999 19.07 3.17 Intr + 101811 101900 90 1 0 78 101 127 0.986 12.97 3.18 Intr + 102046 102162 117 2 0 136 101 170 0.789 23.54 3.19 Intr + 102693 102888 196 1 1 4 83 379 0.697 27.47 3.20 Term + 104073 104187 115 0 1 109 55 205 0.995 17.34 3.21 PlyA + 104842 104847 6 1.05 4.00 Prom + 119766 119805 40 -2.76 4.01 Init + 130089 130167 79 2 1 80 32 90 0.121 3.82 4.02 Intr + 136284 136417 134 1 2 94 23 30 0.019 -2.54 4.03 Intr + 150078 150160 83 2 2 -2 95 134 0.408 3.54 4.04 Term + 156470 156722 253 2 1 73 49 149 0.328 4.61 4.05 PlyA + 157321 157326 6 1.05 5.00 Prom + 165222 165261 40 -6.96 5.01 Init + 165745 165931 187 0 1 71 113 101 0.627 10.12 5.02 Term + 169883 169959 77 0 2 103 44 16 0.316 -3.30 5.03 PlyA + 170924 170929 6 1.05 6.05 PlyA - 171760 171755 6 1.05 6.04 Term - 175235 175130 106 1 1 114 42 69 0.545 2.78 6.03 Intr - 177295 177213 83 1 2 86 21 69 0.137 -1.56 6.02 Intr - 185696 185554 143 0 2 69 92 46 0.076 3.17 6.01 Intr - 197905 197834 72 2 0 96 71 21 0.141 0.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:201911137_202115320|GENSCAN_predicted_peptide_1|175_aa MSRVAKYRRQVSEDPDIDSLLETLSPEEMEELEKELDVVDPDGSVPVGLRQRNQTEKQST GVYNREAMLNFCEKETKKLMQREMSMDVHTWFCGQSPSQEIEVVCVFVCTSLMARPVSQC SSALIIRELSFSPGSQTAALFLAAIAPLRPLEIILGAPTAFLTALDILKFAGLSS >gi568815597f:201911137_202115320|GENSCAN_predicted_CDS_1|525_bp atgtctagagtagccaaatatcgccggcaggtgagtgaagaccccgacatcgacagcctg ctggagaccctgtctcccgaggagatggaggagctggagaaggagctggacgtggtggac ccagacgggagtgttcccgtggggctgcggcagagaaaccagacggagaaacagtccacg ggtgtgtacaaccgggaggccatgctcaacttctgtgaaaaggagaccaagaaacttatg cagagggagatgtccatggatgtccacacttggttctgcggacagagcccgagtcaagaa atagaagtggtgtgtgtgtttgtgtgtacatccttgatggccagacctgtcagccagtgt agctccgccctaatcatccgggaactctccttcagcccaggttcacaaaccgctgccctt ttcttggctgccattgcgccactccgacccctggagatcatcttgggagctcccaccgcc ttcctcacagccctggacatcctcaagtttgcaggccttagcagn >gi568815597f:201911137_202115320|GENSCAN_predicted_peptide_2|196_aa MEEYAREPCPWRIVDDCGGAFTMGTIGGGIFQAIKGFRNSPVGVNHRLRGSLTAIKTRAP QLGDGPVAMVGSAAMGGILLALIEGAGILLTRFASAQFPNGPQFAEDPSQLPSTQLPSSP FGDYRQYQLQTGMRGAFGKPQGTVARVHIGQVMSIHTKLQNKEHVIEALRRAKFKFSGRQ KIHISKKWGFTKFNAN >gi568815597f:201911137_202115320|GENSCAN_predicted_CDS_2|591_bp atggaggagtacgcgcgagagccttgcccatggcgaattgtggatgactgtggtggggcc tttacgatgggtaccattggtggtggtatctttcaagcaatcaaaggttttcgcaattct ccagtgggagtaaaccacagactacgagggagtttgacagctattaaaaccagggctcca cagttaggagatggaccagtggccatggttgggtcagccgcaatgggtggcattctccta gctttaattgaaggagctggtatcttgttgacaagatttgcctctgcacagtttcccaat ggtcctcagtttgcagaagacccctcccagttgccttcaactcagttaccttcctcacct tttggagactatcgacaatatcagctccaaactggcatgcgaggtgcctttggaaagccc cagggcactgtggccagggttcacattggccaagttatgtccatccacaccaagctgcag aacaaggagcatgtgattgaggccctgcgcagggccaagttcaagttttctggccgccag aagatccacatctcaaagaaatggggcttcaccaagttcaatgccaattaa >gi568815597f:201911137_202115320|GENSCAN_predicted_peptide_3|1041_aa MASGEHSPGSGAARRPLHSAQAVDVASASNFRAFELLHLHLDLRAEFGPPGPGAGSRGLS GTAVLDLRCLEPEGAAELRLDSHPCLEVTAAALRRERPGSEEPPAEPVSFYTQPFSHYGQ ALCVSFPQPCRAAERLQVLLTYRVGEGPGVCWLAPEQTAGKKKPFVYTQGQAVLNRAFFP CFDTPAVKYKYSALIEVPDGFTAVMSASTWEKRGPNKFFFQMCQPIPSYLIALAIGDLVS AEVGPRSRVWAEPCLIDAAKEEYNGVIEEFLATGEKLFGPYVWGRYDLLFMPPSFPFGGM ENPCLTFVTPCLLAGDRSLADVIIHEISHSWFGNLVTNANWGEFWLNEGFTMYAQRRIST ILFGAAYTCLEAATGRALLRQHMDITGEENPLNKLRVKIEPGVDPDDTYNETPYEKGFCF VSYLAHLVGDQDQFDSFLKAYVHEFKFRSILADDFLDFYLEYFPELKKKRVDIIPGFEFD RWLNTPGWPPYLPDLSPGDSLMKPAEELAQLWAAEELDMKAIEAVAISPWKTYQLVYFLD KILQKSPLPPGNVKKLGDTYPSISNARNAELRLRWGQIVLKNDHQEDFWKVKEFLHNQGK QKYTLPLYHAMMGGSEVAQTLAKETFASTASQLHSNVVNYVQQIVAPKGSLFVAESLEFR GQQLSDSGSLMAATCEISNIFSNYFSAMYSSEDSTLASVPPAATFGADDLVLTLSNPQMS LEGTEKASWLGEQPQFWSKTQVLDWISYQVEKNKYDASAIDFSRCDMDGATLCNCALEEL RLVFGPLGDQLHAQLRDLTSSSSDELSWIIELLEKDGMAFQEALDPGPFDQGSPFAQELL DDGQQASPYHPGSCGAGAPSPGSSDVSTAGTGASRSSHSSDSGGSDVDLDPTDGKLFPSD GFRDCKKGDPKHGKRKRGRPRKLSKEYWDCLEGKKSKHAPRGTHLWEFIRDILIHPELNE GLMKWENRHEGVFKFLRSEAVAQLWGQKKKNSNMTYEKLSRAMRYYYKREILERVDGRRL VYKFGKNSSGWKEEEVLQSRN >gi568815597f:201911137_202115320|GENSCAN_predicted_CDS_3|3126_bp atggcgagcggcgagcattcccccggcagcggcgcggcccggcggccgctgcactccgcg caggctgtggacgtggcctcggcctccaacttccgggcctttgagctgctgcacttgcac ctggacctgcgggctgagttcgggcctccagggcccggcgcagggagccgggggctgagc ggcaccgcggtcctggacctgcgctgcctggagcccgagggcgccgccgagctgcggctg gactcgcacccgtgcctggaggtgacggcggcggcgctgcggcgggagcggcccggctcg gaggagccgcctgcggagcccgtgagcttctacacgcagcccttctcgcactatggccag gccctgtgcgtgtccttcccgcagccctgccgcgccgccgagcgcctccaggtgctgctc acctaccgcgtcggggagggacccggggtttgctggttggctcccgagcagacagcagga aagaagaagcccttcgtgtacacccagggccaggctgtcctaaaccgggccttcttccct tgcttcgacacgcctgctgttaaatacaagtattcagctcttattgaggtcccagatggc ttcacagctgtgatgagtgctagcacctgggagaagagaggtccaaataagttcttcttc cagatgtgtcagcccatcccctcctatctgatagctttggccatcggagatctggtttcg gctgaagttggacccaggagccgggtgtgggctgagccctgcctgattgatgctgccaag gaggagtacaacggggtgatagaagaatttttggcaacaggagagaagctttttggacct tatgtttggggaaggtatgacttgctcttcatgccaccgtcctttccatttggaggaatg gagaacccttgtctgacctttgtcaccccctgcctgctagctggggaccgctccttggca gatgtcatcatccatgagatctcccacagttggtttgggaacctggtcaccaacgccaac tggggtgaattctggctcaatgaaggtttcaccatgtacgcccagaggaggatctccacc atcctctttggcgctgcgtacacctgcttggaggctgcaacggggcgggctctgctgcgt cagcacatggacatcactggagaggaaaacccactcaacaagctccgcgtgaagattgaa ccaggcgttgacccggacgacacctataatgagaccccctacgagaaaggtttctgcttt gtttcatacctggcccacttggtgggtgatcaggatcagtttgacagttttctcaaggcc tatgtgcatgaattcaaattccgaagcatcttagccgatgactttctggacttctacttg gaatatttccctgagcttaagaaaaagagagtggatatcattccaggttttgagtttgat cgatggctgaatacccccggctggcccccgtacctccctgatctctcccctggggactca ctcatgaagcctgctgaagagctagcccaactgtgggcagccgaggagctggacatgaag gccattgaagccgtggccatctctccctggaagacctaccagctggtctacttcctggat aagatcctccagaaatcccctctccctcctgggaatgtgaaaaaacttggagacacatac ccaagtatctcaaatgcccggaatgcagagctccggctgcgatggggccaaatcgtcctt aagaacgaccaccaggaagatttctggaaagtgaaggagttcctgcataaccaggggaag cagaagtatacacttccgctgtaccacgcaatgatgggtggcagtgaggtggcccagacc ctcgccaaggagacttttgcatccaccgcctcccagctccacagcaatgttgtcaactat gtccagcagatcgtggcacccaagggcagcctctttgttgctgaatctctggaatttagg ggccagcagctttctgactcaggtagcctcatggctgcaacctgtgagattagcaacatt tttagcaactacttcagtgcgatgtacagctcggaggactccaccctggcctctgttccc cctgctgccacctttggggccgatgacttggtactgaccctgagcaacccccagatgtca ttggagggtacagagaaggccagctggttgggggaacagccccagttctggtcgaagacg caggttctggactggatcagctaccaagtggagaagaacaagtacgacgcaagcgccatt gacttctcacgatgtgacatggatggcgccaccctctgcaattgtgcccttgaggagctg cgtctggtctttgggcctctgggggaccaactccatgcccagctgcgagacctcacttcc agctcttctgatgagctcagttggatcattgagctgctggagaaggatggcatggccttc caggaggccctagacccagggccctttgaccagggcagcccctttgcccaggagctgctg gacgacggtcagcaagccagcccctaccaccccggcagctgtggcgcaggagccccctcc cctggcagctctgacgtctccaccgcagggactggtgcttctcggagctcccactcctca gactccggtggaagtgacgtggacctggatcccactgatggcaagctcttccccagcgat ggttttcgtgactgcaagaagggggatcccaagcacgggaagcggaaacgaggccggccc cgaaagctgagcaaagagtactgggactgtctcgagggcaagaagagcaagcacgcgccc agaggcacccacctgtgggagttcatccgggacatcctcatccacccggagctcaacgag ggcctcatgaagtgggagaatcggcatgaaggcgtcttcaagttcctgcgctccgaggct gtggcccaactatggggccaaaagaaaaagaacagcaacatgacctacgagaagctgagc cgggccatgaggtactactacaaacgggagatcctggaacgggtggatggccggcgactc gtctacaagtttggcaaaaactcaagcggctggaaggaggaagaggttctccagagtcgg aactga >gi568815597f:201911137_202115320|GENSCAN_predicted_peptide_4|182_aa MEINQVNMEINQVFKNGCKTDKIPAGAPASLDSHQMILLSSSREGHDLKQEILQFPTLLN LYHLPSRGVEEPKQSEQQNQVVLDDNAEYDTDIHKSTLIEPIQEWPKKAGNKMGFGSWMP HCLAGNEDLIPGGKWGTEGYWHKVAALKLSLAVNPEGIPERENALEGVRSQESDRCGGTG IN >gi568815597f:201911137_202115320|GENSCAN_predicted_CDS_4|549_bp atggaaatcaaccaagtcaacatggaaatcaaccaagtgtttaaaaacggctgcaagaca gacaagataccagctggggctcctgcatctcttgactctcatcaaatgatcttgctgtct tcttcacgggaaggccatgacctcaaacaggaaatccttcagtttcccaccctgctgaat ctgtaccaccttccctccagaggagtagaagagcctaaacaatctgaacaacaaaaccaa gtagtgttggatgacaacgcagaatatgacacagacatccacaagtccacactgatagaa cccattcaggaatggcccaagaaagcaggtaataagatgggttttgggagctggatgcct cactgcctagctggcaatgaggacctcatccctggtgggaagtggggtacggagggctat tggcacaaggtggcagcattaaaactttcactggcggtaaatccagaaggcataccagag agagagaatgcactggagggtgtgcggtcccaggagtctgaccggtgtggaggaactggt atcaattag >gi568815597f:201911137_202115320|GENSCAN_predicted_peptide_5|87_aa MPKPSWRGESQRKGPGAGVSMIWGLNRSWREEDGGQRPDEDSQTGLDKKSGDDDMQKTNP QEGKSQKPEFLFPKAIETRIPLPESKT >gi568815597f:201911137_202115320|GENSCAN_predicted_CDS_5|264_bp atgccaaaaccttcctggagaggagagagccagcgcaaaggtcctggggcaggagtgagc atgatctgggggttaaatagaagctggagagaagaggatggtgggcagaggcctgatgag gacagtcagacaggcctagataagaagtctggagatgatgatatgcagaaaaccaaccca caggaaggcaagtcacagaaaccagaattcctcttccccaaggccatagaaactagaatc cctctcccagaaagcaaaacctag >gi568815597f:201911137_202115320|GENSCAN_predicted_peptide_6|134_aa XDEGKCLDHCWPNSSQKAATVHWKGRAAVAAAAAADDDDDGDNNIYWALTIYQHSSQMFP CVNSLNLQQPFIPAAKQNPGQITSHGQQKLDKVLGQPSPWCSPNKIPNLAFILLNRCVSG ISQGQANSLHYGCL >gi568815597f:201911137_202115320|GENSCAN_predicted_CDS_6|405_bp naggatgagggaaaatgtctggaccactgttggcctaacagttcacagaaagctgccact gtgcactggaaaggaagggctgctgttgctgctgctgctgctgctgatgacgatgacgat ggtgacaataacatttattgggcacttactatataccagcactcttcccaaatgtttcca tgtgttaactcacttaatcttcaacagcctttcatacctgcagcaaagcaaaatcctggc cagataacttcccatgggcagcagaagctggacaaagtccttgggcagccatcaccatgg tgcagccccaataagatcccaaacctggccttcatcctcctaaacagatgtgtttctgga atttctcagggtcaagccaactccctgcactatggctgcctctga