GENSCAN 1.0 Date run: 7-Nov-116 Time: 03:33:46 Sequence gi568815581r:20903034_21142835 : 239802 bp : 45.40% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 16084 16601 518 0 2 58 57 197 0.013 7.96 1.02 Term + 35039 35225 187 2 1 77 46 121 0.032 3.76 1.03 PlyA + 36468 36473 6 1.05 2.30 PlyA - 37730 37725 6 1.05 2.29 Term - 48759 48646 114 0 0 54 40 174 0.978 7.77 2.28 Intr - 73145 73004 142 1 1 68 39 128 0.185 6.26 2.27 Intr - 76435 76239 197 1 2 91 46 114 0.155 5.71 2.26 Intr - 86600 86417 184 1 1 88 94 13 0.002 1.69 2.25 Intr - 88826 88755 72 1 0 133 16 73 0.003 3.12 2.24 Intr - 94262 94232 31 0 1 111 67 36 0.063 0.99 2.23 Intr - 99411 99259 153 1 0 89 61 41 0.049 1.54 2.22 Intr - 99867 99626 242 2 2 65 50 84 0.064 -0.51 2.21 Intr - 101318 101169 150 1 0 89 69 305 0.985 27.98 2.20 Intr - 101957 101895 63 1 0 110 108 87 0.980 10.73 2.19 Intr - 103954 103863 92 1 2 115 91 100 0.989 11.79 2.18 Intr - 104963 104837 127 1 1 60 106 51 0.983 4.78 2.17 Intr - 108276 108118 159 2 0 101 93 159 0.999 16.80 2.16 Intr - 109902 109797 106 1 1 66 100 104 0.999 8.67 2.15 Intr - 111953 111795 159 0 0 65 70 66 0.797 2.46 2.14 Intr - 112866 112719 148 0 1 53 86 282 0.882 24.31 2.13 Intr - 114518 114267 252 2 0 74 51 97 0.735 2.23 2.12 Intr - 115078 114909 170 2 2 89 83 166 0.945 15.87 2.11 Intr - 115886 115781 106 2 1 103 72 59 0.734 5.59 2.10 Intr - 117829 117729 101 2 2 96 27 58 0.734 0.33 2.09 Intr - 118193 118080 114 0 0 116 42 168 0.988 15.42 2.08 Intr - 125671 125509 163 1 1 93 68 148 0.684 12.85 2.07 Intr - 139786 139632 155 2 2 53 91 393 0.168 35.89 2.06 Intr - 175840 175639 202 1 1 6 51 163 0.155 3.26 2.05 Intr - 176652 176507 146 1 2 4 68 152 0.062 4.80 2.04 Intr - 177081 176928 154 0 1 85 89 52 0.071 4.65 2.03 Intr - 195960 195838 123 0 0 76 99 47 0.877 5.38 2.02 Intr - 196782 196746 37 2 1 108 82 11 0.746 0.76 2.01 Init - 201854 201787 68 2 2 67 58 60 0.619 1.44 2.00 Prom - 204586 204547 40 -6.86 3.00 Prom + 207544 207583 40 -2.86 3.01 Init + 211051 211155 105 0 0 67 40 92 0.271 2.52 3.02 Intr + 216788 216887 100 1 1 67 76 54 0.825 1.78 3.03 Intr + 220330 220411 82 2 1 82 89 38 0.551 2.00 3.04 Term + 223717 223831 115 1 1 7 55 125 0.351 -0.86 3.05 PlyA + 226123 226128 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:20903034_21142835|GENSCAN_predicted_peptide_1|234_aa MIVYLENPIVSAQNLLKLISNFSKVSGYKIDVQKSQAFLYTNNRQTESKIMNELPFKIAS KRIKYLGIQLTRNVKDLFKENYKPLLNKIKEDTNKWKNIPCSWIGRINIMKMAILPKVIY RFNAIPIKLPMTFFTELEKTTLKFVRNQKRARIAKIELFFIHPAFAIVASNTTCALGSHR KRSHPDSPGLRCWKMEKPLPQEKSSMNTMKLGLHLGHPSPRLVYNAGALHTLPG >gi568815581r:20903034_21142835|GENSCAN_predicted_CDS_1|705_bp atgattgtatatttagaaaaccccatcgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcgatgtgcaaaaatcacaagcattcttatac accaataacagacaaacagagagcaaaatcatgaatgaactcccattcaaaattgcttca aagagaataaaatacctaggaattcaacttacaaggaatgtgaaggacctcttcaaggag aactacaaaccactgctcaacaaaataaaagaggacacaaacaaatggaagaacattcca tgctcatggataggaaggatcaatatcatgaaaatggccatactgcccaaggtaatttat agattcaatgccatccccatcaaactaccaatgactttcttcacagaattggagaaaact actttaaagttcgtaaggaaccaaaaaagagcccgcattgccaagatagagctcttcttt atacatcctgcttttgccatagttgcttctaataccacgtgcgccttgggctcccacagg aagcggtcacatccagactcaccaggcctgagatgttggaagatggagaagcctttgcca caggaaaaatcctccatgaacacaatgaaacttggcctgcacctgggacatccttctcca cggctagtctacaatgcaggtgctctgcacacgctccctggctga >gi568815581r:20903034_21142835|GENSCAN_predicted_peptide_2|1309_aa MPNGKTGSRSYPRIYVALSSVQRFLALCGIGTVLLHLHHQFLDFMKGYALHLASFTQHHV CDIRVMIIWVCTCIKLHCWVEVSTTELALVSGAQMWGSNLGRSAAGATVGECGPTLEDTR VLLSNPCVWALVRSTLEPFHTDDKEEESEEEGEYNEVTEEVTEQVCLPAKAAKEGEGPKE PYADFIVRLQESLKKVIADSAAQDIVLRLLAFNNANPECQAALRPIRGKAHLVDYIKACD SIGEPEGEAMDAELAVAPPGCSHLGSFKVDNWKQNLRAIYQCFVWSGTAEARKRKRSLLL FSEFQAKSCICHVCGVHLNRLHSCLYCVFFGCFTKKHIHEHAKAKRHNLAIDLMYGGIYC FLCQDYIYDKDMEIIAKEEQRKAWKMQGPPWACSWAKGTTPALLKLSQGAPADVGLNECG EGPAVGLGRRVMRSNSPALTAFSVCVGKGTQVDFGQGLRGLINLGNTCFMNCIVQALTHT PLLRDFFLSDRHRCEMQSPSSCLVCEMSSLFQEVSLSMPHQSLCPHELRAARWLFLWAAR VAFLSTSVPGIRAWQRSEQGVHGCRLSPHRLRDLDMLLRAEAAQLPQLISRLVPVLQFYS GHRSPHIPYKLLHLVWTHARHLAGYEQQDAHEFLIAALDVLHRHCKGLPKARAGVSDSAD LWGAGASSLLISSQLMLMPLVQMPYSENPCAQVTGHLAACDDNGKKANNPNHCNCIIDQI FTGGLQSDVTCQVCHGVSTTIDPFWDISLDLPGSSTPFWPLSPGSEGNVVNGESHVSGTT TLTDCLRRFTRPEHLGSSAKIKCSGCHSYQESTKQLTMKKLPIVACFHLKRFEHSAKLRR KITTYVSFPLELDMTPFMASSKESRMNGQYQQPTDSLNNDNKYSLFAVVNHQGTLESGHY TSFIRQHKDQWFKCDDAIITKASIKDVLDSEGCGPSAPGTHRVGMDHTDGEAPGAALKMD EMRGVLWVGGAAYTRHQNISCVMTWGCNGGLTAQSDRCLAFPSTRCGKAPTCCNSIKLHV EASRGFLETGIFLSLSRSFFHKGEFHKCNTSKVNELLSLYRNLGLFSSPDSELRSLDLGF NPGKQNRSVLLLPCEGTWETQMLPAPTPKPGQGAPRVSLESSQQNPRPVSSPDQHLKPFS VQLGSLPLLSPRSPMCEQQPLPKEYGNQVFKDTCLPTLKLAGNQKSLLWENRPKGKDPEA LTAGDSLQKMSKSLPDPTGAKLLHEKERKEEKEEKEKDEEEETQGWKKEKRGGQMRRGKR MMEEEEKMLLVSRASAGEEHKKIKGVLAKDQEQEDLTSEEAMGPRHKGQ >gi568815581r:20903034_21142835|GENSCAN_predicted_CDS_2|3930_bp atgccaaacgggaagacaggttctcgatcctatccaaggatctacgtggcactgtcttca gttcaaaggtttctggctctgtgtggaataggcactgtcctcctgcacctccatcaccag ttcctggacttcatgaaggggtacgctttgcatctggcttctttcacccaacatcatgtc tgtgatattcgagtaatgattatatgggtgtgcacatgtataaaactccactgctgggtt gaggtctccacgaccgagctggccttggtaagtggcgcccaaatgtggggctcgaacctg ggtcgaagtgctgccggagcaacggttggagaatgtggacctacactggaggacacccga gtactcttaagcaatccctgtgtgtgggctctggttcgttccaccttggagccttttcac acagatgacaaggaggaggaatcagaggaagaaggagagtataacgaagtaacagaagag gtgacagagcaggtttgcttgccagctaaagcggcaaaggagggagagggaccaaaagaa ccatatgcagattttatagttcggttacaggagtctcttaaaaaggtgattgcagattcg gctgctcaggatatagtgttgcggttactagctttcaacaatgccaatcctgagtgccaa gccgctctgcgacctattagagggaaagcacatttagttgattatattaaggcctgtgac agtattggagagcccgagggcgaggccatggacgccgagctggcggtagcgccgccgggc tgctcgcacctgggcagcttcaaggtggacaactggaagcagaacctgcgggccatctac cagtgcttcgtgtggagcggcacggctgaggcccgcaagcgcaagcgttcactcctcctc ttctctgaatttcaggccaagtcctgtatctgccatgtctgtggcgtccacctcaacagg ctgcattcctgcctctactgtgtcttcttcggctgtttcacaaagaagcatattcacgag catgcgaaggcgaagcggcacaacctggccattgatctgatgtacggaggcatctactgt tttctgtgccaggactacatctatgacaaagacatggaaataatcgccaaggaggagcag cgaaaagcttggaaaatgcaaggtcccccatgggcctgcagctgggccaagggaacaacc ccagcattacttaagctctcccagggcgcacctgcagatgtggggctgaacgagtgtggc gagggacctgcagtgggcttggggcgcagagtgatgcgtagcaacagccctgccctcaca gccttcagcgtctgtgtgggtaaagggactcaagttgattttggccagggtctgcgtggg ctgatcaaccttgggaacacatgcttcatgaactgcatcgtgcaggccctgacccacacg ccacttctgcgggacttcttcctgtctgacaggcaccgctgtgagatgcagagccccagc tcctgtctggtctgtgagatgtcctcactgtttcaggaggtctctctaagcatgccccat cagagcctctgtcctcatgagctacgcgctgcacgttggctgttcctgtgggctgcacgt gtggcgttcctgtccacttctgtccccggcatcagagcatggcagcgttcagagcaaggc gttcacggctgccgcctcagccctcaccggctccgtgacctagacatgcttctcagagct gaagcagctcagttacctcaactaataagcaggctggtgcctgttcttcagttttactct ggacaccggtcccctcacatcccgtataagttgctgcacctggtgtggacccacgcgagg cacctagcaggctacgagcagcaggacgcccacgagttcctcatcgcggccctggacgtg ctccaccgacactgcaaaggtcttcctaaagcccgtgctggcgtttctgattcagcagat ctgtggggggcaggggcttcttcactcttaatcagttcccagctgatgttgatgccgctg gttcagatgccatactctgagaacccatgcgctcaggtcacagggcacctggctgcttgt gatgacaatgggaagaaggccaacaaccccaaccactgcaactgcatcatagaccagatc ttcacaggcgggttgcagtcagacgtcacctgccaagtctgccatggagtctccaccacc atcgaccccttctgggacatcagcttggatctccccggctcttccaccccattctggccc ctgagcccagggagcgagggcaacgtggtaaacggggaaagccacgtgtcgggaaccacc acgctcacggactgcctgcgacgattcaccagaccagagcacttgggcagcagcgccaag atcaagtgcagcggttgccatagctaccaggagtccacaaagcagctcactatgaagaaa ctgcccatcgtagcctgttttcatctcaaacgatttgaacactcagccaagctgcggcgg aagatcaccacgtatgtgtccttccccctggagctggacatgacccctttcatggcctcc agcaaagagagcaggatgaatggacagtaccagcagcccacggacagtctcaacaatgac aacaagtattccctgtttgctgttgttaaccatcaagggaccttggagagtggccactac accagctttatccggcagcacaaagaccagtggttcaagtgtgacgatgccatcatcacc aaggccagcatcaaggacgtcctggacagcgaaggatgtggcccctctgcacctgggacc catcgggtcgggatggaccacacggacggggaggctcctggagctgctttgaagatggat gagatgaggggtgtgctctgggtgggaggagcagcgtacacccgtcaccagaacatctct tgtgtcatgacatgggggtgcaacgggggcctcacagcacagagtgaccgctgcctggcg ttccccagcactcggtgtggaaaggcccctacctgctgtaacagcatcaagctgcacgtg gaagcatctcgcggttttctagaaacaggcattttcttatccctctcccgctcctttttc cacaaaggtgaatttcataaatgtaatactagtaaagtgaatgaattactgagtttatac agaaatttagggctgttcagcagccctgactcagaactcaggtccctggaccttggcttc aaccccgggaagcaaaaccggagcgtcctgcttctcccttgtgagggcacatgggagaca cagatgctccctgcacctacccccaagcctggacagggagctcctcgcgtcagcctggag tcgtcacaacagaaccccaggccagtctcctctcctgatcaacatctcaagccattttca gtccagcttgggagcctgcccctgctctccccacgaagcccaatgtgtgaacagcagccc ctacctaaggagtacggcaaccaggtttttaaagacacctgccttcctaccctgaaactg gcaggaaatcagaagtctctgctctgggaaaacagacccaaaggaaaagacccagaggca ctgacagcgggagattctctccagaaaatgtccaagtccctgcctgatcccactggagca aagcttctacacgagaaggagaggaaggaggaaaaagaggagaaggagaaggatgaagag gaggagacacaagggtggaagaaggaaaaaagaggaggacagatgaggagggggaagagg atgatggaggaggaggaaaagatgcttttggtttctagggccagtgctggagaggagcac aagaagatcaagggcgttctggcaaaggatcaggaacaagaagatctgacatcagaggag gccatgggccccaggcacaaaggacagtag >gi568815581r:20903034_21142835|GENSCAN_predicted_peptide_3|133_aa MRDMWSTCEPNRQPKPPDLQISEQENVGRKPILTMEGGAVSVQLIRYLRKRNLVETFPQP PSSIIQRTETTLDAVAEIIGCDTHSILSTIYTFLQDPRNTNHDLRNFPPEDEGTVVMRVD PIRSRVHRVLCTP >gi568815581r:20903034_21142835|GENSCAN_predicted_CDS_3|402_bp atgagagacatgtggagcacatgtgaacctaatcggcagcccaaaccacctgacctacag atctctgagcaagaaaatgttggtcgcaaaccaattttaacaatggaaggtggcgctgtt tccgtgcagctcatcaggtacctcagaaaaagaaatttggtggagacgtttccgcagcct ccaagcagcatcattcaaaggacagaaacaacccttgatgctgttgcagaaatcataggt tgtgacacccacagcattctctcaaccatctacacattcctccaagatccacggaacacc aaccacgaccttcggaatttccctccggaagatgaaggcactgtagttatgcgggtggac cctatcagatctcgggtccatcgggtcctgtgtacaccttga