GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:24:49 Sequence gi568815583r:30969836_31170229 : 200394 bp : 43.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 4621 4479 143 2 2 93 85 123 0.775 12.57 1.04 Intr - 5168 5096 73 2 1 59 98 15 0.247 -1.42 1.03 Intr - 7120 6984 137 2 2 58 89 43 0.076 1.69 1.02 Intr - 21767 21612 156 0 0 47 97 59 0.456 2.68 1.01 Init - 21875 21797 79 2 1 71 31 227 0.749 14.52 1.00 Prom - 22502 22463 40 -5.56 2.27 PlyA - 23166 23161 6 1.05 2.26 Term - 33235 31987 1249 0 1 80 34 624 0.163 47.26 2.25 Intr - 56436 56304 133 1 1 78 102 307 0.969 30.80 2.24 Intr - 57282 57080 203 2 2 61 27 195 0.991 9.63 2.23 Intr - 58641 58497 145 1 1 49 70 115 0.664 5.14 2.22 Intr - 61322 61148 175 2 1 19 66 299 0.735 20.31 2.21 Intr - 63105 62854 252 0 0 73 49 404 0.891 32.63 2.20 Intr - 65839 65711 129 1 0 90 24 235 0.792 18.19 2.19 Intr - 68007 67876 132 0 0 115 51 145 0.999 14.34 2.18 Intr - 68331 68209 123 0 0 71 73 122 0.989 9.78 2.17 Intr - 70511 70283 229 1 1 112 100 352 0.995 36.67 2.16 Intr - 72408 72116 293 0 2 98 32 524 0.985 43.73 2.15 Intr - 77416 77276 141 1 0 55 37 231 0.987 15.25 2.14 Intr - 79674 79540 135 0 0 111 79 70 0.990 9.16 2.13 Intr - 80747 80574 174 2 0 31 71 242 0.737 16.94 2.12 Intr - 90809 90709 101 0 2 100 24 106 0.929 5.23 2.11 Intr - 91679 91607 73 2 1 93 105 79 0.977 9.08 2.10 Intr - 92867 92744 124 1 1 72 99 11 0.987 1.19 2.09 Intr - 93457 93283 175 2 1 65 57 236 0.975 17.20 2.08 Intr - 96412 96241 172 1 1 63 108 286 0.997 27.62 2.07 Intr - 97352 97228 125 0 2 67 95 85 0.983 7.40 2.06 Intr - 98257 98044 214 1 1 13 110 242 0.980 17.19 2.05 Intr - 100391 100196 196 1 1 86 81 126 0.978 11.12 2.04 Intr - 107149 107070 80 1 2 113 99 14 0.599 3.35 2.03 Intr - 134204 134070 135 2 0 62 105 100 0.091 9.86 2.02 Intr - 157465 157346 120 1 0 27 77 68 0.008 0.29 2.01 Init - 166035 165964 72 0 0 96 82 66 0.832 7.87 2.00 Prom - 166989 166950 40 -6.86 3.00 Prom + 174776 174815 40 -1.06 3.01 Init + 175068 175116 49 2 1 91 89 -1 0.575 -0.13 3.02 Intr + 182048 182152 105 0 0 66 82 119 0.462 9.29 3.03 Intr + 188059 188190 132 2 0 94 14 72 0.058 1.02 3.04 Intr + 191049 191140 92 1 2 78 50 101 0.162 5.01 3.05 Intr + 193699 193853 155 0 2 106 95 -3 0.137 1.07 3.06 Term + 197846 197924 79 2 1 88 48 41 0.054 -2.66 3.07 PlyA + 198617 198622 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:30969836_31170229|GENSCAN_predicted_peptide_1|196_aa MGPGARVGLAVPGPSPLGRALRRIPAQRAWAAALRALRPEASALPLARDGNGAKRRRHHV LPQAAQTHLQVLPPATAPGEIVVNEVNFVRKCIATDTSQYDLWGKLICSNFKISFITDDP MPLQKFHYRNLLLGEHDVPLTCIEQIVTVNDHKRKQKVLGPNQKLKFNPTELIIYCKDFR IVRFRFDESGPESAKK >gi568815583r:30969836_31170229|GENSCAN_predicted_CDS_1|588_bp atgggccccggcgcccgcgtcggcctggctgtgcccggcccctccccgctcgggcgggcg ctgcgccgtatccccgcccaaagggcctgggcggccgcactgagagctttacgcccggag gcgtcggcgctgccactggcccgcgacgggaacggggcgaaaaggcggcggcaccatgtt ctccctcaagccgcccaaacccaccttcaggtcctacctcctgccaccgccccaggagaa attgtcgtaaatgaagtcaattttgtgagaaaatgcattgcaacagacacaagccagtac gatttgtggggaaagctgatatgcagtaacttcaaaatctcctttattacagatgaccca atgccattacagaaattccattacagaaaccttcttcttggtgaacacgatgtcccttta acatgtattgagcaaattgtcacagtaaacgaccacaagaggaagcagaaagtcctaggc cccaaccagaaactgaaatttaatccaacagagttaattatttattgtaaagatttcaga attgtcagatttcgctttgatgaatcaggtcccgaaagtgctaaaaag >gi568815583r:30969836_31170229|GENSCAN_predicted_peptide_2|1699_aa MEPASASCEGFRLPPLMVEEEEEPMAATHKRPQVAENEKHTQTDWASLEMPRDKLCNGLF STVKSSHTVSMHQVIQVTSWGRMPLESGQSGENPFSPQLQLNTAQQKYKGQKSWIEKTFC KRECIFVIPSMKDSNRCCCGQFTNQHIPPLPSATPSKNEEESKQVETQPEKWSVAKHTQS YPTDSYGVLEFQGGGYSNKAMYIRVSYDTKPDSLLHLMVKDWQLELPKLLISVHGGLQNF EMQPKLKQVFGKGLIKAAMTTGAWIFTGGVSTGVISHVGDALKDHSSKSRGRVCAIGIAP WGIVENKEDLVGKDVTRVYQTMSNPLSKLSVLNNSHTHFILADNGTLGKYGAEVKLRRLL EKHISLQKINTRLGQGVPLVGLVVEGGPNVVSIVLEYLQEEPPIPVVICDGSGRASDILS FAHKYCEEGGIINESLREQLLVTIQKTFNYNKAQSHQLFAIIMECMKKKELVTVFRMGSE GQQDIEMAILTALLKGTNVSAPDQLSLALAWNRVDIARSQIFVFGPHWPPLGSLAPPTDS KATEKEKKPPMATTKGGRGKGKGKKKGKVKEEVEEETDPRKIELLNWVNALEQAMLDALV LDRVDFVKLLIENGVNMQHFLTIPRLEELYNTSNLPPDYHISLIDIGLVLEYLMGGAYRC NYTRKNFRTLYNNLFGPKRDDEPPAKGKKKKKKKKEEEIDIDVDDPAVSRFQYPFHELMV WAVLMKRQKMAVFLWQRGEESMAKALVACKLYKAMAHESSESDLVDDISQDLDNNSKDFG QLALELLDQSYKHDEQIAMKLLTYELKNWSNSTCLKLAVAAKHRDFIAHTCSQMLLTDMW MGRLRMRKNPGLKVIMGILLPPTILFLEFRTYDDFSYQTSKENEDGKEKEEENTDANADA GSRKGDEENEHKKQRSIPIGTKICEFYNAPIVKFWFYTISYLGYLLLFNYVILVRMDGWP SLQEWIVISYIVSLALEKIREILMSEPGKLSQKIKVWLQEYWNITDLVAISTFMIGAILR LQNQPYMGYGRVIYCVDIIFWYIRVLDIFGVNKYLGPYVMMIGKMMIDMLYFVVIMLVVL MSFGVARQAILHPEEKPSWKLARNIFYMPYWMIYGEVFADQIDPPCGENLYDEEGKRLPP CIPGAWLTPALMACYLLVANILLVNLLIAVFNNTFFEVKSISNQVWKFQRYQLIMTFHDR PVLPPPMIILSHIYIIIMRLSGRCRKKREGDQEERDRGLKLFLSDEELKRLHEFEEQCVQ EHFREKEDEQQSSSDERIRVTSERVENMSMRLEEINERETFMKTSLQTVDLRLAQLEELS NRMVNALENLAGIDRSDLIQARSRASSECEATYLLRQSSINSADGYSLYRYHFNGEELLF EDTSLSTSPGTGVRKKTCSFRIKEEKDVKTHLVPECQNSLHLSLGTSTSATPDGSHLAVD DLKNAEESKLGPDIGISKEDDERQTDSKKEETISPSLNKTDVIHGQDKSDVQNTQLTVET TNIEGTISYPLEETKITRYFPDETINACKTMKSRSFVYSRGRKLVGGVNQDVEYSSITDQ QLTTEWQCQVQKITRSHSTDIPYIVSEAAVQAEHKEQFADMQDEHHVAEAIPRIPRLSLT ITDRNGMENLLSVKPDQTLGFPSLRSKSLHGHPRNVKSIQGKLDRSGHASSVSSLVIVSG MTAEEKKVKKEKASTETEC >gi568815583r:30969836_31170229|GENSCAN_predicted_CDS_2|5100_bp atggagccagcgtctgcttcctgtgagggcttcaggctgcctccactcatggtggaagag gaagaggagccgatggcagctacccacaagaggccccaagtagcagagaatgagaaacat acacaaactgactgggcatccttggagatgccacgagataagctttgcaatggcctcttc tccacagtcaagtcctcacacactgtgtccatgcaccaggtcatccaagtgacttcatgg gggaggatgcctctggagagtggacagtctggtgagaaccccttttccccacagctgcag ctcaacaccgcccaacagaaatacaagggtcagaaatcttggatagagaaaaccttttgc aaacgggaatgtatctttgtaattcctagcatgaaagactctaacaggtgttgctgtggc cagttcaccaaccagcatatcccccctctgccaagtgcaacacccagcaaaaatgaagag gaaagcaaacaggtggagactcagcctgagaaatggtctgttgccaagcacacccagagc tacccaacagattcctatggagttcttgaattccagggtggcggatattccaataaagcc atgtatatccgtgtatcctatgacaccaagccagactcactgctccatctcatggtgaaa gattggcagctggaactccccaagctcttaatatctgtgcatggaggcctccagaacttt gagatgcagcccaagctgaaacaagtctttgggaaaggcctgatcaaggctgctatgacc accggggcctggatcttcaccgggggtgtcagcacaggtgttatcagccacgtaggggat gccttgaaagaccactcctccaagtccagaggccgggtttgtgctataggaattgctcca tggggcatcgtggagaataaggaagacctggttggaaaggatgtaacaagagtgtaccag accatgtccaaccctctaagtaagctctctgtgctcaacaactcccacacccacttcatc ctggctgacaatggcaccctgggcaagtatggcgccgaggtgaagctgcgaaggctgctg gaaaagcacatctccctgcagaagatcaacacaagactggggcagggcgtgcccctcgtg ggtctcgtggtggaggggggccctaacgtggtgtccatcgtcttggaatacctgcaagaa gagcctcccatccctgtggtgatttgtgatggcagcggacgtgcctcggacatcctgtcc tttgcgcacaagtactgtgaagaaggcggaataataaatgagtccctcagggagcagctt ctagttaccattcagaaaacatttaattataataaggcacaatcacatcagctgtttgca attataatggagtgcatgaagaagaaagaactcgtcactgtgttcagaatgggttctgag ggccagcaggacatcgagatggcaattttaactgccctgctgaaaggaacaaacgtatct gctccagatcagctgagcttggcactggcttggaaccgcgtggacatagcacgaagccag atctttgtctttgggccccactggccgcccctgggaagcctggcacccccgacggacagc aaagccacggagaaggagaagaagccacccatggccaccaccaagggaggaagaggaaaa gggaaaggcaagaagaaagggaaagtgaaagaggaagtggaggaagaaactgacccccgg aagatagagctgctgaactgggtgaatgctttggagcaagcgatgctagatgctttagtc ttagatcgtgtcgactttgtgaagctcctgattgaaaacggagtgaacatgcaacacttt ctgaccattccgaggctggaggagctttataacacaagcaaccttccgcctgattaccac atcagcctcatagacatcgggctcgtgctggagtacctcatgggaggagcctaccgctgc aactacactcggaaaaactttcggaccctttacaacaacttgtttggaccaaagagggat gatgagcctccagctaaagggaagaaaaagaaaaagaagaaaaaggaggaagagatcgac attgatgtggacgaccctgccgtgagtcggttccagtatcccttccacgagctgatggtg tgggcagtgctgatgaaacgccagaaaatggcagtgttcctctggcagcgaggggaagag agcatggccaaggccctggtggcctgcaagctctacaaggccatggcccacgagtcctcc gagagtgatctggtggatgacatctcccaggacttggataacaattccaaagacttcggc cagcttgctttggagttattagaccagtcctataagcatgacgagcagatcgctatgaaa ctcctgacctacgagctgaaaaactggagcaactcgacctgcctcaaactggccgtggca gccaaacaccgggacttcattgctcacacctgcagccagatgctgctgaccgatatgtgg atgggaagactgcggatgcggaagaaccccggcctgaaggttatcatggggattcttcta ccccccaccatcttgtttttggaatttcgcacatatgatgatttctcgtatcaaacatcc aaggaaaatgaggatggcaaagaaaaagaagaggaaaatacggatgcaaatgcagatgct ggctcaagaaagggggatgaggagaacgagcacaaaaaacagagaagtattcccatcgga acaaagatctgtgaattctataacgcgcccattgtcaagttctggttttacacaatatca tacttgggctacctgctgctgtttaactacgtcatcctggtgcggatggatggctggccg tccctccaggagtggatcgtcatctcctacatcgtgagcctggcgttagagaagatacga gagatcctcatgtcagaaccaggcaaactcagccagaaaatcaaagtttggcttcaggag tactggaacatcacagatctcgtggccatttccacattcatgattggagcaattcttcgc ctacagaaccagccctacatgggctatggccgggtgatctactgtgtggatatcatcttc tggtacatccgtgtcctggacatctttggtgtcaacaagtatctggggccatacgtgatg atgattggaaagatgatgatcgacatgctgtactttgtggtcatcatgctggtcgtgctc atgagtttcggagtagcccgtcaagccattctgcatccagaggagaagccctcttggaaa ctggcccgaaacatcttctacatgccctactggatgatctatggagaggtgtttgcagac cagatagaccctccttgtggtgagaacctatatgatgaggagggcaagcggcttcctccc tgtatccccggcgcctggctcactccagcactcatggcgtgctatctactggtcgccaac atcctgctggtgaacctgctgattgctgtgttcaacaataccttctttgaagtaaaatca atatccaaccaggtgtggaagttccagcgatatcagctgattatgacatttcatgacagg ccagtcctgcccccaccgatgatcattttaagccacatctacatcatcattatgcgtctc agcggccgctgcaggaaaaagagagaaggggaccaagaggaacgggatcgtggattgaag ctcttccttagcgacgaggagctaaagaggctgcatgagttcgaggagcagtgcgtgcag gagcacttccgggagaaggaggatgagcagcagtcgtccagcgacgagcgcatccgggtc acttctgaaagagttgaaaatatgtcaatgaggttggaagaaatcaatgaaagagaaact tttatgaaaacttccctgcagactgttgaccttcgacttgctcagctagaagaattatct aacagaatggtgaatgctcttgaaaatcttgcgggaatcgacaggtctgacctgatccag gcacggtcccgggcttcttctgaatgtgaggcaacgtatcttctccggcaaagcagcatc aatagcgctgatggctacagcttgtatcgatatcattttaacggagaagagttattattt gaggatacatctctctccacgtcaccagggacaggagtcaggaaaaaaacctgttccttc cgtataaaggaagagaaggacgtgaaaacgcacctagtcccagaatgtcagaacagtctt cacctttcactgggcacaagcacatcagcaaccccagatggcagtcaccttgcagtagat gacttaaagaacgctgaagagtcaaaattaggtccagatattgggatttcaaaggaagat gatgaaagacagacagactctaaaaaagaagaaactatttccccaagtttaaataaaaca gatgtgatacatggacaggacaaatcagatgttcaaaacactcagctaacagtggaaacg acaaatatagaaggcactatttcctatcccctggaagaaaccaaaattacacgctatttc cccgatgaaacgatcaatgcttgtaaaacaatgaagtccagaagcttcgtctattcccgg ggaagaaagctggtcggtggggttaaccaggatgtagagtacagttcaatcacggaccag caattgacgacggaatggcaatgccaagttcaaaagatcacgcgctctcatagcacagat attccttacattgtgtcggaagctgcagtgcaagctgagcataaagagcagtttgcagat atgcaagatgaacaccatgtcgctgaagcaattcctcgaatccctcgcttgtccctaacc attactgacagaaatgggatggaaaacttactgtctgtgaagccagatcaaactttggga ttcccatctctcaggtcaaaaagtttacatggacatcctaggaatgtgaaatccattcag ggaaagttagacagatctggacatgccagtagtgtaagcagcttagtaattgtgtctgga atgacagcagaagaaaaaaaggttaagaaagagaaagcttccacagaaactgaatgctag >gi568815583r:30969836_31170229|GENSCAN_predicted_peptide_3|203_aa MGFLHVGQADLELPTSVCRNQASALFAIPLKVEDTEEQGIQENEVLDLDLPGQLVCFNKH GAEGCCFRLGNREPTLSCCVNEGPSVQRDRMKTAHCHWQSSLTFCDPDVELLSEPRLKEL ISLSEPVSCLSDGYPISKTGSLHALGLHVLATSVLWSPRQLSTPMQKRVTLKNADLVSLP GVTGAFSPTHLEAFSLPSVSLNY >gi568815583r:30969836_31170229|GENSCAN_predicted_CDS_3|612_bp atggggtttcttcatgttggtcaggctgatctcgagctcccgacctcagtgtgcaggaac caggcaagtgccctttttgccatcccgctaaaagtagaggatacagaagaacaaggaatc caagagaacgaggtactggacttggacctccctgggcagctggtgtgtttcaacaaacac ggtgcagaggggtgctgctttcgattaggaaaccgggaaccgacactgagctgctgcgtc aatgaagggcctagtgtgcaacgagatagaatgaaaacagcccactgccactggcaaagc tcactcaccttctgcgaccctgatgtggagctcttgagcgagccccgcttgaaggagctc atctctctcagtgagcctgtttcctgtctcagtgatggctaccccatcagcaagacagga agccttcatgctctgggactccatgtcctagccacctctgtactgtggtctcccagacag ctgtctacacctatgcaaaaaagggtaactttaaaaaatgcagatctggtcagtctccct ggggtcactggggccttttcaccaactcacctggaagctttctccctgccctctgtttcc ctgaattactga