GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:30:35 Sequence gi568815593r:147724664_147931577 : 206914 bp : 38.36% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 14727 15080 354 2 0 8 54 228 0.386 7.20 1.02 PlyA + 15309 15314 6 1.05 2.03 PlyA - 15349 15344 6 1.05 2.02 Term - 27479 27426 54 2 0 89 48 44 0.794 -2.92 2.01 Init - 28024 27899 126 1 0 102 100 69 0.982 9.72 2.00 Prom - 28302 28263 40 -0.65 3.03 PlyA - 28927 28922 6 1.05 3.02 Term - 58347 57890 458 1 2 82 41 346 0.805 23.60 3.01 Init - 66210 66102 109 0 1 64 55 52 0.306 -0.07 3.00 Prom - 70070 70031 40 -0.05 4.00 Prom + 72010 72049 40 -1.05 4.01 Sngl + 73704 73865 162 2 0 40 44 168 0.526 1.95 4.02 PlyA + 75014 75019 6 1.05 5.04 PlyA - 79548 79543 6 1.05 5.03 Term - 83083 82959 125 2 2 59 43 95 0.799 -0.33 5.02 Intr - 84246 84124 123 1 0 36 97 77 0.604 3.04 5.01 Init - 85367 85304 64 2 1 77 32 106 0.355 5.26 5.00 Prom - 91324 91285 40 -7.25 6.04 PlyA - 91818 91813 6 1.05 6.03 Term - 98694 98485 210 0 0 -86 38 350 0.908 8.91 6.02 Intr - 104967 104936 32 1 2 83 88 33 0.819 -0.27 6.01 Init - 106914 106860 55 0 1 87 119 52 0.905 7.80 6.00 Prom - 116047 116008 40 -6.05 7.04 PlyA - 118884 118879 6 1.05 7.03 Term - 127649 127263 387 2 0 30 46 225 0.372 6.45 7.02 Intr - 128008 127797 212 2 2 79 54 154 0.053 8.81 7.01 Init - 142009 141943 67 1 1 68 56 78 0.155 3.89 7.00 Prom - 148123 148084 40 -2.55 8.00 Prom + 148212 148251 40 -9.05 8.01 Init + 149048 149225 178 1 1 50 45 145 0.182 5.87 8.02 Term + 152484 152596 113 1 2 53 34 133 0.518 2.14 8.03 PlyA + 152893 152898 6 1.05 9.00 Prom + 154011 154050 40 -6.15 9.01 Init + 156522 156588 67 2 1 79 115 63 0.824 9.39 9.02 Intr + 156783 156985 203 1 2 132 51 175 0.904 16.28 9.03 Intr + 158540 158622 83 1 2 64 92 41 0.408 -0.38 9.04 Intr + 160916 161026 111 2 0 54 81 60 0.111 0.48 9.05 Term + 166478 166652 175 2 1 38 48 134 0.121 0.65 9.06 PlyA + 168338 168343 6 1.05 10.04 PlyA - 169686 169681 6 1.05 10.03 Term - 172378 172330 49 0 1 83 44 64 0.301 -2.70 10.02 Intr - 177110 176966 145 0 1 95 87 118 0.740 10.82 10.01 Init - 181838 181769 70 2 1 80 111 114 0.999 12.18 10.00 Prom - 181910 181871 40 -5.55 11.00 Prom + 191959 191998 40 -3.85 11.01 Init + 192110 192147 38 1 2 72 83 29 0.197 0.33 11.02 Intr + 197479 197563 85 1 1 68 -12 111 0.502 -2.00 11.03 Intr + 197678 197898 221 1 2 57 57 247 0.712 14.68 11.04 Term + 198088 198277 190 1 1 35 47 245 0.411 11.04 11.05 PlyA + 198844 198849 6 1.05 12.00 Prom + 200172 200211 40 -8.45 12.01 Init + 200472 200553 82 2 1 67 81 85 0.498 6.88 12.02 Intr + 204639 204776 138 1 0 46 76 110 0.175 5.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 128003 127797 207 2 0 83 54 146 0.861 9.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:147724664_147931577|GENSCAN_predicted_peptide_1|117_aa MPRSTTWKRQRDFIHKNLSGAFSQTTSGGARRGWPNSDVDWCMRRTEAHRRASEEPTKNV HKVKESALNLFFSRQQKLHRRELEALLKQHHGSSNTFSVTLCPFPFQCSNFSEGDRG >gi568815593r:147724664_147931577|GENSCAN_predicted_CDS_1|354_bp atgcccagaagtaccacctggaagaggcaacgagactttattcataagaatctaagtgga gcattctcccagacaaccagcggcggagcaaggaggggctggccaaacagtgacgttgac tggtgcatgagacgaacagaggctcacagaagagcctctgaagagcccacaaaaaatgtc cataaggtaaaagagtcagctttgaacttatttttcagtaggcaacagaagcttcacaga agagagcttgaagctctcttgaagcagcatcatggttcaagcaataccttctcggtcaca ctgtgccccttccctttccagtgctccaacttcagtgagggagatagagggtga >gi568815593r:147724664_147931577|GENSCAN_predicted_peptide_2|59_aa MGQAIITTITSTKDNCTASCCHATNFSSKGPVASILGFVGLKMSSLREAFLIPMQTHSK >gi568815593r:147724664_147931577|GENSCAN_predicted_CDS_2|180_bp atgggccaagccatcatcaccaccatcacttccaccaaggataactgcacagcctcctgc tgccatgcaaccaacttttcctccaaaggcccagtagcaagtattttaggttttgtgggc cttaagatgtcatccctgagagaagccttcctgatcccaatgcagactcattccaaatga >gi568815593r:147724664_147931577|GENSCAN_predicted_peptide_3|188_aa MNIAETRTPISNFGHIVFQQMGLKPNLASPKKVRRKAPLAPFLKRPGVQGRRHQEHPRGQ QHKFTPLQTLQLASPISPPPELGRLSLVIAMATSSCRASSLYRSRGGALGEGERERRSQR RCLLQVPGCFRAHPILREGGGPTGDAAAAAAAAAAASIADTLVGLEGWFLFAPSPTRDRG KKKTGSHC >gi568815593r:147724664_147931577|GENSCAN_predicted_CDS_3|567_bp atgaacattgctgaaacaagaactccaatatctaactttggccatattgtgtttcagcag atgggtttaaaacccaatctggcaagtccaaagaaagttagaagaaaggccccccttgcc ccatttttaaaaagacctggagttcagggtaggagacatcaagagcatcctagagggcag cagcacaaattcactcccctccagaccctgcagcttgcttcgcccatctcccctccccct gagcttggtcgcttatcattggtcattgccatggcaaccagcagctgccgggccagcagc ctctaccgcagcaggggtggggcactgggggagggggagagagagaggaggagccagagg aggtgtctgctgcaagttcctggctgcttccgagctcaccccatcctccgagagggagga ggcccaactggtgatgctgctgctgctgctgctgccgccgccgccgcctctattgctgat actctagtggggctggaagggtggttcctattcgcaccatcgccaaccagagacagaggg aaaaaaaaaaccggcagccactgctga >gi568815593r:147724664_147931577|GENSCAN_predicted_peptide_4|53_aa MEEITAVMVEIARALELEVDSEDVPVFLLSHDTTFTDEDEELHLIDDQRKVVS >gi568815593r:147724664_147931577|GENSCAN_predicted_CDS_4|162_bp atggaggaaataactgcagttatggtggaaatagcaagagcactggaattagaagtagac tctgaagatgtgcctgtattcctgctatctcatgatacaactttcacagatgaggacgag gagttgcatcttatcgatgaccaaagaaaagtggtttcttga >gi568815593r:147724664_147931577|GENSCAN_predicted_peptide_5|103_aa MHEWVAFEQKFKQEEASPAEVRSGEMDLHLQGNVESMEDFGTFQSMFLNSYWAKMEKLMM QEDVSQSLSPSTLAVTQSDEVKNVHGSRDESYALAQQHGFHQG >gi568815593r:147724664_147931577|GENSCAN_predicted_CDS_5|312_bp atgcatgagtgggtagcatttgagcagaagtttaaacaggaagaagccagccctgcagag gttcggagtggagaaatggatctgcatctccagggaaatgtggagtcaatggaggacttt ggaactttccaaagcatgtttttgaactcatattgggcaaagatggagaaattgatgatg caagaagatgtcagccagtctctttccccaagcactctagcagtcactcaatctgatgag gttaaaaatgttcatggcagtagggatgaaagctatgcattggctcaacaacatggattt caccagggctga >gi568815593r:147724664_147931577|GENSCAN_predicted_peptide_6|98_aa MKVTGIFLLSALALLSLSGNTGADSLGREKNKEEEEEEEEVEEEEEEVEEEKKEKEKEEE RRKRRRREERGGGGGGKEKEGQGEEEEGEVEEEDEKAA >gi568815593r:147724664_147931577|GENSCAN_predicted_CDS_6|297_bp atgaaggtaacaggcatctttcttctcagtgccttggccctgttgagtctatctggtaac actggagctgactccctgggaagagagaagaataaggaggaggaggaggaggaagaggag gtagaggaggaggaagaggaggtagaggaggagaagaaggagaaggaaaaggaggaggaa agaagaaagaggaggaggagagaagagagaggaggaggaggagggggaaaggagaaggag gggcagggggaggaagaggagggggaggtggaggaagaagatgaaaaagcagcttga >gi568815593r:147724664_147931577|GENSCAN_predicted_peptide_7|221_aa MGAVCEALRQYSLGIAELDGCQGSMENVSALSLLTVESPTSMFDYCDDSLERVKSALDIF SMIIYTVTFFLGLAGNGLVIWVVGFHMSCTVNTCLPSDPHLHGPLTCDPVANLVLEQLHT SKGNSGALEDLAFGNLFLCSLLDLQGNSWWKVSPSLYNQYDLQNETQGSHQLWKEIIIPW HQTLVTTAHFFFGFFLPLAIITGYYILVALKLRERQLVKFS >gi568815593r:147724664_147931577|GENSCAN_predicted_CDS_7|666_bp atgggggctgtctgtgaagctttgcggcagtacagcctaggtattgctgaacttgatggg tgtcagggtagcatggagaatgtgtctgcattgtcactgttgactgtggagagtcccacg tccatgtttgactattgtgatgactctttggagagggtcaagtctgctcttgacatcttt tccatgatcatctacacagtgactttcttcctaggcttggctggcaatggccttgtcatt tgggtagttggattccacatgtcctgcacagtcaacacgtgtcttccttctgaccctcat ctccatggaccactgacttgtgatcctgtggccaatctagtcctggaacaattgcacacc agcaaaggcaactctggggcccttgaggacctggcttttggcaatttgtttctctgttcc ctacttgatcttcaaggaaactcgtggtggaaagtgtcaccctctttgtacaaccagtat gatctgcagaatgaaactcaaggaagtcaccaactttggaaagagattatcattccatgg caccaaacgctggtcacaacagcccactttttctttggcttctttctccctctggctatc atcactggctactacatccttgtagccttgaagttaagagaaaggcagctggttaagttt agctga >gi568815593r:147724664_147931577|GENSCAN_predicted_peptide_8|96_aa MIKHQGKAAFPVGDRCRSFGFTDKTCLFCLYQKMKGIEIKIRERLKCGAKIERRKRLRDI HLYLIIGPREQAAGPCSSGNGINLTTNLGSLRGLVY >gi568815593r:147724664_147931577|GENSCAN_predicted_CDS_8|291_bp atgattaaacaccaagggaaggctgccttcccagtcggtgaccggtgccggagttttggg ttcacggataaaacatgtctcttttgtctctaccagaaaatgaaaggaattgaaattaag ataagggagagactgaagtgtggcgccaagattgaaaggagaaagaggttgagggatatt cacctgtatctcatcattggaccacgagagcaagcagctggaccctgttcctccggcaac ggcatcaatcttaccaccaaccttggctctttgaggggcttggtatattaa >gi568815593r:147724664_147931577|GENSCAN_predicted_peptide_9|212_aa MAIAGFSEEVIFEERPNNVRKKATAFLINKVPLPVDKLAPLPLDNILPFMDPLKLLLKTL GISVEHLVEGLRKCVNELGPEASEAVKKLLGEVSGNDCTTSELQVLLDRPALVPAFAGGV SNRGIKKESGDKLCLGRQAETDDKHYHKSDNTKCWHDTNQPPEPQFLHLSYAENNTSLIH FNRIAVMRLQGDDTGKESQKKNSMQMKEILDL >gi568815593r:147724664_147931577|GENSCAN_predicted_CDS_9|639_bp atggccattgcaggcttctccgaggaggtaatatttgaagagagaccaaataatgtgaga aagaaagctactgccttcctcatcaacaaagtgccccttcctgttgacaagttggcacct ttacctctggacaacattcttccctttatggatccattaaagcttcttctgaaaactctg ggcatttctgttgagcaccttgtggaggggctaaggaagtgtgtaaatgagctgggacca gaggcttctgaagctgtgaagaaactgctgggagaagtctctggaaatgactgtaccacg tctgagctccaggtcctgctagataggcctgccttggtcccagcttttgctgggggtgta tcaaacagagggattaagaaggaatctggggacaaattatgtttgggaaggcaggcagaa actgatgataaacattatcataaaagtgacaacaccaaatgctggcatgataccaatcag cctcctgagcctcagtttcttcacctgtcatatgcagaaaacaatacctccctaattcac ttcaatcgcattgcagttatgagattacaaggagatgatactgggaaagagtcccaaaag aaaaattctatgcagatgaaagagattcttgatctttaa >gi568815593r:147724664_147931577|GENSCAN_predicted_peptide_10|87_aa MAVSVLRLTVVLGLLVLFLTCYADDKPDKPDDKPDDSGKDPKPDFPKFLSLLGTEIIENA VEFILRSMSRSTGFMEFDDNEGKHSSK >gi568815593r:147724664_147931577|GENSCAN_predicted_CDS_10|264_bp atggctgtctcagtacttcgcctgacagttgtcctgggactgcttgtcttattcctgacc tgctatgcagacgacaaaccagacaagccagacgacaagccagacgactcgggcaaagac ccaaagccagacttccccaaattcctaagcctcctgggcacagagatcattgagaatgca gtcgagttcatcctccgctccatgtccaggagcacaggatttatggaatttgatgataat gaaggaaaacattcatcaaagtga >gi568815593r:147724664_147931577|GENSCAN_predicted_peptide_11|177_aa MEEKEVRGWKRESVENPTRFLCEITKAAGSLNTCPENRRAGVLAFKGDDGFSVWESNAIA TYVSNEELWGSAPEAAAQAVQWVNFADDSQYQGVPTLGKMHHDKQATQDAGEEVSPSSRL SWVEMKLCENMAHFDAKIFAESQPKKDTPRKEKGSREEKQKPQAERKEEKKVATPAP >gi568815593r:147724664_147931577|GENSCAN_predicted_CDS_11|534_bp atggaagaaaaagaagtaagaggatggaaaagggagagcgtggagaaccctactcgcttt ctctgcgaaatcaccaaggcagctgggagcctgaacacatgtcctgaaaaccggagggcg ggtgttctagcatttaagggtgatgatggattctctgtgtgggagagcaatgctattgcc acctatgtgagcaatgaggagctgtggggaagtgctccagaggcagcagcccaggctgtg cagtgggtgaactttgctgatgatagccagtaccagggtgttcccaccttgggcaaaatg caccatgacaaacaggccacccaggatgcaggggaagaggtgagccccagttccaggctg tcttgggtggaaatgaaactgtgtgagaacatggcccactttgatgctaaaatatttgca gagagccagcctaaaaaggacactccacggaaagagaaaggttcacgggaagagaagcag aagccccaggctgagcggaaggaggagaaaaaggtggccacccctgctccttaa >gi568815593r:147724664_147931577|GENSCAN_predicted_peptide_12|74_aa MTPASASSEGLRKLPLMREGAGEQVSHGDRQGNYSDFTMKREHRRKIRGMMVNFGWALIE IEGHLQRLGSFPTX >gi568815593r:147724664_147931577|GENSCAN_predicted_CDS_12|222_bp atgacaccagcatctgcttccagtgagggccttaggaagcttccactcatgcgggaaggt gcaggggagcaagtatcacatggtgatcggcagggaaattatagtgactttaccatgaaa agggagcacagaaggaagattagaggaatgatggtaaattttggttgggcacttatcgaa attgaggggcatcttcagcgtttaggtagttttcccacagnn