GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:36:06 Sequence gi568815578r:32096455_32302178 : 205724 bp : 47.16% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2550 2681 132 1 0 108 91 254 0.884 28.32 1.02 Term + 4863 5065 203 1 2 84 44 391 0.999 31.85 1.03 PlyA + 5368 5373 6 1.05 2.00 Prom + 7165 7204 40 -6.36 2.01 Init + 8379 8433 55 2 1 35 48 86 0.091 0.75 2.02 Intr + 12731 12751 21 0 0 105 93 27 0.604 2.42 2.03 Intr + 13190 13301 112 0 1 79 101 61 0.535 5.94 2.04 Intr + 13943 13994 52 2 1 105 85 -46 0.067 -4.29 2.05 Intr + 36559 36672 114 0 0 69 115 50 0.327 6.44 2.06 Intr + 39620 39719 100 1 1 28 106 32 0.344 -1.32 2.07 Intr + 45043 45211 169 2 1 90 95 288 0.969 28.70 2.08 Intr + 45312 45441 130 0 1 30 75 270 0.999 20.60 2.09 Intr + 46528 46651 124 0 1 100 96 176 0.972 19.76 2.10 Intr + 48637 48755 119 2 2 60 78 99 0.651 6.28 2.11 Intr + 48858 48969 112 2 1 77 94 79 0.990 7.35 2.12 Intr + 50331 50401 71 1 2 92 12 156 0.529 7.30 2.13 Intr + 53180 53312 133 1 1 83 65 184 0.429 15.82 2.14 Intr + 54168 54249 82 1 1 133 61 59 0.998 6.40 2.15 Intr + 54346 54491 146 1 2 80 102 100 0.930 10.53 2.16 Intr + 55089 55141 53 1 2 15 80 48 0.083 -4.67 2.17 Intr + 58649 58732 84 1 0 87 115 -2 0.123 2.42 2.18 Intr + 61340 61515 176 1 2 71 84 288 0.128 25.44 2.19 Intr + 61928 62060 133 2 1 37 65 167 0.790 9.95 2.20 Intr + 63538 63657 120 0 0 74 78 122 0.998 10.49 2.21 Intr + 64822 64911 90 0 0 155 79 57 0.998 11.69 2.22 Term + 68841 68990 150 2 0 81 38 274 0.999 19.61 2.23 PlyA + 69067 69072 6 1.05 3.03 PlyA - 69275 69270 6 1.05 3.02 Term - 81138 80978 161 1 2 83 49 85 0.718 2.20 3.01 Init - 86279 86069 211 2 1 64 54 116 0.462 5.05 3.00 Prom - 89647 89608 40 -5.76 4.02 PlyA - 90054 90049 6 1.05 4.01 Sngl - 93815 92748 1068 2 0 83 42 1046 0.998 94.65 4.00 Prom - 96293 96254 40 -7.96 5.04 PlyA - 99187 99182 6 1.05 5.03 Term - 101228 99998 1231 1 1 109 41 1031 0.818 91.58 5.02 Intr - 107462 107236 227 2 2 67 67 151 0.812 7.68 5.01 Init - 107679 107614 66 0 0 76 91 11 0.823 1.27 5.00 Prom - 109158 109119 40 -6.56 6.00 Prom + 109651 109690 40 -8.06 6.01 Init + 111488 111611 124 1 1 110 82 242 0.999 24.13 6.02 Intr + 113617 113738 122 2 2 98 92 95 0.999 11.11 6.03 Intr + 118815 118997 183 2 0 102 98 229 0.981 25.28 6.04 Intr + 120155 120267 113 1 2 118 65 -6 0.075 -0.72 6.05 Intr + 131809 132001 193 1 1 108 28 150 0.870 10.49 6.06 Intr + 134365 134607 243 0 0 95 95 289 0.960 27.99 6.07 Term + 138019 138207 189 0 0 130 37 242 0.999 20.75 6.08 PlyA + 140862 140867 6 1.05 7.00 Prom + 142150 142189 40 -5.26 7.01 Init + 151663 151764 102 0 0 21 70 151 0.086 4.88 7.02 Intr + 160031 160052 22 1 1 119 67 4 0.010 -1.48 7.03 Intr + 169064 169128 65 0 2 109 67 44 0.123 2.84 7.04 Intr + 175941 176029 89 2 2 52 100 51 0.154 1.47 7.05 Intr + 181221 181311 91 0 1 41 109 94 0.422 6.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 126682 126776 95 0 2 81 84 32 0.810 1.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:32096455_32302178|GENSCAN_predicted_peptide_1|111_aa XAKFPIKWTAPEAINFGSFTIKSDVWSFGILLMEIVTYGRIPYPGMSNPEVIRALERGYR MPRPENCPEELYNIMMRCWKNRPEERPTFEYIQSVLDDFYTATESQYQQQP >gi568815578r:32096455_32302178|GENSCAN_predicted_CDS_1|336_bp ngggccaagttccccatcaagtggacagctcctgaagccatcaactttggctccttcacc atcaagtcagacgtctggtcctttggtatcctgctgatggagatcgtcacctacggccgg atcccttacccagggatgtcaaaccctgaagtgatccgagctctggagcgtggataccgg atgcctcgcccagagaactgcccagaggagctctacaacatcatgatgcgctgctggaaa aaccgtccggaggagcggccgaccttcgaatacatccagagtgtgctggatgacttctac acggccacagagagccagtaccaacagcagccatga >gi568815578r:32096455_32302178|GENSCAN_predicted_peptide_2|781_aa MTPDAEEDVEQQELSFIADGNEEEQGAVEARRPPCGVKLRNQGAPLPLTSLRRHVDPRWR RRWDLLPCWFSNFTVHKNLEDWLPWSLLLFSLMCETSAFYVPGVAPINFHQNDPVEIKAV KLTSSRTQLPYEYYSLPFCQPSKITYKAENLGEVLRGDRIVNTPFQVLMNSEKKCEVLCS QSNKPVTLTVEQSRLVAERITEDYYVHLIADNLPVATRLELYSNRDSDDKKKEKDVQFEH GYRLGFTDVNKIYLHNHLSFILYYHREDMEEDQEHTYRVVRFEVIPQSIRLEDLKADEKS SCTLPEGTNSSPQEIDPTKENQLYFTYSVHWEESDIKWASRWDTYLTMSDVQIHWFSIIN SVVVVFFLSGILSMIIIRTLRKDIANYNKEDDIEDTMEESGWKLVHGDVFRPPQYPMILS SLLGSGIQLFCMILIVIFVAMLGMLSPSSRGALMTTACFLFMFMGVFGGFSAGRLYRTLK GHRWKKGAFCVSVLNLLLFGGKQRSFFLGSSGEAQGELRLYQFWLRRFWHKTATLYPGVV FGICFVLNCFIWGKHSSGAVPFPTMVALLCMWFGISLPLVYLGYYFGFRKQPYDNPVRTN QIPRQIPEQRWYMNRFVGLGRSLGLPGGLVSNNVNLSFCGSILMAGILPFGAMFIELFFI FSAIWENQFYYLFGFLFLVFIILVVSCSQISIVMVYFQLCAEDYRWWWRNFLVSGGSAFY VLVYAIFYFVNKLDIVEFIPSLLYFGYTALMVLSFWLLTGTIGFYAAYMFVRKIYAAVKI D >gi568815578r:32096455_32302178|GENSCAN_predicted_CDS_2|2346_bp atgacaccagatgctgaggaggatgtggagcaacaggaactctcattcattgctgatggg aatgaggaagagcaaggagcggttgaagccagacgaccaccttgtggagttaaactccgt aaccagggagcaccacttccgctgacgtcattacggcgacacgtggatccaagatggcga cggcgatgggaccttttaccctgttggttctcaaactttactgtgcataagaatctcgag gattggttgccgtggtctttactgcttttctccctgatgtgtgaaacaagcgccttctat gtgcctggggtcgcgcctatcaacttccaccagaacgatcccgtagaaatcaaggctgtg aagctcaccagctctcgaacccagctaccttatgaatactattcactgcccttctgccag cccagcaagataacctacaaggcagagaatctgggagaggtgctgagaggggaccggatt gtcaacacccctttccaggttctcatgaacagcgagaagaagtgtgaagttctgtgcagc cagtccaacaagccagtgaccctgacagtggagcagagccgactcgtggccgagcggatc acagaagactactacgtccacctcattgctgacaacctgcctgtggccacccggctggag ctctactccaaccgagacagcgatgacaagaagaaggaaaaagatgtgcagtttgaacac ggctaccggctcggcttcacagatgtcaacaagatctacctgcacaaccacctctcattc atcctttactatcatcgggaggacatggaagaggaccaggagcacacgtaccgtgtcgtc cgcttcgaggtgattccccagagcatcaggctggaggacctcaaagcagatgagaagagt tcgtgcactctgcctgagggtaccaactcctcgccccaagaaattgaccccaccaaggag aatcagctgtacttcacctactctgtccactgggaggaaagtgatatcaaatgggcctct cgctgggacacttacctgaccatgagtgacgtccagatccactggttttctatcattaac tccgttgttgtggtcttcttcctgtcaggtatcctgagcatgattatcattcggaccctc cggaaggacattgccaactacaacaaggaggatgacattgaagacaccatggaggagtct gggtggaagttggtgcacggcgacgtcttcaggcccccccagtaccccatgatcctcagc tccctgctgggctcaggcattcagctgttctgtatgatcctcatcgtcatctttgtagcc atgcttgggatgctgtcgccctccagccggggagctctcatgaccacagcctgcttcctc ttcatgttcatgggggtgtttggcggattttctgctggccgtctgtaccgcactttaaaa ggccatcggtggaagaaaggagccttctgtgtgagtgtcctaaaccttctcttgtttggt gggaagcagagaagcttcttcctgggctcctcaggagaagctcaaggagagctcaggctg tatcagttctggctgcgacgcttctggcacaaaacggcaactctgtaccctggtgtggtt tttggcatctgcttcgtattgaattgcttcatttggggaaagcactcatcaggagcggtg ccctttcccaccatggtggctctgctgtgcatgtggttcgggatctccctgcccctcgtc tacttgggctactacttcggcttccgaaagcagccatatgacaaccctgtgcgcaccaac cagattccccggcagatccccgagcagcggtggtacatgaaccgatttgtgggcttggga aggagcttggggcttcctggtggcctggtctctaacaatgtcaacctctcgttctgtggc agcatcctcatggctgggatcttgcccttcggcgccatgttcatcgagctcttcttcatc ttcagtgctatctgggagaatcagttctattacctctttggcttcctgttccttgttttc atcatcctggtggtatcctgttcacaaatcagcatcgtcatggtgtacttccagctgtgt gcagaggattaccgctggtggtggagaaatttcctagtctccgggggctctgcattctac gtcctggtttatgccatcttttatttcgttaacaagctggacatcgtggagttcatcccc tctctcctctactttggctacacggccctcatggtcttgtccttctggctgctaacgggt accatcggcttctatgcagcctacatgtttgttcgcaagatctatgctgctgtgaagata gactga >gi568815578r:32096455_32302178|GENSCAN_predicted_peptide_3|123_aa MYEGIRCLLKALLGFVSLAIGTLYCPRQYRPFPGSLGIEAINVPEPIPDSYYRDMATWPT HAPSVEEGGQGRRIFLSAEQNEKSPMSTSFYTDTATIRFLNLFPTFPAFLFHKAAIVILA RSQ >gi568815578r:32096455_32302178|GENSCAN_predicted_CDS_3|372_bp atgtatgaaggcatcagatgcctgctgaaggcccttctggggtttgtatctctagcgata ggaactctctactgcccaaggcagtaccgcccttttccaggcagtcttggaattgaggca ataaatgtcccggagcccattcccgactcctactacagagacatggccacgtggcctaca catgctccaagcgtggaggaaggaggccagggcagaagaatttttcttagtgcagaacaa aatgaaaagtctcccatgtctacttctttctacacagacacggcaaccatccgatttctc aatcttttccccacctttcccgcctttctattccacaaagccgccattgtcatcctggcc cgttctcaatga >gi568815578r:32096455_32302178|GENSCAN_predicted_peptide_4|355_aa MADKRAGTPEAAARPPPGLAREGDARTVPAARAREAGGRGSLHPAAGPGTAFPSPGRGEA ASTATTPSLENGRVRDEAPETCGAEGLGTRAGASEKAEDANKEEGAIFKKEPAEEVEKQQ EGEEKQEVAAEAQEGPRLLNLGALIVDPLEAIQWEAEAVSAQADRAYLPLERRFGRMHRL YLARRSFIIQNIPGFWVTAFLNHPQLSAMISPRDEDMLCYLMNLEVRELRHSRTGCKFKF RFWSNPYFQNKVIVKEYECRASGRVVSIATRIRWHWGQEPPALVHRNRDTVRSFFSWFSQ HSLPEADRVAQIIKDDLWPNPLQYYLLGDRPCRARGGLARWPTETPSRPYGFQSG >gi568815578r:32096455_32302178|GENSCAN_predicted_CDS_4|1068_bp atggcggacaagagggcggggaccccagaagccgcggcgcgcccgccgcccggccttgcc cgggagggggacgcgcgcacggtccccgcggcccgggcccgagaagctggggggcgcggg tccctccaccccgcagcgggccccgggaccgccttcccttcccctgggcgcggggaagcg gcctccacggcgactactccgagcctggaaaatggccgggtccgggacgaagccccagaa acctgtggtgcagaggggctagggactcgggcaggagccagcgagaaggccgaggacgcg aacaaggaggagggcgccatcttcaagaaggagccagcggaggaggtggagaagcagcag gagggggaggagaagcaggaggtggcagcggaggcccaggagggcccgcggctcctgaac cttggtgccctaattgtggacccactggaggccatccagtgggaggcggaggccgtgagc gcccaggccgacagggcctacctcccgctcgagcgcaggtttgggcggatgcacaggttg tacctcgcccgtaggagcttcatcatccagaatattccgggcttctgggtcaccgccttc ctgaaccacccgcagctctcagccatgatcagccctcgagatgaagacatgctctgctac ctgatgaatttggaggtgagggagctcaggcactctaggacaggttgcaaattcaagttc cgcttttggagcaacccctacttccagaacaaggtgatagtgaaggagtatgaatgcaga gcctcaggccgagtggtgtctattgcgactcgcatccgatggcactggggccaggaaccc ccggccctcgtacacaggaaccgggacactgtccgaagcttcttcagctggttttcacag cacagcctcccagaggccgacagggttgcccagattattaaagatgacctgtggcccaac cccctgcagtactacctgctgggggataggccctgcagagccaggggaggcctcgcaagg tggcccacggagaccccttctaggccctacgggttccagtctggctaa >gi568815578r:32096455_32302178|GENSCAN_predicted_peptide_5|507_aa MHTRAHSLLNSKVHSNCWYYKQHVIHAKCSHLNADLSRQIYTRDGDVFIILHRSVSSTRG MVPEQDVAHVNDEQSSLGHEPADPIILFTASLLDEWSLHMATHSAQKPHQCMYCDKMFHR KDHLRNHLQTHDPNKEALHCSECGKNYNTKLGYRRHLAMHAASSGDLSCKVCLQTFESTQ ALLEHLKAHSRRVAGGAKEKKHPCDHCDRRFYTRKDVRRHLVVHTGRKDFLCQYCAQRFG RKDHLTRHVKKSHSQELLKIKTEPVDMLGLLSCSSTVSVKEELSPVLCMASRDVMGTKAF PGMLPMGMYGAHIPTMPSTGVPHSLVHNTLPMGMSYPLESSPISSPAQLPPKYQLGSTSY LPDKLPKVEVDSFLAELPGSLSLSSAEPQPASPQPAAAAALLDEALLAKSPANLSEALCA ANVDFSHLLGFLPLNLPPCNPPGATGGLVMGYSQAEAQPLLTTLQAQPQDSPGAGGPLNF GPLHSLPPVFTSGLSSTTLPRFHQAFQ >gi568815578r:32096455_32302178|GENSCAN_predicted_CDS_5|1524_bp atgcacactcgagcacactctctgctcaattccaaggtgcatagtaattgctggtattac aaacagcatgtgattcatgcaaaatgcagccaccttaatgcagatctttctagacaaatc tacactcgtgatggggacgtgtttatcatattacatcgcagtgtttcttcaacaagagga atggttcctgaacaggacgtagctcatgttaatgatgagcaaagctctctggggcatgaa cccgctgatcccatcatcttgttcacggccagtctgttggatgaatggtcactgcacatg gccacccactcagcccagaaaccccaccagtgtatgtactgtgataagatgtttcaccgc aaggaccatctgcggaaccatctgcagacccatgatcctaacaaagaggccctccactgc tctgagtgcggtaagaattacaatacgaagctgggctaccggcgccacctggccatgcat gctgccagcagcggtgacctcagctgcaaggtgtgcctgcagacctttgagagtacccag gccctgctagagcacctgaaggcccactcacgccgggtagcaggcggtgccaaggagaag aagcacccctgtgaccactgcgaccggcggttctatactcgtaaggatgtacggcggcac ctagtggtgcacacaggccgtaaggacttcctgtgccagtactgtgcccagcggtttggc cgtaaggaccacctgacgcgtcatgtcaagaagagccactcgcaggagctgctcaagatc aagacagagcccgtggacatgttaggcctactcagctgcagctccacagtcagtgtgaag gaagagctgagccctgtgctgtgcatggcctctcgggacgtaatggggaccaaggccttc cctggcatgttgcccatgggcatgtatggtgcccacatccctaccatgcccagcacgggc gtgccacactccctggtgcacaacacgctgcccatgggtatgagctaccctctggaatcc tcacctatctcttccccagctcagctccctccaaaataccagcttggatctacctcatac ttgcccgacaaattgcccaaagtggaggtggatagttttctggcggagcttcctggaagc ctgtctctctcatccgctgaaccccagcccgcctcacctcagccggcggcagctgcggcc ctcctagatgaagcactgcttgccaagagccccgccaacctctctgaggccctctgcgct gctaatgtggacttctcccacctactgggctttcttccactcaacctgcccccgtgtaac ccacctggggccacaggaggcctggtcatgggctactcccaggctgaggcacagcccctg cttaccactttgcaagctcagcctcaagattccccaggagctgggggaccactgaacttt gggcctctgcactccttgcctcctgtcttcacgtctggcctgagtagcaccaccctgcct cgtttccatcaagcattccagtag >gi568815578r:32096455_32302178|GENSCAN_predicted_peptide_6|388_aa MGAAAWARPLSVSFLLLLLPLPGMPAGSWDPAGYLLYCPCMGRFGNQADHFLGSLAFAKL LNRTLAVPPWIEYQHHKPPFTNLHVSYQKYFKLEPLQAYHRVISLEDFMEKLAPTHWPPE KRVAYCFEVAAQRSPDKKTCPMKEGNPFGPFWDQFHVSFNKSELFTGISFSASYREQWSQ RFSPKEHPVLALPGAPAQFPVLEEHRPLQKYMVWSDEMVKTGEAQIHAHLVRPYVGIHLR IGSDWKNACAMLKDGTAGSHFMASPQCVGYSRSTAAPLTMTMCLPDLKEIQRAVKLWVRS LDAQSVYVATDSESYVPELQQLFKGKVKVVSLKPEVAQVDLYILGQADHFIGNCVSSFTA FVKRERDLQGRPSSFFGMDRPPKLRDEF >gi568815578r:32096455_32302178|GENSCAN_predicted_CDS_6|1167_bp atgggcgccgccgcgtgggcacggccgctgagcgtgtctttcctgctgctgcttctgccg ctcccggggatgcctgcgggctcctgggacccggccggttacctgctctactgcccctgc atggggcgctttgggaaccaggccgatcacttcttgggctctctggcatttgcaaagctg ctaaaccgtaccttggctgtccctccttggattgagtaccagcatcacaagcctcctttc accaacctccatgtgtcctaccagaagtacttcaagctggagcccctccaggcttaccat cgggtcatcagcttggaggatttcatggagaagctggcacccacccactggccccctgag aagcgggtggcatactgctttgaggtggcagcccagcgaagcccagataagaagacgtgc cccatgaaggaaggaaacccctttggcccattctgggatcagtttcatgtgagtttcaac aagtcggagctttttacaggcatttccttcagtgcttcctacagagaacaatggagccag agattttctccaaaggaacatccggtgcttgccctgccaggagccccagcccagttcccc gtcctagaggaacacaggccactacagaagtacatggtatggtcagacgaaatggtgaag acgggagaggcccagattcatgcccaccttgtccggccctatgtgggcattcatctgcgc attggctctgactggaagaacgcctgtgccatgctgaaggacgggactgcaggctcgcac ttcatggcctctccgcagtgtgtgggctacagccgcagcacagcggcccccctcacgatg actatgtgcctgcctgacctgaaggagatccagagggctgtgaagctctgggtgaggtcg ctggatgcccagtcggtctacgttgctactgattccgagagttatgtgcctgagctccaa cagctcttcaaagggaaggtgaaggtggtgagcctgaagcctgaggtggcccaggtcgac ctgtacatcctcggccaagccgaccactttattggcaactgtgtctcctccttcactgcc tttgtgaagcgggagcgggacctccaggggaggccgtcttctttcttcggcatggacagg ccccctaagctgcgggacgagttctga >gi568815578r:32096455_32302178|GENSCAN_predicted_peptide_7|123_aa MAGKLGLTIGRRPLFLSMGLLEGLCNIVAVYPQRYLEPNPLAPSNYHSIFYLHEFDYSKY FIQVGDGHVIKDQPIKAWVIEVETVGNPYGEGSGEWLSQGFAAPAAAAAAAAAAAARFRL GPQ >gi568815578r:32096455_32302178|GENSCAN_predicted_CDS_7|369_bp atggctggcaagttggggctgactattggccggaggcctctgttcctctccatggggttg ctggaaggtctttgcaacatcgtggctgtctacccccagagatacctggagccaaaccct ttagcacctagcaactaccactctattttctatctccacgaatttgactactctaagtac ttcatacaggtaggagatgggcatgtgatcaaggatcaaccaattaaggcttgggttata gaggtcgagactgtggggaatccttatggagagggaagcggggaatggctgagccagggg ttcgccgcccccgccgccgccgccgccgccgccgccgccgccgccgcccgctttcggctc gggcctcag