GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:26:36 Sequence gi568815584r:36417281_36619357 : 202077 bp : 41.35% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 4747 4786 40 -1.95 1.01 Init + 13719 13791 73 2 1 80 60 73 0.384 4.98 1.02 Intr + 31486 31557 72 2 0 101 91 28 0.002 2.86 1.03 Intr + 43129 43212 84 2 0 66 69 70 0.045 1.77 1.04 Intr + 44928 45038 111 1 0 78 98 21 0.126 1.53 1.05 Intr + 59627 59787 161 0 2 16 94 186 0.656 10.79 1.06 Intr + 60311 60547 237 1 0 73 72 130 0.660 6.79 1.07 Intr + 82820 82930 111 1 0 76 58 46 0.087 0.06 1.08 Intr + 85457 85488 32 1 2 90 61 56 0.320 -0.89 1.09 Intr + 86736 86913 178 0 1 62 14 120 0.368 1.00 1.10 Term + 87012 87320 309 2 0 53 36 242 0.837 9.58 1.11 PlyA + 87975 87980 6 1.05 2.08 PlyA - 89578 89573 6 1.05 2.07 Term - 97201 96747 455 2 2 16 54 286 0.924 12.33 2.06 Intr - 98532 98461 72 1 0 61 44 111 0.004 2.46 2.05 Intr - 100740 100012 729 1 0 102 16 681 0.004 52.77 2.04 Intr - 102090 101705 386 2 2 126 100 433 0.963 41.87 2.03 Intr - 102332 102188 145 0 1 45 87 22 0.426 -3.78 2.02 Intr - 103398 103277 122 2 2 28 72 124 0.485 4.02 2.01 Init - 103980 103607 374 0 2 44 -42 306 0.996 9.68 2.00 Prom - 104227 104188 40 -9.95 3.02 PlyA - 104254 104249 6 1.05 3.01 Sngl - 108270 107719 552 0 0 69 40 338 0.828 22.96 3.00 Prom - 117654 117615 40 -3.35 4.00 Prom + 121752 121791 40 -7.15 4.01 Init + 127159 127228 70 0 1 85 22 89 0.196 3.26 4.02 Term + 132065 132273 209 0 2 92 48 118 0.365 4.72 4.03 PlyA + 132377 132382 6 1.05 5.09 PlyA - 132892 132887 6 1.05 5.08 Term - 140248 140141 108 1 0 54 48 96 0.302 -0.27 5.07 Intr - 143808 143665 144 0 0 52 86 104 0.267 6.16 5.06 Intr - 159487 159408 80 2 2 63 119 37 0.068 2.65 5.05 Intr - 160584 160487 98 2 2 2 21 169 0.713 0.33 5.04 Intr - 162491 162391 101 2 2 45 80 59 0.723 -1.21 5.03 Intr - 162978 162889 90 0 0 61 80 163 0.730 12.07 5.02 Intr - 164184 163778 407 1 2 120 59 450 0.902 38.44 5.01 Init - 164590 164290 301 1 1 61 -1 263 0.470 12.48 5.00 Prom - 164689 164650 40 -6.35 6.03 PlyA - 165656 165651 6 1.05 6.02 Term - 167066 166891 176 0 2 7 47 237 0.837 8.44 6.01 Init - 168200 168152 49 2 1 28 115 86 0.786 6.46 6.00 Prom - 168877 168838 40 -5.35 7.06 PlyA - 169341 169336 6 1.05 7.05 Term - 174705 174550 156 0 0 102 48 98 0.600 4.15 7.04 Intr - 177800 177651 150 2 0 79 16 114 0.475 2.74 7.03 Intr - 180375 180283 93 0 0 133 45 49 0.575 4.34 7.02 Intr - 195612 195549 64 2 1 94 84 46 0.226 2.60 7.01 Intr - 200552 200506 47 2 2 119 60 48 0.150 1.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 159487 159343 145 2 1 63 44 130 0.841 4.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:36417281_36619357|GENSCAN_predicted_peptide_1|455_aa MVSTGSVDVVQGSERKRSLVEKRLEFLNTIWTVYIPEAHQSQSNKPWAAFLLKYAPKTCS DIFPEIPPQRSDQKAKRYVKELLDDQNDLKYLEKSYQLHAMAHTCNPSTSGGQGRESAQE QSQRPYFNEYSHDNTLASDELLLCDSVVGCSRAVWTVPGLGTQTATQIKGRIRRGPEARL GDRRCYGKQESRGKWGRKLESQASGGHSTFLRAVIGYYALEERRKYVTETFRGNEQIAHL FFPYVQALPMMQMSSKGACSAEARDALDTKTHFPLAMIPSYLLLNGNIGELPEGPAGGCA QNPGLWASGARRGDEEVKPRALRRFDGRRASAGSEEDPAKKSARPHPWSRPCASPYRGAG GQRLRRACSAAGRKTPDRRPLPEPTRGPVSQRAVLGKLRAAAGHRIPIGDLGDGLSLGAP PAGRHLHTVLPLRREFSPCEDALGIGEQQGKKRVT >gi568815584r:36417281_36619357|GENSCAN_predicted_CDS_1|1368_bp atggtgtccacaggcagtgtggatgttgtccagggaagtgagcgaaagaggtccttagtg gaaaaaagattagaattcttgaacactatttggacagtctatataccggaagcccatcag tcacagagcaacaagccatgggcagccttccttctgaaatatgcaccaaaaacttgttct gatatctttccagaaatcccaccccagagaagtgaccagaaagccaaaagatatgttaaa gagctattagatgaccagaatgacctgaaatatcttgaaaagagttatcagctgcatgcg atggctcacacctgtaatcccagcacttcaggaggccaaggaagagaatctgcccaagaa caaagtcagcgtccctactttaacgaatattctcatgacaacaccttggcctcggatgaa ctcctgctttgtgacagcgtcgtgggatgcagcagggctgtatggactgttcctggactt ggcacgcaaacagcgacacagattaagggcaggattcgtcggggacctgaggcaaggcta ggggacagaaggtgttatggcaaacaggagtcaagagggaaatgggggaggaagttggag agccaagcaagtggtggtcacagtacctttctgagagcagtcattggttattatgctctg gaggagaggaggaaatacgtaacagagaccttcagagggaacgagcagatcgcacatttg ttctttccatatgtccaggcactgccaatgatgcaaatgtcttctaaaggagcctgctca gctgaagccagggatgcccttgatacaaaaacacactttccacttgcaatgataccctca tatttactgctaaatgggaatattggagagctgccagagggcccggcgggaggctgcgcc cagaatcctgggctttgggcctctggggcccggcgaggagatgaagaagtcaagcctcga gctctccggaggttcgatggccgccgggccagtgcgggctcagaggaagaccctgcaaaa aagagcgctcgcccccacccctggagccgaccctgcgcatcgccgtatcgcggggctggc ggccagcgccttagaagagcctgctccgccgcaggaagaaagacgcccgacaggcgccca ctgcccgaacccacgcgggggccagtcagccagcgggccgtgttgggaaagctccgcgcg gcggcgggacataggatccccatcggggacctgggcgacggcctgagcttgggcgcccct ccagctgggcgtcatctccacacagttcttcccttgcgccgcgaattcagcccctgtgaa gacgctttgggcattggcgagcaacaggggaaaaaacgagtcacttaa >gi568815584r:36417281_36619357|GENSCAN_predicted_peptide_2|760_aa MIKTFKIQGQKPESVSKSAPFSAPLGSRLGCRPRPGRSNAGDTPDSAGERSPQLPLARRP RTLKARGPHLKPGNAQTGNPLPPKQFLARRMRGAQLKQNDPHLLISSVATKRLAIYATTL NKDICIRGENSVALGSGAGAVMSSESQLQPQIPACLGDGRGSGTPALTPRLWWLPKTWRR AKTNARQPPSLHSSQLRRTRSTPLRVHPTRSALSRRRIMSMSPKHTTPFSVSDILSPLEE SYKKVGMEGGGLGAPLAAYRQGQAAPPTAAMQQHAVGHHGAVTAAYHMTAAGVPQLSHSA VGGYCNGNLGNMSELPPYQDTMRNSASGPGWYGANPDPRFPAISRFMGPASGMNMSGMGG LGSLGDVSKNMAPLPSAPRRKRRVLFSQAQVYELERRFKQQKYLSAPEREHLASMIHLTP TQVKIWFQNHRYKMKRQAKDKAAQQQLQQDSGGGGGGGGTGCPQQQQAQQQSPRRVAVPV LVKDGKPCQAGAPAPGAASLQGHAQQQAQHQAQAAQAAAAAISVGSGGAGLGAHPGHQPG SAGQSPDLAHHAASPAALQGQVSSLSHLNSSGSDYGTMSCSTLLYGTPLGDQQQTPSAPA AFGSFGLQRGELNRVCGPRQTGPQPLSCHQLRWLHPLPLPAGGIVAAAGRDLERNLKRFI REIGARWEQCKRVPQPGDTLRPYSTVGFWKADSPPFWESPASTRAQFLASPLGEISILRM PLALRCLEASEVWEQVRFTPRSPRSFASAKVPYPSLGPNG >gi568815584r:36417281_36619357|GENSCAN_predicted_CDS_2|2283_bp atgataaaaacgtttaaaatccaaggacaaaaaccggagtcggtctcaaaatccgcgccg ttcagtgctccactcggtagtcgactcggctgcaggccccggcctggccggagcaacgcg ggggacacccccgattccgctggggagcgcagcccacagctccccctcgccaggcgccca aggaccctcaaggcgcggggcccacacttgaagcctgggaacgcgcagacaggaaaccct cttcctcctaagcagtttcttgctcgtcggatgagaggcgcccaattgaagcagaatgat cctcatctactaatatccagcgtggccacaaagcgactggccatttacgccaccacttta aacaaagatatttgcatcagaggggaaaacagcgtggctctgggctcgggtgctggggct gtgatgtcctcggaaagtcagctccagccccagatccccgcgtgtctgggagatgggcga gggtcagggacaccagcgcttacgccccgcctctggtggctgcctaaaacctggcgccgg gctaaaacaaacgcgaggcagcccccgagcctccactcaagccaattaaggaggactcgg tccactccgttacgtgtacatccaacaagatcggcgttaagccgccgccgaatcatgtcg atgagtccaaagcacacgactccgttctcagtgtctgacatcttgagtcccctggaggaa agctacaagaaagtgggcatggagggcggcggcctcggggctccgctggcggcgtacagg cagggccaggcggcaccgccaacagcggccatgcagcagcacgccgtggggcaccacggc gccgtcaccgccgcctaccacatgacggcggcgggggtgccccagctctcgcactccgcc gtggggggctactgcaacggcaacctgggcaacatgagcgagctgccgccgtaccaggac accatgaggaacagcgcctctggccccggatggtacggcgccaacccagacccgcgcttc cccgccatctcccgcttcatgggcccggcgagcggcatgaacatgagcggcatgggcggc ctgggctcgctgggggacgtgagcaagaacatggccccgctgccaagcgcgccgcgcagg aagcgccgggtgctcttctcgcaggcgcaggtgtacgagctggagcgacgcttcaagcaa cagaagtacctgtcggcgccggagcgcgagcacctggccagcatgatccacctgacgccc acgcaggtcaagatctggttccagaaccaccgctacaaaatgaagcgccaggccaaggac aaggcggcgcagcagcaactgcagcaggacagcggcggcggcgggggcggcgggggcacc gggtgcccgcagcagcaacaggctcagcagcagtcgccgcgacgcgtggcggtgccggtc ctggtgaaagacggcaaaccgtgccaggcgggtgcccccgcgccgggcgccgccagccta caaggccacgcgcagcagcaggcgcagcaccaggcgcaggccgcgcaggcggcggcagcg gccatctccgtgggcagcggtggcgccggccttggcgcacacccgggccaccagccaggc agcgcaggccagtctccggacctggcgcaccacgccgccagccccgcggcgctgcagggc caggtatccagcctgtcccacctgaactcctcgggctcggactacggcaccatgtcctgc tccaccttgctatacgggacccctctaggggaccagcagcagacgccttcagcccctgct gcttttggctcctttggcctccaacggggtgaactcaatcgggtttgcggaccgagacaa acaggacctcagcctctttcttgccatcaattacgctggctccatccgctcccgctcccg gctgggggaattgtcgcagctgcgggaagagacctagaacgcaatctgaagcgtttcatt agggaaataggtgcgcgctgggagcagtgcaagagggttccgcaacctggggacaccctc cgcccctacagcaccgtcggcttctggaaagccgacagtcctcctttctgggaatctcct gcatcaactcgtgcccaatttctggcctcccctctcggagaaatcagcattctgaggatg cccctggcgctgaggtgcttggaagcttccgaggtttgggagcaagtacggttcaccccg agatcaccgcgaagctttgcctccgccaaagtaccctacccatccctaggccccaacggc tga >gi568815584r:36417281_36619357|GENSCAN_predicted_peptide_3|183_aa MAWGRYCGHRKLPGTSVGDVGETGCGGRLERRKGAEEGTPALLASPRETRTPKAHGLPLL GLRFVGEQRAERAIGFDYLKLNEAAQVLGRKQPRRHKDASGICPLIIRSSCETYLSAART LERPRGCPCPFRRGRLADSRRSTAARASQGRLTASGPCSYLPSPSWFPRNSKGAAELPLF SAL >gi568815584r:36417281_36619357|GENSCAN_predicted_CDS_3|552_bp atggcctggggtcgatactgcggccaccggaaactgcctgggacaagtgttggggatgtg ggggagacgggctgtggtgggaggttggagaggaggaaaggagcggaggagggaacccca gcgctgctggcaagcccgcgggaaactcgaacacccaaggcccacgggctcccactgttg ggactgagatttgttggggaacagagggcagaaagagctataggtttcgactacctaaag ctgaatgaggctgcacaggtcctggggcgaaaacagccgcgccgccacaaggatgcttcg ggaatctgtcctttaattatcaggagcagctgcgaaacttaccttagcgcagcacgaacc ctagagcgcccccgagggtgtccctgccccttccgacgggggcgtctggcggattcgcgg cgtagcacggcagcgcgggcctcacagggtcggctgaccgcttcaggcccgtgctcctac ctcccgtctccgtcttggtttcctcgcaattcaaagggagctgctgagctcccgctgttc tctgctttgtaa >gi568815584r:36417281_36619357|GENSCAN_predicted_peptide_4|92_aa MGASLDQEHSGHPAGSGGIEVSGGTQCVLSMKAAIFQDRPFTWFGAGSPWEGEIGFPAPT RRPAQLVDGQLMLSRGVLPTPTPHLWGSALAQ >gi568815584r:36417281_36619357|GENSCAN_predicted_CDS_4|279_bp atgggtgcctcgctggatcaggagcacagtggacaccctgccggatccggagggatagaa gtcagcggcggtacccagtgtgtcctcagcatgaaagctgcaatttttcaggatcgtcca ttcacttggtttggggctgggagcccatgggaaggggagattggctttcctgcccccact cggagaccagcgcagctggtagatggtcagctaatgctgagtcgtggggtgctccccacc ccaactccacatctgtggggatctgcacttgcccagtga >gi568815584r:36417281_36619357|GENSCAN_predicted_peptide_5|442_aa MASVGEFTPTPFPLRAPQTSAHCSAVDLEEAGAGDPACDKLVGQRNYRCGHVKRAPGWVF SGHDLRPADDRCGMNEVTAFGEVDVQMWTQEMASSRKSRATSDESSLETSPPDSSQRPSA RPASPGSDAEKRKKRRVLFSKAQTLELERRFRQQRYLSAPEREQLASLLRLTPTQVKIWF QNHRYKLKRARAPGAAESPDLAASAELHAAPGLLRRVVVPVLVRDGQPCGGGGGGESCWR FEYRKVNPVDHDRAEGVSPEMSRRGQGLLCKQSLSHRSDDLAPRHPLLGDAPEESSWGIR AQQEEPKEDAGQQCQGCECMCRGYQGPRKGQGDTTTPTTQVGKLKAKKVRRDLSRVIRLF SGGGSLQYHTGTHRFRNVGQRRGRKRKGKARAEALIVVVMGKDGRERKVGSIKQSNNQLF EKLRESQLCFAGGKIEGALILE >gi568815584r:36417281_36619357|GENSCAN_predicted_CDS_5|1329_bp atggccagtgttggcgagtttacccccaccccgttcccactcagggctccgcaaacctcg gcacactgcagtgcagtcgacctggaggaggccggcgctggtgacccagcttgtgataaa ctggtcggccagcgcaactaccggtgcggacatgttaagagggccccggggtgggttttt tcggggcacgatctcaggcccgctgatgaccgctgtggtatgaatgaggtcaccgcattc ggagaggtggatgtccagatgtggactcaggaaatggcctcatcccgaaagtctcgggct acctcggacgagagcagcctggagaccagcccgccagactcgtcgcagcggccgtccgct aggcccgcgtctccgggctcggacgccgagaaaaggaagaagcggcgggtgctattctcc aaggcgcagacgctggagttggagcggcgcttccggcagcagcggtacctgtctgcgccc gagcgcgagcagctggcgagcctgcttcgcctcacgcccacgcaggtcaagatctggttc cagaatcatcgctacaagctgaagcgcgctcgcgctccaggggcggcggagtcgcctgac ctggcagcatccgccgagctgcacgccgcgcccggcctgctgcgtcgcgtggtggtgccg gtgcttgttcgcgacgggcagccgtgcggcggcggcggcggtggcgagagctgctggcgt ttcgaataccgaaaagtcaaccctgtggaccacgacagggcagaaggagtttctccggag atgagccggcgaggccagggcctcttgtgcaagcagtcactttctcacagaagcgacgac ctcgctcccaggcacccgctgctgggagatgcgccagaggagagcagttggggcatcaga gcacaacaagaagagcctaaagaagatgcaggccagcaatgccaaggctgtgagtgtatg tgccgaggctatcaaggccctcgtaaaggccaaggagatactactactcccactacacaa gtgggaaaattgaaggccaaaaaggttagaagagacttgtccagggtcatacggctgttc tcaggaggagggagcttgcagtaccacacagggacacatagatttcgaaatgtgggtcag aggagaggcagaaagagaaaagggaaagcacgggcagaagccttgattgtggttgtcatg ggaaaggatgggagagaaaggaaagttggaagcataaaacagtccaataaccagttgttt gaaaagctaagggaaagccagttgtgttttgctggtggcaaaattgaaggtgcactgatt ctggaataa >gi568815584r:36417281_36619357|GENSCAN_predicted_peptide_6|74_aa MAEALMAEEEGIQDEADITNKSIARAAERRPAESIVLPIPLNPIENHDNTGQAPIYPGAL KRIVSRRQSCIIPD >gi568815584r:36417281_36619357|GENSCAN_predicted_CDS_6|225_bp atggcagaggcacttatggcagaggaggagggaatccaagatgaagcagacataactaat aaatctattgccagggccgcggaaagacggccagcagagtcgatcgtgctcccaatccca ttgaaccccattgaaaaccatgacaacaccggacaagctccgatttatcccggcgcgtta aagcgcattgtgagcagacgtcaatcatgcattattccggactga >gi568815584r:36417281_36619357|GENSCAN_predicted_peptide_7|169_aa PRWSHEQTVAAEWGKRTKALTGSQGHILELPSLQVLELQLSNRETATAQLPTSIAAPVPS FGFISGNLVSKFLTQSCGQCFLGTRSHSVEVQKHGKISRAEAAQKTFLDRSGSTRAQEVS AFVSNASDSVLMWCFTRANERELLLQDALAELRGFKCYFRNSHARKNES >gi568815584r:36417281_36619357|GENSCAN_predicted_CDS_7|510_bp cccaggtggagccatgaacaaacagtggctgctgagtgggggaagaggacaaaagcactc accggaagccagggccatatccttgaacttccaagcctgcaagtccttgagcttcaacta agcaacagggagacagccacagcccagctgcccacctccatagcagcccccgtcccttca tttgggttcatctctggcaacttggtttccaagttcttaacccagtcatgtgggcaatgc ttcctaggaaccagaagtcactcagtagaggtgcagaaacatggcaaaatcagcagagct gaagcggcacagaaaacattcctggacaggtcggggagtacgagagcccaggaagtaagt gcatttgtttccaatgcgtctgattctgttcttatgtggtgtttcacacgtgctaacgag agagaacttttactacaagatgctctggctgaattgagaggattcaaatgctacttcagg aactcacatgctagaaagaatgaatcctga