GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:59:46 Sequence gi568815587r:27267870_27572302 : 304433 bp : 39.08% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 358 353 6 1.05 1.01 Sngl - 2375 1542 834 2 0 88 43 241 0.990 15.15 1.00 Prom - 5226 5187 40 -5.55 2.03 PlyA - 6590 6585 6 1.05 2.02 Term - 11437 11230 208 0 1 74 35 103 0.047 -0.57 2.01 Init - 29658 29594 65 0 2 95 78 60 0.338 6.37 2.00 Prom - 47763 47724 40 -3.95 3.00 Prom + 47870 47909 40 -4.95 3.01 Sngl + 54307 54609 303 0 0 88 54 177 0.677 9.88 3.02 PlyA + 54696 54701 6 1.05 4.00 Prom + 56038 56077 40 -6.15 4.01 Sngl + 57294 58238 945 2 0 33 42 478 0.547 33.99 4.02 PlyA + 58481 58486 6 1.05 5.05 PlyA - 59283 59278 6 1.05 5.04 Term - 64376 64301 76 1 1 77 48 97 0.356 0.93 5.03 Intr - 68526 68394 133 1 1 -20 95 111 0.307 -0.22 5.02 Intr - 72968 72827 142 2 1 49 77 184 0.999 12.41 5.01 Init - 73645 73523 123 1 0 67 63 164 0.805 12.02 5.00 Prom - 74435 74396 40 -4.45 6.05 PlyA - 74795 74790 6 1.05 6.04 Term - 79054 78956 99 1 0 140 37 52 0.909 2.45 6.03 Intr - 82570 82463 108 1 0 53 80 114 0.967 6.66 6.02 Intr - 89672 89534 139 1 1 22 83 193 0.980 11.75 6.01 Init - 95217 94967 251 0 2 54 97 439 0.787 36.38 6.00 Prom - 97166 97127 40 -6.95 7.07 PlyA - 98430 98425 6 1.05 7.06 Term - 101042 99998 1045 1 1 76 43 293 0.224 14.21 7.05 Intr - 105807 105682 126 2 0 41 89 130 0.613 7.27 7.04 Intr - 113097 113026 72 2 0 96 92 34 0.633 2.20 7.03 Intr - 117599 117384 216 1 0 68 84 225 0.323 16.80 7.02 Intr - 124649 124578 72 1 0 84 84 44 0.283 1.20 7.01 Init - 138314 138208 107 2 2 91 4 123 0.072 3.94 7.00 Prom - 142760 142721 40 -4.55 8.00 Prom + 142841 142880 40 -6.25 8.01 Init + 152571 152655 85 2 1 47 87 88 0.435 5.63 8.02 Intr + 159422 159514 93 0 0 41 75 85 0.270 1.52 8.03 Intr + 172923 173158 236 1 2 41 91 182 0.729 10.28 8.04 Intr + 175401 175488 88 2 1 99 50 37 0.217 -0.38 8.05 Term + 191377 191600 224 2 2 90 49 66 0.030 -1.00 8.06 PlyA + 193525 193530 6 1.05 9.04 PlyA - 193769 193764 6 -0.45 9.03 Term - 194997 194930 68 1 2 83 41 105 0.359 2.42 9.02 Intr - 196208 196123 86 1 2 56 47 66 0.224 -2.16 9.01 Init - 204433 204249 185 1 2 91 68 415 0.981 36.44 9.00 Prom - 210384 210345 40 -5.05 10.08 PlyA - 210399 210394 6 1.05 10.07 Term - 230935 230780 156 1 0 76 37 201 0.997 10.75 10.06 Intr - 231699 231490 210 0 0 74 79 162 0.999 12.09 10.05 Intr - 233697 233626 72 0 0 66 94 77 0.957 4.78 10.04 Intr - 234051 233933 119 1 2 57 105 90 0.845 6.86 10.03 Intr - 238586 238359 228 0 0 68 20 121 0.271 0.22 10.02 Intr - 238896 238847 50 2 2 78 105 106 0.332 8.71 10.01 Init - 254507 254416 92 2 2 66 13 148 0.234 5.11 10.00 Prom - 255976 255937 40 -4.65 11.04 PlyA - 259176 259171 6 1.05 11.03 Term - 278325 278218 108 0 0 57 44 106 0.388 0.63 11.02 Intr - 281027 280867 161 0 2 56 78 71 0.161 1.69 11.01 Init - 284889 284670 220 0 1 71 103 126 0.243 11.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 205029 205230 202 2 1 43 70 117 0.849 4.60 S.002 Intr + 205299 205327 29 1 2 140 119 43 0.978 9.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:27267870_27572302|GENSCAN_predicted_peptide_1|277_aa MIKSASCRDARLAQHMQINQCNPSHKQKQRQKPHDYLNRCRKASNKIQHRFMLKTLNKLG IDGMYLKIIRAIYDKPTANIILNGQNLEAFPLKTSTRQGCLLSPLLFNIVLEVLARAIRQ EKEIKRIQIGREEVKLSLFADDMIVYLENPISAQNLLKLISNFSKVSGYKISVQKSQAFL YTNNRQTESQIMSEIPFRIGTKRIKYLGIQLTRDVKGFFKENYKPLLKEIREDTNKWKNI PCSWIGRIYIVKMTILPKVIYIFNAIPIKLQMTFLQN >gi568815587r:27267870_27572302|GENSCAN_predicted_CDS_1|834_bp atgatcaagtcggcttcatgccgggatgcaaggctggctcaacatatgcaaatcaatcaa tgtaatccatcacataaacagaaacaacgacaaaaaccacatgattatctcaatagatgc agaaaggcctccaataaaattcaacaccgcttcatgctaaaaactctcaataaactaggt attgatggaatgtatctcaaaataataagagctatttatgacaaacccacagccaatatc atattgaatgggcaaaacctggaagcattccctctaaaaaccagcacaagacaaggatgc cttctctcaccactcctattcaacatagtattggaagttctggccagggcaatcaggcaa gagaaagaaataaagcgtattcaaataggaagagaggaagtcaaattgtctctgtttgca gatgacatgattgtatatttagaaaatcccatctcagcccaaaatctccttaagctgata agcaacttcagcaaagtctcaggatacaaaatcagtgtgcaaaaatcacaagcattccta tacaccaataacaggcaaacagagagccaaatcatgagtgaaatccctttcagaattggt acaaagagaataaaatacctaggaatacaacttacaagggatgtgaagggcttcttcaag gagaactacaaaccactgctcaaggaaataagagaagacacaaacaaatggaaaaacatt ccatgctcatggataggaagaatctatatcgtgaaaatgactatactgcccaaagtaatt tatatattcaatgctatccccatcaagctacaaatgactttcttacagaattag >gi568815587r:27267870_27572302|GENSCAN_predicted_peptide_2|90_aa MEDQWTPFNVVHKFPDIVWKHLCGPQVSIDITWELIRNAGPTPVPLNQTLHSNNIHRCLA HPLKLEMHHSTWSHNQDDTESGEARNPDEV >gi568815587r:27267870_27572302|GENSCAN_predicted_CDS_2|273_bp atggaggatcagtggacgcccttcaatgtggtccacaaatttcctgatattgtatggaaa catttgtgtggtccacaggtcagcatagatatcacctgggagcttatcagaaatgcaggt cccactccagtccctttgaatcagactctgcattccaacaacatccacaggtgtttggca cacccattaaagcttgagatgcaccactccacgtggagccataaccaagatgatacagaa agtggtgaagctagaaacccagatgaggtctaa >gi568815587r:27267870_27572302|GENSCAN_predicted_peptide_3|100_aa MGRNQSRKAKNSKNQSTSSPLKDRSSSPETEQSWMENDFDELTEVGFRRTVITNFCELKE NVQTHCKEGKNCEKRLDEWLTRINSVEKTLNDLMELKTMA >gi568815587r:27267870_27572302|GENSCAN_predicted_CDS_3|303_bp atggggagaaaccagagcagaaaagctaaaaattctaaaaaccagagcacctcttctcct ctaaaggatcgcagctcctcgccagaaacggaacaaagctggatggagaatgactttgat gagttgacagaagtaggcttcagaaggacagtaataacaaacttctgtgagctaaaggag aatgttcaaacccattgcaaagaaggtaaaaactgtgaaaaaaggttagatgaatggcta actagaataaacagtgtagagaagaccttaaatgacctgatggagctgaaaactatggca tga >gi568815587r:27267870_27572302|GENSCAN_predicted_peptide_4|314_aa MSLMNIDIKILSKILASRIQQHIKNLIHHDQVGFISGMQGLLNICKSINVIHHINRTNDK NHMIISIDAEKAFDKIQQPFMLKTLNKLDIDGMYLKIIRAIYDKPTANILNRQKREAFPS KTGTRQGRPLSPLLFNIVLEVLARAISQEKEIKGIQLGNEEVKLSLFADDMIVYLENPVI SAQNLRLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSKLPFTIASKRIKYLGIKLT RDVKGLFKENYKPLLNEIKEDTNKRKNIPCSCIGRINIVKKAILPKVIYRFNAISIKLQM TFFTELEKTTLKFT >gi568815587r:27267870_27572302|GENSCAN_predicted_CDS_4|945_bp atgtccctgatgaacatcgatataaaaatcctcagtaaaatactggcaagccgaatccag cagcacatcaaaaaccttatccaccacgatcaagttggcttcatctctgggatgcaaggc ttgctcaacatttgcaaatcaataaacgtaatccatcacataaacagaaccaatgacaaa aatcacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaacagcccttc atgctaaaaactctcaataaactagatatcgatggaatgtatctcaaaataataagagct atttatgacaaacccacagccaatatactgaacaggcaaaaacgggaagcattcccttcg aaaactggcacaagacaaggacgccctctctcaccactcctattcaacatagtgttggaa gttctggccagggcaatcagtcaagagaaagaaataaagggtattcaattaggaaatgag gaagtcaaattgtccctgtttgcagatgacatgattgtatatttagaaaaccctgtcatc tcagcccaaaatctcaggctgataagcaacttcagcaaagtctcaggatacaaaatcaac gtgcaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatg agtaaactcccattcacaattgcttcaaagagaataaaatacctaggaataaaacttaca agggatgtaaagggccttttcaaggagaactacaaaccactgctcaatgaaataaaagag gacacaaacaaacggaagaacattccatgctcatgcataggaagaatcaatatcgtgaaa aaggccatactgcccaaggtaatttatagattcaatgccatctccatcaagctacaaatg actttcttcacagaattggaaaaaactactttaaagttcacatag >gi568815587r:27267870_27572302|GENSCAN_predicted_peptide_5|157_aa MEEKAAKELEKEYLQEKAKEKYQEWLKKKNAEECERKKKEKEKEKQQQAEIQEKKEIAEK KFQEWLENAKHKPRPAAKSYGYANGKLTDYYGCSEEVGLLGAKNGSNDTHWDDFAVSGRE VMIAWIKMAAVTWLFIFSEKDLLIFRRDISLAASIPE >gi568815587r:27267870_27572302|GENSCAN_predicted_CDS_5|474_bp atggaggaaaaagcagcaaaggaactggagaaagaatacttgcaagaaaaagcaaaagaa aaatatcaagaatggttaaagaaaaaaaatgctgaagaatgtgagaggaagaagaaagaa aaggaaaaagaaaaacaacagcaagctgaaatacaggagaaaaaggaaatagcagaaaaa aagtttcaagaatggttggaaaatgcgaaacataaacctcgtccagctgcaaagagctat ggttatgccaatggaaaacttacagattactatggctgctctgaggaagttggcctgttg ggggcaaagaatggcagcaatgacacccattgggatgactttgcagtatctggcagagaa gtcatgatagcttggattaaaatggcagcagtgacatggctgttcattttctcagagaag gatcttctaatcttccggagagatataagcctagctgccagcattccggagtag >gi568815587r:27267870_27572302|GENSCAN_predicted_peptide_6|198_aa MTGARGQGLEVVRSPSPPLPLSCSNSTRSLLSPLGHQSFQFDEDDGDGEDEEDVDDEEDV DEDAHDSEAKVASLRGMELQGCASTQVESENNQEEQKQVRLPESRLTPWEVWFIGKEKEE RDRLQLKALEELNQQLEKRKEMEEREKRKIIAEEKHKEWVQKKNEQGFSLFTYIPGVSSS SCKDTSPIELGSYPYDLI >gi568815587r:27267870_27572302|GENSCAN_predicted_CDS_6|597_bp atgacgggcgcacgtgggcaggggctggaggtggtgcgctcgccgtcgccgccgctgccg ctgagctgcagcaattccaccaggtcgctgttgtctccccttggccaccagagcttccag tttgacgaggacgacggtgacggggaggatgaggaagacgtggatgatgaggaagacgtg gatgaagatgcccatgattcagaggccaaagtggcgagcctgagaggaatggagttacag gggtgcgccagcactcaggttgaatcagaaaataaccaagaagaacagaaacaggtgcgc ttaccagaaagccgcctgacaccatgggaggtgtggtttattggcaaagaaaaagaagaa cgtgaccggctgcaactgaaagctctagaggaattaaatcaacaactagaaaaaagaaaa gaaatggaagaacgtgaaaaaagaaagataattgctgaagaaaagcacaaggaatgggtt cagaaaaagaatgagcaaggcttttctctgttcacatatatccctggtgtctcttcctct tcttgtaaggacaccagtcctattgaattagggtcctacccttatgacctcatttaa >gi568815587r:27267870_27572302|GENSCAN_predicted_peptide_7|545_aa MAFLESLEGIGGVIQEDMYKDLQRSMSKDGDTDWDRQLAGNDLSFIHPKALSGLKELKVL RLDANHITSVPEDSFEGLVQLRHLWLDDNSLTEVPVHPLSNLPTLQALTLALNKISSIPD FAFTNLSSLVVLGFHSNSISVIPDGAFDGNPLLRTIDVSFNELTSFPTEGLNGLNQLKLV GNFKLKEALAAKDFVNLRFAEFGIWWETGSGCKVAGFLAVFSSESAIFLLMLATVERSLS AKDIMKNGKSNHLKQFRVAALLAFLGATVAGCFPLFHRGEYSASPLCLPFPTGETPSLGF TVTLVLLNSLAFLLMAVIYTKLYCNLEKEDLSENSQSSMIKHVAWLIFTNCIFFCPVAFF SFAPLITAISISPEIMKSVTLIFFPLPACLNPVLYVFFNPKFKEDWKLLKRRVTKKSGSV SVSISSQGGCLEQDFYYDCGMYSHLQGNLTVCDCCESFLLTKPVSCKHLIKSHSCPALAV ASCQRPEGYWSDCGTQSAHSDYADEEDSFVSDSSDQVQACGRACFYQSRGFPLVRYAYNL PRVKD >gi568815587r:27267870_27572302|GENSCAN_predicted_CDS_7|1638_bp atggcatttctggagagcctagaggggatagggggagttattcaagaggacatgtacaaa gacctgcagaggagcatgagcaaagatggggacacagactgggatagacaattggcgggc aacgacctttcttttatccacccaaaggccttgtctgggttgaaagaactcaaagttctg cgtttagatgccaaccatattacctcagtccccgaggacagttttgaaggacttgttcag ttacggcatctgtggctggatgacaacagcttgacggaggtgcctgtgcaccccctcagc aatctgcccaccctacaggcgctgaccctggctctcaacaagatctcaagcatccctgac tttgcatttaccaacctttcaagcctggtagttctaggatttcatagtaattctatttct gttatccctgatggagcatttgatggtaatccactcttaagaactatagatgtaagtttc aatgaattaacttcctttcctacggaaggcctgaatgggctaaatcaactgaaacttgtg ggcaacttcaagctgaaagaagccttagcagcaaaagactttgttaacctcagattcgct gaatttggcatttggtgggaaactggcagtggctgcaaagtagctgggtttcttgcagtt ttctcctcagaaagtgccatatttttattaatgctagcaactgtcgaaagaagcttatct gcaaaagatataatgaaaaatgggaagagcaatcatctcaaacagttccgggttgctgcc cttttggctttcctaggtgctacagtagcaggctgttttccccttttccatagaggggaa tattctgcatcacccctttgtttgccatttcctacaggtgaaacgccatcattaggattc actgtaacgttagtgctattaaactcactagcatttttattaatggccgttatctacact aaactatactgcaacttggaaaaagaggacctctcagaaaactcacaatctagcatgatt aagcatgtcgcttggctaatcttcaccaattgcatctttttctgccctgtggcgtttttt tcatttgcaccattgatcactgcaatctctatcagccccgaaataatgaagtctgttact ctgatattttttccattgcctgcttgcctgaatccagtcctgtatgttttcttcaaccca aagtttaaagaagactggaagttactgaagcgacgtgttaccaagaaaagtggatcagtt tcagtttccatcagtagccaaggtggttgtctggaacaggatttctactacgactgtggc atgtactcacatttgcagggcaacctgactgtttgcgactgctgcgaatcgtttctttta acaaagccagtatcatgcaaacacttgataaaatcacacagctgtcctgcattggcagtg gcttcttgccaaagacctgagggctactggtccgactgtggcacacagtcggcccactct gattatgcagatgaagaagattcctttgtctcagacagttctgaccaggtgcaggcctgt ggacgagcctgcttctaccagagtagaggattccctttggtgcgctatgcttacaatcta ccaagagttaaagactga >gi568815587r:27267870_27572302|GENSCAN_predicted_peptide_8|241_aa MTCDSASEDDRIRPGAQKRLPTPKLLCTGEKEQLYASRENSQVKEYREGPPKRKNSWHEE LAAPISPGNLPEIQVTGPPPRSMESKILEVGLSNVFYTALLVIRIHIIVAQRGTTARLSQ STFMLVKVFGLHSNFTRQVHLLLHTQVIPLHDGKFLPKLIPTTEHQRDKSSTSVCQNQST CLKNNETRNSGSSNLRVGKDPDEMDVITSSANVINLLILGCHSGGFFLLLEFWNATSARM V >gi568815587r:27267870_27572302|GENSCAN_predicted_CDS_8|726_bp atgacgtgtgactcagcaagtgaagatgatcgaatcagacctggagctcaaaaaaggctt ccaacaccaaagctgttgtgcacaggggaaaaagagcagctgtatgcaagtagggaaaac agccaagtaaaggagtacagggaagggcctccgaaacgaaagaacagctggcatgaagaa ctagcagcaccaatatcacctggaaatttgccagaaattcaagtaactgggcccccaccc aggtcgatggaatcaaaaatcctagaggtgggtctcagcaatgtgttttacacagccctc ctggtgattcgaatacacattatagtggcacaacggggaaccactgcccggctctcacaa agcacttttatgctcgtgaaagtatttggacttcacagcaactttacaaggcaggtacac ctgttattacacactcaagtaataccactgcacgatgggaaattcctccccaagctcatt cctaccactgaacaccagagagataaatcatcaacttctgtctgtcaaaaccagtctaca tgtctgaaaaacaatgaaactagaaactcaggctcgagtaacttgagggttgggaaggac cctgatgaaatggatgtcattacttcctctgccaacgtcataaatcttcttatccttgga tgtcacagcggggggtttttcctccttctggaattctggaatgccacctcagccaggatg gtgtga >gi568815587r:27267870_27572302|GENSCAN_predicted_peptide_9|112_aa MPGPLGLLCFLALGLLGSAGPSGAAPPLCAAPCSCDGDRRVDCSGKGLTAVPEGLSAFTQ ALEEKADMLTVDLQISVRGNCSCALQPSVLESGTGNEDIRVQEGIVELGQVT >gi568815587r:27267870_27572302|GENSCAN_predicted_CDS_9|339_bp atgccgggcccgctagggctgctctgcttcctcgccctggggctgctcggctcggccggg cccagcggcgcggcgccgcctctctgcgcggcgccctgcagctgcgacggcgaccgtcgg gtggactgctccgggaaggggctgacggccgtgcccgaggggctcagcgccttcacccaa gcgctggaagagaaagcagatatgttaactgtggaccttcagatttcagttcgtggaaac tgttcatgtgctcttcagccttctgttttagagtctggcacaggaaatgaagacatccga gttcaggagggcattgtagagcttgggcaggtcacttaa >gi568815587r:27267870_27572302|GENSCAN_predicted_peptide_10|308_aa MDSSSEDRMDEGTQAKMQAGLEEVMGTTCGRLREKMAALGEPVRLERDTRRTRAPLVSTA EAAFGDPYTRKANVTHPVRTDYTHCLYSTRTDTRYNSTEPPFPAVCTQTPSLLSPQPRLY SGWDICRAIELLEKLQRSGEVPPQKLQALQRVLQSEFCNAVREVYEHVYETVDISSSPEV RANATAKATVAAFAASEGHSHPRVVELPKTEEGLGFNIMGGKEQNSPIYISRIIPGGIAD RHGGLKRGDQLLSVNGVSVEGEHHEKAVELLKAAQGKVKLVVRYTPKVLEEMESRFEKMR SAKRRQQT >gi568815587r:27267870_27572302|GENSCAN_predicted_CDS_10|927_bp atggacagttcctcagaggacagaatggatgaggggacacaggcgaagatgcaagcaggt ttggaagaagtgatgggaaccacatgtggaaggttaagggagaagatggcggcgctaggg gaacccgtgcggctggagagagacacacgccgcacacgcgctcccttggtgagcacagca gaagcagcattcggagacccgtacacccgaaaagcaaacgtcacacacccggttcgcaca gactacactcattgcctgtacagcacgagaacggacacacggtacaattcaacagaacct cctttccccgctgtttgcacacaaactccttcattactttctccacaacctcgcttgtat tcgggttgggatatttgtagagcaattgaattattggaaaaactacaaaggagtggagaa gtaccaccacagaaacttcaggctttgcaaagagtccttcaaagtgaattctgcaatgct gtgagagaggtatatgaacatgtctatgagactgtggacatcagtagcagtcctgaagtg agagcgaacgctactgcaaaggctactgttgctgcatttgctgccagtgaaggacattct catcctcgagttgttgagctaccaaaaacagaagagggccttggattcaatattatggga ggcaaagaacaaaactctccaatctatatatcccgaataattccaggtggaattgctgat agacatgggggcctcaaacgtggagatcaactcctctctgttaatggagtgagtgttgaa ggagaacatcatgaaaaagctgtagaactgctgaaagccgcacaaggaaaggttaaatta gtggtacgatacacacccaaagtcttagaagaaatggagtcgcgctttgaaaaaatgaga tcagcaaaacgcaggcaacagacctaa >gi568815587r:27267870_27572302|GENSCAN_predicted_peptide_11|162_aa MKRAKEKQFLELLPPQKIVNKKQYCIPGGTAEISATVKDLKDAGVAIPITFPFNTPIWSM QKTDGAWRITVDYVCKETIMDLLGLQLVLVLWRILTNTITKAHLQPSIFPFPKFNQSPLL EMTVMWEKPDQQFAHAALCFQVPFATATAISSPHTDTVSCVH >gi568815587r:27267870_27572302|GENSCAN_predicted_CDS_11|489_bp atgaagagggccaaggagaagcagttcctagaattgcttccacctcaaaaaatagtaaat aaaaagcaatactgcattcctggagggactgcagagattagtgctaccgttaaggacttg aaagatgcaggggtggcgattcccatcacattcccattcaacactcctatttggtctatg cagaagacagatggagcttggagaataacagtggattatgtttgcaaagaaacaatcatg gatcttctgggcctccaattagttctggttctctggagaatcctgactaatacaatcacc aaggctcatcttcagccatcgatttttccctttccgaagttcaaccagtctccactatta gagatgacagtgatgtgggagaaacctgatcagcagtttgctcatgctgctctctgtttc caggtcccttttgcaacagcaactgccatctcctctcctcacacagacacagtcagctgt gtacattga