GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:04:39 Sequence gi568815592f:113757746_113960576 : 202831 bp : 38.35% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 41002 41097 96 0 0 78 78 83 0.172 6.66 1.02 Term + 66115 66255 141 0 0 52 44 144 0.036 3.35 1.03 PlyA + 66384 66389 6 1.05 2.00 Prom + 70222 70261 40 -4.45 2.01 Init + 70868 70886 19 1 1 70 101 12 0.129 0.95 2.02 Intr + 74741 74898 158 0 2 24 66 168 0.331 7.01 2.03 Intr + 81504 81825 322 2 1 68 60 300 0.105 19.71 2.04 Intr + 97495 97626 132 2 0 80 52 89 0.174 4.20 2.05 Term + 97950 98140 191 1 2 8 43 189 0.208 3.03 2.06 PlyA + 98165 98170 6 1.05 3.00 Prom + 99816 99855 40 -8.95 3.01 Init + 100001 100102 102 1 0 99 99 82 0.070 10.69 3.02 Term + 101938 102834 897 0 0 121 42 869 0.999 76.98 3.03 PlyA + 103594 103599 6 1.05 4.00 Prom + 106697 106736 40 -5.15 4.01 Init + 120690 120780 91 2 1 77 57 80 0.522 4.50 4.02 Intr + 127353 127425 73 1 1 61 37 117 0.366 1.45 4.03 Intr + 127578 127736 159 0 0 77 59 63 0.249 0.48 4.04 Intr + 138721 138815 95 1 2 67 87 71 0.355 3.59 4.05 Intr + 141757 141861 105 2 0 62 99 58 0.543 3.57 4.06 Intr + 146534 146999 466 0 1 83 105 175 0.555 9.96 4.07 Intr + 148428 148761 334 0 1 6 75 389 0.692 24.05 4.08 Intr + 149730 149893 164 2 2 88 37 1 0.371 -7.15 4.09 Term + 149984 150173 190 2 1 73 53 146 0.672 5.54 4.10 PlyA + 150513 150518 6 1.05 5.14 PlyA - 151771 151766 6 1.05 5.13 Term - 161861 161489 373 1 1 108 32 186 0.327 8.18 5.12 Intr - 184020 183963 58 1 1 83 61 82 0.397 2.02 5.11 Intr - 185761 185606 156 2 0 28 71 275 0.964 18.86 5.10 Intr - 186665 186535 131 1 2 53 57 215 0.977 14.32 5.09 Intr - 187725 187617 109 1 1 55 69 50 0.988 -1.78 5.08 Intr - 188403 188263 141 1 0 75 115 134 0.995 14.20 5.07 Intr - 191342 191234 109 2 1 46 121 24 0.861 0.44 5.06 Intr - 191515 191423 93 1 0 45 95 114 0.977 7.04 5.05 Intr - 195715 195532 184 0 1 78 90 110 0.840 9.07 5.04 Intr - 198406 198268 139 2 1 73 53 102 0.981 3.80 5.03 Intr - 198948 198874 75 1 0 65 111 83 0.991 6.87 5.02 Intr - 201021 200904 118 0 1 94 57 115 0.810 8.12 5.01 Intr - 202273 202161 113 2 2 103 73 49 0.904 4.08 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:113757746_113960576|GENSCAN_predicted_peptide_1|78_aa MGLPHPDPYSKNVYVGSLTRHVTVYEDRAFKELSLAEWIMLAIQMASAGLNVTGEETAIE KHPLNDLGTQKAITLRIK >gi568815592f:113757746_113960576|GENSCAN_predicted_CDS_1|237_bp atggggctcccccatcccgatccctactccaaaaatgtttatgttggatccttaacccgt catgtgactgtatatgaagatagggcctttaaggagctctcacttgctgagtggatcatg ttagctatacaaatggcttctgcaggactaaatgtgactggcgaggaaacagccattgaa aagcatcccctgaatgacttgggaacacaaaaagccatcacactcaggataaaatag >gi568815592f:113757746_113960576|GENSCAN_predicted_peptide_2|273_aa MDTKKQWHKPQQSLRYPLEILMVNSYLNETGFQNAVDGKPYKIIDVITEEGGYLFQVSLP SWQLLLTGAIPTKEMKNSLESINYRLQLMKSGKYMLGYKQTLKMIRQGTAKLVILAYNCP ALRKSEVEYYAMLAKIGVHHYSGNNIELGTACGKYYRVCTLALTDPAPAPPLPPPRPSAP NLPRPLPGRPADTRQIGVPERKGGKATRQPDAFLRKRLYISSGEYDHGRIMHLKELDSEK TKLHVTMFSLVIFKATLEQRVTGLNQPSFSLRV >gi568815592f:113757746_113960576|GENSCAN_predicted_CDS_2|822_bp atggacacaaagaaacaatggcataaaccacagcagtcattacgatatccactggagatc ctgatggtgaactcctacctaaatgaaacaggcttccagaatgctgtggatggtaaaccc tataaaattattgatgttattactgaagaaggtggatacctgtttcaagtctctctacca tcttggcagctgctcttgactggggccatcccaaccaaggagatgaaaaattcactggag tcaattaattatcggctccaactcatgaaaagtggaaagtacatgctggggtacaagcag actttaaagatgatcagacaaggcacagcaaaattggtcatccttgcttacaactgccca gctttgcggaaatcagaagtagagtactatgcaatgttggccaaaattggtgtccatcac tacagtggcaataatattgaactgggaacagcatgcggaaaatactacagagtatgcaca ctggcgctcactgatccagcgccggctccgcctcttccccctccccggcccagtgctccg aatcttccccggccgttgccagggagaccagccgataccaggcaaataggcgtaccagaa agaaaaggggggaaagctacacggcagcctgatgcatttctacggaaacgtctttacatc agctctggggaatatgaccatggtcgaataatgcatctgaaggaactggacagtgagaag acgaagcttcacgtaacaatgttcagcttggtgatttttaaagcaacgcttgaacaaaga gttactggcttgaaccaaccgtctttttcactcagagtgtga >gi568815592f:113757746_113960576|GENSCAN_predicted_peptide_3|332_aa MGAQFSKTAAKGEAAAERPGEAAVASSPSKANGQENGHVKVNGDASPAAAESGAKEELQA NGSAPAADKEEPAAAGSGAASPSAAEKGEPAAAAAPEAGASPVEKEAPAEGEAAEPGSPT AAEGEAASAASSTSSPKAEDGATPSPSNETPKKKKKRFSFKKSFKLSGFSFKKNKKEAGE GGEAEAPAAEGGKDEAAGGAAAAAAEAGAASGEQAAAPGEEAAAGEEGAAGGDPQEAKPQ EAAVAPEKPPASDETKAAEEPSKVEEKKAEEAGASAAACEAPSAAGPGAPPEQEAAPAEE PAAAAASSACAAPSQEAQPECSPEAPPAEAAE >gi568815592f:113757746_113960576|GENSCAN_predicted_CDS_3|999_bp atgggtgcccagttctccaagaccgcagcgaagggagaagccgccgcggagaggcctggg gaggcggctgtggcctcgtcgccttccaaagcgaacggacaggagaatggccacgtgaag gtaaacggcgacgcttcgcccgcggccgccgagtcgggcgccaaggaggagctgcaggcc aacggcagcgccccggccgccgacaaggaggagcccgcggccgccgggagcggggcggcg tcgccctccgcggccgagaaaggtgagccggccgccgccgctgcccccgaggccggggcc agcccggtagagaaggaggcccccgcggaaggcgaggctgccgagcccggctcgcccacg gccgcggagggagaggccgcgtcggccgcctcctcgacttcttcgcccaaggccgaggac ggggccacgccctcgcccagcaacgagaccccgaaaaaaaaaaagaagcgcttttccttc aagaagtctttcaagctgagcggcttctccttcaagaagaacaagaaggaggctggagaa ggcggtgaggctgaggcgcccgctgccgaaggcggcaaggacgaggccgccgggggcgca gctgcggccgccgccgaggcgggcgcggcctccggggagcaggcagcggcgccgggcgag gaggcggcagcgggcgaggagggggcggcgggtggcgacccgcaggaggccaagccccag gaggccgctgtcgcgccagagaagccgcccgccagcgacgagaccaaggccgccgaggag cccagcaaggtggaggagaaaaaggccgaggaggccggggccagcgccgccgcctgcgag gccccctccgccgccgggcccggcgcgcccccggagcaggaggcagcccccgcggaggag cccgcggccgccgcagcctcgtcagcctgcgcagccccctcacaggaggcccagcccgag tgcagtccagaagcccccccagcggaggcggcagagtaa >gi568815592f:113757746_113960576|GENSCAN_predicted_peptide_4|558_aa MQSCMETQKKQETMETQQHPYRLRDGNGRGACTEKRATKKKEGREEEEEESERRKYRGLF SPDVITASKEASDQEQVTLPSQGTAKWTLHTTESIYFEELDLFSSFACVYTVVCQEEALS KDSGLVWPCLSHEDTTTTPVSSTLVDVPQCPHGALLAGLLLCLPNATGRTQTAIERQKAW EWVRQEDLCTSISIAFSTTGFLPSSSAKPAIVATWGASDSPKSPRPEDMLSSLVSERGTQ RLLGKRTRVLKPRVFSPTASALAVACQTSPLIFAFLPASAFVYPRRQDPTVKSERKTSGS RFSSRCCVSQCHLLSSQRQLDVVTDFLGMRNCDSDVVEMRWICGRGCIHRKMQPQDQGQK ASHGGPKDSRPHGGPRDSRSHVEDPRTAGLTWRTQGQQVSHGGPRDSRPHVEGPGTAGLT WRGYGQQASCGGPQPLSRRRQSRMFFPLKMLACEVAMLQRWPPEEPPFLAPMLLCSLLPH WVWAALGGSDAVGFPRESQPPYKTSNLRPPCTEEAQASHMEGTCGESKIFGQTVEILAIP AQGSRHVSEETVLDVELG >gi568815592f:113757746_113960576|GENSCAN_predicted_CDS_4|1677_bp atgcagtcctgcatggaaacccagaagaaacaggagacaatggaaactcagcagcaccca tacagactcagagatggtaatgggagaggagcttgcactgagaagagagctaccaaaaag aaggaaggaagagaggaagaagaggaggaatctgagagaagaaagtacaggggcctgttt tctccagatgtaataactgcttcaaaggaagcttcagaccaggagcaagtaactttgcca agtcaaggaactgcaaaatggacactccacactacagaaagcatttattttgaagaatta gatctattttcttcatttgcctgtgtttatactgtagtctgccaagaagaagctctcagc aaagatagtggcctggtttggccctgcctgtcacatgaagacacaacaactactccagtt tcttctactttggtggatgtgcctcagtgcccgcatggtgcactccttgcaggtctccta ctgtgcctccccaacgccactggaagaacccaaactgccatcgaaaggcagaaagcctgg gaatgggttcggcaggaagatctctgtacctcaatctccattgctttcagcaccactggc tttctccccagctcaagtgcaaaacctgccatcgttgctacctggggtgcctcagactca cccaagtcccccaggccagaggacatgctctcaagccttgtttcagaaagagggacccaa aggctgttagggaagagaaccagggtcctcaaacccagggttttctcacccactgcctct gctctggctgtggcctgccagacctctcctctgattttcgctttccttcctgccagtgcc tttgtttatccgaggcgccaagatcccactgtgaagtctgaaaggaagaccagtggttct cgcttttcctctagatgctgtgtctcccaatgtcatcttctttcctctcaaagacaattg gacgttgttacagattttcttggaatgagaaactgtgactcagatgtggtagagatgaga tggatatgtggccgtggttgcatccaccggaagatgcagccacaggaccagggacagaag gcctcacatggagggcccaaggacagcaggcctcacggaggacccagggacagcaggtct cacgtggaggacccaaggacagcaggcctcacgtggaggacccagggacagcaggtctca catggaggacccagggacagcaggcctcatgtggagggcccagggacagcaggcctcacg tggaggggctacggacagcaggcctcatgtggagggccccagcccctgtctagaaggagg cagagcaggatgtttttcccattgaaaatgttagcatgcgaggtagccatgcttcagaga tggcctcctgaggaaccaccctttctagcacccatgctcctgtgcagtctcctcccacac tgggtctgggctgcacttggaggaagtgatgctgtgggatttccaagggaaagtcagcca ccatataagacatctaacctgagaccaccatgcactgaggaagcccaagctagccacatg gagggaacatgtggagaaagcaagatatttggccaaactgtagagattttggccatccca gcccaggggtccagacatgtaagtgaagaaaccgtgttggatgttgagcttggttga >gi568815592f:113757746_113960576|GENSCAN_predicted_peptide_5|599_aa XDIGNYYYGQGHPMKPHRIRMTHNLLLNYGLYRKMEIYRPHKATAEEMTKYHSDEYIKFL RSIRPDNMSEYSKQMQRFNVGEDCPVFDGLFEFCQLSTGGSVAGAVKLNRQQTDMAVNWA GGLHHAKKSEASGFCYVNDIVLAILELLKTFKGKTKTYGFVFRYHQRVLYIDIDIHHGDG VEEAFYTTDRVMTVSFHKYGEYFPGTGDLRDIGAGKGKYYAVNFPMRDGIDDESYGQIFK PIISKVMEMYQPSAVVLQCGADSLSGDRLGCFNLTVKGHAKCVEVVKTFNLPLLMLGGGG YTIRNVARCWTYETAVALDCEIPNELPYNDYFEYFGPDFKLHISPSNMTNQNTPEYMEKI KQRLFENLRMLPHAPGVQMQAIPEDAVHEDSGDEDGEDPDKRISIRASDKRIACDEEFSD SEDEGEGGRRNVADHKKGAKKARIEEDKKETEDKKTDVKEEDKSKDNSGEKTDTKGGVTG TSCAYTSAFTLLESGVTFNPQITNTAMQAARELEADDKRHIAILLPDLSGQQKIPSWEHS KITNKSGGRVQFPLQLKKLQSDEQTRHFLLKPCPHSSVSTFAFPSRESNRVWCKRCMSQ >gi568815592f:113757746_113960576|GENSCAN_predicted_CDS_5|1800_bp ngtgatattggaaattattattatggacagggtcatcccatgaagcctcatagaatccgc atgacccataacttgctgttaaattatggcttatacagaaaaatggaaatatataggccc cataaagccactgccgaagaaatgacaaaatatcacagtgatgagtatatcaaatttcta cggtcaataagaccagataacatgtctgagtatagtaagcagatgcagagatttaatgtt ggagaagattgtccagtgtttgatggactctttgagttttgtcagctctcaactggcggt tcagttgctggagctgtgaagttaaaccgacaacagactgatatggctgttaattgggct ggaggattacatcatgctaagaaatcagaagcatcaggattctgttacgttaatgatatt gtgcttgccatccttgaattactaaaaacctttaaaggaaaaaccaaaacttatggattt gttttcaggtatcatcagagagtcttatatattgatatagatattcatcatggtgatggt gttgaagaagctttttatacaacagatcgtgtaatgacggtatcattccataaatatggg gaatactttcctggcacaggagacttgagggatattggtgctggaaaaggcaaatactat gctgtcaattttccaatgagagatggtatagatgatgagtcatatgggcagatatttaag cctattatctcaaaggtgatggagatgtatcaacctagtgctgtggtattacagtgtggt gcagactcattatctggtgatagactgggttgtttcaatctaacagtcaaaggtcatgct aaatgtgtagaagttgtaaaaacttttaacttaccattactgatgcttggaggaggtggc tacacaatccgtaatgttgctcgatgttggacatatgagactgcagttgcccttgattgt gagattcccaatgagttgccatataatgattactttgagtattttggaccagacttcaaa ctgcatattagtccttcaaacatgacaaaccagaacactccagaatatatggaaaagata aaacagcgtttgtttgaaaatttgcgcatgttacctcatgcacctggtgtccagatgcaa gctattccagaagatgctgttcatgaagacagtggagatgaagatggagaagatccagac aagagaatttctattcgagcatcagacaagcggatagcttgtgatgaagaattctcagat tctgaggatgaaggagaaggaggtcgaagaaatgtggctgatcataagaaaggagcaaag aaagctagaattgaagaagataagaaagaaacagaggacaaaaaaacagacgttaaggaa gaagataaatccaaggacaacagtggtgaaaaaacagataccaaagggggtgttactggt actagctgtgcatacacctctgccttcaccctccttgagagtggtgtcaccttcaatcca cagataacaaacacagcaatgcaggcagccagggaactggaggcagatgacaaaagacac attgctattctcttacctgatctatcaggtcagcagaaaataccttcttgggagcacagc aagataaccaacaagtcgggagggagggtgcagtttccgttgcagctgaagaaacttcag agtgatgaacaaactcgacatttcctactgaagccatgtccacactcatctgtatccaca tttgcttttccaagtcgtgaatcgaacagagtttggtgtaagcgttgcatgtcccagtaa