GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:27:53 Sequence gi568815594r:17387134_17611993 : 224860 bp : 43.59% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9449 9509 61 1 1 78 19 230 0.120 16.51 1.02 Term + 16905 17050 146 1 2 89 51 46 0.080 -0.93 1.03 PlyA + 17247 17252 6 1.05 2.00 Prom + 23337 23376 40 -0.86 2.01 Init + 27268 27427 160 0 1 83 -3 180 0.690 6.40 2.02 Term + 27438 27655 218 1 2 31 55 152 0.790 3.51 2.03 PlyA + 28126 28131 6 1.05 3.00 Prom + 28432 28471 40 -2.36 3.01 Init + 30140 30202 63 1 0 90 90 16 0.562 3.15 3.02 Intr + 32906 33083 178 1 1 81 83 43 0.626 2.59 3.03 Term + 39618 39664 47 1 2 111 43 58 0.716 0.97 3.04 PlyA + 39722 39727 6 1.05 4.02 PlyA - 39789 39784 6 1.05 4.01 Sngl - 40775 40563 213 2 0 82 41 153 0.885 5.18 4.00 Prom - 44005 43966 40 -4.06 5.00 Prom + 44154 44193 40 -4.66 5.01 Init + 45154 45259 106 0 1 82 87 49 0.195 4.58 5.02 Term + 56845 57065 221 2 2 71 44 153 0.652 6.40 5.03 PlyA + 58162 58167 6 1.05 6.04 PlyA - 58929 58924 6 1.05 6.03 Term - 60045 59885 161 1 2 54 38 129 0.255 2.60 6.02 Intr - 69414 69328 87 1 0 131 66 3 0.008 2.34 6.01 Init - 77582 77573 10 2 1 98 116 8 0.435 5.20 6.00 Prom - 94970 94931 40 -3.26 7.07 PlyA - 95923 95918 6 1.05 7.06 Term - 103612 103525 88 0 1 118 48 42 0.397 0.33 7.05 Intr - 105207 105099 109 1 1 102 105 175 0.973 19.94 7.04 Intr - 114726 114586 141 1 0 67 107 91 0.977 9.22 7.03 Intr - 117342 117246 97 0 1 55 65 74 0.884 1.38 7.02 Intr - 122247 122138 110 1 2 38 68 126 0.470 5.60 7.01 Init - 124921 124519 403 1 1 89 64 310 0.637 23.49 7.00 Prom - 126720 126681 40 -5.66 8.00 Prom + 127806 127845 40 -6.56 8.01 Init + 128134 128386 253 0 1 64 72 238 0.960 16.21 8.02 Intr + 135526 135681 156 2 0 57 87 109 0.973 7.68 8.03 Intr + 135731 135910 180 0 0 77 110 267 0.999 27.64 8.04 Term + 139684 139949 266 2 2 15 38 432 0.967 26.57 8.05 PlyA + 142922 142927 6 1.05 9.00 Prom + 147174 147213 40 -3.76 9.01 Init + 172613 172669 57 1 0 79 79 -4 0.204 -0.89 9.02 Intr + 176635 176708 74 0 2 95 42 64 0.762 0.70 9.03 Intr + 177402 177566 165 0 0 110 20 136 0.712 8.08 9.04 Intr + 178160 178354 195 2 0 95 13 153 0.926 7.03 9.05 Term + 178938 179154 217 0 1 61 28 162 0.536 4.12 9.06 PlyA + 184239 184244 6 1.05 10.00 Prom + 186432 186471 40 -6.06 10.01 Init + 190333 190434 102 0 0 62 100 117 0.792 8.54 10.02 Intr + 192691 192806 116 0 2 87 103 5 0.812 1.15 10.03 Intr + 194627 194681 55 2 1 101 80 26 0.883 2.08 10.04 Intr + 195155 195260 106 1 1 32 30 125 0.981 0.89 10.05 Intr + 196350 196509 160 1 1 85 59 172 0.992 13.05 10.06 Intr + 197839 198003 165 1 0 75 91 91 0.993 7.18 10.07 Intr + 201686 201844 159 2 0 98 68 188 0.999 16.90 10.08 Intr + 208277 208401 125 2 2 77 97 101 0.996 10.13 10.09 Intr + 209913 210011 99 1 0 57 45 91 0.604 1.78 10.10 Intr + 210656 210744 89 0 2 105 89 32 0.777 4.69 10.11 Intr + 211323 211425 103 2 1 105 71 51 0.886 4.85 10.12 Intr + 217455 217534 80 1 2 106 75 15 0.864 1.27 10.13 Intr + 219696 219805 110 2 2 90 91 50 0.860 4.58 10.14 Term + 220267 220456 190 1 1 115 32 63 0.862 0.32 10.15 PlyA + 220521 220526 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 69699 69851 153 2 0 61 37 187 0.965 8.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:17387134_17611993|GENSCAN_predicted_peptide_1|68_aa MVIDDDNDDEGGDDEDEDEDGELRPLAIVTFHTTLGLPPLFNEKIGFVLTLGKHLEYQKP VCRKAAAI >gi568815594r:17387134_17611993|GENSCAN_predicted_CDS_1|207_bp atggtcatcgacgatgacaatgatgatgaaggtggtgatgatgaagatgaagatgaggat ggtgaacttaggcctctggccattgtgacttttcatacaacccttggtttgcctccatta ttcaatgagaaaataggttttgttttaacactgggcaagcatctggagtaccagaaacct gtctgcagaaaagcagcagcaatatga >gi568815594r:17387134_17611993|GENSCAN_predicted_peptide_2|125_aa MAGCRSRALARGEAAEALRESECGAGGQAVLGDPAHPLQLLARVLSPSLPGASGRSECGA RGARAHLELALARKRHAQLGFLPVPLPPHLPASIGSRLRSRPAQRGAPTVQQRAEGFLKC SQSGR >gi568815594r:17387134_17611993|GENSCAN_predicted_CDS_2|378_bp atggcaggctgcaggtcccgagccctggcccgcggggaggcggctgaggccctgcgagaa tccgagtgcggcgcgggcgggcaggcagtgctgggggacccggcgcaccctctgcagctg ctggcccgggtgctaagcccctcactgcccggggccagcggccgctccgagtgcggagcc cgcggagcccgagcccacctggaactcgcgctggcccgcaagcgccacgcgcagcttggg ttcttgcctgtgcctctccctccacaccttcctgcaagcatagggagccggctccggtct cggccagcccagcgaggggctcccacagtgcagcagcgggctgaagggttcctcaagtgc agccagagtggacgctga >gi568815594r:17387134_17611993|GENSCAN_predicted_peptide_3|95_aa MGLQLFLMKFHVSLPEKASVRRMPSTPDLGKQATDLESSLLYFRIISATNLFKFSLALDL SNKNANPSLTGLLSRLNNDREFLIPHYQLADIQWN >gi568815594r:17387134_17611993|GENSCAN_predicted_CDS_3|288_bp atggggcttcagcttttcctaatgaagtttcacgtgtcccttcctgaaaaagccagtgtc aggaggatgccctcgacaccagatttaggaaagcaagcaacagatttggagtcaagtctc ctgtatttcagaattatctcagctactaacctcttcaagttttcattggctctagatctg tcgaataagaatgctaatccttcactcacaggattgttgtcaaggttaaataatgacaga gagttcctcatccctcactatcagcttgcagatatccagtggaactga >gi568815594r:17387134_17611993|GENSCAN_predicted_peptide_4|70_aa MHGAILEDLVFPSEIVHKRLCMKLDGSRLMKVHSDKAQQNNVEHRVETFSGVYKKLTGKD VNFEFQGFQL >gi568815594r:17387134_17611993|GENSCAN_predicted_CDS_4|213_bp atgcatggtgccatccttgaggacttggttttcccaagtgaaattgtgcacaagagactc tgcatgaaactggatggcagccggctcatgaaggttcattcagacaaagcacagcagaac aacgtggaacacagggttgagactttttctggtgtctataagaagctcactggcaaggat gttaattttgaattccaagggtttcagttgtaa >gi568815594r:17387134_17611993|GENSCAN_predicted_peptide_5|108_aa MAILSKKNKSGGITLPNFKLYYNAIVIKIAWYWYKSTLCKLSVELPFWGLEDDGLLLTAP LGSAPVRTLCGDPNPTFLFHTALREVLHEGSAPAANFCLDIQAFPYIL >gi568815594r:17387134_17611993|GENSCAN_predicted_CDS_5|327_bp atggcaatactaagcaaaaagaacaaatctggaggcatcacattaccaaacttcaaatta tactacaatgctatcgttatcaaaatagcatggtactggtataaaagcacactgtgcaag ctgtcagtggaactaccattctggggtctggaggatgatggccttcttctcacagctcca ctaggcagtgccccagtgcggactctgtgtggggaccccaaccccacatttctcttccac actgctctaagagaggttctccatgagggctctgctcctgcagcaaatttctgcctggac atccaggcatttccatacatcctctga >gi568815594r:17387134_17611993|GENSCAN_predicted_peptide_6|85_aa MAEDTLLLNNKLVSSPPLQMSYSSLRVPALNLVSKFESTNNDGSLHQVTVLGCGATKMKK ECSCPRGAHSLVGGNNADITGYGTS >gi568815594r:17387134_17611993|GENSCAN_predicted_CDS_6|258_bp atggcggaagacactttacttctgaacaacaaattggtgtctagtccccctttacaaatg agttatagttctttgagagtaccagctttaaatctagtgagcaaatttgagtcaacaaat aacgatgggagcttacatcaagtgactgtgcttggctgtggggctaccaagatgaagaag gaatgctcctgccctcgaggagctcacagtctagtgggagggaacaatgcggacataact ggctacggaacaagttag >gi568815594r:17387134_17611993|GENSCAN_predicted_peptide_7|315_aa MAAAAAAGEARRVLVYGGRGALGSRCVQAFRARNWVMLRAGAAGSPRGWPGAFVWNPAGG SLEEGGGPCAPPDSCCLHVRAQVYASIPVSECVRTPVPGRWAFAVQGGAPWASARPKKSG VTRERIPKGRNGSRAVFSFQWVASVDVVENEEASASIIVKMTDSFTEQADQVTAEVGKLL GEEKVDAILCVAGGWAGGNAKSKSLFKNCDLMWKQSIWTSTISSHLATKHLKEGGLLTLA GAKAALDGTPGMIGYGMAKGAVHQLCQSLAGKNSGMPPGAAAIAVLPVTLDTPMNRKSMP EADFSSWTPLEFLVE >gi568815594r:17387134_17611993|GENSCAN_predicted_CDS_7|948_bp atggcggcggcggcggctgcaggcgaggcgcgccgggtgctggtgtacggcggcaggggc gctctgggttctcgatgcgtgcaggcttttcgggcccgcaactgggtaatgctgcgggct ggggctgctgggtctccgcgggggtggccgggggctttcgtgtggaacccggcggggggc agtctagaggaaggtgggggcccctgtgcaccccctgactcgtgttgcttgcacgtgcgc gcacaggtctacgcgtctattcctgtttccgagtgcgtgcgcacgccagtcccagggcgc tgggcctttgctgtgcaaggaggtgctccgtgggcctccgccagacccaaaaagtccggc gtgactcgggagagaattccaaaggggaggaacgggagcagggctgttttctccttccag tgggttgccagcgttgatgtggtggagaatgaagaggccagcgctagcatcattgttaaa atgacagactcgttcactgagcaggctgaccaggtgactgctgaggttggaaagctcttg ggtgaagagaaggtggatgcaattctttgcgttgctggaggatgggccgggggcaatgcc aaatccaagtctctctttaagaactgtgacctgatgtggaagcagagcatatggacatcg accatctccagccatctggctaccaagcatctcaaggaaggaggcctcctgaccttggct ggcgcaaaggctgccctggatgggactcctggtatgatcgggtacggcatggccaagggt gctgttcaccagctctgccagagcctggctgggaagaacagcggcatgccgcccggggca gccgccatcgctgtgctcccggttaccctggataccccgatgaacaggaaatcaatgcct gaggctgacttcagctcctggacacccttagaattcctagttgagtga >gi568815594r:17387134_17611993|GENSCAN_predicted_peptide_8|284_aa MPGWFKKAWYGLASLLSFSSFILIIVALVVPHWLSGKILCQTGVDLVNATDRELVKFIGD IYYGLFRGCKVRQCGLGGRQSQFTRTEACWSTDKHESGGHEFISESLPLFEIPTLAVLPS PLSVEAFVVSKLGLHEIFPHLVKELNAGLHVMILLLLFLALALALVSMGFAILNMIQVPY RAVSGPGGICLWNVLAGGVVALAIASFVAAVKFHDLTERIANFQEKLFQFVVVEEQYEES FWICVASASAHAANLVVVAISQIPLPEIKTKIEEATVTAEDILY >gi568815594r:17387134_17611993|GENSCAN_predicted_CDS_8|855_bp atgcctggatggttcaaaaaggcgtggtatgggctggcgtctttactcagcttctcctcc ttcatcctgatcatcgttgccctggtagtgccccactggctgagtgggaaaatcctttgt cagactggagtggatctggtcaacgccacagacagagagctggtcaagttcattggggac atttactacgggctcttccgagggtgtaaagtgcggcagtgtgggcttgggggccgccaa tcccaattcacgagaacagaggcttgttggtcaactgacaagcatgaatctgggggccat gagttcatctcagaaagtctcccgctgtttgaaatccccacgcttgctgtcttgccctca cccctgtctgtcgaagccttcgtggtaagcaagcttggacttcatgaaatcttcccacac ctggtgaaggagctcaacgcaggccttcatgtgatgattctgctgctcctcttcctggcc ttggccctggctctggtcagcatgggctttgccattcttaacatgatccaggtcccgtac cgggcagtcagcggtcctgggggcatctgcctatggaatgtcctggcaggcggcgtcgtg gcgttagccatcgccagcttcgtggctgcggtgaaatttcacgacctgacggaacgaatc gccaactttcaggagaagctcttccagtttgtggtggtggaagaacagtatgaagagtcg ttttggatctgcgtggccagcgcttcggcccatgctgcaaacttggtcgtggtggcgatc agtcaaattcccctccctgagattaagaccaaaatcgaagaggccacggtcacagctgag gatatcttgtattaa >gi568815594r:17387134_17611993|GENSCAN_predicted_peptide_9|235_aa MMERPETLVSDPFLIAKCQEHGAIEQWNIPLTPSLMKGHMACECTWIENISAKIIPAVAA VKDRKRCQVFHLSIVSLFLQINSSHVAYCKCDRGPSGHHTWIENVSAKIIPAVAAVKDWK RCQVFCLSIVSLSLQINSSHVAYCKCDRGPSGHQWEPHHQTSDGIGTQVLDKDDTHLVID CVEIANATACAIKDQRDALNSLAKVVMDNESSQLSAGKMGWHLHDCQHLLLGINQ >gi568815594r:17387134_17611993|GENSCAN_predicted_CDS_9|708_bp atgatggaaaggcctgaaacattggtttctgaccccttccttatagcaaaatgccaggaa catggagccattgaacagtggaacatcccactcacaccgtcactgatgaaaggacacatg gcctgcgaatgtacatggattgagaacatatcagcgaagatcatcccagcagtagcagca gtgaaagaccggaagcgttgccaggtcttccacttgagtattgtgagcctattcctccag attaatagttcacatgtggcctattgcaaatgtgacaggggtcccagtggccatcataca tggattgagaacgtatcagcgaagatcatcccagcagtagcagcagtgaaagactggaag cgttgccaggtcttctgcttgagtattgtgagcctatccctccagattaatagttcacat gtggcctattgcaaatgtgacaggggtcccagtggccatcagtgggagccacaccaccag acatcagacggtataggcacacaggtcctggataaagatgatacacatcttgtcattgac tgtgtggagatagcgaatgctactgcctgtgctattaaggaccagcgggatgcattaaac tcactagccaaggtggtgatggacaacgaaagttctcaattatctgctggcaaaatggga tggcatttgcatgactgccaacacctcttgctgggtataaatcaataa >gi568815594r:17387134_17611993|GENSCAN_predicted_peptide_10|552_aa MFLLPLPAAGRVVVRRLAVRRFGSRSLSTADMTKGLVLGIYSKEKEDDVPQFTSAGENFD KLLAGKLRETLNISGPPLKAGKTRTFYGLHQDFPSVVLVGLGKKAAGIDEQENWHEGKEN IRAAVAAGCRQIQDLELSSVEVDPCGDAQAAAEGAVLGLYEYDDLKQKKKMAVSAKLYGS GDQEAWQKGVLFASGQNLARQLMETPANEMTPTRFAEIIEKNLKSASSKTEVHIRPKSWI EEQAMGSFLSVAKGSDEPPVFLEIHYKGSPNANEPPLVFVGKGITFDSGGISIKASANMD LMRADMGGAATICSAIVSAAKLNLPINIIGLAPLCENMPSGKANKPGDVVRAKNGKTIQV CKCVEVEHTVLHHGPCTEGDVAAGPLGGTCMQVDNTDAEGRLILADALCYAHTFNPKVIL NAATLTGAMDVALGSGATGVFTNSSWLWNKLFEASIETGDRVWRMPLFEHYTRQVVDCQL ADVNNIGKYRSAGACTAAAFLKEFVTHPKWAHLDIAGVMTNKDEVPYLRKGMTGRPTRTL IEFLLRFSQDNA >gi568815594r:17387134_17611993|GENSCAN_predicted_CDS_10|1659_bp atgttcttgctgcctcttccggctgcggggcgagtagtcgtccgacgtctggccgtgaga cgtttcgggagccggagtctctccaccgcagacatgacgaagggccttgttttaggaatc tattccaaagaaaaagaagatgatgtgccacagttcacaagtgcaggagagaattttgat aaattgttagctggaaagctgagagagactttgaacatatctggaccacctctgaaggca gggaagactcgaaccttttatggtctgcatcaggacttccccagcgtggtgctagttggc ctcggcaaaaaggcagctggaatcgacgaacaggaaaactggcatgaaggcaaagaaaac atcagagctgctgttgcagcggggtgcaggcagattcaagacctggagctctcgtctgtg gaggtggatccctgtggagacgctcaggctgctgcggagggagcggtgcttggtctctat gaatacgatgacctaaagcaaaaaaagaagatggctgtgtcggcaaagctctatggaagt ggggatcaggaggcctggcagaaaggagtcctgtttgcttctgggcagaacttggcacgc caattgatggagacgccagccaatgagatgacgccaaccagatttgctgaaattattgag aagaatctcaaaagtgctagtagtaaaaccgaggtccatatcagacccaagtcttggatt gaggaacaggcaatgggatcattcctcagtgtggccaaaggatctgacgagcccccagtc ttcttggaaattcactacaaaggcagccccaatgcaaacgaaccacccctggtgtttgtt gggaaaggaattacctttgacagtggtggtatctccatcaaggcttctgcaaatatggac ctcatgagggctgacatgggaggagctgcaactatatgctcagccatcgtgtctgctgca aagcttaatttgcccattaatattataggtctggcccctctttgtgaaaatatgcccagc ggcaaggccaacaagccgggggatgttgttagagccaaaaacgggaagaccatccaggtt tgtaaatgtgttgaggtagaacatactgtcctacatcatgggccctgcactgaaggggat gtggcagcaggtcctctgggtggcacgtgcatgcaggttgataacactgatgctgagggg aggctcatactggctgatgcgctctgttacgcacacacgtttaacccgaaggtcatcctc aatgccgccaccttaacaggtgccatggatgtagctttgggatcaggtgccactggggtc tttaccaattcatcctggctctggaacaaactcttcgaggccagcattgaaacaggggac cgtgtctggaggatgcctctcttcgaacattatacaagacaggttgtagattgccagctt gctgatgttaacaacattggaaaatacagatctgcaggagcatgtacagctgcagcattc ctgaaagaattcgtaactcatcctaagtgggcacatttagacatagcaggcgtgatgacc aacaaagatgaagttccctatctacggaaaggcatgactgggaggcccacaaggactctc attgagttcttacttcgtttcagtcaagacaatgcttag