GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:34:30 Sequence gi568815588f:17129423_17337268 : 207846 bp : 39.25% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5888 5946 59 1 2 67 87 53 0.263 3.93 1.02 Intr + 8548 8673 126 1 0 44 92 82 0.219 3.07 1.03 Intr + 15043 15101 59 1 2 59 89 52 0.068 0.01 1.04 Term + 18154 18290 137 2 2 111 42 40 0.090 -1.10 1.05 PlyA + 18772 18777 6 1.05 2.05 PlyA - 19475 19470 6 -0.45 2.04 Term - 19718 19618 101 0 2 122 28 107 0.970 5.41 2.03 Intr - 24214 24085 130 1 1 97 88 93 0.999 9.55 2.02 Intr - 25312 25255 58 0 1 76 79 43 0.995 0.17 2.01 Init - 28356 28019 338 0 2 54 73 278 0.564 19.70 2.00 Prom - 30070 30031 40 -3.05 3.00 Prom + 32642 32681 40 -2.75 3.01 Init + 34672 35526 855 0 0 43 86 338 0.094 23.77 3.02 Intr + 67365 67499 135 2 0 47 83 89 0.011 4.14 3.03 Intr + 85568 85671 104 1 2 41 119 51 0.015 1.55 3.04 Intr + 86107 86270 164 1 2 17 98 108 0.572 3.40 3.05 Intr + 90840 90930 91 1 1 52 94 65 0.245 1.63 3.06 Intr + 97565 97616 52 2 1 91 81 72 0.534 4.79 3.07 Intr + 98947 98981 35 0 2 96 87 40 0.531 0.80 3.08 Intr + 99877 100563 687 1 0 39 72 833 0.346 66.53 3.09 Intr + 101228 101288 61 2 1 40 72 34 0.691 -5.18 3.10 Intr + 104165 104260 96 1 0 83 88 142 0.999 12.99 3.11 Intr + 104348 104509 162 1 0 123 97 277 0.998 31.35 3.12 Intr + 105271 105396 126 0 0 72 84 260 0.980 23.96 3.13 Intr + 105747 105967 221 2 2 104 91 373 0.732 35.38 3.14 Intr + 124450 124563 114 1 0 47 97 49 0.030 0.34 3.15 Intr + 135664 135776 113 1 2 86 64 55 0.121 2.00 3.16 Intr + 138853 138988 136 2 1 -4 71 158 0.193 3.61 3.17 Intr + 139828 139886 59 1 2 78 78 46 0.060 0.21 3.18 Term + 146221 146306 86 2 2 65 33 154 0.099 4.44 3.19 PlyA + 146877 146882 6 1.05 4.00 Prom + 151403 151442 40 -5.75 4.01 Sngl + 152516 153028 513 1 0 49 42 430 0.958 30.29 4.02 PlyA + 155128 155133 6 1.05 5.00 Prom + 182743 182782 40 -3.65 5.01 Init + 182867 183001 135 1 0 69 59 66 0.661 1.99 5.02 Term + 184756 184911 156 0 0 23 42 148 0.821 0.65 5.03 PlyA + 185108 185113 6 1.05 6.05 PlyA - 186021 186016 6 1.05 6.04 Term - 191924 191456 469 1 1 64 41 313 0.957 17.66 6.03 Intr - 193735 193643 93 0 0 72 100 72 0.961 4.96 6.02 Intr - 197704 197592 113 1 2 42 82 82 0.610 1.26 6.01 Intr - 202130 201986 145 1 1 118 89 55 0.661 7.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:17129423_17337268|GENSCAN_predicted_peptide_1|126_aa MVMFAFYKQPCGAMSETQSWGMFFYLECNNPYVIFLLGQLNLSGGLDQCGSHKNEERVEM QRKNEILEALGVKCGVLEGDFRWLISLRIMYSRFIHVTAYVRISFLRKAGTIPLHARNTI PASIHG >gi568815588f:17129423_17337268|GENSCAN_predicted_CDS_1|381_bp atggtcatgtttgcattttacaagcaaccctgtggagccatgtctgagacccagagttgg gggatgttcttttatcttgagtgcaataatccttatgtgatatttctgctaggccagctg aatttatctggaggactggatcaatgtgggagtcataaaaatgaggagagggttgagatg cagaggaaaaatgagattttggaagctttaggagtcaagtgtggtgtacttgagggagac ttcaggtggcttatttcacttcgcataatgtactcaaggttcatccatgtcaccgcatat gtcagaatttccttcctgcggaaggctggtactattccactgcatgcacggaacacaatt cctgcatccattcatgggtga >gi568815588f:17129423_17337268|GENSCAN_predicted_peptide_2|208_aa MEFPKIESVHPQKYAMDVENKIQEKNVEPNISFDGSIQCSGKDAILFKLETAEEIHRKNQ QDSDLSVKMLKDFLEDDTDVNQYLLPPKSLLRYALLLDIVQPTCRRSVCFTKGYGSYIEG TGSVLQTAEDVQVENIYKSLTNLSQEEQITKLLILKLRYFTPKEIANLLGFPPEFGFPEK ITVKQRYRLLGNSLNVHVVAKLIKILYE >gi568815588f:17129423_17337268|GENSCAN_predicted_CDS_2|627_bp atggagttccccaaaattgaatctgtacatccacaaaaatatgcaatggatgtagaaaat aaaattcaagaaaagaacgttgaaccaaatattagctttgatggcagcatacagtgttct ggaaaagatgccattctttttaagcttgaaactgcagaagaaattcacaggaaaaatcaa caagatagtgatctctctgtgaaaatgctaaaagattttcttgaagatgacactgacgtg aaccagtatcttttaccaccaaagtcattgctgcgatatgctcttctgttagacattgtt cagcccacttgtagaaggtccgtgtgctttaccaaaggatatggaagctacatagaaggg acagggtctgtgttacagactgcagaggatgtgcaggttgagaatatctacaaatccctt accaatttgtcacaagaagaacagataacaaagctgttaatacttaaactgcgatatttc actcctaaagaaatagcaaatctccttggatttcctccagagttcggatttcctgagaag ataacagtgaaacagcgttatcgcctacttggaaatagtctcaacgtgcatgtagtagct aaactaatcaaaatcttatatgaataa >gi568815588f:17129423_17337268|GENSCAN_predicted_peptide_3|1098_aa MNIDAKIHNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNTSKSINVIQHINRTNDKNHM IISINAEKTFDKTQQPFMLKTLNKLGIDGTYLKIIRAIYDKPTANIIPNGQKLEAFPLKT GTRQGCPLSSLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLEKPIISA QNLLKLTSNFSKVSGYKINVQKSQAFLYTNKRQTESQIMSELPFTIASKRIKYLGIQLTR DVKDLFKKIYQSLLNDIKEDTNKWKNIPCSGIGRINIVKMATLPKCLPTTCIHMIRPQFR VSSVRRDFNSEVAMEELKQTRKPIFLAREVRTLTSFPIFQPYGFLILRGHRQEQFTSTVV LRITWTKGILVIQVNPWLTTLEAGISKAPDKPWNPSQVAPALPQKPAQVLGLWLAFSVNR TWMQLETIILSKLMQEQKSKYCMLSLISGSESVSKRRIALGSALHVEGPTAHSKAMAQLS PRQRRSRAPTTHTHRALVRLFSGSQSAPPPPPRPSPPSAAMSTRSVSSSSYRRMFGGPGT ASRPSSSRSYVTTSTRTYSLGSALRPSTSRSLYASSPGGVYATRSSAVRLRSSVPGVRLL QDSVDFSLADAINTEFKNTRTNEKVELQELNDRFANYIDKVRFLEQQNKILLAELEQLKG QGKSRLGDLYEEEMRELRRQVDQLTNDKARVEVERDNLAEDIMRLREKLQEEMLQREEAE NTLQSFRQDVDNASLARLDLERKVESLQEEIAFLKKLHEEEIQELQAQIQEQHVQIDVDV SKPDLTAALRDVRQQYESVAAKNLQEAEEWYKSKFADLSEAANRNNDALRQAKQESTEYR RQVQSLTCEVDALKGTNESLERQMREMEENFAVEAANYQDTIGRLQDEIQNMKEEMARHL REYQDLLNVKMALDIEIATYRKLLEGEESRGAIVLLQISGYYNQLRACVQKSPKRSRDST YPSEQLLSRQTKFYDSDLRVNSSPSMDSMDLMYLILWFLEIHFIRSQKYNTAFGSVIHPL WVYLEPFDPAELWPSLLQNEEECPLGPNGTREAQQLGRTQFPPYFDTPEITVSLQFGTCK RLADLEDVSKASTQAHRK >gi568815588f:17129423_17337268|GENSCAN_predicted_CDS_3|3297_bp atgaacatcgatgcaaaaatccacaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaac acaagcaaatcaataaatgtaatccagcatataaacagaaccaacgacaaaaaccacatg attatctcaataaatgcagaaaagacctttgacaaaactcaacaacccttcatgctaaaa actctcaataaattaggtattgatgggacatatctcaaaataataagagctatctatgac aaacccacagccaatatcataccgaatgggcaaaaactggaagcattccctttgaaaact ggcacaagacagggatgccctctctcatcacttctattcaacatagtgttggaagttctg gccagggcaatcaggcaggagaaagaaataaagggcattcagttaggaaaagaggaagtc aaattgtccctgtttgcagatgacatgattgtatatctagaaaaacccatcatctcggcc caaaatctccttaagctgacaagcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcttatacaccaataaaagacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtgaaggacctcttcaagaagatctaccaatcactgctcaatgacataaaagaggat acaaacaaatggaagaacattccatgctcagggataggaagaatcaatattgtgaaaatg gccacattgcccaagtgtctaccaaccacctgcatccacatgataagacctcagtttaga gttagcagtgtgaggagagacttcaattcagaggtagcaatggaagaacttaaacaaacg aggaaacccatctttctggcaagggaggtgcgaacactcaccagtttccccattttccaa ccctatggttttcttattctacgtggccacagacaggaacaattcactagcacagtggtt ctcagaatcacctggacaaaaggtatactagtgattcaggtaaatccttggctgaccaca ctggaagctggcatctccaaggctcccgacaaaccctggaacccaagccaagttgctcct gcactaccccagaagccagctcaggtccttggcttgtggcttgctttttctgtcaacaga acatggatgcagctggagaccatcatccttagcaaactaatgcaggaacagaaaagcaaa tactgcatgttgtcacttataagtgggagtgagtcagtcagcaagcgtcgcattgccctg ggatcggcactgcacgtagaaggcccgaccgcacacagcaaggcgatggcccagctgtcc ccgcgccagagacgcagccgcgctcccaccacccacacccaccgcgccctcgttcgcctc ttctccgggagccagtccgcgccaccgccgccgcccaggccatcgccaccctccgcagcc atgtccaccaggtccgtgtcctcgtcctcctaccgcaggatgttcggcggcccgggcacc gcgagccggccgagctccagccggagctacgtgactacgtccacccgcacctacagcctg ggcagcgcgctgcgccccagcaccagccgcagcctctacgcctcgtccccgggcggcgtg tatgccacgcgctcctctgccgtgcgcctgcggagcagcgtgcccggggtgcggctcctg caggactcggtggacttctcgctggccgacgccatcaacaccgagttcaagaacacccgc accaacgagaaggtggagctgcaggagctgaatgaccgcttcgccaactacatcgacaag gtgcgcttcctggagcagcagaataagatcctgctggccgagctcgagcagctcaagggc caaggcaagtcgcgcctgggggacctctacgaggaggagatgcgggagctgcgccggcag gtggaccagctaaccaacgacaaagcccgcgtcgaggtggagcgcgacaacctggccgag gacatcatgcgcctccgggagaaattgcaggaggagatgcttcagagagaggaagccgaa aacaccctgcaatctttcagacaggatgttgacaatgcgtctctggcacgtcttgacctt gaacgcaaagtggaatctttgcaagaagagattgcctttttgaagaaactccacgaagag gaaatccaggagctgcaggctcagattcaggaacagcatgtccaaatcgatgtggatgtt tccaagcctgacctcacggctgccctgcgtgacgtacgtcagcaatatgaaagtgtggct gccaagaacctgcaggaggcagaagaatggtacaaatccaagtttgctgacctctctgag gctgccaaccggaacaatgacgccctgcgccaggcaaagcaggagtccactgagtaccgg agacaggtgcagtccctcacctgtgaagtggatgcccttaaaggaaccaatgagtccctg gaacgccagatgcgtgaaatggaagagaactttgccgttgaagctgctaactaccaagac actattggccgcctgcaggatgagattcagaatatgaaggaggaaatggctcgtcacctt cgtgaataccaagacctgctcaatgttaagatggcccttgacattgagattgccacctac aggaagctgctggaaggcgaggagagcagaggggccatagtgttgcttcagatctctggg tactataatcagttgagagcctgtgttcaaaagtcccccaaaaggtctagagattccact taccccagtgaacagttgctcagtaggcagactaaattctatgactctgatctgagggtt aacagtagtccatctatggattctatggacttgatgtatcttatactgtggttcctggaa attcattttatcaggagccagaaatacaacacagcttttggctctgtcattcatccactg tgggtctacctggagccatttgatcctgctgagctgtggccttcccttttgcaaaatgaa gaagaatgcccccttgggccaaatgggacaagagaagcccagcagcttggtaggacccag ttcccaccttattttgatactccagaaataactgtttctctgcagtttggcacctgcaag cggttagcagacctggaggatgtcagcaaagcctctacccaggcccacaggaagtag >gi568815588f:17129423_17337268|GENSCAN_predicted_peptide_4|170_aa MWNNFGNGVTGRDWNNLEGSEEERKMWESLELPKDLLNGFDQNADSDVNNEVQAEVVSDE DEELVGNWNKGDPRYALAKRLAAFCPCTRDLWNFELDRDDLGYLTEEILKQQSIQEVTEH KSLENLQPDNTREKKNLFSGEKFKPAAEICISNEGPNVNHQANGGDVSRA >gi568815588f:17129423_17337268|GENSCAN_predicted_CDS_4|513_bp atgtggaataactttggaaatggggtaacaggcagagattggaacaatttggagggctca gaagaagaaaggaagatgtgggaaagtttggaacttcctaaagacttgttaaatggtttt gaccaaaatgctgatagtgatgtgaacaatgaagtccaggctgaggtggtctcagatgaa gatgaggaacttgttgggaactggaataaaggtgaccctcgctatgctttagcaaagaga ctggcagcattttgcccctgcactagagatctgtggaactttgaacttgacagagatgac ttagggtatctgacagaagaaattcttaaacagcaaagcattcaagaggtgacagaacat aaaagtttggaaaatttgcagcctgacaatacaagagaaaagaaaaatctattttctggg gagaaattcaagccagctgcagaaatttgcataagtaacgaggggccaaatgttaatcac caagccaatgggggagatgtctccagggcataa >gi568815588f:17129423_17337268|GENSCAN_predicted_peptide_5|96_aa MESYAAIKRNAIMSFAGTWMKLEAIALSKLTQEQKAKHHMFSLIGIKEYYEQLYYDKFYN SDEEDKFPEEHKLPHLTQEEINRLNDPASIKEIVQY >gi568815588f:17129423_17337268|GENSCAN_predicted_CDS_5|291_bp atggaatcctatgcagccataaaaaggaatgcgatcatgtcctttgcagggacatggatg aagctggaggccattgccctcagcaaactaacacaggaacagaaagccaaacaccacatg ttctcacttataggtataaaggaatattatgaacaactttattacgataaattctacaac tctgatgaagaagataaatttcctgaagaacacaaactaccacatctcactcaagaagaa ataaaccgtctgaatgatcctgcatctattaaagaaattgttcaatattaa >gi568815588f:17129423_17337268|GENSCAN_predicted_peptide_6|273_aa XAKLASCCDAVQNFVVSQNNTPVGTNMSYEVESKKEIPIKKNIFHMFPVSQPFVDYPYNQ CAVVGNGGILNKSLCGTEIDKSDFVFRCNLPPTTGDVSKDVGSKTNLVTINPSIITLKYG NLKEKKALFLEDIATYGDAFFLLPAFSFRANTGTSFKVYYTLEESKARQKVLFFHPKYLK DLALFWRTKGVTAYRLSTGLMITSVAVELCKNVKLYGFWPFSKTVEDIPVSHHYYDNKLP KHGFHQMPKEYSQILQLHMKGILKLQFSKCEVA >gi568815588f:17129423_17337268|GENSCAN_predicted_CDS_6|822_bp nnagccaaacttgcttcctgctgtgatgctgttcaaaactttgttgtttctcagaataac actccagttgggactaatatgagttacgaggtggaaagcaaaaaagaaatcccaattaag aagaacatttttcatatgtttccagtgtcccagccttttgtggactacccttataatcag tgtgcagtggtcggaaatgggggaattctgaataagtctctctgtggaactgaaatagat aaatccgacttcgtttttaggtgtaacctacccccaaccacaggagatgttagtaaagat gttggcagtaaaacaaatcttgtgactataaatccaagcatcataactctgaaatatggg aacttaaaggaaaaaaaagccctatttctggaggacattgcaacctatggagatgcattt tttcttctgccagcattttccttcagggccaacacgggtacctctttcaaagtatactac acgctcgaagagtctaaagcaagacaaaaggttctatttttccatcccaagtacctgaaa gatctggcccttttctggagaactaaaggtgtgactgcataccgcttgtccaccggcttg atgatcacaagtgttgcagtggaactgtgtaaaaatgtgaagctgtatggattctggccc ttctctaaaactgtagaagacatacctgtcagccatcactattatgacaacaagctacct aaacatggtttccatcagatgcccaaagaatacagccagatcctccaacttcacatgaaa ggaatcctcaaactgcaatttagcaaatgtgaagtcgcctaa