GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:52:08 Sequence gi568815583r_30262296 : 200047 bp : 42.56% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 227008 227074 67 2 1 79 80 81 0.184 5.69 1.02 Intr + 245290 245659 370 1 1 -99 50 412 0.146 11.94 1.03 Intr + 246757 246928 172 0 1 -17 86 176 0.377 5.92 1.04 Intr + 250352 250493 142 0 1 100 55 80 0.883 4.91 1.05 Intr + 253787 253889 103 2 1 105 20 74 0.748 0.61 1.06 Intr + 254176 254383 208 0 1 71 25 294 0.806 19.56 1.07 Intr + 254643 255600 958 1 1 50 85 1201 0.786 105.71 1.08 Term + 255864 256102 239 0 2 31 55 328 0.487 19.15 1.09 PlyA + 256233 256238 6 1.05 2.07 PlyA - 256342 256337 6 1.05 2.06 Term - 258467 258436 32 2 2 92 55 5 0.193 -5.36 2.05 Intr - 258831 258697 135 0 0 105 76 84 0.444 8.52 2.04 Intr - 259375 259164 212 2 2 64 29 218 0.515 11.13 2.03 Intr - 260967 260553 415 0 1 12 99 183 0.670 4.04 2.02 Intr - 261354 261222 133 2 1 76 113 124 0.716 13.00 2.01 Init - 263983 263915 69 0 0 89 56 64 0.745 4.40 2.00 Prom - 270234 270195 40 -7.25 3.00 Prom + 277486 277525 40 -9.05 3.01 Init + 279126 279242 117 1 0 73 22 112 0.296 3.35 3.02 Intr + 283617 283672 56 1 2 77 110 10 0.427 -1.04 3.03 Intr + 289790 290343 554 1 2 52 8 324 0.165 12.37 3.04 Intr + 290373 290585 213 0 0 79 56 132 0.174 6.86 3.05 Intr + 292263 292565 303 0 0 63 94 129 0.106 6.64 3.06 Intr + 292747 292827 81 1 0 85 91 49 0.366 3.59 3.07 Intr + 293679 293950 272 0 2 123 99 -3 0.137 0.64 3.08 Intr + 293980 294027 48 2 0 116 105 67 0.956 9.16 3.09 Intr + 294116 294200 85 0 1 86 81 87 0.982 6.27 3.10 Intr + 294296 294405 110 2 2 80 109 143 0.631 14.68 3.11 Intr + 295456 295542 87 2 0 38 94 76 0.842 2.45 3.12 Intr + 295649 295756 108 0 0 94 78 55 0.955 4.66 3.13 Intr + 295952 296039 88 0 1 76 86 107 0.987 7.92 3.14 Intr + 296416 296672 257 1 2 80 78 360 0.999 30.44 3.15 Intr + 296928 296996 69 1 0 83 57 86 0.776 3.46 3.16 Intr + 297919 298008 90 2 0 0 37 176 0.263 2.97 3.17 Intr + 298241 298316 76 0 1 85 102 145 0.781 13.77 3.18 Intr + 298748 298839 92 2 2 92 89 72 0.781 6.49 3.19 Intr + 299095 299453 359 2 2 47 35 276 0.741 11.33 3.20 Intr + 299597 299694 98 1 2 90 97 73 0.999 7.13 3.21 Intr + 299790 299945 156 0 0 68 54 213 0.961 14.96 3.22 Term + 300031 300206 176 1 2 84 41 91 0.390 0.94 3.23 PlyA + 302607 302612 6 1.05 4.08 PlyA - 303074 303069 6 1.05 4.07 Term - 306754 306599 156 0 0 103 42 131 0.857 6.95 4.06 Intr - 307877 307638 240 1 0 92 81 382 0.986 34.72 4.05 Intr - 308748 308623 126 2 0 30 4 225 0.586 8.26 4.04 Intr - 319026 318912 115 1 1 127 110 35 0.388 9.13 4.03 Intr - 326445 326358 88 0 1 48 63 116 0.035 3.21 4.02 Intr - 334971 334897 75 0 0 111 70 15 0.029 0.57 4.01 Init - 336359 336266 94 1 1 63 23 117 0.155 3.33 4.00 Prom - 336706 336667 40 -6.85 5.00 Prom + 336787 336826 40 -10.65 5.01 Init + 337922 338026 105 0 0 116 82 38 0.896 6.34 5.02 Intr + 338726 338805 80 0 2 96 84 60 0.924 3.83 5.03 Intr + 341938 342100 163 0 1 8 92 135 0.202 4.96 5.04 Intr + 343548 343667 120 1 0 92 76 74 0.651 6.37 5.05 Intr + 344560 344682 123 2 0 68 75 75 0.706 4.06 5.06 Intr + 344801 344881 81 0 0 85 99 70 0.929 6.72 5.07 Intr + 345735 345778 44 1 2 123 7 77 0.423 -0.78 5.08 Intr + 345910 346006 97 0 1 110 99 -6 0.442 1.79 5.09 Intr + 346036 346083 48 2 0 121 105 56 0.956 8.56 5.10 Intr + 346172 346256 85 0 1 86 81 60 0.992 3.57 5.11 Intr + 346352 346461 110 2 2 86 109 103 0.794 11.28 5.12 Intr + 347511 347597 87 1 0 43 94 92 0.957 4.55 5.13 Intr + 347704 347811 108 2 0 78 78 80 0.973 5.56 5.14 Intr + 348007 348094 88 2 1 76 86 94 0.995 6.62 5.15 Intr + 348471 348727 257 0 2 59 90 402 0.994 33.74 5.16 Intr + 348983 349051 69 0 0 85 57 104 0.864 5.46 5.17 Intr + 349974 350069 96 1 0 71 37 199 0.830 12.39 5.18 Intr + 350302 350377 76 2 1 83 102 124 0.848 11.47 5.19 Intr + 350809 350900 92 1 2 92 89 83 0.859 7.59 5.20 Intr + 351156 351514 359 1 2 47 35 281 0.855 11.83 5.21 Intr + 351658 351755 98 0 2 107 97 66 0.999 8.13 5.22 Intr + 351851 352006 156 2 0 68 89 213 0.997 18.46 5.23 Term + 352171 352367 197 1 2 8 40 178 0.460 1.59 5.24 PlyA + 354670 354675 6 1.05 6.00 Prom + 355230 355269 40 -6.15 6.01 Init + 357444 357545 102 1 0 51 98 66 0.636 4.19 6.02 Intr + 363059 363293 235 0 1 72 -17 203 0.436 4.54 6.03 Intr + 364123 364234 112 1 1 22 73 133 0.539 3.72 6.04 Intr + 364507 364654 148 0 1 27 98 204 0.739 14.62 6.05 Intr + 368408 368478 71 0 2 38 121 36 0.348 -1.04 6.06 Intr + 371195 371291 97 1 1 62 105 164 0.998 14.59 6.07 Intr + 371875 372128 254 2 2 42 97 97 0.769 1.31 6.08 Intr + 372785 372948 164 1 2 21 91 170 0.786 9.30 6.09 Intr + 373192 373338 147 1 0 35 107 161 0.965 11.99 6.10 Intr + 376451 376525 75 2 0 80 74 44 0.515 0.77 6.11 Term + 393709 393827 119 1 2 54 42 139 0.003 3.62 6.12 PlyA + 394418 394423 6 1.05 7.03 PlyA - 394852 394847 6 1.05 7.02 Term - 408174 408062 113 0 2 89 42 123 0.563 5.54 7.01 Init - 421123 421054 70 0 1 84 76 65 0.451 6.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 329967 329727 241 1 1 83 39 193 0.848 8.41 S.002 Term - 391762 391522 241 2 1 82 37 220 0.948 10.81 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r_30262296|GENSCAN_predicted_peptide_1|752_aa MARSWLTTSSPSRVHAILLPLPHEDISIPESEEQEHEEDGSETEADGQEDLEDLEEEEEV SDMGGDNPEVSERANSSKFDPTKSPVLSDEDSDLDFHINKLEQQSKVRNKGHGKPREKSI ADEKFFQLSEMEAYLENRKRRGTKRWDEDDDLEESEDSKQCKESLKRVTFTLPDDEAIED AGVSHVKKNSDEVKSSFKKRQEKMNEKNYTFRKRVVIKKPWLRLGEVTAQKRPENSLLEE TLHFDHAVRMVHREINKGEGRSESGTVQEHVGWGKVEWEVWGPLDITDEELKEKNAELQE KLRLVESEKSEIQLNVKDLKRKLERAQLLLPQASSCSPGGCGSPIRLGPWSRDHQQLQVE ADRLGKELQSVSAKLQAQVEENELWNLLNQQQEEKMWRQEEKIQEQEEKMCEQELKIREQ EEKMWRQEEKMHEQEEKIREQEDKMWRQEEKIREQEEKIREQEEKMWRQEEKIREQDEKI QEQEEEMWRQEEKIREQEEKRQEKMWRQEKKMREQDEKIREQEEEMWRQEEKIRELEEMM QDQEEKLREVEEKMQEEEEKMQEQEEKIQRQEEKIQEQEEKTWRQEKLLKQEEKIWEQEE KMWRQEEKMWEQEEKMQEQEEKMQRQEEKMREQEVRLWQQEEKMQEQEVRLQELEERLGK LGQKAELLGGAGRESGSQRLPTLTPILQVELKSQEAQSLQQQQDHYRGHLQQYVAAYQQL ASGKEALPSCSSRKLRAKRWLRWPTNSCRRPS >gi568815583r_30262296|GENSCAN_predicted_CDS_1|2259_bp atggcgcgatcttggctcaccacaagctccccctcccgggttcacgccattctcctgcct ctgcctcatgaagatatcagtatcccagagagtgaagaacaggagcatgaagaggatggt tcagagacagaggctgatggccaggaggacctagaagatttagaggaggaggaggaagtg tcagatatgggtggtgacaatcctgaagtgagtgagagagcaaactcaagcaaattcgat ccgacgaaaagcccagttctcagtgatgaggattctgaccttgactttcatatcaacaaa ttggaacagcagagcaaggtgcgaaacaaaggacacgggaaaccaagagaaaagtccata gcagatgagaaattcttccaactctctgaaatggaggcctatttagaaaacagaaaaaga agaggaacgaaaagatgggatgaagatgatgacctggaagaaagtgaagacagtaaacaa tgtaaagaaagcttgaaaagagtgaccttcactttgccagatgatgaggcaattgaagat gcaggtgtttcacatgtaaagaaaaattctgatgaagttaaatcctcttttaaaaaaaga caggaaaagatgaatgaaaaaaattacacctttagaaaaagagttgttataaaaaagcct tggttgcgtctgggggaagtgacagcacagaagagaccagagaatagcctgctggaggag accctgcactttgaccatgctgtccggatggtacatcgagaaattaacaaaggagaggga cgctctgagtctggaactgtacaggaacacgtaggatgggggaaggtggaatgggaggtc tgggggcccttagacataaccgatgaggagttgaaggagaaaaatgccgaactacaagaa aaacttcgacttgtagaatctgaaaagtctgagatccagctcaacgtaaaggaccttaaa aggaagctggaaagggcccagctcctgctgccacaggcgagcagctgcagccccgggggt tgtgggagccccatccggctggggccatggtctagggatcatcagcagctgcaggtggag gctgaccgcctgggtaaggagctacagagtgtgtcagcaaagctccaagcccaggtggaa gagaacgagttgtggaacctcctgaaccagcaacaagaggagaagatgtggaggcaggag gagaagatacaggagcaggaagagaagatgtgtgagcaggagctgaagataagggagcag gaggagaagatgtggaggcaggaggagaagatgcatgagcaggaagagaagatacgggag caggaggacaagatgtggaggcaggaggagaagatacgggagcaggaagagaagatacgg gagcaggaggagaagatgtggaggcaggaggagaagatacgggagcaggatgagaagata caggagcaggaggaggagatgtggaggcaggaggagaagatacgggagcaggaggagaag aggcaggagaagatgtggaggcaggagaagaagatgcgcgagcaggatgagaagatacgg gagcaggaggaggagatgtggaggcaggaggagaagatacgggagctggaggagatgatg caagatcaggaggagaagctgcgggaggtggaggagaaaatgcaggaggaggaggaaaag atgcaggagcaggaggagaagatacagaggcaggaggagaagatccaggagcaggaggag aagacgtggaggcaggagaagctgctcaagcaggaagagaagatatgggagcaggaggag aagatgtggaggcaggaggagaagatgtgggaacaggaggagaagatgcaggaacaggag gagaagatgcagaggcaggaggagaagatgcgggagcaggaagtgaggctgtggcagcag gaggagaagatgcaggaacaggaggtgaggctgcaggagctggaggagaggctggggaag ctggggcagaaggcggagctcttggggggagcaggcagagaatctggaagccagcgacta cctaccctgacgcctatcctgcaggtggagctgaagagccaagaggctcagagtctgcag cagcagcaagaccactaccggggtcacctgcagcagtacgtggccgcctatcagcagctg gcctctgggaaggaggcactgcccagctgcagcagcaggaagctcagggcgaagcggtgg ctgagatggcccaccaatagttgcaggagacccagctga >gi568815583r_30262296|GENSCAN_predicted_peptide_2|331_aa MPDYSKRMSRKQYESKPQSPGEKLRNGGSAKWSKVLGTPKSEPIAGGQDEGEALRVSRMP ERQDRPKGKLKKGCGMHPAVAWGYRLASRVLKQLASWVAAPGMRGRTQNLGQASTGIPSG EPGHSAGRAGGSRCTRSMFHKVPNIASAQIGSELECGCSRQGGSCSLRPGVQPSTHLSPP QSCSTGSSVLWALWSHPQREGFGATSCQLQWGTAAQGPSRSVCLRRVLPHGPAEQKLMDD LLNKTRYHNLIRPAASSSQLISIEMELSLAQCISVVACSTGDLTGSLLFARNKNQTARTD GYRGPGVGAGPQAVQRLLSKAGLEPLLWKQL >gi568815583r_30262296|GENSCAN_predicted_CDS_2|996_bp atgccagattatagcaaaaggatgtcgaggaagcaatatgaaagcaagcctcagagtcct ggagagaagctgagaaatggggggtcggccaaatggtctaaggttctgggaacccctaag tcagagcccatagctggtggtcaagatgagggagaggccctcagggtcagccgaatgcca gagaggcaggacaggcccaaaggtaaactaaagaagggctgtgggatgcacccagctgta gcctggggctacagactggcttccagggtactcaagcagctggcctcttgggtagcagcc ccgggtatgagaggcaggactcagaatctaggccaagcctccacaggaatcccctctgga gagcccgggcactctgcaggaagggcaggaggcagcaggtgcaccaggagcatgttccac aaggtgcccaatattgcatctgctcagataggcagcgagttggaatgtggatgcagtagg cagggtggcagctgctccctacggccaggagtccagcccagcacccatctgagtccacct cagtcctgctcaactgggtcatccgtgctctgggccctctggtcccacccacagagggag ggctttggagcgaccagctgtcagctgcagtgggggacagctgcacaaggaccaagcagg tctgtgtgtttacgcagggttctgccgcatggccctgccgagcagaagctgatggacgac cttctgaacaaaacccgttaccacaacctgatccgcccagccgccagctcctcacagctc atctccatcgagatggagctctccctggcccagtgcatcagtgtggtagcttgctcaaca ggtgaccttacaggctccctactctttgcgaggaataagaaccagactgcgagaaccgat gggtacagaggcccaggtgtaggggcaggaccacaggcagtgcagcgtctactgagcaag gcggggttagaacctctcctctggaagcagctctga >gi568815583r_30262296|GENSCAN_predicted_peptide_3|1164_aa MGMEGGKKCESSSVAESKWLGLCSQRLSEKEELLADLEEAWLRAAVFWKGALCPRCGRRN STQQIGCSQEKGKNALGHSPSTQPQIPSDDKTPARVYMTPEAHWTGPPNPGALGYPHQSF VSQPHPFSKQPSPCPRQSPQGDFGWVTPGASRSITGPSSPAAPSLISVGSLGSHLQGARP HPRQSSLGDFGLVTPGTPCCRLCPPLLLPQGRPPWVLCAGVSKELGPNPVLPSPIVEQRL GHGGVECCDVTVHLVTAVTARLAFDLTTQSPKRSHPVSGSSGHSTNFQLEGEWRLWDLGA RGSRLPHSLTDVDVSRGNQDKENSHLPSGVTEGAPGWDGGEIRTMREVGTKELWDKRSKI GRKENVAVDGEERKSEGSDTVGDRTSPCALSSLVVSNRFPQGRPYIICYPERSGEPVPRT SSSPGFNVRKNQSTEEHHQIFETTEETSGTSAGRSNVISFPRNMTAGFGGHSDIQAPVSS HPLPAWGRRLTPQIPPHPHRAPDNLVPWVGLSWGIGGILGACLLLCHLCLPLEKKANNER QKAERELEVQIQTLIIQKEELNTDLYHMERSLRYFEEESKDLAVRLQHSLQCKGELERAL SAVIATEKKKANQLSSCSKAHTEWELEQSLQDQALLKAQLTQLKESFQQLQLERDECAEH IEGERARWHQRMSKMLQEICTLKKEKQQDMRRVEELERSLSKLKNQMAEPLPPEPPAVPS EVELQHLRKELERVAGELQSQVKNNQHISLLNRRQEERIREQEERLRKQEERLQEQHEKL RQLAKPQSVFEELNNENKSTLQLEQQVKELQEKLGELKSQEVQSLQQQPDHYLGHLQQYV ATYQQQEHLEAASQQNQQLTAQLNLMALPGEGHGGEHLDSEGEEAPQPMPSVPEDLESRE AMVAFFKSAGASAQEKQAQLQEQVKEQRVCCQRLAHPVASAQKEPEAARGPGAPGPGGES VSGETHWALQEVTEKLAHARTHLHLLHDLKMPPEGRSLPRCDCNILAPEQLYGPPEGEGR PEKIHHLLSEPGGRAKDAALGGGHHQAGAQGGDEGEAAGAAADGIAAYSNYNNGHRKFLA AAHNSADEPGPGAPAPQELGAADKHGDLREVTLTSSAQGEAREDPLLDKPTAQPIVQDHQ EHPGLGSNCCVPLFCWAWLPRRRR >gi568815583r_30262296|GENSCAN_predicted_CDS_3|3495_bp atgggaatggaaggaggaaagaaatgtgaaagctcatcggtggcagagtcaaaatggctt ggtctttgtagtcaacgattaagtgagaaggaggaattactggctgacttagaagaagcc tggctgcgcgctgctgtgttctggaaaggcgcattgtgccctcgctgtggcagaagaaac tcaacacaacaaattggctgcagccaagaaaaaggtaaaaacgcactaggtcatagcccc tcaacccagccacagatcccctctgatgacaagacccctgccagagtctatatgactcct gaggcacactggactggtccccccaaccccggtgccttgggctacccccaccaaagtttt gtcagtcagccccaccccttcagcaagcagcccagtccttgccctcgccaatcaccccag ggtgactttgggtgggtgactcctggggcttcccgctccattactgggccctcatctcct gccgccccaagcttgatctccgtgggctctttgggctctcatctccaaggagccaggccc caccctcgccagtcatccttgggtgactttgggctggtgactcctgggactccctgctgc agactgtgccctcccctcctgctgcctcaaggtcgacctccctgggttctttgtgctggc gtctccaaggagctgggtcccaaccctgtgcttccctcccccatcgtggagcagcgactt ggacatggaggagtggaatgttgtgatgtcacagtccacctagtaactgccgttactgca agactggcctttgaccttacgacccagtcccctaagcgttctcaccccgtttctggttcc tctggtcacagcacaaatttccagctggaaggggaatggagactatgggacctaggagca agaggttccaggctgcctcactcccttacagatgttgacgtctcaaggggaaaccaggac aaagagaacagccacttgccatcaggagtcactgaaggggccccaggatgggatggtggg gagataagaaccatgagagaagttggcacaaaggagttatgggacaaaaggtccaagata ggcagaaaagaaaatgttgcagttgatggggaagaaaggaagtcagagggctcagacact gtgggggacagaacatctccatgtgcactctcatctcttgtagtcagcaacaggtttcca cagggaaggccctacatcatctgctaccctgaaagatctggagagcccgtgccaagaacg agcagtagtcctggattcaacgtccgtaaaaatcagtcgactgaagaacaccatcaaatc tttgaaacaacagaagaaacaagtggaacatcagctggaagaagtaacgtgatttcgttt cctcgcaacatgactgctgggtttggggggcactcagacatacaggccccagtctcgtct cacccactcccagcctggggaagaaggctcacccctcagattccaccccatccccacagg gcccctgataacctggtcccatgggtgggcctgtcctggggcattggtggcattctgggg gcatgtctcttgctgtgccatctctgcctccccctggaaaagaaagcaaacaacgagaga cagaaagccgaaagggagctagaggttcaaatccagacattgatcatacagaaagaggaa ctaaatacggacctgtaccacatggaacgttctctcagatactttgaagaagagtccaag gacctggctgtccgcctgcaacattcattgcagtgtaaaggagagttagagagggctctg tctgctgtcatcgccacagagaagaagaaggcaaaccagttgtccagctgcagcaaagca catacagagtgggagttagagcagtccctacaggaccaggcactgctgaaagcgcagctg acacagttgaaggagtcatttcaacaactccaattagaaagagatgagtgtgctgaacat atagaaggagagagggcccggtggcatcagaggatgagtaaaatgttgcaggagatttgc acattaaagaaagagaagcagcaagatatgcgtcgggtagaggagctggagaggagcttg tccaaactcaaaaaccagatggctgaacccttgcccccggagcccccagcagtgccctct gaggtggagctgcagcacctgaggaaggaactagagagagtggcaggagagctccagtcc caggtcaaaaacaatcagcacataagtctcctgaaccggcgacaagaagagaggattcgg gaacaggaagagaggcttcggaagcaggaggagaggcttcaggagcagcacgagaagctt cggcagctggccaagccacagagcgtcttcgaggagctgaacaatgagaacaagagcaca ctgcagttggagcagcaagtaaaggagctacaggagaagcttggcgagctgaagagccaa gaggttcagagtctgcagcagcagccagaccattacctgggtcacctgcagcagtacgtg gccacctatcagcagcaggagcacctggaagcggccagccagcagaaccagcagctaacg gcccagctgaacctcatggctctccctggggaaggacacggaggagaacatctggacagt gagggggaggaggcacctcagcccatgccgagtgtcccagaggacctggagagcagggag gccatggtggcatttttcaagtccgctggagctagtgcccaggagaagcaggcacagtta caagagcaggtgaaagagcagagggtgtgctgccagcgcctggctcacccggtggcctcg gcccagaaggagccagaggcagccagaggccctggagccccagggcctgggggcgagtct gtgagtggggagacccactgggccctgcaggaagtcacggagaagctggcccatgccagg actcacctccaccttctccatgacttgaaaatgccacctgagggcaggtcgctgccgaga tgtgactgcaatattttggctccagagcagctttatggaccacctgaaggagaaggcaga cctgagaaaatccatcaccttttatcagaaccagggggccgtgccaaagatgcagcactg ggaggaggacaccatcaggctggagctcagggaggagatgaaggtgaagctgctggagct gcagcagatggtattgcggcttacagcaactacaacaatgggcacagaaaattcctggcc gctgcccacaactctgctgatgagcccggtccaggagccccagccccccaggagcttggg gctgcagacaagcatggtgatcttcgtgaggtgaccctcacctcctctgcccaaggagag gccagggaggatcctctccttgacaagcctactgcacagccgatcgtgcaggaccaccag gagcacccaggcttgggcagcaactgctgtgtgccattattttgttgggcttggctgcca agaagaaggagataa >gi568815583r_30262296|GENSCAN_predicted_peptide_4|297_aa MVQELLGQTLQVGVQQPQQQVPVEQGQHITQGPLAISPKKPWFFSPTLTLEPLPVQATLI VSVIGFLRCEQQDLDQTPRCFSSNMRKNSKGAVRVLHAIITQRYESSIMQLQLPHSAALQ KEAQLERQMETTQNLVDSYMAIVNKTVWDLMVGVMPKTIMHVMINNTKEFIFSELLSNLY SRGDQKTLMEESAEQAQWRDEMLRMYHVLKEALGIIGDINTTTISTHMGARGQLLPAGAE RPCRMQLHHPGSSSVSSLSGLVAVVLGSPRREAALDKWVGRHGDQRRGKPKGAEHQS >gi568815583r_30262296|GENSCAN_predicted_CDS_4|894_bp atggtccaggaactgctgggccaaactctccaagtgggagtgcagcagccgcaacagcaa gtaccagtagaacaagggcagcacatcacccagggacctcttgctatttcccctaaaaag ccttggttcttttcacctacactgactcttgagcctctcccagttcaagcaacactcatc gtctccgtgattggctttctgcgctgtgagcaacaggacctagaccaaacccctcgatgt ttcagtagcaatatgaggaagaattcgaaaggagcagtcagggtattgcatgccatcatt acacagagatatgaatcaagtatcatgcaactccaactaccacattctgctgccctccaa aaggaggcacagctggagcggcaaatggaaaccacccagaaccttgtggactcctacatg gccattgtcaacaagaccgtgtgggacctcatggttggtgtcatgcccaagaccatcatg cacgtcatgatcaacaacaccaaggagttcatcttctcggagctgctgtccaacctgtac tcacgtggggaccagaaaacgctgatggaagagtcggcagagcaggcacagtggcgcgac gagatgctgcgcatgtaccacgtgctgaaggaggcactcggcatcatcggcgacatcaac acgaccaccatcagcacgcacatgggggcccgtggacaactcctgcctgcaggtgcagag cgtccttgccggatgcagcttcatcatcctggttcaagcagtgtttcttctctatcaggc ctggtggctgttgttttgggctccccaaggcgagaggcggccctggacaagtgggttgga agacacggtgaccagagaagagggaagcccaaaggggctgagcatcagtcttaa >gi568815583r_30262296|GENSCAN_predicted_peptide_5|912_aa MACPWIRHEVLLSRLGFLVLENTNMDKRPSKLLRWVRVFAFKRPFALRPKEVLEHQQASI SRPLPESIRLLRHTGLVPPTPVPLGYPHQSFVSQPHPFSKQPSPCPRQSPQGDFGWLKEY WQKNSPRVPAGANRNRKTNGSVPEKATSGGCQPPGDSATGFHREGPTSSATLKDLEVRGS GQRCSDPSGQPSNLLLQSPCQERAVVLDSRSVEISQLKNTIKSLKQQKKQVEHQLEEVTA PDNLVPWVGLSWGIGGILGACLLLCHLCLPLEKKANNKKQKAKRVLEVQIQTLNIQKGKL NTDLYHMKRSLRYFEEKSKDLAVCLQHSLQRKGELESVLSNVMATQKKKANQLSSRSKAR TEWKLEQSMREEALLKVQLTQLKESFQQVQLERDECAEHLKGERARWQQRMRKMSQEICT LKKEKQQDMRRVEKLERSLSKLKNQMAEPLPPEPPAVPSEVELQHLRKELERVAGELQAQ VKKNQRISLLNQRQEERIQEQEERLRKQEERIQEQHKSLQQLAKPQSVFEEPNNENKNAL QLEQQVKELQEKLGEVELKSQEAQSLQQQPDHYLGHLQQYVATYQQQEHLEAASQQNQQL TAQLSLMALPGEGHGGEHLDSEGEEAPRPMPSVPEDPESREAMVAFFKSAGASAQEKQAQ LQEQVKEQRVCCQRLAHPVASAQKEPEAARGPGAPGPGGESVSGETHWALQEVTEKLAHA RTHLHLLHDLKMPPEGRSLPRCDCNILAPEQLYGPPGGEGRPEKTHHLLSEPGGRAKDAA LGGGHHQAGAQGGDEGEAAGAAADGIAAYSNYNNGHRKFLAAAHNSADEPGPGAPAPQEL GAADKHGADRAGPPGAPRLGQQLLCAIVVLGLAAKKKEINITILKELLKKFLNKKPSYGV NLLHNSFTSFEC >gi568815583r_30262296|GENSCAN_predicted_CDS_5|2739_bp atggcgtgtccatggataagacatgaagtccttctttcaagacttggttttctggtactg gaaaataccaatatggataaaagaccttcaaagctgctacgatgggtaagggtttttgca ttcaaaaggccctttgccttaagacctaaagaggttttggaacatcagcaagcatccatc tcgagacccctgccagagtctatacgactcctgaggcacactggactggtcccccctacc ccggtgcctctgggctacccccatcaaagttttgtcagtcagccccaccccttcagcaag cagcccagtccttgccctcgccaatcaccccagggtgactttgggtggttaaaagaatat tggcagaaaaacagccctagagttccagcaggagcgaacaggaacaggaaaacaaatggc agtgtccctgagaaagccacttctggtggttgccagccacctggggattcagcaacaggt ttccacagggaaggccctacatcatctgctaccctgaaagatctggaggtaagaggctct gggcagaggtgcagtgacccttcgggtcaaccctccaacctcctcctccagagcccgtgc caagaacgagcagtagtcctggattcaaggtccgtagaaatcagtcaactgaagaacacc atcaaatctctgaaacaacagaagaaacaagtggaacatcagctggaagaagtaacggcc cctgataacctggtcccatgggtgggcctgtcctggggcattggtggcattctgggggca tgtctcttgctgtgccatctctgcctccccctggaaaagaaagcaaacaacaagaaacag aaagccaaaagggtgctagaggttcaaatccagacattgaacatacagaaagggaaacta aatacggacctgtaccacatgaaacgttctctcagatactttgaagaaaagtccaaggat ctggctgtctgcctgcaacattcattgcagcgtaaaggagagttagagagtgttctctct aatgtcatggccacacagaagaagaaggcaaaccagttgtccagccgcagcaaagcacgt acggagtggaagttagagcagtccatgcgggaggaggcactactgaaagtgcagctgaca cagttgaaggagtcatttcaacaagtccaattagaaagagatgagtgtgctgaacatcta aaaggagagagggcccggtggcagcagaggatgagaaaaatgtcgcaggagatttgcaca ttaaagaaagagaagcagcaagatatgcgtcgggtagagaagctggagaggagcttgtcc aaactcaaaaaccagatggctgaacccttgcccccggagcccccagcagtgccctctgag gtggagctgcagcacctgaggaaggaactagagagagtggcaggagagctccaggcccag gtcaaaaagaatcagcgcataagtctcctgaaccagcgacaagaagagaggattcaggag caggaagagaggcttcggaagcaggaggagaggattcaggagcagcacaagagccttcag cagctggccaagccacagagcgtcttcgaggagccgaacaatgagaacaagaacgcactg cagttggagcagcaagtaaaggagctacaggagaagcttggcgaggtggagctgaagagc caagaggctcagagtctgcagcagcagccagaccattacctgggtcacctgcagcagtac gtggccacctatcagcagcaggagcacctggaagctgccagccagcagaaccagcagcta acggcccagctgagcctcatggctctccctggggaaggacacggaggagaacatctggac agtgagggggaggaggcacctcggcccatgccgagtgtcccagaggacccggagagcagg gaggccatggtggcatttttcaagtccgctggagctagtgcccaggagaagcaggcacag ttacaagagcaggtgaaagagcagagggtgtgctgccagcgcctggctcacccggtggcc tcggcccagaaggagccagaggcagccagaggccctggagccccagggcctgggggcgag tctgtgagtggggagacccactgggccctgcaggaagtcacggagaagctggcccatgcc aggactcacctccaccttctccatgacttgaaaatgccacctgagggcaggtcgctgccg agatgtgactgcaatattttggctccagagcagctttatggaccacctggaggagaaggc agacctgagaaaacccatcaccttttatcagaaccagggggccgtgccaaagatgcggca ctgggaggaggacaccatcaggctggagctcagggaggagatgaaggtgaagctgctgga gctgcagcagatggtattgcggcttacagcaactacaacaatgggcacagaaaattcctg gccgctgcccacaactctgctgatgagcccggtccaggagccccagccccccaggagctt ggggctgcagacaagcatggtgccgatcgtgcaggaccaccaggagcacccaggcttggg cagcaactgctgtgtgccattgttgtgctgggcttggctgccaagaagaaggagataaac atcaccatcctcaaagagctgctcaagaaatttttaaataagaaaccaagttatggggtt aatctcctacacaattcatttacttcctttgaatgttag >gi568815583r_30262296|GENSCAN_predicted_peptide_6|507_aa MYSKVLGLHIHSPLTDSPRATSSPASSTRKYPTQPALGCFLYPKKQNILMGSLMFQTAKA AQGVCSRASQQDNRLDRKLGSSRSPNSKSNTKLLQLPPPVDFAFLPHSPLTTERTRRTVR FGGRGIVGSSRKWRESGNRRETERIRKKSIVIDVSGMWDQRLVKLALLQHLRAFYGIKVK GVRGQCDRRRHETAATEIGGKIFGVPFNALPHSAVPEYGHIPSFLVDACTSLEEHIHTEG LFRKSGSVIRLKALKNKVDHGEGCLSSAPPCDIAGLLKQFFRELPEPILPADLHEALLKA QQLGTEEKNKAILLLSCLLADHTVHVLRYFFNFLRNVSLRSSENKMDSSNLAVIFAPNLL QTSEGHEKMSSNAEKKVRLQAAVVQTLIDYASDIGRVPDFILEKIPAMLGIDGLCATPSL EGFEEGEYETPGEYKRKRRQRVGDFVSGALNKFKPNRTPSITPQQERIDARVRSKHCLGV TVAIGIQPCLHGSSGLVGETDNKPVNE >gi568815583r_30262296|GENSCAN_predicted_CDS_6|1524_bp atgtacagtaaggtcctaggccttcacattcactcaccactcactgactcacccagagca acttccagtcctgcaagctccactcgtaagtaccctacgcagcccgccttgggatgtttc ttatatcccaagaaacagaatattttgatgggatcgctgatgtttcagactgcaaaagca gctcagggcgtttgcagtcgtgcaagtcaacaagataaccgtctggaccggaagctgggc tcctcccggtctcctaactccaaatccaacaccaagcttctgcagctgccacctcccgta gacttcgcatttcttccgcactctcctctcacgacggagcgaaccagacggacagtaagg tttggaggaagggggatcgttggaagtagcaggaagtggagagaatctggcaataggcga gaaaccgaaagaatcagaaagaagtctatagttatcgacgtatccggaatgtgggatcag aggctggtgaagttggccctgttgcagcatctgcgggccttctatggtattaaggtgaag ggtgtccgtgggcagtgcgatcgcaggagacatgaaacagcagccacggaaatagggggt aaaatatttggagtaccttttaatgcactgccccattctgctgtaccagaatatggacac attccaagctttcttgtcgatgcttgcacatctttagaagaacatattcataccgaaggg ctttttcggaaatcaggatctgtgattcgcctaaaagcactaaagaataaagtggatcat ggtgaaggttgcctatcttctgcacctccttgtgatattgcgggacttcttaagcagttt tttagggaactgccagagcccattctcccagctgatttgcatgaagcacttttgaaagct caacagttaggcacagaggaaaagaataaagctatactgttgctctcctgtcttctggct gaccacacagttcatgtattaagatacttctttaactttctcaggaatgtttctcttaga tccagtgagaataagatggatagcagcaatcttgcagtaatatttgcaccaaatcttctt cagacaagtgaaggacatgaaaagatgtcttctaacgcagaaaagaaggtacgattacag gctgcagtagtacagactcttatcgattatgcatcagatattgggcgtgtaccagacttt atcctggaaaagataccagccatgttgggtattgatggtctctgtgctactccatcactg gaaggctttgaagaaggtgaatatgaaactcctggtgaatataagagaaagagaagacaa cgtgtaggagattttgttagtggagcactaaataaatttaaacctaacagaacaccttct attacacctcaacaagaaagaattgatgcccgagtgcgctccaagcactgtctaggtgtc acagtggcgatcgggatacagccctgccttcatggatcttctgggctggtcggggagaca gacaataaaccagtcaacgaatga >gi568815583r_30262296|GENSCAN_predicted_peptide_7|60_aa MALKFPSLPQLPSRENQAVQVHTAQATAGISAQLFNLTAAAFCWLPQGVVQHRHNVKDSQ >gi568815583r_30262296|GENSCAN_predicted_CDS_7|183_bp atggccctgaagtttccaagcctgccacagctcccatctagagaaaaccaagctgtccag gtccacacagctcaagcaactgctggaatttctgctcaactctttaacctcacagctgct gctttctgctggcttccacagggtgtcgttcagcacaggcacaacgtcaaagatagccaa tga GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:52:11 Sequence gi568815583r_30262296 : 200516 bp : 41.76% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1809673 1809845 173 2 2 44 63 143 0.304 6.36 1.02 Intr + 1831227 1831332 106 2 1 75 29 93 0.002 1.40 1.03 Intr + 1835681 1835902 222 0 0 81 28 116 0.001 2.30 1.04 Intr + 1837735 1837879 145 2 1 70 -29 129 0.013 -1.67 1.05 Term + 1843029 1843249 221 0 2 7 38 435 0.548 26.62 1.06 PlyA + 1843930 1843935 6 -3.44 2.09 PlyA - 1844064 1844059 6 1.05 2.08 Term - 1844335 1844170 166 2 1 86 42 152 0.384 6.81 2.07 Intr - 1846750 1846597 154 1 1 81 99 66 0.659 5.21 2.06 Intr - 1847526 1847451 76 2 1 78 44 81 0.805 0.87 2.05 Intr - 1847860 1847732 129 0 0 87 66 76 0.714 5.27 2.04 Intr - 1850150 1849922 229 0 1 76 79 149 0.879 9.75 2.03 Intr - 1852743 1852537 207 1 0 76 19 188 0.465 7.97 2.02 Intr - 1857816 1857592 225 1 0 27 80 266 0.947 15.88 2.01 Init - 1863499 1863444 56 0 2 78 88 57 0.875 5.67 2.00 Prom - 1867172 1867133 40 -5.15 3.03 PlyA - 1867358 1867353 6 1.05 3.02 Term - 1871659 1871374 286 2 1 74 52 177 0.080 6.59 3.01 Init - 1882662 1882526 137 2 2 71 72 132 0.736 9.56 3.00 Prom - 1883212 1883173 40 -11.24 4.00 Prom + 1883897 1883936 40 -5.45 4.01 Init + 1886961 1887031 71 1 2 51 81 37 0.031 -0.13 4.02 Intr + 1891612 1891691 80 0 2 108 115 69 0.172 9.88 4.03 Intr + 1895313 1895480 168 0 0 69 94 192 0.280 16.90 4.04 Intr + 1896117 1896638 522 0 0 9 -1 508 0.078 25.99 4.05 Intr + 1897274 1897360 87 2 0 70 82 89 0.283 5.52 4.06 Intr + 1900931 1901040 110 2 2 122 79 182 0.484 19.78 4.07 Term + 1905645 1906163 519 1 0 125 41 663 0.808 58.61 4.08 PlyA + 1906574 1906579 6 1.05 5.03 PlyA - 1906678 1906673 6 1.05 5.02 Term - 1911066 1910891 176 0 2 75 49 85 0.356 0.24 5.01 Init - 1916268 1915998 271 2 1 38 44 345 0.550 22.28 5.00 Prom - 1916860 1916821 40 -10.35 6.00 Prom + 1917179 1917218 40 -5.95 6.01 Sngl + 1922167 1923033 867 2 0 88 49 736 0.985 65.34 6.02 PlyA + 1923040 1923045 6 1.05 7.00 Prom + 1923575 1923614 40 -6.15 7.01 Sngl + 1923668 1926463 2796 0 0 44 47 959 0.607 79.96 7.02 PlyA + 1926498 1926503 6 -0.45 8.05 PlyA - 1926827 1926822 6 1.05 8.04 Term - 1927886 1927606 281 2 2 99 42 149 0.635 6.02 8.03 Intr - 1928275 1928074 202 0 1 117 55 36 0.572 1.14 8.02 Intr - 1932092 1931955 138 1 0 71 71 37 0.391 0.04 8.01 Init - 1935007 1934747 261 0 0 45 105 194 0.371 13.91 8.00 Prom - 1938978 1938939 40 -5.35 9.00 Prom + 1940447 1940486 40 -3.65 9.01 Init + 1941919 1942072 154 2 1 79 76 120 0.490 10.09 9.02 Intr + 1944344 1944443 100 2 1 44 94 42 0.141 -1.35 9.03 Term + 1950288 1950537 250 2 1 55 48 169 0.424 3.79 9.04 PlyA + 1951500 1951505 6 1.05 10.03 PlyA - 1951887 1951882 6 1.05 10.02 Term - 1954207 1954151 57 0 0 99 38 61 0.651 -1.09 10.01 Init - 1954745 1954590 156 1 0 25 67 150 0.770 6.56 10.00 Prom - 1967059 1967020 40 -5.25 11.00 Prom + 1968148 1968187 40 -4.35 11.01 Init + 1968682 1969029 348 2 0 49 45 197 0.052 8.73 11.02 Term + 1978539 1978799 261 1 0 42 54 172 0.127 3.74 11.03 PlyA + 1979160 1979165 6 1.05 12.07 PlyA - 1980400 1980395 6 1.05 12.06 Term - 1980759 1980637 123 2 0 80 54 86 0.377 1.80 12.05 Intr - 1981613 1981307 307 0 1 44 71 197 0.052 9.13 12.04 Intr - 1983746 1983678 69 0 0 31 80 96 0.063 0.48 12.03 Intr - 1986267 1986076 192 1 0 51 88 63 0.063 0.29 12.02 Intr - 1991105 1990916 190 2 1 56 57 131 0.036 4.52 12.01 Init - 2004081 2003925 157 2 1 56 47 142 0.200 7.02 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r_30262296|GENSCAN_predicted_peptide_1|288_aa MGRGWKSLEDLEDRKIRESLELLRDWLNGCDQNADGNMDSENQADEVSDRNEEFIKNEQL NTLVCLAPFNLALGKLDLDFTGETKISGVARIQVNSGQWEWSPCIPALNVLLPGARGPCG NYNSIWGLSTLRGVATIPRSPSSPAACVAKYKSRGKEVECRAEVLPQRCPTCEQLGWGTD VPVNCLHAERIFILMRGSIRFSFASLLTDMPQQMPQEGGGGEEEGKEEEKEEEEKRRKNG GGVGGGGGGEGKEAEEKEEEEEEEEERRKGGGEEERRRRRRKRGGGGN >gi568815583r_30262296|GENSCAN_predicted_CDS_1|867_bp atgggcagaggttggaagagtttggaggacttagaagacaggaagattagggaaagtttg gaacttcttagagactggttaaatggttgtgaccagaatgctgatggcaatatggacagt gaaaaccaggctgatgaagtctcagatagaaatgaggaatttattaaaaacgagcagctg aacaccttggtctgccttgctcccttcaatctggcattgggcaaattggatctggatttc acaggggaaacaaaaatcagtggtgttgcccggatccaggtaaattctgggcagtgggag tggagcccctgcatacctgccctgaatgtccttctgccgggggccagagggccatgtggg aactataattccatctggggactttccacactcagaggtgtggccaccatccccaggtct cccagttcacctgcagcctgtgtggccaagtacaaatcccgaggcaaggaagttgagtgc agagctgaagtcctcccccagaggtgcccaacatgtgagcaactgggctggggcactgat gtccctgtcaactgcttacatgctgaacgcattttcattttaatgagaggcagcattcgt ttttcatttgcatccctgctcactgacatgccccagcagatgccacaagaaggaggcgga ggagaagaagaaggaaaagaggaggagaaggaggaggaggaaaaaaggaggaaaaatgga ggaggagttggaggaggtggaggaggagaaggaaaagaggcggaggagaaggaggaagag gaggaggaagaggaggagaggaggaaaggaggaggagaggaggaaaggaggaggaggagg aggaaaagaggaggaggaggaaattaa >gi568815583r_30262296|GENSCAN_predicted_peptide_2|413_aa MYDRFLSCTPHLLYQTLRRLHKVQKEGIKPQFPGYHLLALGECPVKGAWLQGAEDQELRA SRLNAVHGTFSKHRMRQTALTVSQRQSTHSGADRALLATKLGDTEPCGKGPEGVGAQSPA EWLSFLPGPASPPAQPRPAPLWKCTPQHIGPVEPQPMAVPNATRKTIRNQVYTLQGKFLH AARDDSPHAHLQPSLPQNSPMHNREHRRGCGSKLWAPAVHPIGQHFLLLVITAAIQRTKS CISPWDASLCRSLIITEPVTFTIITTQVRAPRFHELPWKVGMITAFMLVQCPVYVQTHHS RLTIVRGDPRTLSWGLDRNPFQVTVVAEKVGGAAKSIRIRQTWVCTKGLAETSGVSLFRL NQIMSDNPLKMIKTNLQRKLAAEEPAAEEPAASSSEHEEVNREMKPLSPPSRF >gi568815583r_30262296|GENSCAN_predicted_CDS_2|1242_bp atgtatgaccgattcctgagctgtactccacacctgctgtatcagaccctgaggagactt cataaagtccagaaagaaggcataaagccacaatttcctggctatcatctgctggcactg ggcgagtgcccagtaaagggggcttggctccagggtgctgaagaccaggagctgcgggca agtaggctgaacgcagtgcatgggaccttcagtaagcaccggatgaggcagacagcatta acagtcagccagcgacagtccactcactctggtgcagacagagcactgctggccaccaag ctgggggacaccgagccttgtggaaaaggcccagaaggggtgggtgcacagtccccagct gaatggctgagcttcctgccagggccggcgtccccaccagctcagcccagacctgccccg ctgtggaagtgtacccctcagcacatcggcccagtggagccccagcccatggctgtgccc aatgccacgagaaagactattagaaatcaggtatataccctgcaggggaagtttctgcat gcagctagagatgacagtcctcacgctcacctgcagccatctctgccacagaactctcca atgcacaacagggaacatagacgaggatgtgggtccaaactgtgggctccagcagtgcat cccataggccagcacttcctgctgttggtcataactgctgccattcagagaacaaagtct tgcatctcaccctgggatgcttctctgtgtcggtcactaattatcacagagccagtcacg ttcaccattattaccacacaggtcagggccccaagattccacgaactgccctggaaagtg ggcatgatcacagcctttatgctagtccagtgtcctgtgtatgtccagactcaccatagc agactcaccatagttcggggggatccaagaaccctctcttggggtctggatcggaaccct tttcaggtaaccgtagtggcagaaaaggtagggggagctgccaaaagcatccgcatcaga cagacctgggtttgcaccaagggtctggcagagactagcggcgtgagcttgttcagatta aatcagattatgtctgacaaccctctcaaaatgataaaaactaatctgcagagaaaactg gctgcagaggaaccggctgcagaggaaccagctgcttcctcctcggaacatgaagaggtg aacagagagatgaagcctctttctcctccctcacgtttctga >gi568815583r_30262296|GENSCAN_predicted_peptide_3|140_aa MEQNWMENDFDELTEVGFRRSAIRNFSKLKEHVRTHCKEAKNLEKRACSWHMNLADWTYW NESPTCVATVSMEILWTWPEGDFSPGALHSILPGAQSLTDQTTLHATTIGAGISWVLLDI SNQHQEAPLPSSVVNSHGRL >gi568815583r_30262296|GENSCAN_predicted_CDS_3|423_bp atggaacaaaactggatggagaatgactttgacgagttgacagaagtaggtttcagaagg tcggcaataagaaacttctccaagctaaaggagcatgttcgaacccattgcaaggaagct aagaaccttgaaaaaagagcatgttcatggcacatgaaccttgctgactggacatactgg aatgaatccccaacctgtgttgctaccgtgtctatggagatcctgtggacctggccggag ggggacttctctccaggtgccctgcactccattctgccaggtgcccagagcctcacagat cagaccactttgcatgccactacaattggtgctggaatttcctgggttctgctggacata tcaaaccagcaccaggaggcaccccttccctcatctgttgtgaattcccatggaagactc tga >gi568815583r_30262296|GENSCAN_predicted_peptide_4|518_aa MNPCSPTPVSTVMGVCQHRAAQHCADERFDATFHTNVLVNSSGHCQYLPPGIFKSSCYID VRWFPFDVQHCKLKFGSWSYGGWSLDLQMQEADISGYIPNGEWDLVGIPGKRSERFYECC KEPYPDVTFTVTMRRRTLYYGLNLLIPCVLISALALLVFLLPADSGEKISLGKRPSVWRE SETGDLLLRSALEGSQQTAQDSIRVPGDSLAHPMDLRAHGGSRTPEVPDSGSVLDGCVIL RILEDPQRMGDAQGGGQLHSALRGLCFLPSCHPTCSSMATGITVLLSLTVFMLLVAEIMP ATSDSVPLIAQYFASTMIIVGLSVVVTVIVLQYHHHDPDGGKMPKWTRVILLNWCAWFLR MKRPGEDKVRPACQHKQRRCSLASVEMSAVAPPPASNGNLLYIGFRGLDGVHCVPTPDSG VVCGRMACSPTHDEHLLHGGQPPEGDPDLAKILEEVRYIANRFRCQDESEAVCSEWKFAA CVVDRLCLMAFSVFTIICTIGILMSAPNFVEAVSKDFA >gi568815583r_30262296|GENSCAN_predicted_CDS_4|1557_bp atgaatccctgcagccctacccctgtttccacggttatgggtgtttgccagcacagggct gcccagcactgtgctgatgagcgctttgacgccacattccacactaacgtgttggtgaat tcttctgggcattgccagtacctgcctccaggcatattcaagagttcctgctacatcgat gtacgctggtttccctttgatgtgcagcactgcaaactgaagtttgggtcctggtcttac ggaggctggtccttggatctgcagatgcaggaggcagatatcagtggctatatccccaat ggagaatgggacctagtgggaatccccggcaagaggagtgaaaggttctatgagtgctgc aaagagccctaccccgatgtcaccttcacagtgaccatgcgccgcaggacgctctactat ggcctcaacctgctgatcccctgtgtgctcatctccgccctcgccctgctggtgttcctg cttcctgcagattccggggagaagatttccctgggtaagcgccccagtgtctggcgggag tctgagactggagaccttctgctgagatcagctctggagggctcacagcagacagcgcag gactccatcagggttcctggggattccctggctcatcccatggacctccgagcccacggt ggctccaggacaccagaggtccctgattcgggctccgtgctggacggctgtgtaatcctg agaatactggaggaccctcagaggatgggggatgcacagggagggggccagctccattct gccttgagaggcctgtgctttcttccctcctgccaccccacctgttcctcaatggcgact gggataacagtcttactctctcttaccgtcttcatgctgctcgtggctgagatcatgccc gcaacatccgattcggtaccattgatagcccagtacttcgccagcaccatgatcatcgtg ggcctctcggtggtggtgacagtgatcgtgctgcagtaccaccaccacgaccccgacggg ggcaagatgcccaagtggaccagagtcatccttctgaactggtgcgcgtggttcctgcga atgaagaggcccggggaggacaaggtgcgcccggcctgccagcacaagcagcggcgctgc agcctggccagtgtggagatgagcgccgtggcgccgccgcccgccagcaacgggaacctg ctgtacatcggcttccgcggcctggacggcgtgcactgtgtcccgacccccgactctggg gtagtgtgtggccgcatggcctgctcccccacgcacgatgagcacctcctgcacggcggg caaccccccgagggggacccggacttggccaagatcctggaggaggtccgctacattgcc aaccgcttccgctgccaggacgaaagcgaggcggtctgcagcgagtggaagttcgccgcc tgtgtggtggaccgcctgtgcctcatggccttctcggtcttcaccatcatctgcaccatc ggcatcctgatgtcggctcccaacttcgtggaggccgtgtccaaagactttgcgtaa >gi568815583r_30262296|GENSCAN_predicted_peptide_5|148_aa MWSVTINALRVEEGRIRDIAAQKSNNSKQQQQHRMMEPPATLFKALQPGGHREEGGVGLQ RTDSESVRPTPRVPPTQIADPSPHPTLPVSSSQGPEEGLNRVLGPALAGEAPATTVASAV LAHCCHLGGLGQRRDPQDPADRSSDCKV >gi568815583r_30262296|GENSCAN_predicted_CDS_5|447_bp atgtggtcagtgactatcaacgcgttgcgagtggaggaaggcaggatcagagatatcgct gcccaaaaaagcaacaatagcaaacagcagcagcaacacagaatgatggagcctcctgcc acactcttcaaagcgctgcagccaggaggacacagagaagaaggtggtgtgggcctgcaa aggacagactctgaatcagtgcgacccaccccgcgtgtccctccaacccaaatagcggat ccctctccccaccctacgctgccagtctcgagctcccagggcccagaggaagggctgaac cgtgtgttgggcccagcactggctggagaagcacctgcaacaactgtggcttctgctgtg ctggcccactgctgtcacctgggagggctgggtcaacgcagggacccccaagaccctgca gacaggtcctcggactgcaaagtctaa >gi568815583r_30262296|GENSCAN_predicted_peptide_6|288_aa MGKKQSRKTGNSKKQSTSPPPKDRSSSPAMEQSWTENDFDELREESFRRSNYELQEEIQT EGKEVKNFEKNLDECITRITNTDKCLKEQMELKAKTRELREECRSLRSRCNQLEERVSAM EDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQDI IQENFPNLARQANVQIQEIQRTPQRYSSRRAPPRHIIVRFTKVEMKEKMLRAAREKGRVT HKAKPIRLTADLSAETASQKRVRANIQHSQRKEFSTQNFISSQTKLHK >gi568815583r_30262296|GENSCAN_predicted_CDS_6|867_bp atggggaaaaaacagagcagaaaaacgggaaactctaaaaagcagagcacctctcctcct ccaaaggatcgcagttcctcaccagcaatggaacaaagctggacagagaatgactttgac gagttgagagaagaaagcttcagacgatcaaactacgagctacaggaggaaattcaaacc gaaggcaaagaagttaaaaactttgaaaaaaatttagacgaatgtataactagaataacc aatacagacaagtgcttaaaggagcagatggagctgaaagccaagactcgagaattacgt gaagaatgcagaagcctcaggagccgatgcaatcaactggaagaaagggtatcagcgatg gaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaaaga aatgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctgatt ggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctgcaggatatt atccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaatacag agaacaccacaaagatactcctcgagaagagcacctccaagacacataattgtcagattc accaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggttacc cacaaagcgaagcccatcagactaacagcggatctctcggcagaaactgcaagccagaag agagtgcgggccaatattcaacattctcaaagaaaagaattttcaacccagaatttcata tccagccaaactaagcttcataagtga >gi568815583r_30262296|GENSCAN_predicted_peptide_7|931_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYKTLHPKSTEYTFFSAPHHTYS KIDHIFGSKALFSKCKRTEIITNCLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCTGKFIALNAHKRKQERSKMDTLTSQLK ELDKQEQTHSKASRRQEITKIRAELKEIETQKTLQNINESRSWFFERINKIDRPLARLIK KKRENQIDAIKNDKGDITTDPTEIQTTIREYYKHLYAIKLENLEEMDKFLDTYTLPRLNQ EEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKDELIPFLLKLFQSIEKEG ILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANGIQQHIKKLIHHDQ VGFIPGMQGWFNIHKSINVIQHINRTKDKNHKIISIDAEKAFDKIQQPFMLKTLNKLGID GTYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAVRQEK EIKGIQLGKEEVKLSLFADDMIVYLDNPIVSAKNLLKLISNFNKVSGYKINVQKSQAFLY TNNRQTESQIMSELPFAIASNRIKYLGIHLTRDVKDLFKENYKPLLNEMKEDTNKWKNIP CSWVGRINIVKMATLPKVIYRFNAIPIKLPMTFFRELEKTTLKFIWNQKRACIAKSILSQ KNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHIYNYLIFDKPEKN KQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRCIKDLNVRPKTIKTLEENLGI TIQDIGTGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTIWEKIFATY SSDKGLISRIYNELQQIYKKKTTPSKSGQRT >gi568815583r_30262296|GENSCAN_predicted_CDS_7|2796_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctccaccaagcagacctaatagacatctacaaa actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatatttggaagtaaagctctcttcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggatgcattcaaagcagtatgtacagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaatggacaccctaacatcacaattaaaa gaactagataagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaacattaatgaatcc aggagctggttttttgaaagaatcaacaaaattgatagaccgctagcaagactaataaag aaaaaaagagagaatcaaatagacgcaataaaaaatgataaaggggatatcaccaccgat cccacagaaatacaaactaccatcagagaatactacaagcacctctacgcaattaaacta gaaaatctagaagaaatggataaattcctcgacacatacactctcccaagactaaaccag gaagaagttgaatctctgaatagaccaataacaggagctgaaattgtggcaataatcaat agcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccagagg tacaaggatgaactgataccattccttctgaaactattccaatcaatagaaaaagaggga atcctccctaactcattttatgaggccagcatcatcctgataccaaagccgggcagagac acaaccaaaaaagagaattttagaccaatatccctgatgaacatcgatgcaaaaatcctc aataaaatactggcaaacggaatccagcagcacatcaaaaagcttatccaccatgatcaa gtgggcttcatccctgggatgcaaggctggttcaatatacacaaatcaataaatgtaatc cagcatataaacagaaccaaagacaaaaaccacaagattatctcaatagatgcagaaaag gcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtattgat gggacatatctcaaaataataagagctatctatgacaaacccacagccaatatcatactg aatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccctctc tcaccactcctattcaacatagtgttggaagttctggccagggcagttaggcaggagaag gaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagacgac atgattgtatatctagataaccccattgtctcagccaaaaatctccttaagctgataagc aacttcaacaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatac accaataacagacaaacagagagccaaatcatgagtgaactcccattcgcaattgcttca aacagaataaaatacctaggaatccaccttacaagggacgtgaaggacctcttcaaggag aactacaaaccactgctcaatgaaatgaaagaggatacaaacaaatggaagaacattcca tgctcctgggtaggaagaatcaatatcgtgaaaatggccacactgcccaaggtaatttat agattcaatgccatccccatcaagctaccaatgactttcttcagagaattggaaaaaact actttaaagttcatatggaaccaaaaaagagcctgcatcgccaagtcaatcctaagccaa aagaacaaagctggaggcatcacgctacctgacttcaaactatactacaaggctacagta accaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaacagag ccctcagaaataacgccgcatatctacaactatctgatctttgacaaacctgagaaaaac aagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagctata tgtagaaagctgaaactagatcccttccttacaccttatacaaaaatcaattcaagatgc attaaagacttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggcatt accattcaggacataggcacgggcaaggacttcatgtctaaaacaccaaaagcaatggca acaaaagccaaaatagacaaatgggatctaattaaactaaagagcttctgcacagcaaaa gaaactaccatcagagtgaacaggcaacccacaatatgggagaaaattttcgcaacctac tcatctgacaaagggctaatatccagaatctacaatgaactccaacaaatttacaagaaa aaaacaaccccatcaaaaagtgggcaaaggacatga >gi568815583r_30262296|GENSCAN_predicted_peptide_8|293_aa MINMPLKQVHRTSEPGKAAWEMHRVMGNLESVQIQTRQVPAGQISQSQPDVWAPGVDMLS LSPDLVSVHHVSVFQAAVNLKAALTSLAQGALKLAPCSPELVSVSPEWASWNPLSLGGSQ GGSSSPVLPEPAQGHRNPLSLLVPIRWAGPSLLLKAQPETRLRGNQNSERCSEEGGLVLG PQGEKEGTATGCFMAPRPTEEISGWWAPAGDPSLFFRQPEPWAPLGDTQGRAWGNSVCPS ACIRRDPESHKDPRETKQERTTQQRLWKLDASRTTACKMGQDPCAKPKQVTAF >gi568815583r_30262296|GENSCAN_predicted_CDS_8|882_bp atgataaatatgccactcaagcaggttcatcgtacatcagagccagggaaggcagcttgg gaaatgcacagggtaatgggcaacttggaaagtgtgcaaatccagacaaggcaggtgcct gcaggacaaatcagccaaagccaaccagatgtctgggcacctggtgtggacatgctttcc ctcagcccagatttagtttcagtgcatcacgtctctgtgttccaagcagcagttaacctc aaggccgcactaacatctctggcccagggtgcactcaagctggcaccgtgttccccagag ctggtctcagtatctccagagtgggcgtcctggaatcctctcagcctgggaggttctcag ggtggctcgagcagccctgtcctcccagagcctgctcagggacacagaaaccccctctca ctgctggtcccaattagatgggcaggcccctccctgctgctcaaagcacaaccagaaacc aggctcagaggcaaccagaactccgaaaggtgttcagaggaaggcgggctggttttggga ccccagggggagaaggaaggcacagcaacagggtgctttatggccccaagaccaacagaa gaaatatctggttggtgggcacctgcaggggatccaagcctcttcttcagacagcctgag ccttgggccccactgggagacactcagggcagagcctggggaaactccgtctgcccctca gcctgcatcaggagggatcctgagagccacaaggacccccgagaaaccaagcaggagaga acaacacagcagcgactctggaaattagatgccagtagaaccacagcctgcaagatgggc caggacccatgtgccaaacctaaacaggtcactgccttctaa >gi568815583r_30262296|GENSCAN_predicted_peptide_9|167_aa MVSAFASGEGFRLLPPPVEGEGEQVYVVVRERAREGERESEREGEREGEREELTKTTDFI QPIPLSQSIGVLNVDRFPTQPQSSRWFANVWHGKMQAYFMSSCQKLGSILRGDYPTGTKE GISAIFIEPLIGPLIENAEQDLTLTRPVLNQPGRVLVPLRDSVLHPI >gi568815583r_30262296|GENSCAN_predicted_CDS_9|504_bp atggtgtcagcatttgcttctggtgagggcttcaggctgcttccaccaccagtggaaggt gaaggggagcaggtttatgttgtggtgagagagagagcgagagagggtgagagagagagc gagagagagggtgagagagagggcgagagagaggagctcacaaagaccacagattttatc cagccaataccactcagtcaaagcattggagttttaaacgtagacaggttcccaacacag ccacagtccagcagatggtttgcaaatgtttggcatgggaaaatgcaggcatatttcatg agtagttgtcagaaattaggatcaattttacgaggtgactatccaacagggaccaaggaa ggaatctctgccattttcatagagccactgattggcccactcattgaaaatgctgagcaa gacctcaccctcactcgcccggtcctgaaccagcctggcagagttcttgtcccactcaga gatagcgttctacatcctatctga >gi568815583r_30262296|GENSCAN_predicted_peptide_10|70_aa MPLGAVPSKCTDSTLFRESSASFSIPAVEFANLNTKALEGCCHLSQHTNDGDREEDISGE TVERAEENIN >gi568815583r_30262296|GENSCAN_predicted_CDS_10|213_bp atgcctctgggggctgttccatcgaaatgtactgattcaacacttttccgagaaagttct gcctcattcagcattcctgctgtggagtttgccaatctgaataccaaggccctggaaggc tgctgtcacctttctcagcacacaaatgatggggatagggaggaagacatatctggagag actgtggaaagagcagaagaaaacataaactga >gi568815583r_30262296|GENSCAN_predicted_peptide_11|202_aa MWKRLWNWVKSRGWNSLKGSEEERKTWEGLELPRDLLNGFDQNADNDMNNEVQVEVVSHG DEELVGNWSKGHCCYALAKRLVVFCPFPRDLWNFELERDDLGYLVEEISKQQSIQESTVW KVGKKEQLYVKKPDKDSLSQVTKVHISSGGPCWHLPNQRKNSHQFHSRDILPSMTVTPQI YQCHQKQENSENLSPPREPKEP >gi568815583r_30262296|GENSCAN_predicted_CDS_11|609_bp atgtggaagcggctttggaactgggtaaagagcagaggttggaacagtttgaagggctca gaagaagaaaggaagacgtgggaaggtttggaacttcctagagacttgttgaatggtttt gaccaaaatgctgataatgatatgaacaacgaagtccaggtggaggtggtctcacatgga gatgaagaacttgttgggaattggagtaaaggtcactgttgctatgctttagcaaagaga ctggtggtattttgccccttccctagagacctgtggaactttgaacttgagagagatgat ttagggtatctggtggaagaaatttctaagcagcaaagcattcaagagagtacggtgtgg aaagtgggaaaaaaagagcaactttacgtgaagaagcctgacaaagactccctgagccag gtgaccaaggtccacatcagcagtggtgggccatgctggcacctgcctaatcagagaaag aattcacatcaattccattcgagggacatcttaccaagtatgaccgttactcctcaaatt tatcagtgtcatcaaaaacaagaaaactcagagaatctgtcaccaccaagagagcctaag gagccatga >gi568815583r_30262296|GENSCAN_predicted_peptide_12|345_aa MSLETGGDHQVVLKPGEESNIGLKAKAYLWVQIGQRILCWTVRNSAVEHQAGKCMFLLIK KVNCKTASGRLSGVIPEEGTVILGSDSSMRVTAPEDLPVGQVVEVEDSDTDAPDPGPSGS ELVALSVPYAHMNEFCLNPCSPLWGTKYQPQLAGQMLRDSLSAKLFTDMVSFYLPHKPKS WKSILEVPKHISVAEDLESTANGTSSANGATDCPFSKMSHIEKNGLGPWRTTAIKQSIGS IHQIPDGGKKGKTEAHRGRYHERQFSRTQRGILTNSASFCTKILCSRQHFLHLKRELSRK SQDIQECTKRRESPAGGPEVEEAGMRRSRGGWKEPGEKWGQRECK >gi568815583r_30262296|GENSCAN_predicted_CDS_12|1038_bp atgtccctagaaactggaggagaccaccaagttgttctaaagccaggagaagaatctaac attggcctgaaagctaaagcctacctgtgggtacaaattggacaaaggatactttgctgg acagtcagaaattcagctgtggagcaccaggctggcaaatgtatgtttctacttataaaa aaggttaactgtaaaacagcctcaggcaggttgtcaggagttattccagaagaaggcact gttatcttaggaagtgacagctccatgcgtgttactgcccctgaagaccttccagtggga caagttgtggaggtggaagacagtgacactgatgctcctgaccctggtccctctggctct gagcttgtggctctatctgttccctatgcccacatgaatgagttctgcctcaatccctgc tccccactgtggggaaccaagtaccaacctcaacttgctgggcagatgctacgtgacagt ctaagtgctaagctctttacagacatggtctcattttatcttcctcataagccaaagagt tggaaatccatcctggaagttcccaagcacatcagtgttgcagaggaccttgaaagcaca gcaaatggtactagctcagccaatggggctactgactgtcccttctccaagatgagtcat attgagaagaacggcctgggcccatggagaacaactgccatcaaacaatcaattggcagc atccatcaaatacctgatggagggaaaaaaggaaagacagaggcacatcggggcagatat cacgaaagacagttttcaaggacacagagagggattctcaccaacagtgcttcattctgc acaaagattctatgcagcaggcagcattttctgcacctgaaacgagaactgtcacggaag agtcaggatattcaagaatgcaccaagagaagagaatctccagcaggaggcccagaggtg gaagaggcagggatgcgaagaagcagaggaggctggaaggaaccgggcgagaagtgggga cagcgggagtgtaagtga