GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:21:08 Sequence gi568815581r:3713047_3922008 : 208962 bp : 51.42% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 1059 1054 6 1.05 1.04 Term - 1896 1801 96 0 0 114 48 27 0.896 -0.43 1.03 Intr - 3752 3642 111 2 0 81 86 179 0.873 18.08 1.02 Intr - 7356 7261 96 0 0 87 86 73 0.964 7.71 1.01 Init - 8084 8082 3 2 0 93 53 0 0.736 -3.07 1.00 Prom - 8552 8513 40 -3.41 2.00 Prom + 8965 9004 40 -7.10 2.01 Sngl + 10890 13286 2397 2 0 107 43 1508 0.999 139.44 2.02 PlyA + 13627 13632 6 1.05 3.44 PlyA - 13697 13692 6 1.05 3.43 Term - 14165 14151 15 2 0 84 34 -6 0.210 -7.88 3.42 Intr - 14980 14873 108 1 0 107 68 104 0.585 11.28 3.41 Intr - 15122 15059 64 1 1 88 103 25 0.960 3.21 3.40 Intr - 18137 18058 80 2 2 77 108 69 0.975 6.64 3.39 Intr - 19720 19584 137 2 2 83 42 100 0.935 5.70 3.38 Intr - 21903 21743 161 2 2 81 32 173 0.960 11.14 3.37 Intr - 26832 26759 74 0 2 87 70 67 0.977 3.50 3.36 Intr - 30571 30443 129 1 0 73 62 243 0.952 21.60 3.35 Intr - 32881 32718 164 2 2 51 107 113 0.989 9.71 3.34 Intr - 35006 34876 131 1 2 75 98 195 0.994 20.04 3.33 Intr - 37436 37306 131 2 2 119 70 205 0.997 21.80 3.32 Intr - 38828 38604 225 2 0 109 60 244 0.845 22.41 3.31 Intr - 40384 40244 141 1 0 41 75 197 0.999 14.76 3.30 Intr - 40975 40737 239 2 2 101 82 184 0.782 16.96 3.29 Intr - 42215 42071 145 2 1 96 68 247 0.971 23.87 3.28 Intr - 42851 42784 68 0 2 85 94 98 0.999 9.22 3.27 Intr - 44088 43938 151 0 1 65 100 271 0.999 26.25 3.26 Intr - 44813 44660 154 1 1 70 78 199 0.990 17.69 3.25 Intr - 46507 46356 152 1 2 100 70 141 0.993 12.97 3.24 Intr - 47241 47126 116 1 2 78 64 144 0.997 11.67 3.23 Intr - 48131 47967 165 0 0 82 85 363 0.944 35.85 3.22 Intr - 48474 48357 118 0 1 125 87 34 0.966 7.54 3.21 Intr - 48936 48869 68 1 2 82 97 31 0.492 2.52 3.20 Intr - 50914 50823 92 0 2 89 97 47 0.791 5.84 3.19 Intr - 64614 64494 121 1 1 124 105 105 0.755 15.86 3.18 Intr - 81610 81520 91 1 1 62 87 2 0.012 -2.43 3.17 Intr - 85018 84884 135 1 0 53 17 140 0.159 4.67 3.16 Intr - 92528 92493 36 2 0 80 89 32 0.025 1.44 3.15 Intr - 95434 95333 102 1 0 48 86 79 0.035 4.57 3.14 Intr - 96240 96135 106 2 1 81 50 29 0.029 -0.88 3.13 Intr - 100477 100234 244 2 1 77 91 70 0.262 3.29 3.12 Intr - 101437 101276 162 2 0 118 77 39 0.924 6.26 3.11 Intr - 103224 103070 155 2 2 71 84 121 0.914 10.23 3.10 Intr - 105526 105217 310 2 1 74 66 434 0.905 35.72 3.09 Intr - 108306 108203 104 2 2 69 28 112 0.971 3.62 3.08 Intr - 109006 108907 100 2 1 43 101 58 0.973 2.27 3.07 Intr - 111986 111896 91 2 1 54 94 6 0.544 -2.13 3.06 Intr - 112797 112721 77 1 2 58 88 123 0.990 9.03 3.05 Intr - 113169 113041 129 1 0 76 87 53 0.930 5.17 3.04 Intr - 116322 116197 126 1 0 90 95 48 0.877 6.86 3.03 Intr - 127159 127054 106 1 1 111 87 144 0.999 16.89 3.02 Intr - 130105 130040 66 1 0 80 116 56 0.993 7.19 3.01 Init - 133177 132995 183 1 0 108 80 375 0.994 35.79 3.00 Prom - 136463 136424 40 -6.90 4.23 PlyA - 137233 137228 6 1.05 4.22 Term - 138994 138818 177 1 0 32 38 189 0.963 6.30 4.21 Intr - 144232 144128 105 1 0 124 90 28 0.821 7.51 4.20 Intr - 148101 147857 245 1 2 15 64 235 0.669 11.55 4.19 Intr - 148504 148152 353 0 2 57 52 157 0.695 4.52 4.18 Intr - 153222 152862 361 1 1 84 78 150 0.304 8.44 4.17 Intr - 155324 155159 166 2 1 61 21 91 0.485 -0.25 4.16 Intr - 156569 156441 129 2 0 94 83 95 0.984 11.00 4.15 Intr - 156842 156755 88 1 1 121 58 159 0.978 16.67 4.14 Intr - 159581 159508 74 2 2 93 64 131 0.999 9.90 4.13 Intr - 160416 160363 54 0 0 104 113 71 0.985 10.86 4.12 Intr - 163376 163177 200 0 2 106 57 350 0.999 33.29 4.11 Intr - 164516 164442 75 0 0 121 44 35 0.810 2.38 4.10 Intr - 167388 167300 89 2 2 115 74 209 0.849 22.31 4.09 Intr - 168668 168581 88 0 1 50 84 39 0.492 -0.77 4.08 Intr - 169324 169211 114 2 0 28 76 106 0.930 4.32 4.07 Intr - 169518 169482 37 0 1 92 92 47 0.929 3.82 4.06 Intr - 170129 169996 134 0 2 77 75 201 0.511 18.57 4.05 Intr - 170434 170383 52 1 1 105 60 43 0.998 2.17 4.04 Intr - 170918 170838 81 2 0 43 73 109 0.802 5.23 4.03 Intr - 171381 171334 48 0 0 122 97 110 0.995 14.66 4.02 Intr - 172684 172282 403 0 1 105 73 237 0.957 18.90 4.01 Init - 175679 175492 188 2 2 81 77 120 0.962 6.71 4.00 Prom - 178306 178267 40 -3.91 5.17 PlyA - 183566 183561 6 1.05 5.16 Term - 184426 184397 30 1 0 95 41 32 0.634 -2.66 5.15 Intr - 185064 184963 102 0 0 83 95 127 0.918 13.77 5.14 Intr - 185503 185438 66 1 0 72 103 139 0.998 13.39 5.13 Intr - 185978 185888 91 1 1 88 113 201 0.993 23.10 5.12 Intr - 186715 186588 128 1 2 85 39 203 0.999 15.08 5.11 Intr - 190297 190156 142 0 1 90 86 165 0.992 17.36 5.10 Intr - 190585 190505 81 0 0 46 96 106 0.987 6.45 5.09 Intr - 190978 190882 97 2 1 49 34 253 0.957 15.57 5.08 Intr - 191353 191284 70 1 1 63 116 57 0.994 5.25 5.07 Intr - 191883 191812 72 0 0 116 78 143 0.999 16.20 5.06 Intr - 192321 192174 148 2 1 85 99 201 0.999 21.65 5.05 Intr - 198381 198330 52 1 1 113 44 12 0.248 -2.25 5.04 Intr - 199848 199694 155 2 2 86 59 60 0.768 3.13 5.03 Intr - 203290 203043 248 1 2 89 100 260 0.024 24.09 5.02 Intr - 205498 205372 127 0 1 56 9 232 0.014 13.29 5.01 Init - 208522 208509 14 1 2 97 92 3 0.040 1.03 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 96144 96285 142 2 1 43 44 172 0.883 8.56 S.002 Init - 203179 203043 137 1 2 104 100 297 0.969 32.08 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:3713047_3922008|GENSCAN_predicted_peptide_1|101_aa MLLKDVTELQILGEISFNKSLYEGLNAENHRTKITVVFLKDEKYHSLPIIIKGSVGGLLV LIVILVILFKCGFFKRKYQQLNLESIRKAQLKSENLLEEEN >gi568815581r:3713047_3922008|GENSCAN_predicted_CDS_1|306_bp atgttactaaaagatgtaactgaactgcagatccttggtgaaatatctttcaacaaatct ctatatgagggactgaatgcagagaaccacagaactaagatcactgtcgtcttcctgaaa gatgagaagtaccattctttgcctatcatcattaaaggcagcgttggtggacttctggtg ttgatcgtgattctggtcatcctgttcaagtgtggcttttttaaaagaaaatatcaacaa ctgaacttggagagcatcaggaaggcccagctgaaatcagagaatctgctcgaagaagag aattag >gi568815581r:3713047_3922008|GENSCAN_predicted_peptide_2|798_aa MAASLPGPGSRLFRTYGAADGRRQRRPGREAAQWFPPQDRRRFFNSSGSSDASIGDPSQS DDPDDPDDPDFPGSPVRRRRRRPGGRVPKDRPSLTVTPKRWKLRARPSLTVTPRRLGLRA RPPQKCSTPCGPLRLPPFPSRDSGRLSPDLSVCGQPRDGDELGISASLFSSLASPCPGSP TPRDSVISIGTSACLVAASAVPSGLHLPEVSLDRASLPCSQEEATGGAKDTRMVHQTRAS LRSVLFGLMNSGTPEDSEFRADGKNMRESCCKRKLVVGNGPEGPGLSSTGKRRATGQDSC QERGLQEAVRREHQEASVPKGRIVPRGIDRLERTRSSRKSKHQEATETSLLHSHRFKKGQ KLGKDSFPTQDLTPLQNVCFWTKTRASFSFHKKKIVTDVSEVCSIYTTATSLSGSLLSEC SNRPVMNRTSGAPSSWHSSSMYLLSPLNTLSISNKKASDAEKVYGECSQKGPVPFSHCLP TEKLQRCEKIGEGVFGEVFQTIADHTPVAIKIIAIEGPDLVNGSHQKTFEEILPEIIISK ELSLLSGEVCNRTEGFIGLNSVHCVQGSYPPLLLKAWDHYNSTKGSANDRPDFFKDDQLF IVLEFEFGGIDLEQMRTKLSSLATAKSILHQLTASLAVAEASLRFEHRDLHWGNVLLKKT SLKKLHYTLNGKSSTIPSCGLQVSIIDYTLSRLERDGIVVFCDVSMDEDLFTGDGDYQFD IYRLMKKENNNRWGEYHPYSNVLWLHYLTDKMLKQMTFKTKCNTPAMKQIKRKIQEFHRT MLNFSSATDLLCQHSLFK >gi568815581r:3713047_3922008|GENSCAN_predicted_CDS_2|2397_bp atggcggcttcgctcccgggacctgggagccggcttttccgcacatatggggctgcggac ggcaggagacagcggcggccgggccgggaagccgcgcagtggttcccgccgcaggaccgg aggcgtttcttcaacagcagcggcagcagcgacgccagcatcggcgacccctcgcagtcc gacgatcctgacgatcccgacgaccccgacttccccggcagcccggtgaggcggcggcgg aggcgtcccggcggccgagtgcccaaggaccggcccagcctgaccgtgaccccaaagcgc tggaagctgcgagctcgcccaagcctaaccgtgaccccaagacgcctggggctgcgagct cggcccccgcagaagtgcagcacaccctgcggcccgctccgacttccgcccttccccagc cgcgactccggccgcctcagcccggacctcagcgtgtgcggccagcccagggacggcgac gagctgggcatcagtgcctccctgttcagctctctggcctcgccctgccccgggtcccca acgccaagggacagtgtcatctcgatcggcacctccgcctgtctggttgcagcctcagcc gtcccgagcggcctccacctcccagaagtctccctggaccgagcatctctcccctgctcc caggaggaagcgacaggaggagccaaggacaccaggatggtccaccaaacccgcgccagc ctcaggtcagttctctttggccttatgaactcaggaacccctgaggattctgagtttcgg gcagatgggaagaatatgagagagtcctgctgtaaaaggaaactggtggtgggaaatgga ccagagggtccaggtctgtcaagcacaggcaagaggagggccacaggccaggactcttgt caagagagagggcttcaagaggccgtccggagagagcatcaggaggccagtgttcccaag ggccgcattgtgccaaggggaatagacaggctggagagaactagatcaagccggaagagc aaacatcaggaggcaacggaaacctctctcctccattcccaccgctttaaaaagggccaa aagctgggaaaagattcgttccccacccaggacctgactcctttacagaatgtctgcttt tggaccaaaaccagggcttccttcagtttccacaagaagaaaattgtgactgatgtgtca gaggtctgcagcatctataccactgccacttctctctctggatccctcctatcagaatgt tcaaaccggcctgtcatgaacagaacaagtggtgctccgtcctcttggcactcctcctct atgtatttgctaagccccttaaacactctaagtatttcaaacaaaaaggcatctgatgct gaaaaggtttatggggaatgcagtcagaagggtcctgtcccctttagccattgccttccc acagaaaaactgcaacgctgtgagaagattggggaaggggtgtttggcgaagtgtttcaa acaattgctgatcacacacccgtagccataaaaatcattgctattgaaggaccagattta gtcaatggatcccatcagaaaacctttgaggaaatcctgccagagatcatcatctccaaa gagttgagcctcttatccggtgaagtgtgcaaccgcacagaaggctttatcgggctgaac tcagtgcactgtgtccagggatcttaccctcccttgctcctcaaagcctgggatcactat aattcaaccaaaggctctgcaaatgaccggcctgatttttttaaagacgaccagctcttc attgtgctggaatttgagtttggagggattgacttagagcaaatgcgaaccaagttgtct tccttggctactgcaaagagcattctacaccagctcacagcctccctcgcagtggcagag gcatcactgcgctttgagcaccgagacttacactgggggaacgtgctcttaaagaaaacc agcctcaaaaaactccactacaccctcaatgggaagagcagcactatccccagctgtggg ttgcaagtgagcatcattgactacaccctgtcgcgcttggaacgggatgggattgtggtt ttctgtgacgtttccatggatgaggacctgtttaccggtgacggtgactaccagtttgac atctacaggctcatgaagaaggagaataacaaccgctggggtgaatatcacccttatagt aatgtgctctggttacattacctgacagacaagatgctgaaacaaatgaccttcaagact aaatgtaacactcctgccatgaagcaaattaagagaaaaatccaggagttccacaggaca atgctgaacttcagctctgccactgacttgctctgccagcacagtctgtttaagtaa >gi568815581r:3713047_3922008|GENSCAN_predicted_peptide_3|1823_aa MAAVRGLRVSVKAEAPAGPALGLPSPEAESGVDRGEPEPMEVEEGELEIVPVRRSLKELI PDTSRRYENKAGSFITGIDVTSKEAIEKKEQRAKRFHFRSEVNLAQRNVALDRDMMKKAI PKVRLETIYICGVDEMSTQDVFSYFKEYPPAHIEWLDDTSCNVVWLDEMTATRALINMSS LPAQDKIRSRDASEDKSAEKRKKDKQEDSSDDDEAEEGEVEDENSSDVEVEEESLLRNDL RPANKLAKGNRLFMRFATKDDKKELGAARRSQYYMKYGNPNYGGMKGILSNSWKRRYHSR RIQRDVIKKRALIGDDVGLTSYKHRHSGLVNVPEEPIEEEEEEEEEEEEEEEEDQDMDAD DRVVVEYHEELPALKQPRERSASRRSSASSSDSDEMDYDLELKMISTPSPKKSMKMTMYA DEVESQLKNIRNSMRADSVSSSNIKNRIGNKLPPEKFADVRHLLDEKRQHSRPRPPVSST KSDIRQRLGKRPHSPEKAFSSNPVVRREPSSDVHSRLGVPRQDSKGLYADTREKKSGRDL TCPPTVRIPCWVQTLPFAGRGCGPGNEILPAQGCTVVAGSTDCWRFLHSRCWQCCCPLSA TLVLSADCESEMPSHPNRNHQVVFPSSCIIFHSTSSVRGFQFLHISTNTHYFLEAVPNFQ SQGIGYFYVSHGTATEESQLSPGFSSKSPTDLRPEDANMVWYKWRNAGIQSREAHGGGGE ERGDLSAGLGELGVPAEGSGEPQKFWIRGGAAGTVRPSSLPRGLDLEPLRVYVGCLALLA AFNVDVARPWLTPKGGAPFVLSSLLHQDPSTNQTWLLVTSPRTKRTPGPLHRCSLVQDEI LCHPVEHVPIPKGRHRGVTVVRSHHGVLICIQVLVRRPHSLSSELTGTCSLLGPDLRPQA QANFFDLENLLDPDARVDTGDCYSNKEGGGEDDVNTARQRRALEKEEEEDKEEEEDEEEE EAGTEIAIILDGSGSIDPPDFQRAKDFISNMMRNFYEKCFECNFALVQYGGVIQTEFDLR DSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKASKVMVVLTDGGIFE DPLNLTTVINSPKMQGVERFAIGVGEEFKSARTARELNLIASDPDETHAFKVTNYMALDG LLSKLRYNIISMEGTVGDALHYQLAQIGFSAQILDERQVLLGAVGAFDWSGGALLYDTRS RRGRFLNQTAAAAADAEAAQYSYLGHFEGLTGKVPGPRERPGGSTRCVHSHTTLAPGYAV AVLHKTCSLSYIAGAPRYKHHGAVFELQKEGREASFLPVLEGEQMGSYFGSELCPVDIDM DGSTDFLLVAAPFYHVHGEEGRVYVYRLSEQDGSFSLARILSGHPGFTNARFGFAMAAMG DLSQDKLTDVAIGAPLEGFGADDGASFGSVYIYNGHWDGLSASPSQRIRASTVAPGLQYF GMSMAGGFDISGDGLADITVGTLGQAVVFRSRPVVRLKVSMAFTPSALPIGFNGVVNVRL CFEISSVTTASESGLREALLNFTLDVDVGKQRRRLQCSDVRSCLGCLREWSSGSQLCEDL LLMPTEGELCEEDCFSNASVKVSYQLQTPEGQTDHPQPILDRYTEPFAIFQLPYEKACKN KLFCVAELQLATTVSQQELVVGLTKELTLNINLTNSGEDSYMTSMALNYPRNLQLKRMQK VTVTVAVETGRPYEFLRVGIHISPLLFSAAPSQKHPFDLVFKFLGDLLLTLPVRLAHVSV VWQLEENAFPNRTADITVTVTKPSIMYVNTGQGLSHHKEFLFHVHGENLFGAEYQLQICV PTKLRGLQVVAVKKLTRTQLSHL >gi568815581r:3713047_3922008|GENSCAN_predicted_CDS_3|5472_bp atggcggccgtacggggcctgcgggtgtcggtgaaggcggaggccccggcggggccggcc ctggggctcccgtcccctgaggcggagtccggtgttgaccgtggcgagccggagcccatg gaggtggaggagggcgagctggaaatcgtgcctgtgcggcgctcgctcaaggaactgatc ccggacacgagcagaagatatgaaaacaaggctggcagcttcatcactggaattgatgtc acctccaaggaagcaattgaaaagaaagagcagcgagccaagcgcttccattttcgatcg gaagtaaatcttgcccaaagaaatgtagccttggaccgagacatgatgaagaaagcaatc cccaaggtgagactggagacaatctatatttgcggagtagatgagatgagcacccaagat gtcttttcctattttaaagaatatcctccagctcacatcgaatggttggatgatacctcc tgtaatgtagtttggctggatgaaatgacagccacacgagcacttatcaatatgagctcc ctgcctgcacaggataagatcagaagcagggatgccagtgaggacaagtcagctgagaaa aggaaaaaagacaagcaggaagacagttcagatgatgatgaagctgaagaaggagaggtt gaagatgagaactcaagtgatgtagaggtagaagaggagtctttgttaagaaacgatctt cgtccagctaacaaacttgctaaaggaaataggttattcatgagatttgctacaaaagat gacaaaaaggaacttggagcagccagaagaagtcagtattacatgaaatatgggaatcca aattatggaggcatgaaaggaattcttagcaattcatggaagcgaagatatcattcccgt cgtattcagcgggacgtgatcaagaagagagccctgattggggatgacgttggcttgacg tcgtataaacatcgacattctgggctagtgaatgttcccgaggaacccattgaagaggag gaagaggaggaggaggaggaagaggaagaggaagaagaagaccaggacatggatgcagat gacagagtggtggtagagtaccacgaggagctcccggctctcaagcagccccgggagcgg agcgcgtctagacgatccagtgccagcagctcagactcagatgaaatggactatgatcta gaactgaaaatgatttccacgccttcaccaaagaaaagcatgaaaatgactatgtatgct gacgaagtggaatctcagttgaaaaatattaggaactccatgagggcagatagtgtatct tcaagcaatatcaaaaaccgaattggtaacaaattaccacctgagaaatttgcagatgtc cgacatctattagatgagaaacgtcagcactcccgtccacggccaccagtcagcagtact aaatcagatatacgccagcggttaggaaaaagaccacattctccggaaaaggcttttagt agtaaccccgtcgttcggagagagccctcttctgatgtgcatagtaggctaggtgttccc aggcaggatagtaaaggcctctacgccgatactcgggagaagaaatcagggagggacctg acctgtccccccacggtgagaatcccgtgctgggtccagacccttccgtttgcgggcagg ggctgcggcccgggaaatgagattttgcctgcccaaggctgcactgtggtggcaggctcc acagactgctggcgtttcctgcacagcaggtgttggcagtgctgctgccccctcagtgct accctggttcttagcgctgactgcgaaagtgaaatgccatctcatccaaacaggaaccac caggtggtttttccaagcagctgtatcattttccattccaccagcagtgtgcgagggttc cagtttctccacatctccaccaacactcattattttctggaggcagtccccaacttccag agccagggtataggctacttctatgtgtcccatggtactgccacagaagagtctcagctc tctccaggattcagttctaagtcccccacagatctgaggccggaggatgccaacatggtg tggtataagtggaggaatgcgggcatccagtcccgagaggcccacggtggaggtggtgag gaacgtggggacctgagtgccggcctgggggagcttggagttcctgctgagggcagtggg gagccacagaagttctggattcgtgggggagcagcgggaacagtacggccttcttccctg cctaggggtcttgatctggaacctttgagagtctatgtaggctgcctggccctgctggcc gctttcaatgtggatgtggcccggccctggctcacgcccaagggaggtgcccctttcgtg ctcagctcccttctgcaccaagaccccagcaccaaccagacctggctcctggtcaccagc cccagaaccaagaggacaccagggcccctccatcgatgttcccttgtccaggatgaaatc ctttgccatcctgtagagcatgtccccatccccaaggggaggcaccggggagtgaccgtt gtccggagccaccacggtgttttgatatgcattcaagtgctggtccggcggcctcacagc ctcagctcagaactcacaggcacctgtagcctcctgggccctgacctccgtccccaggct caggccaacttcttcgaccttgaaaatctcctggatccagatgcacgtgtggacactgga gactgctacagcaacaaagaaggcggtggagaagacgatgtgaacacagccaggcagcgc cgggctctggagaaggaggaggaggaagacaaggaggaggaggaagacgaggaggaggag gaagctggcaccgagattgccatcatcctggatggctcaggaagcattgatcccccagac tttcagagagccaaagacttcatctccaacatgatgaggaacttctatgaaaagtgtttt gagtgcaactttgccttggtgcagtatggaggagtgatccagactgagtttgaccttcgg gacagccaggatgtgatggcctccctcgccagagtccagaacatcactcaagtggggagt gtcaccaagactgcctcagccatgcaacacgtcttagacagcatcttcacctcaagccac ggctccaggagaaaggcatccaaggtcatggtggtgctcaccgatggtggcatattcgag gaccccctcaaccttacgacagtcatcaactcccccaaaatgcagggtgttgagcgcttt gccattggggtgggagaagaatttaagagtgctaggactgcgagggaactgaacctgatc gcctcagacccggatgagacccatgctttcaaggtgaccaactacatggcgctggatggg ctgctgagcaaactgcggtacaacatcatcagcatggaaggcacggttggagacgccctt cactaccagctggcacagattggcttcagtgctcagatcctggatgagcggcaggtgctg ctcggcgccgtcggggcctttgactggtccggaggggcgttgctctacgacacacgcagc cgccggggccgcttcctgaaccagacagcggcggcggcggcagacgcggaggctgcgcag tacagctacctgggccactttgaaggcctgactgggaaggtgccggggccaagagagagg ccaggtgggagcacacggtgtgttcacagccacaccacccttgccccaggttacgctgtg gccgtgctgcacaagacctgcagcctctcctacatcgcgggggctccacggtacaaacat catggggccgtgtttgagctccagaaggagggcagagaggccagcttcctgccagtgctg gagggagagcagatggggtcctattttggctctgagctgtgccctgtggacattgacatg gatggaagcacggacttcttgctggtggctgctccattttaccacgttcatggagaagaa ggcagagtctacgtgtaccgtctcagcgagcaggatggttctttctccttggcacgcata ctgagtgggcaccccgggttcaccaatgcccgctttggctttgccatggcggctatgggg gatctcagtcaggataagctcacagatgtggccatcggggcccccctggaaggttttggg gcagatgatggtgccagcttcggcagtgtgtatatctacaatggacactgggacggcctc tccgccagcccctcgcagcggatcagagcctccacggtggccccaggactccagtacttc ggcatgtccatggctggtggctttgatattagtggcgacggccttgccgacatcaccgtg ggcactctgggccaggcggttgtgttccgctcccggcctgtggttcgcctgaaggtctcc atggccttcacccccagcgcactgcccatcggcttcaacggcgtcgtgaatgtccgttta tgttttgaaatcagctctgtaaccacagcctctgagtcaggcctccgcgaggcacttctc aacttcacgctggatgtggatgtggggaagcagaggagacggctgcagtgttcagacgta agaagctgtctgggctgcctgagggagtggagcagcggatcccagctttgtgaggacctc ctgctcatgcccacagagggagagctctgtgaggaggactgcttctccaatgccagtgtc aaagtcagctaccagctccagacccctgagggacagacggaccatccccagcccatcctg gaccgctacactgagccctttgccatcttccagctgccctatgagaaggcctgcaagaat aagctgttttgtgtcgcagaattacagttggccaccaccgtctctcagcaggagttggtg gtgggtctcacaaaggagctgaccctgaacattaacctaactaactccggggaagattcc tacatgacaagcatggccttgaattaccccagaaacctgcagttgaagaggatgcaaaag gtgactgtgacggtggctgtggaaacaggaagaccttacgaattcctccgagttggaatt catatttctccactgctgttctctgcagctccctcacagaagcaccccttcgaccttgta tttaaattcctcggggatctactgctgaccctgcctgtccggcttgctcatgtttcagtc gtttggcagctagaggagaatgcctttccaaacaggacagcagacatcactgtgactgtc accaaaccatccataatgtacgtgaacacaggccaggggctttctcaccacaaagaattc ctcttccatgtacatggggagaacctctttggagcagaataccagttgcaaatttgcgtc ccaaccaaattacgaggtctccaggttgtagcagtgaagaagctgacgaggactcagttg tctcatctgtag >gi568815581r:3713047_3922008|GENSCAN_predicted_peptide_4|1086_aa MGSLTSALAAAFRPPAHPSGPQRVPSNHNSSRTTTTHFYGLLSGRHCSERLTGVPSSASH HVRFPTRLRRRTPLTEAMEGGPAVCCQDPRAELVERVAAIDVTHLEEADGGPEPTRNGVD PPPRARAASVIPGSTSRLLPARPSLSARKLSLQERPAGSYLEAQAGPYATGPASHISPRA WRRPTIESHHVAISDAEDCVQLNQYKLQSEIGKVGLTDAYLQGAYGVVRLAYNESEDRHY AMKVLSKKKLLKQYGFPRRPPPRGSQAAQGGPAKQLLPLERVYQEIAILKKLDHVNVVKL IEVLDDPAEDNLYLALQNQAQNIQLDSTNIAKPHSLLPSEQQDSGSTWAARSGRDLGIGC FASQLHLTFLSFLSVFDLLRKGPVMEVPCDKPFSEEQARLYLRDVILGLEYSSWGPLVQM KVLAKEGAHRSMVAGWVHCQKIVHRDIKPSNLLLGDDGHVKIADFGVSNQFEGNDAQLSS TAGTPAFMAPEAISDSGQSFSGKALDVWATGVTLYCFVYGKCPFIDDFILALHRKIKNEP VVFPEEPEISEELKDLILKMLDKNPETRIGVPDIKLHPWVTKNGEEPLPSEEEHCSVVEV TEEEVKNSVRLIPSWTTVRPRINQAVPASPPDPVYQLDGACISPVPTYQLDGACFSPRAH VSVRRRLLLPQSPPWPCGPFMGWSRPPPHVPVASGSSPFQEYAPSCQRPVFLPLPFHAVL IEMGPALRPPWQLLSGNRSRGCVSGSLGPVLLTLPCPPQILVKSMLRKRSFGNPFEPQAR REERSMSAPGNLLVTAQDYLAAKWMELTFLDCVSHSALSGKWTERNQALSRGLKRKHHIV PAHLTLSLDDAPRAEAGEDTRADFDRGPWTRQACGPAHAEITGTQPLLLHRPTHLGPRAA LTLLQRRTNCPGGLCGNCLTLRTSAVSLLRAGVLERVQTQLFRSSLHLLSGISVRLADVA FADRMLIEVDPHRVTQDSDTALETWMDKGFWPQAQPFRRDTASPQSLPLLVKLDPALQGL FSSWKNPEGGSERRNGPALLHSKEDVQEAAGHAHLELEGEARTGNGDAGALSVWTAVELE AMRVMS >gi568815581r:3713047_3922008|GENSCAN_predicted_CDS_4|3261_bp atggggagcctgacgtcagccctcgcggccgccttccgcccgcccgcgcatccatctggg cctcagcgtgtcccgagcaatcacaacagcagccgcacaacaacaactcacttttacggc ctccttagtggcaggcactgttctgagcgccttacgggcgttccctcctcagcatctcac cacgtgcggttcccaacaaggctacgcagaagaacccccttgactgaagcaatggagggg ggtccagctgtctgctgccaggatcctcgggcagagctggtagaacgggtggcagccatc gatgtgactcacttggaggaggcagatggtggcccagagcctactagaaacggtgtggac cccccaccacgggccagagctgcctctgtgatccctggcagtacttcaagactgctccca gcccggcctagcctctcagccaggaagctttccctacaggagcggccagcaggaagctat ctggaggcgcaggctgggccttatgccacggggcctgccagccacatctccccccgggcc tggcggaggcccaccatcgagtcccaccacgtggccatctcagatgcagaggactgcgtg cagctgaaccagtacaagctgcagagtgagattggcaaggtggggctgactgatgcctat ctgcagggtgcctacggtgtggtgaggctggcctacaacgaaagtgaagacagacactat gcaatgaaagtcctttccaaaaagaagttactgaagcagtatggctttccacgtcgccct cccccgagagggtcccaggctgcccagggaggaccagccaagcagctgctgcccctggag cgggtgtaccaggagattgccatcctgaagaagctggaccacgtgaatgtggtcaaactg atcgaggtcctggatgacccagctgaggacaacctctatttggccctgcagaaccaggcc cagaatatccagttagattcaacaaatatcgccaagccccactccctgcttccctctgag cagcaagacagtggatccacgtgggctgcgcgctcagggagggaccttggcatcggctgc tttgccagccagctacaccttaccttcttgtcttttctttcagtgtttgacctcctgaga aaggggcccgtcatggaagtgccctgtgacaagcccttctcggaggagcaagctcgcctc tacctgcgggacgtcatcctgggcctcgagtactcatcctggggtccgttggtccagatg aaggtacttgccaaggagggagcccacaggtcgatggtcgcgggatgggtgcactgccag aagatcgtccacagggacatcaagccatccaacctgctcctgggggatgatgggcacgtg aagatcgccgactttggcgtcagcaaccagtttgaggggaacgacgctcagctgtccagc acggcgggaaccccagcattcatggcccccgaggccatttctgattccggccagagcttc agtgggaaggccttggatgtatgggccactggcgtcacgttgtactgctttgtctatggg aagtgcccattcatcgacgatttcatcctggccctccacaggaagatcaagaatgagccc gtggtgtttcctgaggagccagaaatcagcgaggagctcaaggacctgatcctgaagatg ttagacaagaatcccgagacgagaattggggtgccagacatcaagttgcacccttgggtg accaagaacggggaggagccccttccttcggaggaggagcactgcagcgtggtggaggtg acagaggaggaggttaagaactcagtcaggctcatccccagctggaccacggtgcgcccg cgtatcaatcaggcagtgcctgcgtctcccccagatcctgtgtatcagttagacggtgcc tgcatctcccccgtgcccacgtatcagttagacggcgcctgcttctcccccagagcccac gtatcagttagacggcgcctgcttctcccccagagcccaccctggccttgcggacccttc atgggctggtcccggccccctcctcatgtaccagtggcatccggctcctcaccattccag gaatatgcccccagctgccagcgccccgtgttcttgcctctgccatttcatgctgtgctg attgagatgggacccgcactgcggcccccttggcagctgctctcggggaatcggagcaga ggctgcgtgtctgggagcctgggacctgtgctcctcacgctgccttgtcctcctcagatc ctggtgaagtccatgctgaggaagcgttcctttgggaacccgtttgagccccaagcacgg agggaagagcgatccatgtctgctccaggaaacctactggtcacagcccaagattatttg gcagccaagtggatggaactaactttcctggactgtgtttcgcattcggcgttatctgga aagtggactgaacggaatcaagctctgagcagaggcctgaagcggaagcaccacatcgtc cctgcccatctcactctctcccttgatgatgcccctagagctgaggctggagaagacacc agggctgactttgaccgagggccatggacgcgacaggcctgtggccctgcgcatgctgaa ataactggaacccagcctctcctcctacaccggcctacccatctgggcccaagagctgca ctcacactcctacaacgaaggacaaactgtccaggtggcctctgcggcaattgcctcacc ctgaggacatcagcagtcagcctgctcagagcgggggtgctggagcgcgtgcagacacag ctcttccggagcagccttcaccttctctctgggatcagtgtccggctggccgacgtggca tttgctgaccgaatgctcatagaggttgacccccacagggtcacgcaggactcggacact gccctggaaacatggatggacaagggcttttggccacaggcccagccattccggcgggac actgcctcccctcaatccctgcccctgcttgtcaagcttgaccccgccctccaaggcctg ttcagcagctggaaaaatccagagggagggagcgaacgcaggaatggacccgccttgcta cattcaaaggaagatgtccaggaggcagctggacatgcgcatctggagctcgagggagag gcgagaactggaaatggagacgcgggggccctcagcgtgtggacggccgtggagttggaa gccatgcgtgtgatgagctag >gi568815581r:3713047_3922008|GENSCAN_predicted_peptide_5|540_aa MGRPSPSICPSKSVSPSVNLSVSKCGSLEEEEEEEEEEEEEEEEDMVGPGSPQKLYHRRG WWHPAHPESRGRAGGSVLSPSRPTMARRFQEELAAFLFEYDTPRMVLVRNKKVGVIFRLI QLVVLVYVIGPLEITCSEISKWRSECTNQGEGWIEIIGGRNTSRDRFCSIFSMPGAVPGN RARNTWHSQSFRRLKRSGKWVFLYEKGYQTSSGLISSVSVKLKGLAVTQLPGLGPQVWDV ADYVFPAQGDNSFVVMTNFIVTPKQTQGYCAEHPEGGICKEDSGCTPGKAKRKAQGIRTG KCVAFNDTVKTCEIFGWCPVEVDDDIPRPALLREAENFTLFIKNSISFPRFKVNRRNLVE EVNAAHMKTCLFHKTLHPLCPVFQLGYVVQESGQNFSTLAEKGGVVGITIDWHCDLDWHV RHCRPIYEFHGLYEEKNLSPGFNFRFARHFVENGTNYRHLFKVFGIRFDILVDGKAGKFD IIPTMTTIGSGIGIFGVATVLCDLLLLHILPKRHYYKQKKFKYAEDMGPGAICALRCGSQ >gi568815581r:3713047_3922008|GENSCAN_predicted_CDS_5|1623_bp atggggcggccaagtccatcaatctgtccaagcaagtctgtcagtccatccgtcaatctc tctgtcagcaagtgtggctccttggaggaggaggaggaggaggaggaggaggaggaggag gaggaggaagaagacatggtggggccaggaagcccccagaagctctaccatcgacgtggg tggtggcacccggctcaccctgagagcagaggccgtgcagggggctcagttctgagcccc agccggcccaccatggcacggcggttccaggaggagctggccgccttcctcttcgagtat gacaccccccgcatggtgctggtgcgtaataagaaggtgggcgttatcttccgactgatc cagctggtggtcctggtctacgtcatcggacctcttgaaatcacatgttctgagatctcc aagtggagatcagagtgcaccaatcagggagaaggatggatagaaattattggggggaga aacacatccagggaccgtttctgcagcatcttctccatgcctggtgctgtgccggggaac agagcacgtaacacctggcacagccagagcttccgacgtctcaagagaagtgggaagtgg gtgtttctctatgagaagggctaccagacctcgagcggcctcatcagcagtgtctctgtg aaactcaagggcctggccgtgacccagctccctggcctcggcccccaggtctgggatgtg gctgactacgtcttcccagcccagggggacaactccttcgtggtcatgaccaatttcatc gtgaccccgaagcagactcaaggctactgcgcagagcacccagaagggggcatatgcaag gaagacagtggctgtacccctgggaaggccaagaggaaggcccaaggcatccgcacgggc aagtgtgtggccttcaacgacactgtgaagacgtgtgagatctttggctggtgccccgtg gaggtggatgacgacatcccgcgccctgcccttctccgagaggccgagaacttcactctt ttcatcaagaacagcatcagctttccacgcttcaaggtcaacaggcgcaacctggtggag gaggtgaatgctgcccacatgaagacctgcctctttcacaagaccctgcaccccctgtgc ccagtcttccagcttggctacgtggtgcaagagtcaggccagaacttcagcaccctggct gagaagggtggagtggttggcatcaccatcgactggcactgtgacctggactggcacgta cggcactgcagacccatctatgagttccatgggctgtacgaagagaaaaatctctcccca ggcttcaacttcaggtttgccaggcactttgtggagaacgggaccaactaccgtcacctc ttcaaggtgtttgggattcgctttgacatcctggtggacggcaaggccgggaagtttgac atcatccctacaatgaccaccatcggctctggaattggcatctttggggtggccacagtt ctctgtgacctgctgctgcttcacatcctgcctaagaggcactactacaagcagaagaag ttcaaatacgctgaggacatggggccaggggcgatctgtgctctccgatgtggcagtcag taa