GENSCAN 1.0 Date run: 3-Nov-116 Time: 10:46:39 Sequence gi568815575r:53839161_54142821 : 303661 bp : 42.93% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 89 84 6 1.05 1.04 Term - 416 100 317 0 2 47 42 237 0.230 9.12 1.03 Intr - 1406 1307 100 2 1 88 73 28 0.952 0.06 1.02 Intr - 5184 5028 157 2 1 61 96 80 0.878 5.19 1.01 Init - 10070 9652 419 2 2 71 53 136 0.686 4.35 1.00 Prom - 10124 10085 40 -5.65 2.03 PlyA - 10243 10238 6 1.05 2.02 Term - 11768 11019 750 2 0 -11 43 268 0.349 4.85 2.01 Init - 12530 11862 669 2 0 66 41 348 0.552 23.14 2.00 Prom - 12993 12954 40 -5.45 3.06 PlyA - 13162 13157 6 1.05 3.05 Term - 13838 13390 449 0 2 12 43 177 0.032 -0.01 3.04 Intr - 22897 22684 214 1 1 52 109 59 0.032 1.77 3.03 Intr - 25656 25221 436 2 1 95 59 213 0.143 11.76 3.02 Intr - 39043 38905 139 2 1 8 38 93 0.042 -5.10 3.01 Init - 42368 42299 70 2 1 77 98 76 0.842 8.76 3.00 Prom - 45446 45407 40 -7.45 4.00 Prom + 45872 45911 40 -3.95 4.01 Sngl + 50449 51180 732 0 0 81 43 387 0.432 29.47 4.02 PlyA + 51408 51413 6 1.05 5.00 Prom + 51577 51616 40 -6.15 5.01 Sngl + 52938 53876 939 2 0 43 37 332 0.898 19.85 5.02 PlyA + 54145 54150 6 1.05 6.00 Prom + 54900 54939 40 -3.95 6.01 Init + 57250 57387 138 0 0 52 77 106 0.682 6.23 6.02 Term + 61412 61474 63 1 0 80 44 73 0.270 -0.99 6.03 PlyA + 64057 64062 6 1.05 7.03 PlyA - 64358 64353 6 1.05 7.02 Term - 72678 72572 107 1 2 29 43 124 0.093 -0.41 7.01 Init - 91953 91875 79 0 1 78 110 53 0.772 7.77 7.00 Prom - 92658 92619 40 -8.05 8.26 PlyA - 93193 93188 6 1.05 8.25 Term - 99051 98582 470 1 2 92 38 192 0.172 8.75 8.24 Intr - 100933 100796 138 2 0 -11 47 146 0.510 0.11 8.23 Intr - 101356 101020 337 1 1 102 82 316 0.998 26.57 8.22 Intr - 105263 104974 290 0 2 27 96 332 0.423 23.94 8.21 Intr - 123779 123684 96 0 0 121 84 148 0.807 16.76 8.20 Intr - 130705 130679 27 2 0 85 87 42 0.083 0.97 8.19 Intr - 146067 145754 314 2 2 123 64 355 0.032 31.50 8.18 Intr - 146789 146656 134 2 2 77 94 103 0.997 8.32 8.17 Intr - 148003 147918 86 2 2 82 75 128 0.955 9.52 8.16 Intr - 148784 148606 179 1 2 103 36 263 0.996 21.24 8.15 Intr - 153679 153576 104 1 2 78 94 136 0.995 11.25 8.14 Intr - 154637 154441 197 0 2 54 98 185 0.491 14.31 8.13 Intr - 156622 156472 151 1 1 88 66 168 0.498 13.41 8.12 Intr - 160801 160710 92 2 2 145 23 57 0.965 3.69 8.11 Intr - 163101 162995 107 2 2 97 56 119 0.985 8.54 8.10 Intr - 163522 163435 88 2 1 101 89 92 0.999 8.71 8.09 Intr - 172124 171962 163 2 1 108 16 99 0.887 3.33 8.08 Intr - 175403 175217 187 1 1 87 97 178 0.874 17.37 8.07 Intr - 177576 177435 142 1 1 79 110 170 0.734 16.79 8.06 Intr - 178661 178501 161 1 2 105 91 198 0.996 20.51 8.05 Intr - 178885 178736 150 0 0 99 92 5 0.385 0.36 8.04 Intr - 183207 183099 109 1 1 65 94 110 0.997 7.72 8.03 Intr - 183683 183598 86 1 2 108 83 100 0.998 10.04 8.02 Intr - 203660 203471 190 0 1 65 110 245 0.671 22.12 8.01 Init - 203979 203898 82 0 1 53 81 139 0.995 10.88 8.00 Prom - 216345 216306 40 -5.35 9.09 PlyA - 216389 216384 6 1.05 9.08 Term - 234127 233873 255 1 0 93 44 274 0.716 18.10 9.07 Intr - 241129 241072 58 0 1 62 108 22 0.400 -0.43 9.06 Intr - 242300 242162 139 0 1 98 63 84 0.482 5.50 9.05 Intr - 246756 246555 202 0 1 52 82 48 0.438 -1.46 9.04 Intr - 248804 248595 210 2 0 73 81 231 0.988 19.09 9.03 Intr - 252266 252152 115 1 1 106 113 109 0.954 14.73 9.02 Intr - 277634 277385 250 0 1 81 119 147 0.639 12.67 9.01 Init - 283305 280017 3289 0 1 44 60 1053 0.422 88.16 9.00 Prom - 283563 283524 40 -5.25 10.07 PlyA - 283645 283640 6 -4.04 10.06 Term - 284771 283663 1109 0 2 17 43 855 0.027 65.12 10.05 Intr - 293703 293532 172 0 1 52 115 145 0.988 12.19 10.04 Intr - 294886 294613 274 0 1 82 100 145 0.990 11.72 10.03 Intr - 295951 295671 281 1 2 67 80 144 0.917 6.85 10.02 Intr - 296444 296368 77 0 2 82 103 92 0.990 8.42 10.01 Init - 303395 303329 67 2 1 86 73 88 0.947 6.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 25656 25176 481 2 1 95 53 226 0.803 13.08 S.002 Init + 141221 141355 135 1 0 88 56 171 0.962 14.09 S.003 Term + 142658 142738 81 1 0 104 42 48 0.943 -1.49 S.004 Term - 146067 145740 328 2 1 123 47 363 0.963 28.90 S.005 Sngl + 265974 266528 555 2 0 43 43 280 0.953 14.87 S.006 Sngl - 284679 283663 1017 0 0 88 43 796 0.963 71.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:53839161_54142821|GENSCAN_predicted_peptide_1|330_aa MGKDFMSKTPKAMATKGKIDKWDLIKLKSFCTAKETTIRVNRQPIEWEKIFANYSSDKGL ISRIYNELQQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKRHMRKCSSSLAIREMQIKTT MRSHLTPVRMVIMKNSGNNRLVDRLPSRHNRPYISGSRNFKALHTVKIDAVVLPKCSEIT GSVGVERMLYSQGHHRYIFSKKSSPMKANSNKWKKCLLLQMYRCQSWAKEQDTVEDEEGG GGGKGGRRGTKPYDTWDTMKQPNIRIFCVPEGEEKMKGLENLCNEITDGNFPSLARVLDI WTHEAQRSPNRYNLKGIHGTSNCQKTKRKF >gi568815575r:53839161_54142821|GENSCAN_predicted_CDS_1|993_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaaggcaaaattgac aagtgggatctaattaaactcaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctatagaatgggagaaaatttttgcaaactactcatctgacaaagggcta atatccagaatctacaatgaactccaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggcaaaggatatgaacagacacttctcaaaagaagacatttatgcagccaaa agacacatgagaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccaca atgagatcccatctcacaccagttagaatggtgatcatgaaaaactcaggaaacaacaga ctggtggatagactcccaagtcggcacaatagaccatatatttctggctccagaaacttt aaggcccttcacactgtaaagatagatgcagtggtgttgcctaagtgctcagaaatcacg gggtctgtgggagtggagaggatgctgtacagtcagggacatcatagatacatcttcagc aaaaagtcctcccctatgaaagcaaattcaaataaatggaagaagtgcctgttactccag atgtacagatgtcaatcctgggcaaaagagcaagacaccgtagaagatgaagaaggagga ggaggaggcaaaggaggaagaagaggcacaaagccatatgatacatgggacaccatgaag caaccaaatattcgaattttctgtgtcccagaaggtgaagagaaaatgaaagggttagaa aatctatgtaatgaaataacagatggaaacttcccaagcctagcaagagttttagacatc tggacacatgaggctcaaagatccccaaacagatacaatttaaaaggtatccatggcaca tcaaactgccaaaagacaaagagaaaattctaa >gi568815575r:53839161_54142821|GENSCAN_predicted_peptide_2|472_aa MKAEIKMFFETNKNKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELE KQEQTHSKASRRQEITKIRAEVKDIETQKTLQKISESRSWFFEKINRVDRPLPRLIKKKR EKNQIDAIKNDKGDITTDPTEIQTAIREYYKHLYANKRENLEEMDKFLDTYTLPRLNHEE VESLNRQITGSEIEAVINSLPTKKSPGPDGFTAEFYQRYKEELPGRDTTKKENFRPMSLM NIDAKILSKILANRIPQHIKKLIHHDQVGFIPGMQGWFNIGKSINVIQSINVIQHINRIK DKNHIIISIDAEKAFDKIQQPFMLKTLNKLDIDGMYLKIIKAIYDKPTANIILNGQKLEA FPLKTGTRQGCPLSPLLFNIVLEVLSRAIRQEKEIKGIQLRKEEAKLSLFADDMIVYLEN PIVSAQNLLKLIGNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPLTDC >gi568815575r:53839161_54142821|GENSCAN_predicted_CDS_2|1419_bp atgaaggcagaaataaagatgttctttgaaaccaacaagaacaaagacacaacataccag aatctctgggacacattcaaagcagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaagaactagaa aagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaagatcagagca gaagtgaaggacatagagacacaaaaaacccttcaaaaaatcagtgaatccaggagctgg ttttttgaaaagatcaacagagttgatagaccgctaccaagactaataaagaagaaaaga gagaagaatcaaatagacgcaataaaaaatgacaaaggggatatcaccaccgatcccaca gaaatacaaactgccatcagagaatactataaacacctctacgcaaataaacgagaaaat ctagaagaaatggataaattcctggacacatacaccctcccaagactaaaccacgaagaa gttgaatctctgaatagacaaataacaggctctgaaattgaggcagtaattaatagctta ccaaccaaaaaaagtccaggaccagatggattcacagccgaattctaccagaggtacaag gaggagctgcctggcagagacacaacaaaaaaagagaattttagaccaatgtccctgatg aacatcgatgcaaaaatcctcagtaaaatactggcaaaccgaatcccgcagcacatcaaa aagcttatccaccatgatcaagtgggctttatccctgggatgcaaggctggttcaacata ggcaaatcaataaacgtaatccaatcaataaacgtaatccagcatataaacagaatcaaa gacaaaaaccacataattatctcaatagatgcagaaaaggcctttgacaaaattcaacaa cccttcatgctaaaaactctcaataaattagatattgatgggatgtatctcaaaataata aaagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactggaagca ttccctttgaaaactggcacaagacagggatgccctctctcaccactcctattcaacata gtgttggaagttctgtccagggcaatcaggcaggagaaggaaataaagggtattcagtta cgaaaagaggaagccaaattgtccctgtttgcagatgacatgattgtatatctagaaaac cccatcgtctcagcccaaaatctccttaagctgataggcaacttcagcaaagtctcagga tacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaataacagacaaacagag agccaaatcatgagtgaactcccattgacagactgctag >gi568815575r:53839161_54142821|GENSCAN_predicted_peptide_3|435_aa MACQGDREPEERGSELQGEKEQEVIYFLLVKLHVQSQVYGSASEVMRKGKIEDSSTPKQC VHWAFPEITRGRPDRLDLSQAFYEQIAGLLPVFWTAGLPQAEAAAGQQATPFWDLPFRGR GAQLPYQDTNWHLTCLSVLDVGAPSGRSCSWPTGYTLLGPALQREGCSAPVPGHELAPHL SRCSKSGVSSPAQAQAIDFSSISLGDVLKLWGWEEKTKLCPYHLGYMSSGLPGAVSRVRL QPWQNKLSKLTETCLRFSGFTHPTCRITKLNNYPHTRKYFHKNQKSARQANIQIQEIQRT PQRYSSRRATPRHIIIRFTKVEMKEKILRAAREKGRVTHKGKPIRLTADLLAETLQARRE WGPIFNILKEKNVQPRISYPAKLSFISKGEIKSFTDKQMLRDFVTTRPALKELLKEALNM ERNNQYQPLQKHAKL >gi568815575r:53839161_54142821|GENSCAN_predicted_CDS_3|1308_bp atggcctgccagggtgacagagaaccagaagaaagaggcagtgaattgcaaggggagaag gaacaagaagtcatctatttcctgctggtcaaattacatgttcaatctcaggtctatgga tcagcaagtgaggtaatgagaaaaggaaagattgaagattcatcaacaccaaagcagtgt gttcactgggcttttcctgaaattacaaggggcaggcctgaccggttagatttgtcccaa gccttctatgagcagattgctgggctgttgcctgtgttctggactgcggggctccctcag gcagaagctgcagctggccaacaggctacacccttctgggacctgcctttcagagggagg ggtgctcagctcccgtaccaggacacgaactggcacctcacttgtctcagtgttctggac gtcggggctccctcaggcagaagctgcagctggccaacaggctacacccttctgggacct gccttacagagggaggggtgctcagctcccgtaccaggacacgaactggcacctcacttg tctcggtgttctaagagtggggtctcctcccctgctcaagctcaggccatagatttcagc tccatatccctgggcgatgtgctcaaactctgggggtgggaggagaaaaccaagctgtgc ccctaccaccttgggtacatgtcctcaggacttcctggggctgtgtcacgggtgcgtctt caaccttggcaaaataaactttctaaattaactgagacctgtctcagattttctgggttc acacatcccacctgcaggatcaccaaactcaacaactatccccacacaagaaagtacttt cataagaaccaaaaatcagcaaggcaggccaacattcaaattcaggaaattcagagaacg ccacaaagatactcctcgagaagagcaactccaagacacataattatcagattcaccaaa gttgaaatgaaggaaaaaatattaagggcagccagagagaaaggtcgggttacccacaaa gggaagcccatcagactaacagctgatctcttggcagaaactctacaagccagaagagag tgggggccaatattcaacattcttaaagaaaagaatgttcaacccagaatttcatatcca gccaaactaagcttcataagcaaaggagaaataaaatcctttacagacaagcaaatgctg agagattttgtcaccaccaggcctgccctaaaagaactcctgaaggaagcactaaacatg gaaaggaacaaccagtaccagccactgcaaaaacatgccaaattgtaa >gi568815575r:53839161_54142821|GENSCAN_predicted_peptide_4|243_aa MARELRDKCTTLSSRFHQLEERVSVMEDQMNEMKREEKFREKRIKRNEQSLQEIWDYVKR PNIRLIGVPESHGENGTKLENTLQDIIQENFPNLARQANIQIQEIQRTPQRYSSRRATPR HIIIRFTKVEMKEKMLRAAREKGRVTHKGKPIRLTADLSAETLQARREWGPIFNILKEKN VQPRISYPAKLSFISEGEIKSFTDKQMLRDVVTTRPALKELLKEALNMERNNQYQPLQKH AKL >gi568815575r:53839161_54142821|GENSCAN_predicted_CDS_4|732_bp atggcacgagaactacgtgacaaatgcacaaccctcagtagccgattccatcaactggaa gaaagggtatcagtgatggaagatcaaatgaatgaaatgaagcgagaagagaagtttaga gaaaaaagaataaaaagaaatgaacaaagcctccaagaaatatgggactatgtgaaaaga ccaaatatacgtctgattggtgtacctgaaagtcacggggagaatggaaccaagttggaa aacactctgcaggatattatccaggagaacttccccaatctagcaaggcaggccaacatt caaattcaggaaattcagagaacgccacaaagatactcctcgagaagagcaactccaaga cacataattatcagattcactaaagttgaaatgaaggaaaaaatgttaagggcagccaga gagaaaggtcgggttacccacaaagggaagcccatcagactaacagcggatctctcagca gaaactctacaagccagaagagagtgggggccaatattcaacattcttaaagaaaagaat gttcaacccagaatttcatatccagccaaactaagcttcataagcgaaggagaaataaaa tcctttacagacaagcaaatgctgagagatgttgtcaccaccaggcctgccctaaaagaa ctcctgaaggaagcactaaacatggaaaggaacaaccagtaccagccactgcaaaaacat gccaaattgtaa >gi568815575r:53839161_54142821|GENSCAN_predicted_peptide_5|312_aa MNIDAKILNKILANRIQQHIETLIHHDQVGFIPGIEGWFNIRKSINVIQSINVIQHINRT KDKNHMIISIHAEKAFDKIQQCFMLKTLNKLGIDGTCLKIIKAIYDKPTANIILNGQKLE AFPLKTGIRQGCPVSPLLFNIVLEVLARAISQEKKIKGIQLGKEEVKLSLFADDMIVYLE NPIVSAQNILKLIGNFSKVSGYKINMQKSQAFLYTNNRQTESKILSELPFTIASKRIKYL EIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSFVGRINIMEMAILPKVIYRFNAIP IKLPMTFFTELE >gi568815575r:53839161_54142821|GENSCAN_predicted_CDS_5|939_bp atgaacatcgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc gaaacgcttatccaccatgatcaagtgggcttcatccctgggattgaaggctggttcaac atacgcaaatcaataaatgtaatccaatcaataaacgtaatccagcatataaacagaacc aaagacaaaaaccacatgattatctcaatacatgcagaaaaggcctttgacaaaattcaa caatgcttcatgctaaaaactctcaataaattaggtattgatgggacgtgtctcaaaata ataaaagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactggaa gcattccctttgaaaactggcataagacagggatgccctgtctcaccactcctattcaac atagtgttggaagttctggccagggcaatcagtcaggagaagaaaataaagggtattcag ttaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatctagaa aaccccatcgtctcagcccaaaatatccttaagctgataggcaacttcagcaaagtctca ggatacaaaatcaacatgcaaaaatcacaagcattcttatacaccaataacagacaaaca gagagcaaaatcctgagtgaactcccattcacaattgcttcaaagagaataaaataccta gaaatccaacttacaagggacgtgaaagacctcttcaaggagaactacaaaccactgctc aatgaaataaaagaggatacaaacaaatggaagaacattccatgctcattcgtaggaaga atcaatatcatggaaatggccatactgcccaaggtaatttatagattcaatgccatcccc atcaagctaccaatgactttcttcacagaattggaataa >gi568815575r:53839161_54142821|GENSCAN_predicted_peptide_6|66_aa MLMGTIAAELESLLLIRMCILSGRGNGTYLGALTRNVGVVTLMGSKAAVHPDLPPSPYDS RDPKGA >gi568815575r:53839161_54142821|GENSCAN_predicted_CDS_6|201_bp atgctgatgggaaccattgcagctgaattggaatccctgcttttgataaggatgtgtatc ctgagtggcagaggtaatggcacttacctgggtgcccttaccaggaatgttggtgtggtg accctaatgggcagcaaggctgcagttcacccagatctgccaccaagcccttatgactca agggatccgaaaggtgcttga >gi568815575r:53839161_54142821|GENSCAN_predicted_peptide_7|61_aa MKSSLEVTCGRMLTVKCGRHKDVSVADVEAEKTQIQRRRPYEDGGSDWSYAATSQGTPEP Q >gi568815575r:53839161_54142821|GENSCAN_predicted_CDS_7|186_bp atgaaaagttccctggaggtaacgtgtggaaggatgttgactgttaagtgtggtagacat aaagatgtttcagttgcagacgtagaagcagagaagacacagatacaaaggagaaggcca tatgaagatggaggcagcgattggagttatgcagccacaagccaaggaacaccggaacca cagtga >gi568815575r:53839161_54142821|GENSCAN_predicted_peptide_8|1359_aa MKEASRKGFRYFLETRKEVITDNGAEEAIVQRGRVLPPPAPLDTTNLAGRRTLQGRAKMA SVPVYCLCRLPYDVTRFMIECDMCQDWFHGSCVGVEEEKAADIDLYHCPNCEVLHGPSIM KKRRGSSKGHDTHKGKPVKTGSPTFVRELRSRTFDREEHHVGRILPTSASDLVSCLGQII SSLADEVFGISCQLTLSPLIKTPPLGSDEVILKPTGNQLTVEFLEENSFSVPILVLKKDG LGMTLPSPSFTVRDVEHYVGSDKEIDVIDVTRQADCKMKLGDFVKYYYSGKREKVLNVIS LEFSDTRLSNLVETPKIVRKLSWVENLWPEECVFERPNVQKYCLMSVRDSYTDFHIDFGG TSVWYHVLKGEKIFYLIRPTNANLTLFECWSSSSNQNEMFFGDQVDKCYKCSVKQGQTLF IPTGWIHAVLTPVDCLAFGGNFLHSLNIEMQLKAYEIEKRLSTADLFRFPNFETICWYVG KHILDIFRGLRENRRHPASYLVHGGKALNLAFRAWTRKEALPDHEDEIPETVRTVQLIKD LAREIRLVEVRSMGQGRKGIWKGLEAQVAVSMSRLSLPSKNGSKKKGLKPKELFKKAERK GKESSALGPAGQLSYNLMDTYSHQALKTGSFQKAKFNITGACLNDSDDDSPDLDLDGNES PLALLMSNGSTKRVKSLSKSRRTKIAKKVDKARLMAEQVMEDEFDLDSDDELQIDERLGK EKATLIIRPKFPRKLPRAKPCSDPNRVREPGEVEFDIEEDYTTDEDMVEGVEGKLGNGSG AGGILDLLKASRQVGGPDYAALTEAPASPSTQEAIQGMLCMANLQSSSSSPATSSLQAWW TGGQDRSSGSSSSGLGTVSNSPASQRTPGKRPIKRPAYWRTESEEEEENASLDEQDSLGA CFKDAEYMNVLGAFVEIYPSLESDDDDPALKSRPKKKKNSDDAPWSPKVCALSIGELGSY VMASKIWLHKKLEAGRPMRVPLESLEVGNMASLRINVLVTPSHPVSLAARVTPTLPKQDR PVREGTRVASIETGLAAAAAKLAQQELQKAQKKKYIKKKPLLKEVEQPRPQDSNLSLTVP APTVAATPQLVTSSSPLPPPEPKQEALSGSLADHEYTARPNAFGMAQANRSTTPMAPGVF LTQRRPSVGSQSNQAGQGRDTEGQPKVLGFDLMGHGEMLTDLKYRQDQVLVLKDPSGGQK EVEEHSLDIMEGVAGRPTPGSSIGNFSCFFLHLRSWSEEDLRTHIFTPGPLHFQTTGHRS SRMQTGHSRAQVEEKRRPSVKEQTIHPILPFPCPSFVDCLHCPDSAPTSHRLASLAIWTR WRTSPHLHLALPKWSVPLPFPPVPPTPTSPQLAQCFWGI >gi568815575r:53839161_54142821|GENSCAN_predicted_CDS_8|4080_bp atgaaagaagcttcccgaaaaggcttcaggtattttcttgaaacccggaaagaagttatc actgacaacggggccgaagaagctatcgtccagagaggacgcgtgctgccgcctcccgcc cctcttgacacgacgaacctggccggccgcagaacgctccagggccgagcgaagatggcc tcggtgccggtgtattgcctctgccggctgccttacgatgtgacccgcttcatgatcgag tgtgacatgtgccaggactggtttcatggcagttgtgttggtgttgaagaggagaaggct gctgacattgacctctaccactgccccaactgtgaagtcttgcatgggccctccattatg aaaaaacgccgtggatcttcaaaggggcatgatacacacaaggggaaaccagtgaagacc gggagccctacgttcgtcagagagctccggagtaggacttttgacagagaagagcaccat gttggcagaatactccccacctctgccagtgacctggtctcttgccttggacagatcatt tcttccctggcagatgaggtttttggcattagttgccagctcacattaagtcccctcata aaaaccccacctcttggctcagatgaagtgattctgaagcccactggaaatcaactgacc gtggaattcctggaagaaaatagcttcagtgtgcccatcctggtcctgaagaaggatggg ttgggcatgacgctgccctcgccatcattcactgtgagggatgttgaacactatgttggt tctgacaaagagattgatgtgattgatgtgacccgccaggctgactgcaagatgaagctt ggtgattttgtgaaatactattacagcgggaagagggagaaagtcctcaatgtcattagt ttggaattctctgataccagactttctaaccttgtggagacaccgaagattgttcgaaag ctgtcatgggtcgaaaacttgtggccagaggaatgtgtctttgagagacccaatgtacag aagtactgcctcatgagtgtgcgagatagctatacagactttcacattgactttggtggc acctctgtctggtaccatgtactcaagggtgaaaagatcttctacctgatccgcccaaca aatgccaatctgactctctttgagtgctggagcagttcctctaatcagaatgagatgttc tttggggaccaggtggacaagtgctacaagtgttccgtgaagcaaggacagacacttttc attcccacagggtggatccatgctgtgctgacgcctgtggactgccttgcctttggaggg aacttcttacacagccttaacatcgagatgcagctcaaagcctatgagattgagaagcgg ctgagcacagcagacctcttcagattccccaactttgagaccatctgttggtatgtggga aagcacatcctggacatctttcgcggtttgcgagagaacaggagacaccctgcctcctac ctggtccatggtggcaaagccttgaacttggcctttagagcctggacaaggaaagaagct ctgccagaccatgaggatgagatcccggagacagtgcgaaccgtacagctcattaaagat ctggccagggagatccgcctggtggaagtaaggagcatggggcagggaagaaaaggcatt tggaaaggccttgaggctcaagtggcagtgtccatgtccaggctgtcactgccctccaaa aatggttcaaagaagaaaggcctgaagcccaaggaactcttcaagaaggcagagcgaaag ggcaaggagagttcagccttggggcctgctggccagttgagctataatctcatggacaca tacagtcatcaggcactgaagacaggctctttccagaaagcaaagttcaacatcactggt gcctgcttgaatgactcagatgacgactcaccagacttggaccttgatggaaatgagagc ccattggccctattgatgtctaacggcagtacgaaaagggtgaagagtttatccaaatct cggcgaaccaagatagcaaagaaggtagacaaggctaggctgatggcagaacaggtgatg gaagacgaatttgacttggattcagatgatgagctgcagattgacgagagattgggaaag gagaaggcgaccctgataataagaccaaaatttccccggaaattgccccgtgcgaagcct tgctctgaccccaaccgagttcgtgaaccaggagaagttgagtttgacattgaggaggac tatacaacagatgaggacatggtggaaggggttgaaggcaagcttgggaatggtagtggc gctggtggcattcttgatctgctcaaggccagcaggcaggtggggggacctgactatgct gccctcaccgaggccccagcttctcccagcactcaggaggccatccagggcatgctgtgc atggccaacctgcagtcctcatcgtcctcaccggctacctctagcctgcaggcctggtgg actgggggacaggatcgaagcagtgggagctccagcagtgggctgggcacagtgtctaac agtcctgcttcccagcgcaccccagggaagcggcccatcaagcggccagcatactggaga accgagagcgaggaggaggaggagaacgccagtctggatgaacaggacagcttgggagcg tgcttcaaggatgcagagtatatgaatgttcttggtgccttcgtggaaatctatccttca ctggagtctgatgatgatgaccctgctttgaaatctcgacccaagaaaaagaagaattca gatgatgctccatggagtcctaaagtgtgtgctttgagcattggtgaattaggttcttat gtgatggcaagcaaaatctggcttcataaaaagctagaagcaggaagaccaatgagagta cctttggagagcttggaggtagggaatatggctagccttcggataaatgtcttagtgaca ccttctcatcctgtttcccttgcagcccgcgtgaccccaactctgccgaagcaggaccgt cctgtgcgtgaggggacccgggtagcctctattgagacaggtttggctgcagcagctgca aagctggcccagcaggagctacagaaggcccaaaagaagaaatatatcaagaagaagcct ttgctgaaggaggtagaacagcctcgccctcaagactccaatctcagtctgacagtacca gcccccactgtggctgccacaccacaacttgtcacctcctcctcacccctgcctcctcct gagcctaaacaagaggccctgtcaggaagtctcgctgaccatgagtacaccgctcgtccc aatgcctttggcatggcccaggcaaaccgcagcaccacacctatggcccccggtgtcttc ttgacccagcggcgcccttcagttggctcccagagcaatcaggcaggacaagggagggac actgaaggccaacctaaggtgcttggatttgatcttatgggccatggagagatgttgact gatttgaagtacagacaagatcaggttttagttttgaaagatccttctggtggccagaag gaggtagaggagcacagcctggacattatggagggtgtggcgggacggcccacacctggg tcctccatcgggaacttttcatgcttctttctccacctgaggtcttggtctgaagaagac ctcaggactcacatcttcactcctgggcctttgcacttccagacgacaggtcatcgttca agcagaatgcagacaggccattcacgagcccaagttgaagagaagagacgcccatccgtg aaggagcagaccatccatccgatcctccccttcccctgtccttccttcgtggattgtctc cattgtccagacagtgcccccacctcccaccgccttgcctcactggcaatctggactcga tggagaacatccccccacctccatttggcactacccaagtggagtgtacccttgcccttt ccacctgtaccacccactccaacctcaccccagcttgcccaatgcttctggggaatttaa >gi568815575r:53839161_54142821|GENSCAN_predicted_peptide_9|1505_aa MVKGSIQQEELTILNIYAPNTGAPRFIKQVLSDLQRDLDSHTLIMGDFNTPLSTLDRSTR QKVNKDTQELNSALYQADLIDIYRTLHPKSTEYAFFSAPHHTYSKIDHIVGSKALLSKCK RTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNE NKDTTYQNLWDAFKAVCRGKFIALNAYKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQ EITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIKKKREKNQIDTIKNDKG DITTDPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRLNQEEVESLNRPITGSEI VAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIP KPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRK SINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPT ANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLS LFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMGELP FTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNNWKNIPCSWVGRINIVKMAIL PKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLY YKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPGKNKQWGKDSLFNKWCWE NWLAICRKLKLDPFLTPYTKINSRWIKDLNVKPKTIKTLEENLGITIQDIGVGKYFMSKT PKAMATKDKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRIYNELK QIYKKKTNNPIKKWAKDTNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHLTPVR MAIIKKSGNNRCWRGCGEIGTLLHCWWDCKLVQPLWKSVWRFLRDLELEIPFDPAIPLLG IYPNEYKSCCYKDTCTLPSVILKEWSAYKGKSPQTPELVSALTFREWTCPNLKKLWLGKA VEDKNRRMRAFLACMKSDTPSMLNPANVPTHLLLMCCVLRYMVQWPGGRILHRHELDTFL AQAVSTQLYEPDRLQELKIEKLDARGIQLAALFMSGVDTALFANDACGQPVPWEHCCPWI YFDGKLFQSKLIKAGRERVSLVELCDGQADLATKVEKMRQSILEGVNMNHPPPSALLPSP TFVPPMVPSLYPVSLYSRAMGSMPLPPQGRSRGFAGLHPIPPQGGKLEIAGMVVGQWAGS RSSRGRGSFGMQVVSVGGPGKGHGKEQTGRGSKGHKKGNKQGSSDGVSKSLELHQGRSRS QVNGNSGALIKEEKSDHRLPAPSQCALSRDSNECNNGNRYLPMNNREKNHLQEQKLETVA QRKED >gi568815575r:53839161_54142821|GENSCAN_predicted_CDS_9|4518_bp atggtaaagggatcaattcaacaagaggagctaactatcctaaatatttatgcacccaat acaggagcacccagattcataaagcaagtcctcagtgacctacaaagagacttagactcc cacacattaataatgggagactttaacaccccactgtcaacattagacagatcaacgaga cagaaagtcaacaaggatacccaggaattgaactcagctctgtaccaagcagacctaata gacatctacagaactctccaccccaaatcaacagaatatgcatttttttcagcaccacac cacacctattccaaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaa agaacagaaattataacaaactatctctcagaccacagtgcaatcaaactagaactcagg attaagaatctcactcaaagccgctcaactacatggaaactgaacaacctgctcctgaat gactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgag aacaaagacaccacataccagaatctctgggacgcattcaaagcagtgtgtagagggaaa tttatagcactaaatgcctacaagagaaagcaggaaagatccaaaattgacaccctaaca tcacaattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaataactaaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaa atcaatgaatccaggagctggttttttgaaaggatcaacaaaattgatagaccgctagca agactaataaagaaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggg gatatcaccaccgatcccacagaaatacaaactaccatcagagaatactacaaacacctc tacgcaaataaactagaaaatctagaagaaatggatacattcctcgacacatacactctc ccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggctctgaaatt gtggcaataatcaatagtttaccaaccaaaaagagtccaggaccagatggattcacagcc gaattctaccagaggtacaaggaggaactggtaccattccttctgaaactattccaatca atagaaaaagagggaatcctccctaactcattttatgaggccagcatcattctgatacca aagccaggcagagacacaaccaaaaaagagaattttagaccaatatccttgatgaacatt gatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagctt atccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaa tcaataaatgtaatccagcatataaacagagccaaagacaaaaaccacatgattatctca atagatgcagaaaaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaat aaattaggtattgatgggacgtatttcaaaataataagagctatctatgacaaacccaca gccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaactggcacaaga cagggatgccctctctcaccgctcctattcaacatagtgttggaagttctggccagggca atcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtcc ctgtttgcagacgacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctc cttaagctgataagcaacttcagcaaagtgtcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaacaacagacaaacagagagccaaatcatgggtgaactccca ttcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaag gacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggacacaaacaat tggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactg cccaaggtaatttacagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcattgccaag tcaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaactatac tacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaa tggaacagaacagagccctcagaaataatgccgcatatctacaactatctgatctttgac aaacctgggaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaa aactggctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaa atcaattcaagatggattaaagatttaaacgttaaacctaaaaccataaaaaccctagaa gaaaacctaggcattaccattcaggacataggcgtgggcaagtacttcatgtccaaaaca ccaaaagcaatggcaacaaaagacaaaattgacaaatgggatctaattaaactaaagagc ttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaaaatgggagaaa attttcgcaacctactcatctgacaaagggctaatatccagaatctacaatgaactcaaa caaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggacacgaacaga cacttctcaaaagaagacatttatgcagccaaaaaacacatgaagaaatgctcatcatca ctggccatcagagaaatgcaaatcaaaaccactatgagatatcatctcacaccagttaga atggcaatcattaaaaagtcaggaaacaacaggtgctggagaggatgtggagaaatagga acacttttacactgttggtgggactgtaaactagttcaaccattgtggaagtcagtgtgg cgattcctcagggatctagaactagaaataccatttgacccagccatcccattactgggt atatacccaaatgagtataaatcatgctgctataaagacacatgcacacttccttcagtg atccttaaagagtggtctgcctataaagggaagtcacctcaaacccctgagctggtgtct gcactgacatttcgggaatggacttgcccaaacctcaagaagctctggctaggcaaagca gttgaagacaagaacagaaggatgcgggccttcctggcctgtatgaagtcggacacgccc agtatgctcaatccagctaatgtccccacccatctgctgctcatgtgttgtgtactccgg tacatggttcagtggcctggtggccgaatcctgcatcgccatgagctagacaccttcctt gcacaggcagtgtctacccagctttatgaaccagatcgactccaagaactcaagattgag aaactggatgcccgagggatccagcttgctgccctcttcatgagtggagtcgacacagct ctgtttgctaatgatgcctgtggccagccagtcccttgggagcattgctgtccatggatt tactttgatggcaagctgttccagagcaagctaattaaagcaggccgagagcgagtatct ctggttgagctctgtgatggccaggctgacctggcaaccaaagtggaaaagatgagacag agcatccttgaaggagtcaacatgaatcatccaccgccttctgctctacttccgtcacct acttttgtgcctcccatggtgccctctctctaccctgtttcactttattcccgagctatg ggctccatgccacttccccctcaagggaggagccggggatttgcaggtctccatccaatc ccaccccaaggaggaaaactggagattgctgggatggttgtgggccagtgggctggcagc agatcctccaggggccgaggatccttcggcatgcaagtggtttctgtcggtgggccagga aaggggcatggaaaagaacagactggtagaggatccaagggacacaaaaaaggaaataag caaggctcttcagatggagtttctaaatccctggagcttcatcaaggtcggtctcgctcc caggtaaatggaaacagtggcgcattgatcaaggaagagaagagtgatcatcgtcttcca gctccatcacaatgtgccttatccagagacagcaatgagtgtaataatggtaaccgctac ctccctatgaataatagggagaagaaccacttacaagagcaaaagctagaaactgtggca caacggaaagaggactga >gi568815575r:53839161_54142821|GENSCAN_predicted_peptide_10|659_aa MADAPPPASLLPCSSTSDCCASSFRNNRLGNPPLPRNQVGTISAGKPMFSHQVPQKVKYP PPFPVGPNSSLLFSSHALGESHAFSEDPMLQNSPFANWAVSYDSSASQFPNYLPSKASPP LGPDSSHSSSSDGDEPNGASSDHITEAFHHQPEWGNPNRDRGSWAQPVDTGVSEASLGDG EPHIPSLLSMSTRNHMDITIPPLPPVAPEVLRVAEHRHRRGLMYPYIYHVLTKGEIKIPV CIEDECNMELPPAALLFRSARQYVYGVLFSLAETQRKMERLAMRRRLPVEEGKLTNRKDI YTENPSVHHHHQRPKVDKTTKMGKKQNRKTGNSKTQSASPPPKERSSSPAMEQSWMENDF DELREEGFRRSNYSELREDIQTKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARE LREECRSLRSRCDQVEERVSAMEDEMNEMKQEGKFREKRIKRNEQSLQEIWDYVKRPNLR LIGVPESDVENGTKLENTLQDIIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIV RFTKVEMKEKMLRAAREKGRVTLKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPR ISYPAKLSFISEGEIKYFIDKQMLRDFVTTRPALKELLKEALNMERNDRYQPLQNHAKM >gi568815575r:53839161_54142821|GENSCAN_predicted_CDS_10|1980_bp atggcggacgcccctcccccagcctcgctgctgccttgcagttcgacctcagactgctgt gccagcagttttcggaataataggcttggaaatcctccccttccacgaaatcaagtgggc accatttctgctggaaagccaatgttttctcatcaagtgccccagaaagtgaaatatcca ccaccattcccagtgggacccaactcatctcttctcttctcctcccatgctttgggggaa tcccatgctttttctgaggatcccatgctgcagaacagcccctttgccaattgggctgtc tcctatgactcttctgcatcccagtttcccaattacctgccttctaaagcctcacctcct ttgggaccagactcttcccactcctcttcctctgatggtgatgagccaaatggagctagc tctgatcatatcacagaagcatttcatcaccagcctgagtggggaaatcccaatcgtgac agagggtcctgggcacagcctgttgatactggagtttcagaagcgagcctaggtgatggt gagccccacatcccatctctgctgtctatgtctacaaggaaccacatggatatcaccatt ccacccttacctccagtagctccagaagtcttgagagttgctgaacacagacaccggagg ggtcttatgtacccatatatctaccatgtcctcactaagggtgaaattaagatccccgta tgtattgaggatgagtgtaacatggagctgcctccagctgctctcttattccggtcagct cgtcaatatgtatatggagttctttttagtctggcagagacacagaggaaaatggaacgc ttggccatgcgacggcggctgcctgtggaagaaggaaaactaacaaacagaaaggacatc tacaccgaaaacccatctgttcatcaccatcatcaaagaccaaaagtagataaaaccaca aagatggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagcgcctctcct cctccaaaggaacgcagttcctcaccagcaatggaacaaagctggatggagaatgatttt gacgagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacatt caaaccaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactaga ataaccaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaa ctacgtgaagaatgcagaagcctcaggagccgatgcgatcaagtggaagaaagggtatca gcaatggaagatgaaatgaatgaaatgaagcaagaagggaagtttagagaaaaaagaata aaaagaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgt ctgattggtgtacctgaaagtgatgtggagaatggaaccaagttggaaaacactctgcag gatattatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaa atacagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtc agattcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgg gttaccctcaaaggaaagcccatcagactaacagcggatctctcggcagaaaccctacaa gccagaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccaga atctcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttatagac aagcaaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaa gcgctaaacatggaaaggaacgaccggtaccagccgctgcaaaatcatgccaaaatgtaa