GENSCAN 1.0 Date run: 8-Nov-116 Time: 16:10:30 Sequence gi568815596r:74122867_74415023 : 292157 bp : 45.59% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 4502 4497 6 1.05 1.04 Term - 12792 12727 66 0 0 79 47 52 0.212 -1.86 1.03 Intr - 19494 19406 89 1 2 70 82 92 0.891 6.49 1.02 Intr - 22437 22323 115 0 1 116 110 64 0.990 11.42 1.01 Init - 25008 24955 54 0 0 85 72 118 0.840 9.19 1.00 Prom - 26562 26523 40 -6.56 2.07 PlyA - 26987 26982 6 1.05 2.06 Term - 33779 33702 78 2 0 113 36 26 0.502 -2.34 2.05 Intr - 36388 36225 164 2 2 86 111 28 0.680 4.59 2.04 Intr - 42485 42352 134 1 2 64 97 7 0.359 -0.51 2.03 Intr - 44241 44148 94 1 1 92 73 47 0.323 2.62 2.02 Intr - 49886 49720 167 1 2 60 98 100 0.358 7.80 2.01 Init - 55808 55795 14 2 2 75 70 13 0.144 -2.45 2.00 Prom - 61032 60993 40 -4.56 3.00 Prom + 68127 68166 40 -3.46 3.01 Init + 75776 75876 101 1 2 35 103 94 0.027 4.19 3.02 Intr + 82839 83023 185 0 2 82 71 160 0.992 13.13 3.03 Intr + 84838 84960 123 2 0 69 109 43 0.967 5.06 3.04 Intr + 85703 85855 153 0 0 87 82 57 0.652 5.04 3.05 Intr + 88333 88425 93 2 0 118 110 65 0.999 11.64 3.06 Intr + 88875 89000 126 1 0 135 94 147 0.995 20.65 3.07 Term + 91213 91376 164 2 2 85 36 85 0.957 1.10 3.08 PlyA + 92393 92398 6 1.05 4.26 PlyA - 93400 93395 6 1.05 4.25 Term - 95878 95761 118 0 1 69 45 115 0.857 3.31 4.24 Intr - 98635 98568 68 1 2 92 92 62 0.922 4.70 4.23 Intr - 102129 101974 156 0 0 93 59 212 0.754 19.01 4.22 Intr - 104264 104091 174 2 0 96 58 317 0.988 29.64 4.21 Intr - 105012 104944 69 0 0 48 110 118 0.998 9.38 4.20 Intr - 108442 108370 73 0 1 108 100 110 0.988 13.61 4.19 Intr - 109781 109603 179 2 2 84 91 283 0.924 26.92 4.18 Intr - 110697 110536 162 0 0 57 116 218 0.996 21.67 4.17 Intr - 112348 112235 114 1 0 117 47 72 0.990 6.64 4.16 Intr - 116647 116469 179 2 2 67 121 221 0.655 22.94 4.15 Intr - 119186 119128 59 1 2 92 119 61 0.629 8.13 4.14 Intr - 124441 124163 279 0 0 109 48 517 0.457 46.39 4.13 Intr - 125620 125487 134 1 2 111 58 186 0.993 17.34 4.12 Intr - 127651 127477 175 0 1 8 89 197 0.496 11.74 4.11 Intr - 129522 129313 210 2 0 135 44 140 0.999 12.33 4.10 Intr - 130262 130108 155 2 2 92 85 302 0.999 29.17 4.09 Intr - 131840 131753 88 1 1 127 95 56 0.996 10.17 4.08 Intr - 133066 132909 158 1 2 77 94 294 0.999 27.71 4.07 Intr - 136777 136722 56 2 2 127 67 -2 0.706 0.30 4.06 Intr - 139366 139271 96 2 0 81 56 87 0.699 4.78 4.05 Intr - 141433 141281 153 2 0 64 89 160 0.736 13.74 4.04 Intr - 142398 142238 161 2 2 116 66 162 0.999 16.43 4.03 Intr - 163036 162907 130 2 1 96 94 118 0.970 12.95 4.02 Intr - 181814 181623 192 0 0 110 80 112 0.276 12.06 4.01 Init - 182439 182424 16 0 1 54 78 9 0.343 -3.43 4.00 Prom - 182771 182732 40 -8.36 5.06 PlyA - 183618 183613 6 1.05 5.05 Term - 184377 183859 519 0 0 -27 46 710 0.512 49.60 5.04 Intr - 184554 184444 111 0 0 47 73 177 0.998 12.68 5.03 Intr - 184853 184654 200 0 2 65 -52 366 0.999 19.37 5.02 Intr - 185143 185031 113 0 2 80 55 139 0.336 9.82 5.01 Init - 191634 191534 101 0 2 96 97 10 0.092 2.64 5.00 Prom - 201149 201110 40 -4.06 6.05 PlyA - 201195 201190 6 1.05 6.04 Term - 207386 206932 455 0 2 46 43 155 0.650 2.12 6.03 Intr - 211311 211161 151 0 1 99 89 63 0.783 7.24 6.02 Intr - 216037 215989 49 0 1 134 95 15 0.827 5.48 6.01 Init - 224442 224372 71 0 2 51 61 39 0.143 -1.98 6.00 Prom - 228379 228340 40 -1.76 7.29 PlyA - 229459 229454 6 1.05 7.28 Term - 238770 238633 138 0 0 60 47 208 0.531 11.86 7.27 Intr - 239275 239186 90 1 0 135 72 63 0.998 9.59 7.26 Intr - 239863 239784 80 2 2 121 82 99 0.997 11.87 7.25 Intr - 240311 240128 184 2 1 123 84 109 0.998 13.36 7.24 Intr - 240561 240428 134 1 2 93 100 64 0.997 8.46 7.23 Intr - 242375 242209 167 1 2 26 89 210 0.995 14.50 7.22 Intr - 242791 242649 143 1 2 47 -2 277 0.995 13.75 7.21 Intr - 243152 243027 126 2 0 76 100 129 0.970 13.78 7.20 Intr - 243509 243378 132 2 0 91 100 217 0.999 24.04 7.19 Intr - 243754 243593 162 1 0 80 80 151 0.900 13.67 7.18 Intr - 244066 243917 150 1 0 43 14 209 0.997 9.36 7.17 Intr - 244241 244179 63 2 0 100 94 30 0.891 3.71 7.16 Intr - 244554 244486 69 0 0 100 67 88 0.994 7.28 7.15 Intr - 244998 244830 169 2 1 101 64 255 0.975 24.35 7.14 Intr - 245265 245105 161 0 2 97 92 274 0.995 27.49 7.13 Intr - 246014 245862 153 2 0 88 86 81 0.593 8.17 7.12 Intr - 246348 246232 117 0 0 136 77 153 0.981 19.66 7.11 Intr - 246625 246434 192 1 0 50 88 347 0.999 30.59 7.10 Intr - 247203 247099 105 0 0 93 91 138 0.998 15.01 7.09 Intr - 247479 247320 160 2 1 101 41 265 0.999 23.09 7.08 Intr - 247678 247600 79 2 1 60 75 119 0.949 6.41 7.07 Intr - 247959 247755 205 0 1 114 86 395 0.962 40.67 7.06 Intr - 248310 248113 198 0 0 81 42 249 0.884 19.15 7.05 Intr - 248798 248671 128 0 2 33 116 24 0.379 0.10 7.04 Intr - 250082 250062 21 0 0 94 100 20 0.262 1.32 7.03 Intr - 254860 254782 79 1 1 40 77 61 0.377 -0.68 7.02 Intr - 255379 255134 246 1 0 119 66 210 0.948 19.56 7.01 Init - 257171 257139 33 2 0 94 114 18 0.937 5.05 7.00 Prom - 273171 273132 40 -3.36 8.06 PlyA - 273238 273233 6 1.05 8.05 Term - 285119 284931 189 2 0 122 46 67 0.974 3.35 8.04 Intr - 289870 289782 89 2 2 104 116 -34 0.613 0.69 8.03 Intr - 290401 290162 240 2 0 78 68 134 0.848 7.92 8.02 Intr - 290769 290564 206 2 2 80 89 37 0.317 1.84 8.01 Intr - 291932 291467 466 0 1 66 60 355 0.119 22.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 75824 76250 427 2 1 57 54 284 0.916 16.68 S.002 Init + 262810 262884 75 0 0 86 115 55 0.804 9.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:74122867_74415023|GENSCAN_predicted_peptide_1|107_aa MAAWSPAAAAPLLRGIRGLPLHHRMFATQTEGELRVTQILKEKFPRATAIKVTDISGGCG AMYEIKIESEEFKEKRTVQQHQMVNQALKEEIKEMHGLRIFTSVPKR >gi568815596r:74122867_74415023|GENSCAN_predicted_CDS_1|324_bp atggctgcatggagcccggccgcggcagcgcctctcctccgcgggatccgcgggcttcca cttcaccatcggatgtttgccactcagactgagggggagctcagagtgacccaaattctc aaagaaaagtttccacgagctacagctataaaagtcactgacatttcaggaggttgtggg gcgatgtatgaaattaaaattgaatcagaagaatttaaggagaagagaactgtccagcag caccagatggttaatcaggcactaaaagaagaaatcaaagagatgcatggattgcggata tttacctctgtccccaaacgctga >gi568815596r:74122867_74415023|GENSCAN_predicted_peptide_2|216_aa MSFLFSSRSSKTFKPKKNIPEGSHQYELLKHAEATLGSGNLRQAVMLPEGEDLNEWIAVN TVDFFNQINMLYGTITEFCTEASCPVMSAGPRYEYHWADGTNIKKPIKCSAPKYIDYLMT WVQDQLDDETLFPSKIGVPFPKNFMSVAKTILKRLFRVYAHIYHQHFDSVMQLQEEAHLN TSFKHFIFFVQEFNLIDRRELAPLQELIEKLGSKDR >gi568815596r:74122867_74415023|GENSCAN_predicted_CDS_2|651_bp atgagcttcctcttcagcagccgctcttctaaaacattcaaaccaaagaagaatatccct gaaggatctcatcagtatgaactcttaaaacatgcagaagcaactctaggaagtgggaat ctgagacaagctgttatgttgcctgagggagaggatctcaatgaatggattgctgtgaac actgtggatttctttaaccagatcaacatgttatatggaactattacagaattctgcact gaagcaagctgtccagtcatgtctgcaggtccgagatatgaatatcactgggcagatggt actaatattaaaaagccaatcaaatgttctgcaccaaaatacattgactatttgatgact tgggttcaagatcagcttgatgatgaaactctttttccttctaagattggtgtcccattt cccaaaaactttatgtctgtggcaaagactattctaaagcgtctgttcagggtttatgcc catatttatcaccagcactttgattctgtgatgcagctgcaagaggaggcccacctcaac acctcctttaagcactttattttctttgttcaggagtttaatctgattgataggcgtgag ctggcacctcttcaagaattaatagagaaacttggatcaaaagacagataa >gi568815596r:74122867_74415023|GENSCAN_predicted_peptide_3|314_aa MAATSLMSALAARLLQPAHSCSLRLRPFHLAAVRNEAVVISGRKLAQQIKQEVRQEVEEW VASGNKRPHLSVILVGENPASHSYVLNKTRAAAVVGINSETIMKPASISEEELLNLINKL NNDDNVDGLLVQLPLPEHIDERRICNAVSPDKDVDGFHVINVGRMCLDQYSMLPATPWGV WEIIKRTGDATVTISHRYTPKEQLKKHTILADIVISAAGIPNLITADMIKEGAAVIDVGI NRVHDPVTAKPKLVGDVDFEGVRQKAGYITPVPGGVGPMTVAMLMKNTIIAAKKVLRLEE REVLKSKELGVATN >gi568815596r:74122867_74415023|GENSCAN_predicted_CDS_3|945_bp atggctgcgacttctctaatgtctgctttggctgcccggctgctgcagcccgcgcacagc tgctcccttcgccttcgccctttccacctcgcggcagttcgaaatgaagctgttgtcatt tctggaaggaaactggcccagcagatcaagcaggaagtgcggcaggaggtagaagagtgg gtggcctcaggcaacaaacggccacacctgagtgtgatcctggttggcgagaatcctgca agtcactcctatgtcctcaacaaaaccagggcagctgcagttgtgggaatcaacagtgag acaattatgaaaccagcttcaatttcagaggaagaattgttgaatttaatcaataaactg aataatgatgataatgtagatggcctccttgttcagttgcctcttccagagcatattgat gagagaaggatctgcaatgctgtttctccagacaaggatgttgatggctttcatgtaatt aatgtaggacgaatgtgtttggatcagtattccatgttaccggctactccatggggtgtg tgggaaataatcaagcgaactggtgatgccactgttacaatatctcatcgatatactccc aaagagcagttgaagaaacatacaattcttgcagatattgtaatatctgctgcaggtatt ccaaatctgatcacagcagatatgatcaaggaaggagcagcagtcattgatgtgggaata aatagagttcacgatcctgtaactgccaaacccaagttggttggagatgtggattttgaa ggagtcagacaaaaagctgggtatatcactccagttcctggaggtgttggccccatgaca gtggcaatgctaatgaagaataccattattgctgcaaaaaaggtgctgaggcttgaagag cgagaagtgctgaagtctaaagagcttggggtagccactaattaa >gi568815596r:74122867_74415023|GENSCAN_predicted_peptide_4|1117_aa MSYLRECPPIHIGLPVPTYPQRKTDQKGHLSGLQKVHWGLRPDQPQQELTGPGSGASSQD SSMDLISRTRSPAAEQLQDILGEEDEAPNPTLFTEMDTLQHDGDQMEWKESARWIKFEEK VEEGGERWSKPHVSTLSLHSLFELRTCLQTGTVLLDLDSGSLPQIIDDVIEKQIEDGLLR PELRERVSYVLLRRHRHQTKKPIHRSLADIGKSVSTTNRSPARSPGAGPSLHHSTEDLRM RQSANYGRLCHAQSRSMNDISLTPNTDQRKNKFMKKIPKDSEASNVLVGEVDFLDQPFIA FVRLIQSAMLGGVTEVPVPTRFLFILLGPSGRAKSYNEIGRAIATLMVDDLFSDVAYKAR NREDLIAGIDEFLDEVIVLPPGEWDPNIRIEPPKKVPSADKRKSVFSLAELGQMNGSVGG GGGAPGGGNGGGGGGGSGGGAGSGGAGGTSSGDDGEMPAMHEIGEELIWTGRFFGGLCLD IKRKLPWFPSDFYDGFHIQSISAILFIYLGCITNAITFGGLLGDATDNYQGVMESFLGTA MAGSLFCLFSGQPLIILSSTGPILIFEKLLFDFSKGNGLDYMEFRLWIGLHSAVQCLILV ATDASFIIKYITRFTEEGFSTLISFIFIYDAIKKMIGAFKYYPINMDFKPNFITTYKCEC VAPDTGDREYNRVQCFSPIGTRHQRFSALDWSLLSKKECLSYGGRLLGNSCKFIPDLALM SFILFFGTYSMTLTLKKFKFSRYFPTKVRALVADFSIVFSILMFCGIDACFGLETPKLHV PSVIKPTRPDRGWFVAPFGKNPWWVYPASILPALLVTILIFMDQQITAVIVNRKENKLKK AAGYHLDLFWVGILMALCSFMGLPWYVAATVISIAHIDSLKMETETSAPGEQPQFLGVRE QRVTGIIVFILTGISVFLAPILKCIPLPVLYGVFLYMGVASLNGIQFWERCKLFLMPAKH QPDHAFLRHVPLRRIHLFTLVQILCLAVLWILKSTVAAIIFPVMILGLIIVRRLLDFIFS QHDLAWIDNILPEKEKKETDKKRKRKKGAHEDCDEEEKDLPVGVTHSDSSFSDTELDRSY SRNPVFMVPQVKIEMESDYDFTDMDKYRRETDSETTL >gi568815596r:74122867_74415023|GENSCAN_predicted_CDS_4|3354_bp atgtcctatctgagagaatgccctcctatccacattgggcttccagtacccacttaccct caaagaaaaactgaccagaagggacatctttcaggcctgcaaaaagtccactggggcctg cggccagaccagccacagcaggaactgactggcccagggagtggggcaagcagccaggac agcagcatggatcttatcagcaggactcggtccccagctgctgagcagctccaggacatc ctgggggaggaagatgaggctcccaaccccaccctctttacagagatggatactctgcag catgacggagaccagatggagtggaaggagtcagccaggtggataaagtttgaagaaaag gtagaggaaggcggcgaacgctggagcaagccccacgtgtccacactatccctgcacagc ctcttcgagctccgtacctgcctgcagacggggacggtgctgctggatttggacagtggc tccttaccacagatcatagatgatgtcattgagaagcagattgaggatggtctcctgcgg ccagagctccgggagagggtcagttacgtcctcctgaggaggcaccgccaccaaaccaag aagcccatccaccgctccttagctgacattgggaagtcagtctccaccacaaatcgcagt cctgcccggagccctggtgctggcccgagtctacaccactccacggaagacctgcggatg cggcagagtgcaaattacggacgtctgtgtcatgcccagagcagaagcatgaatgacatt tctctcaccccaaacacagaccagcggaaaaacaaattcatgaagaagatccccaaggac tcagaagcgtccaacgtgctcgtgggcgaggtggacttcctagaccagccattcatcgcg ttcgtgcgcctcatccagtcggccatgctgggaggagtgaccgaggtgcctgtccccacc agatttctgtttatactactgggaccttctgggagagcaaaatcctacaatgaaattggc cgtgccattgcaaccctcatggtagatgatctcttcagtgacgtggcctacaaagcccgc aatcgggaagatctgatcgcaggaattgatgaatttctggatgaggtcatcgtccttcct cctggagaatgggacccaaatatccggattgagccccccaagaaggtgccctctgctgac aagaggaaatctgtgttctccctagcagagctgggccagatgaatggctctgtgggagga ggcggcggagctcctggaggaggcaatggaggtggtggtggtggtggcagtggcggcggg gctggcagtggcggggccggcggaacaagcagcggggatgatggagagatgccagccatg catgaaatcggggaggaacttatctggacaggaaggttcttcggtggcctgtgtctggat atcaagaggaagttgccctggttcccaagtgacttctatgatggcttccacattcagtcc atctctgccatcctattcatctacctcggctgtatcaccaacgcgatcacctttggtggg cttctgggggatgccaccgacaattatcagggagtgatggagagcttcctgggcactgcc atggctggctccttgttctgcctcttctcgggacagcctctcatcattctcagcagcacg gggcccatcctcatctttgagaagctcctcttcgacttcagcaaaggcaatggcctggac tacatggagttccgcctctggattggcctacactcagctgtccagtgccttatcctagtg gccacagatgccagctttatcatcaaatatatcacccgcttcaccgaggagggcttctcc acccttatcagcttcatcttcatctacgatgccatcaagaagatgatcggtgccttcaag tactaccctatcaatatggacttcaagccaaacttcatcactacctacaagtgcgagtgt gtcgcccctgacacaggtgaccgtgaatacaaccgtgttcaatgcttcagccccattggc accagacaccaacgcttctctgcgttggactggtccctgctgagcaagaaggagtgtctg agctacggcgggcgcctgcttgggaattcctgcaagtttatcccagacctggcgctcatg tccttcatccttttctttgggacatactccatgaccctgaccctgaagaagttcaaattc agccgctattttcctaccaaggtccgggccctggtggctgacttttccattgttttctcc atcctgatgttctgtggaatcgatgcctgttttggcctagaaactcccaagctgcatgtg cccagtgtcatcaagccaacgcggcctgaccgaggctggttcgtggccccctttgggaag aacccgtggtgggtatacccagcaagcatcctgcccgccctgctggtgaccatcctgatc ttcatggaccagcagatcactgccgtcattgtcaaccggaaggagaacaaactgaagaag gctgccggctaccatctggacctgttctgggtgggcatcctcatggctttgtgctccttt atggggctcccctggtacgtggctgccacggtcatctccatcgcccacatcgacagcctc aagatggagacagagaccagtgcccctggggagcagccccagtttctgggagtcagggaa cagagagtaaccggcatcatcgtcttcatcctgacgggaatctctgtcttcctggctccc atcctaaagtgtatccccctgccggtgctgtacggagtcttcctctacatgggcgtggcc tccctgaatggcatccagttctgggaacgctgcaagctcttcctgatgccagccaagcac cagccggaccatgccttcctgcggcacgtgccgctgcgccggatccacctcttcaccctg gtgcagatcctctgcctggcggtgctctggatcctcaaatccacggtggctgccatcatc ttcccggtcatgatcctgggcctcatcatcgttcgaaggcttctggatttcatcttttcc cagcacgacctggcctggattgacaacatcctcccagagaaggaaaaaaaggagacagac aagaagaggaagagaaaaaaaggggcccacgaggactgtgatgaggaggaaaaagatctt ccagttggagttactcactctgattcttccttcagtgacacagaacttgaccgaagctac tcacggaacccagtgttcatggtgccacaggtgaagatagagatggagtcagactatgac ttcacagacatggataaataccgaagagaaactgacagtgagaccaccctctag >gi568815596r:74122867_74415023|GENSCAN_predicted_peptide_5|347_aa MGLMKRQSDGRLLPSLPQPHLKENTNLAAHLEARMSFTTRSTFSSNYWSLGSVQPPSYGA WLVSSVASVYAVRSLETDNRKLESKIREHLEKKGPQVRDWSHYFKTIEDMSAQIFANTVD NAHIVQIDSAHLAANDFRLETEIEALEELLFMKKNHEEEVKDLQAQITSSGLTMELPQKS REEPDKYCSQQMEESTTVVTMQSAKVGAAEMMLMKLRRTVQSLEIDLDSMRNLKVSLGNS LREVEAHYTLQMEQLNGILLHLESELAQTQAEGQCQAQEYKALLNIKLKLEAEITTYHCL LEDGKDFNLGDALDSSNSMQIIQKNTTHWIVDGKVVSENNDTKVLRH >gi568815596r:74122867_74415023|GENSCAN_predicted_CDS_5|1044_bp atggggcttatgaaaaggcaatcagatgggcggctactgccatcgttaccccagccccac ttgaaggaaaataccaacctcgctgcccacctagaggctcgcatgagcttcaccactcgt tccaccttctcctccaactactggtccctgggctctgtccagccacccagctacggcgcc tggctggtcagcagtgtggccagtgtctatgcagtgaggagcctggagactgataacagg aagctggagagcaaaatccgggagcacctggagaagaagggaccccaggtcagagactgg agccattacttcaagaccattgaggacatgagcgctcagatcttcgcaaatactgtggac aatgcccacattgtgcagatcgacagtgcccatcttgctgccaatgactttagactggag acagagatcgaggctctcgaggagctgctcttcatgaagaagaaccacgaagaggaagta aaagacctacaagcccagattaccagctctgggttgaccatggagctgcctcagaagagc cgagaggagccagacaagtactgctctcagcagatggaggagagcaccacagtggtcacc atgcagtctgccaaggttggagctgctgagatgatgctcatgaaactgagacgtacagtc cagtccttggagattgacctggactccatgagaaatctgaaggtcagcttgggaaacagc ctgagggaggtggaggcccactacaccctgcagatggagcagctcaatgggatcctgctg cacctggaatcagagctggcacagacccaggcagagggacagtgccaggcccaggagtac aaggccctgctgaacatcaagctcaagctggaggctgagatcaccacctaccactgcctg ctggaagatggcaaggacttcaatcttggtgatgccctggacagcagcaactccatgcaa atcatccaaaagaacactacccactggatagtggatggcaaagtggtgtctgagaataat gacaccaaagttctgagacattaa >gi568815596r:74122867_74415023|GENSCAN_predicted_peptide_6|241_aa MSFATEMSVTSKEQKSEHSGLRDEWECKITFQIKSCCYQKSADTITVCSLLNPGHGSETA KADKNLGQPILYHLPKLGTNNLEAKNCTYCDITITIYTSPPSPIPHHLHLYHTPPSRPRY LHLDITTSIYTSRPPSTPHHPHLSLTTYTYATHLLLYLTITIYTSPPLSTPHHHVYLIPP SILHTTIHYLTISISTSHLCLQLTTSIYISSLPCKPHHLHSHITFPSTLHPPSTPHTSIY I >gi568815596r:74122867_74415023|GENSCAN_predicted_CDS_6|726_bp atgagctttgccacagaaatgtctgtgaccagtaaggagcagaaatcagaacacagtggt ttgagggatgagtgggagtgtaagataaccttccagataaagagttgctgctaccaaaag agtgctgacaccatcacagtatgttcccttctcaaccctggacatggttcagaaacagct aaagcagacaagaatctggggcagccaatattgtaccacctgccaaagcttggcacaaac aatctggaagccaaaaattgcacatactgtgacatcaccatcaccatctacacctcacca ccctcacctatacctcatcacctacacctatatcacacacctccatctagacctcgctac ctccatctagatatcaccacctccatctacacctcacgaccaccatctacacctcaccac cctcacctatctctcaccacctatacctacgccacacacctccttctatacctcaccatc accatctacacctcaccacccttgtccacacctcatcaccatgtatacctcatacctcca tctatacttcacaccaccatacactacctcaccatctccatctccacctcacacctctgt ctacaacttaccacctccatctacatctcatcacttccatgtaaacctcaccacctccat tcacatatcacatttccatctacacttcaccccccatctacacctcacacttccatctac atctga >gi568815596r:74122867_74415023|GENSCAN_predicted_peptide_7|1227_aa MAQSKRHVYSRTPSGSRMSAEASARPLRVGSRVEVIGKGHRGTVAYVGATLFATGKWVGV ILDEAKGKNDGTVQGRKYFTCDEGHGIFVRQSQIQVFEDGADTTSPETPDSSASKVLKRD HNSATQASAGELSSSEPSTPAQTPLAAPIIPTPVLTSPGAVPPLPSPSKEEEGLRAQVRD LEEKLETLRLKRAEDKAKLKELEKHKIQLEQVQEWKSKMQEQQADLQRRLKEARKEAKEA LEAKERYMEEMADTADAIEMATLDKEMAEERAESLQQEVEALKERVDELTTDLEILKAEI EEKGSDGAASSYQLKQLEEQNARLKDALVRMRDLSSSEKQEHVKLQKLMEKKNQELEVVR QQRERLQEELSQAESTIDELKEQVDAALGAEEMVEMLTDRNLNLEEKVRELRETVGDLEA MNEMNDELQENARETELELREQLDMAGARVREAQKRVEAAQETVADYQQTIKKYRQLTAH LQDVNRELTNQQEASVERQQQPPPETFDFKIKFAETKAHAKAIEMELRQMEVAQANRHMS LLTAFMPDSFLRPGGDHDCVLVLLLMPRLICKAELIRKQAQEKFELSENCSERPGLRGAA GEQLSFAAGLVYSLSLLQATLHRYEHALSQCSVDVYKKVGSLYPEMSAHERSLDFLIELL HKDQLDETVNVEPLTKAIKYYQHLYSIHLAEQPEDCTMQLADHIKFTQSALDCMSVEVGR LRAFLQGGQEATDIALLLRDLETSCSDIRQFCKKIRRRMPGTDAPGIPAALAFGPQVSDT LLDCRKHLTWVVAVLQEVAAAAAQLIAPLAENEGLLVAALEELAFKASEQIYGTPSSSPY ECLRQSCNILISTMNKLATAMQEGEYDAERPPSKPPPVELRAAALRAEITDAEGLGLKLE DRETVIKELKKSLKIKGEELSEANVRLSLLEKKLDSAAKDADERIEKVQTRLEETQALLR KKEKEFEETMDALQADIDQLEAEKAELKQRLNSQSKRTIEGLRGPPPSGIATLVSGIAGG AIPGQAPGSVPGPGLVKDSPLLLQQISAMRLHISQLQHENSILKGAQMKASLASLPPLHV AKLSHEGPGSELPAGALYRKTSQLLETLNQLSTHTHVVDITRTSPAAKSPSAQLMEQVAQ LKSLSDTVEKLKDEVLKETVSQRPGATVPTDFATFPSSAFLRAKEEQQDDTVYMGKVTFS CAAGFGQRHRLVLTQEQLHQLHSRLIS >gi568815596r:74122867_74415023|GENSCAN_predicted_CDS_7|3684_bp atggcacagagcaagaggcacgtgtacagccggacgcccagcggcagcaggatgagtgcg gaggcaagcgcccggcctctgcgggtgggctcccgtgtagaggtgattggaaaaggccac cgaggcactgtggcctatgttggagccacactgtttgccactggcaaatgggtaggcgtg attctggatgaagcaaagggcaaaaatgatggaactgttcaaggcaggaagtacttcact tgtgatgaagggcatggcatctttgtgcgccagtcccagatccaggtatttgaagatgga gcagatactacttccccagagacacctgattcttctgcttcaaaagtcctcaaaagagac cacaactcggcgacccaagcgtcagcaggtgagctgagcagcagtgagcccagcaccccg gctcagactccgctggcagcacccatcatccccacgccggtcctcacctctcctggagca gtccccccgcttccttccccatccaaggaggaggagggactaagggctcaggtgcgggac ctggaggagaaactagagaccctgagactgaaacgggcagaagacaaagcaaagctaaaa gagctggagaaacacaaaatccagctggagcaggtgcaggaatggaagagcaaaatgcag gagcagcaggccgacctgcagcggcgcctcaaggaggcgagaaaggaagccaaggaggcg ctggaggcaaaggaacgctatatggaggagatggctgatactgctgatgccattgagatg gccactttggacaaggagatggctgaagagcgggctgagtccctgcagcaggaggtggag gcactgaaggagcgggtggacgagctcactactgacttagagatcctcaaggctgagatt gaagagaagggctcagatggcgctgcatccagttatcagctcaagcagcttgaggagcag aatgcccgcctgaaggatgccctggtgaggatgcgggatctttcttcctcagagaagcag gagcatgtgaagctccagaagctcatggaaaagaagaaccaagagctggaagttgtgagg caacagcgggagcgtctgcaggaggagctaagccaggcagagagcaccattgatgagctc aaggagcaggtggatgctgctctgggtgctgaggagatggtggagatgctgacagatcgg aacctgaatctggaagagaaagtgcgcgagttgagggagactgtgggagacttggaagcg atgaatgagatgaacgatgagctgcaggagaatgcacgtgagacagaactggagctgcgg gagcagctggacatggcaggcgcgcgggttcgtgaggcccagaagcgtgtggaggcagcc caggagacggttgcagactaccagcagaccatcaagaagtaccgccagctgaccgcccat ctacaggatgtgaatcgggaactgacaaaccagcaggaagcatctgtggagaggcaacag cagccacctccagagacctttgacttcaaaatcaagtttgctgagactaaggcccatgcc aaggcaattgagatggaattgaggcagatggaggtggcccaggccaatcgacacatgtcc ctgctgacagccttcatgcctgacagcttccttcggccaggtggggaccatgactgcgtt ctggtgctgttgctcatgcctcgtctcatttgcaaggcagagctgatccggaagcaggcc caggagaagtttgaactaagtgagaactgttcagagcggcctgggctgcgaggagctgct ggggagcaactcagctttgctgctggactggtgtactcgctgagcctgctgcaggccacg ctacaccgctatgagcatgccctctctcagtgcagtgtggatgtgtataagaaagtgggc agcctgtaccctgagatgagtgcccatgagcgctccttggatttcctcattgaactgctg cacaaggatcagctggatgagactgtcaatgtggagcctctcaccaaggccatcaagtac tatcagcatctgtacagcatccaccttgccgaacagcctgaggactgtactatgcagctg gctgaccacattaagttcacgcagagtgctctggactgcatgagtgtggaggtaggacgg ctgcgtgccttcttgcagggtgggcaggaggctacagatattgccctcctgctccgggat ctggaaacttcatgcagtgacatccgccagttctgcaagaagatccgaaggcgaatgcca gggacagatgctcctgggatcccagctgcactggcctttggaccacaggtatctgacacg ctcctagactgcaggaaacacttgacgtgggtcgtggctgtgctgcaggaggtggcagct gctgctgcccagctcattgccccactggcagagaatgaggggctacttgtggctgctctg gaggaactggctttcaaagcaagcgagcagatctatgggaccccctccagcagcccctat gagtgtctgcgccagtcatgcaacatcctcatcagtaccatgaacaagctggccacagcc atgcaggagggggagtatgatgcagagcggccccccagcaagcctccaccggttgaactg cgggctgctgcccttcgtgcagagatcacagatgctgaaggcctgggtttgaagctcgaa gatcgagagacagttattaaggagttgaagaagtcactcaagattaagggagaggagcta agtgaggccaatgtgcggctgagcctcctggagaagaagttggacagtgctgccaaggat gcagatgagcgcatcgagaaagtccagactcggctggaggagacccaggcactgctgcga aagaaggagaaagagtttgaggagacaatggatgcactccaggctgacatcgaccagctg gaggcagagaaggcagaactaaagcagcgtctgaacagccagtccaaacgcacgattgag ggactccggggccctcctccttcaggcattgctactctggtctctggcattgctggtgga gccatccctgggcaggctccagggtctgtgccaggcccagggctggtgaaggactcacca ctgctgcttcagcagatctctgccatgaggctgcacatctcccagctccagcatgagaac agcatcctcaagggagcccagatgaaggcatccttggcatccctgccccctctgcatgtt gcaaagctatcccatgagggccctggcagtgagttaccagctggagcgctgtatcgtaag accagccagctgctggagacattgaatcaattgagcacacacacgcacgtagtagacatc actcgcaccagccctgctgccaagagcccgtcggcccaacttatggagcaagtggctcag cttaagtccctgagtgacaccgtcgagaagctcaaggatgaggtcctcaaggagacagta tctcagcgccctggagccacagtacccactgactttgccaccttcccttcatcagccttc ctcagggccaaggaggagcagcaggatgacacagtctacatgggcaaagtgaccttctca tgtgcggctggttttggacagcgacaccggctggtgctgacccaggagcagctgcaccag cttcacagtcgcctcatctcctaa >gi568815596r:74122867_74415023|GENSCAN_predicted_peptide_8|396_aa XLSSPSLESKLPLPNSRIRFLTTHPVLPDVARSRSPKLWPSVRWPSGWEGKAELLGELWA GRTRVPPQGLELADREGQDPGRWPRTTPPVLEATSQVMWKPVLLPEALKLAPGVSMWNRS TQVLLSSGVPEQEDKEGSTFPPVEQHPIQTGAPKPSISPAGPGSFCYVAVGCTQHPGLGR WLCLPYSGLLQLHVQLWQKSHPWDLQCCSTDLTGKIAIVTGANSGIGKVVSQDLARCGAQ VILTCQSRECGQQALAEIQAASNSNRLLLGEVDLSSMTSIRSFARRLLQENPEIHLLVNN AGVSGFRRHLPQGAWISPLSLTMLGPFCSQIYSKDLKQGVLPVLYLSLAEEPGGISGKYF SSSCVITLPVKASRDPHVAQSLWNASVRLTSLVKMD >gi568815596r:74122867_74415023|GENSCAN_predicted_CDS_8|1191_bp ngcctatcgtcgccatccctcgagtcaaagctgccactcccaaactccaggatccgcttc ctcaccacacacccggtgctccctgatgtggcccgcagccgcagccccaagctgtggccc agtgtcaggtggcccagcggttgggaggggaaggccgagctgctgggcgagctgtgggct ggccggacccgcgtgcctccacagggtctggagctggcagacagggagggccaggatcct ggcagatggcctcgaaccacacccccggtccttgaagccacttcccaggtgatgtggaag cccgtgttgctgccagaagccctgaagctggcccctggtgtgagcatgtggaaccggagc acccaggtgttgctcagctctggtgtgcctgaacaagaggacaaagaaggtagcaccttt cctcccgttgagcaacatcccatccagacaggtgccccaaagcccagcatttccccagca ggcccaggaagtttctgctatgttgctgtgggctgcactcagcatcctggtctggggcgc tggctctgtcttccttattctggtcttcttcaactacatgtgcagctctggcagaagtct catccctgggacctccagtgctgctccacagatctgactgggaaaatagccatagtgact ggggccaacagtggcatcgggaaggttgtatcccaggacctagctcggtgtggggcccaa gtgatccttacttgtcagagcagggaatgtggacagcaagccctggctgagatccaagca gcctcaaacagcaaccgcctcctgcttggcgaggtggaccttagctccatgacctctatt cggagctttgcccggaggcttctacaggagaatcctgagatacatctgctggtaaacaat gctggagtcagtggattccgaagacacttaccccagggggcctggatctcacctttgtca ctaactatgttgggccctttctgctcacaaatctactccaaggatctcaaacaaggtgta ctcccagtcctctacttgagcttggcagaggagccgggtggtatttctggaaaatatttc agcagttcctgtgtgataactcttcccgttaaagcctctcgggatcctcatgttgcccag agcctctggaatgcctcagtccgactgacaagcctagtcaagatggactga