GENSCAN 1.0 Date run: 3-Nov-116 Time: 08:50:02 Sequence gi568815588r:71651668_71873439 : 221772 bp : 52.85% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 371 452 82 1 1 72 64 39 0.427 0.99 1.02 Intr + 1575 1595 21 1 0 101 98 8 0.600 1.10 1.03 Intr + 4373 4486 114 0 0 106 49 52 0.902 4.02 1.04 Intr + 4735 4815 81 2 0 118 77 41 0.934 6.11 1.05 Term + 7471 7613 143 2 2 58 49 99 0.552 1.40 1.06 PlyA + 9885 9890 6 1.05 2.00 Prom + 14781 14820 40 -3.91 2.01 Init + 18946 19023 78 0 0 60 51 99 0.833 4.41 2.02 Intr + 23445 23509 65 2 2 139 94 102 0.997 13.91 2.03 Intr + 25789 26026 238 1 1 59 90 480 0.973 43.45 2.04 Intr + 27720 27825 106 2 1 143 92 225 0.988 28.69 2.05 Intr + 30778 30905 128 2 2 65 78 246 0.346 22.20 2.06 Intr + 35980 36052 73 0 1 122 92 153 0.976 18.57 2.07 Intr + 38801 38917 117 0 0 77 99 266 0.998 27.54 2.08 Intr + 42480 42592 113 1 2 112 84 180 0.998 20.60 2.09 Intr + 43571 43717 147 1 0 56 57 54 0.392 0.04 2.10 Intr + 43751 43858 108 1 0 72 90 222 0.961 21.78 2.11 Intr + 45639 45725 87 2 0 93 61 45 0.353 2.96 2.12 Intr + 50355 50544 190 2 1 85 12 370 0.998 28.78 2.13 Intr + 50882 51027 146 0 2 116 89 161 0.992 19.51 2.14 Intr + 53244 53463 220 2 1 147 77 591 0.986 62.30 2.15 Intr + 55230 55382 153 1 0 93 53 350 0.999 32.56 2.16 Intr + 57431 57544 114 0 0 108 76 259 0.999 27.62 2.17 Intr + 60998 61150 153 0 0 109 37 224 0.494 19.96 2.18 Term + 62101 62195 95 2 2 125 41 -11 0.255 -3.81 2.19 PlyA + 62272 62277 6 -3.64 3.02 PlyA - 62962 62957 6 -1.75 3.01 Sngl - 64670 64269 402 2 0 92 39 394 0.135 30.01 3.00 Prom - 65171 65132 40 -4.61 4.00 Prom + 65203 65242 40 -4.01 4.01 Init + 68364 68450 87 2 0 95 30 112 0.790 6.70 4.02 Intr + 72378 72438 61 2 1 66 115 122 0.968 11.50 4.03 Intr + 73705 73853 149 2 2 74 72 276 0.968 25.06 4.04 Intr + 76073 76228 156 1 0 83 41 55 0.550 1.02 4.05 Intr + 77446 77541 96 0 0 76 107 -5 0.587 0.91 4.06 Intr + 78802 78937 136 0 1 106 81 310 0.999 32.75 4.07 Intr + 80320 80708 389 2 2 83 89 914 0.969 86.07 4.08 Intr + 82190 82495 306 1 0 1 29 178 0.498 0.39 4.09 Intr + 82573 82689 117 0 0 128 76 207 0.484 24.67 4.10 Intr + 86831 86980 150 1 0 55 80 304 0.999 27.17 4.11 Intr + 87977 88105 129 1 0 166 75 199 0.999 27.80 4.12 Intr + 89155 89283 129 0 0 127 78 266 0.993 30.90 4.13 Term + 90027 90317 291 2 0 109 55 476 0.460 42.09 4.14 PlyA + 91501 91506 6 1.05 5.08 PlyA - 92003 91998 6 1.05 5.07 Term - 99623 99586 38 0 2 136 41 19 0.782 -0.21 5.06 Intr - 100194 100001 194 2 2 69 106 104 0.985 9.96 5.05 Intr - 101335 101308 28 2 1 146 110 50 0.972 10.66 5.04 Intr - 103799 103692 108 0 0 103 78 86 0.811 9.86 5.03 Intr - 109257 109201 57 1 0 119 78 30 0.851 4.45 5.02 Intr - 110359 109931 429 2 0 59 94 643 0.865 56.26 5.01 Init - 110928 110862 67 0 1 71 94 -29 0.552 -2.78 5.00 Prom - 111661 111622 40 -3.61 6.00 Prom + 117466 117505 40 -0.71 6.01 Init + 118051 118104 54 0 0 67 111 30 0.891 4.46 6.02 Intr + 121695 122000 306 2 0 70 31 227 0.393 12.39 6.03 Intr + 126013 126234 222 0 0 101 49 610 0.998 57.25 6.04 Intr + 126522 126641 120 2 0 98 76 220 0.994 22.99 6.05 Intr + 127600 127780 181 0 1 62 92 296 0.999 27.36 6.06 Intr + 132620 132753 134 0 2 49 79 244 0.998 20.47 6.07 Intr + 133224 133433 210 2 0 74 66 432 0.999 39.23 6.08 Intr + 133964 134071 108 1 0 36 76 188 0.997 13.38 6.09 Intr + 137273 137375 103 1 1 118 82 78 0.996 10.45 6.10 Intr + 138621 138746 126 1 0 69 93 278 0.998 27.56 6.11 Intr + 139465 139668 204 2 0 3 107 334 0.874 26.50 6.12 Intr + 141515 141973 459 0 0 98 87 926 0.996 87.24 6.13 Intr + 145437 145553 117 1 0 113 91 123 0.956 16.04 6.14 Intr + 146687 146911 225 0 0 112 96 489 0.998 50.58 6.15 Intr + 147444 147613 170 1 2 16 94 358 0.998 29.38 6.16 Intr + 147825 147962 138 2 0 85 101 193 0.992 21.47 6.17 Intr + 148969 149088 120 0 0 79 94 174 0.999 18.29 6.18 Intr + 151231 151408 178 0 1 86 71 237 0.989 21.81 6.19 Intr + 151542 151753 212 1 2 33 63 383 0.994 29.46 6.20 Intr + 154139 154330 192 1 0 101 66 563 0.987 55.61 6.21 Intr + 154501 154614 114 0 0 110 55 216 0.785 21.65 6.22 Intr + 155610 155739 130 2 1 126 75 251 0.996 28.37 6.23 Intr + 155849 156100 252 0 0 91 61 447 0.973 40.24 6.24 Intr + 156179 156340 162 0 0 120 45 366 0.997 35.96 6.25 Intr + 158153 158409 257 0 2 70 51 768 0.983 69.00 6.26 Intr + 158805 158902 98 2 2 65 117 216 0.999 21.51 6.27 Intr + 159648 159768 121 0 1 42 80 288 0.999 24.40 6.28 Intr + 159844 159923 80 0 2 113 56 155 0.963 13.64 6.29 Intr + 160046 160086 41 2 2 133 119 56 0.998 11.55 6.30 Intr + 160288 160348 61 2 1 129 92 162 0.999 18.98 6.31 Intr + 160813 160942 130 1 1 108 58 195 0.999 19.70 6.32 Intr + 161101 161223 123 0 0 105 105 238 0.999 28.49 6.33 Intr + 161577 161681 105 2 0 100 53 114 0.947 10.01 6.34 Term + 163285 163611 327 0 0 67 42 636 0.998 51.96 6.35 PlyA + 164262 164267 6 -1.75 7.17 PlyA - 164651 164646 6 1.05 7.16 Term - 165554 165525 30 2 0 102 38 35 0.361 -1.96 7.15 Intr - 166271 166176 96 2 0 88 70 25 0.338 1.41 7.14 Intr - 167057 166950 108 2 0 75 82 50 0.930 4.08 7.13 Intr - 167444 167364 81 2 0 93 85 202 0.997 20.73 7.12 Intr - 167955 167798 158 1 2 57 101 226 0.959 21.04 7.11 Intr - 168233 168047 187 2 1 109 105 260 0.995 29.58 7.10 Intr - 168668 168573 96 2 0 81 69 138 0.996 11.91 7.09 Intr - 170340 170209 132 0 0 101 77 190 0.998 20.55 7.08 Intr - 174226 174170 57 1 0 107 78 79 0.991 8.37 7.07 Intr - 176490 176347 144 0 0 75 47 230 0.997 18.59 7.06 Intr - 177410 177210 201 2 0 62 116 204 0.983 20.60 7.05 Intr - 179584 179459 126 1 0 88 115 145 0.999 18.58 7.04 Intr - 180253 180179 75 1 0 65 83 60 0.889 3.41 7.03 Intr - 182838 182705 134 1 2 85 89 123 0.058 12.97 7.02 Intr - 199411 199199 213 2 0 91 92 50 0.883 4.91 7.01 Init - 199554 199515 40 0 1 61 102 136 0.985 10.60 7.00 Prom - 206321 206282 40 -1.11 8.03 PlyA - 206902 206897 6 1.05 8.02 Term - 209095 208860 236 2 2 82 45 80 0.292 -0.09 8.01 Init - 213919 213901 19 1 1 113 93 13 0.359 4.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 34648 34722 75 0 0 83 76 54 0.845 2.77 S.002 Init - 182872 182705 168 1 0 67 89 191 0.907 14.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:71651668_71873439|GENSCAN_predicted_peptide_1|146_aa MDFRQDRLLPQSEEAKKGEALKSPDWAALTPQGKDLSFLLSPRRRWAGGLLNALHRSGIP GDADRSPRTKAKEQNSALSGTKGALQMQIQRLFKAWNDSDNHSENTRAKTMDPLGQGTMG SCTSGQRQNEFVNAADQMPVQSPPVP >gi568815588r:71651668_71873439|GENSCAN_predicted_CDS_1|441_bp atggacttcagacaagacaggcttcttccccagtcggaggaagccaagaaaggagaggcc ctgaaaagtcctgactgggcagcattgaccccacaaggaaaagaccttagtttcctgctc agtccaaggagaagatgggctggaggccttctgaatgcccttcatagatcaggcatccct ggggatgcagaccgcagccccaggaccaaggccaaggagcagaactctgcactctctggc acaaagggtgccctccagatgcaaatccagcgactatttaaggcctggaatgacagtgac aatcactcagagaacaccagagcgaagacaatggaccccctgggccaggggacaatgggc agctgtacaagtggccagcgccagaatgagtttgtaaatgctgcagaccaaatgcctgtt cagagtccaccagtgccatga >gi568815588r:71651668_71873439|GENSCAN_predicted_peptide_2|776_aa MREQSGVYQGRGRYENTKPNQTVDTRATDNDAGTFGEVSYFFSDDPDRFSLDKDTGLIML IARLDYELIQRFTLTIIARDGGGEETTGRVRINVLDVNDNVPTFQKDAYVGALRENEPSV TQLVRLRATDEDSPPNNQITYSIVSASAFGSYFDISLYEGYGVISVSRPLDYEQISNGLI YLTVMAMDAGNPPLNSTVPVTIEVFDENDNPPTFSKPAYFVSVVENIMAGATVLFLNATD LDRSREYGQESIIYSLEGSTQFRINARSGEITTTSLLDRETKSEYILIVRAVDGGVGHNQ KTGIATRPGSFLPSKAQALPKAPKVGGAGRINAKIARGHLSTFPLAGPALAHLSWVNITL LDINDNHPTWKDAPYYINLVEMTPPDSDVTTEIHELSGNAYCMPGPTLVRLRVQEATPGQ VVAVDPDLGENGTLVYSIQPPNKFYSLNSTTGKIRTTHAMLDRENPDPHEAELMRKIVVS VTDCGRPPLKATSSATVFVNLLDLNDNDPTFQNLPFVAEVLEGIPAGVSIYQVVAIDLDE GLNGLVSYRMPVGMPRMDFLINSSSGVVVTTTELDRERIAEYQLRVVASDAGTPTKSSTS TLTIHVLDVNDETPTFFPAVYNVSVSEDVPREFRVVWLNCTDNDVGLNAELSYFITGGNV DGKFSVGYRDAVVRTVVGLDRETTAAYMLILEAIDNGPVGKRHTGTATVFVTVLDVNDNR PIFLQSSYEASVPEDIPEGHSILQAGLRAASTPTPTSRKTVPSGQQGLLVRWMATG >gi568815588r:71651668_71873439|GENSCAN_predicted_CDS_2|2331_bp atgagagaacagtctggggtgtaccagggaagaggtagatacgagaacactaagcccaac cagactgtggacactagggcaactgacaatgatgcaggcacctttggggaagtcagctac ttcttcagtgatgaccctgacaggttctcgctggacaaggacacgggactcatcatgctg attgccaggctggactatgagctcatccagcgcttcaccctgacgatcattgcccgggac gggggcggcgaggagaccacaggccgggtcaggatcaatgtgttggatgtcaacgacaac gtgcccaccttccagaaggatgcctacgtgggtgctctgcgggagaacgagccttctgtc acacagctggtgcggctccgggcaacagatgaagactcccctcccaacaaccagatcacc tacagcattgtcagtgcatctgcctttggcagctacttcgacatcagcctgtacgagggc tatggagtgatcagcgtcagtcgccccctggattatgaacagatatccaatgggctgatt tatctgacggtcatggccatggatgctggcaacccccctctcaacagcaccgtccctgtc accatcgaggtgtttgatgagaatgacaaccctcccaccttcagcaagcccgcctacttc gtctccgtggtggagaacatcatggcaggagccacggtgctgttcctgaatgccacagac ctggaccgctcccgggagtacggccaggagtccatcatctactccttggaaggctccacc cagtttcggatcaatgcccgctcaggggaaatcaccaccacgtctctgcttgaccgagag accaagtctgaatacatcctcatcgttcgcgcagtggacgggggtgtgggccacaaccag aaaactggcatcgccacccggcctggctccttcctgccctcgaaagcccaggctctgccc aaggctcccaaggttggtggagctggcagaattaatgcgaagattgcccgtgggcatctg agcacttttcccctggcaggccctgccctggcccatctcagctgggtaaacatcaccctc ctggacatcaatgacaaccaccccacgtggaaggacgcaccctactacatcaacctggtg gagatgacccctccagactctgatgtgaccacggaaatccacgagctatcagggaatgcc tactgcatgccaggccccacgcttgtcaggctgagggtacaggaagcaacaccaggccag gtggtggctgttgacccagacctgggggagaatggcaccctggtgtacagcatccagcca cccaacaagttctacagcctcaacagcaccacgggcaagatccgcaccacccacgccatg ctggaccgggagaaccccgacccccatgaggccgagctgatgcgcaaaatcgtcgtctct gttactgactgtggcaggccccctctgaaagccaccagcagtgccacagtgtttgtgaac ctcttggatctcaatgacaatgaccccacctttcagaacctgccttttgtggccgaggtg cttgaaggcatcccggcgggggtctccatctaccaagtggtggccatcgacctcgatgag ggcctgaacggcctggtgtcctaccgcatgccggtgggcatgccccgcatggacttcctc atcaacagcagcagcggcgtggtggtcaccaccaccgagctggaccgcgagcgcatcgcg gagtaccagctgcgggtggtggccagtgatgcaggcacgcccaccaagagctccaccagc acgctcaccatccatgtgctggatgtgaacgacgagacgcccaccttcttcccggccgtg tacaatgtgtctgtgtccgaggacgtgccacgcgagttccgggtggtctggctgaactgc acggacaacgacgtgggcctcaatgcagagctcagctacttcatcacaggtggcaacgtg gatgggaagttcagcgtgggttaccgcgatgccgttgtgagaaccgtggtgggcctggac cgggagaccacagccgcctacatgctcatcctggaggccatcgacaacggccctgtaggg aagcgacacacgggcacagccaccgtgttcgtcactgtcctggatgtgaatgacaaccgg cccatctttctgcagagcagctatgaggccagcgtccctgaggacatccctgaaggccac agcatcttgcaggcaggcctcagggcagcctcaacccccaccccaacctccaggaagaca gtgcctagtggccagcagggcctgctggtgaggtggatggctacaggctga >gi568815588r:71651668_71873439|GENSCAN_predicted_peptide_3|133_aa MSTEGPSLASSPAISPLAFLSAPVTPGTLAEATDPLPMLIALACIFLLLATCLLFMTLCK PAALDPSRRRAHECMPHHPGSPSEPQLRLWKRLGSLRLSLHSFRHGRPTVPRQPLPGPED NRSHCDYMESTKM >gi568815588r:71651668_71873439|GENSCAN_predicted_CDS_3|402_bp atgagcacagagggccccagcctcgccagctccccagccatcagccccctcgcctttctc tcagctcccgtcactcccgggacccttgcagaggcaactgaccccctccccatgctcatc gccctggcctgcatcttcctcctgctggccacctgtctgctgttcatgacgctctgcaag ccggccgcgctggacccgagccgccgcagggctcacgagtgcatgccccaccaccctggg agccccagtgagccccagctccggctctggaagcgcctgggctccttgcgcctctccctg cacagcttccgccatggccggcccaccgtccctcgacagcccctgccgggccccgaggac aaccgcagccactgtgactacatggaatctaccaagatgtaa >gi568815588r:71651668_71873439|GENSCAN_predicted_peptide_4|731_aa MALTFPCRKFEWYGRRQPEVRYSVPASHQLKATDADEGEFGRVWYRILHGNHGNNFRIHV SNGLLMRGPRPLDRERNSSHVLIVEAYNHDLGPMRSSVRMRKLRQSTALAQHWTGTALDR KQVWFSRKPSCSHLLSKNLTDLCFLYICASQAKATMPGLHKHLLNCITFMMYSAHIHKEN LVLVIVYVEDINDEAPVFTQQQYSRLGLRETAGIGTSVIVVQATDRDSGDGGLVNYRILS GAEGKFEIDESTGLIITVNYLDYETKTSYMMNVSATDQAPPFNQGFCSVYITLLNELDEA VQFSNASYEAAILENLALGTEIVRVQAYSIDNLNQITYRFNAYTSTQAKALFKIDAITRI LGTQMDTKMNKTLLSPQRVLRLEVEMELIQDANQSATRRCAENYNRGVVEPLRAQQSYLA GEAGRLHGRGGFPVECEREEGIQQTECPGEVMPDRGSDMEGVITVQGLVDREKGDFYTLT VVADDGGPKVDSTVVSGTRVYITVLDENDNSPRFDFTSDSAVSIPEDCPVGQRVATVKAW DPDAGSNGQVVFSLASGNIAGAFEIVTTNDSIGEVFVARPLDREELDHYILQVVASDRGT PPRKKDHILQVTILDINDNPPVIESPFGYNVSVNENVGGGTAVVQVRATDRDIGINSVLS YYITEGNKDMAFRMDRISGEIATRPAPPDRERQSFYHLVATVEDEGTPTLSVSDGGGHRE ERVGQGSSASE >gi568815588r:71651668_71873439|GENSCAN_predicted_CDS_4|2196_bp atggcgctcaccttcccctgtcggaagtttgaatggtatggtcggcggcagccagaggtt cgctattcagtgcctgccagccaccagctgaaagccacggacgcagatgagggcgagttt gggcgtgtgtggtaccgcatcctccatggtaaccatggcaacaacttccggatccatgtc agcaatgggctcctgatgcgagggccccggcccctggaccgggagcggaactcatcccac gtgctgatagtggaggcctacaaccacgacctgggccccatgcggagctccgtcaggatg aggaaactgaggcaaagcactgcactggcacagcactggacaggcacagcactggacaga aagcaggtctggttctccaggaagcccagttgctcccacttgctcagcaagaacctgaca gacctctgcttcctgtacatctgtgcctctcaggcaaaagccaccatgcctggcctccat aaacaccttttaaactgtattacgttcatgatgtacagtgcacacattcataaagagaac ttagtgctggtgattgtgtacgtggaggacatcaacgatgaggcccccgtgttcacacag cagcagtacagccgtctggggcttcgagagaccgcaggcattggaacgtcagtcatcgtg gtccaagccacagaccgagactctggggatggtggcctggtgaactaccgcatcctgtcg ggcgcagaggggaagtttgagattgacgagagcacagggcttatcatcaccgtgaattac ctggactacgagaccaagaccagctacatgatgaatgtgtcggccactgaccaggccccg cccttcaaccagggcttctgcagcgtctacatcactctgctcaacgagctggacgaggcc gtgcagttctccaatgcctcatacgaggctgccatcctggagaatctggcactgggtact gagattgtgcgggtccaggcctactccatcgacaacctcaaccaaatcacgtaccgcttc aacgcctacaccagcacccaggccaaagccctcttcaagatagacgccatcacgaggata ctgggaacacaaatggacacaaagatgaacaagacccttctcagtcctcaaagagttctc aggctagaagtggaaatggaactaatccaggatgcaaatcagagcgccacaagaagatgt gcagagaactacaacagaggggtagtggagcccctcagggcccagcagtcatacctggct ggggaagcaggaaggcttcatggaagaggtggcttcccagtggaatgtgagagagaagaa ggaattcaacagacagagtgtcctggggaagttatgccggacagaggaagtgacatggag ggtgtgatcacagtccagggcctggtggaccgtgagaagggcgacttctataccttgaca gtggtggcagatgacggcggccccaaggtggactccaccgtggtgagtgggaccagggtc tacatcactgtgctggacgagaatgacaacagcccccggtttgacttcacctccgactcg gcggtcagcatacccgaggactgccctgtgggccagcgagtggctactgtcaaggcctgg gaccctgatgctggcagcaatgggcaggtggtcttctccctggcctctggcaacatcgcg ggggcctttgagatcgtcaccaccaatgactccattggcgaagtgtttgtggccaggccc ctggacagagaagagctggatcactacatcctccaggttgtggcttctgaccgaggcacc cctccacggaagaaggaccacatcctgcaggtgaccatcctggacatcaatgacaaccct ccagtcatcgagagcccctttggatacaatgtcagtgtgaatgagaacgtgggtggaggt actgctgtggtccaggtgagagccactgaccgtgacatcgggatcaacagtgttctgtcc tactacatcaccgagggcaacaaggacatggccttccgcatggaccgcatcagcggtgag atcgccacacggcctgccccgcctgaccgcgagcgccagagcttctaccacctggtggcc actgtggaggacgagggcaccccaaccctgtcggtgagcgatgggggtggccacagggag gagcgggtgggccagggcagctccgcctccgagtga >gi568815588r:71651668_71873439|GENSCAN_predicted_peptide_5|306_aa MGFCCRECRPRLLKGRPCIQHAGPVAAFKVATPYSLYVCPEGQNVTLTCRLLGPVDKGHD VTFYKTWYRSSRGEVQTCSERRPIRNLTFQDLHLHHGGHQAANTSHDLAQRHGLESASDH HGNFSITMRNLTLLDSGLYCCLVVEIRHHHSEHRVHGAMELQVQTGKDAPSNCVVYPSSS QDSENITAAALATGACIVGILCLPLILLLVYKQRQAASNRRAQELVRMDSNIQGIENPGF EASPPAQGIPEAKVRHPLSYVAQRQPSESGRHLLSEPSTPLSPPGPGDVFFPSLDPVPDS PNFEVI >gi568815588r:71651668_71873439|GENSCAN_predicted_CDS_5|921_bp atggggttctgctgtcgggaatgcaggcctcgtctgctcaaggggaggccatgcattcag catgcaggtccggtggcagccttcaaggtcgccacgccgtattccctgtatgtctgtccc gaggggcagaacgtcaccctcacctgcaggctcttgggccctgtggacaaagggcacgat gtgaccttctacaagacgtggtaccgcagctcgaggggcgaggtgcagacctgctcagag cgccggcccatccgcaacctcacgttccaggaccttcacctgcaccatggaggccaccag gctgccaacaccagccacgacctggctcagcgccacgggctggagtcggcctccgaccac catggcaacttctccatcaccatgcgcaacctgaccctgctggatagcggcctctactgc tgcctggtggtggagatcaggcaccaccactcggagcacagggtccatggtgccatggag ctgcaggtgcagacaggcaaagatgcaccatccaactgtgtggtgtacccatcctcctcc caggatagtgaaaacatcacggctgcagccctggctacgggtgcctgcatcgtaggaatc ctctgcctccccctcatcctgctcctggtctacaagcaaaggcaggcagcctccaaccgc cgtgcccaggagctggtgcggatggacagcaacattcaagggattgaaaaccccggcttt gaagcctcaccacctgcccaggggatacccgaggccaaagtcaggcaccccctgtcctat gtggcccagcggcagccttctgagtctgggcggcatctgctttcggagcccagcaccccc ctgtctcctccaggccccggagacgtcttcttcccatccctggaccctgtccctgactct ccaaactttgaggtcatctag >gi568815588r:71651668_71873439|GENSCAN_predicted_peptide_6|1859_aa MGNKILRRKAKGMGLLGLGRSQEESEEQGSPAPAAGLQGRGDAHVAVGRAEELLVPGSGR DAAGAGKPPATECERVSAAGGPSAVLQAPAPFAQRPRGAVWGNCLSEGRVGGASRRPAPW ATTHVYVTIVDENDNAPMFQQPHYEVLLDEGPDTLNTSLITIQALDLDEGPNGTVTYAIV AGNIVNTFRIDRHMGVITAAKELDYEISHGRYTLIVTATDQCPILSHRLTSTTTVLVNVN DINDNVPTFPRDYEGPFEVTEGQPGPRVWTFLAHDRDSGPNGQVEYSIMDGDPLGEFVIS PVEGVLRVRKDVELDRETIAFYNLTICARDRGMPPLSSTMLVGIRVLDINDNDPVLLNLP MNITISENSPVSSFVAHVLASDADSGCNARLTFNITAGNRERAFFINATTGIVTVNRPLD RERIPEYKLTISVKDNPENPRIARRDYDLLLIFLSDENDNHPLFTKSTYQAEVMENSPAG TPLTVLNGPILALDADQDIYAVVTYQLLGAQSGLFDINSSTGVVTVRSGVIIDREAFSPP ILELLLLAEDIGLLNSTAHLLITILDDNDNRPTFSPATLTVHLLENCPPGFSVLQVTATD EDSGLNGELVYRIEAGAQDRFLIHLVTGVIRVGNATIDREEQESYRLTVVATDRGTVPLS GTAIVTILIDDINDSRPEFLNPIQTVSVLESAEPGTVIANITAIDHDLNPKLEYHIVGIV AKDDTDRLVPNQEDAFAVNINTGSVMVKSPMNRELVATYEVTLSVIDNASDLPERSVSVP NAKLTVNVLDVNDNTPQFKPFGITYYMERILEGATPGTTLIAVAAVDPDKGLNGLVTYTL LDLVPPGYVQLEDSSAGKVIANRTVDYEEVHWLNFTVRASDNGSPPRAAEIPVYLEIVDI NDNNPIFDQPSYQEAVFEDVPVGTIILTVTATDADSGNFALIEYSLGDGESKFAINPTTG DIYVLSSLDREKKDHYILTALAKDNPGDVASNRRENSVQVVIQVLDVNDCRPQFSKPQFS TSVYENEPAGTSVITMMATDQDEGPNGELTYSLEGPGVEAFHVDMDSGLVTTQRPLQSYE KFSLTVVATDGGEPPLWGTTMLLVEVIDVNDNRPVFVRPPNGTILHIREEIPLRSNVYEV YATDKDEGLNGAVRYSFLKTAGNRDWEFFIIDPISGLIQTAQRLDRESQAVYSLILVASD LGQPVPYETMQPLQVALEDIDDNEPLFVRPPKGSPQYQLLTVPEHSPRGTLVGNVTGAVD ADEGPNAIVYYFIAAGNEEKNFHLQPDGCLLVLRDLDREREAIFSFIVKASSNRSWTPPR GPSPTLDLVADLTLQEVRVVLEDINDQPPRFTKAEYTAGVATDAKVGSELIQVLALDADI GNNSLVFYSILAIHYFRALANDSEDVGQVFTMGSMDGILRTFDLFMAYSPGYFVVDIVAR DLAGHNDTAIIGIYILRDDQRVKIVINEIPDRVRGFEEEFIHLLSNITGAIVNTDNVQFH VDKKGRVNFAQTELLIHVVNRDTNRILDVDRVIQMIDENKEQLRNLFRNYNVLDVQPAIS VRLPDDMSALQMAIIVLAILLFLAAMLFVLMNWYYRTVHKRKLKAIVAGSAGNRGFIDIM DMPNTNKYSFDGANPVWLDPFCRNLELAAQAEHEDDLPENLSEIADLWNSPTRTHGTFGR EPAAVKPDDDRYLRAAIQEYDNIAKLGQIIREGPIKGSLLKVVLEDYLRLKKLFAQRMVQ KASSCHSSISELIQTELDEEPGDHSPGQGSLRFRHKPPVELKGPDGIHVVHGSTGTLLAT DLNSLPEEDQKGLGRSLETLTAAEATAFERNARTESAKSTPLHKLRDVIMETPLEITEL >gi568815588r:71651668_71873439|GENSCAN_predicted_CDS_6|5580_bp atgggaaataagatcctacgcagaaaggctaaaggaatgggcttgcttggcttgggacgc agccaggaagagagcgaagagcagggatccccagcgccagctgccggcctccagggccgt ggggacgcccatgtcgccgtcggacgcgcagaggaacttctggtgccggggagcgggcgg gacgcggccggcgcggggaagcctcccgcgactgagtgcgagcgagtgagcgctgcgggc ggccccagcgccgtgctccaggcacccgcccccttcgcgcagcgcccccggggggccgtg tgggggaactgcctctccgagggccgcgtgggaggggcttcccggaggccggccccgtgg gccaccacgcacgtgtacgtgaccattgtggatgagaatgataacgcgcccatgttccag cagccccactatgaggtgctgctggatgagggcccagacacgctcaacaccagcctcatc accatccaggcactggacctggatgagggtcccaacggcacagtcacctatgccatcgtc gcaggcaacatcgtcaacaccttccgcatcgacagacacatgggtgtcatcactgctgcc aaagagctggactacgagatcagccacggccgctacaccctgatcgtcactgccacagac cagtgccccatcttatcccaccgcctcacctctaccaccacggtgcttgtgaatgtgaat gacatcaacgacaatgtgcctaccttcccccgggactatgagggaccatttgaagtcact gagggccagccggggcccagagtgtggaccttcctggcccatgaccgagactcaggaccc aacgggcaggtggagtacagcatcatggatggagaccctctgggggagtttgtgatctct cctgtggagggggtgctaagggtccggaaggacgtggagctggaccgggagaccatcgcc ttctacaacctgaccatctgtgcccgtgaccgggggatgcccccactcagctccacaatg ctggtggggatccgggtgctggacatcaacgacaacgaccctgtgctgctgaacctgccc atgaacatcaccatcagcgagaacagccctgtctccagctttgtcgcccatgtcctggcc agtgacgctgacagtggctgcaatgcacgcctcaccttcaacatcactgcgggcaaccgc gagcgggccttcttcatcaatgccacgacagggatcgtcactgtgaaccggcccctggac cgcgagcggatcccagagtacaagctgaccatttctgtgaaggacaacccggagaatcca cgcatagccaggagggattatgacttgcttctgatcttcctttctgatgagaatgacaac caccccctcttcactaaaagcacctaccaggcagaggtgatggaaaactctcccgctggc acccctctcacggtgctcaatgggcccatcctggccctggatgcagaccaagacatctac gccgtggtgacctaccagctgctgggtgcccagagtggcctctttgacatcaacagcagc accggtgtggtgaccgtgaggtcaggtgtcatcattgaccgggaggcattctcgccaccc atcctggagctgctgctgctggctgaggacatcgggctgctcaacagcacggcccacctg ctcatcaccatcctggatgacaatgacaaccggcccacctttagccctgccaccctcact gtccatctgctagagaactgcccgcctggattctcagtccttcaagtcacagccacagat gaggacagtggcctcaatggggagctggtctaccgaatagaagctggggctcaggaccgc ttcctcattcatctggtcaccggggtcatccgtgttggtaatgccaccatcgacagagag gagcaggagtcctacaggctaacggtggtggccaccgaccggggcaccgttcctctctcg ggcacagccattgtcaccattctgatcgatgacatcaatgactcccgccccgagttcctc aaccccatccagacagtgagcgtgctggagtcggctgagccaggcactgtcattgccaat atcacggccattgaccacgacctcaacccaaagctagagtaccacattgtcggcattgtg gccaaggacgacactgatcgcctggtgcccaaccaggaggacgcctttgctgtgaatatc aacacaggatctgtaatggtgaagtcccccatgaatcgggagctggttgccacctatgag gtcactctctcagtgattgacaatgccagcgacctaccagagcgctctgtcagtgtgcca aatgccaagctgactgtcaacgtcctggacgtcaatgacaatacgccccagttcaagccc tttgggatcacctactacatggagcggatcctggagggggccacccctgggaccacactc attgctgtggcagccgtggaccctgacaagggccttaatgggctggtcacctacaccctg ctggacctggtgcccccagggtatgtccagctggaggactcctcggcagggaaggtcatt gccaaccggacagtggactacgaggaggtgcactggctcaactttaccgtgagggcctca gacaacgggtccccgccccgggcagctgagatccctgtctacctggaaatcgtggacatc aatgacaacaaccccatctttgaccagccctcctaccaggaggctgtctttgaggatgtg cctgtgggcacaatcatcctgacagtcactgccactgatgctgactcaggcaactttgca ctcattgagtacagccttggagatggagagagcaagtttgccatcaaccccaccacgggt gacatctatgtgctgtcttctctggaccgggagaagaaggaccactatatcctgactgcc ttggccaaagacaaccctggggatgtagccagcaaccgtcgcgaaaattcagtgcaggtg gtgatccaagtgctggatgtcaatgactgccggccacagttctccaagccccagttcagc acaagcgtgtatgagaatgagccggcgggcacctcggtcatcaccatgatggccactgac caggatgaaggtcccaatggagagttgacctactcacttgagggccctggcgtggaggcc ttccatgtggacatggactcgggcttggtgaccacacagcggccactgcagtcctacgag aagttcagtctgaccgtggtggccacagatggtggagagcccccactctggggcaccacc atgctcctggtggaggtcatcgacgtcaatgacaaccgccctgtctttgtgcgcccaccc aacggcaccatcctccacatcagagaggagatcccgctgcgctccaacgtgtacgaggtc tacgccacggacaaggatgagggcctcaacggggcggtgcgctacagcttcctgaagact gcgggcaaccgggactgggagttcttcatcatcgacccaatcagcggcctcatccagact gctcagcgcctggaccgcgagtcgcaggcggtgtacagcctcatcttggtggccagcgac ctgggccagccagtgccatacgagactatgcagccgctgcaggtggccctggaggacatc gatgacaacgaaccccttttcgtgaggcctccaaaaggcagcccccagtaccagctgctg acagtgcctgagcactcaccacgcggcaccctcgtgggcaacgtgacaggcgcagtggat gcagatgagggccccaacgcgatcgtgtactacttcatcgcagccggcaacgaagagaag aacttccatctgcagcccgatgggtgtctgctggtgctgcgggacctggaccgggagcga gaagccatcttctccttcatcgtcaaggcctccagcaatcgcagctggacacctccccgt ggaccctccccaaccctcgacctggttgctgacctcacactgcaggaggtgcgcgttgtg ctagaggacatcaacgaccagccaccacgcttcaccaaggctgagtacactgcaggggtg gccaccgacgccaaggtgggctcagagttgatccaggtgctggccctggatgcagacatt ggcaacaacagccttgtcttctacagcattctggccatccactacttccgggcccttgcc aacgactctgaagatgtgggccaggtcttcaccatggggagcatggacggcattctgcgc accttcgacctcttcatggcctacagccccggctacttcgtggtggacattgtggcccga gacctggcaggccacaacgacacggccatcatcggcatctacatcctgagggacgaccag cgcgtcaagatcgtcattaacgagatccccgaccgtgtgcgcggcttcgaggaggagttc atccacctgctctccaacatcactggggccattgtcaatactgacaatgtgcagttccat gtggacaagaagggccgggtgaactttgcgcagacagaactgcttatccacgtggtgaac cgcgataccaaccgcatcctggacgtggaccgggtgatccagatgatcgatgagaacaag gagcagctacggaatcttttccggaactacaacgtcctggacgtgcagcctgccatctct gtccggctgccggatgacatgtctgccctgcagatggcgatcatcgtcctggctatcctc ctgttcctggccgccatgctctttgtcctcatgaactggtactacaggactgtacacaag aggaagctcaaggccattgtggctggctcagctgggaatcgtggcttcatcgacatcatg gacatgcctaacaccaacaagtactcctttgatggagccaaccctgtgtggctggatccc ttctgtcggaacctggagctggccgcccaggcggagcatgaggatgacctaccggagaac ctgagtgagatcgccgacctgtggaacagccccacgcgcacccatggaacttttgggcgt gagccagcagctgtcaagcctgatgatgaccgatacctgcgggctgccatccaggagtat gacaacattgccaagctgggccagatcattcgtgaggggccaatcaagggctcgctgctg aaggtggtcctggaggattacctgcggctcaaaaagctctttgcacagcggatggtgcaa aaagcctcctcctgccactcctccatctctgagctgatacagactgagctggacgaggag ccaggagaccacagcccagggcagggtagcctgcgcttccgccacaagccaccagtggag ctcaaggggcccgatgggatccatgtggtgcacggcagcacgggcacgctgctggccacc gacctcaacagcctgcccgaggaagaccagaagggcctgggccgctcgctggagacgctg accgctgccgaggccactgccttcgagcgcaacgcccgcacagaatccgccaaatccaca cccctgcacaaacttcgcgacgtgatcatggagacccccctggagatcacagagctgtga >gi568815588r:71651668_71873439|GENSCAN_predicted_peptide_7|625_aa MYALFLLASLLGAVGKGARALRRRVRWLSLAFSGRLGAAGGGLRKCPREDGPGPGQHLLR GSSGFLRGWGPEAFPQAALPTVGVALAGPVLGLKECTRGSAVWCQNVKTASDCGAVKHCL QTVWNKPTVKSLPCDICKDVVTAAGDMLKDNATEEEILVYLEKTCDWLPKPNMSASCKEI VDSYLPVILDIIKGEMSRPGEVCSALNLCESLQKHLAELNHQKQLESNKIPELDMTEVVA PFMANIPLLLYPQDGPRSKPQPKDNGDVCQDCIQMVTDIQTAVRTNSTFVQALVEHVKEE CDRLGPGMADICKNYISQYSEIAIQMMMHMQPKEICALVGFCDEVKEMPMQTLVPAKVAS KNVIPALELVEPIKKHEVPAKSDVYCEVCEFLVKEVTKLIDNNKTEKEILDAFDKMCSKL PKSLSEECQEVVDTYGSSILSILLEEVSPELVCSMLHLCSGTRLPALTVHVTQPKDGGFC EVCKKLVGYLDRNLEKNSTKQEILAALEKGCSFLPDPYQKQCDQFVAEYEPVLIEILVEV MDPSFVCLKIGACPSAHKPLLGTEKCIWGPSYWCQNTETAAQCNLGSSPSPPPAPPHSWF CCCPGEMWACFSRPELMDVDALEVF >gi568815588r:71651668_71873439|GENSCAN_predicted_CDS_7|1878_bp atgtacgccctcttcctcctggccagcctcctgggcgcggttggcaagggggcgcgcgca ctgcgcaggcgcgttcgctggctttctctggctttctctgggcggctgggggctgcgggg ggcgggctgcgaaaatgcccccgggaggacggcccaggccctgggcagcatctccttcgg ggctcgagtgggttcctgcgcggctggggccccgaggccttccctcaggctgcgcttccc actgtgggggtggctctagccggcccggtccttggactgaaagaatgcaccaggggctcg gcagtgtggtgccagaatgtgaagacggcgtccgactgcggggcagtgaagcactgcctg cagaccgtttggaacaagccaacagtgaaatcccttccctgcgacatatgcaaagacgtt gtcaccgcagctggtgatatgctgaaggacaatgccactgaggaggagatccttgtttac ttggagaagacctgtgactggcttccgaaaccgaacatgtctgcttcatgcaaggagata gtggactcctacctccctgtcatcctggacatcattaaaggagaaatgagccgtcctggg gaggtgtgctctgctctcaacctctgcgagtctctccagaagcacctagcagagctgaat caccagaagcagctggagtccaataagatcccagagctggacatgactgaggtggtggcc cccttcatggccaacatccctctcctcctctaccctcaggacggcccccgcagcaagccc cagccaaaggataatggggacgtttgccaggactgcattcagatggtgactgacatccag actgctgtacggaccaactccacctttgtccaggccttggtggaacatgtcaaggaggag tgtgaccgcctgggccctggcatggccgacatatgcaagaactatatcagccagtattct gaaattgctatccagatgatgatgcacatgcaacccaaggagatctgtgcgctggttggg ttctgtgatgaggtgaaagagatgcccatgcagactctggtccccgccaaagtggcctcc aagaatgtcatccctgccctggaactggtggagcccattaagaagcacgaggtcccagca aagtctgatgtttactgtgaggtgtgtgaattcctggtgaaggaggtgaccaagctgatt gacaacaacaagactgagaaagaaatactcgacgcttttgacaaaatgtgctcgaagctg ccgaagtccctgtcggaagagtgccaggaggtggtggacacgtacggcagctccatcctg tccatcctgctggaggaggtcagccctgagctggtgtgcagcatgctgcacctctgctct ggcacgcggctgcctgcactgaccgttcacgtgactcagccaaaggacggtggcttctgc gaagtgtgcaagaagctggtgggttatttggatcgcaacctggagaaaaacagcaccaag caggagatcctggctgctcttgagaaaggctgcagcttcctgccagacccttaccagaag cagtgtgatcagtttgtggcagagtacgagcccgtgctgatcgagatcctggtggaggtg atggatccttccttcgtgtgcttgaaaattggagcctgcccctcggcccataagcccttg ttgggaactgagaagtgtatatggggcccaagctactggtgccagaacacagagacagca gcccagtgcaatctaggctccagcccatctccaccgccagccccaccccacagctggttc tgctgctgtcctggggaaatgtgggcctgcttctcaaggcccgagctgatggatgttgat gcactggaggtcttttag >gi568815588r:71651668_71873439|GENSCAN_predicted_peptide_8|84_aa MGMGLPDGWLRPESLLKSVQNVSEHFSGRVFPAFIRFSRGSDLTNLQTATVTQQLPACPT ALLLLTHQLMQGLLGSSCSPQESS >gi568815588r:71651668_71873439|GENSCAN_predicted_CDS_8|255_bp atggggatggggctgccagatgggtggctgaggcctgagagcctcctgaagtcagtgcaa aatgtgtctgagcatttttctggcagagtgttcccagctttcatcagattctctaggggc tctgacctcaccaacttgcagactgctacagtgactcagcagctcccggcctgccccact gccctgctgctgctgactcaccagctcatgcaaggccttcttggcagctcctgctctccc caagagagcagctga