GENSCAN 1.0 Date run: 3-Nov-116 Time: 13:41:50 Sequence gi568815577f:42418455_42679800 : 261346 bp : 49.50% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 79 155 77 0 2 1 109 105 0.652 2.21 1.02 Intr + 4773 4943 171 0 0 37 66 93 0.497 1.06 1.03 Intr + 8243 8366 124 2 1 123 90 75 0.995 11.79 1.04 Intr + 11479 11647 169 0 1 47 67 159 0.991 9.22 1.05 Intr + 13071 13195 125 1 2 52 47 56 0.623 -1.80 1.06 Intr + 13649 13748 100 1 1 82 78 100 0.986 8.08 1.07 Intr + 16378 16500 123 2 0 86 91 42 0.885 4.86 1.08 Intr + 19034 19126 93 0 0 82 86 78 0.897 6.94 1.09 Term + 21727 21872 146 2 2 75 55 125 0.542 5.97 1.10 PlyA + 21898 21903 6 1.05 2.00 Prom + 23463 23502 40 -2.26 2.01 Init + 23638 23656 19 0 1 54 68 31 0.056 -2.07 2.02 Intr + 23998 24142 145 2 1 64 85 147 0.108 11.34 2.03 Intr + 24858 24964 107 0 2 100 98 117 0.957 13.76 2.04 Intr + 26080 26189 110 2 2 91 107 65 0.619 8.70 2.05 Intr + 28855 28974 120 0 0 39 68 79 0.138 1.69 2.06 Intr + 32502 32639 138 2 0 20 81 114 0.171 4.56 2.07 Intr + 34003 34171 169 0 1 76 89 39 0.108 2.32 2.08 Intr + 35425 35592 168 2 0 31 72 90 0.256 1.62 2.09 Term + 37685 37722 38 0 2 92 43 41 0.154 -2.50 2.10 PlyA + 37883 37888 6 1.05 3.00 Prom + 39077 39116 40 -3.86 3.01 Init + 48885 49061 177 2 0 62 32 212 0.306 12.26 3.02 Term + 52971 53111 141 2 0 83 44 89 0.238 1.93 3.03 PlyA + 54049 54054 6 1.05 4.10 PlyA - 54064 54059 6 1.05 4.09 Term - 54416 54364 53 0 2 128 44 58 0.905 2.99 4.08 Intr - 57593 57444 150 0 0 95 72 298 0.866 29.03 4.07 Intr - 62909 62861 49 2 1 -11 81 68 0.339 -5.55 4.06 Intr - 63499 63476 24 1 0 73 111 31 0.778 2.12 4.05 Intr - 64254 64183 72 0 0 78 115 15 0.839 2.80 4.04 Intr - 67350 67215 136 2 1 64 101 95 0.929 8.97 4.03 Intr - 68007 67917 91 1 1 108 58 123 0.766 10.35 4.02 Intr - 74459 74304 156 0 0 32 93 97 0.121 4.58 4.01 Init - 77732 77666 67 2 1 83 85 97 0.747 10.13 4.00 Prom - 84577 84538 40 -6.06 5.06 PlyA - 85615 85610 6 1.05 5.05 Term - 90951 90911 41 1 2 60 47 79 0.119 -1.65 5.04 Intr - 95795 95531 265 2 1 40 80 152 0.577 6.69 5.03 Intr - 96274 95906 369 1 0 31 35 255 0.463 9.80 5.02 Intr - 99834 99741 94 2 1 82 44 69 0.522 1.87 5.01 Init - 100315 100173 143 1 2 95 78 77 0.754 7.01 5.00 Prom - 109559 109520 40 -4.76 6.00 Prom + 110813 110852 40 -2.26 6.01 Init + 113401 113448 48 0 0 55 55 51 0.310 -0.36 6.02 Intr + 115255 115473 219 0 0 32 60 124 0.604 2.50 6.03 Intr + 115672 115791 120 0 0 119 31 44 0.835 2.49 6.04 Intr + 116244 116376 133 2 1 76 111 86 0.928 9.92 6.05 Intr + 117018 117096 79 1 1 23 96 155 0.983 8.41 6.06 Intr + 121058 121193 136 2 1 92 110 213 0.998 24.47 6.07 Intr + 123950 124026 77 1 2 126 100 136 0.999 16.91 6.08 Intr + 124982 125148 167 2 2 96 111 146 0.999 17.30 6.09 Intr + 128649 128686 38 1 2 114 103 39 0.952 5.98 6.10 Intr + 135608 135688 81 1 0 70 119 39 0.546 5.03 6.11 Intr + 140504 140635 132 1 0 62 100 148 0.796 14.24 6.12 Intr + 143624 143714 91 1 1 91 92 59 0.956 6.17 6.13 Intr + 145361 145423 63 0 0 75 97 74 0.954 5.69 6.14 Intr + 146254 146339 86 2 2 120 61 76 0.996 7.64 6.15 Intr + 146756 146851 96 1 0 96 77 1 0.499 0.01 6.16 Intr + 147373 147421 49 0 1 103 109 67 0.999 8.55 6.17 Intr + 148531 148604 74 2 2 97 105 142 0.635 15.93 6.18 Intr + 149906 149980 75 1 0 137 34 141 0.645 13.31 6.19 Intr + 150498 150768 271 2 1 34 76 122 0.390 2.71 6.20 Intr + 154512 154652 141 1 0 61 71 54 0.675 1.32 6.21 Intr + 156364 156461 98 2 2 59 119 52 0.866 5.13 6.22 Intr + 161029 161161 133 0 1 56 48 57 0.488 -1.28 6.23 Intr + 161708 161815 108 0 0 100 66 16 0.552 0.86 6.24 Intr + 162319 162537 219 2 0 21 72 220 0.357 11.97 6.25 Intr + 169059 172112 3054 1 0 66 19 1602 0.622 138.47 6.26 Intr + 174666 174885 220 1 1 71 105 113 0.394 8.66 6.27 Intr + 182307 182397 91 0 1 69 59 60 0.170 1.20 6.28 Intr + 187738 188013 276 0 0 56 -22 193 0.065 2.81 6.29 Term + 191467 191574 108 0 0 81 55 114 0.572 5.91 6.30 PlyA + 192213 192218 6 1.05 7.00 Prom + 195972 196011 40 -3.86 7.01 Init + 198644 198666 23 1 2 84 80 33 0.743 -0.22 7.02 Intr + 198747 198901 155 0 2 74 50 185 0.712 13.02 7.03 Intr + 207728 207913 186 0 0 38 107 57 0.030 2.26 7.04 Intr + 208450 208561 112 2 1 65 23 73 0.065 -2.06 7.05 Intr + 211670 211758 89 2 2 41 86 86 0.058 3.31 7.06 Intr + 214893 214927 35 1 2 92 82 13 0.018 -0.96 7.07 Intr + 220270 220401 132 0 0 89 77 46 0.295 4.44 7.08 Intr + 221172 221547 376 2 1 39 28 157 0.132 -0.61 7.09 Intr + 225248 225349 102 0 0 78 91 26 0.262 2.05 7.10 Term + 226255 226355 101 2 2 72 48 74 0.291 0.09 7.11 PlyA + 228410 228415 6 1.05 8.00 Prom + 230217 230256 40 -5.96 8.01 Init + 235361 235429 69 1 0 67 63 170 0.938 13.45 8.02 Intr + 237915 238082 168 2 0 52 91 35 0.285 0.34 8.03 Intr + 242951 243080 130 1 1 110 57 69 0.831 6.27 8.04 Intr + 254821 254949 129 2 0 92 78 53 0.895 5.37 8.05 Term + 255275 255477 203 0 2 38 46 178 0.693 6.15 8.06 PlyA + 256368 256373 6 1.05 9.03 PlyA - 256493 256488 6 1.05 9.02 Term - 260088 259986 103 2 1 103 54 34 0.686 -0.75 9.01 Intr - 260568 260438 131 0 2 90 31 137 0.767 7.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 168683 168764 82 1 1 56 -17 35 0.892 -9.07 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:42418455_42679800|GENSCAN_predicted_peptide_1|375_aa RTGCRGFLPENYTDRASESDTWVKHRCVQTSGSGETQFADIDMSVGPGATSKSQDVGATR DCRDNRRQLPCLRGEKTRAQEARMYTFSLATDLNSRKDGEASSRCSGEFLPQTARSLSSL QALQFMHLPNVARATENCLVLINFSSWKFCIVRASVDFTRDPQDLAGAARGADPENAFKA AFSQTENDQLWENSKGMDEGFSPWCSEAEFIPGSHKRTAPQQATVARKSVLVVRHGERVD QIFGKAWLQQCSTPDGKYYRPDLNFPCSLPRRSRGIKDFENDPPLSSCGIFQSRIAGDAL LDSGIRISSVFASPALRCVQTAKLILEAAGPAFVFSGRPEKQDVAFPALRFGRDEGGKIR NVKGQGLRVSGDPSV >gi568815577f:42418455_42679800|GENSCAN_predicted_CDS_1|1128_bp cggacgggctgccggggcttcctgccggaaaactacacggatcgagccagtgagtctgac acgtgggtgaagcacagatgcgtgcagaccagcggttcaggagaaacacaatttgctgac atagacatgtccgtgggtccaggggccacatcaaaatcacaggatgtgggtgccacaagg gattgtagagataaccgaaggcagctcccctgcttgagaggtgagaagactcgggcccag gaggccaggatgtacaccttcagtctagccacagacctgaactccagaaaggatggtgaa gccagcagcagatgcagcggggaatttcttccacaaacggcaaggagtcttagcagctta caggccttgcagtttatgcacctaccaaatgtagccagggccaccgagaactgtctggtc ctgataaatttcagcagctggaaattctgcatcgtccgggccagcgtggacttcacaagg gatccacaggatctggcaggagctgcccgtggtgctgaccccgagaatgctttcaaggca gcattcagccaaacagagaatgaccagctatgggaaaacagcaaggggatggatgagggc ttctccccttggtgctcagaagcagaattcatacctgggagccataaaaggacggcccct caacaggctaccgttgcaaggaagagcgtgctggtggttcgccacggggagagagtggat cagatcttcgggaaggcatggctgcagcaatgctccactcctgatgggaaatactacagg ccagacctgaatttcccctgcagtctgccaagacggagtcgtgggatcaaagactttgaa aacgatcccccattatcatcgtgtggcattttccagtccagaattgcaggggacgcgcta ctggacagtggtatcagaatcagctctgtgtttgcctccccagccctccgctgtgtgcag acggccaaactcatcctggaagcagcaggtccagccttcgtgttttcaggaaggcctgag aaacaggacgtggcctttccagctctgcgctttgggagggacgagggtggcaagatcaga aatgtgaaggggcagggtctccgagtatctggggacccttcagtgtga >gi568815577f:42418455_42679800|GENSCAN_predicted_peptide_2|337_aa MLEDAAELKLEKKIKIRVEPGIFEWTKWEAGKTTPTLMSLEELKEANFNIDTDYRPAFPL SALMPAESYQEYMDRCTASMVQIVNTCPQDTGVILIVSHGSTLDSCTRPLLGLPPRECGD FAQLVRKKHETHFYIPEYFPPAFAFVTPICGPIVHQPSCGEHRAGGWPTILQGSEVGRKE NGSIECTATNPKKAPGAWEWLRRNLGLDGPQDMPTSCDKQRCLHMLPNVPGGAGSSPAEK YRCIPFTGLHGTLASGLPPGHSGIARMGTGIHPELCLFRDPWPAATLAPGAHLKRAPKLS PLSSPAIQPDVPTPAPPCLIASAARVGFTAQQQLALL >gi568815577f:42418455_42679800|GENSCAN_predicted_CDS_2|1014_bp atgttggaggatgcagccgaactcaaactggagaaaaaaatcaagatacgagtggaacct ggaatctttgaatggacaaaatgggaagctggcaaaaccaccccaaccctcatgagcctg gaagagctgaaagaggcaaatttcaacattgacactgattacaggcccgcgtttcccctg tccgccctcatgccggccgagagctaccaggagtacatggacaggtgcacggcgagcatg gtgcaaatcgtcaacacctgtccacaggacacgggtgtcatcctaattgtgagtcacggc tccactctggactcctgcacgcggccactgctcgggctgccgccccgggaatgtggggat tttgcccaactcgtgagaaagaagcacgagacgcacttttatatcccggaatatttccct ccggctttcgcctttgtaactcccatctgtggacccatcgtccaccagcccagctgcggg gagcacagggcaggtggctggccaacaattctgcaaggatcagaagtcggaagaaaggaa aatggcagcatcgaatgcacagccacaaatcccaagaaggccccaggagcctgggagtgg ctgcggaggaatttagggctggatggcccacaggacatgcccaccagttgtgataaacaa agatgtctccacatgctgccaaatgtccctgggggagcaggatcatccccagctgagaag taccgatgtattcccttcacaggtctgcatggaaccctggcctcagggctcccaccaggg cactcaggcatagcgcgaatgggcacagggattcatcccgagctgtgtctcttccgggat ccttggccagcagcgactctggcacctggagcacacctgaagagagcccccaagctctcg cctctctccagccctgctatccaaccagatgttcccactcctgctcctccctgcctcatc gcctcagccgcccgggtgggtttcacagcccagcagcagctggccctcttgtag >gi568815577f:42418455_42679800|GENSCAN_predicted_peptide_3|105_aa MKAVVVAVVAMTNDQRKTENRRLSGLQSQQTKLMAAAADKKQVSTQDRKGHKEEPQILWT WGLYTAGKGRTRSSKTKATLQDSKNAMCTVASNLCGNVKVDRFLH >gi568815577f:42418455_42679800|GENSCAN_predicted_CDS_3|318_bp atgaaggctgtggtggtggcggtcgtagccatgacaaatgaccaaaggaaaacagaaaac cgcaggctttctggtctgcagagccagcagactaagctcatggcagccgcagctgataaa aagcaagtgagcacccaggacagaaaaggccacaaggaggagcctcaaatcctgtggacc tggggcttatatactgcagggaaagggcgtacacgctccagcaagacaaaggcaactctc caggacagcaagaatgccatgtgcactgtggcctctaatttgtgcggcaacgtcaaggtt gacaggttcttacactag >gi568815577f:42418455_42679800|GENSCAN_predicted_peptide_4|265_aa MSDLGSEELEEEGENDIGVRAPANATFSNDMNDIINVFQGIYKFKNGARYIGEYVRNKKH GQGTFIYPDGSRYEGEWANDLRHGHGVYYYINNDTYTGEWFAHQRHGQGTYLYAETGSKY VGTWVNGQQEGTAELIHLNHRYQGKFLNKNPVGPGKYVFDVGCEQHGEYRLTDMHAPLNV PLFCRLYKKHGTRNCLASGAGEPGEEAQALLEGFEGEMDMRPGDEDADVLREESREYDQE EFRYDMDEGNINSEEEETRQSDLQD >gi568815577f:42418455_42679800|GENSCAN_predicted_CDS_4|798_bp atgtcggacctgggctcggaggagttggaggaggagggagagaatgatattggggtgaga gctcctgcaaatgctacattcagcaatgatatgaatgatataatcaatgtgtttcagggg atctacaaatttaaaaatggtgctcgatatatcggagaatatgttagaaataaaaagcac ggtcaaggcacttttatatatccagatggatccagatatgaaggagagtgggcaaatgac ctgcggcacggccatggcgtatactactacatcaataatgacacctacactggagagtgg tttgctcatcaaaggcatgggcaaggcacctatttatacgcggagacgggcagtaagtat gttggcacctgggtgaacggacagcaggagggcacggccgagctcattcacctgaaccac aggtaccagggcaagttcttgaacaaaaatcctgttggccctggaaagtatgtatttgat gttgggtgtgaacaacatggtgaatatcgtttaacagatatgcatgctcctctgaacgtg ccgttgttctgcaggctgtacaagaagcatggcacccgcaactgcctggcctctggtgca ggagaacccggggaggaggcccaggctctgctggagggcttcgagggtgagatggacatg aggcctggagatgaagatgcagacgtcctccgggaagagagccgggagtatgaccaggag gagttccgctatgacatggatgagggaaacattaattctgaagaagaagaaactagacag tcagacctccaggactaa >gi568815577f:42418455_42679800|GENSCAN_predicted_peptide_5|303_aa MVLRETTGDHAGAQTLSDNASTKDKGHLSTVLGHGFHHQKAIQRLWVSGSALYKEEERHS ESSVVTFTPSALLAEVTTHTERTFSVTRITAWAVRAGRLVASGEGTFRRVKPFIINPEGR CHSHWPGGPRAPGTSCSADKQRKGPGCRRSEPAFCAGSSLGGWSPEEVNSAGTPTSPSRQ TMRRVPSCRLSGPSAVTAARAGRCPGPGTPGIVAEGHSSRCRISKTSLRENSLRPSPALG KHPNQPPPPPEKKEKEGEEPSSECAGPRARLPEGAAGAERLRRLSAGCPAEFSSLIVKNG QRP >gi568815577f:42418455_42679800|GENSCAN_predicted_CDS_5|912_bp atggttcttagagaaaccactggagaccacgcgggagcccaaacactatctgacaatgcc agtaccaaggacaaaggacacctcagcactgttctgggtcacggcttccaccaccaaaaa gcaattcaacggttatgggtgagtggctctgctctctacaaggaggaggagaggcattca gagtccagtgtcgtgacattcacaccttcagcgttactggcagaagtaaccacacacact gaacgtaccttctcggtcaccaggattacggcgtgggccgtaagggctggaaggctggtg gccagcggcgaggggactttccggcgcgtcaagccctttattataaacccagaaggtcgc tgtcactcgcattggccaggaggcccacgggctccgggaaccagctgctcagccgacaaa caacgcaaaggcccagggtgccggcgctccgagccagcgttctgtgccggctcctccctg ggcggctggtccccggaggaggtgaattccgcgggcacgcccacgtccccatcgcggcag acaatgcgcagggtcccctcctgccggctctctggaccctcagctgtcaccgcggcccgg gccgggcgttgccccggtccgggaacgcccggcattgtcgccgagggccactcttctaga tgtcgtatttcaaagacttccctccgcgaaaatagcttgagaccgtccccggctctcgga aaacatccaaaccagcccccacctccccccgaaaaaaaagaaaaggaaggagaggagccg agctccgagtgcgcagggccccgcgcccggctccctgagggcgccgcaggtgctgagcgg ctgcggcgcctgagcgccggctgccccgcagaattctcctccttgatcgtgaagaacggg cagcggccatga >gi568815577f:42418455_42679800|GENSCAN_predicted_peptide_6|2160_aa MALAAAFQQVCGVRVKLCAKSLVFSVTGPSDFCNWAAALSAEGETEAGAASNMLTQPVTK LQGDGFPLLPPPRQVSPTLALLAGSVDSLVTVTLTSSRMDSLPMLGLPMSGMLQSVLFHV CLLLLHVMFGELHKYCTAWDEADVRFSSQNRKSGSAAPHQLPDNETDCGWAPFDKNNYQQ LLGALDYSFLCAYAVGMYLSGIIGERLPIRYYLTFGMLASGAFTALFGLGYFYNIHSFGF YVVTQVINGLVQTTGWPSVVTCLGNWFGKGRRGLIMGVWNSHTSVGNILGSLIAGYWVST CWGLSFVVPGAIVAAMGIVCFLFLIEHPNDVRCSSTLVTHSKGYENGTNRLRLQKQILKS EKNKPLDPEMQCLLLSDGKGSIHPNHVVILPGDGGSGTAAISFTGALKIPGVIEFSLCLL FAKLVSYTFLFWLPLYITNVDHLDAKKAGELSTLFDVGGIFGGILAGVISDRLEKRASTC GLMLLLAAPTFPHLENGTSVSPPCSGLETMHMVTVFIIVTLELYIFSTVSKMGLEATIAM LLLSGALVSGPYTLITTAVSADLGTHKSLKGNAHALSTVTAIIDGTGSPDGTAVAFPEPL YGEHAQWLHLPGENQELPHLPGEDQEGPSYMGDVMGGCLVWLSQLIRRTWGANLCVTCCV GVSSKAECLVDEGGSVDPGSGTLRNLQEETPSVPRGAWVQLPTSLPWHGSLVEAGVFPQI PCRLLGAALGPLLAGLLSPSGWSNVFYMLMFADACALLGSSWFFVGLHQTALPSSDSPAH RSPGLRLSMQAPLLLQGGHPREEGCGKGARVYPASSTPAGEESLLGREGSWLPIGFAHEN LYVPHGKTGHQSPPDSAGQQRSREPAPHNIKKADNQVGNQDGAQTHHIAPEAFPAPMMFR TDLKTNCREAKGVSPYLHRASIPVYIHAGVSPYLHRASRPVYIHAGVSPYLHRASRPVYI HAGVSPYLHRASRPVYIHAGVSPYLHRASRPVYIHAGVSPYLHRASRPVYIHAGVSPYLH RASRPVYIHAGVSPYLHRASRPVYIHAGVSPYLHRASRPVYIHAGVSPYLHRACRPVYIH AGVSPYLHRACRPVYIHAGVSPYLHRASRPVYIHAGVSPYLHRASIPVYIHAGVSPYLHR ASRPVYIHAGVSPYLHRACRPVYIHAGVSPYLHRASRPVYIHAGVSPYLHRASIPVYIHA GVSPYLHRASRPVYIHAGVSPYLHRASIPVYIHAGVSPYLHRASIPVYIHAGVSPYLHRA SRPVYIHAGVSPYLHRASRPVYIHAGVSPYLHRASRPVYIHAGVSPYLHRASRPVYIHAG VSPYLHRASRPVYIHAGVSPYLHRASRPVYIHAGVSPYLHRASIPVYIHAGVSPYLHRAC RPVYIHAGVAPYLHRACRPVYIHAGVSPYLHRASIPVYIHAGVSPYLHRASIPVYIHAGV SPYLHRASRPVYIHAGVSPYLHRASIPVYIHAGVSPYLHRASRPVYIHAGVSPYLHRASI PVYIHAGVSPYLHRACRPVYIHAGVSPYLHRASIPVYIHAGVSPYLHRASIPVYIHAGVS PYLHRASRPVYIHAGVSPYLHRASIPVYIHAGVSPYLHRASRPVYIHAGVSPYLHRASIP VYIHAGVSPYLHRASRPVYIHAGVSPYLHRASIPVYIHAGVSPYLHRASIPVYIHAGVSP YLHRASIPVYIHAGVSPYLHRASRPVYIHAGVSPYLHRASIPVYIHAGVSPYLHRASRPV YIHAGVSPYLHRASIPVYIHAGVSPYLHRASRPVYIHAGVSPYLHRASIPVYIHAGVSPY LHRASIPVYIHAGVSPYLHRASIPVYIHAGVSPYLHRASRPVYIHAGVSPYLHRACRPVY IHACVSPYLHWACHPVYIYACVFLLPFQSRGLTSFGLVILSPCLLFRYFIDDFLIAPNGS PRALRHSHVCRPCTSIAGSPARGTVSAEAPFGTWEGTSHQYSRTSAIATDRRVNAGWAVP WPRPPPADPGSLQPAGLVASGDRAAAAVDSPWCSLLKSTVFLQESPISAEASLITGFKVE PFKDQDQESMQVLGPKPRLSYSNVQTPKYPRINDRFHCTTVLSHLQHYGFHPPSTTSHHT HQGQVLSQCQAVSSALQTASEPSNTLIGHSSHVTAPNHKAAQLIGHSSHVTTPNHKAALV >gi568815577f:42418455_42679800|GENSCAN_predicted_CDS_6|6483_bp atggctttggcagcagccttccaacaggtctgcggggtgcgggtgaagctctgtgcaaaa agcctggtgttcagcgtcacgggtcccagtgacttctgtaactgggcagcagccttgtct gcagagggggagaccgaggcaggggcagcaagtaacatgctcacacagccagtcaccaaa ctccagggcgatggctttcccctcctcccacctcccagacaagtgtctccaaccctggcc ctgcttgctggttctgtggacagtttggtaaccgtgactctcacctctagccgcatggat tcgttgcccatgcttggacttcctatgagtggaatgctgcagtctgtactctttcatgtc tgcctccttctgcttcatgttatgtttggtgagctccacaagtactgcactgcttgggat gaagctgacgtcaggttcagcagccagaacaggaagtctgggtccgctgccccccaccag ctccctgacaatgagaccgactgtggctgggcaccgtttgataagaacaactatcagcag ctgcttggggccctggactactccttcctgtgcgcctatgccgtggggatgtacctcagt ggcatcattggggagcgcctgccgattaggtattacctaactttcgggatgctcgccagc ggagccttcaccgccctgttcggcttagggtatttctacaacatccacagtttcggattc tacgtggtaactcaggtcatcaacgggctggtgcagaccaccggctggcccagcgtcgtc acctgcctcggcaactggtttggaaaaggaaggagaggtttgattatgggggtctggaac tcccacacctccgtgggcaacatcttggggtcattgatcgctggctactgggtgtccaca tgctggggcctgtccttcgtcgtgcctggagccatcgtggcagccatggggatagtgtgc tttctcttcctcattgaacatccgaacgacgtcaggtgctcctccaccctggtgacgcac tcaaaaggctatgagaatggtacaaacagattgagactccagaagcaaatcttgaagagc gaaaagaacaagcctctggacccagagatgcagtgcctgctgctctcagatgggaagggc tccatccacccgaaccacgtcgtcattctccccggggacggtgggagtggcacggccgcc atcagcttcacaggggccttgaaaattccaggcgtgatagagttctcactgtgtctgctg tttgccaagctggtcagctatactttcctcttctggctgcccctgtacatcacgaatgtg gatcaccttgatgccaaaaaggcgggggagctctccaccctgtttgacgtgggcggaatc tttggtgggatcctggcaggtgtgatctcagaccgactggagaaaagggcctccacctgc ggcctgatgctgctgctcgcggcccccacgttccctcatctggaaaatgggacgtcagta tcaccaccttgcagtgggttagaaacaatgcacatggttactgtttttatcatcgtaaca ttagagctctacatcttctccaccgtcagcaagatggggcttgaggccaccatcgccatg ctgctgctcagcggagccctggtcagtgggccctacacactcatcaccaccgccgtctcc gccgacctggggactcataaaagtctgaaaggcaacgcgcacgccctctccaccgtgacg gccatcattgacgggacgggctctcctgacggcacggctgtggccttccctgagcccttg tatggggagcacgcacagtggctgcacctgccaggcgaaaatcaggagttgccacacctg ccaggtgaagatcaggagggtccttcctacatgggggacgttatggggggctgcctggtg tggctgtcacagctaataaggaggacctggggtgccaacctgtgtgtcacttgctgcgtg ggggtctccagtaaagccgagtgtttggtagatgaaggtggctcggtagacccaggcagt gggaccctgcggaacctgcaggaggagactccttcagtgcccaggggagcctgggtccag ctgccaacatccttgccatggcatggaagcttggtggaagctggtgtctttccccagatc ccctgtcgcctgttaggagcagccctgggccccctgctggctgggctcctctccccgtcc ggctggagcaatgtgttttacatgctgatgtttgcagatgcctgtgccttactgggcagc tcctggttcttcgtgggccttcatcagacagccctgccctcatcagacagccctgcccac aggagcccaggcctccggctgagcatgcaagccccgctgctcttgcagggaggccacccc agggaagaaggctgcgggaaaggtgcaagggtttatccagcatccagcacccctgcagga gaggaatccctgctgggccgggagggcagctggctccccataggatttgcacatgagaac ctgtatgtgccacatggaaaaacaggacaccagagcccaccagacagtgccggccagcag agaagcagagagccagcgccacacaacatcaagaaggccgacaaccaggttggaaaccaa gacggagctcagacccaccacatcgccccagaggcttttccagcacccatgatgtttcgg actgacctaaaaactaattgtcgagaagccaagggcgtttccccttacctgcaccgagcc tccattcccgtttatatccacgcaggcgtttccccttacctgcaccgagcctcccgcccc gtttacatccacgcaggcgtttccccttacctgcaccgagcctcccgccccgtttacatc cacgcaggcgtttccccttacctgcaccgagcctcccgccccgtttacatccacgcaggc gtttccccttacctgcaccgagcctcccgccccgtttacatccacgcaggcgtttcccct tacctgcaccgagcctcccgccccgtttacatccacgcaggcgtttccccttacctacac cgagcctcccgccccgtttacatccacgcaggcgtttccccttacctgcaccgagcctcc cgccccgtttacatccacgcaggcgtttccccttacctgcaccgagcctcccgccccgtt tacatccacgcaggcgtttccccttacctgcaccgagcctgccgccccgtttacatccac gcaggcgtttccccttacctgcaccgagcctgccgccccgtttacatccacgcaggcgtt tccccttacctgcaccgagcctcccgccccgtttatatccacgcaggcgtttccccttac ctgcaccgagcctccattcccgtttacatccacgcaggcgtttccccttacctgcaccga gcctcccgccccgtttacatccacgcaggcgtttccccttacctgcaccgagcctgccgc cccgtttacatccacgcaggcgtttccccttacctgcaccgagcctcccgccccgtttat atccacgcaggcgtttccccttacctgcaccgagcctccattcccgtttacatccacgca ggcgtttccccttacctgcaccgagcctcccgccccgtttacatccacgcaggcgtttcc ccttacctgcaccgagcctccattcccgtttacatccacgcaggcgtttccccttacctg caccgagcctccattcccgtttatatccatgcaggcgtttccccttacctgcaccgagcc tcccgccccgtttacatccacgcaggcgtttccccttacctgcaccgagcctcccgcccc gtttacatccacgcaggcgtttccccttacctgcaccgagcctcccgcccggtttacatc cacgcaggcgtttccccttacctgcaccgagcctcccgccccgtttacatccacgcaggc gtttccccttacctgcaccgagcctcccgccccgtttacatccacgcaggcgtttcccct tacctgcaccgagcctcccgccccgtttacatccacgcaggcgtttccccttacctgcac cgagcctccattcccgtttatatccacgcaggcgtttccccttacctgcaccgagcctgc cgccccgtttacatccacgcaggcgttgccccttacctgcaccgagcctgccgccccgtt tacatccacgcaggcgtttccccttacctgcaccgagcctccattcccgtttatatccac gcaggcgtttccccttacctgcaccgagcctccattcccgtttatatccacgcaggcgtt tccccttacctgcaccgagcctcccgccccgtttacatccacgcaggcgtttccccttac ctgcaccgagcctccattcccgtttatatccacgcaggcgtttccccttacctgcaccga gcctcccgccccgtttacatccacgcaggcgtttccccttacctgcaccgagcctccatt cccgtttatatccacgcaggcgtttccccttacctgcaccgggcctgccgccccgtttac atccacgcaggcgtttccccttacctgcaccgagcctccattcccgtttatatccacgca ggcgtttccccttacctgcaccgagcctccattcccgtttatatccacgcaggcgtttcc ccttacctgcaccgagcctcccgccccgtttacatccacgcaggcgtttccccttacctg caccgagcctccattcccgtttatatccacgcaggcgtttccccttacctgcaccgagcc tcccgccccgtttacatccacgcaggcgtttccccttacctgcaccgagcctccattccc gtttatatccacgcaggcgtttccccttacctgcaccgagcctcccgccccgtttacatc cacgcaggcgtttccccttacctgcaccgagcctccattcccgtttatatccacgcaggc gtttccccttacctgcaccgagcctccattcccgtttatatccacgcaggcgtttcccct tacctgcaccgagcctccattcccgtttatatccacgcaggcgtttccccttacctgcac cgagcctcccgccccgtttacatccacgcaggcgtttccccttacctgcaccgagcctcc attcccgtttatatccacgcaggcgtttccccttacctgcaccgagcctcccgccccgtt tacatccacgcaggcgtttccccttacctgcaccgagcctccattcccgtttatatccac gcaggcgtttccccttacctgcaccgagcctcccgccccgtttacatccacgcaggcgtt tccccttacctgcaccgagcctccattcccgtttatatccacgcaggcgtttccccttac ctgcaccgagcctccattcccgtttatatccacgcaggcgtttccccttacctgcaccga gcctccattcccgtttatatccacgcaggcgtttccccttacctgcaccgagcctcccgc cccgtttatatccacgcaggcgtttccccttacctgcaccgggcctgccgccccgtttac atccacgcatgcgtttccccttacctgcactgggcctgccaccctgtttatatctatgcc tgcgttttcctcttgccttttcaatccagaggcctcactagctttggcctagtcatcctt tctccttgtcttctcttcaggtattttattgatgacttcctgatagcaccaaatggctcc ccgagggctctgaggcattcacacgtgtgccggccctgcacttcgattgccgggagccct gcacgcggaactgtgtctgcagaagctccttttggcacctgggagggcacctctcatcag tactccaggacctcggccatcgccacggaccgcagggtgaacgctgggtgggctgtacca tggcctcggccaccgcctgcagatccaggctccctgcagcctgctgggctggtggcctca ggggaccgggcagctgccgctgtggattctccttggtgcagcttgctgaagtccactgtc ttcctccaggagtctcctatttctgcagaggccagcctgattactggctttaaggtagag ccctttaaggaccaggaccaagaaagcatgcaggttttaggacctaaaccacgactttct tattcaaacgtgcaaacacccaagtacccccgtatcaatgatcgtttccattgcacgact gtcctgagccacctccaacactatggctttcacccgccatcaaccaccagccatcacaca catcaaggtcaagtcctctcacagtgccaagcagtctctagtgccctacaaacagccagc gagcccagtaacacactcattggccacagcagccatgtgaccgcacccaaccacaaagct gcacagctcattggccacagcagccatgtgaccacacccaaccacaaagctgcacttgta tga >gi568815577f:42418455_42679800|GENSCAN_predicted_peptide_7|436_aa MGAAAACRQVHKRSPVKAQRSQEASGLHPSEPPIHVEMPAKALQHTAEEGAMGAMLGPEV TFSASVKQRVCACSSQAWAPPHVVEGTLLHIEEPGLDTNRLSSHIPQKHLAQGSALPILQ RVTFWCYHHNPIFKTQRQAKLTHTVDRFMDGPRCMGSPGAGECHCVHSNLGASQWEALAP PANAFAIARPAIASFPFPVWVARSCAPIPRLAGSPNFHAHERWNHTIQHRMAGGLVSFFT AGGEGHRIQPKELPMANAGTLGATFSKVVWNCSPQYESIQNKQKWWKRQIRGGEETNVPC QIPNDVGRRPLEEVLWTPHPKRGLCTVTPFQSTRPGDREREGDLTVGKPDSQHLSQVAKV NVNRKDSFGKSNRRSGLAGALCSPCKKGSETPSDLSKHLVFQTGGHFPPRNFLRLEEMRF LQKHQLAGELKVPVGG >gi568815577f:42418455_42679800|GENSCAN_predicted_CDS_7|1311_bp atgggggctgctgctgcctgcagacaagtgcacaagcgcagcccggtgaaggcacagcgg tcccaggaggcatctgggctgcaccccagcgagccgcccatacacgtggagatgccggcc aaggccctgcagcacacggcagaggaaggcgcgatgggagccatgctgggcccggaagtg acattctcagcatctgtgaagcaacgagtgtgcgcgtgctcttcccaggcgtgggctccg ccgcacgtggtggaaggaactctgctccacattgaggagcctggtttggacaccaacagg ctaagctcccacataccccaaaaacacctggcccagggatcagcccttccaattctacaa cgagtaactttctggtgttatcaccacaaccccatcttcaaaacccaacgtcaggccaag ctgacccacactgtggatcgattcatggatggacccagatgcatgggaagtcctggtgct ggcgagtgccactgtgttcacagcaacttgggtgcttctcagtgggaagcactggcccca ccggctaacgcttttgccattgcccgtccggccattgcaagttttccttttcctgtgtgg gtggctaggtcctgtgcgcccatcccccggctggctggctctccaaacttccatgcacat gaacgttggaaccacacaattcagcaccgcatggcaggtggcttggtcagcttcttcact gctgggggggagggccacaggatccaaccgaaggagctcccaatggccaatgctggaaca ctgggagcaacatttagtaaagttgtatggaattgtagcccacagtatgagtccatacag aataaacaaaaatggtggaagagacaaatacgtgggggagaagaaacaaatgtcccatgc cagattccaaatgacgtgggcagacgccccctcgaggaggtgctgtggaccccccaccct aagcgtgggctgtgcacagtgactcccttccagagcacacggcctggagatagggagaga gagggtgaccttacagtggggaagcctgactcacagcacctcagccaggtggccaaggtc aacgtcaacaggaaggactcctttggaaaaagcaatagaagaagcgggctggcaggagcc ctgtgctctccctgcaagaaagggtcagagaccccctctgacctgagcaaacacctggtg tttcagacaggaggtcatttcccaccaaggaactttctgcggctggaggaaatgagattc ttacagaaacaccagctggctggcgagctgaaagtgcctgtggggggctaa >gi568815577f:42418455_42679800|GENSCAN_predicted_peptide_8|232_aa MGSGSSSYRPKAIYLDIDGRIQKFPPETAETGRKEGTLETGGSAEFCAPAKLQVDNLKLS EGGPGQQVPVSPGLALPVQPLTPTVHSAFYLHETAECGDLTEVESHRIALLCLASFTRIM PSEQCTTSTTDQGDPRGKRQPNGTDPAKPRPAQVEQSQLGAFGSRVCHGGPPNPEPPSSL CEEGPACTALQTWASLDAWATLPCDSLACRTSAKHSGLTAVAMNGLTDTEWT >gi568815577f:42418455_42679800|GENSCAN_predicted_CDS_8|699_bp atgggatccggctcctccagctaccggcccaaggccatctacctggacatcgatggacgc attcagaagtttccccctgagaccgcagagacggggaggaaggagggaacattagaaact gggggttctgccgagttctgtgctccggcaaagctccaggttgacaatctgaagttgtct gagggaggccccgggcagcaagtgccggtgtctcctggcttggccttacctgtgcagccc ctgacccccactgtacattccgctttctatctccatgaaactgctgagtgcggggacctc acagaagtggaatcacacaggatcgccctgctgtgcctggcctccttcactcggatcatg ccctcagagcagtgtacaacatcaacaactgaccagggcgatcctagaggtaagagacag ccgaatggcacggacccagccaagcccaggcctgcccaggtggaacagagccagctagga gcattcggtagcagagtttgtcacggtggccccccaaatccagaaccgccctcctcgctg tgtgaggagggtcctgcctgcaccgctttgcagacatgggccagcctggatgcctgggcg acgctgccctgcgactcgctggcctgcaggacatctgcaaaacattcaggtctcacggca gtggccatgaacggcctcacagacactgagtggacctga >gi568815577f:42418455_42679800|GENSCAN_predicted_peptide_9|77_aa VQHEAQGTCMFQFGGDSSSEILLVEDPSDQRESTGLQPRHAEGKWLSKEPWEDGYPELGV LGPSLASYWGRQLSEGH >gi568815577f:42418455_42679800|GENSCAN_predicted_CDS_9|234_bp gtgcagcatgaagcccaaggcacctgcatgttccagtttggaggggactcgtccagtgag atccttctagtggaggatccttctgatcagcgggagtccactgggcttcagcccagacac gccgaggggaagtggttgtccaaggagccatgggaagatggttacccagaactgggtgtc cttggtcccagcctggcctcctactggggcaggcagctctcagaagggcattga