GENSCAN 1.0 Date run: 2-Nov-116 Time: 23:38:33 Sequence gi568815581f:43075800_43310071 : 234272 bp : 48.50% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 815 689 127 0 1 7 116 109 0.070 5.34 1.04 Intr - 6773 6605 169 2 1 14 93 116 0.009 4.22 1.03 Intr - 15233 15145 89 0 2 83 98 -10 0.415 -0.81 1.02 Intr - 15834 15636 199 0 1 90 40 113 0.704 5.62 1.01 Init - 18843 15913 2931 0 0 62 90 915 0.532 80.56 1.00 Prom - 22539 22500 40 -2.86 2.00 Prom + 44047 44086 40 -3.66 2.01 Init + 50867 51101 235 1 1 39 80 121 0.272 2.82 2.02 Term + 52536 52738 203 1 2 -35 45 249 0.315 5.85 2.03 PlyA + 55441 55446 6 1.05 3.04 PlyA - 55742 55737 6 1.05 3.03 Term - 59473 59357 117 1 0 91 48 107 0.767 5.54 3.02 Intr - 69241 69141 101 2 2 93 98 64 0.299 7.73 3.01 Init - 71206 71179 28 1 1 76 53 25 0.066 -2.38 3.00 Prom - 74621 74582 40 -3.46 4.00 Prom + 76619 76658 40 -0.06 4.01 Init + 99822 99824 3 2 0 58 115 0 0.747 -0.30 4.02 Intr + 99992 100102 111 1 0 78 101 40 0.753 4.88 4.03 Intr + 102137 102199 63 1 0 105 83 50 0.934 5.11 4.04 Intr + 103595 103613 19 1 1 134 113 21 0.886 5.38 4.05 Intr + 104996 105018 23 0 2 64 90 22 0.323 -2.74 4.06 Intr + 110451 110645 195 2 0 29 80 157 0.269 8.61 4.07 Intr + 113243 113320 78 1 0 63 113 51 0.960 4.85 4.08 Intr + 113789 114003 215 1 2 56 87 135 0.980 7.71 4.09 Intr + 114810 114977 168 0 0 96 98 125 0.999 13.36 4.10 Intr + 115573 115782 210 1 0 104 69 104 0.861 8.13 4.11 Intr + 117295 117454 160 1 1 42 86 59 0.675 1.09 4.12 Intr + 117549 117839 291 2 0 96 83 119 0.993 9.53 4.13 Intr + 118551 118700 150 2 0 82 44 125 0.996 7.86 4.14 Intr + 119165 119240 76 1 1 91 86 72 0.996 6.39 4.15 Intr + 120682 120792 111 2 0 97 95 1 0.803 2.05 4.16 Intr + 121143 121307 165 1 0 54 70 136 0.882 8.33 4.17 Intr + 124368 124809 442 1 1 78 81 408 0.735 31.61 4.18 Intr + 127882 127987 106 1 1 71 82 80 0.871 5.92 4.19 Term + 134102 134275 174 1 0 80 55 226 0.954 16.26 4.20 PlyA + 135649 135654 6 1.05 5.00 Prom + 136600 136639 40 -10.15 5.01 Init + 137243 137453 211 1 1 54 113 186 0.952 16.65 5.02 Intr + 138029 138092 64 0 1 121 87 76 0.999 8.58 5.03 Intr + 139989 140142 154 0 1 100 76 261 0.991 26.17 5.04 Intr + 140650 140790 141 0 0 121 62 97 0.684 10.95 5.05 Intr + 140898 140941 44 2 2 83 73 36 0.429 -1.36 5.06 Intr + 141460 141513 54 1 0 105 100 5 0.533 1.49 5.07 Intr + 145336 145437 102 1 0 137 72 49 0.420 7.49 5.08 Intr + 148470 148835 366 0 0 -51 89 294 0.292 9.86 5.09 Intr + 152616 152740 125 0 2 86 54 34 0.667 0.03 5.10 Intr + 157718 157866 149 0 2 28 75 127 0.712 5.35 5.11 Intr + 157968 158234 267 2 0 103 81 27 0.546 1.23 5.12 Intr + 160343 160489 147 1 0 69 14 107 0.029 1.83 5.13 Intr + 161858 162017 160 1 1 76 55 120 0.049 7.06 5.14 Intr + 163852 164000 149 2 2 28 75 96 0.044 2.25 5.15 Intr + 164102 164368 267 1 0 103 81 27 0.049 1.23 5.16 Intr + 165361 165627 267 0 0 103 81 27 0.056 1.23 5.17 Intr + 167742 167888 147 2 0 69 14 102 0.024 1.33 5.18 Intr + 168825 168975 151 2 1 16 26 96 0.207 -4.06 5.19 Intr + 169605 169753 149 1 2 28 75 127 0.540 5.35 5.20 Intr + 169855 170121 267 0 0 103 81 27 0.481 1.23 5.21 Intr + 172225 172371 147 0 0 69 14 102 0.149 1.33 5.22 Intr + 173742 173900 159 2 0 76 21 92 0.104 1.48 5.23 Intr + 176010 176276 267 2 0 103 81 27 0.275 1.23 5.24 Intr + 177671 177817 147 1 0 69 14 102 0.172 1.33 5.25 Intr + 182271 182417 147 2 0 69 14 107 0.081 1.83 5.26 Intr + 185389 185470 82 0 1 76 35 77 0.144 0.41 5.27 Intr + 185515 185658 144 2 0 89 59 65 0.060 3.95 5.28 Intr + 190564 190712 149 2 2 28 75 87 0.230 1.35 5.29 Intr + 190814 191080 267 1 0 103 81 27 0.554 1.23 5.30 Intr + 193194 193340 147 2 0 69 14 107 0.060 1.83 5.31 Intr + 195550 195816 267 0 0 103 81 27 0.419 1.23 5.32 Intr + 197282 197549 268 1 1 103 81 7 0.649 -1.39 5.33 Intr + 197601 197999 399 1 0 109 -11 172 0.345 3.88 5.34 Intr + 201909 202057 149 1 2 28 75 96 0.539 2.25 5.35 Intr + 202159 202425 267 0 0 103 81 27 0.384 1.23 5.36 Intr + 206053 206134 82 0 1 76 35 77 0.332 0.41 5.37 Intr + 208070 208218 149 0 2 28 75 96 0.645 2.25 5.38 Intr + 208320 208586 267 2 0 103 81 27 0.516 1.23 5.39 Intr + 210696 210842 147 2 0 69 14 107 0.324 1.83 5.40 Intr + 212217 212298 82 2 1 76 35 77 0.328 0.41 5.41 Intr + 214921 215243 323 2 2 75 47 146 0.273 4.78 5.42 Term + 216839 217051 213 1 0 69 42 138 0.763 4.53 5.43 PlyA + 217056 217061 6 1.05 6.05 PlyA - 218162 218157 6 1.05 6.04 Term - 218512 218352 161 2 2 26 43 251 0.293 12.50 6.03 Intr - 219887 219800 88 2 1 61 30 95 0.130 0.54 6.02 Intr - 224815 224609 207 1 0 78 11 115 0.073 2.07 6.01 Init - 227457 227353 105 0 0 69 57 145 0.886 9.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 162013 161854 160 0 1 58 43 253 0.946 15.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:43075800_43310071|GENSCAN_predicted_peptide_1|1172_aa MNVEKAEFCNKSKQPGLARSQHNRWAGSKETCNDRRTPSTEKKVDLNADPLCERKEWNKQ KLPCSENPRDTEDVPWITLNSSIQKVNEWFSRSDELLGSDDSHDGESESNAKVADVLDVL NEVDEYSGSSEKIDLLASDPHEALICKSERVHSKSVESNIEDKIFGKTYRKKASLPNLSH VTENLIIGAFVTEPQIIQERPLTNKLKRKRRPTSGLHPEDFIKKADLAVQKTPEMINQGT NQTEQNGQVMNITNSGHENKTKGDSIQNEKNPNPIESLEKESAFKTKAEPISSSISNMEL ELNIHNSKAPKKNRLRRKSSTRHIHALELVVSRNLSPPNCTELQIDSCSSSEEIKKKKYN QMPVRHSRNLQLMEGKEPATGAKKSNKPNEQTSKRHDSDTFPELKLTNAPGSFTKCSNTS ELKEFVNPSLPREEKEEKLETVKVSNNAEDPKDLMLSGERVLQTERSVESSSISLVPGTD YGTQESISLLEVSTLGKAKTEPNKCVSQCAAFENPKGLIHGCSKDNRNDTEGFKYPLGHE VNHSRETSIEMEESELDAQYLQNTFKVSKRQSFAPFSNPGNAEEECATFSAHSGSLKKQS PKVTFECEQKEENQGKNESNIKPVQTVNITAGFPVVGQKDKPVDNAKCSIKGGSRFCLSS QFRGNETGLITPNKHGLLQNPYRIPPLFPIKSFVKTKCKKNLLEENFEEHSMSPEREMGN ENIPSTVSTISRNNIRENVFKEASSSNINEVGSSTNEVGSSINEIGSSDENIQAELGRNR GPKLNAMLRLGVLQPEVYKQSLPGSNCKHPEIKKQEYEEVVQTVNTDFSPYLISDNLEQP MGSSHASQVCSETPDDLLDDGEIKEDTSFAENDIKESSAVFSKSVQKGELSRSPSPFTHT HLAQGYRRGAKKLESSEENLSSEDEELPCFQHLLFGKVNNIPSQSTRHSTVATECLSKNT EENLLSLKNSLNDCSNQCSELEDLTANTNTQDPFLIGSSKQMRHQSESQGVGLSDKELVS DDEERGTGLEENNQEEQSMDSNLGEAASGCESETSVSEDCSGLSSQSDILTTQRDTMQHN LIKLQQEMAELEAVLEQHGSQPSNSYPSIISDSSALEDLRNPEQSTSEKAVLTSQKSSEY PISQNPEGLSADKFEVSADSSTSKNKEPGVES >gi568815581f:43075800_43310071|GENSCAN_predicted_CDS_1|3516_bp atgaatgtagaaaaggctgaattctgtaataaaagcaaacagcctggcttagcaaggagc caacataacagatgggctggaagtaaggaaacatgtaatgataggcggactcccagcaca gaaaaaaaggtagatctgaatgctgatcccctgtgtgagagaaaagaatggaataagcag aaactgccatgctcagagaatcctagagatactgaagatgttccttggataacactaaat agcagcattcagaaagttaatgagtggttttccagaagtgatgaactgttaggttctgat gactcacatgatggggagtctgaatcaaatgccaaagtagctgatgtattggacgttcta aatgaggtagatgaatattctggttcttcagagaaaatagacttactggccagtgatcct catgaggctttaatatgtaaaagtgaaagagttcactccaaatcagtagagagtaatatt gaagacaaaatatttgggaaaacctatcggaagaaggcaagcctccccaacttaagccat gtaactgaaaatctaattataggagcatttgttactgagccacagataatacaagagcgt cccctcacaaataaattaaagcgtaaaaggagacctacatcaggccttcatcctgaggat tttatcaagaaagcagatttggcagttcaaaagactcctgaaatgataaatcagggaact aaccaaacggagcagaatggtcaagtgatgaatattactaatagtggtcatgagaataaa acaaaaggtgattctattcagaatgagaaaaatcctaacccaatagaatcactcgaaaaa gaatctgctttcaaaacgaaagctgaacctataagcagcagtataagcaatatggaactc gaattaaatatccacaattcaaaagcacctaaaaagaataggctgaggaggaagtcttct accaggcatattcatgcgcttgaactagtagtcagtagaaatctaagcccacctaattgt actgaattgcaaattgatagttgttctagcagtgaagagataaagaaaaaaaagtacaac caaatgccagtcaggcacagcagaaacctacaactcatggaaggtaaagaacctgcaact ggagccaagaagagtaacaagccaaatgaacagacaagtaaaagacatgacagcgatact ttcccagagctgaagttaacaaatgcacctggttcttttactaagtgttcaaataccagt gaacttaaagaatttgtcaatcctagccttccaagagaagaaaaagaagagaaactagaa acagttaaagtgtctaataatgctgaagaccccaaagatctcatgttaagtggagaaagg gttttgcaaactgaaagatctgtagagagtagcagtatttcattggtacctggtactgat tatggcactcaggaaagtatctcgttactggaagttagcactctagggaaggcaaaaaca gaaccaaataaatgtgtgagtcagtgtgcagcatttgaaaaccccaagggactaattcat ggttgttccaaagataatagaaatgacacagaaggctttaagtatccattgggacatgaa gttaaccacagtcgggaaacaagcatagaaatggaagaaagtgaacttgatgctcagtat ttgcagaatacattcaaggtttcaaagcgccagtcatttgctccgttttcaaatccagga aatgcagaagaggaatgtgcaacattctctgcccactctgggtccttaaagaaacaaagt ccaaaagtcacttttgaatgtgaacaaaaggaagaaaatcaaggaaagaatgagtctaat atcaagcctgtacagacagttaatatcactgcaggctttcctgtggttggtcagaaagat aagccagttgataatgccaaatgtagtatcaaaggaggctctaggttttgtctatcatct cagttcagaggcaacgaaactggactcattactccaaataaacatggacttttacaaaac ccatatcgtataccaccactttttcccatcaagtcatttgttaaaactaaatgtaagaaa aatctgctagaggaaaactttgaggaacattcaatgtcacctgaaagagaaatgggaaat gagaacattccaagtacagtgagcacaattagccgtaataacattagagaaaatgttttt aaagaagccagctcaagcaatattaatgaagtaggttccagtactaatgaagtgggctcc agtattaatgaaataggttccagtgatgaaaacattcaagcagaactaggtagaaacaga gggccaaaattgaatgctatgcttagattaggggttttgcaacctgaggtctataaacaa agtcttcctggaagtaattgtaagcatcctgaaataaaaaagcaagaatatgaagaagta gttcagactgttaatacagatttctctccatatctgatttcagataacttagaacagcct atgggaagtagtcatgcatctcaggtttgttctgagacacctgatgacctgttagatgat ggtgaaataaaggaagatactagttttgctgaaaatgacattaaggaaagttctgctgtt tttagcaaaagcgtccagaaaggagagcttagcaggagtcctagccctttcacccataca catttggctcagggttaccgaagaggggccaagaaattagagtcctcagaagagaactta tctagtgaggatgaagagcttccctgcttccaacacttgttatttggtaaagtaaacaat ataccttctcagtctactaggcatagcaccgttgctaccgagtgtctgtctaagaacaca gaggagaatttattatcattgaagaatagcttaaatgactgcagtaaccagtgcagtgaa ttggaagacttgactgcaaatacaaacacccaggatcctttcttgattggttcttccaaa caaatgaggcatcagtctgaaagccagggagttggtctgagtgacaaggaattggtttca gatgatgaagaaagaggaacgggcttggaagaaaataatcaagaagagcaaagcatggat tcaaacttaggtgaagcagcatctgggtgtgagagtgaaacaagcgtctctgaagactgc tcagggctatcctctcagagtgacattttaaccactcagagggataccatgcaacataac ctgataaagctccagcaggaaatggctgaactagaagctgtgttagaacagcatgggagc cagccttctaacagctacccttccatcataagtgactcttctgcccttgaggacctgcga aatccagaacaaagcacatcagaaaaagcagtattaacttcacagaaaagtagtgaatac cctataagccagaatccagaaggcctttctgctgacaagtttgaggtgtctgcagatagt tctaccagtaaaaataaagaaccaggagtggaaagn >gi568815581f:43075800_43310071|GENSCAN_predicted_peptide_2|145_aa MRGDNVLAALARSRRLLGLGVHSGRAGGALQPATALWGPLSGLAEARAGSLCLRGSVEGE AGVGTGAARSARQPARVPASLLKPARPRIHQKEVTLNTSEHQKEQTLDTSSLRTVPLTVR VRNFILEVSETKNPPISDTRTSGFY >gi568815581f:43075800_43310071|GENSCAN_predicted_CDS_2|438_bp atgcgaggtgacaacgtgctagcagccctcgctcgctctcggcgcctcctcggccttggc gtccattctggccgtgctggaggagcccttcagcccgccactgcgctgtgggggcccctc tctgggctggccgaagccagagccggctccctctgcttgcggggaagtgtggagggagag gcgggtgtgggaactggggctgcgcgcagcgctcgccagccagcgcgagttccagcttca ctcctgaagccagcgagaccacgaatccaccagaaggaggtaactctgaacacgtccgaa catcagaaggaacaaactctggacacatcatctttaagaactgtaccactcaccgtgagg gtccgcaacttcattcttgaagtcagtgagaccaagaacccaccaatttcggacacaaga acatccggcttctactga >gi568815581f:43075800_43310071|GENSCAN_predicted_peptide_3|81_aa MRLQDKESSRGLADNCADLLLGSSMAAPSVELTFFLGILAAGKACGSARGLRSFWTEAEA TAAPEKAFWLKVEVHGVRRTA >gi568815581f:43075800_43310071|GENSCAN_predicted_CDS_3|246_bp atgaggctgcaggacaaggagtcttccagaggtttagctgacaattgcgctgatctcctc ttgggctcttccatggcagccccttcggtggagctgacctttttcttgggcatcctggca gcagggaaggcctgcggatcggccagggggctccgatccttttggaccgaggctgaagca acggctgcaccagagaaggccttctggctgaaggtggaagtgcacggggtccgcagaacc gcctaa >gi568815581f:43075800_43310071|GENSCAN_predicted_peptide_4|919_aa MPHSMEPQVTLNVTFKNEIQSFLVSDPENTTWADIEAMVKVSFDLNTIQIKYLDEENEEV SINSQGEYEEALKMAVKQGNQLQMQVHEGHHVVDEAPPPVVGAKRLAARAGKKPLAHYSS LVRVLGSDMKTPEDPAVQSFPLVPCDTDQPQDKPPDWFTSYLETFREQVVNETVEKLEQK LHEKLVLQNPSLGSCPSEVSMPTSEETLFLPENQFSWHIACNNCQRRIVGVRYQCSLCPS YNICEDCEAGPYGHDTNHVLLKLRRPVVGSSEPFCHSKYSTPRLPAALEQVRLQKQVDKN FLKAEKQRLRAEKKQRKAEVKELKKQLKLHRKIHLWNSIHGLQSPKSPLGRPESLLQSNT LMLPLQPCTSVMPMLSAAFVDENLPDGTHLQPGTKFIKHWRMKNTGNVKWSADTKLKFMW GNLTLASTEKKDVLVPCLKAGHVGVVSVEFIAPALEGTYTSHWRLSHKGQQFGPRVWCSI IVDPFPSEESPDNIEKGMISSSKTDDLTCQQEETFLLAKEERQLGEVTEQTEGTAACIPQ KAKNVASERELYIPSVDLLTAQDLLSFELLDINIVQELERVPHNTPVDVTPCMSPLPHDS PLIEKPGLGQIEEENEGAGFKALPDSMVSVKRKAENIASVEEAEEDLSGTQFVCETVIRS LTLDAAPDHNPPCRQKSLQMTFALPEGPLGNEKEEIIHIAEEEAVMEEEEDEEDEEEEDE LKDEVQSQSSASSEDYIIILPECFDTSRPLGDSMYSSALSQPGLERGAEGKPGVEAGQEP AEAGERLPGGENQPQEHSISDILTTSQTLETVPLIPEVVELPPSLPRHHHGSSIAGGLVK GALSVAASAYKALFAGPPVTAQPIISEDQTAALMAHLFEMGFCDRQLNLRLLKKHNYNIL QVVTELLQLNNNDWYSQRY >gi568815581f:43075800_43310071|GENSCAN_predicted_CDS_4|2760_bp atgcctcacagcatggaaccacaggttactctaaatgtgacttttaaaaatgaaattcaa agctttctggtttctgatccagaaaatacaacttgggctgatatcgaagctatggtaaaa gtttcatttgatctgaatactattcaaataaaatacctggatgaggaaaatgaagaggta tccatcaacagtcaaggagaatatgaagaagcgcttaagatggcagttaaacagggaaac caactgcagatgcaagtccacgaagggcaccatgtcgttgatgaagccccacccccagtt gtaggagcaaaacgactagctgccagggcagggaagaagccacttgcacattactcttca ctggtgagagtcttgggatcagacatgaagaccccagaggatcctgcagtgcagtcgttt ccacttgttccatgtgacacagaccagcctcaagacaagcccccagactggttcacaagc tacctggagacgttcagagaacaagtggttaacgaaacggttgagaagcttgaacagaaa ttacatgaaaagcttgtcctccagaacccatccttgggttcttgtccctcagaagtctca atgcctacttcagaagaaacattgtttttgccagaaaaccagttcagctggcatattgct tgcaacaactgccaaagaaggattgttggtgtccgctaccagtgtagcctatgcccatcc tacaatatctgtgaagattgtgaagcagggccatatggccatgacactaaccacgtcctg ctgaagttgcggagacctgttgtgggctcctctgaaccgttctgtcactcaaagtactct actcctcgtcttcctgctgctctggaacaagtcaggctccagaaacaggttgataagaac tttcttaaagcagaaaagcaaaggttgcgagctgagaagaaacaacgtaaagcagaggtc aaggaacttaaaaagcagcttaaactccataggaaaattcacctgtggaattcaatccat ggactccagagccccaagtctcctttaggccgacctgagagcttgctccagtctaatacc ctgatgctccctttgcagccctgtacctccgttatgccaatgctcagtgcagcatttgtg gatgagaatttgcctgatgggactcaccttcagccaggaaccaagtttatcaaacactgg aggatgaaaaatacaggaaatgtaaagtggagtgcagacacaaagctcaagttcatgtgg ggaaacctgactttggcttccacagaaaagaaggatgttttggttccctgcctcaaggcc ggccatgtgggagttgtatctgtggagttcattgccccagccttggagggaacgtatact tcccattggcgtctttctcacaaaggccagcaatttgggcctcgggtctggtgcagtatc atagtagatcctttcccctccgaagagagccctgataacattgaaaagggcatgatcagc tcaagcaaaactgatgatctcacctgccagcaagaggaaacttttcttctggctaaagaa gaaagacagcttggtgaagtgactgagcagacagaagggacagcagcctgcatcccacag aaggcaaaaaatgttgccagtgagagggagctctacatcccatctgtggatcttctgact gcccaggacctgctgtcctttgagctgttggatataaacattgttcaagagttggagaga gtgccccacaacacccctgtggatgtgactccctgcatgtctcctctgccacatgacagt cctttaatagagaagccaggcttggggcagatagaggaagagaatgaaggggcaggattt aaagcacttcctgattctatggtgtcagtaaagaggaaggctgagaacattgcttctgtg gaggaagcagaagaagacctgagtgggacccagtttgtgtgtgagacagtaatccgatcc cttaccttggatgctgccccagaccacaaccctccttgcagacagaagtccttgcagatg acatttgccttgcctgaaggaccacttggaaatgagaaggaggagattatccatatcgct gaggaagaagctgtcatggaggaggaggaggatgaggaggatgaggaggaggaggatgag ctcaaagatgaagttcaaagtcagtcctctgcttcctcagaggattacatcatcatcctg cctgagtgctttgataccagccgccccctgggggattctatgtacagctctgcgctctca cagccaggcctggagcgaggtgctgaaggcaagcctggggttgaggctgggcaggaacca gctgaggctggggaaagactccctggaggggagaaccagccacaggagcacagcataagt gacatcctcacgacctcacagactctggaaacagtgcccctaatcccagaggtagtggag cttccaccgtcactgcccaggcaccatcatgggagcagcattgctggaggactggtgaag ggggctttgtctgttgctgcctctgcatacaaggccctgtttgctgggccaccagtcact gcacagccaataatttctgaagatcagacagcagccctgatggcccatctctttgaaatg ggattctgtgacaggcagctgaacctacggctgctgaagaaacacaattacaatatcctg caggttgtgacagaacttcttcagttaaacaacaacgactggtacagccaacgctattga >gi568815581f:43075800_43310071|GENSCAN_predicted_peptide_5|2549_aa MGKTFSQLGSWREDENKSILSSKPAIGSKAVNYSSTGSSKSFCSCVPCEGTADASFVTCP TCQGSGKIPQELEKQLVALIPYGDQRLKPKHTKLFVFLAVLICLVTSSFIVFFLFPRSVI VQPAGLNSSTVAFDEADIYLNITNILNISNGNYYPIMVTQLTLEVLHLSLVVGQVSNNLL LHIGPLASEQMFYAVATKIRDENTYKICTWLEIKVHHVLLHIQLYLSKHGLGGMGDGSSD SHKLKPPTDLSLLNEGRCAEYNGLEHLVVLAEVVAVVVVAVALATPVPVAMLMAVVVAVM VAALMAAAVMAVVAPVMSTISPLVVAAHLTSYVAGVFPPIVVLPPGDSSVPGKKPPGKWH HPLPTQSPICQRNFREQGKEDPEEPSLSHAPVLSAAAPAAAGPSAISFGPKPSAGAGPPS GRHPLDCQAEILRTSPKEEVVAFPKRSSHGGNDSVPHKPTVYRPHIRNLKETRCIPGGGA PFLEVLQYQVDAWSGRSKLLFHLPAPKIHLIYCPRIEDVSDIKLIRTDTTLDLSQKAEKR CARLRARRHRPTLIHIQVAFLEDSSSRTSRSTTVFDLSDKCIEETLTFAVLPLHLGYHKG KGPLSSGHYRIEPQKTSESNKRPLLQQLLITGLPPGYSSCVSRPNPRSSSAITSADAGGF LGRHPLDCQAEILRTSPKEEVVAFPKRSSHGGNESVPHKPTVYRPHIRNLKETRCIPGGG APFLEVLQYQVDAWSGRSKLLFHLPAPKIHLIYCPRIEDVSDIKLIRTDTTLDLSQKAEK RCARLRARRHRPTLIHIQVAETRCIPGGGAPFLEVLQYQVDAWSGRSKLLFHLPAPKIHL IYCPRIEDVSDIKLIRTDTTLDLSQKAEKRCARLRARRHRPTLIHIQVAFLEDSSSRTSR STTVFDLSDKCIEETLTFAVLPLHLGYHKGKGPLSSAHCQLLSGLKRKKFLLGRHVDTYV YCSTIYNSKDLEPTRMPMNDRLDKESVARRHPLDCQAEILRTSPKEEVVAFPKRSSHGGN DSVPHKPTVYRPHIRNLKETRCIPGGGAPFLEVLQYQVDAWSGRSKLLFHLPAPKIHLIY CPRIEDVSDIKLIRTDTTLDLSQKAEKRCARLRARRHRPTLIHIQVAFLEDSSSRTSRST TVFDLSDKCIEETLTFAVLPLHLGYHKGKGPLSSAHYRIEPQKTSESNKRPLLQQLLITG CPPVLILCLSSKSPLLLSYHLCRCWWFPWETRCIPGGGAPFLEVLQYQVDAWSGRSKLLF HLPAPKIHLIYCPRIEDVSDIKLIRTDTTLDLSQKAEKRCARLRARRHRPTLIHIQVAFL EDSSSRTSRSTTVFDLSDKCIEETLTFAVLPLHLGYHKGKGPLSSAHFLEDSSSRTSRST TVFDLSDKCIEETLTFAVLPLHLGYHKGKGPLSSGHYRIEPQKTSESNKRPLLQQLLITG LPPAITSADAGGFPGYREPKLSFPRGLKPPRGQAPCGRRSVNVCGAKKRRKGRHPLDCQA EILRTSPKEEVMAFPKRSSHGGNESVPHKPTVYRPHIRNLKETRCIPGGGAPFLEVLQYQ VDAWSGRSKLLFHLPAPKIHLIYCPRIEDVSDIKLIRTDTTLDLSQKAEKRCARLRARRH RPTLIHIQVAFLEDSSSRTSRSTTVFDLSDKCIEETLTFAVLPLHLGYHKGKGPLSSGHE TRCIPGGGAPFLEVLQYQVDAWSGRSKLLFHLPAPKIHLIYCPRIEDVSDIKLIRTDTTL DLSQKAEKRCARLRARRHRPTLIHIQVAETRCIPGGGAPFLEVLQYQVDAWSGRSKLLFH LPAPKIHLIYCPRIEDVSDIKLIRTDTTLDLSQKAEKAMRSPSRPPSPSHSHPHSSRALL TPLHSRADFLFSALKREIYTRASSGVPGLSFRICMPRPFTEGVASAVDSAPGAASAWGSR GSAGGDFLVRIEPAKRQNRKRPGAKATGVCGRDLSAESGPRFPGERRGPRAGARAALGLY FAARGRGDRPRRHPLDCQAEILRTSPKEEVVAFPKRSSHGGNESVPHKPTVYRPHIRNLK ETRCIPGGGAPFLEVLQYQVDAWSGRSKLLFHLPAPKIHLIYCPRIEDVSDIKLIRTDTT LDLSQKAEKRCARLRARRHRPTLIHIQVAYRIEPQKTSESNKRPLLQQLLITGLPPGRHP LDCQAEILRTSPKEEVVAFPKRSSHGGNESVPHKPTVYRPHIRNLKETRCIPGGGAPFLE VLQYQVDAWSGRSKLLFHLPAPKIHLIYCPRIEDVSDIKLIRTDTTLDLSQKAEKRCARL RARRHRPTLIHIQVAFLEDSSSRTSRSTTVFDLSDKCIEETLTFAVLPLHLGYHKGKGPL SSGHYRIEPQKTSESNKRPLLQQLLITGLPPEGVASAVDSAPGAASAWGSRGSAGGDFLV RIEPAKRQNRKRPGAKATGVCGRDLSAESGPRFPGERRGPGQERGLRWAFTSPPAGGETG PVPEGTRGRGAHAQSGKPKFLEDSSSRTSRSTTVFDLSDKCIEETLTFAVLPLHLGYHKG KGPLSSGHVTRVTYRSLEMTGTPYPAPLP >gi568815581f:43075800_43310071|GENSCAN_predicted_CDS_5|7650_bp atgggtaagacgttttcccagctgggctcttggcgggaggatgagaacaagtcaatcctg tcctccaaaccagccattggcagcaaggctgtcaactactccagcaccggtagcagcaag tctttttgttcctgtgtgccttgtgaaggaactgctgatgccagcttcgtgacttgtccc acctgccagggcagtggcaagattccccaagagctggagaagcagttggtggctctcatt ccctatggggaccagaggctgaagcccaagcacacgaagctctttgtgttcctggccgtg ctcatctgcctggtgacctcctccttcatcgtctttttcctgtttccccggtccgtcatt gtgcagcctgcaggcctcaactcctccacagtggcctttgatgaggctgatatctacctc aacataacgaatatcttaaacatctccaatggcaactactaccccattatggtgacacag ctgaccctcgaggttctgcacctgtccctcgtggtggggcaggtttccaacaaccttctc ctacacattggccctttggccagtgaacagatgttttacgcagtagctaccaagatacgg gatgaaaacacatacaaaatctgtacctggctggaaatcaaagtccaccatgtgcttttg cacatccagctatatctcagtaagcatgggcttgggggaatgggtgatggctccagtgac agtcataaactgaagccccccacagacttatcactgctgaatgagggcaggtgtgctgag tacaacgggcttgagcacttggttgtgttggctgaggtggtggctgtggttgtggtggct gtggccctggccacacctgtgcctgtggcaatgttgatggctgtggtggtggctgtgatg gtggctgcactgatggctgctgctgtgatggctgtggttgctcctgtgatgtccacaatt tcgccactggtggttgcggcacatttaacttcttatgtggctggagtttttcctccgatt gttgttcttcctcctggtgattcttcagttcctggcaagaagcccccgggcaagtggcat catcctctccccacccaatcacccatctgccagagaaacttcagggaacaaggcaaagag gaccctgaagaacccagcctatcccatgcacctgttctttctgctgcagctcctgctgct gctgggccatcagccatctccttcggtccaaagccatctgccggcgccgggcctcccagt ggtcgccacccactcgactgccaagctgagatcctaaggacctccccaaaggaggaggtc gtggccttcccaaagcgcagtagccacggtggaaacgacagcgtgccgcataagcctacc gtctaccgcccgcacatcaggaacctcaaggaaacacgttgtatccccggagggggtgca ccgttcctggaggtactgcaataccaggtcgatgcgtggagtggacggagcaagctccta ttccatctccctgctccaaaaatccatttaatatattgtcctcggatagaggacgtatca gatattaaactgataagaacagatactacacttgatcttagccaaaaggccgagaagcga tgcgctcgccttcgcgcccgccgtcaccgtcccactctcatccacattcaagtcgcgttc ttagaagacagcagtagcagaactagtaggagtaccacagtcttcgatctttctgataag tgcatagaagaaacgctgacgtttgctgtcctccctctccacctcggctaccacaaaggg aaaggccccctgtccagtggacactaccgcatagagccgcagaagactagcgagtcgaac aaacggcccctcttgcagcagttgttgatcacggggctgccccccgggtactcatcctgt gtctctcgtccaaatccccgctcctcctcagctatcacctctgccgatgctggtggtttc ctgggtcgccacccactcgactgccaagctgagatcctaaggacctccccaaaggaggag gtcgtggccttcccaaagcgcagtagccacggtggaaacgaaagcgtgccgcataagcct accgtctaccgcccgcacatcaggaacctcaaggaaacacgttgtatccccggagggggt gcaccgttcctggaggtactgcaataccaggtcgatgcgtggagtggacggagcaagctc ctattccatctccctgctccaaaaatccatttaatatattgtcctcggatagaggacgta tcagatattaaactgataagaacagatactacacttgatcttagccaaaaggccgagaag cgatgcgctcgccttcgcgcccgccgtcaccgtcccactctcatccacattcaagtcgcg gaaacacgttgtatccccggagggggtgcaccgttcctggaggtactgcaataccaggtc gatgcgtggagtggacggagcaagctcctattccatctccctgctccaaaaatccattta atatattgtcctcggatagaggacgtatcagatattaaactgataagaacagatactaca cttgatcttagccaaaaggccgagaagcgatgcgctcgccttcgcgcccgccgtcaccgt cccactctcatccacattcaagtcgcgttcttagaagacagcagtagcagaactagtagg agtaccacagtcttcgatctttctgataagtgcatagaagaaacgctgacgtttgctgtc ctccctctccacctcggctaccacaaagggaaaggccccctgtccagtgcacactgccag ctactcagcgggctgaagaggaagaagttcctactaggaagacacgtggacacgtatgtt tactgcagcactatttacaatagcaaagacttggaaccaactcgaatgcccatgaatgat aggctggataaagaaagtgtggcacgtcgccacccactcgactgccaagctgagatccta aggacctccccaaaggaggaggtcgtggccttcccaaagcgcagtagccacggtggaaac gacagcgtgccgcataagcctaccgtctaccgcccgcacatcaggaacctcaaggaaaca cgttgtatccccggagggggtgcaccgttcctggaggtactgcaataccaggtcgatgcg tggagtggacggagcaagctcctattccatctccctgctccaaaaatccatttaatatat tgtcctcggatagaggacgtatcagatattaaactgataagaacagatactacacttgat cttagccaaaaggccgagaagcgatgcgctcgccttcgcgcccgccgtcaccgtcccact ctcatccacattcaagtcgcgttcttagaagacagcagtagcagaactagtaggagtacc acagtcttcgatctttctgataagtgcatagaagaaacgctgacgtttgctgtcctccct ctccacctcggctaccacaaagggaaaggccccctgtccagtgcacactaccgcatagag ccgcagaagactagcgagtcgaacaaacggcccctcttgcagcagttgttgatcacgggc tgccccccggtactcatcctgtgtctctcgtccaaatccccgctcctcctcagctatcac ctctgccgatgctggtggtttccctgggaaacacgttgtatccccggagggggtgcaccg ttcctggaggtactgcaataccaggtcgatgcgtggagtggacggagcaagctcctattc catctccctgctccaaaaatccatttaatatattgtcctcggatagaggacgtatcagat attaaactgataagaacagatactacacttgatcttagccaaaaggccgagaagcgatgc gctcgccttcgcgcccgccgtcaccgtcccactctcatccacattcaagtcgcgttctta gaagacagcagtagcagaactagtaggagtaccacagtcttcgatctttctgataagtgc atagaagaaacgctgacgtttgctgtcctccctctccacctcggctaccacaaagggaaa ggccccctgtccagtgcacacttcttagaagacagcagtagcagaactagtaggagtacc acagtcttcgatctttctgataagtgcatagaagaaacgctgacgtttgctgtcctccct ctccacctcggctaccacaaagggaaaggccccctgtccagtggacactaccgcatagag ccgcagaagactagcgagtcgaacaaacggcccctcttgcagcagttgttgatcacgggg ctgccccccgctatcacctctgccgatgctggtggtttccctgggtaccgggaaccaaaa ctctcattccccaggggcctaaagcccccccgtggccaggccccctgtggaaggcgctca gtaaacgtttgtggagcgaagaaacgacgcaaaggtcgccacccactcgactgccaagct gagatcctaaggacctccccaaaggaggaggtcatggccttcccaaagcgcagtagccac ggtggaaacgaaagcgtgccgcataagcctaccgtctaccgcccacacatcaggaacctc aaggaaacacgttgtatccccggagggggtgcaccgttcctggaggtactgcaataccag gtcgatgcgtggagtggacggagcaagctcctattccatctccctgctccaaaaatccat ttaatatattgtcctcggatagaggacgtatcagatattaaactgataagaacagatact acacttgatcttagccaaaaggccgagaagcgatgcgctcgccttcgcgcccgccgtcac cgtcccactctcatccacattcaagtcgcgttcttagaagacagcagtagcagaactagt aggagtaccacagtcttcgatctttctgataagtgcatagaagaaacgctgacgtttgct gtcctccctctccacctcggctaccacaaagggaaaggccccctgtccagtggacacgaa acacgttgtatccccggagggggtgcaccgttcctggaggtactgcaataccaggtcgat gcgtggagtggacggagcaagctcctattccatctccctgctccaaaaatccatttaata tattgtcctcggatagaggacgtatcagatattaaactgataagaacagatactacactt gatcttagccaaaaggccgagaagcgatgcgctcgccttcgcgcccgccgtcaccgtccc actctcatccacattcaagtcgcggaaacacgttgtatccccggagggggtgcaccgttc ctggaggtactgcaataccaggtcgatgcgtggagtggacggagcaagctcctattccat ctccctgctccaaaaatccatttaatatattgtcctcggatagaggacgtatcagatatt aaactgataagaacagatactacacttgatcttagccaaaaggccgagaaagcgatgcgc tcgccttcgcgcccgccgtcaccgtcccactctcatccacattcaagtcgcgcgctattg acgcccttacactctcgggctgatttcttattctccgccttgaaaagggaaatctacacc cgtgcttcttccggcgttcccgggctttcatttcgaatttgcatgccccgccctttcaca gagggcgtggcctccgccgttgactccgcccccggggccgcctctgcctgggggagccgg ggctccgctgggggcgacttccttgttcgtatcgagccagcgaaaagacagaaccggaag agaccgggggcgaaggcgacaggggtctgtggaagagacctgtcggcggagagcggtcca cgttttcctggagaaagacgaggccccagggcaggagcgcgggctgcgctgggcctttac ttcgccgcccgcgggcggggagaccggccccgtcgccacccactcgactgccaagctgag atcctaaggacctccccaaaggaggaggtcgtggccttcccaaagcgcagtagccacggt ggaaacgaaagcgtgccgcataagcctaccgtctaccgcccgcacatcaggaacctcaag gaaacacgttgtatccccggagggggtgcaccgttcctggaggtactgcaataccaggtc gatgcgtggagtggacggagcaagctcctattccatctccctgctccaaaaatccattta atatattgtcctcggatagaggacgtatcagatattaaactgataagaacagatactaca cttgatcttagccaaaaggccgagaagcgatgcgctcgccttcgcgcccgccgtcaccgt cccactctcatccacattcaagtcgcgtaccgcatagagccgcagaagactagcgagtcg aacaaacggcccctcttgcagcagttgttgatcacggggctgccccccggtcgccaccca ctcgactgccaagctgagatcctaaggacctccccaaaggaggaggtcgtggccttccca aagcgcagtagccacggtggaaacgaaagcgtgccgcataagcctaccgtctaccgcccg cacatcaggaacctcaaggaaacacgttgtatccccggagggggtgcaccgttcctggag gtactgcaataccaggtcgatgcgtggagtggacggagcaagctcctattccatctccct gctccaaaaatccatttaatatattgtcctcggatagaggacgtatcagatattaaactg ataagaacagatactacacttgatcttagccaaaaggccgagaagcgatgcgctcgcctt cgcgcccgccgtcaccgtcccactctcatccacattcaagtcgcgttcttagaagacagc agtagcagaactagtaggagtaccacagtcttcgatctttctgataagtgcatagaagaa acgctgacgtttgctgtcctccctctccacctcggctaccacaaagggaaaggccccctg tccagtggacactaccgcatagagccgcagaagactagcgagtcgaacaaacggcccctc ttgcagcagttgttgatcacggggctgccccccgagggcgtggcctccgccgttgactcc gcccccggggccgcctctgcctgggggagccggggctccgctgggggcgacttccttgtt cgtatcgagccagcgaaaagacagaaccggaagagaccgggggcgaaggcgacaggggtc tgtggaagagacctgtcggcggagagcggtccacgttttcctggagaaagacgaggccca gggcaggagcgcgggctgcgctgggcctttacttcgccgcccgcgggcggggagaccggc cccgtacccgaggggacgaggggacgaggggcccatgcccagtcagggaagcctaagttc ttagaagacagcagtagcagaactagtaggagtaccacagtcttcgatctttctgataag tgcatagaagaaacgctgacgtttgctgtcctccctctccacctcggctaccacaaaggg aaaggccccctgtccagtggacacgtgactcgcgtgacctatcgatcattggagatgact ggcactccttaccctgcccccttgccttga >gi568815581f:43075800_43310071|GENSCAN_predicted_peptide_6|186_aa MGSAYHWEARRRQMALDRRRWLMAQQQQELQQKEQTPVAFAPGLFRFCLFAGSIRTRKSP PAEPRLPQAEAAPGAESTAEATPSVKGRGMQIRNESPGTPEEAREREHAVFGFRFLCQFA ENEAFQLHPRSRRGKPPASAEVIAEEERGFGRETQDEYPGGSPVINNCCKRGRLFDSLVF CGSMRY >gi568815581f:43075800_43310071|GENSCAN_predicted_CDS_6|561_bp atgggtagtgcctaccactgggaggcccggcgccggcagatggctttggaccgaaggaga tggctgatggcccagcagcagcaggagctgcagcagaaagaacagacccctgtcgccttc gcccccggtctcttccggttctgtcttttcgctggctcgatacgaacaaggaagtcgccc ccagcggagccccggctcccccaggcagaggcggccccgggggcggagtcaacggcggag gccacgccctctgtgaaagggcggggcatgcaaattcgaaatgaaagcccgggaacgccg gaagaagcacgggagcgagaacatgcggtgtttggttttcgcttcctgtgtcagtttgct gagaatgaggccttccagcttcatccacgttcccgcagagggaaaccaccagcatcggca gaggtgatagctgaggaggagcggggatttggacgagagacacaggatgagtacccgggg ggcagccccgtgatcaacaactgctgcaagaggggccgtttgttcgactcgctagtcttc tgcggctctatgcggtactaa