GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:43:57 Sequence gi568815597f:147078118_147279583 : 201466 bp : 40.62% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 184 489 306 1 0 72 9 184 0.690 3.54 1.02 Intr + 613 807 195 1 0 82 43 212 0.556 13.71 1.03 Intr + 828 1082 255 0 0 -43 72 247 0.414 5.64 1.04 Term + 1895 2399 505 2 1 34 38 283 0.672 10.83 1.05 PlyA + 3619 3624 6 1.05 2.15 PlyA - 3943 3938 6 1.05 2.14 Term - 4865 4221 645 2 0 -62 29 601 0.592 32.33 2.13 Intr - 5587 5324 264 1 0 17 5 246 0.117 5.99 2.12 Intr - 6406 6332 75 1 0 46 78 82 0.218 1.79 2.11 Intr - 20454 20329 126 0 0 54 82 118 0.167 7.76 2.10 Intr - 26641 26503 139 0 1 79 -11 139 0.079 2.65 2.09 Intr - 28058 27921 138 1 0 95 77 66 0.395 4.86 2.08 Intr - 31044 30978 67 1 1 42 50 70 0.324 -4.36 2.07 Intr - 31917 31745 173 2 2 91 76 191 0.881 16.86 2.06 Intr - 32803 32679 125 1 2 55 93 72 0.759 2.86 2.05 Intr - 35514 35325 190 2 1 77 79 78 0.842 4.47 2.04 Intr - 36154 36103 52 2 1 64 109 80 0.961 4.75 2.03 Intr - 36784 36664 121 1 1 41 84 62 0.161 0.25 2.02 Intr - 37260 37150 111 0 0 56 74 76 0.136 2.66 2.01 Init - 46569 46528 42 0 0 53 110 79 0.411 7.07 2.00 Prom - 47194 47155 40 -3.65 3.02 PlyA - 47205 47200 6 -3.94 3.01 Sngl - 48308 47343 966 2 0 75 49 243 0.673 15.54 3.00 Prom - 48550 48511 40 -10.15 4.03 PlyA - 48698 48693 6 1.05 4.02 Term - 49533 49031 503 1 2 50 32 298 0.264 14.06 4.01 Init - 73918 73777 142 1 1 51 37 108 0.116 2.34 4.00 Prom - 76497 76458 40 -6.45 5.11 PlyA - 77012 77007 6 1.05 5.10 Term - 81525 81448 78 0 0 109 49 96 0.921 4.58 5.09 Intr - 82498 82427 72 1 0 85 70 38 0.539 0.38 5.08 Intr - 83663 83595 69 2 0 83 115 14 0.800 2.06 5.07 Intr - 84456 84323 134 1 2 67 65 65 0.821 1.54 5.06 Intr - 88501 88381 121 1 1 61 76 58 0.862 1.05 5.05 Intr - 88822 88729 94 0 1 84 94 90 0.977 8.25 5.04 Intr - 89816 89650 167 2 2 62 46 188 0.721 9.84 5.03 Intr - 93208 93059 150 1 0 112 47 107 0.892 8.44 5.02 Intr - 94073 93872 202 1 1 29 121 332 0.432 28.97 5.01 Init - 95145 95075 71 0 2 73 9 100 0.325 -0.83 5.00 Prom - 95947 95908 40 -3.85 6.00 Prom + 99081 99120 40 -7.55 6.01 Sngl + 99997 101382 1386 0 0 91 35 1800 0.995 168.82 6.02 PlyA + 101810 101815 6 1.05 7.04 PlyA - 102500 102495 6 1.05 7.03 Term - 102959 102727 233 0 2 90 46 128 0.166 4.45 7.02 Intr - 109128 108866 263 2 2 99 -41 239 0.098 8.31 7.01 Init - 112100 112060 41 2 2 70 116 52 0.582 5.91 7.00 Prom - 113051 113012 40 -3.65 8.03 PlyA - 113641 113636 6 1.05 8.02 Term - 114428 113942 487 1 1 68 47 286 0.983 15.69 8.01 Init - 116005 114987 1019 1 2 49 72 481 0.730 35.75 8.00 Prom - 116098 116059 40 -6.15 9.10 PlyA - 116267 116262 6 1.05 9.09 Term - 117069 116495 575 1 2 -138 43 375 0.280 4.03 9.08 Intr - 123387 123035 353 2 2 66 115 350 0.916 29.54 9.07 Intr - 125756 125513 244 0 1 19 34 262 0.502 9.53 9.06 Intr - 126822 126131 692 2 2 41 59 537 0.029 36.34 9.05 Intr - 130934 130735 200 2 2 46 91 117 0.189 5.03 9.04 Intr - 134418 134276 143 1 2 128 36 127 0.463 10.65 9.03 Intr - 135353 135191 163 2 1 98 93 166 0.994 16.73 9.02 Intr - 137825 137637 189 2 0 89 70 155 0.838 12.66 9.01 Init - 146912 146778 135 2 0 89 88 100 0.951 10.37 9.00 Prom - 151023 150984 40 -5.65 10.04 PlyA - 151139 151134 6 1.05 10.03 Term - 152043 151158 886 2 1 -16 50 338 0.084 10.57 10.02 Intr - 153345 152487 859 1 1 45 72 380 0.092 21.58 10.01 Init - 155798 155678 121 2 1 68 74 82 0.903 5.20 10.00 Prom - 157542 157503 40 -6.65 11.00 Prom + 158726 158765 40 -10.05 11.01 Init + 159284 159493 210 1 0 86 58 205 0.851 16.14 11.02 Intr + 163302 163412 111 2 0 105 67 50 0.821 4.26 11.03 Intr + 164341 164491 151 0 1 26 52 104 0.011 -0.59 11.04 Intr + 174506 174618 113 0 2 91 89 71 0.790 6.68 11.05 Intr + 176753 176859 107 1 2 109 77 34 0.837 2.49 11.06 Intr + 181720 181801 82 1 1 72 82 53 0.647 1.82 11.07 Intr + 186976 187045 70 0 1 101 75 -16 0.204 -3.76 11.08 Intr + 187815 187970 156 1 0 72 57 116 0.865 5.96 11.09 Intr + 189309 189401 93 1 0 100 81 80 0.976 7.52 11.10 Intr + 190665 190761 97 1 1 89 69 101 0.731 6.45 11.11 Intr + 192815 192888 74 2 2 83 46 19 0.501 -4.67 11.12 Intr + 194006 194164 159 0 0 56 96 88 0.536 5.44 11.13 Intr + 197237 197351 115 0 1 109 108 136 0.964 16.29 11.14 Intr + 197987 198140 154 2 1 48 80 185 0.988 12.85 11.15 Term + 199785 199985 201 2 0 19 34 234 0.801 7.51 11.16 PlyA + 200636 200641 6 -3.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 42167 42011 157 1 1 85 54 128 0.803 5.52 S.002 Intr - 43058 42956 103 0 1 70 94 83 0.893 5.41 S.003 Term - 109128 108783 346 2 1 99 33 237 0.893 12.28 S.004 Init - 126815 126131 685 2 1 97 59 517 0.912 45.00 S.005 Term + 165326 165496 171 1 0 51 46 174 0.910 6.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:147078118_147279583|GENSCAN_predicted_peptide_1|420_aa XARLWPGDPLLSLHALPLHKGASLGRWSGSARTLQLAPGLVSLVSLAECWVRRLRAPLLE KGGRERPCETTPGATSDLHKSQVFPAHRETVTYRETVASVARRNLPLRRCKRAFSKGQTQ NSLRSEVQTLAKEGNSWWATSHDTALLFISVSVIRTSGFRERLSVSLRCQPSWNCTQEAH RLCRLCRPGGADRAELRITKREAKASRGVCACHKLSADRGRSVEPPHLRSRVTCFLGLQK KRLEAMPARLEVGSRTDRCELSVFYTLNLAIGKKALTQSLGLLPTCTRLPDSRLQARLLA RWRAASTECGTAAAAAPARWRADTSRRKDTRPAWWESMGDLAFPPDEKVSLQSLDTRWME APLLGTRRLLLRPGFAQWLLGPARPLPSRGSKSCFTHLSRLPLSPQLSLGQQGPQPVGKT >gi568815597f:147078118_147279583|GENSCAN_predicted_CDS_1|1263_bp nntgcgcggctctggcccggtgacccgctactgtctctccatgcgctgccgcttcacaaa ggcgcatctctaggtcggtggtcaggttctgcccggacgctgcaactggctccgggactt gtgtccttggtgagcctagcggagtgctgggtgcgtcgcctgcgtgctcctcttttggag aaaggagggagggaacggccttgtgagacgactccaggagcgaccagcgacctccacaag tcccaagtcttcccagcgcacagggaaactgtcacttacagggaaactgtcgcttcagtg gcaagaaggaatcttcctttacgccgctgtaagcgggccttctccaagggccagacgcag aacagtctccgcagcgaggtgcaaaccctggcgaaggagggcaactcctggtgggccacc tctcatgacacagcgctcttatttatctccgtgtctgtcattcgcacaagcggctttaga gagcgactgagcgtctcgctcagatgccagccctcctggaactgcacccaagaagcccac cgtctttgccgcctctgccgtcccggaggcgccgatcgggctgagctgcgaataactaag agagaggccaaggcaagtcgtggcgtttgtgcgtgccacaaattatcagctgacagggga cggtcagtggagcctcctcacctccgttcgcgggtaacgtgcttcttaggccttcagaag aagcgactggaggcgatgcccgcgaggttggaagtgggctctcgaacagacagatgtgag ctctctgtcttttacacgctgaatttggctattggcaaaaaagccctgacccagagcttg ggtctccttccgacctgcacacgactccccgactcccgcctccaagcgcggctcttggct cgctggcgggcagcgtccacagagtgtggaaccgccgcagccgcagctcccgcccgctgg cgggcagacactagcaggagaaaggacacaaggcctgcgtggtgggaaagcatgggagac ctcgctttcccaccggacgagaaggtctccctgcagtctttggacaccagatggatggag gcaccccttctaggaacaaggcggctgctcctgaggcctggcttcgcacagtggctcctg ggtcccgcgcgccctctcccttcccgcggtagcaagagctgctttacacatctcagccgg cttcctctttctccccagctgtccttgggacagcaaggcccccagcccgtaggaaagacc tag >gi568815597f:147078118_147279583|GENSCAN_predicted_peptide_2|755_aa MTAERMVPLNGDPQSYRSTFHSLEEQQVGLALDIGSEYSIVKVIKLQLASQASVTLVYLS VEDVTQVSLNLSPFSVSSKLACFSSSVHHLAGPQCDQVKKDDREATCPSFSYFFKLSREL LEVEEPEVLQDSLDRYYSTPTSYLELPDSCQPYRSTFTHWRNSMLAWLLMGTDWTVDCCM NRGIDLMCLSLSDFVADIEKNQEEEEDQDPQCPRLSPDLPEVEEQDVPQDSLDEVYLTPS LPHDLSDCQQPYNSTLYSLEDQLTCFALDVACAGLTDEESTVVLIPMISSKPLLLLSHLM RNAHRCGGAGHPFKSDTLCSQEDEEQIHGRHFKENLEKPCSHPGGLSSRALEWRLEPPPV RGGGFTGTAGAKNPGAQLSVTATGSRGHQRAPWSSGNASIWEDAHCHVDRGLAVASVSRK NTHYLVDWLGGYEIESNALVVVLAAGEIIRSAESTHTSINLIVKSNKKTTPGVIAVAVCE PVARLYGTGTLKIATMAENDDNEKMAALEAKICHQIEYYFGDFNLPGDKFLREQIKLDEG WKEEERKQNKVEAKLRAKQEQEANQKLEEDAEMKSLEEKIGCLLKFSGDLDDQTCREDLH ILFSSHGEIKWIDFIRGAKEGIILFKEKDKEALGKAKDVNNGNLQLQSKEVTWEVLEGEV EREALKKITEDQQESLNIWKSKGHRFKGKGKGNKAAHPRSGKEKVQFQGKKMKFANDDEH DENGATGPMKRSGEETDKEEPVSKQQKRENGAGDE >gi568815597f:147078118_147279583|GENSCAN_predicted_CDS_2|2268_bp atgactgctgaacgaatggtccccctcaatggtgaccctcagtcttacaggagcaccttt cactcattagaggaacagcaagttggcttggctcttgacataggtagtgagtactccatt gtgaaggtgataaagctccagttagcatcccaggccagtgtcacccttgtctacctctca gtggaagatgtgacccaggtttcactgaatttatctccattttctgtgtcttctaagttg gcttgttttagctcatctgtccatcatcttgctggacctcagtgtgatcaagtgaaaaag gacgatcgagaggcaacatgtcccagtttttcctacttcttcaagctcagcagggagctg ttggaagtggaagagcctgaagtcttgcaggactcactggatagatattattcgactcct accagttatcttgaactgcctgactcatgccagccctacagaagtacttttactcattgg aggaacagcatgttggcttggctcttgatggggacagattggacagtggattgttgcatg aataggggaatagacttaatgtgtctgtccctgtctgactttgttgcagacattgaaaaa aatcaagaagaagaagaagaccaagacccacaatgccccaggctgagcccggatctgcca gaggtggaggaacaggacgtcccacaggactccctggatgaagtttacttgactccttca ctcccccatgacctgtctgactgccagcaaccttacaacagcacgttgtactcattggag gatcagctcacctgctttgctctcgatgtagcctgtgctggcctaactgatgaagaaagc acagtggttctcatccccatgatctccagcaaacccctccttctcttgtcccacctgatg agaaacgcccacaggtgtggaggggcaggccaccccttcaagtctgatacactgtgctct caggaagatgaagaacagatacacggaaggcattttaaggaaaacttagagaaaccttgc tcccacccaggtggcctgtcctcacgggccttggagtggagacttgagccaccacctgtc agaggtggaggcttcacaggcacagctggagccaagaaccctggtgcccagttgtctgtg actgcaactggatcaaggggccaccagagggctccttggtctagtggtaatgccagcatc tgggaagacgcccattgccatgtggaccgtggtctagcggtagcctcagtgtcaaggaaa aacacccactacttagtggactggttaggaggctatgaaatagaaagtaacgcgttggtt gttgtccttgccgccggagaaattataaggtcagccgagtcaactcacacttcgatcaac ttaattgtaaaaagtaataagaaaaccactccaggagtcattgctgttgctgtttgtgag cctgtggcacggctctatgggactggaactttaaagatagccacaatggctgaaaatgat gataatgaaaagatggctgccctggaggccaaaatctgtcatcaaattgagtattatttt ggagacttcaatttgccaggagacaagtttttaagggaacagataaaactggacgaaggc tggaaagaagaagaaagaaaacaaaataaagtggaagctaaattaagagctaaacaagag caagaagcaaatcaaaagttagaagaagatgctgaaatgaaatctctagaagagaagatt ggatgcttgctgaagttttcaggtgatttagatgatcagacctgtagagaagatttacac atccttttctcaagtcatggtgaaataaaatggatagacttcatcagaggagcaaaagag gggataattctatttaaagaaaaagacaaggaagcgttgggtaaagccaaagatgtaaat aatggtaatctacaattacagagcaaagaagtgacttgggaagtactagaaggagaggtg gaaagagaagcactgaaaaaaataacagaagaccaacaagaatccctaaacatatggaag tcaaaaggacacagatttaaaggaaaaggaaagggtaataaagctgcccatcctaggtct ggtaaagaaaaagtacagtttcagggcaagaaaatgaaatttgctaatgatgatgaacat gatgaaaatggtgcaactggacctatgaaaagatcaggagaagaaacagacaaagaagaa cctgtgtccaaacaacagaaaagagaaaatggtgctggagacgagtag >gi568815597f:147078118_147279583|GENSCAN_predicted_peptide_3|321_aa MALLPKVIYGFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKSILSQKNKAGGITPPD FKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNK WCWENWLAICRKLKLDPFLTPYTKINSRWIKDLHVRPKTIKTLEGNLGITIQDIGMGKDF MSKTPKATATKAKIDKRDLIKLKSFCTAKETTIRVNRKPTKWEKIFATYSSDKGLISRIY NELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKRHMKKCSPSLAIREMQIKTAMRYHL TPFRIAIIKKETTGAGEDVEK >gi568815597f:147078118_147279583|GENSCAN_predicted_CDS_3|966_bp atggccttactgcccaaggtaatttatggattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cacatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgccacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataatgccacatatctacaactat ctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaattaattcaagatggattaaagacttacatgttagacctaaaaccata aaaaccctagaaggaaacctaggaattaccattcaggacataggcatgggcaaggatttc atgtctaaaacaccaaaagcaacggcaacaaaagccaaaattgacaaacgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacagaaaacctaca aaatgggagaaaatttttgcaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcaaag gacatgaacagacacttctcaaaagaagacatttatgcagccaaaagacacatgaaaaaa tgctcaccatcactggccatcagagaaatgcaaatcaaaactgcaatgagatatcatctc acaccatttagaattgcaatcattaaaaaggaaacaacaggtgctggagaggatgtggaa aaatag >gi568815597f:147078118_147279583|GENSCAN_predicted_peptide_4|214_aa MFSTKQEKVYHIRHRKKTEALKKIEKSNHYEKKKNPSIEIDSELLHEEIQTTIREYYKHL YANKLENREEMDKFLDTYNLPRLNQEEVESLNRPITVSETVAIINSLPTKKSPGPDGFTA KFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNI DAKILNKILAKRIQQHIKKLIHHNQVGFIPGMQG >gi568815597f:147078118_147279583|GENSCAN_predicted_CDS_4|645_bp atgttcagcaccaaacaagaaaaagtttatcatatccgacatcggaaaaaaactgaggca ctcaaaaaaatagaaaaatcaaaccattatgagaaaaaaaaaaatccatcaattgaaatt gactcagaactgttacatgaggaaatacaaactaccatcagagaatactacaaacacctc tacgcaaataaactagaaaatcgagaagaaatggataaattcctcgacacatacaacctc ccaagactaaaccaggaagaagttgaatctctgaatagaccaataacagtctctgaaact gtggcaataatcaatagcttaccaaccaaaaagagtccaggaccagatggattcacagcc aaattctaccagaggtacaaggaggagctggtaccattccttctgaaactattccaatca atagaaaaagagggaatcctccctaactcattttatgaggccagcatcatcctgatacca aagccgggcagagacacaaccaaaaaagagaattttagaccaatatccttgatgaacatt gatgcaaaaatcctcaataaaatactggcaaaacgaatccagcagcacatcaaaaagctt atccaccataatcaagtgggcttcatccctgggatgcaaggctag >gi568815597f:147078118_147279583|GENSCAN_predicted_peptide_5|385_aa MARSRLTATSASRVQAILLPHPPNPVSFPAVGGPRRAAAMGNTTSDRVSGERHGAKAARS EGAGGHAPGKEHKIMVGSTDDPSVFSLPDSKVEEPMIPRLVYSSVVLQNVAWLKKRQPTA ALKTLLSIPRTFRKRTTCFHSLPGDKEFVSWQQDLEDSVKPTQQARPTVIRWSEGGKEVF ISGSFNNWSTKIPLIKSHNDFVAILDLPEGEHQYKFFVDGQWVHDPSEPVVTSQLGTINN LIHVKKSDFEVFDALKLDSMESSETSCRDLSSSPPGPYGQEMYAFRSEERFKSPPILPPH LLQVILNKDTNISCDPALLPEPNHVMLNHLYALSIKALCQHDSGVLTREAFMVDNTVRGK DSVMVLSATHRYKKKYVTTLLYKPI >gi568815597f:147078118_147279583|GENSCAN_predicted_CDS_5|1158_bp atggcgcgatctcggctcaccgcaacctccgcctcccgggttcaagcgattcttctgcct catcctcccaacccggtgtcgttccccgcggtaggtggtccccgacgagctgcagccatg ggaaacaccaccagcgaccgggtgtccggggagcgccacggcgccaaggctgcacgctcc gagggcgcaggcggccatgccccggggaaggagcacaagatcatggtggggagtacggac gaccccagcgtgttcagcctccctgactccaaggtggaagagccaatgattccacgactt gtttactcctcagtagtacttcaaaatgttgcgtggctaaaaaaacgacaacccacagct gccctgaagaccttgctctcaattccaaggactttcaggaagagaacaacttgttttcac tctctccctggggacaaagagtttgtatcatggcagcaggatttggaggactccgtaaag cccacacagcaggcccggcccactgttatccgctggtctgaaggaggcaaggaggtcttc atctctgggtccttcaacaattggagcaccaagattccactgattaagagccataatgac tttgttgccatcctggacctccctgagggagagcaccaatacaagttctttgtggatgga cagtgggttcatgatccatcagagcctgtggttaccagtcagcttggcacaattaacaat ttgatccatgtcaagaaatctgattttgaggtgttcgatgctttaaagttagattctatg gaaagttctgagacatcttgtagagacctttccagctcacccccagggccttatggtcaa gaaatgtatgcgtttcgatctgaggaaagattcaaatccccacccatccttcctcctcat ctacttcaagttattcttaacaaagacactaatatttcttgtgacccagccttactccct gagcccaaccatgttatgctgaaccatctctatgcattgtccattaaggctttgtgtcag catgactctggagtgttaacccgagaagcattcatggtggataatacagtacgggggaag gacagtgtgatggtccttagcgcaacccatcgctacaagaagaagtatgttactactctg ctatacaagcccatttga >gi568815597f:147078118_147279583|GENSCAN_predicted_peptide_6|461_aa MCLRRPALFPGVALLLAAARLAAASDVLGLRDDNLESRISDTGSAGLMLVEFFAPWCGHC KRLAPEYEAAATRLKGIVPLAKADCTANTNTCNKYGVSGYPTLNMFRDGEEAGAYDGPRT ADGIVSHLKKQAGPASVPLRTEEEFKKFISDKDASIVGFFDDSFSEAHSEFLKAASNLRD NYRFAHTNVESLVNEYDDNGDGIILFRPSHLTNKLEDKTVAYTVQKMTSGKIKKFIQENI FGICPHMTEDNKDLIQGKDLLIAYYDVDYEKNAKGSNYWRNRVMMVAKKFLDAGHKLNFA VASRKTFSHELSDFGLESTAGEIPVVAIRTAKGEKFVMQEDFSRDGNALERFLQDYFDGN LKRYLKSEPIPESNDGPVKVVVAENFDEIVNNENKDVLIEFYAPWCGHCKNLEPKYKELG EKLSKDLNIVIAKMDATANDVPSPYEVRVFLPYTSLQPTRS >gi568815597f:147078118_147279583|GENSCAN_predicted_CDS_6|1386_bp atgtgcctccgccgcccagcgctgttcccgggcgtggcgctgcttctcgccgcggcccgc ctcgcggctgcctccgacgtgctaggactcagggacgacaacttggagagtcgcatctcc gacacgggctctgcgggcctcatgctcgtcgagttcttcgccccctggtgtggacactgc aagagacttgctcctgagtatgaagctgcagctaccagattaaaaggaatagtcccatta gcaaaggctgattgcactgccaacactaacacctgtaataaatatggagtcagtggatat ccaaccctgaatatgtttagagatggtgaagaagcaggtgcttatgatggacctaggact gctgatggaattgtcagccacctgaagaagcaggcaggcccagcttcagtgcctctcagg actgaggaagaatttaagaaattcattagtgataaagatgcctctatagtaggttttttc gatgattcattcagtgaagctcactccgagttcctaaaagcagccagcaacttgagggat aactaccgatttgcacatacgaatgttgagtctctggtgaacgagtatgatgataacgga gatggtatcatcttatttcgtccttcacatctcactaacaagttggaggacaagactgtg gcatatacagtgcaaaaaatgaccagtggcaaaattaaaaagtttatccaggaaaacatt tttggtatctgccctcacatgacagaagacaataaagatttgatacagggcaaggactta cttattgcttactatgatgtggactatgaaaagaatgctaaaggttccaactactggaga aacagggtaatgatggtggcaaagaaattcctggatgctgggcacaaactcaactttgct gtagctagccgcaaaacctttagccatgaactttctgattttggcttggagagcactgct ggagagattcctgttgttgctatcagaactgctaaaggagagaagtttgtcatgcaggag gatttctcgcgtgatgggaatgctctggagaggttcctgcaggattactttgatggcaat ctgaagagatacctgaagtctgaacctatcccagagagcaatgatgggcctgtgaaggta gtggtagcagagaattttgatgaaatagtgaataatgaaaataaagatgtgctgattgaa ttttatgccccttggtgtggtcactgtaagaacctggagcccaagtataaagaacttggc gagaagctcagcaaagacctgaatatcgtcatagccaagatggatgccacagccaatgat gtgccttctccatatgaagtcagagttttcctaccatatacttctctccagccaacaaga agctaa >gi568815597f:147078118_147279583|GENSCAN_predicted_peptide_7|178_aa MAEISKAQEEIDKRYVESQRHTIQGDYIDTMEELADLVGVRPNLLSLAFTDPKLALHLLL GPCTPIHYRVQGPGKWDGARKAILTTDDRIRKPLMTRVVERSMAKHYYGSKSQQTPFCTM PFTGDLEYSSLDPCLGRVSQNVALEAETSALYLTQVLRPHSRLTESETLGTQTSNLHI >gi568815597f:147078118_147279583|GENSCAN_predicted_CDS_7|537_bp atggcagaaatatctaaagctcaagaggaaattgacaaaaggtatgtggagagccaacgc cataccattcagggagactacatagataccatggaagagcttgctgatttggtgggggtc aggcccaatctgctgtctctggccttcactgaccccaagctggcattacacttattactg ggaccctgcactccaatccactatcgtgtacagggccctggaaagtgggatggggctcga aaagctatcctcaccacagatgatcgcatcaggaagcctctgatgacaagagtagttgaa aggagtatggcgaagcattattacggttctaagagtcagcaaactcctttctgcacaatg ccgtttactggggatcttgaatactcgtctctagacccttgtctaggccgcgtttctcaa aatgtggcccttgaggcagaaacatcagccttgtacttgacacaagtcctcaggccccac tccagacttacagaatcagaaactctgggtacacagaccagcaatctgcatatttga >gi568815597f:147078118_147279583|GENSCAN_predicted_peptide_8|501_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYC KIDHIVGSKALLSKCKRTEIITNCLSDHSAMKLELRTKKLTQNHSTTWKLNNLLLNDYWV HNETKAEIKMFFETNGNKDTTYQNLWNTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKARRRQEITKIRAELKEIKSQKTLQKINESRSWFFERINKIDKLLARLIK KKREESNRCSKNDKGDITTDPTEIQTTIREYYKHLYTNKLENLEETDKFLNTYTLPRLNQ EEAESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRVLDILARAIRQEKEIKGIRL GKEEVKLSLFADDMIVYLENPIVSAQNLLKLMSNFSKVSGYKINVQKSQAVLYTNNRQTE SQIMSELPFTIASKRIKYLGIQLTRDVKDLFKEKYKPLLNEIKEDTKKWKNIPCSWVGRI NIVKMAILPKVIYRLNAIRIK >gi568815597f:147078118_147279583|GENSCAN_predicted_CDS_8|1506_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaggttaac aaggatacccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacggaatatacatttttttcagcaccacaccacacctattgc aaaattgatcacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatgaaactagaactcaggactaaaaaactc actcaaaaccactcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaacgaaagcagaaataaagatgttctttgaaaccaatgggaacaaagacaca acataccagaatctctggaacacatttaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagaagaaggcaagaaataactaaa atcagagcagaactgaaggaaattaagtcacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgataaactgctagcaagactaataaag aagaaaagagaagaatcaaatagatgcagtaaaaatgataaaggggatatcaccaccgat cccacagaaatacaaactaccatcagagaatactataaacacctctacacaaataaacta gaaaatctagaagaaacggataaattcctcaacacatacaccctcccaagactaaaccag gaagaagctgaatctctgaatagaccaataacaggctctgaaattgtggcaataatcaat agcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccagaga gtgttggatattctggccagggcaattaggcaggagaaggaaataaagggtattcgatta ggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtatatctagaaaac cccattgtctcagcccaaaatctccttaagctgatgagcaacttcagcaaagtctcagga tacaaaatcaatgtacaaaaatcacaagcagtcttatacaccaacaacagacaaacagag agccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctagga atccaacttacaagggatgtgaaggacctcttcaaggagaaatacaaaccactgctcaat gaaataaaagaggatacaaagaaatggaagaacattccatgctcatgggtaggaagaatc aatatcgtgaaaatggccatactgcccaaggtaatttatagattaaatgccatccgcatc aagtga >gi568815597f:147078118_147279583|GENSCAN_predicted_peptide_9|897_aa MTKKRIAVIGGGVSGLSSIKCCVEEGLEPVCFERTDDIGGLWRFQENPEEGRASIYKSVI INTSKEMMCFSDYPIPDHYPNFMHNAQVLEYFRMYAKEFDLLKYIRFKTTVCSVKKQPDF ATSGQWEVVTESEGKKEMNVFDGVMVCTGHHTNAHLPLESFPGIEKFKGQYFHSRDYKNP EGFTGKRVIIIGIGNSGGDLAVEISQTAKQVFLSTRRGAWILNRVGDYGYPADVLFSSRL THFIWKICGQSLANKYLEKKINQRFDHEMFGLKPKHRLAMALQVPKAPGFAQMLKEGAKH FSELEEAVYRNIQACKELAQTTRTAYGRNGMKKMVINHLEKLFVTNDAATILRELEVQHP AAKMTVMASHMQEQEVGDGTNIVLVFAGALLELAEELLRIGLSVSEVIEGYEIACRKAHE ILPNLVRCSAKNLRDVDEVSSLLRTSVMCKQYGNEVFLAKLIVQACVSIFPDSGHFKVDN IRVCKILGCGITSSSVLHGMVFKKETEEVGDTQVVVFKHEKEDGIISTIVLRGSTDNLMD DIERAVDDGVNTFKVLTRDKRLVPRGGATEIELAKQITSYGETCPGLEQALSQHPTLNDD LPNRIISGLVKVKGNVKEFTETAAIFEDGSREDDIDAVIFATGYSFDFPFLEDSVKVVKN KISLYKKVFPPNLERPTLAIIGLIQPLGAIMPISELQGRWATQVFKGIWDYVKRPNLRRI GEPESDGENGTKLENTLQDIIQENFPNLARQANIQIQEIQRTPQRYSSRRATPRHIIVRF TKVEMKEKMLRAAREKGRVTHKGKPIRLTADLSAVTLQARREWGPIFNILKEKNFQPRIS YPAKLSFISEGDIKYFTDKQMLRDFVTTRPALKELLKEALNMERNNRYQPLQKHAKM >gi568815597f:147078118_147279583|GENSCAN_predicted_CDS_9|2694_bp atgactaagaaaagaattgctgtgattgggggaggagtgagcgggctctcttccatcaag tgctgcgtagaagaaggcttggaacctgtctgctttgaaaggactgatgacatcggaggg ctctggaggttccaggaaaatcctgaagaaggaagggccagtatttacaaatcagtgatc atcaatacttctaaagagatgatgtgcttcagtgactatccaatcccagatcattatccc aacttcatgcataatgcccaggtcctggagtatttcaggatgtatgccaaagaatttgac cttctaaagtatattcgatttaagaccactgtgtgcagtgtgaagaagcagcctgatttt gccacttcaggccaatgggaagtggtcactgaatctgaagggaaaaaggagatgaatgtc tttgatggagtcatggtttgcactggccatcacaccaatgctcatctacctctggaaagc ttccctggaattgagaagttcaaagggcagtacttccacagtcgagactataagaaccca gagggattcactggaaagagagtcattataattggcattgggaattctggaggggatctg gctgtagagattagccaaacagccaagcaggttttcctcagcaccaggagaggggcttgg atcctgaatcgtgtaggggactacggatatcctgctgatgtgttgttctcttctcgactt acacattttatatggaagatctgtggccaatcattagcaaacaaatatttggaaaaaaag ataaaccaaaggtttgaccatgaaatgtttggcctgaagcctaaacacaggctggccatg gcgcttcaagttcccaaggctccgggcttcgcccagatgctcaaggagggagcgaaacac ttttcagaattagaagaggctgtgtatagaaacatacaagcttgcaaggagcttgcccaa accactcgtacagcatatggacgaaatggaatgaaaaaaatggttatcaaccacttggag aagttgtttgtgacaaatgatgcagcgactattttaagagaactagaagtacagcatcct gctgcaaaaatgactgtaatggcttctcatatgcaagagcaagaagttggagatggcaca aacattgttctggtatttgccggagctctcctggaattagctgaagaacttctgaggatt ggcctgtcagtttcagaggtcatagaaggttatgaaatagcttgcagaaaagctcatgag attcttcctaatttggtacgttgttctgcaaaaaaccttcgagatgttgatgaagtctca tctctacttcgtacctctgtaatgtgtaaacaatatggtaatgaagtatttctggccaag cttattgttcaggcatgcgtatctatttttcctgattctggccatttcaaagttgataac atcagagtttgtaaaattctgggctgtggtatcacttcctcttcagtattgcatggcatg gtttttaagaaggaaacagaagaagttggagatactcaggtggtggtttttaagcatgaa aaggaagatggcatcatttctaccatagtacttcggggctctacagacaatctgatggat gacatagaaagggcagtagatgatggtgttaatactttcaaagttcttacaagggataaa cgtcttgtacccagaggtggagcaacagaaattgaattagccaaacagatcacatcatat ggagagacatgtcctggacttgaacaagctctgagtcagcatccaaccttaaatgatgac ctgccaaatcgtatcatttctggcttggtgaaagtgaaaggaaatgtgaaggaattcacg gagacagctgccatatttgaggatggctccagggaggatgacattgatgctgttatcttt gccacaggctatagctttgactttccatttctggaagattccgtcaaagtggtcaaaaac aagatatccctgtataaaaaggtcttccctcctaacctggaaaggccaactcttgcaatc ataggcttgattcagcccttaggagccattatgcccatttcagagctccaaggacgctgg gccactcaggtatttaaaggaatatgggactatgtgaaaagaccaaatctacgtcggatt ggtgaacctgaaagtgacggggagaatggaaccaagttggaaaacactctgcaggatatt atccaggagaacttccccaatctagcaaggcaggccaacattcagattcaggaaatacag agaacgccacaaagatactcctcaagaagagcaactccaagacacataattgtcagattc accaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggttacc cacaaagggaagcccatcagactaacagcagatctctcggcagtaactctacaagccaga agagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatttca tatccagccaaactaagcttcataagtgaaggagacataaaatactttacagacaagcaa atgctgagagattttgtgaccaccaggcctgctctaaaagagctcctgaaggaagcacta aacatggaaaggaacaaccggtaccagccactgcaaaaacatgccaaaatgtaa >gi568815597f:147078118_147279583|GENSCAN_predicted_peptide_10|621_aa MTLNEHAAFKLLFNKAHLAPPLTHLTLSGHSTCFREHRVGAPHHTYSKIDHIVGSKALLS KCKRTEIITNCLSDHSAIKLELRIKELTQNRSTTWKLNNLFLSDYWVHNEMKVEIKRFFE TNENKDTVYQNLWDTLKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQERTHSKAS RRQEITKIRAELKEIETQKTLQKINESRSFFFEKINKIDRLLARLIKKKREESNRCNKND KGDITTNPTEIQTTIREYYKHLYTNKLENLEEMDKFLNTYTLPRLNREEVESLNRPITGS EIEAIINSLPTKKSPGPDGFTAEFYQRAIYDKTTANIALNGQKLEAFPLKTGTRQGCPLS PLLFNILLEVLARAMRQEKEIKGIQLGKEEVKLSLLAEDMIVYLENPIVSAQNLLKLMSN FSKVSGYKINAQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKYLLKEN YKPLLNEIKEDTKKWKNIPCSWVGRINIMKMAILPKVIYRLNAIPIKQPMTFFTELEKTT LKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEP SEIMPHIYNHLIFDKPHKNKK >gi568815597f:147078118_147279583|GENSCAN_predicted_CDS_10|1866_bp atgactcttaacgagcatgctgccttcaagcttctgtttaacaaagcacatcttgcaccg cccttaacccatttaaccctgagtggacacagcacatgtttcagagagcacagggttggg gcaccacaccacacctattccaaaattgatcacatagttggaagcaaagcactcctcagc aaatgtaaaagaacagaaattataacaaactgtctctcagaccacagtgcaatcaaacta gaactcaggattaaggaactcactcaaaaccgctccactacatggaaactgaacaacctg ttcctgagtgactactgggtacataacgaaatgaaggtagagataaagaggttctttgaa accaacgagaacaaagacacagtgtaccagaatctctgggacacactgaaagcagtgtgt agagggaaatttatagcactaaatgcccacaagagaaagcaggaaagatctaaaattgac accctaacgtcacaattaaaagaactagagaagcaagagcgaacacattcaaaagctagc agaaggcaagaaataaccaaaatcagagcagaactgaaggaaatagagacacaaaaaacc cttcaaaaaatcaatgaatccaggagctttttttttgaaaagataaacaaaattgataga ctgctagcaagactaataaagaagaaaagagaagaatcaaatagatgcaataaaaatgat aaaggggatatcaccaccaatcccacagaaatacaaactaccatcagagaatactataaa cacctctacacaaataaactagaaaatctagaagaaatggataaattcctcaacacatac accctcccaagactaaaccgggaagaagttgaatctctgaatagaccgataacaggctct gaaattgaggcaataattaatagcttaccaaccaaaaaaagtccaggaccagatggattc acagccgaattctaccagagagctatttatgacaaaaccacagccaatatcgcactgaat gggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccctctctca ccactcctattcaacatattgttggaagttctggccagggcaatgaggcaggagaaggaa ataaagggtattcaattaggaaaagaggaagtcaaattgtccctgttggcagaagacatg attgtatatctagaaaaccccattgtctcagcccaaaatctccttaaactgatgagcaac ttcagcaaagtctcaggatacaaaatcaatgcacaaaaatcacaagcattcttatacacc aataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaag agaataaaatacctaggaatccaacttacaagggatgtgaagtacctcctcaaggagaac tacaaaccactgctcaatgaaataaaagaggatacaaagaaatggaagaacattccatgc tcatgggtaggaagaatcaatatcatgaaaatggccatactgcccaaggtaatttataga ttaaatgccatccccatcaagcaaccaatgactttcttcacagaactggaaaaaactact ttaaagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagccaaaag aacaaagccggaggcatcacgctacctgacttcaaactatactacaaggctacagtaacc aaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaacagagccc tcagaaataatgccgcatatctacaaccatctgatctttgacaaacctcacaaaaacaag aaatag >gi568815597f:147078118_147279583|GENSCAN_predicted_peptide_11|630_aa MTQAAAAVPERVGLLPAFSPKEHRDAQVCSCSWAATAVPGEQEAPALPTQKEAGPPPVPG SPSSMELAALNKPPLTVIFLYLPKSYKTTPPLSPFADSFRTQPTCTQTPSASEARLVRTG QPFHAPHCKNCLSGRVTAPAPLRISWQDAPSGGASLVGIHLRSYQLEGVNWLAQRFHCQN GCILGDEMGLGKTCQTIALFIYLAGRLNDEGPFLILCPLSVLSNWKEEMQRFPWSVLVVD EAHRLKNQSSLLHKTLSEDYPSIPVILSWDDFSKQVGGMPTTSELHKLLQPFLLRRVKAE VATELPKKTEVVIYHGMSALQKKYYKAILMKDLDAFENETAKKVKLQNILSQLRKCVDHP YLFDGVEPEPFEVGDHLTEASGKLHLLDKLLAFLYSGGHRVLLFSQMTQMLDILQDYMDY RDYLKRVKISVFLPLCKGYSYERVDGSVRGEERHLAIKNFGQQPIFVFLLSTRAGGVGMN LTAADTVIFVDSDFNPQNDLQAAARAHRIGQNKSVKVIRLIGRDTVEEIVYRKAASKLQL TNMIIEGGHFTLGAQKPAADADLQWRERQGSNEGYLVMWAAGHITVKDVDIPPPHGATNE WEHGLLAESDSVGLLAQLMVIGKLLSDSNV >gi568815597f:147078118_147279583|GENSCAN_predicted_CDS_11|1893_bp atgactcaggcagctgcagctgtgcctgagagggtagggctcctgcctgcttttagcccc aaagagcacagggatgcccaggtctgcagctgcagctgggcagccacagctgtacctggg gagcaggaagctcctgccctgccaactcagaaggaggcagggcctccacctgttcctggt tcccccagctccatggagctggcagccctgaacaaaccccctttgactgtaattttcctt tacctacccaaatcttataaaacgaccccacccctatctcccttcgctgactcttttcgg actcagcccacctgcacccagacgcctagcgcttcagaagcacgactggtgagaactggg cagcccttccacgctccgcactgcaagaactgcctgtccggaagggtgactgcaccagct ccgctccggatatcttggcaggacgcgccctctggcggcgcctcgctcgtagggattcac ctacgctcttaccagctggagggagtaaactggctcgcccagcgcttccattgtcagaat ggctgtatcctgggagatgagatgggcctggggaagacctgccagactattgctctcttc atttatttggcaggaagattaaatgatgaagggccatttctgattctttgtcccttgtct gttttgagcaactggaaagaagaaatgcagagattcccttggagtgttcttgttgtggat gaagctcacaggttgaaaaaccaaagctccctgctgcataagaccttgtcagaggattat ccatctatccctgtgatattatcctgggatgatttttcaaaacaggtaggaggtatgcca actacaagtgaactgcacaaactcttgcagccatttctgctgaggcgagtgaaagctgag gtagctacagagcttcccaagaagacagaagtagtgatataccatggcatgtcagcattg cagaagaaatactacaaggccattttgatgaaagacctagatgcatttgaaaatgagacg gcaaagaaagttaaactacagaacattttgtcccagcttcgaaagtgtgtggatcaccca tatttgtttgatggtgtggagccggagccttttgaagttggagaccacctgactgaggct agtgggaagcttcacctgctggataagctactagcattcctgtattctgggggccatcgg gttttacttttctcccaaatgacccagatgttggatattctccaagactatatggattac agagactacttgaaaagggttaagatttctgtttttcttcctctctgtaaaggctacagc tatgagcgtgtggatggttctgtgagaggagaagagagacacttggccattaagaacttt ggacagcagcccattttcgtttttctcctgagtactagggcaggtggagttggcatgaac ttaacagcagcagatactgtgatttttgttgacagtgactttaatcctcagaatgacttg caagcagctgccagggctcatcgcattggccaaaacaagtctgttaaagttattcggctg attggtcgagacactgtggaagaaatagtctataggaaagcagcctccaaactgcagctc accaacatgatcatagaaggaggccattttactctgggagcccagaaacccgctgccgat gctgacctccagtggagggagcgccaggggtccaatgaggggtatcttgtgatgtgggct gctggacacatcactgtgaaagatgtagacatacctccaccccatggggccacaaacgag tgggaacatggactgctggctgagagcgactctgttggacttcttgcccagctcatggtt atcggaaagttactttctgactctaatgtctga