GENSCAN 1.0 Date run: 6-Nov-116 Time: 13:18:58 Sequence gi568815579f:47240800_47441810 : 201011 bp : 52.44% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3267 3406 140 2 2 114 2 72 0.010 0.97 1.02 Intr + 15724 15810 87 1 0 80 60 80 0.002 3.98 1.03 Intr + 16981 17110 130 1 1 -89 90 139 0.001 -2.40 1.04 Intr + 17760 17864 105 2 0 57 49 231 0.959 17.01 1.05 Intr + 19522 19623 102 0 0 91 97 157 0.999 17.77 1.06 Intr + 19789 20040 252 0 0 113 97 61 0.993 7.66 1.07 Intr + 23804 23887 84 1 0 99 60 195 0.999 18.31 1.08 Intr + 23974 24147 174 0 0 115 94 336 0.993 37.55 1.09 Intr + 25812 25993 182 2 2 47 76 329 0.995 26.78 1.10 Intr + 29608 29654 47 1 2 117 101 22 0.591 4.94 1.11 Intr + 29754 29889 136 1 1 88 75 81 0.998 6.93 1.12 Intr + 30283 30388 106 1 1 115 94 58 0.996 9.82 1.13 Intr + 30475 30850 376 0 1 74 76 399 0.997 32.45 1.14 Intr + 31270 31362 93 2 0 67 75 50 0.800 2.03 1.15 Term + 32558 32661 104 0 2 129 47 115 0.921 10.24 1.16 PlyA + 32988 32993 6 -6.37 2.00 Prom + 33007 33046 40 -6.30 2.01 Sngl + 34121 34549 429 1 0 74 35 246 0.396 13.26 2.02 PlyA + 34874 34879 6 1.05 3.03 PlyA - 36089 36084 6 1.05 3.02 Term - 39697 39565 133 0 1 112 41 82 0.658 3.77 3.01 Init - 40697 40684 14 2 2 86 55 -2 0.480 -4.07 3.00 Prom - 41542 41503 40 0.99 4.00 Prom + 43143 43182 40 -0.61 4.01 Init + 48965 49018 54 1 0 51 63 51 0.306 0.13 4.02 Term + 51179 51193 15 1 0 140 35 13 0.382 -0.28 4.03 PlyA + 54629 54634 6 1.05 5.02 PlyA - 56311 56306 6 1.05 5.01 Sngl - 59589 59260 330 0 0 48 47 140 0.830 2.24 5.00 Prom - 67057 67018 40 0.69 6.00 Prom + 68299 68338 40 -4.11 6.01 Init + 69097 69099 3 0 0 80 101 0 0.408 0.43 6.02 Intr + 69271 69399 129 0 0 105 66 17 0.426 2.60 6.03 Term + 78982 80031 1050 0 0 118 48 1560 0.990 147.65 6.04 PlyA + 81248 81253 6 1.05 7.00 Prom + 97480 97519 40 -3.61 7.01 Sngl + 100001 101014 1014 1 0 56 44 1062 0.999 96.10 7.02 PlyA + 101561 101566 6 1.05 8.00 Prom + 103378 103417 40 -6.01 8.01 Init + 112232 112936 705 1 0 69 109 842 0.928 79.44 8.02 Intr + 114240 114551 312 2 0 133 113 427 0.796 46.63 8.03 Intr + 117067 117321 255 0 0 77 64 407 0.946 35.37 8.04 Intr + 119169 119271 103 2 1 129 107 150 0.999 21.25 8.05 Intr + 121677 121894 218 1 2 86 94 293 0.999 28.45 8.06 Intr + 126182 126356 175 1 1 76 97 238 0.652 23.53 8.07 Intr + 131931 132124 194 1 2 89 94 336 0.846 34.03 8.08 Intr + 132800 132901 102 1 0 111 100 154 0.999 19.77 8.09 Intr + 134667 134948 282 2 0 110 86 392 0.723 39.36 8.10 Intr + 135125 135298 174 1 0 140 75 342 0.999 38.75 8.11 Intr + 135644 135761 118 1 1 118 77 255 0.996 27.94 8.12 Intr + 136301 136407 107 0 2 87 80 223 0.994 21.83 8.13 Intr + 138911 139186 276 1 0 119 75 447 0.828 44.75 8.14 Intr + 140017 140193 177 0 0 116 115 133 0.964 19.43 8.15 Intr + 140387 140525 139 1 1 39 115 194 0.989 17.74 8.16 Intr + 141181 141308 128 2 2 112 4 142 0.113 9.10 8.17 Intr + 145153 145213 61 0 1 117 77 72 0.137 7.80 8.18 Intr + 154076 154350 275 0 2 17 38 166 0.018 2.30 8.19 Intr + 154710 154796 87 2 0 43 115 64 0.944 5.26 8.20 Term + 155535 155729 195 2 0 46 33 118 0.641 -0.17 8.21 PlyA + 157895 157900 6 1.05 9.15 PlyA - 157910 157905 6 1.05 9.14 Term - 158435 158413 23 0 2 125 32 5 0.140 -2.64 9.13 Intr - 166172 166089 84 0 0 98 102 92 0.984 11.89 9.12 Intr - 166338 166280 59 2 2 80 92 88 0.999 7.42 9.11 Intr - 166629 166553 77 0 2 34 105 154 0.900 10.51 9.10 Intr - 168448 168300 149 2 2 63 89 176 0.855 15.66 9.09 Intr - 168748 168637 112 1 1 106 100 19 0.985 5.36 9.08 Intr - 173781 173596 186 0 0 14 45 152 0.620 3.90 9.07 Intr - 174067 173918 150 1 0 102 80 309 0.977 32.37 9.06 Intr - 174302 174252 51 2 0 106 105 105 0.999 13.59 9.05 Intr - 175903 175853 51 1 0 123 85 46 0.994 7.39 9.04 Intr - 176164 176005 160 0 1 137 80 172 0.922 21.80 9.03 Intr - 176551 176379 173 1 2 88 92 270 0.999 26.66 9.02 Intr - 178800 178633 168 0 0 78 82 17 0.512 0.76 9.01 Init - 182291 182289 3 2 0 76 81 0 0.302 -1.97 9.00 Prom - 182500 182461 40 -1.51 10.07 PlyA - 183503 183498 6 1.05 10.06 Term - 189666 189290 377 1 2 110 49 917 0.999 85.26 10.05 Intr - 191646 191368 279 1 0 55 52 550 0.410 46.09 10.04 Intr - 196762 196663 100 1 1 113 111 58 0.892 10.78 10.03 Intr - 197174 197050 125 0 2 67 55 179 0.689 13.31 10.02 Intr - 200387 200370 18 0 0 116 113 -13 0.649 1.06 10.01 Intr - 200641 200538 104 0 2 83 94 120 0.982 12.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:47240800_47441810|GENSCAN_predicted_peptide_1|705_aa PGSEFTFHPSKTASGRFATSSSPLPSGPGFWQQAPRRADFRKDFPPWQEVTRLARELRSQ RPTEPESRTPRGLRDRAWAANKSGVRTFFERGIGNVSEPYLVELEYSPGYERHQDEHREA ATLDLKSKEEKDAELDKRIEALRRKNEALIRRYQEIEEDRKKAELEGVAVTAPRKGRSVE KENVAVESEKNLGPSRRSPGTPRPPGASKGGRTPPQQGGRAGMGRASRSWEGSPGEQPRG GGAGGRGRRGRGRGSPHLSGAGDTSISDRKSKEWEERRRQNIEKMNEEMEKIAEYERNQR EGVLEPNPVRNFLDDPRRRSGPLEESERDRREESRRHGRNWGGPDFERVRCGLEHERQGR RAGLGSAGDMTLSMTGRERSEYLRWKQEREKIDQERLQRHRKPTGQWRREWDAEKTDGMF KDGPVPAHEPSHRYDDQAWARPPKPPTFGEFLSQHKAEASSRRRRKSSRPQAKAAPRAYS DHDDRWETKEGAASPAPETPQPTSPETSPKETPMQPPEIPAPAHRPPEDEGEENEGEEDE EWEDISEDEEEEEIEVEEGDEEEPAQDHQAPEAAPTGIPCSEQAHGVPFSPEEPLLEPQA PGTPSSPFSPPSGHQPVSDWGEEVELNSPRTTHLAGALSPGGGQSAPAFPESGPSLRGTQ EAEEEGSEATPEAGPEGQETAEITDFQRVRFCKVVAAPPLPGAAR >gi568815579f:47240800_47441810|GENSCAN_predicted_CDS_1|2118_bp ccaggctcagaattcacttttcatccttctaagacagccagcggacggttcgctacttcg tcctcgcccctcccctccggcccaggcttctggcagcaggccccacgcagggcagacttc aggaaggacttcccgccctggcaggaagtgacgcgcctggcccgggagctgcggtcgcag aggccgacggagccggagtcgcggacgccgcggggtctccgggacagagcttgggcagcc aataagagtggggttcggactttctttgagcgtgggattgggaatgtttctgaaccttac ctggtggagctagaatattcgcctggatatgagaggcaccaggatgagcacagggaggca gccacactcgatttgaaatcaaaggaggagaaggatgctgagttggacaagaggatcgag gctcttcggcggaagaatgaggccctcatccggcgctaccaggagattgaggaagaccgt aagaaagctgaacttgagggagtcgcagtcacagctccccgaaagggccgctcagtggag aaggagaacgtggcagtggagtcggagaagaacctgggtccttcccggaggtctcctggg acccctcggcccccaggggccagcaaggggggccggactcctccacagcagggaggccgg gccggcatgggccgagcatcgcgcagctgggagggcagccccggggagcagcctcgagga ggaggagctgggggccgtggccggaggggccggggccgaggttcacctcacctctctgga gctggagacacctcaatctctgaccgtaaatccaaggagtgggaggagcggcgcaggcag aacattgagaagatgaatgaggagatggagaagatcgccgagtatgagcgcaaccagcgg gaaggggttcttgaacccaacccagtgcggaacttcctggacgacccccggcgacgcagc gggcccctggaggagtctgagcgggaccgccgggaggagagccgccggcacggccgcaac tgggggggccccgacttcgagcgggtgcgctgtggccttgagcacgagcggcagggccgc cgagctggcctgggcagtgctggagacatgacgttgtccatgacgggccgggagcggtcg gagtacctgcgctggaagcaggagagggagaagatcgaccaggagcggctgcagaggcac cgcaagcccactggccagtggaggcgcgagtgggatgccgagaagaccgatgggatgttc aaggatggcccagtccctgcccatgaaccatcccaccgctatgatgaccaggcctgggcc cggcccccgaagccccctacttttggggagttcctgtcccagcacaaagctgaggccagc agccgcagaaggagaaagagcagtcggccccaggccaaggcagcgcccagggcctacagt gaccatgatgaccgctgggagacaaaagaaggggcagcatccccagcccctgagactcca cagcctacttcccccgagacttcccccaaggagacacccatgcagccacccgagatccca gctcctgcccaccggcctcctgaagacgagggggaagagaatgagggggaagaggatgaa gaatgggaggacataagtgaggatgaggaagaggaggagatcgaggtggaagaaggtgat gaggaggaaccagcccaagaccaccaagccccagaggctgcccccaccgggatcccctgc agtgagcaggcccacggagtccccttcagtccggaggagcccctgctggagccccaggcc cctggcacgccttccagccctttctcaccacccagcggccaccagcctgtgtccgattgg ggtgaagaggtggagctgaattctccccggaccactcacctggctggcgccctctccccg ggaggtggccagtcagcccctgccttcccggagagtgggcccagcctccgaggaacccag gaagctgaagaggaagggtctgaggcaactccagaggcaggccccgaaggccaggagacg gcggagatcaccgacttccagagggtgcgtttctgcaaggtggtggcggcccctccgctg ccgggggccgctcgctga >gi568815579f:47240800_47441810|GENSCAN_predicted_peptide_2|142_aa MRGTSCVGGGAESPGGAGLSEGPRGRWLRLAPVCAYFLCVSLAAVLLAVYYGLIWVPTRS PAAPAGPQPSAPSPPCAARPGVPPVPAPAAASLSCLLGVPGGPRPQLQLPLSRRRRYSDP DRRPSRQTPRETPEAAEGRRPG >gi568815579f:47240800_47441810|GENSCAN_predicted_CDS_2|429_bp atgcgggggaccagctgcgtgggcggcggcgccgagagccccggaggcgcggggctgagc gagggcccgcgggggcgctggctgcgcttggctccggtatgcgcctacttcctctgcgtc tcgctagctgccgtgctgctcgccgtgtactacggtctcatctgggtacccacgcggtct cccgcggcacccgccggcccacagcccagcgcgccgtcccctccgtgtgctgcccgcccg ggcgtgccgcctgtcccggcgcccgccgctgcctccctctcctgcctcctgggagtcccc ggcgggccgcgaccccagctccagctgccgctgagccgccgccgccgctacagcgaccct gaccgccgtccgagccgccagacacccagagagacgccagaggccgcggaggggcgaaga cccgggtaa >gi568815579f:47240800_47441810|GENSCAN_predicted_peptide_3|48_aa MVFNGQHSSTATMVSHLNQGGGVLIGLLASTIAPCSPFSTSQSEGLEI >gi568815579f:47240800_47441810|GENSCAN_predicted_CDS_3|147_bp atggtgtttaatggccaacactctagcacagccaccatggtctctcacctgaaccagggc ggtggtgtcctcattgggctcctggcttccaccatcgccccctgcagtccattctccaca tcgcagtcagaaggtcttgagatctga >gi568815579f:47240800_47441810|GENSCAN_predicted_peptide_4|22_aa MRVVISSHRVIAMESGAKVLLE >gi568815579f:47240800_47441810|GENSCAN_predicted_CDS_4|69_bp atgagggtggtaatttccagtcaccgggtcattgccatggaaagtggtgctaaggtgctt ctggaatag >gi568815579f:47240800_47441810|GENSCAN_predicted_peptide_5|109_aa MPTPIRKDGLSRRLQLPHPKEKQLCPATSLPAATGSPEPAKLGWLTTTYHPPLPWHEGSS AVATVAFEAGCGGLISGNACAHAVIRPWPDQLTRNSSTTGGLSKAQRPQ >gi568815579f:47240800_47441810|GENSCAN_predicted_CDS_5|330_bp atgccaacccccatcaggaaggatggactgagcaggcggctgcagctaccacatcccaag gagaagcagctgtgcccagccaccagcttgccagcggccacaggaagcccagaaccagcc aaactgggctggttgaccaccacttaccaccctccactcccttggcacgagggttcctca gctgtggccacagtggcttttgaggcaggatgtggtgggctcatcagcggcaatgcttgt gcccatgcagttatccgaccctggcctgaccaacttaccagaaactctagcaccacgggg ggtctgagtaaggcccagaggccgcaataa >gi568815579f:47240800_47441810|GENSCAN_predicted_peptide_6|393_aa MALTLQGWDVAMGIRLSRLWEQQQPRGAGAREDSPGEGTLELGLDSFNYTTPDYGHYDDK DTLDLNTPVDKTSNTLRVPDILALVIFAVVFLVGVLGNALVVWVTAFEAKRTINAIWFLN LAVADFLSCLALPILFTSIVQHHHWPFGGAACSILPSLILLNMYASILLLATISADRFLL VFKPIWCQNFRGAGLAWIACAVAWGLALLLTIPSFLYRVVREEYFPPKVLCGVDYSHDKR RERAVAIVRLVLGFLWPLLTLTICYTFILLRTWSRRATRSTKTLKVVVAVVASFFIFWLP YQVTGIMMSFLEPSSPTFLLLKKLDSLCVSFAYINCCINPIIYVVAGQGFQGRLRKSLPS LLRNVLTEESVVRESKSFTRSTVDTMAQKTQAV >gi568815579f:47240800_47441810|GENSCAN_predicted_CDS_6|1182_bp atggctctcacactccagggctgggatgtggccatgggaataagattgtcaagattgtgg gagcaacagcaacctcgtggggctggggcccgagaagattccccaggggaggggaccctt gagttgggtctagactccttcaattataccacccctgattatgggcactatgatgacaag gataccctggacctcaacacccctgtggataaaacttctaacacgctgcgtgttccagac atcctggccttggtcatctttgcagtcgtcttcctggtgggagtgctgggcaatgccctg gtggtctgggtgacggcattcgaggccaagcggaccatcaatgccatctggttcctcaac ttggcggtagccgacttcctctcctgcctggcgctgcccatcttgttcacgtccattgta cagcatcaccactggccctttggcggggccgcctgcagcatcctgccctccctcatcctg ctcaacatgtacgccagcatcctgctcctggccaccatcagcgccgaccgctttctgctg gtgtttaaacccatctggtgccagaacttccgaggggctggcttggcctggatcgcctgt gccgtggcttggggtttagccctgctgctgaccataccctccttcctgtaccgggtggtc cgggaggagtactttccaccaaaggtgttgtgtggcgtggactacagccacgacaaacgg cgggagcgagccgtggccatcgtccggctggtcctgggcttcctgtggcctctactcacg ctcacgatttgttacactttcatcctgctccggacgtggagccgcagggccacgcggtcc accaagacactcaaggtggtggtggcagtggtggccagtttctttatcttctggttgccc taccaggtgacggggataatgatgtccttcctggagccatcgtcacccaccttcctgctg ctgaagaagctggactccctgtgtgtctcctttgcctacatcaactgctgcatcaacccc atcatctacgtggtggccggccagggcttccagggccgactgcggaaatccctccccagc ctcctccggaacgtgttgactgaagagtccgtggttagggagagcaagtcattcacgcgc tccacagtggacactatggcccagaagacccaggcagtgtag >gi568815579f:47240800_47441810|GENSCAN_predicted_peptide_7|337_aa MGNDSVSYEYGDYSDLSDRPVDCLDGACLAIDPLRVAPLPLYAAIFLVGVPGNAMVAWVA GKVARRRVGATWLLHLAVADLLCCLSLPILAVPIARGGHWPYGAVGCRALPSIILLTMYA SVLLLAALSADLCFLALGPAWWSTVQRACGVQVACGAAWTLALLLTVPSAIYRRLHQEHF PARLQCVVDYGGSSSTENAVTAIRFLFGFLGPLVAVASCHSALLCWAARRCRPLGTAIVV GFFVCWAPYHLLGLVLTVAAPNSALLARALRAEPLIVGLALAHSCLNPMLFLYFGRAQLR RSLPAACHWALRESQGQDESVDSKKSTSHDLVSEMEV >gi568815579f:47240800_47441810|GENSCAN_predicted_CDS_7|1014_bp atggggaacgattctgtcagctacgagtatggggattacagcgacctctcggaccgccct gtggactgcctggatggcgcctgcctggccatcgacccgctgcgcgtggccccgctccca ctgtatgccgccatcttcctggtgggggtgccgggcaatgccatggtggcctgggtggct gggaaggtggcccgccggagggtgggtgccacctggttgctccacctggccgtggcggat ttgctgtgctgtttgtctctgcccatcctggcagtgcccattgcccgtggaggccactgg ccgtatggtgcagtgggctgtcgggcgctgccctccatcatcctgctgaccatgtatgcc agcgtcctgctcctggcagctctcagtgccgacctctgcttcctggctctcgggcctgcc tggtggtctacggttcagcgggcgtgcggggtgcaggtggcctgtggggcagcctggaca ctggccttgctgctcaccgtgccctccgccatctaccgccggctgcaccaggagcacttc ccagcccggctgcagtgtgtggtggactacggcggctcctccagcaccgagaatgcggtg actgccatccggtttctttttggcttcctggggcccctggtggccgtggccagctgccac agtgccctcctgtgctgggcagcccgacgctgccggccgctgggcacagccattgtggtg gggttttttgtctgctgggcaccctaccacctgctggggctggtgctcactgtggcggcc ccgaactccgcactcctggccagggccctgcgggctgaacccctcatcgtgggccttgcc ctcgctcacagctgcctcaatcccatgctcttcctgtattttgggagggctcaactccgc cggtcactgccagctgcctgtcactgggccctgagggagtcccagggccaggacgaaagt gtggacagcaagaaatccaccagccatgacctggtctcggagatggaggtgtag >gi568815579f:47240800_47441810|GENSCAN_predicted_peptide_8|1360_aa MPPPRTREGRDRRDHHRAPSEEEALEKWDWNCPETRRLLEDAFFREEDYIRQGSEECQKF WTFFERLQRFQNLKTSRKEEKDPGQPKHSIPALADLPRTYDPRYRINLSVLGPATRGSQG LGRHLPAERVAEFRRALLHYLDFGQKQAFGRLAKLQRERAALPIAQYGNRILQTLKEHQV VVVAGDTGCGKSTQVPQYLLAAGFSHVACTQPRRIACISLAKRVGFESLSQYGSQVGYQI RFESTRSAATKIVFLTVGLLLRQIQREPSLPQYEVLIVDEVHERHLHNDFLLGVLQRLLP TRPDLKVILMSATINISLFSSYFSNAPVVQVPGRLFPITVVYQPQEAEPTTSKSEKLDPR PFLRVLESIDHKYPPEERGDLLVFLSGMAEISAVLEAAQTYASHTQRWVVLPLHSALSVA DQDKVFDVAPPGVRKCILSTNIAETSVTIDGIRFVVDSGKVKEMSYDPQAKLQRLQEFWI SQASAEQRKGRAGRTGPGVCFRLYAESDYDAFAPYPVPEIRRVALDSLVLQMKSMSVGDP RTFPFIEPPPPASLETAILYLRDQGALDSSEALTPIGSLLAQLPVDVVIGKMLILGSMFS LVEPVLTIAAALSVQSPFTRSAQSSPECAAARRPLESDQGDPFTLFNVFNAWVQVKSERS RNSRKWCRRRGIEEHRLYEMANLRRQFKELLEDHGLLAGAQAAQVGDSYSRLQQRRERRA LHQLKRQHEEGAGRRRKVLRLQEEQDGGSSDEDRAGPAPPGASDGVDIQVGAMGCGVWGF TKDVKFKLRHDLAQLQAAASSAQDLSREQLALLKLVLGRGLYPQLAVPDAFNSSRKDSDQ IFHTQAKQGAVLHPTCVFAGSPEVLHAQELEASNCDGSRDDKDKMSSKHQLLSFVSLLET NKPYLVNCVRIPALQSLLLFSRSLDTNGDCSRLVADGWLELQLADSESAIRLLAASLRLR ARWESALDRQLAHQAQQQLEEEEEDTPVSPKEVATLSKELLQFTASKIPYSLRRLTGLEV QNMYVGPQTIPATPHLPGLFGSSTLSPHPTKGGYAVTDFLTYNCLTNDTDLYSDCLRTFW TCPHCGLHAPLTPLERIAHENTCPQAPQDGPPGAEEAALETLQKTSVLQRPYHCEACGKD FLFTPTEVLRHRKQHNKGPIRGDVLGVSVSFWEEEVEGLKKAAYKAVNYDKLKETTQGKE ENPAQFVAHLAATLRRYTALDPEGPEGRLILNMHFITQSTPDIRKKLQKLESGPQTPQQE LSNLAFKRLKTDAARSPRKPPRPSQTPSFMQLSQWKPLFAFTWTDPDTHQAQQTTWAVLP QGFTDSPHYFSQAQISSSSVTYLSIIIIKTQVLSLLIVSN >gi568815579f:47240800_47441810|GENSCAN_predicted_CDS_8|4083_bp atgcctcctcctagaacaagggagggcagggatcgccgagaccaccaccgggctcccagc gaggaagaggccttggagaaatgggactggaattgtccagagacgcgtcgcctcttggaa gatgccttcttccgtgaagaggattacatccgtcagggttctgaggaatgtcagaagttt tggaccttctttgaacgcctgcagagattccagaatctcaagacctccaggaaggaggag aaagaccctggacagcccaagcacagcatcccagcgctggccgacctacctcgcacttac gacccacgttaccgcatcaacctctctgttcttggccctgccacgcggggctctcaggga ctgggcaggcacttgcccgcggagagagtggctgagttccgccgagccctgttgcactac ctggactttggccagaagcaggcatttgggcgtctggccaagctgcagcgtgagcgggca gccctccccatcgcccagtatgggaaccgcatcctgcagacgctgaaggagcaccaggtg gtggtagtggccggtgacaccggctgtggcaagtccactcaggtgccccagtacctgctg gctgctggcttcagtcatgtggcgtgcacccagccccggcggatcgcctgcatctcactg gccaagcgtgtgggctttgagagcctcagtcagtatggctcacaggtcggctaccagatc cgctttgagagcacacgttcggcggccaccaagattgtattcctgacagtggggctgctc ctgcgacaaatccagcgggaacccagcctgccccagtatgaggtcctgattgtggatgaa gtccatgagcggcatctccacaacgatttcctcctgggcgtcctccagcgcctgttgccc acgcggcctgacctcaaggtcatcctcatgtcggccaccatcaacatctcgctcttctcc agctatttcagcaatgcccctgtggtacaggtgcctgggaggctgttccccatcacggtt gtgtaccagccgcaggaggcggagccgaccacgtccaagtcagagaagctggacccgcgg cctttcctgagggtgctggagtccattgaccacaagtacccgcctgaggagcggggtgac ctcctcgtcttcctcagcggcatggcggagatcagcgccgtgctggaggctgcccagacc tatgccagccacacccagcgctgggtggtactgccactgcacagcgccctgtctgtggcc gaccaggacaaggtatttgatgtggcaccccctggagtccggaaatgcatcctctccacc aacattgctgagacctcagtcaccattgacgggatccgcttcgtagtagattccggaaag gtgaaggagatgagctacgatccgcaggccaagctgcaacggctgcaggagttctggatt agtcaggccagcgcagagcagcggaagggccgggcgggccgcacgggccccggagtctgc ttccgcctctatgccgaatcggactatgatgccttcgccccctaccccgtcccagaaatt cggagggtggccctggactcgttggtgctgcagatgaagagcatgagtgtgggggacccc cgaaccttccccttcatcgagcccccaccaccagccagcctggaaaccgccatcctctac ctccgggaccagggggccctggacagctcagaggccctcacacccattgggtccctgcta gcccagctgcctgtggacgttgtgattgggaagatgctgatcctgggctccatgttcagc ctggtggagcctgtgctcaccatcgcagccgcacttagcgtccagtcgcccttcacccgc agcgcccagagcagcccagagtgcgcggcagcacggcggccgctggagagcgaccagggt gaccccttcacgctcttcaacgtcttcaacgcctgggtgcaggtgaaatctgaacggagc agaaactctcgcaagtggtgccgccgccggggcatagaggagcatcgactgtacgaaatg gccaaccttcggcgccagttcaaggagctgttggaggaccacgggctgctggctggggcc caggccgcgcaggtaggggacagctacagtcggttgcagcagcgccgggagcgccgggcc ctgcaccagctgaaacgccagcacgaggagggcgcggggcgcaggcgcaaggtgctgcgg ctgcaggaggagcaggacggcggctccagtgacgaggacagggctggcccagccccccca ggggccagtgatggcgtggacatccaggtgggcgccatgggctgtggggtgtgggggttt accaaggatgtgaagttcaagcttcggcatgacctggcgcagctgcaggccgctgccagc tcagcccaggacctgagccgcgagcagctggctctgctgaagctggtgctgggccggggc ctgtacccacagctggccgtccccgacgccttcaacagcagccgaaaggactcagaccag attttccacacgcaggccaagcagggcgccgtgctgcaccccacctgcgtcttcgctggc agccccgaggtgctgcacgcacaggagctggaggccagcaactgcgacggaagccgagac gacaaggacaagatgagcagcaaacaccagctcctcagcttcgtgtccctgctggagacc aacaagccgtacctggtgaactgcgtccgcatccctgccctccagtccctcctgcttttt agccggtctttggacaccaatggtgactgctcccgcctggtggccgatggctggctggag ctgcagctagcagacagtgaaagtgccatccgactcctggcggcttccctgcggctccgt gcccgctgggaaagtgccctggaccggcagctggcgcaccaggcccagcagcagctggag gaggaggaggaggatacgccagtcagccccaaggaggtggccaccctgagcaaggaactc ctgcaattcacggcatccaagattccttacagcctccggcggctcacagggctagaagtc cagaacatgtatgtgggaccccagaccatcccagccaccccccatcttcctggcctcttt ggcagctccaccctgtccccccaccccacaaaggggggctacgcagtcactgacttcctc acctacaactgcctcacgaatgacacagacctgtacagcgactgtctccgaaccttctgg acctgcccccactgtggcctgcatgcgcccctcacgcccctggagcgcatcgcccatgag aacacctgcccccaggccccacaggatgggcccccaggggctgaggaagctgccctcgaa accctccagaagacatctgtcctgcagaggccctaccactgcgaggcctgcgggaaggac ttcctctttacacccacagaggtgctgcgccaccggaagcagcacaacaaaggccccatc agaggggacgtgctgggtgtcagcgtcagcttctgggaggaggaagttgaagggcttaaa aaggcagcttacaaagctgttaattatgacaaacttaaagaaactacccaaggtaaagag gaaaacccagcccagttcgtggcccacttagcagcaacacttagacgctataccgcccta gacccagaagggccagaaggccgccttattcttaatatgcattttatcactcagtccact cctgacattaggaaaaaacttcaaaaattagaatctggccctcaaaccccacaacaggaa ttaagcaacctcgccttcaagcggctgaagactgatgctgcccgatcgcctcggaagccc cctagaccatcacagacgccaagcttcatgcaactctcacagtggaagcctctcttcgct ttcacttggactgaccctgacacccatcaggctcagcaaactacctgggctgtactgccg caaggcttcacagacagcccccattacttcagtcaagcccaaatttcatcctcatctgtt acctatctcagcataattatcataaaaacacaggtgctctccctgctgatcgtgtccaat taa >gi568815579f:47240800_47441810|GENSCAN_predicted_peptide_9|481_aa MLCRLTLAHPLSRSLVVLFNFHKTRERPGTGSAAASTPEPPPDPPSPRAAPRPLRSPYDE LPHYPGIVDGPAALASFPETVPAVPGPYGPHRPPQPLPPGLDSDGLKREKDEIYGHPLFP LLALVFEKCELATCSPRDGAGAGLGTPPGGDVCSSDSFNEDIAAFAKQVRSERPLFSSNP ELDNLMIQAIQVLRFHLLELEKVHDLCDNFCHRYITCLKGKMPIDLVIEDRDGGCREDFE DYPASCPSLPDQPHTHYTTATQTRMRLGNSAQTHAHQRLQMPPTDSATPTQVHIQPHQLS NSHFCSQPDSSVHRNNMWIRDHEDSGSVHLGTPGPSSGGLASQSGDNSSDQGDGLDTSVA SPSSGGEDEDLDQERRRNKKRGIFPKVATNIMRAWLFQHLSHPYPSEEQKKQLAQDTGLT ILQVNNWFINARRRIVQPMIDQSNRTGQGAAFSPEGQPIGGYTETQPHVAVRPPESGNAS H >gi568815579f:47240800_47441810|GENSCAN_predicted_CDS_9|1446_bp atgctctgcaggctcactctggctcatcccctatcccgctccttggtggttctgtttaat ttccacaagacaagggagagaccgggaacggggagcgcggctgccagcacccctgagccg ccgccggaccctccgtcgccccgggccgccccccgccccctgcggtccccgtatgatgag ctgccgcactacccaggcatcgtggatggccccgcagccctggctagcttcccagagaca gtgcccgcagtaccagggccctatggcccgcaccggcctccccagcccctgcccccaggc ttggacagcgacggcctgaagagggagaaggatgagatctatggacacccgctcttcccc ctcttggccctggtctttgagaaatgtgaactggctacatgctctccccgtgacggggcc ggagctgggctggggacaccccctggaggtgacgtctgctcctctgattccttcaacgag gacatcgctgcctttgccaagcaggttcgctctgagaggcccctcttctcctccaaccca gaactggacaatctgatgatccaggccatccaggtgctgcggttccacctgctggagctg gagaaggtccacgacctgtgcgacaacttctgtcaccgctacatcacctgcctcaaggga aagatgcccatcgacctggtcatcgaggatcgggacggcggctgcagggaggacttcgag gactacccagcctcctgccccagcctcccagaccagccacacacccactacacaacagct acgcagactcggatgcggctgggcaacagtgcacagacacacgcacaccagcggctgcag atgccacccacagattcagctacacccacacaggtccacatacagccacaccaactcagc aacagccatttctgctcacagccagacagtagcgtccaccggaataatatgtggattcga gaccatgaggatagtgggtctgtacatttggggaccccaggtccatccagtgggggcctg gcctcccagagtggggacaactccagtgaccaaggagacgggctggacaccagcgtggcc tctcccagttctggtggagaagatgaggacttggaccaggagcgacggcgaaacaagaag agggggatcttccccaaggtggccaccaacatcatgcgagcctggttgttccagcacctc tcgcacccgtacccctcggaggagcagaagaaacagctggcgcaggacacggggctcacc atcctgcaagtcaacaactggttcattaacgcccggagacgcatcgtgcaacctatgatc gatcaatccaaccgcacagggcagggtgcagccttcagcccagagggccagcccatcggg ggctataccgagacgcagccacacgtggccgtccggcctccggaatctggaaatgcctct cattaa >gi568815579f:47240800_47441810|GENSCAN_predicted_peptide_10|334_aa XKTLQVKIVDDEEYEKKDNFFIELGQPQWLKRGISALLLNQGDGDRKLTAEEEEARRIAE MGKPVLGENCRLEVIIEESYDFKNTVDKLIKKTNLALVIGTHSWREQFLEAITVSAGDEE EEEDGSREERLPSCFDYVMHFLTVFWKVLFACVPPTEYCHGWACFGVSILVIGLLTALIG DLASHFGCTVGLKDSVNAVVFVALGTSIPDTFASKVAALQDQCADASIGNVTGSNAVNVF LGLGVAWSVAAVYWAVQGRPFEVRTGTLAFSVTLFTVFAFVGIAVLLYRRRPHIGGELGG PRGPKLATTALFLGLWLLYILFASLEAYCHIRGF >gi568815579f:47240800_47441810|GENSCAN_predicted_CDS_10|1005_bp nngaaaactcttcaggtgaagatagttgatgacgaggaatatgagaaaaaggataatttc ttcattgagctgggccagccccagtggcttaagcgagggatttcagctctgctactcaat caaggggatggggacaggaagctaacagccgaggaggaggaggctcggaggatagcagag atgggcaagccagttcttggggagaactgccggctggaggtcatcatcgaggagtcatat gattttaagaacacggtggataaactcatcaagaaaacgaacttggccttggtaattggg acccattcatggagggagcagtttttagaggcaattacggtgagcgcaggggacgaggag gaggaggaggacgggtcccgggaggagcggctgccgtcgtgctttgactacgtgatgcac ttcctgacggtgttctggaaggtgctcttcgcctgtgtgccccccaccgagtactgccac ggctgggcctgctttggtgtctccatcctggtcatcggcctgctcaccgccctcattggg gacctcgcctcccacttcggctgcaccgttggcctcaaggactctgtcaatgctgttgtc ttcgttgccctgggcacctccatccctgacacgttcgccagcaaggtggcggcgctgcag gaccagtgcgccgacgcgtccatcggcaacgtgaccggctccaacgcggtgaacgtgttc cttggcctgggcgtcgcctggtctgtggccgccgtgtactgggcggtgcagggccgcccc ttcgaggtgcgcactggcacgctggccttctccgtcacgctcttcaccgtcttcgccttc gtgggcattgccgtgctgctgtaccggcgccggccgcacatcggcggcgagctgggcggc ccgcgcggacccaagctcgccaccaccgcgctcttcctgggcctctggctcctgtacatc ctcttcgccagcctggaggcgtactgccacatccggggcttctag