GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:57:03 Sequence gi568815583f:41317236_41580684 : 263449 bp : 48.47% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 14921 15295 375 2 0 131 66 194 0.058 15.63 1.02 Intr + 24456 24507 52 0 1 41 74 26 0.024 -4.59 1.03 Intr + 25151 25219 69 1 0 65 109 53 0.743 4.48 1.04 Intr + 31863 32006 144 2 0 62 68 103 0.902 6.18 1.05 Intr + 33798 33894 97 2 1 19 97 77 0.319 1.28 1.06 Intr + 40914 41023 110 1 2 34 95 75 0.591 2.80 1.07 Intr + 48167 48354 188 1 2 60 103 21 0.557 -0.71 1.08 Intr + 54292 54449 158 1 2 86 58 106 0.909 7.05 1.09 Intr + 58477 58593 117 2 0 61 95 25 0.467 0.94 1.10 Term + 64910 65031 122 0 2 49 55 55 0.171 -3.06 1.11 PlyA + 66794 66799 6 1.05 2.06 PlyA - 67239 67234 6 1.05 2.05 Term - 70358 70209 150 2 0 45 43 93 0.601 -1.59 2.04 Intr - 71287 71213 75 1 0 72 87 23 0.386 0.31 2.03 Intr - 77809 77624 186 1 0 50 87 137 0.833 9.69 2.02 Intr - 79780 79252 529 0 1 47 73 236 0.686 10.94 2.01 Init - 86767 86694 74 1 2 88 116 45 0.917 7.84 2.00 Prom - 95541 95502 40 -3.46 3.00 Prom + 98196 98235 40 -6.76 3.01 Init + 99881 100078 198 1 0 54 94 392 0.996 33.30 3.02 Intr + 121086 121196 111 2 0 76 90 92 0.913 8.78 3.03 Intr + 135666 135813 148 2 1 101 105 100 0.836 12.81 3.04 Intr + 140437 140641 205 2 1 80 116 208 0.984 21.06 3.05 Intr + 147536 147650 115 2 1 40 91 116 0.914 7.55 3.06 Intr + 148906 149017 112 0 1 108 115 173 0.999 21.95 3.07 Intr + 153022 153157 136 2 1 98 57 261 0.981 23.63 3.08 Intr + 153937 154114 178 1 1 89 11 106 0.752 2.92 3.09 Intr + 157385 157467 83 1 2 94 70 30 0.567 0.24 3.10 Intr + 158290 158377 88 1 1 34 88 65 0.694 1.07 3.11 Intr + 158477 158584 108 1 0 75 94 58 0.915 5.58 3.12 Intr + 159211 159288 78 0 0 85 116 66 0.992 8.85 3.13 Intr + 159930 160051 122 2 2 25 89 188 0.995 11.89 3.14 Intr + 160223 160280 58 2 1 117 83 117 0.999 12.99 3.15 Intr + 161313 161390 78 2 0 76 82 58 0.930 3.75 3.16 Intr + 161868 161972 105 2 0 61 26 93 0.606 0.81 3.17 Intr + 162979 163090 112 0 1 76 91 113 0.942 10.35 3.18 Term + 163346 163452 107 0 2 121 53 70 0.921 5.37 3.19 PlyA + 165035 165040 6 1.05 4.00 Prom + 165076 165115 40 -8.76 4.01 Init + 165210 165217 8 2 2 85 89 4 0.759 0.41 4.02 Intr + 166429 166499 71 1 2 90 58 89 0.832 4.93 4.03 Term + 168943 169160 218 2 2 89 43 133 0.614 6.21 4.04 PlyA + 171918 171923 6 -0.45 5.00 Prom + 172285 172324 40 -7.56 5.01 Init + 176720 177181 462 1 0 92 96 766 0.964 71.50 5.02 Intr + 184228 184616 389 0 2 78 85 655 0.610 57.79 5.03 Intr + 184762 184966 205 1 1 125 100 489 0.999 53.00 5.04 Intr + 185167 185268 102 0 0 86 85 153 0.999 15.17 5.05 Intr + 185553 185624 72 2 0 96 99 121 0.985 13.60 5.06 Term + 185698 185931 234 0 0 4 54 380 0.754 22.52 5.07 PlyA + 186297 186302 6 1.05 6.44 PlyA - 186426 186421 6 1.05 6.43 Term - 187009 186761 249 1 0 108 52 32 0.708 -2.70 6.42 Intr - 187197 187107 91 2 1 87 69 139 0.993 11.90 6.41 Intr - 187405 187271 135 0 0 80 56 136 0.998 9.28 6.40 Intr - 187639 187538 102 0 0 113 19 118 0.985 6.69 6.39 Intr - 187829 187737 93 1 0 73 66 89 0.940 4.28 6.38 Intr - 188070 187973 98 0 2 86 66 166 0.998 13.01 6.37 Intr - 188295 188166 130 2 1 52 85 0 0.405 -3.20 6.36 Intr - 188542 188478 65 1 2 107 55 72 0.362 3.32 6.35 Intr - 188770 188680 91 0 1 112 100 44 0.957 8.00 6.34 Intr - 190055 189860 196 0 1 68 111 83 0.956 7.07 6.33 Intr - 190422 190279 144 1 0 92 55 49 0.695 2.25 6.32 Intr - 190986 190834 153 1 0 51 71 119 0.489 6.54 6.31 Intr - 191894 191796 99 0 0 81 72 92 0.917 6.98 6.30 Intr - 194111 193929 183 0 0 126 61 112 0.997 12.06 6.29 Intr - 194382 194187 196 0 1 33 94 185 0.551 12.59 6.28 Intr - 194728 194582 147 1 0 115 87 52 0.911 8.23 6.27 Intr - 195030 194880 151 2 1 91 38 155 0.696 10.96 6.26 Intr - 195643 195472 172 2 1 96 84 127 0.994 12.10 6.25 Intr - 195885 195742 144 1 0 63 88 72 0.938 4.95 6.24 Intr - 196525 196432 94 1 1 -25 114 111 0.178 1.94 6.23 Intr - 198398 198315 84 2 0 18 90 77 0.076 0.92 6.22 Intr - 200456 200322 135 2 0 118 21 89 0.173 5.96 6.21 Intr - 200622 200563 60 0 0 142 75 64 0.998 9.43 6.20 Intr - 200947 200771 177 1 0 90 89 177 0.992 18.12 6.19 Intr - 203912 203156 757 1 1 82 80 574 0.980 47.47 6.18 Intr - 204645 204503 143 0 2 105 105 156 0.968 18.15 6.17 Intr - 205015 204863 153 1 0 52 78 93 0.843 4.97 6.16 Intr - 206737 206536 202 0 1 47 89 19 0.497 -2.81 6.15 Intr - 207019 206861 159 0 0 89 68 117 0.817 8.90 6.14 Intr - 207913 207756 158 1 2 84 94 164 0.999 15.41 6.13 Intr - 209833 209663 171 1 0 139 93 -15 0.948 4.24 6.12 Intr - 210066 209932 135 0 0 63 76 221 0.998 19.16 6.11 Intr - 210370 210188 183 1 0 102 98 239 0.999 26.28 6.10 Intr - 210792 210625 168 0 0 77 66 156 0.974 12.44 6.09 Intr - 211101 211000 102 0 0 93 96 26 0.939 4.27 6.08 Intr - 212333 212235 99 2 0 87 78 85 0.994 7.71 6.07 Intr - 212744 212629 116 0 2 85 78 158 0.997 14.67 6.06 Intr - 213967 213788 180 2 0 114 41 77 0.700 5.44 6.05 Intr - 217700 217479 222 0 0 76 52 197 0.903 13.00 6.04 Intr - 218397 218277 121 0 1 117 84 119 0.999 14.47 6.03 Intr - 218983 218894 90 1 0 83 96 50 0.944 5.49 6.02 Intr - 219414 219266 149 1 2 103 30 -9 0.860 -5.25 6.01 Init - 219890 219710 181 2 1 76 79 170 0.871 14.25 6.00 Prom - 221442 221403 40 -3.76 7.00 Prom + 236249 236288 40 -6.46 7.01 Init + 242041 242146 106 0 1 83 91 281 0.827 26.28 7.02 Intr + 243892 244075 184 2 1 112 61 316 0.841 30.15 7.03 Intr + 244304 244404 101 2 2 66 100 83 0.587 7.05 7.04 Intr + 245313 245483 171 1 0 132 99 31 0.983 8.51 7.05 Intr + 247791 247906 116 1 2 72 96 126 0.894 11.97 7.06 Intr + 250125 250302 178 2 1 101 82 165 0.998 16.69 7.07 Intr + 250982 251127 146 0 2 69 110 121 0.984 12.40 7.08 Intr + 251643 251787 145 2 1 70 94 78 0.936 6.46 7.09 Intr + 252792 252921 130 1 1 113 82 177 0.998 19.35 7.10 Intr + 253005 253105 101 0 2 89 84 197 0.998 19.15 7.11 Intr + 253369 253464 96 2 0 76 95 170 0.998 16.48 7.12 Intr + 253803 253883 81 1 0 52 113 122 0.996 10.71 7.13 Intr + 254360 254452 93 0 0 87 85 99 0.999 9.44 7.14 Intr + 255208 255329 122 2 2 113 59 97 0.996 9.51 7.15 Intr + 255767 255900 134 1 2 92 39 217 0.888 16.64 7.16 Intr + 256073 256232 160 2 1 97 77 214 0.992 21.19 7.17 Intr + 256444 256580 137 0 2 65 67 226 0.970 17.57 7.18 Term + 260651 261041 391 2 1 133 41 161 0.880 10.36 7.19 PlyA + 262078 262083 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 200456 200307 150 2 0 118 45 105 0.821 7.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:41317236_41580684|GENSCAN_predicted_peptide_1|477_aa XWFSATRWRKREHPLGLASAFPERFTALTGEDHGPEGPRQVPGEVHRVGEHCVTLCALEH SAPLRLQPRRQLRAGGWLLSPQPCGAERRGPLHHLRIPLHGGRKRSLVNRPLSATTKVPP GRRCTTPQHFWLIPIKCILHIVQATKLLKALKGYIKHEARKGNENQDESQTSASSCDETE IQISNQEEAERQPLGHVTKTRRRCKTVRVDPDSQNHEKQESQDLRATAKVPSPPDEHQEA ENAVSSDFKKLHEAHFKEMESIDQYIERKKKHFEEHNSMNELKQQPINKGGVRTPVPPRG RLSVASTPISQRRSQGRSCGPASQSTLGLKGSLKRSAISAAKTGVRFSAATKDNEHKRSL TKTPARKSAHVTVSGGTPKGEAVLGTHKLKTITGNSAAVITPFKLTTEATQTPVSNKKPV FDLKASLSRPLNYEPHKGSEERLFPAAITSRSGERLCPAAHRLRCGERLCPAAPSGM >gi568815583f:41317236_41580684|GENSCAN_predicted_CDS_1|1434_bp nngtggttctccgccacccggtggagaaagcgggaacaccctctcgggctagcctctgcc tttcccgaacgcttcactgcactcactggagaagaccacggccccgagggaccgcgacag gtcccaggcgaggtgcaccgagtcggcgagcactgcgtgacactgtgcgcactggaacac agcgcacctctcaggctgcagccaagacggcagctgcgggccggcggctggctcctcagc ccccagccctgcggggccgagcggcgaggaccccttcaccacctgcgtatcccactccat ggaggtcgtaaaagaagcttggtcaatcgccctctcagtgccaccacaaaagtccccccg gggcggcgttgcacaacgccacagcatttctggctcatccccatcaaatgcatcttgcac attgtgcaggcaaccaagttgttaaaagccttgaaaggctacattaaacatgaggcaaga aaaggaaatgagaatcaggatgaaagtcaaacttctgcatcctcttgtgatgagactgag atacagatcagcaaccaggaagaagctgagagacagccacttggccatgtcaccaaaaca aggagaaggtgcaagactgtccgtgtggaccctgactcacagaatcatgaaaagcaggaa agccaggatctcagagctactgcaaaagttccttctccaccagacgagcaccaagaagct gagaatgctgtttcctcagactttaagaagcttcatgaagctcattttaaggaaatggag tccattgatcaatatattgagagaaaaaagaaacattttgaagaacacaattccatgaat gaactgaagcagcagcccatcaataagggaggggtcaggactccagtacctccaagagga agactctctgtggcttctactcccatcagccaacgacgctcgcaaggccggtcttgtggc cctgcaagtcagagtaccttgggtctgaaggggtcactcaagcgctctgctatctctgca gctaaaacgggtgtcaggttttcagctgctactaaagataatgagcataagcgttcactg accaagactccagccagaaagtctgcacatgtgaccgtgtctgggggcaccccaaaaggc gaggctgtgcttgggacacacaaattaaagaccatcacggggaattctgctgctgttatt accccattcaagttgacaactgaggcaacgcagactccagtctccaataagaaaccagtg tttgatcttaaagcaagtttgtctcgtcccctcaactatgaaccacacaaaggaagtgag gagcgcctcttcccagccgccatcacatctaggagtggggagcgtctctgcccggccgcc catcgtctgagatgtggggagcgcctctgccccgccgccccatctgggatgtga >gi568815583f:41317236_41580684|GENSCAN_predicted_peptide_2|337_aa MGHGETARIKGCGKRIEESVMGAERKFSKPTSALYPFLGIRFAEYSSSLQKPVASPGKAS SQRKTEGDLQGDHQKEVALDITSSEEKPDVSFDKAIRDEAIYHFRLLKDEIVDHWRGPEG HPLHEVLLEQAKVVWQFRGKEDLDKWTVTSDKTIGGRSEVFLKMGKNNQSALLYGTLSSE APQDGESTRSGYCAMISRIPRGAFERKMSYDWSQFNTLYLRVRGDGRPWMVNIKEDTDFF QRTNQMYSYFMFTRGGPYWQEVKIPFSKFFFSNRGRIRDVQHELPLDKISSIGFTLADKV DGPFFLEIDFIGVFTDPAHTEEFAYENSPELNPRLFK >gi568815583f:41317236_41580684|GENSCAN_predicted_CDS_2|1014_bp atgggccatggagagacagcccgcattaaaggttgtgggaagagaatagaagagtcagtg atgggggcagaaagaaaattctctaagccaacttctgccttgtatccatttttgggtatt cgctttgcagagtattccagtagtcttcagaaaccagtggcttctcctggcaaagcctcc tcacagaggaagactgaaggggatttgcaaggagatcaccagaaagaagttgctttggat ataacttcttctgaggagaagcctgatgttagtttcgataaagcaattagagatgaagca atataccattttaggcttttgaaggatgaaattgtggatcattggagaggaccggaaggc caccctctgcatgaggtcttgctggaacaagccaaggttgtctggcaattccgggggaaa gaagatttggataagtggacagtgacttctgataagacgattggaggcagaagtgaagtg tttttgaaaatgggcaagaataaccaaagtgcactgctatatggaactctgagctctgag gcgcctcaggacggggagtctacccgaagtgggtactgtgcaatgatatccaggattcca aggggtgcttttgagaggaagatgtcttacgattggtcccagttcaatactctgtatctc cgtgtacgtggggatggtcggccttggatggtgaatatcaaggaggacacagatttcttc cagaggacgaatcagatgtatagttacttcatgttcacccgcgggggaccctactggcag gaggtcaagattcctttttccaaatttttcttctctaatcgaggaagaatccgggatgtt cagcatgagcttccgcttgataagatctcttctataggattcaccttggctgataaagtg gatggtccattcttcctggagatagattttattggcgtgtttactgatccagctcataca gaagaatttgcctatgaaaattctccagagcttaacccaaggctttttaaataa >gi568815583f:41317236_41580684|GENSCAN_predicted_peptide_3|713_aa MRGRLCVGRAAAAAAAVAVPLAGGQEGSPGGGRRGSRGTTMVKKRKGRVVIDSDTEDSGS DENLDQELLSLAKRKRSDSEEKEPPVSQPAASSDSETSDSDDEWTFGSNKNKKKGKARKI EKKGTMKKQANKTASSGSSDKDSSAESSAPEEGEVSDSDSNSSSSSSDSDSSSEDEEFHD GYGEDLMGDEEDRARLEQMTEKEREQELFNRIEKREVLKRRFEIKKKLKTAKKKEKKEKK KKQEEEQEKKKLTQIQESQVTSHNKERRSKRDEKLDKKSQAMEELKAEREKRKNRTAELL AKKQPLKTSEVYSDDEEEEEDDKSSEKSDRSSRTSSSDEEEEKEEIPPKSQPVSLPEELN RVRLSRHKLERWCHMPFFAKTVTGCFVRIGIGNHNSKPVYRVAEITGVVETAKVYQLGGT RTNKGLQLRHGNDQRVFRLEFVSNQEFTESEFMKWKEAMFSAGMQLPTLDEINKKELSIK EALNYKFNDQDIEEIVKEKERFRKAPPNYAMKKTQLLKEKAMAEDLGDQDKAKQIQDQLN ELEERAEALDRQRTKNISAISYINQRNREWNIVESEKALVAESHNMKNQQMDPFTRRQCK PTIVSNSRDPAVQAAILAQLNAKYGSGVLPDAPKEMSKASVGQGKDKDLNSKSASDLSED LFKVHDFDVKIDLQVPSSESKALAITSKAPPAKDGAPRRSLNLEDYKKRRGLI >gi568815583f:41317236_41580684|GENSCAN_predicted_CDS_3|2142_bp atgcgcggtcgcctttgtgtgggtcgagcagcggcggcggcggcggcagtggcggtccca ctggcaggcgggcaagaggggagtccgggcggcggccggcgtgggagccgggggaccacc atggtaaagaagcggaaaggccgcgtcgtgatcgactcggacacagaggacagcggcagc gacgagaacctggatcaggagctcttgtccctggcaaagcgaaagcgcagtgactctgag gagaaggagccgcctgtgagtcagcctgcagcctcgtcagactcggagacgtctgacagt gacgatgagtggacatttgggagcaataaaaataagaagaaaggaaaagccagaaaaata gagaagaaaggaaccatgaagaaacaggccaacaaaactgcctcctcaggcagttcagac aaagacagttcagctgagagctcagcccctgaggaaggtgaagtgtcagactctgacagc aacagctcctcttccagttcagattcagactcttcctcagaagatgaagagttccatgat ggctatggagaagacctcatgggagatgaggaagacagggcccgtctggaacagatgaca gagaaagagagagagcaagaactgttcaatcgcatagagaagagggaggtgttgaaaaga agatttgaaatcaagaaaaaactaaaaacagccaaaaagaaagaaaagaaagaaaagaag aaaaagcaagaagaggagcaagaaaagaaaaaactgacacagattcaagaatctcaggta acatcccacaacaaggaacggcgttccaagcgggatgagaaactagacaagaaatctcaa gccatggaggagctaaaagcagagcgagaaaaacgaaagaacagaacagctgagctcctt gccaaaaaacagccattaaaaaccagtgaggtctactctgatgatgaagaggaggaagag gatgacaaatccagtgaaaagtcagaccgctcatcacgaacatcatcgtctgatgaagaa gaggagaaagaagagatccctcccaaatcccaaccagtttccttacctgaagaattgaat cgggttcgattatcacggcataagctagaacgctggtgtcacatgcccttctttgctaaa actgtcacaggatgttttgtgcggattggcatcggaaaccacaacagcaaaccagtttac cgggtcgctgagattacgggtgttgtggaaactgccaaagtttaccaactaggtggcacc agaacaaacaaagggctgcaactacggcatggcaatgaccaacgcgtgttccgtttagag tttgtctcaaaccaagaattcaccgaaagtgagtttatgaagtggaaagaagcgatgttc tctgctggcatgcagttgcccactctagatgaaatcaataaaaaggaattatctattaaa gaagctcttaattataaattcaatgatcaggacattgaagagattgtaaaagagaaagaa aggttcagaaaagctccacccaactacgctatgaagaagactcagctactgaaggaaaag gccatggctgaggacctgggggatcaggacaaggccaaacaaatccaagatcaactgaat gagctggaggaacgggcagaggccctggaccgccagcggaccaagaacatatccgctatc agttacatcaaccagcggaaccgggagtggaacattgtagagtctgagaaggcccttgtg gctgaaagtcacaacatgaaaaaccaacagatggatccctttactcggcggcagtgcaag cctaccatcgtttctaattccagagacccagctgttcaagctgccatcttggcccagctg aatgcaaaatacggttctggagtgttaccagatgctccaaaggaaatgagcaaggcaagt gtgggtcaaggcaaagataaagatttgaattctaagtcagccagtgacctctcagaagat ctgttcaaagtacacgattttgatgtgaagattgacttacaagttcccagctcagagtca aaggctttagccatcacctccaaggctccgccagccaaggatggggctccaaggagatct ctgaacttggaagactacaaaaaacgacgagggcttatttga >gi568815583f:41317236_41580684|GENSCAN_predicted_peptide_4|98_aa MDFPVPTKVKVAIVNSLAAWFPGSLSAMDSPASLSACDAAQPFTWQARKPQVDSISFAGR ALRRSPLGVSTTPRTGLGATLVRANGPRIPGPVRLLRR >gi568815583f:41317236_41580684|GENSCAN_predicted_CDS_4|297_bp atggacttccctgtgcccacaaaagtgaaagtggccatcgtcaactctttagcagcctgg ttccccggaagcctctctgccatggatagccctgcttcgctaagcgcgtgcgatgcagca cagcccttcacctggcaagcccggaagcctcaggttgactccatcagttttgccgggaga gcccttcggcgctccccgcttggtgtctccaccaccccccgcaccggcctgggcgccacc cttgtccgcgccaacggtccccgcatccctggccccgtgcgcctcctgcgccgttag >gi568815583f:41317236_41580684|GENSCAN_predicted_peptide_5|487_aa MARPGGARPCSPGLERAPRRSVGELRLLFEARCAAVAAAAAAGEPRARGAKRRGGQVPNG LPRAPPAPVIPQLTVTAEEPDVPPTSPGPPERERDCLPAAGSSHLQQPRRLSTSSVSSTG SSSLLEDSEDDLLSDSESRSRGNVQLEAGEDVGQKNHWQKIRTMVNLPVISPFKKRYAWV QLAGHTGEQWGGWAGARDGKGLGGADGCRSSGSFKAAGTSGLILKRCSEPERYCLARLMA DALRGCVPAFHGVVERDGESYLQLQDLLDGFDGPCVLDCKMGVRTYLEEELTKARERPKL RKDMYKKMLAVDPEAPTEEEHAQRAVTKPRYMQWREGISSSTTLGFRIEGIKKADGSCST DFKTTRSREQVLRVFEEFVQGDEEVLRRYLNRLQQIRDTLEVSEFFRRHEGRGLTVRGSQ VIGSSLLFVHDHCHRAGVWLIDFGKTTPLPDGQILDHRRPWEEGNREDGYLLGLDNLIGI LASLAER >gi568815583f:41317236_41580684|GENSCAN_predicted_CDS_5|1464_bp atggcgcggccggggggcgcgaggccctgcagcccggggctggagcgggccccgcgccgg agtgtcggggagctgcgcctgctcttcgaggcgcgctgtgcggcggtcgctgcggccgcc gccgcgggggagccccgggcccgcggggccaagcggcgtgggggacaggtccccaacggg cttccgcgggctcccccggccccggtgatccctcagctgaccgtgacagccgaggagccc gacgtgcccccgaccagccctgggccgccggagcgggagagggactgcctcccggcagcg ggctcttcgcacctgcagcagccgcgccgcctttccacctcgtcggtctcctccactggc tcctcgtcgctgctcgaggactcggaggacgacctgctgagcgacagtgagagccggagc cgcggcaacgtgcagctggaagcgggcgaggacgtgggtcagaaaaaccactggcagaag atccggaccatggtcaatctgccggtcataagccctttcaagaagcgctacgcctgggtg cagctggcagggcacactggtgagcagtggggcgggtgggcgggtgcccgcgacgggaag gggctgggcggcgctgacggatgccggtcctcagggagttttaaggcggcgggcaccagc gggctgatcctgaagcgctgctcggagccggagcgctactgcctggcgcggctgatggct gacgcgctgcgcggctgcgtgcctgccttccacggcgtggtggagcgcgacggcgaaagc tacctgcagctgcaggacctgctcgatggcttcgacggaccttgtgtgctcgactgcaaa atgggcgtcaggacttacctagaggaggagctgaccaaggcccgtgagcggcccaagctg cggaaggacatgtacaagaaaatgctggcggtggatcctgaagctcccacggaggaggag cacgcgcagcgcgccgtcaccaagccgcgctacatgcagtggcgggaaggcatcagctcc agcaccaccctcggcttccgcatcgagggcatcaagaaagcggacggctcctgcagcacc gacttcaagactacgcgaagccgagagcaggtgcttcgcgtctttgaagagtttgtgcaa ggagatgaggaagtgctgaggcggtatctgaaccgcctgcagcagatccgggacaccctg gaggtatccgagttcttcaggaggcacgagggccgcggcctgacggtgcggggctcgcag gtgatcggcagctcgctcctctttgtgcacgatcactgccatcgcgccggcgtgtggctc atcgacttcggcaagaccacgcccctccccgatggccagatcctggaccaccggcggccc tgggaggagggcaaccgcgaggacggctatttgctggggctggacaatctcattggcatc ctggccagcctggctgagagatga >gi568815583f:41317236_41580684|GENSCAN_predicted_peptide_6|2225_aa MLSRPKPGESEVDLLHFQSQFLAAGAAPAVQLVKKGNRGGGDANSDRPPLQDHRDVVMLD NLPDLPPALVPSPPKRARPSPGHCLPEDEDPEERLRRHDQHITAVLTKIIERDTSSVAVN LPVPSGVAFPAVFLRSRDTQGKSATSGKRSIFAQEIAARRIAEAKGPSVGEVVPNVGPPE GAVTCETPTPRNQGCQLPGSSHSFQGPNLVTGKGLRDQEAEQEAQTIHEENIARLQAMAP EEILQEQQRLLAQLDPSLVAFLRSHSHTQEQTGETASEEQRPGGPSANVTKEEPLMSAFA SEPRKRDKLEPEAPALALPVTPQKEWLHMDTVELEKLHWTQDLPPVRRQQTQERMQARFS LQGELLAPDVDLPTHLGLHHHGEEAERAGYSLQELFHLTRSQVSQQRALALHVLAQVISR AQAGEFGDRLAGSVLSLLLDAGFLFLLRFSLDDRVDGVIATAIRALRALLVAPGDEELLD STFSWYHGALTFPLMPSQEDKEDEDEDEECPAGKAKRKSPEEESRPPPDLARHDVIKGLL ATSLLPRLRYVLEVTYPGPAVVLDILAVLIRLARHSLESATRVLECPRLIETIVREFLPT SWSPVGAGPTPSLYKVPCATAMKLLRVLASAGRNIAARLLSSFDLRSRLCRIIAEAPQEL ALPPEEAEMLSTEALRLWAVAASYGQGGYLYRELYPVLMRALQVVPRELSTHPPQPLSMQ RIASLLTLLTQLTLAAGSTPAETISDSAEASLSATPSLVTWTQVSGLQPLVEPCLRQTLK LLSRPEMWRAVGPVPVACLLFLGAYYQAWSQQLAAILAAPGLQNYFLQCVAPGAAPHLTP FSAWALRHEYHLQYLALALAQKAAALQPLPATHAALYHGMALALLSRLLPGSEYLTHELL LSCVFRLEFLPERTSGGPEAADFSDQLSLGSSRVPRCGQGTLLAQACQDLPSIRNCYLTH CSPARASLLASQALHRGELQRVPTLLLPMPTEPLLPTDWPFLPLIRLYHRASDTPSGLSP TDTMGTAMRVLQWVLVLESWRPQALWAVPPAARLARLMCVFLVDSELFRESPVQHLVAAL LAQLCQPQVLPNLNLDCRLPGLTSFPDLYANFLDHFEAVSFGDHLFGALVLLPLQRRFSV TLRLALFGEHVGALRALSLPLTQLPVSLECYTVPPEDNLALLQLYFRTLVTGALRPRWCP VLYAVAVAHVNSFIFSQDPQSSDEVKAARRSMLQKTWLLADEGLRQHLLHYKLPNSTLPE GFELYSQLPPLRQHYLQRLTSTVLQNGITGSYHQKLLIQYIWGETQESAFLTSSQGCGRD RKGFCCRVDPTGMGCWGQLLVWFGAAGAILCSSPGSQETFLRSSPLPLASPSPRDPKVSA PPSILEPASPLNSPGTEGSWLFSTCGASGRHGPTQTQCDGAYAGTSVVVTVGAAGQLRGV QLWRVPGPGQYLISAYGAAGGKGAKNHLSRAHGVFVSAIFSLGLGESLYILVGQQGEDAC PGGSPESQLVCLGESRAVEEHAAMDGSEGVPGSRRWAGGGGGGGGATYVFRLEGASWNTP LAPQVRAGELEPLLVAAGGGGRAYLRPRDRGRTQASPEKLENRSEAPGSGGRGGAAGGGG GWTSRAPSPQAGRSLQEGAEGGQGCSEAWATLGWAAAGGFGGGGGACTAGGGGGGYRGGD ASETDNLWADGEDGVSFIHPSSELFLQPLAVTENHGEVEIRRHLNCSHCPLRDCQWQAEL QLAECLCPEGMELAVDNVTCMDLHKPPGPLVLMVAVVATSTLSLLMVCGVLILGTKRLAG TVDSRLLLSMKQKKWQGLQEMRLPSPELELSKLRTSAIRTAPNPYYCQVGLGPAQSWPLP PGVTEVSPANVTLLRALGHGAFGEVYEGLVIGLPGDSSPLQVAIKTLPELCSPQDELDFL MEALIISKFRHQNIVRCVGLSLRATPRLILLELMSGGDMKSFLRHSRPHLGQPSPLVMRD LLQLAQDIAQGCHYLEENHFIHRDIAARNCLLSCAGPSRVAKIGDFGMARDIYRASYYRR GDRALLPVKWMPPEAFLEGIFTSKTDSWSFGVLLWEIFSLGYMPYPGRTNQEVLDFVVGG GRMDPPRGCPGPVYRIMTQCWQHEPELRPSFASILERLQYCTQDPDVLNSLLPMELGPTP EEEGTSGLGNRSLECLRPPQPQELSPEKLKSWGGSPLGPWLSSGLKPLKSRGLQPQNLWN PTYRS >gi568815583f:41317236_41580684|GENSCAN_predicted_CDS_6|6678_bp atgctgtcgagaccgaagccaggggagtccgaggtggacctgctgcacttccagagtcag tttctcgcagctggtgcagccccagcagtgcagttggtgaagaaaggaaataggggcggt ggtgatgccaactcagaccggcctccgctccaggaccatcgggatgtggtgatgttggac aatctcccagatttgcccccagctttggtcccttctcctccaaagagagccaggcccagc cctggccactgcctgcctgaggatgaggacccagaagagaggctgaggaggcatgatcag cacatcactgctgtcttgactaagattattgaacgagatacaagttcagtggccgtgaat ctgcctgtgcccagtggtgttgctttccctgctgtgttccttcgctcgcgggacacacag gggaaatcagcaacatctggtaagagaagcatctttgcccaggaaattgcggcaaggagg atagctgaagccaagggcccatcagttggggaagttgtgcccaacgtgggcccaccagag ggtgccgtgacctgtgagacacccactcctaggaaccagggctgccagcttcctgggagc agccacagctttcagggacccaatctggtcacagggaaggggctcagggatcaagaagct gagcaggaagcccagactatccatgaagagaacatagcaagactgcaggccatggctcct gaggagatcctgcaggaacagcagcggttgctggcccagcttgaccccagcttggttgct ttcttgagatctcacagccacacgcaagagcaaacaggagagacagcctctgaggagcag aggccaggaggaccctctgctaatgtcaccaaggaggaacccctcatgtcagcttttgcc agtgagcccaggaagagagacaagctggagccagaagccccagctctggcattgcccgtg acccctcagaaagaatggctgcacatggacactgtcgagctggagaagctccactggacc caggacttgccccctgtccggcggcagcagacacaggagaggatgcaggctcgattcagt cttcagggagaactactggcccctgacgtggacctgcccacccacctgggtctgcaccac catggagaggaggcagagagagcggggtattccctacaggagctgttccacctgacccgc agccaggtttcccagcagagagcactggcactgcatgtgttagcccaggtcatcagcagg gcccaggctggtgagtttggggaccggctagcaggcagtgtcttaagcctccttttggat gctggtttcctcttcctactgcgcttctccttggatgacagagtggatggggtcattgca accgccatccgtgctcttcgggctctgctggtggctcctggagatgaggagctcctcgac agcaccttctcttggtaccatggagctttgacgttccctctgatgcccagccaggaggac aaggaggatgaggacgaggatgaagaatgcccagcaggaaaagcaaaaaggaaaagccct gaagaagaaagccggcctccacctgacctggcccgacatgatgtcatcaaggggctcctg gctaccagcctgctgcctcggctgcgctacgtgctggaggtgacatacccaggacctgcg gtggtccttgacatcctggctgtgctcatccgcctggcccggcattccctggaatcagcc acaagggtcctggagtgccctcggctgatagagactatagttcgagagttcttgcccacc agttggtctcctgtgggggcagggcctacccctagtctatacaaagtaccctgtgctact gccatgaaactacttcgtgtcctggcctcagctgggaggaatattgctgcccggctgttg agcagctttgatctccggagccgcctgtgccgcatcatagctgaggctccccaagaactg gccttgcccccagaggaagctgagatgctgagcaccgaggccctccgtctgtgggctgtg gctgcctcctatggccagggcggttacctttacagggagctctacccagtgctgatgcgg gccttgcaggtggtgccgcgggagctcagcacccacccacctcaacccctgtccatgcag cggatagcctcactgctcactctcctcacccagctaaccctggcagccggcagtacccct gctgaaaccatcagtgattctgctgaggccagcctctcggccaccccttccttagtcact tggacacaggtgtctgggctccagcctcttgttgagccgtgtctaaggcagaccttgaag ttgctgtccagacctgagatgtggagagccgtgggcccagtgcccgttgcctgcctgttg ttcctgggagcctactaccaggcctggagccagcaactggctgccatattggctgccccg ggactccagaattacttcctccagtgtgtggctcctggggctgccccacacctcacacct ttctctgcatgggccctgcgccatgagtaccacctgcagtacctggcactcgctctggcc cagaaagcggcagcgctgcagccactgccagccacccatgctgccctctatcatggtatg gccttggccctgctgagccggctgctgcccggaagtgagtacctcacccatgagctgctg ctgagctgtgtattccggctggagttcctcccggaaagaacatcagggggtccagaggca gccgacttctctgaccagctgtcgttaggaagcagcagggtccctcggtgtgggcaaggg actctgctggctcaggcctgccaggacctccccagcatccgcaactgctacctgactcat tgctcgccagcccgagccagtctgctggcctcccaggctctgcaccgaggggagctacag cgagtcccaaccctgctactgcccatgcctacggagccgctgctgcccaccgactggccc ttcctgccactgattcgcctctaccaccgggcttcagacaccccctcgggactctctccc acagacaccatgggcacagccatgcgggtcctgcagtgggtgctagttttggagagctgg cgcccccaggctctctgggctgtgccccctgctgcccgcctggcacggctcatgtgtgtg ttcctggtggacagtgagctgttccgggagtccccagtacagcatctggtggcagccctc ctcgcccagctctgtcagcctcaagtcttgccaaacctcaacctggactgccgactccct ggcctgacgtctttccctgacctctatgccaacttcctggatcattttgaggctgtctct tttggggaccacctctttggggccctggtcctcctgcccctgcagcgtcggttcagtgtc accttgcgccttgccctctttggggaacacgtgggagccttgcgagctctgagcctgcct ctgacccagttgcctgtgtccctggagtgttacacagtgcctcctgaagacaacctggcc ctccttcagctctacttccggaccctggttactggtgcgctccgcccacgttggtgcccc gtgctctatgctgtggctgtggctcatgtcaatagcttcatcttctctcaggacccacag agctcagatgaggtcaaagctgcccgcaggagtatgctgcagaaaacatggctgctggca gatgagggtctccggcagcacctcctgcactataagcttcccaattccacgctcccagag ggctttgagctctattctcagttgccccctctgcgtcagcactacctccagagactgact tcaacagtgctccaaaatgggatcactgggtcctaccaccagaagctgctgattcagtac atctggggtgaaacccaagaatctgcatttctaacaagttcccaggggtgtggccgcgac cgcaagggcttttgttgccgggtggacccaacagggatgggctgctggggacagctgctg gtgtggttcggagccgcgggcgccattctctgctctagcccggggtcccaggagactttt ctgcggtcctcgcccctgccgctggcaagtcccagcccccgggacccgaaagtcagcgcc ccgcctagtatcttggagccagcctccccgctgaattctccgggcaccgaggggtcttgg ctgttttctacctgcggggccagcggccggcatgggcccacacagacacaatgtgacggg gcgtacgcggggaccagcgtggtggtgaccgtgggggccgccgggcagctgagaggcgtg cagctgtggcgcgtgccgggccctggccagtatctgatctcagcctacggagccgcgggc ggcaaaggcgccaagaaccacctgtcgcgggcgcatggcgtcttcgtctcagcaatcttc tccctcggtctcggggagtcgctgtacatcctggtggggcagcagggagaggacgcctgt cccggaggtagcccggagagccagctcgtctgcctcggggagtctcgagccgttgaagag cacgcggcgatggatgggagcgaaggggtcccggggtcgcggcgctgggcgggaggtggc gggggtggcgggggcgccacctacgttttccggctggagggcgcttcctggaacacgccg ctggccccacaggtgcgcgctggcgagctggaaccgttgctggtggcggccggaggcggc ggtcgggcctacctgaggccgcgggaccgaggccggactcaggcctcccccgagaaactg gagaaccgctcggaggcgcccgggagcggcgggagaggcggggcggcaggtggtgggggc ggctggacgtcgcgggctccctctccgcaggccggccgctcactgcaggagggggcggag ggcggccagggctgctccgaggcttgggcgacccttggctgggccgcggccggcggcttc gggggcggcggcggggcctgcactgcgggcggaggcggcggcggctacagggggggcgac gcttcagagactgacaacctctgggctgatggggaagatggagtatccttcatacacccc agcagcgagctcttcctgcagcctctggcagtcaccgagaaccacggagaggtagagatc cgaaggcacctcaactgcagtcactgccctttgagagactgccaatggcaggcagagctc cagctggctgaatgcctgtgcccagaaggcatggagctagctgtggataacgtcacctgc atggacctgcacaagcccccaggccctctggttctgatggtggctgtggtggcaacctca acactgagcctccttatggtgtgtggggtcctgattctgggtacgaagcgtctagcaggc acagttgattcaaggctgctcctctccatgaagcagaagaagtggcagggcctgcaggag atgaggctgccgagccctgagcttgagctgagcaagcttcgaacctctgccatcaggaca gcccccaatccctattattgccaggtggggcttggcccggcccagtcctggcctctgcca ccaggtgtcaccgaggtttccccagccaatgttactctgctcagagccctgggccatggt gcctttggggaggtgtatgagggactggtaattggccttcctggggactccagtcccctg caggtagctatcaagaccctgccagaactctgctcgcctcaggatgagctggatttcctc atggaggccctcatcatcagcaagtttcgccatcagaacattgtgcggtgtgtggggctc agcctcagggccacccctcgcctcattctgctggaactgatgtctggaggggacatgaag agtttcctgaggcacagtcggccacacctgggccagccatcacctctggtcatgcgggac ctgctgcaactggcccaggacatagcccagggctgccactacctggaggaaaatcacttc atccacagggatattgccgcccggaactgcctgctgagctgcgctggacccagccgagtg gccaagattggggactttgggatggcacgagatatctaccgggccagttattaccgcagg ggggaccgggccttgctcccagtcaagtggatgcccccagaggccttcctggagggcatc ttcacatccaagacagattcctggtcttttggggtgctgctctgggagatcttctcactg ggctacatgccctatcctgggcgcaccaaccaggaggtgctggacttcgtcgttggagga ggccggatggaccctcctaggggctgcccagggcctgtgtaccgcatcatgacccagtgt tggcagcacgagcctgagctccgccctagctttgccagcatcttggagcgtctgcagtac tgcactcaggacccggatgtgctgaattcactcctgccaatggagctggggcccacccca gaggaggaagggacttctgggctggggaacagatctttggagtgcctaagacccccacag ccccaggaactgagtccagagaagttgaaaagctggggaggtagccctcttggcccctgg ctgtcctctggcctcaagcccctcaaatccaggggcctccaacctcagaacctttggaat cccacttatcgctcctga >gi568815583f:41317236_41580684|GENSCAN_predicted_peptide_7|863_aa MGRPGLPPLPLPPPPRLGLLLAALASLLLPESAAAGLKLMGAPVKLTVSQGQPVKLNCSV EGMEEPDIQWVKDGAVVQNLDQLYIPVSEQHWIGFLSLKSVERSDAGRYWCQVEDGGETE ISQPVWLTVEGVPFFTVEPKDLAVPPNAPFQLSCEAVGPPEPVTIVWWRGTTKIGGPAPS PSVLNVTALPAAPFNITVTKLSSSNASVAWMPGADGRALLQSCTVQVTQAPGGWEVLAVV VPVPPFTCLLRDLVPATNYSLRVRCANALGPSPYADWVPFQTKGLAPASAPQNLHAIRTD SGLILEWEEVIPEAPLEGPLGPYKLSWVQDNGTQDELTVEGTRANLTGWDPQKDLIVRVC VSNAVGCGPWSQPLVVSSHDRAGQQGPPHSRTSWVPVVLGVLTALVTAAALALILLRKRR KETRFGQAFDSVMARGEPAVHFRAARSFNRERPERIEATLDSLGISDELKEKLEDVLIPE QQFTLGRMLGKGEFGSVREAQLKQEDGSFVKVAVKMLKADIIASSDIEEFLREAACMKEF DHPHVAKLVGVSLRSRAKGRLPIPMVILPFMKHGDLHAFLLASRIGENPFNLPLQTLIRF MVDIACGMEYLSSRNFIHRDLAARNCMYEFWRTRGLAEDMTVCVADFGLSRKIYSGDYYR QGCASKLPVKWLALESLADNLYTVQSDVWAFGVTMWEIMTRGQTPYAGIENAEIYNYLIG GNRLKQPPECMEDVYDLMYQCWSADPKQRPSFTCLRMELENILGQLSVLSASQDPLYINI ERAEEPTAGGSLELPGRDQPYSGAGDGSGMGAVGGTPSDCRYILTPGGLAEQPGQAEHQP ESPLNETQRLLLLQQGLLPHSSC >gi568815583f:41317236_41580684|GENSCAN_predicted_CDS_7|2592_bp atggggcggccggggctcccgccgctgccgctgccgccgccaccgcggctcgggctgctg ctggcggctctggcttctctgctgctcccggagtccgccgccgcaggtctgaagctcatg ggagccccggtgaagctgacagtgtctcaggggcagccggtgaagctcaactgcagtgtg gaggggatggaggagcctgacatccagtgggtgaaggatggggctgtggtccagaacttg gaccagttgtacatcccagtcagcgagcagcactggatcggcttcctcagcctgaagtca gtggagcgctctgacgccggccggtactggtgccaggtggaggatgggggtgaaaccgag atctcccagccagtgtggctcacggtagaaggtgtgccatttttcacagtggagccaaaa gatctggcagtgccacccaatgcccctttccaactgtcttgtgaggctgtgggtccccct gaacctgttaccattgtctggtggagaggaactacgaagatcgggggacccgctccctct ccatctgttttaaatgtaacagcactgcctgcagcccccttcaacatcaccgtgacaaag ctttccagcagcaacgctagtgtggcctggatgccaggtgctgatggccgagctctgcta cagtcctgtacagttcaggtgacacaggccccaggaggctgggaagtcctggctgttgtg gtccctgtgcccccctttacctgcctgctccgggacctggtgcctgccaccaactacagc ctcagggtgcgctgtgccaatgccttggggccctctccctatgctgactgggtgcccttt cagaccaagggtctagccccagccagcgctccccaaaacctccatgccatccgcacagat tcaggcctcatcttggagtgggaagaagtgatccccgaggcccctttggaaggccccctg ggaccctacaaactgtcctgggttcaagacaatggaacccaggatgagctgacagtggag gggaccagggccaatttgacaggctgggatccccaaaaggacctgatcgtacgtgtgtgc gtctccaatgcagttggctgtggaccctggagtcagccactggtggtctcttctcatgac cgtgcaggccagcagggccctcctcacagccgcacatcctgggtacctgtggtccttggt gtgctaacggccctggtgacggctgctgccctggccctcatcctgcttcgaaagagacgg aaagagacgcggtttgggcaagcctttgacagtgtcatggcccggggagagccagccgtt cacttccgggcagcccggtccttcaatcgagaaaggcccgagcgcatcgaggccacattg gacagcttgggcatcagcgatgaactaaaggaaaaactggaggatgtgctcatcccagag cagcagttcaccctgggccggatgttgggcaaaggagagtttggttcagtgcgggaggcc cagctgaagcaagaggatggctcctttgtgaaagtggctgtgaagatgctgaaagctgac atcattgcctcaagcgacattgaagagttcctcagggaagcagcttgcatgaaggagttt gaccatccacacgtggccaaacttgttggggtaagcctccggagcagggctaaaggccgt ctccccatccccatggtcatcttgcccttcatgaagcatggggacctgcatgccttcctg ctcgcctcccggattggggagaacccctttaacctacccctccagaccctgatccggttc atggtggacattgcctgcggcatggagtacctgagctctcggaacttcatccaccgagac ctggctgctcggaattgcatgtacgaattctggaggactcgagggctggcagaggacatg acagtgtgtgtggctgacttcggactctcccggaagatctacagtggggactactatcgt caaggctgtgcctccaaactgcctgtcaagtggctggccctggagagcctggccgacaac ctgtatactgtgcagagtgacgtgtgggcgttcggggtgaccatgtgggagatcatgaca cgtgggcagacgccatatgctggcatcgaaaacgctgagatttacaactacctcattggc gggaaccgcctgaaacagcctccggagtgtatggaggacgtgtatgatctcatgtaccag tgctggagtgctgaccccaagcagcgcccgagctttacttgtctgcgaatggaactggag aacatcttgggccagctgtctgtgctatctgccagccaggaccccttatacatcaacatc gagagagctgaggagcccactgcgggaggcagcctggagctacctggcagggatcagccc tacagtggggctggggatggcagtggcatgggggcagtgggtggcactcccagtgactgt cggtacatactcacccccggagggctggctgagcagccagggcaggcagagcaccagcca gagagtcccctcaatgagacacagaggcttttgctgctgcagcaagggctactgccacac agtagctgttag