GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:51:07 Sequence gi568815597r:202048609_202260807 : 212199 bp : 51.57% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12606 12688 83 2 2 -2 95 134 0.291 4.03 1.02 Term + 18998 19250 253 2 1 73 49 149 0.226 4.94 1.03 PlyA + 19849 19854 6 1.05 2.00 Prom + 27750 27789 40 -4.21 2.01 Init + 28273 28459 187 0 1 71 113 101 0.630 10.10 2.02 Term + 32411 32487 77 0 2 103 44 16 0.344 -3.00 2.03 PlyA + 33452 33457 6 1.05 3.05 PlyA - 34288 34283 6 1.05 3.04 Term - 37763 37658 106 1 1 114 42 69 0.650 3.08 3.03 Intr - 39823 39741 83 1 2 86 21 69 0.103 -1.07 3.02 Intr - 48224 48082 143 0 2 69 92 46 0.079 3.68 3.01 Init - 50409 50400 10 0 1 94 51 15 0.060 -1.56 3.00 Prom - 53840 53801 40 -0.51 4.00 Prom + 58039 58078 40 -3.51 4.01 Init + 74356 74985 630 0 0 89 105 743 0.988 69.53 4.02 Term + 79133 79948 816 1 0 103 54 1125 0.998 104.20 4.03 PlyA + 80880 80885 6 1.05 5.10 PlyA - 82833 82828 6 1.05 5.09 Term - 83964 83758 207 0 0 105 48 114 0.834 6.76 5.08 Intr - 85432 85326 107 2 2 28 81 22 0.132 -4.07 5.07 Intr - 86612 86542 71 1 2 86 70 88 0.578 6.12 5.06 Intr - 86918 86851 68 2 2 136 82 85 0.996 10.90 5.05 Intr - 87192 87099 94 2 1 113 65 148 0.978 15.47 5.04 Intr - 89430 89357 74 0 2 79 86 101 0.976 7.70 5.03 Intr - 89840 89760 81 2 0 47 100 144 0.890 11.83 5.02 Intr - 96357 95842 516 0 0 19 68 481 0.926 32.95 5.01 Init - 98884 98873 12 1 0 67 116 18 0.543 1.79 5.00 Prom - 99342 99303 40 -8.19 6.10 PlyA - 99383 99378 6 -4.04 6.09 Term - 100091 99998 94 1 1 120 46 105 0.991 7.10 6.08 Intr - 101816 101703 114 1 0 95 94 172 0.998 18.57 6.07 Intr - 104160 103934 227 0 2 98 96 185 0.993 17.61 6.06 Intr - 105227 105117 111 2 0 119 81 190 0.999 22.48 6.05 Intr - 105715 105578 138 1 0 102 75 196 0.998 20.87 6.04 Intr - 107001 106838 164 1 2 120 51 124 0.514 12.11 6.03 Intr - 109215 109131 85 0 1 77 71 135 0.422 10.59 6.02 Intr - 109693 109510 184 0 1 35 91 179 0.979 13.21 6.01 Init - 110731 110673 59 1 2 105 78 52 0.554 6.63 6.00 Prom - 116418 116379 40 -7.59 7.00 Prom + 118047 118086 40 -6.60 7.01 Init + 119443 119499 57 0 0 67 94 104 0.705 8.19 7.02 Intr + 120014 120062 49 1 1 131 96 -34 0.798 0.44 7.03 Intr + 120237 120508 272 1 2 77 3 198 0.159 8.00 7.04 Intr + 121358 121436 79 1 1 98 76 93 0.326 8.72 7.05 Intr + 122577 122831 255 1 0 110 113 132 0.990 15.85 7.06 Intr + 122933 123178 246 0 0 99 87 236 0.995 22.56 7.07 Intr + 123336 123611 276 1 0 114 70 117 0.457 10.53 7.08 Intr + 128441 128725 285 0 0 122 83 54 0.767 6.16 7.09 Intr + 128912 129175 264 0 0 102 64 50 0.480 2.02 7.10 Intr + 129503 129784 282 0 0 69 61 186 0.532 11.93 7.11 Intr + 130565 130762 198 0 0 102 -13 188 0.612 9.94 7.12 Intr + 133050 133176 127 1 1 126 105 69 0.908 12.64 7.13 Intr + 133281 133328 48 0 0 129 101 36 0.942 7.28 7.14 Intr + 134306 134405 100 2 1 81 11 35 0.591 -4.29 7.15 Intr + 134776 134866 91 0 1 93 76 109 0.992 10.27 7.16 Intr + 134975 135051 77 0 2 97 72 52 0.569 4.23 7.17 Intr + 136913 137058 146 1 2 91 27 158 0.998 9.59 7.18 Intr + 137948 138083 136 2 1 61 81 245 0.880 22.18 7.19 Intr + 138464 138531 68 1 2 83 80 103 0.757 7.20 7.20 Intr + 138661 138748 88 1 1 75 71 114 0.720 8.87 7.21 Intr + 138855 138933 79 2 1 84 76 109 0.987 8.92 7.22 Intr + 139291 139373 83 2 2 80 100 63 0.982 6.55 7.23 Intr + 140238 140384 147 2 0 73 85 215 0.941 20.64 7.24 Intr + 140623 140710 88 0 1 78 68 87 0.978 5.74 7.25 Intr + 144317 144335 19 0 1 134 76 4 0.019 -0.05 7.26 Intr + 145262 145333 72 2 0 60 40 112 0.020 2.62 7.27 Intr + 145357 145593 237 1 0 55 76 208 0.025 13.56 7.28 Intr + 146815 146860 46 1 1 106 110 -3 0.376 2.60 7.29 Term + 152137 152196 60 0 0 103 53 53 0.218 1.40 7.30 PlyA + 162640 162645 6 1.05 8.13 PlyA - 163852 163847 6 1.05 8.12 Term - 168169 167970 200 2 2 59 42 130 0.364 3.38 8.11 Intr - 174614 174563 52 2 1 125 42 18 0.135 -0.13 8.10 Intr - 176884 176750 135 1 0 36 50 113 0.330 3.57 8.09 Intr - 180366 180309 58 2 1 68 74 81 0.120 4.08 8.08 Intr - 184870 184713 158 1 2 43 89 21 0.016 -2.98 8.07 Intr - 186511 186294 218 2 2 24 79 124 0.073 3.85 8.06 Intr - 190611 190481 131 2 2 63 53 111 0.094 5.94 8.05 Intr - 191375 191317 59 2 2 88 48 23 0.447 -3.53 8.04 Intr - 195227 195101 127 1 1 109 68 42 0.372 5.49 8.03 Intr - 196005 195862 144 2 0 22 59 144 0.911 4.81 8.02 Intr - 196164 196065 100 1 1 90 64 32 0.525 0.67 8.01 Init - 204397 204334 64 1 1 80 69 61 0.601 4.66 8.00 Prom - 210449 210410 40 -0.61 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 145382 145593 212 1 2 97 76 289 0.955 24.84 S.002 Init + 206747 206807 61 1 1 99 81 39 0.929 5.66 S.003 Term + 206882 206940 59 0 2 124 36 38 0.899 0.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:202048609_202260807|GENSCAN_predicted_peptide_1|111_aa PKQSEQQNQVVLDDNAEYDTDIHKSTLIEPIQEWPKKAGNKMGFGSWMPHCLAGNEDLIP GGKWGTEGYWHKVAALKLSLAVNPEGIPERENALEGVRSQESDRCGGTGIN >gi568815597r:202048609_202260807|GENSCAN_predicted_CDS_1|336_bp cctaaacaatctgaacaacaaaaccaagtagtgttggatgacaacgcagaatatgacaca gacatccacaagtccacactgatagaacccattcaggaatggcccaagaaagcaggtaat aagatgggttttgggagctggatgcctcactgcctagctggcaatgaggacctcatccct ggtgggaagtggggtacggagggctattggcacaaggtggcagcattaaaactttcactg gcggtaaatccagaaggcataccagagagagagaatgcactggagggtgtgcggtcccag gagtctgaccggtgtggaggaactggtatcaattag >gi568815597r:202048609_202260807|GENSCAN_predicted_peptide_2|87_aa MPKPSWRGESQRKGPGAGVSMIWGLNRSWREEDGGQRPDEDSQTGLDKKSGDDDMQKTNP QEGKSQKPEFLFPKAIETRIPLPESKT >gi568815597r:202048609_202260807|GENSCAN_predicted_CDS_2|264_bp atgccaaaaccttcctggagaggagagagccagcgcaaaggtcctggggcaggagtgagc atgatctgggggttaaatagaagctggagagaagaggatggtgggcagaggcctgatgag gacagtcagacaggcctagataagaagtctggagatgatgatatgcagaaaaccaaccca caggaaggcaagtcacagaaaccagaattcctcttccccaaggccatagaaactagaatc cctctcccagaaagcaaaacctag >gi568815597r:202048609_202260807|GENSCAN_predicted_peptide_3|113_aa MAAGRAAVAAAAAADDDDDGDNNIYWALTIYQHSSQMFPCVNSLNLQQPFIPAAKQNPGQ ITSHGQQKLDKVLGQPSPWCSPNKIPNLAFILLNRCVSGISQGQANSLHYGCL >gi568815597r:202048609_202260807|GENSCAN_predicted_CDS_3|342_bp atggcggcaggaagggctgctgttgctgctgctgctgctgctgatgacgatgacgatggt gacaataacatttattgggcacttactatataccagcactcttcccaaatgtttccatgt gttaactcacttaatcttcaacagcctttcatacctgcagcaaagcaaaatcctggccag ataacttcccatgggcagcagaagctggacaaagtccttgggcagccatcaccatggtgc agccccaataagatcccaaacctggccttcatcctcctaaacagatgtgtttctggaatt tctcagggtcaagccaactccctgcactatggctgcctctga >gi568815597r:202048609_202260807|GENSCAN_predicted_peptide_4|481_aa MRWLWPLAVSLAVILAVGLSRVSGGAPLHLGRHRAETQEQQSRSKRGTEDEEAKGVQQYV PEEWAEYPRPIHPAGLQPTKPLVATSPNPGKDGGTPDSGQELRGNLTGAPGQRLQIQNPL YPVTESSYSAYAIMLLALVVFAVGIVGNLSVMCIVWHSYYLKSAWNSILASLALWDFLVL FFCLPIVIFNEITKQRLLGDVSCRAVPFMEVSSLGVTTFSLCALGIDRFHVATSTLPKVR PIERCQSILAKLAVIWVGSMTLAVPELLLWQLAQEPAPTMGTLDSCIMKPSASLPESLYS LVMTYQNARMWWYFGCYFCLPILFTVTCQLVTWRVRGPPGRKSECRASKHEQCESQLNST VVGLTVVYAFCTLPENVCNIVVAYLSTELTRQTLDLLGLINQFSTFFKGAITPVLLLCIC RPLGQAFLDCCCCCCCEECGGASEASAANGSDNKLKTEVSSSIYFHKPRESPPLLPLGTP C >gi568815597r:202048609_202260807|GENSCAN_predicted_CDS_4|1446_bp atgcggtggctgtggcccctggctgtctctcttgctgtgattttggctgtggggctaagc agggtctctgggggtgcccccctgcacctgggcaggcacagagccgagacccaggagcag cagagccgatccaagaggggcaccgaggatgaggaggccaagggcgtgcagcagtatgtg cctgaggagtgggcggagtacccccggcccattcaccctgctggcctgcagccaaccaag cccttggtggccaccagccctaaccccggcaaggatgggggcaccccagacagtgggcag gaactgaggggcaatctgacaggagcaccagggcagaggctacagatccagaaccccctg tatccggtgaccgagagctcctacagtgcctatgccatcatgcttctggcgctggtggtg tttgcggtgggcattgtgggcaacctgtcggtcatgtgcatcgtgtggcacagctactac ctgaagagtgcctggaactccatccttgccagcctggccctctgggattttctggtcctc tttttctgcctccctattgtcatcttcaacgagatcaccaagcagaggctactgggtgac gtttcttgtcgtgccgtgcccttcatggaggtctcctctctgggagtcacgactttcagc ctctgtgccctgggcattgaccgcttccacgtggccaccagcaccctgcccaaggtgagg cccatcgagcggtgccaatccatcctggccaagttggctgtcatctgggtgggctccatg acgctggctgtgcctgagctcctgctgtggcagctggcacaggagcctgcccccaccatg ggcaccctggactcatgcatcatgaaaccctcagccagcctgcccgagtccctgtattca ctggtgatgacctaccagaacgcccgcatgtggtggtactttggctgctacttctgcctg cccatcctcttcacagtcacctgccagctggtgacatggcgggtgcgaggccctccaggg aggaagtcagagtgcagggccagcaagcacgagcagtgtgagagccagctcaacagcacc gtggtgggcctgaccgtggtctacgccttctgcaccctcccagagaacgtctgcaacatc gtggtggcctacctctccaccgagctgacccgccagaccctggacctcctgggcctcatc aaccagttctccaccttcttcaagggcgccatcaccccagtgctgctcctttgcatctgc aggccgctgggccaggccttcctggactgctgctgctgctgctgctgtgaggagtgcggc ggggcttcggaggcctctgctgccaatgggtcggacaacaagctcaagaccgaggtgtcc tcttccatctacttccacaagcccagggagtcacccccactcctgcccctgggcacacct tgctga >gi568815597r:202048609_202260807|GENSCAN_predicted_peptide_5|409_aa MGMKRLSTPAAGQPPPLKPADSLLTPPADSLLTPPAGAAPGPPAPAREERALGCEQDTAR PRGRAAAAAEEEGAEATPIGEPIPGQRVVGSRARASARTEPRDPGRTGEGPLRAAARGQR GAAGTWHRAGPAAATMIALFNKLLDWFKALFWKEEMELTLVGLQYSGKTTFVNVIASGQF NEDMIPTVGFNMRKITKGNVTIKLWDIGGQPRFRSMWERYCRGVSAIVYMVDAADQEKIE ASKNELHNLLDKPQLQGIPVLVLGNKRDLPGALDEKELIEKMNLSAIQDREICCYSISCK EKDNIGAFQDLGGGSLERRRETQVDLGQVQIRRGAGWHLRQEKGAKLSLSPGSGDLDSPS WALQPLTFIGAQGFLLEAVIRPVRAVPCLERQFPTARELEERMWMEFGI >gi568815597r:202048609_202260807|GENSCAN_predicted_CDS_5|1230_bp atggggatgaagcgcctctcgacgccagctgccggtcaacccccgccgctgaagcctgcg gattccctgctcaccccacctgcggattccctgctcaccccacctgcgggtgcggcccct ggccccccggctccagcgagggaggagcgcgcgctcgggtgcgagcaggacacggcccgg ccgcgagggagggcggcggcggcggcggaggaggagggcgcggaagccacacccatcggc gagccgattccggggcagcgagtcgtcggcagccgcgctcgagcctccgcccgcaccgag ccgcgggacccgggccgtaccggggaggggccgctccgggccgcagcgcgagggcagcga ggggcggcggggacctggcaccgggcggggccggcggcagcgaccatgatcgctttgttc aacaagctgctggactggttcaaggccctattctggaaggaggagatggagctcacgctg gtcgggcttcagtactcgggcaagaccaccttcgtcaacgtgatcgcgtcaggacagttc aacgaggacatgatccccaccgtgggtttcaacatgcgcaaaatcaccaaagggaatgtg actatcaagctctgggacattgggggacagccgcgtttccgcagcatgtgggagcgctac tgccgaggagtgagcgccatcgtgtacatggtggatgctgctgaccaggagaagattgag gcctctaagaacgagctccacaacctactggacaaacctcagctgcagggcatcccggtc ttagtcctgggtaacaagcgagaccttccgggagcattggatgagaaggagctgattgag aaaatgaatctgtctgccatccaggaccgagagatctgctgctactccatctcttgcaaa gaaaaggacaacattggggcctttcaggatctgggagggggcagtctggagagaaggagg gagacgcaggtggacttggggcaagttcagatcagaagaggtgcaggctggcacctgcgg caggaaaaaggagccaaactgtcccttagtcctgggagtggggaccttgactccccgtcc tgggccctccagcccctcaccttcattggtgcccagggcttcctcctggaggcagtcatc agacctgtcagagcagttccctgcctggagaggcagttccccactgccagagagttggag gaaagaatgtggatggaatttggcatctga >gi568815597r:202048609_202260807|GENSCAN_predicted_peptide_6|391_aa MTQPPPEKTPAKKHVRLQERRGSNVALMLDVRSLGAVEPICSVNTPREVTLHFLRTAGHP LTRWALQRQPPSPKQLEEEFLKIPSNFVSPEDLDIPGHASKDRYKTILPNPQSRVCLGRA QSQEDGDYINANYIRVSGLAAVVGGEWGNSQTSENIFQPWLVDQGYDGKEKVYIATQGPM PNTVSDFWEMVWQEEVSLIVMLTQLREGKEKCVHYWPTEEETYGPFQIRIQDMKECPEYT VRQLTIQAPEKKVDSALHPDLGSHALSPIQYQEERRSVKHILFSAWPDHQTPESAGPLLR LVAEVEESPETAAHPGPIVVHCSAGIGRTGCFIATRIGCQQLKARGEVDILGIVCQLRLD RGGMIQTAEQYQFLHHTLALYAGQLPEEPSP >gi568815597r:202048609_202260807|GENSCAN_predicted_CDS_6|1176_bp atgacccagcctccgcctgaaaaaacgccagccaagaagcatgtgcgactgcaggagagg cggggctccaatgtggctctgatgctggacgttcggtccctgggggccgtagaacccatc tgctctgtgaacacaccccgggaggtcaccctacactttctgcgcactgctggacacccc cttacccgctgggcccttcagcgccagccacccagccccaagcaactggaagaagaattc ttgaagatcccttcaaactttgtcagccccgaagacctggacatccctggccacgcctcc aaggaccgatacaagaccatcttgccaaatccccagagccgtgtctgtctaggccgggca cagagccaggaggacggagattacatcaatgccaactacatccgagtgagtggcctggct gcagtggtgggcggggaatgggggaattctcagacgtctgagaatatcttccagccatgg ctggtggatcagggctatgacgggaaggagaaggtctacattgccacccagggccccatg cccaacactgtgtcggacttctgggagatggtgtggcaagaggaagtgtccctcattgtc atgctcactcagctccgagagggcaaggagaaatgtgtccactactggcccacagaagag gaaacctatggacccttccagatccgcatccaggacatgaaagagtgcccagaatacact gtgcggcagctcaccatccaggcaccagagaagaaggtagacagtgccctccaccctgac cttggttctcatgccttgtctccaatccagtaccaggaagagcgccggtcagtaaagcac atcctcttttcggcctggccagaccatcagacaccagaatcagctgggcccctgctgcgc ctagtggcagaggtggaggagagcccggagacagccgcccaccccgggcctatcgtagtc cactgcagtgcagggattggccggacgggctgcttcatcgccacgcgaattggctgtcaa cagctgaaagcccgaggagaagtggacattctgggtattgtgtgccaactgcggctagac agaggggggatgatccagacggcagagcagtaccagttcctgcaccacactttggccctg tatgcaggccagctgcctgaggaacccagcccctga >gi568815597r:202048609_202260807|GENSCAN_predicted_peptide_7|1324_aa MRLPILFAALLWFRGFLAEEEACLSLEGSPGRESAGPPLNVNITSQGRPTSLFLSWAAPG PGRFTHALRLTCLSPLSSPEGQQLQAHTNASSFKFQDLVSGGRYQLEVTALRPCGQNVTI TLTARTGPPYELTLSAAARPHRAVGPNATEWTYTSAPPRPGLTPLPAKLWASWKVGPGVQ DDFLLKLSGPVEKNITLGPEAHNVTFPGPLPTGHYALELKVLAGPYDAWAQASAWLDDSA AKSRQGSGAKRQLDGLEASKEPGRRALLYTEGNPGLLGNISVPPGATHITFYGPVPGARY CVDIASSLGIITYSLMGHKSPLAPQSLEVISRGGPSDLAIVWAPAPGQREGYRVAWHQEG SQRSPGSLVDLGPDNSSLTLRSLVPGSSYAMSVWAWAENLGSSIQKIHPCTCPLAPPLVN VTSEGPTQLWASWVHAPRGRDSYPVTLYRAGTSAVGAKVASTSFSSLTPGTKYKVEVVTQ AGPHHIAAANTSGWTHEAWGRQRCRRALHTPSELVSMHASTAVVNLAWASSPLGQGMCYT QLSEAGHLSWEHPLVPGQAHLILRGLTPGCNLSLSVLCQAGPLQASTQRVVLLVEPGPVE DVQCQPEATFLALNWTVPARDVGTCLVVAEQLVAGGNAHLVFQADTSKNAVLLPNLVPVT SYHLSLAVLGRNGLWSRVVTLACSTSAEVPQPSWEAINHMWHDHYYRGHDSYLAILLPNP FYPDPWAVPRSWTVPVGTEDCGHTKEICNGQLKLEPWAGVSLASVPLPVMEGLVVGCVLT ICAVLGLLCWRRVKGQRAGKNPFSQELTAYNLRPGWALEELAGNARCSTPRVEFRICVSN RPPGDQELKEVGKEQPRLEAEYAANTTKNHYPHVLPYDHSRVRLTQLEGEPHSDYINANF IPVLCEHYWLTDSTPVTHDHITIHLLAEEADDEWTKREFQLQHMRAPRMRGVGMGQTGTF VALLRLLQQLEEEQMVDVFHAVFAFWMHGPLMIQTLSQYVFLHSCLLNKILEGPFNISES WPISVMNFAQACAKRAANANAGFLKEYELLLQAIKDEAGSYAPLPGYEQDSPISWPEELW ELVWQHGAHVLVSLCPLDAMEKVSCSKGATQLGTFLAMEQLLQQAGSECTVDVFNVALQQ SQACDLMTPTLKQYIYLYNCLNSALADGLPLSRHWSLCRRGLWGRPRAPGAVPTDGAARR DREEAAAAIAPCPSSPTAEMPSPPGLRALWLCAALCASRRAGGAPQPGPGPTACPAPCHC QEDGIMLSADCSELGLSAVPGDLDPLTAYLLGCPPPLQKAQAVGQTWENRSQEEKNTREA KLGQ >gi568815597r:202048609_202260807|GENSCAN_predicted_CDS_7|3975_bp atgaggctcccaatcctgttcgctgccctgctctggttccggggttttctggcagaggag gaagcatgcctctccctggaagggagtccaggcagggagagtgcaggcccacccctgaac gtgaacatcaccagccaggggagacctactagcctctttctgagctgggcagccccgggg ccaggcaggttcacccatgccctccgcctcacatgtctgagccccctcagctctcctgaa gggcagcagctccaggcccacaccaatgcatccagctttaagttccaagatctggtgtca gggggtcgctaccagctggaagtgactgccctgcgaccctgtgggcagaatgtcaccatc accctcactgctcgcactggccccccctatgagctgacgctcagtgctgctgccaggccc catcgggcagtggggcccaatgccacagagtggacctatacctctgccccccctcgacct ggtttgactcccctgcccgccaagctctgggcaagctggaaggtagggccaggtgtgcag gatgacttcctgctgaagttaagtgggccagtggagaagaatatcactctaggccctgag gcccacaatgtcacattcccagggcccctgcccactgggcactatgctctggagctgaag gtcctagcggggccatatgacgcctgggcccaggccagtgcctggctggacgattccgca gccaagtccagacaaggcagtggtgccaagcggcagctggatgggctggaggcctccaag gagcccgggagacgggccctgctctacacagagggaaacccgggcctccttggaaacatc tctgtgccacctggtgccacccacatcaccttctatgggccagtgcctggggcccgctac tgtgtggacattgcctcatctctgggaatcatcacttacagcctcatgggccacaaaagt cccctggcaccacagtccctggaggttatcagcaggggtggcccctctgacctggccatt gtctgggccccagcaccaggacagcgggaaggctacagggtcgcttggcaccaggagggc agccagaggtcaccgggcagtcttgttgacttgggcccggacaattccagcctgactctg aggagtctggtgcccggctcctcctatgccatgtcagtgtgggcctgggcagagaacctt ggctctagcatccagaagatccacccctgtacttgcccgcttgcccctcctctggtaaat gtgacgagtgaaggtcccacccagctctgggcatcctgggtccatgcccccaggggccga gacagctacccggtgaccctgtaccgggcaggcaccagcgccgtcggagccaaggtggcc agcacaagcttttcaagtctgactccaggcacgaagtacaaggtggaggttgtcacgcag gctgggccccaccacattgcagcagccaacacctctggctggacccatgaggcatgggga aggcagcgatgcaggagagccctgcacacacccagtgagttggtgtccatgcatgcgagc accgctgtggtcaacctggcctgggccagcagccccttggggcaggggatgtgctacacc caactctcagaggcggggcacctctcctgggagcaccctctggtgccaggccaagcccac ctcatcctgaggggcctcacacctggatgcaacctctccctgtcagtgctgtgccaggca gggccgctgcaggcgtccactcagcgcgtggtactgcttgttgagcctggccctgtggaa gatgtgcagtgccagcctgaggccaccttcctggccctgaactggacagtgcccgccagg gatgtgggcacctgtctggtggtggcagagcagctggtggcaggagggaatgctcacctt gtgttccaggccgacacctccaaaaatgcagtcctgttgcccaacctggtgcctgtcact tcctatcacctcagcctcgctgtgctgggcaggaacggtctgtggagtcgggtggtcact ctggcatgttccacatctgccgaggtgcctcagccttcctgggaagccatcaaccacatg tggcatgaccactactacagaggacatgactcctacctggccatcctgctccccaacccc ttctacccggatccctgggctgtgccgagatcctggacagtgcctgtgggtacagaggac tgtggccacaccaaagagatatgcaacgggcagctcaagctagagccctgggccggtgtc tccctggcatcagtgcccctgccggtaatggagggcctcgtggtgggctgtgtcctcacc atctgtgctgtgctgggcctgctgtgctggaggcgggtgaaggggcagagggcagggaag aatccattttcccaagagctgacagcttacaacctgcgcccaggctgggcgttggaggag cttgctggaaatgcacgctgctccacacccagggtggaattcaggatctgcgtttccaac aggcccccaggtgatcaggagctgaaggaggtgggcaaggagcagcccagactggaggct gagtacgctgccaacaccaccaagaaccattacccacatgtgcttccctacgaccactcc agggtcaggctgacccagctggagggagagcctcattctgactacatcaatgccaacttc atcccagtgctgtgtgagcattactggctgaccgactctaccccggtcacccatgatcac atcaccatccacctcctagccgaggaggctgacgatgagtggaccaagcgggaattccag ctgcagcacatgcgtgccccaaggatgaggggtgtgggcatgggccagacgggcaccttc gtggccctgttgaggctgctgcagcagctggaggaggagcagatggtagatgtgttccat gctgtgtttgcattctggatgcacgggcccctcatgatccagaccctgagccagtacgtc ttcctgcacagctgcctactgaacaagattctggaagggcccttcaacatctctgagtct tggcccatctctgtgatgaacttcgcacaggcgtgtgccaagagggcagccaatgccaac gctggcttcttgaaggagtacgagctcttgctgcaggccatcaaggacgaggctggctct tacgcacccctgcctggctatgagcaggacagccccatctcctggccagaggagctctgg gagctggtgtggcagcacggggctcatgtgcttgtctctctgtgcccactcgatgccatg gagaaggtcagttgcagcaagggtgcgacccagctgggcaccttcctggccatggagcag ctgctgcagcaggcagggtctgagtgcaccgtggatgtctttaacgtggccctgcagcag tctcaggcctgtgaccttatgaccccaacgctgaagcagtatatctacctctacaattgt ctgaacagcgcactggcagacgggctgcccctgagtcggcactggtcactgtgcaggaga gggctctggggccggcccagagcccctggggcggtccccaccgacggtgcagcccgccgg gaccgggaggaagcagctgcggccatcgcgccgtgccccagtagcccgaccgccgagatg cccagcccgccggggctccgggcgctatggctttgcgccgcgctgtgcgcttcccggagg gccggcggcgccccccagcccggcccggggcccaccgcctgcccggccccctgccactgc caggaggacggcatcatgctgtctgccgactgctctgagctcgggctgtccgccgttccg ggggacctggaccccctgacggcttacctgttgggatgtccccctccactccaaaaggca caggctgtgggccagacatgggagaacaggtcccaggaggagaaaaacacccgagaggcc aaacttgggcagtga >gi568815597r:202048609_202260807|GENSCAN_predicted_peptide_8|481_aa MTRECQNVFLLGALKPSIKECVGEMEAPEDIQLPEPVNGTLFRKGAFANVLRILGDHRET QRERGESYVKAEGRDWSDAATRNASSCRKLEEAKKHPPQHPQSAQETQENDSLRKISKFM HNWLLLLESPRAPTWALLKICFREALPHPVSITTTNDHCRSSNKRPSRQAQGYLRTPARQ QKQEKSRPEDTYPGWKTPTPEDGERGHPGIRGDLESFEDSTRKRRESPEAQSPEQEKELG KDIGGKPSRPLDPEMRRRAHREFMEELGKQEGRKLLTALGTRPALILMQKPEGAWSKDTR ETMAEKTGGRRLLMPLHRPHPSRGPGGYIWSLHKCSGAMNNLLNCTKETCKSGLLQEAQV VEEARLKLCEVVHAEVPGGEKMEKGAVNSPLGPSGTVQNTSGLAGLEMGSAEAVGGREVK GTQASQAQPEPQCDLRTAQEYDRAEPLERNENMGDCFWAPSALYYLSERQRKTYTVQRSG H >gi568815597r:202048609_202260807|GENSCAN_predicted_CDS_8|1446_bp atgactcgagagtgtcagaatgtcttcctgctgggtgcgttgaagccctcaattaaggag tgtgtgggtgaaatggaggccccagaagatatacaacttccagaacctgtaaatgggaca ttatttagaaaaggggcctttgcaaatgtattaaggatcttgggagaccacagagagaca cagagagaaagaggagaaagctacgtgaaggcagaaggcagagactggagtgatgcagcc acaaggaacgccagcagctgccggaagctggaagaggcaaagaagcacccaccccagcac cctcagagtgctcaggagacccaagaaaatgactccttgagaaaaatctccaaattcatg cacaactggctccttctcttggaatcacccagggctcccacttgggccctcttgaagatt tgcttccgggaggctctcccacatccagtcagcattactactactaatgatcattgtaga agtagcaacaaaaggcccagtcgccaggcgcagggctacttgcgtacgccagcaagacag cagaagcaggaaaagagccggccggaagacacctaccctggttggaagacacctacccct gaagatggagaaagaggccatccgggcatcagaggggacttagaaagttttgaggatagc actcgaaagcgcagggagtcccctgaggcacagagccctgagcaagagaaggaactgggt aaagacataggaggaaaaccctccaggcctttagatccagaaatgagaaggagagcacac agggagttcatggaagagctggggaagcaagaagggaggaagttgctgactgccctgggc acaaggcctgccttgattctcatgcagaagcctgagggtgcttggagcaaggatacaaga gaaacgatggctgagaaaacagggggtcgcaggctgctcatgcctcttcacaggccccac ccctccaggggcccaggaggctacatctggtcactgcacaaatgctctggtgcaatgaac aacctgctcaactgtacaaaggagacctgcaagtcagggctcctccaagaagcgcaggtg gtggaagaggccaggctgaagctctgtgaggttgttcatgctgaggtccctggtggagag aagatggaaaaaggagctgtgaactccccacttgggccatctggcactgtccagaacacc tctgggcttgccggtctagaaatgggatccgcagaagctgtaggtggcagagaagttaaa gggacccaggcctcccaggcccagccggagccacagtgtgacctgagaacagcccaggag tatgacagagcagagcctctggagaggaatgaaaatatgggggactgcttctgggccccc tctgctctctactacctctctgagagacaacgcaagacctacactgtccagcgcagtggc cactag