GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:37:09 Sequence gi568815597f:202093990_202419204 : 325215 bp : 48.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 28975 29604 630 0 0 89 105 743 0.994 69.45 1.02 Term + 33752 34567 816 1 0 103 54 1125 0.999 103.74 1.03 PlyA + 35499 35504 6 1.05 2.19 PlyA - 37452 37447 6 1.05 2.18 Term - 38583 38377 207 0 0 105 48 114 0.808 6.44 2.17 Intr - 40051 39945 107 2 2 28 81 22 0.226 -4.57 2.16 Intr - 41231 41161 71 1 2 86 70 88 0.687 5.63 2.15 Intr - 41537 41470 68 2 2 136 82 85 0.997 10.40 2.14 Intr - 41811 41718 94 2 1 113 65 148 0.994 14.97 2.13 Intr - 44049 43976 74 0 2 79 86 101 0.994 7.20 2.12 Intr - 44459 44379 81 2 0 47 100 144 0.917 11.33 2.11 Intr - 50976 50461 516 0 0 19 68 481 0.948 32.36 2.10 Intr - 53576 53492 85 1 1 -17 116 7 0.005 -7.18 2.09 Intr - 54710 54654 57 1 0 120 16 69 0.013 0.90 2.08 Intr - 56435 56322 114 1 0 95 94 172 0.997 18.06 2.07 Intr - 58779 58553 227 0 2 98 96 185 0.993 17.08 2.06 Intr - 59846 59736 111 2 0 119 81 190 0.999 21.98 2.05 Intr - 60334 60197 138 1 0 102 75 196 0.998 20.36 2.04 Intr - 61620 61457 164 1 2 120 51 124 0.673 11.59 2.03 Intr - 63834 63750 85 0 1 77 71 135 0.477 10.09 2.02 Intr - 64312 64129 184 0 1 35 91 179 0.988 12.69 2.01 Init - 65350 65292 59 1 2 105 78 52 0.568 6.68 2.00 Prom - 71037 70998 40 -10.35 3.00 Prom + 72666 72705 40 -9.36 3.01 Init + 74062 74118 57 0 0 67 94 104 0.737 8.24 3.02 Intr + 74633 74681 49 1 1 131 96 -34 0.772 -0.05 3.03 Intr + 74856 75127 272 1 2 77 3 198 0.186 7.46 3.04 Intr + 75977 76055 79 1 1 98 76 93 0.338 8.22 3.05 Intr + 77196 77450 255 1 0 110 113 132 0.990 15.32 3.06 Intr + 77552 77797 246 0 0 99 87 236 0.995 22.03 3.07 Intr + 77955 78230 276 1 0 114 70 117 0.458 9.99 3.08 Intr + 83060 83344 285 0 0 122 83 54 0.719 5.61 3.09 Intr + 83531 83794 264 0 0 102 64 50 0.416 1.48 3.10 Intr + 84122 84403 282 0 0 69 61 186 0.516 11.39 3.11 Intr + 85184 85381 198 0 0 102 -13 188 0.650 9.42 3.12 Intr + 87669 87795 127 1 1 126 105 69 0.923 12.14 3.13 Intr + 87900 87947 48 0 0 129 101 36 0.921 6.80 3.14 Intr + 88925 89024 100 2 1 81 11 35 0.582 -4.79 3.15 Intr + 89395 89485 91 0 1 93 76 109 0.989 9.77 3.16 Intr + 89594 89670 77 0 2 97 72 52 0.605 3.73 3.17 Intr + 91532 91677 146 1 2 91 27 158 0.998 9.08 3.18 Intr + 92567 92702 136 2 1 61 81 245 0.882 21.67 3.19 Intr + 93083 93150 68 1 2 83 80 103 0.757 6.70 3.20 Intr + 93280 93367 88 1 1 75 71 114 0.720 8.37 3.21 Intr + 93474 93552 79 2 1 84 76 109 0.987 8.42 3.22 Intr + 93910 93992 83 2 2 80 100 63 0.983 6.06 3.23 Intr + 94857 95003 147 2 0 73 85 215 0.946 20.13 3.24 Intr + 95242 95329 88 0 1 78 68 87 0.974 5.24 3.25 Intr + 98936 98954 19 0 1 134 76 4 0.053 -0.53 3.26 Intr + 99881 99952 72 2 0 60 40 112 0.055 2.12 3.27 Intr + 99976 100212 237 1 0 55 76 208 0.068 13.03 3.28 Intr + 101434 101479 46 1 1 106 110 -3 0.523 2.11 3.29 Term + 106756 106815 60 0 0 103 53 53 0.308 1.10 3.30 PlyA + 117259 117264 6 1.05 4.13 PlyA - 118471 118466 6 1.05 4.12 Term - 122788 122589 200 2 2 59 42 130 0.271 3.06 4.11 Intr - 129233 129182 52 2 1 125 42 18 0.134 -0.62 4.10 Intr - 131503 131369 135 1 0 36 50 113 0.229 3.06 4.09 Intr - 134985 134928 58 2 1 68 74 81 0.097 3.59 4.08 Intr - 139489 139332 158 1 2 43 89 21 0.037 -3.49 4.07 Intr - 141130 140913 218 2 2 24 79 124 0.101 3.32 4.06 Intr - 145230 145100 131 2 2 63 53 111 0.124 5.44 4.05 Intr - 145994 145936 59 2 2 88 48 23 0.372 -4.02 4.04 Intr - 149846 149720 127 1 1 109 68 42 0.351 4.98 4.03 Intr - 150624 150481 144 2 0 22 59 144 0.912 4.30 4.02 Intr - 150783 150684 100 1 1 90 64 32 0.648 0.17 4.01 Init - 159016 158953 64 1 1 80 69 61 0.603 4.71 4.00 Prom - 165068 165029 40 -3.36 5.00 Prom + 166799 166838 40 -4.96 5.01 Init + 171282 171362 81 2 0 96 37 29 0.071 -0.43 5.02 Intr + 176927 177039 113 1 2 0 110 130 0.511 5.58 5.03 Intr + 182317 182532 216 1 0 118 68 362 0.985 34.82 5.04 Intr + 186792 186863 72 0 0 111 74 97 0.987 9.12 5.05 Intr + 188543 188672 130 2 1 58 75 37 0.715 0.10 5.06 Intr + 189544 189670 127 0 1 81 80 56 0.577 4.35 5.07 Intr + 194843 194942 100 0 1 18 94 30 0.041 -4.23 5.08 Intr + 203519 203587 69 2 0 128 67 112 0.716 11.40 5.09 Intr + 206860 206931 72 1 0 99 76 111 0.981 9.52 5.10 Intr + 207175 207246 72 1 0 57 93 54 0.713 1.32 5.11 Intr + 209290 209358 69 1 0 93 76 99 0.955 7.50 5.12 Intr + 210570 210641 72 0 0 142 76 74 0.998 10.12 5.13 Intr + 211695 211760 66 0 0 120 82 32 0.917 3.82 5.14 Intr + 212879 212950 72 2 0 88 76 107 0.994 8.02 5.15 Intr + 213341 213412 72 2 0 113 87 75 0.997 8.42 5.16 Intr + 215062 215187 126 1 0 90 78 75 0.988 6.49 5.17 Intr + 216208 216368 161 1 2 83 74 162 0.997 13.93 5.18 Intr + 220813 220893 81 2 0 126 94 92 0.999 13.21 5.19 Term + 223963 225218 1256 2 2 99 36 1260 0.902 113.65 5.20 PlyA + 225749 225754 6 1.05 6.00 Prom + 230387 230426 40 -5.96 6.01 Sngl + 232851 233189 339 2 0 104 44 148 0.975 8.03 6.02 PlyA + 237732 237737 6 1.05 7.05 PlyA - 238148 238143 6 1.05 7.04 Term - 239346 239179 168 0 0 56 55 143 0.349 5.68 7.03 Intr - 239566 239461 106 0 1 107 87 17 0.999 3.72 7.02 Intr - 241069 241000 70 2 1 116 88 58 0.994 6.84 7.01 Init - 241765 241657 109 1 1 86 49 47 0.875 1.02 7.00 Prom - 250094 250055 40 -7.76 8.00 Prom + 252025 252064 40 -5.06 8.01 Init + 253663 253665 3 0 0 108 81 0 0.847 1.30 8.02 Intr + 254602 255153 552 0 0 -28 93 599 0.345 41.85 8.03 Intr + 310431 310481 51 2 0 95 99 6 0.025 1.50 8.04 Term + 322798 322932 135 0 0 76 48 140 0.784 6.82 8.05 PlyA + 323562 323567 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 54710 54617 94 1 1 120 46 105 0.975 6.80 S.002 Init + 100001 100212 212 1 2 97 76 289 0.916 24.86 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:202093990_202419204|GENSCAN_predicted_peptide_1|481_aa MRWLWPLAVSLAVILAVGLSRVSGGAPLHLGRHRAETQEQQSRSKRGTEDEEAKGVQQYV PEEWAEYPRPIHPAGLQPTKPLVATSPNPGKDGGTPDSGQELRGNLTGAPGQRLQIQNPL YPVTESSYSAYAIMLLALVVFAVGIVGNLSVMCIVWHSYYLKSAWNSILASLALWDFLVL FFCLPIVIFNEITKQRLLGDVSCRAVPFMEVSSLGVTTFSLCALGIDRFHVATSTLPKVR PIERCQSILAKLAVIWVGSMTLAVPELLLWQLAQEPAPTMGTLDSCIMKPSASLPESLYS LVMTYQNARMWWYFGCYFCLPILFTVTCQLVTWRVRGPPGRKSECRASKHEQCESQLNST VVGLTVVYAFCTLPENVCNIVVAYLSTELTRQTLDLLGLINQFSTFFKGAITPVLLLCIC RPLGQAFLDCCCCCCCEECGGASEASAANGSDNKLKTEVSSSIYFHKPRESPPLLPLGTP C >gi568815597f:202093990_202419204|GENSCAN_predicted_CDS_1|1446_bp atgcggtggctgtggcccctggctgtctctcttgctgtgattttggctgtggggctaagc agggtctctgggggtgcccccctgcacctgggcaggcacagagccgagacccaggagcag cagagccgatccaagaggggcaccgaggatgaggaggccaagggcgtgcagcagtatgtg cctgaggagtgggcggagtacccccggcccattcaccctgctggcctgcagccaaccaag cccttggtggccaccagccctaaccccggcaaggatgggggcaccccagacagtgggcag gaactgaggggcaatctgacaggagcaccagggcagaggctacagatccagaaccccctg tatccggtgaccgagagctcctacagtgcctatgccatcatgcttctggcgctggtggtg tttgcggtgggcattgtgggcaacctgtcggtcatgtgcatcgtgtggcacagctactac ctgaagagtgcctggaactccatccttgccagcctggccctctgggattttctggtcctc tttttctgcctccctattgtcatcttcaacgagatcaccaagcagaggctactgggtgac gtttcttgtcgtgccgtgcccttcatggaggtctcctctctgggagtcacgactttcagc ctctgtgccctgggcattgaccgcttccacgtggccaccagcaccctgcccaaggtgagg cccatcgagcggtgccaatccatcctggccaagttggctgtcatctgggtgggctccatg acgctggctgtgcctgagctcctgctgtggcagctggcacaggagcctgcccccaccatg ggcaccctggactcatgcatcatgaaaccctcagccagcctgcccgagtccctgtattca ctggtgatgacctaccagaacgcccgcatgtggtggtactttggctgctacttctgcctg cccatcctcttcacagtcacctgccagctggtgacatggcgggtgcgaggccctccaggg aggaagtcagagtgcagggccagcaagcacgagcagtgtgagagccagctcaacagcacc gtggtgggcctgaccgtggtctacgccttctgcaccctcccagagaacgtctgcaacatc gtggtggcctacctctccaccgagctgacccgccagaccctggacctcctgggcctcatc aaccagttctccaccttcttcaagggcgccatcaccccagtgctgctcctttgcatctgc aggccgctgggccaggccttcctggactgctgctgctgctgctgctgtgaggagtgcggc ggggcttcggaggcctctgctgccaatgggtcggacaacaagctcaagaccgaggtgtcc tcttccatctacttccacaagcccagggagtcacccccactcctgcccctgggcacacct tgctga >gi568815597f:202093990_202419204|GENSCAN_predicted_peptide_2|813_aa MTQPPPEKTPAKKHVRLQERRGSNVALMLDVRSLGAVEPICSVNTPREVTLHFLRTAGHP LTRWALQRQPPSPKQLEEEFLKIPSNFVSPEDLDIPGHASKDRYKTILPNPQSRVCLGRA QSQEDGDYINANYIRVSGLAAVVGGEWGNSQTSENIFQPWLVDQGYDGKEKVYIATQGPM PNTVSDFWEMVWQEEVSLIVMLTQLREGKEKCVHYWPTEEETYGPFQIRIQDMKECPEYT VRQLTIQAPEKKVDSALHPDLGSHALSPIQYQEERRSVKHILFSAWPDHQTPESAGPLLR LVAEVEESPETAAHPGPIVVHCSAGIGRTGCFIATRIGCQQLKARGEVDILGIVCQLRLD RGGMIQTAEQYQFLHHTLALSTSGLNATPQDSSSLNLRLAAHSEMGMKRLSTPAAGQPPP LKPADSLLTPPADSLLTPPAGAAPGPPAPAREERALGCEQDTARPRGRAAAAAEEEGAEA TPIGEPIPGQRVVGSRARASARTEPRDPGRTGEGPLRAAARGQRGAAGTWHRAGPAAATM IALFNKLLDWFKALFWKEEMELTLVGLQYSGKTTFVNVIASGQFNEDMIPTVGFNMRKIT KGNVTIKLWDIGGQPRFRSMWERYCRGVSAIVYMVDAADQEKIEASKNELHNLLDKPQLQ GIPVLVLGNKRDLPGALDEKELIEKMNLSAIQDREICCYSISCKEKDNIGAFQDLGGGSL ERRRETQVDLGQVQIRRGAGWHLRQEKGAKLSLSPGSGDLDSPSWALQPLTFIGAQGFLL EAVIRPVRAVPCLERQFPTARELEERMWMEFGI >gi568815597f:202093990_202419204|GENSCAN_predicted_CDS_2|2442_bp atgacccagcctccgcctgaaaaaacgccagccaagaagcatgtgcgactgcaggagagg cggggctccaatgtggctctgatgctggacgttcggtccctgggggccgtagaacccatc tgctctgtgaacacaccccgggaggtcaccctacactttctgcgcactgctggacacccc cttacccgctgggcccttcagcgccagccacccagccccaagcaactggaagaagaattc ttgaagatcccttcaaactttgtcagccccgaagacctggacatccctggccacgcctcc aaggaccgatacaagaccatcttgccaaatccccagagccgtgtctgtctaggccgggca cagagccaggaggacggagattacatcaatgccaactacatccgagtgagtggcctggct gcagtggtgggcggggaatgggggaattctcagacgtctgagaatatcttccagccatgg ctggtggatcagggctatgacgggaaggagaaggtctacattgccacccagggccccatg cccaacactgtgtcggacttctgggagatggtgtggcaagaggaagtgtccctcattgtc atgctcactcagctccgagagggcaaggagaaatgtgtccactactggcccacagaagag gaaacctatggacccttccagatccgcatccaggacatgaaagagtgcccagaatacact gtgcggcagctcaccatccaggcaccagagaagaaggtagacagtgccctccaccctgac cttggttctcatgccttgtctccaatccagtaccaggaagagcgccggtcagtaaagcac atcctcttttcggcctggccagaccatcagacaccagaatcagctgggcccctgctgcgc ctagtggcagaggtggaggagagcccggagacagccgcccaccccgggcctatcgtagtc cactgcagtgcagggattggccggacgggctgcttcatcgccacgcgaattggctgtcaa cagctgaaagcccgaggagaagtggacattctgggtattgtgtgccaactgcggctagac agaggggggatgatccagacggcagagcagtaccagttcctgcaccacactttggccctg tctacctcaggactgaacgccacacctcaggattcctcctccttgaatctgagactggct gcccattctgagatggggatgaagcgcctctcgacgccagctgccggtcaacccccgccg ctgaagcctgcggattccctgctcaccccacctgcggattccctgctcaccccacctgcg ggtgcggcccctggccccccggctccagcgagggaggagcgcgcgctcgggtgcgagcag gacacggcccggccgcgagggagggcggcggcggcggcggaggaggagggcgcggaagcc acacccatcggcgagccgattccggggcagcgagtcgtcggcagccgcgctcgagcctcc gcccgcaccgagccgcgggacccgggccgtaccggggaggggccgctccgggccgcagcg cgagggcagcgaggggcggcggggacctggcaccgggcggggccggcggcagcgaccatg atcgctttgttcaacaagctgctggactggttcaaggccctattctggaaggaggagatg gagctcacgctggtcgggcttcagtactcgggcaagaccaccttcgtcaacgtgatcgcg tcaggacagttcaacgaggacatgatccccaccgtgggtttcaacatgcgcaaaatcacc aaagggaatgtgactatcaagctctgggacattgggggacagccgcgtttccgcagcatg tgggagcgctactgccgaggagtgagcgccatcgtgtacatggtggatgctgctgaccag gagaagattgaggcctctaagaacgagctccacaacctactggacaaacctcagctgcag ggcatcccggtcttagtcctgggtaacaagcgagaccttccgggagcattggatgagaag gagctgattgagaaaatgaatctgtctgccatccaggaccgagagatctgctgctactcc atctcttgcaaagaaaaggacaacattggggcctttcaggatctgggagggggcagtctg gagagaaggagggagacgcaggtggacttggggcaagttcagatcagaagaggtgcaggc tggcacctgcggcaggaaaaaggagccaaactgtcccttagtcctgggagtggggacctt gactccccgtcctgggccctccagcccctcaccttcattggtgcccagggcttcctcctg gaggcagtcatcagacctgtcagagcagttccctgcctggagaggcagttccccactgcc agagagttggaggaaagaatgtggatggaatttggcatctga >gi568815597f:202093990_202419204|GENSCAN_predicted_peptide_3|1324_aa MRLPILFAALLWFRGFLAEEEACLSLEGSPGRESAGPPLNVNITSQGRPTSLFLSWAAPG PGRFTHALRLTCLSPLSSPEGQQLQAHTNASSFKFQDLVSGGRYQLEVTALRPCGQNVTI TLTARTGPPYELTLSAAARPHRAVGPNATEWTYTSAPPRPGLTPLPAKLWASWKVGPGVQ DDFLLKLSGPVEKNITLGPEAHNVTFPGPLPTGHYALELKVLAGPYDAWAQASAWLDDSA AKSRQGSGAKRQLDGLEASKEPGRRALLYTEGNPGLLGNISVPPGATHITFYGPVPGARY CVDIASSLGIITYSLMGHKSPLAPQSLEVISRGGPSDLAIVWAPAPGQREGYRVAWHQEG SQRSPGSLVDLGPDNSSLTLRSLVPGSSYAMSVWAWAENLGSSIQKIHPCTCPLAPPLVN VTSEGPTQLWASWVHAPRGRDSYPVTLYRAGTSAVGAKVASTSFSSLTPGTKYKVEVVTQ AGPHHIAAANTSGWTHEAWGRQRCRRALHTPSELVSMHASTAVVNLAWASSPLGQGMCYT QLSEAGHLSWEHPLVPGQAHLILRGLTPGCNLSLSVLCQAGPLQASTQRVVLLVEPGPVE DVQCQPEATFLALNWTVPARDVGTCLVVAEQLVAGGNAHLVFQADTSKNAVLLPNLVPVT SYHLSLAVLGRNGLWSRVVTLACSTSAEVPQPSWEAINHMWHDHYYRGHDSYLAILLPNP FYPDPWAVPRSWTVPVGTEDCGHTKEICNGQLKLEPWAGVSLASVPLPVMEGLVVGCVLT ICAVLGLLCWRRVKGQRAGKNPFSQELTAYNLRPGWALEELAGNARCSTPRVEFRICVSN RPPGDQELKEVGKEQPRLEAEYAANTTKNHYPHVLPYDHSRVRLTQLEGEPHSDYINANF IPVLCEHYWLTDSTPVTHDHITIHLLAEEADDEWTKREFQLQHMRAPRMRGVGMGQTGTF VALLRLLQQLEEEQMVDVFHAVFAFWMHGPLMIQTLSQYVFLHSCLLNKILEGPFNISES WPISVMNFAQACAKRAANANAGFLKEYELLLQAIKDEAGSYAPLPGYEQDSPISWPEELW ELVWQHGAHVLVSLCPLDAMEKVSCSKGATQLGTFLAMEQLLQQAGSECTVDVFNVALQQ SQACDLMTPTLKQYIYLYNCLNSALADGLPLSRHWSLCRRGLWGRPRAPGAVPTDGAARR DREEAAAAIAPCPSSPTAEMPSPPGLRALWLCAALCASRRAGGAPQPGPGPTACPAPCHC QEDGIMLSADCSELGLSAVPGDLDPLTAYLLGCPPPLQKAQAVGQTWENRSQEEKNTREA KLGQ >gi568815597f:202093990_202419204|GENSCAN_predicted_CDS_3|3975_bp atgaggctcccaatcctgttcgctgccctgctctggttccggggttttctggcagaggag gaagcatgcctctccctggaagggagtccaggcagggagagtgcaggcccacccctgaac gtgaacatcaccagccaggggagacctactagcctctttctgagctgggcagccccgggg ccaggcaggttcacccatgccctccgcctcacatgtctgagccccctcagctctcctgaa gggcagcagctccaggcccacaccaatgcatccagctttaagttccaagatctggtgtca gggggtcgctaccagctggaagtgactgccctgcgaccctgtgggcagaatgtcaccatc accctcactgctcgcactggccccccctatgagctgacgctcagtgctgctgccaggccc catcgggcagtggggcccaatgccacagagtggacctatacctctgccccccctcgacct ggtttgactcccctgcccgccaagctctgggcaagctggaaggtagggccaggtgtgcag gatgacttcctgctgaagttaagtgggccagtggagaagaatatcactctaggccctgag gcccacaatgtcacattcccagggcccctgcccactgggcactatgctctggagctgaag gtcctagcggggccatatgacgcctgggcccaggccagtgcctggctggacgattccgca gccaagtccagacaaggcagtggtgccaagcggcagctggatgggctggaggcctccaag gagcccgggagacgggccctgctctacacagagggaaacccgggcctccttggaaacatc tctgtgccacctggtgccacccacatcaccttctatgggccagtgcctggggcccgctac tgtgtggacattgcctcatctctgggaatcatcacttacagcctcatgggccacaaaagt cccctggcaccacagtccctggaggttatcagcaggggtggcccctctgacctggccatt gtctgggccccagcaccaggacagcgggaaggctacagggtcgcttggcaccaggagggc agccagaggtcaccgggcagtcttgttgacttgggcccggacaattccagcctgactctg aggagtctggtgcccggctcctcctatgccatgtcagtgtgggcctgggcagagaacctt ggctctagcatccagaagatccacccctgtacttgcccgcttgcccctcctctggtaaat gtgacgagtgaaggtcccacccagctctgggcatcctgggtccatgcccccaggggccga gacagctacccggtgaccctgtaccgggcaggcaccagcgccgtcggagccaaggtggcc agcacaagcttttcaagtctgactccaggcacgaagtacaaggtggaggttgtcacgcag gctgggccccaccacattgcagcagccaacacctctggctggacccatgaggcatgggga aggcagcgatgcaggagagccctgcacacacccagtgagttggtgtccatgcatgcgagc accgctgtggtcaacctggcctgggccagcagccccttggggcaggggatgtgctacacc caactctcagaggcggggcacctctcctgggagcaccctctggtgccaggccaagcccac ctcatcctgaggggcctcacacctggatgcaacctctccctgtcagtgctgtgccaggca gggccgctgcaggcgtccactcagcgcgtggtactgcttgttgagcctggccctgtggaa gatgtgcagtgccagcctgaggccaccttcctggccctgaactggacagtgcccgccagg gatgtgggcacctgtctggtggtggcagagcagctggtggcaggagggaatgctcacctt gtgttccaggccgacacctccaaaaatgcagtcctgttgcccaacctggtgcctgtcact tcctatcacctcagcctcgctgtgctgggcaggaacggtctgtggagtcgggtggtcact ctggcatgttccacatctgccgaggtgcctcagccttcctgggaagccatcaaccacatg tggcatgaccactactacagaggacatgactcctacctggccatcctgctccccaacccc ttctacccggatccctgggctgtgccgagatcctggacagtgcctgtgggtacagaggac tgtggccacaccaaagagatatgcaacgggcagctcaagctagagccctgggccggtgtc tccctggcatcagtgcccctgccggtaatggagggcctcgtggtgggctgtgtcctcacc atctgtgctgtgctgggcctgctgtgctggaggcgggtgaaggggcagagggcagggaag aatccattttcccaagagctgacagcttacaacctgcgcccaggctgggcgttggaggag cttgctggaaatgcacgctgctccacacccagggtggaattcaggatctgcgtttccaac aggcccccaggtgatcaggagctgaaggaggtgggcaaggagcagcccagactggaggct gagtacgctgccaacaccaccaagaaccattacccacatgtgcttccctacgaccactcc agggtcaggctgacccagctggagggagagcctcattctgactacatcaatgccaacttc atcccagtgctgtgtgagcattactggctgaccgactctaccccggtcacccatgatcac atcaccatccacctcctagccgaggaggctgacgatgagtggaccaagcgggaattccag ctgcagcacatgcgtgccccaaggatgaggggtgtgggcatgggccagacgggcaccttc gtggccctgttgaggctgctgcagcagctggaggaggagcagatggtagatgtgttccat gctgtgtttgcattctggatgcacgggcccctcatgatccagaccctgagccagtacgtc ttcctgcacagctgcctactgaacaagattctggaagggcccttcaacatctctgagtct tggcccatctctgtgatgaacttcgcacaggcgtgtgccaagagggcagccaatgccaac gctggcttcttgaaggagtacgagctcttgctgcaggccatcaaggacgaggctggctct tacgcacccctgcctggctatgagcaggacagccccatctcctggccagaggagctctgg gagctggtgtggcagcacggggctcatgtgcttgtctctctgtgcccactcgatgccatg gagaaggtcagttgcagcaagggtgcgacccagctgggcaccttcctggccatggagcag ctgctgcagcaggcagggtctgagtgcaccgtggatgtctttaacgtggccctgcagcag tctcaggcctgtgaccttatgaccccaacgctgaagcagtatatctacctctacaattgt ctgaacagcgcactggcagacgggctgcccctgagtcggcactggtcactgtgcaggaga gggctctggggccggcccagagcccctggggcggtccccaccgacggtgcagcccgccgg gaccgggaggaagcagctgcggccatcgcgccgtgccccagtagcccgaccgccgagatg cccagcccgccggggctccgggcgctatggctttgcgccgcgctgtgcgcttcccggagg gccggcggcgccccccagcccggcccggggcccaccgcctgcccggccccctgccactgc caggaggacggcatcatgctgtctgccgactgctctgagctcgggctgtccgccgttccg ggggacctggaccccctgacggcttacctgttgggatgtccccctccactccaaaaggca caggctgtgggccagacatgggagaacaggtcccaggaggagaaaaacacccgagaggcc aaacttgggcagtga >gi568815597f:202093990_202419204|GENSCAN_predicted_peptide_4|481_aa MTRECQNVFLLGALKPSIKECVGEMEAPEDIQLPEPVNGTLFRKGAFANVLRILGDHRET QRERGESYVKAEGRDWSDAATRNASSCRKLEEAKKHPPQHPQSAQETQENDSLRKISKFM HNWLLLLESPRAPTWALLKICFREALPHPVSITTTNDHCRSSNKRPSRQAQGYLRTPARQ QKQEKSRPEDTYPGWKTPTPEDGERGHPGIRGDLESFEDSTRKRRESPEAQSPEQEKELG KDIGGKPSRPLDPEMRRRAHREFMEELGKQEGRKLLTALGTRPALILMQKPEGAWSKDTR ETMAEKTGGRRLLMPLHRPHPSRGPGGYIWSLHKCSGAMNNLLNCTKETCKSGLLQEAQV VEEARLKLCEVVHAEVPGGEKMEKGAVNSPLGPSGTVQNTSGLAGLEMGSAEAVGGREVK GTQASQAQPEPQCDLRTAQEYDRAEPLERNENMGDCFWAPSALYYLSERQRKTYTVQRSG H >gi568815597f:202093990_202419204|GENSCAN_predicted_CDS_4|1446_bp atgactcgagagtgtcagaatgtcttcctgctgggtgcgttgaagccctcaattaaggag tgtgtgggtgaaatggaggccccagaagatatacaacttccagaacctgtaaatgggaca ttatttagaaaaggggcctttgcaaatgtattaaggatcttgggagaccacagagagaca cagagagaaagaggagaaagctacgtgaaggcagaaggcagagactggagtgatgcagcc acaaggaacgccagcagctgccggaagctggaagaggcaaagaagcacccaccccagcac cctcagagtgctcaggagacccaagaaaatgactccttgagaaaaatctccaaattcatg cacaactggctccttctcttggaatcacccagggctcccacttgggccctcttgaagatt tgcttccgggaggctctcccacatccagtcagcattactactactaatgatcattgtaga agtagcaacaaaaggcccagtcgccaggcgcagggctacttgcgtacgccagcaagacag cagaagcaggaaaagagccggccggaagacacctaccctggttggaagacacctacccct gaagatggagaaagaggccatccgggcatcagaggggacttagaaagttttgaggatagc actcgaaagcgcagggagtcccctgaggcacagagccctgagcaagagaaggaactgggt aaagacataggaggaaaaccctccaggcctttagatccagaaatgagaaggagagcacac agggagttcatggaagagctggggaagcaagaagggaggaagttgctgactgccctgggc acaaggcctgccttgattctcatgcagaagcctgagggtgcttggagcaaggatacaaga gaaacgatggctgagaaaacagggggtcgcaggctgctcatgcctcttcacaggccccac ccctccaggggcccaggaggctacatctggtcactgcacaaatgctctggtgcaatgaac aacctgctcaactgtacaaaggagacctgcaagtcagggctcctccaagaagcgcaggtg gtggaagaggccaggctgaagctctgtgaggttgttcatgctgaggtccctggtggagag aagatggaaaaaggagctgtgaactccccacttgggccatctggcactgtccagaacacc tctgggcttgccggtctagaaatgggatccgcagaagctgtaggtggcagagaagttaaa gggacccaggcctcccaggcccagccggagccacagtgtgacctgagaacagcccaggag tatgacagagcagagcctctggagaggaatgaaaatatgggggactgcttctgggccccc tctgctctctactacctctctgagagacaacgcaagacctacactgtccagcgcagtggc cactag >gi568815597f:202093990_202419204|GENSCAN_predicted_peptide_5|1008_aa MGLSHQEYSGSDSPELSLLGSSGLVCSLGEYEKQFGPRQVKLFPQSLSKPELACEVPANL PHYCRRLDANLISLVPERSFEGLSSLRHLWLDDNALTEIPVRALNNLPALQAMTLALNRI SHIPDYAFQNLTSLVVLHLHNNRIQHLGTHSFEGLHNLETLHLPSGDIRAFRAVFGVAMD EQGRRESNAMNPGVWALLALLGNQQREDQTPAAQAAVWLELWFGSSADSSKWGKRPPAIP PGEEGPGPSLWERLSGHILLWSGTPQSLQVACINGPQTSRDLNYNKLQEFPVAIRTLGRL QELGFHNNNIKAIPEKAFMGNPLLQTIHFYDNPIQFVGRSAFQYLPKLHTLSLNGAMDIQ EFPDLKGTTSLEILTLTRAGIRLLPSGMCQQLPRLRVLELSHNQIEELPSLHRCQKLEEI GLQHNRIWEIGADTFSQLSSLQALDLSWNAIRSIHPEAFSTLHSLVKLDLTDNQLTTLPL AGLGGLMHLKLKGNLALSQAFSKDSFPKLRILEVPYAYQCCPYGMCASFFKASGQWEAED LHLDDEESSKRPLGLLARQAENHYDQDLDELQLEMEDSKPHPSVQCSPTPGPFKPCEYLF ESWGIRLAVWAIVLLSVLCNGLVLLTVFAGGPVPLPPVKFVVGAIAGANTLTGISCGLLA SVDALTFGQFSEYGARWETGLGCRATGFLAVLGSEASVLLLTLAAVQCSVSVSCVRAYGK SPSLGSVRAGVLGCLALAGLAAALPLASVGEYGASPLCLPYAPPEGQPAALGFTVALVMM NSFCFLVVAGAYIKLYCDLPRGDFEAVWDCAMVRHVAWLIFADGLLYCPVAFLSFASMLG LFPVTPEAVKSVLLVVLPLPACLNPLLYLLFNPHFRDDLRRLRPRAGDSGPLAYAAAGEL EKSSCDSTQALVAFSDVDLILEASEAGRPPGLETYGFPSVTLISCQQPGAPRLEGSHCVE PEGNHFGNPQPSMDGELLLRAEGSTPAGGGLSGGGGFQPSGLAFASHV >gi568815597f:202093990_202419204|GENSCAN_predicted_CDS_5|3027_bp atgggactgtcccaccaggagtactcaggctccgactccccagagttgtcacttttgggc tcttcagggcttgtgtgctctctgggggagtatgagaagcagtttggccctcgacaggtt aagctgtttccccagagcctaagcaagcccgaactggcctgtgaggtccctgctaatctg ccccattactgcaggcgcctagatgccaacctcatctccctggtcccggagaggagcttt gaggggctgtcctccctccgccacctctggctggacgacaatgcactcacggagatccct gtcagggccctcaacaacctccctgccctgcaggccatgaccctggccctcaaccgcatc agccacatccccgactacgcgttccagaatctcaccagccttgtggtgctgcatttgcat aacaaccgcatccagcatctggggacccacagcttcgaggggctgcacaatctggagaca ctccaccttccctcaggagatataagagcttttagggcagtgtttggagttgccatggat gagcagggcaggagggagtccaatgccatgaaccctggagtgtgggcgctgttagctttg ctgggaaatcagcagagggaagatcaaacccctgcggcccaagctgcggtttggctggag ctgtggtttggctcctctgcagactcctctaagtggggaaagcggcccccagccatacca cctggggaggaggggccaggtccaagtctctgggaaagattatcgggccacattctgctg tggtcaggaacacctcagagccttcaggtcgcctgcattaacgggccccaaacatccaga gacctgaattataacaagctgcaggagttccctgtggccatccggaccctgggcagactg caggaactggggttccataacaacaacatcaaggccatcccagaaaaggccttcatgggg aaccctctgctacagacgatacacttttatgataacccaatccagtttgtgggaagatcg gcattccagtacctgcctaaactccacacactatctctgaatggtgccatggacatccag gagtttccagatctcaaaggcaccaccagcctggagatcctgaccctgacccgcgcaggc atccggctgctcccatcggggatgtgccaacagctgcccaggctccgagtcctggaactg tctcacaatcaaattgaggagctgcccagcctgcacaggtgtcagaaattggaggaaatc ggcctccaacacaaccgcatctgggaaattggagctgacaccttcagccagctgagctcc ctgcaagccctggatcttagctggaacgccatccggtccatccaccccgaggccttctcc accctgcactccctggtcaagctggacctgacagacaaccagctgaccacactgcccctg gctggacttgggggcttgatgcatctgaagctcaaagggaaccttgctctctcccaggcc ttctccaaggacagtttcccaaaactgaggatcctggaggtgccttatgcctaccagtgc tgtccctatgggatgtgtgccagcttcttcaaggcctctgggcagtgggaggctgaagac cttcaccttgatgatgaggagtcttcaaaaaggcccctgggcctccttgccagacaagca gagaaccactatgaccaggacctggatgagctccagctggagatggaggactcaaagcca caccccagtgtccagtgtagccctactccaggccccttcaagccctgtgagtacctcttt gaaagctggggcatccgcctggccgtgtgggccatcgtgttgctctccgtgctctgcaat ggactggtgctgctgaccgtgttcgctggcgggcctgtccccctgcccccggtcaagttt gtggtaggtgcgattgcaggcgccaacaccttgactggcatttcctgtggccttctagcc tcagtcgatgccctgacctttggtcagttctctgagtacggagcccgctgggagacgggg ctaggctgccgggccactggcttcctggcagtacttgggtcggaggcatcggtgctgctg ctcactctggccgcagtgcagtgcagcgtctccgtctcctgtgtccgggcctatgggaag tccccctccctgggcagcgttcgagcaggggtcctaggctgcctggcactggcagggctg gccgccgcgctgcccctggcctcagtgggagaatacggggcctccccactctgcctgccc tacgcgccacctgagggtcagccagcagccctgggcttcaccgtggccctggtgatgatg aactccttctgtttcctggtcgtggccggtgcctacatcaaactgtactgtgacctgccg cggggcgactttgaggccgtgtgggactgcgccatggtgaggcacgtggcctggctcatc ttcgcagacgggctcctctactgtcccgtggccttcctcagctttgcctccatgctgggc ctcttccctgtcacgcccgaggccgtcaagtctgtcctgctggtggtgctgcccctgcct gcctgcctcaacccactgctgtacctgctcttcaacccccacttccgggatgaccttcgg cggcttcggccccgcgcaggggactcagggcccctagcctatgctgcggccggggagctg gagaagagctcctgtgattctacccaggccctggtagccttctctgatgtggatctcatt ctggaagcttctgaagctgggcggccccctgggctggagacctatggcttcccctcagtg accctcatctcctgtcagcagccaggggcccccaggctggagggcagccattgtgtagag ccagaggggaaccactttgggaacccccaaccctccatggatggagaactgctgctgagg gcagagggatctacgccagcaggtggaggcttgtcagggggtggcggctttcagccctct ggcttggcctttgcttcacacgtgtaa >gi568815597f:202093990_202419204|GENSCAN_predicted_peptide_6|112_aa MHGLNRNFNEQKYVIRQKYCQDELTGRDCGPGWKILSRLAGNLRQENAEEVKIMKREDEK EQTNLIYFITKPEIHRLSPEPLDYTGSFGIAELSSVSFGRNLREHSSHACIL >gi568815597f:202093990_202419204|GENSCAN_predicted_CDS_6|339_bp atgcatggcctgaataggaatttcaatgagcaaaagtatgtgataagacagaagtactgt caggatgagttaacaggtagagactgtgggcctgggtggaagatccttagcaggttagca gggaacctacggcaagagaatgctgaagaagtaaagataatgaaaagggaagatgagaag gaacagaccaacctaatttacttcatcactaaaccggagatccacaggctctcccctgag cccctggattacacaggaagtttcggaattgcagagctgtcgagtgtcagctttgggaga aatctcagagagcattcaagccatgcctgcattttatag >gi568815597f:202093990_202419204|GENSCAN_predicted_peptide_7|150_aa MQRASRLKRELHMLATEPPPGITCWQDKDQMDDLRAQILGGANTPYEKGVFKLEVIIPER YPFEPPQIRFLTPIYHPNIDSAGRICLDVLKLPPKGAWRPSLNIATVLTSIQLLMSEPNP DDPLMADIVNIPIFLCNTYPTLSIPLTLPV >gi568815597f:202093990_202419204|GENSCAN_predicted_CDS_7|453_bp atgcagagagcttcacgtctgaagagagagctgcacatgttagccacagagccaccccca ggcatcacatgttggcaagataaagaccaaatggatgacctgcgagctcaaatattaggt ggagccaacacaccttatgagaaaggtgtttttaagctagaagttatcattcctgagagg tacccatttgaacctcctcagatccgatttctcactccaatttatcatccaaacattgat tctgctggaaggatttgtctggatgttctcaaattgccaccaaaaggtgcttggagacca tccctcaacatcgcaactgtgttgacctctattcagctgctcatgtcagaacccaaccct gatgacccgctcatggctgacatagtaaatatccccattttcctctgcaacacatatcct accttgtctataccgctaactctccctgtgtga >gi568815597f:202093990_202419204|GENSCAN_predicted_peptide_8|246_aa MFPYGHQVHLGQASYYESRDDPRPGSACVLKPRERAGGTTGGRSKDGGARVSALCSGLKR SERGGSGNSSPNSNLVLVVLAAAAEAGAMAELEHLGGKRAESARMRRAEQLRRWRGSLTE QEPAERRGAGRQPLTRRGSPRVRFEDGAVFLAACSSGDTDEVRKLLARGADINTVNVDGL TALHQILFIAAVSVFSFFFITQACIDENLDMVKFLVENRANVNQQDNEGWTPLHAAASCG YLNIAE >gi568815597f:202093990_202419204|GENSCAN_predicted_CDS_8|741_bp atgttcccttacggccaccaagtgcacttagggcaggcctcttactacgagtcccgtgat gacccgaggccgggcagcgcctgcgtattgaagccgagggagcgtgcgggcggtactact ggcgggaggagtaaagatggcggcgcgagggtctccgccctctgctccgggctgaagcgc tctgagagaggcggcagcggcaactcgagccccaacagtaatttagtgttggtagttttg gcagcagctgccgaggccggagcaatggcggaactggagcacctaggagggaagcgggca gagtcggcgcgaatgcggcgggcagagcagcttcggcgctggcggggctcgctgacagag caggagcctgcggagcgacgaggcgcggggcggcagccgctgaccaggcgcgggagcccc agggtccgcttcgaggacggtgctgtctttctggccgcctgctctagcggggacaccgac gaggtgagaaagcttctggcaagaggtgctgatatcaacacggtcaacgtggacggcttg acagccctgcaccagattctcttcatcgctgctgtttcagtgttctctttcttctttata actcaggcatgtattgatgaaaatttggacatggtgaagtttctggtggagaacagagcc aatgtaaaccagcaagacaacgagggctggacaccccttcatgcagcagcttcctgtggc tatctcaacatagcagagtga