GENSCAN 1.0 Date run: 2-Nov-116 Time: 17:54:30 Sequence gi568815579f:48370352_48579207 : 208856 bp : 53.45% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3166 3403 238 0 1 81 93 456 0.999 43.12 1.02 Intr + 5262 5401 140 1 2 80 68 154 0.839 13.29 1.03 Term + 5734 5967 234 0 0 111 36 193 0.994 13.05 1.04 PlyA + 5985 5990 6 1.05 2.06 PlyA - 7785 7780 6 1.05 2.05 Term - 12976 12942 35 2 2 137 41 22 0.956 0.43 2.04 Intr - 14131 13879 253 1 1 125 65 505 0.986 49.44 2.03 Intr - 19360 19202 159 1 0 116 80 166 0.638 19.30 2.02 Intr - 20173 20073 101 2 2 106 75 154 0.999 16.23 2.01 Init - 21007 20917 91 1 1 85 97 132 0.653 12.44 2.00 Prom - 22514 22475 40 -13.92 3.00 Prom + 22667 22706 40 -2.51 3.01 Init + 28042 28506 465 0 0 60 91 936 0.446 84.78 3.02 Intr + 34383 34782 400 2 1 114 93 572 0.913 54.95 3.03 Intr + 34837 35002 166 2 1 81 95 172 0.548 16.73 3.04 Intr + 43640 43754 115 2 1 106 89 187 0.985 21.55 3.05 Intr + 44022 44233 212 2 2 65 61 466 0.999 39.84 3.06 Intr + 44513 44681 169 2 1 84 78 394 0.999 38.46 3.07 Intr + 45651 45804 154 2 1 106 14 359 0.794 30.46 3.08 Intr + 48883 49008 126 2 0 107 75 227 0.997 24.46 3.09 Intr + 49234 49463 230 2 2 82 93 458 0.998 43.82 3.10 Intr + 51434 51594 161 1 2 66 44 340 0.501 26.70 3.11 Intr + 53141 53341 201 2 0 -11 78 134 0.392 1.42 3.12 Intr + 71418 71605 188 0 2 130 50 523 0.519 52.55 3.13 Intr + 71799 72031 233 1 2 75 51 473 0.967 40.32 3.14 Term + 72249 73586 1338 2 0 79 48 1071 0.972 94.31 3.15 PlyA + 74554 74559 6 1.05 4.00 Prom + 75194 75233 40 -9.07 4.01 Init + 75655 75841 187 0 1 90 58 277 0.973 24.10 4.02 Intr + 76034 76151 118 0 1 116 110 -10 0.999 4.03 4.03 Intr + 76330 76492 163 1 1 47 114 309 0.999 29.89 4.04 Intr + 78629 78739 111 1 0 61 55 56 0.615 0.68 4.05 Intr + 79962 80175 214 2 1 116 117 271 0.999 31.61 4.06 Intr + 80315 80457 143 0 2 81 89 199 0.999 19.88 4.07 Intr + 80683 80880 198 0 0 97 63 324 0.999 30.87 4.08 Term + 82357 82674 318 0 0 98 54 539 0.999 46.73 4.09 PlyA + 83526 83531 6 1.05 5.00 Prom + 86298 86337 40 -4.91 5.01 Init + 91374 92087 714 2 0 36 85 1390 0.968 126.48 5.02 Term + 93830 94426 597 1 0 81 44 751 0.999 64.94 5.03 PlyA + 94972 94977 6 -0.45 6.00 Prom + 98140 98179 40 -6.20 6.01 Init + 99157 99175 19 0 1 120 78 40 0.917 6.35 6.02 Intr + 100002 100149 148 1 1 71 94 334 0.986 32.10 6.03 Intr + 100252 100318 67 1 1 109 44 98 0.906 6.90 6.04 Intr + 101974 102092 119 0 2 74 80 247 0.999 22.27 6.05 Intr + 102947 103027 81 2 0 101 109 123 0.999 14.95 6.06 Intr + 103554 103666 113 0 2 70 80 115 0.999 9.43 6.07 Intr + 103831 103979 149 2 2 84 56 357 0.999 32.56 6.08 Intr + 104451 104598 148 2 1 28 79 275 0.900 20.92 6.09 Intr + 107718 107794 77 1 2 121 101 143 0.978 18.63 6.10 Intr + 107924 107995 72 1 0 93 111 156 0.999 18.50 6.11 Intr + 108087 108241 155 2 2 118 55 221 0.952 21.18 6.12 Term + 108772 108859 88 1 1 120 46 156 0.958 12.03 6.13 PlyA + 111930 111935 6 -0.45 7.16 PlyA - 114535 114530 6 -0.45 7.15 Term - 115438 115422 17 2 2 114 37 15 0.131 -2.32 7.14 Intr - 120894 120757 138 1 0 111 81 38 0.606 6.34 7.13 Intr - 121188 121053 136 0 1 103 73 192 0.999 19.85 7.12 Intr - 123758 123343 416 0 2 89 80 561 0.999 49.80 7.11 Intr - 129566 127042 2525 1 2 136 100 1782 0.978 172.60 7.10 Intr - 130794 130645 150 2 0 147 81 260 0.999 30.99 7.09 Intr - 131053 130932 122 1 2 117 81 189 0.999 20.90 7.08 Intr - 131211 131127 85 2 1 146 97 217 0.999 28.72 7.07 Intr - 132230 132082 149 2 2 109 80 252 0.999 26.04 7.06 Intr - 132645 132558 88 2 1 96 81 88 0.977 9.37 7.05 Intr - 138618 138500 119 0 2 64 84 191 0.492 16.07 7.04 Intr - 139162 139086 77 2 2 111 113 69 0.999 11.43 7.03 Intr - 139822 139672 151 1 1 118 84 159 0.999 18.75 7.02 Intr - 140241 140108 134 1 2 50 100 145 0.996 12.77 7.01 Init - 141225 141150 76 0 1 82 117 167 0.888 18.21 7.00 Prom - 142524 142485 40 -4.81 8.04 PlyA - 142865 142860 6 1.05 8.03 Term - 144183 144020 164 1 2 97 41 41 0.198 -1.29 8.02 Intr - 147837 147712 126 1 0 87 49 93 0.037 6.46 8.01 Init - 170191 170152 40 1 1 95 75 49 0.483 4.59 8.00 Prom - 170686 170647 40 -2.61 9.00 Prom + 176586 176625 40 -5.21 9.01 Init + 181902 181972 71 2 2 85 94 127 0.913 13.47 9.02 Intr + 192953 193048 96 2 0 61 22 100 0.019 0.32 9.03 Intr + 205590 205732 143 0 2 82 82 246 0.285 23.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:48370352_48579207|GENSCAN_predicted_peptide_1|203_aa VFSLIVFSSLLTDGYQNKMESPQLHCILNSNSVACSFAVGAGFLAFLSCLAFLVLDTQET RIAGTRFKTAFQLLDFILAVLWAVVWFMGFCFLANQWQHSPPKEFLLGSSSAQAAIAFTF FSILVWIFQAYLAFQDLRNDAPVPYKRFLDEGGMVLTTLPLPSANSPVNMPTTGPNSLSY ASSALSPCLTAPKSPRLAMMPDN >gi568815579f:48370352_48579207|GENSCAN_predicted_CDS_1|612_bp gtcttctccctgatcgtcttctcctccctgctgaccgacggctaccagaacaagatggag tctccgcagctccactgcattctcaacagcaacagcgtggcctgcagctttgccgtggga gccggcttcctggccttcctcagctgcctggccttcctcgtcctggacacacaggagacc cgcattgccggcacccgcttcaagacagccttccagctcctggacttcatcctggctgtt ctctgggcagttgtctggttcatgggtttctgcttcctggccaaccaatggcagcattcg ccgcccaaagagttcctcctggggagcagcagtgcccaggcagccatcgccttcaccttc ttctccatccttgtctggatattccaggcctacctggcattccaggacctccgaaatgat gctccagtcccttacaagcgcttcctggatgagggtggcatggtgctgaccaccctcccc ttgccctctgccaacagccctgtgaacatgcccaccactggccccaacagcctgagttat gctagctctgccctgtccccctgtctgaccgctccaaagtccccccggcttgctatgatg cctgacaactaa >gi568815579f:48370352_48579207|GENSCAN_predicted_peptide_2|212_aa MNLFRFLGDLSHLLAIILLLLKIWKSRSCAGISGKSQVLFAVVFTARYLDLFTNYISLYN TCMKVVYIACSFTTVWLIYSKFKATYDGNHDTFRVEFLVVPTAILAFLVNHDFTPLEILW TFSIYLESVAILPQLFMVSKTGEAETITSHYLFALGVYRTLYLFNWIWRYHFEGFFDLIA IVAGLVQTVLYCDFFYLYITKVLKGKKLSLPA >gi568815579f:48370352_48579207|GENSCAN_predicted_CDS_2|639_bp atgaatctcttccgattcctgggagacctctcccacctcctcgccatcatcttgctactg ctcaaaatctggaagtcccgctcgtgcgccggaatttcagggaagagccaggtcctgttt gctgtggtgttcactgcccgatatctggacctcttcaccaactacatctcactctacaac acgtgtatgaaggtggtctacatagcctgctccttcaccacggtctggttgatttatagc aagttcaaagctacttacgatgggaaccatgacacgttcagagtggagttcctggtcgtt cccacagccattctggcgttcctggtcaatcatgacttcacccctctggagatcctctgg accttctccatctacctggagtcagtggccatcttgccgcagctgttcatggtgagcaag accggcgaggcggagaccatcaccagccactacttgtttgcgctaggcgtttaccgcacg ctctatctcttcaactggatctggcgctaccatttcgagggcttcttcgacctcatcgcc attgtggcaggcctggtccagacagtcctctactgcgatttcttctacctctatatcacc aaagtcctaaaggggaagaagttgagtttgccggcatag >gi568815579f:48370352_48579207|GENSCAN_predicted_peptide_3|1385_aa MRGAGGPRGPRGPAKMLLLLALACASPFPEEAPGPGGAGGPGGGLGGARPLNVALVFSGP AYAAEAARLGPAVAAAVRSPGLDVRPVALVLNGSDPRSLVLQLCDLLSGLRVHGVVFEDD SRAPAVAPILDFLSAQTSLPIVAVHGGAALVLTPKEKGSTFLQLGSSTEQQLQVIFEVLE EYDWTSFVAVTTRAPGHRAFLSYIEVLTDGSLVGWEHRGALTLDPGAGEAVLSAQLRSVS AQIRLLFCAREEAEPVFRAAEEAGLTGSGYVWFMVGPQLAGGGGSGAPVRSAGWRDDLAR RVAAGVAVVARGAQALLRDYGFLPELGHDCRAQNRTHRGESLHRYFMNITWDNRDYSFNE DGFLVNPSLVVISLTRDRTWEVVGSWEQQTLRLKYPLWSRYGRFLQPVDDTQHLTVATLE ERPFVIVEPADPISGTCIRDSVPCRSQLNRTHSPPPDAPRPEKRCCKGFCIDILKRLAHT IGFSYDLYLVTNGKHGKKIDGVWNGMIGEVFYQRADMAIGSLTINEERSEIVDFSVPFVE TGISVMVARSNGTVSPSAFLEPYSPAVWVMMFVMCLTVVAVTVFIFEYLSPVGYNRSLAT GKRPGGSTFTIGKSIWLLWALVFNNSVPVENPRGTTSKIMVLVWAFFAVIFLASYTANLA AFMIQEEYVDTVSGLSDRKFQRPQEQYPPLKFGTVPNGSTEKNIRSNYPDMHSYMVRYNQ PRVEEALTQLKAGVFRVGFSEEVTFVQSPAGGERASQVDFRKQSLQTEGTECAKGLRKVQ CWFTGGTARRPVWLKGKELRKLDAFIYDAAVLNYMARKDEGCKLVTIGSGKVFATTGYGI ALHKGSRWKRPIDLALLQFLGDDEIEMLERLWLSGICHNDKIEVMSSKLDIDNMAGVFYM LLVAMGLSLLVFAWEHLVYWRLRHCLGPTHRMDFLLAFSRGMYSCCSAEAAPPPAKPPPP PQPLPSPAYPAPRPAPGPAPFVPRERASVDRWRRTKGAGPPGGAGLADGFHRYYGPIEPQ GLGLGLGEARAAPRGAAGRPLSPPAAQPPQKPPPSYFAIVRDKEPAEPPAGAFPGFPSPP APPAAAATAVGPPLCRLAFEDESPPAPARWPRSDPESQPLLGPGAGGAGGTGGAGGGAPA APPPCRAAPPPCPYLDLEPSPSDSEDSESLGGASLGGLEPWWFADFPYPYAERLGPPPGR YWSVDKLGGWRAGSWDYLPPRSGPAAWHCRHCASLELLPPPRHLSCSHDGLDGGWWAPPP PPWAAGPLPRRRARCGCPRSHPHRPRASHRTPAAAAPHHHRHRRAAGGWDLPPPAPTSRS LEDLSSCPRAAPARRLTGPSRHARRCPHAAHWGPPLPTASHRRHRGGDLGTRRGSAHFSS LESEV >gi568815579f:48370352_48579207|GENSCAN_predicted_CDS_3|4158_bp atgcgcggcgccggtggcccccgcggccctcggggccccgctaagatgctgctgctgctg gcgctggcctgcgccagcccgttcccggaggaggcgccggggccgggcggggccggtggg cccggcggcggcctcggcggggcgcggccgctcaacgtggcgctcgtgttctcggggccc gcgtacgcggccgaggcggcacgcctgggcccggccgtggcggcggcggtgcgcagcccg ggcctagacgtgcggcccgtggcgctggtgctcaacggctcggacccgcgcagcctcgtg ctgcagctctgcgacctgctgtcggggttgcgcgtgcacggcgtggtcttcgaagacgac tcgcgcgcgcccgccgtcgcgcccatcctcgacttcctgtcggcgcagacctcgctgccc atcgtggccgtgcacggcggcgccgcgctcgtgctcacgcccaaggagaagggctccacc ttcctgcagctgggctcttccaccgagcaacagcttcaggtcatctttgaggtgctggag gagtatgactggacgtcctttgtagccgtgaccactcgtgcccctggccaccgggccttc ctgtcctacattgaggtgctgactgacggtagtctggtgggctgggagcaccgcggagcg ctgacgctggaccctggggcgggcgaggccgtgctcagtgcccagctccgcagtgtcagc gcgcagatccgcctgctcttctgcgcccgagaggaggccgagcccgtgttccgcgcagct gaggaggctggcctcactggatctggctacgtctggttcatggtggggccccagctggct ggaggcgggggctctggggcccctgtgcgctcggctggctggcgggatgacctggctcgg cgagtggcagctggcgtggccgtagtggccagaggtgcccaggccctgctgcgtgattat ggtttccttcctgagctcggccacgactgtcgcgcccagaaccgcacccaccgcggcgag agtctgcataggtacttcatgaacatcacgtgggataaccgggattactccttcaatgag gacggcttcctagtgaacccctccctggtggtcatctccctcaccagagacaggacgtgg gaggtggtgggcagctgggagcagcagacgctccgcctcaagtacccgctgtggtcccgc tatggtcgcttcctgcagccagtggacgacacgcagcacctcacggtggccacgctggag gaaaggccgtttgtcatcgtggagcctgcagaccctatcagcggcacctgcatccgagac tccgtcccctgccggagccagctcaaccgaacccacagccctccaccggatgccccccgc ccggaaaagcgctgctgcaagggtttctgcatcgacattctgaagcggctggcgcatacc atcggcttcagctacgacctctacctggtcaccaatggcaagcacggaaagaagatcgat ggcgtctggaacggcatgatcggggaggtgttctaccagcgcgcagacatggccatcggc tccctcaccatcaacgaggagcgctccgagatcgtggacttctccgtccccttcgtggag accggcatcagcgtcatggtggcgcgcagcaatggcacggtgtccccctcggccttcctc gagccctacagccccgccgtgtgggtgatgatgttcgtcatgtgcctcactgtggtcgcc gtcactgttttcatcttcgagtacctcagtcctgttggttacaaccgcagcctggccacg ggcaagcgccctggcggttcaaccttcaccattgggaaatccatctggctgctctgggcc ctggtgttcaataattcggtgcccgtggagaacccccggggaaccaccagcaaaatcatg gtgctggtgtgggccttcttcgccgtcatcttcctcgccagctacacagccaacctggcc gccttcatgatccaggaggagtacgtggatactgtgtctgggctcagtgaccgcaagttc cagaggccccaggagcagtacccgcccctgaagtttgggaccgtgcccaacggctccacg gagaagaacatccgcagcaactatcccgacatgcacagctacatggtgcgctacaaccag ccccgcgtagaggaagcgctcactcagctcaaggcaggggtgttcagggtgggcttctcg gaggaggtgacatttgtgcagagccctgcaggaggtgaaagagctagccaggtggacttc cggaagcagagccttcagacggagggaacagagtgtgcaaagggcctgaggaaggtgcag tgttggtttactggaggaacagcaagaaggccagtgtggttgaagggcaaggagctgagg aagctggacgccttcatctacgatgctgcagtgctcaattacatggcccgcaaggacgag ggctgcaagcttgtcaccatcggctccggcaaggtcttcgccacgacaggctatggcatc gccctgcacaagggctcccgctggaagcggcccatcgacctggcgttgctgcagttcctg ggggatgatgagatcgagatgctggagcggctgtggctctctgggatctgccacaatgac aaaatcgaggtgatgagcagcaagctggacatcgacaacatggcgggcgtcttctacatg ctcctggtggccatgggcctgtccctgctggtcttcgcctgggagcacctggtgtactgg cgcctgcggcactgcctggggcccacccaccgcatggacttcctgctggccttctccagg ggcatgtacagctgctgcagcgctgaggccgccccaccgcccgccaagcccccgccgccg ccacagcccctgcccagccccgcgtaccccgcgccgcggccggctcccgggcccgcacct ttcgtgccccgcgagcgcgcctcagtggaccgctggcgccggaccaagggcgcggggccg ccggggggcgcgggcctggccgacggcttccaccgctactacggccccatcgagccgcag ggcctaggcctcggcctgggcgaagcgcgcgcggcaccgcggggcgcagccgggcgcccg ctgtccccgccggccgctcagcccccgcagaagccgccgccctcctatttcgccatcgta cgcgacaaggagccagccgagccccccgccggcgccttccccggcttcccgtcgccgccc gcgccccccgccgccgcggccaccgccgtcgggccgccactctgccgcttggccttcgag gacgagagcccgccggcgcccgcgcggtggccgcgctcggaccccgagagccaacccctg ctggggccaggcgcgggcggcgcggggggcacggggggcgcaggcggaggagccccggcc gctccgcccccgtgccgcgccgcgccgcccccgtgcccttacctcgatctcgagccgtcg ccgtcggactcggaggactcggagagcctgggcggcgcgtcgctgggcggcctggagccc tggtggttcgccgacttcccttacccgtatgccgagcgcctcgggccgccgcccggccgc tactggtcggtcgacaagctcgggggctggcgcgccgggagctgggactacctgcccccg cgcagcggtccggccgcctggcactgtcggcactgcgccagcctggagctgctgccgccg ccgcgccatctcagctgctcgcacgatggcctggacggcggctggtgggcgccaccgcct ccaccctgggccgccgggcccctgccccgacgccgggcccgctgcgggtgcccgcggtcg cacccgcaccgcccgcgggcctcgcaccgcacgcccgccgccgccgcgccccaccaccac aggcaccggcgcgccgctgggggctgggacctcccgccgcccgcgcccacctcgcgctcg ctcgaggacctcagctcgtgccctcgcgccgcccctgcgcgcaggcttaccgggccctcc cgccacgctcgcaggtgtccgcacgccgcgcactgggggccgccgctgcccacagcttcc caccggagacaccggggcggggacctgggcacccgcaggggctcggcgcacttctctagc ctcgagtccgaggtatga >gi568815579f:48370352_48579207|GENSCAN_predicted_peptide_4|483_aa MAARKGRRRTCETGEPMEAESGDTSSEGPAQVYLPGRGPPLREGEELVMDEEAYVLYHRA QTGAPCLSFDIVRDHLGDNRTELPLTLYLCAGTQAESAQSNRLMMLRMHNLHGTKPPPSE GSDEEEEEEDEEDEEERKPQLELAMVPHYGGINRVRRGKGRRKESCGETYGIQGKDDISS SVTAIDIMRRGLEVSWLGEEPVAGVWSEKGQVEVFALRRLLQVVEEPQALAAFLRDEQAQ MKPIFSFAGHMGEGFALDWSPRVTGRLLTGDCQKNIHLWTPTDGGSWHVDQRPFVGHTRS VEDLQWSPTENTVFASCSADASIRIWDIRAAPSKACMLTTATAHDGDVNVISWSRREPFL LSGGDDGALKIWDLRQFKSGSPVATFKQHVAPVTSVEWHPQDSGVFAASGADHQITQWDL AVERDPEAGDVEADPGLADLPQQLLFVHQGETELKELHWHPQCPGLLVSTALSGFTIFRT ISV >gi568815579f:48370352_48579207|GENSCAN_predicted_CDS_4|1452_bp atggcggcgcgcaagggtcggcggcgcacgtgtgaaaccggggaacccatggaagccgag tccggcgacacaagttccgagggcccggcccaggtctacctgcccggccgggggccgccg ctacgcgaaggggaggagctggtcatggacgaggaggcctatgtgctctaccaccgagcg cagactggcgccccctgtctcagctttgacatagtccgggatcacctgggagacaaccgg acagagcttcctcttacactttacttgtgtgctgggacccaggctgagagcgcccagagc aacagactgatgatgcttcggatgcacaatctgcatgggacaaagcccccaccctcagag ggcagtgatgaagaagaagaggaggaagatgaagaggatgaagaagagcggaaacctcag ctggagctggccatggtgccccactatggtggcatcaaccgagttcggagaggtaagggt cggaggaaagaatcctgtggggagacttacggtattcaggggaaggatgacatcagtagt agtgtgacagcaatagacatcatgagaagggggttggaagtgtcatggctgggtgaagag cctgtggctggggtgtggtcagagaagggccaggtggaggtgtttgcgctgcggcggctt ctgcaggtggtggaggagccccaggccctggcagccttcctccgggatgagcaggcccaa atgaagcccatcttctccttcgctggacacatgggcgagggctttgcccttgactggtcc ccccgggtgaccggtcgcctgctgaccggtgactgtcaaaagaacatccacctctggaca cctacggacggcggctcctggcacgtggaccagcggccattcgtgggccacacacgctct gtggaggacctgcagtggtcaccgactgagaacacggtgtttgcctcctgctcagctgac gcctccatccgcatctgggacatccgggcagcccccagcaaggcctgcatgctcaccaca gccaccgcccatgatggggacgtcaatgtcatcagctggagccgccgggagcccttcctg ctcagtggcggggatgatggggccctcaagatctgggaccttcggcagttcaagtctggt tccccagtggccaccttcaagcagcacgtggcccccgtgacctccgtcgagtggcacccc caggacagcggggtctttgcagcctcgggtgcagaccaccagatcacacagtgggacctg gcagtggagcgggaccctgaggcgggcgacgtggaggccgaccccggactggccgacctc ccgcagcagctgctgttcgtgcaccagggcgagaccgagctgaaggagctgcactggcac ccgcagtgcccagggctcctggtcagcacggcgctgtcaggcttcaccatcttccgcacc atcagcgtctga >gi568815579f:48370352_48579207|GENSCAN_predicted_peptide_5|436_aa MGLARALRRLSGALDSGDSRAGDEEEAGPGLCRNGWAPAPVQSPVGRRRGRFVKKDGHCN VRFVNLGGQGARYLSDLFTTCVDVRWRWMCLLFSCSFLASWLLFGLAFWLIASLHGDLAA PPPPAPCFSHVASFLAAFLFALETQTSIGYGVRSVTEECPAAVAAVVLQCIAGCVLDAFV VGAVMAKMAKPKKRNETLVFSENAVVALRDHRLCLMWRVGNLRRSHLVEAHVRAQLLQPR VTPEGEYIPLDHQDVDVGFDGGTDRIFLVSPITIVHEIDSASPLYELGRAELARADFELV VILEGMVEATAMTTQCRSSYLPGELLWGHRFEPVLFQRGSQYEVDYRHFHRTYEVPGTPV CSAKELDERAEQASHSLKSSFPGSLTAFCYENELALSCCQEEDEDDETEEGNGVETEDGA ASPRVLTPTLALTLPP >gi568815579f:48370352_48579207|GENSCAN_predicted_CDS_5|1311_bp atgggcctggccagggccctacgccgcctcagcggcgccctggattcgggagacagccgg gcgggcgatgaagaggaggccgggcccgggttgtgccgcaacgggtgggcgccggcaccg gtgcagtcacccgtgggccggcgccgcggtcgcttcgtcaagaaagacgggcactgcaac gtgcgtttcgtaaacctgggtggccagggcgcgcgctacctgagcgacctgttcaccaca tgcgtggacgtgcgctggcgctggatgtgcctgctcttctcctgctccttcctcgcctcc tggctgctcttcggcctggccttctggctcattgcctcgctgcacggcgacctggccgcc ccgccaccgcccgcgccctgcttctcacacgtggccagcttcctggccgccttcctcttc gcgctggagacgcagacgtccatcggctacggcgtgcgcagcgtcaccgaggagtgcccg gccgctgtggccgccgtggtgctgcagtgcattgccggctgcgtgctcgacgccttcgtc gtgggtgctgtcatggccaagatggccaaacccaagaagcgcaacgagacgctggtcttc agcgagaacgccgtcgtggcgctgcgcgaccaccgcctctgcctcatgtggcgcgtcggc aacctgcgccgcagccacctggtcgaggcccacgtgcgtgcccagctgctgcagccccgt gtgaccccagagggtgagtacatcccgctggaccaccaggatgtggatgtgggctttgat ggaggcaccgatcgtatcttcctcgtgtcccccatcaccatcgtccatgagatcgactct gccagtcctctgtatgagctaggacgtgccgagctggccagggctgactttgagctggtg gtcattctcgaggggatggttgaggccacagccatgaccacacagtgtcgctcgtcctac ctccctggtgaactgctctggggccatcgttttgagccagttctcttccagcgtggctcc cagtatgaggtcgactatcgccacttccatcgcacttatgaggtcccagggacaccggtc tgcagtgctaaggagctggatgaacgggcagagcaggcttcccacagcctcaagtctagt ttccccggctctctgactgcattttgttatgagaatgaacttgctctgagctgctgccag gaggaagatgaggacgatgagactgaggaagggaatggggtggaaacagaagatggggct gctagcccccgagttctcacaccaaccctggcgctgaccctgcctccatga >gi568815579f:48370352_48579207|GENSCAN_predicted_peptide_6|411_aa MEDGVYEPPDLTPEERMELENIRRRKQELLVEIQRLREELSEAMSEVEGLEANEGSKTLQ RNRKMAMGRKKFNMDPKKGIQFLVENELLQNTPEEIARFLYKGEGLNKTAIGDYLGEREE LNLAVLHAFVDLHEFTDLNLVQALRQFLWSFRLPGEAQKIDRMMEAFAQRYCLCNPGVFQ STDTCYVLSFAVIMLNTSLHNPNVRDKPGLERFVAMNRGINEGGDLPEELLRGARMANAA FTPQNLYDSIRNEPFKIPEDDGNDLTHTFFNPDREGWLLKLGGRVKTWKRRWFILTDNCL YYFEYTTDKEPRGIIPLENLSIREVDDPRKPNCFELYIPNNKGQLIKACKTEADGRVVEG NHMVYRISAPTQEEKDEWIKSIQAAVSVDPFYEMLAARKKRISVKKKQEQP >gi568815579f:48370352_48579207|GENSCAN_predicted_CDS_6|1236_bp atggaggacggcgtctatgaacccccagacctgactccggaggagcggatggagctggag aacatccggcggcggaagcaggagctgctggtggagattcagcgcctgcgggaggagctc agtgaagccatgagcgaggtggaggggctggaggccaatgagggcagtaagaccttgcaa cggaaccggaagatggcaatgggcaggaagaagttcaacatggaccccaagaaggggatc cagttcttggtggagaatgaactgctgcagaacacacccgaggagatcgcccgcttcctg tacaagggcgaggggctgaacaagacagccatcggggactacctgggggagagggaagaa ctgaacctggcagtgctccatgcttttgtggatctgcatgagttcaccgacctcaatctg gtgcaggccctcaggcagtttctatggagctttcgcctacccggagaggcccagaaaatt gaccggatgatggaggccttcgcccagcgatactgcctgtgcaaccctggggttttccag tccacagacacgtgctatgtgctgtccttcgccgtcatcatgctcaacaccagtctccac aatcccaatgtccgggacaagccgggcctggagcgctttgtggccatgaaccggggcatc aacgagggcggggacctgcctgaggagctgctcaggggggctcgaatggctaatgcagcc tttactcctcagaacctgtacgacagcatccgaaatgagcccttcaagattcctgaggat gacgggaatgacctgacccacaccttcttcaacccggaccgggagggctggctcctgaag ctggggggccgggtgaagacgtggaagcggcgctggtttatcctcacagacaactgcctc tactactttgagtacaccacggacaaggagccccgaggaatcatccccctggagaatctg agcatccgagaggtggacgacccccggaaaccgaactgctttgaactttacatccccaac aacaaggggcagctcatcaaagcctgcaaaactgaggcggacggccgagtggtggaggga aaccacatggtgtaccggatctcggcccccacgcaggaggagaaggacgagtggatcaag tccatccaggcggctgtgagtgtggaccccttctatgagatgctggcagcgagaaagaag cggatttcagtcaagaagaagcaggagcagccctga >gi568815579f:48370352_48579207|GENSCAN_predicted_peptide_7|1460_aa MPAPGALILLAAVSASGCLASPAHPDGFALGRAPLAPPYAVVLISCSGLLAFIFLLLTCL CCKRGDVGFKEFENPEGEDCSGEYTPPAEETSSSQSLPDVYILPLAEVSLPMPAPQPSHS DMTTPLGLSRQHLSYLQEIGSGWFGKVILGEIFSDYTPAQVVVKELRASAGPLEQRKFIS EAQPYRSLQHPNVLQCLGLCVETLPFLLIMEFCQLGDLKRYLRAQRPPEGLSPELPPRDL RTLQRMGLEIARGLAHLHSHNYVHSDLALRNCLLTSDLTVRIGDYGLAHSNYKEDYYLTP ERLWIPLRWAAPELLGELHGTFMVVDQSRESNIWSLGVTLWELFEFGAQPYRHLSDEEVL AFVVRQQHVKLARPRLKLPYADYWYDILQSCWRPPAQRPSASDLQLQLTYLLSERPPRPP PPPPPPRDGPFPWPWPPAHSAPRPGTLSSPFPLLDGFPGADPDDVLTVTESSRGLNLECL WEKARRGAGRGGGAPAWQPASAPPAPHANPSNPFYEALSTPSVLPVISARSPSVSSEYYI RLEEHGSPPEPLFPNDWDPLDPGVPAPQAPQAPSEVPQLVSETWASPLFPAPRPFPAQSS ASGSFLLSGWDPEGRGAGETLAGDPAEVLGERGTAPWVEEEEEEEEGSSPGEDSSSLGGG PSRRGPLPCPLCSREGACSCLPLERGDAVAGWGGHPALGCPHPPEDDSSLRAERGSLADL PMAPPASAPPEFLDPLMGAAAPQYPGRGPPPAPPPPPPPPRAPADPAASPDPPSAVASPG SGLSSPGPKPGDSGYETETPFSPEGAFPGGGAAEEEGVPRPRAPPEPPDPGAPRPPPDPG PLPLPGPREKPTFVVQVSTEQLLMSLREDVTRNLLGEKGATARETGPRKAGRGPGNREKV PGLNRDPTVLGNGKQAPSLSLPVNGVTVLENGDQRAPGIEEKAAENGALGSPEREEKVLE NGELTPPRREEKALENGELRSPEAGEKVLVNGGLTPPKSEDKVSENGGLRFPRNTERPPE TGPWRAPGPWEKTPESWGPAPTIGEPAPETSLERAPAPSAVVSSRNGGETAPGPLGPAPK NGTLEPGTERRAPETGGAPRAPGAGRLDLGSGGRAPVGTGTAPGGGPGSGVDAKAGWVDN TRPQPPPPPLPPPPEAQPRRLEPAPPRARPEVAPEGEPGAPDSRAGGDTALSGDGDPPKP ERKGPEMPRLFLDLGPPQGNSEQIKARLSRLSLALPPLTLTPFPGPGPRRPPWEGADAGA AGGEAGGAGAPGPAEEDGEDEDEDEEEDEEAAAPGAAAGPRGPGRARAAPVPVVVSSADA DAARPLRGLLKSPRGADEPEDSELERKRKMVSFHGDVTVYLFDQETPTNELSVQAPPEGD TDPSTPPAPPTPPHPATPGDGFPSNDSGFGGSFEWAEDFPLLPPPGPPLCFSRFSVSPAL ETPGPPARAPDARPAGPVEN >gi568815579f:48370352_48579207|GENSCAN_predicted_CDS_7|4383_bp atgcctgcccccggcgccctcatcctccttgcggccgtctccgcctccggctgcctggcg tccccggcccaccccgatggattcgccctgggccgggctcctctggctcctccctacgct gtggtcctcatttcctgctccggcctgctggccttcatcttcctcctcctcacctgtctg tgctgcaaacggggcgatgtcggcttcaaggaatttgagaaccctgaaggggaggactgc tccggggagtacactccccctgcggaggagacctcctcctcacagtcgctgcctgatgtc tacattctcccgctggctgaggtctccctgccaatgcctgccccgcagccttcacactca gacatgaccacccccctgggccttagccggcagcacctgagctacctgcaggagattggg agtggctggtttgggaaggtgatcctgggagagattttctccgactacacccccgcccag gtggtggtgaaggagctccgagccagcgcggggcccctggagcaacgcaagttcatctcg gaagcacagccgtacaggagcctgcagcaccccaatgtcctccagtgcctgggtctgtgc gtggagacgctgccgtttctgctgattatggagttctgtcaactgggggacctgaagcgt tacctccgagcccagcggccccccgagggcctgtcccctgagctaccccctcgagacctg cggacgctgcagaggatgggcctggagatcgcccgcgggctggcgcacctgcattcccac aactacgtgcacagcgacctggccctgcgcaactgcctgctgacctctgacctgaccgtg cgcatcggagactacgggctggcccacagcaactacaaggaggactactacctgacccca gagcgcctgtggatcccactgcgctgggcggcgcccgagctcctcggggagctccacggg accttcatggtggtggaccagagccgcgagagcaacatctggtccctgggggtgaccctg tgggagctgtttgagtttggggcccagccctaccgccacctgtcagacgaggaggtcctc gccttcgtggtccgccagcagcatgtgaagctggcccggccgaggctcaagctgccttac gcggactactggtatgacattcttcagtcctgctggcggccacctgcccagcgcccttca gcctctgatctccaattgcagctcacctacttgctctccgagcggcctccccggccccca ccgccgccacccccaccccgagacggtcccttcccctggccctggccccctgcacacagt gcgccccgcccggggaccctctcctcaccgttccccctactggatggcttccctggagcc gaccccgacgatgtgctcacggtcaccgagagtagccgcggcctcaacctcgagtgcctg tgggagaaggcccggcgtggggccggccggggtgggggggcacctgcctggcagccggcg tcggcccccccggccccccacgccaacccctccaaccctttctacgaggcgctgtccacg cccagcgtgctgcctgtcatcagcgcccgcagcccctccgtgagcagcgagtactacatc cgcttggaggagcacggctcccctcctgagcccctcttccccaacgactgggaccccctg gacccaggagtgcccgcccctcaggccccccaggccccctccgaggtcccccagctggtg tccgagacctgggcctcccccctcttccctgcgccccggcccttcccagcccagtcctca gcgtcaggcagcttcctgctgagcggctgggaccccgagggccggggcgccggggagacc ctggcgggagaccctgccgaggtcttgggggagcgggggaccgccccgtgggtggaagaa gaagaggaggaggaggagggcagctccccaggggaagacagcagcagccttggaggtggc ccaagccgccggggtcccctaccctgtcccctgtgcagccgcgagggggcctgctcctgc ctgccactggagcggggggacgccgtagcaggctggggaggccaccctgctcttggctgc ccccacccccccgaggacgactcctcgctgcgggcagagcggggctccctggccgacttg cccatggccccccccgcctcggccccccccgagtttctggaccccctcatgggggcggcg gcgccccagtaccccgggcgggggccacctcccgctcccccccccccgccgccacctcct cgggcccccgcggacccggccgcgtcccccgaccccccttcggccgtggccagtcccggt tcaggcctctcgtcgccgggccccaagccgggggacagcggctacgagaccgagacccct ttttccccagagggagccttcccaggtgggggggcggccgaggaggaaggggtccctcgg ccgcgggctccccccgagccacccgacccaggagcgccccggccacctccagacccgggt ccgctcccactcccggggccccgggagaagccgaccttcgtggttcaagtgagcacggaa cagctgctgatgtccctgcgggaggatgtgacaaggaacctcctgggggagaagggggcg acagcccgggagacaggacccaggaaggcggggagaggccccgggaacagagagaaagtc ccgggcctgaacagggacccgacagtcctgggcaacgggaaacaagccccaagcctgagc ctcccagtgaacggggtgacagtgctggagaacggggaccagagagccccaggcatcgag gagaaggcggcggagaatggggccctggggtcccccgagagagaagagaaagtgctggag aatggggagctgacacccccaaggagggaggagaaagcgctggagaatggggagctgagg tccccagaggccggggagaaggtgctggtgaatgggggcctgacacccccaaagagcgag gacaaggtgtcagagaatgggggcctgagattccccaggaacacggagaggccaccagag actgggccttggagagccccagggccctgggagaagacgcccgagagttggggtccagcc cccacgatcggggagccagccccagagacctctctggagagagcccctgcacccagcgca gtggtctcctcccggaacggcggggagacagcccctggcccccttggcccagcccccaag aacgggacgctggaacccgggaccgagaggagagcccccgagactgggggggcgccgaga gccccaggggctgggaggctggacctcgggagtgggggccgagccccagtgggcacgggg acggcccccggcggcggccccggaagcggcgtggacgcaaaggccggatgggtagacaac acgaggccgcagccaccgccgccaccgctgccaccgccaccggaggcacagccgaggagg ctggagccagcgcccccgagagccaggccggaggtggcccccgagggagagcccggggcc ccagacagcagggccggcggagacacggcactcagcggagacggggacccccccaagccc gagaggaagggccccgagatgccacgactattcttggacttgggaccccctcaggggaac agcgagcagatcaaagccaggctctcccggctctcgctggcgctgccgccgctcacgctc acgccattcccggggccgggcccgcggcggcccccgtgggagggcgcggacgccggggcg gctggcggggaggccggcggggcgggagcgccggggccggcggaggaggacggggaggac gaggacgaggacgaggaggaggacgaggaggcggcggcgccgggcgcggcggcggggccg cggggccccgggagggcgcgagcagccccggtgcccgtcgtggtgagcagcgccgacgcg gacgcggcccgcccgctgcgggggctgctcaagtctccgcgcggggccgacgagccagag gacagcgagctggagaggaagcgcaagatggtctccttccacggggacgtgaccgtctac ctcttcgaccaggagacgccaaccaacgagctgagcgtccaggccccccccgagggggac acggacccgtcaacgcctccagcgcccccgacacctccccaccccgccacccccggagat gggtttcccagcaacgacagcggctttggaggcagtttcgagtgggcggaggatttcccc ctcctcccccctccaggccccccgctgtgcttctcccgcttctccgtctcgcctgcgctg gagaccccggggccacccgcccgggcccccgacgcccggcccgcaggccccgtggagaat tga >gi568815579f:48370352_48579207|GENSCAN_predicted_peptide_8|109_aa MEKNKANRETVGTASFLQSQSLSFFCVPGTGDTAGNKSKDPCSPGLYARICGPKSESDFV SETTEDLDITPANGWALTVWKALGNPLLLPRQDPLHPHPDSVVPGSAKS >gi568815579f:48370352_48579207|GENSCAN_predicted_CDS_8|330_bp atggagaagaataaggccaatcgagagacagtaggcacagcttcattcctgcagtcacag tcactgagcttcttctgtgtacccggcactggggacacagcaggaaacaaaagcaaagat ccttgttctccagggctctatgcacgaatatgcggccccaagtcagagagtgactttgtt tctgagaccaccgaggatcttgatatcacccctgctaatggctgggcgctcacagtttgg aaagctctagggaaccctctgttactccccagacaagacccactccatccccaccctgac tcagtggtgcccggcagtgctaagtcctga >gi568815579f:48370352_48579207|GENSCAN_predicted_peptide_9|104_aa MDGPAEPQIPGLWDTYEDDISEISYLPKRNENAGSHKDRNNCGSFGSNSLTWKPPSQKLP GEYFRYKGVPFPVGLYSLESISLAENTQDVRDDDIFIITYPKSX >gi568815579f:48370352_48579207|GENSCAN_predicted_CDS_9|312_bp atggacgggcccgccgagccccagatcccgggcttgtgggacacctatgaagatgacatc tcggaaatcagttatttgcccaagagaaatgagaacgccggttcccacaaagacaggaac aactgtggcagctttgggagtaatagcctgacctggaaaccgccgagccagaagttgcca ggtgaatacttccggtacaagggcgtccccttccccgtcggcctgtactcgctcgagagc atcagcttggcggagaacacccaagatgtgcgggacgacgacatctttatcatcacctac cccaagtcagnn