GENSCAN 1.0 Date run: 7-Nov-116 Time: 15:53:09 Sequence gi568815581r:44094266_44298070 : 203805 bp : 53.08% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.10 PlyA - 1381 1376 6 1.05 1.09 Term - 4032 3960 73 2 1 123 54 34 0.423 1.17 1.08 Intr - 11879 11822 58 0 1 104 52 27 0.133 -0.87 1.07 Intr - 16751 16464 288 0 0 86 111 226 0.957 22.46 1.06 Intr - 20973 20720 254 2 2 120 81 69 0.570 7.01 1.05 Intr - 23286 23229 58 1 1 89 94 88 0.366 7.93 1.04 Intr - 23523 23502 22 0 1 107 13 18 0.108 -6.20 1.03 Intr - 29416 29239 178 0 1 37 88 292 0.269 24.54 1.02 Intr - 40091 40069 23 2 2 113 88 17 0.342 1.02 1.01 Init - 40695 40615 81 0 0 93 -3 100 0.478 2.32 1.00 Prom - 41477 41438 40 -3.81 2.00 Prom + 42505 42544 40 -5.11 2.01 Init + 47114 47154 41 1 2 80 106 3 0.584 1.22 2.02 Intr + 47579 47948 370 2 1 59 -28 185 0.262 -0.33 2.03 Intr + 50938 50988 51 0 0 108 75 84 0.929 8.69 2.04 Intr + 53593 54762 1170 0 0 97 109 408 0.962 33.34 2.05 Intr + 56696 56779 84 1 0 94 78 40 0.694 4.11 2.06 Intr + 57260 57349 90 1 0 83 64 23 0.496 0.09 2.07 Intr + 58372 58512 141 0 0 63 116 131 0.974 14.46 2.08 Intr + 60291 60399 109 2 1 55 90 134 0.190 10.66 2.09 Intr + 60588 60673 86 1 2 74 80 100 0.991 7.84 2.10 Intr + 61021 61146 126 0 0 114 116 129 0.973 19.68 2.11 Intr + 63568 63676 109 0 1 54 103 108 0.953 9.26 2.12 Term + 67606 67667 62 2 2 86 53 53 0.781 -0.14 2.13 PlyA + 68193 68198 6 1.05 3.00 Prom + 72676 72715 40 -3.71 3.01 Init + 76558 76825 268 0 1 55 49 395 0.831 27.44 3.02 Intr + 77781 78048 268 1 1 89 42 368 0.681 29.43 3.03 Intr + 82473 82965 493 0 1 109 69 659 0.974 59.69 3.04 Intr + 83344 83457 114 0 0 72 95 67 0.991 6.95 3.05 Intr + 83940 84102 163 2 1 81 -15 154 0.010 4.46 3.06 Intr + 85605 85889 285 1 0 29 48 184 0.012 6.36 3.07 Intr + 93642 93744 103 1 1 88 88 55 0.525 5.23 3.08 Intr + 94757 95323 567 2 0 132 105 382 0.930 36.80 3.09 Term + 96236 96599 364 2 1 97 49 311 0.982 22.50 3.10 PlyA + 97440 97445 6 1.05 4.14 PlyA - 97563 97558 6 1.05 4.13 Term - 100146 99998 149 1 2 142 38 177 0.999 16.48 4.12 Intr - 100409 100252 158 1 2 87 84 153 0.950 14.97 4.11 Intr - 100574 100503 72 1 0 82 88 42 0.772 2.62 4.10 Intr - 100875 100832 44 0 2 80 105 38 0.575 2.23 4.09 Intr - 101222 101154 69 2 0 92 89 50 0.967 5.37 4.08 Intr - 101563 101535 29 2 2 123 113 -16 0.927 2.72 4.07 Intr - 101835 101769 67 0 1 85 115 -3 0.674 1.07 4.06 Intr - 102153 102131 23 1 2 103 100 16 0.918 2.05 4.05 Intr - 102761 102664 98 1 2 130 44 235 0.999 23.45 4.04 Intr - 103134 102963 172 1 1 117 93 185 0.999 21.42 4.03 Intr - 103486 103333 154 1 1 55 85 198 0.842 16.36 4.02 Intr - 103865 103755 111 2 0 71 78 67 0.846 5.08 4.01 Init - 104159 104121 39 2 0 92 84 -12 0.538 -1.07 4.00 Prom - 110715 110676 40 -2.71 5.50 PlyA - 110792 110787 6 1.05 5.49 Term - 113102 112977 126 2 0 112 54 319 0.999 29.49 5.48 Intr - 113332 113189 144 1 0 68 59 432 0.996 39.29 5.47 Intr - 113505 113434 72 0 0 97 99 76 0.925 9.70 5.46 Intr - 113646 113599 48 0 0 118 82 40 0.906 5.76 5.45 Intr - 115276 115087 190 0 1 79 100 413 0.996 41.71 5.44 Intr - 115468 115380 89 1 2 95 58 102 0.999 7.17 5.43 Intr - 115969 115859 111 1 0 95 75 142 0.998 14.68 5.42 Intr - 116208 116053 156 0 0 62 56 328 0.745 27.72 5.41 Intr - 116682 116527 156 0 0 66 77 403 0.997 37.72 5.40 Intr - 116887 116774 114 1 0 96 86 196 0.993 21.35 5.39 Intr - 117066 117025 42 0 0 73 105 57 0.857 4.82 5.38 Intr - 117482 117341 142 1 1 67 99 235 0.994 23.36 5.37 Intr - 117741 117608 134 0 2 76 49 233 0.998 18.15 5.36 Intr - 118189 118079 111 1 0 91 50 155 0.869 13.08 5.35 Intr - 118674 118554 121 2 1 99 76 195 0.893 20.40 5.34 Intr - 119017 118953 65 1 2 142 110 58 0.999 11.41 5.33 Intr - 121544 121389 156 2 0 109 81 226 0.957 24.72 5.32 Intr - 121724 121641 84 2 0 123 113 82 0.999 14.71 5.31 Intr - 122439 122264 176 1 2 81 78 123 0.996 10.78 5.30 Intr - 123996 123907 90 1 0 18 97 70 0.333 1.36 5.29 Intr - 125429 125180 250 2 1 29 58 246 0.455 13.25 5.28 Intr - 125880 125718 163 2 1 67 -3 109 0.383 0.19 5.27 Intr - 126508 126475 34 2 1 105 123 12 0.102 4.17 5.26 Intr - 135088 134963 126 2 0 91 47 183 0.983 15.66 5.25 Intr - 135319 135193 127 1 1 1 36 161 0.702 2.96 5.24 Intr - 155494 155437 58 0 1 107 87 13 0.035 2.48 5.23 Intr - 156273 156197 77 0 2 124 28 162 0.048 12.61 5.22 Intr - 157067 156894 174 2 0 77 56 253 0.989 21.65 5.21 Intr - 157323 157154 170 1 2 85 59 324 0.907 29.38 5.20 Intr - 159106 158853 254 0 2 100 92 365 0.987 35.71 5.19 Intr - 160397 160231 167 2 2 101 57 143 0.987 11.77 5.18 Intr - 161031 160942 90 0 0 102 60 170 0.999 16.29 5.17 Intr - 161581 161408 174 1 0 35 82 205 0.931 15.25 5.16 Intr - 163279 163085 195 1 0 80 81 411 0.999 39.73 5.15 Intr - 163542 163394 149 1 2 87 -7 204 0.980 11.26 5.14 Intr - 163915 163721 195 2 0 112 52 368 0.999 35.51 5.13 Intr - 164358 164148 211 0 1 122 50 249 0.999 23.51 5.12 Intr - 165079 164898 182 2 2 118 86 265 0.999 29.40 5.11 Intr - 165316 165232 85 1 1 114 86 52 0.999 7.49 5.10 Intr - 165667 165544 124 0 1 100 109 43 0.999 8.69 5.09 Intr - 166274 166139 136 0 1 113 49 139 0.930 12.63 5.08 Intr - 166550 166370 181 2 1 27 84 184 0.979 11.86 5.07 Intr - 167371 167310 62 2 2 61 100 50 0.984 2.44 5.06 Intr - 168461 168371 91 2 1 85 99 181 0.985 18.97 5.05 Intr - 168669 168587 83 1 2 91 94 58 0.471 6.55 5.04 Intr - 175058 174941 118 2 1 150 -3 18 0.013 -0.56 5.03 Intr - 178935 178882 54 0 0 127 96 -7 0.893 3.66 5.02 Intr - 179979 179807 173 1 2 53 113 27 0.593 1.88 5.01 Init - 187692 187623 70 0 1 83 57 52 0.295 2.76 5.00 Prom - 188848 188809 40 -3.71 6.02 PlyA - 189452 189447 6 1.05 6.01 Term - 201965 201831 135 2 0 92 36 97 0.785 3.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 83940 84125 186 2 0 81 46 175 0.946 10.41 S.002 Term - 156273 156193 81 0 0 124 55 169 0.942 15.19 S.003 Term - 175058 174831 228 2 0 150 48 66 0.917 5.76 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:44094266_44298070|GENSCAN_predicted_peptide_1|344_aa MDGPEILLLEDDEMCLPYAINAGLPARDGKLPEGSSGSGHVTGVTSPNVVVGGGGERSRR SRRKDGGAVEEVLPPLPPPLLPPPPAKPELEPQRVQTDELQAPNASFLQAQSRHELSQRV GVKDSRLCKGQSFWFPSSPLSVGAFQFGKSSLSITNRLSLWKTEENMRWAIPGPHSEHVS KRDCFGEGQQVETVSDFLLGPTAQEGVPRPDEALPYSVGPVGSGLREPVLCGLPDHLPFG LLPASGGPLPGMLLVPKAQGLVEMLQTIYETESCFSADGMSGREPSLEILPRTSLHSIPV TAARQEIRLTVSMHTAVTAVRCLVLMATAATCSAHARLAEGTDP >gi568815581r:44094266_44298070|GENSCAN_predicted_CDS_1|1035_bp atggatgggccagaaatcctgctgttggaggatgatgaaatgtgccttccatacgccatc aatgcaggacttcctgccagggatggaaagctccccgagggcagctccgggtcgggtcac gtgacgggagtgacgtctccgaatgttgttgttggtggcggcggcgagcggagccggagg agccgccgcaaagatggaggagccgtcgaggaggtgctgccgccgctgccgccgccgctg ctgccgccgccgcccgcgaagccggagctcgagccgcagcgggtacaaactgatgagctc caagcccccaacgccagcttcctgcaggcccagagccggcatgaactctcccaacgagtc ggagtaaaggacagtaggctttgtaaggggcagagcttctggttcccctcaagtcctctc agcgtaggagcttttcagtttggcaagtcctcactgagtatcaccaacaggctcagcctg tggaagacagaagaaaatatgagatgggccatccctggtcctcacagtgaacatgttagc aagagagactgttttggtgaggggcagcaggtggagactgtgagtgatttccttctgggg cccactgcccaggaaggagtgcccaggccggatgaggcacttccctacagtgtgggtccc gtcggcagtggcctgagggaacctgtgctgtgcgggctgcccgaccaccttcccttcggt ctgctccccgcctctggcggccccctccctggcatgctgctggtgcccaaggctcagggg ctcgtggagatgctgcagaccatctatgagacagaatcctgtttctcagcagatgggatg tcaggtcgggaaccatccttggaaatcctgccgcggacttctctgcacagcatccctgtg acagctgccagacaggaaatcagattaactgtgtccatgcacacagctgtaacagctgtc aggtgcttggtgctgatggcaactgctgccacctgctctgctcatgccaggctggcagag ggcacagatccttga >gi568815581r:44094266_44298070|GENSCAN_predicted_peptide_2|812_aa MTVPLHSGLGKGVKPERKTTVRRPFVSAGKKYACPFKKAETPQWRRLRMRPKAPAASLLA TFPISQRLIPLTPARKHCPSESETSTWVVSKAPATPRTRGAGPTSPPRPTRRRYGNAAPS LGAGGPQGPLASGTAHGACSLQKLFAVEEEFEDEDFLSAVEDAENRFTGSLPVNAGRLRP VSSRPQETVQAQSSRLLLLHPTAPSEALGLPDLDLCLPASSTPSADSRPSCIGAAPLRPV STSSSWIGNQRRVTVTEVLRETARPQSSALHPLLTFESQQQQVGGFEGPEQDEFDKVLAS MELEEPGMELECGVSSEAIPILPAQQREGSVLAKKARVVDLSGSCQKGPVPAIHKAGIMS AQDESLDPVIQCRTPRPPLRPGAVGHLPVPTALTVPTQQLHWEVCPQRSPVQALQPLQAA RGTIQSSPQNRFPCQPFQSPSSWLSGKAHLPRPRTPNSSCSTPSRTSSGLFPRIPLQPQA PVSSIGSPVGTPKGPQGALQTPIVTNHLVQLVTAASRTPQQPTHPSTRAKTRRFPGPAGI LPHQQSGRSLEDIMVSAPQTPTHGALAKFQTEYFFSAVAPQICQSISDPKQQILSGFKAC INIVASSQASVEEDFGRGPWLTMKSTLGLDERDPSCFLCTYSIVMVLRKAALKQLPRNKV PNMAVMIKSLTRSTMDASVVFKDPTGEMQGTVHRLLLETCQNELKPGSVLLLKQIGVFSP SLRNHYLNVTPNNLVHIYSPDSGDGSFLKPSQPFPKDSGSFQHDVAAKPEEGFRTAQNLE AEASPEEELPEADDLDGLLSELPEDFFCGTSS >gi568815581r:44094266_44298070|GENSCAN_predicted_CDS_2|2439_bp atgactgtgccgctgcactccgggctgggtaagggagtaaaaccagagcggaagactaca gtgagaaggccctttgtgtcggcgggaaagaaatatgcctgcccctttaagaaggcggag acgccccagtggcggcgtcttcgaatgcggcctaaggcgcctgccgccagtctcctggcg actttccctatatcgcagagactcatccctctgaccccagcccggaagcactgtccctcg gagtccgagacttccacctgggtcgtgtccaaggccccggcgactccccggactcggggt gccgggccaacctccccgccgaggcccacccgccgtcgctatggtaatgccgcgcccagc ttgggggctggcgggccgcagggccccctggcctccggaactgctcatggggcgtgcagt ttgcagaagctgtttgctgtggaagaggagtttgaagatgaggatttcttgtctgctgtg gaggatgcagagaaccggtttactggctcactgcctgtgaatgctgggcgcctgagacct gtctcttctaggccacaggagactgtgcaggcacagtcctccaggctgctgctgttacac cccactgctccctcagaggctttgggcctgccagacttggacctctgcctccctgcctcc agcacgcccagtgctgacagccgtccatcatgcataggagcagctcccctaaggcctgtc tctacttccagcagctggattggcaatcagagaagagtgacagtgacagaagtgctcaga gagacagcaagacctcagtcctcagccttacaccccctactcacctttgagagccaacag cagcaagttggtggctttgaggggcctgaacaagacgaatttgataaagtcctggcaagc atggagttggaggagcctggcatggagctggaatgtggagtcagcagtgaggccatacca atcctgcctgcccagcagcgggagggttcagtattggctaaaaaagcccgggtagttgat ctgagtggatcttgccagaaggggcctgtgcctgccatccacaaagcgggtatcatgtcc gcccaggatgagtctctagatcctgtcatccaatgtaggactccacgaccccccttgaga cctggtgctgtgggtcaccttcctgttccaactgccttaacagttcccactcagcaactc cactgggaagtctgtccgcaacgctcccctgttcaagcacttcagcctctccaagctgct agagggaccattcagagcagccctcaaaatcgtttcccttgtcagccattccagtctcca agttcctggttaagtggcaaagctcatttacccagacctcgaactcccaactcaagctgt tctactccctcaaggactagctctggattatttcctcggatacccttacaaccgcaagct ccagtgtcttccattgggtctcctgttggtaccccaaaaggtccccagggagctctgcag acacccatagtcaccaaccacctggtgcagctagtcactgctgccagccggacaccccag cagcccacccatccctccacccgagccaaaactcgccgtttccctggcccagctgggatc ctgcctcaccagcagagtgggagaagtctggaggacatcatggtttccgcgccccaaact ccaacccatggtgctctggctaaattccagacagagtattttttctctgctgtagctccg cagatctgccagtccatttcagatccaaagcagcagatactctctggattcaaagcatgt attaatattgttgctagttcccaggcatctgtggaggaggattttgggcgagggccctgg ctgaccatgaaatccacgctaggcctggatgagagagaccctagctgcttcctctgtacc tacagcattgtcatggtgctgcgcaaggcagccctgaagcagcttcctaggaacaaggtc cccaacatggcggtgatgatcaagtccctgactcggagcacaatggacgccagtgtggtt ttcaaggaccccacgggagagatgcaggggacggtgcacaggttgctgctggagacgtgc cagaatgagctgaagcctggctcagtgctgctgctgaagcagattggagtgttttctcct tcacttcgaaatcactacctcaacgtgacacccaacaacctggtccatatttacagcccg gattctggggatgggagcttcctcaagccatctcagcccttccccaaggattcagggagc ttccagcatgatgtggctgcaaagcccgaggaaggcttcagaacagcacagaacctagag gcagaggcgtcccctgaggaagaactcccagaagcagatgacctggatggactcctgagt gagcttcctgaagacttcttctgtgggaccagtagttga >gi568815581r:44094266_44298070|GENSCAN_predicted_peptide_3|874_aa MLRSLRLQQEWLEWEDRRRAAAQQCRSRRCPSSPRARLTRPHRSCRDPAVHQALFSGNLQ QVQALFQDEEAANMIVETVSNQLAWSAEQGFWVLTPKTKQTAPLAIATARGYTDCARHLI RQGAELDARVGGRAALHEACARAQFDCVRLLLTFGAKANVLTEEGTTPLHLCTIPESLQC AKLLLEAGATVNLAAGESQETPLHVAAARGLEQHVALYLEHGADVGLRTSQGETALNTAC AGAEGPGSCRRHQAAARRLLEAGADARAAGRKRHTPLHNACANGCGGLAELLLRYGARAE VPNGAGHTPMDCALQAVQDSPNWEPEVLFAALLDYGAQPVRPEMLKHCANFPRALEVLLN AYPCVPSCETWVEAVLPELWKEHEAFYSSALCMVNQPRQLQHLARLAVRARLGSRCRQGA TRLPLPPLLRDYLLLHMKRSRCRDRPQPPPPDRREDGVQRAAELSQSLPPRRRAPPGRQR LEERTGPAGPEGKEQPPALASQSAEIAASARLPPRLGSEERLCLAAHRLGYNSLSNYWGI ILDTSVPREIHTILVYQESNRKMDSVDPASSQAMELSDVTLIEGVGNEVMVVAGVVVLIL ALVLAWLSTYVADSGSNQLLGAIVSAGDTSVLHLGHVDHLVAGQGNPEPTELPHPSEGND EKAEEAGEGRGDSTGEAGAGGGVEPSLEHLLDIQGLPKRQAGAGSSSPEAPLRSEDSTCL PPSPGLITVRLKFLNDTEELAVARPEDTVGALKSKYFPGQESQMKLIYQGRLLQDPARTL RSLNITDNCVIHCHRSPPGSAVPGPSASLAPSATEPPSLGVNVGSLMVPVFVVLLGVVWY FRINYRQFFTAPATVSLVGVTVFFSFLVFGMYGR >gi568815581r:44094266_44298070|GENSCAN_predicted_CDS_3|2625_bp atgctgcgctctctccgcctgcagcaggagtggctggaatgggaggaccggcggcgggcg gctgcccagcagtgccggagccgcaggtgcccgtcaagtccccgggcccgactcactagg cctcaccgttcctgccgagacccagctgtccaccaagccctcttctccggcaacctgcag caggtccaagccctgttccaagatgaagaggccgccaacatgattgtggagactgtgagc aaccagctggcctggtcggctgaacaggggttctgggtgctgacccccaagaccaagcag acggcacccctcgccatcgctacagcccgaggctacacagactgtgctcgacacctgatc cggcagggagctgagctggatgcccgtgtcgggggtcgcgctgccttgcatgaggcctgt gcccgagcccagtttgactgtgtgcggctgctgctgaccttcggagccaaggctaatgtg ctgactgaggagggcacgactcctttgcacctctgcacgatccccgagtccttgcagtgc gccaagttgctgctggaagcaggagcgacggtgaacctggcagcaggcgagagccaggag acgcccctgcacgtggcggcggcgcgcggcctggagcaacatgtggctctgtacctggag catggcgccgacgtgggcctgcgcaccagccagggcgagactgcgctgaacacggcgtgc gctggggccgagggcccaggtagctgcaggcgacaccaggctgcggcgcgccggctcctg gaggctggagctgatgcccgggcggccgggcgcaagcgccacacgccgctgcacaacgct tgtgccaacggctgcgggggcctggccgagctgctgctgcgttacggggcccgcgctgag gtccccaatggggcgggccacacgcccatggactgtgcgctgcaggccgtccaggactcc cccaactgggagcctgaagtccttttcgccgcactgctggactacggggcgcagccagtg cgccctgagatgctgaaacactgcgccaacttccctcgggccctggaagtcctgcttaat gcctatccttgtgtcccatcctgtgagacctgggtggaggcggtgctcccagagctgtgg aaggagcacgaagccttctacagctcggccctgtgcatggtgaaccagccaaggcagctg cagcacctggcccgactagctgtgcgcgctcggttgggaagccgctgccggcagggtgcc acccggctgccactgcccccgctcctcagggactacctgctgctgcatatgaagcggagc cgctgccgcgaccgaccgcagccgccgccgcccgaccgccgggaggatggagttcagcgg gcagcggagctgtctcagtctttgccgccgcgccggcgagcgccgcccgggaggcagcgg ctggaggagcggacgggccccgcggggcccgagggcaaggagcagccgcctgccttggcc tcccaaagtgccgagattgcagcctctgcccggctgccaccccgtctgggaagtgaggag cgtctctgcctggccgcccatcgtctgggatataattctttgagcaactactggggcatt atattggacacttcagtaccaagagaaatacatacgatccttgtctaccaggagtctaat agaaagatggacagcgtggaccctgccagcagccaggccatggagctctctgatgtcacc ctcattgagggtgtgggtaatgaggtgatggtggtggcaggtgtggtggtgctgattcta gccttggtcctagcttggctctctacctacgtagcagacagcggtagcaaccagctcctg ggcgctattgtgtcagcaggcgacacatccgtcctccacctggggcatgtggaccacctg gtggcaggccaaggcaaccccgagccaactgaactcccccatccatcagagggtaatgat gagaaggctgaagaggcgggtgaaggtcggggagactccactggggaggctggagctggg ggtggtgttgagcccagccttgagcatctccttgacatccaaggcctgcccaaaagacaa gcaggtgcaggcagcagcagtccagaggcccccctgagatctgaggatagcacctgcctc cctcccagccctggcctcatcactgtgcggctcaaattcctcaatgataccgaggagctg gctgtggctaggccagaggataccgtgggtgccctgaagagcaaatacttccctggacaa gaaagccagatgaaactgatctaccagggccgcctgctacaagacccagcccgcacactg cgttctctgaacattaccgacaactgtgtgattcactgccaccgctcacccccagggtca gctgttccaggcccctcagcctccttggccccctcggccactgagccacccagccttggt gtcaatgtgggcagcctcatggtgcctgtctttgtggtgctgttgggtgtggtctggtac ttccgaatcaattaccgccaattcttcacagcacctgccactgtctccctggtgggagtc accgtcttcttcagcttcctagtatttgggatgtatggacgataa >gi568815581r:44094266_44298070|GENSCAN_predicted_peptide_4|394_aa MGIKEGFEFWGPRPCCRPLCYEQSERRLHKSLQMKMEEMSLSGLDNSKLEMFSPGAQAIA QEIYADLVEDSCLGFCFEVHRAVKCGYFFLDDTDPDSMKDFEIVDQPGLDIFGQVFNQWK SKECVCPNCSRSIAASRFAPHLEKCLGMGRNSSRIANRRIANSNNMNKSESDQEDNDDIN DNDWSYGSEKKAKKRKSDKLWYLPFQNPNSPRRSKSLKHKNGELSNSDPFKYNNSTGISY ETLGPEELRSLLTTQCGVISEHTKKMCTRSLRCPQHTDEQRRTVRIYFLGPSAVLPEVES SLDNDSFDMTDSQALISRLQWDGSSDLSPSDSGSSKTSENQGWGLGTNSSESRKTKKKKS HLSLVGTASGLGSNKKKKPKPPAPPTPSIYDDIN >gi568815581r:44094266_44298070|GENSCAN_predicted_CDS_4|1185_bp atgggtatcaaagaggggtttgagttttggggacctaggccgtgttgccgcccgctgtgc tatgagcagtcagagcgccgtctccacaagagtttacaaatgaaaatggaggaaatgtct ttgtctggcctggataacagcaaactagagatgttctcccctggggcccaggccatcgct caggagatatacgcggacctggtcgaggattcttgtttgggattctgctttgaggtacac cgggctgtcaagtgtggctacttcttcttggacgacacggaccctgatagcatgaaggat tttgagatcgtggaccagccgggcttggacatctttggacaggttttcaaccagtggaag agcaaggagtgtgtttgccccaattgcagtcgcagcattgccgcctcccgctttgctccc catctggagaagtgcctgggaatgggtcggaacagcagccgaatcgccaaccgccggatt gccaatagcaacaatatgaataagtctgagagtgaccaagaagataatgatgacatcaat gacaacgactggtcctatggctcggagaagaaagccaagaagagaaagtcagacaagcta tggtatctcccattccagaaccccaattcccctcgaagatccaagtcattaaaacacaaa aatggggaacttagcaattcggatccttttaagtataacaattcaactgggatcagctat gagaccctggggccggaggagcttcgcagcctgctaaccacgcaatgtggggtgatttct gaacacaccaagaagatgtgcacaaggtccctgcgctgcccacagcacacagatgagcag aggcgaaccgtacggatttattttctcgggccctcggctgtccttccagaggtcgagagc tccctggataatgacagctttgacatgactgacagccaggccctgatcagccggcttcag tgggacggctcctctgacctctcaccctctgattcaggctcctccaagacgagtgaaaat cagggatggggtctaggtaccaacagctctgagtcacggaaaaccaagaaaaagaaatcc catctgagcctggtagggactgcctccggcctaggttccaacaagaagaagaagccaaag ccaccggcacccccgacgcccagcatctatgatgacatcaactga >gi568815581r:44094266_44298070|GENSCAN_predicted_peptide_5|2099_aa MPGHKDEELDFRDLWELIQGRTTRVGRLAGEKEFTELGMQTMNLEGQSFGPASLERKGIS GQGNSAWKDPEAKRTENIQRKLSGDPAELPACISGRMLRDSAPGRLVVQIVNGLAVAGFV HLCVPVFEFLLLESELQAGTRGAGYAGGSDHRRQLDTQDHAMEELQDDYEDMMEENLEQE EYEDPDIPESQMEEPAAHDTEATATDYHTTSHPGTHKVYVELQELVMDEKNQELRWMEAA RWVQLEENLGENGAWGRPHLSHLTFWSLLELRRVFTKGTVLLDLQETSLAGVANQLLDRF IFEDQIRPQDREELLRALLLKHSHAGELEALGGVKPAVLTRSGDPSQPLLPQHSSLETQL FCEQGDGGTEGHSPSGILEKIPPDSEATLVLVGRADFLEQPVLGFVRLQEAAELEAVELP VPIRFLFVLLGPEAPHIDYTQLGRAAATLMSERVFRIDAYMAQSRGELLHSLEGFLDCSL VLPPTDAPSEQALLSLVPVQRELLRRRYQSSPAKPDSSFYKGLDLNGGPDDPLQQTGQLF GGLVRDIRRRYPYYLSDITDAFSPQVLAAVIFIYFAALSPAITFGGLLGEKTRNQMGVSE LLISTAVQGILFALLGAQPLLVVGFSGPLLVFEEAFFSFCETNGLEYIVGRVWIGFWLIL LVVLVVAFEGSFLVRFISRYTQEIFSFLISLIFIYETFSKLIKIFQDHPLQKTYNYNVLM VPKPQGPLPNTALLSLVLMAGTFFFAMMLRKFKNSSYFPGKLRRVIGDFGVPISILIMVL VDFFIQDTYTQKLSVPDGFKVSNSSARGWVIHPLGLRSEFPIWMMFASALPALLVFILIF LESQITTLIVSKPERKMVKGSGFHLDLLLVVGMGGVAALFGMPWLSATTVRSVTHANALT VMGKASTPGAAAQIQEVKEQRISGLLVAVLVGLSILMEPILSRIPLAVLFGIFLYMGVTS LSGIQLFDRILLLFKPPKYHPDVPYVKRVKTWRMHLFTGIQIICLAVLWVVKSTPASLAL PFVLILTVPLRRVLLPLIFRNVELQCLDADDAKATFDEEEGRDEYDEVAMPVPYLETLFS GPRERRPGEDKPAGGDPEVRKQMPPPPPCPAGRELFNDPYINVQNLDKARQAEVVSMAEQ LQGEPWFHGKLSQREAEALLQLHGDFLVRESMTKASAPGLRSHPHARLRAPPPSGRLRVS LPRPPPPSPAAAAAAAAAAAAASTTTTTPLPSFLPSASAARAEGAVLCAFRREEQQAAAT AAAAAAAATAAAAAAPAPPPRPEEEPLPPREGAAAVPGRAGEGAATQSQGGSRWSGKPGG RAAPARPRRWLDSWRMNGEADCPTDLEMAAPKGQDRWSQEDMLTLLECMKNNLPSNDSSK FKTTESHMDWEKVAFKDFSGDMCKLKWVEISNEVRKFRTLTELILDAQEHVKNPYKGKKL KKHPDFPKKPLTPYFRFFMEKRAKYAKLHPEMSNLDLTKILSKKYKELPEKKKMKYIQDF QREKQEFERNLARFREDHPDLIQNAKKSDIPEKPKTPQQLWYTHEKKVYLKVRPDATTKE VKDSLGKQWSQLSDKKRLKWIHKALEQRKEYEEIMRDYIQKHPELNISEEGITKSTLTKA ERQLKDKFDGRPTKPPPNSYSLYCAELMANMKDVPSTERMVLCSQQWKLLSQKEKDAYHK KCDQKKKDYEVELLRFLESLPEEEQQRVLGEEKMLNINKKQATSPASKKPAQEGGKGGSE KPKRPVSAMFIFSEEKRRQLQEERPELSESELTRLLARMWNDLSEKKKAKYKAREAALKA QSERKPGGEREERGKLPESPKRAEEIWQQSVIGDYLARFKNDRVKALKAMEMTWNNMEKK EKLMWIKKAAEDQKRYERELSEMRAPPAATNSSKKMKFQGEPKKPPMNGYQKFSQELLSN GELNHLPLKERMVEIGSRWQRISQSQKEHYKKLAEEQQKQYKVHLDLWVKSLSPQDRAAY KEYISNKRKSMTKLRGPNPKSSRTTLQSKSESEEDDEEDEDDEDEDEEEEDDENGDSSED GGDSSESSSEDESEDGDENEEDDEDEDDDEDDDEDEDNESEGSSSSSSSSGDSSDSDSN >gi568815581r:44094266_44298070|GENSCAN_predicted_CDS_5|6300_bp atgcctggccataaggatgaagaacttgactttagagacctctgggaacttattcaaggt cgcacaactagggttggaaggttggcgggagagaaggaattcacagagctgggaatgcaa acaatgaaccttgaaggtcaaagctttggtccagcaagtttggaaaggaaaggcatctca ggacaagggaacagtgcatggaaagatccagaagccaagagaacagagaacattcagaga aagctctcaggggaccctgcagaactcccagcgtgtatctctggaaggatgctcagggac tcagcccctggcaggcttgtggtgcagatagtgaatggtcttgcagtggcgggatttgtc cacctgtgtgtccctgtgtttgagtttcttctgcttgagtctgaacttcaggctgggacc cgcggtgcgggttatgctgggggctcagatcaccgtagacaactggacactcaggaccac gccatggaggagctgcaggatgattatgaagacatgatggaggagaatctggagcaggag gaatatgaagacccagacatccccgagtcccagatggaggagccggcagctcacgacacc gaggcaacagccacagactaccacaccacatcacacccgggtacccacaaggtctatgtg gagctgcaggagctggtgatggacgaaaagaaccaggagctgagatggatggaggcggcg cgctgggtgcaactggaggagaacctgggggagaatggggcctggggccgcccgcacctc tctcacctcaccttctggagcctcctagagctgcgtagagtcttcaccaagggtactgtc ctcctagacctgcaagagacctccctggctggagtggccaaccaactgctagacaggttt atctttgaagaccagatccggcctcaggaccgagaggagctgctccgggccctgctgctt aaacacagccacgctggagagctggaggccctggggggtgtgaagcctgcagtcctgaca cgctctggggatccttcacagcctctgctcccccaacactcctcactggagacacagctc ttctgtgagcagggagatgggggcacagaagggcactcaccatctggaattctggaaaag attcccccggattcagaggccacgttggtgctagtgggccgcgccgacttcctggagcag ccggtgctgggcttcgtgaggctgcaggaggcagcggagctggaggcggtggagctgccg gtgcctatacgcttcctctttgtgttgctgggacctgaggccccccacatcgattacacc cagcttggccgggctgctgccaccctcatgtcagagagggtgttccgcatagatgcctac atggctcagagccgaggggagctgctgcactccctagagggcttcctggactgcagccta gtgctgcctcccaccgatgccccctccgagcaggcactgctcagtctggtgcctgtgcag agggagctacttcgaaggcgctatcagtccagccctgccaagccagactccagcttctac aagggcctagacttaaatgggggcccagatgaccctctgcagcagacaggccagctcttc gggggcctggtgcgtgatatccggcgccgctacccctattacctgagtgacatcacagat gcattcagcccccaggtcctggctgccgtcatcttcatctactttgctgcactgtcaccc gccatcaccttcggcggcctcctgggagaaaagacccggaaccagatgggagtgtcggag ctgctgatctccactgcagtgcagggcattctcttcgccctgctgggggctcagcccctg cttgtggtcggcttctcaggacccctgctggtgtttgaggaagccttcttctcgttctgc gagaccaacggtctagagtacatcgtgggccgcgtgtggatcggcttctggctcatcctg ctggtggtgttggtggtggccttcgagggtagcttcctggtccgcttcatctcccgctat acccaggagatcttctccttcctcatttccctcatcttcatctatgagactttctccaag ctgatcaagatcttccaggaccacccactacagaagacttataactacaacgtgttgatg gtgcccaaacctcagggccccctgcccaacacagccctcctctcccttgtgctcatggcc ggtaccttcttctttgccatgatgctgcgcaagttcaagaacagctcctatttccctggc aagctgcgtcgggtcatcggggacttcggggtccccatctccatcctgatcatggtcctg gtggatttcttcattcaggatacctacacccagaaactctcggtgcctgatggcttcaag gtgtccaactcctcagcccggggctgggtcatccacccactgggcttgcgttccgagttt cccatctggatgatgtttgcctccgccctgcctgctctgctggtcttcatcctcatattc ctggagtctcagatcaccacgctgattgtcagcaaacctgagcgcaagatggtcaagggc tccggcttccacctggacctgctgctggtagtaggcatgggtggggtggccgccctcttt gggatgccctggctcagtgccaccaccgtgcgttccgtcacccatgccaacgccctcact gtcatgggcaaagccagcaccccaggggctgcagcccagatccaggaggtcaaagagcag cggatcagtggactcctggtcgctgtgcttgtgggcctgtccatcctcatggagcccatc ctgtcccgcatccccctggctgtactgtttggcatcttcctctacatgggggtcacgtcg ctcagcggcatccagctctttgaccgcatcttgcttctgttcaagccacccaagtatcac ccagatgtgccctacgtcaagcgggtgaagacctggcgcatgcacttattcacgggcatc cagatcatctgcctggcagtgctgtgggtggtgaagtccacgccggcctccctggccctg cccttcgtcctcatcctcactgtgccgctgcggcgcgtcctgctgccgctcatcttcagg aacgtggagcttcagtgtctggatgctgatgatgccaaggcaacctttgatgaggaggaa ggtcgggatgaatacgacgaagtggccatgcctgtcccctaccttgagaccctcttctct gggcccagagagaggcgtcctggtgaggacaagcctgctgggggagatccagaagtccgc aaacagatgccacctccaccaccctgtccagcaggcagagagctcttcaatgatccctat atcaacgtccagaacctagacaaggcccggcaagcagaagtggtttccatggctgagcag ctccaaggggagccctggttccacgggaagctgagccagcgagaggctgaggcattgctg cagctccatggcgacttcctggtgcgggagagcatgaccaaggccagtgcccccggcctg cgctcccatccacacgctcggctccgagccccgccgccatccgggcggctgcgtgtctcc ctgccccggcccccccccccttccccggcggcggcagcagcagcagcagcagccgccgcc gccgccagcaccaccaccaccacccccctcccctccttccttccctccgcctcggccgcc cgggcggagggcgctgtgctttgtgcttttcgccgcgaggagcagcaggcagcagccaca gccgccgccgccgccgccgccgccacagcagcagcagccgccgccccagcgccgccgcct cgcccggaggaggagccgctgccgccgcgggagggagctgcggctgtgcccggccgagcg ggggagggcgccgccactcagagccagggagggagccgctggagcgggaagcccggaggc cgcgctgcgccggcacgaccgaggaggtggctggacagctggaggatgaacggagaagcc gactgccccacagacctggaaatggccgcccccaaaggccaagaccgttggtcccaggaa gacatgctgactttgctggaatgcatgaagaacaaccttccatccaatgacagctccaag ttcaaaaccaccgaatcacacatggactgggaaaaagtagcatttaaagacttttctgga gacatgtgcaagctcaaatgggtggagatttctaatgaggtgaggaagttccgtacattg acagaattgatcctcgatgctcaggaacatgttaaaaatccttacaaaggcaaaaaactc aagaaacacccagacttcccaaagaagcccctgaccccttatttccgcttcttcatggag aagcgggccaagtatgcgaaactccaccctgagatgagcaacctggacctaaccaagatt ctgtccaagaaatacaaggagcttccggagaagaagaagatgaaatatattcaggacttc cagagagagaaacaggagttcgagcgaaacctggcccgattcagggaggatcaccccgac ctaatccagaatgccaagaaatcggacatcccagagaagcccaaaaccccccagcagctg tggtacacccacgagaagaaggtgtatctcaaagtgcggccagatgccactacgaaggag gtgaaggactccctggggaagcagtggtctcagctctcggacaaaaagaggctgaaatgg attcataaggccctggagcagcggaaggagtacgaggagatcatgagagactatatccag aagcacccagagctgaacatcagtgaggagggtatcaccaagtccaccctcaccaaggcc gaacgccagctcaaggacaagtttgacgggcgacccaccaagccacctccgaacagctac tcgctgtactgcgcagagctcatggccaacatgaaggacgtgcccagcacagagcgcatg gtgctgtgcagccagcagtggaagctgctgtcccagaaggagaaggacgcctatcacaag aagtgtgatcagaaaaagaaagattacgaggtggagctgctccgtttcctcgagagcctg cctgaggaggagcagcagcgggtcttgggggaagagaagatgctgaacatcaacaagaag caggccaccagccccgcctccaagaagccagcccaggaagggggcaagggcggctccgag aagcccaagcggcccgtgtcggccatgttcatcttctcggaggagaaacggcggcagctg caggaggagcggcctgagctctccgagagcgagctgacccgcctgctggcccgaatgtgg aacgacctgtctgagaagaagaaggccaagtacaaggcccgagaggcggcgctcaaggct cagtcggagaggaagcccggcggggagcgcgaggaacggggcaagctgcccgagtccccc aaaagagctgaggagatctggcaacagagcgttatcggcgactacctggcccgcttcaag aatgaccgggtgaaggccttgaaagccatggaaatgacctggaataacatggaaaagaag gagaaactgatgtggattaagaaggcagccgaagaccaaaagcgatatgagagagagctg agtgagatgcgggcacctccagctgctacaaattcttccaagaagatgaaattccaggga gaacccaagaagcctcccatgaacggttaccagaagttctcccaggagctgctgtccaat ggggagctgaaccacctgccgctgaaggagcgcatggtggagatcggcagtcgctggcag cgcatctcccagagccagaaggagcactacaaaaagctggccgaggagcagcaaaagcag tacaaggtgcacctggacctctgggttaagagcctgtctccccaggaccgtgcagcatat aaagagtacatctccaataaacgtaagagcatgaccaagctgcgaggcccaaaccccaaa tccagccggactactctgcagtccaagtcggagtccgaggaggatgatgaagaggatgag gatgacgaggacgaggatgaagaagaggaagatgatgagaatggggactcctctgaagat ggcggcgactcctctgagtccagcagcgaggacgagagcgaggatggggatgagaatgaa gaggatgacgaggacgaagacgacgacgaggatgacgatgaggatgaagataatgagtcc gagggcagcagctccagctcctcctcctcaggggactcctcagactctgactccaactga >gi568815581r:44094266_44298070|GENSCAN_predicted_peptide_6|44_aa VLTAAYQSNCILGKGKYSDLSRLLDTELELTLILRNQNAIMGMY >gi568815581r:44094266_44298070|GENSCAN_predicted_CDS_6|135_bp gtacttacagctgcttatcagagcaactgcatcctggggaaagggaaatactcagatctt tcaaggcttttggatacagagttggaactgacactaatattaaggaaccaaaatgccatc atgggcatgtattag