GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:38:02 Sequence gi568815583f:58332033_58668824 : 336792 bp : 42.29% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2030 2258 229 1 1 -2 44 231 0.795 8.28 1.02 Intr + 2868 2997 130 1 1 24 76 83 0.523 -0.47 1.03 Intr + 23939 24101 163 2 1 97 36 151 0.227 9.86 1.04 Intr + 24878 24902 25 1 1 89 59 20 0.119 -4.12 1.05 Intr + 25423 25577 155 2 2 72 71 94 0.117 4.97 1.06 Intr + 31409 31641 233 1 2 48 85 92 0.037 0.55 1.07 Term + 31998 32211 214 0 1 39 54 140 0.727 1.42 1.08 PlyA + 32296 32301 6 1.05 2.00 Prom + 36550 36589 40 -3.65 2.01 Init + 38776 38842 67 0 1 50 100 50 0.212 3.69 2.02 Intr + 62677 62860 184 2 1 72 43 153 0.407 7.12 2.03 Intr + 65155 65374 220 1 1 59 75 154 0.223 8.58 2.04 Term + 66844 66981 138 0 0 92 54 91 0.836 3.08 2.05 PlyA + 67280 67285 6 -0.45 3.06 PlyA - 68223 68218 6 1.05 3.05 Term - 69057 68904 154 2 1 37 54 161 0.392 4.01 3.04 Intr - 71428 71316 113 1 2 29 37 94 0.179 -3.34 3.03 Intr - 73528 73324 205 0 1 24 33 183 0.242 4.68 3.02 Intr - 75655 75376 280 2 1 103 49 82 0.538 1.41 3.01 Init - 76111 75856 256 1 1 64 85 164 0.553 11.14 3.00 Prom - 85700 85661 40 -5.95 4.08 PlyA - 85850 85845 6 1.05 4.07 Term - 92201 91977 225 2 0 65 49 125 0.052 2.10 4.06 Intr - 99497 99292 206 0 2 24 91 132 0.317 5.10 4.05 Intr - 102446 102325 122 1 2 71 46 109 0.229 4.22 4.04 Intr - 106879 106660 220 2 1 89 51 80 0.462 0.84 4.03 Intr - 108499 108314 186 2 0 78 86 103 0.834 7.84 4.02 Intr - 114666 114410 257 2 2 60 55 198 0.335 9.86 4.01 Init - 115512 115181 332 0 2 81 44 140 0.760 5.92 4.00 Prom - 123104 123065 40 -4.05 5.00 Prom + 123796 123835 40 -6.65 5.01 Init + 125381 125527 147 1 0 105 66 121 0.577 11.74 5.02 Intr + 127693 127781 89 0 2 75 55 52 0.076 -1.55 5.03 Intr + 128694 129032 339 0 0 76 78 175 0.064 8.96 5.04 Intr + 129924 130046 123 0 0 95 61 44 0.546 1.18 5.05 Intr + 130970 131186 217 2 1 31 59 130 0.223 1.98 5.06 Term + 133114 133260 147 0 0 102 45 86 0.587 2.62 5.07 PlyA + 133421 133426 6 -0.45 6.04 PlyA - 134082 134077 6 1.05 6.03 Term - 135352 135199 154 0 1 -46 43 238 0.441 2.31 6.02 Intr - 135918 135748 171 2 0 -25 70 173 0.476 2.34 6.01 Init - 141082 140979 104 1 2 66 49 172 0.722 8.86 6.00 Prom - 145749 145710 40 -4.15 7.06 PlyA - 147295 147290 6 1.05 7.05 Term - 157962 157925 38 1 2 100 45 29 0.378 -3.68 7.04 Intr - 158910 158758 153 1 0 102 69 83 0.834 6.92 7.03 Intr - 160833 160737 97 0 1 72 84 42 0.322 0.86 7.02 Intr - 166797 166664 134 1 2 77 94 85 0.876 7.44 7.01 Init - 168986 168899 88 2 1 79 70 63 0.755 4.45 7.00 Prom - 169218 169179 40 -5.45 8.00 Prom + 170916 170955 40 -7.85 8.01 Init + 171453 171543 91 2 1 65 76 46 0.328 1.01 8.02 Intr + 173819 174048 230 0 2 45 54 121 0.305 1.07 8.03 Intr + 174249 174465 217 2 1 77 32 135 0.270 3.95 8.04 Intr + 186510 186744 235 1 1 100 21 127 0.007 3.02 8.05 Intr + 190529 190858 330 2 0 31 48 204 0.036 4.52 8.06 Intr + 191161 191281 121 1 1 69 41 78 0.124 0.78 8.07 Intr + 195254 195376 123 1 0 12 65 132 0.043 3.16 8.08 Intr + 201083 201155 73 1 1 62 91 94 0.512 5.16 8.09 Intr + 206301 206485 185 1 2 79 66 139 0.973 9.39 8.10 Intr + 209232 209359 128 2 2 55 76 114 0.418 5.46 8.11 Intr + 209592 209700 109 0 1 -15 84 60 0.421 -5.33 8.12 Intr + 209753 209935 183 1 0 110 71 257 0.999 25.26 8.13 Intr + 210502 210619 118 0 1 117 91 55 0.970 7.82 8.14 Intr + 213710 214244 535 0 1 104 33 402 0.022 27.35 8.15 Intr + 214940 215045 106 2 1 75 57 44 0.377 -0.70 8.16 Intr + 215183 215399 217 1 1 60 17 156 0.056 2.85 8.17 Intr + 216298 216540 243 2 0 86 113 177 0.026 16.55 8.18 Intr + 217399 217582 184 2 1 72 72 105 0.026 5.22 8.19 Intr + 217794 217818 25 0 1 63 94 45 0.026 -0.29 8.20 Intr + 220538 220848 311 1 2 89 48 180 0.030 8.29 8.21 Intr + 222809 222933 125 2 2 36 93 67 0.263 1.31 8.22 Term + 223885 223982 98 2 2 92 53 122 0.575 6.25 8.23 PlyA + 224477 224482 6 1.05 9.00 Prom + 227968 228007 40 -6.05 9.01 Init + 230139 230347 209 2 2 76 81 118 0.934 8.25 9.02 Intr + 231473 231691 219 2 0 60 57 210 0.974 11.60 9.03 Intr + 231909 232034 126 0 0 6 48 154 0.494 2.07 9.04 Intr + 234473 234550 78 2 0 73 50 93 0.198 1.75 9.05 Term + 236684 236795 112 2 1 98 50 69 0.311 1.15 9.06 PlyA + 236821 236826 6 1.05 10.03 PlyA - 237135 237130 6 1.05 10.02 Term - 240042 239980 63 0 0 101 55 95 0.135 4.41 10.01 Init - 243215 243132 84 2 0 95 46 68 0.171 4.17 10.00 Prom - 261739 261700 40 -3.95 11.14 PlyA - 262600 262595 6 1.05 11.13 Term - 265609 265515 95 2 2 88 36 75 0.512 -0.69 11.12 Intr - 267692 267566 127 2 1 60 110 39 0.547 2.63 11.11 Intr - 278485 278265 221 2 2 88 103 100 0.879 8.60 11.10 Intr - 279075 278967 109 0 1 93 78 54 0.947 3.84 11.09 Intr - 279959 279776 184 1 1 73 86 68 0.968 3.97 11.08 Intr - 289589 289439 151 0 1 94 82 141 0.968 12.40 11.07 Intr - 295851 295668 184 0 1 77 76 66 0.962 2.74 11.06 Intr - 301327 301164 164 2 2 52 100 104 0.990 6.77 11.05 Intr - 305825 305718 108 0 0 77 19 101 0.720 1.44 11.04 Intr - 308928 308745 184 0 1 62 68 264 0.997 20.24 11.03 Intr - 311946 311854 93 0 0 81 61 124 0.990 8.24 11.02 Intr - 314172 314023 150 0 0 35 68 149 0.948 7.04 11.01 Intr - 333165 333065 101 1 2 107 110 78 0.240 10.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 144902 144758 145 0 1 20 42 121 0.809 -0.88 S.002 Term + 186510 186748 239 1 2 100 45 158 0.895 8.05 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:58332033_58668824|GENSCAN_predicted_peptide_1|382_aa MYYEGKQSKERAKGDGAAIGEGRPRRRHRLSRDWSGVKDSSLEMSAGSAVRGNNCKKALN VVRTGSTRMSFRRLFIGCLQQAAFAFRDSCGPGQDSLRNVLQSEELCPVLPSLSPFTGGR FQATLPSTVSQQRQRAHLQETAQDSPLSKRERVVKQNVPENNVEAGILIRDSYGAAPPLA DRDWRLSLAVSTSMKALEFCLTQGQHGMAVHPPGAPRGEWYKPGLKHSSQNLRQSTCGKG RGCGRSFSRLKHPCLMALKRAADLPAQYSSSAKGQAASSSGSLTPVSPDGETAPSRGQQT PHRGELWLASGREQNWTENEFDELTEVGFRRWVITNSSKLKECVLTQCKEAENLEKRLEE LLTRITTLEKNINDLMELKNTA >gi568815583f:58332033_58668824|GENSCAN_predicted_CDS_1|1149_bp atgtactatgaaggaaaacaaagcaaggagagggcaaaaggtgatggagctgctattggc gaaggcaggcctaggaggagacatcgattgagcagagactggagtggagtgaaggattcc tccttggaaatgtctgcaggaagtgctgtaagaggaaataactgcaagaaagccctgaat gttgtgcgaacgggatcaactagaatgagtttcagacgacttttcataggctgcctgcaa caagcagcctttgcattcagggactcctgtggacctggccaggattctctgaggaatgtg ctgcagtccgaggagctctgcccagttcttccttccctctctcctttcacaggtggcagg ttccaggccaccctgcccagcacagtcagccagcaaagacagcgggcccacctgcaagag acagcgcaagactcacccctcagcaaacgggagcgtgttgtaaaacagaatgtcccagag aacaatgtggaggctggcatcctgattagagactcctacggagctgcaccccctctggca gatagagactggagattgtctctggcagtaagcacaagcatgaaggctcttgagttctgt ttaacccagggccagcatgggatggcagtgcaccctcctggggccccaagaggagaatgg tacaagccaggtcttaagcacagctcccaaaacttgcgtcagagcacctgcgggaagggg cgcggctgtgggcgcagcttcagcagacttaaacatccctgcctgatggctctgaagaga gcagcggatctcccagcacagtattcaagctctgctaagggacaggctgcctcctcaagt gggtccctgacccctgtgtctcctgatggggagacagctcccagcaggggtcaacagaca cctcatagaggagagctctggctggcatctggcagggaacaaaactggacagagaatgag tttgacgaattgacagaagtaggcttcagaaggtgggtaataacaaactcctccaagcta aaggagtgtgttctaacccaatgcaaggaagctgagaaccttgaaaaaaggttagaggaa ttgctaactagaataaccactttagagaagaacataaatgacctgatggagctgaaaaac acagcatga >gi568815583f:58332033_58668824|GENSCAN_predicted_peptide_2|202_aa MSLRSQVQSYSSDLEWFNLLLEGGVEVSGVLKCASEGYRVESKGLGLSGHCESPSDTVDC QRFYKAIEVRDGHHINKEATKPGSFQNSVSQVDLVAIYGLCYLPELNIVSKVVTARQPPS SQPKPATTATGSTLAFPYTADGERTNTEQKGQTGAPQTSSISVTEKALVKCKFLGRNLDL TESDTLFSEPSSLGFNRPSGGP >gi568815583f:58332033_58668824|GENSCAN_predicted_CDS_2|609_bp atgagtctgcggtcacaggttcagtcatattcctcagatctagaatggtttaatttactc ctagaaggtggtgtggaagtgtctggagttctcaaatgtgctagtgagggatacagagtt gagagcaaggggctgggcttgagtggccattgtgagtcgccaagtgacacagttgattgc cagaggttctataaggccattgaggtcagagatggacatcacatcaacaaggaagccacc aagcccggaagcttccaaaatagtgttagccaggttgatctcgttgctatttatggactg tgttaccttccagagctcaatattgtttccaaagttgtcacagcaaggcagcccccatcc tctcagcccaaacccgcaaccacagccacgggctctacccttgcttttccctacactgct gatggggagaggacaaatactgaacagaaaggtcaaactggagcacctcagaccagcagc atcagcgttactgagaaggcacttgttaaatgcaaattcttgggccgcaacctagaccta actgaatcagacactctgttctcagagcccagcagtctgggttttaacaggccctctggg ggaccctga >gi568815583f:58332033_58668824|GENSCAN_predicted_peptide_3|335_aa MSTMENYVLANLPAKHKIDQGIRRPPAAEEMIARWESWAKDQPIFWYRKCNHGNSSEGSS SSISNDTLKRYRQREKPVVHLGKGLVTVLTGLSISKAQRGRVAFEATLLSASFISNRDTS ERDEGLCTNNNAHCILLPSAVMWWLLILHRRSLFQTLIDGSSEWRRQQEQMSKAPGKASG RSGEVREIQSMKRIQYIIADFEDGGCQVPGLRVAFGIETSPWLTASKAMETSVLQLQGAE FCSQPDWVSDMKLSLVFVLLCSMPPWMCDTTPLEAGTLASEKDSLIQYLGSHRLEERREH VTADALTDNVDCLKMLKLSRCKIQELRAQQTGVIE >gi568815583f:58332033_58668824|GENSCAN_predicted_CDS_3|1008_bp atgagtaccatggaaaactatgtgctcgccaatcttccagccaaacacaaaatagatcag ggaataaggaggcctcctgctgcagaggagatgatagccaggtgggagagctgggccaag gatcagccaattttctggtacagaaagtgtaaccatggcaacagtagtgaaggtagctct agttccatcagcaatgacacccttaagaggtaccggcagagagagaagcctgttgtacat ctgggcaagggtctcgtaacagtgctcacaggactttcaataagcaaagcccagaggggg agagttgcttttgaagccacactcttgtctgcctcattcatttcaaacagagacacttct gaaagagacgaaggcctgtgtacaaacaacaatgcccactgcattctgttgccttcagct gtgatgtggtggcttttgattctacacagaaggtccttgtttcagacactgattgatggg tcttctgaatggaggaggcagcaagaacaaatgagcaaggcccctggtaaggccagtggc agaagtggggaagtcagagagattcaaagtatgaaaaggattcagtacatcattgctgac tttgaagatggagggtgccaggtaccaggactgagagtggcctttggcattgagactagt ccctggttgacagccagcaaggccatggagacctcagtcctacaattgcaaggagctgaa ttttgctcacaacctgattgggtgtctgacatgaagctctccctggtttttgtcttgctt tgctccatgccaccctggatgtgtgacacgacaccactagaagcgggcaccttggcatct gagaaagactcattgattcagtatcttggaagccacagattagaagagaggagagagcat gtaacagcagacgctttgacagataatgtggactgcctgaagatgctgaaactaagcaga tgtaagatccaagagctaagagcacagcaaactggggtcattgagtga >gi568815583f:58332033_58668824|GENSCAN_predicted_peptide_4|515_aa MICMRLFSLLLVRKWFSGSTETHKTWNHFCKTYNGNTANFSYTCVFSREDLWSGGQLSLS VDGPVTDVEHRLVESPSDFRAPVATCRWGSRAGPELGHLGLWDAFHELRQGRERGVQADS WVQGASAMGERSARGHGTVGSTDECENWQQVTKMLTTSDTGETWQKLSKQGNRRILSGSQ SEKRYQQLSWDNWENWRIYFPTDHGAWYTADSLINAVTAAVDLVMFQYKAVCFLTTLVSL GTRKLEHLSLESFWQSRKAKPHWGKNEMRNASGNRPLSGKEREVSQQPHLPVTQKLHSPE PETLSWNSEAAEMSLRYFVTMFRNCTFCGAAGYATAGKALNCQLTQPYGEDEIKCPWSAW HMLGVQHISDSIMKLVMLVVKVWCRKPFTPVSKGADEEAKMPLPSDPLRGDPRVNTLLPP LCSLMFQLCRGLKSKATQQNQCCQPWGVQGHSPHPGYQGRLGTGQIATAQASLQQDATSS SHKGLLSSAHRIHGCRKPGINSPLFSSFPMMCFKF >gi568815583f:58332033_58668824|GENSCAN_predicted_CDS_4|1548_bp atgatatgcatgcgcctcttctcactcctgctggtccggaagtggttttctggttccaca gaaactcacaagacttggaatcacttctgtaagacctataatgggaacacagccaacttc tcatatacgtgtgtgttttccagagaagacttgtggtctggagggcagctctccctctca gtcgatggtcctgtcacagatgtcgagcacagactagtcgagagtccaagcgatttcaga gccccagtggccacttgtagatggggaagtagggcagggcctgagctaggtcacctgggt ctctgggatgctttccatgagctccggcagggcagggagagaggtgtgcaagcagatagc tgggtgcaaggagccagcgccatgggagagaggtccgcaagaggccatgggacggtgggc agcacagatgagtgtgagaattggcagcaagtaaccaagatgctcacaacctcagacact ggggagacatggcagaagctcagcaaacagggcaacaggaggatcttgagtggatcccag tctgaaaaacgctatcaacaactatcttgggacaactgggaaaattggaggatatacttt cctactgaccatggggcctggtacacagcggacagtttaataaatgctgtcacagctgct gttgatcttgttatgtttcagtataaagcagtttgttttcttaccactttagtttctctt gggacaaggaagctggaacatttgtcactggaaagtttttggcagtcacggaaagccaag ccccactggggaaaaaatgaaatgaggaatgcttctggaaatagacccctaagtgggaag gaaagggaagttagccaacagcctcatcttccagtcacccagaaactccattctcctgag ccagaaaccctctcctggaactccgaagcagctgagatgtcactgcgctactttgtgacc atgtttaggaattgcaccttctgtggggcagccggctatgcgaccgccggcaaggcactt aactgccagcttactcaaccatatggtgaggatgagataaaatgtccgtggagtgcctgg cacatgctgggtgtgcagcacattagtgactccattatgaaattggtgatgcttgtggtc aaagtgtggtgcagaaaacccttcacccccgtgtcaaaaggagctgacgaagaagcaaag atgcccttgccaagtgacccattgagaggtgatcctagagtaaacacacttcttccccct ctgtgttcattaatgttccagctgtgtagaggactgaaaagtaaggccactcagcaaaat cagtgctgccaaccatggggtgttcaggggcacagtcctcaccctgggtatcagggacgg ctgggcactggtcagattgcaacggcccaagcctccctgcagcaggatgccacgtcgtca tctcacaaaggcttgctgagttcagcacacaggatccatggctgccggaaacctggaatt aattcccctttattctcttcattcccaatgatgtgtttcaagttctga >gi568815583f:58332033_58668824|GENSCAN_predicted_peptide_5|353_aa MTYSKHSSNLPVDSKKAYKAPRLRYATWLLQGRAAAAVLVSLCPTSPQHLSQEGIHTRPQ GRLHGNRCPLPYPLGEGQRESLEKLWSWEKRAAVREQPARMGSQETQSSGTDFLGRWRSP MASLALSFLICKRTMGQIGASLRESTSGFDQGTDPNMACKRGLFGVGPQDGYRGRKKATK ANLKMCRNPEVSSNLLMEVLECVFSLILSPTHHTACTGSSKLHAKCLYPSVSNPCKRLQE VDSNLFSPSVDCPWSSTGGVWLLEWQPRASSPVKGDIVLLPHTCTLGWCLPSGLTALFSS SMDMEEPGEKEDSVSSAPYICPCHPPSSNQNNSFPPLDCSPNLCTLEQDVFID >gi568815583f:58332033_58668824|GENSCAN_predicted_CDS_5|1062_bp atgacgtattctaagcactcctcaaatttacctgtggacagcaagaaggcctacaaggca ccccgtttacgatatgccacttggctccttcagggcagagctgcagcagcagtccttgtt tctctgtgccccacatcccctcagcatctgtcacaggaaggaatccacacccgtccccaa ggacgcctgcacgggaacagatgcccgctcccttaccccttgggggaaggacagagggag agtttagagaagctctggagctgggaaaagagggcagctgttagagaacagccagcgaga atgggaagtcaggagacccagagctcaggaacagacttccttggcaggtggcgtagtccc atggcctctttagctctcagttttctcatctgcaagaggacaatgggccagattggtgct tctctgagagaaagtaccagtggttttgaccaagggactgacccgaacatggcatgtaag aggggtttgtttggggttggtccacaggatggttacagaggaaggaagaaagccaccaag gccaatttgaagatgtgcagaaacccagaagtgagttcaaacctgctcatggaagtcctg gaatgtgttttctccctcatcctcagtcccacccatcacacagcctgcactggatcatcc aagttacatgccaaatgtctgtacccctctgtgtccaatccttgtaaaagactccaagaa gtcgattccaatctcttctctcccagcgtggactgcccctggagcagcactgggggtgtg tggctgttagagtggcagccaagggccagctccccagtgaaaggtgatattgtgcttcta ccgcacacctgcacgttgggatggtgtttgccttccggcctgactgctctgttctcatct tccatggacatggaggaaccaggggaaaaagaagattctgtgtcaagtgcaccctacatc tgtccatgtcaccccccttcttccaatcagaacaactcatttcccccacttgactgctcc cctaatctttgcaccttggagcaggatgtgtttattgattga >gi568815583f:58332033_58668824|GENSCAN_predicted_peptide_6|142_aa MRSQLLGLQRLLLLAAVLPLPSCPGVQLPSASARGAVLVLEAVPADPGVQALNRCAVPDP QWENPVNTKSCQTTFSCGFSKRRTGFDVRDTWHCELKSGSEELTDTRSPQAKVLLHKQEA PERHARFGRCEQEPTVRDEWVR >gi568815583f:58332033_58668824|GENSCAN_predicted_CDS_6|429_bp atgaggtcccagctgctgggactgcagaggctgttgcttctggctgccgtcctccccctc cccagctgtccaggcgtccagctcccctctgccagtgctagaggagcggtccttgtgtta gaagcagttccagctgaccccggagtccaagctctcaaccgctgtgcagtccctgaccct cagtgggaaaaccctgtgaacaccaagagttgccaaacaacttttagctgtggtttttct aagcggcggacaggatttgatgtaagggatacctggcactgtgagctcaagagcggttct gaggagctcacagacacaaggagcccgcaagcaaaggttctgctccacaagcaggaagcc cctgagagacatgcacgttttggccgctgtgaacaagagcctacggtcagggatgagtgg gtcaggtga >gi568815583f:58332033_58668824|GENSCAN_predicted_peptide_7|169_aa MGEIWIGCENGKVVSLMGAMHPKSINEWPVQKLKWVWLGRPEVTAELGFEPGFCDSQSNA SSTVSDGVGCRSLQLSGGKGSVNDVLLRHLFMDFHPVPGIFGFKEGGGELWPGFHLGVIS ILHTSQALGSQRPEIFASAALLPNNWLAVVHQSRKRPVLAGHSETAFVK >gi568815583f:58332033_58668824|GENSCAN_predicted_CDS_7|510_bp atgggagaaatatggattggctgcgagaatggtaaggttgtgtcactaatgggagcaatg caccccaaaagtatcaatgaatggcctgtacagaagctgaagtgggtgtggctgggaagg cctgaagttactgcagagctaggatttgaacctgggttttgtgactcccagtctaatgct tcttccacagtatcagatggtgtgggctgcaggtctctgcagctgtcaggaggaaaaggc tctgtgaatgatgtcttactcaggcacctgtttatggatttccatcctgtcccagggata tttgggtttaaggaggggggaggtgagctgtggccaggcttccacctgggtgtaattagt atcctccacacttctcaggctctgggctctcagagacctgaaatttttgcaagtgctgcc ctcctccctaataactggctggcagtggttcatcagagccggaagcgaccagtattagca ggtcacagtgaaactgcctttgtgaaatga >gi568815583f:58332033_58668824|GENSCAN_predicted_peptide_8|1328_aa MRTWSAWGSTSHLILAHISTLLDTFKQAAPWDCTDVAAGCWSALGSQYHTASTRELPPHK RRLLPPCARAPGLWIEEFHCLGLILTHQLGGTSIEWMAAAPPHSVSVISEATAMQENRNR SQPHPVFFGMGTLNGLSSPATTERAPSHRRHMTTVPRQSRSGERPLTRAKQVLMRRHSKG SVSSAKPNPLSKETFLEGNTELVNVMRGRTEQLRCRLCTYHNVDESHLMLGEKSQAQKSI YCVNLLTRSSRAGKTNLRGTEQARLHSPRCCQLQALLTPTSGPCVPASKSTRSCSFLPLL PSQNATSFCQDLRHCYGPAPSLLMSGLCPLAEPIPLPPTSGTAKDFSVSSRVQFLPKSSA HLSCQCELILCTLASGGSKHFAICRVSSARDEAADVGSSLWGPPFLSQDRNRCAADPRVR QVSKKMRVTKRLRKAEGHRVHAAAMTSTSYHPQILITGVTTSSEDMRKAPVSEEPFGRRA QAVETNKTLHEMKTRFLLFGETNQGCQIRINHPDTLQECGFNSSLPLVMIIHGWSAHSAH PAKSSITQSAGQSQCYGFRGEVGQLHAEGGVQMALRDSPHNKEFSGPKCQLCMAEKRHSK PFLQHYPGAGEGRRVDGVLENWIWQMVAALKSQPAQPVNVGLVDWITLAHDHYTIAVRNT RLVGKEVAALLRWLEESVQLSRSHVHLIGYSLGAHVSGFAGSSIGGTHKIGRITGLDAAG PLFEGSAPSNRLSPDDANFVDAIHTFTREHMGLSVGIKQPIGHYDFYPNGGSFQPGCHFL ELYRHIAQHGFNGENEVMGREHRPTFQWGLWNSAESTNIRDSGKSSARIGAQGVWSPSPP REKECWSGQGCLSPSRPLLATGQRHALRMNRGHQQEPPLLTPAPCAQANPLPGHQQFPAD HPALIPNFTPPNKAQYRPSGWCMGFIFLGSWGSSPSHSCTGAHWITADSPSLPPGWMSLG ELVDVLEAQFSRRKQRPQGAAVKMQAVEERRARSIHSLLVAITQTIKCSHERSVHLFIDS LLHAGTQSMAYPCGDMNSFSQGLCLSCKKGRCNTLGYHVRQEPRSKSKRLFLVTRAQSPF KGSILHKTFDAAISSLWFSQQPGEDGRAGYYPILKRSKAKLAWGAHSCAAVRRPGTGPPD SQSAGGDGSTQAPWVGGEGHLLLLLAAAMRAPFLAGRRDDEPSCDSGSRSVFGFSQLLLW KVSTDFRESSRRSRGCRRQHRLGVGIRSVKNLGLAANHLCDLDLMTKPMWAPRKRFLQKR AVMQDSPCRHGVSQECSCKLKGASAISAGSVPSPFPVPLPPVVTPNQKPIDKGARQMQPK GSHPGAQS >gi568815583f:58332033_58668824|GENSCAN_predicted_CDS_8|3987_bp atgaggacttggtctgcatggggcagcacctcgcatctgatactggcgcatatcagcacc ctactggacacattcaaacaggcagcaccctgggactgcactgacgtggctgcaggttgc tggtcagccctagggagccagtaccatacagccagtacccgcgagctccctcctcacaag aggcggctgttgcctccttgtgccagagctcctgggctttggatagaggaattccactgc ttgggtctcatccttactcaccaacttggaggaacttccatagagtggatggcagccgcc cctccccactctgtctctgtgatctctgaagccacagctatgcaggagaacagaaataga agccagccacacccagtcttctttggcatgggcacattgaacgggctcagttctcctgcc acgactgaaagggcaccctcccacagaaggcacatgaccactgttccaagacagtccaga tcaggagagaggcctttgaccagggcaaagcaggtgctgatgagaaggcacagcaaaggc tcagtcagctctgcaaaacccaatccactttccaaggaaacatttctggaaggcaacacg gaattagttaatgtgatgagaggcagaacagagcagttaagatgcaggctctgcacatac cacaacgtggatgaatctcacttaatgttgggtgagaagagccaggcccaaaagagcatc tattgtgtgaatctactaactcgaagttcaagagcaggcaaaactaatctccgcggaact gaacaggcccgactccacagcccccggtgctgccagctgcaagctctgctgactccaacc tcaggcccctgcgtgcctgcttctaaatccacacgttcctgctccttcctgcccctgctg ccctctcagaatgccacttctttctgccaagacctcaggcattgctatggcccagcccca tctttattgatgtcggggctctgtccccttgcagagcccatcccattgccacccacatct gggacagcaaaagacttctcagtttcctccagagttcagtttcttccaaaaagctcggca catcttagctgccagtgtgaactaattctctgcaccttggcttccggtggctccaagcac tttgccatctgccgagtcagttctgccagagatgaagcagctgatgtggggagttctctg tgggggccacctttcctctctcaggataggaaccgctgtgctgcagaccccagggtcagg caagtctcaaagaagatgagagtcaccaagcgcctcaggaaagctgaagggcacagggtg catgctgcagccatgacctcaacctcttatcatccccagattcttatcactggtgtgact acctcttctgaagacatgagaaaagctccagtttcagaagagccatttggaagaagagct caagctgttgaaacaaacaaaacgctgcatgagatgaagaccagattcctgctctttgga gaaaccaatcagggctgtcagattcgaatcaatcatccggacacgttacaggagtgcggc ttcaactcctccctgcctctggtgatgataatccacgggtggtcggctcacagtgcccat cccgctaagtccagcataacgcagtctgcagggcagagccagtgctatgggtttaggggt gaggtggggcagctgcacgcagaaggaggcgtacaaatggccctaagggacagcccccac aacaaggaattctctggcccaaaatgtcaactgtgcatggctgagaaacgtcatagcaag cccttcctccagcattatccaggagctggagaaggaagaagggtggacggcgtgctagaa aactggatctggcagatggtggccgcgctgaagtctcagccggcccagccagtgaacgtg gggctggtggactggatcaccctggcccacgaccactacaccatcgccgtccgcaacacc cgccttgtgggcaaggaggtcgcggctcttctccggtggctggaggaatctgtgcaactc tctcgaagccatgttcacctaattgggtacagcctgggtgcacacgtgtcaggatttgcc ggcagttccatcggtggaacgcacaagattgggagaatcacagggctggatgccgcggga cctttgtttgagggaagtgcccccagcaatcgtctttctccagatgatgccaattttgtg gatgccattcatacctttacccgggagcacatgggcctgagcgtgggcatcaaacagccc ataggacactatgacttctatcccaacgggggctccttccagcctggctgccacttccta gagctctacagacatattgcccagcacggcttcaatggtgagaatgaagtcatgggccgg gagcaccggcctacatttcaatggggcctctggaattcagcggaatctaccaacatacgg gactcagggaagagttcagcaaggataggggcccagggtgtatggtcaccaagcccaccc agagagaaggaatgctggagtgggcaagggtgcctgtcccccagcaggccactcctggca acaggccaacgccacgccctgaggatgaacagagggcaccaacaggaacctccacttctc actccagccccatgtgcccaggcaaaccctctcccgggccaccagcagttcccagctgac catccagccctcatcccaaacttcaccccacccaacaaagctcagtacagaccatcagga tggtgtatgggctttattttcctgggttcctggggtagttctcccagccacagctgcaca ggggctcactggataactgcagattcaccttcactacctcctgggtggatgagcttgggt gagttagtggatgtcttggaagctcagttttctcgtcgaaaacagcggcctcaaggagct gcagtgaagatgcaggcagtggaggagcggagggcacgcagcatacacagcctcctagta gccatcacccagaccataaaatgctcccacgagcgatcggtgcaccttttcatcgactcc ttgctgcacgccggcacgcagagcatggcctacccgtgtggtgacatgaacagcttcagc cagggcctgtgcctgagctgcaagaagggccgctgcaacacgctgggctaccacgtccgc caggagccgcggagcaagagcaagaggctcttcctcgtaacgcgagcccagtcccccttc aaagggagcattttacacaaaacattcgatgcagccatcagctccttgtggttctcccaa cagccaggcgaggatggcagggctggttattatcccattctaaagaggagcaaagcaaaa ctcgcgtggggtgcccacagctgtgcagctgtaaggagacctggaacagggccacctgac tctcagtcggctggaggagatggcagcacgcaggctccctgggtcgggggtgaggggcac ttgcttctgctgctcgccgcagccatgagagcacccttccttgctggtcggcgagatgac gagccaagctgtgactcgggcagtcggtctgtttttggcttctctcagctacttctttgg aaggtttccactgacttcagggagagttccagaagaagcagggggtgtcgtagacagcac aggcttggggttgggataaggagtgtcaagaatcttggccttgctgccaaccacttatgt gaccttgacctaatgacaaaacctatgtgggcaccaaggaagagatttctccaaaaaagg gcagtcatgcaagattctccgtgcagacatggagtttctcaggagtgctcctgcaaactc aagggtgcctcagccatcagcgcaggttccgtccccagcccgttccctgtgcctctccct ccagtggtcacacccaaccagaaaccaatcgacaagggagcccggcagatgcagcctaag ggatcccatcctggggctcagagctga >gi568815583f:58332033_58668824|GENSCAN_predicted_peptide_9|247_aa MAILVRQDIHPSPNPHVPPQPLQLPPTAQCYALWAEQTGKPEMAPLGPAVQRQVVRGGAG VPKRSQEPFQGKGIASNKTYSFLITLDVDIGELIMIKFKWENSAVWANVWDTVQTIIPWS TGPRHSGLVLKTIRVKAGETQQRVSPLPVANAACLWSLRTITRTPGFSSFGAGESPGDVS AASPRPKSQNRDGNIKLYTRNIAAKCKWEKKMTFCSENTDDLLLRPTQEKIFVKCEIKSK TSKRKIR >gi568815583f:58332033_58668824|GENSCAN_predicted_CDS_9|744_bp atggccatcctggtcagacaggacatccacccaagcccgaaccctcacgtccctcctcag cctctccagctcccaccaactgctcaatgttatgcactgtgggctgagcagacagggaag ccagagatggccccactagggcctgcagtgcagaggcaggtggtgaggggcggagcaggt gttcccaagaggtcccaagagcctttccagggcaaaggaattgctagtaataaaacgtat tcctttcttatcacgctggatgtggatatcggcgagctgatcatgatcaagttcaagtgg gaaaacagtgcagtgtgggccaatgtctgggacacggtccagaccatcatcccatggagc acagggccgcgccactcaggcctcgttctgaagacgatcagagtcaaagcaggagaaacc cagcaaagggtctccccactgcccgtggctaacgctgcctgcctctggagcctgaggacc atcacacgaaccccaggcttctcaagcttcggtgcaggtgaatcaccaggtgatgtcagt gctgccagtccaagaccaaaatcccagaatcgggatgggaacattaaactctacactcgt aatatcgctgctaagtgcaagtgggaaaagaaaatgacattttgttcagaaaacacagat gacctactacttcgcccaacccaggaaaaaatcttcgtgaaatgtgaaataaagtctaaa acatcaaagcgaaagatcagatga >gi568815583f:58332033_58668824|GENSCAN_predicted_peptide_10|48_aa MASPESLLGMPAVRMCPRPTVTEMPVNKTCIAIQAFAVFAADMTQGEH >gi568815583f:58332033_58668824|GENSCAN_predicted_CDS_10|147_bp atggcatcgcctgagagcttgttaggaatgcctgctgtcaggatgtgccccagacctacg gtgacagaaatgccagtgaacaagacgtgtattgcaatccaagcttttgcagtatttgcg gccgacatgactcaaggtgaacattga >gi568815583f:58332033_58668824|GENSCAN_predicted_peptide_11|623_aa XYPHKYGPQGGCADHSVFERMRKYQMTGVEEVTQIPQEEHAANGPELLRKKRTTSAEKNT CQLYIQTDHLFFKYYGTREAVIAQISSHVKAIDTIYQTTDFSGIRNISFMVKRIRINTTA DEKDPTNPFRFPNIGVEKFLELNSEQNHDDYCLAYVFTDRDFDDGVLGLAWVGAPSDEET CVQKLCDQPNTTQLEVEQDLESGPFDSRFSSFGSSGGICEKSKLYSDGKKKSLNTGIITV QNYGSHVPPKVSHITFAHEVGHNFGSPHDSGTECTPGESKNLGQKENGNYIMYARATSGD KLNNNKFSLCSIRNISQVLEKKRNNCFVESGQPICGNGMVEQGEECDCGYSDQCKDECCF DANQPEGRKCKLKPGKQCSPSQGPCCTAQCAFKSKSEKCRDDSDCAREGICNGFTALCPA SDPKPNFTDCNRHTQVCINGQCAGSICEKYGLEECTCASSDGKDDKELCHVCCMKKMDPS TCASTGSVQWSRHFSGRTITLQPGSPCNDFRGYCDVFMRCRLVDADGPLARLKKAIFSPE LYENIAEWIVAHWWAVLLMGIALIMLMAGFIKICSVHTPSSNPKLPPPKPLPGTLKRRRP PQPIQQPQRQRPRESYQMGHMRR >gi568815583f:58332033_58668824|GENSCAN_predicted_CDS_11|1872_bp nactatccccataaatacggtcctcaggggggctgtgcagatcattcagtatttgaaaga atgaggaaataccagatgactggtgtagaggaagtaacacagatacctcaagaagaacat gctgctaatggtccagaacttctgaggaaaaaacgtacaacttcagctgaaaaaaatact tgtcagctttatattcagactgatcatttgttctttaaatattacggaacacgagaagct gtgattgcccagatatccagtcatgttaaagcgattgatacaatttaccagaccacagac ttctccggaatccgtaacatcagtttcatggtgaaacgcataagaatcaatacaactgct gatgagaaggaccctacaaatcctttccgtttcccaaatattggtgtggagaagtttctg gaattgaattctgagcagaatcatgatgactactgtttggcctatgtcttcacagaccga gattttgatgatggcgtacttggtctggcttgggttggagcaccttcagatgaggaaact tgtgttcagaaactttgtgaccaacctaataccacacagctagaggtggaacaggatctg gaatctggaccttttgactccagattcagttcatttggaagctctggaggaatatgtgaa aaaagtaaactctattcagatggtaagaagaagtccttaaacactggaattattactgtt cagaactatgggtctcatgtacctcccaaagtctctcacattacttttgctcacgaagtt ggacataactttggatccccacatgattctggaacagagtgcacaccaggagaatctaag aatttgggtcaaaaagaaaatggcaattacatcatgtatgcaagagcaacatctggggac aaacttaacaacaataaattctcactctgtagtattagaaatataagccaagttcttgag aagaagagaaacaactgttttgttgaatctggccaacctatttgtggaaatggaatggta gaacaaggtgaagaatgtgattgtggctatagtgaccagtgtaaagatgaatgctgcttc gatgcaaatcaaccagagggaagaaaatgcaaactgaaacctgggaaacagtgcagtcca agtcaaggtccttgttgtacagcacagtgtgcattcaagtcaaagtctgagaagtgtcgg gatgattcagactgtgcaagggaaggaatatgtaatggcttcacagctctctgcccagca tctgaccctaaaccaaacttcacagactgtaataggcatacacaagtgtgcattaatggg caatgtgcaggttctatctgtgagaaatatggcttagaggagtgtacgtgtgccagttct gatggcaaagatgataaagaattatgccatgtatgctgtatgaagaaaatggacccatca acttgtgccagtacagggtctgtgcagtggagtaggcacttcagtggtcgaaccatcacc ctgcaacctggatccccttgcaacgattttagaggttactgtgatgttttcatgcggtgc agattagtagatgctgatggtcctctagctaggcttaaaaaagcaatttttagtccagag ctctatgaaaacattgctgaatggattgtggctcattggtgggcagtattacttatggga attgctctgatcatgctaatggctggatttattaagatatgcagtgttcatactccaagt agtaatccaaagttgcctcctcctaaaccacttccaggcactttaaagaggaggagacct ccacagcccattcagcaaccccagcgtcagcggccccgagagagttatcaaatgggacac atgagacgctaa