GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:28:13 Sequence gi568815579f:45368587_45573009 : 204423 bp : 51.63% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.16 Intr - 157 44 114 1 0 6 83 180 0.994 10.45 1.15 Intr - 406 344 63 1 0 64 121 85 0.992 8.81 1.14 Intr - 561 484 78 0 0 51 70 93 0.936 4.04 1.13 Intr - 1646 1547 100 1 1 38 98 316 0.993 28.21 1.12 Intr - 2148 2075 74 0 2 20 71 34 0.038 -6.40 1.11 Intr - 14140 13941 200 2 2 94 72 420 0.953 40.59 1.10 Intr - 17142 16976 167 2 2 41 80 363 0.903 30.92 1.09 Intr - 17371 17238 134 1 2 64 93 285 0.999 26.55 1.08 Intr - 17594 17442 153 2 0 100 64 248 0.520 24.38 1.07 Intr - 23754 23294 461 1 2 94 115 435 0.952 40.29 1.06 Intr - 27300 26850 451 0 1 126 113 298 0.932 29.55 1.05 Intr - 27673 27582 92 2 2 88 100 78 0.902 9.21 1.04 Intr - 27850 27752 99 2 0 77 92 214 0.900 21.28 1.03 Intr - 28472 27959 514 2 1 83 61 529 0.988 42.83 1.02 Intr - 29561 29419 143 0 2 63 98 176 0.955 16.68 1.01 Init - 29759 29678 82 2 1 102 58 99 0.752 8.65 1.00 Prom - 30230 30191 40 -4.01 2.00 Prom + 33542 33581 40 -4.11 2.01 Init + 33976 34036 61 0 1 49 39 79 0.570 -1.51 2.02 Intr + 34139 34180 42 0 0 94 78 18 0.340 0.10 2.03 Intr + 34251 34371 121 1 1 95 9 89 0.491 1.76 2.04 Intr + 37754 38394 641 2 2 72 19 208 0.624 4.63 2.05 Intr + 38508 38649 142 1 1 101 101 113 0.977 13.82 2.06 Term + 39547 40915 1369 1 1 110 34 1168 0.857 105.04 2.07 PlyA + 41155 41160 6 -0.45 3.12 PlyA - 41227 41222 6 1.05 3.11 Term - 42898 42761 138 1 0 6 48 144 0.770 0.47 3.10 Intr - 45159 45091 69 0 0 99 69 84 0.951 7.47 3.09 Intr - 45448 45377 72 1 0 117 55 73 0.985 7.00 3.08 Intr - 46374 46275 100 2 1 65 76 202 0.999 17.31 3.07 Intr - 48338 48235 104 2 2 84 78 96 0.546 7.67 3.06 Intr - 50611 50512 100 0 1 84 91 184 0.987 18.91 3.05 Intr - 51841 51738 104 1 2 109 99 127 0.999 15.37 3.04 Intr - 52807 52592 216 1 0 117 94 189 0.957 21.63 3.03 Intr - 54795 54684 112 2 1 104 46 103 0.943 8.58 3.02 Intr - 55885 55751 135 0 0 104 77 28 0.382 3.49 3.01 Init - 57286 57204 83 1 2 46 76 10 0.203 -4.01 3.00 Prom - 60936 60897 40 -1.71 4.00 Prom + 61630 61669 40 -3.21 4.01 Init + 73881 73942 62 2 2 72 70 63 0.102 3.57 4.02 Intr + 79149 79191 43 0 1 108 89 19 0.050 2.73 4.03 Intr + 87189 87501 313 2 1 88 44 100 0.012 1.91 4.04 Intr + 99792 100126 335 1 2 25 111 217 0.215 13.54 4.05 Intr + 102043 102363 321 0 0 88 113 165 0.985 15.61 4.06 Intr + 102608 102715 108 1 0 54 77 107 0.963 7.18 4.07 Term + 103965 104426 462 2 0 93 43 471 0.942 38.54 4.08 PlyA + 106571 106576 6 1.05 5.12 PlyA - 106616 106611 6 1.05 5.11 Term - 117203 117122 82 1 1 136 54 65 0.937 5.36 5.10 Intr - 117527 117469 59 2 2 114 60 48 0.935 2.77 5.09 Intr - 119931 119780 152 1 2 103 37 137 0.709 10.49 5.08 Intr - 120120 120051 70 0 1 79 80 73 0.664 4.85 5.07 Intr - 120400 120262 139 0 1 130 32 209 0.630 20.47 5.06 Intr - 120967 120760 208 2 1 80 44 223 0.431 15.76 5.05 Intr - 125834 125580 255 0 0 56 87 89 0.429 3.55 5.04 Intr - 126419 125940 480 0 0 57 100 489 0.540 40.59 5.03 Intr - 126553 126509 45 2 0 83 115 -10 0.489 0.07 5.02 Intr - 127208 127154 55 2 1 128 6 47 0.538 -0.46 5.01 Init - 128316 128206 111 0 0 82 73 97 0.619 5.79 5.00 Prom - 129131 129092 40 -7.50 6.00 Prom + 129140 129179 40 -6.70 6.01 Init + 129887 130825 939 1 0 76 29 972 0.939 83.94 6.02 Intr + 131363 131480 118 1 1 86 93 64 0.992 7.24 6.03 Intr + 131870 131975 106 0 1 78 89 28 0.986 1.67 6.04 Intr + 132064 132124 61 1 1 93 94 62 0.818 6.53 6.05 Intr + 137078 137151 74 1 2 68 97 11 0.018 -1.30 6.06 Intr + 149077 149248 172 1 1 123 61 266 0.993 27.86 6.07 Intr + 149343 149508 166 2 1 113 77 296 0.968 31.05 6.08 Intr + 152736 152820 85 1 1 93 81 30 0.951 2.18 6.09 Intr + 153582 153631 50 0 2 129 64 60 0.909 6.51 6.10 Intr + 153817 153995 179 2 2 120 100 27 0.431 7.26 6.11 Intr + 154132 154232 101 0 2 79 83 96 0.857 7.61 6.12 Intr + 155511 155556 46 0 1 104 102 49 0.407 6.80 6.13 Intr + 155892 156074 183 2 0 7 93 94 0.626 2.30 6.14 Term + 156292 156489 198 0 0 83 50 124 0.426 5.82 6.15 PlyA + 157287 157292 6 1.05 7.05 PlyA - 158242 158237 6 1.05 7.04 Term - 160870 160470 401 2 2 112 49 665 0.609 60.65 7.03 Intr - 184927 184753 175 1 1 -34 72 158 0.309 2.03 7.02 Intr - 185325 184994 332 1 2 128 69 748 0.654 72.80 7.01 Init - 188394 188331 64 0 1 74 43 79 0.571 1.27 7.00 Prom - 194237 194198 40 -2.51 8.03 PlyA - 194336 194331 6 1.05 8.02 Term - 201696 201416 281 1 2 90 29 150 0.926 5.25 8.01 Init - 203487 203457 31 0 1 108 42 49 0.822 2.14 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:45368587_45573009|GENSCAN_predicted_peptide_1|975_aa MQAPAPAGTMDSEAFQSARDFLDMNFQSLAMKHMDLKQMELDTAAAKVDELTKQLESLWS DSPAPPGPQAGPPSRPPRYSSSSIPEPFGSRGSPRKAATDGADTPFGRSESAPTLHPYSP LSPKGRPSSPRTPLYLQPDAYGSLDRATSPRPRAFDGAGSSLGRAPSPRPGPGPLRQQGP PTPFDFLGRAGSPRGSPLAEGPQAFFPERGPSPRPPATAYDAPASAFGSSLLGSGGSAFA PPLRAQDDLTLRRRPPKAWNESDLDVAYEKKPSQTASYERLDVFARPASPSLQLLPWRES SLDGLGGTGKDNLTSATLPRNYKVSPLASDRRSDAGSYRRSLGSAGPSGTLPRSWQPVSR IPMPPSSPQPRGAPRQRPIPLSMIFKLQNAFWEHGASRAMLPGSPLFTRAPPPKLQPQPQ PQPQPQSQPQPQLPPQPQTQPQTPTPAPQHPQQTWPPVNEGPPKPPTELEPEPEIEGLLT PVLEAGDVDEGPVARPLSPTRLQPALPPEAQSVPELEEVARVLAEIPRPLKRRGSMEQAP AVALPPTHKKQYQQIISRLFHRHGGPGPGGPEPELSPITEGSEARAGPPAPAPPAPIPPP APSQSSPPEQPQSMEMRSVLRKAGSPRKARRARLNPLVLLLDAALTGELEVVQQAVKEVS AAEREMNDPSQPNEEGITALHNAICGANYSIVDFLITAGANVNSPDSHGWTPLHCAASCN DTVICMALVQHGAAIFATTLSDGATAFEKCDPYREGYADCATYLADVEQSMGLMNSGAVY ALWDYSAEFGDELSFREGESVTVLRRDGPEETDWWWAALHGQEGYVPRNYFGSQPPSLFR AHAQYPDWLCPSGLTGRLNVDGLLVYFPYDYIYPEQFSYMRELKRTLDAKGHGVLEMPSG TGKTVSLLALIMAYQRAYPLEVTKLIYCSRTVPEIEKVIEELRKLLNFYEKQEGEKLPFL GLALSSRKNLCIHPE >gi568815579f:45368587_45573009|GENSCAN_predicted_CDS_1|2925_bp atgcaggcgcccgctccggccggcaccatggacagcgaggcattccagagcgcgcgggac tttctggacatgaacttccagtcgctggccatgaaacacatggatctgaagcagatggag ctggacacggcggcggccaaggtggatgaactgaccaagcagctggagtcgctgtggtca gactctcccgcgcctcctggcccgcaggccggacccccttctaggccgccccggtacagc tccagctcgatccctgagcccttcggcagccgagggtccccccggaaggcggccaccgac ggcgcagacaccccgttcggacgatcagagagtgccccaaccctacacccctacagcccg ctgtcccccaagggacggccgtcgtcgccgcgcaccccgctctacctgcagccggacgcc tacggcagcctggaccgcgcgacctcgccccggccccgcgccttcgatggcgcaggcagc tccctcggccgtgcgccctccccgcggcccgggccaggcccgctccgccagcagggtccc cccacgcctttcgacttcctgggccgcgcaggctccccccgcggcagccccctggcggag gggccccaggccttcttccccgagcgtgggccgtcaccgcgcccccctgccacagcctac gacgcgccagcgtccgccttcgggagctccctgctaggctccggcggcagcgcattcgcc ccgcctctgcgcgcgcaagacgacctgacgctgcgccggcggcctccgaaagcctggaac gagtctgacctggacgtggcgtacgagaagaagccttcgcagacagcgagctatgaacgc ctggacgtcttcgcaaggcctgcctcgccgagcctgcagctgttgccttggagggagagc agcctggatggactggggggcaccggcaaggacaacctcactagcgccaccctgccgcgc aattacaaggtctctcctctggccagcgaccggcgttcagacgcgggcagctaccggcgc tcgctgggctccgcggggccgtcgggcactttgcctcgcagctggcagcccgtcagccgc atccccatgcccccctccagcccccagccccgcggggccccgcgccagcgtcccatcccc ctcagcatgatcttcaagctgcagaacgccttctgggagcacggggccagccgcgccatg ctccctgggtcccccctcttcacccgagcacccccgcctaagctgcagccccaaccacaa ccacagccccagccacaatcacaaccacagccccagctgcccccacagccccagacccaa ccccaaacccctaccccagccccccagcatccccaacagacatggccccctgtgaacgaa ggaccccccaaaccccccaccgagctggagcctgagccggagatagaggggctgctgaca ccagtgctggaggctggcgatgtggatgaaggccctgtagcaaggcctctcagccccacg aggctgcagccagcactgccaccggaggcacagtcggtgcccgagctggaggaggtggca cgggtgttggcggaaattccccggcccctcaaacgcaggggctccatggagcaggcccct gctgtggccctgccccctacccacaagaaacagtaccagcagatcatcagccgcctcttc catcgtcatggggggccagggcccggggggccggagccagagctgtcccccatcactgag ggatctgaggccagggcagggccccctgctcctgccccaccagctcccattccacccccg gccccgtcccagagcagcccaccagagcagccgcagagcatggagatgcgctctgtgctg cggaaggcgggctccccgcgcaaggcccgccgcgcgcgcctcaaccctctggtgctcctc ctggacgcggcgctgaccggggagctggaggtggtgcagcaggcggtgaaggaggtgagc gctgctgagcgggagatgaacgacccgagccagcccaacgaggagggcatcactgccttg cacaacgccatctgcggcgccaactactctatcgtggatttcctcatcaccgcgggtgcc aatgtcaactcccccgacagccacggctggacacccttgcactgcgcggcgtcgtgcaac gacacagtcatctgcatggcgctggtgcagcacggcgctgcaatcttcgccaccacgctc agcgacggcgccaccgccttcgagaagtgcgacccttaccgcgagggttatgctgactgc gccacctacctggcagacgtcgagcagagtatggggctgatgaacagcggggcagtgtac gctctctgggactacagcgccgagttcggggacgagctgtccttccgcgagggcgagtcg gtcaccgtgctgcggagggacgggccggaggagaccgactggtggtgggccgcgctgcac ggccaggagggctacgtgccgcggaactacttcgggagtcagccgccctcgcttttccgt gcgcacgcgcagtatcccgattggctctgccctagcggattgacgggcaggctcaacgtg gacgggctcctggtctacttcccgtacgactacatctaccccgagcagttctcctacatg cgggagctcaaacgcacgctggacgccaagggtcatggagtcctggagatgccctcaggc accgggaagacagtatccctgttggccctgatcatggcataccagagagcatatccgctg gaggtgaccaaactcatctactgctcaagaactgtgccagagattgagaaggtgattgaa gagcttcgaaagttgctcaacttctatgagaagcaggagggcgagaagctgccgtttctg ggactggctctgagctcccgcaaaaacttgtgtattcaccctgag >gi568815579f:45368587_45573009|GENSCAN_predicted_peptide_2|791_aa MRSGPGQVLYIPGQGSWAGRALAPSSPAGSGPAAARFFRAKDPVDFLEESSSVGTGSESQ TELGTPRLDSYSPVGLRKSSVAAPLRRRCNKPYWTDDGGDPSGPDGPLTPAIQRISGFRE YIFRKKTSSSVFCSAHVEIFPSFSCRSGVEQCLPPRSRTSWALPDDALPLWIHVVCNLVR AARATGLPEVWVPGWRSPRPAVRVRVDGVRRVRWWKEKGASERVRAEKEAYLQAGLAKSV HSQWANGNSNGERVILWGATRGEQGAPRGWIGEIEVAPGEQVGRKEKPDAARFSCPPNFT AKPPASESPRFSLEALTGPDTELWLIQAPADFAPECFNGRHVPLSGSQIVKGKLAGKRHR YRVLSSCPQAGEATLLAPSTEAGGGLTCASAPQGTLRILEGPQQSLSGSPLQPIPASPPP QIPPGLRPRFCAFGGNPPVTGPRSALAPNLLTSGKKKKEMQVTEAPVTQEAVNGHGALEV DMALGSPEMDVRKKKKKKNQQLKEPEAAGPVGTEPTVETLEPLGVLFPSTTKKRKKPKGK ETFEPEDKTVKQEQINTEPLEDTVLSPTKKRKRQKGTEGMEPEEGVTVESQPQVKVEPLE EAIPLPPTKKRKKEKGQMAMMEPGTEAMEPVEPEMKPLESPGGTMAPQQPEGAKPQAQAA LAAPKKKTKKEKQQDATVEPETEVVGPELPDDLEPQAAPTSTKKKKKKKERGHTVTEPIQ PLEPELPGEGQPEARATPGSTKKRKKQSQESRMPETVPQEEMPGPPLNSESGEEAPTGRD KKRKQQQQQPV >gi568815579f:45368587_45573009|GENSCAN_predicted_CDS_2|2376_bp atgcgcagcgggcccgggcaggtgctgtacatcccggggcaagggagctgggccgggcgg gctctagctccgagctctcccgcgggctctgggccagccgcagccaggttctttagggct aaggatcctgtggacttcctggaggagtcatcttcagtaggaaccgggtcagagagccag actgagctgggaacacccaggctggactcctacagccctgtcgggcttaggaagagcagt gtggctgcccctttaaggaggcgttgcaacaaaccatattggacagacgatgggggcgac ccatcgggacccgacgggcctctgactccagcaatacagcgaatcagcggctttcgggaa tacatttttcggaaaaagacttcttcctcggttttctgctctgcacacgttgaaattttc cccagtttttcctgcagatcgggagtcgagcaatgcctacccccgcgctcccgcaccagt tgggcgctcccggatgatgccctacccctttggatccacgtggtctgcaacctggtgcga gcagcccgggctacagggttgcctgaggtgtgggtcccaggatggaggagccccaggccg gcggtgagggtgcgggttgacggggtgcggagggtgcgttggtggaaggagaaaggggcg tccgagagggttcgggcggaaaaggaggcgtacctgcaagcaggacttgcgaagagcgtg cattcccagtgggcgaacgggaattcgaacggagagagggttatcttgtggggggctacc cgtggagagcaaggcgcccccaggggttggatcggtgaaattgaggtcgcccctggggaa caggtgggcagaaaggagaaaccagatgctgctcggttctcttgtccccccaactttacc gcgaagcccccagcctcagagtcccctcgtttctccttggaggcgctgacgggtccagat acggagctgtggcttattcaggcccctgcagactttgccccagaatgcttcaatgggcgg catgtgcctctctctggctcccagatcgtcaagggcaaattggcaggcaagcggcaccgc tatcgagtcctcagcagctgtccccaagctggagaagcgaccctgctggccccctcaacg gaggcaggaggtggactcacctgtgcctcagccccccagggcaccctaaggatccttgag ggtccccagcaatccctgtcagggagccctctgcagcccatcccagcaagtcccccacca cagatccctcctggcctgaggcctcggttctgtgcctttgggggcaacccaccagtcaca gggcctaggtcagccttggcccccaacctgctcacctcagggaagaagaaaaaggagatg caggtgacagaggccccagtcactcaggaggcagtgaatgggcacggggccctggaggtg gacatggctttggggtcgccagaaatggatgtgcggaagaagaagaagaaaaaaaatcag cagctgaaagaaccagaggcagcagggcctgtggggacagagcccacagtggagacactg gagcctctgggagtgctgttcccgtccaccaccaagaagaggaagaagcccaaagggaaa gaaaccttcgagccagaagacaagacagtgaagcaggaacagattaacactgagcctcta gaagacacagtcctgtccccgaccaaaaagagaaagaggcaaaaggggacggaagggatg gagccagaggagggggtgacagttgagtctcagccacaggtgaaggtggagccactggag gaagccatccctctgccccctacgaagaagaggaaaaaagaaaagggacagatggcaatg atggagccagggacggaggcgatggagccagtggagccggagatgaagcctctggagtcc ccaggggggaccatggcgcctcaacagccagaaggagcgaagcctcaggcccaggcagct ctggcagctcccaaaaagaagacgaagaaagaaaaacagcaagatgccacagtggagcca gagacagaggtggtggggcctgagctgccggatgaccttgagcctcaggcagctcccaca tccaccaagaagaagaagaagaagaaagagagaggtcacacagtgactgagccaattcag ccactagagcctgaactgccaggggagggacagcctgaagccagggcaactccgggatcc accaagaagaggaagaagcagagtcaggaaagccggatgccagagacagtgccccaagag gagatgccagggccgccactgaattcagagtctggggaggaggctcccacaggccgggac aagaagcggaagcagcagcagcagcagcctgtgtag >gi568815579f:45368587_45573009|GENSCAN_predicted_peptide_3|410_aa MYMQARINKKINEHNRNVRNRRVCMGIWKPFRTPGLSQALPLTPIPRLSRDLPSQTCPFW VSTGNVSVSLTEPLQMDPGKDKEGVPQPSGPPARKKFVIPLDEDEVPPGVAKPLFRSTQS LPTVDTSAQAAPQTYAEYAISQPLEGAGATCPTGSEPLAGETPNQALKPGAKSNSIIVSP RQRGNPVLKFVRNVPWEFGDVIPDYVLGQSTCALFLSLRYHNLHPDYIHGRLQSLGKNFA LRVLLVQVDVVSNSDFLLQKDPQQALKELAKMCILADCTLILAWSPEEAGRYLETYKAYE QKPADLLMEKLEQDFVSRVTECLTTVKSVNKTDSQTLLTTFGSLEQLIAASREDLALCPG LGPQKAITNGGQAITNGGEDVKKREPSYTFDGNVNQYNHYGKQFGGSSKN >gi568815579f:45368587_45573009|GENSCAN_predicted_CDS_3|1233_bp atgtacatgcaggcaagaataaataaaaagatcaatgaacataacagaaatgtcagaaat agacgtgtgtgtatgggaatttggaagcccttccggactccggggctgagtcaggcgctc cccctgacccccatcccacggctctcccgggatctcccatcccagacctgcccattctgg gtcagcactgggaacgtgtctgtgtcccttactgaaccgctccagatggaccctgggaag gacaaagagggggtgccccagccctcagggccgccagcaaggaagaaatttgtgataccc ctcgacgaggatgaggtccctcctggagtggccaagcccttattccgatctacacagagc cttcccactgtggacacctcggcccaggcggcccctcagacctacgccgaatatgccatc tcacagcctctggaaggggctggggccacgtgccccacagggtcagagcccctggcagga gagacgcccaaccaggccctgaaacccggggcaaaatccaacagcatcattgtgagccct cggcagaggggcaatcccgtactgaagttcgtgcgcaatgtgccctgggaatttggcgac gtaattcccgactatgtgctgggccagagcacctgtgccctgttcctcagcctccgctac cacaacctgcacccagactacatccatgggcggctgcagagcctggggaagaacttcgcc ttgcgggtcctgcttgtccaggtggatgtggtttctaattctgattttctcctccagaaa gatccccagcaggccctcaaggagctggctaagatgtgtatcctggccgactgcacattg atcctcgcctggagccccgaggaagctgggcggtacctggagacctacaaggcctatgag cagaaaccagcggacctcctgatggagaagctagagcaggacttcgtctcccgggtgact gaatgtctgaccaccgtgaagtcagtcaacaaaacggacagtcagaccctcctgaccaca tttggatctctggaacagctcatcgccgcatcaagagaagatctggccttatgcccaggc ctgggccctcagaaagcaataacaaatggtggacaggcaataacaaatggtggcgaggat gtgaagaaaagggaaccctcatatacttttgatgggaatgtaaatcagtacaaccactat ggaaaacagtttggaggctcctctaaaaactga >gi568815579f:45368587_45573009|GENSCAN_predicted_peptide_4|547_aa MSLAYAKPITEMKWLESFSTQTSPIMYHENLASGQLCAEEGVEGAREGSVRQPAGCLPWT AAQSVCRRERVVQEPESPWEKPQPHTWPLTSPGQGTPGSLDKLTESQAEFSHPGHVACCD PPQHPRVAVWLRGLGRAHRGRLHRNFAIVGTGRCSFPELPRTAYFEDSLSSPGTPTAHPG LAPYFPNPAIALASRRPQRGHRGPPVPREMFQAFPGDYDSGSRCSSSPSAESQYLSSVDS FGSPPTAAASQECAGLGEMPGSFVPTVTAITTSQDLQWLVQPTLISSMAQSQGQPLASQP PVVDPYDMPGTSYSTPGMSGYSSGGASGSGGPSTSGTTSGPGPARPARARPRRPREETLT PEEEEKRRVRRERNKLAAAKCRNRRRELTDRLQAETDQLEEEKAELESEIAELQKEKERL EFVLVAHKPGCKIPYEEGPGPGPLAEVRDLPGSAPAKEDGFSWLLPPPPPPPLPFQTSQD APPNLTASLFTHSEVQVLGDPFPVVNPSYTSSFVLTCPEVSAFAGAQRTSGSDQPSDPLN SPSLLAL >gi568815579f:45368587_45573009|GENSCAN_predicted_CDS_4|1644_bp atgtccctggcctatgcaaagcccatcacagaaatgaagtggctggaatcattcagcact cagacctcacctattatgtaccatgagaacctggcctctggacagctctgtgctgaagag ggcgtggagggggccagggaagggagtgtcaggcagccagccggctgcctgccctggaca gcagcccagagtgtctgcaggagggagagggtagttcaggagcctgagtcaccctgggag aaaccccagccacatacctggccgctgacatcacccggccagggcacccccggcagccta gacaagctgactgaatcacaggcggaattcagccaccccgggcacgtggcctgctgtgac cccccgcaacacccccgagtggccgtctggctgcgggggttgggccgggcacacagggga agactgcacagaaactttgccattgttggaacgggacgttgctccttccccgagcttccc cggacagcgtactttgaggactcgctcagctcaccggggactcccacggctcaccccgga cttgcaccttacttccccaacccggccatagccttggcttcccggcgacctcagcgtggt cacaggggcccccctgtgcccagggaaatgtttcaggctttccccggagactacgactcc ggctcccggtgcagctcctcaccctctgccgagtctcaatatctgtcttcggtggactcc ttcggcagtccacccaccgccgccgcctcccaggagtgcgccggtctcggggaaatgccc ggttccttcgtgcccacggtcaccgcgatcacaaccagccaggacctccagtggcttgtg caacccaccctcatctcttccatggcccagtcccaggggcagccactggcctcccagccc ccggtcgtcgacccctacgacatgccgggaaccagctactccacaccaggcatgagtggc tacagcagtggcggagcgagtggcagtggtgggccttccaccagcggaactaccagtggg cctgggcctgcccgcccagcccgagcccggcctaggagaccccgagaggagacgctcacc ccagaggaagaggagaagcgaagggtgcgccgggaacgaaataaactagcagcagctaaa tgcaggaaccggcggagggagctgaccgaccgactccaggcggagacagatcagttggag gaagaaaaagcagagctggagtcggagatcgccgagctccaaaaggagaaggaacgtctg gagtttgtgctggtggcccacaaaccgggctgcaagatcccctacgaagaggggcccggg ccgggcccgctggcggaggtgagagatttgccgggctcagcaccggctaaggaagatggc ttcagctggctgctgccgcccccgccaccaccgcccctgcccttccagaccagccaagac gcaccccccaacctgacggcttctctctttacacacagtgaagttcaagtcctcggcgac cccttccccgttgttaacccttcgtacacttcttcgtttgtcctcacctgcccggaggtc tccgcgttcgccggcgcccaacgcaccagcggcagtgaccagccttccgatcccctgaac tcgccctccctcctcgctctgtga >gi568815579f:45368587_45573009|GENSCAN_predicted_peptide_5|551_aa MSARDPGPRAAQPEWAGVPMQARGGPWGRSCRSSPTASLFSSHIFAPLSSSDAKHEEAPS TASSTPDSTEGGNDDSDFRELHTAREFSEEDEEETTSQDWGTPRELTFSYIAFDGVVGSG GRRDSTARRPRPQGRSVSEPRDQHPQPSLGDSLESIPSLSQSPEPGRRGDPDTAPPSERP LEDLRLRLDHLGWVARGTGSGEDSSTSSSTPLEDEEPQEPNRLETGEAGEELDLRLRLAQ PSSPEVLTPQLSPGSGTPQAGTPSPSRSRDSNSGPEEPLLEEEEKQWGPLEREPVRGQCL DSTDQLEFTVEPRLLVADLLYWKDTRTSGVVFTGLMVSLLCLLHFSIVSVAAHLALLLLC GTISLRVYRKVLQAVHRGDGANPFQAYLDVDLTLTREQTERLSHQITSRVVSAATQLRHF FLVEDLVDSLKLALLFYILTFVGAIFNGLTLLILGVIGLFTIPLLYRQHQVSVTPSVVST PAKHGGLTISFQNRGPIDRCPGQTQAQIDQYVGLVTNQLSHIKAKIRAKIPGTGALASAA AAVSGSKAKAE >gi568815579f:45368587_45573009|GENSCAN_predicted_CDS_5|1656_bp atgagcgcccgcgaccccgggcccagggcggcacagccggagtgggcgggggtcccgatg caggcccgaggggggccatggggcaggtcctgccggtcttcgcccactgcatcactcttc tccagtcatatcttcgcacctctcagttcctcagatgccaagcatgaagaagctccgtct acagcctcctcaactcctgattccacagaaggagggaacgacgactctgattttcgagag ctgcacacagcccgggaattctcagaggaggacgaggaggagaccacgtcgcaggactgg ggcaccccccgggagctgaccttctcctacatcgcctttgatggtgtagtgggctccggg ggccgcagggattcaactgcccgccgcccccgcccccagggccgctcagtctcggaacca cgagaccagcaccctcagcccagcctgggcgacagcttggagagcatccccagcctgagc caatccccggagcctggacgacggggtgatcctgacaccgcgcctccatccgagcgccct ctggaagacctgaggcttcggttggaccatctgggctgggtggcccggggaacgggatcc ggggaggactcttccaccagcagctccaccccgctggaagacgaagaaccccaagaaccc aacagattggagacaggagaagctggggaagaactggacctacgactccgacttgctcag ccctcatcgcccgaggtcttgactccccagctcagtccgggctctgggacaccccaggcc ggtactccgtccccatcccgatcgcgagattcgaactctgggcccgaagagccattgctg gaagaggaagaaaagcagtgggggccactggagcgagagccagtaaggggacagtgcctc gatagcacggaccaattagaattcacggtggagccacgccttctagtggcggacctgctg tactggaaggacacgaggacgtcaggagtggtcttcacaggcctgatggtctccctcctc tgcctcctgcactttagcatcgtgtccgtggccgcgcacttggctctgttgctgctctgc ggcaccatctctctcagggtttaccgcaaagtgctgcaggccgtgcaccggggggatgga gccaaccctttccaggcctacctggatgtggacctcaccctgactcgggagcagacggaa cgtttgtcccaccagatcacctcccgcgtggtctcggcggccacgcagctgcggcacttc ttcctggtagaagacctcgtggattccctcaagctggccctcctcttctacatcttgacc ttcgtgggtgccatcttcaatggtttgactcttctcattctgggagtgattggtctattc accatccccctgctgtaccggcagcaccaggtgagtgtgacaccctcagttgtcagcacc ccagctaaacatgggggtctgaccatctccttccagaaccggggtcccattgacagatgt cccggccagacacaggctcagatcgaccaatatgtggggttggtgaccaatcagttgagc cacatcaaagctaagatccgagctaaaatcccagggaccggagccctggcctctgcagca gccgcagtctccggatccaaagccaaagccgaatga >gi568815579f:45368587_45573009|GENSCAN_predicted_peptide_6|825_aa MAVLARQLQRLLWTACKKKEREKEGREEEEEEEAGRRAPEGPRSLLTAPRRAQRPHGGAE ASGGLRFGASAAQGWRARMEDAHCTWLSLPGLPPGWALFAVLDGHGGARAARFGARHLPG HVLQELGPEPSEPEGVREALRRAFLSADERLRSLWPRVETGGCTAVVLLVSPRFLYLAHC GDSRAVLSRAGAVAFSTEDHRPLRPRERERIHAAGGTIRRRRVEGSLAVSRALGDFTYKE APGRPPELQLVSAEPEVAALARQAEDEFMLLASDGVWDTVSGAALAGLVASRLRLGLAPE LLCAQLLDTCLCKGSLDNMTCILVCFPGAPRPSEEAIRRELALDAALGCRIAELCASAQK PPSLNTVFRTLASEDIPDLPPGGGLDCKATVIAEVYSQICQVSEECGEMGKLRPREENTV TQGHTELVTHSTSETVICSSRATVMLYDDGNKRWLPAGTGPQAFSRVQIYHNPTANSFRV VGRKMQPDQQVVINCAIVRGVKYNQATPNFHQWRDARQVWGLNFGSKEDAAQFAAGMASA LEALEGGGPPPPPALPTWSVPNGPSPEEVEQQKRQQPGPSEHIERRVSNAGPPPPPGLPP SGVPAAAHGAGGGPPPAPPLPAAQGPGGGGAGAPGLAAAIAGAKLRKVSKQEEASGGPTA PKAESGRSGGGGLMEEMNAMLARRICAETLGEEQHNLAKAELNVTPGIGGGEAGPLSNLI AVPNHTNSPRMKSSSSVTTSETQPCTPSSSDYSDLQRVKQMSALNSFRTLTQLSITNWSW QEAGMRLPTLVLDVLQYFSGSKDPEFSPRILQSLCDLDSDVMVSM >gi568815579f:45368587_45573009|GENSCAN_predicted_CDS_6|2478_bp atggcggtcctggcccgccagctgcagcgtctcctctggaccgcttgcaagaaaaaggag agggagaaggaggggagggaggaagaggaggaggaggaggcggggcgcagggcccccgaa gggcctcggtctctgttgacagcgccgcgccgcgcccagcggccgcacgggggtgccgag gcgtctgggggcctgcgcttcggggcgagcgcagcgcaaggctggcgcgcgcgcatggag gatgctcactgcacttggctttcgttacctggtctgcccccgggctgggccttgtttgcc gtcctcgacggccacggtggggctcgagctgcccgcttcggtgcacgccatttgccaggc catgtgctccaggagctgggcccggagcctagcgagcccgagggcgtgcgcgaggcgctg cgccgagccttcttgagcgccgacgagcgcctgcgctccctctggccccgcgtggaaacg ggcggctgcacggccgtggtgttgctggtctccccgcggtttctgtacctggcgcactgc ggtgactcccgcgcggtgctgagccgcgctggcgccgtggccttcagcacagaggaccac cggccccttcgaccccgggaacgcgagcgcatccacgccgctggcggcaccatccgccgc cgccgcgtcgagggctctctggccgtgtcgcgagcgttgggcgactttacctacaaggag gctccggggaggccccccgagctacagctcgtttctgcggagccagaggtggccgcactg gcacgccaggctgaggacgagttcatgctcctggcctctgatggcgtctgggacactgtg tctggtgctgccctggcgggactggtggcttcacgcctccgcttgggcctggccccagag cttctctgcgcgcagctgttggacacgtgtctgtgcaagggcagcctggacaacatgacc tgcatcctggtctgcttccctggggcccctaggccttctgaggaggcgatcaggagggag ctagcactggacgcagccctgggctgcagaatcgctgaactgtgtgcctctgctcagaag ccccccagcctgaacacagttttcaggactctggcctcagaggacatcccagatttacct cctgggggagggctggactgcaaggccactgtcattgctgaagtttattctcagatctgc caggtctcagaagagtgcggagagatgggaaagctgagacccagagaggagaacacagtt acccaaggtcacactgaactggtaactcattcaactagcgagacggtcatctgttccagc cgggccactgtgatgctttatgatgatggcaacaagcgatggctccctgctggcacgggt ccccaggccttcagccgcgtccagatctaccacaaccccacggccaattcctttcgcgtc gtgggccggaagatgcagcccgaccagcaggtggtcatcaactgtgccatcgtccggggt gtcaagtataaccaggccacccccaacttccatcagtggcgcgacgctcgccaggtctgg ggcctcaacttcggcagcaaggaggatgcggcccagtttgccgccggcatggccagtgcc ctagaggcgttggaaggaggtgggccccctccacccccagcacttcccacctggtcggtc ccgaacggcccctccccggaggaggtggagcagcagaaaaggcagcagcccggcccgtcg gagcacatagagcgccgggtctccaatgcaggtccccccccacccccaggtttgccccct tcgggggtcccagctgcagcgcacggagcagggggaggaccaccccctgcaccccctctc ccggcagcacagggccctggtggtgggggagctggggccccaggcctggccgcagctatt gctggagccaaactcaggaaagtcagcaagcaggaggaggcctcaggggggcccacagcc cccaaagctgagagtggtcgaagcggaggtgggggactcatggaagagatgaacgccatg ctggcccggagaatctgtgcggagaccctgggagaagaacagcacaaccttgccaaggca gagctcaatgtgacccctggaataggaggcggggaagcaggtcctctctctaatctcatt gctgtcccaaaccacaccaactcccccaggatgaagtcgtcttcttcggtgaccacttcc gagacccaaccctgcacgcccagctccagtgattactcggacctacagagggtgaaacag atgagtgccttgaattctttccgcacattgacccagctgtccatcaccaattggagttgg caggaggctggaatgcgcttgccaaccttggtactggatgttctccagtacttttccggc tccaaggatccagaattctcccctagaatcctccagtcactctgcgaccttgacagcgat gtcatggtgtcgatgtag >gi568815579f:45368587_45573009|GENSCAN_predicted_peptide_7|323_aa MMLSFLCYRGALGQAVIAQAAVYHWVEMRTKMRIMGFRGTVIKPLNEEAAAELGAELLGE ATIFIVGGGCLVLEYWRHQAQQRHKEEEQRAAWNALRDEVGHLALALEALQAQVQAAPPQ GALEELRTELQEELAGWNLNLDMAYVPNVAFFPHHPCLRWPSGNHQDLDATWHLVTPADK SSRYLPLGPAVYHWLEMRTKMRIMGFNAAAIKPLNEGAAAELGAELLGEGIIFITACSCL MLEYWRHQLQQRRKEKERRVAREALRGEVGHLGLALEELQAQVQATSTQLALEELRAQLQ EVRAHLCLRDPPPAPPVAPASEK >gi568815579f:45368587_45573009|GENSCAN_predicted_CDS_7|972_bp atgatgctgtccttcctgtgttacagaggggccttggggcaggctgtcattgctcaggca gcagtgtatcactgggtggagatgcggaccaagatgcgcatcatgggcttccggggcacg gtcatcaagccgctgaacgaggaggcggcagctgagctgggcgcagagctgctgggcgaa gccaccatcttcatcgtgggcggcggctgcctagtgctggagtactggcgccaccaggcg cagcagcgccacaaggaggaggagcagcgtgctgcctggaacgcgctgcgggacgaggtg ggccacctggcgctggcgctggaagcgctgcaggcgcaggtgcaggcggcgccgccacag ggcgccctggaggaactgcgcacagagctgcaagaggagcttgctggatggaacctgaat ttggacatggcctatgtacctaacgtggccttcttcccgcaccacccttgcctgcgctgg cccagtggaaaccaccaggatcttgatgcaacttggcatttggttacccctgctgataag agcagccgttacctgccactgggaccagcagtgtaccactggctggagatgcggaccaaa atgcgcatcatgggtttcaatgccgctgccatcaagccgctgaacgagggtgcagccgcc gagctgggcgcggagctgctgggcgagggcatcatcttcatcaccgcctgcagctgcctg atgctggagtattggcgccaccagttgcagcagcgccgcaaggaaaaggagcgacgtgtt gccagggaggcgctgcggggcgaggtgggccacttggggctggcgctcgaggagttgcag gcgcaggtgcaggcgacgtcgacgcagctcgccctggaggagctgcgcgctcagctgcag gaggtgcgagcccacctctgcctccgagacccgccgcctgcacccccagttgcgccggcg tccgagaaatag >gi568815579f:45368587_45573009|GENSCAN_predicted_peptide_8|103_aa MSESQQKNMAVPPFIMQRHLLEFAHVLMAKSSGRFLNRDLFQKPLCRGLADLQPECIPGN LLALAASNMPSGQERAPVSAWGWRLPEKAPEAISVIDPGVSLY >gi568815579f:45368587_45573009|GENSCAN_predicted_CDS_8|312_bp atgagcgagagccagcagaaaaacatggcagtcccacccttcattatgcagaggcatttg ctggaatttgctcatgtactcatggctaaatcttctggccgatttctgaacagggatctt tttcagaagcccctgtgtagagggctggctgatttgcaacctgagtgcatcccagggaat ctacttgcgctggctgcaagcaatatgccaagtggacaggagagggcgcccgtgtcagcc tggggatggcgcctgcctgagaaagcaccagaggccatctcggtcattgaccccggtgta tctctgtattag