GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:06:48 Sequence gi568815587f:71900348_72101858 : 201511 bp : 45.43% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2899 2987 89 2 2 69 -3 157 0.430 4.39 1.02 Intr + 8422 8955 534 0 0 76 -22 297 0.536 10.62 1.03 Intr + 9835 10057 223 0 1 43 38 151 0.457 3.30 1.04 Intr + 11082 11258 177 1 0 25 76 94 0.280 1.79 1.05 Intr + 11776 11925 150 2 0 32 83 98 0.785 3.83 1.06 Intr + 12454 12596 143 2 2 112 -5 95 0.722 2.67 1.07 Term + 12723 12806 84 2 0 120 49 70 0.794 3.95 1.08 PlyA + 13804 13809 6 1.05 2.06 PlyA - 15204 15199 6 1.05 2.05 Term - 15677 15666 12 2 0 120 48 2 0.653 -2.40 2.04 Intr - 15881 15727 155 0 2 37 95 55 0.300 0.89 2.03 Intr - 17429 17274 156 0 0 115 58 85 0.240 8.18 2.02 Intr - 18679 18538 142 1 1 108 92 82 0.997 10.53 2.01 Init - 23006 22962 45 2 0 111 76 10 0.897 2.86 2.00 Prom - 28260 28221 40 -4.56 3.00 Prom + 28439 28478 40 -11.63 3.01 Init + 28715 28777 63 1 0 96 99 78 0.897 10.86 3.02 Intr + 43070 43107 38 1 2 64 90 28 0.065 -2.34 3.03 Intr + 56831 56917 87 2 0 67 107 26 0.380 1.49 3.04 Intr + 60403 60544 142 1 1 107 75 185 0.860 19.46 3.05 Intr + 82414 82568 155 0 2 86 100 59 0.717 5.77 3.06 Intr + 86657 86764 108 2 0 128 89 45 0.994 8.00 3.07 Intr + 90250 90370 121 1 1 91 98 141 0.999 15.90 3.08 Intr + 94372 94558 187 0 1 89 30 70 0.611 0.56 3.09 Intr + 95093 95328 236 0 2 46 44 176 0.265 6.41 3.10 Intr + 98485 98672 188 0 2 75 91 57 0.890 3.19 3.11 Intr + 99580 99665 86 1 2 64 96 44 0.966 2.26 3.12 Intr + 100854 100977 124 1 1 77 94 93 0.993 8.44 3.13 Intr + 101058 101205 148 0 1 121 68 175 0.966 19.04 3.14 Term + 101437 101514 78 0 0 118 47 -10 0.832 -4.34 3.15 PlyA + 102160 102165 6 -3.94 4.27 PlyA - 102541 102536 6 1.05 4.26 Term - 103191 103180 12 0 0 101 43 1 0.553 -4.90 4.25 Intr - 103752 103540 213 0 0 65 75 268 0.978 22.11 4.24 Intr - 103994 103878 117 2 0 102 94 49 0.994 7.56 4.23 Intr - 104469 104293 177 0 0 83 80 240 0.999 22.82 4.22 Intr - 105022 104886 137 2 2 119 72 247 0.998 26.49 4.21 Intr - 105916 105688 229 1 1 66 94 148 0.932 10.64 4.20 Intr - 107088 106842 247 2 1 51 77 245 0.980 17.16 4.19 Intr - 108498 108341 158 0 2 125 71 159 0.996 16.71 4.18 Intr - 108838 108620 219 1 0 127 110 375 0.871 42.10 4.17 Intr - 109040 108921 120 2 0 86 110 112 0.991 13.89 4.16 Intr - 110507 110439 69 2 0 145 81 108 0.986 15.18 4.15 Intr - 112095 112054 42 0 0 99 95 -1 0.509 0.14 4.14 Intr - 115913 112548 3366 2 0 54 110 4307 0.927 416.89 4.13 Intr - 116183 116061 123 2 0 80 91 85 0.986 8.78 4.12 Intr - 116310 116230 81 0 0 115 11 82 0.890 3.03 4.11 Intr - 117480 117340 141 0 0 60 86 245 0.995 22.05 4.10 Intr - 117953 117836 118 1 1 121 98 156 0.924 20.37 4.09 Intr - 118166 118049 118 0 1 102 77 237 0.999 23.52 4.08 Intr - 118633 118476 158 0 2 94 73 258 0.995 24.55 4.07 Intr - 119270 119147 124 0 1 97 55 78 0.979 5.04 4.06 Intr - 120944 120857 88 2 1 74 115 122 0.987 13.04 4.05 Intr - 122072 121992 81 2 0 122 115 11 0.992 7.03 4.04 Intr - 122998 122718 281 2 2 90 77 96 0.395 5.90 4.03 Intr - 124006 123927 80 0 2 83 63 48 0.662 0.99 4.02 Intr - 128943 128858 86 0 2 89 82 130 0.978 11.12 4.01 Init - 135596 135555 42 2 0 80 106 55 0.987 4.92 4.00 Prom - 153651 153612 40 -3.66 5.00 Prom + 168346 168385 40 -4.26 5.01 Init + 188737 188818 82 0 1 54 82 71 0.894 4.23 5.02 Intr + 193149 193354 206 1 2 89 76 183 0.904 16.12 5.03 Intr + 194601 194749 149 2 2 105 97 124 0.948 14.03 5.04 Term + 195032 195173 142 2 1 71 54 222 0.997 14.50 5.05 PlyA + 195229 195234 6 1.05 6.05 PlyA - 195335 195330 6 -0.45 6.04 Term - 197567 197475 93 2 0 127 49 119 0.999 9.73 6.03 Intr - 198068 197942 127 1 1 89 94 55 0.998 6.88 6.02 Intr - 198511 198434 78 0 0 108 99 83 0.997 10.07 6.01 Intr - 198909 198764 146 0 2 98 111 187 0.999 21.08 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:71900348_72101858|GENSCAN_predicted_peptide_1|466_aa XDPELQPVVAGLFLSMCLVTVLENLLIILARRKAESPQAATKWLEEHAPADYQNAQEYGR AQLPGTDPQLDPHERGDMQRLKRDREALLEGLMRGAQKATNVNKLSEVIQGKEESPAQFY QRLCEVYRMYTPCDPDSPENQHMIHMALVRQSPEDMRRKLQKQAGLAGMNPSQLLEIASQ VFVNRDAVSRKENGKENGGQARRYADLFPRTKDYQPVQDLRLLHQAKLTLHPTVNNPSTL LGLLPAEDSWFTCLDLKDVFFPIRLAPERQKLFAFQWEDPESGDWELYVDGSSFFNPQGE RGAGYAVITLDTVVEATSLPQATSGQKAELIAFIGALELSEALAKTVRQRCVTCRQHDAR QGPAVPPGIRAYGAAPFEGLQVDFTEMPKCGDIRKKCHWGCEQPCDIESSIILSPLHIGN NITSGVYSPLHIGNNITSGGIQNNIIGGVYTLCDIESHIILFRSGY >gi568815587f:71900348_72101858|GENSCAN_predicted_CDS_1|1401_bp naggatccagaactgcagccggtcgtcgctgggctgttcctgtccatgtgcctggtcacg gtgctggagaacctgctcatcatcctggcccgaagaaaggcggagagtcctcaagcagca actaagtggctagaggaacatgcaccagctgattatcaaaacgcccaagagtatggaagg gcccagttgccaggaaccgaccctcagttggacccacatgaaagaggggatatgcaaagg ctaaagcgagacagagaagctctcctggaaggattaatgaggggagctcagaaggccaca aacgttaacaagctctctgaggtcattcagggtaaagaagaaagtccagcacaattctac cagagactgtgtgaggtctatcgtatgtatactccctgtgatcccgatagccctgaaaat cagcacatgattcacatggctttagtccgtcaaagcccagaagacatgagaagaaaactg cagaaacaggctgggcttgcagggatgaatccatcccaattactagaaatagctagccag gtgtttgtaaacagggatgcagtaagccgtaaggaaaacggcaaagagaatggaggtcag gcccggcgatatgccgacctgtttccacggaccaaggactaccagccggttcaggatttg cgcttgcttcatcaagctaaactgactttacatccaacagtaaataacccgtccacattg ttggggttgctgccagctgaggacagctggttcacctgcttggacctgaaagacgttttc tttcctatcagattagcccctgagaggcagaagctgtttgcctttcagtgggaagatccg gagtcaggagactgggaactatatgtggatgggagcagcttcttcaacccccaaggagag agaggtgcagggtatgcagtgataaccctggacactgttgttgaagccacatcgttgccc caggccacttcaggccagaaagctgaactcattgctttcattggggccttagaactcagt gaggcccttgccaaaacggtgaggcagcggtgtgttacctgccgccagcatgatgcgagg caaggtccagccgttccgcccggcatacgagcttatggagcagccccctttgaaggtctc caagtggacttcacagagatgccaaagtgtggagatattaggaaaaaatgtcactgggga tgtgaacagccctgcgatattgagagtagtatcatcctctcccccttgcatattgggaac aacatcacaagtggggtgtactcccccttgcatattgggaacaacatcacaagtgggggt attcagaacaatattataggaggggtgtacaccctctgcgatattgagagtcatatcatc ctctttcgctctggatattag >gi568815587f:71900348_72101858|GENSCAN_predicted_peptide_2|169_aa MAPVKISHVVSFSSQDPKYPVENLLNPDSPRRPWLGCPQDKSGQLKVELQLERAVPTGYI DVGNCGCAFLQIDVGHSSWPLDRPFITLLPATTLMSLTDSKQGKNRSGVRMFKDGELWDR LRLTCSRPFTRHQSFGLAFLRVCSSLDSLDDSVVGPSALLSSVLNKVSV >gi568815587f:71900348_72101858|GENSCAN_predicted_CDS_2|510_bp atggctcctgtgaagatcagccatgtggtatcattttcttctcaggatcccaagtatcct gtagagaacttgctaaacccagatagtccaaggagaccttggctcggctgccctcaggac aagagtgggcaattgaaagtagaactacagctggagagggcagtgcccactggctacatt gatgtgggtaactgtggctgtgcgttcctgcaaattgatgtgggccattcttcctggccc ctggacagacctttcataaccctgctccctgcaaccacgctaatgtctctaactgattca aagcaggggaagaaccgctccggggtccgcatgtttaaagatggagagttatgggatcga cttcgcctgacctgctcccgacccttcacgcgtcatcagtcctttggcctggcctttcta cgggtgtgttcttctctggactccttagatgactctgtggtgggtccctcagcccttctg agctctgtgctgaacaaggtttctgtttaa >gi568815587f:71900348_72101858|GENSCAN_predicted_peptide_3|586_aa MAAVVEVEVGGGAAGERELDELCHAAVICDKSIRDSFKTVTDFSGGVFLQVDMSDLSPEE QWRVEHARMHAKHRGHEAMHAEMVLILIATLVVAQLLLVQWKQRHPRSYNMVTLFQMWVV PLYFTVKLHWWRFLVIWILFSAVTAFVTFRATRKPLVQTTPRLVYKWFLLIYKISYATGI VGYMAVMFTLFGLNLLFKIKPEDAMDFGISLLFYGLYYGVLERDFAEMCADYMASTIGFY SESGMPTKHLSDSVCAVCGQQIFVDVSEEGIIENTYRLSCNHVYPASSSWATSLLQSAHC TVLSFHEFCIRGWCIVGKKQTCPYCKEKVDLKRMFSNPYPLLGSLLGVGCGKKVLASVTC IWVLLGPSLLPPTRTCQTMGWLSSLCASSWVGRESSTDKEEVLSVYFLWWVHTQLDTANL SLGAPRGKLLERKALQDLLGAREEDVVTDKEPGSPAPDACIMTMRHNWTPDGTLSLSCVA CSRFPNFSILYWLGNGSFIEHLPGRLWEGSTSRERGSTGTQLCKALVLEQLTPALHSTNF SCVLVDPEQVVQRHVVLAQLWAGLRATLPPTQEALPSSHSSPQQQG >gi568815587f:71900348_72101858|GENSCAN_predicted_CDS_3|1761_bp atggcggcagtggtggaggtggaggttggaggtggtgctgctggggaacgggagctggat gagctctgccatgcagctgttatctgtgacaagtcaataagggacagttttaagacagta acggatttttctggtggtgtctttctacaggttgatatgtcagatctctctccagaagag caatggagggtcgagcacgcacgcatgcatgccaagcaccgtggccatgaagctatgcat gctgaaatggtcctcatcctcatcgcaaccttggtggtggcccagctgctcctggtgcag tggaagcagaggcacccacgctcctacaatatggtgaccctctttcagatgtgggttgtt cccctctatttcacagtgaagctgcactggtggaggttcctagtgatctggatcttgttc tctgctgtcacagcctttgttaccttccgagccacccgaaaacctctagtacagacaacc ccaaggttggtttataagtggttcctgctaatctataaaatcagctatgccactggcatt gttggctacatggctgtcatgtttaccctctttggtcttaacttattattcaagatcaaa ccagaagatgccatggactttggcatctcccttctcttctatggcctctactatggagtt ctggaacgggactttgcagaaatgtgtgcagactacatggcatctaccatagggttctac agcgagtcgggcatgcctaccaaacatctttcagacagtgtgtgtgctgtgtgtgggcag cagatctttgtggacgtcagtgaagaggggatcattgagaacacgtataggctgtcctgc aatcatgtgtatcctgcctcgagctcctgggccacatctctcctgcaatctgcacactgt acggtgctcagcttccacgagttctgcatccgtggctggtgcatcgtgggaaagaagcaa acgtgtccctactgcaaagagaaggtagacctcaagaggatgttcagcaatccgtatcct ttattggggtcgttgttgggagtgggctgtgggaagaaagtactggccagtgtgacctgc atttgggtcctcctggggccctcacttctgcccccaaccagaacatgtcaaaccatgggt tggctctcgagcttgtgtgccagttcctgggttggccgtgagagttctacagacaaggag gaagtgctctcggtgtatttcctgtggtgggttcacacgcagctagacacagctaacttg agtcttggagctcctagagggaagcttctggaaaggaaggctcttcaggacctcttagga gccagagaagaggacgttgtcacagataaagagccaggctcaccagctcctgacgcatgc atcatgaccatgagacacaactggacaccagatggaacgctgagcttatcctgtgtggcc tgcagccgcttccccaacttcagcatcctctactggctgggcaatggttccttcattgag cacctcccaggccgactgtgggaggggagcaccagccgggaacgtgggagcacaggtacg cagctgtgcaaggccttggtgctggagcagctgacccctgccctgcacagcaccaacttc tcctgtgtgctcgtggaccctgaacaggttgtccagcgtcacgtcgtcctggcccagctc tgggctgggctgagggcaaccttgccccccacccaagaagccctgccctccagccacagc agtccacagcagcagggttaa >gi568815587f:71900348_72101858|GENSCAN_predicted_peptide_4|2208_aa MTLHATRGAALLSWVNSLHVADPVEAVLQLQDCSIFIKIIDRIHGTEEGQQILKQPVSER LDFVCSFLQTLGLTQDRTDFRWKIVRQGKEAIGVRRPTSHSPTRDSHSAPQISVFWDLRY GFQEVHPVFYVVVIAENRKHPSSPECLVSAQKVLEGSELELAKMTMLLLYHSTMSSKSPR DWEQFEYKIQAELAVILKFVLDHEDGLNLNEDLENFLQKAPVPSTCSSTFPEELSPPSHQ AKREIRFLELQKVASSSSGNNFLSGSPASPMGDILQTPQFQMRRLKKQLADERSNRDELE LELAENRKLLTEKDAQIAMMQQRIDRLALLNEKQAASPLEPKELEELRDKNESLTMRLHE TLKQCQDLKTEKSQMDRKINQLSEENGDLSFKLREFASHLQQLQDALNELTEEHSKATQE WLEKQAQLEKELSAALQDKARAKGDLGNKMMGPMFADVYEGLCGNLKCLEEKNEILQGKL SQLEEHLSQLQDNPPQEKGEVLGDVLQLETLKQEAATLAANNTQLQARVEMLETERGQQE AKLLAERGHFEEEKQQLSSLITDLQSSISNLSQAKEELEQASQAHGARLTAQVASLTSEL TTLNATIQQQDQELAGLKQQAKEKQAQLAQTLQQQEQASQGLRHQVEQLSSSLKQKEQQL KEVAEKQEATRQDHAQQLATAAEEREASLRERDAALKQLEALEKEKAAKLEILQQQLQVA NEARDSAQTSVTQAQREKAELSRKVEELQACVETARQEQHEAQAQVAELELQLRSEQQKA TEKERVAQEKDQLQEQLQALKESLKVTKGSLEEEKRRAADALEEQQRCISELKAETRSLV EQHKRERKELEEERAGRKGLEARLQQLGEAHQAETEVLRRELAEAMAAQHTAESECEQLV KEVAAWRERYEDSQQEEAQYGAMFQEQLMTLKEECEKARQELQEAKEKVAGIESHSELQI SRQQNELAELHANLARALQQVQEKEVRAQKLADDLSTLQEKMAATSKEVARLETLVRKAG EQQETASRELVKEPARAGDRQPEWLEEQQGRQFCSTQAALQAMEREAEQMGNELERLRAA LMESQGQQQEERGQQEREVARLTQERGRAQADLALEKAARAELEMRLQNALNEQRVEFAT LQEALAHALTEKEGKDQELAKLRGLEAAQIKELEELRQTVKQLKEQLAKKEKEHASGSGA QSEAAGRTEPTGPKLEALRAEVSKLEQQCQKQQEQADSLERSLEAERASRAERDSALETL QGQLEEKAQELGHSQSALASAQRELAAFRTKVQDHSKAEDEWKAQVARGRQEAERKNSLI SSLEEEVSILNRQVLEKEGESKELKRLVMAESEKSQKLEERLRLLQAETASNSARAAERS SALREEVQSLREEAEKQRVASENLRQELTSQAERAEELGQELKAWQEKFFQKEQALSTLQ LEHTSTQALVSELLPAKHLCQQLQAEQAAAEKRHREELEQSKQAAGGLRAELLRAQRELG ELIPLRQKVAEQERTAQQLRAEKASYAEQLSMLKKAHGLLAEENRGLGERANLGRQFLEV ELDQAREKYVQELAAVRADAETRLAEVQREAQSTARELEVMTAKYEGAKVKVLEERQRFQ EERQKLTAQVEQLEVFQREQTKQVEELSKKLADSDQASKVQQQKLKAVQAQGGESQQEAQ RLQAQLNELQAQLSQKEQAAEHYKLQMEKAKTHYDAKKQQNQELQEQLRSLEQLQKENKE LRAEAERLGHELQQAGLKTKEAEQTCRHLTAQVRSLEAQVAHADQQLRDLGKFQVATDAL KSREPQAKPQLDLSIDSLDLSCEEGTPLSITSKLPRTQPDGTSVPGEPASPISQRLPPKV ESLESLYFTPIPARSQAPLESSLDSLGDVFLDSGRKTRSARRRTTQIINITMTKKLDVEE PDSANSSFYSTRSAPASQASLRATSSTQSLARLGSPDYGNSALLSLPGYRPTTRSSARRS QAGVSSGAPPGRNSFYMGTCQDEPEQLDDWNRIAELQQRNRVCPPHLKTCYPLESRPSLS LGTITDEEMKTGDPQETLRRASMQPIQIAEGTGITTRQQRKRVSLEPHQGPGTPESKKAT SCFPRPMTPRDRHEGRKQSTTEAQKKAAPASTKQADRRQSMAFSILNTPKKLGNSLLRRG ASKKALSKASPNTRSGTRRSPRIATTTASAATAAAIGATPRAKGKAKH >gi568815587f:71900348_72101858|GENSCAN_predicted_CDS_4|6627_bp atgacactccacgccacccggggggctgcactcctctcttgggtgaacagtctacacgtg gctgaccctgtggaggctgtgctgcagctccaggactgcagcatcttcatcaagatcatt gacagaatccatggcactgaagagggacagcaaatcttgaagcagccggtgtcagagaga ctggactttgtgtgcagttttctgcagacccttggattgacccaggatagaactgacttc agatggaaaattgtaaggcaggggaaagaggccataggagtaaggaggcctacatctcat agccccaccagagatagccattctgctccccagatttcagtgttctgggatctcaggtat ggtttccaggaagttcaccctgttttctacgttgttgtcattgcagaaaatcgaaaacat ccctcttccccagaatgcctggtatctgcacagaaggtgctagagggatcagagctggaa ctggcgaagatgaccatgctgctcttataccactctaccatgagctccaaaagtcccagg gactgggaacagtttgaatataaaattcaggctgagttggctgtcattcttaaatttgtg ctggaccatgaggacgggctaaaccttaatgaggacctagagaacttcctacagaaagct cctgtgccttctacctgttctagcacattccctgaagagctctccccacctagccaccag gccaagagggagattcgcttcctagagctacagaaggttgcctcctcttccagtgggaac aactttctctcaggttctccagcttctcccatgggtgatatcctgcagaccccacagttc cagatgagacggctgaagaagcagcttgctgatgagagaagtaatagggatgagctggag ctggagctagctgagaaccgcaagctcctcaccgagaaggatgcacagatagccatgatg cagcagcgcattgaccgcctagccctgctgaatgagaagcaggcggccagcccactggag cccaaggagcttgaggagctgcgtgacaagaatgagagccttaccatgcggctgcatgaa accctgaagcagtgccaggacctgaagacagagaagagccagatggatcgcaaaatcaac cagctttcggaggagaatggagacctttcctttaagctgcgggagtttgccagtcatctg cagcagctacaggatgccctcaatgagctgacggaggagcacagcaaggccactcaggag tggctagagaagcaggcccagctggagaaggagctcagcgcagccctgcaggacaaggcc agggcaaaaggagatcttggtaacaagatgatgggccccatgtttgctgatgtttatgag ggcttgtgtggtaatctgaaatgccttgaagagaagaacgaaatccttcagggaaaactt tcacagctggaagaacacttgtcccagctgcaggataacccaccccaggagaagggcgag gtgctgggtgatgtcttgcagctggaaaccttgaagcaagaggcagccactcttgctgca aacaacacacagctccaagccagggtagagatgctggagactgagcgaggccagcaggaa gccaagctgcttgctgagcggggccacttcgaagaagaaaagcagcagctgtctagcctg atcactgacctgcagagctccatctccaacctcagccaggccaaggaagagctggagcag gcctcccaggctcatggggcccggttgactgcccaggtggcctctctgacctctgagctc accacactcaatgccaccatccagcaacaggatcaagaactggctggcctgaagcagcag gccaaagagaagcaggcccagctagcacagaccctccaacagcaagaacaggcctcccag ggcctccgccaccaggtggagcagctaagcagtagcctgaagcagaaggagcagcagttg aaggaggtagcggagaagcaggaggcaactaggcaggaccatgcccagcaactggccact gctgcagaggagcgagaggcctccttaagggagcgggatgcggctctcaagcagctggag gcactggagaaggagaaggctgccaagctggagattctgcagcagcaacttcaggtggct aatgaagcccgggacagtgcccagacctcagtgacacaggcccagcgggagaaggcagag ctgagccggaaggtggaggaactccaggcctgtgttgagacagcccgccaggaacagcat gaggcccaggcccaggttgcagagctagagttgcagctgcggtctgagcagcaaaaagca actgagaaagaaagggtggcccaggagaaggaccagctccaggagcagctccaggccctc aaagagtccttgaaggtcaccaagggcagccttgaagaggagaagcgcagggctgcagat gccctggaagagcagcagcgttgtatctctgagctgaaggcagagacccgaagcctggtg gagcagcataagcgggaacgaaaggagctggaagaagagagggctgggcgcaaggggctg gaggctcgattacagcagcttggggaggcccatcaggctgagactgaagtcctgcggcgg gagctggcagaggccatggctgcccagcacacagctgagagtgagtgtgagcagctcgtc aaagaagtagctgcctggcgtgagcggtatgaggatagccagcaagaggaggcacagtat ggcgccatgttccaggaacagctgatgactttgaaggaggaatgtgagaaggcccgccag gagctgcaggaggcaaaggagaaggtggcaggcatagaatcccacagcgagctccagata agccggcagcagaacgaactagctgagctccatgccaacctggccagagcactccagcag gtccaagagaaggaagtcagggcccagaagcttgcagatgacctctccactctgcaggaa aagatggctgccaccagcaaagaggtggcccgcttggagaccttggtgcgcaaggcaggt gagcagcaggaaacagcctcccgggagttagtcaaggagcctgcgagggcaggagacaga cagcccgagtggctggaagagcaacagggacgccagttctgcagcacacaggcagcgctg caggctatggagcgggaggcagagcagatgggcaatgagctggaacggctgcgggccgcg ctgatggagagccaggggcagcagcaggaggagcgtgggcagcaggaaagggaggtggcg cggctgacccaggagcggggccgtgcccaggctgaccttgccctggagaaggcggccaga gcagagcttgagatgcggctgcagaacgccctcaacgagcagcgtgtggagttcgctacc ctgcaagaggcactggctcatgccctgacggaaaaggaaggcaaggaccaggagttggcc aagcttcgtggtctggaggcagcccagataaaagagctggaggaacttcggcaaaccgtg aagcaactgaaggaacagctggctaagaaagaaaaggagcacgcatctggctcaggagcc caatctgaggctgctggcaggacagagccaacaggccccaagctggaggcactgcgggca gaggtgagcaagctggaacagcaatgccagaagcagcaggagcaggctgacagcctggaa cgcagcctcgaggctgagcgggcctcccgggctgagcgggacagtgctctggagactctg cagggccagttagaggagaaggcccaggagctagggcacagtcagagtgccttagcctcg gcccaacgggagttggctgccttccgcaccaaggtacaagaccacagcaaggctgaagat gagtggaaggcccaggtggcccggggccggcaagaggctgagaggaaaaatagcctcatc agcagcttggaggaggaggtgtccatcctgaatcgccaggtcctggagaaggagggggag agcaaggagttgaagcggctggtgatggccgagtcagagaagagccagaagctggaggag aggctgcgcctgctgcaggcagagacagccagcaacagtgccagagctgcagaacgcagc tctgctctgcgggaggaggtgcagagcctccgggaggaggctgagaaacagcgggtggct tcagagaacctgcggcaggagctgacctcacaggctgagcgtgcggaggagctgggccaa gaattgaaggcgtggcaggagaagttcttccagaaagagcaggccctctccaccctgcag ctcgagcacaccagcacacaggccctggtgagtgagctgctgccagctaagcacctctgc cagcagctgcaggccgagcaggccgctgccgagaaacgccaccgtgaggagctggagcag agcaagcaggccgctgggggactgcgggcagagctgctgcgggcccagcgggagcttggg gagctgattcctctgcggcagaaggtggcagagcaggagcgaacagctcagcagctgcgg gcagagaaggccagctatgcagagcagctgagcatgctgaagaaggcgcatggcctgctg gcagaggagaaccgggggctgggtgagcgggccaaccttggccggcagtttctggaagtg gagttggaccaggcccgggagaagtatgtccaagagttggcagccgtacgtgctgatgct gagacccgtctggctgaggtgcagcgagaagcacagagcactgcccgggagctggaggtg atgactgccaagtatgagggtgccaaggtcaaggtcctggaggagaggcagcggttccag gaagagaggcagaaactcactgcccaggtggagcagctagaggtatttcagagagagcaa actaagcaggtggaagaactgagtaagaaactggctgactctgaccaagccagcaaggtg cagcagcagaagctgaaggctgtccaggctcagggaggcgagagccagcaggaggcccag cgcctccaggcccagctgaatgaactgcaagcccagttgagccagaaggagcaggcagct gagcactataagctgcagatggagaaagccaaaacacattatgatgccaagaagcagcag aaccaagagctgcaggagcagctgcggagcctggagcagctgcagaaggaaaacaaagag ctgcgagctgaagctgaacggctgggccatgagctacagcaggctgggctgaagaccaag gaggctgaacagacctgccgccaccttactgcccaggtgcgcagcctggaggcacaggtt gcccatgcagaccagcagcttcgagacctgggcaaattccaggtggcaactgatgcttta aagagccgtgagccccaggctaagccccagctggacttgagtattgacagcctggatctg agctgcgaggaggggaccccactcagtatcaccagcaagctgcctcgtacccagccagac ggcaccagcgtccctggagaaccagcctcacctatctcccagcgcctgccccccaaggta gaatccctggagagtctctacttcactcccatccctgctcggagtcaggcccccctggag agcagcctggactccctgggagacgtcttcctggactcgggtcgtaagacccgctccgct cgtcggcgcaccacgcagatcatcaacatcaccatgaccaagaagctagatgtggaagag ccagacagcgccaactcatcgttctacagcacgcggtctgctcctgcttcccaggctagc ctgcgagccacctcctctactcagtctctagctcgcctgggttctcccgattatggcaac tcagccctgctcagcttgcctggctaccgccccaccactcgcagttctgctcgtcgttcc caggccggggtgtccagtggggcccctccaggaaggaacagcttctacatgggcacttgc caggatgagcctgagcagctggatgactggaaccgcattgcagagctgcagcagcgcaat cgagtgtgccccccacatctgaagacctgctatcccctggagtccaggccttccctgagc ctgggcaccatcacagatgaggagatgaaaactggagacccccaagagaccctgcgccga gccagcatgcagccaatccagatagccgagggcactggcatcaccacccggcagcagcgc aaacgggtctccctagagccccaccagggccctggaactcctgagtctaagaaggccacc agctgtttcccacgccccatgactccccgagaccgacatgaagggcgcaaacagagcact actgaggcccagaagaaagcagctccagcttctactaaacaggctgaccggcgccagtcg atggccttcagcatcctcaacacacccaagaagctagggaacagccttctgcggcgggga gcctcaaagaaggccctgtccaaggcttcccccaacactcgcagtggaacccgccgttct ccgcgcattgccaccaccacagccagcgccgccactgctgccgccattggtgccacccct cgagccaagggcaaggcaaagcactaa >gi568815587f:71900348_72101858|GENSCAN_predicted_peptide_5|192_aa MNKRDYMNTSVQEPPLDYSFRSIHVIQDLVNEEPRTGLRPLKRSKSGKSLTQSLWLNNNV LNDLRDFNQVASQLLEHPENLAWIDLSFNDLTSIDPVLTTFFNLSVLYLHGNSIQRLGEV NKLAVLPRLRSLTLHGNPMEEEKGYRQYVLCTLSRITTFDFSGVTKADRTTAEVWKRMNI KPKKAWTKQNTL >gi568815587f:71900348_72101858|GENSCAN_predicted_CDS_5|579_bp atgaacaaacgggactatatgaacacttcggtacaggagccccctcttgactactccttc agaagcatccacgtcattcaagatctggtaaatgaggagccaaggacaggactacgacca ctgaagcgttcaaagtcggggaaatcactgacccagtccctgtggctgaataacaatgtt ctcaatgatctgagagacttcaaccaggtggcttcacagctgttggagcacccagagaac ctggcctggatcgacctgtcctttaatgacctgacttccattgaccctgtcctaacaact ttcttcaacctgagtgtcctctatcttcacggcaacagcatccagcgcctgggggaggtg aataagctggctgtccttcctcggctccgtagcctgacactccatgggaaccccatggag gaagagaaagggtataggcaatatgtgctgtgcaccctgtcccgtatcaccacgttcgac ttcagtggggtcaccaaagcagaccgcaccacagctgaagtctggaaacgcatgaacatc aagcccaagaaggcctggaccaagcagaatacactttga >gi568815587f:71900348_72101858|GENSCAN_predicted_peptide_6|147_aa DREERKLLLDPSSPPTKALNGAEPNYHSLPSARTDEQALLSSILAKTASNIIDVSAADSQ GMEQHEYMDRARQYSTRLAVLSSSLTHWKKLPPLPSLTSQPHQVLASEPIPFSDLQQVSR IAAYAYSALSQIRVDAKEELVVQFGIP >gi568815587f:71900348_72101858|GENSCAN_predicted_CDS_6|444_bp gaccgagaggagcggaagctgctgctggaccctagcagcccccctaccaaagctctcaat ggagccgagcccaactaccacagcctgccttccgctcgcactgatgagcaggccctgctc tcttccatccttgccaagacagccagcaacatcattgatgtgtctgctgcagactcacag ggcatggagcagcatgagtacatggaccgtgccaggcagtacagcacccgcttggctgtg ctgagcagcagcctgacccattggaagaagctgccaccgctgccgtctcttaccagccag ccccaccaagtgctggccagtgagcccatcccgttctctgatttgcagcaggtctccagg atagctgcttatgcctacagtgcactttctcagatccgtgtggacgcaaaagaggagctg gttgtacagtttgggatcccatga