GENSCAN 1.0 Date run: 5-Nov-116 Time: 07:31:37 Sequence gi568815583r:41417545_41637125 : 219581 bp : 48.55% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 20777 20887 111 1 0 76 90 92 0.891 8.78 1.02 Intr + 35357 35504 148 1 1 101 105 100 0.827 12.81 1.03 Intr + 40128 40332 205 1 1 80 116 208 0.984 21.06 1.04 Intr + 47227 47341 115 1 1 40 91 116 0.914 7.55 1.05 Intr + 48597 48708 112 2 1 108 115 173 0.999 21.95 1.06 Intr + 52713 52848 136 1 1 98 57 261 0.981 23.63 1.07 Intr + 53628 53805 178 0 1 89 11 106 0.752 2.92 1.08 Intr + 57076 57158 83 0 2 94 70 30 0.567 0.24 1.09 Intr + 57981 58068 88 0 1 34 88 65 0.694 1.07 1.10 Intr + 58168 58275 108 0 0 75 94 58 0.915 5.58 1.11 Intr + 58902 58979 78 2 0 85 116 66 0.992 8.85 1.12 Intr + 59621 59742 122 1 2 25 89 188 0.995 11.89 1.13 Intr + 59914 59971 58 1 1 117 83 117 0.999 12.99 1.14 Intr + 61004 61081 78 1 0 76 82 58 0.930 3.75 1.15 Intr + 61559 61663 105 1 0 61 26 93 0.606 0.81 1.16 Intr + 62670 62781 112 2 1 76 91 113 0.942 10.35 1.17 Term + 63037 63143 107 2 2 121 53 70 0.921 5.37 1.18 PlyA + 64726 64731 6 1.05 2.00 Prom + 64767 64806 40 -8.76 2.01 Init + 64901 64908 8 1 2 85 89 4 0.759 0.41 2.02 Intr + 66120 66190 71 0 2 90 58 89 0.832 4.93 2.03 Term + 68634 68851 218 1 2 89 43 133 0.614 6.21 2.04 PlyA + 71609 71614 6 -0.45 3.00 Prom + 71976 72015 40 -7.56 3.01 Init + 76411 76872 462 0 0 92 96 766 0.964 71.50 3.02 Intr + 83919 84307 389 2 2 78 85 655 0.610 57.79 3.03 Intr + 84453 84657 205 0 1 125 100 489 0.999 53.00 3.04 Intr + 84858 84959 102 2 0 86 85 153 0.999 15.17 3.05 Intr + 85244 85315 72 1 0 96 99 121 0.985 13.60 3.06 Term + 85389 85622 234 2 0 4 54 380 0.754 22.52 3.07 PlyA + 85988 85993 6 1.05 4.44 PlyA - 86117 86112 6 1.05 4.43 Term - 86700 86452 249 0 0 108 52 32 0.708 -2.70 4.42 Intr - 86888 86798 91 1 1 87 69 139 0.993 11.90 4.41 Intr - 87096 86962 135 2 0 80 56 136 0.998 9.28 4.40 Intr - 87330 87229 102 2 0 113 19 118 0.985 6.69 4.39 Intr - 87520 87428 93 0 0 73 66 89 0.940 4.28 4.38 Intr - 87761 87664 98 2 2 86 66 166 0.998 13.01 4.37 Intr - 87986 87857 130 1 1 52 85 0 0.405 -3.20 4.36 Intr - 88233 88169 65 0 2 107 55 72 0.362 3.32 4.35 Intr - 88461 88371 91 2 1 112 100 44 0.957 8.00 4.34 Intr - 89746 89551 196 2 1 68 111 83 0.956 7.07 4.33 Intr - 90113 89970 144 0 0 92 55 49 0.695 2.25 4.32 Intr - 90677 90525 153 0 0 51 71 119 0.489 6.54 4.31 Intr - 91585 91487 99 2 0 81 72 92 0.917 6.98 4.30 Intr - 93802 93620 183 2 0 126 61 112 0.997 12.06 4.29 Intr - 94073 93878 196 2 1 33 94 185 0.551 12.59 4.28 Intr - 94419 94273 147 0 0 115 87 52 0.911 8.23 4.27 Intr - 94721 94571 151 1 1 91 38 155 0.696 10.96 4.26 Intr - 95334 95163 172 1 1 96 84 127 0.994 12.10 4.25 Intr - 95576 95433 144 0 0 63 88 72 0.938 4.95 4.24 Intr - 96216 96123 94 0 1 -25 114 111 0.178 1.94 4.23 Intr - 98089 98006 84 1 0 18 90 77 0.076 0.92 4.22 Intr - 100147 100013 135 1 0 118 21 89 0.173 5.96 4.21 Intr - 100313 100254 60 2 0 142 75 64 0.998 9.43 4.20 Intr - 100638 100462 177 0 0 90 89 177 0.992 18.12 4.19 Intr - 103603 102847 757 0 1 82 80 574 0.980 47.47 4.18 Intr - 104336 104194 143 2 2 105 105 156 0.968 18.15 4.17 Intr - 104706 104554 153 0 0 52 78 93 0.843 4.97 4.16 Intr - 106428 106227 202 2 1 47 89 19 0.497 -2.81 4.15 Intr - 106710 106552 159 2 0 89 68 117 0.817 8.90 4.14 Intr - 107604 107447 158 0 2 84 94 164 0.999 15.41 4.13 Intr - 109524 109354 171 0 0 139 93 -15 0.948 4.24 4.12 Intr - 109757 109623 135 2 0 63 76 221 0.998 19.16 4.11 Intr - 110061 109879 183 0 0 102 98 239 0.999 26.28 4.10 Intr - 110483 110316 168 2 0 77 66 156 0.974 12.44 4.09 Intr - 110792 110691 102 2 0 93 96 26 0.939 4.27 4.08 Intr - 112024 111926 99 1 0 87 78 85 0.994 7.71 4.07 Intr - 112435 112320 116 2 2 85 78 158 0.997 14.67 4.06 Intr - 113658 113479 180 1 0 114 41 77 0.700 5.44 4.05 Intr - 117391 117170 222 2 0 76 52 197 0.903 13.00 4.04 Intr - 118088 117968 121 2 1 117 84 119 0.999 14.47 4.03 Intr - 118674 118585 90 0 0 83 96 50 0.944 5.49 4.02 Intr - 119105 118957 149 0 2 103 30 -9 0.860 -5.25 4.01 Init - 119581 119401 181 1 1 76 79 170 0.871 14.25 4.00 Prom - 121133 121094 40 -3.76 5.00 Prom + 135940 135979 40 -6.46 5.01 Init + 141732 141837 106 2 1 83 91 281 0.827 26.28 5.02 Intr + 143583 143766 184 1 1 112 61 316 0.841 30.15 5.03 Intr + 143995 144095 101 1 2 66 100 83 0.587 7.05 5.04 Intr + 145004 145174 171 0 0 132 99 31 0.983 8.51 5.05 Intr + 147482 147597 116 0 2 72 96 126 0.894 11.97 5.06 Intr + 149816 149993 178 1 1 101 82 165 0.998 16.69 5.07 Intr + 150673 150818 146 2 2 69 110 121 0.984 12.40 5.08 Intr + 151334 151478 145 1 1 70 94 78 0.936 6.46 5.09 Intr + 152483 152612 130 0 1 113 82 177 0.998 19.35 5.10 Intr + 152696 152796 101 2 2 89 84 197 0.998 19.15 5.11 Intr + 153060 153155 96 1 0 76 95 170 0.998 16.48 5.12 Intr + 153494 153574 81 0 0 52 113 122 0.996 10.71 5.13 Intr + 154051 154143 93 2 0 87 85 99 0.999 9.44 5.14 Intr + 154899 155020 122 1 2 113 59 97 0.996 9.51 5.15 Intr + 155458 155591 134 0 2 92 39 217 0.888 16.64 5.16 Intr + 155764 155923 160 1 1 97 77 214 0.992 21.19 5.17 Intr + 156135 156271 137 2 2 65 67 226 0.972 17.57 5.18 Intr + 160342 160576 235 1 1 133 23 160 0.644 11.69 5.19 Intr + 167916 168162 247 2 1 52 43 145 0.402 3.43 5.20 Intr + 170639 170730 92 0 2 49 88 79 0.595 3.71 5.21 Intr + 171139 171316 178 0 1 27 115 87 0.605 4.79 5.22 Intr + 171474 171703 230 1 2 57 58 86 0.213 0.09 5.23 Term + 171745 171963 219 0 0 -17 55 154 0.412 -1.36 5.24 PlyA + 172095 172100 6 1.05 6.03 PlyA - 174646 174641 6 1.05 6.02 Term - 181864 181680 185 2 2 105 43 70 0.746 1.91 6.01 Init - 184026 183966 61 0 1 92 66 35 0.727 3.11 6.00 Prom - 204978 204939 40 -2.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100147 99998 150 1 0 118 45 105 0.821 7.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:41417545_41637125|GENSCAN_predicted_peptide_1|647_aa ELLSLAKRKRSDSEEKEPPVSQPAASSDSETSDSDDEWTFGSNKNKKKGKARKIEKKGTM KKQANKTASSGSSDKDSSAESSAPEEGEVSDSDSNSSSSSSDSDSSSEDEEFHDGYGEDL MGDEEDRARLEQMTEKEREQELFNRIEKREVLKRRFEIKKKLKTAKKKEKKEKKKKQEEE QEKKKLTQIQESQVTSHNKERRSKRDEKLDKKSQAMEELKAEREKRKNRTAELLAKKQPL KTSEVYSDDEEEEEDDKSSEKSDRSSRTSSSDEEEEKEEIPPKSQPVSLPEELNRVRLSR HKLERWCHMPFFAKTVTGCFVRIGIGNHNSKPVYRVAEITGVVETAKVYQLGGTRTNKGL QLRHGNDQRVFRLEFVSNQEFTESEFMKWKEAMFSAGMQLPTLDEINKKELSIKEALNYK FNDQDIEEIVKEKERFRKAPPNYAMKKTQLLKEKAMAEDLGDQDKAKQIQDQLNELEERA EALDRQRTKNISAISYINQRNREWNIVESEKALVAESHNMKNQQMDPFTRRQCKPTIVSN SRDPAVQAAILAQLNAKYGSGVLPDAPKEMSKASVGQGKDKDLNSKSASDLSEDLFKVHD FDVKIDLQVPSSESKALAITSKAPPAKDGAPRRSLNLEDYKKRRGLI >gi568815583r:41417545_41637125|GENSCAN_predicted_CDS_1|1944_bp gagctcttgtccctggcaaagcgaaagcgcagtgactctgaggagaaggagccgcctgtg agtcagcctgcagcctcgtcagactcggagacgtctgacagtgacgatgagtggacattt gggagcaataaaaataagaagaaaggaaaagccagaaaaatagagaagaaaggaaccatg aagaaacaggccaacaaaactgcctcctcaggcagttcagacaaagacagttcagctgag agctcagcccctgaggaaggtgaagtgtcagactctgacagcaacagctcctcttccagt tcagattcagactcttcctcagaagatgaagagttccatgatggctatggagaagacctc atgggagatgaggaagacagggcccgtctggaacagatgacagagaaagagagagagcaa gaactgttcaatcgcatagagaagagggaggtgttgaaaagaagatttgaaatcaagaaa aaactaaaaacagccaaaaagaaagaaaagaaagaaaagaagaaaaagcaagaagaggag caagaaaagaaaaaactgacacagattcaagaatctcaggtaacatcccacaacaaggaa cggcgttccaagcgggatgagaaactagacaagaaatctcaagccatggaggagctaaaa gcagagcgagaaaaacgaaagaacagaacagctgagctccttgccaaaaaacagccatta aaaaccagtgaggtctactctgatgatgaagaggaggaagaggatgacaaatccagtgaa aagtcagaccgctcatcacgaacatcatcgtctgatgaagaagaggagaaagaagagatc cctcccaaatcccaaccagtttccttacctgaagaattgaatcgggttcgattatcacgg cataagctagaacgctggtgtcacatgcccttctttgctaaaactgtcacaggatgtttt gtgcggattggcatcggaaaccacaacagcaaaccagtttaccgggtcgctgagattacg ggtgttgtggaaactgccaaagtttaccaactaggtggcaccagaacaaacaaagggctg caactacggcatggcaatgaccaacgcgtgttccgtttagagtttgtctcaaaccaagaa ttcaccgaaagtgagtttatgaagtggaaagaagcgatgttctctgctggcatgcagttg cccactctagatgaaatcaataaaaaggaattatctattaaagaagctcttaattataaa ttcaatgatcaggacattgaagagattgtaaaagagaaagaaaggttcagaaaagctcca cccaactacgctatgaagaagactcagctactgaaggaaaaggccatggctgaggacctg ggggatcaggacaaggccaaacaaatccaagatcaactgaatgagctggaggaacgggca gaggccctggaccgccagcggaccaagaacatatccgctatcagttacatcaaccagcgg aaccgggagtggaacattgtagagtctgagaaggcccttgtggctgaaagtcacaacatg aaaaaccaacagatggatccctttactcggcggcagtgcaagcctaccatcgtttctaat tccagagacccagctgttcaagctgccatcttggcccagctgaatgcaaaatacggttct ggagtgttaccagatgctccaaaggaaatgagcaaggcaagtgtgggtcaaggcaaagat aaagatttgaattctaagtcagccagtgacctctcagaagatctgttcaaagtacacgat tttgatgtgaagattgacttacaagttcccagctcagagtcaaaggctttagccatcacc tccaaggctccgccagccaaggatggggctccaaggagatctctgaacttggaagactac aaaaaacgacgagggcttatttga >gi568815583r:41417545_41637125|GENSCAN_predicted_peptide_2|98_aa MDFPVPTKVKVAIVNSLAAWFPGSLSAMDSPASLSACDAAQPFTWQARKPQVDSISFAGR ALRRSPLGVSTTPRTGLGATLVRANGPRIPGPVRLLRR >gi568815583r:41417545_41637125|GENSCAN_predicted_CDS_2|297_bp atggacttccctgtgcccacaaaagtgaaagtggccatcgtcaactctttagcagcctgg ttccccggaagcctctctgccatggatagccctgcttcgctaagcgcgtgcgatgcagca cagcccttcacctggcaagcccggaagcctcaggttgactccatcagttttgccgggaga gcccttcggcgctccccgcttggtgtctccaccaccccccgcaccggcctgggcgccacc cttgtccgcgccaacggtccccgcatccctggccccgtgcgcctcctgcgccgttag >gi568815583r:41417545_41637125|GENSCAN_predicted_peptide_3|487_aa MARPGGARPCSPGLERAPRRSVGELRLLFEARCAAVAAAAAAGEPRARGAKRRGGQVPNG LPRAPPAPVIPQLTVTAEEPDVPPTSPGPPERERDCLPAAGSSHLQQPRRLSTSSVSSTG SSSLLEDSEDDLLSDSESRSRGNVQLEAGEDVGQKNHWQKIRTMVNLPVISPFKKRYAWV QLAGHTGEQWGGWAGARDGKGLGGADGCRSSGSFKAAGTSGLILKRCSEPERYCLARLMA DALRGCVPAFHGVVERDGESYLQLQDLLDGFDGPCVLDCKMGVRTYLEEELTKARERPKL RKDMYKKMLAVDPEAPTEEEHAQRAVTKPRYMQWREGISSSTTLGFRIEGIKKADGSCST DFKTTRSREQVLRVFEEFVQGDEEVLRRYLNRLQQIRDTLEVSEFFRRHEGRGLTVRGSQ VIGSSLLFVHDHCHRAGVWLIDFGKTTPLPDGQILDHRRPWEEGNREDGYLLGLDNLIGI LASLAER >gi568815583r:41417545_41637125|GENSCAN_predicted_CDS_3|1464_bp atggcgcggccggggggcgcgaggccctgcagcccggggctggagcgggccccgcgccgg agtgtcggggagctgcgcctgctcttcgaggcgcgctgtgcggcggtcgctgcggccgcc gccgcgggggagccccgggcccgcggggccaagcggcgtgggggacaggtccccaacggg cttccgcgggctcccccggccccggtgatccctcagctgaccgtgacagccgaggagccc gacgtgcccccgaccagccctgggccgccggagcgggagagggactgcctcccggcagcg ggctcttcgcacctgcagcagccgcgccgcctttccacctcgtcggtctcctccactggc tcctcgtcgctgctcgaggactcggaggacgacctgctgagcgacagtgagagccggagc cgcggcaacgtgcagctggaagcgggcgaggacgtgggtcagaaaaaccactggcagaag atccggaccatggtcaatctgccggtcataagccctttcaagaagcgctacgcctgggtg cagctggcagggcacactggtgagcagtggggcgggtgggcgggtgcccgcgacgggaag gggctgggcggcgctgacggatgccggtcctcagggagttttaaggcggcgggcaccagc gggctgatcctgaagcgctgctcggagccggagcgctactgcctggcgcggctgatggct gacgcgctgcgcggctgcgtgcctgccttccacggcgtggtggagcgcgacggcgaaagc tacctgcagctgcaggacctgctcgatggcttcgacggaccttgtgtgctcgactgcaaa atgggcgtcaggacttacctagaggaggagctgaccaaggcccgtgagcggcccaagctg cggaaggacatgtacaagaaaatgctggcggtggatcctgaagctcccacggaggaggag cacgcgcagcgcgccgtcaccaagccgcgctacatgcagtggcgggaaggcatcagctcc agcaccaccctcggcttccgcatcgagggcatcaagaaagcggacggctcctgcagcacc gacttcaagactacgcgaagccgagagcaggtgcttcgcgtctttgaagagtttgtgcaa ggagatgaggaagtgctgaggcggtatctgaaccgcctgcagcagatccgggacaccctg gaggtatccgagttcttcaggaggcacgagggccgcggcctgacggtgcggggctcgcag gtgatcggcagctcgctcctctttgtgcacgatcactgccatcgcgccggcgtgtggctc atcgacttcggcaagaccacgcccctccccgatggccagatcctggaccaccggcggccc tgggaggagggcaaccgcgaggacggctatttgctggggctggacaatctcattggcatc ctggccagcctggctgagagatga >gi568815583r:41417545_41637125|GENSCAN_predicted_peptide_4|2225_aa MLSRPKPGESEVDLLHFQSQFLAAGAAPAVQLVKKGNRGGGDANSDRPPLQDHRDVVMLD NLPDLPPALVPSPPKRARPSPGHCLPEDEDPEERLRRHDQHITAVLTKIIERDTSSVAVN LPVPSGVAFPAVFLRSRDTQGKSATSGKRSIFAQEIAARRIAEAKGPSVGEVVPNVGPPE GAVTCETPTPRNQGCQLPGSSHSFQGPNLVTGKGLRDQEAEQEAQTIHEENIARLQAMAP EEILQEQQRLLAQLDPSLVAFLRSHSHTQEQTGETASEEQRPGGPSANVTKEEPLMSAFA SEPRKRDKLEPEAPALALPVTPQKEWLHMDTVELEKLHWTQDLPPVRRQQTQERMQARFS LQGELLAPDVDLPTHLGLHHHGEEAERAGYSLQELFHLTRSQVSQQRALALHVLAQVISR AQAGEFGDRLAGSVLSLLLDAGFLFLLRFSLDDRVDGVIATAIRALRALLVAPGDEELLD STFSWYHGALTFPLMPSQEDKEDEDEDEECPAGKAKRKSPEEESRPPPDLARHDVIKGLL ATSLLPRLRYVLEVTYPGPAVVLDILAVLIRLARHSLESATRVLECPRLIETIVREFLPT SWSPVGAGPTPSLYKVPCATAMKLLRVLASAGRNIAARLLSSFDLRSRLCRIIAEAPQEL ALPPEEAEMLSTEALRLWAVAASYGQGGYLYRELYPVLMRALQVVPRELSTHPPQPLSMQ RIASLLTLLTQLTLAAGSTPAETISDSAEASLSATPSLVTWTQVSGLQPLVEPCLRQTLK LLSRPEMWRAVGPVPVACLLFLGAYYQAWSQQLAAILAAPGLQNYFLQCVAPGAAPHLTP FSAWALRHEYHLQYLALALAQKAAALQPLPATHAALYHGMALALLSRLLPGSEYLTHELL LSCVFRLEFLPERTSGGPEAADFSDQLSLGSSRVPRCGQGTLLAQACQDLPSIRNCYLTH CSPARASLLASQALHRGELQRVPTLLLPMPTEPLLPTDWPFLPLIRLYHRASDTPSGLSP TDTMGTAMRVLQWVLVLESWRPQALWAVPPAARLARLMCVFLVDSELFRESPVQHLVAAL LAQLCQPQVLPNLNLDCRLPGLTSFPDLYANFLDHFEAVSFGDHLFGALVLLPLQRRFSV TLRLALFGEHVGALRALSLPLTQLPVSLECYTVPPEDNLALLQLYFRTLVTGALRPRWCP VLYAVAVAHVNSFIFSQDPQSSDEVKAARRSMLQKTWLLADEGLRQHLLHYKLPNSTLPE GFELYSQLPPLRQHYLQRLTSTVLQNGITGSYHQKLLIQYIWGETQESAFLTSSQGCGRD RKGFCCRVDPTGMGCWGQLLVWFGAAGAILCSSPGSQETFLRSSPLPLASPSPRDPKVSA PPSILEPASPLNSPGTEGSWLFSTCGASGRHGPTQTQCDGAYAGTSVVVTVGAAGQLRGV QLWRVPGPGQYLISAYGAAGGKGAKNHLSRAHGVFVSAIFSLGLGESLYILVGQQGEDAC PGGSPESQLVCLGESRAVEEHAAMDGSEGVPGSRRWAGGGGGGGGATYVFRLEGASWNTP LAPQVRAGELEPLLVAAGGGGRAYLRPRDRGRTQASPEKLENRSEAPGSGGRGGAAGGGG GWTSRAPSPQAGRSLQEGAEGGQGCSEAWATLGWAAAGGFGGGGGACTAGGGGGGYRGGD ASETDNLWADGEDGVSFIHPSSELFLQPLAVTENHGEVEIRRHLNCSHCPLRDCQWQAEL QLAECLCPEGMELAVDNVTCMDLHKPPGPLVLMVAVVATSTLSLLMVCGVLILGTKRLAG TVDSRLLLSMKQKKWQGLQEMRLPSPELELSKLRTSAIRTAPNPYYCQVGLGPAQSWPLP PGVTEVSPANVTLLRALGHGAFGEVYEGLVIGLPGDSSPLQVAIKTLPELCSPQDELDFL MEALIISKFRHQNIVRCVGLSLRATPRLILLELMSGGDMKSFLRHSRPHLGQPSPLVMRD LLQLAQDIAQGCHYLEENHFIHRDIAARNCLLSCAGPSRVAKIGDFGMARDIYRASYYRR GDRALLPVKWMPPEAFLEGIFTSKTDSWSFGVLLWEIFSLGYMPYPGRTNQEVLDFVVGG GRMDPPRGCPGPVYRIMTQCWQHEPELRPSFASILERLQYCTQDPDVLNSLLPMELGPTP EEEGTSGLGNRSLECLRPPQPQELSPEKLKSWGGSPLGPWLSSGLKPLKSRGLQPQNLWN PTYRS >gi568815583r:41417545_41637125|GENSCAN_predicted_CDS_4|6678_bp atgctgtcgagaccgaagccaggggagtccgaggtggacctgctgcacttccagagtcag tttctcgcagctggtgcagccccagcagtgcagttggtgaagaaaggaaataggggcggt ggtgatgccaactcagaccggcctccgctccaggaccatcgggatgtggtgatgttggac aatctcccagatttgcccccagctttggtcccttctcctccaaagagagccaggcccagc cctggccactgcctgcctgaggatgaggacccagaagagaggctgaggaggcatgatcag cacatcactgctgtcttgactaagattattgaacgagatacaagttcagtggccgtgaat ctgcctgtgcccagtggtgttgctttccctgctgtgttccttcgctcgcgggacacacag gggaaatcagcaacatctggtaagagaagcatctttgcccaggaaattgcggcaaggagg atagctgaagccaagggcccatcagttggggaagttgtgcccaacgtgggcccaccagag ggtgccgtgacctgtgagacacccactcctaggaaccagggctgccagcttcctgggagc agccacagctttcagggacccaatctggtcacagggaaggggctcagggatcaagaagct gagcaggaagcccagactatccatgaagagaacatagcaagactgcaggccatggctcct gaggagatcctgcaggaacagcagcggttgctggcccagcttgaccccagcttggttgct ttcttgagatctcacagccacacgcaagagcaaacaggagagacagcctctgaggagcag aggccaggaggaccctctgctaatgtcaccaaggaggaacccctcatgtcagcttttgcc agtgagcccaggaagagagacaagctggagccagaagccccagctctggcattgcccgtg acccctcagaaagaatggctgcacatggacactgtcgagctggagaagctccactggacc caggacttgccccctgtccggcggcagcagacacaggagaggatgcaggctcgattcagt cttcagggagaactactggcccctgacgtggacctgcccacccacctgggtctgcaccac catggagaggaggcagagagagcggggtattccctacaggagctgttccacctgacccgc agccaggtttcccagcagagagcactggcactgcatgtgttagcccaggtcatcagcagg gcccaggctggtgagtttggggaccggctagcaggcagtgtcttaagcctccttttggat gctggtttcctcttcctactgcgcttctccttggatgacagagtggatggggtcattgca accgccatccgtgctcttcgggctctgctggtggctcctggagatgaggagctcctcgac agcaccttctcttggtaccatggagctttgacgttccctctgatgcccagccaggaggac aaggaggatgaggacgaggatgaagaatgcccagcaggaaaagcaaaaaggaaaagccct gaagaagaaagccggcctccacctgacctggcccgacatgatgtcatcaaggggctcctg gctaccagcctgctgcctcggctgcgctacgtgctggaggtgacatacccaggacctgcg gtggtccttgacatcctggctgtgctcatccgcctggcccggcattccctggaatcagcc acaagggtcctggagtgccctcggctgatagagactatagttcgagagttcttgcccacc agttggtctcctgtgggggcagggcctacccctagtctatacaaagtaccctgtgctact gccatgaaactacttcgtgtcctggcctcagctgggaggaatattgctgcccggctgttg agcagctttgatctccggagccgcctgtgccgcatcatagctgaggctccccaagaactg gccttgcccccagaggaagctgagatgctgagcaccgaggccctccgtctgtgggctgtg gctgcctcctatggccagggcggttacctttacagggagctctacccagtgctgatgcgg gccttgcaggtggtgccgcgggagctcagcacccacccacctcaacccctgtccatgcag cggatagcctcactgctcactctcctcacccagctaaccctggcagccggcagtacccct gctgaaaccatcagtgattctgctgaggccagcctctcggccaccccttccttagtcact tggacacaggtgtctgggctccagcctcttgttgagccgtgtctaaggcagaccttgaag ttgctgtccagacctgagatgtggagagccgtgggcccagtgcccgttgcctgcctgttg ttcctgggagcctactaccaggcctggagccagcaactggctgccatattggctgccccg ggactccagaattacttcctccagtgtgtggctcctggggctgccccacacctcacacct ttctctgcatgggccctgcgccatgagtaccacctgcagtacctggcactcgctctggcc cagaaagcggcagcgctgcagccactgccagccacccatgctgccctctatcatggtatg gccttggccctgctgagccggctgctgcccggaagtgagtacctcacccatgagctgctg ctgagctgtgtattccggctggagttcctcccggaaagaacatcagggggtccagaggca gccgacttctctgaccagctgtcgttaggaagcagcagggtccctcggtgtgggcaaggg actctgctggctcaggcctgccaggacctccccagcatccgcaactgctacctgactcat tgctcgccagcccgagccagtctgctggcctcccaggctctgcaccgaggggagctacag cgagtcccaaccctgctactgcccatgcctacggagccgctgctgcccaccgactggccc ttcctgccactgattcgcctctaccaccgggcttcagacaccccctcgggactctctccc acagacaccatgggcacagccatgcgggtcctgcagtgggtgctagttttggagagctgg cgcccccaggctctctgggctgtgccccctgctgcccgcctggcacggctcatgtgtgtg ttcctggtggacagtgagctgttccgggagtccccagtacagcatctggtggcagccctc ctcgcccagctctgtcagcctcaagtcttgccaaacctcaacctggactgccgactccct ggcctgacgtctttccctgacctctatgccaacttcctggatcattttgaggctgtctct tttggggaccacctctttggggccctggtcctcctgcccctgcagcgtcggttcagtgtc accttgcgccttgccctctttggggaacacgtgggagccttgcgagctctgagcctgcct ctgacccagttgcctgtgtccctggagtgttacacagtgcctcctgaagacaacctggcc ctccttcagctctacttccggaccctggttactggtgcgctccgcccacgttggtgcccc gtgctctatgctgtggctgtggctcatgtcaatagcttcatcttctctcaggacccacag agctcagatgaggtcaaagctgcccgcaggagtatgctgcagaaaacatggctgctggca gatgagggtctccggcagcacctcctgcactataagcttcccaattccacgctcccagag ggctttgagctctattctcagttgccccctctgcgtcagcactacctccagagactgact tcaacagtgctccaaaatgggatcactgggtcctaccaccagaagctgctgattcagtac atctggggtgaaacccaagaatctgcatttctaacaagttcccaggggtgtggccgcgac cgcaagggcttttgttgccgggtggacccaacagggatgggctgctggggacagctgctg gtgtggttcggagccgcgggcgccattctctgctctagcccggggtcccaggagactttt ctgcggtcctcgcccctgccgctggcaagtcccagcccccgggacccgaaagtcagcgcc ccgcctagtatcttggagccagcctccccgctgaattctccgggcaccgaggggtcttgg ctgttttctacctgcggggccagcggccggcatgggcccacacagacacaatgtgacggg gcgtacgcggggaccagcgtggtggtgaccgtgggggccgccgggcagctgagaggcgtg cagctgtggcgcgtgccgggccctggccagtatctgatctcagcctacggagccgcgggc ggcaaaggcgccaagaaccacctgtcgcgggcgcatggcgtcttcgtctcagcaatcttc tccctcggtctcggggagtcgctgtacatcctggtggggcagcagggagaggacgcctgt cccggaggtagcccggagagccagctcgtctgcctcggggagtctcgagccgttgaagag cacgcggcgatggatgggagcgaaggggtcccggggtcgcggcgctgggcgggaggtggc gggggtggcgggggcgccacctacgttttccggctggagggcgcttcctggaacacgccg ctggccccacaggtgcgcgctggcgagctggaaccgttgctggtggcggccggaggcggc ggtcgggcctacctgaggccgcgggaccgaggccggactcaggcctcccccgagaaactg gagaaccgctcggaggcgcccgggagcggcgggagaggcggggcggcaggtggtgggggc ggctggacgtcgcgggctccctctccgcaggccggccgctcactgcaggagggggcggag ggcggccagggctgctccgaggcttgggcgacccttggctgggccgcggccggcggcttc gggggcggcggcggggcctgcactgcgggcggaggcggcggcggctacagggggggcgac gcttcagagactgacaacctctgggctgatggggaagatggagtatccttcatacacccc agcagcgagctcttcctgcagcctctggcagtcaccgagaaccacggagaggtagagatc cgaaggcacctcaactgcagtcactgccctttgagagactgccaatggcaggcagagctc cagctggctgaatgcctgtgcccagaaggcatggagctagctgtggataacgtcacctgc atggacctgcacaagcccccaggccctctggttctgatggtggctgtggtggcaacctca acactgagcctccttatggtgtgtggggtcctgattctgggtacgaagcgtctagcaggc acagttgattcaaggctgctcctctccatgaagcagaagaagtggcagggcctgcaggag atgaggctgccgagccctgagcttgagctgagcaagcttcgaacctctgccatcaggaca gcccccaatccctattattgccaggtggggcttggcccggcccagtcctggcctctgcca ccaggtgtcaccgaggtttccccagccaatgttactctgctcagagccctgggccatggt gcctttggggaggtgtatgagggactggtaattggccttcctggggactccagtcccctg caggtagctatcaagaccctgccagaactctgctcgcctcaggatgagctggatttcctc atggaggccctcatcatcagcaagtttcgccatcagaacattgtgcggtgtgtggggctc agcctcagggccacccctcgcctcattctgctggaactgatgtctggaggggacatgaag agtttcctgaggcacagtcggccacacctgggccagccatcacctctggtcatgcgggac ctgctgcaactggcccaggacatagcccagggctgccactacctggaggaaaatcacttc atccacagggatattgccgcccggaactgcctgctgagctgcgctggacccagccgagtg gccaagattggggactttgggatggcacgagatatctaccgggccagttattaccgcagg ggggaccgggccttgctcccagtcaagtggatgcccccagaggccttcctggagggcatc ttcacatccaagacagattcctggtcttttggggtgctgctctgggagatcttctcactg ggctacatgccctatcctgggcgcaccaaccaggaggtgctggacttcgtcgttggagga ggccggatggaccctcctaggggctgcccagggcctgtgtaccgcatcatgacccagtgt tggcagcacgagcctgagctccgccctagctttgccagcatcttggagcgtctgcagtac tgcactcaggacccggatgtgctgaattcactcctgccaatggagctggggcccacccca gaggaggaagggacttctgggctggggaacagatctttggagtgcctaagacccccacag ccccaggaactgagtccagagaagttgaaaagctggggaggtagccctcttggcccctgg ctgtcctctggcctcaagcccctcaaatccaggggcctccaacctcagaacctttggaat cccacttatcgctcctga >gi568815583r:41417545_41637125|GENSCAN_predicted_peptide_5|1133_aa MGRPGLPPLPLPPPPRLGLLLAALASLLLPESAAAGLKLMGAPVKLTVSQGQPVKLNCSV EGMEEPDIQWVKDGAVVQNLDQLYIPVSEQHWIGFLSLKSVERSDAGRYWCQVEDGGETE ISQPVWLTVEGVPFFTVEPKDLAVPPNAPFQLSCEAVGPPEPVTIVWWRGTTKIGGPAPS PSVLNVTALPAAPFNITVTKLSSSNASVAWMPGADGRALLQSCTVQVTQAPGGWEVLAVV VPVPPFTCLLRDLVPATNYSLRVRCANALGPSPYADWVPFQTKGLAPASAPQNLHAIRTD SGLILEWEEVIPEAPLEGPLGPYKLSWVQDNGTQDELTVEGTRANLTGWDPQKDLIVRVC VSNAVGCGPWSQPLVVSSHDRAGQQGPPHSRTSWVPVVLGVLTALVTAAALALILLRKRR KETRFGQAFDSVMARGEPAVHFRAARSFNRERPERIEATLDSLGISDELKEKLEDVLIPE QQFTLGRMLGKGEFGSVREAQLKQEDGSFVKVAVKMLKADIIASSDIEEFLREAACMKEF DHPHVAKLVGVSLRSRAKGRLPIPMVILPFMKHGDLHAFLLASRIGENPFNLPLQTLIRF MVDIACGMEYLSSRNFIHRDLAARNCMYEFWRTRGLAEDMTVCVADFGLSRKIYSGDYYR QGCASKLPVKWLALESLADNLYTVQSDVWAFGVTMWEIMTRGQTPYAGIENAEIYNYLIG GNRLKQPPECMEDVYDLMYQCWSADPKQRPSFTCLRMELENILGQLSVLSASQDPLYINI ERAEEPTAGGSLELPGRDQPYSGAGDGSGMGAKGPSCGQSRRPAGRGEERGGRRRWRPVG AGPLQGRGQTRTEREPGSSGSTGSILPRVPENTPSSRNARASFVKAVFEFASSGEQVLEK VVILEQICLLQYVDDILISGEDIEKCGSFRSADSRTQRPPAARSLPIKGVGPSHLWMASM HPVHRGCGNTGRGKQEVNLWRKIDGNPNLWREHTCLDLIDYHTKVRPDLGETPFRTGRHL FIDSSSGVIEGKRHNGYSVIDGEILIEIESGKLPTIGLLKHNQEGTIYTDSKYACVVAHM FGKIWTERGLISSKGQDLVHKELITQVLNNLQLPEEIAIVRVPGHQKNLSFES >gi568815583r:41417545_41637125|GENSCAN_predicted_CDS_5|3402_bp atggggcggccggggctcccgccgctgccgctgccgccgccaccgcggctcgggctgctg ctggcggctctggcttctctgctgctcccggagtccgccgccgcaggtctgaagctcatg ggagccccggtgaagctgacagtgtctcaggggcagccggtgaagctcaactgcagtgtg gaggggatggaggagcctgacatccagtgggtgaaggatggggctgtggtccagaacttg gaccagttgtacatcccagtcagcgagcagcactggatcggcttcctcagcctgaagtca gtggagcgctctgacgccggccggtactggtgccaggtggaggatgggggtgaaaccgag atctcccagccagtgtggctcacggtagaaggtgtgccatttttcacagtggagccaaaa gatctggcagtgccacccaatgcccctttccaactgtcttgtgaggctgtgggtccccct gaacctgttaccattgtctggtggagaggaactacgaagatcgggggacccgctccctct ccatctgttttaaatgtaacagcactgcctgcagcccccttcaacatcaccgtgacaaag ctttccagcagcaacgctagtgtggcctggatgccaggtgctgatggccgagctctgcta cagtcctgtacagttcaggtgacacaggccccaggaggctgggaagtcctggctgttgtg gtccctgtgcccccctttacctgcctgctccgggacctggtgcctgccaccaactacagc ctcagggtgcgctgtgccaatgccttggggccctctccctatgctgactgggtgcccttt cagaccaagggtctagccccagccagcgctccccaaaacctccatgccatccgcacagat tcaggcctcatcttggagtgggaagaagtgatccccgaggcccctttggaaggccccctg ggaccctacaaactgtcctgggttcaagacaatggaacccaggatgagctgacagtggag gggaccagggccaatttgacaggctgggatccccaaaaggacctgatcgtacgtgtgtgc gtctccaatgcagttggctgtggaccctggagtcagccactggtggtctcttctcatgac cgtgcaggccagcagggccctcctcacagccgcacatcctgggtacctgtggtccttggt gtgctaacggccctggtgacggctgctgccctggccctcatcctgcttcgaaagagacgg aaagagacgcggtttgggcaagcctttgacagtgtcatggcccggggagagccagccgtt cacttccgggcagcccggtccttcaatcgagaaaggcccgagcgcatcgaggccacattg gacagcttgggcatcagcgatgaactaaaggaaaaactggaggatgtgctcatcccagag cagcagttcaccctgggccggatgttgggcaaaggagagtttggttcagtgcgggaggcc cagctgaagcaagaggatggctcctttgtgaaagtggctgtgaagatgctgaaagctgac atcattgcctcaagcgacattgaagagttcctcagggaagcagcttgcatgaaggagttt gaccatccacacgtggccaaacttgttggggtaagcctccggagcagggctaaaggccgt ctccccatccccatggtcatcttgcccttcatgaagcatggggacctgcatgccttcctg ctcgcctcccggattggggagaacccctttaacctacccctccagaccctgatccggttc atggtggacattgcctgcggcatggagtacctgagctctcggaacttcatccaccgagac ctggctgctcggaattgcatgtacgaattctggaggactcgagggctggcagaggacatg acagtgtgtgtggctgacttcggactctcccggaagatctacagtggggactactatcgt caaggctgtgcctccaaactgcctgtcaagtggctggccctggagagcctggccgacaac ctgtatactgtgcagagtgacgtgtgggcgttcggggtgaccatgtgggagatcatgaca cgtgggcagacgccatatgctggcatcgaaaacgctgagatttacaactacctcattggc gggaaccgcctgaaacagcctccggagtgtatggaggacgtgtatgatctcatgtaccag tgctggagtgctgaccccaagcagcgcccgagctttacttgtctgcgaatggaactggag aacatcttgggccagctgtctgtgctatctgccagccaggaccccttatacatcaacatc gagagagctgaggagcccactgcgggaggcagcctggagctacctggcagggatcagccc tacagtggggctggggatggcagtggcatgggggcaaagggccccagctgcggacagagc cggcgccctgcaggccgcggggaggagcgcggcgggcgcaggcggtggcggcccgtgggc gcggggcccctgcagggacggggacagacgcgcacggagcgggagcccggcagctccggg tctacgggatccatcctcccaagggtgcccgagaacactccatcgtcgcggaatgcccgc gcttctttcgttaaagctgtctttgagtttgcctcctctggtgaacaggtactagaaaaa gttgtcatactagaacaaatatgccttcttcagtacgtagacgacattcttatatctggt gaagatatagagaagtgcggtagttttaggagtgctgactcaagaacacagaggccgccg gcagcccgtagccttcctatcaaaggtgttggacccagtcacttgtggatggcctcaatg catccagtccatcggggctgcggcaatactggtcgaggaaagcaggaagttaacctttgg aggaaaattgacgggaatccaaatctatggagggaacacacatgtttagatttaattgat taccatacaaaggttcgaccagacctaggagaaacccccttccggactggacggcactta ttcatagacagttcctccggggtgattgagggaaaaagacacaatgggtattcagtgatt gatggagaaattctcatagaaatagaatctggaaaattgccaacaattggtctgctcaaa cataaccaggaaggaaccatctatacagattccaagtatgcctgtgtagtggcccatatg tttgggaaaatttggactgaaagaggtctcattagtagtaaaggtcaagaccttgttcac aaggagctgatcacccaagtattgaataatcttcagttgccagaagaaatagctattgtc cgtgttcccggacaccagaaaaacctttcttttgaaagttga >gi568815583r:41417545_41637125|GENSCAN_predicted_peptide_6|81_aa MVLTSLAGGQAAPRNEDAEKEEPNSSILGTLNPSGTGFGWLTWGVEDIIENHIDMLGPVN LDWSGLSLLSYWQQLRESRWP >gi568815583r:41417545_41637125|GENSCAN_predicted_CDS_6|246_bp atggtccttaccagtcttgctggaggtcaagctgccccaaggaatgaagatgctgaaaaa gaagaaccaaacagttccattttagggactctgaaccccagtggaacaggatttggttgg ctgacctggggagtggaagacatcatagaaaaccacattgatatgttggggccggttaac ctggactggtctggactttccctgctgtcctattggcagcaactcagagagtccagatgg ccctag