GENSCAN 1.0 Date run: 8-Nov-116 Time: 02:19:00 Sequence gi568815583r:41403999_41613709 : 209711 bp : 49.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 13118 13315 198 1 0 54 94 392 0.996 33.30 1.02 Intr + 34323 34433 111 2 0 76 90 92 0.913 8.78 1.03 Intr + 48903 49050 148 2 1 101 105 100 0.836 12.81 1.04 Intr + 53674 53878 205 2 1 80 116 208 0.984 21.06 1.05 Intr + 60773 60887 115 2 1 40 91 116 0.914 7.55 1.06 Intr + 62143 62254 112 0 1 108 115 173 0.999 21.95 1.07 Intr + 66259 66394 136 2 1 98 57 261 0.981 23.63 1.08 Intr + 67174 67351 178 1 1 89 11 106 0.752 2.92 1.09 Intr + 70622 70704 83 1 2 94 70 30 0.567 0.24 1.10 Intr + 71527 71614 88 1 1 34 88 65 0.694 1.07 1.11 Intr + 71714 71821 108 1 0 75 94 58 0.915 5.58 1.12 Intr + 72448 72525 78 0 0 85 116 66 0.992 8.85 1.13 Intr + 73167 73288 122 2 2 25 89 188 0.995 11.89 1.14 Intr + 73460 73517 58 2 1 117 83 117 0.999 12.99 1.15 Intr + 74550 74627 78 2 0 76 82 58 0.930 3.75 1.16 Intr + 75105 75209 105 2 0 61 26 93 0.606 0.81 1.17 Intr + 76216 76327 112 0 1 76 91 113 0.942 10.35 1.18 Term + 76583 76689 107 0 2 121 53 70 0.921 5.37 1.19 PlyA + 78272 78277 6 1.05 2.00 Prom + 78313 78352 40 -8.76 2.01 Init + 78447 78454 8 2 2 85 89 4 0.759 0.41 2.02 Intr + 79666 79736 71 1 2 90 58 89 0.832 4.93 2.03 Term + 82180 82397 218 2 2 89 43 133 0.614 6.21 2.04 PlyA + 85155 85160 6 -0.45 3.00 Prom + 85522 85561 40 -7.56 3.01 Init + 89957 90418 462 1 0 92 96 766 0.964 71.50 3.02 Intr + 97465 97853 389 0 2 78 85 655 0.610 57.79 3.03 Intr + 97999 98203 205 1 1 125 100 489 0.999 53.00 3.04 Intr + 98404 98505 102 0 0 86 85 153 0.999 15.17 3.05 Intr + 98790 98861 72 2 0 96 99 121 0.985 13.60 3.06 Term + 98935 99168 234 0 0 4 54 380 0.754 22.52 3.07 PlyA + 99534 99539 6 1.05 4.44 PlyA - 99663 99658 6 1.05 4.43 Term - 100246 99998 249 1 0 108 52 32 0.708 -2.70 4.42 Intr - 100434 100344 91 2 1 87 69 139 0.993 11.90 4.41 Intr - 100642 100508 135 0 0 80 56 136 0.998 9.28 4.40 Intr - 100876 100775 102 0 0 113 19 118 0.985 6.69 4.39 Intr - 101066 100974 93 1 0 73 66 89 0.940 4.28 4.38 Intr - 101307 101210 98 0 2 86 66 166 0.998 13.01 4.37 Intr - 101532 101403 130 2 1 52 85 0 0.405 -3.20 4.36 Intr - 101779 101715 65 1 2 107 55 72 0.362 3.32 4.35 Intr - 102007 101917 91 0 1 112 100 44 0.957 8.00 4.34 Intr - 103292 103097 196 0 1 68 111 83 0.956 7.07 4.33 Intr - 103659 103516 144 1 0 92 55 49 0.695 2.25 4.32 Intr - 104223 104071 153 1 0 51 71 119 0.489 6.54 4.31 Intr - 105131 105033 99 0 0 81 72 92 0.917 6.98 4.30 Intr - 107348 107166 183 0 0 126 61 112 0.997 12.06 4.29 Intr - 107619 107424 196 0 1 33 94 185 0.551 12.59 4.28 Intr - 107965 107819 147 1 0 115 87 52 0.911 8.23 4.27 Intr - 108267 108117 151 2 1 91 38 155 0.696 10.96 4.26 Intr - 108880 108709 172 2 1 96 84 127 0.994 12.10 4.25 Intr - 109122 108979 144 1 0 63 88 72 0.938 4.95 4.24 Intr - 109762 109669 94 1 1 -25 114 111 0.178 1.94 4.23 Intr - 111635 111552 84 2 0 18 90 77 0.076 0.92 4.22 Intr - 113693 113559 135 2 0 118 21 89 0.173 5.96 4.21 Intr - 113859 113800 60 0 0 142 75 64 0.998 9.43 4.20 Intr - 114184 114008 177 1 0 90 89 177 0.992 18.12 4.19 Intr - 117149 116393 757 1 1 82 80 574 0.980 47.47 4.18 Intr - 117882 117740 143 0 2 105 105 156 0.968 18.15 4.17 Intr - 118252 118100 153 1 0 52 78 93 0.843 4.97 4.16 Intr - 119974 119773 202 0 1 47 89 19 0.497 -2.81 4.15 Intr - 120256 120098 159 0 0 89 68 117 0.817 8.90 4.14 Intr - 121150 120993 158 1 2 84 94 164 0.999 15.41 4.13 Intr - 123070 122900 171 1 0 139 93 -15 0.948 4.24 4.12 Intr - 123303 123169 135 0 0 63 76 221 0.998 19.16 4.11 Intr - 123607 123425 183 1 0 102 98 239 0.999 26.28 4.10 Intr - 124029 123862 168 0 0 77 66 156 0.974 12.44 4.09 Intr - 124338 124237 102 0 0 93 96 26 0.939 4.27 4.08 Intr - 125570 125472 99 2 0 87 78 85 0.994 7.71 4.07 Intr - 125981 125866 116 0 2 85 78 158 0.997 14.67 4.06 Intr - 127204 127025 180 2 0 114 41 77 0.700 5.44 4.05 Intr - 130937 130716 222 0 0 76 52 197 0.903 13.00 4.04 Intr - 131634 131514 121 0 1 117 84 119 0.999 14.47 4.03 Intr - 132220 132131 90 1 0 83 96 50 0.944 5.49 4.02 Intr - 132651 132503 149 1 2 103 30 -9 0.860 -5.25 4.01 Init - 133127 132947 181 2 1 76 79 170 0.871 14.25 4.00 Prom - 134679 134640 40 -3.76 5.00 Prom + 149486 149525 40 -6.46 5.01 Init + 155278 155383 106 0 1 83 91 281 0.827 26.28 5.02 Intr + 157129 157312 184 2 1 112 61 316 0.841 30.15 5.03 Intr + 157541 157641 101 2 2 66 100 83 0.587 7.05 5.04 Intr + 158550 158720 171 1 0 132 99 31 0.983 8.51 5.05 Intr + 161028 161143 116 1 2 72 96 126 0.894 11.97 5.06 Intr + 163362 163539 178 2 1 101 82 165 0.998 16.69 5.07 Intr + 164219 164364 146 0 2 69 110 121 0.984 12.40 5.08 Intr + 164880 165024 145 2 1 70 94 78 0.936 6.46 5.09 Intr + 166029 166158 130 1 1 113 82 177 0.998 19.35 5.10 Intr + 166242 166342 101 0 2 89 84 197 0.998 19.15 5.11 Intr + 166606 166701 96 2 0 76 95 170 0.998 16.48 5.12 Intr + 167040 167120 81 1 0 52 113 122 0.996 10.71 5.13 Intr + 167597 167689 93 0 0 87 85 99 0.999 9.44 5.14 Intr + 168445 168566 122 2 2 113 59 97 0.996 9.51 5.15 Intr + 169004 169137 134 1 2 92 39 217 0.888 16.64 5.16 Intr + 169310 169469 160 2 1 97 77 214 0.992 21.19 5.17 Intr + 169681 169817 137 0 2 65 67 226 0.972 17.57 5.18 Intr + 173888 174122 235 2 1 133 23 160 0.644 11.69 5.19 Intr + 181462 181708 247 0 1 52 43 145 0.402 3.43 5.20 Intr + 184185 184276 92 1 2 49 88 79 0.594 3.71 5.21 Intr + 184685 184862 178 1 1 27 115 87 0.605 4.79 5.22 Intr + 185020 185249 230 2 2 57 58 86 0.213 0.09 5.23 Term + 185291 185509 219 1 0 -17 55 154 0.415 -1.36 5.24 PlyA + 185641 185646 6 1.05 6.03 PlyA - 188192 188187 6 1.05 6.02 Term - 195410 195226 185 0 2 105 43 70 0.772 1.91 6.01 Init - 197572 197512 61 1 1 92 66 35 0.761 3.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 113693 113544 150 2 0 118 45 105 0.821 7.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:41403999_41613709|GENSCAN_predicted_peptide_1|713_aa MRGRLCVGRAAAAAAAVAVPLAGGQEGSPGGGRRGSRGTTMVKKRKGRVVIDSDTEDSGS DENLDQELLSLAKRKRSDSEEKEPPVSQPAASSDSETSDSDDEWTFGSNKNKKKGKARKI EKKGTMKKQANKTASSGSSDKDSSAESSAPEEGEVSDSDSNSSSSSSDSDSSSEDEEFHD GYGEDLMGDEEDRARLEQMTEKEREQELFNRIEKREVLKRRFEIKKKLKTAKKKEKKEKK KKQEEEQEKKKLTQIQESQVTSHNKERRSKRDEKLDKKSQAMEELKAEREKRKNRTAELL AKKQPLKTSEVYSDDEEEEEDDKSSEKSDRSSRTSSSDEEEEKEEIPPKSQPVSLPEELN RVRLSRHKLERWCHMPFFAKTVTGCFVRIGIGNHNSKPVYRVAEITGVVETAKVYQLGGT RTNKGLQLRHGNDQRVFRLEFVSNQEFTESEFMKWKEAMFSAGMQLPTLDEINKKELSIK EALNYKFNDQDIEEIVKEKERFRKAPPNYAMKKTQLLKEKAMAEDLGDQDKAKQIQDQLN ELEERAEALDRQRTKNISAISYINQRNREWNIVESEKALVAESHNMKNQQMDPFTRRQCK PTIVSNSRDPAVQAAILAQLNAKYGSGVLPDAPKEMSKASVGQGKDKDLNSKSASDLSED LFKVHDFDVKIDLQVPSSESKALAITSKAPPAKDGAPRRSLNLEDYKKRRGLI >gi568815583r:41403999_41613709|GENSCAN_predicted_CDS_1|2142_bp atgcgcggtcgcctttgtgtgggtcgagcagcggcggcggcggcggcagtggcggtccca ctggcaggcgggcaagaggggagtccgggcggcggccggcgtgggagccgggggaccacc atggtaaagaagcggaaaggccgcgtcgtgatcgactcggacacagaggacagcggcagc gacgagaacctggatcaggagctcttgtccctggcaaagcgaaagcgcagtgactctgag gagaaggagccgcctgtgagtcagcctgcagcctcgtcagactcggagacgtctgacagt gacgatgagtggacatttgggagcaataaaaataagaagaaaggaaaagccagaaaaata gagaagaaaggaaccatgaagaaacaggccaacaaaactgcctcctcaggcagttcagac aaagacagttcagctgagagctcagcccctgaggaaggtgaagtgtcagactctgacagc aacagctcctcttccagttcagattcagactcttcctcagaagatgaagagttccatgat ggctatggagaagacctcatgggagatgaggaagacagggcccgtctggaacagatgaca gagaaagagagagagcaagaactgttcaatcgcatagagaagagggaggtgttgaaaaga agatttgaaatcaagaaaaaactaaaaacagccaaaaagaaagaaaagaaagaaaagaag aaaaagcaagaagaggagcaagaaaagaaaaaactgacacagattcaagaatctcaggta acatcccacaacaaggaacggcgttccaagcgggatgagaaactagacaagaaatctcaa gccatggaggagctaaaagcagagcgagaaaaacgaaagaacagaacagctgagctcctt gccaaaaaacagccattaaaaaccagtgaggtctactctgatgatgaagaggaggaagag gatgacaaatccagtgaaaagtcagaccgctcatcacgaacatcatcgtctgatgaagaa gaggagaaagaagagatccctcccaaatcccaaccagtttccttacctgaagaattgaat cgggttcgattatcacggcataagctagaacgctggtgtcacatgcccttctttgctaaa actgtcacaggatgttttgtgcggattggcatcggaaaccacaacagcaaaccagtttac cgggtcgctgagattacgggtgttgtggaaactgccaaagtttaccaactaggtggcacc agaacaaacaaagggctgcaactacggcatggcaatgaccaacgcgtgttccgtttagag tttgtctcaaaccaagaattcaccgaaagtgagtttatgaagtggaaagaagcgatgttc tctgctggcatgcagttgcccactctagatgaaatcaataaaaaggaattatctattaaa gaagctcttaattataaattcaatgatcaggacattgaagagattgtaaaagagaaagaa aggttcagaaaagctccacccaactacgctatgaagaagactcagctactgaaggaaaag gccatggctgaggacctgggggatcaggacaaggccaaacaaatccaagatcaactgaat gagctggaggaacgggcagaggccctggaccgccagcggaccaagaacatatccgctatc agttacatcaaccagcggaaccgggagtggaacattgtagagtctgagaaggcccttgtg gctgaaagtcacaacatgaaaaaccaacagatggatccctttactcggcggcagtgcaag cctaccatcgtttctaattccagagacccagctgttcaagctgccatcttggcccagctg aatgcaaaatacggttctggagtgttaccagatgctccaaaggaaatgagcaaggcaagt gtgggtcaaggcaaagataaagatttgaattctaagtcagccagtgacctctcagaagat ctgttcaaagtacacgattttgatgtgaagattgacttacaagttcccagctcagagtca aaggctttagccatcacctccaaggctccgccagccaaggatggggctccaaggagatct ctgaacttggaagactacaaaaaacgacgagggcttatttga >gi568815583r:41403999_41613709|GENSCAN_predicted_peptide_2|98_aa MDFPVPTKVKVAIVNSLAAWFPGSLSAMDSPASLSACDAAQPFTWQARKPQVDSISFAGR ALRRSPLGVSTTPRTGLGATLVRANGPRIPGPVRLLRR >gi568815583r:41403999_41613709|GENSCAN_predicted_CDS_2|297_bp atggacttccctgtgcccacaaaagtgaaagtggccatcgtcaactctttagcagcctgg ttccccggaagcctctctgccatggatagccctgcttcgctaagcgcgtgcgatgcagca cagcccttcacctggcaagcccggaagcctcaggttgactccatcagttttgccgggaga gcccttcggcgctccccgcttggtgtctccaccaccccccgcaccggcctgggcgccacc cttgtccgcgccaacggtccccgcatccctggccccgtgcgcctcctgcgccgttag >gi568815583r:41403999_41613709|GENSCAN_predicted_peptide_3|487_aa MARPGGARPCSPGLERAPRRSVGELRLLFEARCAAVAAAAAAGEPRARGAKRRGGQVPNG LPRAPPAPVIPQLTVTAEEPDVPPTSPGPPERERDCLPAAGSSHLQQPRRLSTSSVSSTG SSSLLEDSEDDLLSDSESRSRGNVQLEAGEDVGQKNHWQKIRTMVNLPVISPFKKRYAWV QLAGHTGEQWGGWAGARDGKGLGGADGCRSSGSFKAAGTSGLILKRCSEPERYCLARLMA DALRGCVPAFHGVVERDGESYLQLQDLLDGFDGPCVLDCKMGVRTYLEEELTKARERPKL RKDMYKKMLAVDPEAPTEEEHAQRAVTKPRYMQWREGISSSTTLGFRIEGIKKADGSCST DFKTTRSREQVLRVFEEFVQGDEEVLRRYLNRLQQIRDTLEVSEFFRRHEGRGLTVRGSQ VIGSSLLFVHDHCHRAGVWLIDFGKTTPLPDGQILDHRRPWEEGNREDGYLLGLDNLIGI LASLAER >gi568815583r:41403999_41613709|GENSCAN_predicted_CDS_3|1464_bp atggcgcggccggggggcgcgaggccctgcagcccggggctggagcgggccccgcgccgg agtgtcggggagctgcgcctgctcttcgaggcgcgctgtgcggcggtcgctgcggccgcc gccgcgggggagccccgggcccgcggggccaagcggcgtgggggacaggtccccaacggg cttccgcgggctcccccggccccggtgatccctcagctgaccgtgacagccgaggagccc gacgtgcccccgaccagccctgggccgccggagcgggagagggactgcctcccggcagcg ggctcttcgcacctgcagcagccgcgccgcctttccacctcgtcggtctcctccactggc tcctcgtcgctgctcgaggactcggaggacgacctgctgagcgacagtgagagccggagc cgcggcaacgtgcagctggaagcgggcgaggacgtgggtcagaaaaaccactggcagaag atccggaccatggtcaatctgccggtcataagccctttcaagaagcgctacgcctgggtg cagctggcagggcacactggtgagcagtggggcgggtgggcgggtgcccgcgacgggaag gggctgggcggcgctgacggatgccggtcctcagggagttttaaggcggcgggcaccagc gggctgatcctgaagcgctgctcggagccggagcgctactgcctggcgcggctgatggct gacgcgctgcgcggctgcgtgcctgccttccacggcgtggtggagcgcgacggcgaaagc tacctgcagctgcaggacctgctcgatggcttcgacggaccttgtgtgctcgactgcaaa atgggcgtcaggacttacctagaggaggagctgaccaaggcccgtgagcggcccaagctg cggaaggacatgtacaagaaaatgctggcggtggatcctgaagctcccacggaggaggag cacgcgcagcgcgccgtcaccaagccgcgctacatgcagtggcgggaaggcatcagctcc agcaccaccctcggcttccgcatcgagggcatcaagaaagcggacggctcctgcagcacc gacttcaagactacgcgaagccgagagcaggtgcttcgcgtctttgaagagtttgtgcaa ggagatgaggaagtgctgaggcggtatctgaaccgcctgcagcagatccgggacaccctg gaggtatccgagttcttcaggaggcacgagggccgcggcctgacggtgcggggctcgcag gtgatcggcagctcgctcctctttgtgcacgatcactgccatcgcgccggcgtgtggctc atcgacttcggcaagaccacgcccctccccgatggccagatcctggaccaccggcggccc tgggaggagggcaaccgcgaggacggctatttgctggggctggacaatctcattggcatc ctggccagcctggctgagagatga >gi568815583r:41403999_41613709|GENSCAN_predicted_peptide_4|2225_aa MLSRPKPGESEVDLLHFQSQFLAAGAAPAVQLVKKGNRGGGDANSDRPPLQDHRDVVMLD NLPDLPPALVPSPPKRARPSPGHCLPEDEDPEERLRRHDQHITAVLTKIIERDTSSVAVN LPVPSGVAFPAVFLRSRDTQGKSATSGKRSIFAQEIAARRIAEAKGPSVGEVVPNVGPPE GAVTCETPTPRNQGCQLPGSSHSFQGPNLVTGKGLRDQEAEQEAQTIHEENIARLQAMAP EEILQEQQRLLAQLDPSLVAFLRSHSHTQEQTGETASEEQRPGGPSANVTKEEPLMSAFA SEPRKRDKLEPEAPALALPVTPQKEWLHMDTVELEKLHWTQDLPPVRRQQTQERMQARFS LQGELLAPDVDLPTHLGLHHHGEEAERAGYSLQELFHLTRSQVSQQRALALHVLAQVISR AQAGEFGDRLAGSVLSLLLDAGFLFLLRFSLDDRVDGVIATAIRALRALLVAPGDEELLD STFSWYHGALTFPLMPSQEDKEDEDEDEECPAGKAKRKSPEEESRPPPDLARHDVIKGLL ATSLLPRLRYVLEVTYPGPAVVLDILAVLIRLARHSLESATRVLECPRLIETIVREFLPT SWSPVGAGPTPSLYKVPCATAMKLLRVLASAGRNIAARLLSSFDLRSRLCRIIAEAPQEL ALPPEEAEMLSTEALRLWAVAASYGQGGYLYRELYPVLMRALQVVPRELSTHPPQPLSMQ RIASLLTLLTQLTLAAGSTPAETISDSAEASLSATPSLVTWTQVSGLQPLVEPCLRQTLK LLSRPEMWRAVGPVPVACLLFLGAYYQAWSQQLAAILAAPGLQNYFLQCVAPGAAPHLTP FSAWALRHEYHLQYLALALAQKAAALQPLPATHAALYHGMALALLSRLLPGSEYLTHELL LSCVFRLEFLPERTSGGPEAADFSDQLSLGSSRVPRCGQGTLLAQACQDLPSIRNCYLTH CSPARASLLASQALHRGELQRVPTLLLPMPTEPLLPTDWPFLPLIRLYHRASDTPSGLSP TDTMGTAMRVLQWVLVLESWRPQALWAVPPAARLARLMCVFLVDSELFRESPVQHLVAAL LAQLCQPQVLPNLNLDCRLPGLTSFPDLYANFLDHFEAVSFGDHLFGALVLLPLQRRFSV TLRLALFGEHVGALRALSLPLTQLPVSLECYTVPPEDNLALLQLYFRTLVTGALRPRWCP VLYAVAVAHVNSFIFSQDPQSSDEVKAARRSMLQKTWLLADEGLRQHLLHYKLPNSTLPE GFELYSQLPPLRQHYLQRLTSTVLQNGITGSYHQKLLIQYIWGETQESAFLTSSQGCGRD RKGFCCRVDPTGMGCWGQLLVWFGAAGAILCSSPGSQETFLRSSPLPLASPSPRDPKVSA PPSILEPASPLNSPGTEGSWLFSTCGASGRHGPTQTQCDGAYAGTSVVVTVGAAGQLRGV QLWRVPGPGQYLISAYGAAGGKGAKNHLSRAHGVFVSAIFSLGLGESLYILVGQQGEDAC PGGSPESQLVCLGESRAVEEHAAMDGSEGVPGSRRWAGGGGGGGGATYVFRLEGASWNTP LAPQVRAGELEPLLVAAGGGGRAYLRPRDRGRTQASPEKLENRSEAPGSGGRGGAAGGGG GWTSRAPSPQAGRSLQEGAEGGQGCSEAWATLGWAAAGGFGGGGGACTAGGGGGGYRGGD ASETDNLWADGEDGVSFIHPSSELFLQPLAVTENHGEVEIRRHLNCSHCPLRDCQWQAEL QLAECLCPEGMELAVDNVTCMDLHKPPGPLVLMVAVVATSTLSLLMVCGVLILGTKRLAG TVDSRLLLSMKQKKWQGLQEMRLPSPELELSKLRTSAIRTAPNPYYCQVGLGPAQSWPLP PGVTEVSPANVTLLRALGHGAFGEVYEGLVIGLPGDSSPLQVAIKTLPELCSPQDELDFL MEALIISKFRHQNIVRCVGLSLRATPRLILLELMSGGDMKSFLRHSRPHLGQPSPLVMRD LLQLAQDIAQGCHYLEENHFIHRDIAARNCLLSCAGPSRVAKIGDFGMARDIYRASYYRR GDRALLPVKWMPPEAFLEGIFTSKTDSWSFGVLLWEIFSLGYMPYPGRTNQEVLDFVVGG GRMDPPRGCPGPVYRIMTQCWQHEPELRPSFASILERLQYCTQDPDVLNSLLPMELGPTP EEEGTSGLGNRSLECLRPPQPQELSPEKLKSWGGSPLGPWLSSGLKPLKSRGLQPQNLWN PTYRS >gi568815583r:41403999_41613709|GENSCAN_predicted_CDS_4|6678_bp atgctgtcgagaccgaagccaggggagtccgaggtggacctgctgcacttccagagtcag tttctcgcagctggtgcagccccagcagtgcagttggtgaagaaaggaaataggggcggt ggtgatgccaactcagaccggcctccgctccaggaccatcgggatgtggtgatgttggac aatctcccagatttgcccccagctttggtcccttctcctccaaagagagccaggcccagc cctggccactgcctgcctgaggatgaggacccagaagagaggctgaggaggcatgatcag cacatcactgctgtcttgactaagattattgaacgagatacaagttcagtggccgtgaat ctgcctgtgcccagtggtgttgctttccctgctgtgttccttcgctcgcgggacacacag gggaaatcagcaacatctggtaagagaagcatctttgcccaggaaattgcggcaaggagg atagctgaagccaagggcccatcagttggggaagttgtgcccaacgtgggcccaccagag ggtgccgtgacctgtgagacacccactcctaggaaccagggctgccagcttcctgggagc agccacagctttcagggacccaatctggtcacagggaaggggctcagggatcaagaagct gagcaggaagcccagactatccatgaagagaacatagcaagactgcaggccatggctcct gaggagatcctgcaggaacagcagcggttgctggcccagcttgaccccagcttggttgct ttcttgagatctcacagccacacgcaagagcaaacaggagagacagcctctgaggagcag aggccaggaggaccctctgctaatgtcaccaaggaggaacccctcatgtcagcttttgcc agtgagcccaggaagagagacaagctggagccagaagccccagctctggcattgcccgtg acccctcagaaagaatggctgcacatggacactgtcgagctggagaagctccactggacc caggacttgccccctgtccggcggcagcagacacaggagaggatgcaggctcgattcagt cttcagggagaactactggcccctgacgtggacctgcccacccacctgggtctgcaccac catggagaggaggcagagagagcggggtattccctacaggagctgttccacctgacccgc agccaggtttcccagcagagagcactggcactgcatgtgttagcccaggtcatcagcagg gcccaggctggtgagtttggggaccggctagcaggcagtgtcttaagcctccttttggat gctggtttcctcttcctactgcgcttctccttggatgacagagtggatggggtcattgca accgccatccgtgctcttcgggctctgctggtggctcctggagatgaggagctcctcgac agcaccttctcttggtaccatggagctttgacgttccctctgatgcccagccaggaggac aaggaggatgaggacgaggatgaagaatgcccagcaggaaaagcaaaaaggaaaagccct gaagaagaaagccggcctccacctgacctggcccgacatgatgtcatcaaggggctcctg gctaccagcctgctgcctcggctgcgctacgtgctggaggtgacatacccaggacctgcg gtggtccttgacatcctggctgtgctcatccgcctggcccggcattccctggaatcagcc acaagggtcctggagtgccctcggctgatagagactatagttcgagagttcttgcccacc agttggtctcctgtgggggcagggcctacccctagtctatacaaagtaccctgtgctact gccatgaaactacttcgtgtcctggcctcagctgggaggaatattgctgcccggctgttg agcagctttgatctccggagccgcctgtgccgcatcatagctgaggctccccaagaactg gccttgcccccagaggaagctgagatgctgagcaccgaggccctccgtctgtgggctgtg gctgcctcctatggccagggcggttacctttacagggagctctacccagtgctgatgcgg gccttgcaggtggtgccgcgggagctcagcacccacccacctcaacccctgtccatgcag cggatagcctcactgctcactctcctcacccagctaaccctggcagccggcagtacccct gctgaaaccatcagtgattctgctgaggccagcctctcggccaccccttccttagtcact tggacacaggtgtctgggctccagcctcttgttgagccgtgtctaaggcagaccttgaag ttgctgtccagacctgagatgtggagagccgtgggcccagtgcccgttgcctgcctgttg ttcctgggagcctactaccaggcctggagccagcaactggctgccatattggctgccccg ggactccagaattacttcctccagtgtgtggctcctggggctgccccacacctcacacct ttctctgcatgggccctgcgccatgagtaccacctgcagtacctggcactcgctctggcc cagaaagcggcagcgctgcagccactgccagccacccatgctgccctctatcatggtatg gccttggccctgctgagccggctgctgcccggaagtgagtacctcacccatgagctgctg ctgagctgtgtattccggctggagttcctcccggaaagaacatcagggggtccagaggca gccgacttctctgaccagctgtcgttaggaagcagcagggtccctcggtgtgggcaaggg actctgctggctcaggcctgccaggacctccccagcatccgcaactgctacctgactcat tgctcgccagcccgagccagtctgctggcctcccaggctctgcaccgaggggagctacag cgagtcccaaccctgctactgcccatgcctacggagccgctgctgcccaccgactggccc ttcctgccactgattcgcctctaccaccgggcttcagacaccccctcgggactctctccc acagacaccatgggcacagccatgcgggtcctgcagtgggtgctagttttggagagctgg cgcccccaggctctctgggctgtgccccctgctgcccgcctggcacggctcatgtgtgtg ttcctggtggacagtgagctgttccgggagtccccagtacagcatctggtggcagccctc ctcgcccagctctgtcagcctcaagtcttgccaaacctcaacctggactgccgactccct ggcctgacgtctttccctgacctctatgccaacttcctggatcattttgaggctgtctct tttggggaccacctctttggggccctggtcctcctgcccctgcagcgtcggttcagtgtc accttgcgccttgccctctttggggaacacgtgggagccttgcgagctctgagcctgcct ctgacccagttgcctgtgtccctggagtgttacacagtgcctcctgaagacaacctggcc ctccttcagctctacttccggaccctggttactggtgcgctccgcccacgttggtgcccc gtgctctatgctgtggctgtggctcatgtcaatagcttcatcttctctcaggacccacag agctcagatgaggtcaaagctgcccgcaggagtatgctgcagaaaacatggctgctggca gatgagggtctccggcagcacctcctgcactataagcttcccaattccacgctcccagag ggctttgagctctattctcagttgccccctctgcgtcagcactacctccagagactgact tcaacagtgctccaaaatgggatcactgggtcctaccaccagaagctgctgattcagtac atctggggtgaaacccaagaatctgcatttctaacaagttcccaggggtgtggccgcgac cgcaagggcttttgttgccgggtggacccaacagggatgggctgctggggacagctgctg gtgtggttcggagccgcgggcgccattctctgctctagcccggggtcccaggagactttt ctgcggtcctcgcccctgccgctggcaagtcccagcccccgggacccgaaagtcagcgcc ccgcctagtatcttggagccagcctccccgctgaattctccgggcaccgaggggtcttgg ctgttttctacctgcggggccagcggccggcatgggcccacacagacacaatgtgacggg gcgtacgcggggaccagcgtggtggtgaccgtgggggccgccgggcagctgagaggcgtg cagctgtggcgcgtgccgggccctggccagtatctgatctcagcctacggagccgcgggc ggcaaaggcgccaagaaccacctgtcgcgggcgcatggcgtcttcgtctcagcaatcttc tccctcggtctcggggagtcgctgtacatcctggtggggcagcagggagaggacgcctgt cccggaggtagcccggagagccagctcgtctgcctcggggagtctcgagccgttgaagag cacgcggcgatggatgggagcgaaggggtcccggggtcgcggcgctgggcgggaggtggc gggggtggcgggggcgccacctacgttttccggctggagggcgcttcctggaacacgccg ctggccccacaggtgcgcgctggcgagctggaaccgttgctggtggcggccggaggcggc ggtcgggcctacctgaggccgcgggaccgaggccggactcaggcctcccccgagaaactg gagaaccgctcggaggcgcccgggagcggcgggagaggcggggcggcaggtggtgggggc ggctggacgtcgcgggctccctctccgcaggccggccgctcactgcaggagggggcggag ggcggccagggctgctccgaggcttgggcgacccttggctgggccgcggccggcggcttc gggggcggcggcggggcctgcactgcgggcggaggcggcggcggctacagggggggcgac gcttcagagactgacaacctctgggctgatggggaagatggagtatccttcatacacccc agcagcgagctcttcctgcagcctctggcagtcaccgagaaccacggagaggtagagatc cgaaggcacctcaactgcagtcactgccctttgagagactgccaatggcaggcagagctc cagctggctgaatgcctgtgcccagaaggcatggagctagctgtggataacgtcacctgc atggacctgcacaagcccccaggccctctggttctgatggtggctgtggtggcaacctca acactgagcctccttatggtgtgtggggtcctgattctgggtacgaagcgtctagcaggc acagttgattcaaggctgctcctctccatgaagcagaagaagtggcagggcctgcaggag atgaggctgccgagccctgagcttgagctgagcaagcttcgaacctctgccatcaggaca gcccccaatccctattattgccaggtggggcttggcccggcccagtcctggcctctgcca ccaggtgtcaccgaggtttccccagccaatgttactctgctcagagccctgggccatggt gcctttggggaggtgtatgagggactggtaattggccttcctggggactccagtcccctg caggtagctatcaagaccctgccagaactctgctcgcctcaggatgagctggatttcctc atggaggccctcatcatcagcaagtttcgccatcagaacattgtgcggtgtgtggggctc agcctcagggccacccctcgcctcattctgctggaactgatgtctggaggggacatgaag agtttcctgaggcacagtcggccacacctgggccagccatcacctctggtcatgcgggac ctgctgcaactggcccaggacatagcccagggctgccactacctggaggaaaatcacttc atccacagggatattgccgcccggaactgcctgctgagctgcgctggacccagccgagtg gccaagattggggactttgggatggcacgagatatctaccgggccagttattaccgcagg ggggaccgggccttgctcccagtcaagtggatgcccccagaggccttcctggagggcatc ttcacatccaagacagattcctggtcttttggggtgctgctctgggagatcttctcactg ggctacatgccctatcctgggcgcaccaaccaggaggtgctggacttcgtcgttggagga ggccggatggaccctcctaggggctgcccagggcctgtgtaccgcatcatgacccagtgt tggcagcacgagcctgagctccgccctagctttgccagcatcttggagcgtctgcagtac tgcactcaggacccggatgtgctgaattcactcctgccaatggagctggggcccacccca gaggaggaagggacttctgggctggggaacagatctttggagtgcctaagacccccacag ccccaggaactgagtccagagaagttgaaaagctggggaggtagccctcttggcccctgg ctgtcctctggcctcaagcccctcaaatccaggggcctccaacctcagaacctttggaat cccacttatcgctcctga >gi568815583r:41403999_41613709|GENSCAN_predicted_peptide_5|1133_aa MGRPGLPPLPLPPPPRLGLLLAALASLLLPESAAAGLKLMGAPVKLTVSQGQPVKLNCSV EGMEEPDIQWVKDGAVVQNLDQLYIPVSEQHWIGFLSLKSVERSDAGRYWCQVEDGGETE ISQPVWLTVEGVPFFTVEPKDLAVPPNAPFQLSCEAVGPPEPVTIVWWRGTTKIGGPAPS PSVLNVTALPAAPFNITVTKLSSSNASVAWMPGADGRALLQSCTVQVTQAPGGWEVLAVV VPVPPFTCLLRDLVPATNYSLRVRCANALGPSPYADWVPFQTKGLAPASAPQNLHAIRTD SGLILEWEEVIPEAPLEGPLGPYKLSWVQDNGTQDELTVEGTRANLTGWDPQKDLIVRVC VSNAVGCGPWSQPLVVSSHDRAGQQGPPHSRTSWVPVVLGVLTALVTAAALALILLRKRR KETRFGQAFDSVMARGEPAVHFRAARSFNRERPERIEATLDSLGISDELKEKLEDVLIPE QQFTLGRMLGKGEFGSVREAQLKQEDGSFVKVAVKMLKADIIASSDIEEFLREAACMKEF DHPHVAKLVGVSLRSRAKGRLPIPMVILPFMKHGDLHAFLLASRIGENPFNLPLQTLIRF MVDIACGMEYLSSRNFIHRDLAARNCMYEFWRTRGLAEDMTVCVADFGLSRKIYSGDYYR QGCASKLPVKWLALESLADNLYTVQSDVWAFGVTMWEIMTRGQTPYAGIENAEIYNYLIG GNRLKQPPECMEDVYDLMYQCWSADPKQRPSFTCLRMELENILGQLSVLSASQDPLYINI ERAEEPTAGGSLELPGRDQPYSGAGDGSGMGAKGPSCGQSRRPAGRGEERGGRRRWRPVG AGPLQGRGQTRTEREPGSSGSTGSILPRVPENTPSSRNARASFVKAVFEFASSGEQVLEK VVILEQICLLQYVDDILISGEDIEKCGSFRSADSRTQRPPAARSLPIKGVGPSHLWMASM HPVHRGCGNTGRGKQEVNLWRKIDGNPNLWREHTCLDLIDYHTKVRPDLGETPFRTGRHL FIDSSSGVIEGKRHNGYSVIDGEILIEIESGKLPTIGLLKHNQEGTIYTDSKYACVVAHM FGKIWTERGLISSKGQDLVHKELITQVLNNLQLPEEIAIVRVPGHQKNLSFES >gi568815583r:41403999_41613709|GENSCAN_predicted_CDS_5|3402_bp atggggcggccggggctcccgccgctgccgctgccgccgccaccgcggctcgggctgctg ctggcggctctggcttctctgctgctcccggagtccgccgccgcaggtctgaagctcatg ggagccccggtgaagctgacagtgtctcaggggcagccggtgaagctcaactgcagtgtg gaggggatggaggagcctgacatccagtgggtgaaggatggggctgtggtccagaacttg gaccagttgtacatcccagtcagcgagcagcactggatcggcttcctcagcctgaagtca gtggagcgctctgacgccggccggtactggtgccaggtggaggatgggggtgaaaccgag atctcccagccagtgtggctcacggtagaaggtgtgccatttttcacagtggagccaaaa gatctggcagtgccacccaatgcccctttccaactgtcttgtgaggctgtgggtccccct gaacctgttaccattgtctggtggagaggaactacgaagatcgggggacccgctccctct ccatctgttttaaatgtaacagcactgcctgcagcccccttcaacatcaccgtgacaaag ctttccagcagcaacgctagtgtggcctggatgccaggtgctgatggccgagctctgcta cagtcctgtacagttcaggtgacacaggccccaggaggctgggaagtcctggctgttgtg gtccctgtgcccccctttacctgcctgctccgggacctggtgcctgccaccaactacagc ctcagggtgcgctgtgccaatgccttggggccctctccctatgctgactgggtgcccttt cagaccaagggtctagccccagccagcgctccccaaaacctccatgccatccgcacagat tcaggcctcatcttggagtgggaagaagtgatccccgaggcccctttggaaggccccctg ggaccctacaaactgtcctgggttcaagacaatggaacccaggatgagctgacagtggag gggaccagggccaatttgacaggctgggatccccaaaaggacctgatcgtacgtgtgtgc gtctccaatgcagttggctgtggaccctggagtcagccactggtggtctcttctcatgac cgtgcaggccagcagggccctcctcacagccgcacatcctgggtacctgtggtccttggt gtgctaacggccctggtgacggctgctgccctggccctcatcctgcttcgaaagagacgg aaagagacgcggtttgggcaagcctttgacagtgtcatggcccggggagagccagccgtt cacttccgggcagcccggtccttcaatcgagaaaggcccgagcgcatcgaggccacattg gacagcttgggcatcagcgatgaactaaaggaaaaactggaggatgtgctcatcccagag cagcagttcaccctgggccggatgttgggcaaaggagagtttggttcagtgcgggaggcc cagctgaagcaagaggatggctcctttgtgaaagtggctgtgaagatgctgaaagctgac atcattgcctcaagcgacattgaagagttcctcagggaagcagcttgcatgaaggagttt gaccatccacacgtggccaaacttgttggggtaagcctccggagcagggctaaaggccgt ctccccatccccatggtcatcttgcccttcatgaagcatggggacctgcatgccttcctg ctcgcctcccggattggggagaacccctttaacctacccctccagaccctgatccggttc atggtggacattgcctgcggcatggagtacctgagctctcggaacttcatccaccgagac ctggctgctcggaattgcatgtacgaattctggaggactcgagggctggcagaggacatg acagtgtgtgtggctgacttcggactctcccggaagatctacagtggggactactatcgt caaggctgtgcctccaaactgcctgtcaagtggctggccctggagagcctggccgacaac ctgtatactgtgcagagtgacgtgtgggcgttcggggtgaccatgtgggagatcatgaca cgtgggcagacgccatatgctggcatcgaaaacgctgagatttacaactacctcattggc gggaaccgcctgaaacagcctccggagtgtatggaggacgtgtatgatctcatgtaccag tgctggagtgctgaccccaagcagcgcccgagctttacttgtctgcgaatggaactggag aacatcttgggccagctgtctgtgctatctgccagccaggaccccttatacatcaacatc gagagagctgaggagcccactgcgggaggcagcctggagctacctggcagggatcagccc tacagtggggctggggatggcagtggcatgggggcaaagggccccagctgcggacagagc cggcgccctgcaggccgcggggaggagcgcggcgggcgcaggcggtggcggcccgtgggc gcggggcccctgcagggacggggacagacgcgcacggagcgggagcccggcagctccggg tctacgggatccatcctcccaagggtgcccgagaacactccatcgtcgcggaatgcccgc gcttctttcgttaaagctgtctttgagtttgcctcctctggtgaacaggtactagaaaaa gttgtcatactagaacaaatatgccttcttcagtacgtagacgacattcttatatctggt gaagatatagagaagtgcggtagttttaggagtgctgactcaagaacacagaggccgccg gcagcccgtagccttcctatcaaaggtgttggacccagtcacttgtggatggcctcaatg catccagtccatcggggctgcggcaatactggtcgaggaaagcaggaagttaacctttgg aggaaaattgacgggaatccaaatctatggagggaacacacatgtttagatttaattgat taccatacaaaggttcgaccagacctaggagaaacccccttccggactggacggcactta ttcatagacagttcctccggggtgattgagggaaaaagacacaatgggtattcagtgatt gatggagaaattctcatagaaatagaatctggaaaattgccaacaattggtctgctcaaa cataaccaggaaggaaccatctatacagattccaagtatgcctgtgtagtggcccatatg tttgggaaaatttggactgaaagaggtctcattagtagtaaaggtcaagaccttgttcac aaggagctgatcacccaagtattgaataatcttcagttgccagaagaaatagctattgtc cgtgttcccggacaccagaaaaacctttcttttgaaagttga >gi568815583r:41403999_41613709|GENSCAN_predicted_peptide_6|81_aa MVLTSLAGGQAAPRNEDAEKEEPNSSILGTLNPSGTGFGWLTWGVEDIIENHIDMLGPVN LDWSGLSLLSYWQQLRESRWP >gi568815583r:41403999_41613709|GENSCAN_predicted_CDS_6|246_bp atggtccttaccagtcttgctggaggtcaagctgccccaaggaatgaagatgctgaaaaa gaagaaccaaacagttccattttagggactctgaaccccagtggaacaggatttggttgg ctgacctggggagtggaagacatcatagaaaaccacattgatatgttggggccggttaac ctggactggtctggactttccctgctgtcctattggcagcaactcagagagtccagatgg ccctag