GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:08:44 Sequence gi568815586r:119997248_120216650 : 219403 bp : 47.75% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1274 1489 216 1 0 59 96 275 0.989 24.10 1.02 Intr + 59770 59940 171 0 0 81 89 125 0.346 12.04 1.03 Intr + 64463 64579 117 1 0 91 83 155 0.991 15.96 1.04 Intr + 65057 65152 96 1 0 -9 86 138 0.943 4.11 1.05 Intr + 67486 67632 147 0 0 107 84 206 0.996 22.53 1.06 Intr + 69968 70069 102 1 0 88 60 42 0.679 1.77 1.07 Intr + 70767 70823 57 2 0 82 100 20 0.751 1.68 1.08 Intr + 74375 74554 180 1 0 73 53 187 0.984 13.76 1.09 Intr + 75264 75482 219 2 0 23 107 325 0.991 26.40 1.10 Intr + 77196 77339 144 2 0 73 99 114 0.998 11.48 1.11 Intr + 80470 80626 157 0 1 95 55 51 0.453 2.08 1.12 Intr + 82997 83058 62 0 2 -1 87 86 0.509 -1.95 1.13 Intr + 83640 83770 131 2 2 72 87 195 0.680 17.29 1.14 Intr + 92704 92824 121 1 1 95 97 162 0.679 18.30 1.15 Term + 95753 95914 162 1 0 112 38 165 0.989 11.84 1.16 PlyA + 97227 97232 6 1.05 2.07 PlyA - 97873 97868 6 1.05 2.06 Term - 100126 99998 129 1 0 129 39 215 0.997 18.88 2.05 Intr - 101688 101564 125 1 2 48 78 234 0.999 18.70 2.04 Intr - 101907 101783 125 2 2 81 66 229 0.999 20.23 2.03 Intr - 106702 106579 124 2 1 58 94 280 0.993 25.24 2.02 Intr - 111220 111170 51 2 0 145 109 4 0.981 7.08 2.01 Init - 119403 119352 52 0 1 112 77 161 0.215 18.72 2.00 Prom - 129465 129426 40 -7.56 3.58 PlyA - 130045 130040 6 1.05 3.57 Term - 130727 130602 126 2 0 111 54 259 0.992 23.08 3.56 Intr - 132247 132029 219 1 0 84 75 312 0.965 28.00 3.55 Intr - 133506 133399 108 0 0 103 113 45 0.998 8.98 3.54 Intr - 134086 133938 149 2 2 66 86 154 0.961 12.95 3.53 Intr - 134775 134679 97 0 1 130 106 37 0.999 9.28 3.52 Intr - 137158 137044 115 0 1 69 80 187 0.998 16.45 3.51 Intr - 137479 137286 194 1 2 102 96 298 0.994 30.29 3.50 Intr - 139485 139255 231 0 0 80 113 174 0.990 17.07 3.49 Intr - 140072 139959 114 2 0 76 121 144 0.987 17.14 3.48 Intr - 140567 140298 270 2 0 91 100 396 0.999 38.84 3.47 Intr - 140797 140654 144 1 0 63 95 110 0.998 9.68 3.46 Intr - 141168 141076 93 0 0 86 94 129 0.999 13.46 3.45 Intr - 141609 141448 162 0 0 49 105 140 0.998 11.97 3.44 Intr - 143776 143612 165 1 0 80 78 98 0.995 8.16 3.43 Intr - 145475 145260 216 2 0 97 113 172 0.999 19.30 3.42 Intr - 145694 145544 151 1 1 108 54 115 0.862 10.26 3.41 Intr - 147201 147059 143 0 2 89 94 232 0.999 23.05 3.40 Intr - 147588 147392 197 1 2 98 56 257 0.789 22.63 3.39 Intr - 147814 147676 139 1 1 94 86 141 0.981 14.54 3.38 Intr - 148083 148015 69 0 0 115 105 27 0.989 6.48 3.37 Intr - 150025 149805 221 2 2 31 69 329 0.470 23.32 3.36 Intr - 151099 150920 180 2 0 85 57 208 0.820 17.24 3.35 Intr - 152473 152359 115 1 1 39 99 98 0.996 6.02 3.34 Intr - 152796 152675 122 1 2 119 107 101 0.990 15.31 3.33 Intr - 154267 153898 370 1 1 84 84 456 0.640 39.48 3.32 Intr - 156160 155966 195 1 0 58 82 355 0.984 31.51 3.31 Intr - 156662 156497 166 1 1 136 116 214 0.846 28.96 3.30 Intr - 157093 156941 153 0 0 87 72 28 0.544 0.29 3.29 Intr - 157841 157723 119 2 2 54 75 71 0.889 1.66 3.28 Intr - 158183 157961 223 1 1 80 57 206 0.769 14.83 3.27 Intr - 158472 158345 128 0 2 88 78 218 0.999 20.28 3.26 Intr - 159357 159214 144 0 0 95 65 192 0.997 18.08 3.25 Intr - 159745 159665 81 1 0 107 64 39 0.891 3.23 3.24 Intr - 160783 160602 182 2 2 125 36 117 0.806 9.79 3.23 Intr - 161368 161213 156 2 0 88 110 210 0.998 23.18 3.22 Intr - 162776 162578 199 2 1 101 40 171 0.991 12.52 3.21 Intr - 163008 162895 114 0 0 49 105 194 0.999 17.84 3.20 Intr - 164336 164243 94 1 1 72 105 186 0.898 18.67 3.19 Intr - 164811 164633 179 0 2 98 78 254 0.997 24.12 3.18 Intr - 166012 165806 207 1 0 89 33 217 0.841 15.57 3.17 Intr - 166315 166205 111 1 0 74 49 57 0.637 0.98 3.16 Intr - 167248 167089 160 0 1 123 100 184 0.995 23.09 3.15 Intr - 167474 167399 76 0 1 71 51 -25 0.427 -9.33 3.14 Intr - 171053 170961 93 0 0 74 115 108 0.997 12.04 3.13 Intr - 173074 172922 153 2 0 122 64 88 0.996 9.84 3.12 Intr - 176579 176406 174 0 0 91 78 41 0.890 3.31 3.11 Intr - 176922 176824 99 1 0 42 86 89 0.570 4.18 3.10 Intr - 177965 177915 51 0 0 110 80 60 0.986 6.28 3.09 Intr - 178627 178499 129 2 0 74 83 145 0.999 13.27 3.08 Intr - 178970 178896 75 0 0 96 105 131 0.999 15.09 3.07 Intr - 180308 180200 109 2 1 96 56 60 0.997 3.46 3.06 Intr - 180505 180437 69 1 0 78 85 106 0.994 8.68 3.05 Intr - 181512 181393 120 0 0 103 49 161 0.995 14.39 3.04 Intr - 181703 181584 120 2 0 113 56 111 0.723 11.09 3.03 Intr - 186430 186322 109 0 1 89 82 95 0.997 9.29 3.02 Intr - 186996 186865 132 2 0 116 110 74 0.967 12.16 3.01 Init - 188954 188947 8 2 2 114 91 0 0.935 3.40 3.00 Prom - 194472 194433 40 -1.46 4.10 PlyA - 196376 196371 6 1.05 4.09 Term - 197165 197023 143 0 2 36 48 95 0.249 -1.61 4.08 Intr - 197582 197433 150 0 0 39 94 197 0.634 15.53 4.07 Intr - 199687 199540 148 1 1 141 -52 209 0.914 12.01 4.06 Intr - 200215 200075 141 1 0 84 93 181 0.839 18.75 4.05 Intr - 201492 201307 186 0 0 36 61 312 0.975 23.19 4.04 Intr - 201753 201607 147 0 0 83 105 101 0.999 11.73 4.03 Intr - 201958 201871 88 0 1 90 116 127 0.999 15.67 4.02 Intr - 202238 202063 176 2 2 87 26 302 0.982 22.64 4.01 Init - 203536 203483 54 1 0 89 97 24 0.976 4.70 4.00 Prom - 203898 203859 40 -4.36 5.06 PlyA - 206341 206336 6 1.05 5.05 Term - 215333 215067 267 2 0 103 43 564 0.957 48.89 5.04 Intr - 216743 216595 149 0 2 69 90 300 0.999 28.25 5.03 Intr - 216970 216889 82 1 1 105 102 75 0.998 9.81 5.02 Intr - 217751 217578 174 2 0 112 76 334 0.999 34.74 5.01 Intr - 218026 217856 171 1 0 111 64 188 0.994 18.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 47740 47330 411 1 0 77 48 172 0.808 7.34 S.002 Init - 114709 114665 45 1 0 77 81 76 0.838 4.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:119997248_120216650|GENSCAN_predicted_peptide_1|693_aa HLEQEKHELRRRFENREGEWEGRVSELESDVKQLQDELERQQIHLREADREKSRAVQELS EQNQRLLDQLSRVGMVTAMDALEEESFRLSSSTSDAEFDAVVVYLEDIIMDDRFPIITEK LHGQVLLGLASEVERQLSMQVHALREDFREKNSSTNQHIIRLESLQAEERPECEGQDEGK KKVSAGEQYCENLVVEGEERIKMLSDRKRELEHRLSATLEENDLLQGTVEELQDRVLILE RQGHDKDLQLGILQMSAGPSPSDWVDSISRAQALLTVGQPRKQGYSRSLVPLANHFEVLT VKLHQSQLELQEVRLSCRQLQVKVEELTEERSLQSSAATSTSLLSEIEQSMEAEELEQER EQLRLQLWEAYCQVRYLCSHLRGNDSADSAVSTDSSMDESSETSSAKDVPAGSLRTALNE LKRLIQSIVDGMEPTGSRRLDDDSLEEQIRQTSEDSRALRELMEGERGKLRQSLEELQRL HSQLLPYFKTCILSHPGYGHHFPIYLPSPSPPTLTRFGLIFLKRDLKVGRPLQVLAGAQG PEKVVVPFAEAWATEKVTLLSVEMTALKEERDRLRVTSEDKEPKEQLQKAIRDRDEAIAK KNAVELELAKCRMDMMSLNSQLLDAIQQKLNLSQQLEAWQDDMHRVIDRQLMDTHLKERS QPAAALCRGHSAGRGDEPSIAEGKRLFSFFRKI >gi568815586r:119997248_120216650|GENSCAN_predicted_CDS_1|2082_bp cacttagagcaagagaaacatgaattgagaagacgatttgagaaccgagaaggggagtgg gaaggccgagtgtcagagctggagagtgatgtgaagcagctacaggatgagttggagagg cagcagattcatctgcgggaagcagatcgagaaaaatcacgggctgtccaggaactgtcg gaacagaaccaaaggctattggatcagctcagcagggtgggaatggtaacagcaatggat gccttagaggaagagagcttcaggctgtcctcctccacctctgatgcagaatttgatgct gtggttgtatatttagaggacattatcatggatgacaggttcccaattattacagagaaa ttacatggacaagtactactaggacttgcatcagaagttgagagacaactctccatgcag gtccacgccctcagagaagactttcgggagaaaaactcatcaaccaaccagcacattatc cggctggagagccttcaggccgaggagaggccagaatgtgagggccaggatgaggggaaa aagaaagtatcggcaggggaacaatactgtgagaacctggttgtggaaggtgaagagagg atcaagatgctgtcagatcggaaacgggagctggagcatcgtctcagcgctactttagag gaaaatgacctgctccaagggaccgtggaggagctacaggaccgggtgctaatcctggag aggcagggccatgacaaggacctacagctgggcatcttacagatgtctgcagggcctagt cccagcgactgggtggattccatcagcagagcacaggccctgctcactgtggggcaaccc aggaagcagggctattccagatcactggtgccattagccaatcatttcgaagtactgacg gtgaagctgcaccaaagccagctggagcttcaggaggtgcgtctctcctgccgacagctg caggtgaaggtggaagaactcactgaggagaggagtctgcagagctctgccgccaccagc acatccctcctgtcagagatcgagcagagcatggaggctgaggagctggagcaggagcga gagcagctgagactgcagctctgggaagcctactgccaggttcgctatctgtgctcacac cttcgaggcaatgacagtgctgactcagccgtctccacggactcctccatggacgagtct tcagaaacctcgtccgccaaggatgtgccagccggcagcttgcgcactgccctcaatgag ctcaagagactgatacagagcattgtggatggcatggagcccacgggctcccggagactt gatgatgactccttagaagaacagataaggcagaccagtgaggactcgagagccctaagg gagctcatggagggagagaggggtaaactgaggcaaagcctagaagagctgcagcgactc cacagtcagctcctgccctatttcaagacctgcatcctttcacaccctggctatggtcac cacttcccaatttatctgccttcccccagcccgcccaccctcacacgctttggattgatc ttcctgaagcgtgacttgaaagtgggacgccccttacaagttctggctggggcacaagga cctgagaaggtggtggtgccctttgctgaagcatgggccacagaaaaggtgacactgctg agtgtggagatgactgccctaaaagaggagagagaccgactcagagtcacttctgaggac aaggagccaaaggagcagcttcagaaggccatcagggaccgcgacgaggccattgcaaag aagaatgctgtggagctggaacttgccaagtgcaggatggatatgatgtctctgaacagc cagttgctggatgccattcagcagaaactgaacctctcgcagcagctggaagcttggcag gatgacatgcacagggtcattgaccggcagctgatggacacgcacctgaaagaacggagc cagccggctgctgccctctgcaggggccacagcgctgggcggggggatgagcccagcatc gctgaaggcaaacgactcttctcattcttcaggaaaatttaa >gi568815586r:119997248_120216650|GENSCAN_predicted_peptide_2|201_aa MARDYDHLFKLLIIGDSGVGKSSLLLRFADNTFSGSYITTIGVDFKIRTVEINGEKVKLQ IWDTAGQERFRTITSTYYRGTHGVIVVYDVTSAESFVNVKRWLHEINQNCDDVCRILVGN KNDDPERKVVETEDAYKFAGQMGIQLFETSAKENVNVEEMFNCITELVLRAKKDNLAKQQ QQQQNDVVKLTKNSKRKKRCC >gi568815586r:119997248_120216650|GENSCAN_predicted_CDS_2|606_bp atggcccgggactacgaccacctcttcaagctgctcatcatcggcgacagcggtgtgggc aagagcagtttactgttgcgttttgcagacaacactttctcaggcagctacatcaccacg atcggagtggatttcaagatccggaccgtggagatcaacggggagaaggtgaagctgcag atctgggacacagcggggcaggagcgcttccgcaccatcacctccacgtattatcggggg acccacggggtcattgtggtttacgacgtcaccagtgccgagtcctttgtcaacgtcaag cggtggcttcacgaaatcaaccagaactgtgatgatgtgtgccgaatattagtgggtaat aagaatgacgaccctgagcggaaggtggtggagacggaagatgcctacaaattcgccggg cagatgggcatccagttgttcgagaccagcgccaaggagaatgtcaacgtggaagagatg ttcaactgcatcacggagctggtcctccgagcaaagaaagacaacctggcaaaacagcag cagcaacaacagaacgatgtggtgaagctcacgaagaacagtaaacgaaagaaacgctgc tgctaa >gi568815586r:119997248_120216650|GENSCAN_predicted_peptide_3|2745_aa MPRDAASRRALQAAIQQLAEAQPEATAKNLLHSLQSSGIGSKAGVPSKSSGSAALLALTW TCLLVRIVFPSRAKRQGDIWNKLVEVQCLLLLEVLGGSHKHAVDGAVKKLTKLWKEVTGL QGLNPGLVEQYLSAILSLEPNQNYAGMLGLLVQFCTSHKEMDVSALLDFYMKNILMSKVK PPKYLLDSCAPLLRYLSHSEFKDLILPTIQKSLLRSPENVIETISSLLASVTLDLSQYAM DIVKGLAGHLKSNSPRLMDEAVLALRNLARQCSDSSAMESLTKHLFAILGGSEGKLTVVA QKMSVLSGIGSVSHHVVSGPSSQVLNGIVAELFIPFLQQEVHEGTLVHAVSVLALWCNRF TMEVPKKLTEWFKKAFSLKTSTSAVRHAYLQCMLASYRGDTLLQALDLLPLLIQTVEKAA SQSTQVPTITEGVAAALLLLKLSVADSQAEAKLSSFWQLIVDEKKQVFTSEKFLVMASED ALCTVLHLTERLFLDHPHRLTGNKVQQYHRALVAVLLSRTWHVRRQAQQTVRKLLSSLGG FKLAHGLLEELKTVLSSHKPVEECGPVTDLPPQAGSPGPVPGAFFTPAGFLDAKPQVLPL EALVTDAGEVTEAGKAYVPPRVLQEALCVISGVPGLKGDVTDTEQLAQEMLIISHHPSLG PWLRWSSMNAMGSLSVLSPDRVLPQLISTITASVQNPALRLVTREEFAIMQTPAGELYDK SIIQSAQQDSIKKANMKRENKAYSFKEQIIELELKEEIKKKKGIKEEVQLTSKQKEMLQA QLDREAQVRRRLQELDGELEAALGLLDIILAKNPSGLTQYIPVLVDSFLPLLKSPLAAPR IKNPFLSLAACVMPSRLKALGTLVSHVTLRLLKPECVLDKSWCQEELSVAVKRAVMLLHT HTITSRVGKGEPGAAPLSAPAFSLVFPFLKMVLTEMPHHSEEEEEWMAQILQILTVQAQL RASPNTPPGRVDENGPELLPRVAMLRLLTWVIGTGSPRLQVLASDTLTTLCASSSGDDGC AFAEQEEVDVLLCALQSPCASVRETVLRGLMELHMVLPAPDTDEKNGLNLLRRLWVVKFD KEEEIRKLAERLWSMMGLDLQPDLCSLLIDDVIYHEAAVRQAGAEALSQAVARYQRQAAE VMGRLMEIYQEKLYVSDLCSPDAVWVIDLPVAKHLPTCLLQRPPPVLDALGRVISESPPD QWEARPSARDRTQILCSMTGKSNNTLPLENVFTSTGLYIWFFYLCVLLQPALQVEGCGLA LALNKLSQYLDSSQVKPLFQFFVPDALNDRHPDVRKCMLDAALATLNTHGKENVNSLLPV FEEFLKNAPNDASYDAVRQSVVVLMGSLAKHLDKSDPKVKPIVAKLIAALSTPSQQAVAE MGLGTSSGGCCVEWSDPPPTMKRLQRSHSIDRWALPQVQESVASCLPPLVPAIKEDAGGM IQRLMQQLLESDKYAERKGAAYGLAGLVKGLGILSLKQQEMMAALTDAIQDKKNFRRREG ALFAFEMLCTMLGKLFEPYVVHVLPHLLLCFGDGNQYVREAADDCAKAVMSNLSAHGVKL VLPSLLAALEEESWRTKAGSVELLGAMAYCAPKQLSSCLPNIVPKLTEVLTDSHVKVQKA GQQALRQIGSVIRNPEILAIAPVLLDALTDPSRKTQKCLQTLLDTKFVHFIDAPSLALIM PIVQRAFQDRSTDTRKMAAQIIGNMYSLTDQKDLAPYLPSVTPGLKASLLDPVPEVRTVS AKALGAMVKGMGESCFEDLLPWLMETLTYEQSSVDRSGAAQGLAEVMAGLGVEKLEKLMP EIVATASKVDIAPHVRDGYIMMFNYLPITFGDKFTPYVGPIIPCILKALADENEFVRDTA LRAGQRVISMYAETAIALLLPQLEQGLFDDLWRIRFSSVQLLGDLLFHISGVTGKMTTET ASEDDNFGTAQSNKVPAVAFLPLLMAIITALGVERRNRVLAGLYMGRSDTQLVVRQASLH VWKIVVSNTPRTLREILPTLFGLLLGFLASTCADKRTIAARTLGDLVRKLGEKILPEIIP ILEEGLRSQKSDERQGVCIGLSEIMKSTSRDAVLYFSESLVPTARKALCDPLEEVREAAA KTFEQLHSTIGHQALEDILPFLLKQLDDEEVSEFALDGLKQVMAIKSRVVLPYLVPKLTT PPVNTRVLAFLSSVAGDALTRHLGVILPAVMLALKEKLGTPDEQLEMANCQAVILSVEDD TGHRIIIEDLLEATRSPEVGMRQAAAIILNIYCSRSKADYTSHLRSLVSGLIRLFNDSSP VVLEESWDALNAITKKLDAGNQLALIEELHKEIRLIGNESKGEHVPGFCLPKKGVTSILP VLREGVLTGSPEQKEEAAKALGLVIRLTSADALRPSVVSITGPLIRILGDRFSWNVKAAL LETLSLLLAKVGIALKPFLPQLQTTFTKALQDSNRGVRLKAADALGKLISIHIKVDPLFT ELLNGIRAMEDPGVRDTMLQALRFVIQGAGAKVDAVIRKNIVSLLLSMLGHDEDNTRISS AGCLGELCAFLTEEELSAVLQQCLLADVSGIDWMVRHGRSLALSVAVNVAPGRLCAGRYS SDVQEMILSSATADRIPIAVSGVRGMGFLMRHHIETGGGQLPAKLSSLFVKCLQNPSSDI RLVAEKMIWWANKDPLPPLDPQAIKPILKALLDNTKDKNTVVRAYSDQAIVNLLKMRQGE EVFQSLSKILDVASLEVLNEVNRRSLKKLASQADSTEQVDDTILT >gi568815586r:119997248_120216650|GENSCAN_predicted_CDS_3|8238_bp atgcccagagatgcagcctcccgcagggccttgcaggcagccatccagcagttggctgag gcccagccagaagccactgctaagaaccttctacactctctgcagtcttctggtataggc tccaaagcaggtgttcccagtaagagcagtggctctgccgccttgctggccttgacctgg acctgcctcctggtgcgcattgtctttccatcgagagccaagcgacaaggagacatctgg aacaaactggtggaagtgcagtgcctgctcttgctggaggtgctgggtggctcccacaag cacgccgtggatggtgctgtgaagaaactcacgaagctgtggaaagaggtgacaggactg caggggctgaaccccgggctggtggaacagtacttgtcagccattctcagcctagagccc aaccagaactatgctggcatgctggggctgctggtgcagttctgcacgagtcacaaggag atggacgtgagcgccctactggacttttacatgaagaacatcctgatgagcaaagtcaag cctccgaagtacctgttggatagctgtgcccctctgctccgatacctgtcccactcagaa tttaaggatctgatactgcccaccatacagaagtccttactgaggagtccagagaatgtt attgaaactatttctagtctgctggcatcagtgacgcttgacctcagccagtatgccatg gacatcgtgaaaggactggctggtcacctgaaatccaacagtccccgcctgatggatgaa gctgtgctggcactgcggaacctggcacgccagtgcagtgactcttcggccatggaatcc ctgaccaagcacctatttgctatcctcggaggctcggaaggaaaactaactgttgtagcc cagaagatgagcgtcctctcagggattgggagcgtcagtcatcacgtggtgtctggacct tccagtcaggtcctgaatgggatcgtggctgagctgttcatcccgttccttcagcaggaa gttcatgaagggaccttggtacacgctgtctcagtcctggctctctggtgtaaccgattc actatggaagtgcccaagaagctcactgaatggttcaaaaaagctttcagccttaaaacc tccacatctgcggtgaggcatgcctacctgcagtgcatgttggcctcttaccggggtgac acgctgttgcaggccctggacttactgcccttgctcatccagacagtggagaaggcagcc tcccaaagcactcaggttcccaccatcactgaaggggttgccgcagccttgttgctctta aagttgtcagtggctgactcacaggctgaggccaaactgagcagtttctggcagttgatt gtggatgagaaaaagcaggttttcacttctgagaaattcctggtcatggcttcagaggat gccctgtgtactgtgttgcatctgacagagagacttttccttgaccacccgcatagactc actggcaacaaagttcagcagtaccaccgggctctggtggcggtgctcctgagccgcacc tggcacgtccgcaggcaggctcagcagacagttcggaagctgctgtcctctcttgggggc tttaagctggcgcacggactcttggaggagctgaagactgtcctcagttctcacaagcct gttgaggaatgtgggccagtgacagacttgccaccacaggctgggtctcctggaccagtt cctggtgccttctttacccctgcaggctttttagatgcaaagccacaggtgctgccctta gaggctttggtgactgatgctggagaggtgactgaggcaggcaaggcctacgtgcctcca cgggtcctgcaggaggctctgtgtgtcatctccggtgtgccagggctcaagggtgatgtc accgacactgaacaactggcccaggaaatgctgatcatctcccaccacccatccttaggt ccgtggctgcggtggtcctccatgaatgccatgggctccctttccgtcctgtcgccggac cgggtcctcccacagctcatcagcaccatcactgcctccgtgcagaaccctgcactgcgc ctggtgacgcgggaggagtttgccattatgcagacccctgctggggagctgtatgacaaa tccatcattcagagtgcccagcaggacagcataaaaaaggccaacatgaagcgagagaac aaagcttattccttcaaagagcagatcatcgagctggagctgaaggaggagataaagaag aagaaaggcatcaaagaggaggtgcagctgaccagcaagcagaaggagatgctgcaggcc cagctagacagggaggcgcaggtccggaggcggctgcaggagctggatggggagctggag gcggcgcttggactgctggacatcatcctggccaagaacccgtccggcctgacccagtac atccctgttttggtcgactcttttctgcccttgctgaagtctcccctggctgctcccagg atcaagaaccccttcttgtccttggctgcctgtgtcatgccctctaggctcaaggctttg ggcactttggtgagccacgtgaccctgcgcctgctgaagccagagtgtgtcctggataag tcctggtgccaggaagagctgtcggtggctgtgaagagggcggtgatgctgctgcacacc cacaccatcaccagcagggtgggcaagggggagccaggtgctgcgcccttgtccgcgcca gccttctccttagtcttcccgtttctgaagatggtgctgacggagatgccccaccacagt gaggaggaggaggagtggatggcccagattcttcagatcctcactgtccaagcccagctg agggcctcccccaacaccccacccgggcgggtggacgagaatggcccggagttgctgcct cgcgtggccatgctgcgtcttctgacttgggtgatcgggacgggctcgcctcgcttacag gttctggcttcagacaccctgaccaccctgtgtgccagcagcagtggtgatgatggctgt gcctttgcagagcaggaggaggtggacgtgctgctctgtgccttgcagtccccgtgtgcc agcgtgcgggaaaccgtgctccgggggctgatggaactccacatggtattgccagcacct gatactgatgagaagaatggcctgaaccttctgcggagactctgggtggtcaagtttgac aaggaggaggagatccggaagctggctgagaggctctggtcaatgatgggcctagacctg cagccagacctctgctccttgctgattgacgacgtgatctatcatgaggcggctgtaagg caggcaggggccgaagccctctcccaagcagtggcacgttaccagcggcaggcggcggag gttatgggcaggctcatggagatttaccaggaaaagctctacgtgagtgacctgtgcagc cctgatgcagtctgggtcattgatctgcccgttgcaaagcatttaccaacttgtctgttg cagcggccgcccccagtgctggatgctttgggacgagttatttcagaatctcctccagat cagtgggaagccagaccatcagccagggataggacacagattctttgctccatgactgga aagagtaacaacactttgcccttggagaatgtgttcacttccacaggcctctacatctgg ttcttttacctctgtgttctcttacaacctgctttgcaggtagaggggtgtggcttggcg ttggccctcaacaagctctcccagtatttggacagctctcaggtgaagccactctttcag ttttttgtccctgatgccctcaatgaccgacacccagatgtccggaagtgcatgttggat gcagccctcgcaacgctcaacactcatgggaaggagaacgtcaactcgctgttgccagta ttcgaggagttcctgaagaacgcgcccaatgatgccagctatgatgctgtgcgacagagt gtggtggtcctgatgggctctctggccaagcacctggacaagagtgaccccaaagtgaag cccattgttgccaagctcatcgctgccctctccaccccctcccagcaggcagtggctgag atggggcttgggacatctagtggtgggtgctgtgtagaatggtcagaccccccaccaacc atgaaacggctgcagcggagccacagcattgacaggtgggcccttccccaggtccaggag tccgtagccagctgcttgccaccccttgtgccagccatcaaggaggatgctggagggatg atccagaggcttatgcagcagctgctggagtcagacaagtacgcagagcgcaaaggggcc gcctatggcctggcgggcctggtgaagggcctgggcatcctctcgctgaagcaacaggag atgatggcggcactgactgatgccatccaagataagaagaacttccgccggcgagaggga gccctctttgccttcgagatgctctgcaccatgctggggaaactttttgagccgtatgtg gttcacgtgctgccccatctgctcctgtgctttggggatggaaaccagtatgtgcgtgag gctgcagatgactgtgccaaggctgtgatgagcaacttgagtgctcacggggtgaagctg gtgctcccctccttactggctgccctggaggaggaatcgtggcggaccaaagctgggtca gtggagcttcttggggcaatggcgtactgtgctcctaagcagctgtcatcctgtctaccc aacattgtgcccaagcttacggaggtgctgaccgactcccatgtcaaagtccagaaggct ggacagcaggcgctcaggcagatcggctccgttatcaggaacccggagatcctggccatt gctccagtcctcctggatgccctgacggatccctccaggaagacccagaagtgcttgcag accctgctggacaccaagtttgtccacttcattgatgccccatccctggccctcatcatg cccattgtccagagagccttccaggaccgttccacggacacgcggaagatggcagcccag attattggcaacatgtactccctgacagaccagaaggacttggctccgtacctgcccagc gtgacgcctggcctgaaagcatcgcttttggaccctgtgcctgaggtgcggaccgtatct gcaaaggcccttggggccatggtgaagggcatgggggagtcgtgctttgaggacttgctg ccgtggctgatggagacactgacctatgagcagagctctgtggatcgctcaggcgctgca caggggttggctgaggtcatggccggtttgggggtggagaagttggagaagttgatgcca gaaatcgtggctacagccagcaaagtggacattgcaccccatgtccgagatggctacatt atgatgtttaactacctgcccatcacctttggagacaagtttactccttatgtggggccc atcatcccctgtatcctcaaagctcttgctgatgagaatgagtttgtgcgtgacaccgcc ctgcgcgcgggccagcgggttatctccatgtacgctgagacagccatcgccctgctgctg ccccagctagagcaaggcctctttgatgacctttggagaatcaggttcagctctgttcag ctccttggggatctcctgtttcacatctcaggagtcactgggaagatgaccacagaaact gcctctgaggatgataactttggaactgcccagtccaacaaggtgcctgcagtggctttc ctgcccctgctgatggcgatcatcactgccctgggggtagagcggcggaaccgggtgttg gcagggctgtacatgggccgctcagacacccagctggtggtgcggcaggcgtccctgcat gtctggaagattgttgtctccaataccccccgcaccttgcgtgagatcctacccactctc tttgggctcctgctgggtttcctggccagcacgtgtgcagataagagaacgattgcagcg agaacattgggagatcttgtgcggaagttaggggagaaaatcctccccgagatcatcccc atccttgaggaaggcctgaggtctcagaagagcgatgagaggcagggtgtgtgcattggc ctaagtgagatcatgaagtccaccagccgggatgccgtgctgtatttctctgaatccctc gtgcccacggcaaggaaggctttgtgtgacccactggaggaggtcagagaggcggcagcc aagactttcgagcagctgcattccaccatcggccaccaggctctggaggacattctccca tttttactaaagcagctggatgacgaggaggtgtcagagtttgccttggatggtctgaag caagtcatggctattaagagtcgtgtggtgctgccctaccttgtgcccaagctgacaacg ccacctgtcaacacccgggtgctggctttcctttcgtcagtggctggtgatgccctcacc cgtcatcttggcgtgatcctcccagcggtcatgctggccctgaaggaaaagcttgggacc ccagatgagcagctggagatggccaattgtcaggctgtgatcctctccgtagaggatgac acagggcaccggatcatcatcgaggatctgctggaggccacccgcagccctgaggtgggc atgaggcaagctgctgccatcatcctcaacatctactgttcccgctcaaaggctgactac accagccacctgcggagcctggtctcgggcctgatccgcctcttcaatgactccagccct gtggttctggaggagagctgggatgccctaaatgccatcactaagaagctggatgctggc aaccagttggcactcattgaagagctgcacaaggaaatccggctcatagggaacgagagc aaaggcgagcatgtgccaggattctgcctcccgaagaagggagtgacctccatccttcca gtgttgcgggaaggagtcctgactggcagccctgagcagaaggaggaggcagccaaagcc ttaggcttggtaatccgcctgacctcggctgacgccctgaggccctccgtggtcagcatc actggccctctgatccgcatcctgggggacaggttcagctggaatgtgaaggcggctctg ctcgagacactcagcctcttgttggctaaggttgggattgccctgaagcccttcctgccc cagctgcagaccactttcaccaaagccctgcaggactccaaccggggggtgcgcctgaag gccgcagatgctctggggaagctcatttccatccacattaaggtggaccccctcttcaca gagctgctcaatggcatccgcgccatggaggacccaggtgtcagggacaccatgctgcag gccctgaggtttgtgattcagggagcaggggccaaagtggatgccgtcatccggaaaaac atcgtctcactcctgctgagcatgctgggacacgatgaggacaacactcgcatctcctca gccgggtgcctaggggaactgtgtgcctttttgactgaagaggagcttagtgccgttcta cagcagtgcttgctggcggacgtgtccggcattgactggatggttcggcacgggcggagc ctggcactttccgtggctgtgaatgtggctcctggcagactttgtgccggcagatatagc agtgatgttcaggaaatgatcctgagcagtgccacggcggacaggatccccattgcggtg agcggggtccggggcatgggctttctcatgagacaccacatcgagacaggcggagggcag ttgccggccaaactttccagcctgttcgttaagtgtctgcagaacccatccagcgacatc aggctggtggctgagaagatgatctggtgggcaaataaggacccactgcctcccctggac ccccaggccatcaagcccatcctgaaggctcttcttgacaacaccaaggataagaacacc gtggtcagggcctacagcgaccaggcaattgtcaacctcctcaagatgcggcagggtgaa gaggtgtttcagtccctctccaagatcctggatgtggccagtttggaggtgctgaacgag gttaaccgaaggtccctgaagaagctggccagccaggccgactccacggagcaggtggac gacaccatcctgacatga >gi568815586r:119997248_120216650|GENSCAN_predicted_peptide_4|410_aa MPREDRATWKSNYFLKIIQLLDDYPKCFIVGADNVGSKQMQQIRMSLRGKAVVLMGKNTM MRKAIRGHLENNPALEKLLPHIRGNVGFVFTKEDLTEIRDMLLANKVPAAARAGAIAPCE VTVPAQNTGLGPEKTSFFQALGITTKISRGTIEILSDVQLIKTGDKVGASEATLLNMLNI SPFSFGLVIQQVFDNGSIYNPEVLDITEETLHSRFLEGVRNVASVCLQIGYPTVASVPHS IINGYKRVLALSVETDYTFPLAEKVKAFLADPSAFVAAAPVAAATTAAPAAAAAPAKVEA KEESEESDEDMGFAAAVLPGGTKPGALPPLAPGLALPRGPRCDPEAFRKRFRSQPRQDGG GHAELQKFTLPSLYFFSPKPSGCEPRANAKSSLNFVFSKYSLSTYYEQGL >gi568815586r:119997248_120216650|GENSCAN_predicted_CDS_4|1233_bp atgcccagggaagacagggcgacctggaagtccaactacttccttaagatcatccaacta ttggatgattatccgaaatgtttcattgtgggagcagacaatgtgggctccaagcagatg cagcagatccgcatgtcccttcgcgggaaggctgtggtgctgatgggcaagaacaccatg atgcgcaaggccatccgagggcacctggaaaacaacccagctctggagaaactgctgcct catatccgggggaatgtgggctttgtgttcaccaaggaggacctcactgagatcagggac atgttgctggccaataaggtgccagctgctgcccgtgctggtgccattgccccatgtgaa gtcactgtgccagcccagaacactggtctcgggcccgagaagacctcctttttccaggct ttaggtatcaccactaaaatctccaggggcaccattgaaatcctgagtgatgtgcagctg atcaagactggagacaaagtgggagccagcgaagccacgctgctgaacatgctcaacatc tcccccttctcctttgggctggtcatccagcaggtgttcgacaatggcagcatctacaac cctgaagtgcttgatatcacagaggaaactctgcattctcgcttcctggagggtgtccgc aatgttgccagtgtctgtctgcagattggctacccaactgttgcatcagtaccccattct atcatcaacgggtacaaacgagtcctggccttgtctgtggagacggattacaccttccca cttgctgaaaaggtcaaggccttcttggctgatccatctgcctttgtggctgctgcccct gtggctgctgccaccacagctgctcctgctgctgctgcagccccagctaaggttgaagcc aaggaagagtcggaggagtcggacgaggatatgggatttgcggccgccgtcctgcccggg ggcaccaagcccggggccctgcccccgctggctcccggcctcgccctgccgcgtgggcca cgctgtgacccggaagcgttccggaagcggttccggagtcagccccggcaggatggcggc ggacacgcagagttgcagaagttcactttaccgagcctctacttcttttctccaaagccc tccggctgtgagccccgggccaacgccaagtcttccttaaactttgtattcagcaaatat tctttgagcacctactatgagcagggactttga >gi568815586r:119997248_120216650|GENSCAN_predicted_peptide_5|280_aa FMAQGKTGSSSPPGGPPKPGSQLDSMLGSLQSDLNKLGVATVAKGVCGACKKPIAGQVVT AMGKTWHPEHFVCTHCQEEIGSRNFFERDGQPYCEKDYHNLFSPRCYYCNGPILDKVVTA LDRTWHPEHFFCAQCGAFFGPEGFHEKDGKAYCRKDYFDMFAPKCGGCARAILENYISAL NTLWHPECFVCRECFTPFVNGSFFEHDGQPYCEVHYHERRGSLCSGCQKPITGRCITAMA KKFHPEHFVCAFCLKQLNKGTFKEQNDKPYCQNCFLKLFC >gi568815586r:119997248_120216650|GENSCAN_predicted_CDS_5|843_bp ttcatggcccaggggaagacagggagcagctcaccccctggggggcccccgaagcccggg agccagctggacagcatgctggggagcctgcagtctgacctgaacaagctgggggtcgcc acagtcgccaaaggagtctgcggggcctgcaagaagcccatcgccgggcaggttgtgacc gccatggggaagacgtggcaccccgagcacttcgtctgcacccactgccaggaggagatc ggatcccggaacttcttcgagcgggatggacagccctactgtgaaaaggactaccacaac ctcttctccccgcgctgctactactgcaacggccccatcctggataaagtggtgacagcc cttgaccggacgtggcaccctgaacacttcttctgtgcacagtgtggagccttctttggt cccgaagggttccacgagaaggacggcaaggcctactgtcgcaaggactacttcgacatg ttcgcacccaagtgtggcggctgcgcccgggccatcctggagaactatatctcagccctc aacacgctgtggcatcctgagtgctttgtgtgccgggaatgcttcacgccattcgtgaac ggcagcttcttcgagcacgacgggcagccctactgtgaggtgcactaccacgagcggcgc ggctcgctgtgttctggctgccagaagcccatcaccggccgctgcatcaccgccatggcc aagaagttccaccccgagcacttcgtctgtgccttctgcctcaagcagctcaacaagggc accttcaaggagcagaacgacaagccttactgtcagaactgcttcctcaagctcttctgc tag