GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:14:49 Sequence gi568815588r:96901115_97285674 : 384560 bp : 45.96% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 334 329 6 1.05 1.02 Term - 27011 26905 107 0 2 68 49 131 0.767 5.77 1.01 Init - 27152 27104 49 2 1 94 65 4 0.738 -0.19 1.00 Prom - 35719 35680 40 -3.26 2.00 Prom + 39069 39108 40 -2.76 2.01 Init + 47944 48181 238 0 1 77 86 154 0.753 12.27 2.02 Intr + 50989 51082 94 2 1 101 90 -19 0.594 -1.38 2.03 Intr + 53839 54720 882 1 0 57 36 213 0.004 3.44 2.04 Intr + 79679 84016 4338 2 0 108 -6 2388 0.038 218.83 2.05 Intr + 89648 89730 83 2 2 107 88 -14 0.057 -0.22 2.06 Term + 96754 96932 179 2 2 79 41 210 0.649 13.25 2.07 PlyA + 97146 97151 6 -3.44 3.41 PlyA - 99744 99739 6 1.05 3.40 Term - 100236 99998 239 1 2 87 43 248 0.995 16.53 3.39 Intr - 101255 101044 212 1 2 125 77 147 0.955 15.86 3.38 Intr - 101878 101590 289 2 1 110 63 511 0.999 47.10 3.37 Intr - 103108 102954 155 0 2 63 78 265 0.987 22.72 3.36 Intr - 103709 103579 131 2 2 -13 91 243 0.987 13.99 3.35 Intr - 105606 105369 238 2 1 113 96 224 0.999 23.32 3.34 Intr - 110016 109879 138 2 0 89 99 141 0.999 14.88 3.33 Intr - 112720 112627 94 2 1 138 109 81 0.999 14.22 3.32 Intr - 113044 112905 140 0 2 107 110 188 0.984 23.01 3.31 Intr - 117569 117472 98 2 2 100 94 92 0.973 9.81 3.30 Intr - 117993 117869 125 1 2 93 113 127 0.811 16.00 3.29 Intr - 118634 118515 120 0 0 52 54 69 0.428 0.37 3.28 Intr - 120299 120136 164 1 2 101 105 183 0.972 20.92 3.27 Intr - 129714 129643 72 2 0 101 76 83 0.981 6.92 3.26 Intr - 130563 130492 72 2 0 124 87 69 0.960 8.92 3.25 Intr - 133428 133357 72 2 0 110 65 79 0.931 6.32 3.24 Intr - 136652 136584 69 1 0 107 86 137 0.998 13.70 3.23 Intr - 139006 138874 133 2 1 109 74 154 0.961 15.80 3.22 Intr - 141079 140987 93 2 0 113 30 39 0.572 0.54 3.21 Intr - 141953 141787 167 1 2 84 88 192 0.835 18.40 3.20 Intr - 142399 142256 144 0 0 116 74 378 0.999 38.60 3.19 Intr - 145683 145540 144 2 0 93 76 242 0.983 22.90 3.18 Intr - 145951 145877 75 0 0 79 82 39 0.671 1.03 3.17 Intr - 146720 146576 145 0 1 106 46 273 0.999 24.24 3.16 Intr - 146882 146859 24 0 0 80 91 26 0.500 0.10 3.15 Intr - 148004 147841 164 1 2 69 82 276 0.952 24.72 3.14 Intr - 155350 155207 144 0 0 115 65 301 0.931 29.90 3.13 Intr - 156167 156096 72 1 0 129 58 41 0.928 3.72 3.12 Intr - 158417 158346 72 1 0 95 68 80 0.972 5.22 3.11 Intr - 159044 158973 72 1 0 140 76 125 0.999 15.02 3.10 Intr - 159673 159526 148 2 1 105 76 167 0.999 16.49 3.09 Intr - 162504 162341 164 2 2 81 94 128 0.901 12.32 3.08 Intr - 163125 163054 72 2 0 104 70 122 0.999 10.52 3.07 Intr - 163762 163691 72 0 0 101 87 107 0.995 10.42 3.06 Intr - 164972 164901 72 1 0 98 59 94 0.947 6.02 3.05 Intr - 166852 166623 230 1 2 67 82 72 0.185 1.17 3.04 Intr - 187796 187638 159 2 0 98 42 77 0.628 4.28 3.03 Intr - 189772 189633 140 2 2 78 46 63 0.351 1.28 3.02 Intr - 204018 203852 167 2 2 84 74 35 0.401 1.30 3.01 Init - 204876 204755 122 0 2 103 55 64 0.614 4.27 3.00 Prom - 210529 210490 40 -5.26 4.00 Prom + 212785 212824 40 -7.36 4.01 Init + 216084 216164 81 2 0 72 99 78 0.447 8.27 4.02 Term + 219560 219643 84 1 0 121 37 70 0.654 2.85 4.03 PlyA + 219766 219771 6 1.05 5.13 PlyA - 227851 227846 6 1.05 5.12 Term - 229902 229803 100 2 1 72 54 104 0.223 3.00 5.11 Intr - 234791 234751 41 2 2 130 98 -28 0.151 -0.58 5.10 Intr - 237288 237193 96 0 0 42 105 74 0.214 4.71 5.09 Intr - 239965 239860 106 0 1 39 68 156 0.325 8.92 5.08 Intr - 246898 246837 62 1 2 94 98 3 0.004 -0.57 5.07 Intr - 252372 252265 108 0 0 124 82 16 0.520 5.08 5.06 Intr - 253417 253246 172 0 1 129 -6 22 0.034 -3.15 5.05 Intr - 256775 256704 72 1 0 103 74 70 0.115 5.62 5.04 Intr - 262337 262266 72 1 0 114 68 153 0.184 14.42 5.03 Intr - 263776 263705 72 0 0 136 68 118 0.997 13.12 5.02 Intr - 264294 264178 117 2 0 78 69 85 0.848 5.18 5.01 Init - 264707 264703 5 2 2 97 64 0 0.267 -2.03 5.00 Prom - 267038 266999 40 -4.26 6.00 Prom + 274136 274175 40 -3.86 6.01 Sngl + 278886 279203 318 2 0 66 36 231 0.369 10.16 6.02 PlyA + 280569 280574 6 -0.45 7.03 PlyA - 283586 283581 6 1.05 7.02 Term - 284095 283948 148 0 1 92 52 58 0.620 -0.03 7.01 Init - 284560 284364 197 1 2 124 76 273 0.984 26.10 7.00 Prom - 285623 285584 40 -5.86 8.00 Prom + 285661 285700 40 -12.87 8.01 Init + 285898 286034 137 0 2 35 58 157 0.533 7.01 8.02 Intr + 298146 298331 186 0 0 30 115 74 0.110 3.10 8.03 Intr + 303711 303848 138 0 0 79 56 48 0.014 0.28 8.04 Intr + 310410 310545 136 0 1 79 74 53 0.123 3.57 8.05 Term + 312300 312416 117 2 0 113 36 42 0.504 0.04 8.06 PlyA + 315822 315827 6 1.05 9.11 PlyA - 318073 318068 6 1.05 9.10 Term - 319490 319285 206 0 2 80 38 123 0.618 4.03 9.09 Intr - 328111 328033 79 1 1 63 74 25 0.409 -2.28 9.08 Intr - 328760 328650 111 2 0 55 46 117 0.809 4.78 9.07 Intr - 334201 334103 99 1 0 68 100 45 0.760 4.01 9.06 Intr - 343045 342854 192 1 0 79 51 96 0.452 4.69 9.05 Intr - 355290 355204 87 0 0 79 100 8 0.594 1.27 9.04 Intr - 358514 358288 227 0 2 70 82 157 0.942 11.00 9.03 Intr - 362515 362306 210 2 0 82 105 141 0.999 14.08 9.02 Intr - 365011 364746 266 0 2 68 53 138 0.668 5.46 9.01 Init - 375424 375336 89 1 2 52 22 109 0.154 -1.19 9.00 Prom - 381204 381165 40 0.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 79679 84020 4342 2 1 108 38 2391 0.952 221.56 S.002 Term - 262337 262262 76 1 1 114 47 176 0.815 13.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:96901115_97285674|GENSCAN_predicted_peptide_1|51_aa MGFERTGSNFERSSTTEIATATPTFSNHQLDQLAAVNIKTRPFTSKKIPIY >gi568815588r:96901115_97285674|GENSCAN_predicted_CDS_1|156_bp atggggtttgagagaactggctccaattttgaaagaagttctactacggaaattgccaca gccaccccaactttcagcaaccaccaacttgatcagttggcagcagtcaacatcaagaca agacctttcaccagcaaaaagattccaatttactga >gi568815588r:96901115_97285674|GENSCAN_predicted_peptide_2|1937_aa MQRMIQQFAAEYTSKNSSTQDPSQPNSTKNQSLPKASPVTTSPTAATTQNPVLSKLLMAD QDSPLDLTVRKSQSEPSEQDGVLDLSTKKSPCAGSTSLSHSPGCSSTQGNGRPGRPSQYR PDGLRSGDGVPPRSLQDGTREGFGHSTSLKVPLARSLQISEELLSRNQLSTAASLGPSGL QNHGQHLILSREASWAKPHYEFNLSRMKFRGNGALSNISDLPFLAENSAFPKMALQAKQD GKKDVSHSSPVDLKIPQVRGMDLSWESRTGDQYSYSSLVMGSQTESALSKKLRAILPKQS RKSMLDAGPDSWGSDAEQSTSGQPYPTSDQEGDPGSKQPRKKRGRYRQYNSEILEEAISV VMSGKMSVSKAQSIYGIPHSTLEYKVKERLGTLKNPPKKKMKLMSENSTEAKAVDSNNQS KSPLEKFMVKLCTHHQKQFIRVLNDLYTESQPGTEDLQPSDSGAMDVSTCNAGCAQLSTK HKEKDALCLDMKSSASVDLFVDSSDSHSPLHLTEQTPKKPPPEINPVDGRENALTVVQKD SSELPTTKSNSINSSSVDSFTPGYLTASNCSSVNFHHIPKILEGQTTGQEQDTNVNICED GKDHMQSSALVESLITVKMAAENSEEGNTCIIPQRNLFKALSEEAWNSGFMGNSSRTADK ENTLQCPKTPLRQDLEANEQDARPKQENHLHSLGRNKVGYHLHPSDKGQFDHSKDGWLGP GPMPAVHKAANGHSRTKMISTSIKTARKSKRASGLRINDYDNQCDVVYISQPITECHFEN QKSILSSRKTARKSTRGYFFNGDCCELPTVRTLARNLHSQEKASCSALASEAVFTPKQTL TIPAPRHTVDVQLPREDNPEEPSKEITSHEEGGGDVSPRKEPQEPEVCPTKIKPNLSSSP RSEETTASSLVWPLPAHLPEEDLPEGGSTVSAPTASGMSSPEHNQPPVALLDTEEMSVPQ DCHLLPSTESFSGGVSEDVISRPHSPPEIVSREESPQCSENQSSPMGLEPPMSLGKAEDN QSISAEVESGDTQELNVDPLLKESSTFTDENPSETEESEAAGGIGKLEGEDGDVKCLSEK DTYDTSIDSLEENLDKKKKGKKFPEASDRCLRSQLSDSSSADRCLRNQSSDSSSACLEIK VPKNPSAKRSKKEGHPGGTTPKGLLPDSFHTETLEDTEKPSVNERPSEKDAEQEGEGGGI ITRQTLKNMLDKEVKELRGEIFPSRDPITTAGQPLPGERLEIYVQSKMDEKNAHIPSESI ACKRDPEQAKEEPGHIPTQHVEEAVNEVDNENTQQKDDESDAPCSSLGLSSSGSGDAARA PKSVPRPKRLTSSTYNLRHAHSLGSLDASKVTSEKEAAQVNPIMPKENGASESGDPLDED DVDTVVDEQPKFMEWCAEEENQELIANFNAQYMKVQKGWIQLEKEGQPTPRARNKSDKLK EIWKSKKRSRKCRSSLESQKCSPVQMLFMTNFKLSNVCKWFLETTETRSLVIVKKLNTRL PGDVPPVKHPLQKYAPSSLYPSSLQAERLKKHLKKFPGATPAKNNWKMQKLWAKFRENPD QVEPEDGSDVSPGPNSEDSIEEVKEDRNSHPPANLPTPASTRILRKYSNIRGKLRAQQRL IKNEKMECPDALAVESKPSRKSVCINPLMSPKLALQVDADGFPVKPKSTEGMKGRKGKQV SEILPKAEVQSKRKRTEGSSPPDSKNKGPTVKASKEKHADGATKTPAAKRPAARDRSSQP PKKTSLKENKVKIPKKSAGKSCPPSRKEKENTNKRPSQSIASETLTKPAKQKGAGESSSR PQKATNRKQSSGKTRARPSTKTPESSAAQRKRKLKAKLDCSHSKRRRLDAKASVLDAQCS GDVLCSILQRGSCVVMLPGGMDFRKSREFQGQTSYVCSPLEPSLDDGPYAISPEALGENA SSTRPALGGEKETNKAE >gi568815588r:96901115_97285674|GENSCAN_predicted_CDS_2|5814_bp atgcagcgaatgatccaacaatttgctgctgaatatacctcaaaaaatagctctactcag gaccccagccagcccaatagcacaaagaaccaaagcctgccgaaagcatctccagtcacc acctctcccacggctgcaactactcagaaccctgtgctcagcaaacttctcatggctgac caagactcacctctggaccttactgtcagaaagtctcagtcagaacctagcgaacaagac ggtgtacttgatctgtccactaagaaaagtccatgtgctggcagcacttccctgagccac tctccaggctgctccagtactcaagggaacgggcgacctgggagacccagccagtaccgc ccagacggacttcggagtggtgatggggtacctccaagaagcttacaggatggaaccagg gaaggttttggacactccacatcactcaaagttccactggctcgatccctgcagattagt gaagaactactgagcagaaaccaattgtccacagctgccagccttgggccatctggatta cagaatcatggacaacacttaatattatccagggaagcctcttgggcaaaaccacattac gagttcaacctcagccgtatgaagttcaggggaaatggtgcactcagcaacatcagtgac cttccttttcttgcagaaaactctgcctttccaaaaatggcacttcaagcaaaacaagat ggaaaaaaggatgtgagccattcatctcctgtagatttaaagataccacaagttcgagga atggatctttcttgggagtctcgcactggtgatcagtacagctatagctctttggtaatg ggttcacaaacggagagcgcgcttagtaaaaaattaagggctattcttccaaaacaaagt agaaaaagcatgttagatgctggacccgattcttggggctcagatgctgagcagtctacc tctggacagccatatcccacatcggatcaagaaggagaccctggctccaagcagcctcgg aagaaaagagggcgttacagacagtacaacagtgagatactggaggaagcaatctcagtg gttatgagtggaaaaatgagtgtttccaaagctcagagtatttatgggattccccacagt acactggagtacaaagtaaaggagaggctgggcactttgaaaaaccctccaaagaaaaag atgaaattaatgagtgagaactcaacagaggcaaaagcagtagattctaacaatcagtcg aagtccccactggagaaatttatggtcaaactgtgtactcatcatcaaaagcaattcatt cgtgttctgaacgacctgtacactgaatctcaaccaggcactgaggacctgcagccttct gattcgggagcaatggatgtatccacttgcaatgctggctgtgcccagctcagcaccaaa cataaggaaaaagatgctctgtgtctcgatatgaagtcttctgcttctgtagatttgttc gtagactcgtcagactctcacagccctctacacttgacggaacagaccccgaagaagcct cctcctgagataaaccctgtagatggaagagagaatgccttgactgttgtccagaaagat tcctctgaacttccaaccactaaatcgaattctattaatagcagttcagtggatagtttc actccgggatacctcactgcatctaattgttcctcagtgaacttccaccacatccctaaa atcttggaggggcagaccactggacaagagcaagacacaaatgtgaacatatgtgaggat ggtaaagaccatatgcagagttcagctttagtagaaagtctaattacagtaaaaatggca gctgagaatagtgaggaaggcaatacctgtattattcctcaaagaaatttgttcaaagct ttatcagaagaggcttggaactcagggtttatggggaactcatctagaactgctgacaaa gagaatactttacagtgtccaaaaacacctttgcgccaggatttagaggcaaatgaacaa gatgcaaggccaaagcaagagaaccatcttcactctctgggaagaaataaggtgggttac catttacatcccagtgataagggccagtttgatcattccaaagatggttggttaggcccc ggccctatgccagctgtacacaaagcggcaaatggacactcaagaaccaagatgatatca acctccatcaagacagctcggaaaagtaaaagggcatcagggctgaggataaatgattat gataaccagtgtgatgttgtttatatcagtcaaccaataacagaatgccactttgagaat caaaaatcaatattatcttctcggaaaacagccagaaagagtactcgaggatactttttc aatggtgactgttgtgagctgccaactgttcgtacactggccagaaatttacactcccag gaaaaagcaagctgctcagcattggcatcagaggcagttttcactcctaagcagaccctt acaattccagcccctagacatacagtagatgtgcagcttcccagagaagacaaccctgaa gaacctagcaaggaaatcacctctcacgaggaaggaggtggagacgtttcacctcgaaaa gaacctcaagagcctgaggtttgccccacaaagattaagccgaacctgagcagctcccct aggtcagaggaaacgacagcctccagcctggtgtggcctctccctgctcaccttcctgaa gaggacctgccagaaggtggctccacagtctcagctcccacagcaagtgggatgtcttct cctgaacacaaccaaccaccagttgcactgttggatacggaggagatgagtgtaccccag gactgtcacctccttccctccactgaaagcttttccgggggagtcagtgaagatgtcatt tctaggcctcattctcctcctgaaatagtcagtagagaagaaagtcctcagtgctcagaa aatcagagttccccaatgggcttggagccccccatgagtctgggaaaggctgaggacaac caaagcatcagtgctgaggttgagtctggagacacccaggagctaaatgtcgacccactc ttgaaggaaagcagcacttttactgatgaaaaccccagtgaaactgaggaaagtgaggca gcaggtggtataggaaaattagagggagaggacggtgatgtaaaatgcctgtcagaaaaa gacacgtatgatacaagcattgactcactcgaagagaatttggacaagaagaaaaaaggt aaaaaattccctgaggcctctgataggtgcctaagaagtcaactttcggattcttcctct gctgacagatgcctaagaaatcagagttcagattcttcctcagcttgtcttgaaatcaaa gttcctaaaaatcctagtgcaaaacgttcaaaaaaagaagggcaccctggtgggacaaca cctaagggccttctacctgacagtttccacacggaaactctggaggacacagaaaagcca agtgtcaatgaacgcccctctgagaaagatgctgagcaggagggcgaaggcggggggatc atcaccaggcagactttgaaaaacatgctggacaaagaagtcaaggagttacgaggagag attttccccagcagggaccccataaccacagctggacagccactgcctggagagagattg gaaatctatgttcagtctaaaatggatgagaagaatgctcatatcccctcagaaagtatt gcttgtaagagggacccagaacaggcaaaagaagagccagggcatattcccacacagcat gtggaggaggctgtgaatgaggtagacaacgaaaacacccagcagaaagatgatgagagt gatgccccatgcagctctcttgggttgtcgagtagtggaagtggtgatgctgctagggca ccaaaatcggtgccaaggcctaaaagattgacctcttcaacctacaacctaagacacgct cattctctgggctccttggatgcttcaaaagtgacttcagaaaaggaagctgcacaagta aaccccataatgccaaaggaaaatggagcttcagagagtggagaccccctagatgaggac gatgttgacaccgtggtagatgaacagccaaagtttatggaatggtgtgctgaggaggag aaccaagagctcatcgccaacttcaatgcccagtacatgaaagttcagaagggctggatc cagttggagaaagaaggacagccaacaccaagagcaaggaacaaatcagataaactgaaa gagatttggaaaagcaagaaaaggtcacggaaatgtaggagttcattggagagtcagaag tgttctcctgttcagatgctctttatgacaaactttaaattatctaatgtttgtaaatgg ttcttagagacaactgaaacccggtctctagtcattgtgaagaagctcaatactcgcctt ccaggagacgttccccctgtcaagcatcctcttcagaaatacgctccttccagcctatat cccagttcactacaggctgagcgcttgaaaaagcacttgaagaaatttcctggagctacc cctgctaagaataattggaaaatgcagaagctctgggccaaatttcgagagaatcctgat caagtggagccagaagatggcagtgatgtcagccccggccctaattctgaagacagcata gaggaagtcaaggaagatagaaacagtcatcctccagcaaacctgcccactccagccagt acccggattcttagaaaatattccaatattcgaggaaagctcagagcccagcaacgttta atcaagaatgagaaaatggaatgcccagatgctctggctgtggaaagtaagccaagtcgt aagagcgtatgcatcaaccctctgatgtcccccaagcttgccctgcaagtggatgcagat gggtttcctgttaagcccaagagtactgaaggaatgaagggaaggaaggggaagcaggtg tctgaaatcttgcctaaagcagaagttcagagtaaacgcaagagaacagaaggcagcagc cctccagatagtaagaacaaggggcctacggtgaaagccagcaaagaaaagcatgctgat ggagccaccaaaacccctgctgccaagaggccagctgcaagggacagaagcagccaaccc cccaaaaagacgtctttgaaagagaataaagtgaagatccctaaaaagtccgctgggaag agctgccctccctccaggaaagaaaaagagaatacaaacaaaaggccttcccagtctatt gcctcggaaacactgacgaaacctgcaaaacagaagggggccggtgaatcctcttcaagg cctcagaaagccacgaataggaagcagagtagtggaaagactcgggccagaccctcaacg aaaaccccagagagcagtgcagctcagagaaagcgaaagctgaaggcaaagctggactgt tcgcacagcaaacggaggcggctggatgcaaaagcctcagtgcttgatgctcagtgctca ggagatgtcctttgctccatccttcagaggggcagttgtgtagttatgctgccgggagga atggacttcaggaaaagcagggagttccagggacagacatcctacgtgtgtagccccttg gagccatccctggatgatggcccatatgcgatcagcccagaagccctaggagagaatgcc agcagcacccggcccgcccttggtggagaaaaagagaccaacaaggcagagtga >gi568815588r:96901115_97285674|GENSCAN_predicted_peptide_3|1740_aa MGTICGTFLPRGSVPYPDLLHAFPGIHLMVPLAEASGPDSGTLEEASSAPSASWLSIGQV SVTWRGPNRSRAFTGVIQLNLKVAYSAPGLSPADKLALAEGHFHGWSVGLGGSSQGLPGP STRWTTWGLQFHLFCESAKVQSLILPMTAHEDEGRVLGGDADTKSPKGVADTVLHSFMDV GPCDPVTHSPGSWLLKGNPDRAAAALTVQGAVCGGAGGCGLCSRTLWGEGLKGRAPSKPT GLAQEEEVGLWNCLALGPMSGRSAAGCSPSGPRDLSENAIQAIPRKAFRGATDLKNLQLD KNQISCIEEGAFRALRGLEVLTLNNNNITTIPVSSFNHMPKLRTFRLHSNHLFCDCHLAW LSQWLRQRPTIGLFTQCSGPASLRGLNVAEVQKSEFSCSGQGEAGRVPTCTLSSGSCPAM CTCSNGIVDCRGKGLTAIPANLPETMTEIRLELNGIKSIPPGAFSPYRKLRRIDLSNNQI AEIAPDAFQGLRSLNSLVLYGNKITDLPRGVFGGLYTLQLLLLNANKINCIRPDAFQDLQ NLSLLSLYDNKIQSLAKGTFTSLRAIQTLHLAQNPFICDCNLKWLADFLRTNPIETSGAR CASPRRLANKRIGQIKSKKFRCSAKEQYFIPGTEDYQLNSECNSDVVCPHKCRCEANVVE CSSLKLTKIPERIPQSTAELRLNNNEISILEATGMFKKLTHLKKINLSNNKVSEIEDGAF EGAASVSELHLTANQLESIRSGMFRGLDGLRTLMLRNNRISCIHNDSFTGLRNVRLLSLY DNQITTVSPGAFDTLQSLSTLNLLANPFNCNCQLAWLGGWLRKRKIVTGNPRCQNPDFLR QIPLQDVAFPDFRCEEAYSQWQLHPQLAWPGHVAVESFDFHPKKHHQGQEEGGCLPRPQC PQECACLDTVVRCSNKHLRALPKGIPKNVTELYLDGNQFTLVPGQLSTFKYLQLVDLSNN KISSLSNSSFTNMSQLTTLILSYNALQCIPPLAFQGLRSLRLLSLHGNDISTLQEGIFAD VTSLSHLAIGANPLYCDCHLRWLSSWVKTGYKEPGIARCAGPQDMEGKLLLTTPAKKFEC QGGLAVGSSGYFYNQCILEMLEGPAVAGRWGKMVLPGSGMEGPPTLAVQAKCDLCLSSPC QNQGTCHNDPLEVYRCACPSGYKGRDCEVSLDSCSSGPCENGGTCHAQEGEDAPFTCSCP TGFEGPTCGVNTDDCVDHACANGGVCVDGVGNYTCQCPLQYEGKACEQLVDLCSPDLNPC QHEAQCVGTPDGPRCECMPGYAGDNCSENQDDCRDHRCQNGAQCMDEVNSYSCLCAEGYS GQLCEIPPHLPAPKSPCEGTECQNGANCVDQGNRPVCQCLPGFGGPECEKLLSVNFVDRD TYLQFTDLQNWPRANITLQVSTAEDNGILLYNGDNDHIAVELYQGHVRVSYDPGSYPSSA IYSAETINDGQFHTVELVAFDQMVNLSIDGGSPMTMDNFGKHYTLNSEAPLYVGGMPVDV NSAAFRLWQILNGTGFHGCIRNLYINNELQDFTKTQMKPGVVPGCEPCRKLYCLHGICQP NATPGPMCHCEAGWVGLHCDQPADGPCHGHKCVHGQCVPLDALSYSCQCQDGYSGALCNQ AGALAEPCRGLQCLHGHCQASGTKGAHCVCDPGFSGELCEQESECRGDPVRDFHQVQRGY AICQTTRPLSWVECRGSCPGQGCCQGLRLKRRKFTFECSDGTSFAEEVEKPTKCGCALCA >gi568815588r:96901115_97285674|GENSCAN_predicted_CDS_3|5223_bp atgggtaccatctgcgggaccttcttgcctagaggttctgtcccatatcctgacctcctg catgcctttcctggcatccacctgatggtgcccttggcggaggcctccggaccagattcc gggacactggaagaagccagctctgccccctcagcatcttggctgagtatcggccaggtc tctgtcacatggagggggccaaataggagtcgtgcatttactggtgttattcagctgaat ttaaaagtggcctactcagcaccggggctaagccccgcagataagcttgcactagcagag ggccactttcatggatggagcgtgggcttaggaggcagcagccagggcctgccagggccc agcacccgctggacgacttggggcttgcaatttcacctcttctgtgaaagcgccaaggtg cagagcctgatcctccccatgacagctcatgaggatgaggggagggttttgggtggggat gctgacaccaagtcaccaaaaggtgtggcagacacagtgcttcattcctttatggacgtg ggcccctgcgatcctgtcacccacagccctggttcctggcttctgaagggaaacccagac agggcagcggcggccttgacagttcaaggggcggtgtgcggcggggcagggggctgtggg ctgtgttctcggaccctgtggggtgaggggctgaaaggaagggcaccgtcaaagcccacg ggcctggcccaggaggaggaggtggggctgtggaactgcctggcactggggcctatgtca ggacggtctgccgctggttgttcaccttcaggaccccgggacttgagtgagaacgccatc caggccatccccaggaaagcttttcggggagctacggaccttaaaaatttacagctggac aagaaccagatcagctgcattgaggaaggggccttccgtgctctgcgggggctggaggtg ctgaccctgaacaacaacaatatcaccaccatccccgtgtccagcttcaaccatatgccc aagctacggaccttccgcctgcactccaaccacctgttttgcgactgccacctggcctgg ctctcgcagtggctgaggcagcggccaaccatcgggctcttcacccagtgctcgggccca gccagcctgcgtggcctcaatgtggcagaggtccagaagagtgagttcagctgctcaggc cagggagaagcggggcgcgtgcccacctgcaccctgtcctccggctcctgcccggccatg tgcacctgcagcaatggcatcgtggactgtcgtggaaaaggcctcactgccatcccggcc aacctgcccgagaccatgacggagatacgcctggagctgaacggcatcaagtccatccct cctggagccttctcaccctacagaaagctacggaggatagacctgagcaacaatcagatc gctgagattgcacccgacgccttccagggcctccgctccctgaactcgctggtcctctat ggaaacaagatcacagacctcccccgtggtgtgtttggaggcctatacaccctacagctc ctgctcctgaatgccaacaagatcaactgcatccggcccgatgccttccaggacctgcag aacctctcactgctctccctgtatgacaacaagatccagagcctcgccaagggcactttc acctccctgcgggccatccagactctgcacctggcgcagaaccctttcatttgcgactgt aacctcaagtggctggcagacttcctgcgcaccaatcccatcgagacgagtggtgcccgc tgtgccagtccccggcgcctcgccaacaagcgcatcgggcagatcaagagcaagaagttc cggtgctcagccaaagagcagtacttcattccaggcacggaggattaccagctgaacagc gagtgcaacagcgacgtggtctgtccccacaagtgccgctgtgaggccaacgtggtggag tgctccagcctgaagctcaccaagatccctgagcgcatcccccagtccacggcagaactg cgattgaataacaatgagatttccatcctggaggccactgggatgtttaaaaaacttaca catctgaagaaaatcaatctgagcaacaacaaggtgtcagaaattgaagatggggccttc gagggcgcagcctctgtgagcgagctgcacctaactgccaaccagctggagtccatccgg agcggcatgttccggggtctggatggcttgaggaccctaatgctgcggaacaaccgcatc agctgcatccacaacgacagcttcacgggcctgcgcaacgtccggctcctctcgctctac gacaaccagatcaccaccgtatccccaggagccttcgacaccctccagtccctctccaca ctgaatctcctggccaaccctttcaactgcaactgccagctggcctggctaggaggctgg ctacggaagcgcaagatcgtgacggggaacccgcgatgccagaaccctgactttttgcgg cagattcccctgcaggacgtggccttccctgacttcaggtgtgaggaagcctactcccag tggcagctccaccctcagctggcctggccaggccacgtggcagtggagagctttgacttt caccccaagaaacaccatcagggccaggaggaggggggctgcctgccccgcccacagtgc ccacaggagtgcgcctgcctggacaccgtggtccgatgcagcaacaagcacctgcgggcc ctgcccaagggcattcccaagaatgtcacagaactctatttggacgggaaccagttcacg ctggttccgggacagctgtctaccttcaagtacctgcagctcgtggacctgagcaacaac aagatcagttccttaagcaattcctccttcaccaacatgagccagctgaccactctgatc ctcagctacaatgccctgcagtgcatcccgcctttggccttccagggactccgctccctg cgcctgctgtctctccacggcaatgacatctccaccctccaagagggcatctttgcagac gtgacctccctgtctcacctggccattggtgccaaccccctatactgtgactgccacctc cgctggctgtccagctgggtgaagactggctacaaggaaccgggcattgctcgttgtgct gggccccaggacatggagggcaagctgctcctcaccacgcctgccaagaagtttgaatgc caaggcggcctggctgtcggctcctctggttacttctacaaccagtgcatcctggagatg ctggaggggcctgctgtagcagggaggtgggggaaaatggtcctccctggctcaggcatg gagggtcctccaacgctggctgtccaggccaagtgtgatctctgcttgtccagtccgtgc cagaaccagggcacctgccacaacgacccccttgaggtgtacaggtgcgcctgccccagc ggctataagggtcgagactgtgaggtgtccctggacagctgttccagtggcccctgtgaa aatgggggcacctgccatgcacaggagggcgaggatgccccgttcacgtgctcctgtccc accggctttgaaggaccaacctgtggggtgaacacagatgactgtgtggatcatgcctgt gccaatgggggcgtctgtgtggatggtgtgggcaactacacctgccagtgccccctgcag tatgagggaaaggcctgtgagcagctggtggacttgtgctctccggatctgaacccatgt caacacgaggcccagtgtgtgggcaccccggatgggcccaggtgtgagtgcatgccaggt tatgcaggtgacaactgcagtgagaaccaggatgactgcagggaccaccgctgccagaat ggggcccagtgtatggatgaagtcaacagctactcctgcctctgtgctgagggctacagt ggacagctctgtgagatccctccccatctgcctgcccccaagagcccctgtgaggggact gagtgccagaatggggccaactgtgtggaccagggcaacaggcctgtgtgccagtgcctc ccaggcttcggtggccctgagtgtgagaagttgctcagtgtcaactttgtggatcgggac acttacctgcagttcactgacctgcaaaactggccacgggccaacatcacgttgcaggtc tccacggcagaggacaatgggatccttctgtacaacggggacaacgaccacattgcagtt gagctgtaccagggccatgtgcgtgtcagctacgacccaggcagctaccccagctctgcc atctacagtgctgagacgatcaacgatgggcaattccacaccgttgagctggttgccttt gaccagatggtgaatctctccattgatggcgggagccccatgaccatggacaactttggc aaacattacacgctcaacagcgaggcgccactctatgtgggagggatgcccgtggatgtc aactcagctgccttccgcctgtggcagatcctcaacggcaccggcttccacggttgcatc cgaaacctgtacatcaacaacgagctgcaggacttcaccaagacgcagatgaagccaggc gtggtgccaggctgcgaaccctgccgcaagctctactgcctgcatggcatctgccagccc aatgccaccccagggcccatgtgccactgcgaggctggctgggtgggcctgcactgtgac cagcccgctgacggcccctgccatggccacaagtgtgtccatgggcaatgcgtgcccctc gacgctctttcctacagctgccagtgccaggatgggtactcgggggcactgtgcaaccag gccggggccctggcagagccctgcagaggcctgcagtgcctgcatggccactgccaggcc tcaggcaccaagggggcacactgtgtgtgtgaccccggcttttcgggcgagctgtgtgag caagagtccgagtgccggggggaccctgtccgggactttcaccaggtccagaggggctat gccatctgccagaccacgcgccccctgtcatgggtggagtgccggggctcgtgcccaggc cagggctgctgccagggccttcggctgaagcggaggaagttcacctttgagtgcagcgat gggacctcttttgccgaggaggtggaaaagcccaccaagtgtggctgtgccctctgcgca tag >gi568815588r:96901115_97285674|GENSCAN_predicted_peptide_4|54_aa MSSDIKEPNPERSENKACEVGVQRAKQDVDDAKRPTPPEEFSTVLGTQQVLSKY >gi568815588r:96901115_97285674|GENSCAN_predicted_CDS_4|165_bp atgagttctgacatcaaggagcccaaccctgaaagatctgagaataaagcatgtgaagtg ggtgtccaaagagcaaagcaggatgttgatgatgctaagcggccgacacctccggaagag ttcagcacagtgcttggcacacagcaagtgctcagtaaatattag >gi568815588r:96901115_97285674|GENSCAN_predicted_peptide_5|340_aa MGIRHSQEPVPPDDNTGVPMGTFASAGYAVYLLLPCANRIRELNGNNITRIHKNDFAGLK QLRVLQLMENQIGAVERGAFDDMKELERLRLNRNQLHMLPELLFQNNQALSRLHQVLYSQ QETFLIQEHSRIWVKADFSLAFPLTFLNLICKQGMQTERTPAGLLEDSQAGSVTNSSHLE QIMDFHAQAVMLNQHENGFERDQMCWKMGENRETARRKRLLTSLLGRASDKGAVEDLRVG KKPTQAVFRMVNLLTDTRNWQLSASLADKRHLLIVEVNAGVSCTLRKAQVTRRPGTVCQL LGIFLWSSFMTKIENVHVAEVLPTQGTVDSLGVQITEHVP >gi568815588r:96901115_97285674|GENSCAN_predicted_CDS_5|1023_bp atgggaatacggcactcgcaggagccagtgccaccagatgataacactggtgtccccatg gggacctttgcaagcgcaggctatgccgtttacttgcttctaccctgtgctaaccgtata agggaactcaatggcaacaacatcactcggatccataagaatgactttgcggggctcaag cagctgcgggtgctgcagctgatggagaaccagattggagcagtggaacgtggtgctttt gatgacatgaaggagctggagcggctgcgactgaaccgaaaccagctgcacatgttaccg gaactgctgttccagaacaaccaggctttgtcaagactgcatcaggtcctttactctcag caggaaacatttctcatccaggagcatagcaggatttgggtgaaggcagacttttcttta gctttccccttgacatttcttaacctcatctgtaagcaaggcatgcaaaccgagaggact cctgcagggctcttggaggactcccaggcaggatctgttactaattcttcccatctggag caaattatggactttcatgcacaggctgttatgctaaatcagcatgaaaatgggtttgag agagaccaaatgtgctggaaaatgggagaaaatagagaaacagcaagaagaaagagactg ctcacgagtcttctgggaagagcctccgacaagggagccgtggaggacttgcgagtaggg aagaagccgacccaagcggtgtttcggatggttaatctgctcacagacacacgcaactgg cagctgagtgcatccctcgctgataagcgccatctgttgatagtggaggtgaatgcaggt gtcagctgtacattgcggaaggcacaggtgacccggaggcccggcacagtgtgtcagctc ctgggcatttttctatggagcagtttcatgaccaagattgagaatgtccatgtggccgag gtcctgcccactcagggaaccgtggactctttaggggtccagatcacagagcatgttccc tga >gi568815588r:96901115_97285674|GENSCAN_predicted_peptide_6|105_aa MQEPPSPKRARPPLLAASGSLGHRQPAPPKKGHRGRSAPPLVLQGSQRLPDGEEEGHPGI QRDAQISFTSLQFSVTLITIIILPFIEHSLSARPCAKRSCEHYRI >gi568815588r:96901115_97285674|GENSCAN_predicted_CDS_6|318_bp atgcaggagccgcccagccccaaaagagcacggccgcccctgctggcagcctcgggcagc ctgggccaccgacagcccgccccgcccaagaaggggcaccgaggaagatcagcccctccc cttgtgctccagggttcccagcggctgcctgatggcgaagaggagggccaccctgggatc cagcgtgacgcccaaatttccttcacatccctgcaattctctgtcaccttaataacaata ataatactaccatttattgagcactcgctgtctgccaggccctgtgcaaagcgctcctgc gagcattaccgcatttaa >gi568815588r:96901115_97285674|GENSCAN_predicted_peptide_7|114_aa MALTPGWGSSAGPVRPELWLLLWAAAWRLGASACPALCTCTGTTVDCHGTGLQAIPKNIP RNTERLPSSLRQRFIPPGCLRTGVRKPSLHPGGTLPTLAKHNVSFWAHGATAAF >gi568815588r:96901115_97285674|GENSCAN_predicted_CDS_7|345_bp atggcgctgactcccgggtgggggtcctcggcggggccggtccggccggagctctggctg ctgctgtgggcagccgcgtggcgcctgggtgcctcggcgtgccccgccctctgcacctgc accggaaccacggtggactgccacggcacggggctgcaggccattcccaagaatatacct cggaacaccgagcgcctgccctcctcactgcgccagcgctttattcctccagggtgcctg cgaactggtgtcagaaagccttccttgcacccaggagggacacttcccacactcgccaag cacaatgtctccttttgggcacatggggctacggccgcgttctga >gi568815588r:96901115_97285674|GENSCAN_predicted_peptide_8|237_aa MCGRGCERDIQVGREFSSTPEKTKDLLNAKDSCGSLGQCPAVGALTHTLRKPDSHYGLVA ASPKSGFSSQEEVASEEHIWMRESDSGRGTPQEGSPLTEGWQINAAHSHFPQGEELSDLA ALQPITSFLGDRQQVWHMRWDSSGTAGPREAQCWAQTAHPLPPLHQWFCVMLDQQYQHQL GTNEKCKFLGLRPASETPGPRLLTALPSWTIQHLPSINPPPKTLTSLTTPCETSPAK >gi568815588r:96901115_97285674|GENSCAN_predicted_CDS_8|714_bp atgtgtggacgtgggtgtgaacgggacatccaggtggggagagaattctcctcaacccca gaaaagacgaaagatctcctcaacgccaaggacagctgtggctcgctggggcagtgtcca gcagtcggggccttgacccacactctgaggaagcctgactctcactatggactggtggca gcttccccaaagtctgggttcagttcccaggaggaagtggcttctgaagagcacatttgg atgcgggagagtgacagcgggaggggcaccccacaggaaggaagccccttgacagagggc tggcagattaacgcagcacacagccacttcccacagggagaggagctgagtgacttggct gccctgcagccaattacaagcttcctaggggacagacagcaagtgtggcacatgcgttgg gacagcagcggtacagcaggtcccagggaagcgcaatgctgggcacaaaccgcacaccca ctgccacctctgcatcagtggttctgtgtcatgctggaccaacagtatcagcatcagctg ggaaccaatgagaaatgcaagttcctgggcctcagacctgcatcagagactccagggccc cgactcttgacagccctaccttcctggaccatccagcacctcccttccatcaacccccca cccaagactctgacatccctgacgactccctgtgaaacatcaccagctaaataa >gi568815588r:96901115_97285674|GENSCAN_predicted_peptide_9|521_aa MGRLVGRGADPPTSLPDGAAGRAEGLLTSHDAICSFVICNDSSLRGQPIIFNPDFFVEKL RHEKPEIFTELVVSNITRLIDLPGTELAQLMGEVDLKLPGGAGPASGFFRSLMSLKRKDL RVEGLFRVPGNSVRQQILRDALNNGTDIDLESGEFHSNDVATLLKMFLGELPEPLLTHKH FNAHLKIADLMQFDDKGNKTNIPDKDRQIEALQLLFLILPPPNRNLLKLLLDLLYQTAKK QDKNKMSAYNLALMFAPHVLWPKNVTANDLQENITKLNSGMAFMIKHSQKLFKDDLDLIA SCHTKSFQLAKSQKRNRVDSCPHQEETQHHTEEALRELFQHVHDMPESAKKKQLIRQFNK QSLTQTPGREPSTSQVQKRARSRSFSGLIKRKVLGNQMMSEKKKKNPTPESVAIGELKGT SKENRNLLFSGSPAVTMTPTRLKWSEGKKEGKKAPKGQLVLLTVKKGTRSFKMKHGETLK VTCGKLGDAKKREWRGQEGHFEKQEEDQNAEYGTEHIIINH >gi568815588r:96901115_97285674|GENSCAN_predicted_CDS_9|1566_bp atggggcggctggtcgggcggggggccgacccccccacctccctcccggacggggcggct ggccgggcagaggggctcctcacttcccatgatgccatctgcagttttgtgatctgcaat gattcttcccttcgaggtcagcccattatctttaatcctgacttttttgtggagaaactc cgacatgagaaacctgagattttcactgagttggtggtcagcaatatcacaaggctcatc gatttacctggaactgagttggctcagctgatgggggaagtggaccttaagttgcctggc ggggctggcccagcatcaggattcttccggtctctcatgtctctcaagcgaaaggacttg cgagtagagggtttgtttagagtaccgggtaatagtgtccgacagcagattttaagggat gctctcaataatggaactgacattgacttggaatcaggggaatttcactcaaatgatgtt gccactttgctgaagatgtttctaggagagttgccggagcctctgctgacacataaacac ttcaatgcacacctcaaaatcgctgatttgatgcagtttgatgataaaggaaacaagacc aatataccagacaaggaccggcaaattgaggctctccagttgctcttcctcattctccct cctcctaatcgtaatttgctgaagttattgcttgatctcctataccagacagcaaagaaa caagacaagaacaagatgtcagcctataaccttgcccttatgtttgcaccccatgtcctg tggccaaaaaatgtcactgcaaatgaccttcaggagaatatcacaaagttaaacagtggg atggcttttatgattaaacactcccagaaactttttaaggatgaccttgacctcatagct tcatgtcatactaagtcctttcagctggcaaagtctcagaaacggaaccgggtagattcc tgccctcaccaggaggagacccagcaccatacggaagaggcactgagagagctgtttcaa cacgttcatgatatgccagagtcagcaaagaagaaacaacttattagacagtttaataag caatcattgacccagacaccagggcgagaaccttctacttcccaggtacaaaagagggct cgttcgcgctccttcagtgggcttattaagcggaaggtcctgggaaatcagatgatgtca gaaaagaaaaagaagaaccctactccagaatctgtggccattggtgaattgaagggaacc agcaaagaaaataggaacttattattttctggctctccagctgtcacgatgacaccaaca agattgaagtggtctgaagggaagaaagaggggaaaaaagcccccaagggccagcttgta ttactcactgtgaagaaaggaaccaggtccttcaaaatgaagcatggagagactcttaaa gttacatgtgggaaactgggtgatgccaagaaaagggagtggcgggggcaagagggacat tttgaaaaacaggaagaggaccaaaatgctgagtacggtactgagcacataatcattaat cactga