GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:43:37 Sequence gi568815579r:54007053_54214701 : 207649 bp : 51.66% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 4899 5137 239 1 2 132 43 181 0.984 14.66 1.02 PlyA + 9171 9176 6 1.05 2.03 PlyA - 13822 13817 6 1.05 2.02 Term - 16885 16772 114 1 0 93 41 27 0.251 -2.63 2.01 Init - 22480 22340 141 1 0 80 70 85 0.927 6.00 2.00 Prom - 22846 22807 40 1.69 3.00 Prom + 23339 23378 40 -0.61 3.01 Sngl + 24331 24552 222 0 0 39 42 187 0.943 4.74 3.02 PlyA + 25018 25023 6 1.05 4.11 PlyA - 27923 27918 6 1.05 4.10 Term - 34028 33909 120 2 0 83 43 116 0.997 5.38 4.09 Intr - 34764 34727 38 1 2 95 109 -4 0.855 0.87 4.08 Intr - 34901 34864 38 1 2 111 107 19 0.821 4.49 4.07 Intr - 35144 35117 28 0 1 80 61 41 0.345 -1.74 4.06 Intr - 35317 35225 93 2 0 126 53 100 0.520 10.73 4.05 Intr - 44396 44358 39 0 0 107 110 23 0.932 5.08 4.04 Intr - 48747 48454 294 1 0 130 94 96 0.838 11.93 4.03 Intr - 51538 51254 285 2 0 53 94 282 0.821 23.16 4.02 Intr - 51680 51645 36 0 0 74 101 82 0.790 6.82 4.01 Init - 56725 56692 34 1 1 72 109 54 0.718 3.95 4.00 Prom - 58305 58266 40 -3.91 5.05 PlyA - 62272 62267 6 1.05 5.04 Term - 63108 62951 158 1 2 97 41 236 0.729 18.21 5.03 Intr - 67164 66868 297 1 0 104 110 139 0.990 15.09 5.02 Intr - 68062 67772 291 2 0 86 92 127 0.962 10.55 5.01 Init - 74288 74255 34 2 1 92 89 52 0.485 3.69 5.00 Prom - 83910 83871 40 -1.81 6.05 PlyA - 86360 86355 6 1.05 6.04 Term - 88305 88169 137 1 2 114 52 141 0.897 11.59 6.03 Intr - 89101 88820 282 2 0 68 100 562 0.999 53.33 6.02 Intr - 90103 89810 294 2 0 26 94 194 0.847 11.33 6.01 Init - 93740 93704 37 2 1 115 65 137 0.997 12.25 6.00 Prom - 93955 93916 40 -11.03 7.00 Prom + 94111 94150 40 -6.30 7.01 Init + 95278 95296 19 0 1 80 37 20 0.569 -3.77 7.02 Intr + 95729 95836 108 0 0 70 91 78 0.557 7.06 7.03 Intr + 96062 96136 75 0 0 75 82 146 0.971 12.68 7.04 Intr + 98882 98959 78 0 0 131 109 153 0.971 21.72 7.05 Term + 99759 99850 92 1 2 91 53 75 0.998 2.48 7.06 PlyA + 99890 99895 6 1.05 8.08 PlyA - 99988 99983 6 1.05 8.07 Term - 100117 99998 120 1 0 126 48 105 0.999 9.08 8.06 Intr - 101192 100974 219 2 0 87 94 181 0.996 17.63 8.05 Intr - 101390 101274 117 2 0 73 105 126 0.937 13.97 8.04 Intr - 102262 102172 91 0 1 56 94 16 0.621 -0.50 8.03 Intr - 103069 102999 71 1 2 82 76 66 0.688 3.27 8.02 Intr - 104022 103796 227 1 2 70 61 87 0.474 2.43 8.01 Init - 107644 107386 259 1 1 89 95 312 0.438 29.29 8.00 Prom - 110619 110580 40 -9.36 9.00 Prom + 111113 111152 40 -6.90 9.01 Init + 111227 111403 177 1 0 51 95 246 0.674 20.83 9.02 Intr + 111521 111581 61 1 1 104 44 96 0.997 5.60 9.03 Intr + 114808 114891 84 2 0 92 61 210 0.993 18.99 9.04 Intr + 115445 115557 113 0 2 85 81 174 0.642 17.00 9.05 Intr + 116402 116508 107 1 2 102 63 246 0.670 23.01 9.06 Intr + 116697 116866 170 0 2 57 92 499 0.999 47.31 9.07 Intr + 117447 117604 158 1 2 143 105 314 0.998 38.84 9.08 Intr + 119476 119565 90 0 0 90 100 177 0.946 19.79 9.09 Intr + 121021 121148 128 0 2 87 94 270 0.999 27.48 9.10 Intr + 121253 121325 73 2 1 84 55 185 0.934 14.70 9.11 Intr + 121490 121753 264 1 0 101 63 158 0.975 12.85 9.12 Intr + 122005 122133 129 0 0 90 80 183 0.972 19.10 9.13 Intr + 122220 122318 99 2 0 104 82 177 0.999 19.51 9.14 Term + 124255 124380 126 0 0 91 42 182 0.966 12.49 9.15 PlyA + 124640 124645 6 -1.75 10.00 Prom + 126288 126327 40 1.69 10.01 Init + 131052 131248 197 2 2 103 70 200 0.753 16.08 10.02 Intr + 132495 132577 83 0 2 89 75 -5 0.160 -1.92 10.03 Intr + 136067 136134 68 0 2 91 87 149 0.983 14.22 10.04 Intr + 136390 136464 75 0 0 107 61 105 0.999 9.91 10.05 Intr + 136608 136697 90 2 0 112 70 159 0.881 17.19 10.06 Intr + 136954 137082 129 0 0 63 113 133 0.983 14.70 10.07 Intr + 137185 137280 96 0 0 114 76 234 0.999 25.51 10.08 Intr + 138546 138819 274 2 1 37 30 698 0.512 56.55 10.09 Intr + 138858 138991 134 1 2 116 89 115 0.998 15.27 10.10 Intr + 139549 139605 57 0 0 84 94 61 0.872 5.87 10.11 Intr + 141096 141483 388 2 1 111 81 246 0.999 21.13 10.12 Intr + 141568 141691 124 2 1 105 80 76 0.999 8.65 10.13 Intr + 142508 142706 199 2 1 133 86 113 0.933 15.48 10.14 Intr + 145174 145273 100 0 1 92 78 156 0.998 15.18 10.15 Intr + 145376 145574 199 0 1 91 85 194 0.995 18.33 10.16 Intr + 145815 146025 211 0 1 99 60 246 0.976 22.34 10.17 Intr + 146663 146788 126 1 0 149 94 206 0.999 28.68 10.18 Term + 148257 148355 99 2 0 117 47 323 0.603 29.53 10.19 PlyA + 148611 148616 6 1.05 11.25 PlyA - 148625 148620 6 -3.44 11.24 Term - 148888 148669 220 0 1 143 41 260 0.999 23.64 11.23 Intr - 149973 149711 263 0 2 88 46 256 0.989 18.22 11.22 Intr - 151409 151230 180 2 0 80 80 178 0.999 16.78 11.21 Intr - 152871 152512 360 0 0 3 99 351 0.740 23.68 11.20 Intr - 153493 153415 79 0 1 88 101 100 0.995 11.35 11.19 Intr - 153981 153826 156 2 0 68 94 77 0.963 5.94 11.18 Intr - 154208 154078 131 2 2 87 95 43 0.980 4.90 11.17 Intr - 155233 155050 184 0 1 103 116 201 0.999 24.71 11.16 Intr - 155718 155621 98 0 2 105 50 100 0.999 7.21 11.15 Intr - 156107 155981 127 1 1 127 97 82 0.975 14.19 11.14 Intr - 156835 156672 164 1 2 55 20 155 0.950 4.69 11.13 Intr - 157549 157382 168 1 0 71 80 229 0.928 21.06 11.12 Intr - 158514 158367 148 2 1 88 86 249 0.787 25.45 11.11 Intr - 161240 161119 122 2 2 116 69 119 0.999 12.60 11.10 Intr - 161628 161396 233 1 2 110 92 283 0.999 28.92 11.09 Intr - 162608 162460 149 1 2 123 33 155 0.999 13.89 11.08 Intr - 165031 164818 214 2 1 104 94 188 0.999 19.30 11.07 Intr - 167379 167069 311 2 2 67 47 454 0.280 35.70 11.06 Intr - 171889 171713 177 0 0 77 101 363 0.836 36.05 11.05 Intr - 174081 173721 361 1 1 131 71 659 0.999 63.34 11.04 Intr - 176628 176469 160 0 1 76 109 194 0.759 20.37 11.03 Intr - 180235 180109 127 0 1 57 77 165 0.996 13.49 11.02 Intr - 181294 181165 130 2 1 119 85 40 0.999 7.06 11.01 Init - 181456 181381 76 1 1 73 117 74 0.302 10.08 11.00 Prom - 181832 181793 40 -3.31 12.00 Prom + 183307 183346 40 -9.07 12.01 Init + 183550 183821 272 0 2 58 37 200 0.941 7.85 12.02 Intr + 184309 184555 247 1 1 109 95 270 0.843 27.80 12.03 Intr + 184669 184912 244 0 1 -4 93 231 0.541 12.00 12.04 Intr + 185064 185321 258 1 0 96 105 171 0.708 17.57 12.05 Term + 186123 186310 188 1 2 131 43 115 0.906 9.27 12.06 PlyA + 186592 186597 6 1.05 13.00 Prom + 192770 192809 40 1.49 13.01 Init + 194133 194229 97 2 1 99 109 71 0.995 10.73 13.02 Intr + 194435 194557 123 0 0 94 100 196 0.951 22.46 13.03 Intr + 199224 199410 187 1 1 90 81 283 0.138 27.07 13.04 Term + 200346 200523 178 0 1 105 41 279 0.845 22.38 13.05 PlyA + 200571 200576 6 1.05 14.04 PlyA - 200786 200781 6 -5.41 14.03 Term - 201090 201023 68 1 2 31 45 74 0.274 -4.21 14.02 Intr - 201686 201650 37 2 1 110 99 52 0.281 6.82 14.01 Intr - 202536 202452 85 2 1 89 97 42 0.466 5.52 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 28984 28919 66 1 0 75 64 78 0.868 5.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:54007053_54214701|GENSCAN_predicted_peptide_1|79_aa XLLLLVSLEVFRHSVRALLQRVSPEPPPAPRLTYEYSWSLGCGVGAGLILLLGAGCFLLL TLPSWPWGSLCPKRGHRAT >gi568815579r:54007053_54214701|GENSCAN_predicted_CDS_1|240_bp ngcctgctgctcttggtgagcctggaggtgttccggcattccgtgagggccctgctgcag agagtcagcccggagcctcccccggccccacgcctcacctacgagtactcctggtccctg ggctgcggcgtgggggccggcctgatcctgctgttgggggccggctgctttctgctgctc acactgccttcctggccctgggggtccctctgtcccaagcgggggcaccgggccacctag >gi568815579r:54007053_54214701|GENSCAN_predicted_peptide_2|84_aa MAEGEGEANIFFTRQQEREKSKQRRNLPNAYKTIRSPENSLYHENNKKPNLTLPLRTLIP SFPASSLLLESMKPRREAADTGAP >gi568815579r:54007053_54214701|GENSCAN_predicted_CDS_2|255_bp atggcagaaggggaaggagaagcaaatatcttcttcacaaggcaacaagagagagagaag agcaagcaaaggaggaacttgccaaacgcttataaaaccatcagatctcctgagaactca ctttatcatgagaacaacaagaaacccaacctcaccctacctctccgcacgctcataccc tcgtttccagcctcaagccttttgctggagtccatgaagccccgcagagaggctgcagac acgggggcgccttag >gi568815579r:54007053_54214701|GENSCAN_predicted_peptide_3|73_aa MPLKSSSLCGAVLVEEAQAVPLVVVIDLAVELVDMKTEGSKNSRKGQQFLAGETVRKAAG YLETVIPNALEEV >gi568815579r:54007053_54214701|GENSCAN_predicted_CDS_3|222_bp atgccgcttaaaagctcaagtttatgcggggcagttttggtggaagaagctcaggcagtc cctctggtggtcgttatagatctggccgtggaactggtggatatgaaaacagaaggttct aaaaacagcagaaaagggcaacagttcttagcaggagagacagtgaggaaagctgcaggt tacttggagacagtcatcccaaatgcattagaggaggtgtaa >gi568815579r:54007053_54214701|GENSCAN_predicted_peptide_4|334_aa MTAEFLSLLCLGLCLGYEDEKKNEKPPKPSLHAWPSSVVEAESNVTLKCQAHSQNVTFVL RKVNDSGYKQEQSSAENEAEFPFTDLKPKDAGRYFCAYKTTASHEWSESSEHLQLVVTGS LPEPLLSVNVDPGMTPGLRTLRCLTPYNGTECIVIALLKMGIPEPLQVRQVRKNQTDFML WNVTSNDSGNYSCVYYLSNSSHLASFPSNKLEIWVTDKHDELEAPSMKTDTRTIFVAIFS CISILLLFLSVFIIYRCSQHSSSSEESTKRTSHSKLPEQEAAEADLSNMERVSLSTADPQ GVTYAELSTSALSEAASDTTQEPPGSHEYAALKV >gi568815579r:54007053_54214701|GENSCAN_predicted_CDS_4|1005_bp atgaccgcagaattcctctccctgctttgcctcgggctgtgtctgggctacgaagatgag aaaaagaatgagaaaccgcccaagccctccctccacgcctggcccagctcggtggttgaa gccgagagcaatgtgaccctgaagtgtcaggctcattcccagaatgtgacatttgtgctg cgcaaggtgaacgactctgggtacaagcaggaacagagctcggcagaaaacgaagctgaa ttccccttcacggacctgaagcctaaggatgctgggaggtacttttgtgcctacaagaca acagcctcccatgagtggtcagaaagcagtgaacacttgcagctggtggtcacaggatca ctcccagaacctttgctctcagtcaatgtagaccctgggatgactccaggtctcaggaca cttcgatgtctcactccatacaatggaaccgaatgtattgtaattgctctgttgaaaatg gggatcccagaaccattacaagtcaggcaagtaagaaaaaaccagactgatttcatgctc tggaacgtgacaagtaatgacagtggaaactacagctgtgtgtattacctgagcaactca tcacacttggcctccttccccagcaacaagctggagatctgggtgacagataaacacgat gaacttgaagctccctcaatgaaaacagacaccagaaccatctttgtcgccatcttcagc tgcatctccatccttctcctcttcctctcagtcttcatcatctacagatgcagccagcac agttcatcatctgaggaatccaccaagagaaccagccattccaaacttccggagcaggag gctgccgaggcagatttatccaatatggaaagggtatctctctcgacggcagacccccaa ggagtgacctatgctgagctaagcaccagcgccctgtctgaggcagcttcagacaccacc caggagcccccaggatctcatgaatatgcggcactgaaagtgtag >gi568815579r:54007053_54214701|GENSCAN_predicted_peptide_5|259_aa MIPKLLSLLCFRSLPKPSLSAWPSSVVPANSNVTLRCWTPARGVSFVLRKGGIILESPKP LDSTEGAAEFHLNNLKVRNAGEYTCEYYRKASPHILSQHSDVLLLLVTGHLSKPFLRTYQ RGTVTAGGRVTLQCQKRDQLFVPIMFALLKAGTPSPIQLQSPAGKEIDFSLVDVTAGDAG NYSCMYYQTKSPFWASEPSDQLEILVTVPPGTTSSNYSLGNFVRLGLAAVIVVIMGAFLV EAWYSRNVSPGESEAFKPE >gi568815579r:54007053_54214701|GENSCAN_predicted_CDS_5|780_bp atgatccctaagctgctttccctcctctgtttcaggtcactgcccaagccgtccctcagt gcctggcccagctcggtggtccctgccaacagcaatgtgacgctgcgatgttggactcct gccagaggtgtgagctttgttctcaggaagggaggaattattctggagtccccgaagccc cttgattctacagagggcgcggccgaatttcacctcaataatctaaaagtcagaaatgct ggagagtacacctgtgaatactacagaaaagcatccccccacatcctttcacagcacagt gacgtccttctactgttggtgacaggacatttatctaaacctttcctccgaacctaccaa aggggtacagtgaccgcaggtggaagggtgactctgcagtgccagaagcgagaccaattg tttgtgcctatcatgttcgctctactgaaggcagggacgccatcacccatccagctgcag agtccagcggggaaggagatagacttctctctggtggacgtgacagccggcgatgctggg aactacagctgcatgtactaccagacaaagtctcccttctgggcctcagaacccagtgat cagcttgagatattggtgacagttcccccaggtaccacatcgagcaactactccctgggt aacttcgtacgactgggtctggctgccgtaattgtggttatcatgggagctttcctggtg gaggcctggtacagccggaatgtgtctccaggtgaatcagaggccttcaaaccagagtga >gi568815579r:54007053_54214701|GENSCAN_predicted_peptide_6|249_aa MALVLILQLLTLSSYHPKPWLGAQPATVVTPGVNVTLRCRAPQPAWRFGLFKPGEIAPLL FRDVSSELAEFFLEEVTPAQGGSYRCCYRRPDWGPGVWSQPSDVLELLVTEELPRPSLVA LPGPVVGPGANVSLRCAGRLRNMSFVLYREGVAAPLQYRHSAQPWADFTLLGARAPGTYS CYYHTPSAPYVLSQRSEVLVISWEDSGSSDYTRGNLVRLGLAGLVLISLGALVTFDWRSQ NRAPAGIRP >gi568815579r:54007053_54214701|GENSCAN_predicted_CDS_6|750_bp atggccctggtgctgatcctccagctgctgaccctctcttcataccaccctaagccatgg ctgggagctcagccggctacagttgtgacccctggggtcaacgtgaccttgagatgccgg gcaccccaacccgcttggagatttggacttttcaagcctggagagatcgctccccttctc ttccgggatgtgtcctccgagctggcagaattctttctggaggaggtgactccagcccaa gggggaagttaccgctgctgctaccgaaggccagactgggggccgggtgtctggtcccag cccagcgatgtcctggagctgctggtgacagaggagctgccgcggccgtcgctggtggcg ctgcccgggccggtggtgggtcctggcgccaacgtgagcctgcgctgcgcgggccgcctg cggaacatgagcttcgtgctgtaccgcgagggcgtggcggccccgctgcagtaccgccac tccgcgcagccctgggccgacttcacgctgctgggcgcccgcgcccccggcacctacagc tgctactatcacacgccctccgcgccctacgtgctgtcgcagcgcagcgaggtgctggtc atcagctgggaagactctggctcctccgactacacccgggggaacctagtccgcctgggg ctggccgggctggtcctcatctccctgggcgcgctggtcacttttgactggcgcagtcag aaccgcgctcctgctggtatccgcccctga >gi568815579r:54007053_54214701|GENSCAN_predicted_peptide_7|123_aa MAVTEHWRTRSPGSGFSRIGTTTPRVLRVLAAVAAAETKMAARVGAFLKNAWDKEPVLVV SFVVGGLAVILPPLSPYFKYSVMINKATPYNYPVPVRDDGNMPDVPSHPQDPQGPSLEWL KKL >gi568815579r:54007053_54214701|GENSCAN_predicted_CDS_7|372_bp atggcggtaacagagcactggcgcacgcgcagccctgggagcgggttctcgcgcataggg accacaactcccagggtgctccgcgtcctcgccgctgtcgccgccgcggagacaaagatg gctgcgagagtcggcgccttcctcaagaatgcctgggacaaggagccagtgctggtcgtg tccttcgtcgtcgggggcctcgctgtaattctgcccccattgagcccctacttcaagtac tccgtcatgatcaacaaggccacgccctacaactacccagtgcccgtccgtgatgatggg aacatgcccgacgtgcccagccacccccaggaccctcagggccccagcctggagtggctg aagaaactgtga >gi568815579r:54007053_54214701|GENSCAN_predicted_peptide_8|367_aa MAAVGFEEFSAPPGSELALPPLFGGHILESELETEVEFVSGGLGGSGLRERDEEEEAARG RRRRQRELNRRKYQALGRRCREIEQVVWAHGPSLQVVTKVAPPALTSSMSDSLVFTKHFS LCKVIDSANVHRGCTTCQALVKARDVETSCCRFSAHALAGEAVNERVLNRLHQVQRITRR LQQERSSSSLPKAETCGSQGSSVRTCAVQHGSRWPQPLGDKLFRWWLLPLSRFLMRVLDS YGDDYRASQFTIVLEDEGSQGTDAPTPGNAENEPPEKETLSPPRRTPAPPEPGSPAPGEG PSGRKRRRVPRDGRRAGNALTPELAPVQIKVEEDFGFEADEALDSSWVSRGPDKLLPYPT LASPASD >gi568815579r:54007053_54214701|GENSCAN_predicted_CDS_8|1104_bp atggcagccgtgggctttgaggagttctcagcgccgccaggctcagagttggcgttgcct cccctatttggtggccacatcctggagagcgagctggagacggaagtggagtttgtgtca ggtggtctgggcggctcagggctccgggagcgagatgaagaggaagaggcagcccggggt cggcggcggcgccagcgggaattaaatcgcagaaagtaccaggcactaggtcggcgctgc cgggagatcgagcaggtagtctgggcccacgggccttctctgcaggtcgtaactaaagtc gcacctcctgccctaacctccagcatgtctgactctttggtattcaccaagcacttctca ctttgcaaagtcattgattctgcaaatgttcatcgaggatgtactacgtgccaggctctg gttaaggcacgggatgtagaaacaagttgctgtcgtttttcagctcatgctctggctgga gaggcggtgaacgagcgggtcctgaacaggctccatcaggtgcagaggataactcggagg ctgcagcaggaacggagttcatcctccctccccaaagctgaaacctgcggctcccagggt tcctctgtccgtacctgtgctgtccagcacggtagccgctggccacagccgctaggagat aagttattccgttggtggcttctccccctgagcaggttcctcatgagagtgctggactcc tacggggatgactaccgggccagccagttcaccattgtgctggaggatgagggcagccag ggcacggatgcccccaccccaggcaatgcggagaatgagcctccagagaaagagacactg tccccgcccagaaggactcctgcacccccagaacccggcagcccagcccccggtgagggg cccagtgggcggaagaggcggcgagtgccacgggatggacgccgagcaggaaatgcgctg actccagagctggccccggtgcagattaaggttgaggaagactttggctttgaagcagat gaggccctggattccagttgggtttctcggggtccagacaaactgctgccctacccgacc ctggccagcccagcctctgactga >gi568815579r:54007053_54214701|GENSCAN_predicted_peptide_9|592_aa MSLADELLADLEEAAEEEEGGSYGEEEEEPAIEDVQEETQLDLSGDSVKTIAKLWDSKMF AEIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSK RFPELESLVPNALDYIRTVKVSAEKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQ GQQLSEEELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVA GGLTNLSKMPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLV AAKCTLAARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRG GRRYRKMKERLGLTEIRKQANRMSFGEPQSDPRDPWSLCLRCLEPPRLPIAPGSLAGSSL PRGSLVPCCTAAPSLGPASLLCYPSVIPLVLQDRTQRPPHPIKPVLVPDIPRPTRIEEDA YQEDLGFSLGHLGKSGSGRVRQTQVNEATKARISKTLQRTLQKQSVVYGGKSTIRDRSSG TASSVAFTPLQGLEIVNPQAAEKKVAEANQKYFSSMAEFLKVKGEKSGLMST >gi568815579r:54007053_54214701|GENSCAN_predicted_CDS_9|1779_bp atgtctctggcagatgagctcttagctgatctcgaagaggcagcagaagaggaggaagga ggaagctatggggaggaagaagaggagccagcgatcgaggatgtgcaggaggagacacag ctggatctttccggggattcagtcaagaccatcgccaagctatgggatagtaagatgttt gctgagattatgatgaagattgaggagtatatcagcaagcaagccaaagcttcagaagtg atgggaccagtggaggccgcgcctgaataccgcgtcatcgtggatgccaacaacctgacc gtggagatcgaaaacgagctgaacatcatccataagttcatccgggataagtactcaaag agattccctgaactggagtccttggtccccaatgcactggattacatccgcacggtcaag gtgagcgcagagaaggagctgggcaacagcctggacaagtgcaagaacaatgagaacctg cagcagatcctcaccaatgccaccatcatggtcgtcagcgtcaccgcctccaccacccag gggcagcagctgtcggaggaggagctggagcggctggaggaggcctgcgacatggcgctg gagctgaacgcctccaagcaccgcatctacgagtatgtggagtcccggatgtccttcatc gcacccaacctgtccatcattatcggggcatccacggccgccaagatcatgggtgtggcc ggcggcctgaccaacctctccaagatgcccgcctgcaacatcatgctgctcggggcccag cgcaagacgctgtcgggcttctcgtctacctcagtgctgccccacaccggctacatctac cacagtgacatcgtgcagtccctgccaccggatctgcggcggaaagcggcccggctggtg gccgccaagtgcacactggcagcccgtgtggacagtttccacgagagcacagaagggaag gtgggctacgaactgaaggatgagatcgagcgcaaattcgacaagtggcaggagccgccg cctgtgaagcaggtgaagccgctgcctgcgcccctggatggacagcggaagaagcgaggc ggccgcaggtaccgcaagatgaaggagcggctggggctgacggagatccggaagcaggcc aaccgtatgagcttcggagagccccaaagcgaccctcgcgacccttggagcctgtgtctc cgctgcttagagcccccgcggcttcccatcgccccgggctccttggccggttcctccctg cccagaggctccttagtgccctgctgcacggccgccccgtccctgggccccgccagtctc ctctgttatcccagcgtcatccccttggtcctgcaggaccgaactcagaggccacctcat cctattaaacctgttctggttcctgacatcccccgacccacacgaatcgaggaggacgcc taccaggaggacctgggattcagcctgggccacctgggcaagtcgggcagtgggcgtgtg cggcagacacaggtaaacgaggccaccaaggccaggatctccaagacgctgcagcggacc ctgcagaagcagagcgtcgtatatggcgggaagtccaccatccgcgaccgctcctcgggc acggcctccagcgtggccttcaccccactccagggcctggagattgtgaacccacaggcg gcagagaagaaggtggctgaggccaaccagaagtatttctccagcatggctgagttcctc aaggtcaagggcgagaagagtggccttatgtccacctga >gi568815579r:54007053_54214701|GENSCAN_predicted_peptide_10|882_aa MASPRLPPAAASAARGSRRGAAPSLSPSRSCASFTFLSASRGSFRDPDPGPRPPPPPRVA SRRAPRDKVLAILGIGMAAFLDLRCPRLCMCPHREIDRCLKKVSEGVEQFEDIWQKLHNA ANANQKEKYEADLKKEIKKLQRLRDQIKTWVASNEIKDKRQLIDNRKLIETQMERFKVVE RETKTKAYSKEGLGLAQKVDPAQKEKEEVGQWLTNTIDTLNMQVDQFESEVESLSVQTRK KKGDKDKQDRIEGLKRHIEKHRYHVRMLETILRMLDNDSILVDAIRKIKDDVEYYVDSSQ DPDFEENEFLYDDLDLEDIREALGLIVAQEVRAQNGLSQALVATSPPSHSHMEDEIFNQS SSTPTSTTSSSPIPPSPANCTTENSEDDKKRGRSTDSEVSQSPAKNGSKPVHSNQHPQSP AVPPTYPSGPPPAASALSTTPGNNGVPAPAAPPSALGPKASPAPSHNSGTPAPYAQAVAP PAPSGPSTTQPRPPSVQPSGGGGGGSGGGGSSSSSNSSAGGGAGKQNGATSYSSVVADSP AEVALSSSGGNNASSQALGPPSGPHNPPPSTSKEPSAAAPTGAGGVAPGSGNNSGGPSLL VPLPVNPPSSPTPSFSDAKAAGALLNGPPQFSTAPEIKAPEPLSSLKSMAERAAISSGIE DPVPTLHLTERDIILSSTSAPPASAQPPLQLSEVNIPLSLGVCPLGPVPLTKEQLYQQAM EEAAWHHMPHPSDSERIRQYLPRNPCPTPPYHHQMPPPHSDTVEFYQRLSTETLFFIFYY LEVQQGPRGSLGPPGFAATAAVPPRAGGGTKAQYLAAKALKKQSWRFHTKYMMWFQRHEE PKTITDEFEQGTYIYFDYEKWGQRKKEGFTFEYRYLEDRDLQ >gi568815579r:54007053_54214701|GENSCAN_predicted_CDS_10|2649_bp atggcctcccctcgcctgccccctgccgccgcctctgcagcgcggggctcccggcggggg gcggctccctccctctcgccctcccgttcctgcgcctctttcacgttcctcagcgcctcc cgggggtccttccgcgacccggaccccgggccccgcccgccgccgcctccccgcgtggca tcgcgtcgggccccccgggataaggttcttgccatccttggtattggtatggctgctttt ctggatttgaggtgtccacgcctctgcatgtgtccccaccgtgagattgatcgctgcctc aagaaggtgtccgagggcgtggagcagtttgaagatatttggcagaagctccacaatgca gccaacgcgaaccagaaagaaaagtatgaggctgacctaaagaaggagattaagaagcta caacggctgagggaccaaatcaagacatgggtagcgtccaacgagatcaaggacaagagg cagcttatagacaaccgcaagctcattgagacgcaaatggaacggttcaaagttgtggaa cgagagaccaaaaccaaagcttacagcaaagagggcctgggcctggcccagaaggtagat cctgcccagaaggagaaggaagaggttggccagtggctcacgaataccatcgacacgctc aacatgcaggtggaccagtttgagagtgaagtggagtcactgtcagtgcagacacgcaag aagaagggcgacaaggataagcaggaccggattgagggcttgaagcggcacatcgagaag caccgctaccacgtgcgcatgctagagaccatcctgcgcatgctggacaatgactccatc ctcgttgacgccatccgcaagatcaaggacgacgttgagtactatgttgactcatcccag gaccccgacttcgaggagaacgagtttctctacgatgacctggacctcgaggacattcgt gaggccctggggctgatcgtggcacaggaagtgagggcccagaatgggctgtcacaggcg ctggtcgccacctccccccccagccacagccacatggaggatgagatcttcaaccagtcc agcagcacgcccacctcaaccacctccagctctcccatcccgcccagcccagccaactgt accacggaaaactctgaagatgataagaagaggggacgttccacagacagtgaagtcagc cagtctccagccaaaaacggctccaagcctgtccacagcaaccagcaccctcagtcccca gctgtgccgcccacctacccctccggccccccgcctgctgcctctgccttgagcaccact cctggcaacaatggggtccccgcccccgcagcacccccaagtgccctgggccccaaggcc agtccagctcccagccacaactcgggcacccctgctccctatgcccaggctgtggcccca ccagctcccagtgggcccagcacgacccagccccggccccccagcgtccagcctagcgga ggcggaggcggcggcagcggaggcggagggagcagcagcagtagtaacagcagtgccggt ggaggggctggcaagcagaatggcgccaccagttacagctcagttgtggcagacagcccg gcagaggtggctttgagcagcagtgggggcaacaatgccagcagccaggccttgggcccc ccttccggcccccacaacccacctcccagcacctcgaaggaacccagtgcggcagcccca acgggggctgggggcgtggccccaggctcagggaacaactcagggggacccagcctcctg gtgccactgcctgtgaatcctcccagctccccaacgcccagcttcagtgatgccaaggca gccggtgccctgctcaatgggcctccacagttcagcaccgccccagaaatcaaggcccct gagcctctgagctccttgaagtccatggcggaacgggcagccatcagctctggcattgag gaccctgtgccaacgctgcacctgaccgagcgagacatcatcctgagcagtacatcagca cctccggcctcagcccagccgcccctgcagctgtcagaggtgaacataccgctgtcgctg ggtgtctgtccactgggccctgtgcccctcaccaaggagcagctctatcagcaggccatg gaagaggccgcctggcaccacatgcctcacccctctgactctgagcgtattcggcagtac ctcccccggaacccctgtccgacgcccccctaccaccaccagatgccacccccacactcg gacactgtggaattctaccagcgcctgtcgaccgagacactcttcttcatcttctactat ctggaggtacagcagggcccccggggcagcctcgggccccccggcttcgccgccaccgcc gccgtcccccctcgggctggagggggcactaaggcacagtatctggcagccaaggcccta aagaagcagtcatggcgattccacaccaagtacatgatgtggttccagaggcacgaggag cccaagaccatcactgacgagtttgagcagggcacctacatctactttgactacgagaag tggggccagcggaagaaggaaggcttcacctttgagtaccgctacctggaggaccgggac ctccagtga >gi568815579r:54007053_54214701|GENSCAN_predicted_peptide_11|1445_aa MSPEEWTYLVVLLISIPIGFLFKKAGPGLKRWGAAAVGLGLTLFTCGPHTLHSLVTILGT WALIQAQPCSCHALALAWTFSYLLFFRALSLLGLPTPTPFTNAVQLLLTLKLVSLASEVQ DLHLAQRKEMASGFSKGPTLGLLPDVPSLMETLSYSYCYVGIMTGPFFRYRTYLDWLEQP FPGAVPSLRPLLRRAWPAPLFGLLFLLSSHLFPLEAVREDAFYARPLPARLFYMIPVFFA FRMRFYVAWIAAECGCIAAGFGAYPVAAKARAGGGPTLQCPPPSSPEKAASLEYDYETIR NIDCYSTDFCVRVRDGMRYWNMTVQWWLAQYIYKSAPARSYVLRSAWTMLLSAYWHGLHP GYYLSFLTIPLCLAAEGRLESALRGRLSPGGQKAWDWVHWFLKMRAYDYMCMGFVLLSLA DTLRYWASIYFCIHFLALAALGLGLALGPSLSSVLNELPSAATLRYRDPGVLPWGALEEE EEDGGRSRKAFTEVTQTELQDPHPSRELPWPMQARRAHRQRNASRDQVVYGSGTKTDRWA RLLRRSKEKTKEGLRSLQPWAWTLKRIGGQFGAGTESYFSLLRFLLLLNVLASVLMACMT LLPTWLGGAPPGPPGPDISSPCGSYNPHSQGLVTFATQLFNLLSGEGYLEWSPLFYGFYP PRPRLAVTYLCWAFAVGLICLLLILHRSVSGLKQTLLAESEALTSYSHRVFSAWDFGLCG DVHVRLRQRIILYELKVELEETVVRRQAAVRTLGQQARVWLVRVLLNLLVVALLGAAFYG VYWATGCTVELQEMPLVQELPLLKLGVNYLPSIFIAGVNFVLPPVFKLIAPLEGYTRSRQ IVFILLRTVFLRLASLVVLLFSLWNQITCGGDSEAEDCKTCGYNYKQLPCWETVLGQEMY KLLLFDLLTVLAVALLIQFPRKLLCGLCPGALGRLAGTQEFQVPDEVLGLIYAQTVVWVG SFFCPLLPLLNTVKFLLLFYLKKLTLFSTCSPAARTFRASAANFFFPLVLLLGLAISSVP LLYSIFLIPPSKLCGPFRGQSSIWAQIPESISSLPETTQNFLFFLGTQAFAVPLLLISSI LMAYTVALANSYGRLISELKRQRETICDKRRLLGDTTSRTPSIRCDKGRLSQSPVATGQG AVPTNSEPIRGRSRDKKKGGILSPIGSAKRRACQSLDSYDAMNILPKKSWHVRNKDNVAR VRRDEAQAREEEKERERRVLLAQQEARTEFLRKKARHQNSLPELEAAEAGAPGSGPVDLF RELLEEGKGVIRGNKEYKEEKRQEKERQEKALGILTYLGQSAAEAQTQPPWYQLPPGRGG PPPGPAPDEKIKSRLDPLREMQKHLGKKRQHGGDEGSRSRKEKEGSEKQRPKEPPSLDQL RAERLRREAAERSRAEALLARVQGRALQEGQPEEDETDDRRRRYNSQFNPQLARRPRQQD PHLTH >gi568815579r:54007053_54214701|GENSCAN_predicted_CDS_11|4338_bp atgtcgcctgaagaatggacgtatctagtggttcttcttatctccatccccatcggcttc ctctttaagaaagccggtcctgggctgaagagatggggagcagccgctgtgggcctgggg ctcaccctgttcacctgtggcccccacactttgcattctctggtcaccatcctcgggacc tgggccctcattcaggcccagccctgctcctgccacgccctggctctggcctggactttc tcctatctcctgttcttccgagccctcagcctcctgggcctgcccactcccacgcccttc accaatgccgtccagctgctgctgacgctgaagctggtgagcctggccagtgaagtccag gacctgcatctggcccagaggaaggaaatggcctcaggcttcagcaaggggcccaccctg gggctgctgcccgacgtgccctccctgatggagacactcagctacagctactgctacgtg ggaatcatgacaggcccgttcttccgctaccgcacctacctggactggctggagcagccc ttccccggggcagtgcccagcctgcggcccctgctgcgccgcgcctggccggccccgctc ttcggcctgctgttcctgctctcctctcacctcttcccgctggaggccgtgcgcgaggac gccttctacgcccgcccgctgcccgcccgcctcttctacatgatccccgtcttcttcgcc ttccgcatgcgcttctacgtggcctggattgccgccgagtgcggctgcattgccgccggc tttggggcctaccccgtggccgccaaagcccgggccggaggcggccccaccctccaatgc ccaccccccagcagtccggagaaggcggcttccttggagtatgactatgagaccatccgc aacatcgactgctacagcacagatttctgcgtgcgggtgcgcgatggcatgcggtactgg aacatgacggtgcagtggtggctggcgcagtatatctacaagagcgcacctgcccgttcc tatgtcctgcggagcgcctggaccatgctgctgagcgcctactggcacggcctccacccg ggctactacctgagcttcctgaccatcccgctgtgcctggctgccgagggccggctggag tcagccctgcgggggcggctgagcccagggggccagaaggcctgggactgggtgcactgg ttcctgaagatgcgcgcctatgactacatgtgcatgggcttcgtgctgctctccttggcc gacacccttcggtactgggcctccatctacttctgtatccacttcctggccctggcagcc ctggggctggggctggctttaggcccatcgctgtcttctgtgctgaacgagctgcccagt gctgccacccttcggtaccgagaccctggggtgctgccttggggggcgctggaggaggag gaggaggatggaggaaggagcagaaaggccttcacagaagtcacccagacagagctgcag gaccctcacccttcccgggaactgccctggcccatgcaggccagacgggcacacaggcaa agaaatgccagcagggaccaggtggtctatggctctggaactaagacggaccgatgggcg cggctacttcggaggtccaaggagaaaacaaaggaaggcttgcgaagcctgcagccctgg gcgtggacactgaagaggatcgggggccagtttggcgccggcacggagtcctacttctcc ctgctgcgcttcctgctccttcttaacgtgctggcctctgtgctcatggcctgcatgacg ctgctgcccacctggttgggaggcgctcccccaggccctcccggccccgacatctcctcg ccctgcggctcctataacccccactcccagggcctggtcacctttgccacccagctcttc aacttgctctcgggtgagggttacctggaatggtcccctctcttctatggcttctacccg ccccgcccacgcctggcggtcacctacctgtgctgggcctttgccgttggcctcatctgc ctcctgctcatcctgcatcgctcggtgtctgggctgaagcagacactgctggcggagtcc gaggctctgaccagctacagccaccgggtgttctcggcctgggacttcggtctctgcggg gacgtccacgtgcggctgcgccagcgcatcatcttgtacgaattaaaggtggagctggag gagacagtggtgcggcgccaggctgcggtgcggacgctgggccagcaagccagggtttgg ttggtgcgggtgctgctcaacctgctggtggtcgcgctcctgggggcagccttctatggc gtctactgggctacggggtgcaccgtggagctgcaggagatgccccttgtccaggagttg ccactgctgaagcttggggtgaattaccttccgtccatcttcatcgctggggtcaatttt gtgctgccgcccgtgttcaagctcattgctccactggagggctacactcggagtcgccag atcgtttttatcctgctcaggaccgtgtttcttcgcctcgcctccctggtggtcctgctc ttctctctctggaatcagatcacttgtgggggcgactccgaggctgaggactgcaaaacc tgtggctacaattacaaacaacttccgtgctgggagactgtcctgggccaggaaatgtac aaacttctgctctttgatctgctgactgtcttggcagtcgcgctgctcatccagtttcct agaaagctcctctgtggcctctgtcctggggcgctgggtcgtctggcggggacccaagag ttccaggtgcccgacgaggtgctggggctcatctacgcgcagacggtggtctgggtgggg agttttttctgccctttactgcccctgcttaacacggtcaagttcctgctgcttttctac ctgaagaagcttaccctcttctccacctgctccccggctgcccgcaccttccgggcctcc gcggcgaatttctttttccccttggtccttctcctgggtctggccatctccagcgttccc ctgctttacagcatcttcctgatcccgccttctaagctgtgtggtccattccgggggcag tcgtccatctgggcccagatccctgagtctatttccagcctccctgagaccacccagaat ttcctcttcttcctggggacccaggcttttgctgtgccccttctgctgatctccagcatc ctgatggcgtacactgtggctctggctaactcctacggacgcctcatctctgagctcaaa cgtcagagagagacgatctgtgacaagaggcggttgctaggggataccacgagccgaacg cctagcattcgctgtgataaagggcgtctcagccaatcacctgtcgctacaggccagggg gccgtaccaactaattcggaaccaatccgcggtcgaagtagggacaagaaaaaggggggc atcctctcgccaatcggaagtgcaaagaggcgggcgtgccagtccctggacagctacgac gccatgaatatcttgcccaagaagagctggcacgtccggaacaaggacaatgtcgcccgc gtgcggcgtgacgaggcccaggcccgggaggaggagaaggagcgtgagcggagggtgctg ctggctcagcaagaggcccgtacagaattcctacggaagaaagccagacatcagaactca ctgcctgagcttgaagcagcagaggcgggagccccaggttctggccctgtggacctgttt cgggagctgctggaggaagggaaaggagtgatcagaggcaataaagagtacaaggaagaa aagcgacaggagaaagagaggcaagagaaagctctgggcatcctgacatacctgggccag agtgcagcggaggcacagactcaacccccttggtaccagctacccccagggcgagggggc cccccgcccggcccagccccagatgagaagatcaagagccgtctggaccctctgcgggag atgcagaagcatctggggaagaagagacagcacggcggtgatgaaggcagtcgcagcaga aaggaaaaggaggggtctgagaagcagcgacccaaggagcctccatccctggaccagctt cgagctgaacgtctgcggagggaagcagctgagaggtctcgggcagaggccctgctggcc cgggtccaaggccgggcactacaggagggtcagccggaagaagacgagacggatgaccgg cggcggcggtacaactcccaattcaacccccagctggcccggcgcccccgccagcaggac cctcaccttactcactga >gi568815579r:54007053_54214701|GENSCAN_predicted_peptide_12|402_aa MTPERRTYCVREEAGLRIRPSREHPNVKLLAFEKGAGLRIRWSRGRPNAELPVLGRGQGF VLWESESEEVRKPNHSRSERGGAKARASGRGRMLVVEVANGRSLVWGAEAVQALRERLGV GGRTVGALPRGPRQNSRLGLPLLLMPEEARLLAEIGAVTLVSAPRPDSRHHSLALTSFKR QQEESFQEQSALAAEARETRRQELLEKITEGQAAKKQKLEQASGASSSQEAGSSQAAKED ETSDGQASGEQEEAGPSSSQAGPSNGVAPLPRSALLVQLATARPRPVKARPLDWRVQSKD WPHAGRPAHELRYSIYRDLWERGFFLSAAGKFGGDFLVYPGDPLRFHAHYIAQCWAPEDT IPLQDLVAAGRLGTSVRKTLLLCSPQPDGKVVYTSLQWASLQ >gi568815579r:54007053_54214701|GENSCAN_predicted_CDS_12|1209_bp atgacgcccgaacgccgaacctattgcgtccgggaggaggcggggctacggattcggccg agccgagaacacccgaacgtcaaattgctggcgttcgagaagggggcggggctgcggatt cggtggagccgaggacgcccgaacgccgaacttcctgtgctcgggagggggcagggtttt gtactgtgggagtctgagagcgaggaggtccgaaagccgaatcacagtcgttcggaaaga ggaggagcgaaggctcgagcgtccggaagagggaggatgctggtggtggaggtggcgaac ggccgctccctggtgtggggagccgaggcggtgcaggccctccgggagcgcctgggtgtg gggggccgcacggtaggcgccctgccccgcgggccccgccagaactcgcgcctgggcctc ccgctgctgctgatgcccgaagaggcgcggctcttggccgagatcggcgccgtgactctg gtcagcgccccgcgtccagactctcggcaccacagcctggccctgacatccttcaagcgc cagcaagaggagagcttccaggagcagagcgccttggcagctgaggcccgggagacccgt cgtcaggagctcctggagaagattacggagggccaggctgctaagaagcagaaactagaa caggcttcaggggccagctcaagccaggaggccggctcgagccaggctgccaaagaggat gagaccagtgatggccaggcttcgggagagcaggaggaagctggcccctcgtcttcccaa gcaggaccctcaaatggggtagcccccttgcccagatctgctctccttgtccagctggcc actgccaggcctcgaccggtcaaggccaggcccctggactggcgtgtccagtctaaagac tggccccacgccggccgccctgcccacgagctgcgctacagtatctacagagacctgtgg gagcgaggcttcttcctcagtgcggctggcaagttcggaggtgacttcctggtctatcct ggtgaccccctccgcttccacgcccattatatcgctcagtgctgggcccctgaggacacc atcccactccaagacctggttgctgctgggcgccttggaaccagcgtcagaaagaccctg ctcctctgttctccgcagcctgatggtaaggtggtctacacctccctgcaatgggccagc ctgcagtga >gi568815579r:54007053_54214701|GENSCAN_predicted_peptide_13|194_aa MPVARSWVCRKTYVTPRRPFEKSRLDQELKLIGEYGLRNKREVWRVKFTLAKIRKAAREL LTLDEKDPRRLFEGNALLRRLVRIGVLDEGKMKLDYILGLKIEDFLERRLQTQVFKLGLA KSIHHARVLIRQRHIRVRKQVVNIPSFIVRLDSQKHIDFSLRSPYGGGRPGRVKRKNAKK GQGGAGAGDDEEED >gi568815579r:54007053_54214701|GENSCAN_predicted_CDS_13|585_bp atgccagtggcccggagctgggtttgtcgcaaaacttatgtgaccccgcggagacccttc gagaaatctcgtctcgaccaagagctgaagctgatcggcgagtatgggctccggaacaaa cgtgaggtctggagggtcaaatttaccctggccaagatccgcaaggccgcccgggaactg ctgacgcttgatgagaaggacccacggcgtctgttcgaaggcaacgccctgctgcggcgg ctggtccgcattggggtgctggatgagggcaagatgaagctggattacatcctgggcctg aagatagaggatttcttagagagacgcctgcagacccaggtcttcaagctgggcttggcc aagtccatccaccacgctcgcgtgctgatccgccagcgccatatcagggtccgcaagcag gtggtgaacatcccgtccttcattgtccgcctggattcccagaagcacatcgacttctct ctgcgctctccctacgggggtggccgcccgggccgcgtgaagaggaagaatgccaagaag ggccagggtggggctggggctggagacgacgaggaggaggattaa >gi568815579r:54007053_54214701|GENSCAN_predicted_peptide_14|63_aa XEALGGHGVVSDPDLLGQKHKAEGMELRPAAVELRNCELHPDKTGNLPNINCITSFETSK QIL >gi568815579r:54007053_54214701|GENSCAN_predicted_CDS_14|192_bp nnggaagccttaggtggacacggggtggtcagtgaccccgacctcttgggccagaagcac aaggcagaaggcatggagttgagaccggctgccgtggagctccgcaattgtgagctacac ccagacaaaactgggaacttgcccaacatcaactgcatcacaagctttgaaactagcaag caaattctgtga