GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:50:41 Sequence gi568815583r:77898258_78154169 : 255912 bp : 49.61% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 253 576 324 0 0 56 80 302 0.909 23.53 1.02 Intr + 632 736 105 1 0 59 7 170 0.593 6.41 1.03 Intr + 884 1354 471 1 0 14 78 699 0.641 54.78 1.04 Intr + 3078 3547 470 2 2 60 68 456 0.781 32.70 1.05 Intr + 4884 4956 73 0 1 70 84 29 0.532 0.11 1.06 Intr + 5669 5867 199 1 1 19 90 255 0.866 17.72 1.07 Term + 5969 6039 71 0 2 74 48 85 0.762 1.20 1.08 PlyA + 12750 12755 6 1.05 2.33 PlyA - 13351 13346 6 1.05 2.32 Term - 17054 16927 128 0 2 90 49 88 0.993 3.54 2.31 Intr - 17294 17141 154 2 1 68 101 147 0.669 13.65 2.30 Intr - 17486 17383 104 0 2 87 97 124 0.999 13.09 2.29 Intr - 17981 17757 225 0 0 45 61 200 0.264 10.96 2.28 Intr - 18832 18760 73 1 1 7 87 93 0.967 0.08 2.27 Intr - 19075 18959 117 1 0 20 -19 213 0.903 4.46 2.26 Intr - 20231 20151 81 2 0 86 46 145 0.826 9.93 2.25 Intr - 21058 20508 551 2 2 91 68 627 0.713 53.59 2.24 Intr - 21554 21467 88 2 1 94 86 98 0.993 9.74 2.23 Intr - 21854 21747 108 2 0 131 78 59 0.999 9.68 2.22 Intr - 22100 21963 138 2 0 18 94 157 0.856 9.96 2.21 Intr - 23185 23079 107 2 2 86 96 159 0.994 16.43 2.20 Intr - 23400 23282 119 2 2 90 81 60 0.963 5.61 2.19 Intr - 23815 23772 44 1 2 119 7 65 0.935 -1.46 2.18 Intr - 24753 24673 81 0 0 78 91 83 0.908 7.43 2.17 Intr - 26798 26679 120 2 0 116 76 26 0.967 4.89 2.16 Intr - 28638 28485 154 2 1 61 93 76 0.487 5.47 2.15 Intr - 30409 30313 97 2 1 86 94 42 0.500 3.67 2.14 Intr - 35219 35118 102 0 0 112 31 120 0.843 8.85 2.13 Intr - 35924 35893 32 1 2 112 35 0 0.511 -5.13 2.12 Intr - 37540 37420 121 2 1 119 32 166 0.672 13.65 2.11 Intr - 41324 41133 192 0 0 102 89 1 0.659 1.06 2.10 Intr - 41866 41785 82 1 1 104 61 69 0.725 5.01 2.09 Intr - 42913 42840 74 2 2 90 -16 77 0.324 -3.37 2.08 Intr - 43265 43185 81 0 0 71 92 70 0.903 5.31 2.07 Intr - 43544 43425 120 0 0 -25 102 153 0.891 5.87 2.06 Intr - 43907 43808 100 2 1 86 52 93 0.539 5.18 2.05 Intr - 44489 44373 117 2 0 136 23 48 0.374 3.76 2.04 Intr - 46502 46390 113 0 2 118 21 32 0.024 -0.40 2.03 Intr - 51315 51190 126 1 0 91 94 37 0.949 5.25 2.02 Intr - 53547 53383 165 1 0 63 45 138 0.973 6.93 2.01 Init - 62281 62245 37 1 1 67 61 15 0.069 -3.13 2.00 Prom - 63673 63634 40 -4.26 3.00 Prom + 68895 68934 40 -5.06 3.01 Init + 77019 77225 207 2 0 56 55 165 0.853 6.86 3.02 Intr + 77751 77930 180 2 0 77 76 247 0.870 22.46 3.03 Intr + 79025 79170 146 1 2 65 83 152 0.999 11.48 3.04 Intr + 80582 80751 170 2 2 136 113 229 0.999 29.79 3.05 Intr + 80905 80996 92 2 2 78 23 9 0.559 -6.89 3.06 Intr + 81743 81863 121 1 1 99 47 241 0.604 21.17 3.07 Intr + 83125 83369 245 2 2 43 51 345 0.982 23.52 3.08 Intr + 83663 83792 130 1 1 55 78 119 0.993 7.87 3.09 Intr + 85495 85579 85 2 1 -6 100 65 0.728 -2.82 3.10 Intr + 86020 86123 104 1 2 79 64 144 0.915 10.92 3.11 Intr + 86215 86341 127 2 1 54 99 126 0.994 10.04 3.12 Intr + 86382 86508 127 0 1 87 41 40 0.702 -0.12 3.13 Intr + 87319 87417 99 0 0 109 -25 110 0.597 2.11 3.14 Intr + 87571 88104 534 0 0 44 -10 425 0.091 21.42 3.15 Intr + 89615 89767 153 1 0 93 100 66 0.648 8.57 3.16 Intr + 90164 90379 216 1 0 89 78 30 0.399 0.80 3.17 Intr + 90460 90606 147 0 0 95 99 90 0.978 11.23 3.18 Intr + 91752 91914 163 2 1 110 86 179 0.989 19.45 3.19 Term + 94646 94803 158 0 2 109 47 161 0.998 12.20 3.20 PlyA + 94955 94960 6 -0.45 4.14 PlyA - 94974 94969 6 1.05 4.13 Term - 100098 99903 196 2 1 94 48 382 0.612 31.68 4.12 Intr - 103483 103362 122 1 2 110 95 128 0.984 14.99 4.11 Intr - 105233 105048 186 2 0 141 113 127 0.997 20.39 4.10 Intr - 110857 110740 118 0 1 127 68 28 0.962 5.17 4.09 Intr - 115060 114566 495 0 0 91 94 525 0.975 45.51 4.08 Intr - 118482 118289 194 0 2 27 99 116 0.942 4.89 4.07 Intr - 119700 119590 111 0 0 89 98 26 0.892 4.28 4.06 Intr - 126282 125899 384 0 0 97 69 555 0.975 49.55 4.05 Intr - 127240 127002 239 2 2 73 88 210 0.995 16.83 4.04 Intr - 131913 131750 164 2 2 110 64 149 0.992 14.32 4.03 Intr - 146811 146643 169 1 1 110 89 46 0.411 5.90 4.02 Intr - 155930 155777 154 2 1 126 95 68 0.640 10.95 4.01 Init - 179395 179036 360 1 0 87 81 566 0.938 50.88 4.00 Prom - 188068 188029 40 -4.86 5.00 Prom + 193054 193093 40 -5.26 5.01 Init + 194328 194503 176 2 2 109 80 80 0.937 8.17 5.02 Intr + 195855 195944 90 0 0 67 110 56 0.947 4.81 5.03 Intr + 199672 199837 166 1 1 48 65 243 0.986 17.96 5.04 Intr + 199950 200339 390 2 0 74 78 291 0.454 21.62 5.05 Intr + 202642 203301 660 0 0 116 105 251 0.862 21.60 5.06 Term + 205208 205258 51 1 0 107 55 73 0.815 3.33 5.07 PlyA + 205696 205701 6 -3.64 6.07 PlyA - 206656 206651 6 -0.45 6.06 Term - 207075 207054 22 2 1 151 55 59 0.996 6.68 6.05 Intr - 207677 207482 196 0 1 88 53 448 0.946 39.77 6.04 Intr - 211125 210978 148 0 1 85 46 253 0.930 20.61 6.03 Intr - 213019 212908 112 0 1 77 79 136 0.741 11.98 6.02 Intr - 225482 225448 35 2 2 113 108 61 0.834 7.62 6.01 Init - 232958 232908 51 2 0 121 99 115 0.999 17.06 6.00 Prom - 242971 242932 40 -5.56 7.04 PlyA - 243663 243658 6 1.05 7.03 Term - 251123 251050 74 0 2 87 47 59 0.524 -0.23 7.02 Intr - 251986 251860 127 1 1 91 93 92 0.810 10.25 7.01 Init - 253046 252921 126 2 0 110 50 82 0.950 6.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 44913 44845 69 0 0 98 47 66 0.886 2.58 S.002 Init - 156572 156552 21 2 0 72 111 12 0.860 1.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:77898258_78154169|GENSCAN_predicted_peptide_1|570_aa MTQTFTSSPMSSLMVPEGGESVLSAAYLFVKSLNSASYLYEVMERPRHGRLAWHGTQDKT TMVTSFTNGGLLRGWLVYQHDDSETTEDDIPFVATRQGESSGDVAWEETISRIFHVAQGG WRLLTTDYVAFSDADSGFADAQLVSDGQHQATVLLEVQASEPYLRVANGSSLVVPQGGQG TIDMAVLHLDTSLDIRNGDEVHYHVTAGPRWGQLLWAGQPATAFSQQDLLDGAVLYSHNS SLSPRDTMAFSVEMGPVHTDATLQVTIALEGPLAPLKLVRHKKIYVFQGEAAEIRRDQLE AAQEAVPPADIIFSVKSPPSASYLVMVSRGALADEPPSLDPVQSFSQEAVDTGRVLYLHS RPEAWSDAFSLDVASGLDAPLEGIRVELEVLPTVIPPEVQNFSVPEGGSFTLAPPLLCIA GPYFPTLPGLGLQVLEPPQHGALQKEDGPQAPSPGEWAGPEGSLGPGEEAQECQKALFSR HMWQGATAPIPAEALRSTDGDSGSEDLVYTIEQPSNGRVVLRGAPGTEVRSFTQAQLDSG LVLFSHRGTLDGGIHFGFSDDKHTSSGHFF >gi568815583r:77898258_78154169|GENSCAN_predicted_CDS_1|1713_bp atgacccagacgttcacatcctcaccaatgtcctccctcatggtgcctgagggtggtgag agtgtcctctctgctgcctacctctttgtcaagagtctcaacagtgccagctacctctat gaggtcatggagcggccccgccatgggaggttggcttggcatgggacacaggacaagacc actatggtgacatccttcaccaatggaggcctgttgcgtggctggctggtctaccagcat gatgactccgagaccacggaagatgatatcccatttgttgctacccgccagggcgagagc agtggtgacgtggcctgggaggagaccatcagccggatcttccatgtggcccagggtggg tggcggctgctgactacagactacgtggccttcagcgatgctgactcaggctttgctgac gcccagctggtgtccgatgggcaacaccaggccactgtgctgctggaggtgcaggcctcg gagccctacctccgtgtggccaacggctccagccttgtggtccctcaaggaggccagggc accatcgacatggccgtgctccacctggacaccagcctcgacatccgcaatggagatgag gtccactaccacgtcacagctggccctcgctggggacagctactctgggctggtcagcca gccactgccttctcccagcaggacctgctggatggggccgttctctatagccacaatagc agcctcagcccccgtgacaccatggccttctctgtggaaatggggccagtgcacacggat gccaccctacaagtgaccattgccctagagggcccactggccccgctgaagctggtccgg cacaagaagatctacgtcttccagggagaggcagctgagatcagaagggaccagttggag gcagcccaggaggcagtgccgccagcagacatcatattctcagtgaagagcccgccgagt gccagctacctggtgatggtgtcacgtggcgccttggcagatgagccacccagcctggac cccgtgcagagcttctcccaggaggcagtggacacaggcagggtcctgtacctgcactcc cgccctgaggcctggagcgatgccttctcgctggatgtggcctcaggcctggatgctccc ctcgagggcatccgtgtggagctggaggtgctgcccactgtcatcccaccggaggtgcaa aacttcagtgtccctgagggtggcagcttcaccctggcccctccactgctctgcatcgcc ggcccctacttccccactctcccgggcctcggcctgcaggtgctggagccaccccagcat ggagccctgcagaaggaggatggacctcaagcaccttctcctggagaatgggctgggcct gagggctccctggggccgggggaggaagcccaggaatgccagaaggctctgttttccagg catatgtggcagggggccactgcgcccatccctgcggaggctctgaggagcacggacggc gactctgggtctgaggatctggtctacaccatcgagcagcccagcaacggacgggtagtg ctgcggggggcgccgggcaccgaggtgcgcagcttcacgcaggcgcagctggacagcggg ctcgtgctgttctcacacagaggaaccctggatggaggcatccactttggcttctctgat gacaagcacacttcctccggacacttcttctga >gi568815583r:77898258_78154169|GENSCAN_predicted_peptide_2|1316_aa MSTQNKTIGIRPEDWNAKQVPPAFWLRDILCFQAQTYTCCHKPLRLFRGPMGSHTHAGPA CWAHLHPGDSSLISPWAKYPKQRPGQGVFIRVPKKHMQKGSACPLWPVRGKDSGRRRRRR QHFAPGTSSGLRSAPGLTRAGPALPEARFWFCGDLDCPDWVLAEISTLAKMVECTGCSLS GGGVLGYKKILKLTADAKFGEHPTEFTGPRQPWDLGLVPESGDVKATVAVLSFMLSGTAK HSVDGESLASELQQLGLPKEHAASLCRCYEEKQSPLQKHLRVCSLRTWSLAARVAEGTAE TVDPSAAPKTSGVQALACTVPHDGLGWRHPEEGGTHSGGAWSTKAPDCWGTHTHGPLLSD CWPFPSLDTRALGRGVTIPGLSSQAERLLGTRLLLLGLSSAAGSTLEHLEPKKLPLTFVI PGTHSLIMMKADRNPLRCLKEQGCGFQVSNGCRAPSAVLQALAVAIQLGGHLAGPLFQVD PLSSFGAVSLDIPIYLVFYYRTASVPETYIVKTLFKKLESVALVTPPLAQSHSSLVGDWR CSLMWPQPYLPPDPMMPEETRWNKLAAVKKKLKEYQQRKSPGVPAGAKTKKKKTGSSPET TTSGGCHSPGDSQYQELAVALESSSVTINQLNENIESLKQQKKQVEHQLEEITTCLAFLP KVQFQTINILTLEKADLKTTLYHTKRAARHFEEESKDLAGRLQYALQRIQELERALCAVS TQQQEEDRGHCLSSPDQNFSLFTIQSSSCREAVLQWRLQQTIKEQALLNAHVTQVTESLK QVQLEQDEYAKHIKGERARWQERMWKMSVEARTLKEEKKRDIHRIQELKRSLSELKNQMA EPLSLAPPAVTSVVEQLQDEAKHLRQEVEGLEGKLQSQVENNQALSVLSKEQKQRLQEQE ERLQEQEEMIREQEEMLREQEAQRVRELERLCEQNERLREQQKMLQEQGERLQKQEQRLR KQEERLQKEEERLQKQEKRLWDQEERLWKKEDRLQKQEERLVLFQNHKLDKQLAEPQCSF EDLNNEKKSALQLEQQVKELQEKLDEVKEMQYVATYQQLTSEKEALHRQLLLQTQLVDQL QQQEAWGKAHLEAASQRNQQLETRLSLVALPGEVAFFNSAGASSQEEQARLCGQRKVRRL CCLHLAHLVALAWKKPEAEAPAPGRTGDEFVCGESYRALKEAMVKLKVRESFTVYESQGA VPNTWHQEMEDVIRLAQKEEEMKVNLLELQELVLPLVGNHEGHGKFLTAAQNPADEPTPG PPAPQELGAAGEQDDFYEVSLDNNVEPAPGAAREGSPHDNPTAQQIVQLSPVMQDT >gi568815583r:77898258_78154169|GENSCAN_predicted_CDS_2|3951_bp atgagcactcagaacaaaaccataggaatacggccagaggactggaatgcaaaacaagtg ccccctgccttctggctgcgggacatcctctgcttccaagcccagacatacacctgctgc cacaagcccctgaggctcttcagaggaccaatgggcagccacacccatgctgggccagcc tgctgggcccacctgcaccctggtgattccagcctcatttctccatgggccaaatatcct aaacaaaggcctggacagggggtctttatccgagtgcccaaaaagcacatgcagaaaggc tctgcatgccccctgtggcctgtcagagggaaggattctgggagacggaggaggagacgc cagcatttcgcccccggcacctccagcgggctgcggtccgctcccggcctcactcgagca ggccccgccctcccagaggccaggttctggttctgtggtgatctggactgtcccgactgg gtgctggcagagatcagcacgctggccaagatggttgagtgcacagggtgtagtctgagt ggaggaggggtgttggggtataagaagatcctgaaactcacggctgatgccaagtttggt gagcaccccactgagttcacaggccccaggcaaccctgggacctcggcctggtgcctgag tcaggcgatgtgaaggccacagtggcagtgctgagtttcatgctctccggaactgccaag cacagtgtcgatggcgaatccttggccagtgaactgcagcagctggggctgcccaaagag cacgcggccagcctgtgccgctgttatgaggagaagcaaagccccttgcagaagcacttg cgggtctgcagcctacgcacgtggtccctggcagcacgagtggcagaagggacagcagag actgtggacccctcagctgcacctaagacctctggtgtgcaggccctggcgtgcacagtc ccacacgacggcctgggctggcgccaccctgaagaaggtggcacccactctggcggggct tggtccaccaaggcccctgactgctgggggacccacacccacggcccccttctttctgac tgctggcccttccccagcctggatactcgagccctgggcagaggggtaacgatcccagga ctgtcttcgcaagcagagaggctacttggcaccaggctcttactccttgggctgagctca gcagcagggtctaccttggagcacctggagcccaagaagctgcccctcacctttgtgatc ccagggacccactccctgatcatgatgaaggcagaccgaaacccgctgaggtgcctgaag gagcaggggtgtggattccaggtttccaatgggtgtcgggccccctctgctgtcctgcag gctctggcggtggccatccagcttggtggccatctggctggtccactcttccaggtggac cctctgtcctcatttggtgcagtaagtttggacattccgatctacttggtgttttattat agaactgctagtgtgcctgagacttacattgtgaagacactttttaaaaaacttgagagt gtggccctggttacgcctcctctggctcagtcacacagcagccttgtaggtgactggagg tgttcgctgatgtggccccaaccctacctccctcccgaccccatgatgccagaagaaact cgatggaacaaattggcagcagtcaagaaaaagctaaaagaatatcagcaaaggaagagc cctggtgttccagcaggagcaaagacaaaaaagaaaaaaactggcagtagccctgagaca accacttctggtggttgccactcacctggggatagccagtaccaagaactagcagtagcc ctggagtcaagctccgtgacaatcaatcaactcaatgaaaacatagaatcattgaaacag cagaagaaacaagtggaacatcagctggaagaaataaccacttgcttggcttttctccca aaagttcaattccagacaatcaacatcctcacattggaaaaggcagacttgaagaccacc ctttaccatactaaacgtgctgcccgacacttcgaagaagagtccaaggatctggctggc cgcctgcaatacgccttgcagcgtattcaagaattggagcgggctctctgtgctgtgtct acacagcagcaggaagaggacagggggcactgcctgagctccccagatcaaaacttctca ctcttcaccatccagtcctcgagctgcagagaagcggtcctccagtggcggttacagcag accataaaggagcaggcactgctgaacgcacacgtgacacaggtgacagagtcactaaaa caagtccagctagagcaggacgaatatgctaaacacataaaaggagagagggcccggtgg caggagaggatgtggaaaatgtcggtggaggctcgaacattaaaggaagagaagaagcgt gacatacatcggatacaggagctgaagaggagcttgtccgaactcaaaaaccagatggct gagcccctatccctggcgcccccagcagtgacctctgtggtggaacagctacaagatgag gccaaacacctgaggcaggaggtggaaggtctggagggaaagctccaatcccaggtggaa aacaatcaggccttgagtgtcctgagcaaggaacaaaagcagagactccaggagcaggag gagagactccaggagcaggaggagatgatccgagagcaggaggagatgctccgagagcag gaggcgcagagggtgcgggagctggagagactgtgtgaacaaaacgagaggcttcgggag cagcagaagatgctacaggagcagggtgagaggctgcaaaagcaggagcagaggctacgc aagcaggaggagaggctgcaaaaggaggaggagaggctgcaaaagcaggaaaagaggctg tgggaccaggaggagaggctgtggaagaaggaggacaggctacaaaagcaggaggagagg ctcgtgctcttccagaaccacaagctcgacaagcagctggccgagccacagtgcagcttc gaggatctgaacaatgagaaaaagagcgcactgcagttggagcagcaagtaaaggagctg caggagaagctagacgaggtgaaggagatgcagtacgtggccacctatcagcagctgacc tctgagaaggaggcgctgcacaggcagttactgctgcagacccagctcgtggaccagctg cagcagcaggaagcttggggcaaagcgcacctagaagctgccagccagcggaaccaacag ctagagacccggttgagcctcgtggctctccctggagaagtggcatttttcaactccgct ggagccagttcccaggaggagcaggcccggctatgtgggcagcggaaggtgcgaaggctg tgctgcctgcacctggctcatctggtggccttggcctggaagaagccagaggcagaggcc ccagccccagggaggactggagatgagtttgtgtgtggggagagctaccgggccctgaag gaggccatggtgaagctgaaagtgagagagtccttcaccgtatatgaaagccagggggca gtgccaaacacgtggcaccaggagatggaagatgtcatcaggctggcccagaaggaggag gagatgaaggtgaatctgctggagctgcaagagctggtgttgccccttgtgggcaaccat gaggggcatggcaaattcctcaccgctgcccagaaccctgctgatgagcccactccaggg cccccagccccccaggaactgggggctgccggtgagcaggatgatttttatgaagtgagc ctggacaacaatgtggagcctgcaccaggagcggccagggagggttctccccatgacaac cccactgcacagcagatcgtgcagctgtctcctgtaatgcaggacacctag >gi568815583r:77898258_78154169|GENSCAN_predicted_peptide_3|1067_aa MCQPGLGRPPQGPRLHAHCLVVLGLGSLVFATPQQQEYVVDTDGTCSCLPVSPPSLPGNA SWKKAGVDRAAPPASSHPISSHGWGLCLDDPPAKDIIDFPLVLPGIPYAVSHQCRLQYGA YSAFCEDMDWCLNGECVPVGFRPEAVDGGWSSWSAWSICSWSCGMGVQSAERQCTQPTPK YKGRYCVGERKRFRLCNLQACSAGRPSFRHVQCSHFDAMLYKGQLHTWVPVVNDAPQSAA LQIAISPAGASSMLVLLGSLQPAMDNVGCDFEIDSGAMEDRCGVCHGNGSTCHTVSGTFE EAEGLGYVDVGLIPAGAREIRLQEVAEAANFLALRSEDPEKYFLNDGWTIQWNGGYQVAG TTFTYARRGNWENLTSPGPTKEPVWIQESNPGVHYEHTIHREAGGHSEVPLPKFSWHCGP WTKCTVTCGRGGGAEKAEPPAEGVLQLGFNRQYGSECPRHSQRGGYIGEKGVYENQEDSE EVTVEVCLHPDTGSVQRQSVYCSERQAGPVDEEHCHPLGRPDDRQRKCSEQPCPARCPGS ATGPYIWVPGDPGGEQRELGFHSLPKAEALTPGHPSPQCSVTCGEGTQRRNVLCTNDIGV PCDEAQQPASKPGTMGNAIEEEAPELDLPGPMLVDDFYYDYNCINFHEDLSYGPSEEPDL DLVGTGDRTPPPHSHPAPSTGSPVPATEPPAAKEEGAPGPWSLSPWPSQAGCFPSPPSEQ TPGNPLINFLPEEDAPIGAPDLGLSSLPWPRVSTDGLQTLAAPDSQNDFPVSKDSQSQLP PPWRGRTNECSTTCGLGVVWRPVRCSSGRDEDCTPAGRPQPARHCHLQPCATWHSGNWSK GHTRGPARGCLGPQCSRSCGGGSSVQDVQCVDTRDLWTLQPFHCQPGPAKPPAHRPCGAQ PCLSCYTSSWRECSEACGGGEQQHLVTCPEPGLCEEALRPNTTWPCNTHPCTQWVVGPWG QCSAPCGSGVQRRLVKCVNTQTGLPEEDSDQCGHEAWPESSRPCGTEDCEPIESPRYKRD HLSFRFCETLHLVGCCQLPTVRTHCCRSCSPPSHGARSRGHQRVACH >gi568815583r:77898258_78154169|GENSCAN_predicted_CDS_3|3204_bp atgtgccagccgggcctgggccgccctccccagggcccccgcctccatgctcactgcctc gtggtgttggggctgggcagccttgtcttcgccacaccgcagcagcaggagtatgtggtg gatacagatggcacctgctcatgtctgccagtttccccgccatcgctgccagggaatgcc agctggaagaaggcaggagtagatagggctgcgccccctgcatcctcccatcccatctct agccatggatggggcctgtgcctggacgaccctcctgccaaggacatcatcgacttcccc ttggtgctgcctggcatcccctatgctgtgagccaccagtgccgcctccagtacggggcc tactctgccttctgcgaggacatggattggtgtctcaatggggagtgcgtacccgtgggc ttccggcccgaggccgtggatggtggctggtccagctggagcgcctggtccatctgctca tggagctgtggcatgggcgtacagagcgccgagcggcagtgcacgcagcctacgcccaaa tacaaaggcagatactgtgtgggtgagcggaagcgcttccgcctctgcaacctgcaggcc tgctccgctggccgcccctccttccgccacgtccagtgcagccactttgacgctatgctc tacaagggccagctgcacacatgggtgcccgtggtcaatgacgctccccagtctgcagcc ttgcagatagctatcagcccagccggagcctccagcatgttggtgctacttgggagcctc cagcctgccatggataacgtgggctgtgacttcgagattgactccggtgctatggaggac cgctgtggtgtgtgccacggcaacggctccacctgccacaccgtgagcgggaccttcgag gaggccgagggcctggggtatgtggacgtggggctgatcccagccggcgcacgcgagatc cgcctccaggaggttgccgaagctgccaacttcctggcactgcggagcgaggacccggag aagtacttcctcaatgatggctggaccatccagtggaacgggggctaccaggtggcaggg accaccttcacatacgcacgcaggggcaactgggagaacctcacgtccccgggtcccacc aaggagcctgtctggatccaggaaagcaaccccggggtacactacgagcacaccatccac agggaggcaggtggccacagcgaggtcccgctgcccaagttctcctggcactgtgggccc tggaccaagtgcacagtcacctgcggcagagggggtggagcagagaaggcagagccacca gcagagggggtcctgcagctgggcttcaacaggcagtatggaagtgagtgtccaaggcac agccagaggggaggctacataggagaaaagggggtgtatgagaaccaggaggacagcgag gaggtgacggtggaggtctgccttcatcctgacacaggcagtgtgcagaggcagagtgtg tactgctcagagcggcaggcaggacctgtggacgaggagcactgtcatcccctgggccgg cctgatgaccgccagaggaagtgcagcgagcagccctgccctgccaggtgtcctgggtct gccacaggcccctacatctgggtccctggagaccccgggggggagcagagggagctgggc tttcattctcttcccaaggctgaagccctaaccccaggtcacccctcaccccagtgctca gtgacatgtggggagggcactcagcgccgaaatgtcctctgcaccaatgacatcggtgtc ccctgtgacgaggcccagcagccagctagcaaaccaggcaccatgggcaacgccattgag gaggaggctccagagctggacctgccggggcccatgttggtggacgacttctactatgac tacaattgcatcaacttccacgaggatctgtcctatgggccctctgaggaacccgatcta gacctggtggggacaggggaccggacacccccaccacacagccatcctgcaccctccacg ggtagccccgtgcctgccacagagcctcctgcagccaaggaggagggggcaccgggacct tggtcccttagcccttggcccagccaggccggctgcttcccatccccgccctcagagcag acccctgggaaccctttgatcaatttcctgcctgaggaagatgcccccataggggcccca gatcttgggctctccagcctgccctggcccagggtttctaccgatggcctgcagacactt gctgcccctgatagccaaaatgatttcccagttagcaaggacagccagagccagctgccc cctccatggcggggcaggaccaatgagtgctctaccacctgcggcctgggtgtggtctgg aggccggtgcgctgtagctctggccgggatgaggactgtacccccgctggccggccccag cctgcccgccactgccacctgcagccctgtgccacctggcactcaggcaactggagtaag ggtcacacccgtgggcctgcacgtgggtgtcttggtccccagtgctcgcgcagctgcggt ggaggttcctcagtgcaggacgtgcagtgtgtggatacacgggacctctggacactgcag cccttccattgtcagcccgggcctgccaaaccgcctgcacaccggccctgcggggcccag ccctgcctcagctgttacacatcttcctggagggagtgctccgaggcctgtggcggtggt gagcagcagcatctggtgacttgcccggagccaggcctctgcgaggaggcgctgagaccc aacaccacctggccctgcaacacccacccctgcacgcagtgggtggtggggccctggggc cagtgctcagccccctgtggcagtggcgtccagcggcgcctagtcaagtgtgtcaacacc cagacagggctgcccgaggaagacagtgaccagtgtggccatgaggcctggcctgagagc tcccggccgtgtggcaccgaggattgtgagcccatcgagtctccccgctataagcgggac cacctgtccttcaggttctgcgagacgctgcacctagtgggctgctgccagctgcccact gtccgcacccactgctgccgctcgtgctctccgcccagccacggcgcccgctcccgaggc catcagcgggttgcctgccactga >gi568815583r:77898258_78154169|GENSCAN_predicted_peptide_4|963_aa MPGAGARAEEGGGGGEGAAQGAAAEPGAGPAREPARLCGYLQKLSGKGPLRGYRSRWFVF DARRCYLYYFKSPQDALPLGHLDIADACFSYQGPDEAAEPGTEPPAHFQVHSAGAVTVLK APNRQLMTYWLQELQQKRWEYCNSLDMVKWDSRTSPTPGDFPKGLVARDNTDLIYPHPNA SAEKARNVLAVETVPGELVGEQAANQPAPGHPNSINFYSLKQWGNELKNSMSSFRPGRGH NDSRRTVFYTNEEWELLDPTPKDLEESIVQEEKKKLTPEGNKGVTGSGFPFDFGRNPYKG KRPLKDIIGSYKNRHSSGDPSSEGTSGSGSVSIRKPASEMQLQVQSQQEELEQLKKDLSS QKELVRLLQQTVRSSQYDKYFTSSRLCEGVPKDTLELLHQKDDQILGLTSQLERFSLEKE SLQQEVRTLKSKVGELNEQLGMLMETIQAKDEVIIKLSEGEGNGPPPTVAPSSPSVVPVA RDQLELDRLKDNLQGYKTQNKFLNKEILELSALRRNAERRERDLMAKYSSLEAKLCQIES KYLILLQEMKTPVCSEDQGPTREVIAQLLEDALQVESQEQPEQAFVKPHLVSEYDIYGFR TVPEDDEEEKLVAKVRALDLKTLYLTENQEVSTGVKWENYFASTVNREMMCSPELKNLIR AGIPHEHRSKVWKWCVDRHTRKFKDNTEPGHFQTLLQKALEKQNPASKQIELDLLRTLPN NKHYSCPTSEGIQKLRNVLLAFSWRNPDIGYCQGLNRLVAVALLYLEQEDAFWCLVTIVE VFMPRDYYTKTLLGSQVDQRVFRDLMSEKLPRLHGHFEQYKVDYTLITFNWFLVVFVDSV VSDILFKIWDSFLYEGPKVIFRFALALFKYKEEEILKLQDSMSIFKYLRYFTRTILDARK LISISFGDLNPFPLRQIRNRRAYHLEKVRLELTELEAIREDFLRERDTSPDKGELVSDEE EDT >gi568815583r:77898258_78154169|GENSCAN_predicted_CDS_4|2892_bp atgccgggggccggagcccgggcggaggagggcggcggcggcggcgagggcgcggcgcag ggggcggccgcggagcccggggcgggtccggcgcgggagccagcgcggctgtgtggctat ctgcagaagctgtcgggcaagggccccctgcgtggctaccgcagccgctggttcgtgttc gacgcgcgccgctgctacctttactatttcaagagtccgcaggacgcgctgcccctcggc cacttggacatcgcggacgcctgcttcagctaccagggccccgacgaggcggcggagccg ggcacggagccgcccgcgcacttccaggtgcacagcgcgggagccgtcacggtgctcaag gctcccaatcgtcaactcatgacttactggttacaggagcttcagcagaagagatgggaa tattgtaacagtcttgacatggtcaagtgggacagcaggacctctccaactcccggggat tttcctaagggtcttgtagccagagataacactgatttaatttacccacacccaaatgct tctgcagaaaaagccagaaatgtcctagctgtggagactgtgcctggagagctggtggga gaacaagctgcaaatcagcccgccccagggcatccaaattccattaatttttactctttg aaacagtggggcaatgagctcaagaattcgatgtcttctttccgtcctgggagaggacat aatgatagtcggaggactgtgttttataccaatgaagagtgggaacttttagacccaacc cctaaggacctagaggagtccatagtacaggaagaaaagaagaagctgacccctgaagga aacaaaggagtaactggctcaggattcccctttgattttggacgtaacccctacaaagga aagcgccctttgaaagacataattgggtcgtacaaaaatcgtcacagcagtggtgaccct tcaagtgaaggcacatcaggcagtggcagcgtcagcatcaggaagccggcctccgaaatg caactgcaggtccagagccagcaggaagagctggaacagttaaagaaagacctgtccagt cagaaggagcttgttcgactgctccagcagacagtccggtcatcccagtatgacaagtat ttcacaagcagccggctctgtgagggggtcccaaaggacacgctcgagcttctgcaccaa aaggatgatcagattctgggccttaccagccagctggagaggttcagcttggagaaggag agtcttcagcaggaagtaaggacgctgaagagcaaagtgggcgagctcaacgagcagctg ggaatgctcatggagaccatccaagccaaggacgaggtcatcatcaagctcagcgagggc gagggcaacgggcctcctcccaccgtggcgcccagctccccttcggttgtgcctgttgcc agggaccagctggaactggacaggctgaaagataatctacaggggtacaaaacccaaaac aaatttctaaataaggagattttggaactctcagctctacgaagaaatgcagaaaggaga gagagggatctgatggcaaagtattctagcctggaagccaagctctgccagatagaaagt aaatacctgatattgctccaagaaatgaagacaccagtgtgctcagaagaccaggggccc acccgggaggtcatagcccagttgctggaggatgctctgcaggttgagagccaagagcag ccggagcaagcatttgttaaacctcatcttgtcagtgaatatgatatttatgggttcagg actgtacctgaggatgatgaggaagagaaattggttgccaaggtccgcgcgttggatctg aagactctctacctcacagaaaaccaggaagtctccactggggtcaagtgggaaaactat tttgcaagtacagtgaacagggagatgatgtgctctccagagttaaaaaacctcatccgt gcgggcattccccacgagcaccgttccaaggtgtggaagtggtgtgtggaccgtcacacc aggaagttcaaggacaacactgagcctggccacttccagaccttgctgcagaaggcgctg gagaaacagaacccagcctccaagcagattgagctggacttgctgcgaactctgcccaac aacaaacattactcctgccccacctcagaaggcatacagaagttacgcaatgtcctcctc gccttctcctggcggaatccagatatcggctactgtcaaggcctaaacaggttggtggca gtggccctcctgtacctggaacaagaagatgctttctggtgtctcgttaccatagtggaa gttttcatgcctcgagactattatacaaagactcttttaggatcccaggtggaccagcgg gtgttcagagaccttatgagtgagaagctgcctcggttgcatggccactttgaacagtac aaagtcgactacactctcatcactttcaactggtttctggtggtatttgtggatagtgtc gttagtgacatcctctttaaaatatgggactctttcctttatgaaggaccaaaggttatt ttccgttttgctctggcactttttaagtacaaggaagaggagattttgaaattgcaagat tcgatgtctatatttaagtatctccgctacttcactcgcactatccttgatgctaggaag ctgatcagtatctcctttggggacctgaaccctttccccctacgccagatccggaaccga cgcgcctaccacttggagaaagtccggctggagctgaccgagctggaggccatccgtgag gacttcctgcgtgagcgggacaccagccctgacaagggtgagctggtcagtgacgaggag gaggatacctga >gi568815583r:77898258_78154169|GENSCAN_predicted_peptide_5|510_aa MEDSLKQLSLGRDPEGAGDSQALAELQELALKWFMETQAPFILQNGALPPWFHGFITRKQ TEQLLRDKALGSFLIRLSDRATGYILSYRGSDRCRHFVINQLRNRRYIISGDTQSHSTLA ELVHHYQEAQLEPFKEMLTAACPRGHLVWGHDPYWKKGDCGNVLPEVRDLVGMLQSLTSS LRQAAGDWLGMCDHIPAVSACSQPEDNDLYDAITRGLHQTIVDPENPPATAFLTVVPDKA ASPRSSPKPQVSFLHAQKSLDVSPRNLSQEESMEAPIRVSPLPEKSSSLLEESFGGPSDI IYADLRRMNQARLGLGTEGSGRHGPVPAGSQAYSPGREAQRRLSDGEQNRPDGLGPVLSG VSPDQGPTESPTSWGCSDAMGSLGATWRQEFPKLSQEAQPCSQGSSADIYEFIGTEGLLQ EARDTPDQEGSTYEQIPACWGGPARAPHPGASPTYSPWVHGYKRISGTPELSEPGNTYEQ IPATKSKETGRTHKPDKLRRLFFTYRKHKF >gi568815583r:77898258_78154169|GENSCAN_predicted_CDS_5|1533_bp atggaggacagcctaaagcagctcagcctggggagagatcctgagggggcaggggacagc caggccctggctgagctccaggagcttgccctgaagtggttcatggagacacaggccccc ttcattctgcagaacggtgccctgcctccctggtttcatggattcatcacccgcaagcag acggagcagctactcagggacaaagctcttggttccttccttatccgcctcagtgaccga gccactggctacatcttgtcctacaggggcagtgatcgctgccgacattttgtcatcaac cagcttcgaaaccggcgttacatcatctcaggagacacccagagccacagcaccctggct gagcttgtgcaccattaccaggaggcacagctcgagcccttcaaagagatgctgactgct gcctgcccccgggggcacttggtgtggggtcatgatccctattggaaaaagggagactgt ggcaatgtcttacctgaggtcagagacctggtgggaatgttgcagtctcttaccagcagt ctcaggcaggcagctggagactggctgggcatgtgtgaccacatacctgccgtatccgca tgctcccagccagaggacaatgatctgtatgatgccatcacccggggcctccaccagacc atcgtggacccagaaaacccacctgccacggcattcctcacagtggtccccgacaaggcc gccagcccccgctcttctccaaagccccaggtctccttcctccatgcacagaaaagcctg gatgtgagtccccggaacctctcccaggaggaaagcatggaggctcccatcagagtgtct ccactccctgagaagagttcctccctcctggaagagtcttttggaggccccagtgacatc atctatgcagacctgaggaggatgaaccaggcacggctaggcttgggcacagaggggtcc ggcaggcatgggccagttccagctggcagccaggcctactccccaggcagggaggcccaa aggagactctcagatggagaacagaacaggcctgatggcctggggcctgtcctttctggg gtgagcccagaccagggtcccacagagtctcccacttcctggggatgttctgatgccatg ggatccctgggggctacctggaggcaggagtttccaaagctgagccaagaggctcagccc tgctcccagggcagctctgcagatatctatgagttcatcgggacagaaggcctcctgcaa gaggccagggacacaccagaccaagaaggcagcacctatgagcagatcccagcttgctgg ggtggcccagccagggccccacatcctggggccagtcccacatatagcccatgggtccat ggctacaagaggatctcagggaccccagagctctcagagcctgggaacacctatgaacag atcccagcaaccaagagcaaggagactggacggacacacaagcctgacaagcttcggagg ctcttcttcacgtacaggaagcacaaattctga >gi568815583r:77898258_78154169|GENSCAN_predicted_peptide_6|187_aa MGNKQTIFTEEQLDNYQDCTFFNKKDILKLHSRFYELAPNLVPMDYRKSPIVHVPMSLII QMPELRENPFKERIVAAFSEDGEGNLTFNDFVDMFSVLCESAPRELKANYAFKIYDFNTD NFICKEDLELTLARLTKSELDEEEVVLVCDKVIEEADLDGDGKLGFADFEDMIAKAPDFL STFHIRI >gi568815583r:77898258_78154169|GENSCAN_predicted_CDS_6|564_bp atggggaacaagcagaccatcttcaccgaagagcagctagacaactaccaggactgcacc ttcttcaataagaaggacatcctcaagctgcattcgcgattctatgagctggcccccaac ctcgtcccaatggactacaggaagagccccatcgtccacgtgcccatgagcctcatcatc cagatgccagagctccgggagaatcccttcaaagaaaggatcgtggcggcgttttccgag gatggtgaggggaacctcactttcaacgactttgtggacatgttttccgtgctctgcgag tcggctccccgagagctcaaggcaaactatgccttcaagatctatgacttcaacactgac aacttcatctgcaaggaggacctggagctgacgctggcccggctcactaagtcagagctg gatgaggaggaggtggtgcttgtgtgcgacaaggtcattgaggaggctgacttggacggt gacggcaagctgggctttgctgacttcgaggacatgattgccaaggcccctgacttcctc agcactttccacatccggatctga >gi568815583r:77898258_78154169|GENSCAN_predicted_peptide_7|108_aa MPGHNLKWKLNRGTVLIETGIQLSTSTILGSASEPPSAPIPKAQVSSTEKLRNCIDDLKP FPALASELSRRAKALQIAGFPPMKATAAAAVRKRPPHARSPPLHGHAP >gi568815583r:77898258_78154169|GENSCAN_predicted_CDS_7|327_bp atgcctggccacaacctgaaatggaaactcaacagaggaactgtcctgattgaaaccggc atccagctcagcacctccaccatcctggggtcagcctctgagccaccttctgctccaatc cctaaagcacaggttagttccactgagaagctacggaactgcatagatgatctgaaaccc tttccagctctggcatctgagctgtcaaggagagctaaagcgctccagattgcaggattc ccaccaatgaaagcaacagccgcagcggcagtgcgcaagcgcccaccccacgctcggagc ccgcccctgcacggccacgccccctga