GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:57:53 Sequence gi568815579f:19365346_19605471 : 240126 bp : 52.13% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 16837 16898 62 0 2 75 70 55 0.610 3.07 1.02 Intr + 20225 20500 276 2 0 73 28 131 0.431 2.77 1.03 Intr + 20679 20793 115 0 1 71 109 88 0.828 10.15 1.04 Intr + 38343 38372 30 2 0 105 94 8 0.009 1.91 1.05 Intr + 40653 40674 22 2 1 61 131 28 0.015 2.00 1.06 Intr + 44780 44800 21 0 0 117 109 12 0.011 4.20 1.07 Intr + 55336 55406 71 2 2 117 59 52 0.027 4.59 1.08 Intr + 60043 60088 46 0 1 86 57 25 0.014 -2.33 1.09 Intr + 63757 63898 142 2 1 44 77 188 0.206 13.22 1.10 Intr + 78219 78312 94 0 1 64 91 68 0.023 5.17 1.11 Intr + 86008 86211 204 0 0 22 81 96 0.060 2.22 1.12 Intr + 86861 86953 93 1 0 113 75 -9 0.033 0.96 1.13 Intr + 91781 91884 104 1 2 35 105 49 0.007 0.77 1.14 Intr + 96942 97086 145 0 1 84 59 37 0.216 1.19 1.15 Intr + 99995 100269 275 1 2 81 109 269 0.780 25.17 1.16 Intr + 105230 105342 113 2 2 14 34 64 0.011 -5.87 1.17 Intr + 108859 108948 90 2 0 49 79 88 0.141 4.46 1.18 Intr + 109806 109892 87 1 0 98 58 50 0.525 3.44 1.19 Intr + 123275 123299 25 0 1 108 92 37 0.018 3.67 1.20 Intr + 126961 127093 133 1 1 72 89 187 0.340 18.35 1.21 Intr + 127236 127367 132 2 0 131 100 127 0.999 19.55 1.22 Intr + 128949 129038 90 2 0 129 82 48 0.995 8.99 1.23 Intr + 130409 130540 132 1 0 62 115 82 0.996 9.65 1.24 Intr + 130707 130874 168 2 0 61 94 152 0.995 13.76 1.25 Intr + 133098 133377 280 2 1 77 113 393 0.857 38.29 1.26 Intr + 135773 136071 299 0 2 96 99 554 0.769 54.44 1.27 Intr + 136624 136698 75 0 0 80 71 67 0.950 4.41 1.28 Intr + 136986 137181 196 2 1 131 94 186 0.999 23.01 1.29 Term + 139999 140129 131 2 2 116 39 208 0.832 17.35 1.30 PlyA + 141612 141617 6 1.05 2.02 PlyA - 143425 143420 6 1.05 2.01 Sngl - 150082 149261 822 1 0 106 43 1985 0.992 191.98 2.00 Prom - 150242 150203 40 -3.11 3.00 Prom + 150683 150722 40 -10.12 3.01 Init + 150837 150987 151 2 1 97 60 137 0.373 11.37 3.02 Intr + 160837 160915 79 2 1 95 91 114 0.894 11.50 3.03 Intr + 161936 162007 72 2 0 79 54 98 0.958 4.52 3.04 Intr + 162356 162425 70 2 1 107 98 173 0.999 19.88 3.05 Intr + 162662 162777 116 1 2 116 37 176 0.028 15.05 3.06 Intr + 164019 164172 154 0 1 102 97 48 0.740 7.69 3.07 Intr + 165103 165269 167 0 2 92 34 43 0.692 -1.43 3.08 Intr + 167287 167395 109 1 1 93 77 228 0.995 22.99 3.09 Intr + 169689 169799 111 2 0 70 78 87 0.612 6.98 3.10 Intr + 169992 170105 114 2 0 114 60 220 0.999 22.95 3.11 Intr + 170184 170312 129 2 0 131 54 205 0.991 22.80 3.12 Intr + 170441 170519 79 1 1 80 60 -23 0.616 -6.28 3.13 Term + 171974 172179 206 0 2 84 47 369 0.990 30.26 3.14 PlyA + 172215 172220 6 1.05 4.00 Prom + 172507 172546 40 -9.36 4.01 Init + 173005 173068 64 0 1 107 78 179 0.999 18.06 4.02 Intr + 174334 174432 99 2 0 101 113 107 0.999 15.08 4.03 Intr + 174841 175131 291 2 0 34 66 549 0.401 44.95 4.04 Intr + 177030 177305 276 1 0 103 94 251 0.999 25.23 4.05 Intr + 177519 177627 109 1 1 41 93 93 0.999 4.94 4.06 Intr + 177903 178060 158 0 2 61 97 106 0.986 8.97 4.07 Intr + 178336 180633 2298 2 0 140 30 3917 0.489 379.44 4.08 Intr + 183160 183286 127 2 1 93 36 34 0.248 -1.06 4.09 Intr + 188910 189000 91 0 1 80 77 53 0.595 3.90 4.10 Term + 192208 192333 126 0 0 104 37 94 0.430 4.49 4.11 PlyA + 192406 192411 6 1.05 5.09 PlyA - 193974 193969 6 -0.45 5.08 Term - 197065 197025 41 2 2 100 46 44 0.619 -0.96 5.07 Intr - 197603 197483 121 2 1 -1 66 92 0.236 -1.33 5.06 Intr - 198270 198164 107 1 2 116 109 36 0.788 8.93 5.05 Intr - 199744 199588 157 1 1 114 117 158 0.992 21.40 5.04 Intr - 204329 204104 226 1 1 86 66 206 0.973 16.72 5.03 Intr - 204954 204764 191 0 2 65 71 239 0.707 18.80 5.02 Intr - 205488 205241 248 1 2 71 58 222 0.985 15.21 5.01 Init - 207350 207302 49 2 1 94 58 50 0.931 1.65 5.00 Prom - 207593 207554 40 -6.80 6.00 Prom + 209009 209048 40 -5.11 6.01 Init + 209950 210060 111 0 0 71 103 91 0.888 9.12 6.02 Intr + 213371 213459 89 1 2 39 51 89 0.090 -0.43 6.03 Term + 214240 214390 151 1 1 96 47 80 0.044 2.39 6.04 PlyA + 215412 215417 6 -0.45 7.07 PlyA - 215889 215884 6 1.05 7.06 Term - 216211 216056 156 1 0 66 54 86 0.001 1.25 7.05 Intr - 217590 217515 76 2 1 106 75 5 0.001 1.01 7.04 Intr - 222853 222741 113 1 2 9 111 60 0.010 0.08 7.03 Intr - 228084 227910 175 2 1 70 58 58 0.153 1.46 7.02 Intr - 230088 230058 31 1 1 54 91 33 0.150 -2.13 7.01 Intr - 234020 233947 74 1 2 112 96 92 0.932 11.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 40952 40819 134 0 2 86 42 125 0.835 6.26 S.002 Term - 55798 55689 110 2 2 90 48 120 0.826 7.17 S.003 Term + 162662 162781 120 1 0 116 47 206 0.945 18.08 S.004 Init + 163588 163646 59 0 2 80 76 75 0.873 6.27 S.005 Init + 193596 193667 72 2 0 73 75 77 0.884 3.92 S.006 Term + 193711 193839 129 0 0 83 49 82 0.897 2.29 S.007 Init + 214810 214874 65 0 2 69 94 92 0.899 6.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:19365346_19605471|GENSCAN_predicted_peptide_1|1216_aa MRQEVQNQVKGEDTSEGPDPWMPVSHSVPPPRTRPGRRRDPENQPARALTGRHALSVMKL ATPSMGCGSWESWLDLIGLWRLHSERRASTRKCARRNLPSNRGSRAVKRAISVGRRGRHL RESPPRDATRPDRTAPPEAVGRRQRGRRATSLIKLTLIKTQGPADPRPASRGTFTGVDIG GSPLHEVLITVWVPSCEVWSHPKRTLCFVVTGHTVSSSVPTWVLDEVMRKRLVSMEHGAF GKWDARFQVASFSKKFQHRGPWICFVEASGREKEKGAACGGHEGGPRWPWTWVLDDSQVG CELEGVSQALIRLLVLQKFPSEFTHCPDVQQSTNQLLWLLWKLTLPCFWHEVEPTRKVAV PGQMLLLHGRHVFLVPFPVFLLPLASLDPTAQYVQHTCSVTDTLAEVGPLQGHGQGVVPA VTEPVIWSKCPVVAEDWPVAGPASLVLAVEEAPAKGAPCTRGLAPPPPAVPVLPEFRMTE EACRTRSQKRALERDPTEDDVESKKIKMERGLLASDLNTDGDMRVTPEPGAGPTQGLLRA TEATAMAMGRGEGLVGDGPVDMRTSHRTSRQNWQGMEDLKNIINSLSLTSVEYLVNAEYI LPIVGKGEEGEVDTGIAPNPDNHVSMGLSWCARDTAPGPPAGCLQECTSLNEMDDGLMWH SCRRNSMEVPGSDMKSERRPPSPDVIVLSDNEQPSSPRVNGLTTVALKETSTEALMKSSP EERERMIKQLKEELRLEEAKLVLLKKLRQSQIQKEATAQKPTGSVGSTVTTPPPLVRGTQ NIPAGKPSLQTSSARMPGSVIPPPLVRGGQQASSKLGPQASSQVVMPPLVRGAQQIHSIR QHSSTGPPPLLLAPRASVPSVQIQGQRIIQQGLIRVANVPNTSLLVNIPQPTPASLKGTT ATSAQANSTPTSVASVVTSAESPASRQAAAKLALRKQLEKTLLEIPPPKPPAPEMNFLPS AANNEFIYLVGLEEVVQNLLETQAGRMSAATVLSREPYMCAQCKTDFTCRWREEKSGAIM CENCMTTNQKKALKVEHTSRLKAAFVKALQQEQEIEQRLLQQGTAPAQAKAEPTAAPHPV LKQVIKPRRKLAFRSGEARDWSNGAVLQASSQLSRGSATTPRGVLHTFSPSPKLQNSASA TALVSRTGRHSERTVSAGKGSATSNWKKTPLSTGGTLAFVSPSLAVHKSSSAVDRQREYL LDMIPPRSIPQSATWK >gi568815579f:19365346_19605471|GENSCAN_predicted_CDS_1|3651_bp atgaggcaagaagttcagaatcaagtgaagggggaagacacaagtgaaggccctgatccc tggatgcctgtcagtcattccgtcccgcctcctcggacccgccccggccggcggcgcgat cccgagaaccagccggcccgcgcactcactggtcgtcatgcgctgtctgtcatgaagctg gccacaccttccatggggtgcgggtcctgggagtcctggctggacctcattggcttgtgg cgccttcactcggagagacgcgcctcaacccggaagtgtgcgaggcgaaatctgcctagc aaccggggaagccgggctgtgaagcgggcaatttcagtcggccgccgcgggcgccacctg agggagtcgcctccgcgggacgccacaagacctgaccggactgcgccgcccgaggccgtc ggccgccgtcagcgagggcgccgagcaacttcgttaattaaattgacattaatcaagacc cagggcccggccgacccgcggcccgcctccagaggcaccttcacaggtgtggatatcggc ggtagtcccttgcatgaagtgctgataacagtgtgggttccatcgtgtgaagtgtggagt catcccaagaggactctgtgctttgtggtcacaggtcacacagtctcctcctccgtgccc acgtgggtgttagatgaggtgatgcgcaagcgccttgtgtccatggagcatggagccttt gggaagtgggatgctcgatttcaagtagcaagtttcagcaaaaagttccagcatcgtggt ccatggatttgcttcgtggaagctagtgggagagagaaggagaaaggagcggcatgtgga ggacacgagggtggtccccgatggccgtggacctgggtgctggatgacagccaggtgggc tgtgaattagaaggggtgtcccaggcactgatcaggctcctcgtgctgcagaagtttcct tctgagttcacgcattgtccagatgtgcagcagtcaacaaatcagttattgtggcttctc tggaagctgacattaccatgtttttggcacgaagtagaaccaacacgcaaggttgcagtg ccaggtcagatgctcttacttcatgggcgccatgtgttccttgtccccttcccagtcttc ttgttgcccttggcttccctggatcccacagcccagtatgtgcagcacacctgctctgtg actgacactcttgcagaagtggggccacttcagggacatggacaaggtgttgtacctgct gtcacagagcctgttatctggagcaagtgtcctgtggtggccgaggattggccagtagca ggccctgcgtcattggtcttggcagtggaggaggctcctgccaaaggagccccctgcact cgggggctggccccaccccctccagctgtgcctgtccttcctgagttcagaatgaccgaa gaagcatgccgaacacggagtcagaaacgagcgcttgaacgggacccaacagaggacgat gtggagagcaagaaaataaaaatggagagaggattgttggcttcagatttaaacactgac ggagacatgagggtgacacctgagccgggagcaggtccaacccaaggattgctgagggca acagaggccacggccatggccatgggcagaggcgaagggctggtgggcgatgggcccgtg gacatgcgcacctcacacagaacatctagacaaaattggcaaggtatggaggacttgaag aacattattaactcacttagcctgacatctgtagaatacctggtgaatgcagaatacatc ctcccaattgtagggaagggagaggagggtgaggtcgacactggaattgcccccaacccc gacaaccacgtgagcatggggctctcctggtgtgcccgagacactgctcccggcccgccg gccggctgtctacaggaatgcacgtccttgaatgagatggacgatggactaatgtggcac agctgtcgacgaaattctatggaagttcctggcagtgacatgaagtccgagaggagaccc ccctcacctgacgtgattgtgctctccgacaacgagcagccctcgagcccgagagtgaat gggctgaccacggtggccttgaaggagactagcaccgaggccctcatgaaaagcagtcct gaagaacgagaaaggatgatcaagcagctgaaggaagaattgaggttagaagaagcaaaa ctcgtgttgttgaaaaagttgcggcagagtcaaatacaaaaggaagccaccgcccagaag cccacaggttctgttgggagcaccgtgaccacccctcccccgcttgttcggggcactcag aacattcctgctggcaagccatcactccagacctcttcagctcggatgcccggcagtgtc atacccccgcccctggtccgaggtgggcagcaggcgtcctcgaagctggggccacaggcg agctcacaggtcgtcatgcccccactcgtcaggggggctcagcaaatccacagcattagg caacattccagcacagggccaccgcccctcctcctggccccccgggcgtcggtgcccagt gtgcagattcagggacagaggatcatccagcagggcctcatccgcgtcgccaatgttccc aacaccagcctgctcgtcaacatcccacagcccaccccagcatcactgaaggggacaaca gccacctccgctcaggccaactccacccccactagtgtggcctctgtggtcacctctgcc gagtctccagcaagccgacaggcggccgccaagctggcgctgcgcaaacagctggagaag acgctactcgagatccccccacccaagcccccagccccagagatgaacttcctgcccagc gccgccaacaacgagttcatctacctggtcggcctggaggaggtggtgcagaacctactg gagacacaagcaggcaggatgtcggccgccactgtgctgtcccgggagccctacatgtgt gcacagtgcaagacggacttcacgtgccgctggcgggaggagaagagcggcgccatcatg tgtgagaactgcatgacaaccaaccagaagaaggcgctcaaggtggagcacaccagccgg ctgaaggccgcctttgtgaaggcgctgcagcaggaacaggagattgagcagcggctcctg cagcagggcacggcccctgcacaggccaaggccgagcccaccgctgccccacaccccgtg ctgaagcaggtcataaaaccccggcgtaagttggcgttccgctcaggagaggcccgcgac tggagtaacggggctgtgctacaggcctccagccagctgtcccggggttcggccacgacg ccccgaggtgtcctgcacacgttcagtccgtcacccaaactgcagaactcagcctcggcc acagccctggtcagcaggaccggcagacattctgagagaaccgtgagcgccggcaagggc agcgccacctccaactggaagaagacgcccctcagcacaggcgggacccttgcgtttgtc agcccaagcctggcggtgcacaagagctcctcggccgtggaccgccagcgagagtacctc ctggacatgatcccaccccgctccatcccccagtcagccacgtggaaatag >gi568815579f:19365346_19605471|GENSCAN_predicted_peptide_2|273_aa MSGDKLLSELGYKLGRTIGEGSYSKVKVATSKKYKGTVAIKVVDRRRAPPDFVNKFLPRE LSILRGVRHPHIVHVFEFIEVCNGKLYIVMEAAATDLLQAVQRNGRIPGVQARDLFAQIA GAVRYLHDHHLVHRDLKCENVLLSPDERRVKLTDFGFGRQAHGYPDLSTTYCGSAAYASP EVLLGIPYDPKKYDVWSMGVVLYVMVTGCMPFDDSDIAGLPRRQKRGVLYPEGLELSERC KALIAELLQFSPSARPSAGQVARNCWLRAGDSG >gi568815579f:19365346_19605471|GENSCAN_predicted_CDS_2|822_bp atgtcgggagacaaacttctgagcgaactcggttataagctgggccgcacaattggagag ggcagctactccaaggtgaaggtggccacatccaagaagtacaagggtaccgtggccatc aaggtggtggaccggcggcgagcgcccccggacttcgtcaacaagttcctgccgcgagag ctgtccatcctgcggggcgtgcgacacccgcacatcgtgcacgtcttcgagttcatcgag gtgtgcaacgggaaactgtacatcgtgatggaagcggccgccaccgacctgctgcaagcc gtgcagcgcaacgggcgcatccccggagttcaggcgcgcgacctctttgcgcagatcgcc ggcgccgtgcgctacctgcacgatcatcacctggtgcaccgcgacctcaagtgcgaaaac gtgctgctgagcccggacgagcgccgcgtcaagctcaccgacttcggcttcggccgccag gcccatggctacccagacctgagcaccacctactgcggctcagccgcctacgcgtcaccc gaggtgctcctgggcatcccctacgaccccaagaagtacgatgtgtggagcatgggcgtc gtgctctacgtcatggtcaccgggtgcatgcccttcgacgactcggacatcgccggcctg ccccggcgccagaaacgcggcgtgctctatcccgaaggcctcgagctgtccgagcgctgc aaggccctgatcgccgagctgctgcagttcagcccgtccgccaggccctccgcgggccag gtagcgcgcaactgctggctgcgcgccggggactccggctag >gi568815579f:19365346_19605471|GENSCAN_predicted_peptide_3|518_aa MAVAVSHFRPGPEVWDTASMAASKVKQDMPPPGGYGPIDYKRNLPRRGLSGYSMLAIGIG TLIYGHWSIMKWNRERRRLQIEDFEARIALLPLLQAETDRRTLQMLRENLEEEAIIMKDV PDWKVGESVFHTTRWVPPLIGELYGLRTTEEALHASHGFMWYTALELQPPLADMGRAELS SNATTSLVQRRKQAWGRQSWLEQIWNAGPVCQRLSPHYSNDQLDRKRMFQVAARGQSFGT EEEECIALPPSTSSLHPPAGAAVRAVTRWSTAEAAALERELLEDYRFGRQQLVELCGHAS AVAVTKAFPLPALSRKQRTVLVVCGPEQNGAVGLVCARHLRVFEYEPTIFYPTRSLDLLH RDLTTQCEKMDIPFLSYLPTEVQLINEAYGLVVDAVLGPGVEPGEVGGPCTRALATLKLL SIPLLGWTHTWLRLRVTPIPLMGKIEAATPGWDAETGSDSEDGLRPDVLVSLAAPKRCAG RFSGRHHFVAGRFVPDDVRRKFALRLPGYTGTDCVAAL >gi568815579f:19365346_19605471|GENSCAN_predicted_CDS_3|1557_bp atggcggtggcagtaagtcacttccgcccgggaccggaagtgtgggatactgcgagtatg gcggcgtcaaaggtgaagcaggacatgcctccgccggggggctatgggcccatcgactac aaacggaacttgccgcgtcgaggactgtcgggctacagcatgctggccatagggattgga accctgatctacgggcactggagcataatgaagtggaaccgtgagcgcaggcgcctacaa atcgaggacttcgaggctcgcatcgcgctgttgccactgttacaggcagaaaccgaccgg aggaccttgcagatgcttcgggagaacctggaggaggaggccatcatcatgaaggacgtg cccgactggaaggtgggggagtctgtgttccacacaacccgctgggtgccccccttgatc ggggagctgtacgggctgcgcaccacagaggaggctctccatgccagccacggcttcatg tggtacacggccttggagctgcagcccccacttgccgacatgggaagagcggagcttagc tcaaatgctaccacctcccttgtccagaggaggaaacaggcctggggaaggcagtcatgg ctagagcagatttggaacgcagggcctgtttgccagaggttgagcccccattattctaat gaccagttagacaggaagagaatgttccaagtggctgctagaggccaaagctttgggaca gaagaagaggaatgcatcgccctcccgccctccacttcctccctgcaccctcctgcgggg gcagcagtcagagctgtcacaagatggagcaccgcggaggcagccgccctggagcgggag ctgctggaggattatcgctttgggcggcagcagctcgtggagctgtgcggtcatgctagt gccgtggctgtgaccaaggcgttcccgttgcccgctctctcccggaagcagaggacggtg ctggtcgtgtgtggcccggagcagaacggggcagtggggctggtctgtgcccggcacctg cgggtgtttgagtatgaacccaccatcttctaccccacacgctcgctggacctgctgcat cgggacctgaccacccagtgcgagaagatggacatccccttcctgagctacctgcccact gaggtgcagctcattaacgaagcctatgggctggtggtggatgccgtactgggccccggc gtggagccgggcgaggtcgggggcccctgcacccgcgcgctggccacgctcaagctgctg tccatccccctccttggatggacccacacttggctgaggctaagggtcacacccatccct ctgatggggaagatcgaggctgccaccccaggctgggacgcagagaccggcagcgattcg gaggacgggctgcggcctgacgtgctggtgtctctcgcggcgcccaagcgctgcgctggc cgcttctccgggcgccaccacttcgtggccggcaggttcgtgcccgatgacgtgcgccgc aagttcgctctgcgcctgccgggatacacgggcaccgactgcgtcgcggcactgtga >gi568815579f:19365346_19605471|GENSCAN_predicted_peptide_4|1212_aa MASLLPLLCLCVVAAHLAGARDATPTEEPMATALGLERRSVYTGQPSPALEDWEARAVPA EASEWTSWFNVDHPGGDGDFESLAAIRFYYGPARVCPRPLALEARTTDWALPSAVGERVH LNPTRGFWCLNREQPRGRRCSNYHVRFRCPLGCSLDTCECPDHILLGSVVTPSGQPLLGA RVSLRDQPGTVATSDAHGTFRVPGVCADSRANIRAQMDGFSAGEAQAQANGSISVVTIIL DKLEKPYLVKHPESRVREAGQNVTFCCKASGTPMPKKYSWFHNGTLLDRRAHGYGAHLEL RGLRPDQAGIYHCKAWNEAGAVRSGTARLTVLAPGQPACDPRPREYLIKLPEDCGQPGSG PAYLDVGLCPDTRCPSLAGSSPRCGDASSRCCSVRRLERREIHCPGYVLPVKVVAECGCQ KCLPPRGLVRGRVVAADSGEPLRFARILLGQEPIGFTAYQGDFTIEVPPSTQRLVVTFVD PSGEFMDAVRVLPFDPRGAGVYHEVKAMRKKAPVILHTSQSNTIPLGELEDEAPLGELVL PSGAFRRADGKPYSGPVEARVTFVDPRDLTSAASAPSDLRFVDSDGELAPLRTYGMFSVD LRAPGSAEQLQVGPVAVRVAASQIHMPGHVEALKLWSLNPETGLWEEESGFRREGSSGPR VRREERVFLVGNVEIRERRLFNLDVPERRRCFVKVRAYANDKFTPSEQVEGVVVTLVNLE PAPGFSANPRAWGRFDSAVTGPNGACLPAFCDADRPDAYTALVTATLGGEELEPAPSLPR PLPATVGVTQPYLDRLGYRRTDHDDPAFKRNGFRINLAKPRPGDPAEANGPVYPWRSLRE CQGAPVTASHFRFARVEADKYEYNVVPFREGTPASWTGDLLAWWPNPQEFRACFLKVKIQ GPQEYMVRSHNAGGSHPRTRGQLYGLRDARSVRDPERPGTSAACVEFKCSGMLFDQRQVD RTLVTIMPQGSCRRVAVNGLLRDYLTRHPPPVPAEDPAAFSMLAPLDPLGHNYGVYTVTD QSPRLAKEIAIGRCFDGSSDGFSREMKADAGTAVTFQCREPPAGRPSLFQRLLESPATAL GDIRREMSEAAQAQARASALKPGIPPPPSWNLGYRPPPGKSIVYPPTKLGLPDHTPSVWE ERGNQCQEEDQAPAPADAPSPLAKVSGHRLAGLGVQVTKPLSLIVPTCYVETPISTHTAS RQFKSLVTPVSL >gi568815579f:19365346_19605471|GENSCAN_predicted_CDS_4|3639_bp atggcgtcgctgctgccactgctctgtctctgtgtcgtcgctgcgcacctggcgggggcc cgagacgccacccccaccgaggagccaatggcgactgcactgggcctggaaagacggtcc gtgtacaccggccagccctcaccagccctggaggactgggaagcgcgtgcggtgcccgca gaggccagcgagtggacgtcctggttcaacgtggaccaccccggaggcgacggcgacttc gagagcctggctgccatccgcttctactacgggccagcgcgcgtgtgcccgcgaccgctg gcgctggaagcgcgcaccacggactgggccctgccgtccgccgtcggcgagcgcgtgcac ttgaaccccacgcgcggcttctggtgcctcaaccgcgagcaaccgcgtggccgccgctgc tccaactaccacgtgcgcttccgctgcccactagggtgcagccttgacacctgtgaatgc ccggaccacatcctcctgggctcggtggtcaccccatctgggcaaccactgctaggagcc agggtctccctgcgagaccagcctggcactgtggccaccagcgatgctcacggaaccttc cgggtgcctggtgtctgtgctgacagccgcgccaacatcagggcccagatggatggcttc tctgcaggggaggcccaggcccaggccaacggatccatctctgtggtcaccatcatcctt gataagttggagaagccgtacctggtgaaacaccctgagtcccgagtgcgagaggctggc cagaatgtgactttctgctgcaaagcctccgggacccccatgcccaagaaatactcctgg ttccacaatgggaccctgctggacaggcgagctcatgggtacggggcccacctggagctg cggggactgcgcccagaccaggctggcatctaccactgcaaggcatggaatgaggcgggt gccgtgcgctcgggcactgcccggctcactgtacttgccccaggccagccagcctgcgac ccccggccccgagagtacctgatcaagctccctgaggactgtggtcagccaggtagtggc cctgcctacctggatgtgggcctctgtcccgacacccgctgccccagcctggcaggctcc agcccccgctgcggggacgccagctcccgctgctgctctgtgcgccgtctggagagaagg gagattcactgccctggctacgtcctcccagtgaaggtggtggcagagtgtggctgccag aagtgtctgccccctcgggggctggtccggggccgtgttgtggctgctgactccggggag ccgctacgcttcgccaggattctgctgggccaggagcccatcggcttcaccgcctaccag ggcgactttaccattgaggtgccgccctccacccagcggctggtggtgacttttgtggac cccagcggtgagttcatggacgctgtccgggtcttgccttttgatcctcgaggtgccggc gtgtaccacgaggtcaaggccatgcggaagaaagccccggtcattttacataccagccag agcaacacgatccccctgggcgagctggaagatgaggcgcccctgggcgagctggtcctg ccttctggcgctttccgcagagccgacggcaaaccctactcggggcctgtggaggcccgg gtgacgttcgtggacccccgagacctcacctcggcggcgtctgcccccagtgacctgcgc ttcgtggacagcgacggcgagctggctccactgcgcacctacggcatgttctccgtggac ctccgtgcgcccggctccgcggagcagctgcaggtggggccggtggccgtgcgggtggcc gccagccagatccacatgccaggccacgtggaggccctcaagctgtggtcgctgaacccc gagaccggcttgtgggaggaggagagcggcttccggcgcgaggggtcctcgggcccccgg gtgcgccgggaggagcgcgtcttcctggtgggcaacgtggagatccgggagcggcgcctg ttcaatctggacgtgcctgagcgccgccgctgcttcgtgaaggtgcgcgcctacgccaac gacaagttcacccccagcgagcaggtggagggcgtggtggtcacgctggtcaatctggag cccgcccccggcttctccgccaacccccgtgcctggggccgctttgacagcgcggtcacc ggccccaatggcgcctgcctccccgccttctgcgacgccgacaggccagacgcctacacc gccctggtcaccgccaccctgggcggcgaggagctggagccggccccttccttgccccgc ccactcccggccaccgtgggcgtcacccagccctacctggacaggctggggtaccgtcgg acggaccacgacgatcccgccttcaagcgtaacggcttccgcatcaacctcgccaagccc aggccaggtgaccccgccgaggccaatgggcctgtgtacccgtggcgcagcctgcgggaa tgccagggggccccggtgactgccagccacttccgcttcgccagggtggaggcggacaag tacgagtacaacgtggtccccttccgagagggcacacctgcctcctggactggcgatctc ctggcctggtggcccaacccgcaggagttccgggcctgcttcctcaaggtgaagatccag ggtccccaggagtatatggtccgctcccacaacgcagggggcagccacccacgcacccgc ggccagctctacggacttcgggatgcccggagtgtgcgagaccccgagcgtccgggcacc tcggcagcctgcgtggagttcaagtgcagcgggatgctgttcgaccagcggcaggtggac aggacgctggtgaccattatgccccagggcagctgccggcgcgtggccgtcaacggactc cttcgggattacctgacccggcaccccccaccggtgcccgcggaggacccagctgccttc tccatgctggcccccctagaccctctgggccacaactatggcgtctacactgtcactgac cagagcccacgcttggccaaggagatcgccattggccgctgctttgatggttcctctgac ggcttctccagagagatgaaggctgatgccggcacagccgtcaccttccagtgccgggag ccaccggccggacgacccagcctcttccagaggctgctggagtccccggcgacagcactt ggtgacatccgcagggagatgagcgaggcggcgcaggcacaggcccgggcctcagccctc aagccaggaatcccaccccctccctcctggaacctgggctaccgtccgcctcctggtaaa tccattgtctaccccccaaccaaattggggctgccagaccacaccccctcagtctgggag gagagagggaaccagtgccaggaggaagaccaggcgcccgcccctgcggacgctcctagc cccttagctaaggtctcaggacaccggctggcggggctcggggtgcaggtcacaaagcct ctgagcctcattgtccccacctgttatgtggagacgcctatcagcacccacacagctagc aggcagttcaagtccctggtgactcctgtgtccctgtaa >gi568815579f:19365346_19605471|GENSCAN_predicted_peptide_5|379_aa MAFLHVGWAGLELPTSVVSIRGIQDEDPPDAQLLRLDNMLLAEGVCRPEKRGRGGAVARA GTATPGGCPNDNSIEHSDYRAKLSQIRQIYHSELEKYEQACREFTTHVTNLLQEQSRMRP VSPKEIERMVGAIHGKFSAIQMQLKQSTCEAVMTLRSRLLDARTTGKSYSELEVGVVLES QHINAYEGGCFCRRKRRNFSKQATEVLNEYFYSHLNNPYPSEEAKEELARKGGLTISQVS NWFGNKRIRYKKNMGKFQEEATIYTGKTAVDTTEVGVPGNHASCLSTPSSGSSGPFPLPS AGDAFLTLRTLASLQPPPGGGCLQSQRTDASCAVCGVPPGPGKHTSHTVLLWQLPPITMQ PPARDTRALVVPDASQSFA >gi568815579f:19365346_19605471|GENSCAN_predicted_CDS_5|1140_bp atggcgtttctccacgttggctgggctggtctcgaactcccgacctcagtggtaagcatc cgtggcattcaagacgaagatccccctgacgcccagctcctgaggctggataacatgctg ctggctgagggcgtgtgcaggcccgagaagagaggaagaggaggagcggtggccagggcc ggcacagcaacaccaggtggctgtccaaatgacaatagcattgagcactctgactacagg gccaagctgtcccagatccgacagatttaccactctgagctagagaaatatgaacaggcc tgtcgtgagttcaccacgcacgtcaccaacctcctccaggagcagagcaggatgaggcct gtctcccctaaggagattgagcgcatggtcggcgccattcacggcaagttcagcgccatc cagatgcagttgaagcagagcacctgtgaggcagtgatgaccctgcgttcgcggctgctc gatgccagaactactggaaaatcatactcagaactagaggtgggggtggtactggagtcc cagcatattaatgcctacgaaggcggctgtttttgcaggcgcaagcggcggaatttcagc aagcaggcgacggaagtgctgaatgagtatttttactcccatctgaacaacccttacccc agcgaagaagccaaagaagagctggccaggaagggcggcctcaccatctcccaggtctct aactggtttggcaacaaaagaatccggtataaaaagaacatggggaagtttcaagaagag gctaccatttacacgggtaaaacggctgtggataccacggaagttggggtcccagggaac cacgccagctgcctgtcaacacctagctccggctcctctggacccttcccgctgcccagc gctggggacgccttcctcaccctgcggactctggcctctctccagcctcctcctggggga ggctgcctgcagtcccagagaacggatgcctcctgtgctgtctgtggggtcccacctggt cctgggaagcacacgtcccacactgtcctgctgtggcagctgccccccatcaccatgcag cccccagccagggacacacgagcccttgttgtacccgatgccagtcagtcctttgcctaa >gi568815579f:19365346_19605471|GENSCAN_predicted_peptide_6|116_aa MSEDSPLGMMLMGCCAYNLGLTDLGDSHSKWTPLDSQKQYGNAVSRMFVSPPNSHVEALT PNVTIWSKWEPSLGPQPHREKANPWIKSDALTDPSGLPLSSHVAVSGHSAGRKPQL >gi568815579f:19365346_19605471|GENSCAN_predicted_CDS_6|351_bp atgagcgaggacagcccacttggaatgatgctcatggggtgctgtgcctacaatctgggc cttacagatctgggtgacagtcacagcaagtggacacccttggacagccagaaacaatat ggcaatgctgtgagccgaatgtttgtgtcacctccaaattcacatgtcgaagccctaact cccaacgtgactatttggagcaagtgggaaccatctctgggaccccagccccatagggag aaggctaacccatggattaaatcagatgccctgactgacccctcgggtctgccactttcc agtcatgtggctgtcagtggtcactccgctggccgcaagccccagttgtga >gi568815579f:19365346_19605471|GENSCAN_predicted_peptide_7|208_aa XKHALNCHRMKPALFSVLCEIKEKTALPDGASAFVWNGQVSAQRKLFMVLILHPVVPTGR LLKGNPMSRGPGVLLDLTHSSHISLWLKSAAGLMECTIQEKNRDGCLKIVYLGSSKTSSM NAKNTASRRGDWVCDGCGSLLRCPTAQTPKGEHAVRQERGMPAPSKWQCHRHPQDSNDAE GKQASYRCVRGNRFTLNSGLFHPSSYGP >gi568815579f:19365346_19605471|GENSCAN_predicted_CDS_7|627_bp nnaaagcatgctctgaattgccatcggatgaagcctgctctgttcagcgtgctctgtgag atcaaggaaaagacagcactccctgatggtgcttcagcattcgtgtggaatggccaagtc tcagcccagaggaaactgttcatggtcctgatcctgcaccctgtggtacccacgggccgc ctcttgaagggcaatcccatgagccggggtcctggcgtgcttctcgatttaacccacagt tctcacatttccctctggctgaagtctgcagcaggccttatggagtgcaccatccaggag aagaatagggatggctgcctgaagattgtctatttggggagctctaagacctcttctatg aacgcaaaaaatacggcttcaagaagaggtgattgggtgtgtgacggatgtggctcactt cttcggtgccccacagctcaaacccctaagggggagcatgcagtcagacaggaacgtgga atgccggctccttccaagtggcagtgccaccgccacccgcaagacagtaatgacgcagag gggaagcaggcatcctacagatgcgtgcgaggaaataggtttactctcaattctggactg ttccaccccagctcttacgggccctga