GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:02:09 Sequence gi568815597r:161443015_161624456 : 181442 bp : 47.49% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2007 2090 84 2 0 68 72 55 0.250 2.96 1.02 Intr + 3498 3598 101 2 2 57 22 147 0.810 3.91 1.03 Intr + 3838 4070 233 1 2 23 80 142 0.902 4.32 1.04 Intr + 4461 4560 100 1 1 81 68 93 0.637 5.77 1.05 Intr + 4975 5080 106 1 1 83 97 9 0.939 1.52 1.06 Intr + 5384 5553 170 1 2 71 86 100 0.942 6.84 1.07 Intr + 6203 6968 766 2 1 60 44 183 0.387 2.58 1.08 Intr + 10878 10978 101 2 2 57 22 147 0.769 3.91 1.09 Intr + 11218 11450 233 1 2 23 80 142 0.867 4.32 1.10 Intr + 11842 11941 100 2 1 81 68 93 0.796 5.77 1.11 Intr + 12356 12461 106 2 1 83 97 9 0.891 1.52 1.12 Intr + 12765 12934 170 2 2 71 86 100 0.906 6.84 1.13 Intr + 13584 14349 766 0 1 60 44 167 0.282 0.98 1.14 Intr + 18289 18389 101 0 2 57 22 147 0.758 3.91 1.15 Intr + 18629 18861 233 2 2 23 80 142 0.883 4.32 1.16 Intr + 19252 19351 100 2 1 81 68 93 0.637 5.77 1.17 Intr + 19766 19871 106 2 1 83 97 9 0.939 1.52 1.18 Intr + 20175 20344 170 2 2 71 86 100 0.942 6.84 1.19 Intr + 20994 21759 766 0 1 60 44 183 0.383 2.58 1.20 Intr + 25669 25769 101 0 2 57 22 147 0.771 3.91 1.21 Intr + 26009 26241 233 2 2 23 80 142 0.872 4.32 1.22 Intr + 26633 26732 100 0 1 81 68 93 0.712 5.77 1.23 Intr + 27147 27252 106 0 1 83 97 9 0.768 1.52 1.24 Intr + 27556 27725 170 0 2 71 86 84 0.750 5.24 1.25 Term + 28375 29347 973 1 1 60 47 185 0.622 3.13 1.26 PlyA + 29410 29415 6 -0.45 2.08 PlyA - 29768 29763 6 -8.47 2.07 Term - 30028 29808 221 2 2 73 43 185 0.180 9.70 2.06 Intr - 37005 36712 294 1 0 34 22 172 0.193 2.18 2.05 Intr - 37324 37217 108 2 0 78 89 42 0.785 3.56 2.04 Intr - 40417 40234 184 1 1 99 39 110 0.925 6.56 2.03 Intr - 41240 41115 126 2 0 95 56 56 0.774 3.98 2.02 Intr - 42958 42819 140 2 2 41 27 91 0.225 -1.52 2.01 Init - 48038 47966 73 2 1 88 38 63 0.401 2.53 2.00 Prom - 58324 58285 40 -3.56 3.00 Prom + 59974 60013 40 -5.26 3.01 Init + 62454 62538 85 2 1 64 92 39 0.510 2.88 3.02 Intr + 62973 62993 21 1 0 121 105 16 0.934 4.12 3.03 Intr + 63320 63577 258 0 0 69 77 227 0.676 17.13 3.04 Intr + 66806 67060 255 0 0 88 86 186 0.994 15.82 3.05 Intr + 67820 67942 123 0 0 100 72 175 0.999 17.66 3.06 Intr + 68752 68808 57 2 0 109 68 11 0.503 0.06 3.07 Intr + 70881 70918 38 1 2 125 105 23 0.997 5.68 3.08 Term + 74961 75134 174 2 0 49 43 285 0.982 17.86 3.09 PlyA + 75526 75531 6 1.05 4.00 Prom + 77347 77386 40 -3.86 4.01 Sngl + 81645 83576 1932 2 0 85 42 3418 0.999 330.82 4.02 PlyA + 83757 83762 6 1.05 5.08 PlyA - 83778 83773 6 1.05 5.07 Term - 87786 87604 183 0 0 103 49 105 0.668 5.64 5.06 Intr - 95164 95077 88 0 1 97 54 68 0.311 4.27 5.05 Intr - 97544 97439 106 0 1 1 30 120 0.237 -3.23 5.04 Intr - 101944 101687 258 2 0 97 100 115 0.869 11.03 5.03 Intr - 105664 105407 258 2 0 91 109 243 0.994 24.13 5.02 Intr - 106017 105997 21 1 0 110 105 28 0.940 4.22 5.01 Init - 106722 106683 40 0 1 81 97 61 0.694 4.78 5.00 Prom - 107988 107949 40 -6.66 6.00 Prom + 118303 118342 40 -6.16 6.01 Init + 122212 122214 3 0 0 72 115 0 0.178 1.10 6.02 Intr + 138352 138535 184 0 1 90 92 24 0.490 2.36 6.03 Term + 140076 140170 95 1 2 92 42 125 0.862 6.29 6.04 PlyA + 142137 142142 6 1.05 7.00 Prom + 143149 143188 40 -4.76 7.01 Init + 144468 144521 54 2 0 99 90 3 0.829 0.89 7.02 Intr + 145408 145428 21 0 0 113 119 -6 0.800 2.74 7.03 Intr + 146586 146805 220 2 1 -24 65 211 0.897 5.47 7.04 Intr + 148130 148384 255 0 0 98 86 243 0.997 22.52 7.05 Intr + 149115 149228 114 1 0 92 72 120 0.984 11.22 7.06 Intr + 150412 150468 57 2 0 94 109 24 0.764 3.96 7.07 Intr + 157926 158052 127 1 1 77 103 74 0.236 7.54 7.08 Intr + 163255 164284 1030 1 1 18 -86 2139 0.047 180.07 7.09 Term + 164298 165203 906 2 0 -34 42 1240 0.547 99.50 7.10 PlyA + 165397 165402 6 1.05 8.03 PlyA - 165418 165413 6 1.05 8.02 Term - 169391 169209 183 2 0 90 45 105 0.636 3.94 8.01 Intr - 176589 176410 180 0 0 94 28 99 0.366 4.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 163277 164284 1008 1 0 85 -86 2112 0.929 187.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:161443015_161624456|GENSCAN_predicted_peptide_1|2064_aa MANISIPDRVWSLQVIRLCQDPTNDTGQLTIDHYAMYTRYHTSQESHGYHVRMKKNIVML PGSRSARESWPIPWPAESARVAVVVVAGHLAAMGGESLTWPVGFAADKEWERGEHLTRKG CEADVAKVAPARPKGGMGGAHPPNTARLLPRRTGTQPQAKLRRELRDGDLRKRKPLPTPT LGRQLSGRRGGQRRPPGCGRPSRGREEKGPGACSPSTRRPPPATRHTPPVTCHPPPATVG TTYPDTQQSTLRSRWDPEPEPDPDRNSPGPCASARDPLPIALRTDALPTTSHALFPGPTH SERPRPPSTLFPAYLLRPPSPHSPKQTQPRFFPYSSFSLLPPTGLRPPPTALNRRCAAQR RPGLNSPPGFTLQLLTAEQQRATRSWSRTHVSHQRHAIQHPVLSSDPSDPGRAFHSADTL ARSPIPPRYLCSLPANTCLPAHLQPGRLPARGSGNPAHSRAGESKPGKTAQEESRAEALD PRHPPTNAWPRRDQLCATAHPHAGSRGLGRPSHTQRAFSRLTIDHYAMYTRYHTSQESHG YHVRMKKNIVMLPGSRSARESWPIPWPAESARVAVVVVAGHLAAMGGESLTWPVGFAADK EWERGEHLTRKGCEADVAKVAPARPKGGMGGAHPPNTARLLPRRTGTQPQAKLRRELRDG DLRKRKPLPTPTLGRQLSGRRGGQRRPPGCGRPSRGREEKGPGACSPSTRRPPPATRHTP PVTCHPPPATVGTTYPDTQQSTLRSRWDPEPEPDPDRNSPGPCASARDPLPIALRTDALP TTSHALFPGPTHSERPRPPSTLFPAYLLRPPSPHSPKQTQPRFFPYSSFSLLPPTGLRPP PTALNRRCAAQRRPGLNSPPGFTLQLLTAEQQRATRSWSRTHVSHQRHAIQHPVLSSDPS DPGRAFHSADTLARSPIPPRYLCSLPANTCLPAHLQPGRLPARGSGNPAHSLAGESKPGK TAQEESRAEALDPRHPPTNAWPRRDQLCATAHPHAGSRGLGRPSNTQRAFSRLTIDHYAM YTRYHTSQESHGYHVRMKKNIVMLPGSRSARESWPIPWPAESARVAVVVVAGHLAAMGGE SLTWPVGFAADKEWERGEHLTRKGCEADVAKVAPARPKGGMGGAHPPNTARLLPRRTGTQ PQAKLRRELRDGDLRKRKPLPTPTLGRQLSGRRGGQRRPPGCGRPSRGREEKGPGACSPS TRRPPPATRHTPPVTCHPPPATVGTTYPDTQQSTLRSRWDPEPEPDPDRNSPGPCASARD PLPIALRTDALPTTSHALFPGPTHSERPRPPSTLFPAYLLRPPSPHSPKQTQPRFFPYSS FSLLPPTGLRPPPTALNRRCAAQRRPGLNSPPGFTLQLLTAEQQRATRSWSRTHVSHQRH AIQHPVLSSDPSDPGRAFHSADTLARSPIPPRYLCSLPANTCLPAHLQPGRLPARGSGNP AHSRAGESKPGKTAQEESRAEALDPRHPPTNAWPRRDQLCATAHPHAGSRGLGRPSHTQR AFSRLTIDHYAMYTRYHTSQESHGYHVRMKKNIVMLPGSRSARESWPIPWPAESARVAVV VVAGHLAAMGGESLTWPVGFAADKEWERGEHLTRKGCEADVAKVAPARPKGGMGGAHPPN TARLLPRRTGTQPQAKLRRELRDGDLRKRKPLPTPTLGRQLSGRRGGQRRPPGCGRPSRG REEKGPGACSPSTRRPPPATRHTPPVTCHPPPATVGTTYPDTQQSTLRSRWDPEPEPDPD RNSPGPCASARDPLPIALRTDALPTTSHALFPGPTHSERPRPPSTLFPAYLLRPPSPHSP KQTQPRFFPYSSFSLLPPTGLRPPPTALNRRCAAQRRPGLNSPPGFTLQLLTAEQQRATR SWSRTHVSHQRHAIQHPVLSSDPSDPGRAFHSADTLARSPIPPRYLCSLPANTCLPAHLQ PGRLPARGSGNPAHSLAGESKPGKTAQEESRAEALDPRHPPTNAWPRRDQLCATAHPHAG SRGLGRPSHTQRAFSRVSQLRLCRSAPPLLSRAPASLPAQGAGPQVRAGGVGQSAGARRP RTWSAFLSRTPRLRDPFPIAAPAR >gi568815597r:161443015_161624456|GENSCAN_predicted_CDS_1|6195_bp atggcaaacatctctatccccgatagggtctggtccctgcaggtgatccgactgtgccaa gatcccacgaatgacacgggacagttgaccatcgatcactacgcgatgtacacgcgttac cacacgtcacaggaatcccatggatatcatgtccgcatgaagaaaaacatagtcatgctg ccaggaagccgctccgctcgagagtcctggcccatcccgtggccagccgagagtgcgagg gtggcggtggtggtggtggcgggtcacctggcggccatgggaggagaaagcctgacctgg ccagtgggctttgccgctgacaaagagtgggagcgaggcgagcacctgacaaggaagggc tgtgaagctgatgttgcgaaggtggcgccagcgaggccaaaaggagggatgggtggagca caccctccaaacaccgcgcgcctccttccccggcgcacaggcacgcagccacaggccaag ctgcgacgcgagctccgcgacggggacctccgcaaaagaaagcctcttccgacacccact ttgggccggcagctctctgggcgccgtgggggccagaggaggccgcccggctgcgggcga ccaagccgcgggcgagaagaaaaggggccgggcgcgtgctcgccttccacccgccgcccg ccgcccgccacccgccacacgccacccgtcacctgccacccgccccccgccaccgttggc acgacctaccccgacacccaacaaagcaccctgcgatcccgctgggacccggagccggag ccggaccccgacagaaacagcccaggaccatgcgccagcgcccgcgaccctctaccaatt gcccttcggacagacgccctccccaccacctcacacgccctcttccctggccccacacac agcgagcgaccgcgaccaccttccacgctcttccctgcctatctcctccgcccgccttct cctcactcgcccaaacagacacagcccagattcttcccctattcctccttttccctcctt cctcccaccggcctgcgcccaccgcccaccgccttgaatcgccgctgcgctgcccagagg cgtcctggcctgaacagcccgcccggtttcaccctccaacttctgaccgctgagcagcag cgagcgactcgctcgtggagccgcacacacgtctcccaccagaggcacgccatccagcat cctgtcctttcctccgacccctcggaccccggccgcgcattccattctgccgacacccta gccaggtcgccgatcccacctcgctacctgtgctcccttcccgctaacacctgcctgccg gcccacctgcagcccggacgcctgccggccagaggcagcgggaaccctgcacacagccgg gcaggcgagtccaaacccggaaagacagcccaagaggaatcacgagcggaagccctagat ccccgtcacccgcccacaaacgcctggccccgccgggaccagctctgcgccacagcgcat ccccacgcgggaagccgcggcctgggccgtcccagccacacccagcgcgccttctccagg ttgaccatcgatcactacgcgatgtacacgcgttaccacacgtcacaggaatcccatgga tatcatgtccgcatgaagaaaaacatagtcatgctgccaggaagccgctccgctcgagag tcctggcccatcccgtggccagccgagagtgcgagggtggcggtggtggtggtggcgggt cacctggcggccatgggaggagaaagcctgacctggccagtgggctttgccgctgacaaa gagtgggagcgaggcgagcacctgacaaggaagggctgtgaagctgatgttgcgaaggtg gcgccagcgaggccaaaaggagggatgggtggagcacaccctccaaacaccgcgcgcctc cttccccggcgcacaggcacgcagccacaggccaagctgcgacgcgagctccgcgacggg gacctccgcaaaagaaagcctcttccgacacccactttgggccggcagctctctgggcgc cgtgggggccagaggaggccgcccggctgcgggcgaccaagccgcgggcgagaagaaaag gggccgggcgcgtgctcgccttccacccgccgcccgccgcccgccacccgccacacgcca cccgtcacctgccacccgccccccgccaccgttggcacgacctaccccgacacccaacaa agcaccctgcgatcccgctgggacccggagccggagccggaccccgacagaaacagccca ggaccatgcgccagcgcccgcgaccctctaccaattgcccttcggacagacgccctcccc accacctcacacgccctcttccctggccccacacacagcgagcgaccgcgaccaccttcc acgctcttccctgcctatctcctccgcccgccttctcctcactcgcccaaacagacacag cccagattcttcccctattcctccttttccctccttcctcccaccggcctccgcccaccg cccaccgccttgaatcgccgctgcgctgcccagaggcgtcctggcctgaacagcccgccc ggtttcaccctccaacttctgaccgctgagcagcagcgagcgactcgctcgtggagccgc acacacgtctcccaccagaggcacgccatccaacatcctgtcctttcctccgacccctcg gaccccggccgcgcattccattctgccgacaccctagccaggtcgccgatcccacctcgc tacctgtgctcccttcccgctaacacctgcctgccggcccacctgcagcccggacgcctg ccggccagaggcagcgggaaccctgcacacagcctggcaggcgagtccaaacccggaaag acagcccaagaggaatcacgagcggaagccctagatccccgtcacccgcccacaaacgcc tggccccgccgggaccagctctgcgccacagcgcatccccacgcgggaagccgcggcctg ggccgtcccagcaacacccagcgcgccttctccaggttgaccatcgatcactacgcgatg tacacgcgttaccacacgtcacaggaatcccatggatatcatgtccgcatgaagaaaaac atagtcatgctgccaggaagccgctccgctcgagagtcctggcccatcccgtggccagcc gagagtgcgagggtggcggtggtggtggtggcgggtcacctggcggccatgggaggagaa agcctgacctggccagtgggctttgccgctgacaaagagtgggagcgaggcgagcacctg acaaggaagggctgtgaagctgatgttgcgaaggtggcgccagcgaggccaaaaggaggg atgggtggagcacaccctccaaacaccgcgcgcctccttccccggcgcacaggcacgcag ccacaggccaagctgcgacgcgagctccgcgacggggacctccgcaaaagaaagcctctt ccgacacccactttgggccggcagctctctgggcgccgtgggggccagaggaggccgccc ggctgcgggcgaccaagccgcgggcgagaagaaaaggggccgggcgcgtgctcgccttcc acccgccgcccgccgcccgccacccgccacacgccacccgtcacctgccacccgcccccc gccaccgttggcacgacctaccccgacacccaacaaagcaccctgcgatcccgctgggac ccggagccggagccggaccccgacagaaacagcccaggaccatgcgccagcgcccgcgac cctctaccaattgcccttcggacagacgccctccccaccacctcacacgccctcttccct ggccccacacacagcgagcgaccgcgaccaccttccacgctcttccctgcctatctcctc cgcccgccttctcctcactcgcccaaacagacacagcccagattcttcccctattcctcc ttttccctccttcctcccaccggcctgcgcccaccgcccaccgccttgaatcgccgctgc gctgcccagaggcgtcctggcctgaacagcccgcccggtttcaccctccaacttctgacc gctgagcagcagcgagcgactcgctcgtggagccgcacacacgtctcccaccagaggcac gccatccagcatcctgtcctttcctccgacccctcggaccccggccgcgcattccattct gccgacaccctagccaggtcgccgatcccacctcgctacctgtgctcccttcccgctaac acctgcctgccggcccacctgcagcccggacgcctgccggccagaggcagcgggaaccct gcacacagccgggcaggcgagtccaaacccggaaagacagcccaagaggaatcacgagcg gaagccctagatccccgtcacccgcccacaaacgcctggccccgccgggaccagctctgc gccacagcgcatccccacgcgggaagccgcggcctgggccgtcccagccacacccagcgc gccttctccaggttgaccatcgatcactacgcgatgtacacgcgttaccacacgtcacag gaatcccatggatatcatgtccgcatgaagaaaaacatagtcatgctgccaggaagccgc tccgctcgagagtcctggcccatcccgtggccagccgagagtgcgagggtggcggtggtg gtggtggcgggtcacctggcggccatgggaggagaaagcctgacctggccagtgggcttt gccgctgacaaagagtgggagcgaggcgagcacctgacaaggaagggctgtgaagctgat gttgcgaaggtggcgccagcgaggccaaaaggagggatgggtggagcacaccctccaaac accgcgcgcctccttccccggcgcacaggcacgcagccacaggccaagctgcgacgcgag ctccgcgacggggacctccgcaaaagaaagcctcttccgacacccactttgggccggcag ctctctgggcgccgtgggggccagaggaggccgcccggctgcgggcgaccaagccgcggg cgagaagaaaaggggccgggcgcgtgctctccttccacccgccgcccgccgcccgccacc cgccacacgccacccgtcacctgccacccgccccccgccaccgttggcacgacctacccc gacacccaacaaagcaccctgcgatcccgctgggacccggagccggagccggaccccgac agaaacagcccaggaccatgcgccagcgcccgcgaccctctaccaattgcccttcggaca gacgccctccccaccacctcacacgccctcttccctggccccacacacagcgagcgaccg cgaccaccttccacgctcttccctgcctatctcctccgcccgccttctcctcactcgccc aaacagacacagcccagattcttcccctattcctccttttccctccttcctcccaccggc ctgcgcccaccgcccaccgccttgaatcgccgctgcgctgcccagaggcgtcctggcctg aacagcccgcccggtttcaccctccaacttctgaccgctgagcagcagcgagcgactcgc tcgtggagccgcacacacgtctcccaccagaggcacgccatccaacatcctgtcctttcc tccgacccctcggaccccggccgcgcattccattctgccgacaccctagccaggtcgccg atcccacctcgctacctgtgctcccttcccgctaacacctgcctgccggcccacctgcag cccggacgcctgccggccagaggcagcgggaaccctgcacacagcctggcaggcgagtcc aaacccggaaagacagcccaagaggaatcacgagcggaagccctagatccccgtcacccg cccacaaacgcctggccccgccgggaccagctctgcgccacagcgcatccccacgcggga agccgcggcctgggccgtcccagccacacccagcgcgccttctccagggtcagccagctg cggctctgccgaagcgctcctccgctcctttctcgcgctccagcctccctaccagcccag ggggccggaccccaagtgcgagccggtggcgtgggtcagagcgcaggagcgaggcgacca cggacctggtctgcgtttctgagccgcacgccacggctgcgagacccgttccccatcgcc gcccccgctcgctga >gi568815597r:161443015_161624456|GENSCAN_predicted_peptide_2|381_aa MTPVGPRFIFAFKFKWFMAHTGEPAEKQSALQTAEKFGDEEYVSYSRPKRKRENMEGKEI GESSFPIGREVVRVSQYVDDILLCAPTEEASQEDTEALLNFLADKGYEVSKSKETGKPKD NCEQVVVQTSAAREDLSETPLEDTDCTLFKDGSSFVEQELHQAGYAAGTLNIMEPAQGRE TDRRGRQGKCVQNWWVLGLADFKNEATDPRGVKLQTFLVSVTALKGGGMDPKSEQQPDLL PRVKEQSFHSVEGDPSGLPLLAAHCWLGQPAFIPLSDPTCILLISPFYRELIGPFYRELI HPFYRELIGVKLQTFVISVTGLQGTADAKSEEQQDFLHRAKNQTFQSVEEELSRLPLLAA DCWVGQPTFIPLSDLIRILTH >gi568815597r:161443015_161624456|GENSCAN_predicted_CDS_2|1146_bp atgactccggttggtcctcgtttcatcttcgcattcaaattcaagtggttcatggctcat actggggaaccagctgaaaaacagtcagctctacagacagcagagaagtttggagatgag gaatatgtctcctatagtaggccaaaaaggaaaagagaaaatatggaaggcaaagaaata ggggaatcatcattcccaataggaagagaggtggtcagggtctcacaatatgtagatgac atcctgctctgtgccccaactgaggaagcttctcaggaagacacagaagctcttcttaat ttcctagctgacaaaggatatgaggtttcaaaatccaaggaaactgggaaacctaaagat aactgtgagcaagttgtggtacagacctctgcagccagggaggatctcagtgaaacccct ctagaagatacagattgtactctcttcaaggacggaagttcctttgtggagcaagaactc catcaggcaggatatgcagcaggcactctaaatattatggaacctgcccagggccgagag actgaccgccggggtcgccagggaaagtgtgtccagaattggtgggttcttggtctggct gacttcaagaatgaagccacagaccctcgtggagtgaagctgcagacctttctggtgagt gttacagctcttaaaggtggcggcatggacccaaagagtgagcagcagccagatttattg ccaagagtcaaagaacaaagcttccacagtgtggaaggggacccgagcggattgcccctg ctggctgcccactgctggcttgggcagcctgcttttattcccttatctgaccccacctgc atcctactgatcagtccattttacagagagctcattggtccattttacagagagctgatt catccattttacagagagctgattggagtgaagctgcagacctttgtaatcagtgttaca ggacttcaaggcacagcggatgcaaagagtgaagaacagcaagattttttgcatagagcg aaaaaccaaaccttccaaagtgtagaagaggaactgagtcggttgccgctgctggctgcc gactgctgggttgggcagcctacctttattcccttatctgacctcatccgcattctgacc cactga >gi568815597r:161443015_161624456|GENSCAN_predicted_peptide_3|336_aa MTMETQMSQNVCPRNLWLLQPLTVLLLLASADSQAAAPPKAVLKLEPPWINVLQEDSVTL TCQGARSPESDSIQWFHNGNLIPTHTQPSYRFKANNNDSGEYTCQTGQTSLSDPVHLTVL SEWLVLQTPHLEFQEGETIMLRCHSWKDKPLVKVTFFQNGKSQKFSHLDPTFSIPQANHS HSGDYHCTGNIGYTLFSSKPVTITVQVPSMGSSSPMGIIVAVVIATAVAAIVAAVVALIY CRKKRISALPGDPECREMGETLPEKPANSTDPVKAAQFEPPGRQMIAIRKRQLEETNNDY ETADGGYMTLNPRAPTDDDKNIYLTLPPNDHVNSNN >gi568815597r:161443015_161624456|GENSCAN_predicted_CDS_3|1011_bp atgactatggagacccaaatgtctcagaatgtatgtcccagaaacctgtggctgcttcaa ccattgacagttttgctgctgctggcttctgcagacagtcaagctgcagctcccccaaag gctgtgctgaaacttgagcccccgtggatcaacgtgctccaggaggactctgtgactctg acatgccagggggctcgcagccctgagagcgactccattcagtggttccacaatgggaat ctcattcccacccacacgcagcccagctacaggttcaaggccaacaacaatgacagcggg gagtacacgtgccagactggccagaccagcctcagcgaccctgtgcatctgactgtgctt tccgaatggctggtgctccagacccctcacctggagttccaggagggagaaaccatcatg ctgaggtgccacagctggaaggacaagcctctggtcaaggtcacattcttccagaatgga aaatcccagaaattctcccatttggatcccaccttctccatcccacaagcaaaccacagt cacagtggtgattaccactgcacaggaaacataggctacacgctgttctcatccaagcct gtgaccatcactgtccaagtgcccagcatgggcagctcttcaccaatggggatcattgtg gctgtggtcattgcgactgctgtagcagccattgttgctgctgtagtggccttgatctac tgcaggaaaaagcggatttcagctctcccaggagaccctgagtgcagggaaatgggagag accctccctgagaaaccagccaattccactgatcctgtgaaggctgcccaatttgagcca cctggacgtcaaatgattgccatcagaaagagacaacttgaagaaaccaacaatgactat gaaacagctgacggcggctacatgactctgaaccccagggcacctactgacgatgataaa aacatctacctgactcttcctcccaacgaccatgtcaacagtaataactaa >gi568815597r:161443015_161624456|GENSCAN_predicted_peptide_4|643_aa MQAPRELAVGIDLGTTYSCVGVFQQGRVEILANDQGNRTTPSYVAFTDTERLVGDAAKSQ AALNPHNTVFDAKRLIGRKFADTTVQSDMKHWPFRVVSEGGKPKVRVCYRGEDKTFYPEE ISSMVLSKMKETAEAYLGQPVKHAVITVPAYFNDSQRQATKDAGAIAGLNVLRIINEPTA AAIAYGLDRRGAGERNVLIFDLGGGTFDVSVLSIDAGVFEVKATAGDTHLGGEDFDNRLV NHFMEEFRRKHGKDLSGNKRALRRLRTACERAKRTLSSSTQATLEIDSLFEGVDFYTSIT RARFEELCSDLFRSTLEPVEKALRDAKLDKAQIHDVVLVGGSTRIPKVQKLLQDFFNGKE LNKSINPDEAVAYGAAVQAAVLMGDKCEKVQDLLLLDVAPLSLGLETAGGVMTTLIQRNA TIPTKQTQTFTTYSDNQPGVFIQVYEGERAMTKDNNLLGRFELSGIPPAPRGVPQIEVTF DIDANGILSVTATDRSTGKANKITITNDKGRLSKEEVERMVHEAEQYKAEDEAQRDRVAA KNSLEAHVFHVKGSLQEESLRDKIPEEDRRKMQDKCREVLAWLEHNQLAEKEEYEHQKRE LEQICRPIFSRLYGGPGVPGGSSCGTQARQGDPSTGPIIEEVD >gi568815597r:161443015_161624456|GENSCAN_predicted_CDS_4|1932_bp atgcaggccccacgggagctcgcggtgggcatcgacctgggcaccacctactcgtgcgtg ggcgtgtttcagcagggccgcgtggagatcctggccaacgaccagggcaaccgcaccacg cccagctacgtggccttcaccgacaccgagcggctggtcggggacgcggccaagagccag gcggccctgaacccccacaacaccgtgttcgatgccaagcggctgatcgggcgcaagttc gcggacaccacggtgcagtcggacatgaagcactggcccttccgggtggtgagcgagggc ggcaagcccaaggtgcgcgtatgctaccgcggggaggacaagacgttctaccccgaggag atctcgtccatggtgctgagcaagatgaaggagacggccgaggcgtacctgggccagccc gtgaagcacgcagtgatcaccgtgcccgcctatttcaatgactcgcagcgccaggccacc aaggacgcgggggccatcgcggggctcaacgtgttgcggatcatcaatgagcccacggca gctgccatcgcctatgggctggaccggcggggcgcgggagagcgcaacgtgctcattttt gacctgggtgggggcaccttcgatgtgtcggttctctccattgacgctggtgtctttgag gtgaaagccactgctggagatacccacctgggaggagaggacttcgacaaccggctcgtg aaccacttcatggaagaattccggcggaagcatgggaaggacctgagcgggaacaagcgt gccctgcgcaggctgcgcacagcctgtgagcgcgccaagcgcaccctgtcctccagcacc caggccaccctggagatagactccctgttcgagggcgtggacttctacacgtccatcact cgtgcccgctttgaggaactgtgctcagacctcttccgcagcaccctggagccggtggag aaggccctgcgggatgccaagctggacaaggcccagattcatgacgtcgtcctggtgggg ggctccacacgcatccccaaggtgcagaagttgctgcaggacttcttcaacggcaaggag ctgaacaagagcatcaaccctgatgaggctgtggcctatggggctgctgtgcaggcggcc gtgttgatgggggacaaatgtgagaaagtgcaggatctcctgctgctggatgtggctccc ctgtctctggggctggagacagcaggtggggtgatgaccacgctgatccagaggaacgcc actatccccaccaagcagacccagactttcaccacctactcggacaaccagcctggggtc ttcatccaggtgtatgagggtgagagggccatgaccaaggacaacaacctgctggggcgt tttgaactcagtggcatccctcctgccccacgtggagtcccccagatagaggtgactttt gacattgatgctaatggcatcctgagcgtgacagccactgacaggagcacaggtaaggct aacaagatcaccatcaccaatgacaagggccggctgagcaaggaggaggtggagaggatg gttcatgaagccgagcagtacaaggctgaggatgaggcccagagggacagagtggctgcc aaaaactcgctggaggcccatgtcttccatgtgaaaggttctttgcaagaggaaagcctt agggacaagattcccgaagaggacaggcgcaaaatgcaagacaagtgtcgggaagtcctt gcctggctggagcacaaccagctggcagagaaggaggagtatgagcatcagaagagggag ctggagcaaatctgtcgccccatcttctccaggctctatggggggcctggtgtccctggg ggcagcagttgtggcactcaagcccgccagggggaccccagcaccggccccatcattgag gaggttgattga >gi568815597r:161443015_161624456|GENSCAN_predicted_peptide_5|317_aa MWQLLLPTALLLLVSAGMRTEDLPKAVVFLEPQWYRVLEKDSVTLKCQGAYSPEDNSTQW FHNESLISSQASSYFIDAATVDDSGEYRCQTNLSTLSDPVQLEVHIGWLLLQAPRWVFKE EDPIHLRCHSWKNTALHKVTYLQNGKGRKYFHHNSDFYIPKATLKDSGSYFCRGLFGSKN VSSETVNITITQAAKSYHYNPSRKNQIPQNYPYSTKAEEDIGNHTTRRPVELAIWASVMD CLLPGSSIVTDRIRSGMLPAPQLFETPHWIRRVSDSATVIGAEDPRETGRHRVYLDMARP LYRSRGALEESPGKMEP >gi568815597r:161443015_161624456|GENSCAN_predicted_CDS_5|954_bp atgtggcagctgctcctcccaactgctctgctacttctagtttcagctggcatgcggact gaagatctcccaaaggctgtggtgttcctggagcctcaatggtacagggtgctcgagaag gacagtgtgactctgaagtgccagggagcctactcccctgaggacaattccacacagtgg tttcacaatgagagcctcatctcaagccaggcctcgagctacttcattgacgctgccaca gtcgacgacagtggagagtacaggtgccagacaaacctctccaccctcagtgacccggtg cagctagaagtccatatcggctggctgttgctccaggcccctcggtgggtgttcaaggag gaagaccctattcacctgaggtgtcacagctggaagaacactgctctgcataaggtcaca tatttacagaatggcaaaggcaggaagtattttcatcataattctgacttctacattcca aaagccacactcaaagacagcggctcctacttctgcagggggctttttgggagtaaaaat gtgtcttcagagactgtgaacatcaccatcactcaagcagctaagagttatcactacaac cctagtcggaaaaaccaaatacctcaaaattacccgtacagcactaaggcagaagaggac attgggaaccacacaacgcggagacctgtggagctggcaatatgggcaagtgtcatggac tgtctactgccaggaagctccattgtcaccgacaggatcagaagtggcatgctcccagct ccgcagctgttcgagactccgcactggatccgccgggtgtcggattcagcaacagtgatt ggtgcggaggatccgagggagacgggccgacacagggtctaccttgacatggctcgaccc ctctacagaagcagaggggcgctcgaagagtcgccagggaaaatggagccttaa >gi568815597r:161443015_161624456|GENSCAN_predicted_peptide_6|93_aa MGRIRQALREGCDCCALGASSLQGVMGILSFLPVLATESDWADCKSPQPWGHMLLWTAVL FLGTVLGLGDLSVNMTDSLPDRKYEVNELIVKQ >gi568815597r:161443015_161624456|GENSCAN_predicted_CDS_6|282_bp atgggtagaatccgccaagctttgagagaaggctgtgactgctgtgctctgggcgccagc tcgctccagggagtgatgggaatcctgtcattcttacctgtccttgccactgagagtgac tgggctgactgcaagtccccccagccttggggtcatatgcttctgtggacagctgtgcta ttcctgggcactgttctaggacttggagacttatcagtgaacatgactgacagccttcct gaccggaaatacgaggtcaatgaacttattgtgaagcaataa >gi568815597r:161443015_161624456|GENSCAN_predicted_peptide_7|927_aa MGGERWLTPVIPEFWEAELLLLGHLWINVLQEDSVTLTCRGTHSPESDSIPWFHNGNLIP THTQPSYRFKANNNDSGEYTCQTGQTSLSDPVHLTVLSEWLVLQTPHLEFQEGETIVLRC HSWKDKPLVKVTFFQNGKSKKFSRSDPNFSIPQANHSHSGDYHCTGNIGYTLYSSKPVTI TVQAPSSSPMGIIVAVVTGIAVAAIVAAVVALIYCRKKRISALSGYPECREMGETLPEKP AILDIKNLLFHIHTANTINQTTVMKRCSNMRNAYVTGYMRTIIIRQEASAMQAPRELAVG IDLGTTYSCVGVFQQGRVEILANDQGNRTTPSYVAFTDTERLVGDAAKSQAALNPHNTVF DAKRLIGRKFADTTVQSDMKHWPFQVVSEGGKPKVRVCYRGEDKTFYPEEISSMVLSKMK ETAEAYLGQPVKHAVITVPTYFSNSQRQATKDAGAIAGLKVLPIINEATAAAIAYGLDRR GAGKRNVLIFDLGGGTFDVSVLSIDAGVFEVKATAGDTHLGGEDFDNRLVNHFMEEFRRK HGKDLSGNKRALRRLRTACERAKRTPSSSTQATLEIDSLFEGVDFYKSITRARFEELCSD LFRSTLEPVEKALRDAKLDKAQIHDFGSTRIPKVQKLLQDFFNGKELNKSINPDEAVAYG AAVQAAVLMGDKCEKVQDLLLLDVAPLSLGLETAGGVMTTLIQRNATIPTKQTQTFTTYS DNQPGVFIQVYEVERAMTKDNNLLGRFELIGIPPAPHGVPQIEVTFDIDANGILSVTATD RSTGKANKITNDKGRLSKEEVERMVHEAEQYGAEDEAQRDRVAAKNSLEAHVFHVKGSLQ EESLRDKIPEEDRRKVQDKCQEVLAWLEHNQLAEKEEYEHQKRELEQICRPIFSRLYGGP GVPGGSSCSAQAHQGDPSTGPIIEEVD >gi568815597r:161443015_161624456|GENSCAN_predicted_CDS_7|2784_bp atgggcggggaacggtggctcacacctgtaatcccagaattttgggaggctgagctcctg ttgctgggacacctgtggatcaacgtgctccaagaggactctgtgactctgacatgccgg gggactcacagccctgagagcgactccattccgtggttccacaatgggaatctcattccc acccacacgcagcccagctacaggttcaaggccaacaacaatgacagcggggagtacacg tgccagactggccagaccagcctcagcgaccctgtgcatctgactgtgctttctgagtgg ctggtgctccagacccctcacctggagttccaggagggagaaaccatcgtgctgaggtgc cacagctggaaggacaagcctctggtcaaggtcacattcttccagaatggaaaatccaag aaattttcccgttcggatcccaacttctccatcccacaagcaaaccacagtcacagtggt gattaccactgcacaggaaacataggctacacgctgtactcatccaagcctgtgaccatc actgtccaagctcccagctcttcaccgatggggatcattgtggctgtggtcactgggatt gctgtagcggccattgttgctgctgtagtggccttgatctactgcaggaaaaagcggatt tcagctctctcaggataccctgagtgcagggaaatgggagagaccctccctgagaaacca gccattcttgacatcaagaatcttctgttccacatccacacagccaatacaattaatcaa accactgttatgaaaagatgtagcaacatgagaaatgcttatgttacaggttacatgaga acaatcatcatccgacaagaagcttcagccatgcaggccccacgggagctcgcggtgggc atcgacctgggcaccacctactcgtgcgtgggcgtgtttcagcagggccgcgtggagatc ctggccaacgaccagggcaaccgcaccacgcccagctacgtggccttcaccgacaccgag cggctggtcggggacgcggccaagagccaggcggccctgaacccccacaacaccgtgttc gatgccaagcggctgatcgggcgcaagttcgcggacaccacggtgcagtcggacatgaag cactggcccttccaggtggtgagcgagggcggcaagcccaaggtgcgcgtatgctaccgc ggggaggacaagacgttctaccccgaggagatctcgtccatggtgctgagcaagatgaag gagacggccgaggcgtacctgggccagcccgtgaagcacgcagtgatcaccgtgcccacc tatttcagtaactcgcagcgccaggccaccaaggacgcgggggccatcgcggggctcaag gtgctgccgatcatcaatgaggccacggcagcagccatcgcctatgggctggaccggcgg ggcgcgggaaagcgcaacgtgctcatttttgacctgggtgggggcaccttcgatgtgtcg gttctctccattgacgccggtgtctttgaggtgaaagccactgctggagatacccacctg ggaggagaggacttcgacaaccggctcgtgaaccacttcatggaagaattccggcggaag catgggaaggacctgagcgggaacaagcgtgccctgcgcaggctgcgcacagcctgtgag cgcgccaagcgcaccccgtcctccagcacccaggccaccctggagatagactccctgttc gagggcgtggacttctacaagtccatcactcgtgcccgctttgaggaactgtgctcagac ctcttccgcagcaccctggagccggtggagaaggccctgcgggatgccaagctggacaag gcccagattcatgacttcggctccactcgcatccccaaggtgcagaagttgctgcaggac ttcttcaacggcaaggagctgaacaagagcatcaaccctgatgaggctgtggcctatggg gctgctgtgcaggcggccgtgttgatgggggacaaatgtgagaaagtgcaggatctcctg ctgctggatgtggctcccctgtctctggggctggagacagcaggtggggtgatgaccacg ctgatccagaggaacgccactatccccaccaagcagacccagactttcaccacctactcg gacaaccagcctggggtcttcatccaggtgtatgaggttgagagggccatgaccaaggac aacaacctgctggggcgttttgaactcattggcatccctcctgccccacatggagtcccc cagatagaggtgacgtttgacattgatgctaatggcatcctgagcgtgacagccactgac aggagcacaggtaaggctaacaagatcaccaatgacaagggccggctgagcaaggaggag gtggagaggatggttcatgaagccgagcagtacggggctgaggatgaggcccagagggac agagtggctgccaaaaactcgctggaggcccatgtcttccatgtgaaaggttctttgcaa gaggaaagccttagggacaagattcccgaagaggacaggcgcaaagtgcaagacaagtgt caggaagtccttgcctggctggagcacaaccagctggcagagaaggaggagtatgagcat cagaagagggagctggagcaaatctgtcgccccatcttctccaggctctatggggggcct ggtgtccctgggggcagcagttgtagcgctcaagcccaccagggggaccccagcaccggc cccatcattgaggaggttgattga >gi568815597r:161443015_161624456|GENSCAN_predicted_peptide_8|120_aa TCGAGNMGKCHGLSTSRKLHCHRQDQKWHGKWYKKAHSGTVLKTSPFGGASHAKGIGLEK LPAPQLFETPHWIRRVSDSATVIGAEDPRETGRHRVYLDMARPLYRSRGALEESPGKMEP >gi568815597r:161443015_161624456|GENSCAN_predicted_CDS_8|363_bp acctgtggagctggcaatatgggcaagtgtcatggactgtctacctccaggaagctccat tgtcaccgacaggatcagaagtggcatggtaaatggtacaagaaagcccattcgggcaca gtcctgaagaccagcccttttggaggtgcttctcatgcaaagggaattgggctggaaaaa ctcccagctccgcagctgttcgagactccgcactggatccgccgggtgtcggattcagca acagtgattggtgcggaggatccgagggagacgggccgacacagggtctaccttgacatg gctcgacccctctacagaagcagaggggcgctcgaagagtcgccagggaaaatggagcct taa