GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:28:08 Sequence gi568815588r:47368850_47579889 : 211040 bp : 46.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5818 5928 111 0 0 86 84 44 0.422 4.01 1.02 Intr + 9330 9464 135 2 0 89 21 85 0.215 2.66 1.03 Term + 11116 11238 123 0 0 95 40 68 0.384 1.08 1.04 PlyA + 12187 12192 6 1.05 2.12 PlyA - 14047 14042 6 1.05 2.11 Term - 15022 14988 35 2 2 125 46 22 0.641 -0.55 2.10 Intr - 16776 16694 83 2 2 125 74 44 0.467 5.98 2.09 Intr - 20267 20109 159 1 0 122 50 63 0.181 5.00 2.08 Intr - 37250 37113 138 1 0 102 99 28 0.430 4.88 2.07 Intr - 37968 37824 145 1 1 46 49 80 0.257 -0.76 2.06 Intr - 41438 41347 92 1 2 14 100 81 0.243 1.54 2.05 Intr - 45447 45238 210 2 0 37 83 97 0.266 2.13 2.04 Intr - 45793 45585 209 1 2 120 97 32 0.252 5.18 2.03 Intr - 46233 45861 373 2 1 57 51 154 0.627 3.66 2.02 Intr - 46785 46657 129 2 0 57 94 40 0.593 1.31 2.01 Init - 49153 47526 1628 1 2 59 55 498 0.445 35.33 2.00 Prom - 59226 59187 40 -2.46 3.02 PlyA - 60730 60725 6 1.05 3.01 Sngl - 64110 63088 1023 0 0 88 43 404 0.999 32.97 3.00 Prom - 72846 72807 40 -2.36 4.05 PlyA - 72971 72966 6 1.05 4.04 Term - 84425 84311 115 1 1 86 52 98 0.728 4.04 4.03 Intr - 86590 86460 131 1 2 -38 68 152 0.531 0.09 4.02 Intr - 86741 86650 92 0 2 50 89 43 0.779 0.31 4.01 Init - 87329 87212 118 2 1 59 84 85 0.822 5.73 4.00 Prom - 91342 91303 40 -6.16 5.00 Prom + 91614 91653 40 -2.66 5.01 Init + 92943 92987 45 2 0 93 47 48 0.695 1.98 5.02 Intr + 94217 94328 112 1 1 50 71 82 0.628 2.65 5.03 Term + 95352 95443 92 1 2 31 49 119 0.601 0.18 5.04 PlyA + 98225 98230 6 1.05 6.16 PlyA - 99168 99163 6 1.05 6.15 Term - 100057 99998 60 1 0 111 54 119 0.919 8.60 6.14 Intr - 102535 102413 123 1 0 48 103 127 0.990 10.98 6.13 Intr - 104960 104878 83 0 2 112 57 39 0.626 2.56 6.12 Intr - 105216 105121 96 1 0 10 86 160 0.984 7.98 6.11 Intr - 105549 105456 94 0 1 50 66 161 0.999 9.64 6.10 Intr - 106155 106096 60 0 0 81 94 64 0.974 5.23 6.09 Intr - 106723 106644 80 2 2 112 38 111 0.574 7.77 6.08 Intr - 107473 107383 91 1 1 46 116 172 0.998 15.37 6.07 Intr - 108345 108232 114 0 0 82 86 159 0.999 15.74 6.06 Intr - 109778 109669 110 0 2 65 44 226 0.619 15.90 6.05 Intr - 111039 110949 91 0 1 90 109 120 0.966 13.87 6.04 Intr - 115812 115680 133 2 1 58 34 102 0.343 2.45 6.03 Intr - 118559 118430 130 0 1 64 79 166 0.606 13.05 6.02 Intr - 120799 120737 63 2 0 97 87 85 0.984 7.99 6.01 Init - 122825 122753 73 2 1 72 53 114 0.474 5.89 6.00 Prom - 125671 125632 40 -6.86 7.11 PlyA - 127493 127488 6 1.05 7.10 Term - 134694 133303 1392 0 0 102 43 1084 0.805 96.51 7.09 Intr - 135391 135340 52 0 1 80 115 21 0.973 2.91 7.08 Intr - 141422 141322 101 2 2 137 78 44 0.971 7.21 7.07 Intr - 149008 148974 35 2 2 70 103 13 0.615 -1.06 7.06 Intr - 154642 154455 188 0 2 67 116 197 0.225 19.73 7.05 Intr - 173457 173318 140 0 2 71 110 101 0.914 9.86 7.04 Intr - 175680 175540 141 0 0 92 98 2 0.802 2.15 7.03 Intr - 176771 176583 189 2 0 34 91 81 0.747 2.78 7.02 Intr - 180535 180456 80 2 2 93 119 84 0.844 11.27 7.01 Init - 180901 180850 52 1 1 26 57 35 0.256 -4.48 7.00 Prom - 182304 182265 40 -7.06 8.00 Prom + 183273 183312 40 -5.36 8.01 Init + 184551 184817 267 2 0 36 35 202 0.213 5.58 8.02 Intr + 195430 195636 207 0 0 38 49 282 0.880 18.57 8.03 Term + 195681 196988 1308 2 0 -65 52 1797 0.514 152.39 8.04 PlyA + 198933 198938 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 21831 21809 23 0 2 49 76 58 0.927 -0.13 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:47368850_47579889|GENSCAN_predicted_peptide_1|122_aa MPWHCLVETAGTSTNHKGPNSYKVHHSHPTSPDVPATSFANCLPVPNWTSMDGPAFSLTL EVPTVWKGRHALTTCKGGTHGEILLCPDRCGHSGAVPSSPSQTPAALDGDGSMKPLFSLL RT >gi568815588r:47368850_47579889|GENSCAN_predicted_CDS_1|369_bp atgccctggcactgtctagtagaaactgctggtaccagcaccaatcacaagggaccaaac tcctacaaagtccaccacagccaccccacaagtccagatgtccctgccaccagcttcgcg aactgcctgcctgtgccaaactggacctccatggatggaccagctttcagcctcaccctg gaagtgcccacggtgtggaagggaagacatgcactgaccacatgtaaaggtggcacccat ggggagatcctcctctgtccagaccgctgtgggcacagtggagctgtgccttcctcccct tcacagacaccagctgccctagacggcgatggatctatgaagcccttgttctccttgctc cgaacatga >gi568815588r:47368850_47579889|GENSCAN_predicted_peptide_2|1066_aa MDGGSVVGEWPVMDGCSVVNGWPVMDGWPVVDGCSVVNGGSVVNGGSVVNGWPVMDGGSV ANGWPVLNGWPVMDGWPVMDGCSVVNGWPVMDGWPVMDGCSVVNDWPVMAGQPVVDGCSV VNGWPVMDGCSVVNEWPVMDGWPVVDGCSVVNGGSVVNGWPVMNDWPMMDGGSVVNGWPM MDGGSVANGWPVLNGWPVMDGWPMMDGCSVVNGWPVMAGWPVMDGCSVVNDWPMMAGQPV VDGCSVVNGWPVMNGWPVVDGYSVVNGWPVMDGGTVMDGCSVVNGWPVMNGWPVVDGCSV VNGWPVMNGWPVVDGYSVVNGWPVMDGGTVMDGCSVVNGWPVMNGWPVVDGCSVVNGWPV MNGWPVVDGCSVVNGWPVMNGWPVMDGCSVVNDWPVMAGRPVVDGCSVVNDWPVMNGWPV VDGCSVVNGWPMMNGWPVVDGCSVVNGWPVMDGGTVMDGCSVVNGWPVMNGWPVVDGCSV VNGWPVMDGWPVTDGCSVVNGWPVVNGWPVMDGCSVVNGWPVMDGCSVVDGCSVVNGGSM VKRLCPRALPRKSQKLPGLMNLASLNTSLAAQGFCEDHKIYRYKDSLPIGSRRRHTAGLS EGAERDSSGAQAGRRVRKEADSIWEMTADDHQAGSGDRVVSGFRIISAAGAPVQRHTHPE GSAWNPPDPLDLRGPRGEGGDPRGPGFWLLAHSLQSSSTSASWNRIKQAQGVSHSSAGPL DSPGWWHPHPGTHTGGALHSDVIRKDISAIVQAGFNKDLLKAQKAKMVRASPERCSSCPC DHKWLLGVNHSGSSVFTESIGLTLKGDLEQRYHSSGNHLLHPALLTDPRTLQIPRTLQIP EHLGGHQQGRKSEGLVIREGFQDKVPFEQGCKGQVGSAFQAIIVLVTLLSAEEARSDLLS KGIRSDLQKLRDPCSWLGPQHCSFQGMASPAGGQAPKGYEHQNCTRRIPHSSQNSMERGI TMLIAPGRAPGARSRAKHRLHPYFMAGGKDKREMWAGTGDPTAQASSARSLGQPEIRGQA EGSEQGSHHPVKKDLFCFHFHHDCKFPEASASMLNSLKNSAYEFFS >gi568815588r:47368850_47579889|GENSCAN_predicted_CDS_2|3201_bp atggatgggggctctgtggtgggcgagtggcctgtgatggatgggtgctctgtggtgaat gggtggcctgtgatggatggctggcctgtggtggatgggtgctctgtggtgaatgggggc tctgtggtgaatgggggctctgtggtgaatgggtggcctgtgatggatgggggctctgtg gcaaatgggtggcctgtgttgaatggctggcctgtgatggatggctggcctgtgatggat ggatgttctgtggtgaatgggtggcctgtgatggatggctggcctgtgatggatgggtgc tctgtggtgaacgattggcctgtgatggctggccagcctgtggtggatgggtgctctgtg gtgaatgggtggcctgtgatggatgggtgttctgtggtgaacgagtggcctgtgatggat ggctggcctgtggtggatgggtgctctgtggtgaatgggggctctgtggtgaatgggtgg cctgtgatgaatgattggcctatgatggatgggggctctgtggtgaatgggtggcctatg atggatgggggctctgtggcaaatgggtggcctgtgttgaatggctggcctgtgatggat ggatggcctatgatggatggatgttctgtggtgaatggctggcctgtgatggctggctgg cctgtgatggatgggtgctctgtggtgaacgattggcctatgatggctggccagcctgtg gtggatgggtgctctgtggtgaatggatggcctgtgatgaatggctggcctgtggtggat gggtactcagtggtgaatggctggcctgtgatggatgggggcactgtaatggatgggtgc tctgtggtgaatggctggcctgtgatgaatggctggcctgtggtggatgggtgctctgtg gtgaatgggtggcctgtgatgaatggttggcctgtggtggatgggtactcagtggtgaat ggctggcctgtgatggatgggggcactgtaatggatgggtgctctgtggtgaatggctgg cctgtgatgaatggctggcctgtggtggatgggtgctctgtggtgaatgggtggcctgtg atgaatggctggcctgtggtggacgggtgctctgtggtgaatgggtggcctgtgatgaat ggctggcctgtgatggatgggtgctctgtggtgaacgactggcctgtgatggctggccgg cctgtggtggatgggtgctctgtggtgaatgactggcctgtgatgaatggctggcctgtg gtggatgggtgctctgtggtgaatgggtggcctatgatgaacggctggcctgtggtggat gggtgctctgtggtgaatgggtggcctgtgatggatgggggcactgtaatggatgggtgc tctgtggtgaatggctggcctgtgatgaacggctggcctgtggtggatgggtgctctgtg gtgaatgggtggcctgtgatggatggttggccagtgacagatgggtgctctgtggtgaat gggtggcctgtggtgaatggctggcctgtgatggatgggtgctctgtggtgaatgggtgg cctgtgatggacgggtgttctgtggtggatggttgctctgtggtgaatgggggctctatg gtgaaaaggctctgccctagagccctgccaagaaagtcccagaagcttcctggtctcatg aacctagcctcactgaacacctcgcttgcagcacagggcttctgtgaggaccacaaaatc tacaggtacaaagacagcctccccatcggcagcaggagaagacatactgcagggctcagt gagggagcagagagggacagctccggggcccaagcaggtaggagggtcagaaaggaggct gacagcatctgggaaatgacagcagatgaccaccaggcaggctcgggcgacagggttgtg agtgggtttaggatcatatcagcagctggggcaccggttcagcggcacacgcatcctgaa ggtagtgcgtggaatcctccagatcccctggacctgagaggcccacgtggagaaggtgga gaccctagaggtcctggattctggctcctggcacacagcctccaatcctcatccaccagt gccagctggaacaggatcaagcaggcccagggtgtatctcattccagtgcagggcctctg gacagcccaggttggtggcatccccaccctggtactcacacaggtggtgctcttcatagt gatgtcattagaaaggacatttctgccattgttcaggctggcttcaacaaagacctcctt aaagcacagaaagcgaagatggtgagagccagcccggagagatgcagttcttgcccttgt gaccacaaatggctgcttggtgtcaaccattctggctcttctgtatttacagagtccatt ggcttaactctgaaaggcgacctggagcaacggtaccattcctcgggcaatcatctcctg catcctgccctcctcacagaccctaggacactgcagatccctaggacactgcagatccct gaacaccttgggggccaccaacaagggaggaagtcagaggggctggtcatcagggaaggc ttccaggacaaggtgcccttcgagcagggctgcaaagggcaggtgggctcggcctttcaa gcaataatcgtgctggtaaccttgttgtctgcagaagaagcaaggagtgacctgctcagc aaaggcatcaggagtgacctccagaagctccgggacccctgcagttggctgggcccgcag cactgctcattccaaggcatggccagcccagctggagggcaagcaccaaagggttatgag caccagaactgcacacgcaggatcccacacagttctcagaactccatggaacgggggatt actatgctcattgcacctgggagggcacctggagctcggagcagggctaaacatcgtctc cacccctacttcatggctggtggtaaagataagcgggaaatgtgggctggtacaggggac cccacagcacaggcctccagtgcacggtcactaggtcagcctgaaatcagaggccaggct gaaggaagtgagcaaggaagccaccaccctgtgaagaaggacctgttttgcttccacttc caccatgattgtaagtttcccgaggcctctgcatccatgctgaactcactcaagaattct gcctatgaatttttctcctag >gi568815588r:47368850_47579889|GENSCAN_predicted_peptide_3|340_aa MGRNQHKKAENSKSQNASFPPKDDNSSPAREQNSTENEFDELTEVGFRRWVINPSKLKKH VLTQCKEAKNLEKRLDELLTRITSVEKNINDLMELKNTARELHEAYTIINSQINGVEERI SVIEDQLNEIKREDKIREKRIKRNEQSLQEIWDCVKRPNLHLIVEPESDKENGTKLENTL QDIIQENFPNLARKANIQIQEIQRTPQRYSLRRATPRHIIVRFTEVEMKEKMLRAARQTD WIIHKGKPIRLTADLSAETLQARREWGPTFNIFKEKNFQPRISYPAKLSFISEREIKSFT NKQMLRDFVTTRPALQELLKEALNMERNNWYQRLQKHTKL >gi568815588r:47368850_47579889|GENSCAN_predicted_CDS_3|1023_bp atggggagaaaccagcacaaaaaggctgaaaattccaaaagccagaacgcctcttttcct ccaaaggatgacaactcctcgccagcaagggaacaaaactcgacagagaatgagtttgat gaattgacagaagtaggcttcagaaggtgggtaataaacccctccaagctaaagaagcat gttctaacccaatgcaaggaagctaagaacctcgaaaaaaggttagatgaattgctaact agaataaccagtgtagagaagaacataaatgacctgatggagctgaaaaacacagcacga gaacttcatgaagcatacacaattattaatagccaaatcaatggagtggaagaaaggata tcagtgattgaagatcaacttaatgaaataaagagagaagacaagattagagaaaaaaga ataaaaaggaatgaacaaagcctccaagaaatatgggactgtgtgaaaagaccaaattta catttgattgttgaacctgaaagtgacaaagagaatggaaccaagctggaaaacactctt caggatattatccaggagaacttccccaatctagcaagaaaagccaacattcaaattcag gaaatacagagaacaccacaaagatactccttgagaagagcaaccccaagacacataatt gtcagattcaccgaggttgaaatgaaagaaaaaatgttaagggcagccagacagacagat tggattatccacaaagggaagcccatcagactaacagcggatctctctgcagaaacccta caagccagaagagagtgggggccaacattcaacatttttaaagaaaagaattttcaaccc agaatttcatatccagccaaactaagcttcataagtgaacgagaaataaaatcctttaca aacaagcaaatgctgagagattttgtcaccaccaggcctgccttacaagagctcctgaag gaagcactaaacatggaaaggaacaactggtaccagcgactgcaaaaacataccaaattg taa >gi568815588r:47368850_47579889|GENSCAN_predicted_peptide_4|151_aa MWVPSLFWGSDPLLMIPVEVLGSRDIQARMPSSSAHESKESWNKYHGAAPSTEEGPLPWA SVGNQGPKSARTCQAPTKPNGKLSPGDTVLIQGNQDSHGTSFSTEEERRGKTPRATCFLS SPRTRLSFITYCMEDSIVLQLTLTLGRGVLL >gi568815588r:47368850_47579889|GENSCAN_predicted_CDS_4|456_bp atgtgggtgccctcattgttctggggcagtgaccccctgctcatgatccctgtggaggtt ctgggctccagggacatccaggccagaatgcccagcagttcagctcatgagtccaaagag tcatggaacaaataccacggagcagcacctagcacagaggaaggccccctgccctgggca tctgtgggaaaccagggccccaagtctgcgagaacctgccaggctccaacaaagccaaac gggaaactctccccgggagacaccgtcctcatccagggaaaccaggattctcatggaacg tcattttctactgaagaggagcgcagaggaaaaactcccagggccacgtgcttcctctcc agcccgagaacacggctgtccttcatcacctactgcatggaggacagcatcgtcctgcag ctcacactcacactaggcaggggagtgctgctttga >gi568815588r:47368850_47579889|GENSCAN_predicted_peptide_5|82_aa MESGSNCVGMGRLPQGGDPKHAKSNTLALPPESMRTTRMGVCPDLALPSWTAGDLLKKGQ GDQTSPGKVEDEDPDRILGVVR >gi568815588r:47368850_47579889|GENSCAN_predicted_CDS_5|249_bp atggagagcggcagcaactgcgttgggatggggcggctgccccagggaggtgaccctaaa catgccaagtccaacacactagctctaccccctgagagcatgagaacaacacggatgggc gtgtgcccggatctggccttgccctcatggactgcaggagacctgctgaagaaaggccag ggtgaccagacatcacctgggaaggtggaggatgaggaccctgatcggatattgggagtg gtcagatag >gi568815588r:47368850_47579889|GENSCAN_predicted_peptide_6|466_aa MLGGLGKLAAEGLAHRTEKATEGAIHAVEEVVKEVVGHAKETGEKAIAEAIKKAQESGDK KMKEITETVTNTVTNAITHAAESLDKLGHDASEWSRGVVVAGQSQAGARVSLGGDGAEAI TGLTVDQYGMLYKIEQEGVTVKSSSHFNPDPDAETLYKAMKGIGTNEQAIIDVLTKRSNT QRQQIAKSFKAQFGKARGRLDLTETLKSELSGKFERLIVALMYPPYRYEAKELHDAMKGL GTKEGVIIEILASRTKNQLREIMKAYEEDYGSSLEEDIQADTSGYLERILVCLLQGSRDD VSSFVDPGLALQDAQDLYAAGEKIRGTDEMKFITILCTRSATHLLRVFEEYEKIANKSIE DSIKSETHGSLEEAMLTVGTAPPLESQCVLPKIVLCAAHVVPSAVWGAGTRDGTLIRNIV SRSEIDLNLIKCHFKKMYGKTLSSMIMEDTSGDYKNALLSLVGSDP >gi568815588r:47368850_47579889|GENSCAN_predicted_CDS_6|1401_bp atgctgggaggcctggggaagctggctgccgagggcctggcccaccgcaccgagaaggcc accgagggagccattcatgccgtggaggaagtggtgaaggaggtggtgggacatgccaag gagactggagagaaagccattgctgaagccataaagaaagcccaggagtcaggggacaaa aagatgaaggaaatcaccgagacagtgaccaacacagtcacaaatgccatcacccatgca gcagaaagtctggacaaacttggacatgatgcctctgaatggtcccgaggggttgtggtg gccgggcagagccaggcaggagccagagtcagcctggggggtgatggagctgaggccatc accggtctgacagtggaccagtatggcatgctgtataagattgaacaggagggtgtcaca gtgaagagcagctcccacttcaacccagaccctgatgcagagaccctctacaaagccatg aaggggatcgggaccaacgagcaggctatcatcgatgtgctcaccaagagaagcaacacg cagcggcagcagatcgccaagtccttcaaggctcagttcggcaaggcaaggggaaggctg gacctcactgagaccttgaagtctgagctcagtggcaagtttgagaggctcattgtggcc cttatgtacccgccatacagatacgaagccaaggagctgcatgacgccatgaagggctta ggaaccaaggagggtgtcatcattgagatcctggcctctcggaccaagaaccagctgcgg gagataatgaaggcgtatgaggaagactatgggtccagcctggaggaggacatccaagca gacacaagtggctacctggagaggatcctggtgtgcctcctgcagggcagcagggatgat gtgagcagctttgtggacccaggactggccctccaagacgcacaggatctgtatgcggca ggcgagaagattcgtgggactgatgagatgaaattcatcaccatcctgtgcacgcgcagt gccactcacctgctgagagtgtttgaagagtatgagaaaattgccaacaagagcattgag gacagcatcaagagtgagacccatggctcactggaggaggccatgctcactgtggggact gctccacctctagagtcccagtgtgtgctgccaaagattgttctctgtgctgcccacgtg gtgcccagtgctgtgtggggagcagggacgcgtgatgggaccctgataagaaacatcgtt tcaaggagcgagattgacttaaatcttatcaaatgtcacttcaagaagatgtacggcaag accctcagcagcatgatcatggaagacaccagcggtgactacaagaacgccctgctgagc ctggtgggcagcgacccctga >gi568815588r:47368850_47579889|GENSCAN_predicted_peptide_7|789_aa MFEDVFSDSGNTGNFDRGKKRRLTIIECGCDINMMIDLAKVADLVLMLIDASFGFEMEMF EFLNICQAHGFPKILGVLTHLDSFKHNKQLKKTKKRLKHRFWTEVYQVAKLFYLSGMVHG EYQNQEIHNLGHFITVMKFRPLMANFSPLYPGRQDKVGLTHELVQSLISTYSTIDAKMAS SRVTLLSNSKPLGSEAIDNQGVSLEFDQQQGSVCPSESEIYEAGAEDRMAGAPMAAAVQP AEVTVEVGEDLHMHQVRDREMPEVVEIRRSNCTNHVSTERFSQQYSSCSTIFLDDSTAIQ HYLTMTIISDADRSLSIPDEQLHSFAVSTVHIMKKRNGGGSLNNYSSSIPPTPSTSQEDP QFSVPPTANTPTPVCKRSMRWSNLFTSEKGSDPDKERKAPENHADTIGSGRAIPIKQGML LKRSGKWLKTWKKKYVTLCSNGVLTYYSSLGDYMKNIHKKEIDLRTSTIKVPGKWPSLAT SACAPISSSKSNGLSKDMDTGLGDSICFSPGISSTTSPKLNPPPSPHANKKKHLKKKSTN NFMIVSATGQTWHFEATTYEERDAWVQAIQSQILASLQSCKSSKSKSQLTSQSEAMALQS IQNMRGNAHCVDCETQNPKWASLNLGVLMCIECSGIHRSFGTRLSRVRSLELDDWPVELR KVMSSIGNELANSIWEGSSQGQTKPSIKSTREEKEWWIRSKYEEKLFLAPLPCTELSLGQ QLLRATTDEDLQTAILLLAHGSREEVNETCGEGDGCTALHLACRKGNVVLEQLLTGWTSW PEMPTGTQR >gi568815588r:47368850_47579889|GENSCAN_predicted_CDS_7|2370_bp atgtttgaggatgtttttagcgatagtgggaatacaggaaattttgatagaggtaaaaag cgcagactcaccattattgaatgtgggtgtgacattaacatgatgattgatctggctaaa gtagcagatctggtactgatgcttatagatgccagctttgggtttgaaatggaaatgttt gagtttctaaacatctgtcaagcacatggctttcctaaaattctgggagttctcacccac ctcgactccttcaagcataacaagcaactgaagaagacaaagaagcgattaaaacacagg ttctggacagaagtttaccaggttgccaaactgttctacctttctggaatggtgcatgga gaatatcaaaaccaagaaatccacaatctgggccattttattacagttatgaagtttagg cctctcatggcaaacttttcacccttatatcctggcagacaggacaaggtggggctcacc catgagctggtccagagtctcatctccacctactccaccattgatgccaagatggcttca agtcgagtgacgctgctttccaattccaaaccacttgggtcagaggctatagataatcaa ggcgtcagcctcgagtttgaccagcagcaggggtcggtgtgtccctctgaatctgagatc tatgaggcaggagctgaggacaggatggcaggagcgcccatggctgctgctgtacagcct gctgaggtgactgttgaagttggtgaggacctccacatgcaccaggttcgtgaccgggag atgcctgaagttgtagaaataagaagaagcaactgtacaaaccatgtatctactgagcgt ttcagtcaacaatacagctcgtgttcgacaatattccttgatgacagcacagccatccag cattatcttacaatgacaataatatcagatgcagatagatctttgagcatacctgatgaa cagttacactcatttgcggtttccaccgtgcacattatgaagaaaagaaatggaggtggg agtttaaataactattcctcctccattccaccgactcccagcaccagccaggaggaccct cagttcagtgttcctcccactgccaacacacccacccccgtttgcaagcggtccatgcgc tggtccaacctgtttacatctgagaaagggagtgacccagacaaagagaggaaagccccg gagaatcatgctgacaccatcgggagcggcagagccatccccattaaacagggcatgctc ttaaagcgaagtggaaaatggctgaagacatggaaaaagaaatacgtcaccctgtgttcc aatggcgtgctcacctattattcaagcttaggtgattatatgaagaatattcataaaaaa gagattgaccttcggacatctaccatcaaagtcccaggaaagtggccatccctagccaca tcagcctgtgcacccatctccagctctaaaagcaatggcctatccaaggacatggacacc gggctgggtgactccatatgcttcagccccggtatctccagcaccaccagccccaagctc aacccgcccccctctcctcatgccaataaaaagaaacacctaaagaagaaaagcaccaac aactttatgattgtgtctgccactggccaaacgtggcactttgaagccacgacgtatgag gagcgggatgcctgggtccaagccatccagagccagatcctggccagcctgcagtcatgc aagagcagtaaaagcaagtcccagctgaccagccagagtgaggccatggccctgcagtcg atccaaaacatgcgtgggaacgcccactgtgtggactgtgagacccagaatcctaagtgg gccagtttgaacttgggagtcctcatgtgtattgaatgctcaggtatccaccgcagtttt ggcacccgcctttcccgtgtgcgatctctggagctggatgactggccagttgagctcagg aaggttatgtcatctattggcaatgagctagccaacagcatctgggaagggagcagccag gggcagacaaaaccctcaataaagtccacgagggaagagaaggaatggtggatccgttcc aaatatgaggagaagctctttctggccccactaccctgcactgagctgtccctgggccag cagctgctgcgggccaccactgatgaggacctgcagacagccatcctgctgctggcacat ggctcccgtgaggaggtgaacgagacctgtggggagggagacggctgcacggcactccat ctggcctgccgcaaggggaatgtggtcctggagcagctcctgacggggtggacgtcatgg cccgagatgcccacgggaacacagcgctga >gi568815588r:47368850_47579889|GENSCAN_predicted_peptide_8|593_aa MRTTRDLPSAPEPRTQPPPTPARGTKPVHNCDSLGPLGHRRKRKFTLAPTRRRETKRQKK RDRRLDDFQGTAGRKTRQRPSAGAGAVCAEGDGRQEAGQPAAGRAWGGLVIDSHSFLEYN SWHVLSSVNICCSKLVKWRLQKGKVTIVEFIQPAARSQHAAADSFLSILLTKLDGCFHSV AGCFHSMAIITGGFATFSSCFPDLCKGEPAALLPMSLSQSCLLVPSVGLTLILPHLYLGS QEDVLNKDLMTQNGISYVLYASNSCPKPDFIYQSHFLRVPINDNYCEKLLPWLDKSIEFV DKAKLSSCQVIVHRLAGISCCATIAIAYIMKTMGMSSEDAYRFVKDQRPSISPNFNFLGQ LLEDQSSPKLLAAVQGDAGTPSGMQEPPPSPAAGAPLPWLPPPTSETAATRSAAAREGGP SAGRKPPAPPTATSTLQQGLRSLRLSSDHLQDTSRLKPSFSLDIKSAYAPSRRPGGPGPA TPARPRSSLKAGQPVGAMLGLPSPCPDAAPRHAHGPARYPARGLNFGYAAAGPWPAGQPR SLDATARLPEASSVLQPRGRARAGQGAVCALRPGGRPGTQRLQRPATAGGSKG >gi568815588r:47368850_47579889|GENSCAN_predicted_CDS_8|1782_bp atgcgcacgactcgcgatcttcccagtgccccagagccacggacccaaccgccgcctacc ccagcccgcggcaccaaacctgttcacaactgcgactccttaggtccgcttggacaccgc cggaaacggaaattcaccctcgcgccgactcgccggagggaaacaaaaaggcagaaaaaa agggaccgccgcctggatgactttcaggggacagctggaagaaagactcgtcaacggccg agtgctggcgccggcgccgtctgcgcagaaggtgatggacgccaagaagctggccagcct gctgcggggcgggcctgggggggcctggtcatcgacagtcactccttcctggagtacaac agctggcatgtgctcagctccgtcaacatctgctgctccaagctggtgaagtggcggttg cagaagggcaaggtgaccattgtggagttcatccagccggccgcacgcagccagcacgcg gccgcagacagcttcctctccatcctgctgaccaagctggatggctgcttccacagcgtg gccggctgcttccacagcatggccatcatcacggggggcttcgccaccttctcctcctgc ttccccgacctctgcaagggtgagcctgctgccctgctacccatgagcctctcccagtcc tgcctgctcgtgcccagcgtgggcctgaccctcatcctgcctcacctctacctgggctcg caggaagacgtcctgaacaaggatctgatgacgcagaatggaataagctacgtcctctat gccagcaactcctgccccaagcctgacttcatctaccagagccacttcttgcgggtcccc atcaacgacaactactgtgaaaagctgctgccctggctggacaagtccatcgagttcgtc gataaagccaagctgtccagctgccaagtcatcgtccaccgtctggccggcatctcctgc tgtgccactatcgccatcgcctacatcatgaagaccatgggcatgtcctccgaagacgcc tacaggtttgtgaaggaccagcgcccgtccatctcgcccaacttcaacttcctgggccag ctgctggaggaccagagcagcccgaagctgctggccgccgtgcagggcgacgcgggcacc ccctcaggaatgcaggagcctccccccagccctgcggccggggccccactgccatggctg ccaccacctacctcagagaccgctgccaccaggagtgcagctgccagggagggcggcccg agcgcgggcaggaagcccccggcgccccccacggccaccagcacgctgcagcagggcctg cgcagcctgcgcctctcctcggaccacctgcaggacaccagccgcctcaagccctccttc tctctggacatcaagtcggcctacgcccccagcaggcggcccggcggcccgggcccagcg accccggcgaggccccgaagctctctgaaagctggacagccagtcggggccatgctgggc ctgccctcgccctgcccggacgccgcgcccaggcacgcccacggcccggcgcgctacccc gcgcgcggcctgaacttcggctacgcggctgccgggccctggccagccggccagccccgg agcctggacgccaccgctcgactccctgaagcgtcctcggtgcttcagccccgagggcgt gcaagggccgggcagggtgctgtttgcgcccttcggccgggcgggcgccccggaacccaa cggctgcagcgacctgccacggcgggaggcagcaagggctga