GENSCAN 1.0 Date run: 16-Jul-119 Time: 15:49:54 Sequence gi568815595r:142349432_142678704 : 329273 bp : 37.97% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 Intr - 6065 5970 96 2 0 51 96 67 0.777 3.09 1.11 Intr - 7688 7481 208 1 1 73 71 164 0.928 11.36 1.10 Intr - 10500 10431 70 1 1 73 76 88 0.992 3.42 1.09 Intr - 15748 15616 133 1 1 86 75 178 0.990 15.60 1.08 Intr - 15935 15879 57 2 0 93 89 14 0.489 0.16 1.07 Intr - 21189 21054 136 2 1 68 74 106 0.999 6.85 1.06 Intr - 26513 26367 147 1 0 12 107 139 0.981 6.63 1.05 Intr - 27163 27048 116 1 2 80 106 -7 0.928 -1.37 1.04 Intr - 30749 30651 99 2 0 72 84 58 0.774 3.19 1.03 Intr - 33982 33869 114 1 0 62 24 103 0.634 1.02 1.02 Intr - 35254 35092 163 0 1 68 115 107 0.988 10.46 1.01 Init - 39450 39149 302 0 2 50 100 138 0.803 7.97 1.00 Prom - 40696 40657 40 -3.25 2.14 PlyA - 41558 41553 6 1.05 2.13 Term - 43205 43169 37 1 1 93 37 50 0.138 -3.77 2.12 Intr - 48029 47898 132 1 0 53 79 135 0.607 7.94 2.11 Intr - 51116 51013 104 2 2 68 97 99 0.986 6.85 2.10 Intr - 54341 54243 99 2 0 27 89 109 0.346 4.29 2.09 Intr - 63232 63113 120 1 0 64 78 84 0.604 4.77 2.08 Intr - 64860 64704 157 2 1 86 94 134 0.989 12.89 2.07 Intr - 69178 69073 106 2 1 101 42 106 0.940 5.65 2.06 Intr - 69450 69384 67 0 1 59 110 28 0.953 -0.34 2.05 Intr - 71722 71585 138 1 0 53 34 236 0.772 14.44 2.04 Intr - 72112 72045 68 2 2 63 95 58 0.911 1.71 2.03 Intr - 77410 77313 98 0 2 61 99 55 0.488 2.63 2.02 Intr - 83462 83230 233 2 2 124 66 177 0.539 14.75 2.01 Init - 87881 87870 12 2 0 73 95 13 0.688 0.03 2.00 Prom - 88384 88345 40 -11.14 3.00 Prom + 88708 88747 40 -12.62 3.01 Init + 90347 90665 319 1 1 76 59 164 0.917 9.84 3.02 Term + 90730 91439 710 2 2 18 35 339 0.499 14.28 3.03 PlyA + 95181 95186 6 1.05 4.03 PlyA - 95679 95674 6 1.05 4.02 Term - 98094 97971 124 2 1 92 43 161 0.996 8.88 4.01 Init - 98631 98435 197 0 2 31 80 219 0.996 11.86 4.00 Prom - 99556 99517 40 -8.25 5.12 PlyA - 99639 99634 6 1.05 5.11 Term - 100171 99998 174 1 0 85 44 154 0.403 7.48 5.10 Intr - 103802 103697 106 1 1 52 96 75 0.411 4.00 5.09 Intr - 108324 108173 152 0 2 23 74 158 0.621 5.94 5.08 Intr - 109680 109527 154 2 1 76 95 67 0.975 5.35 5.07 Intr - 109952 109796 157 0 1 106 115 113 0.999 13.95 5.06 Intr - 112659 112509 151 0 1 73 41 157 0.815 8.31 5.05 Intr - 115809 115666 144 0 0 95 53 74 0.924 4.16 5.04 Intr - 117102 116893 210 0 0 83 52 86 0.858 2.69 5.03 Intr - 118637 118503 135 2 0 63 36 114 0.917 3.54 5.02 Intr - 127940 127247 694 1 1 27 86 243 0.142 8.58 5.01 Init - 128653 127995 659 1 2 72 67 318 0.988 22.88 5.00 Prom - 129366 129327 40 -6.15 6.27 PlyA - 129535 129530 6 1.05 6.26 Term - 130209 129761 449 1 2 16 43 225 0.630 5.19 6.25 Intr - 130540 130281 260 0 2 -33 -2 265 0.093 1.58 6.24 Intr - 135851 135709 143 2 2 74 109 97 0.188 8.63 6.23 Intr - 147089 146930 160 1 1 91 90 124 0.999 11.97 6.22 Intr - 147761 147582 180 1 0 74 4 160 0.889 4.26 6.21 Intr - 149343 149166 178 1 1 106 53 52 0.923 1.56 6.20 Intr - 150287 150196 92 1 2 68 98 71 0.986 4.82 6.19 Intr - 154022 153931 92 2 2 69 90 37 0.960 -0.13 6.18 Intr - 155872 155708 165 1 0 4 110 145 0.121 7.54 6.17 Intr - 158678 158500 179 0 2 43 41 102 0.073 -0.28 6.16 Intr - 163039 162829 211 1 1 33 92 103 0.577 2.76 6.15 Intr - 164207 164070 138 2 0 83 78 61 0.944 4.34 6.14 Intr - 173410 173297 114 1 0 -18 110 131 0.833 4.42 6.13 Intr - 174768 174562 207 0 0 84 103 126 0.919 12.05 6.12 Intr - 185774 185649 126 2 0 70 85 169 0.925 14.76 6.11 Intr - 193326 193234 93 0 0 75 115 34 0.644 4.04 6.10 Intr - 200242 200048 195 1 0 62 86 58 0.718 1.69 6.09 Intr - 200871 200701 171 0 0 39 93 149 0.658 9.72 6.08 Intr - 203967 203796 172 2 1 77 110 27 0.765 2.82 6.07 Intr - 204309 204209 101 0 2 72 69 63 0.839 0.79 6.06 Intr - 204584 204394 191 0 2 72 0 154 0.956 3.38 6.05 Intr - 209997 209820 178 0 1 10 113 98 0.312 3.07 6.04 Intr - 213678 212801 878 1 2 71 107 199 0.153 10.22 6.03 Intr - 216830 216690 141 0 0 86 111 42 0.720 5.70 6.02 Intr - 218723 218632 92 1 2 43 91 73 0.480 1.82 6.01 Init - 219696 219419 278 0 2 90 70 76 0.364 2.51 6.00 Prom - 228138 228099 40 -5.85 7.03 PlyA - 228345 228340 6 1.05 7.02 Term - 229051 228907 145 0 1 80 43 143 0.893 5.40 7.01 Init - 229273 229215 59 1 2 87 91 122 0.980 11.24 7.00 Prom - 240553 240514 40 -5.35 8.07 PlyA - 240822 240817 6 1.05 8.06 Term - 247113 246502 612 0 0 59 43 354 0.759 21.49 8.05 Intr - 247866 247786 81 0 0 100 27 105 0.749 4.52 8.04 Intr - 260101 259976 126 1 0 111 48 73 0.208 5.56 8.03 Intr - 288724 288632 93 1 0 73 91 54 0.083 3.44 8.02 Intr - 303122 303013 110 0 2 78 96 15 0.001 0.38 8.01 Init - 311190 311124 67 0 1 75 77 61 0.024 4.99 8.00 Prom - 311601 311562 40 -7.35 9.00 Prom + 312661 312700 40 -4.25 9.01 Init + 314807 314876 70 1 1 76 57 54 0.453 2.36 9.02 Intr + 319959 320122 164 1 2 56 89 227 0.982 18.37 9.03 Intr + 321562 321691 130 0 1 86 93 81 0.954 7.75 9.04 Intr + 326726 326858 133 0 1 71 97 106 0.918 8.58 9.05 Intr + 328601 328682 82 2 1 88 84 69 0.660 5.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 120138 119906 233 1 2 91 93 82 0.952 5.57 S.002 Init - 155857 155708 150 1 0 53 110 149 0.878 13.69 S.003 Intr - 166084 165964 121 0 1 45 116 50 0.932 3.08 S.004 Init - 166245 166208 38 0 2 68 121 45 0.861 5.33 S.005 Init - 176447 176358 90 2 0 79 81 70 0.977 5.94 S.006 Init - 234782 234667 116 2 2 92 105 67 0.826 8.53 S.007 Sngl - 269238 269056 183 0 0 95 38 130 0.846 3.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:142349432_142678704|GENSCAN_predicted_peptide_1|547_aa MESTHSEDAVNIVEMTAKDLGYYINLVEKQWQTLRALTPSWKELLLWVKCYKIALHSAEK SFMKSQLIQQTLLLSYFQKLRQPSQPSSATALTNQQPSISSYLRRKGIIINETSAVVYAQ LLTGRKYQINQNGEVRLEKQWSKQVVPFVYQTIVKDIRAFDSRFSNIKTLDDLFPLRSMV FMLGTPYYGCTGEVQDSGDVITEGRIRVIFSIPCEPNLDALIQNQHKYSIKYNPGYVLAS RLGVSGYLVSRFTGSIFIGRGSRRNPHGDHKANVGLNLKFNKKNEEVPGYTKKVGSEWMY SSAAEQLLAEYLESAEKVQEIITWLKGHPVSTLSRSSCDLQILDAAIVEKIEEEVEKCKQ RKNNKKVRVTVKPHLLYRPLEQQHGVIPDRDAEFCLFDRVVNVRENFSVPVGLRGTIIGI KGANREADVLFEVLFDEEFPGGLTIRCSPGRGYRLPTSALVNLSHGSRSETGNQKLTAIV KPQPAVHQHSSSSSVSSGHLGALNHSPQSLFVPTQVPTKDDDEFCNIWQSLQGSGKMQYF QPTIQEK >gi568815595r:142349432_142678704|GENSCAN_predicted_CDS_1|1641_bp atggaatctactcatagtgaagatgctgtgaacattgttgaaatgacagcaaaggattta ggatattacataaatttagttgaaaagcagtggcagactttgagagcattgactccaagt tggaaagaacttctcttgtgggtaaaatgctataaaatagcattgcattctgcagagaaa tctttcatgaagagtcagctgatacagcaaactttattattgtcttattttcaaaaattg cgtcagccatcccaaccttcatcggccactgccctgactaatcaacagccatcaatatca agctacctgagaagaaaaggaataataataaatgaaacatctgcagttgtgtatgctcag ttactcacaggtcgtaaatatcaaataaatcaaaatggtgaagttcgtctagagaaacag tggtcaaaacaagttgttccttttgtttatcaaactattgtcaaggacatccgagctttc gactcccgtttctccaatatcaaaacattggatgatttgtttcctctgagaagtatggtc tttatgctgggaactccctattatggctgcactggagaagttcaggattcaggtgatgtg attacagaaggtaggattcgtgtgattttcagcattccatgtgaacccaatcttgatgct ttaatacagaaccagcataaatattctataaagtacaacccaggatatgtgttggccagt cgccttggagtgagtggataccttgtttcaaggtttacaggaagtatttttattggaaga ggatctaggagaaaccctcatggagaccataaagcaaatgtgggtttaaatctcaaattc aacaagaaaaatgaggaggtacctggatatactaagaaagttggaagtgaatggatgtat tcatctgcagcagaacaacttctggcagagtacttagagagtgctgaaaaagttcaagaa attattacttggctaaaaggacatcctgtcagtactttatctcgttcttcttgtgattta caaattctggatgcagctattgttgagaaaattgaggaagaagtcgaaaagtgcaagcaa agaaagaataataagaaggtgcgagtaacagtgaaaccccatttgctatacagaccttta gaacagcaacatggagtcattcctgatcgggatgcagaattttgtctttttgaccgtgtt gtaaatgtgagagaaaacttctcagttccagttggccttcgaggcaccatcataggaata aaaggagctaatagagaagccgatgtactatttgaagtattatttgatgaagaatttcct ggagggttaacaataagatgctcacctggtagaggttatcgactgccaacaagtgccttg gtgaacctttctcatgggagtcgctctgaaactggaaatcagaagttgacagccatcgta aaaccacaaccagctgtacatcaacatagctcaagttcatcagtttcctctgggcatttg ggagccctcaaccattcccctcaatcactttttgttcctactcaagtacctactaaagat gatgatgaattctgcaacatttggcagtccttacagggatctggaaagatgcaatacttt cagccaactatacaagagaag >gi568815595r:142349432_142678704|GENSCAN_predicted_peptide_2|456_aa MDKMIPEFDNLYLDMNGIIHQCSHPNDDDVHFRISDDKIFTDIFHYLEVLFRIIKPRKVF FMAVDGVAPRAKMNQQRGRRFRSAKEAEDKIKKAIEKGETLPTEARFDSNCITPGYINES GHLNLPRFEKYLVKLSDFDREHFSEVFVDLKWFESKVGNKYLNEAAGVAAEEARNYKEKK KLKGQENSLCWTALDKNEGEMITSKDNLEDETEDDDLFETEFRQYKRTYYMTKMGVDVVS EYYPYHYAPFLSDIHNISTLKIHFELGKPFKPFEQLLAVLPAASKNLLPACYQHLMTNED SPIIEYYPPDFKTDLNGKQQEWEAVVLIPFIDEFFLKKSGVQVFQQSSRGENMMLEILVD AESDELTVENVASSVLGKSVFVNWPHLEEARVVAVSDGETKFYLEEPPGTQKLYSGRTAP PSKVVHLGDKEQSNWAKEVQGISEQDMDEIGNHHSQ >gi568815595r:142349432_142678704|GENSCAN_predicted_CDS_2|1371_bp atggataagatgattcctgaatttgacaacttgtacctggatatgaatggaattatacat cagtgctcccatcctaatgatgatgatgttcactttagaatttcagatgataaaatcttt actgatatttttcactacctggaggtgttgtttcgcattattaaacccaggaaagtgttc tttatggctgtagatggtgtggctcctcgagcaaaaatgaaccagcagcgtgggaggcgt tttaggtcagcaaaggaggcagaagacaaaattaaaaaggcaatagagaagggagaaact cttcctacagaggccagatttgattccaactgtatcacaccaggttatattaatgaaagt gggcacctcaacttacctcgatttgagaaataccttgtgaaactatcagattttgatcgg gagcacttcagtgaagtttttgtggacctaaaatggtttgaaagcaaagttggtaacaag tacctcaatgaagcagcaggtgtcgcagcagaagaagccaggaactacaaggaaaagaaa aagttaaagggccaggaaaattctctgtgttggactgctttagacaaaaatgaaggcgaa atgataacttctaaggataatttagaagatgagactgaagatgatgacctatttgaaact gagtttagacaatataaaagaacatattacatgacgaagatgggggttgacgtagtatct gagtattatccttatcattatgcacctttcctgtctgatatacacaacatcagtacactc aaaatccattttgaactaggaaaaccttttaagccatttgaacagcttcttgctgtactt ccagcagccagcaaaaatttacttcctgcatgctaccagcatttgatgaccaatgaagac tcaccaattatagaatattacccacctgattttaaaactgacctaaatgggaaacaacag gaatgggaagctgtggtgttaatcccttttattgatgagttttttttgaagaaaagtggt gttcaagtattccagcaaagcagtcgtggagaaaacatgatgttggaaatcttagtggat gcagaatcagatgaacttaccgtagaaaatgtagcttcatcagtgcttggaaaatctgtc tttgttaattggcctcaccttgaggaagctagagtcgtggctgtatcagatggagaaact aagttttacttggaagaacctccaggaacacagaagctttattcaggaagaactgcccca ccatctaaagtggttcatcttggagataaagaacaatctaactgggcaaaagaagtacaa ggaatttcagaacaggacatggatgaaattggaaatcatcattctcagtaa >gi568815595r:142349432_142678704|GENSCAN_predicted_peptide_3|342_aa MALRQVDFGGSGKGKSWANRMPKRACFQCGPQGHFKKDCPSRNKPPPCPCPLCQGNHWKA HYLRRRRSSETKVSGDEGPAEPDDPAAGLRVPGESASPCHHPHRALVLLSCPGQLSSRSV TIRGVLGQPVTRYFSQPLSCDWGTLLFSHAFLITPESPTPLLRRDILAKVGVIIHLNIGE GTPVCCPLLEEGINPEVWATERQHGRAKNAHPGQVKLKDSTSFPYQRQYPLRPEAQQGLQ KIVKDLKAQGLVKPCNSSCNTPNLGVQKPNRQWRLVQDLRIMNEAVVPLYPAVPNPYTLL SQIPEEAERFTVLDLKDAFFCNPVHPDSQFLFAFEEPSNPTS >gi568815595r:142349432_142678704|GENSCAN_predicted_CDS_3|1029_bp atggcccttaggcaagtggactttggaggctctggaaaagggaaaagctgggcaaatcga atgcctaaaagggcttgcttccagtgcggtccgcaaggacactttaaaaaagattgtcca agtagaaataagccgcccccttgtccatgccccttatgtcaagggaatcactggaaggcc cactacctcaggagacgaaggtcctctgagacgaaggtctcaggagacgaaggtcctgct gaaccagatgatccagcagcaggactgagggtgcccggggaaagcgccagcccatgccat caccctcacagagccctggtcttactctcctgtcctggacaactgtcctccagatctgtc actatccgaggggtcctaggacagccagtcactagatacttctcccagccactaagttgt gactggggaactttactcttttcacatgcttttctaattacgcctgaaagccccactccc ttgttaaggagagacattctagcaaaagtaggggtcattatacacctgaacataggagaa ggaacacccgtttgttgtcccctgcttgaggaaggaattaatcctgaagtctgggcaaca gaaagacaacatggacgagcaaagaatgcccatcctggtcaagttaaactaaaggattcc acctcctttccctaccaaaggcagtacccccttagacccgaggcccaacaaggactccaa aagattgttaaggacctaaaagcccaaggcctagtaaaaccatgcaatagctcctgcaat actccaaatttaggagtacagaaacccaacagacagtggaggttagtgcaagatctcagg attatgaatgaggctgttgttcctctatacccagctgtacctaacccttatactctgctt tcccaaataccagaggaagcagagcggtttacagtcctggaccttaaggatgcctttttc tgcaaccctgtacatcctgactctcaattcttgtttgcctttgaagagccttcgaaccca acgtcttaa >gi568815595r:142349432_142678704|GENSCAN_predicted_peptide_4|106_aa MVGRAVCCHPRSDDRRLRRSPGGAFGFGLALTGISVDDRNGSPQVLQMDLRAVSLSQRSG ERASGGGCEAGVGVLVGSPIPPRAADPLTDDNSRFRTQSPCLLSLI >gi568815595r:142349432_142678704|GENSCAN_predicted_CDS_4|321_bp atggtcggccgggcggtgtgttgtcatccgcggagcgacgaccggaggctgcggcggagc cccggcggggcgtttggtttcggtttggccctgactgggattagtgttgacgatcgaaat gggagtccccaagttttacagatggatctcagagcggtatccctgtctcagcgaagtggt gaaagagcatcaggtggaggctgcgaagctggagtgggtgtcctcgtaggttcccccatc ccacccagagcggcagacccacttacagacgataacagccgcttccgcacgcagtcacct tgtttactttccctgatctag >gi568815595r:142349432_142678704|GENSCAN_predicted_peptide_5|911_aa METQKTLHKINESRSWFLEKINKIDRLLARLIKKKREKNQIDAIKNVKGDITDPTEIQTT IREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNGPITGSEIEAIINSLPTKKS PGPDGFTAEFYQRYKEELVPFILKLFQSIEKEGVLPNSFYEASIILIPKPGRDTTKKENF KPISLMNIDAKILNKILANRIQQHIKKLIHHDEVGFIPGISKDKNHMIMSIDAEKAFDKI QQPFMLKTLNKLGIDGTYLKIIRAIYDRLTANILNGQKLEAFLLKTGIRQGCPLSPLLFN IVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIISAPNFFKLISNFTKVS GYKINVQKSQAYLYTNNRQTETQIMSELLFTIASKRIKYPGIQLTRDVKDLFKENYKPLL KEIKEDTNKWKNIPCSWIGRINIVKMAILPKSSYPMRVNRCKEILNKAIHMKKSLEKFVG DATRLTDKLLELCNKPVDGSSSTLSMSTHFKMLKKLVEEATFSEILIPLQSVMIPTLPSI LGTHANHASHEPFPGHWAYIAGFDDMVEILASLQKPKKISLKGSDGKFYIMMCKPKDDLR KDCRLMEFNSLINKCLRKDAESRRRELHIRTYAVIPLNDECGIIEWVNNTAGLRPILTKL YKEKGVYMTGKELRQCMLPKSAALSEKLKVFREFLLPRHPPIFHEWFLRTFPDPTSWYSS RSAYCRSTAVMSMVGYILGLGDRHGENILFDSLTGECVHVDFNCLFNKGETFEVPEIVPF RLTHNMVNGMGPMGTEGLFRRACEVTMRLMRDQREPLMSVLKTFLHDPLVEWSKPVKGHS KAPLNETGEVVNEKAKTHVLDIEQRLQGVIKTRNRVTGLPLSIEGHVHYLIQEATDENLL CQMYLGWTPYM >gi568815595r:142349432_142678704|GENSCAN_predicted_CDS_5|2736_bp atggagacacaaaaaacccttcacaaaatcaatgaatccaggagctggtttttggaaaag atcaacaaaattgatagactgctagctagactaataaagaagaaaagagagaagaatcaa atagatgcaataaaaaatgttaaaggggatatcaccgatcccacagaaatacaaactacc atcagagaatactataaacacctctatgcaaataagctagaaaatctagaagaaatggat aaattcctggacacatacaccctcccaagactaaaccaggaagaagttgaatctcttaat ggaccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaaaaaagt ccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactggtacca ttcattctgaaactatttcaatcaatagagaaagagggagtcctccctaactcattttat gaggccagcatcatcctgataccaaagcctggcagagacacaacaaaaaaagagaatttt aaaccaatatccctgatgaacattgatgcaaaaatcctcaataaaatactggcaaaccga atccagcagcacatcaaaaagcttatccaccatgatgaagtgggcttcatccctgggata agcaaagacaaaaaccacatgattatgtcaatagatgcagaaaaggcctttgacaaaatt caacagcccttcatgctaaaaactctcaataaattaggtattgatgggacatatctaaaa ataataagagctatttatgacagactcacagccaatatactgaatgggcaaaaactggaa gcattccttttgaaaactggcataagacagggatgccctctctcaccactcctattcaac atagtgttggaagttctggccagggcaatcaggcaggagaaagaaataaagggcattcaa ctaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtatatctagaa aaccccatcatctcagccccaaatttctttaagctgataagcaacttcaccaaagtctca ggatacaaaatcaatgtgcaaaaatcacaagcatacttatacaccaataacagacaaaca gagacccaaatcatgagtgaactcctattcacaattgcttcaaagagaataaaataccca ggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccattgctc aaggaaataaaagaggacacaaacaaatggaagaacattccatgctcatggataggaaga atcaatatcgtgaaaatggccatactgcccaagtcatcttatcccatgcgtgtgaacaga tgcaaggaaatcctcaataaagctattcatatgaaaaaatccttagagaagtttgttgga gatgcaactcgcctaacagataagcttctagaattgtgcaataaaccggttgatggaagt agttccacattaagcatgagcactcattttaaaatgcttaaaaagctggtagaagaagca acatttagtgaaatcctcattcctctacaatcagtcatgatacctacacttccatcaatt ctgggtacccatgctaaccatgctagccatgaaccatttcctggacattgggcctatatt gcagggtttgatgatatggtggaaattcttgcttctcttcagaaaccaaagaagatttct ttaaaaggctcagatggaaagttctacatcatgatgtgtaagccaaaagatgacctgaga aaggattgtagactaatggaattcaattccttgattaataagtgcttaagaaaagatgca gagtctcgtagaagagaacttcatattcgaacatatgcagttattccactaaatgatgaa tgtgggattattgaatgggtgaacaacactgctggtttgagacctattctgaccaaacta tataaagaaaagggagtgtatatgacaggaaaagaacttcgccagtgtatgctaccaaag tcagcagctttatctgaaaaactcaaagtattccgagaatttctcctgcccaggcatcct cctatttttcatgagtggtttctgagaacattccctgatcctacatcatggtacagtagt agatcagcttactgccgttccactgcagtaatgtcaatggttggttatattctggggctt ggagaccgtcatggtgaaaatattctctttgattctttgactggtgaatgcgtacatgta gatttcaattgtcttttcaataagggagaaacctttgaagttccagaaattgtgccattt cgcctgactcataatatggttaatggaatgggtcctatgggaacagagggtctttttcga agagcatgtgaagttacaatgaggctgatgcgtgatcagcgagagcctttaatgagtgtc ttaaagacttttctacatgatcctcttgtggaatggagtaaaccagtgaaagggcattcc aaagcgccactgaatgaaactggagaagttgtcaatgaaaaggccaagacccatgttctt gacattgagcagcgactacaaggtgtaatcaagactcgaaatagagtgacaggactgccg ttatctattgaaggacatgtgcattaccttatacaggaagctactgatgaaaacttacta tgccagatgtatcttggttggactccatatatgtga >gi568815595r:142349432_142678704|GENSCAN_predicted_peptide_6|1727_aa MGGPGKSTLSSRCGPCNWQPGFQASSCSWLEGGSSLGTHPFPPRSLLPAAINLPPTTPRL FVPRGACRPILSCPQPTAWPAQSPEGDEAAGGCATPEEYNTVVQKPRQILCQFIDRILTD VNVVAVELVKKTDSQPTSVMLLDFIQHIMKSSPLMFVNVSGSHEAKGSCIEFSNWIITRL LRIAATPSCHLLHKKICEVICSLLFLFKSKSPAIFGVLTKELLQLFEDLVYLHRRNVMGH AVEWPVVMSRFLSQLDEHMGYLQSAPLQLMSMQNLEFIEVTLLMVLTRIIAIVFFRRQEL LLWQIGCVLLEYGSPKIKSLAISFLTELFQLGGLPAQPASTFFSSFLELLKHLVEMDTDQ LKLYEEPLSKLIKTLFPFEAEAYRNIEPVYLNMLLEKLCVMFEDGVLMRLKSDLLKAALC HLLQYFLKFVPAGYESALQVRKVYVRNICKALLDVLGIEVDAEDCQHKSKKKPSVVITWM SLDFYTKVLKSCRSLLESVQKLDLEATIDKVVKIYDALIYMQAFIDNLHHLCKHLDFRED ETDVKAVLGTLLNLMEDPDKDVRVAFSGNIKHILESLDSEDGFIKELFVLRMKEAYTHAQ ISRNNELKDTLILTTGDIGRAAKGDLVPFALLHLLHCLLSKSASVSGAAYTEIRALVAAK SVKLQSFFSQYKKPICQFLVESLHSSQMTALPNTPCQNADVRKQDVAHQREMALNTLSEI ANVFDFPDLNRFLTRTLQVLLPDLAAKASPAASALIRTLGKQLNVNRREILINNFKYIFS HLVCSCSKDELERALHYLKADYLQPKLLGILAFFNMQLLSSSVGIEDKKMETSESTDLQT TLQLSMKAIQHENVDVRIHALTSLKETLYKNQEKLIKYATDSETVEPIISQLVTVLLKGC QDANSQARLLCGECLGELGAIDPGRLDFSTTETQGKDFTFVTGVEDSSFAYGLLMELTRA YLAYADNSRAQDSAAYAIQVRHDLASKIFTCCSIMMKHDFKVTIYLLPHILVYVLLGCNQ EDQQEVYAEIMAVLKHDDQHTINTQDIASDLCQLSTQTVFSMLDHLTQWARHKFQALKAE KCPHSKSNRNKVDSMVSTVDYEDYQSVTRFLDLIPQDTLAVASFRSKAYTRAVMHFESFI TEKKQNIQEHLGFLQKLYAAMHEPDGVAGVSAIRKAEPSLKEQILEHESLGLLRDATACY DRAIQLEPDQIIHYHGVVKSMLGLGQLSTVITQVNGVHANRSEWTDELNTYRVEAAWKLS QWDLVENYLAADGKSTTWSVRLGQLLLSAKKRDITAFYDSLKLVRAEQIVPLSAASFERG SYQRGYEYIVRLHMLCELEHSIKPLFQHSPGDSSQEDSLNWVARLEMTQNSYRAKEPILA LRRALLSLNKRPDYNEMVGECWLQSARVARKAGHHQTAYNALLNAGESRLAELYVERAKW LWSKDVTACLPEWEDGHFYLAKYYDKLMPMVTDNKMEKQGDLIRYIVLHFGRITNAEKSL KDLMELKSTAQELRDECTSFSSRFDQLEERVSVIEDQMNEMKREEKFREKRIKRNEQSLQ EIWDYVKRPNLCLIGVPETRQANIQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKERTL RAAREKGRVTHKGNPIRLTADLSAEILQARREWGPIFKILKEKNFQPRISYPAKLSFKSE GEIKSFTDKQMLRDFVTTRPVLKELLKEALNMERNNRYQPLQKHAKL >gi568815595r:142349432_142678704|GENSCAN_predicted_CDS_6|5184_bp atgggtgggcctggaaaaagcaccctaagttctcgctgtggtccgtgtaactggcagcca ggcttccaggcttcaagctgttcctggcttgaaggtgggtcttcactggggacccacccc tttcctcccaggagcctacttcctgctgccatcaacctgccacccactacaccgaggctg ttcgtgccgaggggcgcctgcaggcccatattgagctgccctcagcctaccgcttggcct gcccaaagtccagagggagatgaggcagcagggggctgtgccacaccagaggaatataat acagttgtacagaagccaagacaaattctgtgtcaattcattgaccggatacttacagat gtaaatgttgttgctgtagaacttgtaaagaaaactgactctcagccaacctccgtgatg ttgcttgatttcatccagcatatcatgaaatcctccccacttatgtttgtaaatgtgagt ggaagccatgaggccaaaggcagttgtattgaattcagtaattggatcataacgagactt ctgcggattgcagcaactccctcctgtcatttgttacacaagaaaatctgtgaagtcatc tgttcattattatttctttttaaaagcaagagtcctgctatttttggggtactcacaaaa gaattattacaactttttgaagacttggtttacctccatagaagaaatgtgatgggtcat gctgtggaatggccagtggtcatgagccgatttttaagtcaattagatgaacacatggga tatttacaatcagctcctttgcagttgatgagtatgcaaaatttagaatttattgaagtc actttattaatggttcttactcgtattattgcaattgtgttttttagaaggcaagaactc ttactttggcagataggttgtgttctgctagagtatggtagtccaaaaattaaatcccta gcaattagctttttaacagaactttttcagcttggaggactaccagcacaaccagctagc acttttttcagctcatttttggaattattaaaacaccttgtagaaatggatactgaccaa ttgaaactctatgaagagccattatcaaagctgataaagacactatttccctttgaagca gaagcttatagaaatattgaacctgtctatttaaatatgctgctggaaaaactctgtgtc atgtttgaagacggtgtgctcatgcggcttaagtctgatttgctaaaagcagctttgtgc catttactgcagtatttccttaaatttgtgccagctgggtatgaatctgctttacaagtc aggaaggtctatgtgagaaatatttgtaaagctcttttggatgtgcttggaattgaggta gatgcagaggactgtcaacataaatccaagaagaaaccttctgtagtgataacttggatg tcattggatttttacacaaaagtgcttaagagctgtagaagtttgttagaatctgttcag aaactggacctggaggcaaccattgataaggtggtgaaaatttatgatgctttgatttat atgcaagctttcatagataatctacatcatctttgtaagcatcttgattttagagaagat gaaacagatgtaaaagcagttcttggaactttattaaatttaatggaagatccagacaaa gatgttagagtggcttttagtggaaatatcaagcacatattggaatccttggactctgaa gatggatttataaaggagctttttgtcttaagaatgaaggaagcatatacacatgcccaa atatcaagaaataatgagctgaaggataccttgattcttacaacaggggatattggaagg gccgcaaaaggagatttggtaccatttgcactcttacacttattgcattgtttgttatcc aagtcagcatctgtctctggagcagcatacacagaaattagagctctggttgcagctaaa agtgttaaactgcaaagttttttcagccagtataagaaacccatctgtcagtttttggta gaatcccttcactctagtcagatgacagcacttccgaatactccatgccagaatgctgac gtgcgaaaacaagatgtggctcaccagagagaaatggctttaaatacgttgtctgaaatt gccaacgttttcgactttcctgatcttaatcgttttcttactaggacattacaagttcta ctacctgatcttgctgccaaagcaagccctgcagcttctgctctcattcgaactttagga aaacaattaaatgtcaatcgtagagagattttaataaacaacttcaaatatattttttct catttggtctgttcttgttccaaagatgaattagaacgtgcccttcattatctgaaggct gattatttacaacccaaattgttgggcattttggctttttttaacatgcagttactgagc tctagtgttggcattgaagataagaaaatggagacctctgagagcactgatcttcagaca actcttcagctctctatgaaggccattcaacatgaaaatgtcgatgttcgtattcatgct cttacaagcttgaaggaaaccttgtataaaaatcaggaaaaactgataaagtatgcaaca gacagtgaaacagtagaacctattatctcacagttggtgacagtgcttttgaaaggttgc caagatgcaaactctcaagctcggttgctctgtggggaatgtttaggggaattgggggcg atagatccaggtcgattagatttctcaacaactgaaactcaaggaaaagattttacattt gtgactggagtagaagattcaagctttgcctatggattattgatggagctaacaagagct taccttgcgtatgctgataatagccgagctcaagattcagctgcctatgccattcaggtt cgacatgatcttgccagtaaaattttcacctgctgtagcattatgatgaagcatgatttc aaagtgaccatctatcttcttccacatattctggtgtatgtcttactgggttgtaatcaa gaagatcagcaggaggtttatgcagaaattatggcagttctaaagcatgacgatcagcat accataaatacccaagacattgcatctgatctgtgtcaactcagtacacagactgtgttc tccatgcttgaccatctcacacagtgggcaaggcacaaatttcaggcactgaaagctgag aaatgtccacacagcaaatcaaacagaaataaggtagactcaatggtatctactgtggat tatgaagactatcagagtgtaacccgttttctagacctcataccccaggatactctggca gtagcttcctttcgctccaaagcatacacacgagctgtaatgcactttgaatcatttatt acagaaaagaagcaaaatattcaggaacatcttggatttttacagaaattgtatgctgct atgcatgaacctgatggagtggccggagtcagtgcaattagaaaggcagaaccatctcta aaagaacagatccttgaacatgaaagccttggcttgctgagggatgccactgcttgttat gacagggctattcagctagaaccagaccagatcattcattatcatggtgtagtaaagtcc atgttaggtcttggtcagctgtctactgttatcactcaggtgaatggagtgcatgctaac aggtccgagtggacagatgaattaaacacgtacagagtggaagcagcttggaaattgtca cagtgggatttggtggaaaactatttggcagcagatggaaaatctacaacatggagtgtc agactgggacagctattattatcagccaaaaaaagagatatcacagctttttatgactca ctgaaactagtgagagcagaacaaattgtacctctttcagctgcaagctttgaaagaggc tcctaccaacgaggatatgaatatattgtgagattgcacatgttatgtgagttggagcat agcatcaaaccacttttccagcattctccaggtgacagttctcaagaagattctctaaac tgggtagctcgactagaaatgacccagaattcctacagagccaaggagcctatcctggct ctccggagggctttactaagcctcaacaaaagaccagattacaatgaaatggttggagaa tgctggctgcagagtgccagggtagctagaaaggctggtcaccaccagacagcctacaat gctctccttaatgcaggggaatcacgactcgctgaactgtacgtggaaagggcaaagtgg ctctggtccaaggatgtgaccgcgtgcctgccagaatgggaggatgggcatttttacctt gccaagtactatgacaaattgatgcccatggtcacagacaacaaaatggaaaagcaaggt gatctcatccggtatatagttcttcattttggcagaataaccaatgcagagaaatcctta aaggacctgatggagctgaaatccacggcacaggaactacgtgacgaatgcacaagcttc agtagccgattcgatcaactggaagaaagggtatcagtgattgaagatcaaatgaatgaa atgaagcgagaagagaagtttagagaaaaaagaataaaaagaaacgaacaaagcctccaa gaaatatgggactatgtgaaaagaccaaatctatgtctgattggtgtacctgaaacaagg caggccaacattcaaattcaggaaatacagagaacgccacaaagatactcctcgagaaga gcaactccaagacacataattgtcagattcaccaaagttgaaatgaaggaacgaacgtta agggcagccagagagaaaggtcgggttacccacaaagggaaccccatcagactaacagct gatctctcggcagaaattctacaagccagaagagagtgggggccaatattcaaaattctt aaggaaaagaattttcaacccagaatttcatatccagccaaactaagcttcaaaagtgaa ggagaaataaaatcctttacagacaagcaaatgctgagagattttgtcaccaccaggcct gtcctaaaagagctcctgaaggaagcactaaacatggaaaggaacaatcggtaccagcca ctgcagaaacatgccaaattgtaa >gi568815595r:142349432_142678704|GENSCAN_predicted_peptide_7|67_aa MGEHGLELASMIPALRELGRVPVDSTWGSRHKPAGSRSVGERGVCRPWFTTHPDREFASC SWRKSPC >gi568815595r:142349432_142678704|GENSCAN_predicted_CDS_7|204_bp atgggggaacatggcctggagctggcttccatgatccccgccctgcgggagctgggcaga gtgcccgttgatagcacatggggcagtcgccacaaacctgcgggaagccggtccgtgggg gagagaggcgtctgccggccctggtttaccactcaccccgatcgagagtttgcttcctgc agctggaggaagagtccctgctga >gi568815595r:142349432_142678704|GENSCAN_predicted_peptide_8|362_aa MVWKLHTVSRERYDTLTMLGRGDRCANGFAMPPRTQALLPHCSLSKGHKNLCLFVGRAKE IACLKRLNYRIKKFYSLGILESQVKRADRQALAIKQGDELSSSQQAVSQSDICHSRPKHF RAVRDHPVPLPLVLQEIYAAQLKPHTSVSNTVAGGKDITPSFWALSPPPRTCRHRCPAFE RAAPTGDPSTCRSTLRLPLRARVAAALAPLRMLGRHPGPQTLNPLAAFCLRLPGEGKGSP EPGFRLRVKEELQALGAWMGRGGRALRQRYLYRPQLPKSKDDHLHPGNEFSTAVLKLERG SESPGELFKSQIPNPETEFPIQLGLERSATVPISNMFSAKTGATSLGTTLWDLLVKRKGI RY >gi568815595r:142349432_142678704|GENSCAN_predicted_CDS_8|1089_bp atggtgtggaagttacacacagtcagcagagagcggtatgatactctcaccatgctgggc agaggtgatagatgcgctaatggatttgccatgcctccgaggacacaggcactgctccct cactgcagcttgtccaaggggcacaaaaatctttgcctctttgttggtagagccaaggag attgcatgcttaaaacgacttaattatagaataaaaaagttttattctctaggaattttg gaatctcaggttaagcgagcagatagacaggcccttgctattaagcagggtgatgagctt agttctagccaacaggctgtgagtcaaagtgatatttgtcactctaggccaaagcatttc agagctgtacgtgatcatccagttccccttcctctggttcttcaagaaatttatgcagca cagctgaagcctcatacaagtgtatccaacaccgtggctggagggaaagatatcacgcct tccttttgggccctatccccacccccgcgtacctgccgccaccgctgtcctgccttcgag agggcagctcccactggagatccaagtacctgcaggtccactctgcggctacctctccgc gcccgcgtagccgctgcgctcgcgccactgcgcatgctcggccggcacccgggcccgcaa actctgaatccacttgccgcgttttgcctgcggctcccaggggagggaaagggaagccca gaacctgggtttaggctaagggtgaaagaggagttgcaggcgctgggagcgtggatgggg agaggtggacgggcgctgcggcaacgttatctctaccgcccacagttacccaagtctaaa gatgaccacctacaccctgggaatgagtttagcacagcagttctcaaacttgagcgtgga tcagaatcacctggagagctttttaaatcacagatcccaaacccggaaactgagtttccg attcagttaggtctggagcgtagcgcgacagttcccatttctaacatgttctcagcgaaa actggtgctactagtctgggtaccacactttgggacctgctggtcaaaaggaaaggtata cgttattaa >gi568815595r:142349432_142678704|GENSCAN_predicted_peptide_9|193_aa MENSTTTISREELEELQEAFNKIDIDNSGYVSDYELQDLFKEASLPLPGYKVREIVEKIL SVADSNKDGKISFEEFVSLMQELKSKDISKTFRKIINKREGITAIGGTSTISSEGTQHSY SEEEKVAFVNWINKALENDPDCKHLIPMNPNDDSLFKSLADGILLCKMINLSEPDTIDER AINKKKLTPFTIS >gi568815595r:142349432_142678704|GENSCAN_predicted_CDS_9|579_bp atggaaaacagtactactaccatttctcgggaggagcttgaagaactacaagaggcattt aataaaatagatattgacaatagtgggtatgtcagtgactatgaacttcaagacctgttt aaggaagcaagccttcctctgcctggctacaaggtgcgcgagattgtggagaaaattcta tcagttgctgacagcaacaaagatggcaaaatcagttttgaagagtttgtgtcactaatg caagaattaaaaagcaaagatatcagcaaaacattccgaaaaataattaacaagagggaa gggattactgctattggaggaacttcaactatttccagtgagggcacacagcattcttat tcagaggaagaaaaagtggcttttgttaactggataaacaaagccctggagaatgaccct gactgtaagcatcttatacccatgaatcccaatgatgatagtcttttcaagtcacttgca gatggcatccttctttgcaaaatgatcaacttatctgaaccagatacaattgatgaaaga gccatcaataagaaaaagctcacgccattcactatttct