GENSCAN 1.0 Date run: 4-Nov-116 Time: 01:30:37 Sequence gi568815576r:17741779_18006812 : 265034 bp : 50.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 1048 903 146 2 2 27 75 75 0.298 0.10 1.04 Intr - 2235 2025 211 0 1 122 96 285 0.999 31.19 1.03 Intr - 2751 2654 98 1 2 20 86 86 0.713 1.33 1.02 Intr - 3057 2805 253 0 1 117 31 82 0.411 2.41 1.01 Init - 4734 4717 18 0 0 73 52 31 0.216 -2.47 1.00 Prom - 7494 7455 40 -6.26 2.08 PlyA - 7771 7766 6 1.05 2.07 Term - 12604 12156 449 2 2 41 50 254 0.391 12.28 2.06 Intr - 32804 32603 202 2 1 90 89 3 0.814 -0.54 2.05 Intr - 33414 33358 57 0 0 90 97 65 0.940 6.68 2.04 Intr - 33966 33868 99 0 0 110 80 52 0.513 6.91 2.03 Intr - 38190 38069 122 1 2 12 89 104 0.052 3.11 2.02 Intr - 42382 42359 24 2 0 82 99 31 0.369 1.60 2.01 Init - 45246 45084 163 0 1 104 75 24 0.239 2.62 2.00 Prom - 46250 46211 40 -8.06 3.08 PlyA - 46857 46852 6 -0.45 3.07 Term - 49138 48954 185 2 2 120 49 331 0.668 30.11 3.06 Intr - 49293 49220 74 2 2 90 113 120 0.922 13.75 3.05 Intr - 49570 49424 147 0 0 29 70 242 0.800 15.85 3.04 Intr - 51369 51249 121 1 1 127 64 118 0.788 12.85 3.03 Intr - 54716 54572 145 2 1 30 1 107 0.126 -3.94 3.02 Intr - 55477 55336 142 0 1 70 102 41 0.793 4.06 3.01 Init - 56133 56006 128 0 2 87 90 117 0.983 9.98 3.00 Prom - 60922 60883 40 -6.26 4.18 PlyA - 61795 61790 6 1.05 4.17 Term - 66627 66350 278 1 2 96 33 119 0.163 2.82 4.16 Intr - 67159 67066 94 1 1 63 84 184 0.998 15.04 4.15 Intr - 69035 68925 111 2 0 72 98 204 0.998 20.38 4.14 Intr - 69830 69672 159 2 0 40 25 116 0.602 0.68 4.13 Intr - 75006 74912 95 1 2 124 90 174 0.999 20.88 4.12 Intr - 77351 75533 1819 2 1 104 96 1775 0.695 167.23 4.11 Intr - 79731 79649 83 1 2 129 94 25 0.998 6.56 4.10 Intr - 80392 80252 141 2 0 80 83 151 0.998 14.12 4.09 Intr - 81282 81169 114 1 0 100 54 151 0.712 13.32 4.08 Intr - 82968 82913 56 2 2 80 30 85 0.414 0.52 4.07 Intr - 84012 83802 211 1 1 64 -14 145 0.666 -0.13 4.06 Intr - 86003 85866 138 0 0 -1 82 134 0.695 4.34 4.05 Intr - 86621 86480 142 2 1 39 40 77 0.248 -2.07 4.04 Intr - 88252 87944 309 1 0 69 51 135 0.384 4.41 4.03 Intr - 88376 88327 50 0 2 55 82 32 0.427 -2.30 4.02 Intr - 90329 90076 254 1 2 82 77 443 0.783 39.68 4.01 Init - 92218 92154 65 1 2 50 96 23 0.332 -0.08 4.00 Prom - 94590 94551 40 -1.96 5.30 PlyA - 95478 95473 6 1.05 5.29 Term - 99458 99296 163 1 1 92 41 133 0.993 6.41 5.28 Intr - 100239 100044 196 1 1 114 76 346 0.994 34.47 5.27 Intr - 101996 101895 102 0 0 104 38 44 0.111 1.15 5.26 Intr - 104448 104409 40 0 1 123 68 30 0.158 2.30 5.25 Intr - 106960 106866 95 2 2 109 72 -8 0.161 -0.62 5.24 Intr - 112696 112614 83 0 2 50 91 57 0.134 1.48 5.23 Intr - 119149 119008 142 2 1 20 105 88 0.009 3.11 5.22 Intr - 123208 123121 88 1 1 60 68 89 0.093 3.64 5.21 Intr - 124234 124146 89 2 2 119 90 46 0.979 7.59 5.20 Intr - 130245 130059 187 0 1 106 81 405 0.986 40.86 5.19 Intr - 130590 130345 246 0 0 77 36 143 0.882 5.66 5.18 Intr - 131076 130969 108 0 0 46 99 46 0.763 1.98 5.17 Intr - 133776 133669 108 0 0 79 69 27 0.236 0.38 5.16 Intr - 137624 137568 57 2 0 82 60 57 0.402 1.38 5.15 Intr - 142580 142497 84 2 0 86 103 3 0.690 1.62 5.14 Intr - 144273 144100 174 0 0 -5 97 174 0.928 9.14 5.13 Intr - 145657 145545 113 2 2 64 95 102 0.994 8.60 5.12 Intr - 147452 147256 197 1 2 88 91 241 0.999 23.46 5.11 Intr - 149854 149707 148 2 1 87 95 32 0.960 3.09 5.10 Intr - 152126 152030 97 2 1 49 101 65 0.966 3.48 5.09 Intr - 153632 153506 127 1 1 78 110 127 0.946 14.58 5.08 Intr - 154583 154468 116 2 2 84 80 8 0.932 -1.15 5.07 Intr - 155203 154946 258 1 0 27 105 386 0.161 31.86 5.06 Intr - 157770 157670 101 1 2 84 58 74 0.079 3.83 5.05 Intr - 159219 159064 156 1 0 97 77 262 0.990 25.98 5.04 Intr - 160201 160100 102 2 0 93 67 116 0.933 10.15 5.03 Intr - 160969 160853 117 2 0 102 111 95 0.999 13.64 5.02 Intr - 163061 162854 208 2 1 43 89 281 0.999 22.35 5.01 Init - 165034 164771 264 1 0 104 121 332 0.904 35.11 5.00 Prom - 165510 165471 40 -10.94 6.00 Prom + 165739 165778 40 -7.96 6.01 Init + 165944 166074 131 1 2 60 87 56 0.270 2.32 6.02 Intr + 168677 168825 149 2 2 51 116 103 0.424 9.28 6.03 Intr + 175544 175867 324 0 0 90 71 91 0.274 3.35 6.04 Intr + 182139 182311 173 1 2 -20 76 139 0.053 1.56 6.05 Intr + 183210 183404 195 2 0 40 94 59 0.233 1.31 6.06 Intr + 189131 189303 173 1 2 64 81 121 0.632 7.74 6.07 Term + 191528 192080 553 2 1 20 37 280 0.788 9.89 6.08 PlyA + 192217 192222 6 -0.45 7.00 Prom + 192423 192462 40 -10.25 7.01 Sngl + 192621 193433 813 2 0 63 43 268 0.813 15.58 7.02 PlyA + 193566 193571 6 1.05 8.00 Prom + 194326 194365 40 -2.46 8.01 Init + 200949 200977 29 2 2 69 99 58 0.449 4.37 8.02 Intr + 205399 205551 153 1 0 109 78 78 0.883 8.09 8.03 Term + 206693 206813 121 2 1 120 47 11 0.467 -1.85 8.04 PlyA + 207435 207440 6 1.05 9.10 PlyA - 208735 208730 6 1.05 9.09 Term - 218567 218489 79 1 1 81 46 46 0.207 -3.06 9.08 Intr - 219343 219182 162 0 0 86 37 111 0.251 4.89 9.07 Intr - 222727 222538 190 2 1 91 116 16 0.295 3.34 9.06 Intr - 231753 231594 160 0 1 77 100 12 0.523 0.86 9.05 Intr - 238435 238345 91 0 1 69 80 67 0.397 4.00 9.04 Intr - 245929 245837 93 0 0 45 67 90 0.514 1.68 9.03 Intr - 248239 248166 74 1 2 67 52 69 0.039 -0.60 9.02 Intr - 258368 258152 217 1 1 61 71 124 0.559 6.51 9.01 Init - 260192 259615 578 2 2 46 94 265 0.491 15.51 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 123208 122876 333 1 0 60 45 155 0.843 3.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:17741779_18006812|GENSCAN_predicted_peptide_1|242_aa MKHILFVFLWSGHTQLQGSPLPVRIQPPNTVSTPRTVPAISGGTDAGQAAAASAAQGAGE LPAKSPAPLGRAGASATTPCGIRIKDTPTQIVTENLPSAQYPPEPVKEMSGKRLRLLEAQ PGKVNNGSSLRDECITNLLVFGFLQSCSDNSFRRELDALGHELPVLAPQWEGYDELQTDG NRSSHSRLGRIEADPALRRWLSVVCTLELHEKPFPNAGLGPYSSCLHQAVLGRETAQRIL TQ >gi568815576r:17741779_18006812|GENSCAN_predicted_CDS_1|726_bp atgaagcacatcctgtttgtctttctctggtctgggcacacccagctgcagggctcacct ttgcctgtaagaatacagcccccaaacacagtcagtaccccaagaacagtccctgccatc tctggcggcacagatgctggccaagctgcagctgccagtgctgcccagggagctggagag ctgccggccaagagcccagcccctctgggtagagcaggagccagtgccaccactccctgt gggattcggattaaggacacacccacccaaatagttactgaaaatcttccatctgcccaa taccctcctgagcccgtgaaggagatgagcggaaagaggctccgcctgttggaagcacag ccaggaaaggtcaacaacggttccagcctcagggatgagtgcatcacaaacctactggtg tttggcttcctccaaagctgttctgacaacagcttccgcagagagctggacgcactgggc cacgagctgccagtgctggctccccagtgggagggctacgatgagctgcagactgatggc aaccgcagcagccactcccgcttgggaagaatagaggcagatccagccctgcgacgctgg ctttcagtagtgtgcacattggaattacacgagaaaccttttccaaatgcaggccttggg ccctactccagctgcctgcatcaggctgttttagggcgggagactgcccagaggattctg acgcag >gi568815576r:17741779_18006812|GENSCAN_predicted_peptide_2|371_aa MQGGKDRSSSVHVSHTAYRQLEATQWKLAPSPSSCVPSYSRTKKGLAGTNVGVSAAKLPL LAASLQKESAIRKVVDFFLQVPPLLLWSRGNAAVSFHRERALQDLPFVSPPECPARVKSM QETSPCPRGGYIPRGEVSHSREVKTPTKGSAAALPPRAGAGSPGPRKPPGAGRGGPGRAR AWGAEGGSARGAIRRKRVVDRVRAPGRRCLGPDAPAPPRLEGGQKRLPGEAVVRIGGEAC GGAWRCLLGSLLSEMVFVTPVWQSPGDWGNVTPIRQSPGDWGNVTPARQSPGDWGNVTPI RQSPGDWGNVTPIWQSPGEWGNVTPVLQSPGDWGNVTPVRQSPGDWGNVTPVRQSPGDWG NVTPVLQSPGD >gi568815576r:17741779_18006812|GENSCAN_predicted_CDS_2|1116_bp atgcagggagggaaagaccgcagcagctctgtgcacgtgagccacaccgcctacaggcag ctagaagccactcagtggaagctggctcccagtccttccagctgcgttcccagctattcc agaacaaagaaaggcctggctgggacaaatgtgggcgtgtcagctgctaagttgccgctg ctggctgcgagcctgcagaaagagtccgccatcaggaaggtggtggacttctttctccag gtgccgccactactgctctggagccgtgggaatgctgcagtctccttccacagagaacgt gctctccaggatcttccatttgttagccccccagagtgcccagcccgagtgaagagcatg caggaaacaagcccctgtcctcgtggaggctacattccgagaggggaggtatcacactca agagaagtgaagacgcccacaaagggctctgcagcagcactgccgccccgcgccggcgcg ggctccccgggtccccgcaagccgccgggggcgggccggggcgggccggggcgggcccgg gcgtggggggccgaaggaggaagcgcgcggggcgccataaggaggaagcgggtagtcgac cgtgtccgcgcgcctgggagacgctgcctcggcccggacgcgcccgcgcccccgcggctg gagggtgggcagaagcgccttcctggcgaggccgtggtgaggattggaggggaggcctgc ggaggggcctggcggtgcctgctggggtccttgttgtcagagatggtatttgtcaccccc gtctggcaaagtcctggagactgggggaatgtcacccccatccggcagagtcctggagac tgggggaatgtcacccccgcccggcagagtcctggagactgggggaatgtcacccccatc cggcagagtcctggagactgggggaatgtcacccccatctggcagagtcctggagaatgg gggaatgtcacccctgtcctgcagagtcctggagactgggggaatgtcacccccgtccgg cagagtcctggagactgggggaatgtcacccctgtccggcagagtcctggagactggggg aatgtcaccccagtcctgcagagtcctggagactag >gi568815576r:17741779_18006812|GENSCAN_predicted_peptide_3|313_aa MRQVPGASCPAGLEDGPPTCLPAVAVPTSASSAGCHVVSVGGRAGITGVSPRTSELSDRI LTNPCEQGTQASAPQSQGYGSWISSTRRTKNPHLNLPILAATPGRCEDHVGGACEGALKP ESAIRAHLMRCSGQASLPALTRLSLTHHSPCLAAVGELSIASERCPHVSGGIFCIFVGRV GAALPAPDTSVLSAGMGKKDDPKLMQEWFKLVQEKNAMVRYESELMIFARELELEDRQSR LQQELRERMAVEDHLKTEEELSEEKQILNEMLEVVEQRDSLVALLEEQRLREREEDKDLE AAMLSKGFSLNWS >gi568815576r:17741779_18006812|GENSCAN_predicted_CDS_3|942_bp atgaggcaggtccccggggccagttgtccagcaggcctggaggacggcccgccgacgtgc ctgcccgctgtggctgtgcccacctctgcgtcctctgctggctgccacgtggttagcgtt ggtggccgtgctgggatcacaggtgtgagcccccgtacctccgagctttctgaccgcatc ctcactaatccttgtgagcaaggcactcaagcctctgctccccagtctcagggctatggc tcttggatttcgagtaccagaaggactaagaaccctcatttgaatttgcccatcctggca gccacacctggccggtgtgaggatcacgtgggtggtgcatgcgagggcgccctgaagccg gagagtgccatccgtgcccatctcatgcgctgctctggacaggcctcactccctgcgctc acccgactctcgctcacacaccacagcccctgcctggcggccgtgggcgagctctccata gctagtgagcgctgtccgcacgtctccggtggcattttctgtatcttcgtggggagggtg ggcgcggcactccctgcacccgacacaagcgttctctcggcaggcatgggcaagaaggac gaccccaagctgatgcaggagtggttcaagctagtgcaggagaagaacgccatggtgcgc tacgagtcggagctgatgatctttgcccgggagctggagctggaagaccggcagagtcga ctgcagcaggagctccgggaacgcatggcagtggaagatcaccttaagactgaggaggag ctgtcagaagagaagcagattctcaatgagatgctggaggtggtggagcagagagactca ctggtggcgctgctggaggagcagcggctccgggagagagaggaggacaaggacctggag gctgccatgctgtccaagggcttcagccttaactggtcctga >gi568815576r:17741779_18006812|GENSCAN_predicted_peptide_4|1372_aa MVMVRYRERVQMSVSQGKKYIGSSESEMEEEGEEEEEEPRLPPSDLGGVPWKEAVRIHAL LKGKSEEELEASKSFGPGNEEEEEEEEEYEEEEEEDYDEEEEESSEEGEYCPWDTELQGQ WRQLPGSHPKFASKSMSHRQRSVFYGTVLTGSAHISLGYSPWSQPAQTVRRKVTCESEEV LEEVGEGSRSGLMECGPIYAAYVLLQAHLQPRLGSFSDAGNLSPVQQQLQLQKMQTEVPA KSISVLCELAGWRQNDTISSHLSRHLAISWASGAGNQRLQQVMHAADPLEIQADVHWTHI REREEEERMAPASESSASGEGLTFLASALGVSVQPPPPMKTPKIPPRVVRHCWRKGSVIA NSTTVLAVTSGMRTWELVGLQQPVPVSSRHRCRCCRIAFGTASNVGHSAPLDENDLEEDV DSEPAEIEGEAAEDGDPGDTGAELDDDQHWSDSPSDADRELRLPCPAEGEAELELRVSED EEKLPASPKHQERGPSQATSPIRSPQESALLFIPVHSPSTEGPQLPPVPAATQEKSPEER LFPEPLLPKEKPKADAPSDLKAVHSPIRSQPVTLPEARTPVSPGSPQPQPPVAASTPPPS PLPICSQPQPSTEATVPSPTQSPIRFQPAPAKTSTPLAPLPVQSQSDTKDRLGSPLAVDE ALRRSDLVEEFWMKSAEIRRSLGLTPVDRSKGPEPSFPTPAFRPVSLKSYSVEKSPQDEG LHLLKPLSIPKRLGLPKPEGEPLSLPTPRSPSDRELRSAQEERRELSSSSGLGLHGSSSN MKTLGSQSFNTSDSAMLTPPSSPPPPPPPGEEPATLRRKLREAEPNASVVPPPLPATWMR PPREPAQPPREEVRKSFVESVEEIPFADDVEDTYDDKTEDSSLQEKFFTPPSCWPRPEKP RHPPLAKENGRLPALEGTLQPQKRGLPLVSAEAKELAEERMRAREKSVKSQALRDAMARQ LSRMQQMELASGAPRPRKASSAPSQGKERRPDSPTRPTLRGSEEPTLKHEATSEEVLSPP SDSGGPDGSFTSSEGSSGKSKKRSSLFSPRRNKKEKKSKGEGRPPEKPSSNLLEEAAAKP KSLWKSVFSGYKKDKKKKADDKSCPSTPSSGATVDSGKHRVLPVVRAELQLRRQLSFSED SDLSSDDVLEKSSQKSRREAGGCGCPVRGLALDWAIASSVLDGARRNSPGQATTSLLPSG QRLPLQTPESVQPRTYTEEELNAKLTRRVQKAARRQAKQEELKRLHRAQIIQRQLQQVEE RQRRLEERGVAVEKALRGEAEPSTPLVGEHVATLGTHRFPEGTQALCLLKAGHLGQGWRG SGYVTSRRGADSGEWLAGPCARDCCSWQGSPGECGALIPAIPSREPARLRSA >gi568815576r:17741779_18006812|GENSCAN_predicted_CDS_4|4119_bp atggttatggttcgttacagggaaagggtacagatgtcagtgagccaagggaagaagtac ataggtagttccgagtctgagatggaggaggagggagaggaggaggaggaggagcctcgc ctgcccccatctgacctgggtggtgttccgtggaaggaggccgtgcggatccatgccctt ctgaaagggaaaagtgaagaggagctagaggcctcaaagagctttgggcctgggaatgaa gaggaggaggaggaggaggaagaatatgaagaggaggaggaggaggactatgacgaggag gaggaagagtccagtgaagaaggggagtactgcccctgggacacagaactgcaaggccag tggcgccagctccctggcagccatccaaaatttgcctccaagagcatgtctcacagacaa aggtctgtgttttatggaactgtgctgacgggctctgcacacatctcactgggctactct ccctggtcccagcctgcccagacagtcaggcgcaaagtaacatgtgagtcagaagaggtt ctggaggaagttggagaaggctccagaagtggcctcatggaatgcggccccatctatgcc gcatatgttttactgcaagctcatctccagccaagacttggctccttctctgatgcagga aacctatcgccagtgcagcagcagctccagctgcagaagatgcaaaccgaagtgccagca aagtctatctctgtcctgtgtgagctggccggctggcggcagaatgacacaatctcttcc cacctctctcgtcacctggccatcagctgggcgagtggtgccgggaaccagaggctccag caggtcatgcacgcggcggatcctctggagatccaggctgacgtgcactggactcatatc cgtgagagagaggaggaagagaggatggcgccggcctctgagtcctctgcttccggagaa ggtttaacattcttggcctcggcccttggagtgtccgtgcagcctccaccacctatgaag acaccaaaaatccctccacgagttgtcagacactgctggaggaagggcagtgttatcgcc aacagcaccactgtcctagcggtcacttctgggatgcggacatgggagctcgtggggctt cagcagcctgtccctgtgtcctcccggcacaggtgccgatgctgtaggatagcatttggc acggccagcaacgtgggccactctgccccattggatgagaatgacctagaggaagatgtg gactcagaaccagccgagatagaaggggaggcagcagaggatggggacccaggggacact ggtgctgagctggatgatgatcagcactggtctgacagcccgtcggatgctgacagagag ctgcgtttgccgtgcccagctgagggggaagcagagctggagctgagggtgtcggaagat gaggagaagctgcccgcctcaccgaagcaccaagagagaggtccctcccaagccaccagc cccatccggtctccccaggaatcagctcttctgttcattccagtccacagcccctcaaca gaggggccccaactcccacctgtccctgccgccacccaggagaaatcacctgaggagcgc cttttccctgagcctttgctccccaaagagaagcccaaagctgatgccccctcggatctg aaagctgtgcactctcccatccgatcacagccagtgaccctgccagaagctaggactcct gtctcaccagggagcccgcagccccagccacccgtggcggcctccacgcccccacccagc ccactccccatctgctcccagccccagccttccaccgaggccactgtcccatcccctacc cagtcccccatacgcttccagcctgccccggccaaaacatccaccccactggcccctctc cctgtccaaagccaaagtgacaccaaggacagactgggcagcccccttgctgtggatgag gccctcagacggagcgacctggtggaggagttctggatgaagagtgcggagatccgccgc agcctcgggctcacacctgtggaccgcagcaaggggcccgagcccagcttccccacgcct gccttcaggccagtgtccctcaaatcctattccgttgaaaagtccccccaggatgaggga ctccaccttctcaagcctctgtccatccccaaaaggctgggcctgccaaagccggaaggc gagccgttgtccctgccaaccccccggtccccgtccgacagagagctacgcagcgcccag gaggagcgcagggagctgtccagcagctctggcctgggcctgcacgggagctcctccaac atgaagacactgggcagccagagcttcaacacctcggactccgccatgctcacgcccccc tccagcccgcccccaccgccacccccgggcgaggagcccgccaccttgcggaggaagctc agggaggccgagcccaatgcctcggtggtcccgccgcccttgcccgccacctggatgcgg cccccccgggagcctgctcagccccccagagaggaggtgcggaagtcgtttgtggagagc gtggaggagattccctttgctgatgatgtggaggacacctatgacgacaagactgaggac tcaagcctgcaggagaaattcttcacgcccccgtcctgctggccgcgccccgagaagcct cgccacccgcccctggccaaggagaacgggaggctgcctgctctggaggggacgctgcag ccacagaagagggggctgcccttggtgtccgcggaagccaaggagttggccgaggagcgc atgcgagccagggagaagtccgtgaagagccaggcgctgcgggacgccatggccaggcag ctgagcaggatgcagcagatggagctggcctcaggcgcccccaggccccgcaaggcgtcc tcagcaccctcccagggcaaggagcgccggcctgactcccccacacgccccactctcagg ggctccgaggagcccaccctgaagcatgaagccaccagcgaggaggtcctctccccgccg tcggactcagggggcccagatggctctttcacttcatccgagggctccagtgggaagagc aagaagaggtcgtcactcttctccccccgcagaaacaagaaggagaagaagtccaaaggc gagggccggcccccggagaagcccagctccaacctcctagaagaagccgccgccaaaccc aagtccctgtggaagtccgtcttctccgggtacaagaaggacaagaagaagaaggccgac gacaagtcctgccccagcaccccctccagcggggccacggtggactctggaaagcacagg gtgcttcccgtcgtaagggcagagctgcagctccggcgccagctgagcttctccgaggac tcagacctctccagcgacgatgtccttgagaagtcctcacagaagtcccggcgagaggca gggggctgtggctgtccagtccgtggacttgcactggattgggccattgcctcttctgtt ctcgatggcgctaggcgcaacagccctggccaggccacaacctctcttcttcctagcggc cagcggctgcctctgcagaccccagagtcagtccagccaagaacctacacggaggaggaa ctgaatgccaagctgacccggcgtgtgcaaaaggcagctcggagacaggccaagcaggag gagcttaagcggctgcatcgagcccagatcatccagcggcagctgcagcaggtggaggag aggcagcggcggctggaggaaaggggcgtggctgtggagaaggcgctccggggcgaagca gagccctcaacaccgctggtcggggagcatgtggccaccctcggcacacacagatttcca gagggcacccaggccctgtgtctgctgaaggctggtcacttagggcagggctggagaggc tcagggtatgtgaccagccgacgtggagcagactccggggagtggctggcagggccctgt gctcgtgactgctgctcctggcagggaagccctggagaatgtggtgcgctcattcctgcc attccatccagggaacctgcgcggctgcgctcagcatag >gi568815576r:17741779_18006812|GENSCAN_predicted_peptide_5|1321_aa MEERKHETMNPAHVLFDRFVQATTCKGTLKAFQELCDHLELKPKDYRSFYHKLKSKLNYW KAKALWAKLDKRGSHKDYKKGKACTNTKCLIIGAGPCGLRTAIDLSLLGAKVVVIEKRDA FSRNNVLHLWPFTIHDLRGLGAKKFYGKFCAGAIDHISIRQLQLILLKVALILGIEIHVN VEFQGLIQPPEDQENERIGWRALVHPKTHPVSEYEFEVIIGGDGRRNTLEGFRRKEFRGK LAIAITANFINRNTTAEAKVEEISGVAFIFNQKFFQELREATGIDLENIVYYKDDTHYFV MTAKKQSLLDKGVILHDYADTELLLSRENVDQEALLSYAREAADFSTQQQLPSLDFAINH YGQPDVAMFDFTCMYASENAALVREQNGHQLLVALVGDSLLEPFWPMGTGIARGFLAAMD SAWMVRSWSLGTSPLEVLAERESIYRLLPQTTPENVSKNFSQYSIDPVTRYPNINVNFLR PSQVRHLYDTGETKDIHLEMESLVNSRTTPKLTRNESVARSSKLLGWCQRQTDGYAGVNV TDLTMSWKSGLALCAIIHRYRPDLIDFDSLDEQNVEKNNQLAFDIAEKELGISPIMTGKE MASVGEPDKLSMVMYLTQFYEMFKDSLPSSDTLDLNAEEKAVLIASTRSPISFLSKLGQT ISRKRSPKEEAPRGHRGERPTLVSTLTDRRMDVAVGNQNKVKYMATQLLAKFEENAPAQS IGIRRQLTQERGASQPSCCLPGQVRPAPTPRWKQTYRDLDADNRGKQSPHHERPEPEPPR RFFVDQWELSLSLRSSARPASPSSDSLRQKYIKMYTGGVSSLAEQIANQLQRKEQPKALL DKKELTLSCRYFPASPGCIIALLKSGVKHCDAGSLVSSRLAQAFAVVPSVLCQCVEAVTV AVPSLCGRAKSTVDRSQLEVCLKADPLGSMKKEFPQNLGGSDTCYFCQKRVYVMERLSAE GKFFHRSCFKCEYCATTLRLSAYAYDIEDGKFYCKPHYCYRLSGYAQRKRPAVAPLSGKE AKGPLQDGATTDANGRANAVASSTERTPVIIIIVKLFLKLLVPGSENHGSLTLGSRTPGQ GFGDAQQRTGSTPVRRHCTLAATVKGRELTTSDLVESWLVGLFGVTSPKNRLEPFSRRYQ LMCSLTRGVVKSPRQGELELCVDLAQWPEPVDAAFQEYAHSGGEILLDTRLAASLMFPGH SPGSGVNGLEEPSIAKRLRGTPERIELENYRLSLRQAEALQEVPEETQAEHNLSSVLDTG AEEDVASRASSISEGFKVFSRLQMIKRAKNQKGVRCPVGLQDRNTGCKPEGQRPQVALES I >gi568815576r:17741779_18006812|GENSCAN_predicted_CDS_5|3966_bp atggaggagaggaagcatgagaccatgaacccagctcatgtcctctttgaccggtttgtc caggccaccacctgcaagggaaccctcaaggctttccaggagctctgtgaccacctggaa ctaaagccaaaggactaccgctccttctatcacaagctcaagtccaagcttaactactgg aaagccaaagccctctgggcaaaattggacaaacggggcagtcacaaagactacaaaaag ggaaaagcgtgcactaacaccaagtgtctcatcattggggctggcccctgtggtctccgt acagccatcgacttatccttactgggggccaaggtggttgttattgagaaacgagatgcc ttctcccgcaacaacgtcttgcatctctggccattcaccatacatgatctacgaggtctg ggtgccaagaagttctatggcaagttctgtgctggagccatcgaccatatcagtatccgt cagctccaactaatacttttgaaagtagccttgatcctaggcattgaaatccacgtcaat gtggaattccaaggacttatacagcctcctgaggaccaagagaatgaacggataggctgg cgggcactggtgcaccccaagactcatcctgtgtcagagtatgaatttgaagtgatcatc ggtggggatggtcggaggaacaccttggaagggtttcgtcggaaagaattccgtggcaaa ctggccatcgccatcacggcaaattttatcaaccgaaatacaacagcagaagctaaagtg gaagagatcagtggtgtggcttttatattcaaccaaaaatttttccaggaactgagggaa gccacaggtattgacttggagaacatcgtttactacaaagatgacacacactatttcgtt atgacagccaaaaagcagagtttgctggacaaaggagtgatactacatgactacgccgac acagagctcctgctttcccgagaaaacgtggaccaggaggctctgctcagctatgccagg gaggcggcagacttctctacccagcagcagctgccgtctctggattttgccatcaatcac tatgggcagcccgatgtggccatgtttgacttcacttgtatgtatgcctccgagaacgcc gccttggtgcgggagcagaacggacaccagttactagtggctctggtcggggacagcctc ctagagcctttctggccaatgggaacaggaatagcccggggctttctagctgctatggac tctgcctggatggtccgaagttggtctctaggaacgagccctttggaagtgctggcagag agggaaagtatttacaggttgctgcctcagaccacccctgagaatgtgagtaagaacttc agccagtacagtatcgaccctgtcactcggtatcccaatatcaacgtcaacttcctccgg ccaagccaggtgcgccatttatatgatactggcgaaacaaaagatattcacctggaaatg gagagcctggtgaattcccgaaccacccccaaattgactcgcaatgagtctgtagctcgt tcaagcaaactgctgggttggtgccagaggcagacagatggctatgcaggggtaaacgtg acagatctcaccatgtcctggaaaagtggcttggccctttgtgcaattatccatagatac cgccctgacctgatagattttgattctttggatgagcaaaatgtggagaagaataaccaa ctggcctttgacattgctgagaaggaattgggcatttctcccatcatgacaggcaaagaa atggcctccgtgggggagcctgataagctgtccatggtgatgtacctgactcagttctac gagatgtttaaggactccctcccctctagcgacaccttggacctaaatgccgaggagaaa gcagtcctgatagccagcaccagatcccctatctccttcctaagcaaacttggccagacc atctctcggaagcgttctcccaaggaggaagctcctcggggccacagaggagaaagaccg accctggtgagcactctgacagacaggaggatggacgttgccgttgggaaccagaacaaa gtgaagtacatggcgacccagctgctggccaaatttgaagagaatgcgcccgcacagtcc atcggcatacggagacagctgacacaagagcgtggggccagccagccgtcctgctgcctg cctgggcaggttcgccctgcccccaccccccggtggaaacagacatacagggatcttgat gctgacaaccgtggcaagcagagcccccaccatgagaggccagagcctgaacctcctcgt cgattttttgtcgaccagtgggagctttctcttagtctccgctcctctgcccgccccgcc tctccctcctccgactccctccgacagaagtatataaagatgtacacgggcggagtgagc tcattggctgagcagatagccaatcagcttcaaaggaaagaacaacccaaagcccttcta gacaagaaggaactgactctttcttgtcgatatttcccggcgtcgccaggctgtatcata gcattactcaagtctggtgtgaagcactgtgatgctggctccctcgtcagcagcaggctg gcccaggctttcgcggttgtgccaagtgttctctgtcagtgcgtggaggcagtgactgtg gcggtgccgtcactctgtggcagagccaagagcacagttgaccgtagccagttagaagtc tgtctcaaagcagaccccttaggctccatgaagaaggagttcccgcagaacctgggaggc agcgacacatgctacttctgccagaagcgggtctacgtgatggagaggctgagtgccgag ggcaagttcttccaccggagctgcttcaagtgcgagtactgcgccaccaccctgcgcctc tcggcctacgcctacgacatcgaggatggtaaattctactgtaagccacactactgctat cgactctctggctacgcacaaaggaagagaccggcagtggctcccctgtctggaaaggag gccaaaggacccctgcaggatggcgccaccacagatgcaaacggacgggccaacgccgtg gccagctccactgagagaaccccagtaattataataattgtgaagctgtttctaaagctg cttgttcctggttctgagaaccacgggagcctcacactggggagccgtactccaggtcag ggctttggggatgcacagcagcgtacgggcagcacgcctgtaaggaggcattgcaccttg gcagccacggtgaagggtcgtgagcttaccacgtcagaccttgtggagagttggttggtc ggcctgtttggggtgacttctcctaaaaataggctagagcccttctcccgcagataccag ttgatgtgcagcttgaccagaggagtggtgaagtccccgaggcagggtgagctggagctc tgtgtggacctggctcaatggcctgagcctgtggatgcagctttccaggagtatgcacac agtggtggggaaatattactggacaccagacttgctgcttcactcatgtttcctgggcac tccccaggttcaggcgtgaacggcctggaggagcccagcatcgccaagcgactgaggggc accccagagcggatcgagctggagaactaccgcctgtccctgaggcaggctgaggcactg caggaggtaccggaggagactcaggccgagcacaacctgagcagcgtgctggacacgggc gccgaggaggacgtcgccagcagggcttccagtatcagcgaaggcttcaaagtgttcagc cgattacagatgatcaagagagccaagaatcaaaagggggtccggtgtcctgtggggctt caggacagaaacacaggctgcaaacccgaggggcagaggcctcaagtggccttggagagc atttga >gi568815576r:17741779_18006812|GENSCAN_predicted_peptide_6|565_aa MRNSRERSEENQPAASQKSKEQKVRECQSWQGLKELKKSCYCVRCQPQLPESCRANQQAS FNSPAEEISTCRALVTMDEQRQDGSWNNATGTQVFCILVNNTASYPDTQPKTLPPSAFLS HPTVRSSENSVFYLQIQPNPTTSHHVFSIILIETPPALTCSLHEPPLLLLLQTHPPEYST KQPEHPFKTPIRHVDCHIPAPVQHGWGGLKELMIIVEGEANTSFFTWQQQGEVQHEEGEK PLIKASDLKRTHYHKKSMENNEIDQIATGGKSPGIRISHKLDIGLWILESYPLARNQNGL GSFLVNIPTTLGAEHLREQGEGSQAPGLPPVTGPGTQLDSSFPEAAPPNAPSSGGSTSAT DRVPGIPVSSGLSNHTSASGDQIKKLTQNHTTTWKLNNLLLNDYWLHNKMKAEIKMFFET NENKDTRYQNLWDTFKAECRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTHSKASR RQEITKIRAELKEITDTKKPFKKINESKNWFFETINKIDRALARLIKKKREKNQTDTIKN DKGNITTDPTEIQTIIREYYKHFYA >gi568815576r:17741779_18006812|GENSCAN_predicted_CDS_6|1698_bp atgaggaacagtagggagaggtctgaagagaaccaaccagcagcatcacagaagtctaag gagcagaaagttcgagagtgtcaaagttggcagggattaaaagaactgaagaagtcctgc tattgcgtaagatgtcagccacagctcccagagagctgcagagccaaccagcaggcaagt tttaattccccggccgaagagatttcgacctgcagggcactggtgacaatggatgaacag aggcaggatggctcctggaacaatgcgaccggcacccaagtgttctgcatcttggtaaac aacaccgccagctacccggacactcaacccaaaaccttgcccccgtctgcatttctgtca catcccacggtccgttcttcagaaaattctgtgttctaccttcaaatacagccaaatcca accacttctcaccatgtcttctctatcatcctcattgagacaccaccagctctcacttgc tcattgcatgagcctcccctacttctgctcctccaaacccaccctccagaatattccaca aagcagccagagcatccatttaaaacacccatcagacacgttgactgccacatcccagcc ccagttcagcacggctggggaggactcaaggaacttatgatcatagtggaaggggaagca aacacatccttcttcacatggcagcaacaaggagaagtgcagcacgaagagggagagaag ccccttataaaagcatcagatctcaagagaactcactatcacaagaaaagcatggagaac aatgaaattgaccaaatagcaacggggggaaagagtcctgggatccgaataagccacaaa ctggatattgggctctggatcctggagagctaccctctggccagaaaccaaaatgggctt ggctcattcctggtaaatattccaaccacactaggtgccgagcacctgagagagcaaggg gaagggagccaggccccaggcctgccaccggtcactgggccggggacccaactggacagc agcttcccagaggcagcgcctcccaacgccccctcctccggtggctccacctccgcaaca gaccgggttccgggcatccctgtgagctcgggcctgtccaaccacaccagtgcttcaggc gaccagattaagaaactcactcaaaaccacacaactacatggaaactgaacaacctgctc ttgaatgactactggctacataacaaaatgaaagcagaaataaagatgttctttgaaacc aatgagaacaaagacacaaggtaccagaatctctgggacacatttaaagcagagtgtaga gggaaatttatagcactaaatgcccacaagagaaagcaggaaagatctaaaatcgacacc ctaacatcacaattaaaagaactagagaagcaagagcaaacacattcaaaagctagcaga aggcaagaaataactaagatcagggcagaactgaaagagattacagacacaaagaaaccc ttcaaaaaaatcaatgaatccaagaactggttttttgaaacgatcaacaaaattgataga gcactagcaagattaataaagaagaaaagagagaagaatcaaacagacacaataaaaaat gataaagggaatatcaccaccgatcccacggaaatacaaactatcatcagagaatactat aaacacttctatgcataa >gi568815576r:17741779_18006812|GENSCAN_predicted_peptide_7|270_aa MQQPFMLKTLNKLGIDGTYLKIIRVIYDKPTANIILNGQKLEAFPLKTSTRQGCPLSPLL FNTVLEVLDRAIRQEKEIKGIQLGKEEVKLSPFADDMIVYLENPTVSAPNLLKLISNFSK VSGYKINVQKSQAFLYTNNRQTESQIMSEFPFTIATKRLKYLGIQLTRDVKDLFKENYKP LLNEIKEDTNKWKNIPCSWIGRINIMKINAIPIKLRMTFFTELEKTTLKFIWNQKRARIA KTILSKKNKAGGITLSDFKLYYKATVTKTT >gi568815576r:17741779_18006812|GENSCAN_predicted_CDS_7|813_bp atgcaacagcccttcatgctaaaaactctcaataaactaggtattgatggaacttatctg aaaataataagagttatttatgacaaacccacagccaatatcatactgaatgggcaaaaa ctggaagcattccctttgaaaaccagcacaagacaaggatgccctctctcaccactccta ttcaacacagtgttggaagttctggacagggcaatcaggcaagagaaagaaataaagggt attcaattaggaaaagaggaagtcaaattgtccccgtttgcagatgacatgattgtatat ttagaaaaccccactgtttcagccccaaatctccttaagctgataagcaacttcagcaaa gtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaataataga caaaccgagagccaaatcatgagtgaattcccattcacaattgctacaaagagattaaaa tacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaacca ctgctcaatgaaataaaagaggacacaaacaaatggaagaatattccatgctcatggata ggaagaatcaatatcatgaaaattaatgccatccccatcaagctacgaatgactttcttc acagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcattgcc aagacaatcctaagcaaaaaaaacaaagctggagggatcacgctatctgacttcaaacta tactacaaggctacagtaaccaaaacaacatga >gi568815576r:17741779_18006812|GENSCAN_predicted_peptide_8|100_aa MPVTLLGSTRPAYSRCTRSSCSLGPASKTVPAKTASTHPEDSGPPSMAKLPMRQCFPFMK SPHIPAAEHLTPGIHKTQTLLLRILGSSWKSRNNTGEDRI >gi568815576r:17741779_18006812|GENSCAN_predicted_CDS_8|303_bp atgcccgtgacgctgctgggcagcaccaggccagcctacagccgctgcacgcgttcatct tgcagcctgggccctgccagcaagactgtgcccgccaagacagcctctacacacccagag gacagtgggccccccagcatggccaaactgccaatgagacaatgttttcccttcatgaag agtcctcacataccagctgcagagcatttgacaccaggcatccacaagacacagaccctg ctcctcagaatcttgggctcatcgtggaaatcaaggaacaatacaggggaagacagaata tga >gi568815576r:17741779_18006812|GENSCAN_predicted_peptide_9|547_aa MPRLPKVLGLQALATAPGQQVPFKYILSSLRGQALICPPGFSIGHFMWLLAPSKIPNQSV LSRLPLGFEECTIWILQGDGEDEGRGDSLFLCLLHISREILARLSVQVAMESLRTGAACQ FSRCTEISRPRGWGPVVRPEGHNPRPSMCPPIPRLRRFSGCYLPRRTLGLRNRAALARHS DAGAEGHRSMRQRQSLYSRDEDELGFKLRKTSSYSRCRLRESPAFWLPGRTQASSLWLPS PVSTSLQPSQRMPFIPYLIARCTSQGRGGFMKVYYDLVSAHDKPASGEPGTVIEKVPSDS AVSVFEASISPIALARSTCHSQTAREGYIPCFLLFLKSSYYPLSERLAGGQWGWKGSVVA ISFTPCAASTIMTYVWLRPGPHDDQAGVPGPVGKDHMLSYIPYRRLVSDTCGRVLCAQIV TGPKSRSSHKSQGCSLGPVALAAVSSASSYCVHGWETGLCIFGLNHIRHFQIGFPMPTLV SLGVLAAGNQEPLVTHGSDVILPCSDSCSGSPRTPVNAKAARTRLQDTCTRMYTEAYIHT GEEDMNR >gi568815576r:17741779_18006812|GENSCAN_predicted_CDS_9|1644_bp atgcctcggctccccaaagtgttgggattacaggcgttagccaccgcgcccggccaacaa gtcccctttaaatacatcctctcttctctccgggggcaggctttgatatgcccaccgggt ttctccattggccacttcatgtggcttctcgccccttccaagattcccaaccaatcagtt ctttcccgtctcccgttgggctttgaggaatgtaccatatggatcctgcagggggacgga gaggacgaagggcgtggcgacagcttatttctttgccttttgcacatatcccgtgagatt ctggcaagactgagcgtgcaagtagcaatggagtccctgaggacgggggccgcttgccag ttttcgaggtgtactgagatatctcgcccccgaggctggggtcccgtggtccggcccgag ggccacaacccccgcccttccatgtgcccgcccatcccccggctgcgccgcttctccggc tgctacctaccgcgccggacgctcgggctgcggaacagggcggcactggcccgccacagc gacgccggcgccgagggacaccgcagtatgaggcagagacagtcgttgtactccagggat gaggatgaactgggttttaagctgagaaagacatcttcctacagtcgctgccgcctgcgg gaaagccccgccttctggttacctggccgcacacaggccagttcattgtggttaccttct cccgtgagcacatctctgcagccctctcagaggatgccgttcattccatatttgattgcc agatgtacttctcagggcaggggcggcttcatgaaagtgtactacgaccttgtctctgct catgacaaacctgcctcaggagagccagggacagtgatcgagaaagttcccagtgacagt gctgtttctgtctttgaagctagcatctctccaattgctcttgctcggagtacctgccat agccagactgctagagagggctacatcccatgcttcctcctgttcctcaagagctcatat tacccactttcagaaagacttgctgggggccagtggggctggaaggggtcggttgtggcc atcagcttcacaccctgtgctgcttccactatcatgacctatgtgtggctccgtcctggg cctcatgatgatcaagccggtgtcccaggccctgttgggaaggatcatatgctcagttac ataccctacagaaggttggtcagtgatacatgtggacgtgtgctgtgtgcgcagatagtg acaggacccaaaagcagatcatcacataagagtcagggctgttcgctggggcctgtggcg ttggcagcagtgtcctcagccagcagctactgtgtacatggctgggagacggggctttgc atatttggcctcaatcatataaggcacttccagattggcttccctatgcccaccctggtg tccctaggtgtgctggcagctggcaaccaggagcccctggtaacacacggtagcgatgtc attcttccctgctcagactcctgcagtggttcccctcgcaccccagtcaatgccaaagct gccaggaccaggcttcaggacacatgcacacgcatgtacacagaggcctacatacacaca ggggaagaggacatgaatcgctga