GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:20:09 Sequence gi568815592r:44154709_44356881 : 202173 bp : 50.19% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 84 171 88 1 1 55 72 137 0.443 7.83 1.02 Intr + 7082 7125 44 2 2 101 42 46 0.011 -0.82 1.03 Intr + 12050 12121 72 0 0 129 57 11 0.609 1.48 1.04 Intr + 14573 14823 251 0 2 80 86 281 0.673 24.26 1.05 Intr + 15198 15267 70 2 1 119 109 36 0.999 7.55 1.06 Intr + 17594 17712 119 0 2 73 72 164 0.677 13.48 1.07 Intr + 18232 18365 134 0 2 62 89 209 0.997 17.84 1.08 Intr + 18510 18678 169 0 1 93 74 101 0.917 9.15 1.09 Intr + 21360 21443 84 2 0 142 76 77 0.812 11.92 1.10 Intr + 21545 21630 86 1 2 109 96 101 0.999 11.62 1.11 Intr + 21873 21947 75 0 0 86 65 111 0.921 7.23 1.12 Intr + 22130 22290 161 2 2 94 68 192 0.999 17.43 1.13 Intr + 22534 22716 183 2 0 107 -14 281 0.602 19.56 1.14 Intr + 24911 25062 152 0 2 130 58 56 0.598 6.68 1.15 Intr + 25250 25455 206 1 2 7 91 271 0.972 17.30 1.16 Intr + 26471 26612 142 2 1 75 58 111 0.423 7.16 1.17 Intr + 28233 28311 79 2 1 105 37 84 0.919 4.12 1.18 Intr + 28411 28527 117 2 0 116 70 169 0.994 18.34 1.19 Intr + 31310 31350 41 0 2 101 80 48 0.817 3.24 1.20 Intr + 37647 37808 162 2 0 87 62 96 0.289 7.07 1.21 Intr + 60350 60378 29 1 2 122 119 2 0.101 3.61 1.22 Term + 60398 60449 52 2 1 96 45 46 0.111 -2.00 1.23 PlyA + 60705 60710 6 1.05 2.00 Prom + 61774 61813 40 -7.76 2.01 Init + 62764 62999 236 0 2 80 50 364 0.607 27.41 2.02 Intr + 66878 66977 100 2 1 92 75 25 0.544 1.71 2.03 Intr + 72555 72634 80 2 2 84 110 39 0.798 4.05 2.04 Intr + 74682 74763 82 0 1 109 94 28 0.730 5.14 2.05 Intr + 74881 75083 203 0 2 54 105 225 0.977 18.88 2.06 Intr + 75199 75338 140 1 2 102 61 279 0.999 26.71 2.07 Intr + 75639 75773 135 1 0 92 100 142 0.911 16.34 2.08 Intr + 75860 75957 98 0 2 58 105 84 0.965 6.83 2.09 Intr + 76103 76181 79 1 1 93 63 166 0.997 13.72 2.10 Intr + 76656 76753 98 1 2 85 63 80 0.993 4.93 2.11 Intr + 77290 77398 109 0 1 67 60 123 0.847 7.26 2.12 Intr + 77635 77720 86 2 2 131 106 64 0.999 12.04 2.13 Intr + 78099 78298 200 2 2 70 89 385 0.984 34.95 2.14 Term + 78709 78820 112 1 1 82 48 95 0.974 2.93 2.15 PlyA + 79412 79417 6 1.05 3.00 Prom + 81252 81291 40 -7.76 3.01 Init + 81388 81486 99 0 0 86 67 94 0.966 5.37 3.02 Intr + 82636 82826 191 0 2 62 55 139 0.290 6.38 3.03 Intr + 84075 84203 129 0 0 77 38 73 0.062 1.01 3.04 Intr + 89560 89678 119 1 2 42 96 74 0.042 3.71 3.05 Intr + 91690 91910 221 2 2 43 53 156 0.208 5.62 3.06 Intr + 93922 94068 147 0 0 19 45 147 0.525 3.93 3.07 Intr + 94669 94875 207 0 0 57 72 260 0.997 20.57 3.08 Intr + 94967 95126 160 1 1 70 115 190 0.998 19.46 3.09 Intr + 95313 95446 134 1 2 85 73 95 0.996 8.06 3.10 Intr + 95583 95891 309 2 0 59 70 535 0.999 45.31 3.11 Intr + 96340 96505 166 0 1 85 70 160 0.999 13.43 3.12 Intr + 96710 96900 191 0 2 51 93 216 0.999 17.70 3.13 Intr + 97029 97176 148 2 1 48 78 174 0.999 12.21 3.14 Intr + 97291 97559 269 2 2 74 121 449 0.999 44.05 3.15 Intr + 98337 98670 334 2 1 91 97 493 0.997 45.64 3.16 Term + 98781 98890 110 1 2 67 39 216 0.999 13.27 3.17 PlyA + 99150 99155 6 1.05 4.05 PlyA - 99412 99407 6 -1.75 4.04 Term - 100936 99998 939 1 0 111 55 813 0.901 72.41 4.03 Intr - 101788 101634 155 2 2 89 94 148 0.990 15.29 4.02 Intr - 102170 101977 194 1 2 109 82 152 0.944 15.84 4.01 Init - 103026 102692 335 0 2 82 75 157 0.908 8.57 4.00 Prom - 103221 103182 40 -11.82 5.07 PlyA - 103479 103474 6 1.05 5.06 Term - 104576 104511 66 2 0 93 48 94 0.985 3.84 5.05 Intr - 105574 105335 240 1 0 104 100 286 0.991 29.15 5.04 Intr - 105831 105743 89 1 2 102 94 89 0.995 10.59 5.03 Intr - 107140 106918 223 1 1 89 70 116 0.992 7.60 5.02 Intr - 107954 107852 103 1 1 131 77 38 0.990 7.18 5.01 Init - 110638 110274 365 1 2 80 94 380 0.999 34.32 5.00 Prom - 111559 111520 40 -6.06 6.00 Prom + 111627 111666 40 -5.66 6.01 Init + 114025 114068 44 0 2 66 72 -3 0.496 -4.18 6.02 Intr + 115962 116169 208 0 1 122 78 237 0.853 25.18 6.03 Intr + 118373 118798 426 1 0 26 94 714 0.696 59.79 6.04 Term + 120695 121819 1125 1 0 104 43 2000 0.871 189.11 6.05 PlyA + 124712 124717 6 1.05 7.26 PlyA - 125363 125358 6 -3.64 7.25 Term - 125682 125473 210 0 0 110 38 278 0.987 22.29 7.24 Intr - 127851 127402 450 0 0 91 93 604 0.933 54.70 7.23 Intr - 131824 131256 569 2 2 84 79 803 0.946 71.50 7.22 Intr - 133125 132841 285 1 0 33 70 156 0.051 5.61 7.21 Intr - 143247 143118 130 0 1 100 65 60 0.321 5.17 7.20 Intr - 146558 146448 111 2 0 102 94 89 0.967 11.48 7.19 Intr - 146756 146673 84 2 0 139 70 90 0.999 12.32 7.18 Intr - 147462 147352 111 0 0 55 94 173 0.995 15.18 7.17 Intr - 147805 147683 123 1 0 144 53 110 0.999 13.88 7.16 Intr - 148202 148094 109 1 1 139 76 215 0.933 25.69 7.15 Intr - 148467 148358 110 0 2 93 86 32 0.999 2.58 7.14 Intr - 148715 148578 138 2 0 92 105 171 0.999 19.86 7.13 Intr - 149613 149473 141 0 0 -37 99 150 0.911 4.15 7.12 Intr - 149825 149712 114 2 0 85 94 103 0.993 11.24 7.11 Intr - 150109 149937 173 2 2 96 105 184 0.990 20.56 7.10 Intr - 150490 150346 145 1 1 58 81 110 0.765 7.16 7.09 Intr - 151078 150945 134 2 2 91 87 89 0.999 9.46 7.08 Intr - 151683 151572 112 0 1 69 107 139 0.999 13.85 7.07 Intr - 152323 152215 109 0 1 53 95 86 0.978 6.09 7.06 Intr - 152686 152541 146 1 2 96 86 126 0.996 12.28 7.05 Intr - 155735 155591 145 1 1 72 109 179 0.999 18.68 7.04 Intr - 156453 156286 168 2 0 128 90 94 0.999 12.66 7.03 Intr - 156827 156682 146 2 2 111 110 144 0.999 17.98 7.02 Intr - 157555 157364 192 1 0 91 113 123 0.979 14.79 7.01 Init - 158615 158373 243 2 0 83 94 288 0.998 24.63 7.00 Prom - 164380 164341 40 -5.06 8.00 Prom + 171935 171974 40 -3.76 8.01 Init + 189075 189150 76 2 1 77 92 57 0.376 6.25 8.02 Intr + 195474 195647 174 1 0 46 66 83 0.071 1.81 8.03 Intr + 198018 198165 148 1 1 125 89 1 0.080 3.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 2371 2456 86 2 2 124 55 48 0.902 2.92 S.002 Intr - 132571 132482 90 2 0 110 75 15 0.822 2.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:44154709_44356881|GENSCAN_predicted_peptide_1|838_aa XEELLMPPDALTDTDFQSCEDSLIENEIHQPPGELNTLSPRSSQGPSLPESAESLDGSQE DKPRGSCAEPTFTDTGMVAHINNSRLKAKGVGQHDNAQNFGNQSFEELRAACLRKGELFE DPLFPAEPSSLGFKDLGPNSKNVQNISWQRPKDIINNPLFIMDGISPTDICQGILGDCWL LAAIGSLTTCPKLLYRVVPRGQSFKKNYAGIFHFQIWQFGQWVNVVVDDRLPTKNDKLVF VHSTERSEFWSALLEKAYAKLSGSYEALSGGSTMEGLEDFTGGVAQSFQLQRPPQNLLRL LRKAVERSSLMGCSIEVTSDSELESMTDKMLVRGHAYSVTGLQDVHYRGKMETLIRVRNP WGRIEWNGAWSDSAREWEEVASDIQMQLLHKTEDGEFWMSYQDFLNNFTLLEICNLTPDT LSGDYKSYWHTTFYEGSWRRGSSAGGCRNHPGTFWTNPQFKISLPEGDDPEDDAEGNVVV CTCLVALMQKNWRHARQQGAQLQTIGFVLYAVGPKRGTEDKDVGLVPGHIPTPPLAASGL AKGGIWREALGVTGLGFQPSSGLNIQDVHLKKEFFTKYQDHGFSEIFTNSREVSSQLRLP PGEYIIIPSTFEPHRDADFLLRVFTEKHSESWNPRCFPIPWDAFFTPSLLSLTTFQKDGS GKLGLLEFKILWKKLKKWMDIFRECDQDHSGTLNSYEMRLVIEKAGIKLNNKVMQVLVAR YADDDLIIDFDSFISCFLRLKTMFRTLEPKHMRFSASQRRQTLGFTGLQNPVGNLDHHLI HQKGPGFPVSVSDTARKDVQDSPEHCGWLTPQVEVGQAFLTWVLHADGDDDASGGWWG >gi568815592r:44154709_44356881|GENSCAN_predicted_CDS_1|2517_bp natgaggagttgctgatgccacccgacgccctcacggacacagacttccagtcttgcgag gacagcctcatagagaatgagattcaccagcccccaggagagctcaacaccctcagtcct cggagcagccaggggccgagtcttccggagtcagcagagagcctggatggatcacaggag gataagcctcggggctcatgtgcggagcccacttttactgatacgggaatggtggctcac ataaacaacagccggctcaaggccaagggcgtgggccagcacgacaacgcccagaacttt ggtaaccagagctttgaggagctgcgagcagcctgtctaagaaagggggagctcttcgag gaccccttattccctgctgaacccagctcactgggcttcaaggacctgggccccaactcc aaaaatgtgcagaacatctcctggcagcggcccaaggatatcataaacaaccctctattc atcatggatgggatttctccaacagacatctgccaggggatcctcggggactgctggctg ctggctgccatcggctcccttaccacctgccccaaactgctataccgcgtggtgcccaga ggacagagcttcaagaaaaactatgctggcatcttccattttcagatttggcagtttgga cagtgggtgaacgtggtggtagatgaccggctgcccacaaagaatgacaagctggtgttt gtgcactcaaccgaacgcagtgagttctggagtgccctgctggagaaggcgtatgccaag ctgagtgggtcctatgaagcattgtcagggggcagtaccatggagggccttgaggacttc acaggaggcgtggcccagagcttccaactccagaggccccctcagaacctgctcaggctc cttaggaaggccgtggagcgatcctccctcatgggttgctccattgaagtcaccagtgat agtgaactggaatccatgactgacaagatgctggtgagagggcacgcttactctgtgact ggccttcaggatgtccactacagaggcaaaatggaaacactgattcgggtccggaatccc tggggccggattgagtggaatggagcttggagtgacagtgccagggagtgggaagaggtg gcctcagacatccagatgcagctgctgcacaagacggaggacggggagttctggatgtcc taccaagatttcctgaacaacttcacgctcctggagatctgcaacctcacgcctgataca ctctctggggactacaagagctactggcacaccaccttctacgagggcagctggcgcaga ggcagctccgcagggggctgcaggaaccaccctggcacgttctggaccaacccccagttt aagatctctcttcctgagggggatgacccagaggatgacgcagagggcaatgttgtggtc tgcacctgcctggtggccctaatgcagaagaactggcggcatgcacggcagcagggagcc cagctgcagaccattggctttgtcctctacgcggtgggtcccaaaagaggtacagaagat aaagatgtggggcttgttcctggacacatacccactccacccctggctgcaagtggattg gcaaagggtggaatttggcgagaggctctgggggtcactgggttaggattccagccctct tcagggctgaacattcaggatgtccacttgaagaaggaattcttcacgaagtatcaggac cacggcttctcagagatcttcaccaactcacgggaggtgagcagccaactccggctgcct ccgggggaatatatcattattccctccacctttgagccacacagagatgctgacttcctg cttcgggtcttcaccgagaagcacagcgagtcatggaacccccgctgcttccctatcccc tgggatgccttcttcactccttctctgctgtccttgaccaccttccagaaagatggctct ggcaagctggggcttctagagttcaagatcctgtggaaaaaactcaagaaatggatggac atcttcagagagtgtgaccaggaccattcaggcaccttgaactcctatgagatgcgcctg gttattgagaaagcaggcatcaagctgaacaacaaggtaatgcaggtcctggtggccagg tatgcagatgatgacctgatcatagactttgacagcttcatcagctgtttcctgaggcta aagaccatgttcaggacactggagcccaagcacatgcgcttctcagcctctcagagacgt caaacgctaggtttcacagggcttcagaatcctgttgggaacctggatcatcacctcatc caccaaaagggtcctggtttccctgtttctgtaagtgacacagccaggaaggacgtccag gactcgcctgagcattgtggctggctgacgcctcaggtagaagtgggacaggcctttttg acctgggtcttacatgctgatggagatgatgatgctagtggcggctggtggggatga >gi568815592r:44154709_44356881|GENSCAN_predicted_peptide_2|585_aa MPTPLLPLLLRLLLSCLLLPAARLARQYLLPLLRRLARRLGSQDMREALLGCLLFILSQR HSPDAGEASRVDRLERRERDLRELREGEKPEDQAETEESWQGLARKTPGKACAPEGGSCQ PGKTENTITMTTSHQPQDRYKAVWLIFFMLGLGTLLPWNFFMTATQYFTNRLDMSQNVSL VTAELSKDAQASAAPAAPLPERNSLSAIFNNVMTLCAMLPLLLFTYLNSFLHQRIPQSVR ILGSLVAILLVFLITAILVKVQLDALPFFVITMIKIVLINSFGAILQGSLFGLAGLLPAS YTAPIMSGQGLAGFFASVAMICAIASGSELSESAFGYFITACAVIILTIICYLGLPRLEF YRYYQQLKLEGPGEQETKLDLISKGEEPRAGKEESGVSVSNSQPTNESHSIKAILKNISV LAFSVCFIFTITIGMFPAVTVEVKSSIAGSSTWERYFIPVSCFLTFNIFDWLGRSLTAVF MWPGKDSRWLPSLVLARLVFVPLLLLCNIKPRRYLTVVFEHDAWFIFFMAAFAFSNGYLA SLCMCFGPKKVKPAEAETAGAIMAFFLCLGLALGAVFSFLFRAIV >gi568815592r:44154709_44356881|GENSCAN_predicted_CDS_2|1758_bp atgcccacgccactgctcccgctgctgcttcgattgctgctgtcctgcctgctgctgcct gctgcccgcctggcccgccaatacctcctgcccctgctgcgccgattggcccgccgcctg ggctcccaggacatgcgagaggctttgctgggctgtctgctgttcattctcagccagcga cactcgccagacgctggggaggcctcaagagtggaccgcctggagaggagggagagggac ctgagggagctcagggagggagagaagccagaagaccaggcagagactgaagagagctgg caaggcctggctagaaaaactccaggcaaggcctgtgcccctgagggagggagctgtcag ccagggaaaaccgagaacaccatcaccatgacaaccagtcaccagcctcaggacagatac aaagctgtctggcttatcttcttcatgctgggtctgggaacgctgctcccgtggaatttt ttcatgacggccactcagtatttcacaaaccgcctggacatgtcccagaatgtgtccttg gtcactgctgaactgagcaaggacgcccaggcgtcagccgcccctgcagcacccttgcct gagcggaactctctcagtgccatcttcaacaatgtcatgaccctatgtgccatgctgccc ctgctgttattcacctacctcaactccttcctgcatcagaggatcccccagtccgtacgg atcctgggcagcctggtggccatcctgctggtgtttctgatcactgccatcctggtgaag gtgcagctggatgctctgcccttctttgtcatcaccatgatcaagatcgtgctcattaat tcatttggtgccatcctgcagggcagcctgtttggtctggctggccttctgcctgccagc tacacggcccccatcatgagtggccagggcctagcaggcttctttgcctccgtggccatg atctgcgctattgccagtggctcggagctatcagaaagtgccttcggctactttatcaca gcctgtgctgttatcattttgaccatcatctgttacctgggcctgccccgcctggaattc taccgctactaccagcagctcaagcttgaaggacccggggagcaggagaccaagttggac ctcattagcaaaggagaggagccaagagcaggcaaagaggaatctggagtttcagtctcc aactctcagcccaccaatgaaagccactctatcaaagccatcctgaaaaatatctcagtc ctggctttctctgtctgcttcatcttcactatcaccattgggatgtttccagccgtgact gttgaggtcaagtccagcatcgcaggcagcagcacctgggaacgttacttcattcctgtg tcctgtttcttgactttcaatatctttgactggttgggccggagcctcacagctgtattc atgtggcctgggaaggacagccgctggctgccaagcctggtgctggcccggctggtgttt gtgccactgctgctgctgtgcaacattaagccccgccgctacctgactgtggtcttcgag cacgatgcctggttcatcttcttcatggctgcctttgccttctccaacggctacctcgcc agcctctgcatgtgcttcgggcccaagaaagtgaagccagctgaggcagagaccgcagga gccatcatggccttcttcctgtgtctgggtctggcactgggggctgttttctccttcctg ttccgggcaattgtgtga >gi568815592r:44154709_44356881|GENSCAN_predicted_peptide_3|977_aa MAPDPKWALAGPCPLGSRGLLWILHFPASTLLPVGTEGGNGQRPTPSGPNLINGLMSGLD PSSPSRYDLASCVDALGERTCPLGALCSLGSPGARWNSTITVLPFHHLRGVEVRPVNGPE IQPIETCMLKRNDFILGQASLQWVKGSQSCVCISASPGSPHSGGLVSVNEWKTMELVDTA GEAEPRGEGAGPPRQAWKLLEMPQCRRRAFRRMSAKSPRQPRPAPSPYAELPLSANPPPF LYSCESRDLGLPKMPEEVHHGEEEVETFAFQAEIAQLMSLIINTFYSNKEIFLRELISNA SDALDKIRYESLTDPSKLDSGKELKIDIIPNPQERTLTLVDTGIGMTKADLINNLGTIAK SGTKAFMEALQAGADISMIGQFGVGFYSAYLVAEKVVVITKHNDDEQYAWESSAGGSFTV RADHGEPIGRGTKVILHLKEDQTEYLEERRVKEVVKKHSQFIGYPITLYLEKEREKEISD DEAEEEKGEKEEEDKDDEEKPKIEDVGSDEEDDSGKDKKKKTKKIKEKYIDQEELNKTKP IWTRNPDDITQEEYGEFYKSLTNDWEDHLAVKHFSVEGQLEFRALLFIPRRAPFDLFENK KKKNNIKLYVRRVFIMDSCDELIPEYLNFIRGVVDSEDLPLNISREMLQQSKILKVIRKN IVKKCLELFSELAEDKENYKKFYEAFSKNLKLGIHEDSTNRRRLSELLRYHTSQSGDEMT SLSEYVSRMKETQKSIYYITGESKEQVANSAFVERVRKRGFEVVYMTEPIDEYCVQQLKE FDGKSLVSVTKEGLELPEDEEEKKKMEESKAKFENLCKLMKEILDKKVEKVTISNRLVSS PCCIVTSTYGWTANMERIMKAQALRDNSTMGYMMAKKHLEINPDHPIVETLRQKAEADKN DKAVKDLVVLLFETALLSSGFSLEDPQTHSNRIYRMIKLGLGIDEDEVAAEEPNAAVPDE IPPLEGDEDASRMEEVD >gi568815592r:44154709_44356881|GENSCAN_predicted_CDS_3|2934_bp atggctcccgaccccaagtgggcgctcgcaggcccctgccctttggggagcaggggcctc ctctggatcctccactttcctgctagcaccttgctgcctgttggcactgagggtggaaat gggcagcgcccaactccctctggccccaacctcatcaatggccttatgtccgggctggac ccctccagcccatctcggtatgacctggcttcctgtgtagatgctctcggggagaggacg tgcccactcggagccctctgcagccttggcagtcctggtgctcgctggaacagcactatc acagtcctaccgttccatcatctgcgtggggtggaagtgaggccagtgaatggcccagaa atccagcccattgaaacctgcatgctgaaaaggaatgacttcatcctgggacaggccagc ctccagtgggttaagggctcccagtcctgtgtatgcatctctgcatccccaggatctcca cacagtggaggcttagtgagtgtcaatgagtggaaaacgatggagcttgtggacacagcc ggagaggcggagcctcgcggggagggggcgggaccgccgagacaggcctggaaactgctg gaaatgccgcagtgccgccgccgcgccttccgccgcatgtcggcaaagagtccccgccag ccccggccggcgccctccccctacgctgagctgcccctcagcgcgaaccctccgcccttc ctctactcctgcgagagtcgggatctggggctacccaagatgcctgaggaagtgcaccat ggagaggaggaggtggagacttttgcctttcaggcagaaattgcccaactcatgtccctc atcatcaataccttctattccaacaaggagattttccttcgggagttgatctctaatgct tctgatgccttggacaagattcgctatgagagcctgacagacccttcgaagttggacagt ggtaaagagctgaaaattgacatcatccccaaccctcaggaacgtaccctgactttggta gacacaggcattggcatgaccaaagctgatctcataaataatttgggaaccattgccaag tctggtactaaagcattcatggaggctcttcaggctggtgcagacatctccatgattggg cagtttggtgttggcttttattctgcctacttggtggcagagaaagtggttgtgatcaca aagcacaacgatgatgaacagtatgcttgggagtcttctgctggaggttccttcactgtg cgtgctgaccatggtgagcccattggcaggggtaccaaagtgatcctccatcttaaagaa gatcagacagagtacctagaagagaggcgggtcaaagaagtagtgaagaagcattctcag ttcataggctatcccatcaccctttatttggagaaggaacgagagaaggaaattagtgat gatgaggcagaggaagagaaaggtgagaaagaagaggaagataaagatgatgaagaaaaa cccaagatcgaagatgtgggttcagatgaggaggatgacagcggtaaggataagaagaag aaaactaagaagatcaaagagaaatacattgatcaggaagaactaaacaagaccaagcct atttggaccagaaaccctgatgacatcacccaagaggagtatggagaattctacaagagc ctcactaatgactgggaagaccacttggcagtcaagcacttttctgtagaaggtcagttg gaattcagggcattgctatttattcctcgtcgggctccctttgacctttttgagaacaag aagaaaaagaacaacatcaaactctatgtccgccgtgtgttcatcatggacagctgtgat gagttgataccagagtatctcaattttatccgtggtgtggttgactctgaggatctgccc ctgaacatctcccgagaaatgctccagcagagcaaaatcttgaaagtcattcgcaaaaac attgttaagaagtgccttgagctcttctctgagctggcagaagacaaggagaattacaag aaattctatgaggcattctctaaaaatctcaagcttggaatccacgaagactccactaac cgccgccgcctgtctgagctgctgcgctatcatacctcccagtctggagatgagatgaca tctctgtcagagtatgtttctcgcatgaaggagacacagaagtccatctattacatcact ggtgagagcaaagagcaggtggccaactcagcttttgtggagcgagtgcggaaacggggc ttcgaggtggtatatatgaccgagcccattgacgagtactgtgtgcagcagctcaaggaa tttgatgggaagagcctggtctcagttaccaaggagggtctggagctgcctgaggatgag gaggagaagaagaagatggaagagagcaaggcaaagtttgagaacctctgcaagctcatg aaagaaatcttagataagaaggttgagaaggtgacaatctccaatagacttgtgtcttca ccttgctgcattgtgaccagcacctacggctggacagccaatatggagcggatcatgaaa gcccaggcacttcgggacaactccaccatgggctatatgatggccaaaaagcacctggag atcaaccctgaccaccccattgtggagacgctgcggcagaaggctgaggccgacaagaat gataaggcagttaaggacctggtggtgctgctgtttgaaaccgccctgctatcttctggc ttttcccttgaggatccccagacccactccaaccgcatctatcgcatgatcaagctaggt ctaggtattgatgaagatgaagtggcagcagaggaacccaatgctgcagttcctgatgag atcccccctctcgagggcgatgaggatgcgtctcgcatggaagaagtcgattag >gi568815592r:44154709_44356881|GENSCAN_predicted_peptide_4|540_aa MTKHRGHAGRAPLQGWGSPRPGAAQVPGTQAASASGPRGGAVVRRRPGALRGRGRGGGGR GEGGGKSAALPLAAGSLAAPGGGGGSAGGARPGDSHSPVPPPPHAAWTMDARWWAVVVLA AFPSLGAGGETPEAPPESWTQLWFFRFVVNAAGYASFMVPGYLLVQYFRRKNYLETGRGL CFPLVKACVFGNEPKASDEVPLAPRTEAAETTPMWQALKLLFCATGLQVSYLTWGVLQER VMTRSYGATATSPGERFTDSQFLVLMNRVLALIVAGLSCVLCKQPRHGAPMYRYSFASLS NVLSSWCQYEALKFVSFPTQVLAKASKVIPVMLMGKLVSRRSYEHWEYLTATLISIGVSM FLLSSGPEPRSSPATTLSGLILLAGYIAFDSFTSNWQDALFAYKMSSVQMMFGVNFFSCL FTVGSLLEQGALLEGTRFMGRHSEFAAHALLLSICSACGQLFIFYTIGQFGAAVFTIIMT LRQAFAILLSCLLYGHTVTVVGGLGVAVVFAALLLRVYARGRLKQRGKKAVPVESPVQKV >gi568815592r:44154709_44356881|GENSCAN_predicted_CDS_4|1623_bp atgacgaagcacaggggacacgccgggcgggcgcctcttcagggctggggctccccgcgc ccaggggcagcccaggtccccggaacccaagccgcgtctgcctccgggccgcgcgggggc gctgtggtccggcggcggcccggggcgctgcgtggtcgcggcaggggcggagggggccgc ggggagggaggcgggaagagcgcggcacttccgctggccgctggctcgctggccgctcct ggaggcggcggcgggagcgcagggggcgcgcggcccggggactcgcattccccggttccc cctccaccccacgcggcctggaccatggacgccagatggtgggcagtggtggtgctggct gcgttcccctccctaggggcaggtggggagactcccgaagcccctccggagtcatggacc cagctatggttcttccgatttgtggtgaatgctgctggctatgccagctttatggtacct ggctacctcctggtgcagtacttcaggcggaagaactacctggagaccggtaggggcctc tgctttcccctggtgaaagcttgtgtgtttggcaatgagcccaaggcctctgatgaggtt cccctggcgccccgaacagaggcggcagagaccaccccgatgtggcaggccctgaagctg ctcttctgtgccacagggctccaggtgtcttatctgacttggggtgtgctgcaggaaaga gtgatgacccgcagctatggggccacagccacatcaccgggtgagcgctttacggactcg cagttcctggtgctaatgaaccgagtgctggcactgattgtggctggcctctcctgtgtt ctctgcaagcagccccggcatggggcacccatgtaccggtactcctttgccagcctgtcc aatgtgcttagcagctggtgccaatacgaagctcttaagttcgtcagcttccccacccag gtgctggccaaggcctctaaggtgatccctgtcatgctgatgggaaagcttgtgtctcgg cgcagctacgaacactgggagtacctgacagccaccctcatctccattggggtcagcatg tttctgctatccagcggaccagagccccgcagctccccagccaccacactctcaggcctc atcttactggcaggttatattgcttttgacagcttcacctcaaactggcaggatgccctg tttgcctataagatgtcatcggtgcagatgatgtttggggtcaatttcttctcctgcctc ttcacagtgggctcactgctagaacagggggccctactggagggaacccgcttcatgggg cgacacagtgagtttgctgcccatgccctgctactctccatctgctccgcatgtggccag ctcttcatcttttacaccattgggcagtttggggctgccgtcttcaccatcatcatgacc ctccgccaggcctttgccatccttctttcctgccttctctatggccacactgtcactgtg gtgggagggctgggggtggctgtggtctttgctgccctcctgctcagagtctacgcgcgg ggccgtctaaagcaacggggaaagaaggctgtgcctgttgagtctcctgtgcagaaggtt tga >gi568815592r:44154709_44356881|GENSCAN_predicted_peptide_5|361_aa MSEARKGPDEAEESQYDSGIESLRSLRSLPESTSAPASGPSDGSPQPCTHPPGPVKEPQE KEDADGERADSTYGSSSLTYTLSLLGGPEAEDPAPRLPLPHVGALSPQQLEALTYISEDG DTLVHLAVIHEAPAVLLCCLALLPQEVLDIQNNLYQTALHLAVHLDQPGAVRALVLKGAS RALQDRHGDTALHVACQRQHLACARCLLEGRPEPGRGTSHSLDLQLQNWQGLACLHIATL QKNQPLMELLLRNGADIDVQEGTSGKTALHLAVETQERGLVQFLLQAGAQVDARMLNGCT PLHLAAGRGLMGISSTLCKAGADSLLRNVEDETPQDLTEESLVLLPFDDLKISGKLLLCT D >gi568815592r:44154709_44356881|GENSCAN_predicted_CDS_5|1086_bp atgtcggaggcgcggaaggggccggacgaggcggaggagagccagtacgactctggcatt gagtctctgcgctctctgcgctccctacccgagtccacctcggctccagcctccgggccc tcggacggcagcccccagccctgcacccatcctccgggacccgtcaaggaaccacaggag aaggaagacgcggatggggagcgggctgattccacctatggctcctcctcgctcacctac accctgtccttgctggggggccccgaggctgaggacccggccccacgcctgccactcccc cacgtgggggcgctgagccctcagcagctggaagcactcacttacatctccgaggacgga gacacgctggtccacctggcagtgattcatgaggccccagcggtgctgctctgttgcctg gctttgctgccccaggaggtcctggacattcaaaataacctttaccagacagcactccat ctggctgtacatctggaccaaccgggcgcagttcgggcactggtgctgaagggggccagc cgggcactacaggaccggcatggtgacacagcccttcatgtggcctgccagcgccagcac ttggcctgtgcccgctgcctgctggaagggcggccagagccaggcagaggaacatctcac tctctggacctccagctgcaaaactggcaaggtctggcttgtctccacattgccaccctt cagaagaaccaaccactcatggaattgctgcttcggaatggagctgacattgatgtgcag gagggcaccagtggtaagacagcgctgcacctggctgtggaaacccaagagcggggcctg gtacagttcctgctccaggctggtgcccaggtagatgcccgcatgctgaacgggtgcaca cccctgcacctggcagctggccggggtctcatgggcatctcatccactctgtgcaaggcg ggtgctgactccctgctgcggaatgtggaggatgagacgccccaggacctgactgaggaa tcccttgtccttttgccctttgatgacctgaagatctcagggaaactgctgctgtgtacc gactga >gi568815592r:44154709_44356881|GENSCAN_predicted_peptide_6|600_aa MGSTKGSSVGLERPRARAGCPRPLAAPREVLSSTRPLYAMSPPGSAAGESAAGGGGGGGG PGVSEELTAAAAAAAADEGPAREEPSFTKSLCRESHWKCLLLSLLMYGCLGAVAWCHVTT VTRLTFSSAYQGNSLMYHDSPCSNGYVYIPLAFLLMLYAVYLVECWHCQARHELQHRVDV SSVRERVGRMQQATPCIWWKAISYHYVRRTRQVTRYRNGDAYTTTQVYHERVNTHVAEAE FDYARCGVRDVSKTLVGLEGAPATRLRFTKCFSFASVEAENAYLCQRARFFAENEGLDDY MEAREGMHLKNVDFREFMVAFPDPARPPWYACSSAFWAAALLTLSWPLRVLAEYRTAYAH YHVEKLFGLEGPGSASSAGGGLSPSDELLPPLTHRLPRVNTVDSTELEWHIRSNQQLVPS YSEAVLMDLAGLGTRCGGAGGGYAPSCRYGGVGGPGAAGVAPYRRSCEHCQRAVSSSSIF SRSALSICASPRAGPGPGGGAGCGGSRFSLGRLYGSRRSCLWRSRSGSVNEASCPTEQTR LSSQASMGDDEDDDEEEAGPPPPYHDALYFPVLIVHRQEGCLGHSHRPLHRHGSCVETSL >gi568815592r:44154709_44356881|GENSCAN_predicted_CDS_6|1803_bp atggggagcacgaaaggatcctctgtgggtttggaaaggccaagggcccgcgcaggatgc ccccggcccctggcagccccccgggaggtcctgagctcgacgcgccccctctacgccatg tccccccctggctcggccgcgggagagagcgccgccggcggcggcggcggcggtggcggc cccggggtctcggaggagctcacggcggcggcggcagcggcggcggcggacgagggcccc gcccgagaggagccctctttcaccaagtccctctgccgtgagtcccactggaagtgcctc ctgctctcgctgctcatgtacggctgcctgggggcagtggcctggtgccacgtcaccaca gtgacgcgcctcaccttcagcagcgcctaccagggcaacagcctcatgtaccatgacagc ccctgctccaacggctatgtctacatccccctggccttcctgctcatgttgtacgccgtc tacctggtggagtgttggcactgccaagcccgccatgagctgcagcaccgtgttgatgtg agcagtgtgcgggaacgtgtgggccgcatgcagcaagccacgccctgcatctggtggaag gccatcagctaccactatgtccgccgcacccgccaggtcaccagataccgcaatggagac gcctataccaccacccaggtctaccacgaacgcgtcaacacgcacgtggcggaggctgag ttcgactacgcgcgctgcggcgttcgcgacgtgtccaagacgctggtggggctggagggc gcgccggccacgcggctgcgcttcaccaagtgcttcagtttcgccagcgtggaggccgag aacgcgtacctgtgccagcgcgcgcgcttcttcgcagagaacgagggcctagacgactac atggaggcacgcgagggcatgcacctcaagaacgtggacttccgtgagttcatggtggcc ttcccggacccggcccggccgccctggtacgcctgctcgtcggccttctgggccgcggcg ctgctcacgctgtcgtggccgctgcgagtgctggccgagtaccgcacggcctacgcgcac taccacgtggagaagctatttggcctggagggcccgggctcggccagcagcgcaggcggt ggcctcagccccagcgatgagctgctgcccccgctcacccaccgcctgccgcgggtcaac acagtagacagcacggagctcgagtggcacatccgctccaaccagcagctggtgcccagc tactctgaggcggtgctcatggacctggcggggctcgggacgcgctgcggcggggcaggc ggcggctacgcgccctcgtgccgctacggtggggtaggcggcccgggcgcggcgggcgtg gctccctaccggcgcagctgcgagcactgccagcgcgccgtcagcagctcgtctatcttc tcgcgcagcgccctaagcatctgcgccagcccgcgggccggcccggggcccggtgggggc gcgggctgcgggggcagccgcttctcgctgggccgtctctacggctcccggcgcagctgc ctgtggcgcagccgcagcgggagcgtcaacgaggccagctgccccacggagcagacgcgg ctgtccagccaggccagcatgggggacgacgaggacgacgacgaggaggaggccgggccg ccgccgccctaccacgacgccctctactttccggtcctcatcgtccaccggcaggagggg tgtctgggccacagccaccggccgctgcaccgccacggctcctgcgtagagacctcactg tga >gi568815592r:44154709_44356881|GENSCAN_predicted_peptide_7|1465_aa MAASVAAAARRLRRAIRRSPAWRGLSHRPLSSEPPAAKASAVRAAFLNFFRDRHGHRLVP SASVRPRGDPSLLFVNAGMNQFKPIFLGTVDPRSEMAGFRRVANSQKCVRAGGHHNDLED VGRDLSHHTFFEMLGNWAFGGEYFKEEACNMAWELLTQVYGIPEERLWISYFDGDPKAGL DPDLETRDIWLSLGVPASRVLSFGPQENFWEMGDTGPCGPCTEIHYDLAGGVGAPQLVEL WNLVFMQHNREADGSLQPLPQRHVDTGMGLERLVAVLQGKHSTYDTDLFSPLLNAIQQGC RAPPYLGRVGVADEGRTDTAYRVVADHIRTLSVCISDGIFPGMSGPPLVLRRILRRAVRF SMEILKAPPGFLGSLVPVVVETLIANLVSEDEAAFLASLERGRRIIDRTLRTLGPSDMFP AEVAWSLSLCGDLGLPLDMVELMLEEKGVQLDSAGLERLAQEEAQHRARQAEPVQKQGLW LDVHALGELQRQGVPPTDDSPKYNYSLRPSGSYEFGTCEAQVLQLYTEDGTAVASVGKGQ RCGLLLDRTNFYAEQGGQASDRGYLVRAGQEDVLFPVARAQVCGGFILHEAVAPECLRLG DQVQLHVDEAWRLGCMAKHTATHLLNWALRQTLGPGTEQQGSHLNPEQLRLDVTTQTPLT PEQLRAVENTVQEAVGQDEAVYMEEVPLALTAQVPGLRSLDEVYPDPVRVVSVGVPVAHA LDPASQAALQTSVELCCGTHLLRTGAVGDLVIIGDRQLSKGTTRLLAVTGEQAQQARELG QSLAQEVKAATERLSLGSRDVAEALRLSKDIGRLIEAVETAVMPQWQRRELLATVKMLQR RANTAIRKLQMGQAAKKTQELLERHSKGPLIVDTVSAESLSVLVKVVRQLCEQAPSTSVL LLSPQPMGKVLCACQVAQKAHPFLDTSHGSTGHSARPQSHYPQSGRFHGTVPRPGLATLR ATSSMQDTVTTSALLDPSHSSVSTQDNSSTGGHTSSTSPQLSKPSITPVPAKSRNPHPRA NIRRMRRIIAEDPEWSLAIVPLLTELCIQHIIRNFQKNPILKQMLPEHQQKVLNHLSPDL PLAVTANLIDSENYWLRCCMHRWPVCHVAHHGGSWKRMFFERHLENLLKHFIPGTTDPAV ILDLLPLCRNYVRRVHVDQFLPPVQLPAQLRPGDQSDSGSEGEMEEPTVDHYQLGDLVAG LSHLEELDLVYDVKDCGMNFEWNLFLFTYRDCLSLAAAIKACHTLKIFKLTRSKVDDDKA RIIIRSLLDHPVLEELDLSQNLIGDRGARGAAKLLSHSRLRVLNLANNQVRAPGAQSLAH ALAHNTNLISLNLRLNCIEDEGGQALAHALQTNKCLTTLHLGGNELSEPTATLLSQVLAI NTTLTSINLSCNHIGLDGGKQLLEGMSDNKTLLEFDLRLSDVAQESEYLIGQALYANREA ARQRALNPSHFMSTITANGPENSVG >gi568815592r:44154709_44356881|GENSCAN_predicted_CDS_7|4398_bp atggcagcgtcagtggcagctgcagcccggaggctgcggcgggccattcgaaggtcgccc gcatggcggggcctcagccatcggccgctctcatcggagccccctgcagccaaggcctcg gccgtgagggccgcctttctgaacttctttcgggaccgccatggccaccggctggtgccc tccgcttccgtgcggccccgcggcgaccccagtttgctttttgtcaatgcgggcatgaac cagttcaagccaatctttctgggcaccgtggatccacgaagcgagatggcaggcttccga cgtgtggccaacagccagaaatgtgtgagagctggaggacaccataacgacctggaagat gtgggtcgagacctttcccatcataccttctttgaaatgcttggcaattgggcctttggg ggtgaatattttaaggaggaggcttgtaacatggcctgggaactgctgactcaggtctat gggatccctgaggaaaggctctggatctcctactttgatggtgaccccaaggcagggctg gacccagacctggagaccagggacatctggctgagcttaggggtgcctgctagccgtgtg ctttcctttggaccacaagagaacttctgggagatgggggatactggcccttgtgggccc tgtactgagatccactacgaccttgctggtggggtgggagccccccagctggtagagctt tggaacctggtcttcatgcaacacaacagagaggcagatggaagcctgcagcccctgccc cagcggcatgtggacacaggaatgggcctggaaaggctggtggctgtgctgcaaggcaaa cactccacctatgacactgacctcttttccccgctgctcaacgccatacagcagggctgc agggcacccccttacttgggccgagtaggggtggcagacgaggggcgcacagacacagcg taccgcgtggtggctgaccacatccgcacactcagtgtctgcatctctgatggcatcttc cctgggatgtcaggtcccccgctggttcttcgtcggatcctgcgtcgagctgtgcgtttc tccatggagatcttaaaggcaccacctggcttcctaggcagcctggtacctgtagtggtg gagacactgatcgccaacctggtgtcagaggacgaggcagccttcctggcctccctggag cggggtaggcggatcattgatcggactctgaggaccctggggccttcagatatgttccct gctgaagtggcctggtccttgtcactgtgtggagacctgggactccccttggacatggta gagctgatgctggaggagaaaggggtccagctagactccgctggactggagcggttggcc caagaggaggcccagcaccgggcacggcaggctgagccagttcagaagcagggattgtgg cttgatgtccatgcgcttggggagctgcagcgccaaggagtgcccccaactgacgacagc cccaagtacaactactccctgcgacccagcggaagttatgagttcggcacctgtgaggcc caggtgttgcaactgtatacagaggacgggacagcagtggcctccgtggggaaaggccag cgctgtggcctcctcttggacaggaccaacttctacgcagaacaggggggccaggcttca gaccgtggctacctggtgcgggcagggcaagaggacgtgctgttcccagtagcccgggcc caggtctgtggaggtttcatcctgcatgaggcagtagcccctgagtgcctgcggttaggg gaccaggtgcagctgcatgtggatgaggcctggcgtctaggctgcatggcgaagcatacg gccacccacctgctgaactgggcactgaggcagaccctgggccctggcacagagcagcag ggctcccatctcaatcctgagcagctgcgcttggatgtgaccacccagaccccattgacc ccagagcagctccgggcagtggagaacactgtgcaggaggccgtggggcaggatgaggct gtgtacatggaggaggtgcccctggcgctcactgcccaggtccctggcctgcgctctctg gatgaggtttacccagaccctgtgcgggtggtatcagtgggggtgcccgtggcccatgca ttggacccagcctcccaagccgcactgcagacctctgtggagctatgctgtgggacgcac ctgttacgtactggggctgtaggggacctggttatcatcggggaccgccagctttccaag ggcactacccgcctgctggccgtcactggggagcaggcccagcaggcccgagagctaggc cagagcctggcccaggaagtgaaagcggccactgagcggctgagtctggggagccgggat gtggcggaggcactgaggctgtccaaggacataggacgactcattgaagctgtggaaact gctgtgatgccccagtggcagcggcgggagctgctggccacagtgaagatgctgcagcgg cgtgccaacactgccatccgtaagctgcaaatgggacaggctgcaaagaaaactcaggag ctgctggagcggcactcgaaggggcctctgattgtggacacagtctctgctgagtctctc tcagtgctggtgaaggtggtacggcagctgtgtgagcaggcccccagcacgtctgtgctc ctactcagcccccagcccatggggaaggtgctgtgtgcctgtcaggtggcccagaaagcc cacccgttcctagacacgtcccacggaagcacagggcatagcgcaaggccacagtcccac tacccgcaaagcggccgcttccacggaaccgtcccgaggccgggcctggccaccctgcgc gcgacctccagcatgcaggataccgtaacgacatcagcattgttggaccccagccactcc tcagtctccacccaggacaattcctccactggaggacacacttcaagcacaagcccacag ctctcaaagccttcaatcacaccagtccctgcaaagtccaggaacccacatcccagggcc aatatccgtcggatgcgccggatcattgctgaggatcctgagtggtcactggccatcgtg cccctcctcacagagctctgcattcagcacattatcaggaacttccagaaaaaccctatc ctgaagcagatgctcccggaacaccagcagaaggtcctgaaccacctgtcccctgaccta ccactggctgtgaccgccaacctgatagacagtgagaactactggctccgctgctgcatg catcgctggcccgtgtgccacgtggcccaccatggcggcagctggaaacgcatgttcttc gagcggcacctggagaacctgctaaagcactttatcccaggcaccacagaccctgcggtg atcctcgacctgctgccgctctgccggaattacgtgcgcagggtccacgtcgatcagttc cttccgccggtgcagctcccggcccagctccggccgggcgaccagtccgactcaggcagc gagggagagatggaggagcccaccgttgaccactaccaactgggcgatctggtagctggc ctgagccacctggaggagctggacctggtgtacgatgtcaaggactgcggcatgaatttc gagtggaatctcttcctcttcacctaccgtgactgcctctccttggcagccgccatcaag gcatgccacaccctcaagatcttcaagctgacccgaagcaaggtggatgatgacaaggca cgcatcataattcgaagccttctggaccacccagtcctcgaggagctggacttgtcacaa aacctcattggagaccgtggtgcacgaggtgctgccaagctgctgagccacagccgcctg cgtgtgctcaacctggctaacaaccaggtgcgtgcacccggtgcccagtccctggctcac gctctggcacacaacaccaacctcatttccctcaacctacgtctcaactgcatcgaggat gagggtggccaggctcttgcccatgccttgcagaccaacaagtgcctcaccacgctgcac ctcggtggcaatgagctgtctgagcccaccgccacactcctgtcacaggtgctcgccatc aacaccacactcaccagcatcaacctgtcctgcaaccacatcgggctggacggtgggaag cagctcctggaaggcatgtcagacaacaagaccctcctggaatttgacttgcgcctgtca gatgtggcccaggaaagcgagtacctcattggccaggccctctacgcaaaccgagaagca gcccgccagcgggccctgaatcccagccacttcatgtcaaccataactgccaatggccct gagaactctgtgggataa >gi568815592r:44154709_44356881|GENSCAN_predicted_peptide_8|133_aa MALNSHSLPEKLRVRLRSSSHLPVMDCRSESEPWGLHIEQVPQGDSRDWANSGSSVQLGI WKWNWIKYSAVMELDFKKEITAKRANCSDFLESKGCFANTTPSGKSVSSSSSVETGPSVS EPPGLPRVSAYVD >gi568815592r:44154709_44356881|GENSCAN_predicted_CDS_8|399_bp atggcgctgaacagccacagtctcccagaaaagctcagagtgcgcctgcgctcaagttct catttacctgttatggactgtaggtctgagtcagagccctggggcctgcatattgaacaa gtaccccaaggggattctcgtgactgggcaaattcaggaagttctgtgcagttgggaatc tggaagtggaactggataaaatacagtgcagtgatggaactagacttcaagaaggagatt actgccaaacgtgctaattgcagtgattttctggaatctaagggatgttttgccaacaca acaccctctggcaaaagtgtcagttcctcatcttctgtggaaacaggcccaagtgtcagt gagcctcctggcctccccagagtgtctgcttacgtagan