GENSCAN 1.0 Date run: 4-Nov-116 Time: 01:27:13 Sequence gi568815592r:44200550_44413323 : 212774 bp : 47.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 16923 17158 236 2 2 80 50 364 0.607 27.41 1.02 Intr + 21037 21136 100 1 1 92 75 25 0.544 1.71 1.03 Intr + 26714 26793 80 1 2 84 110 39 0.798 4.05 1.04 Intr + 28841 28922 82 2 1 109 94 28 0.730 5.14 1.05 Intr + 29040 29242 203 2 2 54 105 225 0.977 18.88 1.06 Intr + 29358 29497 140 0 2 102 61 279 0.999 26.71 1.07 Intr + 29798 29932 135 0 0 92 100 142 0.911 16.34 1.08 Intr + 30019 30116 98 2 2 58 105 84 0.965 6.83 1.09 Intr + 30262 30340 79 0 1 93 63 166 0.997 13.72 1.10 Intr + 30815 30912 98 0 2 85 63 80 0.993 4.93 1.11 Intr + 31449 31557 109 2 1 67 60 123 0.847 7.26 1.12 Intr + 31794 31879 86 1 2 131 106 64 0.999 12.04 1.13 Intr + 32258 32457 200 1 2 70 89 385 0.984 34.95 1.14 Term + 32868 32979 112 0 1 82 48 95 0.974 2.93 1.15 PlyA + 33571 33576 6 1.05 2.00 Prom + 35411 35450 40 -7.76 2.01 Init + 35547 35645 99 2 0 86 67 94 0.966 5.37 2.02 Intr + 36795 36985 191 2 2 62 55 139 0.290 6.38 2.03 Intr + 38234 38362 129 2 0 77 38 73 0.062 1.01 2.04 Intr + 43719 43837 119 0 2 42 96 74 0.042 3.71 2.05 Intr + 45849 46069 221 1 2 43 53 156 0.208 5.62 2.06 Intr + 48081 48227 147 2 0 19 45 147 0.525 3.93 2.07 Intr + 48828 49034 207 2 0 57 72 260 0.997 20.57 2.08 Intr + 49126 49285 160 0 1 70 115 190 0.998 19.46 2.09 Intr + 49472 49605 134 0 2 85 73 95 0.996 8.06 2.10 Intr + 49742 50050 309 1 0 59 70 535 0.999 45.31 2.11 Intr + 50499 50664 166 2 1 85 70 160 0.999 13.43 2.12 Intr + 50869 51059 191 2 2 51 93 216 0.999 17.70 2.13 Intr + 51188 51335 148 1 1 48 78 174 0.999 12.21 2.14 Intr + 51450 51718 269 1 2 74 121 449 0.999 44.05 2.15 Intr + 52496 52829 334 1 1 91 97 493 0.997 45.64 2.16 Term + 52940 53049 110 0 2 67 39 216 0.999 13.27 2.17 PlyA + 53309 53314 6 1.05 3.05 PlyA - 53571 53566 6 -1.75 3.04 Term - 55095 54157 939 0 0 111 55 813 0.901 72.41 3.03 Intr - 55947 55793 155 1 2 89 94 148 0.990 15.29 3.02 Intr - 56329 56136 194 0 2 109 82 152 0.944 15.84 3.01 Init - 57185 56851 335 2 2 82 75 157 0.908 8.57 3.00 Prom - 57380 57341 40 -11.82 4.07 PlyA - 57638 57633 6 1.05 4.06 Term - 58735 58670 66 1 0 93 48 94 0.985 3.84 4.05 Intr - 59733 59494 240 0 0 104 100 286 0.991 29.15 4.04 Intr - 59990 59902 89 0 2 102 94 89 0.995 10.59 4.03 Intr - 61299 61077 223 0 1 89 70 116 0.992 7.60 4.02 Intr - 62113 62011 103 0 1 131 77 38 0.990 7.18 4.01 Init - 64797 64433 365 0 2 80 94 380 0.999 34.32 4.00 Prom - 65718 65679 40 -6.06 5.00 Prom + 65786 65825 40 -5.66 5.01 Init + 68184 68227 44 2 2 66 72 -3 0.496 -4.18 5.02 Intr + 70121 70328 208 2 1 122 78 237 0.853 25.18 5.03 Intr + 72532 72957 426 0 0 26 94 714 0.696 59.79 5.04 Term + 74854 75978 1125 0 0 104 43 2000 0.871 189.11 5.05 PlyA + 78871 78876 6 1.05 6.26 PlyA - 79522 79517 6 -3.64 6.25 Term - 79841 79632 210 2 0 110 38 278 0.987 22.29 6.24 Intr - 82010 81561 450 2 0 91 93 604 0.933 54.70 6.23 Intr - 85983 85415 569 1 2 84 79 803 0.946 71.50 6.22 Intr - 87284 87000 285 0 0 33 70 156 0.051 5.61 6.21 Intr - 97406 97277 130 2 1 100 65 60 0.321 5.17 6.20 Intr - 100717 100607 111 1 0 102 94 89 0.967 11.48 6.19 Intr - 100915 100832 84 1 0 139 70 90 0.999 12.32 6.18 Intr - 101621 101511 111 2 0 55 94 173 0.995 15.18 6.17 Intr - 101964 101842 123 0 0 144 53 110 0.999 13.88 6.16 Intr - 102361 102253 109 0 1 139 76 215 0.933 25.69 6.15 Intr - 102626 102517 110 2 2 93 86 32 0.999 2.58 6.14 Intr - 102874 102737 138 1 0 92 105 171 0.999 19.86 6.13 Intr - 103772 103632 141 2 0 -37 99 150 0.911 4.15 6.12 Intr - 103984 103871 114 1 0 85 94 103 0.993 11.24 6.11 Intr - 104268 104096 173 1 2 96 105 184 0.990 20.56 6.10 Intr - 104649 104505 145 0 1 58 81 110 0.765 7.16 6.09 Intr - 105237 105104 134 1 2 91 87 89 0.999 9.46 6.08 Intr - 105842 105731 112 2 1 69 107 139 0.999 13.85 6.07 Intr - 106482 106374 109 2 1 53 95 86 0.978 6.09 6.06 Intr - 106845 106700 146 0 2 96 86 126 0.996 12.28 6.05 Intr - 109894 109750 145 0 1 72 109 179 0.999 18.68 6.04 Intr - 110612 110445 168 1 0 128 90 94 0.999 12.66 6.03 Intr - 110986 110841 146 1 2 111 110 144 0.999 17.98 6.02 Intr - 111714 111523 192 0 0 91 113 123 0.979 14.79 6.01 Init - 112774 112532 243 1 0 83 94 288 0.998 24.63 6.00 Prom - 118539 118500 40 -5.06 7.00 Prom + 126094 126133 40 -3.76 7.01 Init + 143234 143309 76 1 1 77 92 57 0.358 6.25 7.02 Intr + 149633 149806 174 0 0 46 66 83 0.117 1.81 7.03 Intr + 152177 152324 148 0 1 125 89 1 0.580 3.19 7.04 Intr + 159897 160021 125 0 2 42 91 68 0.430 2.73 7.05 Intr + 167830 167950 121 2 1 99 101 -4 0.224 1.55 7.06 Intr + 169502 169564 63 2 0 113 89 24 0.820 2.83 7.07 Intr + 175809 175924 116 0 2 101 68 78 0.734 7.19 7.08 Intr + 187237 187319 83 2 2 28 107 25 0.002 -2.24 7.09 Intr + 189719 189822 104 1 2 67 80 29 0.775 -1.03 7.10 Intr + 192118 192279 162 1 0 61 84 75 0.889 3.49 7.11 Intr + 192897 193024 128 0 2 88 110 181 0.997 20.62 7.12 Intr + 195792 195891 100 1 1 88 95 36 0.993 3.47 7.13 Intr + 203260 203478 219 1 0 76 97 130 0.899 10.12 7.14 Intr + 205774 205918 145 1 1 57 89 19 0.958 -0.82 7.15 Intr + 207895 208083 189 0 0 82 97 165 0.847 16.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 86730 86641 90 1 0 110 75 15 0.822 2.37 S.002 Init + 187275 187319 45 2 0 96 107 36 0.849 6.99 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:44200550_44413323|GENSCAN_predicted_peptide_1|585_aa MPTPLLPLLLRLLLSCLLLPAARLARQYLLPLLRRLARRLGSQDMREALLGCLLFILSQR HSPDAGEASRVDRLERRERDLRELREGEKPEDQAETEESWQGLARKTPGKACAPEGGSCQ PGKTENTITMTTSHQPQDRYKAVWLIFFMLGLGTLLPWNFFMTATQYFTNRLDMSQNVSL VTAELSKDAQASAAPAAPLPERNSLSAIFNNVMTLCAMLPLLLFTYLNSFLHQRIPQSVR ILGSLVAILLVFLITAILVKVQLDALPFFVITMIKIVLINSFGAILQGSLFGLAGLLPAS YTAPIMSGQGLAGFFASVAMICAIASGSELSESAFGYFITACAVIILTIICYLGLPRLEF YRYYQQLKLEGPGEQETKLDLISKGEEPRAGKEESGVSVSNSQPTNESHSIKAILKNISV LAFSVCFIFTITIGMFPAVTVEVKSSIAGSSTWERYFIPVSCFLTFNIFDWLGRSLTAVF MWPGKDSRWLPSLVLARLVFVPLLLLCNIKPRRYLTVVFEHDAWFIFFMAAFAFSNGYLA SLCMCFGPKKVKPAEAETAGAIMAFFLCLGLALGAVFSFLFRAIV >gi568815592r:44200550_44413323|GENSCAN_predicted_CDS_1|1758_bp atgcccacgccactgctcccgctgctgcttcgattgctgctgtcctgcctgctgctgcct gctgcccgcctggcccgccaatacctcctgcccctgctgcgccgattggcccgccgcctg ggctcccaggacatgcgagaggctttgctgggctgtctgctgttcattctcagccagcga cactcgccagacgctggggaggcctcaagagtggaccgcctggagaggagggagagggac ctgagggagctcagggagggagagaagccagaagaccaggcagagactgaagagagctgg caaggcctggctagaaaaactccaggcaaggcctgtgcccctgagggagggagctgtcag ccagggaaaaccgagaacaccatcaccatgacaaccagtcaccagcctcaggacagatac aaagctgtctggcttatcttcttcatgctgggtctgggaacgctgctcccgtggaatttt ttcatgacggccactcagtatttcacaaaccgcctggacatgtcccagaatgtgtccttg gtcactgctgaactgagcaaggacgcccaggcgtcagccgcccctgcagcacccttgcct gagcggaactctctcagtgccatcttcaacaatgtcatgaccctatgtgccatgctgccc ctgctgttattcacctacctcaactccttcctgcatcagaggatcccccagtccgtacgg atcctgggcagcctggtggccatcctgctggtgtttctgatcactgccatcctggtgaag gtgcagctggatgctctgcccttctttgtcatcaccatgatcaagatcgtgctcattaat tcatttggtgccatcctgcagggcagcctgtttggtctggctggccttctgcctgccagc tacacggcccccatcatgagtggccagggcctagcaggcttctttgcctccgtggccatg atctgcgctattgccagtggctcggagctatcagaaagtgccttcggctactttatcaca gcctgtgctgttatcattttgaccatcatctgttacctgggcctgccccgcctggaattc taccgctactaccagcagctcaagcttgaaggacccggggagcaggagaccaagttggac ctcattagcaaaggagaggagccaagagcaggcaaagaggaatctggagtttcagtctcc aactctcagcccaccaatgaaagccactctatcaaagccatcctgaaaaatatctcagtc ctggctttctctgtctgcttcatcttcactatcaccattgggatgtttccagccgtgact gttgaggtcaagtccagcatcgcaggcagcagcacctgggaacgttacttcattcctgtg tcctgtttcttgactttcaatatctttgactggttgggccggagcctcacagctgtattc atgtggcctgggaaggacagccgctggctgccaagcctggtgctggcccggctggtgttt gtgccactgctgctgctgtgcaacattaagccccgccgctacctgactgtggtcttcgag cacgatgcctggttcatcttcttcatggctgcctttgccttctccaacggctacctcgcc agcctctgcatgtgcttcgggcccaagaaagtgaagccagctgaggcagagaccgcagga gccatcatggccttcttcctgtgtctgggtctggcactgggggctgttttctccttcctg ttccgggcaattgtgtga >gi568815592r:44200550_44413323|GENSCAN_predicted_peptide_2|977_aa MAPDPKWALAGPCPLGSRGLLWILHFPASTLLPVGTEGGNGQRPTPSGPNLINGLMSGLD PSSPSRYDLASCVDALGERTCPLGALCSLGSPGARWNSTITVLPFHHLRGVEVRPVNGPE IQPIETCMLKRNDFILGQASLQWVKGSQSCVCISASPGSPHSGGLVSVNEWKTMELVDTA GEAEPRGEGAGPPRQAWKLLEMPQCRRRAFRRMSAKSPRQPRPAPSPYAELPLSANPPPF LYSCESRDLGLPKMPEEVHHGEEEVETFAFQAEIAQLMSLIINTFYSNKEIFLRELISNA SDALDKIRYESLTDPSKLDSGKELKIDIIPNPQERTLTLVDTGIGMTKADLINNLGTIAK SGTKAFMEALQAGADISMIGQFGVGFYSAYLVAEKVVVITKHNDDEQYAWESSAGGSFTV RADHGEPIGRGTKVILHLKEDQTEYLEERRVKEVVKKHSQFIGYPITLYLEKEREKEISD DEAEEEKGEKEEEDKDDEEKPKIEDVGSDEEDDSGKDKKKKTKKIKEKYIDQEELNKTKP IWTRNPDDITQEEYGEFYKSLTNDWEDHLAVKHFSVEGQLEFRALLFIPRRAPFDLFENK KKKNNIKLYVRRVFIMDSCDELIPEYLNFIRGVVDSEDLPLNISREMLQQSKILKVIRKN IVKKCLELFSELAEDKENYKKFYEAFSKNLKLGIHEDSTNRRRLSELLRYHTSQSGDEMT SLSEYVSRMKETQKSIYYITGESKEQVANSAFVERVRKRGFEVVYMTEPIDEYCVQQLKE FDGKSLVSVTKEGLELPEDEEEKKKMEESKAKFENLCKLMKEILDKKVEKVTISNRLVSS PCCIVTSTYGWTANMERIMKAQALRDNSTMGYMMAKKHLEINPDHPIVETLRQKAEADKN DKAVKDLVVLLFETALLSSGFSLEDPQTHSNRIYRMIKLGLGIDEDEVAAEEPNAAVPDE IPPLEGDEDASRMEEVD >gi568815592r:44200550_44413323|GENSCAN_predicted_CDS_2|2934_bp atggctcccgaccccaagtgggcgctcgcaggcccctgccctttggggagcaggggcctc ctctggatcctccactttcctgctagcaccttgctgcctgttggcactgagggtggaaat gggcagcgcccaactccctctggccccaacctcatcaatggccttatgtccgggctggac ccctccagcccatctcggtatgacctggcttcctgtgtagatgctctcggggagaggacg tgcccactcggagccctctgcagccttggcagtcctggtgctcgctggaacagcactatc acagtcctaccgttccatcatctgcgtggggtggaagtgaggccagtgaatggcccagaa atccagcccattgaaacctgcatgctgaaaaggaatgacttcatcctgggacaggccagc ctccagtgggttaagggctcccagtcctgtgtatgcatctctgcatccccaggatctcca cacagtggaggcttagtgagtgtcaatgagtggaaaacgatggagcttgtggacacagcc ggagaggcggagcctcgcggggagggggcgggaccgccgagacaggcctggaaactgctg gaaatgccgcagtgccgccgccgcgccttccgccgcatgtcggcaaagagtccccgccag ccccggccggcgccctccccctacgctgagctgcccctcagcgcgaaccctccgcccttc ctctactcctgcgagagtcgggatctggggctacccaagatgcctgaggaagtgcaccat ggagaggaggaggtggagacttttgcctttcaggcagaaattgcccaactcatgtccctc atcatcaataccttctattccaacaaggagattttccttcgggagttgatctctaatgct tctgatgccttggacaagattcgctatgagagcctgacagacccttcgaagttggacagt ggtaaagagctgaaaattgacatcatccccaaccctcaggaacgtaccctgactttggta gacacaggcattggcatgaccaaagctgatctcataaataatttgggaaccattgccaag tctggtactaaagcattcatggaggctcttcaggctggtgcagacatctccatgattggg cagtttggtgttggcttttattctgcctacttggtggcagagaaagtggttgtgatcaca aagcacaacgatgatgaacagtatgcttgggagtcttctgctggaggttccttcactgtg cgtgctgaccatggtgagcccattggcaggggtaccaaagtgatcctccatcttaaagaa gatcagacagagtacctagaagagaggcgggtcaaagaagtagtgaagaagcattctcag ttcataggctatcccatcaccctttatttggagaaggaacgagagaaggaaattagtgat gatgaggcagaggaagagaaaggtgagaaagaagaggaagataaagatgatgaagaaaaa cccaagatcgaagatgtgggttcagatgaggaggatgacagcggtaaggataagaagaag aaaactaagaagatcaaagagaaatacattgatcaggaagaactaaacaagaccaagcct atttggaccagaaaccctgatgacatcacccaagaggagtatggagaattctacaagagc ctcactaatgactgggaagaccacttggcagtcaagcacttttctgtagaaggtcagttg gaattcagggcattgctatttattcctcgtcgggctccctttgacctttttgagaacaag aagaaaaagaacaacatcaaactctatgtccgccgtgtgttcatcatggacagctgtgat gagttgataccagagtatctcaattttatccgtggtgtggttgactctgaggatctgccc ctgaacatctcccgagaaatgctccagcagagcaaaatcttgaaagtcattcgcaaaaac attgttaagaagtgccttgagctcttctctgagctggcagaagacaaggagaattacaag aaattctatgaggcattctctaaaaatctcaagcttggaatccacgaagactccactaac cgccgccgcctgtctgagctgctgcgctatcatacctcccagtctggagatgagatgaca tctctgtcagagtatgtttctcgcatgaaggagacacagaagtccatctattacatcact ggtgagagcaaagagcaggtggccaactcagcttttgtggagcgagtgcggaaacggggc ttcgaggtggtatatatgaccgagcccattgacgagtactgtgtgcagcagctcaaggaa tttgatgggaagagcctggtctcagttaccaaggagggtctggagctgcctgaggatgag gaggagaagaagaagatggaagagagcaaggcaaagtttgagaacctctgcaagctcatg aaagaaatcttagataagaaggttgagaaggtgacaatctccaatagacttgtgtcttca ccttgctgcattgtgaccagcacctacggctggacagccaatatggagcggatcatgaaa gcccaggcacttcgggacaactccaccatgggctatatgatggccaaaaagcacctggag atcaaccctgaccaccccattgtggagacgctgcggcagaaggctgaggccgacaagaat gataaggcagttaaggacctggtggtgctgctgtttgaaaccgccctgctatcttctggc ttttcccttgaggatccccagacccactccaaccgcatctatcgcatgatcaagctaggt ctaggtattgatgaagatgaagtggcagcagaggaacccaatgctgcagttcctgatgag atcccccctctcgagggcgatgaggatgcgtctcgcatggaagaagtcgattag >gi568815592r:44200550_44413323|GENSCAN_predicted_peptide_3|540_aa MTKHRGHAGRAPLQGWGSPRPGAAQVPGTQAASASGPRGGAVVRRRPGALRGRGRGGGGR GEGGGKSAALPLAAGSLAAPGGGGGSAGGARPGDSHSPVPPPPHAAWTMDARWWAVVVLA AFPSLGAGGETPEAPPESWTQLWFFRFVVNAAGYASFMVPGYLLVQYFRRKNYLETGRGL CFPLVKACVFGNEPKASDEVPLAPRTEAAETTPMWQALKLLFCATGLQVSYLTWGVLQER VMTRSYGATATSPGERFTDSQFLVLMNRVLALIVAGLSCVLCKQPRHGAPMYRYSFASLS NVLSSWCQYEALKFVSFPTQVLAKASKVIPVMLMGKLVSRRSYEHWEYLTATLISIGVSM FLLSSGPEPRSSPATTLSGLILLAGYIAFDSFTSNWQDALFAYKMSSVQMMFGVNFFSCL FTVGSLLEQGALLEGTRFMGRHSEFAAHALLLSICSACGQLFIFYTIGQFGAAVFTIIMT LRQAFAILLSCLLYGHTVTVVGGLGVAVVFAALLLRVYARGRLKQRGKKAVPVESPVQKV >gi568815592r:44200550_44413323|GENSCAN_predicted_CDS_3|1623_bp atgacgaagcacaggggacacgccgggcgggcgcctcttcagggctggggctccccgcgc ccaggggcagcccaggtccccggaacccaagccgcgtctgcctccgggccgcgcgggggc gctgtggtccggcggcggcccggggcgctgcgtggtcgcggcaggggcggagggggccgc ggggagggaggcgggaagagcgcggcacttccgctggccgctggctcgctggccgctcct ggaggcggcggcgggagcgcagggggcgcgcggcccggggactcgcattccccggttccc cctccaccccacgcggcctggaccatggacgccagatggtgggcagtggtggtgctggct gcgttcccctccctaggggcaggtggggagactcccgaagcccctccggagtcatggacc cagctatggttcttccgatttgtggtgaatgctgctggctatgccagctttatggtacct ggctacctcctggtgcagtacttcaggcggaagaactacctggagaccggtaggggcctc tgctttcccctggtgaaagcttgtgtgtttggcaatgagcccaaggcctctgatgaggtt cccctggcgccccgaacagaggcggcagagaccaccccgatgtggcaggccctgaagctg ctcttctgtgccacagggctccaggtgtcttatctgacttggggtgtgctgcaggaaaga gtgatgacccgcagctatggggccacagccacatcaccgggtgagcgctttacggactcg cagttcctggtgctaatgaaccgagtgctggcactgattgtggctggcctctcctgtgtt ctctgcaagcagccccggcatggggcacccatgtaccggtactcctttgccagcctgtcc aatgtgcttagcagctggtgccaatacgaagctcttaagttcgtcagcttccccacccag gtgctggccaaggcctctaaggtgatccctgtcatgctgatgggaaagcttgtgtctcgg cgcagctacgaacactgggagtacctgacagccaccctcatctccattggggtcagcatg tttctgctatccagcggaccagagccccgcagctccccagccaccacactctcaggcctc atcttactggcaggttatattgcttttgacagcttcacctcaaactggcaggatgccctg tttgcctataagatgtcatcggtgcagatgatgtttggggtcaatttcttctcctgcctc ttcacagtgggctcactgctagaacagggggccctactggagggaacccgcttcatgggg cgacacagtgagtttgctgcccatgccctgctactctccatctgctccgcatgtggccag ctcttcatcttttacaccattgggcagtttggggctgccgtcttcaccatcatcatgacc ctccgccaggcctttgccatccttctttcctgccttctctatggccacactgtcactgtg gtgggagggctgggggtggctgtggtctttgctgccctcctgctcagagtctacgcgcgg ggccgtctaaagcaacggggaaagaaggctgtgcctgttgagtctcctgtgcagaaggtt tga >gi568815592r:44200550_44413323|GENSCAN_predicted_peptide_4|361_aa MSEARKGPDEAEESQYDSGIESLRSLRSLPESTSAPASGPSDGSPQPCTHPPGPVKEPQE KEDADGERADSTYGSSSLTYTLSLLGGPEAEDPAPRLPLPHVGALSPQQLEALTYISEDG DTLVHLAVIHEAPAVLLCCLALLPQEVLDIQNNLYQTALHLAVHLDQPGAVRALVLKGAS RALQDRHGDTALHVACQRQHLACARCLLEGRPEPGRGTSHSLDLQLQNWQGLACLHIATL QKNQPLMELLLRNGADIDVQEGTSGKTALHLAVETQERGLVQFLLQAGAQVDARMLNGCT PLHLAAGRGLMGISSTLCKAGADSLLRNVEDETPQDLTEESLVLLPFDDLKISGKLLLCT D >gi568815592r:44200550_44413323|GENSCAN_predicted_CDS_4|1086_bp atgtcggaggcgcggaaggggccggacgaggcggaggagagccagtacgactctggcatt gagtctctgcgctctctgcgctccctacccgagtccacctcggctccagcctccgggccc tcggacggcagcccccagccctgcacccatcctccgggacccgtcaaggaaccacaggag aaggaagacgcggatggggagcgggctgattccacctatggctcctcctcgctcacctac accctgtccttgctggggggccccgaggctgaggacccggccccacgcctgccactcccc cacgtgggggcgctgagccctcagcagctggaagcactcacttacatctccgaggacgga gacacgctggtccacctggcagtgattcatgaggccccagcggtgctgctctgttgcctg gctttgctgccccaggaggtcctggacattcaaaataacctttaccagacagcactccat ctggctgtacatctggaccaaccgggcgcagttcgggcactggtgctgaagggggccagc cgggcactacaggaccggcatggtgacacagcccttcatgtggcctgccagcgccagcac ttggcctgtgcccgctgcctgctggaagggcggccagagccaggcagaggaacatctcac tctctggacctccagctgcaaaactggcaaggtctggcttgtctccacattgccaccctt cagaagaaccaaccactcatggaattgctgcttcggaatggagctgacattgatgtgcag gagggcaccagtggtaagacagcgctgcacctggctgtggaaacccaagagcggggcctg gtacagttcctgctccaggctggtgcccaggtagatgcccgcatgctgaacgggtgcaca cccctgcacctggcagctggccggggtctcatgggcatctcatccactctgtgcaaggcg ggtgctgactccctgctgcggaatgtggaggatgagacgccccaggacctgactgaggaa tcccttgtccttttgccctttgatgacctgaagatctcagggaaactgctgctgtgtacc gactga >gi568815592r:44200550_44413323|GENSCAN_predicted_peptide_5|600_aa MGSTKGSSVGLERPRARAGCPRPLAAPREVLSSTRPLYAMSPPGSAAGESAAGGGGGGGG PGVSEELTAAAAAAAADEGPAREEPSFTKSLCRESHWKCLLLSLLMYGCLGAVAWCHVTT VTRLTFSSAYQGNSLMYHDSPCSNGYVYIPLAFLLMLYAVYLVECWHCQARHELQHRVDV SSVRERVGRMQQATPCIWWKAISYHYVRRTRQVTRYRNGDAYTTTQVYHERVNTHVAEAE FDYARCGVRDVSKTLVGLEGAPATRLRFTKCFSFASVEAENAYLCQRARFFAENEGLDDY MEAREGMHLKNVDFREFMVAFPDPARPPWYACSSAFWAAALLTLSWPLRVLAEYRTAYAH YHVEKLFGLEGPGSASSAGGGLSPSDELLPPLTHRLPRVNTVDSTELEWHIRSNQQLVPS YSEAVLMDLAGLGTRCGGAGGGYAPSCRYGGVGGPGAAGVAPYRRSCEHCQRAVSSSSIF SRSALSICASPRAGPGPGGGAGCGGSRFSLGRLYGSRRSCLWRSRSGSVNEASCPTEQTR LSSQASMGDDEDDDEEEAGPPPPYHDALYFPVLIVHRQEGCLGHSHRPLHRHGSCVETSL >gi568815592r:44200550_44413323|GENSCAN_predicted_CDS_5|1803_bp atggggagcacgaaaggatcctctgtgggtttggaaaggccaagggcccgcgcaggatgc ccccggcccctggcagccccccgggaggtcctgagctcgacgcgccccctctacgccatg tccccccctggctcggccgcgggagagagcgccgccggcggcggcggcggcggtggcggc cccggggtctcggaggagctcacggcggcggcggcagcggcggcggcggacgagggcccc gcccgagaggagccctctttcaccaagtccctctgccgtgagtcccactggaagtgcctc ctgctctcgctgctcatgtacggctgcctgggggcagtggcctggtgccacgtcaccaca gtgacgcgcctcaccttcagcagcgcctaccagggcaacagcctcatgtaccatgacagc ccctgctccaacggctatgtctacatccccctggccttcctgctcatgttgtacgccgtc tacctggtggagtgttggcactgccaagcccgccatgagctgcagcaccgtgttgatgtg agcagtgtgcgggaacgtgtgggccgcatgcagcaagccacgccctgcatctggtggaag gccatcagctaccactatgtccgccgcacccgccaggtcaccagataccgcaatggagac gcctataccaccacccaggtctaccacgaacgcgtcaacacgcacgtggcggaggctgag ttcgactacgcgcgctgcggcgttcgcgacgtgtccaagacgctggtggggctggagggc gcgccggccacgcggctgcgcttcaccaagtgcttcagtttcgccagcgtggaggccgag aacgcgtacctgtgccagcgcgcgcgcttcttcgcagagaacgagggcctagacgactac atggaggcacgcgagggcatgcacctcaagaacgtggacttccgtgagttcatggtggcc ttcccggacccggcccggccgccctggtacgcctgctcgtcggccttctgggccgcggcg ctgctcacgctgtcgtggccgctgcgagtgctggccgagtaccgcacggcctacgcgcac taccacgtggagaagctatttggcctggagggcccgggctcggccagcagcgcaggcggt ggcctcagccccagcgatgagctgctgcccccgctcacccaccgcctgccgcgggtcaac acagtagacagcacggagctcgagtggcacatccgctccaaccagcagctggtgcccagc tactctgaggcggtgctcatggacctggcggggctcgggacgcgctgcggcggggcaggc ggcggctacgcgccctcgtgccgctacggtggggtaggcggcccgggcgcggcgggcgtg gctccctaccggcgcagctgcgagcactgccagcgcgccgtcagcagctcgtctatcttc tcgcgcagcgccctaagcatctgcgccagcccgcgggccggcccggggcccggtgggggc gcgggctgcgggggcagccgcttctcgctgggccgtctctacggctcccggcgcagctgc ctgtggcgcagccgcagcgggagcgtcaacgaggccagctgccccacggagcagacgcgg ctgtccagccaggccagcatgggggacgacgaggacgacgacgaggaggaggccgggccg ccgccgccctaccacgacgccctctactttccggtcctcatcgtccaccggcaggagggg tgtctgggccacagccaccggccgctgcaccgccacggctcctgcgtagagacctcactg tga >gi568815592r:44200550_44413323|GENSCAN_predicted_peptide_6|1465_aa MAASVAAAARRLRRAIRRSPAWRGLSHRPLSSEPPAAKASAVRAAFLNFFRDRHGHRLVP SASVRPRGDPSLLFVNAGMNQFKPIFLGTVDPRSEMAGFRRVANSQKCVRAGGHHNDLED VGRDLSHHTFFEMLGNWAFGGEYFKEEACNMAWELLTQVYGIPEERLWISYFDGDPKAGL DPDLETRDIWLSLGVPASRVLSFGPQENFWEMGDTGPCGPCTEIHYDLAGGVGAPQLVEL WNLVFMQHNREADGSLQPLPQRHVDTGMGLERLVAVLQGKHSTYDTDLFSPLLNAIQQGC RAPPYLGRVGVADEGRTDTAYRVVADHIRTLSVCISDGIFPGMSGPPLVLRRILRRAVRF SMEILKAPPGFLGSLVPVVVETLIANLVSEDEAAFLASLERGRRIIDRTLRTLGPSDMFP AEVAWSLSLCGDLGLPLDMVELMLEEKGVQLDSAGLERLAQEEAQHRARQAEPVQKQGLW LDVHALGELQRQGVPPTDDSPKYNYSLRPSGSYEFGTCEAQVLQLYTEDGTAVASVGKGQ RCGLLLDRTNFYAEQGGQASDRGYLVRAGQEDVLFPVARAQVCGGFILHEAVAPECLRLG DQVQLHVDEAWRLGCMAKHTATHLLNWALRQTLGPGTEQQGSHLNPEQLRLDVTTQTPLT PEQLRAVENTVQEAVGQDEAVYMEEVPLALTAQVPGLRSLDEVYPDPVRVVSVGVPVAHA LDPASQAALQTSVELCCGTHLLRTGAVGDLVIIGDRQLSKGTTRLLAVTGEQAQQARELG QSLAQEVKAATERLSLGSRDVAEALRLSKDIGRLIEAVETAVMPQWQRRELLATVKMLQR RANTAIRKLQMGQAAKKTQELLERHSKGPLIVDTVSAESLSVLVKVVRQLCEQAPSTSVL LLSPQPMGKVLCACQVAQKAHPFLDTSHGSTGHSARPQSHYPQSGRFHGTVPRPGLATLR ATSSMQDTVTTSALLDPSHSSVSTQDNSSTGGHTSSTSPQLSKPSITPVPAKSRNPHPRA NIRRMRRIIAEDPEWSLAIVPLLTELCIQHIIRNFQKNPILKQMLPEHQQKVLNHLSPDL PLAVTANLIDSENYWLRCCMHRWPVCHVAHHGGSWKRMFFERHLENLLKHFIPGTTDPAV ILDLLPLCRNYVRRVHVDQFLPPVQLPAQLRPGDQSDSGSEGEMEEPTVDHYQLGDLVAG LSHLEELDLVYDVKDCGMNFEWNLFLFTYRDCLSLAAAIKACHTLKIFKLTRSKVDDDKA RIIIRSLLDHPVLEELDLSQNLIGDRGARGAAKLLSHSRLRVLNLANNQVRAPGAQSLAH ALAHNTNLISLNLRLNCIEDEGGQALAHALQTNKCLTTLHLGGNELSEPTATLLSQVLAI NTTLTSINLSCNHIGLDGGKQLLEGMSDNKTLLEFDLRLSDVAQESEYLIGQALYANREA ARQRALNPSHFMSTITANGPENSVG >gi568815592r:44200550_44413323|GENSCAN_predicted_CDS_6|4398_bp atggcagcgtcagtggcagctgcagcccggaggctgcggcgggccattcgaaggtcgccc gcatggcggggcctcagccatcggccgctctcatcggagccccctgcagccaaggcctcg gccgtgagggccgcctttctgaacttctttcgggaccgccatggccaccggctggtgccc tccgcttccgtgcggccccgcggcgaccccagtttgctttttgtcaatgcgggcatgaac cagttcaagccaatctttctgggcaccgtggatccacgaagcgagatggcaggcttccga cgtgtggccaacagccagaaatgtgtgagagctggaggacaccataacgacctggaagat gtgggtcgagacctttcccatcataccttctttgaaatgcttggcaattgggcctttggg ggtgaatattttaaggaggaggcttgtaacatggcctgggaactgctgactcaggtctat gggatccctgaggaaaggctctggatctcctactttgatggtgaccccaaggcagggctg gacccagacctggagaccagggacatctggctgagcttaggggtgcctgctagccgtgtg ctttcctttggaccacaagagaacttctgggagatgggggatactggcccttgtgggccc tgtactgagatccactacgaccttgctggtggggtgggagccccccagctggtagagctt tggaacctggtcttcatgcaacacaacagagaggcagatggaagcctgcagcccctgccc cagcggcatgtggacacaggaatgggcctggaaaggctggtggctgtgctgcaaggcaaa cactccacctatgacactgacctcttttccccgctgctcaacgccatacagcagggctgc agggcacccccttacttgggccgagtaggggtggcagacgaggggcgcacagacacagcg taccgcgtggtggctgaccacatccgcacactcagtgtctgcatctctgatggcatcttc cctgggatgtcaggtcccccgctggttcttcgtcggatcctgcgtcgagctgtgcgtttc tccatggagatcttaaaggcaccacctggcttcctaggcagcctggtacctgtagtggtg gagacactgatcgccaacctggtgtcagaggacgaggcagccttcctggcctccctggag cggggtaggcggatcattgatcggactctgaggaccctggggccttcagatatgttccct gctgaagtggcctggtccttgtcactgtgtggagacctgggactccccttggacatggta gagctgatgctggaggagaaaggggtccagctagactccgctggactggagcggttggcc caagaggaggcccagcaccgggcacggcaggctgagccagttcagaagcagggattgtgg cttgatgtccatgcgcttggggagctgcagcgccaaggagtgcccccaactgacgacagc cccaagtacaactactccctgcgacccagcggaagttatgagttcggcacctgtgaggcc caggtgttgcaactgtatacagaggacgggacagcagtggcctccgtggggaaaggccag cgctgtggcctcctcttggacaggaccaacttctacgcagaacaggggggccaggcttca gaccgtggctacctggtgcgggcagggcaagaggacgtgctgttcccagtagcccgggcc caggtctgtggaggtttcatcctgcatgaggcagtagcccctgagtgcctgcggttaggg gaccaggtgcagctgcatgtggatgaggcctggcgtctaggctgcatggcgaagcatacg gccacccacctgctgaactgggcactgaggcagaccctgggccctggcacagagcagcag ggctcccatctcaatcctgagcagctgcgcttggatgtgaccacccagaccccattgacc ccagagcagctccgggcagtggagaacactgtgcaggaggccgtggggcaggatgaggct gtgtacatggaggaggtgcccctggcgctcactgcccaggtccctggcctgcgctctctg gatgaggtttacccagaccctgtgcgggtggtatcagtgggggtgcccgtggcccatgca ttggacccagcctcccaagccgcactgcagacctctgtggagctatgctgtgggacgcac ctgttacgtactggggctgtaggggacctggttatcatcggggaccgccagctttccaag ggcactacccgcctgctggccgtcactggggagcaggcccagcaggcccgagagctaggc cagagcctggcccaggaagtgaaagcggccactgagcggctgagtctggggagccgggat gtggcggaggcactgaggctgtccaaggacataggacgactcattgaagctgtggaaact gctgtgatgccccagtggcagcggcgggagctgctggccacagtgaagatgctgcagcgg cgtgccaacactgccatccgtaagctgcaaatgggacaggctgcaaagaaaactcaggag ctgctggagcggcactcgaaggggcctctgattgtggacacagtctctgctgagtctctc tcagtgctggtgaaggtggtacggcagctgtgtgagcaggcccccagcacgtctgtgctc ctactcagcccccagcccatggggaaggtgctgtgtgcctgtcaggtggcccagaaagcc cacccgttcctagacacgtcccacggaagcacagggcatagcgcaaggccacagtcccac tacccgcaaagcggccgcttccacggaaccgtcccgaggccgggcctggccaccctgcgc gcgacctccagcatgcaggataccgtaacgacatcagcattgttggaccccagccactcc tcagtctccacccaggacaattcctccactggaggacacacttcaagcacaagcccacag ctctcaaagccttcaatcacaccagtccctgcaaagtccaggaacccacatcccagggcc aatatccgtcggatgcgccggatcattgctgaggatcctgagtggtcactggccatcgtg cccctcctcacagagctctgcattcagcacattatcaggaacttccagaaaaaccctatc ctgaagcagatgctcccggaacaccagcagaaggtcctgaaccacctgtcccctgaccta ccactggctgtgaccgccaacctgatagacagtgagaactactggctccgctgctgcatg catcgctggcccgtgtgccacgtggcccaccatggcggcagctggaaacgcatgttcttc gagcggcacctggagaacctgctaaagcactttatcccaggcaccacagaccctgcggtg atcctcgacctgctgccgctctgccggaattacgtgcgcagggtccacgtcgatcagttc cttccgccggtgcagctcccggcccagctccggccgggcgaccagtccgactcaggcagc gagggagagatggaggagcccaccgttgaccactaccaactgggcgatctggtagctggc ctgagccacctggaggagctggacctggtgtacgatgtcaaggactgcggcatgaatttc gagtggaatctcttcctcttcacctaccgtgactgcctctccttggcagccgccatcaag gcatgccacaccctcaagatcttcaagctgacccgaagcaaggtggatgatgacaaggca cgcatcataattcgaagccttctggaccacccagtcctcgaggagctggacttgtcacaa aacctcattggagaccgtggtgcacgaggtgctgccaagctgctgagccacagccgcctg cgtgtgctcaacctggctaacaaccaggtgcgtgcacccggtgcccagtccctggctcac gctctggcacacaacaccaacctcatttccctcaacctacgtctcaactgcatcgaggat gagggtggccaggctcttgcccatgccttgcagaccaacaagtgcctcaccacgctgcac ctcggtggcaatgagctgtctgagcccaccgccacactcctgtcacaggtgctcgccatc aacaccacactcaccagcatcaacctgtcctgcaaccacatcgggctggacggtgggaag cagctcctggaaggcatgtcagacaacaagaccctcctggaatttgacttgcgcctgtca gatgtggcccaggaaagcgagtacctcattggccaggccctctacgcaaaccgagaagca gcccgccagcgggccctgaatcccagccacttcatgtcaaccataactgccaatggccct gagaactctgtgggataa >gi568815592r:44200550_44413323|GENSCAN_predicted_peptide_7|651_aa MALNSHSLPEKLRVRLRSSSHLPVMDCRSESEPWGLHIEQVPQGDSRDWANSGSSVQLGI WKWNWIKYSAVMELDFKKEITAKRANCSDFLESKGCFANTTPSGKSVSSSSSVETGPSVS EPPGLPRVSAYVDTTADLDRKLSFSHSDHSSEMSLPEVQKDKYPEEFSLLKLQTNIDPRN GIPKLTPGDNPYMYPEQSKGFHKAGSMLPPVNFSIVPYEKKFDTFIPLEPLPQIPNLPFW VKEKANSLKNEIQEVEELDNWQPAVPLMHMLHLSAPLGCLPRHPAAKMPRIMIKGGVWRN TEDEILKAAVMKYGKNQWSRIASLLHRKSAKQCKARWYEWLDPSIKKTEWSREEEEKLLH LAKLMPTQWRTIAPIIGRTAAQCLEHYEFLLDKAAQRDNEEETTDDPRKLKPGEIDPNPE TKPARPDPIDMDEDELEMLSEARARLANTQGKKAKRKAREKQLEEARRLAALQKRRELRA AGIEIQKKRKRKRGVDYNAEIPFEKKPALGFYDTSEENYQALDADFRKLRQQDLDGELRS EKEGRDRKKDKQHLKRKKESDLPSAILQTSGVSEFTKKRSKLVLPAPQISDAELQEVVKV GQASEIARQTAEESGITNSASSTLLSEYNVTNNSVALRTPRTPASQDRILQ >gi568815592r:44200550_44413323|GENSCAN_predicted_CDS_7|1953_bp atggcgctgaacagccacagtctcccagaaaagctcagagtgcgcctgcgctcaagttct catttacctgttatggactgtaggtctgagtcagagccctggggcctgcatattgaacaa gtaccccaaggggattctcgtgactgggcaaattcaggaagttctgtgcagttgggaatc tggaagtggaactggataaaatacagtgcagtgatggaactagacttcaagaaggagatt actgccaaacgtgctaattgcagtgattttctggaatctaagggatgttttgccaacaca acaccctctggcaaaagtgtcagttcctcatcttctgtggaaacaggcccaagtgtcagt gagcctcctggcctccccagagtgtctgcttacgtagacaccactgctgacttggatcgg aaactctccttctcacattctgatcactcctctgaaatgtcgttgcctgaagtccaaaag gataaatatcctgaggaattcagcctgcttaagttgcagacgaatattgatcccaggaat ggaatcccaaagttaactccaggcgacaatccatatatgtacccagaacagagtaaaggc ttccacaaagcaggatcaatgctcccaccagtgaatttttcaatagtgccttatgaaaag aaatttgatacatttattccacttgagcctcttccacaaattcccaacttgcctttctgg gtgaaggagaaggccaacagtttgaaaaatgagatacaagaggttgaggagcttgacaac tggcagccagcagtgcccttaatgcacatgctacacctttctgctcctctcggctgcttg ccgagacaccctgccgccaagatgcctcgaattatgatcaaggggggcgtatggaggaat accgaggatgaaattctgaaagcagcggtaatgaaatatgggaaaaatcagtggtctagg attgcctcattgctgcatagaaaatcagcaaagcagtgcaaagccagatggtatgaatgg ctggatccaagcattaagaagacagaatggtccagagaagaagaggaaaaactcttgcac ttggccaagttgatgccaactcagtggaggaccattgctccaatcattggaagaacagcg gcccagtgcttagaacactatgaatttcttctggataaagctgcccaaagagacaatgaa gaggaaacaacagatgatccacgaaaacttaaacctggagaaatagatccaaatccagaa acaaaaccagcgcggcctgatccaattgatatggatgaggatgaacttgagatgctttct gaagccagagcccgcttggctaatactcagggaaagaaggccaagaggaaagcaagagag aaacaattggaagaagcaagacgtcttgctgccctccaaaaaagaagagaacttcgagca gctggcatagaaattcagaagaaaagaaaaaggaagagaggagttgattataatgccgaa atcccatttgaaaaaaagcctgcccttggtttttatgatacttctgaggaaaactaccaa gctcttgacgcagatttcaggaaattaagacaacaggatcttgatggggagctaagatct gaaaaagaaggaagagatagaaaaaaagacaaacagcatttgaaaaggaaaaaagaatct gatttaccatcagctattcttcaaactagtggtgtttctgaatttactaaaaagagaagc aaactagtacttcctgcccctcagatttcagatgcagaactccaggaagttgtaaaagta ggccaagcgagtgaaattgcacgtcaaactgccgaggaatctggcataacaaattctgct tccagtacacttttgtctgagtacaatgtcaccaacaacagcgttgctcttagaacacca cgaacaccagcttcccaggacagaattctgcag