GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:40:28 Sequence gi568815595r:48304489_48524611 : 220123 bp : 47.81% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 14023 14152 130 2 1 54 58 89 0.003 2.25 1.02 Term + 23743 23953 211 1 1 45 45 149 0.021 3.07 1.03 PlyA + 24128 24133 6 1.05 2.04 PlyA - 25127 25122 6 1.05 2.03 Term - 26612 26475 138 2 0 86 49 110 0.122 4.86 2.02 Intr - 27773 27629 145 1 1 8 15 160 0.033 0.98 2.01 Init - 45193 45138 56 1 2 83 113 33 0.543 6.06 2.00 Prom - 55584 55545 40 -3.16 3.00 Prom + 61317 61356 40 -2.36 3.01 Init + 68280 68369 90 2 0 55 98 114 0.920 9.62 3.02 Intr + 73895 74038 144 1 0 60 115 58 0.753 6.18 3.03 Intr + 74864 75070 207 1 0 97 96 5 0.506 1.47 3.04 Intr + 76214 76424 211 1 1 118 102 31 0.904 5.99 3.05 Intr + 77212 77390 179 2 2 16 68 198 0.956 10.24 3.06 Intr + 91298 91324 27 1 0 121 91 27 0.012 4.61 3.07 Intr + 95027 95090 64 1 1 116 84 7 0.015 1.39 3.08 Term + 97277 97434 158 0 2 31 48 107 0.013 -0.90 3.09 PlyA + 97507 97512 6 1.05 4.36 PlyA - 98482 98477 6 1.05 4.35 Term - 100102 99998 105 1 0 80 49 173 0.898 11.01 4.34 Intr - 101022 100951 72 0 0 105 54 35 0.725 1.40 4.33 Intr - 101310 101236 75 0 0 115 80 141 0.999 15.71 4.32 Intr - 102410 102335 76 1 1 90 82 72 0.998 6.32 4.31 Intr - 102603 102539 65 0 2 95 79 127 0.689 10.02 4.30 Intr - 105021 104841 181 2 1 82 88 223 0.737 21.57 4.29 Intr - 105243 105083 161 0 2 142 75 313 0.988 34.19 4.28 Intr - 105589 105417 173 2 2 31 81 237 0.999 16.96 4.27 Intr - 105892 105808 85 1 1 -3 75 105 0.576 -0.51 4.26 Intr - 106070 105967 104 0 2 99 97 52 0.984 7.09 4.25 Intr - 106548 106380 169 0 1 91 73 145 0.998 12.82 4.24 Intr - 107521 107336 186 1 0 101 77 206 0.502 20.69 4.23 Intr - 107816 107750 67 1 1 92 72 108 0.982 8.51 4.22 Intr - 108132 107954 179 0 2 87 62 261 0.992 22.12 4.21 Intr - 108471 108254 218 1 2 109 75 332 0.999 32.22 4.20 Intr - 108681 108581 101 2 2 5 64 154 0.997 4.45 4.19 Intr - 109330 109182 149 1 2 54 94 139 0.990 10.13 4.18 Intr - 109583 109407 177 2 0 92 73 242 0.716 23.22 4.17 Intr - 110553 110311 243 0 0 113 77 178 0.955 16.89 4.16 Intr - 110859 110688 172 2 1 36 21 113 0.450 -0.65 4.15 Intr - 111271 111095 177 0 0 91 41 150 0.915 9.63 4.14 Intr - 111679 111543 137 1 2 150 42 101 0.999 11.07 4.13 Intr - 111981 111858 124 2 1 45 82 183 0.556 13.99 4.12 Intr - 113549 113423 127 0 1 -18 115 163 0.542 8.14 4.11 Intr - 113875 113753 123 2 0 104 65 71 0.508 6.96 4.10 Intr - 114054 113961 94 0 1 105 89 116 0.916 12.94 4.09 Intr - 114551 114408 144 2 0 94 43 184 0.801 14.98 4.08 Intr - 114878 114756 123 2 0 81 82 105 0.999 9.98 4.07 Intr - 115769 115089 681 2 0 113 81 302 0.643 23.79 4.06 Intr - 116285 116177 109 1 1 100 109 41 0.993 7.69 4.05 Intr - 116468 116360 109 0 1 132 73 101 0.803 12.34 4.04 Intr - 116775 116736 40 0 1 97 72 33 0.359 0.40 4.03 Intr - 117971 117843 129 2 0 62 79 70 0.892 4.39 4.02 Intr - 118459 118277 183 1 0 78 84 154 0.960 13.98 4.01 Init - 120123 119017 1107 0 0 91 89 1341 0.961 126.86 4.00 Prom - 122231 122192 40 -8.76 5.05 PlyA - 123747 123742 6 1.05 5.04 Term - 128678 127920 759 2 0 89 40 590 0.999 47.73 5.03 Intr - 129383 129219 165 2 0 76 96 189 0.998 18.66 5.02 Intr - 130648 130329 320 2 2 135 78 170 0.669 16.38 5.01 Init - 131612 131555 58 2 1 79 94 4 0.593 0.73 5.00 Prom - 134855 134816 40 -7.36 6.00 Prom + 134978 135017 40 -9.75 6.01 Init + 135809 135824 16 1 1 94 115 27 0.976 5.45 6.02 Intr + 135915 135970 56 1 2 104 78 108 0.988 10.10 6.03 Intr + 136053 136140 88 2 1 116 106 107 0.972 14.84 6.04 Intr + 142240 142344 105 2 0 29 72 74 0.011 0.09 6.05 Intr + 142380 142604 225 1 0 -4 81 238 0.014 11.86 6.06 Intr + 145549 145682 134 2 2 56 115 71 0.989 6.96 6.07 Intr + 149812 149930 119 0 2 90 95 53 0.900 5.46 6.08 Intr + 152777 152928 152 2 2 41 110 2 0.366 -2.49 6.09 Intr + 155299 155428 130 2 1 92 115 15 0.628 4.35 6.10 Intr + 159257 159393 137 2 2 96 103 83 0.985 10.81 6.11 Intr + 159553 159644 92 2 2 78 83 67 0.998 4.91 6.12 Intr + 160094 160174 81 1 0 83 105 56 0.988 6.63 6.13 Intr + 160343 160595 253 1 1 100 121 331 0.999 34.61 6.14 Intr + 160999 161051 53 2 2 89 34 86 0.874 1.93 6.15 Intr + 161536 161659 124 0 1 79 69 47 0.887 2.06 6.16 Term + 162142 163112 971 2 2 103 47 882 0.962 77.82 6.17 PlyA + 163134 163139 6 1.05 7.08 PlyA - 163407 163402 6 1.05 7.07 Term - 164698 164619 80 2 2 88 53 103 0.976 4.73 7.06 Intr - 165085 164873 213 2 0 57 105 148 0.941 12.09 7.05 Intr - 165366 165240 127 0 1 14 65 142 0.849 4.75 7.04 Intr - 172467 172332 136 2 1 37 62 73 0.383 0.17 7.03 Intr - 174769 174689 81 0 0 86 110 127 0.417 13.45 7.02 Intr - 196805 196670 136 0 1 72 75 102 0.322 6.93 7.01 Init - 199606 199531 76 1 1 91 77 204 0.990 18.85 7.00 Prom - 200437 200398 40 -6.76 8.03 PlyA - 201755 201750 6 -0.45 8.02 Term - 204584 204498 87 2 0 90 45 83 0.900 1.86 8.01 Init - 207047 206937 111 2 0 97 71 104 0.925 9.81 8.00 Prom - 211272 211233 40 -8.26 9.06 PlyA - 211956 211951 6 1.05 9.05 Term - 212236 212099 138 1 0 80 37 66 0.320 -1.34 9.04 Intr - 213101 213027 75 2 0 115 64 18 0.339 1.81 9.03 Intr - 217562 217498 65 0 2 45 69 90 0.722 1.24 9.02 Intr - 219111 219049 63 1 0 91 91 66 0.854 5.89 9.01 Intr - 219342 219213 130 0 1 107 64 243 0.866 24.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 95564 95505 60 2 0 41 105 84 0.897 4.77 S.002 Init + 142358 142604 247 1 1 93 81 264 0.941 21.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:48304489_48524611|GENSCAN_predicted_peptide_1|113_aa XSFDSVMLRIPQMMLSDCQVSASVPVIEADTKVVYLTTLVWVLVFTAENVGRARTHNSPI AKAIHMEVARTRMASEQIPFMVTEELWKYATFDVGQTSMRSTSSHRDPSLTRK >gi568815595r:48304489_48524611|GENSCAN_predicted_CDS_1|342_bp ncatcgtttgactctgtcatgctgcgcattccccagatgatgctttctgattgtcaggtg tcagcgtctgtgccagtgattgaggctgacaccaaagttgtttaccttacaactctcgta tgggttctagtatttactgcagaaaatgtgggcagagcaaggacacataactcaccaatt gcaaaggccatccacatggaggtagctagaacaaggatggcgtctgagcagatccccttc atggtgacagaagaactgtggaaatatgcaacttttgatgtaggacaaacatctatgcga agtacaagttctcacagagatccctcactgaccaggaaatga >gi568815595r:48304489_48524611|GENSCAN_predicted_peptide_2|112_aa MHSLITPVIQVDHRHPPATDQKEQAELDKRDKKKATALVMAFRQVDFGGSGKVKGWANRM PNRDLIPNQSCKTTNVLQMEPQMQSLTKIYQGSLDRPASPCCDIDDIKGTPP >gi568815595r:48304489_48524611|GENSCAN_predicted_CDS_2|339_bp atgcacagcctcattactcctgtcatacaagtggaccatcgtcatcctccagccacagat cagaaggagcaggcagaactggacaaacgggataaaaagaaggccaccgctttagtcatg gccttcaggcaagtggactttggaggctctggaaaagtgaaaggctgggcaaatcgaatg cctaatagggatttgattccaaatcaaagctgtaaaactacaaatgttcttcaaatggag ccccagatgcagtccctgactaagatctaccagggatccttggaccggcctgctagccca tgctgtgatattgatgacatcaaaggcacccctccctag >gi568815595r:48304489_48524611|GENSCAN_predicted_peptide_3|359_aa MEIRLPDLALKRIFSFLDLFGLLQVSQVNKMHLAITMDRKKTIKVWNCQDRDALAVLPMP QPCYCMEAYLTKDGPFLMDFTYLPIDMVGMLWFQVGDAAGDIYTFTLPGLRDVSKVTAFQ YGIVLLHCSPDKKWVFACGTYSRTLPQVFLTESLLRPSEGSVPLSTFLPHKLCASACWTP KVKNRITLMSQSSTGKKTEFITFDLTTKKTGGQTVIQAYEIASFQVAAHLKCPIWMGASD GYMIVFTSGPYLLLFSITGFLLQRFEDHQAAINNFWVTSSDGYCPLSPGLVTAVLHHPLL ASQSLQLWNLVEETHASAAIGAKQNTARFPVSPSTTQSHLLALASGIQVLTIWKVSGGF >gi568815595r:48304489_48524611|GENSCAN_predicted_CDS_3|1080_bp atggagatccgattgcctgacttagctttgaagcgaatcttctctttcctggacctgttc ggcttgctgcaggtttcccaggtgaacaagatgcatctcgccatcactatggatcggaaa aaaactatcaaagtgtggaactgtcaggacagggacgctctggctgttctccccatgcca cagccctgttattgcatggaagcctatcttacaaaggatggcccattcctgatggatttc acatatcttcctattgatatggtggggatgctttggtttcaggttggcgatgctgcaggt gacatctacacatttacactgcctgggttaagagatgtttctaaagttactgcatttcaa tatggtattgtacttctacactgctctcctgacaagaaatgggtatttgcatgtgggaca tacagtcgtaccttgccacaggtattcctcacagagtccttactgagaccatcagaaggc agtgttcctctgtctacctttctcccacataaattatgtgccagcgcctgctggacccca aaggtgaaaaacaggataacactgatgtcccaaagtagcactggaaaaaagacagaattt atcacctttgatctaacaaccaagaagactggaggccaaacagtcatccaagcatatgag atcgcaagtttccaggtggcagctcatctgaagtgccctatctggatgggagccagtgat ggatatatgattgtctttaccagtggaccatacttgttactcttcagcatcactggcttc ctgctgcaacgatttgaggaccatcaggcagccatcaacaacttctgggtgacctcatct gacggctattgccctttgagcccaggactggtaacagctgtgcttcaccatcccttattg gcttcccagagcctgcagctctggaacctggtggaggaaacacatgcatcagcagccatt ggtgccaagcagaacacagctaggttccctgtatcccccagcaccacccagagtcacctg ttggcactggccagtgggatccaggtcctgaccatttggaaggtttcaggtggcttctaa >gi568815595r:48304489_48524611|GENSCAN_predicted_peptide_4|2054_aa MPALGPALLQALWAGWVLTLQPLPPTAFTPNGTYLQHLARDPTSGTLYLGATNFLFQLSP GLQLEATVSTGPVLDSRDCLPPVMPDECPQAQPTNNPNQLLLVSPGALVVCGSVHQGVCE QRRLGQLEQLLLRPERPGDTQYVAANDPAVSTVGLVAQGLAGEPLLFVGRGYTSRGVGGG IPPITTRALWPPDPQAAFSYEETAKLAVGRLSEYSHHFVSAFARGASAYFLFLRRDLQAQ SRAFRAYVSRVCLRDQHYYSYVELPLACEGGRYGLIQAAAVATSREVAHGEVLFAAFSSA APPTVGRPPSAAAGASGASALCAFPLDEVDRLANRTRDACYTREGRAEDGTEVAYIEYDV NSDCAQLPVDTLDAYPCGSDHTPSPMASRVPLEATPILEWPGIQLTAVAVTMEDGHTIAF LGDSQGQLHRVYLGPGSDGHPYSTQSIQQGSAVSRDLTFDGTFEHLYVMTQSTTLVRPQC CREEPVDYVSVSVELRFGAVVIAKTSLSFYDCVAVTELRPSAQCQACVSSRWGCNWCVWQ HLCTHKASCDAGPMVASHQSPLVSPDPPARGGPSPSPPTAPKALATPAPDTLPVEPGAPS TATASDISPGASPSLLSPWGPWAGSGSISSPGSTGSPLHEEPSPPSPQNGPGTAVPAPTD FRPSATPEDLLASPLSPSEVAAVPPADPGPEALHPTVPLDLPPATVPATTFPGAMGSVKP ALDWLTREGGELPEADEWTGGDAPAFSTSTLLSGDGDSAELEGPPAPLILPSSLDYQYDT PGLWELEEATLGASSCPCVESVQGSTLMPVHVEREIRLLGRNLHLFQDGPGDNECVMELE GLEVVVEARVECEPPPDTQCHVTCQQHQVHTMSAWLSYEALQPELRVGLFLRRAGRLRVD SAEGLHVVLYDCSVGHGDCSRCQTAMPQYGCVWCEGERPRCVTREACDGGTRVTIRGSNL GQHVQDVLGMVTVAGVPCAVDAQEYEVSSSLPCAYSLVCITGASGEEVAGATAVEVPGRG RGVSEHDFAYQDPKVHSIFPARGPRAGGTRLTLNGSKLLTGRLEDIRVVVGDQPCHLLPE QQSEQLRCETSPRPTPATLPVAVWFGATERRLQRGQFKYTLDPNITSAGPTKSFLSGGRE ICVRGQNLDVVQTPRIRVTVVSRMLQPSQGLGRRRRVVPETACSLGPSCSSQQFEEPCHV NSSQLITCRTPALPGLPEDPWVRVEFILDNLVFDFATLNPTPFSYEADPTLQPLNPEDPT MPFRHKPGSVFSVEGENLDLAMSKEEVVAMIGDGPCVVKTLTRHHLYCEPPVEQPLPRHH ALREAPDSLPEFTVQMGNLRFSLGHVQYDGESPGAFPVAAQVGLGVGTSLLALGVIIIVL MYRRKSKQALRDYKKVQIQLENLESSVRDRCKKEFTDLMTEMTDLTSDLLGSGIPFLDYK VYAERIFFPGHRESPLHRDLGVPESRRPTVEQGLGQLSNLLNSKLFLTKFIHTLESQRTF SARDRAYVASLLTVALHGKLEYFTDILRTLLSDLVAQYVAKNPKLMLRRTETVVEKLLTN WMSICLYTFVRDSVGEPLYMLFRGIKHQVDKGPVDSVTGKAKYTLNDNRLLREDVEYRPL VRVCKCVACSLSATLNALLAVGPGAGEAQGVPVKVLDCDTISQAKEKMLDQLYKGVPLTQ RPDPRTLDVEWRSGVAGHLILSDEDVTSEVQGLWRRLNTLQHYKVPDGATVALVPCLTKH VLRENQDYVPGERTPMLEDVDEGGIRPWHLVKPSDEPEPPRPRRGSLRGGERERAKAIPE IYLTRLLSMKGTLQKFVDDLFQVILSTSRPVPLAVKYFFDLLDEQAQQHGISDQDTIHIW KTNSTFIGHACPTRSLPLRFWINIIKNPQFVFDVQTSDNMDAVLLVIAQTFMDACTLADH KLGRDSPINKLLYARDIPRYKRMVERYYADIRQTVPASDQEMNSVLAELSWNYSGDLGAR VALHELYKYINKYYDQLPSRTARHSAGGCGGQGVYQGLEMIITALEEDGTAQKMQLGYRL QQIAAAVENKVTDL >gi568815595r:48304489_48524611|GENSCAN_predicted_CDS_4|6165_bp atgcctgctctgggcccagctcttctccaggctctctgggccgggtgggtcctcaccctc cagccccttccaccaactgcattcactcccaatggcacgtatctgcagcacctggcaagg gaccccacctcaggcaccctctacctgggggctaccaacttcctgttccagctgagccct gggctgcagctggaggccacagtgtccaccggccctgtgctagacagcagggactgcctg ccacctgtgatgcctgatgagtgcccccaggcccagcctaccaacaacccgaatcagctg ctcctggtgagcccaggggccctggtggtatgcgggagcgtgcaccagggggtctgtgaa cagcggcgcctggggcagctcgagcagctgctgctgcggccagagcggcctggggacaca caatatgtggctgccaatgatcctgcggtcagcacggtggggctggtagcccagggcttg gcaggggagcccctcctgtttgtggggcgaggatacaccagcaggggtgtggggggtggc attccacccatcacaacccgggccctgtggccgcccgacccccaagctgccttctcctat gaggagacagccaagctggcagtgggccgcctctccgagtacagccaccacttcgtgagt gcctttgcacgtggggccagcgcctacttcctgttcctgcggcgggacctgcaggctcag tctagagcttttcgtgcctatgtatctcgagtgtgtctccgggaccagcactactactcc tatgtggagttgcctctggcctgcgaaggtggccgctacgggctgatccaggctgcagct gtggccacgtccagggaggtggcgcatggggaggtgctctttgcagctttctcctcggct gcaccccccactgtgggccggcccccatcggcggctgctggggcatctggagcctctgcc ctctgtgccttccccctggatgaggtggaccggcttgctaatcgcacgcgagatgcctgc tacacccgggagggtcgtgctgaggatgggaccgaggtggcctacatcgagtatgatgtc aattctgactgtgcacagctgccagtggacaccctggatgcttatccctgtggctcagac cacacgcccagccccatggccagccgggtcccgctggaagccacaccaattctggagtgg ccagggattcagctaacagctgtggcagtcaccatggaagatggacacaccatcgctttc ctgggtgatagtcaagggcagctgcacagggtctacttgggcccagggagcgatggccac ccatactccacacagagcatccagcaggggtctgcagtgagcagagacctcacctttgat gggacctttgagcacctgtatgtcatgacccagagcacaaccctagtgaggccccagtgc tgccgagaggagccggtggactacgtatccgtgagcgtggagctcagatttggcgctgtt gtgatcgccaaaacttccctctctttctatgactgtgtggcggtcactgaactccgccca tctgcgcagtgccaggcctgtgtgagcagccgctgggggtgtaactggtgtgtctggcag cacctgtgcacccacaaggcctcgtgtgatgctgggcccatggttgcaagccatcagagc ccgcttgtctccccagaccctcctgcaagaggtggacccagcccctccccacccacagcc cccaaagccctggccacccctgctcctgacacccttcccgtggagcctggggctccctcc acagccacagcttcggacatctcacctggggctagtccttccctgctcagcccctggggg ccatgggcaggttctggctccatatcttcccctggctccacagggtcgcctctccatgag gagccctcccctcccagcccccaaaatggacctggaaccgctgtccctgcccccactgac ttcagaccctcagccacacctgaggacctcttggcctccccgctgtcaccctcagaggta gcagcagtgccccctgcagaccctggccccgaggctcttcatcccacagtgcccctggac ctgccccctgccactgttcctgccaccactttcccaggggccatgggctccgtgaagccc gccctggactggctcacgagagaaggcggcgagctgcccgaggcggacgagtggacgggg ggtgacgcacccgccttctccacttccaccctcctctcaggtgatggagactcagcagag cttgagggccctcccgcccccctcatcctcccgtccagcctcgactaccagtatgacacc cccgggctctgggagctggaagaggcgaccttgggggcaagctcctgcccctgtgtggag agcgttcagggctccacgttgatgccggtccatgtggagcgggaaatccggctgctaggc aggaacctgcaccttttccaggatggcccaggagacaatgagtgtgtgatggagctggag ggcctcgaggtggtggttgaggcccgggtcgagtgtgagccacctccagatacccagtgc catgtcacctgccagcagcaccaggtgcacaccatgagcgcctggctcagctatgaggct ctgcagccggagctccgtgtggggctgtttctgcgtcgggccggccgtctgcgtgtggac agtgctgaggggctgcatgtggtactgtatgactgttccgtgggacatggagactgcagc cgctgccaaactgccatgccccagtatggctgtgtgtggtgtgagggggagcgtccacgt tgtgtgacccgggaggcctgtgacggaggcacccgtgtcaccatcaggggctccaacctg ggccagcatgtgcaggatgtgctgggcatggtcacggtggctggagtgccctgtgctgtg gatgcccaggagtacgaggtctccagcagcctgccctgtgcctacagcctcgtgtgcatc accggggccagtggggaggaggtggccggcgccacagcggtggaggtgccgggaagagga cgtggtgtctcagaacacgactttgcctaccaggatccgaaggtccattccatcttcccg gcccgcggccccagagctgggggcacccgtctcaccctgaatggctccaagctcctgact gggcggctggaggacatccgagtggtggttggagaccagccttgtcacttgctgccggag cagcagtcagaacaactgcggtgtgagaccagcccacgccccacgcctgccacgctccct gtggctgtgtggtttggggccacggagcggaggcttcaacgcggacagttcaagtatacc ttggaccccaacatcacctctgctggccccaccaagagcttcctcagtggaggacgtgag atatgcgtccgtggccagaatctggacgtggtacagacgccaagaatccgggtgaccgtg gtctcgagaatgctgcagcccagccaggggcttggacggaggcgtcgcgtggtcccggag acggcatgttcccttggaccctcctgcagtagccagcaatttgaggagccgtgccatgtc aactcctcccagctcatcacgtgccgcacacctgccctcccaggcctgcctgaggacccc tgggtccgggtggaatttatccttgacaacctggtctttgactttgcaacactgaacccc acacctttctcctatgaggccgaccccaccctgcagccactcaaccctgaggaccccacc atgccattccggcacaagcctgggagtgtgttctccgtggagggggagaacctggacctt gcaatgtccaaggaggaggtggtggctatgataggggatggcccctgtgtggtgaagacg ctgacgcggcaccacctgtactgcgagccccccgtggagcagcccctgccacggcaccat gccctccgagaggcacctgactctttgcctgagttcacggtgcagatggggaacttgcgc ttctccctgggtcacgtgcagtatgacggcgagagccctggggcttttcctgtggcagcc caggtgggcttgggggtgggcacctctcttctggctctgggtgtcatcatcattgtcctc atgtacaggaggaagagcaagcaggccctgagggactataagaaggttcagatccagctg gagaatctggagagcagtgtgcgggaccgctgcaagaaggaattcacagacctcatgact gagatgaccgatctcaccagtgacctcctgggcagcggcatccccttcctcgactacaag gtgtatgcggagaggatcttcttccctgggcaccgcgagtcgcccttgcaccgggacctg ggtgtgcctgagagcagacggcccactgtggagcaagggctggggcagctctctaacctg ctcaacagcaagctcttcctcaccaagttcatccacacgctggagagccagcgcaccttt tcagctcgggaccgtgcctacgtggcatctctgctcaccgtggcactgcatgggaagctt gagtatttcactgacatcctccgcactctgctcagtgacctggttgcccagtatgtggcc aagaaccccaagctgatgctgcgcaggacagagactgtggtggagaagctgctcaccaac tggatgtccatctgtctgtataccttcgtgagggactccgtaggggagcctctgtacatg ctctttcgagggattaagcaccaagtggataaggggccagtggacagtgtgacaggcaag gccaaatacaccttgaacgacaaccgcctgctcagagaggatgtggagtaccgtcccctg gtgagggtgtgcaagtgtgtggcctgcagtctgagtgcgaccttgaatgcactattggct gtggggcctggggcaggagaggcccagggcgtgcccgtgaaggtcctagactgtgacacc atctcccaggcaaaggagaagatgctggaccagctttataaaggagtgcctctcacccag cggccagaccctcgcacccttgatgttgagtggcggtctggggtggccgggcacctcatt ctttctgacgaggatgtcacttctgaggtccagggtctgtggaggcgcctgaacacactg cagcattacaaggtcccagatggagcaactgtggccctcgtcccctgcctcaccaagcat gtgctccgggaaaaccaggattatgtccctggagagcggaccccaatgctggaggatgta gatgaggggggcatccggccctggcacctggtgaagccaagtgatgagccggagccgccc aggcctcggaggggcagccttcggggcggggagcgtgagcgcgccaaggccatccctgag atctacctgacccgcctgctgtccatgaagggcaccctgcagaagttcgtggatgacctg ttccaggtgattctcagcaccagccgccccgtgccgctcgctgtgaagtacttctttgac ctgctggatgagcaggcccagcagcatggcatctccgaccaggacaccatccacatctgg aagaccaacagcacattcatcggccatgcctgccccacccgcagcttgcctctgaggttc tggatcaatataataaaaaacccgcagtttgtgttcgacgtgcaaacatctgataacatg gatgcggtgctccttgtcattgcacagaccttcatggacgcctgcaccctggccgaccac aagctgggccgggactccccgatcaacaaacttctgtatgcacgggacattccccggtac aagcggatggtggaaaggtactatgcagacatcagacagactgtcccagccagcgaccaa gagatgaactctgtcctggctgaactgtcctggaactactccggagacctcggggcgcga gtggccctgcatgaactctacaagtacatcaacaagtactatgaccagctccccagcagg acggcgaggcacagtgctggtggttgtgggggccaaggggtctaccagggcctggagatg atcatcactgccctggaggaggatggcacggcccagaagatgcagctgggctatcggctc cagcagattgcagctgctgtggaaaacaaggtcacagatctatag >gi568815595r:48304489_48524611|GENSCAN_predicted_peptide_5|433_aa MGHLATSLGSSESLALLGSDLRMMGRSPGFAMQHIVGVPHVLVRRGLLGRDLFMTRTLCS PGPSQPGEKRPEEVALGLHHRLPALGRALGHSIQQRATSTAKTWWDRYEEFVGLNEVREA QGKVTEAEKVFMVARGLVREAREDLEVHQAKLKEVRDRLDRVSREDSQYLELATLEHRML QEEKRLRTAYLRAEDSEREKFSLFSAAVRESHEKERTRAERTKNWSLIGSVLGALIGVAG STYVNRVRLQELKALLLEAQKGPVSLQEAIREQASSYSRQQRDLHNLMVDLRGLVHAAGP GQDSGSQAGSPPTRDRDVDVLSAALKEQLSHSRQVHSCLEGLREQLDGLEKTCSQMAGVV QLVKSAAHPGLVEPADGAMPSFLLEQGSMILALSDTEQRLEAQVNRNTIYSTLVTCVTFV ATLPVLYMLFKAS >gi568815595r:48304489_48524611|GENSCAN_predicted_CDS_5|1302_bp atgggtcacctggccacctctttggggtcctcagagtcactggcactcttgggctcagat ctcaggatgatggggcgcagccctgggtttgccatgcagcacatcgtgggtgtgccccac gtactggttcggaggggcctccttggaagggacctcttcatgaccaggactctctgcagc ccaggcccaagccagcccggagagaaaagacctgaggaggtggccctggggctgcaccac cgcctcccagcactgggaagagccctggggcacagcattcagcaacgagcgacctccaca gccaagacttggtgggacagatatgaagagtttgttggactcaacgaggttcgagaggcc cagggaaaggtgacagaggctgagaaagtgttcatggtggctcgagggcttgtccgagag gctcgggaggacttggaagttcaccaggccaagctgaaggaggtgagggaccgcttggac cgtgtctccagggaggacagtcagtacttggaactggctactctcgagcacaggatgctg caggaggagaagaggcttcgcacagcctatctgcgtgcagaagactctgagcgagagaag ttctccctcttctctgcagctgtgcgggaaagtcatgagaaggagcgcacaagggctgag aggaccaagaactggtccctcattggctcagtcctgggggccctgattggtgtggctggc tccacctatgtgaaccgtgtgcgactacaggagctgaaggctttactcctggaggcgcag aaggggcctgtgagtctccaagaggccattcgagaacaggcgtctagctactcccgccag cagagggacctccacaatctcatggtggacctgaggggcctggtacatgctgctgggcca gggcaggactctgggtcacaggcaggtagtcccccgaccagagacagagatgtagatgtc ctttcagctgccttgaaagagcagcttagtcattccaggcaagtccattcatgtctagaa ggcttacgagagcagcttgatggcctagaaaagacttgtagccaaatggctggggtggtt cagcttgtaaagtctgcagcacacccaggcctggtggaaccagcagacggggctatgccc agcttcttgctggagcaggggagcatgatcttggcactgtcagacacggagcagagacta gaagcccaagtcaacaggaacaccatctatagcaccctggtcacctgtgtgacatttgtg gccacactgcctgtgctctacatgctattcaaagccagctaa >gi568815595r:48304489_48524611|GENSCAN_predicted_peptide_6|911_aa MSGREGGKKKPLKQPKKQAKEMDEEDKAFKQKQKEEQKKLEELKAKAAGKGPLGAGHSRR RQAAARGRLVQFSGLAAGKSSSALSDTWGSKRRSEPPAPRPGPPPGTGHPPSKRARGFSA AAAPDPDDPFGAHGDFTADDLEELDTLASQALSQCPAAARDVSSDHKVHRLLDGMSKNPS GKNRETVPIKDNFELEVLQAQYKELKEKLQSLQSELQFKDAEMNELRTKLQTSERANKLA APSVSHVRKNPSVVIKPEACSPQFGKTSFPTKESFSANMSLPHPCQTESGYKPLVGREGS ILINLLLKQPLIPGSSLSLCHLLSSSSESPAGTPLQPPGFGRFQCVFQVLPKCLSPETPL PSVLLAVELLSLLADHDQLAPQLCSHSEGCLLLLLYMYITSRPDRVALETQWLQLEQEVV WLLAKLGVQSPLPPVTGSNCQCNVEVVRALTVMLHRQWLTVRRAGGPPRTDQQRRTVRCL RDTVLLLHGLSQKDKLFMMHCVEVLHQFDQVMPGVSMLIRGLPDVTDCEEAALDDLCAAE TDVEDPELPVTGMMAPVHCATSRPQLWILEGLWGPPGAGEWVWGGTDGGSAAGTYPTMGS QALPPGPMQTLIFFDMEATGLPFSQPKVTELCLLAVHRCALESPPTSQGPPPTVPPPPRV VDKLSLCVAPGKACSPAASEITGLSTAVLAAHGRQCFDDNLANLLLAFLRRQPQPWCLVA HNGDRYDFPLLQAELAMLGLTSALDGAFCVDSITALKALERASSPSEHGPRKSYSLGSIY TRLYGQSPPDSHTAEGDVLALLSICQWRPQALLRWVDAHARPFGTIRPMYGVTASARTKP RPSAVTTTAHLATTRNTSPSLGESRGTKDLPPVKDPGALSREGLLAPLGLLAILTLAVAT LYGLSLATPGE >gi568815595r:48304489_48524611|GENSCAN_predicted_CDS_6|2736_bp atgtccggccgcgaaggtggcaagaagaagccactgaaacagcccaagaagcaggccaag gagatggacgaggaagataaggctttcaagcagaaacaaaaagaggagcagaagaaactc gaggagctaaaagcgaaggccgcggggaaggggcccttgggggcggggcactcgcggcgg aggcaagcggcggcgcgcggacggttggtccagttctccggcctggcggcaggcaagtct agctcggcgctgtcggatacttggggcagcaagaggcggagcgagcccccggcgcctcgc cccggcccgccgccgggcaccgggcaccccccgagcaagcgggcccggggcttctccgca gccgctgccccggaccctgacgacccgttcggcgcgcatggggacttcactgccgacgac ctggaggagcttgacaccctcgcgtcacaggccctgagccaatgtccggccgcggctcgg gacgtgtccagtgatcataaggtccacagattattagatggcatgtcaaaaaatccttca gggaaaaacagagaaactgttccaattaaagataatttcgaattagaggtacttcaggca caatacaaagaacttaaagaaaagctccaatcattgcagtctgaactccagtttaaagat gcagagatgaatgaattaaggacaaagctccagaccagtgaacgagcaaataaactggct gctccctctgtttcccatgtcaggaaaaacccttctgtggttataaagccagaagcatgt tctccacaatttggaaaaacatcttttcctacaaaggagtcttttagtgctaacatgtcc cttccccacccctgccagacggagtcaggatacaagcctctggtgggcagagagggttcc attttgataaacctgctcctgaagcagcctttgatcccagggtcatccctaagcctttgc cacctcctgagtagtagttctgagtctcctgctggcacccccctgcagccaccagggttt ggcaggttccagtgtgtgttccaagtgctgccaaagtgcctcagcccagagacacccctg cctagcgtgctgctggctgttgagctcctctccctgctggcggaccacgaccagctggca cctcagctctgttcccactcagaaggctgcctcctgctgctgctgtacatgtacatcaca tcacggcctgacagagtggccttggagacacaatggctccagctggaacaagaggtggtg tggctcctggctaagcttggtgtgcagagccccttgcccccagtcactggctccaactgc cagtgtaatgtggaggtggtcagagcgctcacggtgatgttgcacagacagtggctgaca gtgcggagggcagggggacccccaaggaccgaccagcagaggcggacagtgcgctgtctg cgggacacggtgctgctgctgcacggcctatcgcagaaggacaagctcttcatgatgcac tgcgtggaggtcctgcatcagtttgaccaggtgatgccgggggtcagcatgctcatccga gggcttcctgatgtgacggactgtgaagaggcagccctggatgacctctgtgccgcggaa accgatgtggaagaccccgagctgcctgtcactggtatgatggccccggtgcattgtgcc accagcaggccacagctgtggatcttggaaggcctctggggtcccccgggagcaggggag tgggtgtgggggggaacggatggtggctcagcagcaggtacgtacccaaccatgggctcg caggccctgcccccggggcccatgcagaccctcatctttttcgacatggaggccactggc ttgcccttctcccagcccaaggtcacggagctgtgcctgctggctgtccacagatgtgcc ctggagagcccccccacctctcaggggccacctcccacagttcctccaccaccgcgtgtg gtagacaagctctccctgtgtgtggctccggggaaggcctgcagccctgcagccagcgag atcacaggtctgagcacagctgtgctggcagcgcatgggcgtcaatgttttgatgacaac ctggccaacctgctcctagccttcctgcggcgccagccacagccctggtgcctggtggca cacaatggtgaccgctacgacttccccctgctccaagcagagctggctatgctgggcctc accagtgctctggatggtgccttctgtgtggatagcatcactgcgctgaaggccctggag cgagcaagcagcccctcagaacacggcccaaggaagagctatagcctaggcagcatctac actcgcctgtatgggcagtcccctccagactcgcacacggctgagggtgatgtcctggcc ctgctcagcatctgtcagtggagaccacaggccctgctgcggtgggtggatgctcacgcc aggcctttcggcaccatcaggcccatgtatggggtcacagcctctgctaggaccaagcca agaccatctgctgtcacaaccactgcacacctggccacaaccaggaacactagtcccagc cttggagagagcaggggtaccaaggatcttcctccagtgaaggaccctggagccctatcc agggaggggctgctggccccactgggtctgctggccatcctgaccttggcagtagccaca ctgtatggactatccctggccacacctggggagtag >gi568815595r:48304489_48524611|GENSCAN_predicted_peptide_7|282_aa MTAPVPAPRILLPLLLLLLLTPPPGARGEVCMASRGLSLFPESCPDFCCGTCDDQYCCSD VLKKFVWSEESVPASVEPVEQLGSALRFRPGYNDPMSGCACGTCSVSPAAAVVCIALGLV CRVLGVPCCLLPQAILPMHREEQLFGRFGATLAVGLTIFVLSVVTIIICFTCSCCCLYKT CRRPRPVVTTTTSTTVVHAPYPQPPSVPPSYPGPSYQGYHTMPPQPGMPAAPYPMQYPPP YPAQPMGPPAYHETLAGGAAAPYPASQPPYNPAYMDAPKAAL >gi568815595r:48304489_48524611|GENSCAN_predicted_CDS_7|849_bp atgactgcgccggtccccgcgccgcggatcctgttgccgttgctgttgctgctgctgcta acgccgcctccgggtgcacgtggtgaggtgtgtatggcttcccgtggactcagcctcttc cccgagtcctgtccagatttctgctgtggtacctgtgatgaccaatactgctgctctgac gtgctgaagaaatttgtgtggagcgaggaaagcgtgcctgccagtgtagagccggtggag cagctgggctcggcgctgaggtttcgccctggctacaacgaccccatgtcagggtgtgca tgtgggacctgttctgttagcccagctgccgccgtggtgtgcatagcactgggcctcgtg tgccgtgtgcttggcgtgccctgctgcctcctgccccaggctattttgcccatgcatcga gaagagcagctctttggcaggttcggagcgaccttggccgttggcctgaccatctttgtg ctgtctgtcgtcactatcatcatctgcttcacctgctcctgctgctgcctttacaagacg tgccgccgaccacgtccggttgtcaccaccaccacatccaccactgtggtgcatgcccct tatcctcagcctccaagtgtgccgcccagctaccctggaccaagctaccagggctaccac accatgccgcctcagccagggatgccagcagcaccctacccaatgcagtacccaccacct tacccagcccagcccatgggcccaccggcctaccacgagaccctggctggaggagcagcc gcgccctaccccgccagccagcctccttacaacccggcctacatggatgccccgaaggcg gccctctga >gi568815595r:48304489_48524611|GENSCAN_predicted_peptide_8|65_aa MEHPIANAKALSVAMPSQKTQKRLVGLEKRDEEEEDQAWTSTAFSSNQADDKSDTPISLG ICYHS >gi568815595r:48304489_48524611|GENSCAN_predicted_CDS_8|198_bp atggagcatcccatagccaatgcaaaggctctgagtgtggctatgcccagtcaaaaaaca cagaagaggctggtggggctggagaagagagacgaggaggaggaagaccaggcctggact tccactgcttttagctccaaccaggcagatgacaagtcggacacccccatttccctgggc atctgctaccattcctga >gi568815595r:48304489_48524611|GENSCAN_predicted_peptide_9|156_aa SYEDLVQRLEPVIMELERQENVLVICHQAVMRCLLAYFLDKAAEQLPYLKCPLHTVLKLT PVAYGCKVESIFLNVAAVNTHRDRPQVCVGILRSASTPMFTLTEEELCCPGCVRAEDGCR WAASGWFGVASSYVSAFSNYAKVSEGEATLQDSGGD >gi568815595r:48304489_48524611|GENSCAN_predicted_CDS_9|471_bp tcctacgaggacctggtccagagactggagcctgtcatcatggagctggagaggcaagag aatgtgctggtcatctgccaccaggctgtgatgcgctgcctgctggcctacttcctcgac aaggcagcagaacagctgccctacctcaagtgtccgctgcacacagtcctgaagctgact cctgtggcatatggttgtaaagtggagtccatattcctgaacgtggctgctgtgaacacg caccgggacaggcctcaggtctgtgttgggatccttcgttcagcttccacccccatgttc actctcactgaagaggaattatgctgccctggatgtgtcagggcagaggatggctgtcgc tgggctgcttctgggtggtttggggtggccagcagctatgtttctgccttctccaactat gccaaggtgtctgagggagaggcaactttgcaggattcaggtggtgactaa