GENSCAN 1.0 Date run: 7-Nov-116 Time: 20:38:21 Sequence gi568815596r:31235961_31514666 : 278706 bp : 42.94% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6005 6366 362 1 2 82 6 187 0.438 5.71 1.02 Intr + 8251 8490 240 1 0 91 109 318 0.873 29.94 1.03 Intr + 13411 13508 98 1 2 89 86 128 0.588 11.43 1.04 Intr + 13805 13942 138 0 0 24 72 86 0.029 0.11 1.05 Intr + 15080 15277 198 0 0 33 89 79 0.025 0.90 1.06 Intr + 17571 17771 201 1 0 44 49 108 0.017 0.84 1.07 Intr + 24550 24962 413 2 2 101 100 580 0.738 53.48 1.08 Intr + 25589 25753 165 1 0 63 74 226 0.982 17.94 1.09 Term + 30217 30744 528 0 0 111 45 645 0.969 55.77 1.10 PlyA + 33308 33313 6 1.05 2.00 Prom + 33686 33725 40 -10.65 2.01 Init + 35527 35574 48 0 0 77 119 46 0.731 7.60 2.02 Intr + 36370 36443 74 0 2 61 77 58 0.397 -0.71 2.03 Intr + 43188 43318 131 0 2 37 105 141 0.080 10.12 2.04 Intr + 45354 45535 182 1 2 45 75 78 0.007 0.87 2.05 Intr + 46427 46551 125 1 2 17 89 70 0.004 -1.54 2.06 Intr + 47131 47340 210 1 0 120 92 -7 0.016 0.01 2.07 Intr + 48265 48363 99 1 0 76 61 94 0.022 3.81 2.08 Term + 55216 55585 370 1 1 13 54 320 0.030 14.23 2.09 PlyA + 56245 56250 6 1.05 3.00 Prom + 59014 59053 40 -3.35 3.01 Sngl + 59321 59521 201 1 0 67 49 178 0.850 6.74 3.02 PlyA + 62623 62628 6 1.05 4.00 Prom + 62951 62990 40 -5.55 4.01 Init + 63391 63444 54 0 0 59 31 132 0.213 5.93 4.02 Intr + 67026 67347 322 2 1 75 107 121 0.058 7.21 4.03 Intr + 76714 76778 65 2 2 49 85 68 0.019 0.12 4.04 Intr + 82933 83066 134 0 2 110 50 118 0.554 8.72 4.05 Intr + 92790 92835 46 0 1 53 96 47 0.085 -0.51 4.06 Intr + 93957 94086 130 2 1 16 65 75 0.162 -2.65 4.07 Intr + 94764 94877 114 1 0 40 99 59 0.216 1.70 4.08 Intr + 97130 97272 143 0 2 97 49 82 0.303 4.35 4.09 Term + 97812 97955 144 2 0 104 42 86 0.704 2.53 4.10 PlyA + 98548 98553 6 1.05 5.39 PlyA - 98911 98906 6 1.05 5.38 Term - 100825 100562 264 1 0 70 48 99 0.284 -1.38 5.37 Intr - 101857 101681 177 1 0 106 64 350 0.903 33.59 5.36 Intr - 103717 103529 189 1 0 105 76 206 0.999 19.96 5.35 Intr - 105434 105369 66 2 0 96 94 109 0.963 10.48 5.34 Intr - 106337 106223 115 1 1 99 116 71 0.996 10.53 5.33 Intr - 110982 110730 253 1 1 12 29 241 0.213 6.17 5.32 Intr - 111690 111492 199 0 1 98 35 189 0.667 12.60 5.31 Intr - 112403 112308 96 2 0 98 99 64 0.992 7.79 5.30 Intr - 114263 114072 192 2 0 97 99 190 0.972 19.77 5.29 Intr - 128284 128198 87 1 0 115 65 92 0.519 8.85 5.28 Intr - 129584 129497 88 1 1 68 13 70 0.786 -3.45 5.27 Intr - 130149 130016 134 0 2 84 78 185 0.992 15.62 5.26 Intr - 131034 130910 125 1 2 74 76 83 0.801 5.08 5.25 Intr - 132097 132001 97 1 1 83 80 77 0.988 5.06 5.24 Intr - 132700 132581 120 1 0 94 78 131 0.999 12.47 5.23 Intr - 134518 134395 124 0 1 106 115 98 0.999 13.97 5.22 Intr - 136437 136268 170 0 2 88 89 258 0.994 23.72 5.21 Intr - 137996 137913 84 2 0 120 89 73 0.726 9.70 5.20 Intr - 139594 139420 175 0 1 91 86 153 0.999 14.42 5.19 Intr - 141277 141093 185 1 2 49 71 123 0.466 4.36 5.18 Intr - 142493 142438 56 0 2 87 70 -1 0.123 -4.22 5.17 Intr - 144016 143907 110 0 2 100 78 104 0.921 9.61 5.16 Intr - 145766 145441 326 2 2 87 36 252 0.924 13.55 5.15 Intr - 147192 147041 152 1 2 57 101 175 0.993 14.66 5.14 Intr - 147887 147795 93 0 0 101 107 24 0.672 4.62 5.13 Intr - 150595 150454 142 1 1 -17 87 233 0.911 11.71 5.12 Intr - 151937 151851 87 2 0 135 82 -7 0.805 2.65 5.11 Intr - 152335 152267 69 1 0 80 89 29 0.416 0.66 5.10 Intr - 160059 159914 146 1 2 83 77 90 0.686 6.48 5.09 Intr - 161526 161367 160 0 1 28 53 109 0.877 0.04 5.08 Intr - 161769 161708 62 1 2 95 110 80 0.582 8.43 5.07 Intr - 162739 162613 127 1 1 80 86 152 0.601 13.53 5.06 Intr - 165312 165260 53 1 2 73 97 50 0.272 2.11 5.05 Intr - 168361 168272 90 2 0 80 69 58 0.168 2.15 5.04 Intr - 169001 168932 70 2 1 86 44 66 0.307 -0.16 5.03 Intr - 179059 178943 117 1 0 139 64 29 0.136 5.34 5.02 Intr - 191545 191476 70 0 1 63 68 65 0.044 0.27 5.01 Init - 193592 193525 68 2 2 84 80 77 0.890 7.10 5.00 Prom - 195913 195874 40 -5.05 6.07 PlyA - 197610 197605 6 1.05 6.06 Term - 201730 200640 1091 2 2 67 36 309 0.011 14.86 6.05 Intr - 211330 211231 100 1 1 77 84 48 0.213 2.06 6.04 Intr - 214209 214042 168 0 0 86 62 92 0.533 5.62 6.03 Intr - 231782 231703 80 0 2 106 57 54 0.074 2.45 6.02 Intr - 231944 231874 71 1 2 112 32 46 0.045 -0.69 6.01 Init - 236751 236636 116 0 2 93 70 90 0.265 7.43 6.00 Prom - 239556 239517 40 -4.75 7.02 PlyA - 239910 239905 6 1.05 7.01 Sngl - 246117 245752 366 0 0 76 41 189 0.510 8.84 7.00 Prom - 247063 247024 40 -4.95 8.00 Prom + 250994 251033 40 -8.55 8.01 Sngl + 254257 254553 297 0 0 33 54 279 0.886 14.49 8.02 PlyA + 255045 255050 6 1.05 9.00 Prom + 255284 255323 40 -4.75 9.01 Sngl + 258518 259108 591 1 0 66 35 223 0.437 10.74 9.02 PlyA + 259825 259830 6 1.05 10.00 Prom + 259833 259872 40 -3.85 10.01 Sngl + 261377 262393 1017 1 0 73 43 717 0.996 62.47 10.02 PlyA + 262662 262667 6 -0.45 11.00 Prom + 262790 262829 40 -6.15 11.01 Sngl + 264232 265989 1758 0 0 42 41 437 0.870 28.82 11.02 PlyA + 267295 267300 6 1.05 12.02 PlyA - 268252 268247 6 1.05 12.01 Sngl - 275941 275657 285 1 0 61 45 219 0.965 10.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 19358 19286 73 1 1 88 75 63 0.851 3.49 S.002 Sngl + 55295 55585 291 1 0 56 54 268 0.918 15.60 S.003 Term - 157348 157190 159 1 0 30 32 174 0.934 2.96 S.004 Term - 190592 190403 190 1 1 133 55 81 0.846 5.24 S.005 Sngl - 201959 200640 1320 2 0 42 36 371 0.906 23.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:31235961_31514666|GENSCAN_predicted_peptide_1|780_aa MRGFPRQPETHLCPVWRVAFGQEGSNSQQISSFLQGWLPRLLLGDGAHSNLTLTTMELQS KFTAQNAAQRHKWDVDEWEEALASLPGHHPSVQTRPAAAVHLCTLQHPSWLQSMHTLWGG VRHAVLIFAQNACIRTVLLPARYLLEQDFPGMRIGPEPTTDSFIAVMQGDMEGIIPGNAL VVDPKKPFRKLNAFGNAFLNRFVCAQLPNPVLESISVIDTPGILSGEKQRISRGAPSMAL ILACFLSSWGACSSAGQPNAYREIQLNGISALGHKRREPDPSEERSQPHGVTGQCSCSCV LTRPKWCRPLAQVQATSARPGLLSSAPVSCLGVLSAESSSLLMRADKKGHGYQVGTEAPT GHIRLPVLQSELLPRAPRPQASQEGALGPQAGLGRQRVVLAQGVDPLLGLGRGYDFAAVL EWFAERVDRIILLFDAHKLDISDEFSEVIKALKNHEDKMRVVLNKADQIETQQLMRVYGA LMWSLGKIVNTPEVIRVYIGSFWSHPLLIPDNRKLFEAEEQDLFRDIQSLPRNAALRKLN DLIKRARLAKVHAYIISSLKKEMPSVFGKDNKKKELVNNLAEIYGRIEREHQISPGDFPN LKRMQDQLQAQDFSKFQPLKSKLLEVVDDMLAHDIAQLMVLVRQEESQRPIQMVKGGAFE GTLHGPFGHGYGEGAGEGIDDAEWVVARDKPMYDEIFYTLSPVDGKITGANAKKEMVRSK LPNSVLGKIWKLADIDKDGMLDDDEFALANHLIKVKLEGHELPNELPAHLLPPSKRKVAE >gi568815596r:31235961_31514666|GENSCAN_predicted_CDS_1|2343_bp atgaggggcttcccaagacaacctgaaacccatctgtgccctgtgtggcgggtggccttt ggacaggagggctccaactctcagcagatcagcagcttcctccagggctggctgccgagg ctcctccttggggacggagcccacagcaacctgactttaactaccatggaactgcagagt aaattcacagctcagaacgctgctcaaaggcacaaatgggatgttgatgagtgggaggaa gccctggcaagcctccctggccaccacccctctgtgcagaccaggcctgctgctgcagtc cacctgtgcacattacaacatccttcatggttacaaagcatgcacacgttgtggggaggg gttagacatgccgtgttgatctttgcgcagaacgcctgcattaggactgtgcttcttcct gctaggtacctgctggaacaggacttcccaggcatgaggattgggcctgagcccaccaca gactccttcattgcggtgatgcagggagacatggaggggatcatccctgggaacgccctg gtggtggatcccaagaaacccttcaggaaactcaacgcctttggcaacgccttcttgaac aggttcgtgtgtgcccagctacctaaccctgtgctggagagcatcagcgtcatcgacaca ccagggatcctctctggggagaagcagaggatcagccggggggcaccttccatggccctg atcctggcttgcttcttaagttcctggggagcttgcagctctgcagggcagccaaatgca tacagagaaattcagctcaatggtatcagtgccctgggtcataagaggagggaaccagac ccctcagaggagaggtcacagcctcatggtgtgacaggccagtgctcctgcagttgtgta ttgacgaggcccaagtggtgcaggcccttggcccaggtgcaggccactagtgccagacct gggcttctgtccagtgctcctgtgagctgccttggtgtcctctctgcagaaagcagctct cttctgatgagggcagataaaaaggggcatggttatcaagtcggcacagaggcccctact ggccatattagactgcctgttctgcaaagtgagctgcttcccagggctccaagacctcag gcttcacaggaaggagcactcgggccccaggctggactagggcggcaaagggtcgtcctt gctcagggtgtggaccctctactgggcctggggcgggggtatgactttgcagctgtcctt gagtggtttgccgagcgggttgaccgcatcattctgctcttcgatgcccacaaactggac atctctgatgagttctcagaagtcatcaaagccctcaagaaccacgaggacaagatgcga gtggtgctgaacaaagctgaccagatcgagacgcagcagctgatgcgggtgtacggggcc ctcatgtggtccttggggaagatcgtgaacaccccagaggtgatccgggtctacatcggc tccttctggtcccaccccctcctcatccctgacaaccggaagctctttgaggctgaggaa caggacctattcagggacatccagagtctgccccgaaatgctgccctgcgcaagctcaac gacctcatcaaaagggccaggctggccaaggtccacgcctacatcatcagctctctgaag aaggagatgccctcggtgttcgggaaggacaacaagaagaaggagctggtcaacaacctg gccgagatctatggccggatcgagcgggagcaccagatctcacctggggacttccccaat ctgaagaggatgcaggaccagctgcaggcccaggactttagcaagttccagccgctgaag agcaagctgctggaggtagtggacgacatgctggcccatgacattgcccagctcatggtg ctagtgcgccaggaggagtcacagcggcccatccagatggtgaagggcggagcgttcgag ggcaccctgcacggcccctttgggcatggctatggggagggggctggagaaggtatcgat gatgctgagtgggtggtggccagggacaagcccatgtacgacgagatcttctacaccctg tcaccggtggatggcaagatcacaggcgctaatgccaagaaggagatggtgcgctccaag ctgcccaacagtgtgctgggcaagatctggaagctggccgacattgacaaggatggcatg ctggacgacgacgagtttgcactggccaaccacctcatcaaagtcaagctggaggggcac gagctgcccaacgagctgcctgcccacctcctgcccccgtccaagaggaaagttgccgag tga >gi568815596r:31235961_31514666|GENSCAN_predicted_peptide_2|412_aa MGLLPDIAYTIMTRDLLPILAQEAGSLVCEVGETMATLVAPLIIGDLLVFGAGRVSTGPA SGVLRAATYKASSLARLCRSHTRAGHNCSVMTCVIRNLLITYWGIKVAQSQVRIRAFSTQ CKAKNTSPKSKMGYFQKLSNEFLRKLAIKEPVKTHQNQDGDESDLWLSSLLHSHQYHDSL QMPWQHHLGQAITLLDLSPDKYFVKPRPQAGLLLWLKTLSQMLQNPLKPTHRNSQNLPLL QGPPKHLLTVFLTLPNSQELETKRHLEKIKSEDQWMAPGAAAISPVTLSRKRENRVVPLA TYKQIYKKGHIIDIKEMGTVQKGRPYKCYHGNTGRTYNATQRVVGIIINKQVKAKILAKR TNVHIEHIKHSESQDSFLKHVKKNGQKNKEAKEKVTWIQLKCQPAPLEKHTL >gi568815596r:31235961_31514666|GENSCAN_predicted_CDS_2|1239_bp atggggctgttacctgacatagcctataccatcatgaccagagacctgcttcctatactt gctcaagaagctggaagtcttgtttgtgaggttggagaaacaatggcaacactggtagca ccgctaatcataggcgatctgctggtgtttggcgctggacgtgtcagcacaggcccagca agtggggtcctaagggcagcaacctacaaagcctccagccttgcccggctgtgccggtca cacaccagagctggtcataactgctctgtgatgacctgtgtgattaggaacttgttgatc acctactgggggataaaggtggcacaaagtcaggtgaggataagagccttctcaacccaa tgtaaggccaagaacacttctcccaagtcaaaaatgggatatttccaaaagctgagtaat gaattcttgcgtaagcttgcaataaaggagccagtcaaaacccaccaaaaccaagatggc gacgagagtgacctctggttgtcttcactgctacactcccaccagtatcatgacagttta caaatgccatggcaacatcacttaggtcaggccatcactctgttagacctcagccctgac aaatactttgtaaagccaaggccacaggctggtcttctgctttggctcaaaacactttcc caaatgctacaaaatcccctaaagcctacccacagaaattcccagaacttgcctctcctc caaggtcctcccaagcatcttctgactgtgtttcttactcttcctaacagtcaggagtta gaaacgaaaaggcatcttgaaaagataaaatcagaggaccagtggatggctcctggcgct gctgctattagccctgtgactctgagtagaaaaagagaaaacagagttgttcctttggcc acatacaagcaaatctataagaaaggccatattatagacatcaaggaaatgggtactgtt cagaaaggaaggccctacaagtgttaccatggcaatactggaagaacctacaatgccacc cagcgtgttgttggcattattataaataaacaagttaaggccaagattcttgccaagaga accaatgtgcatattgagcacattaagcactctgagagccaagatagcttcttgaagcat gtgaagaaaaatggtcagaaaaacaaggaagccaaagagaaagttacctggattcagctg aagtgccagcctgctccactagagaagcacactttgtga >gi568815596r:31235961_31514666|GENSCAN_predicted_peptide_3|66_aa MALNGNRHSWFCTQKLAFSPAKPPILYSYKPQTPGSRSRRADKQMNRRAEEQQRRTEEKE RQEEFH >gi568815596r:31235961_31514666|GENSCAN_predicted_CDS_3|201_bp atggccctaaatgggaacaggcattcctggttttgcacccaaaaacttgccttttcacct gccaagccccctatcctgtactcatataaaccccagaccccaggctccagaagcagacga gcagacaagcagatgaacagaagagcagaagagcagcagagaaggacagaagagaaggaa cgtcaggaggagttccactaa >gi568815596r:31235961_31514666|GENSCAN_predicted_peptide_4|383_aa MGDDDDDDDVTKPLLLLQAPQGQQWSSSEFYPPPLWKERKGSQTQWILSRLAGSCQSQPV SLLCKAHSVCPCSRDRPAIRSFPDPILAALRAKIKSADSDVCFLFFSHLCANSRITGTAL GCAQTGGLATHGPTSEGSVGRRAEPSLGALDSPPTHRGNHRAQQPSADCTSPEKFLEVQE WLLYVDSLSHLRSYSPPLTNEETEAQRLFLLDLIINLLEKQLCQTGLIIRHILSCDQPSS IKDSARTFQIVSYRELRAGPPTQPNVLGKSHSREKLRHANQSFVPKGAASQPPSSGVRGV SSAALMTTQYPGRASKRILSLPINETVGILVPPSTSVPGDTAGSVDCIPCATPKKTKTSH CIDWKHFKTTGQGKVSQTDDASE >gi568815596r:31235961_31514666|GENSCAN_predicted_CDS_4|1152_bp atgggggatgatgatgatgatgacgatgttaccaaacccctgctcctactacaagctcct cagggccagcagtggagttcttctgagttctaccccccacccctgtggaaagaaagaaaa ggaagccaaacccagtggattctgtcccgactggctggttcctgtcagtctcagcctgtc agcctcctttgtaaagctcacagcgtctgcccttgttccagggacaggcctgccattagg agctttccagatccaatcctggctgctctgagagcaaaaattaaatcagcagacagtgat gtctgctttctgttcttttcacacctgtgtgccaacagcaggattactggcacagctcta ggctgtgcccagactggaggtctggctacccatggccccacatccgaaggctcagtgggt agaagagctgagccttctctgggggcccttgactcaccaccaacccacaggggaaaccac agagcacagcagcccagcgccgactgcacctccccggaaaagttcctggaggttcaagaa tggctcctctatgttgactccctgtctcatctcaggtcttattctcctccacttacaaat gaggaaactgaggctcagaggctgtttctattggacctgatcattaacctcctggagaaa cagctatgccaaactggattgatcatcagacacatcctaagctgtgaccagccatcatca attaaagactctgctagaaccttccagatagtcagctatagggagctgagagccggccct cctacccagcccaacgtgctgggaaaatcacattcacgtgagaagctgagacatgctaat caatcatttgtgccaaaaggagcagcctcccagccacctagctctggagtaaggggagtc tcatcagctgccctgatgacaactcagtaccctggccgtgccagcaagagaatcctcagc ctgcctataaatgaaacggtgggcatcctggtgcctccatccacctccgttccaggcgat accgcaggcagtgttgactgcattccttgtgcaacaccaaaaaagaccaagacatctcac tgtatagactggaagcactttaagacaacagggcaaggcaaggtgtctcagacggacgat gcctctgaatag >gi568815596r:31235961_31514666|GENSCAN_predicted_peptide_5|1645_aa MAKRKKKEKSDEENKHKTTFSSGIPAIPMLDKWEYPVAWTQLAQKWQVGGRSRFHKSSAQ DPQSPSAKSTTLLISGCSFLCSLTQRGCLVGLSSDIFGDSKDYYNTHRSATIRLQVPPKL FDLLRAAYILTRTQKTFPVTTVEGIGSTKTRLHPVQERIAKSHGSQCGFCTPGIVMSMYT LLRNQPEPTMEEIENAFQGNLCRCTGYRPILQGFRTFARGRVLMNSMGLSASDGLETGFL EEAHRLRVPLSLAVPLGLAAFPWRLAGSAHGTAHTRKRVEGQLEVGTLLTARSDHSYLVT RKEMEEPLNRYKQDFQFLVWHDGGCCGGDGNNPNCCMNQKKDHSVSLSPSLFKPEEFTPL DPTQEPIFPPELLRLKDTPRKQLRFEGERVTWIQASTLKELLDLKAQHPDAKLVVGNTEI GIEMKFKNMLFPMIVCPAWIPELNSVEHGPDGISFGAACPLSIVEKTLVDAVAKLPAQKT EVFRGVLEQLRWFAGKQVKSVASVGGNIITASPISDLNPVFMASGAKLTLVSRGELPEAE VQEEGLGGEKSHQIGHFLEKQSSKFSDPAFPVIWIQGSMENVPGLVDGKAGYFLCREFTA GTATCMPYLVGHQENCPDGPHLLPWLQKDPAEPGGDTALHRDPLQQGDLLSLNLLILVFI RICGARGEYFSAFKQASRREDDIAKVTSGMRVLFKPGTTEVQELALCYGGMANRTISALK TTQRQLSKLWKEELLQDVCAGLAEELHLPPDAPGGMVDFRCTLTLSFFFKFYLTVLQKLG QENLEDKCGKLDPTFASATLLFQKDPPADVQLFQEVPKGQSEEDMVGRPLPHLAADMQAS GEAVYCDDIPRYENELSLRLVTSTRAHAKIKSIDTSEAKKVPGFVCFISADDVPGSNITG ICNDETVFAKDKVTCVGHIIGAVVADTPEHTQRAAQGVKITYEELPAIITIEDAIKNNSF YGPELKIEKGDLKKGFSEADNVVSGEIYIGGQEHFYLETHCTIAVPKGEAGEMELFVSTQ NTMKTQSFVAKMLGVPANRIVVRVKRMGGGFGGKETRSTVVSTAVALAAYKTGRPVRCML DRDEDMLITGGRHPFLARYKVGFMKTGTVVALEVDHFSNVGNTQDLSQSIMERALFHMDN CYKIPNIRGTGRLCKTNLPSNTAFRGFGGPQGMLIAECWMSEVAVTCGMPAEEAGALLHV YTDGSVLLTHGGTEMGQGLHTKMVQVASRALKIPTSKIYISETSTNTVPNTSPTAASVSA DLNGQAVYVRSPWDPAEQRASPELWAWRVSLALLGGSLAGALGFCLGYRMQRTDVSGVWG VFTLGGLSDHLEKAGTLQEEESQWLLGRLGKSELIQVTFAKEREIDSRASVPNPNKTPNL GYSFETNSGNPFHYFSYGVACSEVEIDCLTGDHKNLRTDIVMDVGSSLNPAIDIGQVEGA FVQGLGLFTLEELHYSPEGSLHTRGPSTYKIPAFGSIPIEFRVSLLRDCPNKKAIYASKA VGEPPLFLAASIFFAIKDAIRAARAQHTGNNVKELFRLDSPATPEKIRNACVDKFTTLVK WVHVVPSGEGTEEPQGQTNPPSADLNTVGEGGIWVPIQVFLGSTKSFSYGVRTTQTVWHS QAVKMPTSVQQPTHSPEPSHYGSQC >gi568815596r:31235961_31514666|GENSCAN_predicted_CDS_5|4938_bp atggcgaagaggaagaaaaaggagaagagtgatgaggagaataagcacaagacgactttc tcttccggcattccagctattccaatgctggacaagtgggaatatcctgtagcatggacc caacttgctcagaaatggcaagtgggaggacgaagcaggtttcataagagcagcgcccaa gacccacagtcgcctagtgccaagtcaacaaccttactgatatctggatgtagtttcctg tgctctttgacacagcgggggtgtctagtgggtctttccagtgacatatttggggacagc aaagattattataacacgcacagaagtgccaccattcgtcttcaggtgcccccaaagttg tttgacctgctgagagctgcatatatcctgactaggacacagaaaaccttcccagtgaca actgtggaaggaataggaagcaccaagacgaggctgcatcctgtgcaggagagaattgcc aaaagccacggctcccagtgcgggttctgcacccctggcatcgtcatgagtatgtacaca ctgctccggaatcagcccgagcccaccatggaggagattgagaatgccttccaaggaaat ctgtgccgctgcacaggctacagacccatcctccagggcttccggacctttgccagggga agggttcttatgaatagcatgggactttctgccagtgatggtctggagaccggcttcctg gaagaggctcaccgcctgagagtgcctttgagcttggcagtgcctttgggcttggcagct ttcccttggagactggcaggctcagcccatggcacagcacacactagaaagcgtgtggag ggtcaactagaggtgggcacactgctcacggcgagaagtgaccacagctatctggtgaca aggaaagaaatggaggaaccattaaacaggtacaagcaagacttccagtttctggtctgg catgatggtggatgctgtggaggagatgggaataatccaaattgctgcatgaaccagaag aaagaccactcagtcagcctctcgccatctttattcaaaccagaggagttcacgcccctg gatccaacccaggagcccatttttcccccagagttgctgaggctgaaagacactcctcgg aagcagctgcgatttgaaggggagcgtgtgacgtggatacaggcctcaaccctcaaggag ctgctggacctcaaggctcagcaccctgacgccaagctggtcgtggggaacacggagatt ggcattgagatgaagttcaagaatatgctgtttcctatgattgtctgcccagcctggatc cctgagctgaattcggtagaacatggacccgacggtatctcctttggagctgcttgcccc ctgagcattgtggaaaaaaccctggtggatgctgttgctaagcttcctgcccaaaagaca gaggtgttcagaggggtcctggagcagctgcgctggtttgctgggaagcaagtcaagtct gtggcgtccgttggagggaacatcatcactgccagccccatctccgacctcaaccccgtg ttcatggccagtggggccaagctgacacttgtgtccagaggtgagctgcctgaagcagag gtccaggaagaaggacttgggggagaaaagtctcaccagataggtcattttctggagaaa cagagctcaaagttcagtgacccagctttccctgtgatttggatccagggaagcatggag aatgtacctggtttggtggacgggaaagctggatattttctctgtagagaattcacagca ggaactgctacctgcatgccttacctagtggggcaccaggagaactgtccagatggacca caccttcttccctggctacagaaagaccctgctgagcccggaggagatactgctctccat agagatcccctacagcagggagatctcctctccctaaacctgctaattttggtgttcatc agaatttgtggtgccagaggggagtatttctcagcattcaagcaggcctcccggagagaa gatgacattgccaaggtaaccagtggcatgagagttttattcaagccaggaaccacagag gtacaggagctggccctttgctatggtggaatggccaacagaaccatctcagccctcaag accactcagaggcagctttccaagctctggaaggaggagctgctgcaggacgtgtgtgca ggactggcagaggagctgcatctgcctcccgatgcccctggtggcatggtggacttccgg tgcaccctcaccctcagcttcttcttcaagttctacctgacagtccttcagaagctgggc caagagaacctggaagacaagtgtggtaaactggaccccactttcgccagtgcaacttta ctgtttcagaaagaccccccagccgatgtccagctcttccaagaggtgcccaagggtcag tctgaggaggacatggtgggccggcccctgccccacctggcagcggacatgcaggcctct ggtgaggccgtgtactgtgacgacattcctcgctacgagaatgagctgtctctccggctg gtcaccagcacccgggcccacgccaagatcaagtccatagatacatcagaagctaagaag gttccagggtttgtttgtttcatttccgctgatgatgttcctgggagtaacataactgga atttgtaatgatgagacagtctttgcgaaggataaggttacttgtgttgggcatatcatt ggtgctgtggttgctgacaccccggaacacacacagagagctgcccaaggggtgaaaatc acctatgaagaactaccagccattatcacaattgaggatgctataaagaacaactccttt tatggacctgagctgaagatcgagaaaggggacctaaagaaggggttttccgaagcagat aatgttgtgtcaggggagatatacatcggtggccaagagcacttctacctggagactcac tgcaccattgctgttccaaaaggcgaggcaggggagatggagctctttgtgtctacacag aacaccatgaagacccagagctttgttgcaaaaatgttgggggttccagcaaaccggatt gtggttcgagtgaagagaatgggaggaggctttggaggcaaggagacccggagcactgtg gtgtccacggcagtggccctggctgcatataagaccggccgccctgtgcgatgcatgctg gaccgtgatgaggacatgctgataactggtggcagacatcccttcctggccagatacaag gttggcttcatgaagactgggacagttgtggctcttgaggtggaccacttcagcaatgtg gggaacacccaggatctctctcagagtattatggaacgagctttattccacatggacaac tgctataaaatccccaacatccggggcactgggcggctgtgcaaaaccaaccttccctcc aacacggccttccggggctttggggggccccaggggatgctcattgccgagtgctggatg agtgaagttgcagtgacctgtgggatgcctgcagaggaggcaggagccctacttcatgtg tacacagatggctctgtgctgctgacccacggggggactgagatgggccaaggccttcat accaaaatggtccaggtggccagtagagctctgaaaatccccacctctaagatttatatc agcgagacaagcactaacactgtgcccaacacctctcccacggctgcctctgtcagcgct gacctcaatggacaggccgtctatgtaaggagcccatgggatcccgcagagcagagggcc agcccagagctgtgggcctggagagtctccctggcattgcttggaggtagccttgctggg gccctcgggttttgcctgggctacagaatgcagaggactgatgtgtctggggtgtggggg gtgtttactttaggcggcttgtcagaccatcttgaaaaggctggaaccctacaagaagaa gaatcccagtggctcctgggaagactgggtaagtctgagctgatccaagtcacgtttgca aaggagagggaaattgacagcagggcctccgttccgaatccgaacaaaacacccaatctg ggctacagctttgagactaactcagggaaccccttccactacttcagctatggggtggct tgctctgaagtagaaatcgactgcctaacaggagatcataagaacctccgcacagatatt gtcatggatgttggctccagtctaaaccctgccattgatattggacaggtggaaggggca tttgtccagggccttggcctcttcaccctagaggagctacactattcccccgaggggagc ctgcacacccgtggccctagcacctacaagatcccggcatttggcagcatccccattgag ttcagggtgtccctgctccgcgactgccccaacaagaaggccatctatgcatcgaaggct gttggagagccgcccctcttcctggctgcttctatcttctttgccatcaaagatgccatc cgtgcagctcgagctcagcacacaggtaataacgtgaaggaactcttccggctagacagc cctgccaccccggagaagatccgcaatgcctgcgtggacaagttcaccaccctggtaaag tgggtacatgtggtcccaagtggggagggcacagaggaaccccaggggcagacaaatcca ccatccgcagacctcaacacagtgggtgaaggaggcatatgggtccccattcaggtcttc cttgggtctaccaagagctttagctatggagtgagaactacacagaccgtgtggcactcc caggcagtaaagatgcccacatctgtgcagcaacctacccacagcccagaaccttcccac tatggctcccaatgctga >gi568815596r:31235961_31514666|GENSCAN_predicted_peptide_6|541_aa MERERSLDSSECGFQGPLNGCADSCGDGTQTSTVSCKKEQKKGCRYGEQRRVTMKGEKKS WSEQGIREENDWFEGSTPPTLDRKSDPGWNRSLHHAVTAGYGEGVALVIQYCFSYLFDTS FSDAKLKPDTMSAHLCFDSHERTSLENIFFQLEKNKQILKVKSRSFLPGAHMRASTDSLL EVLARAIWPEKEIKGIQLGKKEVILSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYK INVQKSQAFLYTNNRQTESQIMSELPFTIVSKRIKYLGIQLTRDVKDLFKENYKPLLKEI KEDTNKWKNIPCSWVGRINIVKMAILPKVMYRFNAIPIKLPMPFFTELEKTTLKFIWNQK RARIAKSILSQKNKAGGITLPDFKLYCKATVTKTAWYWYQNRDIDQCNRTEPSEMRPHIY NYLIFDKPDKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPK TIKTLGENLGITIQDIGMGKDFLSKTPKAMATEAKIDKWDLIKLKSFCTPKETTIRVKKI S >gi568815596r:31235961_31514666|GENSCAN_predicted_CDS_6|1626_bp atggagagagagagaagcctggactctagtgagtgtggattccaggggcctttgaatggc tgtgcagactcctgtggagatggaactcagaccagcacagtctcctgtaagaaagaacag aagaaagggtgcagatatggagaacagagaagggttacaatgaaaggagagaagaaatcc tggagtgagcagggcattagagaagaaaatgactggtttgaaggaagcacaccacctacc cttgacaggaaatcagacccaggctggaatcgctctctgcatcatgctgtcactgctgga tatggggaaggggtggcattggtgattcagtactgcttttcctacctcttcgatacctct ttcagtgatgcaaagttaaaaccagatactatgagtgctcacctgtgttttgattctcat gaacgtacttctttggagaatatcttcttccaattagaaaaaaacaaacagattcttaag gttaagtcacgtagcttcttgcctggtgcccacatgagagcttctacggactcattgttg gaagttctggccagggcaatttggccggagaaggaaataaagggtattcaattaggaaaa aaggaagtcatattgtccctgtttgcagatgacatgattgtatatctagaaaaccccatt gtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaa atcaatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaa atcatgagtgaactcccattcacaattgtttcaaagagaataaaatacctaggaatccaa cttacaagggacgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaata aaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatc gtgaaaatggccatactgcccaaggtaatgtatagattcaatgccatccccatcaagcta ccaatgcctttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaa agagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgcta cctgacttcaaactatactgcaaggctacagtaaccaaaacagcatggtactggtaccaa aacagagatatagatcaatgcaacagaacagagccctcagaaatgaggccgcatatctac aactatctgatctttgacaaacctgacaaaaacaagcaatggggaaaggattccctattt aataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttc cttacaccttatacaaaaattaattcaagatggattaaagacttaaacgttagacctaaa accataaaaaccctaggagaaaacctaggcattaccattcaggacataggcatgggcaag gacttcctgtctaaaacaccaaaagcaatggcaacagaagccaaaattgacaaatgggat ctaattaaactaaagagcttctgcacaccaaaagaaactaccatcagagtgaaaaaaatt tcttaa >gi568815596r:31235961_31514666|GENSCAN_predicted_peptide_7|121_aa MDKFLDIYILPRLRQEEIDSLKRPIMSFRIESVINSPLTNKSPGPGGFTAEFYQMYKEEL LSFLLKLYQKIEEEGLLSNSFYEAVTILVLKFGRDTTKKEKFRPISLMNMNAKTLNKILG N >gi568815596r:31235961_31514666|GENSCAN_predicted_CDS_7|366_bp atggataaattcctggacatatacatcctcccaagactgcgtcaggaagaaattgattcc ctgaagagaccaataatgagcttcagaattgaatcagtaataaatagcccactaaccaac aaaagtccaggacctggtggattcacagctgaattctaccagatgtacaaagaagagctg ttatcattcctactgaaactataccaaaaaatagaggaggagggccttctctccaactca ttctatgaggcagtcaccatcctggtactaaaatttggcagggacacaacaaaaaaagaa aaattcaggccaatatccttgatgaacatgaatgcaaaaactctcaacaaaatattggga aactga >gi568815596r:31235961_31514666|GENSCAN_predicted_peptide_8|98_aa MEIEFDKLTEVGFRRSVITNFSEIKEHILTHRKEAKNLDKRLDKWLTRINSVEKTLNYLM ELKTTAQGLRNTYTSFNSRFDQVEERISVIEDQINEIK >gi568815596r:31235961_31514666|GENSCAN_predicted_CDS_8|297_bp atggagattgagtttgacaagttgacagaagtaggtttcagaaggtcagtaataacaaac ttctccgagataaaggagcatattctaacccatcgcaaggaagctaaaaaccttgacaaa aggttagacaaatggctaactagaataaacagtgtagagaagaccttaaattacctgatg gagctgaaaaccacagcacaaggacttcgtaacacatacacaagcttcaatagccgattt gatcaagtggaagaaaggatatcagtgattgaagatcaaattaatgaaataaagtga >gi568815596r:31235961_31514666|GENSCAN_predicted_peptide_9|196_aa MGKGFPYLTNGAGKLASNVERIPYLINGAGKLASNMRKLKLHPFLTPYTKVNSRLVKDLN VRPKTIKTLEENLGNIIQDIGMGKGFMTKTPKAMATKAKIDKGDLIKLKSFYTAKETIIR VNRQRTEWEKIVAIYPSDKGLISRIYKELKQIYKKKSNNPIKKWAKDMNRHFSKEDIYAA NRHMKKCSPSLATREM >gi568815596r:31235961_31514666|GENSCAN_predicted_CDS_9|591_bp atggggaaaggattcccctatttaacaaatggtgctgggaaactggctagcaatgtggaa aggattccctatttaataaatggtgctgggaaactggctagcaatatgagaaagctgaaa ctgcatcccttccttacaccatatacaaaagttaactcaagattggttaaagacttaaat gtaagacctaagaccataaaaaccctagaagaaaacctaggcaatatcattcaggacata ggcatgggcaaaggcttcatgactaaaacaccaaaagcaatggcaacaaaagctaaaata gacaaaggggatctaattaaactaaagagcttctacacagcaaaagaaactatcatcaga gtgaacaggcaacgtacagaatgggagaaaattgttgcaatctacccatctgacaaaggg ctaatatccagaatctacaaagaacttaaacaaatttacaagaaaaaatcaaacaacccc atcaaaaagtgggcaaaggatatgaacagacacttctcaaaagaagacatttatgcagcc aacagacacatgaaaaaatgctcaccatcactggccaccagagaaatgtaa >gi568815596r:31235961_31514666|GENSCAN_predicted_peptide_10|338_aa MRKKQSRKTGNSKKQSTSPPPKERSSSPAMEQSWTENDFDELREEGFRRSNYSKLQEEIE TKGQEVENLEKNLDKCITRITNIEKCLKELMELKAKARELHEECRSLRSRCDQLEERVSV MEDEMNEMKQEGKFREKRIKRNEQSLQEIWDYVKRPNLPPIDVPESDRENGTKLENTLQD VIQENFPNLARQANIQIQEIQRMPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRL THKGKPIRITADLLAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDK QMLRDFVTTRPALKELLKEALNMERNNRYQPLQNHAKM >gi568815596r:31235961_31514666|GENSCAN_predicted_CDS_10|1017_bp atgaggaaaaaacagagcagaaaaactggaaactctaaaaagcagagcacctctcctcct ccaaaggaacgcagttcctcaccagcaatggaacaaagctggacggagaatgactttgac gagttgagagaagaaggtttcagacgatcaaactactccaagctacaggaggaaattgaa accaaaggccaagaagttgaaaacttggaaaaaaatttagacaaatgtataactagaata accaatatagagaagtgcttaaaggagctgatggagctgaaagccaaggctcgagaacta catgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagta atggaagatgaaatgaatgaaatgaagcaagaagggaagtttagagaaaaaagaataaaa agaaatgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacctccg attgatgtacctgaaagtgacagggagaatggaaccaagttggaaaacactctgcaggat gttatccaggagaacttccccaacctagcaaggcaggccaacattcagattcaggaaata cagagaatgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttgagggcagccagagagaaaggtcggctt acccacaaagggaagcccatcagaataacagcagatctcttggcagaaactctacaagcc agaagagagtggggaccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagca ctaaacatggaaaggaacaaccggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815596r:31235961_31514666|GENSCAN_predicted_peptide_11|585_aa MIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVS AQNPLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLT RDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVMYRFNAIPIKLPM PFFTELEKTTLKFIWNQKRARIIKSILSPKNKAGGITLPDFKLYCKATVTKTAWYWYQNR NIDQCNRTEPSEIMPHIYNYLIFDKPDKNKQWGKDPLFNKWCWENWLAICRKLKLDPFLT PYTKINSRWIKDLNIRPKTIKTLEESLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLI KLKSFCTAKETTIRVDRQPTEWEKIFTTYSSDKGLISRIYNELKQIYKKKSNNPIKKWAK HMNRCFSKEDIYAAKRHVKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRCWRGC GEIRTLLHCWWGCELVQPLWKSVRRFLRDLELEIPFDPAIPLLGI >gi568815596r:31235961_31514666|GENSCAN_predicted_CDS_11|1758_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacaacccttcatgcta aaaactctcaataaattaggtattgatgggacgtatctcaaaataataagagctatctat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaatcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgtatatctagaaaaccccattgtctca gcccaaaatccccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtacaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca agggacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagag gatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaa atggccatactgcccaaggtaatgtatagattcaatgccatccccatcaagctaccaatg cctttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cgcatcatcaagtcaatcctaagcccaaagaacaaagctggaggcatcacgctacctgac ttcaaactatactgcaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga aatatagatcaatgcaacagaacagagccctcagaaataatgccacatatctacaactat ctgatctttgacaaacctgacaaaaacaagcaatggggaaaggatcccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttacacaaaaattaattcaagatggattaaagacttaaacattaggcctaaaaccata aaaaccctagaagaaagcctaggcattaccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtggacaggcaacctaca gaatgggagaaaattttcacaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaatcaaacaaccccatcaaaaagtgggcaaag catatgaacagatgcttctcaaaagaagacatttatgcagccaaaagacacgtgaaaaaa tgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctc acgccagttagaatggcaatcattaaaaagtcaggaaacaacaggtgctggagaggatgt ggagaaataagaacacttttacactgttggtggggctgtgaactagttcaaccattgtgg aagtcagtgcggcgattcctcagggatctagaactagaaataccatttgacccagccatc ccattactgggtatataa >gi568815596r:31235961_31514666|GENSCAN_predicted_peptide_12|94_aa MAANRWKWSSQNKIGPVKSKGHGNGFLGDAQGVLLADFPEGQIMVTSAHYENVLRKLAKA LAEGHPGKLHQRVLHHDNALAHSSHPTRTILQEF >gi568815596r:31235961_31514666|GENSCAN_predicted_CDS_12|285_bp atggcagccaacaggtggaaatggtccagtcaaaacaaaattggaccagtcaagagcaaa ggtcatggtaatggttttttgggggatgctcaaggtgttttgcttgctgactttccggag ggtcagataatggtaacatctgctcattatgaaaatgttttgagaaagttagccaaagct ttggcagaaggacacccaggaaagcttcaccagagagtccttcatcatgacaatgctctt gctcattcttctcatccaacaaggacaattttgcaagagttttga