GENSCAN 1.0 Date run: 3-Nov-116 Time: 06:33:43 Sequence gi568815579r:40221885_40436408 : 214524 bp : 51.81% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.22 PlyA - 348 343 6 1.05 1.21 Term - 588 423 166 2 1 103 48 114 0.812 6.60 1.20 Intr - 1419 1268 152 0 2 118 49 61 0.810 4.67 1.19 Intr - 1662 1504 159 0 0 89 99 221 0.999 24.00 1.18 Intr - 2759 2604 156 2 0 96 65 163 0.889 15.52 1.17 Intr - 2927 2838 90 2 0 52 63 125 0.805 7.09 1.16 Intr - 4670 4491 180 2 0 83 89 242 0.675 24.38 1.15 Intr - 4803 4752 52 2 1 110 85 -22 0.122 -0.90 1.14 Intr - 10969 10856 114 0 0 125 57 43 0.502 4.97 1.13 Intr - 12067 11992 76 2 1 115 46 104 0.767 7.97 1.12 Intr - 13263 13161 103 0 1 114 68 113 0.601 12.15 1.11 Intr - 13466 13379 88 1 1 48 105 129 0.994 11.07 1.10 Intr - 14220 14006 215 0 2 119 78 438 0.999 43.94 1.09 Intr - 14501 14373 129 2 0 57 24 209 0.823 12.80 1.08 Intr - 16207 16085 123 1 0 146 96 172 0.924 25.09 1.07 Intr - 17089 17021 69 1 0 113 83 132 0.999 15.07 1.06 Intr - 20185 20054 132 1 0 118 100 308 0.995 36.35 1.05 Intr - 20803 20650 154 0 1 78 103 267 0.916 27.79 1.04 Intr - 33385 33274 112 2 1 125 78 195 0.623 22.14 1.03 Intr - 33721 33587 135 2 0 106 56 30 0.861 2.65 1.02 Intr - 35170 35042 129 2 0 135 89 206 0.999 26.57 1.01 Init - 36193 36145 49 1 1 86 58 25 0.648 -1.63 1.00 Prom - 37874 37835 40 -6.30 2.00 Prom + 39344 39383 40 -3.81 2.01 Init + 39465 39467 3 2 0 113 22 0 0.219 -4.17 2.02 Intr + 41863 41982 120 0 0 123 46 21 0.546 2.59 2.03 Intr + 43273 43489 217 0 1 105 41 148 0.959 10.40 2.04 Intr + 44068 44563 496 2 1 97 107 172 0.729 12.64 2.05 Intr + 45310 45409 100 1 1 85 57 -12 0.773 -3.99 2.06 Intr + 46376 46524 149 1 2 92 80 109 0.839 10.04 2.07 Term + 57475 57586 112 1 1 112 54 29 0.046 0.23 2.08 PlyA + 57769 57774 6 -0.45 3.00 Prom + 61359 61398 40 -4.51 3.01 Sngl + 63094 63558 465 0 0 44 45 357 0.999 21.50 3.02 PlyA + 68366 68371 6 1.05 4.10 PlyA - 69307 69302 6 1.05 4.09 Term - 100492 99998 495 1 0 111 41 531 0.991 45.66 4.08 Intr - 102192 102122 71 1 2 119 87 47 0.998 7.09 4.07 Intr - 104602 104450 153 2 0 91 79 243 0.867 24.26 4.06 Intr - 106666 106529 138 2 0 114 81 133 0.973 16.14 4.05 Intr - 112045 111967 79 1 1 109 89 51 0.967 6.92 4.04 Intr - 114340 114226 115 0 1 106 94 171 0.996 20.45 4.03 Intr - 114523 114436 88 2 1 59 66 153 0.999 9.73 4.02 Intr - 120006 119955 52 0 1 147 92 6 0.992 5.87 4.01 Init - 120183 120121 63 0 0 86 89 79 0.857 6.99 4.00 Prom - 125122 125083 40 -2.31 5.00 Prom + 127276 127315 40 -2.41 5.01 Init + 129832 129893 62 0 2 82 90 33 0.062 3.85 5.02 Term + 140286 140304 19 0 1 105 41 31 0.017 -1.83 5.03 PlyA + 141347 141352 6 1.05 6.00 Prom + 143107 143146 40 -2.01 6.01 Init + 144600 144626 27 2 0 86 92 33 0.982 3.12 6.02 Intr + 144726 144800 75 2 0 99 97 114 0.999 13.61 6.03 Intr + 144889 145031 143 0 2 144 91 157 0.997 21.26 6.04 Intr + 145812 145995 184 0 1 48 82 274 0.996 23.11 6.05 Intr + 148024 148144 121 0 1 116 89 208 0.987 24.27 6.06 Intr + 148226 148353 128 0 2 123 62 206 0.999 22.40 6.07 Intr + 149789 149989 201 1 0 82 101 310 0.994 31.70 6.08 Intr + 152597 152736 140 1 2 86 84 174 0.999 16.57 6.09 Intr + 154725 154890 166 0 1 88 103 290 0.994 30.98 6.10 Intr + 155902 156001 100 0 1 104 109 175 0.983 21.38 6.11 Term + 156102 156289 188 1 2 106 54 277 0.999 24.07 6.12 PlyA + 156584 156589 6 -3.24 7.05 PlyA - 157405 157400 6 -4.04 7.04 Term - 157885 157703 183 1 0 133 45 91 0.974 7.16 7.03 Intr - 159284 158439 846 2 0 92 83 1268 0.696 118.94 7.02 Intr - 162255 161899 357 0 0 124 116 699 0.936 72.31 7.01 Init - 168018 167554 465 0 0 97 113 1027 0.986 101.78 7.00 Prom - 169868 169829 40 -3.71 8.10 PlyA - 171903 171898 6 1.05 8.09 Term - 176086 172082 4005 1 0 129 48 3077 0.844 294.43 8.08 Intr - 176932 176736 197 2 2 90 80 406 0.986 39.65 8.07 Intr - 181978 181822 157 1 1 75 86 213 0.939 19.90 8.06 Intr - 186147 186022 126 0 0 119 105 106 0.968 16.78 8.05 Intr - 186372 186274 99 0 0 103 81 -10 0.457 0.61 8.04 Intr - 186499 186456 44 2 2 60 99 34 0.758 0.15 8.03 Intr - 189655 189520 136 1 1 58 82 80 0.067 5.05 8.02 Intr - 201662 200997 666 2 0 115 56 720 0.011 64.06 8.01 Init - 212251 212171 81 1 0 66 32 107 0.448 1.82 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:40221885_40436408|GENSCAN_predicted_peptide_1|860_aa MGFLHVGQAGLQLPTSGEYIKTWRPRYFLLKSDGSFIGYKERPEAPDQTLPPLNNFSVAG LSSEKLLPLGSPPTPQAGLDATSELLHTSRLVPIAALSMEPSLSECQLMKTERPRPNTFV IRCLQWTTVIERTFHVDSPDEREEWMRAIQMVANSLKQRAPGEDPMDYKCGSPSDSSTTE EMEVAVSKARAKVTMNDFDYLKLLGKGTFGKVILVREKATGRYYAMKILRKEVIIAKALK YAFQTHDRLCFVMEYANGGELFFHLSRERVFTEERARFYGAEIVSALEYLHSRDVVYRDI KLENLMLDKDGHIKITDFGLCKEGISDGATMKTFCGTPEYLAPEVLEDNDYGRAVDWWGL GVVMYEMMCGRLPFYNQDHERLFELILMEEIRFPRTLSPEAKSLLAGLLKKDPKQRLGGG PSDAKEVMEHRFFLSINWQDVVQKKLLPPFKPQVTSEVDTRYFDDEFTAQSITITPPDRY DSLGLLELDQRTHFPQFSYSASIREAPRSAALRLNMSARPCQCCGTPVRASDCVCRRDAG TRGCKGRCLASPATPSWRMLSLAASLDAEPSSAAVPDGFPAGPTVSPRRLARPPGLEEAL SALGLQGEREYAGDIFAEVMVCRVLPLRALPRAVTPEMRALVVDWLVQVHEYLGLAGDTL YLAVHLLDSYLSAGRVRLHRLQLLGVACLFVACKMEECVLPEPAFLCLLSADSFSRAELL RAERRILSRLDFRLHHPGPLLCLGLLAALAGSSPQVMLLATYFLELSLLEAEAAGWEPGR RAAAALSLAHRLLDGAGSRLQPELYSPEELGTLEPCMARAALRGPAPGRAAVFLKYARPQ RQGTSLAAACLLRRLQSEPP >gi568815579r:40221885_40436408|GENSCAN_predicted_CDS_1|2583_bp atggggtttctccatgttggtcaggctggtctccaactcccgacctcaggtgaatacatc aagacctggaggccacggtacttcctgctgaagagcgacggctccttcattgggtacaag gagaggcccgaggcccctgatcagactctaccccccttaaacaacttctccgtagcaggg cttagttcagagaagctgcttcctctgggaagccccccaaccccccaggctgggttggat gctacctctgagcttctgcacacctctaggcttgtccccatcgcggccctgtctatggag ccatcactctctgaatgccagctgatgaagaccgagaggccgcgacccaacacctttgtc atacgctgcctgcagtggaccacagtcatcgagaggaccttccacgtggattctccagac gagagggaggagtggatgcgggccatccagatggtcgccaacagcctcaagcagcgggcc ccaggcgaggaccccatggactacaagtgtggctcccccagtgactcctccacgactgag gagatggaagtggcggtcagcaaggcacgggctaaagtgaccatgaatgacttcgactat ctcaaactccttggcaagggaacctttggcaaagtcatcctggtgcgggagaaggccact ggccgctactacgccatgaagatcctgcggaaggaagtcatcattgccaaggcgctgaag tatgccttccagacccacgaccgcctgtgctttgtgatggagtatgccaacgggggtgag ctgttcttccacctgtcccgggagcgtgtcttcacagaggagcgggcccggttttatggt gcagagattgtctcggctcttgagtacttgcactcgcgggacgtggtataccgcgacatc aagctggaaaacctcatgctggacaaagatggccacatcaagatcactgactttggcctc tgcaaagagggcatcagtgacggggccaccatgaaaaccttctgtgggaccccggagtac ctggcgcctgaggtgctggaggacaatgactatggccgggccgtggactggtgggggctg ggtgtggtcatgtacgagatgatgtgcggccgcctgcccttctacaaccaggaccacgag cgcctcttcgagctcatcctcatggaagagatccgcttcccgcgcacgctcagccccgag gccaagtccctgcttgctgggctgcttaagaaggaccccaagcagaggcttggtgggggg cccagcgatgccaaggaggtcatggagcacaggttcttcctcagcatcaactggcaggac gtggtccagaagaagctcctgccacccttcaaacctcaggtcacgtccgaggtcgacaca aggtacttcgatgatgaatttaccgcccagtccatcacaatcacaccccctgaccgctat gacagcctgggcttactggagctggaccagcggacccacttcccccagttctcctactcg gccagcatccgcgaggccccacgttctgcagccttaaggttgaacatgagtgcacgtcca tgtcagtgctgtgggactcctgtgcgtgcctcggactgcgtgtgtcggcgggacgcaggc acacgtggctgcaagggtcggtgcttggcctcgcccgcaacaccctcctggaggatgctg agtctcgctgcctccctcgacgcagagccttcaagcgccgcagtccccgacggcttcccc gcgggccccactgtctccccaagacgcctggcgaggccgccggggctggaggaggcgctg agcgcgctggggctgcagggagaacgcgagtacgccggggacatcttcgccgaagtcatg gtgtgccgcgtgctgcccctgagagccctgccccgcgctgtgaccccggagatgcgcgcc ctggtggtagactggctggtccaggtgcacgagtacctgggtctggctggtgacacactt tatctggcggttcacctgcttgattcctacctgagcgctggccgcgtgcgtctacatcgc ctgcagctgctgggcgtggcttgcctgtttgtggcgtgcaaaatggaagagtgcgtgctt cccgagcccgccttcctctgcctcctgagcgcggactccttctcacgggcggagctgctg cgcgccgagcgtcgcatcctgagccgcctggatttccggctgcaccaccccggcccgctg ctgtgcctcgggctgctggccgcgctggcagggagcagcccccaggtgatgttacttgcc acctacttcctggagctgtctttgctggaggccgaggcggcgggatgggagccgggtcgt cgtgcggctgcggctctgagcctggcgcaccgcttgctcgacggggcgggctccaggctc cagccagaactttacagccccgaggaactgggcaccctcgagccgtgcatggcccgcgct gcgctccgaggtcccgcgccgggtcgcgccgcagtcttcctcaagtatgcgcggccccag cgccaggggaccagccttgccgccgcctgcctgctccgccgcctccagtctgagcctccc tga >gi568815579r:40221885_40436408|GENSCAN_predicted_peptide_2|398_aa MAGTVQDAWLFPMPAAPASCPLFEQAHLPPQDSLAVASGITGTAFQEEPAWEKESGGQKR PLTTLVEPAFFDDRHLIHGGSVVRCHLARDSSGQQDMQEAPWTGHSLQDKRGADDREKTG LRQSGILNKKATYCVCTVWGLEASPGGLTEALIPTPRPSLLSSGTSNQQTEGRRGHCFVS GSQNGGEEGARQRQLGPYLEPGAWPGSTETKAPLPFPAWGYLRTAGRELRWRPEKWQLAL GTPWASAFSRAPGELVGRTRYCLACDSGRLPRSRQKEGRTFVAPWCSQQKSQTPQLRIQG PSGPRLCCPAFQRPRDIASQGPCNAAALKYPLWAQGSAYCLAPSEQPNTWLPLFQREAKA ARLSPPAGPLAAWLAVQKSLPWNHPTVRPGGLWLRAFR >gi568815579r:40221885_40436408|GENSCAN_predicted_CDS_2|1197_bp atggctggtacagtgcaggatgcgtggctcttcccgatgcctgcagcgccagcttcctgc cccctctttgaacaagcccacttaccccctcaggactcattggctgtcgcctctggcata actggcacagctttccaggaggagccagcgtgggagaaagaatctggcgggcaaaagcgg cctcttaccacgcttgtggagccagccttctttgatgacagacacctcattcatggtggc agcgtggtacgctgtcacctagctcgggacagctcagggcagcaggacatgcaggaggca ccgtggacagggcacagtctgcaagacaagagaggagctgatgacagagaaaagacaggt ctcagacaatccgggatcttgaacaagaaagcaacctactgcgtatgcacggtctggggc ctagaagccagtcctgggggcttgaccgaggccctgatccctaccccaagaccttcactt ctgtcctcagggaccagcaaccaacagactgagggacgccgaggccactgcttcgtctca ggttcccagaacggaggggaggagggcgcgaggcagcgccagctgggcccctacctggag cctggggcctggccagggtccacagagaccaaggcgcctttgcctttcccagcctggggt tatttgcggacagcaggtcgggagctacggtggcgcccagagaagtggcagctggctcta gggacaccatgggcatcagctttctctcgtgccccaggggagctggtgggccggacacgc tactgcctggcatgtgacagtggcaggctgcccaggtcccggcagaaagaagggagaacc tttgtggctccctggtgttcccagcaaaaaagccaaactcctcagctgaggatccaaggc ccttcaggacctcgcctctgctgccctgcgttccagcggcccagggacatcgcctctcag ggcccttgtaatgcagcagcactaaagtacccactgtgggcccagggctcggcctactgc ctggcgcccagtgaacagccaaacacatggctgccattattccagagagaagcaaaggca gccaggctgtctcccccagctgggcctctcgctgcttggctggctgtccagaagagcctg ccatggaaccaccccaccgtgcggcccggaggcctgtggttgagggccttccgctga >gi568815579r:40221885_40436408|GENSCAN_predicted_peptide_3|154_aa MARRHRPGPPRLVPPPPLGHPHSAPEGGARGPRSPRRARVVQPIAAPRGARTRPSWGGRS GTRAGALTCHRQPPSLSGGALPPPIATLRWFPFLVFPGSGNGAGSGSGGGDASSEAGPTA STARPPPPPPPSPRLPPPPTLWTEIPPSRATNRK >gi568815579r:40221885_40436408|GENSCAN_predicted_CDS_3|465_bp atggcccgccgccaccgccccggccccccgaggctggtcccaccgcccccgctcggccac ccccactcagctcctgaaggaggggctcgagggccgcggtccccccgccgggcacgggtg gtacagcccatcgcggcaccgcggggggcccggacgcgaccatcgtggggggggcgttcg gggacacgcgctggcgcactcacctgtcaccggcagccgcccagcctctccggcggggct ctacccccgcccatcgccacgctgcgctggttccctttccttgtgtttcccggcagcggc aacggcgccggcagcggcagcggcggcggcgacgcctcctccgaggcaggcccaacggct agcacggcgcggccccccccgccccctccccccagcccccgcttaccgccccctcccacc ctctggacggaaataccgccctctcgcgccacaaacaggaagtga >gi568815579r:40221885_40436408|GENSCAN_predicted_peptide_4|417_aa MVVAALSQADFLPAAPGYSWAAPGSSVRARETMVSVTMATSEWIQFFKEAGIPPGPAVNY AVMFVDNRIQKSMLLDLNKEIMNELGVTVVGDIIAILKHAKVVHRQDMCKAATESVPCSP SPLAGEIRRGTSAASRMITNSLNHDSPPSTPPRRPDTSTSKISVTVSNKMAAKSAKATAA LARREEESLAVPAKRRRVTAEMEGKYVINMPKGTTPRTRKILEQQQAAKGLHRTSVFDRL GAETKADTTTGSKPTGVFSRLGATPETDEDLAWDSDNDSSSSVLQYAGVLKKLGRGPAKA SPQPALTVKAKATSSATTAAAPTLRRLALSSRSGLERKPESLSKVSIIKRLGAAALVPEA QDSQVTSTKSKSSAEVKVTIKRTLVGPRGSSSSEGLGAQMDHAGTVSVFKRLGRRTF >gi568815579r:40221885_40436408|GENSCAN_predicted_CDS_4|1254_bp atggtggtggcagctctgtcccaggctgacttcctgccagctgctccaggctattcctgg gcggctccagggagcagtgtcagggccagggagacgatggtctccgtgactatggccact tccgagtggatccagttctttaaggaagccggcattcctccaggacctgccgtcaattat gccgtgatgtttgtggataataggattcagaagagcatgctgctggatctcaataaggag ataatgaatgagctgggcgtgaccgtggtgggtgacatcatcgccattctcaagcatgcc aaagtggtgcaccgtcaggacatgtgcaaagctgccactgagtcagtaccctgcagccct agcccccttgcaggcgaaattcgccgtggcaccagtgctgcctcccgaatgatcaccaac agcctgaaccatgactctccacccagcacaccccccaggcgcccggacaccagcacctcc aagatctcggtcactgtgtccaacaagatggcagcaaagagtgccaaggccactgcagcc ctggcccgccgggaggaggagagcctggctgttcctgccaagcggcgccgggtcactgct gagatggaggggaagtacgtcatcaacatgcccaaaggcaccacaccccgcacccgcaag atcctggagcagcagcaggctgcaaaaggtctccataggacgtctgtgtttgaccgcctc ggcgccgagaccaaggcagacaccacgacagggagtaaacccacaggagtcttcagccgc ctgggggccaccccagaaacggacgaggatctggcttgggacagcgacaatgacagcagc agctctgtcttgcagtatgccggggtcctgaagaagctaggacggggcccagccaaggcc agtccccagccagcactgactgtcaaagccaaggccacaagctcagcgacaacggctgct gccccgacactgcggcgcctggcgctttcctcacggtctgggcttgagaggaagccggag tccttgtctaaagtcagcatcatcaagagactgggcgcagctgcccttgtgcccgaggcc caggacagccaggtcaccagcaccaagagtaagtcctcagccgaggtcaaggtcaccatt aagaggactctggtggggccccgggggagcagctccagcgagggccttggtgcccagatg gaccacgcgggcactgtgagcgtgttcaaaagactgggccgcaggaccttctag >gi568815579r:40221885_40436408|GENSCAN_predicted_peptide_5|26_aa MGHIVEGLVSHAEGWLLLPVSCIPSA >gi568815579r:40221885_40436408|GENSCAN_predicted_CDS_5|81_bp atggggcacatcgtggaaggccttgtgagccatgcagaaggctggcttttactcccagtg agctgtatccccagtgcctaa >gi568815579r:40221885_40436408|GENSCAN_predicted_peptide_6|490_aa MKPKLMYQELKVPAEEPANELPMNEIEAWKAAEKKARWVLLVLILAVVGFGALMTQLFLW EYGDLHLFGPNQRPAPCYDPCEAVLVESIPEGLDFPNASTGNPSTSQAWLGLLAGAHSSL DIASFYWTLTNNDTHTQEPSAQQGEEVLRQLQTLAPKGVNVRIAVSKPSGPQPQADLQAL LQSGAQVRMVDMQKLTHGVLHTKFWVVDQTHFYLGSANMDWRSLTQVKELGVVMYNCSCL ARDLTKIFEAYWFLGQAGSSIPSTWPRFYDTRYNQETPMEICLNGTPALAYLASAPPPLC PSGRTPDLKALLNVVDNARSFIYVAVMNYLPTLEFSHPHRFWPAIDDGLRRATYERGVKV RLLISCWGHSEPSMRAFLLSLAALRDNHTHSDIQVKLFVVPADEAQARIPYARVNHNKYM VTERATYIGTSNWSGNYFTETAGTSLLVTQNGRGGLRSQLEAIFLRDWDSPYSHDLDTSA DSVGNACRLL >gi568815579r:40221885_40436408|GENSCAN_predicted_CDS_6|1473_bp atgaagcctaaactgatgtaccaggagctgaaggtgcctgcagaggagcccgccaatgag ctgcccatgaatgagattgaggcgtggaaggctgcggaaaagaaagcccgctgggtcctg ctggtcctcattctggcggttgtgggcttcggagccctgatgactcagctgtttctatgg gaatacggcgacttgcatctctttgggcccaaccagcgcccagccccctgctatgaccct tgcgaagcagtgctggtggaaagcattcctgagggcctggacttccccaatgcctccacg gggaacccttccaccagccaggcctggctgggcctgctcgccggtgcgcacagcagcctg gacatcgcctccttctactggaccctcaccaacaatgacacccacacgcaggagccctct gcccagcagggtgaggaggtcctccggcagctgcagaccctggcaccaaagggcgtgaac gtccgcatcgctgtgagcaagcccagcgggccccagccacaggcggacctgcaggctctg ctgcagagcggtgcccaggtccgcatggtggacatgcagaagctgacccatggcgtcctg cataccaagttctgggtggtggaccagacccacttctacctgggcagtgccaacatggac tggcgttcactgacccaggtcaaggagctgggcgtggtcatgtacaactgcagctgcctg gctcgagacctgaccaagatctttgaggcctactggttcctgggccaggcaggcagctcc atcccatcaacttggccccggttctatgacacccgctacaaccaagagacaccaatggag atctgcctcaatggaacccctgctctggcctacctggcgagtgcgcccccacccctgtgt ccaagtggccgcactccagacctgaaggctctactcaacgtggtggacaatgcccggagt ttcatctacgtcgctgtcatgaactacctgcccactctggagttctcccaccctcacagg ttctggcctgccattgacgatgggctgcggcgggccacctacgagcgtggcgtcaaggtg cgcctgctcatcagctgctggggacactcggagccatccatgcgggccttcctgctctct ctggctgccctgcgtgacaaccatacccactctgacatccaggtgaaactctttgtggtc cccgcggatgaggcccaggctcgaatcccatatgcccgtgtcaaccacaacaagtacatg gtgactgaacgcgccacctacatcggaacctccaactggtctggcaactacttcacggag acggcgggcacctcgctgctggtgacgcagaatgggaggggcggcctgcggagccagctg gaggccattttcctgagggactgggactccccttacagccatgaccttgacacctcagct gacagcgtgggcaacgcctgccgcctgctctga >gi568815579r:40221885_40436408|GENSCAN_predicted_peptide_7|616_aa MSTIQSETDCYDIIEVLGKGTFGEVAKGWRRSTGEMVAIKILKNDAYRNRIIKNELKLLH CMRGLDPEEAHVIRFLEFFHDALKFYLVFELLEQNLFEFQKENNFAPLPARHIRTVTLQV LTALARLKELAIIHADLKPENIMLVDQTRCPFRVKVIDFGSASIFSEVRYVKEPYIQSRF YRAPEILLGLPFCEKVDVWSLGCVMAELHLGWPLYPGNNEYDQVRYICETQGLPKPHLLH AACKAHHFFKRNPHPDAANPWQLKSSADYLAETKVRPLERRKYMLKSLDQIETVNGGSVA SRLTFPDREALAEHADLKSMVELIKRMLTWESHERISPSAALRHPFVSMQQLRSAHETTH YYQLSLRSYRLSLQVEGKPPTPVVAAEDGTPYYCLAEEKEAAGMGSVAGSSPFFREEKAP GMQRAIDQLDDLSLQEAGHGLWGETCTNAVSDMMVPLKAAITGHHVPDSGPEPILAFYSS RLAGRHKARKPPAGSKSDSNFSNLIRLSQVSPEDDRPCRGSSWEEGEHLGASAEPLAILQ RDEDGPNIDNMTMEAERPDPELFDPSSCPGEWLSEPDCTLESVRGPRAQGLPPRRSHQHG PPRGATSFLQHVTGHH >gi568815579r:40221885_40436408|GENSCAN_predicted_CDS_7|1851_bp atgtccaccatccagtcggagactgactgctacgacatcatcgaggtcttgggcaagggg accttcggggaggtagccaagggctggcggcggagcacgggcgagatggtggccatcaag atcctcaagaatgacgcctaccgcaaccgcatcatcaagaacgagctgaagctgctgcac tgcatgcgaggcctagaccctgaagaggcccacgtcatccgcttccttgagttcttccat gacgccctcaagttctacctggtctttgagctgctggagcaaaaccttttcgagttccag aaggagaacaacttcgcgcccctccccgcccgccacatccgtacagtcaccctgcaggtg ctcacagccctggcccggctcaaggagctggctatcatccacgctgatctcaagcctgag aacatcatgctggtggaccagacccgctgccccttcagggtcaaggtgattgacttcgga tccgccagcattttcagcgaggtgcgctacgtgaaggagccatacatccagtcgcgcttc taccgggcccctgagatcctgctggggctgcccttctgcgagaaggtggacgtgtggtcc ctgggctgcgtcatggctgagctgcacctgggctggcctctctaccccggcaacaacgag tacgaccaggtgcgctacatctgcgaaacccagggcctgcccaagccacacctgttgcac gccgcctgcaaggcccaccacttcttcaagcgcaacccccaccctgacgctgccaacccc tggcagctcaagtcctcggctgactacctggccgagacgaaggtgcgcccattggagcgc cgcaagtatatgctcaagtcgttggaccagattgagacagtgaatggtggcagtgtggcc agtcggctaaccttccctgaccgggaggcgctggcggagcacgccgacctcaagagcatg gtggagctgatcaagcgcatgctgacctgggagtcacacgaacgcatcagccccagtgct gccctgcgccaccccttcgtgtccatgcagcagctgcgcagtgcccacgagaccacccac tactaccagctctcgctgcgcagctaccgcctctcgctgcaagtggaggggaagcccccc acgcccgtcgtggccgcagaagatgggaccccctactactgtctggctgaggagaaggag gctgcgggtatgggcagtgtggccggcagcagccccttcttccgagaggagaaggcacca ggtatgcaaagagccatcgaccagctggatgacctgagtctgcaggaggctgggcatggg ctgtggggtgagacctgcaccaatgcggtctccgacatgatggtccccctcaaggcagcc atcactggccaccatgtgcccgactcgggccctgagcccatcctggccttctacagcagc cgcctggcaggccgccacaaggcccgcaagccacctgcgggttccaagtccgactccaac ttcagcaacctcattcggctgagccaggtctcgcctgaggatgacaggccctgccggggc agcagctgggaggaaggagagcatctcggggcctctgctgagccactggccatcctgcag cgagatgaggatgggcccaacattgacaacatgaccatggaagctgagaggccagaccct gagctcttcgaccccagcagctgtcctggagaatggctgagtgagccagactgcaccctg gagagcgtcaggggcccacgggctcaggggctcccaccccgccgctcccaccagcatggt ccaccccggggggccaccagcttcctccagcatgtcaccgggcaccactga >gi568815579r:40221885_40436408|GENSCAN_predicted_peptide_8|1836_aa MGPRPALGVAGRPSPSPLPTLRPGPAGMLSKGLKRKREEEEEKEPLAVDSWWLDPGHTAV AQAPPAVASSSLFDLSVLKLHHSLQQSEPDLRHLVLVVNTLRRIQASMAPAAALPPVPSP PAAPSVADNLLASSDAALSASMASLLEDLSHIEGLSQAPQPLADEGPPGRSIGGAAPSLG ALDLLGPATGCLLDDGLEGLFEDIDTSMYDNELWAPASEGLKPGPEDGPGKEEAPELDEA ELDYLMDVLAPGMTLTATGLGQRADDPGDPLYYHLQGSGFSCPQLRTLCCVSVLERRTQE AQGAGGDPQAARTPRKGVSPADSCAAPRAGLLLGGSACTQAAAQRQLLHAELKLVLQQKG ERTQEPGVQVPPSNAMEARSRSAEELRRAELVEIIVETEAQTGVSGINVAGGGKEGIFVR ELREDSPAARSLSLQEGDQLLSARVFFENFKYEDALRLLQCAEPYKVSFCLKRTVPTGDL ALRPGTVSGYEIKGPRAKVAKLNIQSLSPVKKKKMVPGALGVPADLAPVDVEFSFPKFSR LRRGLKAEAVKGPVPAAPARRRLQLPRLRVREVAEEAQAARLAAAAPPPRKAKVEAEVAA GARFTAPQVELVGPRLPGAEVGVPQVSAPKAAPSAEAAGGFALHLPTLGLGAPAPPAVEA PAVGIQVPQVELPALPSLPTLPTLPCLETREGAVSVVVPTLDVAAPTVGVDLALPGAEVE ARGEAPEVALKMPRLSFPRFGARAKEVAEAKVAKVSPEARVKGPRLRMPTFGLSLLEPRP AAPEVVESKLKLPTIKMPSLGIGVSGPEVKVPKGPEVKLPKAPEVKLPKVPEAALPEVRL PEVELPKVSEMKLPKVPEMAVPEVRLPEVELPKVSEMKLPKVPEMAVPEVRLPEVQLLKV SEMKLPKVPEMAVPEVRLPEVQLPKVSEMKLPEVSEVAVPEVRLPEVQLPKVPEMKVPEM KLPKVPEMKLPEMKLPEVQLPKVPEMAVPDVHLPEVQLPKVPEMKLPEMKLPEVKLPKVP EMAVPDVHLPEVQLPKVPEMKLPKMPEMAVPEVRLPEVQLPKVSEMKLPKVPEMAVPDVH LPEVQLPKVCEMKVPDMKLPEIKLPKVPEMAVPDVHLPEVQLPKVSEIRLPEMQVPKVPD VHLPKAPEVKLPRAPEVQLKATKAEQAEGMEFGFKMPKMTMPKLGRAESPSRGKPGEAGA EVSGKLVTLPCLQPEVDGEAHVGVPSLTLPSVELDLPGALGLQGQVPAAKMGKGERVEGP EVAAGVREVGFRVPSVEIVTPQLPAVEIEEGRLEMIETKVKPSSKFSLPKFGLSGPKVAK AEAEGAGRATKLKVSKFAISLPKARVGAEAEAKGAGEAGLLPALDLSIPQLSLDAHLPSG KVEVAGADLKFKGPRFALPKFGVRGRDTEAAELVPGVAELEGKGWGWDGRVKMPKLKMPS FGLARGKEAEVQGDRASPGEKAESTAVQLKIPEVELVTLGAQEEGRAEGAVAVSGMQLSG LKVSTAGQVVTEGHDAGLRMPPLGISLPQVELTGFGEAGTPGQQAQSTVPSAEGTAGYRV QVPQVTLSLPGAQVAGGELLVGEGVFKMPTVTVPQLELDVGLSREAQAGEAATGEGGLRL KLPTLGARARVGGEGAEEQPPGAERTFCLSLPDVELSPSGGNHAEYQVAEGEGEAGHKLK VRLPRFGLVRAKEGAEEGEKAKSPKLRLPRVGFSQSEMVTGEGSPSPEEEEEEEEEGSGE GASGRRGRVRVRLPRVGLAAPSKASRGQEGDAAPKSPVREKSPKFRFPRVSLSPKARSGS GDQEEGGLRVRLPSVGFSETGAPGPARMEGAQAAAV >gi568815579r:40221885_40436408|GENSCAN_predicted_CDS_8|5511_bp atgggcccgcggcccgccctgggcgtggcgggaaggcccagtccctccccgctgcccacc ctccgccctgggccggccgggatgctgagcaagggtctgaagcggaaacgggaggaggag gaggagaaggaacctctggcagtcgactcctggtggctagatcctggccacacagcggtg gcacaggcacccccggccgtggcctctagctccctctttgacctctcagtgctcaagctc caccacagcctgcagcagagtgagccggacctgcggcacctggtgctggtcgtgaacact ctgcggcgcatccaggcgtccatggcacccgcggctgccctgccacctgtgcctagccca cctgcagcccccagtgtggctgacaacttactggcaagctcggacgctgccctttcagcc tccatggccagcctcctggaggacctcagccacattgagggcctgagtcaggctccccaa cccttggcagacgaggggccaccaggccgtagcatcgggggagcagcgcccagcctgggt gccttggacctgctgggcccagccactggctgtctactggacgatgggcttgagggcctg tttgaggatattgacacctctatgtatgacaatgaactttgggcaccagcctctgagggc ctcaaaccaggccctgaggatgggccgggcaaggaggaagctccggagctggacgaggcc gaattggactacctcatggatgtgctggctccgggcatgaccctcacagccacgggcctg ggacagagagctgatgacccaggagaccccctctactaccacctacaaggttcaggcttc tcgtgtccccagctcaggactctgtgctgtgtatcagtcctggagcgccggacccaggag gcccaaggagctggaggtgaccctcaggcagcaagaaccccacggaagggcgtgagccct gcagacagctgtgcggcacctcgggctgggctcctgttaggaggaagtgcctgcacccag gcagcggctcagaggcagctgctccatgcagaactgaagctggttctgcagcagaaaggg gagaggacacaggagcctggggtgcaggtgcctcccagcaacgccatggaggccaggagc cggagtgccgaggagctgaggcgggcggagttggtggaaattatcgtggagacggaggcg cagaccggggtcagcggcatcaacgtagcgggcggcggcaaagagggaatcttcgttcgg gagctgcgcgaggactcacccgccgccaggagcctcagcctgcaggaaggggaccagctg ctgagtgcccgagtgttcttcgagaacttcaagtacgaggacgcactacgcctgctgcaa tgcgccgagccttacaaagtctccttctgcctgaagcgcactgtgcccaccggggacctg gctctgcggcccgggaccgtgtctggctacgagatcaagggcccgcgggccaaggtggcc aagctgaacatccagagtctgtcccctgtgaagaagaagaagatggtgcctggggctctg ggggtccccgctgacctggcccctgttgacgtcgagttctcctttcccaagttctcccgc ctgcgtcggggcctcaaagccgaggctgtcaagggtcctgtcccggctgcccctgcccgc cggcgcctccagctgcctcggctgcgtgtacgagaagtggccgaagaggctcaggcagcc cggctggccgccgccgctcctccccccaggaaagccaaggtggaggctgaggtggctgca ggagctcgtttcacagcccctcaggtggagctggttgggccgcggctgccaggggcggag gtgggtgtcccccaggtctcagcccccaaggctgccccctcagcagaggcagctggtggc tttgccctccacctgccaacccttgggctcggagccccggctccgcctgctgtggaggcc ccagccgtgggaatccaggtcccccaggtggagctgcctgccttgccctcactgcccact ctgcccacacttccctgcctagagacccgggaaggggctgtgtcggtagtggtgcccacc ctggatgtggcagcaccgactgtgggggtggacctggccttgccgggtgcagaggtggag gcccggggagaggcacctgaggtggccctgaagatgccccgccttagttttccccgattt ggggctcgagcaaaggaagttgctgaggccaaggtagccaaggtcagccctgaggccagg gtgaaaggtcccagacttcgaatgcccacctttgggctttccctcttggagccccggccc gctgctcctgaagttgtagagagcaagctgaagctgcccaccatcaagatgccctccctt ggcatcggagtgtcagggcccgaggtcaaggtgcccaagggacctgaagtgaagctcccc aaggctcctgaggtcaagcttccaaaagtgcccgaggcagcccttccagaggttcgactc ccagaggtggagctccccaaggtgtcagagatgaaactcccaaaggtgccagagatggct gtgccggaggtgcggcttccagaggtagagctgcccaaagtgtcagagatgaaactccca aaggtgccagagatggctgtgccggaggtgcggcttccagaggtacagctgctgaaagtg tcggagatgaaactcccaaaggtgccagagatggctgtgccggaggtgcggcttccagag gtacagctgccgaaagtgtcagagatgaaactcccagaggtgtcagaggtggctgtgcca gaggtgcggcttccagaggtgcagctgccgaaagtgccagagatgaaagtccctgagatg aagcttccaaaggtgcctgagatgaaacttcctgagatgaaactccctgaagtgcaactc ccgaaggtgcccgagatggccgtgcccgatgtgcacctcccagaagtgcagcttccaaaa gtcccagagatgaagctccctgagatgaaactccctgaggtgaaactcccgaaggtgccc gagatggctgtgcccgatgtgcacctcccggaagtgcagctcccgaaagtcccagagatg aaactccctaaaatgcctgagatggctgtgccagaggttcgactccccgaggtgcagctg ccaaaagtctcagagatgaaactccccaaggtgcctgaaatggccgtgcccgatgtgcac ctcccagaggtgcagctgcccaaagtctgtgaaatgaaagtccctgacatgaagctccca gagataaaactccccaaggtgcctgagatggctgtgcccgatgtgcacctccccgaggtg cagctgccgaaagtgtcagagattcggctgccggaaatgcaagtgccgaaggttcccgac gtgcatcttccgaaggcaccagaggtgaagctgcccagggctccggaggtgcagctaaag gccaccaaggcagaacaggcagaagggatggaatttggcttcaagatgcccaagatgacc atgcccaagctagggagggcagagtccccatcacgtggcaagccaggcgaggcgggtgct gaggtctcagggaagctggtaacacttccctgtctgcagccagaggtggatggtgaggct catgtgggtgtcccctctctcactctgccttcagtggagctagacctgccaggagcactt ggcctgcaggggcaggtcccagccgctaaaatgggcaagggagagcgggtggagggccct gaggtggcagcaggggtcagggaagtgggcttccgagtgccctctgttgaaattgtcacc ccacagctgcccgccgtggaaattgaggaagggcggctggagatgatagagacaaaagtc aagccctcttccaagttctccttacctaagtttggactctcggggccaaaggtggctaag gcagaggctgagggggctgggcgagctaccaagctgaaggtatccaaatttgccatctca ctccccaaggctcgggtgggggctgaggctgaggccaaaggggctggggaggcaggcctg ctgcctgccctcgatctgtccatcccacagctcagcctggatgcccacctgccctcaggc aaggtagaggtggcaggggccgacctcaagttcaaggggcccaggtttgctctccccaag tttggggtcagaggccgggacactgaggcagcagaactagtgccaggggtggctgagttg gagggcaagggctggggctgggatgggagggtgaagatgcccaagctgaagatgccttcc tttgggctggctcgagggaaggaagcagaagttcaaggtgatcgtgccagcccgggggaa aaggctgagtccaccgctgtgcagcttaagatccccgaggtggagctggtcacgctgggc gcccaggaggaagggagggcagagggggctgtggccgtcagtggaatgcagctgtcaggc ctgaaggtgtccacagccgggcaggtggtcactgagggccatgacgcggggctgaggatg cctccgctgggcatctccctgccacaggtggagctgaccggctttggggaggcaggtacc ccagggcagcaggctcagagtacagtcccttcagcagagggcacagcaggctacagggtt caggtgccccaggtgaccctgtctctgcctggagcccaggttgcaggtggtgagctgctg gtgggtgagggtgtctttaagatgcccaccgtgacagtgccccagcttgagctggacgtg gggctaagccgagaggcacaggcgggcgaggcggccacaggcgagggtgggctgaggctg aagttgcccacactgggggccagagctagggtggggggcgagggtgctgaggagcagccc ccaggggccgagcgtaccttctgcctctcactgcccgacgtggagctctcgccatccggg ggcaaccatgccgagtaccaggtggcagagggggagggagaggccggacacaagctcaag gtacggctgccccggtttggcctggtgcgggccaaggagggggccgaggagggtgagaag gccaagagccccaaactcaggctgccccgagtgggcttcagccaaagtgagatggtcact ggggaagggtcccccagccccgaggaggaggaggaggaggaggaagagggcagtggggaa ggggcctcgggtcgccggggccgggtccgggtccgcttgccacgtgtaggcctggcggcc ccttctaaagcctctcgggggcaggagggcgatgcagcccccaagtcccccgtcagagag aagtcacccaagttccgcttccccagggtgtccctaagccccaaggcccggagtgggagt ggggaccaggaagagggtggattgcgggtgcggctgcccagcgtggggttttcagagaca ggggctccaggcccggccaggatggagggggctcaggctgcggctgtctga