GENSCAN 1.0 Date run: 8-Nov-116 Time: 17:00:00 Sequence gi568815595r:47753162_48098860 : 345699 bp : 44.88% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 9062 9174 113 0 2 118 48 56 0.193 3.32 1.02 PlyA + 9454 9459 6 1.05 2.04 PlyA - 10019 10014 6 1.05 2.03 Term - 19404 19339 66 0 0 67 36 56 0.697 -3.76 2.02 Intr - 19775 19656 120 2 0 79 95 149 0.998 15.39 2.01 Init - 28636 28442 195 1 0 106 98 298 0.998 29.53 2.00 Prom - 46180 46141 40 -7.46 3.02 PlyA - 47170 47165 6 1.05 3.01 Sngl - 50272 50066 207 1 0 74 42 198 0.914 6.99 3.00 Prom - 56688 56649 40 -4.26 4.00 Prom + 66506 66545 40 -5.96 4.01 Init + 71879 72086 208 1 1 103 89 212 0.713 19.85 4.02 Intr + 74186 74316 131 0 2 70 88 73 0.948 5.91 4.03 Intr + 75863 75973 111 1 0 111 55 145 0.761 14.08 4.04 Intr + 87716 88017 302 1 2 105 87 341 0.945 31.23 4.05 Intr + 88456 88576 121 1 1 59 113 111 0.999 11.20 4.06 Intr + 89945 90094 150 1 0 83 113 143 0.996 16.66 4.07 Intr + 92539 92691 153 0 0 87 66 167 0.999 14.67 4.08 Intr + 93004 93840 837 0 0 86 78 1292 0.999 119.59 4.09 Intr + 94112 94187 76 1 1 100 94 97 0.700 10.59 4.10 Intr + 94271 94375 105 0 0 77 94 107 0.999 10.39 4.11 Intr + 94620 94795 176 1 2 75 33 294 0.995 22.26 4.12 Intr + 95019 95225 207 2 0 81 62 202 0.951 16.17 4.13 Intr + 95308 95389 82 0 1 51 88 122 0.997 7.71 4.14 Intr + 95463 95656 194 1 2 -39 86 279 0.993 14.21 4.15 Intr + 95759 95918 160 1 1 131 73 241 0.999 26.46 4.16 Intr + 96031 96188 158 2 2 90 53 323 0.915 28.73 4.17 Intr + 96290 96393 104 1 2 138 94 98 0.993 14.37 4.18 Intr + 96469 96608 140 1 2 77 52 264 0.998 21.81 4.19 Term + 96706 96959 254 2 2 97 52 499 0.998 43.00 4.20 PlyA + 97007 97012 6 1.05 5.13 PlyA - 97558 97553 6 1.05 5.12 Term - 99878 99871 8 0 2 113 39 0 0.408 -4.37 5.11 Intr - 100191 100002 190 0 1 98 108 114 0.851 13.56 5.10 Intr - 101267 101136 132 2 0 114 52 22 0.799 2.04 5.09 Intr - 102199 102087 113 2 2 91 69 111 0.848 9.60 5.08 Intr - 104351 104270 82 2 1 130 105 115 0.992 16.61 5.07 Intr - 114177 114085 93 0 0 86 101 61 0.984 7.36 5.06 Intr - 116166 116053 114 0 0 89 87 83 0.994 8.94 5.05 Intr - 117944 117652 293 0 2 117 63 216 0.969 18.85 5.04 Intr - 118939 118756 184 1 1 102 105 146 0.973 17.06 5.03 Intr - 124362 124256 107 1 2 100 100 80 0.968 10.33 5.02 Intr - 131633 131466 168 0 0 81 111 -8 0.101 0.72 5.01 Init - 139330 137863 1468 1 1 73 80 911 0.021 80.62 5.00 Prom - 143808 143769 40 -0.96 6.11 PlyA - 143978 143973 6 1.05 6.10 Term - 149839 149760 80 2 2 82 45 39 0.297 -3.07 6.09 Intr - 159086 155877 3210 0 0 8 64 1098 0.006 86.78 6.08 Intr - 161778 161656 123 1 0 62 111 82 0.293 8.46 6.07 Intr - 164013 162790 1224 1 0 43 98 1071 0.064 90.90 6.06 Intr - 165590 165558 33 0 0 85 97 21 0.027 0.89 6.05 Intr - 168717 168604 114 1 0 82 68 83 0.091 6.12 6.04 Intr - 175189 175067 123 2 0 87 119 108 0.681 14.36 6.03 Intr - 224772 224704 69 1 0 75 111 23 0.053 2.45 6.02 Intr - 234272 234222 51 0 0 144 111 -39 0.067 2.88 6.01 Init - 245699 245477 223 2 1 70 115 305 0.652 30.12 6.00 Prom - 250919 250880 40 -3.96 7.00 Prom + 252969 253008 40 -4.06 7.01 Init + 253197 253262 66 2 0 61 91 8 0.466 -0.53 7.02 Intr + 253860 254015 156 2 0 75 42 78 0.625 2.11 7.03 Term + 255090 255341 252 2 0 114 38 161 0.817 9.34 7.04 PlyA + 257814 257819 6 1.05 8.00 Prom + 288943 288982 40 -0.96 8.01 Init + 303032 303098 67 1 1 97 59 63 0.665 3.54 8.02 Intr + 335477 335678 202 0 1 106 75 58 0.958 4.64 8.03 Intr + 335906 336109 204 2 0 79 39 190 0.939 11.52 8.04 Term + 338877 338985 109 0 1 70 38 109 0.944 2.08 8.05 PlyA + 339812 339817 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 106223 106170 54 2 0 100 65 40 0.880 4.18 S.002 Sngl - 139330 137801 1530 1 0 73 48 940 0.977 83.89 S.003 Term - 168717 168572 146 1 2 82 50 108 0.895 4.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:47753162_48098860|GENSCAN_predicted_peptide_1|37_aa XATGTPQEIIIKPVIKSNSNCKPLLINFTFQTVYSIK >gi568815595r:47753162_48098860|GENSCAN_predicted_CDS_1|114_bp natgctacaggtactccccaggaaataattataaaaccagtgatcaagtccaactccaac tgtaagccccttcttattaacttcacgttccaaacggtttattcaatcaaatga >gi568815595r:47753162_48098860|GENSCAN_predicted_peptide_2|126_aa MAAAAGGGGPGTAVGATGSGIAAAAAGLAVYRRKDGGPATKFWESPETVSQLDSVRVWLG KHYKKYVHADAPTNKTLAGLVVQLLQFQEDAFGKHVTNPAFTKLPFPILSPYRVFSMLSP DAALMP >gi568815595r:47753162_48098860|GENSCAN_predicted_CDS_2|381_bp atggccgcagcggcgggcggcggcgggccggggacagcggtaggcgccacgggctcgggg attgcggcggcagccgcaggcctagctgtttatcgacggaaggatgggggcccggccacc aagttttgggagagcccggagacggtgtcccagctggattcggtgcgggtctggctgggc aagcactacaagaagtatgttcatgcggatgctcctaccaataaaacactggctgggctg gtggtgcagcttcttcagttccaggaagatgcctttgggaagcatgtcaccaacccggcc ttcaccaaactccctttccctatcctctcgccgtaccgagtcttctccatgctttctcct gatgcagctcttatgccataa >gi568815595r:47753162_48098860|GENSCAN_predicted_peptide_3|68_aa MATAGRGLAAHGSAGPASRAGLGCPACSSGPPASPPHSPPGWRSDRAPSSRRPPPAAAGS SRGHERGE >gi568815595r:47753162_48098860|GENSCAN_predicted_CDS_3|207_bp atggcaacggcaggccgaggcctggctgcccacggatcggccgggccggcttcccgggca ggcctaggatgcccggcctgctcctcgggcccgccggcgtctccgccccactctccgcca ggctggcggtcggaccgggccccctcctcacgacgaccacccccggccgcggcaggctcc tcaagagggcacgaaaggggagaataa >gi568815595r:47753162_48098860|GENSCAN_predicted_peptide_4|1222_aa MAAARRLMALAAGISPRLQPLGPRAAGRQGRSRGFSSSCAHPDHTKEAAEAESGMAPGGP GEGDGSLVNASRDLLKEFPQPKNLLNSVIGRALGISHAKDKLVYVHTNGPKKKKVTLHIK WPKSVEVEGYGSKKIDAERQAAAAACQLFKGWGLLGPRNELFDAAKYRVLADRFGSPADS WWRPEPTMPPTSWRQLNPESIRPGGPGGLSRSLGREEEEDEEEELEEGTIDVTDFLSMTQ QDSHAPLRDSRGSSFEMTDDDSAIRALTQFPLPKNLLAKVIQIATSSSTAKNLMQFHTVG TKTKLSTLTLLWPCPMTFVAKGRRKAEAENKAAALACKKLKSLGLVDRNNEPLTHAMYNL ASLRELGETQRRPCTIQVPEPILRKIETFLNHYPVESSWIAPELRLQSDDILPLGKDSGP LSDPITGKPYVPLLEAEEVRLSQSLLELWRRRGPVWQEAPQLPVDPHRDTILNAIEQHPV VVISGDTGCGKTTRIPQLLLERYVTEGRGARCNVIITQPRRISAVSVAQRVSHELGPSLR RNVGFQVRLESKPPSRGGALLFCTVGILLRKLQSNPSLEGVSHVIVDEVHERDVNTDFLL ILLKGLQRLNPALRLVLMSATGDNERFSRYFGGCPVIKVPGFMYPVKEHYLEDILAKLGK HQYLHRHRHHESEDECALDLDLVTDLVLHIDARGEPGGILCFLPGWQEIKGVQQRLQEAL GMHESKYLILPVHSNIPMMDQKAIFQQPPVGVRKIVLATNIAETSITINDIVHVVDSGLH KEERYDLKTKVSCLETVWVSRANVIQRRGRAGRCQSGFAYHLFPRSRLEKMVPFQVPEIL RTPLENLVLQAKIHMPEKTAVEFLSKAVDSPNIKAVDEAVILLQEIGVLDQREYLTTLGQ RLAHISTDPRLAKAIVLAAIFRCLHPLLVVVSCLTRDPFSSSLQNRAEVDKVKALLSHDS GSDHLAFVRAVAGWEEVLRWQDRSSRENYLEENLLYAPSLRFIHGLIKQFSENIYEAFLV GKPSDCTLASAQCNEYSEEEELVKGVLMAGLYPNLIQVRQGKVTRQGKFKPNSVTYRTKS GNILLHKSTINREATRLRSRWLTYFMAVKSNGSVFVRDSSQVHPLAVLLLTDGDVHIRDD GRRATISLSDSDLLRLEGDSRTVRLLKELRRALGRMVERSLRSELAALPPSVQEEHGQLL ALLAELLRGPCGSFDVRKTADD >gi568815595r:47753162_48098860|GENSCAN_predicted_CDS_4|3669_bp atggcggccgctaggagactcatggcgctggccgccggcatctctccgcgcctgcagccg ctgggtccccgcgctgctgggcgacagggtcgctcgcgcggcttctcttcaagctgcgcc caccccgaccacaccaaggaagccgccgaggccgagtcagggatggcccccggcgggcct ggggaaggcgacggaagcttggtgaacgcttctagggacctattaaaagagttcccacag cccaaaaatcttctcaacagtgtgattggaagagccctcggcatctcacatgcaaaagac aaactagtctacgtgcacacaaatggaccgaagaaaaagaaagtcacactgcacataaaa tggcccaagagcgtggaggtagaaggctatggcagcaagaagatcgatgctgagcggcag gctgcagctgcagcctgccagctgttcaagggttggggtctgctaggtccccggaatgag ttgtttgacgcagccaaataccgagtgctagctgatcgctttggctcccctgccgacagc tggtggcgtccggaacccaccatgccccctacttcctggcggcagctgaatccagagagt attcgaccagggggacctgggggcctatcccgctctttaggccgggaagaagaggaggac gaggaggaagagctagaagaagggaccatagatgttaccgacttcttgtccatgacccag caggattcccacgctccactcagggactcaagggggagttcctttgagatgacagatgac gacagtgccattagggctctgacccagtttccacttcccaagaaccttctggccaaggtg attcagattgcaacgtcatcctccacagctaagaacctcatgcagttccatactgtgggc accaagaccaagctgtctacactcaccctgctctggccctgccccatgacctttgttgcc aaagggcgccgcaaagcagaggctgagaataaggcggcagccttggcctgcaagaaactg aagagcctgggcctggtggacaggaacaacgaaccgcttacacacgccatgtataacctg gcctctttgcgtgagctgggtgagacccagcgccgaccatgcaccatccaggtgcccgag cccatcctccgcaagatagagaccttcctgaaccattaccctgtggagagttcatggatc gccccagaactccggctgcagagtgatgacatcttgcccttgggcaaggactcagggcct ctgagtgaccctatcacaggcaagccctatgtgcccctgttggaagcagaggaggtacgt ctcagccagagtctgctagaactgtggcggcggcgagggccggtctggcaggaggccccc cagctacctgtggacccacatcgggacaccatcctcaacgccattgagcagcacccggtg gtggtcatctctggggacacgggctgtgggaagaccacgcgcatcccccagctgttgctg gagcgctatgtgaccgagggccgaggtgcccgctgcaatgttatcatcacccaacctcgc cgcatctctgctgtgtctgtggcacagcgggtcagccacgaactgggcccctccctgcgc cggaatgtgggcttccaggtgcggttggaaagtaagcccccatcccgaggcggggccctg ctcttctgcactgtgggtatcctgctgcgtaagctgcagagcaaccccagcctggagggc gtgagccacgtcatcgtggatgaggtgcatgagcgggacgtgaacacagactttctgctg atcctgctcaagggcctgcagcggctcaacccggccctgcggctggtgctcatgagtgcc acaggggacaatgagcgcttctcccgatactttggtggctgccccgtcatcaaggtgcct ggcttcatgtacccagtcaaggagcactacctagaggacatcctggccaagttgggcaag caccagtacctgcaccggcaccggcaccatgagtctgaggatgaatgcgcactcgatttg gaccttgtgactgatctggttctgcacatcgatgctcgcggggaaccaggtgggatcctg tgcttcctgcctgggtggcaggagatcaaaggagtgcagcagcgcctccaggaggccctg ggcatgcacgagagcaagtacctcatcctgccagtgcactccaacatccccatgatggat cagaaggccatattccagcagcctccagttggggtgcgcaagattgtcttggccaccaac attgctgagacttccatcacaatcaatgacatcgtgcatgtggtggacagtgggctgcac aaggaagaacgctatgacctgaagaccaaggtgtcctgcctggagacagtgtgggtatca agagccaatgtgatccagcgccggggccgggcgggccgctgccagtccggctttgcctac cacttgttccctcgaagccggctggagaaaatggtccctttccaagtgccagagatcctg cgcacacctcttgagaacctggtgctgcaagcgaaaatccacatgcctgagaagacggcg gtggagttcctgtccaaggctgtggacagtccaaacatcaaggcagtggacgaggctgtg atcttgctccaggagatcggggtgctggaccagcgggagtacctgactaccctggggcag cgcctggctcacatctccaccgacccccggttggccaaggccattgtgttggctgccatc ttccgttgcctgcacccactactggtggtcgtttcctgcctcacccgggaccccttcagc agcagcctacagaaccgggcagaggtggacaaggtgaaagcactgttgagccatgacagc ggcagtgaccacctggcctttgtgcgggctgtcgccggctgggaggaggtgctgcgttgg caggaccgcagctcccgggagaattacctggaggaaaacctgctgtacgcacccagcctg cgcttcatccacggactcatcaagcagttctcagagaacatttatgaggccttcctggtg gggaagccctcggactgcaccctggcctccgcccagtgcaacgagtacagtgaggaggag gagctggtgaagggcgtgctgatggccggcctctaccccaacctcatccaggtgaggcag ggcaaggtcacccggcaggggaagttcaagcccaacagcgtcacatataggaccaaatca ggcaacatcctgctgcacaagtcgaccattaacagggaggccacacggttacggagccga tggctgacgtatttcatggcagtcaagtccaatggcagcgtcttcgtccgggactcctct caggtgcacccgctagctgtgctgctgctgaccgacggggacgtgcacatccgtgatgac gggcgccgggccaccatctcactgagcgacagtgacctgctgcggctggagggtgactcg cgtaccgtgcggctgctgaaggagctgcggcgggccctgggccgcatggtggagcggagc ctgcgcagcgagctggctgcacttccccccagcgtacaggaggagcacgggcagctgctt gcgctactggcagagctgctgcgaggaccctgtggcagctttgatgtgcgcaagacagct gacgactga >gi568815595r:47753162_48098860|GENSCAN_predicted_peptide_5|983_aa MSLSDKQTASLTAAYGQLSKGKPAECRMDSPKEISQAGFEWQRTEGKLNEIGLNVSMDGQ PKDGLVKNASFLEQNKLCFFEGKLDKELSIEMQDKDCQEASGHLESRYVISETCHPLEGN SVHQKTSEFHLGLIEGPDKNKTIPVQGKVAGKNGLETKSQSDLDFPGAADIPTRYVKEQE TSVWNPSFHPVAQGSLGSREATPGEMENSITPGCPVIGVVNDNSEQLKCESPLLVSLAHP APIIEHSPTTIPPITMVFTQEHLNASCHIRDHDKELEKLSSTEEAVLNQAPQQKKAVRRA LSECSHLSVPPAVNLADKYPELPAREEPSSGLLPPPSSPMPSPTPGKLGAPAMKRSMTVG EEQTASYKLSPGKLPILSTKEIPPFICEEPVAKKREELAHFSNSSSNSGKKELGTAGLYL HSKLEQIPEGSSKEKGQEDFSETRIDSCSQVCQRGEKQPGQTALAGKKEIEVTATQSTPS FLFEKPPRDARESPRGRCFLLPATPAEIKQEVCLGPRQPESSPEEALYSMFPASLSPYVL TLGDTGIARPEEGRPVVSGTGNDITTPPNKELPPSPEKKTKPIADAKAPEKRASPSKPAS APASRSGSKSTQTVAKTTTAAAVASTGPSSRSPSTLLPKKPTADLSRPKSTSTSSMKKTT TLSGTAPAAGVVPSRVKATPMPSRPSTTPFIDKKPTSAKPSSTTPRLSRLATNTSAPDLK NVRSKVGSTENIKHQPGGGRAKVEKKTEAAATTRKPESNAVTKTAGPIASAQKQPAGKVQ IVSKKVSYSHIQSKCGSKDNIKHVPGGGNVQIQNKKVDISKVSSKCGSKANIKHKPGGGD VKIESQKLNFKEKAQAKVGSLDNVGHLPAGGAVKCQPLPSPSPVPSSPDALTEALISFQE HLGGSLLPQALSSCPPYLTEGGGSEAPLCPGPPAGEEPAISEAAPEAGAPTSASGLNGHP TLSGGGDQREAQTLDSQIQETSI >gi568815595r:47753162_48098860|GENSCAN_predicted_CDS_5|2952_bp atgtcgctctcagacaagcagacagcctctctcactgccgcgtacggtcagctcagtaag ggcaagcctgcagagtgccgaatggactccccaaaagaaatcagtcaagccggattcgaa tggcagaggacagagggcaaactgaatgaaattgggctgaatgtcagcatggacgggcaa ccaaaagatgggcttgtgaagaatgccagcttcctggagcagaacaagctctgctttttt gaggggaagctagacaaagagctgagcattgaaatgcaggacaaggactgtcaagaagcc tcaggtcaccttgagagcaggtatgtgatttcagagacctgccatcccttggaggggaac tcggtacaccagaagacctccgagttccatctgggactcatagaggggccagacaaaaac aaaaccattccagttcaggggaaggtggcagggaagaatggactagagaccaagagccag tcagatctggatttccctggggctgctgacatccctaccagatatgttaaggagcaggaa accagtgtttggaaccccagctttcatccagtggctcaaggctctctgggctcaagggaa gcaactccgggagagatggagaatagcatcacccctggctgcccagtgattggggtggta aatgataactctgagcagctgaagtgtgagtccccactcctggtgtctctagcccaccca gcccccattattgagcattcacccaccaccattccgccaatcactatggtgttcacccag gaacatttgaatgcaagctgtcacatcagagaccatgataaggagttggagaaattgagt tctaccgaggaggctgtgctcaaccaagccccccagcagaaaaaggcagtgcgcagggcc ctgtctgaatgttctcacctctcagttcccccagctgtcaaccttgcagataagtaccct gaactccctgcccgagaagagccttcttctggcctgctgcctccccctagtagcccaatg cctagtcctacacctgggaaactgggagctcctgctatgaagcgctccatgactgtgggt gaggaacagacagctagctacaaattgagccctgggaaactgcccatcttgtctactaaa gagatacctcctttcatctgtgaggaaccagtggccaagaagagagaagaattggctcac ttcagcaacagcagcagcaactctgggaagaaggaactcggcactgctggattatatctc catagtaagctggagcagattcctgaaggaagcagcaaggaaaaagggcaggaagatttt agtgaaactagaattgattcatgctcgcaggtttgccagcgaggagagaaacagccagga cagacggctctggcagggaagaaagaaattgaggtcactgcaacccagagcactccatcg ttcctgtttgaaaagcccccacgtgatgcaagagaaagcccccgaggtaggtgtttcctg ctcccagcaaccccagcagagatcaaacaggaggtctgcctgggtcccaggcagccagag agcagtccagaggaagctctctattctatgtttccagcatcactcagcccctatgtgctg acactgggtgacacaggaatagccaggccagaagaaggaaggcctgtggtgagtgggaca ggaaatgacatcaccaccccaccgaacaaggagctcccaccaagcccagagaagaaaaca aagcccattgcagatgcaaaggctcctgagaagcgggcctcaccatccaagccagcttct gccccagcctccagatctgggtccaagagcactcagactgttgcaaaaaccacaacagct gctgctgttgcctcaactggcccaagcagtaggagcccctccacgctcctgcccaagaag cccactgctgacttgagtcgcccaaagagcacctccaccagttccatgaagaaaaccacc actctcagtgggacagcccccgctgcaggggtggttcccagccgagtcaaggccacaccc atgccctcccggccctccacaactcctttcatagacaagaagcccacctcggccaaaccc agctccaccaccccccggctcagccgcctggccaccaatacttctgctcctgatctgaag aatgtccgctccaaggttggctccacggaaaacatcaagcatcagcctggaggaggccgg gccaaagtagagaaaaaaacagaggcagctgctacaacccgaaagcctgaatctaatgca gtcactaaaacagccggcccaattgcaagtgcacagaaacaacctgcggggaaagtccag atagtctccaaaaaagtgagctacagccatattcagtccaagtgtggttccaaggacaat attaagcatgtccctggaggtggtaatgttcagattcagaacaagaaagtggacatctct aaggtctcctccaagtgtgggtctaaggctaacatcaagcacaagcctggtggaggagat gtcaagattgaaagtcagaagttgaacttcaaggagaaggcccaggccaaggtgggatcc ctcgataatgtgggccacctacctgcaggaggtgctgtgaagtgccagccactgcccagc ccgtcccctgtcccatcctctcctgatgctctgactgaagccttgatcagcttccaggaa catctgggggggtccttgctgccacaggctctcagttcgtgccccccatacctcactgag ggcggtggcagcgaggctcctctgtgtccgggtccccctgctggggaggagccggccatc tctgaggcagcgcctgaagctggcgcccccacttcagccagtggcctcaatggccacccc accctgtcagggggtggtgaccaaagggaggcccagaccttggacagccagatccaggag acaagcatctaa >gi568815595r:47753162_48098860|GENSCAN_predicted_peptide_6|1749_aa MADLSLADALTEPSPDIEGEIKRDFIATLEAEAFDDVVGETVGKTDYIPLLDVDEKTGNS ESKKKPCSETSQIEDLPLPPHPASLSFHLPLDTPSSKPTLLANGGHGVEGSDTTGSPTEF LEEKMAYQEYPNSQNWPEDTNFCFQPEQVVDPIQTDPFKMYHDDDLADLVFPSSATADTS IFAGQNDPLKDSYEAVAEPPQPTAVPLELAKEIEMASEERPPAQALEIMMGLKTTDMAPS KETEMALAKDMALATKTEVALAKDMESPTKLDVTLAKDMQPSMESDMALVKDMELPTEKE VALVKDVRWPTETDVSSAKNVVLPTETEVAPAKDVTLLKETERASPIKMDLAPSKDMGPP KENKKETERASPIKMDLAPSKDMGPPKENKIVPAKDLVLLSEIEVAQANDIISSTEISSA EKVALSSETEVALARDMTLPPETNVILTKDKALPLEAEVAPVKDMAQLPETEIAPAKDVA PSTVKEVGLLKDMSPLSETEMALGKDVTPPPETEVVLIKNVCLPPEMEVALTEDQVPALK TEAPLAKDGVLTLANNVTPAKDVPPLSETEATPVPIKDMEIAQTQKGISEDSHLESLQDV GQSAAPTFMISPETVTGTGKKCSLPAEEDSVLEKLGERKPCNSQPSELSSETSGWVSGSS SCGGPGNQRKSIHVDSLEPQRDLGREAWDIESTPIMMKKKKKKPKQKRYSQPRAGGPSDD DNADKPKGHPFAADTQKSGVLPSQPTTMGTEYGLVSGENLKRECLVNSSAARLVAENFVS ESLRIPLYPSEEAPKTAISSQSKLRVEEESKSNKSVLQNQDKKLLKQHEYKPQPAPHLKT PVDKSQSVGPLNLKGPLAEVSAYNVETPLDIRLKEGCSPFLDQEVMGVVSKPTAAKEIPN LVPTLIASNPLECNLKEGNNESKMTKLQNVKLKEFPEGAEEDKELKKEAFPNERQEISIF TSEQLQGQVLVQVPGVENEPFKRMAGDGKSRKGRGSSGKMRTDSGKVKAKSELPFLLDSQ KDGRAVLIPSEPVSKTEGMTTQDKSEELGLNSSKQPGTKADLTEAVVMGEPKEMTQPKVA GTMQALIPLESGSGMTQTSGVSTETGDVVKDMGVNNQSKEGRCPWKDHEAAPWISEKPKK RGNEGKSKKFKNNYSTQPARMERKEEILNPPFEGKDGDTGSIPHKSKEIGFTFPKMHDSS FSHTPDTPTVEAVDRKGGNFQVNFVELGTLGENKISTVKASTVTEPPAKVTDVSCQEQIQ GAGFVPSVVSEENKTDAANRYTAVADKPSKRSNDGKSKKVKNSSPEKHILENKIDATKIH VPMETTGDQGIEGMAYMDENRNITFTCPRTPSELINKSSPLEVLESAACEKLPTPTPQVV KEGDSFPDTLAKNGQEIAPAQISKSLMVDNYTKDGVPGQERPKGPSAVVPSTSTGGVALP ITTAIETVNIHGDHSLKNKAELADSMKNEAGIDEGHVIGESESVHSGASKHSVEKVTELA KGHLLPGVPVEDQSLPGEARALEGYADRGNFPAHPVNEEKETKEGSVAVQIPDLLEDKAQ KLSFCEDQNAQDRNSKGSDSLNKKVDLTLLSPKSENDKLKEISLACKITELESVSLPTPE IQSDFLHSKVEAPPSEVADTLVIMTASKGVRLPEPKDKILETPQKMTEKSESKTPGEGKK EDKSRMAEPMKGYMRPTKSRGLTPLLPKSTIQEQERHKQLKSAVCLSSSTVYQQLGMSVY GEKLSWCET >gi568815595r:47753162_48098860|GENSCAN_predicted_CDS_6|5250_bp atggctgacctcagtcttgcagatgcattaacagaaccatctccagacattgagggagag ataaagcgggacttcattgccacactagaggcagaggcctttgatgatgttgtgggagaa actgttggaaaaacagactatattcctctcctggatgttgatgagaaaaccgggaactca gagtcaaagaagaaaccgtgctcagaaactagccagattgaagacctcccactaccccca caccctgccagcctttcttttcatcttcctcttgatactccatcttctaaaccaacactc ctagccaatggtggtcatggagtagaagggagcgatactacagggtctccaactgaattc cttgaagagaaaatggcctaccaggaatacccaaatagccagaactggccagaagatacc aacttttgtttccaacctgagcaagtggtcgatcctatccagactgatccctttaagatg taccatgatgatgacctggcagatttggtctttccctccagtgcgacagctgatacttca atatttgcaggacaaaatgatcccttgaaagacagttacgaggctgttgcagaacctcct cagccaacggcagttcccttagagctagccaaggagatagaaatggcatcagaagagagg ccaccagcacaagcattggaaataatgatgggactgaagactactgacatggcaccatct aaagaaacagagatggccctcgccaaggacatggcactagctacaaaaaccgaggtggca ttggctaaagatatggaatcacccaccaaattagatgtgacactggccaaggacatgcag ccatccatggaatcagatatggccctagtcaaggacatggaactacccacagaaaaagaa gtggccctggttaaggatgtcagatggcccacagaaacagatgtatcttcagccaagaat gtggtactgcccacagaaacagaggtagccccagccaaggatgtgacactgttgaaagaa acagagagggcatctcctataaaaatggacttagccccttccaaggacatgggaccaccc aaagaaaacaagaaagaaacagagagggcatctcctataaaaatggacttggctccttcc aaggacatgggaccacccaaagaaaacaagatagtcccagccaaggatttggtattactc tcagaaatagaggtggcacaggctaatgacattatatcatccacagaaatatcctctgct gagaaggtggctttgtcctcagaaacagaggtagccctggccagggacatgacactgccc ccggaaaccaacgtgatcttgaccaaggataaagcactacctttagaagcagaggtggcc ccagtcaaggacatggctcaactcccagaaacagaaatagccccggccaaggatgtggct ccgtccacagtaaaagaagtgggcttgttgaaggacatgtctccactatcagaaacagaa atggctctgggcaaggatgtgactccacctccagaaacagaagtagttctcatcaagaac gtatgtctgcctccagaaatggaggtggccctgactgaggatcaggtcccagccctcaaa acagaagcacccctggctaaggatggggttctgaccctggccaacaatgtgactccagcc aaagatgttccaccactctcagaaacagaggcaacaccagttccaattaaagacatggaa attgcacaaacacaaaaaggaataagtgaggattcccatttagaatctctgcaggatgtg gggcagtcagctgcacctactttcatgatttcaccagaaaccgtcacaggaacggggaaa aagtgcagcttgccggccgaggaggattctgtgttagaaaaactaggggaaaggaaacca tgcaacagtcaaccttctgagctttcttcagagacctcaggttgggtctctggttcatcc tcctgtggtgggcctgggaaccaaagaaaaagtattcatgttgactcacttgaaccccag agggatcttggcagggaggcctgggatatagaaagcacaccaataatgatgaagaaaaag aagaagaaaccaaagcaaaagagatattctcaaccacgggctggaggaccttcggatgat gacaatgcagataagcctaaaggtcatccatttgcagctgacacacaaaaatcaggtgtt ctccccagccagcctaccactatgggtacagaatatggacttgtatctggagaaaacttg aaaagggaatgtttagttaactccagtgcagccagactggtagctgagaactttgtctca gagagtctcagaattcctttatatccttctgaagaagcccccaaaactgcaataagttct cagtctaagctgagagtagaggaagagagcaaaagtaacaagtcagtactacaaaaccaa gacaagaaattgctgaagcaacatgaatacaaaccacagcctgcaccccacctgaagact cctgtagataaaagccagtcagtaggccctctcaatctgaaaggacccctagcagaagtt tctgcatacaatgtagaaacccctttggatatcagacttaaagagggttgctctcctttc ttggaccaagaggttatgggtgtagtttcaaaacccacagcagcaaaagaaataccaaat ttggtacccactttgatagcaagtaatccattagaatgtaatctaaaagaagggaataat gaaagtaaaatgactaaactgcagaatgtcaaactgaaggagtttcctgaaggagctgaa gaggataaagaactaaaaaaggaagcttttcccaacgaaagacaagagatcagcatcttt acttctgagcagctgcagggccaagtgttggtacaggtccctggggtagagaatgaacca tttaagagaatggcaggtgatggcaaaagcaggaagggaaggggaagttctgggaaaatg agaacagattctgggaaggtaaaagcaaaatctgagctgccatttcttctggacagccag aaggacggaagggctgttctcataccgagtgagccagtctctaaaactgaaggaatgact actcaggataagagtgaggagctggggctgaattcttcaaagcaaccaggcactaaggct gatctcacggaggcagtggtgatgggggagcctaaagagatgactcagcctaaggtggca ggcaccatgcaggcattgattcctttggaaagtggatcaggcatgactcagacttctggt gttagcacagaaacaggagatgtagtcaaagatatgggtgtcaataaccagagcaaggaa ggaaggtgtccatggaaggatcatgaggcagctccctggatttctgaaaagcctaaaaag agaggcaatgaaggcaaaagcaaaaagtttaaaaataattattccacacagcctgctaga atggagaggaaggaagaaatccttaacccaccttttgaagggaaggatggagatactggt agtattccccataaaagcaaggaaataggatttactttccccaaaatgcatgattcttcg ttctcacatacaccagatacacccacagtggaagcagttgacaggaagggtggaaatttt caggttaattttgttgagcttgggactcttggggaaaacaagataagcacagtcaaggct tctactgttactgaaccacctgccaaggtgacagatgtgagctgccaagagcaaatccag ggggcaggatttgttccttcagtagtatctgaggagaataagacagatgcagccaataga tacactgcagtggctgacaaaccaagtaaaaggagtaatgatggaaaaagtaaaaaggtt aaaaatagttctcctgagaagcacattctggagaataagatagatgcaacaaaaatacat gttcccatggaaaccacaggggaccagggaattgaaggaatggcctatatggacgaaaat agaaatattacatttacctgtcccagaacaccatcagagctgataaataaatcatctcct ctagaggttctggaatcagcagcctgtgaaaaactgcccactcctactcctcaagtagta aaggaaggtgattcctttccagataccttggcaaaaaatgggcaagagatagccccagcc cagatttccaaatcattaatggtagataactacaccaaagatggagtcccaggtcaagaa agacccaagggtccctctgctgttgtgccctctacaagcacaggaggagttgctctacct attacaacagccatagaaacagttaacattcatggagatcactctcttaagaataaagct gagcttgctgattccatgaaaaatgaagcagggatcgatgaagggcatgtgataggagaa tctgagtcagtgcacagtggtgcgtctaagcattcagtagagaaagtcacagagctagca aaaggtcacctccttcctggagtgccagtagaagaccagagcctaccaggagaggccaga gccctagaaggatatgcagatagaggtaatttcccagcacatccagtgaatgaagagaaa gagactaaagaagggtctgttgcagttcagattcctgacttactggaagacaaagcacaa aagctcagtttttgtgaggaccaaaatgctcaagatagaaattccaaaggttcagatagt ttgaataagaaggtagatctgactcttttgtctccaaaaagtgaaaatgataaattgaaa gaaattagtctggcttgtaaaatcacggaattggaaagcgtttccttgccaacaccagaa atccagtcagatttcttacatagcaaagtcgaagctcctccttcagaggtggcggatacg ttagtaataatgactgcttccaagggtgttcgactcccagaacccaaagataagattttg gagacacctcagaaaatgacagaaaaatctgaatcaaagacaccaggagaagggaaaaag gaagataaaagcagaatggcagaaccaatgaaaggctacatgagacccaccaagtcccga ggacttactccacttttgccaaagtctacaatccaggaacaagagagacataagcaactg aagtccgctgtttgcttgagctcatcaactgtctaccagcagctgggaatgtcagtttat ggtgagaaactctcttggtgtgaaacttga >gi568815595r:47753162_48098860|GENSCAN_predicted_peptide_7|157_aa MTPGDLKHHCGLPVKLGAHGSQGYINSPALCHNLIWRDLDRFSLPQDITLVHYIDDIMLT GSSDQEIANTLDLLPALMASWGVPYDQLTEEKKIRACFTDGSASYADNTRKWRAAALQPL SRTSLKDNNEGKSSQWAELRAVHLVVHFEWKEKWPDV >gi568815595r:47753162_48098860|GENSCAN_predicted_CDS_7|474_bp atgactccaggggacttaaaacatcactgtggtcttccagttaaactaggggctcatgga agtcaggggtatatcaactctccggctttgtgtcataatcttatttggagagatcttgat cgcttttcgcttccgcaagatatcacactggtccattacattgatgacattatgctgact ggatccagtgatcaagaaatagcaaacacactggacttattgcctgcactgatggcctca tggggagttccctatgatcagctgacagaggaaaagaagattagggcctgcttcacagac ggttctgcatcatatgcagacaacacccgaaagtggagagctgcagcactacaacccctt tctaggacatccctgaaggacaacaatgaagggaaatcttcccagtgggcagaacttcga gcagtgcacctggttgtgcactttgaatggaaggagaaatggccagatgtgtga >gi568815595r:47753162_48098860|GENSCAN_predicted_peptide_8|193_aa MPSGREVGGSAPRLASRAVREGGRRPPPFHASGGPRGVPSCRRARLGQPRDSEEAAPPIG PRLRLHLPAAAETELGQKGARRAPGRAVRRAARAAGGWGEVRLGALASRLRPANAEADRR PIDCLRCSMMAVALSQSLGARSEESARVGAHADRATSGGKEHSLQFQGAQDEKHRTTYSG HTAYGIALLHKQQ >gi568815595r:47753162_48098860|GENSCAN_predicted_CDS_8|582_bp atgccgtccgggagggaggtgggggggtcagccccccgcctggccagccgtgccgtccgg gagggaggccggcggccaccgcccttccacgcctcgggcgggccccgtggggtacccagc tgccgcagggcaaggctcgggcagccccgggacagtgaggaggccgcccctccaatcgga ccccgcctgcggctgcacctacccgccgccgccgagacggagctggggcagaagggagcc aggagagcgccgggaagagccgtgaggagagctgcccgagccgcgggcggttggggcgag gtgcgcctgggggctctagcctcccgcctgcggcctgcgaatgccgaagcggaccgcagg ccgatcgactgccttcgctgttctatgatggccgtcgctctctctcagagtctgggcgcc cggagcgaggagagtgcgagagtcggcgcacacgcggaccgggccaccagcggtggaaaa gaacattccctccagttccagggtgcccaggatgaaaagcatagaaccacctattctggg catactgcctatgggattgccctgctccacaagcagcagtaa