GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:56:32 Sequence gi568815588r:99614590_99832049 : 217460 bp : 41.16% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 1216 1211 6 1.05 1.01 Sngl - 5746 5375 372 1 0 50 54 475 0.991 34.17 1.00 Prom - 13552 13513 40 -5.45 2.00 Prom + 18804 18843 40 -1.95 2.01 Init + 19529 19740 212 1 2 47 88 160 0.074 10.20 2.02 Term + 22831 23200 370 1 1 64 35 159 0.196 1.33 2.03 PlyA + 24905 24910 6 1.05 3.02 PlyA - 25918 25913 6 1.05 3.01 Sngl - 27912 27328 585 0 0 58 39 263 0.888 14.33 3.00 Prom - 29685 29646 40 -6.15 4.02 PlyA - 29854 29849 6 1.05 4.01 Sngl - 31092 30790 303 0 0 90 48 263 0.934 18.08 4.00 Prom - 37178 37139 40 -7.75 5.00 Prom + 37474 37513 40 -8.15 5.01 Init + 44520 44999 480 2 0 47 86 231 0.013 14.25 5.02 Intr + 45273 45375 103 2 1 35 82 118 0.015 4.73 5.03 Intr + 46910 47039 130 0 1 90 113 -3 0.011 1.23 5.04 Intr + 64672 64877 206 1 2 83 79 190 0.992 15.52 5.05 Intr + 65136 65286 151 1 1 36 78 148 0.726 6.90 5.06 Intr + 71203 71306 104 1 2 118 79 68 0.999 7.80 5.07 Intr + 74105 74161 57 0 0 63 101 49 0.753 1.64 5.08 Intr + 76796 76929 134 0 2 84 80 96 0.991 7.84 5.09 Intr + 81367 81533 167 0 2 75 88 170 0.995 13.54 5.10 Intr + 83945 84269 325 2 1 52 99 276 0.995 19.95 5.11 Intr + 86384 86469 86 1 2 55 108 30 0.954 -0.60 5.12 Intr + 87923 88084 162 2 0 77 103 132 0.995 11.77 5.13 Intr + 89863 90082 220 1 1 74 75 147 0.953 9.38 5.14 Term + 97300 97476 177 0 0 27 54 136 0.035 0.80 5.15 PlyA + 98009 98014 6 1.05 6.11 PlyA - 98645 98640 6 -0.45 6.10 Term - 100129 99998 132 1 0 101 37 122 0.987 5.51 6.09 Intr - 101872 101759 114 1 0 128 75 71 0.991 9.52 6.08 Intr - 103911 103757 155 1 2 50 119 84 0.884 6.57 6.07 Intr - 106479 106398 82 0 1 75 91 63 0.873 3.59 6.06 Intr - 109534 109367 168 1 0 52 116 55 0.906 3.92 6.05 Intr - 112565 112379 187 1 1 84 110 103 0.985 10.87 6.04 Intr - 115046 114964 83 2 2 35 97 82 0.047 1.32 6.03 Intr - 117727 117483 245 2 2 82 24 149 0.086 4.19 6.02 Intr - 118160 117949 212 1 2 15 50 249 0.820 11.43 6.01 Init - 118324 118212 113 1 2 43 93 21 0.788 -2.14 6.00 Prom - 119280 119241 40 -2.05 7.00 Prom + 120819 120858 40 -8.45 7.01 Init + 121677 121728 52 2 1 20 94 69 0.322 2.07 7.02 Intr + 125121 125180 60 1 0 68 106 74 0.852 4.99 7.03 Intr + 128564 128773 210 0 0 127 86 197 0.932 21.36 7.04 Intr + 132668 132801 134 0 2 58 98 151 0.968 12.54 7.05 Intr + 148131 148199 69 2 0 92 62 93 0.324 5.56 7.06 Intr + 154314 154424 111 2 0 52 103 68 0.203 4.36 7.07 Term + 154711 154770 60 0 0 78 42 34 0.146 -5.37 7.08 PlyA + 154903 154908 6 1.05 8.00 Prom + 156625 156664 40 -3.85 8.01 Init + 156865 158375 1511 0 2 52 53 444 0.457 28.97 8.02 Intr + 162406 162445 40 1 1 100 45 5 0.181 -5.29 8.03 Intr + 163441 163509 69 0 0 87 77 28 0.186 0.06 8.04 Intr + 170019 170192 174 2 0 79 103 90 0.813 8.81 8.05 Intr + 177645 177770 126 2 0 119 86 138 0.997 16.66 8.06 Intr + 178922 179096 175 1 1 7 110 118 0.545 4.49 8.07 Intr + 179765 179879 115 0 1 21 91 56 0.540 -2.21 8.08 Intr + 182508 182742 235 0 1 91 53 176 0.938 11.17 8.09 Intr + 184618 184781 164 0 2 78 81 132 0.735 9.35 8.10 Intr + 185797 185974 178 1 1 87 121 36 0.651 5.80 8.11 Intr + 189430 189684 255 0 0 64 103 129 0.963 8.72 8.12 Intr + 190793 190858 66 1 0 112 100 63 0.996 8.08 8.13 Intr + 192795 192932 138 2 0 59 105 126 0.996 11.14 8.14 Intr + 193494 193640 147 2 0 105 96 143 0.996 16.31 8.15 Intr + 196823 197013 191 1 2 0 70 154 0.561 2.26 8.16 Intr + 198429 198646 218 0 2 81 44 229 0.344 15.02 8.17 Intr + 202026 202114 89 1 2 47 105 42 0.396 0.57 8.18 Intr + 202719 202895 177 2 0 105 75 105 0.969 10.09 8.19 Intr + 204201 204368 168 2 0 84 100 42 0.945 4.22 8.20 Intr + 204500 204680 181 1 1 69 91 129 0.909 9.82 8.21 Term + 205358 205437 80 0 2 75 55 55 0.630 -2.15 8.22 PlyA + 206008 206013 6 -0.45 9.03 PlyA - 206398 206393 6 1.05 9.02 Term - 207607 207246 362 2 2 48 45 292 0.735 14.61 9.01 Init - 208362 207630 733 0 1 54 103 379 0.780 30.98 9.00 Prom - 208526 208487 40 -15.74 10.06 PlyA - 208646 208641 6 -0.45 10.05 Term - 210372 208814 1559 1 2 31 37 475 0.921 26.26 10.04 Intr - 211190 210591 600 0 0 69 33 249 0.638 9.05 10.03 Intr - 213075 211504 1572 1 0 34 80 1088 0.714 90.66 10.02 Intr - 213339 213263 77 2 2 37 115 65 0.656 2.34 10.01 Init - 214309 214230 80 1 2 81 -35 89 0.487 -3.52 10.00 Prom - 214898 214859 40 -7.75 11.00 Prom + 215412 215451 40 -10.15 11.01 Init + 215591 215844 254 1 2 20 52 267 0.766 13.36 11.02 Intr + 216133 216262 130 1 1 65 61 99 0.849 4.68 11.03 Intr + 216962 217241 280 1 1 90 95 122 0.750 9.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 48505 48451 55 1 1 71 54 56 0.838 2.00 S.002 Term + 90257 90370 114 1 0 94 54 43 0.890 -0.91 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:99614590_99832049|GENSCAN_predicted_peptide_1|123_aa MELEGRGAGGVAGGPAAGPGRSPGESALLDGWLQRGVGRGAGGGEAGACRPPVRQDPDSG PDYEALPAGATVTTHMVAGAVAGILEHCVMYPIDCVKVRPAPGFEPDLGLTRPLGLVRSQ DPG >gi568815588r:99614590_99832049|GENSCAN_predicted_CDS_1|372_bp atggagttggaggggcggggtgctggcggtgtggcgggggggccggcggcagggcccggg cggagccccggggagtcggcgctgctggacgggtggctgcagcggggcgtgggccggggg gccggcggcggggaggccggggcctgcaggcccccggtacgacaagatccggactccggc ccggactacgaggcgctgccggctggagccactgtcaccacgcacatggtggcaggcgcc gtggcagggatcctggagcactgcgtgatgtaccccatcgactgcgtcaaggtgagacct gcacccggcttcgaacccgacctggggctgacccgacccctgggtcttgtccggagccag gatcccggctga >gi568815588r:99614590_99832049|GENSCAN_predicted_peptide_2|193_aa MPCVTKEFEQQPSALDLPSGRAYPNGRNQKTKSGNMTKQGSFTPSKNHTSSPAMDPNQEE IPDLPEKEFRSNGSPSQSNQTRERKDIQIGKEEIKLSLFADNMIVYLENPKDSSRKLLEP IKEFSKVSGYKINVHKSVALLYTNSDQVENQIRNSTLFTVAAKTKTNKQKNLGIHLTKEA KDLYKENYKTLLK >gi568815588r:99614590_99832049|GENSCAN_predicted_CDS_2|582_bp atgccctgtgtgacaaaagaatttgaacaacagccttcagccctagaccttccctctggc agagcctacccaaatggaaggaaccagaaaaccaagtctggtaatatgacaaaacaaggc tcttttacaccctcaaaaaatcacactagctcaccagcaatggatccaaaccaagaagaa atccctgatttacctgaaaaagaattcaggagtaatggaagtcccagccagagcaatcag acaagagaaagaaaggacattcaaataggtaaagaggaaatcaaactgtcactgtttgct gacaatatgatcgtttaccttgaaaaccctaaagactcctccagaaagctcctagaaccg ataaaagaattcagcaaagtttctggatacaagattaatgtacacaaatcagtagctctc ctatacaccaacagcgaccaagtggagaatcaaatcaggaactcaactctttttacagtt gctgcaaaaacaaaaacaaacaaacaaaaaaacttaggaatacacctaactaaggaggcg aaagacctctacaaggaaaactacaaaacactgctgaaataa >gi568815588r:99614590_99832049|GENSCAN_predicted_peptide_3|194_aa MIVYLENPIVSSQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQAESQIMSELPYTIAS KRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIAKMAILPKVIY RFNAIPIKLPTTFFTELEKTTLKFIWNQKRAHIAKSILSQKNKAGGIMLPDFKLYYKATV TKTAWYWYQNRDID >gi568815588r:99614590_99832049|GENSCAN_predicted_CDS_3|585_bp atgattgtatatttagaaaaccccatcgtctcatcccaaaacctccttaagctgataagc aacttcagcaaagtgtcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatac accaataacagacaagcagagagccaaatcatgagtgaactaccatacacaattgcttca aagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggag aactacaaaccactgctcaatgaaataaaagaggacacaaacaaatggaagaacattcca tgctcatggataggaagaatcaatattgctaaaatggccatactgcccaaggtaatttat agattcaatgccatccccatcaagctaccaacgactttcttcacagaattggaaaaaact actttaaagttcatatggaaccaaaaaagagcccacattgccaagtcaatcctaagccaa aagaacaaagctggaggcatcatgctacctgacttcaaactatactacaaggctacagta accaaaacagcatggtactggtaccaaaacagagatatagactaa >gi568815588r:99614590_99832049|GENSCAN_predicted_peptide_4|100_aa MERNQSRKAENSKNQSTSSPPKESSSSPAMEQSWTENDFDELREEGFGRSVITNFSKLKE NVRTHRKVAKNLEKRLDEWLTRINSIEKTLNDLMELKTTA >gi568815588r:99614590_99832049|GENSCAN_predicted_CDS_4|303_bp atggagagaaaccagagcagaaaagctgaaaactctaaaaatcagagcacctcttctcct ccaaaggaaagcagctcctcgccagcaatggaacaaagctggacggagaatgactttgac gagttgagagaagaaggctttggacgatcggtaataacaaacttctctaagctaaaggag aatgttcgaacccaccgcaaagtagccaaaaaccttgaaaaaagattagatgaatggcta actagaataaacagcatagagaagaccttaaatgacctgatggagctgaaaaccacggca tga >gi568815588r:99614590_99832049|GENSCAN_predicted_peptide_5|833_aa MRFGREHPEPTYRSYSVPGAAVEKATSREFAPQGGKRAQGRPAPLTCELSAGPAGGASVR RLSDWARLKVPGRASEAERSRGLATLADGRAADRAGSPPVRLVESRPLRESSVSHWLPAV SLGTWRRRRPGPPSGWARGRGEQLGTEPRGAEGTGGPLDPAADVGDAWDKEATFSGQKKK KVTGVETTEGNPWLVSPFLRQRVAFLGLFFISCLLLLMLIIDFRHWSASLPRDRQYERYL ARVGELEATDTEDPNLNYGLVVDCGSSGSRIFVYFWPRHNGNPHDLLDIKQMRDRNSQPV VKKIKPGISAMADTPEHASDYLRPLLSFAAAHVPVKKHKETPLYILCTAGMRLLPERKQL AILADLVKDLPLEFDFLFSQSQAEVISGKQEGVYAWIGINFVLGRFDHEDESDAEATQEL AAGRRRTVGILDMGGASLQIAYEVPTSTSVLPAKQEEAAKILLAEFNLGCDVQHTEHVYR VYVTTFLGFGGNFARQRYEDLVLNETLNKNRLLGQKTGLSPDNPFLDPCLPVGLTDVVER NSQVLHVRGRGDWVSCGAMLSPLLARSNTSQASLNGIYQSPIDFNNSEFYGFSEFFYCTE DVLRIGGRYHGPTFAKAAQDYCGMAWSVLTQRFKNGLFSSHADEHRLKYQCFKSAWMYQV LHEGFHFPYDYPNLRTAQLVYDREVQWTLGAILYKTRFLPLRDLRQEGVRQAHGSWFRLS FVYNHYLFFACILVVLLAIFLYLLRLRRIHHRQTRASAPLDLLWLEEVVPMMGVQNCIGS MALASASGEGLTKLTIMAEGEGEPVCHMVRESKRERRKFQALLNNQLLHKQPE >gi568815588r:99614590_99832049|GENSCAN_predicted_CDS_5|2502_bp atgcgcttcggtcgcgaacaccctgaacccacctaccggagctactctgtcccaggagcg gccgtggagaaagcaaccagccgagagttcgcgccccagggagggaagcgggcacagggc cgcccagcgccactcacctgtgagctctccgcgggccctgcaggcggagcctcggtacga cgcctttccgattgggcgcggctcaaagtcccggggcgggcatcagaggccgagcgctct aggggattggccaccctggcggacggacgtgctgctgaccgagctggttcgcccccggtt cggctcgtggagagccggcccctccgtgagtcttctgtcagtcattggctccctgcggtt tccttggggacgtggcgccgccgccggccgggccctccttccggctgggcaaggggccgc ggggagcagctcgggactgaaccgagaggtgccgaaggaaccggcgggccgcttgatccc gctgcagacgtaggagatgcctgggacaaggaggccaccttctcagggcaaaagaaaaag aaggtgacaggcgttgagaccaccgaagggaacccatggctagtgagtccatttctccgt cagcgggtggcattcctgggactcttcttcatatcctgtctccttttacttatgttaatc atagactttcgacattggagtgcttcattaccacgagataggcaatacgaaaggtatttg gctcgagtaggggagcttgaagctactgacactgaagacccaaatctgaattatggactt gttgttgactgtggcagcagtggttcccggatttttgtttatttctggccaagacataat gggaacccccatgacttgctggacatcaaacagatgagagaccgcaacagccaaccagtg gttaaaaaaatcaagccaggaatctctgcaatggcagacactccagaacatgccagtgat taccttcgtcctctgctgagctttgctgctgctcatgtgcctgtgaagaagcacaaggag acccctctttacatcctctgcacagcaggcatgaggcttctccctgagaggaagcagttg gctatcttggctgacctagtgaaagatttaccactggagtttgacttcctcttttcacag tctcaagcagaagtgatctctgggaagcaggaaggggtttatgcatggattggaatcaac tttgttttgggaagattcgaccacgaggatgaatcagatgctgaggctacccaggaattg gcagcaggacggagaaggacagtagggatactggatatgggaggagcctctctccaaatt gcttatgaagttcctacctcaacctctgtccttcctgcaaagcaggaagaagctgccaag atcctgctggctgagttcaacctgggctgtgatgtgcaacacactgaacacgtgtacagg gtttatgtcacaacttttctgggtttcggaggcaactttgcccggcagcgctacgaagac cttgttctgaatgaaactcttaacaaaaacagattgcttggtcagaagacaggtctgagt cccgacaatccatttctggatccctgcctgccagtgggactcacagatgtggtggagagg aacagccaagtcttacatgtccgaggaagaggagactgggtgtcttgtggggcaatgctg agccccctgctggctcgctccaacaccagccaggcctcactcaatggcatatatcaatcg cctattgacttcaacaacagcgagttctacggcttctctgagtttttttattgtacagag gatgtgttgcgcattggtggccgctaccatgggccaacatttgccaaggctgctcaggat tactgtggcatggcttggtcggtactaactcagagattcaagaatggcctcttttcatca catgcagatgagcatcgactcaaatatcagtgttttaaatcggcttggatgtaccaagtc ttacatgaaggattccactttccctatgactacccaaacctgcggacagcccagctggtg tatgaccgagaggttcagtggacgctgggagccattctatataaaacacgattcttacca ctcagggatcttcggcaggaaggtgtccgacaagcccatggtagctggttccgtctctcc tttgtatacaaccactatctcttctttgcctgtatcctggtggtgctactggccatcttc ctataccttctgcggctacgccgaattcaccaccgacaaacacgagcctcagctccattg gacttgctgtggcttgaagaggtggtgcccatgatgggagtacagaactgtataggaagc atggcgctggcatctgcttctggtgagggcctcacgaagcttacaatcatggcagaaggt gaaggggagccagtgtgtcacatggtgagagagagcaagcgagagaggaggaagttccag gctcttttaaacaaccagctcttgcataaacaaccagagtga >gi568815588r:99614590_99832049|GENSCAN_predicted_peptide_6|496_aa MLPWHSGRTRVLEQPQARPLGSCVTLGTKPRPRRGSYSGVLAAGAGGTGILVWRRKKREA EVNQLAQGRRAVMRGPRDSNLGSTAEPFSLPAPRHALRFHTFNASKIEVSTWNLGSGAHQ LRSASTAARARARVPDTYAPAAPAKAESQRRRTSVRSGPQRRRGSRCPTRKCFSFPWRRR ETTEPWVVEEAAERVVGRWLLVCSGTVAGAVILGGVTSLNHDMTLTEFKFIWYMEYSHRM WGRLVGLVYILPAAYFWRKGWLSRGMKGRVLALCGLVCFQGLLGWYMVKSGLEEKSDSHD IPRVSQYRLAAHLGSALVLYCASLWTSLSLLLPPHKLPETHQLLQLRRFAHGTAGLVFLT ALSGAFVAGLDAGLVYNSFPKMGESWIPEDLFTFSPILRNVFENPTMVQFDHRILGITSV TAITVLYFLSRRIPLPRRTKMAAVTLLALAYTQVGLGISTLLMYVPTPLAATHQSGSLAL LTGALWLMNELRRVPK >gi568815588r:99614590_99832049|GENSCAN_predicted_CDS_6|1491_bp atgctaccctggcactcaggaagaactcgggtcttggagcagccacaagctcggccactt ggtagctgcgtgactttggggaccaagccccgtccccgccgtggcagctacagcggagta ctggctgctggtgcgggcggtactggcatcctcgtgtggcgacggaaaaaacgtgaagct gaagttaaccaacttgcacaaggtcgcagggccgttatgaggggaccccgggactcgaac cttggctccacagctgagccattctcgctacctgcccctcgtcacgccctccgtttccac acctttaacgcctcaaagatagaagtttccacttggaatttgggctccggcgcgcaccag ctaagaagcgcgtcaacagctgcgcgcgcccgtgcgcgcgtccccgacacctacgcccca gcagcccccgcgaaagcggagtcgcaacgcaggcgcacttctgttcgctccggtccccag agaaggcggggctcccgctgcccgacccggaagtgcttctcttttccttggcggaggagg gagaccacagagccctgggttgtggaagaggctgctgagcgggtggtgggccgatggctc ctggtctgcagtggaacagtggctggagcagttattcttggtggagtaactagcttgaat catgatatgacactgacagaattcaagttcatctggtacatggagtactcacaccgaatg tggggtcgccttgtaggccttgtgtacatcctgcctgctgcctacttttggagaaagggc tggctcagccgtggcatgaaaggacgtgttcttgccctctgtggcctcgtctgcttccag ggtctgttgggatggtatatggtgaaaagtggactagaagaaaaatcagactcccatgac atccctcgggtcagtcagtaccgccttgctgcccacctgggatcagccctggttctttat tgtgccagcttgtggacctcactgtcactgctactccctccgcacaagttgcctgaaacc caccaactcctacagttgagacgatttgctcatggaacagcaggtctggtgttccttacg gccctctcaggggcttttgtggcagggctagatgctgggcttgtttataactcctttccc aaaatgggagaatcctggatcccggaggacctctttaccttctcccccatcctgaggaat gtttttgagaatcccaccatggtgcagtttgatcaccggattctgggaatcacttcagtc actgccattacagtgctctacttcctctctcggagaattccccttcctagaaggaccaag atggcagcagtgactctgctggctttggcgtatacacaggtgggcttgggcatcagcacg ctgctgatgtatgtcccaactcctctggccgccactcaccagtcaggctccttggctttg ctcactggtgctctttggctgatgaatgaactccgaagagtcccaaaatga >gi568815588r:99614590_99832049|GENSCAN_predicted_peptide_7|231_aa MEVCVDSVESAVNAERGGADRIELCSGLSEGGTTPSMGVLQVVKQSVQIPVFVMIRPRGG DFLYSDREIEVMKADIRLAKLYGADGLVFGALTEDGHIDKELCMSLMAFDMVHDPMAALE TLLTLGFERVLTSGCDSSALEGLPLIKRLIEQVDNVEIELEYTELVSAAEVIACLSWEHN TNRAVQNPTKDFSWSCQERDTLFARVTGHEDHEGSSESKLQPTCSKWTGSY >gi568815588r:99614590_99832049|GENSCAN_predicted_CDS_7|696_bp atggaagtttgtgttgattcagtggaatcagctgtgaatgcagaaagaggaggtgctgat cggattgaattatgttctggtttatcagaggggggaactacacccagcatgggtgtcctt caagtagtgaagcagagtgttcagatcccagtttttgtgatgattcggccacggggaggt gattttttgtattcagatcgtgaaattgaggtgatgaaggctgacattcgtcttgccaag ctttatggtgctgatggtttggtttttggggcattgactgaagatggacacattgacaaa gagctgtgtatgtcccttatggcctttgacatggttcatgatccaatggcagctctggag accctcttaaccttgggatttgaacgcgtgttgaccagtggatgtgacagttcagcatta gaagggctacccctaataaagcgactcattgagcaggtagataatgttgagattgaatta gagtacaccgagctggtgtctgctgcagaagtgattgcttgcttgagctgggaacacaac acaaacagggctgttcagaatcctacgaaagacttttcctggagctgtcaggaaagagac actctttttgctagggttactggtcatgaggaccatgaaggaagctcagagtcaaaattg cagcccacatgtagcaagtggacaggatcctattga >gi568815588r:99614590_99832049|GENSCAN_predicted_peptide_8|1498_aa MIKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEI KGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTN NRQTESQIMSELPFTIASKRIKYLGIQLTRVVKDLFKENYKPLLKEIKEDTNKWKNIPCS WVGRINIVKMAILPKVIYRFNTIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKN KAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQ WGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITI QDIGVGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFATYSS DKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQ IKTTMRYHLTPVRMAIIKKSGNNSFDFSRMPSSWNHTQGSPAPGPQTGAGPRPVRAAQQV NSSFLDSPEADLPLCFEQTVLVWIPLGYLWLLAPWQLLHVYKSRTKRSSTTKLYLAKQVF VGFLLILAAIELALVLTEDSGQATVPAVRYTNPSLYLGTWWQYSSEHHVNFSPAPGFADP IQQTMVCTEKLLVPVPILDSLDTLWHFPISDSDPDTLTESHEVPVSNWFTFRFFCVFQNP SSIASFLSSITYSWYDSIILKGYKRPLTLEDVWEVDEEMKTKTLVSKFETHMKRELQKAR RALQRRQEKSSQQNSGARLPGLNKNQSQSQDALVLEDVEKKKKKSGTKKDVPKSWLMKAL FKTFYMVLLKSFLLKLVNDIFTFVSPQLLKLLISFASDRDTYLWIGYLCAILLFTAALIQ SFCLQCYFQLCFKLGVKVRTAIMASVYKKALTLSNLARKEYTVGETVNLMSVDAQKLMDV TNFMHMLWSSVLQIVLSIFFLWRELGPSVLAGVGVMVLVIPINAILSTKSKTIQVKNMKN KDKRLKIMNEILSGIKILKYFAWEPSFRDQVQNLRKKELKNLLAFSQLQCVVIFVFQLTP VLVSVVTFSVYVLVDSNNILDAQKAFTSITLFNILRFPLSMLPMMISSMLQRRSQHLAET ILGADGESGERHVRADSHVGTYIGLKEGFFSTDKAMQFSEASFTWEHDSEATVRDVNLDI MAGQLVAVIGPVGSGKSSLISAMLGEMENVHGHITIKVRGNANAKASDSRMSAASCFCLT GEKALRANVIYTMGPQPRGPQTGTVPWPVGNQATQEKGTTAYVPQQSWIQNGTIKDNILF GTEFNEKRYQQVLEACALLPDLEMLPGGDLAEIGEKGINLSGGQKQRISLARATYQNLDI YLLDDPLSAVDAHVGKHIFNKVLGPNGLLKGKTRLLVTHSMHFLPQVDEIVVLGNGTIVE KGSYSALLAKKGEFAKNLKTFLRHTGPEEEATGGCHQQEFRSRKPGQKAVEAFQKSVR >gi568815588r:99614590_99832049|GENSCAN_predicted_CDS_8|4497_bp atgataaaaataataagagctatctatgacaaacccacagccaatatcatactgaatggg caaaaactggaagcattccctttgaaaactggcacaagacagggatgccctctctcaccg ctcctattcaacatagtgttggaagttctggccagggcaatcaggcaggagaaggaaata aagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgatt gtttatctagaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaacttc agcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaac aacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaagaga ataaaatacctaggaatccaacttacaagggttgtgaaggacctcttcaaggagaactac aaaccactgctcaaggaaataaaagaggacacaaacaaatggaagaacattccatgctca tgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttacagattc aataccatccccatcaagctaccaatgactttcttcacagaattggaaaaaactacttta aagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaac aaagctggaggcatcacactacctgacttcaaactatactacaaggctacagtaaccaaa acagcatggtactggtaccaaaacagagatatagatcaatggaacagaacagagccctca gaaataatgccacatatctacaactatctgatctttgacaaacctgagaaaaacaagcaa tggggaaaggattccctatttaataaatggtgctgggaaaactggctagccatatgtaga aagctgaaactggatcccttccttacaccttatacaaaaatcaattcaagatggattaaa gatttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggcattaccatt caggacataggcgtgggcaaggacttcatgtccaaaacaccaaaagcaatggcaacaaaa gccaaaattgacaaatgggatctaattaaactcaagagcttctgcacagcaaaagaaact accatcagagtgaacaggcaacctacaacatgggagaaaattttcgcaacctactcatct gacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaagaaaaaaaca aacaaccccatcaaaaagtgggcgaaggacatgaacagacacttctcaaaagaagacatt tatgcagccaaaaaacacatgaagaaatgctcatcatcactggccatcagagaaatgcaa atcaaaaccactatgagatatcatctcacaccagttagaatggcaatcattaaaaagtca ggaaacaacagttttgacttttccagaatgccgtctagttggaatcatacacaggggtcc ccagcacctgggccacaaactggtgctggtccaaggcctgttagggctgcacagcaggtg aattcctcattcctggacagtccggaggcagacctgccactttgttttgagcaaactgtt ctggtgtggattcccttgggctacctatggctcctggccccctggcagcttctccacgtg tataaatccaggaccaagagatcctctaccaccaaactctatcttgctaagcaggtattc gttggttttcttcttattctagcagccatagagctggcccttgtactcacagaagactct ggacaagccacagtccctgctgttcgatataccaatccaagcctctacctaggcacatgg tggcagtattcttcagaacatcatgtgaatttctctccagctcctggttttgctgatcca atacagcagacaatggtgtgtacagaaaaactcctggttcctgtccctattctggattct ctcgatactctgtggcactttccaatttcagactctgatccggacactcttacagagtcc catgaagttcctgtctccaattggtttacatttcgatttttttgtgtctttcagaatcca tcatccatagcttcattcctgagtagcattacctacagctggtatgacagcatcattctg aaaggctacaagcgtcctctgacactcgaggatgtctgggaagttgatgaagagatgaaa accaagacattagtgagcaagtttgaaacgcacatgaagagagagctgcagaaagccagg cgggcactccagagacggcaggagaagagctcccagcagaactctggagccaggctgcct ggcttgaacaagaatcagagtcaaagccaagatgcccttgtcctggaagatgttgaaaag aaaaaaaagaagtctgggaccaaaaaagatgttccaaaatcctggttgatgaaggctctg ttcaaaactttctacatggtgctcctgaaatcattcctactgaagctagtgaatgacatc ttcacgtttgtgagtcctcagctgctgaaattgctgatctcctttgcaagtgaccgtgac acatatttgtggattggatatctctgtgcaatcctcttattcactgcggctctcattcag tctttctgccttcagtgttatttccaactgtgcttcaagctgggtgtaaaagtacggaca gctatcatggcttctgtatataagaaggcattgaccctatccaacttggccaggaaggag tacaccgttggagaaacagtgaacctgatgtctgtggatgcccagaagctcatggatgtg accaacttcatgcacatgctgtggtcaagtgttctacagattgtcttatctatcttcttc ctatggagagagttgggaccctcagtcttagcaggtgttggggtgatggtgcttgtaatc ccaattaatgcgatactgtccaccaagagtaagaccattcaggtcaaaaatatgaagaat aaagacaaacgtttaaagatcatgaatgagattcttagtggaatcaagatcctgaaatat tttgcctgggaaccttcattcagagaccaagtacaaaacctccggaagaaagagctcaag aacctgctggcctttagtcaactacagtgtgtagtaatattcgtcttccagttaactcca gtcctggtatctgtggtcacattttctgtttatgtcctggtggatagcaacaatattttg gatgcacaaaaggccttcacctccattaccctcttcaatatcctgcgctttcccctgagc atgcttcccatgatgatctcctccatgctccagcgtaggagccagcacttagcagaaaca atcctaggagctgatggagaaagcggagagagacacgtgagggcagacagtcacgtgggg acctacattggactaaaagagggctttttctcaacagacaaagccatgcagttttctgag gcctcctttacctgggaacatgattcggaagccacagtccgagatgtgaacctggacatt atggcaggccaacttgtggctgtgataggccctgtcggctctgggaaatcctccttgata tcagccatgctgggagaaatggaaaatgtccacgggcacatcaccatcaaggtgagaggg aatgccaatgcaaaagcctctgactcccgaatgtcagcagcttcgtgcttttgcctcact ggggagaaggcactgagggctaatgtcatctataccatgggtccccaaccccgtgggcct cagactggtactgttccatggcctgttgggaaccaggctacacaggagaagggcaccact gcctatgtcccacagcagtcctggattcagaatggcaccataaaggacaacatccttttt ggaacagagtttaatgaaaagaggtaccagcaagtactggaggcctgtgctctcctccca gacttggaaatgctgcctggaggagatttggctgagattggagagaagggtataaatctt agtgggggtcagaagcagcggatcagcctggccagagctacctaccaaaatttagacatc tatcttctagatgaccccctgtctgcagtggatgctcatgtaggaaaacatatttttaat aaggtcttgggccccaatggcctgttgaaaggcaagactcgactcttggttacacatagc atgcactttcttcctcaagtggatgagattgtagttctggggaatggaacaattgtagag aaaggatcctacagtgctctcctggccaaaaaaggagagtttgctaagaatctgaagaca tttctaagacatacaggccctgaagaggaagccacaggtggttgtcaccagcaggagttc aggtcgagaaagccagggcagaaggctgtagaggcattccagaaatcagtcagatga >gi568815588r:99614590_99832049|GENSCAN_predicted_peptide_9|364_aa MGVPEKIKTDNGPGYCSKAFQKFLSQWKISHTTGMPYNSQGQAIVERTNRTLKTQLVKQK EAGDSKECTTPQMQLNLALYTLNFLNIYRNQTTTSAEQHLTGKKNSPHEGKLIWWKDNKN KTWEIGKVITWGRGFACVSPGENQLPVWIRTRHLKFYNEPIGDAKKSTSAETETPQSSTV DSQDEQNGDVRRTDEVAIHQEGRAANLGTTKEADAVSYKISREHKGDTNPREYAACSLDD CINGGAAAANYTYWAYVPFPPLIRAVTWMDNPIEVYVNDSVWVPGPTDDRCPAKPEEEGM MINISIGYRYPPICLGRAPGCLMPAVQNWLVEVPTVSPISRFTYHMSAGVPNSSEETATI ENGP >gi568815588r:99614590_99832049|GENSCAN_predicted_CDS_9|1095_bp atgggagttccggaaaaaatcaaaactgacaatggaccaggatattgtagtaaagctttc caaaaattcttaagtcagtggaaaatttcacatacaacaggaatgccttataattcccaa ggacaggccatagttgaaagaactaatagaacactcaaaactcaattagttaaacaaaaa gaagcaggagacagtaaggagtgtaccactcctcagatgcaacttaatctagcactctat actttaaattttttaaacatttatagaaatcagactactacttctgcagaacaacatctt actggtaaaaagaacagcccacatgaaggaaaactaatttggtggaaagataataaaaat aagacatgggaaatagggaaggtgataacgtgggggagaggttttgcttgtgtttcacca ggagaaaatcagcttcctgtttggatacgcactagacatttgaagttctacaatgaaccc atcggagatgcaaagaaaagcacctccgcggagacggagacaccacagtcgagcaccgtt gactcacaagatgaacaaaatggtgacgtcagaagaacagatgaagttgccatccaccaa gaaggcagagccgccaacttgggcacaactaaagaagctgacgcagttagctacaaaata tctagagaacacaaaggtgacacaaaccccagagagtatgctgcttgcagccttgatgat tgtatcaacggtggagcagctgcagctaactatacctactgggcctatgtgcctttcccg cccttaattcgggcagtcacatggatggataatcctatagaagtatatgttaatgatagc gtatgggtacctggccccacagatgatcgctgccctgccaaacctgaggaagaagggatg atgataaatatttccattgggtatcgttatcctcctatttgcctagggagagcaccagga tgtttaatgcctgcagtccaaaattggttggtagaagtacctactgtcagtcccatcagt agattcacttatcacatgtctgcaggtgtacccaacagctccgaagagacagcgaccatc gagaacgggccatga >gi568815588r:99614590_99832049|GENSCAN_predicted_peptide_10|1295_aa MEEGSNLSLPADYVALYKQEVKAKAEMVKVRSSVVIEHKSTRDPEYIYSQPYEQGTLDLK DWKRIGKELKQAGRKGNIIPLIVWNDWAIIKAALEPFQTEEDSVSVPDAPRSCIVDCNEK TRKKSQKEMESLHCEYVAEPVMAQSTQNVDYNQLQEVIYPETLKLEGKGPELVGPSESKP RGPSPLPAGQVPVTLQPQTQVKENKTQPPVAYQYWPPAELQYRPPPESQYGYPGMPPAPQ GRAPYPQPPTRRLNPTAPPSRQGSELHEIIDKSRKEGDTEAWQFPVTLEPMPPGEGAQEG EPLTVEARYKSFSIKMLKDMKEGVKQYGPKSPYMRTLLDSIAHGHRLIPYDWEILAKSSL SPSQFLQFKTWWIDGVQEQVRRNRAANPPVNIDADQLLGISQNWSTISQQALMQNEAIEQ VRAICLRAWEKIQDPGSACPSFNTVRQGSKEPYPDFVARLQDVAQKSIANEKARKVIVEL MAYENANPECQSAIKPLKGKVPAGSDVISEYVKACDGIRGAMHKAMLMAQAITGVVLGGQ VRTFGGKCYNCGQIGHLKKNCPVSNKQNITIQATTTGEPPQKIPTEVYGPLPERTVGLIL GRSSLNLKGVQIHTGVVDSDYKGEIQLVISSSIPWSASPGDRIAELLLLPYIKGGNSEIK RTGGFGSTDPTGKAAYWASQVSENRPVCKAIIQGKQFEGLVDTGADVSIIALNRWPKNWP KQKAVTGFVGIGTASEVYQSTEILHCLGPDNQESTVQPMITSIPLNLWGRDLLQQWEKPV WVNQWPLPKQKLEALHLLANEQLEKGHIEPSFSPWNSPVFVIQKKSGKWRMLTDLRAVNA VIQPMGPLQPGLPSLAMIPKDWPLIIIDLKDCFFTIPLAEQDCEKFAFTIPAINNKEPAT RFQWKVLPQGMLNSPTICQTFVGRALQPVRDKFSDCYIIHYIDDILCAAETKDKLIDCYT FLQAEVANAGLAIASDKIQTSTPFHYLGMQIENRKIKPQKIKIRKDTLKTLNYFQKLLGD INWIRPTLGIPTYAMSNLFSILRGDSDVNSKRMLTPEATKEIKLVEEKIQSAQINRIDPL APLQLLIFATAHSPTGIIIQNTDLVEWSFLPHSTVKTFTLYLDQIATLIGQTRLRIIKLC GNDPDKIVVPLTKEQVRQAFINSGAWQIGLANFVGIIDNHYPKTKIFQFLKLTTWILPKI TRREPLENALTVFTDGSSNGKAAYTGPKERVIKTPHQSAQRAELVAVITVLQDFDQPINI ISDSAYVVQATRDVETALIKYSMDDQLNQLFNLLQ >gi568815588r:99614590_99832049|GENSCAN_predicted_CDS_10|3888_bp atggaagagggctctaatctttcacttccggctgactatgtggctctgtataagcaggaa gtgaaggctaaggcagagatggtgaaggtacgctcgagcgtggtcattgagcacaagtcg acgagagatcccgagtacatatacagtcagccgtacgaacaaggaactttagatctaaaa gactggaaaagaattggtaaggaactaaaacaagcaggtaggaagggtaatatcattcca cttatagtatggaatgattgggccattattaaagcagctttagaaccatttcaaacagaa gaagatagcgtttcagttcctgacgcccctagaagctgtatagtagattgtaatgaaaag acaaggaaaaaatcccagaaagaaatggaaagtttacattgcgaatatgtagcagagccg gtaatggctcagtcaacgcaaaatgttgactataatcaattacaggaggtgatatatcct gaaacgttaaaattagaaggaaaaggtccagaattagtggggccatcagagtctaagcca cgagggccaagtcctcttccagcaggtcaggtgcccgtaacactacaacctcaaacgcag gttaaagaaaataagacccaaccgccagtagcttatcaatactggccgccagccgaactt cagtatcggccacccccagaaagtcagtatggatatccaggaatgcccccagcaccacag ggcagggcgccataccctcagccgcccactaggagacttaatcctacggcaccacctagt agacagggtagtgaattacatgaaattattgataaatcaagaaaggaaggagatactgag gcgtggcaattcccagtaacgttagaaccgatgccacctggagaaggagcccaagaggga gagcctctcacagttgaggccagatacaagtctttttcgataaaaatgctaaaagatatg aaagagggagtaaaacagtatggacccaaatccccttatatgaggacattattagattcc attgctcatggacatagactcattccttatgattgggagattctggcaaaatcgtctctc tcaccctctcaatttttacaatttaagacttggtggattgatggggtacaagaacaggtc cgaagaaatagggctgccaatcctccagttaacatagatgcagatcaactattaggaata agtcaaaattggagtactattagtcaacaagcattaatgcaaaatgaggccattgagcaa gttagagctatctgccttagagcctgggaaaaaatccaagacccaggaagcgcctgcccc tcatttaatacagtaagacaaggttcgaaagagccctaccctgattttgtggcaaggctc caagatgttgctcaaaagtcaattgccaatgaaaaagcccgtaaggtcatagtggagttg atggcatatgaaaacgccaatcctgagtgtcaatcagccattaagccattaaaaggaaag gttcccgcaggatcagatgtaatctcagaatatgtaaaagcctgtgatggaatcagagga gctatgcataaagctatgcttatggctcaagcaataacaggagttgttttaggaggacaa gttagaacatttggaggaaaatgttacaattgtggtcaaattggtcacttaaaaaagaat tgcccagtctcaaataaacagaatataactattcaagcaactacaacaggggagccccca caaaaaatccccacagaggtatatggcccactgcctgagaggactgtaggactaatcttg ggaagatcaagtctaaatctaaaaggagttcaaattcatactggtgtggttgattcagac tataaaggcgaaattcagttggttattagctcttcaattccttggagtgccagtccagga gacaggattgctgaattattactcctgccatatattaagggtggaaatagtgaaataaaa agaacaggagggtttggaagcactgatccgacaggaaaggctgcatattgggcaagtcag gtctcagagaacagacctgtgtgtaaggccattattcaaggaaaacagtttgaagggttg gtagacactggagcagatgtctctatcattgctttaaatcggtggccaaaaaattggcct aaacaaaaggctgttacaggatttgtcggcataggcacagcctcagaagtgtatcaaagt actgagattttacattgcttagggccagataatcaagaaagtactgttcagccaatgatt acttcaattcctcttaatctgtggggtcgagatttattacaacaatgggaaaaaccggtg tgggtaaatcagtggccgctaccaaaacaaaaactggaggctttacatttattagcaaat gaacagttagaaaagggtcacattgagccttcgttctcgccttggaattctcctgtgttt gtaattcagaagaaatcaggcaaatggcgtatgttaactgacttaagggccgtaaacgcc gtaattcaacccatggggcctctccaacccgggttgccctctctggccatgatcccaaaa gactggcctttaattataattgatctaaaggattgcttttttaccatccctctggcggag caggattgcgaaaaatttgcctttactataccagccataaataataaagaaccagccacc aggtttcagtggaaagtgttacctcagggaatgcttaatagtccaactatttgtcagact tttgtaggtcgagctcttcaaccagttagagacaagttttcagactgttatattattcat tatattgatgatattttatgtgctgcagaaacaaaagataaattaattgactgttataca tttctgcaagcagaggttgccaatgcaggactggcaatagcatctgataagatccaaacc tctactccttttcattatttagggatgcagatagaaaatagaaaaattaagccacaaaaa ataaaaataagaaaagacacattaaaaacactaaattattttcaaaaattgctgggagat attaattggattcggccaactctaggcattcctacttatgccatgtcaaatttgttctct atcttaagaggagactcagacgtaaatagtaaaagaatgttaaccccagaggcaacaaaa gaaattaaattagtggaagaaaaaattcagtcagcgcaaataaatagaatagatccctta gccccactccaacttttgatttttgccactgcacattctccaacaggcatcattattcaa aatactgatcttgtggagtggtcattccttcctcacagtacagttaagacttttacattg tacttggatcaaatagctacattaattggtcagacaagattacgaataataaaattatgt ggaaatgacccagacaaaatagttgtccctttaaccaaggaacaagttagacaagccttt atcaattctggtgcatggcagattggtcttgctaattttgtgggaattattgataatcat tacccaaaaacaaagatcttccagttcttaaaattgactacttggattctacctaaaatt accagacgtgaacctttagaaaatgctctaacagtatttactgatggttccagcaatgga aaagcagcttacacagggccgaaggaacgagtaatcaaaactccacatcaatcggctcaa agagcagagttggttgcagtcattacagtgttacaagattttgaccaacctatcaatatt atatcagattctgcatatgtagtacaggctacaagggatgttgagacagctctaattaaa tatagcatggatgatcagttaaaccagctattcaatttattacaataa >gi568815588r:99614590_99832049|GENSCAN_predicted_peptide_11|222_aa MLYVHLGSLAETSKIRGGFSLLVHRTDRDLCSSFPNLYCVSLVHDGSEEEDDDYGLISSV EEIPEDAASITMRRENSFRRTLSRRSNGRHLKSLRNSLKTRNVNSLKEDEELVKGQKLIK KEFIETGKVIPYQKPLGSSTNIEVGTLQVKFSIYLEYLQAIGLFSIFFIILAFVMNSVAF IGSNLWLSAWTSDSKIFNSTDYPASQRDMRVGVYGALGLAQX >gi568815588r:99614590_99832049|GENSCAN_predicted_CDS_11|666_bp atgctgtatgtacatctgggatcccttgctgaaaccagcaagatcagaggaggcttctct ctccttgttcataggactgacagggatctatgcagctctttccctaacctctactgtgtc tccctagtccatgatggcagtgaagaagaagacgatgactatgggctgatatccagtgtg gaagagatccccgaagatgcagcctccataaccatgagaagagagaacagctttcgtcga acacttagccgcaggtccaatggcaggcatctgaagtccctgagaaactccttgaaaact cggaatgtgaatagcctgaaggaagacgaagaactagtgaaaggacaaaaactaattaag aaggaattcatagaaactggaaaggtgattccttatcaaaagccattagggagttctact aatattgaggtggggactttgcaggtgaagttctccatctacctggagtacctacaagca ataggattgttttcgatattcttcatcatccttgcgtttgtgatgaattctgtggctttt attggatccaacctctggctcagtgcttggaccagtgactctaaaatcttcaatagcacc gactatccagcatctcagagggacatgagagttggagtctacggagctctgggattagcc caagnn