GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:53:54 Sequence gi568815587r:47140119_47348789 : 208671 bp : 50.16% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 15058 15143 86 1 2 150 109 141 0.952 21.86 1.02 Intr + 16905 17100 196 1 1 65 94 331 0.989 29.87 1.03 Intr + 17695 17804 110 1 2 93 123 88 0.992 12.73 1.04 Intr + 20907 21072 166 1 1 33 87 204 0.814 13.82 1.05 Term + 21346 21520 175 1 1 106 55 111 0.936 6.83 1.06 PlyA + 22104 22109 6 1.05 2.16 PlyA - 24226 24221 6 1.05 2.15 Term - 25384 25364 21 1 0 122 42 57 0.933 2.81 2.14 Intr - 26268 26150 119 1 2 110 94 130 0.997 15.98 2.13 Intr - 26481 26388 94 0 1 74 91 78 0.992 6.24 2.12 Intr - 26768 26642 127 1 1 56 107 57 0.995 5.08 2.11 Intr - 27925 27791 135 0 0 108 75 92 0.999 9.58 2.10 Intr - 28133 28005 129 1 0 76 78 123 0.992 9.91 2.09 Intr - 31439 31308 132 1 0 80 100 175 0.983 17.66 2.08 Intr - 31682 31546 137 2 2 50 103 197 0.997 16.77 2.07 Intr - 32215 32163 53 2 2 110 61 85 0.974 6.63 2.06 Intr - 33722 33641 82 2 1 69 94 -3 0.738 -2.39 2.05 Intr - 34980 34897 84 0 0 91 89 52 0.954 5.62 2.04 Intr - 35195 35064 132 2 0 84 95 93 0.995 10.44 2.03 Intr - 35805 35733 73 2 1 27 45 67 0.815 -4.29 2.02 Intr - 36516 36398 119 0 2 90 96 123 0.995 12.56 2.01 Init - 36735 36664 72 0 0 86 100 61 0.981 8.17 2.00 Prom - 37425 37386 40 -10.45 3.10 PlyA - 37446 37441 6 -1.95 3.09 Term - 37928 37813 116 0 2 91 48 198 0.980 14.83 3.08 Intr - 38369 38248 122 1 2 113 80 188 0.999 20.64 3.07 Intr - 38912 38776 137 2 2 86 76 76 0.937 5.57 3.06 Intr - 39161 39023 139 1 1 35 23 106 0.938 -0.73 3.05 Intr - 39468 39260 209 0 2 81 19 343 0.783 24.58 3.04 Intr - 40223 40068 156 2 0 12 87 216 0.996 14.11 3.03 Intr - 40572 40337 236 1 2 106 78 388 0.964 37.01 3.02 Intr - 42441 42285 157 0 1 74 84 261 0.998 23.88 3.01 Init - 46431 46231 201 0 0 97 101 105 0.636 9.58 3.00 Prom - 47142 47103 40 -2.46 4.00 Prom + 74669 74708 40 -5.86 4.01 Init + 75019 75145 127 0 1 94 41 111 0.915 7.32 4.02 Intr + 76218 76354 137 1 2 53 110 -10 0.722 -2.01 4.03 Intr + 76740 76931 192 2 0 112 72 152 0.995 15.69 4.04 Intr + 92696 92841 146 1 2 77 89 104 0.986 8.48 4.05 Intr + 94455 94554 100 0 1 43 92 162 0.568 12.21 4.06 Intr + 94639 94816 178 0 1 67 64 128 0.878 7.79 4.07 Intr + 95152 95294 143 2 2 39 113 116 0.523 9.27 4.08 Intr + 97719 97883 165 2 0 94 85 229 0.999 23.36 4.09 Intr + 98020 98065 46 0 1 115 92 46 0.596 5.68 4.10 Term + 98682 98731 50 1 2 86 54 24 0.411 -3.73 4.11 PlyA + 99080 99085 6 1.05 5.12 PlyA - 99265 99260 6 -4.04 5.11 Term - 100131 99998 134 1 2 75 48 206 0.948 13.55 5.10 Intr - 102780 102605 176 2 2 79 99 166 0.999 16.38 5.09 Intr - 103099 102900 200 1 2 60 92 237 0.989 19.45 5.08 Intr - 103203 103121 83 1 2 59 101 47 0.995 2.46 5.07 Intr - 104749 104617 133 1 1 40 84 160 0.841 11.02 5.06 Intr - 105276 105187 90 0 0 46 83 109 0.988 6.39 5.05 Intr - 105454 105356 99 1 0 106 73 144 0.999 15.01 5.04 Intr - 105716 105564 153 2 0 82 90 205 0.999 20.37 5.03 Intr - 107609 107523 87 2 0 96 94 96 0.996 11.17 5.02 Intr - 108015 107920 96 0 0 64 98 89 0.920 7.71 5.01 Init - 108671 108558 114 2 0 84 23 230 0.877 14.31 5.00 Prom - 119956 119917 40 -6.06 6.00 Prom + 120095 120134 40 -3.46 6.01 Init + 120332 120557 226 1 1 76 65 316 0.489 26.73 6.02 Intr + 121123 121331 209 2 2 96 32 240 0.994 18.00 6.03 Intr + 121429 121900 472 0 1 64 94 359 0.674 26.75 6.04 Intr + 127795 127908 114 2 0 132 64 207 0.984 23.12 6.05 Intr + 128143 128237 95 2 2 78 84 83 0.966 6.58 6.06 Term + 128432 128578 147 1 0 108 42 139 0.992 9.20 6.07 PlyA + 128707 128712 6 -1.95 7.00 Prom + 131651 131690 40 -8.76 7.01 Init + 133797 133858 62 2 2 104 91 23 0.993 5.22 7.02 Intr + 134445 135041 597 0 0 106 87 615 0.981 54.86 7.03 Intr + 135781 136084 304 1 1 109 100 151 0.954 14.99 7.04 Intr + 136614 136745 132 2 0 40 109 120 0.986 10.14 7.05 Intr + 138047 138160 114 1 0 99 72 146 0.999 14.74 7.06 Intr + 138881 138961 81 1 0 81 96 68 0.977 6.73 7.07 Intr + 141457 141635 179 0 2 93 110 154 0.980 16.82 7.08 Intr + 142263 142498 236 0 2 84 79 220 0.983 18.03 7.09 Intr + 142695 142851 157 1 1 67 71 219 0.998 17.17 7.10 Intr + 144060 144163 104 0 2 59 100 53 0.960 3.42 7.11 Intr + 144257 144447 191 0 2 61 84 262 0.991 22.40 7.12 Intr + 144823 145076 254 0 2 70 89 232 0.998 17.73 7.13 Intr + 145333 145472 140 1 2 53 100 190 0.994 16.81 7.14 Intr + 146315 146416 102 0 0 62 88 91 0.895 6.65 7.15 Intr + 149273 149375 103 0 1 112 58 15 0.731 0.13 7.16 Intr + 149749 149935 187 1 1 -2 49 417 0.809 28.49 7.17 Intr + 150031 150181 151 0 1 62 97 145 0.980 12.54 7.18 Intr + 150492 150698 207 1 0 103 84 77 0.815 7.85 7.19 Intr + 153726 153865 140 1 2 29 72 111 0.277 3.78 7.20 Intr + 155378 155488 111 1 0 108 60 99 0.928 9.68 7.21 Intr + 155779 155937 159 0 0 157 52 165 0.997 19.98 7.22 Intr + 158226 158342 117 2 0 62 -7 122 0.489 0.76 7.23 Intr + 168473 168581 109 1 1 123 66 81 0.942 9.26 7.24 Intr + 169163 169283 121 0 1 58 80 70 0.969 2.75 7.25 Intr + 169389 169494 106 0 1 126 90 168 0.985 21.02 7.26 Intr + 171614 171724 111 1 0 64 107 189 0.979 18.98 7.27 Intr + 175102 175209 108 0 0 59 92 29 0.613 0.88 7.28 Intr + 183553 183717 165 0 0 91 107 224 0.996 24.76 7.29 Intr + 184147 184219 73 0 1 104 93 109 0.999 11.98 7.30 Intr + 184353 184459 107 1 2 102 96 130 0.972 15.13 7.31 Intr + 186620 186689 70 1 1 138 19 53 0.622 2.15 7.32 Intr + 188540 188618 79 0 1 69 74 104 0.877 5.71 7.33 Term + 188928 189117 190 0 1 111 55 55 0.454 1.42 7.34 PlyA + 189894 189899 6 1.05 8.26 PlyA - 191319 191314 6 1.05 8.25 Term - 191763 191753 11 1 2 95 47 0 0.670 -5.04 8.24 Intr - 192140 191954 187 2 1 74 78 211 0.717 17.96 8.23 Intr - 192584 192448 137 0 2 63 97 228 0.685 21.49 8.22 Intr - 192855 192696 160 0 1 105 53 275 0.838 25.26 8.21 Intr - 193215 193076 140 1 2 103 89 155 0.957 17.28 8.20 Intr - 193634 193439 196 2 1 91 70 364 0.981 33.89 8.19 Intr - 193892 193804 89 0 2 95 75 84 0.849 7.49 8.18 Intr - 195091 194924 168 2 0 108 66 165 0.875 16.22 8.17 Intr - 195893 195759 135 0 0 118 68 201 0.999 21.64 8.16 Intr - 197489 197273 217 2 1 -17 97 475 0.757 35.98 8.15 Intr - 197676 197516 161 1 2 63 36 174 0.466 9.41 8.14 Intr - 198561 198402 160 0 1 86 85 380 0.777 37.06 8.13 Intr - 199286 199206 81 2 0 95 113 88 0.970 11.83 8.12 Intr - 199672 199533 140 2 2 102 84 150 0.999 16.18 8.11 Intr - 200914 200885 30 2 0 121 110 1 0.929 3.70 8.10 Intr - 201126 201020 107 2 2 85 80 208 0.999 19.56 8.09 Intr - 202038 201873 166 1 1 90 101 299 0.999 30.42 8.08 Intr - 202626 202460 167 2 2 26 84 278 0.999 20.80 8.07 Intr - 202817 202712 106 0 1 65 101 129 0.999 11.17 8.06 Intr - 203027 202903 125 1 2 78 86 205 0.994 19.53 8.05 Intr - 203506 203374 133 2 1 -54 61 230 0.977 5.80 8.04 Intr - 206252 206089 164 1 2 -16 71 345 0.997 22.02 8.03 Intr - 207361 207308 54 0 0 84 105 42 0.908 3.59 8.02 Intr - 207787 207739 49 2 1 53 94 86 0.968 3.44 8.01 Intr - 208423 208306 118 1 1 106 89 204 0.993 22.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:47140119_47348789|GENSCAN_predicted_peptide_1|244_aa XIVLMDDAMDCLMSFSDFLFAFQIQFYYSEFLDSVAAIYEDLLSGKNPNTVIVPTSSSGQ HRQRPALGGAGTLEGVEASLFYQCLENLCDRHKYSCPPPALVKEALSNVQRLTFYGFLMA LSKHRGINQALGSAEQAAVKVRGMEESVAWLKGPQINYCAPNDLLGALPDKGDLMHDPAM DEELERLLAQVPGLVNSVTASPEASCLPSRTPPRVGSPWRPLHHSRKVDGESDGSTEETD ESET >gi568815587r:47140119_47348789|GENSCAN_predicted_CDS_1|735_bp nngattgtgctcatggacgatgccatggactgcttgatgtctttttcagatttcctcttt gccttccagatccagttttactactcagaattcctggacagtgtggctgccatctatgag gacctgctgtcaggcaagaaccccaacacagtgattgtgccgacgtcgtccagtgggcag caccgccaacgacctgccttgggcggggccggcacgctggagggcgtggaggcgtcgctg ttctaccagtgtctggaaaacctgtgtgatcggcacaagtacagctgcccacccccagca cttgtcaaagaggccctcagcaatgttcagagactgaccttctatggattcctcatggct ctctcaaagcaccgtggaatcaaccaagccctcgggtcagcagaacaggcagccgtgaag gtgaggggcatggaggaatctgttgcctggctgaagggccctcagattaactactgtgcc cccaatgatctcctaggagctttgcctgacaagggggatctgatgcacgacccagcaatg gatgaagagctggaacggctgctggcccaggtcccaggcctggtcaactcggtcacagcc agtccagaggccagttgcctgccttcccggacccctccccgggttggctctccctggaga cctctccatcattcccgaaaagtggatggagagagtgatggctccactgaagagacagac gagtcggagacttga >gi568815587r:47140119_47348789|GENSCAN_predicted_peptide_2|502_aa MAAEPNKTEIQTLFKRLRAVPTNKACFDCGAKNPSWASITYGVFLCIDCSGVHRSLGVHL SFIRSTELDSNWNWFQLRCMQVGGNANATAFFRQHGCTANDANTKYNSRAAQMYREKIRQ LGSAALARHGTDLWIDNMSSAVPNHSPEKKDSDFFTEHTQPPAWDAPATEPSGTQQPAPS TESSGLAQLKSSIIGKKKPAAAKKGLGAKKGLGAQKVSSQSFSEIERQAQVAEKLREQQA ADAKKQAEESMVASMRLAYQELQIDRKKEEKKLQNLEGKKREQAERLGMGLVSRSSVSHS VLSEMQVIEQETPVSAKSSRSQLDLFDDVGTFASGPPKYKDNPFSLGESFGSRWDTDAAW GMDRVEEKEPEVTISSIRPISERATNRREVESRSSGLESSEARQKFAGAKAISSDMFFGR EVDAEYEARSRLQQLSGSSAISSSDLFGDMDGAHGAGSVSLGNVLPTADIAQFKQGVKSV AGKMAVLANGVMNSLQDRYGSY >gi568815587r:47140119_47348789|GENSCAN_predicted_CDS_2|1509_bp atggcggcggagccgaacaagaccgaaatccagactctttttaagaggcttcgcgcagtt ccaaccaacaaggcctgtttcgactgcggcgccaagaatccgagttgggccagcatcacg tacggtgttttcttgtgcattgactgttccggggtgcaccgctccctgggcgtccatctg agcttcatcaggtccacagagttggattccaactggaactggttccagctgaggtgtatg caggtcggcgggaatgccaatgcgacggctttttttcgccaacatggatgcacagccaat gatgccaacaccaaatataatagccgagctgcccagatgtaccgggagaagatccggcag ctggggagtgcggccctggctaggcatggcactgatctttggatagacaacatgagtagt gccgttcctaatcactccccagagaagaaggactctgatttcttcacagaacacactcaa ccccctgcctgggatgcgccagccactgagccttcagggacccagcagccagccccgtct acagagagcagtggcctggcacaactgaaaagctccatcattggcaagaagaagccagca gcagctaagaaagggctgggtgccaagaaaggcctaggggcccagaaggtgagcagccag agcttcagtgagattgagcggcaggctcaggtggcagagaagctccgtgagcagcaggca gccgatgccaagaagcaggcggaggagtccatggtcgcctccatgcgtctggcctaccag gagctccagattgatcgtaagaaagaggaaaagaagctacagaatctggaagggaagaag cgagagcaggcagaaaggttgggcatgggcttggtatcccgaagctctgtctcccactcc gtgctgtctgagatgcaggtgattgagcaggaaaccccagtgagtgcaaaatcctctcgc tcgcagctggacttgtttgacgatgttggtactttcgcctctggacccccaaagtacaag gacaatcccttttccttaggggaaagctttggctcccgctgggatacagatgctgcctgg ggtatggacagggtagaggagaaggagccagaagtgaccatctcaagcatccggcctatt tcagaaagagccacaaaccggagggaagtggagagccggagctcaggcctcgagtctagt gaggcgcgtcagaaattcgcaggagccaaagccatctcatctgacatgttctttgggcgg gaggtggatgcagagtatgaggccaggtctcggctgcagcagctctcaggcagcagtgcc atcagctcttcagacctctttggggacatggatggagctcacggagcaggaagtgtatct ctggggaacgtgctgcctacagcggacattgcccagtttaagcagggtgtcaagtctgtg gctgggaaaatggctgtgctggccaatggtgtgatgaattccttgcaggatcgctacggt tcctactga >gi568815587r:47140119_47348789|GENSCAN_predicted_peptide_3|490_aa MPAAPLPRRVGLRQRRPGEGRGRGPGAGLPGGGPGALREAAARAQERQDSGRSVAGPPPA EGPREDAAGNYRRTVQRVEDGHRLCGDLVSCFQERARIEKAYAQQLADWARKWRGTVEKG PQYGTLEKAWHAFFTAAERLSALHLEVREKLQGQDSERVRAWQRGAFHRPVLGGFRESRA AEDGFRKAQKPWLKRLKEVEASKKSYHAARKDEKTAQTRESHAKADSAVSQEQLRKLQER VERCAKEAEKTKAQYEQTLAELHRYTPRYMEDMEQAFETCQAAERQRLLFFKDMLLTLHQ HLDLSSSEKYELPGGHGDLGFHELHRDLHQGIEAASDEEDLRWWRSTHGPGMAMNWPQFE ACVPGGEWSLDTQRTISRKEKGGRSPDEVTLTSIVPTRDGTAPPPQSPGSPGTGQDEEWS DEESPRKAATGVRVRALYDYAGQEADELSFRAGEELLKMSEEDEQGWCQGQLQSGRIGLY PANYVECVGA >gi568815587r:47140119_47348789|GENSCAN_predicted_CDS_3|1473_bp atgcctgcagccccgctgccccgccgcgtggggttgaggcagcggcggccaggagagggg cggggccgggggccgggggccgggcttcccgggggagggcccggcgcgctccgggaggcc gcggcgcgggcccaggagcggcaggactcgggccggagcgtggccggacccccacccgcc gaggggcccagggaggacgcggctggcaactacaggcgcacggtacagcgggtggaggac gggcaccggctgtgcggggacctggtcagctgcttccaggagcgcgcccgcatcgagaag gcttatgcccagcagttggctgactgggcccgaaagtggagggggaccgtggagaagggc ccccagtatggcacactggagaaggcctggcatgcctttttcacggcggctgagcggctg agcgcgctgcacctggaggtgcgggagaagctgcaagggcaggacagtgagcgggtgcgc gcctggcagcggggggctttccaccggcctgtgctgggcggcttccgcgagagccgggcg gccgaggacggcttccgcaaggcccagaagccctggctgaagaggctgaaggaggttgag gcttccaagaaaagctaccacgcagcccggaaggatgagaagaccgcccagacgagggag agccacgcaaaggcagacagcgccgtctcccaggagcagctgcgcaaactgcaggaacgg gtggaacgctgtgccaaggaggccgagaagacaaaagctcagtatgagcagacgctggca gagctgcatcgctacactccacgctacatggaggacatggaacaggcctttgagacctgc caggccgccgagcgccagcggcttcttttcttcaaggatatgctgctcaccttacaccag cacctggacctttccagcagtgagaagtatgaactgcctggtgggcatggagatctgggg ttccatgaactccaccgtgacttgcaccagggcattgaggcagccagtgacgaagaggat ctgcgctggtggcgcagcacccacgggccaggcatggccatgaactggccacagttcgag gcatgtgttccaggcggggagtggtccttggacacacagaggacaatcagccggaaagag aagggtggccggagccctgatgaggttaccctgaccagcattgtgcctacaagagatggc accgcacccccaccccagtccccggggtccccaggcacggggcaggatgaggagtggtca gatgaagagagtccccggaaggctgccaccggggttcgggtgagggcactctatgactac gctggccaggaagctgatgagctgagcttccgagcaggggaggagctgctgaagatgagt gaggaggacgagcagggctggtgccaaggccagttgcagagtggccgcattggcctgtac cctgccaactacgtggagtgtgtgggcgcctga >gi568815587r:47140119_47348789|GENSCAN_predicted_peptide_4|427_aa MAPKKRPETQKTSEIVLRPRNKRSRSPLELEPEAKKLCAKGSGPSRRCDSDCLWVGLAGP QILPPCRSIVRTLHQHKLGRASWPSVQQGLQQSFLHTLDSYRILQKAAPFDRRATSLAWH PTHPSTVAVGSKGGDIMLWNFGIKDKPTFIKGIGAGGSITGLKFNPLNTNQFYASSMEGT TRLQDFKGNILRVFASSDTINIWFCSLDVSASSRMVVTGDNVGNVILLNMDGKELWNLRM HKKKVTHVALNPCCDWFLATASVDQTVKIWDLRQVRGKASFLYSLPHRHPVNAACFSPDG ARLLTTDQKSEIRVYSASQWDCPLGLIPHPHRHFQHLTPIKAAWHPRYNLIVVGRYPDPN FKSCTPYELRTIDVFDGNSGKMMCQLYDPESSGISSLNEFNPMGDTLASAMGYHILIWSQ EEARTRK >gi568815587r:47140119_47348789|GENSCAN_predicted_CDS_4|1284_bp atggctcccaagaaacgcccagaaacccagaagacctccgagattgtattacgccccagg aacaagaggagcaggagtcccctggagctggagcccgaggccaagaagctctgtgcgaag ggctccggtcctagcagaagatgtgactcagactgcctctgggtggggctggctggccca cagatcctgccaccatgccgcagcatcgtcaggaccctccaccagcataagctgggcaga gcttcctggccatctgtccagcaggggctccagcagtcctttttgcacactctggattct taccggatattacaaaaggctgccccctttgacaggagggctacatccttggcgtggcac ccaactcaccccagcaccgtggctgtgggttccaaagggggagatatcatgctctggaat tttggcatcaaggacaaacccaccttcatcaaagggattggagctggagggagcatcact gggctgaagtttaaccctctcaataccaaccagttttacgcctcctcaatggagggaaca actaggctgcaagactttaaaggcaacattctacgagtttttgccagctcagacaccatc aacatctggttttgtagcctggatgtgtctgctagtagccgaatggtggtcacaggagac aacgtggggaacgtgatcctgctgaacatggacggcaaagagctttggaatctcagaatg cacaaaaagaaagtgacgcatgtggccctgaacccatgctgtgattggttcctggccaca gcctccgtagatcaaacagtgaaaatttgggacctgcgccaggttagagggaaagccagc ttcctctactcgctgccgcacaggcatcctgtcaacgcagcttgtttcagtcccgatgga gcccggctcctgaccacggaccagaagagcgagatccgagtttactctgcttcccagtgg gactgccccctgggcctgatcccgcaccctcaccgtcacttccagcacctcacacccatc aaggcagcctggcatcctcgctacaacctcattgttgtgggccgatacccagatcctaat ttcaaaagttgtaccccttatgaattgaggacgatcgacgtgttcgatggaaactcaggg aagatgatgtgtcagctctatgacccagaatcttctggcatcagttcgcttaatgaattc aatcccatgggggacacgctggcctctgcaatgggttaccacattctcatctggagccag gaggaagccaggacacggaagtga >gi568815587r:47140119_47348789|GENSCAN_predicted_peptide_5|454_aa MAGKRSGWSRAALLQLLLGVNLVVMPPTRARSLRFVTLLYRHGDRSPVKTYPKDPYQEEE WPQGFGQLTKEGMLQHWELGQALRQRYHGFLNTSYHRQEVYVRSTDFDRTLMSAEANLAG LFPPNGMQRFNPNISWQPIPVHTVPITEDRLLKFPLGPCPRYEQLQNETRQTPEYQNESS RNAQFLDMVANETGLTDLTLETVWNVYDTLFCEQTHGLRLPPWASPQTMQRLSRLKDFSF RFLFGIYQQAEKARLQGGVLLAQIRKNLTLMATTSQLPKLLVYSARAVSGKELDFFSVRF LRVWGTMLTGSPSPPQHDTTLVALQMALDVYNGEQAPYASCHIFELYQEDSGNFSVEMYF RNESDKAPWPLSLPGCPHRCPLQDFLRLTEPVVPKDWQQECQLASGPADTEVIVALAVCG SILFLLIVLLLTVLFRMQAQPPGYRHVADGEDHA >gi568815587r:47140119_47348789|GENSCAN_predicted_CDS_5|1365_bp atggcgggcaagcggtccggctggagccgggcggctctcctccagctccttctcggcgtg aacctggtggtgatgccgcccacccgggcccggagtctgcgcttcgttaccttgctgtac cgccatggagaccgttcaccagtgaagacatatcccaaggacccctatcaggaagaagaa tggccccaggggtttggtcagttaaccaaggaggggatgctacagcactgggaactgggc caggccctgcggcagcgctatcacggcttcctaaacacctcttatcaccggcaagaggtt tatgtgcgaagcacagactttgaccggactctcatgagtgctgaggccaacctggctgga ctcttccctcccaacgggatgcagcgcttcaacccgaacatctcgtggcagcctattcct gtgcacactgtgcccatcactgaggacaggctgctgaagttcccgttgggcccatgtccc cgttatgagcagctgcagaacgagacccggcagacaccagagtatcagaatgagagttct cggaatgcacaatttctggacatggtggccaacgagacagggcttacagacctgacactg gagaccgtctggaatgtctatgacacactcttctgtgagcaaacgcacgggctgcgcctg ccgccctgggcctcaccccaaaccatgcagcgtctcagccggctaaaggacttcagcttc cgcttcctcttcggaatctaccagcaggcggagaaggcccggcttcaggggggagtcctg ctggctcagataaggaagaacctgaccctaatggcgaccacctcccagctccccaagctg ctggtttactctgcgcgtgctgtgtcaggcaaggagctggatttcttctctgttcggttc ctgcgggtgtggggaacaatgctgacgggcagcccctctcccccgcagcacgacactacc ctggttgccctgcaaatggcactggatgtctacaatggtgaacaagccccctacgcctcc tgccacatatttgaactgtaccaggaagattctgggaatttctcagtggagatgtacttt cggaacgagagtgacaaggccccctggccgctcagcctgcctggctgccctcaccgctgc ccactgcaggacttccttcgcctcacagagcccgtcgtgcccaaggattggcagcaggag tgccagctggcaagcggtcctgcagacacagaggtgattgtggccttggctgtatgtggc tccatcctcttcctcctcatagtgctgctcctcaccgtcctcttccggatgcaggcccag cctcctggctaccgccacgtcgcagatggggaggaccacgcctga >gi568815587r:47140119_47348789|GENSCAN_predicted_peptide_6|420_aa MLGNELCSVCGDKASGFHYNVLSCEGCKGFFRRSVIKGAHYICHSGGHCPMDTYMRRKCQ ECRLRKCRQAGMREECVLSEEQIRLKKLKRQEEEQAHATSLPPRASSPPQILPQLSPEQL GMIEKLVAAQQQCNRRSFSDRLRVTPWPMAPDPHSREARQQRFAHFTELAIVSVQEIVDF AKQLPGFLQLSREDQIALLKTSAIEVAGEGQGMKGEAEWDYLWEGPPDIELGEPNLLGSR DEENRPPWKRPCSKTSPPSPRLRFAACVQVMLLETSRRYNPGSESITFLKDFSYNREDFA KAGLQVEFINPIFEFSRAMNELQLNDAEFALLIAISIFSADRPNVQDQLQVERLQHTYVE ALHAYVSIHHPHDRLMFPRMLMKLVSLRTLSSVHSEQVFALRLQDKKLPPLLSEIWDVHE >gi568815587r:47140119_47348789|GENSCAN_predicted_CDS_6|1263_bp atgctggggaacgagctatgcagcgtgtgtggggacaaggcctcgggcttccactacaat gttctgagctgcgagggctgcaagggattcttccgccgcagcgtcatcaagggagcgcac tacatctgccacagtggcggccactgccccatggacacctacatgcgtcgcaagtgccag gagtgtcggcttcgcaaatgccgtcaggctggcatgcgggaggagtgtgtcctgtcagaa gaacagatccgcctgaagaaactgaagcggcaagaggaggaacaggctcatgccacatcc ttgccccccagggcttcctcacccccccaaatcctgccccagctcagcccggaacaactg ggcatgatcgagaagctcgtcgctgcccagcaacagtgtaaccggcgctccttttctgac cggcttcgagtcacgccttggcccatggcaccagatccccatagccgggaggcccgtcag cagcgctttgcccacttcactgagctggccatcgtctctgtgcaggagatagttgacttt gctaaacagctacccggcttcctgcagctcagccgggaggaccagattgccctgctgaag acctctgcgatcgaggtggctggagaagggcaagggatgaagggagaagcagagtgggat tatctgtgggaggggcctccagacatcgagctgggagagccaaatctgctgggaagcagg gatgaggagaatcggcctccctggaagaggccatgctccaagaccagccctcctagtccc cgtttgaggtttgctgcttgtgtgcaggtgatgcttctggagacatctcggaggtacaac cctgggagtgagagtatcaccttcctcaaggatttcagttataaccgggaagactttgcc aaagcagggctgcaagtggaattcatcaaccccatcttcgagttctccagggccatgaat gagctgcaactcaatgatgccgagtttgccttgctcattgctatcagcatcttctctgca gaccggcccaacgtgcaggaccagctccaggtagagaggctgcagcacacatatgtggaa gccctgcatgcctacgtctccatccaccatccccatgaccgactgatgttcccacggatg ctaatgaaactggtgagcctccggaccctgagcagcgtccactcagagcaagtgtttgca ctgcgtctgcaggacaaaaagctcccaccgctgctctctgagatctgggatgtgcacgaa tga >gi568815587r:47140119_47348789|GENSCAN_predicted_peptide_7|1688_aa MVQKKKFCPRLLDYLVIVGARHPSSDSVAQTPELLRRYPLEDHTEFPLPPDVVFFCQPEG CLSVRQRRMSLRDDTSFVFTLTDKDTGVTRYGICVNFYRSFQKRISKEKGEGGAGSRGKE GTHATCASEEGGTESSESGSSLQPLSADSTPDVNQSPRGKRRAKAGSRSRNSTLTSLCVL SHYPFFSTFRECLYTLKRLVDCCSERLLGKKLGIPRGVQRDTMWRIFTGSLLVEEKSSAL LHDLREIEAWIYRLLRSPVPVSGQKRVDIEVLPQELQPALTFALPDPSRFTLVDFPLHLP LELLGVDACLQVLTCILLEHKVVLQSRDYNALSMSVMAFVAMIYPLEYMFPVIPLLPTCM ASAEQLLLAPTPYIIGVPASFFLYKLDFKMPDDVWLVDLDSNRVIAPTNAEVLPILPEPE SLELKKHLKQALASMSLNTQPILNLEKFHEGQEIPLLLGRPSNDLQSTPSTEFNPLIYGN DVDSVDVATRVAMVRFFNSANVLQGFQMHTRTLRLFPRPVVAFQAGSFLASRPRQTPFAE KLARTQAVEYFGEWILNPTNYAFQRIHNNMFDPALIGDKPKWYAHQLQPIHYRVYDSNSQ LAEALSVPPERDSDSEPTDDSGSDSMDYDDSSSSYSSLGDFVSEMMKCDINGDTPNVDPL THAALGDASEVEIDELQNQKEAEEPGPDSENSQENPPLRSSSSTTASSSPSTVIHGANSE PADSTEMDDKAAVGVSKPLPSVPPSIGKSNVDRRQAEIGEGSVRRRIYDNPYFEPQYGFP PEEDEDEQGESYTPRFSQHVSGNRAQKLLRPNSLRLASDSDAESDSRASSPNSTVSNTST EGFGGIMSFASSLYRNHSTSFSLSNLTLPTKGAREKATPFPSLKGNRRALVDQKSSVIKH SPTVKREPPSPQGRSSNSSENQQFLKEVVHSVLDGQGVGWLNMKKVRRLLESEQLRVFVL SKLNRMVQSEDDARQDIIPDVEISRKVYKGMLDLLKCTVLSLEQSYAHAGLGGMASIFGL LEIAQTHYYSKEPDKRKRSPTESVNTPVGKDPGLAGRGDPKAMAQLRVPQLGPRAPSATG KGPKELDTRSLKEENFIASIAFVHGVTEVFPYSGPEVIKPVFDLGETEEKKSQISADSGV SLTSSSQRTDQDSVIGVSPAVMIRSSSQDSEVSTVVGEHHTGILVSNSSGETLGADSDLS SNAGDGPGGEGSVHLASSRGTLSDSEIETNSATSTIFPHNCEDSNHQFLINGSGSENRLQ QFGFKACVEAHYSKTPGKAHSLKPSIKEKLAGSPIRTSEDVSQRVYLYEGLLGKERSTLW DQMQFWEDAFLDAVMLEREGMGMDQGPQEMIDRYLSLGEHDRKRLEDDEDRLLATLLHNL ISYMLLMKVNKNDIRKKVRRLMGKSHIGLVYSQQINEVLDQLANLNGRDLSIWSSGSRHM KKQTFVVHAGTDTNGDIFFMEVCDDCVVLRSNIGTVYERWWYEKLINMTYCPKTKVLCLW RRNGSETQLNKFYTKKCRELYYCVKDSMERAAARQQSIKPGPELGGEFPVQDLKTGEGGL LQVTLEGINLKFMHNQVFIELNHIKKCNTVRGVFVLEEFVPEIKEVVSHKYKTPMVSVPL NPDGPRPTKSATPYYVSSRTWLQFIAVRKISEPRPGLSLADGEGLRSCPSPGHAPGPLLF PSARCCCD >gi568815587r:47140119_47348789|GENSCAN_predicted_CDS_7|5067_bp atggtgcaaaagaagaagttctgtcctcggttacttgactatctagtgatcgtaggggcc aggcacccgagcagtgatagcgtggcccagactcctgaattgctacggcgataccccttg gaggatcacactgagtttcccctgcccccagatgtagtgttcttctgccagcccgagggc tgcctgagcgtgcggcagcggcgcatgagccttcgggatgatacctcttttgtcttcacc ctcactgacaaggacactggagtcacgcgatatggcatctgtgttaacttctaccgctcc ttccaaaagcgaatctctaaggagaagggggaaggtggggcagggtcccgtgggaaggaa ggaacccatgccacctgtgcctcagaagagggtggcactgagagctcagagagtggctca tccctgcagcctctcagtgctgactctacccctgatgtgaaccagtctcctcggggcaaa cgccgggccaaggcggggagccgctcccgcaacagtactctcacgtccctgtgcgtgctc agccactaccctttcttctccaccttccgagagtgtttgtatactctcaagcgcctggtg gactgctgtagtgagcgccttctgggcaagaaactgggcatccctcgaggcgtacaaagg gacaccatgtggcggatctttactggatcgctgctggtagaggagaagtcaagtgccctt ctgcatgaccttcgagagattgaggcctggatctatcgattgctgcgctccccagtaccc gtctctgggcagaagcgagtagacatcgaggtcctaccccaagagctccagccagctctg acctttgctcttccagacccatctcgattcaccctagtggatttcccactgcaccttccc ttggaacttctaggtgtggacgcctgtctccaggtgctaacctgcattctgttagagcac aaggtggtgctacagtcccgagactacaatgcactctccatgtctgtgatggcattcgtg gcaatgatctacccactggagtatatgtttcctgtcatcccgctgctacccacctgcatg gcatcagcagagcagctgctgttggctccaaccccgtacatcattggggttcctgccagc ttcttcctctacaaactggacttcaaaatgcctgatgatgtatggctagtggatctggac agcaatagggtgattgcccccaccaatgcagaagtgctgcctatcctgccagaaccagaa tcactagagctgaaaaagcatttaaagcaggccttggccagcatgagtctcaacacccag cccatcctcaatctggagaaatttcatgagggccaggagatcccccttctcttgggaagg ccttctaatgacctgcagtccacaccgtccactgaattcaacccactcatctatggcaat gatgtggattctgtggatgttgcaaccagggttgccatggtacggttcttcaattccgcc aacgtgctgcagggatttcagatgcacacgcgtaccctgcgcctctttcctcggcctgtg gtagcttttcaagctggctcctttctagcctcacgtccccggcagactccttttgccgag aaattggccaggactcaggctgtggagtactttggggaatggatccttaaccccaccaac tatgcctttcagcgaattcacaacaatatgtttgatccagccctgattggtgacaagcca aagtggtatgctcatcagctgcagcctatccactatcgcgtctatgacagcaattcccag ctggctgaggccctgagtgtaccaccagagcgggactctgactccgaacctactgatgat agtggcagtgatagtatggattatgacgattcaagctcttcttactcctcccttggtgac tttgtcagtgaaatgatgaaatgtgacattaatggtgatactcccaatgtggaccctctg acacatgcagcactgggggatgccagcgaggtggagattgacgagctgcagaatcagaag gaagcagaagagcctggcccagacagtgagaactctcaggaaaaccccccactgcgctcc agctctagcaccacagccagcagcagccccagcactgtcatccacggagccaactctgaa cctgctgactctacggagatggatgataaggcagcagtaggcgtctccaagcccctccct tccgtgcctcccagcattggcaaatcgaacgtggacagacgtcaggcagaaattggagag gggtcagtgcgccggcgaatctatgacaatccatacttcgagccccaatatggctttccc cctgaggaagatgaggatgagcagggggaaagttacactccccgattcagccaacatgtc agtggcaatcgggctcaaaagctgctgcggcccaacagcttgagactggcaagtgactca gatgcagagtcagactctcgggcaagctctcccaactccaccgtctccaacaccagcacc gagggcttcgggggcatcatgtcttttgccagcagcctctatcggaaccacagtaccagc ttcagtctttcaaacctcacactgcccaccaaaggtgcccgagagaaggccacgcccttc cccagtctgaaaggaaacaggagggcgttagtggatcagaagtcatctgtcattaaacac agcccaacagtgaaaagagaacctccatcaccccagggtcgatccagcaattctagtgag aaccagcagttcctgaaggaggtggtgcacagcgtgctggacggccagggagttggctgg ctcaacatgaaaaaggtgcgccggctgctggagagcgagcagctgcgagtctttgtcctg agcaagctgaaccgcatggtgcagtcagaggacgatgcccggcaggacatcatcccggat gtggagatcagtcggaaggtgtacaagggaatgttagacctcctcaagtgtacagtcctc agcttggagcagtcctatgcccacgcgggtctgggtggcatggccagcatctttgggctt ttggagattgcccagacccactactatagtaaagaaccagacaagcggaagagaagtcca acagaaagtgtaaataccccagttggcaaggatcctggcctagctgggcggggggaccca aaggctatggcacaactgagagttccacaactgggacctcgggcaccaagtgccacagga aagggtcctaaggaactggacaccagaagtttaaaggaagaaaattttatagcatctatt gcctttgtgcacggagtaacagaagtcttcccctactcagggcctgaagtaatcaaacct gtctttgaccttggtgagacagaggagaaaaagtcccagatcagcgcagacagtggtgtg agcctgacgtctagttcccagaggactgatcaagactctgtcatcggcgtgagtccagct gttatgatccgcagctcaagtcaggattctgaagttagcaccgtggtaggggaacaccac actggcatcttggtgagtaatagctctggagagacccttggagctgacagtgacttgagc agcaatgcaggtgatggaccaggtggcgagggcagtgttcacctggcaagctctcggggc actttgtctgatagtgaaattgagaccaactctgccacaagcaccatctttccgcataac tgtgaagatagcaaccatcagtttctcataaatggaagcggcagtgagaaccgtctacag cagtttggcttcaaggcctgtgtagaggcccattattccaaaacacctggtaaagcccac agcttgaagccaagcataaaggagaagctggcaggcagccccattcgtacttctgaagat gtgagccagcgagtctatctctatgagggactcctaggcaaagagcgttctactttatgg gaccaaatgcaattctgggaagatgccttcttagatgctgtgatgttggagagagaaggg atgggtatggaccagggtccccaggaaatgatcgacaggtacctgtcccttggagaacat gaccggaagcgcctggaagatgatgaagatcgcttgttggccacacttctgcacaacctc atctcctacatgctgctgatgaaggtaaataagaatgacatccgcaagaaggtgaggcgc ctaatgggaaagtcgcacattgggcttgtgtacagccagcaaatcaatgaggtgcttgat cagctggcgaacctgaatggacgcgatctctctatctggtccagtggcagccggcacatg aagaagcagacatttgtggtacatgcagggacagatacaaacggagatatctttttcatg gaggtgtgcgatgactgtgtggtgttgcgtagtaacatcggaacagtgtatgagcgctgg tggtacgagaagctcatcaacatgacctactgtcccaagacgaaggtgttgtgcttgtgg cgtagaaatggctctgagacccagctcaacaagttctatactaaaaagtgtcgggagctg tactactgtgtgaaggacagcatggagcgcgctgccgcccgacagcaaagcatcaaaccc ggacctgaattgggtggcgagttccctgtgcaggacctgaagactggtgagggtggcctg ctgcaggtgaccctggaagggatcaacctcaaattcatgcacaatcaggttttcatagag ctgaatcacattaaaaagtgcaatacagttcgaggcgtctttgtcctggaggaatttgtt cctgaaattaaagaagtggtgagccacaagtacaagacaccaatggtgagtgtgccgctc aaccctgatgggccacggcccacgaaatctgctactccgtattatgtctcttctcgtacg tggctgcagttcatagcagtgaggaagatctcagaaccccgccccggcctgtctctagct gatggagaggggctacgcagctgccccagcccagggcacgcccctggccccttgctgttc ccaagtgcacgatgctgctgtgactga >gi568815587r:47140119_47348789|GENSCAN_predicted_peptide_8|1066_aa VYLFELHITDAQPAFTGSYRCEVSTKDKFDCSNFNLTVHEAMGTGDLDLLSAFRRTDSHE DTGILDFSSLLKKRDSKLEAPAEEDVWEILRQAPPSEYERIAFQYGVTDLRGMLKRLKGM RRDEKKSTAFQKKLEPAYQVSKGHKIRLTVELADHDAEVKWLKNGQEIQMSGRYIFESIG AKRTLTISQCSLADDAAYQCVVGGEKCSTELFVKEPPVLITRPLEDQLVMVGQRVEFECE VSEEGAQVKWLKDGVELTREETFKYRFKKDGQRHHLIINEAMLEDAGHYALCTSGGQALA ELIVQEKKLEVYQSIADLMVGAKDQAVFKCEVSDENVRGVWLKNGKELVPDSRIKVSHIG RVHKLTIDDVTPADEADYSFVPEGFACNLSAKLHFMEVKIDFVPRQEPPKIHLDCPGRIP DTIVVVAGNKLRLDVPISGDPAPTVIWQKAITQGNKAPARPAPDAPEDTGDSDEWVFDKK LLCETEGRVRVETTKDRSIFTVEGAEKEDEGVYTVTVKNPVGEDQVNLTVKVIDVPDAPA APKISNVGEDSCTVQWEPPAYDGGQPILGECKGTGWRCEGAKQIRGKTRAATSPEPGYIL ERKKKKSYRWMRLNFDLIQELSHEARRMIEGVVYEMRVYAVNAIGMSRPSPASQPFMPIG PPSEPTHLAVEDVSDTTVSLKWRPPERVGAGGLDGYSVEYCPEGCSEWVAALQGLTEHTS ILVKDLPTGARLLFRVRAHNMAGPGAPVTTTEPVTVQEILQRPRLQLPRHLRQTIQKKVG EPVNLLIPFQGKPRPQVTWTKEGQPLAGEEVSIRNSPTDTILFIRAARRVHSGTYQVTVR IENMEDKATLVLQVVDKPSPPQDLRVTDAWGLNVALEWKPPQDVGNTELWGYTVQKADKK TMEWFTVLEHYRRTHCVVPELIIGNGYYFRVFSQNMVGFSDRAATTKEPVFIPRPGITYE PPNYKALDFSEAPSFTQPLVNRSVIAGYTAMLCCAVRGSPKPKISWFKNGLDLGEDARFR MFSKQGVLTLEIRKPCPFDGGIYVCRATNLQGEARCECRLEVRVPQ >gi568815587r:47140119_47348789|GENSCAN_predicted_CDS_8|3201_bp gtctatctgttcgagctgcacatcaccgatgcccagcctgccttcactggcagctaccgc tgtgaggtgtccaccaaggacaaatttgactgctccaacttcaatctcactgtccacgag gccatgggcaccggagacctggacctcctatcagccttccgccgcactgatagccatgag gacactgggattctggacttcagctcactgctgaaaaagagggactcgaagctggaggca ccagcagaggaggacgtgtgggagatcctacggcaggcacccccatctgagtacgagcgc atcgccttccagtacggcgtcactgacctgcgcggcatgctaaagaggctcaagggcatg aggcgcgatgagaagaagagcacagcctttcagaagaagctggagccggcctaccaggtg agcaaaggccacaagatccggctgaccgtggaactggctgaccatgacgctgaggtcaaa tggctcaagaatggccaggagatccagatgagcggcaggtacatctttgagtccatcggt gccaagcgtaccctgaccatcagccagtgctcattggcggacgacgcagcctaccagtgc gtggtgggtggcgagaagtgtagcacggagctctttgtgaaagagccccctgtgctcatc acgcgccccttggaggaccagctggtgatggtggggcagcgggtggagtttgagtgtgaa gtatcggaggagggggcgcaagtcaaatggctgaaggacggggtggagctgacccgggag gagaccttcaaataccggttcaagaaggacgggcagagacaccacctgatcatcaacgag gccatgctggaggacgcggggcactatgcactgtgcactagcgggggccaggcgctggct gagctcattgtgcaggaaaagaagctggaggtgtaccagagcatcgcagacctgatggtg ggcgcaaaggaccaggcggtgttcaaatgtgaggtctcagatgagaatgttcggggtgtg tggctgaagaatgggaaggagctggtgcccgacagccgcataaaggtgtcccacatcggg cgggtccacaaactgaccattgacgacgtcacacctgccgacgaggctgactacagcttt gtgcccgagggcttcgcctgcaacctgtcagccaagctccacttcatggaggtcaagatt gacttcgtacccaggcaggaacctcccaagatccacctggactgcccaggccgcatacca gacaccattgtggttgtagctggaaataagctacgtctggacgtccctatctctggggac cctgctcccactgtgatctggcagaaggctatcacgcaggggaataaggccccagccagg ccagccccagatgccccagaggacacaggtgacagcgatgagtgggtgtttgacaagaag ctgctgtgtgagaccgagggccgggtccgcgtggagaccaccaaggaccgcagcatcttc acggtcgagggggcagagaaggaagatgagggcgtctacacggtcacagtgaagaaccct gtgggcgaggaccaggtcaacctcacagtcaaggtcatcgacgtgccagacgcacctgcg gcccccaagatcagcaacgtgggagaggactcctgcacagtacagtgggagccgcctgcc tacgatggcgggcagcccatcctgggtgagtgcaagggcaccggatggaggtgtgagggc gccaaacagatccgagggaagaccagagctgccacctcccctgagccaggctacatcctg gagcgcaagaagaagaagagctaccggtggatgcggctgaacttcgacctgattcaggag ctgagtcatgaagcgcggcgcatgatcgagggcgtggtgtacgagatgcgcgtctacgcg gtcaacgccatcggcatgtccaggcccagccctgcctcccagcccttcatgcctatcggt ccccccagcgaacccacccacctggcagtagaggacgtctctgacaccacggtctccctc aagtggcggcccccagagcgcgtgggagcaggaggcctggatggctacagcgtggagtac tgcccagagggctgctcagagtgggtggctgccctgcaggggctgacagagcacacatcg atactggtgaaggacctgcccacgggggcccggctgcttttccgagtgcgggcacacaat atggcagggcctggagcccctgttaccaccacggagccggtgacagtgcaggagatcctg caacggccacggcttcagctgcccaggcacctgcgccagaccattcagaagaaggtcggg gagcctgtgaaccttctcatccctttccagggcaagccccggcctcaggtgacctggacc aaagaggggcagcccctggcaggcgaggaggtgagcatccgcaacagccccacagacacc atcctgttcatccgggccgctcgccgcgtgcattcaggcacttaccaggtgacggtgcgc attgagaacatggaggacaaggccacgctggtgctgcaggttgttgacaagccaagtcct ccccaggatctccgggtgactgacgcctggggtcttaatgtggctctggagtggaagcca ccccaggatgtcggcaacacggagctctgggggtacacagtgcagaaagccgacaagaag accatggagtggttcaccgtcttggagcattaccgccgcacccactgcgtggtgccagag ctcatcattggcaatggctactacttccgcgtcttcagccagaatatggttggctttagt gacagagcggccaccaccaaggagcccgtctttatccccagaccaggcatcacctatgag ccacccaactataaggccctggacttctccgaggccccaagcttcacccagcccctggtg aaccgctcggtcatcgcgggctacactgctatgctctgctgtgctgtccggggtagcccc aagcccaagatttcctggttcaagaatggcctggacctgggagaagacgcccgcttccgc atgttcagcaagcagggagtgttgactctggagattagaaagccctgcccctttgacggg ggcatctatgtctgcagggccaccaacttacagggcgaggcacggtgtgagtgccgcctg gaggtgcgagtgcctcagtga