GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:35:55 Sequence gi568815587r:113633697_113873666 : 239970 bp : 42.69% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2499 2614 116 2 2 91 78 85 0.536 7.53 1.02 Intr + 4146 4219 74 0 2 2 98 66 0.248 -2.87 1.03 Intr + 10915 11085 171 2 0 67 19 129 0.055 2.89 1.04 Term + 14802 15085 284 1 2 48 43 233 0.224 9.40 1.05 PlyA + 17039 17044 6 1.05 2.00 Prom + 20356 20395 40 -6.35 2.01 Sngl + 23943 24959 1017 2 0 88 43 795 0.988 71.77 2.02 PlyA + 25187 25192 6 1.05 3.00 Prom + 25356 25395 40 -6.15 3.01 Sngl + 26389 27714 1326 0 0 49 47 458 0.888 33.63 3.02 PlyA + 27814 27819 6 1.05 4.00 Prom + 28578 28617 40 -3.65 4.01 Init + 40499 40538 40 1 1 56 95 77 0.579 5.82 4.02 Term + 46513 46727 215 2 2 116 43 151 0.669 9.81 4.03 PlyA + 48146 48151 6 1.05 5.00 Prom + 48208 48247 40 -9.85 5.01 Init + 48270 48801 532 2 1 16 5 306 0.577 8.30 5.02 Intr + 49107 49270 164 1 2 97 21 185 0.603 11.47 5.03 Intr + 51066 51219 154 2 1 97 87 85 0.553 8.02 5.04 Intr + 53958 53991 34 1 1 87 94 9 0.578 -2.24 5.05 Intr + 56519 56748 230 2 2 88 76 232 0.703 18.59 5.06 Term + 57442 57635 194 2 2 51 38 132 0.620 1.10 5.07 PlyA + 57931 57936 6 -4.04 6.10 PlyA - 58049 58044 6 1.05 6.09 Term - 59769 59358 412 2 1 50 55 190 0.855 5.63 6.08 Intr - 60944 60665 280 0 1 84 90 221 0.909 17.51 6.07 Intr - 61747 61623 125 0 2 115 81 10 0.725 2.31 6.06 Intr - 63722 63587 136 0 1 29 92 68 0.838 0.01 6.05 Intr - 65331 65209 123 1 0 77 80 100 0.952 7.74 6.04 Intr - 65997 65899 99 1 0 74 68 88 0.839 4.56 6.03 Intr - 66472 66370 103 1 1 75 24 137 0.742 4.83 6.02 Intr - 69457 69211 247 0 1 87 37 103 0.035 1.64 6.01 Init - 75332 75253 80 2 2 73 76 45 0.054 2.43 6.00 Prom - 77218 77179 40 -7.15 7.03 PlyA - 78842 78837 6 1.05 7.02 Term - 80130 79697 434 1 2 29 54 323 0.076 17.47 7.01 Init - 88257 88182 76 0 1 61 64 136 0.182 9.80 7.00 Prom - 94033 93994 40 -7.05 8.18 PlyA - 98209 98204 6 1.05 8.17 Term - 100118 99998 121 1 1 84 36 133 0.936 4.67 8.16 Intr - 103126 102924 203 1 2 48 88 193 0.941 12.46 8.15 Intr - 104007 103876 132 0 0 101 94 19 0.925 3.72 8.14 Intr - 104698 104568 131 2 2 97 91 133 0.981 13.99 8.13 Intr - 105686 105517 170 1 2 74 21 76 0.731 -1.83 8.12 Intr - 110344 110106 239 1 2 37 68 223 0.784 10.69 8.11 Intr - 114017 113835 183 2 0 99 76 167 0.999 15.66 8.10 Intr - 114724 114561 164 2 2 33 88 168 0.093 10.07 8.09 Intr - 124157 123966 192 0 0 83 77 98 0.936 6.74 8.08 Intr - 125010 124858 153 1 0 103 87 62 0.953 6.72 8.07 Intr - 126672 126513 160 0 1 75 48 4 0.367 -6.26 8.06 Intr - 126894 126817 78 0 0 40 64 116 0.757 3.23 8.05 Intr - 127222 127121 102 1 0 27 78 132 0.804 5.55 8.04 Intr - 135271 135137 135 1 0 92 94 92 0.461 10.04 8.03 Intr - 140055 139866 190 2 1 85 100 155 0.326 15.07 8.02 Intr - 142989 142892 98 0 2 63 91 79 0.297 3.59 8.01 Init - 143628 143578 51 0 0 60 95 22 0.494 1.31 8.00 Prom - 144828 144789 40 -7.65 9.00 Prom + 145554 145593 40 -8.35 9.01 Sngl + 146100 146789 690 2 0 79 48 285 0.834 17.62 9.02 PlyA + 146816 146821 6 1.05 10.00 Prom + 148942 148981 40 -4.45 10.01 Init + 155708 155962 255 1 0 63 38 362 0.489 25.98 10.02 Intr + 156074 156186 113 1 2 -95 109 199 0.847 1.96 10.03 Intr + 156412 156560 149 1 2 90 -86 260 0.587 7.76 10.04 Term + 156608 157035 428 0 2 28 46 440 0.586 28.28 10.05 PlyA + 157739 157744 6 1.05 11.24 PlyA - 158507 158502 6 1.05 11.23 Term - 165719 165544 176 0 2 114 43 237 0.999 18.74 11.22 Intr - 167982 167787 196 0 1 88 76 110 0.999 7.87 11.21 Intr - 171055 170977 79 0 1 90 98 108 0.999 10.63 11.20 Intr - 171350 171172 179 2 2 81 100 196 0.999 17.90 11.19 Intr - 172888 172793 96 1 0 85 65 116 0.995 8.29 11.18 Intr - 174440 174255 186 2 0 90 77 202 0.998 18.26 11.17 Intr - 174741 174602 140 1 2 58 78 242 0.999 19.46 11.16 Intr - 175558 175367 192 2 0 109 65 176 0.999 15.94 11.15 Intr - 178808 178580 229 2 1 78 98 185 0.963 15.02 11.14 Intr - 180259 180189 71 2 2 74 75 60 0.977 1.28 11.13 Intr - 181686 181478 209 2 2 81 58 228 0.966 16.90 11.12 Intr - 184141 183962 180 0 0 53 90 138 0.892 8.56 11.11 Intr - 190004 189909 96 1 0 53 113 117 0.926 8.91 11.10 Intr - 193664 193537 128 2 2 45 109 47 0.754 1.06 11.09 Intr - 195649 195501 149 2 2 78 94 175 0.946 16.13 11.08 Intr - 197372 197171 202 2 1 59 94 67 0.679 2.34 11.07 Intr - 199861 199724 138 1 0 71 87 104 0.990 8.34 11.06 Intr - 200639 200553 87 2 0 69 98 60 0.919 4.35 11.05 Intr - 207061 206902 160 0 1 7 80 104 0.283 0.57 11.04 Intr - 208072 207967 106 2 1 95 30 114 0.355 4.65 11.03 Intr - 217320 217141 180 1 0 26 92 99 0.357 3.02 11.02 Intr - 218937 218805 133 0 1 111 58 173 0.450 15.90 11.01 Init - 236976 236896 81 0 0 62 66 75 0.645 3.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 80089 79697 393 1 0 88 54 323 0.819 24.89 S.002 Sngl + 121579 121923 345 0 0 30 54 254 0.867 11.99 S.003 Term - 122542 122418 125 2 2 102 37 43 0.888 -1.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:113633697_113873666|GENSCAN_predicted_peptide_1|214_aa MKKTVSRDKAGGGGLKSLTTFIKVQTLLRMAFFTNLEKVDSFGVGGKGREGKGRVDERQK LEDKPSGADTRGHQAGLLHSCGCEFGGPTGRATTTGQCLPLPGSLPRQCLVQLSLAEPLR GVKLETFVVSVTALKAAGPELFIPPGGFVLSLASGAKLQTLAVSVTAHKGSVDPKTEQQQ DLLQRTKKQPCHTVEGDLSRLPLLAWAACFYSLI >gi568815587r:113633697_113873666|GENSCAN_predicted_CDS_1|645_bp atgaagaagacagtcagcagagacaaggctggagggggtgggctgaagagccttacaacc ttcatcaaagttcagactctcctgaggatggcattctttactaatttggaaaaggttgac agctttggcgtgggagggaagggaagggaagggaagggaagggtagatgaacgtcagaaa ctggaagacaaaccttcaggagctgacaccagagggcatcaagcaggacttctccactcc tgtggctgtgaatttggagggcccactggaagggctactaccactggccagtgcctccca cttcctgggtctctcccacgtcagtgtctggttcagctcagcttggcagaacccctgcgt ggagtgaagctggagaccttcgtggtgagtgttacagctcttaaggcagcgggtccggag ttgttcattcctcctggtgggttcgtgctgtcgctggcctcaggagcgaagctgcagacc ttagcggtgagtgtaacagctcacaaaggcagtgtggacccaaagactgagcagcagcaa gatttactgcaaagaacgaagaaacaaccctgccacactgtggaaggggacctgagcagg ttgcctctgctggcttgggcagcctgcttttattcccttatctga >gi568815587r:113633697_113873666|GENSCAN_predicted_peptide_2|338_aa MGKKQSRKTGNSKKQSTSPPPKERSSSPATEQSWTENDFDEVREEGFRRSNYSELQEEIQ NKGKEVENFEKNLDECITRITNTEKCLKELMELKAKAREPREECRSLRSRYNQLEERVSV MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTMKD IIQENFPNLARQANIQIQEIQRTPQRYSWRRATPRHIIVRFTKVEMKEKMLRAAREKGRV THKGKPIRLTVDLSAETLQARREWEPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDK QMLRDFVITRPALKELLKEVLNMERHNRYQLLQNHAKM >gi568815587r:113633697_113873666|GENSCAN_predicted_CDS_2|1017_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaagcagagcacctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggacagagaatgactttgac gaggtgagagaagaaggcttcagacgatcaaactactccgagctacaggaggaaattcaa aacaaaggcaaagaagttgaaaactttgaaaaaaatttagacgaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaagccaaggctcgagaacca cgtgaagaatgcagaagcctcaggagccgatacaatcaactggaagaaagggtatcagtg atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaacgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctgcgtctg attggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactatgaaggat attatccaggagaacttccccaatctagcaaggcaggccaacattcagattcaggaaata cagagaacgccacaaagatactcctggagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt acccacaaagggaagcccatcagactaacagtggatctctcggcagaaactctacaagcc agaagagagtgggagccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagacaag caaatgctgagagattttgtcatcaccaggcctgccctaaaagagctcctgaaggaagta ctaaacatggaaaggcacaaccggtaccagctgctgcaaaatcatgccaaaatgtaa >gi568815587r:113633697_113873666|GENSCAN_predicted_peptide_3|441_aa MVAIIISLPTKKSPGPDGFTAKIYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIFLI PKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIC KSINVIQHINRTKDKNHMIISIDAEKAFDKIQQRFMLNTLNKLGIDGMYLKIITAIYDKP TANIILNGQKLEAFPLKTGTRQGCPLSPFLFNIVLEVLARAMRQEKEIKGIQLGKEEVKL SLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYQINVQKSQAFLYTNNRQTESQIMSEL PFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKCKNIPCSWVGRINIVKMAI LPKVIYRFNAIPIKLPMTFFTELEKATFKFIWNQKRARIAKSTLSQKNKAGGIMLPDFKL YYKATVTKTAWYWYQNRDIDQ >gi568815587r:113633697_113873666|GENSCAN_predicted_CDS_3|1326_bp atggttgcaataatcattagcttaccaaccaaaaagagtccaggaccagatggattcaca gccaaaatctaccagaggtacaaggaggaactggtaccattccttctgaaactattccaa tcaatagaaaaagagggaatcctccctaactcattttatgaggccagcatcttcctgata ccaaagccgggcagagacacaaccaaaaaagagaattttagaccaatatccttgatgaac attgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaag cttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaatatatgc aaatcaataaatgtaatccagcatataaacagaaccaaagacaaaaaccacatgattatc tcaatagatgcagaaaaggcctttgacaaaattcaacaacgcttcatgctaaacactctc aataaattaggtattgatgggatgtatctcaaaataataacagctatctatgacaaaccc acagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaactggcaca agacagggatgccctctctcaccattcctattcaacatagtattggaagttctggccagg gcaatgaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattg tccctctttgcagatgacatgattgtatatctagaaaaccccattgtctcagcccaaaat ctccttaagctgataagcaacttcagcaaagtctcaggataccaaatcaatgtacaaaaa tcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagtgaactc ccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtg aaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggatacaaac aaatgcaagaacattccatgctcgtgggtaggaagaatcaatattgtgaaaatggccata ctgcccaaggtaatttatagattcaatgccatccccatcaaactaccaatgactttcttc acagaattggaaaaagctactttcaagttcatatggaaccaaaaaagagcccgcatcgcc aagtcaaccctaagccaaaagaacaaagctggaggcatcatgctacctgacttcaaacta tactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagat caatga >gi568815587r:113633697_113873666|GENSCAN_predicted_peptide_4|84_aa MPVILAGAAVLIPDPEIPKVPEPASRGCTGTPAGHTPKQDGVSARLCRQPLAEGVGRKGG QKPTLFINQAPGQNPKIQGFKIWI >gi568815587r:113633697_113873666|GENSCAN_predicted_CDS_4|255_bp atgcctgtgattctggctggagctgccgtcctcatcccagatcctgaaattcccaaggtc cctgagcccgcttccagaggctgcacaggtacccctgctgggcatacccctaagcaggat ggagtgtctgcacggctgtgcagacagcccttagctgagggcgtggggagaaaaggtggc cagaagcctactctcttcatcaaccaagctccaggacagaatccaaaaattcaaggattc aaaatttggatctga >gi568815587r:113633697_113873666|GENSCAN_predicted_peptide_5|435_aa MIFGRQRPGEEAHPAPVRAAGPSLPPPEWAQPLEASTYNMLSPHRALGTRNQSDQGKLGH RYAWLAGVTLHLTHLPSLPQIAREVLGLGGPGCLRKTLGKVREGCPGCRLRSYHREVATR EAAGPEEGGALSLRASRARQHLLVSQPLQNSSDLENSQDLPPPTLCTRECFLKLRKDSMK KGWRKEWVLMASLGAVKENRKKESVHGLRGREEIPLAAFAELGPMRLQTEPGEQAQGNGA GQATRFPSLSKELDWQMLIRKVRDSASADLTGSLAKENKNFCIVYPPHTGPATASHHRPH LACISPSVQVASAKHAGGEGSAVHARAVAELSAEQGHHRVLEHIRAVSYERHREKLQGTK QGPGRQSSKRGEDSMQTAVCMAPSMGGQDPCCPLPTPGDGVGVKILSGHGHPTSNEDSVS KNVTYPMGAAMHDFR >gi568815587r:113633697_113873666|GENSCAN_predicted_CDS_5|1308_bp atgatctttggaaggcagaggccaggagaagaagcccatccagctcccgtacgggcggca ggcccttctcttcctcctcctgagtgggctcaaccactcgaagccagcacctataacatg ctgagcccgcaccgggctctaggaaccagaaatcagagtgaccaggggaaactgggacat cgatatgcatggcttgccggagtaacactacacctgactcaccttcctagcctcccgcaa attgccagagaagttctgggcctaggagggccaggctgtttgcgcaaaacattaggtaag gtgagggagggctgccctggctgcaggctcagaagctaccacagggaagtggcaaccagg gaagctgcaggccctgaggaaggaggggctctctccttaagggcctccagggcaaggcag catctgctggtgtcacagcctttgcaaaacagttcagaccttgagaattcccaggacctt cctcctccaactctgtgcaccagggagtgcttcctcaaactgaggaaggatagcatgaag aaaggctggaggaaggagtgggtcctcatggccagcttaggagctgtcaaggaaaacaga aagaaggaaagcgttcacggtcttcgtggtagagaggaaattcctttggctgcctttgcg gagctggggcccatgaggcttcagacagagcctggggagcaggctcaggggaacggagct ggccaggccactcgcttcccttccctttccaaagagctggactggcagatgctaataagg aaagtgagagattctgcatctgcagatctgactggtagcctggccaaagagaacaagaac ttctgtatagtgtacccaccccacacagggccagccacagccagccaccacaggccacac ctggcatgcatcagcccttccgtccaggtagccagcgcaaagcatgcggggggtgagggc tccgctgtacacgcaagagctgttgcagagctgagtgctgaacaagggcaccaccgtgtc ctggagcatatccgagctgtaagctatgagagacaccgagaaaaactgcagggcacaaag caaggcccaggcaggcagagcagcaagaggggagaggacagcatgcaaactgcagtgtgc atggcgccctctatgggtggacaggacccatgctgccccctccccacacctggggatgga gtgggggtgaaaattctttcaggccatgggcacccaacttctaatgaggacagtgtttcc aaaaacgtgacatatcccatgggtgcagccatgcatgattttaggtag >gi568815587r:113633697_113873666|GENSCAN_predicted_peptide_6|534_aa MPSTLRQDIDNSTLAFLLVQSLKLSPGAQPHFWLFSWAAAFPGSQCKLLVDLPFWGLEDG GLLFTAPLGSAPVGTLCGGSNPTFPFHTALSEDLHEGSIPVANICKGIQSLMLDDQPPME AQYAEEGPGPGIFRAEPGDQQHPISQAVCWRSMRRGCAVLGALGLLAGAGVGSWLLVLYL CPAASQPISGTLQDEEITLSCSEASAEEALLPALPKTVSFRINSEDFLLEAQVRDQPRWL LVCHEGWSPALGLQICWSLGHLRNNCTSGQVVSLRCSGESYLGLGEVAVRGRGVPPASSV EEPYECGARPLASRIVGGQSVAPGRWPWQASVALGFRHTCGGSVLAPRWVVTAAHCMHRQ VSVAAEDREPQLPFLFGCWYVLYQMFVSNCLQLFAPKGPAAFIKHLLHAQCLAGCWGDTK RHSLWGLGAHRLLTAPPPMAALASFLHLLPCLGTPLLPLPSPLSMAPVCSFRLARLSSWR VHAGLVSHSAVRPHQGALVERIIPHPLYSAQNHDYDVALLRLQTALNFSGAALA >gi568815587r:113633697_113873666|GENSCAN_predicted_CDS_6|1605_bp atgccttcaactctcagacaggacattgacaactccaccttagccttcctgcttgtgcag agccttaagcttagcccaggggcacagccccacttctggctgttttcatgggctgcagct tttccaggctcacagtgcaagctgttggtggatctaccattctggggtctggaggatggt ggccttcttttcacagctccactaggcagtgccccagtgggaactctgtgtgggggctcc aaccctacatttcccttccatactgccctatcagaggatctccatgagggctccatccct gtagcaaacatctgcaagggcatccagagcctgatgctggatgaccaaccccctatggag gcccagtatgcagaggagggcccaggacctgggatcttcagagcagagcctggagaccag cagcatcccatttctcaggcagtgtgctggcgttccatgcgacgtggctgtgcagtgctg ggagccctggggctgctggccggtgcaggtgttggctcatggctcctagtgctgtatctg tgtcctgctgcctctcagcccatttccgggaccttgcaggatgaggagataactttgagc tgctcagaggccagcgctgaggaagctctgctccctgcacttcccaaaacagtatctttc agaataaacagcgaagacttcttgctggaagcgcaagtgagggatcagccacgctggctc ctggtctgccatgagggctggagccccgccctggggctgcagatctgctggagccttggg catctcaggaacaactgcacttctggtcaagttgtttccctcagatgctctggtgagtca tatcttggtctaggggaggtggcggtgagaggaaggggtgttccccctgcctcaagtgtg gaagagccctatgagtgtggagcgaggcccctggcttcccggatagttggtgggcagtct gtggctcctgggcgctggccgtggcaggccagcgtggccctgggcttccggcacacgtgt gggggctctgtgctagcgccacgctgggtggtgactgctgcacattgtatgcacaggcaa gtctctgtggctgctgaggacagagagcctcagttgccctttctgttcggctgttggtat gtcctgtatcaaatgttcgtgtccaactgtctccagctttttgctccaaaaggtcctgca gcatttattaagcacctactgcatgcccagtgtcttgccggctgctggggtgatactaag aggcatagtctgtggggcctgggagctcaccgcctgctgacggcccccccaccaatggct gctctggccagcttcctccacctgctaccttgcttaggaacacctcttctgccccttcct tccccgctcagcatggcccctgtgtgcagtttcaggctggcccgcctgtccagctggcgg gttcatgcggggctggtcagccacagtgccgtcaggccccaccaaggggctctggtggag aggattatcccacaccccctctacagtgcccagaatcatgactacgacgtcgccctcctg aggctccagaccgctctcaacttctcaggtgcggcactggcttga >gi568815587r:113633697_113873666|GENSCAN_predicted_peptide_7|169_aa MNALDNQPEEFEFDPASNGEISEDLCHQHQRPKVDKTTKMGRNQSRKTENSKNQSTSSTP KDCTSLPATEQSWMENDELTEIGFRRSVIRNFSELKEDVLTHCKEAKNLEKRLDEWLTRI NSVEKILNDLTELKTMVQELRDARTSFNSRFDQVEERVSVIEDQINEIK >gi568815587r:113633697_113873666|GENSCAN_predicted_CDS_7|510_bp atgaatgcattggataatcagcctgaagaatttgaatttgatcctgccagcaatggcgaa atcagtgaagacttatgtcaccaacatcaaagaccaaaggtagataaaaccacaaagatg gggagaaaccagagcagaaaaactgaaaattctaaaaaccagagcacctcttccactcca aaggattgcacctccttgccagcaacagaacaaagctggatggagaatgacgagttgaca gaaataggcttcagaaggtcggtaataagaaacttctctgagctaaaggaggatgttcta acccattgcaaggaagctaaaaaccttgaaaaaagattagatgaatggttaactagaata aacagtgtagagaagatcttaaatgacctgacggagctgaaaaccatggtgcaagaacta cgtgatgcacgcacaagcttcaatagccgattcgatcaagtggaagaaagggtatcagtt attgaagatcaaattaatgaaataaaatga >gi568815587r:113633697_113873666|GENSCAN_predicted_peptide_8|833_aa MAMTFTKYTACKHFVNKQHDGKYCANVETLSDCTVLYQKYAFATLLLEYRPRVKTAGGTG AAALATRVTQSALVPVLAMASFVTEVLAHSGRLEKEDLGTRISRLTRRVEEIKGEVCNMI SKKYSEFLPSMQSAQGLITQVDKLSEDIDLLKSRIESEVRRDLHVSTGEFTDLKQQLERD SVVLSLLKQLQEFSTAIEEYNCALTEKKYVTGAQRLEEAQKCLKLLKSRKCFDLKILKSL SMELTIQKQNILYHLGEEWQKLIVWKFPPSKDTSSLESYLQTELHLYTEQSHKEEKTPMP PISSVLLAFSVLGELHSKLKSFGQMLLKYILRPLASCPSLHAVIESQPNIVIIRFESIMT NLEYPSPSEVFTKIRLVLEVLQKQLLDLPLDTDLENEKTSTVPLAEMLGDMIWEDLSECL IKNCLVYSIPTNSSKLQQYEEIIQSTEEFENALKEMRFLKGDTTDLLKYARNINSHFANK KCQDVIVAARNLMTSEIHNTVKIIPDSKINVPELPTPDEDNKLEVQKVSNTQYHEVMNLE PENTLDQHSFSLPTCRISESVKKLMELAYQTLLEATTSSDQWENLQKLPQLAAIHHNNCM YIAHHLLTLGHQFRLRLAPILCDGTATFVDLVPGFRRLGTECFLAQMRAQKGELLERLSS ARNFSNMDDEENYSAASKAVRQVLHQLKRLGIVWQDVLPVNIYCKAMGTLLNTAISEVIG KITALEDISTEDGDRLYSLCKTVMDEGPQVFAPLSEESKNKKYQEEVPVYVPKWMPFKEL MMMLQASLQEIGDRWADGKGPLAAAFSSSEVKALIRALFQNTERRAAALAKIK >gi568815587r:113633697_113873666|GENSCAN_predicted_CDS_8|2502_bp atggcgatgacttttaccaagtatactgcttgtaaacattttgttaacaagcaacatgat ggcaaatactgtgcaaacgtggaaacgctgtctgattgtactgtcctctaccagaaatat gcattcgctactttgcttcttgaatacaggcctcgcgtcaagacggccggcgggacggga gctgcggcgctggctacgagagtgacccagtcagcgttggttcccgtcttggccatggcc tcgttcgtgacagaagttttggcacactccgggaggctggaaaaggaggatctggggacc cggatcagccgcctgacccggcgggtggaggagatcaagggtgaggtgtgcaatatgatt agcaagaagtacagtgaattcctgcctagcatgcagagcgcgcagggcctgattacccag gtggataagctatctgaagacattgacctgctgaaatccaggatagagagtgaggtccgc cgggatcttcacgtatcaaccggtgaatttacagacttaaagcagcagttggaaagagac tcagttgtcctaagtttgcttaaacagttgcaggagttttccactgctattgaagaatat aattgtgcattaacagagaagaagtatgtcactggtgctcagcgtctggaagaggcacag aaatgcttgaagttattaaaatccagaaaatgctttgatttaaaaatattgaaatctctc agcatggagctcacaatacagaaacagaacatactttatcaccttggagaagagtggcag aagctgattgtatggaagttcccaccatcaaaagataccagcagtttggaatcttaccta caaactgaacttcatttatacactgaacaatcgcacaaagaggagaagacccctatgcca cccatcagttctgtcctcttggcattttctgttcttggagaactacacagcaagcttaaa tcatttggtcagatgctgctgaagtatatccttaggccgctggcatcttgcccatccctt catgctgtgatagaaagccagcctaacatagttattattcgttttgaatctataatgact aacttggaatatccatcaccatctgaagtttttacaaagatcagactggtactagaagtg ctccagaaacagcttctagatttgccacttgacactgacctggaaaatgaaaaaacatct actgtcccattggctgagatgcttggagacatgatctgggaggacttgtctgagtgcctc atcaaaaactgtttggtttattcgattccaacaaatagcagcaaattacagcaatatgaa gagatcatacagtccactgaagaatttgaaaatgccctaaaggaaatgagatttttaaaa ggagatactacagatttgctgaaatacgctcgtaacatcaattctcattttgcaaacaaa aagtgccaggatgtgattgtggcagccagaaatctaatgacctcagaaattcataacact gtgaagattattcctgattctaagataaatgtgccagagttacccactcctgatgaggat aacaaactggaagtacagaaagtatccaatactcagtaccacgaagtgatgaatttagag cctgaaaatacattggaccaacattccttttccttgcccacatgccgtatcagtgagtct gtgaagaaattaatggaactcgcctatcagactttactagaggcaacaaccagtagtgat caatgggagaaccttcaaaaacttccccagttggctgctattcatcacaacaactgtatg tacattgctcaccacttgctgaccctcgggcatcagttcagattgcgtcttgcccccatt ctttgtgatggcactgctacttttgtggatcttgtacctggcttcaggagacttgggaca gaatgctttttggcccaaatgcgggcacagaaaggtgaacttctggaaagattatcaagt gctaggaacttttcaaatatggacgatgaagagaattattctgcagcaagtaaagcagtc cggcaggtactgcaccaactaaagagacttggaattgtgtggcaggatgtcctgccagtg aatatatattgcaaggctatggggactttactcaatacagcaatttctgaggtcattggc aaaattactgccctagaggacatatctactgaagatggtgataggttatattccttatgc aaaacagtgatggatgaaggaccccaagtatttgcacctttatctgaagaaagcaagaac aagaaatatcaagaagaggttccagtctatgtgccaaaatggatgccattcaaggaattg atgatgatgctacaagccagcttgcaagaaattggggatcggtgggcagatggaaaagga cccctggcagctgcgttctcttccagtgaagtaaaagctttaattcgtgccttgtttcag aacacagaaagaagagcagctgcccttgctaaaattaaatag >gi568815587r:113633697_113873666|GENSCAN_predicted_peptide_9|229_aa MAWSFRAKVQLGGLLLSLLGWVCSCVTTILPQWKTLNLELNEMETWIMGIWEVCVDREEV ATVCKAFESFLSLPQELQVARILMVASHGLGLLGLLLCSFGSECFQFHRIRWVFKRRLGL LGRTLEASASATTLLPVSWVAHATIQDFWDDSIPDIIPRWEFGGALYLGWAAGIFLALGG LLLIFSACLGKEDVPFPLMAGPTVPLSCAPVEESDGSFHLMLRPRNLVI >gi568815587r:113633697_113873666|GENSCAN_predicted_CDS_9|690_bp atggcctggagtttccgtgcaaaagtccagctcggggggctacttctctccctccttggc tgggtctgctcctgtgttaccaccatcctgccccagtggaagactcttaatctggaactg aacgagatggagacctggatcatggggatttgggaggtctgcgtggatcgagaggaagtc gccactgtgtgcaaggcctttgaatccttcttgtctctgccccaggagctccaggtagcc cgcatcctcatggtagcctcccatgggctgggcctattggggcttttgctctgcagcttt gggtctgaatgcttccagtttcacaggatcagatgggtattcaagaggcggcttggtctc ctgggaaggactttggaggcatccgcttcagccactaccctccttccagtctcctgggtg gcccatgccacaatccaagacttctgggatgacagcatccctgacatcatacctcggtgg gagtttggaggtgccctctacttgggctgggctgctggtattttcctggctcttggtggg ctactcctcatcttctcggcctgcctgggaaaagaagatgtgccttttcctttgatggct ggtcccacagtccccctatcctgtgctccagtggaggagtcagatggctccttccacctc atgctaagacctaggaacctggtcatctag >gi568815587r:113633697_113873666|GENSCAN_predicted_peptide_10|314_aa MLARTIKYLQPNPASQAKLTMLNAVCKIRGQVKNPGYPQSEGLLSECLIRHQKELGNESN FSDALLDAGESMKHLAEVKDSLDIEKRQDKIPDEELRQALEKSEDSKEVAETSMRSWTPT LSRVHGHHGVSGAAVPVAALAFAAAAAAPGSAAAPPSGLSHGASQGIQQQRCSKHRNMTE MSFLSSEVLVGDLMSPFDQSGLGAEESLGLLDNYLEVAKHFKPHGFSRDKAKAGFSEWLA VDGLGSPSNNSKEDAFSGTDWMLEKMDLKEFVFDALLGIDNLETMPDELLTTLDDTCDLF APLVQETNKETPRR >gi568815587r:113633697_113873666|GENSCAN_predicted_CDS_10|945_bp atgctggcaaggaccatcaagtacctgcagcccaacccagcctcgcaggctaagctgacc atgctgaacgcggtgtgcaagatccggggccaggtgaagaaccccggctacccgcagtcg gaggggctcctgagcgagtgcctgatccgccaccagaaggagctaggcaacgagtccaac ttcagtgacgcactgctggatgccggcgagtccatgaagcacctggcagaggtgaaggac tccctggacatagagaagcggcaggacaagatccccgatgaggagctgcgccaggcgctg gagaagtctgaggactccaaggaggtagcagaaaccagcatgcggtcctggacaccgaca ttgagcagggtccacggccaccatggcgtatcaggggcagcagtacctgtggcagcattg gcctttgcagcggcggcagcagcaccaggctctgcagcggcaccccccagcggcttaagc catggcgcttctcagggcattcagcagcagcgttgctcaaagcaccgcaacatgaccgaa atgagcttcctgagcagcgaggtattggtgggggacttgatgtcccccttcgaccagtcg ggtttgggggctgaagaaagcctaggtctcttagataactacctggaggtggccaagcac ttcaaacctcatgggttctccagggacaaggctaaggcgggcttctccgaatggctggct gtggatgggttaggcagtccctccaataacagcaaggaggatgccttctccgggacagat tggatgttggagaaaatggatttgaaggagttcgtctttgatgccctgttgggtatagat aacctggaaaccatgccagatgaacttttgaccactttggatgacacttgtgatctcttt gcccccctagtccaggagactaataaggagacccccagacggtga >gi568815587r:113633697_113873666|GENSCAN_predicted_peptide_11|1130_aa MAVWTSLGSSKWLIVQRPAEGLTRQQQASNGDITQAVSLLTDERVKEPSQDTVATEPSEV EGSAANKEVLATTEAFNQDGNHCRWSRFEMVDEEISLGNTKFEMPVRHSRQKCQKDGGDP GAQFRGEVQAGEVIDLTHDNKDDLQAAIALSLLESPKIQADGRDLNRMHEATSAETKRSK RKRCEVWGENPNPNDWRRVDGWPVGLKNVGNTCWFSAVIQSLFQLPEFRRLVLSYSLPQN VLENCRSHTEKRNIMFMQELQYLFALMMGSNRKFVDPSAALDLLKGAFRSSEEQQIIPGS LGVDENALFLLLNDAWILCSTYLNPNNPELSPFPKSSSPRNKSENPMVQLFYGTFLTEGV REGKPFCNNETFGQYPLQVNGYRNLDECLEGAMVEGDVELLPSDHSVKYGQERWFTKLPP VLTFELSRFEFNQSLGQPEKIHNKLEFPQIIYMDRYMYRSKELIRNKRECIRKLKEEIKI LQQKLERYVKYGSGPARFPLPDMLKYVIEFASTKPASESCPPESDTHMTLPLSSVHCSVS DQTSKESTSTESSSQDVESTFSSPEDSLPKSKPLTSSRSSMEMPSQPAPRTVTDEEINFV KTCLQRWRSEIEQDIQDLKTCIASTTQTIEQMYCDPLLRQVPYRLHAVLVHEGQANAGHY WAYIYNQPRQSWLKYNDISVTESSWEEVERDSYGGLRNVSAYCLMYINDKLPYFNAEAAP TESDQMSEVEALSVELKHYIQEDNWRFEQEVEEWEEEQSCKIPQMESSTNSSSQDYSTSQ EPSVASSHGVRCLSSEHAVIVKEQTAQAIANTARAYEKSGVEAALSELKEAEPKKPMPQE TNLAEQSEQPPKANDAESTAQPNSEVSEVEIPSVGRILVRSDADGYDEEVMLSPAMQGVI LAIAKARQTFDRDGSEAGLIKAFHEEYSRLYQLAKETPTSHSDPRLQHVLVYFFQNEAPK RVVERTLLEQFADKNLSYDERSISIMKVAQAKLKEIGPDDMNMEEYKELNAKAASLFETN DDHSVTEGINVMNELIIPCIHLIINNDISKDDLDAIEVMRNHWCSYLGQDIAENLQLCLG EFLPRLLDPSAEIIVLKEPPTIRPNSPYDLCSRFAAVMESIQGVSTVTVK >gi568815587r:113633697_113873666|GENSCAN_predicted_CDS_11|3393_bp atggccgtgtggaccagtctgggctccagcaaatggctgattgttcagaggcctgcagag ggactgactaggcagcagcaggccagtaatggtgacattactcaggcagtcagccttctc actgatgagagagttaaggagcccagtcaagacactgttgctacagaaccatctgaagta gaggggagtgctgccaacaaggaagtattagcaacaactgaagcatttaaccaagatggg aatcactgtaggtggagcagatttgagatggtagatgaagagatcagtttggggaatact aaatttgaaatgcctgttagacattcacggcagaagtgtcagaaagatggtggagaccca ggtgcacagttcagaggagaggtccaggctggagaagttatagaccttactcatgataac aaagatgatcttcaggctgccattgctttgagtctactggagtctcccaaaattcaagct gatggaagagatcttaacaggatgcatgaagcaacctctgcagaaactaaacgctcaaag agaaaacgctgtgaagtctggggagaaaaccccaatcccaatgactggaggagagttgat ggttggccagttgggctgaaaaatgttggcaatacatgttggtttagtgctgttattcag tctctctttcaattgcctgaatttcgaagacttgttctcagttatagtctgccacaaaat gtacttgaaaattgtcgaagtcatacagaaaagagaaatatcatgtttatgcaagagctt cagtatttgtttgctctaatgatgggatcaaatagaaaatttgtagacccgtctgcagcc ctggatctattaaagggagcattccgatcatctgaggaacagcagatcatacctggctct ttgggggttgatgaaaatgcactatttttgcttttgaatgatgcttggattttatgtagc acatatcttaatccaaacaatccagaattgtctccttttcctaaaagcagcagtcccagg aacaaatctgaaaatccaatggtgcagctgttctatggtactttcctgactgaaggggtt cgtgaaggaaaacccttttgtaacaatgagaccttcggccagtatcctcttcaggtaaac ggttatcgcaacttagacgagtgtttggaaggggccatggtggagggtgatgttgagctt cttccctccgatcactcggtgaagtatggacaagagcgttggtttacaaagctacctcca gtgttgacctttgaactctcaagatttgagtttaatcagtcccttgggcagccagagaaa attcacaataagctggaatttcctcagattatttatatggacaggtacatgtacaggagc aaggagcttattcgaaataagagagagtgtattcgaaagttgaaggaggaaataaaaatt ctgcagcaaaaattggaaaggtatgtgaaatatggctcaggcccagctcggttcccgctc ccggacatgctgaaatatgttattgaatttgctagtacaaaacctgcctcagaaagctgt ccacctgaaagtgacacacatatgacattaccactttcttcagtgcactgctcggtttct gaccagacatccaaggaaagtacaagtacagaaagctcttctcaggatgttgaaagtacc ttttcttctcctgaagattctttacccaagtctaaaccactgacatcttctcggtcttcc atggaaatgccttcacagccagctccacgaacagtcacagatgaggagataaattttgtt aagacctgtcttcagagatggaggagtgagattgaacaagatatacaagatttaaagact tgtattgcaagtactactcagactattgaacagatgtactgcgatcctctccttcgtcag gtgccttatcgcttgcatgcagttcttgttcatgaaggacaagcaaatgctggacactat tgggcctatatctataatcaaccccgacagagctggctcaagtacaatgacatctctgtt actgaatcttcctgggaagaagttgaaagagattcctatggaggcctgagaaatgttagt gcttactgtctgatgtacattaatgacaaactaccctacttcaatgcagaggcagcccca actgaatcagatcaaatgtcagaagtggaagccctatctgtggaactcaagcattacatt caggaggataactggcggtttgagcaggaagtagaggagtgggaagaagagcagtcttgc aaaatccctcaaatggagtcctccaccaactcctcatcacaggactactctacatcacaa gagccttcagtagcctcttctcatggggttcgctgcttgtcgtctgagcatgctgtgatt gtaaaggagcaaactgcccaggctattgcaaacacagcccgtgcctatgagaagagcggt gtagaagcggcactgagtgagcttaaagaagctgaacccaagaagcccatgccccaggaa acaaaccttgcagagcagtcagaacagcccccaaaggctaatgatgcagagtctactgcc cagcctaattctgaggtctctgaagtcgagattcccagtgtgggaaggattctggttaga tctgatgcagatggatatgatgaggaggtgatgctgagccctgccatgcaaggggtcatc ctggccatagctaaagcccgtcagacctttgaccgagatgggtctgaagcagggctgatt aaggcattccatgaagaatactccaggctctatcagcttgccaaagagacccccacctct cacagtgatcctcgacttcagcatgtccttgtctactttttccaaaatgaagcacccaaa agggtagtagaacgaacccttctggaacagtttgcagataaaaatcttagctatgatgaa agatcaatcagcattatgaaggtggctcaagcgaaactgaaggaaattggtccagatgac atgaatatggaagagtacaaggagctgaatgccaaagcagcttctctttttgaaacaaat gatgatcactccgtaactgagggcattaatgtgatgaatgaactgatcatcccctgcatt caccttatcattaataatgacatttccaaggatgatctggatgccattgaggtcatgaga aaccattggtgctcttaccttgggcaagatattgcagaaaatctgcagctgtgcctaggg gagtttctacccagacttctagatccttctgcagaaatcatcgtcttgaaagagcctcca actattcgacccaattctccctatgacctatgtagccgatttgcagctgtcatggagtca attcagggagtttcaactgtgacagtgaaataa