GENSCAN 1.0 Date run: 5-Nov-116 Time: 09:58:46 Sequence gi568815591r:99393532_99600093 : 206562 bp : 48.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11 117 107 1 2 68 34 108 0.816 3.15 1.02 Intr + 498 588 91 0 1 103 100 142 0.932 16.90 1.03 Term + 920 958 39 1 0 106 43 38 0.995 -1.71 1.04 PlyA + 1231 1236 6 1.05 2.09 PlyA - 1946 1941 6 1.05 2.08 Term - 3209 3151 59 0 2 100 36 57 0.823 -0.45 2.07 Intr - 4482 4331 152 2 2 59 116 241 0.705 23.81 2.06 Intr - 6893 6772 122 2 2 80 44 221 0.991 16.19 2.05 Intr - 9974 9867 108 2 0 54 84 170 0.693 13.68 2.04 Intr - 11422 11331 92 2 2 74 96 84 0.742 7.51 2.03 Intr - 15189 15005 185 2 2 -26 78 170 0.238 4.03 2.02 Intr - 15657 15503 155 0 2 48 84 29 0.066 -2.63 2.01 Init - 15932 15855 78 2 0 63 41 114 0.249 5.26 2.00 Prom - 16684 16645 40 -10.64 3.00 Prom + 17019 17058 40 -7.76 3.01 Init + 17562 17655 94 2 1 65 113 75 0.745 8.24 3.02 Intr + 22607 22729 123 0 0 109 76 163 0.999 17.76 3.03 Intr + 23898 24064 167 1 2 104 40 186 0.966 15.08 3.04 Term + 25860 25910 51 2 0 113 54 63 0.937 2.83 3.05 PlyA + 26063 26068 6 1.05 4.09 PlyA - 26078 26073 6 -4.73 4.08 Term - 26618 26436 183 2 0 52 45 290 0.967 18.64 4.07 Intr - 30426 30244 183 0 0 91 65 368 0.998 34.78 4.06 Intr - 32085 31264 822 0 0 111 113 856 0.984 82.02 4.05 Intr - 35673 35572 102 0 0 108 91 178 0.999 20.47 4.04 Intr - 36275 36057 219 2 0 112 81 444 0.993 44.60 4.03 Intr - 39194 39045 150 2 0 92 -20 119 0.600 1.86 4.02 Intr - 39887 39747 141 2 0 122 57 272 0.902 28.05 4.01 Init - 41711 41259 453 2 0 78 116 425 0.699 38.16 4.00 Prom - 43720 43681 40 -8.76 5.00 Prom + 44539 44578 40 -11.33 5.01 Init + 45552 45654 103 2 1 109 53 233 0.727 22.30 5.02 Intr + 51258 51308 51 1 0 119 116 19 0.218 6.68 5.03 Intr + 54590 54742 153 0 0 122 99 177 0.523 22.24 5.04 Intr + 55987 56131 145 2 1 58 55 100 0.929 3.04 5.05 Intr + 56638 56840 203 1 2 60 90 203 0.572 16.63 5.06 Intr + 57171 57264 94 1 1 86 73 18 0.947 -0.88 5.07 Intr + 58837 58909 73 1 1 38 81 92 0.960 2.91 5.08 Intr + 60435 60605 171 2 0 74 113 99 0.480 11.14 5.09 Intr + 64284 64483 200 2 2 80 47 127 0.072 5.95 5.10 Intr + 64971 65136 166 0 1 116 59 58 0.044 5.66 5.11 Intr + 72520 72657 138 0 0 68 45 80 0.133 2.36 5.12 Intr + 75264 75469 206 2 2 88 81 39 0.099 1.20 5.13 Intr + 77785 77944 160 1 1 66 74 26 0.075 -1.01 5.14 Intr + 82872 82949 78 2 0 100 105 29 0.400 5.55 5.15 Intr + 86130 86256 127 2 1 38 49 146 0.174 5.95 5.16 Term + 92906 93957 1052 0 2 124 41 381 0.419 29.30 5.17 PlyA + 94044 94049 6 1.05 6.04 PlyA - 97694 97689 6 1.05 6.03 Term - 101100 99998 1103 1 2 43 36 529 0.220 35.44 6.02 Intr - 105311 105185 127 2 1 86 72 76 0.978 6.05 6.01 Init - 106562 106107 456 2 0 68 94 500 0.691 44.41 6.00 Prom - 112171 112132 40 -1.66 7.00 Prom + 112441 112480 40 -7.56 7.01 Init + 112520 112927 408 1 0 58 109 508 0.534 46.45 7.02 Intr + 118922 119060 139 1 1 48 77 69 0.428 1.84 7.03 Intr + 126296 126378 83 0 2 122 99 2 0.971 4.06 7.04 Intr + 126638 126773 136 1 1 14 82 142 0.981 6.34 7.05 Intr + 132282 132887 606 1 0 109 57 450 0.642 36.42 7.06 Intr + 137577 138587 1011 1 0 108 85 553 0.593 47.67 7.07 Term + 139546 139646 101 2 2 68 41 74 0.663 -1.01 7.08 PlyA + 139897 139902 6 1.05 8.05 PlyA - 141964 141959 6 1.05 8.04 Term - 147276 147098 179 1 2 66 35 112 0.886 1.55 8.03 Intr - 150099 149778 322 0 1 41 99 96 0.530 1.43 8.02 Intr - 150258 150211 48 0 0 114 61 14 0.307 0.18 8.01 Init - 154876 154391 486 1 0 55 -8 247 0.145 7.05 8.00 Prom - 155161 155122 40 -8.86 9.00 Prom + 156046 156085 40 -2.46 9.01 Init + 164903 165216 314 1 2 78 25 199 0.489 9.40 9.02 Intr + 167022 167164 143 0 2 62 74 81 0.421 4.10 9.03 Intr + 168357 168442 86 1 2 140 99 -8 0.965 5.04 9.04 Intr + 168813 168963 151 2 1 74 82 121 0.937 9.84 9.05 Intr + 178157 178261 105 0 0 44 109 94 0.963 7.29 9.06 Term + 178714 180053 1340 2 2 33 42 677 0.980 49.29 9.07 PlyA + 180373 180378 6 1.05 10.05 PlyA - 180827 180822 6 1.05 10.04 Term - 186960 186774 187 2 1 43 41 180 0.674 5.76 10.03 Intr - 188856 188620 237 2 0 66 30 116 0.461 0.33 10.02 Intr - 189842 189734 109 0 1 110 12 70 0.844 0.94 10.01 Init - 195597 195546 52 0 1 76 44 52 0.749 0.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 65732 65616 117 0 0 99 70 169 0.892 16.64 S.002 Intr - 66662 66555 108 0 0 77 55 102 0.861 6.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:99393532_99600093|GENSCAN_predicted_peptide_1|78_aa MGKRLFKAQKLPATRVHPAANQRERELYGKTGQALRQISVLSGGKAKCSQFCTTGMDGGM SIWDVKSLESALKDLKIK >gi568815591r:99393532_99600093|GENSCAN_predicted_CDS_1|237_bp atggggaagaggctcttcaaagcccaaaagctgccagccacacgtgtccaccccgcggcg aatcagagggagcgggagctgtatggcaagactggccaagcccttcgccagatctcggtg ctcagcggcggcaaggccaagtgctcgcagttctgcaccactggcatggatggcggcatg agtatctgggatgtgaagagcttggagtcagccttgaaggacctcaagatcaaatga >gi568815591r:99393532_99600093|GENSCAN_predicted_peptide_2|316_aa MVGGNQIEEKEDVGGNQIEKKDVPGGKNERGAASSRELSPASRSKARTHPSGWLRVQPLI SPSRTVQTTGLPPQPLPRTVRLRAEASLRSGSLRPLRPLKRRRRLAQEEEVEPKLFPVPV SVRGAAAAAAAAGAAMPKGGRKGGHKGRARQYTSPEEIDAQLQAEKQKAREEEEQKEGGD GAAGDPKKEKKSLDSDESEDEEDDYQQKRKGVEGLIDIENPNRVAQTTKKVTQLDLDGPK ELSRREREEIEKQKAKERYMKMHLAGKTEQAKADLARLAIIRKQREEAARKKEEERKAKD DATLSGKRMQSLSLNK >gi568815591r:99393532_99600093|GENSCAN_predicted_CDS_2|951_bp atggtaggcggtaaccagatcgaggagaaagaggacgtaggaggtaatcagatcgagaag aaagatgtgccgggaggtaagaacgaacgtggcgccgcctcctctcgggagctctctccg gcctcaaggtccaaagcccgaacacatcccagtggctggctgagagtccagccactcata tctccctccaggacggtgcagaccaccggtctcccgcctcagccgttgccaaggacggtc cgacttcgtgcggaggcctccctgaggtccgggtccttgcggccactgcggccactgaag cggcggcggcggctggcccaggaggaagaagtcgagcccaagctatttccggttccggtg tcagttcgaggcgccgccgccgccgccgcagccgccggagccgcaatgcctaaaggagga agaaagggaggccacaaaggccgggcgaggcagtatacaagccctgaggagatcgacgcg cagctgcaggctgagaagcagaaggccagggaagaagaggagcaaaaagaaggtggagat ggggctgcaggtgaccccaaaaaggagaagaaatctctagactcagatgagagtgaggat gaagaagatgactaccagcaaaagcgcaaaggcgttgaagggctcatcgacatcgagaac cccaaccgggtggcacagacaaccaaaaaggtcacacaactggatctggacgggccaaag gagctttcgaggagagaacgagaagagattgagaagcagaaggcaaaagagcgttacatg aaaatgcacttggccgggaagacagagcaagccaaggctgacctggcccggctggccatc atccggaaacagcgggaggaggctgcccggaagaaggaagaggaaaggaaagcaaaagac gatgccacattgtcaggaaaacgaatgcagtcactctccctgaataagtaa >gi568815591r:99393532_99600093|GENSCAN_predicted_peptide_3|144_aa MPKVKRSRKAPPDGWELIEPTLDELDQKMREAETEPHEGKRKVESLWPIFRIHHQKTRYI FDLFYKRKAISRELYEYCIKEGYADKNLIAKWKKQGYENLCCLRCIQTRDTNFGTNCICR VPKSKLEVGRIIECTHCGCRGCSG >gi568815591r:99393532_99600093|GENSCAN_predicted_CDS_3|435_bp atgcctaaagtcaaaagaagccggaaagcacccccagatggctgggagttgattgagcca acactggatgaattagatcaaaagatgagagaagctgaaacagaaccgcatgagggaaag aggaaagtggaatctctgtggcccatcttcaggatccaccaccagaaaacccgctacatc ttcgacctcttttacaagcggaaagccatcagcagagaactctatgaatattgtattaaa gaaggctatgcagacaaaaacctgattgcaaaatggaaaaagcaaggatatgagaacttg tgctgcctgcggtgcattcagacacgggacaccaacttcgggacgaactgcatctgccgc gtgcccaaaagcaagctggaagtgggccgcatcatcgagtgcacacactgtggctgtcgt ggctgctctggctga >gi568815591r:99393532_99600093|GENSCAN_predicted_peptide_4|750_aa MDFVRLARLFARARPMGLFILQHLDPCRARWAGGREGLMRPMWAPFSSSSSQLPLGQERQ ENTGSLGSDPSHSNSTATQEEDEEEEESFGTLSDKYSSRRLFRKSAAQFHNLRFGERRDE QMEPEPKLWRGRRNTPYWYFLQCKHLIKEGKLVEALDLFERQMLKEERLQPMESNYTVLI GGCGRVGYLKKAFNLYNQVGRETEKRNKTQRQSIEKEQWSPGTGAQHTLSIPHTEDLRIR RTCAAQALMKKRDLEPSDATYTALFNVCAESPWKDSALQSALKLRQQLQAKNFELNLKTY HALLKMAAKCADLRMCLDVFKEIIHKGHVVTEETFSFLLMGCIQDKKTGFRYALQVWRLM LSLGLQPSRDSYNLLLVAARDCGLGDPQVASELLLKPREEATVLQPPVSRQRPRRTAQAK AGNLMSAMLHVEALERQLFLEPSQALGPPEPPEARVPGKAQPEVDTKAEPSHTAALTAVA LKPPPVELEVNLLTPGAVPPTVVSFGTVTTPADRLALIGGLEGFLSKMAEHRQQPDIRTL TLLAEVVESGSPAESLLLALLDEHQVEADLTFFNTLVRKKSKLGDLEGAKALLPVLAKRG LVPNLQTFCNLAIGCHRPKDGLQLLTDMKKSQVTPNTHIYSALINAAIRKLNYTYLISIL KDMKQNRVPVNEVVIRQLEFAAQYPPTFDRYQGKNTYLEKIDGFRAYYKQWLTVMPAEET PHPWQKFRTKPQGDQDTGKEADDGCALGGR >gi568815591r:99393532_99600093|GENSCAN_predicted_CDS_4|2253_bp atggacttcgtgagactcgctcgactgttcgccagggcccgccccatgggactgttcatc ctgcaacacctggacccctgtagagccaggtgggcaggaggcagggaggggctgatgcgg ccaatgtgggcgcccttcagcagctcctcctctcagctgcccctcggccaggagcgtcag gaaaacacgggcagcctgggctctgacccgagccactccaactccacggccacgcaggaa gaagacgaggaggaggaggagagttttgggaccctctctgacaaatactcctcccggaga ctattccgcaaatccgcagcccagttccataacctgcggtttggggaacggagagatgag caaatggaaccggagcccaaattatggcgaggccggagaaacaccccgtactggtacttc ttgcagtgcaaacacctgatcaaggaagggaagctggttgaagccctggacctgtttgag aggcagatgctgaaggaggagcgattgcagcccatggagagcaactacacggtgctgatt gggggctgcgggcgggttggctacctgaagaaggccttcaacctctacaaccaggtggga cgagagactgagaaaagaaataagacacagagacaaagtatagagaaagaacagtggtcc cctgggaccggtgctcagcatacgctcagcatcccgcatacggaggacctgcgcatacgg aggacctgcgctgcgcaggcactgatgaaaaagcgggacctggagccctcggacgccacc tacacggccctgttcaacgtctgtgccgagtccccctggaaggactcagctctacagagc gccctgaagctccggcagcagctgcaggccaaaaacttcgagctcaacttgaaaacatac cacgcgctgctgaagatggctgccaagtgcgcagaccttaggatgtgcctcgatgtgttc aaggaaatcatccacaaagggcacgtggtcacagaggagaccttcagtttcctgctcatg ggctgcatccaagacaagaagacaggcttccggtacgccctccaggtgtggcggctgatg ctgagtctagggctacagccgagccgggacagctacaacctgctgttggtggcagctcgg gactgtggcctaggggacccccaggtggcctcagagctgcttctgaagcccagggaggag gcgactgtgcttcagcccccagtgagcaggcagcggccaaggaggacagcccaggccaag gcaggcaacctcatgtcagccatgctgcatgtggaggccctggagaggcagctgtttctg gaaccttctcaggcacttgggcctccagagcctccggaagccagagtgcccggcaaggcc caaccagaggtggatactaaggcagagcccagccacacagcagccctcaccgcagtggcc ctgaagccacctcccgtggagctggaagtcaacctcctgacccccggggccgttccccct acagtggtctcctttggaacggtgaccaccccagctgaccggctggccttgatagggggc ctggagggcttcctgagcaagatggcagagcacaggcagcagcccgacatcaggaccctc acgctactggccgaggtggtggagtccgggagtcctgcagagtccttgctgctggccctc ctggatgagcaccaggtagaggccgacctgacattctttaacacgctggtgagaaagaag agcaagctgggagacctggagggggccaaggcgctgttgccggtcctggcaaagaggggc ctcgtccccaacctgcagacattctgcaacctggccatcgggtgccacaggccgaaggac ggtctacagcttctcacagacatgaagaagtcccaggtgacccccaacactcacatctac agtgccctcatcaacgcggccatcaggaagctgaactacacctatctcatcagcatcttg aaggacatgaagcagaacagggtcccggtgaacgaagtggtcatccgccagctggagttt gcagcccagtaccctcccacctttgaccggtaccaagggaagaacacctacctggagaag attgacggcttccgagcctattacaagcagtggctgacagtgatgcccgcagaggaaacc ccgcacccctggcagaagttccggaccaagccccagggggaccaggacaccggcaaggag gctgatgacggatgtgcccttgggggcaggtga >gi568815591r:99393532_99600093|GENSCAN_predicted_peptide_5|1039_aa MQEIIASVDHIKFDLEIAVEQQLGAQPLPFPGMDKSGAAVCEFFLKAACGKGGMCPFRHI SGEKTVVCKHWLRGLCKKGDQCEFLHEYDMTKMPECYFYSKFALVQDALSAASVTCTTDI GLRAPFPGTGGTMVKRSLPCEAFTLVVQGKRARQAKTGHQNLFSLDEAKVGLAPAFPVVS RLSFPAGECSNKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNYLVGFC PEGPSCKFMHPRFELPMGTTEQPPLPQQTQPPAKQSNNPPLQRSSSLIQLTSQNSSPNQQ RTPQVIGVMQSQNSSAGNRGPRPLEQVTCYKGDAETQDAGHPCLEPQDSRGGIWRSQVRK PLCYTIKGQPTKGQNVCKCTCPQHSWQGTDFGTAASGRISRPGGHKDTSGANLPPGQPLS FQKEQREHTLRQTAPENPSFRGRTALVPQNGSECGSGPWTLAPASTTESKQPYLGPDTHQ LTPFWSPGVRCAGPREGCWAGHSRGRRAAGAQDPHTGIQGGPGKRPEGSYANARSPGVDG RTEGAGAVICICINANVLVNELGLFRGRDMDEAGGHQPQQTNTGTENQTQHVLIHKWELN IENAWTQRGEQHTPGPFGESGLSQTSRIPPLAKDQAVEAMFPPARGKELLSFEDVAMYFT REEWGHLNWGQKDLYRDVMLENYRNMVLLVDRSFTSFLVIWLGSEARHKMKKLTPKQKFS EDLESYKISVVMQESAEKLSEKLHKCKEFVDSCRLTFPTSGDEYSRGFLQNLNLIQDQNA QTRWKQGRYDEDGKPFNQRSLLLGHERILTRAKSYECSECGKVIRRKAWFDQHQRIHFLE NPFECKVCGQAFRQRSALTVHKQCHLQNKPYRCHDCGKCFRQLAYLVEHKRIHTKEKPYK CSKCEKTFSQNSTLIRHQVIHSGEKRHKCLECGKAFGRHSTLLCHQQIHSKPNTHKCSEC GQSFGRNVDLIQHQRIHTKEEFFQCGECGKTFSFKRNLFRHQVIHTGSQPYQCVICGKSF KWHTSFIKHQGTHKGQIST >gi568815591r:99393532_99600093|GENSCAN_predicted_CDS_5|3120_bp atgcaggaaatcatcgccagcgtggaccacatcaagtttgacttggagatcgcggtggag cagcagctgggggcgcagccgctgcccttccccggcatggacaagtcgggcgctgctgtc tgtgaattctttttgaaagctgcctgcggcaaagggggcatgtgtccgtttcgccacatc agtggtgagaagacagttgtgtgcaaacactggctgcgtggcctatgcaagaaaggggac cagtgtgagttcctgcatgagtatgacatgaccaagatgcccgagtgctacttctactcc aagttcgccctggtgcaggacgcactgtctgcagcttctgtcacctgcaccacagacatt gggttgagggccccctttccaggcactggagggacaatggtgaagaggtccttgccctgt gaagccttcactcttgtggtccagggaaagagggctcgtcaggccaaaactggacaccag aacctcttttccttggatgaagccaaggtgggcctggccccggccttcccagtggtctca cgtctgagtttccctgcaggggagtgcagcaacaaggaatgtcccttcctgcacatcgac cccgagtccaagatcaaggactgtccttggtatgaccgtggcttctgcaagcacggtccc ctctgcaggcaccggcacacacggagagtcatctgtgtgaattacctcgtgggattctgc ccggaggggccctcgtgtaaattcatgcaccctcgatttgaactgcccatgggaaccacc gagcagcccccactgccgcagcagacacagcctccagcaaagcaaagtaacaatccgcca ttacaaaggtcgtcctccttgatccagttaacgagtcagaactcttctcccaatcagcag agaaccccgcaggtcatcggggtcatgcagagtcaaaacagcagcgcgggcaaccgggga ccccggccactggagcaggtcacctgttacaagggtgatgcagaaacacaggatgcgggc catccctgcctggagccacaggacagccggggtgggatctggcgctctcaggtacgcaag cctttgtgttacaccataaagggccagccaactaaaggccagaatgtctgcaaatgcacc tgcccacagcactcatggcaagggacggacttcggcactgctgcttcgggaagaatcagc agacccggggggcacaaagacacctcaggagccaacctgccacctggccaaccactttct ttccagaaggaacagagggaacacacgctcagacaaacagccccagagaacccgtccttc agaggcaggaccgccctggtgccacagaatgggagtgagtgtggctctggaccctggacc ctggctcctgcttccaccacggagtccaaacagccttacctggggccggacactcaccaa ctgacgccattttggagtcctggtgtccgctgtgccggaccgcgcgagggctgctgggcc ggacactcgcggggacgcagagccgccggagcccaggatccgcacactggaatccaggga ggccccgggaagaggccggagggaagctacgccaatgcccgatcccctggggtggatgga aggactgagggcgcaggagctgtgatttgtatatgtataaatgccaacgttttagttaat gaactggggttgtttcggggaagggacatggatgaagctggtggccatcagcctcagcaa actaacacaggaacagaaaaccaaacacagcatgttctcattcataagtgggaattgaat attgagaacgcatggacacagagaggggaacaacacacacctgggccttttggggagtcg gggctcagccagacgtccaggatcccaccccttgcaaaagaccaggccgtggaagccatg ttcccaccagccagggggaaggagctgctctcgtttgaggatgtggcgatgtacttcacc agagaggagtggggccacctcaactggggtcagaaggacctctaccgagatgtgatgttg gagaactacaggaacatggtcttgctggtggataggtcatttacatcttttcttgttatt tggctaggttctgaagccagacacaagatgaaaaagctaactccaaaacagaaattttct gaagatttagagtcatataagatatcagtggtaatgcaggaatcagctgagaaactttca gaaaagttacataagtgtaaagaatttgtggacagttgcaggcttactttccctactagt ggtgatgaatacagcaggggcttccttcaaaaccttaaccttattcaagatcagaatgcg caaacaaggtggaagcagggcagatatgatgaggatggcaaacccttcaatcaaagatct ttgcttttggggcatgagcgaattctcacaagagcaaagtcttatgaatgcagtgaatgt ggaaaagtcattaggcgtaaggcatggtttgatcaacatcaaagaattcactttttagag aatccttttgagtgtaaggtctgtgggcaagccttcagacagcggtcagctcttacggtc cataaacagtgtcacctgcaaaacaagccatacagatgtcatgactgtggaaagtgtttt cggcagctcgcgtatcttgttgaacataagaggattcacaccaaagaaaaaccttataaa tgtagcaaatgtgaaaaaacgtttagtcagaattcaacccttattcgacatcaggtgatc catagtggagaaaaacgccataaatgccttgagtgtggaaaagcctttggccggcattca acccttctatgtcatcaacagattcacagtaaaccgaacacccataaatgcagtgaatgt ggacagtcctttggtaggaatgtggatctcattcagcatcaaagaatccatacaaaggag gaattctttcaatgtggagaatgtgggaaaacgtttagttttaagaggaatctttttcga catcaggtcattcacactggaagccaaccctaccaatgtgtcatatgtggaaaatctttc aagtggcacacaagctttattaagcaccagggcactcacaaaggacagatatccacatga >gi568815591r:99393532_99600093|GENSCAN_predicted_peptide_6|561_aa MNSSLTAQRRGSDAELGPWVMAARSKDAAPSQRDGLLPVKVEEDSPGSWEPNYPAASPDP ETSRLHFRQLRYQEVAGPEEALSRLRELCRRWLRPELLSKEQILELLVLEQFLTILPEEL QAWVREHCPESGEEAVAVVRALQRALDGTSSQGMVTFEDTAVSLTWEEWERLDPARRDFC RESAQKDSGSTVPPSLESRVENKELIPMQQILEEAEPQGQLQEAFQGKRPLFSKCGSTHE DRVEKQSGDPLPLKLENSPEAEGLNSISDVNKNGSIEGEDSKNNELQNSARCSNLVLCQH IPKAERPTDSEEHGNKCKQSFHMVTWHVLKPHKSDSGDSFHHSSLFETQRQLHEERPYKC GNCGKSFKQRSDLFRHQRIHTGEKPYGCQECGKSFSQSAALTKHQRTHTGEKPYTCLKCG ERFRQNSHLNRHQSTHSRDKHFKCEECGETCHISNLFRHQRLHKGERPYKCEECEKSFKQ RSDLFKHHRIHTGEKPYGCSVCGKRFNQSATLIKHQRIHTGEKPYKCLECGERFRQSTHL IRHQRIHQNKVLSAGRGGSRL >gi568815591r:99393532_99600093|GENSCAN_predicted_CDS_6|1686_bp atgaattccagcttgaccgcccagaggcgcggcagtgacgccgagttgggaccctgggtg atggctgcgaggtccaaggacgcggcgccgtcccaacgcgacggacttttgcccgtgaaa gtggaggaagactcacccggaagttgggagcccaactatcccgcggcttcgccggacccc gaaacttctcgactgcactttaggcagctgcgttaccaggaggtggctggaccggaagag gcgctgagccggctccgagaactctgtcgtcggtggctgagacccgagctgctctccaag gagcagatcctggagctgctggtgctggagcagttcctcaccatcctgcccgaggagctt caagcctgggtgcgagagcactgcccagagagcggggaggaggcggtggccgtggtgcgg gctctgcagcgagcgctcgatggaacctcatcccaggggatggtgactttcgaggacacg gctgtgtctctaacctgggaggagtgggagcgcctggacccagcacggagggacttctgc agagagagtgcgcagaaggattccgggagcacagttccgccgagtttggaaagcagagtg gagaacaaagagttgattccaatgcaacaaattttagaagaagcggagccacaggggcaa ctacaagaagcgttccaggggaagcgccccctgttttctaagtgtggcagtacccatgag gacagggtggaaaagcagtccggagaccccttgcccctgaaacttgaaaattctcctgaa gcagaaggactcaacagcatctcagatgtcaataagaatggttccatagaaggggaagac tctaaaaataatgaattgcagaacagtgccaggtgttccaaccttgttctatgtcagcac atcccgaaagcagagaggcccactgacagtgaggaacacgggaacaagtgcaagcaaagt ttccacatggtgacgtggcacgtgctgaaacctcacaagtctgacagtggagacagtttc catcattccagcctttttgagacccagaggcagctccatgaagaaagaccttataaatgt ggtaactgtgggaagagtttcaaacaacgctctgacctctttagacaccagagaatccac acaggtgagaaaccctatggctgccaagaatgtgggaaaagcttcagccagagtgctgcc ctgaccaagcaccagaggacacacacaggcgagaagccgtacacctgtctgaaatgtggg gagcgcttcaggcagaattcacacctaaatcgtcatcaaagtacccacagtagagacaaa cattttaaatgtgaggaatgcggggaaacctgtcatatttccaacctttttagacatcag agactacataaaggggaaagaccctataagtgtgaagaatgcgagaagagcttcaaacag cgctctgacctctttaaacaccacagaatccacactggggagaagccctatggatgttcc gtctgtgggaaacgcttcaatcagagtgcaaccctcattaaacaccagagaattcacact ggggaaaagccttacaaatgtcttgaatgtggggaaagatttagacaaagtacacacctt atccgacaccaaagaattcatcaaaataaagtgctgtcggctgggcgtggtggctcacgc ctataa >gi568815591r:99393532_99600093|GENSCAN_predicted_peptide_7|827_aa MTESREVIDLDPPAETSQEQEDLFIVKVEEEDCTWMQEYNPPTFETFYQRFRHFQYHEAS GPREALSQLRVLCCEWLRPELHTKEQILELLVLEQFLTILPEEFQPWVREHHPESGEEAV AVIENIQRELEERRQQIVACPDVLPRKMATPGAVQESCSPHPLTVDTQPEQAPQKPRLLE ENALPVLQVPSLPLKDSQELTASLLSTGSQKLVKIEEVADVAVSFILEEWGHLDQSQKSL YRDDRKENYGSITSMGYESRDNMELIVKQISDDSESHWVAPEHTERSVPQDPDFAEVSDL KGMVQRWQVNPTVGKSRQNPSQKRDLDAITDISPKQSTHGERGHRCSDCGKFFLQASNFI QHRRIHTGEKPFKCGECGKSYNQRVHLTQHQRVHTGEKPYKCQVCGKAFRVSSHLVQHHS VHSGERPYGCNECGKNFGRHSHLIEHLKRHFREKSQRCSDKRSKNTKLSVKKKISEYSEA DMELSGKTQRNVSQVQDFGEGCEFQGKLDRKQGIPMKEILGQPSSKRMNYSEVPYVHKKS STGERPHKCNECGKSFIQSAHLIQHQRIHTGEKPFRCEECGKSYNQRVHLTQHQRVHTGE KPYTCPLCGKAFRVRSHLVQHQSVHSGERPFKCNECGKGFGRRSHLAGHLRLHSREKSHQ CRECGEIFFQYVSLIEHQVLHMGQKNEKNGICEEAYSWNLTVIEDKKIELQEQPYQCDIC GKAFGYSSDLIQHYRTHTAEKPYQCDICRENVGQCSHTKQHQKIYSSTKSHQCHECGRGF TLKSHLNQHQRIHTGMRAAKPIEGVGPRRITSPGPSSMCADFWVPQK >gi568815591r:99393532_99600093|GENSCAN_predicted_CDS_7|2484_bp atgaccgaatcccgagaagttatagacttagaccccccagctgagacttcccaggagcag gaagaccttttcatagtgaaggtggaagaagaagactgcacctggatgcaggagtacaac ccgccaacgtttgagactttttaccagcgcttcaggcacttccagtaccatgaggcttca ggaccccgggaggctctcagccaactccgggtgctctgctgtgagtggctgaggcccgag ctgcacacgaaggagcagatcctggagctgctggtgctggagcagttcctgaccatcctg cctgaagagttccagccctgggtgagggaacatcaccctgaaagtggagaagaggcggtg gccgtgatagaaaatatacagcgagaacttgaggaacgcagacagcagattgttgcctgc cctgatgtgcttcctcggaagatggcaacacctggagcagtgcaggagtcctgcagcccc catcccctgaccgtggacacccagcctgagcaagcgccacagaagcctcgtctcctggag gaaaatgcccttcctgttctccaagttccttcccttcccctgaaggacagccaggagctg acagcttcacttctctcaactgggtcccagaagttggtgaaaattgaagaggtggctgat gtggctgtatccttcatcctggaggaatgggggcatttggaccagtcccagaagtccctt tatagggatgacaggaaggagaactatgggagtattacttccatgggttatgagtccagg gacaatatggagctcatagtgaagcagatttctgatgactctgaatcacactgggtggcg ccagaacacaccgaaaggagcgttcctcaggatccagactttgcagaagtcagtgacctt aaaggcatggtacaaaggtggcaggtcaaccccactgtggggaaatcaaggcagaatcct tcccagaaaagggatctggatgcaatcacagacatcagccctaagcaaagcacacatggc gagagagggcacagatgcagcgattgtggcaaattcttcctccaagcctcaaactttatt cagcatcggcgcatccacactggagaaaaaccgtttaagtgcggagaatgtgggaagagc tacaatcagcgggtgcacctcacccagcaccagcgcgtccacacaggggagaaaccctac aaatgtcaggtgtgcggaaaggctttccgggtgagttcccacctggttcagcaccacagt gtccacagcggagagaggccctatggctgcaatgagtgtgggaagaacttcggtcgccat tcgcatctgatcgaacacctaaaacgccacttcagggagaaatcccagagatgcagtgac aaaagaagtaagaacacaaaattaagtgttaagaagaaaatttcagaatattcagaagca gacatggaactatctggaaaaacccaaagaaatgtttctcaagttcaagattttggagaa ggctgtgagtttcaaggcaagctggatagaaagcagggaattcccatgaaagagatacta ggacaaccatcttcaaagaggatgaactacagtgaagtcccatatgtccacaaaaaatcc tccactggagagagaccacataaatgtaacgagtgtgggaaaagcttcattcagagtgca catcttattcaacatcaaagaatacacactggggagaaaccattcaggtgtgaggaatgt gggaaaagctacaaccaacgcgtgcacctaactcagcatcagcgcgtccacacaggtgag aagccctacacctgtcccttatgtgggaaagccttcagagtgaggtcccaccttgttcag catcagagcgtgcacagtggggagagacccttcaagtgtaacgaatgtgggaaaggcttt gggaggcgttcccacctggctggacatcttcgactccactcccgagagaaatcccatcag tgtcgtgaatgtggggaaatcttttttcagtacgttagcctaattgaacatcaggtgctc cacatgggtcagaaaaatgaaaaaaatggcatctgtgaggaagcatatagttggaacttg acagtgattgaagacaagaagattgagttacaagagcagccttatcagtgtgatatctgt ggaaaagcctttggttatagctcagacctcattcagcattacagaactcatacagcagag aagccctatcaatgtgatatatgtagagaaaatgttggccagtgttcccacaccaaacaa catcaaaaaatctactccagcacaaaatcccatcaatgtcatgaatgtggcagaggcttc actctgaagtcacatcttaatcaacatcagagaatccatactggaatgagggcagctaaa cccatagaaggagttggaccaaggcgaattacgagtcctggtcccagcagtatgtgtgct gacttctgggtgccccagaaatag >gi568815591r:99393532_99600093|GENSCAN_predicted_peptide_8|344_aa MTPESRDTTDLSPGGTQEMEGIVIVKVEEEDEEDHFQKERNKVESSPQVLSRSTTMNERA LLSSYLVAYRVAKEKMAHTAAEKIILPACMDMVRTIFDDKSADKLRTIPLSDNTISRRIC TIAKHLEAMLITRLQSGIDFAIQLDESTDIASCPTLLVYVRYALCSPTTLGPHADRLRTT DRAWAAEQNPHPGANHNSGLLPGIAKKGFGSGRTEGAGRFSSRSSARPPRSGKWRNRPSL PFTERHWLLVPLPRPQSRSRKIQSQNCSYTEDSSAPYPPWVARPAGSSTEEQFHPYGECG KSFTSSSCFTVHQESAWGETLQCNGCEKASSLSMTFVVDRREAL >gi568815591r:99393532_99600093|GENSCAN_predicted_CDS_8|1035_bp atgactcctgaatcaagggatactacagatttgtctccagggggtacccaggagatggaa ggcatcgtgatagtgaaggtggaggaggaagatgaagaagaccattttcaaaaggaaaga aacaaagtagagtcatcgccacaagttctcagtcgctctacaactatgaatgagagagcc ttattgtcatcgtatttagttgcatatagagtggcaaaagagaaaatggctcacacagcg gctgaaaaaattatccttccagcatgtatggacatggtacggacaatttttgatgacaaa tcagctgataaactaagaactatacctcttagtgataatacaatatctcgtcgaatctgt acgattgcaaaacatttggaagcaatgcttattacacggctgcagtccggtatagacttt gcaatccaactcgatgagagcactgatattgcaagttgtcccacactcttggtttatgtc agatatgccctctgcagccccaccacacttggtccccacgctgaccggctaaggaccacg gacagagcctgggctgcagaacagaacccacacccgggcgcaaaccacaactcagggctg cttcctgggatagcgaagaaaggctttgggagtgggcggaccgagggcgcagggcgcttc tcctccaggtcctcggctcggccaccccggtccggcaaatggcggaacagacccagcctg cccttcactgagcgccactggttgctggtgcccctgcctcggccacagtcgcgatctcgg aaaatccagagccagaactgtagttatacagaagactccagtgctccatacccaccctgg gtggcaaggcctgcaggtagctccactgaagagcaattccacccttacggcgagtgtggg aaaagcttcacgtcaagctcatgcttcactgtgcaccaagaatccgcctggggagagacc ctacagtgtaatgggtgtgaaaaggcttccagcctcagcatgacatttgtggtagacaga agagaagccctgtaa >gi568815591r:99393532_99600093|GENSCAN_predicted_peptide_9|712_aa MEGSGTGKRRGKAAKTSLRIMDARAQLLLRVPHPGPSLTSGALTHIRDPHPGLSPTSGTL MPGRRRGGPHSGPCTPSPEVPPRSAGLGAVYSGPGEAVVAPSASVAVMEEIPAQEAAGSP RVQFQSLETQSECLSPEPQFVQDTDMEQGLTGAPPVPQVPALPREGSPGDQAAALLTARY QSSQDVLFQEFVTFEDVAVHLTREEWGYLDPVQRDLYREVMLENYGNVVSLGFPISKPDG ISQLEQDLQVFDLETKTREVLRDDCSDGETREENKLLIPKQKISEEVHSYKVRVGRLKHD ITQVPETREVYKSEDRLERLQEILRKFLYLEREFRQITISKETFTSEKNNECHEPEKSFS LDSTIDADQRVLRIQNTDDNDKYDMSFNQNSASGKHEHLNLTEDFQSSECKESLMDLSHL NKWESIPNTEKSYKCDVCGKIFHQSSALTRHQRIHTREKPYKCKECEKSFSQSSSLSRHK RIHTREKPYKCEASDKSCEASDKSCSPSSGIIQHKKIHTRAKSYKCSSCERVFSRSVHLT QHQKIHKEMPCKCTVCGSDFCHTSYLLEHQRVHHEEKAYEYDEYGLAYIKQQGIHFREKP YTCSECGKDFRLNSHLIQHQRIHTGEKAHECNECGKAFSQTSCLIQHHKMHRKEKSYECN EYEGSFSHSSDLILQQEVLTRQKAFDCDVWEKNSSQRAHLVQHQSIHTKENS >gi568815591r:99393532_99600093|GENSCAN_predicted_CDS_9|2139_bp atggagggcagtgggaccggaaaaagacgtggaaaagctgcgaaaacgagccttcgaatc atggacgcgcgggcccagctcctcctccgagttcctcatccggggccgtcactcacatcc ggggccctcactcacatccgggaccctcatccggggctctcacccacatccgggaccctc atgcctgggcggaggagggggggccctcattcgggaccctgcactccgtcgccggaagtg ccaccgagaagcgccggcctcggggctgtctacagcggcccgggagaggctgtggtggcc ccgagcgcgagtgtagcagtgatggaggaaataccagcccaggaagcagcagggtcacca agggtccagtttcagtctttggagacccagtctgagtgtctgtccccagagcctcagttt gtgcaggacaccgacatggaacagggactcactggggctccacctgttcctcaggtgcct gctcttccccgtgagggaagcccaggagaccaggcagctgcgctcttgacagccaggtac cagagcagccaggatgtgttatttcaggagtttgtgacattcgaggatgtggctgtgcac cttactcgagaggaatggggatacctggaccctgttcagagggacctctacagagaagtg atgttagagaattatgggaacgtggtctcactgggatttccaatttccaagcctgatggg atctcccagctggaacaggatctacaggtctttgatctggaaactaagactagagaagtc ttaagagatgactgctcagatggagagaccagagaagagaacaagctgttgattcctaag cagaaaatttcggaagaagtgcattcatacaaagtgagagtaggaagactcaaacacgat attacccaagttcctgagactagagaagtgtataagtctgaggacagattagaaagactt caggaaattctaaggaaatttctgtacctggagagagagtttaggcaaataacaatcagc aaggaaaccttcaccagtgagaagaacaatgaatgtcatgaacccgaaaaaagcttcagt ctggactctactattgatgcagatcagagagttcttagaatacagaataccgatgacaat gataagtatgacatgagcttcaaccagaattcagcctctggtaaacatgaacacttaaat ctaacagaggattttcagagtagtgaatgtaaggaaagcttaatggatctctcccacctt aataaatgggagagcatccctaacactgagaaatcctataaatgtgatgtatgtgggaaa attttccatcagagctcagcccttactagacatcagagaatccatactagagagaagccc tacaaatgtaaagaatgtgaaaagtctttcagtcagagctcaagtcttagtcgacataaa agaatacacactagagaaaaaccttacaaatgtgaagcatctgataaatcctgtgaagcg tctgataaatcctgtagtccaagctcaggcataattcagcataagaaaattcacaccaga gccaaatcttacaaatgtagcagttgtgaaagagtcttcagtcgtagtgtccaccttact caacatcagaaaattcacaaagagatgccctgtaagtgtactgtatgtggcagtgacttc tgccatacttcatacctacttgaacatcagagggtccatcatgaagagaaagcctatgag tatgatgaatatgggttggcctatattaaacaacaaggaattcatttcagagaaaagccc tatacgtgtagtgaatgtggaaaagacttcagattgaattcacatcttattcagcatcaa agaattcacacaggagagaaagcacatgaatgtaatgaatgtggaaaagctttcagtcaa acctcatgccttattcagcatcacaaaatgcataggaaagagaaatcgtatgaatgtaat gagtatgagggcagtttcagtcatagctcagatcttatcctgcaacaagaagtcctcacc agacagaaagcctttgattgtgatgtatgggaaaagaactccagtcagagagcacatcta gttcaacatcagagcattcataccaaagagaactcatga >gi568815591r:99393532_99600093|GENSCAN_predicted_peptide_10|194_aa MQALYTFCDIRSNVQGRGGENDIIPNIAEGVHPHYDIAFNIQKGRGYYSQYRRGGRKDDI TPNSAGGVHPYFDIVVNIREREDDVNPNIAGSVHPPCDIVFNIGGGDNITPSFAKGVHTS RVVEDDITSNIAGRGWTKAKTPDRDCRNSARPLETGGGPRPCDQKPRKLGPAANCVSSRY GADPSSVLRTGAHS >gi568815591r:99393532_99600093|GENSCAN_predicted_CDS_10|585_bp atgcaggctctgtacaccttctgtgacattaggagtaatgtccaagggagaggtggagag aatgacattattcccaatatcgcagagggtgtacaccctcactatgatattgcttttaat atccagaaggggagaggatattactcccaatatcggaggggggggagaaaggatgacatt actcccaatagcgcagggggtgtacacccctactttgacattgtggttaatatcagggaa agagaggatgatgttaaccccaatatcgcagggagtgtacacccaccctgtgatattgtt tttaatattggggggggagacaatattactcccagttttgcaaaaggtgtacacacatcc agggtggtagaggatgatattacttccaatattgcagggaggggatggacgaaagcaaag acccctgaccgtgactgcaggaactctgccaggcccttagaaaccggcggcgggccccgc ccctgcgatcagaagccccgcaagctgggccccgccgccaactgcgtttccagccgctac ggggccgacccctcctccgttctccgcactggggcgcacagctag