GENSCAN 1.0 Date run: 3-Nov-116 Time: 02:23:33 Sequence gi568815591r:99319970_99535242 : 215273 bp : 49.74% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1810 1849 40 -1.56 1.01 Init + 5716 6035 320 0 2 69 97 126 0.525 6.90 1.02 Intr + 6235 6350 116 1 2 32 21 86 0.068 -3.61 1.03 Intr + 18212 18316 105 0 0 72 103 132 0.784 13.29 1.04 Intr + 24324 24546 223 1 1 69 105 154 0.891 12.39 1.05 Intr + 28883 28990 108 2 0 106 75 61 0.981 6.00 1.06 Intr + 33940 34152 213 1 0 70 73 234 0.488 17.93 1.07 Intr + 38275 38446 172 1 1 46 68 43 0.333 -1.95 1.08 Intr + 39576 39769 194 2 2 19 63 355 0.471 24.39 1.09 Intr + 54283 54702 420 1 0 85 41 235 0.026 11.56 1.10 Intr + 65733 65809 77 0 2 82 86 132 0.082 11.56 1.11 Intr + 66716 66820 105 0 0 104 91 186 0.997 20.69 1.12 Intr + 68070 68292 223 1 1 121 68 488 0.999 47.29 1.13 Intr + 69936 70043 108 0 0 94 89 192 0.999 19.30 1.14 Intr + 70924 71130 207 1 0 86 69 220 0.999 18.09 1.15 Intr + 71209 71284 76 1 1 82 77 102 0.998 8.02 1.16 Intr + 72702 72907 206 2 2 90 94 404 0.999 39.20 1.17 Intr + 74060 74150 91 2 1 103 100 142 0.993 16.90 1.18 Term + 74482 74520 39 0 0 106 43 38 0.995 -1.71 1.19 PlyA + 74793 74798 6 1.05 2.09 PlyA - 75508 75503 6 1.05 2.08 Term - 76771 76713 59 2 2 100 36 57 0.823 -0.45 2.07 Intr - 78044 77893 152 1 2 59 116 241 0.705 23.81 2.06 Intr - 80455 80334 122 1 2 80 44 221 0.991 16.19 2.05 Intr - 83536 83429 108 1 0 54 84 170 0.693 13.68 2.04 Intr - 84984 84893 92 1 2 74 96 84 0.742 7.51 2.03 Intr - 88751 88567 185 1 2 -26 78 170 0.238 4.03 2.02 Intr - 89219 89065 155 2 2 48 84 29 0.066 -2.63 2.01 Init - 89494 89417 78 1 0 63 41 114 0.249 5.26 2.00 Prom - 90246 90207 40 -10.64 3.00 Prom + 90581 90620 40 -7.76 3.01 Init + 91124 91217 94 1 1 65 113 75 0.745 8.24 3.02 Intr + 96169 96291 123 2 0 109 76 163 0.999 17.76 3.03 Intr + 97460 97626 167 0 2 104 40 186 0.966 15.08 3.04 Term + 99422 99472 51 1 0 113 54 63 0.937 2.83 3.05 PlyA + 99625 99630 6 1.05 4.09 PlyA - 99640 99635 6 -4.73 4.08 Term - 100180 99998 183 1 0 52 45 290 0.967 18.64 4.07 Intr - 103988 103806 183 2 0 91 65 368 0.998 34.78 4.06 Intr - 105647 104826 822 2 0 111 113 856 0.984 82.02 4.05 Intr - 109235 109134 102 2 0 108 91 178 0.999 20.47 4.04 Intr - 109837 109619 219 1 0 112 81 444 0.993 44.60 4.03 Intr - 112756 112607 150 1 0 92 -20 119 0.600 1.86 4.02 Intr - 113449 113309 141 1 0 122 57 272 0.902 28.05 4.01 Init - 115273 114821 453 1 0 78 116 425 0.699 38.16 4.00 Prom - 117282 117243 40 -8.76 5.00 Prom + 118101 118140 40 -11.33 5.01 Init + 119114 119216 103 1 1 109 53 233 0.727 22.30 5.02 Intr + 124820 124870 51 0 0 119 116 19 0.218 6.68 5.03 Intr + 128152 128304 153 2 0 122 99 177 0.523 22.24 5.04 Intr + 129549 129693 145 1 1 58 55 100 0.929 3.04 5.05 Intr + 130200 130402 203 0 2 60 90 203 0.572 16.63 5.06 Intr + 130733 130826 94 0 1 86 73 18 0.947 -0.88 5.07 Intr + 132399 132471 73 0 1 38 81 92 0.960 2.91 5.08 Intr + 133997 134167 171 1 0 74 113 99 0.480 11.14 5.09 Intr + 137846 138045 200 1 2 80 47 127 0.072 5.95 5.10 Intr + 138533 138698 166 2 1 116 59 58 0.044 5.66 5.11 Intr + 146082 146219 138 2 0 68 45 80 0.133 2.36 5.12 Intr + 148826 149031 206 1 2 88 81 39 0.099 1.20 5.13 Intr + 151347 151506 160 0 1 66 74 26 0.075 -1.01 5.14 Intr + 156434 156511 78 1 0 100 105 29 0.400 5.55 5.15 Intr + 159692 159818 127 1 1 38 49 146 0.174 5.95 5.16 Term + 166468 167519 1052 2 2 124 41 381 0.419 29.30 5.17 PlyA + 167606 167611 6 1.05 6.04 PlyA - 171256 171251 6 1.05 6.03 Term - 174662 173560 1103 0 2 43 36 529 0.220 35.44 6.02 Intr - 178873 178747 127 1 1 86 72 76 0.978 6.05 6.01 Init - 180124 179669 456 1 0 68 94 500 0.691 44.41 6.00 Prom - 185733 185694 40 -1.66 7.00 Prom + 186003 186042 40 -7.56 7.01 Init + 186082 186489 408 0 0 58 109 508 0.534 46.45 7.02 Intr + 192484 192622 139 0 1 48 77 69 0.428 1.84 7.03 Intr + 199858 199940 83 2 2 122 99 2 0.971 4.06 7.04 Intr + 200200 200335 136 0 1 14 82 142 0.981 6.34 7.05 Intr + 205844 206449 606 0 0 109 57 450 0.638 36.42 7.06 Intr + 211139 212149 1011 0 0 108 85 553 0.577 47.67 7.07 Term + 213108 213208 101 1 2 68 41 74 0.675 -1.01 7.08 PlyA + 213459 213464 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 139294 139178 117 2 0 99 70 169 0.892 16.64 S.002 Intr - 140224 140117 108 2 0 77 55 102 0.861 6.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:99319970_99535242|GENSCAN_predicted_peptide_1|1000_aa MPLRQHLPGVPWGSISSEGPPTVPRQWNGVPSSTAACLTPDCAPELCACADRPRGRNAQA PCPLPGSVSRSLWASVLRPRPTEPVRVDCPESANPPLRARPDSPDPRMFELLAASDARIN GPFCAPRVLLAPLLGAFVTVSKIVSKIALSPNNHEVHIYKKNGSQWVKAHELKEHNGHIT GIDWAPKSDRIVTCGADRNAYVWSQKDGVWKPTLVILRINRAATFVKWSPLENKFAVGSG ARLISVCYFESENDWWVSKHIKKPIRSTVLSLDWHPNNVLLAAGSCDFKCRVFSAYIKEV DEKPASTPWGSKMPFGQLMSEFGGSGTGGWVHGVSFSASGSRLAWVSHDSTVSVADASKS VQSPCENGSFVIICVCKEANSLLHLGITHVLVFRVSTLKTEFLPLLSVSFVSENSVVAAG HDCCPMLFNYDDRGCLTFVSKLDIPKQSIQRNMSAMERFRNMDKRATTEDRNTALETLHQ NSITACTVGAHRHQREGSSSLAADGAAGLASRSPACERGARSTRRPGSEDVTPRPCARQS PGAVRPAQARRSSPAAAQAARLAPSLSFPIGKALTVVADGFCGQSLGLRRLRGYRQHPHP EGRAGPGSRDRCPRSPPPGPTGLRSQAAMAYHSFLVEPISCHAWNKDRTQIAICPNNHEV HIYEKSGAKWTKVHELKEHNGQVTGIDWAPESNRIVTCGTDRNAYVWTLKGRTWKPTLVI LRINRAARCVRWAPNENKFAVGSGSRVISICYFEQENDWWVCKHIKKPIRSTVLSLDWHP NNVLLAAGSCDFKCRIFSAYIKEVEERPAPTPWGSKMPFGELMFESSSSCGWVHGVCFSA SGSRVAWVSHDSTVCLADADKKMAVATLASETLPLLALTFITDNSLVAAGHDCFPVLFTY DAAAGMLSFGGRLDVPKQSSQRGLTARERFQNLDKKASSEGGTAAGAGLDSLHKNSVSQI SVLSGGKAKCSQFCTTGMDGGMSIWDVKSLESALKDLKIK >gi568815591r:99319970_99535242|GENSCAN_predicted_CDS_1|3003_bp atgccgcttcgccagcacttgccaggcgttccgtggggcagcatctcctctgaagggccg ccgacagtaccacgccaatggaacggcgtaccctcctccactgcagcctgcctaaccccg gactgcgcgcccgagctgtgcgcatgcgcggaccgtccgcgcggtcgcaacgcgcaggcg ccgtgtccacttccgggatctgtcagccgctccctctgggcttccgtcctccgcccgcgc ccgacggagcctgttcgcgtcgactgcccagagtccgcgaatcctccgctccgagcccgt ccggactcccccgatcccaggatgttcgagctcctagccgcctccgatgcccgaattaac ggccccttctgcgcccccagggtcctacttgcacctctgctgggggcatttgtcaccgtc tctaaaattgtgtcaaagattgccctcagtcccaataatcacgaagtgcacatctataag aagaacgggagccagtgggtgaaagctcatgaactcaaggagcacaacggacacatcaca ggtattgactgggctcccaagagcgaccgcattgtcacttgtggggcagaccgcaatgcc tatgtctggagtcagaaagatggtgtttggaagccaaccctggtgatcctgagaattaat cgcgcagctacttttgtgaagtggtcccccctagagaacaaatttgctgtgggaagtgga gcacgactcatttctgtttgttactttgagtctgaaaatgactggtgggtgagcaagcac attaaaaagccgattcgctccacagtcctcagcttggattggcatcccaacaacgttttg ctggcagcaggatcatgtgacttcaaatgcagagtgttttctgcctacattaaagaagtg gatgaaaagccagccagcacgccctggggcagcaagatgccttttgggcagctgatgtca gagtttggtggcagtggcactggtggctgggtccacggggtaagcttctctgccagtggg agccgcctggcctgggtcagccacgacagcaccgtgtctgttgctgatgcctcaaaaagt gtgcaaagcccctgtgagaatgggtcctttgtgatcatctgtgtctgtaaagaagctaat agtctgttgcatcttgggataacacatgttcttgtgttcagggtctcgactctgaagaca gagttcctgccgctcctaagtgtgtcatttgtctcagagaacagcgtcgtggctgctggc catgactgctgcccaatgctctttaactacgatgaccgcggctgcctgaccttcgtctcc aagttagatattccaaaacagagcatccaacgcaacatgtctgccatggaacgcttccgc aacatggacaagagagccacaactgaggaccgcaacacggccttggagacgctgcaccag aatagcatcacggcctgcacagttggcgcccaccggcatcagcgtgaagggagctcctcg ctggctgccgacggggctgcagggctcgctagccgctcacctgcgtgtgagcgcggagcc cgcagtacccgccgtccgggctcagaggacgtgactccccgaccctgcgcccggcagtcc ccgggggccgtgcgcccggcccaggctcggaggtccagcccagcggcggctcaggctgcg cgcctggctcccagcctcagtttccccattggtaaagcattgacggtggttgcggacggc ttctgcggacagagccttgggctccgacgtctgcgcgggtaccggcagcacccccacccc gaagggagggcgggacctggatctcgcgaccgctgccccaggtccccgcccccgggccct accgggctgaggagccaagccgccatggcctaccacagcttcctggtggagcccatcagc tgccacgcctggaacaaggaccgcacccagattgccatctgccccaacaaccatgaggtg catatctatgaaaagagcggtgccaaatggaccaaggtgcacgagctcaaggagcacaac gggcaggtgacaggcatcgactgggcccccgagagtaaccgtattgtgacctgcggcaca gaccgcaacgcctacgtgtggacgctgaagggccgcacatggaagcccacgctggtcatc ctgcggatcaaccgggctgcccgctgcgtgcgctgggcccccaacgagaacaagtttgct gtgggcagcggctctcgtgtgatctccatctgttatttcgagcaggagaatgactggtgg gtttgcaagcacatcaagaagcccatccgctccaccgtcctcagcctggactggcacccc aacaatgtgctgctggctgccggctcctgtgacttcaagtgtcggatcttttcagcctac atcaaggaggtggaggaacggccggcacccaccccgtggggctccaagatgccctttggg gaactgatgttcgaatccagcagtagctgcggctgggtacatggcgtctgtttctcagcc agcgggagccgcgtggcctgggtaagccacgacagcaccgtctgcctggctgatgccgac aagaagatggccgtcgcgactctggcctctgaaacactaccactgctggcgctgaccttc atcacagacaacagcctggtggcagcgggccacgactgcttcccggtgctgttcacctat gacgccgccgcggggatgctgagcttcggcgggcggctggacgttcctaagcagagctcg cagcgtggcttgacggcccgcgagcgcttccagaacctggacaagaaggcgagctccgag ggtggcacggctgcgggcgcgggcctagactcgctgcacaagaacagcgtcagccagatc tcggtgctcagcggcggcaaggccaagtgctcgcagttctgcaccactggcatggatggc ggcatgagtatctgggatgtgaagagcttggagtcagccttgaaggacctcaagatcaaa tga >gi568815591r:99319970_99535242|GENSCAN_predicted_peptide_2|316_aa MVGGNQIEEKEDVGGNQIEKKDVPGGKNERGAASSRELSPASRSKARTHPSGWLRVQPLI SPSRTVQTTGLPPQPLPRTVRLRAEASLRSGSLRPLRPLKRRRRLAQEEEVEPKLFPVPV SVRGAAAAAAAAGAAMPKGGRKGGHKGRARQYTSPEEIDAQLQAEKQKAREEEEQKEGGD GAAGDPKKEKKSLDSDESEDEEDDYQQKRKGVEGLIDIENPNRVAQTTKKVTQLDLDGPK ELSRREREEIEKQKAKERYMKMHLAGKTEQAKADLARLAIIRKQREEAARKKEEERKAKD DATLSGKRMQSLSLNK >gi568815591r:99319970_99535242|GENSCAN_predicted_CDS_2|951_bp atggtaggcggtaaccagatcgaggagaaagaggacgtaggaggtaatcagatcgagaag aaagatgtgccgggaggtaagaacgaacgtggcgccgcctcctctcgggagctctctccg gcctcaaggtccaaagcccgaacacatcccagtggctggctgagagtccagccactcata tctccctccaggacggtgcagaccaccggtctcccgcctcagccgttgccaaggacggtc cgacttcgtgcggaggcctccctgaggtccgggtccttgcggccactgcggccactgaag cggcggcggcggctggcccaggaggaagaagtcgagcccaagctatttccggttccggtg tcagttcgaggcgccgccgccgccgccgcagccgccggagccgcaatgcctaaaggagga agaaagggaggccacaaaggccgggcgaggcagtatacaagccctgaggagatcgacgcg cagctgcaggctgagaagcagaaggccagggaagaagaggagcaaaaagaaggtggagat ggggctgcaggtgaccccaaaaaggagaagaaatctctagactcagatgagagtgaggat gaagaagatgactaccagcaaaagcgcaaaggcgttgaagggctcatcgacatcgagaac cccaaccgggtggcacagacaaccaaaaaggtcacacaactggatctggacgggccaaag gagctttcgaggagagaacgagaagagattgagaagcagaaggcaaaagagcgttacatg aaaatgcacttggccgggaagacagagcaagccaaggctgacctggcccggctggccatc atccggaaacagcgggaggaggctgcccggaagaaggaagaggaaaggaaagcaaaagac gatgccacattgtcaggaaaacgaatgcagtcactctccctgaataagtaa >gi568815591r:99319970_99535242|GENSCAN_predicted_peptide_3|144_aa MPKVKRSRKAPPDGWELIEPTLDELDQKMREAETEPHEGKRKVESLWPIFRIHHQKTRYI FDLFYKRKAISRELYEYCIKEGYADKNLIAKWKKQGYENLCCLRCIQTRDTNFGTNCICR VPKSKLEVGRIIECTHCGCRGCSG >gi568815591r:99319970_99535242|GENSCAN_predicted_CDS_3|435_bp atgcctaaagtcaaaagaagccggaaagcacccccagatggctgggagttgattgagcca acactggatgaattagatcaaaagatgagagaagctgaaacagaaccgcatgagggaaag aggaaagtggaatctctgtggcccatcttcaggatccaccaccagaaaacccgctacatc ttcgacctcttttacaagcggaaagccatcagcagagaactctatgaatattgtattaaa gaaggctatgcagacaaaaacctgattgcaaaatggaaaaagcaaggatatgagaacttg tgctgcctgcggtgcattcagacacgggacaccaacttcgggacgaactgcatctgccgc gtgcccaaaagcaagctggaagtgggccgcatcatcgagtgcacacactgtggctgtcgt ggctgctctggctga >gi568815591r:99319970_99535242|GENSCAN_predicted_peptide_4|750_aa MDFVRLARLFARARPMGLFILQHLDPCRARWAGGREGLMRPMWAPFSSSSSQLPLGQERQ ENTGSLGSDPSHSNSTATQEEDEEEEESFGTLSDKYSSRRLFRKSAAQFHNLRFGERRDE QMEPEPKLWRGRRNTPYWYFLQCKHLIKEGKLVEALDLFERQMLKEERLQPMESNYTVLI GGCGRVGYLKKAFNLYNQVGRETEKRNKTQRQSIEKEQWSPGTGAQHTLSIPHTEDLRIR RTCAAQALMKKRDLEPSDATYTALFNVCAESPWKDSALQSALKLRQQLQAKNFELNLKTY HALLKMAAKCADLRMCLDVFKEIIHKGHVVTEETFSFLLMGCIQDKKTGFRYALQVWRLM LSLGLQPSRDSYNLLLVAARDCGLGDPQVASELLLKPREEATVLQPPVSRQRPRRTAQAK AGNLMSAMLHVEALERQLFLEPSQALGPPEPPEARVPGKAQPEVDTKAEPSHTAALTAVA LKPPPVELEVNLLTPGAVPPTVVSFGTVTTPADRLALIGGLEGFLSKMAEHRQQPDIRTL TLLAEVVESGSPAESLLLALLDEHQVEADLTFFNTLVRKKSKLGDLEGAKALLPVLAKRG LVPNLQTFCNLAIGCHRPKDGLQLLTDMKKSQVTPNTHIYSALINAAIRKLNYTYLISIL KDMKQNRVPVNEVVIRQLEFAAQYPPTFDRYQGKNTYLEKIDGFRAYYKQWLTVMPAEET PHPWQKFRTKPQGDQDTGKEADDGCALGGR >gi568815591r:99319970_99535242|GENSCAN_predicted_CDS_4|2253_bp atggacttcgtgagactcgctcgactgttcgccagggcccgccccatgggactgttcatc ctgcaacacctggacccctgtagagccaggtgggcaggaggcagggaggggctgatgcgg ccaatgtgggcgcccttcagcagctcctcctctcagctgcccctcggccaggagcgtcag gaaaacacgggcagcctgggctctgacccgagccactccaactccacggccacgcaggaa gaagacgaggaggaggaggagagttttgggaccctctctgacaaatactcctcccggaga ctattccgcaaatccgcagcccagttccataacctgcggtttggggaacggagagatgag caaatggaaccggagcccaaattatggcgaggccggagaaacaccccgtactggtacttc ttgcagtgcaaacacctgatcaaggaagggaagctggttgaagccctggacctgtttgag aggcagatgctgaaggaggagcgattgcagcccatggagagcaactacacggtgctgatt gggggctgcgggcgggttggctacctgaagaaggccttcaacctctacaaccaggtggga cgagagactgagaaaagaaataagacacagagacaaagtatagagaaagaacagtggtcc cctgggaccggtgctcagcatacgctcagcatcccgcatacggaggacctgcgcatacgg aggacctgcgctgcgcaggcactgatgaaaaagcgggacctggagccctcggacgccacc tacacggccctgttcaacgtctgtgccgagtccccctggaaggactcagctctacagagc gccctgaagctccggcagcagctgcaggccaaaaacttcgagctcaacttgaaaacatac cacgcgctgctgaagatggctgccaagtgcgcagaccttaggatgtgcctcgatgtgttc aaggaaatcatccacaaagggcacgtggtcacagaggagaccttcagtttcctgctcatg ggctgcatccaagacaagaagacaggcttccggtacgccctccaggtgtggcggctgatg ctgagtctagggctacagccgagccgggacagctacaacctgctgttggtggcagctcgg gactgtggcctaggggacccccaggtggcctcagagctgcttctgaagcccagggaggag gcgactgtgcttcagcccccagtgagcaggcagcggccaaggaggacagcccaggccaag gcaggcaacctcatgtcagccatgctgcatgtggaggccctggagaggcagctgtttctg gaaccttctcaggcacttgggcctccagagcctccggaagccagagtgcccggcaaggcc caaccagaggtggatactaaggcagagcccagccacacagcagccctcaccgcagtggcc ctgaagccacctcccgtggagctggaagtcaacctcctgacccccggggccgttccccct acagtggtctcctttggaacggtgaccaccccagctgaccggctggccttgatagggggc ctggagggcttcctgagcaagatggcagagcacaggcagcagcccgacatcaggaccctc acgctactggccgaggtggtggagtccgggagtcctgcagagtccttgctgctggccctc ctggatgagcaccaggtagaggccgacctgacattctttaacacgctggtgagaaagaag agcaagctgggagacctggagggggccaaggcgctgttgccggtcctggcaaagaggggc ctcgtccccaacctgcagacattctgcaacctggccatcgggtgccacaggccgaaggac ggtctacagcttctcacagacatgaagaagtcccaggtgacccccaacactcacatctac agtgccctcatcaacgcggccatcaggaagctgaactacacctatctcatcagcatcttg aaggacatgaagcagaacagggtcccggtgaacgaagtggtcatccgccagctggagttt gcagcccagtaccctcccacctttgaccggtaccaagggaagaacacctacctggagaag attgacggcttccgagcctattacaagcagtggctgacagtgatgcccgcagaggaaacc ccgcacccctggcagaagttccggaccaagccccagggggaccaggacaccggcaaggag gctgatgacggatgtgcccttgggggcaggtga >gi568815591r:99319970_99535242|GENSCAN_predicted_peptide_5|1039_aa MQEIIASVDHIKFDLEIAVEQQLGAQPLPFPGMDKSGAAVCEFFLKAACGKGGMCPFRHI SGEKTVVCKHWLRGLCKKGDQCEFLHEYDMTKMPECYFYSKFALVQDALSAASVTCTTDI GLRAPFPGTGGTMVKRSLPCEAFTLVVQGKRARQAKTGHQNLFSLDEAKVGLAPAFPVVS RLSFPAGECSNKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNYLVGFC PEGPSCKFMHPRFELPMGTTEQPPLPQQTQPPAKQSNNPPLQRSSSLIQLTSQNSSPNQQ RTPQVIGVMQSQNSSAGNRGPRPLEQVTCYKGDAETQDAGHPCLEPQDSRGGIWRSQVRK PLCYTIKGQPTKGQNVCKCTCPQHSWQGTDFGTAASGRISRPGGHKDTSGANLPPGQPLS FQKEQREHTLRQTAPENPSFRGRTALVPQNGSECGSGPWTLAPASTTESKQPYLGPDTHQ LTPFWSPGVRCAGPREGCWAGHSRGRRAAGAQDPHTGIQGGPGKRPEGSYANARSPGVDG RTEGAGAVICICINANVLVNELGLFRGRDMDEAGGHQPQQTNTGTENQTQHVLIHKWELN IENAWTQRGEQHTPGPFGESGLSQTSRIPPLAKDQAVEAMFPPARGKELLSFEDVAMYFT REEWGHLNWGQKDLYRDVMLENYRNMVLLVDRSFTSFLVIWLGSEARHKMKKLTPKQKFS EDLESYKISVVMQESAEKLSEKLHKCKEFVDSCRLTFPTSGDEYSRGFLQNLNLIQDQNA QTRWKQGRYDEDGKPFNQRSLLLGHERILTRAKSYECSECGKVIRRKAWFDQHQRIHFLE NPFECKVCGQAFRQRSALTVHKQCHLQNKPYRCHDCGKCFRQLAYLVEHKRIHTKEKPYK CSKCEKTFSQNSTLIRHQVIHSGEKRHKCLECGKAFGRHSTLLCHQQIHSKPNTHKCSEC GQSFGRNVDLIQHQRIHTKEEFFQCGECGKTFSFKRNLFRHQVIHTGSQPYQCVICGKSF KWHTSFIKHQGTHKGQIST >gi568815591r:99319970_99535242|GENSCAN_predicted_CDS_5|3120_bp atgcaggaaatcatcgccagcgtggaccacatcaagtttgacttggagatcgcggtggag cagcagctgggggcgcagccgctgcccttccccggcatggacaagtcgggcgctgctgtc tgtgaattctttttgaaagctgcctgcggcaaagggggcatgtgtccgtttcgccacatc agtggtgagaagacagttgtgtgcaaacactggctgcgtggcctatgcaagaaaggggac cagtgtgagttcctgcatgagtatgacatgaccaagatgcccgagtgctacttctactcc aagttcgccctggtgcaggacgcactgtctgcagcttctgtcacctgcaccacagacatt gggttgagggccccctttccaggcactggagggacaatggtgaagaggtccttgccctgt gaagccttcactcttgtggtccagggaaagagggctcgtcaggccaaaactggacaccag aacctcttttccttggatgaagccaaggtgggcctggccccggccttcccagtggtctca cgtctgagtttccctgcaggggagtgcagcaacaaggaatgtcccttcctgcacatcgac cccgagtccaagatcaaggactgtccttggtatgaccgtggcttctgcaagcacggtccc ctctgcaggcaccggcacacacggagagtcatctgtgtgaattacctcgtgggattctgc ccggaggggccctcgtgtaaattcatgcaccctcgatttgaactgcccatgggaaccacc gagcagcccccactgccgcagcagacacagcctccagcaaagcaaagtaacaatccgcca ttacaaaggtcgtcctccttgatccagttaacgagtcagaactcttctcccaatcagcag agaaccccgcaggtcatcggggtcatgcagagtcaaaacagcagcgcgggcaaccgggga ccccggccactggagcaggtcacctgttacaagggtgatgcagaaacacaggatgcgggc catccctgcctggagccacaggacagccggggtgggatctggcgctctcaggtacgcaag cctttgtgttacaccataaagggccagccaactaaaggccagaatgtctgcaaatgcacc tgcccacagcactcatggcaagggacggacttcggcactgctgcttcgggaagaatcagc agacccggggggcacaaagacacctcaggagccaacctgccacctggccaaccactttct ttccagaaggaacagagggaacacacgctcagacaaacagccccagagaacccgtccttc agaggcaggaccgccctggtgccacagaatgggagtgagtgtggctctggaccctggacc ctggctcctgcttccaccacggagtccaaacagccttacctggggccggacactcaccaa ctgacgccattttggagtcctggtgtccgctgtgccggaccgcgcgagggctgctgggcc ggacactcgcggggacgcagagccgccggagcccaggatccgcacactggaatccaggga ggccccgggaagaggccggagggaagctacgccaatgcccgatcccctggggtggatgga aggactgagggcgcaggagctgtgatttgtatatgtataaatgccaacgttttagttaat gaactggggttgtttcggggaagggacatggatgaagctggtggccatcagcctcagcaa actaacacaggaacagaaaaccaaacacagcatgttctcattcataagtgggaattgaat attgagaacgcatggacacagagaggggaacaacacacacctgggccttttggggagtcg gggctcagccagacgtccaggatcccaccccttgcaaaagaccaggccgtggaagccatg ttcccaccagccagggggaaggagctgctctcgtttgaggatgtggcgatgtacttcacc agagaggagtggggccacctcaactggggtcagaaggacctctaccgagatgtgatgttg gagaactacaggaacatggtcttgctggtggataggtcatttacatcttttcttgttatt tggctaggttctgaagccagacacaagatgaaaaagctaactccaaaacagaaattttct gaagatttagagtcatataagatatcagtggtaatgcaggaatcagctgagaaactttca gaaaagttacataagtgtaaagaatttgtggacagttgcaggcttactttccctactagt ggtgatgaatacagcaggggcttccttcaaaaccttaaccttattcaagatcagaatgcg caaacaaggtggaagcagggcagatatgatgaggatggcaaacccttcaatcaaagatct ttgcttttggggcatgagcgaattctcacaagagcaaagtcttatgaatgcagtgaatgt ggaaaagtcattaggcgtaaggcatggtttgatcaacatcaaagaattcactttttagag aatccttttgagtgtaaggtctgtgggcaagccttcagacagcggtcagctcttacggtc cataaacagtgtcacctgcaaaacaagccatacagatgtcatgactgtggaaagtgtttt cggcagctcgcgtatcttgttgaacataagaggattcacaccaaagaaaaaccttataaa tgtagcaaatgtgaaaaaacgtttagtcagaattcaacccttattcgacatcaggtgatc catagtggagaaaaacgccataaatgccttgagtgtggaaaagcctttggccggcattca acccttctatgtcatcaacagattcacagtaaaccgaacacccataaatgcagtgaatgt ggacagtcctttggtaggaatgtggatctcattcagcatcaaagaatccatacaaaggag gaattctttcaatgtggagaatgtgggaaaacgtttagttttaagaggaatctttttcga catcaggtcattcacactggaagccaaccctaccaatgtgtcatatgtggaaaatctttc aagtggcacacaagctttattaagcaccagggcactcacaaaggacagatatccacatga >gi568815591r:99319970_99535242|GENSCAN_predicted_peptide_6|561_aa MNSSLTAQRRGSDAELGPWVMAARSKDAAPSQRDGLLPVKVEEDSPGSWEPNYPAASPDP ETSRLHFRQLRYQEVAGPEEALSRLRELCRRWLRPELLSKEQILELLVLEQFLTILPEEL QAWVREHCPESGEEAVAVVRALQRALDGTSSQGMVTFEDTAVSLTWEEWERLDPARRDFC RESAQKDSGSTVPPSLESRVENKELIPMQQILEEAEPQGQLQEAFQGKRPLFSKCGSTHE DRVEKQSGDPLPLKLENSPEAEGLNSISDVNKNGSIEGEDSKNNELQNSARCSNLVLCQH IPKAERPTDSEEHGNKCKQSFHMVTWHVLKPHKSDSGDSFHHSSLFETQRQLHEERPYKC GNCGKSFKQRSDLFRHQRIHTGEKPYGCQECGKSFSQSAALTKHQRTHTGEKPYTCLKCG ERFRQNSHLNRHQSTHSRDKHFKCEECGETCHISNLFRHQRLHKGERPYKCEECEKSFKQ RSDLFKHHRIHTGEKPYGCSVCGKRFNQSATLIKHQRIHTGEKPYKCLECGERFRQSTHL IRHQRIHQNKVLSAGRGGSRL >gi568815591r:99319970_99535242|GENSCAN_predicted_CDS_6|1686_bp atgaattccagcttgaccgcccagaggcgcggcagtgacgccgagttgggaccctgggtg atggctgcgaggtccaaggacgcggcgccgtcccaacgcgacggacttttgcccgtgaaa gtggaggaagactcacccggaagttgggagcccaactatcccgcggcttcgccggacccc gaaacttctcgactgcactttaggcagctgcgttaccaggaggtggctggaccggaagag gcgctgagccggctccgagaactctgtcgtcggtggctgagacccgagctgctctccaag gagcagatcctggagctgctggtgctggagcagttcctcaccatcctgcccgaggagctt caagcctgggtgcgagagcactgcccagagagcggggaggaggcggtggccgtggtgcgg gctctgcagcgagcgctcgatggaacctcatcccaggggatggtgactttcgaggacacg gctgtgtctctaacctgggaggagtgggagcgcctggacccagcacggagggacttctgc agagagagtgcgcagaaggattccgggagcacagttccgccgagtttggaaagcagagtg gagaacaaagagttgattccaatgcaacaaattttagaagaagcggagccacaggggcaa ctacaagaagcgttccaggggaagcgccccctgttttctaagtgtggcagtacccatgag gacagggtggaaaagcagtccggagaccccttgcccctgaaacttgaaaattctcctgaa gcagaaggactcaacagcatctcagatgtcaataagaatggttccatagaaggggaagac tctaaaaataatgaattgcagaacagtgccaggtgttccaaccttgttctatgtcagcac atcccgaaagcagagaggcccactgacagtgaggaacacgggaacaagtgcaagcaaagt ttccacatggtgacgtggcacgtgctgaaacctcacaagtctgacagtggagacagtttc catcattccagcctttttgagacccagaggcagctccatgaagaaagaccttataaatgt ggtaactgtgggaagagtttcaaacaacgctctgacctctttagacaccagagaatccac acaggtgagaaaccctatggctgccaagaatgtgggaaaagcttcagccagagtgctgcc ctgaccaagcaccagaggacacacacaggcgagaagccgtacacctgtctgaaatgtggg gagcgcttcaggcagaattcacacctaaatcgtcatcaaagtacccacagtagagacaaa cattttaaatgtgaggaatgcggggaaacctgtcatatttccaacctttttagacatcag agactacataaaggggaaagaccctataagtgtgaagaatgcgagaagagcttcaaacag cgctctgacctctttaaacaccacagaatccacactggggagaagccctatggatgttcc gtctgtgggaaacgcttcaatcagagtgcaaccctcattaaacaccagagaattcacact ggggaaaagccttacaaatgtcttgaatgtggggaaagatttagacaaagtacacacctt atccgacaccaaagaattcatcaaaataaagtgctgtcggctgggcgtggtggctcacgc ctataa >gi568815591r:99319970_99535242|GENSCAN_predicted_peptide_7|827_aa MTESREVIDLDPPAETSQEQEDLFIVKVEEEDCTWMQEYNPPTFETFYQRFRHFQYHEAS GPREALSQLRVLCCEWLRPELHTKEQILELLVLEQFLTILPEEFQPWVREHHPESGEEAV AVIENIQRELEERRQQIVACPDVLPRKMATPGAVQESCSPHPLTVDTQPEQAPQKPRLLE ENALPVLQVPSLPLKDSQELTASLLSTGSQKLVKIEEVADVAVSFILEEWGHLDQSQKSL YRDDRKENYGSITSMGYESRDNMELIVKQISDDSESHWVAPEHTERSVPQDPDFAEVSDL KGMVQRWQVNPTVGKSRQNPSQKRDLDAITDISPKQSTHGERGHRCSDCGKFFLQASNFI QHRRIHTGEKPFKCGECGKSYNQRVHLTQHQRVHTGEKPYKCQVCGKAFRVSSHLVQHHS VHSGERPYGCNECGKNFGRHSHLIEHLKRHFREKSQRCSDKRSKNTKLSVKKKISEYSEA DMELSGKTQRNVSQVQDFGEGCEFQGKLDRKQGIPMKEILGQPSSKRMNYSEVPYVHKKS STGERPHKCNECGKSFIQSAHLIQHQRIHTGEKPFRCEECGKSYNQRVHLTQHQRVHTGE KPYTCPLCGKAFRVRSHLVQHQSVHSGERPFKCNECGKGFGRRSHLAGHLRLHSREKSHQ CRECGEIFFQYVSLIEHQVLHMGQKNEKNGICEEAYSWNLTVIEDKKIELQEQPYQCDIC GKAFGYSSDLIQHYRTHTAEKPYQCDICRENVGQCSHTKQHQKIYSSTKSHQCHECGRGF TLKSHLNQHQRIHTGMRAAKPIEGVGPRRITSPGPSSMCADFWVPQK >gi568815591r:99319970_99535242|GENSCAN_predicted_CDS_7|2484_bp atgaccgaatcccgagaagttatagacttagaccccccagctgagacttcccaggagcag gaagaccttttcatagtgaaggtggaagaagaagactgcacctggatgcaggagtacaac ccgccaacgtttgagactttttaccagcgcttcaggcacttccagtaccatgaggcttca ggaccccgggaggctctcagccaactccgggtgctctgctgtgagtggctgaggcccgag ctgcacacgaaggagcagatcctggagctgctggtgctggagcagttcctgaccatcctg cctgaagagttccagccctgggtgagggaacatcaccctgaaagtggagaagaggcggtg gccgtgatagaaaatatacagcgagaacttgaggaacgcagacagcagattgttgcctgc cctgatgtgcttcctcggaagatggcaacacctggagcagtgcaggagtcctgcagcccc catcccctgaccgtggacacccagcctgagcaagcgccacagaagcctcgtctcctggag gaaaatgcccttcctgttctccaagttccttcccttcccctgaaggacagccaggagctg acagcttcacttctctcaactgggtcccagaagttggtgaaaattgaagaggtggctgat gtggctgtatccttcatcctggaggaatgggggcatttggaccagtcccagaagtccctt tatagggatgacaggaaggagaactatgggagtattacttccatgggttatgagtccagg gacaatatggagctcatagtgaagcagatttctgatgactctgaatcacactgggtggcg ccagaacacaccgaaaggagcgttcctcaggatccagactttgcagaagtcagtgacctt aaaggcatggtacaaaggtggcaggtcaaccccactgtggggaaatcaaggcagaatcct tcccagaaaagggatctggatgcaatcacagacatcagccctaagcaaagcacacatggc gagagagggcacagatgcagcgattgtggcaaattcttcctccaagcctcaaactttatt cagcatcggcgcatccacactggagaaaaaccgtttaagtgcggagaatgtgggaagagc tacaatcagcgggtgcacctcacccagcaccagcgcgtccacacaggggagaaaccctac aaatgtcaggtgtgcggaaaggctttccgggtgagttcccacctggttcagcaccacagt gtccacagcggagagaggccctatggctgcaatgagtgtgggaagaacttcggtcgccat tcgcatctgatcgaacacctaaaacgccacttcagggagaaatcccagagatgcagtgac aaaagaagtaagaacacaaaattaagtgttaagaagaaaatttcagaatattcagaagca gacatggaactatctggaaaaacccaaagaaatgtttctcaagttcaagattttggagaa ggctgtgagtttcaaggcaagctggatagaaagcagggaattcccatgaaagagatacta ggacaaccatcttcaaagaggatgaactacagtgaagtcccatatgtccacaaaaaatcc tccactggagagagaccacataaatgtaacgagtgtgggaaaagcttcattcagagtgca catcttattcaacatcaaagaatacacactggggagaaaccattcaggtgtgaggaatgt gggaaaagctacaaccaacgcgtgcacctaactcagcatcagcgcgtccacacaggtgag aagccctacacctgtcccttatgtgggaaagccttcagagtgaggtcccaccttgttcag catcagagcgtgcacagtggggagagacccttcaagtgtaacgaatgtgggaaaggcttt gggaggcgttcccacctggctggacatcttcgactccactcccgagagaaatcccatcag tgtcgtgaatgtggggaaatcttttttcagtacgttagcctaattgaacatcaggtgctc cacatgggtcagaaaaatgaaaaaaatggcatctgtgaggaagcatatagttggaacttg acagtgattgaagacaagaagattgagttacaagagcagccttatcagtgtgatatctgt ggaaaagcctttggttatagctcagacctcattcagcattacagaactcatacagcagag aagccctatcaatgtgatatatgtagagaaaatgttggccagtgttcccacaccaaacaa catcaaaaaatctactccagcacaaaatcccatcaatgtcatgaatgtggcagaggcttc actctgaagtcacatcttaatcaacatcagagaatccatactggaatgagggcagctaaa cccatagaaggagttggaccaaggcgaattacgagtcctggtcccagcagtatgtgtgct gacttctgggtgccccagaaatag