GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:32:56 Sequence gi568815579f:54870016_55100895 : 230880 bp : 50.39% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4275 4308 34 2 1 93 97 35 0.358 3.70 1.02 Intr + 5315 5350 36 0 0 81 115 5 0.224 0.73 1.03 Intr + 15220 15510 291 2 0 118 110 296 0.560 31.91 1.04 Intr + 17992 18279 288 2 0 89 96 174 0.939 15.52 1.05 Term + 19634 19848 215 0 2 61 41 228 0.975 12.79 1.06 PlyA + 21647 21652 6 1.05 2.00 Prom + 23712 23751 40 -5.36 2.01 Init + 36173 36206 34 1 1 56 109 53 0.959 2.59 2.02 Intr + 36284 36394 111 0 0 78 60 73 0.918 3.85 2.03 Intr + 36508 36792 285 2 0 72 93 235 0.991 19.71 2.04 Intr + 39230 39508 279 0 0 55 94 160 0.852 10.75 2.05 Intr + 40003 40050 48 2 0 110 105 39 0.880 6.45 2.06 Intr + 47049 47150 102 1 0 108 41 54 0.416 2.85 2.07 Term + 51129 51169 41 1 2 94 36 46 0.128 -2.65 2.08 PlyA + 53482 53487 6 1.05 3.10 PlyA - 53519 53514 6 1.05 3.09 Term - 53857 53725 133 0 1 89 53 69 0.859 1.06 3.08 Intr - 60651 60484 168 2 0 90 78 58 0.871 4.06 3.07 Intr - 63724 63554 171 0 0 93 78 102 0.976 8.76 3.06 Intr - 64644 64474 171 2 0 91 91 170 0.999 16.66 3.05 Intr - 66416 66246 171 1 0 83 90 198 0.999 18.56 3.04 Intr - 68142 68029 114 2 0 79 83 94 0.956 7.56 3.03 Intr - 70451 68873 1579 0 1 81 116 1680 0.999 158.11 3.02 Intr - 70990 70916 75 2 0 24 94 72 0.649 0.89 3.01 Init - 71696 71420 277 2 1 62 106 240 0.657 20.35 3.00 Prom - 74336 74297 40 -6.66 4.00 Prom + 77955 77994 40 -4.26 4.01 Init + 84221 84284 64 1 1 68 12 97 0.044 -0.59 4.02 Intr + 99984 100280 297 1 0 93 105 365 0.697 35.55 4.03 Intr + 104485 104529 45 2 0 107 97 -9 0.547 0.28 4.04 Intr + 107737 107808 72 2 0 88 98 146 0.999 14.98 4.05 Intr + 112147 113713 1567 2 1 9 78 1715 0.832 150.79 4.06 Intr + 115032 115202 171 0 0 111 78 28 0.895 3.16 4.07 Intr + 116136 116300 165 0 0 121 43 78 0.962 5.68 4.08 Intr + 120007 120177 171 1 0 115 91 69 0.992 8.96 4.09 Intr + 120487 120657 171 1 0 81 78 114 0.925 8.76 4.10 Intr + 124254 124424 171 0 0 72 89 27 0.537 0.26 4.11 Intr + 127302 127472 171 0 0 115 80 56 0.717 6.56 4.12 Term + 130745 130883 139 2 1 97 41 149 0.999 8.54 4.13 PlyA + 131100 131105 6 1.05 5.17 PlyA - 132604 132599 6 1.05 5.16 Term - 140235 140042 194 1 2 63 49 111 0.734 2.28 5.15 Intr - 145150 144956 195 2 0 130 20 81 0.632 4.89 5.14 Intr - 145718 145668 51 0 0 69 105 40 0.730 2.68 5.13 Intr - 155256 155203 54 1 0 71 96 46 0.720 2.65 5.12 Intr - 157847 157563 285 0 0 102 91 248 0.984 23.81 5.11 Intr - 162381 162124 258 1 0 114 70 378 0.584 36.03 5.10 Intr - 162523 162491 33 2 0 105 89 2 0.373 0.19 5.09 Intr - 172878 172751 128 2 2 44 40 129 0.088 4.02 5.08 Intr - 175062 174935 128 0 2 -9 42 107 0.317 -4.12 5.07 Intr - 175294 175107 188 2 2 47 56 305 0.681 22.61 5.06 Intr - 177473 177372 102 0 0 149 109 117 0.999 20.05 5.05 Intr - 178526 178314 213 0 0 77 102 333 0.999 32.29 5.04 Intr - 178748 178644 105 0 0 102 68 205 0.998 20.09 5.03 Intr - 186793 186638 156 2 0 78 97 266 0.977 26.48 5.02 Intr - 189260 189142 119 1 2 122 83 199 0.998 22.91 5.01 Init - 193017 192953 65 0 2 81 89 235 0.531 21.52 5.00 Prom - 195690 195651 40 -4.96 6.00 Prom + 200568 200607 40 -8.06 6.01 Init + 201000 201060 61 2 1 81 94 25 0.933 3.81 6.02 Intr + 201786 201929 144 1 0 58 3 124 0.364 1.15 6.03 Intr + 208984 209042 59 2 2 117 82 60 0.696 6.90 6.04 Intr + 209675 209854 180 1 0 72 35 269 0.750 20.06 6.05 Intr + 210114 210263 150 2 0 69 72 223 0.998 19.16 6.06 Intr + 210757 210839 83 0 2 20 94 162 0.999 8.44 6.07 Intr + 211216 211477 262 1 1 107 77 227 0.976 20.99 6.08 Intr + 211758 211884 127 2 1 118 74 137 0.999 15.55 6.09 Intr + 212077 212165 89 2 2 89 81 252 0.991 24.29 6.10 Intr + 212260 212334 75 0 0 83 81 114 0.999 9.91 6.11 Intr + 212439 212587 149 2 2 89 76 206 0.999 18.53 6.12 Intr + 213363 213504 142 0 1 89 60 159 0.987 13.56 6.13 Intr + 213601 213629 29 0 2 111 92 30 0.930 2.61 6.14 Intr + 215826 215958 133 0 1 89 50 158 0.901 12.75 6.15 Intr + 216046 216177 132 0 0 92 93 116 0.999 13.34 6.16 Intr + 216377 216503 127 1 1 131 96 73 0.997 12.65 6.17 Intr + 216699 216873 175 1 1 83 100 204 0.964 20.10 6.18 Intr + 217288 217420 133 1 1 69 83 210 0.934 19.25 6.19 Term + 217513 217599 87 0 0 50 43 130 0.766 2.36 6.20 PlyA + 218978 218983 6 -3.44 7.20 PlyA - 219976 219971 6 1.05 7.19 Term - 221543 221457 87 2 0 107 46 151 0.999 10.46 7.18 Intr - 221685 221635 51 0 0 113 81 97 0.775 10.60 7.17 Intr - 221894 221844 51 2 0 98 69 38 0.800 2.00 7.16 Intr - 222311 222207 105 2 0 95 110 253 0.992 28.61 7.15 Intr - 222529 222427 103 0 1 108 66 91 0.997 9.08 7.14 Intr - 222647 222607 41 2 2 100 91 24 0.978 0.92 7.13 Intr - 222853 222768 86 2 2 70 78 20 0.348 -1.26 7.12 Intr - 223061 223001 61 2 1 128 78 36 0.855 4.91 7.11 Intr - 223218 223138 81 0 0 113 85 87 0.901 10.73 7.10 Intr - 224420 224330 91 1 1 80 64 98 0.999 6.60 7.09 Intr - 224729 224646 84 1 0 80 57 93 0.947 4.34 7.08 Intr - 225343 225276 68 1 2 118 76 29 0.997 2.40 7.07 Intr - 225585 225430 156 0 0 99 109 10 0.490 4.41 7.06 Intr - 225925 225852 74 2 2 77 89 44 0.982 2.53 7.05 Intr - 226163 226036 128 1 2 106 93 106 0.983 13.22 7.04 Intr - 226320 226247 74 0 2 112 75 41 0.917 3.40 7.03 Intr - 228843 228769 75 0 0 84 86 116 0.984 10.71 7.02 Intr - 229080 228936 145 2 1 126 77 192 0.918 22.18 7.01 Init - 229702 229644 59 1 2 75 41 140 0.623 6.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 10649 10239 411 2 0 82 42 215 0.987 12.59 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:54870016_55100895|GENSCAN_predicted_peptide_1|287_aa MDPKQTTLLCLVLCLGQRIQAQEGDFPMPFISAKSSPVIPLDGSVKIQCQAIREAYLTQL MIIKNSTYREIGRRLKFWNETDPEFVIDHMDANKAGRYQCQYRIGHYRFRYSDTLELVVT GLYGKPFLSADRGLVLMPGENISLTCSSAHIPFDRFSLAKEGELSLPQHQSGEHPANFSL GPVDLNVSGIYRCYGWYNRSPYLWSFPSNALELVVTDSIHQDYTTQNLIRMAVAGLVLVA LLAILVENWHSHTALNKEASADVAEPSWSQQMCQPGLTFARTPSVCK >gi568815579f:54870016_55100895|GENSCAN_predicted_CDS_1|864_bp atggaccccaaacagaccaccctcctgtgtcttgtgctctgtctgggccagaggattcag gcacaggaaggggactttcccatgcctttcatatctgccaaatcgagtcctgtgattccc ttggatggatctgtgaaaatccagtgccaggccattcgtgaagcttacctgacccagctg atgatcataaaaaactccacgtaccgagagataggcagaagactgaagttttggaatgag actgatcctgagttcgtcattgaccacatggacgcaaacaaggcagggcgctatcagtgc caatataggatagggcactacaggttccggtacagtgacaccctggagctggtagtgaca ggcttgtatggcaaacccttcctctctgcagatcggggtctggtgttgatgccaggagag aatatttccctcacgtgcagctcagcacacatcccatttgatagattttcactggccaag gagggagaactttctctgccacagcaccaaagtggggaacacccggccaacttctctttg ggtcctgtggacctcaatgtctcagggatctacaggtgctacggttggtacaacaggagc ccctacctgtggtccttccccagtaatgccttggagcttgtggtcacagactccatccac caagattacacgacgcagaacttgatccgcatggccgtggcaggactggtcctcgtggct ctcttggccatactggttgaaaattggcacagccatacggcactgaacaaggaagcctcg gcagatgtggctgaaccgagctggagccaacagatgtgtcagccaggattgacctttgca cgaacaccaagtgtctgcaagtaa >gi568815579f:54870016_55100895|GENSCAN_predicted_peptide_2|299_aa MSSTLPALLCVGLCLSQRISAQQRESFLQSPGSLFRIQAKLLPPKHGWETLPKPFIWAEP HFMVPKEKQVTICCQGNYGAVEYQLHFEGSLFAVDRPKPPERINKVQFYIPDMNSRMAGQ YSCIYRVGELWSEPSNLLDLVVTEMYDTPTLSVHPGPEVISGEKVTFYCRLDTATSMFLL LKEGRSSHVQRGYGKVQAEFPLGPVTTAHRGTYRCFGSYNNHAWSFPSEPVKLLVTGDIE NTSLAPEDPTFPVVFDTLTTSSSKDSLLWLSRTLLATRFLTVTVPTANSTRELHVHVEP >gi568815579f:54870016_55100895|GENSCAN_predicted_CDS_2|900_bp atgtcttccacactccctgccctgctctgcgtcgggctgtgtctgagtcagaggatcagc gcccagcagcgtgagtccttccttcaaagcccagggtcactcttccggattcaggccaag ctccttccacccaagcacggctgggagactctcccaaaaccgttcatctgggccgagccc catttcatggttccaaaggaaaagcaagtgaccatctgttgccagggaaattatggggct gttgaataccagctgcactttgaaggaagcctttttgccgtggacagaccaaaaccccct gagcggattaacaaagtccaattctacatcccggacatgaactcccgcatggcagggcaa tacagctgcatctatcgggttggggagctctggtcagagcccagcaacttgctggatctg gtggtaacagaaatgtatgacacacccaccctctcggttcatcctggacccgaagtgatc tcgggagagaaggtgaccttctactgccgtctagacactgcaacaagcatgttcttactg ctcaaggagggaagatccagccacgtacagcgcggatacgggaaggtccaggcggagttc cccctgggccctgtgaccacagcccacagagggacataccgatgttttggctcctataac aaccatgcctggtctttccccagtgagccagtgaagctcctggtcacaggcgacattgag aacaccagccttgcacctgaagaccccacctttcctgttgtatttgacacgttgaccact tcctcctcgaaggactcacttctctggctttctcggacacttcttgctactcgttttctg acggttacagtaccaacagcaaattctacacgggaacttcatgtgcatgtagaaccctaa >gi568815579f:54870016_55100895|GENSCAN_predicted_peptide_3|952_aa MTSPQLEWTLQTLLEQLNEDELKSFKSLLWAFPLEDVLQKTPWSEVEEADGKKLAEILVN TSSENWIRNATVNILEEMNLTELCKMAKAEMMEDGQVQEIDNPELGDAEEDSELAKPGEK EGWRNSMEKQSLVWKNTFWQGDIDNFHDDVTLRNQRFIPFLNPRTPRKLTPYTVVLHGPA GVGKTTLAKKCMLDWTDCNLSPTLRYAFYLSCKELSRMGPCSFAELISKDWPELQDDIPS ILAQAQRILFVVDGLDELKVPPGALIQDICGDWEKKKPVPVLLGSLLKRKMLPRAALLVT TRPRALRDLQLLAQQPIYVRVEGFLEEDRRAYFLRHFGDEDQAMRAFELMRSNAALFQLG SAPAVCWIVCTTLKLQMEKGEDPVPTCLTRTGLFLRFLCSRFPQGAQLRGALRTLSLLAA QGLWAQMSVFHREDLERLGVQESDLRLFLDGDILRQDRVSKGCYSFIHLSFQQFLTALFY ALEKEEGEDRDGHAWDIGDVQKLLSGEERLKNPDLIQVGHFLFGLANEKRAKELEATFGC RMSPDIKQELLQCKAHLHANKPLSVTDLKEVLGCLYESQEEELAKVVVAPFKEISIHLTN TSEVMHCSFSLKHCQDLQKLSLQVAKGVFLENYMDFELDIEFESSNSNLKFLEVKQSFLS DSSVRILCDHVTRSTCHLQKVEIKNVTPDTAYRDFCLAFIGKKTLTHLTLAGHIEWERTM MLMLCDLLRNHKCNLQYLRLGGHCATPEQWAEFFYVLKANQSLKHLRLSANVLLDEGAML LYKTMTRPKHFLQMLSLENCRLTEASCKDLAAVLVVSKKLTHLCLAKNPIGDTGVKFLCE GLSYPDCKLQTLVLQQCSITKLGCRYLSEALQEACSLTNLDLSINQIARGLWILCQALEN PNCNLKHLRLKTYETNLEIKKLLEEVKEKNPKLTIDCNASGATAPPCCDFFC >gi568815579f:54870016_55100895|GENSCAN_predicted_CDS_3|2859_bp atgacatcgccccagctagagtggactctgcagacccttctggagcagctgaacgaggat gaattaaagagtttcaaatcccttttatgggcttttcccctcgaagacgtgctacagaag accccatggtctgaggtggaagaggctgatggcaagaaactggcagaaattctggtcaac acctcctcagaaaattggataaggaatgcgactgtgaacatcttggaagagatgaatctc acggaattgtgtaagatggcaaaggctgagatgatggaggacggacaggtgcaagaaata gataatcctgagctgggagatgcagaagaagactcggagttagcaaagccaggtgaaaag gaaggatggagaaattcaatggagaaacagtctttggtctggaagaacaccttttggcaa ggagacattgacaatttccatgacgacgtcactctgagaaaccaacggttcattccattc ttgaatcccagaacacccaggaagctaacaccttacacggtggtgctgcacggccccgca ggcgtggggaaaaccacgctggccaaaaagtgtatgctggactggacagactgcaacctc agcccgacgctcagatacgcgttctacctcagctgcaaggagctcagccgcatgggcccc tgcagttttgcagagctgatctccaaagactggcctgaattgcaggatgacattccaagc atcctagcccaagcacagagaatcctgttcgtggtcgatggccttgatgagctgaaagtc ccacctggggcgctgatccaggacatctgcggggactgggagaagaagaagccggtgccc gtcctcctggggagtttgctgaagaggaagatgttacccagggcagccttgctggtcacc acgcggcccagggcactgagggacctccagctcctggcgcagcagccgatctacgtaagg gtggagggcttcctggaggaggacaggagggcctatttcctgagacactttggagacgag gaccaagccatgcgtgcctttgagctaatgaggagcaacgcggccctgttccagctgggc tcggcccccgcggtgtgctggattgtgtgcacgactctgaagctgcagatggagaagggg gaggacccggtccccacctgcctcacccgcacggggctgttcctgcgtttcctctgcagc cggttcccgcagggcgcacagctgcggggcgcgctgcggacgctgagcctcctggccgcg cagggcctgtgggcgcagatgtccgtgttccaccgagaggacctggaaaggctcggggtg caggagtccgacctccgtctgttcctggacggagacatcctccgccaggacagagtctcc aaaggctgctactccttcatccacctcagcttccagcagtttctcactgccctgttctac gccctggagaaggaggagggggaggacagggacggccacgcctgggacatcggggacgta cagaagctgctttccggagaagaaagactcaagaaccccgacctgattcaagtaggacac ttcttattcggcctcgctaacgagaagagagccaaggagttggaggccacttttggctgc cggatgtcaccggacatcaaacaggaattgctgcaatgcaaagcacatcttcatgcaaat aagcccttatccgtgaccgacctgaaggaggtcttgggctgcctgtatgagtctcaggag gaggagctggcgaaggtggtggtggccccgttcaaggaaatttctattcacctgacaaat acttctgaagtgatgcattgttccttcagcctgaagcattgtcaagacttgcagaaactc tcactgcaggtagcaaagggggtgttcctggagaattacatggattttgaactggacatt gaatttgaaagctcaaacagcaacctcaagtttctggaagtgaaacaaagcttcctgagt gactcttctgtgcggattctttgtgaccacgtaacccgtagcacctgtcatctgcagaaa gtggagattaaaaacgtcacccctgacaccgcgtaccgggacttctgtcttgctttcatt gggaagaagaccctcacgcacctgaccctggcagggcacatcgagtgggaacgcacgatg atgctgatgctgtgtgacctgctcagaaatcataaatgcaacctgcagtacctgaggttg ggaggtcactgtgccaccccggagcagtgggctgaattcttctatgtcctcaaagccaac cagtccctgaagcacctgcgtctctcagccaatgtgctcctggatgagggtgccatgttg ctgtacaagaccatgacacgcccaaaacacttcctgcagatgttgtcgttggaaaactgt cgtcttacagaagccagttgcaaggaccttgctgctgtcttggttgtcagcaagaagctg acacacctgtgcttggccaagaaccccattggggatacaggggtgaagtttctgtgtgag ggcttgagttaccctgattgtaaactgcagaccttggtgttacagcaatgcagcataacc aagcttggctgtagatatctctcagaggcgctccaagaagcctgcagcctcacaaacctg gacttgagtatcaaccagatagctcgtggattgtggattctctgtcaggcattagagaat ccaaactgtaacctaaaacacctacggttgaagacctatgaaactaatttggaaatcaag aagctgttggaggaagtgaaagaaaagaatcccaagctgactattgattgcaatgcttcc ggggcaacggcacctccgtgctgtgactttttttgctga >gi568815579f:54870016_55100895|GENSCAN_predicted_peptide_4|1067_aa MGNRPGAVVTPVIPALWEAEAAPTWDKMVSSAQMGFNLQALLEQLSQDELSKFKYLITTF SLAHELQKIPHKEVDKADGKQLVEILTTHCDSYWVEMASLQVFEKMHRMDLSERAKDEVR EAALKSFNKRKPLSLGITRKERPPLDVDEMLERFKTEAQDKDNRCRYILKTKFREMWKSW PGDSKEVQVMAERYKMLIPFSNPRVLPGPFSYTVVLYGPAGLGKTTLAQKLMLDWAEDNL IHKFKYAFYLSCRELSRLGPCSFAELVFRDWPELQDDIPHILAQARKILFVIDGFDELGA APGALIEDICGDWEKKKPVPVLLGSLLNRVMLPKAALLVTTRPRALRDLRILAEEPIYIR VEGFLEEDRRAYFLRHFGDEDQAMRAFELMRSNAALFQLGSAPAVCWIVCTTLKLQMEKG EDPVPTCLTRTGLFLRFLCSRFPQGAQLRGALRTLSLLAAQGLWAQTSVLHREDLERLGV QESDLRLFLDGDILRQDRVSKGCYSFIHLSFQQFLTALFYTLEKEEEEDRDGHTWDIGDV QKLLSGVERLRNPDLIQAGYYSFGLANEKRAKELEATFGCRMSPDIKQELLRCDISCKGG HSTVTDLQELLGCLYESQEEELVKEVMAQFKEISLHLNAVDVVPSSFCVKHCRNLQKMSL QVIKENLPENVTASESDAEVERSQDDQHMLPFWTDLCSIFGSNKDLMGLAINDSFLSASL VRILCEQIASDTCHLQRVVFKNISPADAHRNLCLALRGHKTVTYLTLQGNDQDDMFPALC EVLRHPECNLRYLGLVSCSATTQQWADLSLALEVNQSLTCVNLSDNELLDEGAKLLYTTL RHPKCFLQRLSLENCHLTEANCKDLAAVLVVSRELTHLCLAKNPIGNTGVKFLCEGLRYP ECKLQTLVLWNCDITSDGCCDLTKLLQEKSSLLCLDLGLNHIGVKGMKFLCEALRKPLCN LRCLWLWGCSIPPFSCEDLCSALSCNQSLVTLDLGQNPLGSSGVKMLFETLTCSSGTLRT LRLKIDDFNDELNKLLEEIEEKNPQLIIDTEKHHPWAERPSSHDFMI >gi568815579f:54870016_55100895|GENSCAN_predicted_CDS_4|3204_bp atgggcaaccggccgggcgcggtggtcacgcctgtaatcccagcactttgggaggccgag gcggctcccacgtgggacaagatggtgtcttcggcgcagatgggcttcaacctgcaggct ctcctggagcagctcagccaggatgagttgagcaagttcaagtatctgatcacgaccttc tccctggcacacgagctccagaagatcccccacaaggaggtagacaaggctgatgggaag caactggtagaaatcctcaccacccattgtgacagctactgggtggagatggcgagcctc caggtctttgaaaagatgcaccgaatggatctgtctgagagagcaaaggatgaagtcaga gaagcagctttgaaatcctttaataaaaggaagcctctatcattagggataacacggaaa gaacgaccacctctagacgtggacgaaatgctggagcgcttcaaaacagaagcacaagac aaagacaataggtgcaggtatatattgaagacgaagttccgggagatgtggaagagctgg cctggagatagcaaagaggtccaggttatggctgagagatacaagatgctgatcccattc agcaaccccagggtgcttcccgggcccttctcatacacggtggtgctgtatggtcctgca ggccttgggaaaaccacgctggcccagaaactaatgctagactgggcagaggacaacctc atccacaaattcaaatatgcgttctacctcagctgcagggagctcagccgcctgggcccg tgcagttttgcagagctggtcttcagggactggcctgaattgcaggatgacattccacac atcctagcccaagcacggaaaatcttgttcgtgattgacggctttgatgagctgggagcc gcacctggggcgctgatcgaggacatctgcggggactgggagaagaagaagccggtgccc gtcctcctggggagtttgctgaacagggtgatgttacccaaggccgccctgctggtcacc acgcggcccagggccctgagggacctccggatcctggcggaggagccgatctacataagg gtggagggcttcctggaggaggacaggagggcctatttcctgagacactttggagacgag gaccaagccatgcgtgcctttgagctaatgaggagcaacgcggccctgttccagctgggc tcggcccccgcggtgtgctggatcgtgtgcacgactctgaagctgcagatggagaagggg gaggacccggtccccacctgcctcacccgcacggggctgttcctgcgtttcctctgcagc cggttcccgcagggcgcacagctgcggggcgcgctgcggacgctgagcctcctggccgcg cagggcctgtgggcgcagacgtccgtgcttcaccgagaggatctggaaaggctcggggtg caggagtccgacctccgtctgttcctggacggagacatcctccgccaggacagagtctcc aaaggctgctactccttcatccacctcagcttccagcagtttctcactgccctgttctac accctggagaaggaggaggaagaggatagggacggccacacctgggacattggggacgta cagaagctgctttccggagtagaaagactcaggaaccccgacctgatccaagcaggctac tactcctttggcctcgctaacgagaagagagccaaggagttggaggccacttttggctgc cggatgtcaccggacatcaaacaggaattgctgcgatgcgacataagttgtaagggtgga cattcaacggtgacagacctgcaggagctcctcggctgtctgtacgagtctcaggaggag gagctggtgaaggaggtgatggctcagttcaaagaaatatccctgcacttaaatgcagta gacgttgtgccatcttcattctgcgtcaagcactgtcgaaacctgcagaaaatgtcactg caggtaataaaggagaatctcccggagaatgtcactgcgtctgaatcagacgccgaggtt gagagatcccaggatgatcagcacatgcttcctttctggacggacctttgttccatattt ggatcaaataaggatctgatgggtctagcaatcaatgatagctttctcagtgcctcccta gtaaggatcctgtgtgaacaaatagcctctgacacctgtcatctccagagagtggtgttc aaaaacatttccccagctgatgctcatcggaacctctgcctagctcttcgaggtcacaag actgtaacgtatctgacccttcaaggcaatgaccaggatgatatgtttcccgcattgtgt gaggtcttgagacatccagaatgtaacctgcgatatctcgggttggtgtcttgttccgct accactcagcagtgggctgatctctccttggcccttgaagtcaaccagtccctgacgtgc gtaaacctctccgacaatgagcttctggatgagggtgctaagttgctgtacacaactttg agacaccccaagtgctttctgcagaggttgtcgttggaaaactgtcaccttacagaagcc aattgcaaggaccttgctgctgtgttggttgtcagccgggagctgacacacctgtgcttg gccaagaaccccattgggaatacaggggtgaagtttctgtgtgagggcttgaggtacccc gagtgtaaactgcagaccttggtgctttggaactgcgacataactagcgatggctgctgc gatctcacaaagcttctccaagaaaaatcaagcctgttgtgtttggatctggggctgaat cacataggagttaagggaatgaagttcctgtgtgaggctttgaggaaaccactgtgcaac ttgagatgtctgtggttgtggggatgttccatccctccgttcagttgtgaagacctctgc tctgccctcagctgcaaccagagcctcgtcactctggacctgggtcagaatcccttgggg tctagtggagtgaagatgctgtttgaaaccttgacatgttccagtggcaccctccggaca ctcaggttgaaaatcgatgactttaatgatgaactcaataagctgctggaagaaatagaa gaaaaaaacccacaactgattattgatactgagaaacatcatccctgggcagaaaggcct tcttctcatgacttcatgatctga >gi568815579f:54870016_55100895|GENSCAN_predicted_peptide_5|757_aa MSRYLLPLSALGTVAGAAVLLKDYVTGGACPSKATIPGKTVIVTGANTGIGKQTALELAR RGGNIILACRDMEKCEAAAKDIRGETLNHHVNARHLDLASLKSIREFAAKIIEEEERVDI LINNAGVMRCPHWTTEDGFEMQFGVNHLGHFLLTNLLLDKLKASAPSRIINLSSLAHVAG HIDFDDLNWQTRKYNTKAAYCQSKLAIVLFTKELSRRLQGSGVTVNALHPGVARTELGRH TGIHGSTFSSTTLGPIFWLLVKSPELAAQPSTYLAVAEELADVSGKYFDGLKQKAPAPEA EDEEVARRLWAESARLITSGADLKARMAPPDRGQLSAMPAASWHYLSRETQDWRPPCPHP LVLPSRWLVVMEPAAMVSGPVEKPGRNGDNLEKQNHTNNHVRLCLGRVPAQSGPLPKPSL QALPSSLVPLEKPVTLRCQGPPGVDLYRLEKLSSSRYQDQAVLFIPAMKRSLAGRYRCSY QNGSLWSLPSDQLELVATGVFAKPSLSAQPGPAVSSGGDVTLQCQTRYGFDQFALYKEGD PAPYKNPERWYRASFPIITVTAAHSGTYRCYSFSSRDPYLWSAPSDPLELVVTGTSVTPS RLPTEPPSPVAETSRSITASPKESDSPAGPARQYYTKGNLVRICLGAVILIILAGFLAED WHSRRKRLRHRGRAVQRPLPPLPPLPLTRKSNGGACYKCQKSDHQAKECLQPRIPPKPCP ICAGPHWKSDCSTHLAATPRAPGTLAQGSLTPSRLSG >gi568815579f:54870016_55100895|GENSCAN_predicted_CDS_5|2274_bp atgagccgctacctgctgccgctgtcggcgctgggcacggtagcaggcgccgccgtgctg ctcaaggactatgtcaccggtggggcttgccccagcaaggccaccatccctgggaagacg gtcatcgtgacgggtgccaacacaggcatcgggaagcagaccgccttggaactggccagg agaggaggcaacatcatcctggcctgccgagacatggagaagtgtgaggcggcagcaaag gacatccgcggggagaccctcaatcaccatgtcaacgcccggcacctggacttggcttcc ctcaagtctatccgagagtttgcagcaaagatcattgaagaggaggagcgagtggacatt ctaatcaacaacgcgggtgtgatgcggtgcccccactggaccaccgaggacggcttcgag atgcagtttggcgttaaccacctgggtcactttctcttgacaaacttgctgctggacaag ctgaaagcctcagccccttcgcggatcatcaacctctcgtccctggcccatgttgctggg cacatagactttgacgacttgaactggcagacgaggaagtataacaccaaagccgcctac tgccagagcaagctcgccatcgtcctcttcaccaaggagctgagccggcggctgcaaggc tctggtgtgactgtcaacgccctgcaccccggcgtggccaggacagagctgggcagacac acgggcatccatggctccaccttctccagcaccacactcgggcccatcttctggctgctg gtcaagagccccgagctggccgcccagcccagcacatacctggccgtggcggaggaactg gcggatgtttccggaaagtacttcgatggactcaaacagaaggccccggcccccgaggct gaggatgaggaggtggcccggaggctttgggctgaaagtgcccgcctgataacctctgga gcagatttgaaagccaggatggcgcctccagaccgaggacagctgtccgccatgcccgca gcttcctggcactacctgagccgggagacccaggactggcggccgccatgcccgcaccct cttgtcttgccctctcgctggctggtggtgatggagccggctgccatggtgagcggccct gtggagaagccagggaggaatggagacaatctggagaaacagaatcacaccaacaaccac gtgcggctgtgtctggggcgtgtgccagcgcagagtggaccgctccccaagccctccctc caggctctgcccagctccctggtgcccctggagaagccagtgaccctccggtgccaggga cctccgggcgtggacctgtaccgcctggagaagctgagttccagcaggtaccaggatcag gcagtcctcttcatcccggccatgaagagaagtctggctggacgctaccgctgctcctac cagaacggaagcctctggtccctgcccagcgaccagctggagctcgttgccacgggagtt tttgccaaaccctcgctctcagcccagcccggcccggcggtgtcgtcaggaggggacgta accctacagtgtcagactcggtatggctttgaccaatttgctctgtacaaggaaggggac cctgcgccctacaagaatcccgagagatggtacagggctagttttcccatcatcacggtg accgccgcccacagcggaacctaccgatgctacagcttctccagcagggacccatacctg tggtcagcccccagcgaccccctggagcttgtggtcacaggaacctctgtgacccccagc cggttaccaacagaaccaccttccccggtagcagagacttctaggagtatcaccgccagt ccaaaggagtcagactctccagctggtcctgcccgccagtactacaccaagggcaacctg gtccggatatgcctcggggctgtgatcctaataatcctggcggggtttctggcagaggac tggcacagccggaggaagcgcctgcggcacaggggcagggctgtgcagaggccgcttccg cccctcccgcccctcccgctgacccggaaatcaaacgggggagcttgctacaagtgccag aaatctgaccaccaggccaaggaatgcctgcagcccaggattcctcctaagccgtgtccc atctgtgcgggaccccactggaaatcggactgttcaactcacctggcagccactcccaga gcccctggaactctggcccaaggctctctgactccttctcggcttagcggctga >gi568815579f:54870016_55100895|GENSCAN_predicted_peptide_6|778_aa MGGELFKPQGRILFRSWDIPGTVKHRKYLWHECDENTELQKPKKEKRKGPPSTPTSPTRC YTHQDHLLEQRKRYSTVVMADVSQYPVNHLVTFCLGEDDGVHTVEDASRKLAVMDSQGRV WAQEMLLRVSPDHVTLLDPASKVPGGTWEELESYPLGAIVRCDAVMPPGRSRSLLLLVCQ EPERAQPDVHFFQGLRLGAELIREDIQGALHNYRSGRGERRAAALRATQEELQRDRSPAA ETPPLQRRPSVRAVISTVERGAGRGRPQAKPIPEAEEAQRPEPVGTSSNADSASPDLGPR GPDLAVLQAEREVDILNHVFDDVESFVSRLQKSAEAARVLEHRERGRRSRRRAAGEGLLT LRAKPPSEAEYTDVLQKIKYAFSLLARLRGNIADPSSPELLHFLFGPLQMIVNTSGGPEF ASSVRRPHLTSDAVALLRDNVTPRENELWTSLGDSWTRPGLELSPEEGPPYRPEFFSGWE PPVTDPQSRAWEDPVEKQLQHERRRRQQSAPQVAVNGHRDLEPESEPQLESETAGKWVLC NYDFQARNSSELSVKQRDVLEVLDDSRKWWKVRDPAGQEGYVPYNILTPYPGPRLHHSQS PARSLNSTPPPPPAPAPAPPPALARPRWDRPRWDSCDSLNGLDPSEKEKFSQMLIVNEEL QARLAQGRSGPSRAVPGPRAPEPQLSPGSDASEVRAWLQAKGFSSGTVDALGVLTGAQLF SLQKEELRAVSPEEGARVYSQVTVQRSLLEDKEKVSELEAVMEKQKKKVEGEVEMEVI >gi568815579f:54870016_55100895|GENSCAN_predicted_CDS_6|2337_bp atggggggtgagctcttcaagccccaggggagaattctgttccgttcctgggacatccca ggtacagtcaagcacagaaaatacttgtggcatgaatgtgatgagaacacagaattgcag aagccaaagaaagagaagcgtaagggccctccttccacccctacctcccccacccgctgc tacacgcaccaggaccacctgctggagcagaggaagcgttactccacagttgttatggct gatgtatcccagtacccagtcaatcacctggtgacgttctgcctgggtgaggacgatggc gtgcataccgtggaggatgcctccaggaagttggccgtcatggatagccagggccgagtc tgggcacaggagatgctgctgcgagtgtctcccgaccatgtcacgctgctcgacccggcc tccaaggtgccggggggcacgtgggaggagctggagtcgtacccactgggcgccatcgtg cgctgtgacgcggtgatgccacccggcaggagccgctcgttgctgctgctcgtgtgccag gaacccgagcgcgcgcagcccgacgtgcacttcttccagggcctgcgcctcggggcggag ctgatccgagaggacatccagggggctctgcacaattaccgctcgggccgcggggagcgc agggcggcggcgctcagggccacgcaggaggagttgcagcgcgaccgctcgcccgccgct gagaccccgcccctgcagcgccgcccgtcagtccgcgcagtgatcagcaccgtagagcgg ggcgcgggccgcggacgaccccaggcgaagcccattcccgaggcagaggaggcgcagagg cctgagccggtggggacctcgagcaacgctgactcggcctccccggacctgggtccccgg ggtcctgacctggcggttctgcaggcggagcgggaagtggacatcctgaaccacgtgttc gacgacgtagagagctttgtatcgaggctgcagaagtcggcggaggcggccagggtgctg gagcaccgggaacgcggccgcaggagccggcgccgggcggctggggagggcttgctgacg ctgcgggccaagccgccctcggaggccgagtacaccgacgtgctgcagaagatcaagtac gccttcagcctgctggcccggctgcgcggcaacatcgccgacccctcctctccggagctg ttgcacttccttttcgggcctctgcagatgattgtgaacacgtcgggggggccggagttc gcgagcagtgtgcggcggccgcatctgacatcggatgccgtggcgctgctgcgggacaac gtcactccacgtgaaaacgagctctggacctcgctgggggactcgtggacccgccccggg ctggagctgtccccggaggagggacccccatacagacccgagttcttcagcggctgggag ccgccggtcactgacccgcagagccgcgcctgggaggacccagttgagaaacagctacag cacgagcggaggcgccggcagcaaagcgccccccaggtcgctgtcaatggtcaccgagac ttggagccagaatctgagcctcagctggagtcagagacagcaggaaaatgggtcctgtgt aattatgacttccaggcccgcaacagcagtgagctgtcggtcaagcagcgggacgtactg gaggtcctggatgacagtcgtaagtggtggaaggttcgggacccagcggggcaggaggga tatgtgccctacaacatcctgacaccctaccccggaccccggctgcaccacagccaaagc cctgcccgcagcctgaacagcactcctcctccaccaccagccccagccccggccccacct ccagctctggctcggccccgctgggacaggccccgctgggacagctgcgatagcctcaac ggcttggaccccagcgagaaggagaaattctcccagatgctcatcgtcaacgaggaactg caggcgcgcctggcccagggccgctcgggaccgagccgcgcagtcccagggccccgcgcc ccggaaccgcagctcagcccgggctcggacgcctccgaggtccgcgcctggctgcaggcc aagggctttagctccgggaccgtggacgcgctgggtgtgctgaccggggcgcagcttttc tcgctgcagaaggaggagctgcgggcggtgagccccgaggagggggcacgtgtgtacagc caggtcaccgtgcagcgctcgctgctggaggacaaagagaaagtgtcagagctggaggca gtgatggagaagcaaaagaagaaggtggaaggcgaggtggaaatggaggtcatttga >gi568815579f:54870016_55100895|GENSCAN_predicted_peptide_7|539_aa MLGQVPLLVGLLLSALSAAGLLLQAGYDPELRDGDGWTPLHAAAHWGVEDACRLLAEHGG GMDSLTHAGQRPCDLADEEVLSLLEELARKQEDLRNQKEASQSRGQEPQAPSSSKHRRSS VCRLSSREKISLQDLSKERRPGGAGGPPIQDEDEGEEGPTEPPPAEPRTLNGVSSPPHPS PKSPVLEEAPFSRRFGLLKTGSSGALGPPERRTAEGAPGAGLQRSASSSWLEGTSTQAKE LRLARITPTPSPKLPEPSVLIPEPESPAKPNVPTASTAPPADSRDRRRSYQMPVRDEESE SQRKARSRLMRQSRRSTQGVTLTDLKEAEKAAGKAPESEKPAQSLDPSRRPRVPGVENSD SPAQRAEAPDGQGPGPQAAREHRKVGKEWRGPAEGEEAEPADRSQESSTLEGGPSARRQR WQRDLNPEPEPESEEPDGGFRTLYAELRRENERLREALTETTLRLAQLKVELERATQRQE RFAERPALLELERFERRALERKAAELEEELKALSDLRADNQRLKDENAALIRVISKLSK >gi568815579f:54870016_55100895|GENSCAN_predicted_CDS_7|1620_bp atgttgggtcaggtccctctccttgtggggctgctgctgagcgctctgagcgcagccggg ttgctccttcaggctggctacgacccagagctccgggacggggacggctggactcccctg cacgcagcggcacactggggcgtggaggatgcctgccgcctgctggccgagcatggcggg ggcatggactcactgacccatgcggggcagcgtccctgtgacctggccgatgaggaagta ctgagcctgttggaggaactggcccggaaacaggaggaccttcggaaccaaaaagaagct tcccagagccggggccaggagccccaagcgccctctagcagcaaacacagaaggagctct gtgtgtcgtctgagcagtcgcgagaagatttccctccaggacttgtccaaggagcgccgg cctggtggggctggggggccccccatccaggacgaggatgagggggaagaaggtcccacc gaaccaccccctgcagaacccagaaccctcaatggcgtctcctccccgccgcaccccagc cctaagagtcccgtgcttgaagaggcccccttctccaggcgctttggcctcctgaagaca gggagttctggtgccctgggtccccctgaaaggcggacagcggagggagcccctggggct gggctgcagcgctcggcttcctcctcctggctggaagggacctccactcaggccaaggag ctccgtcttgccagaattaccccgaccccctccccgaagctgccggagccctctgtcctg attccggagcctgaatccccagcgaagccaaacgtccccacagcctccacggcgccccca gcggactcccgggaccgacggaggtcctaccagatgcctgtgcgggatgaggagtcggaa tcccagagaaaagctcgctcccgtctcatgcgccagtctcggaggtccacacagggtgtg actcttacagacctgaaggaggcagagaaggctgcagggaaggccccagagtcagagaag ccggcgcagagcctggacccttcccgaaggccccgcgtccctggagtggagaactctgac agccctgcccagagagcagaggcgcccgacgggcagggtccgggaccgcaggcggccagg gagcaccgcaaggtcggaaaggagtggagggggcctgcggagggggaggaggcggagccg gctgaccgcagccaggagtccagcaccctggagggcggcccctcggcccgcaggcagcgg tggcagcgggacctcaacccagaacctgagccagaatcggaagagccagacggaggcttt aggacgctgtatgcagagctgcgcagggagaacgagcggcttcgcgaggccctgaccgag accacgctgcggctggcgcagctcaaggtggagctggagcgggccacgcagaggcaagaa cgcttcgctgagaggccagccctcctggaactggagagattcgagcgcagggccctggaa cgcaaggccgcagagctggaggaggagctgaaggccctgtctgacctccgcgctgacaac cagcgcctcaaggatgagaatgcagcgttgatccgcgtcatcagcaaactctccaagtga