GENSCAN 1.0 Date run: 4-Nov-116 Time: 01:37:38 Sequence gi568815575f:152973063_153174259 : 201197 bp : 47.21% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.15 PlyA - 481 476 6 1.05 1.14 Term - 1294 1248 47 2 2 125 33 14 0.108 -3.03 1.13 Intr - 4887 4750 138 1 0 93 87 57 0.530 6.54 1.12 Intr - 5474 5429 46 2 1 114 87 79 0.974 8.38 1.11 Intr - 10928 10856 73 1 1 113 85 -17 0.403 -0.09 1.10 Intr - 11051 11015 37 0 1 59 115 50 0.459 2.12 1.09 Intr - 16910 16704 207 0 0 14 -4 188 0.349 1.25 1.08 Intr - 18548 17480 1069 2 1 -27 31 792 0.033 51.83 1.07 Intr - 18915 18840 76 2 1 110 69 38 0.044 3.62 1.06 Intr - 19724 19595 130 0 1 62 105 -5 0.044 -1.65 1.05 Intr - 21063 20971 93 1 0 93 96 108 0.131 12.04 1.04 Intr - 21635 21549 87 0 0 46 82 87 0.620 3.84 1.03 Intr - 26318 26224 95 1 2 73 36 58 0.390 -1.29 1.02 Intr - 27883 27776 108 0 0 41 89 109 0.853 5.70 1.01 Init - 29375 29200 176 2 2 83 47 79 0.822 2.12 1.00 Prom - 40116 40077 40 -4.96 2.00 Prom + 42446 42485 40 -4.36 2.01 Init + 42938 43514 577 1 1 70 40 200 0.826 8.70 2.02 Intr + 43665 44316 652 1 1 71 39 284 0.473 12.77 2.03 Intr + 45947 45996 50 2 2 62 93 56 0.550 1.82 2.04 Intr + 55652 55865 214 0 1 98 86 239 0.543 22.47 2.05 Intr + 56547 56796 250 0 1 27 -5 247 0.237 6.74 2.06 Intr + 56844 57312 469 2 1 24 62 259 0.599 9.58 2.07 Intr + 59593 59823 231 2 0 49 52 126 0.276 2.84 2.08 Intr + 59942 59999 58 0 1 64 86 54 0.487 0.74 2.09 Intr + 60281 60408 128 2 2 12 67 96 0.060 0.22 2.10 Intr + 83383 83576 194 2 2 55 87 83 0.077 4.11 2.11 Term + 83973 85385 1413 2 0 44 44 764 0.097 58.64 2.12 PlyA + 87378 87383 6 1.05 3.00 Prom + 91789 91828 40 -5.56 3.01 Init + 96066 96147 82 2 1 82 79 71 0.687 5.05 3.02 Term + 99873 101200 1328 1 2 109 39 1022 0.964 91.12 3.03 PlyA + 101935 101940 6 1.05 4.14 PlyA - 101947 101942 6 -5.80 4.13 Term - 104034 102707 1328 1 2 109 39 1001 0.961 89.02 4.12 Intr - 108971 108854 118 2 1 -24 78 91 0.100 -3.06 4.11 Intr - 109971 109789 183 0 0 110 35 49 0.551 1.78 4.10 Intr - 110330 110200 131 0 2 26 75 95 0.336 2.41 4.09 Intr - 110528 110450 79 2 1 36 56 73 0.405 -1.98 4.08 Intr - 111368 111270 99 2 0 28 92 81 0.502 2.81 4.07 Intr - 118996 118810 187 0 1 70 40 53 0.009 -1.61 4.06 Intr - 123535 123432 104 1 2 62 77 70 0.461 2.27 4.05 Intr - 126600 126508 93 0 0 31 50 102 0.421 0.86 4.04 Intr - 130219 130041 179 2 2 57 116 50 0.841 4.34 4.03 Intr - 134497 134373 125 0 2 69 100 78 0.956 7.33 4.02 Intr - 137017 136851 167 1 2 78 105 105 0.646 9.96 4.01 Init - 138554 138429 126 2 0 59 -4 113 0.156 -0.64 4.00 Prom - 142429 142390 40 -6.56 5.03 PlyA - 143056 143051 6 1.05 5.02 Term - 153399 149079 4321 2 1 67 36 2508 0.195 228.96 5.01 Init - 155261 153616 1646 2 2 87 47 803 0.202 67.66 5.00 Prom - 156034 155995 40 -6.36 6.00 Prom + 158907 158946 40 -3.76 6.01 Init + 171075 171222 148 2 1 29 92 100 0.654 4.75 6.02 Intr + 175535 175644 110 0 2 89 106 41 0.731 6.00 6.03 Intr + 180273 180374 102 2 0 58 82 51 0.467 1.87 6.04 Term + 192419 192502 84 1 0 69 32 94 0.116 -0.45 6.05 PlyA + 193318 193323 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 21063 20885 179 1 2 93 50 133 0.826 7.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:152973063_153174259|GENSCAN_predicted_peptide_1|793_aa MAEGKEEQVTSYMDGSRQRDGSRQRERELVQGKPHDSIISHQVLPTTHGKYGSYKMRFRS RWLTLGIDSANGYLPNGIALPQPEWLPDDIECPVRPLMKKFGTKSGGQSSVLSSPFSGVC MPADQFVLSGRCKPQDLPSVEETCFGKRQEAVWVLASWRSQHGGGSRHRRVVVALRGNSG LEAPPADPVKQDPSRLSQELPNWIELKSRASLKWLCYCSIRAASRANPARHPKHTQQSPL DFEEEPTAGGSQQGVKVGAMALTLLEDWCKGMDMDPRKALLIVGIPMECSEVEIQDTVKA GLQPLCAYRVLGRMFRREDNAKAVFIELADTVNYTTLPSHIPGKGGSWEVVVKPRNPDDE FLSRLNYFLKDEGRSMTDVARALGCCSLPAESLDAEVMPQVRSPPLEPPKESMWYRKLKV FSGTASPSPGEETFEDWLEQVTEIMPIWQVSEVEKRRRLLESLRGPALSIMRVLQANNDS ITVEQCLDALKQIFGDKEDFRASQFRFLQTSPKIGEKVSTFLLRLEPLLQKAVHKSPLSV RSTDMIRLKHLLARVAMTPALRGKLELLDQRGCPPNFLELMKLIRDEEEWENTEAVMKNK EKPSGRGRGASDGDPSEAGAEEGVAGGSGLVLWTPPNIYPVDFELSMPTGTQATWDLEEP AWGLPLPEGARAAEEELWGRSSSENLSIHYLVRIQHPLWPFWTPDVHLFLAWDHLLLPCE EGACIPITFCHDSIATEIGLLLCLSPHEMVTPQIHRLSVLLIDVSPVPHRRLDALYKESL CVTLSPIKSPTNT >gi568815575f:152973063_153174259|GENSCAN_predicted_CDS_1|2382_bp atggcagaaggcaaagaggagcaagtcacgtcttacatggatggcagcaggcaaagagat ggcagcaggcaaagagagagagagcttgtgcagggaaaaccccatgattcaattatttcc caccaggtccttcccacaacacatgggaaatatgggagctacaagatgagatttaggtcc cgctggctgaccctgggcattgattctgccaatgggtacttgcctaatggaattgcccta cctcaaccagagtggctccctgacgatattgagtgccctgtcaggcccctcatgaagaag tttggaacaaagagtggtggtcagagttcagtcctcagctcccccttttctggggtctgc atgccagctgatcagtttgtactttcgggccgctgcaagccgcaggaccttccctctgtg gaagaaacctgctttgggaaacgacaggaagcagtttgggtactagcctcgtggcgcagt cagcacggaggcggcagccgccatagacgtgtggtggtcgcgctgcgcgggaactccggc ttggaggcacctccggcagatccagtgaagcaggacccatcacgcctgagtcaggaactg cctaactggatagagttgaaaagcagagcgagtctgaagtggctgtgttattgcagcatc cgcgctgccagcagggccaacccagcaaggcacccgaagcacacacagcagagtcctctg gactttgaggaagaacccacagcaggaggaagtcagcagggagtgaaagtgggagcaatg gcactgacactgctagaggattggtgcaaggggatggacatggaccccagaaaggccctg ctgattgtaggcatccccatggagtgtagtgaggtggaaattcaggacactgtgaaggca ggcttacagcccctgtgcgcatacagggtcctagggagaatgttcaggagggaagacaat gccaaggcagtcttcattgaactggctgacactgtcaattacactactctgcccagtcac ataccaggaaagggtggctcctgggaagtggtggtaaaaccccgtaacccagatgatgag tttctcagtagactgaactacttcctgaaagatgagggccgaagtatgacagatgtggcc agagccctgggatgttgcagcctccctgccgagagcctggatgcagaggtcatgccccaa gttagatccccacctttagagcctccgaaagaaagtatgtggtacaggaaactgaaagtg ttttcgggaactgcttcccctagcccaggcgaagagacctttgaagactggctagagcag gtcactgagataatgcccatatggcaagtgtctgaggtggagaagaggcggcgtttgctg gagagcttacgtgggcctgctctgtcaatcatgcgggtgctccaggccaacaatgactcc ataactgtggagcagtgccttgacgccctaaagcagatctttggggataaagaggacttt agagcctctcagtttaggtttctgcagacctctccgaagattggagagaaagtctccact ttcctgctgcgcttagagcccctgctgcagaaagccgtgcacaagagccccttgtcagtg cgcagcacagacatgattcgtctgaaacatctcttagctcgggtcgccatgacccccgcc ctcaggggcaagctggagctcttggatcagcgagggtgtcctcccaattttctggagtta atgaagctcattcgagatgaagaagagtgggagaacactgaggcagtgatgaagaataag gagaagccatcagggagaggccggggggcctccgatggggaccccagcgaagcaggggct gaagaaggtgtggctgggggctctggcctggtcctctggacacccccaaatatctaccct gtggactttgagctgagcatgcccactggcacccaggccacgtgggacctggaggagcct gcctggggcctgcctctgccagaaggagccagggctgctgaggaagagctgtggggcaga agctcatctgagaacctgtcgattcactaccttgtaaggatccagcaccctctgtggcct ttctggactcccgacgtccacctgttcctggcctgggatcacctcctcttgccttgtgaa gaaggtgcctgcatccccatcaccttctgccatgattctattgcgactgaaataggattg ttgctttgtctttctccccatgagatggtcactccacaaattcacagattgtctgtcttg ctcattgatgtatccccagtgccccacagaaggcttgatgcactgtacaaggaaagtctt tgtgtcacactgtcaccaattaaaagtcctacaaatacctag >gi568815575f:152973063_153174259|GENSCAN_predicted_peptide_2|1411_aa MDKFLDTYTHPRLNQEEVESLNRPITDSEIEAIINSLPTRKSPGPDGFTAEFYQRYKQEL VPFLQKIFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDVKILNKILA NRIQQQIKKLIHHDQVGFIPGMQGWFNIHKSINVIQHINRTKDKNHMIISIDAEKAFDKI QQPFMLKSLNKLVLEVLARAIRQEKEMKGIQLGKEEVKLSLFADDMIVYLENPIISAQDL LKLISNFSKVSGYEINVQKSQASLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVK DLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAILIKLPMTFFT ELKTTTLKFIWNQKRACIVKTILSQKNKAGGITLPDFKLYYKATVIKTACNMNGLDAVIR SELMWEGCVRDCNSQHTFLSDQAPSLKSSAMALAMLRDWCRRMGVNAERSLLILDIPDDC EEHEFQEAVRAALSPLGRAYQELRPFSGREQPGCEEESFESWMEHAKDMLQLWYHASERE RKRWLLESLGGPALDVVSGLLEEDPNLAALDCQALLDCWAAAEVPDLNAGALCLPGMPGR PAADGPGEGGRLPGPGQPPVTAVGAASGLPWRSTPEYPEGDALERRPPYFLGLLQLIQEM EAWAASPVRSQHVVAWPVATVESEDPVAAQAAPACGDAAQASSAQEDASQADPGVEDAAE TAPATKEAARSTPAIREASRLAGTTGVQPPYPEPQSQSVAVPAGKMRPSTKRNGHRGEFG LGGDSPEVTPALAARGRPRITNGPEELAAPSHAQGSPGGHGSGTPDNGRQGCPGTRLLNS GTRAAFITLYRFQFVMCSFCQQLASLLALLLWHESKESRGEQAPAHWPTCCASEGEPQSE RTSALGARGARPPSAAAAAAAHSPRRALWGPRPEGTLPWEKYPQRFEDMPLTLLQDWCRG EHLNTRRCMLILGIPEDCGEDEFEETLQEACRHLGRYRVIGRMFRREENAQAILLELAQD IDYALLPREIPGKGGPWEVIVKPRNSDGEFLNRLNRFLEEERRTVSDMNRVLGSDTNCSA PRVTISPEFWTWAQTLGAAVQPLLEQMLYRELRVFSGNTISIPGALAFDAWLEHTTEMLQ MWQVPEGEKRRRLMECLRGPALQVVSGLRASNASITVEECLAALQQVFGPVESHKIAQVK LCKAYQEAGEKVSSFVLRLEPLLQRAVENNVVSRRNVNQTRLKRVLSGATLPDKLRDKLK LMKQRRKPPGFLALVKLLREEEEWEATLGPDRESLEGLEVAPRPPARITGVGAVPLPASG NSFDARPSQGYRRRRGRGQHRRGGVARAGSRGSRKRKRHTFCYSCGEDGHIRVQCINPSN LLLVKQKKQAAVESGNGNWAWDKSHPKSKAK >gi568815575f:152973063_153174259|GENSCAN_predicted_CDS_2|4236_bp atggataaattcctggacacatacacccacccaagactaaaccaggaggaagttgaatcc ctgaatagaccaataacagactctgaaattgaggcaataattaatagcctaccaaccaga aaaagtccaggaccagacggattcacagccgaattctaccagaggtacaagcaggaactg gtaccattccttcagaaaatattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcatcctgataccaaagcctggcagagacacaacaaaaaaagag aattttagaccaatatccctgatgaacatcgatgtaaaaattctcaataaaatactggca aacagaatccagcagcaaatcaaaaagcttatccaccacgatcaagtgggcttcatccca gggatgcaaggctggttcaacatacacaaatcaataaatgtaatccagcatataaacaga accaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacagcccttcatgctaaaatctctcaataaattagtgttggaagttctggccagggca atcaggcaggaaaaagaaatgaagggtattcagttaggaaaagaggaagtcaaactgtcc ctgtttgcagatgatatgattgtatatttagaaaaccccatcatctcagcccaagatctc cttaagctgataagcaacttcagcaaagtctcaggatacgaaattaatgtgcaaaaatca caagcatccttatacaccaataacagacaaacagagagccaaatcatgagtgaactccca ttcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagagatgtgaag gacctcttcaaggagaactacaaaccactgctcaacgaaataaaagaggacacaaacaaa tggaagaacattccatgctcatgggtaggaagaatcaatattgtgaaaatggccatactg cccaaggtaatttatagattcaatgccatcctcatcaagttaccaatgactttcttcaca gaactgaaaacaactactttaaagttcatatggaaccaaaaaagagcctgcattgtcaag acaatcctaagccaaaagaacaaagctggaggcatcacgctacctgacttcaaactatac tacaaagctacagtaatcaaaacagcatgcaacatgaatggactggatgctgttatccga agtgaattgatgtgggaaggctgcgtcagagactgcaacagtcagcacacattcctgtct gatcaggctccctccctcaagtcctctgcgatggctctggcgatgcttcgggactggtgc aggaggatgggtgtgaacgcagagcgctctctgctcatcctggatatccctgacgactgc gaggaacatgagttccaggaggccgtgcgggctgccctgtcgcccctgggcagggcctac caggaactgagacccttttcagggagggagcagccaggctgcgaggaagagtcctttgag agctggatggagcacgccaaggatatgctgcagctgtggtaccatgcgtcggaaagggag aggaagagatggctgctggagagcttgggtggcccagccctggacgttgtgagcggcctc ctagaggaagatcccaacttggcggcactggactgccaggcgctgctggactgctgggca gcggctgaagttcctgacctgaacgcaggggccctttgccttcctggtatgcctggaagg cctgctgcagatggccctggagaagggggccgtctgcccggccctggccaaccacctgtg actgcggtaggtgcagcctcaggcctgccctggcgaagcactccagaataccctgagggg gatgcactggagaggaggccgccctacttcctggggctgcttcagctcatccaggagatg gaggcgtgggcggcctccccagtgaggagccagcatgttgtggcctggccagtggccaca gtggaaagtgaagatccagttgccgcccaggcagctcctgcctgtggagatgctgctcag gcctcctcagcccaggaagacgccagccaggctgaccctggcgtggaagatgctgctgag actgctcctgccaccaaagaggccgccaggagcacccctgccattagggaagcctcccga ctagctgggaccacaggcgtgcaaccaccatacccggaaccacagtcacagtcggtcgca gtgcccgctgggaaaatgcgaccgtccacaaagcgaaacggccatcggggcgagttcggc ctcggcggggattctcctgaggtcacgcccgctctggccgccagaggacgcccgaggatc acgaatgggcccgaggagctggctgccccttcccacgctcaagggagccctggcggccat ggctctgggacaccagacaacggtcgtcagggctgtccaggaaccagacttcttaattct gggacccgtgctgctttcattacactctaccgcttccagtttgtgatgtgctccttctgt caacagttagcaagcctgctggctctgttgctctggcatgagtccaaagagagccgcggc gagcaggcgccggcgcactggcccacgtgctgcgcgagcgagggagagccacagtctgag cgaacgtccgcgctgggagccaggggtgcccgacccccgtccgccgccgccgccgccgcc gcgcatagcccccggagagccctctggggaccccgaccagaagggaccttgccctgggag aagtatccgcagagattcgaggacatgccgttgaccttgttacaggactggtgtcggggg gaacacctgaacacccggaggtgcatgctcatcctggggatccccgaggactgtggcgag gatgagtttgaggagacactccaggaggcttgcaggcacctgggcagatacagggtgatt ggcaggatgtttaggagggaggagaacgcccaggcgattctactggagctggcacaagat atcgactatgctttgctcccaagggaaataccaggaaagggggggccctgggaagtgatt gtaaaaccccgtaactcagatggggaatttctcaacagactgaaccgcttcttagaggag gagaggcggaccgtgtcagatatgaaccgagtcctcgggtcggacaccaattgttcggct ccaagagtgactatatcaccagagttctggacctgggcccagactctgggggcagcagtg cagcctctgctagaacaaatgttgtaccgagaactaagagtgttttctgggaacaccata tccatcccaggtgcactggcctttgatgcctggcttgagcacaccactgagatgctacag atgtggcaggtgcccgagggggaaaagaggcggaggctgatggaatgcttacggggccct gctctccaggtggtcagtgggctccgggccagcaatgcttccataactgtggaggagtgc ctggctgccttgcagcaggtgttcggacctgtggagagccataaaattgcccaggtgaag ttgtgtaaagcctatcaggaggcaggagagaaagtatctagctttgtgttacgtttggaa cccctgctccaaagagctgtagaaaacaatgtggtatcacgtagaaacgtgaatcagact cgcctgaaacgagtcttaagtggggccacccttcctgacaaactccgagataagcttaag ctgatgaaacagcgaaggaagcctcctggtttcctggccctggtgaagctcctgcgtgag gaggaggaatgggaggccactttaggtccagatagggagagtctggaggggctggaagta gccccaaggccacctgccaggatcactggggttggggcagtacctctccctgcctctggc aacagttttgatgcgaggccttcccagggctaccggcgccggaggggcagaggccaacac cgaaggggtggtgtggcaagggctggctctcgaggctcaagaaaacggaaacgccacaca ttctgctatagctgtggggaagacggccacatcagggtacagtgcatcaacccctccaac ctgctcttggtaaagcagaagaaacaggctgcagttgagtcgggaaacgggaactgggct tgggacaagagccatcccaagtccaaggccaagtag >gi568815575f:152973063_153174259|GENSCAN_predicted_peptide_3|469_aa MVKGTKPSGLNILALKCLRGIHVGRWMGCGDLGSDPSSSPHTPAAPCHPGPRWEPYPAPR VSPALARIAGMAVTMLQDWCRWMGVNARRGLLILGIPEDCDDAEFQESLEAALRPMGHFT VLGKAFREEDNATAALVELDREVNYALVPREIPGTGGPWNVVFVPRCSGEEFLGLGRVFH FPEQEGQMVESVAGALGVGLRRVCWLRSIGQAVQPWVEAVRCQSLGVFSGRDQPAPGEES FEVWLDHTTEMLHVWQGVSERERRRRLLEGLRGTALQLVHALLAENPARTAQDCLAALAQ VFGDNESQATIRVKCLTAQQQSGERLSAFVLRLEVLLQKAMEKEALARASADRVRLRQML TRAHLTEPLDEALRKLRMAGRSPSFLEMLGLVRESEAWEASLARSVRAQTQEGAGARAGA QAVARASTKVEAVPGGPGREPEGLLQAGGQEAEELLQEGLKPVLEECDN >gi568815575f:152973063_153174259|GENSCAN_predicted_CDS_3|1410_bp atggtgaaggggacgaagccctccggtttgaacatcttagctctgaaatgtctgcggggc atccacgtgggcagatggatgggctgtggagacctgggctccgaccccagttcatccccc cacacccccgccgccccgtgccaccctggtccgcgctgggaaccctatcctgcccctcgt gtcagcccggcactggccagaatcgcgggcatggcggtgaccatgctgcaggactggtgc cggtggatgggggtcaacgctcgcaggggcctgctcatcctgggcatcccggaggactgt gatgatgccgaattccaagagtccctcgaggctgccctgaggcctatgggacactttaca gtgctaggcaaagcgtttcgagaggaggataatgccaccgcggccctggtcgagctcgac cgggaagtcaactatgctttggtccccagggaaatccccggcactgggggcccgtggaac gtggtctttgtgccccgttgctcaggcgaggagtttctcggtctcggtcgcgtgttccac ttcccggagcaagaggggcagatggtggagagcgtggccggcgccctgggtgtggggctg cgcagggtgtgctggctgcgatccatcggtcaggcggtccagccctgggtggaggccgtg aggtgccagagcctgggcgtgttttccgggagggaccagccagccccaggggaggagtcc tttgaggtctggctagaccacaccaccgaaatgctgcatgtgtggcagggggtctcggaa agggagaggaggaggaggctgctggaaggcttgcgtgggaccgccctgcagctcgtgcac gcgctcctggcggagaaccccgccaggacggcgcaggactgtctggcggccctggcccag gtgtttggagacaacgagtcccaggcgaccatccgggtgaagtgtctgaccgctcagcag cagtcaggcgagcgtctctcagctttcgtgttgcggctggaagtgctgctgcagaaggcc atggagaaggaggccctggccagagcatccgccgaccgcgtgcgcctgaggcagatgctc accagggcccaccttactgagcctctggatgaagcactgaggaagctgagaatggccggg aggtctccaagtttcttggagatgctggggctcgttcgggagtctgaggcatgggaggcc agtctagccaggagcgtgagagcccagacacaggaaggggccggtgcccgggctggtgcc caggctgttgccagagccagcactaaagtagaggcggtcccaggaggtcctggtcgggag ccagagggcctcctccaggcaggaggccaggaggctgaggagctcctccaggaggggctc aagcccgtcctggaggaatgtgataactag >gi568815575f:152973063_153174259|GENSCAN_predicted_peptide_4|972_aa MTAPQRGSRKQQLEVSGMCAKIPNGGSHGTTGGNGKGACPYRKRLSKSHEGTGVSGGRDP RYPELLVANLYASAVTAILASSEESIQLRGIGQKGGPRTFVIGFKAHSGNVGSSPLKMLN FMISAKTPFPNKVTFARSEGEEYRKGGHGAWYRPMGGFDMAMVTGKPKATCTVTRQPTLN TRKERRGTPLQARSIDRVKDLINLAFKVFNNREEAAKQQLISELQLLASAAHGFTLTQDW QIEFTHMPCVRKFKYLLVWVNTFTGKADERLFWDELVELSLEQRAGGRDVGTEPHGSPGV CVQIQEKEYVLPDKHTEASHLNSSTEMEWWNPFGEWWNTGKAVLEEEEETVSDELLQKKK GREPSGAESKRALEEEASEQSEQAAAEPGPSDIVVHSKEALLGSRADNAPTQPVEAVRDT SEVSQLGGNQVSNICWARWLCAVPSAGNQGDQAKKMEASGSQTKQGPPIQFLYLYQGVCL PTGQGWVLQGRAQDTILEGETEPPPDAEPTGALILGFPASGIGNKKFLFFCCGDLGSDPS SSPHTPAAPCHPGPRWEPYPAPRVSPALARIAGMAVTMLQDWCRWMGVNARRGLLILGIP EDCDDAEFQESLEAALRPMGHFTVLGKAFREEDNATAALVELDREVNYALVPREIPGTGG PWNVVFVPRCSGEEFLGLGRVFHFPEQEGQMVESVAGALGVGLRRVCWLRSIGQAVQPWV EAVRCQSLGVFSGRDQPAPGEESFEVWLDHTTEMLHVWQGVSERERRRRLLEGLRGTALQ LVHALLAENPARTAQDCLAALAQVFGDNESQATIRVKCLTAQQQSGERLSAFVLRLEVLL QKAMEKEALARASADRVRLRQMLTRAHLTEPLDEALRKLRMAGRSPSFLEMLGLVRESEA WEASLARSVRAQTQEGAGARAGAQAVARASTKVEAVPGGPGREPEGLLQAGGQEAEELLQ EGLKPVLEECDN >gi568815575f:152973063_153174259|GENSCAN_predicted_CDS_4|2919_bp atgacagccccccagaggggatcccgaaagcagcagctggaagtcagtgggatgtgtgct aagatcccaaacgggggaagccacgggactacaggaggcaacggcaagggcgcctgcccc taccgaaagagactttctaaatctcatgaaggaacaggtgtcagcggtggcagagatcct cgttaccccgagttactggtggcgaatctgtacgcgtctgcagtaactgcaattctcgcc tcctcagaagaaagcattcaactgaggggtatagggcagaaagggggaccgaggacattt gtgattggatttaaggcccactcgggtaatgtaggatcatctcctctcaaaatgctcaac ttcatgatctctgcaaagaccccttttccaaataaggtcacatttgcaagatcagaaggt gaggagtatagaaaaggaggccatggggcctggtatagaccaatgggagggtttgacatg gcaatggtgacagggaaaccaaaagcaacttgcactgttaccagacaaccaaccctcaac accagaaaggaaagaagaggaacacccctacaagccaggtccatagatagggtcaaggat ttaatcaaccttgccttcaaggtgttcaataacagagaagaagctgccaagcagcaactt atctctgagttacaactacttgcctccgctgctcatggatttactctgacacaagattgg cagattgagtttactcatatgccctgtgtccgtaaatttaagtatctcctggtttgggtc aacaccttcaccgggaaagcagatgaacgcttgttctgggatgagttggttgagctcagc cttgaacaaagagcagggggcagggatgtgggaacagaaccccatggcagccctggagta tgcgttcaaatacaagaaaaggagtatgtcctaccagacaaacacacggaggcatctcat cttaattcttcaactgagatggaatggtggaatccttttggggaatggtggaacactggg aaggcagtgctggaagaggaggaagaaaccgtgagtgatgagttattgcaaaagaagaag ggcagagagccgtcgggagcagaatcaaagagggcattggaggaagaagcttctgagcag tcagagcaggccgcggccgagcctgggccctcggacattgtggtgcacagcaaagaagcc cttcttggttcaagagcagacaatgctcccactcaacctgtggaggctgtaagggacacg tccgaggtcagccagctgggaggcaaccaggtgtccaacatctgctgggcacgctggctc tgtgcagtgccttctgctggcaatcagggggaccaagccaaaaagatggaagcctctggg agccaaaccaagcaggggccacctatacaattcctctacctttaccagggagtctgcctg cccacagggcagggctgggtgctgcagggcagggcacaagacaccatcttggaaggagag actgagcccccaccagatgctgaacctactggtgccttgatcttgggcttcccagcctcc ggaattgggaacaagaaatttctgttcttttgctgtggagacctgggctccgaccccagt tcatccccccacacccccgccgccccgtgccaccctggtccgcgctgggaaccctatcct gcccctcgtgtcagcccggcactggccagaatcgcgggcatggcggtgaccatgctgcag gactggtgccggtggatgggggtcaacgctcgcaggggcctgctcatcctgggcatcccg gaggactgtgatgatgccgaattccaagagtccctcgaggctgccctgaggcctatggga cactttacagtgctaggcaaagcgtttcgagaggaggataatgccaccgcggccctggtc gagctcgaccgggaagtcaactatgctttggtccccagggaaatccccggcactgggggc ccgtggaacgtggtctttgtgccccgttgctcaggcgaggagtttctcggtctcggtcgc gtgttccacttcccggagcaagaggggcagatggtggagagcgtggccggcgccctgggt gtggggctgcgcagggtgtgctggctgcgatccatcggtcaggcggtccagccctgggtg gaggccgtgaggtgccagagcctgggtgtgttttccgggagggaccagccagccccaggg gaggagtcctttgaggtctggctagaccacaccaccgaaatgctgcatgtgtggcagggg gtctcggaaagggagaggaggaggaggctgctggaaggcttgcgtgggaccgccctgcag ctcgtgcacgcgctcctggcggagaaccccgccaggacggcgcaggactgtctggcggcc ctggcccaggtgtttggagacaacgagtcccaggcgaccatccgggtgaagtgtctgacc gctcagcagcagtcaggcgagcgtctctcagctttcgtgttgcggctggaagtgctgctg cagaaggccatggagaaggaggccctggccagagcatccgccgaccgcgtgcgcctgagg cagatgctcaccagggcccaccttactgagcctctggatgaagcactgaggaagctgaga atggccgggaggtctccaagtttcttggagatgctggggctcgttcgggagtctgaggca tgggaggccagtctagccaggagcgtgagagcccagacacaggaaggggccggtgcccgg gctggtgcccaggctgttgccagagccagcactaaagtagaggcggtcccaggaggtcct ggtcgggagccagagggcctcctccaggcaggaggccaggaggctgaggagctcctccag gaggggctcaagcccgtcctggaggaatgtgataactag >gi568815575f:152973063_153174259|GENSCAN_predicted_peptide_5|1988_aa MDSEYVLCSWKGRLWPAKVLCTRGTSPKTKPEKAISLEVQILAVDEKIKVKSTDVKTPTK FEMEDIAASAAAQTKLGAPLREKMGYRGTLRVALEILKERTNLGGGRKPHELESTTPSQL SQKVPEKPASSVPREDDWRCKGDLRRSLGKRENPSSPTVPSESKRALRDDRSQEPTAIAP TPGALPGDRSGAPRAIAPTPGAMLSGRSRARRAIAPTPSALRGYRSWAHRAIAPTPGCLY SDRSRAHRAIAPARGTKHGGRSWACRSIAPKPGSLCGDRSQASRAIDPTLGARRGGRSRA HRAIAPTPGSLCGNRSRACGAIALTPGVLCGVRSRVPKDITPTPGALRGYKSWVCRAIAP TPGALRGDRSAARTAIVRTPGALGRDRSRARSAIASTPGTLQGNRSSVSKAIAPAPGALR GDRSAARTAFVPTPGALHRDRSRARSAIASTPGTLRGNTSSACKAIAPTPGALRGYKSWA RRAIAPNPGAWRGYRSTTGTAIAPNLGALGGNRSAARTDIAPTPGALRGYRSWTRRAIAP TPGTLSSYRSPVRRAIAPTPGTLSGYRSRARTAIAPTPGTLRGYRPRSRRAIASTPATLR GEKSRAHTSLAPAPGALRGDGSRARRAIVPTTCPLCEIWSRVGIGIAPIADALRRDRPPV RRAIAPTPGALRCDRSRELTAIDPTPGALCSDRSGASRAIAPTPGTLCSERSRVRRAIAP TPCALCGKGSQVGMGVAPTPGALRRDRSQAGRAIAPTPSALFRVGSRVGTGIALPAGALH RDRSPVRRAVAPTPGTLHCDGSRKCTATGSTPGALPGDRSGVSKATAPAPGALCSERSRA RRSIAPTPCLLCGDRSWVGMGIAPTPGALLGGKSRKCRAIAITPGALRGGRSQKRRVVAP TPEALHGDGSWTYMAIAPTPGALHGDSSPAHTSIIPSPGALHGDGPPAHMAFPSTPGTLH GDASHAHMAIAPTPGTMRGDSSTARTATAPSPGALRGDRSWKRKAIASTPGALHGNRSDR SRKCKAIASTPGTLHVERSPALRAIVPTPGTLGRDSSPGRTSIIPSPGALHGDRSPAHLD IASTPGALHGDSSQAHTAIAPTPGTMRGDSSTARMAIAPSAGALRGDRSWKRKAIASTPG ALRGNRSDRSRKRKAIASTPGALLGNRSDRSRKRKAIAPTPGAPRIDRSPACRAIAPTPG ALGDDSSTAIAPTPGTPRGDSSPANTAIASTPGALHGDTSQTHKAIAPTPGDLGGGSSSA HKAIAPSPGALHGDRSPAHTAIASTPGALHGDSSQVHTTIAPTPGALRDDKSWKRKAIAP TPGTLHCDSSRTCTAFAPTPGALHADRSPAHQDITLTSGALHCDSSRESRAVAPILGALH RVGSQAHKAIASTPGPLRGDSSPFHTAIAPMPGALHGTRSWKREAISQTPGTLCGDSSGE RMAIAPTPGALHSDRSQTHTAIDPTPSVLRSDSSPACMAIDPTPGALGRDRSQALMAIAP TPVGMQAHVLQSPRACQDSLTLSRHVCEKKGKKRANASTLMSLPPTVTEEGASLPPGLTS PAPPALKEETQDSRPKKALAASPESSPFSGNIQDPGEGAWKPGWAGMAASSGSRQHRLPS SLRLANRKRKRPGPDFQRRPQGPQTPGDAKLANPVTTIQRAGGKQDGQPPSLAFPQEPHP IERGTMVWFKFQDHPFWPAVVKSVSNTDKTARVLLLEANLHHGKRGIQVPLRRLKHLDCK EKEKLLKRAQKAYKQSVNWCFSLISHYREGLVRGSFRGSFLDYYAADISYPIRRAIQEGD LQIDFPKVNYGDLEDWEEETSLGGKRPCKKILPDRMRASWDRDNQKLVDFIVRRKGADPH LLDILQGRKQSRWLTAFLKPHRDLHCIETYLEDDDQLEVVAKHLQEIYKQIDKARLTLIR DDKVNFVLEVLLPEAMICTIAALDGLDYKAAEEKYLRGPPVHYREKELFDRNILKKARRE PATTHTAN >gi568815575f:152973063_153174259|GENSCAN_predicted_CDS_5|5967_bp atggactcggagtacgtcctatgctcttggaaaggccgactatggccagcaaaggttttg tgcacacgtgggacttcaccaaaaacgaagcctgaaaaggcgatttctctagaagttcaa atcctcgcagtagatgaaaaaatcaaggtgaaaagcacagacgtgaagaccccaactaag tttgagatggaagacattgccgcctctgcagcagcacagacgaagctcggtgccccactc agagagaagatggggtacagaggaacccttcgggtggccctggagattctgaaagagaga acaaatctgggtggaggaaggaaaccacatgaactagagagcaccacaccctctcagctt tctcaaaaggtgcccgaaaagccagccagttctgtccctcgtgaagatgactggagatgc aaaggcgacctaaggaggagtcttgggaagagggaaaacccaagctcaccgacggtccct tcagagagtaagcgtgccctgcgggatgacaggtcgcaggagcccacagccattgcccct actccaggcgccctgcccggggacaggtcaggggcgcccagggccattgcccctactcca ggagccatgctcagtggcaggtcacgggcacgcagggccattgcccctacaccaagcgcc ctgcgaggttacaggtcttgggcgcacagggccattgcccctaccccaggctgcctgtac agtgacaggtcacgggcgcacagggccattgcccctgctcgaggcaccaagcatggtggc aggtcatgggcatgcaggtccattgcccccaaaccaggctccctgtgcggggacaggtca caggcgagcagggccattgaccctaccttaggcgccaggcgcggtggcaggtcacgggcc cacagagccattgcccctactccaggctccctgtgcggcaacaggtcacgggcttgcgga gccattgcccttactccaggtgtcctgtgcggtgtcaggtcacgggtgccaaaggacatt acccctactccaggcgccctgcgaggttacaagtcatgggtgtgcagggccattgcccct actccaggtgccctgcgcggagacaggtcagcagcacgcacggccattgtccgtactcca ggtgcccttggcagggacaggtcacgggcacgcagcgccattgcttctactccagggacc ctgcagggaaacaggtcatctgtgtccaaggccattgcccctgctccaggtgccctgcgt ggagacaggtcagcagcgcgcacggcctttgtccctactccaggcgcccttcacagggac aggtcacgggcccgcagcgccattgcttctactccagggaccctgcggggaaacacgtca tctgcgtgcaaggccattgcccctactccaggtgccctgcgaggttacaagtcatgggcg cgcagggccattgcacctaacccaggtgcctggcgcggttacaggtcaacgacaggcacc gccattgcacctaatctgggcgccctgggcggcaacaggtcagcggcacgcacggacatt gcccctactccaggcgccctgcgaggttacaggtcatggacgcgcagggccattgcccct actccaggcactctgagcagttacaggtcaccagtgcgcagggccattgcccctactcca ggcactctgagcggttacaggtcacgggcacgcacggccattgcccctactccaggcacc ctgcgaggttacaggccacggtcccgcagggccattgcctctactccagccaccctgcgt ggtgaaaagtcacgggcgcacaccagccttgcccccgccccaggtgctttgcgcggtgac ggttcacgagcacgcagggccattgtccctactacatgcccattgtgcgagatatggtca cgggtgggcataggcattgcccctattgcagatgccctgcgccgtgacaggccaccagtg cgcagggccattgctcctactccaggcgccctgcgctgtgacaggtcacgagagctcaca gccattgatcctactccaggagctctgtgcagtgacaggtcaggggcaagcagggccatt gcccccactccaggcactttgtgcagtgaaaggtcccgggtgcgcagggccattgcccct actccatgcgcactgtgcgggaaggggtcacaggtgggcatgggcgttgcccctactcca ggtgcactgcgcagggacaggtcacaggcaggcagggccattgcacctactccatccgca ctgttcagggttgggtcacgggtgggcacaggcattgcccttcctgcaggtgccctgcac cgtgacaggtcaccggtgcgcagggccgttgctcccactccaggcaccctgcactgtgac gggtcacgaaaatgcacggccactggttctactccaggagccctgcccggcgacaggtca ggggtgagcaaggccactgcccctgctccaggcgccttgtgcagtgaaaggtcccgcgca cgcaggagcattgcccctactccatgtttactgtgcggggacaggtcatgggtgggtatg ggcattgcacctactccaggcgccctgcttggtggcaaatcaaggaaatgcagggccatt gctattactcccggcgccctgcgtggtggcaggtcacagaaacgcagggtcgttgctcct actccagaggccctgcacggtgacgggtcatggacttatatggccattgctcctactcca ggtgccctgcacggtgacagctcaccagcccacacgtccattattccctctccaggcgcc ctgcatggtgacgggccaccagcgcacatggcctttccttctactccaggcaccctgcat ggtgatgcctctcacgcacacatggccattgctcctactccaggcacgatgcgcggtgac agctcaacagcgcgcacggccactgccccatctccaggggccctgcgaggtgacaggtca tggaaacgcaaggccattgcttctactccaggtgccctgcatggtaacaggtctgacagg tcacggaaatgcaaggccattgcttctactccaggcaccctgcacgttgagaggtcacca gcactcagggccattgttccaactccaggcaccctgggccgtgacagctcaccagggcgc acgtccattattccttctccaggcgccctgcatggtgacaggtcaccagcacacctggac attgcttctactccaggcgccctgcacggtgacagctctcaggcacacacggccattgct cctactccaggcaccatgcgcggtgacagctcaacagctcgcatggccattgccccatct gcaggggccctgagaggtgacaggtcatggaaacgcaaggccattgcttctactccaggt gccctgcgtggtaacaggtctgacaggtcacggaaacgcaaagccattgcttctactcca ggtgccctgctcggtaacaggtctgacaggtcacggaaacgcaaggccattgctcctact ccaggtgccccgcgcattgacaggtcaccagcatgcagggccattgctcctactccaggc gccctgggtgatgacagctcaacggccattgcccctactccagggaccccgcgaggtgac agttcgccagcaaacacggccattgcttctactccaggcgccctgcacggtgacacctct cagacacacaaggccattgctcctactccaggtgacctgggcggtggcagctcatcagcg cacaaggccatcgctccttctccaggtgccctgcatggtgacaggtcaccagcacacacg gccattgcttctactccaggagccctgcatggtgacagctcccaggtgcacacgaccatt gctccaactccaggcgccctgcgcgatgacaagtcatggaaacgcaaggccattgctccc actccaggcaccctgcactgtgacagctcacgaacgtgcaccgcctttgctcctactcca ggcgccctgcatgctgacaggtcaccagcgcaccaggacattactcttacttcaggcgcc ctgcactgtgacagctcaagagaaagcagggccgttgctcctattctaggcgccctgcac cgtgtcggctctcaggcacacaaggctattgcttctactccaggccctctgcgcggtgac agctcaccatttcacacagccattgcacctatgccaggggccctgcatggtaccaggtca tggaaacgcgaagccatttctcagactccaggcaccctgtgtggtgacagctcaggagaa cgcatggccattgctcctactccaggcgccctgcacagtgacaggtcacagacacatacg gccatcgatcctactccaagtgtcctgcgcagtgacagctcaccagcgtgcatggccatt gatcctactccaggtgccctgggcagggacaggtcacaggcgctcatggccattgctcct actccagttggaatgcaagcacatgtgttgcaaagcccccgagcctgccaggattccctg acgctttcgcggcatgtttgtgagaaaaaggggaagaaaagggcaaacgcctcaactctt atgtccctgcctcccacagtaacggaggagggtgcatctctgcctccaggtctcaccagc cctgcaccccccgctctgaaggaagagacacaggacagccgcccgaagaaggccctggct gcatccccggaaagttctcccttctcggggaacattcaggaccccggagagggtgcctgg aagccaggctgggcaggtatggctgcatcctctgggtcccgtcagcacaggctgccttct tcactccggcttgccaatagaaaaaggaagcgtccaggtccagattttcagaggagacct caaggacctcagacgcctggtgacgctaagcttgctaatcctgtcaccaccattcaaagg gctggcggtaaacaggatgggcagccccccagccttgcttttccacaggagccacatccc atcgaaaggggaacgatggtctggttcaaatttcaagatcatccgttttggccagcagtg gtcaagagtgtcagcaacacagacaagaccgcgagggtgctcctgcttgaggccaacctg caccatggaaagcggggcattcaagttcctcttcgaaggctgaagcacctggattgtaag gagaaagagaaactgctgaagagagcccagaaggcctacaagcaaagcgtcaactggtgc ttctcactgatctcccactacagagaaggactggtccggggttctttccggggctctttc ctggactattatgccgcagacatcagctacccaatcaggagagccatccaagagggagac ctgcagattgactttccaaaggtgaattatggcgacctggaagactgggaggaggagacc tccctgggcgggaagaggccttgcaagaaaatcctcccggaccggatgagggcctcttgg gaccgagacaaccagaagctcgtggacttcatcgtgaggagaaagggggccgacccccac cttctggacatcttgcaaggcaggaagcagtccaggtggctgaccgcgtttctgaagcca cacagggatttgcactgcattgaaacatacctggaggatgacgatcagttggaagtcgtg gccaagcatttacaagaaatctacaagcaaattgacaaggccaggctgactctgataagg gatgacaaagtcaattttgttctggaagttcttctgccggaagccatgatttgtaccatc gccgcacttgatgggctggattacaaggcagcagaggagaagtacctgcgaggaccacct gtgcattaccgggagaaagagctgtttgacagaaatatcttaaagaaggcaagaagagaa ccagcaaccacccatacagctaattag >gi568815575f:152973063_153174259|GENSCAN_predicted_peptide_6|147_aa MESIPGEDAVNVVVVEMTVRDLEYCINLVDTAVAGFKRVRYNFEGYSTMVQHGIQECTAI PKVVILQKQMRKTCGNVFNVIVPPSKSIFLNSNKRLFQESEIVEGQRTKAFNALCEEFLQ EVRDPERRDQLKPWQKNVDCEDFMDIY >gi568815575f:152973063_153174259|GENSCAN_predicted_CDS_6|444_bp atggaatccattcctggagaagatgctgtgaacgtagtagtagttgaaatgacagtacgg gatttagagtattgcataaacttagttgatacagcagtggcagggtttaagagggttcgt tacaactttgaaggatattcaactatggtgcagcacggaattcaggaatgcacagccatc ccgaaagtagtcattttgcagaaacagatgaggaagacatgtggtaatgtgtttaatgta attgtgcctcctagtaaaagcatatttctcaacagcaacaaaagactttttcaagaatct gaaattgttgagggccagagaacaaaagcatttaatgcattatgtgaggaattcttacag gaagtcagggaccctgaacggagggaccagttgaagccatggcagaagaacgtggattgt gaagatttcatggacatttattag