GENSCAN 1.0 Date run: 4-Nov-116 Time: 01:37:41 Sequence gi568815575r:152975772_153176968 : 201197 bp : 47.12% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 Intr - 2178 2041 138 1 0 93 87 57 0.529 6.54 1.12 Intr - 2765 2720 46 2 1 114 87 79 0.965 8.38 1.11 Intr - 8219 8147 73 1 1 113 85 -17 0.402 -0.09 1.10 Intr - 8342 8306 37 0 1 59 115 50 0.458 2.12 1.09 Intr - 14201 13995 207 0 0 14 -4 188 0.349 1.25 1.08 Intr - 15839 14771 1069 2 1 -27 31 792 0.032 51.83 1.07 Intr - 16206 16131 76 2 1 110 69 38 0.044 3.62 1.06 Intr - 17015 16886 130 0 1 62 105 -5 0.044 -1.65 1.05 Intr - 18354 18262 93 1 0 93 96 108 0.131 12.04 1.04 Intr - 18926 18840 87 0 0 46 82 87 0.620 3.84 1.03 Intr - 23609 23515 95 1 2 73 36 58 0.390 -1.29 1.02 Intr - 25174 25067 108 0 0 41 89 109 0.853 5.70 1.01 Init - 26666 26491 176 2 2 83 47 79 0.822 2.12 1.00 Prom - 37407 37368 40 -4.96 2.00 Prom + 39737 39776 40 -4.36 2.01 Init + 40229 40805 577 1 1 70 40 200 0.826 8.70 2.02 Intr + 40956 41607 652 1 1 71 39 284 0.473 12.77 2.03 Intr + 43238 43287 50 2 2 62 93 56 0.550 1.82 2.04 Intr + 52943 53156 214 0 1 98 86 239 0.543 22.47 2.05 Intr + 53838 54087 250 0 1 27 -5 247 0.237 6.74 2.06 Intr + 54135 54603 469 2 1 24 62 259 0.599 9.58 2.07 Intr + 56884 57114 231 2 0 49 52 126 0.276 2.84 2.08 Intr + 57233 57290 58 0 1 64 86 54 0.487 0.74 2.09 Intr + 57572 57699 128 2 2 12 67 96 0.060 0.22 2.10 Intr + 80674 80867 194 2 2 55 87 83 0.077 4.11 2.11 Term + 81264 82676 1413 2 0 44 44 764 0.097 58.64 2.12 PlyA + 84669 84674 6 1.05 3.00 Prom + 89080 89119 40 -5.56 3.01 Init + 93357 93438 82 2 1 82 79 71 0.687 5.05 3.02 Term + 97164 98491 1328 1 2 109 39 1022 0.964 91.12 3.03 PlyA + 99226 99231 6 1.05 4.14 PlyA - 99238 99233 6 -5.80 4.13 Term - 101325 99998 1328 1 2 109 39 1001 0.961 89.02 4.12 Intr - 106262 106145 118 2 1 -24 78 91 0.100 -3.06 4.11 Intr - 107262 107080 183 0 0 110 35 49 0.551 1.78 4.10 Intr - 107621 107491 131 0 2 26 75 95 0.336 2.41 4.09 Intr - 107819 107741 79 2 1 36 56 73 0.405 -1.98 4.08 Intr - 108659 108561 99 2 0 28 92 81 0.502 2.81 4.07 Intr - 116287 116101 187 0 1 70 40 53 0.009 -1.61 4.06 Intr - 120826 120723 104 1 2 62 77 70 0.461 2.27 4.05 Intr - 123891 123799 93 0 0 31 50 102 0.421 0.86 4.04 Intr - 127510 127332 179 2 2 57 116 50 0.841 4.34 4.03 Intr - 131788 131664 125 0 2 69 100 78 0.956 7.33 4.02 Intr - 134308 134142 167 1 2 78 105 105 0.646 9.96 4.01 Init - 135845 135720 126 2 0 59 -4 113 0.156 -0.64 4.00 Prom - 139720 139681 40 -6.56 5.03 PlyA - 140347 140342 6 1.05 5.02 Term - 150690 146370 4321 2 1 67 36 2508 0.195 228.96 5.01 Init - 152552 150907 1646 2 2 87 47 803 0.202 67.66 5.00 Prom - 153325 153286 40 -6.36 6.00 Prom + 156198 156237 40 -3.76 6.01 Init + 168366 168513 148 2 1 29 92 100 0.654 4.75 6.02 Intr + 172826 172935 110 0 2 89 106 41 0.730 6.00 6.03 Intr + 177564 177665 102 2 0 58 82 51 0.468 1.87 6.04 Term + 189710 189793 84 1 0 69 32 94 0.127 -0.45 6.05 PlyA + 190609 190614 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 18354 18176 179 1 2 93 50 133 0.826 7.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:152975772_153176968|GENSCAN_predicted_peptide_1|779_aa MAEGKEEQVTSYMDGSRQRDGSRQRERELVQGKPHDSIISHQVLPTTHGKYGSYKMRFRS RWLTLGIDSANGYLPNGIALPQPEWLPDDIECPVRPLMKKFGTKSGGQSSVLSSPFSGVC MPADQFVLSGRCKPQDLPSVEETCFGKRQEAVWVLASWRSQHGGGSRHRRVVVALRGNSG LEAPPADPVKQDPSRLSQELPNWIELKSRASLKWLCYCSIRAASRANPARHPKHTQQSPL DFEEEPTAGGSQQGVKVGAMALTLLEDWCKGMDMDPRKALLIVGIPMECSEVEIQDTVKA GLQPLCAYRVLGRMFRREDNAKAVFIELADTVNYTTLPSHIPGKGGSWEVVVKPRNPDDE FLSRLNYFLKDEGRSMTDVARALGCCSLPAESLDAEVMPQVRSPPLEPPKESMWYRKLKV FSGTASPSPGEETFEDWLEQVTEIMPIWQVSEVEKRRRLLESLRGPALSIMRVLQANNDS ITVEQCLDALKQIFGDKEDFRASQFRFLQTSPKIGEKVSTFLLRLEPLLQKAVHKSPLSV RSTDMIRLKHLLARVAMTPALRGKLELLDQRGCPPNFLELMKLIRDEEEWENTEAVMKNK EKPSGRGRGASDGDPSEAGAEEGVAGGSGLVLWTPPNIYPVDFELSMPTGTQATWDLEEP AWGLPLPEGARAAEEELWGRSSSENLSIHYLVRIQHPLWPFWTPDVHLFLAWDHLLLPCE EGACIPITFCHDSIATEIGLLLCLSPHEMVTPQIHRLSVLLIDVSPVPHRRLDALYKEX >gi568815575r:152975772_153176968|GENSCAN_predicted_CDS_1|2337_bp atggcagaaggcaaagaggagcaagtcacgtcttacatggatggcagcaggcaaagagat ggcagcaggcaaagagagagagagcttgtgcagggaaaaccccatgattcaattatttcc caccaggtccttcccacaacacatgggaaatatgggagctacaagatgagatttaggtcc cgctggctgaccctgggcattgattctgccaatgggtacttgcctaatggaattgcccta cctcaaccagagtggctccctgacgatattgagtgccctgtcaggcccctcatgaagaag tttggaacaaagagtggtggtcagagttcagtcctcagctcccccttttctggggtctgc atgccagctgatcagtttgtactttcgggccgctgcaagccgcaggaccttccctctgtg gaagaaacctgctttgggaaacgacaggaagcagtttgggtactagcctcgtggcgcagt cagcacggaggcggcagccgccatagacgtgtggtggtcgcgctgcgcgggaactccggc ttggaggcacctccggcagatccagtgaagcaggacccatcacgcctgagtcaggaactg cctaactggatagagttgaaaagcagagcgagtctgaagtggctgtgttattgcagcatc cgcgctgccagcagggccaacccagcaaggcacccgaagcacacacagcagagtcctctg gactttgaggaagaacccacagcaggaggaagtcagcagggagtgaaagtgggagcaatg gcactgacactgctagaggattggtgcaaggggatggacatggaccccagaaaggccctg ctgattgtaggcatccccatggagtgtagtgaggtggaaattcaggacactgtgaaggca ggcttacagcccctgtgcgcatacagggtcctagggagaatgttcaggagggaagacaat gccaaggcagtcttcattgaactggctgacactgtcaattacactactctgcccagtcac ataccaggaaagggtggctcctgggaagtggtggtaaaaccccgtaacccagatgatgag tttctcagtagactgaactacttcctgaaagatgagggccgaagtatgacagatgtggcc agagccctgggatgttgcagcctccctgccgagagcctggatgcagaggtcatgccccaa gttagatccccacctttagagcctccgaaagaaagtatgtggtacaggaaactgaaagtg ttttcgggaactgcttcccctagcccaggcgaagagacctttgaagactggctagagcag gtcactgagataatgcccatatggcaagtgtctgaggtggagaagaggcggcgtttgctg gagagcttacgtgggcctgctctgtcaatcatgcgggtgctccaggccaacaatgactcc ataactgtggagcagtgccttgacgccctaaagcagatctttggggataaagaggacttt agagcctctcagtttaggtttctgcagacctctccgaagattggagagaaagtctccact ttcctgctgcgcttagagcccctgctgcagaaagccgtgcacaagagccccttgtcagtg cgcagcacagacatgattcgtctgaaacatctcttagctcgggtcgccatgacccccgcc ctcaggggcaagctggagctcttggatcagcgagggtgtcctcccaattttctggagtta atgaagctcattcgagatgaagaagagtgggagaacactgaggcagtgatgaagaataag gagaagccatcagggagaggccggggggcctccgatggggaccccagcgaagcaggggct gaagaaggtgtggctgggggctctggcctggtcctctggacacccccaaatatctaccct gtggactttgagctgagcatgcccactggcacccaggccacgtgggacctggaggagcct gcctggggcctgcctctgccagaaggagccagggctgctgaggaagagctgtggggcaga agctcatctgagaacctgtcgattcactaccttgtaaggatccagcaccctctgtggcct ttctggactcccgacgtccacctgttcctggcctgggatcacctcctcttgccttgtgaa gaaggtgcctgcatccccatcaccttctgccatgattctattgcgactgaaataggattg ttgctttgtctttctccccatgagatggtcactccacaaattcacagattgtctgtcttg ctcattgatgtatccccagtgccccacagaaggcttgatgcactgtacaaggaaann >gi568815575r:152975772_153176968|GENSCAN_predicted_peptide_2|1411_aa MDKFLDTYTHPRLNQEEVESLNRPITDSEIEAIINSLPTRKSPGPDGFTAEFYQRYKQEL VPFLQKIFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDVKILNKILA NRIQQQIKKLIHHDQVGFIPGMQGWFNIHKSINVIQHINRTKDKNHMIISIDAEKAFDKI QQPFMLKSLNKLVLEVLARAIRQEKEMKGIQLGKEEVKLSLFADDMIVYLENPIISAQDL LKLISNFSKVSGYEINVQKSQASLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVK DLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAILIKLPMTFFT ELKTTTLKFIWNQKRACIVKTILSQKNKAGGITLPDFKLYYKATVIKTACNMNGLDAVIR SELMWEGCVRDCNSQHTFLSDQAPSLKSSAMALAMLRDWCRRMGVNAERSLLILDIPDDC EEHEFQEAVRAALSPLGRAYQELRPFSGREQPGCEEESFESWMEHAKDMLQLWYHASERE RKRWLLESLGGPALDVVSGLLEEDPNLAALDCQALLDCWAAAEVPDLNAGALCLPGMPGR PAADGPGEGGRLPGPGQPPVTAVGAASGLPWRSTPEYPEGDALERRPPYFLGLLQLIQEM EAWAASPVRSQHVVAWPVATVESEDPVAAQAAPACGDAAQASSAQEDASQADPGVEDAAE TAPATKEAARSTPAIREASRLAGTTGVQPPYPEPQSQSVAVPAGKMRPSTKRNGHRGEFG LGGDSPEVTPALAARGRPRITNGPEELAAPSHAQGSPGGHGSGTPDNGRQGCPGTRLLNS GTRAAFITLYRFQFVMCSFCQQLASLLALLLWHESKESRGEQAPAHWPTCCASEGEPQSE RTSALGARGARPPSAAAAAAAHSPRRALWGPRPEGTLPWEKYPQRFEDMPLTLLQDWCRG EHLNTRRCMLILGIPEDCGEDEFEETLQEACRHLGRYRVIGRMFRREENAQAILLELAQD IDYALLPREIPGKGGPWEVIVKPRNSDGEFLNRLNRFLEEERRTVSDMNRVLGSDTNCSA PRVTISPEFWTWAQTLGAAVQPLLEQMLYRELRVFSGNTISIPGALAFDAWLEHTTEMLQ MWQVPEGEKRRRLMECLRGPALQVVSGLRASNASITVEECLAALQQVFGPVESHKIAQVK LCKAYQEAGEKVSSFVLRLEPLLQRAVENNVVSRRNVNQTRLKRVLSGATLPDKLRDKLK LMKQRRKPPGFLALVKLLREEEEWEATLGPDRESLEGLEVAPRPPARITGVGAVPLPASG NSFDARPSQGYRRRRGRGQHRRGGVARAGSRGSRKRKRHTFCYSCGEDGHIRVQCINPSN LLLVKQKKQAAVESGNGNWAWDKSHPKSKAK >gi568815575r:152975772_153176968|GENSCAN_predicted_CDS_2|4236_bp atggataaattcctggacacatacacccacccaagactaaaccaggaggaagttgaatcc ctgaatagaccaataacagactctgaaattgaggcaataattaatagcctaccaaccaga aaaagtccaggaccagacggattcacagccgaattctaccagaggtacaagcaggaactg gtaccattccttcagaaaatattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcatcctgataccaaagcctggcagagacacaacaaaaaaagag aattttagaccaatatccctgatgaacatcgatgtaaaaattctcaataaaatactggca aacagaatccagcagcaaatcaaaaagcttatccaccacgatcaagtgggcttcatccca gggatgcaaggctggttcaacatacacaaatcaataaatgtaatccagcatataaacaga accaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacagcccttcatgctaaaatctctcaataaattagtgttggaagttctggccagggca atcaggcaggaaaaagaaatgaagggtattcagttaggaaaagaggaagtcaaactgtcc ctgtttgcagatgatatgattgtatatttagaaaaccccatcatctcagcccaagatctc cttaagctgataagcaacttcagcaaagtctcaggatacgaaattaatgtgcaaaaatca caagcatccttatacaccaataacagacaaacagagagccaaatcatgagtgaactccca ttcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagagatgtgaag gacctcttcaaggagaactacaaaccactgctcaacgaaataaaagaggacacaaacaaa tggaagaacattccatgctcatgggtaggaagaatcaatattgtgaaaatggccatactg cccaaggtaatttatagattcaatgccatcctcatcaagttaccaatgactttcttcaca gaactgaaaacaactactttaaagttcatatggaaccaaaaaagagcctgcattgtcaag acaatcctaagccaaaagaacaaagctggaggcatcacgctacctgacttcaaactatac tacaaagctacagtaatcaaaacagcatgcaacatgaatggactggatgctgttatccga agtgaattgatgtgggaaggctgcgtcagagactgcaacagtcagcacacattcctgtct gatcaggctccctccctcaagtcctctgcgatggctctggcgatgcttcgggactggtgc aggaggatgggtgtgaacgcagagcgctctctgctcatcctggatatccctgacgactgc gaggaacatgagttccaggaggccgtgcgggctgccctgtcgcccctgggcagggcctac caggaactgagacccttttcagggagggagcagccaggctgcgaggaagagtcctttgag agctggatggagcacgccaaggatatgctgcagctgtggtaccatgcgtcggaaagggag aggaagagatggctgctggagagcttgggtggcccagccctggacgttgtgagcggcctc ctagaggaagatcccaacttggcggcactggactgccaggcgctgctggactgctgggca gcggctgaagttcctgacctgaacgcaggggccctttgccttcctggtatgcctggaagg cctgctgcagatggccctggagaagggggccgtctgcccggccctggccaaccacctgtg actgcggtaggtgcagcctcaggcctgccctggcgaagcactccagaataccctgagggg gatgcactggagaggaggccgccctacttcctggggctgcttcagctcatccaggagatg gaggcgtgggcggcctccccagtgaggagccagcatgttgtggcctggccagtggccaca gtggaaagtgaagatccagttgccgcccaggcagctcctgcctgtggagatgctgctcag gcctcctcagcccaggaagacgccagccaggctgaccctggcgtggaagatgctgctgag actgctcctgccaccaaagaggccgccaggagcacccctgccattagggaagcctcccga ctagctgggaccacaggcgtgcaaccaccatacccggaaccacagtcacagtcggtcgca gtgcccgctgggaaaatgcgaccgtccacaaagcgaaacggccatcggggcgagttcggc ctcggcggggattctcctgaggtcacgcccgctctggccgccagaggacgcccgaggatc acgaatgggcccgaggagctggctgccccttcccacgctcaagggagccctggcggccat ggctctgggacaccagacaacggtcgtcagggctgtccaggaaccagacttcttaattct gggacccgtgctgctttcattacactctaccgcttccagtttgtgatgtgctccttctgt caacagttagcaagcctgctggctctgttgctctggcatgagtccaaagagagccgcggc gagcaggcgccggcgcactggcccacgtgctgcgcgagcgagggagagccacagtctgag cgaacgtccgcgctgggagccaggggtgcccgacccccgtccgccgccgccgccgccgcc gcgcatagcccccggagagccctctggggaccccgaccagaagggaccttgccctgggag aagtatccgcagagattcgaggacatgccgttgaccttgttacaggactggtgtcggggg gaacacctgaacacccggaggtgcatgctcatcctggggatccccgaggactgtggcgag gatgagtttgaggagacactccaggaggcttgcaggcacctgggcagatacagggtgatt ggcaggatgtttaggagggaggagaacgcccaggcgattctactggagctggcacaagat atcgactatgctttgctcccaagggaaataccaggaaagggggggccctgggaagtgatt gtaaaaccccgtaactcagatggggaatttctcaacagactgaaccgcttcttagaggag gagaggcggaccgtgtcagatatgaaccgagtcctcgggtcggacaccaattgttcggct ccaagagtgactatatcaccagagttctggacctgggcccagactctgggggcagcagtg cagcctctgctagaacaaatgttgtaccgagaactaagagtgttttctgggaacaccata tccatcccaggtgcactggcctttgatgcctggcttgagcacaccactgagatgctacag atgtggcaggtgcccgagggggaaaagaggcggaggctgatggaatgcttacggggccct gctctccaggtggtcagtgggctccgggccagcaatgcttccataactgtggaggagtgc ctggctgccttgcagcaggtgttcggacctgtggagagccataaaattgcccaggtgaag ttgtgtaaagcctatcaggaggcaggagagaaagtatctagctttgtgttacgtttggaa cccctgctccaaagagctgtagaaaacaatgtggtatcacgtagaaacgtgaatcagact cgcctgaaacgagtcttaagtggggccacccttcctgacaaactccgagataagcttaag ctgatgaaacagcgaaggaagcctcctggtttcctggccctggtgaagctcctgcgtgag gaggaggaatgggaggccactttaggtccagatagggagagtctggaggggctggaagta gccccaaggccacctgccaggatcactggggttggggcagtacctctccctgcctctggc aacagttttgatgcgaggccttcccagggctaccggcgccggaggggcagaggccaacac cgaaggggtggtgtggcaagggctggctctcgaggctcaagaaaacggaaacgccacaca ttctgctatagctgtggggaagacggccacatcagggtacagtgcatcaacccctccaac ctgctcttggtaaagcagaagaaacaggctgcagttgagtcgggaaacgggaactgggct tgggacaagagccatcccaagtccaaggccaagtag >gi568815575r:152975772_153176968|GENSCAN_predicted_peptide_3|469_aa MVKGTKPSGLNILALKCLRGIHVGRWMGCGDLGSDPSSSPHTPAAPCHPGPRWEPYPAPR VSPALARIAGMAVTMLQDWCRWMGVNARRGLLILGIPEDCDDAEFQESLEAALRPMGHFT VLGKAFREEDNATAALVELDREVNYALVPREIPGTGGPWNVVFVPRCSGEEFLGLGRVFH FPEQEGQMVESVAGALGVGLRRVCWLRSIGQAVQPWVEAVRCQSLGVFSGRDQPAPGEES FEVWLDHTTEMLHVWQGVSERERRRRLLEGLRGTALQLVHALLAENPARTAQDCLAALAQ VFGDNESQATIRVKCLTAQQQSGERLSAFVLRLEVLLQKAMEKEALARASADRVRLRQML TRAHLTEPLDEALRKLRMAGRSPSFLEMLGLVRESEAWEASLARSVRAQTQEGAGARAGA QAVARASTKVEAVPGGPGREPEGLLQAGGQEAEELLQEGLKPVLEECDN >gi568815575r:152975772_153176968|GENSCAN_predicted_CDS_3|1410_bp atggtgaaggggacgaagccctccggtttgaacatcttagctctgaaatgtctgcggggc atccacgtgggcagatggatgggctgtggagacctgggctccgaccccagttcatccccc cacacccccgccgccccgtgccaccctggtccgcgctgggaaccctatcctgcccctcgt gtcagcccggcactggccagaatcgcgggcatggcggtgaccatgctgcaggactggtgc cggtggatgggggtcaacgctcgcaggggcctgctcatcctgggcatcccggaggactgt gatgatgccgaattccaagagtccctcgaggctgccctgaggcctatgggacactttaca gtgctaggcaaagcgtttcgagaggaggataatgccaccgcggccctggtcgagctcgac cgggaagtcaactatgctttggtccccagggaaatccccggcactgggggcccgtggaac gtggtctttgtgccccgttgctcaggcgaggagtttctcggtctcggtcgcgtgttccac ttcccggagcaagaggggcagatggtggagagcgtggccggcgccctgggtgtggggctg cgcagggtgtgctggctgcgatccatcggtcaggcggtccagccctgggtggaggccgtg aggtgccagagcctgggcgtgttttccgggagggaccagccagccccaggggaggagtcc tttgaggtctggctagaccacaccaccgaaatgctgcatgtgtggcagggggtctcggaa agggagaggaggaggaggctgctggaaggcttgcgtgggaccgccctgcagctcgtgcac gcgctcctggcggagaaccccgccaggacggcgcaggactgtctggcggccctggcccag gtgtttggagacaacgagtcccaggcgaccatccgggtgaagtgtctgaccgctcagcag cagtcaggcgagcgtctctcagctttcgtgttgcggctggaagtgctgctgcagaaggcc atggagaaggaggccctggccagagcatccgccgaccgcgtgcgcctgaggcagatgctc accagggcccaccttactgagcctctggatgaagcactgaggaagctgagaatggccggg aggtctccaagtttcttggagatgctggggctcgttcgggagtctgaggcatgggaggcc agtctagccaggagcgtgagagcccagacacaggaaggggccggtgcccgggctggtgcc caggctgttgccagagccagcactaaagtagaggcggtcccaggaggtcctggtcgggag ccagagggcctcctccaggcaggaggccaggaggctgaggagctcctccaggaggggctc aagcccgtcctggaggaatgtgataactag >gi568815575r:152975772_153176968|GENSCAN_predicted_peptide_4|972_aa MTAPQRGSRKQQLEVSGMCAKIPNGGSHGTTGGNGKGACPYRKRLSKSHEGTGVSGGRDP RYPELLVANLYASAVTAILASSEESIQLRGIGQKGGPRTFVIGFKAHSGNVGSSPLKMLN FMISAKTPFPNKVTFARSEGEEYRKGGHGAWYRPMGGFDMAMVTGKPKATCTVTRQPTLN TRKERRGTPLQARSIDRVKDLINLAFKVFNNREEAAKQQLISELQLLASAAHGFTLTQDW QIEFTHMPCVRKFKYLLVWVNTFTGKADERLFWDELVELSLEQRAGGRDVGTEPHGSPGV CVQIQEKEYVLPDKHTEASHLNSSTEMEWWNPFGEWWNTGKAVLEEEEETVSDELLQKKK GREPSGAESKRALEEEASEQSEQAAAEPGPSDIVVHSKEALLGSRADNAPTQPVEAVRDT SEVSQLGGNQVSNICWARWLCAVPSAGNQGDQAKKMEASGSQTKQGPPIQFLYLYQGVCL PTGQGWVLQGRAQDTILEGETEPPPDAEPTGALILGFPASGIGNKKFLFFCCGDLGSDPS SSPHTPAAPCHPGPRWEPYPAPRVSPALARIAGMAVTMLQDWCRWMGVNARRGLLILGIP EDCDDAEFQESLEAALRPMGHFTVLGKAFREEDNATAALVELDREVNYALVPREIPGTGG PWNVVFVPRCSGEEFLGLGRVFHFPEQEGQMVESVAGALGVGLRRVCWLRSIGQAVQPWV EAVRCQSLGVFSGRDQPAPGEESFEVWLDHTTEMLHVWQGVSERERRRRLLEGLRGTALQ LVHALLAENPARTAQDCLAALAQVFGDNESQATIRVKCLTAQQQSGERLSAFVLRLEVLL QKAMEKEALARASADRVRLRQMLTRAHLTEPLDEALRKLRMAGRSPSFLEMLGLVRESEA WEASLARSVRAQTQEGAGARAGAQAVARASTKVEAVPGGPGREPEGLLQAGGQEAEELLQ EGLKPVLEECDN >gi568815575r:152975772_153176968|GENSCAN_predicted_CDS_4|2919_bp atgacagccccccagaggggatcccgaaagcagcagctggaagtcagtgggatgtgtgct aagatcccaaacgggggaagccacgggactacaggaggcaacggcaagggcgcctgcccc taccgaaagagactttctaaatctcatgaaggaacaggtgtcagcggtggcagagatcct cgttaccccgagttactggtggcgaatctgtacgcgtctgcagtaactgcaattctcgcc tcctcagaagaaagcattcaactgaggggtatagggcagaaagggggaccgaggacattt gtgattggatttaaggcccactcgggtaatgtaggatcatctcctctcaaaatgctcaac ttcatgatctctgcaaagaccccttttccaaataaggtcacatttgcaagatcagaaggt gaggagtatagaaaaggaggccatggggcctggtatagaccaatgggagggtttgacatg gcaatggtgacagggaaaccaaaagcaacttgcactgttaccagacaaccaaccctcaac accagaaaggaaagaagaggaacacccctacaagccaggtccatagatagggtcaaggat ttaatcaaccttgccttcaaggtgttcaataacagagaagaagctgccaagcagcaactt atctctgagttacaactacttgcctccgctgctcatggatttactctgacacaagattgg cagattgagtttactcatatgccctgtgtccgtaaatttaagtatctcctggtttgggtc aacaccttcaccgggaaagcagatgaacgcttgttctgggatgagttggttgagctcagc cttgaacaaagagcagggggcagggatgtgggaacagaaccccatggcagccctggagta tgcgttcaaatacaagaaaaggagtatgtcctaccagacaaacacacggaggcatctcat cttaattcttcaactgagatggaatggtggaatccttttggggaatggtggaacactggg aaggcagtgctggaagaggaggaagaaaccgtgagtgatgagttattgcaaaagaagaag ggcagagagccgtcgggagcagaatcaaagagggcattggaggaagaagcttctgagcag tcagagcaggccgcggccgagcctgggccctcggacattgtggtgcacagcaaagaagcc cttcttggttcaagagcagacaatgctcccactcaacctgtggaggctgtaagggacacg tccgaggtcagccagctgggaggcaaccaggtgtccaacatctgctgggcacgctggctc tgtgcagtgccttctgctggcaatcagggggaccaagccaaaaagatggaagcctctggg agccaaaccaagcaggggccacctatacaattcctctacctttaccagggagtctgcctg cccacagggcagggctgggtgctgcagggcagggcacaagacaccatcttggaaggagag actgagcccccaccagatgctgaacctactggtgccttgatcttgggcttcccagcctcc ggaattgggaacaagaaatttctgttcttttgctgtggagacctgggctccgaccccagt tcatccccccacacccccgccgccccgtgccaccctggtccgcgctgggaaccctatcct gcccctcgtgtcagcccggcactggccagaatcgcgggcatggcggtgaccatgctgcag gactggtgccggtggatgggggtcaacgctcgcaggggcctgctcatcctgggcatcccg gaggactgtgatgatgccgaattccaagagtccctcgaggctgccctgaggcctatggga cactttacagtgctaggcaaagcgtttcgagaggaggataatgccaccgcggccctggtc gagctcgaccgggaagtcaactatgctttggtccccagggaaatccccggcactgggggc ccgtggaacgtggtctttgtgccccgttgctcaggcgaggagtttctcggtctcggtcgc gtgttccacttcccggagcaagaggggcagatggtggagagcgtggccggcgccctgggt gtggggctgcgcagggtgtgctggctgcgatccatcggtcaggcggtccagccctgggtg gaggccgtgaggtgccagagcctgggtgtgttttccgggagggaccagccagccccaggg gaggagtcctttgaggtctggctagaccacaccaccgaaatgctgcatgtgtggcagggg gtctcggaaagggagaggaggaggaggctgctggaaggcttgcgtgggaccgccctgcag ctcgtgcacgcgctcctggcggagaaccccgccaggacggcgcaggactgtctggcggcc ctggcccaggtgtttggagacaacgagtcccaggcgaccatccgggtgaagtgtctgacc gctcagcagcagtcaggcgagcgtctctcagctttcgtgttgcggctggaagtgctgctg cagaaggccatggagaaggaggccctggccagagcatccgccgaccgcgtgcgcctgagg cagatgctcaccagggcccaccttactgagcctctggatgaagcactgaggaagctgaga atggccgggaggtctccaagtttcttggagatgctggggctcgttcgggagtctgaggca tgggaggccagtctagccaggagcgtgagagcccagacacaggaaggggccggtgcccgg gctggtgcccaggctgttgccagagccagcactaaagtagaggcggtcccaggaggtcct ggtcgggagccagagggcctcctccaggcaggaggccaggaggctgaggagctcctccag gaggggctcaagcccgtcctggaggaatgtgataactag >gi568815575r:152975772_153176968|GENSCAN_predicted_peptide_5|1988_aa MDSEYVLCSWKGRLWPAKVLCTRGTSPKTKPEKAISLEVQILAVDEKIKVKSTDVKTPTK FEMEDIAASAAAQTKLGAPLREKMGYRGTLRVALEILKERTNLGGGRKPHELESTTPSQL SQKVPEKPASSVPREDDWRCKGDLRRSLGKRENPSSPTVPSESKRALRDDRSQEPTAIAP TPGALPGDRSGAPRAIAPTPGAMLSGRSRARRAIAPTPSALRGYRSWAHRAIAPTPGCLY SDRSRAHRAIAPARGTKHGGRSWACRSIAPKPGSLCGDRSQASRAIDPTLGARRGGRSRA HRAIAPTPGSLCGNRSRACGAIALTPGVLCGVRSRVPKDITPTPGALRGYKSWVCRAIAP TPGALRGDRSAARTAIVRTPGALGRDRSRARSAIASTPGTLQGNRSSVSKAIAPAPGALR GDRSAARTAFVPTPGALHRDRSRARSAIASTPGTLRGNTSSACKAIAPTPGALRGYKSWA RRAIAPNPGAWRGYRSTTGTAIAPNLGALGGNRSAARTDIAPTPGALRGYRSWTRRAIAP TPGTLSSYRSPVRRAIAPTPGTLSGYRSRARTAIAPTPGTLRGYRPRSRRAIASTPATLR GEKSRAHTSLAPAPGALRGDGSRARRAIVPTTCPLCEIWSRVGIGIAPIADALRRDRPPV RRAIAPTPGALRCDRSRELTAIDPTPGALCSDRSGASRAIAPTPGTLCSERSRVRRAIAP TPCALCGKGSQVGMGVAPTPGALRRDRSQAGRAIAPTPSALFRVGSRVGTGIALPAGALH RDRSPVRRAVAPTPGTLHCDGSRKCTATGSTPGALPGDRSGVSKATAPAPGALCSERSRA RRSIAPTPCLLCGDRSWVGMGIAPTPGALLGGKSRKCRAIAITPGALRGGRSQKRRVVAP TPEALHGDGSWTYMAIAPTPGALHGDSSPAHTSIIPSPGALHGDGPPAHMAFPSTPGTLH GDASHAHMAIAPTPGTMRGDSSTARTATAPSPGALRGDRSWKRKAIASTPGALHGNRSDR SRKCKAIASTPGTLHVERSPALRAIVPTPGTLGRDSSPGRTSIIPSPGALHGDRSPAHLD IASTPGALHGDSSQAHTAIAPTPGTMRGDSSTARMAIAPSAGALRGDRSWKRKAIASTPG ALRGNRSDRSRKRKAIASTPGALLGNRSDRSRKRKAIAPTPGAPRIDRSPACRAIAPTPG ALGDDSSTAIAPTPGTPRGDSSPANTAIASTPGALHGDTSQTHKAIAPTPGDLGGGSSSA HKAIAPSPGALHGDRSPAHTAIASTPGALHGDSSQVHTTIAPTPGALRDDKSWKRKAIAP TPGTLHCDSSRTCTAFAPTPGALHADRSPAHQDITLTSGALHCDSSRESRAVAPILGALH RVGSQAHKAIASTPGPLRGDSSPFHTAIAPMPGALHGTRSWKREAISQTPGTLCGDSSGE RMAIAPTPGALHSDRSQTHTAIDPTPSVLRSDSSPACMAIDPTPGALGRDRSQALMAIAP TPVGMQAHVLQSPRACQDSLTLSRHVCEKKGKKRANASTLMSLPPTVTEEGASLPPGLTS PAPPALKEETQDSRPKKALAASPESSPFSGNIQDPGEGAWKPGWAGMAASSGSRQHRLPS SLRLANRKRKRPGPDFQRRPQGPQTPGDAKLANPVTTIQRAGGKQDGQPPSLAFPQEPHP IERGTMVWFKFQDHPFWPAVVKSVSNTDKTARVLLLEANLHHGKRGIQVPLRRLKHLDCK EKEKLLKRAQKAYKQSVNWCFSLISHYREGLVRGSFRGSFLDYYAADISYPIRRAIQEGD LQIDFPKVNYGDLEDWEEETSLGGKRPCKKILPDRMRASWDRDNQKLVDFIVRRKGADPH LLDILQGRKQSRWLTAFLKPHRDLHCIETYLEDDDQLEVVAKHLQEIYKQIDKARLTLIR DDKVNFVLEVLLPEAMICTIAALDGLDYKAAEEKYLRGPPVHYREKELFDRNILKKARRE PATTHTAN >gi568815575r:152975772_153176968|GENSCAN_predicted_CDS_5|5967_bp atggactcggagtacgtcctatgctcttggaaaggccgactatggccagcaaaggttttg tgcacacgtgggacttcaccaaaaacgaagcctgaaaaggcgatttctctagaagttcaa atcctcgcagtagatgaaaaaatcaaggtgaaaagcacagacgtgaagaccccaactaag tttgagatggaagacattgccgcctctgcagcagcacagacgaagctcggtgccccactc agagagaagatggggtacagaggaacccttcgggtggccctggagattctgaaagagaga acaaatctgggtggaggaaggaaaccacatgaactagagagcaccacaccctctcagctt tctcaaaaggtgcccgaaaagccagccagttctgtccctcgtgaagatgactggagatgc aaaggcgacctaaggaggagtcttgggaagagggaaaacccaagctcaccgacggtccct tcagagagtaagcgtgccctgcgggatgacaggtcgcaggagcccacagccattgcccct actccaggcgccctgcccggggacaggtcaggggcgcccagggccattgcccctactcca ggagccatgctcagtggcaggtcacgggcacgcagggccattgcccctacaccaagcgcc ctgcgaggttacaggtcttgggcgcacagggccattgcccctaccccaggctgcctgtac agtgacaggtcacgggcgcacagggccattgcccctgctcgaggcaccaagcatggtggc aggtcatgggcatgcaggtccattgcccccaaaccaggctccctgtgcggggacaggtca caggcgagcagggccattgaccctaccttaggcgccaggcgcggtggcaggtcacgggcc cacagagccattgcccctactccaggctccctgtgcggcaacaggtcacgggcttgcgga gccattgcccttactccaggtgtcctgtgcggtgtcaggtcacgggtgccaaaggacatt acccctactccaggcgccctgcgaggttacaagtcatgggtgtgcagggccattgcccct actccaggtgccctgcgcggagacaggtcagcagcacgcacggccattgtccgtactcca ggtgcccttggcagggacaggtcacgggcacgcagcgccattgcttctactccagggacc ctgcagggaaacaggtcatctgtgtccaaggccattgcccctgctccaggtgccctgcgt ggagacaggtcagcagcgcgcacggcctttgtccctactccaggcgcccttcacagggac aggtcacgggcccgcagcgccattgcttctactccagggaccctgcggggaaacacgtca tctgcgtgcaaggccattgcccctactccaggtgccctgcgaggttacaagtcatgggcg cgcagggccattgcacctaacccaggtgcctggcgcggttacaggtcaacgacaggcacc gccattgcacctaatctgggcgccctgggcggcaacaggtcagcggcacgcacggacatt gcccctactccaggcgccctgcgaggttacaggtcatggacgcgcagggccattgcccct actccaggcactctgagcagttacaggtcaccagtgcgcagggccattgcccctactcca ggcactctgagcggttacaggtcacgggcacgcacggccattgcccctactccaggcacc ctgcgaggttacaggccacggtcccgcagggccattgcctctactccagccaccctgcgt ggtgaaaagtcacgggcgcacaccagccttgcccccgccccaggtgctttgcgcggtgac ggttcacgagcacgcagggccattgtccctactacatgcccattgtgcgagatatggtca cgggtgggcataggcattgcccctattgcagatgccctgcgccgtgacaggccaccagtg cgcagggccattgctcctactccaggcgccctgcgctgtgacaggtcacgagagctcaca gccattgatcctactccaggagctctgtgcagtgacaggtcaggggcaagcagggccatt gcccccactccaggcactttgtgcagtgaaaggtcccgggtgcgcagggccattgcccct actccatgcgcactgtgcgggaaggggtcacaggtgggcatgggcgttgcccctactcca ggtgcactgcgcagggacaggtcacaggcaggcagggccattgcacctactccatccgca ctgttcagggttgggtcacgggtgggcacaggcattgcccttcctgcaggtgccctgcac cgtgacaggtcaccggtgcgcagggccgttgctcccactccaggcaccctgcactgtgac gggtcacgaaaatgcacggccactggttctactccaggagccctgcccggcgacaggtca ggggtgagcaaggccactgcccctgctccaggcgccttgtgcagtgaaaggtcccgcgca cgcaggagcattgcccctactccatgtttactgtgcggggacaggtcatgggtgggtatg ggcattgcacctactccaggcgccctgcttggtggcaaatcaaggaaatgcagggccatt gctattactcccggcgccctgcgtggtggcaggtcacagaaacgcagggtcgttgctcct actccagaggccctgcacggtgacgggtcatggacttatatggccattgctcctactcca ggtgccctgcacggtgacagctcaccagcccacacgtccattattccctctccaggcgcc ctgcatggtgacgggccaccagcgcacatggcctttccttctactccaggcaccctgcat ggtgatgcctctcacgcacacatggccattgctcctactccaggcacgatgcgcggtgac agctcaacagcgcgcacggccactgccccatctccaggggccctgcgaggtgacaggtca tggaaacgcaaggccattgcttctactccaggtgccctgcatggtaacaggtctgacagg tcacggaaatgcaaggccattgcttctactccaggcaccctgcacgttgagaggtcacca gcactcagggccattgttccaactccaggcaccctgggccgtgacagctcaccagggcgc acgtccattattccttctccaggcgccctgcatggtgacaggtcaccagcacacctggac attgcttctactccaggcgccctgcacggtgacagctctcaggcacacacggccattgct cctactccaggcaccatgcgcggtgacagctcaacagctcgcatggccattgccccatct gcaggggccctgagaggtgacaggtcatggaaacgcaaggccattgcttctactccaggt gccctgcgtggtaacaggtctgacaggtcacggaaacgcaaagccattgcttctactcca ggtgccctgctcggtaacaggtctgacaggtcacggaaacgcaaggccattgctcctact ccaggtgccccgcgcattgacaggtcaccagcatgcagggccattgctcctactccaggc gccctgggtgatgacagctcaacggccattgcccctactccagggaccccgcgaggtgac agttcgccagcaaacacggccattgcttctactccaggcgccctgcacggtgacacctct cagacacacaaggccattgctcctactccaggtgacctgggcggtggcagctcatcagcg cacaaggccatcgctccttctccaggtgccctgcatggtgacaggtcaccagcacacacg gccattgcttctactccaggagccctgcatggtgacagctcccaggtgcacacgaccatt gctccaactccaggcgccctgcgcgatgacaagtcatggaaacgcaaggccattgctccc actccaggcaccctgcactgtgacagctcacgaacgtgcaccgcctttgctcctactcca ggcgccctgcatgctgacaggtcaccagcgcaccaggacattactcttacttcaggcgcc ctgcactgtgacagctcaagagaaagcagggccgttgctcctattctaggcgccctgcac cgtgtcggctctcaggcacacaaggctattgcttctactccaggccctctgcgcggtgac agctcaccatttcacacagccattgcacctatgccaggggccctgcatggtaccaggtca tggaaacgcgaagccatttctcagactccaggcaccctgtgtggtgacagctcaggagaa cgcatggccattgctcctactccaggcgccctgcacagtgacaggtcacagacacatacg gccatcgatcctactccaagtgtcctgcgcagtgacagctcaccagcgtgcatggccatt gatcctactccaggtgccctgggcagggacaggtcacaggcgctcatggccattgctcct actccagttggaatgcaagcacatgtgttgcaaagcccccgagcctgccaggattccctg acgctttcgcggcatgtttgtgagaaaaaggggaagaaaagggcaaacgcctcaactctt atgtccctgcctcccacagtaacggaggagggtgcatctctgcctccaggtctcaccagc cctgcaccccccgctctgaaggaagagacacaggacagccgcccgaagaaggccctggct gcatccccggaaagttctcccttctcggggaacattcaggaccccggagagggtgcctgg aagccaggctgggcaggtatggctgcatcctctgggtcccgtcagcacaggctgccttct tcactccggcttgccaatagaaaaaggaagcgtccaggtccagattttcagaggagacct caaggacctcagacgcctggtgacgctaagcttgctaatcctgtcaccaccattcaaagg gctggcggtaaacaggatgggcagccccccagccttgcttttccacaggagccacatccc atcgaaaggggaacgatggtctggttcaaatttcaagatcatccgttttggccagcagtg gtcaagagtgtcagcaacacagacaagaccgcgagggtgctcctgcttgaggccaacctg caccatggaaagcggggcattcaagttcctcttcgaaggctgaagcacctggattgtaag gagaaagagaaactgctgaagagagcccagaaggcctacaagcaaagcgtcaactggtgc ttctcactgatctcccactacagagaaggactggtccggggttctttccggggctctttc ctggactattatgccgcagacatcagctacccaatcaggagagccatccaagagggagac ctgcagattgactttccaaaggtgaattatggcgacctggaagactgggaggaggagacc tccctgggcgggaagaggccttgcaagaaaatcctcccggaccggatgagggcctcttgg gaccgagacaaccagaagctcgtggacttcatcgtgaggagaaagggggccgacccccac cttctggacatcttgcaaggcaggaagcagtccaggtggctgaccgcgtttctgaagcca cacagggatttgcactgcattgaaacatacctggaggatgacgatcagttggaagtcgtg gccaagcatttacaagaaatctacaagcaaattgacaaggccaggctgactctgataagg gatgacaaagtcaattttgttctggaagttcttctgccggaagccatgatttgtaccatc gccgcacttgatgggctggattacaaggcagcagaggagaagtacctgcgaggaccacct gtgcattaccgggagaaagagctgtttgacagaaatatcttaaagaaggcaagaagagaa ccagcaaccacccatacagctaattag >gi568815575r:152975772_153176968|GENSCAN_predicted_peptide_6|147_aa MESIPGEDAVNVVVVEMTVRDLEYCINLVDTAVAGFKRVRYNFEGYSTMVQHGIQECTAI PKVVILQKQMRKTCGNVFNVIVPPSKSIFLNSNKRLFQESEIVEGQRTKAFNALCEEFLQ EVRDPERRDQLKPWQKNVDCEDFMDIY >gi568815575r:152975772_153176968|GENSCAN_predicted_CDS_6|444_bp atggaatccattcctggagaagatgctgtgaacgtagtagtagttgaaatgacagtacgg gatttagagtattgcataaacttagttgatacagcagtggcagggtttaagagggttcgt tacaactttgaaggatattcaactatggtgcagcacggaattcaggaatgcacagccatc ccgaaagtagtcattttgcagaaacagatgaggaagacatgtggtaatgtgtttaatgta attgtgcctcctagtaaaagcatatttctcaacagcaacaaaagactttttcaagaatct gaaattgttgagggccagagaacaaaagcatttaatgcattatgtgaggaattcttacag gaagtcagggaccctgaacggagggaccagttgaagccatggcagaagaacgtggattgt gaagatttcatggacatttattag