GENSCAN 1.0 Date run: 3-Nov-116 Time: 21:41:29 Sequence gi568815581r:29474705_29689378 : 214674 bp : 46.41% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5664 5777 114 0 0 44 91 75 0.117 2.96 1.02 Intr + 7493 7584 92 2 2 73 87 50 0.173 3.04 1.03 Intr + 14960 15053 94 0 1 91 101 -20 0.293 -1.38 1.04 Intr + 17080 17161 82 1 1 71 86 15 0.446 -0.76 1.05 Intr + 20856 21023 168 2 0 74 94 178 0.996 17.14 1.06 Intr + 23614 23817 204 0 0 93 81 97 0.982 8.90 1.07 Intr + 27885 28019 135 2 0 113 84 -12 0.671 1.66 1.08 Intr + 33192 33428 237 2 0 57 24 216 0.876 9.91 1.09 Intr + 36160 36288 129 0 0 44 65 121 0.982 6.29 1.10 Intr + 42749 42952 204 1 0 77 107 138 0.998 14.00 1.11 Intr + 47633 47815 183 1 0 21 86 158 0.647 8.88 1.12 Intr + 55703 55915 213 1 0 88 78 125 0.998 10.41 1.13 Term + 59414 59602 189 1 0 108 42 171 0.490 11.95 1.14 PlyA + 60608 60613 6 1.05 2.05 PlyA - 61232 61227 6 1.05 2.04 Term - 68232 68053 180 0 0 104 46 170 0.978 12.01 2.03 Intr - 75160 75046 115 0 1 -26 111 76 0.034 -1.05 2.02 Intr - 88382 87861 522 1 0 54 62 516 0.142 37.66 2.01 Init - 92262 91382 881 0 2 89 94 1303 0.975 122.95 2.00 Prom - 93326 93287 40 -12.78 3.00 Prom + 93538 93577 40 -16.15 3.01 Init + 94055 94126 72 1 0 55 94 167 0.994 13.07 3.02 Intr + 94614 94655 42 2 0 101 109 -3 0.722 1.54 3.03 Intr + 96887 97015 129 1 0 117 79 66 0.998 9.49 3.04 Intr + 97438 97993 556 0 1 116 81 307 0.650 25.22 3.05 Term + 98108 98220 113 0 2 84 49 107 0.999 5.12 3.06 PlyA + 98433 98438 6 -1.75 4.20 PlyA - 98789 98784 6 1.05 4.19 Term - 100210 99998 213 1 0 65 43 538 0.999 44.23 4.18 Intr - 100438 100375 64 0 1 138 87 50 0.974 8.62 4.17 Intr - 100766 100584 183 1 0 31 84 141 0.988 6.90 4.16 Intr - 100999 100926 74 1 2 98 60 74 0.895 3.80 4.15 Intr - 101427 101374 54 0 0 62 102 86 0.989 6.58 4.14 Intr - 101746 101516 231 1 0 60 74 221 0.999 15.87 4.13 Intr - 101970 101818 153 0 0 88 78 342 0.609 33.47 4.12 Intr - 102292 102159 134 2 2 71 5 285 0.999 18.86 4.11 Intr - 102543 102432 112 0 1 119 75 186 0.996 20.35 4.10 Intr - 103038 102941 98 1 2 99 94 57 0.997 7.13 4.09 Intr - 103667 103595 73 2 1 93 109 178 0.999 19.38 4.08 Intr - 104075 104027 49 1 1 102 115 60 0.998 8.78 4.07 Intr - 106676 106634 43 0 1 65 109 57 0.997 2.80 4.06 Intr - 107132 107038 95 1 2 77 88 93 0.999 7.81 4.05 Intr - 107440 107223 218 1 2 64 94 247 0.999 20.20 4.04 Intr - 108099 107994 106 2 1 75 42 170 0.999 11.32 4.03 Intr - 108333 108221 113 0 2 126 81 107 0.999 12.98 4.02 Intr - 108912 108779 134 1 2 87 92 97 0.993 10.36 4.01 Init - 114674 114623 52 2 1 49 102 100 0.935 6.83 4.00 Prom - 116227 116188 40 -5.96 5.00 Prom + 116914 116953 40 -6.66 5.01 Init + 118492 119031 540 0 0 58 91 252 0.443 15.39 5.02 Intr + 133038 133173 136 2 1 124 80 142 0.999 17.14 5.03 Intr + 133282 133406 125 2 2 116 100 322 0.922 36.50 5.04 Intr + 133491 133536 46 2 1 119 86 60 0.911 6.88 5.05 Intr + 134147 134290 144 0 0 60 77 175 0.997 13.85 5.06 Intr + 134382 134571 190 1 1 90 90 267 0.963 25.74 5.07 Intr + 134651 134717 67 2 1 126 100 69 0.944 10.81 5.08 Intr + 135981 136062 82 2 1 106 90 103 0.749 11.51 5.09 Intr + 136875 136939 65 1 2 59 84 8 0.355 -4.06 5.10 Intr + 137172 137302 131 2 2 46 53 164 0.580 8.19 5.11 Intr + 137412 137569 158 0 2 33 77 234 0.863 16.45 5.12 Intr + 137698 137898 201 2 0 58 35 324 0.651 23.36 5.13 Intr + 137948 138111 164 0 2 83 96 241 0.993 24.09 5.14 Intr + 138183 138259 77 2 2 80 82 81 0.998 5.01 5.15 Term + 138650 138878 229 2 1 85 43 369 0.995 28.20 5.16 PlyA + 140035 140040 6 1.05 6.22 PlyA - 140359 140354 6 -3.74 6.21 Term - 141471 141028 444 0 0 53 41 721 0.766 59.14 6.20 Intr - 141632 141575 58 1 1 77 100 75 0.999 6.49 6.19 Intr - 142338 141998 341 0 2 62 65 394 0.915 28.77 6.18 Intr - 142915 142796 120 1 0 115 54 178 0.945 17.79 6.17 Intr - 143463 143344 120 0 0 2 82 114 0.743 2.89 6.16 Intr - 144267 144086 182 1 2 73 101 257 0.999 25.09 6.15 Intr - 144485 144356 130 2 1 117 41 131 0.999 11.57 6.14 Intr - 145069 144947 123 1 0 67 58 180 0.983 13.68 6.13 Intr - 146719 146520 200 2 2 99 100 308 0.999 32.17 6.12 Intr - 147239 147089 151 2 1 -26 28 153 0.247 -2.36 6.11 Intr - 148153 147984 170 2 2 7 80 186 0.120 9.37 6.10 Intr - 155699 155605 95 1 2 86 25 89 0.024 2.01 6.09 Intr - 158227 156267 1961 1 2 100 -6 827 0.054 61.83 6.08 Intr - 162098 161264 835 1 1 67 93 493 0.726 39.10 6.07 Intr - 173640 173440 201 2 0 27 113 188 0.387 13.60 6.06 Intr - 176096 175950 147 1 0 85 87 133 0.541 12.25 6.05 Intr - 180903 180857 47 0 2 116 98 70 0.986 8.01 6.04 Intr - 192291 192163 129 0 0 60 119 76 0.997 8.79 6.03 Intr - 192519 192426 94 2 1 85 94 76 0.997 7.87 6.02 Intr - 197425 197231 195 0 0 68 103 64 0.646 4.43 6.01 Intr - 209980 209859 122 1 2 87 80 81 0.732 6.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 88382 87857 526 1 1 54 48 506 0.816 36.94 S.002 Term - 158227 156137 2091 1 0 100 37 840 0.916 65.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:29474705_29689378|GENSCAN_predicted_peptide_1|681_aa XDIKAGNILLTEPGQVKLADFGSASMASPANSFVGTPYWMAPEVILAMDEGQYDGKVDVW SLGITCIELAERKPPLFNMNAMSALYHIAQNESPTLQSNEWSDYFRNFVDSCLQKIPQDR PTSEELLKHIFVLRERPETVLIDLIQRTKDAVRELDNLQYRKMKKLLFQEAHNGPAVEAQ EEEEEQDHGVGRTGTVNSVGSNQSIPSMSISASSQSSSVNSLPDVSDDKSELDMMEGDHT VMSNSSVIHLKPEEENYREEGDPRTRASDPQSPPQVSRHKSHYRNREHFATIRTASLVTR QMQEHEQDSELREQMSGYKRMRRQHQKQLMTLENKLKAEMDEHRLRLDKDLETQRNNFAA EMEKLIKKHQAAMEKEAKVMSNEEKKFQQHIQAQQKKELNSFLESQKREYKLRKEQLKEE LNENQSTPKKEKQEWLSKQKENIQHFQAEEEANLLRRQRQYLELECRRFKRRMLLGRHNL EQDLVREHESMQELEFRHLNTIQKMRCELIRLQHQTELTNQLEYNKRRERELRRKHVMEV RQQPKSLKSKELQIKKQFQDTCKIQTRQYKALRNHLLETTPKSEHKAVLKRLKEEQTRKL AILAEQYDHSINEMLSTQALRLDEAQEAECQVLKMQLQQELELLNAYQSKIKMQAEAQHD RELRELEQRVSLRRALLEQKV >gi568815581r:29474705_29689378|GENSCAN_predicted_CDS_1|2046_bp nnagatatcaaagcaggaaatatccttctgacagaaccaggccaggtgaaacttgctgac tttggctctgcttccatggcatcacctgccaattcctttgtgggaacgccgtattggatg gccccagaagtaattttagccatggatgaaggacaatatgatggcaaagtagatgtgtgg tctcttggaataacatgtattgaactagcggaaaggaagcctcctttatttaatatgaat gcaatgagtgccttatatcacatagcccaaaatgaatcccctacactacagtctaatgaa tggtctgattattttcgcaactttgtagattcttgcctccagaaaatccctcaagatcga cctacatcagaggaacttttaaagcacatatttgttcttcgggagcgccctgaaaccgtg ttaatagatctcattcagaggacaaaggatgcagtaagagagctggacaatctgcagtat cgaaagatgaagaaactccttttccaggaggcacataatggaccagcagtagaagcacag gaagaagaagaggaacaagatcatggtgttggccggacaggaacagttaatagtgttgga agtaatcaatccattcccagcatgtccatcagtgccagcagccaaagcagtagtgttaac agtcttccagatgtctcagatgacaagagtgagctagacatgatggagggagaccacaca gtgatgtctaacagttctgttatccatttaaaaccagaggaagaaaattacagagaagag ggagatcctagaacaagagcatcagatccacaatctccaccccaagtatctcgtcacaaa tcacactatcgtaatcgagaacactttgctactatacggacagcatcactggttacgagg caaatgcaagaacatgagcaggactctgagcttagagaacaaatgtctggctataagcga atgaggcgacaacatcaaaagcaactgatgactctggaaaacaagctaaaggctgagatg gatgaacatcgcctcagattagacaaagatcttgaaactcagcgtaacaattttgctgca gaaatggagaaacttatcaagaaacaccaggctgccatggagaaagaggctaaagtgatg tccaatgaagagaaaaaatttcagcaacatattcaggcccaacagaagaaagaactgaat agttttctcgagtcccagaaaagagagtataaacttcgaaaagagcagcttaaagaggag ctaaatgaaaaccagagtacccccaaaaaagaaaaacaggagtggctttcaaagcagaag gagaatatacagcatttccaagcagaagaagaagctaaccttcttcgacgtcaaagacaa tacctagagctggaatgccgtcgcttcaagagaagaatgttacttgggcgtcataactta gagcaggaccttgtcagggagcatgaatctatgcaagaactggagttccgccacctcaac acaattcagaagatgcgctgtgagttgatcagattacagcatcaaactgagctcactaac cagctggaatataataagcgaagagaacgagaactaagacgaaagcatgtcatggaagtt cgacaacagcctaagagtttgaagtctaaagaactccaaataaaaaagcagtttcaggat acctgcaaaatccaaaccagacagtacaaagcattaagaaatcacctgctggagactaca ccaaagagtgagcacaaagctgttctgaaacggctcaaggaggaacagacccggaaatta gctatcttggctgagcagtatgatcacagcattaatgaaatgctctccacacaagccctg cgtttggatgaagcacaggaagcagagtgccaggttttgaagatgcagctgcagcaggaa ctggagctgttgaatgcgtatcagagcaaaatcaagatgcaagctgaggcacaacatgat cgagagcttcgcgagcttgaacagagggtctccctccggagggcactcttagaacaaaag gtataa >gi568815581r:29474705_29689378|GENSCAN_predicted_peptide_2|565_aa MPPWGAALALILAVLALLGLLGPRLRGPWGRAVGERTLPGAQDRDDGEEADGGGPADQFS DGREPLPGGCSLVCKPSALAQCLLRALRRSEALEAGPRSWFSGPHLQTLCHFVLPVAPGP ELAREYLQLADDGLVALDWVVGPCVRGRRITSAGGLPAVLLVIPNAWGRLTRNVLGLCLL ALERGYYPVIFHRRGHHGCPLVSPRLQPFGDPSDLKEAVTYIRFRHPAAPLFAVSEGSGS ALLLSYLGECGSSSYVTGAACISPVLRCREWFEAGLPWPYERGFLLHQKIALSRYATALE DTVDTSRLFRSRSLREFEEALFCHTKSFPISWDTYWDRNDPLRDVDEAAVPVLCICSADD PVCGPPDHTLTTELFHSNPYFFLLLSRHGGHCGFLRQEPLPAWSHEVILESFRALTEFFR TEERIKGLSRHRASFLGGRRRGGALQRREVSSSSNLEEIFNWKRSYTRKKGGGFFDIVKC KEGQQHIWNGNQKRAVWQYTRTANILKLSASEPGGYCGLPYCYLEVPLALALKGDPMAGV HLALDGPKLVVGHPWDDPSEVLDPQ >gi568815581r:29474705_29689378|GENSCAN_predicted_CDS_2|1698_bp atgccgccgtggggcgccgccctcgcgctcatcttggccgtgctcgcccttctcggcctg ctcggcccgcggctccggggaccctgggggcgcgccgtcggagagaggaccctgccgggg gcccaagaccgagacgacggggaggaggcggacggcggaggcccggcggaccagttcagc gacgggcgcgagccactgccgggagggtgcagccttgtttgcaagccgtcggccctggcc cagtgcctgctgcgcgccctgcggcgctcagaggcgctggaggccggcccgcgctcctgg ttctccgggccccacctgcagaccctctgccacttcgtcctgcccgtagcgcctgggcct gagctggcccgggagtacctgcagttggcggacgatgggctagtggccctggactgggtg gtaggaccttgtgttcggggccgccggatcaccagcgccgggggccttcctgcggtgctt ctggtgatccccaatgcgtggggtcgcctcacccgcaacgtgctcggcctttgcttgctc gccctggagcgcggctactacccggtcatcttccatcgccgcggccaccacggttgccca ctggtcagcccccggctgcagcctttcggggacccgtccgacctcaaggaggcggtcaca tacatccgcttccgacacccggcggcgccgctgttcgcggtgagcgaaggctcgggctcg gcgctgctcctgtcctacctgggcgagtgcggctcctccagctacgtgacaggcgccgcc tgcatctcgcccgtgctgcgctgccgagagtggttcgaggccggcctgccctggccctac gagcggggctttctgctccaccagaagatcgccctcagcaggtatgccacagccctggag gacactgtggacaccagcagactgttcaggagccgttcccttcgagagtttgaggaggct ctcttctgccacaccaaaagcttccccatcagctgggatacctactgggaccgcaacgac ccgctccgggatgtcgatgaggcagccgtgcctgtgctgtgtatctgcagtgctgacgac cccgtgtgtggacccccagaccacactctgacaactgaactcttccacagcaacccctac ttcttcctcctgctcagtcgccacggaggccactgtggcttcctgcgccaggagcccttg ccagcctggagccatgaggtcatcttggagtccttccgggccttgactgagttcttccga acggaggagaggattaaagggctgagcaggcacagagcttccttccttgggggccgtcgt cgtgggggagccttgcagaggcgggaagtctcttcctcttccaacctggaggagatcttt aactggaagcgatcatacacaaggaagaagggtggtggtttctttgatattgtgaaatgc aaagaaggacagcagcacatttggaatgggaaccaaaagagagctgtctggcaatacacc aggacagccaacatcttgaagctgtccgcctcagagcctgggggctattgcggactccca tactgctacctcgaggtaccccttgcattggccctgaagggtgaccccatggctggggtc caccttgcattggatggccccaagcttgtggtgggccacccatgggatgaccccagtgag gtcctggacccccagtag >gi568815581r:29474705_29689378|GENSCAN_predicted_peptide_3|303_aa MAPPPPSPQLLLLAALARLLGPSEVSPRVTYTRVSPGQAEDVTFLYHPCAHPWLKLQLAL LAYACMANPSLTPDFSLTQDRALALAFALRSWRPPGTEVTSQGPRQPSSSGAKRRRLRAA LGPQPTRSALRFPSASPGSLKAKQSMAGIPGRESNAPSVPTVSLLPGAPGGNASSRTEAQ VPNGQGSPGGCVCSSQASPAPRAAAPPRAARGPTPRTEEAAWAAMALTFLLVLLTLATLC TRLHRNFRRGESIYWGPTADSQDTVAAVLKRRLLQPSRRVKRSRRRPLLPPTPDSGPEGE SSE >gi568815581r:29474705_29689378|GENSCAN_predicted_CDS_3|912_bp atggcgcctcctccgccttcgccccaactgcttctcctggcagccctcgcgaggctcctg ggtcccagcgaggtgtcaccaagagtgacctacacacgagtgagcccagggcaggctgag gatgtcaccttcctctaccacccctgtgcccatccctggctgaagctccagcttgccctc ctggcctatgcttgtatggctaacccttccctcacccctgacttcagcctcacgcaggat cgggccctggctctggcctttgctctgcggagctggcggccccctggcacagaggtgaca tctcaagggcccaggcagccctcttctagtggtgccaagaggcggaggctgcgggctgcc cttggtccccagcccactcgctcagccctgaggtttccctctgcttccccagggagcttg aaggccaagcagtccatggcgggaatccctggtagggagagtaatgccccatctgtgccc actgtctccctgctgccgggggcgcctggaggcaatgccagctccaggacagaggctcag gtgcccaacgggcaaggcagcccagggggctgtgtctgttcaagtcaggcttccccggcc cctcgcgcagcagcgcctccacgggcagcccggggccccaccccacgcactgaagaggcc gcctgggctgccatggccctgaccttcctgctggtgctgctcaccctggccacgctctgc acacggctgcacagaaacttccgacgcggggagagcatctactgggggcccacagcggac agccaggacacagtggctgctgtgctgaagcggaggctgctgcagccctcgcgccgggtc aagcgctcgcgccggagacccctcctcccgcccacgccggacagcggcccggaaggcgag agctcggagtga >gi568815581r:29474705_29689378|GENSCAN_predicted_peptide_4|732_aa MSRKGPRAEVCADCSAPDPGWASISRGVLVCDECCSVHRSLGRHISIVKHLRHSAWPPTL LQMVHTLASNGANSIWEHSLLDPAQVQSGRRKANPQDKVHPIKSEFIRAKYQMLAFVHKL PCRDDDGVTAKDLSKQLHSSVRTGNLETCLRLLSLGAQANFFHPEKGTTPLHVAAKAGQT LQAELLVVYGADPGSPDVNGRTPIDYARQAGHHELAERLVECQYELTDRLAFYLCGRKPD HKNGHYIIPQMADSLDLSELAKAAKKKLQALSNRLFEELAMDVYDEVDRRENDAVWLATQ NHSTLVTERSAVPFLPVNPEYSATRNQGRQKLARFNAREFATLIIDILSEAKRRQQGKSL SSPTDNLELSLRSQSDLDDQHDYDSVASDEDTDQEPLRSTGATRSNRARSMDSSDLSDGA VTLQEYLELKKALATSEAKVQQLMKVNSSLSDELRRLQREIHKLQAENLQLRQPPGPVPT PPLPSERAEHTPMAPGGSTHRRDRQAFSMYEPGSALKPFGGPPGDELTTRLQPFHSTELE DDAIYSVHVPAGLYRSKLSRHGSGADSDYENTQSGDPLLGLEGKRFLELGKEEDFHPELE SLDGDLDPGLPSTEDVILKTEQVTKNIQELLRAAQEFKHDSFVPCSEKIHLAVTEMASLF PKRPALEPVRSSLRLLNASAYRLQSECRKTVPPEPGAPVDFQLLTQQVIQCAYDIAKAAK QLVTITTREKKQ >gi568815581r:29474705_29689378|GENSCAN_predicted_CDS_4|2199_bp atgtcccgaaaggggccgcgagcggaggtgtgtgcggactgcagcgccccggaccctggc tgggcatccatcagcaggggtgtgctggtgtgtgacgagtgctgcagcgtgcaccggagc ctgggacgccacatctccattgtcaagcaccttcgccacagcgcctggcctcccacgctg ctgcagatggtgcacacgcttgccagcaacggggccaactccatctgggagcactccctg ctggaccccgcacaagtgcagagcggccggcgtaaagccaacccccaagacaaagtccac cccatcaagtcagagttcatcagggccaagtaccagatgctggcatttgtgcacaagctt ccctgccgggacgatgatggagtcaccgccaaagacctcagcaagcaactacactcgagc gtgcggacaggcaacctggagacatgtctgcgcctgctctccctgggtgcccaggccaac ttcttccacccagagaagggcaccacacctctgcacgtggctgccaaggcaggacagaca ctgcaggccgagctgcttgtagtgtatggggctgaccctggctcccctgatgttaatggc cgcacacccattgactatgccaggcaggcggggcaccatgagctggcggaaaggctggtt gagtgccaatatgagctcactgaccggctggccttctacctctgtggacgcaagccggat cacaagaatgggcattacatcatcccacagatggctgacagccttgacttatccgaattg gccaaagctgctaagaagaagctgcaggcgctcagcaaccggctttttgaggaactcgcc atggacgtgtatgacgaggtggatcgaagagaaaatgatgcagtgtggctggctacccaa aaccacagcactctggtgacagagcgcagtgccgtgcccttcctgcctgttaacccggaa tactcagccacgcggaatcaggggcgacaaaagctggcccgctttaatgcccgagagttt gccaccttgatcatcgacattctcagtgaggccaagcggagacagcagggcaagagcctg agcagccccacagacaacctcgagctgtctctgcggagccagagtgacctcgacgaccaa cacgactacgacagcgtggcctctgacgaggacacagaccaggagcccctgcgcagcacc ggcgccactcggagcaaccgggcccggagcatggactcctcggacttgtctgacggggct gtgacgctgcaggagtacctggagctgaagaaggccctggctacatcggaggcaaaggtg cagcagctcatgaaggtcaacagtagcctgagcgacgagctccggaggctgcagcgagag atccacaagctgcaggcggagaacctgcagctccggcagcctccagggccggtgcccaca cctccactccccagtgaacgggcggaacacacacccatggcgccaggcgggagcacacac cgcagggatcgccaggccttttccatgtatgaacctggctctgccctgaagccctttggg ggcccccctggggacgagctcactacgcggctgcagcctttccacagcactgagctagag gacgacgccatctattcagtgcacgtccctgctggcctttaccggagcaagctttcccgc cacggcagtggagccgacagtgactatgagaacacgcaaagtggggacccactgctgggg ctggaagggaagaggtttctagagctgggcaaagaggaagacttccacccagagctggaa agcctggatggagacctagatcctgggcttcccagcacagaggatgtcatcttgaagaca gagcaggtcaccaagaacattcaggaactgttgcgggcagcccaggagttcaagcatgac agcttcgtgccctgctcagagaagatccatttggctgtgaccgagatggcctccctcttc ccaaagaggccagccctggagccagtgcggagctcactgcggctgctcaacgccagcgcc taccggctgcagagtgagtgccggaagacagtgcccccagagcccggcgccccagtggac ttccagctgctgactcagcaggtgatccagtgcgcctatgacatcgccaaggctgccaag cagctggtcaccatcaccacccgagagaagaagcagtga >gi568815581r:29474705_29689378|GENSCAN_predicted_peptide_5|784_aa MQLWGPGGCGRASLRRRPRGPRPVRPRARCGRLLPPPSGVFVCGGVGGERESAQRRRVPA PSQGGWGPLRAARRRRLARTPVARAARTAPGAPPPPAAARTCPSRSPASRQRRPPAPRPR APAPAPSLLPGRAPRPRHEERAMIPANASARKGPEGKYPLHYLVWHNRHRELEKEVRAGQ VDIEQLDPRGRTPLHLATTLGHLECARVLLAHGADVGRENRSGWTVLQEAVSTRDLELVQ LVLRYRDYQRVVKRLAGIPVLLEKLRKAQDFYVEMKWEFTSWVPLVSKICPSDTYKVWKS GQNLRVDTTLLGFDHMTWQRGNRSFVFRGQDTSAVVMEIDHDRRVVYTETLALAGQDREL LLAAAQPTEEQVLSRLTAPVVTTQLDTKNISFERNKTGILGWRSEKTEMVNGYEAKVYGA SNVELITRTRTEHLSEQHKGKVKGCKTPLQSFLGIAEQHGGPQNGTLITQTLSQANPTAI TAEEYFNPNFELGNRDMGRPMELTTKTQKFKAKLWLCEEHPLSLCEQVAPIIDLMAVSNA LFAKLRDFITLRLPPGFPVKIEIPIFHILNARITFGNLNGCDEPVPSVRGSPSSETPSPG SDSSSVSSSSSTSEAPRENACPSALPGVASCRGCEISPALFEAPRGYSMMGGQREAATRD DDDDLLQFAIQQSLLEAGSEYDQVTIWEALTNSKPGTHPMSYEGRRQDRSAPPTPQRQPA PPASVPSPRPSSGPGSGGHVFRSYDEQLRLAMELSAQEQEERRRRARQEEEELERILRLS LTEQ >gi568815581r:29474705_29689378|GENSCAN_predicted_CDS_5|2355_bp atgcagctgtggggacccgggggctgcgggcgcgcgtccctgcggcggcgtccccggggc ccgcgtcccgtgcgcccccgcgcccgctgcgggcgcctgctccctccgccgagcggcgtc tttgtgtgcgggggtgtgggaggcgagcgcgagtccgcgcagcgccgccgagtgcccgct ccctcccagggcgggtgggggcctctccgcgccgcccgccgccgccgcctcgcgaggacg cccgtcgcccgcgccgcccgcaccgcgccgggcgcgccgcccccgcccgccgccgctcgc acatgcccgagccgcagccccgcgagcaggcagcgccggccccccgccccgcggccccgg gccccggctccggcgccgtccctcctccccggccgggcgccgcggccccggcatgaggag cgggcgatgatccccgccaacgcctccgccaggaaggggcccgagggcaagtatccgctg cactacctcgtgtggcacaaccgccaccgcgagctggagaaggaggtccgcgcgggccag gtggacatcgagcagctggatccccgcggccggactcccctgcacctggccaccacgctg gggcaccttgagtgtgcccgtgtgctcctggcgcacggcgcagacgtgggcagggagaat cgcagcggctggacagtgctccaggaggctgtgagtacccgggacctggagctggtgcag ctggtgcttcggtaccgggactaccagcgggtggtgaagcggctggcgggcatccccgtg ctcctggagaagctgcgcaaggcccaggacttctacgtggagatgaaatgggagttcact agctgggtgcccctggtgtccaagatctgccctagtgacacctacaaagtgtggaagagc ggccagaacctgagggtagacaccacactcctgggctttgaccacatgacctggcagcga gggaaccgcagctttgtcttcaggggccaagacacaagcgccgtggtcatggagattgac cacgaccgccgggtggtgtacacagagactctggcactggctgggcaggaccgggagctg ctgctggctgctgctcagcccactgaggaacaggtgctgagccggcttaccgcgcccgtc gtcaccactcagcttgacaccaagaatatctcctttgagaggaacaagactggcatcctg ggctggcgcagtgaaaagacggagatggtgaatgggtatgaagctaaggtgtatggggca tctaacgtggagctcatcacccgcacacggacagaacatctttcagaacagcacaagggc aaggtcaaaggctgtaagacacctttgcagtccttcctgggaatcgctgagcagcacggg ggcccccaaaatgggaccctgatcactcagactctgagccaagccaaccccactgccatc actgcagaagaatacttcaaccccaactttgagctgggcaaccgtgatatgggccgcccc atggaactgaccaccaagacacagaagttcaaggccaagctgtggctgtgtgaggagcat cccctgtccctgtgtgagcaggtggcccccatcattgacctcatggccgtcagcaatgcg ctttttgccaagctccgggacttcatcaccctgcgtctgcctcctggcttcccagttaag attgaaatcccgatcttccacatcctcaacgcccgcatcaccttcgggaacctcaacggc tgcgacgaaccggtgccatcggtgcgaggcagccccagcagcgagacgccttccccaggc agcgactcctccagcgtcagcagctccagctccacgagtgaggccccccgcgagaacgcc tgcccctcggctctccccggggtggcctcctgccgcggctgcgagatctccccagcgttg ttcgaggccccgcgcggctacagcatgatgggcggccagcgggaggcggcgacccgggac gacgacgacgacctgctgcaattcgccatccagcagagcctgcttgaggcgggcagtgag tatgaccaggtcaccatctgggaggcgctaaccaacagcaagccaggcacccaccccatg tcctacgagggtcgccgacaggacaggagcgccccgcccacgccgcagcgccagcctgcg cccccggcgtcagtgcccagccctcggcccagctcagggccaggttccggcggccacgtg ttccggagctacgacgagcagctgcggctggcgatggaactgtcggcgcaggagcaggag gagaggcggcggcgcgcgcgccaggaggaggaggagctggagcgcatcctgaggctctca ctgaccgagcagtag >gi568815581r:29474705_29689378|GENSCAN_predicted_peptide_6|1954_aa AVRLESTYQNRTRYMVVVSTNGRQDTEESIVLGMDFSSNDRSALQSLHKACEVARAHNYY PGSLFLTWVSYYESHINSDQSSVNEWNAMQDVQSHRPDSPALFTDIPTERERTERLIKTK LREIMMQKDLENITSKEIRTELEMQMVCNLREFKEFIDNEMIVILGQMDSPTQIFEHVFL GSEWNASNLEDLQNRGVRYILNVTREIDNFFPGVFEYHNIRVYDEEATDLLAYWNDTYKF ISKAKKHGSKCLVHCKMGVSRSASTVIAYAMKEYGWNLDRAYDYVKERRTVTKPNPSFMR QLEEYQGILLASKQRHNKLWRSHSDSDLSDHHEPICKPGLELNKKDITTSADQIAEVKTM ESHPPIPPVFVEHMVPQDANQKGLCTKERMICLEFTSREFHAGQIEDELNLNDINGCSSG CCLNESKFPLDNCHASKALIQPGHVPEMANKFPDLTVEDLETDALKADMNVHLLPMEELT SPLKDPPMSPDPESPSPQPSCQTEISDFSTDRIDFFSALEKFVELSQETRSRSFSHSRME ELGGGRNESCRLSVVEVAPSKVTADDQRSSSLSNTPHASEESSMDEEQSKAISELVSPDI FMQSHSENAISVKEIVTEIESISQGVGQIQLKGDILPNPCHTPKKNSIHELLLERAQTPE NKPGHMEQDEDSCTAQPELAKDSGMCNPEGCLTTHSSIADLEEGEPAEGEQELQGSGMHP GAKWYPGSVRRATLEFEERLRQEQEHHGAAPTCTSLSTRKNSKNDSSVADLAPKGKSDEA PPEHSFVLKEPEMSKGKGKYSGSEAGSLSHSEQNATVPAPRVLEFDHLPDPQEGPGSDTG TQQEGVLKDLRTVIPYQESETQAVPLPLPKRVEIIEYTHIVTSPNHTGPGSEIATSEKSG EQGLRKVNMEKSVTVLCTLDENLNRTLDPNQVSLHPQVLPLPHSSSPEHNRPTDHPTSIL SSPEDRGSSLSTALETAAPFVSHTTHLLSASLDYLHPQTMVHLEGFTEQSSTTDEPSAEQ VSWEESQESPLSSGSEVPYKDSQLSSADLSLISKLGDNTGELQEKMDPLPVACRLPHSSS SENIKSLSHSPGVVKERAKEIESRVVFQAGLTKPSQMRRSASLAKLGYLDLCKDCLPERE PASCESPHLKLLQPFLRTDSGMHAMEDQESLENPGAPHNPEPTKSFVEQLTTTECIVQSK PVERPLVQYAKEFGSSQQYLLPRAGLELTSSEGGLPVLQTQGLQPEASNGLKVSQKSEGS QPSSSRSPLEFLKEAESRRIGQSAELDTRVPDTTDTRRGGSFALKPSSAPYALGPLPQRI RSVSAPQPAQDQMRVRYPVVAAVLAPYLALSQDPMVKSSASGQGASGSYNHVREEMLIKA GGAMSRRVVRQSKFRHVFGQAAKADQAYEDIRVSKVTWDSSFCAVNPKFLAIIVEAGGGG AFIVLPLAKTGRVDKNYPLVTGHTAPVLDIDWCPHNDNVIASASDDTTIMVWQIPDYTPM RNITEPIITLEGHSKRVGILSWHPTARNVLLSAGGDNVIIIWNVGTGEVLLSLDDMHPDV IHSVCWNSNGSLLATTCKDKTLRIIDPRKGQVVAEQARPHEGARPLRAVFTADGKLLSTG FSRMSERQLALWDPERFAAHEGMRPMRAVFTRQGHIFTTGFTRMSQRELGLWDPNNFEEP VALQEMDTSNGVLLPFYDPDSSIVYLCGKVLTAGQGEQGTGWRCGGPCPGAPLNRLILQG DSSIRYFEITDEPPFVHYLNTFSSKEPQRGMGFMPKRGLDVSKCEIARFYKLHERKCEPI IMTVPRKSDLFQDDLYPDTPGPEPALEADEWLSGQDAEPVLISLRDGYVPPKHRELRVTK RNILDVRPPSGPRRSQSASDAPLSVRSALLHSGPIYISPNRPYTTRPLCPPPQQQHTLET LLEEIKALRERVQAQEQRITALENMLCELVDGTD >gi568815581r:29474705_29689378|GENSCAN_predicted_CDS_6|5865_bp gctgtaagactggaaagtacttaccagaatcgaacacgctatatggtagtggtttcaact aatggtagacaagacactgaagaaagcatcgtcctaggaatggatttctcctctaatgac aggtctgcactacagagcttacacaaggcttgtgaagtcgccagagcgcataactactac ccaggcagcctatttctcacttgggtgagttattatgagagccatatcaactcagatcaa tcctcagtcaatgaatggaatgcaatgcaagatgtacagtcccaccggcccgactctcca gctctcttcaccgacatacctactgaacgtgaacgaacagaaaggctaattaaaaccaaa ttaagggagatcatgatgcagaaggatttggagaatattacatccaaagagataagaaca gagttggaaatgcaaatggtgtgcaacttgcgggaattcaaggaatttatagacaatgaa atgatagtgatccttggtcaaatggatagccctacacagatatttgagcatgtgttcctg ggctcagaatggaatgcctccaacttagaggacttacagaaccgaggggtacggtatatc ttgaatgtcactcgagagatagataacttcttcccaggagtctttgagtatcataacatt cgggtatatgatgaagaggcaacggatctcctggcgtactggaatgacacttacaaattc atctctaaagcaaagaaacatggatctaaatgccttgtgcactgcaaaatgggggtgagt cgctcagcctccaccgtgattgcctatgcaatgaaggaatatggctggaatctggaccga gcctatgactatgtgaaagaaagacgaacggtaaccaagcccaacccaagcttcatgaga caactggaagagtatcaggggatcttgctggcaagcaaacagcggcataacaaactatgg agatctcattcagatagtgacctctcagaccaccacgaacccatctgcaaacctgggcta gaactcaacaagaaggatatcaccacctcagcagaccagattgctgaggtgaagaccatg gagagtcacccacccatacctcctgtctttgtggaacatatggtcccacaagatgcaaat cagaaaggcctgtgtaccaaagaaagaatgatctgcttggagtttacttctagggaattt catgctggacagattgaggatgaattaaacttaaatgacatcaatggatgctcatcaggg tgttgtctgaatgaatcaaaatttcctcttgacaattgccatgcatccaaagccttaatt cagcctggacatgtcccagaaatggccaacaagtttccagacttaacagtggaagatttg gagacagatgcactgaaagcagacatgaatgtccacctactgcctatggaagaattgaca tctccactgaaagacccccccatgtcccctgatcctgagtcaccaagcccccaacccagt tgccagactgaaatctcagatttcagtacagatcgcattgacttttttagtgccctagag aagtttgtggagctctcccaagaaacccggtcacgatctttttcccattcaaggatggag gaactgggtggaggaaggaatgagagctgtcgactgtcagtggtagaagtagccccttcc aaagtgacagctgatgaccagagaagcagctctttgagtaatactccccatgcatcagaa gaatcttcaatggatgaggaacagtcaaaggcaatttcagaactggtcagcccagacatc ttcatgcagtctcactcggaaaatgcaatttcagtcaaagaaattgtcactgaaattgag tccatcagtcaaggagttgggcagattcaactgaaaggagacatcttacccaacccatgc catacaccaaagaagaacagcatccatgagctgctccttgagagggcccagactccagag aacaaacctggacatatggagcaagatgaggactcctgcacagcccagcctgaactagcc aaagactcagggatgtgcaacccagaaggctgcctaaccacacactcatctatagcagac ttggaagaaggggaaccagctgagggggaacaagagctccagggctcagggatgcaccca ggtgccaagtggtaccctgggtctgtgaggcgagccaccttggagttcgaagagcgctta cggcaggagcaagagcatcatggtgctgccccaacatgtacctcattgtccactcgtaag aattcaaagaatgattcttctgtggcagacctagcaccaaaagggaaaagtgatgaagcc cccccagaacattcatttgtcctcaaggaaccagaaatgagcaaaggcaaagggaaatac agtgggtctgaggctggctcactgtcccattctgagcagaatgccactgttccagctccc agggtgctggagtttgaccacttgccagatcctcaggagggcccagggtcagatactgga acacagcaggaaggagtcctgaaggatctgaggactgtgattccataccaggagtctgaa acacaagcagtccctcttccccttcccaagagggtagaaatcattgaatatacccacata gttacatcacccaatcacactgggccagggagtgaaatagccaccagtgagaagagcgga gagcaagggctgaggaaagtgaacatggaaaaatctgtcactgtgctctgcacactggat gaaaatctaaacaggactctggaccccaaccaggtttctctgcacccccaagtgctacct ctgcctcattcttcctcccctgagcacaacagacccactgaccatccaacctccatcctg agtagccctgaagacagaggcagcagcctgtccacagccctggagacagcagcacctttt gtcagtcatacaacccatttactgtctgccagtttggattacctgcatccccagactatg gttcacctggagggcttcacagagcagagcagcactacagatgagccctctgcagaacag gttagctgggaagaaagtcaggagagccctctctccagtggcagtgaggtgccatataag gactcccagctaagtagcgcagacctaagtttaattagcaaacttggtgacaacactggg gagttacaggagaaaatggacccattgcctgtagcctgtcgactcccacatagctctagt agtgaaaacataaagagtctcagccacagccccggtgtggtgaaggagcgtgctaaagaa atcgagtctcgagtggttttccaggcagggctcaccaaaccatcccaaatgaggcgctca gcttctctcgccaaattaggttacttggacctctgtaaagactgcttaccagagagggag cctgcctcctgtgaatcccctcatctcaaactgcttcagcctttcctcagaacagactca ggcatgcacgcgatggaggaccaagagtccctagaaaacccaggtgccccccacaaccca gagcccaccaagtcttttgtagaacaactcacaacaacagagtgtattgtgcagagcaag ccagtggagaggccccttgtgcagtatgccaaagaatttggttctagtcagcagtatttg ctccccagggcaggacttgaattgactagttctgaaggaggccttcccgtgctacagacc cagggactgcaacctgaggcatcgaatgggctaaaggtcagccagaagtcagagggctct cagccttcatccagccgcagccctttagagttcctgaaggaggcagagagccgcaggatc ggccagagtgcggagctggacacccgggtcccagatactacagacacccggagaggtggc tccttcgccctgaagccttcctcggccccctacgcactcgggccccttccgcagaggatt cgcagcgtgagcgccccgcagcccgctcaggaccagatgcgagttcggtatcctgtggtg gctgcagtcttggccccatacctggctttaagccaagatccaatggtcaagtcttctgct tctggacagggtgcctctgggagctacaaccacgtccgtgaagagatgctcatcaaggct ggcggtgctatgagcagacgtgtggttcggcaaagcaagttccgccatgtgtttgggcag gcagcaaaggccgaccaggcctacgaggacatccgtgtgtccaaggtcacatgggacagc tccttctgtgccgtcaaccccaaattcctggccattattgtggaggctggaggcgggggt gccttcatcgtcctgcctctggccaagacagggcgagtggataagaactacccactggtc actgggcacactgcccctgtgctggatattgactggtgtccacacaatgacaacgttatc gccagtgcctcagacgacaccaccatcatggtgtggcagattccagactatacccccatg cgcaacattacggaacctatcatcacacttgagggccactccaagcgtgtgggcatcctc tcctggcaccctactgccaggaatgtcctgctcagtgcaggtggtgacaatgtgatcatc atctggaatgtgggcaccggggaggtgctgctgagcctggatgatatgcacccagacgtc atccacagtgtgtgctggaacagcaacggtagcctgctagccaccacctgcaaggacaag accttgcgcatcattgaccccagaaaaggccaagtggtggcggagcaagcccggcctcac gagggcgcccgcccgctgcgggctgtcttcaccgcagacgggaagctgctcagcaccggc ttcagcaggatgagtgagcggcaactcgcgctctgggacccggagaggtttgcggcccac gaggggatgaggcccatgcgggccgtcttcacgcgccagggccatatcttcaccacgggc ttcacccgcatgagccagcgagagctgggcctgtgggacccgaacaacttcgaggagcca gtggcactgcaggagatggacacaagcaacggggtcctattgcccttttacgatcccgac tccagcatcgtctacctgtgtggcaaggtgctcacggccgggcagggagaacagggcact ggatggagatgtggagggccttgtccgggcgcgcccctgaaccgactgattttgcagggc gacagcagcattcggtactttgagattaccgacgagccgcctttcgtgcactacctgaac acgttcagcagcaaagagccgcagcggggcatgggtttcatgcccaaaaggggactggat gtcagcaagtgtgagatcgcccggttctacaagctacacgaaagaaagtgtgaacctatc atcatgactgtgccccgcaagtcagacctcttccaggacgatctgtacccggatacgcca ggcccggagccggccctagaagcggacgaatggctatccggccaggacgccgaacccgtg ctcatttcgctgagggacggctatgtgccccccaagcaccgcgagctccgggtcacgaag cgcaacatcctggacgtgcgcccgccctccggcccccgccgcagccagtcggccagcgac gcccccttgtcggtaagatcggccctgctgcactctggcccaatctacatctcccccaac cgcccctatacgacacggcctctctgtcctccgccccagcagcagcacaccctggagacg ctgctggaagagatcaaggccctccgcgagcgggtgcaggcccaggagcagcgcatcacg gctctggagaacatgctgtgcgagctggtggacggcacggactag