GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:59:27 Sequence gi568815597r:29094025_29330906 : 236882 bp : 49.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2131 2580 450 0 0 72 67 326 0.951 22.40 1.02 Intr + 3783 3911 129 2 0 84 91 160 0.997 16.79 1.03 Intr + 11990 12121 132 1 0 100 91 -22 0.520 0.14 1.04 Intr + 15312 15413 102 2 0 99 87 63 0.992 7.67 1.05 Intr + 18344 18424 81 1 0 103 75 70 0.991 7.03 1.06 Term + 21675 21773 99 2 0 117 49 219 0.988 19.03 1.07 PlyA + 22474 22479 6 -0.45 2.03 PlyA - 23108 23103 6 1.05 2.02 Term - 27824 26881 944 0 2 118 49 865 0.852 77.86 2.01 Init - 30075 29832 244 0 1 68 99 78 0.719 3.57 2.00 Prom - 43047 43008 40 -4.26 3.09 PlyA - 43414 43409 6 1.05 3.08 Term - 55202 54386 817 1 1 78 36 501 0.808 36.51 3.07 Intr - 56168 56079 90 1 0 95 91 1 0.523 0.21 3.06 Intr - 60886 60672 215 1 2 110 79 123 0.944 11.11 3.05 Intr - 65462 65350 113 0 2 55 42 60 0.829 -1.80 3.04 Intr - 66493 66351 143 0 2 101 66 163 0.939 15.40 3.03 Intr - 87769 87622 148 2 1 52 76 307 0.067 25.19 3.02 Intr - 97586 97469 118 2 1 117 94 15 0.585 5.04 3.01 Init - 98196 98188 9 0 0 93 77 0 0.547 -0.21 3.00 Prom - 98472 98433 40 -10.15 4.13 PlyA - 98802 98797 6 1.05 4.12 Term - 100155 99998 158 1 2 109 43 169 0.993 12.60 4.11 Intr - 101989 101917 73 1 1 110 109 24 0.966 5.68 4.10 Intr - 102234 102174 61 2 1 84 91 50 0.970 3.64 4.09 Intr - 104135 103938 198 1 0 73 72 76 0.872 2.97 4.08 Intr - 106565 106492 74 2 2 123 94 71 0.985 9.40 4.07 Intr - 108021 107919 103 2 1 71 77 75 0.995 4.88 4.06 Intr - 109209 109107 103 1 1 78 96 158 0.999 14.83 4.05 Intr - 112881 112738 144 1 0 84 91 141 0.997 14.25 4.04 Intr - 122112 121981 132 1 0 118 95 114 0.971 15.72 4.03 Intr - 122661 122564 98 2 2 52 86 141 0.992 9.95 4.02 Intr - 130146 130075 72 2 0 57 110 26 0.421 0.22 4.01 Init - 136882 136707 176 1 2 75 54 198 0.815 11.92 4.00 Prom - 140418 140379 40 -4.66 5.03 PlyA - 141012 141007 6 1.05 5.02 Term - 143739 142369 1371 0 0 70 41 317 0.940 16.56 5.01 Init - 144682 144470 213 1 0 81 78 115 0.332 6.55 5.00 Prom - 147368 147329 40 -5.26 6.00 Prom + 152973 153012 40 -7.86 6.01 Init + 154087 154111 25 0 1 85 73 11 0.170 -0.88 6.02 Intr + 161251 161382 132 2 0 64 127 244 0.906 26.52 6.03 Intr + 164481 164752 272 1 2 137 80 287 0.807 30.06 6.04 Intr + 165237 165318 82 2 1 81 101 108 0.998 10.61 6.05 Intr + 165425 165540 116 0 2 63 63 145 0.999 9.67 6.06 Intr + 165846 166116 271 2 1 135 23 404 0.980 35.71 6.07 Intr + 166586 166879 294 0 0 111 94 561 0.999 55.98 6.08 Intr + 181424 181732 309 0 0 96 95 540 0.858 51.78 6.09 Intr + 184988 185097 110 0 2 111 90 127 0.741 15.20 6.10 Intr + 185432 185633 202 1 1 90 93 263 0.991 25.86 6.11 Intr + 186015 186117 103 1 1 94 75 141 0.946 12.63 6.12 Intr + 188652 188925 274 0 1 91 60 386 0.965 33.64 6.13 Intr + 189916 189952 37 0 1 95 115 50 0.969 6.24 6.14 Intr + 190707 190845 139 1 1 122 96 181 0.999 21.82 6.15 Intr + 197845 198002 158 1 2 98 84 382 0.995 38.45 6.16 Intr + 205198 205305 108 2 0 67 86 30 0.563 0.96 6.17 Intr + 209831 210021 191 0 2 76 40 209 0.981 14.20 6.18 Intr + 210750 210825 76 2 1 114 76 75 0.993 7.99 6.19 Intr + 211328 211404 77 0 2 119 76 140 0.940 15.13 6.20 Intr + 216720 216756 37 2 1 121 77 -5 0.901 -0.46 6.21 Intr + 217432 217529 98 2 2 85 98 135 0.953 13.93 6.22 Intr + 217619 217735 117 1 0 63 100 167 0.995 16.06 6.23 Intr + 218528 218682 155 1 2 107 75 184 0.999 17.87 6.24 Intr + 221348 221483 136 2 1 89 64 212 0.875 19.47 6.25 Intr + 221978 222127 150 1 0 107 75 204 0.826 21.36 6.26 Intr + 223724 223897 174 1 0 91 49 361 0.933 32.64 6.27 Intr + 226661 226801 141 1 0 93 68 280 0.998 27.05 6.28 Intr + 229347 229472 126 2 0 74 100 180 0.966 18.68 6.29 Intr + 229613 229764 152 1 2 68 76 246 0.525 20.36 6.30 Intr + 231167 231302 136 2 1 81 77 189 0.813 17.67 6.31 Term + 231575 231637 63 1 0 140 44 78 0.999 6.49 6.32 PlyA + 232753 232758 6 1.05 7.04 PlyA - 232892 232887 6 -5.12 7.03 Term - 233099 232932 168 2 0 58 48 149 0.922 5.78 7.02 Intr - 234307 234179 129 1 0 71 24 73 0.486 0.09 7.01 Init - 236291 236226 66 2 0 109 85 38 0.918 6.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:29094025_29330906|GENSCAN_predicted_peptide_1|330_aa SINGIRTEEVAVVTKGPSTNPDSEWEGPKHSVVPSKSQMTTSSESLQSFAFGSLSISSKE TEEKEEGAAGYLDIKEMPRGPTGGCIGVEEQASALKFSVTPASCQLQPGVKKAESSEEHV TPGEPPGKQNGSFLDFHVGNQFPTLIRSFQPPLVKTQTVTISDNANAVKSEIPTKDVPIV HTETKTITYEAAQCWDFRHEPPCLATNLVVGVSFILSKHLRVKLLGHRTNDQYEETLTDD NSGDLDPGVLLTAQTITSETPSSTTTTQITKTVKGGISETRIEKRIVITGDADIDHDQVL VQAIKEAKEQHPDMSVTKVVVHQETEIADE >gi568815597r:29094025_29330906|GENSCAN_predicted_CDS_1|993_bp agtataaatggcattcgcacagaggaggtggctgtcgtgacaaaggggccatctactaac cctgactctgaatgggagggtcccaagcattcggtagttcctagtaaaagccagatgacc acctcgtcggagtctctgcaaagctttgcctttggctccctctccataagcagcaaggag acagaagagaaggaggagggggcagctggctatcttgatattaaggagatgccaagaggc ccaactgggggatgtataggagtggaggaacaggccagtgccttaaagttctcagtaaca ccagcttcctgtcagctgcaacctggtgtaaaaaaggcagagagtagtgaagaacatgtt acaccaggagagccacctggaaaacaaaatggatcatttcttgactttcatgtgggtaac cagttccccaccctcattcgaagtttccagcctcccctggtgaagacacaaactgtcacc atctcagataatgccaatgctgtgaaaagtgaaatcccaaccaaagacgtccctattgtc cacactgagaccaagaccatcacttatgaggctgcccagtgctgggatttcaggcatgag ccaccgtgcctggccacaaatcttgttgtgggcgtttcatttatcttgagtaaacatcta agagtcaaattgctgggtcataggacaaatgaccagtatgaggaaacattgactgacgac aacagtggagacttggacccaggagtcttgctgacagctcaaactatcacatctgagacc ccaagcagcaccaccacaactcaaattaccaagactgtaaaaggtgggatttcagagaca cgtattgaaaagagaattgtgatcacaggagatgctgatattgaccatgatcaggtcctt gtacaagccatcaaggaggcaaaggagcagcacccagacatgtcagtgaccaaggtggtc gtccaccaggagaccgagattgctgatgagtga >gi568815597r:29094025_29330906|GENSCAN_predicted_peptide_2|395_aa MAELGPRVPGVTRRAQGAYLRSFDLSCRLSAAVGPGPARVGRSRGLEAGPGRGWGRAVET RAALEPPRADWDPDPRGAAAPERPDDGEMTAGSPEECGEVRRSPEGRVSRLGRRLGRRRR PRSPPEPLRVRARLRLRSPSGAFAALGALVVLVGMGIAVAGYWPHRAGAPGSRAANASSP QMSELRREGRGGGRAHGPHERLRLLGPVIMGVGLFVFICANTLLYENRDLETRRLRQGVL RAQALRPPDGPGWDCALLPSPGPRSPRAVGCAEPEIWDPSPRRGTSPVPSVRSLRSEPAN PRLGLPALLNSYPLKGPGLPPPWGPRTQTGHVIITVQPSGSCIEHSKSLDLGLGELLLGA PAARDCAHRSWPRLDRLSLGGYAKLGGGGDLGARV >gi568815597r:29094025_29330906|GENSCAN_predicted_CDS_2|1188_bp atggcagaactcgggccacgggtgcccggagtcaccaggcgcgcacagggcgcctacctg cgatcctttgacctgagctgccgcctttcagcggcggtggggccgggcccggcgcgggtc ggacggtcccgggggctggaggcggggccggggcggggctggggccgggccgtggagacc cgggcggctctggagcctccgcgcgcggactgggacccggacccgcgcggcgctgcggcg ccagagcgcccagacgacggcgagatgacggccgggagccccgaagaatgcggggaggtg cggaggagccccgagggccgcgtctctcgcttgggccgccgcctgggccgccgccggcgc ccgcgctccccgcccgagcctctgcgggtgcgggcgcggctgcggctgcgctcgccgtcg ggggcgttcgcggcgctgggggcgctcgtggtactggtgggtatgggcattgcagtggcc ggctactggccgcaccgggccggggccccagggtcccgggccgccaatgccagctcgccc cagatgagcgagctgcgacgcgagggtcgcggcgggggccgggctcacggcccgcacgag cggctgcggctcctcgggccggtgatcatgggcgtcggcctgttcgtgttcatctgcgcc aacacactgctgtatgagaaccgagacttggagacgcgacggctccgccagggggtgctg cgggcccaggcgctccggccccccgacggcccgggctgggactgcgccctccttcccagc cccggccctaggagtccccgagccgtaggctgcgcagagccagaaatctgggacccgtcc ccgcgtcggggtacttcacccgtcccgtcagtgcggagtctgcgttcagagcccgctaat cctcgcttggggttacctgccttgctcaacagctacccgctgaagggccccgggctgccc ccaccctggggtccacggacgcagactggccatgtgatcatcaccgtgcagccgtctggc tcctgcattgaacattccaagtctctggatctgggccttggggagctcctccttggggcc ccagcagctcgggactgtgctcaccgaagctggccacggctggaccgcctcagtcttggg ggctatgccaaattgggaggaggaggggacttgggggcccgggtctga >gi568815597r:29094025_29330906|GENSCAN_predicted_peptide_3|550_aa MGEVSGTSDCTDDQCRQVKKALEGGKAARGHRSKIKIRFFRPGGLGPGPAITAVAGMPRV YIGRLSYQARERDVERFFKGYGKILEVDLKNGYGFVEFDDLRDADDAVYELNGKDLCGER VIVEHARGPRRDGSYGSGRSGYGYRRSGRDKYGPPTRTEYRLIVENLSSRCSWQDLKDYM RQAGEVTYADAHKGRKNEGVIEFVSYSDMKRALEKLDGTEVNGRKIRLVEDKPGSRRRRS YSRSRSHSRSRSRSRHSRKSRSRSGSSKSSHSKSRSRSRSGSRSRSKSRSRSQSRSRSKK EKSRSPSKEKSRSRSHSAGKSRSKSKDQAEEKIQNNDNVGKPKSRSPSRHKSKSKSRSRS QERRVEEEKRGSVSRGRSQEKSLRQSRSRSRSKGGSRSRSRSRSKSKDKRKGRKRSREES RSRSRSRSKSERSRKRGSKRDSKAGSSKKKKKEDTDRSQSRSPSRSVSKEREHAKSESSQ REGRGESENAGTNQETRSRSRSNSKSKPNLPSESRSRSKSASKTRSRSKSRSRSASRSPS RSRSRSHSRS >gi568815597r:29094025_29330906|GENSCAN_predicted_CDS_3|1653_bp atgggtgaggttagtgggacaagtgattgtacagatgatcagtgcaggcaagtgaaaaaa gccttagaaggagggaaggctgccagaggccacaggagtaaaattaagattaggttcttt aggccaggggggctggggccggggccagccatcactgccgttgccgggatgccgcgggtg tacatcggccgcctgagctaccaggcccgggagcgcgatgtggagcgcttctttaagggc tacgggaagatcctggaggtggatctgaagaacggatatggttttgtggagtttgatgat ctgcgtgatgcagatgatgctgtttatgaactgaatggcaaagacctttgtggtgagcga gtaattgttgagcatgcccgcggcccacggcgagatggcagttacggttctggacgcagt ggatatggttatagaagaagtggccgagataaatatggccctcctactcgcacagagtac agacttattgtggagaatttgtcaagtcggtgcagctggcaagacctaaaggattatatg cgtcaggcaggagaagtgacttatgcagatgctcacaagggacgcaaaaatgaaggggtg attgaatttgtatcttattctgatatgaaaagagctttggaaaagttggatggaactgaa gtcaatgggagaaaaatcagattagttgaagacaagccaggttccagacgacgccggtcc tactccagaagccggagtcattcaaggtctcgctctcgaagcagacattcccgtaagagc agaagccgaagtggcagcagcaaaagcagtcattctaagagtagatctcggtccaggtcg ggctcccgctcccggagcaagagccggagccggagccagagtcggagccggagcaagaaa gagaaaagcaggagccccagcaaggaaaagagccgcagccgcagccatagcgctggcaag agccgcagcaagagcaaagaccaagctgaagagaagatccaaaacaatgacaatgtcggg aaacccaagagccggagtcctagcaggcataaaagtaagagcaaaagtcggagcaggagt caggagaggagagtggaggaggagaagcgagggagtgtgagcaggggcaggagccaggag aagagcctccgccagagtcggagccggagcaggagcaaagggggcagcaggagccggagc aggagccgcagcaagagcaaggacaagaggaagggcaggaagagaagcagagaggagagc cgcagtcgcagtcgcagccgcagcaagagtgagaggagcagaaagcgaggcagcaagcga gacagcaaggcgggcagcagcaagaagaagaagaaggaagacactgaccgctcccagtcc agatctccatcccgctccgtgtcaaaggagcgggaacatgccaagtctgaatccagccag agggaaggtcgaggagagagtgagaatgctggcaccaatcaggagacccggtccaggtcg agatccaattccaaatcgaaaccaaaccttccatcagaatcacgctccagatcaaagtca gcttcaaaaacccgatctcggtccaagtctagatccaggtctgcttccagatcgccctcc cgatctagatctaggtcccactcaaggtcctaa >gi568815597r:29094025_29330906|GENSCAN_predicted_peptide_4|463_aa MWVCSTLWRVRTPARQWRGLLPASGCHGPAASSYSASAEPARVRALVYGHHGDPAKVVDG VSLVSADVAYHGACRLSGPLSLGLKNLELAAVRGSDVRVKMLAAPINPSDINMIQGNYGF LPELPAVGGNEGVAQVVAVGSNVTGLKPGDWVIPANAGLGTWRTEAVFSEEALIQVPSDI PLQSAATLGVNPCTAYRMLMDFEQLQPGDSVIQNASNSGVGQAVIQIAAALGLRTINVVR DRPDIQKLSDRLKSLGAEHVITEEELRRPEMKNFFKDMPQPRLALNCVGGKSSTELLRQL ASVAAFDSHGHVNPIVKHAGKGSRSHAPYENLMPENLRRNSFIPKPSPPLQSMEKLSSMK PVLVPTRRGGTMVTYGGMAKQPVVASVSLLIFKDLKLRGFWLSQWKKDHSPDQFKELILT LCDLIRRGQLTAPACSQVPLQDYQSALEASMKPFISSKQILTM >gi568815597r:29094025_29330906|GENSCAN_predicted_CDS_4|1392_bp atgtgggtctgcagtaccctgtggcgggtgcgaacccccgcccggcagtggcgggggctg ctcccagcttctggctgtcacggacctgccgcctcctcctactccgcatccgccgagcct gcccgggtccgggcgcttgtctatgggcaccacggggatccagccaaggtcgtcgatgga gtttccctggtgtcagcagatgtagcatatcatggggcctgcagactttctggtcctctt agtttaggactcaagaacctggagctagctgctgtgagaggatcagatgtccgtgtgaag atgctggcggcccctatcaatccatctgacataaatatgatccaaggaaactacggattc cttcctgaactgcctgctgttggagggaacgaaggtgttgcacaggtggtagcggtgggc agcaatgtgaccgggctgaagccaggagactgggtgattccagcaaatgctggtttagga acctggcggaccgaggctgtgttcagcgaggaagcactgatccaagttccgagtgacatc cctcttcagagcgctgccaccctgggtgtcaatccctgcacagcctacaggatgttgatg gacttcgagcaactgcagccaggggattctgtcatccagaatgcatccaacagcggagtg gggcaagcagtcatccagatcgccgcagccctgggcctaagaaccatcaatgtggtccga gacagacctgatatccagaagctgagtgacagactgaagagtctgggggctgagcatgtc atcacagaagaggagctaagaaggcccgaaatgaaaaacttctttaaggacatgccccag ccacggcttgctctcaactgtgttggtgggaaaagctccacagagctgctgcggcagtta gcatcagtggcagcattcgattctcatgggcatgtgaaccctattgtgaagcacgcaggg aagggatctaggtcccatgctccttatgagaatctaatgcctgagaatctgaggcggaac agtttcatcccaaaaccatctcccccactccagtccatggaaaaattgtcttccatgaaa ccggtcctggtgccaacaaggcgtggaggaaccatggtaacctatggggggatggccaag cagcccgtcgtagcctctgtgagcctgctcatttttaaggatctcaaacttcgaggcttt tggttgtcccagtggaagaaggatcacagtccagaccagttcaaggagctgatcctcaca ctgtgcgatctcatccgccgaggccagctcacagcccctgcctgctcccaggtcccgctg caggactaccagtctgccttggaagcctccatgaagcccttcatatcttcaaagcagatt ctcaccatgtga >gi568815597r:29094025_29330906|GENSCAN_predicted_peptide_5|527_aa MGEKLGRMGWEVALGVLSGTGRSKLKLCPMAPELQGEAQALASGRRERWSHRSSGPRHRS QQQGRERDAGEPRRRLGSAARQRRAELAKQQTVRHARGGGRGARSSPEPRALPDPGSPGG AAATTCSGPARHTRAHAHARTHVLPPPPRAAPGPGHTRSRGPGALHTRNIPGRAASTHGH IRTHSHPFQTRSHTRIPGTLGIHSDPRGPRQHTPHTLDTPHTYTAHGHTRSGPVPTPSHM YTDLKSTHATHAFPDPHSGGIQRAPLATGITQTHNPLADTKQTALPARHTKCKQMQHISR YNSRNTTKLRCTHNPLGHTNTTHWQTHKTLSAHPMPRCTQHSPPTYSPTHITFSCAQNSR THNIRSHATPRAHNRPSYAQTHTGSTLVTPGHSTLTLSSELRHGPRGSAGQARSGRRALT CRSLGLRRAELEGECQHERLGTGHGWSVAARPGARRGGRRPRSPSPARGEPEPEPERSGA ARSGTGAESRARAGASGALTSGPAAGSAVSRRSLLATAGAAAGPAAR >gi568815597r:29094025_29330906|GENSCAN_predicted_CDS_5|1584_bp atgggggagaagttggggaggatggggtgggaggtcgccttgggcgtcctttcgggaacg gggaggtcaaagctaaagctgtgccctatggccccggagctgcagggcgaggcccaagcg ctggccagtggccggcgcgaacgctggagccacaggagttccggcccgagacaccggagc cagcagcagggccgcgaacgcgatgccggggagccccggcgccgcctcggatcagccgcc cgccagcgccgcgcggaacttgcgaaacaacaaacagtccgacatgcccgaggcggcggc cgcggggcccggtcctcccctgagcctcgggccctgcccgacccgggcagcccagggggc gccgccgcgacaacttgttccggcccagcgcgccacacacgcgcacacgcacacgcgcgc acacacgtcctgccaccgccccctcgggcggccccgggaccagggcacactcgcagccgc ggcccaggcgcgctccacactcgcaacattcctggacgcgcagccagcacacacggacac attcgcacacactcacatccgttccagactcgctcccacacacgcattcccggcacactt ggcattcactcggacccacgcggcccccggcagcacacaccacataccctagacactccc cacacgtacacagcgcacggacacacacgttccggacccgttcccacgccttcacacatg tacacggatctcaaaagcacacacgccacacacgcgttcccggatccacactcaggcggg atccagagagcacccttggccacaggcataacacaaacacacaacccactcgcagacaca aaacaaacagcactcccggccaggcatacaaaatgcaaacaaatgcaacacattagcaga tacaactcccgcaacacaaccaaactcagatgcacacacaacccacttggccacacaaac actacacactggcagacccacaagacactcagcgcacacccaatgcctcgatgcacacaa cattcacccccaacatactcgccgacacacatcaccttttcttgcgcacagaactcacgg acacacaacattcgcagtcacgcaacaccccgggcacacaatcgcccatcctacgctcag acacacacagggagcacgctcgtaacgcccggtcactcaacactcacactttcttccgag ctacgccacgggccccgaggctcggcggggcaggctcggtccggccgccgcgcgcttacc tgccggagtctcggtctccggcgcgcagagctggaaggtgagtgccagcacgagcgcctg ggcacgggccatggttggagcgtcgccgcccgtcccggggcccggcgcgggggacgccgc ccccggagcccgagcccagcccgaggcgagccggagcccgagccggagcggagcggcgcg gcgcggagcgggactggcgccgagtccagagcgcgagccggagcaagcggggcgctgacg tcaggcccggccgcgggttcggcggtctcgcggcgctccctgctggccacggcgggtgct gcagcgggtccggccgcccgctag >gi568815597r:29094025_29330906|GENSCAN_predicted_peptide_6|1486_aa MVLELGVVAGCTFEEASDPAVPCEYSQAQYDDFQWEQVRIHPGTRAPADLPHGSYLMVNT SQHAPGQRAHVIFQSLSENDTHCVQFSYFLYSRDGHSPGTLGVYVRVNGGPLGSAVWNMT GSHGRQWHQAELAVSTFWPNEYQVLFEALISPDRRGYMGLDDILLLSYPCAKAPHFSRLG DVEVNAGQNASFQCMAAGRAAEAERFLLQRQSGALVPAAGVRHISHRRFLATFPLAAVSR AEQDLYRCVSQAPRGAGVSNFAELIVKGQLVDAGERRDLTLEGRGRRRGRALPGGVAVGE PPTPIAPPQLLRAGPTYLIIQLNTNSIIGDGPIVRKEIEYRMARGPWAEVHAVSLQTYKL WHLDPDTEYEISVLLTRPGDGGTGRPGPPLISRTKCAEPMRAPKGLAFAEIQARQLTLQW EPLGYNVTRCHTYTVSLCYHYTLGSSHNQTIRECVKTEQGVSRYTIKNLLPYRNVHVRLV LTNPEGRKEGKEVTFQTDEDVPSGIAAESLTFTPLEDMIFLKWEEPQEPNGLITQYEISY QSIESSDPAVNVPGPRRTISKLRNETYHVFSNLHPGTTYLFSVRARTGKGFGQAALTEIT TNISAPSFDYADMPSPLGESENTITVLLRPAQGRGAPISVYQVIVEEERARRLRREPGGQ DCFPVPLTFEAALARGLVHYFGAELAASSLPEAMPFTVGDNQTYRGFWNPPLEPRKAYLI YFQAASHLKGETRLNCIRIARKAACKESKRPLEVSQRSEEMGLILGICAGGLAVLILLLG AIIVIIRKGKPVNMTKATVNYRQEKTHMMSAVDRSFTDQSTLQEDERLGLSFMDTHGYST RALPIKHQLSSGYSQKMFHSHDGARKSLGESQSGDFEGDQRSGGVTEASSLLGGSPRRPC GRKGSPYHTGQLHPAVRVADLLQHINQMKTAEGYGFKQEYESFFEGWDATKKKDKVKGSR QEPMPAYDRHRVKLHPMLGDPNADYINANYIDGYHRSNHFIATQGPKPEMVYDFWRMVWQ EHCSSIVMITKLVEVGRVKCSRYWPEDSDTYGDIKIMLVKTETLAEYVVRTFALERRGYS ARHEVRQFHFTAWPEHGVPYHATGLLAFIRRVKASTPPDAGPIVIHCSAGTGRTGCYIVL DVMLDMAECEGVVDIYNCVKTLCSRRVNMIQTEEQYIFIHDAILEACLCGETTIPVSEFK ATYKEMIRIDPQSNSSQLREEFQTLNSVTPPLDVEECSIALLPRNRDKNRSMDVLPPDRC LPFLISTDGDSNNYINAALTDSYTRSAAFIVTLHPLQSTTPDFWRLVYDYGCTSIVMLNQ LNQSNSAWPCLQYWPEPGRQQYGLMEVEFMSGTADEDLVARVFRVQNISREGHLLVRHFQ FLRWSAYRDTPDSKKAFLHLLAEVDKWQAESGDGRTIVHCLNGGGRSGTFCACATVLEMI RCHNLVDVFFAAKTLRNYKPNMVETMDQYHFCYDVALEYLEGLESR >gi568815597r:29094025_29330906|GENSCAN_predicted_CDS_6|4461_bp atggtcctagagctgggtgtggttgctggctgcaccttcgaggaggcaagtgacccagca gtgccctgcgagtacagccaggcccagtacgatgacttccagtgggagcaagtgcgaatc caccctggcacccgggcacctgcggacctgccccacggctcctacttgatggtcaacact tcccagcatgccccaggccagcgagcccatgtcatcttccagagcctgagcgagaatgat acccactgtgtgcagttcagctacttcctgtacagccgggacgggcacagcccgggcacc ctgggcgtctacgtgcgcgttaatgggggccccctgggcagtgctgtgtggaatatgact ggatcccacggccgtcagtggcaccaggctgagctggctgtcagcactttctggcccaat gaatatcaggtgctgtttgaggccctcatctccccagaccgcaggggctacatgggccta gatgacatcctgcttctcagctacccctgcgcaaaggccccacacttctcccgcctgggc gacgtggaggtcaacgcgggccagaacgcgtcgttccagtgcatggccgcgggcagagcg gccgaggccgaacgcttcctcttgcaacggcagagcggggcgctggtgccggcggcgggc gtgcggcacatcagccaccggcgcttcctggccactttcccgctggctgccgtgagccgc gccgagcaggacctgtaccgctgtgtgtcccaggccccgcgcggcgcgggcgtctctaac ttcgcggagctcatcgtcaagggtcagctggtggacgccggggagcgccgggacctcacc ctcgaggggcggggccggcgacgggggcgggctctgcccgggggcgtggccgtgggggag cccccaactcccatcgcgcccccacagctgctgcgtgctggccccacctacctcatcatc cagctcaacaccaactccatcattggcgacgggccgatcgtgcgcaaggagattgagtac cgcatggcgcgcgggccctgggctgaggtgcacgccgtcagcctgcagacctacaagctg tggcacctcgaccccgacacagagtatgagatcagcgtgctgctcacgcgtcccggagac ggcggcactggccgccctgggccacccctcatcagccgcaccaaatgcgcagagcccatg agggcccccaaaggcctggcttttgctgagatccaggcccgtcagctgaccctgcagtgg gaaccactgggctacaacgtgacgcgttgccacacctatactgtgtcgctgtgctatcac tacaccctgggcagcagccacaaccagaccatccgagagtgtgtgaagacagagcaaggt gtcagccgctacaccatcaagaacctgctgccctatcggaacgttcacgtgaggcttgtc ctcactaaccctgaggggcgcaaagagggcaaggaggtcactttccagacggatgaggat gtgcccagtgggattgcagccgagtccctgaccttcactccactggaggacatgatcttc ctcaagtgggaggagccccaggagcccaatggtctcatcacccagtatgagatcagctac cagagcatcgagtcatcagacccggcagtgaacgtgccaggcccacgacgtaccatctcc aagctccgcaatgagacctaccatgtcttctccaacctgcacccaggcaccacctacctg ttctccgtgcgggcccgcacaggcaaaggcttcggccaggcggcactcactgagataacc actaacatctctgctcccagctttgattatgccgacatgccgtcacccctgggcgagtct gagaacaccatcaccgtgctgctgaggccggcacagggccgcggtgcgcccatcagtgtg taccaggtgattgtggaggaggagcgggcgcggaggctgcggcgggagccaggtggacag gactgcttcccagtgccattgaccttcgaggcggcgctggcccgaggcctggtgcactac ttcggggccgaactggcggccagcagtctacctgaggccatgccctttaccgtgggtgac aaccagacctaccgaggcttctggaacccaccacttgagcctaggaaggcctatctcatc tacttccaggcagcaagccacctgaagggggagacccggctgaattgcatccgcattgcc aggaaagctgcctgcaaggaaagcaagcggcccctggaggtgtcccagagatcggaggag atggggcttatcctgggcatctgtgcaggggggcttgctgtcctcatccttctcctgggt gccatcattgtcatcatccgcaaagggaagccggtgaacatgaccaaggccaccgtcaac taccgccaggagaagacacacatgatgagcgccgtggaccgcagcttcacagaccagagc accctgcaggaggacgagcggctgggcctgtccttcatggacacccatggctacagcacc cgggccctccccataaagcatcagctgtccagtggttactcacagaagatgttccacagt cacgatggtgctaggaagagcctgggagagtcccaaagtggagactttgaaggagaccag cgcagcggtggggtcactgaggccagcagcctcctggggggctccccgaggcgtccctgt ggccggaagggctccccataccacacggggcagctgcaccctgcggtgcgtgtcgcagac cttctgcagcacatcaaccagatgaagacggccgagggttacggcttcaagcaggagtat gagagcttctttgaaggctgggacgccacaaagaagaaagacaaggtcaagggcagccgg caggagccaatgcctgcctatgatcggcaccgagtgaaactgcacccgatgctgggagac cccaatgccgactacattaatgccaactacatagatggttaccacaggtcaaaccacttc atagccactcaagggccgaagcctgagatggtctatgacttctggcgtatggtgtggcag gagcactgttccagcatcgtcatgatcaccaagctggtcgaggtgggcagggtgaaatgc tcacggtactggccggaggactcagacacctacggggacatcaagattatgctggtgaag acagagaccctggctgagtatgtcgtgcgcacttttgccctggagcggagaggctactct gcccggcacgaggtccgccagttccacttcacagcgtggccagagcatggcgtcccctac catgccacggggctgctggctttcatccggcgcgtgaaggcctccaccccacctgatgcc gggcccattgtcatccactgcagcgcgggcaccggccgcacaggttgctatatcgtcctg gatgtgatgctggacatggcagagtgtgagggcgtcgtggacatttacaactgtgtgaag actctctgctcccggcgtgtcaacatgatccagactgaggagcagtacatcttcattcat gatgcaatcctggaggcctgcctgtgtggggagaccaccatccctgtcagtgagttcaag gccacctacaaggagatgatccgcattgatcctcagagtaattcctcccagctgcgggaa gagttccagacgctgaactcggtcaccccgccgctggacgtggaggagtgcagcatcgcc ctgttgccccggaaccgcgacaagaaccgcagcatggacgtcctgccgcccgaccgctgc ctgcccttcctcatctccactgatggggactccaacaactacattaatgcagccctgact gacagctacacacggagtgcggccttcatcgtgaccctgcacccgctgcagagcaccacg cccgacttctggcggctggtctacgattacgggtgcacctccatcgtcatgctcaaccag ctgaaccagtccaactccgcctggccctgcctgcagtactggccagagccaggccggcag caatatggcctcatggaggtggagtttatgtcgggcacagctgatgaagacttagtggct cgagtcttccgggtgcagaacatctctcgggaggggcacctgctggtgcggcacttccag ttcctgcgctggtctgcataccgggacacacctgactccaagaaggccttcttgcacctg ctggctgaggtggacaagtggcaggccgagagtggggatgggcgcaccatcgtgcactgc ctaaacgggggaggacgcagcggcaccttctgcgcctgcgccacggtcctggagatgatc cgctgccacaacttggtggacgttttctttgctgccaaaaccctccggaactacaaaccc aacatggtggagaccatggatcagtaccacttttgctacgatgtggccctggagtacttg gaggggctggagtcaagatag >gi568815597r:29094025_29330906|GENSCAN_predicted_peptide_7|120_aa MPSATDTGGQDLHGPGLSLLSLGTSSMTSPLLGNGHKMAQKYSLPLRSQHRTEQRAEPHP DCSARKELNLIQGPQWSERQLCGPQGSRVDIPYLRNCFRGLDGDSNGQGPLEPTLKVEGP >gi568815597r:29094025_29330906|GENSCAN_predicted_CDS_7|363_bp atgcccagtgcaacagacacaggaggacaagacctacatggccctggcctctcgctgctc agtctgggcactagctccatgacaagccccttactgggcaatggacacaagatggcacag aaatactccctgcccttgaggagccagcaccgaacagagcaaagggccgagccgcatcca gactgctccgcacgcaaagagctgaacctgatccagggcccccagtggagtgagaggcag ctctgtggccctcagggatctcgagttgatataccctacctgaggaactgcttcagaggc ctggatggggacagcaatggccagggtcccttggaaccaactctgaaagtggaaggaccc tga