GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:02:08 Sequence gi568815586r:120951_360052 : 239102 bp : 47.02% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2568 2650 83 2 2 72 46 84 0.525 3.04 1.02 Intr + 4683 4962 280 0 1 101 80 170 0.699 14.98 1.03 Intr + 17317 18404 1088 0 2 138 103 1814 0.822 176.78 1.04 Intr + 20174 20335 162 2 0 83 64 279 0.191 24.09 1.05 Intr + 24196 24235 40 1 1 62 99 -13 0.000 -4.57 1.06 Intr + 31108 31420 313 0 1 63 86 140 0.103 7.06 1.07 Intr + 34040 34216 177 0 0 113 78 32 0.781 4.59 1.08 Intr + 35880 35910 31 1 1 81 75 35 0.830 -1.31 1.09 Intr + 36075 36197 123 0 0 117 110 267 0.999 31.50 1.10 Intr + 36578 36752 175 2 1 89 64 468 0.602 44.44 1.11 Intr + 40918 41115 198 0 0 75 79 318 0.678 29.15 1.12 Intr + 42544 42669 126 0 0 91 89 197 0.989 20.98 1.13 Intr + 44484 44583 100 2 1 79 84 137 0.956 12.08 1.14 Intr + 44749 44940 192 2 0 81 56 335 0.517 29.06 1.15 Intr + 48063 48155 93 1 0 69 99 64 0.846 5.54 1.16 Intr + 48478 48639 162 2 0 65 76 48 0.607 1.25 1.17 Intr + 49637 49722 86 0 2 44 101 99 0.936 6.34 1.18 Term + 53649 54083 435 2 0 82 38 421 0.989 31.79 1.19 PlyA + 57703 57708 6 1.05 2.41 PlyA - 60129 60124 6 -3.24 2.40 Term - 61207 61110 98 2 2 70 55 96 0.196 2.63 2.39 Intr - 64810 64644 167 0 2 86 56 66 0.863 2.80 2.38 Intr - 67008 66770 239 0 2 47 110 67 0.546 1.21 2.37 Intr - 67305 67122 184 2 1 58 53 102 0.323 3.49 2.36 Intr - 70261 70122 140 1 2 118 0 88 0.559 2.26 2.35 Intr - 71698 71528 171 1 0 106 100 198 0.998 22.94 2.34 Intr - 72427 72327 101 2 2 65 84 163 0.999 13.43 2.33 Intr - 74377 74275 103 1 1 109 101 78 0.999 10.95 2.32 Intr - 75311 75174 138 2 0 85 83 186 0.999 18.46 2.31 Intr - 75932 75820 113 0 2 93 98 61 0.872 7.70 2.30 Intr - 76551 76427 125 2 2 110 96 140 0.999 17.23 2.29 Intr - 77053 76950 104 1 2 76 52 136 0.996 7.77 2.28 Intr - 77981 77847 135 2 0 97 91 129 0.955 14.86 2.27 Intr - 79833 79701 133 2 1 89 100 141 0.971 16.05 2.26 Intr - 80899 80812 88 2 1 85 33 26 0.849 -4.17 2.25 Intr - 81930 81790 141 1 0 27 68 193 0.979 11.52 2.24 Intr - 83748 83614 135 1 0 132 75 182 0.996 21.84 2.23 Intr - 89093 88823 271 2 1 81 89 266 0.810 23.11 2.22 Intr - 93630 93603 28 2 1 78 99 -5 0.368 -2.28 2.21 Intr - 98261 98143 119 2 2 10 58 147 0.606 3.16 2.20 Intr - 100596 100426 171 0 0 138 75 118 0.917 15.64 2.19 Intr - 101682 101582 101 1 2 90 61 124 0.995 9.73 2.18 Intr - 102284 102182 103 2 1 91 113 136 0.998 16.15 2.17 Intr - 103179 103042 138 0 0 62 78 184 0.895 15.46 2.16 Intr - 103563 103451 113 1 2 98 69 43 0.798 3.50 2.15 Intr - 105564 105440 125 2 2 117 109 232 0.999 28.43 2.14 Intr - 106718 106615 104 2 2 123 60 120 0.953 11.67 2.13 Intr - 111744 111616 129 0 0 74 50 64 0.641 2.09 2.12 Intr - 114274 114140 135 1 0 50 110 122 0.930 11.36 2.11 Intr - 116340 116208 133 2 1 93 83 102 0.979 10.85 2.10 Intr - 117060 116976 85 1 1 30 83 125 0.854 5.08 2.09 Intr - 118415 118098 318 0 0 27 42 241 0.711 9.33 2.08 Intr - 121828 121664 165 2 0 72 80 301 0.547 27.63 2.07 Intr - 122863 122729 135 2 0 75 96 84 0.975 8.44 2.06 Intr - 131979 131899 81 1 0 111 59 29 0.140 1.91 2.05 Intr - 139107 138901 207 1 0 66 78 155 0.021 11.35 2.04 Intr - 143744 143649 96 0 0 39 111 34 0.012 0.78 2.03 Intr - 152586 152428 159 1 0 55 26 111 0.105 1.56 2.02 Intr - 164712 164577 136 0 1 111 42 69 0.195 4.74 2.01 Init - 172210 171809 402 1 0 80 113 430 0.992 41.33 2.00 Prom - 175531 175492 40 -7.66 3.00 Prom + 175566 175605 40 -6.26 3.01 Init + 177635 178029 395 1 2 87 50 255 0.718 17.81 3.02 Term + 179169 179760 592 0 1 56 48 185 0.666 5.30 3.03 PlyA + 179876 179881 6 1.05 4.16 PlyA - 183276 183271 6 1.05 4.15 Term - 184541 184512 30 2 0 112 33 8 0.321 -4.45 4.14 Intr - 186139 185996 144 1 0 68 63 236 0.480 19.58 4.13 Intr - 187055 186462 594 2 0 89 18 216 0.404 7.47 4.12 Intr - 189014 188853 162 2 0 77 83 205 0.999 19.07 4.11 Intr - 190114 190013 102 1 0 62 69 115 0.784 7.37 4.10 Intr - 192244 192106 139 0 1 89 110 111 0.741 13.87 4.09 Intr - 197511 197156 356 0 2 88 61 260 0.884 17.39 4.08 Intr - 200159 200045 115 1 1 45 44 85 0.597 0.25 4.07 Intr - 201617 201467 151 0 1 53 97 148 0.803 11.42 4.06 Intr - 202256 202132 125 1 2 116 32 65 0.685 3.93 4.05 Intr - 202831 202650 182 1 2 22 36 35 0.220 -9.63 4.04 Intr - 208079 207885 195 2 0 46 101 164 0.918 13.11 4.03 Intr - 210988 210869 120 1 0 52 17 156 0.946 5.59 4.02 Intr - 212699 212537 163 1 1 52 76 97 0.681 4.88 4.01 Init - 213439 213291 149 1 2 60 84 55 0.436 1.86 4.00 Prom - 220321 220282 40 -5.26 5.00 Prom + 220381 220420 40 -1.56 5.01 Sngl + 222919 223248 330 0 0 58 54 237 0.493 13.32 5.02 PlyA + 223265 223270 6 -0.45 6.00 Prom + 223295 223334 40 -8.06 6.01 Init + 224733 225381 649 2 1 66 72 249 0.636 16.57 6.02 Term + 225426 225655 230 1 2 -11 32 181 0.600 -0.61 6.03 PlyA + 225787 225792 6 1.05 7.05 PlyA - 227522 227517 6 1.05 7.04 Term - 229829 229632 198 2 0 109 34 187 0.946 12.80 7.03 Intr - 231374 231255 120 2 0 115 84 129 0.997 15.89 7.02 Intr - 234299 234208 92 0 2 51 78 31 0.720 -1.89 7.01 Intr - 235587 235482 106 0 1 68 93 72 0.705 5.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 20174 20339 166 2 1 83 48 290 0.808 21.89 S.002 Term - 25059 24793 267 0 0 11 48 221 0.847 5.89 S.003 Init + 53026 53112 87 0 0 84 96 87 0.841 7.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:120951_360052|GENSCAN_predicted_peptide_1|1287_aa MVDIEKGSSNEDFSRPGPASKHSFICQSDGSCTQAGGGMEDSVVAAAAVAAGRPSAHAPK AQAQELQEEEERPGAGAASPRAGPQHKASPGRQQPALATALCPHAPAASDYELSLDLKNK QIEMLEHKYGGHLVSRRAACTIQTAFRQYQLSKNFEKIRNSLLESRLPRRISLRKVRSPT AESLAAEKALMEGYGLVGLPLVRSPSLPPTFAGTLTELEDSFTEQVQSLAKSIDDALSTW SLKTMCSLRESGAYQLHQALQAAAGPPGLEAEGRAPESAGPGPGDDAAETPGLPPAHSGT LMMAFRDVTVQIANQNISVSSSTALSVANCLGAQTVQAPAEPAAGKAEQGETSGREAPEA PAVGREDASAEDSCAEAAASGAADGATAPKTEEEEEEEETAEVGRGAEAEAGDLEQLSSS STSTKSAKSGSEASASASKDALQAMILSLPRYHCENPASCKSPTLSTDTLRKRLYRIGLN LFNINPDKGIQFLISRGFIPDTPIGVAHFLLQRKGLSRQMIGEFLGNSKKQFNRDVLEEE ISLSFLISKMLTETVLLATYTRSRAQPCTRGHSVLVAERVNQPLPQGTKSTRKGKAGGQR YGPCCGLQVPRPALNLQGDHRAPVLTAHACTKRLSCVTVSTGFSLASHRSPPSHLGSLCT HRTTASWSPALDSVTVWLCCAQLLTPDGLGYRNTRMWPHRAFGTHVGLQRVQGGGFGFRK GYCGSCVVDEMDFSSMELDEALRKFQAHIRVQGEAQKVERLIEAFSQRYCMCNPEVVQQF HNPDTIFILAFAIILLNTDMYSPNIKPDRKMMLEDFIRNLRGEEGSDTLPPNPTTTPAHS TLLGVDDGADIPRELVVGIYERIQQKELKSNEDHVTYVTKVEKSIVGMKTVLSVPHRRLV CCSRLFEVTDVNKLQKQAAHQREVFLFNDLLVILKLCPKKKSSSTYTFCKSVGLLGMQFQ LFENESHWGPGVCFPDYSHGITLVTPLSGSEKKQVLHFCALGSDEMQKFVEDLKESIAEV TELEQIRIEWELEKQQGTKTLSFKPCGAQGDPQSKQGSPTARLSGRGEVKRSPTRNTLQP GSVRGQGDQKDLDLALLCTVPSLMSHKHFLTPGSGPLLLCQAGQKALRVITEQLSALTVP GTAVSIHNRLQTSQHNSGLGAERGAPVPPPDLQPSPPRQQTPPLPPPPPTPPGTLVQCQQ IVKVIVLDKPCLARMEPLLSQALSCYTSSSSDSCGSTPLGGPGSPVKVTHQPPLPPPPPP YNHPHQFCPPGSLLHGHRYSSGSRSLV >gi568815586r:120951_360052|GENSCAN_predicted_CDS_1|3864_bp atggtggacatagaaaaaggcagcagcaatgaagatttttcacggccagggccagccagc aagcactcctttatctgccagagtgatggctcctgcacccaggccggtgggggcatggag gactccgtggtggcagcggcggcggtggcagccggcagacccagtgcccatgccccgaag gctcaagcccaggagctgcaggaggaggaggagcggccgggggcaggggctgcctcccca agggctggcccccagcacaaggcctcccccggccggcagcagcctgccctggcgacggcg ctgtgcccccacgcccctgccgcctccgattacgaactctcccttgacctaaagaataaa cagattgaaatgctagaacataagtacggcggtcacctggtgtcccggcgcgccgcttgc accatccaaaccgccttccgccaataccagctcagcaagaacttcgagaaaatccgcaac tcgcttctggagagccgcctgccacggcggatctccctgcgcaaggtgcggtcacccacg gccgagagcctggcggccgagaaagcgctcatggagggctacggcctcgtggggctgccg ctggtgcgctcgccctccctgccgcccaccttcgcaggcaccctcaccgagctggaggac tccttcaccgagcaggtgcaatccctggccaagtccatcgacgacgcgctcagcacgtgg agcctcaagaccatgtgctccctgcgggagagtggcgcttaccagctccaccaggccctg caggcggccgcggggcccccaggcctggaggccgaggggcgggcgccggagagcgcgggc cccgggcccggggatgacgccgcggagacccccggcctgcccccggcccacagcgggacc ctcatgatggctttccgggacgtcacggtgcagatcgccaaccagaacatatccgtctcc tcctccacggctctgtcggtggccaactgcctgggcgctcagacggtccaggcccccgca gagcccgcggcgggcaaggccgagcagggcgagacctctgggcgggaggccccggaagcc cccgccgtgggccgggaggacgcgtcagccgaggactcatgcgcagaggctgcggctagt ggggcggcggatggggccacagcccccaaaacagaggaggaagaggaggaggaggagacg gcggaggtggggagaggggccgaggccgaggcaggcgacttggagcagctgagcagcagc agcacgtccaccaagtccgccaagtcaggctcggaggcgtcggcctccgcctccaaggac gccctgcaggccatgatcctgagcctgccgcgctaccactgcgagaacccagccagctgc aagtcgcccacgctctccaccgacaccctgcgcaagcggctctaccgcatcggcctcaac ctcttcaacataaaccccgacaagggcatccagttcctgatctcacgcggcttcatcccg gacacccccatcggtgtggcccatttcctcctccagcgaaagggcctcagccgccagatg attggagagttcctgggcaacagcaagaagcagttcaaccgcgacgtgctggaagaagaa atcagcctcagtttcctcattagtaaaatgctgacagagactgtgttgctggccacttac accaggtcccgagcacagccttgcacacggggccactcagtgctcgttgctgaaagagtc aaccagccactgccacagggcacaaaaagcaccagaaaggggaaggcaggaggtcagcga tatggaccctgctgtgggctccaggtgcccaggccagctctgaacctgcagggtgaccac agagcccccgtgctgactgcccacgcatgcaccaagaggctctcctgtgtaactgtgtcc acgggcttctccttggcctctcacaggtccccaccaagccatctgggctctctctgtact catcggaccacagcctcttggtcaccagccctggacagtgtcactgtgtggctctgttgt gcccagttgctgactcctgatgggctgggctacaggaacacacggatgtggccgcatagg gcctttggcactcacgtagggctgcaaagggtgcagggtggagggtttgggttccgaaag ggctactgtggcagctgcgtggtggacgagatggacttctccagcatggagctggacgag gccctgcgcaagttccaggcacacatccgtgtgcagggggaggctcagaaggtggagcgg ctcattgaggccttcagccagcgctactgcatgtgcaaccccgaagtggttcagcagttc cacaaccccgacaccatcttcatcctcgccttcgccatcatcctcctcaacaccgacatg tacagccccaacatcaagcctgaccggaagatgatgctggaggacttcatccgaaacctt cgaggtgaggagggctctgacaccctccctcccaaccccaccaccacccctgcccactcc acgttgttaggtgtggacgatggcgctgacatccccagggagctggtggtaggcatctat gagaggatacagcagaaggagctcaagtccaatgaggaccacgtcacgtacgtcaccaag gtggagaagtccattgtgggcatgaagacagtgctgtccgtgccccaccgccgcctggtg tgctgcagccggctcttcgaggtgacggatgtgaacaagctgcagaagcaggcagcgcat cagagggaggtgttcctcttcaatgacctgctggtgattctcaaactttgcccgaagaag aagagctcctccacgtacaccttttgcaagtcagttggcctgctgggcatgcagttccag ctctttgagaacgagtcccactgggggcctggggtctgctttccagattactctcatggc atcacactggtgaccccgctctcgggctccgagaagaagcaggtgctgcatttctgtgcc ctgggctcggacgagatgcagaagttcgtggaggacctgaaggagtccattgctgaggtg acggagctggagcagatccgaatagagtgggagctggagaagcagcagggaacaaagaca ctctccttcaagccctgcggagcccagggggacccacagtcaaagcaaggatcgccgaca gccaggctgtctggacgaggcgaggtcaagagaagccccacgaggaacactctccaacca ggcagtgtaagagggcagggtgaccagaaggatctggatcttgctcttctctgtacagtc ccttctctaatgagccacaagcacttcctgacccctggctcagggcccctgctgctgtgc caggccggccagaaggccctcagagtaattacagagcagttatcggctctcaccgtgccc ggcactgcggtgtcaattcacaacaggcttcaaacgtcccagcacaactccgggctgggg gccgagaggggagcgccggtgccgccgccagacctgcagcctagccccccgagacagcag accccaccactgccgccgccgccacccacgcccccgggcaccctggtgcagtgccagcaa attgtcaaggtcattgtcctggacaagccctgcctggcccgcatggagcccctgctgagc caggctctctcctgctacacctcgtcgtcctctgactcctgcggctccacacccctgggc ggtcccggctctccggtcaaggtcacccaccagcctccgctgcccccgcccccacccccc tacaaccaccctcaccagttctgtcccccaggctccctgctgcacgggcaccgctactcc agtggctcaaggagcctggtgtag >gi568815586r:120951_360052|GENSCAN_predicted_peptide_2|1922_aa MEEKPLKVKGKDSSEKKRKRKLEKVEQLFGEGKQKSKELKKMDKPRKKKLKLGADKSKEL NKLAKKLAKEEERKKKKEKAAAAKVELVKESTEKKREKKVLDIPSKYDWSGAEESDDENA VCAAQNCQRPCKDKVDWVQCDGGCDEWFHQVCVGVSPEMAENEDYICINCAKKQGPVSPG SHKAVITVVVKAVSYLKFEEESTSELMWLLAAFSSFWVVGIKALDFVWPLAREEKEIGDN KDLERNKIGVNMVTWQQLCKNKRVGGMDSRVSGTTSNGETKPVYPVMEKKEEDGTLERGH WNNKMEFVLSVAGEIIGLGNVWRFPYLCYKNGGGPVAPPCRCGLCPSWQAAKELNEAEEG GAFFIPYLVFLFTCGIPVFLLETALGQYTSQGGVTAWRKICPIFEAQFCSFPAGIGYASQ MIVILLNVYYIIVLAWALFYLFSSFTIDLPWGGCYHEWNTGNRVDVVEPVAEGNGVGMDP THPTCAGNRVDVVEPVAEGNGVGMDPTQPMCAGNRVDVVEHVAEGAGNRVDVVEHVAEGN GVGMDPTHPTCAGNRVDVVEQVAEGNEHCMEFQKTNGSLNGTSENATSPVIEFWERRVLK ISDGIQHLGALRWELALCLLLAWVICYFCIWKGVKSTGKVVYFTATFPYLMLVVLLIRGV TLPGAAQGIQFYLYPNLTRLWDPQTAWRYRLLLPHVHAGYEGSGQALEPNCRSTTRVLEG FQCTFLIVWMDAGTQIFFSFAICLGCLTALGSYNKYHNNCYRDCIALCFLNSGTSFVAGF AIFSILGFMSQEQGVPISEVAESGPGLAFIAYPRAVVMLPFSPLWACCFFFMVVLLGLDS QFVCVESLVTALVDMYPHVFRKKNRREVLILGVSVVSFLVGLIMLTEGGMYVFQLFDYYA ASGMCLLFVAIFESLCVAWVYGAKRFYDNIEDMIGYRPWPLIKYCWLFLTPAVCTATFLF SLIKYTPLTYNKKYTYPWWGDALGWLLALSSMVCIPAWSLYRLGTLKGPFREVIIPVSIR MTKVAGTAGLSGNVGPRCDLSRIEELPDGGTRFSVYSLESLALALVILPTQRGFPTHHTA MDGKVAVQECGPPAVSWVPEEGEKLDQEDEDQVKDRGQWTNKMEFVLSVAGEIIGLGNVW RFPYLCYKNGGGAFFIPYFIFFFVCGIPVFFLEVALGQYTSQGSVTAWRKICPLFQGIGL ASVVIESYLNVYYIIILAWALFYLFSSFTSELPWTTCNNFWNTEHCTDFLNHSGAGTVTP FENFTSPVMEFWERRVLGITSGIHDLGSLRWELALCLLLAWVICYFCIWKGVKSTGKVVY FTATFPYLMLVILLIRGVTLPGAYQGIIYYLKPDLFRLKDPQVWMDAGTQIFFSFAICQG CLTALGSYNKYHNNCYKDCIALCFLNSATSFVAGFVVFSILGFMSQEQGVPISEVAESGP GLAFIAFPKAVTMMPLSQLWSCLFFIMLIFLGLDSQFVCVECLVTASIDMFPRQLRKSGR RELLILTIAVMCYLIGLFLVTEGGMYIFQLFDYYASSGICLLFLSLFEVVCISWVYGADR FYDNIEDMIGYRPWPLVKISWLFLTPGLCLATFLFSLSKYTPLKYNNVYVYPPWGYSIGW FLALSSMVCVPLFVVITLLKTRGPFRKRLRQLITPDSSLPQPKQHPCLDGSAGRNFGPSP TREGLIAGEKETHLARNLQLAMPEPPSPSMGSCAARASPTSAAPCSRAPSPIDHPRAEKC GRTEPDWEAAPPVAPLKVCKHTNQHPVSSSGFVNAPIHTLYLASLVGTWRTFVSSSGVVN TPIKTLSKRTNQRSVKWTNRLSVKWTNQQDVGGASLATAGADGLSGLRNLLARDWEVTGG ECKLPDDATDREGSLLPAKAESLQARVHPQGSALDGASFLPGRLQTFICDTGGHVAHPEE GE >gi568815586r:120951_360052|GENSCAN_predicted_CDS_2|5769_bp atggaagagaaaccactaaaagtgaaaggaaaggactcttcagagaagaaacggaaacgg aagctagaaaaggtagagcaactttttggagaaggaaaacagaagtccaaggagttaaag aaaatggacaaacctagaaagaagaaattaaaattaggtgcagacaaatcaaaggagctg aataaactggccaagaaactagcaaaagaagaagagagaaagaaaaagaaggagaaggct gctgcagccaaagttgaacttgtgaaagagagcactgaaaagaaaagagagaaaaaggtg ctggacatcccctcaaagtatgactggtcaggagcagaggagtctgatgatgagaatgct gtgtgcgcagcacagaactgccaaaggccctgcaaggacaaggtagactgggtacaatgt gatggtggctgtgatgagtggtttcatcaagtttgtgtgggtgtatctccagaaatggct gaaaatgaagattacatctgtataaactgtgcaaagaagcaggggccagttagcccaggg tctcacaaggctgtcatcacggtggtggttaaggctgtgtcatatctgaagtttgaggaa gaatctacttctgagctcatgtggttgttggcagcctttagttccttttgggtcgtcgga ataaaggccttagattttgtttggccactggccagagaggaaaaagaaattggggataac aaagatttggagaggaataaaataggagtcaacatggtaacatggcagcagttatgtaaa aacaaaagagttggtgggatggatagcagggtctcaggcacaaccagtaatggagagaca aaaccagtgtatccagtcatggaaaagaaggaggaagatggcaccctggagcgggggcac tggaacaacaagatggagtttgtgctgtcagtggctggggagatcattggcttaggcaac gtctggaggtttccctatctctgctacaaaaatgggggagggcctgtagcccctccttgc aggtgtggactttgtccctcttggcaggctgccaaggagctcaacgaggcagaggaagga ggtgccttcttcatcccctacctcgtcttcctctttacctgtggcattcctgtcttcctt ctggagacagcactaggccagtacactagccagggaggcgtcacagcctggaggaagatc tgccccatctttgaggcccaattctgcagtttccccgcaggcattggctatgcctcccag atgatcgtcatcctcctcaacgtctactacatcattgtgttggcctgggccctgttctac ctcttcagcagcttcaccatcgacctgccctggggcggctgctaccatgagtggaacaca gggaaccgggtggacgtggtagagccggtggctgaggggaatggtgtgggaatggatcca acacaccccacgtgtgcagggaacagggtggacgtggtagagccggtggctgaagggaat ggtgtgggaatggatccaacacaacccatgtgtgcagggaacagggtggacgtggtagag cacgtggctgaaggggcagggaacagggtggacgtggtagagcatgtggctgagggaaat ggtgtgggaatggatccaacacaccccacgtgtgcagggaacagggtggacgtggtagag caggtggctgaggggaatgaacactgtatggagttccagaagaccaacggctccctgaat ggtacctctgagaatgccacctctcctgtcatcgagttctgggagcggcgggtcttgaag atctctgatgggatccagcacctgggggccctgcgctgggagctggctctgtgcctcctg ctggcctgggtcatctgctacttctgcatctggaagggggtgaagtccacaggcaaggtg gtgtacttcacggccacatttccttacctcatgctggtggtcctgttaattcgaggggtg acgttgcctggggcagcccaaggaattcagttttacctgtacccaaacctcacgcgtctg tgggatccccagacagcatggagatacagattgctactgcctcatgtccacgcagggtac gagggcagcggtcaggctttggagccaaactgtagaagcacaaccagggtcctagaagga ttccagtgcacgttcctcattgtgtggatggatgcaggcacccagatattcttctccttc gccatctgtcttgggtgcctgacagccctgggcagctacaacaagtaccacaacaactgc tacagggactgcatcgccctctgcttcctcaacagcggcaccagctttgtggccggcttt gccatcttctccatcctgggcttcatgtctcaggagcagggggtgcccatttctgaggtg gccgagtcaggccctggcctggctttcatcgcttacccgcgggctgtggtgatgctgccc ttctctcctctctgggcctgctgtttcttcttcatggtcgttctcctgggactggatagc cagtttgtgtgtgtagaaagcctggtgacagcgctggtggacatgtaccctcacgtgttc cgcaagaagaaccggagggaagtcctcatccttggagtatctgtcgtctccttccttgtg gggctgatcatgctcacagagggcggaatgtacgtgttccagctctttgactactatgcg gccagtggcatgtgcctcctgttcgtggccatcttcgagtccctctgtgtggcttgggtt tacggagccaagcgcttctacgacaacatcgaagacatgattgggtacaggccatggcct cttatcaaatactgttggctcttcctcacaccagctgtgtgcacagccacctttctcttc tccctgataaagtacactccgctgacctacaacaagaagtacacgtacccgtggtggggc gatgccctgggctggctcctggctctgtcctccatggtctgcattcctgcctggagcctc tacagactcggaaccctcaagggccccttcagagaggttataatacctgtctccatccgt atgaccaaagttgccggaactgccggcctcagcgggaatgtgggacccagatgtgatctg agtcgcatcgaggagcttcctgatggtggaacacggttctcagtttatagcctggagtct ctggctttggccctcgtcatcctgcccacccagcggggcttcccaacccaccacacagcc atggacgggaaggtggcagtgcaagagtgtgggcctcctgcagtctcctgggtccccgag gagggagagaagttggaccaggaagacgaggaccaggtgaaggatcggggccaatggacc aacaagatggagtttgtgctgtcagtggccggggagatcattgggctgggcaatgtctgg aggtttccctatctctgctacaaaaacggaggtggagccttcttcatcccctacttcatc ttcttctttgtctgcggcatcccggtgttcttcctggaggtggcgttgggccaatacacc agccaagggagtgtcacagcctggaggaagatctgccccctcttccagggcattggtctg gcatctgtggtcatcgagtcatatttgaatgtctactacatcatcatccttgcctgggct ctcttctacctgttcagctccttcacttctgagctgccctggacgacctgcaacaacttt tggaacacagagcattgcacggactttctgaaccactcaggagccggcacagtgacccca tttgagaattttacctcacctgtcatggaattctgggagagacgagttctgggcatcacc tcgggcatccatgacctgggctccctgcgctgggagctggccctgtgcctcctgctcgcc tgggtcatctgctatttctgcatctggaagggggtcaagtccacaggcaaggtggtttat ttcacagccacgtttccgtacctgatgcttgtcattttgctgatcagaggtgtcaccctt cccggagcctaccagggcatcatctactacttgaagccagatttgttccgcctcaaggac cctcaggtgtggatggatgcgggcacccagatcttcttctcctttgccatctgccagggg tgcctgacagccctgggcagctacaacaagtatcacaacaactgctacaaggactgcatc gccctctgcttcctgaacagtgccaccagctttgtggctgggtttgttgtcttctccatc ctgggcttcatgtcccaagagcaaggggtgcccatttctgaagtggccgagtcaggtcct gggctggccttcatcgccttccccaaggctgtgactatgatgcccttatcccagctgtgg tcctgcctgttctttatcatgctcatattcctagggctggacagccagtttgtctgtgtg gagtgcctggtgacagcctccatagacatgttccccaggcagctccggaagagcgggcgg cgcgagctcctcatcctcaccatcgccgtcatgtgctacctgatagggcttttcctggtc accgagggcgggatgtacatcttccagctgtttgactactatgcttccagtggcatatgc ctgctgttcctgtcattgtttgaagtggtctgcataagctgggtgtatggggcggaccgt ttctatgacaacattgaggacatgattggctaccggccatggcccctggtgaagatctcc tggctcttcctgacccctggactttgcctggccactttcctcttctccttgagcaagtac acccccctcaagtacaacaacgtctatgtgtacccgccctggggatactccattggctgg ttcctggctctgtcctccatggtctgtgtcccactcttcgtcgtcatcaccctcctgaag actcggggtcctttcaggaagcgtctgcgtcagctcatcacccctgactccagtctgcca cagcccaagcaacatccctgcttggatggcagtgctggccggaactttgggccctcccca acaagggaaggactgatagccggggagaaggagacccatttggctcggaacctgcagctc gccatgcctgagcctccctccccctccatgggctcctgtgcggcccgagcctccccgacc agcgccgccccctgctccagggcacccagtcccatcgaccacccaagggctgagaagtgc gggcgcacggaaccggactgggaggcagctccacctgtggccccactcaaggtttgtaaa cacaccaatcagcaccctgtgtctagctcagggtttgtgaatgcaccaatccacactctg tatctagcttctctggtggggacttggagaacctttgtgtctagctcaggcgttgtaaac acaccaatcaaaaccctgtcaaaacggaccaatcagcgctctgtaaaatggaccaatcgg ctctctgtaaaatggaccaatcagcaggatgtgggtggggccagtctggccacagcaggg gcagatggcctgtcaggtctgaggaacctgctggcacgtgactgggaagtaacgggaggc gagtgcaagttgccagatgatgccacagacagagaggggagcttgctccctgccaaggct gagtccttgcaggctcgtgtccatccacagggctccgccctggacggagcgtccttcttg cctgggcgtcttcagaccttcatctgtgacaccgggggccatgtggctcaccccgaggaa ggtgagtga >gi568815586r:120951_360052|GENSCAN_predicted_peptide_3|328_aa MGRNQYKKAEHSKNQNASSPPEDQISSPAKEQNWMVNEFDELTEVGFRRWVITNSSELKD HVLTQYKEAKNLQERLEELLTRITSLEKNINELMELKNTARELREAYTSINSRINQAEER ISEIEDQFNKIKSMRWKINKDIQELNSALHQADLIDTYRTLHPKSAEYTFCSAPHHTYSK IDPIIGSKTPLSKCKRMEIITNSLSDHSAIILELRIKKLTQNHTTTRKLNNLLLNDYWVH KEMKAEIKTFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKERSKIDILTSQLKEP EKQEQTNSKASRRQEITKIRAELKEIET >gi568815586r:120951_360052|GENSCAN_predicted_CDS_3|987_bp atggggagaaaccagtacaaaaaggctgaacattccaaaaaccagaatgcctcttctcct ccagaggatcaaatctcctcgccagcaaaggaacaaaactggatggtgaatgagtttgat gaattgacagaagtaggcttcagaaggtgggtaataacaaactcctctgagctaaaggac catgttctaacccaatacaaggaagctaagaaccttcaggaaaggttagaggaattgcta actagaataaccagtttagaaaagaacataaatgaactgatggagctgaaaaacacagca cgagaacttcgtgaagcatacacaagtatcaatagccgaatcaatcaagcagaagaaagg atatcagagattgaagatcaattcaataaaataaaatcaatgagatggaaaattaacaag gatatccaggaactgaactcagctctgcaccaagcagacctaatagacacctacagaact ctccaccccaaatcagcagaatacacattctgctcagcaccacatcatacttattctaaa attgaccccataattggaagtaaaacacccctcagcaaatgcaaaagaatggaaataata acaaacagtctctcagaccacagtgcaatcatattagaactcaggattaagaaactcact caaaaccacacaactacacggaaacttaacaacctgctcctgaatgactactgggtacat aaagaaatgaaggcagaaataaagacattctttgaaaccaatgagaacaaagacacaaca taccagaatctctgggacacatttaaagcagtgtgtagagggaaatttatagcactaaat gcccacaagagaaaggaaagatctaaaatcgacatcctaacatcacaattaaaagaacca gagaagcaagagcaaacaaattcaaaagctagcagaaggcaagaaataactaagatcaga gcagaactgaaggagatagagacatga >gi568815586r:120951_360052|GENSCAN_predicted_peptide_4|908_aa MPVLEQSVLAHINVDISGMKVPWLYVGMCFSSFCWHIEDHWSYSINYLHWGEPKTWYGVP SHAAEQLEEVMRELAPELFESQPDLLHQLVTIMNPNVLMEHGVPVYRTNQCAGEFVVTFP RAYHSGFNQGYNFAEAVNFCTADWLPIGRQCVNHYRRLRRHCVFSHEELIFKMAADPECL DVGLAAMVCKELTLMTEEETRLRESVVQMGVLMSEEEVFELVPDDERQCSACRTTCFLSA LTCSCNPERLVCLYHPTDLCPCPMQKKCLRYRYPLEDLPSLLYGVKVRAQSYDTWVSRVT EALSANFNHKKDLIELRVMLEDAEDRKYPENDLFRKLRDAVKEAETCASVAQLLLSKKQK HRQSPDSGRTRTKLTVEELKAFVQQLFSLPCVISQARQVKNLLDDVEEFHERAQEAMMDE TPDSSKLQMLIDMGSSLYVELPELPRLKQELQQARWLDEVRLTLSDPQQVTLDVMKKLID SGVGLAPHHAVEKAMAELQELLTVSERWEEKAKVCLQARPRHSVASLESIVNEAKNIPAF LPNVLSLKEALQKAREWTAKVEAIQSGSNYAYLEQLESLSAKGRPIPVRLEALPQVESQV LSPRTDIGVYGSGKNRRKKVKELIEKEKEKDLDLEPLSDLEEGLEETRDTAMVVAVFKER EQKEIEAMHSLRAANLAKMTMVDRIEEVKFCICRKTASGFMLQCELCKDWFHNSCVPLPK SSSQKKGSSWQAKEVKFLCPLCMRSRRPRLETILSLLVSLQKLPVRLPEGEALQCLTERA MSWQDRARQALATDELSSALAKLSVLSQRMVEQAAREKTEKIISAELQKAAANPDLQASL GFTFFFYFMGYGHLPSFQQSAFNRVVSSVSSSPRQTMDYDDEETDSDEDIRETYGYDMKL ADQHGTAH >gi568815586r:120951_360052|GENSCAN_predicted_CDS_4|2727_bp atgcctgtcctggaacagtctgttcttgcacatattaatgtggacatctctggtatgaaa gtgccgtggctctatgtgggaatgtgcttctcttctttttgctggcacattgaggatcac tggagttattccatcaactacttgcactggggggagccaaagacatggtatggtgtgcca tctcatgctgcagagcaactggaggaggtgatgagagagctggcccccgagttatttgaa tcccagcctgatcttctgcatcagttagttaccatcatgaaccccaacgtgctaatggag catggtgtgcctgtgtacaggaccaatcagtgtgctggcgagtttgttgtgacatttcct cgtgcctatcactctggatttaaccagggctacaactttgctgaagctgtgaacttctgt actgctgactggttgcccattggacgtcaatgtgtaaatcattaccgacgcctaaggcgc cactgtgtcttttcacacgaggaactaattttcaagatggcagcagatccagaatgctta gatgtggggctggctgccatggtctgcaaagaattgactctcatgactgaagaagaaaca cgattaagagagtctgttgtacagatgggtgtcctgatgtcagaagaagaagtgtttgaa cttgttcctgatgatgagcggcagtgttcagcatgcagaaccacatgttttctctctgct ctcacatgttcctgtaatcctgagcggcttgtatgtctctaccatccaactgatctgtgc ccctgccccatgcagaagaaatgtcttagatatcgctacccattagaagacctcccttct ctgctatatggtgtaaaagtcagggcacagtcctatgacacttgggtcagtcgtgttaca gaagcattgtctgctaacttcaaccacaaaaaagatttgattgaattgcgagtaatgctg gaagatgctgaggataggaaatacccagagaatgatctctttcgaaaactcagggatgct gtaaaagaagctgagacctgtgcttctgtggctcagctgcttctgagcaaaaagcagaaa cacagacagagcccagatagtgggaggactcggaccaaactgacagtggaagaattgaag gcctttgtccaacaactttttagtcttccgtgtgtcatcagccaagctcggcaagtaaag aatctgctagatgatgtggaagagtttcatgaacgtgctcaggaggccatgatggatgaa accccagattcttccaaactccagatgttgatagatatgggctctagtctctatgtggaa ctccctgaattaccacgactgaagcaagagctacaacaggctcggtggttggacgaagta agactgaccttatcagatccgcaacaagtcactttggatgtcatgaagaagctgatagac tctggggtagggttggcaccccaccatgctgtggagaaagcaatggctgaactacaggag ctccttacagtctctgaacgatgggaagaaaaggctaaggtctgcctacaggcaagaccg aggcacagtgtggcaagtttagaaagcattgtgaatgaagccaagaacattccagccttt ctacccaatgtgttgtccttgaaagaagccttacaaaaggctcgagaatggaccgctaaa gtggaagctattcagagtggcagcaattacgcttatttggagcagcttgagagcttgtct gcgaaaggacgccctattcctgtgcgtcttgaagcactgccgcaagtggaatcacaggtg ctgagcccccggaccgacattggtgtatatgggagtggcaaaaataggaggaaaaaagta aaagaactaatagaaaaagaaaaagaaaaggatctggacctggagcctctgagtgatctg gaggaaggattggaggaaaccagagatacagccatggtggtggcagttttcaaagaacgg gagcaaaaagagattgaagccatgcattctctcagagcagccaacctagccaagatgaca atggtggaccgcatagaagaagtaaaattttgcatttgccgcaagacagccagtgggttt atgctacagtgtgagctctgcaaagactggttccataacagctgtgttcctcttcctaaa tcaagttcccaaaaaaaaggatccagctggcaagctaaagaagtaaaattcctttgccct ctttgtatgcggtctcgaaggcccaggctagagactattctgtcactcctggtatccctt cagaagttgcccgtacggttgcctgaaggagaggccctgcagtgtttgacagaacgtgct atgagttggcaagatagagcgcggcaggctctagccacagatgaactatcctctgccctg gccaaactatctgtgttgagccagcgtatggtggaacaggcggctcgagaaaaaactgaa aagatcatcagtgcagaactccaaaaagcagctgccaatccagacttacaggcaagttta ggatttacattctttttctatttcatgggatatggacacttacctagtttccagcagtct gcttttaaccgggtggtgagcagtgtgtcatcttctcctcgacaaacaatggactatgat gatgaagaaacagactctgatgaagacattcgagagacatatggctacgacatgaagctt gcagatcagcatggtactgctcattaa >gi568815586r:120951_360052|GENSCAN_predicted_peptide_5|109_aa MAAPCQQWNKAGWRMTDELTEVGFRRSVKTNFSELKEDVQTHRKEAKNLEKRLDKSLTRI NSVEKTLNDLMELKTMGRELHYTCTSFNSQFDQVEERVSVIEDQINEIK >gi568815586r:120951_360052|GENSCAN_predicted_CDS_5|330_bp atggcagctccttgccagcaatggaacaaagctggatggagaatgactgatgagttgaca gaagtagggttcagaaggtcagtaaaaacaaacttctccgagctaaaggaggatgttcaa acccatcgcaaggaagctaaaaaccttgaaaaaagattagacaaatcgctaactagaata aacagtgtagagaagaccttaaatgacctgatggagctgaaaaccatgggacgagaactt cattacacatgcacaagcttcaatagccaatttgatcaagtggaagaaagggtatcagtg attgaagatcaaattaatgaaataaagtga >gi568815586r:120951_360052|GENSCAN_predicted_peptide_6|292_aa MKAEIKMFFETNENKNTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKMDILTSQLKELE KQEQANSKASRRQEITKIRAELKDRETQKTFKTSIKSRTWFFEKINKIDRPRARLIKKKG EKNQIDAIKNDKGDITTNPTEIQTTIREYYKHLYTNKLENLEEMDKFLDTYTLPRLNQEE VESLNRPIAGSEIEAIIAYQTKKVQDQMDSQPNSTRAIEKDGILPNSFYEANIILIPKPG RDTTKKENFRPISLMNIDVKIPNKILANRIQQHVKKLIIHHNRVGFIPGMQG >gi568815586r:120951_360052|GENSCAN_predicted_CDS_6|879_bp atgaaggcagaaataaagatgttctttgaaacaaatgagaacaaaaacacaacataccag aatctctgggacacatttaaagcagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagcaggaaagatctaaaatggacatcctaacatcacaattaaaagaactagag aagcaagagcaagcaaattcaaaagctagcagaaggcaagaaataactaagatcagagca gaactgaaagacagagagacacaaaaaaccttcaaaacatcaattaaatccaggacctgg ttttttgaaaagatcaacaaaattgatagaccacgagcaagactaataaaaaagaaagga gagaagaatcaaatagatgcaataaaaaatgataaaggggatatcaccaccaatcccaca gaaatacaaactaccatcagagaatactataaacacctctacacaaataaactagaaaat ctagaagaaatggataaattcctggacacatacactctcccaagactaaaccaggaagaa gttgaatctctgaacagaccaatagcaggctctgaaattgaggcaataatagcctaccaa acaaaaaaagtccaggaccagatggattcacagccaaattctaccagagcaatagaaaaa gacggaatcctccctaactcattttatgaggccaacatcatcctgataccaaagcctggc agagacacaacaaaaaaagagaattttagaccaatatccctgatgaacatcgatgtgaaa atccccaataaaatattggcaaaccgaatccagcagcacgtcaaaaagcttattattcac cacaatcgagttggcttcatccccgggatgcaaggctag >gi568815586r:120951_360052|GENSCAN_predicted_peptide_7|171_aa SESGDVSRNTELKKLQIFGAGPKVVGLAMGTKDKEDEVTRRRKVTNRSDAFNMQMRQRKG TLSVNFECSKPREAFGFEQAVREYTLQSFGEMADNFKSDYFNMPVHMVPTELVEKEFWRL VSSIEEDVIVEYGADISSKDFGSGFPVKDGRRKILPEEEVQIYPNFCLPPS >gi568815586r:120951_360052|GENSCAN_predicted_CDS_7|516_bp tcagaatctggagatgtgagtagaaacacggaactgaagaaacttcagatttttggggct gggcccaaggttgtgggcttggcaatgggaacaaaagataaagaagatgaggtcacccga agacgaaaagttaccaacaggtcagacgcatttaacatgcaaatgagacaacggaaaggc actctctctgttaactttgaatgtagcaaacctcgagaagcctttggatttgaacaagct gtacgagagtatacacttcagagctttggagagatggcagataattttaagtctgattat tttaatatgccagtccatatggttcccacagaactagtagaaaaggaattttggcggctg gtaagcagcattgaagaagatgttattgtggaatatggagcagatatctcctcaaaagac tttggaagtggatttccggtgaaggatgggcggagaaagattctgccagaagaagaggtg cagatttatccaaacttttgcttacccccaagctga