GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:26:40 Sequence gi568815576f:37749321_37988461 : 239141 bp : 52.45% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1821 1880 60 2 0 69 97 75 0.972 6.16 1.02 Intr + 2452 2508 57 0 0 129 94 69 0.977 11.17 1.03 Intr + 5781 5870 90 2 0 50 66 216 0.927 16.29 1.04 Intr + 6230 6339 110 1 2 66 70 154 0.990 10.98 1.05 Intr + 8293 8818 526 1 1 87 80 796 0.999 72.24 1.06 Intr + 9834 9944 111 2 0 113 110 69 0.971 12.68 1.07 Intr + 16350 16497 148 2 1 88 80 228 0.346 22.32 1.08 Intr + 18754 18856 103 2 1 94 63 141 0.999 11.93 1.09 Intr + 19708 19867 160 1 1 92 80 314 0.999 31.50 1.10 Intr + 19942 20055 114 0 0 93 105 152 0.997 18.55 1.11 Intr + 22330 22548 219 0 0 63 53 187 0.764 11.73 1.12 Intr + 23023 23223 201 0 0 64 16 96 0.354 0.00 1.13 Term + 23281 23442 162 0 0 138 44 337 0.928 32.55 1.14 PlyA + 25171 25176 6 1.05 2.04 PlyA - 27352 27347 6 1.05 2.03 Term - 34751 34637 115 1 1 73 47 74 0.281 0.15 2.02 Intr - 35680 35519 162 0 0 127 94 -31 0.290 1.01 2.01 Init - 41443 41292 152 1 2 114 66 95 0.729 9.38 2.00 Prom - 51105 51066 40 -2.11 3.00 Prom + 51224 51263 40 -3.11 3.01 Sngl + 56225 56809 585 1 0 116 48 1040 0.991 99.37 3.02 PlyA + 57318 57323 6 -0.45 4.00 Prom + 57890 57929 40 -0.51 4.01 Init + 58648 58843 196 0 1 72 82 301 0.587 24.97 4.02 Intr + 60707 60837 131 0 2 113 86 130 0.904 16.22 4.03 Intr + 63567 63668 102 2 0 94 85 153 0.968 16.47 4.04 Intr + 64143 64289 147 2 0 98 75 461 0.957 46.74 4.05 Intr + 65806 65960 155 0 2 96 56 217 0.958 18.68 4.06 Intr + 66098 66180 83 2 2 69 82 99 0.997 7.18 4.07 Intr + 66343 66514 172 2 1 122 64 225 0.754 23.02 4.08 Intr + 66880 67001 122 1 2 109 94 96 0.996 12.94 4.09 Intr + 67247 67368 122 0 2 93 3 231 0.002 15.82 4.10 Intr + 68458 68709 252 0 0 51 23 128 0.000 0.76 4.11 Intr + 69203 69351 149 1 2 16 49 98 0.000 -1.76 4.12 Intr + 74068 74445 378 1 0 95 85 595 0.039 54.54 4.13 Term + 75403 76150 748 1 1 75 40 1124 0.999 99.67 4.14 PlyA + 76507 76512 6 1.05 5.09 PlyA - 76689 76684 6 1.05 5.08 Term - 82697 82623 75 2 0 97 43 65 0.156 1.04 5.07 Intr - 83424 83317 108 0 0 63 99 90 0.983 8.58 5.06 Intr - 83789 83638 152 0 2 118 109 178 0.982 23.29 5.05 Intr - 83886 83839 48 1 0 96 87 80 0.987 7.84 5.04 Intr - 84435 84364 72 1 0 91 94 29 0.795 3.67 5.03 Intr - 89278 89180 99 2 0 58 97 126 0.995 11.08 5.02 Intr - 90914 90867 48 0 0 91 109 78 0.998 9.34 5.01 Init - 94918 94591 328 1 1 80 101 522 0.995 50.19 5.00 Prom - 97514 97475 40 -5.11 6.00 Prom + 99861 99900 40 -7.89 6.01 Init + 100130 100162 33 1 0 87 94 41 0.936 4.41 6.02 Intr + 100695 100743 49 2 1 135 94 118 0.995 15.84 6.03 Intr + 101960 102170 211 0 1 60 80 325 0.813 27.40 6.04 Intr + 106245 106314 70 0 1 51 52 53 0.345 -2.32 6.05 Intr + 113649 113718 70 2 1 122 64 72 0.967 7.45 6.06 Intr + 113952 114025 74 1 2 92 70 76 0.713 5.82 6.07 Intr + 120856 121027 172 0 1 86 105 221 0.870 23.63 6.08 Intr + 125050 125204 155 2 2 123 59 176 0.980 18.50 6.09 Intr + 126521 126691 171 1 0 87 91 275 0.984 28.35 6.10 Intr + 128354 128851 498 1 0 86 91 814 0.963 75.27 6.11 Intr + 137445 137525 81 2 0 132 78 130 0.995 16.73 6.12 Term + 139106 139144 39 1 0 126 45 37 0.968 0.68 6.13 PlyA + 140787 140792 6 1.05 7.00 Prom + 153393 153432 40 -1.11 7.01 Init + 156717 156862 146 2 2 84 7 172 0.951 6.26 7.02 Intr + 156913 157248 336 1 0 61 68 438 0.639 34.29 7.03 Intr + 162632 162680 49 2 1 138 111 75 0.961 13.97 7.04 Intr + 163031 163172 142 1 1 87 100 182 0.997 19.74 7.05 Intr + 168441 168475 35 1 2 66 96 41 0.421 1.13 7.06 Intr + 169716 169858 143 2 2 126 117 139 0.981 20.26 7.07 Intr + 172652 173106 455 2 2 107 101 366 0.950 33.28 7.08 Intr + 175340 175397 58 0 1 99 63 23 0.976 -0.67 7.09 Intr + 176341 176723 383 1 2 123 92 358 0.999 34.82 7.10 Intr + 178091 178506 416 0 2 68 113 248 0.566 19.70 7.11 Intr + 180349 180495 147 0 0 65 80 58 0.719 3.64 7.12 Intr + 182479 182613 135 0 0 113 76 143 0.990 16.87 7.13 Intr + 183233 183359 127 1 1 -2 50 241 0.599 12.06 7.14 Intr + 183478 183568 91 2 1 46 56 206 0.960 12.75 7.15 Intr + 183719 183792 74 2 2 64 109 133 0.524 12.54 7.16 Intr + 187760 187874 115 0 1 84 46 286 0.998 24.01 7.17 Intr + 188426 188499 74 2 2 76 93 120 0.926 10.84 7.18 Term + 191389 191510 122 2 2 109 42 98 0.491 6.24 7.19 PlyA + 192665 192670 6 1.05 8.08 PlyA - 193732 193727 6 1.05 8.07 Term - 194926 194855 72 1 0 111 33 22 0.402 -2.80 8.06 Intr - 195197 195097 101 0 2 78 91 136 0.976 13.23 8.05 Intr - 195853 195722 132 2 0 89 57 131 0.912 11.22 8.04 Intr - 198143 197961 183 0 0 99 100 71 0.637 9.68 8.03 Intr - 202202 202140 63 0 0 41 78 132 0.492 6.68 8.02 Intr - 203838 203646 193 0 1 94 48 209 0.224 17.09 8.01 Init - 204080 204012 69 2 0 102 3 28 0.302 -3.30 8.00 Prom - 204207 204168 40 -10.76 9.00 Prom + 204275 204314 40 -8.29 9.01 Init + 204468 204487 20 2 2 50 89 58 0.856 1.53 9.02 Intr + 207453 207522 70 0 1 92 78 163 0.812 15.38 9.03 Intr + 210026 210156 131 1 2 87 66 160 0.488 13.70 9.04 Intr + 217779 217850 72 0 0 114 108 94 0.903 13.02 9.05 Term + 218305 218395 91 1 1 106 49 231 0.999 18.49 9.06 PlyA + 218460 218465 6 1.05 10.07 PlyA - 221388 221383 6 1.05 10.06 Term - 224878 224175 704 2 2 126 43 754 0.999 68.83 10.05 Intr - 228815 228547 269 1 2 65 109 372 0.998 34.71 10.04 Intr - 232538 232116 423 1 0 90 48 137 0.631 3.45 10.03 Intr - 234548 234037 512 2 2 85 83 1006 0.888 92.16 10.02 Intr - 235121 235019 103 1 1 95 105 41 0.848 7.18 10.01 Intr - 235257 235192 66 2 0 148 79 -14 0.634 2.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 45905 45953 49 1 1 86 58 39 0.880 -0.25 S.002 Term + 46553 46674 122 0 2 108 49 81 0.962 5.14 S.003 Term + 67247 67398 152 0 2 93 49 246 0.998 19.58 S.004 Init + 74087 74445 359 1 2 90 85 584 0.959 54.90 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:37749321_37988461|GENSCAN_predicted_peptide_1|686_aa MLQLVAPRPRGCAPLGGTQKPDLLNFKKGWMSILDEPGEADELDGEIDLRSCTDVTEYAV QRNYGFQIHTKDAVYTLSAMTSGIRRNWIEALRKTVRPTSAPDVTKLSDSNKENALHSYS TQKGPLKAGEQRAGSEVISRGGPRKADGQRQALDYVELSPLTQASPQRARTPARTPDRLA KQEELERDLAQRSEERRKWFEATDSRTPEVPAGEGPRRGLGAPLTEDQQNRLSEEIEKKW QELEKLPLRENKRVPLTALLNQSRGERRGPPSDGHEALEKEVQALRAQLEAWRLQGEAPQ SALRSQEDGHIPPGYISQEACERSLAEMESSHQQVMEELQRHHERELQRLQQEKEWLLAE ETAATASAIEAMKKAYQEELSRELSKTRSLQQGPDGLRKQHQSDVEALKRELQVLSEQYS QKCLEIGALMRQAEEREHTLRRCQQEGQELLRHNQELHGRLSEEIDQLRGFIASQGMGNG CGRSNERSSCELEVLLRVKENELQYLKKEVQCLRDELQMMQKVGPSAGLGAVGDSGAIWM PSCEHLLCAKPSTRFILPQALLCFSHPLTALRSWSKSSPPKQDEDDNDACVPGWYQGGGR GKPRTHGHLGEPPGGMKRGICGEVWLLVLPMCPDKRFTSGKYQDVYVELSHIKTRSEREI EQLKEHLRLAMAALQEKESMRNSLAE >gi568815576f:37749321_37988461|GENSCAN_predicted_CDS_1|2061_bp atgctgcagctggtagcccccagaccccggggctgtgcccccctgggcggcacccagaag cccgatctgctcaacttcaagaagggatggatgtcgatcttggacgagcctggagaggca gatgagctggatggtgagatcgacctgcgttcctgcacggatgtcactgagtacgcggtg cagcgcaactatggcttccagatccacaccaaggatgctgtctataccttgtcggccatg acctcaggcatccggcggaactggatcgaggctctgagaaagaccgtacgtccaacttca gccccagatgtcaccaagctctcggactctaacaaggagaacgcgctgcacagctacagc acccagaagggccccctgaaggcaggggagcagcgggcgggctctgaggtcatcagccgg ggtggccctcggaaggcggacgggcagcgtcaggccttggactacgtggagctctcgccg ctgacccaggcttccccgcagcgggcccgcaccccagcccgcactcctgaccgcctggcc aagcaggaggagctggagcgggacctggcccagcgctccgaggagcggcgcaagtggttt gaggccacagacagcaggaccccagaggtgcctgctggtgaggggccgcgccggggcctg ggtgcccccctgactgaggaccagcaaaaccggcttagtgaggagatcgagaagaagtgg caggagctggagaagctgcccctgcgggagaataagcgggtgcccctcactgccctgctc aaccaaagccgcggagagcgccgagggcccccaagtgacggccacgaggcactggagaag gaggttcaggctcttcgggcccagctggaggcgtggcgtctccaaggggaggctcctcag agtgcactgagatcccaggaggatggccacatccccccgggctacatctcacaggaggca tgtgagcgcagcctggcagagatggagtcctcgcaccagcaggtgatggaggagctgcag cggcaccacgagcgggagctgcagcgcctgcagcaggagaaggagtggctcctggctgag gagacggcagccacggcctcagccattgaagccatgaagaaggcctaccaggaagagctg agccgagagctgagcaaaacacggagtctccagcagggcccggatggcctccggaagcag caccagtcagatgtggaggcactgaagcgagagctgcaggtgctatcggagcagtactcg cagaagtgcctggagattggggcactcatgcggcaggctgaggagcgcgagcacacgctg cgccgctgccagcaggagggccaggagctgctgcgccacaaccaggagctgcatggccgc ctgtcagaggagatagaccagctgcgcggcttcattgcctcgcagggcatgggcaatggc tgcgggcgcagcaacgagcggagttcctgcgagctagaggtgctgcttcgcgtaaaagaa aacgaactccagtacctaaagaaggaggtgcagtgcctccgggacgagctccagatgatg cagaaggtaggtccttccgctgggctgggggccgtcggggactctggagccatctggatg ccatcctgtgagcacctgctctgtgccaagccctccactcgcttcatccttcctcaggct ctcctgtgcttctcccatccactcactgccctgcggtcttggtcaaaatcttctcccccg aaacaggatgaggatgacaatgacgcctgtgtccctgggtggtaccagggaggtgggagg ggtaagcccagaacccacggccatcttggggagccacctggagggatgaagcgaggtatc tgcggggaggtctggctgctggtgctgcccatgtgcccggacaagcgcttcacctcggga aagtaccaggacgtctatgtggagctgagccacatcaagacacggtctgagcgggagatc gagcagctgaaggagcacctgcgtcttgccatggccgccctccaggagaaggagtcgatg cgcaacagcctggctgagtag >gi568815576f:37749321_37988461|GENSCAN_predicted_peptide_2|142_aa MPGWENIFSSMNHTLNALTGAVTSTRDNRDCSLVRAMPNWGPNAAGISPCRVSAGTGRVP SCLCHSGAWNRLDQDINRNNSCYSDSVGLATLRASLNPISSSHLRHTWPVQPAVLLLSED TATWEQLECGSGDATVEVTASR >gi568815576f:37749321_37988461|GENSCAN_predicted_CDS_2|429_bp atgcccggctgggaaaatattttctcctccatgaaccatactctcaatgcactaactggt gctgtgacaagcaccagggacaacagggactgctctctggtgagggccatgcccaactgg ggcccaaatgcagctggcatcagtccctgcagggtgagcgcagggacagggcgtgtccct tcatgtctgtgtcactcaggtgcctggaacaggcttgaccaggatattaacaggaataac agctgctactcagactcagtgggcctggccacgctcagggcctcacttaatcccatttct tcatctcatctcagacacacttggccagtgcagccagctgtcctgctcttgagtgaggac acagccacgtgggagcagctggaatgtggctctggagatgccacagttgaggtcacagct agtagatga >gi568815576f:37749321_37988461|GENSCAN_predicted_peptide_3|194_aa MTENSTSAPAAKPKRAKASKKSTDHPKYSDMIVAAIQAEKNRAGSSRQSIQKYIKSHYKV GENADSQIKLSIKRLVTTGVLKQTKGVGASGSFRLAKSDEPKKSVAFKKTKKEIKKVATP KKASKPKKAASKAPTKKPKATPVKKAKKKLAATPKKAKKPKTVKAKPVKASKPKKAKPVK PKAKSSAKRAGKKK >gi568815576f:37749321_37988461|GENSCAN_predicted_CDS_3|585_bp atgaccgagaattccacgtccgcccctgcggccaagcccaagcgggccaaggcctccaag aagtccacagaccaccccaagtattcagacatgatcgtggctgccatccaggccgagaag aaccgcgctggctcctcgcgccagtccattcagaagtatatcaagagccactacaaggtg ggtgagaacgctgactcgcagatcaagttgtccatcaagcgcctggtcaccaccggtgtc ctcaagcagaccaaaggggtgggggcctcggggtccttccggctagccaagagcgacgaa cccaagaagtcagtggccttcaagaagaccaagaaggaaatcaagaaggtagccacgcca aagaaggcatccaagcccaagaaggctgcctccaaagccccaaccaagaaacccaaagcc accccggtcaagaaggccaagaagaagctggctgccacgcccaagaaagccaaaaaaccc aagactgtcaaagccaagccggtcaaggcatccaagcccaaaaaggccaaaccagtgaaa cccaaagcaaagtccagtgccaagagggccggcaagaagaagtga >gi568815576f:37749321_37988461|GENSCAN_predicted_peptide_4|918_aa MWPGNAWRAALFWVPRGRRAQSALAQLRGILEGELEGIRGAGTWKSERVITSRQGPHIRV DGVSGGILNFCANNYLGLSSHPEVIQAGLQALEEFGAGLSSVRFICGTQSIHKNLEAKIA RFHQREDAILYPSCYDANAGLFEALLTPEDAVLSDELNHASIIDGIRLCKAHKYRYRHLD MADLEAKLQEAQKHRLRLVATDGAFSMDGDIAPLQEICCLASRYGALVFMDECHATGFLG PTGRGTDELLGVMDQVTIINSTLGKALGGASGGYTTGPGPLVSLLRQRARPYLFSNSLPP AVVGCASKALDLLMGSNTIVQSMAAKTQRFRSKMEAAGFTISGASHPICPVMLGDARLAS RMADDMLKRGIFVIGFSYPVVPKGKARIRVQISAVHSEEDIDRCVEAFVEAYDDQPPGGA ASTGRGRRLAPPLQTGGLGPRLFRLPSSQGQERRFAAAKAPSSLVPHNGSPLSCWRGLRE EGGLSLATLWTPCGPAAHVKRHGQLRGDVWQSVCCTRRGQNRGPRSTQTDVRRHEARSRQ RRRKCPSDGEMADAQNISLDSPGSVGAVAVPVVFALIFLLGTVGNGLVLAVLLQPGPSAW QEPGSTTDLFILNLAVADLCFILCCVPFQATIYTLDAWLFGALVCKAVHLLIYLTMYASS FTLAAVSVDRYLAVRHPLRSRALRTPRNARAAVGLVWLLAALFSAPYLSYYGTVRYGALE LCVPAWEDARRRALDVATFAAGYLLPVAVVSLAYGRTLRFLWAAVGPAGAAAAEARRRAT GRAGRAMLAVAALYALCWGPHHALILCFWYGRFAFSPATYACRLASHCLAYANSCLNPLV YALASRHFRARFRRLWPCGRRRRHRARRALRRVRPASSGPPGCPGDARPSGRLLAGGGQG PEPREGPVHGGEAARGPE >gi568815576f:37749321_37988461|GENSCAN_predicted_CDS_4|2757_bp atgtggcctgggaacgcctggcgcgccgcactcttctgggtgccccgcggccgccgcgca cagtcagcgctggcccagctgcgtggcattctggagggggagctggaaggcatccgcgga gctggcacttggaagagtgagcgggtcatcacgtcccgtcaggggccgcacatccgcgtg gacggcgtctccggaggaatccttaacttctgtgccaacaactacctgggcctgagcagc caccctgaggtgatccaggcaggtctgcaggctctggaggagtttggagctggcctcagc tctgtccgctttatctgtggaacccagagcatccacaagaatctagaagcaaaaatagcc cgcttccaccagcgggaggatgccatcctctatcccagctgttatgacgccaacgccggc ctctttgaggccctgctgaccccagaggacgcagtcctgtcggacgagctgaaccatgcc tccatcatcgacggcatccggctgtgcaaggcccacaagtaccgctatcgccacctggac atggccgacctagaagccaagctgcaggaggcccagaagcatcggctgcgcctggtggcc actgatggggccttttccatggatggcgacatcgcacccctgcaggagatctgctgcctc gcctctagatatggtgccctggtcttcatggatgaatgccatgccactggcttcctgggg cccacaggacggggcacagatgagctgctgggtgtgatggaccaggtcaccatcatcaac tccaccctggggaaggccctgggtggagcatcagggggctacacgacagggcctgggccc ctggtgtccctgctgcggcagcgcgcccggccatacctcttctccaacagtctgccacct gctgtcgttggctgcgcctccaaggccctagatctgctgatggggagtaacaccattgtc cagtctatggctgccaagacccagaggttccgtagtaagatggaagctgctggcttcact atctcgggagccagtcaccccatctgccctgtgatgctgggtgatgcccggctggcctct cgcatggcggatgacatgctgaagagaggcatctttgtcatcgggttcagctaccccgtg gtccccaagggcaaggcccggatccgggtacagatctcagcagtgcatagcgaggaagac attgaccgctgcgtggaggccttcgtggaagcctacgatgatcagccaccagggggtgct gcgagcacgggccgcggccgccggctcgccccgcccctccagactgggggccttgggccg cggctgttcaggctgcccagcagtcaaggccaggagaggcgatttgctgctgccaaggcc ccatcctcccttgtgccccacaacggctcgccgctttcctgttggaggggcctgcgggag gagggcggtctctccctggcgaccttgtggaccccttgtgggccagcagctcatgttaag cgccacgggcagctgcggggtgacgtgtggcagtcggtgtgctgcacccggcggggccag aacagagggcccaggtccacccagaccgacgtgaggcggcacgaggcgagatccagacag cggcgcagaaagtgcccgtctgatggggagatggctgatgcccagaacatttcactggac agcccagggagtgtgggggccgtggcagtgcctgtggtctttgccctaatcttcctgctg ggcacagtgggcaatgggctggtgctggcagtgctcctgcagcctggcccgagtgcctgg caggagcctggcagcaccacggacctgttcatcctcaacctggcggtggctgacctctgc ttcatcctgtgctgcgtgcccttccaggccaccatctacacgctggatgcctggctcttt ggggccctcgtctgcaaggccgtgcacctgctcatctacctcaccatgtacgccagcagc tttacgctggctgctgtctccgtggacaggtacctggccgtgcggcacccgctgcgctcg cgcgccctgcgcacgccgcgtaacgcccgcgccgcagtggggctggtgtggctgctggcg gcgctcttctcggcgccctacctcagctactacggcaccgtgcgctacggcgcgctggag ctctgcgtgcccgcctgggaggacgcgcgccgccgcgccctggacgtggccaccttcgct gccggctacctgctgcccgtggctgtggtgagcctggcctacgggcgcacgctgcgcttc ctgtgggccgccgtgggtcccgcgggcgcggcggcggccgaggcgcggcggagggcgacg ggccgcgcggggcgcgccatgctggcggtggccgcgctctacgcgctctgctggggtccg caccacgcgctcatcctgtgcttctggtacggccgcttcgccttcagcccggccacctac gcctgccgcctggcctcacactgcctggcctacgccaactcctgcctcaacccgctcgtc tacgcgctcgcctcgcgccacttccgcgcgcgcttccgccgcctgtggccgtgcggccgc cgacgccgccaccgtgcccgccgcgccttgcgtcgcgtccgccccgcgtcctcgggccca cccggctgccccggagacgcccggcctagcgggaggctgctggctggtggcggccagggc ccggagcccagggagggacccgtccacggcggagaggctgcccgaggaccggaataa >gi568815576f:37749321_37988461|GENSCAN_predicted_peptide_5|309_aa MAAAAGDADDEPRSGHSSSEGECAVAPEPLTDAEGLFSFADFGSALGGGGAGLSGRASGG AQSPLRYLHVLWQQDAEPRDELRCKIPAGRLRRAARPHRRLGPTGKEVHALKRLRDSANA NDVETVQQLLEDGADPCAADDKGRTALHFASCNGNDQIVQLLLDHGADPNQRDGLGNTPL HLAACTNHVPVITTLLRGECSPQLVPAGARVDALDRAGRTPLHLAKSKLNILQEGHAQCL EAVRLEVKQIIHMLREYLERLGQHEQRERLDDLCTRLQMTSTKEQVDEVTDLLASFTSLS LQMQSMEKR >gi568815576f:37749321_37988461|GENSCAN_predicted_CDS_5|930_bp atggcagccgccgccggggacgcggacgacgagccgcgctcaggccactcgagctcggag ggcgagtgcgcggtggcgccggagccgctgactgacgctgagggcctcttctccttcgct gacttcgggtctgcgctgggcggcggcggcgcgggcctctcgggccgggcgtccggcggg gcccagtcgccgctgcgctacttgcacgtcctgtggcagcaggatgcggagccgcgcgac gagctgcgctgcaagatacccgctggccggctgaggcgcgctgccaggccccaccggcgg ctcgggcccacgggcaaggaggtgcacgctctgaagagactgagggactcggccaatgcc aatgatgtggaaacagtgcagcagctgctggaagatggcgcggatccctgtgcagctgat gacaagggccgcacagctctacactttgcctcatgcaatggcaatgaccagattgtgcag ctgctcctggaccatggtgctgatcctaaccagcgagatgggctggggaacacgccactg cacctggcggcctgcaccaaccacgttcctgtcatcaccacactgctacgaggagaatgc tcacctcagctggtacctgcaggggcccgtgtagatgccctggaccgagctggtcgcaca cccctgcacctggccaagtcaaagctgaatatcctgcaggagggccatgcccagtgccta gaggctgtgcgtctggaggtgaagcagatcatccatatgctgagggagtatctggagcgc ctagggcaacatgagcagcgagaacgcctggatgacctctgcacccgcctgcagatgacc agtaccaaagagcaggtggatgaagtgactgacctcctggccagcttcacctccctcagt ctgcagatgcagagcatggagaagaggtag >gi568815576f:37749321_37988461|GENSCAN_predicted_peptide_6|540_aa MSYPADDYESEAAYDPYAYPSDYDMHTGDPKQDLAYERQYEQQTYQVIPEVIKNFIQYFH KTVSDLIDQKVYELQASRVSSDVIDQKVYEIQDIYENSWTKLTERFFKNTPWPEAEAIAP QGGPSLEQRFESYYNYCNLFNYILNADGPAPLELPNQWLWDIIDEFIYQFQSFSQYRCKT AKKSEEEIDFLRSNPKIWNVHSVLNVLHSLVDKSNINRQLEVYTSGGDPESVAGEYGRHS LYKMLGYFSLVGLLRLHSLLGDYYQAIKVLENIELNKKSMYSRVPECQVTTYYYVGFAYL MMRRYQDAIRVFANILLYIQRTKSMFQRTTYKYEMINKQNEQMHALLAIALTMYPMRIDE SIHLQLREKYGDKMLRMQKGDPQVYEELFSYSCPKFLSPVVPNYDNVHPNYHKEPFLQQL KVFSDEVQQQAQLSTIRSFLKLYTTMPVAKLAGFLDLTEQEFRIQLLVFKHKMKNLVWTS GISALDGEFQSASEVDFYIDKDMIHIADTKVARRYGDFFIRQIHKFEELNRTLKKMGQRP >gi568815576f:37749321_37988461|GENSCAN_predicted_CDS_6|1623_bp atgtcttatcccgctgatgattatgagtctgaggcggcttatgacccctacgcttatccc agcgactatgatatgcacacaggagatccaaagcaggaccttgcttatgaacgtcagtat gaacagcaaacctatcaggtgatccctgaggtgatcaaaaacttcatccagtatttccac aaaactgtctcagatttgattgaccagaaagtgtatgagctacaggccagtcgtgtctcc agtgatgtcattgaccagaaggtgtatgagatccaggacatctatgagaacagctggacc aagctgactgaaagattcttcaagaatacaccttggcccgaggctgaagccattgctcca caggggggaccttccttggagcagaggtttgaatcctattacaactactgcaatctcttc aactacattcttaatgccgatggtcctgctccccttgaactacccaaccagtggctctgg gatattatcgatgagttcatctaccagtttcagtcattcagtcagtaccgctgtaagact gccaagaagtcagaggaggagattgactttcttcgttccaatcccaaaatctggaatgtt catagtgtcctcaatgtccttcattccctggtagacaaatccaacatcaaccgacagttg gaggtatacacaagcggaggtgaccctgagagtgtggctggggagtatgggcggcactcc ctctacaaaatgcttggttacttcagcctggtcgggcttctccgcctgcactccctgtta ggagattactaccaggccatcaaggtgctggagaacatcgaactgaacaagaagagtatg tattcccgtgtgccagagtgccaggtcaccacatactattatgttgggtttgcatatttg atgatgcgtcgttaccaggatgccatccgggtcttcgccaacatcctcctctacatccag aggaccaagagcatgttccagaggaccacgtacaagtatgagatgattaacaagcagaat gagcagatgcatgcgctgctggccattgccctcacgatgtaccccatgcgtattgatgag agcattcacctccagctgcgggagaaatatggggacaagatgttgcgcatgcagaaaggt gacccacaagtctatgaagaacttttcagttactcctgccccaagttcctgtcgcctgta gtgcccaactatgataatgtgcaccccaactaccacaaagagcccttcctgcagcagctg aaggtgttttctgatgaagtacagcagcaggcccagctttcaaccatccgcagcttcctg aagctctacaccaccatgcctgtggccaagctggctggcttcctggacctcacagagcag gagttccggatccagcttcttgtcttcaaacacaagatgaagaacctcgtgtggaccagc ggtatctcagccctggatggtgaatttcagtcagcctcagaggttgacttctacattgat aaggacatgatccacatcgcggacaccaaggtcgccaggcgttatggggatttcttcatc cgtcagatccacaaatttgaggagcttaatcgaaccctgaagaagatgggacagagacct tga >gi568815576f:37749321_37988461|GENSCAN_predicted_peptide_7|1015_aa MGRRPVLVGVGSALEAELLLGGARLRRGRRRRDARSHDRPTMRRAGSAELGQGRGAPERG HRGPASAPPLACRSAPELGAAAAAGNRARAAAAVPAKPGPRSQSRSRAGRGVMAGPRGAL LAWCRRQCEGYRGVEIRDLSSSFRDGLAFCAILHRHRPDLLDFDSLSKDNVFENNRLAFE VAEKELGIPALLDPNDMVSMSVPDCLSIMTYVSQYYNHFCSPGQAPTPVEPEDVAQGEEL SSGSLSEQGTGQTPSSTCAACQQHVHLVQRYLADGRLYHRHCFRCRRCSSTLLPGAYENG PEEGTFVCAEHCARLGPGTRSGTRPGPFSQPKQQHQQQLAEDAKDVPGGGPSSSAPAGAE ADGPKASPEARPQIPTKPRVPGKLQELASPPAGRPTPAPRKASESTTPAPPTPRPRSSLQ QENLVEQAGSSSLVNGRLHELPVPKPRGTPKPSEGTPAPRKDPPWITLVQAEPKKKPAPL PPSSSPGPPSQDSRQVENGGTEEVAQPSPTASLESKPYNPFEEEEEDKEEEAPAAPSLAT SPALGHPESTPKSLHPWYGITPTSSPKTKKRPAPRAPSASPLALHASRLSHSEPPSATPS PALSVESLSSESASQTAGAELLEPPAVPKSSSEPAVHAPGTPGNPVSLSTNSSLASSGEL VEPRVEQMPQASPGLAPRTRGSSGPQPAKPCSGATPTPLLLVGDRSPVPSPGSSSPQLQV KPHTENLGIAWAQYFPLGTQFMRALSCSHSAFGWPNALQASELPRLIWPQSSCKENPFNR KPSPAASPATKKATKGSKPVRPPAPGHGFPLIKRKVQADQYIPEEDIHGEMDTIERRLDA LEHRGVLLEEKLRGGLNEGREDDMLVDWFKLIHEKHLLVRRESELIYVFKQQNLEQRQAD VEYELRCLLNKPEKDWTEEDRAREKVLMQELVTLIEQRNAIINCLDEDRQREEEEDKMLE AMIKKKGEALAGDEAEFQREAEPEGKKKGKFKTMKMLKLLGNKRDAKSKSPRDKS >gi568815576f:37749321_37988461|GENSCAN_predicted_CDS_7|3048_bp atggggcgccggcccgtcttggtaggggtgggctccgcactggaggcggagctgctgctg ggtggggcgcgcctccggcgcggacggaggcggcgggacgcccgctcccacgaccggccc acaatgaggcgagcgggcagcgcggagttagggcagggccgcggggcgcccgagcgagga caccgcggccccgcctccgcccctcccctcgcctgccggtcggcgcccgagctcggagcc gcagccgcagccggaaaccgggcccgcgcggcggccgccgtcccggccaagccggggccc cgaagccagagccggagccgggcgggccgcggggtcatggctgggccgcggggcgcgctg ctggcctggtgccgccgccagtgcgagggctaccgcggcgtggagatccgcgacctgagc agctccttccgggacggcctggccttctgcgccatcctgcaccggcaccggcccgacctg ctagattttgattcgctttccaaggacaatgtcttcgagaataaccgtttggcctttgaa gtggctgagaaggagctggggatccccgctctcctggaccccaatgacatggtctccatg agcgtccctgactgcctcagcatcatgacctatgtgtcccagtattacaaccacttctgc agtcctggccaagcacccactccagtggaaccagaagatgtggctcagggcgaggagctc tcctcaggcagcctgtcagagcagggcaccggccagacccccagcagcacgtgcgcagcc tgccagcagcatgtgcacttggtgcagcgctacctggctgacggcaggctgtaccatcgc cactgcttccggtgtcggcggtgctccagcaccctgctccctggggcttatgagaatggg cctgaggagggcacctttgtgtgtgcagaacactgtgccaggctgggcccggggacacgg tcggggaccaggcctgggcccttctcacagccaaagcagcagcaccagcagcaactcgca gaagatgccaaggatgttccaggaggcggccccagctccagtgctcctgcaggggctgag gccgatggacccaaggccagccctgaggcccggccgcagatccctaccaagccccgggtt cctggcaaactacaggagctggccagcccccctgcgggccgccccacccctgcccccagg aaggcctctgagagcaccaccccagcaccccccacgccccggccccgctccagtctgcag caggagaacctggtggagcaggctggcagcagcagcctggtgaacgggagactgcacgaa ctgcctgtccccaagccgagggggacaccgaagccgtccgaggggacaccagcccccagg aaggaccccccatggatcacgctggtgcaggcagaaccaaagaagaagccagccccactt cccccaagcagcagcccggggccaccaagccaggacagcaggcaggtggagaatggaggc accgaggaggtggcccagccgagcccaacggccagcctggagtccaaaccctataacccc tttgaggaggaggaggaggacaaggaggaagaggctccagctgcacccagcctggccacc agccctgccctgggccacccggagtccacacccaagtccctgcacccctggtacggcatc acccctaccagcagccccaagacaaagaagcgccctgccccgcgcgcacccagcgcgtcc ccactggctctccacgcctcccgcctctcgcactcggagccgccctcggccacaccatcg ccagcgctcagcgtggagagcctgtcgtctgagagcgccagccagactgcaggtgcagag cttctggagccgccagctgtgcccaagagctcctcagagcctgctgtccatgcccctggt acccctggaaaccctgtcagcctctctaccaactcctccctggcctcctctggggaacta gtggagcctagagtggaacaaatgcctcaagccagccctggccttgcccccaggaccagg ggcagctcaggtccccagccagccaagccctgcagtggcgccaccccaacgcctctcttg ttggttggagacaggagcccggtgccttcccctggaagctcgtccccacagctgcaggta aagccccatactgagaacctggggattgcctgggctcagtactttcctctgggcacccag ttcatgagggccctgtcttgcagccacagcgcctttggatggcccaacgctctgcaggcc tctgagctcccaaggctcatctggccccagtcctcctgcaaggagaatccttttaaccgg aagccatcacctgcagcgtccccagccacaaagaaggccaccaagggatccaagccagtg aggccacctgcccctggacacggctttccactcatcaaacgcaaggtccaggctgaccag tacatccctgaggaggacatccatggagagatggataccattgagcgccggctggatgcc ctggagcaccgtggggtgctgctggaggagaagctgcgtggcggcctgaatgagggccgt gaggatgacatgctggtggactggttcaagctcatccacgagaagcacctactggtgcgg cgagagtccgagctcatctatgtcttcaagcagcagaacctggagcagcgccaggctgat gtcgagtatgagctccggtgcctcctcaataagccagaaaaggactggacggaggaggac cgggcccgggagaaggtgctgatgcaggagcttgtgaccctcattgagcagcgcaacgct atcatcaactgcctggatgaggaccggcagagggaggaagaggaagacaagatgttggaa gccatgatcaagaagaaaggtgaggcccttgctggggatgaggctgagttccagagggag gctgaacctgagggcaagaagaaggggaagttcaagaccatgaagatgttgaaactgcta ggaaacaaacgtgatgccaagagcaagtcccccagagacaagagctaa >gi568815576f:37749321_37988461|GENSCAN_predicted_peptide_8|270_aa MAVLTLPPEETSLPQALGQNEDCSPPMASQKQMEVVTKGTGFRRRPKTITYTPGTCELLR GQSVKAAIPAFLAPGTGFVEDNFSTDGVMMKESKLTNIQQRHIMDIMKRGDALPLQCSPT SSQRVLPSKQIASPIYLPPILAARPHLRPANMCQANGAYSREQFKPQATRDLEKEKQRLQ NIFATGKDMEERKRKAPPARQKAPAPELDRFEELVKEIQERKEFLADMEALGQGKQYRGI ILAEISQKLREMEDIDHRRSEELRKGLATT >gi568815576f:37749321_37988461|GENSCAN_predicted_CDS_8|813_bp atggctgtgttaacactccctcccgaggagacttccctgccacaggcgctgggtcagaat gaggactgtagtcctcccatggcttcacagaagcagatggaggtagtgaccaaaggaact gggttccggcgccgccccaagaccatcacttacaccccggggacctgcgagctgctcaga ggtcagtccgttaaagcagcgatcccagcctttttggcaccagggaccggtttcgtggaa gacaatttttccactgacggagtgatgatgaaggaatccaaactgacgaacatccagcag cgccacatcatggacatcatgaaaagaggagatgctttgcccctacagtgcagcccaaca tccagccagagagtcttaccttccaagcaaatagcctcgcccatctacctgcctcccatc ctcgcagcccgtccccacctccggcctgccaacatgtgtcaagccaatggggcctacagc cgggagcagttcaagcctcaagccaccagggatttggagaaggagaaacaaagactccaa aatatctttgccacagggaaggacatggaggaacggaaaagaaaggcccctcctgcacga cagaaggctccagcccctgagctagaccgatttgaagagctggtgaaggaaatccaggag aggaaagaattcctggctgacatggaggccctgggacagggcaaacagtaccgaggaatc atccttgctgaaatctcccagaaactccgggaaatggaagacattgaccacagaaggagt gaggaacttaggaagggtcttgccaccacttaa >gi568815576f:37749321_37988461|GENSCAN_predicted_peptide_9|127_aa MSDNEDNFDGDDFDDVEEDEGLDDLENAEEEGQENVEILPSGERPQANQKRITTPYMTKY ERARVLGTRALQIAMCAPVMVELEGETDPLLIAMKELKARKIPIIIRRYLPDGSYEDWGV DELIITD >gi568815576f:37749321_37988461|GENSCAN_predicted_CDS_9|384_bp atgtcagacaacgaggacaattttgatggcgacgactttgatgatgtggaggaggatgaa gggctagatgacttggagaatgccgaagaggaaggccaggagaatgtcgagatcctcccc tctggggagcgaccgcaggccaaccagaagcgaatcaccacaccatacatgaccaagtac gagcgagcccgcgtgctgggcacccgagcgctccagattgcgatgtgtgcccctgtgatg gtggagctggagggggagacagatcctctgctcattgccatgaaggaactcaaggcccga aagatccccatcatcattcgccgttacctgccagatgggagctatgaagactggggggtg gacgagctcatcatcaccgactga >gi568815576f:37749321_37988461|GENSCAN_predicted_peptide_10|692_aa XPPQRLRLSRSVSLGLSGQGGWCPPASPPSPELDRTPWDTVFHFLRTSPRLEERSEEVGV GLFARTPAAGPGEAAEAAAAAAGGDMAEEQDLSEVELSPVGSEEPRCLSPGSAPSLGPDG GGGGSGLRASPGPGELGKVKKEQQDGEADDDKFPVCIREAVSQVLSGYDWTLVPMPVRVN GASKSKPHVKRPMNAFMVWAQAARRKLADQYPHLHNAELSKTLGKLWRAGPGLSDQPAVW TQTAQFCPRAEVTASSSTQGPSRSGWGLAQASIFSARGQRASEKQTTRRDIQIGQTLGTM RRPQYEPNESQSKDGPHFPSISHCQGFKIGSSIVPVPRFPQPRTLGLDSLQEDLFLAITQ AVCTGASLRLLNESDKRPFIEEAERLRMQHKKDHPDYKYQPRRRKNGKAAQGEAECPGGE AEQGGTAAIQAHYKSAHLDHRHPGEGSPMSDGNPEHPSGQSHGPPTPPTTPKTELQSGKA DPKRDGRSMGEGGKPHIDFGNVDIGEISHEVMSNMETFDVAELDQYLPPNGHPGHVSSYS AAGYGLGSALAVASGHSAWISKPPGVALPTVSPPGVDAKAQVKTETAGPQGPPHYTDQPS TSQIAYTSLSLPHYGSAFPSISRPQFDYSDHQPSGPYYGHSGQASGLYSAFSYMGPSQRP LYTAISDPSPSGPQSHSPTHWEQPVYTTLSRP >gi568815576f:37749321_37988461|GENSCAN_predicted_CDS_10|2079_bp nnccctccgcagcggctcaggctcagtcgctcagtcagtctcgggctgtccggccagggt ggttggtgcccaccagcgtcacctcccagccccgagctggaccgcacaccttgggacacg gttttccacttcctaaggacgagccccagactggaggagaggtccgaggaggtgggcgtt ggactctttgcgaggaccccggcggctggcccgggggaggcggccgaggcggcggcggcg gcggccgggggcgacatggcggaggagcaggacctatcggaggtggagctgagccccgtg ggctcggaggagccccgctgcctgtccccggggagcgcgccctcgctagggcccgacggc ggcggcggcggatcgggcctgcgagccagcccggggccaggcgagctgggcaaggtcaag aaggagcagcaggacggcgaggcggacgatgacaagttccccgtgtgcatccgcgaggcc gtcagccaggtgctcagcggctacgactggacgctggtgcccatgcccgtgcgcgtcaac ggcgccagcaaaagcaagccgcacgtcaagcggcccatgaacgccttcatggtgtgggct caggcagcgcgcaggaagctcgcggaccagtacccgcacctgcacaacgctgagctcagc aagacgctgggcaagctctggagggcaggcccagggctcagcgaccagcccgctgtgtgg acccagacagcccagttctgcccaagagcagaggtcacagctagctcctccacccaaggt ccatccagatctgggtggggcttggcccaggccagtatctttagtgccaggggacaaaga gcaagtgagaaacagacaaccaggagagacatacagatagggcagactctggggaccatg agaagaccacagtacgagcccaacgagagtcaaagcaaggacggacctcattttccctcc atctcccattgccaagggtttaagattggctcctccattgtccctgtccccaggttcccc cagcccagaaccctgggcttggactcccttcaggaggacttattcttggccatcactcaa gctgtgtgcacaggggccagcctgaggctgctgaacgaaagtgacaagcgccccttcatc gaggaggctgagcggctccgtatgcagcacaagaaagaccacccggactacaagtaccag cccaggcggcggaagaacgggaaggccgcccagggcgaggcggagtgccccggtggggag gccgagcaaggtgggaccgccgccatccaggcccactacaagagcgcccacttggaccac cggcacccaggagagggctcccccatgtcagatgggaaccccgagcacccctcaggccag agccatggcccacccacccctccaaccaccccgaagacagagctgcagtcgggcaaggca gacccgaagcgggacgggcgctccatgggggagggcgggaagcctcacatcgacttcggc aacgtggacattggtgagatcagccacgaggtaatgtccaacatggagacctttgatgtg gctgagttggaccagtacctgccgcccaatgggcacccaggccatgtgagcagctactca gcagccggctatgggctgggcagtgccctggccgtggccagtggacactccgcctggatc tccaagccaccaggcgtggctctgcccacggtctcaccacctggtgtggatgccaaagcc caggtgaagacagagaccgcggggccccaggggcccccacactacaccgaccagccatcc acctcacagatcgcctacacctccctcagcctgccccactatggctcagccttcccctcc atctcccgcccccagtttgactactctgaccatcagccctcaggaccctattatggccac tcgggccaggcctctggcctctactcggccttctcctatatggggccctcgcagcggccc ctctacacggccatctctgaccccagcccctcagggccccagtcccacagccccacacac tgggagcagccagtatatacgacactgtcccggccctaa