GENSCAN 1.0 Date run: 3-Nov-116 Time: 05:14:36 Sequence gi568815579r:42768101_42979581 : 211481 bp : 42.91% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 922 917 6 1.05 1.01 Sngl - 4234 2765 1470 1 0 66 43 539 0.719 42.50 1.00 Prom - 4695 4656 40 -6.55 2.02 PlyA - 4863 4858 6 1.05 2.01 Sngl - 6107 5658 450 2 0 88 53 364 0.848 28.86 2.00 Prom - 8596 8557 40 -8.05 3.04 PlyA - 9349 9344 6 1.05 3.03 Term - 9828 9599 230 1 2 68 48 236 0.116 13.41 3.02 Intr - 17947 17611 337 1 1 -12 119 214 0.538 8.67 3.01 Init - 18579 18100 480 0 0 52 57 240 0.401 12.72 3.00 Prom - 22420 22381 40 -6.75 4.04 PlyA - 22800 22795 6 1.05 4.03 Term - 24514 24249 266 2 2 -3 48 266 0.457 8.19 4.02 Intr - 29042 28935 108 0 0 80 87 19 0.294 0.34 4.01 Init - 31908 31842 67 0 1 45 90 47 0.410 1.89 4.00 Prom - 33520 33481 40 -2.95 5.02 PlyA - 33917 33912 6 1.05 5.01 Sngl - 40111 37934 2178 1 0 70 48 562 0.887 44.15 5.00 Prom - 40442 40403 40 -11.54 6.00 Prom + 40489 40528 40 -6.05 6.01 Init + 41348 41733 386 1 2 88 44 429 0.777 34.86 6.02 Term + 41812 42364 553 1 1 -48 43 286 0.530 3.30 6.03 PlyA + 42588 42593 6 1.05 7.02 PlyA - 43210 43205 6 1.05 7.01 Sngl - 49555 49253 303 1 0 43 32 251 0.887 10.58 7.00 Prom - 50090 50051 40 -9.25 8.06 PlyA - 50638 50633 6 1.05 8.05 Term - 51824 51702 123 2 0 79 32 113 0.673 2.20 8.04 Intr - 53069 52944 126 2 0 4 109 141 0.727 7.76 8.03 Intr - 54839 54695 145 1 1 93 72 139 0.848 12.16 8.02 Intr - 55715 55420 296 2 2 76 19 209 0.778 7.68 8.01 Init - 58562 58467 96 2 0 48 66 92 0.604 3.36 8.00 Prom - 66153 66114 40 -1.65 9.16 PlyA - 66385 66380 6 1.05 9.15 Term - 76513 76274 240 1 0 94 48 120 0.462 3.64 9.14 Intr - 76885 76763 123 1 0 66 57 72 0.465 1.76 9.13 Intr - 77194 77049 146 2 2 72 65 114 0.698 6.58 9.12 Intr - 77863 77819 45 2 0 124 108 -1 0.865 2.96 9.11 Intr - 79522 79268 255 2 0 117 105 41 0.927 5.19 9.10 Intr - 80201 79902 300 0 0 71 39 179 0.262 7.08 9.09 Intr - 84318 84229 90 1 0 62 42 98 0.110 1.65 9.08 Intr - 86025 85849 177 1 0 19 103 127 0.208 6.27 9.07 Intr - 87631 87456 176 0 2 41 94 108 0.132 5.36 9.06 Intr - 88978 88761 218 1 2 61 -12 178 0.096 1.48 9.05 Intr - 90421 90298 124 0 1 38 84 70 0.227 1.27 9.04 Intr - 91338 91270 69 2 0 74 60 107 0.163 3.88 9.03 Intr - 92156 92028 129 1 0 31 72 116 0.054 3.19 9.02 Intr - 94486 94203 284 1 2 21 31 269 0.189 9.79 9.01 Init - 94818 94720 99 0 0 61 40 134 0.731 6.21 9.00 Prom - 97034 96995 40 -8.05 10.33 PlyA - 97796 97791 6 1.05 10.32 Term - 98200 98046 155 2 2 102 48 84 0.916 2.90 10.31 Intr - 100255 100001 255 2 0 97 119 153 0.748 15.79 10.30 Intr - 100934 100656 279 0 0 72 65 356 0.995 28.23 10.29 Intr - 101603 101559 45 0 0 114 108 -1 0.762 1.96 10.28 Intr - 103945 103667 279 2 0 60 75 216 0.187 14.03 10.27 Intr - 110178 109813 366 1 0 85 103 250 0.982 20.39 10.26 Intr - 111589 111418 172 1 1 5 94 139 0.669 4.79 10.25 Intr - 112874 112568 307 1 1 10 71 136 0.075 -0.37 10.24 Intr - 117841 117653 189 0 0 47 42 178 0.047 6.88 10.23 Intr - 120741 120601 141 2 0 44 17 131 0.008 0.15 10.22 Intr - 127899 127747 153 2 0 77 23 94 0.013 0.07 10.21 Intr - 132605 132416 190 0 1 26 17 211 0.401 5.52 10.20 Intr - 139076 138822 255 0 0 112 119 136 0.958 15.59 10.19 Intr - 139837 139476 362 0 2 38 65 431 0.534 29.74 10.18 Intr - 141306 141122 185 0 2 -15 81 220 0.432 8.66 10.17 Intr - 142758 142523 236 1 2 72 81 179 0.130 12.08 10.16 Intr - 143188 143097 92 0 2 56 80 16 0.382 -3.68 10.15 Intr - 147603 147306 298 1 1 90 68 192 0.842 12.41 10.14 Intr - 148387 148025 363 2 0 78 74 272 0.943 19.03 10.13 Intr - 157927 157673 255 2 0 109 119 134 0.762 15.09 10.12 Intr - 158616 158338 279 1 0 66 65 360 0.998 28.03 10.11 Intr - 159285 159241 45 1 0 110 108 -1 0.744 1.56 10.10 Intr - 161023 160688 336 2 0 54 119 70 0.436 1.27 10.09 Intr - 161620 161342 279 2 0 72 75 207 0.490 14.33 10.08 Intr - 162050 161959 92 1 2 51 61 66 0.375 -1.08 10.07 Intr - 166782 166587 196 1 1 85 69 100 0.876 5.35 10.06 Intr - 167669 167304 366 0 0 82 84 303 0.634 23.49 10.05 Intr - 169897 169839 59 0 2 93 37 47 0.212 -2.29 10.04 Intr - 174924 174787 138 2 0 148 48 65 0.230 7.16 10.03 Intr - 178703 178624 80 2 2 74 71 54 0.116 -0.27 10.02 Intr - 181739 181365 375 2 0 -7 72 185 0.120 1.69 10.01 Init - 182367 181894 474 0 0 62 57 224 0.288 12.28 10.00 Prom - 189060 189021 40 -2.95 11.03 PlyA - 189456 189451 6 1.05 11.02 Term - 198375 198270 106 2 1 72 39 130 0.667 3.40 11.01 Intr - 201431 201369 63 1 0 69 75 64 0.243 0.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:42768101_42979581|GENSCAN_predicted_peptide_1|489_aa MKAEIKMFFETNENKNTPYQNLWDAFKAVCTGKFIARNPHKRKQERSKIDTLTSQLKELE KQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRLLARLIKKKR EKNQIDAIKNDKGVITSDSIEIKTTIREYYKHIYANKLENLEEMDKFLDTYTLRRLNQEE VESLNRPITGSEIVAIINCLPNKKSPGPDGFTAEFYHTYKQELVPFFLKLFQSIEKEGIL PNSFHEASIILIPKPGRDTTKKENFRPISLINFGAKILNKILANQIQQHIKKLIHHDQVG FIPGMQGWFNTRKSINAIQHINRTKDKNHMIISIDAEKAFDKIQQCFMLKTLHKLGIDGT YLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEI KGIQLGKEEVKLSLFADDGIVYLENPIVSAQNLLKLISNFSSLRIQNQSTKTTSILIHQQ QTNREPNHQ >gi568815579r:42768101_42979581|GENSCAN_predicted_CDS_1|1470_bp atgaaggcagaaataaagatgttctttgaaaccaacgagaacaaaaacacaccataccag aatctctgggatgcattcaaagcagtgtgtacagggaaatttatagcacgaaatccccac aagagaaagcaggaaagatccaaaattgacaccctaacttcacaattaaaagaactagaa aagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaaatcagagca gaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatccaggagctgg ttttttgaaaggatcaacaaaattgatagactgctagcaagactaataaagaagaaaaga gagaagaatcaaatagacgcaataaaaaatgataaaggggttatcaccagcgattccata gaaataaagactaccatcagagaatactacaagcacatctatgcaaataaactagaaaat ctagaagaaatggataaattcctcgacacatacaccctccgaagactgaaccaggaagaa gttgaatctctgaatagaccaataacaggatctgaaattgtggcaataatcaattgctta ccaaacaaaaagagtccaggaccagatggattcacagccgaattttaccacacgtacaag caggaactggtaccattctttctgaaactattccaatcaatagaaaaagagggaatcctc cctaactcatttcatgaggccagcatcatcctgataccaaagccaggcagagacacaacc aaaaaagagaatttcagaccaatatccttgatcaactttggtgcaaaaatcctcaataaa atactggcaaaccaaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggc ttcatccctgggatgcaaggctggttcaatacacgcaaatcaataaatgcaatccagcat ataaacagaaccaaagacaagaaccacatgattatctcaatagatgcagaaaaggccttt gacaaaattcaacaatgcttcatgctaaaaactctccataaattaggtattgatgggaca tatctcaaaataataagagctatctatgacaaacccacagccaatatcatactgaatggg caaaaactggaagcattccctttgaaaactggcacaagacagggatgccctctctcacca ctcctattcaacatagtgttggaagttctggccagggcaattaggcaggagaaggaaata aagggtatccaattaggaaaagaggaagtcaaattgtccctgtttgcagacgatgggatt gtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataagcaacttc agcagtctcaggatacaaaatcaatctacaaaaaccacaagcattcttatacaccaacaa cagacaaacagagagccaaatcatcagtga >gi568815579r:42768101_42979581|GENSCAN_predicted_peptide_2|149_aa MGKKHSRKNANSKKQSASPPPKERSSSPETEQSWTENDFDELREEGFRQSNYSKLWEDIQ TKGKEVENFEKNLEECIIRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSV MEDEMNEMKREWKFREKRIKRNEQSLQEI >gi568815579r:42768101_42979581|GENSCAN_predicted_CDS_2|450_bp atggggaaaaaacacagcagaaaaaatgcaaactctaaaaagcagagtgcctctcctcct ccaaaggaacgcagttcctcaccagaaacggaacaaagctggacggagaacgactttgac gagctgagagaagaaggcttcagacaatcaaattactccaagctatgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataattagaata accaatacagagaagtgcttaaaggagctaatggaactgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagagtatcagtg atggaagatgaaatgaatgaaatgaagcgagaatggaagtttagagaaaaaagaataaaa agaaatgaacaaagcctccaagaaatatga >gi568815579r:42768101_42979581|GENSCAN_predicted_peptide_3|348_aa MLMEKSVVPHRGIFSVICTANVPKPYVTSNPTEDEEAVVLTCEPDIHGTIYMWWVNGHRL TLSPRLKMSNDNRILALLSITRSDTGLYECQRKNVVSTSQIDPVTWMFSVSILFSFMDKA AGPNPHNMRPGLSVPLRFKYEDTYPWTPKLAMTSCPRKTWEHCAPFTDQELPIPSDNITC GIILFAPDGPDEPTTYSSDTYYYPGSNLNLSCLMGSNPSAEYSWLLNGNNQQIGQELFIP QVTTENSGDYLCYVHNPVTNGKNFATKKIRVPVTMPKKMTWTSSRYAAIPVYPSPAAGPY QPKAQIFSCRAEREHGIPSITYLLPVHCSGYRHRPCTREDTHLLTHGF >gi568815579r:42768101_42979581|GENSCAN_predicted_CDS_3|1047_bp atgctcatggagaaatcagtggtgccacacaggggaatcttctctgttatctgcacagcg aatgttcccaagccctatgtcaccagcaaccccacagaggatgaggaggctgtggtctta acctgtgaacctgacattcatggcacaatctacatgtggtgggtaaatggtcatagactc acactcagtcccaggctaaagatgtccaatgacaacaggatccttgctctactcagtatc acaaggagtgacacaggactctatgaatgtcaaaggaagaatgtagtgagtaccagccaa attgacccagtcacctggatgttctctgtgagtatcctcttttccttcatggacaaggct gccggcccaaatccacataacatgaggccaggcctctcagtccctctcaggttcaagtat gaagacacttacccctggacccccaagctggccatgacttcctgccccagaaaaacctgg gaacactgtgcccctttcacagaccaggagcttcccattccctctgataacattacctgt ggcattattctctttgctccagatggcccagatgaacccacaacttattcttcagacacc tattactatccagggtcaaacctcaacctctcctgcctcatgggctctaacccatcagca gagtattcttggctgctgaatgggaataaccagcaaataggacaagagctctttatcccc caagtcactactgagaatagtggggactatctgtgttatgtccataacccagtcactaat ggcaaaaacttcgcaaccaagaaaatcagagtccctgtaacaatgcctaagaagatgaca tggacttcgtcccgatatgcagccattcctgtgtacccttcccctgctgcagggccgtac cagcccaaggcccaaatcttcagctgcagagctgagagagaacatgggatacccagcatc acttaccttcttccagtccactgcagtggctaccggcatcgcccatgtacccgtgaggac acccatctgctgacccacggtttctaa >gi568815579r:42768101_42979581|GENSCAN_predicted_peptide_4|146_aa MSHSELTNMFDTITTDTQTENQATVSFYLVVYQCATLALFLQLPLYKMSSSTIGFLRDYN PDRYIEAFQNLSQVFHLTWKDVMLLLNQTLTVTEKQPALQAPENFEDEQHISYNTPKGKE GDTESEEIAETPFQIRSEAVPLGNPD >gi568815579r:42768101_42979581|GENSCAN_predicted_CDS_4|441_bp atgagtcactcggagctcaccaatatgtttgacaccataactacagacactcaaactgaa aaccaggcaacagtgagtttttatctggttgtttaccagtgtgcaacccttgctctcttc ctccagctcccattatataagatgtcttcctccaccattggctttctccgtgattacaat cctgataggtatatagaagctttccagaatttaagtcaggtgtttcacctcacatggaag gatgttatgctgctcctaaaccaaaccctaactgtaactgaaaaacagccagctctacaa gcaccagagaattttgaagatgagcaacatatctcctataatacaccaaaagggaaggaa ggagatacagaaagtgaagaaatagcagaaacaccattccaaataagaagtgaagcagta cctcttggcaaccctgattga >gi568815579r:42768101_42979581|GENSCAN_predicted_peptide_5|725_aa MDKFLDTYTLPRLNQEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKERLLPNSFYKASIILIPKPGRDTTKKENFRPISLINIDAKILNKILA NQIQQHIKKLIHHDQVGFIPGIQGWFNICKSINVIQHINRTKDKNHMIISIDAEKAFDKI QQPFMLKNLHKLGIDGTYLKIIRAIYEKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLF NIVLEVLARAIRQEKEIKGIQLGKEEFKLSLFADDMIVSLENPIVSAQNLLKLISNFSKV SGFKINVQKSQAFLYTNNRQTESQIMSEIPFTIASKRIKYLGIQLTRDVKDLFKENYKPL LGEIKEDTNKWKNIPRSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFI WNQKRAHIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIT LLIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTLYTKINSRWIKDLH VRPKTIKTLEENLGITIQDIGMGKDFMYKTPKAMAAKDKIDKWDLIKLKSFCTAKETTIR VNRQPTKWEKIFTTYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKADIYAA KKHMKKCSSSLAIREMQIKTTIRYHLTPVRMAIIKKSGNNRCRRGCGEIGRLLHCWWDCK LVQPL >gi568815579r:42768101_42979581|GENSCAN_predicted_CDS_5|2178_bp atggataaattccttgacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggagctgaaattgtggcaataatcaatagtttaccaaccaaa aagagtccaggaccagatggattcacagctgaattctaccagaggtacaaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagagactcctccctaactca ttttataaggccagcatcattctgataccaaagccgggcagagacacaaccaaaaaagag aattttagaccaatatccttgatcaacattgatgcaaaaatcctcaataaaatactggca aaccaaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatacaaggctggttcaatatatgcaaatcaataaatgtaatccagcatataaacaga accaaagacaagaaccacatgattatctcaatagatgcagaaaaagcctttgacaaaatt caacagcccttcatgctaaaaaatctccataaattaggtattgatgggacatatctcaaa ataataagagctatctatgagaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattc aacatagtgttggaagttctggccagggcaattaggcaggagaaggaaataaagggtatt caattaggaaaagaggaattcaaattgtccctgtttgcagacgacatgattgtatctcta gaaaaccccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggattcaaaatcaatgtacaaaaatcacaagcattcttatacaccaacaacagacaa acagagagccaaatcatgagtgaaatcccattcacaattgcttcaaagagaataaaatac ctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactg cttggggaaataaaagaggatacaaacaaatggaagaatattccacgctcatgggtagga agaatcaatatcgtgaaaatggccatactgcccaaggtaatttatagattcaatgccatt ccaatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcata tggaaccaaaaaagagcccacatcgccaagtcaatcctaagccaaaagaacaaagctgga ggcatcacactacctgacttcaaactatactacaaggctacagtaaccaaaacagcatgg tactggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataacg ctgcttatctacaactatctgatctttgacaaacctgagaaaaacaagcaatggggaaag gattccctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaa ctggatcccttccttacactttatacaaaaatcaattcaagatggattaaagacttacat gttagacctaaaactataaaaaccctagaagaaaacctaggcattaccattcaggacata ggcatgggcaaggacttcatgtataaaacaccaaaagcaatggcagcaaaagacaaaatt gacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcaga gtgaacaggcaacctacaaagtgggagaaaattttcacaacctactcatctgacaaaggg ctgatatccagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaacccc atcaaaaagtgggcgaaggacatgaacagacacttctcaaaagcagacatttatgcagcc aaaaaacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaacc acaataagataccatcttacaccagttagaatggcaatcattaaaaagtcaggaaacaac aggtgcaggagaggatgtggagaaataggaagacttttacactgttggtgggactgtaaa ctagttcaaccattgtga >gi568815579r:42768101_42979581|GENSCAN_predicted_peptide_6|312_aa MGKKQNRKTGNSKTQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDVQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELHEECRSLRSQCDQLEERVSA MEDEMNEMKPNLRLIGVPESDGENGTKLENTLQDIIQENSPNLARQAKVQIQEIQRTPQR YSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRVTLKGKPIRLTADLLAETLQARREWGP IFNILKEKNFQPRISYPAQLSFISEGEIKYLRDKQILRDFVTTRPALKDLLKEALNMERN NWYQPLQNHAKM >gi568815579r:42768101_42979581|GENSCAN_predicted_CDS_6|939_bp atggggaaaaaacaaaacagaaaaactggaaactctaaaacacagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgactttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacgttcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta catgaagaatgcagaagcctcaggagccaatgcgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaaaccaaatctacgtctgattggtgtacctgaaagt gacggggagaatggaaccaagttggaaaacactctgcaggatattatccaggagaactcc cccaatctagcaaggcaggccaaagttcagattcaggaaatacagagaacgccacaaaga tactcctcgagaagagcaactccaagacacataattgtcagattcaccaaagttgaaatg aaggaaaaaatgttaagggcagccagagagaaaggtcgggttaccctcaaagggaagccc atcagactaacagcggatctcttggcagaaaccctacaagccagaagagagtgggggcca atattcaacattcttaaagaaaagaattttcaacccagaatttcatatccagcccaacta agcttcataagtgaaggagaaataaaatacttaagagacaagcaaatactgagagatttt gtcaccaccaggcctgccctaaaagacctcctgaaggaagcgctaaacatggaaaggaac aactggtaccagccactgcaaaatcatgccaaaatgtaa >gi568815579r:42768101_42979581|GENSCAN_predicted_peptide_7|100_aa MRVTCGLDRINAGTIGHQGQTGEGCTEEAEIMNVESLRSFSNQESQDMGLSLQGKMEPAG SHAKLFVSFEVWCDMRDSKRVDRGGEAFKLQMMKELRKES >gi568815579r:42768101_42979581|GENSCAN_predicted_CDS_7|303_bp atgagggtcacctgtggcttggacaggattaatgcagggaccattggacatcagggacaa actggagaaggttgcactgaagaagcagagatcatgaatgtagaaagcttgagaagtttt tccaatcaggaaagtcaagacatggggctgtcactacagggcaaaatggagcctgcaggg agtcatgcaaaattatttgtctcttttgaagtttggtgtgacatgagggattcaaagaga gttgacagaggaggggaagcttttaaactccagatgatgaaagagctcagaaaggaatct taa >gi568815579r:42768101_42979581|GENSCAN_predicted_peptide_8|261_aa MLEAAVHAPGNSDVTRTRCATQVLKPLVSYGQAETGFSVPQLKYPPWDGLLHRDIPPGTL PVEHSPCSLAALQFLNATSTHGECSAFARRKLESSFSQHSLQEGHRHGSRLSPSGAPVKD SGSRTAPGSRGGQQHPSQLLLCGRDDLFQASSGHGPFLLLFPELPSPRDALQLLREASKF PTDKRQGRSWSVAITNTAATSLEWVPFHAFAIPRSTLVSQQSPTANSPYDPSPPPPSSTA TSENSYYLNFTVTKISLGPVG >gi568815579r:42768101_42979581|GENSCAN_predicted_CDS_8|786_bp atgctggaggctgctgtgcacgccccaggtaactcagatgtcaccaggacacgatgtgcc actcaggtcctaaagcccttggtgtcctatggtcaggcagaaaccggcttctcagttcct cagctcaaatacccaccttgggacgggcttctgcaccgggacatcccgccaggaactctc ccggtggaacacagtccatgttctctggctgccctgcagtttttaaatgccaccagtaca catggggaatgttctgccttcgcaaggaggaagctggaaagttcattttcccagcactcc ttgcaggagggacaccgacacgggtccaggctttccccatcaggtgctcccgtgaaagac tcagggtcccgaacagcaccaggaagcagagggggccagcagcatccgagccagcttctt ctctgtggcagagatgatctttttcaggcctcctccggccatggccctttcctgctcctc tttccagagctcccaagtccccgggatgcactccagctgctaagagaggcctcgaagttc cctacggataaacgtcaaggtcgttcctggtctgtggccatcaccaacactgccgcgact agtctagaatgggtgccatttcatgcatttgcaattcccagaagtacactggttagtcag cagtctcccactgctaattctccatatgacccatccccaccaccaccatcctcaacagcc accagtgagaattcttactacctgaacttcactgtgacaaaaatcagccttggacctgtg ggctaa >gi568815579r:42768101_42979581|GENSCAN_predicted_peptide_9|824_aa MAGYSSETKLPEERSDSSIRSSQKSAVLQPPLLERSSSPAMEQSWMENDFHKLREEGFRC SNYSELQEEIQTKGKEVKNFEKNLDEGITRITNTEKCLKELMELKAKTRELREECRSLRS RCDQLEEGLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRWELNNENTR TQEGEHHTLGTVVGTRDHKTTAPAGAQPAQVPQGSVNVRPWMACEATAIQPQRESAKPSA DTEQRLPEDPQPPRESVRHPVTCLPAAPRGAQSACDGHTQAPSGMWPPTPSCPLGRQTSP VCLLGEGGKRLSAEGGRTAQPTAVLRKFLEPRLISTEENTQAAETMGPLSAPPCTQHIKW KGVLLTDGQRITYGPAYSGRETVYSNASLLIQNVTREDGVSYTLHIIQRGDGTRGVTGNF TFTLYPTPPHATLKLFGHRRVLRAPWSWTEEMGAPVETPKPSISSSKLNPREIMEAVILI CDPATPNASYLWWMNGQNLPTTQRLQLSKTNRTLFIFGATKYTAGPYECEIRNPVSASRS DPFTLNLLRDYLLFLYGLDAPTISSSYTYYHTREVPKLSCLTDSHPLAEHSWLIDGKFQQ SAQVLFIPQITKTYRGVYVCFIHNSAIGGTNLIIKRIIVPDHSLHSALSLEVTGSTKLPK PYITINNSKPRENKDVLPFTCDPKSENYTYTWWLNGQSLPVSPRDTSLTLNDHKPVLSVF LRSKYRHLYFWTSELAMTPCPGKSWMVQTSPEFTLHSPITVQDKTFTCPASRNVTHRHST LGQLMGSFSNQNKSSLSPKLLQSIEGSMLALFVTQPLARKAPNP >gi568815579r:42768101_42979581|GENSCAN_predicted_CDS_9|2475_bp atggccgggtactcctctgagacaaaacttccagaggaacgatcagacagcagcattcgc agttcacaaaaatccgctgttctgcagccaccgctgctggaacgcagttcctcaccagca atggaacaaagctggatggagaatgactttcataagttgagagaagaaggcttcagatgt tcaaactactctgagctccaggaggaaattcaaaccaaaggcaaagaagttaaaaacttt gaaaaaaatttagatgaaggtataactagaataaccaatacagagaagtgcttgaaggag ctgatggagctgaaagccaagactcgagaactacgtgaagaatgcagaagcctcaggagc cgatgcgatcaactggaagaaggactaaaccaggaagaagttgaatctctgaatagacca ataacaggatctgaaattgtggcaataatcaatagcttaccaaccaaaaagagtccagga ccagatggattcacagccgaattctaccagaggtgggaactgaacaatgagaacacacga acacaggaaggggaacatcacactctggggactgttgtgggaaccagagaccataaaaca acagctccagcaggggctcagccagcacaggtcccacaaggcagcgtcaatgtcaggccc tggatggcatgtgaggccacagcaatacaaccacaacgtgaaagtgccaagcctagcgcc gacacagaacagcggctgcctgaggacccacagcctccccgggaatcagtcaggcaccct gtgacctgcctgcccgctgcaccccggggtgctcagagtgcgtgtgatggtcacacacag gcaccatccgggatgtggccacctacccccagctgtcccttgggaagacagacctctcct gtgtgcttgctgggagaaggggggaagaggctcagcgcagaaggaggaaggacagcacag cctacagccgtgctcaggaagtttctggaacctaggctcatctccacagaggagaacaca caagcagcagagaccatggggcccctctcagcccctccctgcacacagcacatcaaatgg aagggggtcctgctcacagacggtcaaagaattacatatgggcctgcatacagtggacga gaaacagtatattccaatgcatccctgctgatccagaatgtcacccgggaggacggagta tcctacaccttacacatcatacagcgaggtgatgggactagaggagtaactggaaatttc accttcaccttataccccaccccacctcatgcaactctgaagctctttggccaccggagg gttctcagggctccttggtcctggactgaggaaatgggggcacctgtggagactcccaag ccctccatctccagcagcaaattaaaccccagggagatcatggaggctgtgatcttaatc tgtgatcctgcaactccgaacgcaagctacctgtggtggatgaatggtcagaacctccct accactcaaaggttgcagctgtccaaaaccaacaggaccctctttatatttggtgccaca aagtatactgcaggaccctatgaatgtgaaatacggaacccagtgagtgccagccgcagt gacccattcaccctgaatctcctccgtgactatcttctgttcctctatggcctggatgcc cccaccatttcttcctcatacacctattaccatacaagggaagtccccaagctctcctgc ctcacagactctcacccactagcagagcattcttggctgattgatgggaagttccagcaa tcagcacaagtgttatttataccccaaatcaccaaaacatatagaggggtctatgtctgt ttcatccataactcagccattggtggaacaaatctcataatcaagaggatcatagtccct gatcattccttgcactctgctctatctttagaggtcactggttcaacgaagctgcccaag ccctacatcaccatcaacaactcaaaacccagggagaataaggatgtcttacccttcacc tgtgaccctaaaagtgagaactacacctacacgtggtggctaaatggtcagagcctccca gtcagtcccagggacaccagcttaactctaaatgaccacaagccagtcctctcagtcttt ctccggtccaagtatagacacctttacttctggacatccgagctggccatgactccctgc cctgggaaatcctggatggtccagacctccccagaatttacccttcattcacctattacc gttcaggacaaaacctttacttgtcctgcttcgcggaatgtaacccaccggcatagtact cttggacaattaatgggaagtttcagcaatcagaacaaaagctctttatcccccaaatta ctacaaagcatagagggctctatgcttgctctgttcgtaactcagccactggcaaggaaa gctccaaatccatga >gi568815579r:42768101_42979581|GENSCAN_predicted_peptide_10|2431_aa MEKSMMPHRGIFSVICTANLPKPYVTSNSAEDEEAVVLTCEPETHGTIYMWWVNSHRLTL SPRLKMSNDNRILALLSVTRSDKGLYERQRKNIVSTSHSDPVSWMFSVSILFSFMDKAAS PNPHNMRPGLSVPLRSKYEDTYPWTPKLAMTSCPRQTWEHYAPFTDQELPIPSDNITCGF ILFAPDGPDEPTTYSSDTCYYPGSNLNLSCLTGSNPSAEYSWLLNGNNQQIGQELFIPQV TTENSGDYLCYVHNPVTNGKNFATKKIRVPCKWITGALAICFQLHLSLGVSVVAAMNGLH GKDLTDRGPRCHCLSRHLFQLQRTAKPRPRHQRINKDFKTYCVQRRCPLLGTKPPREGVV TTGQSAGHIREDIKDASLLNFWNPPTTAQVTIEAQPPKVSEGKDVLLLVHNLPQNLTGYI WYKGQIRDLYHYVTSYIVDGQIIKYGPAYSGRETVYSNASLLIQNVTQEDTGSYTLHIIK RGDGTGGVTGRFTFTLYPVHQPGLSPQSLIWARTELSSPDTQSGEDRKTSFVGHQPTALG GLGHSIESLISPEAETEERRCTRMSLEREAEEKTYHRGQSKIETKKIPALNAPMETPKPS ISSSNFNPREATEAVILTCDPETPDASYLWWMNGQSLPMTHSLQLSETNRTLYLFGVTNY TAGPYECEIRNPVSASRSDPVTLNLLRSLWPFHRPGFYLPSDNITCDIILFAPHGLDAPT IFSSYTYYHTREVPKLPCLTDSHPLAEHSWLIDGKFQQSVQVFFIPQFTKTYRGVYVSFI HNSATGGTNLIIKRIIVPDHSLHSALSLEVTGSTKLPKPYITINNLNPRENKDVSTFTCE PKSENYTYIWWLNGQSLPVSPRVKRRIENRILILPSVTRNETGPYQCEIRDRYGGIRSDP VTLNVLYGPDLPRIYPSFTYYHSGQNLYLSCFADSNPPAQYSWTINGKFQLSGQKLSIPQ ITTKHSGLYACSVRNSATGKESSKSVTVRVSASLLNFWNLPTTAQVIIEAKPPKVSEGKD VLLLVHNLPQNLTGYIWYKGQMTDLYHYITSYVVHGQIIYGPAYSGRETVYSNASLLIQN VTQEDAGSYTLHIIKRGDGTGGVTGYFTVTLYCKAEEPVPGLGCGFFGRAYWDQGFTSCL RTVSPGAVHQPGLSPQSLIWARTELSSPDTQSGEDRKTSFVGHQPTALGGLGHSIESLIS PEAETEERRCTRMSLAREADEKTYLRGQSETETNKIPALHAPTETPKPSISSSNLNPREV MEAVRLICDPETPDASYLWLLNGQNLPMTHRLQLSKTNRTLYLFGVTKYIAGPYECEIRN PALLNDCHNTNTEKKKQPEKVESSDDIENSNQPFSHPKAFKNIRVQHGHYGIHRKLITKL ETWWEMPTLNEGCLWRNQRCHTGQSSVIHTAKLPMPYITINNLNPREKKDVLAFTCEPKS RNYTYIWWLNGQSLPVSPRVKRPIENRILILPSVTRNETGPYQCEIRDRYGGIRSNPVTL NVLYGPDLPRIYPSFTYYRSGENLDLSCFADSNPPAEYSWTINGKFQLSGQKLFIPQITT NHSGLYACSVRNSATGKEISKSMIVKVSEKQMSVALGKECDPYGEPINDPKSLTKLPPLS LNLIINAGVSSALAASQDQKAVTPLDTAFPILDMDETGNDHSQQTNTGTENQTLHVLTYK WELNNENTWTQGGEHHTLGPVEGHFKNMHPWAFATLFSYVGTHLVVTEDTLNIPKSEETE AQFDYPLMKWINAGTIGDQRQAGEGCTEEAEIMNAESLRSFSNQESQDMGLSLQGKTEPA GSHAKVFVSFEVCQAPCDLPARCTLGAQSACDGHTQAPSGMWPPNTSCPLGRQTSPVCLR EEGECYCSLISEHNSPPPALSGESNVPPDRLSGHCLFLLYTQQLGQVKTIRTDPWRLSTE RGRTAQLTAVLREFLDPRLISTEENTQAAETMGTLSAPPCTQRIKWKGLLLTASLLNFWN LPTTAQVTIEAEPTKVSEGKDVLLLVHNLPQNLTGYIWYKGQMRDLYHYITSYVVDGEII IYGPAYSGRETAYSNASLLIQNVTREDAGSYTLHIIKGDDGTRGVTGRFTFTLHLETPKP SISSSNLNPRETMEAVSLTCDPETPDASYLWWMNGQSLPMTHSLKLSETNRTLFLLGVTK YTAGPYECEIRNPVSASRSDPVTLNLLHHSLHSALSLEVTGSTKLPKPYITINNLNPREN KDVLNFTCEPKSENYTYIWWLNGQSLPVSPRVKRPIENRILILPSVTRNETGPYQCEIRD RYGGIRSDPVTLNVLYGPDLPRIYPSFTYYRSGEVLYLSCSADSNPPAQYSWTINEKFQL PGQKLFIRHITTKHSGLYVCSVRNSATGKESSKSMTVEVSGPYKPMAQIFSCRAETEHGI PGIPYLLPVHCNVYWHGPFTPEDTHLLTHSF >gi568815579r:42768101_42979581|GENSCAN_predicted_CDS_10|7296_bp atggagaaatcaatgatgccacacaggggaatcttctctgttatctgcacagccaatctg cccaagccctatgtcaccagcaactccgcagaggatgaggaggctgtggtcttaacctgt gaacctgagactcatggcacaatctacatgtggtgggtaaatagtcatagactcacactc agtcccaggctaaagatgtccaatgataacaggatccttgctctactcagtgtcacaagg agtgacaaaggactctatgaacgtcaaaggaagaatatagtgagcaccagccacagtgac ccagtctcctggatgttctctgtgagtatcctcttttccttcatggacaaggctgccagc ccaaatccacataacatgaggccaggcctctcagtccctctcaggtccaagtatgaagac acttacccctggacccccaagctggccatgacttcctgtcccaggcaaacctgggaacac tatgcccctttcacagaccaggagcttcccattccctctgataacattacctgtggcttt attctctttgctccagatggcccagatgaacccacaacttattcttcagacacctgttac tatccagggtcaaacctcaacctctcctgcctcacgggctctaatccatcagcagagtat tcttggctgctgaatgggaataaccaacaaataggacaagagctctttatcccccaagtc actactgaaaatagtggggactatctgtgttatgtccataacccagtcactaatggcaaa aacttcgcaaccaagaaaatcagagtcccttgtaagtggatcactggagcattggcaata tgctttcagttgcatctttctctcggtgtctcagttgtggcagccatgaatggcctccat ggcaaggacttgactgacagagggcccaggtgtcattgccttagcagacacctctttcag ctgcagcgaacagctaagccaagacccagacaccagaggataaacaaggattttaaaacc tactgtgtccaacggagatgcccacttctgggcaccaagccaccaagggaaggtgtggtg accacaggacaatcagctgggcacataagagaggacatcaaagatgcatcacttttaaac ttctggaacccgcccaccacagcccaagtcacgattgaagcccagccaccaaaagtttcc gaggggaaggatgttcttctacttgtccacaatttgccccagaatcttactggctacatc tggtacaaaggacaaatcagggacctctaccattatgttacatcatatatagtagacggt caaataattaaatatgggcctgcatacagtggacgagaaacagtatattccaatgcatcc ctgctgatccagaatgtcacccaggaagacacaggatcctacactttacacatcataaag cgaggtgatgggactggaggagtaactggacgtttcaccttcaccttataccctgttcac cagccagggctcagccctcagagcctcatctgggcaaggacagagctttcttcacctgac actcagagtggagaggacagaaagacaagctttgtaggccatcagccaactgccttagga ggcctaggacactccatagaaagtctaatatccccagaagcagaaacagaagagagaaga tgtaccaggatgtccttggaaagggaagctgaagagaaaacataccacagggggcaaagt aagattgaaactaagaagattccagcactgaatgctccaatggagactcccaaaccctcc atctccagcagcaatttcaaccccagggaggccacggaggctgtgattttaacctgtgat cctgagactccagatgcaagctacctgtggtggatgaatggtcagagcctccctatgact cacagcttgcagctgtctgaaaccaacaggaccctctacctatttggtgtcacaaactat actgcaggaccctatgaatgtgaaatacggaacccagtgagtgccagccgcagtgaccca gtcaccctgaatctcctccggtcactgtggcccttccacagaccaggattttaccttccc tctgacaatatcacctgtgacattattctctttgctccacatggcctggatgcccccacc attttttcctcatacacctattaccatacaagagaagtccccaagctcccctgcctcaca gattctcatccactggccgagcattcttggctgattgatgggaagttccagcaatcagta caagtgttctttataccccaattcactaaaacatatagaggggtctatgtctctttcatc cataactcagccactggtggaacaaatctcataatcaagaggatcatagtccctgatcat tccttgcactctgctctatctttagaggtcactggttcaacgaagctgcccaagccctac atcaccatcaataacttaaaccccagggagaataaggatgtctcaaccttcacctgtgaa cctaagagtgagaactacacctacatttggtggctaaatggtcagagcctcccggtcagt cccagggtaaagcgacgcattgaaaacaggatcctcattctacccagtgtcacgagaaat gaaacaggaccctatcaatgtgaaatacgggaccgatatggtggcatccgcagtgaccca gtcaccctgaatgtcctctatggtccagacctccccagaatttacccttcattcacctat taccattcaggacaaaacctctacttgtcctgctttgcggactctaacccaccggcacag tattcttggacaattaatgggaagtttcagctatcaggacaaaagctttctatcccccag attactacaaagcatagcgggctctatgcttgctctgttcgtaactcagccactggcaag gaaagctccaaatccgtgacagtcagagtctctgcatcacttttaaacttctggaacctg cccaccactgcccaagtaataattgaagccaagccacccaaagtttccgaggggaaggat gttcttctacttgtccacaatttgccccagaatcttactggctacatctggtacaaaggg caaatgacggacctctaccattacattacatcatatgtagtacacggtcaaattatatat gggcctgcctacagtggacgagaaacagtatattccaatgcatccctgctgatccagaat gtcacacaggaggatgcaggatcctacaccttacacatcataaagcgaggcgatgggact ggaggagtaactggatatttcactgtcaccttatactgtaaagctgaggagcctgtgcca ggactgggttgtggcttctttggcagggcttactgggaccaaggatttaccagctgtctg aggactgtgtctcctggagctgttcaccagccagggctcagtcctcagagcctcatctgg gcaaggacagagctttcttcacctgacactcagagtggagaggacagaaagacaagcttt gtaggccatcagccaactgccttaggaggcctaggacactccatagaaagtctaatatcc ccagaagcagaaacagaagagagaagatgcaccaggatgtccttggcaagggaagctgat gagaaaacatacctcagggggcaaagtgagactgaaactaacaagattccagcactgcat gccccaacggagactcccaagccctccatctccagcagcaacttaaaccccagggaggtc atggaggctgtgcgcttaatctgtgatcctgagactccggatgcaagctacctgtggttg ctgaatggtcagaacctccctatgactcacaggttgcagctgtccaaaaccaacaggacc ctctatctatttggtgtcacaaagtatattgcaggaccctatgaatgtgaaatacggaac ccagctcttcttaatgactgccacaacacaaacactgagaaaaagaagcaaccagaaaag gtggaaagttctgatgacatagaaaatagcaatcagcctttctcacatcccaaagccttc aaaaatatacgagtgcagcatggccactatggaattcaccgaaaactaatcaccaagcta gaaacatggtgggagatgccaactctgaatgaaggatgcctgtggaggaatcaaaggtgc cacacaggacaatcttctgttatccacacagcgaagctgcccatgccttacatcaccatc aacaacttaaaccccagggagaagaaggatgtgttagccttcacctgtgaacctaagagt cggaactacacctacatttggtggctaaatggtcagagcctcccggtcagtccgagggta aagcgacccattgaaaacaggatactcattctacccagtgtcacgagaaatgaaacagga ccctatcaatgtgaaatacgggaccgatatggtggcatccgcagtaacccagtcaccctg aatgtcctctatggtccagacctccccagaatttacccttcattcacctattaccgttca ggagaaaacctcgacttgtcctgctttgcggactctaacccaccggcagagtattcttgg acaattaatgggaagtttcagctatcaggacaaaagctctttatcccccaaattactaca aatcatagcgggctctatgcttgctctgttcgtaactcagccactggcaaggaaatctcc aaatccatgatagtcaaagtctctgaaaagcagatgtctgtagctctgggaaaggaatgt gacccttatggagagcctataaatgaccctaaatccctcactaaactacccccactctca ctaaacttaataataaatgctggtgtatccagtgcattggcagcatcgcaggaccagaag gcggtgacacccctggacacagctttccctatcttggacatggatgaaactggaaatgac cattctcagcaaactaacacaggaacagaaaaccaaacactgcatgttcttacttataag tgggagttgaacaatgagaacacatggacacagggaggggaacatcacacactggggcct gttgaggggcactttaagaacatgcacccatgggcttttgctacactcttttcttatgtt ggtacccaccttgtggtgactgaggatactcttaatatccccaagtctgaggaaactgag gcacaattcgattatccacttatgaagtggattaatgcagggaccattggagatcagaga caagctggagaaggttgcactgaagaagcagagatcatgaatgcagaaagcttgagaagt ttttccaatcaggaaagccaagacatggggctgtcactacagggcaaaacggagcctgca gggagtcatgcaaaagtatttgtctcttttgaagtttgccaggcaccctgtgacctgcct gcccgctgcaccctgggggctcagagcgcgtgtgatggtcacacacaggcaccatccggg atgtggccacctaacaccagctgtcccttgggaagacagacctctcctgtgtgcttgcgg gaagaaggggaatgttattgctctttgatctcagaacacaactcacccccacccgccctc tcaggtgagagcaatgtccctccagacaggctctctggccactgcctgttcctcctctac acacagcagcttggccaggtcaaaaccatcaggacagacccctggaggctcagcacagaa agaggaaggacagcacagctgacagccgtgctcagagagtttctggatcctaggcttatc tccacagaggagaacacacaagcagcagagaccatgggaaccctctcagcccctccctgc acacagcgcatcaaatggaaggggctcctgctcacagcatcacttttaaacttctggaac ctgcccaccactgcccaagtcacgattgaagccgagccaaccaaagtttccgaggggaag gatgttcttctacttgtccacaatttgccccagaatcttaccggctacatctggtacaaa gggcaaatgagggacctctaccattacattacatcatatgtagtagacggtgaaataatt atatatgggcctgcatatagtggacgagaaacagcatattccaatgcatccctgctgatc cagaatgtcacccgggaggacgcaggatcctacaccttacacatcataaagggagatgat gggactagaggagtaactggacgtttcaccttcaccttacacctggagactcctaagccc tccatctccagcagcaacttaaatcccagggagaccatggaggctgtgagcttaacctgt gaccctgagactccagacgcaagctacctgtggtggatgaatggtcagagcctccctatg actcacagcttgaagctgtccgaaaccaacaggaccctctttctattgggtgtcacaaag tatactgcaggaccctatgaatgtgaaatacggaacccagtgagtgccagccgcagtgac ccagtcaccctgaatctcctccatcattccttgcactctgctctatctttagaggtcact ggttcaacgaagctgcccaagccctacatcaccatcaacaacttaaaccccagggagaat aaggatgtcttaaacttcacctgtgaacctaagagtgagaactacacctacatttggtgg ctaaatggtcagagcctcccggtcagtcccagggtaaagcgacccattgaaaacaggatc ctcattctacccagtgtcacgagaaatgaaacaggaccctatcaatgtgaaatacgggac cgatatggtggcatccgcagtgacccagtcaccctgaatgtcctctatggtccagacctc cccagaatttacccttcattcacctattaccgttcaggagaagtcctctacttgtcctgt tctgcggactctaacccaccggcacagtattcttggacaattaatgaaaagtttcagcta ccaggacaaaagctctttatccgccatattactacaaagcatagcgggctctatgtttgc tctgttcgtaactcagccactggcaaggaaagctccaaatccatgacagtcgaagtctct gggccgtacaagcccatggcccaaatcttcagctgcagagctgagacagaacatgggata cctggcatcccttaccttcttccagtccactgcaatgtctactggcatggcccatttacc cctgaggacacccatctgctgacccatagtttctaa >gi568815579r:42768101_42979581|GENSCAN_predicted_peptide_11|56_aa XILASVALGSLNLVWAGGHGESSPETLSKADKTQLIPMTRDPPNQYSQITTFAAES >gi568815579r:42768101_42979581|GENSCAN_predicted_CDS_11|171_bp nncatcctggcctccgtggcattagggtcactgaacctcgtgtgggctggaggtcatggg gaaagctcaccggagacactcagcaaagctgacaaaacacaacttattcccatgacaaga gatcccccaaatcagtattcccaaattaccacatttgctgcagaatcatag