GENSCAN 1.0 Date run: 3-Nov-116 Time: 04:59:23 Sequence gi568815576f:40161830_40423238 : 261409 bp : 44.38% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 12 7 6 1.05 1.04 Term - 10562 10489 74 0 2 56 49 72 0.140 -1.83 1.03 Intr - 24288 24123 166 0 1 92 89 172 0.759 17.23 1.02 Intr - 28926 28805 122 1 2 112 -6 111 0.707 4.31 1.01 Init - 34476 34413 64 0 1 82 94 49 0.938 4.36 1.00 Prom - 45966 45927 40 -1.96 2.02 PlyA - 46929 46924 6 1.05 2.01 Sngl - 53804 53577 228 2 0 67 49 187 0.911 7.82 2.00 Prom - 69388 69349 40 -5.06 3.04 PlyA - 69655 69650 6 1.05 3.03 Term - 77583 77450 134 1 2 62 45 156 0.823 6.95 3.02 Intr - 84176 84112 65 1 2 65 89 50 0.586 1.16 3.01 Init - 92018 91912 107 2 2 66 77 98 0.845 6.19 3.00 Prom - 96786 96747 40 -2.56 4.00 Prom + 97011 97050 40 -6.26 4.01 Init + 98589 98596 8 2 2 88 81 11 0.307 0.12 4.02 Intr + 100003 100157 155 1 2 64 40 202 0.508 12.72 4.03 Intr + 100273 100344 72 2 0 46 110 51 0.475 2.48 4.04 Intr + 102859 105207 2349 2 0 69 115 718 0.628 59.95 4.05 Intr + 108293 108451 159 0 0 55 94 61 0.248 3.36 4.06 Intr + 114625 114644 20 2 2 96 61 30 0.329 -2.57 4.07 Intr + 116170 116215 46 0 1 120 75 49 0.941 4.78 4.08 Intr + 118166 118314 149 0 2 85 113 76 0.984 9.75 4.09 Intr + 123816 123941 126 2 0 110 115 -49 0.046 0.88 4.10 Intr + 138626 138757 132 1 0 80 111 78 0.849 10.14 4.11 Intr + 139081 139176 96 0 0 90 98 105 0.996 11.91 4.12 Intr + 139321 139504 184 0 1 98 119 175 0.999 20.96 4.13 Intr + 146683 146820 138 2 0 99 95 77 0.991 9.94 4.14 Intr + 148988 149164 177 0 0 99 72 133 0.982 12.69 4.15 Intr + 150676 150822 147 2 0 85 80 113 0.886 10.41 4.16 Intr + 151073 151168 96 0 0 61 80 55 0.768 1.98 4.17 Intr + 153454 153815 362 2 2 90 77 55 0.151 -0.46 4.18 Intr + 159261 159400 140 2 2 63 84 70 0.183 3.36 4.19 Intr + 161025 161209 185 0 2 112 60 164 0.822 15.43 4.20 Intr + 172801 172912 112 2 1 41 48 94 0.006 0.14 4.21 Intr + 184723 184882 160 1 1 39 97 220 0.041 17.99 4.22 Intr + 188003 188206 204 1 0 90 17 219 0.661 14.40 4.23 Intr + 192419 192498 80 1 2 71 100 -1 0.806 -2.25 4.24 Intr + 197035 197206 172 1 1 86 61 185 0.846 15.55 4.25 Intr + 197431 197477 47 0 2 72 92 38 0.999 -0.19 4.26 Intr + 198573 198663 91 0 1 93 113 83 0.999 11.30 4.27 Intr + 199444 199513 70 0 1 98 97 125 0.948 13.15 4.28 Intr + 199659 199806 148 1 1 61 77 182 0.991 13.69 4.29 Intr + 201152 201242 91 2 1 70 69 34 0.755 -0.30 4.30 Intr + 202447 202536 90 0 0 89 72 188 0.999 17.49 4.31 Intr + 203051 203227 177 1 0 78 115 107 0.982 12.52 4.32 Intr + 208779 208859 81 2 0 37 109 64 0.819 3.23 4.33 Intr + 208916 209092 177 1 0 43 66 91 0.641 2.52 4.34 Intr + 209290 209346 57 0 0 92 82 25 0.472 1.38 4.35 Term + 211517 211528 12 1 0 119 48 -3 0.276 -3.00 4.36 PlyA + 212752 212757 6 1.05 5.00 Prom + 225686 225725 40 -2.36 5.01 Init + 229231 229313 83 0 2 87 82 89 0.986 8.64 5.02 Intr + 235858 235951 94 1 1 99 68 36 0.634 2.67 5.03 Intr + 240310 240376 67 0 1 73 92 111 0.992 8.48 5.04 Intr + 242418 242626 209 1 2 79 110 140 0.998 14.10 5.05 Intr + 242728 242835 108 0 0 85 94 245 0.997 25.28 5.06 Intr + 243312 243455 144 2 0 16 89 241 0.998 17.48 5.07 Intr + 243751 244015 265 0 1 69 87 390 0.821 34.09 5.08 Intr + 244249 244394 146 2 2 88 42 277 0.906 23.10 5.09 Intr + 244609 244833 225 0 0 31 37 433 0.957 30.68 5.10 Intr + 245188 245739 552 0 0 109 41 689 0.759 59.35 5.11 Intr + 245960 246014 55 1 1 105 59 87 0.955 6.05 5.12 Intr + 246242 246291 50 0 2 99 78 97 0.726 8.20 5.13 Intr + 246448 246600 153 0 0 53 107 144 0.996 13.07 5.14 Intr + 246798 246868 71 2 2 119 93 26 0.999 4.18 5.15 Intr + 246965 247013 49 2 1 75 90 103 0.996 7.88 5.16 Intr + 247104 247189 86 2 2 79 89 113 0.998 9.12 5.17 Intr + 247421 247543 123 2 0 87 93 133 0.972 13.40 5.18 Intr + 247636 247696 61 1 1 91 90 28 0.813 2.04 5.19 Term + 247853 247930 78 1 0 63 46 134 0.970 4.46 5.20 PlyA + 248254 248259 6 1.05 6.08 PlyA - 248478 248473 6 1.05 6.07 Term - 250078 249561 518 2 2 53 36 475 0.889 33.28 6.06 Intr - 255217 255157 61 1 1 130 76 95 0.967 10.81 6.05 Intr - 255619 255512 108 1 0 73 94 114 0.441 10.98 6.04 Intr - 257555 256545 1011 2 0 121 82 1100 0.998 103.60 6.03 Intr - 258747 258576 172 2 1 116 83 152 0.999 17.45 6.02 Intr - 259271 259018 254 2 2 92 30 534 0.999 44.23 6.01 Init - 260862 260737 126 0 0 46 73 98 0.631 2.28 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 44958 44819 140 1 2 70 43 142 0.930 6.03 S.002 Term + 119367 119456 90 2 0 113 42 37 0.801 -0.68 S.003 Term - 128500 128301 200 2 2 33 41 189 0.804 6.26 S.004 Init + 184730 184882 153 1 0 71 97 221 0.930 21.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:40161830_40423238|GENSCAN_predicted_peptide_1|141_aa MVVEPVAPATLVAEMEGLLEPALFEKEKEQALLGQPLPQPADSIITPGQARELLAMALLI TGSHGDLILSLTFDYHFHVEAFQVFISTSDLFLKLQIHTASCLLTSSIAKSHQILQFNSV LSARDAAVKQKVLVYMELTSS >gi568815576f:40161830_40423238|GENSCAN_predicted_CDS_1|426_bp atggtggtggagcctgtagccccagctactctggtagctgagatggaaggactgcttgag cccgcactcttcgaaaaggagaaggagcaggccttgctggggcagccactcccacaacca gctgatagcattatcacaccagggcaggccagagagctgctggccatggccctcctgatt acaggatcccatggtgacctcatcctctcccttaccttcgactatcacttccacgtggaa gccttccaagtctttatctccacttctgacctctttctcaagctgcagatccacactgcc agctgcctgctaaccagttccattgccaagtcccaccaaattctccaattcaactctgtg ctaagtgctagggatgcagcagtgaagcagaaggttctcgtctatatggagctgacatcc tcatag >gi568815576f:40161830_40423238|GENSCAN_predicted_peptide_2|75_aa MELWASVNTPEDAAKMDLQVLGKLQNGFRCIYRKELLLLWGKKALLKCPHGNRKQQVPSS FSGHAVTLSCPVGTA >gi568815576f:40161830_40423238|GENSCAN_predicted_CDS_2|228_bp atggagctttgggccagtgtcaacacccctgaggatgcagcgaagatggacctccaagta ttgggaaaactgcaaaatggattcaggtgcatctacaggaaggaactgctgctgctgtgg ggcaaaaaagcattgctcaagtgccctcatgggaacaggaagcagcaagttccttcttcc ttctccggccatgcagtcaccctctcctgccccgttggcactgcctaa >gi568815576f:40161830_40423238|GENSCAN_predicted_peptide_3|101_aa MSAEVREEEGDLQYQGGLTGNKDEDVAVNLDPTIESNIITEGYTSTFTIHFRSKLNACLK ADDTGTGSIRAPEVTFCGMAAFTRVVTGTDRKAQTDLALFY >gi568815576f:40161830_40423238|GENSCAN_predicted_CDS_3|306_bp atgtcagctgaggtaagagaagaagaaggagatttacagtaccaaggtgggcttactggg aataaagatgaagatgtggctgtaaacctagacccaacaatagaaagtaacatcataaca gaaggttatacatctacattcacaatccatttccgcagcaaattgaatgcttgtctcaag gcagatgacaccggtactggcagcatcagagcacctgaagtgactttctgcggcatggct gcttttaccagggtggtaacaggcacagaccggaaggcacagactgacttggctctcttc tactag >gi568815576f:40161830_40423238|GENSCAN_predicted_peptide_4|2169_aa MACARSDETKFKPTNGRQPNWQLSIATSQWWQQCQKGGSAERTTAKRRPLHASGAPGANP NNAQVTGALLQSESGTAPDSTLGGAAASNYANSTWGSGASSNNGTSPNPIHIWDKVIVDG SDMEEWPCIASKDTESSSENTTDNNSASNPGSEKSTLPGSTTSNKGKGSQCQSASSGNEC NLGVWKSDPKAKSVQSSNSTTENNNGLGNWRNVSGQDRIGPGSGFSNFNPNSNPSAWPAL VQEGTSRKGALETDNSNSSAQVSTVGQTSREQQSKMENAGVNFVVSGREQAQIHNTDGPK NGNTNSLNLSSPNPMENKGMPFGMGLGNTSRSTDAPSQSTGDRKTGSVGSWGAARGPSGT DTVSGQSNSGNNGNNGKEREDSWKGASVQKSTGSKNDSWDNNNRSTGGSWNFGPQDSNDN KWGEGNKMTSGVSQGEWKQPTGSDELKIGEWSGPNQPNSSTGAWDNQKGHPLPENQGNAQ APCWGRSSSSTGSEVGGQSTGSNHKAGSSDSHNSGRRSYRPTHPDCQAVLQTLLSRTDLD PRVLSNTGWGQTQIKQDTVWDIEEVPRPEGKSDKGTEGWESAATQTKNSGGWGDAPSQSN QMKSGWGELSASTEWKDPKNTGGWNDYKNNNSSNWGGGRPDEKTPSSWNENPSKDQGWGG GRQPNQGWSSGKNGWGEEVDQTKNSNWESSASKPVSGWGEGGQNEIGTWGNGGNASLASK GGWEDCKRSPAWNETGRQPNSWNKQHQQQQPPQQPPPPQPEASGSWGGPPPPPPGNVRPS NSSWSSGPQPATPKDEEPSGWEEPSPQSISRKMDIDDGTSAWGDPNSYNYKNVNLWDKNS QGGPAPREPNLPTPMTSKSASVWSKSTPPAPDNGTSAWGEPNESSPGWGEMDDTGASTTG WGNTPANAPNAMKPKVKVELMSQTEDNPSSKMDLSVGSLSDKKFDVDKRAMNLGDFNDIM RKDRSGFRPPNSKDMGTTDSGPYFEKGGSHGLFGNSTAQSRGLHTPVQPLNSSPSLRAQV PPQFISPQVSASMLKQFPNSGLSPGLFNVGPQLSPQQIAMLSQLPQIPQFQLACQLLLQQ QQQQQLLQNQRKISQAVRQQQEQQLARMVSALQQQQQQQQRQPGMKHSPSHPVGPKPHLD NMVPNALNVGLPDLQTKGPIPGYGSGFSSGGMDYGMVGGKEAGTESRFKQWTSMMEGLPS VATQEANMHKNGAIVAPGKTRGGSPYNQFDIIPGDTLGGHTGPAGDSWLPAKSPPTNKIG SKSSNASWPPEFQPGVPWKGIQNIDPESDPYVTPGSVLGGTATSPIVDTDHQLLRDNTTG SNSSLNTSLPSPGAWPYSASDNSFTNVHSTSAKFPDYKSTWSPDPIGHNPTHLSNKMWKN HISSRNTTPLPRPPPGLTNPKPSSPWSSTAPRSVRGWGTQDSRLASGEEDLPKGTPLFIH KGLMLTQMVKTQRGLPKRGSHGCLLFLLVEEKIDGSTLRTICMQHGPLLTFHLNLTQGTA LIRYSTKQEAAKAQTALHMCVLGNTTILAEFATDDEVSRFLAQAQPPTPAATPSAPAAGW QSLETGQNQSDPVGPALNLFGGYVYIVKYILPNQETVNNGSHFTETQEDLGGVAEEDRVG MAAGGDHGSPDSYRSPLASRYASPEMCFVFSDRYKFRTWRQLWLWLAEAEQTLGLPITDE QIQEMKSNLENIDFKMAAEEEKRLRHDVMAHVHTFGHCCPKAAGIIHLGATSCYVGDNTL ARVISRLADFAKERASLPTLGFTHFQPAQLTTVGKRCCLWIQDLCMDLQNLKRVRDDLRF RGVKGTTGTQASFLQLFEGDDHKVEQLDKMVTEKAGFKRAFIITGQTYTRKVDIEVLSVL ASLGASVHKICTDIRLLANLKEMEEPFEKQQIGSSAMPYKRNPMRSERCCSLARHLMTLV MDPLQTASVQWFERTLDDSANRRICLAEAFLTADTILNTLQNISEGLVVYPKVIERRIRQ ELPFMATENIIMAMVKAGGSRQDCHEKIRVLSQQAASVVKQEGGDNDLIERIQVDAYFSP IHSQLDHLLDPSSFTGRASQQAPLSADWVRVGKLLTRPGLALRWPVGQAGAAGRLWSDCP TEGRMGRRRPREKHRLGPESRRADLARVAVGMGAGDAAGRGGRLRPPIQIQVQNPGSDPG PRLGTPTCP >gi568815576f:40161830_40423238|GENSCAN_predicted_CDS_4|6510_bp atggcctgtgcccgaagtgacgaaaccaagtttaagccaaccaacggccgccagcccaat tggcagctctccatcgccaccagtcaatggtggcaacaatgccaaaagggtggcagtgcc gaacggacaaccgccaagcgccgcccgctacatgcctcgggagcacctggagcaaaccca aacaacgcacaagtgacaggagcgctgctgcagagtgagagtgggactgcgccagactca acccttggaggtgctgctgcttcaaattatgcaaattccacttggggctcgggagcctcc tccaacaacggcacctcccccaacccaattcacatctgggacaaggtgattgtagacggg tctgacatggaagagtggccttgtattgccagcaaagacactgaatcttcttccgaaaac accaccgataacaacagtgcctcgaaccctggctctgagaagagcactctgccaggaagc accactagtaacaaaggaaaagggagccagtgccagtctgcaagttctgggaacgaatgt aatcttggggtctggaaatctgaccctaaggctaaatctgttcaatcttccaactctact acagagaacaacaatggactaggaaattggaggaatgtgagtggtcaggatagaattgga cctggctctggcttcagcaactttaacccaaatagcaacccatctgcctggccagcactg gtccaagaaggaacttctaggaaaggggcattggaaacagataatagtaattccagtgca caggttagcacagtaggtcagacatccagggaacagcagtcaaagatggaaaatgcgggt gttaattttgttgtctctggcagagaacaggctcaaattcataacactgatggaccaaaa aatggaaacactaactccttgaacttaagttcaccaaaccccatggagaataagggaatg ccctttggaatgggcttggggaacacctccaggagcactgatgccccttcacaaagcact ggagatcgaaagactgggagtgttggatcttggggtgcagctagggggccttctggaact gacacagtctctggacaaagcaattctggaaacaatgggaacaatggaaaagagagagag gactcctggaaaggagcttctgttcagaaatcaactgggtcaaaaaatgactcttgggac aacaataacaggtctacgggtgggtcctggaactttggcccccaggactctaatgacaac aaatggggtgaagggaacaaaatgacatctggggtctctcagggagaatggaaacagccg actgggtctgatgagttgaaaattggagaatggagtggtccaaaccaaccaaattctagc actggagcatgggacaatcaaaagggccaccccctccctgaaaaccaaggcaatgcccag gctccctgttggggaagatcttccagctccacaggaagtgaagttggaggtcaaagcact ggaagcaaccacaaagcaggaagtagtgacagtcataactctggccgtcggtcgtacagg cccacacatcctgattgtcaggctgtcttgcagactcttttgagccgaactgatttggac cccagggtgctctcaaacactggctggggccaaactcaaattaagcaggacacagtgtgg gacattgaagaggtgccaaggcctgaggggaaatctgacaaaggaactgaggggtgggag agcgctgccacacagaccaagaactcagggggctggggagatgcacccagccaaagcaat caaatgaagtctggatggggggagctctcagcctctacagagtggaaagaccccaagaac acaggaggctggaatgactacaagaacaacaactcttccaactggggaggaggacgacct gatgaaaagacaccttcctcttggaatgagaatcccagcaaggatcaggggtggggaggt ggacgccagcccaatcaaggatggtcttctggaaagaatggttggggggaggaagtcgat cagacaaaaaacagcaattgggaaagttctgcaagtaaacctgtgtctgggtggggtgaa ggagggcagaatgaaatcgggacttggggtaatggtggcaatgcaagcctagcttcaaaa ggtgggtgggaggattgcaaaagatccccagcatggaatgagacgggccgacagcccaat tcctggaataaacaacaccaacagcagcagcccccacagcagccgccgccaccacaacca gaggcttctggttcgtggggaggcccacccccaccacctccaggcaacgttcgaccttcc aattccagctggagcagcgggccacagcctgcaacacctaaggatgaggaacccagtggt tgggaagagccatccccacagtcaattagtcggaaaatggacattgatgatggcacttca gcatggggagaccctaacagttataactacaagaatgtgaatctgtgggataagaattcc caagggggcccagcacctcgagaaccaaacctgcccaccccaatgaccagtaaatcggca tcagtctggagcaaaagcacaccacctgctccagataatggtacttccgcttggggtgag ccaaatgaaagcagtcctgggtggggcgagatggatgatacaggagcatcgaccacaggc tgggggaacacgcccgccaacgctcccaatgccatgaagcctaaagtgaaagtggagctc atgagtcagactgaagataatccaagcagcaaaatggatttgtctgtaggaagcctttca gataaaaaatttgatgtggacaagcgagcgatgaatctcggggattttaatgatatcatg aggaaggatcgatctgggttccgtccacctaattccaaagacatgggaaccacagatagt gggccttattttgagaagggcggtagtcatggtttgtttggaaacagcacagcacaatcg agaggtctgcacacacccgtgcagccactaaattcttctcccagtctccgggcgcaagtg cctccccagtttatttccccccaggtttctgcctcaatgctcaagcagtttcccaacagt ggcctgagtccaggtcttttcaatgtggggccccagttatctcctcaacaaattgccatg ctgagccagcttccacaaattccccagtttcagttggcatgtcagcttctcttgcagcag cagcaacagcagcagttgttacagaaccagagaaagatttctcaagctgtacgccaacag caagagcagcagctggctcgaatggtgagtgcactgcagcagcagcagcagcagcagcag aggcagccaggcatgaagcactcgccctctcatcctgttgggcccaagccgcatctggac aacatggtacccaacgcattgaatgtggggctcccagaccttcaaaccaaagggccaata cctggatatggttctggcttcagctctggcggcatggactatggcatggttggtgggaag gaggctggaaccgagtctcgctttaaacagtggacctccatgatggaggggctgccctct gtagccacacaggaagccaatatgcacaaaaatggcgctatagtggcccctggtaaaacc cggggagggtcaccgtacaaccagtttgatatcatccctggtgacacactgggtggccat acgggtcctgctggtgatagctggttacctgccaaatctccaccaacaaataaaatcgga agtaaatccagcaatgccagttggcctccagaattccaaccaggagtgccatggaaaggt atccaaaacattgaccctgaatctgacccctatgtcaccccaggaagtgtgctggggggt acagccacatctcccattgtagatactgaccaccaactgctgcgggataacaccacaggg tctaattcttccctcaacacctcgctgccttcacctggtgcctggccctacagtgcctct gacaactcctttaccaacgttcatagcacttcagcaaagttccctgattacaaatcaaca tggtccccagatcccataggacacaaccccactcatctctccaacaagatgtggaaaaac catatttcctccaggaacactacaccgctgccccgcccacctcctggtctgaccaacccc aaaccatcatctccctggagcagcacagcaccccgatcagtcagggggtgggggacacag gactcacggctcgcctcgggtgaggaggatctgcctaaaggaacaccattgttcatccac aaggggcttatgttaacacagatggtgaagacacaaagaggtcttcccaagagaggaagt catggttgtctgctttttcttctggtagaggaaaagattgatgggtcaaccttgagaacg atctgcatgcagcatggcccactgctgacattccatctgaatctaacccagggcactgcc ctgatccgatacagcaccaaacaggaggcggccaaggcccaaactgcactgcacatgtgt gtgttgggaaacactaccatccttgctgagtttgccactgatgatgaagtcagccgcttt ctggcacaagctcagccccctacacctgcagcaaccccaagtgcgccagctgcggggtgg cagtcgctggagaccggccagaaccagtcagatcccgtgggacctgctctgaatcttttt gggggctatgtttacatagtaaaatacatcctaccaaatcaggaaacagtgaataatggg agtcacttcacagagacgcaggaggacctggggggcgtagcagaggaggatagggttggg atggcggctggaggcgatcatggttcgcccgacagctaccgctcacctcttgcctcccgc tatgccagcccggagatgtgcttcgtgtttagcgacaggtataaattccggacatggcgg cagctgtggctgtggctggcggaggccgagcagacattgggtttgcctatcacagatgaa caaatccaggagatgaaatcaaacctggagaacatcgacttcaagatggcagctgaggaa gagaaacgtttacgacatgatgtgatggctcacgtgcacacatttggccactgctgtcca aaagctgcaggcattattcaccttggtgctacttcttgctatgttggagacaatactctt gccagagtgatctctcggcttgccgactttgctaaggaacgagccagtctacccacatta ggtttcacacatttccagcctgcacagctgaccacagttgggaaacgttgctgtctttgg attcaggatctttgcatggatctccagaacttgaagcgtgtccgagatgacctgcgcttc cggggagtaaagggtaccactggcactcaggccagtttcctgcagctctttgagggagat gaccataaggtagagcagcttgacaagatggtgacagaaaaggcaggatttaagagagct ttcatcatcacagggcagacatatacacgaaaagtggatattgaagtactgtctgtgctg gctagcttgggggcatcagtgcacaagatttgcaccgacatacgcctcctggcaaacctc aaggagatggaggaaccctttgaaaaacagcagattggctcaagtgcgatgccatataag cggaatcccatgcgttcagaacgttgctgcagtcttgcccgccacctgatgacccttgtc atggacccgctacagacagcatctgtccagtggtttgaacgcacactggatgatagtgcc aaccgacggatctgtttggccgaggcatttcttaccgcagatactatattgaatacgctg cagaacatttctgaaggattggtcgtgtaccccaaagtaattgaacggcgcattcggcaa gagctgcctttcatggccacagagaacatcatcatggccatggtcaaagctggaggtagc cgccaggattgccatgagaaaatcagagtgctttctcagcaggcagcttctgtggttaag caggaagggggtgacaatgacctcatagagcgtatccaggttgatgcctacttcagtccc attcactcccagttggatcatttactggatccttcttctttcactggtcgtgcctcccag caggcgccgctgagcgctgactgggtgcgagtggggaagctgctaacccgacccggattg gcgctgaggtggcccgtggggcaggcgggggctgcggggcggctgtggtcggactgtccg accgagggtcgtatgggccgaaggcgcccacgagagaagcaccgcctcgggccggagagc cgccgggccgaccttgctcgcgtcgccgttggaatgggggccggggacgcggcggggcgg ggaggtcggctgcgtcctccgattcagatccaggtgcagaacccgggctcggatccaggc ccgagactcggaacaccgacttgtccttaa >gi568815576f:40161830_40423238|GENSCAN_predicted_peptide_5|872_aa MEVGKKGLLEEATSELEFERIMVKEEFSPSSLLAALQLFQHAQCAPNLGLLLPGSPFPQK EESAEQPEFYYDEFGFRVYKEEGDEPGSSLLANSPLMEDAPQRLRWQAHLEFTHNHDVGD LTWDKIAVSLPRSEKLRSLVLAGIPHGMRPQLWMRLSGALQKKRNSELSYREIVKNSSND ETIAAKQIEKDLLRTMPSNACFASMGSIGVPRLRRVLRALAWLYPEIGYCQGTGMGRGTF SNLHKDLGRRPLSLCPGQVAACLLLFLEEEDAFWMMSAIIEDLLPASYFSTTLLGVQTDQ RVLRHLIVQYLPRLDKLLQEHDIELSLITLHWFLTAFASVVDIKLLLRIWDLFFYEGSRV LFQLTLGMLHLKEEELIQSENSASIFNTLSDIPSQMEDAELLLGVAMRLAGSLTDVAVET QRRKHLAYLIADQGQLLGAGTLTNLSQVVRRRTQRRKSTITALLFGESSASARQCGHAGV CPHAHVDVELPPRGPGVGCGHHGSLGLLGEDDLEALKAKNIKQTELVADLREAILRVARH FQCTDPKNCSVVSRQLPGLLPNTALTPPTPLVGLCSLWQELTPDYSMESHQRDHENYVAC SRSHRRRAKALLDFERHDDDELGFRKNDIITIVSQKDEHCWVGELNGLRGWFPAKFVEVL DERSKEYSIAGDDSVTEGVTDLVRGTLCPALKALFEHGLKKPSLLGGACHPWLFIEEAAG REVERDFASVYSRLVLCKTFRLDEDGKVLTPEELLYRAVQSVNVTHDAVHAQMDVKLRSL ICVGLNEQVLHLWLEVLCSSLPTVEKWYQPWSFLRSPGWVQIKCELRVLCCFAFSLSQDW ELPAKREAQQPLKEGVRDMLVKHHLFSWDVDG >gi568815576f:40161830_40423238|GENSCAN_predicted_CDS_5|2619_bp atggaagttggaaagaaaggcttgttagaggaggccacatctgagctagaatttgaaaga ataatggtcaaagaggaattcagcccctccagcctgctggctgccttgcagctgtttcag catgcccagtgtgctcccaacttggggctgcttctccctggatctcctttccctcagaag gaagagtcagcagagcaaccagagttctactacgatgagtttggtttccgtgtgtacaag gaagaaggtgatgagcctggctccagtctgctggcgaactcccctctgatggaggatgct ccacagaggctgcggtggcaggcccacctggagttcacccataaccacgatgtgggggat ctcacctgggacaagattgccgtctccctaccccgctctgagaagctccgctccctggtg ctggccggcatcccacatggcatgaggccacagctgtggatgcggctctctggggccctg cagaagaagaggaactctgagctgtcctaccgcgagattgtgaagaacagctccaacgat gagaccatcgctgccaagcagatcgagaaggacctgctccgcaccatgcccagcaacgcc tgcttcgccagcatgggtagcatcggggtgccccgcctgcgcagggtgctccgggccctg gcctggctctacccagagatcggctactgccagggcaccggcatgggtaggggcaccttt tctaatctgcacaaagacctgggtagaaggcccttgtccctgtgccctggccaggtggcc gcctgcctcctgctgttcctggaggaggaggacgccttctggatgatgtctgccatcatc gaggacctgctccccgcctcctacttcagcaccaccctgctgggtgtccagactgaccag cgggtcctgcgccacctcattgtccagtacctgcctcgcctggacaagctgctccaggag catgacattgagctgtccctgatcacactgcactggttcctcacggccttcgccagcgtg gtggacatcaagctgctcctgcgcatctgggacctgtttttctacgagggctcccgggtg ctgttccagctcacgctgggcatgctgcacctcaaggaggaagagctgatccagtcagag aactcggcctccatcttcaacacgctatcggatatcccgtcgcagatggaggacgcggag ctgcttctgggggtggccatgcggctggccggctccctcaccgatgtggccgtggagact cagcgccgcaagcacctggcctatctcattgcagaccagggccagctcctgggggccggc accctcaccaacctctctcaggttgttcgccgcaggacccagcggaggaagtccaccatc actgctctgctcttcggtgagagctctgcgagtgccaggcagtgtgggcatgcgggagtc tgtcctcacgctcatgtggacgtggagcttcctcctcgggggcctggagtgggctgtggt caccatggttctctgggcctcctaggggaggatgacctggaggcactcaaggccaagaac atcaagcagacggaactggtggctgacctccgggaagccatcctgcgcgtggcacgccac ttccagtgcacagaccccaaaaactgcagcgtggtgagtcgccagctccctgggctgcta ccaaacacggccctaactcctccaacccccttggtgggcctgtgttcactgtggcaggag ctgactccagactatagcatggagagccaccagcgggaccacgagaactacgtggcgtgc tcacgcagccaccggcgccgagccaaggccctgctggactttgagcggcacgacgacgac gagctgggcttccgcaagaacgacatcatcacaatcgtgtctcagaaggacgagcactgc tgggtgggggagctcaacggcctgcgaggctggtttccagccaagttcgtggaagtcctg gatgagcgcagcaaagagtactccatcgcgggggatgactcggtgacggagggggtcaca gacctcgtgcgagggaccctctgcccggcccttaaggccctgttcgaacatggactgaag aagccatccctgcttgggggcgcctgccacccctggctgtttatcgaggaggctgcaggc cgggaggtcgagagagactttgcctccgtgtattcccgtctggtgctctgtaagaccttc aggttggatgaagatggcaaagtcctgaccccggaggagctgctctaccgggctgtgcag tctgtgaacgtgacccacgatgcagtgcatgcacaaatggatgtgaagctccgctcactg atctgcgtggggctcaatgagcaggtgctgcacctgtggctggaggtgctctgctccagc ctgcccaccgtggagaagtggtaccagccctggtccttcctgcgcagcccgggctgggtc cagatcaagtgtgagctccgagtcctctgctgctttgccttcagcctctcccaggactgg gagctccctgcgaagagagaggcgcagcagcccctgaaggagggcgtccgggacatgctg gtgaagcaccacctcttcagctgggatgtggacgggtga >gi568815576f:40161830_40423238|GENSCAN_predicted_peptide_6|749_aa MVTSPVPPLHMTLSSLLPLASSMSESHCGFLNVSLLFQALGAQSQPKSASEKSQRSKKAK ELKPKVKKLKYHQYIPPDQKQDRGAPPMDSSYAKILQQQQLFLQLQILNQQQQQHHNYQA ILPAPPKSAGEALGSSGTPPVRSLSTTNSSSSSGAPGPCGLARQNSTSLTGKPGALPANL DDMKVAELKQELKLRSLPVSGTKTELIERLRAYQDQISPVPGAPKAPAATSILHKAGEVV VAFPAARLSTGPALVAAGLAPAEVVVATVASSGVVKFGSTGSTPPVSPTPSERSLLSTGD ENSTPGDTFGEMVTSPLTQLTLQASPLQILVKEEGPRAGSCCLSPGGRAELEGRDKDQML QEKDKQIEALTRMLRQKQQLVERLKLQLEQEKRAQQPAPAPAPLGTPVKQENSFSSCQLS QQPLGPAHPFNPSLAAPATNHIDPCAVAPGPPSVVVKQEALQPEPEPVPAPQLLLGPQGP SLIKGVAPPTLITDSTGTHLVLTVTNKNADSPGLSSGSPQQMDLEHPLQPLFGTPTSLLK KEPPGYEEAMSQQPKQQENGSSSQQMDDLFDILIQSGEISADFKEPPSLPGKEKPSPKTV CGSPLAAQPSPSAELPQAAPPPPGSPSLPGRLEDFLESSTGLPLLTSGHDGPEPLSLIDD LHSQMLSSTAILDHPPSPMDTSELHFVPEPSSTMGLDLADGHLDSMDWLELSSGGPVLSL APLSTTAPSLFSTDFLDGHDLQLHWDSCL >gi568815576f:40161830_40423238|GENSCAN_predicted_CDS_6|2250_bp atggtgacatccccagtcccgcctctgcacatgaccctgtcatctctgctgcctcttgct tcctccatgtcagaaagtcactgtggcttcctgaatgtcagtctcctctttcaggccttg ggagcacaaagccaacccaagtctgccagtgagaagtcacagcgcagcaagaaggccaag gagctgaagccaaaggtgaagaagctcaagtaccaccagtacatccccccggaccagaag caggacaggggggcaccccccatggactcatcctacgccaagatcctgcagcagcagcag ctcttcctccagctgcagatcctcaaccagcagcagcagcagcaccacaactaccaggcc atcctgcctgccccgccaaagtcagcaggcgaggccctgggaagcagcgggaccccccca gtacgcagcctctccactaccaatagcagctccagctcgggcgcccctgggccctgtggg ctggcacgtcagaacagcacctcactgactggcaagccgggagccctgccggccaacctg gacgacatgaaggtggcagagctgaagcaggagctgaagttgcgatcactgcctgtctcg ggcaccaaaactgagctgattgagcgccttcgagcctatcaagaccaaatcagccctgtg ccaggagcccccaaggcccctgccgccacctctatcctgcacaaggctggcgaggtggtg gtagccttcccagcggcccggctgagcacggggccagccctggtggcagcaggcctggct ccagctgaggtggtggtggccacggtggccagcagtggggtggtgaagtttggcagcacg ggctccacgccccccgtgtctcccaccccctcggagcgctcactgctcagcacgggcgat gaaaactccacccccggggacacctttggtgagatggtgacatcacctctgacgcagctg accctgcaggcctcgccactgcagatcctcgtgaaggaggagggcccccgggccgggtcc tgttgcctgagccctggggggcgggcggagctagaggggcgcgacaaggaccagatgctg caggagaaagacaagcagatcgaggcgctgacgcgcatgctccggcagaagcagcagctg gtggagcggctcaagctgcagctggagcaggagaagcgagcccagcagcccgcccccgcc cccgcccccctcggcacccccgtgaagcaggagaacagcttctccagctgccagctgagc cagcagcccctgggccccgctcacccattcaaccccagcctggcggccccagccaccaac cacatagacccttgtgctgtggccccggggcccccgtccgtggtggtgaagcaggaagcc ttgcagcctgagcccgagccggtccccgccccccagttgcttctggggcctcagggcccc agcctcatcaagggggttgcacctcccaccctcatcaccgactccacagggacccacctt gtcctcaccgtgaccaataagaatgcagacagccctggcctgtccagtgggagcccccag cagatggacctggagcacccactgcagcccctctttgggacccccacttctctgctgaag aaggaaccacctggctatgaggaagccatgagccagcagcccaaacagcaggaaaatggt tcctcaagccagcagatggacgacctgtttgacattctcattcagagcggagaaatttca gcagatttcaaggagccgccatccctgccagggaaggagaagccatccccgaagacagtc tgtgggtcccccctggcagcacagccatcaccttctgctgagctcccccaggctgcccca cctcctccaggctcaccctccctccctggacgcctggaggacttcctggagagcagcacg gggctgcccctgctgaccagtgggcatgacgggccagagcccctttccctcattgacgac ctccatagccagatgctgagcagcactgccatcctggaccaccccccgtcacccatggac acctcggaattgcactttgttcctgagcccagcagcaccatgggcctggacctggctgat ggccacctggacagcatggactggctggagctgtcgtcaggtggtcccgtgctgagccta gcccccctcagcaccacagcccccagcctcttctccacagacttcctcgatggccatgat ttgcagctgcactgggattcctgcttgtag