GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:32:16 Sequence gi568815591r:151139106_151375152 : 236047 bp : 49.60% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 874 1011 138 2 0 78 72 104 0.957 8.24 1.02 Intr + 2793 2947 155 1 2 37 98 202 0.497 15.89 1.03 Intr + 3058 3148 91 0 1 57 81 142 0.818 9.97 1.04 Intr + 3307 3529 223 2 1 86 99 195 0.990 17.59 1.05 Intr + 4236 4491 256 0 1 68 92 363 0.999 32.15 1.06 Term + 4632 4838 207 2 0 92 48 125 0.761 6.24 1.07 PlyA + 5308 5313 6 1.05 2.04 PlyA - 6925 6920 6 1.05 2.03 Term - 10037 9484 554 0 2 90 42 565 0.941 46.58 2.02 Intr - 27570 27523 48 1 0 90 63 39 0.143 0.25 2.01 Init - 28443 27906 538 0 1 73 131 684 0.965 66.23 2.00 Prom - 30443 30404 40 -6.66 3.07 PlyA - 30965 30960 6 1.05 3.06 Term - 37192 37007 186 1 0 120 43 193 0.569 15.49 3.05 Intr - 37571 37458 114 2 0 136 71 181 0.993 21.84 3.04 Intr - 42353 41834 520 1 1 87 86 616 0.978 54.26 3.03 Intr - 47554 47287 268 2 1 103 61 323 0.968 27.69 3.02 Intr - 48003 47710 294 1 0 19 14 279 0.222 10.58 3.01 Init - 48476 48347 130 2 1 86 60 187 0.862 16.04 3.00 Prom - 50762 50723 40 -7.26 4.34 PlyA - 51530 51525 6 -0.45 4.33 Term - 51982 51875 108 1 0 52 45 166 0.971 7.21 4.32 Intr - 52797 52533 265 2 1 106 100 337 0.806 34.22 4.31 Intr - 53348 52999 350 2 2 66 35 392 0.776 25.85 4.30 Intr - 53724 53638 87 0 0 71 78 74 0.938 4.87 4.29 Intr - 54385 54069 317 2 2 94 81 317 0.943 27.38 4.28 Intr - 55258 55151 108 2 0 96 77 93 0.890 9.26 4.27 Intr - 55493 55411 83 1 2 61 66 53 0.626 -0.32 4.26 Intr - 56380 56325 56 1 2 73 69 43 0.669 -1.32 4.25 Intr - 57449 57343 107 0 2 68 64 152 0.844 10.73 4.24 Intr - 57947 57872 76 2 1 36 60 37 0.452 -5.21 4.23 Intr - 58222 58100 123 1 0 75 78 190 0.426 17.48 4.22 Intr - 58801 58717 85 0 1 42 80 75 0.967 1.92 4.21 Intr - 59898 59693 206 0 2 110 26 259 0.909 19.90 4.20 Intr - 61349 61278 72 2 0 85 89 91 0.989 8.50 4.19 Intr - 63991 63806 186 1 0 79 86 156 0.999 14.39 4.18 Intr - 64409 64290 120 2 0 76 92 118 0.999 11.69 4.17 Intr - 65731 65404 328 0 1 62 62 735 0.937 64.00 4.16 Intr - 69002 68770 233 2 2 113 -1 421 0.281 32.27 4.15 Intr - 71304 71082 223 2 1 132 109 190 0.932 23.63 4.14 Intr - 75086 74953 134 2 2 63 41 222 0.658 14.44 4.13 Intr - 75977 75774 204 2 0 33 94 291 0.938 23.60 4.12 Intr - 76627 76499 129 1 0 73 82 183 0.990 17.09 4.11 Intr - 76924 76862 63 1 0 121 92 52 0.992 7.81 4.10 Intr - 79086 78976 111 0 0 70 64 69 0.862 3.28 4.09 Intr - 79545 79456 90 0 0 105 98 -14 0.730 1.49 4.08 Intr - 79768 79649 120 1 0 92 81 113 0.929 11.69 4.07 Intr - 80054 79959 96 2 0 69 97 129 0.977 12.11 4.06 Intr - 82575 82473 103 2 1 70 113 96 0.994 10.48 4.05 Intr - 83511 83416 96 2 0 106 74 138 0.999 13.32 4.04 Intr - 84744 84573 172 1 1 66 109 245 0.982 23.40 4.03 Intr - 85009 84827 183 2 0 94 54 263 0.984 23.26 4.02 Intr - 85883 85671 213 0 0 83 73 274 0.612 24.09 4.01 Init - 87353 87200 154 2 1 81 68 225 0.994 19.94 4.00 Prom - 91846 91807 40 -5.46 5.00 Prom + 92326 92365 40 -1.76 5.01 Init + 94907 95169 263 1 2 84 94 332 0.995 27.94 5.02 Intr + 95943 96507 565 0 1 87 61 571 0.999 47.10 5.03 Intr + 97303 97485 183 0 0 92 99 229 0.999 24.38 5.04 Term + 98269 99576 1308 0 0 65 41 1350 0.997 119.59 5.05 PlyA + 99696 99701 6 -1.95 6.22 PlyA - 99889 99884 6 -1.95 6.21 Term - 100051 99998 54 1 0 79 49 88 0.994 1.56 6.20 Intr - 100392 100291 102 0 0 98 91 163 0.999 17.97 6.19 Intr - 100641 100438 204 0 0 56 94 184 0.907 15.20 6.18 Intr - 101142 101007 136 2 1 37 81 193 0.752 14.07 6.17 Intr - 101492 101320 173 2 2 85 94 96 0.564 8.64 6.16 Intr - 102548 102387 162 2 0 56 91 200 0.870 17.27 6.15 Intr - 102873 102772 102 0 0 53 46 182 0.822 10.87 6.14 Intr - 103127 103032 96 2 0 108 94 149 0.999 17.71 6.13 Intr - 103498 103376 123 1 0 67 57 214 0.959 16.98 6.12 Intr - 103817 103616 202 1 1 78 46 199 0.387 13.99 6.11 Intr - 106566 106355 212 0 2 72 109 170 0.866 15.21 6.10 Intr - 109574 109380 195 2 0 27 89 95 0.674 3.11 6.09 Intr - 114798 114710 89 1 2 96 81 70 0.879 6.79 6.08 Intr - 115394 115190 205 2 1 16 100 74 0.007 0.17 6.07 Intr - 121673 121615 59 0 2 92 100 60 0.016 6.20 6.06 Intr - 132358 132152 207 2 0 109 98 -12 0.211 0.95 6.05 Intr - 138084 137972 113 2 2 107 99 14 0.487 4.42 6.04 Intr - 151389 151287 103 1 1 15 56 82 0.027 -3.07 6.03 Intr - 152414 152309 106 2 1 34 113 58 0.073 2.69 6.02 Intr - 162854 162658 197 0 2 94 52 88 0.039 4.93 6.01 Init - 163189 163096 94 1 1 87 75 44 0.075 3.33 6.00 Prom - 173122 173083 40 -4.66 7.00 Prom + 185741 185780 40 -4.96 7.01 Init + 195324 195399 76 2 1 42 93 67 0.550 3.85 7.02 Intr + 202568 202860 293 0 2 78 72 152 0.310 9.45 7.03 Intr + 209968 210135 168 0 0 66 87 171 0.993 14.94 7.04 Intr + 212319 212377 59 2 2 84 108 29 0.932 2.18 7.05 Intr + 213707 213777 71 2 2 77 77 48 0.928 1.43 7.06 Intr + 216663 216845 183 1 0 50 65 167 0.922 10.36 7.07 Intr + 217023 217117 95 1 2 33 106 11 0.414 -2.92 7.08 Intr + 221036 221142 107 1 2 84 70 29 0.226 -0.29 7.09 Intr + 227834 228020 187 2 1 108 53 81 0.820 6.29 7.10 Intr + 228756 228863 108 2 0 53 116 63 0.883 6.08 7.11 Intr + 229630 229782 153 0 0 101 65 67 0.861 5.97 7.12 Intr + 234992 235138 147 1 0 59 64 187 0.734 13.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 12736 12673 64 1 1 92 71 64 0.978 6.41 S.002 Init - 115298 115190 109 2 1 87 100 63 0.930 7.84 S.003 Intr + 168611 168737 127 1 1 55 119 16 0.828 1.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:151139106_151375152|GENSCAN_predicted_peptide_1|356_aa XSGPAEVLSSSPKLDPPPSPHSNRKKHRRKKSTGTPRPDGPSSATEEAEESFEFVVVSLT GQTWHFEASTAEERELWVQSVQAQILASLQGCRSAKDKTRLGNQNAALAVQAVRTVRGNS FCIDCDAPNPDWASLNLGALMCIECSGIHRHLGAHLSRVRSLDLDDWPPELLAVMTAMGN ALANSVWEGALGGYSKPGPDACREEKERWIRAKYEQKLFLAPLPSSDVPLGQQLLRAVVE DDLRLLVMLLAHGSKEEVNETYGDGDGRTALHLSSAMANVVFTQLLIWYGVDVRSRDARG LTPLAYARRAGSQECADILIQHGCPGEGCGLAPTPNREPANGTNPSAELHRSPSLL >gi568815591r:151139106_151375152|GENSCAN_predicted_CDS_1|1071_bp nccagtggcccagctgaggtactcagttccagccccaagctggatcctcccccatctccc cactccaaccggaagaagcaccggaggaaaaagagcaccgggaccccccgaccagacggc cccagcagtgctactgaagaggcagaggagtcgtttgaatttgtggtggtgtccctcact gggcagacgtggcacttcgaggcttcaacggcggaggagcgggagctgtgggttcagagt gtgcaggcccagatccttgccagcctgcaaggctgccgcagtgccaaggacaagactcga ctggggaaccagaacgcagctctggctgtgcaggccgtccgcaccgtccgcggcaacagc ttttgtatcgactgcgatgcacccaatccagactgggccagcctgaacctgggtgccctg atgtgcattgagtgctcaggcatccaccgacacctgggggctcacctgtcccgggtgcgc tcccttgacctcgatgactggccgcctgagctgctggctgtcatgactgccatgggcaat gccctcgccaacagcgtctgggagggggccttgggtggctactccaagccagggcctgat gcctgcagagaggagaaggaacgctggatacgggccaagtatgaacagaagctcttcctg gccccactgccaagctcagatgtgccactggggcagcagctgctccgggccgtggtggaa gatgacctgcggctgttggtgatgctcctggcacatggctccaaagaggaggtgaatgag acctatggggacggggacgggcggacggctctacatctctccagtgccatggccaacgtt gtcttcacgcagctgctcatctggtacggggtggacgtgaggagccgggacgcccggggc ctgactccactggcatatgctcgccgggccggcagccaggagtgtgcagacatcttgatc cagcatggctgccctggggagggctgtggcttagcgcctacccccaacagagagcctgcc aatggcaccaacccctctgctgagctgcaccgtagtcctagcctcctataa >gi568815591r:151139106_151375152|GENSCAN_predicted_peptide_2|379_aa MQRAGGGSAPGGNGGGGGGGPGTAFSIDSLIGPPPPRSGHLLYTGYPMFMPYRPLVLPQA LAPAPLPAGLPPLAPLASFAGRLTNTFCAGLGQAVPSMVALTTALPSFAEPPDAFYGPQE LAAAAAAAAATAARNNPEPGGRRPEGGLEADELLPAREKVAEPPPPPPPHFSETFPSLPG VDKLQGWDFRGHQDGAEGKVYSSDEEKLEASAGDPAGSEQEEEGSGGDSEDDGFLDSSAG GPGALLGPKPKLKGSLGTGAEEGAPVTAGVTAPGGKSRRRRTAFTSEQLLELEKEFHCKK YLSLTERSQIAHALKLSEVQVKIWFQNRRAKWKRIKAGNVSSRSGEPVRNPKIVVPIPVH VNRFAVRSQHQQMEQGARP >gi568815591r:151139106_151375152|GENSCAN_predicted_CDS_2|1140_bp atgcagcgggccggaggcggtagcgcccctgggggcaacggcgggggcggcggcgggggc ccgggcactgccttctccatcgactccctaatcgggccgccgccgccgcgctccggccac ttgctgtacaccggctaccccatgttcatgccctaccggccgctcgtgctgccgcaggcg ctggcccctgcgccgctgcccgctggcctcccgcccctcgccccgctagcctctttcgcc ggccgccttaccaacaccttctgcgcggggctgggtcaggctgtgccctcgatggtggcg ctgaccaccgcgctgcccagcttcgcggagccgcccgacgctttctacgggccccaggag ctcgccgccgccgctgccgccgccgccgccactgccgcccgaaacaaccccgagccaggc ggccgacgcccagagggtgggctggaagctgatgagctgctgccggcccgggagaaagtg gcagagcccccaccacctccgcctccgcacttctcagagacttttccaagtctgcccggg gtagacaagctacaaggatgggatttccgggggcaccaagatggggcagaggggaaggtg tacagctcagatgaggagaagctggaggcatcagcaggagacccagcaggcagcgaacag gaggaagagggctcaggcggtgacagcgaggatgacggtttcctggacagttctgcaggg ggcccaggggctcttctgggacctaaaccgaagctaaagggaagcctggggactggagct gaggagggggcaccggtgacagcaggggtcacagctcctggggggaaaagccgacggcgc cgcacagcatttaccagcgagcagcttttggaattggagaaggaatttcattgcaagaaa tacctgagcttgacagagcgctctcagatcgcccacgccctcaagctcagtgaggtgcag gtcaagatctggtttcagaatcgacgggccaagtggaagcgcatcaaagctggcaatgtg agcagccgttctggggagcccgtaagaaaccccaagattgttgtccccatacctgtgcat gtcaacaggtttgctgtgcggagccagcaccaacaaatggagcagggggcccggccctga >gi568815591r:151139106_151375152|GENSCAN_predicted_peptide_3|503_aa MALQNALYTGDLARLQELFPPHSTADLLLESRAAEPRWSSHQREECKGQGEPLDDRHPLC ARLVEKPSRGSEEHLKSGPGPIVTRTASGPALAFWQAVLAGDVGCVSRILADSSTGLAPD SVFDTSDPERWRDFRFNIRALRLWSLTYEEELTTPLHVAASRGHTEVLRLLLRRRARPDS APGGRTALHEACAAGHTACVHVLLVAGADPNIADQDGKRPLHLCRGPGTLECAELLLRFG ARVDGRSEEEEETPLHVAARLGHVELADLLLRRGACPDARNAEGWTPLLAACDVRCQSIT DAEATTARCLQLCSLLLSAGADADAADQDKQRPLHLACRRGHAAVVELLLSCGVSANTMD YGGHTPLHCALQGPAAALAQSPEHVVRALLNHGAVRVWPGALPKVLERWSTCPRTIEVLM NTYSVVQLPEEAVGLVTPETLQKHQRFYSSLFALVRQPRSLQHLSRCALRSHLEGSLPQA LPRLPLPPRLLRYLQLDFEGVLY >gi568815591r:151139106_151375152|GENSCAN_predicted_CDS_3|1512_bp atggccctgcagaatgccctctacaccggggacctggcaaggttgcaggagctgttcccc ccgcacagcacagccgacctgctgctggagagccgggccgcagagcctcgctggagcagc caccagagggaagagtgcaaggggcagggagagcccctcgatgacagacaccccctctgt gccaggctggtggagaagcccagcagagggtctgaggagcacctcaagtctggcccggga cccatcgtcacccgcacagcctcaggacctgcccttgccttctggcaggcagtgctggct ggggacgtgggctgtgtctcccgcatcctcgcggactccagtactggcctggctcctgat tccgtctttgataccagcgacccagagcgatggagggatttccgcttcaacatccgtgct ctgagactctggtctctgacatacgaagaggagctgaccaccccactgcatgtggcagcc agccgtggccacacggaagtcctgcggctgctgctgaggcggcgagcaaggccagacagt gcccctgggggccgcaccgccctgcacgaggcctgtgctgcaggccacactgcctgtgtt catgtgctgctggtggcaggagccgaccccaacatcgctgaccaggatgggaaacgcccc ctgcatctctgccgggggcctggcacccttgagtgtgcggagctgctcctcaggtttgga gcgagagtggatggtcggtccgaggaagaagaggagacccctttgcatgtggccgcccgg cttggccatgtggagctggcagatctgcttctaagacggggggcatgtcctgatgcccgc aatgccgaaggctggaccccactgctggctgcctgtgacgtccgctgccagtccatcacc gatgccgaggccaccaccgcccgctgcctgcagctgtgcagcttgctgctttcagctgga gcagacgctgatgctgcggaccaggacaagcagcgacccctgcacctggcctgccgccgt ggccatgcagctgtcgtggagctgctcctgtcctgtggtgtcagcgccaacaccatggac tatgggggacacacgcccctgcactgtgctctgcagggcccagctgcagccctggcccag agccccgagcacgtggttcgggctctgctcaaccatggcgccgtccgtgtctggccaggg gccctccccaaggtgctggagcgctggagcacgtgccctcggaccatcgaggtcctgatg aacacctacagtgttgtgcagcttcccgaggaggccgtcggcctggtgactcctgaaact ctgcagaaacatcagcgtttctactcctccctcttcgccttggtgaggcagcccaggtcg ctgcagcatttgagccgctgtgcgctccgctcccacctggagggcagcctgccccaagcg ctgccccgcctccccctgccaccgcgcctgctccgctacctgcagctggattttgagggc gtgctctactag >gi568815591r:151139106_151375152|GENSCAN_predicted_peptide_4|1666_aa MPSDLAKKKAAKKKEAAKARQRPRKGHEENGDVVTEPQVAEKNEANGRETTEVDLLTKEL EDFEMKKAAARAVTGVLASHPNSTDVHIINLSLTFHGQELLSDTKLELNSGRRYGLIGLN GIGKSMLLSAIGKREVPIPEHIDIYHLTREMPPSDKTPLHCVMEVDTERAMLEKEAERLA HEDAECEKLMELYERLEELDADKAEMRASRILHGLGFTPAMQRKKLKDFSGGWRMRVALA RALFIRPFMLLLDEPTNHLDLDACVWLEEELKTFKRILVLVSHSQDFLNGVCTNIIHMHN KKLKYYTGNYDQYVKTRLELEENQMKRFHWEQDQIAHMKNYIARFGHGSAKLARQAQSKE KTLQKMMASGLTERVVSDKTLSFYFPPCGKIPPPVIMVQNVSFKYTKDGPCIYNNLEFGI DLDTRVALVGPNGAGKSTLLKLLTGELLPTDGMIRKHSHVKIGRYHQHLQEQLDLDLSPL EYMMKCYPEIKEKEEMRKIIGRYGLTGKQQVSPIRNLSDGQKCRVCLAWLAWQNPHMLFL DEPTNHLDIETIDALADAINEFEGGMMLVSHDFRLIQQVAQEIWVCEKQTITKWPGDILA YKEHLKSKLVDEEPQLTKRTHNVCMSPATPGGAADLIKSSSKGVGEGYSRSRRMSAEYGQ RQQPGGRGGRSSGNKKSKKRCRRKESYSMYIYKVLKQVHPDIGISAKAMSIMNSFVNDVF EQLACEAARLAQYSGRTTLTSREVQTAVRLLLPGELAKHAVSEGTKAVTKYTSSKAYQRL WESSHATLQELLDQEQLLLEPAPDRERQSFQYRLASLYLHYLGLLRRFDTVYDQMVQPQK RRLLRRLLDGVAGRVLELKDELVRADLCENHCLDRVLQDFKLTPADLEVPIPKYFLLEQS TTVRERGLILAEILSRLEPVSSQKSFTGMHRTEAIILVQKAERARQGRLRATFMREIRRD EEQDGRIREDGWHKFSQGQAAVTIQKVWKGYLQRKRTQQDRRMEMEFIGMLPSPNQVEHL SIISQPCLVEDVQRLRQMEKEEEFRAAMVKAHDSLVETEGPDMKEKMKEQIRQWFIECHD LTGRFPDYPDASSGGSYSIFADKTPEQVRMELEMQMQENRKKEQEKSKEKGKDEKEKKKG KEEKAKKGEVDAVLQVLPSKCIPMICAGHEEYLNTWKNRCESIHPSQNYDSETLREEKRK EVELEIRIQKTPGKKTGKKKEKDLTSDRSVESLYEELVISGLLRKSESVALKDYIGDFLY LGSTLSLVKKLPMPSLFDIRQNVALYAVLRLGSPDIHIMAPLIRSILLVGPSGMGKKMLV KAVCTETGANLFDLSPENLLGKYPGRNGAQMMVHIVFKVRSPQLLAFAVLRPVATKEGGF EVTSPISLLGMISGAELVARLLQPSVIWIGNAEKNFYKKTPKEDKEMDPKRIKKDLTKAL RLLTPGDRVMLIGTTSRPQLAEMRGLCRVYERILFMPRPDYASRYGEAWGCGDGCAYQSL PTSRPIPHCCCEPSPPQPGLLEGLLDKQPSGLGPGFGSVDASRPHVFHMPAVLWKRMIEA RGIQPTQHLDISALAKVSDGYTPGHILQAIQSVLSERRFLQLSKRPLVASEFLGQLVKLD PVYREEEESLKDWYFKTPLGKKSMKHRMDQLEAEEAKLDKEKKKRK >gi568815591r:151139106_151375152|GENSCAN_predicted_CDS_4|5001_bp atgccctccgacctggccaagaagaaggcagccaaaaagaaggaggctgccaaagctcga cagcggcccagaaaaggacatgaagaaaatggagatgttgtcacagaaccacaggtggca gagaagaatgaggccaatggcagagagaccacagaagtagatttgctgaccaaggagcta gaggactttgagatgaagaaagctgctgctcgagctgtcactggcgtcctggcctctcac cccaacagtactgatgttcacatcatcaacctctcacttacctttcatggtcaagagctg ctcagtgacaccaaactggaattaaactcaggccgtcgttatggcctcattggtttaaat ggaattggaaagtccatgctgctctctgctattgggaagcgtgaagtgcccatccctgag cacatcgacatctaccatctgactcgagagatgccccctagtgacaagacacccttgcat tgtgtgatggaagtcgacacagagcgggccatgctggagaaagaggcagagcggctggct catgaggatgcggagtgtgagaagctcatggagctctacgagcgcctggaggagctggat gccgacaaggcagagatgagggcctcgcggatcttgcatggactgggtttcacacctgcc atgcagcgcaagaagctaaaagacttcagtgggggctggaggatgagggttgcccttgcc agagccctctttattcggcccttcatgctgctcctggatgagcctaccaaccacctggac ctagatgcttgcgtgtggttggaagaagaactaaaaacttttaagcgcatcttggtcctc gtctcccattcccaggattttctgaatggtgtctgtaccaatatcattcacatgcacaac aagaaactgaagtattatacgggtaattatgatcagtacgtgaagacgcggctagagctg gaggagaaccagatgaagaggtttcactgggagcaagatcagattgcacacatgaagaac tacattgcgaggtttggtcatggcagtgccaagctggcccggcaggcccagagcaaggag aagacgctacagaaaatgatggcatcaggactgacagagagggtcgtgagcgataagaca ctgtcattttatttcccaccatgtggcaagatccctccacctgtcattatggtgcaaaat gtgagcttcaagtatacaaaagatgggccttgcatctacaataatctagaatttggaatt gaccttgacacacgagtggctctggtagggcccaatggagcagggaagtcaactcttctg aagctgctaactggagagctactacccacagatggcatgatccgaaaacactctcatgtc aagatagggcgttaccatcagcatttacaagagcagctggacttagatctctcacctttg gagtacatgatgaagtgctacccagagatcaaggagaaggaagaaatgaggaagatcatt gggcgatacggtctcactgggaaacaacaggtgagcccaatccggaacttgtcagacggg cagaagtgccgagtgtgtctggcctggctggcctggcagaacccccacatgctcttcctg gatgaacccaccaatcacctggatatcgagaccatcgacgccctggcagatgccatcaat gagtttgagggtggtatgatgctggtcagccatgacttcagactcattcagcaggttgca caggaaatttgggtctgtgagaagcagacaatcaccaagtggcctggagacatcctggct tacaaggagcacctcaagtccaagctggtggatgaggagccccagctcaccaagaggacc cacaacgtgtgcatgtccccagccacacctggtggggcagctgacttgataaaaagtagc tccaagggggtgggtgagggctattcgagaagcaggaggatgagcgctgagtatggacag cggcagcagcccgggggccgtgggggtcgcagttctgggaacaaaaagtccaaaaagcgc tgtcggcgcaaggagtcctactccatgtatatctacaaggtgctgaagcaggtgcaccct gacattggcatctctgccaaggccatgagcatcatgaactcgtttgtaaacgacgtgttt gagcagctggcgtgtgaggctgcccggctggcccagtactcgggccggaccaccctgaca tcccgagaagtccagacggctgtgcgtctgctgctgcctggggagctggccaagcacgct gtgtctgagggcaccaaggctgtcaccaagtacaccagctccaaagcttaccagcgcctc tgggagtcctcccacgcgaccctgcaggagctgctggaccaagagcagcttctgctcgaa cctgcgcccgaccgggagcggcagtccttccagtacaggctcgcatcgctctacctgcac tacctggggctgctgcgccgcttcgacaccgtctatgaccagatggtgcagccgcagaag cggcggctgctgcgacgcctgctggacggcgtggcgggccgcgtgctggagctcaaggac gagctggtgcgcgccgacctgtgtgagaaccactgcctggaccgcgtgctgcaggatttc aagctcaccccagctgacctggaggttccaatccccaaatacttcctgctggagcagtcc accactgtgcgggagcgagggctgatactggccgagatcctgtccagactggagccagtg tcctcccagaagagctttacaggaatgcaccggactgaggccataattctagtgcaaaag gcagagcgggcgaggcaaggccggcttcgagccaccttcatgcgagagattcgaagagat gaggagcaggatgggaggattcgggaggatggatggcacaagttcagtcagggccaagca gctgtcaccatacagaaggtgtggaaaggctatctgcagaggaaacgcactcagcaggac cggcgaatggagatggagttcattggcatgctcccctcacccaaccaggtagagcacctg agcatcatctcccagccatgcctcgtggaggatgtccagaggctccgccagatggagaaa gaggaggagttccgggcagcaatggtgaaggcccatgattctctggtagagacagagggg cctgacatgaaggagaaaatgaaggagcaaatccgacagtggtttatcgagtgccatgac cttactggccggttccctgattatccagatgcgtcttcaggtggatcctactcaatcttt gcagacaaaacgccagaacaggtgagaatggaactggaaatgcagatgcaagagaacaga aaaaaggaacaagagaagagcaaggaaaaaggaaaggatgaaaaagagaagaagaaagga aaggaagaaaaggccaagaagggggaagtggatgcggtgttgcaagtgttgccatccaaa tgtatccctatgatctgtgccgggcatgaagaatacttaaatacatggaagaaccggtgt gagagcatacaccccagtcagaattatgactccgagaccctccgggaggagaagaggaaa gaggtggagctggagatccggatacagaaaacacctgggaagaaaactgggaaaaagaag gaaaaagatctgacctcagacaggtctgtggagtctctgtacgaagagcttgttatttct ggccttctaaggaagagtgagtcagtagcattgaaagactacataggtgacttcctgtat cttggatccactctgagtttggtgaagaagttgcccatgccatccctgtttgacatacga cagaatgtggctttgtatgcggtccttcggcttggctccccggatatccacatcatggcc ccactcatccgctccatcctcctggtgggcccctctggcatggggaagaagatgctggtc aaggcagtgtgcacagaaactggcgccaacctgttcgacctgtcgccggaaaacctgctg ggcaaatatcctggcaggaatggggcacagatgatggtgcatatagtctttaaggtccgg agcccccagcttttggcatttgcggttctcagacctgtggcaacaaaggaggggggcttt gaggtcacgtcacccatcagcctactgggtatgatttcaggtgctgaactggtagcccga ctcctacagccctctgtgatttggattggaaatgctgagaagaatttctataagaagacc cccaaagaagacaaggagatggacccaaagcggataaagaaggacctcaccaaggccttg cggctgctgactcccggagaccgtgtgatgctgattgggacaacctcccggccacagctg gccgagatgcggggtctgtgccgggtctacgagcggatcctcttcatgccccggcctgac tacgcatcccgctatggtgaggcctggggatgtggggacggctgcgcctaccaaagcctg cccacctcccgccccataccccattgctgctgcgagccctcacccccacagcccggtctt ctggaagggctgctggacaagcagcccagtggcctgggtcctggctttggctctgtggac gcgagcaggccccatgtcttccacatgcctgcagtgctctggaagcgtatgatagaggcc cggggcatccagccgacccagcacctggacatcagtgccctagccaaggtctccgatggc tacacgccgggtcacattctccaagccatccaatcggtgctgagtgagcggcggttcctg caactgtccaagcggcccctggtggcctctgagttcctgggacaactggtgaagctggat ccagtgtacagggaggaggaggagtccctgaaggactggtacttcaagaccccactgggc aagaagagcatgaaacacaggatggaccagttggaggccgaggaggccaagctggacaag gagaagaagaaaaggaagtga >gi568815591r:151139106_151375152|GENSCAN_predicted_peptide_5|772_aa MRLSSLLALLRPALPLILGLSLGCSLSLLRVSWIQGEGEDPCVEAVGERGGPQNPDSRAR LDQSDEDFKPRIVPYYRDPNKPYKKVLRTRYIQTELGSRERLLVAVLTSRATLSTLAVAV NRTVAHHFPRLLYFTGQRGARAPAGMQVVSHGDERPAWLMSETLRHLHTHFGADYDWFFI MQDDTYVQAPRLAALAGHLSINQDLYLGRAEEFIGAGEQARYCHGGFGYLLSRSLLLRLR PHLDGCRGDILSARPDEWLGRCLIDSLGVGCVSQHQGQQYRSFELAKNRDPEKEGSSAFL SAFAVHPVSEGTLMYRLHKRFSALELERAYSEIEQLQAQIRNLTVLTPEGEAGLSWPVGL PAPFTPHSRFEVLGWDYFTEQHTFSCADGAPKCPLQGASRADVGDALETALEQLNRRYQP RLRFQKQRLLNGYRRFDPARGMEYTLDLLLECVTQRGHRRALARRVSLLRPLSRVEILPM PYVTEATRVQLVLPLLVAEAAAAPAFLEAFAANVLEPREHALLTLLLVYGPREGGRGAPD PFLGVKAAAAELERRYPGTRLAWLAVRAEAPSQVRLMDVVSKKHPVDTLFFLTTVWTRPG PEVLNRCRMNAISGWQAFFPVHFQEFNPALSPQRSPPGPPGAGPDPPSPPGADPSRGAPI GGRFDRQASAEGCFYNADYLAARARLAGELAGQEEEEALEGLEVMDVFLRFSGLHLFRAV EPGLVQKFSLRDCSPRLSEELYHRCRLSNLEGLGGRAQLAMALFEQEQANST >gi568815591r:151139106_151375152|GENSCAN_predicted_CDS_5|2319_bp atgcgactgagctccctgttggctctgctgcggccagcgcttcccctcatcttagggctg tctctggggtgcagcctgagcctcctgcgggtttcctggatccagggggagggagaagat ccctgtgtcgaggctgtaggggagcgaggagggccacagaatccagattccagagctcgg ctagaccaaagtgatgaagacttcaaaccccggattgtcccctactacagggaccccaac aagccctacaagaaggtgctcaggactcggtacatccagacagagctgggctcccgtgag cggttgctggtggctgtcctgacctcccgagctacactgtccactttggccgtggctgtg aaccgtacggtggcccatcacttccctcggttactctacttcactgggcagcggggggcc cgggctccagcagggatgcaggtggtgtctcatggggatgagcggcccgcctggctcatg tcagagaccctgcgccaccttcacacacactttggggccgactacgactggttcttcatc atgcaggatgacacatatgtgcaggccccccgcctggcagcccttgctggccacctcagc atcaaccaagacctgtacttaggccgggcagaggagttcattggcgcaggcgagcaggcc cggtactgtcatgggggctttggctacctgttgtcacggagtctcctgcttcgtctgcgg ccacatctggatggctgccgaggagacattctcagtgcccgtcctgacgagtggcttgga cgctgcctcattgactctctgggcgtcggctgtgtctcacagcaccaggggcagcagtat cgctcatttgaactggccaaaaatagggaccctgagaaggaagggagctcggctttcctg agtgccttcgccgtgcaccctgtctccgaaggtaccctcatgtaccggctccacaaacgc ttcagcgctctggagttggagcgggcttacagtgaaatagaacaactgcaggctcagatc cggaacctgaccgtgctgacccccgaaggggaggcagggctgagctggcccgttgggctc cctgctcctttcacaccacactctcgctttgaggtgctgggctgggactacttcacagag cagcacaccttctcctgtgcagatggggctcccaagtgcccactacagggggctagcagg gcggacgtgggtgatgcgttggagactgccctggagcagctcaatcggcgctatcagccc cgcctgcgcttccagaagcagcgactgctcaacggctatcggcgcttcgacccagcacgg ggcatggagtacaccctggacctgctgttggaatgtgtgacacagcgtgggcaccggcgg gccctggctcgcagggtcagcctgctgcggccactgagccgggtggaaatcctacctatg ccctatgtcactgaggccacccgagtgcagctggtgctgccactcctggtggctgaagct gctgcagccccggctttcctcgaggcctttgcagccaatgtcctggagccacgagaacat gcattgctcaccctgttgctggtctacgggccacgagaaggtggccgtggagctccagac ccatttcttggggtgaaggctgcagcagcggagttagagcgacggtaccctgggacgagg ctggcctggctcgctgtgcgagcagaggccccttcccaggtgcgactcatggacgtggtc tcgaagaagcaccctgtggacactctcttcttccttaccaccgtgtggacaaggcctggg cccgaagtcctcaaccgctgtcgcatgaatgccatctctggctggcaggccttctttcca gtccatttccaggagttcaatcctgccctgtcaccacagagatcacccccagggcccccg ggggctggccctgaccccccctcccctcctggtgctgacccctcccggggggctcctata ggggggagatttgaccggcaggcttctgcggagggctgcttctacaacgctgactacctg gcggcccgagcccggctggcaggtgaactggcaggccaggaagaggaggaagccctggag gggctggaggtgatggatgttttcctccggttctcagggctccacctctttcgggccgta gagccagggctggtgcagaagttctccctgcgagactgcagcccacggctcagtgaagaa ctctaccaccgctgccgcctcagcaacctggaggggctagggggccgtgcccagctggct atggctctctttgagcaggagcaggccaatagcacttag >gi568815591r:151139106_151375152|GENSCAN_predicted_peptide_6|977_aa MSMSREVPTFRTTLAVFVSHLHQKTPLSSKADLSVHSFILLANTYCAPTVCEVLGMLQQS NGHQSPLTEKTYSKEERGLAVMTGSAGVDHGLGQGFQQADLYRLHYPSSLIFWLLVGFAN AGPSRRLEVWRKGQSSDMPFGVPMRINVKNTHKALNTVTATARDRQIRSLRLPRSGLQRR RRRAEPSAEQGAGGRAPGRGPGGAGPEPMEKQAQASVNLSARKFKLKRGLNLRPKCRRQV LLPNLGLLTWAWEQKEMICVLKQTRLEEECWPQGPVGSSKFSKPSTKWHQEGKHFQRQRL EGRQQHPHVTDEDRAPGRKRHSPGHMARHEQIPSRPEATRLPPRSLAANIWMQPASVERW EGPYSQQVRALPNSGSLDVALLPASRATPWKKCERAQQDSEGRVGGKKKAEKGKKEEERE RVRGAAEPTPMAADEVAGGARKATKSKLFEFLVHGVRPGMPSGARMPHQGAPMGPPGSPY MGSPAVRPGLAPAGMEPARKRAAPPPGQSQAQSQGQPVPTAPARSRRIAQHAYIPWYNPG LEPLTPALLLLLQIRELVPESQAYMDLLAFERKLDQTIMRKRVDIQEALKRPMKQKRKLR LYISNTFNPAKPDAEDSDGSIASWELRVEGKLLDDPSKQKRKFSSFFKSLVIELDKDLYG PDNHLVEWHRTPTTQETDGFQVKRPGDLSVRCTLLLMLDYQPPQFKLDPRLARLLGLHTQ SRSAIVQALWQYVKTNRLQDSHDKEYINGDKYFQQNKRIYNCMMISSRRVISLSPIVLLQ IFDCPRLKFSEIPQRLTALLLPPDPIVINHVISVDPSDQKKTACYDIDVEVEEPLKGQMS SFLLSTANQQEISALDSKIHETIESINQLKIQRDFMLSFSRDPKGYVQDLLRSQSRDLKV KRETQGVLEWKTSTAGKRVNVVLQGQVMTDVAGNPEEERRAEFYHQPWSQEAVSRYFYCK IQQRRQELEQSLVVRNT >gi568815591r:151139106_151375152|GENSCAN_predicted_CDS_6|2934_bp atgagcatgagtagagaggtgcccacgttccggaccactctcgctgtatttgtctcccat ctccatcagaagactccgctgagctccaaggcagatctttctgttcactcatttatcctc ttggcaaatacgtattgtgcacctactgtgtgcgaggtgctagggatgctgcagcaatcc aacgggcaccaatctcccctcacagagaagacatactccaaagaggaaaggggcctcgct gtgatgacaggcagtgcaggtgtagaccatggccttggacaggggttccaacaagctgac ctctacagactgcattaccccagctctcttatcttctggcttctggttggatttgccaat gcaggccccagcagaagactggaggtgtggaggaaagggcaatcatcagacatgcctttt ggagtgcctatgaggatcaacgtaaagaacacacataaagcactcaacacagtgacagcc acagccagggatcgacaaatacgctccctgcgcctcccccgctccgggctgcagcggcgc aggcgccgggccgagccgagcgccgagcagggagcgggcggccgcgctccgggccggggt cccgggggagcaggtcctgaacccatggaaaaacaggcacaggcatcagttaacttaagt gcaaggaaattcaagctgaagagaggcttgaacctgaggcccaagtgtagaaggcaggtc ctactccctaacctgggtttgcttacctgggcatgggaacagaaggagatgatctgtgtc ctaaaacagactcgtctagaagaggaatgctggccacaggggccagttggatcctccaag ttctccaaacctagcaccaagtggcaccaggaaggcaagcatttccagaggcagaggttg gagggaaggcagcagcaccctcatgtcacagatgaggacagagctccagggaggaaacgg cattccccgggccacatggcccgacatgagcagatcccaagtcgaccagaagccacaagg ctgccccccagatccctggcagccaacatctggatgcagccagcatctgtagagagatgg gaagggccatacagccagcaggtcagggcactgcccaattcagggtccctggacgtggcc ctattgcccgccagcagagccacaccctggaagaagtgtgagagagcccagcaggactca gaggggagagttggaggaaaaaaaaaggcagaaaagggaaagaaagaggaagagagagag agagtgagaggagccgctgagcccaccccgatggccgcggacgaagttgccggaggggcg cgcaaagccacgaaaagcaaactttttgagtttctggtccatggggtgcgccccgggatg ccgtctggagcccggatgccccaccagggggcgcccatgggccccccgggctccccgtac atgggcagccccgccgtgcgacccggcctggcccccgcgggcatggagcccgcccgcaag cgagcagcgcccccgcccgggcagagccaggcacagagccagggccagccggtgcccacc gcccccgcgcggagccgcaggattgcacaacatgcatacattccatggtacaaccctgga cttgagcctctgactcctgccctgctcctccttctccagattcgggagctggtccccgag tcccaggcttacatggacctcttggcatttgagaggaaactggatcaaaccatcatgcgg aagcgggtggacatccaggaggctctgaagaggcccatgaagcaaaagcggaagctgcga ctctatatctccaacacttttaaccctgcgaagcctgatgctgaggattccgacggcagc attgcctcctgggagctacgggtggaggggaagctcctggatgatcccagcaaacagaag cggaagttctcttctttcttcaagagtttggtcatcgagctggacaaagatctttatggc cctgacaaccacctcgttgagtggcatcggacacccacgacccaggagacggacggcttc caggtgaaacggcctggggacctgagtgtgcgctgcacgctgctcctcatgctggactac cagcctccccagttcaaactggatccccgcctagcccggctgctggggctgcacacacag agccgctcagccattgtccaggccctgtggcagtatgtgaagaccaacaggctgcaggac tcccatgacaaggaatacatcaatggggacaagtatttccagcagaacaaaaggatctac aactgcatgatgatctcttcaagaagagtgatctctctctccccaattgtcctgcttcag atttttgattgtccccggctgaagttttctgagattccccagcgcctcacagccctgcta ttgccccctgacccaattgtcatcaaccatgtcatcagcgtggacccttcagaccagaag aagacggcgtgctatgacattgacgtggaggtggaggagccattaaaggggcagatgagc agcttcctcctatccacggccaaccagcaggagatcagtgctctggacagtaagatccat gagacgattgagtccataaaccagctcaagatccagagggacttcatgctaagcttctcc agagaccccaaaggctatgtccaagacctgctccgctcccagagccgggacctcaaggtg aagagggagactcagggagtactggagtggaagacaagcacagcgggaaagagagtaaac gttgtactccaggggcaggtgatgacagatgtagccggcaaccctgaagaggagcgccgg gctgagttctaccaccagccctggtcccaggaggccgtcagtcgctacttctactgcaag atccagcagcgcaggcaggagctggagcagtcgctggttgtgcgcaacacctag >gi568815591r:151139106_151375152|GENSCAN_predicted_peptide_7|549_aa MVVWDADTQQVIPNGIQLAGLDKSHSGFALAPPTTLFPSGGGGGGAKATAAAGAGLASPG MKTNGGRCRIRALCWSRREWRGAGEDTAAECPRPQPQQHCLAPRFPVRLGTSPGQGWSGR GAGDLAKQYSDRLECCENEVEKVIEEIRCKAIERGTGNDNYRTTGIATIEVFLPPRLKKD RKNLLETRLHITGRELRSKIAETFGLQENYIKIVINKKQLQLGKTLEEQGVAHNVKAMVL ELKQSEEDARKNFQLEEEEQNEAKLKEKQIQRTKRGLEILAKRAAETVVDPEMTPYLDIA NQTGRSIRIPPSERKALMLAMGYHEKGRAFLKRKEYGIALPCLLDADKYFCECCRELLDT VDNYAVLQLDIVWCYFRLEQLECLDDAEKKLNLAQKCFKNCYGENHQRLVHIKGNCGKEK VLFLRLYLLQGIRNYHSGNDVEAYEYLNKARQLFKELYIDPSKVDNLLQLGFTAQEARLG LRACDGNVDHAATHITNRREELAQIRKEEKEKKRRRLENIRFLKGMGYSTHAAQQVLHAA SGNLDEALK >gi568815591r:151139106_151375152|GENSCAN_predicted_CDS_7|1647_bp atggttgtatgggatgcagacacacagcaggttattcccaatggaatacagcttgctgga ctggataaaagccactctggcttcgccttggccccgcccacaaccctctttccaagcggc ggcggcggcggcggcgcgaaggcgacagcggcggcgggggcggggctggcctcacccgga atgaaaacaaacggcggccgctgccgcatccgggcactctgctggtcgcggcgggagtgg cgtggcgcaggtgaggacacggcggccgagtgtcctcgaccccagcctcagcagcactgc ttggcgccccggttcccggtccgactgggcacctctcctggccaggggtggagcggccgc ggggcgggggaccttgctaagcagtactctgacagactagaatgctgtgaaaatgaagta gaaaaggtaatagaagaaatacgttgcaaggcaattgagcgtggaacaggaaatgacaat tatagaacaacgggaattgctacaatcgaggtgtttttaccaccaagactaaaaaaagat aggaaaaacttgttggagacccgattgcacatcactggcagagaactgaggtccaaaata gctgaaacctttggacttcaagaaaattatatcaaaattgtcataaataagaagcaacta caactagggaaaacccttgaagaacaaggcgtggctcacaatgtgaaagcgatggtgctt gaactaaaacaatctgaagaggacgcgaggaaaaacttccagttagaggaagaggagcaa aatgaggccaaactcaaagaaaaacaaattcagaggaccaagagaggactagaaatactg gcaaagagagcagcagagacagtggtggatccagaaatgacaccgtacttagacatagct aaccagacaggcagatcaatcagaattcccccatcagaaagaaaagcccttatgttagct atgggatatcatgagaagggcagagctttcctgaaaagaaaagaatatggaatagccttg ccatgtctgttggacgctgacaaatatttctgtgagtgttgcagagagctgctggacaca gtggataactacgccgtcctccagctggatatagtgtggtgttacttccgcctggaacag ctggaatgccttgatgatgcagaaaaaaaattaaacttggcccagaaatgctttaaaaat tgttacggagaaaatcatcagagactggtccacataaaaggaaattgtgggaaagagaag gtactgtttctaagactctacttacttcaagggatccgaaactatcacagtggaaatgat gtagaggcttatgagtatcttaacaaggcacgtcagctctttaaagagctatatattgat ccatcaaaagtggacaatttgttgcagttggggtttactgcccaggaagcccggcttggc ctgagggcgtgtgatgggaacgtggatcatgcggccactcatattaccaaccgcagagag gaactggcccaaataaggaaggaggaaaaagagaagaaaagacgccgcctcgagaacatc aggtttctgaaagggatgggctactccacgcacgcggcccagcaggtactccacgcagcc agcgggaacttggatgaggccctgaag