GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:24:08 Sequence gi568815587r:57226410_57435121 : 208712 bp : 48.36% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 2927 2922 6 1.05 1.01 Sngl - 10595 9453 1143 2 0 99 49 1992 0.997 192.82 1.00 Prom - 20745 20706 40 -5.16 2.00 Prom + 22038 22077 40 -2.26 2.01 Init + 32137 32283 147 0 0 44 119 92 0.491 7.99 2.02 Intr + 36750 36780 31 2 1 126 78 -10 0.155 -0.50 2.03 Term + 43369 43478 110 2 2 102 42 54 0.210 0.87 2.04 PlyA + 45767 45772 6 1.05 3.14 PlyA - 48011 48006 6 1.05 3.13 Term - 54175 54003 173 2 2 106 43 87 0.555 3.99 3.12 Intr - 67255 67158 98 0 2 79 25 149 0.181 7.35 3.11 Intr - 74632 74475 158 1 2 83 73 100 0.985 6.81 3.10 Intr - 75534 75398 137 1 2 88 76 145 0.995 13.59 3.09 Intr - 75815 75665 151 2 1 30 91 199 0.895 14.14 3.08 Intr - 76416 76050 367 2 1 116 78 146 0.612 11.55 3.07 Intr - 84147 81986 2162 0 2 88 90 1082 0.975 95.74 3.06 Intr - 87480 86125 1356 0 0 114 78 427 0.256 33.01 3.05 Intr - 91478 91409 70 1 1 95 76 106 0.760 9.28 3.04 Intr - 94303 93670 634 2 1 101 57 431 0.965 32.49 3.03 Intr - 95541 95383 159 1 0 126 81 65 0.497 9.56 3.02 Intr - 98092 97982 111 2 0 50 28 105 0.582 1.05 3.01 Init - 98602 98431 172 1 1 38 77 219 0.990 13.40 3.00 Prom - 99037 98998 40 -9.06 4.18 PlyA - 99602 99597 6 1.05 4.17 Term - 100069 99998 72 1 0 74 43 90 0.990 1.01 4.16 Intr - 100480 100258 223 0 1 95 72 159 0.735 13.13 4.15 Intr - 101105 101017 89 2 2 7 105 112 0.999 3.57 4.14 Intr - 101473 101303 171 1 0 35 105 240 0.995 20.54 4.13 Intr - 102017 101888 130 1 1 100 46 321 0.987 29.80 4.12 Intr - 103729 103684 46 2 1 100 64 60 0.973 2.27 4.11 Intr - 104020 103882 139 1 1 89 80 274 0.979 26.74 4.10 Intr - 104518 104446 73 0 1 93 65 141 0.999 11.71 4.09 Intr - 105480 105259 222 2 0 53 78 386 0.989 31.34 4.08 Intr - 105871 105743 129 0 0 106 76 187 0.999 19.11 4.07 Intr - 106077 105974 104 0 2 54 72 127 0.997 6.67 4.06 Intr - 106446 106216 231 0 0 92 91 296 0.729 28.27 4.05 Intr - 106740 106550 191 1 2 48 78 218 0.962 16.10 4.04 Intr - 107131 107026 106 1 1 93 89 81 0.998 8.49 4.03 Intr - 108239 108054 186 2 0 87 89 209 0.998 20.79 4.02 Intr - 108751 108659 93 1 0 83 81 69 0.317 5.86 4.01 Init - 109114 108956 159 1 0 45 34 173 0.157 5.43 4.00 Prom - 111063 111024 40 -6.26 5.00 Prom + 111304 111343 40 -12.59 5.01 Init + 112142 112260 119 1 2 76 96 209 0.998 20.17 5.02 Intr + 119468 119503 36 2 0 103 98 38 0.927 3.68 5.03 Intr + 120135 120270 136 0 1 111 80 229 0.842 24.97 5.04 Intr + 120707 120778 72 1 0 107 78 88 0.998 9.30 5.05 Intr + 121006 121069 64 0 1 104 84 108 0.625 10.29 5.06 Intr + 121761 121854 94 1 1 62 83 156 0.999 11.52 5.07 Intr + 122218 122295 78 1 0 97 81 112 0.998 10.07 5.08 Intr + 123348 123489 142 0 1 142 89 155 0.999 21.36 5.09 Intr + 124353 124489 137 2 2 61 91 210 0.893 17.97 5.10 Intr + 141600 141693 94 0 1 74 82 248 0.950 22.77 5.11 Intr + 141963 142028 66 2 0 87 89 105 0.898 9.60 5.12 Intr + 142952 143029 78 1 0 113 105 175 0.999 21.45 5.13 Term + 143475 143588 114 2 0 109 49 164 0.999 13.17 5.14 PlyA + 144175 144180 6 1.05 6.09 PlyA - 147412 147407 6 1.05 6.08 Term - 150499 150441 59 2 2 57 46 85 0.804 -0.95 6.07 Intr - 151427 151316 112 2 1 93 100 17 0.882 3.35 6.06 Intr - 152403 152272 132 0 0 89 106 89 0.990 11.64 6.05 Intr - 153398 153085 314 0 2 87 111 313 0.951 29.40 6.04 Intr - 161456 161345 112 2 1 123 78 71 0.209 9.55 6.03 Intr - 162299 162168 132 2 0 61 103 94 0.998 9.04 6.02 Intr - 162908 162601 308 0 2 39 92 359 0.814 27.57 6.01 Init - 163535 163478 58 2 1 85 97 -25 0.259 -0.41 6.00 Prom - 165059 165020 40 -6.76 7.00 Prom + 168869 168908 40 -5.46 7.01 Init + 171583 171627 45 0 0 34 68 35 0.237 -3.22 7.02 Intr + 175278 175388 111 2 0 97 46 38 0.513 1.08 7.03 Term + 176987 177262 276 1 0 5 43 712 0.962 53.86 7.04 PlyA + 177793 177798 6 1.05 8.14 PlyA - 178162 178157 6 1.05 8.13 Term - 181487 181383 105 2 0 139 32 39 0.988 1.81 8.12 Intr - 182889 182766 124 2 1 93 83 150 0.999 15.59 8.11 Intr - 183712 183526 187 2 1 87 85 240 0.880 22.35 8.10 Intr - 188322 188206 117 1 0 111 94 136 0.999 16.94 8.09 Intr - 188697 188524 174 1 0 80 87 128 0.896 11.81 8.08 Intr - 190261 190164 98 0 2 60 105 87 0.998 7.25 8.07 Intr - 191478 191339 140 0 2 127 82 56 0.989 8.16 8.06 Intr - 194655 194563 93 0 0 50 94 55 0.452 2.46 8.05 Intr - 194964 194888 77 1 2 34 75 120 0.976 4.53 8.04 Intr - 197619 197573 47 2 2 93 109 76 0.991 8.25 8.03 Intr - 199261 199132 130 2 1 62 103 158 0.988 14.45 8.02 Intr - 199751 199580 172 2 1 26 95 185 0.440 12.52 8.01 Init - 200623 200477 147 1 0 113 41 188 0.633 14.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 16071 15997 75 0 0 133 92 65 0.848 11.11 S.002 Init - 154299 154239 61 0 1 45 106 83 0.886 5.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:57226410_57435121|GENSCAN_predicted_peptide_1|380_aa MEEGGDFDNYYGADNQSECEYTDWKSSGALIPAIYMLVFLLGTTGNGLVLWTVFRSSREK RRSADIFIASLAVADLTFVVTLPLWATYTYRDYDWPFGTFFCKLSSYLIFVNMYASVFCL TGLSFDRYLAIVRPVANARLRLRVSGAVATAVLWVLAALLAMPVMVLRTTGDLENTTKVQ CYMDYSMVATVSSEWAWEVGLGVSSTTVGFVVPFTIMLTCYFFIAQTIAGHFRKERIEGL RKRRRLLSIIVVLVVTFALCWMPYHLVKTLYMLGSLLHWPCDFDLFLMNIFPYCTCISYV NSCLNPFLYAFFDPRFRQACTSMLCCGQSRCAGTSHSSSGEKSASYSSGHSQGPGPNMGK GGEQMHEKSIPYSQETLVVD >gi568815587r:57226410_57435121|GENSCAN_predicted_CDS_1|1143_bp atggaggaaggtggtgattttgacaactactatggggcagacaaccagtctgagtgtgag tacacagactggaaatcctcgggggccctcatccctgccatctacatgttggtcttcctc ctgggcaccacgggcaacggtctggtgctctggaccgtgtttcggagcagccgggagaag aggcgctcagctgatatcttcattgctagcctggcggtggctgacctgaccttcgtggtg acgctgcccctgtgggctacctacacgtaccgggactatgactggccctttgggaccttc ttctgcaagctcagcagctacctcatcttcgtcaacatgtacgccagcgtcttctgcctc accggcctcagcttcgaccgctacctggccatcgtgaggccagtggccaatgctcggctg aggctgcgggtcagcggggccgtggccacggcagttctttgggtgctggccgccctcctg gccatgcctgtcatggtgttacgcaccaccggggacttggagaacaccactaaggtgcag tgctacatggactactccatggtggccactgtgagctcagagtgggcctgggaggtgggc cttggggtctcgtccaccaccgtgggctttgtggtgcccttcaccatcatgctgacctgt tacttcttcatcgcccaaaccatcgctggccacttccgcaaggaacgcatcgagggcctg cggaagcggcgccggctgctcagcatcatcgtggtgctggtggtgacctttgccctgtgc tggatgccctaccacctggtgaagacgctgtacatgctgggcagcctgctgcactggccc tgtgactttgacctcttcctcatgaacatcttcccctactgcacctgcatcagctacgtc aacagctgcctcaaccccttcctctatgcctttttcgacccccgcttccgccaggcctgc acctccatgctctgctgtggccagagcaggtgcgcaggcacctcccacagcagcagtggg gagaagtcagccagctactcttcggggcacagccaggggcccggccccaacatgggcaag ggtggagaacagatgcacgagaaatccatcccctacagccaggagacccttgtggttgac tag >gi568815587r:57226410_57435121|GENSCAN_predicted_peptide_2|95_aa MWYIYTMEYYTTINKNEIMSFAATWMELEVTTLSKLPQEQKIKYHMFSLEPWWAVPKHRG GAELRAWEWREQGEQQSKGGALNHCGHFPMGTQKF >gi568815587r:57226410_57435121|GENSCAN_predicted_CDS_2|288_bp atgtggtacatctataccatggaatactatacaaccataaataagaacgagatcatgtcc tttgcagcaacatggatggagctggaagtcactaccctaagcaaactaccacaggaacag aaaatcaaataccacatgttctcactggaaccttggtgggcagttcctaaacacagaggg ggagcagagctcagggcctgggaatggagggagcaaggagagcagcagagcaagggaggg gccctcaaccactgtggccacttccccatgggaacgcagaagttctga >gi568815587r:57226410_57435121|GENSCAN_predicted_peptide_3|1915_aa MKRPGGVDLGGVGAGPRRGSQWLRPAVTAAAAAVAAAAAAGTGIGAPSRAEPDPGRAPPS PGSFGTPPATPVRPHLIDVCSPPSVRRLLHSPLTGLQPPQLIPLCLPALKEGLPHVMKVS TLRESSAMASPLPREMEEELVPTGSEPGDTRAKPPVKPKPRALPAKPALPAKPSLLVPVG PRPPRGPLAELPSARKMNMLAGPQPYGGSKRPLPFAPRPAVEASTGGEATQETGKEEAGK EEPPPLTPPARCAAPGGVRKAPAPFRPASERFAATTVEEILAKMEQPRKEVLASPDRLWG SRLTFNHDGSSRYGPRTYGTTTAPRDEDGSTLFRGWSQEGPVKSPAECREEHSKTPEERS LPSDLAFNGDLAKAASSELPADISKPWIPSSPAPSSENGGPASPGLPAEASGSGPGSPHL HPPDKSSPCHSQLLEAQTPEASQASPCPAVTPSAPSAALPDEGSRHTPSPGLPAEGAPEA PRPSSPPPEVLEPHSLDQPPATSPRPLIEVGELLDLTRTFPSGGEEEAKGDAHLRPTSLV QRRFSEGVLQSPSQDQEKLGGSLAALPQGQGSQLALDRPFGAESNWSLSQSFEWTFPTRP SGLGVWRLDSPPPSPITEASEAAEAAEAGNLAVSSREEGVSQQGQGAGSAPSGSGSSWVQ GDDPSMSLTQKGDGESQPQFPAVPLEPLPTTEGTPGLPLQQAEERYESQEPLAGQESPLP LATREAALPILEPVLGQEQPAAPDQPCVLFADAPEPGQALPVEEEAVTLARAETTQARTE AQDLCRASPEPPGPESSSRWLDDLLASPPPSGGGARRGAGAELKDTQSPSTCSEGLLGWS QKDLQSEFGITGDPQPSSFSPSSWCQGASQDYGLGGASPRGDPGLGERDWTSKYGQGAGE GSTREWASRCGIGQEEMEASSSQDQSKVSAPGVLTAQDRVVGKPAQLGTQRSQEADVQDW EFRKRDSQGTYSSRDAELQDQEFGKRDSLGTYSSRDVSLGDWEFGKRDSLGAYASQDANE QGQDLGKRDHHGRYSSQDADEQDWEFQKRDVSLGTYGSRAAEPQEQEFGKSAWIRDYSSG GSSRTLDAQDRSFGTRPLSSGFSPEEAQQQDEEFEKKIPSVEDSLGEGSRDAGRPGERGS GGLFSPSTAHVPDGALGQRDQSSWQNSDASQEVGGHQERQQAGAQGPGSADLEDGEMGKR GWVGEFSLSVGPQREAAFSPGQQDWSRDFCIEASERSYQFGIIGNDRVSGAGFSPSSKME GGHFVPPGKTTAGSVDWTDQLGLRNLEVSSCVGSGGSSEARESAVGQMGWSGGLSLRDMN LTGCLESGGSEEPGGIGVGEKDWTSDVNVKSKDLAEVGEGGGHSQARESGVGQTDWSGVE AGEFLKSRERGVGQADWTPDLGLRNMAPGAVCSPGESKELGVGQMDWGNNLGLRDLEVTC DPDSGGSQGLRGCGVGQMDWTQDLAPQNVELFGAPSEAREHGVGGVSQCPEPGLRHNGSL SPGLEARDPLEARELGVGETSGPETQGEDYSSSSLEPHPADPGMETGEALSFGASPGRCP ARPPPSGSQGLLEEMLAASSSKAVARRESAASGLGGLLEEEGAGAGAAQEEVLEPGRDSP PSWRPQPDGEASQTEDVDGTWGSSAARWSDQGPAQTSRRPSQGPPARSPSQDFSFIEDTE ILDSAMYRSRANLGRKRGHRAPVIRPGGTLGLSEAADSDAHLFQDSTEPRASRVPSSDEE VVEEPQSRRTRMSLGTKGLKVNLFPGLSPSALKAKLRPRNRSAEEGELAESKSSQKESAV QRSKSCKVPGLGKPLTLPPKPEKSSGIQELQLVFGPKNELAFVHGALSIGVVVVGVRGEQ VSVGLEPMCDLAEFMPHPALQPYLVVVELCYSPLYMVSAGYSVSSSSNGPGHAGH >gi568815587r:57226410_57435121|GENSCAN_predicted_CDS_3|5748_bp atgaagcgccctgggggcgtggacctgggcggagttggggcggggccccgccggggctcg cagtggcttcgtcccgcggtgacggcggcggcggcggcggtagcagcggcggcggcggcg gggactggcatcggggccccgagccgagcggagccggaccccgggcgagcgcccccctcc cccggctccttcggcacgcccccagccaccccagtccgcccccacctgatcgacgtctgc agcccaccgtctgttcgccggctgcttcacagcccgctcacaggtctgcagccaccccag ctcatacctctctgcctccccgctctcaaggagggtctgccgcatgtgatgaaagtgtct actctcagggaaagctcagccatggcttccccactgccccgggagatggaggaggagctg gtgcctactggctctgagccaggtgacactcgggccaaaccccctgtcaagcccaaaccc cgggccctgcctgccaagccagccctgcctgccaaacccagcctgctggtgcctgttggg cctcggcctccccggggtcccctggctgagttgccttctgccaggaagatgaacatgctg gcaggaccccagccctatggtggcagcaagcgcccccttccctttgcaccaaggcctgcg gttgaggcctccactggaggagaagccacccaagagactgggaaagaggaggctgggaaa gaggagccaccccctttgacacccccagctcgatgtgcagccccagggggtgtacggaag gcccctgcccctttccgcccagcctcagagcgcttcgcggccaccacggtggaagagatc ctggccaagatggagcagcctcggaaggaggtccttgccagccccgaccgcctgtggggt tcccgcctcacctttaaccacgatggcagctcgcgatatggccccaggacctatggcacg accactgctcccagggatgaggatggcagcaccctcttcaggggatggtcccaggagggg ccagtaaagtctccagcagagtgccgggaagagcacagcaagacccctgaggagaggagc cttccttccgacctggccttcaacggggacctggctaaggcagccagctcggagctacct gctgatatttccaagccctggattccctcaagtccagccccctcctcagagaatggaggc cctgccagcccaggcctccccgcagaagcctcaggctcaggccctggctctccccatctt cacccgcctgataagagttctccctgccactcacagcttctggaagcccagactcctgaa gcttcccaggcttctccctgccccgctgtgactccatcagctccaagtgcagccctgcct gacgagggctcccgccacacccccagcccggggctccctgccgagggggctccagaggcc cccagacccagcagcccaccccctgaggtcttggagccccatagcctggatcagccccct gccacctcaccccggcccctgatcgaggtgggtgagttgctggatctcactcggacgttt ccatctggcggggaggaggaggccaagggtgacgcacacctccgccccaccagcctggtt cagcgccgattctctgaaggtgtgctccagtcacccagtcaggaccaggagaagctgggg ggctcgctggctgccctgccccaaggccaggggagccagttggccctggatcgtcccttt ggggcagagtccaactggagcttatcacagtccttcgaatggaccttccccacgaggccc tcgggtctgggcgtgtggcggctggactccccgcctccctcccccatcactgaagccagt gaggccgccgaggctgctgaggctggcaacttggccgtttccagcagggaagaaggagtg tctcagcaggggcaaggggctgggtcagctccaagtgggtcaggaagttcctgggtgcag ggggatgatccaagcatgtccctcacccagaagggcgatggggagagtcaacctcaattc ccagctgttccccttgagcccctgcctacaactgagggcacacctggattacctttgcag caggcagaggagagatacgagtcgcaggagcccttggctggacaggagtcccctctcccc ctggctaccagggaggcagccttgcccatcctggagccagtcctggggcaggagcagcca gcagcccctgaccagccctgtgttctctttgctgatgcccctgagcctggacaggcactg cctgttgaggaggaggccgtgaccctagcccgggctgagaccacccaagccaggacagag gctcaagacttgtgtagggcatcccccgagcctccaggccctgaaagcagctcccgctgg ctggacgacctcctggcttcaccaccacccagtggtggcggtgcaaggcggggagctgga gctgagctgaaggacacacagtccccaagtacctgctctgagggactccttggctggtcc cagaaagatctgcagagtgaatttgggatcacaggagacccacagcccagcagtttcagt ccttccagctggtgtcaaggtgcttctcaggactatggccttgggggtgcaagccctaga ggagacccaggtctcggagagagggactggaccagcaagtatgggcaaggagcaggggaa gggagcaccagggagtgggccagcaggtgtggcatcggccaggaggagatggaggccagc agcagccaagaccagagtaaagtgtctgccccaggggtgctcacagcccaggaccgggta gttggaaagccagcccagcttggcactcagcggagccaggaggcagatgttcaggactgg gagttcagaaagagggattcccagggcacttactccagccgggatgcagaactccaggac caggaattcggaaagagagattcactgggtacctacagtagtcgagatgtaagccttggg gactgggaatttgggaagagagattctctgggtgcttatgccagccaagatgccaacgag cagggccaagatttggggaagagggaccaccatggtaggtacagcagccaggatgccgat gagcaggactgggagtttcagaagagagatgtgtcactcggcacctatggcagccgggct gcggagccacaggaacaggagtttgggaagagcgcttggataagggactacagcagtggt ggcagctccaggacccttgacgcccaggacagaagctttggaacgagacccctgagctct gggttcagccccgaggaagcccagcaacaggatgaggaatttgagaagaagattccaagt gtggaagacagccttggagagggcagcagggatgctggccggccaggagagagaggatcc gggggcttgttcagtcctagcactgcccacgtgccggatggggcactcgggcagagagac cagagcagctggcaaaacagtgatgctagccaggaggtgggagggcatcaggagagacag caggcaggggctcagggccctggcagtgctgacctggaagatggggagatgggaaagcga ggctgggtcggtgagtttagcctcagtgttggcccccagcgagaggcagcatttagccca gggcagcaggactggagccgggacttctgcatcgaggccagtgagaggagctatcagttt ggcatcattggcaacgacagagtgagtggtgctggctttagcccttctagcaagatggaa ggtggtcactttgtgcctcctgggaagaccacagctggctcggtggactggactgaccag ctgggtctcaggaacttggaagtgtccagctgtgtgggttctgggggctcgagcgaggcc agggagagtgccgtgggacagatgggctggtcaggtggcctgagcttgagagacatgaac ctgaccggctgtttggaaagtggagggtctgaagagccggggggaatcggagttggggag aaggactggacttctgatgttaatgtgaagagcaaagatttggctgaggtcggggaggga ggaggccacagccaggccagagagagtggcgtggggcagactgactggtcaggtgtggag gccggagagttccttaaatcaagggagcgtggagttggacaggcagactggacacctgac cttgggctgagaaacatggccccaggggcagtctgcagtcctggagagtccaaagagctt ggggtgggccagatggactggggtaacaatctgggcctgagggatttggaggtgacctgt gacccagactctggaggttctcaggggctacggggatgtggagtggggcagatggactgg acccaggacttggcgccccagaatgtggagctctttggggctccaagtgaagccagggag catggggtgggcggggtgagccagtgcccagagcccggcctgaggcacaatggcagcttg tctcctggcctggaggccagagaccccttggaggccagggagctgggggttggtgagaca agtgggccagagacccagggtgaagattactcctcgtcttccttggagccacaccctgca gaccctggaatggagacaggagaagccctcagcttcggagcaagccctggcaggtgcccg gcccgccccccaccctccggctcccagggcctgctggaggagatgctggcagccagcagc tccaaggcggtggctcggagggagtcagcggcctcgggccttgggggcctgttggaggag gaaggagccggggcaggtgctgcccaagaggaggtgctggagcctggcagggactctcca ccctcctggaggccgcagcctgatggtgaggccagccagacagaagacgtggatggcacc tggggctcttcagcagccaggtggagcgatcaggggccagcacagacttctcggcgaccc tcccaaggccctcctgccagatcccccagtcaggacttctccttcattgaggacaccgag atcctcgacagtgccatgtatcggagccgtgccaacttggggcgcaagcgtgggcaccgg gccccggtcattcggcctgggggtaccttgggcctgtcggaggcagcagactcggatgca cacctgttccaggactctacagagccacgggcatctcgggtgccatcttcagatgaagag gtagtggaggaacctcagagccgccggacacggatgtcgttgggcaccaaggggctgaaa gtcaacctctttcctggcctgagcccctcagccctgaaggccaagctgcgcccccggaat cgctcagctgaggagggagagctggctgagagcaagtcgagccagaaggagtccgcggtc cagcgttcgaaatcctgcaaggtcccaggactgggaaagcccctcacgttacctcccaag ccagagaaatcctcagggatccaggagctgcaactggtgtttgggcccaagaacgagctg gcctttgtccatggagccctcagtattggggtggtggtggttggtgtcaggggagagcag gtgtctgtgggtctggagcccatgtgtgacctggcagagtttatgccacacccggcctta cagccatacctggtggtagtagagctgtgttactctcctttgtacatggtgtcagccggt tacagtgtatcctccagctccaatgggcctggccatgctggccactag >gi568815587r:57226410_57435121|GENSCAN_predicted_peptide_4|787_aa MCGLLADPGVPLLVAMAPAGAARAGVGLGVSGIPGPVLHPGLDLDPRALQRPSPGALPRP PQAGVDMAETLEFNDVYQEVKGSMNDGRLRLSRQGIIFKNSKTGKVDNIQAGELTEGIWR RVALGHGLKLLTKNGHVYKYDGFRESEFEKLSDFFKTHYRLELMEKDLCVKGWNWGTVKF GGQLLSFDIGDQPVFEIPLSNVSQCTTGKNEVTLEFHQNDDAEVSLMEVRFYVPPTQEDG VDPVEAFAQNVLSKADVIQATGDAICIFRELQCLTPRGRYDIRIYPTFLHLHGKTFDYKI PYTTVLRLFLLPHKDQRQMFFVISLDPPIKQGQTRYHFLILLFSKDEDISLTLNMNEEEV EKRFEGRLTKNMSGSLYEMVSRVMKALVNRKITVPGNFQGHSGAQCITCSYKASSGLLYP LERGFIYVHKPPVHIRFDEISFVNFARGTTTTRSFDFEIETKQGTQYTFSSIEREEYGKL FDFVNAKKLNIKNRGLKEGMNPSYDEYADSDEDQHDAYLERMKEEGKIREENANDSSDDS GEETDESFNPGEEEEDVAEEFDSNASASSSSNEGDSDRDEKKRKQLKKAKMAKDRKSRKK PVEVKKGKDPNAPKRPMSAYMLWLNASREKIKSDHPGISITDLSKKAGEIWKGMSKEKKE EWDRKAEDARRDYEKAMKEYEGGRGESSKRDKSKKKKKVKVKMEKKSTPSRGSSSKSSSR QLSESFKSKEFVSSDESSSGENKSKKKRRRSEVRQECGACGLGRDSEEEELASTPPSSED SASGSDE >gi568815587r:57226410_57435121|GENSCAN_predicted_CDS_4|2364_bp atgtgcgggctcctggcagaccctggggttccgctgctcgttgccatggctcccgccgga gcggccagagcgggcgtcgggttgggtgtcagcggtattccgggcccggttctccaccca ggcctggacctggaccctcgggcactgcagcgtccgtctcctggggctctgcccaggcca ccacaggcaggggtcgacatggcagagacactggagttcaacgacgtctatcaggaggtg aaaggttccatgaatgatggtcgactgaggttgagccgtcagggcatcatcttcaagaat agcaagacaggcaaagtggacaacatccaggctggggagttaacagaaggtatctggcgc cgtgttgctctgggccatggacttaaactgcttacaaagaatggccatgtctacaagtat gatggcttccgagaatcggagtttgagaaactctctgatttcttcaaaactcactatcgc cttgagctaatggagaaggacctttgtgtgaagggctggaactgggggacagtgaaattt ggtgggcagctgctttcctttgacattggtgaccagccagtctttgagatacccctcagc aatgtgtcccagtgcaccacaggcaagaatgaggtgacactggaattccaccaaaacgat gacgcagaggtgtctctcatggaggtgcgcttctacgtcccacccacccaggaggatggt gtggaccctgttgaggcctttgcccagaatgtgttgtcaaaggcggatgtaatccaggcc acgggagatgccatctgcatcttccgggagctgcagtgtctgactcctcgtggtcgttat gacattcggatctaccccacctttctgcacctgcatggcaagacctttgactacaagatc ccctacaccacagtactgcgtctgtttttgttaccccacaaggaccagcgccagatgttc tttgtgatcagcctggatcccccaatcaagcaaggccaaactcgctaccacttcctgatc ctcctcttctccaaggacgaggacatttcgttgactctgaacatgaacgaggaagaagtg gagaagcgctttgagggtcggctcaccaagaacatgtcaggatccctctatgagatggtc agccgggtcatgaaagcactggtaaaccgcaagatcacagtgccaggcaacttccaaggg cactcaggggcccagtgcattacctgttcctacaaggcaagctcaggactgctctacccg ctggagcggggcttcatctacgtccacaagccacctgtgcacatccgcttcgatgagatc tcctttgtcaactttgctcgtggtaccactactactcgttcctttgactttgaaattgag accaagcagggcactcagtataccttcagcagcattgagagggaggagtacgggaaactg tttgattttgtcaacgcgaaaaagctcaacatcaaaaaccgaggattgaaagagggcatg aacccaagctacgatgaatatgctgactctgatgaggaccagcatgatgcctacttggag aggatgaaggaggaaggcaagatccgggaggagaatgccaatgacagcagcgatgactca ggagaagaaaccgatgagtcattcaacccaggtgaagaggaggaagatgtggcagaggag tttgacagcaacgcctctgccagctcctccagtaatgagggtgacagtgaccgggatgag aagaagcggaaacagctcaaaaaggccaagatggccaaggaccgcaagagccgcaagaag cctgtggaggtgaagaagggcaaagaccccaatgcccccaagaggcccatgtctgcatac atgctgtggctcaatgccagccgagagaagatcaagtcagaccatcctggcatcagcatc acggatctttccaagaaggcaggcgagatctggaagggaatgtccaaagagaagaaagag gagtgggatcgcaaggctgaggatgccaggagggactatgaaaaagccatgaaagaatat gaagggggccgaggcgagtcttctaagagggacaagtcaaagaagaagaagaaagtaaag gtaaagatggaaaagaaatccacgccctctaggggctcatcatccaagtcgtcctcaagg cagctaagcgagagcttcaagagcaaagagtttgtgtctagtgatgagagctcttcggga gagaacaagagcaaaaagaagaggaggaggagcgaggtgcggcaggaatgtggggcctgt gggctgggcagggactctgaagaagaagaactagccagtactccccccagctcagaggac tcagcgtcaggatccgatgagtag >gi568815587r:57226410_57435121|GENSCAN_predicted_peptide_5|409_aa MNCISDFFTYETTKSVVVKSWTIGIINRVVQLLIISYFVGPPTFNRAPLPARWVFLHEKA YQVRDTAIESSVVTKVKGSGLYANRVMDVSDYVTPPQGTSVFVIITKMIVTENQMQGFCP ESEEKYRCVSDSQCGPERLPGGGILTGRCVNYSSVLRTCEIQGWCPTEVDTVETPIMMEA ENFTIFIKNSIRFPLFNFEKGNLLPNLTARDMKTCRFHPDKDPFCPILRVGDVVKFAGQD FAKLARTGGVLGIKIGWVCDLDKAWDQCIPKYSFTRLDSVSEKSSVSPGYNFRFAKYYKM ENGSEYRTLLKAFGIRFDVLVYGNAGKFNIIPTIISSVAAFTSVGVGTVLCDIILLNFLK GADQYKAKKFEEVNETTLKIAALTNPVYPSDQTTAEKQSTDSGAFSIGH >gi568815587r:57226410_57435121|GENSCAN_predicted_CDS_5|1230_bp atgaactgcatatccgacttcttcacctatgagaccaccaagtcggtggttgtgaagagc tggaccatcgggatcatcaaccgagtagttcagcttctgatcatctcctactttgtaggc ccccccaccttcaaccgagctcctttacctgccaggtgggttttcttgcacgagaaggct taccaggtacgggacacagccattgagtcctcggtggtaaccaaggtgaagggctccgga ctctacgccaacagagtcatggatgtgtctgattacgtgacgccacctcagggcacctcg gtctttgtcatcatcaccaagatgattgttactgaaaatcagatgcaaggattctgccca gagagtgaggagaaataccgctgtgtatcagacagccagtgcgggcctgagcgcttgcca ggtggggggatcctcactggccgctgcgtgaactacagctctgtgctccggacctgtgag atccagggctggtgccccacggaggtggacacagtggaaacgcccatcatgatggaagct gagaacttcactattttcatcaagaacagcatccgtttccccctcttcaactttgagaag ggaaacctccttcccaacctgacagccagggacatgaagacctgccgcttccacccggac aaggaccctttctgccccatcttgcgggtaggggacgtggtcaagtttgcggggcaggat tttgccaaactggcgcgcacggggggagttctgggcattaagatcggctgggtgtgcgac ttggacaaggcctgggaccagtgcatccccaaatactccttcacccggctcgacagcgtt tctgagaaaagcagcgtgtccccaggctacaacttcaggtttgccaagtactacaaaatg gaaaatggcagtgagtaccgcaccctcctgaaggcttttggcatccgcttcgacgtgctg gtatacgggaatgctggcaagttcaacatcatccccaccatcatcagctctgtggcggcc tttacttctgtgggagtgggaactgttctctgtgacatcatcctgctcaacttcctcaag ggggccgaccagtacaaagccaagaagtttgaggaggtgaatgagactacgctgaaaatc gcggctttgaccaacccagtgtaccccagcgaccagaccacagcggagaagcagtccacc gattcgggggccttctccataggccactag >gi568815587r:57226410_57435121|GENSCAN_predicted_peptide_6|408_aa MKLPLLLALLFGAVSALHLRSETSTFETPLGAKTLPEDEETPEQEMEETPCRELEEEEEW GSGSEDASKKDGAVESISVPDMVDKNLTCPEEEDTVKVVGIPGCQTCRYLLVRSLQTFSQ AWFTCRRCYRGNLVSIHNFNINYRIQCSVSALNQGQVWIGGRITGSGRCRRFQWVDGSRW NFAYWAAHQPWSRGGHCVALCTRENDAPHLESLETQADLGQDLDSSKEQERDLALTEEVI QAEGEEVKASACQDNFEDEEAMESDPAALDKDFQCPREEDIVEVQGSPRCKICRYLLVRT PKTFAEAQNVCSRCYGGNLVSIHDFNFNYRIQCCTSTVNQAQVWIGGNLRGWFLWKRFCW TDGSHWNFAYWSPGQPGNGQGSCVALCTKGGYWRRAQCDKQLPFVCSF >gi568815587r:57226410_57435121|GENSCAN_predicted_CDS_6|1227_bp atgaaactccccttacttctggctcttctatttggggcagtttctgctcttcatctaagg tctgagacttccacctttgagacccctttgggtgctaagacgctgcctgaggatgaggag acaccagagcaggagatggaggagaccccttgcagggagctggaggaagaggaggagtgg ggctctggaagtgaagatgcctccaagaaagatggggctgttgagtctatctcagtgcca gatatggtggacaaaaaccttacgtgtcctgaggaagaggacacagtaaaagtggtgggc atccctgggtgccagacctgccgctacctcctggtgagaagtcttcagacgtttagtcaa gcttggtttacttgccggaggtgctacaggggcaacctggtttccatccacaacttcaat attaattatcgaatccagtgttctgtcagcgcgctcaaccagggtcaagtctggattgga ggcaggatcacaggctcgggtcgctgcagacgctttcagtgggttgacggcagccgctgg aactttgcatactgggctgctcaccagccctggtcccgcggtggtcactgcgtggccctg tgtacccgagagaatgatgccccccatctggagagcctagagacacaggcagacctaggc caggatctggatagttcaaaggagcaggagagagacttggctctgacggaggaggtgatt caggcagagggagaggaggtcaaggcttctgcctgtcaagacaactttgaggatgaggaa gccatggagtcggacccagctgccttagacaaggacttccagtgccccagggaagaagac attgttgaagtgcagggaagtccaaggtgcaagatctgccgctacctattggtgcggact cctaaaacttttgcagaagctcagaatgtctgcagcagatgctacggaggcaaccttgtc tctatccatgacttcaacttcaactatcgcattcagtgctgcactagcacagtcaaccaa gcccaggtctggattggaggcaacctcaggggctggttcctgtggaagcggttttgctgg actgatgggagccactggaattttgcttactggtccccagggcaacctgggaatgggcaa ggctcctgtgtggccctatgcaccaaaggaggttattggcgacgagctcaatgcgacaag caactgcccttcgtctgctccttctaa >gi568815587r:57226410_57435121|GENSCAN_predicted_peptide_7|143_aa MVTQDASPLGCPMDIVRKLRFRVVKVTQLISACKAQSFPTHPTALASLQAWESKTPPQKK KKKKKKEEEEEEEEEEEEEEEEEEEEEEEKKRKKRKKRKKRKKRKKKKKKKKKKKKKKKK KKKKKKKKKKKKKKKKKRRRNVK >gi568815587r:57226410_57435121|GENSCAN_predicted_CDS_7|432_bp atggtcacccaagatgcttctccacttggatgtcccatggacattgtgaggaaactgagg ttcagggtggttaaggtcacacagctgatcagtgcctgcaaagcccagagctttcccaca caccctactgctctagccagcctgcaggcctgggagagcaagaccccacctcaaaaaaaa aagaagaagaagaagaaagaagaagaggaagaagaagaagaagaagaagaagaagaagag gaagaagaggaagaagaggaagaagaaaagaagaggaagaagaggaagaagaggaagaag aggaagaagaggaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaag aagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaaaagaagaaga aatgtaaagtaa >gi568815587r:57226410_57435121|GENSCAN_predicted_peptide_8|536_aa MGRRARPGPSGSSAGRAGRRLERGAEAGGERPPEAFLVEVDARKGGGDQGLPLHVATLLT GLLECLGFAGVLFGWPSLVFVFKNEDYFKDLCGPDAGPIGNATGQADCKAQDERFSLIFT LGSFMNNFMTFPTGYIFDRFKTTVARLIAIFFYTTATLIIAFTSAGSAVLLFLAMPMLTI GGILFLITNLQIGNLFGQHRSTIITLYNGAFDSSSAVFLIIKLLYEKGISLRASFIFISV CSTWHVARTFLLMPRGHIPYPLPPNYSYGLCPGNGTTKEEKETAEHENRELQSKEFLSAK EETPGAGQKQELRSFWSYAFSRRFAWHLVWLSVIQLWHYLFIGTLNSLLTNMAGGDMARV STYTNAFAFTQFGVLCAPWNGLLMDRLKQKYQKEARKTGSSTLAVALCSTVPSLALTSLL CLGFALCASVPILPLQYLTFILQVISRSFLYGSNAAFLTLAFPSEHFGKLFGLVMALSAV VSLLQFPIFTLIKGSLQNDPFYVNVMFMLAILLTFFHPFLVYRECRTWKESPSAIA >gi568815587r:57226410_57435121|GENSCAN_predicted_CDS_8|1611_bp atggggcgccgagcgcggcctggcccctcgggctcctctgcggggagggcaggccgcagg ctggagcggggtgcggaggctggcggggagcggcccccggaggctttcctggtagaagtt gatgcgaggaagggcggcggggaccagggcctgcccctgcacgtggccacactgctgact gggctgctggaatgcctgggctttgctggcgtcctctttggctggccttcactagtgttt gtcttcaagaatgaagattactttaaggatctgtgtggaccagatgctgggccgattggc aatgccacagggcaggctgactgcaaagcccaggatgagaggttctcactcatcttcacc ctggggtccttcatgaacaacttcatgacattccccactggctacatctttgaccggttc aagaccaccgtggcacgcctcatagccatatttttctacaccaccgccacactcatcata gccttcacctctgcaggctcagccgtgctgctcttcctggccatgccaatgctcaccatt gggggaatcctgtttctcatcaccaacctgcagattgggaacctatttggccaacaccgt tcgaccatcatcactctgtacaatggagcatttgactcttcctcggcagtcttccttatt attaagcttctttatgaaaaaggcatcagcctcagggcctccttcatcttcatctctgtc tgcagtacctggcatgtagcacgcactttcctcctgatgccccgggggcacatcccatac ccactgccccccaactacagctatggcctgtgccctgggaatggcaccacaaaggaagag aaggaaacagctgagcatgaaaacagggagctacagtcaaaggagttcctttcagcgaag gaagagaccccaggggcagggcagaagcaggaactccgctccttctggagctacgctttc tctcggcgctttgcctggcacctggtgtggctgtctgtgatacagttgtggcactacctc ttcattggcactctcaactccttgctgaccaacatggccggtggggacatggcacgagtc agcacctacacaaatgcctttgccttcactcagttcggagtgctgtgtgccccctggaat ggcctgctcatggaccggcttaaacagaagtaccagaaggaagcaagaaagacaggttcc tccactttggcggtggccctctgctcgacggtgccttcgctggccctgacatccctgctg tgcctgggcttcgccctctgtgcctcagtccccatcctccctctccagtacctcaccttc atcctgcaagtgatcagccgctccttcctctatgggagcaacgcggccttcctcaccctt gctttcccttcagagcactttggcaagctctttgggctggtgatggccttgtcggctgtg gtgtctctgctccagttccccatcttcaccctcatcaaaggctcccttcagaatgaccca ttttacgtgaatgtgatgttcatgcttgccattcttctgacattcttccacccctttctg gtatatcgggaatgccgtacttggaaagaaagtccctctgcaattgcatag