GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:06:22 Sequence gi568815597f:43201072_43422719 : 221648 bp : 48.08% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5649 5861 213 2 0 101 97 191 0.883 20.21 1.02 Intr + 8672 8845 174 1 0 65 105 317 0.993 31.24 1.03 Intr + 14184 14345 162 2 0 101 116 225 0.993 26.77 1.04 Intr + 16640 16888 249 1 0 61 73 95 0.581 2.93 1.05 Intr + 18311 18466 156 1 0 90 62 151 0.990 12.91 1.06 Intr + 20301 20394 94 2 1 53 80 79 0.974 3.14 1.07 Intr + 21034 21224 191 2 2 116 63 325 0.993 32.10 1.08 Intr + 21753 21926 174 2 0 31 64 383 0.967 30.34 1.09 Intr + 22975 23133 159 0 0 83 56 234 0.995 19.88 1.10 Intr + 25912 26055 144 0 0 110 94 163 0.977 19.58 1.11 Intr + 27151 27174 24 0 0 94 95 13 0.630 0.82 1.12 Intr + 31437 31553 117 2 0 117 70 109 0.988 12.66 1.13 Intr + 33208 33342 135 0 0 28 103 218 0.999 18.06 1.14 Intr + 33424 33567 144 0 0 95 82 324 0.999 32.98 1.15 Intr + 40394 40591 198 1 0 121 49 28 0.718 1.75 1.16 Intr + 42156 42288 133 2 1 121 90 160 0.992 19.72 1.17 Intr + 56460 56619 160 1 1 84 67 64 0.150 2.95 1.18 Term + 66543 66750 208 0 1 100 39 140 0.019 7.11 1.19 PlyA + 66946 66951 6 1.05 2.00 Prom + 67509 67548 40 -8.36 2.01 Sngl + 71652 72311 660 2 0 63 49 855 0.926 75.28 2.02 PlyA + 72898 72903 6 1.05 3.07 PlyA - 74301 74296 6 1.05 3.06 Term - 74911 74849 63 1 0 99 40 63 0.503 0.49 3.05 Intr - 75982 75825 158 2 2 87 83 70 0.670 6.13 3.04 Intr - 82036 81740 297 2 0 120 31 224 0.532 16.75 3.03 Intr - 84536 84373 164 1 2 64 61 61 0.292 0.62 3.02 Intr - 84870 84662 209 0 2 72 60 135 0.593 6.98 3.01 Init - 86226 86224 3 0 0 77 101 0 0.593 0.20 3.00 Prom - 96053 96014 40 -5.26 4.00 Prom + 96964 97003 40 -6.16 4.01 Init + 100001 100058 58 1 1 42 106 69 0.868 3.59 4.02 Intr + 103780 104094 315 2 0 42 105 491 0.997 42.24 4.03 Intr + 104162 104272 111 0 0 85 55 93 0.953 6.05 4.04 Intr + 105769 105924 156 2 0 91 24 271 0.458 20.98 4.05 Intr + 106071 106202 132 1 0 31 86 66 0.543 1.32 4.06 Intr + 106361 106501 141 0 0 68 91 26 0.410 1.22 4.07 Intr + 106725 106853 129 1 0 95 75 59 0.840 5.97 4.08 Intr + 107915 108060 146 0 2 78 53 188 0.911 14.30 4.09 Intr + 108317 108461 145 1 1 108 86 198 0.997 21.46 4.10 Intr + 110600 110758 159 0 0 27 92 197 0.997 13.96 4.11 Intr + 110923 111060 138 2 0 39 85 73 0.852 2.54 4.12 Intr + 111234 111530 297 1 0 61 68 179 0.998 10.05 4.13 Intr + 112064 112354 291 0 0 100 93 404 0.983 39.21 4.14 Intr + 112707 112897 191 1 2 88 69 251 0.978 22.50 4.15 Intr + 116128 116338 211 0 1 51 31 395 0.998 28.59 4.16 Intr + 116493 116603 111 1 0 36 83 110 0.884 5.65 4.17 Intr + 116811 117001 191 1 2 128 89 288 0.855 32.20 4.18 Intr + 118164 118277 114 2 0 75 101 155 0.999 16.14 4.19 Intr + 118388 118458 71 1 2 96 56 110 0.790 6.58 4.20 Intr + 120198 120238 41 0 2 70 97 -29 0.577 -5.93 4.21 Intr + 120325 120421 97 2 1 87 65 214 0.653 18.07 4.22 Intr + 120545 120644 100 2 1 121 100 129 0.999 17.51 4.23 Term + 121580 121651 72 1 0 93 49 146 0.999 9.11 4.24 PlyA + 122015 122020 6 1.05 5.00 Prom + 126866 126905 40 -5.56 5.01 Init + 136778 136856 79 1 1 77 100 133 0.995 12.63 5.02 Intr + 137028 137160 133 1 1 18 78 164 0.741 8.10 5.03 Intr + 137447 137649 203 2 2 16 82 168 0.890 7.93 5.04 Intr + 138200 138490 291 0 0 111 19 154 0.579 7.91 5.05 Intr + 138936 139055 120 1 0 17 109 128 0.580 8.27 5.06 Intr + 139316 139442 127 0 1 66 94 31 0.988 1.24 5.07 Intr + 145374 145558 185 0 2 65 99 180 0.973 16.23 5.08 Intr + 145592 145863 272 0 2 62 86 87 0.742 3.16 5.09 Intr + 147772 147931 160 0 1 75 93 138 0.719 12.56 5.10 Intr + 148192 148288 97 2 1 85 86 70 0.688 5.57 5.11 Intr + 151145 151232 88 2 1 129 117 -2 0.989 6.77 5.12 Intr + 158106 158325 220 2 1 38 65 229 0.075 13.47 5.13 Intr + 158419 158567 149 2 2 110 95 202 0.999 23.05 5.14 Intr + 158654 158750 97 1 1 51 90 88 0.725 4.88 5.15 Intr + 158898 159026 129 1 0 72 87 171 0.999 16.07 5.16 Intr + 159122 159318 197 0 2 72 94 155 0.963 13.63 5.17 Intr + 159428 159522 95 1 2 68 82 106 0.995 6.76 5.18 Intr + 159662 159890 229 2 1 105 113 153 0.991 17.37 5.19 Intr + 160049 160174 126 1 0 100 89 73 0.990 9.48 5.20 Intr + 161124 161241 118 2 1 98 115 100 0.997 13.74 5.21 Term + 161880 162058 179 1 2 77 49 150 0.998 7.85 5.22 PlyA + 162096 162101 6 1.05 6.18 PlyA - 162125 162120 6 -0.45 6.17 Term - 163066 162845 222 1 0 116 54 262 0.932 22.52 6.16 Intr - 163389 163253 137 1 2 122 81 61 0.993 9.09 6.15 Intr - 163576 163471 106 1 1 109 100 -48 0.910 -1.71 6.14 Intr - 163723 163667 57 1 0 53 79 88 0.844 3.48 6.13 Intr - 163937 163857 81 2 0 21 110 96 0.967 4.93 6.12 Intr - 164305 164115 191 2 2 85 64 189 0.998 15.50 6.11 Intr - 164552 164493 60 0 0 85 110 34 0.051 4.01 6.10 Intr - 166348 166206 143 0 2 23 88 54 0.007 -1.10 6.09 Intr - 184035 183975 61 1 1 114 101 43 0.595 5.99 6.08 Intr - 185155 184907 249 2 0 75 110 75 0.633 5.81 6.07 Intr - 185599 185482 118 1 1 77 61 73 0.714 3.54 6.06 Intr - 185927 185787 141 2 0 21 47 134 0.725 3.15 6.05 Intr - 186576 186432 145 2 1 46 95 186 0.569 15.38 6.04 Intr - 187418 187239 180 1 0 75 99 111 0.865 9.88 6.03 Intr - 188818 188688 131 1 2 58 94 47 0.443 1.79 6.02 Intr - 194809 194765 45 1 0 47 121 29 0.588 0.71 6.01 Init - 195893 195828 66 2 0 84 121 45 0.956 8.47 6.00 Prom - 196953 196914 40 -6.06 7.00 Prom + 201413 201452 40 -6.96 7.01 Init + 201557 201634 78 1 0 81 53 8 0.415 -2.34 7.02 Intr + 202106 202231 126 1 0 85 81 96 0.906 9.48 7.03 Intr + 202530 202703 174 2 0 72 48 101 0.921 4.64 7.04 Intr + 203309 203479 171 1 0 122 72 152 0.999 17.14 7.05 Intr + 214011 214142 132 2 0 99 91 173 0.970 19.54 7.06 Intr + 214889 215030 142 1 1 53 82 60 0.979 1.83 7.07 Intr + 215464 215570 107 2 2 90 94 98 0.999 10.53 7.08 Intr + 218663 218873 211 1 1 68 81 117 0.943 7.49 7.09 Intr + 219082 219252 171 2 0 82 116 228 0.995 24.91 7.10 Intr + 219678 219912 235 1 1 86 78 286 0.989 24.15 7.11 Intr + 220103 220232 130 2 1 58 117 99 0.984 10.50 7.12 Intr + 221012 221154 143 1 2 114 44 121 0.826 9.45 7.13 Intr + 221409 221561 153 0 0 87 80 125 0.991 10.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 158145 158325 181 2 1 64 65 231 0.872 17.82 S.002 Init - 164538 164493 46 0 1 99 110 48 0.922 8.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:43201072_43422719|GENSCAN_predicted_peptide_1|944_aa IRSIVWNADDSKLISGGTDGAVYEWNLSTGKRETECVLKSCSYNCVTVSPDAKIIFAVGS DHTLKEIADSLILREISAFDVTYTAIVISHSGRMMFVGTSVGTIRAMKYPLPLQKEFNEY QAHAGPITKMLLTFDDQFLLTAAEDGCLFTWKVFDKDGRGIKREREVGFAEEVLVTKTDM EEKLPPTHPSNSSSCLWSFLNLVRAAYRHWRDLYNTKVSLIGFDEDLWKPSKIKSKLPSM VYRAPDGLTQSWLPLPIHPTCAISHAAQVMLELKTRVEELKMENEYQLRLKDMNYSEKIK ELTDKFIQEMESLKTKNQVLRTEKEKQDVYHHEHIEDLLDKQSRELQDMECCNNQKLLLE YEKYQELQLKSQRMQEEYEKQLRDNDETKSQALEELTEFYEAKLQEKTTLLEEAQEDVRQ QLREFEETKKQIEEDEDREIQDIKTKYEKKLRDEKESNLRLKGETGIMRKKFSSLQKEIE ERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGK FKFVLDYKIKELKKQIEPRENEIRVMKEQIQEPTDFLTVPMEAELENFHKQNTQLELNIT ELWQKLRATDQEMRRERQKERDLEALVKRFKTDLHNCVAYIQEPRLLKEKVRGLFEKYVQ RADMVEIAGLNTDLQQEYTRQREHLERNLATLKKKVVKEGELHRTDYVRIMQVPPHSRGP FVTAQGEQCLRPSCSLGSDSVATTREIMSQVEAQFVPGTGPFYVDLACNCLFWYTDPQEN VSLIKEINELRRELKFTRSQVYDLEAALKLTKKVRPQEVSETEFWHQLWPSSVGYLAEAP LVSCYLNTEKPALQKVLASLSCFLYKIWPAIAGPRSTPIHHLLVDLRVVRPHLPLCAHGH LLLLTFTVTNLNYPNNLCFGDRVQQPCEEPAFPLDSKLLDKTQN >gi568815597f:43201072_43422719|GENSCAN_predicted_CDS_1|2835_bp attcgctcaattgtgtggaatgcagatgatagcaaactgatttctggtggcacagatggt gctgtgtatgaatggaatctgtccacaggaaagagagagacagaatgcgtgctcaagtct tgcagctacaactgtgttactgtctcccccgatgccaaaattatctttgctgttggatca gaccacaccctcaaggagattgcagattccttgatccttcgagagatatcggcgtttgat gtcacctacaccgccattgtcatctcgcattctggacgcatgatgtttgtgggcacctcg gtgggaaccattcgtgccatgaagtaccctctgcctctgcagaaggaattcaatgagtac caggcccatgccggtcctatcaccaagatgttgcttacctttgatgatcagttcctgctg actgctgctgaggatggctgcctgttcacctggaaggtctttgataaggatggccgggga atcaagcgagagagggaggtgggctttgccgaagaggtgcttgtgactaaaacagacatg gaagaaaagttgccgcccacccaccccagcaactcatcatcctgcctttggtccttcctt aatctagtccgtgctgcgtaccgccattggagagatctttataacaccaaagtcagcctg atcgggttcgatgaggacctttggaagccctctaagataaaatccaaacttcctagcatg gtctaccgggcccctgatggtctgacgcagtcttggctgccccttcctatacatccaacc tgtgccatatcacatgcagctcaggttatgttggagctaaagactcgtgtggaggaatta aaaatggagaatgagtatcaactccgactaaaggacatgaactattctgagaagattaag gagctaacagacaagttcatccaggaaatggagtccttgaaaacaaaaaaccaggtctta agaacagaaaaagagaagcaggatgtttatcaccatgagcacatagaagacctcctagac aagcaaagccgggaactgcaggacatggaatgttgcaacaaccaaaagttgcttctagaa tatgagaagtaccaggagctgcagctcaagtcccagaggatgcaggaagagtatgaaaaa cagctccgggataacgatgagaccaagagccaggccctggaggagctgactgagttttac gaggcaaaactgcaggagaaaaccacccttctggaagaggcacaggaagacgtcaggcag cagctgcgggagtttgaagagaccaagaagcagattgaggaagatgaagaccgagaaatc caagatatcaaaaccaagtatgagaaaaagcttcgggatgaaaaggaatcaaacctgcgg ctcaagggagaaacaggcatcatgaggaagaagttcagcagcctacagaaggagattgaa gaacgaaccaatgacatcgagaccctaaaaggagagcagatgaagctgcaaggagtcatt aagtctctggagaaggacatccaaggcctcaagcgagagatccaggaaagagacgagact attcaagacaaggagaagcgaatttatgatctgaaaaagaaaaatcaagaactagggaaa ttcaagtttgtgcttgactacaaaataaaggagctgaagaagcaaatagaacctcgagag aatgagatcagggtgatgaaggaacagattcaggagcccactgactttctgacagttcct atggaagctgaactggagaatttccataagcagaacactcaactggagctgaacatcaca gaattgtggcagaaactgagagccaccgatcaggagatgcgcagagagagacagaaggag cgagacttggaagcgctggtcaaaaggtttaaaacagacctccacaactgcgtagcctat attcaggaaccgcggctgctgaaggagaaggttcgaggtctctttgagaagtacgtgcag cgagcagacatggtggagatcgcagggctgaacacagacctgcagcaggagtacacccgg cagcgggagcacctggagaggaacctggccactctcaagaagaaggtggtcaaggagggc gagctgcaccgcacagactacgtccgcatcatgcaggtacctccccacagcagaggaccg tttgtcacagctcagggagaacagtgcttaagaccatcctgttctctgggctctgactcc gtagccaccactcgtgagataatgtcccaagtagaagcacagtttgttcctggcacaggt ccattttatgtagatttggcctgcaactgccttttctggtatacagaccctcaggaaaat gtctctctgatcaaggaaattaatgagctccgcagggagctgaagttcactcggtcccaa gtctatgaccttgaagcagctctgaaactgaccaagaaagtccgaccacaagaagtttca gagacagagttttggcatcagctgtggccaagcagtgtgggctacctagctgaggctccc ctggtgtcctgctatctcaacactgaaaagcctgccctacagaaggttctagcatcactg tcctgcttcttatacaaaatatggccagccattgctggtcccaggagtactcccattcac caccttcttgtggatctgcgggtggttcgtccgcaccttccgctttgtgcccacggccac ctcctcctcctgaccttcaccgtcaccaatctgaattaccccaacaacctctgctttggg gacagagttcaacagccctgtgaagagcctgcctttccacttgactccaagttacttgat aagacccagaactag >gi568815597f:43201072_43422719|GENSCAN_predicted_peptide_2|219_aa MSEQEAQAPGGRGLPPDMLAEQVELWWSQQPRRSALCFVVAVGLVAGCGAGGVALLSTTS SRSGEWRLATGTVLCLLALLVLVKQLMSSAVQDMNCIRQAHHVALLRSGGGADALVVLLS GLVLLVTGLTLAGLAAAPAPARPLAAMLSVGIALAALGSLLLLGLLLYQVGVSGHCPSIC MATPSTHSGHGGHGSIFSISGQLSAGRRHETTSSIASLI >gi568815597f:43201072_43422719|GENSCAN_predicted_CDS_2|660_bp atgtctgaacaggaggctcaagccccagggggccgggggctgcccccggacatgctggca gagcaggtggagctgtggtggtcccagcagccgcggcgctcggcgctctgcttcgtcgtg gccgtgggcctcgtggcaggctgtggcgcgggcggcgtggcactgctgtcaaccaccagc agccgctcaggtgaatggcggctagcaacgggcactgtgctctgtttgctggctctgctg gttctggtgaaacagctgatgagctcggctgtgcaggacatgaactgcatccgccaggcc caccatgtggccctgctgcgcagtggtggaggggccgacgccctcgtggtgctgctcagt ggcctcgtgctgctggtcaccggcctgaccctggccgggctggccgccgcccctgcccct gctcggccgctggccgccatgctgtctgtgggcattgctctggctgccttgggctcgctt ttgctgctgggcctgctgctgtatcaagtgggtgtgagcggacactgcccctccatctgt atggccactccctccacccacagtggccatggcggccatggcagcatcttcagcatctca ggacagttgtctgctggccggcgtcacgagaccacatccagcattgccagcctcatctga >gi568815597f:43201072_43422719|GENSCAN_predicted_peptide_3|297_aa MACKAPSSTLDFVPVTGRSVVYQPEHLPAEVSAPLGPAGRFSGSQLYKPQALGGEGEVYT HKVLGKSLGERFCRPQPEAYSADCAQFLKVGTEEKARGGRGGGTQALHSTQGASGSRKEA THQSWALVGPSELPTASAVAPGPGTGARAWPVLVGFVLGAVVLSLLIALAAKCHLCRRYH ASYRHRPLPETGRGGRPQVAEDEDDDGFIEDNYIQPGTGELGTEGQSLAPHRSQQVPNPI LATVPYSPGESLSMFSSVLRTKSKFLCWAAKAIGQVGESLVLTHSPTAAAQSDAYAR >gi568815597f:43201072_43422719|GENSCAN_predicted_CDS_3|894_bp atggcctgcaaagctccctcttcaactctggactttgtaccagtcaccggacgttcagtt gtttatcaacctgagcacctgcctgctgaggtgtctgcccctctagggcccgctggacgc ttctctggatcccagctgtacaaaccgcaagcattggggggtgagggtgaggtctacacc cataaagttcttggcaaaagcttgggggagaggttctgccggccccagccggaagcttac tccgcggattgtgcacagttcttgaaggtgggaacagaagagaaggcccgggggggccgg ggagggggtacccaggctctgcacagtacccaaggggcttctggcagcaggaaggaagct acacatcagagttgggcacttgttgggccttcggagctccccacagcgtctgctgtggcc cctggcccaggcactggggctcgggcatggcctgtgctggtaggatttgtgctgggggct gtggtcctctcgctcctcattgcacttgctgccaaatgccacctctgccgccgataccat gccagctaccggcaccgcccactgcctgagacaggaaggggaggccgcccacaggtggct gaagatgaggatgatgatggcttcatcgaggacaattacattcagcctgggactggcgag ctggggacagagggccaatccctggctccacaccgcagccagcaagtcccaaatcctatt ctggccacagtaccctactcccctggcgaaagtctttccatgttttccagtgtcctgagg acaaagtccaaattcctctgctgggctgccaaggccattggacaagtaggagagagcctg gtgctcacccactcacccacggcagctgctcagtcagatgcttatgccaggtaa >gi568815597f:43201072_43422719|GENSCAN_predicted_peptide_4|1138_aa MVWRVPPFLLPILFLASHVGAAVDLTLLANLRLTDPQRFFLTCVSGEAGAGRGSDAWGPP LLLEKDDRIVRTPPGPPLRLARNGSHQVTLRGFSKPSDLVGVFSCVGGAGARRTRVIYVH NSPGAHLLPDKVTHTVNKGDTAVLSARVHKEKQTDVIWKSNGSYFYTLDWHEAQDGRFLL QLPNVQPPSSGIYSATYLEASPLGSAFFRLIVRGCGAGRWGPGCTKECPGCLHGGVCHDH DGECVCPPGFTGTRCEQACREGRFGQSCQEQCPGISGCRGLTFCLPDPYGCSCGSGWRGS QCQEACAPGHFGADCRLQCQCQNGGTCDRFSGCVCPSGWHGVHCEKSDRIPQILNMASEL EFNLETMPRINCAAAGNPFPVRGSIELRKPDGTVLLSTKAIVEPEKTTAEFEVPRLVLAD SGFWECRVSTSGGQDSRRFKVNVKVPPVPLAAPRLLTKQSRQLVVSPLVSFSGDGPISTV RLHYRPQDSTMDWSTIVVDPSENVTLMNLRPKTGYSVRVQLSRPGEGGEGAWGPPTLMTT DCPEPLLQPWLEGWHVEGTDRLRVSWSLPLVPGPLVGDGFLLRLWDGTRGQERRENVSSP QARTALLTGLTPGTHYQLDVQLYHCTLLGPASPPAHVLLPPSGPPAPRHLHAQALSDSEI QLTWKHPEALPGPISKYVVEVQVAGGAGDPLWIDVDRPEETSTIIRGLNASTRYLFRMRA SIQGLGDWSNTVEESTLGNGLQAEGPVQESRAAEEGLDQQLILAVVGSVSATCLTILAAL LTLVCIRRSCLHRRRTFTYQSGSGEETILQFSSGTLTLTRRPKLQPEPLSYPVLEWEDIT FEDLIGEGNFGQVIRAMIKKDGLKMNAAIKMLKEYASENDHRDFAGELEVLCKLGHHPNI INLLGACKNRGYLYIAIEYAPYGNLLDFLRKSRVLETDPAFAREHGTASTLSSRQLLRFA SDAANGMQYLSEKQFIHRDLAARNVLVGENLASKIADFGLSRGEEVYVKKTMGRLPVRWM AIESLNYSVYTTKSDVWSFGVLLWEIVSLGGTPYCGMTCAELYEKLPQGYRMEQPRNCDD EVYELMRQCWRDRPYERPPFAQIALQLGRMLEARKAYVNMSLFENFTYAGIDATAEEA >gi568815597f:43201072_43422719|GENSCAN_predicted_CDS_4|3417_bp atggtctggcgggtgccccctttcttgctccccatcctcttcttggcttctcatgtgggc gcggcggtggacctgacgctgctggccaacctgcggctcacggacccccagcgcttcttc ctgacttgcgtgtctggggaggccggggcggggaggggctcggacgcctggggcccgccc ctgctgctggagaaggacgaccgtatcgtgcgcaccccgcccgggccacccctgcgcctg gcgcgcaacggttcgcaccaggtcacgcttcgcggcttctccaagccctcggacctcgtg ggcgtcttctcctgcgtgggcggtgctggggcgcggcgcacgcgcgtcatctacgtgcac aacagccctggagcccacctgcttccagacaaggtcacacacactgtgaacaaaggtgac accgctgtactttctgcacgtgtgcacaaggagaagcagacagacgtgatctggaagagc aacggatcctacttctacaccctggactggcatgaagcccaggatgggcggttcctgctg cagctcccaaatgtgcagccaccatcgagcggcatctacagtgccacttacctggaagcc agccccctgggcagcgccttctttcggctcatcgtgcggggttgtggggctgggcgctgg gggccaggctgtaccaaggagtgcccaggttgcctacatggaggtgtctgccacgaccat gacggcgaatgtgtatgcccccctggcttcactggcacccgctgtgaacaggcctgcaga gagggccgttttgggcagagctgccaggagcagtgcccaggcatatcaggctgccggggc ctcaccttctgcctcccagacccctatggctgctcttgtggatctggctggagaggaagc cagtgccaagaagcttgtgcccctggtcattttggggctgattgccgactccagtgccag tgtcagaatggtggcacttgtgaccggttcagtggttgtgtctgcccctctgggtggcat ggagtgcactgtgagaagtcagaccggatcccccagatcctcaacatggcctcagaactg gagttcaacttagagacgatgccccggatcaactgtgcagctgcagggaaccccttcccc gtgcggggcagcatagagctacgcaagccagacggcactgtgctcctgtccaccaaggcc attgtggagccagagaagaccacagctgagttcgaggtgccccgcttggttcttgcggac agtgggttctgggagtgccgtgtgtccacatctggcggccaagacagccggcgcttcaag gtcaatgtgaaagtgccccccgtgcccctggctgcacctcggctcctgaccaagcagagc cgccagcttgtggtctccccgctggtctcgttctctggggatggacccatctccactgtc cgcctgcactaccggccccaggacagtaccatggactggtcgaccattgtggtggacccc agtgagaacgtgacgttaatgaacctgaggccaaagacaggatacagtgttcgtgtgcag ctgagccggccaggggaaggaggagagggggcctgggggcctcccaccctcatgaccaca gactgtcctgagcctttgttgcagccgtggttggagggctggcatgtggaaggcactgac cggctgcgagtgagctggtccttgcccttggtgcccgggccactggtgggcgacggtttc ctgctgcgcctgtgggacgggacacgggggcaggagcggcgggagaacgtctcatccccc caggcccgcactgccctcctgacgggactcacgcctggcacccactaccagctggatgtg cagctctaccactgcaccctcctgggcccggcctcgccccctgcacacgtgcttctgccc cccagtgggcctccagccccccgacacctccacgcccaggccctctcagactccgagatc cagctgacatggaagcacccggaggctctgcctgggccaatatccaagtacgttgtggag gtgcaggtggctgggggtgcaggagacccactgtggatagacgtggacaggcctgaggag acaagcaccatcatccgtggcctcaacgccagcacgcgctacctcttccgcatgcgggcc agcattcaggggctcggggactggagcaacacagtagaagagtccaccctgggcaacggg ctgcaggctgagggcccagtccaagagagccgggcagctgaagagggcctggatcagcag ctgatcctggcggtggtgggctccgtgtctgccacctgcctcaccatcctggctgccctt ttaaccctggtgtgcatccgcagaagctgcctgcatcggagacgcaccttcacctaccag tcaggctcgggcgaggagaccatcctgcagttcagctcagggaccttgacacttacccgg cggccaaaactgcagcccgagcccctgagctacccagtgctagagtgggaggacatcacc tttgaggacctcatcggggaggggaacttcggccaggtcatccgggccatgatcaagaag gacgggctgaagatgaacgcagccatcaaaatgctgaaagagtatgcctctgaaaatgac catcgtgactttgcgggagaactggaagttctgtgcaaattggggcatcaccccaacatc atcaacctcctgggggcctgtaagaaccgaggttacttgtatatcgctattgaatatgcc ccctacgggaacctgctagattttctgcggaaaagccgggtcctagagactgacccagct tttgctcgagagcatgggacagcctctacccttagctcccggcagctgctgcgtttcgcc agtgatgcggccaatggcatgcagtacctgagtgagaagcagttcatccacagggacctg gctgcccggaatgtgctggtcggagagaacctagcctccaagattgcagacttcggcctt tctcggggagaggaggtttatgtgaagaagacgatggggcgtctccctgtgcgctggatg gccattgagtccctgaactacagtgtctataccaccaagagtgatgtctggtcctttgga gtccttctttgggagatagtgagccttggaggtacaccctactgtggcatgacctgtgcc gagctctatgaaaagctgccccagggctaccgcatggagcagcctcgaaactgtgacgat gaagtgtacgagctgatgcgtcagtgctggcgggaccgtccctatgagcgaccccccttt gcccagattgcgctacagctaggccgcatgctggaagccaggaaggcctatgtgaacatg tcgctgtttgagaacttcacttacgcgggcattgatgccacagctgaggaggcctga >gi568815597f:43201072_43422719|GENSCAN_predicted_peptide_5|1097_aa MPSWALFMVTSCLLLAPQNLAQVSSQDVSLLASDSEPLKCFSRTFEDLTCFWDEEEAAPS GTYQLLYAYPRRDLFYANREKPRACPLSSQSMPHFGTRYVCQFPDQEEVRLFFPLHLWVK NVFLNQTRTQRVLFVDSVGLPAPPSIIKAMGGSQPGELQISWEEPAPEISDFLRYELRYG PRDPKNSTGPTVIQLIATETCCPALQRPHSASALDQSPCAQPTMPWQDGPKQTSPRLQPG NSYWLQLRSEPDGISLGGSWGSWSLPVTVDLPGDAVALGLQCFTLDLKNVTCQWQQQDHA SSQGFFYHSRARCCPRDRYPIWENCEEEEKTNPGLQTPQFSRCHFKSRNDSIIHILVEVT TAPGTVHSYLGSPFWIHQAVPTPTESDPVPRIPNSDPSDRWLWWHNALCTEGLKLLPADI PVVRLPTPNLHWREISSGHLELEWQHPSSWAAQETCYQLRYTGEGHQDWKVLEPPLGARG GTLELRPRSRYRLQLRARLNGPTYQGPWSSWSDPTRVETATETAWISLVTALHLVLGLSA VLGLLLLRWQFPAHYRRLRHALWPSLPDLHRVLGQYLRDTAALSPAPTARTPPPAGAPMA QFAFESDLHSLLQLDAPIPNAPPARWQRKAKEAAGPAPSPMRAANRSHSAGRTPGRTPGK SSSKVQTTPSKPGGDRYIPHRSAAQMEVASFLLSKENQPENSQTPTKKEHQKAWALNLNG FDVEEAKILRLSGKPQNAPEGYQNRLKVLYSQKATPGSSRKTCRYIPSLPDRILDAPEIR NDYYLNLVDWSSGNVLAVALDNSVYLWSASSGDILQLLQMEQPGEYISSVAWIKEGNYLA VGTSSAEVQLWDVQQQKRLRNMTSHSARVGSLSWNSYILSSGSRSGHIHHHDVRVAEHHV ATLSGHSQEVCGLRWAPDGRHLASGGNDNLVNVWPSAPGEGGWVPLQTFTQHQGAVKAVA WCPWQSNVLATGGGTSDRHIRIWNVCSGACLSAVDAHSQVCSILWSPHYKELISGHGFAQ NQLVIWKYPTMAKVAELKGHTSRVLSLTMSPDGATVASAAADETLRLWRCFELDPARRRE REKASAAKSSLIHQGIR >gi568815597f:43201072_43422719|GENSCAN_predicted_CDS_5|3294_bp atgccctcctgggccctcttcatggtcacctcctgcctcctcctggcccctcaaaacctg gcccaagtcagcagccaagatgtctccttgctggcatcagactcagagcccctgaagtgt ttctcccgaacatttgaggacctcacttgcttctgggatgaggaagaggcagcgcccagt gggacataccagctgctgtatgcctacccgcggagggacctcttctatgccaacagggag aagccccgtgcttgccccctgagttcccagagcatgccccactttggaacccgatacgtg tgccagtttccagaccaggaggaagtgcgtctcttctttccgctgcacctctgggtgaag aatgtgttcctaaaccagactcggactcagcgagtcctctttgtggacagtgtaggcctg ccggctccccccagtatcatcaaggccatgggtgggagccagccaggggaacttcagatc agctgggaggagccagctccagaaatcagtgatttcctgaggtacgaactccgctatggc cccagagatcccaagaactccactggtcccacggtcatacagctgattgccacagaaacc tgctgccctgctctgcagaggcctcactcagcctctgctctggaccagtctccatgtgct cagcccacaatgccctggcaagatggaccaaagcagacctccccaagactccagcctggc aactcctactggctgcagctgcgcagcgaacctgatgggatctccctcggtggctcctgg ggatcctggtccctccctgtgactgtggacctgcctggagatgcagtggcacttggactg caatgctttaccttggacctgaagaatgttacctgtcaatggcagcaacaggaccatgct agctcccaaggcttcttctaccacagcagggcacggtgctgccccagagacaggtacccc atctgggagaactgcgaagaggaagagaaaacaaatccaggactacagaccccacagttc tctcgctgccacttcaagtcacgaaatgacagcattattcacatccttgtggaggtgacc acagccccgggtactgttcacagctacctgggctcccctttctggatccaccaggctgtt cccacccccactgaatctgaccctgtgcccaggatccccaactctgacccttctgaccga tggctctggtggcacaatgccttgtgcacagaaggacttaagctgctccctgctgacatc cctgtagtgcgcctccccaccccaaacttgcactggagggagatctccagtgggcatctg gaattggagtggcagcacccatcgtcctgggcagcccaagagacctgttatcaactccga tacacaggagaaggccatcaggactggaaggtgctggagccgcctctcggggcccgagga gggaccctggagctgcgcccgcgatctcgctaccgtttacagctgcgcgccaggctcaac ggccccacctaccaaggtccctggagctcgtggtcggacccaactagggtggagaccgcc accgagaccgcctggatctccttggtgaccgctctgcatctagtgctgggcctcagcgcc gtcctgggcctgctgctgctgaggtggcagtttcctgcacactacaggagactgaggcat gccctgtggccctcacttccagacctgcaccgggtcctaggccagtaccttagggacact gcagccctgagcccggcaccaactgcaaggacccctccccctgcgggcgctcccatggca cagttcgcgttcgagagtgacctgcactcgctgcttcagctggatgcacccatccccaat gcaccccctgcgcgctggcagcgcaaagccaaggaagccgcaggcccggccccctcaccc atgcgggccgccaaccgatcccacagcgccggcaggactccgggccgaactcctggcaaa tccagttccaaggttcagaccactcctagcaaacctggcggtgaccgctatatcccccat cgcagtgctgcccagatggaggtggccagcttcctcctgagcaaggagaaccagcctgaa aacagccagacgcccaccaagaaggaacatcagaaagcctgggctttgaacctgaacggt tttgatgtagaggaagccaagatccttcggctcagtggaaaaccacaaaatgcgccagag ggttatcagaacagactgaaagtactctacagccaaaaggccactcctggctccagccgg aagacctgccgttacattccttccctgccagaccgtatcctggatgcgcctgaaatccga aatgactattacctgaaccttgtggattggagttctgggaatgtactggccgtggcactg gacaacagtgtgtacctgtggagtgcaagctctggtgacatcctgcagcttttgcaaatg gagcagcctggggaatatatatcctctgtggcctggatcaaagagggcaactacttggct gtgggcaccagcagtgctgaggtgcagctatgggatgtgcagcagcagaaacggcttcga aatatgaccagtcactctgcccgagtgggctccctaagctggaacagctatatcctgtcc agtggttcacgttctggccacatccaccaccatgatgttcgggtagcagaacaccatgtg gccacactgagtggccacagccaggaagtgtgtgggctgcgctgggccccagatggacga catttggccagtggtggtaatgataacttggtcaatgtgtggcctagtgctcctggagag ggtggctgggttcctctgcagacattcacccagcatcaaggggctgtcaaggccgtagca tggtgtccctggcagtccaatgtcctggcaacaggagggggcaccagtgatcgacacatt cgcatctggaatgtgtgctctggggcctgtctgagtgccgtggatgcccattcccaggtg tgctccatcctctggtctccccattacaaggagctcatctcaggccatggctttgcacag aaccagctagttatttggaagtacccaaccatggccaaggtggctgaactcaaaggtcac acatcccgggtcctgagtctgaccatgagcccagatggggccacagtggcatccgcagca gcagatgagaccctgaggctatggcgctgttttgagttggaccctgcgcggcggcgggag cgggagaaggccagtgcagccaaaagcagcctcatccaccaaggcatccgctga >gi568815597f:43201072_43422719|GENSCAN_predicted_peptide_6|710_aa MVFVDEEFLSFIHFIFQSFMDKGAIMSSSNSKKNKGKQEPEVGGRAEFQFSDWWTKPSRR RLLTGKPEVKSWQPPRPPQCSPPFYVSYSFIWVTNLFLLVQREEKQLEASLDALLSQVAD LKNSLGSFICKLENEYGRLTWPSVLDSFALLSGQLNTLNKVLKHEKTPLFRNQVIIPLVL SPDRDEDLMRQTEGRVPVFSHEVVPDHLRTKPDPEVEEQEKQLTTDAARIGADAAQKQIQ SLNKMCSNLLEKISKEERESESGGMMGLVAKQRRRGLRPNKQTFNPTDTNALVAAVAFGK GLSNWRPSGSSGPGQAGQPGAGTILAGTSGLQQVQMAGAPSQQQPMLSGVQMAQAGQPGK MPSGIKTNIKSASMHPYQREEPVLAPSPRPGPCRRWAASKPDGGGASKEGIRAWPGSAHT CVSGPAESLARMEAVVNLYQEVMKHADPRIQGYPLMGSPLLMTSILLTYVYFVLSLGPRI MANRKPFQLRGFMIVYNFSLVALSLYIVYEFLMSGWLSTYTWRCDPVDYSNSPEALRMVR VAWLFLFSKFIELMDTVIFILRKKDGQVTFLHVFHHSVLPWSWWWGVKIAPGGMGSFHAM INSSVHVIMYLYYGLSAFGPVAQPYLWWKKHMTAIQLIQFVLVSLHISQYYFMSSCNYQY PVIIHLIWMYGTIFFMLFSNFWYHSYTKGKRLPRALQQNGAPGIAKVKAN >gi568815597f:43201072_43422719|GENSCAN_predicted_CDS_6|2133_bp atggtttttgttgacgaagagttcctgtcctttattcacttcatctttcaatcatttatg gacaagggtgccatcatgtcctcatcgaactccaaaaagaacaaaggcaagcaggaaccg gaagttggtgggagggccgagttccagttttctgattggtggacgaagccttctcgtagg cgtttgctgactggaaaaccggaagtgaaatcgtggcagccgcctcggccgccgcaatgc agccctcctttctatgtgtcatactcgtttatctgggtgaccaacctgttcctattggtg cagagagaggagaagcagcttgaggcatcattagatgcactgctgagtcaagtggctgat ctgaagaactctctggggagtttcatttgcaagttggagaacgagtatggccggctgacc tggccatctgtcctggacagctttgccttgctttctggacagctgaacactctgaacaag gtcttgaagcatgaaaaaacaccgctgttccgtaaccaggtcatcattcctctggtgttg tctccagaccgagatgaagatctcatgcggcagactgaaggacgggtgcctgttttcagc catgaggtagtccctgaccatctgagaaccaagcctgaccctgaagtggaagaacaggag aagcaactgacgacagatgctgcccgcattggtgcagatgcagcccagaagcagatccag agcttgaataaaatgtgttcaaaccttctggagaaaatcagcaaagaggagcgagaatca gagagtggaggtatgatgggactggtggctaaacagagaaggagaggtctccggccgaac aagcagacctttaaccctacagacactaatgccttggtggcagctgttgcctttgggaaa ggactatctaattggagaccttcaggcagcagtggtcctggccaggcaggccagccagga gctgggacgatccttgcaggaacctcaggattacagcaggtgcagatggcaggagctcca agccagcagcagccaatgctcagtggggtacaaatggctcaggcaggtcaaccagggaaa atgccaagtggaataaaaaccaacatcaagtcggcttccatgcatccctaccagcgggag gaacctgtgttggcgccctcgccccggcctgggccctgccggcgatgggcggccagcaag cccgatggtgggggagcaagtaaggaggggatccgagcgtggccaggcagcgcgcacacg tgtgtgagtggccccgcggagtccttagccaggatggaggctgttgtgaacttgtaccaa gaggtgatgaagcacgcagatccccggatccagggctaccctctgatggggtcccccttg ctaatgacctccattctcctgacctacgtgtacttcgttctctcacttgggcctcgcatc atggctaatcggaagcccttccagctccgtggcttcatgattgtctacaacttctcactg gtggcactctccctctacattgtctatgagttcctgatgtcgggctggctgagcacctat acctggcgctgtgaccctgtggactattccaacagccctgaggcacttaggatggttcgg gtggcctggctcttcctcttctccaagttcattgagctgatggacacagtgatctttatt ctccgaaagaaagacgggcaggtgaccttcctacatgtcttccatcactctgtgcttccc tggagctggtggtggggggtaaagattgccccgggaggaatgggctctttccatgccatg ataaactcttccgtgcatgtcataatgtacctgtactacggattatctgcctttggccct gtggcacaaccctacctttggtggaaaaagcacatgacagccattcagctgatccagttt gtcctggtctcactgcacatctcccagtactactttatgtccagctgtaactaccagtac ccagtcattattcacctcatctggatgtatggcaccatcttcttcatgctgttctccaac ttctggtatcactcttataccaagggcaagcggctgccccgtgcacttcagcaaaatgga gctccaggtattgccaaggtcaaggccaactga >gi568815597f:43201072_43422719|GENSCAN_predicted_peptide_7|658_aa MEARTRGLTTEKCGGGDCDPQRFDFEVEEAGQVFLLMKKDYRISRNVRLAWFLSHLHQTV QATPQEMLLQSEQELEVLSVLPPGWQPDEPVVPRPFLLVPSTRVTFLAWQYRFVIELDLS PSTGIVDDSTGEILFDEVFHALSRCLGGLLRPFRVPGSCIDFQPEIYVTIQAYSSIIGLQ SHQVLVQGCLLDPSQREVFLQQIYEQLCLFEDKVATMLQQQYDPQSQAEDQSPDSGDLLG RKVGVSMVTADLGLVSMIRQGILALQLLPSNSSAGIIVITDGVTSVPDVAVCETLLNQLR SGTVACSFVQVGGVYSYDCSFGHVPNVELMKFIAMATFGSYLSTCPEPEPGNLGLTVYHR AFLLYSFLRSGEALNPEYYCGSQHRLFNEHLVSASSNPALALRRKKHTEKEVPADLVSTV SVRLREGYSVREVTLAKGGSQLEVKLVLLWKHNMRIEYVAMAPWPLEPEGPRVTRVEVTM EGGYDILHDVSCALRQPIRSLYRTHVIRRFWNTLQSINQTDQMLAHLQSFSSVPEHFTLP DSTKSGVPLFYIPPGSTTPVLSLQPSGSDSSHAQFAAYWKPVLSMDANSWQRWLHMHRLV LILEHDTPIPKHLHTPGSNGRYSTIQCRISHSSLTSLLRDWSSFVLVEGYSYVKLLSS >gi568815597f:43201072_43422719|GENSCAN_predicted_CDS_7|1974_bp atggaggcaagaactagagggctgacaactgagaagtgcgggggtggggactgtgaccca cagaggtttgattttgaggtggaagaagctgggcaggtgttcctgttaatgaaaaaggat tatcgaatctcccgaaatgttcgcctggcttggttcctcagtcatctgcaccaaactgtg caggccacaccccaggagatgctgcttcagtctgaacaggaattggaagtcctcagtgtc ctgccccctgggtggcagccagatgaaccagtggtcccaaggccattcctcctggtacct tccacccgggtcaccttcctggcttggcagtatcggtttgtcattgagttggaccttagc ccatctactggcattgtggatgattccacaggggagatcttgtttgatgaagttttccat gccctgtcccgctgcttaggcgggctgcttcggcccttccgagtgcctggatcttgcatc gacttccagcctgagatctatgtaactatccaggcctactcctccatcattggactgcag tcccaccaggtgctggtacagggctgcctcttggacccttcccagcgggaggtgttcctg cagcagatatatgagcagctctgcctctttgaggataaggtggccaccatgctgcagcag cagtacgatccccagagccaggcagaagaccagtccccagactcaggggacctactgggc cggaaggtaggcgtctccatggtgacagctgatcttgggctggtcagtatgattcgtcag ggcatcttggcactgcagttactaccctcgaactctagtgcagggattatcgtgatcacg gatggggtgaccagtgtacctgatgttgctgtctgtgagacactgctgaaccagcttcgc agtggcactgtggcttgttcctttgtccaggtgggaggagtttactcttatgactgcagt tttggccatgtgcccaatgtggaattaatgaagttcatcgcaatggcaacatttgggtcc tacctgtccacttgtcctgagccggagccaggcaacctgggtctgactgtctaccaccgg gcatttctcctctattccttcctgcgcagtggggaagcactgaaccctgaatattactgc ggctctcagcaccgcctatttaatgagcacctggtctctgcaagcagcaaccctgccctg gccttgcgccggaagaagcacactgagaaggaggtgccagccgacttggtcagcactgtg tccgtacggcttcgagagggctacagtgtccgagaggtcacactggccaaaggagggtcc caattggaggtaaagctggtgctgctgtggaaacacaacatgcgcattgagtatgtggct atggcaccctggcccctggagcctgagggccctcgagtaacacgggtggaagtgacgatg gaaggcggctacgacattttgcatgatgtgtcctgtgcactaaggcagcccattcgttca ttgtatcgtacccatgttatccggcgtttctggaacacgctgcagagcatcaaccagaca gaccagatgcttgcccaccttcagtccttctcctcagtgcctgagcatttcacgcttcct gacagcaccaagagcggagtgccactcttctacatccctccaggctccaccaccccggtg ctctccctccagcccagtggttctgactcatcccatgcccagtttgctgcctactggaag ccagtgctgtccatggatgcaaattcctggcagcgatggctgcacatgcatcgcctggtg ctaatcctggagcatgacacaccaatccccaagcacttgcacaccccgggcagcaatggg cgctacagcactatccagtgcaggatctcccactcctccctgacctctctgctgcgggac tggagcagcttcgtactagtcgagggctattcttatgttaagctgctctccagn