GENSCAN 1.0 Date run: 16-Aug-121 Time: 14:36:12 Sequence gi568815581r:3797817_4016225 : 218409 bp : 52.71% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.17 Intr - 248 114 135 2 0 53 17 140 0.016 4.67 1.16 Intr - 7758 7723 36 0 0 80 89 32 0.012 1.44 1.15 Intr - 10664 10563 102 2 0 48 86 79 0.020 4.57 1.14 Intr - 11470 11365 106 0 1 81 50 29 0.017 -0.88 1.13 Intr - 15707 15464 244 0 1 77 91 70 0.257 3.29 1.12 Intr - 16667 16506 162 0 0 118 77 39 0.924 6.26 1.11 Intr - 18454 18300 155 0 2 71 84 121 0.914 10.23 1.10 Intr - 20756 20447 310 0 1 74 66 434 0.905 35.72 1.09 Intr - 23536 23433 104 0 2 69 28 112 0.971 3.62 1.08 Intr - 24236 24137 100 0 1 43 101 58 0.973 2.27 1.07 Intr - 27216 27126 91 0 1 54 94 6 0.544 -2.13 1.06 Intr - 28027 27951 77 2 2 58 88 123 0.990 9.03 1.05 Intr - 28399 28271 129 2 0 76 87 53 0.930 5.17 1.04 Intr - 31552 31427 126 2 0 90 95 48 0.877 6.86 1.03 Intr - 42389 42284 106 2 1 111 87 144 0.999 16.89 1.02 Intr - 45335 45270 66 2 0 80 116 56 0.993 7.19 1.01 Init - 48407 48225 183 2 0 108 80 375 0.994 35.79 1.00 Prom - 51693 51654 40 -6.90 2.23 PlyA - 52463 52458 6 1.05 2.22 Term - 54224 54048 177 2 0 32 38 189 0.963 6.30 2.21 Intr - 59462 59358 105 2 0 124 90 28 0.821 7.51 2.20 Intr - 63331 63087 245 2 2 15 64 235 0.669 11.55 2.19 Intr - 63734 63382 353 1 2 57 52 157 0.695 4.52 2.18 Intr - 68452 68092 361 2 1 84 78 150 0.304 8.44 2.17 Intr - 70554 70389 166 0 1 61 21 91 0.485 -0.25 2.16 Intr - 71799 71671 129 0 0 94 83 95 0.984 11.00 2.15 Intr - 72072 71985 88 2 1 121 58 159 0.978 16.67 2.14 Intr - 74811 74738 74 0 2 93 64 131 0.999 9.90 2.13 Intr - 75646 75593 54 1 0 104 113 71 0.985 10.86 2.12 Intr - 78606 78407 200 1 2 106 57 350 0.999 33.29 2.11 Intr - 79746 79672 75 1 0 121 44 35 0.810 2.38 2.10 Intr - 82618 82530 89 0 2 115 74 209 0.849 22.31 2.09 Intr - 83898 83811 88 1 1 50 84 39 0.492 -0.77 2.08 Intr - 84554 84441 114 0 0 28 76 106 0.930 4.32 2.07 Intr - 84748 84712 37 1 1 92 92 47 0.929 3.82 2.06 Intr - 85359 85226 134 1 2 77 75 201 0.511 18.57 2.05 Intr - 85664 85613 52 2 1 105 60 43 0.998 2.17 2.04 Intr - 86148 86068 81 0 0 43 73 109 0.802 5.23 2.03 Intr - 86611 86564 48 1 0 122 97 110 0.995 14.66 2.02 Intr - 87914 87512 403 1 1 105 73 237 0.957 18.90 2.01 Init - 90909 90722 188 0 2 81 77 120 0.962 6.71 2.00 Prom - 93536 93497 40 -3.91 3.15 PlyA - 98796 98791 6 1.05 3.14 Term - 99656 99627 30 2 0 95 41 32 0.634 -2.66 3.13 Intr - 100294 100193 102 1 0 83 95 127 0.918 13.77 3.12 Intr - 100733 100668 66 2 0 72 103 139 0.998 13.39 3.11 Intr - 101208 101118 91 2 1 88 113 201 0.993 23.10 3.10 Intr - 101945 101818 128 2 2 85 39 203 0.999 15.08 3.09 Intr - 105527 105386 142 1 1 90 86 165 0.992 17.36 3.08 Intr - 105815 105735 81 1 0 46 96 106 0.987 6.45 3.07 Intr - 106208 106112 97 0 1 49 34 253 0.957 15.57 3.06 Intr - 106583 106514 70 2 1 63 116 57 0.994 5.25 3.05 Intr - 107113 107042 72 1 0 116 78 143 0.999 16.20 3.04 Intr - 107551 107404 148 0 1 85 99 201 0.999 21.65 3.03 Intr - 113611 113560 52 2 1 113 44 12 0.248 -2.25 3.02 Intr - 115078 114924 155 0 2 86 59 60 0.768 3.13 3.01 Init - 118409 118273 137 2 2 104 100 297 0.969 32.08 3.00 Prom - 119587 119548 40 -6.50 4.27 PlyA - 120865 120860 6 1.05 4.26 Term - 123597 123329 269 1 2 72 53 160 0.433 6.89 4.25 Intr - 126849 126655 195 1 0 66 56 177 0.738 12.21 4.24 Intr - 130964 130847 118 2 1 106 113 151 0.975 19.94 4.23 Intr - 131629 131512 118 0 1 59 76 267 0.999 23.57 4.22 Intr - 132618 132485 134 0 2 73 96 279 0.999 27.15 4.21 Intr - 137461 137376 86 2 2 104 93 113 0.991 13.44 4.20 Intr - 138653 138451 203 1 2 35 105 388 0.999 34.65 4.19 Intr - 139820 139600 221 2 2 65 81 523 0.999 46.93 4.18 Intr - 143490 143155 336 0 0 108 77 660 0.975 63.17 4.17 Intr - 143838 143620 219 0 0 104 90 300 0.973 30.83 4.16 Intr - 144915 144790 126 0 0 94 81 170 0.997 18.28 4.15 Intr - 145706 145575 132 2 0 96 101 178 0.998 21.25 4.14 Intr - 146990 146888 103 1 1 72 64 180 0.992 14.68 4.13 Intr - 147332 147244 89 2 2 91 65 143 0.999 11.57 4.12 Intr - 150039 149575 465 0 0 85 100 940 0.856 88.81 4.11 Intr - 152780 152695 86 0 2 80 84 124 0.999 11.24 4.10 Intr - 152957 152877 81 0 0 65 76 154 0.833 12.01 4.09 Intr - 153573 153435 139 0 1 108 85 343 0.999 36.54 4.08 Intr - 153869 153765 105 2 0 61 75 262 0.983 23.11 4.07 Intr - 155613 155531 83 1 2 74 78 163 0.976 13.75 4.06 Intr - 166534 166358 177 2 0 69 101 270 0.090 26.81 4.05 Intr - 179442 179332 111 1 0 78 97 56 0.048 6.35 4.04 Intr - 182899 182771 129 2 0 55 94 55 0.127 3.97 4.03 Intr - 183635 183499 137 1 2 75 93 -18 0.730 -1.88 4.02 Intr - 183849 183687 163 1 1 106 96 23 0.837 4.45 4.01 Init - 184852 184777 76 1 1 93 46 134 0.986 8.90 4.00 Prom - 199858 199819 40 -2.81 5.07 PlyA - 200010 200005 6 1.05 5.06 Term - 209154 209074 81 0 0 113 41 115 0.915 7.29 5.05 Intr - 211138 211067 72 1 0 121 78 111 0.981 13.50 5.04 Intr - 211941 211788 154 2 1 97 94 255 0.990 27.59 5.03 Intr - 215798 215633 166 0 1 20 115 54 0.978 0.83 5.02 Intr - 216372 216274 99 1 0 85 89 214 0.561 21.78 5.01 Intr - 216699 216531 169 0 1 105 72 203 0.999 20.43 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 11374 11515 142 0 1 43 44 172 0.895 8.56 S.002 Init - 166702 166608 95 1 2 70 90 163 0.840 12.51 S.003 Intr + 178971 179071 101 2 2 100 71 37 0.846 2.61 S.004 Intr + 179103 179242 140 0 2 88 75 125 0.933 11.82 S.005 Term + 180707 180846 140 0 2 106 47 53 0.852 1.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:3797817_4016225|GENSCAN_predicted_peptide_1|744_aa MAAVRGLRVSVKAEAPAGPALGLPSPEAESGVDRGEPEPMEVEEGELEIVPVRRSLKELI PDTSRRYENKAGSFITGIDVTSKEAIEKKEQRAKRFHFRSEVNLAQRNVALDRDMMKKAI PKVRLETIYICGVDEMSTQDVFSYFKEYPPAHIEWLDDTSCNVVWLDEMTATRALINMSS LPAQDKIRSRDASEDKSAEKRKKDKQEDSSDDDEAEEGEVEDENSSDVEVEEESLLRNDL RPANKLAKGNRLFMRFATKDDKKELGAARRSQYYMKYGNPNYGGMKGILSNSWKRRYHSR RIQRDVIKKRALIGDDVGLTSYKHRHSGLVNVPEEPIEEEEEEEEEEEEEEEEDQDMDAD DRVVVEYHEELPALKQPRERSASRRSSASSSDSDEMDYDLELKMISTPSPKKSMKMTMYA DEVESQLKNIRNSMRADSVSSSNIKNRIGNKLPPEKFADVRHLLDEKRQHSRPRPPVSST KSDIRQRLGKRPHSPEKAFSSNPVVRREPSSDVHSRLGVPRQDSKGLYADTREKKSGRDL TCPPTVRIPCWVQTLPFAGRGCGPGNEILPAQGCTVVAGSTDCWRFLHSRCWQCCCPLSA TLVLSADCESEMPSHPNRNHQVVFPSSCIIFHSTSSVRGFQFLHISTNTHYFLEAVPNFQ SQGIGYFYVSHGTATEESQLSPGFSSKSPTDLRPEDANMVWYKWRNAGIQSREAHGGGGE ERGDLSAGLGELGVPAEGSGEPQK >gi568815581r:3797817_4016225|GENSCAN_predicted_CDS_1|2232_bp atggcggccgtacggggcctgcgggtgtcggtgaaggcggaggccccggcggggccggcc ctggggctcccgtcccctgaggcggagtccggtgttgaccgtggcgagccggagcccatg gaggtggaggagggcgagctggaaatcgtgcctgtgcggcgctcgctcaaggaactgatc ccggacacgagcagaagatatgaaaacaaggctggcagcttcatcactggaattgatgtc acctccaaggaagcaattgaaaagaaagagcagcgagccaagcgcttccattttcgatcg gaagtaaatcttgcccaaagaaatgtagccttggaccgagacatgatgaagaaagcaatc cccaaggtgagactggagacaatctatatttgcggagtagatgagatgagcacccaagat gtcttttcctattttaaagaatatcctccagctcacatcgaatggttggatgatacctcc tgtaatgtagtttggctggatgaaatgacagccacacgagcacttatcaatatgagctcc ctgcctgcacaggataagatcagaagcagggatgccagtgaggacaagtcagctgagaaa aggaaaaaagacaagcaggaagacagttcagatgatgatgaagctgaagaaggagaggtt gaagatgagaactcaagtgatgtagaggtagaagaggagtctttgttaagaaacgatctt cgtccagctaacaaacttgctaaaggaaataggttattcatgagatttgctacaaaagat gacaaaaaggaacttggagcagccagaagaagtcagtattacatgaaatatgggaatcca aattatggaggcatgaaaggaattcttagcaattcatggaagcgaagatatcattcccgt cgtattcagcgggacgtgatcaagaagagagccctgattggggatgacgttggcttgacg tcgtataaacatcgacattctgggctagtgaatgttcccgaggaacccattgaagaggag gaagaggaggaggaggaggaagaggaagaggaagaagaagaccaggacatggatgcagat gacagagtggtggtagagtaccacgaggagctcccggctctcaagcagccccgggagcgg agcgcgtctagacgatccagtgccagcagctcagactcagatgaaatggactatgatcta gaactgaaaatgatttccacgccttcaccaaagaaaagcatgaaaatgactatgtatgct gacgaagtggaatctcagttgaaaaatattaggaactccatgagggcagatagtgtatct tcaagcaatatcaaaaaccgaattggtaacaaattaccacctgagaaatttgcagatgtc cgacatctattagatgagaaacgtcagcactcccgtccacggccaccagtcagcagtact aaatcagatatacgccagcggttaggaaaaagaccacattctccggaaaaggcttttagt agtaaccccgtcgttcggagagagccctcttctgatgtgcatagtaggctaggtgttccc aggcaggatagtaaaggcctctacgccgatactcgggagaagaaatcagggagggacctg acctgtccccccacggtgagaatcccgtgctgggtccagacccttccgtttgcgggcagg ggctgcggcccgggaaatgagattttgcctgcccaaggctgcactgtggtggcaggctcc acagactgctggcgtttcctgcacagcaggtgttggcagtgctgctgccccctcagtgct accctggttcttagcgctgactgcgaaagtgaaatgccatctcatccaaacaggaaccac caggtggtttttccaagcagctgtatcattttccattccaccagcagtgtgcgagggttc cagtttctccacatctccaccaacactcattattttctggaggcagtccccaacttccag agccagggtataggctacttctatgtgtcccatggtactgccacagaagagtctcagctc tctccaggattcagttctaagtcccccacagatctgaggccggaggatgccaacatggtg tggtataagtggaggaatgcgggcatccagtcccgagaggcccacggtggaggtggtgag gaacgtggggacctgagtgccggcctgggggagcttggagttcctgctgagggcagtggg gagccacagaag >gi568815581r:3797817_4016225|GENSCAN_predicted_peptide_2|1086_aa MGSLTSALAAAFRPPAHPSGPQRVPSNHNSSRTTTTHFYGLLSGRHCSERLTGVPSSASH HVRFPTRLRRRTPLTEAMEGGPAVCCQDPRAELVERVAAIDVTHLEEADGGPEPTRNGVD PPPRARAASVIPGSTSRLLPARPSLSARKLSLQERPAGSYLEAQAGPYATGPASHISPRA WRRPTIESHHVAISDAEDCVQLNQYKLQSEIGKVGLTDAYLQGAYGVVRLAYNESEDRHY AMKVLSKKKLLKQYGFPRRPPPRGSQAAQGGPAKQLLPLERVYQEIAILKKLDHVNVVKL IEVLDDPAEDNLYLALQNQAQNIQLDSTNIAKPHSLLPSEQQDSGSTWAARSGRDLGIGC FASQLHLTFLSFLSVFDLLRKGPVMEVPCDKPFSEEQARLYLRDVILGLEYSSWGPLVQM KVLAKEGAHRSMVAGWVHCQKIVHRDIKPSNLLLGDDGHVKIADFGVSNQFEGNDAQLSS TAGTPAFMAPEAISDSGQSFSGKALDVWATGVTLYCFVYGKCPFIDDFILALHRKIKNEP VVFPEEPEISEELKDLILKMLDKNPETRIGVPDIKLHPWVTKNGEEPLPSEEEHCSVVEV TEEEVKNSVRLIPSWTTVRPRINQAVPASPPDPVYQLDGACISPVPTYQLDGACFSPRAH VSVRRRLLLPQSPPWPCGPFMGWSRPPPHVPVASGSSPFQEYAPSCQRPVFLPLPFHAVL IEMGPALRPPWQLLSGNRSRGCVSGSLGPVLLTLPCPPQILVKSMLRKRSFGNPFEPQAR REERSMSAPGNLLVTAQDYLAAKWMELTFLDCVSHSALSGKWTERNQALSRGLKRKHHIV PAHLTLSLDDAPRAEAGEDTRADFDRGPWTRQACGPAHAEITGTQPLLLHRPTHLGPRAA LTLLQRRTNCPGGLCGNCLTLRTSAVSLLRAGVLERVQTQLFRSSLHLLSGISVRLADVA FADRMLIEVDPHRVTQDSDTALETWMDKGFWPQAQPFRRDTASPQSLPLLVKLDPALQGL FSSWKNPEGGSERRNGPALLHSKEDVQEAAGHAHLELEGEARTGNGDAGALSVWTAVELE AMRVMS >gi568815581r:3797817_4016225|GENSCAN_predicted_CDS_2|3261_bp atggggagcctgacgtcagccctcgcggccgccttccgcccgcccgcgcatccatctggg cctcagcgtgtcccgagcaatcacaacagcagccgcacaacaacaactcacttttacggc ctccttagtggcaggcactgttctgagcgccttacgggcgttccctcctcagcatctcac cacgtgcggttcccaacaaggctacgcagaagaacccccttgactgaagcaatggagggg ggtccagctgtctgctgccaggatcctcgggcagagctggtagaacgggtggcagccatc gatgtgactcacttggaggaggcagatggtggcccagagcctactagaaacggtgtggac cccccaccacgggccagagctgcctctgtgatccctggcagtacttcaagactgctccca gcccggcctagcctctcagccaggaagctttccctacaggagcggccagcaggaagctat ctggaggcgcaggctgggccttatgccacggggcctgccagccacatctccccccgggcc tggcggaggcccaccatcgagtcccaccacgtggccatctcagatgcagaggactgcgtg cagctgaaccagtacaagctgcagagtgagattggcaaggtggggctgactgatgcctat ctgcagggtgcctacggtgtggtgaggctggcctacaacgaaagtgaagacagacactat gcaatgaaagtcctttccaaaaagaagttactgaagcagtatggctttccacgtcgccct cccccgagagggtcccaggctgcccagggaggaccagccaagcagctgctgcccctggag cgggtgtaccaggagattgccatcctgaagaagctggaccacgtgaatgtggtcaaactg atcgaggtcctggatgacccagctgaggacaacctctatttggccctgcagaaccaggcc cagaatatccagttagattcaacaaatatcgccaagccccactccctgcttccctctgag cagcaagacagtggatccacgtgggctgcgcgctcagggagggaccttggcatcggctgc tttgccagccagctacaccttaccttcttgtcttttctttcagtgtttgacctcctgaga aaggggcccgtcatggaagtgccctgtgacaagcccttctcggaggagcaagctcgcctc tacctgcgggacgtcatcctgggcctcgagtactcatcctggggtccgttggtccagatg aaggtacttgccaaggagggagcccacaggtcgatggtcgcgggatgggtgcactgccag aagatcgtccacagggacatcaagccatccaacctgctcctgggggatgatgggcacgtg aagatcgccgactttggcgtcagcaaccagtttgaggggaacgacgctcagctgtccagc acggcgggaaccccagcattcatggcccccgaggccatttctgattccggccagagcttc agtgggaaggccttggatgtatgggccactggcgtcacgttgtactgctttgtctatggg aagtgcccattcatcgacgatttcatcctggccctccacaggaagatcaagaatgagccc gtggtgtttcctgaggagccagaaatcagcgaggagctcaaggacctgatcctgaagatg ttagacaagaatcccgagacgagaattggggtgccagacatcaagttgcacccttgggtg accaagaacggggaggagccccttccttcggaggaggagcactgcagcgtggtggaggtg acagaggaggaggttaagaactcagtcaggctcatccccagctggaccacggtgcgcccg cgtatcaatcaggcagtgcctgcgtctcccccagatcctgtgtatcagttagacggtgcc tgcatctcccccgtgcccacgtatcagttagacggcgcctgcttctcccccagagcccac gtatcagttagacggcgcctgcttctcccccagagcccaccctggccttgcggacccttc atgggctggtcccggccccctcctcatgtaccagtggcatccggctcctcaccattccag gaatatgcccccagctgccagcgccccgtgttcttgcctctgccatttcatgctgtgctg attgagatgggacccgcactgcggcccccttggcagctgctctcggggaatcggagcaga ggctgcgtgtctgggagcctgggacctgtgctcctcacgctgccttgtcctcctcagatc ctggtgaagtccatgctgaggaagcgttcctttgggaacccgtttgagccccaagcacgg agggaagagcgatccatgtctgctccaggaaacctactggtcacagcccaagattatttg gcagccaagtggatggaactaactttcctggactgtgtttcgcattcggcgttatctgga aagtggactgaacggaatcaagctctgagcagaggcctgaagcggaagcaccacatcgtc cctgcccatctcactctctcccttgatgatgcccctagagctgaggctggagaagacacc agggctgactttgaccgagggccatggacgcgacaggcctgtggccctgcgcatgctgaa ataactggaacccagcctctcctcctacaccggcctacccatctgggcccaagagctgca ctcacactcctacaacgaaggacaaactgtccaggtggcctctgcggcaattgcctcacc ctgaggacatcagcagtcagcctgctcagagcgggggtgctggagcgcgtgcagacacag ctcttccggagcagccttcaccttctctctgggatcagtgtccggctggccgacgtggca tttgctgaccgaatgctcatagaggttgacccccacagggtcacgcaggactcggacact gccctggaaacatggatggacaagggcttttggccacaggcccagccattccggcgggac actgcctcccctcaatccctgcccctgcttgtcaagcttgaccccgccctccaaggcctg ttcagcagctggaaaaatccagagggagggagcgaacgcaggaatggacccgccttgcta cattcaaaggaagatgtccaggaggcagctggacatgcgcatctggagctcgagggagag gcgagaactggaaatggagacgcgggggccctcagcgtgtggacggccgtggagttggaa gccatgcgtgtgatgagctag >gi568815581r:3797817_4016225|GENSCAN_predicted_peptide_3|456_aa MARRFQEELAAFLFEYDTPRMVLVRNKKVGVIFRLIQLVVLVYVIGPLEITCSEISKWRS ECTNQGEGWIEIIGGRNTSRDRFCSIFSMPGAVPGNRARNTWHSQSFRRLKRSGKWVFLY EKGYQTSSGLISSVSVKLKGLAVTQLPGLGPQVWDVADYVFPAQGDNSFVVMTNFIVTPK QTQGYCAEHPEGGICKEDSGCTPGKAKRKAQGIRTGKCVAFNDTVKTCEIFGWCPVEVDD DIPRPALLREAENFTLFIKNSISFPRFKVNRRNLVEEVNAAHMKTCLFHKTLHPLCPVFQ LGYVVQESGQNFSTLAEKGGVVGITIDWHCDLDWHVRHCRPIYEFHGLYEEKNLSPGFNF RFARHFVENGTNYRHLFKVFGIRFDILVDGKAGKFDIIPTMTTIGSGIGIFGVATVLCDL LLLHILPKRHYYKQKKFKYAEDMGPGAICALRCGSQ >gi568815581r:3797817_4016225|GENSCAN_predicted_CDS_3|1371_bp atggcacggcggttccaggaggagctggccgccttcctcttcgagtatgacaccccccgc atggtgctggtgcgtaataagaaggtgggcgttatcttccgactgatccagctggtggtc ctggtctacgtcatcggacctcttgaaatcacatgttctgagatctccaagtggagatca gagtgcaccaatcagggagaaggatggatagaaattattggggggagaaacacatccagg gaccgtttctgcagcatcttctccatgcctggtgctgtgccggggaacagagcacgtaac acctggcacagccagagcttccgacgtctcaagagaagtgggaagtgggtgtttctctat gagaagggctaccagacctcgagcggcctcatcagcagtgtctctgtgaaactcaagggc ctggccgtgacccagctccctggcctcggcccccaggtctgggatgtggctgactacgtc ttcccagcccagggggacaactccttcgtggtcatgaccaatttcatcgtgaccccgaag cagactcaaggctactgcgcagagcacccagaagggggcatatgcaaggaagacagtggc tgtacccctgggaaggccaagaggaaggcccaaggcatccgcacgggcaagtgtgtggcc ttcaacgacactgtgaagacgtgtgagatctttggctggtgccccgtggaggtggatgac gacatcccgcgccctgcccttctccgagaggccgagaacttcactcttttcatcaagaac agcatcagctttccacgcttcaaggtcaacaggcgcaacctggtggaggaggtgaatgct gcccacatgaagacctgcctctttcacaagaccctgcaccccctgtgcccagtcttccag cttggctacgtggtgcaagagtcaggccagaacttcagcaccctggctgagaagggtgga gtggttggcatcaccatcgactggcactgtgacctggactggcacgtacggcactgcaga cccatctatgagttccatgggctgtacgaagagaaaaatctctccccaggcttcaacttc aggtttgccaggcactttgtggagaacgggaccaactaccgtcacctcttcaaggtgttt gggattcgctttgacatcctggtggacggcaaggccgggaagtttgacatcatccctaca atgaccaccatcggctctggaattggcatctttggggtggccacagttctctgtgacctg ctgctgcttcacatcctgcctaagaggcactactacaagcagaagaagttcaaatacgct gaggacatggggccaggggcgatctgtgctctccgatgtggcagtcagtaa >gi568815581r:3797817_4016225|GENSCAN_predicted_peptide_4|1366_aa MEAGSRGGAVAPGVLAFPAALEEARGAFPCMSAELHHWRLGRLERRVFHQQALGSVLSAS AREATQHQGRRSPALFPLLSSTSLKQNKTKQNITKHKYGIFATQEATKKGLGAKNVFLSG SSILDSGEMSTSGNSDSNTTPSCVPSTTPESQHEARECAELPLKPPGRDSSSRPSNSQHP IATLGMTTLTCPADPLGPLGLRWVCPPRRRGGLDGADGADGRAGGMEAAHLLPAADVLRH FSVTAEGGLSPAQVTGARERYGPNGKSLWELVLEQFEDLLVRILLLAALVSFVLAWFEEG EETTTAFVEPLVIMLILVANAIVGVWQERNAESAIEALKEYEPEMGKVIRSDRKGVQRIR ARDIVPGDIVEVAVGDKVPADLRLIEIKSTTLRVDQSILTGESVSVTKHTEAIPDPRAVN QDKKNMLFSGTNITSGKAVGVAVATGLHTELGKIRSQMAAVEPERTPLQRKLDEFGRQLS HAISVICVAVWVINIGHFADPAHGGSWLRGAVYYFKIAVALAVAAIPEGLPAVITTCLAL GTRRMARKNAIVRSLPSVETLGCTSVICSDKTGTLTTNQMSVCRMFVVAEADAGSCLLHE FTISGTTYTPEGEVRQGDQPVRCGQFDGLVELATICALCNDSALDYNEAKGVYEKVGEAT ETALTCLVEKMNVFDTDLQALSRVERAGACNTVIKQLMRKEFTLEFSRDRKSMSVYCTPT RPHPTGQGSKMFVKGAPESVIERCSSVRVGSRTAPLTPTSREQILAKIRDWGSGSDTLRC LALATRDAPPRKEDMELDDCSKFVQYETDLTFVGCVGMLDPPRPEVAACITRCYQAGIRV VMITGDNKGTAVAICRRLGIFGDTEDVAGKAYTGREFDDLSPEQQRQACRTARCFARVEP AHKSRIVENLQSFNEITAMTGDGVNDAPALKKAEIGIAMGSGTAVAKSAAEMVLSDDNFA SIVAAVEEGRAIYSNMKQFIRYLISSNVGEVVCIFLTAILGLPEALIPVQLLWVNLVTDG LPATALGFNPPDLDIMEKLPRSPREALISGWLFFRYLAIGVYVGLATVAAATWWFVYDAE GPHINFYQLRNFLKCSEDNPLFAGIDCEVFESRFPTTMALSVLVTIEMCNALNSVSENQS LLRMPPWMNPWLLVAVAMSMALHFLILLVPPLPLIFQVTPLSGRQWVVVLQISLPVILLD EALKYLSRNHMHGFPVLSSGDVQSHTAARSATQRSNLPPASLVPETTDILRFLTVAPFYP AQCAAAALSAQLGASEPDPTLQSSQWWEGPRWGCVRRNGGTKSQQGGEGKLDMRTKAPEC PQRLHPAQASDIHEQMKDKQAAVRPRNREEPTTDARNCHPKPKPLG >gi568815581r:3797817_4016225|GENSCAN_predicted_CDS_4|4101_bp atggaggctggaagcaggggaggggctgtggccccaggggtcctggcgttcccggcagcc cttgaggaggccagaggtgcctttccgtgcatgagcgcagagctacatcactggcgctta ggacgcctggagcgacgcgtttttcaccaacaggccctgggctcagtgctgagcgcctcg gcccgcgaagccacacagcaccaggggaggcggagtccggctttattcccgttgctcagc tctacctccttgaaacaaaacaaaacaaaacaaaacataacaaaacacaagtatggaata tttgcaacccaggaagctacaaagaaaggcttaggggccaagaatgtgttcctctcaggg agttccatccttgactctggggaaatgagcacctctggcaattctgattcaaacacgaca ccaagctgtgttccttccaccactccagagtcgcagcatgaggcccgggaatgcgcagaa ttacccttgaaacctccaggaagagacagcagctcgaggcccagcaactcccagcaccca atagccactctgggaatgaccacgctcacctgccctgcagatcccctgggtccactgggc ctgcgctgggtgtgccccccccggcggagaggcggcctcgacggtgcggacggcgcagac ggccgggcgggcggcatggaggcggcgcatctgctcccggccgccgacgtgctgcgccac ttctcggtgacagccgagggcggcctgagcccggcgcaggtgaccggcgcgcgggagcgc tacggccccaacgggaagtccctgtgggagctggtgctggaacagtttgaggacctcctg gtgcgcatcctgctgctggctgcccttgtctcctttgtcctggcctggttcgaggagggc gaggagaccacgaccgccttcgtggagcccctggtcatcatgctgatcctcgtggccaac gccattgtgggcgtgtggcaggaacgcaacgccgagagtgccatcgaggccctgaaggag tatgagcctgagatgggcaaggtgatccgctcggaccgcaagggcgtgcagaggatccgt gcccgggacatcgtcccaggggacattgtagaagtggcagtgggggacaaagtgcctgct gacctccgcctcatcgagatcaagtccaccacgctgcgagtggaccagtccatcctgacg ggtgaatctgtgtccgtgaccaagcacacagaggccatcccagaccccagagctgtgaac caggacaagaagaacatgctgttttctggcaccaatatcacatcgggcaaagcggtgggt gtggccgtggccaccggcctgcacacggagctgggcaagatccggagccagatggcggca gtcgagcccgagcggacgccgctgcagcgcaagctggacgagtttggacggcagctgtcc cacgccatctctgtgatctgcgtggccgtgtgggtcatcaacatcggccacttcgccgac ccggcccacggtggctcctggctgcgtggcgctgtctactacttcaagatcgccgtggcc ctggcggtggcggccatccccgagggcctcccggctgtcatcactacatgcctggcactg ggcacgcggcgcatggcacgcaagaacgccatcgtgcgaagcctgccgtccgtggagacc ctgggctgcacctcagtcatctgctccgacaagacgggcacgctcaccaccaatcagatg tctgtctgccggatgttcgtggtagccgaggccgatgcgggctcctgccttttgcacgag ttcaccatctcgggtaccacgtatacccccgagggcgaagtgcggcagggggatcagcct gtgcgctgcggccagttcgacgggctggtggagctggcgaccatctgcgccctgtgcaac gactcggctctggactacaacgaggccaagggtgtgtatgagaaggtgggagaggccacg gagacagctctgacttgcctggtggagaagatgaacgtgttcgacaccgacctgcaggct ctgtcccgggtggagcgagctggcgcctgtaacacggtcatcaagcagctgatgcggaag gagttcaccctggagttctcccgagaccggaaatccatgtccgtgtactgcacgcccacc cgccctcaccctactggccagggcagcaagatgtttgtgaagggggctcctgagagtgtg atcgagcgctgtagctcagtccgcgtggggagccgcacagcacccctgacccccacctcc agggagcagatcctggcaaagatccgggattggggctcaggctcagacacgctgcgctgc ctggcactggccacccgggacgcgcccccaaggaaggaggacatggagctggacgactgc agcaagtttgtgcagtacgagacggacctgaccttcgtgggctgcgtaggcatgctggac ccgccgcgacctgaggtggctgcctgcatcacacgctgctaccaggcgggcatccgcgtg gtcatgatcacgggggataacaaaggcactgccgtggccatctgccgcaggcttggcatc tttggggacacggaagacgtggcgggcaaggcctacacgggccgcgagtttgatgacctc agccccgagcagcagcgccaggcctgccgcaccgcccgctgcttcgcccgcgtggagccc gcacacaagtcccgcatcgtggagaacctgcagtcctttaacgagatcactgctatgact ggcgatggagtgaacgacgcaccagccctgaagaaagcagagatcggcatcgccatgggc tcaggcacggccgtggccaagtcggcggcagagatggtgctgtcagatgacaactttgcc tccatcgtggctgcggtggaggagggccgggccatctacagcaacatgaagcaattcatc cgctacctcatctcctccaatgttggcgaggtcgtctgcatcttcctcacggcaattctg ggcctgcccgaagccctgatccctgtgcagctgctctgggtgaacctggtgacagacggc ctacctgccacggctctgggcttcaacccgccagacctggacatcatggagaagctgccc cggagcccccgagaagccctcatcagtggctggctcttcttccgatacctggctatcgga gtgtacgtaggcctggccacagtggctgccgccacctggtggtttgtgtatgacgccgag ggacctcacatcaacttctaccagctgaggaacttcctgaagtgctccgaagacaacccg ctctttgccggcatcgactgtgaggtgttcgagtcacgcttccccaccaccatggccttg tccgtgctcgtgaccattgaaatgtgcaatgccctcaacagcgtctcggagaaccagtcg ctgctgcggatgccgccctggatgaacccctggctgctggtggctgtggccatgtccatg gccctgcacttcctcatcctgctcgtgccgcccctgcctctcattttccaggtgacccca ctgagcgggcgccagtgggtggtggtgctccagatatctctgcctgtcatcctgctggat gaggccctcaagtacctgtcccggaaccacatgcacgggttccccgtcctgagctcggga gatgttcagagtcacactgccgcccggtctgccacgcagaggtccaacttgccacccgcg tccctggtacctgagaccaccgacatcctcaggttcctgaccgtggcgcccttctaccca gcccagtgtgcggccgccgcgctgtctgcacagctgggggcctctgagcctgaccccacg ctccagtcctcacagtggtgggagggaccccggtggggctgtgttaggagaaatggaggc acaaaatcacagcagggaggagaagggaagctggacatgcgcacaaaggcccccgagtgt ccacagcggcttcacccagcacaggcctcagatatccatgaacagatgaaggataagcaa gctgcagtacgtccacgcaaccgagaggaaccaaccactgatgcacgcaactgccacccg aagcctaaaccgctgggctga >gi568815581r:3797817_4016225|GENSCAN_predicted_peptide_5|246_aa DKVHIPGAIYLSIKFDSQCNTEEGCDELAMSSSSDFQQDRHSFSGSQQKWKDFELPGDTL YYRFTSDMSNTEWGYRFTVTAGHLGRFQTGFEILKQMLSEERVVPHLPLAKIWEWLVGVA CRQTGHQRLKAIHLLLRIVRCCGHSDLCDLALLKPLWQLFTHMEYGLFEDVTQPGILLPL HRALTELFFVTENRAQELGVLQDYLLALTTDDHLLRCAAQALQNIAAISLAINYPNKATR LWNVEC >gi568815581r:3797817_4016225|GENSCAN_predicted_CDS_5|741_bp gataaagttcacattcctggtgccatctacctctcaatcaaattcgactctcagtgcaac acagaggagggctgtgacgagttagccatgtccagcagcagtgacttccagcaagaccga cacagcttcagcgggtctcagcagaagtggaaagattttgaacttccaggagacactctg tattaccgcttcacctccgacatgagcaacaccgagtggggctacagattcaccgtgacg gccggacacctggggcggttccagacaggattcgagattttgaagcagatgttgtcagaa gaaagggtcgtgcctcatctcccattggcaaaaatttgggaatggctggtgggcgtggcc tgtcgccagactggccatcaacgattaaaagccatccacttacttctgaggattgtgcga tgctgcggccacagtgacctgtgtgaccttgcgctgttgaagcccctgtggcagctcttt acccacatggagtacggcctgtttgaggacgtgacgcagcccggcatcctccttcccctg catcgtgccctcactgagctcttcttcgtcaccgagaaccgtgcccaggagcttggcgtg ctgcaggattacctgctggccctaaccacggacgaccaccttctccgctgtgcggcacag gctctgcagaacattgctgccatcagcctggccatcaactacccaaacaaggccacccgc ctctggaatgtggagtgttag