GENSCAN 1.0 Date run: 3-Nov-116 Time: 13:12:02 Sequence gi568815581r:3828663_4064291 : 235629 bp : 51.92% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 162 286 125 2 2 90 113 57 0.903 8.29 1.02 Intr + 5387 5453 67 2 1 55 69 37 0.274 -2.10 1.03 Intr + 6750 6937 188 2 2 67 108 105 0.468 9.41 1.04 Term + 7347 7389 43 0 1 100 48 25 0.739 -3.38 1.05 PlyA + 8494 8499 6 1.05 2.05 PlyA - 9280 9275 6 -0.45 2.04 Term - 9738 9638 101 1 2 44 54 85 0.621 -0.71 2.03 Intr - 11543 11438 106 2 1 111 87 144 0.999 16.89 2.02 Intr - 14489 14424 66 2 0 80 116 56 0.993 7.19 2.01 Init - 17561 17379 183 2 0 108 80 375 0.994 35.79 2.00 Prom - 20847 20808 40 -6.90 3.23 PlyA - 21617 21612 6 1.05 3.22 Term - 23378 23202 177 2 0 32 38 189 0.963 6.30 3.21 Intr - 28616 28512 105 2 0 124 90 28 0.821 7.51 3.20 Intr - 32485 32241 245 2 2 15 64 235 0.669 11.55 3.19 Intr - 32888 32536 353 1 2 57 52 157 0.695 4.52 3.18 Intr - 37606 37246 361 2 1 84 78 150 0.304 8.44 3.17 Intr - 39708 39543 166 0 1 61 21 91 0.485 -0.25 3.16 Intr - 40953 40825 129 0 0 94 83 95 0.984 11.00 3.15 Intr - 41226 41139 88 2 1 121 58 159 0.978 16.67 3.14 Intr - 43965 43892 74 0 2 93 64 131 0.999 9.90 3.13 Intr - 44800 44747 54 1 0 104 113 71 0.985 10.86 3.12 Intr - 47760 47561 200 1 2 106 57 350 0.999 33.29 3.11 Intr - 48900 48826 75 1 0 121 44 35 0.810 2.38 3.10 Intr - 51772 51684 89 0 2 115 74 209 0.849 22.31 3.09 Intr - 53052 52965 88 1 1 50 84 39 0.492 -0.77 3.08 Intr - 53708 53595 114 0 0 28 76 106 0.930 4.32 3.07 Intr - 53902 53866 37 1 1 92 92 47 0.929 3.82 3.06 Intr - 54513 54380 134 1 2 77 75 201 0.511 18.57 3.05 Intr - 54818 54767 52 2 1 105 60 43 0.998 2.17 3.04 Intr - 55302 55222 81 0 0 43 73 109 0.802 5.23 3.03 Intr - 55765 55718 48 1 0 122 97 110 0.995 14.66 3.02 Intr - 57068 56666 403 1 1 105 73 237 0.957 18.90 3.01 Init - 60063 59876 188 0 2 81 77 120 0.962 6.71 3.00 Prom - 62690 62651 40 -3.91 4.15 PlyA - 67950 67945 6 1.05 4.14 Term - 68810 68781 30 2 0 95 41 32 0.634 -2.66 4.13 Intr - 69448 69347 102 1 0 83 95 127 0.918 13.77 4.12 Intr - 69887 69822 66 2 0 72 103 139 0.998 13.39 4.11 Intr - 70362 70272 91 2 1 88 113 201 0.993 23.10 4.10 Intr - 71099 70972 128 2 2 85 39 203 0.999 15.08 4.09 Intr - 74681 74540 142 1 1 90 86 165 0.992 17.36 4.08 Intr - 74969 74889 81 1 0 46 96 106 0.987 6.45 4.07 Intr - 75362 75266 97 0 1 49 34 253 0.957 15.57 4.06 Intr - 75737 75668 70 2 1 63 116 57 0.994 5.25 4.05 Intr - 76267 76196 72 1 0 116 78 143 0.999 16.20 4.04 Intr - 76705 76558 148 0 1 85 99 201 0.999 21.65 4.03 Intr - 82765 82714 52 2 1 113 44 12 0.248 -2.25 4.02 Intr - 84232 84078 155 0 2 86 59 60 0.768 3.13 4.01 Init - 87563 87427 137 2 2 104 100 297 0.969 32.08 4.00 Prom - 88741 88702 40 -6.50 5.27 PlyA - 90019 90014 6 1.05 5.26 Term - 92751 92483 269 1 2 72 53 160 0.433 6.89 5.25 Intr - 96003 95809 195 1 0 66 56 177 0.738 12.21 5.24 Intr - 100118 100001 118 2 1 106 113 151 0.975 19.94 5.23 Intr - 100783 100666 118 0 1 59 76 267 0.999 23.57 5.22 Intr - 101772 101639 134 0 2 73 96 279 0.999 27.15 5.21 Intr - 106615 106530 86 2 2 104 93 113 0.991 13.44 5.20 Intr - 107807 107605 203 1 2 35 105 388 0.999 34.65 5.19 Intr - 108974 108754 221 2 2 65 81 523 0.999 46.93 5.18 Intr - 112644 112309 336 0 0 108 77 660 0.975 63.17 5.17 Intr - 112992 112774 219 0 0 104 90 300 0.973 30.83 5.16 Intr - 114069 113944 126 0 0 94 81 170 0.997 18.28 5.15 Intr - 114860 114729 132 2 0 96 101 178 0.998 21.25 5.14 Intr - 116144 116042 103 1 1 72 64 180 0.992 14.68 5.13 Intr - 116486 116398 89 2 2 91 65 143 0.999 11.57 5.12 Intr - 119193 118729 465 0 0 85 100 940 0.856 88.81 5.11 Intr - 121934 121849 86 0 2 80 84 124 0.999 11.24 5.10 Intr - 122111 122031 81 0 0 65 76 154 0.833 12.01 5.09 Intr - 122727 122589 139 0 1 108 85 343 0.999 36.54 5.08 Intr - 123023 122919 105 2 0 61 75 262 0.983 23.11 5.07 Intr - 124767 124685 83 1 2 74 78 163 0.976 13.75 5.06 Intr - 135688 135512 177 2 0 69 101 270 0.090 26.81 5.05 Intr - 148596 148486 111 1 0 78 97 56 0.048 6.35 5.04 Intr - 152053 151925 129 2 0 55 94 55 0.127 3.97 5.03 Intr - 152789 152653 137 1 2 75 93 -18 0.730 -1.88 5.02 Intr - 153003 152841 163 1 1 106 96 23 0.837 4.45 5.01 Init - 154006 153931 76 1 1 93 46 134 0.986 8.90 5.00 Prom - 169012 168973 40 -2.81 6.27 PlyA - 169164 169159 6 1.05 6.26 Term - 178308 178228 81 0 0 113 41 115 0.915 7.29 6.25 Intr - 180292 180221 72 1 0 121 78 111 0.981 13.50 6.24 Intr - 181095 180942 154 2 1 97 94 255 0.990 27.59 6.23 Intr - 184952 184787 166 0 1 20 115 54 0.978 0.83 6.22 Intr - 185526 185428 99 1 0 85 89 214 0.561 21.78 6.21 Intr - 185853 185685 169 0 1 105 72 203 0.999 20.43 6.20 Intr - 187804 187661 144 1 0 106 99 209 0.998 24.79 6.19 Intr - 189068 188643 426 2 0 56 81 505 0.626 41.16 6.18 Intr - 189309 189174 136 2 1 107 76 77 0.999 9.48 6.17 Intr - 191107 191007 101 1 2 89 94 106 0.995 10.71 6.16 Intr - 192658 192467 192 1 0 82 62 279 0.992 24.91 6.15 Intr - 194166 194047 120 0 0 73 63 94 0.811 6.59 6.14 Intr - 196456 196257 200 2 2 67 90 146 0.998 12.29 6.13 Intr - 203596 203464 133 1 1 91 81 105 0.998 10.82 6.12 Intr - 204340 204166 175 0 1 65 81 231 0.980 20.56 6.11 Intr - 205630 205353 278 1 2 91 68 143 0.906 9.45 6.10 Intr - 213906 213767 140 1 2 46 99 83 0.338 5.89 6.09 Intr - 215712 215562 151 0 1 97 103 55 0.540 8.15 6.08 Intr - 221197 221046 152 2 2 84 -25 167 0.067 5.39 6.07 Intr - 222381 222119 263 2 2 90 57 288 0.799 23.57 6.06 Intr - 223474 223309 166 2 1 141 79 186 0.995 22.53 6.05 Intr - 225533 225395 139 2 1 82 67 179 0.752 15.74 6.04 Intr - 227683 227554 130 0 1 99 69 167 0.999 17.10 6.03 Intr - 229493 229332 162 1 0 81 110 62 0.993 7.31 6.02 Intr - 230628 230509 120 2 0 87 84 84 0.995 8.01 6.01 Init - 231318 231251 68 0 2 89 80 52 0.851 5.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 135856 135762 95 1 2 70 90 163 0.840 12.51 S.002 Intr + 148125 148225 101 2 2 100 71 37 0.846 2.61 S.003 Intr + 148257 148396 140 0 2 88 75 125 0.933 11.82 S.004 Term + 149861 150000 140 0 2 106 47 53 0.852 1.43 S.005 Term - 221197 221007 191 2 2 84 43 163 0.876 9.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:3828663_4064291|GENSCAN_predicted_peptide_1|140_aa VRDTGRVSPLSNSTEGSNDKVQGSENAAPSLRSWILSFVGKRPLFTIALPVTNKDFLGFP RKEKKEHSETAKRLEKGDQKVATHEAKLSEWAASLCFQHPKGYCMKGLHSHTVTDGQKLL EGNAQHNETEIINTFSSQGC >gi568815581r:3828663_4064291|GENSCAN_predicted_CDS_1|423_bp gtcagagacacaggcagagtcagccctctgagtaacagtacagaaggcagtaatgacaaa gtccaaggctctgaaaatgcagccccaagtctacgcagttggatcctctcctttgttggg aaaaggcctctgttcactattgcactgccggtcactaataaggatttcctggggttccct agaaaagagaagaaagaacattcagaaacggcaaagcgtttagagaagggagaccagaaa gttgcgacacatgaagcaaagttgagcgaatgggcagcaagcctgtgcttccagcacccg aagggctactgtatgaaaggacttcacagtcacacagtaactgacggacagaaattacta gaaggcaatgctcaacacaatgaaacggaaataatcaacaccttctcctcacagggctgt tga >gi568815581r:3828663_4064291|GENSCAN_predicted_peptide_2|151_aa MAAVRGLRVSVKAEAPAGPALGLPSPEAESGVDRGEPEPMEVEEGELEIVPVRRSLKELI PDTSRRYENKAGSFITGIDVTSKEAIEKKEQRAKRFHFRSEVNLAQRNVALDRDMMKKAG RKHTCESLSQLYREGVIALSHLSTSGPESIP >gi568815581r:3828663_4064291|GENSCAN_predicted_CDS_2|456_bp atggcggccgtacggggcctgcgggtgtcggtgaaggcggaggccccggcggggccggcc ctggggctcccgtcccctgaggcggagtccggtgttgaccgtggcgagccggagcccatg gaggtggaggagggcgagctggaaatcgtgcctgtgcggcgctcgctcaaggaactgatc ccggacacgagcagaagatatgaaaacaaggctggcagcttcatcactggaattgatgtc acctccaaggaagcaattgaaaagaaagagcagcgagccaagcgcttccattttcgatcg gaagtaaatcttgcccaaagaaatgtagccttggaccgagacatgatgaagaaagccggg cgcaagcatacctgtgagtctctctcacaactctaccgcgagggcgtcatagctttatca cacctgtcgacctctggtcccgagagcattccttga >gi568815581r:3828663_4064291|GENSCAN_predicted_peptide_3|1086_aa MGSLTSALAAAFRPPAHPSGPQRVPSNHNSSRTTTTHFYGLLSGRHCSERLTGVPSSASH HVRFPTRLRRRTPLTEAMEGGPAVCCQDPRAELVERVAAIDVTHLEEADGGPEPTRNGVD PPPRARAASVIPGSTSRLLPARPSLSARKLSLQERPAGSYLEAQAGPYATGPASHISPRA WRRPTIESHHVAISDAEDCVQLNQYKLQSEIGKVGLTDAYLQGAYGVVRLAYNESEDRHY AMKVLSKKKLLKQYGFPRRPPPRGSQAAQGGPAKQLLPLERVYQEIAILKKLDHVNVVKL IEVLDDPAEDNLYLALQNQAQNIQLDSTNIAKPHSLLPSEQQDSGSTWAARSGRDLGIGC FASQLHLTFLSFLSVFDLLRKGPVMEVPCDKPFSEEQARLYLRDVILGLEYSSWGPLVQM KVLAKEGAHRSMVAGWVHCQKIVHRDIKPSNLLLGDDGHVKIADFGVSNQFEGNDAQLSS TAGTPAFMAPEAISDSGQSFSGKALDVWATGVTLYCFVYGKCPFIDDFILALHRKIKNEP VVFPEEPEISEELKDLILKMLDKNPETRIGVPDIKLHPWVTKNGEEPLPSEEEHCSVVEV TEEEVKNSVRLIPSWTTVRPRINQAVPASPPDPVYQLDGACISPVPTYQLDGACFSPRAH VSVRRRLLLPQSPPWPCGPFMGWSRPPPHVPVASGSSPFQEYAPSCQRPVFLPLPFHAVL IEMGPALRPPWQLLSGNRSRGCVSGSLGPVLLTLPCPPQILVKSMLRKRSFGNPFEPQAR REERSMSAPGNLLVTAQDYLAAKWMELTFLDCVSHSALSGKWTERNQALSRGLKRKHHIV PAHLTLSLDDAPRAEAGEDTRADFDRGPWTRQACGPAHAEITGTQPLLLHRPTHLGPRAA LTLLQRRTNCPGGLCGNCLTLRTSAVSLLRAGVLERVQTQLFRSSLHLLSGISVRLADVA FADRMLIEVDPHRVTQDSDTALETWMDKGFWPQAQPFRRDTASPQSLPLLVKLDPALQGL FSSWKNPEGGSERRNGPALLHSKEDVQEAAGHAHLELEGEARTGNGDAGALSVWTAVELE AMRVMS >gi568815581r:3828663_4064291|GENSCAN_predicted_CDS_3|3261_bp atggggagcctgacgtcagccctcgcggccgccttccgcccgcccgcgcatccatctggg cctcagcgtgtcccgagcaatcacaacagcagccgcacaacaacaactcacttttacggc ctccttagtggcaggcactgttctgagcgccttacgggcgttccctcctcagcatctcac cacgtgcggttcccaacaaggctacgcagaagaacccccttgactgaagcaatggagggg ggtccagctgtctgctgccaggatcctcgggcagagctggtagaacgggtggcagccatc gatgtgactcacttggaggaggcagatggtggcccagagcctactagaaacggtgtggac cccccaccacgggccagagctgcctctgtgatccctggcagtacttcaagactgctccca gcccggcctagcctctcagccaggaagctttccctacaggagcggccagcaggaagctat ctggaggcgcaggctgggccttatgccacggggcctgccagccacatctccccccgggcc tggcggaggcccaccatcgagtcccaccacgtggccatctcagatgcagaggactgcgtg cagctgaaccagtacaagctgcagagtgagattggcaaggtggggctgactgatgcctat ctgcagggtgcctacggtgtggtgaggctggcctacaacgaaagtgaagacagacactat gcaatgaaagtcctttccaaaaagaagttactgaagcagtatggctttccacgtcgccct cccccgagagggtcccaggctgcccagggaggaccagccaagcagctgctgcccctggag cgggtgtaccaggagattgccatcctgaagaagctggaccacgtgaatgtggtcaaactg atcgaggtcctggatgacccagctgaggacaacctctatttggccctgcagaaccaggcc cagaatatccagttagattcaacaaatatcgccaagccccactccctgcttccctctgag cagcaagacagtggatccacgtgggctgcgcgctcagggagggaccttggcatcggctgc tttgccagccagctacaccttaccttcttgtcttttctttcagtgtttgacctcctgaga aaggggcccgtcatggaagtgccctgtgacaagcccttctcggaggagcaagctcgcctc tacctgcgggacgtcatcctgggcctcgagtactcatcctggggtccgttggtccagatg aaggtacttgccaaggagggagcccacaggtcgatggtcgcgggatgggtgcactgccag aagatcgtccacagggacatcaagccatccaacctgctcctgggggatgatgggcacgtg aagatcgccgactttggcgtcagcaaccagtttgaggggaacgacgctcagctgtccagc acggcgggaaccccagcattcatggcccccgaggccatttctgattccggccagagcttc agtgggaaggccttggatgtatgggccactggcgtcacgttgtactgctttgtctatggg aagtgcccattcatcgacgatttcatcctggccctccacaggaagatcaagaatgagccc gtggtgtttcctgaggagccagaaatcagcgaggagctcaaggacctgatcctgaagatg ttagacaagaatcccgagacgagaattggggtgccagacatcaagttgcacccttgggtg accaagaacggggaggagccccttccttcggaggaggagcactgcagcgtggtggaggtg acagaggaggaggttaagaactcagtcaggctcatccccagctggaccacggtgcgcccg cgtatcaatcaggcagtgcctgcgtctcccccagatcctgtgtatcagttagacggtgcc tgcatctcccccgtgcccacgtatcagttagacggcgcctgcttctcccccagagcccac gtatcagttagacggcgcctgcttctcccccagagcccaccctggccttgcggacccttc atgggctggtcccggccccctcctcatgtaccagtggcatccggctcctcaccattccag gaatatgcccccagctgccagcgccccgtgttcttgcctctgccatttcatgctgtgctg attgagatgggacccgcactgcggcccccttggcagctgctctcggggaatcggagcaga ggctgcgtgtctgggagcctgggacctgtgctcctcacgctgccttgtcctcctcagatc ctggtgaagtccatgctgaggaagcgttcctttgggaacccgtttgagccccaagcacgg agggaagagcgatccatgtctgctccaggaaacctactggtcacagcccaagattatttg gcagccaagtggatggaactaactttcctggactgtgtttcgcattcggcgttatctgga aagtggactgaacggaatcaagctctgagcagaggcctgaagcggaagcaccacatcgtc cctgcccatctcactctctcccttgatgatgcccctagagctgaggctggagaagacacc agggctgactttgaccgagggccatggacgcgacaggcctgtggccctgcgcatgctgaa ataactggaacccagcctctcctcctacaccggcctacccatctgggcccaagagctgca ctcacactcctacaacgaaggacaaactgtccaggtggcctctgcggcaattgcctcacc ctgaggacatcagcagtcagcctgctcagagcgggggtgctggagcgcgtgcagacacag ctcttccggagcagccttcaccttctctctgggatcagtgtccggctggccgacgtggca tttgctgaccgaatgctcatagaggttgacccccacagggtcacgcaggactcggacact gccctggaaacatggatggacaagggcttttggccacaggcccagccattccggcgggac actgcctcccctcaatccctgcccctgcttgtcaagcttgaccccgccctccaaggcctg ttcagcagctggaaaaatccagagggagggagcgaacgcaggaatggacccgccttgcta cattcaaaggaagatgtccaggaggcagctggacatgcgcatctggagctcgagggagag gcgagaactggaaatggagacgcgggggccctcagcgtgtggacggccgtggagttggaa gccatgcgtgtgatgagctag >gi568815581r:3828663_4064291|GENSCAN_predicted_peptide_4|456_aa MARRFQEELAAFLFEYDTPRMVLVRNKKVGVIFRLIQLVVLVYVIGPLEITCSEISKWRS ECTNQGEGWIEIIGGRNTSRDRFCSIFSMPGAVPGNRARNTWHSQSFRRLKRSGKWVFLY EKGYQTSSGLISSVSVKLKGLAVTQLPGLGPQVWDVADYVFPAQGDNSFVVMTNFIVTPK QTQGYCAEHPEGGICKEDSGCTPGKAKRKAQGIRTGKCVAFNDTVKTCEIFGWCPVEVDD DIPRPALLREAENFTLFIKNSISFPRFKVNRRNLVEEVNAAHMKTCLFHKTLHPLCPVFQ LGYVVQESGQNFSTLAEKGGVVGITIDWHCDLDWHVRHCRPIYEFHGLYEEKNLSPGFNF RFARHFVENGTNYRHLFKVFGIRFDILVDGKAGKFDIIPTMTTIGSGIGIFGVATVLCDL LLLHILPKRHYYKQKKFKYAEDMGPGAICALRCGSQ >gi568815581r:3828663_4064291|GENSCAN_predicted_CDS_4|1371_bp atggcacggcggttccaggaggagctggccgccttcctcttcgagtatgacaccccccgc atggtgctggtgcgtaataagaaggtgggcgttatcttccgactgatccagctggtggtc ctggtctacgtcatcggacctcttgaaatcacatgttctgagatctccaagtggagatca gagtgcaccaatcagggagaaggatggatagaaattattggggggagaaacacatccagg gaccgtttctgcagcatcttctccatgcctggtgctgtgccggggaacagagcacgtaac acctggcacagccagagcttccgacgtctcaagagaagtgggaagtgggtgtttctctat gagaagggctaccagacctcgagcggcctcatcagcagtgtctctgtgaaactcaagggc ctggccgtgacccagctccctggcctcggcccccaggtctgggatgtggctgactacgtc ttcccagcccagggggacaactccttcgtggtcatgaccaatttcatcgtgaccccgaag cagactcaaggctactgcgcagagcacccagaagggggcatatgcaaggaagacagtggc tgtacccctgggaaggccaagaggaaggcccaaggcatccgcacgggcaagtgtgtggcc ttcaacgacactgtgaagacgtgtgagatctttggctggtgccccgtggaggtggatgac gacatcccgcgccctgcccttctccgagaggccgagaacttcactcttttcatcaagaac agcatcagctttccacgcttcaaggtcaacaggcgcaacctggtggaggaggtgaatgct gcccacatgaagacctgcctctttcacaagaccctgcaccccctgtgcccagtcttccag cttggctacgtggtgcaagagtcaggccagaacttcagcaccctggctgagaagggtgga gtggttggcatcaccatcgactggcactgtgacctggactggcacgtacggcactgcaga cccatctatgagttccatgggctgtacgaagagaaaaatctctccccaggcttcaacttc aggtttgccaggcactttgtggagaacgggaccaactaccgtcacctcttcaaggtgttt gggattcgctttgacatcctggtggacggcaaggccgggaagtttgacatcatccctaca atgaccaccatcggctctggaattggcatctttggggtggccacagttctctgtgacctg ctgctgcttcacatcctgcctaagaggcactactacaagcagaagaagttcaaatacgct gaggacatggggccaggggcgatctgtgctctccgatgtggcagtcagtaa >gi568815581r:3828663_4064291|GENSCAN_predicted_peptide_5|1366_aa MEAGSRGGAVAPGVLAFPAALEEARGAFPCMSAELHHWRLGRLERRVFHQQALGSVLSAS AREATQHQGRRSPALFPLLSSTSLKQNKTKQNITKHKYGIFATQEATKKGLGAKNVFLSG SSILDSGEMSTSGNSDSNTTPSCVPSTTPESQHEARECAELPLKPPGRDSSSRPSNSQHP IATLGMTTLTCPADPLGPLGLRWVCPPRRRGGLDGADGADGRAGGMEAAHLLPAADVLRH FSVTAEGGLSPAQVTGARERYGPNGKSLWELVLEQFEDLLVRILLLAALVSFVLAWFEEG EETTTAFVEPLVIMLILVANAIVGVWQERNAESAIEALKEYEPEMGKVIRSDRKGVQRIR ARDIVPGDIVEVAVGDKVPADLRLIEIKSTTLRVDQSILTGESVSVTKHTEAIPDPRAVN QDKKNMLFSGTNITSGKAVGVAVATGLHTELGKIRSQMAAVEPERTPLQRKLDEFGRQLS HAISVICVAVWVINIGHFADPAHGGSWLRGAVYYFKIAVALAVAAIPEGLPAVITTCLAL GTRRMARKNAIVRSLPSVETLGCTSVICSDKTGTLTTNQMSVCRMFVVAEADAGSCLLHE FTISGTTYTPEGEVRQGDQPVRCGQFDGLVELATICALCNDSALDYNEAKGVYEKVGEAT ETALTCLVEKMNVFDTDLQALSRVERAGACNTVIKQLMRKEFTLEFSRDRKSMSVYCTPT RPHPTGQGSKMFVKGAPESVIERCSSVRVGSRTAPLTPTSREQILAKIRDWGSGSDTLRC LALATRDAPPRKEDMELDDCSKFVQYETDLTFVGCVGMLDPPRPEVAACITRCYQAGIRV VMITGDNKGTAVAICRRLGIFGDTEDVAGKAYTGREFDDLSPEQQRQACRTARCFARVEP AHKSRIVENLQSFNEITAMTGDGVNDAPALKKAEIGIAMGSGTAVAKSAAEMVLSDDNFA SIVAAVEEGRAIYSNMKQFIRYLISSNVGEVVCIFLTAILGLPEALIPVQLLWVNLVTDG LPATALGFNPPDLDIMEKLPRSPREALISGWLFFRYLAIGVYVGLATVAAATWWFVYDAE GPHINFYQLRNFLKCSEDNPLFAGIDCEVFESRFPTTMALSVLVTIEMCNALNSVSENQS LLRMPPWMNPWLLVAVAMSMALHFLILLVPPLPLIFQVTPLSGRQWVVVLQISLPVILLD EALKYLSRNHMHGFPVLSSGDVQSHTAARSATQRSNLPPASLVPETTDILRFLTVAPFYP AQCAAAALSAQLGASEPDPTLQSSQWWEGPRWGCVRRNGGTKSQQGGEGKLDMRTKAPEC PQRLHPAQASDIHEQMKDKQAAVRPRNREEPTTDARNCHPKPKPLG >gi568815581r:3828663_4064291|GENSCAN_predicted_CDS_5|4101_bp atggaggctggaagcaggggaggggctgtggccccaggggtcctggcgttcccggcagcc cttgaggaggccagaggtgcctttccgtgcatgagcgcagagctacatcactggcgctta ggacgcctggagcgacgcgtttttcaccaacaggccctgggctcagtgctgagcgcctcg gcccgcgaagccacacagcaccaggggaggcggagtccggctttattcccgttgctcagc tctacctccttgaaacaaaacaaaacaaaacaaaacataacaaaacacaagtatggaata tttgcaacccaggaagctacaaagaaaggcttaggggccaagaatgtgttcctctcaggg agttccatccttgactctggggaaatgagcacctctggcaattctgattcaaacacgaca ccaagctgtgttccttccaccactccagagtcgcagcatgaggcccgggaatgcgcagaa ttacccttgaaacctccaggaagagacagcagctcgaggcccagcaactcccagcaccca atagccactctgggaatgaccacgctcacctgccctgcagatcccctgggtccactgggc ctgcgctgggtgtgccccccccggcggagaggcggcctcgacggtgcggacggcgcagac ggccgggcgggcggcatggaggcggcgcatctgctcccggccgccgacgtgctgcgccac ttctcggtgacagccgagggcggcctgagcccggcgcaggtgaccggcgcgcgggagcgc tacggccccaacgggaagtccctgtgggagctggtgctggaacagtttgaggacctcctg gtgcgcatcctgctgctggctgcccttgtctcctttgtcctggcctggttcgaggagggc gaggagaccacgaccgccttcgtggagcccctggtcatcatgctgatcctcgtggccaac gccattgtgggcgtgtggcaggaacgcaacgccgagagtgccatcgaggccctgaaggag tatgagcctgagatgggcaaggtgatccgctcggaccgcaagggcgtgcagaggatccgt gcccgggacatcgtcccaggggacattgtagaagtggcagtgggggacaaagtgcctgct gacctccgcctcatcgagatcaagtccaccacgctgcgagtggaccagtccatcctgacg ggtgaatctgtgtccgtgaccaagcacacagaggccatcccagaccccagagctgtgaac caggacaagaagaacatgctgttttctggcaccaatatcacatcgggcaaagcggtgggt gtggccgtggccaccggcctgcacacggagctgggcaagatccggagccagatggcggca gtcgagcccgagcggacgccgctgcagcgcaagctggacgagtttggacggcagctgtcc cacgccatctctgtgatctgcgtggccgtgtgggtcatcaacatcggccacttcgccgac ccggcccacggtggctcctggctgcgtggcgctgtctactacttcaagatcgccgtggcc ctggcggtggcggccatccccgagggcctcccggctgtcatcactacatgcctggcactg ggcacgcggcgcatggcacgcaagaacgccatcgtgcgaagcctgccgtccgtggagacc ctgggctgcacctcagtcatctgctccgacaagacgggcacgctcaccaccaatcagatg tctgtctgccggatgttcgtggtagccgaggccgatgcgggctcctgccttttgcacgag ttcaccatctcgggtaccacgtatacccccgagggcgaagtgcggcagggggatcagcct gtgcgctgcggccagttcgacgggctggtggagctggcgaccatctgcgccctgtgcaac gactcggctctggactacaacgaggccaagggtgtgtatgagaaggtgggagaggccacg gagacagctctgacttgcctggtggagaagatgaacgtgttcgacaccgacctgcaggct ctgtcccgggtggagcgagctggcgcctgtaacacggtcatcaagcagctgatgcggaag gagttcaccctggagttctcccgagaccggaaatccatgtccgtgtactgcacgcccacc cgccctcaccctactggccagggcagcaagatgtttgtgaagggggctcctgagagtgtg atcgagcgctgtagctcagtccgcgtggggagccgcacagcacccctgacccccacctcc agggagcagatcctggcaaagatccgggattggggctcaggctcagacacgctgcgctgc ctggcactggccacccgggacgcgcccccaaggaaggaggacatggagctggacgactgc agcaagtttgtgcagtacgagacggacctgaccttcgtgggctgcgtaggcatgctggac ccgccgcgacctgaggtggctgcctgcatcacacgctgctaccaggcgggcatccgcgtg gtcatgatcacgggggataacaaaggcactgccgtggccatctgccgcaggcttggcatc tttggggacacggaagacgtggcgggcaaggcctacacgggccgcgagtttgatgacctc agccccgagcagcagcgccaggcctgccgcaccgcccgctgcttcgcccgcgtggagccc gcacacaagtcccgcatcgtggagaacctgcagtcctttaacgagatcactgctatgact ggcgatggagtgaacgacgcaccagccctgaagaaagcagagatcggcatcgccatgggc tcaggcacggccgtggccaagtcggcggcagagatggtgctgtcagatgacaactttgcc tccatcgtggctgcggtggaggagggccgggccatctacagcaacatgaagcaattcatc cgctacctcatctcctccaatgttggcgaggtcgtctgcatcttcctcacggcaattctg ggcctgcccgaagccctgatccctgtgcagctgctctgggtgaacctggtgacagacggc ctacctgccacggctctgggcttcaacccgccagacctggacatcatggagaagctgccc cggagcccccgagaagccctcatcagtggctggctcttcttccgatacctggctatcgga gtgtacgtaggcctggccacagtggctgccgccacctggtggtttgtgtatgacgccgag ggacctcacatcaacttctaccagctgaggaacttcctgaagtgctccgaagacaacccg ctctttgccggcatcgactgtgaggtgttcgagtcacgcttccccaccaccatggccttg tccgtgctcgtgaccattgaaatgtgcaatgccctcaacagcgtctcggagaaccagtcg ctgctgcggatgccgccctggatgaacccctggctgctggtggctgtggccatgtccatg gccctgcacttcctcatcctgctcgtgccgcccctgcctctcattttccaggtgacccca ctgagcgggcgccagtgggtggtggtgctccagatatctctgcctgtcatcctgctggat gaggccctcaagtacctgtcccggaaccacatgcacgggttccccgtcctgagctcggga gatgttcagagtcacactgccgcccggtctgccacgcagaggtccaacttgccacccgcg tccctggtacctgagaccaccgacatcctcaggttcctgaccgtggcgcccttctaccca gcccagtgtgcggccgccgcgctgtctgcacagctgggggcctctgagcctgaccccacg ctccagtcctcacagtggtgggagggaccccggtggggctgtgttaggagaaatggaggc acaaaatcacagcagggaggagaagggaagctggacatgcgcacaaaggcccccgagtgt ccacagcggcttcacccagcacaggcctcagatatccatgaacagatgaaggataagcaa gctgcagtacgtccacgcaaccgagaggaaccaaccactgatgcacgcaactgccacccg aagcctaaaccgctgggctga >gi568815581r:3828663_4064291|GENSCAN_predicted_peptide_6|1378_aa MRQRPGLTPTGIEEKAAVGVLPGYFGHLEGCGADLHKEIRDTYYQLVLFLVKAVKGFSSL NDRSLLPALSCVQTALLHLLDMGWEPNDLAFFVDIQLPDLLMKMSQENISVHDSVISQWS EEDELADAKQNSEWMDECQDGMFEAWYEKIAQEDPEKQRKMHMFIARYCDLLNVDISCDG CDEIAPWHRYRCLQCSDMDLCKTCFLGGVKPEGHGDDHEMVNMEFTCDHCQGLIIGRRMN CNVCDDFDLCYGCYAAKKYSYGHLPTHSITAHPMVTIRISDRQRLIQPYIHNYSWLLFAA LALYSAHLASAEDVDGEKLDPQTRSSATTLRSQCMQLVGDCLMKAHQGKGLKALALLGVL PDGDSSLEDQALPVTVPTGASEEQLEKKAVQGAELSEAGNGKRAVHEEIRPVDFKQRNKA DKGVSLSKDPSCQTQISDSPADASPPTGLPDAEDSEVSSQKPIEEKAVTPSPEQVFAECS QKRILGLLAAMLPPLKSGPTVPLIDLEHVLPLMFQVVISNAGHLNETYHLTLGLLGQLII RLLPAEVDAAVIKVLSAKHNLFAAGDSSIVPDGWKTTHLLFSLGAVCLDSRVGLDWACSM AEILRSLNSAPLWRDVIATFTDHCIKQLPFQLKHTNIFTLLVLVGFPQVLCVGTRCVYMD NANEPHNVIILKHFTEKNRAVIVDVKTRKRKTVKDYQLVQKGGGQECGDSRAQLSQYSQH FAFIASHLLQSSMDSHCPEAVEATWVLSLALKGLYKTLKAHGFEEIRATFLQTDLLKLLV KKCSKGTGFSKTWLLRDLEILSIMLYSSKKEINALAEHGDLELDERGDREEEVERPVSSP GDPEQKKLDPLEGLDEPTRICFLMAHDALNAPLHILRAIYELQMKKTDYFFLEVQKRFDG DELTTDERIRSLAQRWQPSKSLRLEEQSAKAVDTDMIILPCLSRPARCDQATAESNPVTQ KLISSTESELQQSYAKQRRSKSAALLHKELNCKSKRAVRDYLFRVNEATAVLYARHVLAS LLAEWPSHVPVSEDILELSGPAHMTYILDMFMQLEEKHEWEKVVMQTELVLTHQVLPLPH RLPPILQKVLQGCREDMLGTMALAACQFMEEPGMEVQVRESKHPYNNNTNFEDKVHIPGA IYLSIKFDSQCNTEEGCDELAMSSSSDFQQDRHSFSGSQQKWKDFELPGDTLYYRFTSDM SNTEWGYRFTVTAGHLGRFQTGFEILKQMLSEERVVPHLPLAKIWEWLVGVACRQTGHQR LKAIHLLLRIVRCCGHSDLCDLALLKPLWQLFTHMEYGLFEDVTQPGILLPLHRALTELF FVTENRAQELGVLQDYLLALTTDDHLLRCAAQALQNIAAISLAINYPNKATRLWNVEC >gi568815581r:3828663_4064291|GENSCAN_predicted_CDS_6|4137_bp atgaggcagcgtcctgggctgacgcccactggtattgaagaaaaggctgctgttggagtc ctacctggttattttggacacctggaaggctgtggtgctgatctacacaaagaaattcga gacacttactatcaacttgttctgtttttggtcaaagcagttaaaggatttagtagccta aatgacaggtccttgctccctgccttatcctgtgttcagacagccctgcttcatcttttg gatatgggctgggaacccaatgatctcgccttctttgttgatattcagttaccagatctc ctcatgaaaatgtcacaggagaatataagtgtccatgacagtgtgatcagccaatggagt gaagaagatgagcttgctgatgccaagcagaattcagaatggatggatgagtgtcaggat ggcatgtttgaggcctggtatgaaaaaatagcccaggaagatccagagaagcagaggaaa atgcacatgttcattgctcgctactgtgacctgttaaatgtggacatctcttgtgatggg tgtgatgagattgccccctggcatcgataccgctgtctgcagtgcagcgacatggatctc tgcaaaacttgcttcctaggtggggtgaagcctgagggccacggagacgaccatgaaatg gtcaacatggagtttacctgtgaccactgccagggtttgatcataggccggaggatgaac tgcaatgtttgcgatgactttgatctttgctacggatgctatgcagcgaagaaatactcc tacggccatttgcctacccacagcatcacggcccacccaatggtaaccattcggatcagt gaccggcagaggctcatccagccatatatccataactactcctggctgctctttgctgcc ctggctctctatagcgcccacctggccagtgcagaggatgtggatggggagaagctggac ccccagacgcgcagcagtgccaccaccctgcggagccagtgcatgcagctcgtcggggac tgtctgatgaaggctcatcagggaaaaggccttaaagctctagctttgctgggtgtattg ccagatggggactcgagcctagaagatcaggccctaccagtcactgtgcccaccggagcg tcagaggagcagctagagaagaaagctgtccagggtgctgagctgtcagaagcaggcaat ggaaagagagctgttcatgaggaaatcagacctgtagatttcaagcagagaaataaggca gataaaggtgtatcattatcgaaggatccttcatgccagacccaaatttcagattcacct gcagatgctagcccacctacaggacttccagatgctgaagattcagaagtgtcatctcag aagcccatagaggaaaaagcagttactccaagccctgagcaagtgtttgctgagtgttcc cagaagaggattttgggattactagcagccatgttacctcccttaaagtcgggccccacg gttcccctgatagacctggagcacgtccttccactcatgtttcaggttgtcatctcaaac gcaggccacctgaatgaaacctaccatctcaccctgggtcttctcggccagttaattatc cgtcttttgccagcagaggtagacgccgcagtgatcaaagtcctctcagccaaacacaac ctgtttgctgcaggggacagttccattgtgccagatggctggaaaaccacccacctgctc tttagcctgggagctgtgtgtctggacagccgggtgggcttggactgggcgtgctccatg gcagagatcctgcggtcactcaacagtgccccactgtggcgtgatgtcattgccaccttc acagaccactgcatcaagcagctgccattccagctgaagcacaccaacatcttcaccctg ctcgtgctggttggcttcccccaggtcctctgtgtgggaacccgctgcgtttatatggat aatgccaatgaaccccataatgtgatcatcttgaagcactttactgagaagaacagggct gtgattgttgatgtcaaaactcggaagaggaaaacagtgaaggactaccagctggtccag aagggaggaggacaagagtgtggtgactctcgggcccagctgagccagtactcccagcac tttgcctttatcgccagtcaccttctgcaaagcagcatggacagccattgtcccgaggca gtagaagcaacttgggtcctgtccctggccctgaaaggattgtataaaacactaaaggct cacggttttgaggagatccgtgctactttccttcagaccgatttgctgaagttgctggtg aaaaagtgcagcaaagggactggctttagtaaaacgtggctcctccgggacctggaaatt ttgtccatcatgctgtactcctcaaaaaaggagatcaacgctttggctgagcacggagac ctagagctggatgagcgaggggaccgagaggaagaggtggaacggccagtcagcagccct ggcgacccagagcagaaaaagctggacccccttgagggcctggatgagcccaccagaata tgtttcttgatggctcatgatgccctcaatgcccctctgcacattctccgggccatatac gaactgcagatgaaaaagaccgattatttcttcctggaggttcagaagaggtttgatggt gatgagctcaccacagatgaaaggatacggtccctggctcagcggtggcagcccagtaag agtctgaggctggaagaacagagcgccaaagctgtggatacagacatgattatcctgcca tgcttgtcccggcctgcacgctgtgaccaagccactgctgaatcgaaccctgtgacccag aagctgatctccagcacagagagcgaactgcagcagagctatgccaagcagcgccgtagc aagagcgccgccctcctgcacaaggagctgaactgcaagagtaagagggctgtccgggac tacctcttccgagtgaacgaggccacagctgtcctgtacgcccgccacgtgcttgcatcc ctgctcgccgagtggcctagccacgtgccagtgagcgaggacatcctggagctcagtggc cctgcccatatgacctacattttggatatgttcatgcagctggaagaaaagcatgagtgg gagaaggtagttatgcagactgagcttgtgctcacccaccaggttcttcctctgccccac aggcttcctccaatcctgcagaaagtgctccagggctgccgagaggacatgctggggacc atggccctggctgcatgccagttcatggaggagccaggaatggaggtgcaagtgagggag tcgaaacacccgtataacaacaacaccaacttcgaggataaagttcacattcctggtgcc atctacctctcaatcaaattcgactctcagtgcaacacagaggagggctgtgacgagtta gccatgtccagcagcagtgacttccagcaagaccgacacagcttcagcgggtctcagcag aagtggaaagattttgaacttccaggagacactctgtattaccgcttcacctccgacatg agcaacaccgagtggggctacagattcaccgtgacggccggacacctggggcggttccag acaggattcgagattttgaagcagatgttgtcagaagaaagggtcgtgcctcatctccca ttggcaaaaatttgggaatggctggtgggcgtggcctgtcgccagactggccatcaacga ttaaaagccatccacttacttctgaggattgtgcgatgctgcggccacagtgacctgtgt gaccttgcgctgttgaagcccctgtggcagctctttacccacatggagtacggcctgttt gaggacgtgacgcagcccggcatcctccttcccctgcatcgtgccctcactgagctcttc ttcgtcaccgagaaccgtgcccaggagcttggcgtgctgcaggattacctgctggcccta accacggacgaccaccttctccgctgtgcggcacaggctctgcagaacattgctgccatc agcctggccatcaactacccaaacaaggccacccgcctctggaatgtggagtgttag