GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:29:25 Sequence gi568815576f:37539788_37755620 : 215833 bp : 54.32% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7864 7981 118 0 1 76 15 126 0.242 4.43 1.02 Intr + 9262 9365 104 2 2 111 55 27 0.385 2.09 1.03 Intr + 11510 11576 67 1 1 97 95 0 0.418 0.57 1.04 Intr + 14472 14596 125 1 2 98 63 46 0.021 3.91 1.05 Intr + 20667 20801 135 2 0 34 92 92 0.947 5.47 1.06 Intr + 21795 22030 236 2 2 56 68 135 0.547 5.32 1.07 Intr + 22069 22213 145 1 1 86 61 20 0.700 -0.11 1.08 Intr + 24555 24758 204 2 0 63 73 93 0.777 5.22 1.09 Intr + 26530 27025 496 0 1 81 89 575 0.876 50.06 1.10 Term + 28321 29033 713 2 2 129 55 358 0.994 30.70 1.11 PlyA + 29594 29599 6 -1.95 2.06 PlyA - 30354 30349 6 -3.24 2.05 Term - 30625 30476 150 1 0 120 42 87 0.366 5.52 2.04 Intr - 30948 30789 160 2 1 141 100 145 0.990 21.50 2.03 Intr - 32285 32062 224 2 2 56 80 156 0.173 9.05 2.02 Intr - 35002 34865 138 1 0 87 44 116 0.307 8.27 2.01 Init - 40118 40113 6 2 0 110 89 7 0.416 3.45 2.00 Prom - 63719 63680 40 -0.31 3.03 PlyA - 63771 63766 6 1.05 3.02 Term - 69100 68929 172 0 1 74 43 191 0.984 10.81 3.01 Init - 69350 69331 20 2 2 96 80 11 0.970 0.69 3.00 Prom - 70740 70701 40 -7.40 4.00 Prom + 72375 72414 40 -4.91 4.01 Init + 73027 73054 28 0 1 77 77 -9 0.266 -3.26 4.02 Intr + 74403 74487 85 1 1 83 61 276 0.957 23.78 4.03 Intr + 77135 77210 76 2 1 117 93 92 0.999 12.61 4.04 Intr + 78661 78759 99 0 0 115 81 218 0.996 24.61 4.05 Intr + 80451 80574 124 2 1 17 72 254 0.999 17.36 4.06 Intr + 81026 81126 101 0 2 69 100 79 0.996 7.53 4.07 Intr + 81829 81909 81 0 0 120 59 101 0.862 10.73 4.08 Intr + 83540 83680 141 1 0 70 50 248 0.963 20.26 4.09 Intr + 83726 83846 121 1 1 99 71 167 0.438 16.67 4.10 Intr + 85182 85289 108 1 0 110 81 209 0.962 23.16 4.11 Intr + 86010 86162 153 1 0 107 68 127 0.989 13.16 4.12 Intr + 89675 89739 65 0 2 141 91 38 0.961 8.33 4.13 Intr + 90211 90383 173 0 2 111 65 183 0.979 17.56 4.14 Intr + 91116 91312 197 0 2 39 81 175 0.997 11.48 4.15 Intr + 92209 92378 170 2 2 113 56 325 0.999 31.98 4.16 Intr + 92618 92728 111 1 0 90 116 154 0.999 19.48 4.17 Intr + 92814 92913 100 2 1 133 20 195 0.235 17.38 4.18 Intr + 93932 94149 218 0 2 21 -39 160 0.004 -4.65 4.19 Intr + 99263 99483 221 1 2 42 71 77 0.066 -0.97 4.20 Intr + 99982 100059 78 1 0 76 113 113 0.134 11.76 4.21 Intr + 101339 101381 43 2 1 99 98 88 0.895 9.63 4.22 Intr + 101587 101691 105 0 0 120 89 203 0.995 24.51 4.23 Intr + 102752 102828 77 1 2 81 72 66 0.983 3.11 4.24 Intr + 103108 103219 112 1 1 79 83 143 0.999 13.78 4.25 Intr + 103311 103387 77 2 2 103 75 110 0.968 10.01 4.26 Intr + 103857 104001 145 0 1 55 72 194 0.930 15.29 4.27 Intr + 104850 104918 69 2 0 98 67 107 0.969 9.47 4.28 Intr + 105083 105173 91 1 1 93 75 106 0.926 9.87 4.29 Intr + 105578 105723 146 0 2 104 59 147 0.963 13.91 4.30 Intr + 107031 107142 112 2 1 90 51 139 0.999 10.86 4.31 Intr + 107480 107561 82 0 1 138 94 157 0.999 20.49 4.32 Intr + 107654 107734 81 2 0 91 99 116 0.999 12.25 4.33 Intr + 108532 108648 117 1 0 97 72 190 0.999 18.38 4.34 Intr + 110365 110462 98 1 2 72 57 131 0.841 8.55 4.35 Intr + 110755 110938 184 2 1 108 100 42 0.973 6.76 4.36 Intr + 113992 114086 95 1 2 131 64 27 0.869 4.71 4.37 Intr + 116139 116218 80 1 2 108 50 46 0.169 2.57 4.38 Intr + 116455 116515 61 0 1 106 37 25 0.056 -2.00 4.39 Intr + 118868 119569 702 0 0 74 93 1075 0.044 98.67 4.40 Term + 125768 126084 317 0 2 134 52 454 0.984 41.85 4.41 PlyA + 127123 127128 6 1.05 5.00 Prom + 129021 129060 40 -0.61 5.01 Init + 131712 131759 48 2 0 51 77 64 0.598 1.35 5.02 Intr + 133713 133832 120 2 0 41 105 58 0.489 3.99 5.03 Intr + 137199 137278 80 2 2 51 78 86 0.865 2.74 5.04 Intr + 138696 138867 172 0 1 75 64 287 0.952 25.46 5.05 Term + 139816 139962 147 0 0 113 44 362 0.959 32.51 5.06 PlyA + 139985 139990 6 1.05 6.00 Prom + 143578 143617 40 -3.31 6.01 Init + 146606 146688 83 1 2 55 101 149 0.879 13.30 6.02 Intr + 148123 148228 106 1 1 106 85 223 0.589 24.52 6.03 Intr + 148525 148573 49 0 1 96 106 46 0.999 5.94 6.04 Intr + 149063 149205 143 0 2 86 90 264 0.999 26.98 6.05 Intr + 150910 151007 98 0 2 93 101 53 0.969 6.41 6.06 Term + 151387 151549 163 1 1 101 53 127 0.555 8.22 6.07 PlyA + 151955 151960 6 1.05 7.00 Prom + 155849 155888 40 -4.51 7.01 Init + 157250 157302 53 1 2 90 77 63 0.820 4.21 7.02 Term + 157801 157891 91 1 1 110 42 60 0.846 1.09 7.03 PlyA + 158884 158889 6 1.05 8.00 Prom + 159699 159738 40 -5.81 8.01 Init + 161579 161692 114 1 0 82 80 100 0.801 8.77 8.02 Intr + 168913 169087 175 0 1 38 39 92 0.145 -0.67 8.03 Intr + 169654 169799 146 2 2 32 89 107 0.838 5.71 8.04 Intr + 170640 170779 140 2 2 50 90 110 0.993 7.17 8.05 Intr + 173423 173624 202 2 1 108 77 96 0.762 10.31 8.06 Intr + 175976 176147 172 1 1 87 96 -3 0.808 0.43 8.07 Intr + 177454 177589 136 2 1 33 -26 156 0.545 -1.17 8.08 Intr + 178064 178242 179 2 2 53 56 146 0.828 7.98 8.09 Intr + 178296 178723 428 1 2 43 -8 221 0.016 2.08 8.10 Intr + 183892 186716 2825 0 2 81 99 839 0.237 72.78 8.11 Intr + 190733 190834 102 2 0 74 61 60 0.456 1.69 8.12 Intr + 193511 193625 115 2 1 32 99 60 0.521 2.45 8.13 Intr + 194612 195655 1044 1 0 109 110 197 0.325 15.37 8.14 Intr + 201108 201235 128 2 2 38 43 128 0.003 3.28 8.15 Term + 208093 208306 214 1 1 53 43 146 0.148 3.63 8.16 PlyA + 209409 209414 6 1.05 9.00 Prom + 209855 209894 40 -7.79 9.01 Init + 211354 211413 60 0 0 69 97 75 0.980 6.16 9.02 Intr + 211985 212041 57 1 0 129 94 69 0.977 11.17 9.03 Intr + 215314 215403 90 0 0 50 66 216 0.873 16.29 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 200972 200828 145 2 1 96 91 114 0.964 12.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:37539788_37755620|GENSCAN_predicted_peptide_1|780_aa MSNAVTIPALTEVNVMGEGGPLNKVMAVTRGTEQQHGREENARLASQHVWAFALQREFTP AQLVFAFDPPNSCGGEILGWQSCPLGGVVSFQSPKGSDSKYLESMAHVPYRCPLTRCSAQ GSAAATTHLPRVHGNCLLPAEPAARAAQTGPATLPGCQPGRAAAAAADDESALVPRPRGS RSQRLDFEEQEKSADRFNPLFQGPEQVWGSCRNHDRDPLSSGGGGRHPAGCPQEVGCVHP SSGLQDETWAPGQHCGLRVTPRPALGLPATTCASTATPVGPIPAPKLVLASFPPLSELRP CAQALTILYRQGSYVSLTNCDKVFSCGPSQRLEYAHAWRRQLEKATFDHRILIPGSSLKV DGSEEPLLWLFTQLHKALAWTSSCEQPELMPGPQGGRGAATMSLGKLSPVGWVSSSQGKR RLTADMISHPLGDFRHTMHVGRGGDVFGDTSFLSNHGGSSGSTHRSPRSFLAKKLQLVRR VGAPPRRMASPPAPSPAPPAISPIIKNAISLPQLNQAAYDSLVVGKLSFDSSPTSSTDGH SSYGLDSGFCTISRLPRSEKPHDRDRDGSFPSEPGLRRSDSLLSFRLDLDLGPSLLSELL GVMSLPEAPAAETPAPAANPPAPTANPTGPAANPPATTANPPAPAANPSAPAATPTGPAA NPPAPAASSTPHGHCPNGVTAGLGPVAEVKSSPVGGGPRGPAGPALGRHWGAGWDGGHHY PEMDARQERVEVLPQARASWESLDEEWRAPQAGSRTPVPSTVQANTFEFADAEEDDEVKV >gi568815576f:37539788_37755620|GENSCAN_predicted_CDS_1|2343_bp atgagcaacgcagtcaccatccctgccctcacggaggttaatgtcatgggggaaggtggt cccctgaacaaggtcatggcagtcaccagaggcacagagcagcagcacggaagggaggag aatgccaggctggccagccagcatgtgtgggcctttgccctacaaagggagttcactccg gcccaactcgtttttgcctttgatcctcccaacagctgtggaggagaaattcttggctgg cagagctgtcctcttgggggagtggtgagcttccagtcaccaaagggatcagactctaag tatcttgaaagtatggcccacgttccttaccgctgccccctcacaaggtgctcagcccag ggctctgctgcagcaactacccatttaccacgtgttcacggaaactgcctactgcccgcg gagcccgcggccagggcggcgcagaccggcccagcgactctcccgggctgccagccggga cgcgcggccgccgccgctgcagacgacgagtccgccctcgtcccgcgcccccggggctcg cggagccagcgtctggactttgaggagcaggagaagtcagccgaccggtttaacccctta ttccagggaccagagcaggtttgggggagctgccgtaaccatgacagggacccgctgtcc agcggtgggggggggcgtcacccagctggatgcccgcaggaagtgggctgtgtgcacccg agcagtggcctccaggatgagacgtgggcccctggccagcactgcggcctgcgggtgacc cccagacccgcccttggcctccccgcgaccacatgtgcctccacagcgacccctgttggc ccgatcccggctccaaaacttgttcttgcgtcattcccaccattgagtgagctcaggcca tgtgcccaggccctgactatcctctataggcaaggctcctacgtcagtttgacaaactgt gataaagtcttttcctgcgggccttctcagagacttgaatatgctcacgcatggaggagg cagctggaaaaagccacttttgaccacaggatccttattccaggaagctccttgaaggtg gatggttcagaggaacctctgctctggctgtttacccagcttcacaaagctttggcatgg actagcagctgtgagcagccagagctgatgcccggcccccaggggggcagaggcgccgcc accatgagcctgggcaagctctcgcctgtgggctgggtgtccagttcacagggaaagagg cggctgactgcagacatgatcagccacccactcggggacttccgccacaccatgcatgtg ggccgtggcggggatgtcttcggggacacgtccttcctcagcaaccacggtggcagctcc gggagcacccatcgctcaccccgcagcttcctggccaagaagctgcagctggtgcggagg gtgggggcgcccccccggaggatggcatctccccctgcaccctccccggctccaccggcc atctcccccatcatcaagaacgccatctccctgccccagctcaaccaggccgcctacgac agcctcgtggttggcaagctcagcttcgacagcagccccaccagctccacggacggccac tccagctacggcctggactctgggttctgcaccatctcccgcctgccccgctcggaaaag ccgcatgaccgagaccgggatggttccttcccctctgagcccgggcttcgccgctctgac tctctcttgtccttccgcctggacctcgaccttgggccctcactcctcagcgagctgcta ggggtcatgagcctcccagaagcccctgcagctgagactccagcccccgctgcaaacccc ccagcccctactgcaaaccccacgggtcctgctgcaaaccccccagccactactgcaaac cccccagcgcctgctgcaaacccctcagcacctgccgcaacccccacgggtcctgctgca aatcccccagcccctgccgcaagctccacaccccatggacactgtcccaatggggtaaca gctgggttgggcccagtggctgaggtgaagtccagcccagtgggagggggtccccgagga cctgctggccctgccctcggcaggcactggggagcaggctgggatggcggccaccactac ccagagatggatgcgcggcaggagcgggtggaggtgctgccccaagcccgggcctcctgg gagagcctggacgaagagtggagggcgccccaggcaggcagcaggaccccagtgcccagc acagtgcaagcaaacacctttgaatttgcggatgctgaggaggatgatgaggtcaaggtg tga >gi568815576f:37539788_37755620|GENSCAN_predicted_peptide_2|225_aa MTRGGGCQREGTEGPPMATAPLTSGKVPQSNELIQGISDHAFSGNKKSLGGLLRPHVGRG IETCSWVGMRWMKLLGKCGIDFLWGLEWNVSLLAQGELEVKNMDMKPGSTLKITGSIADG TDGFVINLGQGTDKLNLHFNPRFSESTIVCNSLDGSNWGQEQREDHLCFSPGSEVKFTVT FESDKFKVKLPDGHELTFPNRLGHSHLSYLSVRGGFNMSSFKLKE >gi568815576f:37539788_37755620|GENSCAN_predicted_CDS_2|678_bp atgacgagaggaggcggctgccagcgagagggcactgagggtcctcccatggccactgcc cccttgacttctggcaaagtgccccagtccaatgagctcattcagggcatctcagatcat gctttttctggaaataaaaagtcacttggtggtctcctgcgtcctcacgtgggcaggggg attgagacctgcagctgggttggcatgaggtggatgaagctgctgggcaagtgtgggatt gattttctgtggggactcgagtggaatgtttctctgttggcccagggggaacttgaggtt aagaacatggacatgaagccggggtcaaccctgaagatcacaggcagcatcgccgatggc actgatggctttgtaattaatctgggccaggggacagacaagctgaacctgcatttcaac cctcgcttcagcgaatccaccattgtctgcaactcattggacggcagcaactgggggcaa gaacaacgggaagatcacctgtgcttcagcccagggtcagaggtcaagttcacagtgacc tttgagagtgacaaattcaaggtgaagctgccagatgggcacgagctgacttttcccaac aggctgggtcacagccacctgagctacctgagcgtaaggggcgggttcaacatgtcctct ttcaagttaaaagaataa >gi568815576f:37539788_37755620|GENSCAN_predicted_peptide_3|63_aa MAQGSDRVSGSIAGSIRHRPPASAPPPRPPQLSRDLLGTALCRLKGQRPNGRARHHREAN ELA >gi568815576f:37539788_37755620|GENSCAN_predicted_CDS_3|192_bp atggcccagggatccgacagagtctccggctccatcgcgggctccatccgccaccggccc ccagcctcggcaccgcccccccgccccccgcagctctcgcgggacctcctgggcaccgcc ctctgccgattaaaggggcaacgtccgaacgggcgcgctcggcaccatagagaggccaac gagctcgcctag >gi568815576f:37539788_37755620|GENSCAN_predicted_peptide_4|1747_aa MGKWRFREAYRATNPLNKELDWASINGFCEQLNEDFEGPPLATRLLAHKIQSPQEWEAIQ ALTVLETCMKSCGKRFHDEVGKFRFLNELIKVVSPKYLGSRTSEKVKNKILELLYSWTVG LPEEVKIAEAYQMLKKQGIVKSDPKLPDDTTFPLPPPRPKNVIFEDEEKSKMLARLLKSS HPEDLRAANKLIKEMVQEDQKRMEKISKRVNAIEEVNNNVKLLTEMVMSHSQGGAAAGSS EDLMKPTRTLTRPSCCPQELYQRCERMRPTLFRLASDTEDNDEALAEILQANDNLTQVIN LYKQLVRGEEVNGDATAGSIPGSTSALLDLSGLDLPPAGTTYPAMPTRPGEQASPEQPSA SVSLLDDELMSLGLSDPTPPSGPSLDGTGWNSFQSSDATEPPAPALAQAPSMESRPPAQT SLPASSGLDDLDLLGKTLLQQSLPPESQQVRWEKQQPTPRLTLRDLQNKSSSCSSPSSSA TSLLHTVSPEPPRPPQQPVPTELSLASITVPLESIKPSNILPVTVYDQHGFRILFHFARD PLPGRSDVLVVVVSMLSTAPQPIRNIVFQSAVPKVMKVKLQPPSGTELPAFNPIVHPSAI TQVLLLANPQKEKVRLRYKLTFTMGDQTYNEMGDVDQFPPPETWGPAKGPVASAESPERA LGCGGPGPSQLEVGDRQGLAGAPAATDTEGEAPGCSRLPPAVLSPPHNLDAAAPQRENGQ AQDSWPIRPLETASPEFQGLVHGHTAARGPRKPKLPASHPVVGGLVPGGQRSSAEESARQ YPRQAQRGQDSPQLAPKMMKRQLHRMRQLAQTGSLGRTPETAEFLGEDLLQVEQRLEPAK RAAHNIHKRLQACLQGQSGADMDKRVKKLPLMALSTTMAESFKELDPDSSMGKALEMSCA IQNQLARILAEFEMTLERDVLQPLSRLSEEELPAILKHKKSLQKLVSDWNTLKSRLSQAT KNSGSSQGLGGSPGSHSHTTMANKVETLKEEEEELKRKVEQCRDEYLADLYHFVTKEDSY ANYFIRLLEIQADYHRRSLSSLDTALAELRENHGQADHSPSMTATHFPRVYGVSLATHLQ ELGREIALPIEACVMMLLSEGMKEEGLFRLAAGASVLKRLKQTMASDPHSLEEFCSDPHA VAGALKSYLRELPEPLMTFDLYDDWMRAASLKEPGARLQALQEVCSRLPPENLSNLRYLM KFLARLAEEQEVNKMTPSNIAIVLGPNLLWPPEKEGDQAQLDAASVSSIQVVGVVEALIQ SADTLFPGDINFNVSGLFSAVTLQDTVSDRLASEELPSTAVPTPATTPAPAPAPAPAPAP ALASAATKERTESEVPPRPASPKVTRSPPETAAPVEDMARRSPRGATGRKERFACSYGTD SSLADMFFEIPLPLSSEETEAQTEVTCQGHAVAPPPGPSSAPIGGCVERARESAPCPRRE RGAGGRRPAGCMARCERLRGAALRDVLGRAQGVLFDCDGVLWNGERAVPGAPELLERLAR AGKAALFVSNNSRRARPELALRFARLGFGGLRAEQLFSSALCAARLLRQRLPGPPDAPGA VFVLGGEGLRAELRAAGLRLAGDPSAGDGAAPRVRAVLVGYDEHFSFAKLREACAHLRDP ECLLVATDRDPWHPLSDGSRTPGTGSLAAAVETASGRQALVVGKPSPYMFECITENFSID PARTLMVGDRLETDILFGHRCGMTTVLTLTGVSRLEEAQAYLAAGQHDLVPHYYVESIAD LTEGLED >gi568815576f:37539788_37755620|GENSCAN_predicted_CDS_4|5244_bp atggggaaatggaggttcagggaagcatatagagccacgaaccccctgaacaaggagctc gactgggccagcatcaacggcttctgcgagcagctcaacgaggactttgaggggcctcca ctcgccacccggctgctggcccacaagatccagtccccacaggagtgggaggcgatccag gccttgacggtgctggaaacatgcatgaagagctgcggcaagcggttccacgacgaagtg ggcaagttccgctttctcaacgagctcatcaaggtcgtgtctcccaagtatctgggctct cggacatcggagaaggtgaagaacaagatcttggagctcctctacagctggacagtgggc ctgcccgaggaggtgaaaatcgcagaggcctaccagatgctaaagaagcaggggattgta aagtccgaccccaagcttccagatgacactacctttccccttcctcctccacggccgaag aatgtgatctttgaagatgaggagaaatccaagatgctggcccgcctgctgaagagctcc catcccgaagacctccgcgcagccaataagctcatcaaagagatggtgcaggaggaccag aagcggatggagaagatctcgaagagggtgaatgccatcgaggaggtgaacaacaatgtg aaactgctcacggagatggtgatgagccacagccagggcggcgcagcagctggcagcagc gaggacctcatgaagcccacgcggaccctgacccgcccatcctgctgccctcaggaactg taccagcgctgtgagcggatgcggcccacgctcttccgactggcgagtgacacagaggac aatgatgaggccttagcggagatcctgcaggccaatgacaacctcacccaggtgatcaac ctgtataagcagctggtgcggggtgaggaggtcaacggtgatgccacagccggctccatc cctgggagcacctcggccctgctggatctctcaggcctggatctcccgcctgcgggcacc acctacccagctatgcccacccgccctggcgagcaggccagccctgagcagcccagtgcc tcagtttccctgcttgacgacgagctcatgtctctgggcctcagtgaccccacaccccct tcaggcccaagcctggatggtaccggatggaacagcttccagtcgtcggatgccactgag cccccagcccctgctctggcccaggcccccagtatggaaagccgacccccagcgcagaca tccctgccagcaagcagcggtctggacgacctagacctcctggggaagaccctcctgcag cagtcgctgcccccggaatcccagcaagtgcggtgggagaagcagcagccaaccccccgg ctcacactccgggacctgcagaataagagcagcagctgcagctcccccagctccagcgcc accagccttctccacaccgtgtccccagagccccccaggcctccgcagcagcccgtacca accgagctctcactggccagcatcactgtgcccctggagtccatcaaacccagcaacatc ctgcccgtgactgtgtatgaccagcacggcttccgcatcctcttccattttgcccgggac ccactgccagggcgctccgacgtgctggtggtggtggtttccatgctgagcaccgccccc cagcccatccgcaacatcgtgttccagtcagctgtccccaaggttatgaaggtgaagctg cagccaccctcgggcacggagctgccagcttttaaccccatcgtccacccctcagcaatc acccaggtcctgctgcttgccaacccccagaaggagaaggttcgcctccgctacaagctc accttcaccatgggtgaccagacctacaacgagatgggggatgtggaccagttcccccca cctgaaacctggggccctgcaaaggggcctgtggccagtgctgagtcaccagagagggcg ctgggctgtggcggaccaggaccgtcccagctggaagtgggcgaccgccagggcctggca ggagccccagctgctacagacaccgagggggaggcccctggctgctcacgacttcctcct gctgtgctcagccctccacacaacctcgatgctgcagcaccccagagggaaaatgggcag gcccaggacagctggcccatcagaccattagaaacagcgagtccggagttccaggggctt gtccacggccacacagcagcccgtggccccaggaagccaaagctcccagccagtcatcca gtggtggggggtttagttccagggggccagaggtcctctgcggaagagagtgcaaggcag tatccgcggcaggcccagagaggccaggacagcccccagctcgcccccaagatgatgaag aggcagctgcaccgcatgcggcagctggcccagacgggcagcttgggacgcaccccggag accgctgagttcctgggtgaggacctgctgcaggtagaacagcggctggagccggccaag cgggcagcccacaacatccacaagcggctgcaggcctgtctgcagggccagagcggggca gacatggacaagcgggtgaagaagcttcccctcatggctctgtccaccacgatggctgag agcttcaaggagctggaccctgattccagcatggggaaggccttggagatgagctgtgcc atccagaatcagctggcccgcatcctggccgagtttgagatgaccctggagagggacgtc ctgcagccactcagcaggctgagtgaggaggagctgccagccatcctcaaacacaagaaa agcctccagaagctcgtgtccgactggaacacactcaagagcaggctcagtcaggcaacc aagaattcaggcagcagtcaaggcctaggaggcagcccgggtagtcacagccatacgacc atggccaacaaggtggagacgctgaaggaggaggaggaggagctgaagaggaaagtggag caatgcagggacgagtacttggctgacctgtaccactttgttaccaaggaggactcctat gccaactacttcattcgtctcctggagattcaggccgattaccatcgcaggtcactgagc tcgctggacacagccctggctgagctgagggagaaccacggccaagcagaccactcccct tcgatgacagccacccacttccccagggtgtatggggtgtcgctggcaacccacctgcaa gagctgggccgggagattgccctgcccatcgaggcctgcgtcatgatgctgctttctgag ggcatgaaggaagagggtctcttccgtctggctgctggggcctcggtgctgaagcgtctc aagcagacaatggcctcggacccccacagcctggaggagttctgctccgacccgcacgct gtggcaggtgccctcaagtcctatctgcgggagctgccagagcctctgatgaccttcgac ctctatgatgactggatgagggcagccagcctgaaggagccaggggcccggctgcaggcc ctccaagaggtgtgcagccgcctaccccccgagaacctcagcaacctcaggtacctgatg aagttcctggcacggctggccgaggagcaggaggtgaacaagatgacacccagcaacatc gccatagtcctgggacccaacttgctgtggccacctgagaaagaaggggaccaggcccag ctggatgcagcctccgtgtcttccatccaggtggtgggcgtcgtcgaggcgctgatccag agcgcagacaccctcttccctggagacatcaacttcaacgtgtcaggcctcttctcagct gttaccctccaggacacagtcagtgacaggctggcctctgaggaacttccgtccactgcc gtgcccaccccagccaccaccccggctccggctccggctccagctccagctccggcccca gccttggcttcagcagctaccaaggaaaggacagagtctgaggtgcctcccagaccagcc tcccccaaggtcaccaggagtcccccggagacagctgccccagtggaggacatggctcgg aggagtcctaggggagccaccggaaggaaggagaggtttgcctgctcctacgggactgat tcttctcttgccgacatgttttttgaaataccattacctcttagcagtgaggagactgag gcccagacagaagtgacctgccaaggccacgccgtcgccccgccccccggtccttccagc gcgccaattggcggctgcgtggaacgtgccagggagagcgcgccgtgcccgcggagagag cgcggcgcgggaggccggcggccggccggctgcatggcgcgctgcgagaggctgcgcgga gcggccctgcgcgacgtgctgggccgggcgcagggggtcctgttcgactgtgacggggtg ctgtggaacggcgagcgcgccgtgccgggcgccccggagctgctggagcggctggcgcgg gccggcaaggcggctctgtttgtgagcaacaacagccggcgcgcgcggcccgagctggcc ctgcgcttcgcgcgcctcggcttcggggggctgcgcgccgagcagctcttcagctccgcg ctgtgcgccgcgcgcctgctgcgccagcgcctgcccgggcctccggacgcgccgggcgcc gtgttcgtgctgggcggcgaggggctgcgcgccgagctgcgcgccgcggggctgcgcctg gccggggacccgagcgcgggggacggcgcggccccgcgcgtgcgcgccgtgcttgtgggc tacgacgagcacttctccttcgccaagctgagggaggcgtgcgcgcacctgcgcgacccc gagtgcctactcgtggccaccgaccgtgacccatggcacccgctgagcgacggcagccgg acccctggcaccgggagcctggccgctgcagtggagacagcctcgggacgccaggccctg gtggtgggcaagcccagcccctacatgttcgagtgcatcacggagaacttcagcatcgac cccgcacgcacgcttatggtgggtgaccgcctggagaccgacatcctctttggccaccgc tgcggcatgaccactgtgctcacgctcacaggagtctcccgcctagaagaggcccaggcc tacctagcggccggccagcacgacctcgtgccccattactatgtggagagcatcgcagac ttgacagaggggttggaggactga >gi568815576f:37539788_37755620|GENSCAN_predicted_peptide_5|188_aa MAPLWAALLQHRQLKEQFQKGDGQVEKEVPVTTPQHAPHFAQCSPRPFLHQVPECKGLVA SNLNLKPGECLRVRGEVAPDAKSFVLNLGKDSNNLCLHFNPRFNAHGDANTIVCNSKDGG AWGTEQREAVFPFQPGSVAEVCITFDQANLTVKLPDGYEFKFPNRLNLEAINYMAADGDF KIKCVAFD >gi568815576f:37539788_37755620|GENSCAN_predicted_CDS_5|567_bp atggcgcctctgtgggccgccctccttcagcaccgccagctgaaggagcagtttcagaag ggggacggccaggtggagaaggaggtccccgtgacaaccccccagcatgcccctcatttt gcccagtgctccccacgccccttcctccaccaggttcctgagtgtaagggtctggtcgcc agcaacctgaatctcaaacctggagagtgccttcgagtgcgaggcgaggtggctcctgac gctaagagcttcgtgctgaacctgggcaaagacagcaacaacctgtgcctgcacttcaac cctcgcttcaacgcccacggcgacgccaacaccatcgtgtgcaacagcaaggacggcggg gcctgggggaccgagcagcgggaggctgtctttcccttccagcctggaagtgttgcagag gtgtgcatcaccttcgaccaggccaacctgaccgtcaagctgccagatggatacgaattc aagttccccaaccgcctcaacctggaggccatcaactacatggcagctgacggtgacttc aagatcaaatgtgtggcctttgactga >gi568815576f:37539788_37755620|GENSCAN_predicted_peptide_6|213_aa MGRNKKKKRDGDDRRPRLVLSFDEEKRREYLTGFHKRKVERKKAAIEEIKQRLKEEQRKL REERHQEYLKMLAEREEALEEADELDRLVTAKTESVQYDHPNHTVTVTTISDLDLSGARL LGLTPPEGGAGDRSEEEASSTEKPTKALPRKSRDPLLSQRISSLTASLHAHSRKKVKRKH PRRAQDSKKPPRAPRTSKAQRRRLTGKARHSGE >gi568815576f:37539788_37755620|GENSCAN_predicted_CDS_6|642_bp atgggccgcaacaagaagaagaagcgagatggtgacgaccggcggccgaggctcgttctt agcttcgacgaggagaagaggcgggagtacctgacaggcttccacaagcggaaggtcgag cgaaagaaggcagccattgaggagattaagcagcggctgaaagaggagcagaggaagctt cgggaggagcgccaccaggaatacttgaagatgctggcagagagagaagaggctctggag gaggcagatgagctggaccggttggtgacagcaaagacggagtcggtgcagtatgaccac cccaaccacacagtcaccgtgaccaccatcagtgacctggacctctcgggggcccggctg ctcgggctgaccccacctgagggaggggctggagacaggtctgaggaggaggcgtcatcc acggagaaaccaaccaaagccttgcccaggaagtccagagaccccctgctctctcagcgg atctcctccctcacagcatcactacatgcacacagccgcaaaaaggtcaagaggaaacat ccccgacgggcccaggactccaaaaagcccccaagggcccctcgtaccagcaaggcccag cgccgccgtctcacaggcaaagcacggcacagcggggagtga >gi568815576f:37539788_37755620|GENSCAN_predicted_peptide_7|47_aa MTLALPPRQPLCVETQPRSDDGGALALAVPGEEVKFLSSPPRLATKA >gi568815576f:37539788_37755620|GENSCAN_predicted_CDS_7|144_bp atgaccctggcactcccaccccggcagcccctgtgcgtggaaacccagccaaggtctgat gatggaggagccttggccctggctgtcccaggggaggaggtgaaattcctcagctctcca ccaagattggccacaaaagcctga >gi568815576f:37539788_37755620|GENSCAN_predicted_peptide_8|2039_aa MEEVPGDALCEHFEANILTQNRCQNCFHPEEAHGARYQPPFPRGENNAPKGEVIGLRSPA GNRRSQDGARGPSEARASADPPDRKSELGTDRGAHSSFHCLGGCCHHSPESGPGRAAVWG PEPEPPGDEGADSRQPPPPPEPAAQELRSPSGAEVPYCDLPRCPPAPEDPLSASTSGCQS VVDPGLRPGPKRGPSPSAGLPEEGPTAAPRSRSRELEAVPYLEGLTTSLCGSCNEDPGSD PTSSPDSATPDDTSNSSSVDWDTVERQEEEAPSWDELAVMIPRRPREGPRADSSQRAPSL LTRSPVGGDAAGQKKEGVKLQTFVVSVTAHKGSVDPKSEQQQDLLQRAKEQSFHSMEEDP SGPWVVDGTGRCGAGAALIGEARAAQEPTEAGGSSGMAGCRSRALPGGKAAKARREIQRS AGAKPLIARGRQGQPAAPSAGPTKPTPTRNSSWPTSAARSLGSRSRLSLHTSVQAEGASS GLGQPRKGLPQCSGGLKGSSSAAKVGAQAEEAPRVSEGCEGRQHAVTSQYHPSDSASITE RKAPLGWGEREDTTFTKRLLSIPRRENPRTPCVQQDDPRASSPNRTTQRENSRTSCAQRD NPKASRTSSPNRATRDNPRTSCAQRDNPRASSPSRATRDNPTTSCAQRDNPRASRTSSPN RATRDNPRTSCAQRDNPRASSPSRATRDNPTTSCAQRDNPRASRTSSPNRATRDNPRTSC AQRDNPRASSPNRAARDNPTTSCAQRDNPRASRTSSPNRATRDNPRTSCAQRDNPRASSP NRATRDNPTTSCAQRDNPRASRTSSPNRATRDNPRTSCAQRDNPRASSPNRTTQQDSPRT SCARRDDPRASSPNRTIQQENPRTSCALRDNPRASSPSRTIQQENPRTSCAQRDDPRASS PNRTTQQENPRTSCARRDNPRASSRNRTIQRDNPRTSCAQRDNPRASSPNRTIQQENLRT SCTRQDNPRTSSPNRATRDNPRTSCAQRDNLRASSPIRATQQDNPRTCIQQNIPRSSSTQ QDNPKTSCTKRDNLRPTCTQRDRTQSFSFQRDNPGTSSSQCCTQKENLRPSSPHRSTQWN NPRNSSPHRTNKDIPWASFPLRPTQSDGPRTSSPSRSKQSEVPWASIALRPTQGDRPQTS SPSRPAQHDPPQSSFGPTQYNLPSRATSSSHNPGHQSTSRTSSPVYPAAYGAPLTSPEPS QPPCAVCIGHRDAPRASSPPRYLQHDPFPFFPEPRAPESEPPHHEPPYIPPAVCIGHRDA PRASSPPRHTQFDPFPFLPDTSDAEHQCQSPQHEPLQLPAPVCIGYRDAPRASSPPRQAP EPSLLFQDLPRASTESLVPSMDSLHECPHIPTPVCIGHRDAPSFSSPPRQAPEPSLFFQD PPGTSMESLAPSTDSLHGSPVLIPQVCIGHRDAPRASSPPRHPPSDLAFLAPSPSPGSSG GSRGSAPPGETRHNLEREEYTVLADLPPPRRLAQRQPGPQAQCSSGGRTHSPGRAEVERL FGQERREEQPTGSRLGSCFIEAPIPQFGGKFTLRPSALTEKSEAAGAFQAQDEGRSQQPS QGQSQLLRRQSSPAPSRQVTMLPAKQAELTRRSQAEPPHPWSPEKRPEGDRQLQGSPLPP RTSARTPERELRTQRPLESGQAGPRQPLGVWQSQEEPPGSQGPHRHLERSWSSQEGGLGP GGWWGCGEPSLGAAKAPEGAWGGTSREYKESWGQPEAWEEKPTHELPRELGKRSPLTSPP ENWGGPAESSQSWHSGTPTAVGWGAEGACPYPRGSERRPELDWRDLLGLLRAPGEGVWAR VPSLDWEGLLELLQARLPRKDPAGHRDDLARALGPELGPPGTNDVPEQESHSQPEGWAEA TPVNGHSPALQSQSPVQLPSPACTSTQWPKIKVTRGPATATLAGLEQTGPLGSRSTAKGP SLPELQADKRPAEGKAGSPLKGRLVTSWRMPGDRPTLFNPFLLSLGVLSCLSWGQHVLSK GKAAGSSVWGAWKMDSTSLLVAAAFLREVSSCDYRALSGGDGLARRQQCGQKRVVMFEK >gi568815576f:37539788_37755620|GENSCAN_predicted_CDS_8|6120_bp atggaggaggtgcctggggatgccctgtgtgaacactttgaggccaacatacttacccag aaccgctgtcaaaactgcttccaccctgaggaggcccatggagcaagataccagcctcca tttccgagaggagaaaacaatgctccgaaaggggaagtcatcggcctgaggtcaccagcc gggaaccggaggagccaggatggggcccgaggtccatctgaggccagagccagtgctgac ccgccagacaggaaatcagagctggggactgaccgtggggcccactcaagtttccactgc ctcggcggctgctgccaccacagcccggagtcggggcctgggagggcagcagtgtggggg cctgagccggagccccccggggacgagggtgctgacagtcgacagccaccaccaccacca gagcccgcagcccaggagctcaggagcccttcaggtgctgaggtgccctactgcgacctg cctcgatgtccacctgcccctgaggacccactcagcgcctcaacctccggctgccagtct gtggtggacccaggcctcaggccagggcccaagaggggcccatccccctcagcagggctc ccagaagagggtcccacagctgcccccaggagcaggagccgggagcttgaggcagtaccc tatctggagggcctgaccacttccttgtgtggcagctgcaacgaggaccccggctctgac cccacctccagccctgactccgccacccctgatgataccagcaactcgtcctctgtggac tgggacactgttgagaggcaggaggaggaggcccccagctgggacgagctcgcagtgatg atcccgaggaggcctcgggaggggccgagagctgacagctcccaaagggctccgtctctc ctcaccaggtcccctgtgggaggagatgctgcaggccagaaaaaggagggagtgaagctg cagaccttcgtggtgagcgttacagctcataaaggcagtgtggacccaaagagtgagcag cagcaagatttattgcaaagagcgaaagaacaaagcttccacagcatggaagaggacccg agcggcccttgggtggttgatgggactgggcgctgtggagcaggggcggcgctcatcggg gaggctcgggctgcacaggagcccacggaggcggggggaagctcaggcatggcgggctgc aggtcccgagccctgcccggtgggaaggcagctaaggcccggcgagaaatccagcgcagc gctggtgctaagcccctcattgcccggggccggcagggccagccggccgctccgagtgcg gggcccaccaagcccacgcccacccggaactccagctggcccacaagcgccgcgcgcagc ctcggttcccgctcgcgcctctccctccacacctccgtgcaagctgagggagccagctcc ggcctcggccagcccaggaaggggctcccacagtgcagcggtggcctgaagggctcctca agtgccgccaaagtgggagcccaggcagaggaggcgccgagagtgagcgagggctgtgag ggccgccagcatgctgtcacctctcagtatcacccctcagattctgcatcgattactgag agaaaggcacccttaggctggggagagcgggaagataccactttcaccaagaggctgctc agcatccccagacgggaaaaccccaggacaccctgtgtccagcaggacgatcccagagcc tcctctcccaacagaaccactcaacgagagaattccagaacatcctgtgcccagcgggac aatcccaaagcctccagaacctcctctcccaatagagccacacgagacaaccccagaaca tcctgcgcccagcgggacaatcccagagcctcctctcccagtagagctacacgagacaac cccacaacatcctgtgcccagcgggacaatcccagagcctccagaacctcctctcccaat agagccacacgagacaaccccagaacatcctgtgcccagcgggacaatcccagagcctcc tctcccagtagagctacacgagacaaccccacaacatcctgtgcccagcgggacaatccc agagcctccagaacctcctctcccaatagagccacacgagacaaccccagaacatcctgc gcccagcgggacaatcccagagcctcctctcccaatagagctgcacgagacaaccccaca acatcctgtgcccagcgggacaatcccagagcctccagaacctcctctcccaatagagcc acacgagacaaccccagaacatcctgtgcccagcgggacaatcccagagcctcctctccc aatagagctacacgagacaaccccacaacatcctgtgcccagcgggacaatcccagagcc tccagaacctcctctcccaatagagccacacgagataaccccagaacatcctgtgcccag cgggacaatcccagagcctcctctcccaacagaaccacccaacaagacagccccagaaca tcctgtgcccgacgggacgatcccagagcctcctctcctaacagaaccatccaacaagag aaccccagaacatcctgtgccctacgggacaatcccagagcctcctctcccagcagaacc atccaacaagagaaccccagaacatcctgtgcccaacgggacgatcccagagcctcctct cctaacagaaccacccaacaagagaaccccagaacatcctgtgcccgacgggacaatccc agagcctcctctcgcaacagaaccatccagcgagacaaccccagaacatcctgtgcccag cgggacaatcccagagcctcctctcctaacagaaccatccaacaagagaacctcagaaca tcctgtacccgacaggacaatcccaggacctcctctcccaatagagccacacgagacaac cccagaacatcctgtgcccagcgggacaatctcagagcctcctctcccatcagagccacc caacaggacaaccccagaacttgtattcaacagaacatccccagatcatcttctacccaa caagacaaccctaaaacctcttgtaccaaacgagataacctcagacccacttgtacacag cgggaccgcacacagtccttttcctttcaacgagacaaccctggaacctcctcatctcaa tgctgcacccaaaaggagaatctgagaccatcatctccccaccgctccactcaatggaac aatcccaggaattcatctccccatcgtactaacaaagacatcccctgggcctcgtttccc ctccggccaactcagagtgatggtccccgaacctcttccccatctcgctccaagcaaagc gaggttccctgggcatccatcgccctccggccaacccaaggtgacaggcctcagacatcc tctcccagcaggccagcccagcatgacccaccccagtcctcctttggccccacccagtac aacttgccatcccgggccacctcttcctcccataacccaggccaccagagcacctcccga acttcctcacctgtgtaccccgctgcctatggggctcccctgacctctcctgagccctcc cagcctccatgtgctgtgtgcattgggcaccgggatgcccctcgagcctcttcgccccct cgctatttgcagcacgaccccttccccttcttcccagagccccgcgcccctgagagtgaa ccgccccaccacgagcctccctatataccacctgctgtgtgcattggacaccgagatgcc ccccgggcgtcctcgcccccccgccacacccaatttgaccccttccccttcctcccagac acatcagatgccgagcatcagtgtcagtccccccaacacgagccccttcagctccctgca cctgtgtgtattgggtaccgagatgcaccccgggcctcctccccaccacgccaggcccca gagccttccctcttattccaggacctccccagggccagcacagagagccttgtcccttcc atggactctctgcacgagtgcccccacatccccacccctgtgtgcattgggcaccgggat gcaccctccttctcatccccaccacgccaggctcctgagccatccctcttcttccaggat ccccctggaactagtatggagagcctggccccctccactgactctctgcatggctcccca gtgctgatcccccaagtgtgcatcgggcaccgggatgcaccccgagcctcctccccaccc cgccacccacccagtgacctagcgttcctggcaccctcaccttcaccgggcagctctggg ggctcccggggctcagcgcctcccggggagaccaggcacaacttggagcgggaggagtac actgtgctggccgacctgcccccacccaggaggctggcccagagacagccagggccccag gcgcagtgcagcagcgggggccgcacccacagccctggccgtgcagaggtggagcgcctc ttcgggcaagagcgcagggaggaacagcccactgggtcacgtctgggctcttgcttcatc gaagctcctataccccagttcgggggaaaattcactctaaggccttcagctctcacagag aagtccgaggcagcgggggccttccaggcccaggacgagggacggtcacagcagcccagc caaggccagagccaacttctccgaagacagtccagccctgcccccagcaggcaggtgacc atgctccctgccaaacaggcagaactgacccggcggagccaagcagagccccctcatcct tggagtcctgagaagagacctgagggagatcggcagctccaggggtccccgctgcccccc aggacatcagccaggacccctgagagggagctgcggacacagagacctctggagagtggc caagcaggcccaagacagcctctgggggtgtggcagagtcaggaggaaccgccagggtcc cagggccctcatagacacctagaaaggagctggagcagccaggagggaggcctgggccct gggggctggtggggatgtggagagcccagcctgggggcagccaaagccccggagggagca tgggggggcacttccagggagtacaaggagagctgggggcagccagaggcctgggaggag aagcccactcatgagctccccagagaactaggaaagagaagcccactcacgagcccccct gagaactggggaggccccgcagagtcctcacaatcctggcactctgggacacccactgct gtgggctggggggcagagggagcgtgtccatacccgcgtggctctgagaggcgacccgag cttgactggagggatctgcttggccttctccgggcaccaggagagggggtctgggcccgt gtccccagcctggactgggagggcctcttggagctcctgcaggccaggctgccccgcaag gacccagctggacacagggatgacctggccagggctttagggccagagctgggtccccca ggcacaaacgatgtccctgagcaggagtcacacagccagccagaaggctgggccgaggcc accccagtcaatggacacagccccgcactgcagtcccagagcccggtccagctgcccagc cctgcctgcacctccacccagtggccaaagatcaaagtgacaagaggaccagcgaccgca actctggcaggcctggagcagacgggccccctggggagcaggagcactgcgaagggcccc agcttgccagagctgcaggcagacaagaggccagcagagggcaaggctgggagcccgctc aagggccgactggtgacctcatggcggatgcccggggaccggcccacgctgttcaatccg ttcctgctgtctctgggggtcctcagttgcctgtcctgggggcagcacgtgctgagcaag ggtaaggctgccggaagcagcgtgtggggtgcttggaagatggacagcacatccctgctg gtggcagcagccttcctgagggaggtgtcctcctgtgattatagggccttgtcaggtgga gatggactagcgaggagacagcagtgtggacagaaacgggtggtcatgtttgagaagtag >gi568815576f:37539788_37755620|GENSCAN_predicted_peptide_9|69_aa MLQLVAPRPRGCAPLGGTQKPDLLNFKKGWMSILDEPGEADELDGEIDLRSCTDVTEYAV QRNYGFQIH >gi568815576f:37539788_37755620|GENSCAN_predicted_CDS_9|207_bp atgctgcagctggtagcccccagaccccggggctgtgcccccctgggcggcacccagaag cccgatctgctcaacttcaagaagggatggatgtcgatcttggacgagcctggagaggca gatgagctggatggtgagatcgacctgcgttcctgcacggatgtcactgagtacgcggtg cagcgcaactatggcttccagatccac