GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:46:02 Sequence gi568815595r:52129037_52335107 : 206071 bp : 51.57% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1586 1653 68 0 2 113 42 70 0.312 3.29 1.02 PlyA + 3644 3649 6 1.05 2.11 PlyA - 4349 4344 6 1.05 2.10 Term - 8118 8035 84 0 0 85 46 79 0.975 1.35 2.09 Intr - 9266 9133 134 0 2 86 94 189 0.454 20.17 2.08 Intr - 13456 13373 84 2 0 65 82 31 0.466 0.49 2.07 Intr - 16925 16810 116 1 2 71 65 249 0.822 21.49 2.06 Intr - 18059 17952 108 1 0 84 100 158 0.998 16.50 2.05 Intr - 20353 20174 180 0 0 61 63 164 0.769 10.70 2.04 Intr - 20951 20780 172 0 1 32 56 171 0.049 7.82 2.03 Intr - 22064 21980 85 2 1 81 86 75 0.031 6.49 2.02 Intr - 25608 25319 290 1 2 82 103 187 0.018 17.11 2.01 Init - 28220 28172 49 2 1 86 58 28 0.016 -1.35 2.00 Prom - 33129 33090 40 -0.81 3.00 Prom + 37896 37935 40 -0.61 3.01 Init + 43788 43887 100 2 1 61 109 47 0.825 4.49 3.02 Intr + 46825 46939 115 2 1 75 115 33 0.060 4.71 3.03 Intr + 53202 53241 40 0 1 96 80 52 0.069 3.91 3.04 Intr + 56903 56938 36 1 0 93 62 35 0.190 0.34 3.05 Intr + 61597 61633 37 0 1 116 60 20 0.223 0.32 3.06 Intr + 64114 64691 578 2 2 54 92 600 0.268 50.05 3.07 Intr + 65144 65389 246 1 0 -16 75 256 0.833 12.09 3.08 Intr + 68943 69081 139 2 1 103 -15 174 0.956 9.14 3.09 Intr + 70174 70404 231 2 0 50 89 146 0.485 9.17 3.10 Intr + 73471 73698 228 2 0 100 116 82 0.980 10.57 3.11 Intr + 74827 74976 150 2 0 59 101 117 0.991 10.75 3.12 Intr + 75657 75879 223 1 1 66 92 176 0.667 13.51 3.13 Intr + 76803 76987 185 0 2 67 110 125 0.949 12.55 3.14 Intr + 77536 77715 180 2 0 43 115 168 0.939 15.36 3.15 Intr + 79047 79211 165 1 0 59 76 191 0.997 15.45 3.16 Intr + 82247 82515 269 0 2 75 62 364 0.981 30.29 3.17 Intr + 83222 83384 163 1 1 75 97 199 0.999 19.56 3.18 Term + 90207 90343 137 1 2 70 45 75 0.561 -0.11 3.19 PlyA + 91106 91111 6 1.05 4.02 PlyA - 92064 92059 6 1.05 4.01 Sngl - 95225 92181 3045 2 0 94 42 4187 0.285 405.23 4.00 Prom - 96634 96595 40 -4.81 5.11 PlyA - 97939 97934 6 1.05 5.10 Term - 100165 99998 168 1 0 98 49 343 0.989 29.59 5.09 Intr - 100746 100625 122 1 2 84 30 235 0.999 18.02 5.08 Intr - 101034 100884 151 0 1 71 95 239 0.999 23.15 5.07 Intr - 101959 101834 126 1 0 110 64 163 0.999 17.48 5.06 Intr - 102195 102091 105 0 0 92 90 65 0.923 8.01 5.05 Intr - 102503 102408 96 2 0 100 84 233 0.999 24.81 5.04 Intr - 103086 102908 179 1 2 76 86 240 0.986 22.76 5.03 Intr - 106070 105993 78 0 0 96 113 108 0.907 14.12 5.02 Intr - 110046 109956 91 0 1 75 89 100 0.642 8.87 5.01 Init - 111884 111828 57 2 0 65 53 15 0.348 -3.04 5.00 Prom - 112250 112211 40 -4.01 6.00 Prom + 114470 114509 40 -4.01 6.01 Init + 116789 117012 224 1 2 84 72 268 0.912 20.81 6.02 Intr + 117083 117226 144 2 0 72 36 88 0.613 1.91 6.03 Intr + 117659 117776 118 2 1 68 105 161 0.956 16.77 6.04 Intr + 117938 118192 255 1 0 49 56 329 0.897 23.97 6.05 Intr + 118646 118758 113 1 2 72 109 134 0.997 13.58 6.06 Intr + 119117 119201 85 2 1 110 82 133 0.999 15.22 6.07 Intr + 119254 119417 164 0 2 44 105 109 0.936 7.49 6.08 Intr + 119601 119664 64 0 1 66 82 50 0.935 1.41 6.09 Intr + 119920 120027 108 0 0 88 110 57 0.675 8.88 6.10 Intr + 120138 120286 149 2 2 5 84 189 0.899 9.74 6.11 Term + 120634 120778 145 1 1 50 55 71 0.548 -2.31 6.12 PlyA + 122729 122734 6 1.05 7.10 PlyA - 123848 123843 6 -0.45 7.09 Term - 128483 128454 30 2 0 110 46 25 0.445 -1.36 7.08 Intr - 129642 129500 143 1 2 65 80 100 0.768 7.48 7.07 Intr - 130230 130161 70 0 1 62 97 47 0.980 2.25 7.06 Intr - 130836 130681 156 0 0 111 72 207 0.970 22.12 7.05 Intr - 131465 131349 117 2 0 95 94 42 0.937 6.67 7.04 Intr - 132443 132344 100 1 1 52 95 65 0.942 4.21 7.03 Intr - 137982 137916 67 1 1 115 116 38 0.974 7.76 7.02 Intr - 141773 141676 98 1 2 102 115 53 0.998 9.55 7.01 Init - 149325 149165 161 0 2 111 89 341 0.914 35.77 7.00 Prom - 153398 153359 40 0.59 8.00 Prom + 153470 153509 40 -4.71 8.01 Init + 161307 161683 377 2 2 74 64 504 0.998 41.03 8.02 Intr + 161924 162075 152 2 2 106 64 232 0.944 22.92 8.03 Intr + 162711 162886 176 1 2 31 80 106 0.653 4.28 8.04 Intr + 163224 164043 820 2 1 91 81 693 0.106 60.56 8.05 Intr + 169345 169480 136 2 1 43 36 100 0.024 0.43 8.06 Intr + 170994 171109 116 0 2 45 109 48 0.058 3.19 8.07 Intr + 173754 173842 89 1 2 74 57 20 0.005 -2.31 8.08 Intr + 175360 175432 73 0 1 107 92 26 0.005 4.37 8.09 Intr + 181682 181871 190 0 1 49 43 93 0.001 0.06 8.10 Intr + 184202 184243 42 2 0 70 102 29 0.004 0.34 8.11 Intr + 185209 185289 81 1 0 46 81 83 0.433 2.65 8.12 Intr + 193373 193739 367 2 1 88 84 233 0.928 18.61 8.13 Intr + 194772 194844 73 2 1 53 78 65 0.582 1.47 8.14 Intr + 197104 197288 185 2 2 92 43 325 0.527 28.43 8.15 Intr + 197625 197855 231 2 0 79 105 220 0.592 21.30 8.16 Intr + 198089 198187 99 1 0 88 61 55 0.924 3.61 8.17 Intr + 198846 198978 133 2 1 94 89 157 0.999 17.12 8.18 Intr + 199631 199687 57 0 0 87 80 25 0.141 0.95 8.19 Intr + 202112 202273 162 0 0 66 86 259 0.202 23.96 8.20 Intr + 203106 203358 253 1 1 84 80 358 0.190 31.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 25335 25634 300 2 0 118 49 293 0.981 22.51 S.002 Intr + 46841 46939 99 2 0 98 115 43 0.908 7.72 S.003 Term + 163224 164090 867 2 0 91 45 712 0.886 60.35 S.004 Term - 178774 178610 165 1 0 93 32 137 0.935 6.83 S.005 Term + 203106 203362 257 1 2 84 47 363 0.804 27.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:52129037_52335107|GENSCAN_predicted_peptide_1|22_aa XGARPPKPGVTLLSIKTVLQNA >gi568815595r:52129037_52335107|GENSCAN_predicted_CDS_1|69_bp ngaggtgcccgaccacccaagccaggtgtgaccctcctgagcatcaaaacagtcctccaa aatgcctag >gi568815595r:52129037_52335107|GENSCAN_predicted_peptide_2|433_aa MGFHHVGQAGLELLTSGGTSGGGQRLLAALSVGCLGLGNWARRPWRRPPVIQLCFQSSGK KLRRFSFNCGGRGVPTRPALLLVAVGGAETERAATATAAAFGAASPAMAAPCAEDPSLER HFKGHRDAVTCVDFSINTKQLASGSMDSCLMVWHMKPQSRAYRFTGHKDAVTCVNFSPSG HLLASGSRDKTVRIWVPNVKGESTVFRAHTATVRSVHFCSDGQSFVTASDDKTVKVWATH RQKFLFSLSQHINWVRCAKFSPDGRLIVSASDDKTVKLWDKSSRECVHSYCEHGGFVTYV DFHPSGTCIAAAGMDNTVKVWDVRTHRLLQHYQWLLELKPTWGFGLWRQLEPFVLRWLTL LVHSAAVNGLSFHPSGNYLITASSDSTLKILDLMEGRLLYTLHGHQRHIQCAGLVGLFSL GQHIQRGWLLALV >gi568815595r:52129037_52335107|GENSCAN_predicted_CDS_2|1302_bp atggggtttcaccatgttggccaggctggtctcgaacttctgacctcaggcggaacgagt ggcggcggacagaggcttttggctgcgctgtctgtgggttgtctgggcttggggaactgg gcacggagaccgtggcgccggccgccggtcatacagctctgcttccagtcgtcgggcaaa aagctgaggcgcttctcctttaattgcggtgggcggggcgtgccgacgcggcccgcgttg ctattggtggccgttgggggcgcggaaactgaacgggccgcaacggccaccgcagctgcc ttcggcgccgccagccccgccatggctgcgccctgcgcggaggacccctcgctggaaagg cattttaagggccaccgagatgcagttacctgtgtggacttcagtatcaacacaaagcag ctggccagtggctccatggactcatgcctcatggtctggcacatgaagccgcagtcacgc gcctaccgcttcactggccacaaggatgccgtcacctgtgtgaacttctctccttcggga cacctgcttgcttccggctcccgagacaagactgtccgcatctgggtacccaatgtcaaa ggtgagtccactgtgtttcgtgcacacacagccacagtgaggagtgtccacttctgcagt gatggccagtccttcgtgacagcctctgacgacaagacagtcaaagtgtgggcaactcat cgccagaaattcctgttctccctgagccagcatatcaactgggtccgctgtgccaagttc tcccccgacgggcggctcatcgtgtctgccagtgatgacaagactgttaagctgtgggac aagagcagccgggaatgtgtccactcgtattgtgagcatggcggctttgtcacctatgtg gacttccaccccagtgggacgtgcattgccgctgccggcatggacaacacagtgaaggtg tgggacgtgcggactcaccggctgctgcagcattatcagtggttgttggagttgaagccc acttggggctttgggctatggaggcagctggagccatttgtcttgcgctggctgactctc ctagtgcacagtgcagcagtgaacgggctctctttccacccgtcgggaaactacctgatc acagcctccagtgactcaaccctgaagatcctggacctgatggagggccggctgctctac acactccacgggcatcagagacatatccagtgtgcaggactagtgggcctgttctccctg ggccagcacatccagagggggtggctgctggccctggtgtaa >gi568815595r:52129037_52335107|GENSCAN_predicted_peptide_3|1073_aa MQTELQSSGSQPQLLFRTPAGGAQTLAFYKSSPAPLPLRDCPAPHNPAQLQGNEGVIQVT KAPSDESSLGLRMPCKLIGVTVGSWVREMKAQSQWTPGGVLDQKPEGPGELANTSTMPYQ YPLLTPEQKELSDMTHRIVALGKGIPAADESTGSTAKWLQSIGTENTEENRCFYRQLWLT ADNRVNPCIKGVILFHETLYQKADDGRPFPQVIKSKGNVVSIKVDKGVVPLERANGETTT QGLDGLSERCAPYKKDGAHFAKWHHVLKIGKHTPSALTIIENANISWARWLTPVIPALWE AEYCQYVTEKVWLLYKALSDHDIYLEGTLLKSNMVTPGHACIQKFSHEEIAMATVTSTTL WGISGLEGSQLGQNRAHPDSSLRQASRGIPGSTDIRRAQEVGLSAPPPRLAEGYAARLRR RRQRPRRMRSVFPQEPAYFLNMESVVRRCPFLSRVPQAFLQKAGKSLLFYAQNCPKMMEV GAKPAPRALSTAAVHYQQIKETPPASEKDKTAKAKVQQTPDGSQQSPDGTQLPSGHPLPA TSQGTASKCPFLAAQMNQRGSSVFCKASLELQEDVQEMNAVRKEVAETSAGPSVVSVKTD GGDPSGLLKNFQDIMQKQRPERVSHLLQDNLPKSVSTFQYDRFFEKKIDEKKNDHTYRVF KTVNRRAHIFPMADDYSDSLITKKQVSVWCSNDYLGMSRHPRVCGAVMDTLKQHGAGAGG TRNISGTSKFHVDLERELADLHGKDAALLFSSCFVANDSTLFTLAKMMPGCEIYSDSGNH ASMIQGIRNSRVPKYIFRHNDVSHLRELLQRSDPSVPKIVAFETVHSMDGAVCPLEELCD VAHEFGAITFVDEVHAVGLYGARGGGIGDRDGVMPKMDIISGTLGKAFGCVGGYIASTSS LIDTVRSYAAGFIFTTSLPPMLLAGALESVRILKSAEGRVLRRQHQRNVKLMRQMLMDAG LPVVHCPSHIIPVRVADAAKNTEVCDELMSRHNIYVQAINYPTVPRGEELLRIAPTPHHT PQMMNYFLDEQITGIYSPPQEPLYPSQKEDGSIDFPHSHALGSRAKATLQGLP >gi568815595r:52129037_52335107|GENSCAN_predicted_CDS_3|3222_bp atgcagacagagcttcagagcagcggttctcagccccagctgctgttcagaacccctgca ggtggagctcagacactcgcattttataaaagctccccagctcccttacctctcagggac tgtcctgctccccacaacccagctcagctgcaaggaaacgagggggtcattcaggtgaca aaggccccctctgatgagtcatccttgggtctcaggatgccttgcaagcttatcggggtc actgtgggcagctgggtcagagaaatgaaggcacagtcccagtggaccccgggtggagtg ctggaccagaagccagaagggccaggagaacttgctaacaccagtaccatgccctaccaa tatccattgctgaccccggagcagaaggagctgtctgacatgactcatcgcatcgtagct ctgggcaagggcatcccggctgcagatgagtccactgggagcactgccaagtggctgcag tccattggcaccgagaataccgaggagaaccggtgcttctaccgccagctgtggctgaca gctgacaaccgcgtgaacccctgcatcaagggtgtcatcctcttccatgagacgctgtac caaaaggcggatgatgggcgtcccttcccccaagttatcaaatccaagggcaatgttgtg agcatcaaggtagacaagggggtggtacccctggaaagggcaaatggtgagaccaccacc caagggctggatgggctgtctgagcgctgtgccccatacaagaaggatggagcccacttt gccaagtggcatcatgtgctgaagattgggaaacacaccccctcagccctcaccatcatt gaaaatgccaacatcagctgggcgcggtggctcacgcctgtaatcccagcactctgggag gctgagtactgccagtatgtaaccgagaaggtgtggctgctctacaaggctctgagtgac catgacatctacctggaaggcaccttgctgaagtccaatatggtcaccccaggccatgcc tgcatccagaagttttctcatgaggagattgccatggcaactgtcacaagcactacactc tggggaataagcggcctagaaggatcacaactgggccagaacagggcccacccagactca tcactgaggcaggcgagccggggaatccccggcagcaccgatatccgcagagcccaagaa gtgggcctctctgccccgcccccccggctcgccgagggctacgcggcgcgcctgcgcaga agaaggcagcgcccaaggcgcatgcgcagcgtctttccacaggagccagcatacttcctg aacatggagagtgttgttcgccgctgcccattcttatcccgagtcccccaggcctttctg cagaaagcaggcaaatctctgttgttctatgcccaaaactgccccaagatgatggaagtt ggggccaagccagcccctcgggcattgtccactgcagcagtacactaccaacagatcaaa gaaacccctccggccagtgagaaagacaaaactgctaaggccaaggtccaacagactcct gatggatcccagcagagtccagatggcacacagcttccgtctggacaccccttgcctgcc acaagccagggcactgcaagcaaatgccctttcctggcagcacagatgaatcagagaggc agcagtgtcttctgcaaagccagtcttgagcttcaggaggatgtgcaggaaatgaatgcc gtgaggaaagaggttgctgaaacctcagcaggccccagtgtggttagtgtgaaaaccgat ggaggggatcccagtggactgctgaagaacttccaggacatcatgcaaaagcaaagacca gaaagagtgtctcatcttcttcaagataacttgccaaaatctgtttccacttttcagtat gatcgtttctttgagaaaaaaattgatgagaaaaagaatgaccacacctatcgagttttt aaaactgtgaaccggcgagcacacatcttccccatggcagatgactattcagactccctc atcaccaaaaagcaagtgtcagtctggtgcagtaatgactacctaggaatgagtcgccac ccacgggtgtgtggggcagttatggacactttgaaacaacatggtgctggggcaggtggt actagaaatatttctggaactagtaaattccatgtggacttagagcgggagctggcagac ctccatgggaaagatgccgcactcttgttttcctcgtgctttgtggccaatgactcaacc ctcttcaccctggctaagatgatgccaggctgtgagatttactctgattctgggaaccat gcctccatgatccaagggattcgaaacagccgagtgccaaagtacatcttccgccacaat gatgtcagccacctcagagaactgctgcaaagatctgacccctcagtccccaagattgtg gcatttgaaactgtccattcaatggatggggcggtgtgcccactggaagagctgtgtgat gtggcccatgagtttggagcaatcaccttcgtggatgaggtccacgcagtggggctttat ggggctcgaggcggagggattggggatcgggatggagtcatgccaaaaatggacatcatt tctggaacacttggcaaagcctttggttgtgttggagggtacatcgccagcacgagttct ctgattgacaccgtacggtcctatgctgctggcttcatcttcaccacctctctgccaccc atgctgctggctggagccctggagtctgtgcggatcctgaagagcgctgagggacgggtg cttcgccgccagcaccagcgcaacgtcaaactcatgagacagatgctaatggatgccggc ctccctgttgtccactgccccagccacatcatccctgtgcgggttgcagatgctgctaaa aacacagaagtctgtgatgaactaatgagcagacataacatctacgtgcaagcaatcaat taccctacggtgccccggggagaagagctcctacggattgcccccacccctcaccacaca ccccagatgatgaactacttccttgatgagcagataacagggatctacagcccaccacag gagcccctgtacccaagtcagaaagaggacggcagcatagacttcccccacagccatgcc ctggggagtagggcaaaagcaaccctccaagggctgccatga >gi568815595r:52129037_52335107|GENSCAN_predicted_peptide_4|1014_aa MLAMTLALGTLPAFLPCELQPHGLVNCNWLFLKSVPHFSMAAPRGNVTSLSLSSNRIHHL HDSDFAHLPSLRHLNLKWNCPPVGLSPMHFPCHMTIEPSTFLAVPTLEELNLSYNNIMTV PALPKSLISLSLSHTNILMLDSASLAGLHALRFLFMDGNCYYKNPCRQALEVAPGALLGL GNLTHLSLKYNNLTVVPRNLPSSLEYLLLSYNRIVKLAPEDLANLTALRVLDVGGNCRRC DHAPNPCMECPRHFPQLHPDTFSHLSRLEGLVLKDSSLSWLNASWFRGLGNLRVLDLSEN FLYKCITKTKAFQGLTQLRKLNLSFNYQKRVSFAHLSLAPSFGSLVALKELDMHGIFFRS LDETTLRPLARLPMLQTLRLQMNFINQAQLGIFRAFPGLRYVDLSDNRISGASELTATMG EADGGEKVWLQPGDLAPAPVDTPSSEDFRPNCSTLNFTLDLSRNNLVTVQPEMFAQLSHL QCLRLSHNCISQAVNGSQFLPLTGLQVLDLSHNKLDLYHEHSFTELPRLEALDLSYNSQP FGMQGVGHNFSFVAHLRTLRHLSLAHNNIHSQVSQQLCSTSLRALDFSGNALGHMWAEGD LYLHFFQGLSGLIWLDLSQNRLHTLLPQTLRNLPKSLQVLRLRDNYLAFFKWWSLHFLPK LEVLDLAGNQLKALTNGSLPAGTRLRRLDVSCNSISFVAPGFFSKAKELRELNLSANALK TVDHSWFGPLASALQILDVSANPLHCACGAAFMDFLLEVQAAVPGLPSRVKCGSPGQLQG LSIFAQDLRLCLDEALSWDCFALSLLAVALGLGVPMLHHLCGWDLWYCFHLCLAWLPWRG RQSGRDEDALPYDAFVVFDKTQSAVADWVYNELRGQLEECRGRWALRLCLEERDWLPGKT LFENLWASVYGSRKTLFVLAHTDRVSGLLRASFLLAQQRLLEDRKDVVVLVILSPDGRRS RYVRLRQRLCRQSVLLWPHQPSGQRSFWAQLGMALTRDNHHFYNRNFCQGPTAE >gi568815595r:52129037_52335107|GENSCAN_predicted_CDS_4|3045_bp atgctggccatgaccctggccctgggtaccttgcctgccttcctaccctgtgagctccag ccccacggcctggtgaactgcaactggctgttcctgaagtctgtgccccacttctccatg gcagcaccccgtggcaatgtcaccagcctttccttgtcctccaaccgcatccaccacctc catgattctgactttgcccacctgcccagcctgcggcatctcaacctcaagtggaactgc ccgccggttggcctcagccccatgcacttcccctgccacatgaccatcgagcccagcacc ttcttggctgtgcccaccctggaagagctaaacctgagctacaacaacatcatgactgtg cctgcgctgcccaaatccctcatatccctgtccctcagccataccaacatcctgatgcta gactctgccagcctcgccggcctgcatgccctgcgcttcctattcatggacggcaactgt tattacaagaacccctgcaggcaggcactggaggtggccccgggtgccctccttggcctg ggcaacctcacccacctgtcactcaagtacaacaacctcactgtggtgccccgcaacctg ccttccagcctggagtatctgctgttgtcctacaaccgcatcgtcaaactggcgcctgag gacctggccaatctgaccgccctgcgtgtgctcgatgtgggcggaaattgccgccgctgc gaccacgctcccaacccctgcatggagtgccctcgtcacttcccccagctacatcccgat accttcagccacctgagccgtcttgaaggcctggtgttgaaggacagttctctctcctgg ctgaatgccagttggttccgtgggctgggaaacctccgagtgctggacctgagtgagaac ttcctctacaaatgcatcactaaaaccaaggccttccagggcctaacacagctgcgcaag cttaacctgtccttcaattaccaaaagagggtgtcctttgcccacctgtctctggcccct tccttcgggagcctggtcgccctgaaggagctggacatgcacggcatcttcttccgctca ctcgatgagaccacgctccggccactggcccgcctgcccatgctccagactctgcgtctg cagatgaacttcatcaaccaggcccagctcggcatcttcagggccttccctggcctgcgc tacgtggacctgtcggacaaccgcatcagcggagcttcggagctgacagccaccatgggg gaggcagatggaggggagaaggtctggctgcagcctggggaccttgctccggccccagtg gacactcccagctctgaagacttcaggcccaactgcagcaccctcaacttcaccttggat ctgtcacggaacaacctggtgaccgtgcagccggagatgtttgcccagctctcgcacctg cagtgcctgcgcctgagccacaactgcatctcgcaggcagtcaatggctcccagttcctg ccgctgaccggtctgcaggtgctagacctgtcccacaataagctggacctctaccacgag cactcattcacggagctaccgcgactggaggccctggacctcagctacaacagccagccc tttggcatgcagggcgtgggccacaacttcagcttcgtggctcacctgcgcaccctgcgc cacctcagcctggcccacaacaacatccacagccaagtgtcccagcagctctgcagtacg tcgctgcgggccctggacttcagcggcaatgcactgggccatatgtgggccgagggagac ctctatctgcacttcttccaaggcctgagcggtttgatctggctggacttgtcccagaac cgcctgcacaccctcctgccccaaaccctgcgcaacctccccaagagcctacaggtgctg cgtctccgtgacaattacctggccttctttaagtggtggagcctccacttcctgcccaaa ctggaagtcctcgacctggcaggaaaccagctgaaggccctgaccaatggcagcctgcct gctggcacccggctccggaggctggatgtcagctgcaacagcatcagcttcgtggccccc ggcttcttttccaaggccaaggagctgcgagagctcaaccttagcgccaacgccctcaag acagtggaccactcctggtttgggcccctggcgagtgccctgcaaatactagatgtaagc gccaaccctctgcactgcgcctgtggggcggcctttatggacttcctgctggaggtgcag gctgccgtgcccggtctgcccagccgggtgaagtgtggcagtccgggccagctccagggc ctcagcatctttgcacaggacctgcgcctctgcctggatgaggccctctcctgggactgt ttcgccctctcgctgctggctgtggctctgggcctgggtgtgcccatgctgcatcacctc tgtggctgggacctctggtactgcttccacctgtgcctggcctggcttccctggcggggg cggcaaagtgggcgagatgaggatgccctgccctacgatgccttcgtggtcttcgacaaa acgcagagcgcagtggcagactgggtgtacaacgagcttcgggggcagctggaggagtgc cgtgggcgctgggcactccgcctgtgcctggaggaacgcgactggctgcctggcaaaacc ctctttgagaacctgtgggcctcggtctatggcagccgcaagacgctgtttgtgctggcc cacacggaccgggtcagtggtctcttgcgcgccagcttcctgctggcccagcagcgcctg ctggaggaccgcaaggacgtcgtggtgctggtgatcctgagccctgacggccgccgctcc cgctacgtgcggctgcgccagcgcctctgccgccagagtgtcctcctctggccccaccag cccagtggtcagcgcagcttctgggcccagctgggcatggccctgaccagggacaaccac cacttctataaccggaacttctgccagggacccacggccgaatag >gi568815595r:52129037_52335107|GENSCAN_predicted_peptide_5|390_aa MERVSWGDHAVPYFVTVHPRALPGQGQAEGSDGADGAKRRAMAHQTGIHATEELKEFFAK ARAGSVRLIKVVIEDEQLVLGASQEPVGRWDQDYDRAVLPLLDAQQPCYLLYRLDSQNAQ GFEWLFLAWSPDNSPVRLKMLYAATRATVKKEFGGGHIKDELFGTVKDDLSFAGYQKHLS SCAAPAPLTSAERELQQIRINEVKTEISVESKHQTLQGLAFPLQPEAQRALQQLKQKMVN YIQMKLDLERETIELVHTEPTDVAQLPSRVPRDAARYHFFLYKHTHEGDPLESVVFIYSM PGYKCSIKERMLYSSCKSRLLDSVEQDFHLEIAKKIEIGDGAELTAEFLYDEVHPKQHAF KQAFAKPKGPGGKRGHKRLIRGPGENGDDS >gi568815595r:52129037_52335107|GENSCAN_predicted_CDS_5|1173_bp atggagagggtcagctggggagaccacgcagtaccctactttgtcacagttcatccacga gcgttgccgggccagggtcaagcggagggctccgacggcgcggacggagcgaagcgccga gccatggcgcaccaaacgggcatccacgccacggaagagctgaaggaattctttgccaag gcacgggctggctctgtgcggctcatcaaggttgtgattgaggacgagcagctcgtgctg ggtgcctcgcaggagccagtaggccgctgggatcaggactatgacagggccgtgctgcca ctgctggacgcccagcagccctgctacctgctctaccgcctcgactcacagaatgctcag ggcttcgaatggctcttcctcgcctggtcgcctgataactcccccgtgcggctgaagatg ctgtacgcggccacgcgggccacagtgaaaaaggagtttggaggtggccacatcaaggat gagctcttcgggactgtgaaggatgacctctcttttgctgggtaccagaaacacctgtcg tcctgtgcggcacctgccccgctgacctcggctgagagagagctccagcagatccgcatt aacgaggtgaagacagagatcagtgtggaaagcaagcaccagaccctgcagggcctcgcc ttccccctgcagcctgaggcccagcgggcactccagcagctcaagcagaaaatggtcaac tacatccagatgaagctggacctagagcgggaaaccattgagctggtgcacacagagccc acggatgtggcccagctgccctcccgggtgccccgagatgctgcccgctaccacttcttc ctctacaagcacacccatgagggcgacccccttgagtctgtagtgttcatctactccatg ccggggtacaagtgcagcatcaaggagcgaatgctctactccagctgcaagagccgcctc ctcgactccgtggagcaggacttccatctggagatcgccaagaaaattgagattggcgat ggggcagagctgacggcagagttcctctacgacgaggtgcaccccaagcaacacgccttc aagcaggccttcgccaagcccaagggcccagggggcaagcggggccataagcgcctcatc cgcggcccgggtgaaaatggggatgacagctag >gi568815595r:52129037_52335107|GENSCAN_predicted_peptide_6|522_aa MSAGWFRRRFLPGEPLPAPRPPGPHASPVPYRRPRFLRGSSSSPGAADASRRPDSRPVRS PARGRTLPWNAGYAEKREELSDSPHPPVDSPFLPSPPNPDNFPVIPTQRFPWDSDLPLRE DRRIINAEKSEFNEDQAACGKLCIRRCEFGAEEEWLTLCPEEFLTGHYWALFDGHGGPAA AILAANTLHSCLRRQLEAVVEGLVATQPPMHLNGRCICPSDPQFVEEKGIRAEDLVIGAL ESAFQECDEVIGRELEASGQMGGCTALVAVSLQGKLYMANAGDSRAILVRRDEIRPLSFE FTPETERQRIQQLGHKKEYDLSAASPPQAFVYPELLAGEFTRLEFPRRLKGDDLGQKVLF RDHHMSGWSYKRVEKSDLKYPLIHGQGRQARLLGTLAVSRGLGDHQLRVLDTNIQLKPFL LSVPQVTVLDVDQLELQEDDVVVMATDGLWDVLSNEQVAWLVRSFLPGNQEDPHRFSKLA QMLIHSTQGKEDSLTEEGQVSYDDVSVFVIPLHSQGQESSDH >gi568815595r:52129037_52335107|GENSCAN_predicted_CDS_6|1569_bp atgtccgccggctggttccggcgccgcttcctgcctggggagccgctccccgcgccgcgg ccgcctgggccgcatgccagccccgtgccctaccgacggccccgcttccttcgcggctcc agctccagccccggggcggccgacgcctcgcgccgcccagactcccggcccgtgcgcagc cccgcacgaggacgcacgctaccctggaatgcaggctacgccgaaaagcgggaggagctg agtgacagtccccacccacctgtggacagccccttcttgccctcacctccgaatcctgac aacttcccggtcatcccgacccagcgttttccttgggattccgacctccccctcagagaa gacaggaggattatcaatgcagagaaatctgaattcaatgaggatcaagccgcctgtggg aagctgtgcatccggagatgtgagtttggggctgaagaagagtggctgaccctgtgccca gaggagttcctgacaggccattactgggcactgttcgatgggcacggcggtcctgcagca gccatcttggctgccaacaccctgcactcctgcttgcgccggcagctggaggccgtggtg gaaggcttggtggccactcagccccccatgcacctcaatggccgctgcatctgccccagt gaccctcagtttgtggaggaaaagggcatcagggcagaagacttggtgatcggggcattg gagagtgcctttcaggaatgtgatgaggtgatcgggcgggagctggaggcctcaggccag atgggcggctgcacagccctggtggctgtgtccctgcagggaaagctgtacatggccaat gctggggatagcagggccatcttggtgcggagagatgagatacggccactgagcttcgag ttcaccccagagactgagcggcagcggatccagcagctgggccataagaaggagtatgac ctgagcgcagcctccccaccccaggcctttgtctatcctgagcttctggctggtgagttc acccgactggagttccctcggcggctgaagggggatgacttgggacagaaggttttgttc agggatcaccacatgagtggctggagctacaaacgtgtggagaaatcggatctcaagtac ccactgatccatggacagggtaggcaggctcggttactaggaacactggctgtctcccgg ggcctgggagaccatcagctcagagtcctggacacaaacatccagctcaagcccttcttg ctctctgtgccacaggtgactgtgctggatgtggaccagctggagctacaggaggatgat gtggttgtcatggcaactgatggactctgggatgtactgtccaacgagcaggtggcatgg ctggtgcggagcttcctccctgggaaccaagaggacccacacaggttctcaaagctggcc cagatgctgatacacagcacacagggaaaggaagacagtctcacagaggaagggcaggtg tcctacgatgacgtctctgtgttcgtgattcccttgcacagtcagggccaagagagcagt gaccactga >gi568815595r:52129037_52335107|GENSCAN_predicted_peptide_7|313_aa MKLTDSVLRSFRVAKVFRENSDKINCFDFSPNGETVISSSDDDSIVLYDCQEGKPKRTLY SKKYGVDLIRYTHAANTVVYSSNKIDDTIRYLSLHDNKYIRYFPGHSKRVVALSMSPVDD TFISGSLDKTIRLWDLRSPNCQGLMHLQGKPVCSFDPEGLIFAAGVNSEMVKLYDLRSFD KGPFATFKMQYDRTCEWTGLKFSNDGKLILISTNGSFIRLIDAFKGVVMHTFGGYANSKA VTLEASFTPDSQFIMIGSEDGKIHVWNGESGIKVAVLDGKHTGPITCLQFNPKFMTFASA CSNMAFWLPTIDD >gi568815595r:52129037_52335107|GENSCAN_predicted_CDS_7|942_bp atgaagctgaccgacagcgtgttgcggagcttccgcgtcgctaaggtgttccgcgaaaac tcggacaagattaactgcttcgatttcagccccaacggcgagacggtcatctcgagtagc gacgacgactccatcgtgctctatgactgccaggagggcaaaccaaagagaaccctgtac agtaagaaatatggtgtggacctcatcagatacactcatgcagcaaacacagttgtttac agctctaacaaaatagacgatactattcgttacttgtccttgcatgacaacaaatacatc agatactttcctggacatagcaaaagggtggtggccttgtccatgtcacctgtggatgac actttcatttctgggtctcttgataagaccattcgactctgggatctccggtctcctaac tgccagggcctcatgcatctgcaggggaagccagtttgttcttttgatccagaagggtta attttcgctgcaggtgtcaactctgaaatggtcaagctttatgaccttcgttcttttgat aaggggccatttgctacctttaagatgcagtatgatcgaacttgtgagtggacaggactt aaattcagcaatgatggcaagctcatcctcatttccaccaacggcagcttcattcgtctg attgatgcattcaaaggagtggtgatgcacacatttgggggttatgccaacagcaaagct gtcacactggaggcttcatttactccagactctcagtttattatgattggttcagaggat ggcaagatccatgtctggaatggagagagcggtataaaagtagctgtgttggatggtaaa cacacaggcccgattacctgtttgcaattcaaccccaagttcatgacttttgccagtgcg tgttccaacatggccttttggttgcccaccattgatgactga >gi568815595r:52129037_52335107|GENSCAN_predicted_peptide_8|1271_aa MAAALQVLPRLARAPLHPLLWRGSVARLASSMALAEQARQLFESAVGAVLPGPMLHRALS LDPGGRQLKVRDRNFQLRQNLYLVGFGKAVLGMAAAAEELLGQHLVQGVISVPKGIRAAM ERAGKQEMLLKPHSRVQVFEGAEDNLPDRDALRAALAIQQLAEGLTADDLLLVLISGGGS ALLPAPIPPVTLEEKQTLTRLLAARGATIQELNTIRKALSQLKGGGLAQAAYPAQVVSLI LSDVVGDPVEVIASGPTVASSHNVQDCLHILNRYGLRAALPRSVKTVLSRADSDPHGPHT CGHVLNVIIGSNVLALAEAQRQAEALGYQAVVLSAAMQGDVKSMAQFYGLLAHVARTRLT PSMAGASVEEDAQLHELAAELQIPDLQLEEALETMAWGRGPVCLLAGGEPTVQLQGSGRG GRNQELALRVGAELRRWPLGPIDVLFLSGGTDGQDGPTEAAGAWVTPELASQAAAEGLDI ATFLAHNDSHTFFCCLQGGAHLLHTGMTVGVAPAHSSQPPTPLLAQALVRVTVPWTVPLL APSNPIAAAQSESGLPNAELMKDEIMSHFPAAALLLPSAAVTGKGTAWGSQLAHPSEAQG TPVPIPLKRFQPISTYSIYLIRVDKPRQQDPVKDSTKSQTPALVLAALVSRRNSLDLKDS GDHPPRALVTGDHQVPVGGQRAPQGKATTGAKECPEHGQGSHSGNAVSCGETETLATISV WRKSSWSPPGKCPVEDVFVSEPAIWRFGPGRSFGWGISLRSSMEQPNSKGYSLGRTPQGP ECSSAPAVQVGTHRGLEYNPGKILPGSDYGLGNPPALDPKLPHLPLPPAPPTLSDLGQPR KSPLTGTDKKYPLMKQRGFYSDILSPGTLDQLGEVCRGPRMSQNLLRQADLDKFTPRVGS FEVPEDFQERMEQQCIGSTTRLLAQTDFPLQAYEPKMQVPFQVLPGQHPRKIEIERYGWA LVDTHGATSHEPSHPEVPSLPLTRRKQQYLSLDIEQLLFSQGIDSNKLMPRHLDHQHPQT IEQGHDPIFPIYLPLKPVCLASASVPPMGNLDAAQLDCGVSSLTSARLRVFDNEDFDCRT PREWINMGLEPGSLDRKPVPGKALLPTDDFLGHGLPQRLWRLSHFLLWKAVPEDPKSQKL KYKWCEVGVLDYDEEKKLYLVHKTDEKGLVRDEMGRPILNAGVTTEGRPPLQVCQYWVPR IQLLFCAEDPCMFAQRVVQANALRKNTEALLLYNLYVDCMPSDGQHVISEQSLSKIKQWA LSTPRMRKGPS >gi568815595r:52129037_52335107|GENSCAN_predicted_CDS_8|3813_bp atggctgcagccctgcaggtcctgccccgcttggcccgagcccccttgcatccactcctc tggcggggctcagtggcccgtctggccagcagcatggccttggcagagcaggccaggcag ctgtttgagagtgctgtaggtgcagtgctgccgggccccatgctgcaccgggcactatcc ttggaccctggtggcagacagctgaaggtgcgggaccggaactttcagctgaggcaaaac ctctacctggtgggctttggcaaggctgtgctgggtatggcagctgcagctgaggaacta ctgggccagcatcttgtgcagggcgtgatcagcgttcccaaggggatccgtgctgccatg gagcgtgccggcaagcaggagatgctgctgaagccacatagccgtgtccaggtattcgag ggtgcggaggacaacctcccggaccgcgatgcgctgcgggctgcactggccatccagcaa ctggctgagggactcacagctgatgacctgctgctcgtgctgatctcaggtgggggttca gctctgctgcctgcccccatcccacctgtcacactggaggagaagcagacactcactaga ctgctggcagcccgtggagccaccatccaggagttgaacaccattcggaaggccctgtcc cagctcaagggtggggggctggctcaggccgcctaccctgcccaggtggtgagcctcatc ctgtcagatgtggtgggggaccctgtggaggtgattgccagtggccccaccgtggccagt tcccacaatgtgcaagattgcctgcatatcctcaatcgctacggcctccgtgcagccctg ccacgttctgtgaagactgtgctgtctcgggccgactctgacccccatgggccacacacc tgtggccatgtcctgaatgtgatcattggctctaatgtgctggcgctagctgaggcccag cggcaggccgaggcactgggctaccaggctgtggtgctgagtgcagccatgcaaggtgat gtaaaaagtatggcccagttctacgggctgctggcccatgtggctagaacccgcctcacc ccatccatggctggggcttctgtggaggaagatgcacagctccatgagctggcagctgag cttcagatcccagacctgcagctggaggaggctctggagaccatggcatggggaaggggc ccagtctgcctgctggctggtggcgagcccacagtacagctgcagggctcgggcaggggt ggccggaaccaggaactggccctgcgtgttggagcagagttgagaaggtggccgctgggg ccgatagatgtgctgtttttgagcggtggcaccgatgggcaggatgggcccacagaggct gctggggcctgggtcacacctgagcttgccagccaggctgcagctgagggcctggacata gccaccttcctagcccacaatgactcacataccttcttctgctgcctccagggtggggca cacctgctgcacacagggatgacagtgggagtagccccagcacattccagccagccgcca acccccctgctagcacaggcacttgtgagggtcacagtcccctggacggtgccgctgctg gcacccagcaaccccatagcagcagcccagtcagagtcagggctgcctaatgccgagcta atgaaggatgaaataatgagccacttccctgctgctgccctcctgctgccaagtgctgct gtgacaggcaagggcactgcctggggaagtcagctcgcccacccaagtgaagcccagggg acaccagtgcccatccccctgaagagatttcagccaattagcacgtacagtatttactta attagggtggataaaccaaggcagcaagacccagtaaaggacagcaccaagtcccaaacc ccggctttggtccttgcagcgcttgtgtcaaggaggaattctttagatttaaaggacagt ggtgaccatcctccccgagccctggtcaccggggaccaccaggttcctgttggaggacag agggcaccacaggggaaggcaacaacaggggcaaaggaatgcccagagcatgggcagggc agccattcaggcaatgccgtgtcatgtggggaaactgagactctagcaactatcagtgtc tggcgcaaaagctcctggagcccccctgggaaatgtcctgtggaagatgtctttgtctca gagcctgccatctggcgctttggcccaggaaggagcttcggctggggcatctccctgaga agcagcatggagcagcctaacagtaaaggctatagcctgggaaggacccctcagggccca gagtgcagcagtgctcctgcagtccaagtggggacccacaggggcctagagtataacccg gggaagattcttccaggatcagactatgggttgggaaatcctccagcccttgaccccaag ctcccacatttacccctgcccccggccccacccacactctcagacttggggcagccacgg aagtcacccctgacaggcactgataagaagtacccgctgatgaagcagcgtgggttctac tccgacatcctcagccctggaaccttagatcaacttggggaggtatgtcgtggcccccga atgagccagaacctcctgcggcaggctgaccttgacaagttcaccccaagagtcggaagc tttgaggttcctgaagacttccaggagcgcatggagcagcagtgcatcgggtccaccacc cggctgctcgcccagactgacttcccactgcaggcctacgagcccaagatgcaggtgcct ttccaggtgctgccaggccagcatcctcgcaagattgagatcgagaggtacggctgggcc cttgtggacacccatggagccacctcacacgagccaagccatcctgaggtcccctccctg cccttgaccaggaggaaacagcagtacctgagcctggacattgagcagttgctgttcagc cagggcatcgactccaacaagctcatgcccaggcacctggaccaccagcacccccaaacc atcgaacagggccatgacccaatcttccccatctacctcccactgaagcctgtatgcctg gcctcggcctccgtgccacccatgggcaacctagatgcggcccagctggactgtggagta tcctccctgaccagtgcccgcctcagggtatttgacaatgaggactttgactgccggact cccagagagtggatcaacatgggcttggagccagggtctctggacaggaaacctgtcccg ggaaaagccctcttgcccactgatgacttcctggggcatggcctgccccagcggctgtgg aggctctcccatttcctgctgtggaaagcagtgccagaggaccccaagagtcagaagctg aagtacaaatggtgcgaggtcggcgtcctggactacgacgaggagaagaagctatacctg gtacacaagacagacgagaaaggcctggtgcgagatgagatggggaggcccatcctgaat gcaggggtcaccactgaaggaaggccaccccttcaggtctgtcagtactgggtgccacgg atccagcttctcttctgcgctgaggacccttgcatgttcgcacaacgtgtggtccaggcc aacgccctgcgcaagaacacggaagcactgctgctctacaacttgtatgtggactgcatg ccctctgacggccagcatgtcatcagtgaacagagcctgagcaagatcaagcagtgggcc ctgagcacgcctcggatgcgcaaaggcccctcn