GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:36:53 Sequence gi568815595r:47313857_47542993 : 229137 bp : 48.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5797 5927 131 0 2 93 79 130 0.996 12.09 1.02 Intr + 8712 8852 141 0 0 94 89 221 0.984 22.27 1.03 Intr + 16095 16293 199 0 1 86 91 270 0.944 26.45 1.04 Intr + 19301 19461 161 1 2 58 109 141 0.990 11.99 1.05 Intr + 20827 20963 137 1 2 80 82 125 0.991 11.31 1.06 Intr + 22679 22901 223 0 1 44 61 339 0.801 23.99 1.07 Intr + 26716 26820 105 1 0 73 80 169 0.142 13.93 1.08 Intr + 28863 28974 112 0 1 41 65 4 0.182 -6.22 1.09 Term + 29699 30085 387 1 0 125 48 429 0.653 37.54 1.10 PlyA + 32939 32944 6 1.05 2.00 Prom + 36587 36626 40 -7.16 2.01 Init + 44729 44864 136 1 1 67 31 136 0.513 6.10 2.02 Intr + 48938 49052 115 0 1 79 75 52 0.200 2.51 2.03 Intr + 56196 56274 79 0 1 30 119 36 0.101 0.45 2.04 Term + 56340 56441 102 2 0 46 49 251 0.124 15.28 2.05 PlyA + 57319 57324 6 1.05 3.03 PlyA - 57754 57749 6 -0.45 3.02 Term - 57849 57831 19 2 1 75 44 4 0.059 -7.51 3.01 Init - 58414 58308 107 1 2 75 77 107 0.279 7.99 3.00 Prom - 59721 59682 40 1.24 4.00 Prom + 62001 62040 40 -6.16 4.01 Init + 67037 67324 288 1 0 63 105 257 0.636 20.02 4.02 Intr + 82287 82361 75 2 0 98 91 116 0.992 12.61 4.03 Intr + 90796 90923 128 0 2 105 85 138 0.990 14.68 4.04 Intr + 91149 91225 77 0 2 58 85 162 0.999 12.06 4.05 Intr + 91893 91942 50 1 2 128 78 60 0.998 7.40 4.06 Intr + 92059 92190 132 0 0 67 81 254 0.734 23.44 4.07 Intr + 92469 92549 81 2 0 44 91 87 0.887 4.43 4.08 Intr + 92625 92756 132 2 0 112 56 196 0.977 19.64 4.09 Intr + 92847 92894 48 2 0 70 88 92 0.630 6.28 4.10 Intr + 93274 93330 57 0 0 71 98 121 0.958 10.48 4.11 Intr + 93453 93511 59 2 2 96 87 57 0.993 4.08 4.12 Intr + 93649 93728 80 1 2 114 58 43 0.991 3.09 4.13 Intr + 93841 93955 115 2 1 87 94 100 0.996 10.01 4.14 Intr + 94034 94099 66 2 0 77 64 127 0.993 7.22 4.15 Intr + 94489 94634 146 1 2 105 113 206 0.999 24.73 4.16 Intr + 94920 95231 312 1 0 48 99 384 0.829 31.66 4.17 Intr + 95307 95470 164 1 2 88 80 299 0.641 28.79 4.18 Intr + 95561 95712 152 1 2 71 32 317 0.962 23.36 4.19 Intr + 95799 95978 180 0 0 73 72 327 0.924 28.58 4.20 Intr + 96072 96471 400 0 1 93 77 208 0.708 14.71 4.21 Intr + 97072 97830 759 0 0 74 100 820 0.438 73.47 4.22 Intr + 97927 98111 185 0 2 75 62 157 0.998 10.39 4.23 Intr + 98238 98342 105 0 0 99 80 208 0.999 20.43 4.24 Intr + 98427 98565 139 0 1 65 100 190 0.999 18.37 4.25 Intr + 98658 98771 114 2 0 67 100 42 0.893 3.94 4.26 Term + 98850 99329 480 2 0 116 47 276 0.925 21.10 4.27 PlyA + 99559 99564 6 1.05 5.26 PlyA - 99864 99859 6 -0.45 5.25 Term - 100243 99998 246 1 0 104 53 297 0.999 23.59 5.24 Intr - 100530 100324 207 0 0 95 80 200 0.999 19.17 5.23 Intr - 100796 100716 81 2 0 59 115 105 0.511 10.13 5.22 Intr - 101137 100971 167 2 2 97 70 55 0.551 4.28 5.21 Intr - 101324 101242 83 1 2 84 36 79 0.567 1.58 5.20 Intr - 103351 103266 86 1 2 32 105 115 0.988 6.22 5.19 Intr - 103970 103448 523 1 1 107 64 453 0.894 37.75 5.18 Intr - 104393 104278 116 2 2 66 73 147 0.883 10.25 5.17 Intr - 104666 104465 202 1 1 69 89 323 0.999 29.79 5.16 Intr - 104987 104799 189 1 0 69 80 136 0.998 9.60 5.15 Intr - 105848 105472 377 2 2 96 83 321 0.992 26.21 5.14 Intr - 106532 106476 57 2 0 70 92 33 0.560 0.98 5.13 Intr - 106916 106698 219 2 0 73 37 343 0.809 26.20 5.12 Intr - 107173 107075 99 1 0 55 75 92 0.863 4.91 5.11 Intr - 108680 108586 95 0 2 114 110 220 0.999 26.48 5.10 Intr - 110189 110077 113 1 2 125 92 134 0.999 17.52 5.09 Intr - 111755 111629 127 0 1 128 77 221 0.999 24.74 5.08 Intr - 112313 112141 173 1 2 65 46 276 0.637 20.69 5.07 Intr - 113406 113301 106 1 1 87 71 134 0.998 10.87 5.06 Intr - 113811 113591 221 2 2 83 63 222 0.854 17.15 5.05 Intr - 114814 114657 158 1 2 114 80 144 0.998 15.01 5.04 Intr - 117118 117040 79 0 1 91 97 14 0.581 2.15 5.03 Intr - 119478 119311 168 2 0 116 89 21 0.409 4.06 5.02 Intr - 121281 121156 126 2 0 83 11 132 0.455 4.79 5.01 Init - 129137 129016 122 2 2 63 84 79 0.845 4.66 5.00 Prom - 154348 154309 40 -2.46 6.00 Prom + 159268 159307 40 -2.46 6.01 Init + 177468 177588 121 2 1 87 106 12 0.346 3.25 6.02 Term + 179284 179312 29 2 2 121 43 23 0.626 -0.66 6.03 PlyA + 180254 180259 6 1.05 7.07 PlyA - 181807 181802 6 1.05 7.06 Term - 182341 182213 129 1 0 115 43 96 0.985 5.98 7.05 Intr - 184610 184430 181 1 1 76 85 172 0.712 15.57 7.04 Intr - 187995 187738 258 2 0 81 42 230 0.703 14.28 7.03 Intr - 190592 190474 119 2 2 83 82 100 0.053 8.16 7.02 Intr - 200165 199814 352 1 1 58 65 143 0.006 4.33 7.01 Intr - 228599 228469 131 2 2 42 80 106 0.059 4.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 67498 67596 99 0 0 94 54 67 0.887 2.03 S.002 Term - 196398 196289 110 1 2 83 48 131 0.912 7.27 S.003 Init - 199996 199814 183 1 0 60 65 154 0.880 8.75 S.004 Sngl - 207441 207289 153 0 0 91 44 151 0.946 5.30 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:47313857_47542993|GENSCAN_predicted_peptide_1|531_aa IGDHKFSAHRIVLAASIPYFHAMFTNDMMECKQDEIVMQGMDPSALEALINFAYNGNLAI DQQNVQSLLMGASFLQLQSIKDACCTFLRERLHPKNCLGVRQFAETMMCAVLYDAANSFI HQHFVEVSMSEEFLALPLEDVLELVSRDELNVKSEEQVFEAALAWVRYDREQRGPYLPEL LSNIRLPLCRPQFLSDRVQQDDLVRCCHKCRDLVDEAKDYHLMPERRPHLPAFRTRPRCC TSIAGLIYAVGGLNSAGDSLNVVEVFDPIANCWERCRPMTTARSRVGVAVVNGLLYAIGG YDGQLRLSTVEAYNPETDTWTRVGSMNSKRSAMGTVVLDGQIYVCGGYDGNSSLSSVETY SPETDKWTVVTSMSSNRSAAGVTVFEGRIYVSGGHDGLQIFSSVEHYNHHTATWHPAAGM LNKRCRHGAASLGSKMFVCGGYDGSGFLSIAEMYSSVADQWCLIVPMHTRRSRVSLVASC GRLYAVGGYDGQSNLSSVEMYDPETDCWTFMAPMACHEGGVGVGCIPLLTI >gi568815595r:47313857_47542993|GENSCAN_predicted_CDS_1|1596_bp attggggaccacaaattcagtgcccaccggattgtcttagcagcctcgatcccgtatttc catgctatgtttacaaatgacatgatggagtgcaagcaggatgagattgtaatgcaagga atggacccaagtgccctggaggctctgatcaactttgcctacaacggcaaccttgccatt gaccagcaaaatgtccagtcattgctgatgggggcgagcttcctgcagctgcagagcatc aaagacgcctgctgcacattccttcgagaacggcttcacccaaaaaactgcctgggtgtg cgccagtttgctgagacaatgatgtgtgctgtgctgtacgacgctgccaacagcttcatc caccagcactttgtggaggtgtccatgtcagaagagttcctggccctgcccttggaagac gtgcttgagctggtgtctcgggatgagctgaatgtcaaatctgaggagcaggtctttgaa gctgcattggcctgggtcagatacgaccgggagcagaggggtccctacctgcctgagctg ctgtccaatatccgcctgcccctctgtcggccccagttcctttcagacagagtacagcag gatgacctggtgcgttgctgccacaaatgcagggacctggtagacgaagcaaaggactac cacctcatgccagagcgccggccccacctgccagctttcagaacccggccacgctgctgc acatccatcgctggacttatctacgctgtagggggcctcaactcagcaggtgattccctg aatgtggtggaagtgttcgaccccattgccaattgctgggagagatgccgtcccatgaca acagcccgcagccgcgttggcgtggctgtggtgaacgggcttctctatgccatcggagga tatgacggccagctacggctgagcactgtggaggcctacaacccggagacagacacatgg accagagtggggagcatgaatagcaagagaagtgccatggggacagtcgtgctggatggg cagatctacgtctgtgggggctacgatggcaactcttccctcagctccgtggagacctac tcacctgagacggacaaatggacagtggtgacctcgatgagctcgaatcgcagtgctgct ggggttacagtctttgagggcaggatatatgtgtcaggcggccatgatggtttgcagatc ttcagcagtgtggaacactacaaccaccacacagccacctggcaccctgcagctggcatg ctcaacaagcgctgccggcacggagccgcctccctggggagcaagatgtttgtctgcggg ggctacgatggctctggcttcctcagcattgccgagatgtacagctctgtggcagaccag tggtgcctgattgtccccatgcacacgcgcaggagccgggtctccctggtggccagctgt gggcgcctctacgctgttgggggctacgacggacagtcaaacctaagctcagtggagatg tatgacccagagacagactgctggacattcatggcccccatggcgtgccatgagggaggg gtcggtgtgggctgcatccctctcctcaccatctaa >gi568815595r:47313857_47542993|GENSCAN_predicted_peptide_2|143_aa MRESLELLRDWLNGCDQNADSDMDNEVQAAKVSDGNEELIGNWSKTMQERPNTSHFLSLS LNFLLCVRKQEEESLPCVFSCHVSLGNKSETPSQKQQQQQQQKQDPGTELSEEERQSEEE RQSEEERQSEEERQSDVGLAAGA >gi568815595r:47313857_47542993|GENSCAN_predicted_CDS_2|432_bp atgagggaaagtttggaacttcttagagactggttaaatggctgtgaccaaaatgctgat agtgatatggacaatgaagtccaggctgccaaagtctcagatggaaatgaggaacttatt ggtaactggagcaaaacaatgcaagaacggcctaatacaagtcacttcctgtctttgtcc ctcaatttcctcctctgtgtgaggaaacaggaagaagaatccctgccctgcgtattcagc tgccacgtgagcctgggcaacaagagtgaaactccatctcaaaaacaacaacaacaacaa caacaaaaacaggacccaggcacagagctgtcagaggaggagcggcagtcagaggaggag cggcagtcagaggaggagcggcagtcagaggaggagcggcagtctgatgttggtttggct gctggggcctga >gi568815595r:47313857_47542993|GENSCAN_predicted_peptide_3|41_aa MSNLISNQENANPNYTEMPLDSHQNDEYTSVNNKCCHQPGS >gi568815595r:47313857_47542993|GENSCAN_predicted_CDS_3|126_bp atgtccaacctcatcagtaaccaggaaaatgcaaaccccaactacactgaaatgccactt gatagtcatcagaatgatgagtatacatctgtcaataataagtgttgccaccagccagga tcctag >gi568815595r:47313857_47542993|GENSCAN_predicted_peptide_4|1507_aa MRNRDSACAKDYASGWLGSLQLPAGRWHFSFPPVTSDFRHEGAGLGSWLSQQLQQLREWP GGRRVPAAMEAVPRMPMIWLDLKEAGDFHFQPAVKKFVLKNYGENPEAYNEELKKLELLR QNAVRVPRDFEGCSVLRKYLGQLHYLQSRVPMGSGQEAAVPVTWTEIFSGKSVAHEDIKY EQACILYNLGALHSMLGAMDKRVSEEGMKVSCTHFQCAAGAFAYLREHFPQAYSVDMSRQ ILTLNVNLMLGQAQECLLEKSMLDNRKSFLVARISAQVVDYYKEACRALENPDTASLLGR IQKDWKKLVQMKIYYFAAVAHLHMGKQAEEQQKFGERVAYFQSALDKLNEAIKLAKGQPD TVQDALRFTMDVIGGKYNSAKKDNDFIYHEAVPALDTLQPVKGAPLVKPLPVNPTDPAVT GPDIFAKLVPMAAHEASSLYSEEKAKLLREMMAKIEDKNEVLDQFMDSMQLDPETVDNLD AYSHIPPQLMEKCAALSVRPDTVRNLVQSMQVLSGVFTDVEASLKDIRDLLEEDELLEQK FQEAVGQAGAISITSKAELAEVRREWAKYMEVHEKASFTNSELHRAMNLHVGNLRLLSGP LDQVRAALPTPALSPEDKAVLQNLKRILAKVQEMRDQRVSLEQQLRELIQKDDITASLVT TDHSEMKVGWKLFEEQLKKYDQLKVYLEQNLAAQDRVLCALTEANVQYAAVRRVLSDLDQ KWNSTLQTLVASYEAYEDLMKKSQEGRDFYADLESKVAALLERTQSTCQAREAARQQLLD RELKKKPPPRPTAPKPLLPRREESEAVEAGDPPEELRSLPPDMVAGPRLPDTFLGSATPL HFPPSPFPSSTGPGPHYLSGPLPPGTYSGPTQLIQPRAPGPHAMPVAPGPALYPAPAYTP ELGLVPRSSPQHGVPPHPPLAYGPAPSTRPMGPQAAPLTIRGPSSAGQSTPSPHLVPSPA PSPGPGPVPPRPPAAEPPPCLRRGAAAADLLSSSPESQHGGTQSPGGGQPLLQPTKVDAA EGRRPQALRLIERDPYEHPERLRQLQQELEAFRGQLGDVGALDTVWRELQDAQEHDARGR SIAIARCYSLKNRHQDVMPYDSNRVVLRSGKDDYINASCVEGLSPYCPPLVATQAPLPGT AADFWLMVHEQKVSVIVMLVSEAEMEKQKVARYFPTERGQPMVHGALSLALSSVRSTETH VERVLSLQFRDQSLKRSLVHLHFPTWPELGLPDSPSNLLRFIQEVHAHYLHQRPLHTPII VHCSSGVGRTGAFALLYAAVQEVEAGNGIPELPQLVRRMRQQRKHMLQEKLHLRFCYEAV VRHVEQVLQRHGVPPPCKPLASASISQKNHLPQDSQDLVLGGDVPISSIQATIAKLSIRP PGGLESPVASLPGPAEPPGLPPASLPESTPIPSSSPPPLSSPLPEAPQPKEEPPVPEAPS SGPPSSSLELLASLTPEAFSLDSSLRGKQRMSKHNFLQAHNGQGLRATRPSDDPLSLLDP LWTLNKT >gi568815595r:47313857_47542993|GENSCAN_predicted_CDS_4|4524_bp atgcgcaacagggacagcgcctgcgcaaaagactacgcgagcgggtggctgggttcactg caactcccggcaggccgctggcatttttcattccctcccgtgaccagtgacttccggcac gaaggggcggggctgggctcgtggctgagccagcagctgcagcagctacgggagtggccg ggtggccggcgggtgccagccgccatggaggccgtgccccgcatgcccatgatctggctg gacctgaaggaggccggtgactttcacttccagccagctgtgaagaagtttgtcctgaag aattatggagagaacccagaagcctacaatgaagaactgaagaagctggagttgctcaga cagaatgctgtccgtgtcccacgagactttgagggctgtagtgtcctccgcaagtacctc ggccagcttcattacctgcagagtcgggtccccatgggctcgggccaggaggccgctgtc cctgtcacctggacagagatcttctcaggcaagtctgtggcccatgaggacatcaagtac gagcaggcctgtattctctacaaccttggagcgctgcactccatgctgggggccatggac aagcgggtgtctgaggagggcatgaaggtctcctgtacccatttccagtgcgcagccggc gccttcgcctacctacgggagcacttccctcaagcctacagcgtcgacatgagccgccag atccttacgctcaacgtcaacctcatgctgggccaggctcaggagtgcctcctggagaag tcgatgttggacaacaggaagagctttctggtggcccgcatcagtgcacaggtggtagat tactacaaggaggcatgccgggccttggagaaccccgacactgcctcactgctgggccgg atccagaaggactggaagaaacttgtgcagatgaagatctactacttcgcagccgtggct catctgcacatgggaaagcaggccgaggagcagcagaagttcggggagcgggttgcatac ttccagagcgccctggacaagctcaatgaagccatcaagttggccaagggccagcctgac actgtgcaagacgcgcttcgcttcactatggatgtcattgggggaaagtacaattctgcc aagaaggacaacgacttcatttaccatgaggctgtcccagcattggacactcttcagcct gtaaaaggagcccccttggtgaagcccttgccagtgaaccccacagacccagctgttaca ggccctgacatctttgccaaactggtacccatggctgcccacgaggcctcgtcactgtac agtgaggagaaggccaagctgctccgggagatgatggccaagattgaggacaagaatgag gtcctggaccagttcatggattcaatgcagttggatcccgagacggtggacaaccttgat gcctacagccacatcccaccccagctcatggagaagtgcgcggctctcagcgtccggccc gacactgtcaggaaccttgtacagtccatgcaagtgctgtcaggtgtgttcacggatgtg gaggcttccctgaaggacatcagagatctgttggaggaggatgagctgctagagcagaag tttcaggaggcggtgggccaggcaggggccatctccatcacctccaaggctgagctggca gaggtgaggcgagaatgggccaagtacatggaagtccatgagaaggcctccttcaccaac agtgagctgcaccgtgccatgaacctgcacgtcggcaacctgcgcctgctcagcgggccg cttgaccaggtccgggctgccctgcccacaccggccctctccccagaggacaaggccgtg ctgcaaaacctaaagcgcatcctggctaaggtgcaggagatgcgggaccagcgcgtgtcc ctggagcagcagctgcgtgagcttatccagaaagatgacatcactgcctcgctggtcacc acagaccactcagagatgaaggtgggctggaagttgttcgaggagcagctgaaaaagtat gaccagctgaaggtgtacctggagcagaacctggccgcccaggaccgtgtcctctgtgca ctgacagaggccaacgtgcagtacgcagccgtgcggcgggtactcagcgacttggaccaa aagtggaactccacgctgcagaccctggtggcctcgtatgaagcctatgaggacctgatg aagaagtcgcaggagggcagggacttctacgcagatctggagagcaaggtggctgctctg ctggagcgcacgcagtccacctgccaggcccgcgaggctgcccgccagcagctcctggac agggagctgaagaagaagccgccgccacggcccacagccccaaagccgctgctgccccgc agggaggagagtgaggcagtggaagcaggagacccccctgaggagctgcgcagcctcccc cctgacatggtggctggcccacgactgcctgacaccttcctgggaagtgccaccccgctc cactttcctcccagccccttccccagctccacaggcccaggaccccactatctctcaggc cccttgccccctggtacctactcgggccccacccagctgatacagcccagggccccaggg ccccatgcaatgcccgtagcacctgggcctgccctctacccagcccctgcctacacaccg gagctgggccttgtgccccgatcctccccacagcatggcgtgcctccccatcccccactg gcatatggtcctgccccttctaccagacccatgggcccccaggcagcccctcttaccatt cgagggccctcgtctgctggccagtccacccctagtccccacctggtgccttcacctgcc ccatctccagggcctggtccggtaccccctcgccccccagcagcagaaccacccccttgc ctgcgccgaggcgccgcagctgcagacctgctctcctccagcccggagagccagcatggc ggcactcagtctcctgggggtgggcagcccctgctgcagcccaccaaggtggatgcagct gagggtcgtcggccgcaggccctgcggctgattgagcgggacccctatgagcatcctgag aggctgcggcagttgcagcaggagctggaggcctttcggggtcagctgggggatgtggga gctctggacactgtctggcgagagctgcaagatgcgcaggaacatgatgcccgaggccgt tccatcgccattgcccgctgctactcactgaagaaccggcaccaggatgtcatgccctat gacagtaaccgtgtggtgctgcgctcaggcaaggatgactacatcaatgccagctgcgtg gaggggctctccccatactgccccccgctagtggcaacccaggccccactgcctggcaca gctgctgacttctggctcatggtccatgagcagaaagtgtcagtcattgtcatgctggtt tctgaggctgagatggagaagcaaaaagtggcacgctacttccccaccgagaggggccag cccatggtgcacggtgccctgagcctggcattgagcagcgtccgcagcaccgaaacccat gtggagcgcgtgctgagcctgcagttccgagaccagagcctcaagcgctctcttgtgcac ctgcacttccccacttggcctgagttaggcctgcccgacagccccagcaacttgctgcgc ttcatccaggaggtgcacgcacattacctgcatcagcggccgctgcacacgcccatcatt gtgcactgcagctctggtgtgggccgcacgggagcctttgcactgctctatgcagctgtg caggaggtggaggctgggaacggaatccctgagctgcctcagctggtgcggcgcatgcgg cagcagagaaagcacatgctgcaggagaagctgcacctcaggttctgctatgaggcagtg gtgagacacgtggagcaggtcctgcagcgccatggtgtgcctcctccatgcaaacccttg gccagtgcaagcatcagccagaagaaccaccttcctcaggactcccaggacctggtcctc ggtggggatgtgcccatcagctccatccaggccaccattgccaagctcagcattcggcct cctggggggttggagtccccggttgccagcttgccaggccctgcagagcccccaggcctc ccgccagccagcctcccagagtctaccccaatcccatcttcctccccgccccccctttcc tccccactacctgaggctccccagcctaaggaggagccgccagtgcctgaagcccccagc tcggggcccccctcctcctccctggaattgctggcctccttgaccccagaggccttctcc ctggacagctccctgcggggcaaacagcggatgagcaagcataactttctgcaggcccat aacgggcaagggctgcgggccacccggccctctgacgaccccctcagccttctggatcca ctctggacactcaacaagacctga >gi568815595r:47313857_47542993|GENSCAN_predicted_peptide_5|1379_aa MTLTERLREKISRAFYNHGLLCASYPIPIILFTGFCILACCYPLLKLPLPGTGPVEFTTP VKDYSPPPVDSDRKQGEPTEQPEIEKEMGPGQSPGVTQLSGQLSHLHWLSLIKIQASDTE VGGKDCFSYEEQEIQPKSEVPGGINLLCLVSILALAYAVTWSTALYVGAPVAYVQQIFVK SSVFPWHKNLLAVDVFRSPLSRAFQLVEEIRNHVLRDSSGIRSLEELCLQVTDLLPGLRK LRNLLPEHGCLLLSPGNFWQNDWERFHADPDIIGTIHQHEPKTLQTSATLKDLLFGVPGK YSGVSLYTRKRMVSYTITLVFQHYHAKFLGSLRARLMLLHPSPNCSLRAESLVHVHFKEE IGVAELIPLVTTYIILFAYIYFSTRKIDMVKSKWGLALAAVVTVLSSLLMSVGLCTLFGL TPTLNGGEIFPYLVVVIGLENVLVLTKSVVSTPVDLEVKLRIAQGLSSESWSIMKNMATE LGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQMLFFTTVLSIDIRRMELADLNKRLPPE ACLPSAKPVGQPTRYERQLAVRPSTPHTITLQPSSFRNLRLPKRLRVVYFLARTRLAQRL IMTLVASEEISLWEKQLQGSQAGTVVWIGILVYTDPAGLRNYLAAQVTEQSPLGEGALAP MPVPSGMLPPSHPDPAFSIFPPDAPKLPENQTSPGESPERGGPAEVVHDSPVPEVTWGPE DEELWRKLSFRHWPTLFSYYNITLAKRYISLLPVIPVTLRLNPREALEGRHPQDGRSAWP PPGPIPAGHWEAGPKGPGGVQAHGDVTLYKVAALGLATGIVLVLLLLCLYRVLCPRNYGQ LGGGPGRRRRGELPCDDYGYAPPETEIVPLVLRGHLMDIECLASDGMLLVSCCLAGHVCV WDAQTGDCLTRIPRPGRQRRDSGVGSGLEAQESWERLSDGGKAGPEEPGDSPPLRHRPRG PPPPSLFGDQPDLTCLIDTNFSAQPRSSQPTQPEPRHRAVCGRSRDSPGYDFSCLVQRVY QEEGLAAVCTPALRPPSPGPVLSQAPEDEGGSPEKGSPSLAWAPSAEGSIWSLELQGNLI VVGRSSGRLEVWDAIEGVLCCSSEEVSSGITALVFLDKRIVAARLNGSLDFFSLETHTAL SPLQFRGTPGRGSSPASPVYSSSDTVACHLTHTVPCAHQKPITALKAAAGRLVTGSQDHT LRVFRLEDSCCLFTLQGHSGAITTVYIDQTMVLASGGQDGAICLWDVLTGSRVSHVFAHR GDVTSLTCTTSCVISSGLDDLISIWDRSTGIKFYSIQQDLGCGASLGVISDNLLVTGGQG CVSFWDLNYGDLLQTVYLGKNSEAQPARQILVLDNAAIVCNFGSELSLVYVPSVLEKLD >gi568815595r:47313857_47542993|GENSCAN_predicted_CDS_5|4140_bp atgaccctgactgaaaggctgcgtgagaagatatctcgggccttctacaaccatgggctc ctctgtgcatcctatcccatccccatcatcctcttcacagggttctgcatcttagcctgc tgctacccactgctgaaactccccttgccaggaacaggacctgtggaattcaccacccct gtgaaggattactcgcccccacctgtggactctgaccgcaaacaaggagagcctactgag cagcctgagattgagaaggagatgggcccagggcagtcacctggtgtcacacagctcagt ggtcagttatctcacctccattggctgtccttgataaagatacaggccagcgatactgaa gtgggtggtaaggactgttttagttatgaagaacaggaaattcaacccaaatcagaggtc ccaggaggcattaacttgctgtgcttggtcagcatcttggcccttgcctatgctgtaacc tggagtactgcattgtatgtgggtgccccggtggcttatgtccagcagatatttgtgaag tcctcagtgtttccctggcacaagaacctcctggcagtagatgtatttcgttcacctttg tcccgggcattccaactggtggaggagatccggaaccacgtgctgagagacagctctggg atcaggagcttggaggagttgtgtctgcaagtgaccgacctgctgccaggccttaggaag ctcaggaacctactccctgagcatggatgcctgctgctgtcccctgggaacttctggcag aatgactgggaacgcttccatgctgatcctgacatcattgggaccatccaccagcacgag cctaaaaccctgcagacttcagccacactcaaagacttgttatttggtgttcctgggaag tacagcggggtgagcctctacaccaggaagaggatggtctcctacaccatcaccctggtc ttccagcactaccatgccaagttcctgggcagcctgcgtgcccgcctgatgcttctgcac cccagccccaactgcagccttcgggcggagagcctggtccacgtgcacttcaaggaggag attggtgtcgctgagctcatcccccttgtgaccacctacatcatcttgtttgcctacatc tacttctccacgcggaagatcgacatggtcaagtccaagtgggggctggccctggctgcc gtggtcacagtgctcagctcgctgctcatgtctgtgggactctgcacactcttcggcctg acgcccaccctcaatggcggcgagattttcccctaccttgtggtggttattgggttagag aatgtgttggtgctcaccaagtctgtggtctcaaccccggtagacctggaggtgaagctg cggatcgcccaaggcctaagcagcgagagctggtccatcatgaagaacatggccacggag ctgggcatcatcctcatcggctacttcaccctagtgcccgccatccaggagttctgtctc tttgctgtcgtggggctggtgtctgacttcttccttcagatgctgtttttcaccactgtc ctgtccattgacattcgccggatggagctagcagacctgaacaagcgactgccccctgag gcctgcctgccctcagccaagccagtgggacagccaacgcgctacgagcggcagctggct gtgaggccgtccacaccccacaccatcacgttgcagccgtcttccttccgaaacctgcgg ctccccaagaggctgcgtgttgtctacttcctggcccgcacccgcctggcacagcgcctc atcatgaccctagtggcttctgaggagataagcctgtgggagaagcagctccagggttct caggctggcaccgttgtctggattggcatcctggtatacacagacccagcagggctgcgc aactacctcgctgcccaggtgacggaacagagcccattgggtgagggagccctggctccc atgcccgtgcctagtggcatgctgccccccagccacccggaccctgccttctccatcttc ccacctgatgcccctaagctacctgagaaccagacgtcgccaggcgagtcacctgagcgt ggaggtccagcagaggttgtccatgacagcccagtcccagaggtaacctgggggcctgag gatgaggaactttggaggaaattgtccttccgccactggccgacgctcttcagctattac aacatcacactggccaagaggtacatcagcctgctgcccgtcatcccagtcacgctccgc ctgaacccgagggaggctctggagggccggcaccctcaggacggccgcagtgcctggccc ccaccggggcccatacctgctgggcactgggaagcaggacccaagggcccaggtggggtg caggcccatggagacgtcacgctgtacaaggtggcggcgctgggcctggccaccggcatc gtcttggtgctgctgctgctctgcctctaccgcgtgctatgcccgcgcaactacgggcag ctgggtggtgggcccgggcggcggaggcgcggggagctgccctgcgacgactacggctat gcgccacccgagacggagatcgtgccgcttgtgctgcgcggccacctcatggacatcgag tgcctggccagcgacggcatgctgctggtgagctgctgcctggcaggccacgtctgcgtg tgggacgcgcagaccggggattgcctaacgcgcattccgcgcccaggcaggcagcgccgg gacagtggcgtgggcagcgggcttgaggctcaggagagctgggaacgactttcagatggt gggaaggctggtccagaggagcctggggacagccctcccctgagacaccgcccccggggc cctccgccgccttccctcttcggggaccagcctgacctcacctgcttaattgacaccaac ttttcagcgcagcctcggtcctcacagcccactcagcccgagccccggcaccgggcggtc tgtggccgctctcgggactccccaggctatgacttcagctgcctggtgcagcgggtgtac caggaggaggggctggcggccgtctgcacaccagccctgcgcccaccctcgcctgggccg gtgctgtcccaggcccctgaggacgagggtggctcccccgagaaaggctccccttccctc gcctgggcccccagtgccgagggttccatctggagcttggagctgcagggcaacctcatc gtggtggggcggagcagcggccggctggaggtgtgggacgccattgaaggggtgctgtgc tgcagcagcgaggaggtctcctcaggcattaccgctctggtgttcttggacaaaaggatt gtggctgcacggctcaacggttcccttgatttcttctccttggagacccacactgccctc agccccctgcagtttagagggaccccagggcggggcagttcccctgcctctccagtgtac agcagcagcgacacagtggcctgtcacctgacccacacagtgccctgtgcacaccaaaaa cccatcacagccctgaaagccgctgctgggcgcttggtgactgggagccaagaccacaca ctgagagtgttccgtctggaggactcgtgctgcctcttcacccttcagggccactcaggg gccatcacgaccgtgtacattgaccagaccatggtgctggccagtggaggacaagatggg gccatctgcctgtgggatgtactgactggcagccgggtcagccatgtgtttgctcaccgt ggggatgtcacctcccttacctgtaccacctcctgtgtcatcagcagtggcctggatgac ctcatcagcatctgggaccgcagcacaggcatcaagttctactccattcagcaggacctg ggctgtggtgcaagcttgggtgtcatctcagacaacctgctggtgactggcggccagggc tgtgtctccttttgggacctaaactacggggacctgttacagacagtctacctggggaag aacagtgaggcccagcctgcccgccagatcctggtgctggacaacgctgccattgtctgc aactttggcagtgagctcagcctggtgtatgtgccctctgtgctggagaagctggactga >gi568815595r:47313857_47542993|GENSCAN_predicted_peptide_6|49_aa MAASASGEASRELSLVVEGKAGAGVVYGRSRTKMGSGVAVERKFYELCQ >gi568815595r:47313857_47542993|GENSCAN_predicted_CDS_6|150_bp atggcagcatctgcttctggggaggcctcaagggagctttcactcgtggtggaaggcaaa gcaggagcaggtgtcgtatatggcaggagcaggaccaagatggggtctggggtggcggtg gaacgtaagttctatgagctctgccagtga >gi568815595r:47313857_47542993|GENSCAN_predicted_peptide_7|389_aa RTIHVRTVTHPRDPANLQKSGGEHDQQEHTAGPSDTERDKLVTSPSSQECGAYRRRLTAT CGEKEPGAGQRARPREEEYSVAWSLGGPLGGLGRHADLRGMSRQRPLAVAQRSPSFPTTW DSTDVYARAQRSHRGTRLIGCGWLAKVALRMRASLREPGCRGVSLTMARERGQLVFLEGL KSAVDVVFQAQKEPHPLQFLREANAGNLKPLFEFVREALKPVDSGEARWTYPVLLVDDLS VLLSLGMGAVAVLDFIHYCRATVCWELKVLMELPSLRPPVTSKNSDSKEQASILYLMQGN MVVLVHDSGDAEDEENDILLNGLSHQSHLILRAEGLATGFCRDVHGQLRILWRRPSQPAV HRDQSFTYQYKIQDKSVSFFAKGMSPAVL >gi568815595r:47313857_47542993|GENSCAN_predicted_CDS_7|1170_bp agaactatccatgtgagaactgtcacccaccccagagaccctgccaacctgcaaaaaagt ggaggagagcatgaccagcaggagcacactgcaggcccctctgacacggagcgggacaag ctggtaaccagcccaagctcccaggagtgtggggcatacagaaggcgcttgacagccacg tgtggagaaaaggagcccggggctgggcagagagcgaggccaagggaagaagaatactcg gtcgcgtggagccttgggggcccactgggagggctcgggcgccacgcagatctgcgcgga atgagccgccagaggccgctggcagtcgcacagagaagcccatcgtttcctaccacctgg gactcgacggacgtctacgcgcgtgcgcagcggtcccatcgaggcacccgcctcattggc tgtggctggctcgcaaaagtggcattgcgcatgcgcgcttccttgcgcgagccgggctgt cggggtgtcagcctgaccatggcgcgggagcgtgggcagcttgtgttccttgagggactc aagtctgcagtggacgtcgtcttccaggctcaaaaggagccacaccccctgcagtttctc agggaggctaatgctgggaacttgaaaccattgtttgagtttgtacgggaggccctgaag ccagtagacagtggagaggctcggtggacgtacccggtgctgttggtggacgacctcagt gtgctcctgagcctgggcatgggggcggtggctgtgctagacttcattcactactgcaga gccaccgtgtgctgggaactaaaggtactcatggaactgccttctctgcgcccacctgtg acctccaagaactcagacagtaaggagcaggcttccatcttgtatctgatgcagggaaac atggtggtccttgtgcacgacagtggagatgcggaggatgaggagaatgacatcctgctg aatggcctcagtcatcagagccatctgatactgcgggctgagggcctggccactggcttc tgcagggatgtgcacgggcagctgaggatcctgtggaggagaccatcgcagcccgcagtc caccgggatcagagcttcacttaccagtataagatacaggacaaaagcgtgtcctttttt gccaaaggaatgtctcctgctgttctgtga