GENSCAN 1.0 Date run: 3-Nov-116 Time: 12:22:13 Sequence gi568815581r:41655747_41871854 : 216108 bp : 51.47% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1118 1175 58 1 1 90 21 135 0.748 6.45 1.02 Intr + 15724 15807 84 2 0 111 47 42 0.101 2.69 1.03 Term + 26380 26453 74 2 2 110 44 77 0.610 3.86 1.04 PlyA + 28593 28598 6 1.05 2.00 Prom + 31089 31128 40 -1.11 2.01 Init + 31842 31844 3 2 0 113 22 0 0.719 -4.17 2.02 Intr + 33260 33323 64 1 1 107 111 33 0.516 6.28 2.03 Intr + 34032 34195 164 1 2 92 56 181 0.999 15.51 2.04 Intr + 34342 34443 102 0 0 106 105 111 0.999 15.47 2.05 Term + 35036 35080 45 1 0 126 42 59 0.956 2.59 2.06 PlyA + 35235 35240 6 1.05 3.00 Prom + 37592 37631 40 0.29 3.01 Init + 49335 49372 38 2 2 74 94 1 0.120 -1.07 3.02 Intr + 54184 54328 145 1 1 74 15 104 0.080 2.49 3.03 Intr + 55097 55196 100 1 1 80 61 42 0.051 0.88 3.04 Intr + 59686 59901 216 2 0 130 91 122 0.090 15.70 3.05 Term + 60032 60126 95 0 2 96 42 154 0.987 9.89 3.06 PlyA + 60203 60208 6 1.05 4.18 PlyA - 60442 60437 6 1.05 4.17 Term - 67791 67745 47 1 2 111 54 19 0.506 -1.54 4.16 Intr - 68068 67976 93 2 0 103 77 0 0.396 0.83 4.15 Intr - 68306 68216 91 2 1 41 64 78 0.620 0.77 4.14 Intr - 68651 68599 53 0 2 138 64 25 0.863 4.22 4.13 Intr - 69408 69020 389 2 2 85 113 467 0.942 43.90 4.12 Intr - 70151 70113 39 1 0 80 121 20 0.834 2.33 4.11 Intr - 71398 71307 92 1 2 105 92 38 0.980 5.19 4.10 Intr - 71710 71552 159 1 0 6 81 208 0.489 12.60 4.09 Intr - 72090 72016 75 0 0 69 105 87 0.997 8.71 4.08 Intr - 72585 72455 131 1 2 106 101 225 0.936 26.52 4.07 Intr - 75813 75747 67 0 1 137 40 88 0.957 7.87 4.06 Intr - 75997 75892 106 0 1 144 78 55 0.999 10.82 4.05 Intr - 76372 76191 182 1 2 88 68 327 0.995 29.88 4.04 Intr - 76648 76484 165 1 0 121 76 158 0.994 18.57 4.03 Intr - 77052 76973 80 1 2 87 85 28 0.902 2.17 4.02 Intr - 77713 77633 81 2 0 96 50 54 0.846 2.51 4.01 Init - 78888 78420 469 0 1 69 94 416 0.853 34.09 4.00 Prom - 78936 78897 40 -5.41 5.00 Prom + 79994 80033 40 -6.10 5.01 Init + 80338 80419 82 0 1 96 45 65 0.717 2.08 5.02 Intr + 82383 82561 179 1 2 44 74 116 0.926 5.96 5.03 Intr + 83134 83228 95 0 2 117 92 14 0.724 3.96 5.04 Term + 86358 86463 106 0 1 83 44 62 0.510 -0.52 5.05 PlyA + 89950 89955 6 1.05 6.19 PlyA - 91219 91214 6 1.05 6.18 Term - 100149 99998 152 1 2 78 48 294 0.950 22.78 6.17 Intr - 100468 100429 40 1 1 135 115 59 0.998 11.58 6.16 Intr - 101790 101669 122 1 2 89 68 300 0.999 28.82 6.15 Intr - 102038 101888 151 2 1 109 93 289 0.999 31.75 6.14 Intr - 102772 102653 120 1 0 97 109 199 0.955 24.09 6.13 Intr - 103124 102969 156 2 0 87 92 213 0.966 22.32 6.12 Intr - 107575 107237 339 1 0 142 79 614 0.585 62.12 6.11 Intr - 109070 108967 104 0 2 98 94 178 0.951 19.79 6.10 Intr - 109321 109177 145 1 1 95 105 202 0.959 22.97 6.09 Intr - 111834 111633 202 2 1 73 86 495 0.997 47.61 6.08 Intr - 113461 113223 239 1 2 118 80 404 0.997 39.54 6.07 Intr - 113931 113672 260 1 2 103 42 520 0.992 46.62 6.06 Intr - 116116 115901 216 2 0 106 77 432 0.558 42.90 6.05 Intr - 117180 117108 73 0 1 104 47 22 0.007 -0.83 6.04 Intr - 120324 120260 65 1 2 128 55 12 0.013 0.83 6.03 Intr - 130270 130157 114 2 0 50 59 87 0.164 2.92 6.02 Intr - 130946 130820 127 2 1 21 74 97 0.407 2.36 6.01 Init - 131473 131372 102 1 0 74 91 56 0.744 4.91 6.00 Prom - 133877 133838 40 -5.31 7.00 Prom + 135146 135185 40 -1.11 7.01 Init + 136504 136541 38 0 2 91 71 52 0.582 2.04 7.02 Intr + 138252 138487 236 0 2 115 47 79 0.551 4.36 7.03 Term + 138800 138807 8 0 2 94 54 0 0.471 -4.49 7.04 PlyA + 138944 138949 6 1.05 8.09 PlyA - 141372 141367 6 1.05 8.08 Term - 147233 147211 23 0 2 119 54 20 0.763 0.46 8.07 Intr - 147685 147541 145 1 1 60 65 261 0.600 21.37 8.06 Intr - 151133 151050 84 2 0 132 105 195 0.626 26.11 8.05 Intr - 152258 152113 146 0 2 63 64 342 0.992 29.81 8.04 Intr - 154088 153960 129 0 0 103 39 256 0.863 23.37 8.03 Intr - 155288 155117 172 2 1 110 70 340 0.576 34.43 8.02 Intr - 155538 155386 153 0 0 78 59 340 0.933 30.88 8.01 Init - 156169 155708 462 1 0 95 81 1149 0.986 108.64 8.00 Prom - 157049 157010 40 -7.30 9.00 Prom + 157059 157098 40 -14.12 9.01 Init + 157289 157533 245 1 2 85 86 414 0.997 36.10 9.02 Intr + 161312 161457 146 2 2 110 72 200 0.999 21.04 9.03 Intr + 162343 162532 190 2 1 86 94 363 0.981 35.86 9.04 Intr + 162636 162895 260 0 2 112 81 229 0.820 22.24 9.05 Intr + 163464 163653 190 1 1 75 71 285 0.987 24.66 9.06 Intr + 163784 163929 146 2 2 102 58 213 0.999 20.14 9.07 Intr + 164095 164277 183 2 0 67 110 1 0.582 0.48 9.08 Intr + 164523 164715 193 1 1 99 30 270 0.983 21.37 9.09 Intr + 165201 165343 143 0 2 112 78 207 0.994 22.61 9.10 Intr + 165908 166071 164 0 2 104 90 212 0.989 23.21 9.11 Term + 166477 166662 186 0 0 94 55 380 0.667 33.11 9.12 PlyA + 167450 167455 6 1.05 10.09 PlyA - 168242 168237 6 -0.45 10.08 Term - 169911 169777 135 0 0 103 49 322 0.999 28.13 10.07 Intr - 171880 171680 201 1 0 81 98 227 0.861 23.00 10.06 Intr - 173206 173044 163 0 1 94 50 181 0.999 15.39 10.05 Intr - 175144 175055 90 0 0 86 115 38 0.983 5.91 10.04 Intr - 176731 176646 86 1 2 48 101 125 0.987 8.92 10.03 Intr - 179370 179324 47 1 2 72 90 57 0.936 2.92 10.02 Intr - 179526 179457 70 0 1 95 98 2 0.872 1.15 10.01 Init - 180199 180113 87 1 0 52 82 182 0.777 12.72 10.00 Prom - 180665 180626 40 -7.00 11.00 Prom + 181410 181449 40 -8.68 11.01 Init + 182187 182380 194 2 2 94 43 248 0.698 19.32 11.02 Intr + 186077 186566 490 2 1 93 86 526 0.971 46.50 11.03 Intr + 189380 189997 618 1 0 117 98 570 0.991 54.03 11.04 Intr + 191515 191664 150 0 0 67 115 4 0.784 1.87 11.05 Term + 192187 192561 375 0 0 106 52 266 0.990 19.90 11.06 PlyA + 192578 192583 6 1.05 12.03 PlyA - 192791 192786 6 1.05 12.02 Term - 199575 197994 1582 2 1 99 39 653 0.014 51.90 12.01 Init - 209624 209080 545 2 2 105 96 998 0.195 94.73 12.00 Prom - 209710 209671 40 -3.51 13.06 PlyA - 211199 211194 6 1.05 13.05 Term - 212158 212064 95 2 2 87 42 73 0.984 0.89 13.04 Intr - 213039 212963 77 2 2 76 97 62 0.864 5.55 13.03 Intr - 213379 213297 83 1 2 97 97 34 0.999 4.13 13.02 Intr - 213841 213728 114 1 0 76 87 111 0.996 10.95 13.01 Init - 216047 215943 105 2 0 76 109 203 0.981 21.38 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 59691 59901 211 2 1 81 91 197 0.898 16.25 S.002 Sngl - 199481 197994 1488 2 0 55 39 671 0.839 54.97 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:41655747_41871854|GENSCAN_predicted_peptide_1|71_aa MKTFALALLVALLCTERPQGTWMKLETIILSKLSQGQKTKHRMFSLIVKVKDLMSLAARL LLDELLETALI >gi568815581r:41655747_41871854|GENSCAN_predicted_CDS_1|216_bp atgaagacctttgccctggccctgctggtggccctgctgtgcacagagagaccccaaggg acatggatgaagctggaaaccatcattctcagcaaactatcacaaggacaaaaaaccaaa caccgcatgttctcactcatagtgaaagtgaaggacttgatgagtcttgctgctcgatta ctgctggatgaactcttggagacagcccttatatga >gi568815581r:41655747_41871854|GENSCAN_predicted_peptide_2|125_aa MAVSTEEKESYRMSAIQNLHSFDPFADASKGDDLLPAGTEDYIHIRIQQRNGRKTLTTVQ GIADDYDKKKLVKAFKKKFACNGTVIEHPEYGEVIQLQGDQRKNICQFLVEIGLAKDDQL KVHGF >gi568815581r:41655747_41871854|GENSCAN_predicted_CDS_2|378_bp atggccgtttccaccgaggaaaaggaatcgtatcgtatgtccgctatccagaacctccac tctttcgacccctttgctgatgcaagtaagggtgatgacctgcttcctgctggcactgag gattatatccatataagaattcaacagagaaacggcaggaagacccttactactgtccaa gggatcgctgatgattacgataaaaagaaactagtgaaggcgtttaagaaaaagtttgcc tgcaatggtactgtaattgagcatccggaatatggagaagtaattcagctacagggtgac caacgcaagaacatatgccagttcctcgtagagattggactggctaaggacgatcagctg aaggttcatgggttttaa >gi568815581r:41655747_41871854|GENSCAN_predicted_peptide_3|197_aa MTSAVCQGSFGIRGENSSEKERNMLKAQQLEKDRTRKPFLFHQTRSPHEENVQSVTGELL RMRVVIHNGYSAVSCHGHRTACFLHHPLILSKAGYEMQRLCVYVLIFALALAAFSEASWK PRSQQPDAPLGTGANRDLELPWLEQQGPASHHRRQLGPQGPPHLVADPSKKQGPWLEEEE EAYGWMDFGRRSAEDEN >gi568815581r:41655747_41871854|GENSCAN_predicted_CDS_3|594_bp atgacctctgccgtctgccagggatcatttggaatcagaggagaaaacagctcagagaag gagagaaatatgctcaaagctcaacagctggaaaaagacaggaccaggaaacccttcctt ttccatcagacgaggtcccctcatgaggagaacgtgcagtccgtgactggggagctcctc aggatgagagtggttattcacaacggttattcagccgtcagttgccatggacaccgcaca gcctgcttccttcatcacccattaattctcagcaaggctggctacgagatgcagcgactg tgtgtgtatgtgctgatctttgcactggctctggccgccttctctgaagcttcttggaag ccccgctcccagcagccagatgcacccttaggtacaggggccaacagggacctggagcta ccctggctggagcagcagggcccagcctctcatcatcgaaggcagctgggaccccagggt cccccacacctcgtggcagacccgtccaagaagcagggaccatggctggaggaagaagaa gaagcctatggatggatggacttcggccgccgcagtgctgaggatgagaactaa >gi568815581r:41655747_41871854|GENSCAN_predicted_peptide_4|772_aa MRPKRLGRCCAGSRLGPGDPAALTCAPSPSASPAPEPSAQPQARGTGQRVGSRATSGSQF LSEARTGARPASEAGAKAGARRPSAFSAIQGDVRSMPDNSDAPWTRFVFQGPFGSRATGR GTGKAAGIWKTPAAYVGRRPGVSGPERAAFIRELEEGARDPGVRRAQFLSLEDCDPLTSG SSRALCPNLPPPVKKITQEDVKVMLYLLEELLPPVWESVTYGMVLQRERDLNTAARIGQS LVKQNSVLMEENSKLEALLGSAKEEILYLRHQVNLRDELLQLYSDSDEEDEDEEEEEEEK EAEEEQEEEEAEEDLQCAHPCDAPKLISQEALLHQHHCPQLEALQEKLRLLEEENHQLRE EASQLDTLEDEEQMLILECVEQFSEASQQMAELSEVLVLRLENYERQQQEVARLQAQVLK LQQRCRMYGAETEKLQKQLASEKEIQMQLQEEQSVWVGSQLQDLREKYMDCGGMLIEMQE EVKTLRQQPPVSTGSATHYPYSVPLETLPGFQETLAEELRTSLRRMISDPVYFMERNYEM PRGDTSSLRYDFRYSEDREQVRGFEAEEGLMLAADIMRGEDFTPAEEFVPQEELGAAKKV PAEEGVMEEAELVSEETEGWEEVELELDEATRMNVVTSALEASGLGPSHLDMNYVLQQLA NWQDAHYRRQLRWKMLQKGSPTLQQWQQQSKQTRGAEPQDMKDDLVTLMHSSHCTCISSH QPHSVAVLSWYPSQPVPILMALWSLPGGIALASAGVSCFAVPQQPAEAPQTA >gi568815581r:41655747_41871854|GENSCAN_predicted_CDS_4|2319_bp atgcgcccgaagaggttgggccggtgctgcgcggggagccggctcggacccggggaccca gcagcactcacctgtgcaccttcgccctcagccagtcccgctccggagccctctgcgcag ccgcaggcacggggcactggacagagagtaggatcccgagccacctctggatcccagttc ctctcggaagcccgcaccggagctcgcccggcctcggaggctggagccaaggcaggagcc cggcgcccgtccgcattctcggccatccaaggggatgtccggtctatgcccgacaattcg gacgcgccgtggacccgcttcgtattccaagggccgtttggttcccgggccactggccgg gggactggaaaggcagcgggcatctggaagacgccagccgcctacgttggccggcgaccc ggggtgtccggccctgagcgcgccgcctttattcgggagctggaggaaggcgccagggac ccaggtgtccggagagcccagtttctgagcctggaggactgtgacccccttacctcaggc tcctccagggcactgtgtcctaacctacctccgccagtcaaaaagatcacccaggaagac gtcaaagtgatgttatatttgctggaggagcttctcccacctgtctgggagagcgttacc tatgggatggtcctgcagagagagagggacctgaacactgcagctcgcatcggccagtcc ctggtgaaacagaacagtgttttgatggaggagaacagcaagctggaagccctgctgggc tcagccaaggaggagattttatacctcagacaccaggtgaacttgcgggatgagctcctc cagctctactcagattctgatgaggaggatgaggatgaagaagaggaggaggaagaaaag gaggcagaagaggaacaggaagaagaagaagcagaggaagacctgcagtgtgctcatccc tgtgatgcccctaagctgatttcgcaggaggcattgctgcaccagcaccactgcccacag ctggaagccttgcaggagaagctgaggctgctggaggaggagaatcatcagctgagagaa gaggcctctcaactcgacactcttgaggatgaggaacagatgctcattctggagtgtgtg gagcagttttcggaggccagccaacagatggctgagctgtcggaggtgctggtgctcagg ctggaaaactatgaacggcagcagcaggaggtcgctcggctgcaggcccaggtgctgaag ctgcagcagcgctgccggatgtatggggctgagactgaaaagttgcagaagcagctggct tcggagaaggaaatccagatgcagctccaggaagagcagagcgtgtgggtgggttcccag ctgcaggacctgcgggagaagtacatggattgtgggggcatgctgattgagatgcaggag gaggtgaagaccctccgccagcaacccccagtgtccactgggtctgccacccattaccca tacagcgtgcctctggagactcttcctggtttccaggagacgctggctgaggagctcaga acgtctctaaggaggatgatctcagaccctgtgtattttatggagaggaattatgagatg cccagaggggacacatccagcctaaggtatgattttcgctacagtgaggatcgagagcag gtgcgggggtttgaggctgaggaagggttgatgctggcagcggatatcatgcggggggaa gatttcacgcctgcggaggagttcgtgccccaggaggagctgggggctgccaagaaggtg ccggctgaggaaggggtgatggaagaggcagagctggtgtcagaggagaccgagggctgg gaggaggtggaactggagctggatgaggcaacgcggatgaacgtggtgacatcagccctg gaggccagcggcttgggcccttcacacctggacatgaattatgtcctccagcagctggcc aactggcaagatgcccattacaggcggcagctgaggtggaagatgctccagaaaggctct ccaactctgcagcagtggcagcagcagtcaaaacaaacacggggggcggagccgcaggac atgaaagatgaccttgtgacactcatgcacagcagccattgcacgtgcatcagctcccat cagcctcattccgtggcagtgctgtcctggtacccgtcccagcctgtccccattctcatg gccctttggtcactgccgggtggaatagcactggcttcagcaggcgtgtcatgctttgct gttccccagcagcctgcagaggccccacagacagcttga >gi568815581r:41655747_41871854|GENSCAN_predicted_peptide_5|153_aa MASARAVLRGQAQWLTPVILALWEAKAGQRTGSQQLWLRPPEGSVSPYTSPRGFLTYRLS RPTESGVGTETPFYKWRNQGSGPVILEAFRGPSPISDPGLPLQNQAVAAIALTPTPEPRK PRASFTFSNHITPIGLQEGLGLPRRWSLTLSRV >gi568815581r:41655747_41871854|GENSCAN_predicted_CDS_5|462_bp atggcctcggcaagagcggtcttgcggggccaggcacagtggctcacacctgtaatccta gcactctgggaagccaaggcaggtcagcgcactggcagccagcaactctggctacggcca ccagagggcagtgtcagcccgtatacctccccacgtggcttcctcacctacaggctcagt cgccccacagaaagtggagtagggacagagactccattttacaaatggagaaaccagggc tcagggcccgtcattctggaagcctttcgggggccctcccccatctctgatccaggcctt cccctccagaaccaggcagtggccgccatagccctcacccccaccccagaacccaggaag cccagagcatccttcaccttctccaaccacatcacgcccattgggctacaggagggcctt gggctgcctcgcaggtggtcattgacactctcccgtgtgtag >gi568815581r:41655747_41871854|GENSCAN_predicted_peptide_6|908_aa MNKPKEASCLTLSWLAGEGVCPFPWYPYDLPVGQSPEQPPPDHAELSSLSAPAPTPARPR PGPVRPHTQVRAIGGAGERTESRGPALQLPEPRRYYLRRLPPRRWEPGSGFFSGGVLALG ALQPSLDVPIADCPWQPLTSAGQFPGQWSQTQTKREVHLPVATMEVMNLMEQPIKVTEWQ QTYTYDSGIHSGANTCVPSVSSKGIMEEDEACGRQYTLKKTTTYTQGVPPSQGDLEYQMS TTARAKRVREAMCPGVSGEDSSLLLATQVEGQATNLQRLAEPSQLLKSAIVHLINYQDDA ELATRALPELTKLLNDEDPVVVTKAAMIVNQLSKKEASRRALMGSPQLVAAVVRTMQNTS DLDTARCTTSILHNLSHHREGLLAIFKSGGIPALVRMLSSPVESVLFYAITTLHNLLLYQ EGAKMAVRLADGLQKMVPLLNKNNPKFLAITTDCLQLLAYGNQESKLIILANGGPQALVQ IMRNYSYEKLLWTTSRVLKVLSVCPSNKPAIVEAGGMQALGKHLTSNSPRLVQNCLWTLR NLSDVATKQEGLESVLKILVNQLSVDDVNVLTCATGTLSNLTCNNSKNKTLVTQNSGVEA LIHAILRAGDKDDITEPAVCALRHLTSRHPEAEMAQNSVRLNYGIPAIVKLLNQPNQWPL VKATIGLIRNLALCPANHAPLQEAAVIPRLVQLLVKAHQDAQRHVAAGTQQPYTDGVRME EIVEGCTGALHILARDPMNRMEIFRLNTIPLFVQLLYSSVENIQRVAAGVLCELAQDKEA ADAIDAEGASAPLMELLHSRNEGTATYAAAVLFRISEDKNPDYRKRVSVELTNSLFKHDP AAWEAAQSMIPINEPYGDDMDATYRPMYSSDVPLDPLEMHMDMDGDYPIDTYSDGLRPPY PTADHMLA >gi568815581r:41655747_41871854|GENSCAN_predicted_CDS_6|2727_bp atgaataagcccaaagaagcatcatgcctgacgctgagttggctggccggtgaaggcgtc tgtcccttcccttggtatccctatgacttacctgttggacagagtccggagcagccgccg cccgaccacgccgagctcagttcgctgtccgcgccggctcccaccccggcccgaccccga cccggcccggtcaggccccatactcaggtgcgggctatcgggggcgcaggtgagcgcacg gagtccagaggccctgccctccagctcccagagcctcgtcgatactacctgcggcgcctc cctccacgccggtgggagcccggctctggcttcttcagcggaggtgtcttggcacttggt gccctgcaacccagcctagatgtgcccattgctgactgtccttggcagcctctgacctct gcaggtcagttcccaggacagtggagccaaacacagaccaagcgggaagttcacctccct gtagccacgatggaggtgatgaacctgatggagcagcctatcaaggtgactgagtggcag cagacatacacctacgactcgggtatccactcgggcgccaacacctgcgtgccctccgtc agcagcaagggcatcatggaggaggatgaggcctgcgggcgccagtacacgctcaagaaa accaccacttacacccagggggtgccccccagccaaggtgatctggagtaccagatgtcc acaacagccagggccaaacgggtgcgggaggccatgtgccctggtgtgtcaggcgaggac agctcgcttctgctggccacccaggtggaggggcaggccaccaacctgcagcgactggcc gagccgtcccagctgctcaagtcggccattgtgcatctcatcaactaccaggacgatgcc gagctggccactcgcgccctgcccgagctcaccaaactgctcaacgacgaggacccggtg gtggtgaccaaggcggccatgattgtgaaccagctgtcgaagaaggaggcgtcgcggcgg gccctgatgggctcgccccagctggtggccgctgtcgtgcgtaccatgcagaataccagc gacctggacacagcccgctgcaccaccagcatcctgcacaacctctcccaccaccgggag gggctgctcgccatcttcaagtcgggtggcatccctgctctggtccgcatgctcagctcc cctgtggagtcggtcctgttctatgccatcaccacgctgcacaacctgctcctgtaccag gagggcgccaagatggccgtgcgcctggccgacgggctgcaaaagatggtgcccctgctc aacaagaacaaccccaagttcctggccatcaccaccgactgcctgcagctcctggcctac ggcaaccaggagagcaagctgatcatcctggccaatggtgggccccaggccctcgtgcag atcatgcgtaactacagttatgaaaagctgctctggaccaccagtcgtgtgctcaaggtg ctatccgtgtgtcccagcaataagcctgccattgtggaggctggtgggatgcaggccctg ggcaagcacctgaccagcaacagcccccgcctggtgcagaactgcctgtggaccctgcgc aacctctcagatgtggccaccaagcaggagggcctggagagtgtgctgaagattctggtg aatcagctgagtgtggatgacgtcaacgtcctcacctgtgccacgggcacactctccaac ctgacatgcaacaacagcaagaacaagacgctggtgacacagaacagcggtgtggaggct ctcatccatgccatcctgcgtgctggtgacaaggacgacatcacggagcctgccgtctgc gctctgcgccacctcactagccgccaccctgaggccgagatggcccagaactctgtgcgt ctcaactatggcatcccagccatcgtgaagctgctcaaccagcccaaccagtggccactg gtcaaggcaaccatcggcttgatcaggaatctggccctgtgcccagccaaccatgccccg ctgcaggaggcagcggtcatcccccgcctcgtccaactgctggtgaaggcccaccaggat gcccagcgccacgtagctgcaggcacacagcagccctacacggatggtgtgaggatggag gagattgtggagggctgcaccggagcactgcacatcctcgcccgggaccccatgaaccgc atggagatcttccggctcaacaccattcccctgtttgtgcagctcctgtactcgtcggtg gagaacatccagcgcgtggctgccggggtgctgtgtgagctggcccaggacaaggaggcg gccgacgccattgatgcagagggggcctcggccccactcatggagttgctgcactcccgc aacgagggcactgccacctacgctgctgccgtcctgttccgcatctccgaggacaagaac ccagactaccggaagcgcgtgtccgtggagctcaccaactccctcttcaagcatgacccg gctgcctgggaggctgcccagagcatgattcccatcaatgagccctatggagatgacatg gatgccacctaccgccccatgtactccagcgatgtgccccttgacccgctggagatgcac atggacatggatggagactaccccatcgacacctacagcgacggcctcaggcccccgtac cccactgcagaccacatgctggcctag >gi568815581r:41655747_41871854|GENSCAN_predicted_peptide_7|93_aa MDFQVIAGVQLSRSLSAGLHLAENWDRTQRPLALYFHPIFPSPIPRPPGNLRVEILEYFI STSASSGSSSQLTSPGAFMAPTENLEVARGRGL >gi568815581r:41655747_41871854|GENSCAN_predicted_CDS_7|282_bp atggactttcaggtcatcgcgggggtgcaactgtcaaggtctttaagcgctggactccac ctggctgagaactgggacaggactcagcggcccctggccttgtactttcatcccatcttt cccagtcccatcccacggcccccagggaatttgagagttgagatccttgaatatttcatc agcacttcagcctccagtggaagcagcagccaacttacatctcctggagcatttatggcc ccaacagagaacctggaagtagccagggggagggggctgtga >gi568815581r:41655747_41871854|GENSCAN_predicted_peptide_8|437_aa MARVAWGLLWLLLGSAGAQYEKYSFRGFPPEDLMPLAAAYGHALEQYEGESWRESARYLE AALRLHRLLRDSEAFCHANCSGPAPAAKPDPDGGRADEWACELRLFGRVLERAACLRRCK RTLPAFQVPYPPRQLLRDFQSRLPYQYLHYALFKANRLEKAVAAAYTFLQRNPKHELTAK YLNYYQGMLDVADESLTDLEAQPYEAVFLRAVKLYNSGDFRSSTEDMERALSEYLAVFAR CLAGCEGAHEQVDFKDFYPAIADLFAESLQCKVDCEANLTPNVGGYFVDKFVATMYHYLQ FAYYKLNDVRQAARSAASYMLFDPKDSVMQQNLVYYRFHRARWGLEEEDFQPREEAMLYH NQTAELRELLEFTHMYLQSDDEMELEETEPPLEPEDALSDAEFEGEGDYEEGMYADWWQE PDAKGDEAEAEPEPELA >gi568815581r:41655747_41871854|GENSCAN_predicted_CDS_8|1314_bp atggctcgggtggcgtgggggctgctgtggttgctgctgggcagcgccggggcgcagtac gagaagtacagcttccggggcttcccgcccgaggacctgatgccgctggccgcggcgtac gggcacgctctggagcagtacgagggagagagctggcgcgagagcgcgcgctacctggag gcggcgctgcggctgcaccggctcctgcgcgacagcgaggccttctgccacgccaactgc agcggccccgcgcccgcggccaagcccgatcccgacggcggccgcgcagacgagtgggcc tgcgagctgcggctcttcggccgcgtcctggagcgagccgcctgcctgcggcgctgcaag cggacgctgcccgccttccaggtgccctacccgccgcggcagctgctgcgtgacttccag agccgcctgccctaccagtacctgcactacgcgctgttcaaggctaaccggctggagaag gcggtggcggcggcctacaccttcctccagaggaacccgaagcacgagctgaccgccaag tatctcaactactatcaggggatgctggacgtcgccgacgagtccctcacggacctagag gcccagccctacgaggccgtgttcctccgggctgtgaagctctacaacagcggggatttc cgcagcagcacggaggacatggagcgggccttgtcagagtacctggcagtctttgcccgg tgcctggccggctgtgaaggggcccatgagcaggtggacttcaaggacttctacccggcc atagcagatctctttgcagagtccctgcagtgcaaggtggactgtgaggccaatttgacc cccaatgtgggtggctacttcgtggacaagttcgtggccaccatgtaccactacctgcag tttgcctactataagttgaatgatgtgcgccaggctgcccgcagcgccgccagctacatg ctcttcgaccccaaggacagcgtcatgcagcagaacctggtgtattaccggttccaccgg gctcgctggggcctggaagaggaggacttccagccccgggaggaggccatgctctaccac aaccagaccgccgagctgcgggagctgctggagttcacccacatgtacctgcagtcagat gatgagatggagctggaggagacagaaccgcccctggagcctgaggatgccctatctgac gccgagtttgagggggagggtgactacgaggagggcatgtatgctgactggtggcaggag ccggatgccaagggtgacgaggccgaggctgagccagagcctgaactcgcatga >gi568815581r:41655747_41871854|GENSCAN_predicted_peptide_9|681_aa MFPAGPPSHSLLRLPLLQLLLLVVQAVGRGLGRASPAGGPLEDVVIERYHIPRACPREVQ MGDFVRYHYNGTFEDGKKFDSSYDRNTLVAIVVGVGRLITGMDRGLMGMCVNERRRLIVP PHLGYGSIGLAGLIPPDATLYFDVVLLDVWNKEDTVQVSTLLRPPHCPRMVQDGDFVRYH YNGTLLDGTSFDTSYSKGGTYDTYVGSGWLIKGMDQGLLGMCPGERRKIIIPPFLAYGEK GYGEGGQGHKGKFRRRGKNQASTYSCSGCILHEGIQPRTQGTVIPPQASLVFHVLLIDVH NPKDAVQLETLELPPGCVRRAGAGDFMRYHYNGSLMDGTLFDSSYSRNHTYNTYIGQGYI IPGMDQGLQGACMGERRRITIPPHLAYGENGTDSIGFLQGSAPLRPFRSGEGQPSLGREG GYGKTEPAYPQDPAVLGASVSSPVKWASHADPQGDKIPGSAVLIFNVHVIDFHNPADVVE IRTLSRPSETCNETTKLGDFVRYHYNCSLLDGTQLFTSHDYGAPQEATLGANKVIEGLDT GLQGMCVGERRQLIVPPHLAHGESGARGVPGSAVLLFEVELVSREDGLPTGYLFVWHKDP PANLFEDMDLNKDGEVPPEEFSTFIKAQVSEGKGRLMPGQDPEKTIGDMFQNQDRNQDGK ITVDELKLKSDEDEERVHEEL >gi568815581r:41655747_41871854|GENSCAN_predicted_CDS_9|2046_bp atgttccccgcgggcccccccagccacagcctcctccggctccccctgctgcagttgctg ctactggtggtgcaggccgtggggagggggctgggccgcgccagcccggccgggggcccc ctggaagatgtggtcatcgagaggtaccacatccccagggcctgtccccgggaagtgcag atgggggattttgtgcgctaccactacaacggcacttttgaagatggcaagaagtttgat tcaagctatgatcgcaacaccttggtggccatcgtggtgggtgtggggcgcctcatcact ggcatggaccgaggcctcatgggcatgtgtgtcaacgagcggcgacgcctcattgtgcct ccccacctgggctatgggagcatcggcctggcggggctcattccaccggatgccaccctc tacttcgatgtggttctgctggatgtgtggaacaaggaagacaccgtgcaggtgagcaca ttgctgcgcccgccccactgcccccgcatggtccaggacggcgactttgtccgctaccac tacaatggcaccctgctggacggcacctccttcgacaccagctacagtaagggcggcact tatgacacctacgtcggctctggttggctgatcaagggcatggaccaggggctgctgggc atgtgtcctggagagagaaggaagattatcatccctccattcctggcctatggcgagaaa ggctatggtgagggtgggcaaggacacaaggggaaattccgcagaagagggaaaaaccag gcctccacatacagttgctcaggttgtatactgcacgagggcatccaaccaaggactcaa gggacagtgatccccccacaggcctcgctggtctttcacgtcctcctgattgacgtgcac aacccgaaggacgctgtccagctagagacgctggagctcccccccggctgtgtccgcaga gccggggccggggacttcatgcgctaccactacaatggctccttgatggacggcaccctc ttcgattccagctactcccgcaaccacacctacaatacctatatcgggcagggttacatc atccccgggatggaccaggggctgcagggtgcctgcatgggggaacgccggagaattacc atccccccgcacctcgcctatggggagaatggaactgactccatcggtttcctccagggc agcgccccacttcgccccttccgcagtggagaagggcagccaagtttggggagggagggt ggttatggaaaaacagaaccagcatacccccaggacccagctgtgctgggagcctcagtg tcctcacctgtcaagtgggcaagccatgctgatccgcagggagacaagatccctggctct gccgtgctaatcttcaacgtccatgtcattgacttccacaaccctgcggatgtggtggaa atcaggacactgtcccggccatctgagacctgcaatgagaccaccaagcttggggacttt gttcgataccattacaactgttctttgctggacggcacccagctgttcacctcgcatgac tacggggccccccaggaggcgactctcggggccaacaaggtgatcgaaggcctggacacg ggcctgcagggcatgtgtgtgggagagaggcggcagctcatcgtgcccccgcacctggcc cacggggagagtggagcccggggagtcccaggcagtgctgtgctgctgtttgaggtggag ctggtgtcccgggaggatgggctgcccacaggctacctgtttgtgtggcacaaggaccct cctgccaacctgtttgaagacatggacctcaacaaggatggcgaggtccctccggaggag ttctccaccttcatcaaggctcaagtgagtgagggcaaaggacgcctcatgcctgggcag gaccctgagaaaaccataggagacatgttccagaaccaggaccgcaaccaggacggcaag atcacagtcgacgagctcaagctgaagtcagatgaggacgaggagcgggtccacgaggag ctctga >gi568815581r:41655747_41871854|GENSCAN_predicted_peptide_10|292_aa MKATVLMRQPGRVQEIVGALRKGGGDRLQVISDFDMTLSRFAYNGKRCPSSYNILDNSKI ISEECRKELTALLHHYYPIEIDPHRTVKEKLPHMVEWWTKAHNLLCQQKIQKFQIAQVVR ESNAMLREGYKTFFNTLYHNNIPLFIFSAGIGDILEEIIRQMKVFHPNIHIVSNYMDFNE DGFLQGFKGQLIHTYNKNSSACENSGYFQQLEGKTNVILLGDSIGDLTMADGVPGVQNIL KIGFLNDKVEERRERYMDSYDIVLEKDETLDVVNGLLQHILCQGVQLEMQGP >gi568815581r:41655747_41871854|GENSCAN_predicted_CDS_10|879_bp atgaaggccacggtcctgatgcggcagcctgggcgggtgcaggagatcgtgggcgccctc cgcaagggcggcggagaccggttacaggtgatttctgattttgacatgaccttgagcagg tttgcatataatggaaagcgatgcccttcttcttacaatattctggataatagcaagatc atcagtgaggagtgtcggaaagagctcacagcgctccttcaccactattacccaattgag atcgacccacaccggaccgtcaaggagaagctacctcatatggtggaatggtggaccaaa gcacacaatctcctatgtcagcagaagattcagaagtttcagatagcccaggtggttaga gagtccaatgcaatgctcagggagggatataagaccttcttcaacacactctaccataac aacattccccttttcatcttttctgcgggcattggtgatatcctggaagaaattatccga cagatgaaagtgttccaccccaacatccacatcgtgtctaactacatggattttaatgaa gatggttttctccagggatttaagggccagctcatacacacatacaacaagaacagctct gcgtgtgagaactctggttacttccagcaacttgagggcaaaaccaatgtcatcctgctg ggagactctatcggggacctcaccatggccgatggggttcctggtgtgcagaacattctc aaaattggcttcctgaatgacaaggtggaggagcggcgggagcgctacatggactcctat gacatcgtgctggagaaggacgagactctggatgtggtcaacgggctactgcagcacatc ctgtgccagggggtccagctggagatgcaaggcccctga >gi568815581r:41655747_41871854|GENSCAN_predicted_peptide_11|608_aa MEMESAAASTRFHQPHMERKMSAMACEIFNELRLEGKLCDVVIKVNGFEFSAHKNILCSC SSYFRALFTSGWNNTEKKVYNIPGISPDMMKLIIEYAYTRTVPITPDNVEKLLAAADQFN IMGIVRGCCEFLKSELCLDNCIGICKFTDYYYCPELRQKAYMFILHNFEEMVKVSAEFLE LSVTELKDIIEKDELNVKQEDAVFEAILKWISHDPQNRKQHISILLPKVRLALMHAEYFM NNVKMNDYVKDSEECKPVIINALKAMYDLNMNGPSNSDFTNPLTRPRLPYAILFAIGGWS GGSPTNAIEAYDARADRWVNVTCEEESPRAYHGAAYLKGYVYIIGGFDSVDYFNSVKRFD PVKKTWHQVAPMHSRRCYVSVTVLGNFIYAMGGFDGYVRLNTAERYEPETNQWTLIAPMH EQRSDASATTLYGKVYICGGFNGNECLFTAEVYNTESNQWTVIAPMRSRRSGIGVIAYGE HVYAVGGFDGANRLRSAEAYSPVANTWRTIPTMFNPRSNFGIEVVDDLLFVVGGFNGFTT TFNVECYDEKTDEWYDAHDMSIYRSALSCCVVPGLANVEEYAARRDNFPGLALRDEVKYS ASTSTLPV >gi568815581r:41655747_41871854|GENSCAN_predicted_CDS_11|1827_bp atggagatggagagcgcggcggcctccacacgtttccaccagcctcacatggagaggaag atgagtgcgatggcctgtgagatcttcaacgagcttagactagagggcaagctctgcgac gtggtcatcaaggtcaatggctttgagttcagtgcccataagaacatcctctgtagctgc agttcctactttagagctttgtttacaagtggctggaacaacactgaaaagaaggtatac aacatccctggcatttctcccgacatgatgaagctaatcattgagtatgcatacacccgg accgtgcctatcacaccggacaatgtggagaaactgcttgctgctgcagaccagtttaac atcatgggtatcgtcaggggttgctgcgagttcctcaagtcagagctgtgcttggataat tgtatcggcatctgtaagttcacggactactactactgtcctgagctgaggcagaaggcc tacatgttcatactgcacaactttgaggagatggtgaaagtctcggcagaatttttagag ctctcggtcactgaacttaaggatatcattgagaaagatgagctcaatgtcaaacaggaa gatgctgtatttgaggccattttaaagtggatttctcatgacccccaaaatagaaagcag cacatttcaattttgcttcctaaggttcgcctggccctaatgcatgctgagtacttcatg aacaatgttaagatgaatgactatgtcaaagacagtgaggaatgcaaaccagtcatcatt aatgccctaaaggccatgtatgacctcaacatgaatggaccctctaattctgatttcacc aacccactcaccagaccacgcttgccctatgccatcctctttgcaattggtggctggagt ggtgggagccccaccaatgccattgaggcatatgacgctcgggcagacagatgggtgaat gttacttgtgaggaagagagtccccgtgcctaccatggggcagcctatttgaaaggctat gtgtatatcattggggggtttgatagtgtagactatttcaatagtgttaagcgttttgac ccagtcaagaaaacttggcatcaggtggccccgatgcactccagacgttgctatgtcagt gtgacagtcctcggcaattttatttatgccatgggaggatttgatggctacgtgcgtcta aacactgctgaacgttatgagccagagaccaatcaatggacactcatcgcccccatgcac gaacagaggagtgatgcaagcgccacaacactttatgggaaggtctacatatgtggtggg tttaatggaaacgagtgcctgttcacagcagaagtgtataacactgaaagtaatcagtgg acagtcatagcacccatgagaagcaggaggagtggaataggcgtgattgcttatggagaa catgtatatgcggtaggtggctttgatggagctaatcgacttaggagtgccgaagcctac agccctgtggctaacacttggcgcacaatccccactatgtttaatcctcgtagcaatttt ggcatcgaggtggtggatgacctcttgtttgtggtgggtggctttaatggttttaccacc acctttaatgttgagtgctatgatgaaaagaccgatgagtggtatgatgctcatgacatg agtatataccgcagtgctctgagctgctgtgtagtaccagggctggccaatgttgaggaa tatgcagctagacgggacaacttcccaggattagcactgcgagatgaagtaaaatattct gcttcgacaagtaccctacctgtatga >gi568815581r:41655747_41871854|GENSCAN_predicted_peptide_12|708_aa MAAAAVAAAAAAAAAASLQVLEMESMETAAAGSAGLAAEVRGSGTVDFGPGPGISAMEAS GGDPGPEAEDFECSSHCSELSWRQNEQRRQGLFCDITLCFGGAGGREFRAHRSVLAAATE YFTPLLSGQFSESRSGRVEMRKWSSEPGPEPDTVEAVIEYMYTGRIRVSTGSVHEVLELA DRFLLIRLKEFCGEFLKKKLHLSNCVAIHSLAHMYTLSQLALKAADMIRRNFHKVIQDEE FYTLPFHLIRDWLSDLEITVDSEEVLFETVLKWVQRNAEERERYFEELFKLLRLSQMKPT YLTRHVKPERLVANNEVCVKLVADAVERHALRAENIQSGTCQHPTSHVSLLPRYGQNMDV IMVIGGVSEGGDYLSECVGYFVDEDRWVNLPHIHNHLDGHAVAVTESYVYVAGSMEPGFA KTVERYNPNLNTWEHVCSLMTRKHSFGLTEVKGKLYSIGGHGNFSPGFKDVTVYNPELDK WHNLESAPKILRDVKALAIEDRFVYIAARTPVDRDTEDGLKAVITCYDTETRQWQDVESL PLIDNYCFFQMSVVNSNFYQTASCCPKSYCLENEEAVRKIASQVSDEILESLPPEVLSIE GAAICYYKDDVFIIGGWKNSDDIDKQYRKEAYRYCAERKRWMLLPPMPQPRCRATACHVR IPYRYLHGTQRYPMPQNLMWQKDRIRQMQEIHRHALNMRRVPSSQIEC >gi568815581r:41655747_41871854|GENSCAN_predicted_CDS_12|2127_bp atggcggctgcggcagtggcggcggcggcggcggcggccgcggctgcatctcttcaggta ctggagatggagagcatggagacggccgccgccggctcggcaggactggccgccgaggtc cgaggcagcggcacggtggacttcgggcctgggccggggatctctgcaatggaggcgagc gggggcgatccgggcccagaagccgaggatttcgagtgcagctctcactgctcagagctg tcctggcggcagaacgagcagcggcgccagggcctcttctgcgacattaccctgtgcttc ggcggggctggaggccgcgagttccgggcccaccgctcggtactggctgccgccaccgag tacttcacgcccctgctctcgggccagttttccgagtcccgctcgggacgggtggagatg cgcaagtggagctccgagccggggcccgaacccgacacagtggaagccgtaatcgagtac atgtacaccgggcgcatccgcgtcagcacgggcagcgtgcacgaggtgctggagttggcc gacaggttcctactcattcgtttaaaagaattttgtggagaatttctcaagaaaaaactt catctctcaaattgtgtggcaattcatagcttagcacacatgtacaccctgagccaactt gctctgaaggctgctgatatgatacggagaaatttccacaaagtgattcaggatgaagaa ttttatacgttacctttccatctcattagagactggctttcagatttggaaattacagtt gattctgaagaggttctctttgaaaccgttttgaaatgggttcagagaaatgctgaagag agagagagatactttgaagaactttttaaattgctcaggttgtcccagatgaaacctacc taccttactcgacatgtcaaaccagagaggctggtagccaataatgaagtttgtgtcaag ttggtcgctgacgcagtggagagacatgctctgagagctgagaatatacaatctggcaca tgccagcaccccacttctcatgtgtcactattgcctcgttatgggcaaaacatggatgtg atcatggttattggaggtgtgtcagaaggaggggactatttaagtgaatgtgtgggatac tttgttgatgaggacagatgggtaaatctgccacatattcataatcacctcgatggacat gctgttgcagtaacagaatcctacgtgtatgttgctggatcaatggagccagggtttgct aaaactgtagaaaggtataacccaaatttgaatacatgggaacatgtttgtagtctgatg acaagaaagcattcttttggactaacagaagtcaaagggaagctctatagcattggagga catggcaactttagtcctggttttaaagatgtgactgtttataatcctgagcttgataaa tggcacaacttggaatcggcaccaaagattcttcgagatgtcaaagcactagccattgaa gaccggtttgtatacattgccgcccgcactcctgtagaccgggacactgaagatggatta aaggctgtaattacttgctatgatacagagactcgacagtggcaagatgtggaatctttg ccgcttattgacaattactgctttttccaaatgtctgtggtcaattcaaacttttatcag acagcatcatgttgtcccaagagttattgtttagaaaacgaagaggcagtaagaaaaatt gccagccaagtgtctgatgagatccttgaaagcttgcctccagaagtcctaagcatcgaa ggagcagccatttgctattacaaagatgatgtcttcattataggaggctggaaaaacagt gatgatattgataaacagtatcggaaagaagcctaccgatattgtgcggagaggaagagg tggatgcttcttcctcctatgccacaacctcgttgtagagccactgcttgtcacgtgagg atcccataccggtacttgcatggcacacagagataccctatgcctcaaaacctgatgtgg cagaaggaccgcatcagacagatgcaagagatacatcgtcacgccctgaacatgaggcga gtgccaagctctcagattgaatgctag >gi568815581r:41655747_41871854|GENSCAN_predicted_peptide_13|157_aa MFSKAFDSGIIPMEFVNKMKKEGKLIMGIGHRVKSINNPDMRVQILKDYVRQHFPATPLL DYALEVEKITTSKKPNLILNVDGLIGVAFVDMLRNCGSFTREEADEYIDIGALNGIFVLG RSMGFIGHYLDQKRLKQGLYRHPWDDISYVLPEHMSM >gi568815581r:41655747_41871854|GENSCAN_predicted_CDS_13|474_bp atgttcagtaaagcctttgacagtggcattatccccatggagtttgtgaacaagatgaag aaggaagggaagctgatcatgggcattggtcaccgagtgaagtcgataaacaacccagac atgcgagtgcagatcctcaaagattacgtcaggcagcacttccctgccactcctctgctc gattatgcactggaagtagagaagattaccacctcgaagaagccaaatcttatcctgaat gtagatggtctcatcggagtcgcatttgtagacatgcttagaaactgtgggtcctttact cgggaggaagctgatgaatatattgacattggagccctcaatggcatctttgtgctggga aggagtatggggttcattggacactatcttgatcagaagaggctgaagcaggggctgtat cgtcatccgtgggatgatatttcatatgttcttccggaacacatgagcatgtaa