GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:25:01 Sequence gi568815597r:11747113_11947684 : 200572 bp : 51.59% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 328 433 106 1 1 96 89 140 0.992 15.62 1.02 Intr + 1303 1498 196 0 1 68 81 377 0.707 34.51 1.03 Term + 2965 3080 116 2 2 88 48 80 0.768 3.04 1.04 PlyA + 7664 7669 6 1.05 2.00 Prom + 8815 8854 40 -2.01 2.01 Init + 14762 15162 401 1 2 100 7 233 0.035 11.26 2.02 Intr + 17219 17412 194 2 2 109 40 43 0.094 1.26 2.03 Intr + 18745 19438 694 2 1 108 50 246 0.847 14.00 2.04 Intr + 20965 21207 243 1 0 85 54 209 0.892 14.34 2.05 Intr + 24412 24524 113 1 2 57 85 87 0.779 5.83 2.06 Intr + 24970 25147 178 2 1 127 91 22 0.971 5.89 2.07 Intr + 28323 28498 176 0 2 107 99 178 0.999 20.90 2.08 Intr + 29352 29526 175 1 1 67 58 179 0.212 12.31 2.09 Intr + 31548 31704 157 0 1 78 100 106 0.202 11.33 2.10 Intr + 31850 31968 119 1 2 56 54 56 0.353 -1.23 2.11 Intr + 32702 32898 197 2 2 68 96 115 0.489 9.88 2.12 Intr + 35077 35221 145 2 1 65 90 172 0.922 14.95 2.13 Intr + 37062 37481 420 0 0 82 105 193 0.901 14.03 2.14 Term + 38036 38285 250 2 1 123 48 253 0.287 20.31 2.15 PlyA + 39142 39147 6 -0.45 3.12 PlyA - 39320 39315 6 -0.45 3.11 Term - 43786 43568 219 1 0 25 46 363 0.987 23.17 3.10 Intr - 44214 44095 120 0 0 88 98 137 0.999 15.89 3.09 Intr - 45267 45166 102 0 0 104 102 103 0.991 14.17 3.08 Intr - 46977 46795 183 0 0 69 70 407 0.978 37.50 3.07 Intr - 47426 47246 181 1 1 60 87 235 0.999 20.99 3.06 Intr - 47751 47617 135 2 0 82 81 112 0.893 10.09 3.05 Intr - 48236 47986 251 2 2 72 94 477 0.982 43.47 3.04 Intr - 49287 49094 194 1 2 85 94 210 0.986 21.03 3.03 Intr - 53210 53100 111 0 0 77 109 204 0.994 22.25 3.02 Intr - 54287 54049 239 1 2 60 48 433 0.993 34.29 3.01 Init - 56004 55769 236 0 2 96 92 147 0.765 13.50 3.00 Prom - 57664 57625 40 -8.09 4.00 Prom + 57870 57909 40 -7.40 4.01 Init + 59151 59237 87 2 0 90 99 24 0.562 2.52 4.02 Intr + 60019 60114 96 0 0 67 67 83 0.632 4.81 4.03 Intr + 68734 68799 66 0 0 126 98 115 0.999 15.89 4.04 Intr + 69503 69568 66 1 0 89 81 90 0.982 7.99 4.05 Term + 72376 72507 132 0 0 113 43 77 0.918 4.10 4.06 PlyA + 73040 73045 6 1.05 5.00 Prom + 75033 75072 40 -2.21 5.01 Init + 76568 76721 154 1 1 48 94 60 0.089 2.72 5.02 Intr + 77374 77441 68 2 2 89 110 74 0.993 8.82 5.03 Intr + 79044 79102 59 2 2 65 80 61 0.614 1.17 5.04 Intr + 79977 80109 133 0 1 115 65 66 0.897 8.15 5.05 Intr + 80994 81107 114 2 0 93 95 89 0.999 11.25 5.06 Intr + 81346 81512 167 0 2 93 70 113 0.998 9.27 5.07 Intr + 82084 82210 127 1 1 77 91 84 0.965 8.79 5.08 Intr + 86403 86526 124 2 1 117 101 117 0.998 16.56 5.09 Intr + 86765 86918 154 0 1 92 73 49 0.992 3.44 5.10 Intr + 87124 87283 160 1 1 70 89 293 0.736 28.10 5.11 Intr + 87372 87478 107 2 2 111 80 84 0.918 9.41 5.12 Intr + 87911 88075 165 2 0 119 66 13 0.775 1.79 5.13 Intr + 88855 89041 187 1 1 99 98 421 0.999 44.41 5.14 Intr + 89887 90044 158 0 2 79 78 305 0.958 27.92 5.15 Intr + 90231 90387 157 0 1 62 123 145 0.979 15.93 5.16 Intr + 91223 91330 108 1 0 8 69 292 0.941 20.28 5.17 Intr + 91423 91548 126 0 0 98 83 128 0.997 14.68 5.18 Term + 93031 93111 81 0 0 91 48 119 0.886 6.19 5.19 PlyA + 95999 96004 6 1.05 6.04 PlyA - 96393 96388 6 1.05 6.03 Term - 96764 96618 147 2 0 50 44 132 0.275 3.21 6.02 Intr - 100327 100001 327 1 0 68 94 124 0.346 7.45 6.01 Init - 100572 100450 123 0 0 94 97 153 0.996 15.01 6.00 Prom - 100809 100770 40 -0.21 7.06 PlyA - 102343 102338 6 1.05 7.05 Term - 103498 103406 93 1 0 100 42 53 0.749 0.03 7.04 Intr - 104210 104115 96 2 0 57 49 95 0.649 3.21 7.03 Intr - 107734 107646 89 2 2 43 81 113 0.956 6.29 7.02 Intr - 111357 111102 256 0 1 75 121 289 0.994 28.45 7.01 Init - 111721 111590 132 1 0 84 94 131 0.976 11.43 7.00 Prom - 112701 112662 40 -3.81 8.00 Prom + 113247 113286 40 -1.81 8.01 Sngl + 127020 127400 381 2 0 48 47 168 0.786 5.21 8.02 PlyA + 128875 128880 6 1.05 9.00 Prom + 130629 130668 40 -1.01 9.01 Init + 130838 131002 165 1 0 66 4 225 0.698 11.50 9.02 Intr + 131079 131216 138 2 0 -60 -2 238 0.627 1.17 9.03 Intr + 131276 131604 329 1 2 46 53 225 0.350 9.85 9.04 Intr + 131639 132005 367 2 1 28 36 537 0.497 38.21 9.05 Term + 132243 132647 405 2 0 6 46 497 0.961 32.86 9.06 PlyA + 133656 133661 6 1.05 10.00 Prom + 137741 137780 40 -3.81 10.01 Init + 140079 140156 78 2 0 94 75 103 0.429 8.62 10.02 Intr + 141633 141693 61 2 1 76 68 39 0.063 -0.50 10.03 Term + 151691 151905 215 0 2 65 55 157 0.781 7.82 10.04 PlyA + 155205 155210 6 1.05 11.00 Prom + 158793 158832 40 -2.81 11.01 Init + 159210 159296 87 2 0 61 113 55 0.473 5.97 11.02 Term + 161551 161805 255 0 0 36 55 159 0.712 3.32 11.03 PlyA + 163035 163040 6 1.05 12.00 Prom + 163695 163734 40 -4.91 12.01 Init + 163975 164072 98 0 2 76 74 110 0.993 8.38 12.02 Term + 165706 165796 91 1 1 66 47 102 0.551 1.39 12.03 PlyA + 166334 166339 6 -0.45 13.05 PlyA - 167588 167583 6 1.05 13.04 Term - 169948 169756 193 0 1 88 44 93 0.624 2.12 13.03 Intr - 174263 174130 134 2 2 1 60 103 0.509 -1.25 13.02 Intr - 176377 175524 854 2 2 97 97 1244 0.786 117.95 13.01 Init - 179125 178093 1033 1 1 54 109 1428 0.999 133.73 13.00 Prom - 181241 181202 40 -6.01 14.00 Prom + 185179 185218 40 -7.40 14.01 Init + 187668 187743 76 2 1 82 72 237 0.997 20.70 14.02 Intr + 189206 189291 86 0 2 105 22 44 0.483 -0.46 14.03 Intr + 190467 190509 43 2 1 57 117 38 0.412 1.90 14.04 Intr + 197826 198006 181 1 1 72 72 93 0.145 5.54 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 14762 15166 405 1 0 100 49 220 0.931 14.61 S.002 Init + 16850 16928 79 1 1 81 65 55 0.846 3.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:11747113_11947684|GENSCAN_predicted_peptide_1|139_aa XGCIVFSGSYAWANFTILALGVWAVAQRDSIDAISMFLGGLLATIFLDIVHISIFYPRVS LTDTGRFGVGMAILSLLLKPLSCCFVYHMYRERGGELLVHTGFLGSSQDRSAYQTIDSAE APADPFAVPEGRSQDARGY >gi568815597r:11747113_11947684|GENSCAN_predicted_CDS_1|420_bp nngggctgcattgtattctcaggctcctatgcctgggccaacttcaccatcctggccttg ggcgtgtgggctgtggctcagcgggactccatcgacgccataagcatgtttctgggtggc ttgctggccaccatcttcctggacatcgtgcacatcagcatcttctacccgcgggtcagc ctcacggacacgggccgctttggcgtgggcatggccatcctcagcttgctgctcaagccg ctctcctgctgcttcgtctaccacatgtaccgggagcgcgggggtgagctcctggtccac actggtttccttgggtcttctcaggaccgtagtgcctaccagacgattgactcagcagag gcgcccgcagatccctttgcagtcccagagggcaggagtcaagatgcccgagggtactga >gi568815597r:11747113_11947684|GENSCAN_predicted_peptide_2|1153_aa MAGGIPDVRGLQEAALGAGRSQEEARLVEEAQTPVMLPQDSGQRVEEVPGDLMAKRMSLI LHVQKLPWDHVPCLRRTRQNLYQDVGGHAHGSGLGGAKRGAARSALRRPLPPATCRPAGI VSGPSPRLDSNPTGAHLIKQTRPLTVEWTKDTPVPEPMELRSDASHKENVSPKPAALPKP GKRLSGPGWAVTGAGPSLEQRRFRRSLGIGLSGRHDQWVPGCQVERGGPAATPSPGAVLD QEPCRVQTNLASPGPRLGLALKDTTGQLVNSSFWQQSNLQSLARRRQGKAREFAIQQSNL SINETSSPHLCPEPGGSSGPHKLPWGPLLSQEPLARPSSCLRQSGLPAPGTPSGDFRPTE AFAPLDGHTQPGLRSWGGLGSWRSRLVGEPLTLEDLAVPSQNQTQAPSRAAVHQLLASVH CLAQEAARLSWQLLSRCFRSWRHLVKRQREPAAAAVALGRWQLLRKCLQALWLREAQLEA AWGQYTKVLLVRSFREVSGLQVGPGGRVKQCPGSLREEEIAQRLLSHPRQRTDSRHERVQ ILQALQLAVFFLWCQQKKRARQERETLRKATRATQRTGSFPQAWHSTAAGVAWVAPLSPQ HQRAWLCRCFGAWQQFVQRGSRYRDHLADRRTGTLRKCLEQWVRMKQLRESDGAKVTQLS LCRQKAGREAVYTAGPGACGLGAVGQAQGQQEQGRGSLQDACWTLALCWALLLWKMRLFQ RQWANSFFQGLQQRMLQRSLRWWHLRALGPDATSSCTKTPSALEPLSSSTLQDSLEKGSL LWAAGQRQQGQCLLLWQARAQQFQGTARWYQHTRQRRWSRWATAQWAWRELASHRAWDRT CRAVLGLWRQRLLQSRLVEWWAQERGWRLARDALCHWHSCWQGQQFLHEKCQTWVQVHLQ GLQKVVFRSWQQAAAHQRCTVTRPEQLLLQSYFQAWCEVVRDTGVLRAQHQAFQDGLRRR ALGAVFATWREAQEVAAGAQEQRVAQASLARWRSCGQQGQEDGQQKKARAPQAFPAWPVA PGMHHEAQQQAGESAGAQAAQCWTWCWALWVHESCRGQVSRAHASWKPRAWVLEASVQSA VRGGVQRAILTQLRPAELRRFLRTVQLRVRLGLPGAGKVRPKPQTDLTLWPRMASRGSLL LDAPAPWKQVKGA >gi568815597r:11747113_11947684|GENSCAN_predicted_CDS_2|3462_bp atggctgggggtatcccagatgtgcgggggctccaggaggcagctctaggtgcgggaagg agccaagaggaggcacgcctggtggaggaagcccagaccccagtgatgctgccacaggat tcggggcagagggtggaggaggttcccggggacctgatggccaagagaatgtcgctcatc ttgcacgtgcaaaagttgccctgggaccacgtgccttgcctgcggaggacccggcaaaac ctgtaccaggacgtcggtggccacgcccacggctcagggctgggcggggccaagcgcggg gcagcacgctctgctctccgacgtcccctcccgcccgcgacctgccgacctgcggggatc gttagcggtcccagcccccgtctagattcaaatccgactggggcccatctgattaaacag accaggccactcactgtggagtggaccaaggatactcctgtccctgagcccatggagcta aggtctgatgccagccacaaggagaatgtgtccccaaagcctgcagcgctcccgaagcca ggcaagaggcttagtgggcctggatgggcagttacaggagcagggccctctctagagcaa cgaagattccggaggagcctgggcatcggcctgagtggtagacatgaccagtgggtgccc gggtgccaggtggagaggggagggcctgctgccacaccctccccaggggcagtgctagac caggagccctgccgagtccagaccaacctggccagccctggtccccgcctgggcctagct ctgaaggacacgactggccaactggtcaattcaagcttctggcaacagagcaacctgcag tccctggccaggaggcgccaagggaaggcccgagagtttgccatccagcagagcaacctg agcatcaacgagaccagcagcccccacctctgcccagagcctgggggaagctctgggccc cacaagcttccctggggtcctctcctatcccaagagccactggctcgcccatcttcctgc ctgaggcagtccgggctgccggccccaggcacccctagcggggacttcaggcccactgaa gcctttgcccctctcgatgggcatacacagccaggcctcagatcctggggtggtctgggg agctggaggtccaggctggtgggggaacctctcaccctggaggacctggctgtccccagt cagaaccagactcaggccccatcccgtgctgccgtccaccagctgctggcttctgtacat tgcctggcgcaggaggcagcccgactcagctggcagctgttgtccagatgttttcgatcc tggaggcacttggtgaagaggcagcgggagccagcggcggcggcagtggcactgggccgc tggcagctgttgcgaaagtgccttcaggccttgtggctccgggaggctcagctggaggca gcatgggggcagtacacaaaggttctgctggtccggagcttccgagaggtcagcggtctc caggttgggccagggggccgtgtgaagcagtgtccaggaagtctgagggaggaggagatt gctcagcggcttctgtcacatcctaggcagagaacagacagcagacacgagagagtccag atcctgcaggccctgcaactggctgtgttcttcctgtggtgccaacagaagaaacgggcc agacaggagagggagactctgcggaaggccaccagggccacacagaggacagggagcttc ccccaggcctggcactctactgctgcaggtgtagcctgggtggccccactgagcccccag caccagagagcttggctgtgcaggtgcttcggggcgtggcagcagttcgtgcaaagaggg tcccggtaccgagaccacctggctgaccgccggacggggaccctgaggaaatgcctggaa cagtgggtgcggatgaagcagctccgggaatcagatggggcaaaggtgacccagctgtcc ctctgccggcagaaagcaggacgtgaggctgtctacaccgcaggccctggagcctgtggc ctgggtgcagtgggccaggcccaggggcagcaggagcaaggccggggctccctgcaggat gcctgctggacactggccctctgctgggcgctgctgctgtggaagatgcggcttttccag cgccagtgggccaactccttcttccagggcctgcagcagcggatgctgcagcgcagcctg agatggtggcacttgagggcactgggcccagatgccacatcaagctgcaccaagaccccc tcggctctggagccactgagcagcagcacactccaagactctctggagaaggggagcctt ctgtgggcagctgggcagcggcagcaggggcagtgccttctgctctggcaggcacgggcc cagcagttccagggcacagccaggtggtaccagcatacccgccagaggcgctggagccgc tgggcaacagcccaatgggcctggagagagctggcttcccaccgggcctgggatcggacc tgcagggctgtgctgggcctgtggcgtcagcggctgctgcagtcacggctggtggagtgg tgggcccaggagcggggctggcggctggcacgagatgccctatgccactggcactcctgt tggcaggggcagcagttcctgcatgaaaagtgccagacatgggtgcaggtccacctccag ggcctgcagaaggtggtgttccggagctggcagcaggcagcagctcatcagagatgcaca gtgacccggccagagcagctgctactgcagagctacttccaggcctggtgtgaggttgta agagacacgggggtgctccgggcccagcatcaagcctttcaggatggcctgaggagaaga gcactgggggccgtgtttgccacatggcgggaagcccaggaagtggcagccggggcacag gagcagcgtgtggcccaggcctcccttgcccgctggagaagctgcgggcagcaaggccag gaagatgggcagcagaagaaggcccgggccccacaggccttcccagcatggccagtggcc ccgggcatgcaccatgaggcccagcagcaggcaggagagagcgctggggcccaggcagcc cagtgctggacttggtgctgggctctgtgggtgcatgagtcctgtcggggccaggtcagc cgagcccatgcttcctggaagccgcgagcctgggtcctagaggcctcggtgcagtcggcg gtgcgcggcggtgtccagcgagccatcctcacccagctccggccggctgagctcaggcgc ttcctgcggacagtgcagctcagggtgcggctgggactgccaggggccggcaaggtacgc cccaagccccagactgacctcacgctctggccccggatggccagcagggggagcctgctg ctggacgccccagccccttggaaacaggtgaagggagcctag >gi568815597r:11747113_11947684|GENSCAN_predicted_peptide_3|656_aa MVNEARGNSSLNPCLEGSASSGSESSKDSSRCSTPGLDPERHERLREKMRRRLESGDKWF SLEFFPPRTAEGAVNLISRFDRMAAGGPLYIDVTWHPAGDPGSDKETSSMMIASTAVNYC GLETILHMTCCRQRLEEITGHLHKAKQLGLKNIMALRGDPIGDQWEEEEGGFNYAVDLVK HIRSEFGDYFDICVAGYPKGHPEAGSFEADLKHLKEKVSAGADFIITQLFFEADTFFRFV KACTDMGITCPIVPGIFPIQGYHSLRQLVKLSKLEVPQEIKDVIEPIKDNDAAIRNYGIE LAVSLCQELLASGLVPGLHFYTLNREMATTEVLKRLGMWTEDPRRPLPWALSAHPKRREE DVRPIFWASRPKSYIYRTQEWDEFPNGRWGNSSSPAFGELKDYYLFYLKSKSPKEELLKM WGEELTSEESVFEVFVLYLSGEPNRNGHKVTCLPWNDEPLAAETSLLKEELLRVNRQGIL TINSQPNINGKPSSDPIVGWGPSGGYVFQKAYLEFFTSRETAEALLQVLKKYELRVNYHL VNVKGENITNAPELQPNAVTWGIFPGREIIQPTVVDPVSFMFWKDEAFALWIERWGKLYE EESPSRTIIQYIHDNYFLVNLVDNDFPLDNCLWQVVEDTLELLNRPTQNARETEAP >gi568815597r:11747113_11947684|GENSCAN_predicted_CDS_3|1971_bp atggtgaacgaagccagaggaaacagcagcctcaacccctgcttggagggcagtgccagc agtggcagtgagagctccaaagatagttcgagatgttccaccccgggcctggaccccgag cggcatgagagactccgggagaagatgaggcggcgattggaatctggtgacaagtggttc tccctggaattcttccctcctcgaactgctgagggagctgtcaatctcatctcaaggttt gaccggatggcagcaggtggccccctctacatagacgtgacctggcacccagcaggtgac cctggctcagacaaggagacctcctccatgatgatcgccagcaccgccgtgaactactgt ggcctggagaccatcctgcacatgacctgctgccgtcagcgcctggaggagatcacgggc catctgcacaaagctaagcagctgggcctgaagaacatcatggcgctgcggggagaccca ataggtgaccagtgggaagaggaggagggaggcttcaactacgcagtggacctggtgaag cacatccgaagtgagtttggtgactactttgacatctgtgtggcaggttaccccaaaggc caccccgaagcagggagctttgaggctgacctgaagcacttgaaggagaaggtgtctgcg ggagccgatttcatcatcacgcagcttttctttgaggctgacacattcttccgctttgtg aaggcatgcaccgacatgggcatcacttgccccatcgtccccgggatctttcccatccag ggctaccactcccttcggcagcttgtgaagctgtccaagctggaggtgccacaggagatc aaggacgtgattgagccaatcaaagacaacgatgctgccatccgcaactatggcatcgag ctggccgtgagcctgtgccaggagcttctggccagtggcttggtgccaggcctccacttc tacaccctcaaccgcgagatggctaccacagaggtgctgaagcgcctggggatgtggact gaggaccccaggcgtcccctaccctgggctctcagcgcccaccccaagcgccgagaggaa gatgtacgtcccatcttctgggcctccagaccaaagagttacatctaccgtacccaggag tgggacgagttccctaacggccgctggggcaattcctcttcccctgcctttggggagctg aaggactactacctcttctacctgaagagcaagtcccccaaggaggagctgctgaagatg tggggggaggagctgaccagtgaagaaagtgtctttgaagtcttcgttctttacctctcg ggagaaccaaaccggaatggtcacaaagtgacttgcctgccctggaacgatgagcccctg gcggctgagaccagcctgctgaaggaggagctgctgcgggtgaaccgccagggcatcctc accatcaactcacagcccaacatcaacgggaagccgtcctccgaccccatcgtgggctgg ggccccagcgggggctatgtcttccagaaggcctacttagagtttttcacttcccgcgag acagcggaagcacttctgcaagtgctgaagaagtacgagctccgggttaattaccacctt gtcaatgtgaagggtgaaaacatcaccaatgcccctgaactgcagccgaatgctgtcact tggggcatcttccctgggcgagagatcatccagcccaccgtagtggatcccgtcagcttc atgttctggaaggacgaggcctttgccctgtggattgagcggtggggaaagctgtatgag gaggagtccccgtcccgcaccatcatccagtacatccacgacaactacttcctggtcaac ctggtggacaatgacttcccactggacaactgcctctggcaggtggtggaagacacattg gagcttctcaacaggcccacccagaatgcgagagaaacggaggctccatga >gi568815597r:11747113_11947684|GENSCAN_predicted_peptide_4|148_aa MAGCRGSLCCCCRWCCCCGERETRTPEELTILGETQEEEDEILPRKDYEVSSFDTAWATK VSLDYDRCINDPYLEVLETMDNKKGRRYEAVKWMVVFAIGVCTGLVGLFVDFFVRLFTQL KFGVVQTCILFTVPGVRGLQWSPRFFLT >gi568815597r:11747113_11947684|GENSCAN_predicted_CDS_4|447_bp atggcggggtgcagggggtctctgtgctgctgctgcaggtggtgctgctgctgcggtgag cgtgagacccgcacccccgaggagctgaccatccttggagaaacacaggaggaggaggat gagattcttccaaggaaagactatgaggtgagctcctttgatactgcttgggcaactaaa gtaagtttggattatgatcgctgtatcaatgacccttacctggaagttttggagaccatg gataataagaaaggtcgaagatatgaggcggtgaagtggatggtggtgtttgccattgga gtctgcactggcctggtgggtctctttgtggacttttttgtgcgactcttcacccaactc aagttcggagtggtacagacatgtatccttttcacggttcctggtgttcgtggacttcag tggtcaccaagattctttcttacatag >gi568815597r:11747113_11947684|GENSCAN_predicted_peptide_5|782_aa MGVPALLHQPVAAGSGIPEVKCYLNGVKVPGIVRLRTLLCKVLGVLFSVAGGLFVEKEGP MIHSGSVVGAGLPQFQSISLRKIQFNFPYFRSDRDKRDFVSAGAAAGVAAAFGAPIGGTL FSLEEGSSFWNQGLTWKVLFCSMSATFTLNFFRSGIQFGSWGSFQLPGLLNFGEFKCSDS DKKCHLWTAMDLGFFVVMGVIGGLLGATFNCLNKRLAKYRMRNVHPKPKLVRVLESLLVS LVTTVVVFVASMVLGECRQMSSSSQIGNDSFQLQVTEDVNSSIKTFFCPNDTYNDMATLF FNPQESAILQLFHQDGTFSPVTLALFFVLYFLLACWTYGISVPSGLFVPSLLCGAAFGRL VANVLKSYIGLGHIYSGTFALIGAAAFLGGVVRMTISLTVILIESTNEITYGLPIMVTLM VAKWTGDFFNKGIYDIHVGLRGVPLLEWETEVEMDKYLGFLCLDSRQGPYGLTRGPASRF CGGPDRSLFSTFRAVWSLSRLLDSVIAAGKQLRASDIMEPNLTYVYPHTRIQSLVSILRT TVHHAFPVVTENRGNEKEFMKGNQLISNNIKFKKSSILTRAGEQRKRSQSMKSYPSSELR NMCDEHIASEEPAEKEDLLQQMLERRYTPYPNLYPDQSPSEDWTMEERFRPLTFHGLILR SQLVTLLVRGVCYSESQSSASQPRLSYAEMAEDYPRYPDIHDLDLTLLNPRMIVDVTPYM NPSPFTVSPNTHVSQVFNLFRTMGLRHLPVVNAVGEIVGIITRHNLTYEFLQARLRQHYQ TI >gi568815597r:11747113_11947684|GENSCAN_predicted_CDS_5|2349_bp atgggtgtgcctgctctcctccatcagccggtggcagcaggttccgggatacccgaggtc aaatgctatctgaatggcgtaaaggtgccaggaatcgtccgtctccggaccctgctctgc aaggtccttggagtgctgttcagtgtggctggagggctcttcgtggagaaggaaggcccc atgatccacagtggttcggtggtgggagctggcctccctcagtttcagagcatctcctta cggaagatccagtttaacttcccctatttccgaagcgacagagacaagagagactttgta tcagcaggagcggctgctggagttgctgcagctttcggggcgccaatcgggggtaccttg ttcagtctagaggagggttcgtccttctggaaccaagggctcacgtggaaagtgctcttt tgttccatgtctgccaccttcaccctcaacttcttccgttctgggattcagtttggaagc tggggttccttccagctccctggattgctgaactttggcgagtttaagtgctctgactct gataaaaaatgtcatctctggacagctatggatttgggtttcttcgtcgtgatgggggtc attgggggcctcctgggagccacattcaactgtctgaacaagaggcttgcaaagtaccgt atgcgaaacgtgcacccgaaacctaagctcgtcagagtcttagagagcctccttgtgtct ctggtaaccaccgtggtggtgtttgtggcctcgatggtgttaggagaatgccgacagatg tcctcttcgagtcaaatcggtaatgactcattccagctccaggtcacagaagatgtgaat tcaagtatcaagacatttttttgtcccaatgatacctacaatgacatggccacactcttc ttcaacccgcaggagtctgccatcctccagctcttccaccaggatggtactttcagcccc gtcactctggccttgttcttcgttctctatttcttgcttgcatgttggacttacggcatt tctgttccaagtggcctttttgtgccttctctgctgtgtggagctgcttttggacgttta gttgccaatgtcctaaaaagctacattggattgggccacatctattcggggacctttgcc ctgattggtgcagcggctttcttgggcggggtggtccgcatgaccatcagcctcacggtc atcctgatcgagtccaccaatgagatcacctacgggctccccatcatggtcacactgatg gtggccaaatggacaggggactttttcaataagggcatttatgatatccacgtgggcctg cgaggcgtgccgcttctggaatgggagacagaggtggaaatggacaagtacctgggcttc ctgtgtctagatagcaggcaaggcccttacgggctaacacgggggccagcaagccgtttc tgtggagggccagacaggagtttattttcaactttccgggctgtatggtctctgtcacga ctactcgactctgtcattgcagcaggaaagcagctgagagccagcgacatcatggagccc aacctgacctacgtctacccgcacacccgcatccagtctctggtgagcatcctgcgcacc acggtccaccatgccttcccggtggtcacagagaaccgcggtaacgagaaggagttcatg aagggcaaccagctcatcagcaacaacatcaagttcaagaaatccagcatcctcacccgg gctggcgagcagcgcaaacggagccagtccatgaagtcctacccatccagcgagctacgg aacatgtgtgatgagcacatcgcctctgaggagccagccgagaaggaggacctcctgcag cagatgctggaaaggagatacactccctaccccaacctataccctgaccagtccccaagt gaagactggaccatggaggagcggttccgccctctgaccttccacggcctgatccttcgg tcgcagcttgtcaccctgcttgtccgaggagtttgttactctgaaagccagtcgagcgcc agccagccgcgcctctcctatgccgagatggccgaggactacccgcggtaccccgacatc cacgacctggacctgacgctgctcaacccgcgcatgatcgtggatgtcaccccatacatg aacccttcgcctttcaccgtctcgcccaacacccacgtctcccaagtcttcaacctgttc agaacgatgggcctgcgccacctgcccgtggtgaacgctgtgggagagatcgtggggatc atcacacggcacaacctcacctatgaatttctgcaggcccggctgaggcagcactaccag accatctga >gi568815597r:11747113_11947684|GENSCAN_predicted_peptide_6|198_aa MSSFSTTTVSFLLLLAFQLLGQTRANPMYNAVSNADLMDFKNLLDHLEEKMPLEDEVVPP QVLSEPNEEAGAALSPLPEVPPWTGEVSPAQRDGGALGRGPWDSSDRSALLKSKLRALLT APRSLRRSSCFGGRMDRIGAQSGLGCNSFRTQQQFHFFWLDIGAICVWGNFLNTRKKQNG FPIRCSSRTRLLAGGDEF >gi568815597r:11747113_11947684|GENSCAN_predicted_CDS_6|597_bp atgagctccttctccaccaccaccgtgagcttcctccttttactggcattccagctccta ggtcagaccagagctaatcccatgtacaatgccgtgtccaacgcagacctgatggatttc aagaatttgctggaccatttggaagaaaagatgcctttagaagatgaggtcgtgccccca caagtgctcagtgagccgaatgaagaagcgggggctgctctcagccccctccctgaggtg cctccctggaccggggaagtcagcccagcccagagagatggaggtgccctcgggcggggc ccctgggactcctctgatcgatctgccctcctaaaaagcaagctgagggcgctgctcact gcccctcggagcctgcggagatccagctgcttcgggggcaggatggacaggattggagcc cagagcggactgggctgtaacagcttccggacccagcagcagtttcacttcttctggttg gacatcggggcaatctgtgtgtggggcaacttcctaaacaccaggaagaaacagaatgga ttccccatccgctgctcctccaggacgcggctgctagcaggaggagatgaattctag >gi568815597r:11747113_11947684|GENSCAN_predicted_peptide_7|221_aa MDPQTAPSRALLLLLFLHLAFLGGRSHPLGSPGSASDLETSGLQEQRNHLQGKLSELQVE QTSLEPLQESPRPTGVWKSREVATEGIRGHRKMVLYTLRAPRSPKMVQGSGCFGRKMDRI SSSSGLGCKARKEDPKVTCEGAKCDSWSWVAVAGRLLYPPSLEMTAAAANAFIAQGPQAK QGQIPDPQSTEGNCPQSSALSDLHIKPKKQNIIFKVSNDHP >gi568815597r:11747113_11947684|GENSCAN_predicted_CDS_7|666_bp atggatccccagacagcaccttcccgggcgctcctgctcctgctcttcttgcatctggct ttcctgggaggtcgttcccacccgctgggcagccccggttcagcctcggacttggaaacg tccgggttacaggagcagcgcaaccatttgcagggcaaactgtcggagctgcaggtggag cagacatccctggagcccctccaggagagcccccgtcccacaggtgtctggaagtcccgg gaggtagccaccgagggcatccgtgggcaccgcaaaatggtcctctacaccctgcgggca ccacgaagccccaagatggtgcaagggtctggctgctttgggaggaagatggaccggatc agctcctccagtggcctgggctgcaaagcaaggaaagaggaccccaaagtgacctgcgaa ggagccaaatgtgacagttggagctgggtagcggtagcgggccgcctgctgtacccacca agccttgagatgactgcagccgcagccaacgccttcattgcccaggggccccaggccaag cagggccagatccctgatccacagagcacagagggaaactgtccccagtcttcagccctt agtgacctccatattaaaccaaagaaacaaaacatcattttcaaagtcagcaacgatcat ccctaa >gi568815597r:11747113_11947684|GENSCAN_predicted_peptide_8|126_aa MGHCEQQTHGTLDPADTLAVEKGREPATPGGHSHEWRRVGATREDRKHEWPSQNIHMRVI NRAVPGEDALCLGGTTEEQSRLDRKKSCDMWEMREALAGYQRPSGCLSSLTLLPSEGGHC HSGSAF >gi568815597r:11747113_11947684|GENSCAN_predicted_CDS_8|381_bp atgggtcactgtgaacagcaaacacatggcaccttggatcctgcagacacgctggcagtg gagaaagggagggagccagccacgcctgggggtcactctcatgaatggcgtagagtgggg gccacacgtgaggacaggaagcatgaatggccgtcccagaacatccacatgagggttatt aatagagccgtcccaggagaggacgcactgtgcttgggagggacaaccgaggaacagagc cggctggaccgtaagaagtcctgtgacatgtgggaaatgagggaggcgctggcaggctac caaaggccctctggctgcctctcctccctgaccttactccccagtgaaggtggccactgc cactcgggctctgcgttctag >gi568815597r:11747113_11947684|GENSCAN_predicted_peptide_9|467_aa MSLLRSTTPPSWQHRGQLPPEDQGDEISVSEELEPSTLTPASALKPSDGMTISSLLFRIS LDNRMLTTNCCSYPGLLIVPQGVQDNALQWVSHCYYQNCFPGRGVLGLFKAQNAPSPGQS QVDSSSLEQEKYLQAVVSSMPHYADASGRNMLTGFSSAHMGSHIPSPRARVITLSNPMAP LPSRWTAPRSKWGSVQTSGRSSGLGANMGSWQQGPLAHPGFLRLQRVALYILGNKAQLKG VRPDSLQQWELVPIEVFEEQQVKANFKKLLKACVPGCPAAEPSPASFLCLLEDSERLTPI HKLLQVPVLVVEPLDSDSSVLVGLEDGWDITSQVPRLPPYVPPFRTFLLDSDPESIELGL LYEEKGERRGQLLCRSLWECVDRLSKRTPMFYNYMHAPEDTELPQPYSNVSNLKVWDFYT EEILAKGPPYDWELAQGPPGPLEEERSDGGAPQCRHRVAWPCYDSCP >gi568815597r:11747113_11947684|GENSCAN_predicted_CDS_9|1404_bp atgtcactcttaagaagtacaacccccccaagctggcagcaccggggccagctgccccct gaagaccaaggggacgagatctcagtgtcagaggagctggagcccagcacgctgaccccg gcctcagctctgaagccctcagacggcatgaccatcagcagcctgctcttccgcatttct ctggacaaccgcatgttgaccaccaactgctgcagctacccggggctgctgatcgtgccc cagggtgtccaggacaatgccctgcagtgggtgtcccactgctactaccagaactgcttc cccggcagaggtgtcctcggcctcttcaaggcccagaacgcaccttctccaggccagtcc caggtggactcgagtagcctggagcaggagaagtacctgcaggctgtggtcagttccatg ccccactatgccgacgcgtcgggacgcaacatgcttaccggcttctcctcagcccacatg ggcagtcacattcccagccccagagccagggtcatcacactgtccaaccccatggcgccc ttgccctccagatggactgcaccccgaagtaagtggggcagtgtccagaccagcgggcgc agcagtgggcttggtgccaatatgggctcctggcaacagggccccctcgcccacccgggc ttcctgcggctgcagcgagtagccctctacatcctcgggaacaaagcccagctcaagggt gtgcggccagactccctgcagcagtgggagctggtgcccattgaggtattcgaggaacag caggtgaaggccaacttcaagaagctgctgaaggcatgtgtcccgggctgccccgctgct gagcccagcccagcctccttcctgtgcttgctggaggactcggagcggctgaccccaatc cacaagctgctgcaggtgccagtgctggtggtggagcccctggattcagactcctctgtg ctggtgggcctggaggatggctgggacatcacctcccaggttcctcggctgccaccatat gtcccgccgttccggaccttcctacttgactctgaccctgagagcattgagctggggctg ctgtatgaggagaagggggaacgcaggggccagctgctgtgcaggtctttgtgggagtgt gtggaccggctgagcaagaggacgcccatgttctacaattacatgcatgcacccgaggac acagagctcccgcagccctacagcaacgtgtccaacctgaaggtgtgggatttctacact gaggagatactggccaagggccctccctatgactgggaactggcccaggggccccctggg cccctggaggaagaacggtctgacggaggtgctccccagtgcaggcaccgcgtggcgtgg ccctgctacgacagctgcccctga >gi568815597r:11747113_11947684|GENSCAN_predicted_peptide_10|117_aa MRLCPLALALAQCRREAVGASSSILQIRVLSGRKSPLKVPPAGRVADRRAHLGCRDPRAL QVGSAGRITCQVLGSRSTGTRGVQDGGATEVSKGTARAEIGQGGGQELVELRSRQWI >gi568815597r:11747113_11947684|GENSCAN_predicted_CDS_10|354_bp atgaggctctgtccgctcgcccttgcgctggctcagtgcagacgggaagcggttggagcc agttccagcattctgcagattcgtgttctcagcggcaggaaaagccctctaaaggtgcca ccagctggccgagtggcagatcgcagagcccatcttggctgccgggacccgagggcgctc caggttggctccgcgggtcgcatcacctgccaagtgctcgggagccgcagcacgggcacc aggggggtccaggatggaggggccacggaggtctccaagggaactgcccgggctgagatt ggccagggagggggccaggaactagtggagctacggagccgccagtggatctga >gi568815597r:11747113_11947684|GENSCAN_predicted_peptide_11|113_aa MSQMRLQAGWEDEGVWPLKEAPGMSWADTNLMADDEGHSFNLELSTPPTHLPPPLEKLPS MNWVPCARKVGDRWTIRSNEAGTRGELPRCSRRLLRVESKKAVPRALLPRIPA >gi568815597r:11747113_11947684|GENSCAN_predicted_CDS_11|342_bp atgagccagatgcggttgcaggcgggctgggaagatgagggcgtctggcctctcaaggaa gcccctgggatgtcgtgggctgacacgaatctaatggctgatgacgaggggcacagtttc aacctggaactgtccacaccccctacccaccttcccccgcccttggaaaaactgccttcc atgaactgggtcccttgtgccagaaaggttggggaccgctggactatccgatcaaatgaa gcagggactcggggcgagctcccccgctgctcccgccggcttctccgtgtagaatcgaag aaggcagtcccaagggctcttctgccccgtatccccgcctga >gi568815597r:11747113_11947684|GENSCAN_predicted_peptide_12|62_aa MKPGGPGKSAFSTFFCLLFLAVLAADWMVPPTLTSKRNQEYGKSMKVDKERDHGSTAPQG GV >gi568815597r:11747113_11947684|GENSCAN_predicted_CDS_12|189_bp atgaagcccggaggacccggcaagtcggctttttccaccttcttctgcctgctttttcta gctgtgctggcagccgactggatggtgcccccgacactaacttcaaagaggaaccaggag tacgggaagagcatgaaagtggacaaggagcgtgaccatggaagcacagcaccacaggga ggggtttag >gi568815597r:11747113_11947684|GENSCAN_predicted_peptide_13|737_aa MWLQQRLKGLPGLLSSSWARRLLCLLGLLLLLLWFGGSGARRAAGGLHLLPWSRGEPGAA EPSACLEAATRAWRGLRERGEVVPLGPGVPALVANGFLALDVAANRLWVTPGEREPAVAP DFVPFVQLRPLSALAEAGEAVLLLREGLLRRVRCLQLGSPGPGPVAAGPGPASVSGLAAG SGRDCVLLQEDFLAHRGRPHVYLQRIQLNNPTERVAALQTVGPTAGPAPKAFTSTLEKVG DHQFLLYSGRSPPTPTGLVHLVVVAAKKLVNRLQVAPKTQLDETVLWVVHVSGPINPQVL KSKAAKELKALQDLARKEMLELLDMPAAELLQDHQLLWAQLFSPGVEMKKITDTHTPSGL TVNLTLYYMLSCSPAPLLSPSLSHRERDQMESTLNYEDHCFSGHATMHAENLWPGRLSSV QQILQLSDLWRLTLQKRGCKGLVKVGAPGILQGMVLSFGGLQFTENHLQFQADPDVLHNS YALHGIRYKNDHINLAVLADAEGKPYLHVSVESRGQPVKIYACKAGCLDEPVELTSAPTG HTFSVMVTQPITPLLYISTDLTHLQDLRHTLHLKAILAHDEHMAQQDPGLPFLFWFSVAS LITLFHLFLFKLIYNEYCGPGAKPLFRSKHRTQDRSIALKFPETCLPLEAVILAIIAARL SSGQHLSISIAVAGKLRGTQAYVALKHVNDNTVWGHPSPVGHGASRHKLSSSVTGVDNSE AHFIRLLRRCYGLDMVC >gi568815597r:11747113_11947684|GENSCAN_predicted_CDS_13|2214_bp atgtggctgcagcagcggctcaaggggctgccgggactgctgtcgagcagctgggcccgc cgcctcctctgcctgcttggcctcctgctgctgcttctgtggtttggggggtccggcgcg cggcgggcggcgggcggcctgcacctgctgccctggtcccgcggtgagccgggcgccgcc gagccgtctgcctgcctggaggcggccacccgcgcctggcgcggcctgcgggagcgcggt gaggtggtaccgctgggtcctggagtgccggccctggtggccaacggcttcctggccctg gacgtggctgccaatcggctgtgggtgactcccggggagcgggagcccgccgtggcgccg gactttgtgcccttcgtgcagctgcgcccgctgagcgcgctggctgaagctggagaggcg gtgctgctgctgcgggaggggcttctgcgccgcgtgcgttgcctgcagctggggtcccca ggtcctggccccgtggccgccggccccgggcccgcctccgtctctggccttgccgcgggg tccggccgcgactgcgtgctgctgcaagaggactttctggcgcacaggggccgaccccac gtctacctgcagcgcatccagctcaacaaccccacggagcgcgtggccgcgctgcagact gtggggcccactgccggcccagcccccaaggccttcaccagtaccctggagaaggtcgga gaccatcagttcctcctctactcaggccggtccccgcctacgcccactgggttggtgcac ctggtggtggtggccgccaagaagctggtgaaccgcctccaagtggctcccaagacgcag ctggatgagacggtgctgtgggtggtgcacgtctctggccccattaacccccaggtgctc aaaagcaaagcagccaaggagctcaaggcgctgcaggacttggcacggaaggaaatgctg gagctcttggacatgccagcggcggagctgcttcaagaccaccagctcctctgggctcag ctcttcagcccaggagtggaaatgaagaagatcactgacacccacacgccgtctggcctc accgtgaacctgacgctctattacatgctctcctgctcgccagccccactgctcagcccc tccctgagccacagggagcgagaccagatggagtcgacgctcaactatgaagatcactgc ttcagcgggcacgccaccatgcacgccgagaacctgtggccggggcggctgtcctccgtc cagcagatcctgcagctctctgacctgtggaggctgaccctccagaagcgtggctgcaag gggctggtgaaggtgggtgccccaggcatcctgcaggggatggtgctcagctttgggggg ctgcagttcacagagaaccacctccagttccaggccgaccccgacgtgctgcacaacagc tatgcattgcatggcatccgctacaagaacgaccatatcaacctggccgtgctggcggat gccgagggcaagccctacctacacgtgtccgtggagtcccgtggccagcctgtcaagatc tatgcctgcaaggcaggctgcctggacgagccagtggagctgacctcggcgcccacgggc cacaccttctcggtcatggtgacacagcccatcacgccactgctctacatctccaccgac ctcacacacctgcaggacctgcggcacacgctgcacctcaaggccatcctggcccatgat gagcacatggcccagcaggaccccgggctgcccttcctcttctggttcagcgtggcctcc ctaatcaccctcttccacctcttcctcttcaagctcatctacaacgagtactgtgggcct ggagccaagcccctcttcaggagtaagcataggacacaggacagaagcattgccctcaag tttcctgagacatgtttgcccctggaagcggtgattttggccatcattgctgcaagactg tcatcggggcagcatcttagcatcagcattgctgtcgcggggaagctgcgaggcactcag gcatatgtagccttgaagcatgtgaatgacaacaccgtttggggccacccttcacccgtg ggacatggagccagtagacacaagctctcctcttctgtcactggcgtggacaattcggag gcacattttattcggctcctcagaaggtgctacggtttggatatggtttgttga >gi568815597r:11747113_11947684|GENSCAN_predicted_peptide_14|129_aa MRPLLLLALLGWLLLAEAKGDAKPEALPGMGHLQGCPEKGIDCGQSQQSQAEVATIPGEK PQLPMMKPGEKCKLQTLRELRLTQGNNDEHCSVGLASVAWRPERAGEIREGFIEVEGTSL SLALGPGGS >gi568815597r:11747113_11947684|GENSCAN_predicted_CDS_14|387_bp atgcggcccctgctgctactggccctgctgggctggctgctgctggccgaagcgaagggc gacgccaagccggaggccctccctggcatgggccatctccagggctgtcccgagaagggc atagactgcggacagtcccagcaaagccaagcagaagtggcgaccatcccaggagagaaa ccgcagctgcccatgatgaagcccggtgagaagtgtaagctgcagaccctcagggagcta agactcacccagggaaacaatgatgagcactgcagtgtgggtcttgcctctgtggcttgg aggccagagagggcaggagaaatccgagaaggcttcatagaggtggaagggacgtcattg tcactggccctgggacctggaggaagn