GENSCAN 1.0 Date run: 16-Jul-119 Time: 15:53:30 Sequence gi568815592f:33521603_33795777 : 274175 bp : 52.34% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 177 495 319 1 1 28 64 207 0.363 7.86 1.02 Term + 11034 11289 256 0 1 52 36 155 0.009 2.09 1.03 PlyA + 11759 11764 6 1.05 2.03 PlyA - 12669 12664 6 1.05 2.02 Term - 46224 46209 16 2 1 121 44 5 0.645 -2.61 2.01 Init - 47773 47697 77 1 2 65 105 65 0.854 6.41 2.00 Prom - 48337 48298 40 -0.31 3.11 PlyA - 49074 49069 6 1.05 3.10 Term - 52305 52201 105 0 0 141 41 135 0.886 12.81 3.09 Intr - 52612 52432 181 0 1 68 92 228 0.999 21.59 3.08 Intr - 53839 53696 144 0 0 115 80 302 0.995 32.01 3.07 Intr - 54326 54191 136 0 1 71 99 102 0.993 9.73 3.06 Intr - 63165 63105 61 0 1 120 94 24 0.000 5.00 3.05 Intr - 64896 64762 135 0 0 132 68 58 0.000 9.47 3.04 Intr - 65242 65162 81 1 0 66 102 12 0.000 0.73 3.03 Intr - 66733 66582 152 2 2 57 59 72 0.000 1.59 3.02 Intr - 71566 71462 105 2 0 89 76 63 0.175 5.89 3.01 Init - 72000 71649 352 0 1 38 70 180 0.218 6.63 3.00 Prom - 74523 74484 40 -4.91 4.00 Prom + 75591 75630 40 -2.31 4.01 Init + 79270 79398 129 0 0 81 74 16 0.019 -0.39 4.02 Intr + 86438 86467 30 1 0 84 80 27 0.373 0.31 4.03 Term + 87111 87239 129 2 0 124 49 44 0.289 2.59 4.04 PlyA + 87319 87324 6 1.05 5.00 Prom + 92501 92540 40 -2.61 5.01 Init + 100001 100089 89 1 2 93 84 175 0.967 17.67 5.02 Term + 102153 102264 112 0 1 87 41 121 0.966 5.63 5.03 PlyA + 104739 104744 6 -0.45 6.00 Prom + 105808 105847 40 -2.31 6.01 Init + 111853 111904 52 0 1 99 75 -12 0.072 -0.27 6.02 Intr + 112521 112572 52 1 1 78 76 41 0.063 0.35 6.03 Intr + 118882 118952 71 1 2 122 83 167 0.997 18.92 6.04 Intr + 134164 134285 122 2 2 102 91 237 0.994 26.12 6.05 Intr + 136330 136416 87 0 0 97 94 163 0.999 18.46 6.06 Intr + 137068 137226 159 0 0 61 64 304 0.991 26.00 6.07 Intr + 137419 137517 99 0 0 106 78 212 0.994 22.81 6.08 Intr + 137864 137947 84 1 0 104 74 96 0.980 10.31 6.09 Intr + 140926 141072 147 0 0 85 55 241 0.775 21.44 6.10 Intr + 141309 141404 96 2 0 106 83 142 0.993 16.21 6.11 Intr + 141898 141948 51 0 0 80 77 33 0.660 0.99 6.12 Intr + 142136 142278 143 1 2 36 80 205 0.976 14.16 6.13 Intr + 143268 143367 100 0 1 77 74 257 0.923 23.81 6.14 Intr + 143451 143611 161 2 2 100 80 453 0.999 45.00 6.15 Intr + 144233 144374 142 2 1 80 96 322 0.999 33.06 6.16 Intr + 145527 145688 162 2 0 107 85 326 0.999 34.89 6.17 Intr + 146190 146362 173 2 2 120 80 466 0.999 48.26 6.18 Intr + 146913 147032 120 0 0 84 46 203 0.999 15.81 6.19 Intr + 147354 147554 201 0 0 63 49 340 0.562 26.52 6.20 Intr + 148723 148974 252 1 0 114 27 522 0.999 45.78 6.21 Intr + 149069 149213 145 2 1 67 24 319 0.995 24.19 6.22 Intr + 149563 149704 142 0 1 132 65 320 0.984 34.54 6.23 Intr + 150427 150626 200 2 2 129 51 280 0.998 27.99 6.24 Intr + 151989 152118 130 2 1 127 74 213 0.999 24.57 6.25 Intr + 152606 152663 58 0 1 121 60 100 0.996 8.93 6.26 Intr + 154089 154254 166 0 1 101 66 361 0.968 35.68 6.27 Intr + 155166 155330 165 2 0 84 107 337 0.995 35.87 6.28 Intr + 155413 155487 75 0 0 132 69 139 0.999 16.61 6.29 Intr + 155902 156027 126 0 0 82 34 240 0.998 19.38 6.30 Intr + 156819 156941 123 2 0 82 60 222 0.863 20.09 6.31 Intr + 157037 157237 201 1 0 79 78 386 0.989 36.70 6.32 Intr + 158280 158531 252 2 0 88 59 532 0.841 48.46 6.33 Intr + 158727 158852 126 2 0 82 25 325 0.999 26.98 6.34 Intr + 158953 159078 126 0 0 107 99 217 0.999 26.08 6.35 Intr + 160922 161042 121 1 1 55 92 254 0.529 23.07 6.36 Intr + 161605 161795 191 2 2 74 94 299 0.996 28.93 6.37 Intr + 162418 162566 149 0 2 103 63 292 0.999 27.74 6.38 Intr + 162755 162863 109 2 1 64 100 282 0.968 27.79 6.39 Intr + 162996 163086 91 2 1 92 65 131 0.979 11.27 6.40 Intr + 163172 163341 170 0 2 51 64 337 0.845 27.78 6.41 Intr + 163757 163931 175 1 1 68 77 344 0.999 31.33 6.42 Intr + 164041 164225 185 2 2 82 109 267 0.999 28.23 6.43 Intr + 164451 164651 201 2 0 114 99 411 0.952 44.80 6.44 Intr + 164807 164917 111 1 0 79 81 248 0.868 24.28 6.45 Intr + 165407 165516 110 1 2 104 80 145 0.989 14.88 6.46 Intr + 165590 165725 136 2 1 54 81 236 0.979 20.58 6.47 Intr + 165876 165962 87 2 0 82 64 118 0.863 9.46 6.48 Intr + 166455 166565 111 2 0 97 64 177 0.994 17.28 6.49 Intr + 166637 166829 193 1 1 58 45 397 0.929 31.99 6.50 Intr + 167054 167179 126 0 0 103 94 230 0.999 26.26 6.51 Intr + 167636 167808 173 0 2 87 82 331 0.999 32.58 6.52 Intr + 168432 168596 165 2 0 88 81 357 0.999 35.67 6.53 Intr + 169315 169507 193 0 1 64 22 521 0.795 42.69 6.54 Intr + 170013 170117 105 1 0 96 100 103 0.915 12.99 6.55 Intr + 170199 170326 128 1 2 71 50 178 0.996 13.20 6.56 Intr + 171126 171291 166 2 1 114 65 319 0.936 32.25 6.57 Intr + 171943 172103 161 2 2 114 70 320 0.999 33.02 6.58 Intr + 173322 173483 162 2 0 86 69 337 0.999 32.29 6.59 Term + 174110 174178 69 1 0 73 55 93 0.983 2.73 6.60 PlyA + 174941 174946 6 1.05 7.05 PlyA - 175189 175184 6 1.05 7.04 Term - 176148 176051 98 1 2 133 49 111 0.998 10.13 7.03 Intr - 178911 178842 70 0 1 41 95 82 0.852 3.45 7.02 Intr - 179818 179744 75 1 0 126 113 75 0.999 14.01 7.01 Init - 190084 189947 138 1 0 105 60 179 0.943 16.91 7.00 Prom - 195588 195549 40 -1.51 8.06 PlyA - 196221 196216 6 1.05 8.05 Term - 201585 201118 468 0 0 29 47 428 0.996 28.16 8.04 Intr - 203952 203839 114 0 0 -8 96 147 0.262 7.05 8.03 Intr - 205304 205061 244 1 1 109 90 185 0.920 18.83 8.02 Intr - 206698 206485 214 2 1 132 78 237 0.998 25.40 8.01 Init - 213874 213676 199 1 1 106 113 275 0.936 30.84 8.00 Prom - 232807 232768 40 -0.21 9.14 PlyA - 234252 234247 6 1.05 9.13 Term - 236557 236537 21 1 0 116 33 11 0.051 -3.01 9.12 Intr - 240137 239896 242 0 2 100 64 56 0.055 2.20 9.11 Intr - 248996 248906 91 2 1 116 53 8 0.029 0.17 9.10 Intr - 250309 249931 379 0 1 67 33 211 0.728 9.03 9.09 Intr - 255454 255352 103 2 1 100 117 150 0.989 18.83 9.08 Intr - 255637 255536 102 2 0 96 82 25 0.906 3.35 9.07 Intr - 256328 256107 222 0 0 84 -19 137 0.594 1.23 9.06 Intr - 256785 256640 146 2 2 95 78 184 0.999 18.54 9.05 Intr - 258577 258498 80 1 2 101 105 122 0.984 14.04 9.04 Intr - 259551 259475 77 1 2 106 97 27 0.996 5.13 9.03 Intr - 262825 262750 76 1 1 107 105 100 0.975 13.18 9.02 Intr - 265172 265132 41 0 2 87 105 47 0.870 4.73 9.01 Init - 267631 266779 853 1 1 115 60 836 0.494 76.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 58467 58712 246 2 0 87 46 195 0.957 8.86 S.002 Intr + 65068 65207 140 2 2 93 101 159 0.860 18.39 S.003 Sngl - 99630 99169 462 0 0 57 43 232 0.838 10.11 S.004 Init + 133615 133741 127 0 1 62 63 92 0.861 2.40 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:33521603_33795777|GENSCAN_predicted_peptide_1|191_aa XGCSECGVCRARAHPELELARSPGSRLRLSLHTFSQAEGAGSGLGPPREGLPQCSGGLKG SSSVARADAEAEEVLSASEGRQHAVTSQYHLPEILYHHMLWRFLSEGAQDLQPAMPEPPT HSMGSCAARASRTSTTPCSRAASPIDHPRAEECERTAQDWQAAPPAAPVRDPLGEASWAP ESGGDVESLYV >gi568815592f:33521603_33795777|GENSCAN_predicted_CDS_1|576_bp nccggctgctccgagtgtggggtctgccgagcccgcgcccacccggaactcgagctggcg cgcagccctggttcccgcctgcgcctctccctccacaccttctcgcaagcagagggagcc ggctctggcctcggcccgcccagagaagggctcccacagtgcagtggcgggctgaagggc tcctcaagcgtggccagagcagatgctgaggccgaggaggtgctgagcgcgagcgagggc cgccagcacgctgtcacctctcaataccaccttcctgaaattctttatcatcacatgctg tggagattcctctcggagggggctcaggacctgcagcccgccatgcctgagcctcccacc cactccatgggctcctgtgccgcccgagcctcccggacgagcaccaccccctgctccagg gcggccagtcccattgaccacccaagggctgaggagtgcgagcgcacggcgcaggactgg caggcagctccacctgcagccccagtgcgggatccactgggtgaagccagctgggctcct gagtctggtggggacgtggagagtctttatgtctag >gi568815592f:33521603_33795777|GENSCAN_predicted_peptide_2|30_aa MDVGRHQMDACESVNVRVAKCVCGICKAYS >gi568815592f:33521603_33795777|GENSCAN_predicted_CDS_2|93_bp atggacgtgggcaggcaccaaatggatgcatgtgagagtgtgaacgtgcgtgtggctaag tgtgtatgtggcatctgtaaagcctattcctag >gi568815592f:33521603_33795777|GENSCAN_predicted_peptide_3|483_aa MVPSPGSLLDSSLLALLASFSALSKLYYSNIIIWFLTARVCYGPPTTDCEFLEARELPSA QEAPRSAAGRPLRARPEGTEMWHGSAVLRCERLRRYARSAGAWGGLAVVAETCLLHFGLR LFPDSRSQAAPLHPADDGDKVPHKPLIWGKGEEMTQASQRYSGNPRTVTPGTGQAGWGGD EKKEEGPERADTKIQTLRCEKTKEGVGGLARAKRTWPSTMLTSTTVISGRDVEPARYHPS FALPSPGSSLFEGLLWQRQHESLRLHPLGTACKGQVFFLQLQDLDKVAANPKAQSEEQVA QDTEEVFRSYVFYRHQQEQEAEGVAAPADPEMVTLPLQPSSTMGQVGRQLAIIGDDINRR YDSEFQTMLQHLQPTAENAYEYFTKIATSLFESGINWGRVVALLGFGYRLALHVYQHGLT GFLGQVTRFVVDFMLHHCIARWIAQRGGWVAALNLGNGPILNVLVVLGVVLLGQFVVRRF FKS >gi568815592f:33521603_33795777|GENSCAN_predicted_CDS_3|1452_bp atggtgccttctcctggaagccttctggattcctccctgctggcattgcttgcttccttc tctgcactctccaaactatactacagcaacatcatcatctggttcttaactgcgcgggtc tgttacggccctcccaccactgactgtgagtttctggaggccagggagctgccttcagca caggaggccccgcgctccgccgccggtcgacccctgagggcgaggccagagggaacggaa atgtggcacggttcggcagttctcagatgcgagcgcctgcgcaggtacgcacgctccgct ggagcctggggtggcctggcagtcgtggccgagacgtgtttgctgcacttcggccttcga ctcttcccggactccaggtcccaggccgccccgctccaccctgcggatgatggagacaaa gtcccccacaagcccctcatatggggcaagggggaagaaatgacccaggcctctcagcgc tactcaggcaaccccaggactgtgactcctgggacagggcaggctggctggggaggggat gagaagaaggaggagggaccagaaagggcagataccaagattcaaacccttcgatgtgaa aagacaaaagaaggggttggtggtctggcacgggccaagcgtacctggccatccacaatg ctcacctccaccacggtgatctctggccgggacgtagagccagccaggtaccacccctcc ttcgccctgccttccccaggaagctcactctttgagggccttctctggcagcggcagcat gagagcctgcgtcttcacccactggggacagcatgcaaagggcaggtattcttccttcaa ctgcaggatctggataaagtggctgctaatcccaaagcacagtcagaggagcaggtagcc caggacacagaggaggttttccgcagctacgttttttaccgccatcagcaggaacaggag gctgaaggggtggctgcccctgccgacccagagatggtcaccttacctctgcaacctagc agcaccatggggcaggtgggacggcagctcgccatcatcggggacgacatcaaccgacgc tatgactcagagttccagaccatgttgcagcacctgcagcccacggcagagaatgcctat gagtacttcaccaagattgccaccagcctgtttgagagtggcatcaattggggccgtgtg gtggctcttctgggcttcggctaccgtctggccctacacgtctaccagcatggcctgact ggcttcctaggccaggtgacccgcttcgtggtcgacttcatgctgcatcactgcattgcc cggtggattgcacagaggggtggctgggtggcagccctgaacttgggcaatggtcccatc ctgaacgtgctggtggttctgggtgtggttctgttgggccagtttgtggtacgaagattc ttcaaatcatga >gi568815592f:33521603_33795777|GENSCAN_predicted_peptide_4|95_aa MATGQAGGARELRPQKQPSSNDRPAVDEKPCWARWHITNPSDSPTLDKHRELLEHGNAGI NVPSTTLNQKAQELVVKQPAPHFGSPAPWWDNPKL >gi568815592f:33521603_33795777|GENSCAN_predicted_CDS_4|288_bp atggccacagggcaggctggaggtgccagggagttaagaccccagaagcagccttcaagc aatgaccgaccagcagttgatgaaaaaccctgctgggcacggtggcacataactaatcct agtgactcgccaactctggacaaacacagagagctgctggaacatgggaacgctgggatt aacgtccccagtactaccctcaaccaaaaagcacaggagttagtggttaaacaacctgct ccccactttggctctcccgccccttggtgggacaacccgaagctgtag >gi568815592f:33521603_33795777|GENSCAN_predicted_peptide_5|66_aa MSEMSSFLHIGDIVSLYAEGSVNGFISTLGNCFLQSTLILGQILVTSRKPFYAGVPPPSA NTSNIS >gi568815592f:33521603_33795777|GENSCAN_predicted_CDS_5|201_bp atgagtgaaatgtccagctttcttcacatcggggacatcgtctccctgtacgccgagggc tccgtcaatggcttcatcagcactttggggaactgcttcctccagtccaccttgatcctg gggcagatcctggtgacctctagaaagcccttctacgccggggtaccccctccctctgca aacacctccaacatctcctga >gi568815592f:33521603_33795777|GENSCAN_predicted_peptide_6|2698_aa MRKSRGGALRPGTSVPSEITFMGSLRPNEYAQRHGLVDDRCVVEPAAGDLDNPPKKFRDC LFKVCPMNRYSAQKQYWKAKQTKQDKEKIADVVLLQKLQHAAQMEQKQNDTENKKVHGDV VKYGSVIQLLHMKSNKYLTVNKRLPALLEKNAMRVTLDATGNEGSWLFIQPFWKLRSNGD NVVVGDKVILNPVNAGQPLHASNYELSDNAGCKEVNSVNCNTSWKINLFMQFRDHLEEVL KGGDVVRLFHAEQEKFLTCDEYKGKLQVFLRTTLRQSATSATSSNALWEVEVVHHDPCRG GAGHWNGLYRFKHLATGNYLAAEENPSYKGDASDPKAAGMGAQGRTGRRNAGEKIKYCLV AVPHGNDIASLFELDPTTLQKTDSFVPRNSYVRLRHLCTNTWIQSTNVPIDIEEERPIRL MLGTCPTKEDKEAFAIVSVPVSEIRDLDFANDASSMLASAVEKLNEGFISQNDRRFVIQL LEDLVFFVSDVPNNGQNVLDIMVTKPNRERQKLMREQNILKQVFGILKAPFREKGGEGPL VRLEELSDQKNAPYQHMFRLCYRVLRHSQEDYRKNQEHIAKQFGMMQSQIGYDILAEDTI TALLHNNRKLLEKHITKTEVETFVSLVRKNREPRFLDYLSDLCVSNHIAIPVTQELICKC VLDPKNSDILIRTELLVVCRLRPVKEMAQSHEYLSIEYSEEEVWLTWTDKNNEHHEKSVR QLAQEARAGNAHDENVLSYYRYQLKLFARMCLDRQYLAIDEISQQLGVDLIFLCMADEML PFDLRASFCHLMLHVHVDRDPQELVTPVKFARLWTEIPTAITIKDYDSNLNASRDDKKNK FANTMEFVEDYLNNVVSEAVPFANEEKNKLTFEVVSLAHNLIYFGFYSFSELLRLTRTLL GIIDCVQGPPAMLQAYEDPGGKNVRRSIQGVGHMMSTMVLSRKQSVFSAPSLSAGASAAE PLDRSKFEENEDIVVMETKLKILEILQFILNVRLDYRISYLLSVFKKEFVEVFPMQDSGA DGTAPAFDSTTANMNLDRIGEQAEAMFGVGKTSSMLEVDDEGGRMFLRVLIHLTMHDYAP LVSGALQLLFKHFSQRQEAMHTFKQVQLLISAQDVENYKVIKSELDRLRTMVEKSELWVD KKGSGKGEEVEAGAAKDKKERPTDEEGFLHPPGEKSSENYQIVKGILERLNKMCGVGEQM RKKQQRLLKNMDAHKVMLDLLQIPYDKGDAKMMEILRYTHQFLQKFCAGNPGNQALLHKH LHLFLTPGLLEAETMQHIFLNNYQLCSEISEPVLQHFVHLLATHGRHVQYLDFLHTVIKA EGKYVKKCQDMIMTELTNAGDDVVVFYNDKASLAHLLDMMKAARDGVEDHSPLMYHISLV DLLAACAEGKNVYTEIKCTSLLPLEDVVSVVTHEDCITEVKMAYVNFVNHCYVDTEVEMK EIYTSNHIWTLFENFTLDMARVCSKREKRVADPTLEKYVLSVVLDTINAFFSSPFSENST SLQTHQTIVVQLLQSTTRLLECPWLQQQHKGSVEACIRTLAMVAKGRAILLPMDLDAHIS SMLSSGASCAAAAQRNASSYKATTRAFPRVTPTANQWDYKNIIEKLQDIITALEERLKPL VQAELSVLVDVLHWPELLFLEGSEAYQRCESGGFLSKLIQHTKDLMESEEKLCIKVLRTL QQMLLKKTKYGDRGNQLRKMLLQNYLQNRKSTSRGDLPDPIGTGLDPDWSAIAATQCRLD KEGATKLVCDLITSTKNEKIFQESIGLAIHLLDGGNTEIQKSFHNLMMSDKKSERFFKVL HDRMKRAQQETKSTVAVNMNDLGSQPHEDREPVDPTTKGRVASFSIPGSSSRYSLGPSLR RGHEVSERVQSSEMGTSVLIMQPILRFLQLLCENHNRDLQNFLRCQNNKTNYNLVCETLQ FLDIMCGSTTGGLGLLGLYINEDNVGLVIQTLETLTEYCQGPCHENQTCIVTHESNGIDI ITALILNDISPLCKYRMDLVLQLKDNASKLLLALMESRHDSENAERILISLRPQELVRLG RKRALEQALPLQVDVIKKAYLQEEERENSEVSPREVGHNIYILALQLSRHNKQLQHLLKP VKRIQEEEAEGISSMLSLNNKQLSQMLKSSAPAQEEEEDPLAYYENHTSQIEIVRQDRSM EQIVFPVPGICQFLTEETKHRLFTTTEQDEQGSKVSDFFDQSSFLHNEMEWQRKLRSMPL IYWFSRRMTLWGSISFNLAVFINIIIAFFYPYMEGASTGVLDSPLISLLFWILICFSIAA LFTKRYSIRPLIVALILRSIYYLGIGPTLNILGALNLTNKIVFVVSFVGNRGTFIRGYKA MVMDMEFLYHVGYILTSVLGLFAHELFYSILLFDLIYREETLFNVIKSVTRNGRSILLTA LLALILVYLFSIVGFLFLKDDFILEVDRLPNNHSTASPLGMPHGAAAFVDTCSGDKMDCV SGLSVPEVLEEDRELDSTERACDTLLMCIVTVMNHGLRNGGGVGDILRKPSKDESLFPAR VVYDLLFFFIVIIIVLNLIFGVIIDTFADLRSEKQKKEEILKTTCFICGLERDKFDNKTV SFEEHIKLEHNMWNYLYFIVLVRVKNKTDYTGPESYVAQMIKNKNLDWFPRMRAMSLVSN EGEGEQNEIRILQDKLNSTMKLVSHLTAQLNELKEQMTEQRKRRQRLGFVDVQNCISR >gi568815592f:33521603_33795777|GENSCAN_predicted_CDS_6|8097_bp atgaggaagagtcggggtggggcgctaaggccagggacgtcggtgccatcagagatcacc ttcatgggatcccttaggcctaatgagtacgcacagagacacgggctggtggatgaccgc tgtgtggtggagcccgcggccggggacctggacaacccccctaagaagttccgtgactgc ctcttcaaggtgtgccccatgaaccgctactcggcccagaagcagtactggaaggccaag cagactaagcaggacaaggagaagatcgctgatgtggtgttgctgcagaagctgcagcat gcggcgcagatggagcagaagcaaaatgacacggagaacaagaaggtgcatggggatgtc gtgaagtatggcagtgtgatccagctcctgcacatgaagagcaacaagtacctgacagtg aacaagcggcttccggccttgctggagaagaacgccatgcgggtgactctggatgccaca ggcaacgagggttcctggctcttcatccagcccttctggaagctgcggagcaacggggac aacgtggtcgtgggggacaaggtgatcctgaatcctgtcaatgccgggcagcctctgcat gccagcaattacgagctcagcgacaacgccggctgcaaggaggtcaattctgtgaactgc aacaccagctggaagatcaacctgtttatgcagtttcgggaccacctggaggaggtgttg aaagggggagacgtggtgcggctgttccatgcggagcaggagaagttcctgacgtgtgac gagtacaagggcaagctgcaggtgttcctgcgaactacactgcgccagtctgccacctcg gccaccagctccaatgctctctgggaggtggaggtggtccaccacgacccctgccgtgga ggagctgggcactggaatggcttgtaccgcttcaagcacctggctacaggcaactacctg gctgctgaggagaaccccagttacaaaggtgatgcctcagatcccaaggcagcaggaatg ggggcacagggccgcacaggccgcaggaatgctggggagaagatcaagtactgcctggtg gctgtgcctcatggcaatgacatcgcctctctctttgagctggaccccaccaccttgcag aaaaccgactctttcgtgccccggaactcgtacgtccggctgcggcacctctgcaccaac acgtggattcagagcaccaatgtgcccattgacatcgaggaggagcggcccatccggctc atgctgggcacctgccccaccaaggaggacaaggaggcctttgccatcgtgtcagtgccc gtgtctgagatccgagacctggactttgccaatgacgccagctccatgctggccagtgcc gtggagaaactcaacgagggcttcatcagccagaatgaccgcaggtttgtcatccagctg ctggaagacctggtgttctttgtcagcgatgtccccaacaatgggcagaatgtcctggac atcatggtcactaagcccaaccgggaacggcagaagctgatgagggagcagaacatcctc aaacaggtctttggcattctgaaggccccgttccgtgagaaggggggtgaaggtcccctg gtgcggctggaggagctgtcagaccagaagaacgccccctaccagcacatgttccgcctg tgctaccgcgtgttgcggcattcccaggaggactaccgcaagaaccaggagcacattgcc aagcagtttgggatgatgcagtcccagattggctacgacatcctggccgaggacaccatc actgccctgctgcacaacaaccgcaagctcctggaaaagcacatcaccaagaccgaggtg gagaccttcgtcagccttgtgcgcaagaaccgggagcccaggttcctggactacctctct gacctgtgtgtgtccaaccacatcgccatccccgtcacccaagagctcatctgcaagtgt gtgctggaccccaagaacagtgacattctcatccggaccgagttgctggtggtctgcagg cttcggcccgtgaaggagatggcccaatcccacgagtacctgagcatcgagtactcagaa gaggaagtgtggctcacgtggactgacaagaataacgagcatcatgagaagagtgtgagg cagctggcccaggaggcgcgggccggcaacgcccacgacgagaatgtgctcagctactac aggtaccagctgaagctctttgcccgcatgtgcttggaccgccagtacttggccatcgac gagatctcccagcagctgggcgtggacctgattttcctgtgcatggcagacgagatgctg ccctttgacctgcgcgcctccttctgccacctgatgctgcacgtgcacgtggaccgtgac ccccaggagctggtcacgccggtcaagtttgcccgtctctggactgagatccccacagcc atcaccatcaaggactatgattccaacctcaacgcgtcccgagatgacaagaagaacaag tttgccaacaccatggagttcgtggaggactacctcaacaatgtagtcagcgaggccgtg ccctttgccaacgaggagaagaacaagctcacttttgaggtggtcagcctggcgcacaat ctcatctacttcggcttctacagcttcagcgagctgctgcggctcactcgcacactgctg ggcatcatcgactgtgtgcaggggcccccggccatgctgcaggcctatgaggaccccggt ggcaagaatgtgcggcggtccatccagggcgtggggcacatgatgtccaccatggtgctg agccgcaagcagtccgtcttcagtgcccccagcctgtctgctggggccagtgctgctgag ccgctggacagaagcaagtttgaggagaatgaggacattgtggtgatggagaccaagctg aagatcctggaaatccttcagttcatcctcaatgtccgcctggattaccgcatatcctac ctgctgtctgtcttcaagaaggagtttgtggaggtgtttcccatgcaggacagtggggct gatggcacagcccctgccttcgactctaccactgccaacatgaacctggatcgcatcggg gagcaggcggaggccatgtttggagtggggaagacaagcagcatgctggaggtggatgac gagggcggccgcatgttcctgcgcgtgctcatccacctcaccatgcacgactatgcgccg ctggtctcgggtgccctgcagctgctcttcaagcacttcagccagcgccaggaggccatg cacaccttcaagcaggttcagctgctgatctcagcgcaggacgtggagaactacaaggtg atcaagtcggagctggaccggctgcggaccatggtggagaagtcagagctgtgggtggac aagaagggcagtggcaagggtgaggaggtggaggcaggcgccgccaaggacaagaaagag cgtcccacggacgaggagggctttctgcacccaccaggggagaaaagcagtgagaactac cagatcgtcaagggcatcctggaaaggctgaacaagatgtgcggggttggggagcaaatg aggaagaagcagcaacggctgctgaagaacatggatgcccacaaggtcatgctggacctg ctgcagatcccctatgacaagggtgatgccaagatgatggagatcctgcgctacacgcac cagttcctgcagaagttctgtgcagggaaccccggcaaccaggccctgctgcacaaacac ctgcacctcttcctcacgccagggctcctggaggcagagaccatgcagcacatcttcctg aacaactatcagctctgctccgagatcagcgagcctgtgttgcagcacttcgtgcacctg ctggccacgcacgggcgccatgtgcagtacctggacttcctgcacaccgtcattaaggcc gagggcaagtacgtcaagaagtgccaggacatgatcatgactgagctgaccaatgcaggt gacgatgtggtcgtgttctacaatgataaggcatcgctggcccacctgctggacatgatg aaggccgcccgcgacggcgtggaggaccacagccccctcatgtaccacatttccctggtg gacctgctggccgcctgtgccgagggcaaaaacgtctacactgagatcaagtgcacctcc ctgctgccgctggaggacgtggtgtctgtggtgacgcatgaggactgcatcactgaggtg aaaatggcctatgtgaacttcgtgaaccactgctacgtggacacggaggtggagatgaag gagatctacaccagcaaccacatctggacgctctttgagaacttcaccctggacatggcc cgggtctgcagcaagcgtgagaagcgcgtggctgaccccaccttggagaagtacgtgctg agcgttgtgctggacaccatcaacgccttcttcagctccccattctctgagaacagcact tccctgcagacacaccagacgattgtggtgcagctgctgcagtctaccacacgcctcctc gagtgtccgtggctacagcagcagcacaagggctccgtggaggcctgcatccggaccctc gccatggtggccaagggccgggccatcttgctgcccatggacctggatgcccacatcagc tcgatgctcagcagtggagccagctgtgcagctgccgcccagcggaacgcctccagctac aaggcaaccacgcgggccttcccccgcgtcacccccaccgccaaccagtgggactacaag aacatcattgagaagctgcaggacatcatcacagccctggaggagcggctgaagcccctg gtacaggctgagctgtccgtgctggtggatgtcctgcactggcctgagctgctcttcctg gagggcagtgaggcctaccagcgctgcgagagtgggggcttcctgtccaagctgatccag cacaccaaggacctcatggagtcggaggagaagctgtgcatcaaggtgctgcggaccctg cagcagatgctgctcaagaagaccaagtacggggaccggggcaaccagctgcgcaagatg ctgctgcaaaactacctccagaaccggaagtccacctcgcggggggaccttcccgacccc ataggcactggcctggacccagactggtcggcaatcgcagccacccagtgccggctggac aaggagggggccaccaagttggtatgcgacctcatcaccagcaccaagaacgagaagatc ttccaggagagcatcggcctggccatccacctgctggatggtggcaacacagagatccag aaatccttccacaacctgatgatgagtgacaagaagtcagagcgcttcttcaaggtgctg cacgaccgcatgaagcgggcccagcaggagaccaagtccacggtggcagtcaacatgaat gacctgggcagccagccacatgaggaccgcgagccagtcgaccccaccaccaaaggccgc gtggcctccttctcgatacctggctcctcatcccgctactcgctgggccccagcctgcgc cgggggcacgaggtgagcgaacgtgtgcagagcagtgagatgggcacatccgtgctcatc atgcagcccatcctgcgctttctgcagctgctgtgtgagaaccacaaccgggacctgcag aacttcctgcgctgtcagaacaacaaaaccaactacaacttggtatgcgagacgctgcag ttcctggacatcatgtgcggcagcaccacgggcggcctggggctgctggggctctacatc aatgaggacaacgtgggcctcgtcatccagaccttggagaccctcactgagtactgccag ggcccctgccatgagaaccagacttgcattgtgactcacgagtccaatggcatagacatc atcaccgcactgatcctcaatgacatcagccccctgtgcaagtaccgcatggatctggtg ctgcagctcaaggacaatgcctccaagctgctcctggctctgatggagagccggcatgac agtgaaaatgctgagcgaatcctcatcagcctgcggccccaggagctggtgaggctgggc aggaagagagcattggaacaggcactgcccctccaggtggacgtcatcaagaaggcctac ctgcaggaggaagagcgtgagaactcggaggtgagcccacgtgaagtgggccataacatc tatatcctggcgctgcagctctccaggcacaataaacagctgcagcacctgctgaagccg gtgaagcgcattcaagaggaggaggccgagggtatctcttccatgctcagcctcaacaac aagcagctgtcacagatgctcaagtcctcagcgccagcacaggaggaggaggaagacccc ctggcctactatgagaaccacacgtcccagatcgagattgtgcggcaggaccgcagcatg gagcagatcgtgttcccagtgcccggcatctgccagttcctgacggaggaaaccaagcac cggctcttcaccactactgagcaggacgagcagggcagcaaagtgagcgacttcttcgac cagtcctccttcctgcacaacgagatggagtggcagcgcaagctccgcagcatgccgctg atctactggttctcccgccgcatgaccctgtggggcagcatctccttcaacctggccgtg tttatcaacatcatcattgccttcttctacccttacatggagggcgcgtccacaggcgtg ctggactcccctctcatctcattgctcttctggatcctcatctgcttctccatcgcggcc ctgttcaccaagcgctacagcatccgccccctcatcgtggcgctcatcctgcgctccatc tactatctgggcatcgggcccacactcaacatcctgggtgccctcaatctgaccaacaag atcgtgtttgtggtgagcttcgtgggcaaccgtggcaccttcatccggggctataaggcc atggtcatggacatggaattcctctaccacgtgggctacatcctgaccagtgtcctgggc ctctttgctcatgagctgttctacagcatcctgctctttgacctcatctaccgcgaggag acgctgttcaacgtcatcaagagtgtgacccgcaatggccgctccatcctgctgacagcc ctgctggccctcatcctggtctacctcttctccatcgtcggcttcctcttcctcaaggat gacttcattctcgaggtcgaccggctgcccaacaaccactccacagccagccccctgggg atgccacatggagctgctgcatttgtggacacctgcagtggggacaagatggactgtgtc tcagggctctcggtgcctgaggtcctggaagaggacagggagctggacagcacagagcgg gcctgtgacactctgttgatgtgcatcgtcactgtcatgaaccatgggctacgcaacggt ggtggcgtgggcgacattctccgcaagccctccaaagatgagtctctcttcccagcccga gtggtctatgacctcctgttcttcttcatcgtcatcatcattgtgctgaacctcatcttt ggggtaatcatcgacaccttcgctgacctgcgtagtgagaagcagaagaaggaggagatt cttaagacgacatgcttcatctgtggtctggagagggacaagtttgataacaagacagtg tcatttgaggaacacatcaagctggagcacaacatgtggaactacttgtacttcattgtg ctggtccgcgtgaagaacaagaccgactacacgggccctgagagctacgtggcccagatg atcaagaacaagaacctggactggttcccccggatgcgggccatgtcccttgtcagcaat gagggcgagggggagcagaatgagattcggattctccaggacaagctcaactccaccatg aagctggtgtcccacctcactgcccagctcaacgagctcaaggagcagatgacggagcag cggaaacgcaggcaacgcctaggctttgtggatgtccagaactgcattagccgctga >gi568815592f:33521603_33795777|GENSCAN_predicted_peptide_7|126_aa MAASRYRRFLKLCEEWPVDETKRGRDLGAYLRQRVAQAFREGENTQVAEPEACDQMYESL ARLHSNYYKHKYPRPRDTSFSGLSLEEYKLILSTDTLEELKEIDKGMWKKLQEKFAPKGP EEDHKA >gi568815592f:33521603_33795777|GENSCAN_predicted_CDS_7|381_bp atggcggccagccggtaccggcgttttcttaagctctgtgaggaatggccagtggacgag accaaacggggccgggacttgggcgcttacctgcgacagcgggtagcacaggcctttcgg gagggagagaatacccaggttgcagagcctgaggcctgtgatcagatgtacgagagctta gcgcgactccattcaaactactacaaacacaagtaccctcgccccagagacaccagcttc agtggcctgtcgttggaagagtacaagctgatcctgtccacagacaccttggaagagctt aaggaaatagataaaggcatgtggaagaaactgcaggagaagtttgcccccaagggtcct gaggaggatcataaggcctga >gi568815592f:33521603_33795777|GENSCAN_predicted_peptide_8|412_aa MVVQNSADAGDMRAGVQLEPFLHQVGGHMSVMKYDEHTVCKPLVSREQRFYESLPLAMKR FTPQYKGTVTVHLWKDSTGHLSLVANPVKESQEPFKVSTESAAVAIWQTLQQTTGSNGSD CTLAQWPHAQLARSPKESPAKALLRSEPHLNTPAFSLVEDTNGNQVERKSFNPWGLQCHQ AHLTRLCSEYPENKRHRILAIPLTCGPKGGATERHRGAEMGTRQHGDDASEEKKARHMRK CAQSTSACLGVRICGMQVYQTDKKYFLCKDKYYGRKLSVEGFRQALYQFLHNGSHLRREL LEPILHQLRALLSVIRSQSSYRFYSSSLLVIYDGQEPPERAPGSPHPHEAPQAAHGSSPG GLTKVDIRMIDFAHTTYKGYWNEHTTYDGPDPGYIFGLENLIRILQDIQEGE >gi568815592f:33521603_33795777|GENSCAN_predicted_CDS_8|1239_bp atggttgtgcaaaacagcgcagacgccggggacatgagggcaggcgtgcagctggagccc ttcctgcaccaggtcggggggcacatgagcgtgatgaagtatgacgagcatacggtgtgc aagcccctcgtctcccgggagcagaggttctatgaatccctgccgctggccatgaagcgg ttcaccccacagtacaaaggtaccgtcacagtgcacctctggaaagacagcacaggccat ctcagcttggttgccaacccagtgaaggagagccaggagcccttcaaggtctccacagag tcggcggcggtggccatatggcagacgctccagcagaccaccggcagcaatggcagcgac tgcacccttgcccagtggccgcatgcccagctggcacgctcacccaaggagagcccggcc aaggctcttctgaggtccgagccccacctcaacactccagccttctcgctggtggaagac accaacggaaaccaggttgagaggaagagcttcaacccgtggggcctgcaatgccaccag gcccacctgacccgcctgtgctccgagtacccagagaacaagcggcatcgtatccttgcc atccccctcacatgtggtcccaagggcggggcgacagagagacacagaggggcagagatg gggacccggcagcacggcgatgatgcatcggaggagaagaaggcccgccacatgaggaag tgtgcgcagagcacctcagcctgcctgggtgtgcgcatctgcggcatgcaggtttatcaa acagataagaagtactttctctgcaaagacaagtactatggaagaaaactctcagtggag gggttcagacaagccctctatcagttcctacataatggaagccacctccggagggagctc ctggagcccatcctgcaccagctccgggccctcctctctgtcattaggagccagagttca taccgcttctattccagctctctccttgtcatctatgatgggcaggaaccaccagaaaga gccccaggcagcccgcatcctcacgaggctccccaggcagcccacggtagctctcccggt ggtctcaccaaggttgacatccgcatgattgactttgctcataccacatacaagggctac tggaatgagcacaccacctacgatggaccagaccctggctatatttttggcctggaaaac ctcatcaggatcctgcaggatatccaagagggagaatga >gi568815592f:33521603_33795777|GENSCAN_predicted_peptide_9|810_aa MATDLPIMARGPARSAAPAGGSSSGCGARQGRAGGGVLAMAGLSDLELRRELQALGFQPG PITDTTRDVYRNKLRRLRGEARLRDEERLREEARPRGEERLREEARLREDAPLRARPAAA SPRAEPWLSQPASGSAYATPGAYGDIRPSAASWVGSRGLAYPARPAQLRRRASVRGSSEE DEDARTPDRATQGPGLAARRWWAASPAPARLPSSLLGPDPRPGLRATRAGPAGAARARPE VGRRLERWLSRLLLWASLGLLLVFLGILWVKMGKPSAPQEAEDNMKLLPVDCERKTDEFC QAKQKAALLELLHELYNFLAIQAGNFECGNPENLKSKCIPVMEAQEYIANVTSSSSAKFE AALTWILSSNKDVGIWLKGEDQSELVTTVDKVVCLESAHPRMGVGCRLSRALLTAVTNVL IFFWWIPVWTSFEQEAQQGLGLLLLPEKYKHLLESYRYLMKARRTQGAALKGGHQSYPLT SGAAFEQRPQGVSEGERPSLAFLWGLLILLKYRWRKLEEEEQAMYEMVKKIIDVVQDHYV DWEQDMERYPYVGILHVRDSLIPPQSRSCDSRPGLQPERRFRELGSPASGPRASHLLCAK IALLYCDHSPDAGLFNKNFSAAGVSVTPAAALGGGRGAVAAGCPLPATREQRPRRRPPAL PPAPQPAPWEAGSRSTPTGGLIKCEKPLQRVFPRPQGQSPQEGSQSCHAPWGLWAVVVTF PVHSLQSMGPETRLPWREGDRQSMKLSPTLERRGEPDTSLQPALVHKAPRAYYWGPQQKM RGLLLHLTLGGSHSERAAASPATEVVTYLR >gi568815592f:33521603_33795777|GENSCAN_predicted_CDS_9|2433_bp atggcgaccgaccttcccatcatggcgcgtggccccgcccgctccgccgcgcctgcggga gggagcagttccgggtgcggtgcgcgccaggggcgggcggggggcggcgtcctggccatg gccggcctgtcggacctggaactgcggcgggagctgcaggccctgggcttccagccagga cccatcaccgacaccacccgggatgtctaccgcaacaagctgcgccgcctgcggggcgag gcccggctgcgcgacgaggagcggctgcgggaggaggcccggccgcggggcgaggagcgg ttacgggaagaggcccggttacgcgaggatgcgccgctgcgcgcccggcccgccgcggcc tctccgcgggcggagccctggctctcccagccggcctcgggctcggcctacgcgacccct ggggcctacggtgatatccggccctccgcggcttcctgggtagggagccgcggcctcgcc tatcctgcccgcccggcgcaactcaggcgccgcgcctcggtccggggcagctccgaggag gacgaggacgcccggacgcccgacagggccacgcagggcccgggtctcgcggcccgccgc tggtgggcagcgtctcccgccccggcgcggctgccttcctccctcctcggtcccgacccg cgcccgggcctgcgggcgactcgagcgggccctgctggcgcggcgagggcccggcctgag gtggggcgccggctggagcgctggctctctcggcttctgctctgggccagcctagggcta ctgctcgtcttcctgggcatcctttgggtgaagatgggcaagccctcagcgccgcaggag gcggaggacaacatgaagttattgccagtggactgtgagagaaaaacagatgagttctgt caggccaagcagaaggcagccttgctggagctgctgcatgaactctacaatttcctggcc atccaagctggtaattttgagtgtggaaatccagagaatctaaaaagcaaatgcattcct gttatggaagcccaagaatatatagccaatgtgaccagcagctcctccgccaagtttgaa gccgcactgacctggatactgagcagtaacaaggacgtgggcatctggttgaaaggagaa gaccagtctgaattggtgacgactgtggacaaggtggtctgcctggaatctgcccacccc cgcatgggtgttggctgccgcctgagccgggccttgctcactgctgtcaccaacgtgctc atcttcttctggtggatcccagtgtggaccagcttcgagcaggaggcacagcaggggctt ggcctacttttactccctgaaaaatataaacatctcctggagagctacagatacctgatg aaagcaagaaggactcagggtgctgcactgaaaggagggcaccagtcttatcccctgaca tcgggggccgcctttgagcagagacctcagggcgtctctgaaggcgaaaggccaagcttg gcttttttgtgggggctcctaattctcctaaaatatcggtggcgaaagttagaagaggag gaacaagccatgtatgagatggtgaagaagattatagacgtggtccaggaccattacgtg gactgggagcaggacatggagcgctatccatatgtaggcatcctgcacgtgcgcgacagc ttgatccctccacagagccgaagctgcgactctagaccaggcctgcagccagaacgccga ttccgggagcttgggagccctgcgtcagggcccagagcctcgcacttgctgtgtgcgaag atcgccttgctttactgtgaccacagccccgacgcggggctgtttaacaagaacttcagc gcagccggcgtttctgttaccccggccgcggctcttggcggcgggagaggcgcagtggct gcaggctgccccctgccggccacaagggagcagcgtccgcgccgccgcccaccggccctc ccgccagctcctcagcctgctccctgggaggcaggctccagaagcaccccgacggggggc cttattaagtgcgagaagcctctccagcgagtcttccccaggccccaaggtcagtcaccc caagaaggcagccagtcctgccatgccccatggggcttgtgggctgttgtggtcaccttt cccgttcacagcctgcagagcatggggcctgagacaaggctgccctggagagaaggtgac aggcagagtatgaaactgtcccccactctggagaggaggggagagcctgatacatccctg cagcctgccctggtgcacaaagcccccagagcctactactgggggccccagcagaagatg aggggcttattgctgcatctgactcttggaggctcccattcagagagagcagctgcatcc ccagcgactgaggtagtcacatacctccggtaa