Miyakogusa Predicted Gene
- chr2.CM0031.300.nc
BLASTP 2.2.18 [Mar-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= chr2.CM0031.300.nc + phase: 0
(927 letters)
Database: Medicago_aa2.0
38,834 sequences; 10,231,785 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
IMGA|AC131455_15.4 Argonaute and Dicer protein, PAZ; Stem cell s... 1583 0.0
IMGA|AC131455_31.4 Argonaute and Dicer protein, PAZ; Stem cell s... 1397 0.0
IMGA|CT030192_30.5 N-6 Adenine-specific DNA methylase; Argonaute... 660 0.0
IMGA|AC147429_4.4 Stem cell self-renewal protein Piwi chr00_pseu... 521 e-148
IMGA|AC160838_11.5 Argonaute and Dicer protein, PAZ; Stem cell s... 489 e-138
IMGA|CT030192_31.5 Stem cell self-renewal protein Piwi chr03_pse... 400 e-111
IMGA|CU179907_3.4 Argonaute and Dicer protein, PAZ; Stem cell se... 395 e-110
IMGA|AC136450_38.5 Argonaute and Dicer protein, PAZ chr02_pseudo... 211 1e-54
IMGA|CR931808_21.5 Ribonucleotide reductase chr05_pseudomolecule... 135 1e-31
IMGA|CU012043_14.5 Stem cell self-renewal protein Piwi chr03_pse... 116 4e-26
IMGA|AC202591_22.3 Stem cell self-renewal protein Piwi chr01_pse... 72 8e-13
IMGA|CU024897_18.4 Peptidase aspartic, active site; Saposin-like... 45 1e-04
>IMGA|AC131455_15.4 Argonaute and Dicer protein, PAZ; Stem cell
self-renewal protein Piwi chr05_pseudomolecule_IMGAG_V2
31662746-31670122 E EGN_Mt071002 20080227
Length = 908
Score = 1583 bits (4098), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 754/900 (83%), Positives = 828/900 (92%), Gaps = 9/900 (1%)
Query: 28 IVPADVEPVKVDLLDLPPEPVKKKLPTRLPIARKGLGSKGTKLPLLTNHFKVTVANSDGH 87
IVPAD+EP+K++ P+ VKKKLPT++P+AR+GLGSKG KLPLLTNHFKV V N+DG+
Sbjct: 16 IVPADIEPIKIE-----PQIVKKKLPTKVPMARRGLGSKGAKLPLLTNHFKVNVTNTDGY 70
Query: 88 FFQYSVALSYEDGRPVEGKGVGRKVIDKVQETYGSELNGKDFAYDGEKTLFTIGSLARNK 147
FFQYSVAL YEDGRPVEGKG GRK++D+VQETYGSELNGKD AYDGEKTLFTIGSLA+NK
Sbjct: 71 FFQYSVALFYEDGRPVEGKGAGRKILDRVQETYGSELNGKDLAYDGEKTLFTIGSLAQNK 130
Query: 148 LEFTVVLEDVISNRNNGNCSPDG-ASTNDSDKKRMRRPYHSKTFKVEISFAAKIPLQAIV 206
LEFTVVLEDV SNRNNGN SPDG S ND+D+KR+++ + SKT+KVEISFA+KIPLQAI
Sbjct: 131 LEFTVVLEDVTSNRNNGNASPDGHGSPNDTDRKRLKKSHRSKTYKVEISFASKIPLQAIA 190
Query: 207 NALRGQESENYQEAIRVLDIILRQHAAKQGCLLVRQSFFHNDPKNYADVGGGVLGCRGFH 266
NAL+G E+ENYQEAIRVLDIILRQHAAKQGCLLVRQ+FFHNDPKN+ DVGGGVLGCRG H
Sbjct: 191 NALKGHETENYQEAIRVLDIILRQHAAKQGCLLVRQNFFHNDPKNFTDVGGGVLGCRGLH 250
Query: 267 SSFRTTQSGLSLNIDVSTTMIIQPGPVVDFLIANQNVRDPFSLDWAKAKRTLKNLRIKAS 326
SSFRTTQSGLSLNIDVSTTMI+ PGPVVDFLIANQNVRDPFSLDW KAKRTLKNLRI S
Sbjct: 251 SSFRTTQSGLSLNIDVSTTMIVHPGPVVDFLIANQNVRDPFSLDWNKAKRTLKNLRITTS 310
Query: 327 PSNQEYKITGLSELPCKEQTFTMKKKGGNNGEEDATEEEITVYEYFVNYRKIDLRYSADL 386
P+NQEYKITGLSE+PCK+Q FT+KK+G GE+D EEITVY+YFVN RKI L+YSADL
Sbjct: 311 PTNQEYKITGLSEMPCKDQLFTLKKRGAVPGEDDT--EEITVYDYFVNRRKISLQYSADL 368
Query: 387 PCINVGKPKRPTYVPVELCSLVSLQRYTKALTTLQRSSLVEKSRQKPLERMNVLNQALKT 446
PCINVGKPKRPT+VPVELCSLVSLQRYTKAL+TLQRSSLVEKSRQKP ERM VL ALKT
Sbjct: 369 PCINVGKPKRPTFVPVELCSLVSLQRYTKALSTLQRSSLVEKSRQKPQERMRVLTDALKT 428
Query: 447 SNYGNEPMLKNCGITIASGFTQVEGRVLQAPRLKFGNGEDFNPRNGRWNLNNKKVVRPAK 506
S+YG+EPML+NCGI+I SGFTQV+GRVLQAPRLKFGNGEDFNPRNGRWN NNKK+V+P K
Sbjct: 429 SDYGSEPMLRNCGISITSGFTQVDGRVLQAPRLKFGNGEDFNPRNGRWNFNNKKIVQPVK 488
Query: 507 IEHWAVVNFSARCDVRGLVRDLIKCARLKGIPIDEPYEEIFEENGQFRRAPPLVRVEKMF 566
IE WAVVNFSARCDVRGLVRDLIKC +KGI +++P++ FEENGQFRRAPPLVRVEKMF
Sbjct: 489 IEKWAVVNFSARCDVRGLVRDLIKCGGMKGIHVEQPFD-CFEENGQFRRAPPLVRVEKMF 547
Query: 567 ERIQKELPGAPSFLLCLLPERKNSDLYGPWKKKNLAEYGIVTQCISPTRVNDQYLTNVLM 626
E +Q +LPGAP FLLCLL ERKNSDLYGPWKKKNLAE+GIVTQCI+PTRVNDQYLTNVL+
Sbjct: 548 EHVQSKLPGAPKFLLCLLSERKNSDLYGPWKKKNLAEFGIVTQCIAPTRVNDQYLTNVLL 607
Query: 627 KINAKLGGLNSVLGVEMNPSIPIVSKVPTIILGMDVSHGSPGQSDIPSIAAVVSSREWPL 686
KINAKLGG+NS+LGVE +PSIPIVSK PT+ILGMDVSHGSPGQ++IPSIAAVVSSR+WPL
Sbjct: 608 KINAKLGGMNSLLGVEHSPSIPIVSKAPTLILGMDVSHGSPGQTEIPSIAAVVSSRQWPL 667
Query: 687 ISKYRACVRTQSPKVEMIDNLFKQVSEKEDEGIIRELLIDFYSSSGKRKPDNIIIFRDGV 746
ISKYRACVRTQ KVEMIDNLFK VS+ EDEGIIRELLIDFY+SSG RKPDNIIIFRDGV
Sbjct: 668 ISKYRACVRTQGAKVEMIDNLFKPVSDTEDEGIIRELLIDFYNSSGNRKPDNIIIFRDGV 727
Query: 747 SESQFNQVLNIELNQIIEACKFLDETWNPKFLVIVAQKNHHTKFFQPGSPDNVPPGTVID 806
SESQFNQVLNIEL+QIIEACKFLDE WNPKFLVIVAQKNHHTKFFQPGSPDNVPPGTV+D
Sbjct: 728 SESQFNQVLNIELSQIIEACKFLDEKWNPKFLVIVAQKNHHTKFFQPGSPDNVPPGTVVD 787
Query: 807 NKICHPRNNDFYMCAHAGMIGTSRPTHYHVLLDDIGFSPDELQELVHSLSYVYQRSTTAI 866
NKICHPRN DFYMCAHAGMIGTSRPTHYHVLLD+IGFSPD+LQELVHSLSYVYQRSTTAI
Sbjct: 788 NKICHPRNYDFYMCAHAGMIGTSRPTHYHVLLDEIGFSPDDLQELVHSLSYVYQRSTTAI 847
Query: 867 SVVAPICYAHLAATQIGQFMKFEDKSDTSSSHGGLTAAGVAPVVPQLPKLQDSVSSSMFF 926
SVVAPICYAHLAA+Q+GQFMKFEDKS+TSSSHGG A +PQLPKL DSV +SMFF
Sbjct: 848 SVVAPICYAHLAASQVGQFMKFEDKSETSSSHGGSGRDINASPIPQLPKLMDSVCNSMFF 907
>IMGA|AC131455_31.4 Argonaute and Dicer protein, PAZ; Stem cell
self-renewal protein Piwi chr05_pseudomolecule_IMGAG_V2
31672438-31678434 E EGN_Mt071002 20080227
Length = 868
Score = 1397 bits (3617), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 681/881 (77%), Positives = 760/881 (86%), Gaps = 42/881 (4%)
Query: 46 EPVKKKLPTRLPIARKGLGSKGTKLPLLTNHFKVTVANSDGHFFQYSVALSYEDGRPVEG 105
E +KKKLPT+ P+AR+GLG+KG KLPLLTNHF+V VAN++ FFQYSVAL YEDGRPVEG
Sbjct: 29 EHLKKKLPTKAPMARRGLGTKGAKLPLLTNHFEVNVANTNRVFFQYSVALFYEDGRPVEG 88
Query: 106 KGVGRKVIDKVQETYGSELNGKDFAYDGEKTLFTIGSLARNKLEFTVVLEDVISNRNNGN 165
KG GRK+IDKVQETY SELNGKD AYDGE TL NN N
Sbjct: 89 KGAGRKIIDKVQETYDSELNGKDLAYDGE-TL------------------------NNAN 123
Query: 166 CSPDGASTNDSDKKRMRRPYHSKTFKVEISFAAKIPLQAIVNALRGQESENYQEAIRVLD 225
SP DKKR+R+ Y SKT+KVEI+FA +IPLQAI NAL+G E+ENYQEAIRVLD
Sbjct: 124 TSP--------DKKRIRKSYRSKTYKVEINFAKEIPLQAIANALKGHEAENYQEAIRVLD 175
Query: 226 IILRQHAAKQGCLLVRQSFFHNDPKNYADVGGGVLGCRGFHSSFRTTQSGLSLNIDVSTT 285
IILRQH+AKQGCLLVRQ+FFHNDP N DVGGGVL C+G HSSFRTTQSGLSLNIDVSTT
Sbjct: 176 IILRQHSAKQGCLLVRQNFFHNDPNNLNDVGGGVLSCKGLHSSFRTTQSGLSLNIDVSTT 235
Query: 286 MIIQPGPVVDFLIANQNVRDPFSLDWAKAKRTLKNLRIKASPSNQEYKITGLSELPCKEQ 345
MI++PGPVVDFLI NQNVRDPFSLDW KAKRTLKNLRI A PSNQEYKITGLSEL CK+Q
Sbjct: 236 MIVRPGPVVDFLIENQNVRDPFSLDWNKAKRTLKNLRITAKPSNQEYKITGLSELSCKDQ 295
Query: 346 TFTMKKKGGNNGEEDATEEEITVYEYFVNYRKIDLRYSADLPCINVGKPKRPTYVPVELC 405
FTMKK+G GE+D EEITVY+YFV+ RKIDL+YSA LPCINVGKPKRPTY+P+ELC
Sbjct: 296 LFTMKKRGAVAGEDDT--EEITVYDYFVHRRKIDLQYSAGLPCINVGKPKRPTYIPIELC 353
Query: 406 SLVSLQRYTKALTTLQRSSLVEKSRQKPLERMNVLNQALKTSNYGNEPMLKNCGITIASG 465
SL+SLQRYTKAL+T QRSSLVEKSRQKP+ERM VL+ ALK SNYG+EPML+NCGI+I S
Sbjct: 354 SLISLQRYTKALSTSQRSSLVEKSRQKPVERMRVLSNALKASNYGSEPMLRNCGISITSE 413
Query: 466 FTQVEGRVLQAPRLKFGNGEDFNPRNGRWNLNNKKVVRPAKIEHWAVVNFSARCDVRGLV 525
FTQV+GRVLQAPRLKFGN EDFNPRNGRWN NNKK V P + +W+VVNFSARCDVRGLV
Sbjct: 414 FTQVDGRVLQAPRLKFGN-EDFNPRNGRWNFNNKKFVEPVSLGNWSVVNFSARCDVRGLV 472
Query: 526 RDLIKCARLKGIPIDEPYEEIFEENGQFRRAPPLVRVEKMFERIQKELPGAPSFLLCLLP 585
RDLIKC +KGI +++P +++ EEN QF+ PP+ RVEKMF + K L PSFLLCLLP
Sbjct: 473 RDLIKCGGMKGILVEQP-KDVIEENRQFKGEPPVFRVEKMFADVLK-LSKRPSFLLCLLP 530
Query: 586 ERKNSDLYGPWKKKNLAEYGIVTQCISPTRVNDQYLTNVLMKINAKLGGLNSVLGVEMNP 645
ERKNSDLYGPWKKKNLAE+GIVTQCI+PTRVNDQYLTNVL+KINAKLGG+NS LGVE +
Sbjct: 531 ERKNSDLYGPWKKKNLAEFGIVTQCIAPTRVNDQYLTNVLLKINAKLGGMNSWLGVEHSR 590
Query: 646 SIPIVSKVPTIILGMDVSHGSPGQSDIPSIAAVVSSREWPLISKYRACVRTQSPKVEMID 705
SIPIVSKVPT+ILGMDVSHGSPGQ DIPSIAAVVSSR+WPLISKYRACVRTQ KVEMID
Sbjct: 591 SIPIVSKVPTLILGMDVSHGSPGQPDIPSIAAVVSSRKWPLISKYRACVRTQGSKVEMID 650
Query: 706 NLFKQVSEKEDEGIIRELLIDFYSSSGKRKPDNIIIFRDGVSESQFNQVLNIELNQIIEA 765
NLFK VS+KEDEGIIRELL+DF+ SS +R+P+NIIIFRDGVSESQFN+VLN+EL+QIIEA
Sbjct: 651 NLFKPVSDKEDEGIIRELLLDFFHSSEERRPENIIIFRDGVSESQFNEVLNVELSQIIEA 710
Query: 766 CKFLDETWNPKFLVIVAQKNHHTKFFQPGSPDNVPPGTVIDNKICHPRNNDFYMCAHAGM 825
CKFLDE WNPKF+VIVAQKNHHTKFFQP SPDNVPPGTV+D+KICHPRN DFYMCAHAGM
Sbjct: 711 CKFLDENWNPKFMVIVAQKNHHTKFFQPRSPDNVPPGTVVDSKICHPRNYDFYMCAHAGM 770
Query: 826 IGTSRPTHYHVLLDDIGFSPDELQELVHSLSYVYQRSTTAISVVAPICYAHLAATQIGQF 885
IGTSRPTHYHVLLD+IGFSPD+LQELVHSLSYVYQRSTTAISVVAPICYAHLAA+Q+GQF
Sbjct: 771 IGTSRPTHYHVLLDEIGFSPDDLQELVHSLSYVYQRSTTAISVVAPICYAHLAASQVGQF 830
Query: 886 MKFEDKSDTSSSHGGLTAAGVAPVVPQLPKLQDSVSSSMFF 926
MKFEDKS+TSSS GG+ A+ ++PQLP L V +SMFF
Sbjct: 831 MKFEDKSETSSSQGGINAS----LIPQLPNLHKRVCNSMFF 867
>IMGA|CT030192_30.5 N-6 Adenine-specific DNA methylase; Argonaute
and Dicer protein, PAZ; Stem cell self-renewal protein
Piwi chr03_pseudomolecule_IMGAG_V2 1284064-1280514 H
EGN_Mt071002 20080227
Length = 602
Score = 660 bits (1702), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/606 (54%), Positives = 431/606 (71%), Gaps = 30/606 (4%)
Query: 43 LPPEPVKKKLPT-RLPIARKGLGSKGTKLPLLTNHFKVTVANSDGHFFQYSVALSYEDGR 101
L E ++K L + +P+AR+GLGSKG K+ LL NHF+V ++ +DG+F+ Y+VAL Y+DG
Sbjct: 5 LNNEEMRKTLNSNHVPMARRGLGSKGAKIQLLANHFRVGLSKNDGYFYHYNVALCYQDGH 64
Query: 102 PVEGKGVGRKVIDKVQETYGSELNGKDFAYDGEKTLFTIGSLARNKLEFTVVLEDVISNR 161
VE KGVGRKVIDK+ ETY L K+FAYDGEK+LFT+ SL K EF VVLE+V S R
Sbjct: 65 AVEVKGVGRKVIDKLCETYDV-LRNKNFAYDGEKSLFTLRSLHHKKQEFIVVLEEVSSTR 123
Query: 162 NNGNCSPDGASTNDSDKKRMRRPYHSKTFKVEISFAAKIPLQAIVNALRGQESENYQEAI 221
N P A+ KRM+ SKTFKVEIS +KIPLQ I +ALRGQESE+YQEA
Sbjct: 124 VGSN--PSEAT------KRMKHQSRSKTFKVEISHVSKIPLQEITDALRGQESEHYQEAF 175
Query: 222 RVLDIILRQHAAKQGCLLVRQSFFHNDPKNYADVGGGVLGCRGFHSSFRTTQSGLSLNID 281
LD ILRQ+AAKQGCL + +S+FH++ KN ++ GG+ CRGFHSSFR TQ GLSLN+D
Sbjct: 176 NFLDTILRQNAAKQGCLRIHKSYFHDNQKNITNLEGGIQCCRGFHSSFRVTQRGLSLNVD 235
Query: 282 VSTTMIIQPGPVVDFLIANQNVRDPFSLDWAK----------AKRTLKNLRIKASPSNQE 331
VSTT++++PGPVVDFL+ NQNV+ P +DW K AKR LKNLRIKA+ N +
Sbjct: 236 VSTTLLVKPGPVVDFLLQNQNVQKPNLIDWTKVILLLHLEVEAKRMLKNLRIKAN--NTQ 293
Query: 332 YKITGLSELPCKEQTFTMKKKGGNNGEEDATEEEITVYEYFVNYRKIDLRYSADLPCINV 391
KITGLSE C Q F K NGE ++E IT+YEYF ++KI+L YS D+PCINV
Sbjct: 294 RKITGLSEKSCMTQNFLFKHGNDANGEVQSSE--ITIYEYFKRHKKIELCYSVDMPCINV 351
Query: 392 GKPKRPTYVPVELCSLVSLQRYTKALTTLQRSSLVEKSRQKPLERMNVLNQALKTSNYGN 451
GKPKRP Y P+ELC+LVSLQRYTK L QR+ L+ +SR P ER L +L+ S YG+
Sbjct: 352 GKPKRPIYYPMELCTLVSLQRYTKPLAHKQRAQLILESRTSPRERKEALQYSLRNSRYGD 411
Query: 452 EPMLKNCGITIASGFTQVEGRVLQAPRLKFGNGEDFNPRNGRWNLNNKKVVRPAKIEHWA 511
EPML++ GITI FTQV+GRVLQ P L G G++F PRNG WN N+KK++ P KI+ WA
Sbjct: 412 EPMLRSLGITIEPSFTQVDGRVLQPPTLIVGRGQNFCPRNGSWNFNDKKLIEPVKIKRWA 471
Query: 512 VVNFSARCDVRGLVRDLIKCARLKGIPIDEPYEEIFEENGQFRRAPPLVRVEKMFERIQK 571
+VNFS++CD + L + KC+ +KG+ ID P+ +IFEE+ + R P RV +M+E ++
Sbjct: 472 IVNFSSQCDTKHLCSMIKKCSEMKGMLIDPPF-DIFEEDIRHRNESPFARVARMYEMVKA 530
Query: 572 ELPGAPS-----FLLCLLPERKNSDLYGPWKKKNLAEYGIVTQCISPTRVNDQYLTNVLM 626
+LPG P+ LLC+LP +N ++YGPWK++ L + GI TQCI+PT++ND Y+ NVL+
Sbjct: 531 KLPGPPTHPLAQLLLCILPVSRNCNIYGPWKRRCLVDEGIATQCIAPTKINDHYIINVLL 590
Query: 627 KINAKL 632
KINAK+
Sbjct: 591 KINAKV 596
>IMGA|AC147429_4.4 Stem cell self-renewal protein Piwi
chr00_pseudomolecule_IMGAG_V2 2850348-2852944 H
EGN_Mt071002 20080227
Length = 298
Score = 521 bits (1342), Expect = e-148, Method: Compositional matrix adjust.
Identities = 250/301 (83%), Positives = 274/301 (91%), Gaps = 11/301 (3%)
Query: 626 MKINAKLGGLNSVLGVEMNPSIPIVSKVPTIILGMDVSHGSPGQSDIPSIAAVVSSREWP 685
+ I +LGGLNS+LGVE +PS+PIVSK PT+ILGMDVSHGSPGQ+DIPSIAAVVSSR+WP
Sbjct: 8 LSIVLQLGGLNSLLGVESSPSLPIVSKAPTLILGMDVSHGSPGQTDIPSIAAVVSSRQWP 67
Query: 686 LISKYRACVRTQSPKVEMIDNLFKQVSEKEDEGIIRELLIDFYSSSGKRKPDNIIIFRDG 745
LISKYRACVRTQS KVEMIDNLFK+VS+ EDEGI+RELL+DFY+SS RKPDNIIIFRDG
Sbjct: 68 LISKYRACVRTQSAKVEMIDNLFKKVSDTEDEGIMRELLLDFYTSSKNRKPDNIIIFRDG 127
Query: 746 VSESQFNQVLNIELNQIIEACKFLDETWNPKFLVIVAQKNHHTKFFQPGSPDNVPPGTVI 805
VSESQFNQVLNIEL+QIIEACKFLDE W PKF+VIVAQKNHHT+FFQP SPDNVPPG
Sbjct: 128 VSESQFNQVLNIELDQIIEACKFLDENWTPKFVVIVAQKNHHTRFFQPNSPDNVPPG--- 184
Query: 806 DNKICHPRNNDFYMCAHAGMIGTSRPTHYHVLLDDIGFSPDELQELVHSLSYVYQRSTTA 865
+N DFY+CAHAGMIGTSRPTHYHVLLD+IGFSPDELQELVHSLSYVYQRSTTA
Sbjct: 185 -------KNYDFYLCAHAGMIGTSRPTHYHVLLDEIGFSPDELQELVHSLSYVYQRSTTA 237
Query: 866 ISVVAPICYAHLAATQIGQFMKFEDKSDTSSSHGGLTAAGVAPVVPQLPKLQDSVSSSMF 925
ISVVAPICYAHLAATQ+GQFMKFEDKS+TSSSHGGL+AAG P VPQLPKLQD+V +SMF
Sbjct: 238 ISVVAPICYAHLAATQLGQFMKFEDKSETSSSHGGLSAAGAVP-VPQLPKLQDNVCNSMF 296
Query: 926 F 926
F
Sbjct: 297 F 297
>IMGA|AC160838_11.5 Argonaute and Dicer protein, PAZ; Stem cell
self-renewal protein Piwi chr08_pseudomolecule_IMGAG_V2
25391911-25397439 E EGN_Mt071002 20080227
Length = 876
Score = 489 bits (1258), Expect = e-138, Method: Compositional matrix adjust.
Identities = 323/902 (35%), Positives = 477/902 (52%), Gaps = 84/902 (9%)
Query: 59 ARKGLGSKGTKLPLLTNHFKVTVANSDGHFFQYSVALSYEDGRPVEGKGVGRKVIDKVQE 118
+R G GTK + N+F ++ SD Y V ++ E K + K++ Q
Sbjct: 26 SRPDYGKLGTKCVVKANYFLADISVSD--LSHYHVDITPEVISSKTRKAIIAKLVKFHQN 83
Query: 119 TYGSELNGKDFAYDGEKTLFTIGSLARNKLEFTVVL-EDVISNRNNGNCSPDGASTNDSD 177
T EL K YDG + L+T GSL EF ++L ED +G T
Sbjct: 84 T---ELGKKLPVYDGAENLYTAGSLPFTHKEFNILLIED-----------DEGFGTTRER 129
Query: 178 KKRMRRPYHSKTFKVEISFAAKIPLQAIVNALRGQESENYQEAIRVLDIILRQHAAKQGC 237
K F+V I F A + + + L G++ E QEAI +DI+L++ A+
Sbjct: 130 K-----------FEVAIKFLAHVSMHQLHELLSGKKVETPQEAINAIDIVLKELASHS-- 176
Query: 238 LLVRQSFFHNDP--KNYADVGGGVLGCRGFHSSFRTTQSGLSLNIDVSTTMIIQPGPVVD 295
V H P K + GG+ GF+ S R TQ GLSLN+D+++T I+P PV+D
Sbjct: 177 -YVSFGSLHYSPDLKKPHKLSGGLESWSGFYQSIRPTQMGLSLNVDMASTAFIEPLPVID 235
Query: 296 FL--IANQNVRD-PFS-LDWAKAKRTLKNLRIKAS---PSNQEYKITGLSELPCKEQTFT 348
I ++V P S D K K+ LK ++++ + ++Y+ITGL+ P +E +F
Sbjct: 236 IAAQILGKDVHSKPLSDADRIKIKKALKGVKVEVTYRGSFRRKYRITGLTSQPTRELSFP 295
Query: 349 MKKKGGNNGEEDATEEEITVYEYFVNYRKIDLRYSADLPCINVGKPKRPTYVPVELCSLV 408
+ +K I+V +YF + Y LPC+ VG K+ Y+P+E C +V
Sbjct: 296 LGEKMNM----------ISVIDYFQEMYGYKIMY-PHLPCLQVGSQKKVNYLPMEACKIV 344
Query: 409 SLQRYTKALTTLQRSSLVEKSRQKPLERMNVLNQALKTSNYGNEPMLKNCGITIASGFTQ 468
QRYTK L+ Q +S+++ S Q+P ER N + Q + ++Y P K GI+I +
Sbjct: 345 GGQRYTKGLSEKQITSMLKVSCQRPRERENDILQTIHQNDYDCNPYAKEFGISIGNELAS 404
Query: 469 VEGRVLQAPRLKF---GNGEDFNPRNGRWNLNNKKVVRPAKIEHWAVVNFSARCDVR--- 522
VE RVL AP LK+ G + P+ G+WN+ NKKVV +K+ +WA +NFS +
Sbjct: 405 VEARVLPAPWLKYHETGRDKKILPQVGQWNMTNKKVVNGSKVRYWACINFSRSVKEKTAS 464
Query: 523 GLVRDLIKCARLKGIPI-DEPYEEIFEEN-GQFRRAPPLVRVEKMFERIQKELPGAPSFL 580
+ L++ + G+ +EP ++ ++A V + + KEL +
Sbjct: 465 AFCQQLVQTCQSLGMEFSEEPVIPVYSARPDMVKKALKYVHSFSLNKLEGKEL----ELV 520
Query: 581 LCLLPERKNSDLYGPWKKKNLAEYGIVTQCISPT---RVNDQYLTNVLMKINAKLGGLNS 637
+ +LP+ N LYG KK + G+++QC ++N QYL+NV +KIN K+GG N+
Sbjct: 521 VAILPDN-NGSLYGDLKKICETDLGLISQCCLTKYVFKINRQYLSNVALKINVKMGGRNT 579
Query: 638 VLGVEMNPSIPIVSKVPTIILGMDVSHGSPGQSDIPSIAAVVSSREWPLISKYRACVRTQ 697
VL ++ IP+VS VPTII G DVSH G+ PSIAAVV+S++WP ++KY V Q
Sbjct: 580 VLLDAISCRIPLVSDVPTIIFGADVSHPESGEDVCPSIAAVVASQDWPEVTKYAGLVCAQ 639
Query: 698 SPKVEMIDNLFKQVSEKEDE----GIIRELLIDFYSSSGKRKPDNIIIFRDGVSESQFNQ 753
P+ E+I +LFK ++ G+IRELL+ F ++GK KP I+ +RDGVSE QF Q
Sbjct: 640 PPREEIIKDLFKCWNDPRRGIVYGGMIRELLLSFQKATGK-KPCRILFYRDGVSEGQFYQ 698
Query: 754 VLNIELNQIIEACKFLDETWNPKFLVIVAQKNHHTKFFQPGSPD--------NVPPGTVI 805
VL EL+ I +AC L+ + P +V QK HHT+ F D N+ PGTV+
Sbjct: 699 VLLYELDAIRKACASLEPGYQPPVTFVVVQKRHHTRLFSDNHNDRNSMDRSGNILPGTVV 758
Query: 806 DNKICHPRNNDFYMCAHAGMIGTSRPTHYHVLLDDIGFSPDELQELVHSLSYVYQRSTTA 865
D KICHP DFY+C+HAG+ GTS+P HYHV+ DD FS DE+Q L ++L Y Y R T +
Sbjct: 759 DTKICHPTEFDFYLCSHAGVQGTSKPAHYHVIWDDNKFSADEIQSLTNNLCYTYARCTRS 818
Query: 866 ISVVAPICYAHLAATQIGQFMKFEDKSDTSSSHGGLTAAGVAPVVPQLPKLQDSVSSSMF 925
+S+V P YAHLAA + +M+ + + S G V P LP L++ V MF
Sbjct: 819 VSLVPPAYYAHLAAYRARFYMEPDVHENAKSQVTGSKVESVRP----LPALKEKVKKVMF 874
Query: 926 FC 927
+C
Sbjct: 875 YC 876
>IMGA|CT030192_31.5 Stem cell self-renewal protein Piwi
chr03_pseudomolecule_IMGAG_V2 1280512-1277890 E
EGN_Mt071002 20080227
Length = 314
Score = 400 bits (1028), Expect = e-111, Method: Compositional matrix adjust.
Identities = 197/298 (66%), Positives = 238/298 (79%), Gaps = 7/298 (2%)
Query: 631 KLGGLNSVLGVEMNPSIPIVSKVPTIILGMDVSHGSPGQSDIPSIAAVVSSREWPLISKY 690
+LGG+NS L E SIP+ SK+PT+++GMDVSHGS GQS+ SIAAVVSSR WP IS+Y
Sbjct: 23 QLGGMNSFLLTEFKHSIPLFSKIPTLVIGMDVSHGSQGQSEALSIAAVVSSRCWPQISRY 82
Query: 691 RACVRTQSPKVEMIDNLFKQVSEKEDEGIIRELLIDFYSSSGKRKPDNIIIFRDGVSESQ 750
+A VRTQS KVE++ +LFK VS+ +D+GII ELL DF ++SG KP IIIFRDGVSESQ
Sbjct: 83 KAVVRTQSSKVEIVQSLFKPVSDTKDDGIISELLKDFQTTSGV-KPQQIIIFRDGVSESQ 141
Query: 751 FNQVLNIELNQIIEACKFLDETWNPKFLVIVAQKNHHTKFFQPGSP-DNVPPGTVIDNKI 809
FNQVLNIELN+II+ACK DE+W PKF +IVAQKNHHT+FF+ SP +NV PGTVIDN I
Sbjct: 142 FNQVLNIELNEIIKACKCYDESWCPKFTLIVAQKNHHTRFFKANSPQENVSPGTVIDNTI 201
Query: 810 CHPRNNDFYMCAHAGMIGTSRPTHYHVLLDDIGFSPDELQELVHSLSYVYQRSTTAISVV 869
CHP++NDFYMCAHAG IGTSRPTHYHVL D+IGFS D LQE VHSL YV+QRST AIS+V
Sbjct: 202 CHPKDNDFYMCAHAGRIGTSRPTHYHVLYDEIGFSADNLQEFVHSLCYVHQRSTNAISIV 261
Query: 870 APICYAHLAATQIGQFMKFEDKSDTSSSHGGLTAAGVAPVVPQLPKLQDSVSSSMFFC 927
API YA LAA QI QF+K+ D+S+ SSH ++ + +LP+L + V+ SMFFC
Sbjct: 262 APIYYADLAAAQIAQFIKY-DESENLSSHNEF----ISQIPTELPRLHERVADSMFFC 314
>IMGA|CU179907_3.4 Argonaute and Dicer protein, PAZ; Stem cell
self-renewal protein Piwi chr05_pseudomolecule_IMGAG_V2
17162220-17166459 E EGN_Mt071002 20080227
Length = 1016
Score = 395 bits (1016), Expect = e-110, Method: Compositional matrix adjust.
Identities = 286/934 (30%), Positives = 461/934 (49%), Gaps = 113/934 (12%)
Query: 49 KKKLPTRLP----IARK--GLGSKGTKLPLLTNHFKVTVANSDGHFFQYSVALSYEDGRP 102
KK + TR P +AR+ G +G + LL NHF V +S + Y+V ++ P
Sbjct: 141 KKLISTRKPHEVIVARRPDSGGQEGPVISLLANHFLVKF-DSSHKIYHYNVEIT-----P 194
Query: 103 VEGKGVGRKVIDKVQETYGSELNGKDFAYDGEKTLFTIGSLARNKLEFTVVLEDVISNRN 162
K V R++ K+ L+G AYDG K L++ +KLEF +
Sbjct: 195 HPSKDVAREIKHKLVNNNAEILSGALPAYDGRKNLYSPIEFQNDKLEFYI---------- 244
Query: 163 NGNCSPDGASTNDSDKKRMRRPYHSKTFKVEISFAAKIPLQAIVNALRGQESENY---QE 219
G P ST+ +K+ K F++ I +KI + + N L + E Q+
Sbjct: 245 -GLPIPTSKSTSPYEKREQH-----KLFRINIKLVSKIDGKGLTNYLSKEGDEGIPLPQD 298
Query: 220 AIRVLDIILRQHAAKQGCLLVRQSFFHNDPKNYADVGGGVLGCRGFHSSFRTTQSGLSLN 279
+ LD++LR+ + + C+ V +SF+ + D+GGG +G RGF S R TQ GL+LN
Sbjct: 299 YLHALDVVLRE-SPTEKCIPVGRSFYSSSMGRSKDIGGGAVGLRGFFQSLRPTQQGLALN 357
Query: 280 IDVSTTMIIQPGPVVDFL---------IANQNVRDPFSLDWAKAKRTLKNLRIKAS--PS 328
+D S T + V+ +L ++ + + + ++TLKN+R+ +
Sbjct: 358 VDFSVTAFHESIGVIPYLQKRLEFLRDLSQRQTTQLTCEERKEVEKTLKNIRVFVCHRET 417
Query: 329 NQEYKITGLSELPCKEQTFTMKKKGGNNGEEDATEEEITVYEYFVNYRKIDLRYSADLPC 388
Q Y++ GL+E + F D + + + YF ++ D+++ PC
Sbjct: 418 VQRYRVYGLTEEATENLWFP-----------DRDGKNLRLMSYFKDHYNYDIQFRK-WPC 465
Query: 389 INVGKPKRPTYVPVELCSLVSLQRYTKALTTLQRSSLVEKSRQKPLERMNVLNQALKTSN 448
+ + + K P Y+P+ELC + Q++ L+ Q + +++ Q+P ER ++ ++ N
Sbjct: 466 LQISRSK-PCYLPMELCVICEGQKFLGKLSDDQTAKILKMGCQRPGERKAIIEGVMR-GN 523
Query: 449 YG--NEPMLKNCGITIASGFTQVEGRVLQAPRLKFGNG---EDFNP-RNGR-WNLNNKKV 501
G + K + ++ T++ GR+L P+LK G+G + P R+ R WN + V
Sbjct: 524 VGPTSGDQEKEFKLQVSREMTKLTGRILYPPKLKLGDGGHVRNLTPSRHDRQWNFLDGHV 583
Query: 502 VRPAKIEHWAVVNFSARCDVRGLVRDLI-----KCARL-----KGIPIDEPYEEIFEENG 551
IE WA+++F + + + I +C +L K I +E I N
Sbjct: 584 FEGTTIERWALISFGGTPEQKSHIPRFINQLTQRCEQLGIFLNKNTIISPQFESIQVLNN 643
Query: 552 QFRRAPPLVRVEKMFERIQKELPGAPSFLLCLLPERKNSDLYGPWKKKNLAEYGIVTQC- 610
+ +E +RIQ L+C++ E+K+ Y K+ G+V+QC
Sbjct: 644 -------VTVLESKLKRIQSIASNNLQLLICIM-EKKHKG-YADLKRIAETSVGVVSQCC 694
Query: 611 ISPT--RVNDQYLTNVLMKINAKLGGLNSVLGVEMNPSIPIVSKV--PTIILGMDVSHGS 666
+ P +++ Q+L N+ +KINAK+GG L + +P + + P + +G DV+H
Sbjct: 695 LYPNLIKLSSQFLANLALKINAKVGGCTVALYNSLPSQLPRLFNIDEPVMFMGADVTHPH 754
Query: 667 PGQSDIPSIAAVVSSREWPLISKYRACVRTQSPKVEMIDNLFKQVSEKEDEGIIRELLID 726
P PS+AAVV S WP +KY + +R+Q+ + E+I +L ++ ELL D
Sbjct: 755 PLDDSSPSVAAVVGSMNWPTANKYISRIRSQTHRQEIIADL---------GAMVGELLED 805
Query: 727 FYSSSGKRKPDNIIIFRDGVSESQFNQVLNIELNQIIEACKFLDETWNPKFLVIVAQKNH 786
FY K P+ II FRDGVSE+QF +VL EL I +AC + P +V QK H
Sbjct: 806 FYQEVEKL-PNRIIFFRDGVSETQFYKVLQEELQSIKQACSSRFHGYKPFITFVVVQKRH 864
Query: 787 HTKFFQPGSPD-------------NVPPGTVIDNKICHPRNNDFYMCAHAGMIGTSRPTH 833
HT+ F P D N+PPGTV+D+ I HP+ DFY+C+H G+ GTSRPTH
Sbjct: 865 HTRLF-PADTDQSSMHNNFHFQYENIPPGTVVDSVITHPKEFDFYLCSHWGVKGTSRPTH 923
Query: 834 YHVLLDDIGFSPDELQELVHSLSYVYQRSTTAISVVAPICYAHLAATQIGQFMKFEDKSD 893
YHVLLD+ F+ DELQ+LV++L + + R T IS+V P YAHLAA + +++ +
Sbjct: 924 YHVLLDENKFTSDELQKLVYNLCFTFVRCTKPISLVPPAYYAHLAAYRGRLYLERSESLG 983
Query: 894 TSSSHGGLTAAGVAPVVPQLPKLQDSVSSSMFFC 927
S L+ A P P LPKL +++ MF+C
Sbjct: 984 LFRSASTLSRAA-TPKTPPLPKLSENIKKLMFYC 1016
>IMGA|AC136450_38.5 Argonaute and Dicer protein, PAZ
chr02_pseudomolecule_IMGAG_V2 14863531-14868049 H
EGN_Mt071002 20080227
Length = 506
Score = 211 bits (537), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 153/494 (30%), Positives = 247/494 (50%), Gaps = 69/494 (13%)
Query: 59 ARKGLGSKGTKLPLLTNHFKVTVANSDGHFFQYSVALSYEDGRPVEGKGVGRKVIDKVQE 118
R G G GTK + NHF V ++ SD Y+V + E K V + + V+
Sbjct: 51 CRPGYGQLGTKCLIKANHFLVDISVSD--LSHYNVKIIPEVCSSKTRKAV---ISELVRV 105
Query: 119 TYGSELNGKDFAYDGEKTLFTIGSLARNKLEFTVVL--EDVISNRNNGNCSPDGASTNDS 176
++L + YDG + L+T G L EF+V+L ED ++ T +
Sbjct: 106 HKNTDLANRLPVYDGGRNLYTAGLLPFTYKEFSVILSEEDYVT-----------GGTREQ 154
Query: 177 DKKRMRRPYHSKTFKVEISFAAKIPLQAIVNALRGQESENYQEAIRVLDIILRQHAAKQG 236
+ FKV I FA + +Q + L G++ + QEA+ V DI+L++ AA++
Sbjct: 155 E------------FKVGIKFATSVRMQQLRELLSGKQVDTPQEALSVFDIVLKEVAAQR- 201
Query: 237 CLLVRQSFFHNDPKNYADVGGGVLGCRGFHSSFRTTQSGLSLNIDVSTTMIIQPGPVVDF 296
P+ +GGG+ RGF+ S R TQ GLSLNID+S+ I+P PV+DF
Sbjct: 202 -----------KPQQ---LGGGIESWRGFYQSIRPTQMGLSLNIDMSSMAFIEPLPVIDF 247
Query: 297 L--IANQNVRD-PFS-LDWAKAKRTLKNLRIKASPSN---QEYKITGLSELPCKEQTFTM 349
+ I ++V P S D K K+ L+ ++++ + ++Y+I+GL+ P +E F +
Sbjct: 248 VAQILGKDVHSKPLSDADRVKIKKALRGVKVEVTHRGNFRRKYRISGLTSQPTRELIFPL 307
Query: 350 KKKGGNNGEEDATEEEITVYEYFVNYRKIDLRYSADLPCINVGKPKRPTYVPVELCSLVS 409
D +V +YF ++YS LPC+ VG ++ Y+P+E C +V
Sbjct: 308 ----------DEQMNMKSVVDYFQEMYGYTIKYS-HLPCLQVGSQRKLNYLPMEACKIVR 356
Query: 410 LQRYTKALTTLQRSSLVEKSRQKPLERMNVLNQALKTSNYGNEPMLKNCGITIASGFTQV 469
QR TK L Q +SL++ S Q+P E+ + Q ++ +NY N P K GI+I V
Sbjct: 357 GQRQTKGLNEKQITSLLKFSCQRPREQETDILQTIEQNNYENNPYAKEFGISIDKKLASV 416
Query: 470 EGRVLQAPRLKF---GNGEDFNPRNGRWNLNNKKVVRPAKIEHWAVVNFS---ARCDVRG 523
E RVL +P LK+ G ++ P+ G+WN+ NKKV+ + + +WA +NFS G
Sbjct: 417 EARVLPSPWLKYHDSGREKEHLPQVGQWNMLNKKVINGSNVRYWACINFSRSVQESTAHG 476
Query: 524 LVRDLIKCARLKGI 537
+ L++ ++ G+
Sbjct: 477 FCQQLVQMCQITGL 490
>IMGA|CR931808_21.5 Ribonucleotide reductase
chr05_pseudomolecule_IMGAG_V2 36705487-36704770 E
EGN_Mt071002 20080227
Length = 131
Score = 135 bits (339), Expect = 1e-31, Method: Composition-based stats.
Identities = 66/100 (66%), Positives = 75/100 (75%), Gaps = 12/100 (12%)
Query: 827 GTSRPTHYHVLLDDIGFSPDELQELVHSLSYVYQRSTTAISVVAPICYAHLAATQIGQFM 886
GTSRPTHYHVLLD+IGFSPD+LQELVHSLSYVYQ +APICY HLAA Q+ QFM
Sbjct: 7 GTSRPTHYHVLLDEIGFSPDDLQELVHSLSYVYQ--------IAPICYVHLAAAQVAQFM 58
Query: 887 KFEDKSDTSSSHGGLTAAGVAPVVPQLPKLQDSVSSSMFF 926
KFE+ S+TSSS GG A+ +PQLPK V +SMFF
Sbjct: 59 KFENISETSSSQGGNNASS----IPQLPKFHTKVWNSMFF 94
>IMGA|CU012043_14.5 Stem cell self-renewal protein Piwi
chr03_pseudomolecule_IMGAG_V2 29691565-29690362 H
EGN_Mt071002 20080227
Length = 176
Score = 116 bits (291), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 66/171 (38%), Positives = 94/171 (54%), Gaps = 32/171 (18%)
Query: 654 PTIILGMDVSHGSPGQSDIPSIAAVVSSREWPLISKYRACVRTQSPKVEMIDNLFKQVSE 713
PTII G DV+H G+ PS+AAVV+S++WP ++KY V Q+ + E+I +L+K +
Sbjct: 28 PTIIFGADVTHPENGEDSSPSMAAVVASQDWPEVTKYAGLVCAQAHRQELIQDLYKTWHD 87
Query: 714 KEDEGIIRELLIDFYSSSGKRKPDNIIIFRDGVSESQFNQVLNIELNQIIEACKFLDETW 773
+R+ + S G + RDGVSE QF QVL EL+ I +AC L+ +
Sbjct: 88 P-----VRDTV-----SGG--------MLRDGVSEGQFYQVLLYELDAIQKACASLEPNY 129
Query: 774 NPKFLVIVAQKNHHTKFFQPGSPDNVPPGTVIDNKICHPRNNDFYMCAHAG 824
P I++ N+ PGTV+D KICHP DFY+C+HAG
Sbjct: 130 QPPVTFIIS--------------GNILPGTVVDTKICHPTEFDFYLCSHAG 166
>IMGA|AC202591_22.3 Stem cell self-renewal protein Piwi
chr01_pseudomolecule_IMGAG_V2 9757199-9756732 H
EGN_Mt071002 20080227
Length = 155
Score = 72.4 bits (176), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 50/138 (36%), Positives = 69/138 (50%), Gaps = 13/138 (9%)
Query: 655 TIILGMDVSH-GSPGQSDIPSIAAVVSSREWPLISKYRACVRTQSPKVEMIDNLFKQVSE 713
+++G DV+H S + PSIAAVV++ WP +KY + + Q + E I N
Sbjct: 20 VMLIGADVNHPASRDRRGSPSIAAVVATVNWPAANKYASRICIQEGQSEKISNF------ 73
Query: 714 KEDEGIIRELLIDFYSSSGKRKPDNIIIFRDGVSESQFNQVLNIELNQIIEACKFLDETW 773
G I L+ Y + KP IIIFR GVS +F+ VLN EL + F +
Sbjct: 74 ----GEICFDLVGNYEKLNRTKPRKIIIFRVGVSREEFSMVLNDELEDLKR--DFGGFKY 127
Query: 774 NPKFLVIVAQKNHHTKFF 791
+P V+VA K H T FF
Sbjct: 128 HPTITVVVAVKGHRTHFF 145
>IMGA|CU024897_18.4 Peptidase aspartic, active site; Saposin-like
chr03_pseudomolecule_IMGAG_V2 10384400-10386431 E
EGN_Mt071002 20080227
Length = 157
Score = 45.4 bits (106), Expect = 1e-04, Method: Composition-based stats.
Identities = 27/67 (40%), Positives = 40/67 (59%), Gaps = 4/67 (5%)
Query: 192 VEISFAAKIPLQAIVNALRGQESENYQEAIRVLDIILRQHA-AKQGCLLVRQSFFHNDPK 250
+EISFAAK + AI G+ N++ + ++L+ ++ Q LLV Q FFHNDP
Sbjct: 80 LEISFAAKTSMDAIAMRYMGR---NHRISKKLLEFLISYLGNMMQAFLLVCQFFFHNDPN 136
Query: 251 NYADVGG 257
++ADV G
Sbjct: 137 DFADVWG 143