Miyakogusa Predicted Gene
- chr3.CM0396.280.nd
BLASTP 2.2.18 [Mar-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= chr3.CM0396.280.nd - phase: 0
(970 letters)
Database: Medicago_aa2.0
38,834 sequences; 10,231,785 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
IMGA|AC160838_11.5 Argonaute and Dicer protein, PAZ; Stem cell s... 941 0.0
IMGA|CU179907_3.4 Argonaute and Dicer protein, PAZ; Stem cell se... 578 e-165
IMGA|AC131455_15.4 Argonaute and Dicer protein, PAZ; Stem cell s... 484 e-137
IMGA|AC131455_31.4 Argonaute and Dicer protein, PAZ; Stem cell s... 455 e-128
IMGA|AC136450_38.5 Argonaute and Dicer protein, PAZ chr02_pseudo... 432 e-121
IMGA|CT030192_31.5 Stem cell self-renewal protein Piwi chr03_pse... 253 2e-67
IMGA|AC147429_4.4 Stem cell self-renewal protein Piwi chr00_pseu... 235 7e-62
IMGA|CT030192_30.5 N-6 Adenine-specific DNA methylase; Argonaute... 207 1e-53
IMGA|CU012043_14.5 Stem cell self-renewal protein Piwi chr03_pse... 198 9e-51
IMGA|AC202591_22.3 Stem cell self-renewal protein Piwi chr01_pse... 79 9e-15
IMGA|AC140104_8.5 hypothetical protein chr04_pseudomolecule_IMGA... 64 2e-10
IMGA|CT573365_24.5 Protein argonaute. chr03_pseudomolecule_IMG... 60 5e-09
IMGA|CR931808_21.5 Ribonucleotide reductase chr05_pseudomolecule... 56 7e-08
IMGA|AC160838_28.5 hypothetical protein chr08_pseudomolecule_IMG... 45 1e-04
>IMGA|AC160838_11.5 Argonaute and Dicer protein, PAZ; Stem cell
self-renewal protein Piwi chr08_pseudomolecule_IMGAG_V2
25391911-25397439 E EGN_Mt071002 20080227
Length = 876
Score = 941 bits (2431), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 459/877 (52%), Positives = 612/877 (69%), Gaps = 25/877 (2%)
Query: 102 SSTKAIRFPDRPGFGRLGKKIQVRANHFQLQVAERDLHHYDVAITPEITSKKVTREVVSQ 161
SS K++ FP RP +G+LG K V+AN+F ++ DL HY V ITPE+ S K + ++++
Sbjct: 17 SSCKSLVFPSRPDYGKLGTKCVVKANYFLADISVSDLSHYHVDITPEVISSKTRKAIIAK 76
Query: 162 LIKMYKESVLGNRLPVFDGRKNLFTAGPLPFSSKEF-VVKLEDDRPXXXXXXXXXXXXRE 220
L+K ++ + LG +LPV+DG +NL+TAG LPF+ KEF ++ +EDD RE
Sbjct: 77 LVKFHQNTELGKKLPVYDGAENLYTAGSLPFTHKEFNILLIEDDE--------GFGTTRE 128
Query: 221 RQFKVTIRFAAKVDLHHLFQFLGRQQLDCPQNTIQALDVALRATASEKYNVVGRSFFSPE 280
R+F+V I+F A V +H L + L ++++ PQ I A+D+ L+ AS Y G +SP+
Sbjct: 129 RKFEVAIKFLAHVSMHQLHELLSGKKVETPQEAINAIDIVLKELASHSYVSFGSLHYSPD 188
Query: 281 LGQTGPLGSGTEYWRGYYQSLRPTQMGLSLNIDVSARAFFEPIPVTEFVPKHFRNINFSR 340
L + L G E W G+YQS+RPTQMGLSLN+D+++ AF EP+PV + + S+
Sbjct: 189 LKKPHKLSGGLESWSGFYQSIRPTQMGLSLNVDMASTAFIEPLPVIDIAAQILGKDVHSK 248
Query: 341 ---DQDRVKVKKALRGIRVDV-FLGECKRSYKISGVSREPVKDLMFTLDDQKTKKSVAQY 396
D DR+K+KKAL+G++V+V + G +R Y+I+G++ +P ++L F L ++ SV Y
Sbjct: 249 PLSDADRIKIKKALKGVKVEVTYRGSFRRKYRITGLTSQPTRELSFPLGEKMNMISVIDY 308
Query: 397 FTEKYKVTLKHANLPALQAGSDTKPIYLPMEVCVIAAGQRYTKRLNEEQVTALLRATCQR 456
F E Y + + +LP LQ GS K YLPME C I GQRYTK L+E+Q+T++L+ +CQR
Sbjct: 309 FQEMYGYKIMYPHLPCLQVGSQKKVNYLPMEACKIVGGQRYTKGLSEKQITSMLKVSCQR 368
Query: 457 PQDRENYIKQIVKQHNFNNDKFVREFGISVKEDPTLLNARVLPPPRLKYHESGKEPRVDP 516
P++REN I Q + Q++++ + + +EFGIS+ + + ARVLP P LKYHE+G++ ++ P
Sbjct: 369 PRERENDILQTIHQNDYDCNPYAKEFGISIGNELASVEARVLPAPWLKYHETGRDKKILP 428
Query: 517 WMGQWNMINKKMVDGGKVEHWSCLNFSSRLRPDLPSIFCDELRSMCTSKGMVFNPQPLVP 576
+GQWNM NKK+V+G KV +W+C+NFS ++ S FC +L C S GM F+ +P++P
Sbjct: 429 QVGQWNMTNKKVVNGSKVRYWACINFSRSVKEKTASAFCQQLVQTCQSLGMEFSEEPVIP 488
Query: 577 IKTVNPLQIESALQNLHKQSITNLANMKQQGRLQXXXXXXPDVKGS-YGKIKKICETELG 635
+ + P ++ AL+ +H S+ L + L+ PD GS YG +KKICET+LG
Sbjct: 489 VYSARPDMVKKALKYVHSFSLNKL----EGKELELVVAILPDNNGSLYGDLKKICETDLG 544
Query: 636 IVSQCCQPRQVQKLNKQYLENLALKINVKVGGRNTVLSDAFDRRIPHVSDKHTIIFGADV 695
++SQCC + V K+N+QYL N+ALKINVK+GGRNTVL DA RIP VSD TIIFGADV
Sbjct: 545 LISQCCLTKYVFKINRQYLSNVALKINVKMGGRNTVLLDAISCRIPLVSDVPTIIFGADV 604
Query: 696 THPQPGEDSSPSIAAVVASMDWPWVTKYKGTVSAQAHREEIIQDLFTTFEDPKRGLVQGG 755
+HP+ GED PSIAAVVAS DWP VTKY G V AQ REEII+DLF + DP+RG+V GG
Sbjct: 605 SHPESGEDVCPSIAAVVASQDWPEVTKYAGLVCAQPPREEIIKDLFKCWNDPRRGIVYGG 664
Query: 756 IIRELIRSFYIANGKRKPERIIFYRDGVSEGQFSQVLLYEMDAIRKACMSLEDGYLPRVT 815
+IREL+ SF A GK KP RI+FYRDGVSEGQF QVLLYE+DAIRKAC SLE GY P VT
Sbjct: 665 MIRELLLSFQKATGK-KPCRILFYRDGVSEGQFYQVLLYELDAIRKACASLEPGYQPPVT 723
Query: 816 FVVVQKRHHTRLFPADHRSRDQMDKSGNIMPGTVVDTSICHPREFDFYLNSHAGIQGTSR 875
FVVVQKRHHTRLF +H R+ MD+SGNI+PGTVVDT ICHP EFDFYL SHAG+QGTS+
Sbjct: 724 FVVVQKRHHTRLFSDNHNDRNSMDRSGNILPGTVVDTKICHPTEFDFYLCSHAGVQGTSK 783
Query: 876 PTHYHVLYDENNFTADELQGLTNNLCYTYARCTRSVSIVPPAYYAHLAAFRARSYIXXXX 935
P HYHV++D+N F+ADE+Q LTNNLCYTYARCTRSVS+VPPAYYAHLAA+RAR Y+
Sbjct: 784 PAHYHVIWDDNKFSADEIQSLTNNLCYTYARCTRSVSLVPPAYYAHLAAYRARFYM---- 839
Query: 936 XXXXXXXXXXXXTRSNVEI--KLPAIKDNVKDVMFYC 970
T S VE LPA+K+ VK VMFYC
Sbjct: 840 EPDVHENAKSQVTGSKVESVRPLPALKEKVKKVMFYC 876
>IMGA|CU179907_3.4 Argonaute and Dicer protein, PAZ; Stem cell
self-renewal protein Piwi chr05_pseudomolecule_IMGAG_V2
17162220-17166459 E EGN_Mt071002 20080227
Length = 1016
Score = 578 bits (1491), Expect = e-165, Method: Compositional matrix adjust.
Identities = 345/915 (37%), Positives = 509/915 (55%), Gaps = 68/915 (7%)
Query: 89 EVEQKLALRPAAPSSTKAIRFPDRPGFGRLGKKIQVRANHFQLQV-AERDLHHYDVAITP 147
EV+ K + P R PD G+ G I + ANHF ++ + ++HY+V ITP
Sbjct: 137 EVDGKKLISTRKPHEVIVARRPD--SGGQEGPVISLLANHFLVKFDSSHKIYHYNVEITP 194
Query: 148 EITSKKVTREVVSQLIKMYKESVLGNRLPVFDGRKNLFTAGPLPFSSKEFVVKLEDDRPX 207
SK V RE+ +L+ E +L LP +DGRKNL++ P+ F + + + P
Sbjct: 195 H-PSKDVAREIKHKLVNNNAE-ILSGALPAYDGRKNLYS--PIEFQNDKLEFYIGLPIPT 250
Query: 208 XXXXXXXXXXXRERQFKVTIRFAAKVDLHHLFQFL---GRQQLDCPQNTIQALDVALRAT 264
+ + F++ I+ +K+D L +L G + + PQ+ + ALDV LR +
Sbjct: 251 SKSTSPYEKREQHKLFRINIKLVSKIDGKGLTNYLSKEGDEGIPLPQDYLHALDVVLRES 310
Query: 265 ASEKYNVVGRSFFSPELGQTGPLGSGTEYWRGYYQSLRPTQMGLSLNIDVSARAFFEPIP 324
+EK VGRSF+S +G++ +G G RG++QSLRPTQ GL+LN+D S AF E I
Sbjct: 311 PTEKCIPVGRSFYSSSMGRSKDIGGGAVGLRGFFQSLRPTQQGLALNVDFSVTAFHESIG 370
Query: 325 VTEFVPKHFRNINFSRD-----------QDRVKVKKALRGIRVDVFLGECKRSYKISGVS 373
V ++ K + F RD ++R +V+K L+ IRV V E + Y++ G++
Sbjct: 371 VIPYLQKR---LEFLRDLSQRQTTQLTCEERKEVEKTLKNIRVFVCHRETVQRYRVYGLT 427
Query: 374 REPVKDLMFTLDDQKTKKSVAQYFTEKYKVTLKHANLPALQAGSDTKPIYLPMEVCVIAA 433
E ++L F D K + + YF + Y ++ P LQ S +KP YLPME+CVI
Sbjct: 428 EEATENLWFPDRDGKNLR-LMSYFKDHYNYDIQFRKWPCLQI-SRSKPCYLPMELCVICE 485
Query: 434 GQRYTKRLNEEQVTALLRATCQRPQDRENYIKQIVKQH-NFNNDKFVREFGISVKEDPTL 492
GQ++ +L+++Q +L+ CQRP +R+ I+ +++ + + +EF + V + T
Sbjct: 486 GQKFLGKLSDDQTAKILKMGCQRPGERKAIIEGVMRGNVGPTSGDQEKEFKLQVSREMTK 545
Query: 493 LNARVLPPPRLKYHESGKEPRVDPWMG--QWNMINKKMVDGGKVEHWSCLNF--SSRLRP 548
L R+L PP+LK + G + P QWN ++ + +G +E W+ ++F + +
Sbjct: 546 LTGRILYPPKLKLGDGGHVRNLTPSRHDRQWNFLDGHVFEGTTIERWALISFGGTPEQKS 605
Query: 549 DLPSIFCDELRSMCTSKGMVFNPQPLVP-----IKTVNPLQI-ESALQNLHKQSITNLAN 602
+P F ++L C G+ N ++ I+ +N + + ES L+ + QSI +
Sbjct: 606 HIPR-FINQLTQRCEQLGIFLNKNTIISPQFESIQVLNNVTVLESKLKRI--QSIAS--- 659
Query: 603 MKQQGRLQXXXXXXPDVKGSYGKIKKICETELGIVSQCCQPRQVQKLNKQYLENLALKIN 662
LQ Y +K+I ET +G+VSQCC + KL+ Q+L NLALKIN
Sbjct: 660 ----NNLQLLICIMEKKHKGYADLKRIAETSVGVVSQCCLYPNLIKLSSQFLANLALKIN 715
Query: 663 VKVGGRNTVLSDAFDRRIPHV--SDKHTIIFGADVTHPQPGEDSSPSIAAVVASMDWPWV 720
KVGG L ++ ++P + D+ + GADVTHP P +DSSPS+AAVV SM+WP
Sbjct: 716 AKVGGCTVALYNSLPSQLPRLFNIDEPVMFMGADVTHPHPLDDSSPSVAAVVGSMNWPTA 775
Query: 721 TKYKGTVSAQAHREEIIQDLFTTFEDPKRGLVQGGIIRELIRSFYIANGKRKPERIIFYR 780
KY + +Q HR+EII DL G ++ EL+ FY ++ P RIIF+R
Sbjct: 776 NKYISRIRSQTHRQEIIADL-------------GAMVGELLEDFY-QEVEKLPNRIIFFR 821
Query: 781 DGVSEGQFSQVLLYEMDAIRKACMSLEDGYLPRVTFVVVQKRHHTRLFPADHRSRDQMD- 839
DGVSE QF +VL E+ +I++AC S GY P +TFVVVQKRHHTRLFPAD +
Sbjct: 822 DGVSETQFYKVLQEELQSIKQACSSRFHGYKPFITFVVVQKRHHTRLFPADTDQSSMHNN 881
Query: 840 ---KSGNIMPGTVVDTSICHPREFDFYLNSHAGIQGTSRPTHYHVLYDENNFTADELQGL 896
+ NI PGTVVD+ I HP+EFDFYL SH G++GTSRPTHYHVL DEN FT+DELQ L
Sbjct: 882 FHFQYENIPPGTVVDSVITHPKEFDFYLCSHWGVKGTSRPTHYHVLLDENKFTSDELQKL 941
Query: 897 TNNLCYTYARCTRSVSIVPPAYYAHLAAFRARSYIXXXXXXXXXXXXXXXXTRSNVEI-K 955
NLC+T+ RCT+ +S+VPPAYYAHLAA+R R Y+ + +
Sbjct: 942 VYNLCFTFVRCTKPISLVPPAYYAHLAAYRGRLYLERSESLGLFRSASTLSRAATPKTPP 1001
Query: 956 LPAIKDNVKDVMFYC 970
LP + +N+K +MFYC
Sbjct: 1002 LPKLSENIKKLMFYC 1016
>IMGA|AC131455_15.4 Argonaute and Dicer protein, PAZ; Stem cell
self-renewal protein Piwi chr05_pseudomolecule_IMGAG_V2
31662746-31670122 E EGN_Mt071002 20080227
Length = 908
Score = 484 bits (1246), Expect = e-137, Method: Compositional matrix adjust.
Identities = 314/905 (34%), Positives = 463/905 (51%), Gaps = 87/905 (9%)
Query: 112 RPGFGRLGKKIQVRANHFQLQVAERD--LHHYDVAITPE----ITSKKVTREVVSQLIKM 165
R G G G K+ + NHF++ V D Y VA+ E + K R+++ ++ +
Sbjct: 43 RRGLGSKGAKLPLLTNHFKVNVTNTDGYFFQYSVALFYEDGRPVEGKGAGRKILDRVQET 102
Query: 166 YKESVLGNRLPVFDGRKNLFTAGPLPFSSKEFVVKLED---------------DRPXXXX 210
Y + G L +DG K LFT G L + EF V LED P
Sbjct: 103 YGSELNGKDL-AYDGEKTLFTIGSLAQNKLEFTVVLEDVTSNRNNGNASPDGHGSPNDTD 161
Query: 211 XXXXXXXXRERQFKVTIRFAAKVDLHHLFQFLGRQQLDCPQNTIQALDVALRATASEKYN 270
R + +KV I FA+K+ L + L + + Q I+ LD+ LR A+++
Sbjct: 162 RKRLKKSHRSKTYKVEISFASKIPLQAIANALKGHETENYQEAIRVLDIILRQHAAKQGC 221
Query: 271 VVGR-SFFSPELGQTGPLGSGTEYWRGYYQSLRPTQMGLSLNIDVSARAFFEPIPVTEFV 329
++ R +FF + +G G RG + S R TQ GLSLNIDVS P PV +F+
Sbjct: 222 LLVRQNFFHNDPKNFTDVGGGVLGCRGLHSSFRTTQSGLSLNIDVSTTMIVHPGPVVDFL 281
Query: 330 PKHFRNINFSRDQDRVKVKKALRGIRVDVFLGECKRSYKISGVSREPVKDLMFTL----- 384
+ +N+ D K K+ L+ +R+ + YKI+G+S P KD +FTL
Sbjct: 282 IAN-QNVRDPFSLDWNKAKRTLKNLRITT--SPTNQEYKITGLSEMPCKDQLFTLKKRGA 338
Query: 385 ---DDQKTKKSVAQYFTEKYKVTLKH-ANLPALQAGSDTKPIYLPMEVCVIAAGQRYTKR 440
+D + +V YF + K++L++ A+LP + G +P ++P+E+C + + QRYTK
Sbjct: 339 VPGEDDTEEITVYDYFVNRRKISLQYSADLPCINVGKPKRPTFVPVELCSLVSLQRYTKA 398
Query: 441 LNEEQVTALLRATCQRPQDRENYIKQIVKQHNFNNDKFVREFGISVKEDPTLLNARVLPP 500
L+ Q ++L+ + Q+PQ+R + +K ++ ++ +R GIS+ T ++ RVL
Sbjct: 399 LSTLQRSSLVEKSRQKPQERMRVLTDALKTSDYGSEPMLRNCGISITSGFTQVDGRVLQA 458
Query: 501 PRLKYHESGKEPRVDPWMGQWNMINKKMVDGGKVEHWSCLNFSSR-----LRPDLPSIFC 555
PRLK+ G +P G+WN NKK+V K+E W+ +NFS+R L DL I C
Sbjct: 459 PRLKF---GNGEDFNPRNGRWNFNNKKIVQPVKIEKWAVVNFSARCDVRGLVRDL--IKC 513
Query: 556 DELRSM-------CTSKGMVFNPQPLVPIKTVNPL--QIESALQNLHKQSITNLANMKQQ 606
++ + C + F P P+ V + ++S L K + L+ K
Sbjct: 514 GGMKGIHVEQPFDCFEENGQFRRAP--PLVRVEKMFEHVQSKLPGAPKFLLCLLSERKNS 571
Query: 607 GRLQXXXXXXPDVKGSYGKIKKICETELGIVSQCCQPRQVQKLNKQYLENLALKINVKVG 666
YG KK E GIV+QC P +V N QYL N+ LKIN K+G
Sbjct: 572 DL--------------YGPWKKKNLAEFGIVTQCIAPTRV---NDQYLTNVLLKINAKLG 614
Query: 667 GRNTVLSDAFDRRIPHVSDKHTIIFGADVTHPQPGEDSSPSIAAVVASMDWPWVTKYKGT 726
G N++L IP VS T+I G DV+H PG+ PSIAAVV+S WP ++KY+
Sbjct: 615 GMNSLLGVEHSPSIPIVSKAPTLILGMDVSHGSPGQTEIPSIAAVVSSRQWPLISKYRAC 674
Query: 727 VSAQAHREEIIQDLFTTFEDPKRGLVQGGIIRELIRSFYIANGKRKPERIIFYRDGVSEG 786
V Q + E+I +LF D + GIIREL+ FY ++G RKP+ II +RDGVSE
Sbjct: 675 VRTQGAKVEMIDNLFKPVSDTE----DEGIIRELLIDFYNSSGNRKPDNIIIFRDGVSES 730
Query: 787 QFSQVLLYEMDAIRKACMSLEDGYLPRVTFVVVQKRHHTRLFPADHRSRDQMDKSGNIMP 846
QF+QVL E+ I +AC L++ + P+ +V QK HHT+ F Q N+ P
Sbjct: 731 QFNQVLNIELSQIIEACKFLDEKWNPKFLVIVAQKNHHTKFF--------QPGSPDNVPP 782
Query: 847 GTVVDTSICHPREFDFYLNSHAGIQGTSRPTHYHVLYDENNFTADELQGLTNNLCYTYAR 906
GTVVD ICHPR +DFY+ +HAG+ GTSRPTHYHVL DE F+ D+LQ L ++L Y Y R
Sbjct: 783 GTVVDNKICHPRNYDFYMCAHAGMIGTSRPTHYHVLLDEIGFSPDDLQELVHSLSYVYQR 842
Query: 907 CTRSVSIVPPAYYAHLAAFRARSYIXXXXXXXXXXXXXXXXTRSNVE--IKLPAIKDNVK 964
T ++S+V P YAHLAA + ++ N +LP + D+V
Sbjct: 843 STTAISVVAPICYAHLAASQVGQFMKFEDKSETSSSHGGSGRDINASPIPQLPKLMDSVC 902
Query: 965 DVMFY 969
+ MF+
Sbjct: 903 NSMFF 907
>IMGA|AC131455_31.4 Argonaute and Dicer protein, PAZ; Stem cell
self-renewal protein Piwi chr05_pseudomolecule_IMGAG_V2
31672438-31678434 E EGN_Mt071002 20080227
Length = 868
Score = 455 bits (1170), Expect = e-128, Method: Compositional matrix adjust.
Identities = 295/838 (35%), Positives = 442/838 (52%), Gaps = 67/838 (7%)
Query: 112 RPGFGRLGKKIQVRANHFQLQVAE--RDLHHYDVAITPE----ITSKKVTREVVSQLIKM 165
R G G G K+ + NHF++ VA R Y VA+ E + K R+++ ++ +
Sbjct: 43 RRGLGTKGAKLPLLTNHFEVNVANTNRVFFQYSVALFYEDGRPVEGKGAGRKIIDKVQET 102
Query: 166 YKESVLGNRLPVFDGRKNLFTAGPLPFSSKEFVVKLEDDRPXXXXXXXXXXXXRERQFKV 225
Y + G L +DG + L A P K+ + K R + +KV
Sbjct: 103 YDSELNGKDL-AYDG-ETLNNANTSP--DKKRIRK----------------SYRSKTYKV 142
Query: 226 TIRFAAKVDLHHLFQFLGRQQLDCPQNTIQALDVALRA-TASEKYNVVGRSFFSPELGQT 284
I FA ++ L + L + + Q I+ LD+ LR +A + +V ++FF +
Sbjct: 143 EINFAKEIPLQAIANALKGHEAENYQEAIRVLDIILRQHSAKQGCLLVRQNFFHNDPNNL 202
Query: 285 GPLGSGTEYWRGYYQSLRPTQMGLSLNIDVSARAFFEPIPVTEFVPKHFRNINFSRDQDR 344
+G G +G + S R TQ GLSLNIDVS P PV +F+ ++ +N+ D
Sbjct: 203 NDVGGGVLSCKGLHSSFRTTQSGLSLNIDVSTTMIVRPGPVVDFLIEN-QNVRDPFSLDW 261
Query: 345 VKVKKALRGIRVDVFLGECKRSYKISGVSREPVKDLMFTL--------DDQKTKKSVAQY 396
K K+ L+ +R+ + YKI+G+S KD +FT+ +D + +V Y
Sbjct: 262 NKAKRTLKNLRITA--KPSNQEYKITGLSELSCKDQLFTMKKRGAVAGEDDTEEITVYDY 319
Query: 397 FTEKYKVTLKH-ANLPALQAGSDTKPIYLPMEVCVIAAGQRYTKRLNEEQVTALLRATCQ 455
F + K+ L++ A LP + G +P Y+P+E+C + + QRYTK L+ Q ++L+ + Q
Sbjct: 320 FVHRRKIDLQYSAGLPCINVGKPKRPTYIPIELCSLISLQRYTKALSTSQRSSLVEKSRQ 379
Query: 456 RPQDRENYIKQIVKQHNFNNDKFVREFGISVKEDPTLLNARVLPPPRLKYHESGKEPRVD 515
+P +R + +K N+ ++ +R GIS+ + T ++ RVL PRLK+ PR
Sbjct: 380 KPVERMRVLSNALKASNYGSEPMLRNCGISITSEFTQVDGRVLQAPRLKFGNEDFNPR-- 437
Query: 516 PWMGQWNMINKKMVDGGKVEHWSCLNFSSRLRPDLPSIFCDELRSMCTSKGMVFNPQPLV 575
G+WN NKK V+ + +WS +NFS+R D+ + D ++ C + QP
Sbjct: 438 --NGRWNFNNKKFVEPVSLGNWSVVNFSARC--DVRGLVRDLIK--CGGMKGILVEQPKD 491
Query: 576 PIKTVNPLQIESALQNLHKQSITNLANMKQQGRLQXXXXXXPDVKGS--YGKIKKICETE 633
I+ + E + + K L K R P+ K S YG KK E
Sbjct: 492 VIEENRQFKGEPPVFRVEKMFADVL---KLSKRPSFLLCLLPERKNSDLYGPWKKKNLAE 548
Query: 634 LGIVSQCCQPRQVQKLNKQYLENLALKINVKVGGRNTVLSDAFDRRIPHVSDKHTIIFGA 693
GIV+QC P +V N QYL N+ LKIN K+GG N+ L R IP VS T+I G
Sbjct: 549 FGIVTQCIAPTRV---NDQYLTNVLLKINAKLGGMNSWLGVEHSRSIPIVSKVPTLILGM 605
Query: 694 DVTHPQPGEDSSPSIAAVVASMDWPWVTKYKGTVSAQAHREEIIQDLFTTFEDPKRGLVQ 753
DV+H PG+ PSIAAVV+S WP ++KY+ V Q + E+I +LF D +
Sbjct: 606 DVSHGSPGQPDIPSIAAVVSSRKWPLISKYRACVRTQGSKVEMIDNLFKPVSDKE----D 661
Query: 754 GGIIRELIRSFYIANGKRKPERIIFYRDGVSEGQFSQVLLYEMDAIRKACMSLEDGYLPR 813
GIIREL+ F+ ++ +R+PE II +RDGVSE QF++VL E+ I +AC L++ + P+
Sbjct: 662 EGIIRELLLDFFHSSEERRPENIIIFRDGVSESQFNEVLNVELSQIIEACKFLDENWNPK 721
Query: 814 VTFVVVQKRHHTRLFPADHRSRDQMDKSGNIMPGTVVDTSICHPREFDFYLNSHAGIQGT 873
+V QK HHT+ F RS D N+ PGTVVD+ ICHPR +DFY+ +HAG+ GT
Sbjct: 722 FMVIVAQKNHHTKFFQP--RSPD------NVPPGTVVDSKICHPRNYDFYMCAHAGMIGT 773
Query: 874 SRPTHYHVLYDENNFTADELQGLTNNLCYTYARCTRSVSIVPPAYYAHLAAFRARSYI 931
SRPTHYHVL DE F+ D+LQ L ++L Y Y R T ++S+V P YAHLAA + ++
Sbjct: 774 SRPTHYHVLLDEIGFSPDDLQELVHSLSYVYQRSTTAISVVAPICYAHLAASQVGQFM 831
>IMGA|AC136450_38.5 Argonaute and Dicer protein, PAZ
chr02_pseudomolecule_IMGAG_V2 14863531-14868049 H
EGN_Mt071002 20080227
Length = 506
Score = 432 bits (1110), Expect = e-121, Method: Compositional matrix adjust.
Identities = 211/470 (44%), Positives = 307/470 (65%), Gaps = 25/470 (5%)
Query: 103 STKAIRFPDRPGFGRLGKKIQVRANHFQLQVAERDLHHYDVAITPEITSKKVTREVVSQL 162
S ++ FP RPG+G+LG K ++ANHF + ++ DL HY+V I PE+ S K + V+S+L
Sbjct: 43 SKSSLMFPCRPGYGQLGTKCLIKANHFLVDISVSDLSHYNVKIIPEVCSSKTRKAVISEL 102
Query: 163 IKMYKESVLGNRLPVFDGRKNLFTAGPLPFSSKEFVVKLEDDRPXXXXXXXXXXXXRERQ 222
++++K + L NRLPV+DG +NL+TAG LPF+ KEF V L ++ RE++
Sbjct: 103 VRVHKNTDLANRLPVYDGGRNLYTAGLLPFTYKEFSVILSEE-------DYVTGGTREQE 155
Query: 223 FKVTIRFAAKVDLHHLFQFLGRQQLDCPQNTIQALDVALRATASEKYNVVGRSFFSPELG 282
FKV I+FA V + L + L +Q+D PQ + D+ L+ A+++ P+
Sbjct: 156 FKVGIKFATSVRMQQLRELLSGKQVDTPQEALSVFDIVLKEVAAQR---------KPQ-- 204
Query: 283 QTGPLGSGTEYWRGYYQSLRPTQMGLSLNIDVSARAFFEPIPVTEFVPKHFRNINFSR-- 340
LG G E WRG+YQS+RPTQMGLSLNID+S+ AF EP+PV +FV + S+
Sbjct: 205 ---QLGGGIESWRGFYQSIRPTQMGLSLNIDMSSMAFIEPLPVIDFVAQILGKDVHSKPL 261
Query: 341 -DQDRVKVKKALRGIRVDV-FLGECKRSYKISGVSREPVKDLMFTLDDQKTKKSVAQYFT 398
D DRVK+KKALRG++V+V G +R Y+ISG++ +P ++L+F LD+Q KSV YF
Sbjct: 262 SDADRVKIKKALRGVKVEVTHRGNFRRKYRISGLTSQPTRELIFPLDEQMNMKSVVDYFQ 321
Query: 399 EKYKVTLKHANLPALQAGSDTKPIYLPMEVCVIAAGQRYTKRLNEEQVTALLRATCQRPQ 458
E Y T+K+++LP LQ GS K YLPME C I GQR TK LNE+Q+T+LL+ +CQRP+
Sbjct: 322 EMYGYTIKYSHLPCLQVGSQRKLNYLPMEACKIVRGQRQTKGLNEKQITSLLKFSCQRPR 381
Query: 459 DRENYIKQIVKQHNFNNDKFVREFGISVKEDPTLLNARVLPPPRLKYHESGKEPRVDPWM 518
++E I Q ++Q+N+ N+ + +EFGIS+ + + ARVLP P LKYH+SG+E P +
Sbjct: 382 EQETDILQTIEQNNYENNPYAKEFGISIDKKLASVEARVLPSPWLKYHDSGREKEHLPQV 441
Query: 519 GQWNMINKKMVDGGKVEHWSCLNFSSRLRPDLPSIFCDELRSMCTSKGMV 568
GQWNM+NKK+++G V +W+C+NFS ++ FC +L MC G+V
Sbjct: 442 GQWNMLNKKVINGSNVRYWACINFSRSVQESTAHGFCQQLVQMCQITGLV 491
>IMGA|CT030192_31.5 Stem cell self-renewal protein Piwi
chr03_pseudomolecule_IMGAG_V2 1280512-1277890 E
EGN_Mt071002 20080227
Length = 314
Score = 253 bits (647), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 131/308 (42%), Positives = 182/308 (59%), Gaps = 15/308 (4%)
Query: 663 VKVGGRNTVLSDAFDRRIPHVSDKHTIIFGADVTHPQPGEDSSPSIAAVVASMDWPWVTK 722
+++GG N+ L F IP S T++ G DV+H G+ + SIAAVV+S WP +++
Sbjct: 22 MQLGGMNSFLLTEFKHSIPLFSKIPTLVIGMDVSHGSQGQSEALSIAAVVSSRCWPQISR 81
Query: 723 YKGTVSAQAHREEIIQDLFTTFEDPKRGLVQGGIIRELIRSFYIANGKRKPERIIFYRDG 782
YK V Q+ + EI+Q LF D K GII EL++ F +G KP++II +RDG
Sbjct: 82 YKAVVRTQSSKVEIVQSLFKPVSDTK----DDGIISELLKDFQTTSGV-KPQQIIIFRDG 136
Query: 783 VSEGQFSQVLLYEMDAIRKACMSLEDGYLPRVTFVVVQKRHHTRLFPADHRSRDQMDKSG 842
VSE QF+QVL E++ I KAC ++ + P+ T +V QK HHTR F A+
Sbjct: 137 VSESQFNQVLNIELNEIIKACKCYDESWCPKFTLIVAQKNHHTRFFKAN-------SPQE 189
Query: 843 NIMPGTVVDTSICHPREFDFYLNSHAGIQGTSRPTHYHVLYDENNFTADELQGLTNNLCY 902
N+ PGTV+D +ICHP++ DFY+ +HAG GTSRPTHYHVLYDE F+AD LQ ++LCY
Sbjct: 190 NVSPGTVIDNTICHPKDNDFYMCAHAGRIGTSRPTHYHVLYDEIGFSADNLQEFVHSLCY 249
Query: 903 TYARCTRSVSIVPPAYYAHLAAFRARSYIXXXXXXXXXXXXXXXXTRSNVEIKLPAIKDN 962
+ R T ++SIV P YYA LAA + +I S + +LP + +
Sbjct: 250 VHQRSTNAISIVAPIYYADLAAAQIAQFIKYDESENLSSHNEFI---SQIPTELPRLHER 306
Query: 963 VKDVMFYC 970
V D MF+C
Sbjct: 307 VADSMFFC 314
>IMGA|AC147429_4.4 Stem cell self-renewal protein Piwi
chr00_pseudomolecule_IMGAG_V2 2850348-2852944 H
EGN_Mt071002 20080227
Length = 298
Score = 235 bits (599), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 126/312 (40%), Positives = 179/312 (57%), Gaps = 23/312 (7%)
Query: 659 LKINVKVGGRNTVLSDAFDRRIPHVSDKHTIIFGADVTHPQPGEDSSPSIAAVVASMDWP 718
L I +++GG N++L +P VS T+I G DV+H PG+ PSIAAVV+S WP
Sbjct: 8 LSIVLQLGGLNSLLGVESSPSLPIVSKAPTLILGMDVSHGSPGQTDIPSIAAVVSSRQWP 67
Query: 719 WVTKYKGTVSAQAHREEIIQDLFTTFEDPKRGLVQGGIIRELIRSFYIANGKRKPERIIF 778
++KY+ V Q+ + E+I +LF D + GI+REL+ FY ++ RKP+ II
Sbjct: 68 LISKYRACVRTQSAKVEMIDNLFKKVSDTE----DEGIMRELLLDFYTSSKNRKPDNIII 123
Query: 779 YRDGVSEGQFSQVLLYEMDAIRKACMSLEDGYLPRVTFVVVQKRHHTRLFPADHRSRDQM 838
+RDGVSE QF+QVL E+D I +AC L++ + P+ +V QK HHTR F Q
Sbjct: 124 FRDGVSESQFNQVLNIELDQIIEACKFLDENWTPKFVVIVAQKNHHTRFF--------QP 175
Query: 839 DKSGNIMPGTVVDTSICHPREFDFYLNSHAGIQGTSRPTHYHVLYDENNFTADELQGLTN 898
+ N+ PG + +DFYL +HAG+ GTSRPTHYHVL DE F+ DELQ L +
Sbjct: 176 NSPDNVPPG----------KNYDFYLCAHAGMIGTSRPTHYHVLLDEIGFSPDELQELVH 225
Query: 899 NLCYTYARCTRSVSIVPPAYYAHLAAFRARSYIXXXXXXXXXXXXXXXXTRSNVEI-KLP 957
+L Y Y R T ++S+V P YAHLAA + ++ V + +LP
Sbjct: 226 SLSYVYQRSTTAISVVAPICYAHLAATQLGQFMKFEDKSETSSSHGGLSAAGAVPVPQLP 285
Query: 958 AIKDNVKDVMFY 969
++DNV + MF+
Sbjct: 286 KLQDNVCNSMFF 297
>IMGA|CT030192_30.5 N-6 Adenine-specific DNA methylase; Argonaute
and Dicer protein, PAZ; Stem cell self-renewal protein
Piwi chr03_pseudomolecule_IMGAG_V2 1284064-1280514 H
EGN_Mt071002 20080227
Length = 602
Score = 207 bits (528), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 165/600 (27%), Positives = 285/600 (47%), Gaps = 72/600 (12%)
Query: 112 RPGFGRLGKKIQVRANHFQLQVAERD--LHHYDVAITPE----ITSKKVTREVVSQLIKM 165
R G G G KIQ+ ANHF++ +++ D +HY+VA+ + + K V R+V+ +L +
Sbjct: 23 RRGLGSKGAKIQLLANHFRVGLSKNDGYFYHYNVALCYQDGHAVEVKGVGRKVIDKLCET 82
Query: 166 YKESVLGNRLPVFDGRKNLFTAGPLPFSSKEFVVKLEDDRPX------XXXXXXXXXXXR 219
Y VL N+ +DG K+LFT L +EF+V LE+ R
Sbjct: 83 Y--DVLRNKNFAYDGEKSLFTLRSLHHKKQEFIVVLEEVSSTRVGSNPSEATKRMKHQSR 140
Query: 220 ERQFKVTIRFAAKVDLHHLFQFLGRQQLDCPQNTIQALDVALRATASEKYNV-VGRSFFS 278
+ FKV I +K+ L + L Q+ + Q LD LR A+++ + + +S+F
Sbjct: 141 SKTFKVEISHVSKIPLQEITDALRGQESEHYQEAFNFLDTILRQNAAKQGCLRIHKSYFH 200
Query: 279 PELGQTGPLGSGTEYWRGYYQSLRPTQMGLSLNIDVSARAFFEPIPVTEFVPKHFRNINF 338
L G + RG++ S R TQ GLSLN+DVS +P PV +F+ ++ +N+
Sbjct: 201 DNQKNITNLEGGIQCCRGFHSSFRVTQRGLSLNVDVSTTLLVKPGPVVDFLLQN-QNVQK 259
Query: 339 SRDQDRVKV----------KKALRGIRVDVFLGECKRSYKISGVSREP--VKDLMFTLDD 386
D KV K+ L+ +R+ +R KI+G+S + ++ +F +
Sbjct: 260 PNLIDWTKVILLLHLEVEAKRMLKNLRIKA--NNTQR--KITGLSEKSCMTQNFLFKHGN 315
Query: 387 ------QKTKKSVAQYFTEKYKVTLKHA-NLPALQAGSDTKPIYLPMEVCVIAAGQRYTK 439
Q ++ ++ +YF K+ L ++ ++P + G +PIY PME+C + + QRYTK
Sbjct: 316 DANGEVQSSEITIYEYFKRHKKIELCYSVDMPCINVGKPKRPIYYPMELCTLVSLQRYTK 375
Query: 440 RLNEEQVTALLRATCQRPQDRENYIKQIVKQHNFNNDKFVREFGISVKEDPTLLNARVLP 499
L +Q L+ + P++R+ ++ ++ + ++ +R GI+++ T ++ RVL
Sbjct: 376 PLAHKQRAQLILESRTSPRERKEALQYSLRNSRYGDEPMLRSLGITIEPSFTQVDGRVLQ 435
Query: 500 PPRLKYHESGKEPRVDPWMGQWNMINKKMVDGGKVEHWSCLNFSSRLRPDLPSIFCDELR 559
PP L G+ P G WN +KK+++ K++ W+ +NFSS+ C ++
Sbjct: 436 PPTLIV---GRGQNFCPRNGSWNFNDKKLIEPVKIKRWAIVNFSSQCDTKH---LCSMIK 489
Query: 560 SMCTSKGMVFNPQPLVPIKTVNPLQI-ESALQNLHKQSITNLANMKQQGR---------- 608
KGM+ +P P I E +++ ++ +A M + +
Sbjct: 490 KCSEMKGMLIDP----------PFDIFEEDIRHRNESPFARVARMYEMVKAKLPGPPTHP 539
Query: 609 -LQXXXXXXPDVKGS--YGKIKKICETELGIVSQCCQPRQVQKLNKQYLENLALKINVKV 665
Q P + YG K+ C + GI +QC P K+N Y+ N+ LKIN KV
Sbjct: 540 LAQLLLCILPVSRNCNIYGPWKRRCLVDEGIATQCIAP---TKINDHYIINVLLKINAKV 596
>IMGA|CU012043_14.5 Stem cell self-renewal protein Piwi
chr03_pseudomolecule_IMGAG_V2 29691565-29690362 H
EGN_Mt071002 20080227
Length = 176
Score = 198 bits (503), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 105/184 (57%), Positives = 118/184 (64%), Gaps = 44/184 (23%)
Query: 688 TIIFGADVTHPQPGEDSSPSIAAVVASMDWPWVTKYKGTVSAQAHREEIIQDLFTTFEDP 747
TIIFGADVTHP+ GEDSSPS+AAVVAS DWP VTKY G V AQAHR+E+IQDL+ T+ DP
Sbjct: 29 TIIFGADVTHPENGEDSSPSMAAVVASQDWPEVTKYAGLVCAQAHRQELIQDLYKTWHDP 88
Query: 748 KRGLVQGGIIRELIRSFYIANGKRKPERIIFYRDGVSEGQFSQVLLYEMDAIRKACMSLE 807
R V GG++ RDGVSEGQF QVLLYE+DAI+KAC SLE
Sbjct: 89 VRDTVSGGML----------------------RDGVSEGQFYQVLLYELDAIQKACASLE 126
Query: 808 DGYLPRVTFVVVQKRHHTRLFPADHRSRDQMDKSGNIMPGTVVDTSICHPREFDFYLNSH 867
Y P VTF++ SGNI+PGTVVDT ICHP EFDFYL SH
Sbjct: 127 PNYQPPVTFII----------------------SGNILPGTVVDTKICHPTEFDFYLCSH 164
Query: 868 AGIQ 871
AG Q
Sbjct: 165 AGNQ 168
>IMGA|AC202591_22.3 Stem cell self-renewal protein Piwi
chr01_pseudomolecule_IMGAG_V2 9757199-9756732 H
EGN_Mt071002 20080227
Length = 155
Score = 79.0 bits (193), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 57/168 (33%), Positives = 84/168 (50%), Gaps = 25/168 (14%)
Query: 665 VGGRNT--VLSDAFDRRIPHVSDKHTIIFGADVTHPQPGED-SSPSIAAVVASMDWPWVT 721
+GGRN L FD ++H ++ GADV HP + SPSIAAVVA+++WP
Sbjct: 1 MGGRNVDEGLLPFFD------YEEHVMLIGADVNHPASRDRRGSPSIAAVVATVNWPAAN 54
Query: 722 KYKGTVSAQAHREEIIQDLFTTFEDPKRGLVQGGIIRELIRSFYIANGKRKPERIIFYRD 781
KY + Q + E I + G I +L+ ++ N + KP +II +R
Sbjct: 55 KYASRICIQEGQSEKISNF-------------GEICFDLVGNYEKLN-RTKPRKIIIFRV 100
Query: 782 GVSEGQFSQVLLYEMDAIRKACMSLEDGYLPRVTFVVVQKRHHTRLFP 829
GVS +FS VL E++ +++ + Y P +T VV K H T FP
Sbjct: 101 GVSREEFSMVLNDELEDLKRDFGGFK--YHPTITVVVAVKGHRTHFFP 146
>IMGA|AC140104_8.5 hypothetical protein
chr04_pseudomolecule_IMGAG_V2 38146711-38147449 H
EGN_Mt071002 20080227
Length = 177
Score = 64.3 bits (155), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 32/52 (61%), Positives = 41/52 (78%), Gaps = 1/52 (1%)
Query: 84 DALSSEVEQKLALRPAAPSS-TKAIRFPDRPGFGRLGKKIQVRANHFQLQVA 134
+ALSS++ ++ +APSS K IRFPDRPGFG+ G+KI V+ANHFQLQVA
Sbjct: 113 EALSSKLTPEMVPEASAPSSQKKVIRFPDRPGFGQEGRKIPVQANHFQLQVA 164
>IMGA|CT573365_24.5 Protein argonaute.
chr03_pseudomolecule_IMGAG_V2 21061719-21060922 H
EGN_Mt071002 20080227
Length = 96
Score = 59.7 bits (143), Expect = 5e-09, Method: Composition-based stats.
Identities = 37/79 (46%), Positives = 47/79 (59%), Gaps = 3/79 (3%)
Query: 114 GFGRLGKKIQVRANHFQLQVAERDLHHY-DVAITPEITSKKVTREVVSQLIKMYKESVLG 172
G G GKK V AN F L +E L + V ITPE TS+ V V+ QL+++Y +S LG
Sbjct: 18 GKGSYGKKCVVMANQFFL--SELPLKNPPSVTITPEETSRGVNCVVMEQLLRLYHDSYLG 75
Query: 173 NRLPVFDGRKNLFTAGPLP 191
RLP +DG K L+T P P
Sbjct: 76 KRLPYYDGHKCLYTTCPSP 94
>IMGA|CR931808_21.5 Ribonucleotide reductase
chr05_pseudomolecule_IMGAG_V2 36705487-36704770 E
EGN_Mt071002 20080227
Length = 131
Score = 56.2 bits (134), Expect = 7e-08, Method: Composition-based stats.
Identities = 27/62 (43%), Positives = 36/62 (58%), Gaps = 8/62 (12%)
Query: 870 IQGTSRPTHYHVLYDENNFTADELQGLTNNLCYTYARCTRSVSIVPPAYYAHLAAFRARS 929
+ GTSRPTHYHVL DE F+ D+LQ L ++L Y Y + P Y HLAA +
Sbjct: 5 VDGTSRPTHYHVLLDEIGFSPDDLQELVHSLSYVYQ--------IAPICYVHLAAAQVAQ 56
Query: 930 YI 931
++
Sbjct: 57 FM 58
>IMGA|AC160838_28.5 hypothetical protein
chr08_pseudomolecule_IMGAG_V2 25420678-25421804 E
EGN_Mt071002 20080227
Length = 59
Score = 45.1 bits (105), Expect = 1e-04, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 28/40 (70%), Gaps = 1/40 (2%)
Query: 669 NTVLSDAFDRRIPHVSD-KHTIIFGADVTHPQPGEDSSPS 707
NT+L DA + RI VSD H I FGAD++ P+ GED+ PS
Sbjct: 6 NTLLLDALNCRISVVSDIPHYITFGADLSRPESGEDTCPS 45