Miyakogusa Predicted Gene
- Lj5g3v0322430.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v0322430.1 Non Characterized Hit- tr|I1MXF9|I1MXF9_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,78.25,0,FF,FF domain;
WW,WW/Rsp5/WWP; coiled-coil,NULL; FF domain,FF domain; WW
domain,WW/Rsp5/WWP; seg,NULL,CUFF.52864.1
(999 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr1g017520.1 | pre-mRNA-processing 40A-like protein | HC | ch... 1140 0.0
Medtr1g106025.1 | pre-mRNA-processing 40A-like protein | HC | ch... 575 e-164
Medtr4g074440.1 | pre-mRNA-processing protein 40C | HC | chr4:28... 102 2e-21
Medtr4g074440.2 | pre-mRNA-processing protein 40C | HC | chr4:28... 62 2e-09
>Medtr1g017520.1 | pre-mRNA-processing 40A-like protein | HC |
chr1:4919114-4935894 | 20130731
Length = 1048
Score = 1140 bits (2949), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 590/953 (61%), Positives = 644/953 (67%), Gaps = 11/953 (1%)
Query: 34 HAGHAIASSNVGMPVVXXXXXXXXXXXXXLAPRPIQPGHPASSSQSMPMPYIQ-NRPLTS 92
HAGHA+ SSNVGMP + LAPR IQPGHP SSSQ +PMPYIQ NRPLTS
Sbjct: 77 HAGHAVPSSNVGMPAIQGQQLQYSQQMQQLAPRQIQPGHPVSSSQGIPMPYIQTNRPLTS 136
Query: 93 FPPHSQQTVPHPSNHMPGLPVSGAPTHSSYTFTPSYGQQQNNANALVQYQHPPHTHAPPA 152
P H+QQ VPH +NHMPGLPVSGAP S YTFTPSYGQQQ+NANAL QYQHPP HAPPA
Sbjct: 137 VPQHAQQAVPHINNHMPGLPVSGAPPQSLYTFTPSYGQQQDNANALPQYQHPPQMHAPPA 196
Query: 153 GQPWLXXXXXXXXXXXXXXXXGVQPSGTTSTDAATPATNQSSASDWQEHTSADGRRYYYN 212
GQPWL GVQ SGT STDAAT T+ +SASDWQEHT+ DGRRYYYN
Sbjct: 197 GQPWLSSVPKSAAAVTSVQPSGVQSSGTASTDAATNTTSNNSASDWQEHTAGDGRRYYYN 256
Query: 213 KRTRQSSWEKPLELMSPIERADASTVWKEFTSSDGRKYYYNKVTQQSTWSIPEELKLARE 272
K TRQSSWEKPLELMSP+ERADASTVWKEFTSS+GRKYYYNKVTQQS W+IPEELKLARE
Sbjct: 257 KSTRQSSWEKPLELMSPLERADASTVWKEFTSSEGRKYYYNKVTQQSVWTIPEELKLARE 316
Query: 273 QAHREMNQGMQSEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPSPVTPIAATDQ 332
QAH+ ++QGM SE PS VTP+ ATD
Sbjct: 317 QAHKTISQGMVSETSDTSNAAASSAATSTPPANAASSNTLTPNGLASSPSSVTPVVATDN 376
Query: 333 QLLVSG--------SIVTSNPTGVEPSNVATMSTVPTTVAGSSEVAAKLLDSKMPSIIEN 384
Q VSG S+VTS+ TGVEPS V T+ST PT VAGS V A LDSK+ SI+EN
Sbjct: 377 QRPVSGLSVASVSHSVVTSSTTGVEPSTVVTVSTAPTAVAGSLGVVANSLDSKINSIVEN 436
Query: 385 QASQD-LGSVNGASLQDVEEAKRGLPVVGKVNITPPEEKTNDDETLVYANKQEAKNAFKA 443
QA+ D SVNG LQD+EEAK+G+PVVG+ N+TP EEKTND ET VYANK EAKNAFKA
Sbjct: 437 QATHDSTSSVNGTPLQDMEEAKKGVPVVGQTNVTPSEEKTNDGETFVYANKLEAKNAFKA 496
Query: 444 FLESVNVQSDWTWEQAMREIINDKRYNALKTLGERKQAFNEYLGQRKKLEAEERRMKQKR 503
LESVNV SDWTWEQAMREIINDKRYNALKTLGERKQAFNEYLGQRKKLEAEERR+KQK+
Sbjct: 497 LLESVNVHSDWTWEQAMREIINDKRYNALKTLGERKQAFNEYLGQRKKLEAEERRIKQKK 556
Query: 504 AREEFTKMLEDCKELTSSTRWSKAIIMFENDERFNAVERPRDRADLFESYLVXXXXXXXX 563
AREEFTKMLE+CKELTSSTRWSKAI M ENDERFNAVER RDR DLFESY+V
Sbjct: 557 AREEFTKMLEECKELTSSTRWSKAISMLENDERFNAVERVRDREDLFESYMVELERKEKE 616
Query: 564 XXXXXXXXNIAEYRKFLESCDYVKVNSQWRKIQXXXXXXXXXXXXXXXXXXXVYQDYIRD 623
N+AEYRKFLESCD+VKVNS WRKIQ V+QDYIRD
Sbjct: 617 NAAEEHRRNLAEYRKFLESCDFVKVNSHWRKIQDRLEDDDRYSLLEKIDRLLVFQDYIRD 676
Query: 624 LEKEEEEQKRIHKDRVRRGERKNRDAFRKLLEEHVAAGVLTGRTQWREYCLKVRDLPQYQ 683
LEKEEEEQKRI K+RVRRGERKNRDAFRKLLEEH+A GVLT +TQWR+YCLKV++LPQYQ
Sbjct: 677 LEKEEEEQKRIQKERVRRGERKNRDAFRKLLEEHIADGVLTAKTQWRDYCLKVKELPQYQ 736
Query: 684 AVASNTSGSTPKDLFEDVAEDLEKQYHEDKILIKDIIKSGKXXXXXXSVFEDFKLAVLEE 743
AVASNTSGSTPKDLFEDV E+LEKQYHEDK LIKDI+KSGK SVFEDFK AV EE
Sbjct: 737 AVASNTSGSTPKDLFEDVFENLEKQYHEDKTLIKDILKSGKITVATTSVFEDFKSAVSEE 796
Query: 744 AACQRISEINXXXXXXXXXXXXXXXXXXXXXXXXXXXDDFTNLLYTFKEITITSKWEDCK 803
A C+ ISEIN DDFTNLLYT K+I +S WE+CK
Sbjct: 797 ATCKTISEINLKLLFEELLERAKEKEEKEAKKRQRLADDFTNLLYTLKDIITSSTWEECK 856
Query: 804 PLFEETQEYRSIGDESYSREIFEEYITYLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 863
LFE+TQEY SIG+ESYS+EIFEEYITYL
Sbjct: 857 ALFEDTQEYISIGNESYSKEIFEEYITYLKEKAKEKERKREEEKAKKEKEREEKEKRKEK 916
Query: 864 XXXXXXXXXXXXXXXXXXXXXXTDSDNEDITDSHGYXXXXXXXXXXXXX-XXXXQXXXXX 922
+DSDN+D+TD HGY Q
Sbjct: 917 EKKEKEREREKEKSKERHKKDESDSDNQDMTDGHGYREEKKKEKDKERKHRRRHQSSMDD 976
Query: 923 XXXXXXXXXXXXXXXXHGSDRKKSRKHANSPESDNESRHKRHKREHWDGSRRT 975
HGSDRKKSRKHANSPESDNESRHKRHKR+H DGSRR+
Sbjct: 977 VDSEKDEKEESRKSRRHGSDRKKSRKHANSPESDNESRHKRHKRDHGDGSRRS 1029
>Medtr1g106025.1 | pre-mRNA-processing 40A-like protein | HC |
chr1:47946865-47960298 | 20130731
Length = 1008
Score = 575 bits (1483), Expect = e-164, Method: Compositional matrix adjust.
Identities = 353/782 (45%), Positives = 449/782 (57%), Gaps = 41/782 (5%)
Query: 77 SQSMPMPYIQNRPLTSFPPHSQQTVPHPSNHMPG--LPVSGAP----THSSYTFTPSYGQ 130
SQ +PMP P + S+ +P P + P P G P + S + SYGQ
Sbjct: 95 SQMIPMPV----PRPNMQTSSESMMPQPDSQAPNGYTPGLGGPGMSISSSFMFASSSYGQ 150
Query: 131 Q-QNNANALVQYQHPPHTHAPPAGQPWLXXXXXXXXXXXXXXXXGVQPSGTTSTDAAT-- 187
QNN N+ QYQ P PP G G QP+ TT +AT
Sbjct: 151 APQNNFNSTGQYQPVPQIQ-PPTG-----SSSQSITPGTAPQSNGEQPTVTTVMPSATII 204
Query: 188 -PATNQSSASDWQEHTSADGRRYYYNKRTRQSSWEKPLELMSPIERADASTVWKEFTSSD 246
P + S+SDW EHTSA GRR+YYNKRT+ SSWEKP ELM+PIER DAST WKE+TS D
Sbjct: 205 QPHLAKGSSSDWIEHTSATGRRFYYNKRTKLSSWEKPFELMTPIERVDASTNWKEYTSPD 264
Query: 247 GRKYYYNKVTQQSTWSIPEELKLAREQAHREMNQGMQSEXXXXXXXXXXXXXXXXXXXXX 306
GRKYYYNK+T++S W IPEELK AREQ + M G E
Sbjct: 265 GRKYYYNKITKESKWLIPEELKFAREQVGKAMVNGTLPEPLLTPCTQPSANSVTEAMPSA 324
Query: 307 XXXXXXXXXXXXXXPSPVTPIAATD----QQLLVSGSIVTSNPTGV--------EPSNVA 354
P V P+ T Q + SGS +PT + EP
Sbjct: 325 DNSSVPAQGEQTS-PISVAPVVTTSPSNLQSEITSGS--RDSPTAITITGTEVDEPEVPV 381
Query: 355 TMSTVPTTVAGSSEVAAKLLDSKMPSI--IENQASQD-LGSVNGASLQDVEEAKRGLPVV 411
+ T + GS + +++ + + N ++QD +GS +G +D E+ K + +
Sbjct: 382 NIITPSDSSLGSDKAFVSDINTAATPMNDVSNVSAQDTVGSADGVLGEDKEDGK--IDSI 439
Query: 412 GK-VNITPPEEKTNDDETLVYANKQEAKNAFKAFLESVNVQSDWTWEQAMREIINDKRYN 470
G+ VN E K+ + E+ VYANK EAK+AFKA LESVNV SDW WE+AMR IINDKRY
Sbjct: 440 GENVNDVASETKSVEPESFVYANKMEAKDAFKALLESVNVGSDWNWERAMRLIINDKRYG 499
Query: 471 ALKTLGERKQAFNEYLGQRKKLEAEERRMKQKRAREEFTKMLEDCKELTSSTRWSKAIIM 530
ALK+LGERKQAFNEYL QRKK EAEE+RMK K+ARE+F KMLE+ ELTSS R+SKAI +
Sbjct: 500 ALKSLGERKQAFNEYLSQRKKQEAEEKRMKHKKAREDFRKMLEESTELTSSIRYSKAIAI 559
Query: 531 FENDERFNAVERPRDRADLFESYLVXXXXXXXXXXXXXXXXNIAEYRKFLESCDYVKVNS 590
FEND+RF AVER RDR D+ ES+L N EYRKFLESCD++K N+
Sbjct: 560 FENDDRFKAVERERDRKDMIESFLEELLNKERAKVLEERKRNTVEYRKFLESCDFIKANT 619
Query: 591 QWRKIQXXXXXXXXXXXXXXXXXXXVYQDYIRDLEKEEEEQKRIHKDRVRRGERKNRDAF 650
Q+RK+Q ++QDY+RDLEKEEEEQK+I K+ +R+ ERKNRD F
Sbjct: 620 QYRKVQDRLEADERCSQLEKIDRLEIFQDYLRDLEKEEEEQKKIQKEELRKTERKNRDEF 679
Query: 651 RKLLEEHVAAGVLTGRTQWREYCLKVRDLPQYQAVASNTSGSTPKDLFEDVAEDLEKQYH 710
RKL++EH +G+LT +T WR+Y +V+DLP Y AVASNTSGSTPK+LFEDV E+LEKQY
Sbjct: 680 RKLMDEHSTSGILTAKTHWRDYHSQVKDLPAYLAVASNTSGSTPKELFEDVVEELEKQYQ 739
Query: 711 EDKILIKDIIKSGKXXXXXXSVFEDFKLAVLEEAACQRISEINXXXXXXXXXXXXXXXXX 770
E+K IKD +KS K FEDFK A+ E + IS+ N
Sbjct: 740 EEKSQIKDAVKSAKITLSSTWTFEDFKSALSEHISSPPISDSNLKLVFDEVLERAREKEE 799
Query: 771 XXXXXXXXXXDDFTNLLYTFKEITITSKWEDCKPLFEETQEYRSIGDESYSREIFEEYIT 830
D F +LLY+ K+IT +SKWED + L E++QE+RS+GD S S+++FE Y+
Sbjct: 800 KEAKKRKRLADAFFHLLYSTKDITESSKWEDFRQLLEDSQEFRSVGDVSLSKQMFEVYVA 859
Query: 831 YL 832
L
Sbjct: 860 QL 861
>Medtr4g074440.1 | pre-mRNA-processing protein 40C | HC |
chr4:28340546-28354758 | 20130731
Length = 959
Score = 102 bits (254), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 130/601 (21%), Positives = 230/601 (38%), Gaps = 115/601 (19%)
Query: 186 ATPATNQSSASD----WQEHTSADGRRYYYNKRTRQSSWEKPLELM----------SPIE 231
AT N+ +A+D W H + G YYYN T QS+++KP +P+
Sbjct: 321 ATVTQNEDAANDQLDAWTAHKTEAGIVYYYNALTGQSTYDKPAGFKGEAHQVSVQPTPVS 380
Query: 232 RAD-ASTVWKEFTSSDGRKYYYNKVTQQSTWSIPEELKLAREQAHREMNQGMQSEXXXXX 290
D T W+ ++SDG+KYYYN T+ S W IP E+ +++ ++ +
Sbjct: 381 MVDLPGTDWQLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDSDVTKDHP------- 433
Query: 291 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPSPVTPIAATDQQLLVSGSIVTSNPTGVEP 350
TP+ T+ +V N +
Sbjct: 434 ----------------------------------TPVPNTNVLSERGSGMVALNAPAITT 459
Query: 351 SNVATMSTVPTTVAGSSE----VAAKLLDSKMP---SIIENQASQDLGSVNGA------- 396
+++ P V S + KL +S P S I + Q NG+
Sbjct: 460 GGRDAVASKPFIVQSSPSALDLIKKKLQESGAPVTSSSIPTPSVQPGSESNGSKATDSTA 519
Query: 397 -SLQDVEEAKRGLPVVGKVNITPPEEKTNDDETLVYANKQEAKNAFKAFLESVNVQSDWT 455
SLQ+ + G N++ + D+++ +K+E N FK L+ V
Sbjct: 520 KSLQNDNSKDKQKDANGDANVSDTSSDSEDEDS--GPSKEECINQFKEMLKERGVAPFSK 577
Query: 456 WEQAMREIINDKRYNALKTLGERKQAFNEYLGQRKKLEAEERRMKQKRAREEFTKML--- 512
WE+ + +I+ D R+ A+ + R+ F Y+ R + E +E+R QK A E F ++L
Sbjct: 578 WEKELPKIVFDPRFKAIPSYSARRSLFEHYVKNRAEEERKEKRAAQKAAIEGFKQLLDEA 637
Query: 513 -EDCKELTSSTRWSKAIIMFENDERFNAVERPRDRADLFESYLVXXXXXXXXXXXXXXXX 571
ED + T S + K + ND RF A++R ++R L ++
Sbjct: 638 SEDIDDKTDSHTFRKK---WGNDPRFEALDR-KEREHLLNERVLPLKKATEEKAQAMRDA 693
Query: 572 NIAEYRKFLESCDYVKVNSQWRKIQXXXXXXXXXXXXXXXXXXXVYQDYIRDLE------ 625
++ L+ + NS+W +++ ++ +YI +L+
Sbjct: 694 AADSFKSMLKEQGEITFNSRWSRVKESLRDDPRYKSVKHEDRELLFNEYISELKAVEHAA 753
Query: 626 KEEEEQKRIHKD---------------------RVRRGERKNR--DAFRKLLEEHVAAGV 662
+ E KR +D RVR R+ +F+ LL E + +
Sbjct: 754 ERETRAKREEQDKLRERERELRKRKEREEHEMERVRLKIRRKEAVTSFQALLVERIKDPM 813
Query: 663 LTGRTQWREYCLKVRDLPQYQAVASNTSGSTPKDLFEDVAEDLEKQYHED-KILIKDIIK 721
+ W E K+ PQ +A S+ + + LF D + L+++ D + L+ + +
Sbjct: 814 AS----WTESKPKLEKDPQGRATNSDLDSADMEKLFRDHVKMLQERRARDFRALLAEFLT 869
Query: 722 S 722
S
Sbjct: 870 S 870
>Medtr4g074440.2 | pre-mRNA-processing protein 40C | HC |
chr4:28340546-28354758 | 20130731
Length = 711
Score = 62.4 bits (150), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/183 (27%), Positives = 84/183 (45%), Gaps = 13/183 (7%)
Query: 368 EVAAKLLDSKMPSIIENQASQDLGS----VNGASLQDVEEAKRGLPVVGKVNITPPEEKT 423
E A + S +P+ S+ GS SLQ+ + G N++ +
Sbjct: 488 ESGAPVTSSSIPTPSVQPGSESNGSKATDSTAKSLQNDNSKDKQKDANGDANVSDTSSDS 547
Query: 424 NDDETLVYANKQEAKNAFKAFLESVNVQSDWTWEQAMREIINDKRYNALKTLGERKQAFN 483
D+++ +K+E N FK L+ V WE+ + +I+ D R+ A+ + R+ F
Sbjct: 548 EDEDS--GPSKEECINQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFE 605
Query: 484 EYLGQRKKLEAEERRMKQKRAREEFTKML----EDCKELTSSTRWSKAIIMFENDERFNA 539
Y+ R + E +E+R QK A E F ++L ED + T S + K + ND RF A
Sbjct: 606 HYVKNRAEEERKEKRAAQKAAIEGFKQLLDEASEDIDDKTDSHTFRKK---WGNDPRFEA 662
Query: 540 VER 542
++R
Sbjct: 663 LDR 665
Score = 61.2 bits (147), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 50/97 (51%), Gaps = 15/97 (15%)
Query: 186 ATPATNQSSASD----WQEHTSADGRRYYYNKRTRQSSWEKPLELM----------SPIE 231
AT N+ +A+D W H + G YYYN T QS+++KP +P+
Sbjct: 321 ATVTQNEDAANDQLDAWTAHKTEAGIVYYYNALTGQSTYDKPAGFKGEAHQVSVQPTPVS 380
Query: 232 RAD-ASTVWKEFTSSDGRKYYYNKVTQQSTWSIPEEL 267
D T W+ ++SDG+KYYYN T+ S W IP E+
Sbjct: 381 MVDLPGTDWQLVSTSDGKKYYYNNRTKTSCWQIPNEV 417