Miyakogusa Predicted Gene
- Lj0g3v0048119.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0048119.1 Non Chatacterized Hit- tr|I1M102|I1M102_SOYBN
Uncharacterized protein OS=Glycine max PE=4
SV=1,69.57,0,Homeodomain-like,Homeodomain-like; seg,NULL,CUFF.2242.1
(1337 letters)
Database: Glyma1.pep
75,778 sequences; 25,431,882 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Glyma13g27500.1 1490 0.0
Glyma15g11480.1 1444 0.0
>Glyma13g27500.1
Length = 1455
Score = 1490 bits (3858), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 824/1367 (60%), Positives = 923/1367 (67%), Gaps = 54/1367 (3%)
Query: 1 MNQNEVSRSDSGDRILK-HVXXXXXXXXXXXXXXXXXXFNPFLKGTXXXXXXXXXXXEVD 59
MNQN +S SDS R LK FNPFLK T EVD
Sbjct: 16 MNQNALSSSDSHGRTLKPEEDKEVGEEGEEEDEDADADFNPFLKETLSQEASSSLSSEVD 75
Query: 60 GLDGNVVNSASAE-------LSKEKNCATVDSEHGEEEIVLQSPGMISQSEANQEKDNGL 112
GLDGNVV S + +KE+ VD+EHGEEEI+LQS MISQSE NQEK N L
Sbjct: 76 GLDGNVVTSGPSGGSGLSKVTTKEQIYTVVDTEHGEEEIILQSSSMISQSEINQEKHNDL 135
Query: 113 TSLTDGNGFRRGELNKTTKPQSPVIDIDSDDAICKRTRARYSLASFXXXXXXXXXXXXXX 172
TS TDGNG R EL+ TK SPVIDID++DAIC RTRARYSLASF
Sbjct: 136 TSATDGNG-SRTELSNKTKSWSPVIDIDNEDAICMRTRARYSLASFTLDQLETFLQETDD 194
Query: 173 XXXXXXXXXXXXYKKFLAAVLQDGDGDGLSTHGXXXXXXXXXXXXXXXXXXXXXXXXXXX 232
YKKFLAAVLQ G+GDGL TH
Sbjct: 195 DDDLQNADDEEEYKKFLAAVLQGGNGDGLLTH-ENENLDDDEDNDADFEIELEELLESDA 253
Query: 233 XXNAVVKTQKEYDGSGRRPETRQNKRRKVSAQSERKPLGQFKRPLRPILPNWQNEPPASG 292
NA VK +KEYDG+GRRPETRQNKR+KVSAQ E+K L + KRPLRPILP W N P SG
Sbjct: 254 DDNATVKPRKEYDGAGRRPETRQNKRQKVSAQCEKKILKEVKRPLRPILP-WLNGPLPSG 312
Query: 293 KDLMPEANPSFQSSASGNAPVNGFTPAQIGQLHSLIHEHVQLLIQVFSLSVLDPPQKHTA 352
K L+P+A SFQSSASGN VNGFTP QIGQLH LIHEHVQLLIQVFSLSVL+P QK A
Sbjct: 313 KGLIPDATLSFQSSASGNGLVNGFTPQQIGQLHCLIHEHVQLLIQVFSLSVLEPSQKQVA 372
Query: 353 SQVQSLLFEMLHKRDEVLASKGMPYPTVCFTPSFACTSASNGTSKFVPGQLTKASASTQD 412
SQVQ LLFEMLHKRDE+LA K +PYP+VCFTPSFAC+S +G SKF+ Q + QD
Sbjct: 373 SQVQGLLFEMLHKRDEILALKRVPYPSVCFTPSFACSSVFDGGSKFIQAQCNIEYSPPQD 432
Query: 413 VXXXXXXXXXXXXXEDLNRQRSCFQDIESTLRVPSVH----SVLDVSPLNFVRRYVNGIY 468
E LNRQR FQ ES+ VP V S+LDVSPL+ +RRYV+ I
Sbjct: 433 AQNVWLSQSNQRSSEGLNRQRG-FQVTESSFWVPFVRGPVLSILDVSPLDLIRRYVDDIN 491
Query: 469 SASQEFRKRYIESGCEFS-FEKEPLFPSSSSFTEASTGVSSGTIPGAVRAVSTSPGQQQP 527
SA+QEFRKRYIESG S +KEPLFP SS EA+ +S GTI AV AVS S G+Q+P
Sbjct: 492 SAAQEFRKRYIESGSSDSPVQKEPLFPVSSPVAEANGEISRGTISRAVNAVSPSTGKQRP 551
Query: 528 KKTLAAMLVESTK-QSSALVPKEVAKLTQRFFSDLFNPELFPHKPPPPAVVNRILFTDSE 586
KKTLAAMLVESTK QS ALV KEVAKL QRF + LFNP LFPHKPPP AVVNRILFTDSE
Sbjct: 552 KKTLAAMLVESTKKQSIALVQKEVAKLAQRFLA-LFNPALFPHKPPPAAVVNRILFTDSE 610
Query: 587 DELLSLGILEYNTDWKAIQQRFLPCKSKHQIFVRQKNRSSAKASENPIKAVRRMKTSPLT 646
DELL+LGI+EYNTDWKAIQQRFLPCK+KHQIFVRQKNR S+KASENPIKAVRRMKTSPLT
Sbjct: 611 DELLALGIMEYNTDWKAIQQRFLPCKTKHQIFVRQKNRCSSKASENPIKAVRRMKTSPLT 670
Query: 647 AEEIACIQEGLKHYKFDWMSVWQFIVPYRDPALLPRQWRVACGTQKSYKVDASKKDKRRS 706
AEEIACIQEGLK YK DW VWQ+IVP+RDP+LLPRQWR+A GTQKSYK+DASK++KRR
Sbjct: 671 AEEIACIQEGLKLYKCDWTLVWQYIVPHRDPSLLPRQWRIALGTQKSYKIDASKREKRRL 730
Query: 707 YESEXXXXXXXXXXXXXXISDKEDCEAEIMGSENCMDYSDVPYVHQAFLADWRPDTSAGT 766
YES ISDKEDC+AEI GSE CM VPYVHQAFLADWRPDTS T
Sbjct: 731 YESNRRKSKALESWRA--ISDKEDCDAEIAGSE-CMYSEVVPYVHQAFLADWRPDTSTLT 787
Query: 767 YSEHISSTSGEGNLARDAVSRENIQFYRAINDYGLSGKVQHQN-YQHAFPFTPKFPQLFH 825
Y E IS+TSGEGN+A +A S+E+IQFYR +DYGLSGKV HQN Q A P K PQ FH
Sbjct: 788 YPERISTTSGEGNVAHNAFSQEDIQFYRGTHDYGLSGKVPHQNGNQSALPSVSKLPQPFH 847
Query: 826 TTSE-RSGTKGAPCADNPKNSVF--TSSTNYYYRPYRSRKVNNAHLVKLAPDLPPVNLPP 882
T S+ R+G KG P NPK VF TSS+ YY RPYRSR+ +NAHLVKLAPDLPPVNLPP
Sbjct: 848 TMSDLRNGMKGVPSTINPKKPVFDVTSSSKYYCRPYRSRRAHNAHLVKLAPDLPPVNLPP 907
Query: 883 SVRVVSQTAFKGFQCGTSKMYPPGGSVTGCRIDNIASQIPHGEKYGTAHPVEGAGPVLNG 942
SVRVVSQTAFKGFQCGTSK++PPG V CR D ASQ PHGEK HPV+GA P L
Sbjct: 908 SVRVVSQTAFKGFQCGTSKVHPPGAGVAACRKDYSASQTPHGEKSENVHPVKGARPTLED 967
Query: 943 SVTGSQVERSGTAEGRSTVAEKSTCSELQMHPLLFQATNEGNIPYYPLKXXXXXXXXXXX 1002
SVTGSQ+ERS T EG S VAEK T ++LQMHPLLFQ T +GN PY PLK
Sbjct: 968 SVTGSQLERSETVEGESLVAEKGTRTDLQMHPLLFQVTEDGNAPYCPLKFSSGTSSSFSF 1027
Query: 1003 XXXXQPQLNLSLFHSSQKQSSIDCASKSLRSKNSMLRSAGIDFHPLLQKFNDTQSQACSD 1062
QPQLNLSLFHSSQ+QS IDCA+KSL+SK+S LRS GIDFHPLLQK +DTQS D
Sbjct: 1028 FSGSQPQLNLSLFHSSQQQSHIDCANKSLKSKDSTLRSGGIDFHPLLQKSDDTQSPTSFD 1087
Query: 1063 DIQAESLVNSGVLATANRSSGLNDKSNELDLDIHLSSVSRNGKSVKSKQLEVQDPIGSKK 1122
IQ ESLVNSGV A ANRSSGLNDKSNELDL+IHLSSVS KSVKS+QL+ DP+GSKK
Sbjct: 1088 AIQPESLVNSGVQAIANRSSGLNDKSNELDLEIHLSSVSGREKSVKSRQLKAHDPVGSKK 1147
Query: 1123 TASTCGTAMKCQENSVPCRQQGVENPTTSCCELTSSAPLVVPDDNITRYDVDDVGDQSHP 1182
T + GT+MK QE++ P Q GVEN + CEL SSAPLVV DNITRYDVDD+GDQSHP
Sbjct: 1148 TVAISGTSMKPQEDTAPYCQHGVENLSAGSCELASSAPLVVSSDNITRYDVDDIGDQSHP 1207
Query: 1183 GIVMXXXXXXXXXXXXXXHVXXXXXXXXXXXXXXXXXXXNALKVQNKEV----------- 1231
IVM HV AL+VQNKEV
Sbjct: 1208 EIVMEQEELSDSEEDIEEHVEFECEEMTDSEGEDGSGCEQALEVQNKEVPISSEENVVKY 1267
Query: 1232 ---------PRDNPGTQV-----TNIATLDMGLTREGKDGKNKNSWLSLDSSSTDNFEFP 1277
PR N GT+V N TL++ LT EG+D ++ +SWLSLDS + DN
Sbjct: 1268 MDCMKKPCEPRANYGTEVDGGLLRNSTTLNIALTNEGQDDRSNSSWLSLDSCTADNPVLS 1327
Query: 1278 KAMLQRGADTIAEEDTASRNSTICKAVTEARHPMHIAQQRSAGPQAS 1324
KA+LQ+ + E +AS+N +I KAV E RH + + Q S GP S
Sbjct: 1328 KAILQQ---STLGEASASKNFSIGKAVREERHTVDMVHQLSVGPHVS 1371
>Glyma15g11480.1
Length = 1439
Score = 1444 bits (3739), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 818/1371 (59%), Positives = 923/1371 (67%), Gaps = 64/1371 (4%)
Query: 1 MNQNEVSRSDSGDRILKHVXXXXXXXXXXXXXXXXXXFNPFLKGTXXXXXXXXXXXEVDG 60
MNQN +S DS R LK FNPFLK T EVDG
Sbjct: 16 MNQNALSSPDSHGRTLK---PEEDKEVGEEEEDEDADFNPFLKETLSQEASSSLSSEVDG 72
Query: 61 LDGNVVNSA---SAELSK----EKNCATVDSEHGEEEIVLQSPGMISQSEANQEKDNGLT 113
LDGNVV S +ELSK E+ C V +EHGEEEI+LQS MISQSE NQEK N LT
Sbjct: 73 LDGNVVTSGPSGGSELSKVTTKEQICTVVHNEHGEEEIILQSSSMISQSEINQEKHNDLT 132
Query: 114 SLTDGNGFRRGELNKTTKPQSPVIDIDSDDAICKRTRARYSLASFXXXXXXXXXXXXXXX 173
S+TDGNG R GEL+ K +SPVIDID++DAIC RTRARYSLASF
Sbjct: 133 SVTDGNGSRIGELSNKIKSRSPVIDIDNEDAICMRTRARYSLASFTLDELETFLQETDDD 192
Query: 174 XXXXXXXXXXXYKKFLAAVLQDGDGDGLSTHGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 233
YKKFLAAVLQ G+GDGLSTH
Sbjct: 193 DDLQNADDEEEYKKFLAAVLQGGEGDGLSTH-ENENLDDDEDNDADFEIELEELLESDAD 251
Query: 234 XNAVVKTQKEYDGSGRRPETRQNKRRKVSAQSERKPLGQFKRPLRPILPNWQNEPPASGK 293
NA VKT+KEYDG+GRRPETRQNKR+KVSAQ E+K LG+ KRPLRPILP W N P SGK
Sbjct: 252 DNATVKTRKEYDGAGRRPETRQNKRQKVSAQCEKKTLGEVKRPLRPILP-WLNGPLPSGK 310
Query: 294 DLMPEANPSFQSSASGNAPVNGFTPAQIGQLHSLIHEHVQLLIQVFSLSVLDPPQKHTAS 353
L+P+A SFQSS SGN VNGFTP QIGQLH LIHEHVQLLIQVFSLSVL+P QK AS
Sbjct: 311 GLIPDATLSFQSSTSGNGLVNGFTPQQIGQLHCLIHEHVQLLIQVFSLSVLEPSQKQVAS 370
Query: 354 QVQSLLFEMLHKRDEVLASKGMPYPTVCFTPSFACTSASNGTSKFVPGQLTKASASTQDV 413
QVQ LLFEMLHKRDE+LA K +PYP+VCFTPSFAC+S S+G SKFV Q + QD
Sbjct: 371 QVQGLLFEMLHKRDEILALKRVPYPSVCFTPSFACSSVSDGGSKFVQDQCNIEYSPPQDA 430
Query: 414 XXXXXXXXXXXXXEDLNRQRSCFQDIESTLRVPSVHSVLDVSPLNFVRRYVNGI-YSASQ 472
E LNRQR FQ ES+ VP V P+ + + I ++A+Q
Sbjct: 431 QNVWFSQSNQRSSEGLNRQRG-FQATESSFWVPFVRG-----PVQSILEALCEIPFTAAQ 484
Query: 473 EFRKRYIESGCEFSFEKEPLFPSSSSFTEASTGVSSGTIPGAVRAVSTSPGQQQPKKTLA 532
EFRKRYIESG + EKEPLF SS EA+ +S GTI AV AVSTS QQ+PKKTLA
Sbjct: 485 EFRKRYIESGSDSPVEKEPLFTFSSPVAEANGEISRGTISRAVNAVSTSTRQQRPKKTLA 544
Query: 533 AMLVESTK-QSSALVPKEVAKLTQRFFSDLFNPELFPHKPPPPAVVNRILFTDSEDE--- 588
AMLVESTK QS ALV KEVAKL QRF + LFNP LFPHKPPP AVVNRILFTDSEDE
Sbjct: 545 AMLVESTKKQSIALVQKEVAKLAQRFLA-LFNPALFPHKPPPAAVVNRILFTDSEDEYVC 603
Query: 589 ----LLSLGILEYNTDWKAIQQRFLPCKSKHQIFVRQKNRSSAKASENPIKAVRRMKTSP 644
LL+LGI+EYNTDWKAIQQRFLPCKSKHQIFVRQKN S+KA ENPIKAVRRMKTSP
Sbjct: 604 DVDRLLALGIMEYNTDWKAIQQRFLPCKSKHQIFVRQKNHCSSKALENPIKAVRRMKTSP 663
Query: 645 LTAEEIACIQEGLKHYKFDWMSVWQFIVPYRDPALLPRQWRVACGTQKSYKVDASKKDKR 704
LTAEEIACIQEGLK YK DW VWQ+IVP+RDP+LLPRQWR+A GTQKSYK+DASK++KR
Sbjct: 664 LTAEEIACIQEGLKIYKCDWTLVWQYIVPHRDPSLLPRQWRIALGTQKSYKIDASKREKR 723
Query: 705 RSYESEXXXXXXXXXXXXXXISDKEDCEAEIMGSENCMDYSDV-PYVHQAFLADWRPDTS 763
R YES ISDKEDC+AEI GSE CMDYS+V PYVHQAFLADWRP TS
Sbjct: 724 RLYESNRRKLKALESWRA--ISDKEDCDAEIAGSE-CMDYSEVVPYVHQAFLADWRPHTS 780
Query: 764 AGTYSEHISSTSGEGNLARDAVSRENIQFYRAINDYGLSGKVQHQN-YQHAFPFTPKFPQ 822
TY E IS+TS EGN+A +A S+++IQFYR +DYGLSGKV +N Q A P K PQ
Sbjct: 781 TLTYPECISTTSREGNVAHNAFSQKDIQFYRGTHDYGLSGKVPLENGNQSALPSVSKLPQ 840
Query: 823 LFHTTSE-RSGTKGAPCADNPKNSVF--TSSTNYYYRPYRSRKVNNAHLVKLAPDLPPVN 879
LFHTTS+ R+G KGAP NPK VF TSS+ YY RPYRSR+ +NAHLVKLAP LPPVN
Sbjct: 841 LFHTTSDLRNGMKGAPSTINPKKPVFDVTSSSKYYCRPYRSRRAHNAHLVKLAPGLPPVN 900
Query: 880 LPPSVRVVSQTAFKGFQCGTSKMYPPGGSVTGCRIDNIASQIPHGEKYGTAHPVEGAGPV 939
LPPSVR+VSQTAFKGFQCGTSK++ PG V CR DN +SQ PHGEK HPV+GA P
Sbjct: 901 LPPSVRIVSQTAFKGFQCGTSKVHLPGAGVAACRKDNSSSQTPHGEKSENVHPVKGARPT 960
Query: 940 LNGSVTGSQVERSGTAEGRSTVAEKSTCSELQMHPLLFQATNEGNIPYYPLKXXXXXXXX 999
L SVTGSQ+ RS T E S VAEK T S+LQMHPLLFQ T +GN+PYYPLK
Sbjct: 961 LEDSVTGSQLGRSDTVEDGSLVAEKGTSSDLQMHPLLFQVTEDGNVPYYPLKFSSGTSSS 1020
Query: 1000 XXXXXXXQPQLNLSLFHSSQKQSSIDCASKSLRSKNSMLRSAGIDFHPLLQKFNDTQSQA 1059
QPQLNLSLFHSSQ+QS IDCA+KSL+ K+S LRS GIDFHPLLQK +DTQS
Sbjct: 1021 FSFFSGSQPQLNLSLFHSSQQQSHIDCANKSLKLKDSTLRSGGIDFHPLLQKSDDTQSPT 1080
Query: 1060 CSDDIQAESLVNSGVLATANRSSGLNDKSNELDLDIHLSSVSRNGKSVKSKQLEVQDPIG 1119
D IQ ESLVNSGV A A+RSSGLNDKSNELDL+IHLSSVS KSVKS+QL+ DP+G
Sbjct: 1081 SFDAIQPESLVNSGVQAIASRSSGLNDKSNELDLEIHLSSVSGREKSVKSRQLKAHDPVG 1140
Query: 1120 SKKTASTCGTAMKCQENSVPCRQQGVENPTTSCCELTSSAPLVVPDDNITRYDVDDVGDQ 1179
SKKT + GTAMK QE++ P QQGVEN + CEL SSAPLVVP+DNITRYDVDD+GDQ
Sbjct: 1141 SKKTVAISGTAMKPQEDTAPYCQQGVENLSAGSCELASSAPLVVPNDNITRYDVDDIGDQ 1200
Query: 1180 SHPGIVMXXXXXXXXXXXXXXHVXXXXXXXXXXXXXXXXXXXNALKVQNKEV-------- 1231
SHP IVM HV AL+VQNKEV
Sbjct: 1201 SHPEIVMEQEELSDSEEDIEEHVEFECEEMTDSEGEDGSGCEQALEVQNKEVPISSEENV 1260
Query: 1232 ------------PRDNPGTQV-----TNIATLDMGLTREGKDGKNKNSWLSLDSSSTDNF 1274
PR N GT+V TN L++ LT +G+D ++ +SWLSLDS + DN
Sbjct: 1261 VKYMDCMKKPCEPRGNYGTEVDGGLLTNSTALNIALTNDGQDDRSSSSWLSLDSCTADNP 1320
Query: 1275 EFPKAMLQRGADTIAEEDTASRNSTICKAVTEARHPMHIAQQRSAGPQASV 1325
KA+LQ+ TI E +AS+ +I KAV E RH + + QQ S GP S+
Sbjct: 1321 VLSKAILQQS--TIGEA-SASKIFSIGKAVREERHTVDMIQQPSLGPHVSI 1368