Miyakogusa Predicted Gene
- Lj2g3v3022970.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v3022970.1 tr|E9KID2|E9KID2_MEDTR Root determined nodulation
1 OS=Medicago truncatula GN=RDN1 PE=2 SV=1,86.7,0,seg,NULL; SUBFAMILY
NOT NAMED,NULL; FAMILY NOT NAMED,NULL,CUFF.39629.1
(361 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G13500.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 486 e-137
AT5G13500.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 486 e-137
AT5G13500.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 486 e-137
AT2G25260.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 436 e-122
AT5G25265.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 430 e-121
AT3G01720.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 69 4e-12
>AT5G13500.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25265.1);
Has 35333 Blast hits to 34131 proteins in 2444 species:
Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
(source: NCBI BLink). | chr5:4338676-4340827 FORWARD
LENGTH=358
Length = 358
Score = 486 bits (1251), Expect = e-137, Method: Compositional matrix adjust.
Identities = 235/358 (65%), Positives = 276/358 (77%), Gaps = 6/358 (1%)
Query: 7 MGRAKSXXXXXXXXGFFFATYNLVSMIMDHRAGNWVADGLESFDR------KMLGSASTN 60
MG+A GFF TYNL+++I+ +R+G +DG D + + S+
Sbjct: 1 MGKASGLLLFLLGFGFFVVTYNLLTLIVHNRSGVSNSDGSPLLDPVVQMPLNIRKAKSSP 60
Query: 61 AKYHVALTATDAAYSQWQCRIMYYWYKKVKDMPGSNMGKFTRILHSGRTDQLMDEIPTFV 120
A +HVALTATDA Y++WQCRIMYYWYK+ K +PGS+MG FTRILHSG +D LMDEIPTFV
Sbjct: 61 APFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTRILHSGNSDNLMDEIPTFV 120
Query: 121 VDPLPEGLDRGYIVLNRPWAFVQWLEKADIEEEYILMAEPDHIFVNPLPNLASRTQPAGY 180
VDPLP GLDRGY+VLNRPWAFVQWLE+A I+E+Y+LMAEPDH+FVNPLPNLA PA +
Sbjct: 121 VDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDHVFVNPLPNLAVGGFPAAF 180
Query: 181 PFFYIKPAENEKIIRKFYPKDKGPVTDVDPIGNSPVIIQKSLIEEIAPTWVNVSLRMKDD 240
PFFYI P + E I+RK+YP + GPVT++DPIGNSPVII K +E+IAPTW+NVSL MK+D
Sbjct: 181 PFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKESLEKIAPTWMNVSLTMKND 240
Query: 241 PETDKAFGWVLEMYAYAVASALHGVKHILRKDFMLQPPWDRHVGKTFIIHYTYGCDYNLK 300
PETDKAFGWVLEMY YA+ASA+HGV+HILRKDFMLQPPWD FIIHYTYGCDYN+K
Sbjct: 241 PETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMK 300
Query: 301 GELTYGKIGEWRFDKRSYLMXXXXXXXXXXXXXXXESVVRLVKMVNEATANIPEWDSL 358
GELTYGKIGEWRFDKRS+L ESVV LVKMVNEATA IP WD+L
Sbjct: 301 GELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLVKMVNEATATIPNWDTL 358
>AT5G13500.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25265.1);
Has 228 Blast hits to 200 proteins in 21 species: Archae
- 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 213;
Viruses - 0; Other Eukaryotes - 15 (source: NCBI BLink).
| chr5:4338676-4340827 FORWARD LENGTH=358
Length = 358
Score = 486 bits (1251), Expect = e-137, Method: Compositional matrix adjust.
Identities = 235/358 (65%), Positives = 276/358 (77%), Gaps = 6/358 (1%)
Query: 7 MGRAKSXXXXXXXXGFFFATYNLVSMIMDHRAGNWVADGLESFDR------KMLGSASTN 60
MG+A GFF TYNL+++I+ +R+G +DG D + + S+
Sbjct: 1 MGKASGLLLFLLGFGFFVVTYNLLTLIVHNRSGVSNSDGSPLLDPVVQMPLNIRKAKSSP 60
Query: 61 AKYHVALTATDAAYSQWQCRIMYYWYKKVKDMPGSNMGKFTRILHSGRTDQLMDEIPTFV 120
A +HVALTATDA Y++WQCRIMYYWYK+ K +PGS+MG FTRILHSG +D LMDEIPTFV
Sbjct: 61 APFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTRILHSGNSDNLMDEIPTFV 120
Query: 121 VDPLPEGLDRGYIVLNRPWAFVQWLEKADIEEEYILMAEPDHIFVNPLPNLASRTQPAGY 180
VDPLP GLDRGY+VLNRPWAFVQWLE+A I+E+Y+LMAEPDH+FVNPLPNLA PA +
Sbjct: 121 VDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDHVFVNPLPNLAVGGFPAAF 180
Query: 181 PFFYIKPAENEKIIRKFYPKDKGPVTDVDPIGNSPVIIQKSLIEEIAPTWVNVSLRMKDD 240
PFFYI P + E I+RK+YP + GPVT++DPIGNSPVII K +E+IAPTW+NVSL MK+D
Sbjct: 181 PFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKESLEKIAPTWMNVSLTMKND 240
Query: 241 PETDKAFGWVLEMYAYAVASALHGVKHILRKDFMLQPPWDRHVGKTFIIHYTYGCDYNLK 300
PETDKAFGWVLEMY YA+ASA+HGV+HILRKDFMLQPPWD FIIHYTYGCDYN+K
Sbjct: 241 PETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMK 300
Query: 301 GELTYGKIGEWRFDKRSYLMXXXXXXXXXXXXXXXESVVRLVKMVNEATANIPEWDSL 358
GELTYGKIGEWRFDKRS+L ESVV LVKMVNEATA IP WD+L
Sbjct: 301 GELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLVKMVNEATATIPNWDTL 358
>AT5G13500.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25265.1);
Has 1807 Blast hits to 1807 proteins in 277 species:
Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347;
Plants - 385; Viruses - 0; Other Eukaryotes - 339
(source: NCBI BLink). | chr5:4338676-4340827 FORWARD
LENGTH=358
Length = 358
Score = 486 bits (1251), Expect = e-137, Method: Compositional matrix adjust.
Identities = 235/358 (65%), Positives = 276/358 (77%), Gaps = 6/358 (1%)
Query: 7 MGRAKSXXXXXXXXGFFFATYNLVSMIMDHRAGNWVADGLESFDR------KMLGSASTN 60
MG+A GFF TYNL+++I+ +R+G +DG D + + S+
Sbjct: 1 MGKASGLLLFLLGFGFFVVTYNLLTLIVHNRSGVSNSDGSPLLDPVVQMPLNIRKAKSSP 60
Query: 61 AKYHVALTATDAAYSQWQCRIMYYWYKKVKDMPGSNMGKFTRILHSGRTDQLMDEIPTFV 120
A +HVALTATDA Y++WQCRIMYYWYK+ K +PGS+MG FTRILHSG +D LMDEIPTFV
Sbjct: 61 APFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTRILHSGNSDNLMDEIPTFV 120
Query: 121 VDPLPEGLDRGYIVLNRPWAFVQWLEKADIEEEYILMAEPDHIFVNPLPNLASRTQPAGY 180
VDPLP GLDRGY+VLNRPWAFVQWLE+A I+E+Y+LMAEPDH+FVNPLPNLA PA +
Sbjct: 121 VDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDHVFVNPLPNLAVGGFPAAF 180
Query: 181 PFFYIKPAENEKIIRKFYPKDKGPVTDVDPIGNSPVIIQKSLIEEIAPTWVNVSLRMKDD 240
PFFYI P + E I+RK+YP + GPVT++DPIGNSPVII K +E+IAPTW+NVSL MK+D
Sbjct: 181 PFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKESLEKIAPTWMNVSLTMKND 240
Query: 241 PETDKAFGWVLEMYAYAVASALHGVKHILRKDFMLQPPWDRHVGKTFIIHYTYGCDYNLK 300
PETDKAFGWVLEMY YA+ASA+HGV+HILRKDFMLQPPWD FIIHYTYGCDYN+K
Sbjct: 241 PETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMK 300
Query: 301 GELTYGKIGEWRFDKRSYLMXXXXXXXXXXXXXXXESVVRLVKMVNEATANIPEWDSL 358
GELTYGKIGEWRFDKRS+L ESVV LVKMVNEATA IP WD+L
Sbjct: 301 GELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLVKMVNEATATIPNWDTL 358
>AT2G25260.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G25265.1); Has 35333 Blast hits
to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr2:10755617-10757766 REVERSE LENGTH=358
Length = 358
Score = 436 bits (1121), Expect = e-122, Method: Compositional matrix adjust.
Identities = 198/302 (65%), Positives = 240/302 (79%)
Query: 56 SASTNAKYHVALTATDAAYSQWQCRIMYYWYKKVKDMPGSNMGKFTRILHSGRTDQLMDE 115
S T +H A+TATD+ YS WQCR+MYYWY + +D PGS+MG +TRILHSGR D LMDE
Sbjct: 57 SKKTKRLFHTAVTATDSVYSTWQCRVMYYWYNRFRDEPGSDMGGYTRILHSGRPDGLMDE 116
Query: 116 IPTFVVDPLPEGLDRGYIVLNRPWAFVQWLEKADIEEEYILMAEPDHIFVNPLPNLASRT 175
IPTFV DPLP G+D+GY+VLNRPWAFVQWL++A IEE+YILMAEPDHI V P+PNLA
Sbjct: 117 IPTFVADPLPSGVDKGYVVLNRPWAFVQWLQQAHIEEDYILMAEPDHIIVKPIPNLARGN 176
Query: 176 QPAGYPFFYIKPAENEKIIRKFYPKDKGPVTDVDPIGNSPVIIQKSLIEEIAPTWVNVSL 235
A +PFFYI+P + E ++RKF+PK+ GP++ +DPIGNSPVI+ K+ + +IAPTW+NVSL
Sbjct: 177 LAAAFPFFYIEPKKYESVLRKFFPKENGPISRIDPIGNSPVIVTKNALMKIAPTWMNVSL 236
Query: 236 RMKDDPETDKAFGWVLEMYAYAVASALHGVKHILRKDFMLQPPWDRHVGKTFIIHYTYGC 295
MK+DP+TDKAFGWVLEMYAYAV+SALHGV +IL KDFM+QPPWD KTFIIHYTYGC
Sbjct: 237 AMKNDPQTDKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQPPWDTETKKTFIIHYTYGC 296
Query: 296 DYNLKGELTYGKIGEWRFDKRSYLMXXXXXXXXXXXXXXXESVVRLVKMVNEATANIPEW 355
D+++KG++ GKIGEWRFDKRSY ESVV LV M+NEATANIP W
Sbjct: 297 DFDMKGKMMVGKIGEWRFDKRSYGDKPPPRNLTLPPRGVPESVVTLVTMINEATANIPNW 356
Query: 356 DS 357
+S
Sbjct: 357 ES 358
>AT5G25265.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane,
membrane; EXPRESSED IN: cultured cell, leaf; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G25260.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:8754794-8756855 REVERSE LENGTH=366
Length = 366
Score = 430 bits (1105), Expect = e-121, Method: Compositional matrix adjust.
Identities = 199/309 (64%), Positives = 244/309 (78%), Gaps = 6/309 (1%)
Query: 55 GSASTNAK----YHVALTATDAAYSQWQCRIMYYWYKKVKDM--PGSNMGKFTRILHSGR 108
GS + + K +H A+TA+D+ Y+ WQCR+MYYW+KK++ PGS MG FTRILHSG+
Sbjct: 58 GSRNNDGKRIRLFHTAVTASDSVYNTWQCRVMYYWFKKIQASAGPGSEMGGFTRILHSGK 117
Query: 109 TDQLMDEIPTFVVDPLPEGLDRGYIVLNRPWAFVQWLEKADIEEEYILMAEPDHIFVNPL 168
DQ MDEIPTFV PLP G+D+GY+VLNRPWAFVQWL++ DI+E+YILM+EPDHI V P+
Sbjct: 118 PDQYMDEIPTFVAQPLPSGMDQGYVVLNRPWAFVQWLQQTDIKEDYILMSEPDHIIVKPI 177
Query: 169 PNLASRTQPAGYPFFYIKPAENEKIIRKFYPKDKGPVTDVDPIGNSPVIIQKSLIEEIAP 228
PNLA A +PFFYI+P + EK++RK+YP+ +GPVT++DPIGNSPVI+ K +++IAP
Sbjct: 178 PNLAKDGLGAAFPFFYIEPKKYEKVLRKYYPEVRGPVTNIDPIGNSPVIVGKDALKKIAP 237
Query: 229 TWVNVSLRMKDDPETDKAFGWVLEMYAYAVASALHGVKHILRKDFMLQPPWDRHVGKTFI 288
TW+NVSL MK DPE DKAFGWVLEMYAYAV+SALHGV +IL KDFM+QPPWD VG +I
Sbjct: 238 TWMNVSLAMKKDPEADKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQPPWDIEVGDKYI 297
Query: 289 IHYTYGCDYNLKGELTYGKIGEWRFDKRSYLMXXXXXXXXXXXXXXXESVVRLVKMVNEA 348
IHYTYGCDY++KG+LTYGKIGEWRFDKRSY +SVV LVKM+NEA
Sbjct: 298 IHYTYGCDYDMKGKLTYGKIGEWRFDKRSYDSKPPPRNLTMPPPGVSQSVVTLVKMINEA 357
Query: 349 TANIPEWDS 357
TANIP W S
Sbjct: 358 TANIPNWGS 366
>AT3G01720.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25265.1);
Has 374 Blast hits to 211 proteins in 23 species: Archae
- 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 316;
Viruses - 0; Other Eukaryotes - 58 (source: NCBI BLink).
| chr3:262412-265608 REVERSE LENGTH=802
Length = 802
Score = 68.9 bits (167), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 63/280 (22%), Positives = 119/280 (42%), Gaps = 42/280 (15%)
Query: 58 STNAKYHVALTATDAAYSQWQCRIMYYWYKKVKDMPGSNMGKFTRILHSGRTDQLM---- 113
T K H + Y WQ + +++ G TR+L TD+ +
Sbjct: 389 GTYPKIHTLFSTECTTYFDWQTVGFMHSFRQ-----SGQPGNITRLLSC--TDEALKNYK 441
Query: 114 --DEIPTFVVDPLPEGLDRG--YIVLNRPWAFVQWLEKADIEEEYILMAEPDHIF---VN 166
D PT V + G Y +N+P A V WL +I+ EY+++ + D I +
Sbjct: 442 GHDLAPTHYVPSMSRHPLTGDWYPAINKPAAVVHWLHHTNIDAEYVVILDADMILRGPIT 501
Query: 167 PLPNLASRTQPAGYPFFYIKPAENEKIIRKFYPKDKGPVTDVDPIGNSPVIIQKSLIEEI 226
P A+R +P P+ Y+ +N+ + + + ++ V + +I+ + +
Sbjct: 502 PWEFKAARGRPVSTPYDYLIGCDND--LARLHTRNPEACDKVGGV----IIMHIEDLRKF 555
Query: 227 APTWVNVSLRMKDDPE------TDKAF--GWVLEMYAYAVASALHGVKHILRKDFMLQPP 278
A W+ + ++ D E T + GW+ EMY Y+ +A ++H + K+ M+ P
Sbjct: 556 AMYWLLKTQEVRADKEHYGKELTGDIYESGWISEMYGYSFGAAELNLRHSINKEIMIYPG 615
Query: 279 WDRHVGKTFIIHYTYGCDYNLKGELTYGKIGEWRFDKRSY 318
+ G + + + YG ++ K+G W FDK ++
Sbjct: 616 YVPEPGADYRV-FHYGLEF---------KVGNWSFDKANW 645