Miyakogusa Predicted Gene

Lj2g3v3022970.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj2g3v3022970.1 tr|E9KID2|E9KID2_MEDTR Root determined nodulation
1 OS=Medicago truncatula GN=RDN1 PE=2 SV=1,86.7,0,seg,NULL; SUBFAMILY
NOT NAMED,NULL; FAMILY NOT NAMED,NULL,CUFF.39629.1
         (361 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G13500.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   486   e-137
AT5G13500.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   486   e-137
AT5G13500.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   486   e-137
AT2G25260.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   436   e-122
AT5G25265.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   430   e-121
AT3G01720.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    69   4e-12

>AT5G13500.3 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 22 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G25265.1);
           Has 35333 Blast hits to 34131 proteins in 2444 species:
           Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
           991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
           (source: NCBI BLink). | chr5:4338676-4340827 FORWARD
           LENGTH=358
          Length = 358

 Score =  486 bits (1251), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 235/358 (65%), Positives = 276/358 (77%), Gaps = 6/358 (1%)

Query: 7   MGRAKSXXXXXXXXGFFFATYNLVSMIMDHRAGNWVADGLESFDR------KMLGSASTN 60
           MG+A          GFF  TYNL+++I+ +R+G   +DG    D        +  + S+ 
Sbjct: 1   MGKASGLLLFLLGFGFFVVTYNLLTLIVHNRSGVSNSDGSPLLDPVVQMPLNIRKAKSSP 60

Query: 61  AKYHVALTATDAAYSQWQCRIMYYWYKKVKDMPGSNMGKFTRILHSGRTDQLMDEIPTFV 120
           A +HVALTATDA Y++WQCRIMYYWYK+ K +PGS+MG FTRILHSG +D LMDEIPTFV
Sbjct: 61  APFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTRILHSGNSDNLMDEIPTFV 120

Query: 121 VDPLPEGLDRGYIVLNRPWAFVQWLEKADIEEEYILMAEPDHIFVNPLPNLASRTQPAGY 180
           VDPLP GLDRGY+VLNRPWAFVQWLE+A I+E+Y+LMAEPDH+FVNPLPNLA    PA +
Sbjct: 121 VDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDHVFVNPLPNLAVGGFPAAF 180

Query: 181 PFFYIKPAENEKIIRKFYPKDKGPVTDVDPIGNSPVIIQKSLIEEIAPTWVNVSLRMKDD 240
           PFFYI P + E I+RK+YP + GPVT++DPIGNSPVII K  +E+IAPTW+NVSL MK+D
Sbjct: 181 PFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKESLEKIAPTWMNVSLTMKND 240

Query: 241 PETDKAFGWVLEMYAYAVASALHGVKHILRKDFMLQPPWDRHVGKTFIIHYTYGCDYNLK 300
           PETDKAFGWVLEMY YA+ASA+HGV+HILRKDFMLQPPWD      FIIHYTYGCDYN+K
Sbjct: 241 PETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMK 300

Query: 301 GELTYGKIGEWRFDKRSYLMXXXXXXXXXXXXXXXESVVRLVKMVNEATANIPEWDSL 358
           GELTYGKIGEWRFDKRS+L                ESVV LVKMVNEATA IP WD+L
Sbjct: 301 GELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLVKMVNEATATIPNWDTL 358


>AT5G13500.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 22 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G25265.1);
           Has 228 Blast hits to 200 proteins in 21 species: Archae
           - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 213;
           Viruses - 0; Other Eukaryotes - 15 (source: NCBI BLink).
           | chr5:4338676-4340827 FORWARD LENGTH=358
          Length = 358

 Score =  486 bits (1251), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 235/358 (65%), Positives = 276/358 (77%), Gaps = 6/358 (1%)

Query: 7   MGRAKSXXXXXXXXGFFFATYNLVSMIMDHRAGNWVADGLESFDR------KMLGSASTN 60
           MG+A          GFF  TYNL+++I+ +R+G   +DG    D        +  + S+ 
Sbjct: 1   MGKASGLLLFLLGFGFFVVTYNLLTLIVHNRSGVSNSDGSPLLDPVVQMPLNIRKAKSSP 60

Query: 61  AKYHVALTATDAAYSQWQCRIMYYWYKKVKDMPGSNMGKFTRILHSGRTDQLMDEIPTFV 120
           A +HVALTATDA Y++WQCRIMYYWYK+ K +PGS+MG FTRILHSG +D LMDEIPTFV
Sbjct: 61  APFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTRILHSGNSDNLMDEIPTFV 120

Query: 121 VDPLPEGLDRGYIVLNRPWAFVQWLEKADIEEEYILMAEPDHIFVNPLPNLASRTQPAGY 180
           VDPLP GLDRGY+VLNRPWAFVQWLE+A I+E+Y+LMAEPDH+FVNPLPNLA    PA +
Sbjct: 121 VDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDHVFVNPLPNLAVGGFPAAF 180

Query: 181 PFFYIKPAENEKIIRKFYPKDKGPVTDVDPIGNSPVIIQKSLIEEIAPTWVNVSLRMKDD 240
           PFFYI P + E I+RK+YP + GPVT++DPIGNSPVII K  +E+IAPTW+NVSL MK+D
Sbjct: 181 PFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKESLEKIAPTWMNVSLTMKND 240

Query: 241 PETDKAFGWVLEMYAYAVASALHGVKHILRKDFMLQPPWDRHVGKTFIIHYTYGCDYNLK 300
           PETDKAFGWVLEMY YA+ASA+HGV+HILRKDFMLQPPWD      FIIHYTYGCDYN+K
Sbjct: 241 PETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMK 300

Query: 301 GELTYGKIGEWRFDKRSYLMXXXXXXXXXXXXXXXESVVRLVKMVNEATANIPEWDSL 358
           GELTYGKIGEWRFDKRS+L                ESVV LVKMVNEATA IP WD+L
Sbjct: 301 GELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLVKMVNEATATIPNWDTL 358


>AT5G13500.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 22 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G25265.1);
           Has 1807 Blast hits to 1807 proteins in 277 species:
           Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347;
           Plants - 385; Viruses - 0; Other Eukaryotes - 339
           (source: NCBI BLink). | chr5:4338676-4340827 FORWARD
           LENGTH=358
          Length = 358

 Score =  486 bits (1251), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 235/358 (65%), Positives = 276/358 (77%), Gaps = 6/358 (1%)

Query: 7   MGRAKSXXXXXXXXGFFFATYNLVSMIMDHRAGNWVADGLESFDR------KMLGSASTN 60
           MG+A          GFF  TYNL+++I+ +R+G   +DG    D        +  + S+ 
Sbjct: 1   MGKASGLLLFLLGFGFFVVTYNLLTLIVHNRSGVSNSDGSPLLDPVVQMPLNIRKAKSSP 60

Query: 61  AKYHVALTATDAAYSQWQCRIMYYWYKKVKDMPGSNMGKFTRILHSGRTDQLMDEIPTFV 120
           A +HVALTATDA Y++WQCRIMYYWYK+ K +PGS+MG FTRILHSG +D LMDEIPTFV
Sbjct: 61  APFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTRILHSGNSDNLMDEIPTFV 120

Query: 121 VDPLPEGLDRGYIVLNRPWAFVQWLEKADIEEEYILMAEPDHIFVNPLPNLASRTQPAGY 180
           VDPLP GLDRGY+VLNRPWAFVQWLE+A I+E+Y+LMAEPDH+FVNPLPNLA    PA +
Sbjct: 121 VDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDHVFVNPLPNLAVGGFPAAF 180

Query: 181 PFFYIKPAENEKIIRKFYPKDKGPVTDVDPIGNSPVIIQKSLIEEIAPTWVNVSLRMKDD 240
           PFFYI P + E I+RK+YP + GPVT++DPIGNSPVII K  +E+IAPTW+NVSL MK+D
Sbjct: 181 PFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKESLEKIAPTWMNVSLTMKND 240

Query: 241 PETDKAFGWVLEMYAYAVASALHGVKHILRKDFMLQPPWDRHVGKTFIIHYTYGCDYNLK 300
           PETDKAFGWVLEMY YA+ASA+HGV+HILRKDFMLQPPWD      FIIHYTYGCDYN+K
Sbjct: 241 PETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMK 300

Query: 301 GELTYGKIGEWRFDKRSYLMXXXXXXXXXXXXXXXESVVRLVKMVNEATANIPEWDSL 358
           GELTYGKIGEWRFDKRS+L                ESVV LVKMVNEATA IP WD+L
Sbjct: 301 GELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLVKMVNEATATIPNWDTL 358


>AT2G25260.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT5G25265.1); Has 35333 Blast hits
           to 34131 proteins in 2444 species: Archae - 798;
           Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
           531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
           BLink). | chr2:10755617-10757766 REVERSE LENGTH=358
          Length = 358

 Score =  436 bits (1121), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 198/302 (65%), Positives = 240/302 (79%)

Query: 56  SASTNAKYHVALTATDAAYSQWQCRIMYYWYKKVKDMPGSNMGKFTRILHSGRTDQLMDE 115
           S  T   +H A+TATD+ YS WQCR+MYYWY + +D PGS+MG +TRILHSGR D LMDE
Sbjct: 57  SKKTKRLFHTAVTATDSVYSTWQCRVMYYWYNRFRDEPGSDMGGYTRILHSGRPDGLMDE 116

Query: 116 IPTFVVDPLPEGLDRGYIVLNRPWAFVQWLEKADIEEEYILMAEPDHIFVNPLPNLASRT 175
           IPTFV DPLP G+D+GY+VLNRPWAFVQWL++A IEE+YILMAEPDHI V P+PNLA   
Sbjct: 117 IPTFVADPLPSGVDKGYVVLNRPWAFVQWLQQAHIEEDYILMAEPDHIIVKPIPNLARGN 176

Query: 176 QPAGYPFFYIKPAENEKIIRKFYPKDKGPVTDVDPIGNSPVIIQKSLIEEIAPTWVNVSL 235
             A +PFFYI+P + E ++RKF+PK+ GP++ +DPIGNSPVI+ K+ + +IAPTW+NVSL
Sbjct: 177 LAAAFPFFYIEPKKYESVLRKFFPKENGPISRIDPIGNSPVIVTKNALMKIAPTWMNVSL 236

Query: 236 RMKDDPETDKAFGWVLEMYAYAVASALHGVKHILRKDFMLQPPWDRHVGKTFIIHYTYGC 295
            MK+DP+TDKAFGWVLEMYAYAV+SALHGV +IL KDFM+QPPWD    KTFIIHYTYGC
Sbjct: 237 AMKNDPQTDKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQPPWDTETKKTFIIHYTYGC 296

Query: 296 DYNLKGELTYGKIGEWRFDKRSYLMXXXXXXXXXXXXXXXESVVRLVKMVNEATANIPEW 355
           D+++KG++  GKIGEWRFDKRSY                 ESVV LV M+NEATANIP W
Sbjct: 297 DFDMKGKMMVGKIGEWRFDKRSYGDKPPPRNLTLPPRGVPESVVTLVTMINEATANIPNW 356

Query: 356 DS 357
           +S
Sbjct: 357 ES 358


>AT5G25265.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane,
           membrane; EXPRESSED IN: cultured cell, leaf; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT2G25260.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:8754794-8756855 REVERSE LENGTH=366
          Length = 366

 Score =  430 bits (1105), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 199/309 (64%), Positives = 244/309 (78%), Gaps = 6/309 (1%)

Query: 55  GSASTNAK----YHVALTATDAAYSQWQCRIMYYWYKKVKDM--PGSNMGKFTRILHSGR 108
           GS + + K    +H A+TA+D+ Y+ WQCR+MYYW+KK++    PGS MG FTRILHSG+
Sbjct: 58  GSRNNDGKRIRLFHTAVTASDSVYNTWQCRVMYYWFKKIQASAGPGSEMGGFTRILHSGK 117

Query: 109 TDQLMDEIPTFVVDPLPEGLDRGYIVLNRPWAFVQWLEKADIEEEYILMAEPDHIFVNPL 168
            DQ MDEIPTFV  PLP G+D+GY+VLNRPWAFVQWL++ DI+E+YILM+EPDHI V P+
Sbjct: 118 PDQYMDEIPTFVAQPLPSGMDQGYVVLNRPWAFVQWLQQTDIKEDYILMSEPDHIIVKPI 177

Query: 169 PNLASRTQPAGYPFFYIKPAENEKIIRKFYPKDKGPVTDVDPIGNSPVIIQKSLIEEIAP 228
           PNLA     A +PFFYI+P + EK++RK+YP+ +GPVT++DPIGNSPVI+ K  +++IAP
Sbjct: 178 PNLAKDGLGAAFPFFYIEPKKYEKVLRKYYPEVRGPVTNIDPIGNSPVIVGKDALKKIAP 237

Query: 229 TWVNVSLRMKDDPETDKAFGWVLEMYAYAVASALHGVKHILRKDFMLQPPWDRHVGKTFI 288
           TW+NVSL MK DPE DKAFGWVLEMYAYAV+SALHGV +IL KDFM+QPPWD  VG  +I
Sbjct: 238 TWMNVSLAMKKDPEADKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQPPWDIEVGDKYI 297

Query: 289 IHYTYGCDYNLKGELTYGKIGEWRFDKRSYLMXXXXXXXXXXXXXXXESVVRLVKMVNEA 348
           IHYTYGCDY++KG+LTYGKIGEWRFDKRSY                 +SVV LVKM+NEA
Sbjct: 298 IHYTYGCDYDMKGKLTYGKIGEWRFDKRSYDSKPPPRNLTMPPPGVSQSVVTLVKMINEA 357

Query: 349 TANIPEWDS 357
           TANIP W S
Sbjct: 358 TANIPNWGS 366


>AT3G01720.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 24 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G25265.1);
           Has 374 Blast hits to 211 proteins in 23 species: Archae
           - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 316;
           Viruses - 0; Other Eukaryotes - 58 (source: NCBI BLink).
           | chr3:262412-265608 REVERSE LENGTH=802
          Length = 802

 Score = 68.9 bits (167), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 63/280 (22%), Positives = 119/280 (42%), Gaps = 42/280 (15%)

Query: 58  STNAKYHVALTATDAAYSQWQCRIMYYWYKKVKDMPGSNMGKFTRILHSGRTDQLM---- 113
            T  K H   +     Y  WQ     + +++         G  TR+L    TD+ +    
Sbjct: 389 GTYPKIHTLFSTECTTYFDWQTVGFMHSFRQ-----SGQPGNITRLLSC--TDEALKNYK 441

Query: 114 --DEIPTFVVDPLPEGLDRG--YIVLNRPWAFVQWLEKADIEEEYILMAEPDHIF---VN 166
             D  PT  V  +      G  Y  +N+P A V WL   +I+ EY+++ + D I    + 
Sbjct: 442 GHDLAPTHYVPSMSRHPLTGDWYPAINKPAAVVHWLHHTNIDAEYVVILDADMILRGPIT 501

Query: 167 PLPNLASRTQPAGYPFFYIKPAENEKIIRKFYPKDKGPVTDVDPIGNSPVIIQKSLIEEI 226
           P    A+R +P   P+ Y+   +N+  + + + ++      V  +    +I+    + + 
Sbjct: 502 PWEFKAARGRPVSTPYDYLIGCDND--LARLHTRNPEACDKVGGV----IIMHIEDLRKF 555

Query: 227 APTWVNVSLRMKDDPE------TDKAF--GWVLEMYAYAVASALHGVKHILRKDFMLQPP 278
           A  W+  +  ++ D E      T   +  GW+ EMY Y+  +A   ++H + K+ M+ P 
Sbjct: 556 AMYWLLKTQEVRADKEHYGKELTGDIYESGWISEMYGYSFGAAELNLRHSINKEIMIYPG 615

Query: 279 WDRHVGKTFIIHYTYGCDYNLKGELTYGKIGEWRFDKRSY 318
           +    G  + + + YG ++         K+G W FDK ++
Sbjct: 616 YVPEPGADYRV-FHYGLEF---------KVGNWSFDKANW 645