Miyakogusa Predicted Gene

Lj0g3v0103069.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0103069.1 tr|F2DWA7|F2DWA7_HORVD Predicted protein
OS=Hordeum vulgare var. distichum PE=2 SV=1,32.5,8e-19,seg,NULL;
DUF581,Protein of unknown function DUF581,CUFF.5823.1
         (285 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G22550.1 | Symbols:  | Protein of unknown function (DUF581) |...   142   2e-34
AT3G63210.1 | Symbols: MARD1 | Protein of unknown function (DUF5...   137   9e-33
AT5G11460.1 | Symbols:  | Protein of unknown function (DUF581) |...   102   3e-22
AT2G25690.2 | Symbols:  | Protein of unknown function (DUF581) |...    77   1e-14
AT2G25690.1 | Symbols:  | Protein of unknown function (DUF581) |...    77   1e-14
AT1G53903.1 | Symbols:  | Protein of unknown function (DUF581) |...    61   1e-09
AT1G53885.1 | Symbols:  | Protein of unknown function (DUF581) |...    61   1e-09
AT4G17670.1 | Symbols:  | Protein of unknown function (DUF581) |...    60   2e-09
AT5G20700.1 | Symbols:  | Protein of unknown function (DUF581) |...    59   3e-09
AT2G44670.1 | Symbols:  | Protein of unknown function (DUF581) |...    58   8e-09
AT1G78020.1 | Symbols:  | Protein of unknown function (DUF581) |...    57   1e-08
AT5G49120.1 | Symbols:  | Protein of unknown function (DUF581) |...    56   3e-08
AT5G47060.1 | Symbols:  | Protein of unknown function (DUF581) |...    55   6e-08
AT1G22160.1 | Symbols:  | Protein of unknown function (DUF581) |...    54   2e-07
AT1G79970.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    51   8e-07
AT1G79970.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    51   9e-07

>AT3G22550.1 | Symbols:  | Protein of unknown function (DUF581) |
           chr3:7991827-7992805 REVERSE LENGTH=267
          Length = 267

 Score =  142 bits (359), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 107/295 (36%), Positives = 150/295 (50%), Gaps = 51/295 (17%)

Query: 2   MLRNRSRPVTKPSLMGDHTSSQQSPNKNYVRTTP--SLFGS-QKLRDFTMKCLSGGAEAL 58
           ML+ RSR  +K +LM +   SQ   N+   +TTP   LF +    + FT        +A+
Sbjct: 1   MLKKRSR--SKQALMAETNQSQ---NQKQSKTTPFPRLFTAFSSFKSFTEN------DAV 49

Query: 59  RSPTSILDTR---ALLSPHGSPISPAITSQRVHSKNTYSWDKVDSKGIGLALVGELKXXX 115
            SPTSILDT+    L +P GS            ++   +  K++ K IGLA+V  L    
Sbjct: 50  ASPTSILDTKPFSVLKNPFGS--------DNPKTQEPETRLKLEPKRIGLAIVDSLIQDE 101

Query: 116 XXXXAIHSDPHKPNKGKVLFGTKFKIKIPSLLPNSPFESKTCANADFGAKAKDSENLGTY 175
                       P  G +LFG++ +I++P    +SP  S     +DFG K ++S+     
Sbjct: 102 TPEPG-------PRSGTILFGSQLRIRVP----DSPISS-----SDFGIKTRNSQPETKK 145

Query: 176 RKDSDSLQAVPAATGVMRLSEMELSEEYTCVISHGVNPRTTHIFDNCVVES-----YF-- 228
                 L +    +G    S+MELSE+YTCV  HG NPRT HIFDNC+VES     +F  
Sbjct: 146 PGSESGLGSPRIISGYFPASDMELSEDYTCVTCHGPNPRTIHIFDNCIVESQPGVVFFRS 205

Query: 229 SLPNNQHS---AASVHFLNFCYTCKKHLEQTKDIFIYRGEKAFCSQECRHQEMVL 280
           S P N+     +    FL+ C  CKK L    DIF+YRG++AFCS ECR  EM++
Sbjct: 206 SDPVNESDSDYSPPDSFLSCCCNCKKSLGPRDDIFMYRGDRAFCSSECRSIEMMM 260


>AT3G63210.1 | Symbols: MARD1 | Protein of unknown function (DUF581)
           | chr3:23354019-23354906 REVERSE LENGTH=263
          Length = 263

 Score =  137 bits (345), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 115/303 (37%), Positives = 152/303 (50%), Gaps = 61/303 (20%)

Query: 2   MLRNRSRPVTKPS-----LMGDHTSSQQSPNKNYVRTTPSLFGSQKLRDFTMKCLSGGAE 56
           MLRN+ R           LM D       P  N    +PSLF S K R FT K +    +
Sbjct: 1   MLRNKPRAAVTTKKQTSLLMADQPPP---PKPNTCHCSPSLFSSPKFRFFTSKMMMTPFD 57

Query: 57  A---LRSPTSILDTRALL--SPHGSPIS---PAITS-QRVHSKNTYSWDKVDSKGIGLA- 106
           +   L SPTSIL+    +  S +  P+S   P I + QR HS + +          GLA 
Sbjct: 58  SDFSLVSPTSILEANPSIFSSKNPKPVSYFEPTIPNPQRFHSPDVF----------GLAD 107

Query: 107 LVGELKXXXXXXXAIHSDPHKPNKGKVLFGTKFKIKIPSLLPNSPFESKTCANADFGAKA 166
           LV +           HS   KP    VLFG+K +++IPS             +ADFG K 
Sbjct: 108 LVKDGDSNRD-----HS--RKPVNKMVLFGSKLRVQIPS-------------SADFGTKT 147

Query: 167 KDSENLGTYRKDSDSLQAVPAATGVMRLSEMELSEEYTCVISHGVNPRTTHIFDNCV-VE 225
                    R     L      T V+ +SE++ +E+YT VISHG NP  THIFDN V VE
Sbjct: 148 G-------IRYPPCQLSPC-VQTKVLAVSEIDQTEDYTRVISHGPNPTITHIFDNSVFVE 199

Query: 226 SY-FSLPNNQ---HSAASVHFLNFCYTCKKHLEQTKDIFIYRGEKAFCSQECRHQEMVLD 281
           +   S+P  Q    + ++  FL+ C+TCKK+L+Q +DI+IYRGEK FCS ECR+QEM+LD
Sbjct: 200 ATPCSVPLPQPAMETKSTESFLSRCFTCKKNLDQKQDIYIYRGEKGFCSSECRYQEMLLD 259

Query: 282 GAE 284
             E
Sbjct: 260 QME 262


>AT5G11460.1 | Symbols:  | Protein of unknown function (DUF581) |
           chr5:3657064-3658388 REVERSE LENGTH=344
          Length = 344

 Score =  102 bits (254), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 95/314 (30%), Positives = 137/314 (43%), Gaps = 58/314 (18%)

Query: 16  MGDHTSSQQSPNKNYVRTTPSL--FGSQKLRD--FTMKCLSGGAEALRSPTSILDTRALL 71
           M  H++ Q +   +Y  T P L    S KL    F  KC S   E+  SPTS LD R L 
Sbjct: 1   MSQHSNYQMTTASDYYSTKPVLSAIRSHKLISSVFEGKCPSD-YESAWSPTSPLDFR-LF 58

Query: 72  SPHGSPISPAITSQRVHSKNTYSWDKVDSKGIGLALVGELKXXXXXXXAIHSDPHKPNKG 131
           S  G+P + A +S+ +      SWD   S  +GL++V  L        +       P+  
Sbjct: 59  STLGNPFA-ASSSRSIWRGKQRSWD---SGKVGLSIVHSLVDDHHTDSSATIVLPSPDSK 114

Query: 132 KVLFGTKF--------------KIKIP-SLLPNSPFES----------KTCANADFGAKA 166
            ++FG+                K  +P  ++PN+ FE           +   + D  A  
Sbjct: 115 NIIFGSLMRSGQKPHLLSQPFTKALMPKDVIPNAVFEIGHDVIDVLELRKSGSVD-AAYC 173

Query: 167 KDSENLGTYRKDSDSLQAVPAATGVMRLSEMELSEEYTCVISHGVNPRTTHIFDNCVVES 226
             +EN           +  P +      S+ME+SE+YTCVISHG NP+TTH + + V+ES
Sbjct: 174 SGAENFSVNNNACQVTKQDPGSLNGGTESDMEISEDYTCVISHGPNPKTTHFYGDQVMES 233

Query: 227 Y-------FSLPNNQHSAASV---------------HFLNFCYTCKKHLEQTKDIFIYRG 264
                       N + S  +V                FL+FCY C K L   +DI++Y G
Sbjct: 234 VEREELKNRCCKNEKESIFAVAPLDLTTPVDVLPPKDFLSFCYGCSKKLGMGEDIYMYSG 293

Query: 265 EKAFCSQECRHQEM 278
            KAFCS ECR +E+
Sbjct: 294 YKAFCSSECRSKEI 307


>AT2G25690.2 | Symbols:  | Protein of unknown function (DUF581) |
           chr2:10940530-10941649 REVERSE LENGTH=324
          Length = 324

 Score = 77.4 bits (189), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 39/89 (43%), Positives = 58/89 (65%), Gaps = 9/89 (10%)

Query: 201 EEYTCVISHGVNPRTTHIFDNCVVESYFSL----PNNQHSAASV----HFLNFCYTCKKH 252
           E+YTC+I+HG NP+TTHI+ + V+E + +      +N+    SV    +FL  C  C K 
Sbjct: 218 EDYTCIIAHGPNPKTTHIYGDRVLECHKNELKGDEDNKEKFGSVFPSDNFLGICNFCNKK 277

Query: 253 LEQTKDIFIYRGEKAFCSQECRHQEMVLD 281
           L    DI++YR EK+FCS+ECR +EM++D
Sbjct: 278 LGGGDDIYMYR-EKSFCSEECRSEEMMID 305


>AT2G25690.1 | Symbols:  | Protein of unknown function (DUF581) |
           chr2:10940530-10941649 REVERSE LENGTH=324
          Length = 324

 Score = 77.4 bits (189), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 39/89 (43%), Positives = 58/89 (65%), Gaps = 9/89 (10%)

Query: 201 EEYTCVISHGVNPRTTHIFDNCVVESYFSL----PNNQHSAASV----HFLNFCYTCKKH 252
           E+YTC+I+HG NP+TTHI+ + V+E + +      +N+    SV    +FL  C  C K 
Sbjct: 218 EDYTCIIAHGPNPKTTHIYGDRVLECHKNELKGDEDNKEKFGSVFPSDNFLGICNFCNKK 277

Query: 253 LEQTKDIFIYRGEKAFCSQECRHQEMVLD 281
           L    DI++YR EK+FCS+ECR +EM++D
Sbjct: 278 LGGGDDIYMYR-EKSFCSEECRSEEMMID 305


>AT1G53903.1 | Symbols:  | Protein of unknown function (DUF581) |
           chr1:20132363-20132842 FORWARD LENGTH=126
          Length = 126

 Score = 60.8 bits (146), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 40/63 (63%)

Query: 219 FDNCVVESYFSLPNNQHSAASVHFLNFCYTCKKHLEQTKDIFIYRGEKAFCSQECRHQEM 278
            +N V++S   L  +  + + + FL  C+ C K L Q KD+++YRG+  FCS+ECR  +M
Sbjct: 19  LNNIVIKSSLRLNRSNPNISELCFLKTCHLCNKQLHQDKDVYMYRGDLGFCSRECRESQM 78

Query: 279 VLD 281
           ++D
Sbjct: 79  LID 81


>AT1G53885.1 | Symbols:  | Protein of unknown function (DUF581) |
           chr1:20119798-20120277 FORWARD LENGTH=126
          Length = 126

 Score = 60.8 bits (146), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 40/63 (63%)

Query: 219 FDNCVVESYFSLPNNQHSAASVHFLNFCYTCKKHLEQTKDIFIYRGEKAFCSQECRHQEM 278
            +N V++S   L  +  + + + FL  C+ C K L Q KD+++YRG+  FCS+ECR  +M
Sbjct: 19  LNNIVIKSSLRLNRSNPNISELCFLKTCHLCNKQLHQDKDVYMYRGDLGFCSRECRESQM 78

Query: 279 VLD 281
           ++D
Sbjct: 79  LID 81


>AT4G17670.1 | Symbols:  | Protein of unknown function (DUF581) |
           chr4:9833948-9834663 REVERSE LENGTH=159
          Length = 159

 Score = 60.1 bits (144), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%)

Query: 228 FSLPNNQHSAASVHFLNFCYTCKKHLEQTKDIFIYRGEKAFCSQECRHQEMVLDGA 283
           F   N+ +     HFL+ C+ CKK L   +DIF+YRG+  FCS+ECR +++  D A
Sbjct: 62  FRFDNSYYGYGQPHFLDSCFLCKKRLGDNRDIFMYRGDTPFCSEECREEQIERDEA 117


>AT5G20700.1 | Symbols:  | Protein of unknown function (DUF581) |
           chr5:7006178-7007003 REVERSE LENGTH=248
          Length = 248

 Score = 59.3 bits (142), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 59/216 (27%), Positives = 84/216 (38%), Gaps = 39/216 (18%)

Query: 72  SPHGSPISPAITSQRVHSKNTYSWDKVDSKGIGLALVGELKXXXXXXXAIHSDPHKPNKG 131
           SP    I P I SQR  SK  Y  +   S G+G+    E            S+P++P + 
Sbjct: 38  SPLDFKILPQI-SQRNSSKRFYDDNLGGSVGLGIVAALENSNTRRITSVCRSEPNQPGRS 96

Query: 132 K-VLFGTKFKIKIPSLLPNSPFESKTCANADFGAKAKDSENLGTYRKDSDSLQAV----- 185
             V F +                            + D E+   +  D +    V     
Sbjct: 97  DPVQFMSH-------------------------GGSTDGEDEEMFIMDEEDYTLVTCHHG 131

Query: 186 PAATGVMRLSEMELSEEYTCVISHGVNPRTTHIFDNCVVESYFSLPNNQHSAASVHFLNF 245
           P+ +   R+ +    + + C  S   + R   +F   VV+     P N      + FLN 
Sbjct: 132 PSGSCNTRVYD---KDGFECFSSKINDDRRERLF---VVDVVTESPENSPEFQGLGFLNS 185

Query: 246 CYTCKKHLEQTKDIFIYRGEKAFCSQECRHQEMVLD 281
           CY C+K L   +DIFIYRGEKAFCS ECR   +  D
Sbjct: 186 CYLCRKKL-HGQDIFIYRGEKAFCSTECRSSHIAND 220


>AT2G44670.1 | Symbols:  | Protein of unknown function (DUF581) |
           chr2:18425279-18425673 FORWARD LENGTH=93
          Length = 93

 Score = 57.8 bits (138), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 24/43 (55%), Positives = 30/43 (69%)

Query: 241 HFLNFCYTCKKHLEQTKDIFIYRGEKAFCSQECRHQEMVLDGA 283
           HFL  C  C+KHL    DIF+YRG+KAFCS ECR +++  D A
Sbjct: 15  HFLESCSLCRKHLGLNSDIFMYRGDKAFCSNECREEQIESDEA 57


>AT1G78020.1 | Symbols:  | Protein of unknown function (DUF581) |
           chr1:29338787-29339491 FORWARD LENGTH=162
          Length = 162

 Score = 57.0 bits (136), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 29/41 (70%)

Query: 241 HFLNFCYTCKKHLEQTKDIFIYRGEKAFCSQECRHQEMVLD 281
           HFL  C  C++ L   +DI++YRG+KAFCS ECR ++M  D
Sbjct: 88  HFLRSCALCERLLVPGRDIYMYRGDKAFCSSECRQEQMAQD 128


>AT5G49120.1 | Symbols:  | Protein of unknown function (DUF581) |
           chr5:19908800-19909332 REVERSE LENGTH=150
          Length = 150

 Score = 55.8 bits (133), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 33/44 (75%)

Query: 242 FLNFCYTCKKHLEQTKDIFIYRGEKAFCSQECRHQEMVLDGAEN 285
           FL  C+ C++ L   KDI++Y+G++AFCS ECR ++M++D  E+
Sbjct: 68  FLEHCFLCRRKLLPAKDIYMYKGDRAFCSVECRSKQMIMDEEES 111


>AT5G47060.1 | Symbols:  | Protein of unknown function (DUF581) |
           chr5:19116843-19117639 FORWARD LENGTH=177
          Length = 177

 Score = 54.7 bits (130), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 32/44 (72%)

Query: 241 HFLNFCYTCKKHLEQTKDIFIYRGEKAFCSQECRHQEMVLDGAE 284
           HFL+ C+ CKK L   +DI++YRG+  FCS+ECR +++  D A+
Sbjct: 96  HFLDSCFLCKKPLGDNRDIYMYRGDTPFCSEECRQEQIERDEAK 139


>AT1G22160.1 | Symbols:  | Protein of unknown function (DUF581) |
           chr1:7823238-7823774 FORWARD LENGTH=147
          Length = 147

 Score = 53.5 bits (127), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 31/46 (67%)

Query: 236 SAASVHFLNFCYTCKKHLEQTKDIFIYRGEKAFCSQECRHQEMVLD 281
           S  S  FL  C  CK+ L   +DI++YRG++AFCS ECR Q++ +D
Sbjct: 72  SDYSEDFLRSCSLCKRLLVHGRDIYMYRGDRAFCSLECRQQQITVD 117


>AT1G79970.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: Protein of unknown function (DUF581)
           (TAIR:AT2G25690.2); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr1:30082773-30083429 FORWARD LENGTH=218
          Length = 218

 Score = 51.2 bits (121), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 27/30 (90%)

Query: 196 EMELSEEYTCVISHGVNPRTTHIFDNCVVE 225
           EM LSE+YTC+ISHG NP+TT+IF +C+++
Sbjct: 149 EMALSEDYTCIISHGPNPKTTYIFGDCILD 178


>AT1G79970.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: Protein of unknown function (DUF581)
           (TAIR:AT2G25690.2); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr1:30082773-30083592 FORWARD LENGTH=240
          Length = 240

 Score = 50.8 bits (120), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 27/30 (90%)

Query: 196 EMELSEEYTCVISHGVNPRTTHIFDNCVVE 225
           EM LSE+YTC+ISHG NP+TT+IF +C+++
Sbjct: 149 EMALSEDYTCIISHGPNPKTTYIFGDCILD 178