Miyakogusa Predicted Gene

Lj0g3v0171149.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0171149.1 Non Chatacterized Hit- tr|I1HNA4|I1HNA4_BRADI
Uncharacterized protein OS=Brachypodium distachyon
GN=,35,3e-18,SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL,CUFF.10750.1
         (544 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G03560.1 | Symbols:  | unknown protein; LOCATED IN: plasma me...   571   e-163
AT5G23490.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   270   3e-72
AT5G08440.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   265   5e-71
AT5G08440.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   192   5e-49
AT5G23510.2 | Symbols:  | unknown protein; LOCATED IN: cellular_...   187   1e-47
AT5G23510.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   182   7e-46

>AT3G03560.1 | Symbols:  | unknown protein; LOCATED IN: plasma
           membrane; EXPRESSED IN: 22 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G23490.1);
           Has 157 Blast hits to 146 proteins in 38 species: Archae
           - 3; Bacteria - 14; Metazoa - 8; Fungi - 0; Plants -
           120; Viruses - 0; Other Eukaryotes - 12 (source: NCBI
           BLink). | chr3:853153-856486 REVERSE LENGTH=521
          Length = 521

 Score =  571 bits (1471), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 295/536 (55%), Positives = 382/536 (71%), Gaps = 33/536 (6%)

Query: 21  SEILGRHNFETQLAQSNFKSNDALNHMQDQDTMELYSQARGQEEEILSLHEQIAIACMKE 80
           SE + RH  E     S    +     +QD + M LY++ R QEEEI SL E+IA AC+K+
Sbjct: 7   SESIKRHEIEKDTIASRKLEDTNTKLIQDPEEMALYAKVRSQEEEIHSLQERIAAACLKD 66

Query: 81  MQLLNEKCKLERQFSELRMAVDDKQNEAITSASNELAQRKGYLEENLKLAHDLKVAEDER 140
           MQLLNEK  LER+ ++LR+A+D+KQNE++TSA NELA+RKG LEENLKLAHDLKV EDER
Sbjct: 67  MQLLNEKYGLERKCADLRVAIDEKQNESVTSALNELARRKGDLEENLKLAHDLKVTEDER 126

Query: 141 YIFMSSMLGLLAEYGLWPRVMNASSISNYVKHLHDQLQWRIRNSHDRIGELTAVLETHAD 200
           YIFM+S+LGLLAEYG+WPRV NA++IS+ +KHLHDQLQW+ +  +DRI EL++++E    
Sbjct: 127 YIFMTSLLGLLAEYGVWPRVANATAISSGIKHLHDQLQWKTKACNDRIRELSSIVENQ-- 184

Query: 201 NGNPVVESPGSGNLTSHIHN--EFMFQHNYPQ----QNLTGNEQIPQPMSNITGYMNPVL 254
                   PG+  ++   H+      Q +Y       +   NEQ+  PM N+T   NP  
Sbjct: 185 --------PGTDFISKDNHDPRNSKTQASYGSTDRGNDYQTNEQLLPPMENVT--RNP-- 232

Query: 255 NGGYMNPIIDSDINRTFQRLNQEISKADREVSSSFHHDS----IDKIGAHE--RTRERNF 308
              Y N + D++      R N +I    + +      ++    +  +   E  + RE   
Sbjct: 233 ---YHNIMQDTE----SLRFNNQIGGGSQGIFPQPKRENFGYPLSSVAGKEMIQEREEKA 285

Query: 309 VNGKLYQPPPEHDETASSVSEDGPGIENFQICGEAIPGEKLLGCGYPVRGTSLCMFQWVR 368
            N  ++     ++E AS V E+GPGI+ FQI G+AIPGEK+LGCG+PVRGT+LCMFQWVR
Sbjct: 286 ENSSMFDAYNGNEEFASHVYEEGPGIDGFQIIGDAIPGEKVLGCGFPVRGTTLCMFQWVR 345

Query: 369 HLQDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDKGHQGELVRLFANDQNKIKCDP 428
           HL+DGTRQYIEGAT+PEY+VTADDVDKLIAVECIPMDD+G QGELVRLFANDQNKI+CD 
Sbjct: 346 HLEDGTRQYIEGATHPEYIVTADDVDKLIAVECIPMDDQGRQGELVRLFANDQNKIRCDT 405

Query: 429 EMQLEIDTNLAKGEATFSVLLLMDSSENWEQATLFLRRSGYQIKISGTEGPVVAEKFSKD 488
           EMQ EIDT +++G+A+F+V LLMDSSE+WE AT+ L+RS YQIK + TE  V++EK+SK+
Sbjct: 406 EMQTEIDTYISRGQASFNVQLLMDSSESWEPATVVLKRSSYQIKTNTTEAVVISEKYSKE 465

Query: 489 LSIKVPCGLSTQFVLTCSNGSSHPLSTYSVRMRDTLVLTMRIFQSKALDDKRKGRA 544
           L I+VP G STQFVL   +GSSHP+ST +VRMRDTLVLTMR+ QSKALD++RKGR 
Sbjct: 466 LQIRVPSGESTQFVLISYDGSSHPISTLNVRMRDTLVLTMRMLQSKALDERRKGRV 521


>AT5G23490.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G08440.1); Has 202 Blast hits to 197 proteins
           in 48 species: Archae - 0; Bacteria - 13; Metazoa - 25;
           Fungi - 9; Plants - 109; Viruses - 0; Other Eukaryotes -
           46 (source: NCBI BLink). | chr5:7919831-7926499 FORWARD
           LENGTH=729
          Length = 729

 Score =  270 bits (689), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 173/539 (32%), Positives = 281/539 (52%), Gaps = 51/539 (9%)

Query: 44  LNHMQDQDTMELYSQARGQEEEILSLHEQIAIACMKEMQLLNEKCKLERQFSELRMAVDD 103
           +NH  +    +L  + + QE+EI  L   +    +KE Q+ NEK  LE++ + +R+A D 
Sbjct: 195 VNHSGNAWKQDLIHKVQEQEQEISQLRRYLTDCSVKEAQIRNEKYVLEKRIAYMRLAFDQ 254

Query: 104 KQNEAITSASNELAQRKGYLEENLKLAHDLKVAEDERYIFMSSMLGLLAEYGLWPRVMNA 163
           +Q + + ++S  L+ R+  +EEN++L + L+  + ER  F+S +L LL+EY L P+V +A
Sbjct: 255 QQQDLVDASSKALSYRQEIIEENIRLTYALQATQQERSTFVSYLLPLLSEYSLQPQVSDA 314

Query: 164 SSISNYVKHLHDQLQWRIRNSHDRIGELTAVL-----ETHADNGNPVVESPGSGNLTSHI 218
            SI + VK L   LQ ++  +  ++ E    L     + +  N +P+  S  +G   +H 
Sbjct: 315 QSIVSNVKVLFKHLQEKLLLTETKLKESEYQLAPWQSDVNHSNDSPLAPSRSAGVALTHS 374

Query: 219 HNEFMFQHNYP---------QQNLTGNEQIPQPMSNITGYMNPVLNGG------YMNPII 263
             + M+ H++          QQ+  G+  +     + +   +P+ N        ++ P  
Sbjct: 375 TKDSMYSHDHTAIDWNLERQQQDEPGSSAVRNYHLDDSSTFSPLENSQSAAFEMHVQPGT 434

Query: 264 DSDINRTFQRLNQEISKADREVSSSFHHDSIDKIGAHERTRERNFVNGKLYQPPPE---- 319
             D +   +++++   K         H   ++ I        +N   G  +  P      
Sbjct: 435 SVDESPAHKKVDETPPK---------HVQFLEPISKTVVDDAQNPSYGSAFDDPSSSNSP 485

Query: 320 -----HDETASSVSEDG-----PGIENFQICGEAIPGEKLLGCGYPVRGTSLCMFQWVRH 369
                 +E +SS SE G     PGIE+ QI GE  PG +L  CGY + GT+ C F+WV H
Sbjct: 486 LLSPVFEEPSSSFSEGGDDDPLPGIEDLQISGEPYPGHELQACGYSINGTTSCNFEWVCH 545

Query: 370 LQDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDKGHQGELVRLFANDQNKIKCDPE 429
           L+DG+  YI+GA  P Y+VTADDVD  +A+E  P+DD+  +GELV++FAND  KI C P+
Sbjct: 546 LEDGSVNYIDGAKQPNYLVTADDVDLYLAIEVQPLDDRNRKGELVKVFANDNRKIACHPD 605

Query: 430 MQLEIDTNLAKGEATFSVLLLMDSSENWEQATLFLRRSGYQIKISGTEGPVVAEKFSKDL 489
           MQ  I+  L  G A++ V L +   + WE ATL ++R GY IK        +AEKFS   
Sbjct: 606 MQSNIEKTLHTGHASYKVSLAVGFVDIWEAATLSIKREGYSIKC--ISDLTIAEKFSAST 663

Query: 490 SIKVPCGLSTQFVLTCSNGSSHPL-----STYSVRMRDTLVLTMRIFQSKALDDKRKGR 543
           ++ +P G   + V+  S+GS H L     S   +  RD +VLT+R+F  +AL  ++KG+
Sbjct: 664 TVTIPFGQPAELVIIGSDGSEHSLRADNGSPDLIGSRDEIVLTLRLFIKRAL-QRKKGK 721


>AT5G08440.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G23490.1); Has 141 Blast
           hits to 139 proteins in 35 species: Archae - 0; Bacteria
           - 9; Metazoa - 21; Fungi - 6; Plants - 94; Viruses - 0;
           Other Eukaryotes - 11 (source: NCBI BLink). |
           chr5:2721037-2726970 FORWARD LENGTH=726
          Length = 726

 Score =  265 bits (678), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 175/512 (34%), Positives = 264/512 (51%), Gaps = 46/512 (8%)

Query: 54  ELYSQARGQEEEILSLHEQIAIACMKEMQLLNEKCKLERQFSELRMAVDDKQNEAITSAS 113
           EL  + + Q++EIL L + +A    KE+Q+ NEK  LE++ + +R A D +Q + + +AS
Sbjct: 218 ELIHKVQEQDQEILRLRKYLADYSTKEVQIRNEKYVLEKRIAHMRSAFDQQQQDLVDAAS 277

Query: 114 NELAQRKGYLEENLKLAHDLKVAEDERYIFMSSMLGLLAEYGLWPRVMNASSISNYVKHL 173
             L+ R+  +EEN++L + L+ AE ER +F+S +L LL+EY L P++ ++ SI + VK L
Sbjct: 278 KALSYRQEIIEENIRLTYALQAAEQERSLFVSILLPLLSEYSLHPQISDSQSIVSSVKVL 337

Query: 174 HDQLQWRIRNSHDRIGELTAVLETHADNGNPVVESPGSGNLTSHIHNEFMFQHNYPQQNL 233
              LQ ++  +  ++ E    L     + N    SP S      +   +     +  Q+ 
Sbjct: 338 FRHLQEKLNVTETKLKETEYQLAPWQSDVNHSNASPLSPYQPVGVGLRYSTDSEHHHQDR 397

Query: 234 TG-----NEQIPQPMSNITGYMNPVLNGGYMNPIIDSDI-----NRTFQRLNQEISKADR 283
            G     N  +  P S    +  PV       P ++ D      NR   R          
Sbjct: 398 RGGSAASNYHLDGPESRSPAFQMPV------QPALNQDESHGPNNRVQFR---------E 442

Query: 284 EVSSSFHHDSIDKIGAHERTRERNFVNGKLYQPPPEH--------DETASSVSEDG---- 331
            +S++F  D+   + A   T   N     +  P P +        +E +SS SE      
Sbjct: 443 PLSNTFMDDAYADVQADSNTTLENSTYVAVDDPSPSNYPILAPVLEEPSSSFSEAADDDP 502

Query: 332 -PGIENFQICGEAIPGEKLLGCGYPVRGTSLCMFQWVRHLQDGTRQYIEGATNPEYVVTA 390
            PGI + QI GE  PG +L   G+ + GT+ C F+WVRHL+DG+  YI+GA  P+Y+VTA
Sbjct: 503 LPGIADLQISGEPFPGRELQVSGHSINGTTKCNFEWVRHLEDGSVNYIDGAKRPDYLVTA 562

Query: 391 DDVDKLIAVECIPMDDKGHQGELVRLFANDQNKIKCDPEMQLEIDTNLAKGEATFSVLLL 450
           DDVD  +A+E  P+DDK  +GELVR+FAN+  KI C PEMQ  I+ +L  G A F V   
Sbjct: 563 DDVDLYLAIEVHPLDDKNRKGELVRVFANENCKITCHPEMQSHIEKSLYNGHALFKVSYS 622

Query: 451 MDSSENWEQATLFLRRSGYQIKISGTEGPVVAEKFSKDLSIKVPCGLSTQFVLTCSNGSS 510
           +   + WE ATL +++ GY IK   T  PV+ EKFS   +I +P      FV+  ++G  
Sbjct: 623 IGYLDIWEAATLSIKKEGYSIK--PTNDPVITEKFSSSTNIVIPFDQPADFVIIGTDGEE 680

Query: 511 HPL------STYSVRMRDTLVLTMRIFQSKAL 536
           H        +T     RDT+VLT+R+F  K L
Sbjct: 681 HLCRVVDNDATDLSCSRDTIVLTLRLFLKKTL 712


>AT5G08440.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; EXPRESSED IN: 21 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT5G23490.1). | chr5:2721037-2726970 FORWARD
           LENGTH=772
          Length = 772

 Score =  192 bits (488), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 108/271 (39%), Positives = 151/271 (55%), Gaps = 21/271 (7%)

Query: 285 VSSSFHHDSIDKIGAHERTRERNFVNGKLYQPPPEH--------DETASSVSEDG----- 331
           +S++F  D+   + A   T   N     +  P P +        +E +SS SE       
Sbjct: 490 LSNTFMDDAYADVQADSNTTLENSTYVAVDDPSPSNYPILAPVLEEPSSSFSEAADDDPL 549

Query: 332 PGIENFQICGEAIPGEKLLGCGYPVRGTSLCMFQWVRHLQDGTRQYIEGATNPEYVVTAD 391
           PGI + QI GE  PG +L   G+ + GT+ C F+WVRHL+DG+  YI+GA  P+Y+VTAD
Sbjct: 550 PGIADLQISGEPFPGRELQVSGHSINGTTKCNFEWVRHLEDGSVNYIDGAKRPDYLVTAD 609

Query: 392 DVDKLIAVECIPMDDKGHQGELVRLFANDQNKIKCDPEMQLEIDTNLAKGEATFSVLLLM 451
           DVD  +A+E  P+DDK  +GELVR+FAN+  KI C PEMQ  I+ +L  G A F V   +
Sbjct: 610 DVDLYLAIEVHPLDDKNRKGELVRVFANENCKITCHPEMQSHIEKSLYNGHALFKVSYSI 669

Query: 452 DSSENWEQATLFLRRSGYQIKISGTEGPVVAEKFSKDLSIKVPCGLSTQFVLTCSNGSSH 511
              + WE ATL +++ GY IK   T  PV+ EKFS   +I +P      FV+  ++G  H
Sbjct: 670 GYLDIWEAATLSIKKEGYSIK--PTNDPVITEKFSSSTNIVIPFDQPADFVIIGTDGEEH 727

Query: 512 PL------STYSVRMRDTLVLTMRIFQSKAL 536
                   +T     RDT+VLT+R+F  K L
Sbjct: 728 LCRVVDNDATDLSCSRDTIVLTLRLFLKKTL 758



 Score = 89.4 bits (220), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 46/118 (38%), Positives = 78/118 (66%)

Query: 54  ELYSQARGQEEEILSLHEQIAIACMKEMQLLNEKCKLERQFSELRMAVDDKQNEAITSAS 113
           EL  + + Q++EIL L + +A    KE+Q+ NEK  LE++ + +R A D +Q + + +AS
Sbjct: 218 ELIHKVQEQDQEILRLRKYLADYSTKEVQIRNEKYVLEKRIAHMRSAFDQQQQDLVDAAS 277

Query: 114 NELAQRKGYLEENLKLAHDLKVAEDERYIFMSSMLGLLAEYGLWPRVMNASSISNYVK 171
             L+ R+  +EEN++L + L+ AE ER +F+S +L LL+EY L P++ ++ SI + VK
Sbjct: 278 KALSYRQEIIEENIRLTYALQAAEQERSLFVSILLPLLSEYSLHPQISDSQSIVSSVK 335


>AT5G23510.2 | Symbols:  | unknown protein; LOCATED IN:
           cellular_component unknown; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G23490.1);
           Has 30201 Blast hits to 17322 proteins in 780 species:
           Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
           3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
           2996 (source: NCBI BLink). | chr5:7927055-7929117
           FORWARD LENGTH=306
          Length = 306

 Score =  187 bits (476), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 95/215 (44%), Positives = 128/215 (59%), Gaps = 6/215 (2%)

Query: 332 PGIENFQICGEAIPGEKLLGCGYPVRGTSLCMFQWVRHLQDGTRQYIEGATNPEYVVTAD 391
           P +EN QI GE  PG +L  CGY + GT+ C F+WV HL+DG+  YI+GA  P Y+VTAD
Sbjct: 84  PALENLQISGEPYPGHELQACGYSINGTTSCNFEWVCHLEDGSVNYIDGAKKPNYLVTAD 143

Query: 392 DVDKLIAVECIPMDDKGHQGELVRLFANDQNKIKCDPEMQLEIDTNLAKGEATFSVLLLM 451
           DV   +A+E  P+DD+  +GELV++FAND  KI C PEMQ  ID  L  G A++ V L +
Sbjct: 144 DVGLCLAIEVQPLDDRNRKGELVKVFANDNRKIACHPEMQSNIDKTLHTGHASYKVSLAI 203

Query: 452 DSSENWEQATLFLRRSGYQIKISGTEGPVVAEKFSKDLSIKVPCGLSTQFVLTCSNGSSH 511
                WE ATL + R GY IK +      + EKFS   ++K+P     + V+  S+GS H
Sbjct: 204 GFVHIWEAATLSIEREGYTIKCNNDL--TITEKFSASTAVKIPFEKPAELVIIGSDGSEH 261

Query: 512 PLSTYS----VRMRDTLVLTMRIFQSKALDDKRKG 542
            L   +    +  RD +VLT+R F   AL   +KG
Sbjct: 262 CLRVDNEWPDISSRDEIVLTLRSFIKTALQRGKKG 296


>AT5G23510.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G23490.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr5:7927266-7929016 FORWARD LENGTH=271
          Length = 271

 Score =  182 bits (461), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 91/204 (44%), Positives = 123/204 (60%), Gaps = 6/204 (2%)

Query: 332 PGIENFQICGEAIPGEKLLGCGYPVRGTSLCMFQWVRHLQDGTRQYIEGATNPEYVVTAD 391
           P +EN QI GE  PG +L  CGY + GT+ C F+WV HL+DG+  YI+GA  P Y+VTAD
Sbjct: 45  PALENLQISGEPYPGHELQACGYSINGTTSCNFEWVCHLEDGSVNYIDGAKKPNYLVTAD 104

Query: 392 DVDKLIAVECIPMDDKGHQGELVRLFANDQNKIKCDPEMQLEIDTNLAKGEATFSVLLLM 451
           DV   +A+E  P+DD+  +GELV++FAND  KI C PEMQ  ID  L  G A++ V L +
Sbjct: 105 DVGLCLAIEVQPLDDRNRKGELVKVFANDNRKIACHPEMQSNIDKTLHTGHASYKVSLAI 164

Query: 452 DSSENWEQATLFLRRSGYQIKISGTEGPVVAEKFSKDLSIKVPCGLSTQFVLTCSNGSSH 511
                WE ATL + R GY IK +      + EKFS   ++K+P     + V+  S+GS H
Sbjct: 165 GFVHIWEAATLSIEREGYTIKCNNDL--TITEKFSASTAVKIPFEKPAELVIIGSDGSEH 222

Query: 512 PLSTYS----VRMRDTLVLTMRIF 531
            L   +    +  RD +VLT+R F
Sbjct: 223 CLRVDNEWPDISSRDEIVLTLRSF 246