Miyakogusa Predicted Gene

Lj0g3v0163869.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0163869.1 Non Chatacterized Hit- tr|I3SKM7|I3SKM7_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2 SV=1,98.68,0,seg,NULL;
LEA_2,Late embryogenesis abundant protein, LEA-14,CUFF.10230.1
         (302 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G42860.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   263   1e-70
AT1G45688.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   258   3e-69
AT1G45688.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   181   5e-46
AT4G35170.1 | Symbols:  | Late embryogenesis abundant (LEA) hydr...   167   1e-41
AT2G41990.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Late embry...   135   4e-32
AT3G24600.1 | Symbols:  | Late embryogenesis abundant protein, g...   126   1e-29
AT3G08490.1 | Symbols:  | BEST Arabidopsis thaliana protein matc...    59   3e-09

>AT5G42860.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 11
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G45688.1); Has 1807 Blast
           hits to 1807 proteins in 277 species: Archae - 0;
           Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
           Viruses - 0; Other Eukaryotes - 339 (source: NCBI
           BLink). | chr5:17183339-17184857 REVERSE LENGTH=320
          Length = 320

 Score =  263 bits (671), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 152/321 (47%), Positives = 190/321 (59%), Gaps = 26/321 (8%)

Query: 1   MHAKTDSEVTSIXXXXXXXXXXXXXLYFVQSPSRDSHDGEKTVTTSFHSTPVLXXXXXXX 60
           MHAKTDSEVTS+              YFVQSPSRDSHDGEKT T SFHSTPVL       
Sbjct: 1   MHAKTDSEVTSLSASSPTRSPRRPA-YFVQSPSRDSHDGEKTAT-SFHSTPVLTSPMGSP 58

Query: 61  XXXXXXXXXXXXXKKDNPPHHHSLKPWKQIDVIEEEGLLQGEDRDR-TLSRRCYXXXXXX 119
                         K N          KQ  +IEEEGLL   DR++  L RRCY      
Sbjct: 59  PHSHSSSSRFS---KINGSKRKGHAGEKQFAMIEEEGLLDDGDREQEALPRRCYVLAFIV 115

Query: 120 XXXXXXXXXXXXXWGASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMNSTLKFTY 179
                        + A++P KPKI +KSI F+ ++VQAG D+ G+ TDMI+MN+TL+  Y
Sbjct: 116 GFSLLFAFFSLILYAAAKPQKPKISVKSITFEQLKVQAGQDAGGIGTDMITMNATLRMLY 175

Query: 180 RNTGTFFGVHVASTPLELSYSEIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIPLYGSGAS 239
           RNTGTFFGVHV S+P++LS+S+I I +G++K+FYQ R+S R V V V+G+KIPLYGSG++
Sbjct: 176 RNTGTFFGVHVTSSPIDLSFSQITIGSGSIKKFYQSRKSQRTVVVNVLGDKIPLYGSGST 235

Query: 240 LSST--------------------TGMPTVPVPLNLNFVLRSRAYVLGKLVKPKYYKRIQ 279
           L                          P  PVP+ LNF +RSRAYVLGKLV+PK+YKRI 
Sbjct: 236 LVPPPPPAPIPKPKKKKGPIVIVEPPAPPAPVPMRLNFTVRSRAYVLGKLVQPKFYKRIV 295

Query: 280 CSITLDPKKLSAPIPLKHSCT 300
           C I  + KKLS  IP+ ++CT
Sbjct: 296 CLINFEHKKLSKHIPITNNCT 316


>AT1G45688.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G42860.1); Has 258 Blast
           hits to 242 proteins in 39 species: Archae - 0; Bacteria
           - 11; Metazoa - 10; Fungi - 14; Plants - 198; Viruses -
           17; Other Eukaryotes - 8 (source: NCBI BLink). |
           chr1:17191502-17192870 FORWARD LENGTH=342
          Length = 342

 Score =  258 bits (660), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 153/340 (45%), Positives = 194/340 (57%), Gaps = 42/340 (12%)

Query: 1   MHAKTDSEVTSIXXXXXXXXXXXXXLYFVQSPSRDSHDGEKTVTTSFHSTPVLX------ 54
           MHAKTDSEVTS+             +Y+VQSPSRDSHDGEKT T SFHSTPVL       
Sbjct: 1   MHAKTDSEVTSLAASSPARSPRRP-VYYVQSPSRDSHDGEKTAT-SFHSTPVLSPMGSPP 58

Query: 55  -------XXXXXXXXXXXXXXXXXXXKKDNP------PHHHSLKPWKQIDVIEEEGLLQG 101
                                     +K NP        H   K WK+  VIEEEGLL  
Sbjct: 59  HSHSSMGRHSRESSSSRFSGSLKPGSRKVNPNDGSKRKGHGGEKQWKECAVIEEEGLLDD 118

Query: 102 EDRDRTLSRRCYXXXXXXXXXXXXXXXXXXXWGASRPMKPKIFIKSIKFDHVQVQAGSDS 161
            DRD  + RRCY                   +GA++PMKPKI +KSI F+ +++QAG D+
Sbjct: 119 GDRDGGVPRRCYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQDA 178

Query: 162 TGVATDMISMNSTLKFTYRNTGTFFGVHVASTPLELSYSEIVIAAGNMKEFYQHRRSHRL 221
            GV TDMI+MN+TL+  YRNTGTFFGVHV STP++LS+S+I I +G++K+FYQ R+S R 
Sbjct: 179 GGVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGSGSVKKFYQGRKSERT 238

Query: 222 VSVAVMGNKIPLYGSGAS---------------------LSSTTGMPTVPVPLNLNFVLR 260
           V V V+G KIPLYGSG++                            P  PVP+ L+FV+R
Sbjct: 239 VLVHVIGEKIPLYGSGSTLLPPAPPAPLPKPKKKKGAPVPIPDPPAPPAPVPMTLSFVVR 298

Query: 261 SRAYVLGKLVKPKYYKRIQCSITLDPKKLSAPIPLKHSCT 300
           SRAYVLGKLV+PK+YK+I+C I  + K L+  I +  +CT
Sbjct: 299 SRAYVLGKLVQPKFYKKIECDINFEHKNLNKHIVITKNCT 338


>AT1G45688.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G42860.1); Has 35333 Blast
           hits to 34131 proteins in 2444 species: Archae - 798;
           Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
           531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
           BLink). | chr1:17191502-17192464 FORWARD LENGTH=248
          Length = 248

 Score =  181 bits (459), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 106/240 (44%), Positives = 134/240 (55%), Gaps = 25/240 (10%)

Query: 1   MHAKTDSEVTSIXXXXXXXXXXXXXLYFVQSPSRDSHDGEKTVTTSFHSTPVLX------ 54
           MHAKTDSEVTS+             +Y+VQSPSRDSHDGEKT T SFHSTPVL       
Sbjct: 1   MHAKTDSEVTSLAASSPARSPRRP-VYYVQSPSRDSHDGEKTAT-SFHSTPVLSPMGSPP 58

Query: 55  -------XXXXXXXXXXXXXXXXXXXKKDNPPH------HHSLKPWKQIDVIEEEGLLQG 101
                                     +K NP        H   K WK+  VIEEEGLL  
Sbjct: 59  HSHSSMGRHSRESSSSRFSGSLKPGSRKVNPNDGSKRKGHGGEKQWKECAVIEEEGLLDD 118

Query: 102 EDRDRTLSRRCYXXXXXXXXXXXXXXXXXXXWGASRPMKPKIFIKSIKFDHVQVQAGSDS 161
            DRD  + RRCY                   +GA++PMKPKI +KSI F+ +++QAG D+
Sbjct: 119 GDRDGGVPRRCYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQDA 178

Query: 162 TGVATDMISMNSTLKFTYRNTGTFFGVHVASTPLELSYSEIVIAAGN----MKEFYQHRR 217
            GV TDMI+MN+TL+  YRNTGTFFGVHV STP++LS+S+I I +G+    +++ Y+ R 
Sbjct: 179 GGVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGSGSVSLPIQKLYRMRE 238


>AT4G35170.1 | Symbols:  | Late embryogenesis abundant (LEA)
           hydroxyproline-rich glycoprotein family |
           chr4:16736839-16738186 FORWARD LENGTH=299
          Length = 299

 Score =  167 bits (422), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 78/169 (46%), Positives = 112/169 (66%), Gaps = 1/169 (0%)

Query: 133 WGASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMNSTLKFTYRNTGTFFGVHVAS 192
           WG S+   P   +K +  +++ VQ+G+D +GV TDM+++NST++  YRN  TFF VHV S
Sbjct: 129 WGVSKSFAPIATLKEMVLENLNVQSGNDQSGVLTDMLTLNSTVRILYRNPATFFTVHVTS 188

Query: 193 TPLELSYSEIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIPLYGSGASLSSTTGMP-TVPV 251
            PL+LSYS++++A+G M EF Q R+S R++   V G++IPLYG   +L      P  V +
Sbjct: 189 APLQLSYSQLILASGQMGEFSQRRKSERIIETKVFGDQIPLYGGVPALFGQRAEPDQVVL 248

Query: 252 PLNLNFVLRSRAYVLGKLVKPKYYKRIQCSITLDPKKLSAPIPLKHSCT 300
           PLNL F LR+RAYVLG+LVK  ++  I+CSIT    KL   + L  SC+
Sbjct: 249 PLNLTFTLRARAYVLGRLVKTTFHSNIKCSITFYGDKLGKTLDLSKSCS 297


>AT2G41990.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Late
           embryogenesis abundant protein, group 2
           (InterPro:IPR004864); BEST Arabidopsis thaliana protein
           match is: Late embryogenesis abundant (LEA)
           hydroxyproline-rich glycoprotein family
           (TAIR:AT4G35170.1); Has 172 Blast hits to 168 proteins
           in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 172; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr2:17527396-17528527 FORWARD
           LENGTH=297
          Length = 297

 Score =  135 bits (339), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 76/167 (45%), Positives = 105/167 (62%), Gaps = 5/167 (2%)

Query: 133 WGASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMNSTLKFTYRNTGTFFGVHVAS 192
           WGAS+   PK+ +K +    + +QAG+D +GV TDM+S+NST++  YRN  TFF VHV +
Sbjct: 134 WGASKSYPPKVTVKGMLVRDLNLQAGNDLSGVPTDMLSLNSTVRIYYRNPSTFFAVHVTA 193

Query: 193 TPLELSYSEIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIPLYGSGASLSSTTGMPTVPVP 252
           +PL L YS +++++G M +F   R     V   V G++IPLYG G S      + T+ +P
Sbjct: 194 SPLLLHYSNLLLSSGEMNKFTVGRNGETNVVTVVQGHQIPLYG-GVSFH----LDTLSLP 248

Query: 253 LNLNFVLRSRAYVLGKLVKPKYYKRIQCSITLDPKKLSAPIPLKHSC 299
           LNL  VL S+AY+LG+LV  K+Y RI CS TLD   L   I L  SC
Sbjct: 249 LNLTIVLHSKAYILGRLVTSKFYTRIICSFTLDANHLPKSISLLRSC 295


>AT3G24600.1 | Symbols:  | Late embryogenesis abundant protein,
           group 2 | chr3:8972195-8974867 REVERSE LENGTH=506
          Length = 506

 Score =  126 bits (317), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 64/165 (38%), Positives = 97/165 (58%), Gaps = 2/165 (1%)

Query: 133 WGASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMNSTLKFTYRNTGTFFGVHVAS 192
           WGAS P  P + +KS+         G D TGVAT ++S NS++K T  +   +FG+HV+S
Sbjct: 336 WGASHPFSPIVSVKSVDIHSFYYGEGIDRTGVATKILSFNSSVKVTIDSPAPYFGIHVSS 395

Query: 193 TPLELSYSEIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIPLYGSGASLSSTTGMPTVPVP 252
           +  +L++S + +A G +K +YQ R+S  +  V + G ++PLYG+G  L+++     VPV 
Sbjct: 396 STFKLTFSALTLATGQLKSYYQPRKSKHISIVKLTGAEVPLYGAGPHLAASDKKGKVPV- 454

Query: 253 LNLNFVLRSRAYVLGKLVKPKYYKRIQCSITLDPKKLSAPIPLKH 297
             L F +RSR  +LGKLVK K+   + CS  +   K S PI   H
Sbjct: 455 -KLEFEIRSRGNLLGKLVKSKHENHVSCSFFISSSKTSKPIEFTH 498



 Score = 91.3 bits (225), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 72/256 (28%), Positives = 115/256 (44%), Gaps = 13/256 (5%)

Query: 1   MHAKTDSEVTSIXXXXXXXXXXXXXLYFVQSPSRDSHDGEKTVTTSFHSTPVLXXXXXXX 60
           M+ K+DS+VTS+              Y+VQSPSRDS        T+  +TP         
Sbjct: 3   MYPKSDSDVTSLDLSSPKRPT-----YYVQSPSRDSDKSSSVALTTHQTTPTESPSHPSI 57

Query: 61  XXXXXXXXXXXXXKKDNPPHHHSLKPWKQIDVIEEEGLLQGED---RDRTLS-RRCYXXX 116
                         K    +H  +  W   D  EE G  + ED    +R +S   C    
Sbjct: 58  ASRVSNGGGGGFRWKGRRKYHGGI--WWPADK-EEGGDGRYEDLYEDNRGVSIVTCRLIL 114

Query: 117 XXXXXXXXXXXXXXXXWGASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMNSTLK 176
                           +GAS+   P ++IK +         GSD+TGV T ++++  ++ 
Sbjct: 115 GVVATLSIFFLLCSVLFGASQSSPPIVYIKGVNVRSFYYGEGSDNTGVPTKIMNVKCSVV 174

Query: 177 FTYRNTGTFFGVHVASTPLELSYS-EIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIPLYG 235
            T  N  T FG+HV+ST + L YS +  +A   +K ++Q ++S+    + ++G+K+PLYG
Sbjct: 175 ITTHNPSTLFGIHVSSTAVSLIYSRQFTLANARLKSYHQPKQSNHTSRINLIGSKVPLYG 234

Query: 236 SGASLSSTTGMPTVPV 251
           +GA L ++     VPV
Sbjct: 235 AGAELVASDNSGGVPV 250


>AT3G08490.1 | Symbols:  | BEST Arabidopsis thaliana protein match
           is: Late embryogenesis abundant protein, group 2
           (TAIR:AT3G24600.1); Has 161 Blast hits to 158 proteins
           in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 161; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr3:2574105-2575125 REVERSE
           LENGTH=271
          Length = 271

 Score = 59.3 bits (142), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 33/154 (21%), Positives = 69/154 (44%), Gaps = 1/154 (0%)

Query: 135 ASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMNSTLKFTYRNTGTFFGVHVASTP 194
           A++P  P I  +  +F+   ++ G DS GV+T  ++ N + K    N    FG+H+    
Sbjct: 103 ATQPPHPNISFRIGRFNQFMLEEGVDSHGVSTKFLTFNCSTKLIIDNKSNVFGLHIHPPS 162

Query: 195 LELSYSEIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIPLYGSGASLSSTTGMPTVPVPLN 254
           ++  +  +  A     + Y          + +      +YG+G  ++    +    +PL 
Sbjct: 163 IKFFFGPLNFAKAQGPKLYGLSHESTTFQLYIATTNRAMYGAGTEMNDML-LSRAGLPLI 221

Query: 255 LNFVLRSRAYVLGKLVKPKYYKRIQCSITLDPKK 288
           L   + S   V+  ++ PKY+ +++C + L  K+
Sbjct: 222 LRTSIISDYRVVWNIINPKYHHKVECLLLLADKE 255