Miyakogusa Predicted Gene

Lj6g3v1629690.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj6g3v1629690.1 tr|Q9LT74|Q9LT74_ARATH Similarity to late
embryogenesis abundant protein OS=Arabidopsis thaliana
GN=,52.86,0.0000000006,SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL; Root_cap,Root cap,CUFF.59714.1
         (370 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G19430.1 | Symbols:  | late embryogenesis abundant protein-re...   398   e-111
AT5G60530.1 | Symbols:  | late embryogenesis abundant protein-re...   256   2e-68
AT5G54370.1 | Symbols:  | Late embryogenesis abundant (LEA) prot...   251   7e-67
AT5G60520.1 | Symbols:  | Late embryogenesis abundant (LEA) prot...   246   1e-65
AT1G54890.1 | Symbols:  | Late embryogenesis abundant (LEA) prot...   233   2e-61
AT4G27400.1 | Symbols:  | Late embryogenesis abundant (LEA) prot...   226   3e-59

>AT3G19430.1 | Symbols:  | late embryogenesis abundant
           protein-related / LEA protein-related |
           chr3:6736382-6738330 REVERSE LENGTH=559
          Length = 559

 Score =  398 bits (1022), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 191/301 (63%), Positives = 237/301 (78%), Gaps = 9/301 (2%)

Query: 77  EKARCKNKKFTSCYNTEHVCPASCPGACEVDCTTCKPVCKCDKPGAVCQDPRFIGGDGIT 136
           ++ RCK K+ + CY  E+ CPA CP +C+VDC TCKPVC CDKPG+VCQDPRFIGGDG+T
Sbjct: 261 KRVRCK-KQRSPCYGVEYTCPADCPRSCQVDCVTCKPVCNCDKPGSVCQDPRFIGGDGLT 319

Query: 137 FYFHGKKDSNFCLVSDPNLHINAHFIGKRNHNMKRDFTWVQSIAILFDNHQLSVSALKTA 196
           FYFHGKKDSNFCL+SDPNLHINAHFIGKR   M RDFTWVQSIAILF  H+L V ALKTA
Sbjct: 320 FYFHGKKDSNFCLISDPNLHINAHFIGKRRAGMARDFTWVQSIAILFGTHRLYVGALKTA 379

Query: 197 TWEDSKDRLALTFDGEPIALRESEGAAWKSSSG----VSILR-DADTNSVVVEVEGKFRM 251
           TW+DS DR+A++FDG  I+L + +GA W SS G    VS+ R + DTN++ VEVEG  ++
Sbjct: 380 TWDDSVDRIAVSFDGNVISLPQLDGARWTSSPGVYPEVSVKRVNTDTNNLEVEVEGLLKI 439

Query: 252 TSKVVPITEEESRIHRYGITEEDCFAHLDVGFRFFSLSSEVSGVLGQTYKPDYVSRVNVG 311
           T++VVPIT E+SRIH Y + E+DC AHLD+GF+F  LS  V GVLGQTY+ +YVSRV +G
Sbjct: 440 TARVVPITMEDSRIHGYDVKEDDCLAHLDLGFKFQDLSDNVDGVLGQTYRSNYVSRVKIG 499

Query: 312 VKMPVLGGAGKEFETTSLFSPDCSVARFVGS--NNDEVVTLEMPAMNCVSGIDGQGVVCR 369
           V MPV+GG  +EF+TT LF+PDCS ARF G+  +N+    LE+P M+C SG+ G+GVVC+
Sbjct: 500 VHMPVMGG-DREFQTTGLFAPDCSAARFTGNGDSNNGRSKLELPEMSCASGLGGKGVVCK 558

Query: 370 R 370
           R
Sbjct: 559 R 559



 Score = 59.3 bits (142), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 24/38 (63%), Positives = 29/38 (76%)

Query: 36 YATCNIKKYKYCYNWPQECPASCPNKCKVDCATCKPIC 73
          +ATC IKKYK+CYN    CP  CP+ C V+CA+CKPIC
Sbjct: 14 HATCKIKKYKHCYNLEHVCPKFCPDSCHVECASCKPIC 51



 Score = 55.5 bits (132), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 22/37 (59%), Positives = 27/37 (72%)

Query: 79  ARCKNKKFTSCYNTEHVCPASCPGACEVDCTTCKPVC 115
           A CK KK+  CYN EHVCP  CP +C V+C +CKP+C
Sbjct: 15  ATCKIKKYKHCYNLEHVCPKFCPDSCHVECASCKPIC 51


>AT5G60530.1 | Symbols:  | late embryogenesis abundant
           protein-related / LEA protein-related |
           chr5:24334197-24335685 REVERSE LENGTH=439
          Length = 439

 Score =  256 bits (654), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 130/289 (44%), Positives = 176/289 (60%), Gaps = 23/289 (7%)

Query: 71  PICVPKEKARCKNKKFTSCYNTEHVCPASCPG----------ACEVDCTT-CKPVCK--- 116
           P+   +E+A C+ +   SCY    VCP  CP            C +DCT  C+  CK   
Sbjct: 143 PLPTGQEQAMCQGR--GSCYYKTLVCPGECPKRKPTKNKNTKGCFIDCTNKCEATCKWRK 200

Query: 117 --CDKPGAVCQDPRFIGGDGITFYFHGKKDSNFCLVSDPNLHINAHFIGKRNHNMKRDFT 174
             C+  G++C DPRF+GGDG+ FYFHG K  NF +VSD NL INAHFIG R     RDFT
Sbjct: 201 TNCNGYGSLCYDPRFVGGDGVMFYFHGSKGGNFAIVSDNNLQINAHFIGTRPVGRTRDFT 260

Query: 175 WVQSIAILFDNHQLSVSALKTATWEDSKDRLALTFDGEPIALRESEGAAWKSSSG----V 230
           WVQ++ ++F+NH+L ++A +   W+++ D   + +DGE I L E E + W+  SG    +
Sbjct: 261 WVQALNVMFENHKLVITANRVNQWDETSDAFTIRYDGELITLPEDEQSEWREISGQKKDI 320

Query: 231 SILRDADTNSVVVEVEGKFRMTSKVVPITEEESRIHRYGITEEDCFAHLDVGFRFFSLSS 290
            I R  + NSV V V    +M  +V PI +EE+R+H Y + ++D FAHL+  F+F  LS 
Sbjct: 321 IIERTDERNSVRVLVSDLVQMDIRVRPIGKEENRVHNYQLPQDDAFAHLETQFKFLDLSE 380

Query: 291 EVSGVLGQTYKPDYVSRVNVGVKMPVLGGAGKEFETTSLFSPDCSVARF 339
            V GVLG+TY+PDYVS    GV MPVLGG  K ++T SLFSP C + RF
Sbjct: 381 LVEGVLGKTYRPDYVSSAKTGVPMPVLGGEDK-YQTPSLFSPTCRLCRF 428


>AT5G54370.1 | Symbols:  | Late embryogenesis abundant (LEA)
           protein-related | chr5:22075334-22076567 FORWARD
           LENGTH=337
          Length = 337

 Score =  251 bits (640), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 130/306 (42%), Positives = 184/306 (60%), Gaps = 22/306 (7%)

Query: 86  FTSCYNTEHVCPASCPG---------ACEVDCT--TCKPVCK-----CDKPGAVCQDPRF 129
           +T CY     CP  CP           C  DC   TCK  C+     C++PG+ C DPRF
Sbjct: 33  YTRCYRKYIRCPEECPSKTAMNSKNKVCYADCDRPTCKSQCRMRKPNCNRPGSACYDPRF 92

Query: 130 IGGDGITFYFHGKKDSNFCLVSDPNLHINAHFIGKRNHNMKRDFTWVQSIAILFDNHQLS 189
           IGGDGI FYFHGK +  F LVSD +L IN  FIG R     RDFTW+Q++  LF++++ S
Sbjct: 93  IGGDGIVFYFHGKSNEEFSLVSDSDLQINGRFIGHRPAGRARDFTWIQALGFLFNSNKFS 152

Query: 190 VSALKTATWEDSKDRLALTFDGEPIALRESEGAAWKS-SSGVSILRDADTNSVVVEVEGK 248
           + A KTA+W++  D L  ++DG+ +++ E   + W S +  + I R +  NSV+V ++ K
Sbjct: 153 LEAAKTASWDNEIDHLKFSYDGQDLSVPEETLSTWYSPNKDIKIERVSMRNSVIVTIKDK 212

Query: 249 FRMTSKVVPITEEESRIHRYGITEEDCFAHLDVGFRFFSLSSEVSGVLGQTYKPDYVSRV 308
             +   VVP+T+E+ RIH Y +  +DCFAHL+V FRFF+LS +V G+LG+TY+PD+ +  
Sbjct: 213 AEIMINVVPVTKEDDRIHSYKVPSDDCFAHLEVQFRFFNLSPKVDGILGRTYRPDFQNPA 272

Query: 309 NVGVKMPVLGGAGKEFETTSLFSPDCSVARFVGSNN--DEVVT-LEMPAMNCVSGI-DGQ 364
             GV MPV+GG    F+T+SL S DC    F  S    D V + +E   ++C  G   G 
Sbjct: 273 KPGVAMPVVGGE-DSFKTSSLLSNDCKTCIFSESQAEIDSVKSEIEYATLDCTRGASSGY 331

Query: 365 GVVCRR 370
           G+VCR+
Sbjct: 332 GIVCRK 337


>AT5G60520.1 | Symbols:  | Late embryogenesis abundant (LEA)
           protein-related | chr5:24331787-24332947 REVERSE
           LENGTH=338
          Length = 338

 Score =  246 bits (629), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 127/290 (43%), Positives = 179/290 (61%), Gaps = 22/290 (7%)

Query: 71  PICVPKEKARCKNKKFTSCYNTEHVCPASCP----------GACEVDCTT-CKPVCK--- 116
           P+   +E+ +C  +   SC      CP  CP           AC +DC++ C+  CK   
Sbjct: 43  PLGSGQERVQCLAR--GSCNQKILTCPKECPERKPKMNKKKKACFIDCSSKCEVTCKWRK 100

Query: 117 --CDKPGAVCQDPRFIGGDGITFYFHGKKDSNFCLVSDPNLHINAHFIGKRNHNMKRDFT 174
             C+  G++C DPRF+GGDG+ FYFHG KD NF +VSD NL INAHFIG R     RDFT
Sbjct: 101 ANCNGYGSLCYDPRFVGGDGVMFYFHGNKDGNFAIVSDENLQINAHFIGTRPAGRTRDFT 160

Query: 175 WVQSIAILFDNHQLSVSALKTATWEDSKDRLALTFDGEPIALRESEGAAWK---SSSGVS 231
           WVQ+ +++FD+H L ++A K A+W+DS D L + ++GE + +     A W+       V 
Sbjct: 161 WVQAFSVMFDSHNLVIAAKKVASWDDSVDSLVVRWNGEEVEVPTEGEAEWRIDLDEREVI 220

Query: 232 ILRDADTNSVVVEVEGKFRMTSKVVPITEEESRIHRYGITEEDCFAHLDVGFRFFSLSSE 291
           + R  + N+V V V G  ++  +V PI +EE R+H+Y + ++D FAHL+  F+FF+LS  
Sbjct: 221 VERTDERNNVRVTVSGIVQIDIQVRPIGKEEDRVHKYQLPKDDAFAHLETQFKFFNLSDL 280

Query: 292 VSGVLGQTYKPDYVSRVNVGVKMPVLGGAGKEFETTSLFSPDCSVARFVG 341
           V GVLG+TY+P YVS V  GV MP++GG  K ++T SLFSP C+V RF G
Sbjct: 281 VEGVLGKTYRPGYVSPVKTGVPMPMMGGEDK-YQTPSLFSPLCNVCRFQG 329


>AT1G54890.1 | Symbols:  | Late embryogenesis abundant (LEA)
           protein-related | chr1:20463107-20464407 FORWARD
           LENGTH=347
          Length = 347

 Score =  233 bits (593), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 127/319 (39%), Positives = 181/319 (56%), Gaps = 27/319 (8%)

Query: 77  EKARCKNKKFTSCYNTEHVCPASCPG---------ACEVDC--TTCKPVCKCDKP----- 120
           E   C  +K + C+  +  CP  CP          AC +DC    CK  C+  KP     
Sbjct: 31  ETKTCFQRK-SPCFLKKQTCPKQCPSFSPPNGSTKACVIDCFNPICKATCRNRKPNCNGK 89

Query: 121 GAVCQDPRFIGGDGITFYFHGKKDSNFCLVSDPNLHINAHFIGKRNHNMKRDFTWVQSIA 180
           G+ C DPRFIGGDGI FYFHGK+D +F L+SD +  +NA FIG R +   RDFTW+QS+ 
Sbjct: 90  GSACLDPRFIGGDGIVFYFHGKRDEHFALISDVDFQVNARFIGLRPNGRARDFTWIQSLG 149

Query: 181 ILF--DNHQLSVSALKTATWEDSKDRLALTFDGEPIALRESEGAAWKSSSG--VSILRDA 236
           ++F  ++   S+ A K   W+   D L L+++G+ I+L + + + W    G  + I R +
Sbjct: 150 LIFGPNSKTFSLEATKAEKWDHQVDHLRLSYEGKEISLPKGDTSVWSPPLGDYIKIERTS 209

Query: 237 DTNSVVVEVEGKFRMTSKVVPITEEESRIHRYGITEEDCFAHLDVGFRFFSLSSEVSGVL 296
           D NSV+V ++    +   VVP+T+E+  IH+YGI E+DCFAHL+V FRF  LSS V GVL
Sbjct: 210 DINSVLVTLQDIAEIWINVVPVTKEDDIIHKYGIPEDDCFAHLEVQFRFLKLSSNVEGVL 269

Query: 297 GQTYKPDYVSRVNVGVKMPVLGGAGKEFETTSLFSPDCSVARFVGSNND----EVVTLEM 352
           G+TYK D+ +    GV MPV+GG  K + T SL    C+   + G +      E + L  
Sbjct: 270 GRTYKEDFKNPAKPGVAMPVVGGEDK-YRTASLLETSCNACVYSGGSRSLDKIEPLLLNQ 328

Query: 353 PAMNCVSG-IDGQGVVCRR 370
             ++C  G   G G+ CR+
Sbjct: 329 NTVDCTGGSSSGVGIFCRK 347


>AT4G27400.1 | Symbols:  | Late embryogenesis abundant (LEA)
           protein-related | chr4:13705341-13706637 FORWARD
           LENGTH=341
          Length = 341

 Score =  226 bits (575), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 120/299 (40%), Positives = 172/299 (57%), Gaps = 27/299 (9%)

Query: 96  CPASCPGA---------CEVDC--TTCKPVCKCDKP-----GAVCQDPRFIGGDGITFYF 139
           CP  CP           C VDC    C+ VC+  KP     G++C DPRFIGGDGI FYF
Sbjct: 46  CPEECPTEMFPNSQNKICWVDCFKPLCEAVCRAVKPNCESYGSICLDPRFIGGDGIVFYF 105

Query: 140 HGKKDSNFCLVSDPNLHINAHFIGKRNHNMKRDFTWVQSIAILFDNHQLSVSALKTATWE 199
           HGK + +F +VSDP+  INA F G R     RDFTW+Q++  LF++H+ S+   K ATW+
Sbjct: 106 HGKSNEHFSIVSDPDFQINARFTGHRPAGRTRDFTWIQALGFLFNSHKFSLETTKVATWD 165

Query: 200 DSKDRLALTFDGEPIALRESEGAAWKSSS-GVSILRDADTNSVVVEVEGKFRMTSKVVPI 258
            + D L  T DG+ + + +   + W SS   + I R  + NSV+V ++ K  +   VVP+
Sbjct: 166 SNLDHLKFTIDGQDLIIPQETLSTWYSSDKDIKIERLTEKNSVIVTIKDKAEIMVNVVPV 225

Query: 259 TEEESRIHRYGITEEDCFAHLDVGFRFFSLSSEVSGVLGQTYKPDYVSRVNVGVKMPVLG 318
           T+E+ RIH Y +  +DCFAH +V F+F +LS +V G+LG+TY+PD+ +    GV MPV+G
Sbjct: 226 TKEDDRIHNYKLPVDDCFAHFEVQFKFINLSPKVDGILGRTYRPDFKNPAKPGVVMPVVG 285

Query: 319 GAGKEFETTSLFSPDCSVARFVGSNNDEVVTLEMPAMNCVSGID-------GQGVVCRR 370
           G    F T+SL S  C    F  S +  V +  +   +  + +D       G G+VCR+
Sbjct: 286 GE-DSFRTSSLLSHVCKTCLF--SEDPAVASGSVKPKSTYALLDCSRGASSGYGLVCRK 341