Miyakogusa Predicted Gene

Lj0g3v0012219.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0012219.1 Non Chatacterized Hit- tr|I1MYS1|I1MYS1_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.56469
PE,71.69,0,NUCLEOLAR PROTEIN-RELATED,NULL; NUCLEOLAR PROTEIN
7/ESTROGEN RECEPTOR COACTIVATOR-RELATED,NULL;
doma,NODE_54831_length_1456_cov_62.771290.path1.1
         (371 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G39870.2 | Symbols:  | TLD-domain containing nucleolar protei...   325   4e-89
AT4G39870.1 | Symbols:  | TLD-domain containing nucleolar protei...   325   4e-89
AT2G05590.2 | Symbols:  | TLD-domain containing nucleolar protei...   222   3e-58
AT2G05590.1 | Symbols:  | TLD-domain containing nucleolar protei...   174   1e-43
AT5G06260.1 | Symbols:  | TLD-domain containing nucleolar protei...    69   4e-12

>AT4G39870.2 | Symbols:  | TLD-domain containing nucleolar protein |
           chr4:18502234-18504275 FORWARD LENGTH=394
          Length = 394

 Score =  325 bits (832), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 190/401 (47%), Positives = 247/401 (61%), Gaps = 38/401 (9%)

Query: 1   MGKKPSLRTKA----TDFVYAVLNPISD--SNDHNXXXXXXXXXXXEEVGETEISASETS 54
           MGK  S R+KA    TD    +LNPISD  S+ H            +       +  E++
Sbjct: 1   MGKHKSFRSKAVHFVTDLTAGLLNPISDKPSSAHPPPPLPDEEDESKR------NQLEST 54

Query: 55  DEEGSHGLDGGPDTSSFTAFLYSFVSSSDTKTDKHGQNDEK---------------SEPD 99
             E    L   PDTSSF+AFL S +SS      K    +++               S+  
Sbjct: 55  TAEQPKDLVDEPDTSSFSAFLGSLLSSDPKDKRKDQDPEDEEDEEEDEEEDSEAETSDTS 114

Query: 100 NINPLPDSSLKEN----GRRKSLFSRGKQSLGRAIRHATRIGGFRHH----DRRKDNVEM 151
           + +  P  ++KE       +KS  S+ KQ   R    A +  G +      D   D+ E 
Sbjct: 115 SSSANPTRTMKETTSGGAAKKSFLSKYKQHF-RNFYQAVKFPGVKERKGNSDVIPDDEET 173

Query: 152 KYDDGHCSKISTVEPVKESVHRPL-VDLPEISEPSVLLSEGMRSVLYASLPPLVHGRKWL 210
           +Y DG   K      VKE V   +   +PEISEPS+LLSE  R  LY SLP LV GRKW+
Sbjct: 174 EYYDGLEMKPMQNNNVKEEVTVVVQAIIPEISEPSLLLSEQSRRSLYTSLPALVQGRKWI 233

Query: 211 LLYSTWRHGVTLSTLYRRSMLSPGSCLLVVGDQRGAVFGSLVEAPMRPSNRRKYQGTNNT 270
           LLYSTWRHG++LSTLYR+S+L PG  LLVVGD++G+VFG LVEAP+ P+++ KYQGTN+T
Sbjct: 234 LLYSTWRHGISLSTLYRKSLLWPGLSLLVVGDRKGSVFGGLVEAPLIPTDK-KYQGTNST 292

Query: 271 YVFTNISGHPVIYRPTGVNRYFTLCNTDYIAIGGGGHFALYLDGDLLNGSSSVSETYGNP 330
           +VFTN SG P IYRPTG NR++TLC+ +++A+GGGG FALYLD +LL+GSS+ SETYGN 
Sbjct: 293 FVFTNKSGQPTIYRPTGANRFYTLCSKEFLALGGGGRFALYLDSELLSGSSAYSETYGNS 352

Query: 331 CLANSQEFEVKEVELWGFVQTSKYEEVLALSRTETPGICRW 371
           CLA+SQ+F+VKEVELWGFV  SKY+E+LA S+T  PG+CRW
Sbjct: 353 CLADSQDFDVKEVELWGFVYGSKYDEILAHSKTMEPGLCRW 393


>AT4G39870.1 | Symbols:  | TLD-domain containing nucleolar protein |
           chr4:18502234-18504275 FORWARD LENGTH=394
          Length = 394

 Score =  325 bits (832), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 190/401 (47%), Positives = 247/401 (61%), Gaps = 38/401 (9%)

Query: 1   MGKKPSLRTKA----TDFVYAVLNPISD--SNDHNXXXXXXXXXXXEEVGETEISASETS 54
           MGK  S R+KA    TD    +LNPISD  S+ H            +       +  E++
Sbjct: 1   MGKHKSFRSKAVHFVTDLTAGLLNPISDKPSSAHPPPPLPDEEDESKR------NQLEST 54

Query: 55  DEEGSHGLDGGPDTSSFTAFLYSFVSSSDTKTDKHGQNDEK---------------SEPD 99
             E    L   PDTSSF+AFL S +SS      K    +++               S+  
Sbjct: 55  TAEQPKDLVDEPDTSSFSAFLGSLLSSDPKDKRKDQDPEDEEDEEEDEEEDSEAETSDTS 114

Query: 100 NINPLPDSSLKEN----GRRKSLFSRGKQSLGRAIRHATRIGGFRHH----DRRKDNVEM 151
           + +  P  ++KE       +KS  S+ KQ   R    A +  G +      D   D+ E 
Sbjct: 115 SSSANPTRTMKETTSGGAAKKSFLSKYKQHF-RNFYQAVKFPGVKERKGNSDVIPDDEET 173

Query: 152 KYDDGHCSKISTVEPVKESVHRPL-VDLPEISEPSVLLSEGMRSVLYASLPPLVHGRKWL 210
           +Y DG   K      VKE V   +   +PEISEPS+LLSE  R  LY SLP LV GRKW+
Sbjct: 174 EYYDGLEMKPMQNNNVKEEVTVVVQAIIPEISEPSLLLSEQSRRSLYTSLPALVQGRKWI 233

Query: 211 LLYSTWRHGVTLSTLYRRSMLSPGSCLLVVGDQRGAVFGSLVEAPMRPSNRRKYQGTNNT 270
           LLYSTWRHG++LSTLYR+S+L PG  LLVVGD++G+VFG LVEAP+ P+++ KYQGTN+T
Sbjct: 234 LLYSTWRHGISLSTLYRKSLLWPGLSLLVVGDRKGSVFGGLVEAPLIPTDK-KYQGTNST 292

Query: 271 YVFTNISGHPVIYRPTGVNRYFTLCNTDYIAIGGGGHFALYLDGDLLNGSSSVSETYGNP 330
           +VFTN SG P IYRPTG NR++TLC+ +++A+GGGG FALYLD +LL+GSS+ SETYGN 
Sbjct: 293 FVFTNKSGQPTIYRPTGANRFYTLCSKEFLALGGGGRFALYLDSELLSGSSAYSETYGNS 352

Query: 331 CLANSQEFEVKEVELWGFVQTSKYEEVLALSRTETPGICRW 371
           CLA+SQ+F+VKEVELWGFV  SKY+E+LA S+T  PG+CRW
Sbjct: 353 CLADSQDFDVKEVELWGFVYGSKYDEILAHSKTMEPGLCRW 393


>AT2G05590.2 | Symbols:  | TLD-domain containing nucleolar protein |
           chr2:2067196-2068951 FORWARD LENGTH=303
          Length = 303

 Score =  222 bits (566), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 99/175 (56%), Positives = 133/175 (76%)

Query: 180 EISEPSVLLSEGMRSVLYASLPPLVHGRKWLLLYSTWRHGVTLSTLYRRSMLSPGSCLLV 239
           E++E SV ++  +   L+ASLP +V G KW+LLYST +HG++L TL RRS   PG CLLV
Sbjct: 127 ELTESSVFITANLFEFLHASLPNIVRGCKWILLYSTLKHGISLRTLLRRSGELPGPCLLV 186

Query: 240 VGDQRGAVFGSLVEAPMRPSNRRKYQGTNNTYVFTNISGHPVIYRPTGVNRYFTLCNTDY 299
            GD++GAVFG+L+E P++P+ +RKYQGT+ T++FT I G P I+RPTG NRY+ +C  ++
Sbjct: 187 AGDKQGAVFGALLECPLQPTPKRKYQGTSQTFLFTTIYGEPRIFRPTGANRYYLMCMNEF 246

Query: 300 IAIGGGGHFALYLDGDLLNGSSSVSETYGNPCLANSQEFEVKEVELWGFVQTSKY 354
           +A GGGG+FAL LD DLL  +S  SET+GN CLA+S EFE+K VELWGF   S+Y
Sbjct: 247 LAFGGGGNFALCLDEDLLKATSGPSETFGNECLASSTEFELKNVELWGFAHASQY 301


>AT2G05590.1 | Symbols:  | TLD-domain containing nucleolar protein |
           chr2:2067196-2068650 FORWARD LENGTH=263
          Length = 263

 Score =  174 bits (440), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 76/137 (55%), Positives = 105/137 (76%)

Query: 180 EISEPSVLLSEGMRSVLYASLPPLVHGRKWLLLYSTWRHGVTLSTLYRRSMLSPGSCLLV 239
           E++E SV ++  +   L+ASLP +V G KW+LLYST +HG++L TL RRS   PG CLLV
Sbjct: 127 ELTESSVFITANLFEFLHASLPNIVRGCKWILLYSTLKHGISLRTLLRRSGELPGPCLLV 186

Query: 240 VGDQRGAVFGSLVEAPMRPSNRRKYQGTNNTYVFTNISGHPVIYRPTGVNRYFTLCNTDY 299
            GD++GAVFG+L+E P++P+ +RKYQGT+ T++FT I G P I+RPTG NRY+ +C  ++
Sbjct: 187 AGDKQGAVFGALLECPLQPTPKRKYQGTSQTFLFTTIYGEPRIFRPTGANRYYLMCMNEF 246

Query: 300 IAIGGGGHFALYLDGDL 316
           +A GGGG+FAL LD DL
Sbjct: 247 LAFGGGGNFALCLDEDL 263


>AT5G06260.1 | Symbols:  | TLD-domain containing nucleolar protein |
           chr5:1902755-1904835 REVERSE LENGTH=424
          Length = 424

 Score = 69.3 bits (168), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 57/207 (27%), Positives = 93/207 (44%), Gaps = 20/207 (9%)

Query: 162 STVEPVKESVHRPLVDLPEISEPSVLLSEGMRSVLYASLP--PLVHGRKWLLLYSTWRHG 219
           STV P  +  H    D   +S   +LL +     +  +LP   LV   +W LLY +  HG
Sbjct: 194 STVRPGYQVPHLLYED--SVSSDRLLLKKEYAWHIGGALPHHELV---EWKLLYHSSVHG 248

Query: 220 VTLST-LYRRSMLSPGSCLLVVGDQRGAVFGSLVEAPMRPSNRRKYQGTNNTYVFTNISG 278
            + +T L   S     + +L++ D  G V+G     P        + G   +++F  ++ 
Sbjct: 249 QSFNTFLGHTSNTGMSASVLIIKDTEGYVYGGYASQPWE--RYSDFYGDMKSFLF-QLNP 305

Query: 279 HPVIYRPTGVNRYFTLCNTDYIA------IGGGG---HFALYLDGDLLNGSSSVSETYGN 329
              IYRPTG N     C T++ +      IG GG   HF L++      G +    T+G+
Sbjct: 306 KAAIYRPTGANTNIQWCATNFTSENIPNGIGFGGKINHFGLFISASFDQGQTFECTTFGS 365

Query: 330 PCLANSQEFEVKEVELWGFVQTSKYEE 356
           P L+ +   + + +E WG VQ S  ++
Sbjct: 366 PSLSKTSRIQPEVIECWGIVQASNEQD 392