Miyakogusa Predicted Gene
- Lj0g3v0361769.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0361769.1 Non Chatacterized Hit- tr|I1MYS1|I1MYS1_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.56469
PE,71.96,0,NUCLEOLAR PROTEIN-RELATED,NULL; NUCLEOLAR PROTEIN
7/ESTROGEN RECEPTOR COACTIVATOR-RELATED,NULL;
doma,NODE_54831_length_1456_cov_62.771290.path2.1
(371 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G39870.2 | Symbols: | TLD-domain containing nucleolar protei... 325 4e-89
AT4G39870.1 | Symbols: | TLD-domain containing nucleolar protei... 325 4e-89
AT2G05590.2 | Symbols: | TLD-domain containing nucleolar protei... 222 3e-58
AT2G05590.1 | Symbols: | TLD-domain containing nucleolar protei... 174 1e-43
AT5G06260.1 | Symbols: | TLD-domain containing nucleolar protei... 69 4e-12
>AT4G39870.2 | Symbols: | TLD-domain containing nucleolar protein |
chr4:18502234-18504275 FORWARD LENGTH=394
Length = 394
Score = 325 bits (832), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 190/401 (47%), Positives = 247/401 (61%), Gaps = 38/401 (9%)
Query: 1 MGKKPSLRTKA----TDFVYAVLNPISD--SNDHNXXXXXXXXXXXEEVGETEISASETS 54
MGK S R+KA TD +LNPISD S+ H + + E++
Sbjct: 1 MGKHKSFRSKAVHFVTDLTAGLLNPISDKPSSAHPPPPLPDEEDESKR------NQLEST 54
Query: 55 DEEGSHGLDGGPDTSSFTAFLYSFVSSSDTKTDKHGQNDEK---------------SEPD 99
E L PDTSSF+AFL S +SS K +++ S+
Sbjct: 55 TAEQPKDLVDEPDTSSFSAFLGSLLSSDPKDKRKDQDPEDEEDEEEDEEEDSEAETSDTS 114
Query: 100 NINPLPDSSLKEN----GRRKSLFSRGKQSLGRAIRHATRIGGFRHH----DRRKDNVEM 151
+ + P ++KE +KS S+ KQ R A + G + D D+ E
Sbjct: 115 SSSANPTRTMKETTSGGAAKKSFLSKYKQHF-RNFYQAVKFPGVKERKGNSDVIPDDEET 173
Query: 152 KYDDGHCSKISTVEPVKESVHRPL-VDLPEISEPSVLLSEGMRSVLYASLPPLVHGRKWL 210
+Y DG K VKE V + +PEISEPS+LLSE R LY SLP LV GRKW+
Sbjct: 174 EYYDGLEMKPMQNNNVKEEVTVVVQAIIPEISEPSLLLSEQSRRSLYTSLPALVQGRKWI 233
Query: 211 LLYSTWRHGVTLSTLYRRSMLSPGSCLLVVGDQRGAVFGSLVEAPMRPSNRRKYQGTNNT 270
LLYSTWRHG++LSTLYR+S+L PG LLVVGD++G+VFG LVEAP+ P+++ KYQGTN+T
Sbjct: 234 LLYSTWRHGISLSTLYRKSLLWPGLSLLVVGDRKGSVFGGLVEAPLIPTDK-KYQGTNST 292
Query: 271 YVFTNISGHPVIYRPTGVNRYFTLCNTDYIAIGGGGHFALYLDGDLLNGSSSVSETYGNP 330
+VFTN SG P IYRPTG NR++TLC+ +++A+GGGG FALYLD +LL+GSS+ SETYGN
Sbjct: 293 FVFTNKSGQPTIYRPTGANRFYTLCSKEFLALGGGGRFALYLDSELLSGSSAYSETYGNS 352
Query: 331 CLANSQEFEVKEVELWGFVQTSKYEEVLALSRTETPGICRW 371
CLA+SQ+F+VKEVELWGFV SKY+E+LA S+T PG+CRW
Sbjct: 353 CLADSQDFDVKEVELWGFVYGSKYDEILAHSKTMEPGLCRW 393
>AT4G39870.1 | Symbols: | TLD-domain containing nucleolar protein |
chr4:18502234-18504275 FORWARD LENGTH=394
Length = 394
Score = 325 bits (832), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 190/401 (47%), Positives = 247/401 (61%), Gaps = 38/401 (9%)
Query: 1 MGKKPSLRTKA----TDFVYAVLNPISD--SNDHNXXXXXXXXXXXEEVGETEISASETS 54
MGK S R+KA TD +LNPISD S+ H + + E++
Sbjct: 1 MGKHKSFRSKAVHFVTDLTAGLLNPISDKPSSAHPPPPLPDEEDESKR------NQLEST 54
Query: 55 DEEGSHGLDGGPDTSSFTAFLYSFVSSSDTKTDKHGQNDEK---------------SEPD 99
E L PDTSSF+AFL S +SS K +++ S+
Sbjct: 55 TAEQPKDLVDEPDTSSFSAFLGSLLSSDPKDKRKDQDPEDEEDEEEDEEEDSEAETSDTS 114
Query: 100 NINPLPDSSLKEN----GRRKSLFSRGKQSLGRAIRHATRIGGFRHH----DRRKDNVEM 151
+ + P ++KE +KS S+ KQ R A + G + D D+ E
Sbjct: 115 SSSANPTRTMKETTSGGAAKKSFLSKYKQHF-RNFYQAVKFPGVKERKGNSDVIPDDEET 173
Query: 152 KYDDGHCSKISTVEPVKESVHRPL-VDLPEISEPSVLLSEGMRSVLYASLPPLVHGRKWL 210
+Y DG K VKE V + +PEISEPS+LLSE R LY SLP LV GRKW+
Sbjct: 174 EYYDGLEMKPMQNNNVKEEVTVVVQAIIPEISEPSLLLSEQSRRSLYTSLPALVQGRKWI 233
Query: 211 LLYSTWRHGVTLSTLYRRSMLSPGSCLLVVGDQRGAVFGSLVEAPMRPSNRRKYQGTNNT 270
LLYSTWRHG++LSTLYR+S+L PG LLVVGD++G+VFG LVEAP+ P+++ KYQGTN+T
Sbjct: 234 LLYSTWRHGISLSTLYRKSLLWPGLSLLVVGDRKGSVFGGLVEAPLIPTDK-KYQGTNST 292
Query: 271 YVFTNISGHPVIYRPTGVNRYFTLCNTDYIAIGGGGHFALYLDGDLLNGSSSVSETYGNP 330
+VFTN SG P IYRPTG NR++TLC+ +++A+GGGG FALYLD +LL+GSS+ SETYGN
Sbjct: 293 FVFTNKSGQPTIYRPTGANRFYTLCSKEFLALGGGGRFALYLDSELLSGSSAYSETYGNS 352
Query: 331 CLANSQEFEVKEVELWGFVQTSKYEEVLALSRTETPGICRW 371
CLA+SQ+F+VKEVELWGFV SKY+E+LA S+T PG+CRW
Sbjct: 353 CLADSQDFDVKEVELWGFVYGSKYDEILAHSKTMEPGLCRW 393
>AT2G05590.2 | Symbols: | TLD-domain containing nucleolar protein |
chr2:2067196-2068951 FORWARD LENGTH=303
Length = 303
Score = 222 bits (566), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 99/175 (56%), Positives = 133/175 (76%)
Query: 180 EISEPSVLLSEGMRSVLYASLPPLVHGRKWLLLYSTWRHGVTLSTLYRRSMLSPGSCLLV 239
E++E SV ++ + L+ASLP +V G KW+LLYST +HG++L TL RRS PG CLLV
Sbjct: 127 ELTESSVFITANLFEFLHASLPNIVRGCKWILLYSTLKHGISLRTLLRRSGELPGPCLLV 186
Query: 240 VGDQRGAVFGSLVEAPMRPSNRRKYQGTNNTYVFTNISGHPVIYRPTGVNRYFTLCNTDY 299
GD++GAVFG+L+E P++P+ +RKYQGT+ T++FT I G P I+RPTG NRY+ +C ++
Sbjct: 187 AGDKQGAVFGALLECPLQPTPKRKYQGTSQTFLFTTIYGEPRIFRPTGANRYYLMCMNEF 246
Query: 300 IAIGGGGHFALYLDGDLLNGSSSVSETYGNPCLANSQEFEVKEVELWGFVQTSKY 354
+A GGGG+FAL LD DLL +S SET+GN CLA+S EFE+K VELWGF S+Y
Sbjct: 247 LAFGGGGNFALCLDEDLLKATSGPSETFGNECLASSTEFELKNVELWGFAHASQY 301
>AT2G05590.1 | Symbols: | TLD-domain containing nucleolar protein |
chr2:2067196-2068650 FORWARD LENGTH=263
Length = 263
Score = 174 bits (440), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 76/137 (55%), Positives = 105/137 (76%)
Query: 180 EISEPSVLLSEGMRSVLYASLPPLVHGRKWLLLYSTWRHGVTLSTLYRRSMLSPGSCLLV 239
E++E SV ++ + L+ASLP +V G KW+LLYST +HG++L TL RRS PG CLLV
Sbjct: 127 ELTESSVFITANLFEFLHASLPNIVRGCKWILLYSTLKHGISLRTLLRRSGELPGPCLLV 186
Query: 240 VGDQRGAVFGSLVEAPMRPSNRRKYQGTNNTYVFTNISGHPVIYRPTGVNRYFTLCNTDY 299
GD++GAVFG+L+E P++P+ +RKYQGT+ T++FT I G P I+RPTG NRY+ +C ++
Sbjct: 187 AGDKQGAVFGALLECPLQPTPKRKYQGTSQTFLFTTIYGEPRIFRPTGANRYYLMCMNEF 246
Query: 300 IAIGGGGHFALYLDGDL 316
+A GGGG+FAL LD DL
Sbjct: 247 LAFGGGGNFALCLDEDL 263
>AT5G06260.1 | Symbols: | TLD-domain containing nucleolar protein |
chr5:1902755-1904835 REVERSE LENGTH=424
Length = 424
Score = 69.3 bits (168), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 57/207 (27%), Positives = 93/207 (44%), Gaps = 20/207 (9%)
Query: 162 STVEPVKESVHRPLVDLPEISEPSVLLSEGMRSVLYASLP--PLVHGRKWLLLYSTWRHG 219
STV P + H D +S +LL + + +LP LV +W LLY + HG
Sbjct: 194 STVRPGYQVPHLLYED--SVSSDRLLLKKEYAWHIGGALPHHELV---EWKLLYHSSVHG 248
Query: 220 VTLST-LYRRSMLSPGSCLLVVGDQRGAVFGSLVEAPMRPSNRRKYQGTNNTYVFTNISG 278
+ +T L S + +L++ D G V+G P + G +++F ++
Sbjct: 249 QSFNTFLGHTSNTGMSASVLIIKDTEGYVYGGYASQPWE--RYSDFYGDMKSFLF-QLNP 305
Query: 279 HPVIYRPTGVNRYFTLCNTDYIA------IGGGG---HFALYLDGDLLNGSSSVSETYGN 329
IYRPTG N C T++ + IG GG HF L++ G + T+G+
Sbjct: 306 KAAIYRPTGANTNIQWCATNFTSENIPNGIGFGGKINHFGLFISASFDQGQTFECTTFGS 365
Query: 330 PCLANSQEFEVKEVELWGFVQTSKYEE 356
P L+ + + + +E WG VQ S ++
Sbjct: 366 PSLSKTSRIQPEVIECWGIVQASNEQD 392