Miyakogusa Predicted Gene
- Lj0g3v0018539.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0018539.1 Non Chatacterized Hit- tr|I1MZZ5|I1MZZ5_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.10393
PE,80.17,0,UNCHARACTERIZED,Protein of unknown function DUF821,
CAP10-like; KDEL (LYS-ASP-GLU-LEU) CONTAINING - ,CUFF.1048.1
(469 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G07220.1 | Symbols: | Arabidopsis thaliana protein of unknow... 637 0.0
AT5G23850.1 | Symbols: | Arabidopsis thaliana protein of unknow... 382 e-106
AT3G48980.1 | Symbols: | Arabidopsis thaliana protein of unknow... 367 e-102
AT1G63420.1 | Symbols: | Arabidopsis thaliana protein of unknow... 352 3e-97
AT2G45830.1 | Symbols: DTA2 | downstream target of AGL15 2 | chr... 348 5e-96
AT3G61270.1 | Symbols: | Arabidopsis thaliana protein of unknow... 344 7e-95
AT2G45840.1 | Symbols: | Arabidopsis thaliana protein of unknow... 324 1e-88
AT3G61280.1 | Symbols: | Arabidopsis thaliana protein of unknow... 323 2e-88
AT2G45830.2 | Symbols: DTA2 | downstream target of AGL15 2 | chr... 320 1e-87
AT3G61290.1 | Symbols: | Arabidopsis thaliana protein of unknow... 306 2e-83
AT3G61280.2 | Symbols: | Arabidopsis thaliana protein of unknow... 204 9e-53
>AT1G07220.1 | Symbols: | Arabidopsis thaliana protein of unknown
function (DUF821) | chr1:2217073-2219379 REVERSE
LENGTH=507
Length = 507
Score = 637 bits (1644), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 293/454 (64%), Positives = 356/454 (78%), Gaps = 12/454 (2%)
Query: 6 KHYPRSPTYLYPSVVALSLFSITVLLLYKVDDVVSRTGTVVGHNLEPTPWHVFPHKTFDE 65
K PRSP+YL V+ALS FS T LL YKVDD +++T T+ GHNLEPTPWH+FP K+F
Sbjct: 12 KSSPRSPSYLLLCVLALSFFSFTALLFYKVDDFIAQTKTLAGHNLEPTPWHIFPRKSFSA 71
Query: 66 ESRHGRAYKIIQCSYLTCRYSSPDGDXXXXXXXXXXXSGGG------DCPDFFRAIRKDL 119
++H +AY+I+QCSY +C Y + SG G CPDFFR I +DL
Sbjct: 72 ATKHSQAYRILQCSYFSCPYKA-----VVQPKSLHSESGSGRQTHQPQCPDFFRWIHRDL 126
Query: 120 EPWAATRISRAHVEEAQKYAAFRVVIVGGKMFVDWYYACVQSRAMFTVWGILQLLRKYPG 179
EPWA T +++ HV+ A+ AAFRVVI+ GK++VD YYACVQSR MFT+WGILQLL KYPG
Sbjct: 127 EPWAKTGVTKEHVKRAKANAAFRVVILSGKLYVDLYYACVQSRMMFTIWGILQLLTKYPG 186
Query: 180 LVPDVDLMFDCMDKPTINRTEHLSMPLPLFRYCTTKEHFDIPFPDWSFWGWSEINIRPWQ 239
+VPDVD+MFDCMDKP IN+TE+ S P+PLFRYCT + H DIPFPDWSFWGWSE N+RPW+
Sbjct: 187 MVPDVDMMFDCMDKPIINQTEYQSFPVPLFRYCTNEAHLDIPFPDWSFWGWSETNLRPWE 246
Query: 240 EEFPDIKRGSQAVSWKNKMAWAYWRGNPDVASPIRVELLNCNDTRMWGAQIMRQDWGEAA 299
EEF DIK+GS+ SW NK AYW+GNPDV SPIR+EL+ CN +R+WGAQIMRQDW E A
Sbjct: 247 EEFGDIKQGSRRRSWYNKQPRAYWKGNPDVVSPIRLELMKCNHSRLWGAQIMRQDWAEEA 306
Query: 300 RSGFKESKLSKQCNHRYKIYAEGYAWSVSLKYILSCGSVTLIISPSQYEDFFTRGLIPRQ 359
+ GF++SKLS QCNHRYKIYAEGYAWSVSLKYILSCGS+TLIISP +YEDFF+RGL+P++
Sbjct: 307 KGGFEQSKLSNQCNHRYKIYAEGYAWSVSLKYILSCGSMTLIISP-EYEDFFSRGLLPKE 365
Query: 360 NSWPVDPLNLCPSIKNAVDWGNQHPREAEAIGKRGQDFMESLSMDRIYDYMLHLISEYAK 419
N WP+ P +LC SIK AVDWGN +P EAE IGKRGQ +MESLSM+R+YDYM HLI+EY+K
Sbjct: 366 NYWPISPTDLCRSIKYAVDWGNSNPSEAETIGKRGQGYMESLSMNRVYDYMFHLITEYSK 425
Query: 420 LQDFKPSPPPTALEVCSESVLCFADDKQRMFLSK 453
LQ FKP P +A EVC+ S+LC A+ K+R L +
Sbjct: 426 LQKFKPEKPASANEVCAGSLLCIAEQKERELLER 459
>AT5G23850.1 | Symbols: | Arabidopsis thaliana protein of unknown
function (DUF821) | chr5:8038126-8040741 FORWARD
LENGTH=542
Length = 542
Score = 382 bits (981), Expect = e-106, Method: Compositional matrix adjust.
Identities = 178/353 (50%), Positives = 246/353 (69%), Gaps = 9/353 (2%)
Query: 108 CPDFFRAIRKDLEPWAATRISRAHVEEAQKYAAFRVVIVGGKMFVDWYYACVQSRAMFTV 167
CPD+FR I +DL PW+ T I+R +E A+K A FR+ IVGGK++V+ + Q+R +FT+
Sbjct: 139 CPDYFRWIHEDLRPWSRTGITREALERAKKTATFRLAIVGGKIYVEKFQDAFQTRDVFTI 198
Query: 168 WGILQLLRKYPGLVPDVDLMFDCMDKPTINRTE----HLSMPLPLFRYCTTKEHFDIPFP 223
WG LQLLRKYPG +PD++LMFDC+D P + TE + P PLFRYC +E DI FP
Sbjct: 199 WGFLQLLRKYPGKIPDLELMFDCVDWPVVRATEFAGANAPSPPPLFRYCGNEETLDIVFP 258
Query: 224 DWSFWGWSEINIRPWQEEFPDIKRGSQAVSWKNKMAWAYWRGNPDVASPIRVELLNCN-- 281
DWSFWGW+E+NI+PW+ +++ G++ W N+ +AYW+GNP VA R +L+ CN
Sbjct: 259 DWSFWGWAEVNIKPWESLLKELREGNERTKWINREPYAYWKGNPMVAE-TRQDLMKCNVS 317
Query: 282 DTRMWGAQIMRQDWGEAARSGFKESKLSKQCNHRYKIYAEGYAWSVSLKYILSCGSVTLI 341
+ W A++ QDW + ++ G+K+S L+ QC+HRYKIY EG AWSVS KYIL+C SVTL+
Sbjct: 318 EEHEWNARLYAQDWIKESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVTLL 377
Query: 342 ISPSQYEDFFTRGLIPRQNSWPVDPLNLCPSIKNAVDWGNQHPREAEAIGKRGQDFMES- 400
+ P Y DFFTRGL+P + WPV + C SIK AVDWGN H ++A+ IGK DF++
Sbjct: 378 VKP-HYYDFFTRGLLPAHHYWPVREHDKCRSIKFAVDWGNSHIQKAQDIGKAASDFIQQD 436
Query: 401 LSMDRIYDYMLHLISEYAKLQDFKPSPPPTALEVCSESVLCFADDKQRMFLSK 453
L MD +YDYM HL++EY+KL FKP P A+E+CSE++ C +R F+++
Sbjct: 437 LKMDYVYDYMYHLLTEYSKLLQFKPEIPRNAVEICSETMACLRSGNERKFMTE 489
>AT3G48980.1 | Symbols: | Arabidopsis thaliana protein of unknown
function (DUF821) | chr3:18155416-18158222 FORWARD
LENGTH=539
Length = 539
Score = 367 bits (943), Expect = e-102, Method: Compositional matrix adjust.
Identities = 169/356 (47%), Positives = 241/356 (67%), Gaps = 9/356 (2%)
Query: 103 SGGGDCPDFFRAIRKDLEPWAATRISRAHVEEAQKYAAFRVVIVGGKMFVDWYYACVQSR 162
S CPD+FR I +DL PW T I+R +E A A FR+ I+ G+++V+ + Q+R
Sbjct: 131 SPSATCPDYFRWIHEDLRPWEKTGITREALERANATAIFRLAIINGRIYVEKFREAFQTR 190
Query: 163 AMFTVWGILQLLRKYPGLVPDVDLMFDCMDKPTINRTEHLSM----PLPLFRYCTTKEHF 218
+FT+WG +QLLR+YPG +PD++LMFDC+D P + E + P PLFRYC E
Sbjct: 191 DVFTIWGFVQLLRRYPGKIPDLELMFDCVDWPVVKAAEFAGVDQPPPPPLFRYCANDETL 250
Query: 219 DIPFPDWSFWGWSEINIRPWQEEFPDIKRGSQAVSWKNKMAWAYWRGNPDVASPIRVELL 278
DI FPDWS+WGW+E+NI+PW+ +++ G+Q W ++ +AYW+GNP VA R++L+
Sbjct: 251 DIVFPDWSYWGWAEVNIKPWESLLKELREGNQRTKWIDREPYAYWKGNPTVAE-TRLDLM 309
Query: 279 NCNDTRM--WGAQIMRQDWGEAARSGFKESKLSKQCNHRYKIYAEGYAWSVSLKYILSCG 336
CN + + W A++ +QDW + ++ G+K+S L+ QC+HRYKIY EG AWSVS KYIL+C
Sbjct: 310 KCNLSEVYDWKARLYKQDWVKESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACD 369
Query: 337 SVTLIISPSQYEDFFTRGLIPRQNSWPVDPLNLCPSIKNAVDWGNQHPREAEAIGKRGQD 396
SVTL++ P Y DFFTRG+ P + WPV + C SIK AVDWGN H R+A+ IGK+ +
Sbjct: 370 SVTLMVKP-HYYDFFTRGMFPGHHYWPVKEDDKCRSIKFAVDWGNLHMRKAQDIGKKASE 428
Query: 397 FM-ESLSMDRIYDYMLHLISEYAKLQDFKPSPPPTALEVCSESVLCFADDKQRMFL 451
F+ + L MD +YDYM HL+ +Y+KL FKP P + E+CSE++ C D +R F+
Sbjct: 429 FVQQELKMDYVYDYMFHLLIQYSKLLRFKPEIPQNSTELCSEAMACPRDGNERKFM 484
>AT1G63420.1 | Symbols: | Arabidopsis thaliana protein of unknown
function (DUF821) | chr1:23515874-23518777 FORWARD
LENGTH=578
Length = 578
Score = 352 bits (904), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 174/345 (50%), Positives = 236/345 (68%), Gaps = 13/345 (3%)
Query: 108 CPDFFRAIRKDLEPWAATRISRAHVEEAQKYAAFRVVIVGGKMFVDWYYACVQSRAMFTV 167
CPD+F+ I +DL+PW T I++ VE + A FR+VI+ GK+FV+ Y +Q+R FT+
Sbjct: 170 CPDYFKWIHEDLKPWRETGITKEMVERGKTTAHFRLVILNGKVFVENYKKSIQTRDAFTL 229
Query: 168 WGILQLLRKYPGLVPDVDLMFDCMDKPTI--------NRTEHLSMPLPLFRYCTTKEHFD 219
WGILQLLRKYPG +PDVDLMFDC D+P I NRT + P PLFRYC + D
Sbjct: 230 WGILQLLRKYPGKLPDVDLMFDCDDRPVIRSDGYNILNRTVE-NAPPPLFRYCGDRWTVD 288
Query: 220 IPFPDWSFWGWSEINIRPWQEEFPDIKRGSQAVSWKNKMAWAYWRGNPDVASPIRVELLN 279
I FPDWSFWGW EINIR W + +++ G + + + A+AYW+GNP VASP R +LL
Sbjct: 289 IVFPDWSFWGWQEINIREWSKVLKEMEEGKKKKKFMERDAYAYWKGNPFVASPSREDLLT 348
Query: 280 CNDTRM--WGAQIMRQDWGEAARSGFKESKLSKQCNHRYKIYAEGYAWSVSLKYILSCGS 337
CN + + W A+I QDW + GF+ S ++ QC +RYKIY EGYAWSVS KYIL+C S
Sbjct: 349 CNLSSLHDWNARIFIQDWISEGQRGFENSNVANQCTYRYKIYIEGYAWSVSEKYILACDS 408
Query: 338 VTLIISPSQYEDFFTRGLIPRQNSWPVDPLNLCPSIKNAVDWGNQHPREAEAIGKRGQDF 397
VTL++ P Y DFF+R L P Q+ WP+ + C SIK AVDW N H ++A+ IG+ +F
Sbjct: 409 VTLMVKPYYY-DFFSRTLQPLQHYWPIRDKDKCRSIKFAVDWLNNHTQKAQEIGREASEF 467
Query: 398 ME-SLSMDRIYDYMLHLISEYAKLQDFKPSPPPTALEVCSESVLC 441
M+ LSM+ +YDYM HL++EY+KL +KP P ++E+C+E+++C
Sbjct: 468 MQRDLSMENVYDYMFHLLNEYSKLLKYKPQVPKNSVELCTEALVC 512
>AT2G45830.1 | Symbols: DTA2 | downstream target of AGL15 2 |
chr2:18866324-18868344 FORWARD LENGTH=523
Length = 523
Score = 348 bits (893), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 167/353 (47%), Positives = 235/353 (66%), Gaps = 9/353 (2%)
Query: 108 CPDFFRAIRKDLEPWAATRISRAHVEEAQKYAAFRVVIVGGKMFVDWYYACVQSRAMFTV 167
CP +FR I +DL PW T ++R +E+A++ A FRVVI+ G+++V Y +Q+R +FT+
Sbjct: 119 CPSYFRWIHEDLRPWKETGVTRGMLEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVFTL 178
Query: 168 WGILQLLRKYPGLVPDVDLMFDCMDKPTIN----RTEHLSMPLPLFRYCTTKEHFDIPFP 223
WGI+QLLR YPG +PD++LMFD D+PT+ + + P PLFRYC+ DI FP
Sbjct: 179 WGIVQLLRWYPGRLPDLELMFDPDDRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIVFP 238
Query: 224 DWSFWGWSEINIRPWQEEFPDIKRGSQAVSWKNKMAWAYWRGNPDVASPIRVELLNCNDT 283
DWSFWGW+E+NI+PW + I+ G++ WK+++A+AYWRGNP+VA P R +LL CN +
Sbjct: 239 DWSFWGWAEVNIKPWDKSLVAIEEGNKMTQWKDRVAYAYWRGNPNVA-PTRRDLLRCNVS 297
Query: 284 RM--WGAQIMRQDWGEAARSGFKESKLSKQCNHRYKIYAEGYAWSVSLKYILSCGSVTLI 341
W ++ QDW +R GFK S L QC HRYKIY EG+AWSVS KYI++C S+TL
Sbjct: 298 AQEDWNTRLYIQDWDRESREGFKNSNLENQCTHRYKIYIEGWAWSVSEKYIMACDSMTLY 357
Query: 342 ISPSQYEDFFTRGLIPRQNSWPVDPLNLCPSIKNAVDWGNQHPREAEAIGKRGQDFM-ES 400
+ P Y DF+ RG++P Q+ WP+ + C S+K AV WGN H +A IG+ G F+ E
Sbjct: 358 VRPMFY-DFYVRGMMPLQHYWPIRDTSKCTSLKFAVHWGNTHLDQASKIGEEGSRFIREE 416
Query: 401 LSMDRIYDYMLHLISEYAKLQDFKPSPPPTALEVCSESVLCFADDKQRMFLSK 453
+ M+ +YDYM HL++EYAKL FKP P A E+ + + C A + R F+ +
Sbjct: 417 VKMEYVYDYMFHLMNEYAKLLKFKPEIPWGATEITPDIMGCSATGRWRDFMEE 469
>AT3G61270.1 | Symbols: | Arabidopsis thaliana protein of unknown
function (DUF821) | chr3:22678270-22680212 FORWARD
LENGTH=498
Length = 498
Score = 344 bits (883), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 179/457 (39%), Positives = 261/457 (57%), Gaps = 23/457 (5%)
Query: 1 MGHSSKHYPRSPTYLYPSVVALSLFSITVLLLYKVDDVVSRTGTVVGHNLEPTPWHVFPH 60
M ++ + ++ T+ S+V ++F + + + + D++ ++ F
Sbjct: 3 MNNNDGQHNKTVTFPRKSIVKATVFIVVLFISAAILDLLGYLD-----------FNAFAG 51
Query: 61 KTFDEESRHGRAYKIIQCSYLTCRYS-SPDGDXXXXXXXXXXXSGGGDCPDFFRAIRKDL 119
+++ Y C ++ + S +P S CP +FR I +DL
Sbjct: 52 LKLTTKTKEPNPYG---CDFVQNQSSQTPISQNRKSRLNPNNSSKSSTCPSYFRWIHEDL 108
Query: 120 EPWAATRISRAHVEEAQKYAAFRVVIVGGKMFVDWYYACVQSRAMFTVWGILQLLRKYPG 179
PW T I+R +EEA + A FR+VI GK +V Y +Q+R FT+WGILQLLR YPG
Sbjct: 109 RPWKQTGITRGMIEEASRTAHFRLVIRNGKAYVKRYKKSIQTRDEFTLWGILQLLRWYPG 168
Query: 180 LVPDVDLMFDCMDKPTINRTEHLSM---PLPLFRYCTTKEHFDIPFPDWSFWGWSEINIR 236
+PD++LMFD D+P + + + P P+FRYC+ DI FPDWSFWGW+E+N++
Sbjct: 169 KLPDLELMFDADDRPVVRSVDFIGQQKEPPPVFRYCSDDASLDIVFPDWSFWGWAEVNVK 228
Query: 237 PWQEEFPDIKRGSQAVSWKNKMAWAYWRGNPDVASPIRVELLNCNDTRM--WGAQIMRQD 294
PW + IK G+ WK+++A+AYWRGNP V P R +LL CN T W ++ QD
Sbjct: 229 PWGKSLEAIKEGNSMTQWKDRVAYAYWRGNPYV-DPGRGDLLKCNATEHEEWNTRLYIQD 287
Query: 295 WGEAARSGFKESKLSKQCNHRYKIYAEGYAWSVSLKYILSCGSVTLIISPSQYEDFFTRG 354
W + + GFK S L QC HRYKIY EG+AWSVS KYI++C S+TL + P Y DF+ RG
Sbjct: 288 WDKETKEGFKNSNLENQCTHRYKIYIEGWAWSVSEKYIMACDSMTLYVKPRFY-DFYIRG 346
Query: 355 LIPRQNSWPVDPLNLCPSIKNAVDWGNQHPREAEAIGKRGQDFM-ESLSMDRIYDYMLHL 413
++P Q+ WP+ + C S+K AV WGN H +A IG+ G F+ E ++M +YDYM HL
Sbjct: 347 MMPLQHYWPIRDDSKCTSLKFAVHWGNTHEDKAREIGEVGSRFIREEVNMQYVYDYMFHL 406
Query: 414 ISEYAKLQDFKPSPPPTALEVCSESVLCFADDKQRMF 450
+ EYA L FKP P A E+ +S+ C A ++ R F
Sbjct: 407 LKEYATLLKFKPEIPLDAEEITPDSMGCPATERWRDF 443
>AT2G45840.1 | Symbols: | Arabidopsis thaliana protein of unknown
function (DUF821) | chr2:18869286-18871487 FORWARD
LENGTH=523
Length = 523
Score = 324 bits (830), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 161/343 (46%), Positives = 215/343 (62%), Gaps = 14/343 (4%)
Query: 108 CPDFFRAIRKDLEPWAATRISRAHVEEAQKYAAFRVVIVGGKMFVDWYYACVQSRAMFTV 167
CPD+FR I KDLE W T I+R +E A A FR++I GG+++V Y Q+R +FT+
Sbjct: 119 CPDYFRWIHKDLEAWRETGITRETLERASDKAHFRLIIKGGRVYVHQYKKSFQTRDVFTI 178
Query: 168 WGILQLLRKYPGLVPDVDLMFDCMDKPTINRTEHLSM--------PLPLFRYCTTKEHFD 219
WGI+QLLR YPG VPD++L+F C D P I R ++ P PLF YC FD
Sbjct: 179 WGIVQLLRMYPGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFD 238
Query: 220 IPFPDWSFWGWSEINIRPWQEEFPDIKRGSQAVSWKNKMAWAYWRGNPDVASPIRVELLN 279
I FPDWSFWGW EINI+ W ++ I G + V W+ + +AYW+GNP VA +R +L++
Sbjct: 239 IVFPDWSFWGWPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGVAM-VRRDLMH 297
Query: 280 CNDTRMWGAQIMRQDWGEAARSGFKESKLSKQCNHRYKIYAEGYAWSVSLKYILSCGSVT 339
C+D + + RQDW R G++ S L QC HRYKIY EG AWSVS KYIL+C S+T
Sbjct: 298 CHDPMV---HLYRQDWSREGRIGYRTSNLEDQCTHRYKIYVEGRAWSVSEKYILACDSMT 354
Query: 340 LIISPSQYEDFFTRGLIPRQNSWPVDPLNLCPSIKNAVDWGNQHPREAEAIGKRGQDFM- 398
L++ P Y DFFTR L+P ++ WP+ P C I AV WGN + ++A AIG+ G ++
Sbjct: 355 LLVKPF-YFDFFTRSLVPMEHYWPIRPQEKCSDIVFAVHWGNNNTKKARAIGRNGSGYVR 413
Query: 399 ESLSMDRIYDYMLHLISEYAKLQDFKPSPPPTALEVCSESVLC 441
++L M +YDYMLHL+ Y KL P A EVC E++ C
Sbjct: 414 KNLKMKYVYDYMLHLLQSYGKLMKMNVEVPQGAKEVCPETMAC 456
>AT3G61280.1 | Symbols: | Arabidopsis thaliana protein of unknown
function (DUF821) | chr3:22681145-22683589 FORWARD
LENGTH=536
Length = 536
Score = 323 bits (828), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 157/356 (44%), Positives = 229/356 (64%), Gaps = 11/356 (3%)
Query: 103 SGGGDCPDFFRAIRKDLEPWAATRISRAHVEEAQKYAAFRVVIVGGKMFVDWYYACVQSR 162
S CPD+F+ I +DL+ W T I+R +E A+ A FR+VI G+++V Y Q+R
Sbjct: 126 SSSETCPDYFKWIHRDLKVWQKTGITRETLERARPNAHFRIVIKSGRLYVHQYEKAFQTR 185
Query: 163 AMFTVWGILQLLRKYPGLVPDVDLMFDCMDKPTINRTEHLSM------PLPLFRYCTTKE 216
+FT+WGILQLLR YPG +PD++L+F C D+P I + + P PLF YC ++
Sbjct: 186 DVFTIWGILQLLRMYPGQIPDLELLFLCHDRPAIWKRDLKKKRKDTWPPPPLFHYCGHRD 245
Query: 217 HFDIPFPDWSFWGWSEINIRPWQEEFPDIKRGSQAVSWKNKMAWAYWRGNPDVASPIRVE 276
+DI FPDWSFWGW E+NI+ W + +K G++ V W++++ +AYW+GNP V SPIR +
Sbjct: 246 AYDIVFPDWSFWGWPELNIKEWNKLSVALKEGNKKVKWEDRVPYAYWKGNPHV-SPIRGD 304
Query: 277 LLNCNDTRMWG--AQIMRQDWGEAARSGFKESKLSKQCNHRYKIYAEGYAWSVSLKYILS 334
L+ CN + + ++ QDW +GF+ S L QC HRYKIY EG AWSVS KYILS
Sbjct: 305 LMRCNFSDKYDPMVRLYVQDWRSEIEAGFRGSNLEDQCTHRYKIYIEGNAWSVSEKYILS 364
Query: 335 CGSVTLIISPSQYEDFFTRGLIPRQNSWPVDPLNLCPSIKNAVDWGNQHPREAEAIGKRG 394
C S+TL++ P +Y DFF R ++P ++ WP+ N C +K AV+WGN + +A+ IG++G
Sbjct: 365 CDSMTLLVKP-EYYDFFFRSMVPMKHFWPIRQNNKCGDLKFAVEWGNNNTEKAQIIGRQG 423
Query: 395 QDF-MESLSMDRIYDYMLHLISEYAKLQDFKPSPPPTALEVCSESVLCFADDKQRM 449
++ M++L M +YDYML+++ Y KL + P A EVCSE++ C D R+
Sbjct: 424 SEYMMKNLKMKYVYDYMLYVLQGYGKLMKLDVTVPENATEVCSETMACSITDGGRI 479
>AT2G45830.2 | Symbols: DTA2 | downstream target of AGL15 2 |
chr2:18866850-18868344 FORWARD LENGTH=382
Length = 382
Score = 320 bits (821), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 156/329 (47%), Positives = 220/329 (66%), Gaps = 9/329 (2%)
Query: 132 VEEAQKYAAFRVVIVGGKMFVDWYYACVQSRAMFTVWGILQLLRKYPGLVPDVDLMFDCM 191
+E+A++ A FRVVI+ G+++V Y +Q+R +FT+WGI+QLLR YPG +PD++LMFD
Sbjct: 2 LEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVFTLWGIVQLLRWYPGRLPDLELMFDPD 61
Query: 192 DKPTIN----RTEHLSMPLPLFRYCTTKEHFDIPFPDWSFWGWSEINIRPWQEEFPDIKR 247
D+PT+ + + P PLFRYC+ DI FPDWSFWGW+E+NI+PW + I+
Sbjct: 62 DRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIVFPDWSFWGWAEVNIKPWDKSLVAIEE 121
Query: 248 GSQAVSWKNKMAWAYWRGNPDVASPIRVELLNCNDTRM--WGAQIMRQDWGEAARSGFKE 305
G++ WK+++A+AYWRGNP+VA P R +LL CN + W ++ QDW +R GFK
Sbjct: 122 GNKMTQWKDRVAYAYWRGNPNVA-PTRRDLLRCNVSAQEDWNTRLYIQDWDRESREGFKN 180
Query: 306 SKLSKQCNHRYKIYAEGYAWSVSLKYILSCGSVTLIISPSQYEDFFTRGLIPRQNSWPVD 365
S L QC HRYKIY EG+AWSVS KYI++C S+TL + P Y DF+ RG++P Q+ WP+
Sbjct: 181 SNLENQCTHRYKIYIEGWAWSVSEKYIMACDSMTLYVRPMFY-DFYVRGMMPLQHYWPIR 239
Query: 366 PLNLCPSIKNAVDWGNQHPREAEAIGKRGQDFM-ESLSMDRIYDYMLHLISEYAKLQDFK 424
+ C S+K AV WGN H +A IG+ G F+ E + M+ +YDYM HL++EYAKL FK
Sbjct: 240 DTSKCTSLKFAVHWGNTHLDQASKIGEEGSRFIREEVKMEYVYDYMFHLMNEYAKLLKFK 299
Query: 425 PSPPPTALEVCSESVLCFADDKQRMFLSK 453
P P A E+ + + C A + R F+ +
Sbjct: 300 PEIPWGATEITPDIMGCSATGRWRDFMEE 328
>AT3G61290.1 | Symbols: | Arabidopsis thaliana protein of unknown
function (DUF821) | chr3:22684407-22686772 FORWARD
LENGTH=455
Length = 455
Score = 306 bits (785), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 153/348 (43%), Positives = 219/348 (62%), Gaps = 12/348 (3%)
Query: 108 CPDFFRAIRKDLEPWAATRISRAHVEEAQKYAAFRVVIVGGKMFVDWYYACVQSRAMFTV 167
CPD+FR I++DL+ W T I+R +E A+ A FR+VI G+++V Y +SR + T+
Sbjct: 44 CPDYFRWIQQDLKVWEETGITRETLERAKPKAHFRLVIKSGRLYVHQYDKAYESRDVLTI 103
Query: 168 WGILQLLRKYPGLVPDVDLMFDCMDKPTINRTEHLS-------MPLPLFRYCTTKEHFDI 220
WGILQLLR YPG VPD++L+F C D P I + + P PLF+YC +E + I
Sbjct: 104 WGILQLLRMYPGQVPDLELLFFCHDIPAIWKRDFRQPEPNATWPPPPLFQYCGHREAYGI 163
Query: 221 PFPDWSFWGWSEINIRPWQEEFPDIKRGSQAVSWKNKMAWAYWRGNPDVASPIRVELLNC 280
FPDWSFWGW E+NI+ W + I+ ++ V W +++ +AYW+GN V R L+ C
Sbjct: 164 VFPDWSFWGWPEVNIKEWTKLSVAIREANKRVKWNDRVPYAYWKGNSGVHRE-RGNLMKC 222
Query: 281 NDTRMWG--AQIMRQDWGEAARSGFKESKLSKQCNHRYKIYAEGYAWSVSLKYILSCGSV 338
N + + ++ QDWG+ GFK S L QC HRYKIY EG AWSVS KYIL+C S+
Sbjct: 223 NFSDKYDPMVRLYEQDWGKEREIGFKSSNLEDQCTHRYKIYIEGRAWSVSKKYILACDSM 282
Query: 339 TLIISPSQYEDFFTRGLIPRQNSWPVDPLNLCPSIKNAVDWGNQHPREAEAIGKRGQDF- 397
TL+I ++Y DFF R L+P ++ WP+ C +K AV+WGN + ++A+ IG++G D+
Sbjct: 283 TLLIK-AEYFDFFGRSLVPLEHYWPIKSHEKCGDLKFAVEWGNNNTKKAQVIGRQGSDYI 341
Query: 398 MESLSMDRIYDYMLHLISEYAKLQDFKPSPPPTALEVCSESVLCFADD 445
M++L M +YDYML+++ Y KL + P A EVCSE++ C D
Sbjct: 342 MKNLEMKYVYDYMLYVLQGYGKLMKLDVTVPENATEVCSETMACPITD 389
>AT3G61280.2 | Symbols: | Arabidopsis thaliana protein of unknown
function (DUF821) | chr3:22681145-22682869 FORWARD
LENGTH=378
Length = 378
Score = 204 bits (520), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 97/242 (40%), Positives = 153/242 (63%), Gaps = 11/242 (4%)
Query: 103 SGGGDCPDFFRAIRKDLEPWAATRISRAHVEEAQKYAAFRVVIVGGKMFVDWYYACVQSR 162
S CPD+F+ I +DL+ W T I+R +E A+ A FR+VI G+++V Y Q+R
Sbjct: 126 SSSETCPDYFKWIHRDLKVWQKTGITRETLERARPNAHFRIVIKSGRLYVHQYEKAFQTR 185
Query: 163 AMFTVWGILQLLRKYPGLVPDVDLMFDCMDKPTINRTEHLSM------PLPLFRYCTTKE 216
+FT+WGILQLLR YPG +PD++L+F C D+P I + + P PLF YC ++
Sbjct: 186 DVFTIWGILQLLRMYPGQIPDLELLFLCHDRPAIWKRDLKKKRKDTWPPPPLFHYCGHRD 245
Query: 217 HFDIPFPDWSFWGWSEINIRPWQEEFPDIKRGSQAVSWKNKMAWAYWRGNPDVASPIRVE 276
+DI FPDWSFWGW E+NI+ W + +K G++ V W++++ +AYW+GNP V SPIR +
Sbjct: 246 AYDIVFPDWSFWGWPELNIKEWNKLSVALKEGNKKVKWEDRVPYAYWKGNPHV-SPIRGD 304
Query: 277 LLNCNDTRMWG--AQIMRQDWGEAARSGFKESKLSKQCNHRY--KIYAEGYAWSVSLKYI 332
L+ CN + + ++ QDW +GF+ S L QC HRY +I++ + + ++++++
Sbjct: 305 LMRCNFSDKYDPMVRLYVQDWRSEIEAGFRGSNLEDQCTHRYMCRIHSLDHVYLINIRFV 364
Query: 333 LS 334
+
Sbjct: 365 FN 366