Miyakogusa Predicted Gene

Lj0g3v0273289.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0273289.1 tr|G7KQ38|G7KQ38_MEDTR KTEL motif-containing
protein OS=Medicago truncatula GN=MTR_6g031170 PE=4
SV=,74.27,0,seg,NULL; coiled-coil,NULL; Putative
lipopolysaccharide-modifying
enzyme,Lipopolysaccharide-modifyin,CUFF.18076.1
         (515 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G45830.1 | Symbols: DTA2 | downstream target of AGL15 2 | chr...   605   e-173
AT5G23850.1 | Symbols:  | Arabidopsis thaliana protein of unknow...   582   e-166
AT3G61270.1 | Symbols:  | Arabidopsis thaliana protein of unknow...   580   e-166
AT3G48980.1 | Symbols:  | Arabidopsis thaliana protein of unknow...   572   e-163
AT2G45830.2 | Symbols: DTA2 | downstream target of AGL15 2 | chr...   559   e-159
AT1G63420.1 | Symbols:  | Arabidopsis thaliana protein of unknow...   527   e-150
AT3G61280.1 | Symbols:  | Arabidopsis thaliana protein of unknow...   496   e-140
AT2G45840.1 | Symbols:  | Arabidopsis thaliana protein of unknow...   477   e-135
AT3G61290.1 | Symbols:  | Arabidopsis thaliana protein of unknow...   468   e-132
AT1G07220.1 | Symbols:  | Arabidopsis thaliana protein of unknow...   390   e-108
AT3G61280.2 | Symbols:  | Arabidopsis thaliana protein of unknow...   295   7e-80

>AT2G45830.1 | Symbols: DTA2 | downstream target of AGL15 2 |
           chr2:18866324-18868344 FORWARD LENGTH=523
          Length = 523

 Score =  605 bits (1561), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 284/441 (64%), Positives = 344/441 (78%), Gaps = 6/441 (1%)

Query: 74  PQESQEFSLRCTKSKNNKET--KQTCPRDNYPTSHSPSNPQNLTCPSYFRWIHEDLKPWK 131
           P  SQ F  +C   +N  +   +    R+N     S S     TCPSYFRWIHEDL+PWK
Sbjct: 77  PITSQRFPNQCGVVQNQTQLFPQNGSSRNNDKPRSSHSRIS--TCPSYFRWIHEDLRPWK 134

Query: 132 EKGGITRQMLEGVRRTAHFRLVIVDGKVYVEKFRQSIQTRDVFTLWGILQLLRVYPGKLP 191
           E G +TR MLE  RRTAHFR+VI+DG+VYV+K+R+SIQTRDVFTLWGI+QLLR YPG+LP
Sbjct: 135 ETG-VTRGMLEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVFTLWGIVQLLRWYPGRLP 193

Query: 192 DLELMFDCDDRPVIHLANFQGPK-AAPPPLFRYCSDQGSLDIVFPDWSFWGWAETNIRPW 250
           DLELMFD DDRP +   +FQG +  APPPLFRYCSD  SLDIVFPDWSFWGWAE NI+PW
Sbjct: 194 DLELMFDPDDRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIVFPDWSFWGWAEVNIKPW 253

Query: 251 REVLKDIKEGNKRTKWEDRVPYAYWKGNPHVARTRQNLLKCNVTPQNDWNTRLYIQDWSQ 310
            + L  I+EGNK T+W+DRV YAYW+GNP+VA TR++LL+CNV+ Q DWNTRLYIQDW +
Sbjct: 254 DKSLVAIEEGNKMTQWKDRVAYAYWRGNPNVAPTRRDLLRCNVSAQEDWNTRLYIQDWDR 313

Query: 311 ESNQGYKKSNVADQCTHRYKIYIEGWAWSVSEKYIMACNSMSLYVKSSYHDFFIRGMKPL 370
           ES +G+K SN+ +QCTHRYKIYIEGWAWSVSEKYIMAC+SM+LYV+  ++DF++RGM PL
Sbjct: 314 ESREGFKNSNLENQCTHRYKIYIEGWAWSVSEKYIMACDSMTLYVRPMFYDFYVRGMMPL 373

Query: 371 QHYWPIRDNSKCTSLKYAVEWGNNHTDKAQAIGEAASRFIQEDLDMDHVYDYMFHLLNEY 430
           QHYWPIRD SKCTSLK+AV WGN H D+A  IGE  SRFI+E++ M++VYDYMFHL+NEY
Sbjct: 374 QHYWPIRDTSKCTSLKFAVHWGNTHLDQASKIGEEGSRFIREEVKMEYVYDYMFHLMNEY 433

Query: 431 AKLLKFKPTVPPGAVEFCPETLACAVNGTQRRFMEESMVKFPSHSNPCTIPPPYDPSTFQ 490
           AKLLKFKP +P GA E  P+ + C+  G  R FMEESMV FPS  +PC +P P++P   +
Sbjct: 434 AKLLKFKPEIPWGATEITPDIMGCSATGRWRDFMEESMVMFPSEESPCEMPSPFNPHDLK 493

Query: 491 SFQEEKANATRQVEIWEDKYW 511
              E K N TRQVE WED+Y+
Sbjct: 494 EILERKTNLTRQVEWWEDQYF 514


>AT5G23850.1 | Symbols:  | Arabidopsis thaliana protein of unknown
           function (DUF821) | chr5:8038126-8040741 FORWARD
           LENGTH=542
          Length = 542

 Score =  582 bits (1499), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 266/443 (60%), Positives = 341/443 (76%), Gaps = 11/443 (2%)

Query: 79  EFSLRCTKSKNNKETKQTCPRDNYPTSHS-----PSNPQNLTCPSYFRWIHEDLKPWKEK 133
           EF+L C+ +    ET  +CP + YPT+ S      ++P   TCP YFRWIHEDL+PW  +
Sbjct: 101 EFTLHCSAN----ETTASCPSNKYPTTTSFEDDDTNHPPTATCPDYFRWIHEDLRPWS-R 155

Query: 134 GGITRQMLEGVRRTAHFRLVIVDGKVYVEKFRQSIQTRDVFTLWGILQLLRVYPGKLPDL 193
            GITR+ LE  ++TA FRL IV GK+YVEKF+ + QTRDVFT+WG LQLLR YPGK+PDL
Sbjct: 156 TGITREALERAKKTATFRLAIVGGKIYVEKFQDAFQTRDVFTIWGFLQLLRKYPGKIPDL 215

Query: 194 ELMFDCDDRPVIHLANFQGPKA-APPPLFRYCSDQGSLDIVFPDWSFWGWAETNIRPWRE 252
           ELMFDC D PV+    F G  A +PPPLFRYC ++ +LDIVFPDWSFWGWAE NI+PW  
Sbjct: 216 ELMFDCVDWPVVRATEFAGANAPSPPPLFRYCGNEETLDIVFPDWSFWGWAEVNIKPWES 275

Query: 253 VLKDIKEGNKRTKWEDRVPYAYWKGNPHVARTRQNLLKCNVTPQNDWNTRLYIQDWSQES 312
           +LK+++EGN+RTKW +R PYAYWKGNP VA TRQ+L+KCNV+ +++WN RLY QDW +ES
Sbjct: 276 LLKELREGNERTKWINREPYAYWKGNPMVAETRQDLMKCNVSEEHEWNARLYAQDWIKES 335

Query: 313 NQGYKKSNVADQCTHRYKIYIEGWAWSVSEKYIMACNSMSLYVKSSYHDFFIRGMKPLQH 372
            +GYK+S++A QC HRYKIYIEG AWSVSEKYI+AC+S++L VK  Y+DFF RG+ P  H
Sbjct: 336 KEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLLPAHH 395

Query: 373 YWPIRDNSKCTSLKYAVEWGNNHTDKAQAIGEAASRFIQEDLDMDHVYDYMFHLLNEYAK 432
           YWP+R++ KC S+K+AV+WGN+H  KAQ IG+AAS FIQ+DL MD+VYDYM+HLL EY+K
Sbjct: 396 YWPVREHDKCRSIKFAVDWGNSHIQKAQDIGKAASDFIQQDLKMDYVYDYMYHLLTEYSK 455

Query: 433 LLKFKPTVPPGAVEFCPETLACAVNGTQRRFMEESMVKFPSHSNPCTIPPPYDPSTFQSF 492
           LL+FKP +P  AVE C ET+AC  +G +R+FM ES+VK P+ S PC +PPPYDP+T+   
Sbjct: 456 LLQFKPEIPRNAVEICSETMACLRSGNERKFMTESLVKQPADSGPCAMPPPYDPATYYEV 515

Query: 493 QEEKANATRQVEIWEDKYWLKKN 515
            + K +   ++  WE KYW K+N
Sbjct: 516 VKRKQSTNMRILQWEMKYWSKQN 538


>AT3G61270.1 | Symbols:  | Arabidopsis thaliana protein of unknown
           function (DUF821) | chr3:22678270-22680212 FORWARD
           LENGTH=498
          Length = 498

 Score =  580 bits (1494), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 267/421 (63%), Positives = 328/421 (77%), Gaps = 2/421 (0%)

Query: 92  ETKQTCPRDNYPTSHSPSNP-QNLTCPSYFRWIHEDLKPWKEKGGITRQMLEGVRRTAHF 150
           ++ QT    N  +  +P+N  ++ TCPSYFRWIHEDL+PWK+ G ITR M+E   RTAHF
Sbjct: 72  QSSQTPISQNRKSRLNPNNSSKSSTCPSYFRWIHEDLRPWKQTG-ITRGMIEEASRTAHF 130

Query: 151 RLVIVDGKVYVEKFRQSIQTRDVFTLWGILQLLRVYPGKLPDLELMFDCDDRPVIHLANF 210
           RLVI +GK YV+++++SIQTRD FTLWGILQLLR YPGKLPDLELMFD DDRPV+   +F
Sbjct: 131 RLVIRNGKAYVKRYKKSIQTRDEFTLWGILQLLRWYPGKLPDLELMFDADDRPVVRSVDF 190

Query: 211 QGPKAAPPPLFRYCSDQGSLDIVFPDWSFWGWAETNIRPWREVLKDIKEGNKRTKWEDRV 270
            G +  PPP+FRYCSD  SLDIVFPDWSFWGWAE N++PW + L+ IKEGN  T+W+DRV
Sbjct: 191 IGQQKEPPPVFRYCSDDASLDIVFPDWSFWGWAEVNVKPWGKSLEAIKEGNSMTQWKDRV 250

Query: 271 PYAYWKGNPHVARTRQNLLKCNVTPQNDWNTRLYIQDWSQESNQGYKKSNVADQCTHRYK 330
            YAYW+GNP+V   R +LLKCN T   +WNTRLYIQDW +E+ +G+K SN+ +QCTHRYK
Sbjct: 251 AYAYWRGNPYVDPGRGDLLKCNATEHEEWNTRLYIQDWDKETKEGFKNSNLENQCTHRYK 310

Query: 331 IYIEGWAWSVSEKYIMACNSMSLYVKSSYHDFFIRGMKPLQHYWPIRDNSKCTSLKYAVE 390
           IYIEGWAWSVSEKYIMAC+SM+LYVK  ++DF+IRGM PLQHYWPIRD+SKCTSLK+AV 
Sbjct: 311 IYIEGWAWSVSEKYIMACDSMTLYVKPRFYDFYIRGMMPLQHYWPIRDDSKCTSLKFAVH 370

Query: 391 WGNNHTDKAQAIGEAASRFIQEDLDMDHVYDYMFHLLNEYAKLLKFKPTVPPGAVEFCPE 450
           WGN H DKA+ IGE  SRFI+E+++M +VYDYMFHLL EYA LLKFKP +P  A E  P+
Sbjct: 371 WGNTHEDKAREIGEVGSRFIREEVNMQYVYDYMFHLLKEYATLLKFKPEIPLDAEEITPD 430

Query: 451 TLACAVNGTQRRFMEESMVKFPSHSNPCTIPPPYDPSTFQSFQEEKANATRQVEIWEDKY 510
           ++ C      R F  ESM+  PS  +PC + PPYDP   +   E KAN TRQVE+WE++Y
Sbjct: 431 SMGCPATERWRDFKAESMIISPSEESPCEMLPPYDPLALKEVLERKANLTRQVELWENQY 490

Query: 511 W 511
           +
Sbjct: 491 F 491


>AT3G48980.1 | Symbols:  | Arabidopsis thaliana protein of unknown
           function (DUF821) | chr3:18155416-18158222 FORWARD
           LENGTH=539
          Length = 539

 Score =  572 bits (1474), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 261/449 (58%), Positives = 338/449 (75%), Gaps = 11/449 (2%)

Query: 75  QESQEFSLRCTKSKNNKETKQTCPRDNYPTSHSPSNPQ-------NLTCPSYFRWIHEDL 127
           ++ +EF+L C     N     TCP+DNYPTS   S  +       + TCP YFRWIHEDL
Sbjct: 90  EKPKEFTLNCAAFSGND--TGTCPKDNYPTSFRSSAGEGESDRSPSATCPDYFRWIHEDL 147

Query: 128 KPWKEKGGITRQMLEGVRRTAHFRLVIVDGKVYVEKFRQSIQTRDVFTLWGILQLLRVYP 187
           +PW EK GITR+ LE    TA FRL I++G++YVEKFR++ QTRDVFT+WG +QLLR YP
Sbjct: 148 RPW-EKTGITREALERANATAIFRLAIINGRIYVEKFREAFQTRDVFTIWGFVQLLRRYP 206

Query: 188 GKLPDLELMFDCDDRPVIHLANFQG-PKAAPPPLFRYCSDQGSLDIVFPDWSFWGWAETN 246
           GK+PDLELMFDC D PV+  A F G  +  PPPLFRYC++  +LDIVFPDWS+WGWAE N
Sbjct: 207 GKIPDLELMFDCVDWPVVKAAEFAGVDQPPPPPLFRYCANDETLDIVFPDWSYWGWAEVN 266

Query: 247 IRPWREVLKDIKEGNKRTKWEDRVPYAYWKGNPHVARTRQNLLKCNVTPQNDWNTRLYIQ 306
           I+PW  +LK+++EGN+RTKW DR PYAYWKGNP VA TR +L+KCN++   DW  RLY Q
Sbjct: 267 IKPWESLLKELREGNQRTKWIDREPYAYWKGNPTVAETRLDLMKCNLSEVYDWKARLYKQ 326

Query: 307 DWSQESNQGYKKSNVADQCTHRYKIYIEGWAWSVSEKYIMACNSMSLYVKSSYHDFFIRG 366
           DW +ES +GYK+S++A QC HRYKIYIEG AWSVSEKYI+AC+S++L VK  Y+DFF RG
Sbjct: 327 DWVKESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVTLMVKPHYYDFFTRG 386

Query: 367 MKPLQHYWPIRDNSKCTSLKYAVEWGNNHTDKAQAIGEAASRFIQEDLDMDHVYDYMFHL 426
           M P  HYWP++++ KC S+K+AV+WGN H  KAQ IG+ AS F+Q++L MD+VYDYMFHL
Sbjct: 387 MFPGHHYWPVKEDDKCRSIKFAVDWGNLHMRKAQDIGKKASEFVQQELKMDYVYDYMFHL 446

Query: 427 LNEYAKLLKFKPTVPPGAVEFCPETLACAVNGTQRRFMEESMVKFPSHSNPCTIPPPYDP 486
           L +Y+KLL+FKP +P  + E C E +AC  +G +R+FM ES+VK P+ + PC +PPPYDP
Sbjct: 447 LIQYSKLLRFKPEIPQNSTELCSEAMACPRDGNERKFMMESLVKRPAETGPCAMPPPYDP 506

Query: 487 STFQSFQEEKANATRQVEIWEDKYWLKKN 515
           ++F S  + + + T ++E WE KYW K+N
Sbjct: 507 ASFYSVLKRRQSTTSRIEQWESKYWRKQN 535


>AT2G45830.2 | Symbols: DTA2 | downstream target of AGL15 2 |
           chr2:18866850-18868344 FORWARD LENGTH=382
          Length = 382

 Score =  559 bits (1440), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 254/372 (68%), Positives = 306/372 (82%), Gaps = 1/372 (0%)

Query: 140 MLEGVRRTAHFRLVIVDGKVYVEKFRQSIQTRDVFTLWGILQLLRVYPGKLPDLELMFDC 199
           MLE  RRTAHFR+VI+DG+VYV+K+R+SIQTRDVFTLWGI+QLLR YPG+LPDLELMFD 
Sbjct: 1   MLEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVFTLWGIVQLLRWYPGRLPDLELMFDP 60

Query: 200 DDRPVIHLANFQGPK-AAPPPLFRYCSDQGSLDIVFPDWSFWGWAETNIRPWREVLKDIK 258
           DDRP +   +FQG +  APPPLFRYCSD  SLDIVFPDWSFWGWAE NI+PW + L  I+
Sbjct: 61  DDRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIVFPDWSFWGWAEVNIKPWDKSLVAIE 120

Query: 259 EGNKRTKWEDRVPYAYWKGNPHVARTRQNLLKCNVTPQNDWNTRLYIQDWSQESNQGYKK 318
           EGNK T+W+DRV YAYW+GNP+VA TR++LL+CNV+ Q DWNTRLYIQDW +ES +G+K 
Sbjct: 121 EGNKMTQWKDRVAYAYWRGNPNVAPTRRDLLRCNVSAQEDWNTRLYIQDWDRESREGFKN 180

Query: 319 SNVADQCTHRYKIYIEGWAWSVSEKYIMACNSMSLYVKSSYHDFFIRGMKPLQHYWPIRD 378
           SN+ +QCTHRYKIYIEGWAWSVSEKYIMAC+SM+LYV+  ++DF++RGM PLQHYWPIRD
Sbjct: 181 SNLENQCTHRYKIYIEGWAWSVSEKYIMACDSMTLYVRPMFYDFYVRGMMPLQHYWPIRD 240

Query: 379 NSKCTSLKYAVEWGNNHTDKAQAIGEAASRFIQEDLDMDHVYDYMFHLLNEYAKLLKFKP 438
            SKCTSLK+AV WGN H D+A  IGE  SRFI+E++ M++VYDYMFHL+NEYAKLLKFKP
Sbjct: 241 TSKCTSLKFAVHWGNTHLDQASKIGEEGSRFIREEVKMEYVYDYMFHLMNEYAKLLKFKP 300

Query: 439 TVPPGAVEFCPETLACAVNGTQRRFMEESMVKFPSHSNPCTIPPPYDPSTFQSFQEEKAN 498
            +P GA E  P+ + C+  G  R FMEESMV FPS  +PC +P P++P   +   E K N
Sbjct: 301 EIPWGATEITPDIMGCSATGRWRDFMEESMVMFPSEESPCEMPSPFNPHDLKEILERKTN 360

Query: 499 ATRQVEIWEDKY 510
            TRQVE WED+Y
Sbjct: 361 LTRQVEWWEDQY 372


>AT1G63420.1 | Symbols:  | Arabidopsis thaliana protein of unknown
           function (DUF821) | chr1:23515874-23518777 FORWARD
           LENGTH=578
          Length = 578

 Score =  527 bits (1358), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 258/450 (57%), Positives = 332/450 (73%), Gaps = 13/450 (2%)

Query: 74  PQESQEFSLRCTKSKNNKETKQTCPRDNYPTSHSPSNPQNLTCPSYFRWIHEDLKPWKEK 133
           P+E+   S+ C+ S  N+    +C R      +      N +CP YF+WIHEDLKPW+E 
Sbjct: 130 PEETGS-SVDCS-SFLNQNRSGSCSRTLQSGYNQNQTESNRSCPDYFKWIHEDLKPWRET 187

Query: 134 GGITRQMLEGVRRTAHFRLVIVDGKVYVEKFRQSIQTRDVFTLWGILQLLRVYPGKLPDL 193
           G IT++M+E  + TAHFRLVI++GKV+VE +++SIQTRD FTLWGILQLLR YPGKLPD+
Sbjct: 188 G-ITKEMVERGKTTAHFRLVILNGKVFVENYKKSIQTRDAFTLWGILQLLRKYPGKLPDV 246

Query: 194 ELMFDCDDRPVIHLANF----QGPKAAPPPLFRYCSDQGSLDIVFPDWSFWGWAETNIRP 249
           +LMFDCDDRPVI    +    +  + APPPLFRYC D+ ++DIVFPDWSFWGW E NIR 
Sbjct: 247 DLMFDCDDRPVIRSDGYNILNRTVENAPPPLFRYCGDRWTVDIVFPDWSFWGWQEINIRE 306

Query: 250 WREVLKDIKEGNKRTKWEDRVPYAYWKGNPHVAR-TRQNLLKCNVTPQNDWNTRLYIQDW 308
           W +VLK+++EG K+ K+ +R  YAYWKGNP VA  +R++LL CN++  +DWN R++IQDW
Sbjct: 307 WSKVLKEMEEGKKKKKFMERDAYAYWKGNPFVASPSREDLLTCNLSSLHDWNARIFIQDW 366

Query: 309 SQESNQGYKKSNVADQCTHRYKIYIEGWAWSVSEKYIMACNSMSLYVKSSYHDFFIRGMK 368
             E  +G++ SNVA+QCT+RYKIYIEG+AWSVSEKYI+AC+S++L VK  Y+DFF R ++
Sbjct: 367 ISEGQRGFENSNVANQCTYRYKIYIEGYAWSVSEKYILACDSVTLMVKPYYYDFFSRTLQ 426

Query: 369 PLQHYWPIRDNSKCTSLKYAVEWGNNHTDKAQAIGEAASRFIQEDLDMDHVYDYMFHLLN 428
           PLQHYWPIRD  KC S+K+AV+W NNHT KAQ IG  AS F+Q DL M++VYDYMFHLLN
Sbjct: 427 PLQHYWPIRDKDKCRSIKFAVDWLNNHTQKAQEIGREASEFMQRDLSMENVYDYMFHLLN 486

Query: 429 EYAKLLKFKPTVPPGAVEFCPETLACA-----VNGTQRRFMEESMVKFPSHSNPCTIPPP 483
           EY+KLLK+KP VP  +VE C E L C      VNG  ++FM  S+V  P  S PC++PPP
Sbjct: 487 EYSKLLKYKPQVPKNSVELCTEALVCPSEGEDVNGVDKKFMIGSLVSRPHASGPCSLPPP 546

Query: 484 YDPSTFQSFQEEKANATRQVEIWEDKYWLK 513
           +D +  + F  +K N  RQVE WED YW K
Sbjct: 547 FDSNGLEKFHRKKLNLIRQVEKWEDSYWQK 576


>AT3G61280.1 | Symbols:  | Arabidopsis thaliana protein of unknown
           function (DUF821) | chr3:22681145-22683589 FORWARD
           LENGTH=536
          Length = 536

 Score =  496 bits (1276), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 234/443 (52%), Positives = 311/443 (70%), Gaps = 9/443 (2%)

Query: 74  PQESQEFSLRCTKSKNNKETKQTCPRDNYPTSHSPSNPQNLTCPSYFRWIHEDLKPWKEK 133
           P+ + +  L CT   +N  T QTCP  NYPT   P+   + TCP YF+WIH DLK W +K
Sbjct: 92  PKIAIKIPLNCTSLNSN--TTQTCP-SNYPTKFEPAISSSETCPDYFKWIHRDLKVW-QK 147

Query: 134 GGITRQMLEGVRRTAHFRLVIVDGKVYVEKFRQSIQTRDVFTLWGILQLLRVYPGKLPDL 193
            GITR+ LE  R  AHFR+VI  G++YV ++ ++ QTRDVFT+WGILQLLR+YPG++PDL
Sbjct: 148 TGITRETLERARPNAHFRIVIKSGRLYVHQYEKAFQTRDVFTIWGILQLLRMYPGQIPDL 207

Query: 194 ELMFDCDDRPVIHLANFQGPKAA---PPPLFRYCSDQGSLDIVFPDWSFWGWAETNIRPW 250
           EL+F C DRP I   + +  +     PPPLF YC  + + DIVFPDWSFWGW E NI+ W
Sbjct: 208 ELLFLCHDRPAIWKRDLKKKRKDTWPPPPLFHYCGHRDAYDIVFPDWSFWGWPELNIKEW 267

Query: 251 REVLKDIKEGNKRTKWEDRVPYAYWKGNPHVARTRQNLLKCNVTPQNDWNTRLYIQDWSQ 310
            ++   +KEGNK+ KWEDRVPYAYWKGNPHV+  R +L++CN + + D   RLY+QDW  
Sbjct: 268 NKLSVALKEGNKKVKWEDRVPYAYWKGNPHVSPIRGDLMRCNFSDKYDPMVRLYVQDWRS 327

Query: 311 ESNQGYKKSNVADQCTHRYKIYIEGWAWSVSEKYIMACNSMSLYVKSSYHDFFIRGMKPL 370
           E   G++ SN+ DQCTHRYKIYIEG AWSVSEKYI++C+SM+L VK  Y+DFF R M P+
Sbjct: 328 EIEAGFRGSNLEDQCTHRYKIYIEGNAWSVSEKYILSCDSMTLLVKPEYYDFFFRSMVPM 387

Query: 371 QHYWPIRDNSKCTSLKYAVEWGNNHTDKAQAIGEAASRFIQEDLDMDHVYDYMFHLLNEY 430
           +H+WPIR N+KC  LK+AVEWGNN+T+KAQ IG   S ++ ++L M +VYDYM ++L  Y
Sbjct: 388 KHFWPIRQNNKCGDLKFAVEWGNNNTEKAQIIGRQGSEYMMKNLKMKYVYDYMLYVLQGY 447

Query: 431 AKLLKFKPTVPPGAVEFCPETLACAV--NGTQRRFMEESMVKFPSHSNPCTIPPPYDPST 488
            KL+K   TVP  A E C ET+AC++   G  R+ M++S+V  PS    C +PP Y    
Sbjct: 448 GKLMKLDVTVPENATEVCSETMACSITDGGRIRQCMDDSLVMSPSVKAACDLPPSYGDYE 507

Query: 489 FQSFQEEKANATRQVEIWEDKYW 511
            + F++++ +A R+VE W +KYW
Sbjct: 508 LKKFRKKQESAERKVEQWTNKYW 530


>AT2G45840.1 | Symbols:  | Arabidopsis thaliana protein of unknown
           function (DUF821) | chr2:18869286-18871487 FORWARD
           LENGTH=523
          Length = 523

 Score =  477 bits (1228), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 232/456 (50%), Positives = 307/456 (67%), Gaps = 20/456 (4%)

Query: 63  NQNLPATHLAKPQESQEFSLRCTKSKNNKETKQTCPRDNYPTSHSPSNPQNLTCPSYFRW 122
           N  +P   L  P     F+L+C+  +N     QTCP  N P    PS  +  TCP YFRW
Sbjct: 72  NATIPKEKLTTPLN---FTLQCSLDQNI--ATQTCPASN-PEKSQPSKDEPETCPDYFRW 125

Query: 123 IHEDLKPWKEKGGITRQMLEGVRRTAHFRLVIVDGKVYVEKFRQSIQTRDVFTLWGILQL 182
           IH+DL+ W+E G ITR+ LE     AHFRL+I  G+VYV ++++S QTRDVFT+WGI+QL
Sbjct: 126 IHKDLEAWRETG-ITRETLERASDKAHFRLIIKGGRVYVHQYKKSFQTRDVFTIWGIVQL 184

Query: 183 LRVYPGKLPDLELMFDCDDRPVIHLANFQGPKAA------PPPLFRYCSDQGSLDIVFPD 236
           LR+YPG++PDLEL+F C D P I   +++ P+        PPPLF YC   G+ DIVFPD
Sbjct: 185 LRMYPGQVPDLELLFMCHDSPEIWRRDYR-PRPGVNVTWPPPPLFHYCGHSGAFDIVFPD 243

Query: 237 WSFWGWAETNIRPWREVLKDIKEGNKRTKWEDRVPYAYWKGNPHVARTRQNLLKCNVTPQ 296
           WSFWGW E NI+ W +  + I EG K+ KWE+R PYAYWKGNP VA  R++L+ C+    
Sbjct: 244 WSFWGWPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGVAMVRRDLMHCH---- 299

Query: 297 NDWNTRLYIQDWSQESNQGYKKSNVADQCTHRYKIYIEGWAWSVSEKYIMACNSMSLYVK 356
            D    LY QDWS+E   GY+ SN+ DQCTHRYKIY+EG AWSVSEKYI+AC+SM+L VK
Sbjct: 300 -DPMVHLYRQDWSREGRIGYRTSNLEDQCTHRYKIYVEGRAWSVSEKYILACDSMTLLVK 358

Query: 357 SSYHDFFIRGMKPLQHYWPIRDNSKCTSLKYAVEWGNNHTDKAQAIGEAASRFIQEDLDM 416
             Y DFF R + P++HYWPIR   KC+ + +AV WGNN+T KA+AIG   S +++++L M
Sbjct: 359 PFYFDFFTRSLVPMEHYWPIRPQEKCSDIVFAVHWGNNNTKKARAIGRNGSGYVRKNLKM 418

Query: 417 DHVYDYMFHLLNEYAKLLKFKPTVPPGAVEFCPETLACAVNGTQ-RRFMEESMVKFPSHS 475
            +VYDYM HLL  Y KL+K    VP GA E CPET+AC +NG + R+ M++S+V  PS  
Sbjct: 419 KYVYDYMLHLLQSYGKLMKMNVEVPQGAKEVCPETMACPINGGRMRQSMDDSLVMSPSVK 478

Query: 476 NPCTIPPPYDPSTFQSFQEEKANATRQVEIWEDKYW 511
             C +PPP++    + F E+K +  ++VE W ++YW
Sbjct: 479 ATCEMPPPFEEDELKKFLEKKESVEKEVEKWTNEYW 514


>AT3G61290.1 | Symbols:  | Arabidopsis thaliana protein of unknown
           function (DUF821) | chr3:22684407-22686772 FORWARD
           LENGTH=455
          Length = 455

 Score =  468 bits (1204), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 224/427 (52%), Positives = 294/427 (68%), Gaps = 8/427 (1%)

Query: 95  QTCPRDNYPTSHSPSNPQNLTCPSYFRWIHEDLKPWKEKGGITRQMLEGVRRTAHFRLVI 154
           QTCP   YP+   P      TCP YFRWI +DLK W+E G ITR+ LE  +  AHFRLVI
Sbjct: 24  QTCP-STYPSRLEPMISSLETCPDYFRWIQQDLKVWEETG-ITRETLERAKPKAHFRLVI 81

Query: 155 VDGKVYVEKFRQSIQTRDVFTLWGILQLLRVYPGKLPDLELMFDCDDRPVIHLANFQGPK 214
             G++YV ++ ++ ++RDV T+WGILQLLR+YPG++PDLEL+F C D P I   +F+ P+
Sbjct: 82  KSGRLYVHQYDKAYESRDVLTIWGILQLLRMYPGQVPDLELLFFCHDIPAIWKRDFRQPE 141

Query: 215 A----APPPLFRYCSDQGSLDIVFPDWSFWGWAETNIRPWREVLKDIKEGNKRTKWEDRV 270
                 PPPLF+YC  + +  IVFPDWSFWGW E NI+ W ++   I+E NKR KW DRV
Sbjct: 142 PNATWPPPPLFQYCGHREAYGIVFPDWSFWGWPEVNIKEWTKLSVAIREANKRVKWNDRV 201

Query: 271 PYAYWKGNPHVARTRQNLLKCNVTPQNDWNTRLYIQDWSQESNQGYKKSNVADQCTHRYK 330
           PYAYWKGN  V R R NL+KCN + + D   RLY QDW +E   G+K SN+ DQCTHRYK
Sbjct: 202 PYAYWKGNSGVHRERGNLMKCNFSDKYDPMVRLYEQDWGKEREIGFKSSNLEDQCTHRYK 261

Query: 331 IYIEGWAWSVSEKYIMACNSMSLYVKSSYHDFFIRGMKPLQHYWPIRDNSKCTSLKYAVE 390
           IYIEG AWSVS+KYI+AC+SM+L +K+ Y DFF R + PL+HYWPI+ + KC  LK+AVE
Sbjct: 262 IYIEGRAWSVSKKYILACDSMTLLIKAEYFDFFGRSLVPLEHYWPIKSHEKCGDLKFAVE 321

Query: 391 WGNNHTDKAQAIGEAASRFIQEDLDMDHVYDYMFHLLNEYAKLLKFKPTVPPGAVEFCPE 450
           WGNN+T KAQ IG   S +I ++L+M +VYDYM ++L  Y KL+K   TVP  A E C E
Sbjct: 322 WGNNNTKKAQVIGRQGSDYIMKNLEMKYVYDYMLYVLQGYGKLMKLDVTVPENATEVCSE 381

Query: 451 TLACAV--NGTQRRFMEESMVKFPSHSNPCTIPPPYDPSTFQSFQEEKANATRQVEIWED 508
           T+AC +   G  R+ M++S+V  PS  + C +P PY     + F E++ +A R+VE W +
Sbjct: 382 TMACPITDGGLIRQCMDDSLVMSPSVKSACDLPRPYRDDELKRFLEKQESAERKVEKWTN 441

Query: 509 KYWLKKN 515
           +YW  +N
Sbjct: 442 EYWEAQN 448


>AT1G07220.1 | Symbols:  | Arabidopsis thaliana protein of unknown
           function (DUF821) | chr1:2217073-2219379 REVERSE
           LENGTH=507
          Length = 507

 Score =  390 bits (1002), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 182/385 (47%), Positives = 256/385 (66%), Gaps = 9/385 (2%)

Query: 98  PRDNYPTSHSPSNPQNLTCPSYFRWIHEDLKPWKEKGGITRQMLEGVRRTAHFRLVIVDG 157
           P+  +  S S        CP +FRWIH DL+PW  K G+T++ ++  +  A FR+VI+ G
Sbjct: 97  PKSLHSESGSGRQTHQPQCPDFFRWIHRDLEPWA-KTGVTKEHVKRAKANAAFRVVILSG 155

Query: 158 KVYVEKFRQSIQTRDVFTLWGILQLLRVYPGKLPDLELMFDCDDRPVIHLANFQGPKAAP 217
           K+YV+ +   +Q+R +FT+WGILQLL  YPG +PD+++MFDC D+P+I+   +Q   + P
Sbjct: 156 KLYVDLYYACVQSRMMFTIWGILQLLTKYPGMVPDVDMMFDCMDKPIINQTEYQ---SFP 212

Query: 218 PPLFRYCSDQGSLDIVFPDWSFWGWAETNIRPWREVLKDIKEGNKRTKWEDRVPYAYWKG 277
            PLFRYC+++  LDI FPDWSFWGW+ETN+RPW E   DIK+G++R  W ++ P AYWKG
Sbjct: 213 VPLFRYCTNEAHLDIPFPDWSFWGWSETNLRPWEEEFGDIKQGSRRRSWYNKQPRAYWKG 272

Query: 278 NPHVAR-TRQNLLKCNVTPQNDWNTRLYIQDWSQESNQGYKKSNVADQCTHRYKIYIEGW 336
           NP V    R  L+KCN +    W  ++  QDW++E+  G+++S +++QC HRYKIY EG+
Sbjct: 273 NPDVVSPIRLELMKCNHS--RLWGAQIMRQDWAEEAKGGFEQSKLSNQCNHRYKIYAEGY 330

Query: 337 AWSVSEKYIMACNSMSLYVKSSYHDFFIRGMKPLQHYWPIRDNSKCTSLKYAVEWGNNHT 396
           AWSVS KYI++C SM+L +   Y DFF RG+ P ++YWPI     C S+KYAV+WGN++ 
Sbjct: 331 AWSVSLKYILSCGSMTLIISPEYEDFFSRGLLPKENYWPISPTDLCRSIKYAVDWGNSNP 390

Query: 397 DKAQAIGEAASRFIQEDLDMDHVYDYMFHLLNEYAKLLKFKPTVPPGAVEFCPETLACAV 456
            +A+ IG+    ++ E L M+ VYDYMFHL+ EY+KL KFKP  P  A E C  +L C  
Sbjct: 391 SEAETIGKRGQGYM-ESLSMNRVYDYMFHLITEYSKLQKFKPEKPASANEVCAGSLLCIA 449

Query: 457 NGTQRRFMEESMVKFPSHSNPCTIP 481
              +R  +E S V  PS   PC  P
Sbjct: 450 EQKERELLERSRV-VPSLDQPCKFP 473


>AT3G61280.2 | Symbols:  | Arabidopsis thaliana protein of unknown
           function (DUF821) | chr3:22681145-22682869 FORWARD
           LENGTH=378
          Length = 378

 Score =  295 bits (754), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 141/265 (53%), Positives = 182/265 (68%), Gaps = 7/265 (2%)

Query: 74  PQESQEFSLRCTKSKNNKETKQTCPRDNYPTSHSPSNPQNLTCPSYFRWIHEDLKPWKEK 133
           P+ + +  L CT   +N  T QTCP  NYPT   P+   + TCP YF+WIH DLK W +K
Sbjct: 92  PKIAIKIPLNCTSLNSN--TTQTCP-SNYPTKFEPAISSSETCPDYFKWIHRDLKVW-QK 147

Query: 134 GGITRQMLEGVRRTAHFRLVIVDGKVYVEKFRQSIQTRDVFTLWGILQLLRVYPGKLPDL 193
            GITR+ LE  R  AHFR+VI  G++YV ++ ++ QTRDVFT+WGILQLLR+YPG++PDL
Sbjct: 148 TGITRETLERARPNAHFRIVIKSGRLYVHQYEKAFQTRDVFTIWGILQLLRMYPGQIPDL 207

Query: 194 ELMFDCDDRPVIHLANFQGPKAA---PPPLFRYCSDQGSLDIVFPDWSFWGWAETNIRPW 250
           EL+F C DRP I   + +  +     PPPLF YC  + + DIVFPDWSFWGW E NI+ W
Sbjct: 208 ELLFLCHDRPAIWKRDLKKKRKDTWPPPPLFHYCGHRDAYDIVFPDWSFWGWPELNIKEW 267

Query: 251 REVLKDIKEGNKRTKWEDRVPYAYWKGNPHVARTRQNLLKCNVTPQNDWNTRLYIQDWSQ 310
            ++   +KEGNK+ KWEDRVPYAYWKGNPHV+  R +L++CN + + D   RLY+QDW  
Sbjct: 268 NKLSVALKEGNKKVKWEDRVPYAYWKGNPHVSPIRGDLMRCNFSDKYDPMVRLYVQDWRS 327

Query: 311 ESNQGYKKSNVADQCTHRYKIYIEG 335
           E   G++ SN+ DQCTHRY   I  
Sbjct: 328 EIEAGFRGSNLEDQCTHRYMCRIHS 352