Miyakogusa Predicted Gene
- Lj6g3v0920310.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v0920310.1 Non Chatacterized Hit- tr|I1KY61|I1KY61_SOYBN
Uncharacterized protein (Fragment) OS=Glycine max
PE=4,64.44,0,SUBFAMILY NOT NAMED,NULL; FAMILY NOT NAMED,NULL;
seg,NULL; DUF4220,Domain of unknown function DUF422,CUFF.58498.1
(799 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G45530.1 | Symbols: | Protein of unknown function (DUF594) |... 482 e-136
AT5G45470.1 | Symbols: | Protein of unknown function (DUF594) |... 357 3e-98
AT5G45480.1 | Symbols: | Protein of unknown function (DUF594) |... 348 9e-96
AT5G45460.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 345 7e-95
AT5G45540.1 | Symbols: | Protein of unknown function (DUF594) |... 341 1e-93
AT4G19090.1 | Symbols: | Protein of unknown function (DUF594) |... 179 8e-45
AT4G19080.1 | Symbols: | Protein of unknown function (DUF594) |... 166 8e-41
>AT5G45530.1 | Symbols: | Protein of unknown function (DUF594) |
chr5:18454316-18457222 REVERSE LENGTH=798
Length = 798
Score = 482 bits (1241), Expect = e-136, Method: Compositional matrix adjust.
Identities = 312/881 (35%), Positives = 445/881 (50%), Gaps = 179/881 (20%)
Query: 5 IPEFLKNTWSNWNIRIVILTSLCLQITLIFLAPFRKRTRKRTLAILLWSTYLLADYTANF 64
IP +K WNIR +++ SL Q +LIFLAP RKRT K+ LA +LW+ YLLAD+TAN+
Sbjct: 4 IPPAIKKILDKWNIRGLVIMSLLFQTSLIFLAPMRKRTSKKLLAAVLWTAYLLADWTANY 63
Query: 65 CIGLISNKYGDKDTNISNI--NDFVLTLWAPFLLLHLGGPDTITAFALEDNELWLRHMLG 122
+ I+ G K+T + N +L LWAPFLLLHLGGPDTITA ALEDN LW RH+ G
Sbjct: 64 AVSQITKNQG-KETEPGDPPKNKKLLALWAPFLLLHLGGPDTITALALEDNALWQRHLFG 122
Query: 123 LMVQLCLTSYVFILTLPENTLWIPTVLVFMSGIIKYAERTRSLQLASIDNFRQSMTRVPD 182
L+ Q Y + +L EN LW P L+F++G IKY ERTR+L AS+D F+ M + D
Sbjct: 123 LVSQALAGVYAVVQSL-ENVLWPPITLLFITGTIKYVERTRALYSASLDKFKDRMLQRAD 181
Query: 183 PGPNYAKLMEEYKSRKDAGLPTKIVSMPELAEQGLITPMVGPDVXXXXXXXXXXXXXXXX 242
G NYAKLMEE+ SRK + LPT+I E + +V PD
Sbjct: 182 AGSNYAKLMEEFASRKMSNLPTEIFLTDEPDKHERPPTLVKPD----------------- 224
Query: 243 IEKQSSEVADSDSASKHEQEKLDDVDVVRDAYHHFNTFKGLIVDMIFSFRERNESREYFL 302
L D+++V+ + FNTFKGL+VD+IFSFRER+ESR++F
Sbjct: 225 -------------------RDLTDLEIVQYGFKFFNTFKGLVVDLIFSFRERDESRDFFK 265
Query: 303 SQTAVDALRVIEVELNFMYEAFYTKASVVRDKVG-------------------------- 336
+ALR+IE EL F+YE+ YTK +++ +G
Sbjct: 266 ELKPGEALRIIETELGFLYESMYTKTAILHTGIGTLFRLISFGSLLSSFFVFHRRPLKSE 325
Query: 337 ----------YLFRFVGVCSIVAALVLFVLD----------------------------- 357
Y+ VG+ +A++V+F+L
Sbjct: 326 DFHGADVVITYVLFIVGIALDLASMVIFLLSDWTFAVLRNLKDDPEEKSTSIDSLFNWFL 385
Query: 358 -------QKHGCDKFDI--MVTYTLFYGAVALDLVSLLMLIFSDHTYASFAHPQRNYCTR 408
+KH C+ +++ F + + + F S H +RN C
Sbjct: 386 EFRKPRWKKHTCNGNQTHEVLSTGFFTRRWSGTIYGFNFIGFCLKAKVSRIHQKRN-CNL 444
Query: 409 GIGKFAISVFKTYLKLKRPT--WQEKPERKGKILSTRTLSRRWSGSISEFNLVSYILNK- 465
+ + +S+F ++ + W + R S R++ R+WS Y L
Sbjct: 445 LVWDYVVSLFDLVIRRIQMMIGWIKNVNR-----SIRSVLRQWSKKNPMIRCTVYPLYLV 499
Query: 466 ---------RGSWVDKVIDYI-GATELVEQWRYEKKKPLLQDLWIFIFKELERKSGDADD 515
R W K ID I T ++ R+ ++PL ++ W FIF E++ KSG A+
Sbjct: 500 FFAGIPEVFRVLW--KYIDRIFSVTSYLDGIRFISREPLTKNQWEFIFNEVKDKSGFAET 557
Query: 516 VETIQRICSSRGAWVIQEGQVLRDDINRLMPYVDANTVTFDQSLILWHIATDLLFYEEKV 575
E +++ +RG W +++ +++ +++ LM Y++ V +DQSL+LWHIAT+L F +E+
Sbjct: 558 PEVAKKVSWARGEWALRDSKLM--EVDTLMRYIEK--VDYDQSLLLWHIATELCFQKEEG 613
Query: 576 IQVSNIDHETDEDVQRNFSKLLSDYMLYLLIMQPTMMSAVRGIGQIRFQDTCAEAIKFFE 635
++ + E +D R FSK++SDYM+YLLIM+P +MS V GIG IRF+DT AEA +FF+
Sbjct: 614 GKMEKLSREGYDD--REFSKIISDYMMYLLIMRPKLMSEVAGIGTIRFRDTKAEAERFFK 671
Query: 636 KRDIMAEKEKGNDAFSKNFWNRLIDLGKKKFHKSTRKIEKNIEHQKEACSKLSGVCTEQE 695
R I K++ K A + V + E
Sbjct: 672 GRQI-----------------------------------KDLRDMKRASETVLLVSNDIE 696
Query: 696 PSSVKGDRSKSVLFDGCRLAYXXXXXXXXXX---XWKIMAQVWVELLSYAACSCRPITHV 752
P VKGDRSKSVLFD LA W+++++VWVELL YAA C+ HV
Sbjct: 697 PILVKGDRSKSVLFDASMLAKELQNLKESSNEDGKWRVLSKVWVELLCYAASHCKATEHV 756
Query: 753 QQLSKGGEFMSLVWLLMTHLGLAKQFQIKEGHARAKLIVGE 793
QLS+GGE ++ VWLLM H GL QFQI +G ARAKL+VGE
Sbjct: 757 AQLSRGGELLNFVWLLMAHFGLGDQFQINKGDARAKLVVGE 797
>AT5G45470.1 | Symbols: | Protein of unknown function (DUF594) |
chr5:18422164-18424764 REVERSE LENGTH=866
Length = 866
Score = 357 bits (915), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 190/403 (47%), Positives = 260/403 (64%), Gaps = 34/403 (8%)
Query: 1 MVNPIPEFLKNTWSNWNIRIVILTSLCLQITLIFLAPFRKRTRKRTLAILLWSTYLLADY 60
MV IP+ +K+ W WNIR ++ SL LQ LI +P RKRT +R L +L+WS+YLLAD+
Sbjct: 1 MVEVIPKHIKDVWDRWNIRGAVILSLTLQAILICFSPLRKRTPRRLLIVLVWSSYLLADW 60
Query: 61 TANFCIGLISNKYG-DKDTNISNINDFVLTLWAPFLLLHLGGPDTITAFALEDNELWLRH 119
+ANF +GLIS G D + + V+ LWAPFLLLHLGGPDTITAFALEDN LWLRH
Sbjct: 61 SANFAVGLISKNQGKDLKPDDPPQDKKVMALWAPFLLLHLGGPDTITAFALEDNALWLRH 120
Query: 120 MLGLMVQLCLTSYVFILTLPENTLWIPTVLVFMSGIIKYAERTRSLQLASIDNFRQSMTR 179
+ GL+ Q YV +++LP N+LW+ VLVF+SG IKY ERT +L AS+D FR SM +
Sbjct: 121 VFGLVFQAIAGVYVVVMSLP-NSLWVVIVLVFVSGTIKYLERTTALYSASLDKFRDSMIQ 179
Query: 180 VPDPGPNYAKLMEEYKSRKDAGLPTKIVSMPELAEQGLITPMVGPDVXXXXXXXXXXXXX 239
PDPGPNYAKLMEEYK++K+A LPTKIV + E PD
Sbjct: 180 APDPGPNYAKLMEEYKAKKEARLPTKIVLIDE------------PDK------------- 214
Query: 240 XXXIEKQSSEVADSDSASKHEQEKLDDVDVVRDAYHHFNTFKGLIVDMIFSFRERNESRE 299
E + ++ ASK ++ L D+++V+ AY FNTFKGL+V++IFSFRER+ES E
Sbjct: 215 ----ENRPKKLEHPALASKKRKKDLTDLEIVQYAYKFFNTFKGLVVNLIFSFRERDESLE 270
Query: 300 YFLS-QTAVDALRVIEVELNFMYEAFYTKASVVRDKVGYLFRFVGVCSIVAALVLFVLDQ 358
F + +ALR+IE+EL F+Y+A +TK +++ +G + R ++VAA ++F
Sbjct: 271 IFENLNDPEEALRIIEIELGFLYDALFTKIAILHTGIGTVSRVFASGTLVAAFIIFHKKP 330
Query: 359 KHGCD--KFDIMVTYTLFYGAVALDLVSLLMLIFSDHTYASFA 399
G D D++VTYTLF + LD +S+L+ +FSD T A+++
Sbjct: 331 NKGTDFHGADVVVTYTLFAVGLVLDFISILLFLFSDWTCAAYS 373
Score = 233 bits (594), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 131/316 (41%), Positives = 181/316 (57%), Gaps = 40/316 (12%)
Query: 481 ELVEQWRYEKKKPLLQDLWIFIFKELERKSGDADDVETIQRICSSRGAWVIQEGQVLRDD 540
++V + + +P+ ++LW FIF+EL+ KS D E +RI +RG W ++E + +
Sbjct: 588 DMVHKTLFVHGEPMTRELWKFIFEELKNKSKYGDSPENAKRISLARGEWTLRENLPVDAE 647
Query: 541 INRLMPYVDANTVTFDQSLILWHIATDLLFYE-EKVIQVSNIDHETDEDVQRNFSKLLSD 599
+L+ YV V +DQSL++WHIAT+L + + EK D + R FSK++SD
Sbjct: 648 REKLVRYV--TKVDYDQSLLMWHIATELCYQQHEKETIPEGYDEQRKHYSNREFSKIISD 705
Query: 600 YMLYLLIMQPTMMSAVRGIGQIRFQDTCAEAIKFFEKRDIMAEKEKGNDAFSKNFWNRLI 659
YM+YLLI+QP +MS V GIG+IRF+DT AE KFF++R I
Sbjct: 706 YMMYLLILQPGLMSEVAGIGKIRFRDTLAETHKFFQRRHI-------------------- 745
Query: 660 DLGKKKFHKSTRKIEKNIEHQKEACSKLSGVCTEQEPSSVKGDRSKSVLFDGCRLA--YX 717
++ R +E A + V +E EP VKGDRSKSVLFD RLA
Sbjct: 746 --------ENDRSVET-------ATLNILDVESEIEPMGVKGDRSKSVLFDASRLAKDLA 790
Query: 718 XXXXXXXXXXWKIMAQVWVELLSYAACSCRPITHVQQLSKGGEFMSLVWLLMTHLGLAKQ 777
W+I+++VWVELL YAAC C HV+QLS+GGE ++ VWLLM H GL Q
Sbjct: 791 EMEKTHNKDKWEILSKVWVELLCYAACHCDSTAHVEQLSRGGELINFVWLLMAHFGLTDQ 850
Query: 778 FQIKEGHARAKLIVGE 793
FQI +G ARAKLI+G+
Sbjct: 851 FQINKGDARAKLIIGK 866
>AT5G45480.1 | Symbols: | Protein of unknown function (DUF594) |
chr5:18426296-18428929 REVERSE LENGTH=877
Length = 877
Score = 348 bits (893), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 177/396 (44%), Positives = 243/396 (61%), Gaps = 38/396 (9%)
Query: 1 MVNPIPEFLKNTWSNWNIRIVILTSLCLQITLIFLAPFRKRTRKRTLAILLWSTYLLADY 60
MVN IP+ +K+ W W+IR ++ SL LQ LIF AP RKR+ ++ L +WS YLLAD+
Sbjct: 1 MVNAIPKPIKDIWDEWSIRSTLIFSLSLQTFLIFFAPQRKRSSRKVLLSFIWSAYLLADW 60
Query: 61 TANFCIGLISNKYGD-KDTNISNINDFVLTLWAPFLLLHLGGPDTITAFALEDNELWLRH 119
+ANF G IS+ GD + + + W PFLLLHLGGPDTITA ALEDNELWLRH
Sbjct: 61 SANFAAGQISDSQGDDPEPGEPKKSAELFAFWVPFLLLHLGGPDTITALALEDNELWLRH 120
Query: 120 MLGLMVQLCLTSYVFILTLPENTLWIPTVLVFMSGIIKYAERTRSLQLASIDNFRQSMTR 179
+LGL Q T YV + +LP N LW P +LVF +G+IKY ERT +L LAS+D F+ SM +
Sbjct: 121 LLGLFFQSVATVYVLLQSLP-NALWKPILLVFATGVIKYVERTLALYLASLDKFKDSMIQ 179
Query: 180 VPDPGPNYAKLMEEYKSRKDAGLPTKIVSMPELAEQGLITPMVGPDVXXXXXXXXXXXXX 239
PDPGPNYAKLMEEY ++KD +PT+I+ + E
Sbjct: 180 RPDPGPNYAKLMEEYAAKKDMKMPTQIIKVGE---------------------------- 211
Query: 240 XXXIEKQSSEVADSDSASKHEQEKLDDVDVVRDAYHHFNTFKGLIVDMIFSFRERNESRE 299
E D A + +++++ AY +FN FKGL+VD+IF+F++R ES+
Sbjct: 212 --------PEKDPRDDAPVKPPDGFTPLNILQYAYKYFNIFKGLVVDLIFTFQQRAESKR 263
Query: 300 YFLSQTAVDALRVIEVELNFMYEAFYTKASVVRDKVGYLFRFVGVCSIVAALVLFVLDQK 359
+F S A +ALR++EVELNF+Y A YTKA ++ + +G+LFRF+ + + AAL +F K
Sbjct: 264 FFDSLKAEEALRILEVELNFIYAALYTKAEILHNWIGFLFRFIALGCLAAALRIFQYKSK 323
Query: 360 HGCDKFDIMVTYTLFYGAVALDLVSLLMLIFSDHTY 395
FD+ +TY L G +ALD ++L+M SD T+
Sbjct: 324 KDYSGFDVGLTYALLLGGIALDCIALIMFCASDWTF 359
Score = 244 bits (623), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 147/348 (42%), Positives = 191/348 (54%), Gaps = 51/348 (14%)
Query: 456 FNLVSYILNKRGSWVDKVIDYIGATELVEQWRYEKKKPLLQDLWIFIFKELERKSGDADD 515
F + YI + G W+ +++ G +++ Y L D+W IF E++ KS ADD
Sbjct: 571 FFCIFYIPSLPGRWIKSFMEFFGIRAQLDEVIYTSSDRLTLDMWEHIFGEVKAKSRFADD 630
Query: 516 VETIQRICSSRGAWV---IQEGQVLRDDINRLMPYVDANTVTFDQSLILWHIATDLLFYE 572
E+ R+ S+RG W IQ +L+ YV + +DQSL++WHIAT+LL+
Sbjct: 631 SESAMRVSSARGDWTLRDIQGDPETEKKREKLLRYV--MEMDYDQSLLVWHIATELLYQT 688
Query: 573 EKVIQVSNIDHETDEDVQRNFSKLLSDYMLYLLIMQPTMMSAVRGIGQIRFQDTCAEAIK 632
+K + ++ R FSK+LSDYM+YLL+MQPT+MSAV GIG+IRF+DTC EA +
Sbjct: 689 KKGTKANH--------SAREFSKILSDYMMYLLMMQPTLMSAVVGIGKIRFRDTCEEAQR 740
Query: 633 FFEKRDIMA-EKEKGNDAFSKNFWNRLIDLGKKKFHKSTRKIEKNIEHQKEACSKLSGVC 691
FF++R IM +K DA KEA + V
Sbjct: 741 FFDRRHIMGISAKKAPDA-------------------------------KEASVAILSVA 769
Query: 692 --TEQEPSSVKGDRSKSVLFDGCRLAYXXXXXXXX----XXXWKIMAQVWVELLSYAACS 745
+ EP VKGDRSKSVLFDG LA WKIM+QVWVELLSYAA
Sbjct: 770 VPAKAEPIDVKGDRSKSVLFDGAMLAKELKGLRKNKEDDSEMWKIMSQVWVELLSYAATK 829
Query: 746 CRPITHVQQLSKGGEFMSLVWLLMTHLGLAKQFQIKEGHARAKLIVGE 793
C I H QLSKGGE +S VWLLM H GL QFQI +G ARAKLI+G+
Sbjct: 830 CGAIEHAAQLSKGGELISFVWLLMAHFGLGDQFQINQGDARAKLIIGK 877
>AT5G45460.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: Protein of unknown function
(DUF594) (TAIR:AT5G45470.1); Has 30201 Blast hits to
17322 proteins in 780 species: Archae - 12; Bacteria -
1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr5:18417154-18419265 REVERSE LENGTH=703
Length = 703
Score = 345 bits (885), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 204/497 (41%), Positives = 291/497 (58%), Gaps = 63/497 (12%)
Query: 1 MVNPIPEFLKNTWSNWNIRIVILTSLCLQITLIFLAPFRKRTRKRTLAILLWSTYLLADY 60
MV+ IP+ +K+ W WNIR I SL LQ LI +P RKRT +R L I++WS+YLLAD+
Sbjct: 1 MVDVIPKHIKDAWDRWNIRGTIFLSLTLQAFLICFSPLRKRTPRRHLIIVIWSSYLLADW 60
Query: 61 TANFCIGLISNKYG-DKDTNISNINDFVLTLWAPFLLLHLGGPDTITAFALEDNELWLRH 119
+ANF +GLIS G D + + ++ LWAPFLLLHLGGPDTITAFALEDN LWLR+
Sbjct: 61 SANFAVGLISKNQGKDLKPDDPPQDKKLMALWAPFLLLHLGGPDTITAFALEDNALWLRN 120
Query: 120 MLGLMVQLCLTSYVFILTLPENTLWIPTVLVFMSGIIKYAERTRSLQLASIDNFRQSMTR 179
+ GL+ Q YV + +LP N+LW+ +LVF+SG IKY ERT +L AS+D FR SM +
Sbjct: 121 VFGLVFQAIAGVYVVLQSLP-NSLWVTILLVFISGTIKYLERTTALYSASLDKFRDSMIQ 179
Query: 180 VPDPGPNYAKLMEEYKSRKDAGLPTKIVSMPELAEQGLITPMVGPDVXXXXXXXXXXXXX 239
PDPGPNYAKLMEEYK++K+A LPTKI+ + E PD
Sbjct: 180 GPDPGPNYAKLMEEYKAKKEAKLPTKIILIDE------------PDK------------- 214
Query: 240 XXXIEKQSSEVADSDSASKHEQEKLDDVDVVRDAYHHFNTFKGLIVDMIFSFRERNESRE 299
E + ++ AS+ ++++L +++ + AY FNTFKGL+V++IFSFRER++S E
Sbjct: 215 ----EHRPKKLEHPSLASETKRKELTHLEIAQYAYKFFNTFKGLVVNLIFSFRERDQSIE 270
Query: 300 YFLS-QTAVDALRVIEVELNFMYEAFYTKASVVRDKVGYLFRFVGVCSIVAALVLFVLDQ 358
F + + +ALR+IE+EL F+Y+A +TK +V+ +G + R V S+VAA ++F
Sbjct: 271 IFQNLEDPEEALRIIEIELGFLYDALFTKNAVLHTVLGTVSRVVASGSLVAAFIIFHKIS 330
Query: 359 KHGCD--KFDIMVTYTLFYGAVALDLVSLLMLIFSDHTYASFAHPQRNYCTRGIGKFAIS 416
G D D+++TY LF + LD +S+L+ +FSD T A+ + + + +
Sbjct: 331 NKGRDFHGADVVITYILFAVGLVLDFISILLFLFSDWTCAALSSLKDD--PDEPLSWKDR 388
Query: 417 VFKTYLKLKRPTWQ----------------EKPERK-----------GKILSTRTLSRRW 449
F L+ ++ W+ EKP K +L+T RRW
Sbjct: 389 FFNCLLEFRKLRWKMQECHNKGEHKCTKEGEKPCIKEGDDKCSMVHVHHMLTTPFFFRRW 448
Query: 450 SGSISEFNLVSYILNKR 466
GSI+ FN ++Y KR
Sbjct: 449 CGSINVFNFLAYATKKR 465
Score = 73.2 bits (178), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 77/143 (53%), Gaps = 15/143 (10%)
Query: 457 NLVSYILNKRGSWVDKVIDYIG---ATELVEQWRYEKKKPLLQDLWIFIFKELERKSGDA 513
N + Y+ N +I+++G A +LVE+ + +P+ ++LW FIF+EL+ KS
Sbjct: 543 NFIKYLWNV-------LIEFLGIPDALDLVEKTLFVHGEPMTKELWKFIFEELKNKSKYG 595
Query: 514 DDVETIQRICSSRGAWVIQEGQVLRDDINRLMPYVDANTVTFDQSLILWHIATDLLFYEE 573
D E +RI +RG W +++ + +L+ YV V +DQSL++WHIAT+ + E
Sbjct: 596 DSPENAKRISLARGEWALRDNLPEDAEREKLVRYV--TKVDYDQSLLMWHIATEFCYQEH 653
Query: 574 KVIQVSNIDHETDEDVQRNFSKL 596
K I DE ++N +L
Sbjct: 654 K---KETIPEGYDEQCKQNHLRL 673
>AT5G45540.1 | Symbols: | Protein of unknown function (DUF594) |
chr5:18458294-18460705 REVERSE LENGTH=803
Length = 803
Score = 341 bits (875), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 192/471 (40%), Positives = 277/471 (58%), Gaps = 46/471 (9%)
Query: 1 MVNPIPEFLKNTWSNWNIRIVILTSLCLQITLIFLAPFRKRTRKRTLAILLWSTYLLADY 60
M + IP L+ W WNIR VI+ SL LQ LIF AP R+RT K+ +L+WS YLLAD+
Sbjct: 1 MTDMIPPHLRKLWDKWNIRGVIILSLFLQTILIFFAPSRRRTAKKLFLVLIWSAYLLADW 60
Query: 61 TANFCIGLISNKYGDK-DTNISNINDFVLTLWAPFLLLHLGGPDTITAFALEDNELWLRH 119
A++ +G IS+ ++ ++N + N +L W+PFLLLHLGGPDTITA ALEDNELW RH
Sbjct: 61 AADYAVGQISDSQEEEAESNKPSKNRELLAFWSPFLLLHLGGPDTITALALEDNELWDRH 120
Query: 120 MLGLMVQLCLTSYVFILTLPENTLWIPTVLVFMSGIIKYAERTRSLQLASIDNFRQSMTR 179
+ L+ Q T YV +L++P N L PT+++F+ G+IKY ERT +L AS+D F+ SM
Sbjct: 121 LFSLVCQAVATVYVILLSIP-NRLLTPTLIMFVGGVIKYVERTAALFSASLDKFKDSMLD 179
Query: 180 VPDPGPNYAKLMEEYKSRKDAGLPTKIVSM--PELAEQGLITPMVGPDVXXXXXXXXXXX 237
PDPG NYAKLMEEY++RK +PT ++ + PE +G TP V PD
Sbjct: 180 DPDPGANYAKLMEEYEARKKMNMPTDVIVVKDPEKGREG-NTP-VRPD------------ 225
Query: 238 XXXXXIEKQSSEVADSDSASKHEQEKLDDVDVVRDAYHHFNTFKGLIVDMIFSFRERNES 297
+L + V++ AY +FN FKGLIVD+IF+ +ER+ES
Sbjct: 226 ------------------------NELTALQVIQYAYKYFNIFKGLIVDLIFTNQERDES 261
Query: 298 REYFLSQTAVDALRVIEVELNFMYEAFYTKASVVRDKVGYLFRFVGVCSIVAALVLFVLD 357
R++F TA +ALR+IEVEL +Y+ +TKA ++ + G +FRF+ + +VA+L LF ++
Sbjct: 262 RKFFDKLTAEEALRIIEVELGLIYDCLFTKAEILHNWTGAVFRFIALGCLVASLCLFKMN 321
Query: 358 QKHGCDKFDIMVTYTLFYGAVALDLVSLLMLIFSDHTYASFAHPQRNYCTRGIGKFAISV 417
+K D FD+++TY L +ALD ++LLM SD T A + + + V
Sbjct: 322 KKDQYDGFDVVLTYALLICGIALDSIALLMFCVSDWTIARLRKLKEDLEEKDT--LTDRV 379
Query: 418 FKTYLKLKRPTWQE-KPERKG-KILSTRTLSRRWSGSISEFNLVSYILNKR 466
L K W+ K + G ++L+ + RRWS + +NL+ + L R
Sbjct: 380 LNWILDFKTLRWKRSKCSQDGHQVLNRNFMFRRWSEYVHAYNLIGFCLGIR 430
Score = 237 bits (605), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 138/335 (41%), Positives = 192/335 (57%), Gaps = 43/335 (12%)
Query: 467 GSWVDKVIDYIGATELVEQWRYEKKKPLLQDLWIFIFKELERKSGDADDVETIQRICSSR 526
G + +D+ G +LVE+ R+ L ++LW FIF E+++K A+D E+ + I S+R
Sbjct: 502 GLLIYNFLDFFGIKDLVEEIRFTVSDRLTRELWEFIFTEVQQKHRFAEDQESAKGISSAR 561
Query: 527 GAWVIQEGQVLRD----DINRLMPYVDANTVTFDQSLILWHIATDLLF---YEEKVIQVS 579
G W + E + D +L+ YV +DQS++LWHIAT+LL+ ++KV +
Sbjct: 562 GNWTLLETSSKKKEDGTDHTKLLQYVTEKD--YDQSILLWHIATELLYQKPIDKKVTEKE 619
Query: 580 --NIDHETDEDVQRNFSKLLSDYMLYLLIMQPTMMSAVRGIGQIRFQDTCAEAIKFFEKR 637
+ + E +E R FSK+LSDYM+YLLI+QPT+MSAV GI +IRF+DTC EA FF++R
Sbjct: 620 EHSTNREKEEHSNREFSKILSDYMMYLLIVQPTLMSAVSGIAKIRFRDTCEEAKDFFQRR 679
Query: 638 DIMAEKEKGNDAFSKNFWNRLIDLGKKKFHKSTRKIEKNIEHQKEACSKLSGVCTEQEPS 697
+ +R ++KN+ KEAC + V TE +P
Sbjct: 680 HV----------------------------DKSRYVKKNL--MKEACRAILSVNTEIDPM 709
Query: 698 SVKGDRSKSVLFDGCRLAYXXXXXXXXXXXWKIMAQVWVELLSYAACSCRPITHVQQLSK 757
+VKGDRSKSVLFD LA W+++++VWVELL YA+ C H QLSK
Sbjct: 710 AVKGDRSKSVLFDASVLA--KELMNEGENMWEVVSKVWVELLCYASLHCDSQEHASQLSK 767
Query: 758 GGEFMSLVWLLMTHLGLAKQFQIKEGHARAKLIVG 792
GGE ++ VWLLM H GL QFQI ARAKLIV
Sbjct: 768 GGELINFVWLLMAHFGLGDQFQINRDDARAKLIVA 802
>AT4G19090.1 | Symbols: | Protein of unknown function (DUF594) |
chr4:10449900-10452757 FORWARD LENGTH=751
Length = 751
Score = 179 bits (454), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 130/388 (33%), Positives = 197/388 (50%), Gaps = 71/388 (18%)
Query: 84 NDFVLTLWAPFLLLHLGGPDTITAFALEDNELWLRHMLGLMVQLCLTSYVFILTLPENTL 143
N ++ LWAPFLLLHLGGPDTITAF+LEDN LW RH LGL+ Q YV + +LP N L
Sbjct: 79 NKKLVALWAPFLLLHLGGPDTITAFSLEDNALWNRHFLGLVFQALAGVYVVVQSLP-NVL 137
Query: 144 WIPTVLVFMSGIIKYAERTRSLQLASIDNFRQSMTRVPDPGPNYAKLMEEYKSRKDAGLP 203
+ +L+F++G KY ERT +L LAS D +R SM + + +Y +D +
Sbjct: 138 SVIILLLFIAGTSKYLERTIALYLASSDKYRNSMLQASNSRFDYT------DQTRDLDMD 191
Query: 204 TKIVSMPELAE-QGLITP--MVGPDVXXXXXXXXXXXXXXXXIEKQSSEVADSDSASKHE 260
TK+ S + E +G P ++ P
Sbjct: 192 TKLASEMNMKEHRGQPKPLKLLQP------------------------------------ 215
Query: 261 QEKLDDVDVVRDAYHHFNTFKGLIVDMIFSFRERNESREYFLS--QTAVDALRVIEVELN 318
++L +++++ A+ F E + + F S Q +A +IE EL+
Sbjct: 216 HKELTHLEILQYAF----------------FLELRDESKAFFSALQLKDEAFCIIEAELD 259
Query: 319 FMYEAFYTKASVVRDKVGYLFRFVGVCSIVAALVLFVL--DQKHGCDKFDIMVTYTLFYG 376
F+YE YTK SV+ VG + RF+ + S+++A ++ ++ K DI++TYTLF
Sbjct: 260 FIYEGLYTKGSVLHSWVGLVSRFISLGSLLSAFTIYHYRHNKIQEFHKADIVITYTLFLV 319
Query: 377 AVALDLVSLLMLIFSDHTYASFAHPQRNYCTRGIGKFAISVFKTYLKLKRPTWQEKPERK 436
+ALD++S+ M + SD T A A + + R GK I L LKRP W+ + R+
Sbjct: 320 GIALDVISIHMFMVSDWTTAILAKLKDDPDERYSGKDHI--LNWILFLKRPKWKWQTCRE 377
Query: 437 G---KILSTRTLSRRWSGSISEFNLVSY 461
G ++L+T L RRW+GSI+ N ++Y
Sbjct: 378 GDQQEVLNTPFLLRRWTGSITMLNFLTY 405
Score = 157 bits (396), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 104/308 (33%), Positives = 152/308 (49%), Gaps = 57/308 (18%)
Query: 470 VDKVIDYIGATELVEQWRYEKKKPLLQDLWIFIFKELERKSGDADDVETIQRICSSRGAW 529
V VI+++G +L + R L++ S +E S+R W
Sbjct: 498 VTSVINFLGINDLFYEIRL-----------------LQKNSQILYSLENTTTKSSARRDW 540
Query: 530 VIQEGQVLRDDINRLMPYVDANTVTFDQSLILWHIATDLLFYEEKVIQVSNIDHETDEDV 589
Q+ D + L+ Y+ V ++ SL++WHIAT+L + EE + N D +++
Sbjct: 541 ASSHTQIQIADHDMLLHYI--TDVDYEHSLLIWHIATELCYQEEDSAK-ENCD-KSEYHT 596
Query: 590 QRNFSKLLSDYMLYLLIMQPTMMSAVRGIGQIRFQDTCAEAIKFFEKRDIMAEKEKGNDA 649
R SK++SDYM+YLLIMQP +MS V GIG+IRF+DT AEA +FF+K I+
Sbjct: 597 NRKISKIISDYMMYLLIMQPKLMSEVAGIGKIRFRDTLAEADRFFKKMGII--------- 647
Query: 650 FSKNFWNRLIDLGKKKFHKSTRKIEKNIEHQKEACSKLSGVCTEQEPSSVKGDRSKSVLF 709
+ +R + K A ++ T EP VKG+ SKSVLF
Sbjct: 648 ------------------RDSRNV-------KLASKEILSADTSIEPREVKGNHSKSVLF 682
Query: 710 DGCRLA--YXXXXXXXXXXXWKIMAQVWVELLSYAACSCRPITHVQQLSKGGEFMSLVWL 767
+ LA WKI+++VW+E L +AA C T ++ LSKGGEF++ VWL
Sbjct: 683 EASSLAKELQRVEKNFGEDKWKILSKVWLEFLFHAASHCDATTRMELLSKGGEFINFVWL 742
Query: 768 LMTHLGLA 775
LM H GL
Sbjct: 743 LMAHFGLG 750
>AT4G19080.1 | Symbols: | Protein of unknown function (DUF594) |
chr4:10448813-10449796 REVERSE LENGTH=327
Length = 327
Score = 166 bits (419), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 102/327 (31%), Positives = 169/327 (51%), Gaps = 44/327 (13%)
Query: 470 VDKVIDYIGATELVEQWRYEKKKPLLQDLWIFIFKELERKSGDADDVETIQRICSSRGAW 529
++ ++++ ++ V + + + L ++LW FIF+E++ +S E+ I S+RG W
Sbjct: 31 INYLLEFFSISDQVNEIVFTSRTRLTKELWEFIFEEVKHRSLLVVGSESASHIYSARGDW 90
Query: 530 VIQEGQVLRDDINRLMPYVDANTVTFDQSLILWHIATDLLFYEEKVIQVSNIDHETDEDV 589
++++ Q+ D +L+PYV V +DQ +++WH+AT+LL E +D+
Sbjct: 91 ILRDMQI-EDYYEKLLPYV--TEVEYDQGILVWHVATELLHQTE-------VDNAARNVR 140
Query: 590 QRNFSKLLSDYMLYLLIMQPTMMSAVRGIGQIRFQDTCAEAIKFFEKRDIMAEKEKGNDA 649
+ +SK +SDYM+YLL+ Q ++MS V GI +I+F+D AEA N
Sbjct: 141 SKEYSKTISDYMMYLLVAQSSLMSTVAGIDKIKFKDAIAEA---------------KNSK 185
Query: 650 FSKNFWNRLIDLGKKKFHKSTRKIEKNIEHQKEACSKLSGVCTEQEPSSVKGDR--SKSV 707
+K + ++ G + K+AC+ + TE E + R SKS+
Sbjct: 186 EAKKLFQKMHVEGSRD--------------AKKACAAIVDSFTEFELGNGNARRYQSKSM 231
Query: 708 LFDGCRLAYXXXXXXXXX---XXWKIMAQVWVELLSYAACSCRPITHVQQLSKGGEFMSL 764
LF LA WK++++VWVE+L YAA C H QL+KGGE ++
Sbjct: 232 LFQASMLAKELLHITNERGNDAMWKVVSKVWVEMLCYAATHCDSKQHAAQLNKGGELINF 291
Query: 765 VWLLMTHLGLAKQFQIKEGHARAKLIV 791
VWLLM H GL +QF+ + +RA LI+
Sbjct: 292 VWLLMAHFGLGEQFRTTKEDSRATLIL 318