Miyakogusa Predicted Gene
- Lj5g3v1737110.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1737110.1 Non Chatacterized Hit- tr|I1IVH4|I1IVH4_BRADI
Uncharacterized protein OS=Brachypodium distachyon
GN=,29.72,0.000000000000001,SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL; DUF668,Protein of unknown function DUF668;
DUF3475,,CUFF.55839.1
(591 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G04550.1 | Symbols: | Protein of unknown function (DUF668) |... 544 e-155
AT5G51670.1 | Symbols: | Protein of unknown function (DUF668) |... 191 9e-49
AT3G23160.1 | Symbols: | Protein of unknown function (DUF668) |... 191 1e-48
AT1G34320.1 | Symbols: | Protein of unknown function (DUF668) |... 87 5e-17
AT5G08660.1 | Symbols: | Protein of unknown function (DUF668) |... 78 1e-14
AT1G30755.1 | Symbols: | Protein of unknown function (DUF668) |... 60 3e-09
>AT5G04550.1 | Symbols: | Protein of unknown function (DUF668) |
chr5:1303757-1305556 REVERSE LENGTH=599
Length = 599
Score = 544 bits (1402), Expect = e-155, Method: Compositional matrix adjust.
Identities = 295/596 (49%), Positives = 377/596 (63%), Gaps = 21/596 (3%)
Query: 1 MVAESWFQKLWKTPRKDDSNSEKVVIGVLAFEVARLMSKLVNLWQSLSEKQVARLREEIT 60
MV+E+WF+ LW+ P+K D + +K V+GVLAFEVA L+SKLV+LWQSLS+K VARLR+EIT
Sbjct: 1 MVSETWFRNLWRFPKKHDGHKDKAVLGVLAFEVASLLSKLVHLWQSLSDKNVARLRDEIT 60
Query: 61 NSVGIKKLVSDDENFVVRLISLEMLENIAHVAESVARIGKKSSDPSLKGFEDALVKCITF 120
+S GIKKLVS+D++F+VRLI EM+EN+ +VA++VAR+ +K +DP LK FE+ +
Sbjct: 61 HSTGIKKLVSEDDDFIVRLIRDEMMENVENVAKAVARLARKCNDPKLKCFENCFSDMMKT 120
Query: 121 GDDSHGWKFTSXXXXXXXXXXXXFVSTNASLYQEMEVLTDLEQTLKRMKAYTESDGLNLI 180
G D +GW+F F+S+NASLYQE E+L DLEQT KRMK+ ES NL+
Sbjct: 121 GADPYGWQFGWKKMDKKAKKMERFISSNASLYQETEILADLEQTFKRMKS-NESATDNLL 179
Query: 181 EYQKRVEWSRVEVKNLKANSLWNRTYDYTVHLLARSLFTIFSRINNLFGIQEVIDVGKTK 240
EYQK+V W R EVKNL+ SLWNRTYDYTV LL RS+FTI SR ++FGI ++
Sbjct: 180 EYQKKVTWKRHEVKNLRDVSLWNRTYDYTVILLVRSVFTILSRTKHVFGISYRVEASDVS 239
Query: 241 NPSVFYSDYIRGSQSVSEILQPSVHASDKN-VARFTSGPLGTFAAKSSQNVRTDKT---- 295
+ SD+I S SVS IL P H S+ + + RF SGPLG F +S + T T
Sbjct: 240 SAD---SDFIGRSHSVSTILTPVSHKSESSGLPRFASGPLGRFTGPASGSAATRSTKMSD 296
Query: 296 ----NIXXXXXXXXXXXXXXXXXXXXXXXPLGRNLKKPASDYGTNKNSSILKSHGHSTTI 351
++ LG+ K G KN+ + +I
Sbjct: 297 FLSGSLSAESPKSGPLVAEKHKRFKFYSGQLGKITSKSGPLIGMGKNNKKMGQTPERPSI 356
Query: 352 SGKEINTRHSRMTQVGPFKGCMVAESSSVIDCHSSPNDAQLATQNHKDADSNILSPGNED 411
S + + +R+TQVGPFKGCMV+ + N A+ ++ H + N N
Sbjct: 357 SSVKKQLKANRLTQVGPFKGCMVSHDGITPLSTRARNGARNSSAEHHILEDN----SNSV 412
Query: 412 HRTQSAFSSLCKLQPPS-QSLGAASLALHYANVIAVIEKLAASPHLIGLDARDDLYKMLP 470
H S KL + +LG A LALHYANVI VIE+ ASPHLIG DARDDLY MLP
Sbjct: 413 HVENLTLPSRPKLSDAAPNTLGTACLALHYANVIIVIERFVASPHLIGDDARDDLYNMLP 472
Query: 471 RRVRAALRSKLKPYSM---ASAVYDASLAEEWSEAMAGILEWLAPLAHNMLRWQSERSFE 527
VR +LR +LKPYS +S VYD LA EW++AMAGILEWL PLAHNM++WQSERS+E
Sbjct: 473 ASVRTSLRERLKPYSKNLSSSTVYDPGLAREWTDAMAGILEWLGPLAHNMIKWQSERSYE 532
Query: 528 QQCFVSRTKVLLVQTLYFADLEKTEAIITELLVGLNYVWRYVRELNSIGLVECGSS 583
Q VSRT ++L QTL+FA+ +KTEAIITELLVGLNYVWR+ RELN+ L EC SS
Sbjct: 533 HQSLVSRTHIVLAQTLFFANQQKTEAIITELLVGLNYVWRFGRELNAKALQECTSS 588
>AT5G51670.1 | Symbols: | Protein of unknown function (DUF668) |
chr5:20993560-20995096 FORWARD LENGTH=474
Length = 474
Score = 191 bits (486), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 101/186 (54%), Positives = 125/186 (67%), Gaps = 13/186 (6%)
Query: 406 SPGNEDH---RTQSAF--SSLCKLQPPSQSLGAASLALHYANVIAVIEKLAASPHLIGLD 460
SP +E+ T SAF S L+PP +LG A +ALHYAN+I V+EK+ P L+GLD
Sbjct: 285 SPNDEERDKTTTSSAFLEESSRLLKPPETTLGGAGVALHYANLIVVMEKMIKQPQLVGLD 344
Query: 461 ARDDLYKMLPRRVRAALRSKLKPYSMASAVYDASLAEEWSEAMAGILEWLAPLAHNMLRW 520
ARDDLY MLP VR++LRS+LK + D LA EW A+ IL WL PLA NM+RW
Sbjct: 345 ARDDLYSMLPASVRSSLRSRLKGVGFTAT--DGGLATEWKAALGRILRWLLPLAQNMIRW 402
Query: 521 QSERSFEQQCFVSRT----KVLLVQTLYFADLEKTEAIITELLVGLNYVWRYVRELNSIG 576
QSERSFEQQ + T +V+LVQTL FAD KTEA ITELLVGLNY+WR+ RE+ +
Sbjct: 403 QSERSFEQQHMATATNSQNRVMLVQTLVFADKVKTEAAITELLVGLNYIWRFEREMTAKA 462
Query: 577 L--VEC 580
L ++C
Sbjct: 463 LFNLQC 468
Score = 129 bits (325), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 69/215 (32%), Positives = 118/215 (54%), Gaps = 12/215 (5%)
Query: 26 IGVLAFEVARLMSKLVNLWQSLSEKQVARLREEITNSVGIKKLVSDDENFVVRLISLEML 85
+GVL+FEVAR+M+KL++L SL++ + R+ + G+ K+V+ DE F + L+ E+
Sbjct: 37 VGVLSFEVARVMTKLLHLTHSLTDSNLLTPRDHSLSLEGLTKIVNGDETFHLSLVCAELA 96
Query: 86 ENIAHVAESVARIGKKSSDPSLKGFEDALVKCITFGDDSHGWKFTSXXXXXXXXXXXXFV 145
+++AH A SV+R+ + + SL+ F + G D HGW +V
Sbjct: 97 DSLAHAANSVSRLSNRCTTASLRSFHRLFHEFADMGRDPHGWVMNCKDTEAKNKKIERYV 156
Query: 146 STNASLYQEMEVLTDLEQTLKRMK------------AYTESDGLNLIEYQKRVEWSRVEV 193
S +LY+EME + LE +L++ + D + +I+ Q ++E + V
Sbjct: 157 SVTTALYREMEEMAILENSLRKQSLQIGIEFEEEEDYENKKDVMKVIDLQNKIERQKQHV 216
Query: 194 KNLKANSLWNRTYDYTVHLLARSLFTIFSRINNLF 228
K LK SLWN+++D V +LARS+FT +R+ ++F
Sbjct: 217 KYLKDRSLWNKSFDTVVLILARSVFTALARLKSVF 251
>AT3G23160.1 | Symbols: | Protein of unknown function (DUF668) |
chr3:8260059-8261654 REVERSE LENGTH=531
Length = 531
Score = 191 bits (485), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 92/168 (54%), Positives = 121/168 (72%), Gaps = 3/168 (1%)
Query: 417 AFSSLCKLQPPSQSLGAASLALHYANVIAVIEKLAASPHLIGLDARDDLYKMLPRRVRAA 476
F S + ++G ++L+LHYANV+ V+EKL PHLIG +ARDDLY+MLP ++
Sbjct: 347 GFKSRLTQHASASTIGGSALSLHYANVVIVVEKLLKYPHLIGEEARDDLYQMLPTSLKTT 406
Query: 477 LRSKLKPYSMASAVYDASLAEEWSEAMAGILEWLAPLAHNMLRWQSERSFEQQC-FVSRT 535
L++ L+ Y ++YDA LA +W E + GIL WLAPLAHNM+RWQSER+FEQQ V RT
Sbjct: 407 LKASLRSYLKNISIYDAPLAHDWKETIDGILSWLAPLAHNMIRWQSERNFEQQNQIVKRT 466
Query: 536 KVLLVQTLYFADLEKTEAIITELLVGLNYVWRYVRELNSIGLVECGSS 583
VLL+QTLYFAD EKTEA I +LLVGLNY+ Y ++ N+ L++C SS
Sbjct: 467 NVLLLQTLYFADREKTEAAICKLLVGLNYICHYEQQQNA--LLDCASS 512
Score = 152 bits (383), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 81/210 (38%), Positives = 131/210 (62%), Gaps = 7/210 (3%)
Query: 23 KVVIGVLAFEVARLMSKLVNLWQSLSEKQVARLREEITNSVGIKKLVSDDENFVVRLISL 82
K IG+L+FEVA +MSK ++L +SLS+ ++++L+ E+ +S G++KLVS DEN ++ L
Sbjct: 40 KQTIGILSFEVANVMSKTIHLHRSLSDTEISKLKAEVFHSEGVRKLVSSDENHLLDLSVS 99
Query: 83 EMLENIAHVAESVARIGKKSSDPSLKGFEDALVKCITFGDDSHGWKFTSXXXXXXXXXXX 142
E L++++ VA V+R+GKK ++P+L+GFE + D F
Sbjct: 100 EKLDDLSRVASVVSRLGKKCNEPALQGFEHVYEDIVNGAIDFRKLGFLVKDMESMVKKME 159
Query: 143 XFVSTNASLYQEMEVLTDLEQT---LKRMKAYTESDGLNLIEYQKRVEWSRVEVKNLKAN 199
FV+ SLY EMEV+ +LEQ L+R + + ES + +++++ W R +VK+L+
Sbjct: 160 RFVNATCSLYCEMEVMNELEQAIVKLQRSQQHQES----VKAFEQKLMWQRQDVKSLRDG 215
Query: 200 SLWNRTYDYTVHLLARSLFTIFSRINNLFG 229
SLWN+TYD V +LAR++ TI+ RI +FG
Sbjct: 216 SLWNQTYDKVVEMLARTVCTIYGRIETVFG 245
>AT1G34320.1 | Symbols: | Protein of unknown function (DUF668) |
chr1:12520713-12524046 FORWARD LENGTH=657
Length = 657
Score = 86.7 bits (213), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 56/164 (34%), Positives = 85/164 (51%), Gaps = 17/164 (10%)
Query: 426 PP--SQSLGAASLALHYANVIAVIEKLAASPHLIGLDARDDLYKMLPRRVRAALRSKLKP 483
PP + LG+A LALHYAN+I I+ L + + RD LY+ LP +++ALRS+++
Sbjct: 373 PPINHKKLGSAGLALHYANIITQIDTLVSRSSTMPASTRDALYQGLPPSIKSALRSRIQS 432
Query: 484 YSMASAVYDASLAEEWSEAMAGILEWLAPLAHNMLR----------WQSERSFEQQCFVS 533
+ + + + E M L+WL P+A N + W S S Q
Sbjct: 433 FQVKEELTVPQIKAE----MEKTLQWLVPVATNTTKAHHGFGWVGEWASSGSEANQRPAG 488
Query: 534 RTKVLLVQTLYFADLEKTEAIITELLVGLNYVWRYVRELNSIGL 577
+T +L + TL+ AD EKTEA I +L+V L+++ VR GL
Sbjct: 489 QT-ILRIDTLHHADKEKTEAYILDLVVWLHHLVTQVRATTGYGL 531
Score = 70.9 bits (172), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 57/214 (26%), Positives = 92/214 (42%), Gaps = 16/214 (7%)
Query: 26 IGVLAFEVARLMSKLVNLWQSLSEKQVARLREEITNSVGIKKLVSDDENFVVRLISLEML 85
I +L+FEVA + K NL SLS+ + L+E + S G++ L+S D + ++R+ + +
Sbjct: 155 ISILSFEVANTIVKGANLMHSLSKDSITHLKEVVLPSEGVQNLISKDMDELLRIAAADKR 214
Query: 86 ENIAHVAESVARIGKKSSDP---SLKGFEDALVKCITFGDDSHGWKFTSXXXXXXXXXXX 142
E + + V R G + DP +L F D L G + K
Sbjct: 215 EELRIFSGEVVRFGNRCKDPQYHNLDRFFDRL------GSEFTPQKHLKQEAETIMHQMM 268
Query: 143 XFVSTNASLYQEMEVLTDLEQTLKRMKAYTESD-------GLNLIEYQKRVEWSRVEVKN 195
FV A LY E+ L EQ +R E+ G L + ++ + V+N
Sbjct: 269 SFVHFTADLYHELHALDRFEQDYQRKIQEEENPSTAQRGVGDTLAILRTELKSQKKHVRN 328
Query: 196 LKANSLWNRTYDYTVHLLARSLFTIFSRINNLFG 229
LK SLW+R + + L + + I+ FG
Sbjct: 329 LKKKSLWSRILEEVMEKLVDVVHFLHLEIHEAFG 362
>AT5G08660.1 | Symbols: | Protein of unknown function (DUF668) |
chr5:2814526-2817845 FORWARD LENGTH=649
Length = 649
Score = 78.2 bits (191), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 75/144 (52%), Gaps = 13/144 (9%)
Query: 431 LGAASLALHYANVIAVIEKLAASPHLIGLDARDDLYKMLPRRVRAALRSKLKPYSMASAV 490
LG A LALHYAN+I I+ L A I +ARD LY+ LP ++ ALRSK+K +++ +
Sbjct: 372 LGPAGLALHYANIIVQIDTLVARASSITSNARDSLYQSLPPGIKLALRSKIKSFNVDKEL 431
Query: 491 YDASLAEEWSEAMAGILEWLAPL------AHNMLRWQSERSFEQQCFVSRT---KVLLVQ 541
+ +E M L WL P+ AH+ W E + F S+ +L ++
Sbjct: 432 SVTQIKDE----MERTLHWLVPVAGNTTKAHHGFGWVGEWANTGTDFTSKPSGGDILRIE 487
Query: 542 TLYFADLEKTEAIITELLVGLNYV 565
TLY A EKTE I ++ L ++
Sbjct: 488 TLYHASKEKTEIYILGQIIWLQHL 511
Score = 67.0 bits (162), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 60/214 (28%), Positives = 109/214 (50%), Gaps = 18/214 (8%)
Query: 26 IGVLAFEVARLMSKLVNLWQSLSEKQVARLREEITNSVGIKKLVSDDENFVVRLISLEML 85
+G+LAFEVA + K NL +SLS++ + L+ I S G++ LVS+D + ++RL++ +
Sbjct: 149 LGILAFEVANTIVKSSNLIESLSKRNIEHLKGTILYSEGVQNLVSNDFDELLRLVAADKR 208
Query: 86 ENIAHVAESVARIGKKSSD---PSLKGFEDALVKCITFGDDSHGWKFTSXXXXXXXXXXX 142
+ + + V R G +S D +L+ + D + K +T +
Sbjct: 209 QELQVFSGEVVRFGNRSKDFQWHNLQRYFDRISKELT------PQRQLKEDAVLVVDQLM 262
Query: 143 XFVSTNASLYQEMEVLTDLEQTLKRMKAYTES-------DGLNLIEYQKRVEWSRVEVKN 195
V A LYQE++VL LE+ ++ + E+ DGL +++ + + + R VK+
Sbjct: 263 VLVQYTAELYQELQVLYRLEKDYEQKRREEENSANSSKGDGLAILKTELKAQ--RKVVKS 320
Query: 196 LKANSLWNRTYDYTVHLLARSLFTIFSRINNLFG 229
LK SLW+R ++ + L + + I+N+FG
Sbjct: 321 LKKKSLWSRGFEEVMEKLVDIVHFLLLEIHNIFG 354
>AT1G30755.1 | Symbols: | Protein of unknown function (DUF668) |
chr1:10905991-10908773 REVERSE LENGTH=615
Length = 615
Score = 60.5 bits (145), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 43/152 (28%), Positives = 70/152 (46%), Gaps = 17/152 (11%)
Query: 429 QSLGAASLALHYANVIAVIEKLAASPHLIGLDARDDLYKMLPRRVRAALRSKLKPYSMAS 488
+ LG A L+LHYAN+I I+ +A+ P + + RD LY LP V+ ALR +L+
Sbjct: 353 ERLGEAGLSLHYANLIQQIDNIASRPSSLPSNVRDTLYNALPATVKTALRPRLQTLDQEE 412
Query: 489 AVYDASLAEEWSEAMAGILEWLAPLAHNMLRWQSERSFEQQCFVSRTKV----------- 537
+ + E M L+WL P A N + + + SR +
Sbjct: 413 ELSVPEIKAE----MEKSLQWLVPFAENTTKAHQGFGWVGEWANSRIEFGKGKGKGENNG 468
Query: 538 --LLVQTLYFADLEKTEAIITELLVGLNYVWR 567
+QTL+ AD ++ + EL+V L+ + +
Sbjct: 469 NPTRLQTLHHADKPIVDSYVLELVVWLHRLMK 500
Score = 56.6 bits (135), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/209 (24%), Positives = 90/209 (43%), Gaps = 8/209 (3%)
Query: 26 IGVLAFEVARLMSKLVNLWQSLSEKQVARLREEITNSVGIKKLVSDDENFVVRLISLEML 85
+ +LAFEVA ++K L QSLSE+ + +++++ +S +KKLVS D + L + +
Sbjct: 134 VTILAFEVANTIAKGAALLQSLSEENLKFMKKDMLHSEEVKKLVSTDTTELQILAASDKR 193
Query: 86 ENIAHVAESVARIGKKSSDPSLKGFEDALVKCITFGDDSHGWKFTSXXXXXXXXXXXXFV 145
E + + V R G D + +K T ++ K
Sbjct: 194 EELDLFSGEVIRFGNMCKDLQWHNLDRYFMKLDT---ENSQHKLLKDDAEARMQELVTLA 250
Query: 146 STNASLYQEMEVLTDLEQTLKRMKAYTES-----DGLNLIEYQKRVEWSRVEVKNLKANS 200
+ LY E++ L EQ +R A ES G ++ Q ++ + VK+L+ S
Sbjct: 251 RITSELYHELQALDRFEQDYRRKLAEVESLNLPRRGEGIVILQNELKQQKKLVKSLQKKS 310
Query: 201 LWNRTYDYTVHLLARSLFTIFSRINNLFG 229
LW++ + L + I I +FG
Sbjct: 311 LWSQNLAEIIEKLVDVVSYIRQTIVEVFG 339