Miyakogusa Predicted Gene
- Lj0g3v0171149.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0171149.1 Non Chatacterized Hit- tr|I1HNA4|I1HNA4_BRADI
Uncharacterized protein OS=Brachypodium distachyon
GN=,35,3e-18,SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL,CUFF.10750.1
(544 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G03560.1 | Symbols: | unknown protein; LOCATED IN: plasma me... 571 e-163
AT5G23490.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 270 3e-72
AT5G08440.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 265 5e-71
AT5G08440.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 192 5e-49
AT5G23510.2 | Symbols: | unknown protein; LOCATED IN: cellular_... 187 1e-47
AT5G23510.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 182 7e-46
>AT3G03560.1 | Symbols: | unknown protein; LOCATED IN: plasma
membrane; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G23490.1);
Has 157 Blast hits to 146 proteins in 38 species: Archae
- 3; Bacteria - 14; Metazoa - 8; Fungi - 0; Plants -
120; Viruses - 0; Other Eukaryotes - 12 (source: NCBI
BLink). | chr3:853153-856486 REVERSE LENGTH=521
Length = 521
Score = 571 bits (1471), Expect = e-163, Method: Compositional matrix adjust.
Identities = 295/536 (55%), Positives = 382/536 (71%), Gaps = 33/536 (6%)
Query: 21 SEILGRHNFETQLAQSNFKSNDALNHMQDQDTMELYSQARGQEEEILSLHEQIAIACMKE 80
SE + RH E S + +QD + M LY++ R QEEEI SL E+IA AC+K+
Sbjct: 7 SESIKRHEIEKDTIASRKLEDTNTKLIQDPEEMALYAKVRSQEEEIHSLQERIAAACLKD 66
Query: 81 MQLLNEKCKLERQFSELRMAVDDKQNEAITSASNELAQRKGYLEENLKLAHDLKVAEDER 140
MQLLNEK LER+ ++LR+A+D+KQNE++TSA NELA+RKG LEENLKLAHDLKV EDER
Sbjct: 67 MQLLNEKYGLERKCADLRVAIDEKQNESVTSALNELARRKGDLEENLKLAHDLKVTEDER 126
Query: 141 YIFMSSMLGLLAEYGLWPRVMNASSISNYVKHLHDQLQWRIRNSHDRIGELTAVLETHAD 200
YIFM+S+LGLLAEYG+WPRV NA++IS+ +KHLHDQLQW+ + +DRI EL++++E
Sbjct: 127 YIFMTSLLGLLAEYGVWPRVANATAISSGIKHLHDQLQWKTKACNDRIRELSSIVENQ-- 184
Query: 201 NGNPVVESPGSGNLTSHIHN--EFMFQHNYPQ----QNLTGNEQIPQPMSNITGYMNPVL 254
PG+ ++ H+ Q +Y + NEQ+ PM N+T NP
Sbjct: 185 --------PGTDFISKDNHDPRNSKTQASYGSTDRGNDYQTNEQLLPPMENVT--RNP-- 232
Query: 255 NGGYMNPIIDSDINRTFQRLNQEISKADREVSSSFHHDS----IDKIGAHE--RTRERNF 308
Y N + D++ R N +I + + ++ + + E + RE
Sbjct: 233 ---YHNIMQDTE----SLRFNNQIGGGSQGIFPQPKRENFGYPLSSVAGKEMIQEREEKA 285
Query: 309 VNGKLYQPPPEHDETASSVSEDGPGIENFQICGEAIPGEKLLGCGYPVRGTSLCMFQWVR 368
N ++ ++E AS V E+GPGI+ FQI G+AIPGEK+LGCG+PVRGT+LCMFQWVR
Sbjct: 286 ENSSMFDAYNGNEEFASHVYEEGPGIDGFQIIGDAIPGEKVLGCGFPVRGTTLCMFQWVR 345
Query: 369 HLQDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDKGHQGELVRLFANDQNKIKCDP 428
HL+DGTRQYIEGAT+PEY+VTADDVDKLIAVECIPMDD+G QGELVRLFANDQNKI+CD
Sbjct: 346 HLEDGTRQYIEGATHPEYIVTADDVDKLIAVECIPMDDQGRQGELVRLFANDQNKIRCDT 405
Query: 429 EMQLEIDTNLAKGEATFSVLLLMDSSENWEQATLFLRRSGYQIKISGTEGPVVAEKFSKD 488
EMQ EIDT +++G+A+F+V LLMDSSE+WE AT+ L+RS YQIK + TE V++EK+SK+
Sbjct: 406 EMQTEIDTYISRGQASFNVQLLMDSSESWEPATVVLKRSSYQIKTNTTEAVVISEKYSKE 465
Query: 489 LSIKVPCGLSTQFVLTCSNGSSHPLSTYSVRMRDTLVLTMRIFQSKALDDKRKGRA 544
L I+VP G STQFVL +GSSHP+ST +VRMRDTLVLTMR+ QSKALD++RKGR
Sbjct: 466 LQIRVPSGESTQFVLISYDGSSHPISTLNVRMRDTLVLTMRMLQSKALDERRKGRV 521
>AT5G23490.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G08440.1); Has 202 Blast hits to 197 proteins
in 48 species: Archae - 0; Bacteria - 13; Metazoa - 25;
Fungi - 9; Plants - 109; Viruses - 0; Other Eukaryotes -
46 (source: NCBI BLink). | chr5:7919831-7926499 FORWARD
LENGTH=729
Length = 729
Score = 270 bits (689), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 173/539 (32%), Positives = 281/539 (52%), Gaps = 51/539 (9%)
Query: 44 LNHMQDQDTMELYSQARGQEEEILSLHEQIAIACMKEMQLLNEKCKLERQFSELRMAVDD 103
+NH + +L + + QE+EI L + +KE Q+ NEK LE++ + +R+A D
Sbjct: 195 VNHSGNAWKQDLIHKVQEQEQEISQLRRYLTDCSVKEAQIRNEKYVLEKRIAYMRLAFDQ 254
Query: 104 KQNEAITSASNELAQRKGYLEENLKLAHDLKVAEDERYIFMSSMLGLLAEYGLWPRVMNA 163
+Q + + ++S L+ R+ +EEN++L + L+ + ER F+S +L LL+EY L P+V +A
Sbjct: 255 QQQDLVDASSKALSYRQEIIEENIRLTYALQATQQERSTFVSYLLPLLSEYSLQPQVSDA 314
Query: 164 SSISNYVKHLHDQLQWRIRNSHDRIGELTAVL-----ETHADNGNPVVESPGSGNLTSHI 218
SI + VK L LQ ++ + ++ E L + + N +P+ S +G +H
Sbjct: 315 QSIVSNVKVLFKHLQEKLLLTETKLKESEYQLAPWQSDVNHSNDSPLAPSRSAGVALTHS 374
Query: 219 HNEFMFQHNYP---------QQNLTGNEQIPQPMSNITGYMNPVLNGG------YMNPII 263
+ M+ H++ QQ+ G+ + + + +P+ N ++ P
Sbjct: 375 TKDSMYSHDHTAIDWNLERQQQDEPGSSAVRNYHLDDSSTFSPLENSQSAAFEMHVQPGT 434
Query: 264 DSDINRTFQRLNQEISKADREVSSSFHHDSIDKIGAHERTRERNFVNGKLYQPPPE---- 319
D + +++++ K H ++ I +N G + P
Sbjct: 435 SVDESPAHKKVDETPPK---------HVQFLEPISKTVVDDAQNPSYGSAFDDPSSSNSP 485
Query: 320 -----HDETASSVSEDG-----PGIENFQICGEAIPGEKLLGCGYPVRGTSLCMFQWVRH 369
+E +SS SE G PGIE+ QI GE PG +L CGY + GT+ C F+WV H
Sbjct: 486 LLSPVFEEPSSSFSEGGDDDPLPGIEDLQISGEPYPGHELQACGYSINGTTSCNFEWVCH 545
Query: 370 LQDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDKGHQGELVRLFANDQNKIKCDPE 429
L+DG+ YI+GA P Y+VTADDVD +A+E P+DD+ +GELV++FAND KI C P+
Sbjct: 546 LEDGSVNYIDGAKQPNYLVTADDVDLYLAIEVQPLDDRNRKGELVKVFANDNRKIACHPD 605
Query: 430 MQLEIDTNLAKGEATFSVLLLMDSSENWEQATLFLRRSGYQIKISGTEGPVVAEKFSKDL 489
MQ I+ L G A++ V L + + WE ATL ++R GY IK +AEKFS
Sbjct: 606 MQSNIEKTLHTGHASYKVSLAVGFVDIWEAATLSIKREGYSIKC--ISDLTIAEKFSAST 663
Query: 490 SIKVPCGLSTQFVLTCSNGSSHPL-----STYSVRMRDTLVLTMRIFQSKALDDKRKGR 543
++ +P G + V+ S+GS H L S + RD +VLT+R+F +AL ++KG+
Sbjct: 664 TVTIPFGQPAELVIIGSDGSEHSLRADNGSPDLIGSRDEIVLTLRLFIKRAL-QRKKGK 721
>AT5G08440.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G23490.1); Has 141 Blast
hits to 139 proteins in 35 species: Archae - 0; Bacteria
- 9; Metazoa - 21; Fungi - 6; Plants - 94; Viruses - 0;
Other Eukaryotes - 11 (source: NCBI BLink). |
chr5:2721037-2726970 FORWARD LENGTH=726
Length = 726
Score = 265 bits (678), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 175/512 (34%), Positives = 264/512 (51%), Gaps = 46/512 (8%)
Query: 54 ELYSQARGQEEEILSLHEQIAIACMKEMQLLNEKCKLERQFSELRMAVDDKQNEAITSAS 113
EL + + Q++EIL L + +A KE+Q+ NEK LE++ + +R A D +Q + + +AS
Sbjct: 218 ELIHKVQEQDQEILRLRKYLADYSTKEVQIRNEKYVLEKRIAHMRSAFDQQQQDLVDAAS 277
Query: 114 NELAQRKGYLEENLKLAHDLKVAEDERYIFMSSMLGLLAEYGLWPRVMNASSISNYVKHL 173
L+ R+ +EEN++L + L+ AE ER +F+S +L LL+EY L P++ ++ SI + VK L
Sbjct: 278 KALSYRQEIIEENIRLTYALQAAEQERSLFVSILLPLLSEYSLHPQISDSQSIVSSVKVL 337
Query: 174 HDQLQWRIRNSHDRIGELTAVLETHADNGNPVVESPGSGNLTSHIHNEFMFQHNYPQQNL 233
LQ ++ + ++ E L + N SP S + + + Q+
Sbjct: 338 FRHLQEKLNVTETKLKETEYQLAPWQSDVNHSNASPLSPYQPVGVGLRYSTDSEHHHQDR 397
Query: 234 TG-----NEQIPQPMSNITGYMNPVLNGGYMNPIIDSDI-----NRTFQRLNQEISKADR 283
G N + P S + PV P ++ D NR R
Sbjct: 398 RGGSAASNYHLDGPESRSPAFQMPV------QPALNQDESHGPNNRVQFR---------E 442
Query: 284 EVSSSFHHDSIDKIGAHERTRERNFVNGKLYQPPPEH--------DETASSVSEDG---- 331
+S++F D+ + A T N + P P + +E +SS SE
Sbjct: 443 PLSNTFMDDAYADVQADSNTTLENSTYVAVDDPSPSNYPILAPVLEEPSSSFSEAADDDP 502
Query: 332 -PGIENFQICGEAIPGEKLLGCGYPVRGTSLCMFQWVRHLQDGTRQYIEGATNPEYVVTA 390
PGI + QI GE PG +L G+ + GT+ C F+WVRHL+DG+ YI+GA P+Y+VTA
Sbjct: 503 LPGIADLQISGEPFPGRELQVSGHSINGTTKCNFEWVRHLEDGSVNYIDGAKRPDYLVTA 562
Query: 391 DDVDKLIAVECIPMDDKGHQGELVRLFANDQNKIKCDPEMQLEIDTNLAKGEATFSVLLL 450
DDVD +A+E P+DDK +GELVR+FAN+ KI C PEMQ I+ +L G A F V
Sbjct: 563 DDVDLYLAIEVHPLDDKNRKGELVRVFANENCKITCHPEMQSHIEKSLYNGHALFKVSYS 622
Query: 451 MDSSENWEQATLFLRRSGYQIKISGTEGPVVAEKFSKDLSIKVPCGLSTQFVLTCSNGSS 510
+ + WE ATL +++ GY IK T PV+ EKFS +I +P FV+ ++G
Sbjct: 623 IGYLDIWEAATLSIKKEGYSIK--PTNDPVITEKFSSSTNIVIPFDQPADFVIIGTDGEE 680
Query: 511 HPL------STYSVRMRDTLVLTMRIFQSKAL 536
H +T RDT+VLT+R+F K L
Sbjct: 681 HLCRVVDNDATDLSCSRDTIVLTLRLFLKKTL 712
>AT5G08440.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; EXPRESSED IN: 21 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT5G23490.1). | chr5:2721037-2726970 FORWARD
LENGTH=772
Length = 772
Score = 192 bits (488), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 108/271 (39%), Positives = 151/271 (55%), Gaps = 21/271 (7%)
Query: 285 VSSSFHHDSIDKIGAHERTRERNFVNGKLYQPPPEH--------DETASSVSEDG----- 331
+S++F D+ + A T N + P P + +E +SS SE
Sbjct: 490 LSNTFMDDAYADVQADSNTTLENSTYVAVDDPSPSNYPILAPVLEEPSSSFSEAADDDPL 549
Query: 332 PGIENFQICGEAIPGEKLLGCGYPVRGTSLCMFQWVRHLQDGTRQYIEGATNPEYVVTAD 391
PGI + QI GE PG +L G+ + GT+ C F+WVRHL+DG+ YI+GA P+Y+VTAD
Sbjct: 550 PGIADLQISGEPFPGRELQVSGHSINGTTKCNFEWVRHLEDGSVNYIDGAKRPDYLVTAD 609
Query: 392 DVDKLIAVECIPMDDKGHQGELVRLFANDQNKIKCDPEMQLEIDTNLAKGEATFSVLLLM 451
DVD +A+E P+DDK +GELVR+FAN+ KI C PEMQ I+ +L G A F V +
Sbjct: 610 DVDLYLAIEVHPLDDKNRKGELVRVFANENCKITCHPEMQSHIEKSLYNGHALFKVSYSI 669
Query: 452 DSSENWEQATLFLRRSGYQIKISGTEGPVVAEKFSKDLSIKVPCGLSTQFVLTCSNGSSH 511
+ WE ATL +++ GY IK T PV+ EKFS +I +P FV+ ++G H
Sbjct: 670 GYLDIWEAATLSIKKEGYSIK--PTNDPVITEKFSSSTNIVIPFDQPADFVIIGTDGEEH 727
Query: 512 PL------STYSVRMRDTLVLTMRIFQSKAL 536
+T RDT+VLT+R+F K L
Sbjct: 728 LCRVVDNDATDLSCSRDTIVLTLRLFLKKTL 758
Score = 89.4 bits (220), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 78/118 (66%)
Query: 54 ELYSQARGQEEEILSLHEQIAIACMKEMQLLNEKCKLERQFSELRMAVDDKQNEAITSAS 113
EL + + Q++EIL L + +A KE+Q+ NEK LE++ + +R A D +Q + + +AS
Sbjct: 218 ELIHKVQEQDQEILRLRKYLADYSTKEVQIRNEKYVLEKRIAHMRSAFDQQQQDLVDAAS 277
Query: 114 NELAQRKGYLEENLKLAHDLKVAEDERYIFMSSMLGLLAEYGLWPRVMNASSISNYVK 171
L+ R+ +EEN++L + L+ AE ER +F+S +L LL+EY L P++ ++ SI + VK
Sbjct: 278 KALSYRQEIIEENIRLTYALQAAEQERSLFVSILLPLLSEYSLHPQISDSQSIVSSVK 335
>AT5G23510.2 | Symbols: | unknown protein; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G23490.1);
Has 30201 Blast hits to 17322 proteins in 780 species:
Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
2996 (source: NCBI BLink). | chr5:7927055-7929117
FORWARD LENGTH=306
Length = 306
Score = 187 bits (476), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 95/215 (44%), Positives = 128/215 (59%), Gaps = 6/215 (2%)
Query: 332 PGIENFQICGEAIPGEKLLGCGYPVRGTSLCMFQWVRHLQDGTRQYIEGATNPEYVVTAD 391
P +EN QI GE PG +L CGY + GT+ C F+WV HL+DG+ YI+GA P Y+VTAD
Sbjct: 84 PALENLQISGEPYPGHELQACGYSINGTTSCNFEWVCHLEDGSVNYIDGAKKPNYLVTAD 143
Query: 392 DVDKLIAVECIPMDDKGHQGELVRLFANDQNKIKCDPEMQLEIDTNLAKGEATFSVLLLM 451
DV +A+E P+DD+ +GELV++FAND KI C PEMQ ID L G A++ V L +
Sbjct: 144 DVGLCLAIEVQPLDDRNRKGELVKVFANDNRKIACHPEMQSNIDKTLHTGHASYKVSLAI 203
Query: 452 DSSENWEQATLFLRRSGYQIKISGTEGPVVAEKFSKDLSIKVPCGLSTQFVLTCSNGSSH 511
WE ATL + R GY IK + + EKFS ++K+P + V+ S+GS H
Sbjct: 204 GFVHIWEAATLSIEREGYTIKCNNDL--TITEKFSASTAVKIPFEKPAELVIIGSDGSEH 261
Query: 512 PLSTYS----VRMRDTLVLTMRIFQSKALDDKRKG 542
L + + RD +VLT+R F AL +KG
Sbjct: 262 CLRVDNEWPDISSRDEIVLTLRSFIKTALQRGKKG 296
>AT5G23510.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G23490.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr5:7927266-7929016 FORWARD LENGTH=271
Length = 271
Score = 182 bits (461), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 91/204 (44%), Positives = 123/204 (60%), Gaps = 6/204 (2%)
Query: 332 PGIENFQICGEAIPGEKLLGCGYPVRGTSLCMFQWVRHLQDGTRQYIEGATNPEYVVTAD 391
P +EN QI GE PG +L CGY + GT+ C F+WV HL+DG+ YI+GA P Y+VTAD
Sbjct: 45 PALENLQISGEPYPGHELQACGYSINGTTSCNFEWVCHLEDGSVNYIDGAKKPNYLVTAD 104
Query: 392 DVDKLIAVECIPMDDKGHQGELVRLFANDQNKIKCDPEMQLEIDTNLAKGEATFSVLLLM 451
DV +A+E P+DD+ +GELV++FAND KI C PEMQ ID L G A++ V L +
Sbjct: 105 DVGLCLAIEVQPLDDRNRKGELVKVFANDNRKIACHPEMQSNIDKTLHTGHASYKVSLAI 164
Query: 452 DSSENWEQATLFLRRSGYQIKISGTEGPVVAEKFSKDLSIKVPCGLSTQFVLTCSNGSSH 511
WE ATL + R GY IK + + EKFS ++K+P + V+ S+GS H
Sbjct: 165 GFVHIWEAATLSIEREGYTIKCNNDL--TITEKFSASTAVKIPFEKPAELVIIGSDGSEH 222
Query: 512 PLSTYS----VRMRDTLVLTMRIF 531
L + + RD +VLT+R F
Sbjct: 223 CLRVDNEWPDISSRDEIVLTLRSF 246