Miyakogusa Predicted Gene
- Lj0g3v0164069.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0164069.1 Non Chatacterized Hit- tr|C5X2U4|C5X2U4_SORBI
Putative uncharacterized protein Sb02g008135
(Fragment,45.45,1e-18,seg,NULL; SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL; DUF793,Protein BYPASS-related,CUFF.10250.1
(386 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G18740.1 | Symbols: | Protein of unknown function (DUF793) |... 454 e-128
AT1G74450.1 | Symbols: | Protein of unknown function (DUF793) |... 447 e-126
AT1G43630.1 | Symbols: | Protein of unknown function (DUF793) |... 425 e-119
AT1G63930.1 | Symbols: ROH1 | from the Czech 'roh' meaning 'corn... 275 5e-74
AT4G11300.1 | Symbols: | Protein of unknown function (DUF793) |... 206 3e-53
AT4G23530.1 | Symbols: | Protein of unknown function (DUF793) |... 193 2e-49
>AT1G18740.1 | Symbols: | Protein of unknown function (DUF793) |
chr1:6464125-6465273 FORWARD LENGTH=382
Length = 382
Score = 454 bits (1167), Expect = e-128, Method: Compositional matrix adjust.
Identities = 231/394 (58%), Positives = 295/394 (74%), Gaps = 30/394 (7%)
Query: 5 AEFQGS----ILSFRGAQVHSM----------EGSTFDPELDSFQKHVTDRFSELSSVSN 50
+FQGS +LS R QV S E ST + ELDSFQ+ V ++F +L++ SN
Sbjct: 4 TDFQGSFGRSLLSLRRDQVDSSTVVSGSSSHHEPSTMEVELDSFQRQVAEKFIDLNASSN 63
Query: 51 DELLSLSWIRKLLDSFICCQEEFRMILHNHKALVVKSPLDRMVGDFFERSVKALDVCNAI 110
D LLSL WI KLLDSF+CCQEEFR I+ NH++ + KSP+DR++ D+FERS+KALDVCNAI
Sbjct: 64 D-LLSLEWIGKLLDSFLCCQEEFRAIVFNHRSQISKSPMDRLISDYFERSIKALDVCNAI 122
Query: 111 RDGIEQIRQWKKLLEIVLCALDQKKNIGEGQFRRAKKALVDLAIGMLDDKD-SNANSIAH 169
RDGIEQIRQW+KL +IV+ ALD + IGEGQ RRAKKAL+DLAIGMLD+KD + ++AH
Sbjct: 123 RDGIEQIRQWEKLADIVISALDSHRPIGEGQLRRAKKALIDLAIGMLDEKDHPSGTNLAH 182
Query: 170 RNRSFGRXXXXXXXXXXXXXXXLLGSSSTFGQFRSLSWSVSRNWSAARQLQAIGNNLSPP 229
RNRSFGR + G FRSLSWSVSR+WSA++QLQA+ +NL+ P
Sbjct: 183 RNRSFGRVKD--------------SHHRSIGHFRSLSWSVSRSWSASKQLQALASNLATP 228
Query: 230 KASEIVATNGLALVVYTMSSILLFTMWALVAAIPCQDRGLNLHFSVPRQFLWAAPMTSLH 289
+ +++VA+NGLA+ VYTM+S+LLF MW LVAAIPCQDRGL ++F VPR F WAAP+ SLH
Sbjct: 229 RPNDVVASNGLAVPVYTMTSVLLFVMWVLVAAIPCQDRGLQVNFFVPRHFQWAAPVMSLH 288
Query: 290 DRILEESRKRERKNSCGLMREIQQIEKCARAMNELGDSVQFPLSEKKEEEVRHRVQDVVS 349
D+I+EES++R+RKN CGL++EI +IEK +R MNEL DS+ FPL++ KE EV+ RV ++V
Sbjct: 289 DKIVEESKRRDRKNCCGLLKEIDRIEKSSRLMNELIDSIHFPLNDDKEVEVKQRVDELVQ 348
Query: 350 VCEDLKEGLDPLECQVREVFRRIVCSRMDGLDSL 383
V E L+ GLDP E +VREVF RIV SR + LDSL
Sbjct: 349 VREALRNGLDPFERKVREVFHRIVRSRTESLDSL 382
>AT1G74450.1 | Symbols: | Protein of unknown function (DUF793) |
chr1:27982737-27983930 FORWARD LENGTH=397
Length = 397
Score = 447 bits (1149), Expect = e-126, Method: Compositional matrix adjust.
Identities = 229/387 (59%), Positives = 295/387 (76%), Gaps = 29/387 (7%)
Query: 15 RGAQVHSMEGSTFDPELD-------SFQKHVTDRFSELSSVSNDELLSLSWIRKLLDSFI 67
R V+S+E +T PEL SFQ+ V +RF +L++ S ++LLSL W+ KLLDSF+
Sbjct: 19 RDTAVNSVESTTVTPELTQMEAELVSFQRKVAERFIDLNASSCEDLLSLEWVGKLLDSFL 78
Query: 68 CCQEEFRMILHNHKALVVKSPLDRMVGDFFERSVKALDVCNAIRDGIEQIRQWKKLLEIV 127
CQEEFR I+ NH++++ K P+DR+V D+FERSVKALDVCNAIRDG+EQIRQW+KL+EIV
Sbjct: 79 SCQEEFRSIVINHRSMITKPPMDRLVSDYFERSVKALDVCNAIRDGVEQIRQWQKLIEIV 138
Query: 128 LCALDQ-------KKNIGEGQFRRAKKALVDLAIGMLDDKDSNANSIA--HRNRSFGRXX 178
+CA + K+ +GEGQFRRA+K L++LAIGMLD+KDS+++S++ HRNRSFGR
Sbjct: 139 ICAFNNNGGGSSGKRPLGEGQFRRARKTLIELAIGMLDEKDSSSSSVSSQHRNRSFGRNK 198
Query: 179 XXXXXXXXXXXXXLLGSSSTFGQFRSLSWSVSRNWSAARQLQAIGNNLSPPKASEIVATN 238
T G FRSLSWSVSR+WSA++QLQAIGNNL+ P+AS+I ATN
Sbjct: 199 EQL-------------HHRTIGHFRSLSWSVSRSWSASKQLQAIGNNLATPRASDITATN 245
Query: 239 GLALVVYTMSSILLFTMWALVAAIPCQDRGLNLHFSVPRQFLWAAPMTSLHDRILEESRK 298
GL + VYTM+++LLF MWALVAAIPCQDRGL +HF+VPR + W + SLHDRI+EES+K
Sbjct: 246 GLIVPVYTMTTVLLFVMWALVAAIPCQDRGLQVHFNVPRNYQWGGSLMSLHDRIIEESKK 305
Query: 299 RERKNSCGLMREIQQIEKCARAMNELGDSVQFPLSEKKEEEVRHRVQDVVSVCEDLKEGL 358
RERKN+CGL++EI Q EK +R MNEL DSVQFPLSE+KE EVR RV+++ + E LK GL
Sbjct: 306 RERKNTCGLLKEIHQFEKTSRLMNELVDSVQFPLSEEKEMEVRERVEELGKLQEALKNGL 365
Query: 359 DPLECQVREVFRRIVCSRMDGLDSLGR 385
DP E +VREVF RIV SR +GLD++G+
Sbjct: 366 DPFERKVREVFHRIVRSRTEGLDTVGK 392
>AT1G43630.1 | Symbols: | Protein of unknown function (DUF793) |
chr1:16436445-16437686 REVERSE LENGTH=413
Length = 413
Score = 425 bits (1093), Expect = e-119, Method: Compositional matrix adjust.
Identities = 217/393 (55%), Positives = 293/393 (74%), Gaps = 28/393 (7%)
Query: 3 TKAEFQGSILSFRGAQVHSMEGSTF------DPELDSFQKHVTDRFSELSSVSND-ELLS 55
T+ F S LS R Q H M+ ++F + ELDSFQ+ V ++F +L++ +++ E+LS
Sbjct: 34 TEYSFGRSFLSLRRDQAHLMDPTSFSEPMTMEVELDSFQRQVAEKFIDLNASADEAEILS 93
Query: 56 LSWIRKLLDSFICCQEEFRMILHNHKALVVKSPLDRMVGDFFERSVKALDVCNAIRDGIE 115
L WI KLLDSF+CCQE+FR+I+ NHK ++K P+DR++ ++FERSVKALDVCNAIRDGIE
Sbjct: 94 LEWIGKLLDSFLCCQEDFRVIIFNHKPQLLKQPMDRLIEEYFERSVKALDVCNAIRDGIE 153
Query: 116 QIRQWKKLLEIVLCALD-QKKNIGEGQFRRAKKALVDLAIGMLDDKDSNANSIAHRNRSF 174
QIRQW+KL+EIV+ ALD ++ +GEG+ RAKKAL+DLAIGMLD+KDS S HRNRSF
Sbjct: 154 QIRQWQKLIEIVISALDTNQRQLGEGEIHRAKKALIDLAIGMLDEKDS---SNTHRNRSF 210
Query: 175 GRXXXXXXXXXXXXXXXLLGSSSTFGQFRSLSWSVSRNWSAARQLQAIGNNLSPPKASEI 234
R + G RSLSWSVSR+WSA+RQLQ IGNNL+ P+AS++
Sbjct: 211 TRNK---------------DHNQHIGYIRSLSWSVSRSWSASRQLQGIGNNLATPRASDV 255
Query: 235 VATNGLALVVYTMSSILLFTMWALVAAIPCQDRGLNLHFSVPRQFLWAAPMTSLHDRILE 294
+ATNGLAL VYTM+SILLF W LVAAIPCQDRGL++HF PR F WA P+ SLHD+I++
Sbjct: 256 MATNGLALTVYTMTSILLFVTWVLVAAIPCQDRGLHVHFYFPRHFQWAVPVMSLHDKIMD 315
Query: 295 ES-RKRERKNSCGLMREIQQIEKCARAMNELGDSVQFPLSEKK-EEEVRHRVQDVVSVCE 352
ES ++ ++K CGL+REI QIE+ +R +++L DS F L+++K EV+ RVQ++++VCE
Sbjct: 316 ESKKRDKKKKGCGLLREINQIERNSRMLSDLIDSDNFSLTDEKCTLEVKERVQELMNVCE 375
Query: 353 DLKEGLDPLECQVREVFRRIVCSRMDGLDSLGR 385
+KEGLDP + +VR+VF +IV +R + LDSLG+
Sbjct: 376 AIKEGLDPFDRKVRDVFHQIVRTRTEALDSLGK 408
>AT1G63930.1 | Symbols: ROH1 | from the Czech 'roh' meaning 'corner'
| chr1:23727927-23729174 FORWARD LENGTH=415
Length = 415
Score = 275 bits (702), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 150/416 (36%), Positives = 237/416 (56%), Gaps = 45/416 (10%)
Query: 1 MRTKAEFQGSIL---SFRGAQVHSMEGSTFDPELDSFQKHVTDRFSEL------------ 45
MR + QGS L S R Q + +L+ FQKH+ DRF+EL
Sbjct: 1 MRPAQDNQGSFLGRISIRRNQFVDVNNEQEQEDLELFQKHIADRFTELLSPPQPPPSDEI 60
Query: 46 ----SSVSNDELLSLSWIRKLLDSFICCQEEFRMIL--HNHKALVVKSPLDRMVGDFFER 99
S + ++++S++W+RKL+D F+CC+ EF+ IL + K P DR+V + +R
Sbjct: 61 NTVASVAATEQIMSVTWLRKLMDVFLCCEAEFKAILLMGRDPTQISKPPFDRLVPEMLDR 120
Query: 100 SVKALDVCNAIRDGIEQIRQWKKLLEIVLCALDQKKNIGEGQFRRAKKALVDLAIGM-LD 158
S+KALD+C A+ +GI+ +R +++L EI + AL+Q+ +G+G RRAK+AL +L + + L+
Sbjct: 121 SIKALDICTAVVNGIDSVRHYQRLAEIAVTALEQRP-LGDGNVRRAKRALANLVVALSLE 179
Query: 159 DKDS---------NANSIAHRNRSFGRXXXXXXXXXXXXXXXLLGSSSTFGQFRSLSWSV 209
DK++ N R+ SFGR +T GQ +S SW+V
Sbjct: 180 DKENVSGGGGGGGGGNKTTERSWSFGRRSGGSSAASK--------GGATIGQLKSSSWAV 231
Query: 210 SRNWSAARQLQAIGNNLSPPKASEIVATNGLALVVYTMSSILLFTMWALVAAIPCQDR-G 268
RNWSAA+Q+ A+ NL+PP+ +E GL ++ MS++++F MW L AA+PCQ+R G
Sbjct: 232 GRNWSAAKQIHAMTANLTPPRGNEAA---GLPQPMFIMSTVMVFVMWVLTAAVPCQERSG 288
Query: 269 LNLHFSVPRQFL-WAAPMTSLHDRILEESRKRERKNSCGLMREIQQIEKCARAMNELGDS 327
L H VP + L WA + +H++I +E +K+E+K S GLM E+ ++EK ++ E D
Sbjct: 289 LANHLPVPPKHLNWAQSLIGIHEKIGDEWKKKEKKGSAGLMEEMTRMEKLGHSLMEFADG 348
Query: 328 VQFPLSEKKEEEVRHRVQDVVSVCEDLKEGLDPLECQVREVFRRIVCSRMDGLDSL 383
+P + E +V ++ +C ++E L PL+ Q+REVF RIV SR + L+ L
Sbjct: 349 FHYPAEKDAAESAAVQVAEMAEICRRMEEELVPLQQQIREVFHRIVRSRAEILEVL 404
>AT4G11300.1 | Symbols: | Protein of unknown function (DUF793) |
chr4:6872345-6873460 REVERSE LENGTH=371
Length = 371
Score = 206 bits (523), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 134/392 (34%), Positives = 209/392 (53%), Gaps = 32/392 (8%)
Query: 1 MRTKAEFQGSILS--FRGAQVHSME--GSTFDPELDSFQKHVTDRFSELSSVSNDE---- 52
M E Q S LS R QV SME EL+ FQKHV +RF+EL S+
Sbjct: 1 MAPATEHQSSFLSRLSRRNQVVSMEVNHEQEQEELEDFQKHVAERFAELLPPSDSPESYP 60
Query: 53 LLSLSWIRKLLDSFICCQEEFRMILHNHKALVVKSPLDRMVGDFFERSVKALDVCNAIRD 112
+LS+ W+RKLLD F+ + EF +L ++ + + K PLD++V + +R VKALD+C A+ +
Sbjct: 61 ILSIQWLRKLLDVFMSIESEFHSVLTSNPSQISKPPLDKLVPEMLDRIVKALDICTAVVN 120
Query: 113 GIEQIRQWKKLLEIVLCALDQKKNIGEGQFRRAKKALVDLAIGMLDDKDSNANSIAHRNR 172
G++ +RQ ++ EI + AL Q + +G RRAK+AL L + DK N
Sbjct: 121 GVDSVRQIQRCAEIAVTALKQTP-LSDGSVRRAKRALTSLLAALNADK----------NS 169
Query: 173 SFGRXXXXXXXXXXXXXXXLLGSSSTFGQFRSLSWSVSRNWSAARQLQAIGNNLSPPKAS 232
S + G + VS+NWSAA+Q+QA+ NL P+
Sbjct: 170 GSSGGGSGRRSSTDQWSSFGRRSGGSSGGGGGGAGCVSKNWSAAKQIQAMTANLVAPRGG 229
Query: 233 EIVATNGLALVVYTMSSILLFTMWALVAAIPCQ-DRGLNLHFSVPRQFLWAAPMTSLHDR 291
E A +Y MSS+++ MW LV A+PCQ GL +H +P+ +WA S+ +R
Sbjct: 230 E-------ASPMYIMSSVMVMVMWTLVVAVPCQTSNGLMVHVPLPKNQVWANAAVSISER 282
Query: 292 ILEESRKRERKNSCGLMREIQQIEKCARAMNELGDSVQFPLSEKKEEEVRHRVQDVVSVC 351
+ EE +++E + GLM E+Q++E+ + E + +F EE+V V ++ +C
Sbjct: 283 VGEEMKRKETRGG-GLMEEMQRMERIGLKLMEFSEGFRF----NGEEDVVAEVAEMEEIC 337
Query: 352 EDLKEGLDPLECQVREVFRRIVCSRMDGLDSL 383
+++GL+ L+ +VREVF R+V SR + L+ +
Sbjct: 338 RKMEDGLEGLQRRVREVFHRLVKSRSEILEVI 369
>AT4G23530.1 | Symbols: | Protein of unknown function (DUF793) |
chr4:12279171-12280361 FORWARD LENGTH=396
Length = 396
Score = 193 bits (491), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 143/410 (34%), Positives = 229/410 (55%), Gaps = 57/410 (13%)
Query: 1 MRTKAEFQGSILS---FRGAQVHSMEGSTFDP--ELDSFQKHVTDRFSELSSVSN----- 50
M EFQGS LS R Q+ SM+ + EL+ FQKHV +RFS+L + +
Sbjct: 1 MARATEFQGSFLSRISIRRNQIVSMDVNHEQELEELEYFQKHVAERFSDLITSPSPPPSS 60
Query: 51 ---------DELLSLSWIRKLLDSFICCQEEFRMILHNHKALVVKSP-LDRMVGDFFERS 100
D +LS+ W++ LLD F+ C+ EF+ +L + + KSP L+R++ + +R
Sbjct: 61 SSSAVSQPSDPILSIPWLQNLLDVFMSCEAEFKAVLSTTQ--ISKSPSLERVLPEMLDRI 118
Query: 101 VKALDVCNAIRDGIEQIRQWKKLLEIVLCALDQKKNIGEGQFRRAKKALVDLAIGMLDDK 160
+KALD+CNA+ +GI+ +RQ ++ EI + AL Q+ + +G RRAK+AL L IG+ D+
Sbjct: 119 LKALDLCNAVVNGIDSVRQSRRFAEIAVTALKQRP-LCDGSVRRAKRALTSLLIGLNADE 177
Query: 161 DSNANS----------IAHRNRSFGRXXXXXXXXXXXXXXXLLGSSSTFGQFRSLSWSVS 210
+ NS R+ SFG + +GQ VS
Sbjct: 178 RRDRNSGGSGCSNQRRTTSRSWSFGTRSN-------------VTGGGLYGQV------VS 218
Query: 211 RNWSAARQLQAIGNNLSPPKASEIVATNGLALVVYTMSSILLFTMWALVAAIPCQDRG-L 269
+NWSA++Q+QA+ NL P+ +E +G A+ VY MSS+++ MW LVAA+PCQ L
Sbjct: 219 KNWSASKQIQAMVANLVLPRGAE---ASGPAMPVYIMSSVMVLVMWVLVAAVPCQTSSVL 275
Query: 270 NLHFSVPRQFLWAAPMTSLHDRILEESRKRERK-NSCGLMREIQQIEKCARAMNELGDSV 328
+P+ WA+ S+ +RI EE +++E++ GLM E+Q++EK ++ E +
Sbjct: 276 VAPLPLPKHQNWASAAMSIQERIGEEIKRKEKRCGGGGLMEEMQRMEKIGLSLMEFAERF 335
Query: 329 QFPLSEKKEEEVRHRVQDVVSVCEDLKEGLDPLECQVREVFRRIVCSRMD 378
+FP E++E EV +V ++ +C ++ GL+ L+ QVR+VF R+V SR++
Sbjct: 336 RFPADEEEEVEVAEKVDEMEEICRRMEVGLEDLQRQVRQVFHRLVRSRIE 385