Miyakogusa Predicted Gene
- Lj5g3v1949730.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1949730.1 Non Chatacterized Hit- tr|C5Y570|C5Y570_SORBI
Putative uncharacterized protein Sb05g003910
OS=Sorghu,61.96,3e-18,DUF793,Protein BYPASS-related; SUBFAMILY NOT
NAMED,NULL; FAMILY NOT NAMED,NULL; seg,NULL,CUFF.56206.1
(400 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G74450.1 | Symbols: | Protein of unknown function (DUF793) |... 495 e-140
AT1G18740.1 | Symbols: | Protein of unknown function (DUF793) |... 484 e-137
AT1G43630.1 | Symbols: | Protein of unknown function (DUF793) |... 459 e-129
AT1G63930.1 | Symbols: ROH1 | from the Czech 'roh' meaning 'corn... 277 1e-74
AT4G11300.1 | Symbols: | Protein of unknown function (DUF793) |... 233 2e-61
AT4G23530.1 | Symbols: | Protein of unknown function (DUF793) |... 209 4e-54
>AT1G74450.1 | Symbols: | Protein of unknown function (DUF793) |
chr1:27982737-27983930 FORWARD LENGTH=397
Length = 397
Score = 495 bits (1274), Expect = e-140, Method: Compositional matrix adjust.
Identities = 252/415 (60%), Positives = 320/415 (77%), Gaps = 33/415 (7%)
Query: 1 MPATDYQGSSPSSSLTQFGRSIFSTNRRDQVHSMEGGST------LEHELESFQQHVTDR 54
MPAT+YQ S FGRS + R V+S+E + +E EL SFQ+ V +R
Sbjct: 1 MPATEYQRS--------FGRSFLNLRRDTAVNSVESTTVTPELTQMEAELVSFQRKVAER 52
Query: 55 FLDLSSAGPDDLLSLSWVGKLLDHFLSCQEEFKAILNAHKAQALRPPLDRMVADYFERSV 114
F+DL+++ +DLLSL WVGKLLD FLSCQEEF++I+ H++ +PP+DR+V+DYFERSV
Sbjct: 53 FIDLNASSCEDLLSLEWVGKLLDSFLSCQEEFRSIVINHRSMITKPPMDRLVSDYFERSV 112
Query: 115 KALDVCNAIRDGIEQIRQWQKLLEIVLFALGGN------QRSIGEGQFRRAKKALIDLAI 168
KALDVCNAIRDG+EQIRQWQKL+EIV+ A N +R +GEGQFRRA+K LI+LAI
Sbjct: 113 KALDVCNAIRDGVEQIRQWQKLIEIVICAFNNNGGGSSGKRPLGEGQFRRARKTLIELAI 172
Query: 169 GMLDDKDSNASV---SHRNRSFGKSRDXXXXXXXXXXXXXXRSLGHFRSLSWSVSRTWSA 225
GMLD+KDS++S HRNRSFG++++ R++GHFRSLSWSVSR+WSA
Sbjct: 173 GMLDEKDSSSSSVSSQHRNRSFGRNKEQLHH----------RTIGHFRSLSWSVSRSWSA 222
Query: 226 ARQLQAIGNNTYPPRASDLVASHGLAMPVFIMNSILLFVMWALVAAIPCQDRGLHAHFTI 285
++QLQAIGNN PRASD+ A++GL +PV+ M ++LLFVMWALVAAIPCQDRGL HF +
Sbjct: 223 SKQLQAIGNNLATPRASDITATNGLIVPVYTMTTVLLFVMWALVAAIPCQDRGLQVHFNV 282
Query: 286 PRNYTWAVPLLSLHERIFEESKKRERKNACGLLKEIYQIEKCARAMNELADSVQFPLTEE 345
PRNY W L+SLH+RI EESKKRERKN CGLLKEI+Q EK +R MNEL DSVQFPL+EE
Sbjct: 283 PRNYQWGGSLMSLHDRIIEESKKRERKNTCGLLKEIHQFEKTSRLMNELVDSVQFPLSEE 342
Query: 346 REGEVRKRVQEVSQVCDALKDGLDPLERQVREVFHRIVRSRTEGLDSLGRPNNAE 400
+E EVR+RV+E+ ++ +ALK+GLDP ER+VREVFHRIVRSRTEGLD++G+ + +E
Sbjct: 343 KEMEVRERVEELGKLQEALKNGLDPFERKVREVFHRIVRSRTEGLDTVGKHHGSE 397
>AT1G18740.1 | Symbols: | Protein of unknown function (DUF793) |
chr1:6464125-6465273 FORWARD LENGTH=382
Length = 382
Score = 484 bits (1246), Expect = e-137, Method: Compositional matrix adjust.
Identities = 246/404 (60%), Positives = 312/404 (77%), Gaps = 33/404 (8%)
Query: 1 MPATDYQGSSPSSSLTQFGRSIFSTNRRDQVHSMEGG---------STLEHELESFQQHV 51
MPATD+QGS FGRS+ S RRDQV S ST+E EL+SFQ+ V
Sbjct: 1 MPATDFQGS--------FGRSLLSL-RRDQVDSSTVVSGSSSHHEPSTMEVELDSFQRQV 51
Query: 52 TDRFLDLSSAGPDDLLSLSWVGKLLDHFLSCQEEFKAILNAHKAQALRPPLDRMVADYFE 111
++F+DL+ A +DLLSL W+GKLLD FL CQEEF+AI+ H++Q + P+DR+++DYFE
Sbjct: 52 AEKFIDLN-ASSNDLLSLEWIGKLLDSFLCCQEEFRAIVFNHRSQISKSPMDRLISDYFE 110
Query: 112 RSVKALDVCNAIRDGIEQIRQWQKLLEIVLFALGGNQRSIGEGQFRRAKKALIDLAIGML 171
RS+KALDVCNAIRDGIEQIRQW+KL +IV+ AL + R IGEGQ RRAKKALIDLAIGML
Sbjct: 111 RSIKALDVCNAIRDGIEQIRQWEKLADIVISALDSH-RPIGEGQLRRAKKALIDLAIGML 169
Query: 172 DDKD--SNASVSHRNRSFGKSRDXXXXXXXXXXXXXXRSLGHFRSLSWSVSRTWSAARQL 229
D+KD S +++HRNRSFG+ +D RS+GHFRSLSWSVSR+WSA++QL
Sbjct: 170 DEKDHPSGTNLAHRNRSFGRVKDSHH-----------RSIGHFRSLSWSVSRSWSASKQL 218
Query: 230 QAIGNNTYPPRASDLVASHGLAMPVFIMNSILLFVMWALVAAIPCQDRGLHAHFTIPRNY 289
QA+ +N PR +D+VAS+GLA+PV+ M S+LLFVMW LVAAIPCQDRGL +F +PR++
Sbjct: 219 QALASNLATPRPNDVVASNGLAVPVYTMTSVLLFVMWVLVAAIPCQDRGLQVNFFVPRHF 278
Query: 290 TWAVPLLSLHERIFEESKKRERKNACGLLKEIYQIEKCARAMNELADSVQFPLTEEREGE 349
WA P++SLH++I EESK+R+RKN CGLLKEI +IEK +R MNEL DS+ FPL +++E E
Sbjct: 279 QWAAPVMSLHDKIVEESKRRDRKNCCGLLKEIDRIEKSSRLMNELIDSIHFPLNDDKEVE 338
Query: 350 VRKRVQEVSQVCDALKDGLDPLERQVREVFHRIVRSRTEGLDSL 393
V++RV E+ QV +AL++GLDP ER+VREVFHRIVRSRTE LDSL
Sbjct: 339 VKQRVDELVQVREALRNGLDPFERKVREVFHRIVRSRTESLDSL 382
>AT1G43630.1 | Symbols: | Protein of unknown function (DUF793) |
chr1:16436445-16437686 REVERSE LENGTH=413
Length = 413
Score = 459 bits (1180), Expect = e-129, Method: Compositional matrix adjust.
Identities = 235/408 (57%), Positives = 305/408 (74%), Gaps = 33/408 (8%)
Query: 1 MPATDYQGSSPSSSLTQFGRSIFSTNRRDQVHSMEGGS-----TLEHELESFQQHVTDRF 55
MP T+Y FGRS S RRDQ H M+ S T+E EL+SFQ+ V ++F
Sbjct: 31 MPETEYS----------FGRSFLSL-RRDQAHLMDPTSFSEPMTMEVELDSFQRQVAEKF 79
Query: 56 LDL-SSAGPDDLLSLSWVGKLLDHFLSCQEEFKAILNAHKAQALRPPLDRMVADYFERSV 114
+DL +SA ++LSL W+GKLLD FL CQE+F+ I+ HK Q L+ P+DR++ +YFERSV
Sbjct: 80 IDLNASADEAEILSLEWIGKLLDSFLCCQEDFRVIIFNHKPQLLKQPMDRLIEEYFERSV 139
Query: 115 KALDVCNAIRDGIEQIRQWQKLLEIVLFALGGNQRSIGEGQFRRAKKALIDLAIGMLDDK 174
KALDVCNAIRDGIEQIRQWQKL+EIV+ AL NQR +GEG+ RAKKALIDLAIGMLD+K
Sbjct: 140 KALDVCNAIRDGIEQIRQWQKLIEIVISALDTNQRQLGEGEIHRAKKALIDLAIGMLDEK 199
Query: 175 DSNASVSHRNRSFGKSRDXXXXXXXXXXXXXXRSLGHFRSLSWSVSRTWSAARQLQAIGN 234
DS S +HRNRSF +++D + +G+ RSLSWSVSR+WSA+RQLQ IGN
Sbjct: 200 DS--SNTHRNRSFTRNKD------------HNQHIGYIRSLSWSVSRSWSASRQLQGIGN 245
Query: 235 NTYPPRASDLVASHGLAMPVFIMNSILLFVMWALVAAIPCQDRGLHAHFTIPRNYTWAVP 294
N PRASD++A++GLA+ V+ M SILLFV W LVAAIPCQDRGLH HF PR++ WAVP
Sbjct: 246 NLATPRASDVMATNGLALTVYTMTSILLFVTWVLVAAIPCQDRGLHVHFYFPRHFQWAVP 305
Query: 295 LLSLHERIFEES-KKRERKNACGLLKEIYQIEKCARAMNELADSVQFPLTEER-EGEVRK 352
++SLH++I +ES K+ ++K CGLL+EI QIE+ +R +++L DS F LT+E+ EV++
Sbjct: 306 VMSLHDKIMDESKKRDKKKKGCGLLREINQIERNSRMLSDLIDSDNFSLTDEKCTLEVKE 365
Query: 353 RVQEVSQVCDALKDGLDPLERQVREVFHRIVRSRTEGLDSLGRPNNAE 400
RVQE+ VC+A+K+GLDP +R+VR+VFH+IVR+RTE LDSLG+ N E
Sbjct: 366 RVQELMNVCEAIKEGLDPFDRKVRDVFHQIVRTRTEALDSLGKLQNQE 413
>AT1G63930.1 | Symbols: ROH1 | from the Czech 'roh' meaning 'corner'
| chr1:23727927-23729174 FORWARD LENGTH=415
Length = 415
Score = 277 bits (708), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 154/423 (36%), Positives = 247/423 (58%), Gaps = 52/423 (12%)
Query: 2 PATDYQGSSPSSSLTQFGRSIFSTNRRDQVHSMEGGSTLEHELESFQQHVTDRFLDL--- 58
PA D QGS GR N+ V++ + + +LE FQ+H+ DRF +L
Sbjct: 3 PAQDNQGSF-------LGRISIRRNQFVDVNNEQE----QEDLELFQKHIADRFTELLSP 51
Query: 59 -------------SSAGPDDLLSLSWVGKLLDHFLSCQEEFKAIL--NAHKAQALRPPLD 103
S A + ++S++W+ KL+D FL C+ EFKAIL Q +PP D
Sbjct: 52 PQPPPSDEINTVASVAATEQIMSVTWLRKLMDVFLCCEAEFKAILLMGRDPTQISKPPFD 111
Query: 104 RMVADYFERSVKALDVCNAIRDGIEQIRQWQKLLEIVLFALGGNQRSIGEGQFRRAKKAL 163
R+V + +RS+KALD+C A+ +GI+ +R +Q+L EI + AL QR +G+G RRAK+AL
Sbjct: 112 RLVPEMLDRSIKALDICTAVVNGIDSVRHYQRLAEIAVTAL--EQRPLGDGNVRRAKRAL 169
Query: 164 IDLAIGM-LDDKDSNA----------SVSHRNRSFGKSRDXXXXXXXXXXXXXXRSLGHF 212
+L + + L+DK++ + + R+ SFG+ ++G
Sbjct: 170 ANLVVALSLEDKENVSGGGGGGGGGNKTTERSWSFGRRSGGSSAASKGGA-----TIGQL 224
Query: 213 RSLSWSVSRTWSAARQLQAIGNNTYPPRASDLVASHGLAMPVFIMNSILLFVMWALVAAI 272
+S SW+V R WSAA+Q+ A+ N PPR ++ + GL P+FIM+++++FVMW L AA+
Sbjct: 225 KSSSWAVGRNWSAAKQIHAMTANLTPPRGNE---AAGLPQPMFIMSTVMVFVMWVLTAAV 281
Query: 273 PCQDR-GLHAHFTIP-RNYTWAVPLLSLHERIFEESKKRERKNACGLLKEIYQIEKCARA 330
PCQ+R GL H +P ++ WA L+ +HE+I +E KK+E+K + GL++E+ ++EK +
Sbjct: 282 PCQERSGLANHLPVPPKHLNWAQSLIGIHEKIGDEWKKKEKKGSAGLMEEMTRMEKLGHS 341
Query: 331 MNELADSVQFPLTEEREGEVRKRVQEVSQVCDALKDGLDPLERQVREVFHRIVRSRTEGL 390
+ E AD +P ++ +V E++++C +++ L PL++Q+REVFHRIVRSR E L
Sbjct: 342 LMEFADGFHYPAEKDAAESAAVQVAEMAEICRRMEEELVPLQQQIREVFHRIVRSRAEIL 401
Query: 391 DSL 393
+ L
Sbjct: 402 EVL 404
>AT4G11300.1 | Symbols: | Protein of unknown function (DUF793) |
chr4:6872345-6873460 REVERSE LENGTH=371
Length = 371
Score = 233 bits (595), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 145/398 (36%), Positives = 227/398 (57%), Gaps = 37/398 (9%)
Query: 2 PATDYQGSSPSSSLTQFGRSIFSTNRRDQVHSMEGGSTLEHE-LESFQQHVTDRFLDL-- 58
PAT++Q SS L++ +RR+QV SME E E LE FQ+HV +RF +L
Sbjct: 3 PATEHQ----SSFLSRL-------SRRNQVVSMEVNHEQEQEELEDFQKHVAERFAELLP 51
Query: 59 SSAGPDD--LLSLSWVGKLLDHFLSCQEEFKAILNAHKAQALRPPLDRMVADYFERSVKA 116
S P+ +LS+ W+ KLLD F+S + EF ++L ++ +Q +PPLD++V + +R VKA
Sbjct: 52 PSDSPESYPILSIQWLRKLLDVFMSIESEFHSVLTSNPSQISKPPLDKLVPEMLDRIVKA 111
Query: 117 LDVCNAIRDGIEQIRQWQKLLEIVLFALGGNQRSIGEGQFRRAKKALIDLAIGMLDDKDS 176
LD+C A+ +G++ +RQ Q+ EI + AL Q + +G RRAK+AL L + + D
Sbjct: 112 LDICTAVVNGVDSVRQIQRCAEIAVTAL--KQTPLSDGSVRRAKRALTSLLAAL--NADK 167
Query: 177 NASVSHRNRSFGKSRDXXXXXXXXXXXXXXRSLGHFRSLSWSVSRTWSAARQLQAIGNNT 236
N+ S S D G VS+ WSAA+Q+QA+ N
Sbjct: 168 NSGSSGGGSGRRSSTDQWSSFGRRSGGSSGGGGGGAG----CVSKNWSAAKQIQAMTANL 223
Query: 237 YPPRASDLVASHGLAMPVFIMNSILLFVMWALVAAIPCQ-DRGLHAHFTIPRNYTWAVPL 295
PR G A P++IM+S+++ VMW LV A+PCQ GL H +P+N WA
Sbjct: 224 VAPRG-------GEASPMYIMSSVMVMVMWTLVVAVPCQTSNGLMVHVPLPKNQVWANAA 276
Query: 296 LSLHERIFEESKKRERKNACGLLKEIYQIEKCARAMNELADSVQFPLTEEREGEVRKRVQ 355
+S+ ER+ EE K++E + GL++E+ ++E+ + E ++ +F E +V V
Sbjct: 277 VSISERVGEEMKRKETRGG-GLMEEMQRMERIGLKLMEFSEGFRF----NGEEDVVAEVA 331
Query: 356 EVSQVCDALKDGLDPLERQVREVFHRIVRSRTEGLDSL 393
E+ ++C ++DGL+ L+R+VREVFHR+V+SR+E L+ +
Sbjct: 332 EMEEICRKMEDGLEGLQRRVREVFHRLVKSRSEILEVI 369
>AT4G23530.1 | Symbols: | Protein of unknown function (DUF793) |
chr4:12279171-12280361 FORWARD LENGTH=396
Length = 396
Score = 209 bits (531), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 150/403 (37%), Positives = 235/403 (58%), Gaps = 52/403 (12%)
Query: 16 TQFGRSIFS--TNRRDQVHSMEGGSTLE-HELESFQQHVTDRFLDLSSAGP--------- 63
T+F S S + RR+Q+ SM+ E ELE FQ+HV +RF DL ++
Sbjct: 5 TEFQGSFLSRISIRRNQIVSMDVNHEQELEELEYFQKHVAERFSDLITSPSPPPSSSSSA 64
Query: 64 -----DDLLSLSWVGKLLDHFLSCQEEFKAILNAHKAQALRPPLDRMVADYFERSVKALD 118
D +LS+ W+ LLD F+SC+ EFKA+L+ + P L+R++ + +R +KALD
Sbjct: 65 VSQPSDPILSIPWLQNLLDVFMSCEAEFKAVLSTTQISK-SPSLERVLPEMLDRILKALD 123
Query: 119 VCNAIRDGIEQIRQWQKLLEIVLFALGGNQRSIGEGQFRRAKKALIDLAIGMLDD--KDS 176
+CNA+ +GI+ +RQ ++ EI + AL QR + +G RRAK+AL L IG+ D +D
Sbjct: 124 LCNAVVNGIDSVRQSRRFAEIAVTAL--KQRPLCDGSVRRAKRALTSLLIGLNADERRDR 181
Query: 177 NAS---------VSHRNRSFGKSRDXXXXXXXXXXXXXXRSLGHFRSLSWSVSRTWSAAR 227
N+ + R+ SFG + G + + VS+ WSA++
Sbjct: 182 NSGGSGCSNQRRTTSRSWSFGTRSNVTGG-------------GLYGQV---VSKNWSASK 225
Query: 228 QLQAIGNNTYPPRASDLVASHGLAMPVFIMNSILLFVMWALVAAIPCQDRG-LHAHFTIP 286
Q+QA+ N PR ++ + G AMPV+IM+S+++ VMW LVAA+PCQ L A +P
Sbjct: 226 QIQAMVANLVLPRGAE---ASGPAMPVYIMSSVMVLVMWVLVAAVPCQTSSVLVAPLPLP 282
Query: 287 RNYTWAVPLLSLHERIFEESKKRERK-NACGLLKEIYQIEKCARAMNELADSVQFPLTEE 345
++ WA +S+ ERI EE K++E++ GL++E+ ++EK ++ E A+ +FP EE
Sbjct: 283 KHQNWASAAMSIQERIGEEIKRKEKRCGGGGLMEEMQRMEKIGLSLMEFAERFRFPADEE 342
Query: 346 REGEVRKRVQEVSQVCDALKDGLDPLERQVREVFHRIVRSRTE 388
E EV ++V E+ ++C ++ GL+ L+RQVR+VFHR+VRSR E
Sbjct: 343 EEVEVAEKVDEMEEICRRMEVGLEDLQRQVRQVFHRLVRSRIE 385