Miyakogusa Predicted Gene
- Lj4g3v0244130.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v0244130.1 tr|Q1I184|Q1I184_PEA WD-40 repeat protein
OS=Pisum sativum GN=MSI1 PE=2 SV=1,94.69,0,WD_REPEATS_2,WD40 repeat;
WD_REPEATS_REGION,WD40-repeat-containing domain; WD40 repeats,WD40
repeat;,CUFF.46754.1
(223 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G58230.1 | Symbols: MSI1, MEE70, ATMSI1 | Transducin/WD40 rep... 392 e-109
AT2G16780.1 | Symbols: MSI2, MSI02, NFC02, NFC2 | Transducin fam... 208 2e-54
AT4G35050.1 | Symbols: MSI3, NFC3 | Transducin family protein / ... 202 1e-52
AT2G19520.1 | Symbols: FVE, ACG1, MSI4, NFC4, NFC04, ATMSI4 | Tr... 92 3e-19
AT4G29730.1 | Symbols: NFC5, MSI5 | nucleosome/chromatin assembl... 86 2e-17
AT2G19540.1 | Symbols: | Transducin family protein / WD-40 repe... 59 3e-09
>AT5G58230.1 | Symbols: MSI1, MEE70, ATMSI1 | Transducin/WD40
repeat-like superfamily protein | chr5:23556112-23557994
FORWARD LENGTH=424
Length = 424
Score = 392 bits (1007), Expect = e-109, Method: Compositional matrix adjust.
Identities = 188/226 (83%), Positives = 202/226 (89%), Gaps = 4/226 (1%)
Query: 1 MGKXXXXXXXXXXXRLINEEYKIWKKNSPFLYDLVITHALEWPSLTVEWLPDRHEPPGKD 60
MGK RLINEEYKIWKKN+PFLYDLVITHALEWPSLTVEWLPDR EP GKD
Sbjct: 1 MGKDEEEMRGEIEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPSGKD 60
Query: 61 YSLQKVILGTHTSENEPNYLMLAQVQLPLHDSENDARHYDDD---LGGFGCANGKVQIIQ 117
YS+QK+ILGTHTSE+EPNYLMLAQVQLPL D+E++AR YDDD GGFGCA GKVQIIQ
Sbjct: 61 YSVQKMILGTHTSESEPNYLMLAQVQLPLDDTESEARQYDDDRSEFGGFGCATGKVQIIQ 120
Query: 118 QINHDGEVNRARYMPQNPFIIATKTISAEVYVFDYSKHPSKPPLDGSCNPDLRLRGHNTE 177
QINHDGEVNRARYMPQNPFIIATKT++AEVYVFDYSKHPSKPPLDG+CNPDL+LRGH++E
Sbjct: 121 QINHDGEVNRARYMPQNPFIIATKTVNAEVYVFDYSKHPSKPPLDGACNPDLKLRGHSSE 180
Query: 178 GYGLSWSKFKEGHLLSGSDDAQICLWDINGGTPKNKSLDAMQIFKV 223
GYGLSWSKFK+GHLLSGSDDAQICLWDIN TPKNKSLDA QIFK
Sbjct: 181 GYGLSWSKFKQGHLLSGSDDAQICLWDIN-ATPKNKSLDAQQIFKA 225
>AT2G16780.1 | Symbols: MSI2, MSI02, NFC02, NFC2 | Transducin family
protein / WD-40 repeat family protein |
chr2:7281615-7283583 REVERSE LENGTH=415
Length = 415
Score = 208 bits (530), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 107/213 (50%), Positives = 144/213 (67%), Gaps = 19/213 (8%)
Query: 17 INEEYKIWKKNSPFLYDLVITHALEWPSLTVEWLPDRHEPPGKD--YSLQKVILGTHTSE 74
+ E++ +WKKN+PFLYDL+I+H LEWPSLTV W+P P D + + K+ILGTHTS
Sbjct: 14 VEEDFSVWKKNTPFLYDLLISHPLEWPSLTVHWVPSTPNPYVADSYFGVHKLILGTHTSG 73
Query: 75 NEPNYLMLAQVQLPLHDSENDARHYDDDLGGFGCANG-----KVQIIQQINHDGEVNRAR 129
+ ++LM+A V P ++E G G AN KV+I Q+I DGEVNRAR
Sbjct: 74 SAQDFLMVADVVTPTPNAE----------PGIGGANQDPFIPKVEIRQRIRVDGEVNRAR 123
Query: 130 YMPQNPFIIATKTISAEVYVFDYSKHPSKPPLDGSCNPDLRLRGHNTEGYGLSWSKFKEG 189
MPQ P ++ KT EV++FDY+KH +K C+PDLRL GH+ EGYGLSWS FKEG
Sbjct: 124 CMPQKPTLVGAKTSGCEVFLFDYAKHAAKSQT-SECDPDLRLVGHDKEGYGLSWSPFKEG 182
Query: 190 HLLSGSDDAQICLWDINGGTPKNKSLDAMQIFK 222
+LLSGS D +ICLWD++ TP++K L+AM +++
Sbjct: 183 YLLSGSQDQKICLWDVS-ATPQDKVLNAMFVYE 214
>AT4G35050.1 | Symbols: MSI3, NFC3 | Transducin family protein /
WD-40 repeat family protein | chr4:16682752-16684751
REVERSE LENGTH=424
Length = 424
Score = 202 bits (514), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 102/208 (49%), Positives = 140/208 (67%), Gaps = 9/208 (4%)
Query: 17 INEEYKIWKKNSPFLYDLVITHALEWPSLTVEWLPDRHEPPGKD--YSLQKVILGTHTSE 74
+ EE+ IWK+N+PFLYDL+I+H LEWPSLT+ W+P P KD +++ K+ILGTHTS
Sbjct: 15 VEEEFSIWKRNTPFLYDLMISHPLEWPSLTLHWVPSTPIPYSKDPYFAVHKLILGTHTSG 74
Query: 75 NEPNYLMLAQVQLPLHDSENDARHYDDDLGGFGCANGKVQIIQQINHDGEVNRARYMPQN 134
++LM+A V +P D+E D + KV+I Q+I DGEVNRAR MPQ
Sbjct: 75 GAQDFLMVADVVIPTPDAEPGLGGRDQE-----PIVPKVEIKQKIRVDGEVNRARCMPQK 129
Query: 135 PFIIATKTISAEVYVFDYSKHPSKPPLDGSCNPDLRLRGHNTEGYGLSWSKFKEGHLLSG 194
P ++ KT +EV++FDY++ KP C+PDLRL GH EGYGL+WS FKEG+LLSG
Sbjct: 130 PTLVGAKTSGSEVFLFDYARLSGKPQT-SECDPDLRLMGHEQEGYGLAWSSFKEGYLLSG 188
Query: 195 SDDAQICLWDINGGTPKNKSLDAMQIFK 222
S D +ICLWD++ T +K L+ M +++
Sbjct: 189 SQDQRICLWDVS-ATATDKVLNPMHVYE 215
>AT2G19520.1 | Symbols: FVE, ACG1, MSI4, NFC4, NFC04, ATMSI4 |
Transducin family protein / WD-40 repeat family protein
| chr2:8456006-8459235 FORWARD LENGTH=507
Length = 507
Score = 92.0 bits (227), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 63/195 (32%), Positives = 99/195 (50%), Gaps = 15/195 (7%)
Query: 17 INEEYKIWKKNSPFLYDLVITHALEWPSLTVEWLPDRHEPPGKDYSLQKVILGTHTSENE 76
++E+Y WK P LYD + H L WPSL+ W P + K+ Q++ L T +
Sbjct: 64 VDEKYSQWKGLVPILYDWLANHNLVWPSLSCRWGPQLEQATYKNR--QRLYLSEQTDGSV 121
Query: 77 PNYLMLAQVQL---PLHDSENDARHYDDDLGGFGCANGKVQIIQQINHDGEVNRARYMPQ 133
PN L++A ++ + +E+ ++ ++ F V+ + I H GEVNR R +PQ
Sbjct: 122 PNTLVIANCEVVKPRVAAAEHISQFNEEARSPF------VKKYKTIIHPGEVNRIRELPQ 175
Query: 134 NPFIIATKTISAEVYVFDYSKHPSKPPLDGSCN--PDLRLRGHNTEG-YGLSWSKFKEGH 190
N I+AT T S +V ++D P++ + G+ N PDL L GH + L+ E
Sbjct: 176 NSKIVATHTDSPDVLIWDVETQPNRHAVLGAANSRPDLILTGHQDNAEFALAMCP-TEPF 234
Query: 191 LLSGSDDAQICLWDI 205
+LSG D + LW I
Sbjct: 235 VLSGGKDKSVVLWSI 249
>AT4G29730.1 | Symbols: NFC5, MSI5 | nucleosome/chromatin assembly
factor group C5 | chr4:14559255-14562522 REVERSE
LENGTH=487
Length = 487
Score = 86.3 bits (212), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 94/192 (48%), Gaps = 19/192 (9%)
Query: 17 INEEYKIWKKNSPFLYDLVITHALEWPSLTVEWLPDRHEPPGKDYSLQKVILGTHTSENE 76
+++ Y WK P LYD + H L WPSL+ W P + K Q++ L T+ +
Sbjct: 54 VDDTYSQWKTLLPILYDSFVNHTLVWPSLSCRWGPQLEQAGSK---TQRLYLSEQTNGSV 110
Query: 77 PNYLMLAQVQLPLHDSENDARHYDDDLGGFGCANGKVQIIQQINHDGEVNRARYMPQNPF 136
PN L++A + ++ N+ H + V+ + I H GEVNR R +PQN
Sbjct: 111 PNTLVIANCET-VNRQLNEKAH-----------SPFVKKYKTIIHPGEVNRIRELPQNSK 158
Query: 137 IIATKTISAEVYVFDYSKHPSKPPLDGS--CNPDLRLRGHNTEG-YGLSWSKFKEGHLLS 193
I+AT T S ++ +++ P + + G+ PDL L GH + + L+ E +LS
Sbjct: 159 IVATHTDSPDILIWNTETQPDRYAVLGAPDSRPDLLLIGHQDDAEFALAMCP-TEPFVLS 217
Query: 194 GSDDAQICLWDI 205
G D + LW+I
Sbjct: 218 GGKDKSVILWNI 229
>AT2G19540.1 | Symbols: | Transducin family protein / WD-40 repeat
family protein | chr2:8461804-8464347 FORWARD LENGTH=469
Length = 469
Score = 58.5 bits (140), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 50/107 (46%), Gaps = 15/107 (14%)
Query: 116 IQQINHDGEVNRARYMPQNPFIIATKTISAEVYVFDYSKH-------------PSKPPLD 162
++++ H G VNR R MPQN I + S V V+D S H + P L+
Sbjct: 153 VRRVAHHGCVNRIRAMPQNSHICVSWADSGHVQVWDMSSHLNALAESETEGKDGTSPVLN 212
Query: 163 GSCNPDLRLRGHNTEGYGLSWSKFKEGHLLSGSDDAQICLWDINGGT 209
+ P + GH EGY + WS G LLSG + I LW+ G+
Sbjct: 213 QA--PLVNFSGHKDEGYAIDWSPATAGRLLSGDCKSMIHLWEPASGS 257