Miyakogusa Predicted Gene
- Lj4g3v3061490.3
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v3061490.3 tr|I0Z2S6|I0Z2S6_9CHLO WD40 repeat-like protein
OS=Coccomyxa subellipsoidea C-169 PE=4 SV=1,25,0.00000000001,WD40,WD40
repeat; WD40 repeats,WD40 repeat; F-BOX AND WD40 DOMAIN PROTEIN,NULL;
WD_REPEATS_2,WD40 re,CUFF.52218.3
(310 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G06880.2 | Symbols: | Transducin/WD40 repeat-like superfamil... 143 1e-34
AT3G06880.1 | Symbols: | Transducin/WD40 repeat-like superfamil... 138 4e-33
AT5G51980.2 | Symbols: | Transducin/WD40 repeat-like superfamil... 57 2e-08
AT5G51980.1 | Symbols: | Transducin/WD40 repeat-like superfamil... 56 2e-08
AT1G24130.1 | Symbols: | Transducin/WD40 repeat-like superfamil... 55 5e-08
AT4G25440.1 | Symbols: ZFWD1 | zinc finger WD40 repeat protein 1... 52 5e-07
AT1G49450.1 | Symbols: | Transducin/WD40 repeat-like superfamil... 49 4e-06
>AT3G06880.2 | Symbols: | Transducin/WD40 repeat-like superfamily
protein | chr3:2170516-2175686 REVERSE LENGTH=1264
Length = 1264
Score = 143 bits (361), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 86/294 (29%), Positives = 161/294 (54%), Gaps = 7/294 (2%)
Query: 17 IHKELIQVECRENGGVLSVICFKDKVFSGHADGSIKVWKLKDNLFHLLQEIQEHTKAVTN 76
+H + +++ +G V ++I K +FSG +DGSI+VW + + LL +I+EH VT
Sbjct: 972 VHTQTVEMHQSGSGAVTALIYHKGLLFSGFSDGSIRVWNVNKKIATLLWDIKEHKSTVTC 1031
Query: 77 LVISESGDRLYSGSLDRTTKIWSIGKTAIHCAQVHDMKDQIHKLVVTNSIACFISQGAGV 136
+SE+G+ + SGS D+T ++W I K + CA+V KD I KL ++ I++G +
Sbjct: 1032 FSLSETGECVLSGSADKTIRVWQIVKGKLECAEVIKTKDSIRKLEAFGNMIFVITKGHKM 1091
Query: 137 KVQPLNGESKLLNSNKNVKCLTHANGKLYCGCHDSSVQEIHLATGTVSSIQSGSKKLLGK 196
K+ + S+ + K VK + A GK+Y GC D+S+QE+ +A I++ ++ +
Sbjct: 1092 KLLDSSRISQSIFKGKGVKSMVSAQGKIYIGCIDTSIQELIVANKREKEIKAPTRSWRLQ 1151
Query: 197 AYPIHALQIHGELIYAAGSSLDGSAVK-IWNNSNYSLVGSLQTGSEVRAMAVSSELIYLG 255
PI+++ ++ +++Y++ + ++ S +K + N + + + GS + AM V + IYL
Sbjct: 1152 NKPINSVVVYKDMLYSSSTYVEMSNIKDLRRNYEPQMSITAEKGSNIVAMGVVEDFIYLN 1211
Query: 256 CKGGA--VEIWDKKKHSRVDTLQVGTNCKVNCMALDSIEEILVIGTSDGQIQAW 307
A ++IW ++ +V L G+ +L + +I+ GT G I+ W
Sbjct: 1212 RSSSANTLQIWLRRTQQKVGRLSAGS----KITSLLTANDIVFCGTEAGVIKGW 1261
>AT3G06880.1 | Symbols: | Transducin/WD40 repeat-like superfamily
protein | chr3:2169899-2175686 REVERSE LENGTH=1261
Length = 1261
Score = 138 bits (348), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 160/294 (54%), Gaps = 7/294 (2%)
Query: 17 IHKELIQVECRENGGVLSVICFKDKVFSGHADGSIKVWKLKDNLFHLLQEIQEHTKAVTN 76
+H + +++ +G V ++I K +FSG +DGSI+VW + + LL +I+EH VT
Sbjct: 972 VHTQTVEMHQSGSGAVTALIYHKGLLFSGFSDGSIRVWNVNKKIATLLWDIKEHKSTVTC 1031
Query: 77 LVISESGDRLYSGSLDRTTKIWSIGKTAIHCAQVHDMKDQIHKLVVTNSIACFISQGAGV 136
+SE+G+ + SGS D+T ++W I K + CA+V KD I KL ++ I++G +
Sbjct: 1032 FSLSETGECVLSGSADKTIRVWQIVKGKLECAEVIKTKDSIRKLEAFGNMIFVITKGHKM 1091
Query: 137 KVQPLNGESKLLNSNKNVKCLTHANGKLYCGCHDSSVQEIHLATGTVSSIQSGSKKLLGK 196
K+ + S+ + K VK + A GK+Y GC D+S+QE+ +A I++ ++ +
Sbjct: 1092 KLLDSSRISQSIFKGKGVKSMVSAQGKIYIGCIDTSIQELIVANKREKEIKAPTRSWRLQ 1151
Query: 197 AYPIHALQIHGELIYAAGSSLDGSAVK-IWNNSNYSLVGSLQTGSEVRAMAVSSELIYLG 255
PI+++ ++ +++Y++ + ++ S +K + N + + + GS + AM V + IYL
Sbjct: 1152 NKPINSVVVYKDMLYSSSTYVEMSNIKDLRRNYEPQMSITAEKGSNIVAMGVVEDFIYLN 1211
Query: 256 CKGGA--VEIWDKKKHSRVDTLQVGTNCKVNCMALDSIEEILVIGTSDGQIQAW 307
A ++IW ++ +V L G+ +L + +I+ GT G + +
Sbjct: 1212 RSSSANTLQIWLRRTQQKVGRLSAGS----KITSLLTANDIVFCGTEAGLMDPF 1261
>AT5G51980.2 | Symbols: | Transducin/WD40 repeat-like superfamily
protein | chr5:21113650-21115902 REVERSE LENGTH=443
Length = 443
Score = 56.6 bits (135), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/210 (27%), Positives = 95/210 (45%), Gaps = 22/210 (10%)
Query: 30 GGVLSVICFKDKVFSGHADGSIKVWKLK--DNLFHLLQEIQEHTKAVTNLVISESGDRLY 87
G V S++ D +F+G DGSI W+ N F + HT AV L + +RLY
Sbjct: 231 GQVYSLVVGTDLLFAGTQDGSILAWRYNAATNCFEPSASLTGHTLAVVTLYV--GANRLY 288
Query: 88 SGSLDRTTKIWSIGKTAIHCAQ-VHDMKDQIHKLVVTNS--IACFISQGAGVKVQPLNGE 144
SGS+D+T K+WS+ + C Q + D + L+ + ++C + + G
Sbjct: 289 SGSMDKTIKVWSLDN--LQCIQTLTDHSSVVMSLICWDQFLLSCSLDNTVKIWAAIEGGN 346
Query: 145 SKLLNSNKN-----VKCLTH---ANGKLYCGCHDSSVQEIHLATGTVSSIQSGSKKLLGK 196
++ ++K C H A L C C+D++++ L + + + K+ K
Sbjct: 347 LEVTYTHKEEHGVLALCGVHDAEAKPVLLCACNDNTLRLYDLPSLGLFIRFTERGKIFAK 406
Query: 197 AYPIHALQIHGELIYAAGSSLDGSA-VKIW 225
I A+QI I+ G DG+ VK+W
Sbjct: 407 Q-EIRAIQIGPGGIFFTG---DGTGQVKVW 432
>AT5G51980.1 | Symbols: | Transducin/WD40 repeat-like superfamily
protein | chr5:21113650-21115902 REVERSE LENGTH=437
Length = 437
Score = 56.2 bits (134), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 94/210 (44%), Gaps = 28/210 (13%)
Query: 30 GGVLSVICFKDKVFSGHADGSIKVWKLK--DNLFHLLQEIQEHTKAVTNLVISESGDRLY 87
G V S++ D +F+G DGSI W+ N F + HT AV L + +RLY
Sbjct: 231 GQVYSLVVGTDLLFAGTQDGSILAWRYNAATNCFEPSASLTGHTLAVVTLYV--GANRLY 288
Query: 88 SGSLDRTTKIWSIGKTAIHCAQ-VHDMKDQIHKLVVTNS--IACFISQGAGVKVQPLNGE 144
SGS+D+T K+WS+ + C Q + D + L+ + ++C + + G
Sbjct: 289 SGSMDKTIKVWSLDN--LQCIQTLTDHSSVVMSLICWDQFLLSCSLDNTVKIWAAIEGGN 346
Query: 145 SKLLNSNKN-----VKCLTH---ANGKLYCGCHDSSVQEIHLATGTVSSIQSGSKKLLGK 196
++ ++K C H A L C C+D++++ L + T K+ K
Sbjct: 347 LEVTYTHKEEHGVLALCGVHDAEAKPVLLCACNDNTLRLYDLPSFTERG------KIFAK 400
Query: 197 AYPIHALQIHGELIYAAGSSLDGSA-VKIW 225
I A+QI I+ G DG+ VK+W
Sbjct: 401 Q-EIRAIQIGPGGIFFTG---DGTGQVKVW 426
>AT1G24130.1 | Symbols: | Transducin/WD40 repeat-like superfamily
protein | chr1:8534183-8535430 REVERSE LENGTH=415
Length = 415
Score = 55.1 bits (131), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 48/88 (54%), Gaps = 4/88 (4%)
Query: 12 TSDIWIHKELIQVECRENGGVLSVICFKDK-VFSGHADGSIKVWKLKDNLFHLLQEIQEH 70
TSD K L +E + + +++ KD V++G AD IKVW KD L+ + +H
Sbjct: 221 TSDF---KCLDSIEKAHDDAINAIVVSKDGFVYTGSADKKIKVWNKKDKKHSLVATLTKH 277
Query: 71 TKAVTNLVISESGDRLYSGSLDRTTKIW 98
AV L ISE G LYSG+ DR+ +W
Sbjct: 278 LSAVNALAISEDGKVLYSGACDRSILVW 305
>AT4G25440.1 | Symbols: ZFWD1 | zinc finger WD40 repeat protein 1 |
chr4:13007107-13009381 REVERSE LENGTH=430
Length = 430
Score = 52.0 bits (123), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 97/216 (44%), Gaps = 28/216 (12%)
Query: 30 GGVLSVICFKDKVFSGHADGSIKVWKLKD--NLFHLLQEIQEHTKAVTNLVISESGDRLY 87
G V S++ D +F+G DGSI VW+ + F + HT AV +L + +RLY
Sbjct: 224 GQVYSLVVGTDLLFAGTQDGSILVWRYNSTTSCFDPAASLLGHTLAVVSLYV--GANRLY 281
Query: 88 SGSLDRTTKIWSIGKTAIHCAQ-VHDMKDQIHKLVVTNS--IACFISQGAGVKVQPLNGE 144
SG++D + K+WS+ + C Q + + + L+ + ++C + + G
Sbjct: 282 SGAMDNSIKVWSLDN--LQCIQTLTEHTSVVMSLICWDQFLLSCSLDNTVKIWAATEGGN 339
Query: 145 SKLLNSNKN-----VKCLTH---ANGKLYCGCHDSSVQEIHLATGTVSSIQSGSKKLLGK 196
++ ++K C H A L C C+D+S+ L + T K+L K
Sbjct: 340 LEVTYTHKEEYGVLALCGVHDAEAKPVLLCSCNDNSLHLYDLPSFTERG------KILAK 393
Query: 197 AYPIHALQIHGELIYAAGSSLDGSA-VKIWNNSNYS 231
I ++QI I+ G DGS VK+W S S
Sbjct: 394 Q-EIRSIQIGPGGIFFTG---DGSGQVKVWKWSTES 425
>AT1G49450.1 | Symbols: | Transducin/WD40 repeat-like superfamily
protein | chr1:18305684-18307099 FORWARD LENGTH=471
Length = 471
Score = 49.3 bits (116), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 68/149 (45%), Gaps = 14/149 (9%)
Query: 21 LIQVECRENGGVLSVICFKDKVFSGHADGSIKVWKL----KDNLFHLLQEIQEHTKAVTN 76
L +E ++ V F D VF+G ADG++KVWK K+ L+Q + + AVT
Sbjct: 280 LESIEAHDDAVNTVVSGFDDLVFTGSADGTLKVWKREVQGKEMKHVLVQVLMKQENAVTA 339
Query: 77 LVISESGDRLYSGSLDRTTKIWSIGKTAIHCAQVHDMKDQIHKLVVTNSIACFISQGA-- 134
L ++ + +Y GS D T W K H +H + + L S+ +S GA
Sbjct: 340 LAVNLTDAVVYCGSSDGTVNFWERQKYLTHKGTIHGHRMAVLCLATAGSL--LLSGGADK 397
Query: 135 GVKVQPLNGE------SKLLNSNKNVKCL 157
+ V NG+ S L++ VKCL
Sbjct: 398 NICVWKRNGDGSHTCLSVLMDHEGPVKCL 426