Miyakogusa Predicted Gene
- Lj5g3v0780800.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v0780800.1 Non Chatacterized Hit- tr|O82492|O82492_ARATH
Putative uncharacterized protein AT4g10700
OS=Arabidop,46.59,1e-18,FBOX,F-box domain, cyclin-like; F-box
domain,F-box domain, cyclin-like; no description,NULL;
F-box-l,CUFF.53951.1
(376 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G57790.1 | Symbols: | F-box family protein | chr1:21404578-2... 318 5e-87
AT3G56470.1 | Symbols: | F-box family protein | chr3:20934815-2... 305 3e-83
AT4G00893.1 | Symbols: | unknown protein; LOCATED IN: chloropla... 297 6e-81
AT4G12370.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 261 7e-70
AT4G10660.1 | Symbols: | CDC68-related | chr4:6582535-6583488 F... 252 2e-67
AT1G49360.1 | Symbols: | F-box family protein | chr1:18267338-1... 206 3e-53
AT3G18720.1 | Symbols: | F-box family protein | chr3:6444433-64... 195 5e-50
AT4G10695.1 | Symbols: | CDC68-related | chr4:6593774-6594478 R... 189 2e-48
AT4G12382.2 | Symbols: | F-box family protein | chr4:7334352-73... 148 5e-36
AT4G12382.1 | Symbols: | F-box family protein | chr4:7334352-73... 148 5e-36
AT2G44735.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 114 1e-25
AT4G10700.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 102 6e-22
AT4G12810.1 | Symbols: | F-box family protein | chr4:7522831-75... 69 8e-12
AT4G22660.1 | Symbols: | F-box family protein with a domain of ... 67 2e-11
AT4G22180.1 | Symbols: | F-box family protein with a domain of ... 62 7e-10
AT5G66830.1 | Symbols: | F-box family protein with a domain of ... 61 2e-09
AT4G22060.1 | Symbols: | F-box family protein with a domain of ... 59 5e-09
AT4G22170.1 | Symbols: | F-box family protein with a domain of ... 58 9e-09
AT4G18320.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 57 3e-08
AT4G22165.1 | Symbols: | F-box family protein with a domain of ... 57 3e-08
AT4G18320.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 56 4e-08
AT2G33190.1 | Symbols: | F-box family protein with a domain of ... 55 1e-07
AT1G69090.1 | Symbols: | Protein of unknown function (DUF295) |... 54 2e-07
AT3G61590.2 | Symbols: HWS | Galactose oxidase/kelch repeat supe... 54 2e-07
AT3G61590.1 | Symbols: HWS, HS | Galactose oxidase/kelch repeat ... 54 2e-07
AT4G22400.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 53 3e-07
AT4G22030.1 | Symbols: | F-box family protein with a domain of ... 53 4e-07
AT3G22345.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 52 5e-07
AT2G33200.1 | Symbols: | F-box family protein | chr2:14069541-1... 52 6e-07
AT5G25290.1 | Symbols: | F-box family protein with a domain of ... 51 1e-06
AT1G67160.1 | Symbols: | F-box family protein with a domain of ... 51 2e-06
>AT1G57790.1 | Symbols: | F-box family protein |
chr1:21404578-21405636 REVERSE LENGTH=352
Length = 352
Score = 318 bits (814), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 163/340 (47%), Positives = 227/340 (66%), Gaps = 9/340 (2%)
Query: 36 WSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVVNQSPWLMYFPKYGDLYEFY 95
W DLP ELL VMT L + DNVRASVVCK W A +VRV+++SPWLMYFP+ + Y+FY
Sbjct: 13 WKDLPLELLSSVMTFLEIKDNVRASVVCKSWFEAAVSVRVIDKSPWLMYFPETKNTYDFY 72
Query: 96 DPVQRKTYSLQMPE-LSGSRVCYTKDGWLLLYRPRTHRVFFFNPFTRDIIKLPRFEM--T 152
DP K Y++++P+ L G V Y+KDGWLL+ + + FNPFT D++ LP +
Sbjct: 73 DPSNCKKYTMELPKSLVGFIVRYSKDGWLLMSQEDSSHFVLFNPFTMDVVALPFLHLFTY 132
Query: 153 YQIVAFSCAPTSPNCVLFTVKHVSPTVVAISTCYPGATEWTTVNFQNRLPFVSSIWNKLV 212
YQ+V FS APTS CV+FT+K P V I T PG T WT++ +++ F+ N +V
Sbjct: 133 YQLVGFSSAPTSSECVVFTIKDYDPGHVTIRTWSPGQTMWTSMQVESQ--FLDVDHNNVV 190
Query: 213 FCNGHFYCLSLTGWLGVFDPSERTWSVLSVPPPKCPENFFAKNWWKGKFMTEHEGDVIVI 272
F NG FYCL+ + VFDPS RTW+VL VPPP+CP++ K+W +GKFM ++GD++VI
Sbjct: 191 FSNGVFYCLNQRNHVAVFDPSLRTWNVLDVPPPRCPDD---KSWNEGKFMVGYKGDILVI 247
Query: 273 YTCSSENPIIFKLDQTLMEWEEMRTLDGVTLFASFLSSHSRTELP-GMMRNSVYFSKVRF 331
T +++P++FKLD T WEE TL +T+F S S SRT + GM+RNSVYF ++ +
Sbjct: 248 RTYENKDPLVFKLDLTRGIWEEKDTLGSLTIFVSRKSCESRTYVKDGMLRNSVYFPELCY 307
Query: 332 YGKRCISFSLDDFRYYPRKQWHDWGEQDPYENIWVEPPKD 371
K+ + +S D+ RY+ R+ DWG+Q +NIW+EPPK+
Sbjct: 308 NEKQSVVYSFDEGRYHLREHDLDWGKQLSSDNIWIEPPKN 347
>AT3G56470.1 | Symbols: | F-box family protein |
chr3:20934815-20936017 FORWARD LENGTH=367
Length = 367
Score = 305 bits (782), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 160/348 (45%), Positives = 218/348 (62%), Gaps = 17/348 (4%)
Query: 34 QSWSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVVNQSPWLMYFPKYGDLYE 93
Q++ +LP +LL+LV++RL L DN+RAS VCK WH ++RV++ SPWL+YF K D YE
Sbjct: 27 QTFINLPCDLLQLVISRLPLKDNIRASAVCKTWHEACVSLRVIHTSPWLIYFSKTDDSYE 86
Query: 94 FYDPVQRKTYSLQMPELSGSRVCYTKDGWLLLYRPRTHRVFFFNPFTRDIIKLPRFEMTY 153
YDP +K +L PELSG RVCY+KDGWLL+Y P ++++ FFNPFTRD + +P M Y
Sbjct: 87 LYDPSMQKNCNLHFPELSGFRVCYSKDGWLLMYNPNSYQLLFFNPFTRDCVPMPTLWMAY 146
Query: 154 -QIVAFSCAPTSPNCVLFTVKHVSPTVVAISTCYPGATEWTTVNFQNRLPFVSSIWNKLV 212
Q +AFSCAPTS +C+LFTV V+ + I T + A EW T F+NRL + + ++V
Sbjct: 147 DQRMAFSCAPTSTSCLLFTVTSVTWNYITIKTYFANAKEWKTSVFKNRLQRNFNTFEQIV 206
Query: 213 FCNGHFYCLSLTGWLGVFDPSERTWSVLSVPPPKCPENFFAKNWWKGKFMTEHEGDVIVI 272
F NG FYCL+ TG L +FDPS W+VL PPK P + G FMTEH+G++ +I
Sbjct: 207 FSNGVFYCLTNTGCLALFDPSLNYWNVLPGRPPKRPGS-------NGCFMTEHQGEIFLI 259
Query: 273 YTCSSENPIIFKLDQTLMEWEEMRTLDGVTLFASFLSSHSRTELPGM--MRNSVYFSKVR 330
Y NP + KLD T EW E +TL G+T++AS LSS SR E + N + S
Sbjct: 260 YMYRHMNPTVLKLDLTSFEWAERKTLGGLTIYASALSSESRAEQQKQSGIWNCLCLSVFH 319
Query: 331 FYGKRCISFSLDDFRYYPRKQWHDWGEQDPYENIWVEPPK---DFPGF 375
+ + CI + +D+ + W +Q+PYENIW+ PP D P F
Sbjct: 320 GFKRTCIYYKVDE----ESEVCFKWKKQNPYENIWIMPPLNLIDLPLF 363
>AT4G00893.1 | Symbols: | unknown protein; LOCATED IN: chloroplast;
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT4G12370.1); Has 220 Blast hits to 215
proteins in 11 species: Archae - 0; Bacteria - 0;
Metazoa - 0; Fungi - 0; Plants - 220; Viruses - 0; Other
Eukaryotes - 0 (source: NCBI BLink). |
chr4:379971-381396 REVERSE LENGTH=388
Length = 388
Score = 297 bits (761), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 158/363 (43%), Positives = 230/363 (63%), Gaps = 14/363 (3%)
Query: 17 ADSRRSGTEVKNENLELQSWSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVV 76
A + R + ++ + S++DLP+ L+E +M L L DN+RAS CK W+ +VRVV
Sbjct: 26 ATASRKRFQDSSKKIMNPSFADLPSSLIEEIMLLLVLKDNIRASAACKSWYEAGVSVRVV 85
Query: 77 NQSPWLMYFPKYGDLYEFYDPVQRKTYSLQMPELSGSRVCYTKDGWLLLYRPRTHRVFFF 136
++ PWLM FPK G+L+EF DP+ K ++L +PEL+ S VCY++ GWLL+ + ++ VFFF
Sbjct: 86 DKHPWLMCFPKRGNLFEFRDPLHWKLHTLDLPELAESTVCYSRFGWLLMRKASSNDVFFF 145
Query: 137 NPFTRDIIKLPRFEMTYQIVAFSCAPTSPNCVLFTVKHVSPTV--VAISTCYPGATEWTT 194
NPF+RDII LP E+ +Q +AFSC PTS +CVL +K V V V +STC PGAT+W T
Sbjct: 146 NPFSRDIISLPMCELDFQQIAFSCPPTSDDCVLLAIKFVPGEVNRVTVSTCNPGATKWIT 205
Query: 195 VNFQN--RLPFVSSIWNKLVFCNGHFYCLSLTGWLGVFDPSERTWSVLSVPPPKCPENFF 252
+F RL ++ S LV+ FYC + G L F+PS R WS + +CP
Sbjct: 206 NDFPTFLRLFYMQS---NLVYRRDRFYCFNAEGTLYSFEPSYREWSYICADKLRCPYVHE 262
Query: 253 AKNWWKGK--FMTEHEGDVIVIYTCSSENPIIFKLDQTLMEWEEMR--TLDGVTLFASFL 308
+ W GK F+ E +G++ V++TCS+E P+++KL M+W+E+ TLDG+T F SF
Sbjct: 263 NQYMWCGKAVFLVEKKGELFVMFTCSNEKPMVYKLFS--MKWKELSRTTLDGMTFFVSFY 320
Query: 309 SSHSRTELPGMMRNSVYFSKVRFYGKRCISFSLDDFRYYPRKQWHDWGEQDPYENIWVEP 368
+S R LP MRN+VYFS+ + K C+SFS D+ RY K+W W E P +++W++
Sbjct: 321 NSELRNNLPW-MRNNVYFSRFGYNRKHCVSFSFDESRYNTPKEWEQWVELCPPQSLWIDT 379
Query: 369 PKD 371
PK+
Sbjct: 380 PKN 382
>AT4G12370.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G00893.1); Has 118 Blast hits to 113 proteins
in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 118; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr4:7331972-7332874 FORWARD
LENGTH=300
Length = 300
Score = 261 bits (666), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 137/302 (45%), Positives = 184/302 (60%), Gaps = 12/302 (3%)
Query: 83 MYFPKYGDLYEFYDPVQRKTYSLQMPELSGSRVCYTKDGWLLLYRPRTHRVFFFNPFTRD 142
MY PK G+L+E YDP+ +K Y+L +PEL+ S VCY++DGWLL+ + + +FFFNPFTR+
Sbjct: 1 MYLPKRGNLFELYDPLHQKMYTLNLPELAKSTVCYSRDGWLLMRKTISREMFFFNPFTRE 60
Query: 143 IIKLPRFEMTYQIVAFSCAPTSPNCVLFTVKHVSPTVVAISTCYPGATEWTTVNFQNRLP 202
+I +P+ ++Y +AFSCAPTS CVL KHVS + STC+P ATEW T + Q
Sbjct: 61 LINVPKCTLSYDAIAFSCAPTSGTCVLLAFKHVSYRITTTSTCHPKATEWVTEDLQFHRR 120
Query: 203 FVSSIWN--KLVFCNGHFYCLSLTGWLGVFDPSERTWS---VLSVPPPKCPENFFAKNWW 257
F S N +V+ FYCL G L FDPS R W +P P + F +
Sbjct: 121 FRSETLNHSNVVYAKRRFYCLDGQGSLYYFDPSSRRWDFSYTYLLPCPYISDRFSYQYER 180
Query: 258 KGK--FMTEHEGDVIVIYTCSSENPIIFKLDQTLMEWEEMR--TLDGVTLFASFLSSHSR 313
K K F+ +G I+TC E PI+ KL+ + WEE+ T+DG+T+F SS R
Sbjct: 181 KKKRIFLAVRKGVFFKIFTCDGEKPIVHKLED--INWEEINSTTIDGLTIFTGLYSSEVR 238
Query: 314 TELPGMMRNSVYFSKVRFYGKRCISFSLDDFRYYPRKQWHDWGEQDPYENIWVEPPKDFP 373
LP MRNSVYF ++RF KRC+S+SLD+ RYYPRKQW + + P EN+W+ PPK
Sbjct: 239 LNLP-WMRNSVYFPRLRFNVKRCVSYSLDEERYYPRKQWQEQEDLCPIENLWIRPPKKAV 297
Query: 374 GF 375
F
Sbjct: 298 DF 299
>AT4G10660.1 | Symbols: | CDC68-related | chr4:6582535-6583488
FORWARD LENGTH=317
Length = 317
Score = 252 bits (644), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 133/294 (45%), Positives = 188/294 (63%), Gaps = 9/294 (3%)
Query: 64 KRWHSVATAVRVVNQSPWLMYFPKYGDLYEFYDPVQRKTYSLQMPELSGSRVCYTKDGWL 123
K+ + A +VRVV + PW++ FP + DL +DP++RK Y+L +PEL G+ VCY+KDGWL
Sbjct: 3 KKALTTAESVRVVEKHPWVITFPNHEDLTFLFDPLERKRYTLNLPELVGTDVCYSKDGWL 62
Query: 124 LLYRPRTHRVFFFNPFTRDIIKLPRFEMTYQIVAFSCAPTSPNCVLFTVKHVSPTVVAIS 183
L+ R +FFFNP+TR++I LP+ E+ +Q +AFS APTS CV+ ++ + ++ IS
Sbjct: 63 LMRRSSLVDMFFFNPYTRELINLPKCELAFQAIAFSSAPTSGTCVVLALRPFTRYIIRIS 122
Query: 184 TCYPGATEWTTVNFQNRLPFVSSIWNKLVFCNGHFYCLSLTGWLGVFDPSERTWSVLSVP 243
CYPGATEW T F L F + + LV+ N HFYC S G L FD + RT S +
Sbjct: 123 ICYPGATEWITQEFSCSLRFDPYMHSNLVYANDHFYCFSSGGVLVDFDVASRTMSEQAWN 182
Query: 244 PPKCP----ENFFAKNWWKGKFMTEHEGDVIVIYTCSSENPIIFKLDQTLMEWEEMR--T 297
+C +N N K ++ E +G++ ++YTCSSE P+++KL + WEE+ T
Sbjct: 183 EHRCAYMRNDNAEWFNLPKRIYLAEQKGELFLMYTCSSEIPMVYKLVSS--NWEEINSTT 240
Query: 298 LDGVTLFASFLSSHSRTELPGMMRNSVYFSKVRFYGKRCISFSLDDFRYYPRKQ 351
LDGVT+FAS SS +R ++ G MRNSVYF K K C+S+S D+ RYYPRKQ
Sbjct: 241 LDGVTIFASMYSSETRLDVLG-MRNSVYFPKYGLDCKGCVSYSFDEARYYPRKQ 293
>AT1G49360.1 | Symbols: | F-box family protein |
chr1:18267338-18269423 REVERSE LENGTH=481
Length = 481
Score = 206 bits (523), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 145/395 (36%), Positives = 216/395 (54%), Gaps = 56/395 (14%)
Query: 15 AIADSRR--------SGT--EVKNENLELQSWSDLPAELLELVMTRLALDDNVRASVVCK 64
A+AD +R SGT E K E L+ + LP++L+ L+++RL+ DN+R+S VCK
Sbjct: 79 AVADVKRLRTILENNSGTSEECKGEMLKEDLF--LPSDLVRLILSRLSFKDNIRSSTVCK 136
Query: 65 RWHSVATAVRVVNQSPWLMY---FPKYGDLYEFYDPVQ-RKTYSLQMPELS-GSRVCYTK 119
W +A +VRV ++ WL+Y F G Y F+DPV+ +KT + +PELS S + Y+K
Sbjct: 137 AWGDIAASVRVKSRRCWLLYHDAFQDKGVSYGFFDPVEKKKTKEMNLPELSKSSGILYSK 196
Query: 120 DGWLLLYRPRT--HRVFFFNPFTRDIIKLPR---FEMTYQIVAFSCAPTSPNCVLFTVKH 174
DGWLL+ + ++FFNPFTR+ I LPR E + AFSCAPT +C++F + +
Sbjct: 197 DGWLLMNDSLSLIADMYFFNPFTRERIDLPRNRIMESVHTNFAFSCAPTKKSCLVFGINN 256
Query: 175 VSPTV-VAISTCYPGATEWTTVNFQNRLPFVSSIWNKLVFCNGHFYCLSLTGWLGVFDPS 233
+S +V + IST PGAT W +F N P +++ +G FY S T LGVFDP+
Sbjct: 257 ISSSVAIKISTWRPGATTWLHEDFPNLFPSYFRRLGNILYSDGLFYTASETA-LGVFDPT 315
Query: 234 ERTWSVLSVPP-PKCPENFFAKNWWKGKFMTEHEGDVIVIYTCSSENPIIFKLDQTLMEW 292
RTW+VL V P P P + ++MTE+EG + ++ SS P++++L++ W
Sbjct: 316 ARTWNVLPVQPIPMAPRSI--------RWMTEYEGHIFLV-DASSLEPMVYRLNRLESVW 366
Query: 293 EEMRTLDGVTLFASFLSSHSRTELPGMMRNSVYFSKVRFYGKRCIS-----FSLD---DF 344
E+ TLDG ++F S S L G M N +YF RF +R + FS + +
Sbjct: 367 EKKETLDGSSIFLSDGSCVMTYGLTGSMSNILYFWS-RFINERRSTKSPCPFSRNHPYKY 425
Query: 345 RYYPRKQWHD----------WGEQDPYENIWVEPP 369
Y R D WG++ +W+EPP
Sbjct: 426 SLYSRSSCEDPEGYYFEYLTWGQK---VGVWIEPP 457
>AT3G18720.1 | Symbols: | F-box family protein |
chr3:6444433-6445751 REVERSE LENGTH=380
Length = 380
Score = 195 bits (495), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 130/340 (38%), Positives = 181/340 (53%), Gaps = 27/340 (7%)
Query: 38 DLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVVNQSPWLMY------FPKYGDL 91
+P +LL+ +++RL L N+ AS+VCK W A +VR PWL Y PK GD
Sbjct: 51 QIPTDLLQEILSRLGLKANIHASLVCKTWLKEAVSVRKFQSRPWLFYPQSQRGGPKEGD- 109
Query: 92 YEFYDPVQRKTYSLQMPELSGSR--VCYTKDGWLLLYRPRTHRVFFFNPFTRDIIKLPRF 149
Y ++P + +T+ L+ PEL+G R + KDGWLL+ + VFF NPFT + I LP+
Sbjct: 110 YVLFNPSRSQTHHLKFPELTGYRNKLACAKDGWLLVVKDNPDVVFFLNPFTGERICLPQV 169
Query: 150 --EMTYQIVAFSCAPTSPNC--VLFTVKHVSPTVVAISTCYPGATEWTTVNFQNRLPFVS 205
T + FS APTS +C + FT + VV + T PG + WTT +F +
Sbjct: 170 PQNSTRDCLTFSAAPTSTSCCVISFTPQSFLYAVVKVDTWRPGESVWTTHHFDQKR--YG 227
Query: 206 SIWNKLVFCNGHFYCLSLTGWLGVFDPSERTWSVLSVPPPKCPENFFAKNWWKGKFMTEH 265
+ N+ +F NG FYCLS +G L VFDPS TW+VL V P C + FMTEH
Sbjct: 228 EVINRCIFSNGMFYCLSTSGRLSVFDPSRETWNVLPVKP--CRAFRRKIMLVRQVFMTEH 285
Query: 266 EGDVIVIYTCSSENP--IIFKLDQTLMEWEEMRTLDGVTLFASFLSSHSRTELPGMMRNS 323
EGD+ V+ T N + FKL+ WEEM+ +G+T+F+S +S +R LP RN
Sbjct: 286 EGDIFVVTTRRVNNRKLLAFKLNLQGNVWEEMKVPNGLTVFSSDATSLTRAGLPEEERNI 345
Query: 324 VYFSKVRFYGKRCISFSLDDFRYYPRKQWHDWGEQDPYEN 363
+Y S + + K S F YY W Q P++N
Sbjct: 346 LYSSDIDDFVKS----SHPTFYYYDCSAWL----QPPHDN 377
>AT4G10695.1 | Symbols: | CDC68-related | chr4:6593774-6594478
REVERSE LENGTH=234
Length = 234
Score = 189 bits (481), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 89/201 (44%), Positives = 128/201 (63%)
Query: 48 MTRLALDDNVRASVVCKRWHSVATAVRVVNQSPWLMYFPKYGDLYEFYDPVQRKTYSLQM 107
M+ L L DN+RAS VC+ W A +VRVV + PW++ FP++ + +DP+ RK+Y+L +
Sbjct: 1 MSYLGLKDNIRASAVCRAWRKAAESVRVVEKHPWVISFPRHYGVTILFDPLGRKSYTLNL 60
Query: 108 PELSGSRVCYTKDGWLLLYRPRTHRVFFFNPFTRDIIKLPRFEMTYQIVAFSCAPTSPNC 167
PEL G+ VCY+KDGWLL+ R +FF NP+TR++I LP+ E+++Q VAFS PTS C
Sbjct: 61 PELVGTDVCYSKDGWLLMRRSSLVDMFFLNPYTRELINLPKCELSFQAVAFSSVPTSGTC 120
Query: 168 VLFTVKHVSPTVVAISTCYPGATEWTTVNFQNRLPFVSSIWNKLVFCNGHFYCLSLTGWL 227
+ ++ + ++ IS C+PGATEW T +F F + + LV+ NGHFYC S G L
Sbjct: 121 AVIALRPFTRFIIRISICFPGATEWITQDFSCSHGFEPYMHSNLVYANGHFYCFSSGGVL 180
Query: 228 GVFDPSERTWSVLSVPPPKCP 248
FD + RT S + CP
Sbjct: 181 VDFDLASRTMSHQAWNEHICP 201
>AT4G12382.2 | Symbols: | F-box family protein |
chr4:7334352-7334768 FORWARD LENGTH=138
Length = 138
Score = 148 bits (374), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 64/126 (50%), Positives = 94/126 (74%)
Query: 35 SWSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVVNQSPWLMYFPKYGDLYEF 94
S++DLP+ L+E++M+ LAL +N+RAS CK W+ V +VRVV + PWL+ FPK G+L+EF
Sbjct: 9 SFADLPSSLIEVIMSHLALKNNIRASAACKSWYEVGVSVRVVEKHPWLICFPKRGNLFEF 68
Query: 95 YDPVQRKTYSLQMPELSGSRVCYTKDGWLLLYRPRTHRVFFFNPFTRDIIKLPRFEMTYQ 154
DP+ K Y+L +PEL+ S VCY++ GWLL+ + + VFFFNPF+RDII LP+ E+ ++
Sbjct: 69 RDPLHWKLYTLGLPELAESTVCYSRFGWLLMRKATSKDVFFFNPFSRDIISLPKCELAFE 128
Query: 155 IVAFSC 160
+ F C
Sbjct: 129 HITFYC 134
>AT4G12382.1 | Symbols: | F-box family protein |
chr4:7334352-7334768 FORWARD LENGTH=138
Length = 138
Score = 148 bits (374), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 64/126 (50%), Positives = 94/126 (74%)
Query: 35 SWSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVVNQSPWLMYFPKYGDLYEF 94
S++DLP+ L+E++M+ LAL +N+RAS CK W+ V +VRVV + PWL+ FPK G+L+EF
Sbjct: 9 SFADLPSSLIEVIMSHLALKNNIRASAACKSWYEVGVSVRVVEKHPWLICFPKRGNLFEF 68
Query: 95 YDPVQRKTYSLQMPELSGSRVCYTKDGWLLLYRPRTHRVFFFNPFTRDIIKLPRFEMTYQ 154
DP+ K Y+L +PEL+ S VCY++ GWLL+ + + VFFFNPF+RDII LP+ E+ ++
Sbjct: 69 RDPLHWKLYTLGLPELAESTVCYSRFGWLLMRKATSKDVFFFNPFSRDIISLPKCELAFE 128
Query: 155 IVAFSC 160
+ F C
Sbjct: 129 HITFYC 134
>AT2G44735.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: F-box family protein (TAIR:AT3G18720.1); Has 82
Blast hits to 82 proteins in 10 species: Archae - 0;
Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 82;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr2:18438623-18441641 FORWARD LENGTH=367
Length = 367
Score = 114 bits (284), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 71/162 (43%), Positives = 99/162 (61%), Gaps = 11/162 (6%)
Query: 136 FNPFTRDIIKLP--RFEMTYQIVAFSCAPTSPNCVLFTVKHV-SPTVVAISTCYPGATEW 192
NPFTR+ LP R E + +AFS APTSP+C++ + + S V I T PG TEW
Sbjct: 110 LNPFTRESFYLPPRRHEHRSRFLAFSAAPTSPSCMVISYTQLRSCGSVLIDTWRPGETEW 169
Query: 193 TTVNFQNRLPFVSSIWNKLVFCNGHFYCLSLTGWLGVFDPSERTWSVLSVPPPKCPENF- 251
TT F+N+LPF W+K VF NG F+CLS G+LGVFDPS+ TW++L V P CP +
Sbjct: 170 TTHCFENQLPF--RYWSKCVFSNGMFFCLSECGYLGVFDPSKATWNILPVKP--CPAFYQ 225
Query: 252 FAKNWWKGK-FMTEHEGDV--IVIYTCSSENPIIFKLDQTLM 290
F ++ FMTEHEGD+ I+++ + ++DQ ++
Sbjct: 226 FEFDYTHNPVFMTEHEGDIFDIIMFRLCNCRSYCGEVDQDIV 267
>AT4G10700.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: CDC68-related (TAIR:AT4G10660.1); Has 1807 Blast
hits to 1807 proteins in 277 species: Archae - 0;
Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr4:6597863-6598658 REVERSE LENGTH=247
Length = 247
Score = 102 bits (253), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 54/121 (44%), Positives = 76/121 (62%), Gaps = 7/121 (5%)
Query: 255 NWWKGKFMTEHEGDVIVIYTCSSENPIIFKLDQTLMEWEEMRTLDGVTLFASFLSSHSRT 314
N K ++ E +G++ ++YTCSSE P+++KL + +E TLDGVT+FAS SS +R
Sbjct: 119 NLPKRIYLAEQKGELFLMYTCSSEIPMVYKLVSSNLEEMNSTTLDGVTIFASMYSSETRL 178
Query: 315 ELPGMMRNSVYFSKVRFYGKRCISFSLDDFRYYPRKQ------WHDWGEQDPYENIWVEP 368
++ G MRNSVYF K K C+S+S D+ RYYPRKQ W E P ++W+EP
Sbjct: 179 DVLG-MRNSVYFPKYGLDCKGCVSYSFDEARYYPRKQFPKPTMWQMQKELCPLRSLWIEP 237
Query: 369 P 369
P
Sbjct: 238 P 238
Score = 99.8 bits (247), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 41/88 (46%), Positives = 65/88 (73%)
Query: 64 KRWHSVATAVRVVNQSPWLMYFPKYGDLYEFYDPVQRKTYSLQMPELSGSRVCYTKDGWL 123
K+ + A +VRVV + PW++ FP + D+ +DP++RK Y+L +PE+ G+ VCY+KDGWL
Sbjct: 3 KKALTTAESVRVVEKHPWVITFPNHEDMTFLFDPLERKRYTLNLPEVVGTDVCYSKDGWL 62
Query: 124 LLYRPRTHRVFFFNPFTRDIIKLPRFEM 151
L+ R +FFFNP+TR++I LP+F++
Sbjct: 63 LMRRSSLVDMFFFNPYTRELINLPKFDL 90
>AT4G12810.1 | Symbols: | F-box family protein |
chr4:7522831-7523979 REVERSE LENGTH=382
Length = 382
Score = 68.6 bits (166), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 91/336 (27%), Positives = 136/336 (40%), Gaps = 49/336 (14%)
Query: 22 SGTEVKNENLELQSWSDLPAELLELVMTRLALDDNVRASVVCKRWHSVA-TAVRVVN-QS 79
S +E K N E S L +L+ L++ RL+ D RA V W+ + T + V N +
Sbjct: 11 SDSEKKTFN-EDSKHSILAVDLVRLILERLSFVDFHRARCVSSIWYIASKTVIGVTNPTT 69
Query: 80 PWLMYFPKYGDLY------EFYDPVQRKTYSLQMP--ELSGSRVCYTKDGWLLLYRPRTH 131
PWL+ FPK GD+ + YDP + KTY ++ +L SR + W L+ RT
Sbjct: 70 PWLILFPK-GDVEIKKDSCKLYDPHENKTYIVRDLGFDLVTSRCLASSGSWFLMLDHRTE 128
Query: 132 RVFFFNPFTRDIIKLPRFEMTYQIVAFSCAPTSPNCVLFTVKHVSPTVVA--ISTCY--- 186
N FTR I LP E T N VL+ + +V IS+ +
Sbjct: 129 -FHLLNLFTRVRIPLPSLESTR-----GSDIKIGNAVLWVDEQRKDYLVVWNISSLFGYH 182
Query: 187 -PGATEWTTVN-FQNRLPFVSSIWNKLVFCNGHFYCLSLTGWLGVFDPSERTWSVLSVPP 244
G W +N ++ +VF Y LS+ G + VF S V
Sbjct: 183 KKGDDRWKVFKPLENERCIIA-----MVFKENKLYVLSVDGNVDVFYFSGNDSPVRCATL 237
Query: 245 PKCPENFFAKNWWKG-KFMTEHEGDVIVIYTCSSENP-------IIFKLDQTLMEWEEMR 296
P P KG K + G+V++I P ++K+D WE ++
Sbjct: 238 PSSPLR-------KGHKVVVTLSGEVLIIVAKVEPYPRTRLCFFAVYKMDPKSSRWETIK 290
Query: 297 TLDGVTLFASFLSSHSRTELPGMMRNSVYFSKVRFY 332
+L G L T +M+N +YFS +F+
Sbjct: 291 SLAGEALILDL----GITVEAKVMKNCIYFSNDQFH 322
>AT4G22660.1 | Symbols: | F-box family protein with a domain of
unknown function (DUF295) | chr4:11914981-11916171
REVERSE LENGTH=396
Length = 396
Score = 67.0 bits (162), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 65/133 (48%), Gaps = 12/133 (9%)
Query: 27 KNENLELQSWSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVVNQSPWLMYFP 86
KN+N +WSDLP +LL LV RL+ + +A VC W+S + NQ PWLM FP
Sbjct: 3 KNQNP--NTWSDLPLDLLNLVFKRLSFANFRQAKSVCSSWYSASKQSVPKNQIPWLMLFP 60
Query: 87 KYGDLYE------FYDPVQRKTYSLQMPELS---GSRVCYTKDGWLLLYRPRTHRVFFFN 137
K + + F++P + K Q +L VC G LL + + ++ N
Sbjct: 61 KDKNNNKNSSCTIFFNP-EDKDQLYQTQDLGVEFAKSVCLATYGSWLLMQDSKYNLYILN 119
Query: 138 PFTRDIIKLPRFE 150
PFT + I LP E
Sbjct: 120 PFTYEKIGLPAIE 132
>AT4G22180.1 | Symbols: | F-box family protein with a domain of
unknown function (DUF295) | chr4:11738574-11739782
FORWARD LENGTH=402
Length = 402
Score = 62.0 bits (149), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 62/128 (48%), Gaps = 7/128 (5%)
Query: 34 QSWSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVVNQSPWLMYFPKY--GDL 91
SWS+LP +LL V RL+ + RA VC WHS + V Q PWL+ FP+Y +
Sbjct: 19 NSWSELPLDLLTAVFERLSYANFQRAKSVCSSWHS-GSRQSVPIQIPWLILFPEYDNNNS 77
Query: 92 YEFYDPVQRKTYSLQMPELS---GSRVCYTKDGWLLLYRPRTHRVFFFNPFTRDIIKLPR 148
++P + K +M +L VC G LL R + ++ N FT + + LP
Sbjct: 78 CTLFNP-EEKGQVYKMKDLGVEFSKSVCTATYGSWLLMRDPLYNLYILNLFTHERVNLPP 136
Query: 149 FEMTYQIV 156
FE +V
Sbjct: 137 FESQLGMV 144
>AT5G66830.1 | Symbols: | F-box family protein with a domain of
unknown function (DUF295) | chr5:26691304-26692488
FORWARD LENGTH=394
Length = 394
Score = 60.8 bits (146), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 136/344 (39%), Gaps = 63/344 (18%)
Query: 36 WSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVVNQSPWLMYFPKYGDLYEFY 95
WS LP++L++ V RL D RA VC W SV+ + NQ PW++ FPK + +
Sbjct: 20 WSKLPSDLMQFVFDRLGFADFQRAKSVCSSWLSVSRNSQPNNQIPWMIRFPKDNNHCLLF 79
Query: 96 DPVQRKTYSLQMPELS---GSRVCYTKDGWLLLYRPRT-----------HRVFFFNPFTR 141
+P + + + P L C G LL +P + + ++ + TR
Sbjct: 80 NP-EEEDKMYKTPNLGNDFAKSSCIASYGSWLLMQPESEYMEEDLDHQCNNLYILDLLTR 138
Query: 142 DIIKLPRFEMTYQIVAFSCAPTSPNCVLFTVKHVSPTVVAISTCYPGATEWTTVNFQ--- 198
+ I LP + + + C + S + I A E ++F+
Sbjct: 139 ERINLPILQPEFGLT----------CPILWTDEKSKDHLVIGM----AHEELAISFKKGD 184
Query: 199 ---NRLPFVSSIWN--KLVFCNGHFYCLSLTGWLGVFDPS--------ERTWSVLSVPP- 244
++P +S I +VF + YCLS L VFD S + + S L P
Sbjct: 185 SSWKQIPTLSGIEECFSMVFKDHKLYCLS-NYKLKVFDFSGDIPVKVFKTSVSKLLNNPL 243
Query: 245 --------PKCPENFFAKNWWKGKFMTEHEGDVIVIYTCS--SENPI----IFKLDQTLM 290
P P N +K + G V+++ C S + I I+K++
Sbjct: 244 CISMRMRLPGIPMK-DQLNHFKDDMVVTLAGHVLIV-KCHRPSLSKIWSFEIYKMEGNNN 301
Query: 291 EWEEMRTLDGVTLFASFLSSHSRTELPGMMRNSVYFSKVRFYGK 334
+WE+ +L T+ + ++ G+ NS+YFS Y K
Sbjct: 302 KWEKTVSLGDETILLDLGITVLAKDMQGIKANSIYFSNPTPYFK 345
>AT4G22060.1 | Symbols: | F-box family protein with a domain of
unknown function (DUF295) | chr4:11687620-11688819
FORWARD LENGTH=399
Length = 399
Score = 59.3 bits (142), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 56/123 (45%), Gaps = 5/123 (4%)
Query: 35 SWSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVVNQSPWLMYFPKYGDLY-- 92
SWS LP +LL +V RL D R VC W + Q PWL+ FP+ G +
Sbjct: 12 SWSKLPLDLLIMVFERLGFVDFQRTKSVCLAWLYASRMSAPNKQIPWLIMFPEKGKDFCL 71
Query: 93 EFYDPVQRKTYSLQM--PELSGSRVCYTKDGWLLLYRPRTHRVFFFNPFTRDIIKLPRFE 150
F + K Y +Q E + S WL + PR ++++ N FTR+ I LP E
Sbjct: 72 LFNSEEKEKIYRIQNLGVEFANSHCLAIYGSWLFMRDPR-YKLYIMNLFTRERINLPSVE 130
Query: 151 MTY 153
+
Sbjct: 131 SQF 133
>AT4G22170.1 | Symbols: | F-box family protein with a domain of
unknown function (DUF295) | chr4:11736653-11737744
FORWARD LENGTH=363
Length = 363
Score = 58.2 bits (139), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 61/129 (47%), Gaps = 9/129 (6%)
Query: 34 QSWSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVVNQSPWLMYFP-----KY 88
SWSDLP +LL LV RL+ + RA VC W+S + NQ WL+ FP K
Sbjct: 8 NSWSDLPHDLLNLVFERLSFANFNRARSVCSSWYSASRQSVPKNQIHWLILFPEDNNNKN 67
Query: 89 GDLYEFYDPVQR-KTYSLQ-MPELSGSRVCYTKDG-WLLLYRPRTHRVFFFNPFTRDIIK 145
++P ++ K Y Q + E VC G W L+ P ++ N FTR+ I
Sbjct: 68 NSSCTLFNPDEKDKLYKTQHLDEEFAKSVCRATYGSWFLMVDP-LFNLYILNLFTRERIN 126
Query: 146 LPRFEMTYQ 154
L E+ ++
Sbjct: 127 LHPVELLWK 135
>AT4G18320.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G18310.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr4:10124728-10125978 FORWARD LENGTH=307
Length = 307
Score = 56.6 bits (135), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 60/126 (47%), Gaps = 11/126 (8%)
Query: 33 LQSWSDLPAELLELVMTRLALDDNVRASV--VCKRWHSVATAVR---VVNQSPWLMYFPK 87
+ SW LP +LL L + ++ V W + + R V + PWL+Y +
Sbjct: 1 MDSWGSLPQDLLRSSTNLLNPNFTEGGALRGVNTHWRNTFLSFREGGFVEERPWLLYRER 60
Query: 88 YGDLYEFYDPVQRKTYSLQMPELSGSRVCYTKDGWLLL------YRPRTHRVFFFNPFTR 141
+ FYDPV+ + + + P L G++ + GW+++ +R + H+ F +NPFT
Sbjct: 61 GSRVAHFYDPVREELHHGKDPYLEGAKFLGSTLGWVVMSNSSAIHRRQDHQAFLYNPFTS 120
Query: 142 DIIKLP 147
+LP
Sbjct: 121 QRYQLP 126
>AT4G22165.1 | Symbols: | F-box family protein with a domain of
unknown function (DUF295) | chr4:11734328-11735419
FORWARD LENGTH=363
Length = 363
Score = 56.6 bits (135), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 89/334 (26%), Positives = 136/334 (40%), Gaps = 68/334 (20%)
Query: 34 QSWSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVVNQSPWLMYFPKYGD--- 90
+WS+LP +LL LV RL+L + RA VC +SV+ Q L+ FPK +
Sbjct: 8 NTWSELPLDLLNLVFKRLSLVNFQRAKSVCSTRYSVSRQCVPERQIALLILFPKEDNTDN 67
Query: 91 -LYEFYDPVQR-KTYSLQ--MPELSGSRVCYTKDGWLLLYRPRTHRVFFFNPFTRDIIKL 146
+ ++P ++ K Y +Q E + S T WLL+ + H ++ N FTR I L
Sbjct: 68 STCKLFNPDEKDKLYKMQDLGVEFAKSVCRATYGSWLLMQDSKYH-LYILNIFTRKRINL 126
Query: 147 PRFEMTYQIVA---------------FSCAPTSPNCVLFTVKHVSPT--------VVAIS 183
P E +V +S + +SP +F + S V +
Sbjct: 127 PPVESQLGMVKIERTIYDWFHFSHGHYSFSLSSP---VFWIDEESKDYIVMWGLGVYCVV 183
Query: 184 TCYPGATEWTTVNFQNRLPFVSSIWNKLVFCNGHFYCLSLTGWLGVFDPSE----RTWSV 239
G T W N++P S ++ +V+ + Y LS TG + D SE +T V
Sbjct: 184 YAKKGDTSW------NQIPQTSYFYD-MVYKDHKLYFLSSTGTFQILDFSEEMDNKTSKV 236
Query: 240 LSVPPPKCPENFFAK-----NWWKGKFMTEHEGDVIVIYTCSSENPIIFKLDQTLMEWEE 294
+ + K K W+ + T V I + E KLD +E
Sbjct: 237 VCLLDRKLVVTVTGKALKVAKMWRPTYRT-WSFRVFKISSSGYE-----KLDSL---GDE 287
Query: 295 MRTLD-GVTLFASFLSSHSRTELPGMMRNSVYFS 327
LD G+T+ AS ++ G RNS+YFS
Sbjct: 288 ALLLDLGITVLAS--------DVEGFKRNSIYFS 313
>AT4G18320.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT4G18310.1); Has 37 Blast hits to 37
proteins in 9 species: Archae - 0; Bacteria - 0; Metazoa
- 0; Fungi - 0; Plants - 37; Viruses - 0; Other
Eukaryotes - 0 (source: NCBI BLink). |
chr4:10125016-10125978 FORWARD LENGTH=320
Length = 320
Score = 55.8 bits (133), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 59/124 (47%), Gaps = 11/124 (8%)
Query: 35 SWSDLPAELLELVMTRLALDDNVRASV--VCKRWHSVATAVR---VVNQSPWLMYFPKYG 89
SW LP +LL L + ++ V W + + R V + PWL+Y +
Sbjct: 16 SWGSLPQDLLRSSTNLLNPNFTEGGALRGVNTHWRNTFLSFREGGFVEERPWLLYRERGS 75
Query: 90 DLYEFYDPVQRKTYSLQMPELSGSRVCYTKDGWLLL------YRPRTHRVFFFNPFTRDI 143
+ FYDPV+ + + + P L G++ + GW+++ +R + H+ F +NPFT
Sbjct: 76 RVAHFYDPVREELHHGKDPYLEGAKFLGSTLGWVVMSNSSAIHRRQDHQAFLYNPFTSQR 135
Query: 144 IKLP 147
+LP
Sbjct: 136 YQLP 139
>AT2G33190.1 | Symbols: | F-box family protein with a domain of
unknown function (DUF295) | chr2:14067972-14069111
FORWARD LENGTH=379
Length = 379
Score = 54.7 bits (130), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 58/123 (47%), Gaps = 8/123 (6%)
Query: 34 QSWSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVVNQSPWLMYFPKYGDLYE 93
WS L +LL + L+ D RA VC W++V+ R PW + F G
Sbjct: 6 NGWSKLYPDLLRSIFESLSCLDFHRAGTVCSNWYAVS---RSCPLYPWRIVFR--GKNSV 60
Query: 94 FYDPVQRKTYSLQMPELSGSRV-CYTKDG-WLLLYRPRTHRVFFFNPFTRDIIKLPRFEM 151
+DP+Q K Y+ + + S++ C G W+L+ PR + N FTR+ I LP E
Sbjct: 61 LFDPIQDKIYTKNLLGIDLSKIHCLASYGNWILIVDPRLD-FYLLNVFTRETINLPSLES 119
Query: 152 TYQ 154
+ +
Sbjct: 120 SLR 122
>AT1G69090.1 | Symbols: | Protein of unknown function (DUF295) |
chr1:25977501-25978706 REVERSE LENGTH=401
Length = 401
Score = 53.9 bits (128), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 61/129 (47%), Gaps = 15/129 (11%)
Query: 36 WSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVVNQSPWLMYFPKYGDLYEFY 95
WS LP +L++LV RLA D RA VC W + + NQ PW++ FP + +
Sbjct: 29 WSKLPLDLMQLVFERLAFLDFERAKSVCSSWQFGSKQSKPNNQIPWMILFPTDKNYCLLF 88
Query: 96 DPVQR-KTYSLQM--PELSGSRVCYTKDGWLLLYRPRTHRV------FFFNPFTRDI--- 143
+P + K Y Q + + S V T WLL+ +PR + F+ + +D+
Sbjct: 89 NPEDKEKLYKTQHLGDDFAKSIVLATYRSWLLM-QPRYEELEDQTLDQEFHLYIKDLLTC 147
Query: 144 --IKLPRFE 150
I LP FE
Sbjct: 148 ERINLPAFE 156
>AT3G61590.2 | Symbols: HWS | Galactose oxidase/kelch repeat
superfamily protein | chr3:22792914-22794149 FORWARD
LENGTH=411
Length = 411
Score = 53.5 bits (127), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 67/146 (45%), Gaps = 16/146 (10%)
Query: 21 RSGTEVKNENLELQSWSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRV----- 75
+S E K E + S LP +LLE +++ L + RA VCKRW+ + ++ R
Sbjct: 27 QSDDEAKVETFSMDSL--LPDDLLERILSFLPIASIFRAGTVCKRWNEIVSSRRFLCNFS 84
Query: 76 ---VNQSPWLMYFPKYGDLYEF-YDPVQRKTYSLQMPELSGSRVCYTKDGWLLLYRPRT- 130
V+Q PW F D + YDP+ RK YS +P + S L+ +
Sbjct: 85 NNSVSQRPWYFMFTTTDDPSGYAYDPIIRKWYSFDLPCIETSNWFVASSCGLVCFMDNDC 144
Query: 131 -HRVFFFNPFT---RDIIKLPRFEMT 152
++++ NP T R +I+ P + T
Sbjct: 145 RNKIYVSNPITKQWRTLIEPPGHKST 170
>AT3G61590.1 | Symbols: HWS, HS | Galactose oxidase/kelch repeat
superfamily protein | chr3:22792914-22794149 FORWARD
LENGTH=411
Length = 411
Score = 53.5 bits (127), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 67/146 (45%), Gaps = 16/146 (10%)
Query: 21 RSGTEVKNENLELQSWSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRV----- 75
+S E K E + S LP +LLE +++ L + RA VCKRW+ + ++ R
Sbjct: 27 QSDDEAKVETFSMDSL--LPDDLLERILSFLPIASIFRAGTVCKRWNEIVSSRRFLCNFS 84
Query: 76 ---VNQSPWLMYFPKYGDLYEF-YDPVQRKTYSLQMPELSGSRVCYTKDGWLLLYRPRT- 130
V+Q PW F D + YDP+ RK YS +P + S L+ +
Sbjct: 85 NNSVSQRPWYFMFTTTDDPSGYAYDPIIRKWYSFDLPCIETSNWFVASSCGLVCFMDNDC 144
Query: 131 -HRVFFFNPFT---RDIIKLPRFEMT 152
++++ NP T R +I+ P + T
Sbjct: 145 RNKIYVSNPITKQWRTLIEPPGHKST 170
>AT4G22400.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G18320.2); Has 44 Blast hits to 44 proteins in
7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 44; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr4:11816829-11817812 FORWARD
LENGTH=327
Length = 327
Score = 53.1 bits (126), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 59/126 (46%), Gaps = 11/126 (8%)
Query: 37 SDLPAELLELVMTRLALD--DNVRASVVCKRWHSVATAVR---VVNQSPWLMYFPKYGDL 91
+LPA+L+ + L+ + R + K W + R + PWL+Y +
Sbjct: 18 GNLPADLIRKCTDLMDLNFAEGQRLRTLNKNWKLALPSFRKGAYREERPWLLYRERGSGE 77
Query: 92 YEFYDPVQRKTYSLQMPELSGSRVCYTKDGWLLL-----YRPR-THRVFFFNPFTRDIIK 145
F+DPV+ + + P L+ +R + GWL++ PR TH+ F +NPF ++ +
Sbjct: 78 TRFFDPVRERVHRGNDPRLADARFLGSTLGWLVMSESLDLNPRKTHQTFLYNPFISELQQ 137
Query: 146 LPRFEM 151
LP +
Sbjct: 138 LPELTL 143
>AT4G22030.1 | Symbols: | F-box family protein with a domain of
unknown function (DUF295) | chr4:11672396-11675575
FORWARD LENGTH=626
Length = 626
Score = 52.8 bits (125), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 38/73 (52%), Gaps = 4/73 (5%)
Query: 35 SWSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVVNQSPWLMYFPKYGDLYEF 94
SWS LP++LL +V RL D RA VC W + NQ PWL+ FP+ G F
Sbjct: 409 SWSKLPSDLLNMVFERLGFADFQRAKSVCPSWLDASRQSASKNQIPWLIMFPEKG----F 464
Query: 95 YDPVQRKTYSLQM 107
+ + R+ + Q+
Sbjct: 465 FGDIPRQIFETQV 477
>AT3G22345.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: F-box family protein with a domain of
unknown function (DUF295) (TAIR:AT3G03730.1); Has 30201
Blast hits to 17322 proteins in 780 species: Archae -
12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422;
Plants - 5037; Viruses - 0; Other Eukaryotes - 2996
(source: NCBI BLink). | chr3:7899215-7900219 FORWARD
LENGTH=334
Length = 334
Score = 52.4 bits (124), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 52/115 (45%), Gaps = 4/115 (3%)
Query: 39 LPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVVNQSPWLMYFPKYGDLY-EFYDP 97
L +L+ + RL + RA + W+S A N +PWL+ F Y + +DP
Sbjct: 22 LAPDLIRSIFERLNFAEFHRAMSISLDWYSTAELCYRQNPTPWLILFSNYRHISCRLFDP 81
Query: 98 VQRKTYSLQMPELSGSRVC--YTKDGWLLLYRPRTHRVFFFNPFTRDIIKLPRFE 150
+ KTY ++ R C T WLL+ RT + N FTR+ I LP E
Sbjct: 82 LHDKTYVIRDLGFDFHRSCCLATSGSWLLMLDHRTD-FYLLNLFTRERICLPTLE 135
>AT2G33200.1 | Symbols: | F-box family protein |
chr2:14069541-14070671 FORWARD LENGTH=376
Length = 376
Score = 52.0 bits (123), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 52/117 (44%), Gaps = 6/117 (5%)
Query: 36 WSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVVNQSPWLMYFPKYGDLYEFY 95
WS L ++L L++ L D RA VC W++ +T + PW + F K +
Sbjct: 8 WSKLCHDILRLILESLHYKDYHRARTVCSNWYTASTTCK-RPLYPWRIKFNKIST--SLF 64
Query: 96 DPVQRKTYSLQMP--ELSGSRVCYTKDGWLLLYRPRTHRVFFFNPFTRDIIKLPRFE 150
DP + K + +Q P E S V + W L+ + N FTR+ I LP E
Sbjct: 65 DPREDKIHEIQHPGIEFSDRNVLASCSNWFLMVDSGL-EFYLLNAFTRERINLPSME 120
>AT5G25290.1 | Symbols: | F-box family protein with a domain of
unknown function (DUF295) | chr5:8778592-8779785 FORWARD
LENGTH=397
Length = 397
Score = 51.2 bits (121), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 57/113 (50%), Gaps = 4/113 (3%)
Query: 36 WSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRV-VNQSPWLMYFPKYGDLYEF 94
WS++P ++L V RL+ D RA +VC W+S + + +SP ++ F GD
Sbjct: 14 WSEIPMDILRSVFERLSFVDLHRAKIVCSHWYSCSKQSFLRKTRSPLVILFSDDGDC-TL 72
Query: 95 YDPVQRKTYSLQMPELSGSRVCYTKDGWLLLYRPRTHRVFFFNPFTRDIIKLP 147
Y+P + + Y + +LS R W L+ PR++ ++ + F+ I LP
Sbjct: 73 YNPEEARVYKSKR-DLSRYRFLANSGNWFLVLDPRSN-LYIIDLFSEKKINLP 123
>AT1G67160.1 | Symbols: | F-box family protein with a domain of
unknown function (DUF295) | chr1:25124819-25127098
REVERSE LENGTH=450
Length = 450
Score = 50.8 bits (120), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/135 (26%), Positives = 65/135 (48%), Gaps = 9/135 (6%)
Query: 24 TEVKNENLELQSWSDLPAELLELVMTRLALDDNVRASVVCKRWHSVATAVRVVNQ--SPW 81
++ +N+E + + +L+ L++ RL+ D RA V W+ + +V V +PW
Sbjct: 2 SDSDGKNMEGRFEAAYIVDLVRLILERLSFVDFHRARCVSSTWYVASKSVIGVTNPTTPW 61
Query: 82 LMYFP----KYGDLYEFYDPVQRKTYSLQMP--ELSGSRVCYTKDGWLLLYRPRTHRVFF 135
++ FP + + +DP + KTY ++ ++S SR + W L++ R
Sbjct: 62 IILFPNKNVENNGSCKLFDPHENKTYIIRDLGFDMSTSRCLASSGSWFLMFDHRAD-FHL 120
Query: 136 FNPFTRDIIKLPRFE 150
N FTR+ I LP E
Sbjct: 121 LNLFTRERILLPSLE 135