Miyakogusa Predicted Gene
- Lj1g3v4107070.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4107070.1 Non Chatacterized Hit- tr|C5YRP2|C5YRP2_SORBI
Putative uncharacterized protein Sb08g002930
OS=Sorghu,27.75,2e-18,DUF668,Protein of unknown function DUF668;
DUF3475,Protein of unknown function DUF3475; SUBFAMILY NO,CUFF.31950.1
(451 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G51670.1 | Symbols: | Protein of unknown function (DUF668) |... 250 2e-66
AT5G04550.1 | Symbols: | Protein of unknown function (DUF668) |... 221 5e-58
AT3G23160.1 | Symbols: | Protein of unknown function (DUF668) |... 161 9e-40
AT1G34320.1 | Symbols: | Protein of unknown function (DUF668) |... 110 2e-24
AT1G30755.1 | Symbols: | Protein of unknown function (DUF668) |... 82 1e-15
AT5G08660.1 | Symbols: | Protein of unknown function (DUF668) |... 75 1e-13
>AT5G51670.1 | Symbols: | Protein of unknown function (DUF668) |
chr5:20993560-20995096 FORWARD LENGTH=474
Length = 474
Score = 250 bits (638), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 155/433 (35%), Positives = 231/433 (53%), Gaps = 49/433 (11%)
Query: 24 ISISSFEIGSLMLKLVNLWQSLSDKQIFKLRKEITNSFGIRKLVSGDEYFINRLICAELH 83
+ + SFE+ +M KL++L SL+D + R + G+ K+V+GDE F L+CAEL
Sbjct: 37 VGVLSFEVARVMTKLLHLTHSLTDSNLLTPRDHSLSLEGLTKIVNGDETFHLSLVCAELA 96
Query: 84 ENLAHVADSVAWHGKKCSDPILKGFEKAFKKFITMGDDQYGWEFHGEVMEKNIKKMEGFI 143
++LAH A+SV+ +C+ L+ F + F +F MG D +GW + + E KK+E ++
Sbjct: 97 DSLAHAANSVSRLSNRCTTASLRSFHRLFHEFADMGRDPHGWVMNCKDTEAKNKKIERYV 156
Query: 144 SSNGSLFEALVE----QNPIR-------------LTDNGESGVLKSIAYQKIYRWKRQEV 186
S +L+ + E +N +R + V+K I Q ++Q V
Sbjct: 157 SVTTALYREMEEMAILENSLRKQSLQIGIEFEEEEDYENKKDVMKVIDLQNKIERQKQHV 216
Query: 187 KRLKNISLWTRTYDYTINLLARSLFTIYRKINHVFG----------------------IH 224
K LK+ SLW +++D + +LARS+FT ++ VF
Sbjct: 217 KYLKDRSLWNKSFDTVVLILARSVFTALARLKSVFSSAAATGYMVPTVVSSLPRSLSSSS 276
Query: 225 EMVHLGGSVLNSDLTYRSQSISALLQYLSHPLQNILPPKTLGAAALALHYANIIIVIEKL 284
++L N + ++ + SA L+ S L+ P TLG A +ALHYAN+I+V+EK+
Sbjct: 277 SSMNLVHPSPNDEERDKTTTSSAFLEESSRLLKP--PETTLGGAGVALHYANLIVVMEKM 334
Query: 285 SASPHSISLEAREDLYNMLPRCLRASLKAKLKPCTKTMASSSLYDPSLEEWTQELSGILE 344
P + L+AR+DLY+MLP +R+SL+++LK T L EW L IL
Sbjct: 335 IKQPQLVGLDARDDLYSMLPASVRSSLRSRLKGVGFTATDGGLA----TEWKAALGRILR 390
Query: 345 WLALLAHHMISWHSEKSFNQQIFV----SQRNVLLVQTLYFADQEKTEKVITELLVCLNY 400
WL LA +MI W SE+SF QQ SQ V+LVQTL FAD+ KTE ITELLV LNY
Sbjct: 391 WLLPLAQNMIRWQSERSFEQQHMATATNSQNRVMLVQTLVFADKVKTEAAITELLVGLNY 450
Query: 401 VCKYGRELNAKAL 413
+ ++ RE+ AKAL
Sbjct: 451 IWRFEREMTAKAL 463
>AT5G04550.1 | Symbols: | Protein of unknown function (DUF668) |
chr5:1303757-1305556 REVERSE LENGTH=599
Length = 599
Score = 221 bits (564), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 128/262 (48%), Positives = 167/262 (63%), Gaps = 9/262 (3%)
Query: 1 MVAESWFHRLWRIPQKFGA--DSPRISISSFEIGSLMLKLVNLWQSLSDKQIFKLRKEIT 58
MV+E+WF LWR P+K D + + +FE+ SL+ KLV+LWQSLSDK + +LR EIT
Sbjct: 1 MVSETWFRNLWRFPKKHDGHKDKAVLGVLAFEVASLLSKLVHLWQSLSDKNVARLRDEIT 60
Query: 59 NSFGIRKLVSGDEYFINRLICAELHENLAHVADSVAWHGKKCSDPILKGFEKAFKKFITM 118
+S GI+KLVS D+ FI RLI E+ EN+ +VA +VA +KC+DP LK FE F +
Sbjct: 61 HSTGIKKLVSEDDDFIVRLIRDEMMENVENVAKAVARLARKCNDPKLKCFENCFSDMMKT 120
Query: 119 GDDQYGWEFHGEVMEKNIKKMEGFISSNGSLFE-----ALVEQNPIRLTDNGESGVLKSI 173
G D YGW+F + M+K KKME FISSN SL++ A +EQ R+ N ES +
Sbjct: 121 GADPYGWQFGWKKMDKKAKKMERFISSNASLYQETEILADLEQTFKRMKSN-ESATDNLL 179
Query: 174 AYQKIYRWKRQEVKRLKNISLWTRTYDYTINLLARSLFTIYRKINHVFGIHEMVHLGG-S 232
YQK WKR EVK L+++SLW RTYDYT+ LL RS+FTI + HVFGI V S
Sbjct: 180 EYQKKVTWKRHEVKNLRDVSLWNRTYDYTVILLVRSVFTILSRTKHVFGISYRVEASDVS 239
Query: 233 VLNSDLTYRSQSISALLQYLSH 254
+SD RS S+S +L +SH
Sbjct: 240 SADSDFIGRSHSVSTILTPVSH 261
Score = 209 bits (532), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 101/174 (58%), Positives = 129/174 (74%), Gaps = 1/174 (0%)
Query: 249 LQYLSHPLQNILPPKTLGAAALALHYANIIIVIEKLSASPHSISLEAREDLYNMLPRCLR 308
L S P + P TLG A LALHYAN+IIVIE+ ASPH I +AR+DLYNMLP +R
Sbjct: 417 LTLPSRPKLSDAAPNTLGTACLALHYANVIIVIERFVASPHLIGDDARDDLYNMLPASVR 476
Query: 309 ASLKAKLKPCTKTMASSSLYDPSL-EEWTQELSGILEWLALLAHHMISWHSEKSFNQQIF 367
SL+ +LKP +K ++SS++YDP L EWT ++GILEWL LAH+MI W SE+S+ Q
Sbjct: 477 TSLRERLKPYSKNLSSSTVYDPGLAREWTDAMAGILEWLGPLAHNMIKWQSERSYEHQSL 536
Query: 368 VSQRNVLLVQTLYFADQEKTEKVITELLVCLNYVCKYGRELNAKALAECGSFKV 421
VS+ +++L QTL+FA+Q+KTE +ITELLV LNYV ++GRELNAKAL EC S K
Sbjct: 537 VSRTHIVLAQTLFFANQQKTEAIITELLVGLNYVWRFGRELNAKALQECTSSKT 590
>AT3G23160.1 | Symbols: | Protein of unknown function (DUF668) |
chr3:8260059-8261654 REVERSE LENGTH=531
Length = 531
Score = 161 bits (407), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 84/157 (53%), Positives = 111/157 (70%), Gaps = 7/157 (4%)
Query: 264 TLGAAALALHYANIIIVIEKLSASPHSISLEAREDLYNMLPRCLRASLKAKLKPCTKTMA 323
T+G +AL+LHYAN++IV+EKL PH I EAR+DLY MLP L+ +LKA L+ K +
Sbjct: 360 TIGGSALSLHYANVVIVVEKLLKYPHLIGEEARDDLYQMLPTSLKTTLKASLRSYLKNI- 418
Query: 324 SSSLYD-PSLEEWTQELSGILEWLALLAHHMISWHSEKSFNQQIFVSQR-NVLLVQTLYF 381
S+YD P +W + + GIL WLA LAH+MI W SE++F QQ + +R NVLL+QTLYF
Sbjct: 419 --SIYDAPLAHDWKETIDGILSWLAPLAHNMIRWQSERNFEQQNQIVKRTNVLLLQTLYF 476
Query: 382 ADQEKTEKVITELLVCLNYVCKYGRELNAKALAECGS 418
AD+EKTE I +LLV LNY+C Y ++ N AL +C S
Sbjct: 477 ADREKTEAAICKLLVGLNYICHYEQQQN--ALLDCAS 511
Score = 135 bits (340), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 79/217 (36%), Positives = 131/217 (60%), Gaps = 8/217 (3%)
Query: 12 RIPQKFGADSPRISISSFEIGSLMLKLVNLWQSLSDKQIFKLRKEITNSFGIRKLVSGDE 71
+ P+ I I SFE+ ++M K ++L +SLSD +I KL+ E+ +S G+RKLVS DE
Sbjct: 31 KTPKPVSLPKQTIGILSFEVANVMSKTIHLHRSLSDTEISKLKAEVFHSEGVRKLVSSDE 90
Query: 72 YFINRLICAELHENLAHVADSVAWHGKKCSDPILKGFEKAFKKFITMGDDQYGWEFHGEV 131
+ L +E ++L+ VA V+ GKKC++P L+GFE ++ + D F +
Sbjct: 91 NHLLDLSVSEKLDDLSRVASVVSRLGKKCNEPALQGFEHVYEDIVNGAIDFRKLGFLVKD 150
Query: 132 MEKNIKKMEGFISSNGSLFEAL-----VEQNPIRLTDNGESGVLKSI-AYQKIYRWKRQE 185
ME +KKME F+++ SL+ + +EQ ++L + + +S+ A+++ W+RQ+
Sbjct: 151 MESMVKKMERFVNATCSLYCEMEVMNELEQAIVKLQRSQQHQ--ESVKAFEQKLMWQRQD 208
Query: 186 VKRLKNISLWTRTYDYTINLLARSLFTIYRKINHVFG 222
VK L++ SLW +TYD + +LAR++ TIY +I VFG
Sbjct: 209 VKSLRDGSLWNQTYDKVVEMLARTVCTIYGRIETVFG 245
>AT1G34320.1 | Symbols: | Protein of unknown function (DUF668) |
chr1:12520713-12524046 FORWARD LENGTH=657
Length = 657
Score = 110 bits (275), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 107/426 (25%), Positives = 183/426 (42%), Gaps = 83/426 (19%)
Query: 23 RISISSFEIGSLMLKLVNLWQSLSDKQIFKLRKEITNSFGIRKLVSGDEYFINRLICAEL 82
+ISI SFE+ + ++K NL SLS I L++ + S G++ L+S D + R+ A+
Sbjct: 154 KISILSFEVANTIVKGANLMHSLSKDSITHLKEVVLPSEGVQNLISKDMDELLRIAAADK 213
Query: 83 HENLAHVADSVAWHGKKCSDPILKGFEKAFKKFITMGDDQYGWEFHGEV-----MEKNIK 137
E L + V G +C DP ++ F D+ G EF + E +
Sbjct: 214 REELRIFSGEVVRFGNRCKDPQYHNLDRFF--------DRLGSEFTPQKHLKQEAETIMH 265
Query: 138 KMEGFISSNGSLFEAL---------------VEQNPIRLTDNGESGVLKSIAYQKI-YRW 181
+M F+ L+ L E+NP + GV ++A + +
Sbjct: 266 QMMSFVHFTADLYHELHALDRFEQDYQRKIQEEENP----STAQRGVGDTLAILRTELKS 321
Query: 182 KRQEVKRLKNISLWTRTYDYTINLLARSLFTIYRKINHVFGIHEMVHLGGSVLNSDLTYR 241
+++ V+ LK SLW+R + + L + ++ +I+ FG +D
Sbjct: 322 QKKHVRNLKKKSLWSRILEEVMEKLVDVVHFLHLEIHEAFG------------GADPDKP 369
Query: 242 SQSISALLQYLSHPLQNILPP---KTLGAAALALHYANIIIVIEKLSASPHSISLEARED 298
+ PP K LG+A LALHYANII I+ L + ++ R+
Sbjct: 370 AND----------------PPINHKKLGSAGLALHYANIITQIDTLVSRSSTMPASTRDA 413
Query: 299 LYNMLPRCLRASLKAKLKPCTKTMASSSLYDPSLEEWTQELSGILEWLALL------AHH 352
LY LP ++++L+++++ L P ++ E+ L+WL + AHH
Sbjct: 414 LYQGLPPSIKSALRSRIQSF---QVKEELTVPQIK---AEMEKTLQWLVPVATNTTKAHH 467
Query: 353 MISWHSE-----KSFNQQIFVSQRNVLLVQTLYFADQEKTEKVITELLVCLNYVCKYGRE 407
W E NQ+ + + +L + TL+ AD+EKTE I +L+V L+++ R
Sbjct: 468 GFGWVGEWASSGSEANQR--PAGQTILRIDTLHHADKEKTEAYILDLVVWLHHLVTQVRA 525
Query: 408 LNAKAL 413
L
Sbjct: 526 TTGYGL 531
>AT1G30755.1 | Symbols: | Protein of unknown function (DUF668) |
chr1:10905991-10908773 REVERSE LENGTH=615
Length = 615
Score = 81.6 bits (200), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 98/409 (23%), Positives = 170/409 (41%), Gaps = 61/409 (14%)
Query: 23 RISISSFEIGSLMLKLVNLWQSLSDKQIFKLRKEITNSFGIRKLVSGDEYFINRLICAEL 82
+++I +FE+ + + K L QSLS++ + ++K++ +S ++KLVS D + L ++
Sbjct: 133 KVTILAFEVANTIAKGAALLQSLSEENLKFMKKDMLHSEEVKKLVSTDTTELQILAASDK 192
Query: 83 HENLAHVADSVAWHGKKCSDPILKGFEKAFKKFITMGDDQYGWEFHGEVMEKNIKKMEGF 142
E L + V G C D ++ F K T + E + + +
Sbjct: 193 REELDLFSGEVIRFGNMCKDLQWHNLDRYFMKLDTENSQHKLLKDDAEARMQELVTLARI 252
Query: 143 ISSNGSLFEAL-----------VEQNPIRLTDNGESGVLKSIAYQKIYRWKRQEVKRLKN 191
S +AL E + L GE V+ Q + +++ VK L+
Sbjct: 253 TSELYHELQALDRFEQDYRRKLAEVESLNLPRRGEGIVI----LQNELKQQKKLVKSLQK 308
Query: 192 ISLWTRTYDYTINLLARSLFTIYRKINHVFGIHEMVHLGGSVLNSDLTYRSQSISALLQY 251
SLW++ I L + I + I VFG + G N R +
Sbjct: 309 KSLWSQNLAEIIEKLVDVVSYIRQTIVEVFGNN------GLRDNEGEQGRER-------- 354
Query: 252 LSHPLQNILPPKTLGAAALALHYANIIIVIEKLSASPHSISLEAREDLYNMLPRCLRASL 311
LG A L+LHYAN+I I+ +++ P S+ R+ LYN LP A++
Sbjct: 355 -------------LGEAGLSLHYANLIQQIDNIASRPSSLPSNVRDTLYNALP----ATV 397
Query: 312 KAKLKPCTKTMASSSLYDPSLEEWTQELSGILEWLALL------AHHMISWHSEKSFNQQ 365
K L+P +T+ + S+ E E+ L+WL AH W E + ++
Sbjct: 398 KTALRPRLQTLDQEE--ELSVPEIKAEMEKSLQWLVPFAENTTKAHQGFGWVGEWANSRI 455
Query: 366 IFVSQR-------NVLLVQTLYFADQEKTEKVITELLVCLNYVCKYGRE 407
F + N +QTL+ AD+ + + EL+V L+ + K ++
Sbjct: 456 EFGKGKGKGENNGNPTRLQTLHHADKPIVDSYVLELVVWLHRLMKSSKK 504
>AT5G08660.1 | Symbols: | Protein of unknown function (DUF668) |
chr5:2814526-2817845 FORWARD LENGTH=649
Length = 649
Score = 74.7 bits (182), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/151 (33%), Positives = 78/151 (51%), Gaps = 19/151 (12%)
Query: 263 KTLGAAALALHYANIIIVIEKLSASPHSISLEAREDLYNMLPRCLRASLKAKLKP--CTK 320
K LG A LALHYANII+ I+ L A SI+ AR+ LY LP ++ +L++K+K K
Sbjct: 370 KRLGPAGLALHYANIIVQIDTLVARASSITSNARDSLYQSLPPGIKLALRSKIKSFNVDK 429
Query: 321 TMASSSLYDPSLEEWTQELSGILEWLALL------AHHMISWHSEKSFNQQIFVSQR--- 371
++ + + D E+ L WL + AHH W E + F S+
Sbjct: 430 ELSVTQIKD--------EMERTLHWLVPVAGNTTKAHHGFGWVGEWANTGTDFTSKPSGG 481
Query: 372 NVLLVQTLYFADQEKTEKVITELLVCLNYVC 402
++L ++TLY A +EKTE I ++ L ++
Sbjct: 482 DILRIETLYHASKEKTEIYILGQIIWLQHLV 512