Miyakogusa Predicted Gene
- Lj0g3v0082249.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0082249.1 Non Chatacterized Hit- tr|K4CQS4|K4CQS4_SOLLC
Uncharacterized protein OS=Solanum lycopersicum
GN=Sol,48,4e-18,DUF668,Protein of unknown function DUF668;
DUF3475,Protein of unknown function DUF3475; SUBFAMILY NO,CUFF.4286.1
(603 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G23160.1 | Symbols: | Protein of unknown function (DUF668) |... 587 e-168
AT5G04550.1 | Symbols: | Protein of unknown function (DUF668) |... 190 2e-48
AT5G51670.1 | Symbols: | Protein of unknown function (DUF668) |... 174 1e-43
AT1G34320.1 | Symbols: | Protein of unknown function (DUF668) |... 82 1e-15
AT5G08660.1 | Symbols: | Protein of unknown function (DUF668) |... 72 1e-12
AT1G30755.1 | Symbols: | Protein of unknown function (DUF668) |... 71 2e-12
>AT3G23160.1 | Symbols: | Protein of unknown function (DUF668) |
chr3:8260059-8261654 REVERSE LENGTH=531
Length = 531
Score = 587 bits (1513), Expect = e-168, Method: Compositional matrix adjust.
Identities = 320/606 (52%), Positives = 397/606 (65%), Gaps = 78/606 (12%)
Query: 1 MVAEAWIVKMGRQVSANLKHALLLEPSAKRKQGPKRQDNKVMIEILSFEVANVMSKIVHL 60
MV+EAWIVKM QVS+NLKHALLLE S+ +K K I ILSFEVANVMSK +HL
Sbjct: 1 MVSEAWIVKMRNQVSSNLKHALLLESSSTKKTPKPVSLPKQTIGILSFEVANVMSKTIHL 60
Query: 61 HKSLSESEITKLRNEILNSEGVRNLVSSXXXXXXXXXXXXXXXXXNRVASVVSRLGKKCS 120
H+SLS++EI+KL+ E+ +SEGVR LVSS +RVASVVSRLGKKC+
Sbjct: 61 HRSLSDTEISKLKAEVFHSEGVRKLVSSDENHLLDLSVSEKLDDLSRVASVVSRLGKKCN 120
Query: 121 VPALQGFEHVYXXXXXXXXXXKELGFLVKHMEGMVRKLDRYVSTTRNLYREMEVLNQLEQ 180
PALQGFEHVY ++LGFLVK ME MV+K++R+V+ T +LY EMEV+N+LEQ
Sbjct: 121 EPALQGFEHVYEDIVNGAIDFRKLGFLVKDMESMVKKMERFVNATCSLYCEMEVMNELEQ 180
Query: 181 AVKKLQHS-QHEESRRAFEQKLAWQKQDVRHLKEISLWNQTFDKVVELLARTVCTIYARI 239
A+ KLQ S QH+ES +AFEQKL WQ+QDV+ L++ SLWNQT+DKVVE+LARTVCTIY RI
Sbjct: 181 AIVKLQRSQQHQESVKAFEQKLMWQRQDVKSLRDGSLWNQTYDKVVEMLARTVCTIYGRI 240
Query: 240 CMIFGDSAGRKHSLGLGGGSPPMQNECGLMSPGIDFQMTSDKLKRNRSKKNGYQLSSTRR 299
+FG LGL G LKR+RSK + ++R
Sbjct: 241 ETVFG-------GLGLRGKKDVT-------------------LKRDRSKNEASKAVNSRS 274
Query: 300 VAAETRGAISRTHIDLRREEL-AYLQFEDLGFPCGTSPGRLFMECLSLSSSVSNFXXXXX 358
VA D RR E + + D FPCGT+PGR+FMECL+++ ++ +
Sbjct: 275 VAG---------FKDSRRSEADEFTRAGDFNFPCGTNPGRMFMECLAMNRTIGDDDDDDD 325
Query: 359 XXXYVIDHEEQFCQIPGIGNSVKKRETMCLSGGLNHAQTVVPSTAGDPRQVRSGVQSCSA 418
I T STA R +RS +
Sbjct: 326 EDGGRI--------------------------------TFPLSTA---RMIRS-----NK 345
Query: 419 FGPKSGLAVYAPPSTLGGCALALHYANVIIVIEKLLSYSHLVGDEARDDLYQMLPTSLRL 478
FG KS L +A ST+GG AL+LHYANV+IV+EKLL Y HL+G+EARDDLYQMLPTSL+
Sbjct: 346 FGFKSRLTQHASASTIGGSALSLHYANVVIVVEKLLKYPHLIGEEARDDLYQMLPTSLKT 405
Query: 479 SLKAKLRTYVKNLAIYDAPLAHEWKVKLDSILRWLSPLAHSMIRWQSERNFE-QHQIVSK 537
+LKA LR+Y+KN++IYDAPLAH+WK +D IL WL+PLAH+MIRWQSERNFE Q+QIV +
Sbjct: 406 TLKASLRSYLKNISIYDAPLAHDWKETIDGILSWLAPLAHNMIRWQSERNFEQQNQIVKR 465
Query: 538 TNVLLFQTLYFADRGKTEDAMCELLIGLNYICRYEQQQNALLGCASSFDFEDCMRRQLRC 597
TNVLL QTLYFADR KTE A+C+LL+GLNYIC YEQQQNALL CASSFD+EDC Q +C
Sbjct: 466 TNVLLLQTLYFADREKTEAAICKLLVGLNYICHYEQQQNALLDCASSFDYEDCFEWQSQC 525
Query: 598 GASFLN 603
A++L+
Sbjct: 526 RAAYLD 531
>AT5G04550.1 | Symbols: | Protein of unknown function (DUF668) |
chr5:1303757-1305556 REVERSE LENGTH=599
Length = 599
Score = 190 bits (483), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 90/168 (53%), Positives = 122/168 (72%), Gaps = 5/168 (2%)
Query: 429 APPSTLGGCALALHYANVIIVIEKLLSYSHLVGDEARDDLYQMLPTSLRLSLKAKLRTYV 488
A P+TLG LALHYANVIIVIE+ ++ HL+GD+ARDDLY MLP S+R SL+ +L+ Y
Sbjct: 428 AAPNTLGTACLALHYANVIIVIERFVASPHLIGDDARDDLYNMLPASVRTSLRERLKPYS 487
Query: 489 KNLA---IYDAPLAHEWKVKLDSILRWLSPLAHSMIRWQSERNFEQHQIVSKTNVLLFQT 545
KNL+ +YD LA EW + IL WL PLAH+MI+WQSER++E +VS+T+++L QT
Sbjct: 488 KNLSSSTVYDPGLAREWTDAMAGILEWLGPLAHNMIKWQSERSYEHQSLVSRTHIVLAQT 547
Query: 546 LYFADRGKTEDAMCELLIGLNYICRYEQQQN--ALLGCASSFDFEDCM 591
L+FA++ KTE + ELL+GLNY+ R+ ++ N AL C SS E C+
Sbjct: 548 LFFANQQKTEAIITELLVGLNYVWRFGRELNAKALQECTSSKTLEKCL 595
Score = 132 bits (333), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 77/239 (32%), Positives = 136/239 (56%), Gaps = 19/239 (7%)
Query: 34 PKRQD---NKVMIEILSFEVANVMSKIVHLHKSLSESEITKLRNEILNSEGVRNLVSSXX 90
PK+ D +K ++ +L+FEVA+++SK+VHL +SLS+ + +LR+EI +S G++ LVS
Sbjct: 14 PKKHDGHKDKAVLGVLAFEVASLLSKLVHLWQSLSDKNVARLRDEITHSTGIKKLVSEDD 73
Query: 91 XXXXXXXXXXXXXXXNRVASVVSRLGKKCSVPALQGFEHVYXXXXXXXXXXKELGFLVKH 150
VA V+RL +KC+ P L+ FE+ + F K
Sbjct: 74 DFIVRLIRDEMMENVENVAKAVARLARKCNDPKLKCFENCFSDMMKTGADPYGWQFGWKK 133
Query: 151 MEGMVRKLDRYVSTTRNLYREMEVLNQLEQAVKKLQHSQH-EESRRAFEQKLAWQKQDVR 209
M+ +K++R++S+ +LY+E E+L LEQ K+++ ++ ++ +++K+ W++ +V+
Sbjct: 134 MDKKAKKMERFISSNASLYQETEILADLEQTFKRMKSNESATDNLLEYQKKVTWKRHEVK 193
Query: 210 HLKEISLWNQTFDKVVELLARTVCTIYARICMIFG---------------DSAGRKHSL 253
+L+++SLWN+T+D V LL R+V TI +R +FG D GR HS+
Sbjct: 194 NLRDVSLWNRTYDYTVILLVRSVFTILSRTKHVFGISYRVEASDVSSADSDFIGRSHSV 252
>AT5G51670.1 | Symbols: | Protein of unknown function (DUF668) |
chr5:20993560-20995096 FORWARD LENGTH=474
Length = 474
Score = 174 bits (442), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 88/172 (51%), Positives = 115/172 (66%), Gaps = 6/172 (3%)
Query: 410 RSGVQSCSAFGPKSGLAVYAPPSTLGGCALALHYANVIIVIEKLLSYSHLVGDEARDDLY 469
R + SAF +S + P +TLGG +ALHYAN+I+V+EK++ LVG +ARDDLY
Sbjct: 291 RDKTTTSSAFLEESSRLLKPPETTLGGAGVALHYANLIVVMEKMIKQPQLVGLDARDDLY 350
Query: 470 QMLPTSLRLSLKAKLRTYVKNLAIYDAPLAHEWKVKLDSILRWLSPLAHSMIRWQSERNF 529
MLP S+R SL+++L+ D LA EWK L ILRWL PLA +MIRWQSER+F
Sbjct: 351 SMLPASVRSSLRSRLKGV--GFTATDGGLATEWKAALGRILRWLLPLAQNMIRWQSERSF 408
Query: 530 EQHQIVSKTN----VLLFQTLYFADRGKTEDAMCELLIGLNYICRYEQQQNA 577
EQ + + TN V+L QTL FAD+ KTE A+ ELL+GLNYI R+E++ A
Sbjct: 409 EQQHMATATNSQNRVMLVQTLVFADKVKTEAAITELLVGLNYIWRFEREMTA 460
Score = 111 bits (278), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 79/264 (29%), Positives = 130/264 (49%), Gaps = 24/264 (9%)
Query: 1 MVAEAWIVKMGRQVSANLKHALLLEPSAKR--KQGPKRQDNKVMIEILSFEVANVMSKIV 58
M E +++K LK+A+ +P+++R + P + +LSFEVA VM+K++
Sbjct: 1 MALETFLIK--------LKNAISSKPTSRRPHRSSPPISTTTSSVGVLSFEVARVMTKLL 52
Query: 59 HLHKSLSESEITKLRNEILNSEGVRNLVSSXXXXXXXXXXXXXXXXXNRVASVVSRLGKK 118
HL SL++S + R+ L+ EG+ +V+ A+ VSRL +
Sbjct: 53 HLTHSLTDSNLLTPRDHSLSLEGLTKIVNGDETFHLSLVCAELADSLAHAANSVSRLSNR 112
Query: 119 CSVPALQGFEHVYXXXXXXXXXXKELGFLVKHMEGMVRKLDRYVSTTRNLYREMEVLNQL 178
C+ +L+ F ++ K E +K++RYVS T LYREME + L
Sbjct: 113 CTTASLRSFHRLFHEFADMGRDPHGWVMNCKDTEAKNKKIERYVSVTTALYREMEEMAIL 172
Query: 179 EQAVKK--LQ------------HSQHEESRRAFEQKLAWQKQDVRHLKEISLWNQTFDKV 224
E +++K LQ + + + K+ QKQ V++LK+ SLWN++FD V
Sbjct: 173 ENSLRKQSLQIGIEFEEEEDYENKKDVMKVIDLQNKIERQKQHVKYLKDRSLWNKSFDTV 232
Query: 225 VELLARTVCTIYARICMIFGDSAG 248
V +LAR+V T AR+ +F +A
Sbjct: 233 VLILARSVFTALARLKSVFSSAAA 256
>AT1G34320.1 | Symbols: | Protein of unknown function (DUF668) |
chr1:12520713-12524046 FORWARD LENGTH=657
Length = 657
Score = 82.0 bits (201), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 69/240 (28%), Positives = 109/240 (45%), Gaps = 18/240 (7%)
Query: 43 IEILSFEVANVMSKIVHLHKSLSESEITKLRNEILNSEGVRNLVSSXXXXXXXXXXXXXX 102
I ILSFEVAN + K +L SLS+ IT L+ +L SEGV+NL+S
Sbjct: 155 ISILSFEVANTIVKGANLMHSLSKDSITHLKEVVLPSEGVQNLISKDMDELLRIAAADKR 214
Query: 103 XXXNRVASVVSRLGKKCSVPALQGFEHVYXXXXXXXXXXKELGFLVKHMEGMVRKLDRYV 162
+ V R G +C P + + K L + E ++ ++ +V
Sbjct: 215 EELRIFSGEVVRFGNRCKDPQYHNLDRFFDRLGSEFTPQKHLK---QEAETIMHQMMSFV 271
Query: 163 STTRNLYREMEVLNQLEQAV-KKLQHSQHEES-RRAFEQKLAW-------QKQDVRHLKE 213
T +LY E+ L++ EQ +K+Q ++ + +R LA QK+ VR+LK+
Sbjct: 272 HFTADLYHELHALDRFEQDYQRKIQEEENPSTAQRGVGDTLAILRTELKSQKKHVRNLKK 331
Query: 214 ISLWNQTFDKVVELLARTVCTIYARICMIFGDSAGRKHSLGLGGGSPPMQNECGLMSPGI 273
SLW++ ++V+E L V ++ I FG + K PP+ N L S G+
Sbjct: 332 KSLWSRILEEVMEKLVDVVHFLHLEIHEAFGGADPDK-----PANDPPI-NHKKLGSAGL 385
Score = 75.5 bits (184), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 78/152 (51%), Gaps = 16/152 (10%)
Query: 430 PP---STLGGCALALHYANVIIVIEKLLSYSHLVGDEARDDLYQMLPTSLRLSLKAKLRT 486
PP LG LALHYAN+I I+ L+S S + RD LYQ LP S+K+ LR+
Sbjct: 373 PPINHKKLGSAGLALHYANIITQIDTLVSRSSTMPASTRDALYQGLPP----SIKSALRS 428
Query: 487 YVKNLAIYDAPLAHEWKVKLDSILRWLSPL------AHSMIRWQSE---RNFEQHQIVSK 537
+++ + + + K +++ L+WL P+ AH W E E +Q +
Sbjct: 429 RIQSFQVKEELTVPQIKAEMEKTLQWLVPVATNTTKAHHGFGWVGEWASSGSEANQRPAG 488
Query: 538 TNVLLFQTLYFADRGKTEDAMCELLIGLNYIC 569
+L TL+ AD+ KTE + +L++ L+++
Sbjct: 489 QTILRIDTLHHADKEKTEAYILDLVVWLHHLV 520
>AT5G08660.1 | Symbols: | Protein of unknown function (DUF668) |
chr5:2814526-2817845 FORWARD LENGTH=649
Length = 649
Score = 72.0 bits (175), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 81/159 (50%), Gaps = 19/159 (11%)
Query: 422 KSGLAVYAPPSTLGGCALALHYANVIIVIEKLLSYSHLVGDEARDDLYQMLPTSLRLSLK 481
K G A Y LG LALHYAN+I+ I+ L++ + + ARD LYQ LP ++L+L+
Sbjct: 362 KKGAAEY--DKRLGPAGLALHYANIIVQIDTLVARASSITSNARDSLYQSLPPGIKLALR 419
Query: 482 AKLRTY--VKNLAIYDAPLAHEWKVKLDSILRWLSPL------AHSMIRWQSERNFEQHQ 533
+K++++ K L++ + K +++ L WL P+ AH W E
Sbjct: 420 SKIKSFNVDKELSV------TQIKDEMERTLHWLVPVAGNTTKAHHGFGWVGEWANTGTD 473
Query: 534 IVSKT---NVLLFQTLYFADRGKTEDAMCELLIGLNYIC 569
SK ++L +TLY A + KTE + +I L ++
Sbjct: 474 FTSKPSGGDILRIETLYHASKEKTEIYILGQIIWLQHLV 512
Score = 54.3 bits (129), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 95/213 (44%), Gaps = 22/213 (10%)
Query: 45 ILSFEVANVMSKIVHLHKSLSESEITKLRNEILNSEGVRNLVSSXXXXXXXXXXXXXXXX 104
IL+FEVAN + K +L +SLS+ I L+ IL SEGV+NLVS+
Sbjct: 151 ILAFEVANTIVKSSNLIESLSKRNIEHLKGTILYSEGVQNLVSNDFDELLRLVAADKRQE 210
Query: 105 XNRVASVVSRLG---KKCSVPALQGFEHVYXXXXXXXXXXKELGFLVKHMEGMVRKLDRY 161
+ V R G K LQ + KE LV V +L
Sbjct: 211 LQVFSGEVVRFGNRSKDFQWHNLQRYFDRISKELTPQRQLKEDAVLV------VDQLMVL 264
Query: 162 VSTTRNLYREMEVLNQLEQAVKKLQHSQHEESRRA----------FEQKLAWQKQDVRHL 211
V T LY+E++VL +LE K + + EE A + +L Q++ V+ L
Sbjct: 265 VQYTAELYQELQVLYRLE---KDYEQKRREEENSANSSKGDGLAILKTELKAQRKVVKSL 321
Query: 212 KEISLWNQTFDKVVELLARTVCTIYARICMIFG 244
K+ SLW++ F++V+E L V + I IFG
Sbjct: 322 KKKSLWSRGFEEVMEKLVDIVHFLLLEIHNIFG 354
>AT1G30755.1 | Symbols: | Protein of unknown function (DUF668) |
chr1:10905991-10908773 REVERSE LENGTH=615
Length = 615
Score = 70.9 bits (172), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/216 (25%), Positives = 95/216 (43%), Gaps = 10/216 (4%)
Query: 43 IEILSFEVANVMSKIVHLHKSLSESEITKLRNEILNSEGVRNLVSSXXXXXXXXXXXXXX 102
+ IL+FEVAN ++K L +SLSE + ++ ++L+SE V+ LVS+
Sbjct: 134 VTILAFEVANTIAKGAALLQSLSEENLKFMKKDMLHSEEVKKLVSTDTTELQILAASDKR 193
Query: 103 XXXNRVASVVSRLGKKCSVPALQGFEHVYXXXXXXXXXXKELGFLVKHMEGMVRKLDRYV 162
+ + V R G C + + K L E +++L
Sbjct: 194 EELDLFSGEVIRFGNMCKDLQWHNLDRYFMKLDTENSQHK---LLKDDAEARMQELVTLA 250
Query: 163 STTRNLYREMEVLNQLEQ-------AVKKLQHSQHEESRRAFEQKLAWQKQDVRHLKEIS 215
T LY E++ L++ EQ V+ L + E + +L QK+ V+ L++ S
Sbjct: 251 RITSELYHELQALDRFEQDYRRKLAEVESLNLPRRGEGIVILQNELKQQKKLVKSLQKKS 310
Query: 216 LWNQTFDKVVELLARTVCTIYARICMIFGDSAGRKH 251
LW+Q +++E L V I I +FG++ R +
Sbjct: 311 LWSQNLAEIIEKLVDVVSYIRQTIVEVFGNNGLRDN 346
Score = 53.9 bits (128), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/155 (25%), Positives = 75/155 (48%), Gaps = 17/155 (10%)
Query: 434 LGGCALALHYANVIIVIEKLLSYSHLVGDEARDDLYQMLPTSLRLSLKAKLRTYVKNLAI 493
LG L+LHYAN+I I+ + S + RD LY LP +++ +L+ +L+T + +
Sbjct: 355 LGEAGLSLHYANLIQQIDNIASRPSSLPSNVRDTLYNALPATVKTALRPRLQTLDQEEEL 414
Query: 494 YDAPLAHEWKVKLDSILRWLSPLAHSMIR----------WQSER-NFEQHQIVSKT--NV 540
E K +++ L+WL P A + + W + R F + + + N
Sbjct: 415 S----VPEIKAEMEKSLQWLVPFAENTTKAHQGFGWVGEWANSRIEFGKGKGKGENNGNP 470
Query: 541 LLFQTLYFADRGKTEDAMCELLIGLNYICRYEQQQ 575
QTL+ AD+ + + EL++ L+ + + +++
Sbjct: 471 TRLQTLHHADKPIVDSYVLELVVWLHRLMKSSKKR 505