Miyakogusa Predicted Gene
- Lj1g3v4694280.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4694280.1 Non Chatacterized Hit- tr|I1JPV9|I1JPV9_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.42611
PE,75.18,0,seg,NULL; SUBFAMILY NOT NAMED,NULL; FAMILY NOT NAMED,NULL;
DUF668,Protein of unknown function DUF668,CUFF.32912.1
(563 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G04550.1 | Symbols: | Protein of unknown function (DUF668) |... 354 1e-97
AT3G23160.1 | Symbols: | Protein of unknown function (DUF668) |... 270 3e-72
AT5G51670.1 | Symbols: | Protein of unknown function (DUF668) |... 156 4e-38
AT1G34320.1 | Symbols: | Protein of unknown function (DUF668) |... 78 2e-14
AT5G08660.1 | Symbols: | Protein of unknown function (DUF668) |... 67 3e-11
AT1G30755.1 | Symbols: | Protein of unknown function (DUF668) |... 67 4e-11
>AT5G04550.1 | Symbols: | Protein of unknown function (DUF668) |
chr5:1303757-1305556 REVERSE LENGTH=599
Length = 599
Score = 354 bits (908), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 220/601 (36%), Positives = 320/601 (53%), Gaps = 67/601 (11%)
Query: 10 SWLSALWPVSRKGASDNKAVVVGVLGSEVAGLMLKVVNLWQSLSDEEVLSLREGIVNSAG 69
+W LW +K V+GVL EVA L+ K+V+LWQSLSD+ V LR+ I +S G
Sbjct: 5 TWFRNLWRFPKKHDGHKDKAVLGVLAFEVASLLSKLVHLWQSLSDKNVARLRDEITHSTG 64
Query: 70 VRKLVSEDDDYLMELVLNEVLDNFQSVARTVARLGRRCVDPVYHRFEHFVRSPAQNYFQW 129
++KLVSEDDD+++ L+ +E+++N ++VA+ VARL R+C DP FE+ +
Sbjct: 65 IKKLVSEDDDFIVRLIRDEMMENVENVAKAVARLARKCNDPKLKCFENCFSDMMKTGADP 124
Query: 130 SGWDYRGXXXXXXXXXXXXFVAAMTQLCQELEVLAEVEQTLRRMQANPVLHRGKLPEFQK 189
GW + F+++ L QE E+LA++EQT +RM++N L E+QK
Sbjct: 125 YGWQFGWKKMDKKAKKMERFISSNASLYQETEILADLEQTFKRMKSNESA-TDNLLEYQK 183
Query: 190 KVTLQRQEVRNLRDMSPWNRSYDYVVRLLVRSLFTILERIILVFGNNHLATKQNENESPN 249
KVT +R EV+NLRD+S WNR+YDY V LLVRS+FTIL R VFG ++ + ++ +
Sbjct: 184 KVTWKRHEVKNLRDVSLWNRTYDYTVILLVRSVFTILSRTKHVFG---ISYRVEASDVSS 240
Query: 250 TTANNLLRSQSFSVFMHSSVHPSEND-LYGFHSGHLGR---------------------- 286
++ + RS S S + H SE+ L F SG LGR
Sbjct: 241 ADSDFIGRSHSVSTILTPVSHKSESSGLPRFASGPLGRFTGPASGSAATRSTKMSDFLSG 300
Query: 287 ----------------------------RLASNSGILVDXXXXXXXXXXALHPPALCRDR 318
++ S SG L+ P++ +
Sbjct: 301 SLSAESPKSGPLVAEKHKRFKFYSGQLGKITSKSGPLIGMGKNNKKMGQTPERPSISSVK 360
Query: 319 LNSESKQFGPIGPFKSCMSVANNSPVIDSFVQTNGGSMRLTDSH-MENVDKMKRVEKSSL 377
++ + +GPFK CM V+++ S NG + H +E+ VE +L
Sbjct: 361 KQLKANRLTQVGPFKGCM-VSHDGITPLSTRARNGARNSSAEHHILEDNSNSVHVENLTL 419
Query: 378 SNRIRIYSKLSMSNRLKPASFTLGDAALTLHYANMIVLIEQMVSSPHLTDPEPRDDLYKM 437
+R KLS A TLG A L LHYAN+I++IE+ V+SPHL + RDDLY M
Sbjct: 420 PSR----PKLS-----DAAPNTLGTACLALHYANVIIVIERFVASPHLIGDDARDDLYNM 470
Query: 438 LPATIRAVLRHKLKGRKKS-KSSWVYNAEIAAEWSAVLAQILDWLAPLAHDTIRWHSVRS 496
LPA++R LR +LK K+ SS VY+ +A EW+ +A IL+WL PLAH+ I+W S RS
Sbjct: 471 LPASVRTSLRERLKPYSKNLSSSTVYDPGLAREWTDAMAGILEWLGPLAHNMIKWQSERS 530
Query: 497 FEKEHATLKANILLVQTLYFANQVKTEAAIVDLLVGLNYVCRIDTKVGVRDRVECASARS 556
+E + + +I+L QTL+FANQ KTEA I +LLVGLNYV R ++ + EC S+++
Sbjct: 531 YEHQSLVSRTHIVLAQTLFFANQQKTEAIITELLVGLNYVWRFGRELNAKALQECTSSKT 590
Query: 557 F 557
Sbjct: 591 L 591
>AT3G23160.1 | Symbols: | Protein of unknown function (DUF668) |
chr3:8260059-8261654 REVERSE LENGTH=531
Length = 531
Score = 270 bits (689), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 174/532 (32%), Positives = 280/532 (52%), Gaps = 62/532 (11%)
Query: 30 VVGVLGSEVAGLMLKVVNLWQSLSDEEVLSLREGIVNSAGVRKLVSEDDDYLMELVLNEV 89
+G+L EVA +M K ++L +SLSD E+ L+ + +S GVRKLVS D+++L++L ++E
Sbjct: 42 TIGILSFEVANVMSKTIHLHRSLSDTEISKLKAEVFHSEGVRKLVSSDENHLLDLSVSEK 101
Query: 90 LDNFQSVARTVARLGRRCVDPVYHRFEHFVRSPAQNYFQWSGWDYRGXXXXXXXXXXXXF 149
LD+ VA V+RLG++C +P FEH + + F
Sbjct: 102 LDDLSRVASVVSRLGKKCNEPALQGFEHVYEDIVNGAIDFRKLGFLVKDMESMVKKMERF 161
Query: 150 VAAMTQLCQELEVLAEVEQTLRRMQANPVLHRGKLPEFQKKVTLQRQEVRNLRDMSPWNR 209
V A L E+EV+ E+EQ + ++Q + H+ + F++K+ QRQ+V++LRD S WN+
Sbjct: 162 VNATCSLYCEMEVMNELEQAIVKLQRSQQ-HQESVKAFEQKLMWQRQDVKSLRDGSLWNQ 220
Query: 210 SYDYVVRLLVRSLFTILERIILVFGNNHLATKQN---ENESPNTTANNLLRSQSFSVFMH 266
+YD VV +L R++ TI RI VFG L K++ + + A+ + S+S + F
Sbjct: 221 TYDKVVEMLARTVCTIYGRIETVFGGLGLRGKKDVTLKRDRSKNEASKAVNSRSVAGFKD 280
Query: 267 SSVHPSENDLYGFHSGHLGRRLASNSGILVDXXXXXXXXXXALHPPALCRDRLNSESKQF 326
S SE D + +G +N G +
Sbjct: 281 S--RRSEADEFT-RAGDFNFPCGTNPGRM------------------------------- 306
Query: 327 GPIGPFKSCMSVANNSPVIDSFVQTNGGSMRLTDSHMENVDKMKRVEKSSLSNRIRIYSK 386
F C+ A N + D + R+T + +M R K +R+ ++
Sbjct: 307 -----FMECL--AMNRTIGDDDDDDDEDGGRIT--FPLSTARMIRSNKFGFKSRLTQHAS 357
Query: 387 LSMSNRLKPASFTLGDAALTLHYANMIVLIEQMVSSPHLTDPEPRDDLYKMLPATIRAVL 446
S T+G +AL+LHYAN+++++E+++ PHL E RDDLY+MLP +++ L
Sbjct: 358 AS----------TIGGSALSLHYANVVIVVEKLLKYPHLIGEEARDDLYQMLPTSLKTTL 407
Query: 447 RHKLKGRKKSKSSWVYNAEIAAEWSAVLAQILDWLAPLAHDTIRWHSVRSFEKEHATLK- 505
+ L+ K+ S +Y+A +A +W + IL WLAPLAH+ IRW S R+FE+++ +K
Sbjct: 408 KASLRSYLKNIS--IYDAPLAHDWKETIDGILSWLAPLAHNMIRWQSERNFEQQNQIVKR 465
Query: 506 ANILLVQTLYFANQVKTEAAIVDLLVGLNYVCRIDTKVGVRDRVECASARSF 557
N+LL+QTLYFA++ KTEAAI LLVGLNY+C + + ++CAS+ +
Sbjct: 466 TNVLLLQTLYFADREKTEAAICKLLVGLNYICHYEQQQNA--LLDCASSFDY 515
>AT5G51670.1 | Symbols: | Protein of unknown function (DUF668) |
chr5:20993560-20995096 FORWARD LENGTH=474
Length = 474
Score = 156 bits (394), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 78/161 (48%), Positives = 107/161 (66%), Gaps = 8/161 (4%)
Query: 390 SNRLKPASFTLGDAALTLHYANMIVLIEQMVSSPHLTDPEPRDDLYKMLPATIRAVLRHK 449
S LKP TLG A + LHYAN+IV++E+M+ P L + RDDLY MLPA++R+ LR +
Sbjct: 305 SRLLKPPETTLGGAGVALHYANLIVVMEKMIKQPQLVGLDARDDLYSMLPASVRSSLRSR 364
Query: 450 LKGRKKSKSSWVYNAEIAAEWSAVLAQILDWLAPLAHDTIRWHSVRSFEKEHATLKAN-- 507
LKG + + + +A EW A L +IL WL PLA + IRW S RSFE++H N
Sbjct: 365 LKGVGFTAT----DGGLATEWKAALGRILRWLLPLAQNMIRWQSERSFEQQHMATATNSQ 420
Query: 508 --ILLVQTLYFANQVKTEAAIVDLLVGLNYVCRIDTKVGVR 546
++LVQTL FA++VKTEAAI +LLVGLNY+ R + ++ +
Sbjct: 421 NRVMLVQTLVFADKVKTEAAITELLVGLNYIWRFEREMTAK 461
Score = 103 bits (256), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/224 (30%), Positives = 111/224 (49%), Gaps = 26/224 (11%)
Query: 31 VGVLGSEVAGLMLKVVNLWQSLSDEEVLSLREGIVNSAGVRKLVSEDDDYLMELVLNEVL 90
VGVL EVA +M K+++L SL+D +L+ R+ ++ G+ K+V+ D+ + + LV E+
Sbjct: 37 VGVLSFEVARVMTKLLHLTHSLTDSNLLTPRDHSLSLEGLTKIVNGDETFHLSLVCAELA 96
Query: 91 DNFQSVARTVARLGRRCVDP-------VYHRFEHFVRSPAQNYFQWSGWDYRGXXXXXXX 143
D+ A +V+RL RC ++H F R P GW
Sbjct: 97 DSLAHAANSVSRLSNRCTTASLRSFHRLFHEFADMGRDP-------HGWVMNCKDTEAKN 149
Query: 144 XXXXXFVAAMTQLCQELEVLAEVEQTLRRMQANPVLHRG------------KLPEFQKKV 191
+V+ T L +E+E +A +E +LR+ + K+ + Q K+
Sbjct: 150 KKIERYVSVTTALYREMEEMAILENSLRKQSLQIGIEFEEEEDYENKKDVMKVIDLQNKI 209
Query: 192 TLQRQEVRNLRDMSPWNRSYDYVVRLLVRSLFTILERIILVFGN 235
Q+Q V+ L+D S WN+S+D VV +L RS+FT L R+ VF +
Sbjct: 210 ERQKQHVKYLKDRSLWNKSFDTVVLILARSVFTALARLKSVFSS 253
>AT1G34320.1 | Symbols: | Protein of unknown function (DUF668) |
chr1:12520713-12524046 FORWARD LENGTH=657
Length = 657
Score = 78.2 bits (191), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 60/219 (27%), Positives = 97/219 (44%), Gaps = 10/219 (4%)
Query: 23 ASDNKAVVVGVLGSEVAGLMLKVVNLWQSLSDEEVLSLREGIVNSAGVRKLVSEDDDYLM 82
A+ K + +L EVA ++K NL SLS + + L+E ++ S GV+ L+S+D D L+
Sbjct: 147 ATTVKGNKISILSFEVANTIVKGANLMHSLSKDSITHLKEVVLPSEGVQNLISKDMDELL 206
Query: 83 ELVLNEVLDNFQSVARTVARLGRRCVDPVYHRFEHFVRSPAQNYFQWSGWDYRGXXXXXX 142
+ + + + + V R G RC DP YH + F +++ +
Sbjct: 207 RIAAADKREELRIFSGEVVRFGNRCKDPQYHNLDRFFDRLGS---EFTPQKHLKQEAETI 263
Query: 143 XXXXXXFVAAMTQLCQELEVLAEVEQTLRRM---QANP-VLHRG---KLPEFQKKVTLQR 195
FV L EL L EQ +R + NP RG L + ++ Q+
Sbjct: 264 MHQMMSFVHFTADLYHELHALDRFEQDYQRKIQEEENPSTAQRGVGDTLAILRTELKSQK 323
Query: 196 QEVRNLRDMSPWNRSYDYVVRLLVRSLFTILERIILVFG 234
+ VRNL+ S W+R + V+ LV + + I FG
Sbjct: 324 KHVRNLKKKSLWSRILEEVMEKLVDVVHFLHLEIHEAFG 362
Score = 75.5 bits (184), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 80/164 (48%), Gaps = 18/164 (10%)
Query: 400 LGDAALTLHYANMIVLIEQMVSSPHLTDPEPRDDLYKMLPATIRAVLRHKLKGRKKSKSS 459
LG A L LHYAN+I I+ +VS RD LY+ LP +I++ LR ++ +S
Sbjct: 380 LGSAGLALHYANIITQIDTLVSRSSTMPASTRDALYQGLPPSIKSALRSRI------QSF 433
Query: 460 WVYNAEIAAEWSAVLAQILDWLAPLAHDTIRWH---------SVRSFEKEHATLKANILL 510
V + A + + L WL P+A +T + H + E IL
Sbjct: 434 QVKEELTVPQIKAEMEKTLQWLVPVATNTTKAHHGFGWVGEWASSGSEANQRPAGQTILR 493
Query: 511 VQTLYFANQVKTEAAIVDLLVGLNYV---CRIDTKVGVRDRVEC 551
+ TL+ A++ KTEA I+DL+V L+++ R T G+R V+
Sbjct: 494 IDTLHHADKEKTEAYILDLVVWLHHLVTQVRATTGYGLRSPVKS 537
>AT5G08660.1 | Symbols: | Protein of unknown function (DUF668) |
chr5:2814526-2817845 FORWARD LENGTH=649
Length = 649
Score = 67.0 bits (162), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 77/159 (48%), Gaps = 18/159 (11%)
Query: 400 LGDAALTLHYANMIVLIEQMVSSPHLTDPEPRDDLYKMLPATIRAVLRHKLKGRKKSKSS 459
LG A L LHYAN+IV I+ +V+ RD LY+ LP I+ LR K+K K
Sbjct: 372 LGPAGLALHYANIIVQIDTLVARASSITSNARDSLYQSLPPGIKLALRSKIKSFNVDKEL 431
Query: 460 WVYNAEIAAEWSAVLAQILDWLAPLAHDTIRWHSVRSFEKEHATL---------KANILL 510
V +I E + + L WL P+A +T + H + E A +IL
Sbjct: 432 SV--TQIKDE----MERTLHWLVPVAGNTTKAHHGFGWVGEWANTGTDFTSKPSGGDILR 485
Query: 511 VQTLYFANQVKTEAAIVDLLVGLNYV---CRIDTKVGVR 546
++TLY A++ KTE I+ ++ L ++ + D + G R
Sbjct: 486 IETLYHASKEKTEIYILGQIIWLQHLVTKAKSDARGGPR 524
Score = 56.6 bits (135), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 57/210 (27%), Positives = 95/210 (45%), Gaps = 8/210 (3%)
Query: 31 VGVLGSEVAGLMLKVVNLWQSLSDEEVLSLREGIVNSAGVRKLVSEDDDYLMELVLNEVL 90
+G+L EVA ++K NL +SLS + L+ I+ S GV+ LVS D D L+ LV +
Sbjct: 149 LGILAFEVANTIVKSSNLIESLSKRNIEHLKGTILYSEGVQNLVSNDFDELLRLVAADKR 208
Query: 91 DNFQSVARTVARLGRRCVDPVYHRFEHFVRSPAQNYFQWSGWDYRGXXXXXXXXXXXXFV 150
Q + V R G R D +H + + ++ +
Sbjct: 209 QELQVFSGEVVRFGNRSKDFQWHNLQRYFDRISKELTPQRQLKEDAVLVVDQLMVLVQYT 268
Query: 151 AAMTQLCQELEVL----AEVEQTLRRMQANPVLHRGK-LPEFQKKVTLQRQEVRNLRDMS 205
A +L QEL+VL + EQ R + + +G L + ++ QR+ V++L+ S
Sbjct: 269 A---ELYQELQVLYRLEKDYEQKRREEENSANSSKGDGLAILKTELKAQRKVVKSLKKKS 325
Query: 206 PWNRSYDYVVRLLVRSLFTILERIILVFGN 235
W+R ++ V+ LV + +L I +FG
Sbjct: 326 LWSRGFEEVMEKLVDIVHFLLLEIHNIFGG 355
>AT1G30755.1 | Symbols: | Protein of unknown function (DUF668) |
chr1:10905991-10908773 REVERSE LENGTH=615
Length = 615
Score = 66.6 bits (161), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 52/177 (29%), Positives = 84/177 (47%), Gaps = 32/177 (18%)
Query: 400 LGDAALTLHYANMIVLIEQMVSSPHLTDPEPRDDLYKMLPATIRAVLRHKLKGRKKSKSS 459
LG+A L+LHYAN+I I+ + S P RD LY LPAT++ LR +L+ + +
Sbjct: 355 LGEAGLSLHYANLIQQIDNIASRPSSLPSNVRDTLYNALPATVKTALRPRLQTLDQEEEL 414
Query: 460 WVYNAEIAAEWSAVLAQILDWLAPLAHDTIRWHSVRSFEKEHATLK-------------A 506
V EI AE + + L WL P A +T + H + E A +
Sbjct: 415 SV--PEIKAE----MEKSLQWLVPFAENTTKAHQGFGWVGEWANSRIEFGKGKGKGENNG 468
Query: 507 NILLVQTLYFANQVKTEAAIVDLLVGLNYVCRIDTKVGVRDRVECASARSFHGVGLR 563
N +QTL+ A++ ++ +++L+V L+ R+ +S + HGV L+
Sbjct: 469 NPTRLQTLHHADKPIVDSYVLELVVWLH-------------RLMKSSKKRAHGVKLQ 512
Score = 66.2 bits (160), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 61/231 (26%), Positives = 107/231 (46%), Gaps = 12/231 (5%)
Query: 22 GASDNKAVVVGVLGSEVAGLMLKVVNLWQSLSDEEVLSLREGIVNSAGVRKLVSEDDDYL 81
G + ++ V +L EVA + K L QSLS+E + +++ +++S V+KLVS D L
Sbjct: 125 GVTSSRGGKVTILAFEVANTIAKGAALLQSLSEENLKFMKKDMLHSEEVKKLVSTDTTEL 184
Query: 82 MELVLNEVLDNFQSVARTVARLGRRCVDPVYHRFE-HFVRSPAQNYFQWSGWDYRGXXXX 140
L ++ + + V R G C D +H + +F++ +N S
Sbjct: 185 QILAASDKREELDLFSGEVIRFGNMCKDLQWHNLDRYFMKLDTEN----SQHKLLKDDAE 240
Query: 141 XXXXXXXXFVAAMTQLCQELEVLAEVEQTLRRMQAN----PVLHRGK-LPEFQKKVTLQR 195
++L EL+ L EQ RR A + RG+ + Q ++ Q+
Sbjct: 241 ARMQELVTLARITSELYHELQALDRFEQDYRRKLAEVESLNLPRRGEGIVILQNELKQQK 300
Query: 196 QEVRNLRDMSPWNRSYDYVVRLLVRSLFTILERIILVFGNNHLATKQNENE 246
+ V++L+ S W+++ ++ LV + I + I+ VFGNN L + NE E
Sbjct: 301 KLVKSLQKKSLWSQNLAEIIEKLVDVVSYIRQTIVEVFGNNGL--RDNEGE 349