Miyakogusa Predicted Gene
- Lj3g3v0641400.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v0641400.1 Non Chatacterized Hit- tr|I1KJC7|I1KJC7_SOYBN
Uncharacterized protein OS=Glycine max PE=4
SV=1,87.5,0,DUF1005,Protein of unknown function DUF1005; SUBFAMILY NOT
NAMED,NULL; FAMILY NOT NAMED,NULL;
seg,NU,NODE_61152_length_1741_cov_33.342907.path1.1
(430 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G17640.1 | Symbols: | Protein of unknown function (DUF1005) ... 585 e-167
AT1G10020.1 | Symbols: | Protein of unknown function (DUF1005) ... 295 3e-80
AT4G29310.1 | Symbols: | Protein of unknown function (DUF1005) ... 286 2e-77
AT3G19680.1 | Symbols: | Protein of unknown function (DUF1005) ... 237 2e-62
AT1G50040.1 | Symbols: | Protein of unknown function (DUF1005) ... 223 2e-58
>AT5G17640.1 | Symbols: | Protein of unknown function (DUF1005) |
chr5:5811047-5812429 REVERSE LENGTH=432
Length = 432
Score = 585 bits (1507), Expect = e-167, Method: Compositional matrix adjust.
Identities = 298/421 (70%), Positives = 344/421 (81%), Gaps = 7/421 (1%)
Query: 1 MDPQAFIRLSIGSLGLRCTGIEL-STGKSGVHR-FSSSYVCEIRLRGFPVQTSSVPLITS 58
MDPQAFIRLS+GSL LR + + ST KS + FSS CEI+LRGFPVQT+S+PL+ S
Sbjct: 1 MDPQAFIRLSVGSLALRIPKVLINSTSKSNEKKNFSSQCSCEIKLRGFPVQTTSIPLMPS 60
Query: 59 SEVIPDTQNIASSFYLEESDLKALLAPGCFYNSNACLEIAVFSGRKGSSCGVSIKRQQIG 118
+ PD +I++SFYLEESDL+ALL PGCFY+ +A LEI+VF+G+K +CGV KRQQIG
Sbjct: 61 LDAAPDHHSISTSFYLEESDLRALLTPGCFYSPHAHLEISVFTGKKSLNCGVGGKRQQIG 120
Query: 119 IFKMQVGPEWGQGKPLILFNGWIGIGKNKQENGKLGAELHLKVKLDPDPRFVFQFEDVTT 178
+FK++VGPEWG+GKP+ILFNGWI IGK K++ AELHLKVKLDPDPR+VFQFEDVTT
Sbjct: 121 MFKLEVGPEWGEGKPMILFNGWISIGKTKRDGA---AELHLKVKLDPDPRYVFQFEDVTT 177
Query: 179 LSPQIVQLQGSIKQPIFSCKFSKDRVSHIDPLSTYWSDSMDISDFETERRERKGWKVKIH 238
LSPQIVQL+GS+KQPIFSCKFS+DRVS +DPL+ YWS S D ++ E+ERRERKGWKVKIH
Sbjct: 178 LSPQIVQLRGSVKQPIFSCKFSRDRVSQVDPLNGYWSSSGDGTELESERRERKGWKVKIH 237
Query: 239 DLSGSAVAAAFITTPFVPSSGCDWVARSNPGAWLIVRPDVGRSESWQPWGKLEAWRERGL 298
DLSGSAVAAAFITTPFVPS+GCDWVA+SNPGAWL+VRPD R SWQPWGKLEAWRERG+
Sbjct: 238 DLSGSAVAAAFITTPFVPSTGCDWVAKSNPGAWLVVRPDPSRPNSWQPWGKLEAWRERGI 297
Query: 299 RDSVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTEKHMRNXXXXXXXXXXXXGDF 358
RDSVCCRFHLLS E G++LMSEI I+AEKGGEF IDT+K M GDF
Sbjct: 298 RDSVCCRFHLLSNGLEVGDVLMSEILISAEKGGEFLIDTDKQMLTVAATPIPSPQSSGDF 357
Query: 359 GALSPVV--GGFVMCCRVQGEGKRGKPLVQLALRHVTCVEDAAIFMALAAAVDLSIEACR 416
L V GGFVM RVQGEGK KP+VQLA+RHVTCVEDAAIFMALAAAVDLSI AC+
Sbjct: 358 SGLGQCVSGGGFVMSSRVQGEGKSSKPVVQLAMRHVTCVEDAAIFMALAAAVDLSILACK 417
Query: 417 P 417
P
Sbjct: 418 P 418
>AT1G10020.1 | Symbols: | Protein of unknown function (DUF1005) |
chr1:3269939-3271732 REVERSE LENGTH=461
Length = 461
Score = 295 bits (756), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 187/464 (40%), Positives = 271/464 (58%), Gaps = 43/464 (9%)
Query: 1 MDPQAFIRLSIGSLGLRCTGIELSTGKSGVHRFSSSYVCEIRLRGFPVQTSSVPLIT-SS 59
MDP FIRL+IG+L L+ + T S VH SS C+I+L+ FP QT+++P I +
Sbjct: 1 MDPCPFIRLTIGNLALKVP-LAAKTTSSVVHPSSSPCFCKIKLKNFPPQTAAIPYIPLET 59
Query: 60 EVIPDTQNIASSFYLEESDLKALLAPGCFYNSNACLEIAVFSGRKGSSCGVSIKRQQIGI 119
P+ Q +A++F+L SD++ L A + S CL+I +++GR G++CGV R +
Sbjct: 60 TQFPEIQTLAATFHLSSSDIQRL-ASRSIFTSKPCLKILIYTGRAGAACGVHSGRL-LAK 117
Query: 120 FKMQVGPEWGQGKPLILFNGWIGIGKNKQENGKLGAELHLKVKLDPDPRFVFQFEDVTTL 179
+ + Q KP + NGWI +GK ++ A+ HL VK +PDPRFVFQF+
Sbjct: 118 VSVPLDLSGTQSKPCVFHNGWISVGKGAGKSSS-SAQFHLNVKAEPDPRFVFQFDGEPEC 176
Query: 180 SPQIVQLQGSIKQPIFSCKFS----KDRVSHIDPLSTYWSDSMD-ISDFETER----RER 230
SPQ+VQ+QG+I+QP+F+CKFS DR L T S S ++ F +ER +ER
Sbjct: 177 SPQVVQIQGNIRQPVFTCKFSCRHTGDRTQRSRSLPTETSVSRSWLNSFGSERERPGKER 236
Query: 231 KGWKVKIHDLSGSAVAAAFITTPFVPSSGCDWVARSNPGAWLIVRPDVGRSESWQPWGKL 290
KGW + +HDLSGS VA A I TPFV S G D V+RSNPG+WLI+RP +W+PWG+L
Sbjct: 237 KGWSITVHDLSGSPVAMASIVTPFVASPGTDRVSRSNPGSWLILRPG---DCTWRPWGRL 293
Query: 291 EAWRER-GLRDSVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFID------------- 336
EAWRER G D + RF L+ + G ++++E I++ +GG+F I+
Sbjct: 294 EAWRERGGATDGLGYRFELIPDGSSGAGIVLAESTISSHRGGKFSIELGSSPSSSSPTSV 353
Query: 337 ---------TEKHMRNXXXXXXXXXXXXGDFG-ALSP--VVGGFVMCCRVQGEGKRGKPL 384
GD+G L P V GFVM V+GEGK KP
Sbjct: 354 VNRSRSRRGGSSGSGGGASPANSPRGGSGDYGYGLWPWNVYKGFVMSASVEGEGKCSKPC 413
Query: 385 VQLALRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRGKQH 428
V+++++HV+C+EDAA ++AL+AA+DLS++ACR F +++R+ H
Sbjct: 414 VEVSVQHVSCMEDAAAYVALSAAIDLSMDACRLFNQRMRKELCH 457
>AT4G29310.1 | Symbols: | Protein of unknown function (DUF1005) |
chr4:14437892-14439609 REVERSE LENGTH=424
Length = 424
Score = 286 bits (732), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 175/438 (39%), Positives = 251/438 (57%), Gaps = 39/438 (8%)
Query: 1 MDPQAFIRLSIGSLGLRCTGIELSTGKS---GVHRFSSSYVCEIRLRGFPVQTSSVPLIT 57
MDP F+RL+I SL LR E +T K VH S+ C++R++ FP Q + +PL +
Sbjct: 1 MDPCPFVRLTIDSLALRLP--ETATNKQIGGEVHPSSTPCYCKLRIKHFPSQKALLPLSS 58
Query: 58 SSEVI--PDTQNIASSFYLEESDLKALLAPGCFYNSNACLEIAVFSGRKGSSCGVSIKRQ 115
S+ P++ A F+L+ ++ + L ++V++GR G +CGV+ +
Sbjct: 59 FSDASSPPESSTSAPGFHLDADAIRRI------SGKKISLRVSVYAGRTGHTCGVA-SGK 111
Query: 116 QIGIFKMQVGPEWGQGKPLILFNGWIGIGKNKQENGKLGAELHLKVKLDPDPRFVFQFED 175
+G ++ V + + NGW +G + K A LHL V +PDPRFVFQF
Sbjct: 112 LLGKVEVAVDLAAALSRTVAFHNGWKKLGGD---GDKPSARLHLLVCAEPDPRFVFQFGG 168
Query: 176 VTTLSPQIVQLQGSIKQPIFSCKFSKDRVSHIDPLS---TYWSD---SMDISDFETER-- 227
SP + Q+Q ++KQP+FSCKFS DR L TY S + +S + E+
Sbjct: 169 EPECSPVVYQIQDNLKQPVFSCKFSSDRNGRSRSLPSGFTYSSRGWITRTLSGDQWEKKQ 228
Query: 228 -RERKGWKVKIHDLSGSAVAAAFITTPFVPSSGCDWVARSNPGAWLIVRPDVGRSESWQP 286
RERKGW + IHDLSGS VAAA + TPFV S G D V+RSNPGAWLI+RP SW+P
Sbjct: 229 ARERKGWMITIHDLSGSPVAAASMITPFVASPGSDRVSRSNPGAWLILRPHGTCVSSWKP 288
Query: 287 WGKLEAWRERGLRDSVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDTEKHMRNXXX 346
WG+LEAWRERG D + +F L+ + + ++E ++ ++GG+F ID
Sbjct: 289 WGRLEAWRERGAIDGLGYKFELVRDNSTSTGIPIAEGTMSTKQGGKFSIDRR-------- 340
Query: 347 XXXXXXXXXGDFGALSPVVGGFVMCCRVQGEGKRGKPLVQLALRHVTCVEDAAIFMALAA 406
G+ A+S V GFVM V+GEGK KP+V + +HVTC+ DAA+F+AL+A
Sbjct: 341 -----VSGQGESPAISSPVKGFVMGSSVEGEGKVSKPVVHVGAQHVTCMADAALFVALSA 395
Query: 407 AVDLSIEACRPFRRKIRR 424
AVDLS++AC+ F RK+R+
Sbjct: 396 AVDLSVDACQLFSRKLRK 413
>AT3G19680.1 | Symbols: | Protein of unknown function (DUF1005) |
chr3:6840448-6842107 FORWARD LENGTH=491
Length = 491
Score = 237 bits (604), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 171/491 (34%), Positives = 258/491 (52%), Gaps = 78/491 (15%)
Query: 1 MDPQAFIRLSIGSLGLR------CTGIELSTGKSGVHRFSSSYVCEIRLRGFPVQTSSVP 54
MDP +F+R+ +G+L +R + SG++ + + C+IR + FP + SVP
Sbjct: 1 MDPCSFVRIIVGNLAVRFPSSSSSSSSSSGPSVSGINPTAPNCYCKIRFKNFPREIVSVP 60
Query: 55 LITSSEVIPDTQ--------NIASSFYLEESDLKALLAPGCFYNSNACLEIAVFS----- 101
++ +E +T+ +A+ F L ++ ++A L F + L + +S
Sbjct: 61 VMFRTESESETRCSSSGNVSTVAACFSLSKAQIEASLKKPKF----SVLSVEAYSRGNSD 116
Query: 102 ---GRKGSSCGVSIKRQQI-GIFKMQVGPEWGQGKPLILFNGWIGIGKNKQENGKLGA-- 155
G G+SCG++ +++ G F++ + + + K + NGW+ + +K+ K G+
Sbjct: 117 GDDGVSGASCGLATAGEKLLGRFEVSLDLKSAETKSFLAHNGWVAL-PSKKTKSKTGSDP 175
Query: 156 ELHLKVKLDPDPRFVFQFEDVTTLSPQIVQLQGSIKQPIFSCKF------SKDR-VSHID 208
ELH+ V+++PDPRFVFQF+ SPQ+ Q+QG+ KQ +F+CKF S DR + H
Sbjct: 176 ELHVSVRVEPDPRFVFQFDGEPECSPQVFQVQGNTKQAVFTCKFGSRNSNSGDRNLLHSS 235
Query: 209 PLSTYWSDSMD-ISDFETER----RERKGWKVKIHDLSGSAVAAAFITTPFVPSSGCDWV 263
+ + S + IS +E+ +ERKGW + +HDLSGS VA A + TPFVPS G + V
Sbjct: 236 SMMSEISSTRSCISSMNSEKEQPSKERKGWSITVHDLSGSPVAMASMVTPFVPSPGSNRV 295
Query: 264 ARSNPGAWLIVRPDVGRSESWQPWGKLEAWRERGLRDSVCCRFHLLSEAQEG-GELLMSE 322
RS+PGAWLI+RPD +W+PWG+LEAWRE G D++ RF L Q+G + +
Sbjct: 296 TRSSPGAWLILRPD---GCTWKPWGRLEAWREAGYSDTLGYRFELF---QDGIATAVSAS 349
Query: 323 IHINAEKGGEFFIDTEKHMR----------------------NXXXXXXXXXXXXGDFGA 360
I+ + GG F ID DFG
Sbjct: 350 SSISLKNGGSFVIDVTGGTSTTASTPTTSPQGSWDLGSGSSAGSRPASRPGSGSGSDFGY 409
Query: 361 LSP-------VVGGFVMCCRVQGEGKRGKPLVQLALRHVTCVEDAAIFMALAAAVDLSIE 413
L P GFVM V+G GKR KP V++ + HVTC EDAA +ALAAAVDLS++
Sbjct: 410 LLPQHPSAAAQNRGFVMSATVEGVGKRSKPEVEVGVTHVTCTEDAAAHVALAAAVDLSLD 469
Query: 414 ACRPFRRKIRR 424
ACR F K+R+
Sbjct: 470 ACRLFSHKLRK 480
>AT1G50040.1 | Symbols: | Protein of unknown function (DUF1005) |
chr1:18542236-18543823 FORWARD LENGTH=460
Length = 460
Score = 223 bits (568), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 167/465 (35%), Positives = 250/465 (53%), Gaps = 59/465 (12%)
Query: 1 MDPQAFIRLSIGSLGLR-------CTGIELSTGKSGVHRFSSSYVCEIRLRGFPVQTSSV 53
MDP +F+R+ +G+L +R + S+G S S + C+I+ + FP Q SV
Sbjct: 1 MDPCSFVRIIVGNLAVRFPRSPSSSSSSSSSSGPSVSDVSSGNCYCKIKFKSFPRQIVSV 60
Query: 54 PLITSSEVIPDTQ-------NIASSFYLEESDLKALLAPGCFYNSNACLEIAVFSGRKGS 106
P++ +E +++ +A+ F L +S ++ L + + L + V+S R+ +
Sbjct: 61 PVLLRTESESESRCCSGNVSTVAACFSLSKSQIETSLKKAKW----SVLSVEVYS-RRSA 115
Query: 107 SCG--VSIKRQQIGIFKMQVGPEWGQGKPLILFNGWIGIGKNKQENGKLGA--ELHLKVK 162
SCG + + IG F++ + + + K + NGW+ +G + N K G+ ELH+ V+
Sbjct: 116 SCGFVAASGEKLIGRFQVTLDLKAAESKTCLAHNGWVDLGTKSKNNKKSGSDPELHVSVR 175
Query: 163 LDPDPRFVFQFEDVTTLSPQIVQLQGSIKQPIFSCKF----SKDRVSHIDPLSTYWSDSM 218
++PD RFVFQF+ SPQ+ Q+QG+ KQ +F+CKF S DR + S S
Sbjct: 176 VEPDTRFVFQFDGEPECSPQVFQVQGNAKQAVFTCKFGFRNSGDR-------NLSLSLSS 228
Query: 219 DISDFETERRERKGWKVKIHDLSGSAVAAAFITTPFVPSSGCDWVARSNPGAWLIVRPDV 278
S E +ERKGW + IHDLSGS VA A + TPFVPS G + V+RS+PGAWLI+RPD
Sbjct: 229 VTSGKEQFSKERKGWSITIHDLSGSPVAMASMVTPFVPSPGSNRVSRSSPGAWLILRPD- 287
Query: 279 GRSESWQPWGKLEAWRERGLRDSVCCRFHLLSEAQEGGELLMSEIHINAEKGGEFFIDT- 337
+W+PW +L+AWRE G+ D + RF L + + + I+ + GG F ID
Sbjct: 288 --GYTWKPWVRLQAWREPGVSDVLGYRFELYKDGIA--VAVSASSSISTKLGGSFIIDGS 343
Query: 338 ----------------EKHMRNXXXXXXXXXXXXGDFG-ALSPVVG--GFVMCCRVQGEG 378
+ + DF +LS GFVM RVQG
Sbjct: 344 TSTTTTASWSSSEGSFDLSSWSSIRSSRTDSGSGSDFRFSLSQAQQNLGFVMSTRVQGVE 403
Query: 379 KRGKPLVQLALRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIR 423
K+ KP V++ ++HVTC EDAA +ALAAAVDLS++ACR F +K+R
Sbjct: 404 KQSKPKVEVGVKHVTCTEDAAAHVALAAAVDLSMDACRLFSQKLR 448