Miyakogusa Predicted Gene
- Lj0g3v0248759.4
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0248759.4 Non Chatacterized Hit- tr|I1JLE4|I1JLE4_SOYBN
Uncharacterized protein OS=Glycine max PE=4
SV=1,80.91,0,DUF3453,Protein of unknown function DUF3453;
SYMPLEKIN,NULL; FAMILY NOT NAMED,NULL; ARM repeat,Armad,CUFF.16295.4
(895 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G01400.1 | Symbols: ESP4 | HEAT repeat-containing protein | c... 758 0.0
AT1G27595.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Symplekin ... 135 1e-31
AT1G27590.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Protein of... 110 6e-24
AT1G27570.1 | Symbols: | phosphatidylinositol 3- and 4-kinase f... 79 2e-14
>AT5G01400.1 | Symbols: ESP4 | HEAT repeat-containing protein |
chr5:162803-171072 REVERSE LENGTH=1467
Length = 1467
Score = 758 bits (1957), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/885 (47%), Positives = 556/885 (62%), Gaps = 56/885 (6%)
Query: 11 SRERLASLVNSANMAIDIPSKLESLRQLRLELPQEDPVXXXXXXXXXXXXXSDHFSPVRK 70
SR RL L NSA A ++P KL+ LR +R +L ++D V SD F VRK
Sbjct: 5 SRARLKDLANSAKSATELPPKLQRLRYMRRDLQKDDSVFPTELLPHLFDLLSDQFGAVRK 64
Query: 71 FVTEMVGEIGLKNTEFLSXXXXXXXXXXXXXXXXXXRQAILCGIDLFRSTLQKLAIQGIY 130
FV E++GEIGLK E + RQ I CG DLFRSTL+++A+QG++
Sbjct: 65 FVAEILGEIGLKYVELIPEIVPLLIKSLEDETPAVARQVIACGADLFRSTLERVAVQGLH 124
Query: 131 SSDLDGALESAWSWMVRFKDKVYTLAFQHGSSGAKLLALKFVEAVIHLYTPDPNGSSEPT 190
SS+L+ LES+W+W+++FKD++ ++AF+ G+SG KL A+KFVEA+I LYTP
Sbjct: 125 SSELNDLLESSWTWLIKFKDEICSVAFKQGNSGVKLCAMKFVEALILLYTP--------- 175
Query: 191 PHQGRNMEFNISWLRRGHPILNVGDLSIEASHSLGLLLDQLRFPTVKSLSNAVIIVVIKS 250
H+G +FNIS LR GHP+L +GDLSIEAS LGLLLDQLR P KSL+++ IIV+I S
Sbjct: 176 -HEGIEADFNISILRGGHPVLKIGDLSIEASQKLGLLLDQLRHPAAKSLNSSTIIVLINS 234
Query: 251 LSAIAINRPAFYGRIXXXXXXXXXXXXXXNGVCVTAVHLALKNAFLSCSKCTHPSAAPWR 310
LS++A RPA+ GRI GV A +LALK FLSC KCTHP+AAP
Sbjct: 235 LSSVAKKRPAYCGRILPVLLSLDPLSFL-KGVYAAATNLALKTVFLSCLKCTHPAAAP-- 291
Query: 311 DRLAGALKEMQSEGKADRVFHLISPSNGSIQREK--DDQPVIKEEEPAVNSSDSVHIKLA 368
DRL ALKE++ G+A + L +NGSIQ + +D V EE P SSD L+
Sbjct: 292 DRLTSALKEIEGGGQAAKAKDLFYKTNGSIQDKDSVEDTKVSVEENPLCASSDVAESNLS 351
Query: 369 RKRSGSQDRGDLSVDEDVPGKRVRTIVGLTEPKKE----LDDCAVNSQDDAHSTPTSSKG 424
RKRSGS+ DL+ D GKR R ++E + D ++ + P+ S+G
Sbjct: 352 RKRSGSEYNIDLNGDAS-DGKRARITPSVSEESTDGLNGNDGVSLPRVASTSTGPSDSRG 410
Query: 425 DVENGPVQQLVAMFGALVAQGEKAVXXXXXXXXXXXXXXXXXVVMANMRYLPPN-YPHAE 483
++GP QQLV +FG LV+QGEKA+ VVMANM +PPN +A+
Sbjct: 411 VSDSGPAQQLVGLFGTLVSQGEKAIGSLEILISSISADLLTDVVMANMHNIPPNCSSYAD 470
Query: 484 GNDEQLHEICIFGSDDKAKYPPSFVAGVMSLSTTFPPVASLLETHRSVSNDVVKSHEEEE 543
G DE + +CI GSD + KYPPSFVAGV+SLST FPP+A+L+ H +E+EE
Sbjct: 471 GTDELVMNMCIVGSDAQIKYPPSFVAGVLSLSTAFPPIAALINPH----------NEDEE 520
Query: 544 ISATGVGSAVVHSGMNLSAVNVSSPPDF-PSSDACIPGVENGSTAMASDIHDVGNLESGI 602
+ + V + A + +PP + D P E +T ++H +GN ESGI
Sbjct: 521 VYSVHVDQQM------FPAEDARTPPGLLATCDTSFPENEESNTVSPQNVHYIGNRESGI 574
Query: 603 PGLDSCGRSDXXXXXXXXXXXXXXXVQLEDADQEQVTSLDQRSSLNLVPSISTDRSEELS 662
PGL+S + D +E A + Q S + ++++PS+S D+ EE S
Sbjct: 575 PGLESSAQHDGSGALVTNVLSS---TNVEAASKNQNASFSGKLLVDVIPSMSVDKLEEFS 631
Query: 663 PKGVVTDANSIVSSTATSAVLPSRLVLPKMIAPVVDLADEQKDDLQMSCFMRIIDALKQI 722
PK V T A++ S+ VLPK+ APVVDL+DE+KD LQ F+RI++A KQI
Sbjct: 632 PKAVGTVASA------------SQFVLPKISAPVVDLSDEEKDSLQKLVFLRIVEAYKQI 679
Query: 723 AEAGGSKVRFSILAYLGVEFPLDLDPWKLLQKHILIDYTSHEGHELTLRVLYRLFGEAEE 782
+ +GGS++RFS+LA+LGVEFP +LDPWK+LQ+H+L DY +HEGHELT+RVLYRL+GEAE
Sbjct: 680 SMSGGSQLRFSLLAHLGVEFPSELDPWKILQEHVLSDYLNHEGHELTVRVLYRLYGEAEA 739
Query: 783 ETDFFSSTTAASVYERFLLTVAEALRDSFPPSDKSLSKLLGESPYLPKSVLKILENMCSP 842
E DFFSSTTAAS YE FLLTVAEALRDSFPPSDKSLSKLLG+SP+LPKSVL +LE+ C P
Sbjct: 740 EQDFFSSTTAASAYESFLLTVAEALRDSFPPSDKSLSKLLGDSPHLPKSVLMLLESFCCP 799
Query: 843 GNGDKVEKESLSFNADRVTQGLSAVWSLILLRPPIRDTCLKIALQ 887
G+G+ VEK+ + DRVTQGLSAVWSLIL+RP IR+ CL IALQ
Sbjct: 800 GSGE-VEKDLQ--HGDRVTQGLSAVWSLILMRPGIRNDCLNIALQ 841
>AT1G27595.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Symplekin
tight junction protein C-terminal (InterPro:IPR022075);
BEST Arabidopsis thaliana protein match is: HEAT
repeat-containing protein (TAIR:AT5G01400.1); Has 593
Blast hits to 297 proteins in 111 species: Archae - 0;
Bacteria - 3; Metazoa - 139; Fungi - 49; Plants - 57;
Viruses - 0; Other Eukaryotes - 345 (source: NCBI
BLink). | chr1:9596475-9602917 FORWARD LENGTH=961
Length = 961
Score = 135 bits (340), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 78/192 (40%), Positives = 115/192 (59%), Gaps = 5/192 (2%)
Query: 697 VDLADEQKDDLQMSCFMRIIDALKQIAEAGGSKVRFSILAYLGVEFPLDLDPWKLLQKHI 756
V+L EQ + RII++ + + +K+R +++A L D +L++ I
Sbjct: 341 VELTTEQSKTVGKLAIERIIESNRHVFGFDCNKIRMALIARLIARIDAGSDVATILRELI 400
Query: 757 LIDYTSHEGHELTLRVLYRLFGEAEEETDFFSSTTAASVYERFLLTVAEALRDSFPPSDK 816
+D+ +GH+L L VLY L A +TD S+ A+VYE FL++VA + D+ P SDK
Sbjct: 401 SVDHREFKGHDLVLHVLYHLHSMAILDTD--ESSFYATVYENFLISVARSFLDALPASDK 458
Query: 817 SLSKLLGESPYLPKSVLKILENMCSPGNGDKVEKESLSFNADRVTQGLSAVWSLILLRPP 876
S S+L GE+P+LP S + +L+ +CS D V KE +++RVTQGL AVWSLIL+RP
Sbjct: 459 SFSRLFGEAPHLPDSAINLLDELCS-TRHDPVGKEVC--DSERVTQGLGAVWSLILVRPN 515
Query: 877 IRDTCLKIALQC 888
R CL IAL+C
Sbjct: 516 ERKACLAIALKC 527
>AT1G27590.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF3453 (InterPro:IPR021850); BEST
Arabidopsis thaliana protein match is:
phosphatidylinositol 3- and 4-kinase family protein
(TAIR:AT1G27570.1); Has 109 Blast hits to 109 proteins
in 38 species: Archae - 0; Bacteria - 0; Metazoa - 65;
Fungi - 4; Plants - 38; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr1:9592293-9593912 FORWARD
LENGTH=264
Length = 264
Score = 110 bits (274), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 71/222 (31%), Positives = 110/222 (49%), Gaps = 3/222 (1%)
Query: 27 DIPSKLESLRQLRLELPQEDPVXXXXXXXXXXXXXSDHFSPVRKFVTEMVGEIGLKNTEF 86
D+ KL SLRQ++ L +P VRK + E++ E+GL+ +
Sbjct: 25 DLAVKLSSLRQVKEILLSLEPSLSAEIFPYLAELHLSREILVRKSLIEIIEEVGLRMLDH 84
Query: 87 LSXXXXXXXXXXXXXXXXXXRQAILCGIDLFRSTLQKLAIQGIYSSDLDGALESAWSWMV 146
+++I G F + L+++A+Q + +D W+WMV
Sbjct: 85 SYVLVSVLLVLSRDEDPTVAKKSISVGTTFFCTILEEMAMQFHHRGKVDRWCGELWTWMV 144
Query: 147 RFKDKVYTLAFQHGSSGAKLLALKFVEAVIHLYTPDPNGSSEPTPHQGRNMEFNISWLRR 206
+FKD V+ A + G G K+LALKF+E I L+TPD + + + R+M FNISWL
Sbjct: 145 KFKDTVFATALEPGCVGVKVLALKFMETFILLFTPDASDPEKASSEGSRHM-FNISWLAG 203
Query: 207 GHPILNVGDLSIEASHSLGLLLDQLRFPTVKSLSNAVIIVVI 248
GHPILN L EA+ + G+L+D ++ + L A+ I VI
Sbjct: 204 GHPILNPATLMSEANRTFGILVDFIQ--SANRLPGALTISVI 243
>AT1G27570.1 | Symbols: | phosphatidylinositol 3- and 4-kinase
family protein | chr1:9575538-9582376 FORWARD LENGTH=649
Length = 649
Score = 78.6 bits (192), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/209 (31%), Positives = 97/209 (46%), Gaps = 15/209 (7%)
Query: 68 VRKFVTEMVGEIGLKNTEFLSXXXXXXXXXXXXXXXXXXRQAILCGIDLFRSTLQKLAIQ 127
VR+ + E++ E+GL+ E ++I G FRS L+K+ Q
Sbjct: 66 VRRSLIEIIEEVGLRMLEHSYVLVSVLIHLVGDNDPTVAEKSISTGTTFFRSILEKMETQ 125
Query: 128 GIYSSDLDGALESAWSWMVRFKDKVYTLAFQHGSS---GAKLLALKFVEAVIHLYTP--- 181
+ +D + W+ M+ FKD V+ +A G K+LALKF+E I L TP
Sbjct: 126 FHHRGKVDRWCVNLWTLMLMFKDAVFNIALDLEPGRVVGVKVLALKFMETFILLITPHAS 185
Query: 182 DPNGSSEPTPHQGRNMEFNISWLRRGHPILNVGDLSIEASHSLGLLLDQLRFPTVKSLSN 241
DP S T +G NIS L G P+LN+ L E + +L L L+ PT+ + +
Sbjct: 186 DPEKVS--TSSEGSRQMINISSLAAGLPMLNLTGLMSEVNQTLVRLGSFLQAPTL--IQD 241
Query: 242 AVIIVVIK-----SLSAIAINRPAFYGRI 265
A+ I VI SL+ +A RP Y +
Sbjct: 242 ALPIAVIDCSLSFSLAVVARKRPVHYDTV 270