Miyakogusa Predicted Gene

Lj0g3v0248759.4
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0248759.4 Non Chatacterized Hit- tr|I1JLE4|I1JLE4_SOYBN
Uncharacterized protein OS=Glycine max PE=4
SV=1,80.91,0,DUF3453,Protein of unknown function DUF3453;
SYMPLEKIN,NULL; FAMILY NOT NAMED,NULL; ARM repeat,Armad,CUFF.16295.4
         (895 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G01400.1 | Symbols: ESP4 | HEAT repeat-containing protein | c...   758   0.0  
AT1G27595.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Symplekin ...   135   1e-31
AT1G27590.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Protein of...   110   6e-24
AT1G27570.1 | Symbols:  | phosphatidylinositol 3- and 4-kinase f...    79   2e-14

>AT5G01400.1 | Symbols: ESP4 | HEAT repeat-containing protein |
           chr5:162803-171072 REVERSE LENGTH=1467
          Length = 1467

 Score =  758 bits (1957), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 424/885 (47%), Positives = 556/885 (62%), Gaps = 56/885 (6%)

Query: 11  SRERLASLVNSANMAIDIPSKLESLRQLRLELPQEDPVXXXXXXXXXXXXXSDHFSPVRK 70
           SR RL  L NSA  A ++P KL+ LR +R +L ++D V             SD F  VRK
Sbjct: 5   SRARLKDLANSAKSATELPPKLQRLRYMRRDLQKDDSVFPTELLPHLFDLLSDQFGAVRK 64

Query: 71  FVTEMVGEIGLKNTEFLSXXXXXXXXXXXXXXXXXXRQAILCGIDLFRSTLQKLAIQGIY 130
           FV E++GEIGLK  E +                   RQ I CG DLFRSTL+++A+QG++
Sbjct: 65  FVAEILGEIGLKYVELIPEIVPLLIKSLEDETPAVARQVIACGADLFRSTLERVAVQGLH 124

Query: 131 SSDLDGALESAWSWMVRFKDKVYTLAFQHGSSGAKLLALKFVEAVIHLYTPDPNGSSEPT 190
           SS+L+  LES+W+W+++FKD++ ++AF+ G+SG KL A+KFVEA+I LYTP         
Sbjct: 125 SSELNDLLESSWTWLIKFKDEICSVAFKQGNSGVKLCAMKFVEALILLYTP--------- 175

Query: 191 PHQGRNMEFNISWLRRGHPILNVGDLSIEASHSLGLLLDQLRFPTVKSLSNAVIIVVIKS 250
            H+G   +FNIS LR GHP+L +GDLSIEAS  LGLLLDQLR P  KSL+++ IIV+I S
Sbjct: 176 -HEGIEADFNISILRGGHPVLKIGDLSIEASQKLGLLLDQLRHPAAKSLNSSTIIVLINS 234

Query: 251 LSAIAINRPAFYGRIXXXXXXXXXXXXXXNGVCVTAVHLALKNAFLSCSKCTHPSAAPWR 310
           LS++A  RPA+ GRI               GV   A +LALK  FLSC KCTHP+AAP  
Sbjct: 235 LSSVAKKRPAYCGRILPVLLSLDPLSFL-KGVYAAATNLALKTVFLSCLKCTHPAAAP-- 291

Query: 311 DRLAGALKEMQSEGKADRVFHLISPSNGSIQREK--DDQPVIKEEEPAVNSSDSVHIKLA 368
           DRL  ALKE++  G+A +   L   +NGSIQ +   +D  V  EE P   SSD     L+
Sbjct: 292 DRLTSALKEIEGGGQAAKAKDLFYKTNGSIQDKDSVEDTKVSVEENPLCASSDVAESNLS 351

Query: 369 RKRSGSQDRGDLSVDEDVPGKRVRTIVGLTEPKKE----LDDCAVNSQDDAHSTPTSSKG 424
           RKRSGS+   DL+ D    GKR R    ++E   +     D  ++       + P+ S+G
Sbjct: 352 RKRSGSEYNIDLNGDAS-DGKRARITPSVSEESTDGLNGNDGVSLPRVASTSTGPSDSRG 410

Query: 425 DVENGPVQQLVAMFGALVAQGEKAVXXXXXXXXXXXXXXXXXVVMANMRYLPPN-YPHAE 483
             ++GP QQLV +FG LV+QGEKA+                 VVMANM  +PPN   +A+
Sbjct: 411 VSDSGPAQQLVGLFGTLVSQGEKAIGSLEILISSISADLLTDVVMANMHNIPPNCSSYAD 470

Query: 484 GNDEQLHEICIFGSDDKAKYPPSFVAGVMSLSTTFPPVASLLETHRSVSNDVVKSHEEEE 543
           G DE +  +CI GSD + KYPPSFVAGV+SLST FPP+A+L+  H          +E+EE
Sbjct: 471 GTDELVMNMCIVGSDAQIKYPPSFVAGVLSLSTAFPPIAALINPH----------NEDEE 520

Query: 544 ISATGVGSAVVHSGMNLSAVNVSSPPDF-PSSDACIPGVENGSTAMASDIHDVGNLESGI 602
           + +  V   +        A +  +PP    + D   P  E  +T    ++H +GN ESGI
Sbjct: 521 VYSVHVDQQM------FPAEDARTPPGLLATCDTSFPENEESNTVSPQNVHYIGNRESGI 574

Query: 603 PGLDSCGRSDXXXXXXXXXXXXXXXVQLEDADQEQVTSLDQRSSLNLVPSISTDRSEELS 662
           PGL+S  + D                 +E A + Q  S   +  ++++PS+S D+ EE S
Sbjct: 575 PGLESSAQHDGSGALVTNVLSS---TNVEAASKNQNASFSGKLLVDVIPSMSVDKLEEFS 631

Query: 663 PKGVVTDANSIVSSTATSAVLPSRLVLPKMIAPVVDLADEQKDDLQMSCFMRIIDALKQI 722
           PK V T A++            S+ VLPK+ APVVDL+DE+KD LQ   F+RI++A KQI
Sbjct: 632 PKAVGTVASA------------SQFVLPKISAPVVDLSDEEKDSLQKLVFLRIVEAYKQI 679

Query: 723 AEAGGSKVRFSILAYLGVEFPLDLDPWKLLQKHILIDYTSHEGHELTLRVLYRLFGEAEE 782
           + +GGS++RFS+LA+LGVEFP +LDPWK+LQ+H+L DY +HEGHELT+RVLYRL+GEAE 
Sbjct: 680 SMSGGSQLRFSLLAHLGVEFPSELDPWKILQEHVLSDYLNHEGHELTVRVLYRLYGEAEA 739

Query: 783 ETDFFSSTTAASVYERFLLTVAEALRDSFPPSDKSLSKLLGESPYLPKSVLKILENMCSP 842
           E DFFSSTTAAS YE FLLTVAEALRDSFPPSDKSLSKLLG+SP+LPKSVL +LE+ C P
Sbjct: 740 EQDFFSSTTAASAYESFLLTVAEALRDSFPPSDKSLSKLLGDSPHLPKSVLMLLESFCCP 799

Query: 843 GNGDKVEKESLSFNADRVTQGLSAVWSLILLRPPIRDTCLKIALQ 887
           G+G+ VEK+    + DRVTQGLSAVWSLIL+RP IR+ CL IALQ
Sbjct: 800 GSGE-VEKDLQ--HGDRVTQGLSAVWSLILMRPGIRNDCLNIALQ 841


>AT1G27595.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Symplekin
           tight junction protein C-terminal (InterPro:IPR022075);
           BEST Arabidopsis thaliana protein match is: HEAT
           repeat-containing protein (TAIR:AT5G01400.1); Has 593
           Blast hits to 297 proteins in 111 species: Archae - 0;
           Bacteria - 3; Metazoa - 139; Fungi - 49; Plants - 57;
           Viruses - 0; Other Eukaryotes - 345 (source: NCBI
           BLink). | chr1:9596475-9602917 FORWARD LENGTH=961
          Length = 961

 Score =  135 bits (340), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 78/192 (40%), Positives = 115/192 (59%), Gaps = 5/192 (2%)

Query: 697 VDLADEQKDDLQMSCFMRIIDALKQIAEAGGSKVRFSILAYLGVEFPLDLDPWKLLQKHI 756
           V+L  EQ   +      RII++ + +     +K+R +++A L        D   +L++ I
Sbjct: 341 VELTTEQSKTVGKLAIERIIESNRHVFGFDCNKIRMALIARLIARIDAGSDVATILRELI 400

Query: 757 LIDYTSHEGHELTLRVLYRLFGEAEEETDFFSSTTAASVYERFLLTVAEALRDSFPPSDK 816
            +D+   +GH+L L VLY L   A  +TD   S+  A+VYE FL++VA +  D+ P SDK
Sbjct: 401 SVDHREFKGHDLVLHVLYHLHSMAILDTD--ESSFYATVYENFLISVARSFLDALPASDK 458

Query: 817 SLSKLLGESPYLPKSVLKILENMCSPGNGDKVEKESLSFNADRVTQGLSAVWSLILLRPP 876
           S S+L GE+P+LP S + +L+ +CS    D V KE    +++RVTQGL AVWSLIL+RP 
Sbjct: 459 SFSRLFGEAPHLPDSAINLLDELCS-TRHDPVGKEVC--DSERVTQGLGAVWSLILVRPN 515

Query: 877 IRDTCLKIALQC 888
            R  CL IAL+C
Sbjct: 516 ERKACLAIALKC 527


>AT1G27590.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Protein of
           unknown function DUF3453 (InterPro:IPR021850); BEST
           Arabidopsis thaliana protein match is:
           phosphatidylinositol 3- and 4-kinase family protein
           (TAIR:AT1G27570.1); Has 109 Blast hits to 109 proteins
           in 38 species: Archae - 0; Bacteria - 0; Metazoa - 65;
           Fungi - 4; Plants - 38; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr1:9592293-9593912 FORWARD
           LENGTH=264
          Length = 264

 Score =  110 bits (274), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 71/222 (31%), Positives = 110/222 (49%), Gaps = 3/222 (1%)

Query: 27  DIPSKLESLRQLRLELPQEDPVXXXXXXXXXXXXXSDHFSPVRKFVTEMVGEIGLKNTEF 86
           D+  KL SLRQ++  L   +P                    VRK + E++ E+GL+  + 
Sbjct: 25  DLAVKLSSLRQVKEILLSLEPSLSAEIFPYLAELHLSREILVRKSLIEIIEEVGLRMLDH 84

Query: 87  LSXXXXXXXXXXXXXXXXXXRQAILCGIDLFRSTLQKLAIQGIYSSDLDGALESAWSWMV 146
                               +++I  G   F + L+++A+Q  +   +D      W+WMV
Sbjct: 85  SYVLVSVLLVLSRDEDPTVAKKSISVGTTFFCTILEEMAMQFHHRGKVDRWCGELWTWMV 144

Query: 147 RFKDKVYTLAFQHGSSGAKLLALKFVEAVIHLYTPDPNGSSEPTPHQGRNMEFNISWLRR 206
           +FKD V+  A + G  G K+LALKF+E  I L+TPD +   + +    R+M FNISWL  
Sbjct: 145 KFKDTVFATALEPGCVGVKVLALKFMETFILLFTPDASDPEKASSEGSRHM-FNISWLAG 203

Query: 207 GHPILNVGDLSIEASHSLGLLLDQLRFPTVKSLSNAVIIVVI 248
           GHPILN   L  EA+ + G+L+D ++  +   L  A+ I VI
Sbjct: 204 GHPILNPATLMSEANRTFGILVDFIQ--SANRLPGALTISVI 243


>AT1G27570.1 | Symbols:  | phosphatidylinositol 3- and 4-kinase
           family protein | chr1:9575538-9582376 FORWARD LENGTH=649
          Length = 649

 Score = 78.6 bits (192), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 65/209 (31%), Positives = 97/209 (46%), Gaps = 15/209 (7%)

Query: 68  VRKFVTEMVGEIGLKNTEFLSXXXXXXXXXXXXXXXXXXRQAILCGIDLFRSTLQKLAIQ 127
           VR+ + E++ E+GL+  E                      ++I  G   FRS L+K+  Q
Sbjct: 66  VRRSLIEIIEEVGLRMLEHSYVLVSVLIHLVGDNDPTVAEKSISTGTTFFRSILEKMETQ 125

Query: 128 GIYSSDLDGALESAWSWMVRFKDKVYTLAFQHGSS---GAKLLALKFVEAVIHLYTP--- 181
             +   +D    + W+ M+ FKD V+ +A         G K+LALKF+E  I L TP   
Sbjct: 126 FHHRGKVDRWCVNLWTLMLMFKDAVFNIALDLEPGRVVGVKVLALKFMETFILLITPHAS 185

Query: 182 DPNGSSEPTPHQGRNMEFNISWLRRGHPILNVGDLSIEASHSLGLLLDQLRFPTVKSLSN 241
           DP   S  T  +G     NIS L  G P+LN+  L  E + +L  L   L+ PT+  + +
Sbjct: 186 DPEKVS--TSSEGSRQMINISSLAAGLPMLNLTGLMSEVNQTLVRLGSFLQAPTL--IQD 241

Query: 242 AVIIVVIK-----SLSAIAINRPAFYGRI 265
           A+ I VI      SL+ +A  RP  Y  +
Sbjct: 242 ALPIAVIDCSLSFSLAVVARKRPVHYDTV 270