Miyakogusa Predicted Gene

Lj1g3v4752850.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v4752850.1 Non Chatacterized Hit- tr|I1NAV7|I1NAV7_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.43823 PE,75.76,0,FAMILY
NOT NAMED,NULL; seg,NULL; NT-C2,EEIG1/EHBP1 N-terminal
domain,CUFF.33113.1
         (753 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G11760.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   745   0.0  
AT5G04860.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   570   e-162
AT2G10560.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   253   3e-67
AT2G25460.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: C2 calcium...   138   1e-32

>AT3G11760.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 14 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT5G04860.1); Has 84 Blast hits to 73 proteins in
           13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
           - 0; Plants - 84; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr3:3718529-3721123 FORWARD
           LENGTH=702
          Length = 702

 Score =  745 bits (1923), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 413/745 (55%), Positives = 507/745 (68%), Gaps = 73/745 (9%)

Query: 1   MVVKMMRWRPWPP-PSKRFHVRLTVRKLTGCDLLRDD--SRSKLTLEIRWKGPKSSLPSL 57
           MVVKMM+WRPWPP  ++++ V+L+V+KL G DL+R+    + +LT+EIRWKGPK++L SL
Sbjct: 1   MVVKMMKWRPWPPLVTRKYEVKLSVKKLEGWDLVREGVPEKDRLTVEIRWKGPKATLGSL 60

Query: 58  RWNSVARNFTAEAAVDTTAGAVTW-DEEFQSLCNLTADKHNAFHPWEIAFTLF-NGLNQN 115
           R  SV RNFT EA  ++    V+W DEEFQSLC+LT+ K + F+PWEI F++F NG+ Q 
Sbjct: 61  R-RSVKRNFTKEAVGESDV--VSWEDEEFQSLCSLTSYKDSLFYPWEITFSVFTNGMKQG 117

Query: 116 IKRKVPIIGTALLNIAEFASPTDQKDFDLNIPLTLPGG-SVEPSPSLCISISLVEISGAQ 174
            K K P++GTA LN+AE+A  TD+K+FD+NIPLTL    + E  P L +S+SL+E+    
Sbjct: 118 QKNKAPVVGTAFLNLAEYACVTDKKEFDINIPLTLSACVASETHPLLFVSLSLLELRTTP 177

Query: 175 GSLESVHRTIVPVSSPPAQSG----ETTMAEKGDELSAIKAGLRKVKIFTEYVXXXXXXX 230
            + +S  +T V     P+ S     ET   EK D +SAIKAGLRKVKIFTE+V       
Sbjct: 178 ETSDSAAQTAVVPLPLPSPSPQQPTETHSVEKED-VSAIKAGLRKVKIFTEFVSTRKAKK 236

Query: 231 XXXXXXXXXXXXXXXXXDGECNYPVXXXXXXXXXXXXXXXXXXXXXFRKSFSYGPLAYAN 290
                            +G  +                         RKSFSYGPL+YAN
Sbjct: 237 ACREE------------EGRFSSFESSESLDDFETDFDEGKEELMSMRKSFSYGPLSYAN 284

Query: 291 A-GGAFCSNMRVNCDDEGWVYYSHRMSD--AGCLRMEDSTLSSSEPNVQSSMRSILSWRK 347
             G +     +V+ +DE WVYYSHR SD  AGC   EDS             RSIL WRK
Sbjct: 285 GVGTSLNCGAKVSDEDEDWVYYSHRKSDVGAGCSDAEDSAAGLVYEASLLPRRSILPWRK 344

Query: 348 RKLSFRSPKKANKGEPLLKKAYAEEGGDDIDFDRRQLSSDESLSLRLYKNEDDSCAN-RS 406
           RKLSFRSPK  +KGEPLLKK   EEGGDDIDFDRRQLSSDE+      K ++DS AN R+
Sbjct: 345 RKLSFRSPK--SKGEPLLKKDNGEEGGDDIDFDRRQLSSDEAHPPFGSKIDEDSSANPRT 402

Query: 407 SISEFGDDNFAVGSWEQKEVMSRDGHMKLQTQVFFASIDQRSERAAGESACTALVAVIAD 466
           S SEFG+D+FA+GSWE+KEV+SRDGHMKLQT VF ASIDQRSERAAGESACTALVAVIAD
Sbjct: 403 SFSEFGEDSFAIGSWEEKEVISRDGHMKLQTSVFLASIDQRSERAAGESACTALVAVIAD 462

Query: 467 WFQNSPDLMPIKSQFDSLIREGSSEWRSMCDNETYRERFPDKHFDLETVIQAKIRPLSVV 526
           WFQ + +LMPIKSQFDSLIREGS EWR++C+NETY ++FPDKHFDL+TV+QAKIRPL+V+
Sbjct: 463 WFQKNGNLMPIKSQFDSLIREGSLEWRNLCENETYMQKFPDKHFDLDTVLQAKIRPLTVI 522

Query: 527 PSKSFIGFFHPEGM-DEEKFDILHGAMSFDNIWDEISGSGHESLSNGE-------PHVYI 578
           P KSF+GFFHP+GM +E +F+ L GAMSFD+IW EI  S  ES +NG+       PHVYI
Sbjct: 523 PGKSFVGFFHPDGMINEGRFEFLQGAMSFDSIWAEII-SLEESSANGDSYDDDSPPHVYI 581

Query: 579 VSWNDHFFILKVEADCYYIIDTLGERLYEGCNQAYILKFDSSTVIHKMQNAAKSSLEDKT 638
           VSWNDHFF+LKVE + YYIIDTLGERLYEGC+QAY+LKFD  TVIHK+ +  ++      
Sbjct: 582 VSWNDHFFVLKVEKEAYYIIDTLGERLYEGCDQAYVLKFDHKTVIHKILHTEEAG----- 636

Query: 639 TSNQQTVAEVLERDNSKEVDSSSGVAEQQEEEVVLCRGKEACKEYIKSFLAAIPIRELQA 698
                                    +E + E  +L RGKE+CKEYIK+FLAAIPIRELQ 
Sbjct: 637 -------------------------SESEPESEILSRGKESCKEYIKNFLAAIPIRELQE 671

Query: 699 DVKKGLVLMSSTQVHHRLQIEFHYT 723
           D+KKGL   S+  VHHRLQIEFHYT
Sbjct: 672 DIKKGLA--STAPVHHRLQIEFHYT 694


>AT5G04860.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11760.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:1411760-1414459
           REVERSE LENGTH=782
          Length = 782

 Score =  570 bits (1470), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 359/793 (45%), Positives = 464/793 (58%), Gaps = 74/793 (9%)

Query: 1   MVVKM---MRWRPWPPP-SKRFHVRLTVRKLTGC---DLLRDDS------------RSKL 41
           MVVKM   MRW PWPP  + +F V + V ++ G    D   DDS            R + 
Sbjct: 1   MVVKMKQIMRWPPWPPLFAVKFDVIVVVHQMDGLLDSDGGGDDSTDQSQRGGGTTTRKRP 60

Query: 42  TLEIRWKGPKSSLPSLRWNSVARNFTAEAAVDTTAGAVTWDEEFQSLCNLTADKHNAFHP 101
            +EI+WKGPKS   +L+  SV RN T E       G V W+EEF+ +C  +  K  +F P
Sbjct: 61  VVEIKWKGPKSV--TLK-RSVVRNLTEEGGF-RGDGVVEWNEEFKRVCEFSVYKEGSFLP 116

Query: 102 WEIAFTLFNGLNQNIKRKVPIIGTALLNIAEFASPTDQKDFDLNIPLTLPGGSVEPSPSL 161
           W ++ T+F+GLNQ  K KV   G A LNIAE+ S   + D  + +PL     S   SP +
Sbjct: 117 WFVSLTVFSGLNQGSKEKVRSFGKASLNIAEYFSLMKEDDVQVKVPLKDCDSSSVRSPHV 176

Query: 162 CISISLVEISGAQGSLESVHRTIVPVSSPPAQSGETTMAEKGDELSAIKAGLRKVKIFTE 221
            IS+        + SL    R+ +PV   P  S E   AE     S +K GLRK+K F  
Sbjct: 177 HISLQF----SPKESLPERQRSALPVLWSPL-SAEAEKAE-----SVVKVGLRKMKTFNN 226

Query: 222 YVXXXXXXXXXXXXXXXXXXXX-----XXXXDGECNYPVXXXXXXX--XXXXXXXXXXXX 274
            +                             D + +YP                      
Sbjct: 227 CMSSTQASEKESEKDGSSGSGSDGKSPERNLDSDSSYPFDTDSLDEGDAADESEENKENE 286

Query: 275 XXFRKSFSYGPLAYAN-AGGAFCSNMRVNCDDEGWVYYSHR--MSDAGCLRME--DSTLS 329
                  +Y  L  AN A G+F  +   N +DE  +YYSHR  +++ G    E  +  +S
Sbjct: 287 SSLADPVNYKTLRSANWARGSF--HTVTNPEDEDLIYYSHRSPLAETGHCSDEVSNDVVS 344

Query: 330 SSEPNVQSSMRSILSWRKRKLSFRSPKKANKGEPLLKKAYAEEGGDDIDFDRRQLSSDES 389
             +   Q S + +LSW+KRKLSFRSPK+  KGEPLLKK   EEGGDDIDFDRRQLSS + 
Sbjct: 345 LEQAKGQMSKKRMLSWKKRKLSFRSPKQ--KGEPLLKKDCLEEGGDDIDFDRRQLSSSDE 402

Query: 390 LSLRLYKNEDDSCANRSSISEFGDDNFAVGSWEQKEVMSRDGHMKLQTQVFFASIDQRSE 449
            +   Y+++D   A    +S+FGDD+F VGSWE KE++SRDG MKL  +VF ASIDQRSE
Sbjct: 403 SNSDWYRSDD---AIMKPLSQFGDDDFVVGSWETKEIISRDGLMKLTARVFLASIDQRSE 459

Query: 450 RAAGESACTALVAVIADWFQNSPDLMPIKSQFDSLIREGSSEWRSMCDNETYRERFPDKH 509
           RAAGESACTALVAV+A W  ++ D++P +S+FDSLIREGSSEWR+MC+NE YRERFPDKH
Sbjct: 460 RAAGESACTALVAVMAHWLGSNRDIIPTRSEFDSLIREGSSEWRNMCENEEYRERFPDKH 519

Query: 510 FDLETVIQAKIRPLSVVPSKSFIGFFHP------EGMDEEKFDILHGAMSFDNIWDEISG 563
           FDLETV+QAK+RP+ VVP +SFIGFFHP      EG ++   D L G MSFD+IW+E+  
Sbjct: 520 FDLETVLQAKVRPICVVPERSFIGFFHPEKSEEEEGKEDASLDFLKGVMSFDSIWEELMK 579

Query: 564 SGHESLSNGEPHVYIVSWNDHFFILKVEADCYYIIDTLGERLYEGCNQAYILKFDSSTVI 623
              E  S  EP +YIVSWNDHFF+L V  D YYIIDTLGERLYEGCNQAY+LKFD    I
Sbjct: 580 QEPEE-SASEPVIYIVSWNDHFFVLLVNHDAYYIIDTLGERLYEGCNQAYVLKFDKDAEI 638

Query: 624 HKMQNAAKSSLEDKTTSNQQTVAEVLERDNSKEVDSSSGVAEQQEEEVVLCRGKEACKEY 683
            ++ +  K +  D    NQ+   +     N  E    S  +E+QEEE V+CRGKE+C+EY
Sbjct: 639 KRLPSVIKDNKAD--MGNQKQGGK-----NKSEQPERSKESEEQEEEEVVCRGKESCREY 691

Query: 684 IKSFLAAIPIRELQADVKKGLVLMSSTQVHHRLQIEFHYTQLLQ----SCPATPAVELEA 739
           IKSFLAAIPI++++AD+KKGLV    + +HHRLQIE HYT+ L     +   + A E+  
Sbjct: 692 IKSFLAAIPIQQVKADMKKGLV----SSLHHRLQIELHYTKHLHHHQPNMFESSATEVTV 747

Query: 740 SIAATPETLALAI 752
           S AA   T+A ++
Sbjct: 748 SEAAVSVTVAWSL 760


>AT2G10560.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G04860.1); Has 70 Blast hits to 70 proteins in
           12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
           - 0; Plants - 70; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr2:4109862-4110698 REVERSE
           LENGTH=278
          Length = 278

 Score =  253 bits (646), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 129/239 (53%), Positives = 166/239 (69%), Gaps = 18/239 (7%)

Query: 495 MCDNETYRERFPDKHFDLETVIQAKIRPLSVVPSKSFIGFFH------PEGMDEEKFDIL 548
           MC+NE YRERFPDKHFDLETV+QAK+RP+ VVP ++FIGFFH       E  ++   D L
Sbjct: 1   MCENEEYRERFPDKHFDLETVLQAKVRPICVVPERTFIGFFHREKSKEEEEKEDVSLDFL 60

Query: 549 HGAMSFDNIWDEISGSGHESLSNGEPHVYIVSWNDHFFILKVEADCYYIIDTLGERLYEG 608
            G MSFD+IW+EI     E  S  E  +YIVSWNDH+F+L V  D YYIIDTLGER+YEG
Sbjct: 61  KGVMSFDSIWEEIMKQEPEE-SASEHVIYIVSWNDHYFVLLVNHDAYYIIDTLGERVYEG 119

Query: 609 CNQAYILKFDSSTVIHKMQNAAKSSLEDKTTSNQQTVAEVLERDNSKEVDSSSGVAEQQE 668
           CNQAY+LKFD    I ++ +  K +  D  +  Q    +  + + SKE       +E+Q 
Sbjct: 120 CNQAYVLKFDQDAEIKRLPSVIKDNKADMGSQKQGGKNKYEQPERSKE-------SEEQG 172

Query: 669 EEVVLCRGKEACKEYIKSFLAAIPIRELQADVKKGLVLMSSTQVHHRLQIEFHYTQLLQ 727
           EEVV+CRGKE+C+EYIKSFLAAIPI++++AD+K+GLV    +  HHRLQIE +YT+ L 
Sbjct: 173 EEVVVCRGKESCREYIKSFLAAIPIQQVKADMKEGLV----SSFHHRLQIELYYTKHLH 227


>AT2G25460.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: C2
           calcium-dependent membrane targeting
           (InterPro:IPR000008); BEST Arabidopsis thaliana protein
           match is: unknown protein (TAIR:AT5G04860.1); Has 108
           Blast hits to 69 proteins in 11 species: Archae - 0;
           Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 108;
           Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
           | chr2:10833175-10835374 REVERSE LENGTH=423
          Length = 423

 Score =  138 bits (348), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 76/143 (53%), Positives = 102/143 (71%), Gaps = 6/143 (4%)

Query: 421 WEQKEVMSRDGHMKLQTQVFFASIDQRSERAAGESACTALVAVIADWFQNSPDLM-PIKS 479
           W  K+++SRDG  KL+++V+ ASIDQRSE+AAGE+AC A+  V+A WF  +P L+ P  +
Sbjct: 282 WVMKDLVSRDGKSKLKSEVYLASIDQRSEQAAGEAACAAVAVVVAHWFHANPKLINPSGT 341

Query: 480 QFDSLIREGSSEWRSMCDNETYRERFPDKHFDLETVIQAKIRPLSVVPSKSFIGFFHPEG 539
            FDSLI +GSS W+S+CD E+Y   FP++HFDLET++ A +RP+ V   KSF G F P  
Sbjct: 342 AFDSLITQGSSLWQSLCDKESYLRLFPNRHFDLETIVSANLRPVRVCTDKSFTGLFSP-- 399

Query: 540 MDEEKFDILHGAMSFDNIWDEIS 562
              E+F  L G MSFD IWDE+S
Sbjct: 400 ---ERFASLDGLMSFDQIWDELS 419



 Score = 64.7 bits (156), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 43/164 (26%), Positives = 80/164 (48%), Gaps = 18/164 (10%)

Query: 16  KRFHVRLTVRKLTGCD-LLRDDSRSK---LTLEIRWKGPKSS-----LPSLRWNSVARNF 66
           ++ HV +   +L G   +L D++  K     +E++WKGP S      +P  R N    N 
Sbjct: 6   RKLHVTVKPVRLDGLPAILGDETAGKNLSAMVEVKWKGPVSGFGLGFVPFYRSNRPV-NH 64

Query: 67  TAEAAVDTTAGAVTWDEEFQSLCNLTADKHNAFHPWEIAFTLFNGLNQNIKRKVPIIGTA 126
           T+   +   +  V W+EEF+ +C +         PW ++F +F G N + K K  +IG A
Sbjct: 65  TSSKPIALGSNHVEWEEEFERVCCIVG-------PWNLSFNVFYGENMDAKNKKSLIGKA 117

Query: 127 LLNIAEFASPTDQKDFDLNIPLTLPGGSVEPSPSLCISISLVEI 170
            L+++E AS   +   +  +P+   G  +    +L ++++  E+
Sbjct: 118 SLDLSELAS-KQESTVERKLPIRSKGSVLSKEATLVVNVTFSEV 160