Miyakogusa Predicted Gene

Lj5g3v0841340.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v0841340.1 Non Chatacterized Hit- tr|G7I7Z4|G7I7Z4_MEDTR
Putative uncharacterized protein OS=Medicago
truncatul,79,0,coiled-coil,NULL; NT-C2,EEIG1/EHBP1 N-terminal domain;
FAMILY NOT NAMED,NULL; seg,NULL,CUFF.54067.1
         (774 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G11760.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   776   0.0  
AT5G04860.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   583   e-166
AT2G10560.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   240   2e-63
AT2G25460.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: C2 calcium...   133   4e-31

>AT3G11760.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 14 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT5G04860.1); Has 84 Blast hits to 73 proteins in
           13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
           - 0; Plants - 84; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr3:3718529-3721123 FORWARD
           LENGTH=702
          Length = 702

 Score =  776 bits (2005), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 432/762 (56%), Positives = 515/762 (67%), Gaps = 87/762 (11%)

Query: 1   MVVKMMKKWRPWPPLVSRKYEVKLLVRTLQGCDLLREGAREG-MFAVEIRWKGPKLALSS 59
           MVVKMMK WRPWPPLV+RKYEVKL V+ L+G DL+REG  E     VEIRWKGPK  L S
Sbjct: 1   MVVKMMK-WRPWPPLVTRKYEVKLSVKKLEGWDLVREGVPEKDRLTVEIRWKGPKATLGS 59

Query: 60  LRRSAVARNFTKEAAAGCDGDNNNDVVLW-DEEFQSFCTLSAYKDNNNAFHPWEIAFTVF 118
           LRRS V RNFTKEA         +DVV W DEEFQS C+L++YKD+   F+PWEI F+VF
Sbjct: 60  LRRS-VKRNFTKEAVG------ESDVVSWEDEEFQSLCSLTSYKDS--LFYPWEITFSVF 110

Query: 119 -NGL--NQRPKVPVIGTASLNLAEFASVIDQKDFDLNIPLTIPGGSAXXXXXXXXXXXXX 175
            NG+   Q+ K PV+GTA LNLAE+A V D+K+FD+NIPLT+    A             
Sbjct: 111 TNGMKQGQKNKAPVVGTAFLNLAEYACVTDKKEFDINIPLTLSACVASETHPLLFVSLSL 170

Query: 176 XXLRAAQESSELVQKSVVPVASPLAQT----GETNLAEKDEVSTIKAGLRKVKILTEFVX 231
             LR   E+S+   ++ V      + +     ET+  EK++VS IKAGLRKVKI TEFV 
Sbjct: 171 LELRTTPETSDSAAQTAVVPLPLPSPSPQQPTETHSVEKEDVSAIKAGLRKVKIFTEFVS 230

Query: 232 XXXXXXXXXXXXXXXXNLSARSEDGEYNYPFDSDSLDDFEEGESDEVKED-PNVRKSFSY 290
                             + R E+G ++    S+SLDDFE  + DE KE+  ++RKSFSY
Sbjct: 231 TRKAKK------------ACREEEGRFSSFESSESLDDFET-DFDEGKEELMSMRKSFSY 277

Query: 291 GKLAYANAEGSFYSSIRVKSDDDVDEGWVYYSNHISDTGXXXXXXXXXXXXXXXXXXXQS 350
           G L+YAN  G+  +     SD+D D  WVYYS+  SD G                     
Sbjct: 278 GPLSYANGVGTSLNCGAKVSDEDED--WVYYSHRKSDVGAGCSDAEDSAAGLVYEASLLP 335

Query: 351 SKRSILPWRKRKLSFRSPKSKGEPLLKKAYGEEGGDDIDFDRRQLSSDESLSP--GKTED 408
            +RSILPWRKRKLSFRSPKSKGEPLLKK  GEEGGDDIDFDRRQLSSDE+  P   K ++
Sbjct: 336 -RRSILPWRKRKLSFRSPKSKGEPLLKKDNGEEGGDDIDFDRRQLSSDEAHPPFGSKIDE 394

Query: 409 DSCAN-RTSISEFGDDNFAVGSWEQKEVMSRDGHMKLQAQVFFASIDQRSERAAGESACT 467
           DS AN RTS SEFG+D+FA+GSWE+KEV+SRDGHMKLQ  VF ASIDQRSERAAGESACT
Sbjct: 395 DSSANPRTSFSEFGEDSFAIGSWEEKEVISRDGHMKLQTSVFLASIDQRSERAAGESACT 454

Query: 468 ALVAVIADWFQNNHDLMPIKSQFDSLIREGSLEWRNLCENQTYMERFPDKHFDLETVIQA 527
           ALVAVIADWFQ N +LMPIKSQFDSLIREGSLEWRNLCEN+TYM++FPDKHFDL+TV+QA
Sbjct: 455 ALVAVIADWFQKNGNLMPIKSQFDSLIREGSLEWRNLCENETYMQKFPDKHFDLDTVLQA 514

Query: 528 KTRPLSVVPGKSFIGFFHPEGM-DEGRFDFLHGAMSFDNIWDEI-----SHNAGHDCTYN 581
           K RPL+V+PGKSF+GFFHP+GM +EGRF+FL GAMSFD+IW EI     S   G     +
Sbjct: 515 KIRPLTVIPGKSFVGFFHPDGMINEGRFEFLQGAMSFDSIWAEIISLEESSANGDSYDDD 574

Query: 582 GEPQVYIISWNDHFFILKVEVDAYYIIDTLGERLYEGCNQAYILKFDSNTLIHKMPEVAQ 641
             P VYI+SWNDHFF+LKVE +AYYIIDTLGERLYEGC+QAY+LKFD  T+IHK+    +
Sbjct: 575 SPPHVYIVSWNDHFFVLKVEKEAYYIIDTLGERLYEGCDQAYVLKFDHKTVIHKILHTEE 634

Query: 642 SSDEKTITDQQQTVADVLENNDKQIQQVNAKEADSVAAXXXXXXXXXXXXXXVVCRGKDA 701
           +  E                           E +S                 ++ RGK++
Sbjct: 635 AGSE--------------------------SEPES----------------EILSRGKES 652

Query: 702 CKEYIKSFLAAIPIRELQADVKKGLVSSTPLHHRLQIEFHYT 743
           CKEYIK+FLAAIPIRELQ D+KKGL S+ P+HHRLQIEFHYT
Sbjct: 653 CKEYIKNFLAAIPIRELQEDIKKGLASTAPVHHRLQIEFHYT 694


>AT5G04860.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11760.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:1411760-1414459
           REVERSE LENGTH=782
          Length = 782

 Score =  583 bits (1502), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 363/813 (44%), Positives = 476/813 (58%), Gaps = 97/813 (11%)

Query: 1   MVVKM--MKKWRPWPPLVSRKYEVKLLVRTLQGC--------DLLREGAREGMFA----- 45
           MVVKM  + +W PWPPL + K++V ++V  + G         D   +  R G        
Sbjct: 1   MVVKMKQIMRWPPWPPLFAVKFDVIVVVHQMDGLLDSDGGGDDSTDQSQRGGGTTTRKRP 60

Query: 46  -VEIRWKGPKLALSSLRRSAVARNFTKEAAAGCDGDNNNDVVLWDEEFQSFCTLSAYKDN 104
            VEI+WKGPK    +L+RS V RN T+E     DG     VV W+EEF+  C  S YK+ 
Sbjct: 61  VVEIKWKGPKSV--TLKRSVV-RNLTEEGGFRGDG-----VVEWNEEFKRVCEFSVYKEG 112

Query: 105 NNAFHPWEIAFTVFNGLNQ--RPKVPVIGTASLNLAEFASVIDQKDFDLNIPLTIPGGSA 162
             +F PW ++ TVF+GLNQ  + KV   G ASLN+AE+ S++ + D  + +PL     S+
Sbjct: 113 --SFLPWFVSLTVFSGLNQGSKEKVRSFGKASLNIAEYFSLMKEDDVQVKVPLKDCDSSS 170

Query: 163 XXXXXXXXXXXXXXXLRAAQESSELVQKSVVPVA-SPLAQTGETNLAEKDEVSTIKAGLR 221
                            + +ES    Q+S +PV  SPL+       AEK E S +K GLR
Sbjct: 171 VRSPHVHISLQF-----SPKESLPERQRSALPVLWSPLSAE-----AEKAE-SVVKVGLR 219

Query: 222 KVKILTEFVXXXXXXXXXXXXXXXXXNLSA-----RSEDGEYNYPFDSDSLD--DFEEGE 274
           K+K     +                 + S      R+ D + +YPFD+DSLD  D  +  
Sbjct: 220 KMKTFNNCMSSTQASEKESEKDGSSGSGSDGKSPERNLDSDSSYPFDTDSLDEGDAADES 279

Query: 275 SDEVKEDPNVRKSFSYGKLAYAN-AEGSFYSSIRVKSDDDVDEGWVYYSNH--ISDTGXX 331
            +  + + ++    +Y  L  AN A GSF++    +     DE  +YYS+   +++TG  
Sbjct: 280 EENKENESSLADPVNYKTLRSANWARGSFHTVTNPE-----DEDLIYYSHRSPLAETGHC 334

Query: 332 XXXXXXXXXXXXXXXXXQSSKRSILPWRKRKLSFRSPKSKGEPLLKKAYGEEGGDDIDFD 391
                            Q SK+ +L W+KRKLSFRSPK KGEPLLKK   EEGGDDIDFD
Sbjct: 335 SDEVSNDVVSLEQAKG-QMSKKRMLSWKKRKLSFRSPKQKGEPLLKKDCLEEGGDDIDFD 393

Query: 392 RRQLSS-DESLSPGKTEDDSCANRTSISEFGDDNFAVGSWEQKEVMSRDGHMKLQAQVFF 450
           RRQLSS DES S     DD+      +S+FGDD+F VGSWE KE++SRDG MKL A+VF 
Sbjct: 394 RRQLSSSDESNSDWYRSDDAIMK--PLSQFGDDDFVVGSWETKEIISRDGLMKLTARVFL 451

Query: 451 ASIDQRSERAAGESACTALVAVIADWFQNNHDLMPIKSQFDSLIREGSLEWRNLCENQTY 510
           ASIDQRSERAAGESACTALVAV+A W  +N D++P +S+FDSLIREGS EWRN+CEN+ Y
Sbjct: 452 ASIDQRSERAAGESACTALVAVMAHWLGSNRDIIPTRSEFDSLIREGSSEWRNMCENEEY 511

Query: 511 MERFPDKHFDLETVIQAKTRPLSVVPGKSFIGFFHP------EGMDEGRFDFLHGAMSFD 564
            ERFPDKHFDLETV+QAK RP+ VVP +SFIGFFHP      EG ++   DFL G MSFD
Sbjct: 512 RERFPDKHFDLETVLQAKVRPICVVPERSFIGFFHPEKSEEEEGKEDASLDFLKGVMSFD 571

Query: 565 NIWDEISHNAGHDCTYNGEPQVYIISWNDHFFILKVEVDAYYIIDTLGERLYEGCNQAYI 624
           +IW+E+      +     EP +YI+SWNDHFF+L V  DAYYIIDTLGERLYEGCNQAY+
Sbjct: 572 SIWEELMKQEPEESA--SEPVIYIVSWNDHFFVLLVNHDAYYIIDTLGERLYEGCNQAYV 629

Query: 625 LKFDSNTLIHKMPEVAQSSDEKTITDQQQTVADVLENNDKQIQQVNAKEADSVAAXXXXX 684
           LKFD +  I ++P V        I D +  + +  +    + +Q    +           
Sbjct: 630 LKFDKDAEIKRLPSV--------IKDNKADMGNQKQGGKNKSEQPERSKESEEQE----- 676

Query: 685 XXXXXXXXXVVCRGKDACKEYIKSFLAAIPIRELQADVKKGLVSSTPLHHRLQIEFHYTQ 744
                    VVCRGK++C+EYIKSFLAAIPI++++AD+KKGLVSS  LHHRLQIE HYT+
Sbjct: 677 ------EEEVVCRGKESCREYIKSFLAAIPIQQVKADMKKGLVSS--LHHRLQIELHYTK 728

Query: 745 LLQSYDI---------VPVAEASMTVPETLALA 768
            L  +           V V+EA+++V    +LA
Sbjct: 729 HLHHHQPNMFESSATEVTVSEAAVSVTVAWSLA 761


>AT2G10560.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G04860.1); Has 70 Blast hits to 70 proteins in
           12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
           - 0; Plants - 70; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr2:4109862-4110698 REVERSE
           LENGTH=278
          Length = 278

 Score =  240 bits (613), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 136/280 (48%), Positives = 177/280 (63%), Gaps = 40/280 (14%)

Query: 504 LCENQTYMERFPDKHFDLETVIQAKTRPLSVVPGKSFIGFFH------PEGMDEGRFDFL 557
           +CEN+ Y ERFPDKHFDLETV+QAK RP+ VVP ++FIGFFH       E  ++   DFL
Sbjct: 1   MCENEEYRERFPDKHFDLETVLQAKVRPICVVPERTFIGFFHREKSKEEEEKEDVSLDFL 60

Query: 558 HGAMSFDNIWDEISHNAGHDCTYNGEPQVYIISWNDHFFILKVEVDAYYIIDTLGERLYE 617
            G MSFD+IW+EI      +     E  +YI+SWNDH+F+L V  DAYYIIDTLGER+YE
Sbjct: 61  KGVMSFDSIWEEIMKQEPEESA--SEHVIYIVSWNDHYFVLLVNHDAYYIIDTLGERVYE 118

Query: 618 GCNQAYILKFDSNTLIHKMPEVAQSSDEKTITDQQQTVADVLENNDKQIQQVNAKEADSV 677
           GCNQAY+LKFD +  I ++P V +  ++  +  Q+Q         +K  Q   +KE++  
Sbjct: 119 GCNQAYVLKFDQDAEIKRLPSVIK-DNKADMGSQKQG------GKNKYEQPERSKESEEQ 171

Query: 678 AAXXXXXXXXXXXXXXVVCRGKDACKEYIKSFLAAIPIRELQADVKKGLVSSTPLHHRLQ 737
                           VVCRGK++C+EYIKSFLAAIPI++++AD+K+GLVSS   HHRLQ
Sbjct: 172 G------------EEVVVCRGKESCREYIKSFLAAIPIQQVKADMKEGLVSS--FHHRLQ 217

Query: 738 IEFHYTQLLQ---------SYDIVPVAEA--SMTVPETLA 766
           IE +YT+ L          S   V V+EA  SMTV   LA
Sbjct: 218 IELYYTKHLHHRQPNMFESSTTKVTVSEATVSMTVAWLLA 257


>AT2G25460.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: C2
           calcium-dependent membrane targeting
           (InterPro:IPR000008); BEST Arabidopsis thaliana protein
           match is: unknown protein (TAIR:AT5G04860.1); Has 108
           Blast hits to 69 proteins in 11 species: Archae - 0;
           Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 108;
           Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
           | chr2:10833175-10835374 REVERSE LENGTH=423
          Length = 423

 Score =  133 bits (335), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 74/143 (51%), Positives = 100/143 (69%), Gaps = 6/143 (4%)

Query: 430 WEQKEVMSRDGHMKLQAQVFFASIDQRSERAAGESACTALVAVIADWFQNNHDLM-PIKS 488
           W  K+++SRDG  KL+++V+ ASIDQRSE+AAGE+AC A+  V+A WF  N  L+ P  +
Sbjct: 282 WVMKDLVSRDGKSKLKSEVYLASIDQRSEQAAGEAACAAVAVVVAHWFHANPKLINPSGT 341

Query: 489 QFDSLIREGSLEWRNLCENQTYMERFPDKHFDLETVIQAKTRPLSVVPGKSFIGFFHPEG 548
            FDSLI +GS  W++LC+ ++Y+  FP++HFDLET++ A  RP+ V   KSF G F PE 
Sbjct: 342 AFDSLITQGSSLWQSLCDKESYLRLFPNRHFDLETIVSANLRPVRVCTDKSFTGLFSPE- 400

Query: 549 MDEGRFDFLHGAMSFDNIWDEIS 571
               RF  L G MSFD IWDE+S
Sbjct: 401 ----RFASLDGLMSFDQIWDELS 419