Miyakogusa Predicted Gene

Lj1g3v3975890.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v3975890.1 Non Chatacterized Hit- tr|I1IJ35|I1IJ35_BRADI
Uncharacterized protein OS=Brachypodium distachyon
GN=,47.17,2e-19,DDE_4,NULL; UNCHARACTERIZED,Harbinger
transposase-derived nuclease,gene.g35858.t1.1
         (177 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G55350.1 | Symbols:  | PIF / Ping-Pong family of plant transp...   165   1e-41
AT3G63270.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Putative h...   117   3e-27
AT4G29780.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    64   8e-11
AT5G12010.1 | Symbols:  | unknown protein; INVOLVED IN: response...    59   2e-09
AT1G72270.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Ribosome 6...    56   1e-08
AT1G72270.2 | Symbols:  | LOCATED IN: mitochondrion; EXPRESSED I...    56   1e-08

>AT3G55350.1 | Symbols:  | PIF / Ping-Pong family of plant
           transposases | chr3:20518518-20520690 FORWARD LENGTH=406
          Length = 406

 Score =  165 bits (418), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 91/211 (43%), Positives = 117/211 (55%), Gaps = 38/211 (18%)

Query: 1   MCLASTEPHNDVWLDHEKKHSMVLQAIVHPDMR--------------------------- 33
           M L + EP N VWLD EK  SM LQA+V PDMR                           
Sbjct: 194 MNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAGWPGSLNDDVVLKNSGFYKLV 253

Query: 34  -----LNGRIIHLPYESKIREYIIGDSGFPLLPYLIVPYERKEEKLLGVQADFNRRHFKM 88
                LNG  + L   +++REYI+GDSGFPLLP+L+ PY+ K   L   Q +FN+RH + 
Sbjct: 254 EKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQGKPTSL--PQTEFNKRHSEA 311

Query: 89  QMVAQRALARLKEMWGIIQGTMWRPDKHRLPSIIHVCCILHNIVIDMGDEVQDEQLSNLP 148
              AQ AL++LK+ W II G MW PD++RLP II VCC+LHNI+IDM D+  D+Q   L 
Sbjct: 312 TKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHNIIIDMEDQTLDDQ--PLS 369

Query: 149 INHDSGYHQLICGVEDTQGLLLR--LTKPVC 177
             HD  Y Q  C + D    +LR  L+  +C
Sbjct: 370 QQHDMNYRQRSCKLADEASSVLRDELSDQLC 400


>AT3G63270.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Putative
           harbinger transposase-derived nuclease
           (InterPro:IPR006912); BEST Arabidopsis thaliana protein
           match is: PIF / Ping-Pong family of plant transposases
           (TAIR:AT3G55350.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr3:23375932-23377398 REVERSE LENGTH=396
          Length = 396

 Score =  117 bits (294), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 73/203 (35%), Positives = 101/203 (49%), Gaps = 37/203 (18%)

Query: 1   MCLASTEPHNDVWLDHEKKHSMVLQAIVHPDMR--------------------------- 33
           M L + +  +D W D EK +SM LQ +   +MR                           
Sbjct: 190 MTLPAVQASDD-WCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLC 248

Query: 34  -----LNGRIIHLPYESKIREYIIGDSGFPLLPYLIVPYERKEEKLLGVQADFNRRHFKM 88
                L+G    L   ++IREY++G   +PLLP+LI P++        V   FN RH K+
Sbjct: 249 ENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSMVA--FNERHEKV 306

Query: 89  QMVAQRALARLKEMWGIIQGTMWRPDKHRLPSIIHVCCILHNIVIDMGDEVQDEQLSNLP 148
           + VA  A  +LK  W I+   MWRPD+ +LPSII VCC+LHNI+ID GD +Q++    L 
Sbjct: 307 RSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQED--VPLS 364

Query: 149 INHDSGYHQLICGVEDTQGLLLR 171
            +HDSGY    C   +  G  LR
Sbjct: 365 GHHDSGYADRYCKQTEPLGSELR 387


>AT4G29780.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G12010.1); Has 945 Blast hits to 944 proteins
           in 87 species: Archae - 0; Bacteria - 0; Metazoa - 519;
           Fungi - 43; Plants - 365; Viruses - 0; Other Eukaryotes
           - 18 (source: NCBI BLink). | chr4:14579859-14581481
           FORWARD LENGTH=540
          Length = 540

 Score = 63.5 bits (153), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 33/82 (40%), Positives = 45/82 (54%), Gaps = 3/82 (3%)

Query: 50  YIIGDSGFPLLPYLIVPYERKEEKLLGVQADFNRRHFKMQMVAQRALARLKEMWGIIQGT 109
           +I+G+SGFPL  YL+VPY R  + L   Q  FN    ++Q +A  A  RLK  W  +Q  
Sbjct: 408 WIVGNSGFPLTDYLLVPYTR--QNLTWTQHAFNESIGEIQGIATAAFERLKGRWACLQKR 465

Query: 110 MWRPDKHRLPSIIHVCCILHNI 131
                   LP ++  CC+LHNI
Sbjct: 466 T-EVKLQDLPYVLGACCVLHNI 486


>AT5G12010.1 | Symbols:  | unknown protein; INVOLVED IN: response to
           salt stress; LOCATED IN: chloroplast, plasma membrane,
           membrane; EXPRESSED IN: 23 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT4G29780.1);
           Has 1807 Blast hits to 1807 proteins in 277 species:
           Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347;
           Plants - 385; Viruses - 0; Other Eukaryotes - 339
           (source: NCBI BLink). | chr5:3877975-3879483 REVERSE
           LENGTH=502
          Length = 502

 Score = 58.9 bits (141), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 38/137 (27%), Positives = 63/137 (45%), Gaps = 23/137 (16%)

Query: 15  DHEKKHSMVLQAIVHPDMRLNGRIIHLPY---ESKIRE-----------------YIIGD 54
           + +  +S+ +QA+V+P        I  P    + K+ E                 ++ G 
Sbjct: 315 NQKTSYSITIQAVVNPKGVFTDLCIGWPGSMPDDKVLEKSLLYQRANNGGLLKGMWVAGG 374

Query: 55  SGFPLLPYLIVPYERKEEKLLGVQADFNRRHFKMQMVAQRALARLKEMWGIIQGTMWRPD 114
            G PLL +++VPY   ++ L   Q  FN +  ++Q VA+ A  RLK  W  +Q       
Sbjct: 375 PGHPLLDWVLVPY--TQQNLTWTQHAFNEKMSEVQGVAKEAFGRLKGRWACLQKRT-EVK 431

Query: 115 KHRLPSIIHVCCILHNI 131
              LP+++  CC+LHNI
Sbjct: 432 LQDLPTVLGACCVLHNI 448


>AT1G72270.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Ribosome 60S
           biogenesis N-terminal (InterPro:IPR021714); BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G27010.1); Has 772 Blast hits to 657 proteins
           in 120 species: Archae - 0; Bacteria - 0; Metazoa - 344;
           Fungi - 94; Plants - 322; Viruses - 0; Other Eukaryotes
           - 12 (source: NCBI BLink). | chr1:27199733-27211122
           REVERSE LENGTH=2845
          Length = 2845

 Score = 55.8 bits (133), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 28/93 (30%), Positives = 51/93 (54%), Gaps = 3/93 (3%)

Query: 47  IREYIIGDSGFPLLPYLIVPYERKEEKLLGVQADFNRRHFKMQMVAQRALARLKEMWGII 106
           +  YI+GDS  PLLP+L+ PY+   ++    + +FN          + A A+++  W I+
Sbjct: 262 VPRYILGDSCLPLLPWLVTPYDLTSDE-ESFREEFNNVVHTGLHSVEIAFAKVRARWRIL 320

Query: 107 QGTMWRPDK-HRLPSIIHVCCILHNIVIDMGDE 138
               W+P+    +P +I   C+LHN +++ GD+
Sbjct: 321 D-KKWKPETIEFMPFVITTGCLLHNFLVNSGDD 352


>AT1G72270.2 | Symbols:  | LOCATED IN: mitochondrion; EXPRESSED IN:
           shoot apex, embryo, flower, seed; EXPRESSED DURING:
           petal differentiation and expansion stage, E expanded
           cotyledon stage, D bilateral stage; BEST Arabidopsis
           thaliana protein match is: PIF / Ping-Pong family of
           plant transposases (TAIR:AT3G55350.1). |
           chr1:27209890-27211122 REVERSE LENGTH=410
          Length = 410

 Score = 55.8 bits (133), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 51/90 (56%), Gaps = 3/90 (3%)

Query: 50  YIIGDSGFPLLPYLIVPYERKEEKLLGVQADFNRRHFKMQMVAQRALARLKEMWGIIQGT 109
           YI+GDS  PLLP+L+ PY+   ++    +   N  H  +  V + A A+++  W I+   
Sbjct: 265 YILGDSCLPLLPWLVTPYDLTSDEESFREEFNNVVHTGLHSV-EIAFAKVRARWRILD-K 322

Query: 110 MWRPDK-HRLPSIIHVCCILHNIVIDMGDE 138
            W+P+    +P +I   C+LHN +++ GD+
Sbjct: 323 KWKPETIEFMPFVITTGCLLHNFLVNSGDD 352