Miyakogusa Predicted Gene
- Lj1g3v3975890.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v3975890.1 Non Chatacterized Hit- tr|I1IJ35|I1IJ35_BRADI
Uncharacterized protein OS=Brachypodium distachyon
GN=,47.17,2e-19,DDE_4,NULL; UNCHARACTERIZED,Harbinger
transposase-derived nuclease,gene.g35858.t1.1
(177 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G55350.1 | Symbols: | PIF / Ping-Pong family of plant transp... 165 1e-41
AT3G63270.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Putative h... 117 3e-27
AT4G29780.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 64 8e-11
AT5G12010.1 | Symbols: | unknown protein; INVOLVED IN: response... 59 2e-09
AT1G72270.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Ribosome 6... 56 1e-08
AT1G72270.2 | Symbols: | LOCATED IN: mitochondrion; EXPRESSED I... 56 1e-08
>AT3G55350.1 | Symbols: | PIF / Ping-Pong family of plant
transposases | chr3:20518518-20520690 FORWARD LENGTH=406
Length = 406
Score = 165 bits (418), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 91/211 (43%), Positives = 117/211 (55%), Gaps = 38/211 (18%)
Query: 1 MCLASTEPHNDVWLDHEKKHSMVLQAIVHPDMR--------------------------- 33
M L + EP N VWLD EK SM LQA+V PDMR
Sbjct: 194 MNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAGWPGSLNDDVVLKNSGFYKLV 253
Query: 34 -----LNGRIIHLPYESKIREYIIGDSGFPLLPYLIVPYERKEEKLLGVQADFNRRHFKM 88
LNG + L +++REYI+GDSGFPLLP+L+ PY+ K L Q +FN+RH +
Sbjct: 254 EKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQGKPTSL--PQTEFNKRHSEA 311
Query: 89 QMVAQRALARLKEMWGIIQGTMWRPDKHRLPSIIHVCCILHNIVIDMGDEVQDEQLSNLP 148
AQ AL++LK+ W II G MW PD++RLP II VCC+LHNI+IDM D+ D+Q L
Sbjct: 312 TKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHNIIIDMEDQTLDDQ--PLS 369
Query: 149 INHDSGYHQLICGVEDTQGLLLR--LTKPVC 177
HD Y Q C + D +LR L+ +C
Sbjct: 370 QQHDMNYRQRSCKLADEASSVLRDELSDQLC 400
>AT3G63270.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Putative
harbinger transposase-derived nuclease
(InterPro:IPR006912); BEST Arabidopsis thaliana protein
match is: PIF / Ping-Pong family of plant transposases
(TAIR:AT3G55350.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr3:23375932-23377398 REVERSE LENGTH=396
Length = 396
Score = 117 bits (294), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 73/203 (35%), Positives = 101/203 (49%), Gaps = 37/203 (18%)
Query: 1 MCLASTEPHNDVWLDHEKKHSMVLQAIVHPDMR--------------------------- 33
M L + + +D W D EK +SM LQ + +MR
Sbjct: 190 MTLPAVQASDD-WCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLC 248
Query: 34 -----LNGRIIHLPYESKIREYIIGDSGFPLLPYLIVPYERKEEKLLGVQADFNRRHFKM 88
L+G L ++IREY++G +PLLP+LI P++ V FN RH K+
Sbjct: 249 ENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSMVA--FNERHEKV 306
Query: 89 QMVAQRALARLKEMWGIIQGTMWRPDKHRLPSIIHVCCILHNIVIDMGDEVQDEQLSNLP 148
+ VA A +LK W I+ MWRPD+ +LPSII VCC+LHNI+ID GD +Q++ L
Sbjct: 307 RSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQED--VPLS 364
Query: 149 INHDSGYHQLICGVEDTQGLLLR 171
+HDSGY C + G LR
Sbjct: 365 GHHDSGYADRYCKQTEPLGSELR 387
>AT4G29780.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G12010.1); Has 945 Blast hits to 944 proteins
in 87 species: Archae - 0; Bacteria - 0; Metazoa - 519;
Fungi - 43; Plants - 365; Viruses - 0; Other Eukaryotes
- 18 (source: NCBI BLink). | chr4:14579859-14581481
FORWARD LENGTH=540
Length = 540
Score = 63.5 bits (153), Expect = 8e-11, Method: Composition-based stats.
Identities = 33/82 (40%), Positives = 45/82 (54%), Gaps = 3/82 (3%)
Query: 50 YIIGDSGFPLLPYLIVPYERKEEKLLGVQADFNRRHFKMQMVAQRALARLKEMWGIIQGT 109
+I+G+SGFPL YL+VPY R + L Q FN ++Q +A A RLK W +Q
Sbjct: 408 WIVGNSGFPLTDYLLVPYTR--QNLTWTQHAFNESIGEIQGIATAAFERLKGRWACLQKR 465
Query: 110 MWRPDKHRLPSIIHVCCILHNI 131
LP ++ CC+LHNI
Sbjct: 466 T-EVKLQDLPYVLGACCVLHNI 486
>AT5G12010.1 | Symbols: | unknown protein; INVOLVED IN: response to
salt stress; LOCATED IN: chloroplast, plasma membrane,
membrane; EXPRESSED IN: 23 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT4G29780.1);
Has 1807 Blast hits to 1807 proteins in 277 species:
Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347;
Plants - 385; Viruses - 0; Other Eukaryotes - 339
(source: NCBI BLink). | chr5:3877975-3879483 REVERSE
LENGTH=502
Length = 502
Score = 58.9 bits (141), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 63/137 (45%), Gaps = 23/137 (16%)
Query: 15 DHEKKHSMVLQAIVHPDMRLNGRIIHLPY---ESKIRE-----------------YIIGD 54
+ + +S+ +QA+V+P I P + K+ E ++ G
Sbjct: 315 NQKTSYSITIQAVVNPKGVFTDLCIGWPGSMPDDKVLEKSLLYQRANNGGLLKGMWVAGG 374
Query: 55 SGFPLLPYLIVPYERKEEKLLGVQADFNRRHFKMQMVAQRALARLKEMWGIIQGTMWRPD 114
G PLL +++VPY ++ L Q FN + ++Q VA+ A RLK W +Q
Sbjct: 375 PGHPLLDWVLVPY--TQQNLTWTQHAFNEKMSEVQGVAKEAFGRLKGRWACLQKRT-EVK 431
Query: 115 KHRLPSIIHVCCILHNI 131
LP+++ CC+LHNI
Sbjct: 432 LQDLPTVLGACCVLHNI 448
>AT1G72270.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Ribosome 60S
biogenesis N-terminal (InterPro:IPR021714); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G27010.1); Has 772 Blast hits to 657 proteins
in 120 species: Archae - 0; Bacteria - 0; Metazoa - 344;
Fungi - 94; Plants - 322; Viruses - 0; Other Eukaryotes
- 12 (source: NCBI BLink). | chr1:27199733-27211122
REVERSE LENGTH=2845
Length = 2845
Score = 55.8 bits (133), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/93 (30%), Positives = 51/93 (54%), Gaps = 3/93 (3%)
Query: 47 IREYIIGDSGFPLLPYLIVPYERKEEKLLGVQADFNRRHFKMQMVAQRALARLKEMWGII 106
+ YI+GDS PLLP+L+ PY+ ++ + +FN + A A+++ W I+
Sbjct: 262 VPRYILGDSCLPLLPWLVTPYDLTSDE-ESFREEFNNVVHTGLHSVEIAFAKVRARWRIL 320
Query: 107 QGTMWRPDK-HRLPSIIHVCCILHNIVIDMGDE 138
W+P+ +P +I C+LHN +++ GD+
Sbjct: 321 D-KKWKPETIEFMPFVITTGCLLHNFLVNSGDD 352
>AT1G72270.2 | Symbols: | LOCATED IN: mitochondrion; EXPRESSED IN:
shoot apex, embryo, flower, seed; EXPRESSED DURING:
petal differentiation and expansion stage, E expanded
cotyledon stage, D bilateral stage; BEST Arabidopsis
thaliana protein match is: PIF / Ping-Pong family of
plant transposases (TAIR:AT3G55350.1). |
chr1:27209890-27211122 REVERSE LENGTH=410
Length = 410
Score = 55.8 bits (133), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 51/90 (56%), Gaps = 3/90 (3%)
Query: 50 YIIGDSGFPLLPYLIVPYERKEEKLLGVQADFNRRHFKMQMVAQRALARLKEMWGIIQGT 109
YI+GDS PLLP+L+ PY+ ++ + N H + V + A A+++ W I+
Sbjct: 265 YILGDSCLPLLPWLVTPYDLTSDEESFREEFNNVVHTGLHSV-EIAFAKVRARWRILD-K 322
Query: 110 MWRPDK-HRLPSIIHVCCILHNIVIDMGDE 138
W+P+ +P +I C+LHN +++ GD+
Sbjct: 323 KWKPETIEFMPFVITTGCLLHNFLVNSGDD 352