Miyakogusa Predicted Gene
- Lj4g3v0575050.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v0575050.1 Non Chatacterized Hit- tr|G7JMG5|G7JMG5_MEDTR
Putative uncharacterized protein OS=Medicago
truncatul,86.36,0,seg,NULL; DDE_4,NULL; UNCHARACTERIZED,Harbinger
transposase-derived nuclease,CUFF.47595.1
(436 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G19120.1 | Symbols: | PIF / Ping-Pong family of plant transp... 609 e-174
AT5G12010.1 | Symbols: | unknown protein; INVOLVED IN: response... 100 2e-21
AT4G29780.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 84 2e-16
AT3G63270.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Putative h... 78 1e-14
AT3G55350.1 | Symbols: | PIF / Ping-Pong family of plant transp... 73 3e-13
>AT3G19120.1 | Symbols: | PIF / Ping-Pong family of plant
transposases | chr3:6609678-6611018 REVERSE LENGTH=446
Length = 446
Score = 609 bits (1570), Expect = e-174, Method: Compositional matrix adjust.
Identities = 295/398 (74%), Positives = 339/398 (85%), Gaps = 3/398 (0%)
Query: 42 APLLFFTMASVLSYVASTKANNASDNREGNHRRQPAPNASD--YSVSAFRALSTEHIWSL 99
APLLFFT+AS+LS++A +++ S + + P P +D YSV+AFRAL+T+HIWSL
Sbjct: 49 APLLFFTLASLLSFLAVNRSSTESSSSSESPSPSPPPPLADGDYSVAAFRALTTDHIWSL 108
Query: 100 EAPLRDAHWRSLYGLSYPVFTTVVDKLKPHIALSNLSLPSDYAVAMVLSRLAHGLSATTV 159
+APLRDA WRSLYGLSYPVF TVVDKLKP I SNLSLP+DYAVAMVLSRLAHG SA T+
Sbjct: 109 DAPLRDARWRSLYGLSYPVFITVVDKLKPFITASNLSLPADYAVAMVLSRLAHGCSAKTL 168
Query: 160 AARYSLDPYLVSKITNMVTRLLATKLYPEFIKIPVGRRRLLETTQAFEELTSLPNMCGAI 219
A+RYSLDPYL+SKITNMVTRLLATKLYPEFIKIPVG+RRL+ETTQ FEELTSLPN+CGAI
Sbjct: 169 ASRYSLDPYLISKITNMVTRLLATKLYPEFIKIPVGKRRLIETTQGFEELTSLPNICGAI 228
Query: 220 DTSPVKLRSTPTSNPAT-YLCRYGYPSVLLQVVSDHKKIFWDVCVKAPGGTDDATHFRDS 278
D++PVKLR NP Y C+YGY +VLLQVV+DHKKIFWDVCVKAPGG DD++HFRDS
Sbjct: 229 DSTPVKLRRRTKLNPRNIYGCKYGYDAVLLQVVADHKKIFWDVCVKAPGGEDDSSHFRDS 288
Query: 279 LLYHRLTSGDVVWDKVINVRGHHVRPYVVGDWCYXXXXXXXXXXXXXGMGTPAQNLFDGM 338
LLY RLTSGD+VW+KVIN+RGHHVRPY+VGDWCY G GTP +NLFDGM
Sbjct: 289 LLYKRLTSGDIVWEKVINIRGHHVRPYIVGDWCYPLLSFLMTPFSPNGSGTPPENLFDGM 348
Query: 339 LMKGRSVVVEAIALLKGRWKILQDLNVGLHHAPQTIVACCVLHNLCQIAREPEPELWKEP 398
LMKGRSVVVEAI LLK RWKILQ LNVG++HAPQTIVACCVLHNLCQIAREPEPE+WK+P
Sbjct: 349 LMKGRSVVVEAIGLLKARWKILQSLNVGVNHAPQTIVACCVLHNLCQIAREPEPEIWKDP 408
Query: 399 DESGPQPRVLDSEKSFYFFGESLRQALADDLHQKLSSR 436
DE+G RVL+SE+ FY++GESLRQALA+DLHQ+LSSR
Sbjct: 409 DEAGTPARVLESERQFYYYGESLRQALAEDLHQRLSSR 446
>AT5G12010.1 | Symbols: | unknown protein; INVOLVED IN: response to
salt stress; LOCATED IN: chloroplast, plasma membrane,
membrane; EXPRESSED IN: 23 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT4G29780.1);
Has 1807 Blast hits to 1807 proteins in 277 species:
Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347;
Plants - 385; Viruses - 0; Other Eukaryotes - 339
(source: NCBI BLink). | chr5:3877975-3879483 REVERSE
LENGTH=502
Length = 502
Score = 100 bits (250), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 89/326 (27%), Positives = 146/326 (44%), Gaps = 31/326 (9%)
Query: 94 EHIWSLEAPLRDAHWRSLYGLSYPVFTTVVDKLKPHIALSNLSL----PSDYAVAMVLSR 149
E L+ P D ++ + +S F + D+L +A + +L P VA+ + R
Sbjct: 163 EECSRLDYPEED--FKKAFRMSKSTFELICDELNSAVAKEDTALRNAIPVRQRVAVCIWR 220
Query: 150 LAHGLSATTVAARYSLDPYLVSKITNMVTRLLATKLYPEFIKIPVGRRRLLETTQAFEEL 209
LA G V+ ++ L K+ V + + L P++++ P L + FE +
Sbjct: 221 LATGEPLRLVSKKFGLGISTCHKLVLEVCKAIKDVLMPKYLQWP-DDESLRNIRERFESV 279
Query: 210 TSLPNMCGAIDTSPVKLRSTPTSNPATYLCRYGYP-------SVLLQVVSDHKKIFWDVC 262
+ +PN+ G++ T+ + + + P + A+Y + S+ +Q V + K +F D+C
Sbjct: 280 SGIPNVVGSMYTTHIPIIA-PKISVASYFNKRHTERNQKTSYSITIQAVVNPKGVFTDLC 338
Query: 263 VKAPGGTDDATHFRDSLLYHRLTSGDVVWDKVINVRGHHVRPYVVGDWCYXXXXXXXXXX 322
+ PG D SLLY R +G ++ K + V G P + DW
Sbjct: 339 IGWPGSMPDDKVLEKSLLYQRANNGGLL--KGMWVAGGPGHPLL--DWVLVPYTQQNL-- 392
Query: 323 XXXGMGTPAQNLFDGMLMKGRSVVVEAIALLKGRWKILQD-LNVGLHHAPQTIVACCVLH 381
T Q+ F+ + + + V EA LKGRW LQ V L P + ACCVLH
Sbjct: 393 ------TWTQHAFNEKMSEVQGVAKEAFGRLKGRWACLQKRTEVKLQDLPTVLGACCVLH 446
Query: 382 NLCQIAREP-EPELWKE--PDESGPQ 404
N+C++ E EPEL E DE P+
Sbjct: 447 NICEMREEKMEPELMVEVIDDEVLPE 472
>AT4G29780.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G12010.1); Has 945 Blast hits to 944 proteins
in 87 species: Archae - 0; Bacteria - 0; Metazoa - 519;
Fungi - 43; Plants - 365; Viruses - 0; Other Eukaryotes
- 18 (source: NCBI BLink). | chr4:14579859-14581481
FORWARD LENGTH=540
Length = 540
Score = 84.0 bits (206), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 129/302 (42%), Gaps = 29/302 (9%)
Query: 107 HWRSLYGLSYPVFTTVVDKLKPHIALSNL----SLPSDYAVAMVLSRLAHGLSATTVAAR 162
+R + +S F + ++L + N ++P+ V + + RLA G V+ R
Sbjct: 212 EFRREFRMSKSTFNLICEELDTTVTKKNTMLRDAIPAPKRVGVCVWRLATGAPLRHVSER 271
Query: 163 YSLDPYLVSKITNMVTRLLATKLYPEFIKIPVGRRRLLETTQAFEELTSLPNMCGAIDTS 222
+ L K+ V R + L P+++ P + T FE + +PN+ G+I T+
Sbjct: 272 FGLGISTCHKLVIEVCRAIYDVLMPKYLLWP-SDSEINSTKAKFESVHKIPNVVGSIYTT 330
Query: 223 PVKLRSTPTSNPATYLCRYGYP-------SVLLQVVSDHKKIFWDVCVKAPGG-TDDATH 274
+ + + P + A Y + S+ +Q V + IF DVC+ PG TDD
Sbjct: 331 HIPIIA-PKVHVAAYFNKRHTERNQKTSYSITVQGVVNADGIFTDVCIGNPGSLTDDQIL 389
Query: 275 FRDSLLYHRLTSGDVVWDKVINVRGHHVRPYVVGDWCYXXXXXXXXXXXXXGMGTPAQNL 334
+ SL R G + ++ G + Y++ + T Q+
Sbjct: 390 EKSSLSRQRAARGMLRDSWIVGNSGFPLTDYLLVPYTRQNL-------------TWTQHA 436
Query: 335 FDGMLMKGRSVVVEAIALLKGRWKILQD-LNVGLHHAPQTIVACCVLHNLCQIAREPE-P 392
F+ + + + + A LKGRW LQ V L P + ACCVLHN+C++ +E P
Sbjct: 437 FNESIGEIQGIATAAFERLKGRWACLQKRTEVKLQDLPYVLGACCVLHNICEMRKEEMLP 496
Query: 393 EL 394
EL
Sbjct: 497 EL 498
>AT3G63270.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Putative
harbinger transposase-derived nuclease
(InterPro:IPR006912); BEST Arabidopsis thaliana protein
match is: PIF / Ping-Pong family of plant transposases
(TAIR:AT3G55350.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr3:23375932-23377398 REVERSE LENGTH=396
Length = 396
Score = 77.8 bits (190), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 71/265 (26%), Positives = 112/265 (42%), Gaps = 13/265 (4%)
Query: 127 KPHIALSNLS---LPSDYAVAMVLSRLAHGLSATTVAARYSLDPYLVSKITNMVTRLLAT 183
+P L N+ L + VA+ L RLA G S +V A + + VS++T L
Sbjct: 90 RPPSGLINIEGRLLSVEKQVAIALRRLASGDSQVSVGAAFGVGQSTVSQVTWRFIEALEE 149
Query: 184 KLYPEFIKIPVGRRRLLETTQAFEELTSLPNMCGAIDTSPVKLRSTPTSNPATYLCRYGY 243
+ ++ P R+ E FEE+ LPN CGAIDT+ + + + +
Sbjct: 150 RA-KHHLRWP-DSDRIEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPAVQASDDWCDQEKN 207
Query: 244 PSVLLQVVSDHKKIFWDVCVKAPGGTDDATHFRDSLLYHRLTSGDVVWDKVINV-RGHHV 302
S+ LQ V DH+ F ++ PGG + + S + + ++ + +G +
Sbjct: 208 YSMFLQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQILDGNPKTLSQGAQI 267
Query: 303 RPYVVGDWCYXXXXXXXXXXXXXGMGTPAQNL--FDGMLMKGRSVVVEAIALLKGRWKIL 360
R YVVG Y P+ ++ F+ K RSV A LKG W+IL
Sbjct: 268 REYVVGGISYPLLPWLITPHDS---DHPSDSMVAFNERHEKVRSVAATAFQQLKGSWRIL 324
Query: 361 QDL--NVGLHHAPQTIVACCVLHNL 383
+ P I+ CC+LHN+
Sbjct: 325 SKVMWRPDRRKLPSIILVCCLLHNI 349
>AT3G55350.1 | Symbols: | PIF / Ping-Pong family of plant
transposases | chr3:20518518-20520690 FORWARD LENGTH=406
Length = 406
Score = 73.2 bits (178), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 65/251 (25%), Positives = 107/251 (42%), Gaps = 21/251 (8%)
Query: 143 VAMVLSRLAHGLSATTVAARYSLDPYLVSKIT-----NMVTRLLATKLYPEFIKIPVGRR 197
VA+ L RL G S + + + ++ VS+IT +M R + +P
Sbjct: 115 VAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPS--------- 165
Query: 198 RLLETTQAFEELTSLPNMCGAIDTSPVKLRSTPTSNPATYLCRYGYP--SVLLQVVSDHK 255
+L E FE+++ LPN CGAID + + + + P P+ + G S+ LQ V D
Sbjct: 166 KLDEIKSKFEKISGLPNCCGAIDITHIVM-NLPAVEPSNKVWLDGEKNFSMTLQAVVDPD 224
Query: 256 KIFWDVCVKAPGGTDDATHFRDSLLYHRLTSGDVV-WDKVINVRGHHVRPYVVGDWCYXX 314
F DV PG +D ++S Y + G + +K+ +R Y+VGD +
Sbjct: 225 MRFLDVIAGWPGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPL 284
Query: 315 XXXXXXXXXXXGMGTPAQNLFDGMLMKGRSVVVEAIALLKGRWKILQDLNV--GLHHAPQ 372
P Q F+ + A++ LK RW+I+ + + P+
Sbjct: 285 LPWLLTPYQGKPTSLP-QTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPR 343
Query: 373 TIVACCVLHNL 383
I CC+LHN+
Sbjct: 344 IIFVCCLLHNI 354