Miyakogusa Predicted Gene
- Lj0g3v0163869.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0163869.1 Non Chatacterized Hit- tr|I3SKM7|I3SKM7_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2 SV=1,98.68,0,seg,NULL;
LEA_2,Late embryogenesis abundant protein, LEA-14,CUFF.10230.1
(302 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G42860.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 263 1e-70
AT1G45688.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 258 3e-69
AT1G45688.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 181 5e-46
AT4G35170.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 167 1e-41
AT2G41990.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Late embry... 135 4e-32
AT3G24600.1 | Symbols: | Late embryogenesis abundant protein, g... 126 1e-29
AT3G08490.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 59 3e-09
>AT5G42860.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 11
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G45688.1); Has 1807 Blast
hits to 1807 proteins in 277 species: Archae - 0;
Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr5:17183339-17184857 REVERSE LENGTH=320
Length = 320
Score = 263 bits (671), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 152/321 (47%), Positives = 190/321 (59%), Gaps = 26/321 (8%)
Query: 1 MHAKTDSEVTSIXXXXXXXXXXXXXLYFVQSPSRDSHDGEKTVTTSFHSTPVLXXXXXXX 60
MHAKTDSEVTS+ YFVQSPSRDSHDGEKT T SFHSTPVL
Sbjct: 1 MHAKTDSEVTSLSASSPTRSPRRPA-YFVQSPSRDSHDGEKTAT-SFHSTPVLTSPMGSP 58
Query: 61 XXXXXXXXXXXXXKKDNPPHHHSLKPWKQIDVIEEEGLLQGEDRDR-TLSRRCYXXXXXX 119
K N KQ +IEEEGLL DR++ L RRCY
Sbjct: 59 PHSHSSSSRFS---KINGSKRKGHAGEKQFAMIEEEGLLDDGDREQEALPRRCYVLAFIV 115
Query: 120 XXXXXXXXXXXXXWGASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMNSTLKFTY 179
+ A++P KPKI +KSI F+ ++VQAG D+ G+ TDMI+MN+TL+ Y
Sbjct: 116 GFSLLFAFFSLILYAAAKPQKPKISVKSITFEQLKVQAGQDAGGIGTDMITMNATLRMLY 175
Query: 180 RNTGTFFGVHVASTPLELSYSEIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIPLYGSGAS 239
RNTGTFFGVHV S+P++LS+S+I I +G++K+FYQ R+S R V V V+G+KIPLYGSG++
Sbjct: 176 RNTGTFFGVHVTSSPIDLSFSQITIGSGSIKKFYQSRKSQRTVVVNVLGDKIPLYGSGST 235
Query: 240 LSST--------------------TGMPTVPVPLNLNFVLRSRAYVLGKLVKPKYYKRIQ 279
L P PVP+ LNF +RSRAYVLGKLV+PK+YKRI
Sbjct: 236 LVPPPPPAPIPKPKKKKGPIVIVEPPAPPAPVPMRLNFTVRSRAYVLGKLVQPKFYKRIV 295
Query: 280 CSITLDPKKLSAPIPLKHSCT 300
C I + KKLS IP+ ++CT
Sbjct: 296 CLINFEHKKLSKHIPITNNCT 316
>AT1G45688.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G42860.1); Has 258 Blast
hits to 242 proteins in 39 species: Archae - 0; Bacteria
- 11; Metazoa - 10; Fungi - 14; Plants - 198; Viruses -
17; Other Eukaryotes - 8 (source: NCBI BLink). |
chr1:17191502-17192870 FORWARD LENGTH=342
Length = 342
Score = 258 bits (660), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 153/340 (45%), Positives = 194/340 (57%), Gaps = 42/340 (12%)
Query: 1 MHAKTDSEVTSIXXXXXXXXXXXXXLYFVQSPSRDSHDGEKTVTTSFHSTPVLX------ 54
MHAKTDSEVTS+ +Y+VQSPSRDSHDGEKT T SFHSTPVL
Sbjct: 1 MHAKTDSEVTSLAASSPARSPRRP-VYYVQSPSRDSHDGEKTAT-SFHSTPVLSPMGSPP 58
Query: 55 -------XXXXXXXXXXXXXXXXXXXKKDNP------PHHHSLKPWKQIDVIEEEGLLQG 101
+K NP H K WK+ VIEEEGLL
Sbjct: 59 HSHSSMGRHSRESSSSRFSGSLKPGSRKVNPNDGSKRKGHGGEKQWKECAVIEEEGLLDD 118
Query: 102 EDRDRTLSRRCYXXXXXXXXXXXXXXXXXXXWGASRPMKPKIFIKSIKFDHVQVQAGSDS 161
DRD + RRCY +GA++PMKPKI +KSI F+ +++QAG D+
Sbjct: 119 GDRDGGVPRRCYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQDA 178
Query: 162 TGVATDMISMNSTLKFTYRNTGTFFGVHVASTPLELSYSEIVIAAGNMKEFYQHRRSHRL 221
GV TDMI+MN+TL+ YRNTGTFFGVHV STP++LS+S+I I +G++K+FYQ R+S R
Sbjct: 179 GGVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGSGSVKKFYQGRKSERT 238
Query: 222 VSVAVMGNKIPLYGSGAS---------------------LSSTTGMPTVPVPLNLNFVLR 260
V V V+G KIPLYGSG++ P PVP+ L+FV+R
Sbjct: 239 VLVHVIGEKIPLYGSGSTLLPPAPPAPLPKPKKKKGAPVPIPDPPAPPAPVPMTLSFVVR 298
Query: 261 SRAYVLGKLVKPKYYKRIQCSITLDPKKLSAPIPLKHSCT 300
SRAYVLGKLV+PK+YK+I+C I + K L+ I + +CT
Sbjct: 299 SRAYVLGKLVQPKFYKKIECDINFEHKNLNKHIVITKNCT 338
>AT1G45688.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G42860.1); Has 35333 Blast
hits to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr1:17191502-17192464 FORWARD LENGTH=248
Length = 248
Score = 181 bits (459), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 106/240 (44%), Positives = 134/240 (55%), Gaps = 25/240 (10%)
Query: 1 MHAKTDSEVTSIXXXXXXXXXXXXXLYFVQSPSRDSHDGEKTVTTSFHSTPVLX------ 54
MHAKTDSEVTS+ +Y+VQSPSRDSHDGEKT T SFHSTPVL
Sbjct: 1 MHAKTDSEVTSLAASSPARSPRRP-VYYVQSPSRDSHDGEKTAT-SFHSTPVLSPMGSPP 58
Query: 55 -------XXXXXXXXXXXXXXXXXXXKKDNPPH------HHSLKPWKQIDVIEEEGLLQG 101
+K NP H K WK+ VIEEEGLL
Sbjct: 59 HSHSSMGRHSRESSSSRFSGSLKPGSRKVNPNDGSKRKGHGGEKQWKECAVIEEEGLLDD 118
Query: 102 EDRDRTLSRRCYXXXXXXXXXXXXXXXXXXXWGASRPMKPKIFIKSIKFDHVQVQAGSDS 161
DRD + RRCY +GA++PMKPKI +KSI F+ +++QAG D+
Sbjct: 119 GDRDGGVPRRCYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQDA 178
Query: 162 TGVATDMISMNSTLKFTYRNTGTFFGVHVASTPLELSYSEIVIAAGN----MKEFYQHRR 217
GV TDMI+MN+TL+ YRNTGTFFGVHV STP++LS+S+I I +G+ +++ Y+ R
Sbjct: 179 GGVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGSGSVSLPIQKLYRMRE 238
>AT4G35170.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr4:16736839-16738186 FORWARD LENGTH=299
Length = 299
Score = 167 bits (422), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 78/169 (46%), Positives = 112/169 (66%), Gaps = 1/169 (0%)
Query: 133 WGASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMNSTLKFTYRNTGTFFGVHVAS 192
WG S+ P +K + +++ VQ+G+D +GV TDM+++NST++ YRN TFF VHV S
Sbjct: 129 WGVSKSFAPIATLKEMVLENLNVQSGNDQSGVLTDMLTLNSTVRILYRNPATFFTVHVTS 188
Query: 193 TPLELSYSEIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIPLYGSGASLSSTTGMP-TVPV 251
PL+LSYS++++A+G M EF Q R+S R++ V G++IPLYG +L P V +
Sbjct: 189 APLQLSYSQLILASGQMGEFSQRRKSERIIETKVFGDQIPLYGGVPALFGQRAEPDQVVL 248
Query: 252 PLNLNFVLRSRAYVLGKLVKPKYYKRIQCSITLDPKKLSAPIPLKHSCT 300
PLNL F LR+RAYVLG+LVK ++ I+CSIT KL + L SC+
Sbjct: 249 PLNLTFTLRARAYVLGRLVKTTFHSNIKCSITFYGDKLGKTLDLSKSCS 297
>AT2G41990.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Late
embryogenesis abundant protein, group 2
(InterPro:IPR004864); BEST Arabidopsis thaliana protein
match is: Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family
(TAIR:AT4G35170.1); Has 172 Blast hits to 168 proteins
in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 172; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr2:17527396-17528527 FORWARD
LENGTH=297
Length = 297
Score = 135 bits (339), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 76/167 (45%), Positives = 105/167 (62%), Gaps = 5/167 (2%)
Query: 133 WGASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMNSTLKFTYRNTGTFFGVHVAS 192
WGAS+ PK+ +K + + +QAG+D +GV TDM+S+NST++ YRN TFF VHV +
Sbjct: 134 WGASKSYPPKVTVKGMLVRDLNLQAGNDLSGVPTDMLSLNSTVRIYYRNPSTFFAVHVTA 193
Query: 193 TPLELSYSEIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIPLYGSGASLSSTTGMPTVPVP 252
+PL L YS +++++G M +F R V V G++IPLYG G S + T+ +P
Sbjct: 194 SPLLLHYSNLLLSSGEMNKFTVGRNGETNVVTVVQGHQIPLYG-GVSFH----LDTLSLP 248
Query: 253 LNLNFVLRSRAYVLGKLVKPKYYKRIQCSITLDPKKLSAPIPLKHSC 299
LNL VL S+AY+LG+LV K+Y RI CS TLD L I L SC
Sbjct: 249 LNLTIVLHSKAYILGRLVTSKFYTRIICSFTLDANHLPKSISLLRSC 295
>AT3G24600.1 | Symbols: | Late embryogenesis abundant protein,
group 2 | chr3:8972195-8974867 REVERSE LENGTH=506
Length = 506
Score = 126 bits (317), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 64/165 (38%), Positives = 97/165 (58%), Gaps = 2/165 (1%)
Query: 133 WGASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMNSTLKFTYRNTGTFFGVHVAS 192
WGAS P P + +KS+ G D TGVAT ++S NS++K T + +FG+HV+S
Sbjct: 336 WGASHPFSPIVSVKSVDIHSFYYGEGIDRTGVATKILSFNSSVKVTIDSPAPYFGIHVSS 395
Query: 193 TPLELSYSEIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIPLYGSGASLSSTTGMPTVPVP 252
+ +L++S + +A G +K +YQ R+S + V + G ++PLYG+G L+++ VPV
Sbjct: 396 STFKLTFSALTLATGQLKSYYQPRKSKHISIVKLTGAEVPLYGAGPHLAASDKKGKVPV- 454
Query: 253 LNLNFVLRSRAYVLGKLVKPKYYKRIQCSITLDPKKLSAPIPLKH 297
L F +RSR +LGKLVK K+ + CS + K S PI H
Sbjct: 455 -KLEFEIRSRGNLLGKLVKSKHENHVSCSFFISSSKTSKPIEFTH 498
Score = 91.3 bits (225), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 72/256 (28%), Positives = 115/256 (44%), Gaps = 13/256 (5%)
Query: 1 MHAKTDSEVTSIXXXXXXXXXXXXXLYFVQSPSRDSHDGEKTVTTSFHSTPVLXXXXXXX 60
M+ K+DS+VTS+ Y+VQSPSRDS T+ +TP
Sbjct: 3 MYPKSDSDVTSLDLSSPKRPT-----YYVQSPSRDSDKSSSVALTTHQTTPTESPSHPSI 57
Query: 61 XXXXXXXXXXXXXKKDNPPHHHSLKPWKQIDVIEEEGLLQGED---RDRTLS-RRCYXXX 116
K +H + W D EE G + ED +R +S C
Sbjct: 58 ASRVSNGGGGGFRWKGRRKYHGGI--WWPADK-EEGGDGRYEDLYEDNRGVSIVTCRLIL 114
Query: 117 XXXXXXXXXXXXXXXXWGASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMNSTLK 176
+GAS+ P ++IK + GSD+TGV T ++++ ++
Sbjct: 115 GVVATLSIFFLLCSVLFGASQSSPPIVYIKGVNVRSFYYGEGSDNTGVPTKIMNVKCSVV 174
Query: 177 FTYRNTGTFFGVHVASTPLELSYS-EIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIPLYG 235
T N T FG+HV+ST + L YS + +A +K ++Q ++S+ + ++G+K+PLYG
Sbjct: 175 ITTHNPSTLFGIHVSSTAVSLIYSRQFTLANARLKSYHQPKQSNHTSRINLIGSKVPLYG 234
Query: 236 SGASLSSTTGMPTVPV 251
+GA L ++ VPV
Sbjct: 235 AGAELVASDNSGGVPV 250
>AT3G08490.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: Late embryogenesis abundant protein, group 2
(TAIR:AT3G24600.1); Has 161 Blast hits to 158 proteins
in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 161; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr3:2574105-2575125 REVERSE
LENGTH=271
Length = 271
Score = 59.3 bits (142), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 33/154 (21%), Positives = 69/154 (44%), Gaps = 1/154 (0%)
Query: 135 ASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMNSTLKFTYRNTGTFFGVHVASTP 194
A++P P I + +F+ ++ G DS GV+T ++ N + K N FG+H+
Sbjct: 103 ATQPPHPNISFRIGRFNQFMLEEGVDSHGVSTKFLTFNCSTKLIIDNKSNVFGLHIHPPS 162
Query: 195 LELSYSEIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIPLYGSGASLSSTTGMPTVPVPLN 254
++ + + A + Y + + +YG+G ++ + +PL
Sbjct: 163 IKFFFGPLNFAKAQGPKLYGLSHESTTFQLYIATTNRAMYGAGTEMNDML-LSRAGLPLI 221
Query: 255 LNFVLRSRAYVLGKLVKPKYYKRIQCSITLDPKK 288
L + S V+ ++ PKY+ +++C + L K+
Sbjct: 222 LRTSIISDYRVVWNIINPKYHHKVECLLLLADKE 255