Miyakogusa Predicted Gene
- Lj3g3v0311080.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v0311080.1 Non Chatacterized Hit- tr|B9S0S2|B9S0S2_RICCO
Putative uncharacterized protein OS=Ricinus communis
G,33.51,5e-19,LEA_2,Late embryogenesis abundant protein, LEA-14;
seg,NULL; SUBFAMILY NOT NAMED,NULL; FAMILY NOT NA,CUFF.40505.1
(256 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G17620.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 131 4e-31
AT5G11890.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 98 5e-21
AT2G27080.2 | Symbols: | Late embryogenesis abundant (LEA) hydr... 64 1e-10
AT2G27080.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 64 1e-10
AT5G36970.1 | Symbols: NHL25 | NDR1/HIN1-like 25 | chr5:14604367... 60 2e-09
AT1G65690.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 58 5e-09
>AT1G17620.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr1:6062313-6063107 FORWARD LENGTH=264
Length = 264
Score = 131 bits (329), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 93/232 (40%), Positives = 112/232 (48%), Gaps = 9/232 (3%)
Query: 31 QLYGANRPAYRPQP--LHXXXXXXXXXXXXXWXXXXXXXXXXXXGGAGVAFYLLYRPHHP 88
QLY ANRPAYRP W A YL+YRP P
Sbjct: 33 QLYNANRPAYRPPAGRRRTSHTRGCCCRCCCWTIFVIILLLLIVAAASAVVYLIYRPQRP 92
Query: 89 TFSVXXXXXXXXXXXXXXXXXXKFDLTVAATNPNKKNIAFSYLPTSITILSG------DV 142
+F+V L+V A NPNK N+ F Y T IT+ DV
Sbjct: 93 SFTVSELKISTLNFTSAVRLTTAISLSVIARNPNK-NVGFIYDVTDITLYKASTGGDDDV 151
Query: 143 DVGDGTIPTFHHGKKNTTLLKSSISSSGHEXXXXXXXXXXXXXXXXNGLPLKVKLDTKVK 202
+G GTI F HGKKNTT L+S+I S E + +K+ L++KVK
Sbjct: 152 VIGKGTIAAFSHGKKNTTTLRSTIGSPPDELDEISAGKLKGDLKAKKAVAIKIVLNSKVK 211
Query: 203 ATMGKLKTPRVGIRVSCDGIRVTLPTGKKPATASTSNAKCNVDVRFKIWKWT 254
MG LKTP+ GIRV+C+GI+V PTGKK TA+TS AKC VD RFKIWK T
Sbjct: 212 VKMGALKTPKSGIRVTCEGIKVVAPTGKKATTATTSAAKCKVDPRFKIWKIT 263
>AT5G11890.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
LOCATED IN: plasma membrane; EXPRESSED IN: 12 plant
structures; EXPRESSED DURING: 6 growth stages; BEST
Arabidopsis thaliana protein match is: Late
embryogenesis abundant (LEA) hydroxyproline-rich
glycoprotein family (TAIR:AT1G17620.1); Has 1807 Blast
hits to 1807 proteins in 277 species: Archae - 0;
Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr5:3831770-3832633 FORWARD LENGTH=287
Length = 287
Score = 98.2 bits (243), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 68/240 (28%), Positives = 102/240 (42%), Gaps = 17/240 (7%)
Query: 31 QLY-GANRPAYRPQPLHXX-------XXXXXXXXXXXWXXXXXXXXXXXXGGAGVAFYLL 82
Q+Y ANRP YRPQP W A A Y++
Sbjct: 48 QVYIPANRPVYRPQPYSRRHHHQSRPSCRRICCCCCFWSILIILILALMTAIAATAMYVI 107
Query: 83 YRPHHPTFSVXXXXXXXXXXXXXXXXXXK-----FDLTVAATNPNKKNIAFSYLPTSITI 137
Y P P+FSV F+ T+ + NPN+ +++FSY P ++T+
Sbjct: 108 YHPRPPSFSVPSIRISRVNLTTSSDSSVSHLSSFFNFTLISENPNQ-HLSFSYDPFTVTV 166
Query: 138 LSGD--VDVGDGTIPTFHHGKKNTTLLKSSISSSGHEXXXXXXXXXXXXXXXXNG-LPLK 194
S +G+GT+P F N T I++S + +
Sbjct: 167 NSAKSGTMLGNGTVPAFFSDNGNKTSFHGVIATSTAARELDPDEAKHLRSDLTRARVGYE 226
Query: 195 VKLDTKVKATMGKLKTPRVGIRVSCDGIRVTLPTGKKPATASTSNAKCNVDVRFKIWKWT 254
+++ TKVK MGKLK+ V I+V+C+G T+P GK P A++ KC D+ K+WKW+
Sbjct: 227 IEMRTKVKMIMGKLKSEGVEIKVTCEGFEGTIPKGKTPIVATSKKTKCKSDLSVKVWKWS 286
>AT2G27080.2 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr2:11566383-11567165 FORWARD LENGTH=260
Length = 260
Score = 63.5 bits (153), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 76/175 (43%), Gaps = 9/175 (5%)
Query: 75 AGVAF---YLLYRPHHPTFSVXXXXXXXXXXXXXXXXXXKFDLTVAATNPNKKNIAFSYL 131
AG++F YL+YRP P +S+ F++TV + N N K +
Sbjct: 89 AGISFAVLYLIYRPEAPKYSIEGFSVSGINLNSTSPISPSFNVTVRSRNGNGKIGVYYEK 148
Query: 132 PTSITILSGDVDVGDGTIPTFHHGKKNTTLLKSSISSSGHEXXXXXXXXXXXXXXXXNGL 191
+S+ + DVD+ +G +P F+ KN T++K +S S + +
Sbjct: 149 ESSVDVYYNDVDISNGVMPVFYQPAKNVTVVKLVLSGSKIQLTSGMRKEMRNEVSKKT-V 207
Query: 192 PLKVKLDTKVKATMGKLKTPRVGIRVSCDGIRVTLPTGKKPATASTSNAKCNVDV 246
P K+K+ VK G +KT + + V CD + K A + + KC+ DV
Sbjct: 208 PFKLKIKAPVKIKFGSVKTWTMIVNVDCD-----VTVDKLTAPSRIVSRKCSHDV 257
>AT2G27080.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr2:11566383-11567165 FORWARD LENGTH=260
Length = 260
Score = 63.5 bits (153), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 76/175 (43%), Gaps = 9/175 (5%)
Query: 75 AGVAF---YLLYRPHHPTFSVXXXXXXXXXXXXXXXXXXKFDLTVAATNPNKKNIAFSYL 131
AG++F YL+YRP P +S+ F++TV + N N K +
Sbjct: 89 AGISFAVLYLIYRPEAPKYSIEGFSVSGINLNSTSPISPSFNVTVRSRNGNGKIGVYYEK 148
Query: 132 PTSITILSGDVDVGDGTIPTFHHGKKNTTLLKSSISSSGHEXXXXXXXXXXXXXXXXNGL 191
+S+ + DVD+ +G +P F+ KN T++K +S S + +
Sbjct: 149 ESSVDVYYNDVDISNGVMPVFYQPAKNVTVVKLVLSGSKIQLTSGMRKEMRNEVSKKT-V 207
Query: 192 PLKVKLDTKVKATMGKLKTPRVGIRVSCDGIRVTLPTGKKPATASTSNAKCNVDV 246
P K+K+ VK G +KT + + V CD + K A + + KC+ DV
Sbjct: 208 PFKLKIKAPVKIKFGSVKTWTMIVNVDCD-----VTVDKLTAPSRIVSRKCSHDV 257
>AT5G36970.1 | Symbols: NHL25 | NDR1/HIN1-like 25 |
chr5:14604367-14605194 REVERSE LENGTH=248
Length = 248
Score = 60.1 bits (144), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/173 (23%), Positives = 75/173 (43%), Gaps = 8/173 (4%)
Query: 79 FYLLYRPHHPTFSVXXXXXXXXXXXXXXXXXXKFDLTVAATNPNKKNIAFSYLPTSITIL 138
YL++RP P +++ F++T+ A NPN+K + + I++L
Sbjct: 83 LYLVFRPKFPDYNIDRLQLTRFQLNQDLSLSTAFNVTITAKNPNEKIGIYYEDGSKISVL 142
Query: 139 SGDVDVGDGTIPTFHHGKKNTTLLKSSISSSGHEXXXXXXXXXXXXXXXXNG-LPLKVKL 197
+ +G++P F+ G +NTT++ + +G G +PL++++
Sbjct: 143 YMQTRISNGSLPKFYQGHENTTII--LVEMTGFTQNATSLMTTLQEQQRLTGSIPLRIRV 200
Query: 198 DTKVKATMGKLKTPRVGIRVSCDGIRVTLPTGKKPATASTSNAKCNVDVRFKI 250
V+ +GKLK +V V C G+ V +SN K RF++
Sbjct: 201 TQPVRIKLGKLKLMKVRFLVRC-GVSVDSLAANSVIRVRSSNCK----YRFRL 248
>AT1G65690.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr1:24431642-24432898 REVERSE LENGTH=252
Length = 252
Score = 58.2 bits (139), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 40/176 (22%), Positives = 74/176 (42%), Gaps = 5/176 (2%)
Query: 74 GAGVA-FYLLYRPHHPTFSVXXXXXXXXXXXXXXXXXXKFDLTVAATNPNKKNIAFSYLP 132
GA + YL+++P P +S+ F++T+ A NPN+K +
Sbjct: 81 GASIGILYLVFKPKLPDYSIDRLQLTRFALNQDSSLTTAFNVTITAKNPNEKIGIYYEDG 140
Query: 133 TSITILSGDVDVGDGTIPTFHHGKKNTTLLKSSISSSGHEXXXXXXXXXXXXXXXXNGLP 192
+ IT+ + + +G++P F+ G +NTT++ ++ N +P
Sbjct: 141 SKITVWYMEHQLSNGSLPKFYQGHENTTVIYVEMTGQTQNASGLRTTLEEQQQRTGN-IP 199
Query: 193 LKVKLDTKVKATMGKLKTPRVGIRVSCDGIRVTLPTGKKPATASTSNAKCNVDVRF 248
L+++++ V+ GKLK V V C +L T S+S C +R
Sbjct: 200 LRIRVNQPVRVKFGKLKLFEVRFLVRCGVFVDSLATNNVIKIQSSS---CKFRLRL 252