Miyakogusa Predicted Gene
- Lj1g3v5060670.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v5060670.1 tr|Q2HTB8|Q2HTB8_MEDTR Harpin-induced 1
OS=Medicago truncatula GN=MTR_7g118270 PE=4 SV=1,69.79,0,seg,NULL;
LEA_2,Late embryogenesis abundant protein, LEA-14; SUBFAMILY NOT
NAMED,NULL; FAMILY NOT NA,CUFF.33959.1
(231 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G01410.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 201 3e-52
AT3G52470.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 120 7e-28
AT3G44220.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 115 3e-26
AT2G35960.1 | Symbols: NHL12 | NDR1/HIN1-like 12 | chr2:15107150... 112 2e-25
AT4G09590.1 | Symbols: NHL22 | NDR1/HIN1-like 22 | chr4:6066128-... 109 2e-24
AT2G35970.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 107 6e-24
AT3G11660.1 | Symbols: NHL1 | NDR1/HIN1-like 1 | chr3:3679031-36... 107 8e-24
AT5G06330.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 102 3e-22
AT5G22200.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 100 9e-22
AT5G53730.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 100 1e-21
AT4G05220.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 88 5e-18
AT5G22870.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 83 2e-16
AT2G27080.2 | Symbols: | Late embryogenesis abundant (LEA) hydr... 80 1e-15
AT2G27080.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 80 1e-15
AT1G61760.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 74 6e-14
AT2G35980.1 | Symbols: YLS9, NHL10, ATNHL10 | Late embryogenesis... 74 7e-14
AT3G11650.1 | Symbols: NHL2 | NDR1/HIN1-like 2 | chr3:3676264-36... 70 9e-13
AT5G06320.1 | Symbols: NHL3 | NDR1/HIN1-like 3 | chr5:1931016-19... 69 3e-12
AT1G17620.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 69 3e-12
AT3G52460.1 | Symbols: | hydroxyproline-rich glycoprotein famil... 60 1e-09
AT2G35460.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 56 2e-08
AT5G05657.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 55 5e-08
AT5G21130.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 54 7e-08
AT4G26490.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 53 2e-07
AT1G65690.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 50 1e-06
AT5G56050.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 48 6e-06
AT5G11890.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 47 1e-05
>AT4G01410.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr4:578308-578991 FORWARD LENGTH=227
Length = 227
Score = 201 bits (511), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 99/187 (52%), Positives = 135/187 (72%)
Query: 45 KRAACTFITVFLLAVGITLLVLWLVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVL 104
+RA C I L+ +GI L+LWLVYRPHKPR TVVGAA+Y LN T+PPL+S ++QF+VL
Sbjct: 41 RRAICGAIFTILVILGIIALILWLVYRPHKPRLTVVGAAIYDLNFTAPPLISTSVQFSVL 100
Query: 105 IRNPNKRVSIYYDRFSAFVSYRNQPITQQVLLPPLFLEKHSQVSLSPVIGGTPMPVTVEV 164
RNPN+RVSI+YD+ S +V+Y++Q IT + LPPL L S V ++PV+GG +PV+ EV
Sbjct: 101 ARNPNRRVSIHYDKLSMYVTYKDQIITPPLPLPPLRLGHKSTVVIAPVMGGNGIPVSPEV 160
Query: 165 ANGLAMDESYGVVGLKLVFLGRVKWKAGGLRTWHHGLYVKCDLLVGLKKGFVGQVPLLQA 224
ANGL DE+YGVV +++V GR++WKAG ++T +G Y +CD+ + GQVPLL
Sbjct: 161 ANGLKNDEAYGVVLMRVVIFGRLRWKAGAIKTGRYGFYARCDVWLRFNPSSNGQVPLLAP 220
Query: 225 QACDVDL 231
C VD+
Sbjct: 221 STCKVDV 227
>AT3G52470.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr3:19450750-19451376 FORWARD LENGTH=208
Length = 208
Score = 120 bits (301), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 103/193 (53%), Gaps = 8/193 (4%)
Query: 46 RAACTFITVFLLAVGITLLVLWLVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLI 105
R C I F++ V IT+ ++W++ RP KPRF + A VYA N + P LL++ Q +
Sbjct: 17 RKLCAAIIAFIVIVLITIFLVWVILRPTKPRFVLQDATVYAFNLSQPNLLTSNFQVTIAS 76
Query: 106 RNPNKRVSIYYDRFSAFVSYRNQPITQQVLLPPLFLEKHSQVSL-SPVIGGTPMPVTVEV 164
RNPN ++ IYYDR + +Y NQ IT + +PP + + H +V++ SP + GT +P+
Sbjct: 77 RNPNSKIGIYYDRLHVYATYMNQQITLRTAIPPTY-QGHKEVNVWSPFVYGTAVPIAPYN 135
Query: 165 ANGLAMDESYGVVGLKLVFLGRVKWKAGGLRTWHHGLYVKCDLLVGLKKGFVG------Q 218
+ L ++ G VGL + G V+WK L T + ++V+C + L G
Sbjct: 136 SVALGEEKDRGFVGLMIRADGTVRWKVRTLITGKYHIHVRCQAFINLGNKAAGVLVGDNA 195
Query: 219 VPLLQAQACDVDL 231
V A C V++
Sbjct: 196 VKYTLANKCSVNV 208
>AT3G44220.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr3:15928216-15929645 FORWARD LENGTH=206
Length = 206
Score = 115 bits (287), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 64/181 (35%), Positives = 98/181 (54%), Gaps = 5/181 (2%)
Query: 55 FLLAVGITLLVLWLVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLIRNPNKRVSI 114
FL AV + ++W + PH PRF + A +YA N + P L++ LQ + RNPN ++ I
Sbjct: 27 FLAAVLFVVFLVWAILHPHGPRFVLQDATIYAFNVSQPNYLTSNLQVTLSSRNPNDKIGI 86
Query: 115 YYDRFSAFVSYRNQPITQQVLLPPLFLEKHSQVSL-SPVIGGTPMPVTVEVANGLAMDES 173
+YDR + SYRNQ +T LLP + + H V++ SP + GT +PV + L+ D +
Sbjct: 87 FYDRLDIYASYRNQQVTLATLLPATY-QGHLDVTIWSPFLYGTTVPVAPYFSPALSQDLT 145
Query: 174 YGVVGLKLVFLGRVKWKAGGLRTWHHGLYVKCDLLVGLKKGFVGQVPLLQ---AQACDVD 230
G+V L + G V+WK G + + L+V C + L F G P ++ Q C VD
Sbjct: 146 AGMVLLNIKIDGWVRWKVGTWVSGRYRLHVNCPAYITLAGHFSGDGPAVKYQLVQRCAVD 205
Query: 231 L 231
+
Sbjct: 206 V 206
>AT2G35960.1 | Symbols: NHL12 | NDR1/HIN1-like 12 |
chr2:15107150-15107782 FORWARD LENGTH=210
Length = 210
Score = 112 bits (281), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 55/156 (35%), Positives = 84/156 (53%)
Query: 62 TLLVLWLVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLIRNPNKRVSIYYDRFSA 121
T+ ++W++ +P KPRF + A VYA N + P LL++ Q + RN N R+ IYYDR
Sbjct: 35 TIFLVWIILQPTKPRFILQDATVYAFNLSQPNLLTSNFQITIASRNRNSRIGIYYDRLHV 94
Query: 122 FVSYRNQPITQQVLLPPLFLEKHSQVSLSPVIGGTPMPVTVEVANGLAMDESYGVVGLKL 181
+ +YRNQ IT + +PP + SP + G +P+ A L +++ G V L +
Sbjct: 95 YATYRNQQITLRTAIPPTYQGHKEDNVWSPFVYGNSVPIAPFNAVALGDEQNRGFVTLII 154
Query: 182 VFLGRVKWKAGGLRTWHHGLYVKCDLLVGLKKGFVG 217
GRV+WK G L T + L+V+C + L G
Sbjct: 155 RADGRVRWKVGTLITGKYHLHVRCQAFINLADKAAG 190
>AT4G09590.1 | Symbols: NHL22 | NDR1/HIN1-like 22 |
chr4:6066128-6066763 FORWARD LENGTH=211
Length = 211
Score = 109 bits (272), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 91/163 (55%), Gaps = 2/163 (1%)
Query: 49 CTFITVFLLAVGITLLVLWLVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLIRNP 108
C I F++ V +T+ ++W++ +P P F + VYA N + P LL++ Q + RN
Sbjct: 23 CGAIIGFIIIVLMTIFLVWIILQPKNPEFILQDTTVYAFNLSQPNLLTSKFQITIASRNR 82
Query: 109 NKRVSIYYDRFSAFVSYRNQPITQQVLLPPLFLEKHSQVSL-SPVIGGTPMPVTVEVANG 167
N + IYYD A+ SYRNQ IT LPP + ++H + S+ SP++ G +P+ A
Sbjct: 83 NSNIGIYYDHLHAYASYRNQQITLASDLPPTY-QRHKEDSVWSPLLYGNQVPIAPFNAVA 141
Query: 168 LAMDESYGVVGLKLVFLGRVKWKAGGLRTWHHGLYVKCDLLVG 210
L +++ GV L + G+V+WK G L ++ L+V+C +
Sbjct: 142 LGDEQNSGVFTLTICVDGQVRWKVGTLTIGNYHLHVRCQAFIN 184
>AT2G35970.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr2:15109007-15109642 FORWARD LENGTH=211
Length = 211
Score = 107 bits (268), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 56/162 (34%), Positives = 91/162 (56%), Gaps = 2/162 (1%)
Query: 49 CTFITVFLLAVGITLLVLWLVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLIRNP 108
C I F++ V +T+ ++ ++ +P KP F + VYA N + P LL++ Q + RN
Sbjct: 23 CGAIIGFIIIVLMTIFLVSIILQPKKPEFILQDTTVYAFNLSQPNLLTSKFQITIASRNR 82
Query: 109 NKRVSIYYDRFSAFVSYRNQPITQQVLLPPLFLEKHSQVSL-SPVIGGTPMPVTVEVANG 167
N + IYYD A+ SYRNQ IT LPP + ++H + S+ SP++ G +P+ A
Sbjct: 83 NSNIGIYYDHLHAYASYRNQQITLASDLPPTY-QRHKENSVWSPLLYGNQVPIAPFNAVA 141
Query: 168 LAMDESYGVVGLKLVFLGRVKWKAGGLRTWHHGLYVKCDLLV 209
L +++ GV L + GRV+WK G L ++ L+V+C +
Sbjct: 142 LGDEQNSGVFTLTICVDGRVRWKVGTLTIGNYHLHVRCQAFI 183
>AT3G11660.1 | Symbols: NHL1 | NDR1/HIN1-like 1 |
chr3:3679031-3679660 REVERSE LENGTH=209
Length = 209
Score = 107 bits (266), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 59/157 (37%), Positives = 88/157 (56%), Gaps = 4/157 (2%)
Query: 52 ITVFLLAVGITLLVLWLVYRPHKPRFTVVGAAVYALNTTSPP--LLSAALQFNVLIRNPN 109
I L + +T+L++W + +P KPRF + A VYA N + P LL++ Q + RNPN
Sbjct: 22 IIFVLFIIFLTILLIWAILQPSKPRFILQDATVYAFNVSGNPPNLLTSNFQITLSSRNPN 81
Query: 110 KRVSIYYDRFSAFVSYRNQPITQQVLLPPLFLEKHSQVSL-SPVIGGTPMPVTVEVANGL 168
++ IYYDR + +YR+Q IT +PP + + H V + SP + GT +P+ L
Sbjct: 82 NKIGIYYDRLDVYATYRSQQITFPTSIPPTY-QGHKDVDIWSPFVYGTSVPIAPFNGVSL 140
Query: 169 AMDESYGVVGLKLVFLGRVKWKAGGLRTWHHGLYVKC 205
D+ GVV L + GRV+WK G T + L+VKC
Sbjct: 141 DTDKDNGVVLLIIRADGRVRWKVGTFITGKYHLHVKC 177
>AT5G06330.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr5:1934961-1935584 REVERSE LENGTH=207
Length = 207
Score = 102 bits (253), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 60/173 (34%), Positives = 89/173 (51%), Gaps = 12/173 (6%)
Query: 67 WLVYRPHKPRFTVVGAAVYALNTTSPP--LLSAALQFNVLIRNPNKRVSIYYDRFSAFVS 124
W + +P KPRF + A V+ N + P LL++ QF + RNPN ++ IYYDR + S
Sbjct: 39 WAILQPSKPRFVLQDATVFNFNVSGNPPNLLTSNFQFTLSSRNPNDKIGIYYDRLDVYAS 98
Query: 125 YRNQPITQQVLLPPLFL---EKHSQVSL-SPVIGGTPMPVTVEVANGLAMDESYGVVGLK 180
YR +QQ+ LP L + H +V++ SP +GG +PV A L D S G + L
Sbjct: 99 YR----SQQITLPSPMLTTYQGHKEVNVWSPFVGGYSVPVAPYNAFYLDQDHSSGAIMLM 154
Query: 181 LVFLGRVKWKAGGLRTWHHGLYVKCDLLVGLKKGFVGQV--PLLQAQACDVDL 231
L GRV+WK G T + L+V+C L+ G + + + C V +
Sbjct: 155 LHLDGRVRWKVGSFITGKYHLHVRCHALINFGSSAAGVIVGKYMLTETCSVSV 207
>AT5G22200.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr5:7355688-7356871 FORWARD LENGTH=210
Length = 210
Score = 100 bits (249), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 56/169 (33%), Positives = 89/169 (52%), Gaps = 5/169 (2%)
Query: 67 WLVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLIRNPNKRVSIYYDRFSAFVSYR 126
W + PH PRF + + N + P LS+ LQ V RNPN ++ I+YDR +V+YR
Sbjct: 43 WAILHPHGPRFVLQDVTINDFNVSQPNFLSSNLQVTVSSRNPNDKIGIFYDRLDIYVTYR 102
Query: 127 NQPITQQVLLPPLFLEKHSQVSL-SPVIGGTPMPVTVEVANGLAMDESYGVVGLKLVFLG 185
NQ +T LLP + + H +V++ SP + G+ +PV +++ L D G+V L + G
Sbjct: 103 NQEVTLARLLPSTY-QGHLEVTVWSPFLIGSAVPVAPYLSSALNEDLFAGLVLLNIKIDG 161
Query: 186 RVKWKAGGLRTWHHGLYVKCDLLVGLKKGFVGQVPLLQ---AQACDVDL 231
V+WK G + + L+V C + + G P ++ Q C VD+
Sbjct: 162 WVRWKVGSWVSGSYRLHVNCPAFITVTGKLTGTGPAIKYQLVQRCAVDV 210
>AT5G53730.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr5:21808072-21808713 REVERSE LENGTH=213
Length = 213
Score = 99.8 bits (247), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 51/157 (32%), Positives = 90/157 (57%), Gaps = 10/157 (6%)
Query: 61 ITLLVLWLVYRPHKPRFTVVGAAVYALN--TTSPPLLSAALQFNVLIRNPNKRVSIYYDR 118
+ + ++WL+ P +P F++ A +Y+LN T+S LL++++Q + +NPNK+V IYYD+
Sbjct: 40 LIIFLVWLILHPERPEFSLTEADIYSLNLTTSSTHLLNSSVQLTLFSKNPNKKVGIYYDK 99
Query: 119 FSAFVSYRNQPITQQVLLPPLFLEKHSQVS-LSPVIGGTPMPVTVEVANGLAMDESYGVV 177
+ +YR Q IT + LPP F + H +++ L+ + GT +PV ++ + S G +
Sbjct: 100 LLVYAAYRGQQITSEASLPP-FYQSHEEINLLTAFLQGTELPVAQSFGYQISRERSTGKI 158
Query: 178 GLKLVFLGRVKWKAGGLRTWHHGLY---VKCDLLVGL 211
+ + G+++WK G TW G Y V C +V
Sbjct: 159 IIGMKMDGKLRWKIG---TWVSGAYRFNVNCLAIVAF 192
>AT4G05220.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr4:2685104-2685784 REVERSE LENGTH=226
Length = 226
Score = 87.8 bits (216), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 48/181 (26%), Positives = 89/181 (49%), Gaps = 4/181 (2%)
Query: 49 CTFITVFLLAVGITLLVLWLVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLIRNP 108
C + L VG+ +LWL RPH+PRF + V L+ + + +A + FNV I NP
Sbjct: 47 CAMFLLVLFFVGVIAFILWLSLRPHRPRFHIQDFVVQGLDQPTG-VENARIAFNVTILNP 105
Query: 109 NKRVSIYYDRFSAFVSYRNQPITQQVLLPPLFLEKHSQVSLSPVIGGTPMPVTVEVANGL 168
N+ + +Y+D + Y++Q + LL P F + + ++ + G + V
Sbjct: 106 NQHMGVYFDSMEGSIYYKDQRVGLIPLLNPFFQQPTNTTIVTGTLTGASLTVNSNRWTEF 165
Query: 169 AMDESYGVVGLKLVFLGRVKWKAGGLRTWHHGLYVKCDLLVGLKKGFVGQVPLLQAQACD 228
+ D + G VG +L + +++K + HH ++ C+++VG + G + +P + C
Sbjct: 166 SNDRAQGTVGFRLDIVSTIRFKLHRWISKHHRMHANCNIVVG-RDGLI--LPKFNHKRCP 222
Query: 229 V 229
V
Sbjct: 223 V 223
>AT5G22870.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr5:7647056-7647679 REVERSE LENGTH=207
Length = 207
Score = 82.8 bits (203), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/179 (27%), Positives = 83/179 (46%), Gaps = 9/179 (5%)
Query: 54 VFLLAVGITLLVLWLVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLIRNPNKRVS 113
+F+ AVG L+ WL +P K R+TV A+V N T+ +SA QF + NPN R+S
Sbjct: 37 IFMAAVG--FLITWLETKPKKLRYTVENASVQNFNLTNDNHMSATFQFTIQSHNPNHRIS 94
Query: 114 IYYDRFSAFVSYRNQPITQQVLLPPLFLEKHSQVSLSPVIGGTPMPVTVEVANGLAMDES 173
+YY FV +++Q + + P + + + + + V+ L S
Sbjct: 95 VYYSSVEIFVKFKDQTLAFDT-VEPFHQPRMNVKQIDETLIAENVAVSKSNGKDLRSQNS 153
Query: 174 YGVVGLKLVFLGRVKWKAGGLRTWHHGLYVKCD-LLVGLKKGFVGQVPLLQAQACDVDL 231
G +G ++ RV++K G ++ H +KC + V L Q Q +CD D+
Sbjct: 154 LGKIGFEVFVKARVRFKVGIWKSSHRTAKIKCSHVTVSL-----SQPNKSQNSSCDADI 207
>AT2G27080.2 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr2:11566383-11567165 FORWARD LENGTH=260
Length = 260
Score = 79.7 bits (195), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 95/202 (47%), Gaps = 8/202 (3%)
Query: 11 QNQHHHLPPPNKIQMNNHKAXXXXXXXXXXXXXXKRAACTFIT-VFLLAV--GITLLVLW 67
++Q + +PPP N H+ + C+F+ VF+L V GI+ VL+
Sbjct: 43 KDQIYRIPPPE----NAHR-FEQLSRKKTNRSNCRCCFCSFLAAVFILIVLAGISFAVLY 97
Query: 68 LVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLIRNPNKRVSIYYDRFSAFVSYRN 127
L+YRP P++++ G +V +N S +S + V RN N ++ +YY++ S+ Y N
Sbjct: 98 LIYRPEAPKYSIEGFSVSGINLNSTSPISPSFNVTVRSRNGNGKIGVYYEKESSVDVYYN 157
Query: 128 QPITQQVLLPPLFLEKHSQVSLSPVIGGTPMPVTVEVANGLAMDESYGVVGLKLVFLGRV 187
++P + + + V+ G+ + +T + + + S V KL V
Sbjct: 158 DVDISNGVMPVFYQPAKNVTVVKLVLSGSKIQLTSGMRKEMRNEVSKKTVPFKLKIKAPV 217
Query: 188 KWKAGGLRTWHHGLYVKCDLLV 209
K K G ++TW + V CD+ V
Sbjct: 218 KIKFGSVKTWTMIVNVDCDVTV 239
>AT2G27080.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr2:11566383-11567165 FORWARD LENGTH=260
Length = 260
Score = 79.7 bits (195), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 95/202 (47%), Gaps = 8/202 (3%)
Query: 11 QNQHHHLPPPNKIQMNNHKAXXXXXXXXXXXXXXKRAACTFIT-VFLLAV--GITLLVLW 67
++Q + +PPP N H+ + C+F+ VF+L V GI+ VL+
Sbjct: 43 KDQIYRIPPPE----NAHR-FEQLSRKKTNRSNCRCCFCSFLAAVFILIVLAGISFAVLY 97
Query: 68 LVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLIRNPNKRVSIYYDRFSAFVSYRN 127
L+YRP P++++ G +V +N S +S + V RN N ++ +YY++ S+ Y N
Sbjct: 98 LIYRPEAPKYSIEGFSVSGINLNSTSPISPSFNVTVRSRNGNGKIGVYYEKESSVDVYYN 157
Query: 128 QPITQQVLLPPLFLEKHSQVSLSPVIGGTPMPVTVEVANGLAMDESYGVVGLKLVFLGRV 187
++P + + + V+ G+ + +T + + + S V KL V
Sbjct: 158 DVDISNGVMPVFYQPAKNVTVVKLVLSGSKIQLTSGMRKEMRNEVSKKTVPFKLKIKAPV 217
Query: 188 KWKAGGLRTWHHGLYVKCDLLV 209
K K G ++TW + V CD+ V
Sbjct: 218 KIKFGSVKTWTMIVNVDCDVTV 239
>AT1G61760.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr1:22807440-22808114 REVERSE LENGTH=224
Length = 224
Score = 74.3 bits (181), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 74/165 (44%), Gaps = 7/165 (4%)
Query: 49 CTFITVFLLAVGITLLVLWLVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLIRNP 108
C LL +GI +LW+ +PH+PR + G ++ L + ++ + F + NP
Sbjct: 45 CAIFLSLLLCLGIITFILWISLQPHRPRVHIRGFSISGL-SRPDGFETSHISFKITAHNP 103
Query: 109 NKRVSIYYDRFSAFVSYRNQPITQQVLLPPLFLEKHSQVSLSPVIGGTPMPVTVEVANGL 168
N+ V IYYD V Y+ + I L P + + + S+ + M V + +
Sbjct: 104 NQNVGIYYDSMEGSVYYKEKRIGSTKLTNPFYQDPKNTSSIDGALSRPAMAVNKDRWMEM 163
Query: 169 AMDESYGVVGLKLVFLGRVKWKAGGLRTWH---HGLYVKCDLLVG 210
D + G + +L +++K + TWH H +Y C + +G
Sbjct: 164 ERDRNQGKIMFRLKVRSMIRFK---VYTWHSKSHKMYASCYIEIG 205
>AT2G35980.1 | Symbols: YLS9, NHL10, ATNHL10 | Late embryogenesis
abundant (LEA) hydroxyproline-rich glycoprotein family |
chr2:15110635-15111318 FORWARD LENGTH=227
Length = 227
Score = 74.3 bits (181), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 49/172 (28%), Positives = 81/172 (47%), Gaps = 12/172 (6%)
Query: 47 AACTFITVF-------LLAVGITLLVLWLVYRPHKPRFTVVGAAVYALNTTSPP-LLSAA 98
C +++F ++ +G+ L+ WL+ RP +F V A++ + TSP +L
Sbjct: 33 CGCCLLSLFVKVIISLIVILGVAALIFWLIVRPRAIKFHVTDASLTRFDHTSPDNILRYN 92
Query: 99 LQFNVLIRNPNKRVSIYYDRFSAFVSYRNQPITQQVLLPPLFLEKHSQVS-LSPVIGGTP 157
L V +RNPNKR+ +YYDR A Y + + L P F + H + L+P G
Sbjct: 93 LALTVPVRNPNKRIGLYYDRIEAHAYYEGKRFSTITLTP--FYQGHKNTTVLTPTFQGQN 150
Query: 158 MPV-TVEVANGLAMDESYGVVGLKLVFLGRVKWKAGGLRTWHHGLYVKCDLL 208
+ + + L + GV +++ F RV++K G L+ V CD L
Sbjct: 151 LVIFNAGQSRTLNAERISGVYNIEIKFRLRVRFKLGDLKFRRIKPKVDCDDL 202
>AT3G11650.1 | Symbols: NHL2 | NDR1/HIN1-like 2 |
chr3:3676264-3676986 REVERSE LENGTH=240
Length = 240
Score = 70.5 bits (171), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 82/193 (42%), Gaps = 19/193 (9%)
Query: 49 CTFITVFLLAVGITLLVLWLVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLIRNP 108
C + + +G+ L+LWL++RP+ +F V A + + L +L N IRNP
Sbjct: 56 CNILIAVAVILGVAALILWLIFRPNAVKFYVADANLNRFSFDPNNNLHYSLDLNFTIRNP 115
Query: 109 NKRVSIYYDRFSAFVSYRNQPITQQVLLPPLFLEKHSQVSLSPVIGGTPMPVTVEVANGL 168
N+RV +YYD FS Y +Q + K++ V L+ + G + + L
Sbjct: 116 NQRVGVYYDEFSVSGYYGDQRFGSANVSSFYQGHKNTTVILTKIEGQNLVVLGDGARTDL 175
Query: 169 AMDESYGVVGLKLVFLGRVKWKAGGLRTWHHGLYVKCDLLVGLKKGFVGQVPL------- 221
DE G+ + V++K +++W +KCD L ++PL
Sbjct: 176 KDDEKSGIYRINAKLRLSVRFKFWFIKSWKLKPKIKCDDL---------KIPLGSSNSTG 226
Query: 222 ---LQAQACDVDL 231
Q CD DL
Sbjct: 227 GFKFQPVQCDFDL 239
>AT5G06320.1 | Symbols: NHL3 | NDR1/HIN1-like 3 |
chr5:1931016-1931711 REVERSE LENGTH=231
Length = 231
Score = 68.9 bits (167), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 83/185 (44%), Gaps = 11/185 (5%)
Query: 52 ITVFLLAVGITLLVLWLVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLIRNPNKR 111
IT+ +L +GI L++WL++RP+ +F V A + L L N IRNPN+R
Sbjct: 53 ITIAVL-LGIAALIIWLIFRPNAIKFHVTDAKLTEFTLDPTNNLRYNLDLNFTIRNPNRR 111
Query: 112 VSIYYDRFSAFVSYRNQPITQQVLLPPLFL-EKHSQVSLSPVIGGTPMPVTVEVANGLAM 170
+ +YYD Y +Q + + K++ V + ++G + + L
Sbjct: 112 IGVYYDEIEVRGYYGDQRFGMSNNISKFYQGHKNTTVVGTKLVGQQLVLLDGGERKDLNE 171
Query: 171 DESYGVVGLKLVFLGRVKWKAGGLRTWHHGLYVKCDLLVGL----KKGFVGQVPLLQAQA 226
D + + + ++++K G +++W +KCDL V L GFV Q
Sbjct: 172 DVNSQIYRIDAKLRLKIRFKFGLIKSWRFKPKIKCDLKVPLTSNSTSGFV-----FQPTK 226
Query: 227 CDVDL 231
CDVD
Sbjct: 227 CDVDF 231
>AT1G17620.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr1:6062313-6063107 FORWARD LENGTH=264
Length = 264
Score = 68.9 bits (167), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 70/148 (47%), Gaps = 6/148 (4%)
Query: 65 VLWLVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLIRNPNKRVSIYYD-----RF 119
V++L+YRP +P FTV + LN TS L+ A+ +V+ RNPNK V YD +
Sbjct: 82 VVYLIYRPQRPSFTVSELKISTLNFTSAVRLTTAISLSVIARNPNKNVGFIYDVTDITLY 141
Query: 120 SAFVSYRNQPITQQVLLPPLFLEKHSQVSLSPVIGGTPMPVTVEVANGLAMD-ESYGVVG 178
A + + + + K + +L IG P + A L D ++ V
Sbjct: 142 KASTGGDDDVVIGKGTIAAFSHGKKNTTTLRSTIGSPPDELDEISAGKLKGDLKAKKAVA 201
Query: 179 LKLVFLGRVKWKAGGLRTWHHGLYVKCD 206
+K+V +VK K G L+T G+ V C+
Sbjct: 202 IKIVLNSKVKVKMGALKTPKSGIRVTCE 229
>AT3G52460.1 | Symbols: | hydroxyproline-rich glycoprotein family
protein | chr3:19446970-19447872 FORWARD LENGTH=300
Length = 300
Score = 60.1 bits (144), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/179 (28%), Positives = 75/179 (41%), Gaps = 11/179 (6%)
Query: 46 RAACTFITVFLLAVGITLLVLWLVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLI 105
R T + V ++ + I+ + WLV RP P F+V +V N T P + SA N+ I
Sbjct: 107 RGIFTGLIVLVVLLCISTTITWLVLRPQIPLFSVNNFSVSNFNVTGP-VFSAQWTANLTI 165
Query: 106 RNPNKRVSIYYDRFSAFVSYRNQPITQQVL----LPPLFLEKHSQVSLSPVI--GGTPMP 159
N N ++ Y+DR V ++N + L P+F+E V + + G P
Sbjct: 166 ENQNTKLKGYFDRIQGLVYHQNAVGEDEFLATAFFQPVFVETKKSVVIGETLTAGDKEQP 225
Query: 160 -VTVEVANGLAMDESYGVVGLKLVFLGRVKWKAGGLRTWHHGLYVKCDLLVGLKKGFVG 217
V V + + + G V L V +K G GL V C LK GF G
Sbjct: 226 KVPSWVVDEMKKERETGTVTFSLRMAVWVTFKTDGWAARESGLKVFCG---KLKVGFEG 281
>AT2G35460.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr2:14905788-14906504 FORWARD LENGTH=238
Length = 238
Score = 55.8 bits (133), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/185 (23%), Positives = 78/185 (42%), Gaps = 3/185 (1%)
Query: 49 CTFITVFLLAVGITLLVLWLVYRPHKPRFTVVGAAV--YALNTTSPPLLSAALQFNVLIR 106
C + L+ +G+ L+LW + RP+ +F V A + + + S L + N IR
Sbjct: 55 CNILIGVLVCLGVVALILWFILRPNVVKFQVTEADLTRFEFDPRSHN-LHYNISLNFSIR 113
Query: 107 NPNKRVSIYYDRFSAFVSYRNQPITQQVLLPPLFLEKHSQVSLSPVIGGTPMPVTVEVAN 166
NPN+R+ I+YD+ Y +Q + + K++ V + + G + +
Sbjct: 114 NPNQRLGIHYDQLEVRGYYGDQRFSAANMTSFYQGHKNTTVVGTELNGQKLVLLGAGGRR 173
Query: 167 GLAMDESYGVVGLKLVFLGRVKWKAGGLRTWHHGLYVKCDLLVGLKKGFVGQVPLLQAQA 226
D GV + + ++++K G L +W +KC L V L +
Sbjct: 174 DFREDRRSGVYRIDVKLRFKLRFKFGFLNSWAVRPKIKCHLKVPLSTSSSDERFQFHPTK 233
Query: 227 CDVDL 231
C VDL
Sbjct: 234 CHVDL 238
>AT5G05657.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family
(TAIR:AT5G06330.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:1689013-1689426 FORWARD LENGTH=137
Length = 137
Score = 54.7 bits (130), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 59/127 (46%), Gaps = 18/127 (14%)
Query: 72 PHKPRFTVVGAAVYALNTTSPP--LLSAALQFNVLIRNPNKRVSIYYDRFSAFVSYRNQP 129
P KPRF V+ N + P L + +QFN+ RNPN + IYYD + Y N
Sbjct: 15 PSKPRFIFQDVTVFNFNVSGNPSDLNTPVVQFNLSFRNPNANIRIYYDTLDVYAFYGNG- 73
Query: 130 ITQQVLLP---PLFLEKHSQVSL-SPVIGGTPMPVTVEVANGLAMDESYGVVGLKLVFL- 184
+QQ+++P P + H + S+ SP I P N L +D+ + ++ L
Sbjct: 74 -SQQIIIPTPMPSTYQGHKEDSVWSPYIPVVPY-------NALYLDDQHHSRDGNMLMLH 125
Query: 185 --GRVKW 189
GR+ W
Sbjct: 126 LDGRISW 132
>AT5G21130.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr5:7185968-7186813 FORWARD LENGTH=281
Length = 281
Score = 54.3 bits (129), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 43/202 (21%), Positives = 81/202 (40%), Gaps = 8/202 (3%)
Query: 11 QNQHHHLPPPNKIQMNNHKAXXXXXXXXXXXXXXKRAACTFITVFLLAV---GITLLVLW 67
++Q + +PPP N H+ +R C ++ L+ + I +
Sbjct: 64 KDQIYRVPPPE----NAHR-YEYLSRRKTNKSCCRRCLCYSLSALLIIIVLAAIAFGFFY 118
Query: 68 LVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLIRNPNKRVSIYYDRFSAFVSYRN 127
LVY+PHKP+F+V G +V +N TS S ++ + +N ++ + Y++ + + N
Sbjct: 119 LVYQPHKPQFSVSGVSVTGINLTSSSPFSPVIRIKLRSQNVKGKLGLIYEKGNEADVFFN 178
Query: 128 QPITQQVLLPPLFLEKHSQVSLSPVIGGTPMPVTVEVANGLAMDESYGVVGLKLVFLGRV 187
+ + V+ G+ + + L + G V L V
Sbjct: 179 GTKLGNGEFTAFKQPAGNVTVIVTVLKGSSVKLKSSSRKELTESQKKGKVPFGLRIKAPV 238
Query: 188 KWKAGGLRTWHHGLYVKCDLLV 209
K+K G + TW + V C + V
Sbjct: 239 KFKVGSVTTWTMTITVDCKITV 260
>AT4G26490.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr4:13380425-13381231 FORWARD LENGTH=268
Length = 268
Score = 52.8 bits (125), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 77/165 (46%), Gaps = 14/165 (8%)
Query: 49 CTFITVFLLAVGITLLVLWLVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLIRNP 108
C ++ L+ I L+++L RP P F + A ++ + +P + L V NP
Sbjct: 93 CFVFSLLLIFFAIATLIVFLAIRPRIPVFDIPNANLHTIYFDTPEFFNGDLSMLVNFTNP 152
Query: 109 NKRVSIYYDRFSAFVSYRNQPITQQVLLPPLFLEKHSQVSLSPV-----IGGTPMPVTVE 163
NK++ + +++ + + N+ I QV+ P FL+K + L P+ + G P+ VE
Sbjct: 153 NKKIEVKFEKLRIELFFFNRLIAAQVVQP--FLQKKHETRLEPIRLISSLVGLPVNHAVE 210
Query: 164 VANGLAMDESYGVVGLKLVFLGRVKWKAG-GLRTWHHGLYVKCDL 207
+ L ++ ++ G K KA G+ + + L+ +C L
Sbjct: 211 LRRQLENNK------IEYEIRGTFKVKAHFGMIHYSYQLHGRCQL 249
>AT1G65690.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr1:24431642-24432898 REVERSE LENGTH=252
Length = 252
Score = 50.1 bits (118), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/189 (24%), Positives = 87/189 (46%), Gaps = 18/189 (9%)
Query: 51 FITVFLLAVGITLLVLWLVYRPHKPRFTV--VGAAVYALNTTSPPLLSAALQFNVLIRNP 108
F+ + ++AVG ++ +L+LV++P P +++ + +ALN S L+ A + +NP
Sbjct: 72 FLLLLVVAVGASIGILYLVFKPKLPDYSIDRLQLTRFALNQDSS--LTTAFNVTITAKNP 129
Query: 109 NKRVSIYYDRFSAF-VSYRNQPITQQVLLPPLFLEKHSQVSLSPVIGGTPMPVTVEVANG 167
N+++ IYY+ S V Y ++ L P F + H ++ V M + A+G
Sbjct: 130 NEKIGIYYEDGSKITVWYMEHQLSNGSL--PKFYQGHENTTVIYV----EMTGQTQNASG 183
Query: 168 L-----AMDESYGVVGLKLVFLGRVKWKAGGLRTWHHGLYVKCDLLVGLKKGFVGQVPLL 222
L + G + L++ V+ K G L+ + V+C + V V +
Sbjct: 184 LRTTLEEQQQRTGNIPLRIRVNQPVRVKFGKLKLFEVRFLVRCGVFVD--SLATNNVIKI 241
Query: 223 QAQACDVDL 231
Q+ +C L
Sbjct: 242 QSSSCKFRL 250
>AT5G56050.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
chloroplast; BEST Arabidopsis thaliana protein match is:
Late embryogenesis abundant (LEA) hydroxyproline-rich
glycoprotein family (TAIR:AT4G26490.1); Has 1807 Blast
hits to 1807 proteins in 277 species: Archae - 0;
Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr5:22701167-22702018 REVERSE LENGTH=283
Length = 283
Score = 47.8 bits (112), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 25/106 (23%), Positives = 52/106 (49%), Gaps = 2/106 (1%)
Query: 47 AACTFITVFLLAVGITLLVLWLVYRPHKPRFTVVGAAVYALNTTSPPLLSAALQFNVLIR 106
A C ++ L+ GI L+L+L +P P F + A + + SP + + +
Sbjct: 105 ALCFIFSILLIVFGIATLILYLAVKPRTPVFDISNAKLNTILFESPVYFNGDMLLQLNFT 164
Query: 107 NPNKRVSIYYDRFSAFVSYRNQPITQQVLLPPLFLEKHSQVSLSPV 152
NPNK++++ ++ + + + I Q +LP F +++ + L P+
Sbjct: 165 NPNKKLNVRFENLMVELWFADTKIATQGVLP--FSQRNGKTRLEPI 208
>AT5G11890.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
LOCATED IN: plasma membrane; EXPRESSED IN: 12 plant
structures; EXPRESSED DURING: 6 growth stages; BEST
Arabidopsis thaliana protein match is: Late
embryogenesis abundant (LEA) hydroxyproline-rich
glycoprotein family (TAIR:AT1G17620.1); Has 1807 Blast
hits to 1807 proteins in 277 species: Archae - 0;
Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr5:3831770-3832633 FORWARD LENGTH=287
Length = 287
Score = 47.0 bits (110), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/182 (23%), Positives = 76/182 (41%), Gaps = 9/182 (4%)
Query: 59 VGITLLVLWLVYRPHKPRFTVVGAAVYALNTTSPPLLSAA-----LQFNVLIRNPNKRVS 113
I ++++Y P P F+V + +N T+ S + F ++ NPN+ +S
Sbjct: 97 TAIAATAMYVIYHPRPPSFSVPSIRISRVNLTTSSDSSVSHLSSFFNFTLISENPNQHLS 156
Query: 114 IYYDRFSAFV-SYRNQPITQQVLLPPLFLEKHSQVSLSPVIGGTPMPVTV--EVANGLAM 170
YD F+ V S ++ + +P F + ++ S VI + + + A L
Sbjct: 157 FSYDPFTVTVNSAKSGTMLGNGTVPAFFSDNGNKTSFHGVIATSTAARELDPDEAKHLRS 216
Query: 171 DESYGVVGLKLVFLGRVKWKAGGLRTWHHGLYVKCDLLVG-LKKGFVGQVPLLQAQACDV 229
D + VG ++ +VK G L++ + V C+ G + KG V + C
Sbjct: 217 DLTRARVGYEIEMRTKVKMIMGKLKSEGVEIKVTCEGFEGTIPKGKTPIVATSKKTKCKS 276
Query: 230 DL 231
DL
Sbjct: 277 DL 278