Automatically load unmapped fields from _source?

This relates to #80504 where we would like users to think about field values and make Elasticsearch responsible for figuring out the best way to retrieve them, e.g. `doc_values` when available with a fallback to `_source` otherwise. One challenge with this vision is `text` fields (#81246), another one is unmapped fields that this issue focuses on.

Almost all Elasticsearch APIs only consider fields that exist in the mappings. If a field isn't mapped, queries, aggregations, etc. will treat the field the same way as if it didn't exist. This is one of the reasons why scripting users have to worry about `doc` vs. `_source` today, unmapped fields are only available in `_source`.

One way to address this discrepancy would consist of introducing an abstraction layer in our scripting API so that if a field is unmapped, then the script will look up values from `_source` instead.

Another option would be to address this more broadly by improving mappings so that they would create transient runtime fields whenever a field that is used in a query, aggregation might exist in `_source` documents but isn't mapped. This would be quite similar to `flattened` fields where sub fields are not explicitly mapped, and Elasticsearch creates transient field entries at runtime that look and feel like regular keyword fields. This way, unmapped fields that exist in `_source` could not only be used in scripts, but also in any query or aggregation using the same semantics as fields of the `keyword` family (e.g. `wildcard`.)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Automatically load unmapped fields from _source? #81357

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Automatically load unmapped fields from _source? #81357

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions