MachineLearning.GetTrainedModelsStats always throws a deserialization exception #8271

svalbuena · 2024-07-26T02:20:30Z

Elastic.Clients.Elasticsearch version: 8.14.6

Elasticsearch version: 8.14.3

.NET runtime version: 8.0

Operating system version: Windows

Description of the problem including expected versus actual behavior:
I'm using the MachineLearning.GetTrainedModelsStatsAsync call to get the stats of a trained model, this request always results in System.Text.Json.JsonException: The JSON value could not be converted to System.Int32. Path: $.trained_model_stats[0].model_size_stats.required_native_memory_bytes. Invoking the route manually http://localhost:9200/_ml/trained_models/intfloat__e5-small-v2/_stats works.
Upon inspection of the Model class the response is deserialized into, I found the required_native_memory_bytes is defined as an int rather than ByteSize as the other fields of the same type:

namespace Elastic.Clients.Elasticsearch.MachineLearning;

public sealed partial class TrainedModelSizeStats
{
	/// <summary>
	/// <para>The size of the model in bytes.</para>
	/// </summary>
	[JsonInclude, JsonPropertyName("model_size_bytes")]
	public Elastic.Clients.Elasticsearch.ByteSize ModelSizeBytes { get; init; }

	/// <summary>
	/// <para>The amount of memory required to load the model in bytes.</para>
	/// </summary>
	[JsonInclude, JsonPropertyName("required_native_memory_bytes")]
	public int RequiredNativeMemoryBytes { get; init; }
}

The required_native_memory_bytes is much bigger than an int.

Steps to reproduce:

Deploy an ML model
Use the .NET elastic client to invoke the model like await client.MachineLearning.GetTrainedModelsStatsAsync(new Ids("intfloat__e5-small-v2"), cancellationToken);
The code throws a deserialization exception

Expected behavior
The TrainedModelSizeStats class should type the RequiredNativeMemoryBytes field as Elastic.Clients.Elasticsearch.ByteSize, and the GetTrainedModelsStatsAsync call should succeed.

The text was updated successfully, but these errors were encountered:

svalbuena · 2024-07-26T03:12:58Z

spec issue: elastic/elasticsearch-specification#2739

flobernd · 2024-07-31T20:20:51Z

@svalbuena Thank you for creating the spec PR! The fix should already be included in the latest patch release which I did on Monday :-)

svalbuena · 2024-08-01T09:46:33Z

Thanks @flobernd !
I had a look at how the code generation works, I really like doing a spec-first approach where both the server api (mostly the controllers and DTO classes) and all clients are generated from the spec. I'm not sure what happens in Elastic for the server-side, but for the clients I saw that the spec is generated from the TS client and the rest of clients are generated from the spec. I'm curious of why this is done like this instead of generating the TS client from the spec as well, do you know?

flobernd · 2024-08-01T10:10:18Z

@svalbuena I think you did not get this completely right here. The spec is created by hand (*.ts files) and processed into a schema.json file by the spec compiler. There is another step, the compiler-ts, that produces *.ts output from the schema.json again. These files are not part of any official TS client, but only used internally to validate the structure against actual ES JSON requests/responses.

The TS bindings for the official JS client are as well generated from the spec 🙂

Regarding server-side: Currently, the controllers etc. are not generated from the spec, but I agree, that such approach could have a lot of benefits. For a new project, I would always chose this path as well, but it's often hard to change workflows for a software like ElasticSearch that exists for such long time.

svalbuena · 2024-08-01T10:32:50Z

@flobernd ahh gotcha, so the .ts files that are just used to generate the spec. why not editing the spec .json directly? is it to have some type-safety/compilation checks that you would not have by working against the json directly?

flobernd · 2024-08-01T10:35:33Z

@svalbuena Yes, mainly because it's way more convenient writing TS code than raw JSON 🙂 The compiled schema.json has more than 200.000 lines right now and besides that, we make use of organizing code by grouping multiple *.ts files in directories, etc. JSON as well does not easily allow to include comments, etc.

We as well explored other projects like e.g. Microsoft TypeSpec as a potential replacement for our custom TS based spec format, but currently this does not fulfill our requirements.

svalbuena · 2024-08-02T11:17:57Z

@flobernd I've opened a follow-up pr to this, seems like there were two other type errors that are failing the request, this time I manually edited the .net client locally and verified there are no other errors left elastic/elasticsearch-specification#2763

svalbuena added 8.x Relates to a 8.x client version Category: Bug labels Jul 26, 2024

svalbuena mentioned this issue Jul 26, 2024

Wrong type for required_native_memory_bytes (TrainedModelSizeStats class) elastic/elasticsearch-specification#2739

Closed

maxhniebergall closed this as completed Jul 26, 2024

svalbuena mentioned this issue Aug 2, 2024

MachineLearning.GetTrainedModelsStats throws a deserialization exception part 2 #8281

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MachineLearning.GetTrainedModelsStats always throws a deserialization exception #8271

MachineLearning.GetTrainedModelsStats always throws a deserialization exception #8271

svalbuena commented Jul 26, 2024 •

edited

Loading

svalbuena commented Jul 26, 2024 •

edited

Loading

flobernd commented Jul 31, 2024

svalbuena commented Aug 1, 2024

flobernd commented Aug 1, 2024

svalbuena commented Aug 1, 2024

flobernd commented Aug 1, 2024 •

edited

Loading

svalbuena commented Aug 2, 2024

MachineLearning.GetTrainedModelsStats always throws a deserialization exception #8271

MachineLearning.GetTrainedModelsStats always throws a deserialization exception #8271

Comments

svalbuena commented Jul 26, 2024 • edited Loading

svalbuena commented Jul 26, 2024 • edited Loading

flobernd commented Jul 31, 2024

svalbuena commented Aug 1, 2024

flobernd commented Aug 1, 2024

svalbuena commented Aug 1, 2024

flobernd commented Aug 1, 2024 • edited Loading

svalbuena commented Aug 2, 2024

svalbuena commented Jul 26, 2024 •

edited

Loading

svalbuena commented Jul 26, 2024 •

edited

Loading

flobernd commented Aug 1, 2024 •

edited

Loading