Model AI chat events as a list of request/response messages, with each message containing a list of parts

### Area(s)

area:gen-ai

### What's missing?

- As pointed out in https://github.com/open-telemetry/semantic-conventions/issues/1883, the current events don't match the API request structure when a message contains a combination of text and tool call responses.
- The naming and separation of events is messy and confusing:
  - There's different events for user and system messages, but the distinction is very artificial. The bodies have the same structure, the only difference is the role, which is also present in the body anyway, so a single event name would have worked.
  - https://github.com/open-telemetry/semantic-conventions/issues/1877 shows that these roles change over time and aren't reliable enough to be embedded into the event name which needs to be very stable. One day new developers working with OpenAI will only be familiar with the role being called `developer` instead of `system` and won't understand why the event is called `gen_ai.system.message`.
  - Assistant message events can simultaneously contain text content with any number of tool calls, but for user messages these are split into multiple events. Why the inconsistency? Why not an event per tool call?
  - The event name `gen_ai.tool.message` doesn't make it clear that it means the result of a tool call, rather than the tool call itself. In other words, it's not clear at a glance whether it's sent by the user or assistant.
- There's no clear place for multi-modal content (https://github.com/open-telemetry/semantic-conventions/issues/1556), e.g. a message containing both text and images.
- Tool calls have a `type` field which should apparently always be `function`, so its purpose is not clear.


### Describe the solution you'd like

There needs to be a conceptual hierarchy, where a request consists of a list of messages, and a message consists of a list of 'parts'. Here's one way the events could look:

- One event per message in the request, each with the same event name, e.g. `gen_ai.message`.
- Each message event has `role` and `content` keys in the body, similar to the current events.
- `role` is required and is used to distinguish between user, system, and assistant messages.
- `content` in the body is an array of parts.
- Each part is an object with a `type` field. Some possible values for `type` are `text`, `image`, `tool_call`, and `tool_response`.
  - If needed and possible, this field should account for multiple different types of tool call that the existing `type` field seems to be meant for.
- The separate `tool_calls` array in the bodies of assistant and choice events are removed in favour of `tool_call` parts in the `content` array.
- User and assistant messages (including the response `choice` events) all have the same structure, except that the choice events have additional fields (index, finish_reason) that don't make sense in the request events.

Support for this data model:

- The structure of Google's Gemini/Vertex request messages: https://github.com/googleapis/googleapis/blob/58be301346758c9a342de5632c3f9284d05c4b95/google/cloud/aiplatform/v1/content.proto#L80-L100
- OpenAI uses an array of parts in each message for image and audio inputs: https://platform.openai.com/docs/api-reference/chat/create
- `pydantic_ai` has [`ModelMessage`](https://ai.pydantic.dev/api/messages/#pydantic_ai.messages.ModelMessage) which is either `ModelRequest` or `ModelResponse`, each of which has a `parts` field which is a list of things like `TextPart` or `ToolCallPart`. This may seem like a biased example but the point is that `pydantic_ai` works with many different underlying AI APIs from different companies, and it translates between all those schemas and `ModelMessage` out of necessity.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model AI chat events as a list of request/response messages, with each message containing a list of parts #1913

Area(s)

What's missing?

Describe the solution you'd like

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Model AI chat events as a list of request/response messages, with each message containing a list of parts #1913

Description

Area(s)

What's missing?

Describe the solution you'd like

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions