Yemohyle/dedup messages telemetry #952

yemohyleyemohyle · 2025-09-08T18:36:47Z

Refactor microsoft internal telemetry for agent mode to be sent in smaller events to avoid high drop rate. This is specifically targeted to substitute engine.messages events that attempt to send entire input to each model call, which includes all the messages from the conversation history up to the moment and contains lots of duplicates

This reverts commit 8e7197e.

This reverts commit 8c68089.

This reverts commit 8ff9ff5.

This reverts commit 0991682.

IdTrucker

roblourens

It's really sad that we have to do this at this layer but thanks for doing it! I worry that telemetry could still be too lossy to reconstruct the trajectories but I hope it works. I think I mostly follow it but just a few questions.

roblourens · 2025-09-09T00:51:08Z

src/platform/networking/node/chatStream.ts

 	telemetryService.sendInternalMSFTTelemetryEvent('engine.messages', multiplexProperties(telemetryDataWithPrompt.properties), telemetryDataWithPrompt.measurements);

+	// Send all model telemetry events (model.request.added, model.message.added, model.modelCall.input/output, model.request.options.added)
+	// Comment out the line below to disable the new deduplicated model telemetry events


Why the comment about disabling it, is that something somebody needs to do?

Hopefully not. At the moment the existing engine.messages event and model.... events I am adding are substitutes of each other. So after conformation that the new schema works better we would disable engine.messages events. In case the new schema (shorter events but more of them) still does not help with drop rate, it may be disabled.

roblourens · 2025-09-09T00:54:44Z

src/platform/networking/node/chatStream.ts

+ * If it's the same conversationId, increments the turn.
+ * Returns the current conversationTurn for the conversationId.
+ */
+function updateConversationTracker(conversationId: string): number {


Isn't this wrong when you have more than one conversation going at a time?

Are parallel conversations allowed by UI? It would be then. I changed the conversationTracker to allow for parallel conversations.

roblourens · 2025-09-09T00:59:56Z

src/platform/networking/node/chatStream.ts

+		return;
+	}
+
+	// Check if this is a conversation mode (has conversationId) or supplementary mode


When do we not have a conversationId?

I do not see conversationId available when semantic_search has its own model call, or edit healing ... I called it supplementary mode.

roblourens · 2025-09-09T01:01:41Z

src/platform/networking/node/chatStream.ts

+		}).toString();
+
+		// Get existing UUID for this message content + headerRequestId combination, or generate a new one
+		let messageUuid = messageHashToUuid.get(messageHash);


Can we send the hashes instead of having to map it to uuids? So that we can even track conversatations that continue after reloading the window, etc

But I have no idea if that would be allowed.

I am not sure I follow here. If the question is why don't we use hash itself instead of creating messageUuids? I was not sure how large those hashes could be, so used a standard uuid.

roblourens · 2025-09-09T01:08:39Z

src/platform/networking/node/chatStream.ts

+}
+
+function sendModelCallTelemetry(telemetryService: ITelemetryService, messageData: Array<{ uuid: string; headerRequestId: string }>, telemetryData: TelemetryData, messageDirection: 'input' | 'output', logService?: ILogService) {
+	// Get the unique model call ID


I don't think I understand what this function is doing, what's a "model call"?

For every model call, there’s an input array of messages and an output array containing a single message.
This function doesn’t send the full messages directly. Instead, it sends events with an array of messageUuids (unique message identifiers).
For example, if the agent takes n turns, there are n model calls. A system message will appear at the start of each input array across all n turns. Rather than sending the entire system message n times, we send it once in a model.message.added event. Then, in each of the model.modelCall events, we only include the corresponding messageUuid and its position in the array.

…d version of new model... events Comment out the internal MSFT telemetry event sending.

Copilot

Pull Request Overview

This PR refactors internal Microsoft telemetry for agent mode to reduce event size and eliminate duplicates by sending telemetry in smaller, deduplicated events instead of the large engine.messages events that contain entire conversation histories.

Key Changes

Introduces new granular telemetry events (model.request.added, model.message.added, model.modelCall.input/output, model.request.options.added) that use deduplication via LRU caches and UUID mapping
Comments out the internal MSFT engine.messages event to reduce telemetry volume while keeping the enhanced GitHub telemetry
Updates the existing engine.messages.length telemetry to use a new TelemetryData.createAndMarkAsIssued pattern instead of extendedBy

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
src/platform/networking/node/chatStream.ts	Adds comprehensive deduplication logic for model telemetry events with LRU caches and UUID tracking, plus modifications to existing length telemetry
src/extension/conversation/vscode-node/test/interactiveSessionProvider.telemetry.test.ts	Updates integration test to verify the new model telemetry events are sent correctly

Comments suppressed due to low confidence (1)

src/platform/networking/node/chatStream.ts:1

This TODO comment indicates that telemetry functions are misplaced in a chat stream file. Consider moving these telemetry functions to a dedicated telemetry module to improve code organization and separation of concerns.

/*---------------------------------------------------------------------------------------------

Copilot · 2025-09-09T17:39:31Z

src/platform/networking/node/chatStream.ts

+	//telemetryService.sendInternalMSFTTelemetryEvent('engine.messages', multiplexProperties(telemetryDataWithPrompt.properties), telemetryDataWithPrompt.measurements);
+


Replace the commented-out code with a clear explanation of why this telemetry event was disabled, or remove it entirely if it's permanently disabled. Commented-out code can cause confusion and should be avoided in production.

Suggested change

//telemetryService.sendInternalMSFTTelemetryEvent('engine.messages', multiplexProperties(telemetryDataWithPrompt.properties), telemetryDataWithPrompt.measurements);

Copilot · 2025-09-09T17:39:32Z

src/platform/networking/node/chatStream.ts

+	//telemetryService.sendInternalMSFTTelemetryEvent('engine.messages', multiplexProperties(telemetryDataWithPrompt.properties), telemetryDataWithPrompt.measurements);
+
+	// Send all model telemetry events (model.request.added, model.message.added, model.modelCall.input/output, model.request.options.added)
+	// Comment out the line below to disable the new deduplicated model telemetry events


This comment appears to be development instructions rather than production documentation. Consider either removing this comment or replacing it with proper documentation explaining the purpose of the telemetry events.

Suggested change

// Comment out the line below to disable the new deduplicated model telemetry events

// The following line sends deduplicated model telemetry events to track user requests, message additions, model calls, and request options.

Comment out the old telemetry event sending method for testing.

Yevhen Mohylevskyy and others added 30 commits August 25, 2025 12:49

add message.added event

2f1286a

fix message.added event logging

c2dc597

fix message.added event logging

b79fd19

fix message.added event logging

cedc9ae

fix message.added event logging

cbf2484

fix message.added event logging

69090c3

fix message.added event logging

a3db43a

fix message.added event logging

c8870e8

fix message.added event logging

b46256a

fix message.added event logging

bb7eb63

chunk modelCall messages list

8e7197e

Revert "chunk modelCall messages list"

91c624c

This reverts commit 8e7197e.

skip retry calls

3ea782a

add trajectory id

0991682

add trajectory id

8ff9ff5

Merge branch 'microsoft:main' into yemohyle/dedup_messages_telemetry

0eb0697

another try on trajectoryId

8c68089

Revert "another try on trajectoryId"

8c90ced

This reverts commit 8c68089.

Revert "add trajectory id"

f6bec37

This reverts commit 8ff9ff5.

Revert "add trajectory id"

5417dcc

This reverts commit 0991682.

chunk messageUuids list into 8000 chars

9b257d9

adding request.options events

dddb7b5

chunk request options

27d32d3

save only unique request options

5a77a82

save only unique request options

9279c41

convert maps to LRUcacheto avoid memory accumulation

3da1c47

Merge branch 'microsoft:main' into yemohyle/dedup_messages_telemetry

40b9275

change json.stringify() to vscode hash() for hashing

78839d3

add turn Data to modelCalls event

d1058d8

add engine.request.added with model request non specific info

2d2dd42

Yevhen Mohylevskyy and others added 9 commits September 5, 2025 17:11

add headerRequestId to model.message.added hash

b69737e

rename conversatinTracker to mainHeaderRequestIdTracker

74d78ca

add headerRequestIdTracker

9711ed0

fix headerRequestIdTracker

52c8831

add conversationTrucker and remove redundant preocessedHeaderRequest

4c24a6d

IdTrucker

remove redundant isRetry check from sendNewRequestAddedTelemetry

ea803dd

Merge branch 'microsoft:main' into yemohyle/dedup_messages_telemetry

96ce190

remove logservice messages

7f42602

Merge branch 'microsoft:main' into yemohyle/dedup_messages_telemetry

9a6750b

vs-code-engineering bot assigned TylerLeonhardt Sep 8, 2025

vs-code-engineering bot added the triage-needed label Sep 8, 2025

Merge branch 'microsoft:main' into yemohyle/dedup_messages_telemetry

f0839aa

TylerLeonhardt assigned lramos15 and unassigned TylerLeonhardt Sep 8, 2025

lramos15 requested a review from roblourens September 8, 2025 20:34

Merge branch 'microsoft:main' into yemohyle/dedup_messages_telemetry

3331b61

roblourens reviewed Sep 9, 2025

View reviewed changes

change conversationTracker to allow for parallel conversations

88277c7

roblourens previously approved these changes Sep 9, 2025

View reviewed changes

vs-code-engineering bot added this to the September 2025 milestone Sep 9, 2025

Disable internal MSFT telemetry event as it is the same but duplicate…

384bb31

…d version of new model... events Comment out the internal MSFT telemetry event sending.

Copilot AI review requested due to automatic review settings September 9, 2025 17:38

yemohyleyemohyle dismissed roblourens’s stale review via 384bb31 September 9, 2025 17:38

Copilot AI reviewed Sep 9, 2025

View reviewed changes

Comment out legacy telemetry event for testing

008d0f7

Comment out the old telemetry event sending method for testing.

roblourens approved these changes Sep 9, 2025

View reviewed changes

chrmarti approved these changes Sep 9, 2025

View reviewed changes

roblourens added this pull request to the merge queue Sep 9, 2025

Merged via the queue into microsoft:main with commit 37b4318 Sep 9, 2025
6 checks passed

		//telemetryService.sendInternalMSFTTelemetryEvent('engine.messages', multiplexProperties(telemetryDataWithPrompt.properties), telemetryDataWithPrompt.measurements);

	// Comment out the line below to disable the new deduplicated model telemetry events
	// The following line sends deduplicated model telemetry events to track user requests, message additions, model calls, and request options.

Yemohyle/dedup messages telemetry #952

Yemohyle/dedup messages telemetry #952

Uh oh!

Conversation

yemohyleyemohyle commented Sep 8, 2025

Uh oh!

roblourens left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Key Changes

Reviewed Changes

Uh oh!

Copilot AI Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants