Skip to content

doc(skills): add LLMObs integration and testing skills#7655

Merged
crysmags merged 9 commits intomasterfrom
feat/add-llmobs-skills
Mar 19, 2026
Merged

doc(skills): add LLMObs integration and testing skills#7655
crysmags merged 9 commits intomasterfrom
feat/add-llmobs-skills

Conversation

@crysmags
Copy link
Copy Markdown
Collaborator

@crysmags crysmags commented Mar 3, 2026

Summary

Adds two new agent skills for LLM Observability (LLMObs) instrumentation to support agents creating LLMObs integrations.

Changes

New Skills Added

1. llmobs-integration/

  • Package category system with 4 categories:
    • LLM_CLIENT: Direct LLM API clients (OpenAI, Anthropic, etc.)
    • MULTI_PROVIDER: Abstraction layers over multiple providers
    • ORCHESTRATION: Workflow/chain orchestration libraries
    • INFRASTRUCTURE: Vector databases, retrievers, embeddings
  • Category detection decision tree
  • Message extraction patterns for different providers (OpenAI, Anthropic, Google GenAI)
  • LLMObsSpanKind enum (llm, workflow, agent, tool, embedding, retrieval)
  • Plugin architecture patterns and helper methods
  • Reference implementation examples

2. llmobs-testing/

  • Category-specific test strategies:
    • LLM_CLIENT/MULTI_PROVIDER: VCR cassettes with real API calls
    • ORCHESTRATION: Pure functions, no VCR, actual LLM as node
    • INFRASTRUCTURE: Mock servers
  • VCR cassette recording system and location patterns
  • Assertion helpers (assertLlmObsSpanEvent, MOCK_* matchers)
  • Test structure patterns per category
  • Error handling validation strategies

Files Added

.agents/skills/llmobs-integration/
├── SKILL.md (184 lines)
└── references/
    ├── category-detection.md
    ├── message-extraction.md
    ├── plugin-architecture.md
    └── reference-implementations.md

.agents/skills/llmobs-testing/
├── SKILL.md (259 lines)
└── references/
    ├── assertion-helpers.md
    ├── category-strategies.md
    ├── test-structure.md
    └── vcr-cassettes.md

Why This Change?

Without these skills, agents have no guidance on:

  • How to classify LLM packages into correct categories
  • How to extract messages from different provider formats
  • Which test strategy to use (VCR vs pure functions vs mocks)
  • How to validate LLMObs span events and tags

These skills ensure agents create consistent, properly-tested LLMObs integrations with correct category classification and appropriate testing patterns.

Testing

Skills are referenced by LLMObs workflow agents during integration creation. Content verified against existing LLMObs integrations (OpenAI, Anthropic, LangChain, etc.).

🤖 Generated with Claude Code

@crysmags crysmags requested review from a team as code owners March 3, 2026 20:39
@crysmags crysmags requested review from khanayan123 and removed request for a team March 3, 2026 20:39
@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented Mar 3, 2026

Overall package size

Self size: 5.04 MB
Deduped: 5.88 MB
No deduping: 5.88 MB

Dependency sizes | name | version | self size | total size | |------|---------|-----------|------------| | import-in-the-middle | 3.0.0 | 81.15 kB | 815.98 kB | | dc-polyfill | 0.1.10 | 26.73 kB | 26.73 kB |

🤖 This report was automatically generated by heaviest-objects-in-the-universe

@crysmags crysmags force-pushed the feat/add-llmobs-skills branch from a009de0 to e5bb825 Compare March 3, 2026 20:51
@pr-commenter
Copy link
Copy Markdown

pr-commenter bot commented Mar 3, 2026

Benchmarks

Benchmark execution time: 2026-03-18 20:33:32

Comparing candidate commit 88b214e in PR branch feat/add-llmobs-skills with baseline commit 8b014c4 in branch master.

Found 0 performance improvements and 0 performance regressions! Performance is the same for 229 metrics, 31 unstable metrics.

@codecov
Copy link
Copy Markdown

codecov bot commented Mar 3, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 80.45%. Comparing base (8b014c4) to head (88b214e).
⚠️ Report is 107 commits behind head on master.

Additional details and impacted files
@@            Coverage Diff             @@
##           master    #7655      +/-   ##
==========================================
+ Coverage   80.31%   80.45%   +0.14%     
==========================================
  Files         739      748       +9     
  Lines       31946    32405     +459     
==========================================
+ Hits        25657    26072     +415     
- Misses       6289     6333      +44     
Flag Coverage Δ
aiguard-macos 39.25% <ø> (+0.25%) ⬆️
aiguard-ubuntu 39.36% <ø> (+0.25%) ⬆️
aiguard-windows 39.06% <ø> (+0.21%) ⬆️
apm-capabilities-tracing-macos 48.92% <ø> (+0.16%) ⬆️
apm-capabilities-tracing-ubuntu 48.96% <ø> (+0.16%) ⬆️
apm-capabilities-tracing-windows 48.70% <ø> (+0.16%) ⬆️
apm-integrations-child-process 38.55% <ø> (-0.03%) ⬇️
apm-integrations-couchbase-18 37.46% <ø> (-0.05%) ⬇️
apm-integrations-couchbase-eol 37.90% <ø> (-0.08%) ⬇️
apm-integrations-oracledb 37.87% <ø> (+0.06%) ⬆️
appsec-express 55.27% <ø> (-0.12%) ⬇️
appsec-fastify 51.61% <ø> (-0.09%) ⬇️
appsec-graphql 51.79% <ø> (-0.10%) ⬇️
appsec-kafka 44.39% <ø> (+<0.01%) ⬆️
appsec-ldapjs 44.01% <ø> (-0.07%) ⬇️
appsec-lodash 43.64% <ø> (-0.10%) ⬇️
appsec-macos 58.22% <ø> (-0.15%) ⬇️
appsec-mongodb-core 48.68% <ø> (-0.21%) ⬇️
appsec-mongoose 49.45% <ø> (-0.11%) ⬇️
appsec-mysql 50.89% <ø> (-0.03%) ⬇️
appsec-node-serialize 43.19% <ø> (-0.06%) ⬇️
appsec-passport 47.65% <ø> (-0.05%) ⬇️
appsec-postgres 50.61% <ø> (-0.04%) ⬇️
appsec-sourcing 42.62% <ø> (-0.04%) ⬇️
appsec-template 43.36% <ø> (-0.06%) ⬇️
appsec-ubuntu 58.30% <ø> (-0.15%) ⬇️
appsec-windows 58.06% <ø> (-0.15%) ⬇️
instrumentations-instrumentation-bluebird 32.34% <ø> (-0.06%) ⬇️
instrumentations-instrumentation-body-parser 40.53% <ø> (-0.02%) ⬇️
instrumentations-instrumentation-child_process 37.84% <ø> (-0.07%) ⬇️
instrumentations-instrumentation-cookie-parser 34.30% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-express 34.62% <ø> (-0.09%) ⬇️
instrumentations-instrumentation-express-mongo-sanitize 34.43% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-express-session 40.16% <ø> (-0.02%) ⬇️
instrumentations-instrumentation-fs 31.96% <ø> (-0.05%) ⬇️
instrumentations-instrumentation-generic-pool 29.52% <ø> (-0.19%) ⬇️
instrumentations-instrumentation-http 39.81% <ø> (-0.10%) ⬇️
instrumentations-instrumentation-knex 32.34% <ø> (-0.06%) ⬇️
instrumentations-instrumentation-mongoose 33.46% <ø> (-0.07%) ⬇️
instrumentations-instrumentation-multer 40.28% <ø> (-0.02%) ⬇️
instrumentations-instrumentation-mysql2 38.29% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-passport 44.06% <ø> (-0.01%) ⬇️
instrumentations-instrumentation-passport-http 43.74% <ø> (+<0.01%) ⬆️
instrumentations-instrumentation-passport-local 44.27% <ø> (-0.01%) ⬇️
instrumentations-instrumentation-pg 37.73% <ø> (-0.07%) ⬇️
instrumentations-instrumentation-promise 32.27% <ø> (-0.06%) ⬇️
instrumentations-instrumentation-promise-js 32.28% <ø> (-0.06%) ⬇️
instrumentations-instrumentation-q 32.32% <ø> (-0.06%) ⬇️
instrumentations-instrumentation-url 32.24% <ø> (-0.05%) ⬇️
instrumentations-instrumentation-when 32.29% <ø> (-0.06%) ⬇️
llmobs-ai 42.26% <ø> (+0.07%) ⬆️
llmobs-anthropic 40.30% <ø> (-0.06%) ⬇️
llmobs-bedrock 39.25% <ø> (-0.06%) ⬇️
llmobs-google-genai 39.79% <ø> (-0.10%) ⬇️
llmobs-langchain 40.16% <ø> (+0.22%) ⬆️
llmobs-openai 43.98% <ø> (-0.21%) ⬇️
llmobs-vertex-ai 40.05% <ø> (-0.03%) ⬇️
platform-core 31.47% <ø> (-0.07%) ⬇️
platform-esbuild 34.42% <ø> (-0.07%) ⬇️
platform-instrumentations-misc 48.41% <ø> (+2.87%) ⬆️
platform-shimmer 37.56% <ø> (-0.07%) ⬇️
platform-unit-guardrails 32.89% <ø> (-0.07%) ⬇️
plugins-azure-durable-functions 25.74% <ø> (?)
plugins-azure-event-hubs 25.90% <ø> (+0.07%) ⬆️
plugins-azure-service-bus 25.26% <ø> (+0.07%) ⬆️
plugins-bullmq 44.19% <ø> (+0.20%) ⬆️
plugins-cassandra 37.78% <ø> (-0.08%) ⬇️
plugins-cookie 26.96% <ø> (+0.07%) ⬆️
plugins-cookie-parser 26.75% <ø> (+0.07%) ⬆️
plugins-crypto 26.73% <ø> (-0.06%) ⬇️
plugins-dd-trace-api 38.31% <ø> (-0.13%) ⬇️
plugins-express-mongo-sanitize 26.89% <ø> (+0.07%) ⬆️
plugins-express-session 26.70% <ø> (+0.07%) ⬆️
plugins-fastify 42.25% <ø> (-0.04%) ⬇️
plugins-fetch 38.36% <ø> (-0.03%) ⬇️
plugins-fs 38.65% <ø> (-0.03%) ⬇️
plugins-generic-pool 25.94% <ø> (+0.07%) ⬆️
plugins-google-cloud-pubsub 45.47% <ø> (+0.05%) ⬆️
plugins-grpc 40.91% <ø> (-0.08%) ⬇️
plugins-handlebars 26.94% <ø> (+0.07%) ⬆️
plugins-hapi 40.16% <ø> (-0.03%) ⬇️
plugins-hono 40.42% <ø> (-0.03%) ⬇️
plugins-ioredis 38.46% <ø> (-0.03%) ⬇️
plugins-knex 26.57% <ø> (+0.07%) ⬆️
plugins-langgraph 38.47% <ø> (?)
plugins-ldapjs 24.43% <ø> (+0.07%) ⬆️
plugins-light-my-request 26.30% <ø> (+0.07%) ⬆️
plugins-limitd-client 32.61% <ø> (-0.07%) ⬇️
plugins-lodash 26.03% <ø> (+0.07%) ⬆️
plugins-mariadb 39.50% <ø> (-0.05%) ⬇️
plugins-memcached 38.16% <ø> (-0.08%) ⬇️
plugins-microgateway-core 39.23% <ø> (+<0.01%) ⬆️
plugins-moleculer 40.55% <ø> (-0.02%) ⬇️
plugins-mongodb 39.17% <ø> (-0.09%) ⬇️
plugins-mongodb-core 39.01% <ø> (-0.09%) ⬇️
plugins-mongoose 38.88% <ø> (-0.03%) ⬇️
plugins-multer 26.70% <ø> (+0.07%) ⬆️
plugins-mysql 39.20% <ø> (-0.04%) ⬇️
plugins-mysql2 39.29% <ø> (-0.04%) ⬇️
plugins-node-serialize 27.00% <ø> (+0.07%) ⬆️
plugins-opensearch 37.61% <ø> (-0.08%) ⬇️
plugins-passport-http 26.76% <ø> (+0.07%) ⬆️
plugins-postgres 35.58% <ø> (-0.20%) ⬇️
plugins-process 26.73% <ø> (-0.06%) ⬇️
plugins-pug 26.96% <ø> (+0.07%) ⬆️
plugins-redis 38.92% <ø> (-0.03%) ⬇️
plugins-router 43.11% <ø> (+0.09%) ⬆️
plugins-sequelize 25.55% <ø> (+0.07%) ⬆️
plugins-test-and-upstream-amqp10 38.51% <ø> (-0.05%) ⬇️
plugins-test-and-upstream-amqplib 43.90% <ø> (+0.02%) ⬆️
plugins-test-and-upstream-apollo 39.14% <ø> (+0.05%) ⬆️
plugins-test-and-upstream-avsc 38.61% <ø> (-0.16%) ⬇️
plugins-test-and-upstream-bunyan 33.87% <ø> (-0.09%) ⬇️
plugins-test-and-upstream-connect 40.82% <ø> (-0.03%) ⬇️
plugins-test-and-upstream-graphql 40.10% <ø> (-0.10%) ⬇️
plugins-test-and-upstream-koa 40.41% <ø> (-0.02%) ⬇️
plugins-test-and-upstream-protobufjs 38.83% <ø> (-0.16%) ⬇️
plugins-test-and-upstream-rhea 44.08% <ø> (+<0.01%) ⬆️
plugins-undici 39.13% <ø> (-0.04%) ⬇️
plugins-url 26.73% <ø> (-0.06%) ⬇️
plugins-valkey 38.27% <ø> (+0.12%) ⬆️
plugins-vm 26.73% <ø> (-0.06%) ⬇️
plugins-winston 34.06% <ø> (-0.09%) ⬇️
plugins-ws 41.90% <ø> (-0.03%) ⬇️
profiling-macos 40.56% <ø> (+0.66%) ⬆️
profiling-ubuntu 40.68% <ø> (+0.66%) ⬆️
profiling-windows 42.23% <ø> (+1.00%) ⬆️
serverless-azure-functions-client 25.62% <ø> (+0.07%) ⬆️
serverless-azure-functions-eventhubs 25.62% <ø> (+0.07%) ⬆️
serverless-azure-functions-servicebus 25.62% <ø> (+0.07%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

@datadog-datadog-prod-us1-2

This comment has been minimized.

Comment on lines +14 to +18
- `openai` - OpenAI API client
- `@google/generative-ai` - Google GenAI client
- `@anthropic-ai/sdk` - Anthropic Claude client
- `@mistralai/mistralai` - Mistral AI client
- `cohere-ai` - Cohere API client
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

verify with @sabrenner about best reference integrations to point at.

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i think the google genai and anthropic ones are best, openai is a bit too complicated probably. we also don't have mistral/cohere integrations

crysmags and others added 5 commits March 5, 2026 14:53
Adds two new agent skills for LLM Observability (LLMObs) instrumentation:

1. **llmobs-integration** - Creating LLMObs plugins
   - Package category system (LLM_CLIENT, MULTI_PROVIDER, ORCHESTRATION, INFRASTRUCTURE)
   - Category detection decision tree
   - Message extraction patterns (OpenAI, Anthropic, Google GenAI formats)
   - LLMObsSpanKind enum (llm, workflow, agent, tool, embedding, retrieval)
   - Plugin architecture patterns (getLLMObsSpanRegisterOptions, setLLMObsTags)
   - Reference implementations

2. **llmobs-testing** - Testing LLMObs plugins
   - Category-specific test strategies
   - VCR cassette system for API call recording
   - Assertion helpers (assertLlmObsSpanEvent, MOCK_* matchers)
   - Test structure patterns
   - Error handling validation

These skills provide comprehensive guidance for agents creating LLMObs
integrations, ensuring proper category classification, message handling,
and testing strategies.

Files added:
- .agents/skills/llmobs-integration/SKILL.md
- .agents/skills/llmobs-integration/references/category-detection.md
- .agents/skills/llmobs-integration/references/message-extraction.md
- .agents/skills/llmobs-integration/references/plugin-architecture.md
- .agents/skills/llmobs-integration/references/reference-implementations.md
- .agents/skills/llmobs-testing/SKILL.md
- .agents/skills/llmobs-testing/references/assertion-helpers.md
- .agents/skills/llmobs-testing/references/category-strategies.md
- .agents/skills/llmobs-testing/references/test-structure.md
- .agents/skills/llmobs-testing/references/vcr-cassettes.md

Co-Authored-By: Claude Sonnet 4.5 <[email protected]>
Claude Code discovers skills via .claude/skills/ directory.
Add symlinks to make llmobs-integration and llmobs-testing
auto-discoverable, following the pattern from apm-integrations.

Co-Authored-By: Claude Sonnet 4.5 <[email protected]>
The skills should be accessed from .agents/skills/ directory, not .claude/skills/.
Toolkit now uses symlinks in anubis_apm/agent/skills/dd_trace_js/ to access these.

Co-Authored-By: Claude Sonnet 4.5 <[email protected]>
These symlinks created circular references when skill loader tried to copy
.agents/skills/* to .claude/skills/*. Skills are accessed via toolkit's
anubis_apm/agent/skills/dd_trace_js/ symlinks instead.

Co-Authored-By: Claude Sonnet 4.5 <[email protected]>
Skills should only be accessed via toolkit symlinks in
anubis_apm/agent/skills/dd_trace_js/, not from .claude/skills/.
This directory should remain empty to avoid confusion.

Co-Authored-By: Claude Sonnet 4.5 <[email protected]>
@crysmags crysmags force-pushed the feat/add-llmobs-skills branch from 9e86463 to 1813645 Compare March 6, 2026 16:38
@BridgeAR BridgeAR changed the title feat(skills): Add LLMObs integration and testing skills doc(skills): add LLMObs integration and testing skills Mar 6, 2026
- Strengthen descriptions with keyword-rich frontmatter for both skills
- Expand Purpose section to cover streaming, embeddings, agent runs, orchestration
- Convert all bare references to proper MD links
- Remove mistral/cohere examples (no integrations exist); promote genai/anthropic
- Remove toolkit-specific Multi-Signal Heuristics scoring section
- Remove toolkit-specific Output Format JSON section
- Remove anubis_apm module path references

Co-Authored-By: Claude Sonnet 4.6 <[email protected]>
…ons symlink

- plugin-architecture.md: replace code blocks with lightweight prose descriptions,
  pointing to real implementation files for reference
- message-extraction.md: replace provider-specific code blocks with explanation
  of what varies per provider and pointers to reference implementations
- assertion-helpers.md: replace synthetic "Complete Test Example" with links
  to real test files (anthropic, google-genai)
- Restore deleted .claude/skills/apm-integrations symlink

Co-Authored-By: Claude Sonnet 4.6 <[email protected]>
@wconti27 wconti27 self-requested a review March 11, 2026 17:01
wconti27
wconti27 previously approved these changes Mar 11, 2026
@gh-worker-ownership-write-b05516 gh-worker-ownership-write-b05516 bot removed the request for review from a team March 11, 2026 17:01
…esting

Documents that instrumented sub-packages (e.g. @openai/agents-openai as
a dep of @openai/agents-core) must be required before the parent package
so RITM can patch them before they are cached by the transitive import.

Co-Authored-By: Claude Sonnet 4.6 <[email protected]>
Copy link
Copy Markdown
Collaborator

@sabrenner sabrenner left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cool! left some comments on some inconsistencies/incorrect things, but looks good overall

**Examples:**
- `@ai-sdk/vercel` - Vercel AI SDK
- `langchain` - LangChain framework
- `llamaindex` - LlamaIndex framework
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we don't have llamaindex support here yet so we can probs remove this so it's only referencing integrations we already support


**Examples:**
- `@langchain/langgraph` - LangGraph workflow engine
- `crewai` - CrewAI multi-agent framework
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

again, maybe we can also exclude this crewai ref as well since we don't have it instrumented yet


**Observable signs:**
- Package name suggests orchestration (langgraph, crew, workflow, graph)
- Has graph/workflow/chain execution methods (`invoke`, `stream`, `run`)
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: i guess this could be true for non-orchestration libraries, we might be able to remove this point

**Definition:** Communication protocols, server frameworks, infrastructure layers.

**Examples:**
- MCP (Model Context Protocol) clients/servers
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i think this one we're good to leave as we don't have any examples to point to 👍


## Required Methods

### getLLMObsSpanRegisterOptions(ctx)
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we should specify that this method can return null to not record an LLMObs span for a given ctx

- `asyncEnd(ctx)` - Calls setLLMObsTags
- `end(ctx)` - Restores context

**Inherited helpers:**
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ditto on these not being inherited methods but instead methods on the tagger (see above comment)

const CompositePlugin = require('../../plugins/composite')

module.exports = [
CompositePlugin.createPlugin([TracingPlugin, LLMObsPlugin])
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

idt this is a function, we should spell it out more directly that they need to aggregate the plugins and then create a CompositePlugin class with a static plugins field like

'use strict'

const CompositePlugin = require('../../dd-trace/src/plugins/composite')
const VercelAILLMObsPlugin = require('../../dd-trace/src/llmobs/plugins/ai')
const VercelAITracingPlugin = require('./tracing')

class VercelAIPlugin extends CompositePlugin {
  static get id () { return 'ai' }
  static get plugins () {
    return {
      llmobs: VercelAILLMObsPlugin,
      tracing: VercelAITracingPlugin,
    }
  }
}

module.exports = VercelAIPlugin

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

instead of adding a code block I pointed to an example of the composite plugin structure.


**Signature:**
```javascript
assertLlmObsSpanEvent(actual, expected)
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this function does have a docstring with types for it - do we still need the description below if it can read the docstring, or is the below table/parameters still necessary?


```javascript
const client = new MyLLMClient({
apiKey: 'test-api-key', // Any value works for recording
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is wrong - not any value works for recording, it needs to be a real API key for the first go. this can be pulled from process.env. i think we can give an example like

apiKey: process.env.OPENAI_API_KEY ?? 'test-api-key'

this way locally it always works on the first go as long as the api key is set in the process env, otherwise in ci where it's not set it still has a value

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

although i see we specify it below, but we should maybe clarify the comment here anyhow

- Cassettes are outdated

**Process:**
1. Delete old cassettes: `rm -rf test/llmobs/plugins/{provider}/cassettes/`
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we don't want to delete the whole folder's contents, just the one associated with a specific test. i don't have a great heuristic for doing this, usually it's one in the git diff in whatever PR is being worked on

@crysmags crysmags merged commit ac15a16 into master Mar 19, 2026
787 checks passed
@crysmags crysmags deleted the feat/add-llmobs-skills branch March 19, 2026 15:42
dd-octo-sts bot pushed a commit that referenced this pull request Mar 20, 2026
* feat(skills): Add LLMObs integration and testing skills

Adds two new agent skills for LLM Observability (LLMObs) instrumentation, these skills provide comprehensive guidance for agents creating LLMObs integrations, ensuring proper category classification, message handling,
and testing strategies.


Co-Authored-By: Claude Sonnet 4.5 <[email protected]>
@dd-octo-sts dd-octo-sts bot mentioned this pull request Mar 20, 2026
bengl pushed a commit that referenced this pull request Mar 23, 2026
* feat(skills): Add LLMObs integration and testing skills

Adds two new agent skills for LLM Observability (LLMObs) instrumentation, these skills provide comprehensive guidance for agents creating LLMObs integrations, ensuring proper category classification, message handling,
and testing strategies.


Co-Authored-By: Claude Sonnet 4.5 <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants