Added DocumentRetrievalEvaluator to Azure AI Evaluation to support evaluation of document search by abhahn · Pull Request #39929 · Azure/azure-sdk-for-python

abhahn · 2025-03-04T00:39:19Z

Description

This PR includes a new class, DocumentRetrievalEvaluator, to produce document retrieval evaluator metrics over a set of input document, measured against a set of input ground-truth documents.

All SDK Contribution checklist:

The pull request does not introduce [breaking changes]
CHANGELOG is updated for new features, bug fixes or other significant changes.
I have read the contribution guidelines.

General Guidelines and Best Practices

Title of the pull request is clear and informative.
There are a small number of commits, each of which have an informative message. This means that previously merged commits do not appear in the history of the PR. For more information on cleaning up the commits in your PR, see this page.

Testing Guidelines

Pull request includes test coverage for the included changes.

azure-sdk · 2025-03-04T01:03:59Z

API change check

API changes are not detected in this pull request.

…r input and output schemas

…ions

Copilot

Pull Request Overview

This PR introduces a new evaluator class, DocumentRetrievalEvaluator, to compute document retrieval metrics such as NDCG, XDCG, fidelity, and top-K relevance for document search queries. Key changes include:

Adding an init.py file that exposes DocumentRetrievalEvaluator.
Implementing DocumentRetrievalEvaluator with methods to compute metrics and perform input validation.

Reviewed Changes

Copilot reviewed 2 out of 4 changed files in this pull request and generated no comments.

File	Description
sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/init.py	Exposes the DocumentRetrievalEvaluator class
sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py	Implements the evaluator methods and metric computations

Files not reviewed (2)

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/input.schema: Language not supported
sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/metrics.schema: Language not supported

Comments suppressed due to low confidence (3)

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py:42

The call to super().init() is unnecessary since DocumentRetrievalEvaluator does not extend a base class. Consider removing it to avoid confusion.

super().__init__()

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py:166

The output key 'ratioholes' does not match the TypedDict definition which specifies 'holes_ratio'. Consider updating it for consistency.

"ratioholes": 0,

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py:211

The key 'ratioholes' is inconsistent with the TypedDict definition that uses 'holes_ratio'. Update it accordingly.

"ratioholes": ratioholes,

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py

…into abhahn/document_retrieval_evaluator

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py

sdk/evaluation/azure-ai-evaluation/tests/unittests/test_document_retrieval_evaluator.py

…aluation of document search (#39929) * Added new evaluator code for Azure AI Evaluation * Added TypedDict for input validation and created json schema specs for input and output schemas * Added a temporary hack to make the example runnable; updated schema * Implementation improvements to align with applied science recommendations * Added docstrings and cleaned up input schema file * Updates based on in-person feedback * Addressed comments from the PR and SDK review * small fix for threshold dict update * Updates to support complex object inputs in DocumentRetrievalEvaluator * Silence cspell errors for metric names' * Updates to cspell.json * Some updates for style enforcement; removed json schema files * Reformatted with black * Added tests, addressed a few comments and handled some edge cases * Updates to tests and a few code fixes * Docstring updates and added samples * PR comments * A few small test updates --------- Co-authored-by: Abby Hartman <[email protected]>

Added new evaluator code for Azure AI Evaluation

e34f636

github-actions bot added the Evaluation Issues related to the client library for Azure AI Evaluation label Mar 4, 2025

Abby Hartman added 4 commits March 5, 2025 17:30

Added TypedDict for input validation and created json schema specs fo…

0a45b3d

…r input and output schemas

Added a temporary hack to make the example runnable; updated schema

c31fb7d

Implementation improvements to align with applied science recommendat…

982ff66

…ions

Added docstrings and cleaned up input schema file

eb2ec90

abhahn marked this pull request as ready for review March 18, 2025 20:41

Copilot AI review requested due to automatic review settings March 18, 2025 20:41

abhahn requested a review from a team as a code owner March 18, 2025 20:41

Copilot AI reviewed Mar 18, 2025

View reviewed changes

Updates based on in-person feedback

73b61b5

singankit reviewed Apr 2, 2025

View reviewed changes

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py Outdated Show resolved Hide resolved

singankit reviewed Apr 2, 2025

View reviewed changes

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py Outdated Show resolved Hide resolved

abhahn commented Apr 8, 2025

View reviewed changes

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py Outdated Show resolved Hide resolved

johanste reviewed Apr 8, 2025

View reviewed changes

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py Outdated Show resolved Hide resolved

johanste reviewed Apr 8, 2025

View reviewed changes

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py Outdated Show resolved Hide resolved

Abby Hartman added 8 commits April 11, 2025 10:27

Addressed comments from the PR and SDK review

8b42def

small fix for threshold dict update

3696adb

Merge branch 'main' of https://github.com/Azure/azure-sdk-for-python …

a7460ec

…into abhahn/document_retrieval_evaluator

Updates to support complex object inputs in DocumentRetrievalEvaluator

1349129

Silence cspell errors for metric names'

da7f888

Updates to cspell.json

75d9ab6

Some updates for style enforcement; removed json schema files

e89d172

Reformatted with black

8433f64

singankit reviewed Apr 18, 2025

View reviewed changes

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py Show resolved Hide resolved

singankit reviewed Apr 18, 2025

View reviewed changes

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py Show resolved Hide resolved

singankit reviewed Apr 18, 2025

View reviewed changes

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py Show resolved Hide resolved

singankit reviewed Apr 18, 2025

View reviewed changes

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py Outdated Show resolved Hide resolved

singankit reviewed Apr 18, 2025

View reviewed changes

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py Show resolved Hide resolved

singankit reviewed Apr 18, 2025

View reviewed changes

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py Show resolved Hide resolved

singankit reviewed Apr 18, 2025

View reviewed changes

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py Show resolved Hide resolved

Added tests, addressed a few comments and handled some edge cases

47d8fb4

changliu2 suggested changes Apr 18, 2025

View reviewed changes

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py Outdated Show resolved Hide resolved

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py Show resolved Hide resolved

Abby Hartman added 2 commits April 18, 2025 13:20

Updates to tests and a few code fixes

fc506e1

Docstring updates and added samples

a0b1335

singankit reviewed Apr 18, 2025

View reviewed changes

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_document_retrieval/_document_retrieval.py Outdated Show resolved Hide resolved

changliu2 reviewed Apr 18, 2025

View reviewed changes

sdk/evaluation/azure-ai-evaluation/tests/unittests/test_document_retrieval_evaluator.py Outdated Show resolved Hide resolved

Abby Hartman added 3 commits April 18, 2025 14:11

PR comments

8b3abac

A few small test updates

130eed5

cspell update

8de9c48

singankit approved these changes Apr 18, 2025

View reviewed changes

singankit enabled auto-merge (squash) April 18, 2025 22:16

singankit approved these changes Apr 19, 2025

View reviewed changes

singankit merged commit 1100a73 into main Apr 19, 2025
19 checks passed

singankit deleted the abhahn/document_retrieval_evaluator branch April 19, 2025 01:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added DocumentRetrievalEvaluator to Azure AI Evaluation to support evaluation of document search#39929

Added DocumentRetrievalEvaluator to Azure AI Evaluation to support evaluation of document search#39929
singankit merged 20 commits intomainfrom
abhahn/document_retrieval_evaluator

abhahn commented Mar 4, 2025

Uh oh!

azure-sdk commented Mar 4, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

abhahn commented Mar 4, 2025

Description

All SDK Contribution checklist:

General Guidelines and Best Practices

Testing Guidelines

Uh oh!

azure-sdk commented Mar 4, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants