[Model Monitoring] Optimize `drift-over-time` API for V3IO TSDB #9135

Eyal-Danieli · 2025-12-29T15:01:58Z

Optimize the drift-over-time API for V3IO TSDB. The main issue was that we were iterating over the data multiple times after retrieving it, even though this can be done in a single pass. We also added some optimization in the code logic (see more details below).

🛠️ Changes Made

Use integers representing nanoseconds instead of datetime objects for bucketing. This also means working in nanoseconds rather than microseconds for better performance during this phase.
Filter out invalid data early, before any processing. Previously, we processed everything first and filtered later. With the new design, we avoid doing unnecessary work.
Reduce multiple separate iterations over the data into a single pass that aggregates directly while iterating.
Perform datetime conversions only at the very end, after all filtering and processing is done.
Create fewer objects, reducing memory allocations.

✅ Checklist

I updated the documentation (if applicable)
I have tested the changes in this PR
I confirmed whether my changes are covered by system tests
- If yes, I ran all relevant system tests and ensured they passed before submitting this PR
- I updated existing system tests and/or added new ones if needed to cover my changes
If I introduced a deprecation:
- I followed the Deprecation Guidelines
- I updated the relevant Jira ticket for documentation

🧪 Testing

TestMonitoringAppFlow passed (includes testing for drift over time values).

🔗 References

Ticket link: https://iguazio.atlassian.net/browse/ML-11820
Design docs links:
External links:

🚨 Breaking Changes?

Yes (explain below)
No

🔍️ Additional Notes

mlrun/model_monitoring/db/tsdb/v3io/v3io_connector.py

danielperezz

Looks great :)
Had two comments

danielperezz · 2025-12-30T10:29:39Z

mlrun/model_monitoring/db/tsdb/v3io/v3io_connector.py

+        return mm_schemas.ModelEndpointDriftValues(values=values)

    @staticmethod
    def _convert_drift_data_to_values(


I think this function can be removed now

danielperezz · 2025-12-30T10:41:00Z

mlrun/model_monitoring/db/tsdb/v3io/v3io_connector.py

+                if bucket_start_ns not in bucket_endpoint_status:
+                    bucket_endpoint_status[bucket_start_ns] = {}
+
+                # Update max status for this endpoint in this bucket
+                if endpoint_id not in bucket_endpoint_status[bucket_start_ns]:
+                    bucket_endpoint_status[bucket_start_ns][endpoint_id] = status
+                elif status > bucket_endpoint_status[bucket_start_ns][endpoint_id]:
+                    bucket_endpoint_status[bucket_start_ns][endpoint_id] = status


This can be simplified using defaultdict and get:

Suggested change

if bucket_start_ns not in bucket_endpoint_status:

bucket_endpoint_status[bucket_start_ns] = {}

# Update max status for this endpoint in this bucket

if endpoint_id not in bucket_endpoint_status[bucket_start_ns]:

bucket_endpoint_status[bucket_start_ns][endpoint_id] = status

elif status > bucket_endpoint_status[bucket_start_ns][endpoint_id]:

bucket_endpoint_status[bucket_start_ns][endpoint_id] = status

bucket_endpoint_status = defaultdict(dict) # place above instead of the current initializtion

bucket = bucket_endpoint_status[bucket_start_ns]

bucket[endpoint_id] = max(bucket.get(endpoint_id, status), status)

optimize query data drift values for v3io

902d1fa

Eyal-Danieli requested a review from a team as a code owner December 29, 2025 15:01

github-actions bot added the area/sdk label Dec 29, 2025

fix lint

fbe6dd9

assaf758 reviewed Dec 29, 2025

View reviewed changes

mlrun/model_monitoring/db/tsdb/v3io/v3io_connector.py Outdated Show resolved Hide resolved

danielperezz reviewed Dec 30, 2025

View reviewed changes

fix according to review

61cb190

Eyal-Danieli requested review from assaf758 and danielperezz December 30, 2025 17:11

danielperezz approved these changes Dec 31, 2025

View reviewed changes

remove unused variables

5d36b04

assaf758 approved these changes Dec 31, 2025

View reviewed changes

assaf758 merged commit fb236f0 into mlrun:development Dec 31, 2025
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Model Monitoring] Optimize `drift-over-time` API for V3IO TSDB #9135

[Model Monitoring] Optimize `drift-over-time` API for V3IO TSDB #9135

Uh oh!

Eyal-Danieli commented Dec 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

danielperezz left a comment

Uh oh!

danielperezz Dec 30, 2025 •

edited

Loading

Uh oh!

danielperezz Dec 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Model Monitoring] Optimize drift-over-time API for V3IO TSDB #9135

[Model Monitoring] Optimize drift-over-time API for V3IO TSDB #9135

Uh oh!

Conversation

Eyal-Danieli commented Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🛠️ Changes Made

✅ Checklist

🧪 Testing

🔗 References

🚨 Breaking Changes?

🔍️ Additional Notes

Uh oh!

Uh oh!

danielperezz left a comment

Choose a reason for hiding this comment

Uh oh!

danielperezz Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

danielperezz Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Model Monitoring] Optimize `drift-over-time` API for V3IO TSDB #9135

[Model Monitoring] Optimize `drift-over-time` API for V3IO TSDB #9135

Eyal-Danieli commented Dec 29, 2025 •

edited

Loading

danielperezz Dec 30, 2025 •

edited

Loading