lakeFS Acquires DVC, Uniting Data Version Control Pioneers to Accelerate AI-Ready Data
Bridging the AI infrastructure gap
The Control Plane for
AI-Ready Data
Built on our highly scalable data version control architecture, lakeFS manages the data lifecycle, provenance, and unified access for AI and data engineering teams
Trusted By:
FREE PLAYBOOK
Building AI Factories at Enterprise Scale
Discover how to accelerate AI delivery, ensure reproducibility, reduce data friction, and support compliance
Ensure
Data Quality
- Enforce data quality + compliance standards to catch errors before they impact production
- Test pipeline / model changes in isolation on production data, with zero data copy
- Instantly rollback from data incidents
Make
Training Reproducible
- Track which data was used in experiments or model training
- Gain full visibility into data history with built-in audit trail
- Automatically satisfy model governance requirements
Reduce
Data Access Friction
- Keep your GPUs busy without waiting for data
- Manage access permissions across all storage from one place
- Work with any tool on remote data as if it were local
Curious how lakeFS can help you deliver AI projects faster and more efficiently?
CASE STUDY:
How Arm Powers Its Data Management Infrastructure with lakeFS
With lakeFS, Arm implemented automated data cleaning, avoided costly data duplication, streamlined engineering workflows, and established a robust governance framework to manage data across distributed teams. The result: faster go-to-market, reduced storage costs, improved development velocity, and stronger data governance. Read full case study
Faster
go-to-market
Improved
development
velocity
Reduced
storage costs
Stronger data
governance
lakeFS saved us from the analysis paralysis of overthinking how to test new software on our data lake at Netflix scale. In less than 20 min I had lakeFS up and running, and was able to run tests against my production data in isolation and validate the software change thoroughly before pushing to production. With lakeFS, we improved the robustness and flexibility of our data systems
Holden Karau,
Open Source Engineer
Open Source Engineer
With lakeFS, we have streamlined data science and MLOps workflows, adapted data access controls for different teams, accelerated productivity and reduced time-to-insight for ML engineering projects.
Leonard Aukea
Head of ML Engineering & Operations
Head of ML Engineering & Operations
Transparent, traceable and repeatable development of AI is critical to us. What’s important for Lockheed Martin is that we don’t just focus on what we’re building but also on the how.
Greg Forrest
Director of AI Foundations
Director of AI Foundations
lakeFS allows managing versions for any type of feed. Some files are tabular; some are not. Tracking feeds in lakeFS is pretty fast.
Vara Ghanta
Principal Software Engineering Manager
Principal Software Engineering Manager
Moving to a data branching solution has paid off quickly for us. A few days after completing the migration, we’ve already reduced testing time by 80% on two different projects. And we’re excited to see how data branching increases our product velocity.
Ryan Green
CTO
CTO
With lakeFS we can easily achieve advanced use cases with data, such as running parallel pipelines with different logic to experiment or conduct what-if analysis, compare large result sets for data science and machine learning, and more.
Stephen Seewald,
Raghvendra Verma,
Cory Matheson
Raghvendra Verma,
Cory Matheson
Our partners
Seamlessly integrate with your
data and AI stack
lakeFS
Object Storage
Compute Engines
Ingest Technologies
Data Storage Formats
Orchestration & Workflow
Research and ML
Data Quality
lakeFS
Object Storage
Compute Engines
Ingest Technologies
Data Storage Formats
Orchestration & Workflow
Research and ML
Data Quality
lakeFS connects to every object storage that uses the S3 interface
lakeFS supports all broadly used compute engines
All common ingest technologies are integrated into lakeFS
Data Quality is mandatory for your data lake health. Ensure/maintain the highest data quality together with lakeFS
Learn more about lakeFS and AI-ready data
Product, Thought Leadership
lakeFS Named a Representative Vendor in the 2025 Gartner® Market Guide for DataOps Tools
Read More >
We're excited to share that lakeFS has been named a Representative Vendor in the 2025 Gartner® Market Guide for DataOps Tools. We...
ebook Essential Guide Data Version Control
Read More >
The Essential Guide to Data Version Control In the race to build production-ready AI systems, most enterprises hit the same wall: data...
How Department of Energy Ensures Proper Data Governance for AI Model Development
Read More >
This case study is a summary of the talk presented at Data+AI Summit 2024: Project Alexandria: A Digital Library for Research Data...
StorageGRID