Case Study

How Snorkel Accelerates Frontier-Grade AI Training Datasets With Daytona

1,073,756

sandboxes sandboxes provisioned with Daytona

96%

faster evaluation cycles

Headquarters

Redwood City, CA

Industry

Science Data Infrastructure Technology

Department

Artificial Intelligence Data Science Engineering

Key Features

Sandbox Creation Speed Sandbox Statefulness Infrastructure Scale

Learn how this frontier AI data lab partnered with Daytona to provision configurable, isolated sandboxes, enable efficient compute allocation, and accelerate agent evaluation cycles.

Daytona delivers scalable sandbox infrastructure that works out of the box. With it, we turn multi‑day runs into hours, which directly improves how fast we ship new datasets and environments.

Rustem Feyzkhanov

AI Platform Engineering Manager at Snorkel

01 -- CHALLENGE

Scaling Agent Simulations Required Flexible Runtime Infrastructure

Snorkel was built to deliver frontier-grade datasets, training environments, and evaluations to leading AI teams that rely on agent simulations to improve models in real-world conditions. To produce reliable results, each simulation must run in its own configurable, isolated sandbox. 

For Rustem Feyzkhanov, AI Platform Engineering Manager, and his team, provisioning those sandboxes at a scalable cadence was a standing priority. Snorkel’s agent tasks range from writing Python code to navigating filesystems, and simulations often run for long periods depending on complexity. With thousands of examples in Snorkel’s evaluation pipeline, running these tasks sequentially wasn't an option.

This need for parallelization came with strict isolation requirements. By design, agents can take unpredictable execution paths and interfere with each other’s workloads, corrupting evaluation outputs. Because these cases are hard to spot in production, Rustem needed to ensure datasets wouldn’t require complex debugging later.

Each task also carried its own CPU, RAM, and dependency requirements, so sandboxes had to be configurable. A one-size-fits-all runtime could bottleneck complex workloads or skew simulation outcomes. Runtime performance wasn't the only consideration. Snorkel's customers scrutinize every layer of the infrastructure stack, including their partners’ tools. For a runtime platform to pass compliance reviews, its codebase had to be transparent.

Rustem knew building an in-house runtime solution to meet these needs would require complex, resource-intensive VM networking and workload scheduling. To keep Snorkel’s focus on producing frontier training datasets and environments, he started looking for a dedicated runtime platform that would deliver fast, scalable, and stateful infrastructure.

That search ultimately led him to Daytona. Their open-source runtime infrastructure provided the flexibility and performance Snorkel needed to parallelize agent simulations.

Building a runtime solution in-house would’ve pulled valuable focus away from our core initiatives. Daytona provides us with a flexible way to sandbox agent workloads without the operational burden.

Rustem Feyzkhanov

AI Platform Engineering Manager at Snorkel

02 -- SOLUTION

A Scalable Runtime Platform That Provisions Isolated Sandboxes on Demand

Rustem integrated Daytona into Snorkel’s infrastructure via a simple API call, enabling on‑demand sandbox provisioning at scale.

With Daytona, each sandbox runs in full isolation, so agents never touch each other’s files, processes, or network activity. This isolation ensures each example reflects the real outcome of its simulation, keeping labels accurate. In turn, Rustem and his team ship datasets faster with the confidence that clients can rely on them for critical AI training.

Because Daytona is optimized for sandbox orchestration, Rustem and his team run long-running sandboxes in parallel, evaluating thousands of outcomes in a single pass. This setup saves hours every week, which is redirected toward refining evaluation rubrics and collaborating on new expert datasets.

Through Daytona's Snapshots, Rustem and his engineers pre‑configure runtimes for each agent workload. By specifying CPU, memory, disk, and dependencies up front, they right‑size each sandbox and ensure it mirrors the target deployment context. As a result, compute stays lean, and evaluation outputs can be fully audited and reproduced.

While Daytona’s runtimes help Snorkel power its datasets at scale, its open-source codebase provides full visibility into the runtime stack.

Beyond the platform itself, what solidified Rustem’s confidence in the partnership was Daytona’s commitment to ongoing collaboration. “All our questions are answered on the same day, and sandbox limit increases are approved almost instantaneously,” shares Rustem. The ability to scale sandbox provisioning on demand has proven essential as Snorkel's workloads spike during busy evaluation cycles.

Daytona approved our sandbox limits right away, and continues to do so as we scale. We run hundreds of concurrent simulations without worrying about infrastructure bottlenecks.

Rustem Feyzkhanov

AI Platform Engineering Manager at Snorkel

03 -- RESULT

Snorkel Evaluates Agent Simulations 96% Faster With Daytona

With Daytona, Snorkel has the flexible runtime infrastructure needed to deliver frontier-grade AI training datasets at scale. Today, Rustem and his team spin up thousands of isolated sandboxes in parallel, ensuring agent evaluations are fast, reliable, and efficient.

  • 1,073,756 sandboxes provisioned with Daytona

  • 96% faster evaluation cycles

Looking ahead, Rustem plans to integrate more of Daytona’s features to execute more complex tasks and scale evaluations, expanding the scope of Snorkel’s platform.

Daytona moves fast, and their platform is incredibly well supported.

Rustem Feyzkhanov

AI Platform Engineering Manager at Snorkel

NEWSLETTER
Subscribe to DotFiles Insider, a fortnightly newsletter for developers covering stories, techniques, guides and the latest product innovations.
NEWSLETTER
Subscribe to DotFiles Insider, a fortnightly newsletter for developers covering stories, techniques, guides and the latest product innovations.
NEWSLETTER
Subscribe to DotFiles Insider, a fortnightly newsletter for developers covering stories, techniques, guides and the latest product innovations.