: Towards Lab Automation
with Computer-Using Agents

Automating the computational layer of laboratory work by operating computers the way real scientists do.

Chang Ma*, Linh T. Trinh*, Matt Bucci, Aviv Regev, Hanchen Wang

*Equal contribution

Genentech The University of Hong Kong Stanford University
Biomedical imaging: light microscopy, digital pathology, and fluorescence cell imaging
The Challenge

Laboratory discovery is bottlenecked by its computational layer

Modern discovery increasingly depends on computational workflows that connect experimental data to analysis, interpretation, and follow-up hypotheses. Yet these workflows remain labor-intensive โ€” built on specialized software, visual inspection through graphical user interfaces, and the integration of knowledge across many scattered sources.

Most AI agents for biology are confined to text interfaces or predefined APIs. They cannot click through a figure, pan across a whole-slide image, or operate the interactive scientific tools that real discovery depends on. To automate this computational layer, an agent needs to do what a scientist does โ€” use a computer.

We introduce Orion, a computer-using AI agent that combines code execution with GUI-based control. It navigates digital figures with complex hierarchical structure, extracts detail from thousands of multimodal inputs, and connects evidence across steps to generate coherent hypotheses โ€” sustaining fully automated pipelines of more than 300 sequential actions, and moving towards genuine lab automation.

Overview of Orion operating within a computer environment with GUI and terminal interfaces.
Overview of Orion. Orion operates a standard computer through both a GUI and a terminal โ€” like a human scientist โ€” driving scientific software while inspecting figures and web databases.
How Orion Works

Two interfaces, one shared workspace

Orion mirrors the skills of a human scientist through two integrated interfaces. A terminal navigates the file system and executes Python for data processing; a GUI interface perceives the screen and operates off-the-shelf software through mouse and keyboard. A hierarchical architecture keeps long-horizon work tractable โ€” a controller manages global strategy while specialized GUI subagents are spawned on demand and intermediate results are externalized to disk.

Visual-Centric Deep Research

Seeing the evidence, not just reading about it

Orion reviewing primary visual evidence, clicking on precise coordinates to zoom into high-resolution images.
Example: Orion reviews primary visual evidence, and zoom into high-resolution images for details.

Many biological databases expose details only through a GUI, and scientific figures carry information that text-only agents simply cannot reach. Orion browses the web and inspects images through the same interfaces a researcher uses โ€” zooming into high-resolution staining, navigating multi-panel figures, and querying databases directly.

Orion achieved over 90% accuracy on biomedical database and literature retrieval โ€” outperforming text-only deep-research agents that never see the underlying images.

Orion solving a MicroVQA question by navigating the Human Protein Atlas and zooming into staining.
Orion outperforms strong baselines on database and literature search.
Self-Evolving Skill Learning

Learning to operate software it has never seen

Using public tutorials, Orion taught itself to operate unfamiliar scientific software โ€” from CellProfiler for microscopy image analysis to QuPath for digital pathology โ€” drafting workflows programmatically and debugging them visually when results fell short. It verifies its outputs against ground truth, corrects errors, and distills each solved task into a reusable skill, building a curriculum from simple tasks toward advanced analysis.

Orion solving a MicroVQA question by navigating the Human Protein Atlas and zooming into staining.
Self-Evolving Skill Learning on QuPath.
Digital Pathology

Navigating whole-slide images from tissue to cell

Whole-slide images span centimeters of tissue down to micron-scale subcellular detail. Operating QuPath, Orion learned the intuitive "click-hold-drag" motor skills needed to define regions of interest, and developed a hierarchical browsing pattern โ€” alternating broad tissue exploration with fine-grained local inspection.

Localizing and segmenting breast-cancer metastases in lymph-node slides, Orion beat a SAM baseline by +12.3% F1 and +15.5% spatial IoU, performing comparably to a human annotator.

Open-Ended Discovery

100 hours of autonomous discovery on the JUMP dataset

We challenged Orion with hypothesis generation on JUMP โ€” the largest public optical screening dataset, with Cell Painting profiles for over 15,000 genetic perturbations. Across four days of autonomous operation, Orion mined the data for disease-associated genes and distinctive phenotypes, integrating imaging evidence with knowledge from KEGG, the Human Protein Atlas, and the literature.

It generated 52 detailed reports; human review prioritized 22 plausible mechanistic hypotheses โ€” spanning mitochondrial dysfunction, cell-cycle arrest, and organelle depletion, linked to cancer, metabolic, and neurodevelopmental disease.

Orion's iterative discovery workflow, action-transition patterns, and human review pipeline on JUMP.
Hypotheses are first proposed from observed experimental phenotypic data, and then contextualized through systematic searches of biological knowledge bases and the literature to identify supporting evidence and research gaps. Finally, hypotheses are refined (right) and verified (left) using complementary databases.
Case Studies

From morphology to mechanism

With light-touch human feedback, Orion went beyond description to propose biologically grounded, mechanistically distinct hypotheses โ€” correcting its own segmentation and normalization mistakes along the way.